Amino acid dipepetide frequency for Oikopleura dioica (Tunicate)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.513AlaAla: 5.513 ± 0.052
1.18AlaCys: 1.18 ± 0.019
3.641AlaAsp: 3.641 ± 0.029
4.893AlaGlu: 4.893 ± 0.036
2.703AlaPhe: 2.703 ± 0.023
3.806AlaGly: 3.806 ± 0.041
1.252AlaHis: 1.252 ± 0.014
3.687AlaIle: 3.687 ± 0.027
4.562AlaLys: 4.562 ± 0.034
5.571AlaLeu: 5.571 ± 0.039
1.434AlaMet: 1.434 ± 0.015
2.839AlaAsn: 2.839 ± 0.02
2.933AlaPro: 2.933 ± 0.03
2.707AlaGln: 2.707 ± 0.023
3.426AlaArg: 3.426 ± 0.025
4.849AlaSer: 4.849 ± 0.029
3.477AlaThr: 3.477 ± 0.024
3.727AlaVal: 3.727 ± 0.025
0.658AlaTrp: 0.658 ± 0.01
1.593AlaTyr: 1.593 ± 0.019
0.002AlaXaa: 0.002 ± 0.001
Cys
1.32CysAla: 1.32 ± 0.02
0.536CysCys: 0.536 ± 0.016
1.373CysAsp: 1.373 ± 0.026
1.519CysGlu: 1.519 ± 0.029
0.996CysPhe: 0.996 ± 0.013
1.405CysGly: 1.405 ± 0.023
0.489CysHis: 0.489 ± 0.01
1.052CysIle: 1.052 ± 0.015
1.232CysLys: 1.232 ± 0.02
1.698CysLeu: 1.698 ± 0.02
0.411CysMet: 0.411 ± 0.009
0.927CysAsn: 0.927 ± 0.017
1.278CysPro: 1.278 ± 0.061
0.836CysGln: 0.836 ± 0.014
1.055CysArg: 1.055 ± 0.014
1.74CysSer: 1.74 ± 0.026
1.062CysThr: 1.062 ± 0.019
1.195CysVal: 1.195 ± 0.02
0.277CysTrp: 0.277 ± 0.007
0.598CysTyr: 0.598 ± 0.011
0.001CysXaa: 0.001 ± 0.0
Asp
3.512AspAla: 3.512 ± 0.028
1.271AspCys: 1.271 ± 0.023
4.154AspAsp: 4.154 ± 0.033
5.286AspGlu: 5.286 ± 0.037
3.068AspPhe: 3.068 ± 0.024
3.669AspGly: 3.669 ± 0.032
1.141AspHis: 1.141 ± 0.013
3.295AspIle: 3.295 ± 0.027
3.327AspLys: 3.327 ± 0.024
5.061AspLeu: 5.061 ± 0.033
1.173AspMet: 1.173 ± 0.015
2.562AspAsn: 2.562 ± 0.022
2.558AspPro: 2.558 ± 0.022
2.126AspGln: 2.126 ± 0.02
2.756AspArg: 2.756 ± 0.027
4.408AspSer: 4.408 ± 0.03
2.327AspThr: 2.327 ± 0.021
3.223AspVal: 3.223 ± 0.026
0.81AspTrp: 0.81 ± 0.013
1.923AspTyr: 1.923 ± 0.022
0.001AspXaa: 0.001 ± 0.0
Glu
4.504GluAla: 4.504 ± 0.041
1.5GluCys: 1.5 ± 0.033
4.52GluAsp: 4.52 ± 0.03
6.888GluGlu: 6.888 ± 0.055
2.968GluPhe: 2.968 ± 0.02
3.004GluGly: 3.004 ± 0.027
1.409GluHis: 1.409 ± 0.016
5.146GluIle: 5.146 ± 0.03
6.341GluLys: 6.341 ± 0.044
6.387GluLeu: 6.387 ± 0.045
1.814GluMet: 1.814 ± 0.018
4.807GluAsn: 4.807 ± 0.035
2.583GluPro: 2.583 ± 0.026
2.832GluGln: 2.832 ± 0.026
3.902GluArg: 3.902 ± 0.029
5.046GluSer: 5.046 ± 0.033
3.869GluThr: 3.869 ± 0.029
3.621GluVal: 3.621 ± 0.028
0.811GluTrp: 0.811 ± 0.014
2.006GluTyr: 2.006 ± 0.016
0.001GluXaa: 0.001 ± 0.0
Phe
2.803PheAla: 2.803 ± 0.023
1.115PheCys: 1.115 ± 0.019
2.898PheAsp: 2.898 ± 0.023
3.068PheGlu: 3.068 ± 0.023
2.236PhePhe: 2.236 ± 0.022
2.992PheGly: 2.992 ± 0.026
1.023PheHis: 1.023 ± 0.013
2.55PheIle: 2.55 ± 0.026
2.58PheLys: 2.58 ± 0.022
3.886PheLeu: 3.886 ± 0.029
1.047PheMet: 1.047 ± 0.012
2.157PheAsn: 2.157 ± 0.019
1.748PhePro: 1.748 ± 0.018
1.73PheGln: 1.73 ± 0.017
2.06PheArg: 2.06 ± 0.023
3.663PheSer: 3.663 ± 0.028
2.283PheThr: 2.283 ± 0.021
2.59PheVal: 2.59 ± 0.021
0.616PheTrp: 0.616 ± 0.009
1.558PheTyr: 1.558 ± 0.017
0.0PheXaa: 0.0 ± 0.0
Gly
3.473GlyAla: 3.473 ± 0.032
1.169GlyCys: 1.169 ± 0.019
3.158GlyAsp: 3.158 ± 0.028
3.577GlyGlu: 3.577 ± 0.034
2.778GlyPhe: 2.778 ± 0.028
3.805GlyGly: 3.805 ± 0.049
1.251GlyHis: 1.251 ± 0.016
3.157GlyIle: 3.157 ± 0.026
3.657GlyLys: 3.657 ± 0.03
4.325GlyLeu: 4.325 ± 0.032
1.237GlyMet: 1.237 ± 0.016
2.725GlyAsn: 2.725 ± 0.022
2.016GlyPro: 2.016 ± 0.031
2.125GlyGln: 2.125 ± 0.02
2.975GlyArg: 2.975 ± 0.028
4.356GlySer: 4.356 ± 0.041
2.935GlyThr: 2.935 ± 0.035
2.997GlyVal: 2.997 ± 0.026
0.822GlyTrp: 0.822 ± 0.012
1.903GlyTyr: 1.903 ± 0.029
0.002GlyXaa: 0.002 ± 0.001
His
1.219HisAla: 1.219 ± 0.015
0.499HisCys: 0.499 ± 0.01
1.071HisAsp: 1.071 ± 0.014
1.384HisGlu: 1.384 ± 0.015
1.107HisPhe: 1.107 ± 0.013
1.199HisGly: 1.199 ± 0.014
0.614HisHis: 0.614 ± 0.011
1.066HisIle: 1.066 ± 0.012
1.196HisLys: 1.196 ± 0.014
1.857HisLeu: 1.857 ± 0.019
0.461HisMet: 0.461 ± 0.009
0.892HisAsn: 0.892 ± 0.012
1.062HisPro: 1.062 ± 0.015
0.967HisGln: 0.967 ± 0.014
1.179HisArg: 1.179 ± 0.014
1.537HisSer: 1.537 ± 0.016
0.881HisThr: 0.881 ± 0.013
1.173HisVal: 1.173 ± 0.013
0.265HisTrp: 0.265 ± 0.007
0.693HisTyr: 0.693 ± 0.011
0.0HisXaa: 0.0 ± 0.0
Ile
3.754IleAla: 3.754 ± 0.028
1.414IleCys: 1.414 ± 0.017
3.65IleAsp: 3.65 ± 0.025
4.367IleGlu: 4.367 ± 0.034
2.894IlePhe: 2.894 ± 0.029
3.139IleGly: 3.139 ± 0.024
1.209IleHis: 1.209 ± 0.014
3.397IleIle: 3.397 ± 0.024
3.82IleLys: 3.82 ± 0.029
4.98IleLeu: 4.98 ± 0.036
1.255IleMet: 1.255 ± 0.013
2.784IleAsn: 2.784 ± 0.021
2.717IlePro: 2.717 ± 0.022
2.328IleGln: 2.328 ± 0.023
2.962IleArg: 2.962 ± 0.022
4.846IleSer: 4.846 ± 0.03
3.008IleThr: 3.008 ± 0.029
3.308IleVal: 3.308 ± 0.026
0.705IleTrp: 0.705 ± 0.011
1.754IleTyr: 1.754 ± 0.018
0.001IleXaa: 0.001 ± 0.0
Lys
4.165LysAla: 4.165 ± 0.034
1.429LysCys: 1.429 ± 0.021
3.491LysAsp: 3.491 ± 0.025
5.201LysGlu: 5.201 ± 0.039
2.768LysPhe: 2.768 ± 0.019
2.786LysGly: 2.786 ± 0.026
1.27LysHis: 1.27 ± 0.013
4.916LysIle: 4.916 ± 0.032
6.441LysLys: 6.441 ± 0.048
6.035LysLeu: 6.035 ± 0.046
1.851LysMet: 1.851 ± 0.018
4.083LysAsn: 4.083 ± 0.031
2.715LysPro: 2.715 ± 0.026
2.568LysGln: 2.568 ± 0.022
4.075LysArg: 4.075 ± 0.031
5.248LysSer: 5.248 ± 0.035
4.155LysThr: 4.155 ± 0.031
3.478LysVal: 3.478 ± 0.023
0.826LysTrp: 0.826 ± 0.011
2.029LysTyr: 2.029 ± 0.019
0.001LysXaa: 0.001 ± 0.0
Leu
5.717LeuAla: 5.717 ± 0.039
1.535LeuCys: 1.535 ± 0.02
4.825LeuAsp: 4.825 ± 0.033
6.36LeuGlu: 6.36 ± 0.054
3.604LeuPhe: 3.604 ± 0.03
4.221LeuGly: 4.221 ± 0.029
1.749LeuHis: 1.749 ± 0.018
4.586LeuIle: 4.586 ± 0.032
6.07LeuLys: 6.07 ± 0.039
7.308LeuLeu: 7.308 ± 0.057
1.929LeuMet: 1.929 ± 0.019
4.092LeuAsn: 4.092 ± 0.028
4.08LeuPro: 4.08 ± 0.028
3.643LeuGln: 3.643 ± 0.032
4.546LeuArg: 4.546 ± 0.034
6.718LeuSer: 6.718 ± 0.044
4.673LeuThr: 4.673 ± 0.027
4.643LeuVal: 4.643 ± 0.035
0.831LeuTrp: 0.831 ± 0.011
2.201LeuTyr: 2.201 ± 0.019
0.002LeuXaa: 0.002 ± 0.001
Met
1.549MetAla: 1.549 ± 0.017
0.365MetCys: 0.365 ± 0.008
1.266MetAsp: 1.266 ± 0.015
1.637MetGlu: 1.637 ± 0.017
0.853MetPhe: 0.853 ± 0.013
1.132MetGly: 1.132 ± 0.012
0.424MetHis: 0.424 ± 0.008
1.393MetIle: 1.393 ± 0.015
1.842MetLys: 1.842 ± 0.018
1.852MetLeu: 1.852 ± 0.021
0.712MetMet: 0.712 ± 0.013
1.243MetAsn: 1.243 ± 0.013
1.059MetPro: 1.059 ± 0.014
0.842MetGln: 0.842 ± 0.012
1.263MetArg: 1.263 ± 0.014
1.909MetSer: 1.909 ± 0.018
1.399MetThr: 1.399 ± 0.014
1.176MetVal: 1.176 ± 0.013
0.218MetTrp: 0.218 ± 0.006
0.516MetTyr: 0.516 ± 0.009
0.0MetXaa: 0.0 ± 0.0
Asn
3.088AsnAla: 3.088 ± 0.027
1.123AsnCys: 1.123 ± 0.017
2.845AsnAsp: 2.845 ± 0.023
3.762AsnGlu: 3.762 ± 0.024
2.498AsnPhe: 2.498 ± 0.023
3.166AsnGly: 3.166 ± 0.032
1.039AsnHis: 1.039 ± 0.013
2.871AsnIle: 2.871 ± 0.023
2.953AsnLys: 2.953 ± 0.023
4.34AsnLeu: 4.34 ± 0.032
1.193AsnMet: 1.193 ± 0.013
2.498AsnAsn: 2.498 ± 0.024
2.373AsnPro: 2.373 ± 0.022
2.086AsnGln: 2.086 ± 0.018
2.519AsnArg: 2.519 ± 0.023
4.041AsnSer: 4.041 ± 0.027
2.362AsnThr: 2.362 ± 0.023
2.605AsnVal: 2.605 ± 0.022
0.71AsnTrp: 0.71 ± 0.01
1.657AsnTyr: 1.657 ± 0.019
0.0AsnXaa: 0.0 ± 0.0
Pro
3.198ProAla: 3.198 ± 0.03
0.812ProCys: 0.812 ± 0.027
2.593ProAsp: 2.593 ± 0.022
3.594ProGlu: 3.594 ± 0.031
1.728ProPhe: 1.728 ± 0.018
2.585ProGly: 2.585 ± 0.047
0.831ProHis: 0.831 ± 0.013
2.393ProIle: 2.393 ± 0.022
3.148ProLys: 3.148 ± 0.028
3.233ProLeu: 3.233 ± 0.025
0.876ProMet: 0.876 ± 0.011
2.074ProAsn: 2.074 ± 0.022
2.748ProPro: 2.748 ± 0.038
1.747ProGln: 1.747 ± 0.02
2.215ProArg: 2.215 ± 0.022
3.701ProSer: 3.701 ± 0.032
2.591ProThr: 2.591 ± 0.028
2.868ProVal: 2.868 ± 0.025
0.475ProTrp: 0.475 ± 0.008
1.133ProTyr: 1.133 ± 0.014
0.001ProXaa: 0.001 ± 0.0
Gln
2.55GlnAla: 2.55 ± 0.022
0.747GlnCys: 0.747 ± 0.013
1.736GlnAsp: 1.736 ± 0.016
2.755GlnGlu: 2.755 ± 0.026
1.698GlnPhe: 1.698 ± 0.016
1.695GlnGly: 1.695 ± 0.02
0.842GlnHis: 0.842 ± 0.013
2.654GlnIle: 2.654 ± 0.018
3.224GlnLys: 3.224 ± 0.024
3.624GlnLeu: 3.624 ± 0.029
1.12GlnMet: 1.12 ± 0.014
2.452GlnAsn: 2.452 ± 0.02
1.721GlnPro: 1.721 ± 0.021
2.095GlnGln: 2.095 ± 0.028
2.3GlnArg: 2.3 ± 0.022
2.867GlnSer: 2.867 ± 0.025
2.121GlnThr: 2.121 ± 0.02
2.109GlnVal: 2.109 ± 0.02
0.431GlnTrp: 0.431 ± 0.008
1.076GlnTyr: 1.076 ± 0.013
0.001GlnXaa: 0.001 ± 0.0
Arg
3.327ArgAla: 3.327 ± 0.025
0.994ArgCys: 0.994 ± 0.015
2.79ArgAsp: 2.79 ± 0.025
3.651ArgGlu: 3.651 ± 0.029
2.218ArgPhe: 2.218 ± 0.02
2.651ArgGly: 2.651 ± 0.03
1.178ArgHis: 1.178 ± 0.014
3.122ArgIle: 3.122 ± 0.025
4.299ArgLys: 4.299 ± 0.032
4.517ArgLeu: 4.517 ± 0.031
1.219ArgMet: 1.219 ± 0.014
2.791ArgAsn: 2.791 ± 0.023
2.376ArgPro: 2.376 ± 0.026
2.302ArgGln: 2.302 ± 0.019
3.954ArgArg: 3.954 ± 0.039
4.173ArgSer: 4.173 ± 0.035
2.644ArgThr: 2.644 ± 0.021
2.83ArgVal: 2.83 ± 0.023
0.612ArgTrp: 0.612 ± 0.011
1.45ArgTyr: 1.45 ± 0.015
0.002ArgXaa: 0.002 ± 0.0
Ser
5.114SerAla: 5.114 ± 0.033
1.789SerCys: 1.789 ± 0.025
4.687SerAsp: 4.687 ± 0.03
5.479SerGlu: 5.479 ± 0.035
3.591SerPhe: 3.591 ± 0.025
4.681SerGly: 4.681 ± 0.035
1.479SerHis: 1.479 ± 0.017
4.409SerIle: 4.409 ± 0.031
5.006SerLys: 5.006 ± 0.034
6.305SerLeu: 6.305 ± 0.037
1.702SerMet: 1.702 ± 0.017
3.546SerAsn: 3.546 ± 0.025
3.663SerPro: 3.663 ± 0.04
3.099SerGln: 3.099 ± 0.028
4.263SerArg: 4.263 ± 0.032
8.285SerSer: 8.285 ± 0.059
4.759SerThr: 4.759 ± 0.037
4.28SerVal: 4.28 ± 0.031
0.978SerTrp: 0.978 ± 0.014
2.124SerTyr: 2.124 ± 0.02
0.001SerXaa: 0.001 ± 0.0
Thr
3.588ThrAla: 3.588 ± 0.031
1.356ThrCys: 1.356 ± 0.026
2.965ThrAsp: 2.965 ± 0.024
3.664ThrGlu: 3.664 ± 0.033
2.314ThrPhe: 2.314 ± 0.023
3.209ThrGly: 3.209 ± 0.028
1.015ThrHis: 1.015 ± 0.014
3.246ThrIle: 3.246 ± 0.028
3.47ThrLys: 3.47 ± 0.027
4.323ThrLeu: 4.323 ± 0.031
1.147ThrMet: 1.147 ± 0.014
2.364ThrAsn: 2.364 ± 0.023
2.727ThrPro: 2.727 ± 0.027
1.897ThrGln: 1.897 ± 0.016
2.676ThrArg: 2.676 ± 0.022
4.474ThrSer: 4.474 ± 0.036
4.276ThrThr: 4.276 ± 0.114
3.326ThrVal: 3.326 ± 0.033
0.621ThrTrp: 0.621 ± 0.01
1.367ThrTyr: 1.367 ± 0.018
0.002ThrXaa: 0.002 ± 0.001
Val
3.645ValAla: 3.645 ± 0.03
1.183ValCys: 1.183 ± 0.02
3.511ValAsp: 3.511 ± 0.026
4.062ValGlu: 4.062 ± 0.03
2.614ValPhe: 2.614 ± 0.022
2.879ValGly: 2.879 ± 0.03
1.157ValHis: 1.157 ± 0.014
3.153ValIle: 3.153 ± 0.023
3.734ValLys: 3.734 ± 0.03
4.565ValLeu: 4.565 ± 0.029
1.165ValMet: 1.165 ± 0.012
2.699ValAsn: 2.699 ± 0.023
2.557ValPro: 2.557 ± 0.024
2.206ValGln: 2.206 ± 0.022
2.735ValArg: 2.735 ± 0.023
4.108ValSer: 4.108 ± 0.026
3.072ValThr: 3.072 ± 0.029
3.233ValVal: 3.233 ± 0.028
0.608ValTrp: 0.608 ± 0.01
1.526ValTyr: 1.526 ± 0.014
0.001ValXaa: 0.001 ± 0.0
Trp
0.724TrpAla: 0.724 ± 0.012
0.224TrpCys: 0.224 ± 0.007
0.672TrpAsp: 0.672 ± 0.012
0.751TrpGlu: 0.751 ± 0.013
0.494TrpPhe: 0.494 ± 0.009
0.641TrpGly: 0.641 ± 0.013
0.25TrpHis: 0.25 ± 0.007
0.757TrpIle: 0.757 ± 0.012
0.88TrpLys: 0.88 ± 0.013
0.978TrpLeu: 0.978 ± 0.013
0.293TrpMet: 0.293 ± 0.007
0.742TrpAsn: 0.742 ± 0.011
0.417TrpPro: 0.417 ± 0.009
0.479TrpGln: 0.479 ± 0.009
0.689TrpArg: 0.689 ± 0.01
0.99TrpSer: 0.99 ± 0.019
0.776TrpThr: 0.776 ± 0.013
0.558TrpVal: 0.558 ± 0.01
0.178TrpTrp: 0.178 ± 0.005
0.326TrpTyr: 0.326 ± 0.006
0.001TrpXaa: 0.001 ± 0.0
Tyr
1.693TyrAla: 1.693 ± 0.019
0.758TyrCys: 0.758 ± 0.028
1.803TyrAsp: 1.803 ± 0.017
2.057TyrGlu: 2.057 ± 0.017
1.473TyrPhe: 1.473 ± 0.018
1.776TyrGly: 1.776 ± 0.024
0.702TyrHis: 0.702 ± 0.01
1.439TyrIle: 1.439 ± 0.015
1.747TyrLys: 1.747 ± 0.018
2.49TyrLeu: 2.49 ± 0.024
0.613TyrMet: 0.613 ± 0.01
1.449TyrAsn: 1.449 ± 0.018
1.21TyrPro: 1.21 ± 0.015
1.203TyrGln: 1.203 ± 0.014
1.55TyrArg: 1.55 ± 0.015
2.278TyrSer: 2.278 ± 0.021
1.355TyrThr: 1.355 ± 0.017
1.473TyrVal: 1.473 ± 0.016
0.367TyrTrp: 0.367 ± 0.008
1.03TyrTyr: 1.03 ± 0.017
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.002XaaPhe: 0.002 ± 0.001
0.001XaaGly: 0.001 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.002XaaLys: 0.002 ± 0.001
0.002XaaLeu: 0.002 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.002XaaAsn: 0.002 ± 0.0
0.001XaaPro: 0.001 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.001XaaArg: 0.001 ± 0.0
0.001XaaSer: 0.001 ± 0.0
0.002XaaThr: 0.002 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
14.589XaaXaa: 14.589 ± 1.752
Statistics based on 17050 proteins (6837326 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski