Amino acid dipepetide frequency for Chlorocebus sabaeus (Green monkey) (Cercopithecus sabaeus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.915AlaAla: 6.915 ± 0.05
1.415AlaCys: 1.415 ± 0.015
2.836AlaAsp: 2.836 ± 0.019
4.741AlaGlu: 4.741 ± 0.033
2.773AlaPhe: 2.773 ± 0.023
4.837AlaGly: 4.837 ± 0.038
1.583AlaHis: 1.583 ± 0.014
2.742AlaIle: 2.742 ± 0.019
3.376AlaLys: 3.376 ± 0.025
7.228AlaLeu: 7.228 ± 0.039
1.467AlaMet: 1.467 ± 0.014
1.966AlaAsn: 1.966 ± 0.016
4.28AlaPro: 4.28 ± 0.039
3.224AlaGln: 3.224 ± 0.023
3.716AlaArg: 3.716 ± 0.027
5.736AlaSer: 5.736 ± 0.033
3.629AlaThr: 3.629 ± 0.028
4.677AlaVal: 4.677 ± 0.027
0.832AlaTrp: 0.832 ± 0.012
1.524AlaTyr: 1.524 ± 0.014
0.0AlaXaa: 0.0 ± 0.0
Cys
1.279CysAla: 1.279 ± 0.014
0.718CysCys: 0.718 ± 0.019
1.019CysAsp: 1.019 ± 0.015
1.354CysGlu: 1.354 ± 0.021
0.862CysPhe: 0.862 ± 0.011
1.986CysGly: 1.986 ± 0.03
0.679CysHis: 0.679 ± 0.01
0.938CysIle: 0.938 ± 0.012
1.222CysLys: 1.222 ± 0.017
2.206CysLeu: 2.206 ± 0.021
0.42CysMet: 0.42 ± 0.006
0.864CysAsn: 0.864 ± 0.013
1.421CysPro: 1.421 ± 0.02
1.121CysGln: 1.121 ± 0.016
1.333CysArg: 1.333 ± 0.015
2.048CysSer: 2.048 ± 0.02
1.12CysThr: 1.12 ± 0.014
1.306CysVal: 1.306 ± 0.016
0.303CysTrp: 0.303 ± 0.006
0.603CysTyr: 0.603 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
2.809AspAla: 2.809 ± 0.021
1.074AspCys: 1.074 ± 0.014
2.526AspAsp: 2.526 ± 0.021
3.357AspGlu: 3.357 ± 0.023
2.108AspPhe: 2.108 ± 0.019
3.204AspGly: 3.204 ± 0.027
1.121AspHis: 1.121 ± 0.013
2.54AspIle: 2.54 ± 0.019
2.452AspLys: 2.452 ± 0.019
4.897AspLeu: 4.897 ± 0.03
1.085AspMet: 1.085 ± 0.011
1.636AspAsn: 1.636 ± 0.016
2.89AspPro: 2.89 ± 0.022
1.777AspGln: 1.777 ± 0.014
2.385AspArg: 2.385 ± 0.018
4.126AspSer: 4.126 ± 0.027
2.448AspThr: 2.448 ± 0.017
3.007AspVal: 3.007 ± 0.027
0.627AspTrp: 0.627 ± 0.008
1.438AspTyr: 1.438 ± 0.013
0.0AspXaa: 0.0 ± 0.0
Glu
5.281GluAla: 5.281 ± 0.035
1.636GluCys: 1.636 ± 0.034
4.398GluAsp: 4.398 ± 0.026
7.973GluGlu: 7.973 ± 0.063
2.032GluPhe: 2.032 ± 0.015
4.167GluGly: 4.167 ± 0.027
1.495GluHis: 1.495 ± 0.013
3.092GluIle: 3.092 ± 0.027
5.502GluLys: 5.502 ± 0.045
6.397GluLeu: 6.397 ± 0.038
1.645GluMet: 1.645 ± 0.014
3.121GluAsn: 3.121 ± 0.023
3.308GluPro: 3.308 ± 0.033
3.103GluGln: 3.103 ± 0.028
3.983GluArg: 3.983 ± 0.035
4.304GluSer: 4.304 ± 0.029
3.39GluThr: 3.39 ± 0.025
4.148GluVal: 4.148 ± 0.03
0.702GluTrp: 0.702 ± 0.009
1.556GluTyr: 1.556 ± 0.019
0.001GluXaa: 0.001 ± 0.0
Phe
1.977PheAla: 1.977 ± 0.016
0.966PheCys: 0.966 ± 0.01
1.634PheAsp: 1.634 ± 0.014
1.983PheGlu: 1.983 ± 0.014
1.62PhePhe: 1.62 ± 0.018
2.217PheGly: 2.217 ± 0.021
1.059PheHis: 1.059 ± 0.011
1.834PheIle: 1.834 ± 0.018
1.812PheLys: 1.812 ± 0.017
4.199PheLeu: 4.199 ± 0.031
0.771PheMet: 0.771 ± 0.01
1.361PheAsn: 1.361 ± 0.016
2.031PhePro: 2.031 ± 0.018
1.84PheGln: 1.84 ± 0.015
2.01PheArg: 2.01 ± 0.02
3.521PheSer: 3.521 ± 0.025
2.076PheThr: 2.076 ± 0.016
2.131PheVal: 2.131 ± 0.018
0.508PheTrp: 0.508 ± 0.007
1.21PheTyr: 1.21 ± 0.011
0.0PheXaa: 0.0 ± 0.0
Gly
4.585GlyAla: 4.585 ± 0.035
1.334GlyCys: 1.334 ± 0.015
3.106GlyAsp: 3.106 ± 0.027
4.308GlyGlu: 4.308 ± 0.039
2.437GlyPhe: 2.437 ± 0.024
5.138GlyGly: 5.138 ± 0.051
1.699GlyHis: 1.699 ± 0.017
2.743GlyIle: 2.743 ± 0.022
3.965GlyLys: 3.965 ± 0.027
5.994GlyLeu: 5.994 ± 0.043
1.273GlyMet: 1.273 ± 0.014
2.34GlyAsn: 2.34 ± 0.019
4.383GlyPro: 4.383 ± 0.063
2.776GlyGln: 2.776 ± 0.027
3.729GlyArg: 3.729 ± 0.024
5.82GlySer: 5.82 ± 0.042
3.58GlyThr: 3.58 ± 0.03
3.533GlyVal: 3.533 ± 0.027
0.786GlyTrp: 0.786 ± 0.01
1.727GlyTyr: 1.727 ± 0.017
0.001GlyXaa: 0.001 ± 0.0
His
1.327HisAla: 1.327 ± 0.012
0.743HisCys: 0.743 ± 0.01
0.862HisAsp: 0.862 ± 0.01
1.324HisGlu: 1.324 ± 0.012
1.103HisPhe: 1.103 ± 0.012
1.547HisGly: 1.547 ± 0.018
0.923HisHis: 0.923 ± 0.015
1.219HisIle: 1.219 ± 0.013
1.327HisLys: 1.327 ± 0.017
2.99HisLeu: 2.99 ± 0.021
0.595HisMet: 0.595 ± 0.008
0.874HisAsn: 0.874 ± 0.011
1.654HisPro: 1.654 ± 0.015
1.421HisGln: 1.421 ± 0.019
1.61HisArg: 1.61 ± 0.014
2.342HisSer: 2.342 ± 0.022
1.711HisThr: 1.711 ± 0.025
1.472HisVal: 1.472 ± 0.013
0.367HisTrp: 0.367 ± 0.006
0.804HisTyr: 0.804 ± 0.01
0.0HisXaa: 0.0 ± 0.0
Ile
2.496IleAla: 2.496 ± 0.017
1.104IleCys: 1.104 ± 0.011
1.951IleAsp: 1.951 ± 0.018
2.487IleGlu: 2.487 ± 0.023
1.908IlePhe: 1.908 ± 0.016
2.155IleGly: 2.155 ± 0.016
1.512IleHis: 1.512 ± 0.021
2.363IleIle: 2.363 ± 0.02
2.57IleLys: 2.57 ± 0.025
4.591IleLeu: 4.591 ± 0.031
0.98IleMet: 0.98 ± 0.009
1.776IleAsn: 1.776 ± 0.015
2.59IlePro: 2.59 ± 0.019
2.248IleGln: 2.248 ± 0.019
2.329IleArg: 2.329 ± 0.019
3.683IleSer: 3.683 ± 0.024
2.542IleThr: 2.542 ± 0.028
2.452IleVal: 2.452 ± 0.022
0.516IleTrp: 0.516 ± 0.008
1.401IleTyr: 1.401 ± 0.013
0.0IleXaa: 0.0 ± 0.0
Lys
4.119LysAla: 4.119 ± 0.028
1.276LysCys: 1.276 ± 0.021
3.058LysAsp: 3.058 ± 0.026
5.082LysGlu: 5.082 ± 0.042
1.714LysPhe: 1.714 ± 0.016
3.236LysGly: 3.236 ± 0.035
1.414LysHis: 1.414 ± 0.014
2.795LysIle: 2.795 ± 0.024
4.735LysLys: 4.735 ± 0.04
5.11LysLeu: 5.11 ± 0.033
1.452LysMet: 1.452 ± 0.017
2.384LysAsn: 2.384 ± 0.019
3.281LysPro: 3.281 ± 0.033
2.567LysGln: 2.567 ± 0.022
3.298LysArg: 3.298 ± 0.023
3.903LysSer: 3.903 ± 0.023
3.112LysThr: 3.112 ± 0.02
3.449LysVal: 3.449 ± 0.036
0.603LysTrp: 0.603 ± 0.009
1.553LysTyr: 1.553 ± 0.013
0.0LysXaa: 0.0 ± 0.0
Leu
6.901LeuAla: 6.901 ± 0.038
2.228LeuCys: 2.228 ± 0.022
4.658LeuAsp: 4.658 ± 0.028
7.172LeuGlu: 7.172 ± 0.043
3.417LeuPhe: 3.417 ± 0.025
6.008LeuGly: 6.008 ± 0.038
2.744LeuHis: 2.744 ± 0.02
3.907LeuIle: 3.907 ± 0.024
5.719LeuLys: 5.719 ± 0.037
11.066LeuLeu: 11.066 ± 0.061
1.992LeuMet: 1.992 ± 0.015
3.481LeuAsn: 3.481 ± 0.023
6.074LeuPro: 6.074 ± 0.038
5.798LeuGln: 5.798 ± 0.038
5.977LeuArg: 5.977 ± 0.037
8.074LeuSer: 8.074 ± 0.034
5.178LeuThr: 5.178 ± 0.027
5.579LeuVal: 5.579 ± 0.03
1.179LeuTrp: 1.179 ± 0.012
2.566LeuTyr: 2.566 ± 0.021
0.001LeuXaa: 0.001 ± 0.0
Met
1.93MetAla: 1.93 ± 0.016
0.392MetCys: 0.392 ± 0.008
1.219MetAsp: 1.219 ± 0.013
1.828MetGlu: 1.828 ± 0.013
0.723MetPhe: 0.723 ± 0.009
1.279MetGly: 1.279 ± 0.014
0.464MetHis: 0.464 ± 0.007
0.829MetIle: 0.829 ± 0.01
1.419MetLys: 1.419 ± 0.014
1.953MetLeu: 1.953 ± 0.016
0.545MetMet: 0.545 ± 0.009
0.879MetAsn: 0.879 ± 0.012
1.087MetPro: 1.087 ± 0.016
0.899MetGln: 0.899 ± 0.012
1.025MetArg: 1.025 ± 0.01
1.526MetSer: 1.526 ± 0.014
1.11MetThr: 1.11 ± 0.012
1.336MetVal: 1.336 ± 0.011
0.244MetTrp: 0.244 ± 0.005
0.574MetTyr: 0.574 ± 0.008
0.0MetXaa: 0.0 ± 0.0
Asn
1.997AsnAla: 1.997 ± 0.018
0.854AsnCys: 0.854 ± 0.013
1.473AsnAsp: 1.473 ± 0.014
2.212AsnGlu: 2.212 ± 0.019
1.483AsnPhe: 1.483 ± 0.014
2.365AsnGly: 2.365 ± 0.023
0.945AsnHis: 0.945 ± 0.01
2.07AsnIle: 2.07 ± 0.018
2.182AsnLys: 2.182 ± 0.02
3.748AsnLeu: 3.748 ± 0.024
0.891AsnMet: 0.891 ± 0.009
1.518AsnAsn: 1.518 ± 0.017
2.173AsnPro: 2.173 ± 0.016
1.645AsnGln: 1.645 ± 0.018
1.841AsnArg: 1.841 ± 0.015
3.097AsnSer: 3.097 ± 0.022
1.926AsnThr: 1.926 ± 0.019
2.178AsnVal: 2.178 ± 0.018
0.454AsnTrp: 0.454 ± 0.007
1.116AsnTyr: 1.116 ± 0.011
0.0AsnXaa: 0.0 ± 0.0
Pro
4.923ProAla: 4.923 ± 0.037
1.177ProCys: 1.177 ± 0.017
2.778ProAsp: 2.778 ± 0.022
4.531ProGlu: 4.531 ± 0.039
1.95ProPhe: 1.95 ± 0.015
5.389ProGly: 5.389 ± 0.081
1.446ProHis: 1.446 ± 0.016
1.867ProIle: 1.867 ± 0.018
2.824ProLys: 2.824 ± 0.036
5.317ProLeu: 5.317 ± 0.028
1.036ProMet: 1.036 ± 0.013
1.78ProAsn: 1.78 ± 0.016
6.114ProPro: 6.114 ± 0.072
2.859ProGln: 2.859 ± 0.023
3.438ProArg: 3.438 ± 0.027
5.637ProSer: 5.637 ± 0.044
3.111ProThr: 3.111 ± 0.033
3.819ProVal: 3.819 ± 0.033
0.714ProTrp: 0.714 ± 0.01
1.673ProTyr: 1.673 ± 0.025
0.001ProXaa: 0.001 ± 0.0
Gln
3.53GlnAla: 3.53 ± 0.029
0.975GlnCys: 0.975 ± 0.014
2.32GlnAsp: 2.32 ± 0.017
3.932GlnGlu: 3.932 ± 0.03
1.351GlnPhe: 1.351 ± 0.012
2.892GlnGly: 2.892 ± 0.026
1.307GlnHis: 1.307 ± 0.015
1.954GlnIle: 1.954 ± 0.016
2.952GlnLys: 2.952 ± 0.027
4.752GlnLeu: 4.752 ± 0.034
1.08GlnMet: 1.08 ± 0.01
1.848GlnAsn: 1.848 ± 0.015
2.815GlnPro: 2.815 ± 0.025
3.018GlnGln: 3.018 ± 0.044
3.022GlnArg: 3.022 ± 0.023
3.113GlnSer: 3.113 ± 0.024
2.28GlnThr: 2.28 ± 0.017
2.789GlnVal: 2.789 ± 0.02
0.546GlnTrp: 0.546 ± 0.008
1.118GlnTyr: 1.118 ± 0.011
0.0GlnXaa: 0.0 ± 0.0
Arg
3.905ArgAla: 3.905 ± 0.027
1.242ArgCys: 1.242 ± 0.016
2.716ArgAsp: 2.716 ± 0.021
3.976ArgGlu: 3.976 ± 0.029
1.869ArgPhe: 1.869 ± 0.016
3.647ArgGly: 3.647 ± 0.036
1.602ArgHis: 1.602 ± 0.015
2.46ArgIle: 2.46 ± 0.019
3.706ArgLys: 3.706 ± 0.023
5.453ArgLeu: 5.453 ± 0.033
1.156ArgMet: 1.156 ± 0.012
2.117ArgAsn: 2.117 ± 0.014
3.284ArgPro: 3.284 ± 0.024
2.595ArgGln: 2.595 ± 0.022
4.382ArgArg: 4.382 ± 0.037
4.204ArgSer: 4.204 ± 0.038
2.79ArgThr: 2.79 ± 0.019
3.15ArgVal: 3.15 ± 0.026
0.705ArgTrp: 0.705 ± 0.01
1.455ArgTyr: 1.455 ± 0.013
0.0ArgXaa: 0.0 ± 0.0
Ser
5.272SerAla: 5.272 ± 0.031
1.925SerCys: 1.925 ± 0.023
3.774SerAsp: 3.774 ± 0.027
5.135SerGlu: 5.135 ± 0.036
3.113SerPhe: 3.113 ± 0.02
5.646SerGly: 5.646 ± 0.037
2.185SerHis: 2.185 ± 0.018
3.178SerIle: 3.178 ± 0.02
4.045SerLys: 4.045 ± 0.027
8.264SerLeu: 8.264 ± 0.036
1.574SerMet: 1.574 ± 0.015
2.683SerAsn: 2.683 ± 0.021
5.936SerPro: 5.936 ± 0.048
3.877SerGln: 3.877 ± 0.026
4.52SerArg: 4.52 ± 0.034
9.325SerSer: 9.325 ± 0.067
4.494SerThr: 4.494 ± 0.051
4.848SerVal: 4.848 ± 0.026
1.098SerTrp: 1.098 ± 0.014
2.091SerTyr: 2.091 ± 0.016
0.0SerXaa: 0.0 ± 0.0
Thr
3.753ThrAla: 3.753 ± 0.025
1.341ThrCys: 1.341 ± 0.019
2.39ThrAsp: 2.39 ± 0.018
3.497ThrGlu: 3.497 ± 0.026
2.159ThrPhe: 2.159 ± 0.016
3.749ThrGly: 3.749 ± 0.049
1.394ThrHis: 1.394 ± 0.019
2.374ThrIle: 2.374 ± 0.021
2.634ThrLys: 2.634 ± 0.025
5.365ThrLeu: 5.365 ± 0.03
1.097ThrMet: 1.097 ± 0.01
1.706ThrAsn: 1.706 ± 0.014
3.577ThrPro: 3.577 ± 0.043
2.311ThrGln: 2.311 ± 0.017
2.479ThrArg: 2.479 ± 0.018
4.626ThrSer: 4.626 ± 0.041
3.141ThrThr: 3.141 ± 0.066
3.826ThrVal: 3.826 ± 0.032
0.697ThrTrp: 0.697 ± 0.011
1.417ThrTyr: 1.417 ± 0.012
0.001ThrXaa: 0.001 ± 0.0
Val
4.278ValAla: 4.278 ± 0.025
1.46ValCys: 1.46 ± 0.015
2.875ValAsp: 2.875 ± 0.024
3.791ValGlu: 3.791 ± 0.028
2.438ValPhe: 2.438 ± 0.021
3.38ValGly: 3.38 ± 0.021
1.579ValHis: 1.579 ± 0.014
2.852ValIle: 2.852 ± 0.022
3.308ValLys: 3.308 ± 0.029
6.251ValLeu: 6.251 ± 0.032
1.3ValMet: 1.3 ± 0.013
2.204ValAsn: 2.204 ± 0.018
3.64ValPro: 3.64 ± 0.037
2.715ValGln: 2.715 ± 0.018
2.999ValArg: 2.999 ± 0.021
4.826ValSer: 4.826 ± 0.027
3.76ValThr: 3.76 ± 0.04
3.952ValVal: 3.952 ± 0.027
0.718ValTrp: 0.718 ± 0.009
1.613ValTyr: 1.613 ± 0.014
0.001ValXaa: 0.001 ± 0.0
Trp
0.822TrpAla: 0.822 ± 0.01
0.249TrpCys: 0.249 ± 0.006
0.651TrpAsp: 0.651 ± 0.009
0.829TrpGlu: 0.829 ± 0.011
0.454TrpPhe: 0.454 ± 0.007
0.753TrpGly: 0.753 ± 0.012
0.31TrpHis: 0.31 ± 0.007
0.549TrpIle: 0.549 ± 0.009
0.821TrpLys: 0.821 ± 0.01
1.236TrpLeu: 1.236 ± 0.013
0.31TrpMet: 0.31 ± 0.006
0.54TrpAsn: 0.54 ± 0.008
0.536TrpPro: 0.536 ± 0.009
0.541TrpGln: 0.541 ± 0.008
0.75TrpArg: 0.75 ± 0.009
0.878TrpSer: 0.878 ± 0.011
0.674TrpThr: 0.674 ± 0.01
0.718TrpVal: 0.718 ± 0.009
0.194TrpTrp: 0.194 ± 0.005
0.353TrpTyr: 0.353 ± 0.007
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.398TyrAla: 1.398 ± 0.014
0.691TyrCys: 0.691 ± 0.009
1.264TyrAsp: 1.264 ± 0.015
1.721TyrGlu: 1.721 ± 0.016
1.244TyrPhe: 1.244 ± 0.013
1.662TyrGly: 1.662 ± 0.015
0.754TyrHis: 0.754 ± 0.009
1.351TyrIle: 1.351 ± 0.013
1.568TyrLys: 1.568 ± 0.023
2.704TyrLeu: 2.704 ± 0.017
0.591TyrMet: 0.591 ± 0.008
1.09TyrAsn: 1.09 ± 0.012
1.306TyrPro: 1.306 ± 0.013
1.258TyrGln: 1.258 ± 0.012
1.592TyrArg: 1.592 ± 0.015
2.193TyrSer: 2.193 ± 0.019
1.461TyrThr: 1.461 ± 0.014
1.569TyrVal: 1.569 ± 0.015
0.374TyrTrp: 0.374 ± 0.008
0.953TyrTyr: 0.953 ± 0.013
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.001XaaLys: 0.001 ± 0.0
0.001XaaLeu: 0.001 ± 0.0
0.001XaaMet: 0.001 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.001XaaArg: 0.001 ± 0.0
0.001XaaSer: 0.001 ± 0.0
0.001XaaThr: 0.001 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.011XaaXaa: 0.011 ± 0.002
Statistics based on 19229 proteins (10437685 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski