Amino acid dipepetide frequency for Pan paniscus (Pygmy chimpanzee) (Bonobo)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.43AlaAla: 6.43 ± 0.035
1.377AlaCys: 1.377 ± 0.01
2.884AlaAsp: 2.884 ± 0.012
4.761AlaGlu: 4.761 ± 0.024
2.642AlaPhe: 2.642 ± 0.014
4.578AlaGly: 4.578 ± 0.022
1.561AlaHis: 1.561 ± 0.01
2.81AlaIle: 2.81 ± 0.012
3.436AlaLys: 3.436 ± 0.018
7.032AlaLeu: 7.032 ± 0.027
1.507AlaMet: 1.507 ± 0.009
2.036AlaAsn: 2.036 ± 0.01
3.996AlaPro: 3.996 ± 0.022
3.255AlaGln: 3.255 ± 0.017
3.546AlaArg: 3.546 ± 0.018
5.649AlaSer: 5.649 ± 0.022
3.589AlaThr: 3.589 ± 0.016
4.65AlaVal: 4.65 ± 0.021
0.809AlaTrp: 0.809 ± 0.007
1.521AlaTyr: 1.521 ± 0.01
0.004AlaXaa: 0.004 ± 0.0
Cys
1.224CysAla: 1.224 ± 0.01
0.654CysCys: 0.654 ± 0.008
1.026CysAsp: 1.026 ± 0.011
1.306CysGlu: 1.306 ± 0.014
0.849CysPhe: 0.849 ± 0.007
1.753CysGly: 1.753 ± 0.017
0.684CysHis: 0.684 ± 0.007
0.967CysIle: 0.967 ± 0.009
1.18CysLys: 1.18 ± 0.011
2.21CysLeu: 2.21 ± 0.013
0.427CysMet: 0.427 ± 0.005
0.838CysAsn: 0.838 ± 0.009
1.386CysPro: 1.386 ± 0.013
1.085CysGln: 1.085 ± 0.011
1.292CysArg: 1.292 ± 0.011
2.002CysSer: 2.002 ± 0.014
1.113CysThr: 1.113 ± 0.01
1.304CysVal: 1.304 ± 0.01
0.305CysTrp: 0.305 ± 0.004
0.575CysTyr: 0.575 ± 0.006
0.001CysXaa: 0.001 ± 0.0
Asp
2.881AspAla: 2.881 ± 0.013
1.067AspCys: 1.067 ± 0.01
2.644AspAsp: 2.644 ± 0.016
3.467AspGlu: 3.467 ± 0.017
2.124AspPhe: 2.124 ± 0.012
3.313AspGly: 3.313 ± 0.022
1.14AspHis: 1.14 ± 0.007
2.639AspIle: 2.639 ± 0.013
2.568AspLys: 2.568 ± 0.015
5.039AspLeu: 5.039 ± 0.019
1.119AspMet: 1.119 ± 0.008
1.702AspAsn: 1.702 ± 0.011
2.893AspPro: 2.893 ± 0.013
1.839AspGln: 1.839 ± 0.01
2.419AspArg: 2.419 ± 0.014
4.181AspSer: 4.181 ± 0.018
2.48AspThr: 2.48 ± 0.012
3.096AspVal: 3.096 ± 0.017
0.628AspTrp: 0.628 ± 0.006
1.502AspTyr: 1.502 ± 0.009
0.001AspXaa: 0.001 ± 0.0
Glu
5.258GluAla: 5.258 ± 0.027
1.488GluCys: 1.488 ± 0.019
4.493GluAsp: 4.493 ± 0.021
8.039GluGlu: 8.039 ± 0.037
2.078GluPhe: 2.078 ± 0.011
4.195GluGly: 4.195 ± 0.021
1.534GluHis: 1.534 ± 0.009
3.242GluIle: 3.242 ± 0.016
5.495GluLys: 5.495 ± 0.03
6.534GluLeu: 6.534 ± 0.027
1.705GluMet: 1.705 ± 0.01
3.238GluAsn: 3.238 ± 0.013
3.187GluPro: 3.187 ± 0.018
3.179GluGln: 3.179 ± 0.016
3.998GluArg: 3.998 ± 0.02
4.4GluSer: 4.4 ± 0.019
3.439GluThr: 3.439 ± 0.017
4.207GluVal: 4.207 ± 0.018
0.717GluTrp: 0.717 ± 0.006
1.625GluTyr: 1.625 ± 0.012
0.001GluXaa: 0.001 ± 0.0
Phe
1.961PheAla: 1.961 ± 0.01
0.934PheCys: 0.934 ± 0.008
1.695PheAsp: 1.695 ± 0.009
2.033PheGlu: 2.033 ± 0.011
1.588PhePhe: 1.588 ± 0.011
2.198PheGly: 2.198 ± 0.012
1.062PheHis: 1.062 ± 0.008
1.838PheIle: 1.838 ± 0.011
1.832PheLys: 1.832 ± 0.01
4.082PheLeu: 4.082 ± 0.019
0.78PheMet: 0.78 ± 0.007
1.361PheAsn: 1.361 ± 0.011
2.019PhePro: 2.019 ± 0.01
1.814PheGln: 1.814 ± 0.011
1.973PheArg: 1.973 ± 0.013
3.427PheSer: 3.427 ± 0.017
2.055PheThr: 2.055 ± 0.011
2.135PheVal: 2.135 ± 0.011
0.497PheTrp: 0.497 ± 0.005
1.177PheTyr: 1.177 ± 0.008
0.001PheXaa: 0.001 ± 0.0
Gly
4.332GlyAla: 4.332 ± 0.022
1.288GlyCys: 1.288 ± 0.011
3.145GlyAsp: 3.145 ± 0.015
4.171GlyGlu: 4.171 ± 0.023
2.415GlyPhe: 2.415 ± 0.013
4.849GlyGly: 4.849 ± 0.029
1.683GlyHis: 1.683 ± 0.011
2.799GlyIle: 2.799 ± 0.014
3.871GlyLys: 3.871 ± 0.016
5.855GlyLeu: 5.855 ± 0.02
1.335GlyMet: 1.335 ± 0.009
2.39GlyAsn: 2.39 ± 0.013
4.135GlyPro: 4.135 ± 0.032
2.777GlyGln: 2.777 ± 0.014
3.707GlyArg: 3.707 ± 0.018
5.719GlySer: 5.719 ± 0.021
3.56GlyThr: 3.56 ± 0.015
3.542GlyVal: 3.542 ± 0.019
0.802GlyTrp: 0.802 ± 0.008
1.727GlyTyr: 1.727 ± 0.012
0.003GlyXaa: 0.003 ± 0.0
His
1.32HisAla: 1.32 ± 0.009
0.747HisCys: 0.747 ± 0.007
0.903HisAsp: 0.903 ± 0.007
1.345HisGlu: 1.345 ± 0.009
1.088HisPhe: 1.088 ± 0.006
1.561HisGly: 1.561 ± 0.009
0.901HisHis: 0.901 ± 0.009
1.278HisIle: 1.278 ± 0.008
1.298HisLys: 1.298 ± 0.009
2.972HisLeu: 2.972 ± 0.014
0.596HisMet: 0.596 ± 0.006
0.88HisAsn: 0.88 ± 0.007
1.661HisPro: 1.661 ± 0.01
1.38HisGln: 1.38 ± 0.01
1.597HisArg: 1.597 ± 0.01
2.32HisSer: 2.32 ± 0.013
1.562HisThr: 1.562 ± 0.012
1.486HisVal: 1.486 ± 0.01
0.35HisTrp: 0.35 ± 0.004
0.808HisTyr: 0.808 ± 0.007
0.001HisXaa: 0.001 ± 0.0
Ile
2.601IleAla: 2.601 ± 0.013
1.102IleCys: 1.102 ± 0.009
2.052IleAsp: 2.052 ± 0.012
2.635IleGlu: 2.635 ± 0.016
1.918IlePhe: 1.918 ± 0.013
2.227IleGly: 2.227 ± 0.013
1.403IleHis: 1.403 ± 0.01
2.436IleIle: 2.436 ± 0.014
2.692IleLys: 2.692 ± 0.014
4.629IleLeu: 4.629 ± 0.019
1.008IleMet: 1.008 ± 0.006
1.885IleAsn: 1.885 ± 0.011
2.629IlePro: 2.629 ± 0.013
2.351IleGln: 2.351 ± 0.014
2.399IleArg: 2.399 ± 0.013
3.755IleSer: 3.755 ± 0.016
2.599IleThr: 2.599 ± 0.016
2.517IleVal: 2.517 ± 0.016
0.522IleTrp: 0.522 ± 0.005
1.404IleTyr: 1.404 ± 0.01
0.001IleXaa: 0.001 ± 0.0
Lys
4.039LysAla: 4.039 ± 0.02
1.194LysCys: 1.194 ± 0.012
3.204LysAsp: 3.204 ± 0.017
5.211LysGlu: 5.211 ± 0.03
1.758LysPhe: 1.758 ± 0.01
3.273LysGly: 3.273 ± 0.019
1.423LysHis: 1.423 ± 0.01
2.855LysIle: 2.855 ± 0.014
4.727LysLys: 4.727 ± 0.029
5.263LysLeu: 5.263 ± 0.02
1.469LysMet: 1.469 ± 0.01
2.446LysAsn: 2.446 ± 0.012
3.144LysPro: 3.144 ± 0.018
2.658LysGln: 2.658 ± 0.014
3.321LysArg: 3.321 ± 0.017
3.925LysSer: 3.925 ± 0.02
3.149LysThr: 3.149 ± 0.013
3.496LysVal: 3.496 ± 0.018
0.62LysTrp: 0.62 ± 0.006
1.589LysTyr: 1.589 ± 0.009
0.001LysXaa: 0.001 ± 0.0
Leu
6.709LeuAla: 6.709 ± 0.026
2.211LeuCys: 2.211 ± 0.013
4.738LeuAsp: 4.738 ± 0.016
7.286LeuGlu: 7.286 ± 0.032
3.388LeuPhe: 3.388 ± 0.018
5.847LeuGly: 5.847 ± 0.022
2.761LeuHis: 2.761 ± 0.015
3.942LeuIle: 3.942 ± 0.017
5.899LeuLys: 5.899 ± 0.024
10.759LeuLeu: 10.759 ± 0.041
2.024LeuMet: 2.024 ± 0.011
3.541LeuAsn: 3.541 ± 0.018
5.972LeuPro: 5.972 ± 0.021
5.861LeuGln: 5.861 ± 0.028
5.865LeuArg: 5.865 ± 0.021
7.939LeuSer: 7.939 ± 0.027
5.121LeuThr: 5.121 ± 0.018
5.491LeuVal: 5.491 ± 0.021
1.148LeuTrp: 1.148 ± 0.009
2.546LeuTyr: 2.546 ± 0.011
0.003LeuXaa: 0.003 ± 0.0
Met
1.908MetAla: 1.908 ± 0.01
0.408MetCys: 0.408 ± 0.005
1.264MetAsp: 1.264 ± 0.007
1.912MetGlu: 1.912 ± 0.009
0.736MetPhe: 0.736 ± 0.006
1.322MetGly: 1.322 ± 0.01
0.479MetHis: 0.479 ± 0.005
0.862MetIle: 0.862 ± 0.007
1.477MetLys: 1.477 ± 0.009
1.995MetLeu: 1.995 ± 0.011
0.574MetMet: 0.574 ± 0.005
0.923MetAsn: 0.923 ± 0.008
1.104MetPro: 1.104 ± 0.011
0.931MetGln: 0.931 ± 0.008
1.059MetArg: 1.059 ± 0.007
1.573MetSer: 1.573 ± 0.01
1.135MetThr: 1.135 ± 0.007
1.39MetVal: 1.39 ± 0.009
0.247MetTrp: 0.247 ± 0.004
0.57MetTyr: 0.57 ± 0.005
0.001MetXaa: 0.001 ± 0.0
Asn
2.064AsnAla: 2.064 ± 0.011
0.855AsnCys: 0.855 ± 0.008
1.553AsnAsp: 1.553 ± 0.01
2.295AsnGlu: 2.295 ± 0.012
1.502AsnPhe: 1.502 ± 0.009
2.468AsnGly: 2.468 ± 0.014
0.978AsnHis: 0.978 ± 0.007
2.173AsnIle: 2.173 ± 0.012
2.26AsnLys: 2.26 ± 0.013
3.808AsnLeu: 3.808 ± 0.016
0.924AsnMet: 0.924 ± 0.007
1.562AsnAsn: 1.562 ± 0.012
2.193AsnPro: 2.193 ± 0.012
1.673AsnGln: 1.673 ± 0.012
1.862AsnArg: 1.862 ± 0.009
3.171AsnSer: 3.171 ± 0.014
1.971AsnThr: 1.971 ± 0.011
2.211AsnVal: 2.211 ± 0.011
0.46AsnTrp: 0.46 ± 0.005
1.144AsnTyr: 1.144 ± 0.008
0.0AsnXaa: 0.0 ± 0.0
Pro
4.677ProAla: 4.677 ± 0.024
1.138ProCys: 1.138 ± 0.011
2.773ProAsp: 2.773 ± 0.014
4.416ProGlu: 4.416 ± 0.02
1.914ProPhe: 1.914 ± 0.012
5.143ProGly: 5.143 ± 0.037
1.457ProHis: 1.457 ± 0.01
1.926ProIle: 1.926 ± 0.011
2.774ProLys: 2.774 ± 0.018
5.189ProLeu: 5.189 ± 0.019
1.062ProMet: 1.062 ± 0.009
1.839ProAsn: 1.839 ± 0.011
5.808ProPro: 5.808 ± 0.038
2.82ProGln: 2.82 ± 0.015
3.289ProArg: 3.289 ± 0.018
5.627ProSer: 5.627 ± 0.023
3.091ProThr: 3.091 ± 0.017
3.763ProVal: 3.763 ± 0.017
0.71ProTrp: 0.71 ± 0.007
1.538ProTyr: 1.538 ± 0.012
0.004ProXaa: 0.004 ± 0.0
Gln
3.544GlnAla: 3.544 ± 0.019
0.965GlnCys: 0.965 ± 0.01
2.356GlnAsp: 2.356 ± 0.013
3.994GlnGlu: 3.994 ± 0.02
1.386GlnPhe: 1.386 ± 0.008
2.88GlnGly: 2.88 ± 0.014
1.308GlnHis: 1.308 ± 0.01
2.026GlnIle: 2.026 ± 0.011
3.023GlnLys: 3.023 ± 0.016
4.762GlnLeu: 4.762 ± 0.024
1.137GlnMet: 1.137 ± 0.008
1.891GlnAsn: 1.891 ± 0.012
2.791GlnPro: 2.791 ± 0.016
3.144GlnGln: 3.144 ± 0.029
2.982GlnArg: 2.982 ± 0.017
3.13GlnSer: 3.13 ± 0.016
2.295GlnThr: 2.295 ± 0.013
2.834GlnVal: 2.834 ± 0.012
0.564GlnTrp: 0.564 ± 0.005
1.166GlnTyr: 1.166 ± 0.008
0.001GlnXaa: 0.001 ± 0.0
Arg
3.748ArgAla: 3.748 ± 0.017
1.222ArgCys: 1.222 ± 0.012
2.748ArgAsp: 2.748 ± 0.012
3.964ArgGlu: 3.964 ± 0.019
1.842ArgPhe: 1.842 ± 0.008
3.58ArgGly: 3.58 ± 0.02
1.541ArgHis: 1.541 ± 0.01
2.477ArgIle: 2.477 ± 0.012
3.685ArgLys: 3.685 ± 0.017
5.377ArgLeu: 5.377 ± 0.021
1.212ArgMet: 1.212 ± 0.009
2.16ArgAsn: 2.16 ± 0.011
3.169ArgPro: 3.169 ± 0.017
2.625ArgGln: 2.625 ± 0.014
4.223ArgArg: 4.223 ± 0.025
4.215ArgSer: 4.215 ± 0.024
2.811ArgThr: 2.811 ± 0.013
3.143ArgVal: 3.143 ± 0.019
0.715ArgTrp: 0.715 ± 0.007
1.45ArgTyr: 1.45 ± 0.009
0.002ArgXaa: 0.002 ± 0.0
Ser
5.209SerAla: 5.209 ± 0.02
1.866SerCys: 1.866 ± 0.013
3.873SerAsp: 3.873 ± 0.017
5.211SerGlu: 5.211 ± 0.021
3.064SerPhe: 3.064 ± 0.014
5.549SerGly: 5.549 ± 0.021
2.15SerHis: 2.15 ± 0.012
3.214SerIle: 3.214 ± 0.015
4.105SerLys: 4.105 ± 0.021
8.22SerLeu: 8.22 ± 0.029
1.608SerMet: 1.608 ± 0.009
2.731SerAsn: 2.731 ± 0.016
5.885SerPro: 5.885 ± 0.03
3.891SerGln: 3.891 ± 0.017
4.521SerArg: 4.521 ± 0.023
9.217SerSer: 9.217 ± 0.046
4.44SerThr: 4.44 ± 0.021
4.823SerVal: 4.823 ± 0.019
1.104SerTrp: 1.104 ± 0.008
2.09SerTyr: 2.09 ± 0.01
0.003SerXaa: 0.003 ± 0.0
Thr
3.741ThrAla: 3.741 ± 0.018
1.303ThrCys: 1.303 ± 0.012
2.469ThrAsp: 2.469 ± 0.012
3.557ThrGlu: 3.557 ± 0.017
2.102ThrPhe: 2.102 ± 0.012
3.57ThrGly: 3.57 ± 0.019
1.348ThrHis: 1.348 ± 0.009
2.41ThrIle: 2.41 ± 0.014
2.691ThrLys: 2.691 ± 0.013
5.288ThrLeu: 5.288 ± 0.018
1.127ThrMet: 1.127 ± 0.008
1.77ThrAsn: 1.77 ± 0.01
3.557ThrPro: 3.557 ± 0.02
2.312ThrGln: 2.312 ± 0.012
2.47ThrArg: 2.47 ± 0.011
4.613ThrSer: 4.613 ± 0.024
3.017ThrThr: 3.017 ± 0.023
3.839ThrVal: 3.839 ± 0.02
0.712ThrTrp: 0.712 ± 0.008
1.438ThrTyr: 1.438 ± 0.01
0.002ThrXaa: 0.002 ± 0.0
Val
4.235ValAla: 4.235 ± 0.016
1.431ValCys: 1.431 ± 0.01
2.958ValAsp: 2.958 ± 0.015
3.883ValGlu: 3.883 ± 0.019
2.403ValPhe: 2.403 ± 0.012
3.379ValGly: 3.379 ± 0.015
1.566ValHis: 1.566 ± 0.009
2.921ValIle: 2.921 ± 0.014
3.41ValLys: 3.41 ± 0.019
6.144ValLeu: 6.144 ± 0.021
1.326ValMet: 1.326 ± 0.009
2.272ValAsn: 2.272 ± 0.013
3.587ValPro: 3.587 ± 0.017
2.742ValGln: 2.742 ± 0.013
3.018ValArg: 3.018 ± 0.012
4.826ValSer: 4.826 ± 0.02
3.763ValThr: 3.763 ± 0.026
3.961ValVal: 3.961 ± 0.02
0.722ValTrp: 0.722 ± 0.006
1.623ValTyr: 1.623 ± 0.01
0.001ValXaa: 0.001 ± 0.0
Trp
0.797TrpAla: 0.797 ± 0.006
0.253TrpCys: 0.253 ± 0.004
0.664TrpAsp: 0.664 ± 0.006
0.838TrpGlu: 0.838 ± 0.007
0.442TrpPhe: 0.442 ± 0.005
0.75TrpGly: 0.75 ± 0.008
0.318TrpHis: 0.318 ± 0.004
0.556TrpIle: 0.556 ± 0.006
0.821TrpLys: 0.821 ± 0.006
1.232TrpLeu: 1.232 ± 0.009
0.322TrpMet: 0.322 ± 0.004
0.551TrpAsn: 0.551 ± 0.006
0.543TrpPro: 0.543 ± 0.005
0.552TrpGln: 0.552 ± 0.005
0.745TrpArg: 0.745 ± 0.006
0.894TrpSer: 0.894 ± 0.008
0.678TrpThr: 0.678 ± 0.008
0.707TrpVal: 0.707 ± 0.006
0.203TrpTrp: 0.203 ± 0.003
0.335TrpTyr: 0.335 ± 0.004
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.391TyrAla: 1.391 ± 0.009
0.679TyrCys: 0.679 ± 0.006
1.298TyrAsp: 1.298 ± 0.011
1.728TyrGlu: 1.728 ± 0.011
1.221TyrPhe: 1.221 ± 0.009
1.665TyrGly: 1.665 ± 0.011
0.757TyrHis: 0.757 ± 0.006
1.392TyrIle: 1.392 ± 0.009
1.518TyrLys: 1.518 ± 0.012
2.663TyrLeu: 2.663 ± 0.011
0.608TyrMet: 0.608 ± 0.006
1.11TyrAsn: 1.11 ± 0.009
1.296TyrPro: 1.296 ± 0.01
1.29TyrGln: 1.29 ± 0.008
1.621TyrArg: 1.621 ± 0.009
2.186TyrSer: 2.186 ± 0.011
1.464TyrThr: 1.464 ± 0.011
1.573TyrVal: 1.573 ± 0.009
0.367TyrTrp: 0.367 ± 0.006
0.949TyrTyr: 0.949 ± 0.007
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.004XaaAla: 0.004 ± 0.0
0.001XaaCys: 0.001 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.002XaaGlu: 0.002 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.004XaaGly: 0.004 ± 0.0
0.001XaaHis: 0.001 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.001XaaLys: 0.001 ± 0.0
0.003XaaLeu: 0.003 ± 0.0
0.001XaaMet: 0.001 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.003XaaPro: 0.003 ± 0.0
0.001XaaGln: 0.001 ± 0.0
0.003XaaArg: 0.003 ± 0.0
0.002XaaSer: 0.002 ± 0.0
0.001XaaThr: 0.001 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.001XaaTrp: 0.001 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.419XaaXaa: 0.419 ± 0.034
Statistics based on 42998 proteins (22681937 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski