Amino acid dipepetide frequency for Rhinopithecus bieti (Black snub-nosed monkey) (Pygathrix bieti)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.735AlaAla: 6.735 ± 0.034
1.394AlaCys: 1.394 ± 0.01
2.9AlaAsp: 2.9 ± 0.012
4.747AlaGlu: 4.747 ± 0.021
2.63AlaPhe: 2.63 ± 0.015
4.692AlaGly: 4.692 ± 0.025
1.554AlaHis: 1.554 ± 0.01
2.783AlaIle: 2.783 ± 0.013
3.462AlaLys: 3.462 ± 0.022
7.068AlaLeu: 7.068 ± 0.028
1.489AlaMet: 1.489 ± 0.009
2.031AlaAsn: 2.031 ± 0.01
4.112AlaPro: 4.112 ± 0.023
3.271AlaGln: 3.271 ± 0.017
3.667AlaArg: 3.667 ± 0.016
5.735AlaSer: 5.735 ± 0.022
3.636AlaThr: 3.636 ± 0.018
4.681AlaVal: 4.681 ± 0.018
0.816AlaTrp: 0.816 ± 0.007
1.529AlaTyr: 1.529 ± 0.011
0.001AlaXaa: 0.001 ± 0.0
Cys
1.235CysAla: 1.235 ± 0.009
0.665CysCys: 0.665 ± 0.009
1.002CysAsp: 1.002 ± 0.011
1.299CysGlu: 1.299 ± 0.011
0.857CysPhe: 0.857 ± 0.006
1.744CysGly: 1.744 ± 0.016
0.678CysHis: 0.678 ± 0.007
0.961CysIle: 0.961 ± 0.009
1.169CysLys: 1.169 ± 0.009
2.214CysLeu: 2.214 ± 0.012
0.427CysMet: 0.427 ± 0.004
0.822CysAsn: 0.822 ± 0.009
1.402CysPro: 1.402 ± 0.014
1.077CysGln: 1.077 ± 0.009
1.305CysArg: 1.305 ± 0.009
2.011CysSer: 2.011 ± 0.014
1.098CysThr: 1.098 ± 0.008
1.3CysVal: 1.3 ± 0.011
0.303CysTrp: 0.303 ± 0.004
0.579CysTyr: 0.579 ± 0.005
0.0CysXaa: 0.0 ± 0.0
Asp
2.856AspAla: 2.856 ± 0.012
1.068AspCys: 1.068 ± 0.011
2.646AspAsp: 2.646 ± 0.017
3.459AspGlu: 3.459 ± 0.017
2.136AspPhe: 2.136 ± 0.012
3.28AspGly: 3.28 ± 0.018
1.138AspHis: 1.138 ± 0.007
2.601AspIle: 2.601 ± 0.013
2.564AspLys: 2.564 ± 0.014
4.984AspLeu: 4.984 ± 0.017
1.126AspMet: 1.126 ± 0.008
1.692AspAsn: 1.692 ± 0.012
2.903AspPro: 2.903 ± 0.015
1.843AspGln: 1.843 ± 0.012
2.442AspArg: 2.442 ± 0.014
4.187AspSer: 4.187 ± 0.022
2.471AspThr: 2.471 ± 0.013
3.084AspVal: 3.084 ± 0.016
0.63AspTrp: 0.63 ± 0.006
1.506AspTyr: 1.506 ± 0.011
0.0AspXaa: 0.0 ± 0.0
Glu
5.251GluAla: 5.251 ± 0.025
1.478GluCys: 1.478 ± 0.016
4.471GluAsp: 4.471 ± 0.017
7.97GluGlu: 7.97 ± 0.044
2.058GluPhe: 2.058 ± 0.01
4.196GluGly: 4.196 ± 0.017
1.526GluHis: 1.526 ± 0.01
3.167GluIle: 3.167 ± 0.015
5.407GluLys: 5.407 ± 0.031
6.492GluLeu: 6.492 ± 0.027
1.707GluMet: 1.707 ± 0.011
3.187GluAsn: 3.187 ± 0.017
3.215GluPro: 3.215 ± 0.017
3.156GluGln: 3.156 ± 0.019
4.029GluArg: 4.029 ± 0.02
4.372GluSer: 4.372 ± 0.017
3.403GluThr: 3.403 ± 0.015
4.189GluVal: 4.189 ± 0.015
0.706GluTrp: 0.706 ± 0.006
1.608GluTyr: 1.608 ± 0.011
0.0GluXaa: 0.0 ± 0.0
Phe
1.973PheAla: 1.973 ± 0.011
0.926PheCys: 0.926 ± 0.007
1.685PheAsp: 1.685 ± 0.009
2.017PheGlu: 2.017 ± 0.011
1.586PhePhe: 1.586 ± 0.012
2.202PheGly: 2.202 ± 0.012
1.053PheHis: 1.053 ± 0.008
1.828PheIle: 1.828 ± 0.012
1.815PheLys: 1.815 ± 0.01
4.03PheLeu: 4.03 ± 0.018
0.778PheMet: 0.778 ± 0.007
1.345PheAsn: 1.345 ± 0.009
2.02PhePro: 2.02 ± 0.011
1.818PheGln: 1.818 ± 0.011
1.993PheArg: 1.993 ± 0.011
3.422PheSer: 3.422 ± 0.016
2.044PheThr: 2.044 ± 0.01
2.13PheVal: 2.13 ± 0.01
0.491PheTrp: 0.491 ± 0.005
1.165PheTyr: 1.165 ± 0.009
0.0PheXaa: 0.0 ± 0.0
Gly
4.492GlyAla: 4.492 ± 0.022
1.276GlyCys: 1.276 ± 0.009
3.139GlyAsp: 3.139 ± 0.015
4.18GlyGlu: 4.18 ± 0.023
2.416GlyPhe: 2.416 ± 0.013
4.931GlyGly: 4.931 ± 0.028
1.716GlyHis: 1.716 ± 0.01
2.789GlyIle: 2.789 ± 0.013
3.877GlyLys: 3.877 ± 0.02
5.882GlyLeu: 5.882 ± 0.025
1.324GlyMet: 1.324 ± 0.009
2.366GlyAsn: 2.366 ± 0.012
4.204GlyPro: 4.204 ± 0.036
2.741GlyGln: 2.741 ± 0.015
3.785GlyArg: 3.785 ± 0.017
5.807GlySer: 5.807 ± 0.025
3.58GlyThr: 3.58 ± 0.016
3.543GlyVal: 3.543 ± 0.015
0.81GlyTrp: 0.81 ± 0.008
1.727GlyTyr: 1.727 ± 0.011
0.0GlyXaa: 0.0 ± 0.0
His
1.344HisAla: 1.344 ± 0.009
0.753HisCys: 0.753 ± 0.007
0.892HisAsp: 0.892 ± 0.006
1.345HisGlu: 1.345 ± 0.01
1.088HisPhe: 1.088 ± 0.007
1.546HisGly: 1.546 ± 0.009
0.93HisHis: 0.93 ± 0.01
1.263HisIle: 1.263 ± 0.008
1.278HisLys: 1.278 ± 0.01
2.96HisLeu: 2.96 ± 0.013
0.599HisMet: 0.599 ± 0.005
0.877HisAsn: 0.877 ± 0.006
1.689HisPro: 1.689 ± 0.01
1.373HisGln: 1.373 ± 0.011
1.618HisArg: 1.618 ± 0.01
2.341HisSer: 2.341 ± 0.013
1.573HisThr: 1.573 ± 0.012
1.492HisVal: 1.492 ± 0.01
0.358HisTrp: 0.358 ± 0.004
0.802HisTyr: 0.802 ± 0.006
0.0HisXaa: 0.0 ± 0.0
Ile
2.587IleAla: 2.587 ± 0.013
1.083IleCys: 1.083 ± 0.008
2.009IleAsp: 2.009 ± 0.01
2.587IleGlu: 2.587 ± 0.016
1.885IlePhe: 1.885 ± 0.013
2.227IleGly: 2.227 ± 0.013
1.396IleHis: 1.396 ± 0.012
2.389IleIle: 2.389 ± 0.014
2.647IleLys: 2.647 ± 0.015
4.564IleLeu: 4.564 ± 0.018
1.014IleMet: 1.014 ± 0.006
1.849IleAsn: 1.849 ± 0.011
2.608IlePro: 2.608 ± 0.013
2.298IleGln: 2.298 ± 0.011
2.377IleArg: 2.377 ± 0.011
3.715IleSer: 3.715 ± 0.015
2.558IleThr: 2.558 ± 0.012
2.487IleVal: 2.487 ± 0.013
0.514IleTrp: 0.514 ± 0.005
1.382IleTyr: 1.382 ± 0.008
0.0IleXaa: 0.0 ± 0.0
Lys
4.039LysAla: 4.039 ± 0.021
1.203LysCys: 1.203 ± 0.012
3.159LysAsp: 3.159 ± 0.014
5.151LysGlu: 5.151 ± 0.028
1.743LysPhe: 1.743 ± 0.012
3.289LysGly: 3.289 ± 0.019
1.433LysHis: 1.433 ± 0.01
2.814LysIle: 2.814 ± 0.014
4.713LysLys: 4.713 ± 0.028
5.192LysLeu: 5.192 ± 0.021
1.453LysMet: 1.453 ± 0.01
2.399LysAsn: 2.399 ± 0.015
3.137LysPro: 3.137 ± 0.018
2.648LysGln: 2.648 ± 0.017
3.333LysArg: 3.333 ± 0.014
3.906LysSer: 3.906 ± 0.017
3.119LysThr: 3.119 ± 0.015
3.501LysVal: 3.501 ± 0.017
0.605LysTrp: 0.605 ± 0.006
1.593LysTyr: 1.593 ± 0.014
0.0LysXaa: 0.0 ± 0.0
Leu
6.767LeuAla: 6.767 ± 0.026
2.209LeuCys: 2.209 ± 0.013
4.724LeuAsp: 4.724 ± 0.017
7.195LeuGlu: 7.195 ± 0.027
3.366LeuPhe: 3.366 ± 0.017
5.887LeuGly: 5.887 ± 0.021
2.746LeuHis: 2.746 ± 0.012
3.853LeuIle: 3.853 ± 0.015
5.838LeuLys: 5.838 ± 0.025
10.76LeuLeu: 10.76 ± 0.041
2.014LeuMet: 2.014 ± 0.011
3.502LeuAsn: 3.502 ± 0.015
6.016LeuPro: 6.016 ± 0.026
5.81LeuGln: 5.81 ± 0.027
5.919LeuArg: 5.919 ± 0.023
7.933LeuSer: 7.933 ± 0.024
5.076LeuThr: 5.076 ± 0.017
5.518LeuVal: 5.518 ± 0.019
1.154LeuTrp: 1.154 ± 0.009
2.541LeuTyr: 2.541 ± 0.014
0.0LeuXaa: 0.0 ± 0.0
Met
1.93MetAla: 1.93 ± 0.011
0.403MetCys: 0.403 ± 0.004
1.254MetAsp: 1.254 ± 0.006
1.897MetGlu: 1.897 ± 0.011
0.728MetPhe: 0.728 ± 0.006
1.313MetGly: 1.313 ± 0.009
0.475MetHis: 0.475 ± 0.005
0.845MetIle: 0.845 ± 0.007
1.462MetLys: 1.462 ± 0.009
1.984MetLeu: 1.984 ± 0.01
0.575MetMet: 0.575 ± 0.006
0.913MetAsn: 0.913 ± 0.007
1.118MetPro: 1.118 ± 0.009
0.931MetGln: 0.931 ± 0.008
1.064MetArg: 1.064 ± 0.008
1.558MetSer: 1.558 ± 0.009
1.127MetThr: 1.127 ± 0.007
1.395MetVal: 1.395 ± 0.009
0.24MetTrp: 0.24 ± 0.003
0.558MetTyr: 0.558 ± 0.005
0.0MetXaa: 0.0 ± 0.0
Asn
2.052AsnAla: 2.052 ± 0.011
0.846AsnCys: 0.846 ± 0.008
1.541AsnAsp: 1.541 ± 0.011
2.259AsnGlu: 2.259 ± 0.013
1.488AsnPhe: 1.488 ± 0.01
2.454AsnGly: 2.454 ± 0.014
0.957AsnHis: 0.957 ± 0.007
2.122AsnIle: 2.122 ± 0.012
2.227AsnLys: 2.227 ± 0.013
3.761AsnLeu: 3.761 ± 0.014
0.919AsnMet: 0.919 ± 0.007
1.529AsnAsn: 1.529 ± 0.011
2.183AsnPro: 2.183 ± 0.012
1.664AsnGln: 1.664 ± 0.01
1.862AsnArg: 1.862 ± 0.01
3.142AsnSer: 3.142 ± 0.015
1.978AsnThr: 1.978 ± 0.013
2.207AsnVal: 2.207 ± 0.012
0.454AsnTrp: 0.454 ± 0.005
1.141AsnTyr: 1.141 ± 0.009
0.0AsnXaa: 0.0 ± 0.0
Pro
4.821ProAla: 4.821 ± 0.026
1.16ProCys: 1.16 ± 0.012
2.797ProAsp: 2.797 ± 0.016
4.427ProGlu: 4.427 ± 0.02
1.916ProPhe: 1.916 ± 0.012
5.24ProGly: 5.24 ± 0.042
1.481ProHis: 1.481 ± 0.01
1.902ProIle: 1.902 ± 0.011
2.775ProLys: 2.775 ± 0.017
5.276ProLeu: 5.276 ± 0.023
1.036ProMet: 1.036 ± 0.008
1.829ProAsn: 1.829 ± 0.011
5.866ProPro: 5.866 ± 0.046
2.83ProGln: 2.83 ± 0.016
3.366ProArg: 3.366 ± 0.017
5.728ProSer: 5.728 ± 0.032
3.133ProThr: 3.133 ± 0.019
3.798ProVal: 3.798 ± 0.017
0.727ProTrp: 0.727 ± 0.007
1.555ProTyr: 1.555 ± 0.012
0.001ProXaa: 0.001 ± 0.0
Gln
3.535GlnAla: 3.535 ± 0.018
0.953GlnCys: 0.953 ± 0.009
2.348GlnAsp: 2.348 ± 0.013
3.952GlnGlu: 3.952 ± 0.02
1.374GlnPhe: 1.374 ± 0.009
2.881GlnGly: 2.881 ± 0.017
1.315GlnHis: 1.315 ± 0.009
1.999GlnIle: 1.999 ± 0.01
2.982GlnLys: 2.982 ± 0.016
4.723GlnLeu: 4.723 ± 0.022
1.119GlnMet: 1.119 ± 0.008
1.882GlnAsn: 1.882 ± 0.012
2.825GlnPro: 2.825 ± 0.018
3.13GlnGln: 3.13 ± 0.027
3.015GlnArg: 3.015 ± 0.015
3.155GlnSer: 3.155 ± 0.015
2.292GlnThr: 2.292 ± 0.012
2.839GlnVal: 2.839 ± 0.013
0.556GlnTrp: 0.556 ± 0.005
1.145GlnTyr: 1.145 ± 0.008
0.0GlnXaa: 0.0 ± 0.0
Arg
3.841ArgAla: 3.841 ± 0.018
1.23ArgCys: 1.23 ± 0.011
2.791ArgAsp: 2.791 ± 0.013
4.003ArgGlu: 4.003 ± 0.018
1.837ArgPhe: 1.837 ± 0.01
3.67ArgGly: 3.67 ± 0.019
1.565ArgHis: 1.565 ± 0.01
2.467ArgIle: 2.467 ± 0.013
3.714ArgLys: 3.714 ± 0.018
5.434ArgLeu: 5.434 ± 0.021
1.216ArgMet: 1.216 ± 0.009
2.151ArgAsn: 2.151 ± 0.009
3.252ArgPro: 3.252 ± 0.018
2.65ArgGln: 2.65 ± 0.014
4.35ArgArg: 4.35 ± 0.023
4.292ArgSer: 4.292 ± 0.023
2.833ArgThr: 2.833 ± 0.012
3.151ArgVal: 3.151 ± 0.013
0.712ArgTrp: 0.712 ± 0.006
1.434ArgTyr: 1.434 ± 0.008
0.0ArgXaa: 0.0 ± 0.0
Ser
5.264SerAla: 5.264 ± 0.02
1.855SerCys: 1.855 ± 0.013
3.889SerAsp: 3.889 ± 0.022
5.193SerGlu: 5.193 ± 0.021
3.053SerPhe: 3.053 ± 0.015
5.619SerGly: 5.619 ± 0.026
2.17SerHis: 2.17 ± 0.012
3.205SerIle: 3.205 ± 0.016
4.086SerLys: 4.086 ± 0.017
8.23SerLeu: 8.23 ± 0.024
1.618SerMet: 1.618 ± 0.01
2.724SerAsn: 2.724 ± 0.014
6.029SerPro: 6.029 ± 0.035
3.914SerGln: 3.914 ± 0.017
4.588SerArg: 4.588 ± 0.022
9.336SerSer: 9.336 ± 0.05
4.436SerThr: 4.436 ± 0.024
4.833SerVal: 4.833 ± 0.02
1.079SerTrp: 1.079 ± 0.008
2.071SerTyr: 2.071 ± 0.01
0.0SerXaa: 0.0 ± 0.0
Thr
3.729ThrAla: 3.729 ± 0.018
1.297ThrCys: 1.297 ± 0.011
2.463ThrAsp: 2.463 ± 0.012
3.528ThrGlu: 3.528 ± 0.016
2.104ThrPhe: 2.104 ± 0.011
3.604ThrGly: 3.604 ± 0.018
1.342ThrHis: 1.342 ± 0.01
2.364ThrIle: 2.364 ± 0.012
2.654ThrLys: 2.654 ± 0.014
5.278ThrLeu: 5.278 ± 0.019
1.122ThrMet: 1.122 ± 0.008
1.764ThrAsn: 1.764 ± 0.011
3.612ThrPro: 3.612 ± 0.023
2.311ThrGln: 2.311 ± 0.011
2.474ThrArg: 2.474 ± 0.01
4.649ThrSer: 4.649 ± 0.02
3.008ThrThr: 3.008 ± 0.025
3.787ThrVal: 3.787 ± 0.015
0.698ThrTrp: 0.698 ± 0.006
1.417ThrTyr: 1.417 ± 0.009
0.0ThrXaa: 0.0 ± 0.0
Val
4.282ValAla: 4.282 ± 0.018
1.424ValCys: 1.424 ± 0.009
2.973ValAsp: 2.973 ± 0.015
3.831ValGlu: 3.831 ± 0.014
2.387ValPhe: 2.387 ± 0.012
3.396ValGly: 3.396 ± 0.016
1.571ValHis: 1.571 ± 0.009
2.897ValIle: 2.897 ± 0.014
3.387ValLys: 3.387 ± 0.015
6.138ValLeu: 6.138 ± 0.023
1.316ValMet: 1.316 ± 0.009
2.265ValAsn: 2.265 ± 0.012
3.625ValPro: 3.625 ± 0.018
2.734ValGln: 2.734 ± 0.013
3.037ValArg: 3.037 ± 0.013
4.835ValSer: 4.835 ± 0.019
3.738ValThr: 3.738 ± 0.017
3.94ValVal: 3.94 ± 0.017
0.724ValTrp: 0.724 ± 0.007
1.624ValTyr: 1.624 ± 0.01
0.0ValXaa: 0.0 ± 0.0
Trp
0.809TrpAla: 0.809 ± 0.006
0.251TrpCys: 0.251 ± 0.004
0.652TrpAsp: 0.652 ± 0.007
0.826TrpGlu: 0.826 ± 0.007
0.445TrpPhe: 0.445 ± 0.005
0.755TrpGly: 0.755 ± 0.008
0.318TrpHis: 0.318 ± 0.004
0.549TrpIle: 0.549 ± 0.006
0.822TrpLys: 0.822 ± 0.007
1.218TrpLeu: 1.218 ± 0.008
0.324TrpMet: 0.324 ± 0.004
0.546TrpAsn: 0.546 ± 0.006
0.552TrpPro: 0.552 ± 0.005
0.54TrpGln: 0.54 ± 0.005
0.746TrpArg: 0.746 ± 0.007
0.887TrpSer: 0.887 ± 0.007
0.673TrpThr: 0.673 ± 0.007
0.692TrpVal: 0.692 ± 0.006
0.199TrpTrp: 0.199 ± 0.003
0.334TrpTyr: 0.334 ± 0.004
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.387TyrAla: 1.387 ± 0.009
0.676TyrCys: 0.676 ± 0.006
1.28TyrAsp: 1.28 ± 0.011
1.722TyrGlu: 1.722 ± 0.011
1.221TyrPhe: 1.221 ± 0.009
1.659TyrGly: 1.659 ± 0.011
0.76TyrHis: 0.76 ± 0.007
1.378TyrIle: 1.378 ± 0.009
1.549TyrLys: 1.549 ± 0.019
2.64TyrLeu: 2.64 ± 0.012
0.597TyrMet: 0.597 ± 0.006
1.114TyrAsn: 1.114 ± 0.01
1.295TyrPro: 1.295 ± 0.008
1.279TyrGln: 1.279 ± 0.008
1.626TyrArg: 1.626 ± 0.011
2.18TyrSer: 2.18 ± 0.011
1.43TyrThr: 1.43 ± 0.01
1.556TyrVal: 1.556 ± 0.01
0.365TyrTrp: 0.365 ± 0.005
0.934TyrTyr: 0.934 ± 0.008
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.001XaaGly: 0.001 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.114XaaXaa: 0.114 ± 0.026
Statistics based on 42660 proteins (22132529 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski