Amino acid dipepetide frequency for Nomascus leucogenys (Northern white-cheeked gibbon) (Hylobates leucogenys)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.524AlaAla: 6.524 ± 0.037
1.39AlaCys: 1.39 ± 0.01
2.886AlaAsp: 2.886 ± 0.014
4.703AlaGlu: 4.703 ± 0.028
2.635AlaPhe: 2.635 ± 0.014
4.607AlaGly: 4.607 ± 0.025
1.558AlaHis: 1.558 ± 0.012
2.806AlaIle: 2.806 ± 0.014
3.483AlaLys: 3.483 ± 0.021
7.0AlaLeu: 7.0 ± 0.03
1.486AlaMet: 1.486 ± 0.009
2.032AlaAsn: 2.032 ± 0.011
4.01AlaPro: 4.01 ± 0.024
3.223AlaGln: 3.223 ± 0.019
3.565AlaArg: 3.565 ± 0.02
5.656AlaSer: 5.656 ± 0.022
3.59AlaThr: 3.59 ± 0.015
4.598AlaVal: 4.598 ± 0.017
0.802AlaTrp: 0.802 ± 0.008
1.526AlaTyr: 1.526 ± 0.01
0.003AlaXaa: 0.003 ± 0.0
Cys
1.224CysAla: 1.224 ± 0.011
0.681CysCys: 0.681 ± 0.01
1.008CysAsp: 1.008 ± 0.011
1.324CysGlu: 1.324 ± 0.013
0.856CysPhe: 0.856 ± 0.007
1.801CysGly: 1.801 ± 0.017
0.681CysHis: 0.681 ± 0.007
0.985CysIle: 0.985 ± 0.01
1.192CysLys: 1.192 ± 0.011
2.215CysLeu: 2.215 ± 0.013
0.427CysMet: 0.427 ± 0.005
0.844CysAsn: 0.844 ± 0.008
1.398CysPro: 1.398 ± 0.015
1.091CysGln: 1.091 ± 0.011
1.297CysArg: 1.297 ± 0.01
2.037CysSer: 2.037 ± 0.013
1.099CysThr: 1.099 ± 0.009
1.298CysVal: 1.298 ± 0.01
0.305CysTrp: 0.305 ± 0.004
0.577CysTyr: 0.577 ± 0.005
0.0CysXaa: 0.0 ± 0.0
Asp
2.864AspAla: 2.864 ± 0.014
1.067AspCys: 1.067 ± 0.011
2.639AspAsp: 2.639 ± 0.017
3.464AspGlu: 3.464 ± 0.018
2.129AspPhe: 2.129 ± 0.01
3.252AspGly: 3.252 ± 0.019
1.151AspHis: 1.151 ± 0.008
2.62AspIle: 2.62 ± 0.014
2.571AspLys: 2.571 ± 0.013
5.012AspLeu: 5.012 ± 0.018
1.123AspMet: 1.123 ± 0.009
1.705AspAsn: 1.705 ± 0.013
2.856AspPro: 2.856 ± 0.015
1.838AspGln: 1.838 ± 0.009
2.39AspArg: 2.39 ± 0.014
4.16AspSer: 4.16 ± 0.018
2.449AspThr: 2.449 ± 0.013
3.056AspVal: 3.056 ± 0.016
0.631AspTrp: 0.631 ± 0.007
1.498AspTyr: 1.498 ± 0.01
0.001AspXaa: 0.001 ± 0.0
Glu
5.207GluAla: 5.207 ± 0.03
1.527GluCys: 1.527 ± 0.021
4.452GluAsp: 4.452 ± 0.019
7.973GluGlu: 7.973 ± 0.049
2.08GluPhe: 2.08 ± 0.013
4.128GluGly: 4.128 ± 0.017
1.529GluHis: 1.529 ± 0.012
3.237GluIle: 3.237 ± 0.016
5.562GluLys: 5.562 ± 0.033
6.503GluLeu: 6.503 ± 0.03
1.717GluMet: 1.717 ± 0.01
3.216GluAsn: 3.216 ± 0.016
3.155GluPro: 3.155 ± 0.016
3.148GluGln: 3.148 ± 0.02
3.966GluArg: 3.966 ± 0.021
4.418GluSer: 4.418 ± 0.02
3.423GluThr: 3.423 ± 0.015
4.171GluVal: 4.171 ± 0.017
0.726GluTrp: 0.726 ± 0.007
1.609GluTyr: 1.609 ± 0.011
0.002GluXaa: 0.002 ± 0.0
Phe
1.953PheAla: 1.953 ± 0.011
0.933PheCys: 0.933 ± 0.008
1.671PheAsp: 1.671 ± 0.01
2.035PheGlu: 2.035 ± 0.011
1.597PhePhe: 1.597 ± 0.011
2.209PheGly: 2.209 ± 0.014
1.074PheHis: 1.074 ± 0.009
1.836PheIle: 1.836 ± 0.011
1.839PheLys: 1.839 ± 0.01
4.06PheLeu: 4.06 ± 0.019
0.783PheMet: 0.783 ± 0.006
1.364PheAsn: 1.364 ± 0.009
2.004PhePro: 2.004 ± 0.012
1.814PheGln: 1.814 ± 0.01
1.964PheArg: 1.964 ± 0.01
3.469PheSer: 3.469 ± 0.018
2.045PheThr: 2.045 ± 0.014
2.12PheVal: 2.12 ± 0.011
0.498PheTrp: 0.498 ± 0.005
1.168PheTyr: 1.168 ± 0.008
0.001PheXaa: 0.001 ± 0.0
Gly
4.378GlyAla: 4.378 ± 0.023
1.283GlyCys: 1.283 ± 0.011
3.12GlyAsp: 3.12 ± 0.017
4.184GlyGlu: 4.184 ± 0.025
2.419GlyPhe: 2.419 ± 0.017
4.896GlyGly: 4.896 ± 0.03
1.688GlyHis: 1.688 ± 0.012
2.801GlyIle: 2.801 ± 0.014
3.898GlyLys: 3.898 ± 0.022
5.82GlyLeu: 5.82 ± 0.026
1.321GlyMet: 1.321 ± 0.01
2.384GlyAsn: 2.384 ± 0.013
4.096GlyPro: 4.096 ± 0.035
2.765GlyGln: 2.765 ± 0.019
3.704GlyArg: 3.704 ± 0.018
5.747GlySer: 5.747 ± 0.028
3.547GlyThr: 3.547 ± 0.018
3.53GlyVal: 3.53 ± 0.018
0.81GlyTrp: 0.81 ± 0.009
1.726GlyTyr: 1.726 ± 0.014
0.002GlyXaa: 0.002 ± 0.0
His
1.325HisAla: 1.325 ± 0.008
0.751HisCys: 0.751 ± 0.007
0.897HisAsp: 0.897 ± 0.007
1.347HisGlu: 1.347 ± 0.011
1.094HisPhe: 1.094 ± 0.008
1.568HisGly: 1.568 ± 0.012
0.916HisHis: 0.916 ± 0.008
1.286HisIle: 1.286 ± 0.01
1.313HisLys: 1.313 ± 0.011
2.948HisLeu: 2.948 ± 0.014
0.598HisMet: 0.598 ± 0.007
0.887HisAsn: 0.887 ± 0.008
1.674HisPro: 1.674 ± 0.01
1.389HisGln: 1.389 ± 0.015
1.595HisArg: 1.595 ± 0.01
2.357HisSer: 2.357 ± 0.014
1.595HisThr: 1.595 ± 0.017
1.49HisVal: 1.49 ± 0.01
0.357HisTrp: 0.357 ± 0.005
0.8HisTyr: 0.8 ± 0.007
0.001HisXaa: 0.001 ± 0.0
Ile
2.601IleAla: 2.601 ± 0.014
1.106IleCys: 1.106 ± 0.008
2.047IleAsp: 2.047 ± 0.012
2.657IleGlu: 2.657 ± 0.016
1.914IlePhe: 1.914 ± 0.011
2.252IleGly: 2.252 ± 0.014
1.436IleHis: 1.436 ± 0.014
2.434IleIle: 2.434 ± 0.015
2.701IleLys: 2.701 ± 0.016
4.608IleLeu: 4.608 ± 0.017
1.016IleMet: 1.016 ± 0.008
1.888IleAsn: 1.888 ± 0.011
2.653IlePro: 2.653 ± 0.015
2.348IleGln: 2.348 ± 0.014
2.374IleArg: 2.374 ± 0.012
3.737IleSer: 3.737 ± 0.017
2.587IleThr: 2.587 ± 0.015
2.5IleVal: 2.5 ± 0.013
0.524IleTrp: 0.524 ± 0.005
1.401IleTyr: 1.401 ± 0.01
0.001IleXaa: 0.001 ± 0.0
Lys
4.058LysAla: 4.058 ± 0.023
1.226LysCys: 1.226 ± 0.012
3.193LysAsp: 3.193 ± 0.016
5.24LysGlu: 5.24 ± 0.029
1.761LysPhe: 1.761 ± 0.009
3.267LysGly: 3.267 ± 0.021
1.444LysHis: 1.444 ± 0.011
2.901LysIle: 2.901 ± 0.015
4.845LysLys: 4.845 ± 0.03
5.266LysLeu: 5.266 ± 0.024
1.471LysMet: 1.471 ± 0.01
2.464LysAsn: 2.464 ± 0.015
3.185LysPro: 3.185 ± 0.019
2.686LysGln: 2.686 ± 0.014
3.319LysArg: 3.319 ± 0.015
3.955LysSer: 3.955 ± 0.026
3.164LysThr: 3.164 ± 0.016
3.495LysVal: 3.495 ± 0.016
0.62LysTrp: 0.62 ± 0.006
1.624LysTyr: 1.624 ± 0.013
0.002LysXaa: 0.002 ± 0.0
Leu
6.706LeuAla: 6.706 ± 0.025
2.217LeuCys: 2.217 ± 0.014
4.699LeuAsp: 4.699 ± 0.018
7.21LeuGlu: 7.21 ± 0.033
3.38LeuPhe: 3.38 ± 0.018
5.816LeuGly: 5.816 ± 0.024
2.746LeuHis: 2.746 ± 0.013
3.92LeuIle: 3.92 ± 0.018
5.912LeuLys: 5.912 ± 0.024
10.659LeuLeu: 10.659 ± 0.039
2.032LeuMet: 2.032 ± 0.011
3.527LeuAsn: 3.527 ± 0.016
5.972LeuPro: 5.972 ± 0.025
5.826LeuGln: 5.826 ± 0.028
5.82LeuArg: 5.82 ± 0.024
7.898LeuSer: 7.898 ± 0.025
5.103LeuThr: 5.103 ± 0.018
5.433LeuVal: 5.433 ± 0.019
1.152LeuTrp: 1.152 ± 0.009
2.55LeuTyr: 2.55 ± 0.014
0.003LeuXaa: 0.003 ± 0.0
Met
1.913MetAla: 1.913 ± 0.01
0.412MetCys: 0.412 ± 0.005
1.265MetAsp: 1.265 ± 0.009
1.906MetGlu: 1.906 ± 0.012
0.734MetPhe: 0.734 ± 0.006
1.304MetGly: 1.304 ± 0.01
0.479MetHis: 0.479 ± 0.005
0.853MetIle: 0.853 ± 0.007
1.481MetLys: 1.481 ± 0.009
1.996MetLeu: 1.996 ± 0.011
0.572MetMet: 0.572 ± 0.006
0.928MetAsn: 0.928 ± 0.008
1.1MetPro: 1.1 ± 0.009
0.934MetGln: 0.934 ± 0.007
1.056MetArg: 1.056 ± 0.008
1.572MetSer: 1.572 ± 0.01
1.134MetThr: 1.134 ± 0.009
1.395MetVal: 1.395 ± 0.008
0.25MetTrp: 0.25 ± 0.003
0.556MetTyr: 0.556 ± 0.005
0.001MetXaa: 0.001 ± 0.0
Asn
2.068AsnAla: 2.068 ± 0.013
0.854AsnCys: 0.854 ± 0.008
1.56AsnAsp: 1.56 ± 0.013
2.298AsnGlu: 2.298 ± 0.015
1.501AsnPhe: 1.501 ± 0.008
2.459AsnGly: 2.459 ± 0.016
0.966AsnHis: 0.966 ± 0.007
2.172AsnIle: 2.172 ± 0.012
2.271AsnLys: 2.271 ± 0.013
3.8AsnLeu: 3.8 ± 0.019
0.933AsnMet: 0.933 ± 0.008
1.561AsnAsn: 1.561 ± 0.011
2.168AsnPro: 2.168 ± 0.012
1.681AsnGln: 1.681 ± 0.01
1.861AsnArg: 1.861 ± 0.011
3.179AsnSer: 3.179 ± 0.016
2.002AsnThr: 2.002 ± 0.013
2.212AsnVal: 2.212 ± 0.012
0.465AsnTrp: 0.465 ± 0.004
1.14AsnTyr: 1.14 ± 0.009
0.001AsnXaa: 0.001 ± 0.0
Pro
4.68ProAla: 4.68 ± 0.027
1.149ProCys: 1.149 ± 0.013
2.74ProAsp: 2.74 ± 0.014
4.36ProGlu: 4.36 ± 0.017
1.931ProPhe: 1.931 ± 0.013
5.143ProGly: 5.143 ± 0.048
1.453ProHis: 1.453 ± 0.01
1.916ProIle: 1.916 ± 0.01
2.775ProLys: 2.775 ± 0.016
5.18ProLeu: 5.18 ± 0.023
1.042ProMet: 1.042 ± 0.008
1.831ProAsn: 1.831 ± 0.011
5.831ProPro: 5.831 ± 0.044
2.819ProGln: 2.819 ± 0.017
3.282ProArg: 3.282 ± 0.019
5.639ProSer: 5.639 ± 0.027
3.092ProThr: 3.092 ± 0.017
3.765ProVal: 3.765 ± 0.016
0.712ProTrp: 0.712 ± 0.007
1.57ProTyr: 1.57 ± 0.014
0.003ProXaa: 0.003 ± 0.0
Gln
3.503GlnAla: 3.503 ± 0.02
0.977GlnCys: 0.977 ± 0.011
2.331GlnAsp: 2.331 ± 0.012
3.947GlnGlu: 3.947 ± 0.022
1.386GlnPhe: 1.386 ± 0.008
2.849GlnGly: 2.849 ± 0.017
1.335GlnHis: 1.335 ± 0.011
2.025GlnIle: 2.025 ± 0.011
3.044GlnLys: 3.044 ± 0.017
4.74GlnLeu: 4.74 ± 0.023
1.136GlnMet: 1.136 ± 0.009
1.896GlnAsn: 1.896 ± 0.009
2.796GlnPro: 2.796 ± 0.018
3.103GlnGln: 3.103 ± 0.031
2.989GlnArg: 2.989 ± 0.018
3.158GlnSer: 3.158 ± 0.018
2.295GlnThr: 2.295 ± 0.012
2.836GlnVal: 2.836 ± 0.012
0.551GlnTrp: 0.551 ± 0.005
1.167GlnTyr: 1.167 ± 0.009
0.001GlnXaa: 0.001 ± 0.0
Arg
3.754ArgAla: 3.754 ± 0.019
1.215ArgCys: 1.215 ± 0.011
2.741ArgAsp: 2.741 ± 0.015
3.951ArgGlu: 3.951 ± 0.021
1.836ArgPhe: 1.836 ± 0.01
3.584ArgGly: 3.584 ± 0.018
1.55ArgHis: 1.55 ± 0.011
2.451ArgIle: 2.451 ± 0.013
3.684ArgLys: 3.684 ± 0.016
5.34ArgLeu: 5.34 ± 0.023
1.203ArgMet: 1.203 ± 0.009
2.145ArgAsn: 2.145 ± 0.011
3.142ArgPro: 3.142 ± 0.018
2.621ArgGln: 2.621 ± 0.015
4.24ArgArg: 4.24 ± 0.025
4.235ArgSer: 4.235 ± 0.026
2.82ArgThr: 2.82 ± 0.015
3.111ArgVal: 3.111 ± 0.013
0.709ArgTrp: 0.709 ± 0.007
1.43ArgTyr: 1.43 ± 0.008
0.001ArgXaa: 0.001 ± 0.0
Ser
5.197SerAla: 5.197 ± 0.02
1.88SerCys: 1.88 ± 0.015
3.87SerAsp: 3.87 ± 0.019
5.204SerGlu: 5.204 ± 0.02
3.08SerPhe: 3.08 ± 0.015
5.603SerGly: 5.603 ± 0.027
2.161SerHis: 2.161 ± 0.012
3.248SerIle: 3.248 ± 0.017
4.116SerLys: 4.116 ± 0.021
8.194SerLeu: 8.194 ± 0.023
1.615SerMet: 1.615 ± 0.01
2.762SerAsn: 2.762 ± 0.016
5.933SerPro: 5.933 ± 0.032
3.93SerGln: 3.93 ± 0.017
4.525SerArg: 4.525 ± 0.024
9.358SerSer: 9.358 ± 0.052
4.477SerThr: 4.477 ± 0.027
4.833SerVal: 4.833 ± 0.02
1.095SerTrp: 1.095 ± 0.009
2.066SerTyr: 2.066 ± 0.012
0.003SerXaa: 0.003 ± 0.0
Thr
3.728ThrAla: 3.728 ± 0.017
1.301ThrCys: 1.301 ± 0.013
2.466ThrAsp: 2.466 ± 0.012
3.54ThrGlu: 3.54 ± 0.017
2.081ThrPhe: 2.081 ± 0.011
3.611ThrGly: 3.611 ± 0.018
1.36ThrHis: 1.36 ± 0.011
2.416ThrIle: 2.416 ± 0.013
2.683ThrLys: 2.683 ± 0.015
5.276ThrLeu: 5.276 ± 0.017
1.119ThrMet: 1.119 ± 0.009
1.775ThrAsn: 1.775 ± 0.011
3.554ThrPro: 3.554 ± 0.019
2.301ThrGln: 2.301 ± 0.012
2.456ThrArg: 2.456 ± 0.012
4.647ThrSer: 4.647 ± 0.024
3.057ThrThr: 3.057 ± 0.03
3.808ThrVal: 3.808 ± 0.017
0.705ThrTrp: 0.705 ± 0.007
1.423ThrTyr: 1.423 ± 0.01
0.002ThrXaa: 0.002 ± 0.0
Val
4.212ValAla: 4.212 ± 0.016
1.442ValCys: 1.442 ± 0.011
2.938ValAsp: 2.938 ± 0.013
3.837ValGlu: 3.837 ± 0.019
2.376ValPhe: 2.376 ± 0.012
3.374ValGly: 3.374 ± 0.015
1.573ValHis: 1.573 ± 0.011
2.927ValIle: 2.927 ± 0.014
3.412ValLys: 3.412 ± 0.015
6.084ValLeu: 6.084 ± 0.021
1.322ValMet: 1.322 ± 0.008
2.276ValAsn: 2.276 ± 0.014
3.547ValPro: 3.547 ± 0.017
2.72ValGln: 2.72 ± 0.014
2.999ValArg: 2.999 ± 0.015
4.843ValSer: 4.843 ± 0.017
3.706ValThr: 3.706 ± 0.016
3.889ValVal: 3.889 ± 0.019
0.724ValTrp: 0.724 ± 0.006
1.592ValTyr: 1.592 ± 0.011
0.002ValXaa: 0.002 ± 0.0
Trp
0.814TrpAla: 0.814 ± 0.008
0.248TrpCys: 0.248 ± 0.004
0.659TrpAsp: 0.659 ± 0.008
0.827TrpGlu: 0.827 ± 0.007
0.44TrpPhe: 0.44 ± 0.005
0.756TrpGly: 0.756 ± 0.007
0.318TrpHis: 0.318 ± 0.004
0.559TrpIle: 0.559 ± 0.007
0.828TrpLys: 0.828 ± 0.007
1.238TrpLeu: 1.238 ± 0.009
0.321TrpMet: 0.321 ± 0.004
0.551TrpAsn: 0.551 ± 0.006
0.548TrpPro: 0.548 ± 0.005
0.545TrpGln: 0.545 ± 0.006
0.734TrpArg: 0.734 ± 0.007
0.895TrpSer: 0.895 ± 0.009
0.677TrpThr: 0.677 ± 0.007
0.702TrpVal: 0.702 ± 0.007
0.205TrpTrp: 0.205 ± 0.003
0.339TrpTyr: 0.339 ± 0.005
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.374TyrAla: 1.374 ± 0.009
0.678TyrCys: 0.678 ± 0.006
1.292TyrAsp: 1.292 ± 0.011
1.74TyrGlu: 1.74 ± 0.011
1.205TyrPhe: 1.205 ± 0.01
1.64TyrGly: 1.64 ± 0.012
0.765TyrHis: 0.765 ± 0.007
1.391TyrIle: 1.391 ± 0.01
1.573TyrLys: 1.573 ± 0.021
2.639TyrLeu: 2.639 ± 0.013
0.605TyrMet: 0.605 ± 0.006
1.119TyrAsn: 1.119 ± 0.008
1.288TyrPro: 1.288 ± 0.01
1.282TyrGln: 1.282 ± 0.009
1.625TyrArg: 1.625 ± 0.012
2.185TyrSer: 2.185 ± 0.014
1.443TyrThr: 1.443 ± 0.011
1.555TyrVal: 1.555 ± 0.009
0.363TyrTrp: 0.363 ± 0.005
0.934TyrTyr: 0.934 ± 0.008
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.003XaaAla: 0.003 ± 0.0
0.001XaaCys: 0.001 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.002XaaGly: 0.002 ± 0.0
0.002XaaHis: 0.002 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.002XaaLys: 0.002 ± 0.0
0.002XaaLeu: 0.002 ± 0.0
0.001XaaMet: 0.001 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.003XaaPro: 0.003 ± 0.0
0.001XaaGln: 0.001 ± 0.0
0.002XaaArg: 0.002 ± 0.0
0.002XaaSer: 0.002 ± 0.0
0.002XaaThr: 0.002 ± 0.0
0.002XaaVal: 0.002 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
1.139XaaXaa: 1.139 ± 0.102
Statistics based on 40414 proteins (20721228 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski