Amino acid dipepetide frequency for Mesocricetus auratus (Golden hamster)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.862AlaAla: 6.862 ± 0.035
1.392AlaCys: 1.392 ± 0.011
2.943AlaAsp: 2.943 ± 0.012
4.916AlaGlu: 4.916 ± 0.028
2.574AlaPhe: 2.574 ± 0.014
4.646AlaGly: 4.646 ± 0.025
1.571AlaHis: 1.571 ± 0.01
2.729AlaIle: 2.729 ± 0.015
3.332AlaLys: 3.332 ± 0.016
7.179AlaLeu: 7.179 ± 0.032
1.524AlaMet: 1.524 ± 0.01
1.996AlaAsn: 1.996 ± 0.012
4.316AlaPro: 4.316 ± 0.028
3.419AlaGln: 3.419 ± 0.022
3.743AlaArg: 3.743 ± 0.019
6.163AlaSer: 6.163 ± 0.023
3.753AlaThr: 3.753 ± 0.018
4.673AlaVal: 4.673 ± 0.016
0.776AlaTrp: 0.776 ± 0.007
1.49AlaTyr: 1.49 ± 0.011
0.003AlaXaa: 0.003 ± 0.0
Cys
1.243CysAla: 1.243 ± 0.01
0.603CysCys: 0.603 ± 0.01
1.028CysAsp: 1.028 ± 0.009
1.279CysGlu: 1.279 ± 0.013
0.813CysPhe: 0.813 ± 0.006
1.652CysGly: 1.652 ± 0.017
0.648CysHis: 0.648 ± 0.006
0.906CysIle: 0.906 ± 0.008
1.106CysLys: 1.106 ± 0.01
2.121CysLeu: 2.121 ± 0.014
0.415CysMet: 0.415 ± 0.004
0.741CysAsn: 0.741 ± 0.008
1.37CysPro: 1.37 ± 0.016
1.056CysGln: 1.056 ± 0.009
1.255CysArg: 1.255 ± 0.01
2.028CysSer: 2.028 ± 0.015
1.094CysThr: 1.094 ± 0.01
1.307CysVal: 1.307 ± 0.011
0.275CysTrp: 0.275 ± 0.004
0.555CysTyr: 0.555 ± 0.005
0.002CysXaa: 0.002 ± 0.0
Asp
2.923AspAla: 2.923 ± 0.017
1.031AspCys: 1.031 ± 0.01
2.644AspAsp: 2.644 ± 0.018
3.412AspGlu: 3.412 ± 0.017
2.045AspPhe: 2.045 ± 0.013
3.308AspGly: 3.308 ± 0.021
1.16AspHis: 1.16 ± 0.008
2.487AspIle: 2.487 ± 0.015
2.481AspLys: 2.481 ± 0.013
4.995AspLeu: 4.995 ± 0.021
1.135AspMet: 1.135 ± 0.008
1.619AspAsn: 1.619 ± 0.011
2.972AspPro: 2.972 ± 0.016
1.87AspGln: 1.87 ± 0.01
2.533AspArg: 2.533 ± 0.014
4.401AspSer: 4.401 ± 0.018
2.585AspThr: 2.585 ± 0.014
3.029AspVal: 3.029 ± 0.015
0.601AspTrp: 0.601 ± 0.006
1.397AspTyr: 1.397 ± 0.01
0.002AspXaa: 0.002 ± 0.0
Glu
5.347GluAla: 5.347 ± 0.028
1.357GluCys: 1.357 ± 0.016
4.518GluAsp: 4.518 ± 0.019
7.951GluGlu: 7.951 ± 0.045
1.935GluPhe: 1.935 ± 0.011
4.176GluGly: 4.176 ± 0.018
1.538GluHis: 1.538 ± 0.009
2.907GluIle: 2.907 ± 0.016
5.251GluLys: 5.251 ± 0.029
6.52GluLeu: 6.52 ± 0.035
1.645GluMet: 1.645 ± 0.011
2.94GluAsn: 2.94 ± 0.017
3.439GluPro: 3.439 ± 0.023
3.269GluGln: 3.269 ± 0.019
4.125GluArg: 4.125 ± 0.024
4.563GluSer: 4.563 ± 0.019
3.479GluThr: 3.479 ± 0.016
4.147GluVal: 4.147 ± 0.02
0.665GluTrp: 0.665 ± 0.006
1.496GluTyr: 1.496 ± 0.011
0.003GluXaa: 0.003 ± 0.0
Phe
1.922PheAla: 1.922 ± 0.012
0.924PheCys: 0.924 ± 0.008
1.604PheAsp: 1.604 ± 0.011
1.897PheGlu: 1.897 ± 0.011
1.535PhePhe: 1.535 ± 0.012
2.099PheGly: 2.099 ± 0.015
0.995PheHis: 0.995 ± 0.008
1.752PheIle: 1.752 ± 0.012
1.626PheLys: 1.626 ± 0.011
3.987PheLeu: 3.987 ± 0.021
0.753PheMet: 0.753 ± 0.008
1.245PheAsn: 1.245 ± 0.009
2.02PhePro: 2.02 ± 0.013
1.728PheGln: 1.728 ± 0.011
1.902PheArg: 1.902 ± 0.013
3.341PheSer: 3.341 ± 0.015
1.936PheThr: 1.936 ± 0.013
2.044PheVal: 2.044 ± 0.012
0.465PheTrp: 0.465 ± 0.006
1.148PheTyr: 1.148 ± 0.009
0.002PheXaa: 0.002 ± 0.0
Gly
4.437GlyAla: 4.437 ± 0.025
1.246GlyCys: 1.246 ± 0.012
3.187GlyAsp: 3.187 ± 0.02
4.078GlyGlu: 4.078 ± 0.025
2.312GlyPhe: 2.312 ± 0.016
4.888GlyGly: 4.888 ± 0.035
1.724GlyHis: 1.724 ± 0.012
2.584GlyIle: 2.584 ± 0.014
3.622GlyLys: 3.622 ± 0.021
5.894GlyLeu: 5.894 ± 0.026
1.292GlyMet: 1.292 ± 0.009
2.303GlyAsn: 2.303 ± 0.012
4.461GlyPro: 4.461 ± 0.048
2.817GlyGln: 2.817 ± 0.017
3.745GlyArg: 3.745 ± 0.02
6.154GlySer: 6.154 ± 0.023
3.673GlyThr: 3.673 ± 0.015
3.518GlyVal: 3.518 ± 0.019
0.743GlyTrp: 0.743 ± 0.008
1.649GlyTyr: 1.649 ± 0.012
0.003GlyXaa: 0.003 ± 0.0
His
1.364HisAla: 1.364 ± 0.009
0.715HisCys: 0.715 ± 0.007
0.897HisAsp: 0.897 ± 0.007
1.326HisGlu: 1.326 ± 0.009
1.064HisPhe: 1.064 ± 0.008
1.57HisGly: 1.57 ± 0.011
0.942HisHis: 0.942 ± 0.01
1.217HisIle: 1.217 ± 0.008
1.282HisLys: 1.282 ± 0.01
2.933HisLeu: 2.933 ± 0.016
0.602HisMet: 0.602 ± 0.006
0.846HisAsn: 0.846 ± 0.008
1.726HisPro: 1.726 ± 0.011
1.371HisGln: 1.371 ± 0.013
1.668HisArg: 1.668 ± 0.009
2.442HisSer: 2.442 ± 0.014
1.565HisThr: 1.565 ± 0.013
1.501HisVal: 1.501 ± 0.009
0.331HisTrp: 0.331 ± 0.004
0.781HisTyr: 0.781 ± 0.007
0.003HisXaa: 0.003 ± 0.0
Ile
2.456IleAla: 2.456 ± 0.013
1.017IleCys: 1.017 ± 0.008
1.897IleAsp: 1.897 ± 0.013
2.439IleGlu: 2.439 ± 0.014
1.754IlePhe: 1.754 ± 0.013
2.086IleGly: 2.086 ± 0.011
1.279IleHis: 1.279 ± 0.011
2.231IleIle: 2.231 ± 0.014
2.403IleLys: 2.403 ± 0.014
4.373IleLeu: 4.373 ± 0.022
0.952IleMet: 0.952 ± 0.007
1.679IleAsn: 1.679 ± 0.011
2.614IlePro: 2.614 ± 0.013
2.19IleGln: 2.19 ± 0.012
2.297IleArg: 2.297 ± 0.012
3.549IleSer: 3.549 ± 0.016
2.431IleThr: 2.431 ± 0.015
2.401IleVal: 2.401 ± 0.017
0.466IleTrp: 0.466 ± 0.006
1.295IleTyr: 1.295 ± 0.01
0.002IleXaa: 0.002 ± 0.0
Lys
4.027LysAla: 4.027 ± 0.017
1.082LysCys: 1.082 ± 0.011
3.113LysAsp: 3.113 ± 0.018
4.948LysGlu: 4.948 ± 0.027
1.608LysPhe: 1.608 ± 0.011
3.23LysGly: 3.23 ± 0.024
1.374LysHis: 1.374 ± 0.011
2.488LysIle: 2.488 ± 0.014
4.435LysLys: 4.435 ± 0.028
5.003LysLeu: 5.003 ± 0.021
1.418LysMet: 1.418 ± 0.011
2.191LysAsn: 2.191 ± 0.013
3.155LysPro: 3.155 ± 0.02
2.597LysGln: 2.597 ± 0.017
3.292LysArg: 3.292 ± 0.017
3.952LysSer: 3.952 ± 0.02
3.04LysThr: 3.04 ± 0.017
3.349LysVal: 3.349 ± 0.018
0.546LysTrp: 0.546 ± 0.007
1.445LysTyr: 1.445 ± 0.01
0.004LysXaa: 0.004 ± 0.001
Leu
6.836LeuAla: 6.836 ± 0.029
2.162LeuCys: 2.162 ± 0.012
4.767LeuAsp: 4.767 ± 0.018
7.21LeuGlu: 7.21 ± 0.033
3.248LeuPhe: 3.248 ± 0.019
5.868LeuGly: 5.868 ± 0.023
2.778LeuHis: 2.778 ± 0.016
3.672LeuIle: 3.672 ± 0.019
5.664LeuLys: 5.664 ± 0.024
10.808LeuLeu: 10.808 ± 0.046
1.976LeuMet: 1.976 ± 0.011
3.375LeuAsn: 3.375 ± 0.016
6.179LeuPro: 6.179 ± 0.024
6.032LeuGln: 6.032 ± 0.034
6.126LeuArg: 6.126 ± 0.025
8.366LeuSer: 8.366 ± 0.032
5.088LeuThr: 5.088 ± 0.019
5.439LeuVal: 5.439 ± 0.021
1.101LeuTrp: 1.101 ± 0.008
2.431LeuTyr: 2.431 ± 0.012
0.006LeuXaa: 0.006 ± 0.001
Met
1.904MetAla: 1.904 ± 0.009
0.377MetCys: 0.377 ± 0.005
1.276MetAsp: 1.276 ± 0.009
1.873MetGlu: 1.873 ± 0.012
0.698MetPhe: 0.698 ± 0.007
1.279MetGly: 1.279 ± 0.009
0.475MetHis: 0.475 ± 0.005
0.79MetIle: 0.79 ± 0.007
1.399MetLys: 1.399 ± 0.01
1.978MetLeu: 1.978 ± 0.012
0.557MetMet: 0.557 ± 0.006
0.873MetAsn: 0.873 ± 0.007
1.137MetPro: 1.137 ± 0.01
0.923MetGln: 0.923 ± 0.009
1.045MetArg: 1.045 ± 0.006
1.636MetSer: 1.636 ± 0.01
1.142MetThr: 1.142 ± 0.009
1.392MetVal: 1.392 ± 0.01
0.227MetTrp: 0.227 ± 0.003
0.573MetTyr: 0.573 ± 0.005
0.001MetXaa: 0.001 ± 0.0
Asn
2.003AsnAla: 2.003 ± 0.014
0.788AsnCys: 0.788 ± 0.008
1.457AsnAsp: 1.457 ± 0.01
2.069AsnGlu: 2.069 ± 0.013
1.375AsnPhe: 1.375 ± 0.01
2.408AsnGly: 2.408 ± 0.016
0.937AsnHis: 0.937 ± 0.008
1.926AsnIle: 1.926 ± 0.013
2.041AsnLys: 2.041 ± 0.013
3.548AsnLeu: 3.548 ± 0.017
0.853AsnMet: 0.853 ± 0.006
1.384AsnAsn: 1.384 ± 0.012
2.155AsnPro: 2.155 ± 0.011
1.611AsnGln: 1.611 ± 0.011
1.802AsnArg: 1.802 ± 0.011
3.107AsnSer: 3.107 ± 0.016
1.943AsnThr: 1.943 ± 0.012
2.071AsnVal: 2.071 ± 0.012
0.428AsnTrp: 0.428 ± 0.005
1.021AsnTyr: 1.021 ± 0.008
0.002AsnXaa: 0.002 ± 0.0
Pro
5.158ProAla: 5.158 ± 0.03
1.182ProCys: 1.182 ± 0.013
2.789ProAsp: 2.789 ± 0.013
4.605ProGlu: 4.605 ± 0.022
1.908ProPhe: 1.908 ± 0.012
5.534ProGly: 5.534 ± 0.062
1.539ProHis: 1.539 ± 0.011
1.845ProIle: 1.845 ± 0.013
2.832ProLys: 2.832 ± 0.016
5.507ProLeu: 5.507 ± 0.026
1.117ProMet: 1.117 ± 0.009
1.798ProAsn: 1.798 ± 0.012
6.667ProPro: 6.667 ± 0.056
3.053ProGln: 3.053 ± 0.019
3.497ProArg: 3.497 ± 0.02
6.259ProSer: 6.259 ± 0.029
3.301ProThr: 3.301 ± 0.02
4.022ProVal: 4.022 ± 0.018
0.671ProTrp: 0.671 ± 0.008
1.496ProTyr: 1.496 ± 0.013
0.004ProXaa: 0.004 ± 0.001
Gln
3.699GlnAla: 3.699 ± 0.022
0.964GlnCys: 0.964 ± 0.011
2.431GlnAsp: 2.431 ± 0.015
4.031GlnGlu: 4.031 ± 0.024
1.308GlnPhe: 1.308 ± 0.009
2.951GlnGly: 2.951 ± 0.017
1.358GlnHis: 1.358 ± 0.01
1.893GlnIle: 1.893 ± 0.011
2.925GlnLys: 2.925 ± 0.016
4.859GlnLeu: 4.859 ± 0.028
1.136GlnMet: 1.136 ± 0.009
1.808GlnAsn: 1.808 ± 0.011
3.032GlnPro: 3.032 ± 0.022
3.406GlnGln: 3.406 ± 0.035
3.13GlnArg: 3.13 ± 0.021
3.336GlnSer: 3.336 ± 0.019
2.402GlnThr: 2.402 ± 0.015
2.878GlnVal: 2.878 ± 0.015
0.525GlnTrp: 0.525 ± 0.005
1.121GlnTyr: 1.121 ± 0.009
0.002GlnXaa: 0.002 ± 0.0
Arg
3.946ArgAla: 3.946 ± 0.019
1.189ArgCys: 1.189 ± 0.012
2.817ArgAsp: 2.817 ± 0.014
4.069ArgGlu: 4.069 ± 0.022
1.817ArgPhe: 1.817 ± 0.01
3.601ArgGly: 3.601 ± 0.024
1.601ArgHis: 1.601 ± 0.01
2.328ArgIle: 2.328 ± 0.013
3.711ArgLys: 3.711 ± 0.017
5.551ArgLeu: 5.551 ± 0.026
1.192ArgMet: 1.192 ± 0.009
2.099ArgAsn: 2.099 ± 0.011
3.346ArgPro: 3.346 ± 0.021
2.77ArgGln: 2.77 ± 0.019
4.502ArgArg: 4.502 ± 0.022
4.573ArgSer: 4.573 ± 0.025
2.906ArgThr: 2.906 ± 0.014
3.25ArgVal: 3.25 ± 0.014
0.665ArgTrp: 0.665 ± 0.006
1.385ArgTyr: 1.385 ± 0.009
0.003ArgXaa: 0.003 ± 0.0
Ser
5.615SerAla: 5.615 ± 0.021
1.866SerCys: 1.866 ± 0.015
4.019SerAsp: 4.019 ± 0.019
5.369SerGlu: 5.369 ± 0.021
3.007SerPhe: 3.007 ± 0.015
5.833SerGly: 5.833 ± 0.023
2.26SerHis: 2.26 ± 0.014
3.185SerIle: 3.185 ± 0.016
4.173SerLys: 4.173 ± 0.02
8.594SerLeu: 8.594 ± 0.029
1.686SerMet: 1.686 ± 0.01
2.65SerAsn: 2.65 ± 0.013
6.687SerPro: 6.687 ± 0.04
4.183SerGln: 4.183 ± 0.02
4.824SerArg: 4.824 ± 0.022
10.161SerSer: 10.161 ± 0.05
4.676SerThr: 4.676 ± 0.021
5.031SerVal: 5.031 ± 0.02
1.051SerTrp: 1.051 ± 0.009
2.091SerTyr: 2.091 ± 0.012
0.004SerXaa: 0.004 ± 0.001
Thr
3.884ThrAla: 3.884 ± 0.017
1.266ThrCys: 1.266 ± 0.012
2.46ThrAsp: 2.46 ± 0.014
3.635ThrGlu: 3.635 ± 0.017
2.014ThrPhe: 2.014 ± 0.012
3.605ThrGly: 3.605 ± 0.02
1.362ThrHis: 1.362 ± 0.011
2.292ThrIle: 2.292 ± 0.015
2.606ThrLys: 2.606 ± 0.015
5.395ThrLeu: 5.395 ± 0.019
1.137ThrMet: 1.137 ± 0.008
1.675ThrAsn: 1.675 ± 0.011
3.837ThrPro: 3.837 ± 0.021
2.43ThrGln: 2.43 ± 0.014
2.525ThrArg: 2.525 ± 0.013
4.841ThrSer: 4.841 ± 0.024
3.055ThrThr: 3.055 ± 0.022
3.928ThrVal: 3.928 ± 0.021
0.658ThrTrp: 0.658 ± 0.006
1.389ThrTyr: 1.389 ± 0.008
0.003ThrXaa: 0.003 ± 0.0
Val
4.264ValAla: 4.264 ± 0.017
1.442ValCys: 1.442 ± 0.011
2.936ValAsp: 2.936 ± 0.016
3.791ValGlu: 3.791 ± 0.017
2.34ValPhe: 2.34 ± 0.014
3.304ValGly: 3.304 ± 0.015
1.576ValHis: 1.576 ± 0.008
2.778ValIle: 2.778 ± 0.015
3.278ValLys: 3.278 ± 0.018
6.148ValLeu: 6.148 ± 0.025
1.288ValMet: 1.288 ± 0.009
2.19ValAsn: 2.19 ± 0.013
3.881ValPro: 3.881 ± 0.02
2.73ValGln: 2.73 ± 0.013
3.058ValArg: 3.058 ± 0.015
5.05ValSer: 5.05 ± 0.022
3.85ValThr: 3.85 ± 0.024
3.954ValVal: 3.954 ± 0.02
0.692ValTrp: 0.692 ± 0.006
1.565ValTyr: 1.565 ± 0.009
0.003ValXaa: 0.003 ± 0.0
Trp
0.761TrpAla: 0.761 ± 0.007
0.236TrpCys: 0.236 ± 0.004
0.621TrpAsp: 0.621 ± 0.007
0.781TrpGlu: 0.781 ± 0.007
0.425TrpPhe: 0.425 ± 0.005
0.689TrpGly: 0.689 ± 0.008
0.289TrpHis: 0.289 ± 0.004
0.494TrpIle: 0.494 ± 0.007
0.755TrpLys: 0.755 ± 0.007
1.161TrpLeu: 1.161 ± 0.009
0.289TrpMet: 0.289 ± 0.004
0.503TrpAsn: 0.503 ± 0.005
0.521TrpPro: 0.521 ± 0.007
0.52TrpGln: 0.52 ± 0.005
0.706TrpArg: 0.706 ± 0.007
0.86TrpSer: 0.86 ± 0.008
0.634TrpThr: 0.634 ± 0.007
0.666TrpVal: 0.666 ± 0.006
0.175TrpTrp: 0.175 ± 0.003
0.319TrpTyr: 0.319 ± 0.004
0.001TrpXaa: 0.001 ± 0.0
Tyr
1.347TyrAla: 1.347 ± 0.01
0.656TyrCys: 0.656 ± 0.006
1.222TyrAsp: 1.222 ± 0.008
1.587TyrGlu: 1.587 ± 0.011
1.143TyrPhe: 1.143 ± 0.009
1.602TyrGly: 1.602 ± 0.012
0.737TyrHis: 0.737 ± 0.006
1.302TyrIle: 1.302 ± 0.01
1.369TyrLys: 1.369 ± 0.012
2.575TyrLeu: 2.575 ± 0.014
0.588TyrMet: 0.588 ± 0.005
1.013TyrAsn: 1.013 ± 0.009
1.265TyrPro: 1.265 ± 0.01
1.216TyrGln: 1.216 ± 0.009
1.544TyrArg: 1.544 ± 0.009
2.18TyrSer: 2.18 ± 0.012
1.442TyrThr: 1.442 ± 0.01
1.516TyrVal: 1.516 ± 0.01
0.344TyrTrp: 0.344 ± 0.005
0.888TyrTyr: 0.888 ± 0.008
0.002TyrXaa: 0.002 ± 0.0
Xaa
0.004XaaAla: 0.004 ± 0.001
0.002XaaCys: 0.002 ± 0.0
0.002XaaAsp: 0.002 ± 0.0
0.003XaaGlu: 0.003 ± 0.0
0.003XaaPhe: 0.003 ± 0.0
0.004XaaGly: 0.004 ± 0.001
0.002XaaHis: 0.002 ± 0.0
0.003XaaIle: 0.003 ± 0.0
0.003XaaLys: 0.003 ± 0.001
0.005XaaLeu: 0.005 ± 0.001
0.001XaaMet: 0.001 ± 0.0
0.002XaaAsn: 0.002 ± 0.0
0.004XaaPro: 0.004 ± 0.0
0.003XaaGln: 0.003 ± 0.0
0.003XaaArg: 0.003 ± 0.0
0.004XaaSer: 0.004 ± 0.001
0.003XaaThr: 0.003 ± 0.0
0.003XaaVal: 0.003 ± 0.001
0.001XaaTrp: 0.001 ± 0.0
0.001XaaTyr: 0.001 ± 0.0
0.094XaaXaa: 0.094 ± 0.014
Statistics based on 31698 proteins (20061817 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski