Amino acid dipepetide frequency for Anncaliia algerae PRA339

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.353AlaAla: 1.353 ± 0.092
0.539AlaCys: 0.539 ± 0.024
1.334AlaAsp: 1.334 ± 0.043
1.991AlaGlu: 1.991 ± 0.053
1.839AlaPhe: 1.839 ± 0.043
1.114AlaGly: 1.114 ± 0.053
0.56AlaHis: 0.56 ± 0.024
2.79AlaIle: 2.79 ± 0.063
2.773AlaLys: 2.773 ± 0.072
3.447AlaLeu: 3.447 ± 0.07
0.675AlaMet: 0.675 ± 0.029
1.925AlaAsn: 1.925 ± 0.055
0.777AlaPro: 0.777 ± 0.044
0.923AlaGln: 0.923 ± 0.043
0.983AlaArg: 0.983 ± 0.039
2.13AlaSer: 2.13 ± 0.059
1.382AlaThr: 1.382 ± 0.041
1.564AlaVal: 1.564 ± 0.043
0.211AlaTrp: 0.211 ± 0.014
1.371AlaTyr: 1.371 ± 0.04
0.0AlaXaa: 0.0 ± 0.0
Cys
0.511CysAla: 0.511 ± 0.023
0.386CysCys: 0.386 ± 0.022
0.973CysAsp: 0.973 ± 0.032
1.025CysGlu: 1.025 ± 0.037
1.222CysPhe: 1.222 ± 0.037
0.769CysGly: 0.769 ± 0.034
0.402CysHis: 0.402 ± 0.019
1.796CysIle: 1.796 ± 0.046
1.916CysLys: 1.916 ± 0.048
1.829CysLeu: 1.829 ± 0.047
0.38CysMet: 0.38 ± 0.021
1.324CysAsn: 1.324 ± 0.041
0.442CysPro: 0.442 ± 0.027
0.403CysGln: 0.403 ± 0.017
0.691CysArg: 0.691 ± 0.028
1.296CysSer: 1.296 ± 0.044
0.826CysThr: 0.826 ± 0.03
0.87CysVal: 0.87 ± 0.033
0.172CysTrp: 0.172 ± 0.014
0.833CysTyr: 0.833 ± 0.032
0.0CysXaa: 0.0 ± 0.0
Asp
1.563AspAla: 1.563 ± 0.05
0.95AspCys: 0.95 ± 0.037
2.889AspAsp: 2.889 ± 0.065
4.423AspGlu: 4.423 ± 0.077
3.512AspPhe: 3.512 ± 0.055
1.715AspGly: 1.715 ± 0.041
0.752AspHis: 0.752 ± 0.03
4.651AspIle: 4.651 ± 0.067
5.271AspLys: 5.271 ± 0.07
5.634AspLeu: 5.634 ± 0.082
0.915AspMet: 0.915 ± 0.034
3.472AspAsn: 3.472 ± 0.072
1.282AspPro: 1.282 ± 0.039
1.118AspGln: 1.118 ± 0.03
1.388AspArg: 1.388 ± 0.044
3.765AspSer: 3.765 ± 0.069
2.267AspThr: 2.267 ± 0.051
2.308AspVal: 2.308 ± 0.05
0.304AspTrp: 0.304 ± 0.018
2.442AspTyr: 2.442 ± 0.053
0.0AspXaa: 0.0 ± 0.0
Glu
2.267GluAla: 2.267 ± 0.063
1.273GluCys: 1.273 ± 0.042
3.796GluAsp: 3.796 ± 0.085
8.219GluGlu: 8.219 ± 0.156
4.136GluPhe: 4.136 ± 0.068
2.63GluGly: 2.63 ± 0.065
1.124GluHis: 1.124 ± 0.033
8.23GluIle: 8.23 ± 0.117
8.13GluLys: 8.13 ± 0.12
6.599GluLeu: 6.599 ± 0.099
1.838GluMet: 1.838 ± 0.051
7.109GluAsn: 7.109 ± 0.107
1.229GluPro: 1.229 ± 0.041
1.69GluGln: 1.69 ± 0.055
2.847GluArg: 2.847 ± 0.06
5.197GluSer: 5.197 ± 0.098
3.28GluThr: 3.28 ± 0.058
3.709GluVal: 3.709 ± 0.057
0.381GluTrp: 0.381 ± 0.022
3.418GluTyr: 3.418 ± 0.063
0.0GluXaa: 0.0 ± 0.0
Phe
1.781PheAla: 1.781 ± 0.043
1.304PheCys: 1.304 ± 0.04
3.468PheAsp: 3.468 ± 0.056
3.452PheGlu: 3.452 ± 0.07
4.387PhePhe: 4.387 ± 0.1
2.032PheGly: 2.032 ± 0.053
1.039PheHis: 1.039 ± 0.034
6.767PheIle: 6.767 ± 0.102
5.754PheLys: 5.754 ± 0.093
7.364PheLeu: 7.364 ± 0.121
1.383PheMet: 1.383 ± 0.038
4.879PheAsn: 4.879 ± 0.089
1.372PhePro: 1.372 ± 0.041
1.179PheGln: 1.179 ± 0.041
1.906PheArg: 1.906 ± 0.047
4.578PheSer: 4.578 ± 0.07
3.112PheThr: 3.112 ± 0.072
3.437PheVal: 3.437 ± 0.058
0.338PheTrp: 0.338 ± 0.02
3.818PheTyr: 3.818 ± 0.082
0.0PheXaa: 0.0 ± 0.0
Gly
1.166GlyAla: 1.166 ± 0.044
0.671GlyCys: 0.671 ± 0.029
1.531GlyAsp: 1.531 ± 0.052
2.117GlyGlu: 2.117 ± 0.059
1.898GlyPhe: 1.898 ± 0.045
1.772GlyGly: 1.772 ± 0.063
0.735GlyHis: 0.735 ± 0.029
3.676GlyIle: 3.676 ± 0.067
3.397GlyLys: 3.397 ± 0.059
3.029GlyLeu: 3.029 ± 0.061
0.841GlyMet: 0.841 ± 0.035
2.35GlyAsn: 2.35 ± 0.052
0.756GlyPro: 0.756 ± 0.028
0.667GlyGln: 0.667 ± 0.036
1.496GlyArg: 1.496 ± 0.044
2.447GlySer: 2.447 ± 0.056
1.724GlyThr: 1.724 ± 0.044
1.991GlyVal: 1.991 ± 0.053
0.247GlyTrp: 0.247 ± 0.015
1.749GlyTyr: 1.749 ± 0.048
0.0GlyXaa: 0.0 ± 0.0
His
0.483HisAla: 0.483 ± 0.025
0.356HisCys: 0.356 ± 0.019
0.751HisAsp: 0.751 ± 0.029
1.181HisGlu: 1.181 ± 0.032
1.172HisPhe: 1.172 ± 0.039
0.678HisGly: 0.678 ± 0.027
0.38HisHis: 0.38 ± 0.026
1.35HisIle: 1.35 ± 0.039
2.114HisLys: 2.114 ± 0.052
1.895HisLeu: 1.895 ± 0.045
0.312HisMet: 0.312 ± 0.017
1.241HisAsn: 1.241 ± 0.031
0.611HisPro: 0.611 ± 0.027
0.476HisGln: 0.476 ± 0.022
0.824HisArg: 0.824 ± 0.032
1.552HisSer: 1.552 ± 0.044
0.855HisThr: 0.855 ± 0.027
0.761HisVal: 0.761 ± 0.032
0.098HisTrp: 0.098 ± 0.011
0.736HisTyr: 0.736 ± 0.028
0.0HisXaa: 0.0 ± 0.0
Ile
2.724IleAla: 2.724 ± 0.057
1.928IleCys: 1.928 ± 0.051
4.912IleAsp: 4.912 ± 0.073
6.596IleGlu: 6.596 ± 0.092
6.787IlePhe: 6.787 ± 0.111
3.134IleGly: 3.134 ± 0.066
1.801IleHis: 1.801 ± 0.041
9.168IleIle: 9.168 ± 0.129
11.128IleLys: 11.128 ± 0.111
10.436IleLeu: 10.436 ± 0.118
1.661IleMet: 1.661 ± 0.046
8.432IleAsn: 8.432 ± 0.122
2.874IlePro: 2.874 ± 0.057
2.585IleGln: 2.585 ± 0.053
3.088IleArg: 3.088 ± 0.05
7.174IleSer: 7.174 ± 0.091
4.052IleThr: 4.052 ± 0.056
4.179IleVal: 4.179 ± 0.07
0.544IleTrp: 0.544 ± 0.023
4.943IleTyr: 4.943 ± 0.084
0.0IleXaa: 0.0 ± 0.0
Lys
2.768LysAla: 2.768 ± 0.076
1.859LysCys: 1.859 ± 0.048
5.466LysAsp: 5.466 ± 0.095
10.773LysGlu: 10.773 ± 0.155
4.971LysPhe: 4.971 ± 0.075
3.3LysGly: 3.3 ± 0.069
1.693LysHis: 1.693 ± 0.042
11.049LysIle: 11.049 ± 0.129
11.665LysLys: 11.665 ± 0.163
9.077LysLeu: 9.077 ± 0.102
2.45LysMet: 2.45 ± 0.053
9.802LysAsn: 9.802 ± 0.126
2.071LysPro: 2.071 ± 0.059
2.543LysGln: 2.543 ± 0.052
4.535LysArg: 4.535 ± 0.069
6.886LysSer: 6.886 ± 0.111
4.488LysThr: 4.488 ± 0.077
4.682LysVal: 4.682 ± 0.069
0.46LysTrp: 0.46 ± 0.022
5.399LysTyr: 5.399 ± 0.084
0.0LysXaa: 0.0 ± 0.0
Leu
2.979LeuAla: 2.979 ± 0.065
1.927LeuCys: 1.927 ± 0.049
4.456LeuAsp: 4.456 ± 0.076
7.238LeuGlu: 7.238 ± 0.093
6.563LeuPhe: 6.563 ± 0.113
3.452LeuGly: 3.452 ± 0.062
1.768LeuHis: 1.768 ± 0.041
10.109LeuIle: 10.109 ± 0.121
11.022LeuLys: 11.022 ± 0.13
10.583LeuLeu: 10.583 ± 0.123
2.18LeuMet: 2.18 ± 0.04
8.371LeuAsn: 8.371 ± 0.111
2.598LeuPro: 2.598 ± 0.055
2.918LeuGln: 2.918 ± 0.06
3.683LeuArg: 3.683 ± 0.066
7.304LeuSer: 7.304 ± 0.085
4.493LeuThr: 4.493 ± 0.078
4.533LeuVal: 4.533 ± 0.067
0.513LeuTrp: 0.513 ± 0.02
4.231LeuTyr: 4.231 ± 0.078
0.0LeuXaa: 0.0 ± 0.0
Met
0.669MetAla: 0.669 ± 0.028
0.329MetCys: 0.329 ± 0.018
1.021MetAsp: 1.021 ± 0.034
1.426MetGlu: 1.426 ± 0.043
1.317MetPhe: 1.317 ± 0.038
0.669MetGly: 0.669 ± 0.026
0.518MetHis: 0.518 ± 0.024
2.105MetIle: 2.105 ± 0.049
2.113MetLys: 2.113 ± 0.048
2.122MetLeu: 2.122 ± 0.046
0.383MetMet: 0.383 ± 0.019
1.953MetAsn: 1.953 ± 0.046
0.531MetPro: 0.531 ± 0.024
0.72MetGln: 0.72 ± 0.026
0.722MetArg: 0.722 ± 0.028
1.382MetSer: 1.382 ± 0.041
0.872MetThr: 0.872 ± 0.031
1.035MetVal: 1.035 ± 0.037
0.109MetTrp: 0.109 ± 0.01
0.778MetTyr: 0.778 ± 0.029
0.0MetXaa: 0.0 ± 0.0
Asn
2.222AsnAla: 2.222 ± 0.069
1.187AsnCys: 1.187 ± 0.034
4.684AsnAsp: 4.684 ± 0.083
6.948AsnGlu: 6.948 ± 0.122
4.838AsnPhe: 4.838 ± 0.081
2.347AsnGly: 2.347 ± 0.056
1.553AsnHis: 1.553 ± 0.045
8.226AsnIle: 8.226 ± 0.119
9.175AsnLys: 9.175 ± 0.113
8.238AsnLeu: 8.238 ± 0.107
1.596AsnMet: 1.596 ± 0.044
6.833AsnAsn: 6.833 ± 0.169
1.929AsnPro: 1.929 ± 0.05
2.227AsnGln: 2.227 ± 0.052
2.359AsnArg: 2.359 ± 0.05
5.521AsnSer: 5.521 ± 0.088
3.838AsnThr: 3.838 ± 0.068
3.488AsnVal: 3.488 ± 0.066
0.412AsnTrp: 0.412 ± 0.02
4.048AsnTyr: 4.048 ± 0.085
0.0AsnXaa: 0.0 ± 0.0
Pro
0.715ProAla: 0.715 ± 0.033
0.421ProCys: 0.421 ± 0.021
1.097ProAsp: 1.097 ± 0.035
1.903ProGlu: 1.903 ± 0.055
1.59ProPhe: 1.59 ± 0.043
1.013ProGly: 1.013 ± 0.036
0.477ProHis: 0.477 ± 0.024
2.341ProIle: 2.341 ± 0.05
2.171ProLys: 2.171 ± 0.054
2.379ProLeu: 2.379 ± 0.056
0.539ProMet: 0.539 ± 0.025
1.779ProAsn: 1.779 ± 0.044
0.773ProPro: 0.773 ± 0.059
0.868ProGln: 0.868 ± 0.035
0.699ProArg: 0.699 ± 0.028
1.942ProSer: 1.942 ± 0.055
1.19ProThr: 1.19 ± 0.046
1.207ProVal: 1.207 ± 0.04
0.142ProTrp: 0.142 ± 0.012
1.011ProTyr: 1.011 ± 0.038
0.0ProXaa: 0.0 ± 0.0
Gln
0.862GlnAla: 0.862 ± 0.04
0.397GlnCys: 0.397 ± 0.018
1.077GlnAsp: 1.077 ± 0.037
2.098GlnGlu: 2.098 ± 0.054
1.241GlnPhe: 1.241 ± 0.037
0.834GlnGly: 0.834 ± 0.037
0.398GlnHis: 0.398 ± 0.022
2.549GlnIle: 2.549 ± 0.053
2.731GlnLys: 2.731 ± 0.064
2.329GlnLeu: 2.329 ± 0.056
0.707GlnMet: 0.707 ± 0.028
2.369GlnAsn: 2.369 ± 0.055
0.748GlnPro: 0.748 ± 0.036
0.844GlnGln: 0.844 ± 0.048
1.261GlnArg: 1.261 ± 0.042
1.977GlnSer: 1.977 ± 0.043
1.269GlnThr: 1.269 ± 0.036
1.109GlnVal: 1.109 ± 0.039
0.177GlnTrp: 0.177 ± 0.012
0.884GlnTyr: 0.884 ± 0.035
0.0GlnXaa: 0.0 ± 0.0
Arg
1.108ArgAla: 1.108 ± 0.038
0.66ArgCys: 0.66 ± 0.027
1.473ArgAsp: 1.473 ± 0.046
2.638ArgGlu: 2.638 ± 0.054
1.767ArgPhe: 1.767 ± 0.045
1.384ArgGly: 1.384 ± 0.037
0.588ArgHis: 0.588 ± 0.025
3.796ArgIle: 3.796 ± 0.068
4.328ArgLys: 4.328 ± 0.075
2.737ArgLeu: 2.737 ± 0.064
0.888ArgMet: 0.888 ± 0.034
3.235ArgAsn: 3.235 ± 0.068
0.664ArgPro: 0.664 ± 0.028
0.76ArgGln: 0.76 ± 0.03
1.781ArgArg: 1.781 ± 0.052
2.284ArgSer: 2.284 ± 0.049
1.533ArgThr: 1.533 ± 0.042
1.576ArgVal: 1.576 ± 0.04
0.21ArgTrp: 0.21 ± 0.015
1.39ArgTyr: 1.39 ± 0.042
0.0ArgXaa: 0.0 ± 0.0
Ser
2.154SerAla: 2.154 ± 0.056
1.227SerCys: 1.227 ± 0.034
3.841SerAsp: 3.841 ± 0.077
4.903SerGlu: 4.903 ± 0.09
5.504SerPhe: 5.504 ± 0.09
2.397SerGly: 2.397 ± 0.057
1.348SerHis: 1.348 ± 0.034
6.724SerIle: 6.724 ± 0.081
7.235SerLys: 7.235 ± 0.123
7.981SerLeu: 7.981 ± 0.094
1.348SerMet: 1.348 ± 0.039
5.196SerAsn: 5.196 ± 0.078
1.631SerPro: 1.631 ± 0.051
1.948SerGln: 1.948 ± 0.045
2.051SerArg: 2.051 ± 0.043
6.053SerSer: 6.053 ± 0.169
3.543SerThr: 3.543 ± 0.08
3.419SerVal: 3.419 ± 0.066
0.341SerTrp: 0.341 ± 0.019
3.334SerTyr: 3.334 ± 0.061
0.0SerXaa: 0.0 ± 0.0
Thr
1.296ThrAla: 1.296 ± 0.043
0.735ThrCys: 0.735 ± 0.032
2.597ThrAsp: 2.597 ± 0.05
3.111ThrGlu: 3.111 ± 0.064
2.887ThrPhe: 2.887 ± 0.058
1.732ThrGly: 1.732 ± 0.048
0.923ThrHis: 0.923 ± 0.034
4.238ThrIle: 4.238 ± 0.076
4.607ThrLys: 4.607 ± 0.068
4.539ThrLeu: 4.539 ± 0.074
0.858ThrMet: 0.858 ± 0.03
3.688ThrAsn: 3.688 ± 0.057
1.406ThrPro: 1.406 ± 0.054
1.448ThrGln: 1.448 ± 0.044
1.408ThrArg: 1.408 ± 0.037
3.238ThrSer: 3.238 ± 0.068
2.277ThrThr: 2.277 ± 0.069
2.192ThrVal: 2.192 ± 0.047
0.25ThrTrp: 0.25 ± 0.016
1.82ThrTyr: 1.82 ± 0.042
0.0ThrXaa: 0.0 ± 0.0
Val
1.467ValAla: 1.467 ± 0.038
0.92ValCys: 0.92 ± 0.035
2.585ValAsp: 2.585 ± 0.056
3.511ValGlu: 3.511 ± 0.059
3.099ValPhe: 3.099 ± 0.06
1.645ValGly: 1.645 ± 0.05
0.798ValHis: 0.798 ± 0.029
4.38ValIle: 4.38 ± 0.068
4.736ValLys: 4.736 ± 0.083
4.737ValLeu: 4.737 ± 0.066
0.841ValMet: 0.841 ± 0.027
3.726ValAsn: 3.726 ± 0.065
1.321ValPro: 1.321 ± 0.039
1.207ValGln: 1.207 ± 0.03
1.445ValArg: 1.445 ± 0.046
3.328ValSer: 3.328 ± 0.061
2.082ValThr: 2.082 ± 0.05
2.474ValVal: 2.474 ± 0.059
0.308ValTrp: 0.308 ± 0.019
2.229ValTyr: 2.229 ± 0.049
0.0ValXaa: 0.0 ± 0.0
Trp
0.19TrpAla: 0.19 ± 0.014
0.133TrpCys: 0.133 ± 0.012
0.264TrpAsp: 0.264 ± 0.016
0.264TrpGlu: 0.264 ± 0.017
0.323TrpPhe: 0.323 ± 0.019
0.181TrpGly: 0.181 ± 0.013
0.102TrpHis: 0.102 ± 0.011
0.519TrpIle: 0.519 ± 0.025
0.525TrpLys: 0.525 ± 0.025
0.496TrpLeu: 0.496 ± 0.023
0.147TrpMet: 0.147 ± 0.013
0.38TrpAsn: 0.38 ± 0.02
0.109TrpPro: 0.109 ± 0.011
0.135TrpGln: 0.135 ± 0.012
0.336TrpArg: 0.336 ± 0.018
0.524TrpSer: 0.524 ± 0.025
0.247TrpThr: 0.247 ± 0.018
0.336TrpVal: 0.336 ± 0.02
0.037TrpTrp: 0.037 ± 0.007
0.256TrpTyr: 0.256 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.405TyrAla: 1.405 ± 0.043
0.861TyrCys: 0.861 ± 0.03
2.407TyrAsp: 2.407 ± 0.06
3.283TyrGlu: 3.283 ± 0.071
4.297TyrPhe: 4.297 ± 0.076
1.464TyrGly: 1.464 ± 0.052
0.868TyrHis: 0.868 ± 0.031
3.682TyrIle: 3.682 ± 0.067
5.144TyrLys: 5.144 ± 0.09
5.469TyrLeu: 5.469 ± 0.095
0.834TyrMet: 0.834 ± 0.03
3.605TyrAsn: 3.605 ± 0.065
1.132TyrPro: 1.132 ± 0.038
1.204TyrGln: 1.204 ± 0.039
1.294TyrArg: 1.294 ± 0.033
3.601TyrSer: 3.601 ± 0.068
1.92TyrThr: 1.92 ± 0.051
1.969TyrVal: 1.969 ± 0.049
0.252TyrTrp: 0.252 ± 0.017
2.274TyrTyr: 2.274 ± 0.058
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3576 proteins (953739 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski