Amino acid dipepetide frequency for Lentibacillus persicus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.608AlaAla: 6.608 ± 0.108
0.547AlaCys: 0.547 ± 0.022
4.159AlaAsp: 4.159 ± 0.082
5.467AlaGlu: 5.467 ± 0.081
3.44AlaPhe: 3.44 ± 0.067
6.147AlaGly: 6.147 ± 0.08
1.42AlaHis: 1.42 ± 0.046
5.796AlaIle: 5.796 ± 0.099
4.336AlaLys: 4.336 ± 0.067
7.113AlaLeu: 7.113 ± 0.095
2.169AlaMet: 2.169 ± 0.046
2.954AlaAsn: 2.954 ± 0.062
2.258AlaPro: 2.258 ± 0.048
2.332AlaGln: 2.332 ± 0.048
2.788AlaArg: 2.788 ± 0.062
4.549AlaSer: 4.549 ± 0.076
3.572AlaThr: 3.572 ± 0.064
6.067AlaVal: 6.067 ± 0.088
0.652AlaTrp: 0.652 ± 0.029
2.504AlaTyr: 2.504 ± 0.055
0.0AlaXaa: 0.0 ± 0.0
Cys
0.373CysAla: 0.373 ± 0.023
0.067CysCys: 0.067 ± 0.009
0.381CysAsp: 0.381 ± 0.021
0.357CysGlu: 0.357 ± 0.019
0.258CysPhe: 0.258 ± 0.016
0.597CysGly: 0.597 ± 0.025
0.179CysHis: 0.179 ± 0.015
0.429CysIle: 0.429 ± 0.022
0.261CysLys: 0.261 ± 0.016
0.493CysLeu: 0.493 ± 0.022
0.16CysMet: 0.16 ± 0.012
0.245CysAsn: 0.245 ± 0.018
0.325CysPro: 0.325 ± 0.02
0.21CysGln: 0.21 ± 0.016
0.266CysArg: 0.266 ± 0.017
0.422CysSer: 0.422 ± 0.024
0.333CysThr: 0.333 ± 0.019
0.368CysVal: 0.368 ± 0.019
0.051CysTrp: 0.051 ± 0.007
0.224CysTyr: 0.224 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
3.897AspAla: 3.897 ± 0.066
0.329AspCys: 0.329 ± 0.019
3.382AspAsp: 3.382 ± 0.069
5.051AspGlu: 5.051 ± 0.092
2.443AspPhe: 2.443 ± 0.054
3.746AspGly: 3.746 ± 0.067
1.273AspHis: 1.273 ± 0.039
4.431AspIle: 4.431 ± 0.072
3.496AspLys: 3.496 ± 0.064
5.078AspLeu: 5.078 ± 0.076
1.729AspMet: 1.729 ± 0.039
2.498AspAsn: 2.498 ± 0.051
2.177AspPro: 2.177 ± 0.053
2.026AspGln: 2.026 ± 0.041
2.41AspArg: 2.41 ± 0.053
2.871AspSer: 2.871 ± 0.062
2.835AspThr: 2.835 ± 0.06
4.364AspVal: 4.364 ± 0.07
0.69AspTrp: 0.69 ± 0.03
2.492AspTyr: 2.492 ± 0.053
0.0AspXaa: 0.0 ± 0.0
Glu
6.064GluAla: 6.064 ± 0.105
0.295GluCys: 0.295 ± 0.018
4.334GluAsp: 4.334 ± 0.076
6.771GluGlu: 6.771 ± 0.11
2.296GluPhe: 2.296 ± 0.054
4.449GluGly: 4.449 ± 0.074
1.526GluHis: 1.526 ± 0.041
4.996GluIle: 4.996 ± 0.077
6.057GluLys: 6.057 ± 0.094
6.875GluLeu: 6.875 ± 0.099
2.368GluMet: 2.368 ± 0.059
4.203GluAsn: 4.203 ± 0.071
2.258GluPro: 2.258 ± 0.056
3.515GluGln: 3.515 ± 0.064
3.62GluArg: 3.62 ± 0.071
3.942GluSer: 3.942 ± 0.072
4.44GluThr: 4.44 ± 0.077
4.721GluVal: 4.721 ± 0.082
0.8GluTrp: 0.8 ± 0.027
2.019GluTyr: 2.019 ± 0.044
0.0GluXaa: 0.0 ± 0.0
Phe
3.149PheAla: 3.149 ± 0.071
0.276PheCys: 0.276 ± 0.016
2.489PheAsp: 2.489 ± 0.05
2.632PheGlu: 2.632 ± 0.055
2.251PhePhe: 2.251 ± 0.064
3.304PheGly: 3.304 ± 0.069
0.934PheHis: 0.934 ± 0.036
3.737PheIle: 3.737 ± 0.076
2.27PheLys: 2.27 ± 0.055
4.308PheLeu: 4.308 ± 0.088
1.267PheMet: 1.267 ± 0.04
1.981PheAsn: 1.981 ± 0.048
1.622PhePro: 1.622 ± 0.046
1.535PheGln: 1.535 ± 0.043
1.53PheArg: 1.53 ± 0.04
3.105PheSer: 3.105 ± 0.058
2.506PheThr: 2.506 ± 0.052
2.907PheVal: 2.907 ± 0.067
0.484PheTrp: 0.484 ± 0.024
1.651PheTyr: 1.651 ± 0.043
0.0PheXaa: 0.0 ± 0.0
Gly
5.248GlyAla: 5.248 ± 0.088
0.518GlyCys: 0.518 ± 0.024
3.654GlyAsp: 3.654 ± 0.067
4.794GlyGlu: 4.794 ± 0.077
3.429GlyPhe: 3.429 ± 0.067
5.152GlyGly: 5.152 ± 0.103
1.413GlyHis: 1.413 ± 0.039
5.819GlyIle: 5.819 ± 0.087
4.6GlyLys: 4.6 ± 0.073
6.663GlyLeu: 6.663 ± 0.112
2.357GlyMet: 2.357 ± 0.049
2.84GlyAsn: 2.84 ± 0.064
1.921GlyPro: 1.921 ± 0.04
2.153GlyGln: 2.153 ± 0.046
2.676GlyArg: 2.676 ± 0.052
4.32GlySer: 4.32 ± 0.069
4.256GlyThr: 4.256 ± 0.061
5.426GlyVal: 5.426 ± 0.08
0.836GlyTrp: 0.836 ± 0.036
2.747GlyTyr: 2.747 ± 0.054
0.0GlyXaa: 0.0 ± 0.0
His
1.459HisAla: 1.459 ± 0.039
0.184HisCys: 0.184 ± 0.016
1.197HisAsp: 1.197 ± 0.033
1.513HisGlu: 1.513 ± 0.034
1.007HisPhe: 1.007 ± 0.037
1.531HisGly: 1.531 ± 0.045
0.71HisHis: 0.71 ± 0.03
1.571HisIle: 1.571 ± 0.041
1.06HisLys: 1.06 ± 0.035
1.949HisLeu: 1.949 ± 0.041
0.583HisMet: 0.583 ± 0.024
0.919HisAsn: 0.919 ± 0.029
1.112HisPro: 1.112 ± 0.039
0.799HisGln: 0.799 ± 0.03
0.915HisArg: 0.915 ± 0.032
1.085HisSer: 1.085 ± 0.033
1.075HisThr: 1.075 ± 0.035
1.531HisVal: 1.531 ± 0.042
0.216HisTrp: 0.216 ± 0.016
0.953HisTyr: 0.953 ± 0.033
0.0HisXaa: 0.0 ± 0.0
Ile
6.125IleAla: 6.125 ± 0.088
0.54IleCys: 0.54 ± 0.024
4.644IleAsp: 4.644 ± 0.079
5.183IleGlu: 5.183 ± 0.082
3.18IlePhe: 3.18 ± 0.076
5.969IleGly: 5.969 ± 0.096
1.516IleHis: 1.516 ± 0.04
5.639IleIle: 5.639 ± 0.103
4.008IleLys: 4.008 ± 0.069
6.523IleLeu: 6.523 ± 0.107
1.96IleMet: 1.96 ± 0.038
3.44IleAsn: 3.44 ± 0.055
3.221IlePro: 3.221 ± 0.061
2.604IleGln: 2.604 ± 0.051
2.929IleArg: 2.929 ± 0.057
4.812IleSer: 4.812 ± 0.069
4.392IleThr: 4.392 ± 0.064
5.237IleVal: 5.237 ± 0.085
0.673IleTrp: 0.673 ± 0.028
2.279IleTyr: 2.279 ± 0.045
0.0IleXaa: 0.0 ± 0.0
Lys
4.583LysAla: 4.583 ± 0.075
0.268LysCys: 0.268 ± 0.017
3.594LysAsp: 3.594 ± 0.071
5.601LysGlu: 5.601 ± 0.087
1.748LysPhe: 1.748 ± 0.044
3.883LysGly: 3.883 ± 0.069
1.396LysHis: 1.396 ± 0.041
3.884LysIle: 3.884 ± 0.06
4.747LysLys: 4.747 ± 0.074
5.15LysLeu: 5.15 ± 0.081
2.024LysMet: 2.024 ± 0.044
3.137LysAsn: 3.137 ± 0.063
2.168LysPro: 2.168 ± 0.054
3.277LysGln: 3.277 ± 0.067
3.179LysArg: 3.179 ± 0.057
3.234LysSer: 3.234 ± 0.057
3.552LysThr: 3.552 ± 0.064
3.792LysVal: 3.792 ± 0.071
0.731LysTrp: 0.731 ± 0.025
1.838LysTyr: 1.838 ± 0.046
0.0LysXaa: 0.0 ± 0.0
Leu
7.2LeuAla: 7.2 ± 0.105
0.514LeuCys: 0.514 ± 0.022
4.906LeuAsp: 4.906 ± 0.076
6.391LeuGlu: 6.391 ± 0.095
4.604LeuPhe: 4.604 ± 0.092
6.447LeuGly: 6.447 ± 0.102
1.842LeuHis: 1.842 ± 0.045
7.11LeuIle: 7.11 ± 0.126
5.845LeuLys: 5.845 ± 0.078
9.245LeuLeu: 9.245 ± 0.142
2.714LeuMet: 2.714 ± 0.052
4.452LeuAsn: 4.452 ± 0.067
3.888LeuPro: 3.888 ± 0.062
3.239LeuGln: 3.239 ± 0.059
3.459LeuArg: 3.459 ± 0.061
6.279LeuSer: 6.279 ± 0.091
5.576LeuThr: 5.576 ± 0.077
5.806LeuVal: 5.806 ± 0.094
0.804LeuTrp: 0.804 ± 0.034
3.04LeuTyr: 3.04 ± 0.059
0.0LeuXaa: 0.0 ± 0.0
Met
2.464MetAla: 2.464 ± 0.053
0.15MetCys: 0.15 ± 0.013
1.698MetAsp: 1.698 ± 0.042
2.141MetGlu: 2.141 ± 0.052
1.115MetPhe: 1.115 ± 0.035
1.894MetGly: 1.894 ± 0.044
0.541MetHis: 0.541 ± 0.025
2.102MetIle: 2.102 ± 0.051
2.215MetLys: 2.215 ± 0.048
2.808MetLeu: 2.808 ± 0.054
0.873MetMet: 0.873 ± 0.033
1.57MetAsn: 1.57 ± 0.04
1.112MetPro: 1.112 ± 0.04
1.094MetGln: 1.094 ± 0.034
1.254MetArg: 1.254 ± 0.037
1.746MetSer: 1.746 ± 0.038
1.93MetThr: 1.93 ± 0.042
1.919MetVal: 1.919 ± 0.044
0.214MetTrp: 0.214 ± 0.017
0.738MetTyr: 0.738 ± 0.026
0.0MetXaa: 0.0 ± 0.0
Asn
3.151AsnAla: 3.151 ± 0.056
0.268AsnCys: 0.268 ± 0.017
2.833AsnAsp: 2.833 ± 0.06
3.778AsnGlu: 3.778 ± 0.065
1.676AsnPhe: 1.676 ± 0.047
3.392AsnGly: 3.392 ± 0.079
1.019AsnHis: 1.019 ± 0.034
3.245AsnIle: 3.245 ± 0.068
2.69AsnLys: 2.69 ± 0.056
3.825AsnLeu: 3.825 ± 0.057
1.399AsnMet: 1.399 ± 0.037
2.202AsnAsn: 2.202 ± 0.063
2.096AsnPro: 2.096 ± 0.051
2.119AsnGln: 2.119 ± 0.05
2.112AsnArg: 2.112 ± 0.05
2.209AsnSer: 2.209 ± 0.053
2.364AsnThr: 2.364 ± 0.052
3.21AsnVal: 3.21 ± 0.056
0.54AsnTrp: 0.54 ± 0.024
1.624AsnTyr: 1.624 ± 0.044
0.0AsnXaa: 0.0 ± 0.0
Pro
2.61ProAla: 2.61 ± 0.063
0.197ProCys: 0.197 ± 0.015
2.555ProAsp: 2.555 ± 0.055
3.565ProGlu: 3.565 ± 0.061
1.92ProPhe: 1.92 ± 0.052
2.612ProGly: 2.612 ± 0.062
0.822ProHis: 0.822 ± 0.036
2.632ProIle: 2.632 ± 0.054
1.94ProLys: 1.94 ± 0.05
3.484ProLeu: 3.484 ± 0.061
0.89ProMet: 0.89 ± 0.031
1.493ProAsn: 1.493 ± 0.04
1.066ProPro: 1.066 ± 0.038
1.064ProGln: 1.064 ± 0.033
1.104ProArg: 1.104 ± 0.035
2.241ProSer: 2.241 ± 0.054
1.721ProThr: 1.721 ± 0.043
3.178ProVal: 3.178 ± 0.06
0.367ProTrp: 0.367 ± 0.02
1.453ProTyr: 1.453 ± 0.039
0.0ProXaa: 0.0 ± 0.0
Gln
2.917GlnAla: 2.917 ± 0.056
0.161GlnCys: 0.161 ± 0.015
1.862GlnAsp: 1.862 ± 0.043
2.945GlnGlu: 2.945 ± 0.06
1.603GlnPhe: 1.603 ± 0.042
2.126GlnGly: 2.126 ± 0.055
0.852GlnHis: 0.852 ± 0.032
2.447GlnIle: 2.447 ± 0.054
2.604GlnLys: 2.604 ± 0.055
3.908GlnLeu: 3.908 ± 0.065
1.185GlnMet: 1.185 ± 0.036
1.786GlnAsn: 1.786 ± 0.053
1.229GlnPro: 1.229 ± 0.036
1.947GlnGln: 1.947 ± 0.061
1.548GlnArg: 1.548 ± 0.044
2.267GlnSer: 2.267 ± 0.054
2.185GlnThr: 2.185 ± 0.054
2.242GlnVal: 2.242 ± 0.045
0.422GlnTrp: 0.422 ± 0.024
1.247GlnTyr: 1.247 ± 0.033
0.0GlnXaa: 0.0 ± 0.0
Arg
2.582ArgAla: 2.582 ± 0.059
0.219ArgCys: 0.219 ± 0.014
2.313ArgAsp: 2.313 ± 0.055
3.359ArgGlu: 3.359 ± 0.064
1.89ArgPhe: 1.89 ± 0.046
2.372ArgGly: 2.372 ± 0.058
1.0ArgHis: 1.0 ± 0.039
2.927ArgIle: 2.927 ± 0.055
3.062ArgLys: 3.062 ± 0.057
3.959ArgLeu: 3.959 ± 0.069
1.343ArgMet: 1.343 ± 0.043
1.97ArgAsn: 1.97 ± 0.041
1.33ArgPro: 1.33 ± 0.033
1.72ArgGln: 1.72 ± 0.046
1.946ArgArg: 1.946 ± 0.043
2.241ArgSer: 2.241 ± 0.05
2.071ArgThr: 2.071 ± 0.05
2.631ArgVal: 2.631 ± 0.058
0.375ArgTrp: 0.375 ± 0.022
1.539ArgTyr: 1.539 ± 0.039
0.0ArgXaa: 0.0 ± 0.0
Ser
4.089SerAla: 4.089 ± 0.072
0.319SerCys: 0.319 ± 0.017
3.541SerAsp: 3.541 ± 0.069
4.355SerGlu: 4.355 ± 0.073
2.969SerPhe: 2.969 ± 0.063
4.868SerGly: 4.868 ± 0.078
1.257SerHis: 1.257 ± 0.038
4.736SerIle: 4.736 ± 0.07
3.204SerLys: 3.204 ± 0.06
5.551SerLeu: 5.551 ± 0.085
1.72SerMet: 1.72 ± 0.044
2.53SerAsn: 2.53 ± 0.056
2.149SerPro: 2.149 ± 0.055
2.024SerGln: 2.024 ± 0.052
2.444SerArg: 2.444 ± 0.056
3.788SerSer: 3.788 ± 0.075
2.95SerThr: 2.95 ± 0.054
4.318SerVal: 4.318 ± 0.067
0.588SerTrp: 0.588 ± 0.023
2.125SerTyr: 2.125 ± 0.042
0.0SerXaa: 0.0 ± 0.0
Thr
4.449ThrAla: 4.449 ± 0.067
0.311ThrCys: 0.311 ± 0.021
3.19ThrAsp: 3.19 ± 0.056
3.874ThrGlu: 3.874 ± 0.074
2.682ThrPhe: 2.682 ± 0.06
4.638ThrGly: 4.638 ± 0.074
1.156ThrHis: 1.156 ± 0.034
4.502ThrIle: 4.502 ± 0.068
2.878ThrLys: 2.878 ± 0.055
5.039ThrLeu: 5.039 ± 0.079
1.5ThrMet: 1.5 ± 0.039
2.276ThrAsn: 2.276 ± 0.05
2.36ThrPro: 2.36 ± 0.048
1.512ThrGln: 1.512 ± 0.037
1.947ThrArg: 1.947 ± 0.043
3.389ThrSer: 3.389 ± 0.057
2.916ThrThr: 2.916 ± 0.06
4.302ThrVal: 4.302 ± 0.068
0.535ThrTrp: 0.535 ± 0.025
1.967ThrTyr: 1.967 ± 0.042
0.0ThrXaa: 0.0 ± 0.0
Val
4.959ValAla: 4.959 ± 0.083
0.524ValCys: 0.524 ± 0.024
3.85ValAsp: 3.85 ± 0.068
4.612ValGlu: 4.612 ± 0.074
3.192ValPhe: 3.192 ± 0.063
4.48ValGly: 4.48 ± 0.081
1.385ValHis: 1.385 ± 0.039
5.707ValIle: 5.707 ± 0.089
4.075ValLys: 4.075 ± 0.066
6.892ValLeu: 6.892 ± 0.089
2.094ValMet: 2.094 ± 0.048
3.236ValAsn: 3.236 ± 0.056
2.887ValPro: 2.887 ± 0.052
2.38ValGln: 2.38 ± 0.05
2.748ValArg: 2.748 ± 0.052
4.57ValSer: 4.57 ± 0.067
4.357ValThr: 4.357 ± 0.08
5.091ValVal: 5.091 ± 0.091
0.602ValTrp: 0.602 ± 0.026
2.368ValTyr: 2.368 ± 0.054
0.0ValXaa: 0.0 ± 0.0
Trp
0.643TrpAla: 0.643 ± 0.03
0.065TrpCys: 0.065 ± 0.009
0.521TrpAsp: 0.521 ± 0.022
0.673TrpGlu: 0.673 ± 0.03
0.495TrpPhe: 0.495 ± 0.024
0.676TrpGly: 0.676 ± 0.029
0.24TrpHis: 0.24 ± 0.017
0.735TrpIle: 0.735 ± 0.029
0.634TrpLys: 0.634 ± 0.031
1.2TrpLeu: 1.2 ± 0.041
0.35TrpMet: 0.35 ± 0.019
0.494TrpAsn: 0.494 ± 0.022
0.299TrpPro: 0.299 ± 0.018
0.426TrpGln: 0.426 ± 0.022
0.377TrpArg: 0.377 ± 0.02
0.54TrpSer: 0.54 ± 0.025
0.55TrpThr: 0.55 ± 0.025
0.679TrpVal: 0.679 ± 0.029
0.136TrpTrp: 0.136 ± 0.014
0.366TrpTyr: 0.366 ± 0.023
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.308TyrAla: 2.308 ± 0.053
0.25TyrCys: 0.25 ± 0.017
2.113TyrAsp: 2.113 ± 0.052
2.484TyrGlu: 2.484 ± 0.057
1.743TyrPhe: 1.743 ± 0.047
2.539TyrGly: 2.539 ± 0.054
0.887TyrHis: 0.887 ± 0.027
2.43TyrIle: 2.43 ± 0.054
1.799TyrLys: 1.799 ± 0.044
3.298TyrLeu: 3.298 ± 0.062
0.894TyrMet: 0.894 ± 0.028
1.527TyrAsn: 1.527 ± 0.04
1.435TyrPro: 1.435 ± 0.037
1.506TyrGln: 1.506 ± 0.04
1.598TyrArg: 1.598 ± 0.044
1.92TyrSer: 1.92 ± 0.043
1.84TyrThr: 1.84 ± 0.042
2.221TyrVal: 2.221 ± 0.045
0.383TyrTrp: 0.383 ± 0.02
1.401TyrTyr: 1.401 ± 0.045
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3300 proteins (945704 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski