Amino acid dipepetide frequency for Solimonas aquatica

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.156AlaAla: 18.156 ± 0.172
1.4AlaCys: 1.4 ± 0.036
6.185AlaAsp: 6.185 ± 0.072
7.114AlaGlu: 7.114 ± 0.091
3.654AlaPhe: 3.654 ± 0.062
10.478AlaGly: 10.478 ± 0.107
2.485AlaHis: 2.485 ± 0.048
4.961AlaIle: 4.961 ± 0.067
3.491AlaLys: 3.491 ± 0.07
15.227AlaLeu: 15.227 ± 0.142
3.062AlaMet: 3.062 ± 0.045
2.666AlaAsn: 2.666 ± 0.052
6.444AlaPro: 6.444 ± 0.094
7.895AlaGln: 7.895 ± 0.111
9.748AlaArg: 9.748 ± 0.117
6.56AlaSer: 6.56 ± 0.077
5.176AlaThr: 5.176 ± 0.076
8.149AlaVal: 8.149 ± 0.091
1.744AlaTrp: 1.744 ± 0.046
3.061AlaTyr: 3.061 ± 0.053
0.0AlaXaa: 0.0 ± 0.0
Cys
1.391CysAla: 1.391 ± 0.037
0.145CysCys: 0.145 ± 0.01
0.468CysAsp: 0.468 ± 0.019
0.629CysGlu: 0.629 ± 0.026
0.326CysPhe: 0.326 ± 0.016
1.108CysGly: 1.108 ± 0.033
0.257CysHis: 0.257 ± 0.014
0.395CysIle: 0.395 ± 0.018
0.238CysLys: 0.238 ± 0.012
0.967CysLeu: 0.967 ± 0.025
0.173CysMet: 0.173 ± 0.012
0.229CysAsn: 0.229 ± 0.015
0.499CysPro: 0.499 ± 0.025
0.322CysGln: 0.322 ± 0.017
0.694CysArg: 0.694 ± 0.024
0.616CysSer: 0.616 ± 0.024
0.486CysThr: 0.486 ± 0.018
0.699CysVal: 0.699 ± 0.022
0.15CysTrp: 0.15 ± 0.01
0.259CysTyr: 0.259 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
6.963AspAla: 6.963 ± 0.09
0.554AspCys: 0.554 ± 0.022
2.361AspAsp: 2.361 ± 0.058
3.248AspGlu: 3.248 ± 0.051
2.058AspPhe: 2.058 ± 0.04
4.471AspGly: 4.471 ± 0.081
1.073AspHis: 1.073 ± 0.029
2.166AspIle: 2.166 ± 0.042
1.634AspLys: 1.634 ± 0.04
5.385AspLeu: 5.385 ± 0.065
0.924AspMet: 0.924 ± 0.025
1.291AspAsn: 1.291 ± 0.037
2.951AspPro: 2.951 ± 0.052
1.71AspGln: 1.71 ± 0.037
3.13AspArg: 3.13 ± 0.053
2.641AspSer: 2.641 ± 0.056
2.348AspThr: 2.348 ± 0.068
3.311AspVal: 3.311 ± 0.056
0.992AspTrp: 0.992 ± 0.026
1.76AspTyr: 1.76 ± 0.04
0.0AspXaa: 0.0 ± 0.0
Glu
7.617GluAla: 7.617 ± 0.101
0.399GluCys: 0.399 ± 0.02
2.553GluAsp: 2.553 ± 0.041
2.639GluGlu: 2.639 ± 0.06
1.812GluPhe: 1.812 ± 0.044
3.653GluGly: 3.653 ± 0.055
1.474GluHis: 1.474 ± 0.038
2.742GluIle: 2.742 ± 0.046
1.838GluLys: 1.838 ± 0.043
6.936GluLeu: 6.936 ± 0.083
1.094GluMet: 1.094 ± 0.032
1.415GluAsn: 1.415 ± 0.033
2.537GluPro: 2.537 ± 0.044
3.209GluGln: 3.209 ± 0.058
4.722GluArg: 4.722 ± 0.078
2.546GluSer: 2.546 ± 0.047
2.527GluThr: 2.527 ± 0.049
3.799GluVal: 3.799 ± 0.067
0.628GluTrp: 0.628 ± 0.021
1.292GluTyr: 1.292 ± 0.033
0.0GluXaa: 0.0 ± 0.0
Phe
4.302PheAla: 4.302 ± 0.053
0.411PheCys: 0.411 ± 0.018
2.287PheAsp: 2.287 ± 0.039
1.979PheGlu: 1.979 ± 0.045
1.326PhePhe: 1.326 ± 0.04
3.16PheGly: 3.16 ± 0.055
0.719PheHis: 0.719 ± 0.024
1.44PheIle: 1.44 ± 0.034
1.074PheLys: 1.074 ± 0.029
3.038PheLeu: 3.038 ± 0.051
0.738PheMet: 0.738 ± 0.024
1.152PheAsn: 1.152 ± 0.029
1.422PhePro: 1.422 ± 0.034
1.008PheGln: 1.008 ± 0.028
1.961PheArg: 1.961 ± 0.039
2.098PheSer: 2.098 ± 0.044
1.805PheThr: 1.805 ± 0.045
2.457PheVal: 2.457 ± 0.042
0.516PheTrp: 0.516 ± 0.02
0.989PheTyr: 0.989 ± 0.026
0.0PheXaa: 0.0 ± 0.0
Gly
9.272GlyAla: 9.272 ± 0.099
0.909GlyCys: 0.909 ± 0.032
3.896GlyAsp: 3.896 ± 0.072
5.108GlyGlu: 5.108 ± 0.065
3.242GlyPhe: 3.242 ± 0.052
7.313GlyGly: 7.313 ± 0.12
1.851GlyHis: 1.851 ± 0.035
3.917GlyIle: 3.917 ± 0.059
3.057GlyLys: 3.057 ± 0.053
9.022GlyLeu: 9.022 ± 0.085
1.933GlyMet: 1.933 ± 0.042
2.255GlyAsn: 2.255 ± 0.06
2.896GlyPro: 2.896 ± 0.052
3.131GlyGln: 3.131 ± 0.055
5.429GlyArg: 5.429 ± 0.065
5.128GlySer: 5.128 ± 0.118
3.793GlyThr: 3.793 ± 0.072
6.017GlyVal: 6.017 ± 0.067
1.307GlyTrp: 1.307 ± 0.033
2.512GlyTyr: 2.512 ± 0.053
0.0GlyXaa: 0.0 ± 0.0
His
2.488HisAla: 2.488 ± 0.045
0.329HisCys: 0.329 ± 0.018
1.127HisAsp: 1.127 ± 0.029
1.302HisGlu: 1.302 ± 0.033
0.895HisPhe: 0.895 ± 0.028
2.086HisGly: 2.086 ± 0.043
0.68HisHis: 0.68 ± 0.028
0.849HisIle: 0.849 ± 0.023
0.544HisLys: 0.544 ± 0.021
2.442HisLeu: 2.442 ± 0.048
0.432HisMet: 0.432 ± 0.017
0.531HisAsn: 0.531 ± 0.018
1.35HisPro: 1.35 ± 0.034
0.818HisGln: 0.818 ± 0.027
1.645HisArg: 1.645 ± 0.044
1.122HisSer: 1.122 ± 0.029
0.892HisThr: 0.892 ± 0.026
1.222HisVal: 1.222 ± 0.029
0.504HisTrp: 0.504 ± 0.02
0.791HisTyr: 0.791 ± 0.027
0.0HisXaa: 0.0 ± 0.0
Ile
6.194IleAla: 6.194 ± 0.07
0.454IleCys: 0.454 ± 0.02
2.883IleAsp: 2.883 ± 0.056
3.12IleGlu: 3.12 ± 0.055
1.245IlePhe: 1.245 ± 0.033
4.105IleGly: 4.105 ± 0.068
0.834IleHis: 0.834 ± 0.026
1.548IleIle: 1.548 ± 0.038
1.404IleLys: 1.404 ± 0.034
3.283IleLeu: 3.283 ± 0.056
0.653IleMet: 0.653 ± 0.025
1.338IleAsn: 1.338 ± 0.038
1.961IlePro: 1.961 ± 0.04
1.291IleGln: 1.291 ± 0.036
2.834IleArg: 2.834 ± 0.049
2.494IleSer: 2.494 ± 0.05
2.073IleThr: 2.073 ± 0.039
3.245IleVal: 3.245 ± 0.058
0.537IleTrp: 0.537 ± 0.022
1.022IleTyr: 1.022 ± 0.031
0.0IleXaa: 0.0 ± 0.0
Lys
3.734LysAla: 3.734 ± 0.063
0.174LysCys: 0.174 ± 0.011
1.482LysAsp: 1.482 ± 0.036
1.272LysGlu: 1.272 ± 0.038
0.814LysPhe: 0.814 ± 0.027
2.106LysGly: 2.106 ± 0.046
0.564LysHis: 0.564 ± 0.023
1.46LysIle: 1.46 ± 0.036
1.208LysLys: 1.208 ± 0.036
3.868LysLeu: 3.868 ± 0.054
0.637LysMet: 0.637 ± 0.022
0.913LysAsn: 0.913 ± 0.029
2.141LysPro: 2.141 ± 0.048
1.259LysGln: 1.259 ± 0.033
2.212LysArg: 2.212 ± 0.046
1.648LysSer: 1.648 ± 0.034
1.813LysThr: 1.813 ± 0.04
2.107LysVal: 2.107 ± 0.042
0.308LysTrp: 0.308 ± 0.014
0.673LysTyr: 0.673 ± 0.027
0.0LysXaa: 0.0 ± 0.0
Leu
15.051LeuAla: 15.051 ± 0.148
1.285LeuCys: 1.285 ± 0.038
6.425LeuAsp: 6.425 ± 0.071
5.409LeuGlu: 5.409 ± 0.078
3.584LeuPhe: 3.584 ± 0.054
8.952LeuGly: 8.952 ± 0.099
2.662LeuHis: 2.662 ± 0.05
4.934LeuIle: 4.934 ± 0.065
3.82LeuLys: 3.82 ± 0.059
14.246LeuLeu: 14.246 ± 0.19
2.379LeuMet: 2.379 ± 0.046
2.936LeuAsn: 2.936 ± 0.054
7.049LeuPro: 7.049 ± 0.079
5.391LeuGln: 5.391 ± 0.076
10.007LeuArg: 10.007 ± 0.133
7.834LeuSer: 7.834 ± 0.098
4.644LeuThr: 4.644 ± 0.064
6.554LeuVal: 6.554 ± 0.076
1.456LeuTrp: 1.456 ± 0.039
2.492LeuTyr: 2.492 ± 0.052
0.0LeuXaa: 0.0 ± 0.0
Met
2.453MetAla: 2.453 ± 0.045
0.141MetCys: 0.141 ± 0.01
0.928MetAsp: 0.928 ± 0.027
0.849MetGlu: 0.849 ± 0.027
0.669MetPhe: 0.669 ± 0.025
1.488MetGly: 1.488 ± 0.036
0.462MetHis: 0.462 ± 0.016
0.906MetIle: 0.906 ± 0.028
0.849MetLys: 0.849 ± 0.024
2.506MetLeu: 2.506 ± 0.051
0.489MetMet: 0.489 ± 0.02
0.846MetAsn: 0.846 ± 0.023
1.302MetPro: 1.302 ± 0.033
0.972MetGln: 0.972 ± 0.026
1.628MetArg: 1.628 ± 0.037
1.741MetSer: 1.741 ± 0.041
1.057MetThr: 1.057 ± 0.03
1.258MetVal: 1.258 ± 0.033
0.178MetTrp: 0.178 ± 0.012
0.313MetTyr: 0.313 ± 0.015
0.0MetXaa: 0.0 ± 0.0
Asn
3.198AsnAla: 3.198 ± 0.063
0.29AsnCys: 0.29 ± 0.015
1.222AsnAsp: 1.222 ± 0.052
1.252AsnGlu: 1.252 ± 0.032
0.998AsnPhe: 0.998 ± 0.026
2.19AsnGly: 2.19 ± 0.059
0.511AsnHis: 0.511 ± 0.019
1.202AsnIle: 1.202 ± 0.035
0.783AsnLys: 0.783 ± 0.025
3.013AsnLeu: 3.013 ± 0.05
0.501AsnMet: 0.501 ± 0.018
0.762AsnAsn: 0.762 ± 0.028
1.799AsnPro: 1.799 ± 0.039
0.961AsnGln: 0.961 ± 0.026
1.689AsnArg: 1.689 ± 0.038
1.364AsnSer: 1.364 ± 0.041
1.41AsnThr: 1.41 ± 0.043
1.717AsnVal: 1.717 ± 0.039
0.434AsnTrp: 0.434 ± 0.018
0.723AsnTyr: 0.723 ± 0.026
0.0AsnXaa: 0.0 ± 0.0
Pro
7.146ProAla: 7.146 ± 0.097
0.362ProCys: 0.362 ± 0.015
2.952ProAsp: 2.952 ± 0.051
3.497ProGlu: 3.497 ± 0.056
1.563ProPhe: 1.563 ± 0.035
4.617ProGly: 4.617 ± 0.074
1.04ProHis: 1.04 ± 0.028
1.769ProIle: 1.769 ± 0.04
1.461ProLys: 1.461 ± 0.037
5.882ProLeu: 5.882 ± 0.077
1.239ProMet: 1.239 ± 0.033
1.137ProAsn: 1.137 ± 0.033
2.988ProPro: 2.988 ± 0.064
2.825ProGln: 2.825 ± 0.05
3.403ProArg: 3.403 ± 0.051
2.547ProSer: 2.547 ± 0.048
2.013ProThr: 2.013 ± 0.042
3.727ProVal: 3.727 ± 0.059
0.714ProTrp: 0.714 ± 0.024
1.337ProTyr: 1.337 ± 0.031
0.0ProXaa: 0.0 ± 0.0
Gln
6.664GlnAla: 6.664 ± 0.093
0.334GlnCys: 0.334 ± 0.016
2.041GlnAsp: 2.041 ± 0.04
1.853GlnGlu: 1.853 ± 0.044
1.349GlnPhe: 1.349 ± 0.038
3.207GlnGly: 3.207 ± 0.053
1.052GlnHis: 1.052 ± 0.029
1.98GlnIle: 1.98 ± 0.04
1.28GlnLys: 1.28 ± 0.032
6.221GlnLeu: 6.221 ± 0.088
0.909GlnMet: 0.909 ± 0.03
1.127GlnAsn: 1.127 ± 0.031
2.324GlnPro: 2.324 ± 0.047
2.629GlnGln: 2.629 ± 0.055
4.215GlnArg: 4.215 ± 0.07
2.28GlnSer: 2.28 ± 0.041
2.107GlnThr: 2.107 ± 0.044
2.757GlnVal: 2.757 ± 0.047
0.692GlnTrp: 0.692 ± 0.027
1.049GlnTyr: 1.049 ± 0.029
0.0GlnXaa: 0.0 ± 0.0
Arg
8.386ArgAla: 8.386 ± 0.106
0.741ArgCys: 0.741 ± 0.025
3.723ArgAsp: 3.723 ± 0.051
5.34ArgGlu: 5.34 ± 0.081
2.815ArgPhe: 2.815 ± 0.047
5.179ArgGly: 5.179 ± 0.067
1.883ArgHis: 1.883 ± 0.047
3.665ArgIle: 3.665 ± 0.056
2.164ArgLys: 2.164 ± 0.047
9.243ArgLeu: 9.243 ± 0.103
1.632ArgMet: 1.632 ± 0.031
1.863ArgAsn: 1.863 ± 0.037
3.077ArgPro: 3.077 ± 0.045
3.263ArgGln: 3.263 ± 0.057
5.883ArgArg: 5.883 ± 0.098
3.902ArgSer: 3.902 ± 0.058
2.568ArgThr: 2.568 ± 0.049
5.122ArgVal: 5.122 ± 0.072
1.255ArgTrp: 1.255 ± 0.031
2.545ArgTyr: 2.545 ± 0.044
0.0ArgXaa: 0.0 ± 0.0
Ser
7.151SerAla: 7.151 ± 0.092
0.623SerCys: 0.623 ± 0.021
2.799SerAsp: 2.799 ± 0.058
2.99SerGlu: 2.99 ± 0.048
2.263SerPhe: 2.263 ± 0.043
5.843SerGly: 5.843 ± 0.128
1.058SerHis: 1.058 ± 0.031
2.236SerIle: 2.236 ± 0.046
1.409SerLys: 1.409 ± 0.033
6.55SerLeu: 6.55 ± 0.081
1.149SerMet: 1.149 ± 0.028
1.405SerAsn: 1.405 ± 0.041
2.724SerPro: 2.724 ± 0.047
2.0SerGln: 2.0 ± 0.038
3.672SerArg: 3.672 ± 0.05
3.532SerSer: 3.532 ± 0.075
2.614SerThr: 2.614 ± 0.056
3.896SerVal: 3.896 ± 0.065
0.937SerTrp: 0.937 ± 0.026
1.501SerTyr: 1.501 ± 0.037
0.0SerXaa: 0.0 ± 0.0
Thr
5.298ThrAla: 5.298 ± 0.078
0.316ThrCys: 0.316 ± 0.016
1.888ThrAsp: 1.888 ± 0.046
2.057ThrGlu: 2.057 ± 0.044
1.278ThrPhe: 1.278 ± 0.034
4.002ThrGly: 4.002 ± 0.076
1.052ThrHis: 1.052 ± 0.028
1.782ThrIle: 1.782 ± 0.048
0.968ThrLys: 0.968 ± 0.026
6.129ThrLeu: 6.129 ± 0.085
0.838ThrMet: 0.838 ± 0.025
0.904ThrAsn: 0.904 ± 0.03
3.257ThrPro: 3.257 ± 0.063
2.234ThrGln: 2.234 ± 0.04
3.283ThrArg: 3.283 ± 0.047
2.024ThrSer: 2.024 ± 0.045
2.013ThrThr: 2.013 ± 0.05
3.618ThrVal: 3.618 ± 0.08
0.522ThrTrp: 0.522 ± 0.022
1.004ThrTyr: 1.004 ± 0.029
0.0ThrXaa: 0.0 ± 0.0
Val
7.767ValAla: 7.767 ± 0.09
0.74ValCys: 0.74 ± 0.024
3.737ValAsp: 3.737 ± 0.056
3.708ValGlu: 3.708 ± 0.059
2.427ValPhe: 2.427 ± 0.044
4.776ValGly: 4.776 ± 0.069
1.431ValHis: 1.431 ± 0.03
3.139ValIle: 3.139 ± 0.055
2.097ValLys: 2.097 ± 0.044
8.08ValLeu: 8.08 ± 0.078
1.57ValMet: 1.57 ± 0.033
2.049ValAsn: 2.049 ± 0.044
3.55ValPro: 3.55 ± 0.055
2.911ValGln: 2.911 ± 0.043
4.525ValArg: 4.525 ± 0.059
4.053ValSer: 4.053 ± 0.073
3.158ValThr: 3.158 ± 0.07
4.808ValVal: 4.808 ± 0.068
0.812ValTrp: 0.812 ± 0.027
1.695ValTyr: 1.695 ± 0.035
0.0ValXaa: 0.0 ± 0.0
Trp
1.243TrpAla: 1.243 ± 0.033
0.17TrpCys: 0.17 ± 0.01
0.636TrpAsp: 0.636 ± 0.021
0.534TrpGlu: 0.534 ± 0.02
0.516TrpPhe: 0.516 ± 0.022
0.857TrpGly: 0.857 ± 0.029
0.39TrpHis: 0.39 ± 0.017
0.551TrpIle: 0.551 ± 0.021
0.384TrpLys: 0.384 ± 0.016
2.179TrpLeu: 2.179 ± 0.043
0.285TrpMet: 0.285 ± 0.016
0.492TrpAsn: 0.492 ± 0.018
0.759TrpPro: 0.759 ± 0.022
1.016TrpGln: 1.016 ± 0.031
1.348TrpArg: 1.348 ± 0.034
0.834TrpSer: 0.834 ± 0.029
0.676TrpThr: 0.676 ± 0.022
0.86TrpVal: 0.86 ± 0.027
0.251TrpTrp: 0.251 ± 0.014
0.379TrpTyr: 0.379 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.181TyrAla: 3.181 ± 0.049
0.277TyrCys: 0.277 ± 0.014
1.376TyrAsp: 1.376 ± 0.034
1.526TyrGlu: 1.526 ± 0.037
1.047TyrPhe: 1.047 ± 0.031
2.399TyrGly: 2.399 ± 0.045
0.571TyrHis: 0.571 ± 0.02
0.876TyrIle: 0.876 ± 0.027
0.691TyrLys: 0.691 ± 0.024
2.894TyrLeu: 2.894 ± 0.054
0.399TyrMet: 0.399 ± 0.017
0.706TyrAsn: 0.706 ± 0.026
1.191TyrPro: 1.191 ± 0.03
1.224TyrGln: 1.224 ± 0.034
2.223TyrArg: 2.223 ± 0.044
1.39TyrSer: 1.39 ± 0.034
1.295TyrThr: 1.295 ± 0.037
1.709TyrVal: 1.709 ± 0.036
0.422TyrTrp: 0.422 ± 0.019
0.75TyrTyr: 0.75 ± 0.026
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3891 proteins (1330244 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski