Amino acid dipepetide frequency for Pelotomaculum thermopropionicum (strain DSM 13744 / JCM 10971 / SI)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.884AlaAla: 11.884 ± 0.173
1.38AlaCys: 1.38 ± 0.043
4.114AlaAsp: 4.114 ± 0.079
6.551AlaGlu: 6.551 ± 0.088
3.292AlaPhe: 3.292 ± 0.066
11.619AlaGly: 11.619 ± 0.159
1.313AlaHis: 1.313 ± 0.044
4.656AlaIle: 4.656 ± 0.076
3.724AlaLys: 3.724 ± 0.068
9.849AlaLeu: 9.849 ± 0.109
2.228AlaMet: 2.228 ± 0.055
2.204AlaAsn: 2.204 ± 0.054
3.104AlaPro: 3.104 ± 0.069
2.447AlaGln: 2.447 ± 0.056
7.051AlaArg: 7.051 ± 0.107
4.106AlaSer: 4.106 ± 0.07
3.269AlaThr: 3.269 ± 0.065
9.266AlaVal: 9.266 ± 0.147
0.905AlaTrp: 0.905 ± 0.037
2.36AlaTyr: 2.36 ± 0.055
0.0AlaXaa: 0.0 ± 0.0
Cys
0.944CysAla: 0.944 ± 0.036
0.235CysCys: 0.235 ± 0.019
0.513CysAsp: 0.513 ± 0.027
0.606CysGlu: 0.606 ± 0.025
0.519CysPhe: 0.519 ± 0.025
1.506CysGly: 1.506 ± 0.043
0.27CysHis: 0.27 ± 0.021
0.609CysIle: 0.609 ± 0.029
0.493CysLys: 0.493 ± 0.023
1.401CysLeu: 1.401 ± 0.043
0.258CysMet: 0.258 ± 0.018
0.401CysAsn: 0.401 ± 0.022
0.842CysPro: 0.842 ± 0.036
0.345CysGln: 0.345 ± 0.021
1.172CysArg: 1.172 ± 0.038
0.769CysSer: 0.769 ± 0.033
0.572CysThr: 0.572 ± 0.027
0.696CysVal: 0.696 ± 0.032
0.139CysTrp: 0.139 ± 0.014
0.398CysTyr: 0.398 ± 0.022
0.0CysXaa: 0.0 ± 0.0
Asp
3.502AspAla: 3.502 ± 0.075
0.588AspCys: 0.588 ± 0.026
1.839AspAsp: 1.839 ± 0.051
3.26AspGlu: 3.26 ± 0.068
2.129AspPhe: 2.129 ± 0.051
3.988AspGly: 3.988 ± 0.074
0.724AspHis: 0.724 ± 0.03
3.397AspIle: 3.397 ± 0.063
2.197AspLys: 2.197 ± 0.053
5.091AspLeu: 5.091 ± 0.078
1.119AspMet: 1.119 ± 0.037
1.313AspAsn: 1.313 ± 0.039
2.603AspPro: 2.603 ± 0.055
1.251AspGln: 1.251 ± 0.042
3.287AspArg: 3.287 ± 0.074
2.104AspSer: 2.104 ± 0.051
1.976AspThr: 1.976 ± 0.07
3.384AspVal: 3.384 ± 0.063
0.54AspTrp: 0.54 ± 0.024
1.74AspTyr: 1.74 ± 0.046
0.0AspXaa: 0.0 ± 0.0
Glu
6.908GluAla: 6.908 ± 0.109
0.619GluCys: 0.619 ± 0.03
3.118GluAsp: 3.118 ± 0.057
6.37GluGlu: 6.37 ± 0.103
2.325GluPhe: 2.325 ± 0.057
4.975GluGly: 4.975 ± 0.075
1.155GluHis: 1.155 ± 0.036
5.369GluIle: 5.369 ± 0.094
5.844GluLys: 5.844 ± 0.104
7.322GluLeu: 7.322 ± 0.118
2.12GluMet: 2.12 ± 0.051
2.807GluAsn: 2.807 ± 0.056
2.33GluPro: 2.33 ± 0.055
2.487GluGln: 2.487 ± 0.074
4.438GluArg: 4.438 ± 0.078
2.82GluSer: 2.82 ± 0.058
2.99GluThr: 2.99 ± 0.056
5.626GluVal: 5.626 ± 0.09
0.611GluTrp: 0.611 ± 0.031
2.07GluTyr: 2.07 ± 0.052
0.0GluXaa: 0.0 ± 0.0
Phe
3.227PheAla: 3.227 ± 0.068
0.637PheCys: 0.637 ± 0.024
1.971PheAsp: 1.971 ± 0.053
2.254PheGlu: 2.254 ± 0.047
1.893PhePhe: 1.893 ± 0.054
2.996PheGly: 2.996 ± 0.07
0.639PheHis: 0.639 ± 0.027
2.534PheIle: 2.534 ± 0.066
2.168PheLys: 2.168 ± 0.052
4.198PheLeu: 4.198 ± 0.073
0.938PheMet: 0.938 ± 0.033
1.563PheAsn: 1.563 ± 0.04
1.806PhePro: 1.806 ± 0.047
1.166PheGln: 1.166 ± 0.041
2.082PheArg: 2.082 ± 0.055
2.494PheSer: 2.494 ± 0.053
1.999PheThr: 1.999 ± 0.042
2.412PheVal: 2.412 ± 0.056
0.463PheTrp: 0.463 ± 0.024
1.443PheTyr: 1.443 ± 0.044
0.0PheXaa: 0.0 ± 0.0
Gly
7.236GlyAla: 7.236 ± 0.114
1.259GlyCys: 1.259 ± 0.047
3.653GlyAsp: 3.653 ± 0.063
5.856GlyGlu: 5.856 ± 0.086
3.337GlyPhe: 3.337 ± 0.063
7.07GlyGly: 7.07 ± 0.126
1.486GlyHis: 1.486 ± 0.046
5.617GlyIle: 5.617 ± 0.082
5.171GlyLys: 5.171 ± 0.083
9.015GlyLeu: 9.015 ± 0.121
2.204GlyMet: 2.204 ± 0.052
2.58GlyAsn: 2.58 ± 0.062
2.934GlyPro: 2.934 ± 0.065
2.749GlyGln: 2.749 ± 0.059
6.403GlyArg: 6.403 ± 0.094
4.314GlySer: 4.314 ± 0.072
4.167GlyThr: 4.167 ± 0.071
6.242GlyVal: 6.242 ± 0.106
0.915GlyTrp: 0.915 ± 0.032
2.893GlyTyr: 2.893 ± 0.065
0.0GlyXaa: 0.0 ± 0.0
His
1.238HisAla: 1.238 ± 0.044
0.296HisCys: 0.296 ± 0.019
0.729HisAsp: 0.729 ± 0.03
0.903HisGlu: 0.903 ± 0.032
0.825HisPhe: 0.825 ± 0.028
1.471HisGly: 1.471 ± 0.052
0.425HisHis: 0.425 ± 0.026
0.987HisIle: 0.987 ± 0.035
0.695HisLys: 0.695 ± 0.028
1.923HisLeu: 1.923 ± 0.053
0.303HisMet: 0.303 ± 0.018
0.549HisAsn: 0.549 ± 0.028
1.132HisPro: 1.132 ± 0.041
0.627HisGln: 0.627 ± 0.031
1.215HisArg: 1.215 ± 0.039
0.854HisSer: 0.854 ± 0.028
0.756HisThr: 0.756 ± 0.029
1.071HisVal: 1.071 ± 0.037
0.178HisTrp: 0.178 ± 0.014
0.648HisTyr: 0.648 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
5.529IleAla: 5.529 ± 0.081
0.834IleCys: 0.834 ± 0.029
3.129IleAsp: 3.129 ± 0.068
4.115IleGlu: 4.115 ± 0.08
2.433IlePhe: 2.433 ± 0.053
4.481IleGly: 4.481 ± 0.075
0.981IleHis: 0.981 ± 0.033
4.424IleIle: 4.424 ± 0.086
3.859IleLys: 3.859 ± 0.072
5.816IleLeu: 5.816 ± 0.089
1.537IleMet: 1.537 ± 0.042
2.497IleAsn: 2.497 ± 0.055
3.056IlePro: 3.056 ± 0.063
1.636IleGln: 1.636 ± 0.044
3.606IleArg: 3.606 ± 0.068
3.761IleSer: 3.761 ± 0.067
3.461IleThr: 3.461 ± 0.078
4.038IleVal: 4.038 ± 0.066
0.527IleTrp: 0.527 ± 0.024
1.926IleTyr: 1.926 ± 0.05
0.0IleXaa: 0.0 ± 0.0
Lys
5.103LysAla: 5.103 ± 0.079
0.544LysCys: 0.544 ± 0.028
2.74LysAsp: 2.74 ± 0.062
5.024LysGlu: 5.024 ± 0.096
1.711LysPhe: 1.711 ± 0.043
4.022LysGly: 4.022 ± 0.071
0.898LysHis: 0.898 ± 0.035
4.15LysIle: 4.15 ± 0.066
4.467LysLys: 4.467 ± 0.084
4.769LysLeu: 4.769 ± 0.077
1.549LysMet: 1.549 ± 0.035
2.379LysAsn: 2.379 ± 0.058
2.33LysPro: 2.33 ± 0.062
1.571LysGln: 1.571 ± 0.044
3.065LysArg: 3.065 ± 0.063
2.553LysSer: 2.553 ± 0.062
2.766LysThr: 2.766 ± 0.065
4.428LysVal: 4.428 ± 0.072
0.51LysTrp: 0.51 ± 0.028
1.784LysTyr: 1.784 ± 0.047
0.0LysXaa: 0.0 ± 0.0
Leu
11.632LeuAla: 11.632 ± 0.14
1.113LeuCys: 1.113 ± 0.04
5.101LeuAsp: 5.101 ± 0.08
7.558LeuGlu: 7.558 ± 0.121
3.836LeuPhe: 3.836 ± 0.069
7.512LeuGly: 7.512 ± 0.09
1.62LeuHis: 1.62 ± 0.044
5.697LeuIle: 5.697 ± 0.084
6.861LeuLys: 6.861 ± 0.111
10.224LeuLeu: 10.224 ± 0.151
2.136LeuMet: 2.136 ± 0.051
3.668LeuAsn: 3.668 ± 0.068
5.19LeuPro: 5.19 ± 0.087
2.97LeuGln: 2.97 ± 0.069
5.86LeuArg: 5.86 ± 0.083
5.937LeuSer: 5.937 ± 0.098
4.92LeuThr: 4.92 ± 0.076
7.437LeuVal: 7.437 ± 0.097
0.871LeuTrp: 0.871 ± 0.034
2.793LeuTyr: 2.793 ± 0.063
0.0LeuXaa: 0.0 ± 0.0
Met
2.784MetAla: 2.784 ± 0.062
0.211MetCys: 0.211 ± 0.017
1.159MetAsp: 1.159 ± 0.035
1.822MetGlu: 1.822 ± 0.053
0.757MetPhe: 0.757 ± 0.03
1.938MetGly: 1.938 ± 0.053
0.378MetHis: 0.378 ± 0.019
1.328MetIle: 1.328 ± 0.038
1.48MetLys: 1.48 ± 0.043
2.367MetLeu: 2.367 ± 0.048
0.542MetMet: 0.542 ± 0.029
0.827MetAsn: 0.827 ± 0.032
1.278MetPro: 1.278 ± 0.043
0.723MetGln: 0.723 ± 0.028
1.299MetArg: 1.299 ± 0.045
1.284MetSer: 1.284 ± 0.043
1.126MetThr: 1.126 ± 0.039
1.875MetVal: 1.875 ± 0.041
0.144MetTrp: 0.144 ± 0.013
0.535MetTyr: 0.535 ± 0.024
0.0MetXaa: 0.0 ± 0.0
Asn
2.482AsnAla: 2.482 ± 0.06
0.513AsnCys: 0.513 ± 0.024
1.18AsnAsp: 1.18 ± 0.041
1.925AsnGlu: 1.925 ± 0.051
1.388AsnPhe: 1.388 ± 0.038
2.696AsnGly: 2.696 ± 0.063
0.506AsnHis: 0.506 ± 0.026
2.553AsnIle: 2.553 ± 0.058
1.799AsnLys: 1.799 ± 0.049
3.706AsnLeu: 3.706 ± 0.062
0.854AsnMet: 0.854 ± 0.035
1.258AsnAsn: 1.258 ± 0.043
2.103AsnPro: 2.103 ± 0.056
0.919AsnGln: 0.919 ± 0.038
2.259AsnArg: 2.259 ± 0.057
1.73AsnSer: 1.73 ± 0.043
1.531AsnThr: 1.531 ± 0.045
2.182AsnVal: 2.182 ± 0.053
0.43AsnTrp: 0.43 ± 0.024
1.17AsnTyr: 1.17 ± 0.041
0.0AsnXaa: 0.0 ± 0.0
Pro
5.209ProAla: 5.209 ± 0.103
0.521ProCys: 0.521 ± 0.027
2.709ProAsp: 2.709 ± 0.061
4.281ProGlu: 4.281 ± 0.066
1.878ProPhe: 1.878 ± 0.046
5.192ProGly: 5.192 ± 0.089
0.795ProHis: 0.795 ± 0.033
1.533ProIle: 1.533 ± 0.039
1.652ProLys: 1.652 ± 0.05
4.257ProLeu: 4.257 ± 0.071
0.758ProMet: 0.758 ± 0.031
1.082ProAsn: 1.082 ± 0.035
2.182ProPro: 2.182 ± 0.073
1.354ProGln: 1.354 ± 0.048
2.389ProArg: 2.389 ± 0.056
2.078ProSer: 2.078 ± 0.048
1.507ProThr: 1.507 ± 0.043
4.802ProVal: 4.802 ± 0.076
0.496ProTrp: 0.496 ± 0.026
1.419ProTyr: 1.419 ± 0.042
0.0ProXaa: 0.0 ± 0.0
Gln
3.312GlnAla: 3.312 ± 0.067
0.239GlnCys: 0.239 ± 0.015
1.355GlnAsp: 1.355 ± 0.041
2.41GlnGlu: 2.41 ± 0.078
1.054GlnPhe: 1.054 ± 0.037
2.17GlnGly: 2.17 ± 0.053
0.534GlnHis: 0.534 ± 0.027
1.84GlnIle: 1.84 ± 0.046
2.122GlnLys: 2.122 ± 0.053
2.717GlnLeu: 2.717 ± 0.068
0.814GlnMet: 0.814 ± 0.034
1.036GlnAsn: 1.036 ± 0.035
1.32GlnPro: 1.32 ± 0.043
1.014GlnGln: 1.014 ± 0.041
1.624GlnArg: 1.624 ± 0.042
1.312GlnSer: 1.312 ± 0.037
1.285GlnThr: 1.285 ± 0.042
2.778GlnVal: 2.778 ± 0.063
0.262GlnTrp: 0.262 ± 0.017
0.842GlnTyr: 0.842 ± 0.037
0.0GlnXaa: 0.0 ± 0.0
Arg
5.251ArgAla: 5.251 ± 0.075
0.786ArgCys: 0.786 ± 0.033
2.807ArgAsp: 2.807 ± 0.051
5.814ArgGlu: 5.814 ± 0.093
2.529ArgPhe: 2.529 ± 0.062
4.376ArgGly: 4.376 ± 0.069
1.25ArgHis: 1.25 ± 0.04
3.831ArgIle: 3.831 ± 0.073
3.538ArgLys: 3.538 ± 0.069
7.183ArgLeu: 7.183 ± 0.115
1.626ArgMet: 1.626 ± 0.049
2.036ArgAsn: 2.036 ± 0.05
3.135ArgPro: 3.135 ± 0.064
2.761ArgGln: 2.761 ± 0.082
4.823ArgArg: 4.823 ± 0.093
2.761ArgSer: 2.761 ± 0.057
2.702ArgThr: 2.702 ± 0.05
4.837ArgVal: 4.837 ± 0.079
0.721ArgTrp: 0.721 ± 0.028
2.089ArgTyr: 2.089 ± 0.056
0.0ArgXaa: 0.0 ± 0.0
Ser
4.168SerAla: 4.168 ± 0.082
0.728SerCys: 0.728 ± 0.029
2.046SerAsp: 2.046 ± 0.057
2.938SerGlu: 2.938 ± 0.056
2.378SerPhe: 2.378 ± 0.051
5.029SerGly: 5.029 ± 0.088
0.869SerHis: 0.869 ± 0.032
3.005SerIle: 3.005 ± 0.06
2.239SerLys: 2.239 ± 0.046
5.759SerLeu: 5.759 ± 0.099
1.234SerMet: 1.234 ± 0.034
1.433SerAsn: 1.433 ± 0.044
2.628SerPro: 2.628 ± 0.055
1.413SerGln: 1.413 ± 0.039
3.604SerArg: 3.604 ± 0.071
2.798SerSer: 2.798 ± 0.073
2.266SerThr: 2.266 ± 0.047
3.574SerVal: 3.574 ± 0.069
0.531SerTrp: 0.531 ± 0.03
1.564SerTyr: 1.564 ± 0.048
0.0SerXaa: 0.0 ± 0.0
Thr
4.755ThrAla: 4.755 ± 0.081
0.632ThrCys: 0.632 ± 0.025
1.986ThrAsp: 1.986 ± 0.046
2.5ThrGlu: 2.5 ± 0.059
1.701ThrPhe: 1.701 ± 0.048
5.659ThrGly: 5.659 ± 0.083
0.76ThrHis: 0.76 ± 0.03
2.539ThrIle: 2.539 ± 0.053
1.605ThrLys: 1.605 ± 0.044
4.253ThrLeu: 4.253 ± 0.078
0.987ThrMet: 0.987 ± 0.04
1.221ThrAsn: 1.221 ± 0.042
2.391ThrPro: 2.391 ± 0.057
0.919ThrGln: 0.919 ± 0.033
2.662ThrArg: 2.662 ± 0.054
2.14ThrSer: 2.14 ± 0.054
2.078ThrThr: 2.078 ± 0.061
4.564ThrVal: 4.564 ± 0.081
0.477ThrTrp: 0.477 ± 0.03
1.251ThrTyr: 1.251 ± 0.041
0.0ThrXaa: 0.0 ± 0.0
Val
7.068ValAla: 7.068 ± 0.09
0.955ValCys: 0.955 ± 0.038
3.842ValAsp: 3.842 ± 0.063
5.599ValGlu: 5.599 ± 0.096
3.176ValPhe: 3.176 ± 0.069
4.9ValGly: 4.9 ± 0.092
1.343ValHis: 1.343 ± 0.041
5.262ValIle: 5.262 ± 0.086
4.535ValLys: 4.535 ± 0.066
8.422ValLeu: 8.422 ± 0.107
1.891ValMet: 1.891 ± 0.05
2.836ValAsn: 2.836 ± 0.063
3.755ValPro: 3.755 ± 0.072
2.207ValGln: 2.207 ± 0.056
4.794ValArg: 4.794 ± 0.082
4.264ValSer: 4.264 ± 0.077
3.854ValThr: 3.854 ± 0.07
6.328ValVal: 6.328 ± 0.109
0.64ValTrp: 0.64 ± 0.031
2.351ValTyr: 2.351 ± 0.054
0.0ValXaa: 0.0 ± 0.0
Trp
0.79TrpAla: 0.79 ± 0.032
0.111TrpCys: 0.111 ± 0.011
0.556TrpAsp: 0.556 ± 0.025
0.736TrpGlu: 0.736 ± 0.031
0.335TrpPhe: 0.335 ± 0.019
0.783TrpGly: 0.783 ± 0.035
0.222TrpHis: 0.222 ± 0.016
0.454TrpIle: 0.454 ± 0.025
0.505TrpLys: 0.505 ± 0.03
1.235TrpLeu: 1.235 ± 0.045
0.21TrpMet: 0.21 ± 0.015
0.349TrpAsn: 0.349 ± 0.021
0.464TrpPro: 0.464 ± 0.025
0.485TrpGln: 0.485 ± 0.024
0.656TrpArg: 0.656 ± 0.031
0.488TrpSer: 0.488 ± 0.028
0.388TrpThr: 0.388 ± 0.022
0.616TrpVal: 0.616 ± 0.028
0.147TrpTrp: 0.147 ± 0.014
0.303TrpTyr: 0.303 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.3TyrAla: 2.3 ± 0.053
0.485TyrCys: 0.485 ± 0.024
1.486TyrAsp: 1.486 ± 0.042
1.781TyrGlu: 1.781 ± 0.044
1.385TyrPhe: 1.385 ± 0.044
2.72TyrGly: 2.72 ± 0.061
0.653TyrHis: 0.653 ± 0.028
1.862TyrIle: 1.862 ± 0.05
1.412TyrLys: 1.412 ± 0.039
3.433TyrLeu: 3.433 ± 0.066
0.547TyrMet: 0.547 ± 0.025
1.145TyrAsn: 1.145 ± 0.04
1.51TyrPro: 1.51 ± 0.04
0.994TyrGln: 0.994 ± 0.032
2.567TyrArg: 2.567 ± 0.056
1.639TyrSer: 1.639 ± 0.046
1.467TyrThr: 1.467 ± 0.041
1.865TyrVal: 1.865 ± 0.046
0.327TyrTrp: 0.327 ± 0.022
1.16TyrTyr: 1.16 ± 0.044
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2884 proteins (856099 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski