Amino acid dipepetide frequency for Syntrophobotulus glycolicus (strain DSM 8271 / FlGlyR)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.053AlaAla: 8.053 ± 0.131
0.962AlaCys: 0.962 ± 0.031
4.304AlaAsp: 4.304 ± 0.073
5.962AlaGlu: 5.962 ± 0.092
3.002AlaPhe: 3.002 ± 0.058
7.394AlaGly: 7.394 ± 0.099
1.212AlaHis: 1.212 ± 0.041
5.168AlaIle: 5.168 ± 0.074
4.527AlaLys: 4.527 ± 0.079
8.568AlaLeu: 8.568 ± 0.106
1.918AlaMet: 1.918 ± 0.049
2.625AlaAsn: 2.625 ± 0.06
2.314AlaPro: 2.314 ± 0.061
2.827AlaGln: 2.827 ± 0.065
3.792AlaArg: 3.792 ± 0.069
4.073AlaSer: 4.073 ± 0.072
3.061AlaThr: 3.061 ± 0.087
6.733AlaVal: 6.733 ± 0.09
0.69AlaTrp: 0.69 ± 0.024
2.528AlaTyr: 2.528 ± 0.057
0.0AlaXaa: 0.0 ± 0.0
Cys
0.874CysAla: 0.874 ± 0.03
0.211CysCys: 0.211 ± 0.017
0.521CysAsp: 0.521 ± 0.025
0.593CysGlu: 0.593 ± 0.024
0.508CysPhe: 0.508 ± 0.025
1.218CysGly: 1.218 ± 0.042
0.285CysHis: 0.285 ± 0.019
0.743CysIle: 0.743 ± 0.028
0.512CysLys: 0.512 ± 0.025
1.253CysLeu: 1.253 ± 0.042
0.3CysMet: 0.3 ± 0.017
0.422CysAsn: 0.422 ± 0.019
0.611CysPro: 0.611 ± 0.026
0.397CysGln: 0.397 ± 0.019
0.726CysArg: 0.726 ± 0.032
0.769CysSer: 0.769 ± 0.035
0.544CysThr: 0.544 ± 0.02
0.647CysVal: 0.647 ± 0.026
0.113CysTrp: 0.113 ± 0.01
0.399CysTyr: 0.399 ± 0.025
0.0CysXaa: 0.0 ± 0.0
Asp
3.254AspAla: 3.254 ± 0.071
0.692AspCys: 0.692 ± 0.027
2.288AspAsp: 2.288 ± 0.063
3.552AspGlu: 3.552 ± 0.072
2.426AspPhe: 2.426 ± 0.055
3.527AspGly: 3.527 ± 0.071
1.137AspHis: 1.137 ± 0.037
4.185AspIle: 4.185 ± 0.065
3.013AspLys: 3.013 ± 0.051
5.354AspLeu: 5.354 ± 0.098
1.231AspMet: 1.231 ± 0.041
1.845AspAsn: 1.845 ± 0.052
2.099AspPro: 2.099 ± 0.053
2.024AspGln: 2.024 ± 0.049
2.684AspArg: 2.684 ± 0.052
2.99AspSer: 2.99 ± 0.055
2.503AspThr: 2.503 ± 0.068
3.054AspVal: 3.054 ± 0.055
0.571AspTrp: 0.571 ± 0.023
2.026AspTyr: 2.026 ± 0.05
0.0AspXaa: 0.0 ± 0.0
Glu
5.411GluAla: 5.411 ± 0.081
0.577GluCys: 0.577 ± 0.025
3.416GluAsp: 3.416 ± 0.068
6.188GluGlu: 6.188 ± 0.103
2.4GluPhe: 2.4 ± 0.052
4.324GluGly: 4.324 ± 0.076
1.182GluHis: 1.182 ± 0.039
6.181GluIle: 6.181 ± 0.091
6.132GluLys: 6.132 ± 0.087
6.778GluLeu: 6.778 ± 0.096
2.104GluMet: 2.104 ± 0.048
3.66GluAsn: 3.66 ± 0.069
1.948GluPro: 1.948 ± 0.048
3.045GluGln: 3.045 ± 0.066
3.337GluArg: 3.337 ± 0.071
3.185GluSer: 3.185 ± 0.063
3.563GluThr: 3.563 ± 0.084
4.36GluVal: 4.36 ± 0.069
0.589GluTrp: 0.589 ± 0.026
2.451GluTyr: 2.451 ± 0.055
0.0GluXaa: 0.0 ± 0.0
Phe
3.246PheAla: 3.246 ± 0.056
0.598PheCys: 0.598 ± 0.027
2.177PheAsp: 2.177 ± 0.055
2.289PheGlu: 2.289 ± 0.047
2.07PhePhe: 2.07 ± 0.056
3.169PheGly: 3.169 ± 0.061
0.81PheHis: 0.81 ± 0.031
3.072PheIle: 3.072 ± 0.063
2.103PheLys: 2.103 ± 0.047
4.555PheLeu: 4.555 ± 0.085
0.983PheMet: 0.983 ± 0.034
1.603PheAsn: 1.603 ± 0.047
1.615PhePro: 1.615 ± 0.04
1.507PheGln: 1.507 ± 0.037
1.824PheArg: 1.824 ± 0.046
3.188PheSer: 3.188 ± 0.059
2.167PheThr: 2.167 ± 0.05
2.627PheVal: 2.627 ± 0.053
0.506PheTrp: 0.506 ± 0.022
1.439PheTyr: 1.439 ± 0.039
0.0PheXaa: 0.0 ± 0.0
Gly
5.223GlyAla: 5.223 ± 0.096
1.059GlyCys: 1.059 ± 0.039
3.361GlyAsp: 3.361 ± 0.058
4.934GlyGlu: 4.934 ± 0.088
3.249GlyPhe: 3.249 ± 0.056
5.615GlyGly: 5.615 ± 0.124
1.245GlyHis: 1.245 ± 0.039
6.49GlyIle: 6.49 ± 0.097
5.365GlyLys: 5.365 ± 0.074
7.506GlyLeu: 7.506 ± 0.089
2.178GlyMet: 2.178 ± 0.05
2.899GlyAsn: 2.899 ± 0.071
1.939GlyPro: 1.939 ± 0.094
2.691GlyGln: 2.691 ± 0.06
3.664GlyArg: 3.664 ± 0.073
4.508GlySer: 4.508 ± 0.111
4.291GlyThr: 4.291 ± 0.093
4.769GlyVal: 4.769 ± 0.082
0.838GlyTrp: 0.838 ± 0.033
2.906GlyTyr: 2.906 ± 0.058
0.0GlyXaa: 0.0 ± 0.0
His
1.113HisAla: 1.113 ± 0.04
0.292HisCys: 0.292 ± 0.019
0.838HisAsp: 0.838 ± 0.038
1.009HisGlu: 1.009 ± 0.031
0.867HisPhe: 0.867 ± 0.033
1.264HisGly: 1.264 ± 0.035
0.528HisHis: 0.528 ± 0.025
1.317HisIle: 1.317 ± 0.034
0.938HisLys: 0.938 ± 0.035
1.787HisLeu: 1.787 ± 0.048
0.417HisMet: 0.417 ± 0.019
0.698HisAsn: 0.698 ± 0.023
0.944HisPro: 0.944 ± 0.033
0.726HisGln: 0.726 ± 0.028
0.867HisArg: 0.867 ± 0.032
1.109HisSer: 1.109 ± 0.033
0.874HisThr: 0.874 ± 0.034
0.956HisVal: 0.956 ± 0.034
0.202HisTrp: 0.202 ± 0.014
0.701HisTyr: 0.701 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
6.358IleAla: 6.358 ± 0.094
1.002IleCys: 1.002 ± 0.037
3.873IleAsp: 3.873 ± 0.071
5.051IleGlu: 5.051 ± 0.082
3.185IlePhe: 3.185 ± 0.066
5.52IleGly: 5.52 ± 0.088
1.254IleHis: 1.254 ± 0.033
5.655IleIle: 5.655 ± 0.095
4.628IleLys: 4.628 ± 0.074
7.813IleLeu: 7.813 ± 0.117
1.847IleMet: 1.847 ± 0.05
3.18IleAsn: 3.18 ± 0.062
3.384IlePro: 3.384 ± 0.063
2.487IleGln: 2.487 ± 0.056
3.841IleArg: 3.841 ± 0.067
5.299IleSer: 5.299 ± 0.081
3.968IleThr: 3.968 ± 0.073
5.055IleVal: 5.055 ± 0.088
0.653IleTrp: 0.653 ± 0.027
2.366IleTyr: 2.366 ± 0.053
0.0IleXaa: 0.0 ± 0.0
Lys
5.23LysAla: 5.23 ± 0.068
0.502LysCys: 0.502 ± 0.025
3.321LysAsp: 3.321 ± 0.063
5.363LysGlu: 5.363 ± 0.078
1.842LysPhe: 1.842 ± 0.048
4.17LysGly: 4.17 ± 0.072
0.96LysHis: 0.96 ± 0.034
5.312LysIle: 5.312 ± 0.076
4.874LysLys: 4.874 ± 0.086
5.551LysLeu: 5.551 ± 0.079
1.865LysMet: 1.865 ± 0.047
3.234LysAsn: 3.234 ± 0.064
2.099LysPro: 2.099 ± 0.048
2.248LysGln: 2.248 ± 0.056
2.9LysArg: 2.9 ± 0.06
3.262LysSer: 3.262 ± 0.056
3.832LysThr: 3.832 ± 0.068
4.256LysVal: 4.256 ± 0.068
0.579LysTrp: 0.579 ± 0.026
2.249LysTyr: 2.249 ± 0.049
0.0LysXaa: 0.0 ± 0.0
Leu
8.957LeuAla: 8.957 ± 0.111
1.209LeuCys: 1.209 ± 0.04
5.237LeuAsp: 5.237 ± 0.079
6.867LeuGlu: 6.867 ± 0.094
4.502LeuPhe: 4.502 ± 0.097
7.016LeuGly: 7.016 ± 0.101
1.643LeuHis: 1.643 ± 0.041
7.176LeuIle: 7.176 ± 0.112
6.658LeuLys: 6.658 ± 0.096
10.533LeuLeu: 10.533 ± 0.149
2.387LeuMet: 2.387 ± 0.056
4.404LeuAsn: 4.404 ± 0.071
4.231LeuPro: 4.231 ± 0.065
3.465LeuGln: 3.465 ± 0.06
4.837LeuArg: 4.837 ± 0.077
7.238LeuSer: 7.238 ± 0.098
5.698LeuThr: 5.698 ± 0.101
6.132LeuVal: 6.132 ± 0.08
0.884LeuTrp: 0.884 ± 0.031
3.004LeuTyr: 3.004 ± 0.057
0.0LeuXaa: 0.0 ± 0.0
Met
2.216MetAla: 2.216 ± 0.052
0.188MetCys: 0.188 ± 0.015
1.395MetAsp: 1.395 ± 0.044
1.813MetGlu: 1.813 ± 0.047
0.813MetPhe: 0.813 ± 0.033
1.85MetGly: 1.85 ± 0.048
0.335MetHis: 0.335 ± 0.017
2.133MetIle: 2.133 ± 0.053
1.932MetLys: 1.932 ± 0.042
2.474MetLeu: 2.474 ± 0.06
0.779MetMet: 0.779 ± 0.031
1.263MetAsn: 1.263 ± 0.036
1.011MetPro: 1.011 ± 0.034
0.783MetGln: 0.783 ± 0.029
1.254MetArg: 1.254 ± 0.039
1.518MetSer: 1.518 ± 0.045
1.504MetThr: 1.504 ± 0.039
1.659MetVal: 1.659 ± 0.043
0.148MetTrp: 0.148 ± 0.012
0.585MetTyr: 0.585 ± 0.026
0.0MetXaa: 0.0 ± 0.0
Asn
2.816AsnAla: 2.816 ± 0.057
0.471AsnCys: 0.471 ± 0.022
1.915AsnAsp: 1.915 ± 0.049
2.496AsnGlu: 2.496 ± 0.058
1.517AsnPhe: 1.517 ± 0.042
2.995AsnGly: 2.995 ± 0.065
0.907AsnHis: 0.907 ± 0.035
3.53AsnIle: 3.53 ± 0.062
2.61AsnLys: 2.61 ± 0.061
4.156AsnLeu: 4.156 ± 0.066
1.082AsnMet: 1.082 ± 0.035
1.749AsnAsn: 1.749 ± 0.047
2.114AsnPro: 2.114 ± 0.051
1.631AsnGln: 1.631 ± 0.041
2.151AsnArg: 2.151 ± 0.054
2.426AsnSer: 2.426 ± 0.055
2.28AsnThr: 2.28 ± 0.057
2.381AsnVal: 2.381 ± 0.056
0.432AsnTrp: 0.432 ± 0.022
1.486AsnTyr: 1.486 ± 0.044
0.0AsnXaa: 0.0 ± 0.0
Pro
3.067ProAla: 3.067 ± 0.063
0.417ProCys: 0.417 ± 0.021
2.415ProAsp: 2.415 ± 0.051
3.518ProGlu: 3.518 ± 0.062
1.553ProPhe: 1.553 ± 0.041
2.982ProGly: 2.982 ± 0.056
0.647ProHis: 0.647 ± 0.027
2.187ProIle: 2.187 ± 0.051
1.796ProLys: 1.796 ± 0.045
3.672ProLeu: 3.672 ± 0.066
0.739ProMet: 0.739 ± 0.025
1.273ProAsn: 1.273 ± 0.037
1.157ProPro: 1.157 ± 0.044
1.514ProGln: 1.514 ± 0.047
1.489ProArg: 1.489 ± 0.045
2.094ProSer: 2.094 ± 0.058
1.686ProThr: 1.686 ± 0.087
3.346ProVal: 3.346 ± 0.062
0.39ProTrp: 0.39 ± 0.018
1.347ProTyr: 1.347 ± 0.043
0.0ProXaa: 0.0 ± 0.0
Gln
3.237GlnAla: 3.237 ± 0.061
0.305GlnCys: 0.305 ± 0.019
1.791GlnAsp: 1.791 ± 0.043
3.012GlnGlu: 3.012 ± 0.058
1.274GlnPhe: 1.274 ± 0.037
2.531GlnGly: 2.531 ± 0.056
0.495GlnHis: 0.495 ± 0.023
2.864GlnIle: 2.864 ± 0.056
2.719GlnLys: 2.719 ± 0.057
3.104GlnLeu: 3.104 ± 0.061
1.033GlnMet: 1.033 ± 0.031
1.835GlnAsn: 1.835 ± 0.048
1.123GlnPro: 1.123 ± 0.033
1.286GlnGln: 1.286 ± 0.047
1.681GlnArg: 1.681 ± 0.043
2.126GlnSer: 2.126 ± 0.054
2.03GlnThr: 2.03 ± 0.05
2.274GlnVal: 2.274 ± 0.053
0.351GlnTrp: 0.351 ± 0.02
1.273GlnTyr: 1.273 ± 0.037
0.0GlnXaa: 0.0 ± 0.0
Arg
3.143ArgAla: 3.143 ± 0.061
0.523ArgCys: 0.523 ± 0.022
2.387ArgAsp: 2.387 ± 0.061
4.209ArgGlu: 4.209 ± 0.08
2.099ArgPhe: 2.099 ± 0.044
2.907ArgGly: 2.907 ± 0.052
0.891ArgHis: 0.891 ± 0.028
4.048ArgIle: 4.048 ± 0.063
3.415ArgLys: 3.415 ± 0.062
4.972ArgLeu: 4.972 ± 0.09
1.392ArgMet: 1.392 ± 0.04
2.02ArgAsn: 2.02 ± 0.05
1.702ArgPro: 1.702 ± 0.048
2.101ArgGln: 2.101 ± 0.05
2.527ArgArg: 2.527 ± 0.062
2.462ArgSer: 2.462 ± 0.059
2.318ArgThr: 2.318 ± 0.05
2.993ArgVal: 2.993 ± 0.058
0.466ArgTrp: 0.466 ± 0.022
1.755ArgTyr: 1.755 ± 0.049
0.0ArgXaa: 0.0 ± 0.0
Ser
4.697SerAla: 4.697 ± 0.085
0.664SerCys: 0.664 ± 0.03
2.89SerAsp: 2.89 ± 0.07
3.695SerGlu: 3.695 ± 0.07
2.861SerPhe: 2.861 ± 0.058
5.661SerGly: 5.661 ± 0.121
0.965SerHis: 0.965 ± 0.029
4.343SerIle: 4.343 ± 0.076
3.231SerLys: 3.231 ± 0.065
6.542SerLeu: 6.542 ± 0.093
1.573SerMet: 1.573 ± 0.045
2.07SerAsn: 2.07 ± 0.049
2.347SerPro: 2.347 ± 0.049
2.012SerGln: 2.012 ± 0.053
2.968SerArg: 2.968 ± 0.051
3.922SerSer: 3.922 ± 0.091
2.908SerThr: 2.908 ± 0.069
4.146SerVal: 4.146 ± 0.085
0.6SerTrp: 0.6 ± 0.027
2.141SerTyr: 2.141 ± 0.05
0.0SerXaa: 0.0 ± 0.0
Thr
4.903ThrAla: 4.903 ± 0.12
0.498ThrCys: 0.498 ± 0.02
2.669ThrAsp: 2.669 ± 0.055
3.412ThrGlu: 3.412 ± 0.063
2.021ThrPhe: 2.021 ± 0.053
5.331ThrGly: 5.331 ± 0.2
0.878ThrHis: 0.878 ± 0.032
3.791ThrIle: 3.791 ± 0.073
2.726ThrLys: 2.726 ± 0.059
5.064ThrLeu: 5.064 ± 0.086
1.162ThrMet: 1.162 ± 0.032
1.84ThrAsn: 1.84 ± 0.052
2.189ThrPro: 2.189 ± 0.054
1.361ThrGln: 1.361 ± 0.04
2.056ThrArg: 2.056 ± 0.047
2.766ThrSer: 2.766 ± 0.068
2.671ThrThr: 2.671 ± 0.073
4.585ThrVal: 4.585 ± 0.114
0.419ThrTrp: 0.419 ± 0.023
1.63ThrTyr: 1.63 ± 0.049
0.0ThrXaa: 0.0 ± 0.0
Val
4.775ValAla: 4.775 ± 0.085
0.897ValCys: 0.897 ± 0.036
3.297ValAsp: 3.297 ± 0.063
4.159ValGlu: 4.159 ± 0.073
3.311ValPhe: 3.311 ± 0.065
4.186ValGly: 4.186 ± 0.085
1.047ValHis: 1.047 ± 0.037
5.321ValIle: 5.321 ± 0.081
4.128ValLys: 4.128 ± 0.071
7.464ValLeu: 7.464 ± 0.102
1.741ValMet: 1.741 ± 0.042
2.861ValAsn: 2.861 ± 0.06
2.643ValPro: 2.643 ± 0.051
2.271ValGln: 2.271 ± 0.043
3.374ValArg: 3.374 ± 0.073
4.531ValSer: 4.531 ± 0.082
3.691ValThr: 3.691 ± 0.106
4.479ValVal: 4.479 ± 0.079
0.603ValTrp: 0.603 ± 0.026
2.108ValTyr: 2.108 ± 0.037
0.0ValXaa: 0.0 ± 0.0
Trp
0.661TrpAla: 0.661 ± 0.032
0.094TrpCys: 0.094 ± 0.01
0.521TrpAsp: 0.521 ± 0.022
0.659TrpGlu: 0.659 ± 0.028
0.368TrpPhe: 0.368 ± 0.021
0.635TrpGly: 0.635 ± 0.027
0.19TrpHis: 0.19 ± 0.014
0.67TrpIle: 0.67 ± 0.026
0.686TrpLys: 0.686 ± 0.031
1.093TrpLeu: 1.093 ± 0.031
0.235TrpMet: 0.235 ± 0.015
0.438TrpAsn: 0.438 ± 0.022
0.29TrpPro: 0.29 ± 0.018
0.48TrpGln: 0.48 ± 0.025
0.48TrpArg: 0.48 ± 0.023
0.556TrpSer: 0.556 ± 0.03
0.489TrpThr: 0.489 ± 0.023
0.547TrpVal: 0.547 ± 0.024
0.132TrpTrp: 0.132 ± 0.013
0.296TrpTyr: 0.296 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.478TyrAla: 2.478 ± 0.057
0.485TyrCys: 0.485 ± 0.023
1.839TyrAsp: 1.839 ± 0.047
2.051TyrGlu: 2.051 ± 0.049
1.681TyrPhe: 1.681 ± 0.048
2.434TyrGly: 2.434 ± 0.049
0.835TyrHis: 0.835 ± 0.029
2.233TyrIle: 2.233 ± 0.052
1.654TyrLys: 1.654 ± 0.043
3.843TyrLeu: 3.843 ± 0.072
0.685TyrMet: 0.685 ± 0.028
1.225TyrAsn: 1.225 ± 0.039
1.529TyrPro: 1.529 ± 0.044
1.488TyrGln: 1.488 ± 0.038
1.879TyrArg: 1.879 ± 0.047
2.212TyrSer: 2.212 ± 0.06
1.894TyrThr: 1.894 ± 0.055
1.89TyrVal: 1.89 ± 0.046
0.353TyrTrp: 0.353 ± 0.019
1.419TyrTyr: 1.419 ± 0.043
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3105 proteins (954276 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski