Amino acid dipepetide frequency for Vulcaniibacterium tengchongense

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
24.203AlaAla: 24.203 ± 0.272
1.316AlaCys: 1.316 ± 0.039
7.566AlaAsp: 7.566 ± 0.09
8.565AlaGlu: 8.565 ± 0.131
4.337AlaPhe: 4.337 ± 0.074
13.28AlaGly: 13.28 ± 0.163
2.89AlaHis: 2.89 ± 0.054
5.268AlaIle: 5.268 ± 0.091
3.201AlaLys: 3.201 ± 0.081
18.039AlaLeu: 18.039 ± 0.197
3.064AlaMet: 3.064 ± 0.053
2.537AlaAsn: 2.537 ± 0.059
8.507AlaPro: 8.507 ± 0.157
5.664AlaGln: 5.664 ± 0.085
12.956AlaArg: 12.956 ± 0.17
5.89AlaSer: 5.89 ± 0.105
5.804AlaThr: 5.804 ± 0.109
9.497AlaVal: 9.497 ± 0.132
2.372AlaTrp: 2.372 ± 0.059
2.744AlaTyr: 2.744 ± 0.055
0.0AlaXaa: 0.0 ± 0.0
Cys
1.182CysAla: 1.182 ± 0.041
0.085CysCys: 0.085 ± 0.008
0.451CysAsp: 0.451 ± 0.019
0.471CysGlu: 0.471 ± 0.02
0.258CysPhe: 0.258 ± 0.015
0.885CysGly: 0.885 ± 0.027
0.207CysHis: 0.207 ± 0.013
0.265CysIle: 0.265 ± 0.017
0.153CysLys: 0.153 ± 0.014
0.742CysLeu: 0.742 ± 0.028
0.115CysMet: 0.115 ± 0.011
0.178CysAsn: 0.178 ± 0.013
0.441CysPro: 0.441 ± 0.024
0.188CysGln: 0.188 ± 0.014
0.675CysArg: 0.675 ± 0.027
0.36CysSer: 0.36 ± 0.018
0.375CysThr: 0.375 ± 0.02
0.629CysVal: 0.629 ± 0.023
0.117CysTrp: 0.117 ± 0.012
0.191CysTyr: 0.191 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
9.31AspAla: 9.31 ± 0.115
0.394AspCys: 0.394 ± 0.022
3.091AspAsp: 3.091 ± 0.062
3.515AspGlu: 3.515 ± 0.061
1.939AspPhe: 1.939 ± 0.043
6.043AspGly: 6.043 ± 0.088
1.006AspHis: 1.006 ± 0.029
1.797AspIle: 1.797 ± 0.049
1.247AspLys: 1.247 ± 0.039
5.565AspLeu: 5.565 ± 0.08
0.889AspMet: 0.889 ± 0.03
1.08AspAsn: 1.08 ± 0.042
4.004AspPro: 4.004 ± 0.064
1.304AspGln: 1.304 ± 0.037
4.396AspArg: 4.396 ± 0.077
1.986AspSer: 1.986 ± 0.045
2.213AspThr: 2.213 ± 0.054
3.974AspVal: 3.974 ± 0.064
1.137AspTrp: 1.137 ± 0.035
1.721AspTyr: 1.721 ± 0.043
0.0AspXaa: 0.0 ± 0.0
Glu
8.509GluAla: 8.509 ± 0.133
0.344GluCys: 0.344 ± 0.019
2.697GluAsp: 2.697 ± 0.061
2.607GluGlu: 2.607 ± 0.064
1.898GluPhe: 1.898 ± 0.049
4.264GluGly: 4.264 ± 0.067
1.422GluHis: 1.422 ± 0.039
2.452GluIle: 2.452 ± 0.055
1.315GluLys: 1.315 ± 0.038
6.546GluLeu: 6.546 ± 0.092
0.96GluMet: 0.96 ± 0.032
1.125GluAsn: 1.125 ± 0.035
3.167GluPro: 3.167 ± 0.058
2.348GluGln: 2.348 ± 0.059
6.586GluArg: 6.586 ± 0.103
2.249GluSer: 2.249 ± 0.045
2.55GluThr: 2.55 ± 0.049
4.044GluVal: 4.044 ± 0.063
0.793GluTrp: 0.793 ± 0.029
1.249GluTyr: 1.249 ± 0.042
0.0GluXaa: 0.0 ± 0.0
Phe
4.84PheAla: 4.84 ± 0.077
0.297PheCys: 0.297 ± 0.019
2.734PheAsp: 2.734 ± 0.063
2.112PheGlu: 2.112 ± 0.042
1.085PhePhe: 1.085 ± 0.036
3.444PheGly: 3.444 ± 0.062
0.706PheHis: 0.706 ± 0.027
0.858PheIle: 0.858 ± 0.032
0.724PheLys: 0.724 ± 0.029
2.95PheLeu: 2.95 ± 0.062
0.503PheMet: 0.503 ± 0.019
0.859PheAsn: 0.859 ± 0.03
1.422PhePro: 1.422 ± 0.039
0.778PheGln: 0.778 ± 0.028
2.352PheArg: 2.352 ± 0.047
1.541PheSer: 1.541 ± 0.038
1.313PheThr: 1.313 ± 0.037
2.759PheVal: 2.759 ± 0.058
0.465PheTrp: 0.465 ± 0.023
0.779PheTyr: 0.779 ± 0.026
0.0PheXaa: 0.0 ± 0.0
Gly
11.449GlyAla: 11.449 ± 0.163
0.797GlyCys: 0.797 ± 0.028
5.092GlyAsp: 5.092 ± 0.073
5.833GlyGlu: 5.833 ± 0.075
3.462GlyPhe: 3.462 ± 0.067
8.193GlyGly: 8.193 ± 0.119
1.992GlyHis: 1.992 ± 0.042
3.488GlyIle: 3.488 ± 0.062
2.648GlyLys: 2.648 ± 0.063
9.288GlyLeu: 9.288 ± 0.11
1.95GlyMet: 1.95 ± 0.047
2.105GlyAsn: 2.105 ± 0.069
3.617GlyPro: 3.617 ± 0.067
2.897GlyGln: 2.897 ± 0.058
7.796GlyArg: 7.796 ± 0.1
4.323GlySer: 4.323 ± 0.093
4.146GlyThr: 4.146 ± 0.087
6.531GlyVal: 6.531 ± 0.087
1.748GlyTrp: 1.748 ± 0.04
2.567GlyTyr: 2.567 ± 0.054
0.0GlyXaa: 0.0 ± 0.0
His
3.165HisAla: 3.165 ± 0.063
0.233HisCys: 0.233 ± 0.015
1.208HisAsp: 1.208 ± 0.038
1.128HisGlu: 1.128 ± 0.033
0.747HisPhe: 0.747 ± 0.029
2.398HisGly: 2.398 ± 0.05
0.525HisHis: 0.525 ± 0.026
0.591HisIle: 0.591 ± 0.026
0.431HisLys: 0.431 ± 0.02
2.07HisLeu: 2.07 ± 0.042
0.311HisMet: 0.311 ± 0.017
0.409HisAsn: 0.409 ± 0.02
1.567HisPro: 1.567 ± 0.041
0.479HisGln: 0.479 ± 0.023
1.844HisArg: 1.844 ± 0.037
0.8HisSer: 0.8 ± 0.03
0.792HisThr: 0.792 ± 0.026
1.487HisVal: 1.487 ± 0.034
0.415HisTrp: 0.415 ± 0.019
0.636HisTyr: 0.636 ± 0.027
0.0HisXaa: 0.0 ± 0.0
Ile
6.115IleAla: 6.115 ± 0.081
0.276IleCys: 0.276 ± 0.02
2.858IleAsp: 2.858 ± 0.057
2.955IleGlu: 2.955 ± 0.052
0.843IlePhe: 0.843 ± 0.031
3.997IleGly: 3.997 ± 0.087
0.664IleHis: 0.664 ± 0.024
0.772IleIle: 0.772 ± 0.038
0.91IleLys: 0.91 ± 0.033
2.634IleLeu: 2.634 ± 0.053
0.388IleMet: 0.388 ± 0.022
0.952IleAsn: 0.952 ± 0.039
1.82IlePro: 1.82 ± 0.044
0.898IleGln: 0.898 ± 0.033
2.624IleArg: 2.624 ± 0.049
1.46IleSer: 1.46 ± 0.043
1.572IleThr: 1.572 ± 0.048
3.112IleVal: 3.112 ± 0.063
0.363IleTrp: 0.363 ± 0.02
0.703IleTyr: 0.703 ± 0.031
0.0IleXaa: 0.0 ± 0.0
Lys
3.013LysAla: 3.013 ± 0.076
0.131LysCys: 0.131 ± 0.011
1.185LysAsp: 1.185 ± 0.037
1.019LysGlu: 1.019 ± 0.037
0.659LysPhe: 0.659 ± 0.023
1.711LysGly: 1.711 ± 0.048
0.488LysHis: 0.488 ± 0.021
1.025LysIle: 1.025 ± 0.035
0.886LysLys: 0.886 ± 0.053
2.627LysLeu: 2.627 ± 0.061
0.479LysMet: 0.479 ± 0.021
0.565LysAsn: 0.565 ± 0.021
1.787LysPro: 1.787 ± 0.047
0.937LysGln: 0.937 ± 0.032
2.133LysArg: 2.133 ± 0.05
1.109LysSer: 1.109 ± 0.033
1.271LysThr: 1.271 ± 0.042
1.778LysVal: 1.778 ± 0.052
0.294LysTrp: 0.294 ± 0.017
0.564LysTyr: 0.564 ± 0.028
0.0LysXaa: 0.0 ± 0.0
Leu
17.416LeuAla: 17.416 ± 0.183
0.897LeuCys: 0.897 ± 0.03
6.969LeuAsp: 6.969 ± 0.091
5.559LeuGlu: 5.559 ± 0.094
3.419LeuPhe: 3.419 ± 0.067
9.339LeuGly: 9.339 ± 0.104
2.41LeuHis: 2.41 ± 0.052
3.486LeuIle: 3.486 ± 0.077
2.534LeuLys: 2.534 ± 0.055
11.869LeuLeu: 11.869 ± 0.159
1.818LeuMet: 1.818 ± 0.05
2.139LeuAsn: 2.139 ± 0.046
6.696LeuPro: 6.696 ± 0.089
3.797LeuGln: 3.797 ± 0.066
10.579LeuArg: 10.579 ± 0.129
5.111LeuSer: 5.111 ± 0.089
4.241LeuThr: 4.241 ± 0.07
7.299LeuVal: 7.299 ± 0.098
1.454LeuTrp: 1.454 ± 0.046
2.273LeuTyr: 2.273 ± 0.049
0.0LeuXaa: 0.0 ± 0.0
Met
2.259MetAla: 2.259 ± 0.051
0.117MetCys: 0.117 ± 0.01
0.891MetAsp: 0.891 ± 0.031
0.736MetGlu: 0.736 ± 0.025
0.53MetPhe: 0.53 ± 0.026
1.254MetGly: 1.254 ± 0.036
0.386MetHis: 0.386 ± 0.019
0.713MetIle: 0.713 ± 0.029
0.659MetLys: 0.659 ± 0.028
2.043MetLeu: 2.043 ± 0.049
0.327MetMet: 0.327 ± 0.019
0.606MetAsn: 0.606 ± 0.022
1.299MetPro: 1.299 ± 0.037
0.728MetGln: 0.728 ± 0.025
1.736MetArg: 1.736 ± 0.046
1.294MetSer: 1.294 ± 0.036
1.081MetThr: 1.081 ± 0.034
1.094MetVal: 1.094 ± 0.035
0.177MetTrp: 0.177 ± 0.015
0.302MetTyr: 0.302 ± 0.017
0.0MetXaa: 0.0 ± 0.0
Asn
3.034AsnAla: 3.034 ± 0.069
0.175AsnCys: 0.175 ± 0.014
1.148AsnAsp: 1.148 ± 0.037
1.076AsnGlu: 1.076 ± 0.029
0.768AsnPhe: 0.768 ± 0.033
2.137AsnGly: 2.137 ± 0.058
0.405AsnHis: 0.405 ± 0.02
0.816AsnIle: 0.816 ± 0.033
0.501AsnLys: 0.501 ± 0.024
2.28AsnLeu: 2.28 ± 0.047
0.328AsnMet: 0.328 ± 0.024
0.529AsnAsn: 0.529 ± 0.03
1.645AsnPro: 1.645 ± 0.038
0.617AsnGln: 0.617 ± 0.029
1.64AsnArg: 1.64 ± 0.038
0.913AsnSer: 0.913 ± 0.044
0.99AsnThr: 0.99 ± 0.036
1.623AsnVal: 1.623 ± 0.047
0.346AsnTrp: 0.346 ± 0.019
0.562AsnTyr: 0.562 ± 0.026
0.0AsnXaa: 0.0 ± 0.0
Pro
9.205ProAla: 9.205 ± 0.144
0.358ProCys: 0.358 ± 0.019
3.531ProAsp: 3.531 ± 0.07
3.643ProGlu: 3.643 ± 0.065
1.708ProPhe: 1.708 ± 0.042
5.663ProGly: 5.663 ± 0.091
1.234ProHis: 1.234 ± 0.038
1.796ProIle: 1.796 ± 0.047
1.336ProLys: 1.336 ± 0.038
5.873ProLeu: 5.873 ± 0.101
1.128ProMet: 1.128 ± 0.036
1.193ProAsn: 1.193 ± 0.037
3.858ProPro: 3.858 ± 0.1
2.197ProGln: 2.197 ± 0.053
4.668ProArg: 4.668 ± 0.082
2.504ProSer: 2.504 ± 0.055
2.243ProThr: 2.243 ± 0.045
4.164ProVal: 4.164 ± 0.06
0.904ProTrp: 0.904 ± 0.034
1.28ProTyr: 1.28 ± 0.034
0.0ProXaa: 0.0 ± 0.0
Gln
5.638GlnAla: 5.638 ± 0.096
0.207GlnCys: 0.207 ± 0.013
1.412GlnAsp: 1.412 ± 0.041
1.265GlnGlu: 1.265 ± 0.042
0.952GlnPhe: 0.952 ± 0.03
2.584GlnGly: 2.584 ± 0.052
0.658GlnHis: 0.658 ± 0.024
1.316GlnIle: 1.316 ± 0.042
0.764GlnLys: 0.764 ± 0.026
3.597GlnLeu: 3.597 ± 0.063
0.624GlnMet: 0.624 ± 0.021
0.599GlnAsn: 0.599 ± 0.026
2.12GlnPro: 2.12 ± 0.046
1.442GlnGln: 1.442 ± 0.042
3.693GlnArg: 3.693 ± 0.067
1.309GlnSer: 1.309 ± 0.038
1.275GlnThr: 1.275 ± 0.036
2.655GlnVal: 2.655 ± 0.057
0.546GlnTrp: 0.546 ± 0.025
0.694GlnTyr: 0.694 ± 0.029
0.0GlnXaa: 0.0 ± 0.0
Arg
11.729ArgAla: 11.729 ± 0.142
0.694ArgCys: 0.694 ± 0.029
5.066ArgAsp: 5.066 ± 0.074
5.875ArgGlu: 5.875 ± 0.095
3.268ArgPhe: 3.268 ± 0.063
6.878ArgGly: 6.878 ± 0.085
2.158ArgHis: 2.158 ± 0.05
4.002ArgIle: 4.002 ± 0.067
2.005ArgLys: 2.005 ± 0.048
10.339ArgLeu: 10.339 ± 0.133
1.992ArgMet: 1.992 ± 0.041
1.886ArgAsn: 1.886 ± 0.039
4.616ArgPro: 4.616 ± 0.074
2.989ArgGln: 2.989 ± 0.068
8.409ArgArg: 8.409 ± 0.115
3.521ArgSer: 3.521 ± 0.059
3.504ArgThr: 3.504 ± 0.059
6.0ArgVal: 6.0 ± 0.084
1.628ArgTrp: 1.628 ± 0.046
2.671ArgTyr: 2.671 ± 0.062
0.0ArgXaa: 0.0 ± 0.0
Ser
5.849SerAla: 5.849 ± 0.091
0.316SerCys: 0.316 ± 0.017
2.202SerAsp: 2.202 ± 0.056
2.273SerGlu: 2.273 ± 0.052
1.591SerPhe: 1.591 ± 0.046
4.629SerGly: 4.629 ± 0.092
0.89SerHis: 0.89 ± 0.029
1.665SerIle: 1.665 ± 0.049
1.045SerLys: 1.045 ± 0.038
4.573SerLeu: 4.573 ± 0.072
0.754SerMet: 0.754 ± 0.026
1.098SerAsn: 1.098 ± 0.037
2.638SerPro: 2.638 ± 0.059
1.321SerGln: 1.321 ± 0.033
3.502SerArg: 3.502 ± 0.055
2.031SerSer: 2.031 ± 0.051
2.059SerThr: 2.059 ± 0.054
3.156SerVal: 3.156 ± 0.076
0.666SerTrp: 0.666 ± 0.03
1.113SerTyr: 1.113 ± 0.034
0.0SerXaa: 0.0 ± 0.0
Thr
5.458ThrAla: 5.458 ± 0.106
0.339ThrCys: 0.339 ± 0.02
2.108ThrAsp: 2.108 ± 0.055
2.018ThrGlu: 2.018 ± 0.047
1.323ThrPhe: 1.323 ± 0.045
4.241ThrGly: 4.241 ± 0.075
0.856ThrHis: 0.856 ± 0.033
1.697ThrIle: 1.697 ± 0.048
0.742ThrLys: 0.742 ± 0.03
5.408ThrLeu: 5.408 ± 0.093
0.666ThrMet: 0.666 ± 0.024
0.839ThrAsn: 0.839 ± 0.043
3.17ThrPro: 3.17 ± 0.062
1.212ThrGln: 1.212 ± 0.037
3.34ThrArg: 3.34 ± 0.053
1.674ThrSer: 1.674 ± 0.042
2.038ThrThr: 2.038 ± 0.055
3.832ThrVal: 3.832 ± 0.078
0.596ThrTrp: 0.596 ± 0.023
0.986ThrTyr: 0.986 ± 0.041
0.0ThrXaa: 0.0 ± 0.0
Val
10.469ValAla: 10.469 ± 0.127
0.639ValCys: 0.639 ± 0.024
4.196ValAsp: 4.196 ± 0.067
4.7ValGlu: 4.7 ± 0.078
2.339ValPhe: 2.339 ± 0.051
5.759ValGly: 5.759 ± 0.096
1.474ValHis: 1.474 ± 0.04
2.677ValIle: 2.677 ± 0.057
1.66ValLys: 1.66 ± 0.053
7.939ValLeu: 7.939 ± 0.102
1.227ValMet: 1.227 ± 0.037
1.782ValAsn: 1.782 ± 0.05
4.078ValPro: 4.078 ± 0.059
2.149ValGln: 2.149 ± 0.054
6.077ValArg: 6.077 ± 0.083
3.405ValSer: 3.405 ± 0.062
3.265ValThr: 3.265 ± 0.074
5.707ValVal: 5.707 ± 0.099
0.882ValTrp: 0.882 ± 0.03
1.585ValTyr: 1.585 ± 0.039
0.0ValXaa: 0.0 ± 0.0
Trp
1.571TrpAla: 1.571 ± 0.045
0.154TrpCys: 0.154 ± 0.011
0.746TrpAsp: 0.746 ± 0.03
0.571TrpGlu: 0.571 ± 0.023
0.558TrpPhe: 0.558 ± 0.024
0.959TrpGly: 0.959 ± 0.034
0.358TrpHis: 0.358 ± 0.02
0.628TrpIle: 0.628 ± 0.024
0.388TrpLys: 0.388 ± 0.022
2.343TrpLeu: 2.343 ± 0.057
0.377TrpMet: 0.377 ± 0.02
0.465TrpAsn: 0.465 ± 0.024
0.846TrpPro: 0.846 ± 0.036
0.696TrpGln: 0.696 ± 0.025
1.803TrpArg: 1.803 ± 0.052
0.843TrpSer: 0.843 ± 0.03
0.724TrpThr: 0.724 ± 0.027
0.894TrpVal: 0.894 ± 0.03
0.326TrpTrp: 0.326 ± 0.019
0.385TrpTyr: 0.385 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.288TyrAla: 3.288 ± 0.065
0.198TyrCys: 0.198 ± 0.013
1.46TyrAsp: 1.46 ± 0.045
1.203TyrGlu: 1.203 ± 0.036
0.838TyrPhe: 0.838 ± 0.03
2.265TyrGly: 2.265 ± 0.061
0.506TyrHis: 0.506 ± 0.023
0.604TyrIle: 0.604 ± 0.025
0.516TyrLys: 0.516 ± 0.027
2.579TyrLeu: 2.579 ± 0.055
0.33TyrMet: 0.33 ± 0.019
0.587TyrAsn: 0.587 ± 0.034
1.186TyrPro: 1.186 ± 0.035
0.706TyrGln: 0.706 ± 0.024
2.421TyrArg: 2.421 ± 0.052
1.05TyrSer: 1.05 ± 0.04
1.088TyrThr: 1.088 ± 0.038
1.777TyrVal: 1.777 ± 0.041
0.405TyrTrp: 0.405 ± 0.022
0.646TyrTyr: 0.646 ± 0.027
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3153 proteins (1042108 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski