Amino acid dipepetide frequency for Firmicutes bacterium CAG:424

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.597AlaAla: 6.597 ± 0.118
1.178AlaCys: 1.178 ± 0.038
3.617AlaAsp: 3.617 ± 0.067
5.157AlaGlu: 5.157 ± 0.089
3.104AlaPhe: 3.104 ± 0.063
5.908AlaGly: 5.908 ± 0.1
1.06AlaHis: 1.06 ± 0.036
4.608AlaIle: 4.608 ± 0.083
4.847AlaLys: 4.847 ± 0.091
6.838AlaLeu: 6.838 ± 0.092
2.427AlaMet: 2.427 ± 0.051
2.296AlaAsn: 2.296 ± 0.057
1.922AlaPro: 1.922 ± 0.048
2.3AlaGln: 2.3 ± 0.06
2.99AlaArg: 2.99 ± 0.058
3.653AlaSer: 3.653 ± 0.066
2.863AlaThr: 2.863 ± 0.063
6.066AlaVal: 6.066 ± 0.087
0.655AlaTrp: 0.655 ± 0.028
2.766AlaTyr: 2.766 ± 0.057
0.0AlaXaa: 0.0 ± 0.0
Cys
1.065CysAla: 1.065 ± 0.035
0.312CysCys: 0.312 ± 0.021
0.777CysAsp: 0.777 ± 0.035
0.95CysGlu: 0.95 ± 0.033
0.676CysPhe: 0.676 ± 0.026
1.527CysGly: 1.527 ± 0.049
0.346CysHis: 0.346 ± 0.021
1.118CysIle: 1.118 ± 0.037
0.98CysLys: 0.98 ± 0.034
1.233CysLeu: 1.233 ± 0.037
0.501CysMet: 0.501 ± 0.024
0.546CysAsn: 0.546 ± 0.023
0.643CysPro: 0.643 ± 0.033
0.537CysGln: 0.537 ± 0.025
0.669CysArg: 0.669 ± 0.03
0.962CysSer: 0.962 ± 0.032
0.825CysThr: 0.825 ± 0.029
1.088CysVal: 1.088 ± 0.037
0.134CysTrp: 0.134 ± 0.012
0.62CysTyr: 0.62 ± 0.024
0.0CysXaa: 0.0 ± 0.0
Asp
3.574AspAla: 3.574 ± 0.073
0.822AspCys: 0.822 ± 0.033
2.061AspAsp: 2.061 ± 0.055
4.013AspGlu: 4.013 ± 0.071
2.66AspPhe: 2.66 ± 0.052
3.781AspGly: 3.781 ± 0.076
0.774AspHis: 0.774 ± 0.036
3.877AspIle: 3.877 ± 0.072
3.218AspLys: 3.218 ± 0.057
4.362AspLeu: 4.362 ± 0.077
1.73AspMet: 1.73 ± 0.041
1.798AspAsn: 1.798 ± 0.048
1.453AspPro: 1.453 ± 0.043
1.23AspGln: 1.23 ± 0.036
1.97AspArg: 1.97 ± 0.048
2.792AspSer: 2.792 ± 0.059
2.918AspThr: 2.918 ± 0.058
3.48AspVal: 3.48 ± 0.062
0.608AspTrp: 0.608 ± 0.033
2.642AspTyr: 2.642 ± 0.06
0.001AspXaa: 0.001 ± 0.001
Glu
5.907GluAla: 5.907 ± 0.107
0.856GluCys: 0.856 ± 0.033
4.493GluAsp: 4.493 ± 0.073
9.482GluGlu: 9.482 ± 0.159
2.937GluPhe: 2.937 ± 0.055
4.817GluGly: 4.817 ± 0.075
1.618GluHis: 1.618 ± 0.044
6.242GluIle: 6.242 ± 0.087
7.972GluLys: 7.972 ± 0.107
7.141GluLeu: 7.141 ± 0.103
2.618GluMet: 2.618 ± 0.05
4.628GluAsn: 4.628 ± 0.072
2.014GluPro: 2.014 ± 0.05
3.5GluGln: 3.5 ± 0.068
3.295GluArg: 3.295 ± 0.064
3.241GluSer: 3.241 ± 0.063
4.22GluThr: 4.22 ± 0.08
4.839GluVal: 4.839 ± 0.081
0.638GluTrp: 0.638 ± 0.024
3.357GluTyr: 3.357 ± 0.061
0.0GluXaa: 0.0 ± 0.0
Phe
2.904PheAla: 2.904 ± 0.061
0.789PheCys: 0.789 ± 0.031
2.279PheAsp: 2.279 ± 0.051
2.723PheGlu: 2.723 ± 0.057
2.152PhePhe: 2.152 ± 0.06
3.153PheGly: 3.153 ± 0.06
0.98PheHis: 0.98 ± 0.034
2.697PheIle: 2.697 ± 0.06
1.8PheLys: 1.8 ± 0.04
4.861PheLeu: 4.861 ± 0.092
1.293PheMet: 1.293 ± 0.038
1.286PheAsn: 1.286 ± 0.039
1.554PhePro: 1.554 ± 0.04
1.983PheGln: 1.983 ± 0.046
1.692PheArg: 1.692 ± 0.044
3.075PheSer: 3.075 ± 0.063
2.321PheThr: 2.321 ± 0.052
2.89PheVal: 2.89 ± 0.063
0.534PheTrp: 0.534 ± 0.026
1.925PheTyr: 1.925 ± 0.05
0.0PheXaa: 0.0 ± 0.0
Gly
4.944GlyAla: 4.944 ± 0.091
1.345GlyCys: 1.345 ± 0.046
3.146GlyAsp: 3.146 ± 0.063
5.231GlyGlu: 5.231 ± 0.076
3.333GlyPhe: 3.333 ± 0.061
4.647GlyGly: 4.647 ± 0.086
1.226GlyHis: 1.226 ± 0.043
6.734GlyIle: 6.734 ± 0.096
5.877GlyLys: 5.877 ± 0.081
5.937GlyLeu: 5.937 ± 0.096
2.659GlyMet: 2.659 ± 0.053
3.205GlyAsn: 3.205 ± 0.058
1.343GlyPro: 1.343 ± 0.036
2.154GlyGln: 2.154 ± 0.045
2.82GlyArg: 2.82 ± 0.062
3.819GlySer: 3.819 ± 0.059
4.079GlyThr: 4.079 ± 0.078
5.01GlyVal: 5.01 ± 0.077
0.712GlyTrp: 0.712 ± 0.028
3.287GlyTyr: 3.287 ± 0.057
0.0GlyXaa: 0.0 ± 0.0
His
1.084HisAla: 1.084 ± 0.032
0.34HisCys: 0.34 ± 0.019
0.86HisAsp: 0.86 ± 0.034
1.074HisGlu: 1.074 ± 0.035
0.85HisPhe: 0.85 ± 0.033
1.394HisGly: 1.394 ± 0.043
0.501HisHis: 0.501 ± 0.035
1.37HisIle: 1.37 ± 0.04
1.032HisLys: 1.032 ± 0.033
1.641HisLeu: 1.641 ± 0.045
0.651HisMet: 0.651 ± 0.027
0.671HisAsn: 0.671 ± 0.027
0.96HisPro: 0.96 ± 0.035
0.625HisGln: 0.625 ± 0.026
0.786HisArg: 0.786 ± 0.029
0.975HisSer: 0.975 ± 0.035
1.059HisThr: 1.059 ± 0.032
1.16HisVal: 1.16 ± 0.04
0.204HisTrp: 0.204 ± 0.015
0.799HisTyr: 0.799 ± 0.031
0.0HisXaa: 0.0 ± 0.0
Ile
5.19IleAla: 5.19 ± 0.081
1.312IleCys: 1.312 ± 0.043
3.307IleAsp: 3.307 ± 0.065
4.827IleGlu: 4.827 ± 0.085
3.106IlePhe: 3.106 ± 0.072
4.933IleGly: 4.933 ± 0.081
1.443IleHis: 1.443 ± 0.037
4.272IleIle: 4.272 ± 0.078
3.757IleLys: 3.757 ± 0.064
7.887IleLeu: 7.887 ± 0.113
1.946IleMet: 1.946 ± 0.048
2.364IleAsn: 2.364 ± 0.055
3.354IlePro: 3.354 ± 0.07
3.045IleGln: 3.045 ± 0.05
3.476IleArg: 3.476 ± 0.07
4.819IleSer: 4.819 ± 0.092
4.006IleThr: 4.006 ± 0.063
4.388IleVal: 4.388 ± 0.081
0.695IleTrp: 0.695 ± 0.031
2.793IleTyr: 2.793 ± 0.056
0.0IleXaa: 0.0 ± 0.0
Lys
5.044LysAla: 5.044 ± 0.08
0.725LysCys: 0.725 ± 0.028
3.721LysAsp: 3.721 ± 0.066
8.199LysGlu: 8.199 ± 0.103
1.956LysPhe: 1.956 ± 0.052
4.724LysGly: 4.724 ± 0.076
1.11LysHis: 1.11 ± 0.038
4.981LysIle: 4.981 ± 0.068
6.693LysLys: 6.693 ± 0.103
5.434LysLeu: 5.434 ± 0.071
2.317LysMet: 2.317 ± 0.056
3.561LysAsn: 3.561 ± 0.063
2.171LysPro: 2.171 ± 0.052
2.824LysGln: 2.824 ± 0.053
3.2LysArg: 3.2 ± 0.067
3.221LysSer: 3.221 ± 0.056
3.822LysThr: 3.822 ± 0.058
4.303LysVal: 4.303 ± 0.077
0.694LysTrp: 0.694 ± 0.029
2.739LysTyr: 2.739 ± 0.057
0.0LysXaa: 0.0 ± 0.0
Leu
6.786LeuAla: 6.786 ± 0.093
1.739LeuCys: 1.739 ± 0.052
4.938LeuAsp: 4.938 ± 0.082
7.748LeuGlu: 7.748 ± 0.106
4.141LeuPhe: 4.141 ± 0.086
6.722LeuGly: 6.722 ± 0.093
1.629LeuHis: 1.629 ± 0.038
5.738LeuIle: 5.738 ± 0.094
6.553LeuLys: 6.553 ± 0.074
9.697LeuLeu: 9.697 ± 0.157
2.695LeuMet: 2.695 ± 0.058
3.673LeuAsn: 3.673 ± 0.065
3.79LeuPro: 3.79 ± 0.068
3.284LeuGln: 3.284 ± 0.068
3.59LeuArg: 3.59 ± 0.063
6.021LeuSer: 6.021 ± 0.097
4.945LeuThr: 4.945 ± 0.082
5.783LeuVal: 5.783 ± 0.076
0.87LeuTrp: 0.87 ± 0.037
3.526LeuTyr: 3.526 ± 0.071
0.0LeuXaa: 0.0 ± 0.0
Met
2.509MetAla: 2.509 ± 0.061
0.379MetCys: 0.379 ± 0.021
1.728MetAsp: 1.728 ± 0.047
3.202MetGlu: 3.202 ± 0.06
1.059MetPhe: 1.059 ± 0.034
2.333MetGly: 2.333 ± 0.053
0.472MetHis: 0.472 ± 0.024
2.194MetIle: 2.194 ± 0.051
2.866MetLys: 2.866 ± 0.053
2.886MetLeu: 2.886 ± 0.06
1.028MetMet: 1.028 ± 0.035
1.544MetAsn: 1.544 ± 0.045
1.241MetPro: 1.241 ± 0.04
1.071MetGln: 1.071 ± 0.034
1.251MetArg: 1.251 ± 0.034
1.693MetSer: 1.693 ± 0.044
1.71MetThr: 1.71 ± 0.048
2.092MetVal: 2.092 ± 0.05
0.236MetTrp: 0.236 ± 0.016
0.981MetTyr: 0.981 ± 0.032
0.0MetXaa: 0.0 ± 0.0
Asn
2.703AsnAla: 2.703 ± 0.065
0.6AsnCys: 0.6 ± 0.027
1.522AsnAsp: 1.522 ± 0.045
2.521AsnGlu: 2.521 ± 0.056
1.701AsnPhe: 1.701 ± 0.043
3.149AsnGly: 3.149 ± 0.069
0.883AsnHis: 0.883 ± 0.034
3.021AsnIle: 3.021 ± 0.062
2.384AsnLys: 2.384 ± 0.05
4.228AsnLeu: 4.228 ± 0.07
1.45AsnMet: 1.45 ± 0.041
1.529AsnAsn: 1.529 ± 0.051
2.158AsnPro: 2.158 ± 0.047
1.834AsnGln: 1.834 ± 0.044
1.917AsnArg: 1.917 ± 0.05
2.125AsnSer: 2.125 ± 0.051
2.3AsnThr: 2.3 ± 0.057
2.521AsnVal: 2.521 ± 0.062
0.438AsnTrp: 0.438 ± 0.025
1.773AsnTyr: 1.773 ± 0.044
0.0AsnXaa: 0.0 ± 0.0
Pro
2.254ProAla: 2.254 ± 0.052
0.491ProCys: 0.491 ± 0.027
2.073ProAsp: 2.073 ± 0.052
3.495ProGlu: 3.495 ± 0.073
1.7ProPhe: 1.7 ± 0.046
2.555ProGly: 2.555 ± 0.055
0.556ProHis: 0.556 ± 0.024
1.947ProIle: 1.947 ± 0.048
2.122ProLys: 2.122 ± 0.046
3.026ProLeu: 3.026 ± 0.062
1.048ProMet: 1.048 ± 0.036
1.288ProAsn: 1.288 ± 0.038
0.633ProPro: 0.633 ± 0.027
1.149ProGln: 1.149 ± 0.038
1.023ProArg: 1.023 ± 0.037
1.682ProSer: 1.682 ± 0.039
1.456ProThr: 1.456 ± 0.046
3.094ProVal: 3.094 ± 0.059
0.359ProTrp: 0.359 ± 0.024
1.558ProTyr: 1.558 ± 0.043
0.0ProXaa: 0.0 ± 0.0
Gln
2.656GlnAla: 2.656 ± 0.063
0.382GlnCys: 0.382 ± 0.02
1.947GlnAsp: 1.947 ± 0.048
4.093GlnGlu: 4.093 ± 0.079
1.222GlnPhe: 1.222 ± 0.035
2.588GlnGly: 2.588 ± 0.058
0.498GlnHis: 0.498 ± 0.026
2.752GlnIle: 2.752 ± 0.053
3.353GlnLys: 3.353 ± 0.062
2.923GlnLeu: 2.923 ± 0.061
1.358GlnMet: 1.358 ± 0.037
1.657GlnAsn: 1.657 ± 0.043
0.928GlnPro: 0.928 ± 0.038
1.409GlnGln: 1.409 ± 0.052
1.479GlnArg: 1.479 ± 0.041
1.601GlnSer: 1.601 ± 0.048
1.671GlnThr: 1.671 ± 0.04
2.517GlnVal: 2.517 ± 0.05
0.395GlnTrp: 0.395 ± 0.022
1.453GlnTyr: 1.453 ± 0.043
0.0GlnXaa: 0.0 ± 0.0
Arg
2.474ArgAla: 2.474 ± 0.054
0.556ArgCys: 0.556 ± 0.025
1.989ArgAsp: 1.989 ± 0.049
4.182ArgGlu: 4.182 ± 0.076
1.822ArgPhe: 1.822 ± 0.049
2.447ArgGly: 2.447 ± 0.062
0.729ArgHis: 0.729 ± 0.028
3.436ArgIle: 3.436 ± 0.063
3.843ArgLys: 3.843 ± 0.061
3.563ArgLeu: 3.563 ± 0.07
1.525ArgMet: 1.525 ± 0.041
2.019ArgAsn: 2.019 ± 0.043
1.148ArgPro: 1.148 ± 0.036
1.676ArgGln: 1.676 ± 0.04
2.132ArgArg: 2.132 ± 0.056
1.865ArgSer: 1.865 ± 0.048
2.113ArgThr: 2.113 ± 0.051
2.421ArgVal: 2.421 ± 0.057
0.343ArgTrp: 0.343 ± 0.02
1.726ArgTyr: 1.726 ± 0.045
0.0ArgXaa: 0.0 ± 0.0
Ser
3.637SerAla: 3.637 ± 0.063
0.945SerCys: 0.945 ± 0.03
2.531SerAsp: 2.531 ± 0.058
3.594SerGlu: 3.594 ± 0.068
2.68SerPhe: 2.68 ± 0.056
4.319SerGly: 4.319 ± 0.069
1.099SerHis: 1.099 ± 0.034
3.953SerIle: 3.953 ± 0.081
3.323SerLys: 3.323 ± 0.064
5.206SerLeu: 5.206 ± 0.077
1.95SerMet: 1.95 ± 0.05
1.954SerAsn: 1.954 ± 0.05
1.769SerPro: 1.769 ± 0.047
2.141SerGln: 2.141 ± 0.054
2.591SerArg: 2.591 ± 0.051
3.177SerSer: 3.177 ± 0.062
2.477SerThr: 2.477 ± 0.053
4.255SerVal: 4.255 ± 0.066
0.598SerTrp: 0.598 ± 0.028
2.386SerTyr: 2.386 ± 0.052
0.0SerXaa: 0.0 ± 0.0
Thr
4.026ThrAla: 4.026 ± 0.067
0.703ThrCys: 0.703 ± 0.038
2.714ThrAsp: 2.714 ± 0.053
4.094ThrGlu: 4.094 ± 0.076
2.15ThrPhe: 2.15 ± 0.048
4.568ThrGly: 4.568 ± 0.078
0.905ThrHis: 0.905 ± 0.032
3.678ThrIle: 3.678 ± 0.06
2.921ThrLys: 2.921 ± 0.059
4.916ThrLeu: 4.916 ± 0.083
1.566ThrMet: 1.566 ± 0.051
1.806ThrAsn: 1.806 ± 0.045
2.084ThrPro: 2.084 ± 0.051
1.637ThrGln: 1.637 ± 0.042
2.03ThrArg: 2.03 ± 0.044
2.764ThrSer: 2.764 ± 0.058
2.579ThrThr: 2.579 ± 0.061
4.091ThrVal: 4.091 ± 0.066
0.526ThrTrp: 0.526 ± 0.029
2.065ThrTyr: 2.065 ± 0.056
0.0ThrXaa: 0.0 ± 0.0
Val
4.35ValAla: 4.35 ± 0.088
1.274ValCys: 1.274 ± 0.036
3.292ValAsp: 3.292 ± 0.063
4.922ValGlu: 4.922 ± 0.076
3.276ValPhe: 3.276 ± 0.065
4.156ValGly: 4.156 ± 0.077
1.083ValHis: 1.083 ± 0.038
4.869ValIle: 4.869 ± 0.075
4.685ValLys: 4.685 ± 0.066
7.236ValLeu: 7.236 ± 0.103
2.092ValMet: 2.092 ± 0.048
2.753ValAsn: 2.753 ± 0.047
2.545ValPro: 2.545 ± 0.045
2.307ValGln: 2.307 ± 0.051
2.734ValArg: 2.734 ± 0.059
4.417ValSer: 4.417 ± 0.066
3.83ValThr: 3.83 ± 0.064
4.778ValVal: 4.778 ± 0.092
0.698ValTrp: 0.698 ± 0.032
2.736ValTyr: 2.736 ± 0.057
0.0ValXaa: 0.0 ± 0.0
Trp
0.542TrpAla: 0.542 ± 0.026
0.141TrpCys: 0.141 ± 0.013
0.559TrpAsp: 0.559 ± 0.027
0.847TrpGlu: 0.847 ± 0.033
0.455TrpPhe: 0.455 ± 0.022
0.695TrpGly: 0.695 ± 0.028
0.176TrpHis: 0.176 ± 0.015
0.719TrpIle: 0.719 ± 0.029
0.945TrpLys: 0.945 ± 0.037
0.833TrpLeu: 0.833 ± 0.028
0.398TrpMet: 0.398 ± 0.024
0.59TrpAsn: 0.59 ± 0.026
0.196TrpPro: 0.196 ± 0.015
0.36TrpGln: 0.36 ± 0.023
0.372TrpArg: 0.372 ± 0.022
0.511TrpSer: 0.511 ± 0.025
0.423TrpThr: 0.423 ± 0.023
0.559TrpVal: 0.559 ± 0.026
0.122TrpTrp: 0.122 ± 0.011
0.414TrpTyr: 0.414 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.604TyrAla: 2.604 ± 0.051
0.621TyrCys: 0.621 ± 0.027
2.21TyrAsp: 2.21 ± 0.053
3.46TyrGlu: 3.46 ± 0.061
1.914TyrPhe: 1.914 ± 0.057
3.09TyrGly: 3.09 ± 0.059
0.945TyrHis: 0.945 ± 0.036
2.714TyrIle: 2.714 ± 0.054
2.269TyrLys: 2.269 ± 0.059
4.019TyrLeu: 4.019 ± 0.074
1.22TyrMet: 1.22 ± 0.041
1.593TyrAsn: 1.593 ± 0.044
1.605TyrPro: 1.605 ± 0.046
1.816TyrGln: 1.816 ± 0.048
1.992TyrArg: 1.992 ± 0.053
2.171TyrSer: 2.171 ± 0.051
2.206TyrThr: 2.206 ± 0.054
2.699TyrVal: 2.699 ± 0.061
0.4TyrTrp: 0.4 ± 0.024
1.919TyrTyr: 1.919 ± 0.052
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.001XaaTrp: 0.001 ± 0.001
0.0XaaTyr: 0.0 ± 0.0
0.002XaaXaa: 0.002 ± 0.002
Statistics based on 2835 proteins (903456 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski