Amino acid dipepetide frequency for bacterium E08(2017)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.844AlaAla: 7.844 ± 0.203
1.046AlaCys: 1.046 ± 0.054
5.182AlaAsp: 5.182 ± 0.114
5.922AlaGlu: 5.922 ± 0.16
3.126AlaPhe: 3.126 ± 0.091
7.095AlaGly: 7.095 ± 0.16
1.277AlaHis: 1.277 ± 0.058
5.258AlaIle: 5.258 ± 0.122
4.403AlaLys: 4.403 ± 0.183
7.232AlaLeu: 7.232 ± 0.152
2.339AlaMet: 2.339 ± 0.081
2.98AlaAsn: 2.98 ± 0.095
2.93AlaPro: 2.93 ± 0.117
2.4AlaGln: 2.4 ± 0.083
4.134AlaArg: 4.134 ± 0.122
5.38AlaSer: 5.38 ± 0.122
4.038AlaThr: 4.038 ± 0.136
6.316AlaVal: 6.316 ± 0.144
0.996AlaTrp: 0.996 ± 0.046
2.4AlaTyr: 2.4 ± 0.081
0.0AlaXaa: 0.0 ± 0.0
Cys
0.94CysAla: 0.94 ± 0.051
0.276CysCys: 0.276 ± 0.031
0.683CysAsp: 0.683 ± 0.038
0.756CysGlu: 0.756 ± 0.042
0.469CysPhe: 0.469 ± 0.037
1.131CysGly: 1.131 ± 0.06
0.309CysHis: 0.309 ± 0.031
0.787CysIle: 0.787 ± 0.05
0.542CysLys: 0.542 ± 0.04
1.124CysLeu: 1.124 ± 0.056
0.28CysMet: 0.28 ± 0.025
0.441CysAsn: 0.441 ± 0.037
0.62CysPro: 0.62 ± 0.044
0.278CysGln: 0.278 ± 0.022
0.726CysArg: 0.726 ± 0.047
0.909CysSer: 0.909 ± 0.05
0.608CysThr: 0.608 ± 0.038
0.85CysVal: 0.85 ± 0.044
0.163CysTrp: 0.163 ± 0.02
0.375CysTyr: 0.375 ± 0.038
0.0CysXaa: 0.0 ± 0.0
Asp
5.074AspAla: 5.074 ± 0.129
0.744AspCys: 0.744 ± 0.048
4.228AspAsp: 4.228 ± 0.126
4.53AspGlu: 4.53 ± 0.118
2.457AspPhe: 2.457 ± 0.091
5.418AspGly: 5.418 ± 0.194
1.072AspHis: 1.072 ± 0.05
4.801AspIle: 4.801 ± 0.124
3.411AspLys: 3.411 ± 0.089
5.314AspLeu: 5.314 ± 0.12
1.812AspMet: 1.812 ± 0.071
2.563AspAsn: 2.563 ± 0.086
2.775AspPro: 2.775 ± 0.101
1.635AspGln: 1.635 ± 0.069
2.81AspArg: 2.81 ± 0.09
4.101AspSer: 4.101 ± 0.113
3.183AspThr: 3.183 ± 0.104
4.389AspVal: 4.389 ± 0.111
1.086AspTrp: 1.086 ± 0.056
2.325AspTyr: 2.325 ± 0.091
0.0AspXaa: 0.0 ± 0.0
Glu
5.809GluAla: 5.809 ± 0.169
0.678GluCys: 0.678 ± 0.042
3.795GluAsp: 3.795 ± 0.109
5.022GluGlu: 5.022 ± 0.19
2.273GluPhe: 2.273 ± 0.083
4.334GluGly: 4.334 ± 0.138
1.42GluHis: 1.42 ± 0.06
4.29GluIle: 4.29 ± 0.103
4.511GluLys: 4.511 ± 0.181
6.457GluLeu: 6.457 ± 0.152
1.972GluMet: 1.972 ± 0.079
2.349GluAsn: 2.349 ± 0.08
2.306GluPro: 2.306 ± 0.083
2.506GluGln: 2.506 ± 0.092
3.562GluArg: 3.562 ± 0.111
4.146GluSer: 4.146 ± 0.112
3.512GluThr: 3.512 ± 0.093
4.488GluVal: 4.488 ± 0.096
0.93GluTrp: 0.93 ± 0.043
2.245GluTyr: 2.245 ± 0.069
0.0GluXaa: 0.0 ± 0.0
Phe
2.77PheAla: 2.77 ± 0.08
0.561PheCys: 0.561 ± 0.039
2.617PheAsp: 2.617 ± 0.098
2.445PheGlu: 2.445 ± 0.073
1.738PhePhe: 1.738 ± 0.076
2.874PheGly: 2.874 ± 0.092
0.648PheHis: 0.648 ± 0.039
2.405PheIle: 2.405 ± 0.094
1.936PheLys: 1.936 ± 0.064
3.387PheLeu: 3.387 ± 0.132
1.025PheMet: 1.025 ± 0.051
1.505PheAsn: 1.505 ± 0.052
1.451PhePro: 1.451 ± 0.068
0.966PheGln: 0.966 ± 0.05
1.953PheArg: 1.953 ± 0.07
2.82PheSer: 2.82 ± 0.077
2.287PheThr: 2.287 ± 0.078
2.777PheVal: 2.777 ± 0.09
0.554PheTrp: 0.554 ± 0.043
1.366PheTyr: 1.366 ± 0.062
0.0PheXaa: 0.0 ± 0.0
Gly
5.835GlyAla: 5.835 ± 0.143
1.074GlyCys: 1.074 ± 0.058
4.577GlyAsp: 4.577 ± 0.151
4.353GlyGlu: 4.353 ± 0.125
3.168GlyPhe: 3.168 ± 0.088
6.049GlyGly: 6.049 ± 0.205
1.67GlyHis: 1.67 ± 0.061
5.015GlyIle: 5.015 ± 0.13
4.9GlyLys: 4.9 ± 0.135
6.643GlyLeu: 6.643 ± 0.155
2.457GlyMet: 2.457 ± 0.089
3.039GlyAsn: 3.039 ± 0.108
2.28GlyPro: 2.28 ± 0.085
2.318GlyGln: 2.318 ± 0.075
3.885GlyArg: 3.885 ± 0.11
5.128GlySer: 5.128 ± 0.168
4.598GlyThr: 4.598 ± 0.162
5.385GlyVal: 5.385 ± 0.126
1.062GlyTrp: 1.062 ± 0.051
2.815GlyTyr: 2.815 ± 0.074
0.0GlyXaa: 0.0 ± 0.0
His
1.484HisAla: 1.484 ± 0.062
0.269HisCys: 0.269 ± 0.027
1.22HisAsp: 1.22 ± 0.059
1.234HisGlu: 1.234 ± 0.059
0.857HisPhe: 0.857 ± 0.044
1.663HisGly: 1.663 ± 0.073
0.478HisHis: 0.478 ± 0.033
1.265HisIle: 1.265 ± 0.061
1.058HisLys: 1.058 ± 0.048
1.762HisLeu: 1.762 ± 0.065
0.473HisMet: 0.473 ± 0.033
0.759HisAsn: 0.759 ± 0.044
1.006HisPro: 1.006 ± 0.054
0.535HisGln: 0.535 ± 0.041
0.928HisArg: 0.928 ± 0.042
1.211HisSer: 1.211 ± 0.054
0.999HisThr: 0.999 ± 0.046
1.272HisVal: 1.272 ± 0.057
0.261HisTrp: 0.261 ± 0.029
0.726HisTyr: 0.726 ± 0.039
0.0HisXaa: 0.0 ± 0.0
Ile
5.59IleAla: 5.59 ± 0.129
0.952IleCys: 0.952 ± 0.049
4.415IleAsp: 4.415 ± 0.113
4.448IleGlu: 4.448 ± 0.111
2.299IlePhe: 2.299 ± 0.087
4.579IleGly: 4.579 ± 0.108
1.241IleHis: 1.241 ± 0.059
4.127IleIle: 4.127 ± 0.112
3.687IleLys: 3.687 ± 0.121
5.482IleLeu: 5.482 ± 0.137
1.609IleMet: 1.609 ± 0.065
2.74IleAsn: 2.74 ± 0.084
2.895IlePro: 2.895 ± 0.086
1.797IleGln: 1.797 ± 0.067
3.251IleArg: 3.251 ± 0.107
4.586IleSer: 4.586 ± 0.104
3.885IleThr: 3.885 ± 0.12
4.488IleVal: 4.488 ± 0.112
0.648IleTrp: 0.648 ± 0.039
1.885IleTyr: 1.885 ± 0.066
0.0IleXaa: 0.0 ± 0.0
Lys
4.907LysAla: 4.907 ± 0.168
0.558LysCys: 0.558 ± 0.036
3.616LysAsp: 3.616 ± 0.112
4.214LysGlu: 4.214 ± 0.174
1.543LysPhe: 1.543 ± 0.053
3.95LysGly: 3.95 ± 0.108
1.27LysHis: 1.27 ± 0.056
3.272LysIle: 3.272 ± 0.088
4.471LysLys: 4.471 ± 0.145
5.131LysLeu: 5.131 ± 0.13
1.637LysMet: 1.637 ± 0.068
2.292LysAsn: 2.292 ± 0.081
2.711LysPro: 2.711 ± 0.084
2.066LysGln: 2.066 ± 0.09
3.175LysArg: 3.175 ± 0.098
3.493LysSer: 3.493 ± 0.102
3.291LysThr: 3.291 ± 0.101
3.932LysVal: 3.932 ± 0.11
0.7LysTrp: 0.7 ± 0.043
1.922LysTyr: 1.922 ± 0.072
0.0LysXaa: 0.0 ± 0.0
Leu
7.757LeuAla: 7.757 ± 0.155
1.093LeuCys: 1.093 ± 0.061
5.663LeuAsp: 5.663 ± 0.129
5.967LeuGlu: 5.967 ± 0.147
3.564LeuPhe: 3.564 ± 0.122
6.509LeuGly: 6.509 ± 0.149
1.63LeuHis: 1.63 ± 0.063
5.277LeuIle: 5.277 ± 0.158
5.583LeuLys: 5.583 ± 0.141
8.518LeuLeu: 8.518 ± 0.248
2.41LeuMet: 2.41 ± 0.089
3.637LeuAsn: 3.637 ± 0.097
4.014LeuPro: 4.014 ± 0.102
2.631LeuGln: 2.631 ± 0.08
4.45LeuArg: 4.45 ± 0.114
6.506LeuSer: 6.506 ± 0.139
5.072LeuThr: 5.072 ± 0.125
6.137LeuVal: 6.137 ± 0.133
0.916LeuTrp: 0.916 ± 0.05
2.695LeuTyr: 2.695 ± 0.09
0.0LeuXaa: 0.0 ± 0.0
Met
2.525MetAla: 2.525 ± 0.089
0.271MetCys: 0.271 ± 0.023
1.63MetAsp: 1.63 ± 0.059
1.59MetGlu: 1.59 ± 0.07
0.883MetPhe: 0.883 ± 0.049
1.854MetGly: 1.854 ± 0.079
0.488MetHis: 0.488 ± 0.034
1.781MetIle: 1.781 ± 0.065
1.882MetLys: 1.882 ± 0.064
2.688MetLeu: 2.688 ± 0.088
0.817MetMet: 0.817 ± 0.048
1.237MetAsn: 1.237 ± 0.063
1.345MetPro: 1.345 ± 0.055
0.933MetGln: 0.933 ± 0.053
1.479MetArg: 1.479 ± 0.065
1.92MetSer: 1.92 ± 0.063
1.703MetThr: 1.703 ± 0.053
1.8MetVal: 1.8 ± 0.065
0.273MetTrp: 0.273 ± 0.025
0.636MetTyr: 0.636 ± 0.038
0.0MetXaa: 0.0 ± 0.0
Asn
3.18AsnAla: 3.18 ± 0.093
0.521AsnCys: 0.521 ± 0.044
2.544AsnAsp: 2.544 ± 0.105
2.153AsnGlu: 2.153 ± 0.07
1.26AsnPhe: 1.26 ± 0.056
3.442AsnGly: 3.442 ± 0.134
0.777AsnHis: 0.777 ± 0.049
2.985AsnIle: 2.985 ± 0.094
2.054AsnLys: 2.054 ± 0.08
3.486AsnLeu: 3.486 ± 0.096
1.069AsnMet: 1.069 ± 0.054
1.781AsnAsn: 1.781 ± 0.076
2.21AsnPro: 2.21 ± 0.081
1.1AsnGln: 1.1 ± 0.053
1.976AsnArg: 1.976 ± 0.066
2.488AsnSer: 2.488 ± 0.099
2.273AsnThr: 2.273 ± 0.097
2.874AsnVal: 2.874 ± 0.1
0.655AsnTrp: 0.655 ± 0.041
1.279AsnTyr: 1.279 ± 0.062
0.0AsnXaa: 0.0 ± 0.0
Pro
3.522ProAla: 3.522 ± 0.115
0.405ProCys: 0.405 ± 0.032
3.154ProAsp: 3.154 ± 0.092
3.512ProGlu: 3.512 ± 0.106
1.63ProPhe: 1.63 ± 0.066
3.244ProGly: 3.244 ± 0.097
0.843ProHis: 0.843 ± 0.052
2.179ProIle: 2.179 ± 0.078
2.019ProLys: 2.019 ± 0.077
3.411ProLeu: 3.411 ± 0.104
1.093ProMet: 1.093 ± 0.051
1.541ProAsn: 1.541 ± 0.072
1.628ProPro: 1.628 ± 0.07
1.126ProGln: 1.126 ± 0.048
1.621ProArg: 1.621 ± 0.076
2.7ProSer: 2.7 ± 0.1
1.972ProThr: 1.972 ± 0.078
3.656ProVal: 3.656 ± 0.092
0.528ProTrp: 0.528 ± 0.042
1.307ProTyr: 1.307 ± 0.05
0.0ProXaa: 0.0 ± 0.0
Gln
2.584GlnAla: 2.584 ± 0.075
0.287GlnCys: 0.287 ± 0.028
1.689GlnAsp: 1.689 ± 0.059
2.226GlnGlu: 2.226 ± 0.084
1.06GlnPhe: 1.06 ± 0.051
2.078GlnGly: 2.078 ± 0.065
0.655GlnHis: 0.655 ± 0.041
1.734GlnIle: 1.734 ± 0.07
1.863GlnLys: 1.863 ± 0.082
2.756GlnLeu: 2.756 ± 0.091
0.923GlnMet: 0.923 ± 0.043
1.154GlnAsn: 1.154 ± 0.051
1.164GlnPro: 1.164 ± 0.053
1.15GlnGln: 1.15 ± 0.092
1.505GlnArg: 1.505 ± 0.067
1.816GlnSer: 1.816 ± 0.071
1.503GlnThr: 1.503 ± 0.057
2.148GlnVal: 2.148 ± 0.065
0.422GlnTrp: 0.422 ± 0.033
1.051GlnTyr: 1.051 ± 0.048
0.0GlnXaa: 0.0 ± 0.0
Arg
3.569ArgAla: 3.569 ± 0.101
0.558ArgCys: 0.558 ± 0.039
3.027ArgAsp: 3.027 ± 0.09
3.597ArgGlu: 3.597 ± 0.129
2.042ArgPhe: 2.042 ± 0.074
2.98ArgGly: 2.98 ± 0.093
1.065ArgHis: 1.065 ± 0.048
3.625ArgIle: 3.625 ± 0.108
3.543ArgLys: 3.543 ± 0.108
4.754ArgLeu: 4.754 ± 0.121
1.493ArgMet: 1.493 ± 0.057
1.934ArgAsn: 1.934 ± 0.071
1.781ArgPro: 1.781 ± 0.072
1.614ArgGln: 1.614 ± 0.067
2.756ArgArg: 2.756 ± 0.094
3.011ArgSer: 3.011 ± 0.081
2.457ArgThr: 2.457 ± 0.084
3.385ArgVal: 3.385 ± 0.085
0.702ArgTrp: 0.702 ± 0.041
1.835ArgTyr: 1.835 ± 0.07
0.0ArgXaa: 0.0 ± 0.0
Ser
5.211SerAla: 5.211 ± 0.128
0.824SerCys: 0.824 ± 0.05
4.443SerAsp: 4.443 ± 0.124
4.228SerGlu: 4.228 ± 0.123
2.867SerPhe: 2.867 ± 0.087
6.155SerGly: 6.155 ± 0.152
1.364SerHis: 1.364 ± 0.075
4.412SerIle: 4.412 ± 0.109
3.491SerLys: 3.491 ± 0.095
5.955SerLeu: 5.955 ± 0.134
1.797SerMet: 1.797 ± 0.062
2.756SerAsn: 2.756 ± 0.094
2.591SerPro: 2.591 ± 0.09
1.75SerGln: 1.75 ± 0.062
3.117SerArg: 3.117 ± 0.091
5.02SerSer: 5.02 ± 0.143
3.432SerThr: 3.432 ± 0.119
4.777SerVal: 4.777 ± 0.116
0.935SerTrp: 0.935 ± 0.052
2.021SerTyr: 2.021 ± 0.083
0.0SerXaa: 0.0 ± 0.0
Thr
4.834ThrAla: 4.834 ± 0.124
0.577ThrCys: 0.577 ± 0.039
3.566ThrAsp: 3.566 ± 0.114
3.081ThrGlu: 3.081 ± 0.09
2.108ThrPhe: 2.108 ± 0.083
4.636ThrGly: 4.636 ± 0.115
0.966ThrHis: 0.966 ± 0.048
3.727ThrIle: 3.727 ± 0.11
2.579ThrLys: 2.579 ± 0.099
4.794ThrLeu: 4.794 ± 0.112
1.293ThrMet: 1.293 ± 0.061
2.678ThrAsn: 2.678 ± 0.219
2.577ThrPro: 2.577 ± 0.095
1.357ThrGln: 1.357 ± 0.06
2.313ThrArg: 2.313 ± 0.08
3.428ThrSer: 3.428 ± 0.124
3.048ThrThr: 3.048 ± 0.123
4.737ThrVal: 4.737 ± 0.153
0.832ThrTrp: 0.832 ± 0.059
1.84ThrTyr: 1.84 ± 0.079
0.0ThrXaa: 0.0 ± 0.0
Val
5.59ValAla: 5.59 ± 0.125
0.949ValCys: 0.949 ± 0.057
4.605ValAsp: 4.605 ± 0.114
4.528ValGlu: 4.528 ± 0.117
2.843ValPhe: 2.843 ± 0.09
4.886ValGly: 4.886 ± 0.143
1.265ValHis: 1.265 ± 0.058
4.952ValIle: 4.952 ± 0.116
3.771ValLys: 3.771 ± 0.119
6.874ValLeu: 6.874 ± 0.13
1.993ValMet: 1.993 ± 0.072
2.78ValAsn: 2.78 ± 0.087
2.907ValPro: 2.907 ± 0.085
1.995ValGln: 1.995 ± 0.066
3.55ValArg: 3.55 ± 0.098
5.131ValSer: 5.131 ± 0.128
4.554ValThr: 4.554 ± 0.132
5.47ValVal: 5.47 ± 0.143
0.801ValTrp: 0.801 ± 0.057
2.372ValTyr: 2.372 ± 0.084
0.0ValXaa: 0.0 ± 0.0
Trp
0.912TrpAla: 0.912 ± 0.056
0.188TrpCys: 0.188 ± 0.019
0.737TrpAsp: 0.737 ± 0.049
0.747TrpGlu: 0.747 ± 0.043
0.504TrpPhe: 0.504 ± 0.038
0.956TrpGly: 0.956 ± 0.054
0.337TrpHis: 0.337 ± 0.032
0.763TrpIle: 0.763 ± 0.045
0.69TrpLys: 0.69 ± 0.043
1.293TrpLeu: 1.293 ± 0.054
0.36TrpMet: 0.36 ± 0.03
0.624TrpAsn: 0.624 ± 0.044
0.466TrpPro: 0.466 ± 0.039
0.605TrpGln: 0.605 ± 0.04
0.749TrpArg: 0.749 ± 0.048
0.971TrpSer: 0.971 ± 0.054
0.761TrpThr: 0.761 ± 0.055
0.806TrpVal: 0.806 ± 0.048
0.198TrpTrp: 0.198 ± 0.023
0.466TrpTyr: 0.466 ± 0.033
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.363TyrAla: 2.363 ± 0.083
0.431TyrCys: 0.431 ± 0.035
2.334TyrAsp: 2.334 ± 0.092
2.052TyrGlu: 2.052 ± 0.075
1.392TyrPhe: 1.392 ± 0.061
2.422TyrGly: 2.422 ± 0.091
0.686TyrHis: 0.686 ± 0.039
1.948TyrIle: 1.948 ± 0.076
1.757TyrLys: 1.757 ± 0.068
2.968TyrLeu: 2.968 ± 0.093
0.879TyrMet: 0.879 ± 0.046
1.319TyrAsn: 1.319 ± 0.058
1.432TyrPro: 1.432 ± 0.064
1.006TyrGln: 1.006 ± 0.049
1.788TyrArg: 1.788 ± 0.07
2.384TyrSer: 2.384 ± 0.093
1.797TyrThr: 1.797 ± 0.085
2.13TyrVal: 2.13 ± 0.093
0.473TyrTrp: 0.473 ± 0.038
1.456TyrTyr: 1.456 ± 0.08
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1267 proteins (424508 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski