Amino acid dipepetide frequency for Bacillus sp. CAG:988

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.678AlaAla: 2.678 ± 0.106
0.676AlaCys: 0.676 ± 0.036
2.666AlaAsp: 2.666 ± 0.089
2.907AlaGlu: 2.907 ± 0.092
2.395AlaPhe: 2.395 ± 0.084
3.097AlaGly: 3.097 ± 0.096
0.894AlaHis: 0.894 ± 0.051
5.045AlaIle: 5.045 ± 0.135
4.591AlaLys: 4.591 ± 0.134
4.917AlaLeu: 4.917 ± 0.122
1.648AlaMet: 1.648 ± 0.067
2.84AlaAsn: 2.84 ± 0.106
1.484AlaPro: 1.484 ± 0.087
1.22AlaGln: 1.22 ± 0.057
2.146AlaArg: 2.146 ± 0.07
3.915AlaSer: 3.915 ± 0.095
3.482AlaThr: 3.482 ± 0.098
3.318AlaVal: 3.318 ± 0.092
0.323AlaTrp: 0.323 ± 0.027
2.257AlaTyr: 2.257 ± 0.085
0.0AlaXaa: 0.0 ± 0.0
Cys
0.573CysAla: 0.573 ± 0.039
0.171CysCys: 0.171 ± 0.023
0.73CysAsp: 0.73 ± 0.041
0.652CysGlu: 0.652 ± 0.035
0.583CysPhe: 0.583 ± 0.037
0.833CysGly: 0.833 ± 0.05
0.293CysHis: 0.293 ± 0.028
0.847CysIle: 0.847 ± 0.041
0.83CysLys: 0.83 ± 0.051
0.997CysLeu: 0.997 ± 0.057
0.245CysMet: 0.245 ± 0.023
0.595CysAsn: 0.595 ± 0.044
0.478CysPro: 0.478 ± 0.037
0.331CysGln: 0.331 ± 0.028
0.371CysArg: 0.371 ± 0.029
0.664CysSer: 0.664 ± 0.044
0.576CysThr: 0.576 ± 0.041
0.602CysVal: 0.602 ± 0.037
0.095CysTrp: 0.095 ± 0.015
0.604CysTyr: 0.604 ± 0.038
0.0CysXaa: 0.0 ± 0.0
Asp
3.744AspAla: 3.744 ± 0.097
0.623AspCys: 0.623 ± 0.041
3.925AspAsp: 3.925 ± 0.112
4.95AspGlu: 4.95 ± 0.131
3.064AspPhe: 3.064 ± 0.089
3.718AspGly: 3.718 ± 0.11
0.992AspHis: 0.992 ± 0.055
5.181AspIle: 5.181 ± 0.124
3.761AspLys: 3.761 ± 0.098
5.445AspLeu: 5.445 ± 0.113
1.622AspMet: 1.622 ± 0.063
2.847AspAsn: 2.847 ± 0.092
1.629AspPro: 1.629 ± 0.06
1.796AspGln: 1.796 ± 0.066
2.086AspArg: 2.086 ± 0.074
3.675AspSer: 3.675 ± 0.094
3.549AspThr: 3.549 ± 0.107
4.358AspVal: 4.358 ± 0.102
0.504AspTrp: 0.504 ± 0.033
3.463AspTyr: 3.463 ± 0.103
0.0AspXaa: 0.0 ± 0.0
Glu
3.561GluAla: 3.561 ± 0.113
0.647GluCys: 0.647 ± 0.038
4.21GluAsp: 4.21 ± 0.113
7.445GluGlu: 7.445 ± 0.182
2.885GluPhe: 2.885 ± 0.092
3.235GluGly: 3.235 ± 0.086
1.277GluHis: 1.277 ± 0.062
5.923GluIle: 5.923 ± 0.137
6.753GluLys: 6.753 ± 0.151
6.958GluLeu: 6.958 ± 0.153
2.229GluMet: 2.229 ± 0.067
4.586GluAsn: 4.586 ± 0.117
1.786GluPro: 1.786 ± 0.083
2.7GluGln: 2.7 ± 0.095
2.878GluArg: 2.878 ± 0.1
4.305GluSer: 4.305 ± 0.114
4.12GluThr: 4.12 ± 0.12
4.91GluVal: 4.91 ± 0.107
0.419GluTrp: 0.419 ± 0.031
3.468GluTyr: 3.468 ± 0.085
0.0GluXaa: 0.0 ± 0.0
Phe
2.605PheAla: 2.605 ± 0.078
0.607PheCys: 0.607 ± 0.042
2.916PheAsp: 2.916 ± 0.084
2.6PheGlu: 2.6 ± 0.085
2.343PhePhe: 2.343 ± 0.101
2.762PheGly: 2.762 ± 0.095
0.951PheHis: 0.951 ± 0.05
3.444PheIle: 3.444 ± 0.105
2.436PheLys: 2.436 ± 0.084
4.988PheLeu: 4.988 ± 0.164
1.049PheMet: 1.049 ± 0.039
1.922PheAsn: 1.922 ± 0.06
1.323PhePro: 1.323 ± 0.065
1.886PheGln: 1.886 ± 0.07
1.56PheArg: 1.56 ± 0.068
3.109PheSer: 3.109 ± 0.089
2.438PheThr: 2.438 ± 0.085
3.118PheVal: 3.118 ± 0.095
0.397PheTrp: 0.397 ± 0.03
2.414PheTyr: 2.414 ± 0.083
0.0PheXaa: 0.0 ± 0.0
Gly
2.916GlyAla: 2.916 ± 0.104
0.733GlyCys: 0.733 ± 0.042
2.895GlyAsp: 2.895 ± 0.089
3.563GlyGlu: 3.563 ± 0.094
2.588GlyPhe: 2.588 ± 0.078
3.185GlyGly: 3.185 ± 0.142
1.063GlyHis: 1.063 ± 0.05
5.128GlyIle: 5.128 ± 0.116
4.351GlyLys: 4.351 ± 0.128
4.538GlyLeu: 4.538 ± 0.114
1.682GlyMet: 1.682 ± 0.066
3.095GlyAsn: 3.095 ± 0.111
1.099GlyPro: 1.099 ± 0.059
1.28GlyGln: 1.28 ± 0.056
1.96GlyArg: 1.96 ± 0.075
3.561GlySer: 3.561 ± 0.126
3.996GlyThr: 3.996 ± 0.142
3.685GlyVal: 3.685 ± 0.107
0.511GlyTrp: 0.511 ± 0.049
3.028GlyTyr: 3.028 ± 0.093
0.002GlyXaa: 0.002 ± 0.002
His
0.937HisAla: 0.937 ± 0.047
0.245HisCys: 0.245 ± 0.027
0.98HisAsp: 0.98 ± 0.048
1.154HisGlu: 1.154 ± 0.059
0.904HisPhe: 0.904 ± 0.044
0.999HisGly: 0.999 ± 0.051
0.516HisHis: 0.516 ± 0.037
1.456HisIle: 1.456 ± 0.06
1.113HisLys: 1.113 ± 0.055
1.741HisLeu: 1.741 ± 0.066
0.44HisMet: 0.44 ± 0.032
0.844HisAsn: 0.844 ± 0.04
0.937HisPro: 0.937 ± 0.048
0.654HisGln: 0.654 ± 0.04
0.725HisArg: 0.725 ± 0.043
1.07HisSer: 1.07 ± 0.047
1.101HisThr: 1.101 ± 0.056
1.097HisVal: 1.097 ± 0.051
0.15HisTrp: 0.15 ± 0.029
1.006HisTyr: 1.006 ± 0.056
0.0HisXaa: 0.0 ± 0.0
Ile
4.907IleAla: 4.907 ± 0.111
1.016IleCys: 1.016 ± 0.047
5.535IleAsp: 5.535 ± 0.123
5.473IleGlu: 5.473 ± 0.135
3.554IlePhe: 3.554 ± 0.116
4.762IleGly: 4.762 ± 0.102
1.549IleHis: 1.549 ± 0.058
6.579IleIle: 6.579 ± 0.181
4.995IleLys: 4.995 ± 0.128
8.23IleLeu: 8.23 ± 0.166
1.915IleMet: 1.915 ± 0.065
3.775IleAsn: 3.775 ± 0.095
3.368IlePro: 3.368 ± 0.096
2.735IleGln: 2.735 ± 0.085
3.285IleArg: 3.285 ± 0.087
5.654IleSer: 5.654 ± 0.113
5.04IleThr: 5.04 ± 0.118
5.773IleVal: 5.773 ± 0.137
0.519IleTrp: 0.519 ± 0.037
3.58IleTyr: 3.58 ± 0.091
0.0IleXaa: 0.0 ± 0.0
Lys
3.371LysAla: 3.371 ± 0.109
0.645LysCys: 0.645 ± 0.044
4.99LysAsp: 4.99 ± 0.135
8.746LysGlu: 8.746 ± 0.168
2.264LysPhe: 2.264 ± 0.078
3.542LysGly: 3.542 ± 0.093
1.066LysHis: 1.066 ± 0.049
6.165LysIle: 6.165 ± 0.137
8.373LysLys: 8.373 ± 0.205
5.899LysLeu: 5.899 ± 0.131
2.345LysMet: 2.345 ± 0.089
4.8LysAsn: 4.8 ± 0.12
1.927LysPro: 1.927 ± 0.077
2.807LysGln: 2.807 ± 0.093
3.382LysArg: 3.382 ± 0.092
3.82LysSer: 3.82 ± 0.104
4.263LysThr: 4.263 ± 0.101
5.276LysVal: 5.276 ± 0.14
0.554LysTrp: 0.554 ± 0.039
3.516LysTyr: 3.516 ± 0.094
0.0LysXaa: 0.0 ± 0.0
Leu
5.331LeuAla: 5.331 ± 0.114
1.135LeuCys: 1.135 ± 0.056
5.735LeuAsp: 5.735 ± 0.131
7.174LeuGlu: 7.174 ± 0.171
4.736LeuPhe: 4.736 ± 0.153
5.259LeuGly: 5.259 ± 0.126
1.441LeuHis: 1.441 ± 0.061
7.126LeuIle: 7.126 ± 0.167
7.402LeuLys: 7.402 ± 0.135
9.591LeuLeu: 9.591 ± 0.219
2.091LeuMet: 2.091 ± 0.073
4.86LeuAsn: 4.86 ± 0.11
3.283LeuPro: 3.283 ± 0.092
3.107LeuGln: 3.107 ± 0.087
3.133LeuArg: 3.133 ± 0.088
6.843LeuSer: 6.843 ± 0.143
5.378LeuThr: 5.378 ± 0.128
5.752LeuVal: 5.752 ± 0.121
0.554LeuTrp: 0.554 ± 0.036
3.891LeuTyr: 3.891 ± 0.1
0.0LeuXaa: 0.0 ± 0.0
Met
1.56MetAla: 1.56 ± 0.066
0.186MetCys: 0.186 ± 0.02
1.689MetAsp: 1.689 ± 0.065
1.839MetGlu: 1.839 ± 0.062
1.03MetPhe: 1.03 ± 0.06
1.356MetGly: 1.356 ± 0.062
0.385MetHis: 0.385 ± 0.03
2.238MetIle: 2.238 ± 0.081
2.79MetLys: 2.79 ± 0.087
2.167MetLeu: 2.167 ± 0.07
0.816MetMet: 0.816 ± 0.04
1.691MetAsn: 1.691 ± 0.064
0.916MetPro: 0.916 ± 0.052
0.814MetGln: 0.814 ± 0.047
0.942MetArg: 0.942 ± 0.05
1.518MetSer: 1.518 ± 0.053
1.61MetThr: 1.61 ± 0.067
1.449MetVal: 1.449 ± 0.061
0.14MetTrp: 0.14 ± 0.02
0.949MetTyr: 0.949 ± 0.057
0.0MetXaa: 0.0 ± 0.0
Asn
2.833AsnAla: 2.833 ± 0.093
0.585AsnCys: 0.585 ± 0.039
3.147AsnAsp: 3.147 ± 0.093
3.777AsnGlu: 3.777 ± 0.098
2.119AsnPhe: 2.119 ± 0.064
3.404AsnGly: 3.404 ± 0.102
1.339AsnHis: 1.339 ± 0.047
4.282AsnIle: 4.282 ± 0.106
3.461AsnLys: 3.461 ± 0.108
5.052AsnLeu: 5.052 ± 0.116
1.33AsnMet: 1.33 ± 0.056
2.804AsnAsn: 2.804 ± 0.111
2.426AsnPro: 2.426 ± 0.096
2.602AsnGln: 2.602 ± 0.083
2.402AsnArg: 2.402 ± 0.076
2.964AsnSer: 2.964 ± 0.098
2.883AsnThr: 2.883 ± 0.101
3.373AsnVal: 3.373 ± 0.088
0.459AsnTrp: 0.459 ± 0.036
2.976AsnTyr: 2.976 ± 0.095
0.0AsnXaa: 0.0 ± 0.0
Pro
1.539ProAla: 1.539 ± 0.072
0.29ProCys: 0.29 ± 0.029
2.098ProAsp: 2.098 ± 0.074
2.517ProGlu: 2.517 ± 0.095
1.541ProPhe: 1.541 ± 0.07
1.47ProGly: 1.47 ± 0.07
0.473ProHis: 0.473 ± 0.037
2.719ProIle: 2.719 ± 0.087
2.469ProLys: 2.469 ± 0.075
2.362ProLeu: 2.362 ± 0.07
0.671ProMet: 0.671 ± 0.045
1.853ProAsn: 1.853 ± 0.074
0.628ProPro: 0.628 ± 0.055
0.73ProGln: 0.73 ± 0.045
0.849ProArg: 0.849 ± 0.046
2.284ProSer: 2.284 ± 0.092
2.222ProThr: 2.222 ± 0.104
2.352ProVal: 2.352 ± 0.107
0.209ProTrp: 0.209 ± 0.023
1.789ProTyr: 1.789 ± 0.066
0.0ProXaa: 0.0 ± 0.0
Gln
1.691GlnAla: 1.691 ± 0.075
0.209GlnCys: 0.209 ± 0.025
2.169GlnAsp: 2.169 ± 0.08
2.947GlnGlu: 2.947 ± 0.098
1.304GlnPhe: 1.304 ± 0.054
1.28GlnGly: 1.28 ± 0.053
0.45GlnHis: 0.45 ± 0.029
2.983GlnIle: 2.983 ± 0.096
3.485GlnLys: 3.485 ± 0.112
2.764GlnLeu: 2.764 ± 0.091
1.059GlnMet: 1.059 ± 0.055
2.46GlnAsn: 2.46 ± 0.083
0.794GlnPro: 0.794 ± 0.061
1.037GlnGln: 1.037 ± 0.061
0.97GlnArg: 0.97 ± 0.06
1.772GlnSer: 1.772 ± 0.075
2.01GlnThr: 2.01 ± 0.074
2.186GlnVal: 2.186 ± 0.074
0.186GlnTrp: 0.186 ± 0.021
1.71GlnTyr: 1.71 ± 0.066
0.0GlnXaa: 0.0 ± 0.0
Arg
1.784ArgAla: 1.784 ± 0.066
0.428ArgCys: 0.428 ± 0.03
2.231ArgAsp: 2.231 ± 0.077
3.083ArgGlu: 3.083 ± 0.098
1.541ArgPhe: 1.541 ± 0.061
1.679ArgGly: 1.679 ± 0.067
0.683ArgHis: 0.683 ± 0.039
3.128ArgIle: 3.128 ± 0.102
3.456ArgLys: 3.456 ± 0.106
3.482ArgLeu: 3.482 ± 0.086
1.192ArgMet: 1.192 ± 0.052
2.219ArgAsn: 2.219 ± 0.079
0.93ArgPro: 0.93 ± 0.05
1.192ArgGln: 1.192 ± 0.056
1.565ArgArg: 1.565 ± 0.062
2.169ArgSer: 2.169 ± 0.07
1.981ArgThr: 1.981 ± 0.073
2.326ArgVal: 2.326 ± 0.074
0.307ArgTrp: 0.307 ± 0.027
1.979ArgTyr: 1.979 ± 0.077
0.0ArgXaa: 0.0 ± 0.0
Ser
3.026SerAla: 3.026 ± 0.087
0.814SerCys: 0.814 ± 0.049
3.844SerAsp: 3.844 ± 0.103
3.908SerGlu: 3.908 ± 0.108
3.618SerPhe: 3.618 ± 0.112
3.658SerGly: 3.658 ± 0.125
1.056SerHis: 1.056 ± 0.053
5.133SerIle: 5.133 ± 0.121
5.254SerLys: 5.254 ± 0.119
6.582SerLeu: 6.582 ± 0.132
1.663SerMet: 1.663 ± 0.07
3.77SerAsn: 3.77 ± 0.104
1.663SerPro: 1.663 ± 0.069
1.996SerGln: 1.996 ± 0.066
2.217SerArg: 2.217 ± 0.076
5.25SerSer: 5.25 ± 0.168
3.57SerThr: 3.57 ± 0.107
4.289SerVal: 4.289 ± 0.104
0.442SerTrp: 0.442 ± 0.028
3.409SerTyr: 3.409 ± 0.092
0.0SerXaa: 0.0 ± 0.0
Thr
2.84ThrAla: 2.84 ± 0.097
0.668ThrCys: 0.668 ± 0.044
3.539ThrAsp: 3.539 ± 0.1
3.632ThrGlu: 3.632 ± 0.112
2.828ThrPhe: 2.828 ± 0.084
3.765ThrGly: 3.765 ± 0.115
1.047ThrHis: 1.047 ± 0.051
5.433ThrIle: 5.433 ± 0.121
4.505ThrLys: 4.505 ± 0.117
5.626ThrLeu: 5.626 ± 0.14
1.358ThrMet: 1.358 ± 0.06
3.197ThrAsn: 3.197 ± 0.094
2.291ThrPro: 2.291 ± 0.108
1.415ThrGln: 1.415 ± 0.069
1.872ThrArg: 1.872 ± 0.07
4.148ThrSer: 4.148 ± 0.11
3.699ThrThr: 3.699 ± 0.134
4.431ThrVal: 4.431 ± 0.139
0.447ThrTrp: 0.447 ± 0.032
3.007ThrTyr: 3.007 ± 0.104
0.002ThrXaa: 0.002 ± 0.002
Val
3.944ValAla: 3.944 ± 0.107
0.733ValCys: 0.733 ± 0.044
3.811ValAsp: 3.811 ± 0.107
4.196ValGlu: 4.196 ± 0.116
2.774ValPhe: 2.774 ± 0.094
3.711ValGly: 3.711 ± 0.095
1.125ValHis: 1.125 ± 0.058
5.307ValIle: 5.307 ± 0.123
4.505ValLys: 4.505 ± 0.107
6.751ValLeu: 6.751 ± 0.15
1.546ValMet: 1.546 ± 0.064
3.14ValAsn: 3.14 ± 0.091
2.412ValPro: 2.412 ± 0.085
1.934ValGln: 1.934 ± 0.068
2.429ValArg: 2.429 ± 0.088
5.176ValSer: 5.176 ± 0.125
4.66ValThr: 4.66 ± 0.143
4.695ValVal: 4.695 ± 0.135
0.438ValTrp: 0.438 ± 0.036
2.957ValTyr: 2.957 ± 0.084
0.0ValXaa: 0.0 ± 0.0
Trp
0.295TrpAla: 0.295 ± 0.025
0.088TrpCys: 0.088 ± 0.016
0.362TrpAsp: 0.362 ± 0.031
0.354TrpGlu: 0.354 ± 0.03
0.404TrpPhe: 0.404 ± 0.03
0.328TrpGly: 0.328 ± 0.031
0.15TrpHis: 0.15 ± 0.02
0.775TrpIle: 0.775 ± 0.044
0.497TrpLys: 0.497 ± 0.034
0.711TrpLeu: 0.711 ± 0.049
0.216TrpMet: 0.216 ± 0.025
0.533TrpAsn: 0.533 ± 0.039
0.14TrpPro: 0.14 ± 0.016
0.309TrpGln: 0.309 ± 0.027
0.247TrpArg: 0.247 ± 0.026
0.388TrpSer: 0.388 ± 0.035
0.371TrpThr: 0.371 ± 0.038
0.3TrpVal: 0.3 ± 0.026
0.076TrpTrp: 0.076 ± 0.011
0.502TrpTyr: 0.502 ± 0.042
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.367TyrAla: 2.367 ± 0.081
0.571TyrCys: 0.571 ± 0.042
3.459TyrAsp: 3.459 ± 0.1
3.378TyrGlu: 3.378 ± 0.092
2.395TyrPhe: 2.395 ± 0.083
2.816TyrGly: 2.816 ± 0.094
1.32TyrHis: 1.32 ± 0.059
3.349TyrIle: 3.349 ± 0.09
2.724TyrLys: 2.724 ± 0.084
5.021TyrLeu: 5.021 ± 0.122
0.966TyrMet: 0.966 ± 0.051
2.388TyrAsn: 2.388 ± 0.073
1.551TyrPro: 1.551 ± 0.056
2.921TyrGln: 2.921 ± 0.101
2.333TyrArg: 2.333 ± 0.08
2.876TyrSer: 2.876 ± 0.092
2.802TyrThr: 2.802 ± 0.112
2.952TyrVal: 2.952 ± 0.083
0.316TyrTrp: 0.316 ± 0.028
2.674TyrTyr: 2.674 ± 0.097
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.002XaaIle: 0.002 ± 0.002
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.002XaaArg: 0.002 ± 0.002
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.01XaaXaa: 0.01 ± 0.006
Statistics based on 1397 proteins (420405 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski