Amino acid dipepetide frequency for Cloacibacillus evryensis DSM 19522

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.974AlaAla: 14.974 ± 0.331
1.392AlaCys: 1.392 ± 0.066
5.296AlaAsp: 5.296 ± 0.144
7.349AlaGlu: 7.349 ± 0.174
3.796AlaPhe: 3.796 ± 0.113
9.114AlaGly: 9.114 ± 0.191
1.608AlaHis: 1.608 ± 0.067
5.583AlaIle: 5.583 ± 0.153
5.349AlaLys: 5.349 ± 0.134
10.413AlaLeu: 10.413 ± 0.22
3.122AlaMet: 3.122 ± 0.104
2.746AlaAsn: 2.746 ± 0.097
3.639AlaPro: 3.639 ± 0.113
2.763AlaGln: 2.763 ± 0.1
5.263AlaArg: 5.263 ± 0.136
5.63AlaSer: 5.63 ± 0.165
3.92AlaThr: 3.92 ± 0.148
8.548AlaVal: 8.548 ± 0.216
0.912AlaTrp: 0.912 ± 0.05
2.627AlaTyr: 2.627 ± 0.085
0.0AlaXaa: 0.0 ± 0.0
Cys
1.826CysAla: 1.826 ± 0.08
0.265CysCys: 0.265 ± 0.025
0.787CysAsp: 0.787 ± 0.052
0.926CysGlu: 0.926 ± 0.055
0.66CysPhe: 0.66 ± 0.043
2.0CysGly: 2.0 ± 0.072
0.26CysHis: 0.26 ± 0.026
0.763CysIle: 0.763 ± 0.047
0.564CysLys: 0.564 ± 0.038
1.315CysLeu: 1.315 ± 0.061
0.345CysMet: 0.345 ± 0.03
0.362CysAsn: 0.362 ± 0.029
0.707CysPro: 0.707 ± 0.056
0.235CysGln: 0.235 ± 0.024
1.094CysArg: 1.094 ± 0.062
0.939CysSer: 0.939 ± 0.056
0.652CysThr: 0.652 ± 0.042
1.124CysVal: 1.124 ± 0.063
0.18CysTrp: 0.18 ± 0.021
0.503CysTyr: 0.503 ± 0.038
0.0CysXaa: 0.0 ± 0.0
Asp
4.813AspAla: 4.813 ± 0.128
0.779AspCys: 0.779 ± 0.049
2.713AspAsp: 2.713 ± 0.088
3.674AspGlu: 3.674 ± 0.104
2.448AspPhe: 2.448 ± 0.077
4.918AspGly: 4.918 ± 0.147
0.713AspHis: 0.713 ± 0.044
4.183AspIle: 4.183 ± 0.11
2.929AspLys: 2.929 ± 0.112
4.133AspLeu: 4.133 ± 0.137
1.619AspMet: 1.619 ± 0.067
1.68AspAsn: 1.68 ± 0.071
2.514AspPro: 2.514 ± 0.078
0.809AspGln: 0.809 ± 0.047
2.591AspArg: 2.591 ± 0.092
2.586AspSer: 2.586 ± 0.084
2.412AspThr: 2.412 ± 0.091
3.611AspVal: 3.611 ± 0.108
0.555AspTrp: 0.555 ± 0.041
1.854AspTyr: 1.854 ± 0.07
0.0AspXaa: 0.0 ± 0.0
Glu
6.863GluAla: 6.863 ± 0.182
0.856GluCys: 0.856 ± 0.051
3.111GluAsp: 3.111 ± 0.111
5.459GluGlu: 5.459 ± 0.175
2.241GluPhe: 2.241 ± 0.08
4.89GluGly: 4.89 ± 0.101
1.127GluHis: 1.127 ± 0.06
4.652GluIle: 4.652 ± 0.126
4.78GluLys: 4.78 ± 0.136
6.578GluLeu: 6.578 ± 0.155
2.312GluMet: 2.312 ± 0.091
2.763GluAsn: 2.763 ± 0.089
2.224GluPro: 2.224 ± 0.088
1.677GluGln: 1.677 ± 0.072
4.578GluArg: 4.578 ± 0.141
3.23GluSer: 3.23 ± 0.108
3.13GluThr: 3.13 ± 0.119
3.893GluVal: 3.893 ± 0.107
0.729GluTrp: 0.729 ± 0.047
2.108GluTyr: 2.108 ± 0.088
0.0GluXaa: 0.0 ± 0.0
Phe
4.318PheAla: 4.318 ± 0.147
0.716PheCys: 0.716 ± 0.042
2.249PheAsp: 2.249 ± 0.086
2.199PheGlu: 2.199 ± 0.079
2.127PhePhe: 2.127 ± 0.092
3.528PheGly: 3.528 ± 0.103
0.6PheHis: 0.6 ± 0.042
2.848PheIle: 2.848 ± 0.108
1.771PheLys: 1.771 ± 0.066
3.506PheLeu: 3.506 ± 0.119
1.243PheMet: 1.243 ± 0.068
1.194PheAsn: 1.194 ± 0.06
1.727PhePro: 1.727 ± 0.077
0.89PheGln: 0.89 ± 0.05
2.05PheArg: 2.05 ± 0.086
3.172PheSer: 3.172 ± 0.109
2.321PheThr: 2.321 ± 0.073
2.719PheVal: 2.719 ± 0.105
0.528PheTrp: 0.528 ± 0.042
1.423PheTyr: 1.423 ± 0.075
0.0PheXaa: 0.0 ± 0.0
Gly
8.443GlyAla: 8.443 ± 0.191
1.561GlyCys: 1.561 ± 0.075
4.191GlyAsp: 4.191 ± 0.116
5.47GlyGlu: 5.47 ± 0.114
3.445GlyPhe: 3.445 ± 0.113
8.194GlyGly: 8.194 ± 0.231
1.403GlyHis: 1.403 ± 0.06
5.802GlyIle: 5.802 ± 0.151
4.81GlyLys: 4.81 ± 0.128
7.479GlyLeu: 7.479 ± 0.164
2.633GlyMet: 2.633 ± 0.102
2.495GlyAsn: 2.495 ± 0.093
2.158GlyPro: 2.158 ± 0.097
1.837GlyGln: 1.837 ± 0.067
5.094GlyArg: 5.094 ± 0.12
5.028GlySer: 5.028 ± 0.138
4.614GlyThr: 4.614 ± 0.124
6.285GlyVal: 6.285 ± 0.123
0.995GlyTrp: 0.995 ± 0.054
2.948GlyTyr: 2.948 ± 0.092
0.0GlyXaa: 0.0 ± 0.0
His
1.326HisAla: 1.326 ± 0.068
0.276HisCys: 0.276 ± 0.026
0.867HisAsp: 0.867 ± 0.05
0.997HisGlu: 0.997 ± 0.063
0.696HisPhe: 0.696 ± 0.041
1.312HisGly: 1.312 ± 0.067
0.318HisHis: 0.318 ± 0.029
1.166HisIle: 1.166 ± 0.059
0.724HisLys: 0.724 ± 0.05
1.373HisLeu: 1.373 ± 0.065
0.47HisMet: 0.47 ± 0.037
0.66HisAsn: 0.66 ± 0.043
0.934HisPro: 0.934 ± 0.058
0.296HisGln: 0.296 ± 0.034
0.867HisArg: 0.867 ± 0.053
0.898HisSer: 0.898 ± 0.048
0.774HisThr: 0.774 ± 0.048
1.042HisVal: 1.042 ± 0.056
0.166HisTrp: 0.166 ± 0.02
0.528HisTyr: 0.528 ± 0.038
0.0HisXaa: 0.0 ± 0.0
Ile
6.564IleAla: 6.564 ± 0.17
1.138IleCys: 1.138 ± 0.058
3.406IleAsp: 3.406 ± 0.079
4.183IleGlu: 4.183 ± 0.119
2.929IlePhe: 2.929 ± 0.113
4.995IleGly: 4.995 ± 0.147
0.923IleHis: 0.923 ± 0.05
4.332IleIle: 4.332 ± 0.147
3.514IleLys: 3.514 ± 0.105
5.525IleLeu: 5.525 ± 0.146
1.818IleMet: 1.818 ± 0.072
2.163IleAsn: 2.163 ± 0.086
3.277IlePro: 3.277 ± 0.09
1.426IleGln: 1.426 ± 0.067
3.255IleArg: 3.255 ± 0.103
4.52IleSer: 4.52 ± 0.132
3.661IleThr: 3.661 ± 0.104
4.44IleVal: 4.44 ± 0.111
0.566IleTrp: 0.566 ± 0.045
2.147IleTyr: 2.147 ± 0.087
0.0IleXaa: 0.0 ± 0.0
Lys
4.909LysAla: 4.909 ± 0.133
0.696LysCys: 0.696 ± 0.047
3.158LysAsp: 3.158 ± 0.107
4.434LysGlu: 4.434 ± 0.138
1.829LysPhe: 1.829 ± 0.072
3.879LysGly: 3.879 ± 0.117
0.751LysHis: 0.751 ± 0.045
3.987LysIle: 3.987 ± 0.113
4.086LysLys: 4.086 ± 0.144
4.639LysLeu: 4.639 ± 0.127
1.793LysMet: 1.793 ± 0.079
2.478LysAsn: 2.478 ± 0.104
1.84LysPro: 1.84 ± 0.068
1.343LysGln: 1.343 ± 0.056
3.194LysArg: 3.194 ± 0.111
2.837LysSer: 2.837 ± 0.108
2.868LysThr: 2.868 ± 0.086
3.395LysVal: 3.395 ± 0.11
0.586LysTrp: 0.586 ± 0.041
2.078LysTyr: 2.078 ± 0.081
0.0LysXaa: 0.0 ± 0.0
Leu
9.885LeuAla: 9.885 ± 0.203
1.694LeuCys: 1.694 ± 0.071
4.697LeuAsp: 4.697 ± 0.129
5.514LeuGlu: 5.514 ± 0.139
4.221LeuPhe: 4.221 ± 0.161
7.238LeuGly: 7.238 ± 0.152
1.401LeuHis: 1.401 ± 0.065
5.274LeuIle: 5.274 ± 0.148
4.967LeuLys: 4.967 ± 0.14
9.029LeuLeu: 9.029 ± 0.218
2.658LeuMet: 2.658 ± 0.1
2.89LeuAsn: 2.89 ± 0.099
4.63LeuPro: 4.63 ± 0.124
1.862LeuGln: 1.862 ± 0.076
5.357LeuArg: 5.357 ± 0.168
6.708LeuSer: 6.708 ± 0.178
4.893LeuThr: 4.893 ± 0.123
5.807LeuVal: 5.807 ± 0.139
1.017LeuTrp: 1.017 ± 0.055
2.984LeuTyr: 2.984 ± 0.096
0.0LeuXaa: 0.0 ± 0.0
Met
3.069MetAla: 3.069 ± 0.091
0.373MetCys: 0.373 ± 0.033
1.456MetAsp: 1.456 ± 0.06
2.031MetGlu: 2.031 ± 0.076
0.912MetPhe: 0.912 ± 0.054
2.241MetGly: 2.241 ± 0.102
0.472MetHis: 0.472 ± 0.036
1.964MetIle: 1.964 ± 0.077
2.105MetLys: 2.105 ± 0.075
2.865MetLeu: 2.865 ± 0.09
0.895MetMet: 0.895 ± 0.052
1.18MetAsn: 1.18 ± 0.062
1.481MetPro: 1.481 ± 0.064
0.646MetGln: 0.646 ± 0.048
1.984MetArg: 1.984 ± 0.074
1.71MetSer: 1.71 ± 0.062
1.741MetThr: 1.741 ± 0.068
1.945MetVal: 1.945 ± 0.076
0.24MetTrp: 0.24 ± 0.023
0.641MetTyr: 0.641 ± 0.043
0.0MetXaa: 0.0 ± 0.0
Asn
2.926AsnAla: 2.926 ± 0.082
0.638AsnCys: 0.638 ± 0.064
1.868AsnAsp: 1.868 ± 0.081
1.898AsnGlu: 1.898 ± 0.078
1.495AsnPhe: 1.495 ± 0.077
3.056AsnGly: 3.056 ± 0.101
0.412AsnHis: 0.412 ± 0.037
2.672AsnIle: 2.672 ± 0.091
1.694AsnLys: 1.694 ± 0.076
2.782AsnLeu: 2.782 ± 0.095
1.102AsnMet: 1.102 ± 0.054
1.232AsnAsn: 1.232 ± 0.062
1.671AsnPro: 1.671 ± 0.071
0.721AsnGln: 0.721 ± 0.052
1.572AsnArg: 1.572 ± 0.074
1.97AsnSer: 1.97 ± 0.079
1.71AsnThr: 1.71 ± 0.095
2.39AsnVal: 2.39 ± 0.088
0.398AsnTrp: 0.398 ± 0.031
1.111AsnTyr: 1.111 ± 0.058
0.0AsnXaa: 0.0 ± 0.0
Pro
4.376ProAla: 4.376 ± 0.121
0.566ProCys: 0.566 ± 0.046
2.428ProAsp: 2.428 ± 0.082
3.641ProGlu: 3.641 ± 0.107
1.868ProPhe: 1.868 ± 0.071
3.373ProGly: 3.373 ± 0.093
0.765ProHis: 0.765 ± 0.049
2.1ProIle: 2.1 ± 0.08
1.851ProLys: 1.851 ± 0.071
4.21ProLeu: 4.21 ± 0.115
0.995ProMet: 0.995 ± 0.05
1.257ProAsn: 1.257 ± 0.06
1.453ProPro: 1.453 ± 0.068
1.285ProGln: 1.285 ± 0.054
2.033ProArg: 2.033 ± 0.085
2.254ProSer: 2.254 ± 0.088
1.881ProThr: 1.881 ± 0.074
3.713ProVal: 3.713 ± 0.114
0.481ProTrp: 0.481 ± 0.041
1.348ProTyr: 1.348 ± 0.061
0.0ProXaa: 0.0 ± 0.0
Gln
2.318GlnAla: 2.318 ± 0.096
0.293GlnCys: 0.293 ± 0.027
1.047GlnAsp: 1.047 ± 0.053
1.508GlnGlu: 1.508 ± 0.068
0.834GlnPhe: 0.834 ± 0.051
1.741GlnGly: 1.741 ± 0.054
0.378GlnHis: 0.378 ± 0.029
1.663GlnIle: 1.663 ± 0.063
1.395GlnLys: 1.395 ± 0.07
2.13GlnLeu: 2.13 ± 0.086
0.787GlnMet: 0.787 ± 0.046
1.091GlnAsn: 1.091 ± 0.057
0.87GlnPro: 0.87 ± 0.051
0.848GlnGln: 0.848 ± 0.05
1.42GlnArg: 1.42 ± 0.071
1.351GlnSer: 1.351 ± 0.066
1.116GlnThr: 1.116 ± 0.066
1.442GlnVal: 1.442 ± 0.064
0.332GlnTrp: 0.332 ± 0.029
0.754GlnTyr: 0.754 ± 0.05
0.0GlnXaa: 0.0 ± 0.0
Arg
5.31ArgAla: 5.31 ± 0.166
0.879ArgCys: 0.879 ± 0.055
2.857ArgAsp: 2.857 ± 0.101
4.675ArgGlu: 4.675 ± 0.153
2.036ArgPhe: 2.036 ± 0.08
4.578ArgGly: 4.578 ± 0.105
1.036ArgHis: 1.036 ± 0.054
3.647ArgIle: 3.647 ± 0.13
2.868ArgLys: 2.868 ± 0.106
5.418ArgLeu: 5.418 ± 0.145
1.743ArgMet: 1.743 ± 0.076
1.721ArgAsn: 1.721 ± 0.067
2.158ArgPro: 2.158 ± 0.079
1.544ArgGln: 1.544 ± 0.065
4.108ArgArg: 4.108 ± 0.147
2.973ArgSer: 2.973 ± 0.104
2.445ArgThr: 2.445 ± 0.088
3.755ArgVal: 3.755 ± 0.112
0.677ArgTrp: 0.677 ± 0.038
1.962ArgTyr: 1.962 ± 0.087
0.0ArgXaa: 0.0 ± 0.0
Ser
6.186SerAla: 6.186 ± 0.149
1.006SerCys: 1.006 ± 0.051
2.884SerAsp: 2.884 ± 0.102
3.279SerGlu: 3.279 ± 0.103
2.937SerPhe: 2.937 ± 0.097
6.081SerGly: 6.081 ± 0.182
1.011SerHis: 1.011 ± 0.063
3.299SerIle: 3.299 ± 0.115
2.406SerLys: 2.406 ± 0.099
6.042SerLeu: 6.042 ± 0.154
1.782SerMet: 1.782 ± 0.066
1.685SerAsn: 1.685 ± 0.08
2.749SerPro: 2.749 ± 0.096
1.445SerGln: 1.445 ± 0.062
3.067SerArg: 3.067 ± 0.106
3.445SerSer: 3.445 ± 0.12
2.578SerThr: 2.578 ± 0.098
4.744SerVal: 4.744 ± 0.135
0.787SerTrp: 0.787 ± 0.049
1.978SerTyr: 1.978 ± 0.076
0.0SerXaa: 0.0 ± 0.0
Thr
5.694ThrAla: 5.694 ± 0.142
0.558ThrCys: 0.558 ± 0.045
2.597ThrAsp: 2.597 ± 0.092
3.221ThrGlu: 3.221 ± 0.095
2.0ThrPhe: 2.0 ± 0.075
4.663ThrGly: 4.663 ± 0.122
0.785ThrHis: 0.785 ± 0.047
3.116ThrIle: 3.116 ± 0.113
2.556ThrLys: 2.556 ± 0.087
4.862ThrLeu: 4.862 ± 0.116
1.321ThrMet: 1.321 ± 0.048
1.578ThrAsn: 1.578 ± 0.118
2.788ThrPro: 2.788 ± 0.102
1.122ThrGln: 1.122 ± 0.063
2.246ThrArg: 2.246 ± 0.07
2.514ThrSer: 2.514 ± 0.108
2.547ThrThr: 2.547 ± 0.117
4.086ThrVal: 4.086 ± 0.137
0.497ThrTrp: 0.497 ± 0.039
1.243ThrTyr: 1.243 ± 0.065
0.0ThrXaa: 0.0 ± 0.0
Val
6.462ValAla: 6.462 ± 0.162
1.127ValCys: 1.127 ± 0.06
3.418ValAsp: 3.418 ± 0.105
4.061ValGlu: 4.061 ± 0.105
2.804ValPhe: 2.804 ± 0.093
5.288ValGly: 5.288 ± 0.157
0.981ValHis: 0.981 ± 0.051
5.15ValIle: 5.15 ± 0.142
4.161ValLys: 4.161 ± 0.12
6.294ValLeu: 6.294 ± 0.146
2.183ValMet: 2.183 ± 0.076
2.475ValAsn: 2.475 ± 0.085
3.318ValPro: 3.318 ± 0.099
1.484ValGln: 1.484 ± 0.071
4.034ValArg: 4.034 ± 0.122
5.053ValSer: 5.053 ± 0.139
4.534ValThr: 4.534 ± 0.154
5.445ValVal: 5.445 ± 0.154
0.677ValTrp: 0.677 ± 0.042
2.02ValTyr: 2.02 ± 0.09
0.0ValXaa: 0.0 ± 0.0
Trp
0.856TrpAla: 0.856 ± 0.056
0.157TrpCys: 0.157 ± 0.021
0.533TrpAsp: 0.533 ± 0.042
0.727TrpGlu: 0.727 ± 0.047
0.431TrpPhe: 0.431 ± 0.031
0.903TrpGly: 0.903 ± 0.051
0.24TrpHis: 0.24 ± 0.029
0.608TrpIle: 0.608 ± 0.046
0.658TrpLys: 0.658 ± 0.041
1.188TrpLeu: 1.188 ± 0.07
0.307TrpMet: 0.307 ± 0.039
0.517TrpAsn: 0.517 ± 0.04
0.395TrpPro: 0.395 ± 0.037
0.37TrpGln: 0.37 ± 0.032
0.685TrpArg: 0.685 ± 0.046
0.619TrpSer: 0.619 ± 0.042
0.547TrpThr: 0.547 ± 0.042
0.572TrpVal: 0.572 ± 0.038
0.193TrpTrp: 0.193 ± 0.021
0.334TrpTyr: 0.334 ± 0.03
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.926TyrAla: 2.926 ± 0.091
0.536TyrCys: 0.536 ± 0.038
1.97TyrAsp: 1.97 ± 0.073
1.956TyrGlu: 1.956 ± 0.075
1.417TyrPhe: 1.417 ± 0.063
2.893TyrGly: 2.893 ± 0.097
0.506TyrHis: 0.506 ± 0.039
1.973TyrIle: 1.973 ± 0.078
1.594TyrLys: 1.594 ± 0.066
2.945TyrLeu: 2.945 ± 0.09
0.821TyrMet: 0.821 ± 0.043
1.149TyrAsn: 1.149 ± 0.058
1.348TyrPro: 1.348 ± 0.057
0.718TyrGln: 0.718 ± 0.047
1.895TyrArg: 1.895 ± 0.073
1.953TyrSer: 1.953 ± 0.07
1.652TyrThr: 1.652 ± 0.066
2.011TyrVal: 2.011 ± 0.092
0.326TyrTrp: 0.326 ± 0.032
1.091TyrTyr: 1.091 ± 0.06
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1082 proteins (361961 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski