Amino acid dipepetide frequency for Escherichia virus CBA120

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.305AlaAla: 5.305 ± 0.354
0.689AlaCys: 0.689 ± 0.115
4.219AlaAsp: 4.219 ± 0.33
4.24AlaGlu: 4.24 ± 0.409
2.59AlaPhe: 2.59 ± 0.242
4.282AlaGly: 4.282 ± 0.413
1.379AlaHis: 1.379 ± 0.176
4.324AlaIle: 4.324 ± 0.307
3.885AlaLys: 3.885 ± 0.262
5.159AlaLeu: 5.159 ± 0.311
1.796AlaMet: 1.796 ± 0.212
3.175AlaAsn: 3.175 ± 0.229
2.381AlaPro: 2.381 ± 0.22
2.632AlaGln: 2.632 ± 0.247
3.342AlaArg: 3.342 ± 0.247
3.948AlaSer: 3.948 ± 0.266
4.094AlaThr: 4.094 ± 0.312
4.909AlaVal: 4.909 ± 0.306
0.877AlaTrp: 0.877 ± 0.127
2.506AlaTyr: 2.506 ± 0.234
0.0AlaXaa: 0.0 ± 0.0
Cys
0.731CysAla: 0.731 ± 0.117
0.188CysCys: 0.188 ± 0.063
0.815CysAsp: 0.815 ± 0.167
0.919CysGlu: 0.919 ± 0.128
0.397CysPhe: 0.397 ± 0.089
0.794CysGly: 0.794 ± 0.138
0.439CysHis: 0.439 ± 0.093
0.752CysIle: 0.752 ± 0.117
0.627CysLys: 0.627 ± 0.124
0.606CysLeu: 0.606 ± 0.117
0.313CysMet: 0.313 ± 0.079
0.543CysAsn: 0.543 ± 0.117
0.627CysPro: 0.627 ± 0.108
0.272CysGln: 0.272 ± 0.065
0.46CysArg: 0.46 ± 0.103
1.023CysSer: 1.023 ± 0.147
0.689CysThr: 0.689 ± 0.115
0.961CysVal: 0.961 ± 0.132
0.125CysTrp: 0.125 ± 0.047
0.397CysTyr: 0.397 ± 0.097
0.0CysXaa: 0.0 ± 0.0
Asp
4.533AspAla: 4.533 ± 0.294
0.752AspCys: 0.752 ± 0.129
3.781AspAsp: 3.781 ± 0.346
3.885AspGlu: 3.885 ± 0.318
3.112AspPhe: 3.112 ± 0.284
4.888AspGly: 4.888 ± 0.302
0.982AspHis: 0.982 ± 0.152
4.449AspIle: 4.449 ± 0.263
3.739AspLys: 3.739 ± 0.27
6.162AspLeu: 6.162 ± 0.456
2.172AspMet: 2.172 ± 0.212
3.133AspAsn: 3.133 ± 0.295
2.757AspPro: 2.757 ± 0.255
1.817AspGln: 1.817 ± 0.181
2.318AspArg: 2.318 ± 0.23
4.094AspSer: 4.094 ± 0.275
3.614AspThr: 3.614 ± 0.309
4.219AspVal: 4.219 ± 0.252
0.94AspTrp: 0.94 ± 0.166
3.133AspTyr: 3.133 ± 0.278
0.0AspXaa: 0.0 ± 0.0
Glu
4.136GluAla: 4.136 ± 0.386
0.648GluCys: 0.648 ± 0.127
4.24GluAsp: 4.24 ± 0.285
4.345GluGlu: 4.345 ± 0.377
3.05GluPhe: 3.05 ± 0.281
4.136GluGly: 4.136 ± 0.311
1.379GluHis: 1.379 ± 0.178
4.386GluIle: 4.386 ± 0.266
3.614GluLys: 3.614 ± 0.362
6.162GluLeu: 6.162 ± 0.389
2.026GluMet: 2.026 ± 0.198
3.029GluAsn: 3.029 ± 0.226
1.901GluPro: 1.901 ± 0.214
2.778GluGln: 2.778 ± 0.262
3.634GluArg: 3.634 ± 0.331
3.593GluSer: 3.593 ± 0.317
3.53GluThr: 3.53 ± 0.264
4.428GluVal: 4.428 ± 0.365
1.086GluTrp: 1.086 ± 0.16
2.987GluTyr: 2.987 ± 0.279
0.0GluXaa: 0.0 ± 0.0
Phe
2.381PheAla: 2.381 ± 0.228
0.418PheCys: 0.418 ± 0.092
3.008PheAsp: 3.008 ± 0.225
3.029PheGlu: 3.029 ± 0.271
1.734PhePhe: 1.734 ± 0.204
3.217PheGly: 3.217 ± 0.26
0.961PheHis: 0.961 ± 0.167
2.694PheIle: 2.694 ± 0.208
2.694PheLys: 2.694 ± 0.264
2.799PheLeu: 2.799 ± 0.222
1.232PheMet: 1.232 ± 0.178
2.736PheAsn: 2.736 ± 0.242
1.42PhePro: 1.42 ± 0.183
1.567PheGln: 1.567 ± 0.151
2.256PheArg: 2.256 ± 0.234
2.966PheSer: 2.966 ± 0.258
2.778PheThr: 2.778 ± 0.249
3.196PheVal: 3.196 ± 0.235
0.752PheTrp: 0.752 ± 0.118
1.587PheTyr: 1.587 ± 0.186
0.0PheXaa: 0.0 ± 0.0
Gly
3.801GlyAla: 3.801 ± 0.367
0.982GlyCys: 0.982 ± 0.15
4.177GlyAsp: 4.177 ± 0.291
4.553GlyGlu: 4.553 ± 0.266
2.882GlyPhe: 2.882 ± 0.265
5.076GlyGly: 5.076 ± 0.509
1.295GlyHis: 1.295 ± 0.188
4.888GlyIle: 4.888 ± 0.303
5.096GlyLys: 5.096 ± 0.351
5.264GlyLeu: 5.264 ± 0.365
1.755GlyMet: 1.755 ± 0.181
3.572GlyAsn: 3.572 ± 0.295
1.149GlyPro: 1.149 ± 0.118
2.486GlyGln: 2.486 ± 0.199
2.715GlyArg: 2.715 ± 0.224
5.159GlySer: 5.159 ± 0.507
3.697GlyThr: 3.697 ± 0.315
4.95GlyVal: 4.95 ± 0.317
1.191GlyTrp: 1.191 ± 0.166
2.611GlyTyr: 2.611 ± 0.246
0.0GlyXaa: 0.0 ± 0.0
His
1.086HisAla: 1.086 ± 0.154
0.292HisCys: 0.292 ± 0.071
1.274HisAsp: 1.274 ± 0.186
0.731HisGlu: 0.731 ± 0.111
0.919HisPhe: 0.919 ± 0.146
0.919HisGly: 0.919 ± 0.145
0.501HisHis: 0.501 ± 0.111
1.399HisIle: 1.399 ± 0.196
1.149HisLys: 1.149 ± 0.17
1.587HisLeu: 1.587 ± 0.166
0.627HisMet: 0.627 ± 0.111
0.668HisAsn: 0.668 ± 0.112
1.044HisPro: 1.044 ± 0.154
0.543HisGln: 0.543 ± 0.107
1.107HisArg: 1.107 ± 0.16
0.961HisSer: 0.961 ± 0.138
1.253HisThr: 1.253 ± 0.193
1.42HisVal: 1.42 ± 0.205
0.209HisTrp: 0.209 ± 0.077
1.065HisTyr: 1.065 ± 0.169
0.0HisXaa: 0.0 ± 0.0
Ile
4.052IleAla: 4.052 ± 0.326
0.731IleCys: 0.731 ± 0.135
5.222IleAsp: 5.222 ± 0.286
4.783IleGlu: 4.783 ± 0.355
1.943IlePhe: 1.943 ± 0.22
3.843IleGly: 3.843 ± 0.271
1.399IleHis: 1.399 ± 0.209
3.551IleIle: 3.551 ± 0.31
3.906IleLys: 3.906 ± 0.274
4.533IleLeu: 4.533 ± 0.333
1.65IleMet: 1.65 ± 0.161
3.76IleAsn: 3.76 ± 0.358
2.924IlePro: 2.924 ± 0.253
2.736IleGln: 2.736 ± 0.272
3.175IleArg: 3.175 ± 0.247
3.718IleSer: 3.718 ± 0.301
4.512IleThr: 4.512 ± 0.309
4.219IleVal: 4.219 ± 0.334
0.815IleTrp: 0.815 ± 0.147
2.11IleTyr: 2.11 ± 0.207
0.0IleXaa: 0.0 ± 0.0
Lys
4.24LysAla: 4.24 ± 0.369
0.585LysCys: 0.585 ± 0.125
3.676LysAsp: 3.676 ± 0.242
4.428LysGlu: 4.428 ± 0.359
3.175LysPhe: 3.175 ± 0.216
3.697LysGly: 3.697 ± 0.298
1.17LysHis: 1.17 ± 0.212
4.115LysIle: 4.115 ± 0.288
4.24LysLys: 4.24 ± 0.407
5.076LysLeu: 5.076 ± 0.278
2.298LysMet: 2.298 ± 0.237
2.674LysAsn: 2.674 ± 0.229
2.632LysPro: 2.632 ± 0.227
3.05LysGln: 3.05 ± 0.243
3.07LysArg: 3.07 ± 0.29
4.136LysSer: 4.136 ± 0.268
4.073LysThr: 4.073 ± 0.242
4.219LysVal: 4.219 ± 0.327
0.982LysTrp: 0.982 ± 0.153
2.256LysTyr: 2.256 ± 0.228
0.0LysXaa: 0.0 ± 0.0
Leu
5.932LeuAla: 5.932 ± 0.429
0.815LeuCys: 0.815 ± 0.139
5.096LeuAsp: 5.096 ± 0.3
4.992LeuGlu: 4.992 ± 0.315
3.509LeuPhe: 3.509 ± 0.274
5.284LeuGly: 5.284 ± 0.344
1.253LeuHis: 1.253 ± 0.207
4.24LeuIle: 4.24 ± 0.342
5.786LeuLys: 5.786 ± 0.395
5.974LeuLeu: 5.974 ± 0.419
1.984LeuMet: 1.984 ± 0.229
4.533LeuAsn: 4.533 ± 0.309
3.676LeuPro: 3.676 ± 0.355
2.862LeuGln: 2.862 ± 0.256
3.781LeuArg: 3.781 ± 0.259
5.869LeuSer: 5.869 ± 0.382
4.95LeuThr: 4.95 ± 0.27
5.347LeuVal: 5.347 ± 0.365
0.71LeuTrp: 0.71 ± 0.145
3.238LeuTyr: 3.238 ± 0.238
0.0LeuXaa: 0.0 ± 0.0
Met
2.486MetAla: 2.486 ± 0.21
0.355MetCys: 0.355 ± 0.094
1.525MetAsp: 1.525 ± 0.176
1.504MetGlu: 1.504 ± 0.185
1.462MetPhe: 1.462 ± 0.196
1.379MetGly: 1.379 ± 0.156
0.397MetHis: 0.397 ± 0.096
1.587MetIle: 1.587 ± 0.168
2.339MetLys: 2.339 ± 0.218
2.402MetLeu: 2.402 ± 0.232
1.003MetMet: 1.003 ± 0.143
1.65MetAsn: 1.65 ± 0.223
1.065MetPro: 1.065 ± 0.158
0.961MetGln: 0.961 ± 0.145
1.608MetArg: 1.608 ± 0.201
2.172MetSer: 2.172 ± 0.202
1.88MetThr: 1.88 ± 0.202
1.692MetVal: 1.692 ± 0.197
0.251MetTrp: 0.251 ± 0.069
0.898MetTyr: 0.898 ± 0.129
0.0MetXaa: 0.0 ± 0.0
Asn
3.927AsnAla: 3.927 ± 0.286
0.794AsnCys: 0.794 ± 0.134
2.987AsnAsp: 2.987 ± 0.277
2.674AsnGlu: 2.674 ± 0.219
2.11AsnPhe: 2.11 ± 0.2
4.491AsnGly: 4.491 ± 0.333
1.086AsnHis: 1.086 ± 0.181
3.279AsnIle: 3.279 ± 0.281
3.321AsnLys: 3.321 ± 0.275
3.572AsnLeu: 3.572 ± 0.277
1.775AsnMet: 1.775 ± 0.209
3.07AsnAsn: 3.07 ± 0.304
2.381AsnPro: 2.381 ± 0.209
2.172AsnGln: 2.172 ± 0.203
2.506AsnArg: 2.506 ± 0.258
2.903AsnSer: 2.903 ± 0.281
2.841AsnThr: 2.841 ± 0.269
3.572AsnVal: 3.572 ± 0.296
0.71AsnTrp: 0.71 ± 0.123
1.796AsnTyr: 1.796 ± 0.217
0.0AsnXaa: 0.0 ± 0.0
Pro
2.465ProAla: 2.465 ± 0.232
0.439ProCys: 0.439 ± 0.098
3.029ProAsp: 3.029 ± 0.252
3.488ProGlu: 3.488 ± 0.294
1.692ProPhe: 1.692 ± 0.22
2.59ProGly: 2.59 ± 0.213
0.606ProHis: 0.606 ± 0.111
2.402ProIle: 2.402 ± 0.211
2.298ProLys: 2.298 ± 0.219
2.924ProLeu: 2.924 ± 0.233
0.856ProMet: 0.856 ± 0.109
1.922ProAsn: 1.922 ± 0.168
1.274ProPro: 1.274 ± 0.165
1.316ProGln: 1.316 ± 0.174
1.713ProArg: 1.713 ± 0.205
2.736ProSer: 2.736 ± 0.263
2.423ProThr: 2.423 ± 0.265
2.862ProVal: 2.862 ± 0.247
0.564ProTrp: 0.564 ± 0.095
1.379ProTyr: 1.379 ± 0.198
0.0ProXaa: 0.0 ± 0.0
Gln
2.632GlnAla: 2.632 ± 0.246
0.376GlnCys: 0.376 ± 0.072
2.047GlnAsp: 2.047 ± 0.214
2.277GlnGlu: 2.277 ± 0.273
1.943GlnPhe: 1.943 ± 0.21
2.318GlnGly: 2.318 ± 0.204
0.794GlnHis: 0.794 ± 0.12
2.632GlnIle: 2.632 ± 0.244
2.172GlnLys: 2.172 ± 0.209
3.363GlnLeu: 3.363 ± 0.313
1.128GlnMet: 1.128 ± 0.163
1.608GlnAsn: 1.608 ± 0.192
1.232GlnPro: 1.232 ± 0.176
1.859GlnGln: 1.859 ± 0.212
2.277GlnArg: 2.277 ± 0.206
2.444GlnSer: 2.444 ± 0.276
2.298GlnThr: 2.298 ± 0.204
2.862GlnVal: 2.862 ± 0.237
0.522GlnTrp: 0.522 ± 0.094
1.546GlnTyr: 1.546 ± 0.177
0.0GlnXaa: 0.0 ± 0.0
Arg
2.924ArgAla: 2.924 ± 0.227
0.668ArgCys: 0.668 ± 0.12
3.05ArgAsp: 3.05 ± 0.277
3.154ArgGlu: 3.154 ± 0.271
2.235ArgPhe: 2.235 ± 0.202
2.757ArgGly: 2.757 ± 0.221
1.003ArgHis: 1.003 ± 0.133
3.154ArgIle: 3.154 ± 0.276
3.05ArgLys: 3.05 ± 0.27
4.428ArgLeu: 4.428 ± 0.334
1.441ArgMet: 1.441 ± 0.178
2.59ArgAsn: 2.59 ± 0.229
1.734ArgPro: 1.734 ± 0.262
2.256ArgGln: 2.256 ± 0.183
2.966ArgArg: 2.966 ± 0.317
3.112ArgSer: 3.112 ± 0.286
2.318ArgThr: 2.318 ± 0.235
3.279ArgVal: 3.279 ± 0.252
0.815ArgTrp: 0.815 ± 0.119
2.193ArgTyr: 2.193 ± 0.215
0.0ArgXaa: 0.0 ± 0.0
Ser
3.739SerAla: 3.739 ± 0.311
0.648SerCys: 0.648 ± 0.134
3.614SerAsp: 3.614 ± 0.222
3.989SerGlu: 3.989 ± 0.262
2.841SerPhe: 2.841 ± 0.22
5.222SerGly: 5.222 ± 0.503
0.773SerHis: 0.773 ± 0.147
4.512SerIle: 4.512 ± 0.315
4.031SerLys: 4.031 ± 0.309
5.514SerLeu: 5.514 ± 0.388
1.901SerMet: 1.901 ± 0.204
3.634SerAsn: 3.634 ± 0.268
2.611SerPro: 2.611 ± 0.227
2.339SerGln: 2.339 ± 0.244
3.238SerArg: 3.238 ± 0.309
4.136SerSer: 4.136 ± 0.347
3.864SerThr: 3.864 ± 0.371
4.846SerVal: 4.846 ± 0.391
0.815SerTrp: 0.815 ± 0.135
2.527SerTyr: 2.527 ± 0.231
0.0SerXaa: 0.0 ± 0.0
Thr
4.031ThrAla: 4.031 ± 0.377
0.585ThrCys: 0.585 ± 0.113
3.634ThrAsp: 3.634 ± 0.285
3.822ThrGlu: 3.822 ± 0.327
2.59ThrPhe: 2.59 ± 0.287
4.449ThrGly: 4.449 ± 0.365
0.982ThrHis: 0.982 ± 0.166
4.094ThrIle: 4.094 ± 0.366
3.593ThrLys: 3.593 ± 0.314
4.888ThrLeu: 4.888 ± 0.355
1.399ThrMet: 1.399 ± 0.162
2.653ThrAsn: 2.653 ± 0.258
3.509ThrPro: 3.509 ± 0.293
2.318ThrGln: 2.318 ± 0.19
2.945ThrArg: 2.945 ± 0.27
3.76ThrSer: 3.76 ± 0.346
4.01ThrThr: 4.01 ± 0.349
4.47ThrVal: 4.47 ± 0.345
0.752ThrTrp: 0.752 ± 0.13
1.734ThrTyr: 1.734 ± 0.219
0.0ThrXaa: 0.0 ± 0.0
Val
3.948ValAla: 3.948 ± 0.285
0.835ValCys: 0.835 ± 0.163
5.18ValAsp: 5.18 ± 0.313
4.888ValGlu: 4.888 ± 0.388
2.799ValPhe: 2.799 ± 0.281
4.762ValGly: 4.762 ± 0.33
1.149ValHis: 1.149 ± 0.156
4.407ValIle: 4.407 ± 0.323
5.117ValLys: 5.117 ± 0.4
5.117ValLeu: 5.117 ± 0.306
1.65ValMet: 1.65 ± 0.157
3.906ValAsn: 3.906 ± 0.283
2.527ValPro: 2.527 ± 0.236
2.444ValGln: 2.444 ± 0.246
3.008ValArg: 3.008 ± 0.224
4.929ValSer: 4.929 ± 0.374
4.637ValThr: 4.637 ± 0.434
5.911ValVal: 5.911 ± 0.386
1.149ValTrp: 1.149 ± 0.163
3.091ValTyr: 3.091 ± 0.25
0.0ValXaa: 0.0 ± 0.0
Trp
0.94TrpAla: 0.94 ± 0.136
0.313TrpCys: 0.313 ± 0.088
1.065TrpAsp: 1.065 ± 0.176
1.149TrpGlu: 1.149 ± 0.169
0.648TrpPhe: 0.648 ± 0.127
0.71TrpGly: 0.71 ± 0.106
0.209TrpHis: 0.209 ± 0.074
0.689TrpIle: 0.689 ± 0.142
0.835TrpLys: 0.835 ± 0.125
1.379TrpLeu: 1.379 ± 0.147
0.46TrpMet: 0.46 ± 0.098
0.689TrpAsn: 0.689 ± 0.107
0.418TrpPro: 0.418 ± 0.095
0.376TrpGln: 0.376 ± 0.083
0.898TrpArg: 0.898 ± 0.137
0.689TrpSer: 0.689 ± 0.116
0.731TrpThr: 0.731 ± 0.127
1.149TrpVal: 1.149 ± 0.149
0.146TrpTrp: 0.146 ± 0.057
0.418TrpTyr: 0.418 ± 0.099
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.444TyrAla: 2.444 ± 0.187
0.564TyrCys: 0.564 ± 0.116
2.841TyrAsp: 2.841 ± 0.25
2.235TyrGlu: 2.235 ± 0.178
1.796TyrPhe: 1.796 ± 0.164
2.486TyrGly: 2.486 ± 0.224
1.003TyrHis: 1.003 ± 0.159
2.047TyrIle: 2.047 ± 0.232
2.298TyrLys: 2.298 ± 0.198
2.924TyrLeu: 2.924 ± 0.288
1.128TyrMet: 1.128 ± 0.146
2.423TyrAsn: 2.423 ± 0.202
1.713TyrPro: 1.713 ± 0.186
1.546TyrGln: 1.546 ± 0.197
2.11TyrArg: 2.11 ± 0.203
2.444TyrSer: 2.444 ± 0.219
2.026TyrThr: 2.026 ± 0.318
2.945TyrVal: 2.945 ± 0.235
0.501TyrTrp: 0.501 ± 0.099
1.483TyrTyr: 1.483 ± 0.162
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 204 proteins (47877 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski