Amino acid dipepetide frequency for Exiguobacterium phage vB_EalM-132

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.037AlaAla: 1.037 ± 0.143
0.311AlaCys: 0.311 ± 0.086
4.253AlaAsp: 4.253 ± 0.329
4.46AlaGlu: 4.46 ± 0.369
2.178AlaPhe: 2.178 ± 0.295
3.89AlaGly: 3.89 ± 0.331
1.634AlaHis: 1.634 ± 0.243
5.394AlaIle: 5.394 ± 0.418
5.783AlaLys: 5.783 ± 0.47
5.523AlaLeu: 5.523 ± 0.352
1.867AlaMet: 1.867 ± 0.211
3.164AlaAsn: 3.164 ± 0.346
2.256AlaPro: 2.256 ± 0.231
2.178AlaGln: 2.178 ± 0.264
3.008AlaArg: 3.008 ± 0.264
3.993AlaSer: 3.993 ± 0.387
5.264AlaThr: 5.264 ± 0.609
4.564AlaVal: 4.564 ± 0.361
0.545AlaTrp: 0.545 ± 0.16
2.956AlaTyr: 2.956 ± 0.312
0.0AlaXaa: 0.0 ± 0.0
Cys
0.311CysAla: 0.311 ± 0.093
0.052CysCys: 0.052 ± 0.033
0.441CysAsp: 0.441 ± 0.115
0.648CysGlu: 0.648 ± 0.138
0.182CysPhe: 0.182 ± 0.063
0.57CysGly: 0.57 ± 0.141
0.052CysHis: 0.052 ± 0.04
0.285CysIle: 0.285 ± 0.078
0.545CysLys: 0.545 ± 0.142
0.622CysLeu: 0.622 ± 0.137
0.207CysMet: 0.207 ± 0.09
0.207CysAsn: 0.207 ± 0.082
0.389CysPro: 0.389 ± 0.125
0.311CysGln: 0.311 ± 0.091
0.285CysArg: 0.285 ± 0.095
0.467CysSer: 0.467 ± 0.114
0.519CysThr: 0.519 ± 0.104
0.519CysVal: 0.519 ± 0.098
0.078CysTrp: 0.078 ± 0.045
0.207CysTyr: 0.207 ± 0.08
0.0CysXaa: 0.0 ± 0.0
Asp
4.149AspAla: 4.149 ± 0.445
0.311AspCys: 0.311 ± 0.09
3.864AspAsp: 3.864 ± 0.464
4.305AspGlu: 4.305 ± 0.386
2.697AspPhe: 2.697 ± 0.254
3.967AspGly: 3.967 ± 0.379
1.322AspHis: 1.322 ± 0.302
5.264AspIle: 5.264 ± 0.458
4.123AspLys: 4.123 ± 0.38
6.197AspLeu: 6.197 ± 0.458
1.686AspMet: 1.686 ± 0.215
3.656AspAsn: 3.656 ± 0.361
2.23AspPro: 2.23 ± 0.218
1.348AspGln: 1.348 ± 0.201
2.775AspArg: 2.775 ± 0.331
4.46AspSer: 4.46 ± 0.329
4.642AspThr: 4.642 ± 0.408
3.838AspVal: 3.838 ± 0.304
0.934AspTrp: 0.934 ± 0.136
3.034AspTyr: 3.034 ± 0.311
0.0AspXaa: 0.0 ± 0.0
Glu
4.538GluAla: 4.538 ± 0.351
0.545GluCys: 0.545 ± 0.112
5.783GluAsp: 5.783 ± 0.561
6.898GluGlu: 6.898 ± 0.619
2.93GluPhe: 2.93 ± 0.256
4.901GluGly: 4.901 ± 0.32
1.867GluHis: 1.867 ± 0.326
4.305GluIle: 4.305 ± 0.321
4.849GluLys: 4.849 ± 0.505
6.327GluLeu: 6.327 ± 0.491
1.841GluMet: 1.841 ± 0.165
3.138GluAsn: 3.138 ± 0.315
1.841GluPro: 1.841 ± 0.215
3.371GluGln: 3.371 ± 0.31
3.371GluArg: 3.371 ± 0.374
4.279GluSer: 4.279 ± 0.37
3.345GluThr: 3.345 ± 0.269
5.783GluVal: 5.783 ± 0.406
0.959GluTrp: 0.959 ± 0.197
3.656GluTyr: 3.656 ± 0.355
0.0GluXaa: 0.0 ± 0.0
Phe
1.737PheAla: 1.737 ± 0.211
0.233PheCys: 0.233 ± 0.073
1.867PheAsp: 1.867 ± 0.194
1.841PheGlu: 1.841 ± 0.201
0.934PhePhe: 0.934 ± 0.155
2.023PheGly: 2.023 ± 0.238
0.674PheHis: 0.674 ± 0.151
2.152PheIle: 2.152 ± 0.235
2.541PheLys: 2.541 ± 0.256
2.36PheLeu: 2.36 ± 0.275
1.193PheMet: 1.193 ± 0.203
1.815PheAsn: 1.815 ± 0.224
1.374PhePro: 1.374 ± 0.195
0.882PheGln: 0.882 ± 0.142
1.4PheArg: 1.4 ± 0.183
2.308PheSer: 2.308 ± 0.262
2.515PheThr: 2.515 ± 0.316
2.36PheVal: 2.36 ± 0.302
0.363PheTrp: 0.363 ± 0.099
1.608PheTyr: 1.608 ± 0.171
0.0PheXaa: 0.0 ± 0.0
Gly
4.512GlyAla: 4.512 ± 0.385
0.233GlyCys: 0.233 ± 0.08
4.253GlyAsp: 4.253 ± 0.393
4.33GlyGlu: 4.33 ± 0.386
2.256GlyPhe: 2.256 ± 0.267
4.486GlyGly: 4.486 ± 0.509
1.556GlyHis: 1.556 ± 0.211
4.538GlyIle: 4.538 ± 0.292
5.005GlyLys: 5.005 ± 0.413
4.564GlyLeu: 4.564 ± 0.277
1.737GlyMet: 1.737 ± 0.201
3.501GlyAsn: 3.501 ± 0.308
0.156GlyPro: 0.156 ± 0.068
2.023GlyGln: 2.023 ± 0.235
2.489GlyArg: 2.489 ± 0.248
3.215GlySer: 3.215 ± 0.316
5.082GlyThr: 5.082 ± 0.613
5.031GlyVal: 5.031 ± 0.415
0.934GlyTrp: 0.934 ± 0.181
2.93GlyTyr: 2.93 ± 0.273
0.0GlyXaa: 0.0 ± 0.0
His
1.634HisAla: 1.634 ± 0.204
0.233HisCys: 0.233 ± 0.09
0.882HisAsp: 0.882 ± 0.173
1.608HisGlu: 1.608 ± 0.299
0.648HisPhe: 0.648 ± 0.125
1.193HisGly: 1.193 ± 0.177
1.037HisHis: 1.037 ± 0.242
1.867HisIle: 1.867 ± 0.209
1.841HisLys: 1.841 ± 0.246
2.515HisLeu: 2.515 ± 0.275
0.934HisMet: 0.934 ± 0.186
1.374HisAsn: 1.374 ± 0.178
1.374HisPro: 1.374 ± 0.208
0.648HisGln: 0.648 ± 0.113
1.089HisArg: 1.089 ± 0.168
1.348HisSer: 1.348 ± 0.175
1.634HisThr: 1.634 ± 0.229
1.66HisVal: 1.66 ± 0.218
0.441HisTrp: 0.441 ± 0.103
1.089HisTyr: 1.089 ± 0.162
0.0HisXaa: 0.0 ± 0.0
Ile
3.993IleAla: 3.993 ± 0.366
0.57IleCys: 0.57 ± 0.119
4.123IleAsp: 4.123 ± 0.319
4.849IleGlu: 4.849 ± 0.327
1.426IlePhe: 1.426 ± 0.177
3.734IleGly: 3.734 ± 0.335
1.504IleHis: 1.504 ± 0.182
3.864IleIle: 3.864 ± 0.376
5.394IleLys: 5.394 ± 0.451
5.186IleLeu: 5.186 ± 0.337
1.711IleMet: 1.711 ± 0.244
3.553IleAsn: 3.553 ± 0.377
2.826IlePro: 2.826 ± 0.281
2.126IleGln: 2.126 ± 0.195
3.138IleArg: 3.138 ± 0.266
3.838IleSer: 3.838 ± 0.388
5.031IleThr: 5.031 ± 0.442
3.967IleVal: 3.967 ± 0.388
0.467IleTrp: 0.467 ± 0.116
2.412IleTyr: 2.412 ± 0.309
0.0IleXaa: 0.0 ± 0.0
Lys
5.705LysAla: 5.705 ± 0.402
0.7LysCys: 0.7 ± 0.126
5.031LysAsp: 5.031 ± 0.504
6.275LysGlu: 6.275 ± 0.567
2.438LysPhe: 2.438 ± 0.264
4.693LysGly: 4.693 ± 0.367
2.334LysHis: 2.334 ± 0.344
3.786LysIle: 3.786 ± 0.255
6.016LysLys: 6.016 ± 0.745
5.523LysLeu: 5.523 ± 0.393
1.841LysMet: 1.841 ± 0.203
2.749LysAsn: 2.749 ± 0.28
2.593LysPro: 2.593 ± 0.299
2.982LysGln: 2.982 ± 0.284
2.878LysArg: 2.878 ± 0.332
4.849LysSer: 4.849 ± 0.559
3.241LysThr: 3.241 ± 0.3
4.616LysVal: 4.616 ± 0.385
0.804LysTrp: 0.804 ± 0.165
3.371LysTyr: 3.371 ± 0.285
0.0LysXaa: 0.0 ± 0.0
Leu
5.809LeuAla: 5.809 ± 0.445
0.57LeuCys: 0.57 ± 0.139
5.523LeuAsp: 5.523 ± 0.38
6.82LeuGlu: 6.82 ± 0.414
2.386LeuPhe: 2.386 ± 0.264
5.212LeuGly: 5.212 ± 0.456
1.841LeuHis: 1.841 ± 0.251
4.797LeuIle: 4.797 ± 0.369
5.29LeuLys: 5.29 ± 0.437
6.172LeuLeu: 6.172 ± 0.438
2.282LeuMet: 2.282 ± 0.244
4.33LeuAsn: 4.33 ± 0.323
3.008LeuPro: 3.008 ± 0.259
2.956LeuGln: 2.956 ± 0.283
4.279LeuArg: 4.279 ± 0.272
6.172LeuSer: 6.172 ± 0.348
5.653LeuThr: 5.653 ± 0.475
5.497LeuVal: 5.497 ± 0.424
1.037LeuTrp: 1.037 ± 0.163
3.319LeuTyr: 3.319 ± 0.288
0.0LeuXaa: 0.0 ± 0.0
Met
2.645MetAla: 2.645 ± 0.279
0.207MetCys: 0.207 ± 0.086
1.374MetAsp: 1.374 ± 0.154
2.334MetGlu: 2.334 ± 0.245
0.752MetPhe: 0.752 ± 0.143
1.322MetGly: 1.322 ± 0.213
0.674MetHis: 0.674 ± 0.148
1.193MetIle: 1.193 ± 0.18
2.049MetLys: 2.049 ± 0.248
2.334MetLeu: 2.334 ± 0.277
0.726MetMet: 0.726 ± 0.148
1.452MetAsn: 1.452 ± 0.206
1.089MetPro: 1.089 ± 0.158
1.167MetGln: 1.167 ± 0.185
1.348MetArg: 1.348 ± 0.192
2.671MetSer: 2.671 ± 0.304
1.322MetThr: 1.322 ± 0.182
1.478MetVal: 1.478 ± 0.212
0.233MetTrp: 0.233 ± 0.076
1.193MetTyr: 1.193 ± 0.155
0.0MetXaa: 0.0 ± 0.0
Asn
3.371AsnAla: 3.371 ± 0.302
0.259AsnCys: 0.259 ± 0.094
2.723AsnAsp: 2.723 ± 0.243
2.671AsnGlu: 2.671 ± 0.284
1.348AsnPhe: 1.348 ± 0.191
3.345AsnGly: 3.345 ± 0.327
1.219AsnHis: 1.219 ± 0.212
3.449AsnIle: 3.449 ± 0.3
3.501AsnLys: 3.501 ± 0.316
4.305AsnLeu: 4.305 ± 0.345
1.452AsnMet: 1.452 ± 0.211
2.801AsnAsn: 2.801 ± 0.321
3.008AsnPro: 3.008 ± 0.266
1.556AsnGln: 1.556 ± 0.258
2.749AsnArg: 2.749 ± 0.259
3.319AsnSer: 3.319 ± 0.337
3.501AsnThr: 3.501 ± 0.413
3.604AsnVal: 3.604 ± 0.369
0.726AsnTrp: 0.726 ± 0.103
2.308AsnTyr: 2.308 ± 0.229
0.0AsnXaa: 0.0 ± 0.0
Pro
2.204ProAla: 2.204 ± 0.206
0.207ProCys: 0.207 ± 0.07
2.178ProAsp: 2.178 ± 0.223
3.319ProGlu: 3.319 ± 0.349
0.934ProPhe: 0.934 ± 0.132
1.374ProGly: 1.374 ± 0.216
1.089ProHis: 1.089 ± 0.202
2.36ProIle: 2.36 ± 0.279
2.697ProLys: 2.697 ± 0.317
2.801ProLeu: 2.801 ± 0.261
0.959ProMet: 0.959 ± 0.202
2.074ProAsn: 2.074 ± 0.263
0.83ProPro: 0.83 ± 0.175
1.089ProGln: 1.089 ± 0.17
1.037ProArg: 1.037 ± 0.171
2.126ProSer: 2.126 ± 0.215
3.397ProThr: 3.397 ± 0.315
2.697ProVal: 2.697 ± 0.267
0.285ProTrp: 0.285 ± 0.093
1.763ProTyr: 1.763 ± 0.226
0.0ProXaa: 0.0 ± 0.0
Gln
2.567GlnAla: 2.567 ± 0.296
0.285GlnCys: 0.285 ± 0.084
1.867GlnAsp: 1.867 ± 0.224
2.36GlnGlu: 2.36 ± 0.26
1.141GlnPhe: 1.141 ± 0.226
2.074GlnGly: 2.074 ± 0.268
1.141GlnHis: 1.141 ± 0.194
1.997GlnIle: 1.997 ± 0.239
1.971GlnLys: 1.971 ± 0.265
3.578GlnLeu: 3.578 ± 0.361
0.959GlnMet: 0.959 ± 0.164
1.634GlnAsn: 1.634 ± 0.238
1.011GlnPro: 1.011 ± 0.163
1.4GlnGln: 1.4 ± 0.219
1.582GlnArg: 1.582 ± 0.227
1.945GlnSer: 1.945 ± 0.219
1.504GlnThr: 1.504 ± 0.21
2.723GlnVal: 2.723 ± 0.297
0.674GlnTrp: 0.674 ± 0.117
1.504GlnTyr: 1.504 ± 0.194
0.0GlnXaa: 0.0 ± 0.0
Arg
2.723ArgAla: 2.723 ± 0.267
0.156ArgCys: 0.156 ± 0.077
2.775ArgAsp: 2.775 ± 0.27
2.93ArgGlu: 2.93 ± 0.28
1.711ArgPhe: 1.711 ± 0.228
3.138ArgGly: 3.138 ± 0.276
0.856ArgHis: 0.856 ± 0.143
3.034ArgIle: 3.034 ± 0.293
3.604ArgLys: 3.604 ± 0.413
4.019ArgLeu: 4.019 ± 0.358
1.867ArgMet: 1.867 ± 0.227
2.386ArgAsn: 2.386 ± 0.268
1.193ArgPro: 1.193 ± 0.208
1.867ArgGln: 1.867 ± 0.226
1.634ArgArg: 1.634 ± 0.17
2.749ArgSer: 2.749 ± 0.246
2.567ArgThr: 2.567 ± 0.244
3.475ArgVal: 3.475 ± 0.291
0.415ArgTrp: 0.415 ± 0.107
1.426ArgTyr: 1.426 ± 0.182
0.0ArgXaa: 0.0 ± 0.0
Ser
3.812SerAla: 3.812 ± 0.431
0.337SerCys: 0.337 ± 0.092
4.071SerAsp: 4.071 ± 0.308
4.797SerGlu: 4.797 ± 0.341
1.711SerPhe: 1.711 ± 0.211
4.486SerGly: 4.486 ± 0.43
1.245SerHis: 1.245 ± 0.177
4.253SerIle: 4.253 ± 0.377
4.823SerLys: 4.823 ± 0.424
5.394SerLeu: 5.394 ± 0.383
1.711SerMet: 1.711 ± 0.218
3.578SerAsn: 3.578 ± 0.338
2.463SerPro: 2.463 ± 0.28
2.074SerGln: 2.074 ± 0.212
2.723SerArg: 2.723 ± 0.251
4.434SerSer: 4.434 ± 0.604
4.668SerThr: 4.668 ± 0.357
4.642SerVal: 4.642 ± 0.374
0.752SerTrp: 0.752 ± 0.152
2.93SerTyr: 2.93 ± 0.257
0.0SerXaa: 0.0 ± 0.0
Thr
5.212ThrAla: 5.212 ± 0.519
0.622ThrCys: 0.622 ± 0.131
4.227ThrAsp: 4.227 ± 0.461
4.227ThrGlu: 4.227 ± 0.33
2.723ThrPhe: 2.723 ± 0.314
5.082ThrGly: 5.082 ± 0.442
1.66ThrHis: 1.66 ± 0.198
4.59ThrIle: 4.59 ± 0.396
4.175ThrLys: 4.175 ± 0.292
5.705ThrLeu: 5.705 ± 0.397
1.089ThrMet: 1.089 ± 0.162
3.112ThrAsn: 3.112 ± 0.285
3.604ThrPro: 3.604 ± 0.355
1.945ThrGln: 1.945 ± 0.193
2.619ThrArg: 2.619 ± 0.318
4.097ThrSer: 4.097 ± 0.502
4.668ThrThr: 4.668 ± 0.633
4.616ThrVal: 4.616 ± 0.389
0.752ThrTrp: 0.752 ± 0.151
2.645ThrTyr: 2.645 ± 0.293
0.0ThrXaa: 0.0 ± 0.0
Val
4.538ValAla: 4.538 ± 0.49
0.57ValCys: 0.57 ± 0.134
5.342ValAsp: 5.342 ± 0.356
5.912ValGlu: 5.912 ± 0.403
2.049ValPhe: 2.049 ± 0.224
4.201ValGly: 4.201 ± 0.337
1.971ValHis: 1.971 ± 0.267
3.578ValIle: 3.578 ± 0.304
4.382ValLys: 4.382 ± 0.31
5.316ValLeu: 5.316 ± 0.431
1.737ValMet: 1.737 ± 0.23
3.423ValAsn: 3.423 ± 0.308
2.826ValPro: 2.826 ± 0.277
2.049ValGln: 2.049 ± 0.265
3.553ValArg: 3.553 ± 0.365
5.082ValSer: 5.082 ± 0.339
5.16ValThr: 5.16 ± 0.424
5.342ValVal: 5.342 ± 0.458
0.57ValTrp: 0.57 ± 0.121
2.801ValTyr: 2.801 ± 0.24
0.0ValXaa: 0.0 ± 0.0
Trp
0.908TrpAla: 0.908 ± 0.132
0.104TrpCys: 0.104 ± 0.058
0.959TrpAsp: 0.959 ± 0.142
0.856TrpGlu: 0.856 ± 0.124
0.311TrpPhe: 0.311 ± 0.101
0.648TrpGly: 0.648 ± 0.133
0.285TrpHis: 0.285 ± 0.075
0.648TrpIle: 0.648 ± 0.129
0.83TrpLys: 0.83 ± 0.176
0.83TrpLeu: 0.83 ± 0.182
0.545TrpMet: 0.545 ± 0.124
0.856TrpAsn: 0.856 ± 0.154
0.0TrpPro: 0.0 ± 0.0
0.389TrpGln: 0.389 ± 0.118
0.545TrpArg: 0.545 ± 0.139
0.7TrpSer: 0.7 ± 0.15
0.545TrpThr: 0.545 ± 0.115
1.115TrpVal: 1.115 ± 0.162
0.182TrpTrp: 0.182 ± 0.069
0.519TrpTyr: 0.519 ± 0.145
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.852TyrAla: 2.852 ± 0.258
0.389TyrCys: 0.389 ± 0.133
3.267TyrAsp: 3.267 ± 0.299
3.086TyrGlu: 3.086 ± 0.289
1.426TyrPhe: 1.426 ± 0.196
2.515TyrGly: 2.515 ± 0.259
1.089TyrHis: 1.089 ± 0.173
2.36TyrIle: 2.36 ± 0.289
3.008TyrLys: 3.008 ± 0.288
3.682TyrLeu: 3.682 ± 0.292
1.193TyrMet: 1.193 ± 0.194
2.412TyrAsn: 2.412 ± 0.237
1.348TyrPro: 1.348 ± 0.191
1.556TyrGln: 1.556 ± 0.168
2.023TyrArg: 2.023 ± 0.234
2.852TyrSer: 2.852 ± 0.245
3.112TyrThr: 3.112 ± 0.375
2.878TyrVal: 2.878 ± 0.247
0.622TyrTrp: 0.622 ± 0.121
1.608TyrTyr: 1.608 ± 0.21
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 168 proteins (38565 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski