Amino acid dipepetide frequency for Acinetobacter phage vB_AbaM_B09_Aci02-2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.858AlaAla: 3.858 ± 0.374
0.675AlaCys: 0.675 ± 0.147
3.504AlaAsp: 3.504 ± 0.441
3.986AlaGlu: 3.986 ± 0.476
2.7AlaPhe: 2.7 ± 0.286
3.697AlaGly: 3.697 ± 0.412
1.029AlaHis: 1.029 ± 0.193
4.533AlaIle: 4.533 ± 0.496
4.886AlaLys: 4.886 ± 0.417
6.494AlaLeu: 6.494 ± 0.503
1.961AlaMet: 1.961 ± 0.226
3.054AlaAsn: 3.054 ± 0.612
1.607AlaPro: 1.607 ± 0.19
2.282AlaGln: 2.282 ± 0.33
2.765AlaArg: 2.765 ± 0.303
3.954AlaSer: 3.954 ± 0.396
3.279AlaThr: 3.279 ± 0.374
3.118AlaVal: 3.118 ± 0.341
0.675AlaTrp: 0.675 ± 0.125
2.282AlaTyr: 2.282 ± 0.273
0.0AlaXaa: 0.0 ± 0.0
Cys
0.611CysAla: 0.611 ± 0.134
0.225CysCys: 0.225 ± 0.087
0.611CysAsp: 0.611 ± 0.153
1.093CysGlu: 1.093 ± 0.207
0.579CysPhe: 0.579 ± 0.16
1.318CysGly: 1.318 ± 0.204
0.579CysHis: 0.579 ± 0.173
0.868CysIle: 0.868 ± 0.189
1.189CysLys: 1.189 ± 0.229
0.9CysLeu: 0.9 ± 0.136
0.45CysMet: 0.45 ± 0.109
0.675CysAsn: 0.675 ± 0.126
0.418CysPro: 0.418 ± 0.135
0.386CysGln: 0.386 ± 0.125
0.675CysArg: 0.675 ± 0.141
0.547CysSer: 0.547 ± 0.146
0.675CysThr: 0.675 ± 0.159
0.611CysVal: 0.611 ± 0.136
0.096CysTrp: 0.096 ± 0.055
0.707CysTyr: 0.707 ± 0.142
0.0CysXaa: 0.0 ± 0.0
Asp
3.858AspAla: 3.858 ± 0.372
0.868AspCys: 0.868 ± 0.19
4.822AspAsp: 4.822 ± 0.555
4.983AspGlu: 4.983 ± 0.449
3.408AspPhe: 3.408 ± 0.298
4.79AspGly: 4.79 ± 0.426
0.997AspHis: 0.997 ± 0.185
4.404AspIle: 4.404 ± 0.35
4.243AspLys: 4.243 ± 0.421
5.979AspLeu: 5.979 ± 0.437
1.993AspMet: 1.993 ± 0.234
3.311AspAsn: 3.311 ± 0.4
2.668AspPro: 2.668 ± 0.357
2.282AspGln: 2.282 ± 0.288
2.122AspArg: 2.122 ± 0.255
4.018AspSer: 4.018 ± 0.415
2.54AspThr: 2.54 ± 0.316
5.272AspVal: 5.272 ± 0.484
1.125AspTrp: 1.125 ± 0.191
2.733AspTyr: 2.733 ± 0.352
0.0AspXaa: 0.0 ± 0.0
Glu
4.983GluAla: 4.983 ± 0.385
1.382GluCys: 1.382 ± 0.255
5.401GluAsp: 5.401 ± 0.427
5.754GluGlu: 5.754 ± 0.56
3.44GluPhe: 3.44 ± 0.323
4.018GluGly: 4.018 ± 0.39
1.704GluHis: 1.704 ± 0.278
4.758GluIle: 4.758 ± 0.326
4.629GluLys: 4.629 ± 0.465
6.462GluLeu: 6.462 ± 0.549
2.122GluMet: 2.122 ± 0.287
3.793GluAsn: 3.793 ± 0.31
1.575GluPro: 1.575 ± 0.229
2.797GluGln: 2.797 ± 0.367
3.118GluArg: 3.118 ± 0.318
4.115GluSer: 4.115 ± 0.338
3.022GluThr: 3.022 ± 0.337
5.69GluVal: 5.69 ± 0.404
1.157GluTrp: 1.157 ± 0.17
3.183GluTyr: 3.183 ± 0.278
0.0GluXaa: 0.0 ± 0.0
Phe
2.282PheAla: 2.282 ± 0.276
0.579PheCys: 0.579 ± 0.138
3.247PheAsp: 3.247 ± 0.305
3.44PheGlu: 3.44 ± 0.335
1.189PhePhe: 1.189 ± 0.186
3.472PheGly: 3.472 ± 0.347
0.932PheHis: 0.932 ± 0.212
3.215PheIle: 3.215 ± 0.312
3.89PheLys: 3.89 ± 0.452
2.99PheLeu: 2.99 ± 0.303
1.736PheMet: 1.736 ± 0.238
2.154PheAsn: 2.154 ± 0.239
1.672PhePro: 1.672 ± 0.221
1.511PheGln: 1.511 ± 0.167
1.865PheArg: 1.865 ± 0.234
3.536PheSer: 3.536 ± 0.384
2.154PheThr: 2.154 ± 0.282
2.7PheVal: 2.7 ± 0.256
0.45PheTrp: 0.45 ± 0.108
1.736PheTyr: 1.736 ± 0.282
0.0PheXaa: 0.0 ± 0.0
Gly
3.311GlyAla: 3.311 ± 0.294
1.125GlyCys: 1.125 ± 0.204
4.597GlyAsp: 4.597 ± 0.449
4.726GlyGlu: 4.726 ± 0.402
3.568GlyPhe: 3.568 ± 0.372
4.565GlyGly: 4.565 ± 0.407
1.125GlyHis: 1.125 ± 0.19
4.693GlyIle: 4.693 ± 0.365
4.886GlyLys: 4.886 ± 0.363
5.176GlyLeu: 5.176 ± 0.454
1.607GlyMet: 1.607 ± 0.21
3.568GlyAsn: 3.568 ± 0.316
0.707GlyPro: 0.707 ± 0.119
2.218GlyGln: 2.218 ± 0.334
2.7GlyArg: 2.7 ± 0.294
4.372GlySer: 4.372 ± 0.365
4.404GlyThr: 4.404 ± 0.51
4.822GlyVal: 4.822 ± 0.368
1.382GlyTrp: 1.382 ± 0.229
3.215GlyTyr: 3.215 ± 0.237
0.0GlyXaa: 0.0 ± 0.0
His
1.093HisAla: 1.093 ± 0.187
0.289HisCys: 0.289 ± 0.104
1.575HisAsp: 1.575 ± 0.292
1.093HisGlu: 1.093 ± 0.229
1.093HisPhe: 1.093 ± 0.194
1.736HisGly: 1.736 ± 0.275
0.45HisHis: 0.45 ± 0.132
1.254HisIle: 1.254 ± 0.196
1.447HisLys: 1.447 ± 0.261
1.64HisLeu: 1.64 ± 0.27
0.611HisMet: 0.611 ± 0.14
1.125HisAsn: 1.125 ± 0.219
0.579HisPro: 0.579 ± 0.155
0.547HisGln: 0.547 ± 0.124
0.964HisArg: 0.964 ± 0.219
1.414HisSer: 1.414 ± 0.206
0.868HisThr: 0.868 ± 0.162
1.543HisVal: 1.543 ± 0.316
0.45HisTrp: 0.45 ± 0.117
0.547HisTyr: 0.547 ± 0.119
0.0HisXaa: 0.0 ± 0.0
Ile
3.89IleAla: 3.89 ± 0.37
0.643IleCys: 0.643 ± 0.154
5.722IleAsp: 5.722 ± 0.392
5.594IleGlu: 5.594 ± 0.387
2.315IlePhe: 2.315 ± 0.264
3.986IleGly: 3.986 ± 0.374
1.414IleHis: 1.414 ± 0.216
3.472IleIle: 3.472 ± 0.376
5.658IleLys: 5.658 ± 0.407
4.758IleLeu: 4.758 ± 0.444
1.286IleMet: 1.286 ± 0.207
3.6IleAsn: 3.6 ± 0.362
2.186IlePro: 2.186 ± 0.245
2.572IleGln: 2.572 ± 0.297
2.7IleArg: 2.7 ± 0.285
4.308IleSer: 4.308 ± 0.486
3.89IleThr: 3.89 ± 0.565
4.372IleVal: 4.372 ± 0.395
0.964IleTrp: 0.964 ± 0.142
2.218IleTyr: 2.218 ± 0.256
0.0IleXaa: 0.0 ± 0.0
Lys
6.172LysAla: 6.172 ± 0.65
0.868LysCys: 0.868 ± 0.192
4.533LysAsp: 4.533 ± 0.386
4.919LysGlu: 4.919 ± 0.454
3.761LysPhe: 3.761 ± 0.399
4.661LysGly: 4.661 ± 0.376
2.186LysHis: 2.186 ± 0.327
4.79LysIle: 4.79 ± 0.462
5.529LysLys: 5.529 ± 0.643
6.622LysLeu: 6.622 ± 0.559
2.154LysMet: 2.154 ± 0.325
3.279LysAsn: 3.279 ± 0.319
1.961LysPro: 1.961 ± 0.242
2.893LysGln: 2.893 ± 0.358
3.086LysArg: 3.086 ± 0.368
5.433LysSer: 5.433 ± 0.479
4.147LysThr: 4.147 ± 0.34
6.526LysVal: 6.526 ± 0.427
0.997LysTrp: 0.997 ± 0.173
4.083LysTyr: 4.083 ± 0.442
0.0LysXaa: 0.0 ± 0.0
Leu
5.111LeuAla: 5.111 ± 0.509
1.093LeuCys: 1.093 ± 0.215
5.144LeuAsp: 5.144 ± 0.394
6.558LeuGlu: 6.558 ± 0.546
3.118LeuPhe: 3.118 ± 0.358
4.951LeuGly: 4.951 ± 0.378
1.447LeuHis: 1.447 ± 0.256
4.597LeuIle: 4.597 ± 0.376
6.59LeuLys: 6.59 ± 0.478
6.14LeuLeu: 6.14 ± 0.441
1.768LeuMet: 1.768 ± 0.242
5.176LeuAsn: 5.176 ± 0.404
2.218LeuPro: 2.218 ± 0.254
2.668LeuGln: 2.668 ± 0.271
3.729LeuArg: 3.729 ± 0.356
5.786LeuSer: 5.786 ± 0.494
5.336LeuThr: 5.336 ± 0.458
5.336LeuVal: 5.336 ± 0.405
0.964LeuTrp: 0.964 ± 0.186
2.797LeuTyr: 2.797 ± 0.312
0.0LeuXaa: 0.0 ± 0.0
Met
1.736MetAla: 1.736 ± 0.221
0.257MetCys: 0.257 ± 0.09
1.832MetAsp: 1.832 ± 0.254
2.122MetGlu: 2.122 ± 0.293
1.061MetPhe: 1.061 ± 0.172
1.447MetGly: 1.447 ± 0.243
0.289MetHis: 0.289 ± 0.099
1.993MetIle: 1.993 ± 0.323
2.765MetLys: 2.765 ± 0.367
1.832MetLeu: 1.832 ± 0.229
0.579MetMet: 0.579 ± 0.155
1.479MetAsn: 1.479 ± 0.213
0.739MetPro: 0.739 ± 0.144
0.611MetGln: 0.611 ± 0.163
1.189MetArg: 1.189 ± 0.217
2.282MetSer: 2.282 ± 0.296
1.511MetThr: 1.511 ± 0.199
1.607MetVal: 1.607 ± 0.212
0.225MetTrp: 0.225 ± 0.071
1.189MetTyr: 1.189 ± 0.206
0.0MetXaa: 0.0 ± 0.0
Asn
3.279AsnAla: 3.279 ± 0.723
0.772AsnCys: 0.772 ± 0.173
3.183AsnAsp: 3.183 ± 0.319
2.797AsnGlu: 2.797 ± 0.267
2.507AsnPhe: 2.507 ± 0.321
4.919AsnGly: 4.919 ± 0.391
0.804AsnHis: 0.804 ± 0.162
4.115AsnIle: 4.115 ± 0.363
3.665AsnLys: 3.665 ± 0.425
4.276AsnLeu: 4.276 ± 0.353
1.318AsnMet: 1.318 ± 0.169
2.507AsnAsn: 2.507 ± 0.299
1.961AsnPro: 1.961 ± 0.289
1.64AsnGln: 1.64 ± 0.237
1.961AsnArg: 1.961 ± 0.234
3.183AsnSer: 3.183 ± 0.489
2.668AsnThr: 2.668 ± 0.291
3.793AsnVal: 3.793 ± 0.338
0.579AsnTrp: 0.579 ± 0.139
1.865AsnTyr: 1.865 ± 0.269
0.0AsnXaa: 0.0 ± 0.0
Pro
1.286ProAla: 1.286 ± 0.211
0.225ProCys: 0.225 ± 0.078
1.961ProAsp: 1.961 ± 0.325
2.507ProGlu: 2.507 ± 0.328
1.222ProPhe: 1.222 ± 0.179
0.0ProGly: 0.0 ± 0.0
0.675ProHis: 0.675 ± 0.16
1.768ProIle: 1.768 ± 0.253
2.7ProLys: 2.7 ± 0.285
1.961ProLeu: 1.961 ± 0.306
0.804ProMet: 0.804 ± 0.172
2.025ProAsn: 2.025 ± 0.31
1.189ProPro: 1.189 ± 0.213
0.997ProGln: 0.997 ± 0.14
1.093ProArg: 1.093 ± 0.224
1.929ProSer: 1.929 ± 0.217
1.865ProThr: 1.865 ± 0.224
2.315ProVal: 2.315 ± 0.283
0.225ProTrp: 0.225 ± 0.092
1.64ProTyr: 1.64 ± 0.245
0.0ProXaa: 0.0 ± 0.0
Gln
3.215GlnAla: 3.215 ± 0.55
0.418GlnCys: 0.418 ± 0.101
1.832GlnAsp: 1.832 ± 0.229
2.861GlnGlu: 2.861 ± 0.274
1.382GlnPhe: 1.382 ± 0.206
2.925GlnGly: 2.925 ± 0.413
0.707GlnHis: 0.707 ± 0.151
2.025GlnIle: 2.025 ± 0.206
1.961GlnLys: 1.961 ± 0.248
2.925GlnLeu: 2.925 ± 0.349
1.447GlnMet: 1.447 ± 0.237
1.511GlnAsn: 1.511 ± 0.25
0.772GlnPro: 0.772 ± 0.158
1.093GlnGln: 1.093 ± 0.199
2.025GlnArg: 2.025 ± 0.213
1.8GlnSer: 1.8 ± 0.373
1.543GlnThr: 1.543 ± 0.237
2.09GlnVal: 2.09 ± 0.267
0.547GlnTrp: 0.547 ± 0.146
1.704GlnTyr: 1.704 ± 0.225
0.0GlnXaa: 0.0 ± 0.0
Arg
1.961ArgAla: 1.961 ± 0.267
0.611ArgCys: 0.611 ± 0.148
2.99ArgAsp: 2.99 ± 0.357
3.375ArgGlu: 3.375 ± 0.259
2.379ArgPhe: 2.379 ± 0.361
2.958ArgGly: 2.958 ± 0.292
0.932ArgHis: 0.932 ± 0.204
3.44ArgIle: 3.44 ± 0.358
3.633ArgLys: 3.633 ± 0.399
3.15ArgLeu: 3.15 ± 0.307
0.772ArgMet: 0.772 ± 0.185
2.411ArgAsn: 2.411 ± 0.256
1.029ArgPro: 1.029 ± 0.163
1.318ArgGln: 1.318 ± 0.22
2.25ArgArg: 2.25 ± 0.257
2.122ArgSer: 2.122 ± 0.277
1.865ArgThr: 1.865 ± 0.263
3.504ArgVal: 3.504 ± 0.357
0.643ArgTrp: 0.643 ± 0.14
1.832ArgTyr: 1.832 ± 0.269
0.0ArgXaa: 0.0 ± 0.0
Ser
3.44SerAla: 3.44 ± 0.526
0.804SerCys: 0.804 ± 0.156
3.697SerAsp: 3.697 ± 0.335
4.661SerGlu: 4.661 ± 0.393
2.572SerPhe: 2.572 ± 0.281
5.561SerGly: 5.561 ± 0.5
1.382SerHis: 1.382 ± 0.254
4.179SerIle: 4.179 ± 0.358
5.561SerLys: 5.561 ± 0.406
4.822SerLeu: 4.822 ± 0.414
1.318SerMet: 1.318 ± 0.229
3.15SerAsn: 3.15 ± 0.33
1.768SerPro: 1.768 ± 0.3
2.636SerGln: 2.636 ± 0.384
2.733SerArg: 2.733 ± 0.312
4.34SerSer: 4.34 ± 0.635
3.183SerThr: 3.183 ± 0.283
4.468SerVal: 4.468 ± 0.415
0.9SerTrp: 0.9 ± 0.166
3.086SerTyr: 3.086 ± 0.33
0.0SerXaa: 0.0 ± 0.0
Thr
3.408ThrAla: 3.408 ± 0.453
0.611ThrCys: 0.611 ± 0.163
2.893ThrAsp: 2.893 ± 0.343
3.44ThrGlu: 3.44 ± 0.37
2.797ThrPhe: 2.797 ± 0.307
4.34ThrGly: 4.34 ± 0.393
1.189ThrHis: 1.189 ± 0.177
3.568ThrIle: 3.568 ± 0.34
4.211ThrLys: 4.211 ± 0.454
4.436ThrLeu: 4.436 ± 0.375
1.318ThrMet: 1.318 ± 0.192
2.218ThrAsn: 2.218 ± 0.333
1.865ThrPro: 1.865 ± 0.242
1.8ThrGln: 1.8 ± 0.265
2.379ThrArg: 2.379 ± 0.271
3.183ThrSer: 3.183 ± 0.453
3.022ThrThr: 3.022 ± 0.363
3.568ThrVal: 3.568 ± 0.411
0.836ThrTrp: 0.836 ± 0.157
2.186ThrTyr: 2.186 ± 0.26
0.0ThrXaa: 0.0 ± 0.0
Val
3.793ValAla: 3.793 ± 0.319
0.804ValCys: 0.804 ± 0.189
4.661ValAsp: 4.661 ± 0.509
5.144ValGlu: 5.144 ± 0.368
2.925ValPhe: 2.925 ± 0.3
4.308ValGly: 4.308 ± 0.419
1.157ValHis: 1.157 ± 0.191
4.404ValIle: 4.404 ± 0.339
6.429ValLys: 6.429 ± 0.434
5.594ValLeu: 5.594 ± 0.47
1.865ValMet: 1.865 ± 0.244
4.276ValAsn: 4.276 ± 0.346
1.832ValPro: 1.832 ± 0.204
2.379ValGln: 2.379 ± 0.31
3.215ValArg: 3.215 ± 0.329
4.34ValSer: 4.34 ± 0.445
4.179ValThr: 4.179 ± 0.409
5.786ValVal: 5.786 ± 0.521
0.932ValTrp: 0.932 ± 0.143
3.022ValTyr: 3.022 ± 0.309
0.0ValXaa: 0.0 ± 0.0
Trp
0.707TrpAla: 0.707 ± 0.188
0.289TrpCys: 0.289 ± 0.086
1.061TrpAsp: 1.061 ± 0.181
1.093TrpGlu: 1.093 ± 0.173
0.868TrpPhe: 0.868 ± 0.171
0.547TrpGly: 0.547 ± 0.154
0.321TrpHis: 0.321 ± 0.1
1.061TrpIle: 1.061 ± 0.185
1.157TrpLys: 1.157 ± 0.231
0.997TrpLeu: 0.997 ± 0.179
0.547TrpMet: 0.547 ± 0.113
0.514TrpAsn: 0.514 ± 0.106
0.225TrpPro: 0.225 ± 0.085
0.354TrpGln: 0.354 ± 0.098
0.611TrpArg: 0.611 ± 0.148
0.868TrpSer: 0.868 ± 0.177
0.804TrpThr: 0.804 ± 0.144
1.093TrpVal: 1.093 ± 0.188
0.161TrpTrp: 0.161 ± 0.074
0.836TrpTyr: 0.836 ± 0.182
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.347TyrAla: 2.347 ± 0.275
0.675TyrCys: 0.675 ± 0.142
3.15TyrAsp: 3.15 ± 0.365
2.958TyrGlu: 2.958 ± 0.255
1.865TyrPhe: 1.865 ± 0.249
2.572TyrGly: 2.572 ± 0.269
0.997TyrHis: 0.997 ± 0.218
2.411TyrIle: 2.411 ± 0.298
3.568TyrLys: 3.568 ± 0.357
3.375TyrLeu: 3.375 ± 0.317
1.061TyrMet: 1.061 ± 0.152
1.961TyrAsn: 1.961 ± 0.202
1.414TyrPro: 1.414 ± 0.238
1.8TyrGln: 1.8 ± 0.292
2.025TyrArg: 2.025 ± 0.286
2.733TyrSer: 2.733 ± 0.329
2.315TyrThr: 2.315 ± 0.292
2.829TyrVal: 2.829 ± 0.294
0.804TyrTrp: 0.804 ± 0.17
1.993TyrTyr: 1.993 ± 0.326
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 171 proteins (31108 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski