Amino acid dipepetide frequency for Mycobacterium phage BiancaTri92

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.212AlaAla: 11.212 ± 0.905
0.493AlaCys: 0.493 ± 0.152
5.544AlaAsp: 5.544 ± 0.585
7.269AlaGlu: 7.269 ± 0.826
4.312AlaPhe: 4.312 ± 0.477
7.947AlaGly: 7.947 ± 0.901
1.971AlaHis: 1.971 ± 0.335
4.435AlaIle: 4.435 ± 0.509
5.359AlaLys: 5.359 ± 0.639
8.44AlaLeu: 8.44 ± 1.004
3.08AlaMet: 3.08 ± 0.406
3.327AlaAsn: 3.327 ± 0.59
4.559AlaPro: 4.559 ± 0.583
3.696AlaGln: 3.696 ± 0.641
6.283AlaArg: 6.283 ± 0.593
4.682AlaSer: 4.682 ± 0.486
5.667AlaThr: 5.667 ± 0.722
7.392AlaVal: 7.392 ± 0.643
1.848AlaTrp: 1.848 ± 0.337
2.526AlaTyr: 2.526 ± 0.397
0.0AlaXaa: 0.0 ± 0.0
Cys
0.493CysAla: 0.493 ± 0.154
0.0CysCys: 0.0 ± 0.0
0.678CysAsp: 0.678 ± 0.203
0.308CysGlu: 0.308 ± 0.118
0.308CysPhe: 0.308 ± 0.132
0.493CysGly: 0.493 ± 0.159
0.308CysHis: 0.308 ± 0.166
0.185CysIle: 0.185 ± 0.113
0.246CysLys: 0.246 ± 0.121
0.739CysLeu: 0.739 ± 0.232
0.062CysMet: 0.062 ± 0.049
0.554CysAsn: 0.554 ± 0.18
0.431CysPro: 0.431 ± 0.185
0.123CysGln: 0.123 ± 0.083
0.493CysArg: 0.493 ± 0.165
0.431CysSer: 0.431 ± 0.15
0.554CysThr: 0.554 ± 0.203
0.616CysVal: 0.616 ± 0.18
0.431CysTrp: 0.431 ± 0.157
0.308CysTyr: 0.308 ± 0.142
0.0CysXaa: 0.0 ± 0.0
Asp
5.791AspAla: 5.791 ± 0.572
0.493AspCys: 0.493 ± 0.193
3.758AspAsp: 3.758 ± 0.474
5.483AspGlu: 5.483 ± 0.714
2.649AspPhe: 2.649 ± 0.43
6.345AspGly: 6.345 ± 0.688
1.417AspHis: 1.417 ± 0.395
3.388AspIle: 3.388 ± 0.499
2.156AspLys: 2.156 ± 0.364
5.483AspLeu: 5.483 ± 0.638
1.786AspMet: 1.786 ± 0.334
1.54AspAsn: 1.54 ± 0.303
4.99AspPro: 4.99 ± 0.678
2.094AspGln: 2.094 ± 0.344
2.464AspArg: 2.464 ± 0.373
2.711AspSer: 2.711 ± 0.43
3.142AspThr: 3.142 ± 0.498
4.127AspVal: 4.127 ± 0.475
1.109AspTrp: 1.109 ± 0.222
2.094AspTyr: 2.094 ± 0.238
0.0AspXaa: 0.0 ± 0.0
Glu
6.715GluAla: 6.715 ± 0.684
0.308GluCys: 0.308 ± 0.174
4.682GluAsp: 4.682 ± 0.668
4.99GluGlu: 4.99 ± 0.675
2.526GluPhe: 2.526 ± 0.413
4.62GluGly: 4.62 ± 0.476
1.786GluHis: 1.786 ± 0.339
3.635GluIle: 3.635 ± 0.473
2.218GluLys: 2.218 ± 0.393
7.824GluLeu: 7.824 ± 0.759
2.341GluMet: 2.341 ± 0.318
2.156GluAsn: 2.156 ± 0.341
2.957GluPro: 2.957 ± 0.456
2.526GluGln: 2.526 ± 0.308
4.559GluArg: 4.559 ± 0.686
3.08GluSer: 3.08 ± 0.421
3.819GluThr: 3.819 ± 0.487
4.62GluVal: 4.62 ± 0.596
1.417GluTrp: 1.417 ± 0.28
2.279GluTyr: 2.279 ± 0.383
0.0GluXaa: 0.0 ± 0.0
Phe
3.019PheAla: 3.019 ± 0.456
0.37PheCys: 0.37 ± 0.141
2.279PheAsp: 2.279 ± 0.453
2.587PheGlu: 2.587 ± 0.348
0.801PhePhe: 0.801 ± 0.23
3.265PheGly: 3.265 ± 0.479
0.616PheHis: 0.616 ± 0.279
1.478PheIle: 1.478 ± 0.273
1.232PheLys: 1.232 ± 0.293
2.957PheLeu: 2.957 ± 0.511
0.493PheMet: 0.493 ± 0.167
2.094PheAsn: 2.094 ± 0.357
1.725PhePro: 1.725 ± 0.288
1.294PheGln: 1.294 ± 0.307
2.341PheArg: 2.341 ± 0.368
1.725PheSer: 1.725 ± 0.389
2.033PheThr: 2.033 ± 0.323
2.464PheVal: 2.464 ± 0.445
0.493PheTrp: 0.493 ± 0.168
0.616PheTyr: 0.616 ± 0.162
0.0PheXaa: 0.0 ± 0.0
Gly
7.208GlyAla: 7.208 ± 1.096
0.801GlyCys: 0.801 ± 0.193
6.222GlyAsp: 6.222 ± 0.929
4.743GlyGlu: 4.743 ± 0.619
3.142GlyPhe: 3.142 ± 0.523
7.269GlyGly: 7.269 ± 1.49
1.725GlyHis: 1.725 ± 0.338
4.004GlyIle: 4.004 ± 0.492
3.943GlyLys: 3.943 ± 0.531
6.961GlyLeu: 6.961 ± 0.811
2.218GlyMet: 2.218 ± 0.321
3.388GlyAsn: 3.388 ± 0.49
4.743GlyPro: 4.743 ± 1.4
3.635GlyGln: 3.635 ± 0.457
3.696GlyArg: 3.696 ± 0.463
4.435GlySer: 4.435 ± 0.663
4.99GlyThr: 4.99 ± 0.591
6.345GlyVal: 6.345 ± 0.692
1.232GlyTrp: 1.232 ± 0.254
2.711GlyTyr: 2.711 ± 0.362
0.0GlyXaa: 0.0 ± 0.0
His
1.355HisAla: 1.355 ± 0.317
0.431HisCys: 0.431 ± 0.158
1.54HisAsp: 1.54 ± 0.241
1.232HisGlu: 1.232 ± 0.263
0.431HisPhe: 0.431 ± 0.165
1.786HisGly: 1.786 ± 0.433
0.616HisHis: 0.616 ± 0.187
1.294HisIle: 1.294 ± 0.304
1.109HisLys: 1.109 ± 0.293
1.17HisLeu: 1.17 ± 0.283
0.37HisMet: 0.37 ± 0.144
0.493HisAsn: 0.493 ± 0.169
1.294HisPro: 1.294 ± 0.294
1.047HisGln: 1.047 ± 0.244
1.725HisArg: 1.725 ± 0.37
0.678HisSer: 0.678 ± 0.201
1.109HisThr: 1.109 ± 0.256
1.17HisVal: 1.17 ± 0.306
0.493HisTrp: 0.493 ± 0.174
0.801HisTyr: 0.801 ± 0.325
0.0HisXaa: 0.0 ± 0.0
Ile
5.483IleAla: 5.483 ± 0.586
0.493IleCys: 0.493 ± 0.16
3.573IleAsp: 3.573 ± 0.436
4.62IleGlu: 4.62 ± 0.563
0.986IlePhe: 0.986 ± 0.234
4.251IleGly: 4.251 ± 0.601
0.862IleHis: 0.862 ± 0.2
1.725IleIle: 1.725 ± 0.312
2.526IleLys: 2.526 ± 0.456
4.312IleLeu: 4.312 ± 0.434
0.431IleMet: 0.431 ± 0.14
2.156IleAsn: 2.156 ± 0.393
3.265IlePro: 3.265 ± 0.405
1.54IleGln: 1.54 ± 0.29
3.327IleArg: 3.327 ± 0.464
2.772IleSer: 2.772 ± 0.443
2.895IleThr: 2.895 ± 0.368
2.587IleVal: 2.587 ± 0.37
0.554IleTrp: 0.554 ± 0.164
1.232IleTyr: 1.232 ± 0.308
0.0IleXaa: 0.0 ± 0.0
Lys
5.914LysAla: 5.914 ± 0.685
0.185LysCys: 0.185 ± 0.106
1.91LysAsp: 1.91 ± 0.323
2.218LysGlu: 2.218 ± 0.494
0.924LysPhe: 0.924 ± 0.235
4.497LysGly: 4.497 ± 0.78
0.616LysHis: 0.616 ± 0.185
2.464LysIle: 2.464 ± 0.376
3.08LysLys: 3.08 ± 0.517
3.758LysLeu: 3.758 ± 0.511
1.047LysMet: 1.047 ± 0.235
1.54LysAsn: 1.54 ± 0.258
3.142LysPro: 3.142 ± 0.54
1.848LysGln: 1.848 ± 0.342
3.327LysArg: 3.327 ± 0.514
2.156LysSer: 2.156 ± 0.366
2.587LysThr: 2.587 ± 0.361
3.758LysVal: 3.758 ± 0.48
1.047LysTrp: 1.047 ± 0.239
1.355LysTyr: 1.355 ± 0.256
0.0LysXaa: 0.0 ± 0.0
Leu
8.871LeuAla: 8.871 ± 0.836
0.678LeuCys: 0.678 ± 0.19
5.236LeuAsp: 5.236 ± 0.579
5.175LeuGlu: 5.175 ± 0.679
2.279LeuPhe: 2.279 ± 0.281
6.468LeuGly: 6.468 ± 0.781
1.725LeuHis: 1.725 ± 0.379
4.189LeuIle: 4.189 ± 0.551
3.511LeuLys: 3.511 ± 0.43
5.113LeuLeu: 5.113 ± 0.591
2.895LeuMet: 2.895 ± 0.458
2.094LeuAsn: 2.094 ± 0.437
4.99LeuPro: 4.99 ± 0.529
3.08LeuGln: 3.08 ± 0.508
6.407LeuArg: 6.407 ± 0.689
5.421LeuSer: 5.421 ± 0.53
5.298LeuThr: 5.298 ± 0.643
4.374LeuVal: 4.374 ± 0.671
1.294LeuTrp: 1.294 ± 0.256
2.279LeuTyr: 2.279 ± 0.402
0.0LeuXaa: 0.0 ± 0.0
Met
3.142MetAla: 3.142 ± 0.48
0.062MetCys: 0.062 ± 0.066
1.047MetAsp: 1.047 ± 0.252
1.54MetGlu: 1.54 ± 0.264
0.801MetPhe: 0.801 ± 0.197
1.786MetGly: 1.786 ± 0.304
0.554MetHis: 0.554 ± 0.175
1.355MetIle: 1.355 ± 0.264
1.232MetLys: 1.232 ± 0.282
1.663MetLeu: 1.663 ± 0.322
0.678MetMet: 0.678 ± 0.192
0.739MetAsn: 0.739 ± 0.197
1.232MetPro: 1.232 ± 0.29
0.739MetGln: 0.739 ± 0.184
1.54MetArg: 1.54 ± 0.297
2.094MetSer: 2.094 ± 0.293
2.526MetThr: 2.526 ± 0.403
1.663MetVal: 1.663 ± 0.376
0.246MetTrp: 0.246 ± 0.108
0.862MetTyr: 0.862 ± 0.207
0.0MetXaa: 0.0 ± 0.0
Asn
3.265AsnAla: 3.265 ± 0.508
0.493AsnCys: 0.493 ± 0.165
2.094AsnAsp: 2.094 ± 0.337
2.279AsnGlu: 2.279 ± 0.392
1.17AsnPhe: 1.17 ± 0.32
3.635AsnGly: 3.635 ± 0.627
0.862AsnHis: 0.862 ± 0.219
1.725AsnIle: 1.725 ± 0.258
1.109AsnLys: 1.109 ± 0.235
2.957AsnLeu: 2.957 ± 0.501
0.493AsnMet: 0.493 ± 0.179
0.308AsnAsn: 0.308 ± 0.127
2.156AsnPro: 2.156 ± 0.371
0.554AsnGln: 0.554 ± 0.22
2.341AsnArg: 2.341 ± 0.409
1.971AsnSer: 1.971 ± 0.382
1.54AsnThr: 1.54 ± 0.321
2.341AsnVal: 2.341 ± 0.357
0.924AsnTrp: 0.924 ± 0.251
0.986AsnTyr: 0.986 ± 0.218
0.0AsnXaa: 0.0 ± 0.0
Pro
5.483ProAla: 5.483 ± 0.751
0.308ProCys: 0.308 ± 0.158
3.943ProAsp: 3.943 ± 0.523
4.99ProGlu: 4.99 ± 0.58
1.602ProPhe: 1.602 ± 0.343
4.805ProGly: 4.805 ± 0.759
1.047ProHis: 1.047 ± 0.264
2.649ProIle: 2.649 ± 0.416
3.142ProLys: 3.142 ± 0.657
3.388ProLeu: 3.388 ± 0.537
1.047ProMet: 1.047 ± 0.293
2.094ProAsn: 2.094 ± 0.322
2.895ProPro: 2.895 ± 0.487
2.279ProGln: 2.279 ± 0.586
3.573ProArg: 3.573 ± 0.58
2.587ProSer: 2.587 ± 0.424
3.696ProThr: 3.696 ± 0.444
3.881ProVal: 3.881 ± 0.457
1.232ProTrp: 1.232 ± 0.33
1.478ProTyr: 1.478 ± 0.289
0.0ProXaa: 0.0 ± 0.0
Gln
4.559GlnAla: 4.559 ± 0.612
0.246GlnCys: 0.246 ± 0.121
1.417GlnAsp: 1.417 ± 0.297
1.725GlnGlu: 1.725 ± 0.397
1.54GlnPhe: 1.54 ± 0.259
3.45GlnGly: 3.45 ± 0.854
0.616GlnHis: 0.616 ± 0.212
2.526GlnIle: 2.526 ± 0.354
1.54GlnLys: 1.54 ± 0.288
3.203GlnLeu: 3.203 ± 0.515
1.232GlnMet: 1.232 ± 0.273
1.047GlnAsn: 1.047 ± 0.317
1.417GlnPro: 1.417 ± 0.369
1.971GlnGln: 1.971 ± 0.389
2.218GlnArg: 2.218 ± 0.415
1.663GlnSer: 1.663 ± 0.36
2.033GlnThr: 2.033 ± 0.409
3.573GlnVal: 3.573 ± 0.481
0.616GlnTrp: 0.616 ± 0.185
0.862GlnTyr: 0.862 ± 0.207
0.0GlnXaa: 0.0 ± 0.0
Arg
5.791ArgAla: 5.791 ± 0.799
0.616ArgCys: 0.616 ± 0.231
4.004ArgAsp: 4.004 ± 0.504
5.051ArgGlu: 5.051 ± 0.672
2.526ArgPhe: 2.526 ± 0.467
4.682ArgGly: 4.682 ± 0.669
1.417ArgHis: 1.417 ± 0.294
3.511ArgIle: 3.511 ± 0.563
3.635ArgLys: 3.635 ± 0.492
5.729ArgLeu: 5.729 ± 0.576
1.663ArgMet: 1.663 ± 0.318
1.971ArgAsn: 1.971 ± 0.321
2.957ArgPro: 2.957 ± 0.447
2.403ArgGln: 2.403 ± 0.381
5.606ArgArg: 5.606 ± 0.674
2.711ArgSer: 2.711 ± 0.472
3.019ArgThr: 3.019 ± 0.351
4.374ArgVal: 4.374 ± 0.535
0.986ArgTrp: 0.986 ± 0.229
1.663ArgTyr: 1.663 ± 0.294
0.0ArgXaa: 0.0 ± 0.0
Ser
5.113SerAla: 5.113 ± 0.606
0.308SerCys: 0.308 ± 0.116
3.881SerAsp: 3.881 ± 0.456
3.45SerGlu: 3.45 ± 0.439
1.848SerPhe: 1.848 ± 0.305
4.127SerGly: 4.127 ± 0.53
0.739SerHis: 0.739 ± 0.212
2.218SerIle: 2.218 ± 0.323
2.341SerLys: 2.341 ± 0.513
4.189SerLeu: 4.189 ± 0.606
1.725SerMet: 1.725 ± 0.311
1.232SerAsn: 1.232 ± 0.303
3.08SerPro: 3.08 ± 0.472
2.279SerGln: 2.279 ± 0.34
3.388SerArg: 3.388 ± 0.437
2.341SerSer: 2.341 ± 0.388
3.203SerThr: 3.203 ± 0.445
3.327SerVal: 3.327 ± 0.448
0.986SerTrp: 0.986 ± 0.246
1.294SerTyr: 1.294 ± 0.292
0.0SerXaa: 0.0 ± 0.0
Thr
4.99ThrAla: 4.99 ± 0.519
0.493ThrCys: 0.493 ± 0.16
3.327ThrAsp: 3.327 ± 0.499
3.573ThrGlu: 3.573 ± 0.449
2.094ThrPhe: 2.094 ± 0.383
5.667ThrGly: 5.667 ± 0.896
0.986ThrHis: 0.986 ± 0.278
2.772ThrIle: 2.772 ± 0.517
3.265ThrLys: 3.265 ± 0.47
4.62ThrLeu: 4.62 ± 0.692
1.478ThrMet: 1.478 ± 0.263
1.786ThrAsn: 1.786 ± 0.297
4.99ThrPro: 4.99 ± 0.662
1.971ThrGln: 1.971 ± 0.293
2.711ThrArg: 2.711 ± 0.347
3.08ThrSer: 3.08 ± 0.416
2.957ThrThr: 2.957 ± 0.539
5.298ThrVal: 5.298 ± 0.6
1.294ThrTrp: 1.294 ± 0.292
1.786ThrTyr: 1.786 ± 0.316
0.0ThrXaa: 0.0 ± 0.0
Val
6.776ValAla: 6.776 ± 0.914
0.554ValCys: 0.554 ± 0.179
4.99ValAsp: 4.99 ± 0.515
4.62ValGlu: 4.62 ± 0.563
2.341ValPhe: 2.341 ± 0.409
5.051ValGly: 5.051 ± 0.587
1.294ValHis: 1.294 ± 0.243
3.635ValIle: 3.635 ± 0.512
3.943ValLys: 3.943 ± 0.446
5.483ValLeu: 5.483 ± 0.695
1.478ValMet: 1.478 ± 0.266
2.772ValAsn: 2.772 ± 0.493
2.957ValPro: 2.957 ± 0.464
2.156ValGln: 2.156 ± 0.404
4.928ValArg: 4.928 ± 0.595
3.943ValSer: 3.943 ± 0.54
4.867ValThr: 4.867 ± 0.588
5.359ValVal: 5.359 ± 0.525
1.478ValTrp: 1.478 ± 0.325
2.033ValTyr: 2.033 ± 0.377
0.0ValXaa: 0.0 ± 0.0
Trp
1.725TrpAla: 1.725 ± 0.363
0.246TrpCys: 0.246 ± 0.161
0.986TrpAsp: 0.986 ± 0.225
1.109TrpGlu: 1.109 ± 0.238
0.801TrpPhe: 0.801 ± 0.194
1.294TrpGly: 1.294 ± 0.352
0.493TrpHis: 0.493 ± 0.199
1.17TrpIle: 1.17 ± 0.23
0.554TrpLys: 0.554 ± 0.164
1.17TrpLeu: 1.17 ± 0.301
0.37TrpMet: 0.37 ± 0.127
0.862TrpAsn: 0.862 ± 0.231
0.678TrpPro: 0.678 ± 0.23
1.17TrpGln: 1.17 ± 0.285
1.109TrpArg: 1.109 ± 0.21
1.17TrpSer: 1.17 ± 0.281
1.663TrpThr: 1.663 ± 0.336
1.109TrpVal: 1.109 ± 0.195
0.493TrpTrp: 0.493 ± 0.178
0.431TrpTyr: 0.431 ± 0.141
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.142TyrAla: 3.142 ± 0.415
0.185TyrCys: 0.185 ± 0.101
1.91TyrAsp: 1.91 ± 0.316
2.094TyrGlu: 2.094 ± 0.356
0.801TyrPhe: 0.801 ± 0.21
1.971TyrGly: 1.971 ± 0.308
0.493TyrHis: 0.493 ± 0.166
1.232TyrIle: 1.232 ± 0.237
1.232TyrLys: 1.232 ± 0.273
2.218TyrLeu: 2.218 ± 0.384
0.554TyrMet: 0.554 ± 0.202
1.047TyrAsn: 1.047 ± 0.26
1.663TyrPro: 1.663 ± 0.368
1.047TyrGln: 1.047 ± 0.232
2.279TyrArg: 2.279 ± 0.462
1.478TyrSer: 1.478 ± 0.219
1.54TyrThr: 1.54 ± 0.321
2.279TyrVal: 2.279 ± 0.41
0.431TyrTrp: 0.431 ± 0.15
0.678TyrTyr: 0.678 ± 0.201
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 91 proteins (16234 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski