Amino acid dipepetide frequency for Virgibacillus phage Mimir87

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.684AlaAla: 4.684 ± 1.063
0.344AlaCys: 0.344 ± 0.122
5.235AlaAsp: 5.235 ± 0.703
4.271AlaGlu: 4.271 ± 0.559
2.273AlaPhe: 2.273 ± 0.469
5.579AlaGly: 5.579 ± 1.006
1.309AlaHis: 1.309 ± 0.353
3.788AlaIle: 3.788 ± 0.538
6.199AlaLys: 6.199 ± 0.704
5.786AlaLeu: 5.786 ± 0.564
1.998AlaMet: 1.998 ± 0.395
4.339AlaAsn: 4.339 ± 0.54
1.171AlaPro: 1.171 ± 0.326
2.686AlaGln: 2.686 ± 0.426
2.824AlaArg: 2.824 ± 0.372
3.1AlaSer: 3.1 ± 0.44
3.375AlaThr: 3.375 ± 0.686
3.995AlaVal: 3.995 ± 0.576
0.62AlaTrp: 0.62 ± 0.197
2.893AlaTyr: 2.893 ± 0.5
0.0AlaXaa: 0.0 ± 0.0
Cys
0.207CysAla: 0.207 ± 0.147
0.069CysCys: 0.069 ± 0.071
0.482CysAsp: 0.482 ± 0.194
0.207CysGlu: 0.207 ± 0.115
0.344CysPhe: 0.344 ± 0.174
0.62CysGly: 0.62 ± 0.211
0.207CysHis: 0.207 ± 0.112
0.276CysIle: 0.276 ± 0.151
0.62CysLys: 0.62 ± 0.239
0.482CysLeu: 0.482 ± 0.241
0.207CysMet: 0.207 ± 0.112
0.207CysAsn: 0.207 ± 0.111
0.413CysPro: 0.413 ± 0.162
0.276CysGln: 0.276 ± 0.135
0.276CysArg: 0.276 ± 0.122
0.482CysSer: 0.482 ± 0.209
0.344CysThr: 0.344 ± 0.171
0.413CysVal: 0.413 ± 0.176
0.0CysTrp: 0.0 ± 0.0
0.276CysTyr: 0.276 ± 0.12
0.0CysXaa: 0.0 ± 0.0
Asp
3.651AspAla: 3.651 ± 0.547
0.138AspCys: 0.138 ± 0.094
5.648AspAsp: 5.648 ± 0.937
7.095AspGlu: 7.095 ± 0.644
3.306AspPhe: 3.306 ± 0.533
4.408AspGly: 4.408 ± 0.598
0.895AspHis: 0.895 ± 0.283
5.304AspIle: 5.304 ± 0.558
5.786AspLys: 5.786 ± 0.611
6.13AspLeu: 6.13 ± 0.735
1.446AspMet: 1.446 ± 0.354
3.444AspAsn: 3.444 ± 0.439
1.86AspPro: 1.86 ± 0.376
2.411AspGln: 2.411 ± 0.448
2.893AspArg: 2.893 ± 0.505
3.926AspSer: 3.926 ± 0.537
3.306AspThr: 3.306 ± 0.487
4.133AspVal: 4.133 ± 0.542
0.895AspTrp: 0.895 ± 0.266
2.962AspTyr: 2.962 ± 0.52
0.0AspXaa: 0.0 ± 0.0
Glu
5.51GluAla: 5.51 ± 0.595
0.276GluCys: 0.276 ± 0.14
4.477GluAsp: 4.477 ± 0.685
6.819GluGlu: 6.819 ± 0.814
3.237GluPhe: 3.237 ± 0.59
4.546GluGly: 4.546 ± 0.583
1.722GluHis: 1.722 ± 0.378
6.13GluIle: 6.13 ± 0.567
6.475GluLys: 6.475 ± 0.723
7.715GluLeu: 7.715 ± 0.668
3.1GluMet: 3.1 ± 0.567
4.615GluAsn: 4.615 ± 0.485
2.135GluPro: 2.135 ± 0.359
3.444GluGln: 3.444 ± 0.636
2.893GluArg: 2.893 ± 0.566
4.615GluSer: 4.615 ± 0.719
4.133GluThr: 4.133 ± 0.37
5.648GluVal: 5.648 ± 0.49
0.482GluTrp: 0.482 ± 0.153
2.962GluTyr: 2.962 ± 0.424
0.0GluXaa: 0.0 ± 0.0
Phe
2.273PheAla: 2.273 ± 0.336
0.207PheCys: 0.207 ± 0.114
3.168PheAsp: 3.168 ± 0.438
2.893PheGlu: 2.893 ± 0.449
0.758PhePhe: 0.758 ± 0.235
2.204PheGly: 2.204 ± 0.464
0.62PheHis: 0.62 ± 0.219
2.48PheIle: 2.48 ± 0.394
3.857PheLys: 3.857 ± 0.567
2.824PheLeu: 2.824 ± 0.52
1.102PheMet: 1.102 ± 0.239
2.411PheAsn: 2.411 ± 0.444
0.689PhePro: 0.689 ± 0.185
1.033PheGln: 1.033 ± 0.251
1.998PheArg: 1.998 ± 0.362
2.549PheSer: 2.549 ± 0.356
1.929PheThr: 1.929 ± 0.37
3.031PheVal: 3.031 ± 0.658
0.276PheTrp: 0.276 ± 0.122
1.998PheTyr: 1.998 ± 0.341
0.0PheXaa: 0.0 ± 0.0
Gly
3.995GlyAla: 3.995 ± 0.483
0.276GlyCys: 0.276 ± 0.151
5.097GlyAsp: 5.097 ± 0.691
3.788GlyGlu: 3.788 ± 0.443
2.686GlyPhe: 2.686 ± 0.447
5.097GlyGly: 5.097 ± 0.769
1.24GlyHis: 1.24 ± 0.246
3.788GlyIle: 3.788 ± 0.461
5.786GlyLys: 5.786 ± 1.107
5.786GlyLeu: 5.786 ± 1.028
1.929GlyMet: 1.929 ± 0.606
3.651GlyAsn: 3.651 ± 0.508
1.378GlyPro: 1.378 ± 0.496
2.342GlyGln: 2.342 ± 0.356
2.686GlyArg: 2.686 ± 0.417
3.926GlySer: 3.926 ± 0.794
3.651GlyThr: 3.651 ± 0.633
3.72GlyVal: 3.72 ± 0.541
1.446GlyTrp: 1.446 ± 0.292
2.066GlyTyr: 2.066 ± 0.415
0.0GlyXaa: 0.0 ± 0.0
His
0.895HisAla: 0.895 ± 0.236
0.138HisCys: 0.138 ± 0.095
0.964HisAsp: 0.964 ± 0.261
1.584HisGlu: 1.584 ± 0.319
0.758HisPhe: 0.758 ± 0.192
1.24HisGly: 1.24 ± 0.27
0.62HisHis: 0.62 ± 0.186
0.827HisIle: 0.827 ± 0.269
1.378HisLys: 1.378 ± 0.332
2.066HisLeu: 2.066 ± 0.383
0.207HisMet: 0.207 ± 0.133
1.171HisAsn: 1.171 ± 0.259
0.207HisPro: 0.207 ± 0.168
0.689HisGln: 0.689 ± 0.222
0.62HisArg: 0.62 ± 0.211
1.24HisSer: 1.24 ± 0.244
0.964HisThr: 0.964 ± 0.228
0.964HisVal: 0.964 ± 0.291
0.482HisTrp: 0.482 ± 0.179
0.62HisTyr: 0.62 ± 0.22
0.0HisXaa: 0.0 ± 0.0
Ile
4.615IleAla: 4.615 ± 0.624
0.482IleCys: 0.482 ± 0.214
5.304IleAsp: 5.304 ± 0.762
6.13IleGlu: 6.13 ± 0.638
1.86IlePhe: 1.86 ± 0.429
3.306IleGly: 3.306 ± 0.481
1.102IleHis: 1.102 ± 0.252
4.064IleIle: 4.064 ± 0.43
6.681IleLys: 6.681 ± 0.815
4.271IleLeu: 4.271 ± 0.666
1.515IleMet: 1.515 ± 0.338
3.857IleAsn: 3.857 ± 0.51
1.998IlePro: 1.998 ± 0.474
2.48IleGln: 2.48 ± 0.381
2.48IleArg: 2.48 ± 0.515
3.582IleSer: 3.582 ± 0.593
5.442IleThr: 5.442 ± 0.619
3.1IleVal: 3.1 ± 0.521
0.276IleTrp: 0.276 ± 0.131
2.549IleTyr: 2.549 ± 0.495
0.0IleXaa: 0.0 ± 0.0
Lys
5.579LysAla: 5.579 ± 0.801
0.758LysCys: 0.758 ± 0.271
5.648LysAsp: 5.648 ± 0.843
9.574LysGlu: 9.574 ± 1.18
3.237LysPhe: 3.237 ± 0.385
5.786LysGly: 5.786 ± 0.909
1.722LysHis: 1.722 ± 0.343
5.235LysIle: 5.235 ± 0.718
9.574LysLys: 9.574 ± 1.203
8.266LysLeu: 8.266 ± 0.724
1.998LysMet: 1.998 ± 0.406
4.477LysAsn: 4.477 ± 0.532
2.48LysPro: 2.48 ± 0.406
3.857LysGln: 3.857 ± 0.507
2.686LysArg: 2.686 ± 0.483
5.579LysSer: 5.579 ± 0.506
4.202LysThr: 4.202 ± 0.544
5.028LysVal: 5.028 ± 0.5
0.895LysTrp: 0.895 ± 0.228
3.031LysTyr: 3.031 ± 0.515
0.0LysXaa: 0.0 ± 0.0
Leu
5.648LeuAla: 5.648 ± 0.56
0.344LeuCys: 0.344 ± 0.175
5.304LeuAsp: 5.304 ± 0.491
7.232LeuGlu: 7.232 ± 0.762
3.237LeuPhe: 3.237 ± 0.441
5.442LeuGly: 5.442 ± 0.773
1.24LeuHis: 1.24 ± 0.359
3.995LeuIle: 3.995 ± 0.557
7.37LeuLys: 7.37 ± 0.755
4.753LeuLeu: 4.753 ± 0.549
2.342LeuMet: 2.342 ± 0.392
5.717LeuAsn: 5.717 ± 0.614
2.48LeuPro: 2.48 ± 0.573
3.168LeuGln: 3.168 ± 0.474
3.031LeuArg: 3.031 ± 0.421
6.544LeuSer: 6.544 ± 0.829
4.408LeuThr: 4.408 ± 0.533
4.064LeuVal: 4.064 ± 0.552
0.895LeuTrp: 0.895 ± 0.277
3.582LeuTyr: 3.582 ± 0.521
0.0LeuXaa: 0.0 ± 0.0
Met
2.273MetAla: 2.273 ± 0.447
0.069MetCys: 0.069 ± 0.084
1.86MetAsp: 1.86 ± 0.276
1.722MetGlu: 1.722 ± 0.325
1.24MetPhe: 1.24 ± 0.335
1.515MetGly: 1.515 ± 0.254
0.344MetHis: 0.344 ± 0.18
1.653MetIle: 1.653 ± 0.362
3.375MetLys: 3.375 ± 0.615
2.135MetLeu: 2.135 ± 0.398
1.033MetMet: 1.033 ± 0.3
1.998MetAsn: 1.998 ± 0.355
0.964MetPro: 0.964 ± 0.299
1.102MetGln: 1.102 ± 0.422
1.102MetArg: 1.102 ± 0.243
2.342MetSer: 2.342 ± 0.487
1.309MetThr: 1.309 ± 0.267
0.964MetVal: 0.964 ± 0.216
0.207MetTrp: 0.207 ± 0.102
0.827MetTyr: 0.827 ± 0.251
0.0MetXaa: 0.0 ± 0.0
Asn
3.995AsnAla: 3.995 ± 0.569
0.344AsnCys: 0.344 ± 0.188
3.031AsnAsp: 3.031 ± 0.428
4.822AsnGlu: 4.822 ± 0.623
2.48AsnPhe: 2.48 ± 0.391
3.857AsnGly: 3.857 ± 0.455
1.171AsnHis: 1.171 ± 0.335
3.1AsnIle: 3.1 ± 0.512
5.786AsnLys: 5.786 ± 0.544
4.822AsnLeu: 4.822 ± 0.517
1.584AsnMet: 1.584 ± 0.353
2.686AsnAsn: 2.686 ± 0.561
2.066AsnPro: 2.066 ± 0.339
2.273AsnGln: 2.273 ± 0.385
2.824AsnArg: 2.824 ± 0.442
3.031AsnSer: 3.031 ± 0.493
2.755AsnThr: 2.755 ± 0.623
3.513AsnVal: 3.513 ± 0.47
0.827AsnTrp: 0.827 ± 0.208
1.86AsnTyr: 1.86 ± 0.37
0.0AsnXaa: 0.0 ± 0.0
Pro
1.584ProAla: 1.584 ± 0.292
0.276ProCys: 0.276 ± 0.117
1.86ProAsp: 1.86 ± 0.4
2.755ProGlu: 2.755 ± 0.354
1.171ProPhe: 1.171 ± 0.316
1.033ProGly: 1.033 ± 0.249
0.138ProHis: 0.138 ± 0.097
2.204ProIle: 2.204 ± 0.395
2.066ProLys: 2.066 ± 0.489
1.791ProLeu: 1.791 ± 0.388
0.62ProMet: 0.62 ± 0.182
1.722ProAsn: 1.722 ± 0.302
0.482ProPro: 0.482 ± 0.161
1.033ProGln: 1.033 ± 0.282
1.515ProArg: 1.515 ± 0.402
1.998ProSer: 1.998 ± 0.32
1.722ProThr: 1.722 ± 0.342
2.204ProVal: 2.204 ± 0.346
0.276ProTrp: 0.276 ± 0.156
0.895ProTyr: 0.895 ± 0.261
0.0ProXaa: 0.0 ± 0.0
Gln
2.962GlnAla: 2.962 ± 0.793
0.138GlnCys: 0.138 ± 0.105
2.066GlnAsp: 2.066 ± 0.368
2.411GlnGlu: 2.411 ± 0.407
1.584GlnPhe: 1.584 ± 0.34
2.273GlnGly: 2.273 ± 0.532
0.62GlnHis: 0.62 ± 0.174
2.686GlnIle: 2.686 ± 0.424
3.031GlnLys: 3.031 ± 0.578
3.926GlnLeu: 3.926 ± 0.512
1.584GlnMet: 1.584 ± 0.331
1.86GlnAsn: 1.86 ± 0.302
1.171GlnPro: 1.171 ± 0.324
2.755GlnGln: 2.755 ± 0.662
2.411GlnArg: 2.411 ± 0.364
1.929GlnSer: 1.929 ± 0.333
2.342GlnThr: 2.342 ± 0.378
1.24GlnVal: 1.24 ± 0.247
0.276GlnTrp: 0.276 ± 0.133
1.653GlnTyr: 1.653 ± 0.311
0.0GlnXaa: 0.0 ± 0.0
Arg
2.273ArgAla: 2.273 ± 0.322
0.551ArgCys: 0.551 ± 0.186
3.651ArgAsp: 3.651 ± 0.409
3.031ArgGlu: 3.031 ± 0.46
1.86ArgPhe: 1.86 ± 0.324
2.066ArgGly: 2.066 ± 0.389
0.413ArgHis: 0.413 ± 0.157
3.168ArgIle: 3.168 ± 0.567
3.857ArgLys: 3.857 ± 0.599
3.031ArgLeu: 3.031 ± 0.45
1.378ArgMet: 1.378 ± 0.321
1.791ArgAsn: 1.791 ± 0.322
1.378ArgPro: 1.378 ± 0.342
1.309ArgGln: 1.309 ± 0.292
1.722ArgArg: 1.722 ± 0.345
2.411ArgSer: 2.411 ± 0.368
1.929ArgThr: 1.929 ± 0.333
2.411ArgVal: 2.411 ± 0.38
0.62ArgTrp: 0.62 ± 0.178
2.342ArgTyr: 2.342 ± 0.506
0.0ArgXaa: 0.0 ± 0.0
Ser
4.133SerAla: 4.133 ± 0.618
0.551SerCys: 0.551 ± 0.195
3.857SerAsp: 3.857 ± 0.683
4.339SerGlu: 4.339 ± 0.611
2.755SerPhe: 2.755 ± 0.459
4.89SerGly: 4.89 ± 0.587
1.033SerHis: 1.033 ± 0.28
5.235SerIle: 5.235 ± 0.628
4.959SerLys: 4.959 ± 0.509
5.028SerLeu: 5.028 ± 0.745
1.722SerMet: 1.722 ± 0.49
3.72SerAsn: 3.72 ± 0.489
1.515SerPro: 1.515 ± 0.333
1.791SerGln: 1.791 ± 0.352
2.342SerArg: 2.342 ± 0.311
4.339SerSer: 4.339 ± 0.579
3.513SerThr: 3.513 ± 0.769
3.995SerVal: 3.995 ± 0.535
0.62SerTrp: 0.62 ± 0.251
1.722SerTyr: 1.722 ± 0.355
0.0SerXaa: 0.0 ± 0.0
Thr
5.028ThrAla: 5.028 ± 0.839
0.276ThrCys: 0.276 ± 0.113
3.926ThrAsp: 3.926 ± 0.467
3.72ThrGlu: 3.72 ± 0.492
1.791ThrPhe: 1.791 ± 0.42
3.995ThrGly: 3.995 ± 0.46
1.033ThrHis: 1.033 ± 0.24
3.788ThrIle: 3.788 ± 0.57
4.477ThrLys: 4.477 ± 0.558
3.926ThrLeu: 3.926 ± 0.541
0.964ThrMet: 0.964 ± 0.213
2.893ThrAsn: 2.893 ± 0.566
1.584ThrPro: 1.584 ± 0.365
2.411ThrGln: 2.411 ± 0.475
2.066ThrArg: 2.066 ± 0.398
3.306ThrSer: 3.306 ± 0.426
3.995ThrThr: 3.995 ± 0.622
4.408ThrVal: 4.408 ± 0.41
0.758ThrTrp: 0.758 ± 0.211
1.998ThrTyr: 1.998 ± 0.321
0.0ThrXaa: 0.0 ± 0.0
Val
3.995ValAla: 3.995 ± 0.545
0.413ValCys: 0.413 ± 0.204
4.271ValAsp: 4.271 ± 0.472
4.615ValGlu: 4.615 ± 0.602
1.584ValPhe: 1.584 ± 0.333
2.893ValGly: 2.893 ± 0.399
1.102ValHis: 1.102 ± 0.246
4.271ValIle: 4.271 ± 0.447
4.615ValLys: 4.615 ± 0.618
4.408ValLeu: 4.408 ± 0.562
1.653ValMet: 1.653 ± 0.334
3.72ValAsn: 3.72 ± 0.549
1.929ValPro: 1.929 ± 0.405
2.273ValGln: 2.273 ± 0.26
2.686ValArg: 2.686 ± 0.311
4.753ValSer: 4.753 ± 0.664
4.133ValThr: 4.133 ± 0.579
3.582ValVal: 3.582 ± 0.435
0.62ValTrp: 0.62 ± 0.211
2.549ValTyr: 2.549 ± 0.466
0.0ValXaa: 0.0 ± 0.0
Trp
0.551TrpAla: 0.551 ± 0.199
0.069TrpCys: 0.069 ± 0.075
0.758TrpAsp: 0.758 ± 0.229
0.964TrpGlu: 0.964 ± 0.256
0.482TrpPhe: 0.482 ± 0.209
0.62TrpGly: 0.62 ± 0.194
0.138TrpHis: 0.138 ± 0.108
0.895TrpIle: 0.895 ± 0.256
1.24TrpLys: 1.24 ± 0.28
0.758TrpLeu: 0.758 ± 0.226
0.276TrpMet: 0.276 ± 0.183
0.207TrpAsn: 0.207 ± 0.112
0.069TrpPro: 0.069 ± 0.064
0.482TrpGln: 0.482 ± 0.17
0.344TrpArg: 0.344 ± 0.171
0.62TrpSer: 0.62 ± 0.201
0.827TrpThr: 0.827 ± 0.283
1.102TrpVal: 1.102 ± 0.399
0.0TrpTrp: 0.0 ± 0.0
0.413TrpTyr: 0.413 ± 0.177
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.617TyrAla: 2.617 ± 0.367
0.689TyrCys: 0.689 ± 0.301
3.031TyrAsp: 3.031 ± 0.438
2.962TyrGlu: 2.962 ± 0.356
1.653TyrPhe: 1.653 ± 0.368
2.962TyrGly: 2.962 ± 0.44
0.964TyrHis: 0.964 ± 0.273
2.204TyrIle: 2.204 ± 0.374
2.48TyrLys: 2.48 ± 0.363
2.893TyrLeu: 2.893 ± 0.453
1.309TyrMet: 1.309 ± 0.424
2.342TyrAsn: 2.342 ± 0.439
1.378TyrPro: 1.378 ± 0.243
1.102TyrGln: 1.102 ± 0.299
1.791TyrArg: 1.791 ± 0.357
1.791TyrSer: 1.791 ± 0.357
2.135TyrThr: 2.135 ± 0.372
2.549TyrVal: 2.549 ± 0.55
0.344TyrTrp: 0.344 ± 0.158
1.998TyrTyr: 1.998 ± 0.469
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 68 proteins (14519 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski