Amino acid dipepetide frequency for Mycoplasma phage phiMFV1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.449AlaAla: 2.449 ± 0.927
0.204AlaCys: 0.204 ± 0.195
1.224AlaAsp: 1.224 ± 0.484
1.429AlaGlu: 1.429 ± 0.471
4.082AlaPhe: 4.082 ± 0.933
1.429AlaGly: 1.429 ± 0.51
0.612AlaHis: 0.612 ± 0.423
5.102AlaIle: 5.102 ± 0.813
4.898AlaLys: 4.898 ± 0.953
4.898AlaLeu: 4.898 ± 0.831
0.816AlaMet: 0.816 ± 0.29
3.265AlaAsn: 3.265 ± 0.901
1.02AlaPro: 1.02 ± 0.625
1.224AlaGln: 1.224 ± 0.406
1.429AlaArg: 1.429 ± 0.394
2.653AlaSer: 2.653 ± 0.656
1.633AlaThr: 1.633 ± 0.708
1.224AlaVal: 1.224 ± 0.683
1.02AlaTrp: 1.02 ± 0.353
2.041AlaTyr: 2.041 ± 0.432
0.0AlaXaa: 0.0 ± 0.0
Cys
0.408CysAla: 0.408 ± 0.272
0.0CysCys: 0.0 ± 0.0
0.408CysAsp: 0.408 ± 0.31
0.816CysGlu: 0.816 ± 0.379
0.408CysPhe: 0.408 ± 0.268
0.408CysGly: 0.408 ± 0.279
0.204CysHis: 0.204 ± 0.243
0.816CysIle: 0.816 ± 0.313
0.408CysLys: 0.408 ± 0.266
0.612CysLeu: 0.612 ± 0.375
0.0CysMet: 0.0 ± 0.0
0.408CysAsn: 0.408 ± 0.278
0.0CysPro: 0.0 ± 0.0
0.408CysGln: 0.408 ± 0.261
0.204CysArg: 0.204 ± 0.198
0.816CysSer: 0.816 ± 0.462
0.0CysThr: 0.0 ± 0.0
0.204CysVal: 0.204 ± 0.186
0.0CysTrp: 0.0 ± 0.0
0.204CysTyr: 0.204 ± 0.243
0.0CysXaa: 0.0 ± 0.0
Asp
0.612AspAla: 0.612 ± 0.325
0.204AspCys: 0.204 ± 0.211
1.429AspAsp: 1.429 ± 0.494
2.653AspGlu: 2.653 ± 0.676
5.51AspPhe: 5.51 ± 1.331
1.02AspGly: 1.02 ± 0.401
0.816AspHis: 0.816 ± 0.344
4.49AspIle: 4.49 ± 0.782
4.49AspLys: 4.49 ± 0.946
6.939AspLeu: 6.939 ± 0.948
1.02AspMet: 1.02 ± 0.58
4.898AspAsn: 4.898 ± 1.104
1.429AspPro: 1.429 ± 0.537
1.429AspGln: 1.429 ± 0.455
1.429AspArg: 1.429 ± 0.469
3.061AspSer: 3.061 ± 1.006
1.224AspThr: 1.224 ± 0.475
0.816AspVal: 0.816 ± 0.355
0.408AspTrp: 0.408 ± 0.273
3.265AspTyr: 3.265 ± 0.916
0.0AspXaa: 0.0 ± 0.0
Glu
3.673GluAla: 3.673 ± 1.108
0.612GluCys: 0.612 ± 0.356
3.061GluAsp: 3.061 ± 0.771
6.939GluGlu: 6.939 ± 1.049
1.837GluPhe: 1.837 ± 0.634
1.837GluGly: 1.837 ± 0.6
0.816GluHis: 0.816 ± 0.457
7.347GluIle: 7.347 ± 1.526
7.755GluLys: 7.755 ± 1.385
8.98GluLeu: 8.98 ± 1.814
1.224GluMet: 1.224 ± 0.465
7.551GluAsn: 7.551 ± 1.551
2.449GluPro: 2.449 ± 0.766
4.286GluGln: 4.286 ± 0.883
2.041GluArg: 2.041 ± 0.61
4.49GluSer: 4.49 ± 1.44
3.265GluThr: 3.265 ± 0.854
3.061GluVal: 3.061 ± 0.565
1.429GluTrp: 1.429 ± 0.619
3.673GluTyr: 3.673 ± 0.893
0.0GluXaa: 0.0 ± 0.0
Phe
3.061PheAla: 3.061 ± 0.705
0.612PheCys: 0.612 ± 0.366
3.061PheAsp: 3.061 ± 0.58
4.898PheGlu: 4.898 ± 0.845
3.265PhePhe: 3.265 ± 0.666
2.449PheGly: 2.449 ± 0.506
0.204PheHis: 0.204 ± 0.198
4.694PheIle: 4.694 ± 0.784
8.163PheLys: 8.163 ± 1.365
6.939PheLeu: 6.939 ± 1.292
1.02PheMet: 1.02 ± 0.425
4.694PheAsn: 4.694 ± 0.98
0.816PhePro: 0.816 ± 0.437
2.653PheGln: 2.653 ± 0.758
0.612PheArg: 0.612 ± 0.332
2.857PheSer: 2.857 ± 0.514
2.653PheThr: 2.653 ± 1.031
4.082PheVal: 4.082 ± 0.868
0.816PheTrp: 0.816 ± 0.41
3.061PheTyr: 3.061 ± 0.796
0.0PheXaa: 0.0 ± 0.0
Gly
1.429GlyAla: 1.429 ± 0.438
0.0GlyCys: 0.0 ± 0.0
1.02GlyAsp: 1.02 ± 0.439
2.245GlyGlu: 2.245 ± 0.691
2.653GlyPhe: 2.653 ± 0.826
1.224GlyGly: 1.224 ± 0.441
0.612GlyHis: 0.612 ± 0.353
4.082GlyIle: 4.082 ± 0.893
1.837GlyLys: 1.837 ± 0.573
4.898GlyLeu: 4.898 ± 0.903
1.02GlyMet: 1.02 ± 0.467
4.082GlyAsn: 4.082 ± 0.706
0.0GlyPro: 0.0 ± 0.0
1.02GlyGln: 1.02 ± 0.571
0.816GlyArg: 0.816 ± 0.482
2.653GlySer: 2.653 ± 0.682
3.061GlyThr: 3.061 ± 0.877
2.449GlyVal: 2.449 ± 0.796
0.408GlyTrp: 0.408 ± 0.273
0.816GlyTyr: 0.816 ± 0.374
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.204HisAsp: 0.204 ± 0.211
0.816HisGlu: 0.816 ± 0.411
0.408HisPhe: 0.408 ± 0.288
0.408HisGly: 0.408 ± 0.235
0.0HisHis: 0.0 ± 0.0
1.02HisIle: 1.02 ± 0.328
0.816HisLys: 0.816 ± 0.52
1.224HisLeu: 1.224 ± 0.428
0.204HisMet: 0.204 ± 0.211
1.224HisAsn: 1.224 ± 0.403
0.408HisPro: 0.408 ± 0.408
0.408HisGln: 0.408 ± 0.296
0.0HisArg: 0.0 ± 0.0
1.02HisSer: 1.02 ± 0.528
0.204HisThr: 0.204 ± 0.211
0.612HisVal: 0.612 ± 0.325
0.204HisTrp: 0.204 ± 0.204
1.224HisTyr: 1.224 ± 0.846
0.0HisXaa: 0.0 ± 0.0
Ile
4.49IleAla: 4.49 ± 1.039
0.816IleCys: 0.816 ± 0.312
7.755IleAsp: 7.755 ± 1.017
7.347IleGlu: 7.347 ± 1.47
4.898IlePhe: 4.898 ± 0.687
3.061IleGly: 3.061 ± 0.813
0.612IleHis: 0.612 ± 0.256
5.918IleIle: 5.918 ± 0.955
11.837IleLys: 11.837 ± 1.424
4.082IleLeu: 4.082 ± 0.988
1.837IleMet: 1.837 ± 0.447
9.592IleAsn: 9.592 ± 1.643
3.061IlePro: 3.061 ± 0.685
3.061IleGln: 3.061 ± 0.814
1.633IleArg: 1.633 ± 0.637
5.306IleSer: 5.306 ± 1.105
2.653IleThr: 2.653 ± 0.898
4.286IleVal: 4.286 ± 0.804
1.02IleTrp: 1.02 ± 0.363
5.102IleTyr: 5.102 ± 1.115
0.0IleXaa: 0.0 ± 0.0
Lys
5.918LysAla: 5.918 ± 1.559
0.612LysCys: 0.612 ± 0.357
5.918LysAsp: 5.918 ± 1.201
11.02LysGlu: 11.02 ± 1.336
6.122LysPhe: 6.122 ± 1.183
3.673LysGly: 3.673 ± 1.412
1.02LysHis: 1.02 ± 0.414
10.408LysIle: 10.408 ± 1.145
12.245LysLys: 12.245 ± 1.614
9.592LysLeu: 9.592 ± 1.582
1.837LysMet: 1.837 ± 0.602
12.041LysAsn: 12.041 ± 0.959
3.061LysPro: 3.061 ± 0.765
4.694LysGln: 4.694 ± 0.881
3.061LysArg: 3.061 ± 0.801
7.347LysSer: 7.347 ± 1.257
6.122LysThr: 6.122 ± 1.108
5.306LysVal: 5.306 ± 0.746
1.02LysTrp: 1.02 ± 0.391
4.898LysTyr: 4.898 ± 0.667
0.0LysXaa: 0.0 ± 0.0
Leu
3.673LeuAla: 3.673 ± 0.96
0.408LeuCys: 0.408 ± 0.279
5.102LeuAsp: 5.102 ± 0.871
8.163LeuGlu: 8.163 ± 1.604
4.286LeuPhe: 4.286 ± 0.76
3.061LeuGly: 3.061 ± 0.949
0.612LeuHis: 0.612 ± 0.313
8.163LeuIle: 8.163 ± 1.278
13.878LeuLys: 13.878 ± 1.877
9.592LeuLeu: 9.592 ± 1.727
1.633LeuMet: 1.633 ± 0.615
10.816LeuAsn: 10.816 ± 1.216
1.837LeuPro: 1.837 ± 1.028
2.653LeuGln: 2.653 ± 0.855
3.469LeuArg: 3.469 ± 0.783
8.367LeuSer: 8.367 ± 1.243
5.51LeuThr: 5.51 ± 0.797
5.51LeuVal: 5.51 ± 1.241
1.224LeuTrp: 1.224 ± 0.418
3.469LeuTyr: 3.469 ± 0.875
0.0LeuXaa: 0.0 ± 0.0
Met
0.816MetAla: 0.816 ± 0.351
0.0MetCys: 0.0 ± 0.0
1.02MetAsp: 1.02 ± 0.403
1.837MetGlu: 1.837 ± 0.739
2.041MetPhe: 2.041 ± 0.629
0.408MetGly: 0.408 ± 0.266
0.408MetHis: 0.408 ± 0.284
1.633MetIle: 1.633 ± 0.621
1.224MetLys: 1.224 ± 0.49
1.633MetLeu: 1.633 ± 0.516
0.0MetMet: 0.0 ± 0.0
1.633MetAsn: 1.633 ± 0.57
0.816MetPro: 0.816 ± 0.451
0.408MetGln: 0.408 ± 0.254
0.204MetArg: 0.204 ± 0.195
1.633MetSer: 1.633 ± 0.483
0.816MetThr: 0.816 ± 0.398
0.204MetVal: 0.204 ± 0.204
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.469AsnAla: 3.469 ± 0.69
0.816AsnCys: 0.816 ± 0.48
3.265AsnAsp: 3.265 ± 0.88
8.776AsnGlu: 8.776 ± 1.794
5.714AsnPhe: 5.714 ± 1.031
4.082AsnGly: 4.082 ± 0.949
1.02AsnHis: 1.02 ± 0.467
8.776AsnIle: 8.776 ± 1.071
11.429AsnLys: 11.429 ± 1.268
9.592AsnLeu: 9.592 ± 1.468
0.816AsnMet: 0.816 ± 0.436
8.776AsnAsn: 8.776 ± 1.522
2.857AsnPro: 2.857 ± 0.877
4.694AsnGln: 4.694 ± 1.076
1.02AsnArg: 1.02 ± 0.605
5.102AsnSer: 5.102 ± 1.185
3.878AsnThr: 3.878 ± 1.013
4.898AsnVal: 4.898 ± 0.76
2.245AsnTrp: 2.245 ± 0.469
4.082AsnTyr: 4.082 ± 0.857
0.0AsnXaa: 0.0 ± 0.0
Pro
1.02ProAla: 1.02 ± 0.437
0.0ProCys: 0.0 ± 0.0
1.02ProAsp: 1.02 ± 0.543
1.429ProGlu: 1.429 ± 0.479
1.429ProPhe: 1.429 ± 0.589
1.633ProGly: 1.633 ± 0.57
0.204ProHis: 0.204 ± 0.198
2.449ProIle: 2.449 ± 0.821
4.694ProLys: 4.694 ± 1.089
1.633ProLeu: 1.633 ± 0.503
0.0ProMet: 0.0 ± 0.0
2.041ProAsn: 2.041 ± 0.694
0.816ProPro: 0.816 ± 0.393
0.408ProGln: 0.408 ± 0.235
1.02ProArg: 1.02 ± 0.537
1.02ProSer: 1.02 ± 0.325
1.224ProThr: 1.224 ± 0.449
1.224ProVal: 1.224 ± 0.32
0.816ProTrp: 0.816 ± 0.382
1.633ProTyr: 1.633 ± 0.426
0.0ProXaa: 0.0 ± 0.0
Gln
2.245GlnAla: 2.245 ± 0.582
0.204GlnCys: 0.204 ± 0.198
1.429GlnAsp: 1.429 ± 0.428
2.653GlnGlu: 2.653 ± 0.842
1.224GlnPhe: 1.224 ± 0.443
1.429GlnGly: 1.429 ± 0.702
0.408GlnHis: 0.408 ± 0.274
3.673GlnIle: 3.673 ± 1.081
5.102GlnLys: 5.102 ± 0.781
5.306GlnLeu: 5.306 ± 0.998
1.429GlnMet: 1.429 ± 0.465
4.082GlnAsn: 4.082 ± 0.939
1.224GlnPro: 1.224 ± 0.548
0.612GlnGln: 0.612 ± 0.287
0.612GlnArg: 0.612 ± 0.358
1.633GlnSer: 1.633 ± 0.503
1.633GlnThr: 1.633 ± 0.395
2.245GlnVal: 2.245 ± 0.611
0.408GlnTrp: 0.408 ± 0.266
0.816GlnTyr: 0.816 ± 0.306
0.0GlnXaa: 0.0 ± 0.0
Arg
1.02ArgAla: 1.02 ± 0.351
0.0ArgCys: 0.0 ± 0.0
0.612ArgAsp: 0.612 ± 0.405
1.429ArgGlu: 1.429 ± 0.615
2.245ArgPhe: 2.245 ± 0.596
0.612ArgGly: 0.612 ± 0.467
0.204ArgHis: 0.204 ± 0.211
2.653ArgIle: 2.653 ± 0.632
2.041ArgLys: 2.041 ± 0.493
2.449ArgLeu: 2.449 ± 0.677
0.408ArgMet: 0.408 ± 0.272
2.245ArgAsn: 2.245 ± 0.783
0.612ArgPro: 0.612 ± 0.292
1.02ArgGln: 1.02 ± 0.467
0.204ArgArg: 0.204 ± 0.198
1.633ArgSer: 1.633 ± 0.492
1.837ArgThr: 1.837 ± 0.483
0.0ArgVal: 0.0 ± 0.0
0.612ArgTrp: 0.612 ± 0.321
1.633ArgTyr: 1.633 ± 0.575
0.0ArgXaa: 0.0 ± 0.0
Ser
2.041SerAla: 2.041 ± 0.578
0.612SerCys: 0.612 ± 0.319
3.265SerAsp: 3.265 ± 1.064
4.49SerGlu: 4.49 ± 1.154
4.694SerPhe: 4.694 ± 1.124
2.245SerGly: 2.245 ± 0.682
0.816SerHis: 0.816 ± 0.424
5.102SerIle: 5.102 ± 0.953
7.755SerLys: 7.755 ± 1.017
6.531SerLeu: 6.531 ± 1.18
1.429SerMet: 1.429 ± 0.565
5.51SerAsn: 5.51 ± 0.691
1.429SerPro: 1.429 ± 0.631
2.857SerGln: 2.857 ± 0.583
1.429SerArg: 1.429 ± 0.41
4.082SerSer: 4.082 ± 1.446
3.061SerThr: 3.061 ± 1.098
2.449SerVal: 2.449 ± 0.728
1.837SerTrp: 1.837 ± 0.75
1.429SerTyr: 1.429 ± 0.413
0.0SerXaa: 0.0 ± 0.0
Thr
1.633ThrAla: 1.633 ± 0.401
0.204ThrCys: 0.204 ± 0.19
1.02ThrAsp: 1.02 ± 0.633
3.265ThrGlu: 3.265 ± 0.839
3.265ThrPhe: 3.265 ± 0.849
1.837ThrGly: 1.837 ± 0.592
0.612ThrHis: 0.612 ± 0.45
5.102ThrIle: 5.102 ± 0.812
5.918ThrLys: 5.918 ± 0.9
4.898ThrLeu: 4.898 ± 1.119
0.612ThrMet: 0.612 ± 0.275
3.265ThrAsn: 3.265 ± 0.785
1.837ThrPro: 1.837 ± 0.608
1.837ThrGln: 1.837 ± 0.616
1.837ThrArg: 1.837 ± 0.646
2.653ThrSer: 2.653 ± 0.805
2.857ThrThr: 2.857 ± 1.023
1.429ThrVal: 1.429 ± 0.748
0.204ThrTrp: 0.204 ± 0.201
2.245ThrTyr: 2.245 ± 0.858
0.0ThrXaa: 0.0 ± 0.0
Val
2.245ValAla: 2.245 ± 0.545
0.612ValCys: 0.612 ± 0.373
2.245ValAsp: 2.245 ± 0.6
3.469ValGlu: 3.469 ± 1.057
3.061ValPhe: 3.061 ± 0.752
1.837ValGly: 1.837 ± 0.789
0.612ValHis: 0.612 ± 0.447
2.653ValIle: 2.653 ± 0.569
5.102ValLys: 5.102 ± 0.992
3.673ValLeu: 3.673 ± 1.017
0.204ValMet: 0.204 ± 0.249
5.102ValAsn: 5.102 ± 0.913
1.633ValPro: 1.633 ± 0.49
2.449ValGln: 2.449 ± 0.638
1.02ValArg: 1.02 ± 0.329
2.449ValSer: 2.449 ± 0.644
1.429ValThr: 1.429 ± 0.564
2.245ValVal: 2.245 ± 0.827
0.816ValTrp: 0.816 ± 0.335
2.653ValTyr: 2.653 ± 0.627
0.0ValXaa: 0.0 ± 0.0
Trp
0.612TrpAla: 0.612 ± 0.456
0.204TrpCys: 0.204 ± 0.198
1.02TrpAsp: 1.02 ± 0.444
0.816TrpGlu: 0.816 ± 0.405
0.204TrpPhe: 0.204 ± 0.211
1.429TrpGly: 1.429 ± 0.484
0.0TrpHis: 0.0 ± 0.0
0.816TrpIle: 0.816 ± 0.519
1.837TrpLys: 1.837 ± 0.822
2.653TrpLeu: 2.653 ± 0.574
0.612TrpMet: 0.612 ± 0.394
1.429TrpAsn: 1.429 ± 0.431
0.0TrpPro: 0.0 ± 0.0
0.408TrpGln: 0.408 ± 0.272
0.204TrpArg: 0.204 ± 0.204
0.408TrpSer: 0.408 ± 0.284
1.02TrpThr: 1.02 ± 0.385
1.224TrpVal: 1.224 ± 0.633
0.0TrpTrp: 0.0 ± 0.0
0.204TrpTyr: 0.204 ± 0.186
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.837TyrAla: 1.837 ± 0.615
0.612TyrCys: 0.612 ± 0.388
3.061TyrAsp: 3.061 ± 1.075
1.837TyrGlu: 1.837 ± 0.739
3.265TyrPhe: 3.265 ± 0.66
2.245TyrGly: 2.245 ± 0.73
0.612TyrHis: 0.612 ± 0.362
3.469TyrIle: 3.469 ± 0.599
4.49TyrLys: 4.49 ± 0.951
5.51TyrLeu: 5.51 ± 0.91
0.408TyrMet: 0.408 ± 0.245
2.653TyrAsn: 2.653 ± 0.849
0.408TyrPro: 0.408 ± 0.247
1.837TyrGln: 1.837 ± 0.713
1.224TyrArg: 1.224 ± 0.598
3.878TyrSer: 3.878 ± 0.623
2.449TyrThr: 2.449 ± 1.134
2.041TyrVal: 2.041 ± 0.727
0.612TyrTrp: 0.612 ± 0.29
1.837TyrTyr: 1.837 ± 0.601
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 17 proteins (4901 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski