Amino acid dipepetide frequency for Vibrio phage K05K4_VK05K4_1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.332AlaAla: 5.332 ± 1.161
0.79AlaCys: 0.79 ± 0.371
2.172AlaAsp: 2.172 ± 0.614
3.357AlaGlu: 3.357 ± 0.577
1.58AlaPhe: 1.58 ± 0.549
3.555AlaGly: 3.555 ± 0.539
1.777AlaHis: 1.777 ± 0.433
4.147AlaIle: 4.147 ± 0.801
7.306AlaLys: 7.306 ± 1.215
8.491AlaLeu: 8.491 ± 1.079
2.37AlaMet: 2.37 ± 0.642
2.962AlaAsn: 2.962 ± 0.765
2.567AlaPro: 2.567 ± 0.964
2.37AlaGln: 2.37 ± 0.811
4.147AlaArg: 4.147 ± 0.734
4.344AlaSer: 4.344 ± 0.97
1.975AlaThr: 1.975 ± 0.72
4.937AlaVal: 4.937 ± 0.967
1.58AlaTrp: 1.58 ± 0.466
4.542AlaTyr: 4.542 ± 1.1
0.0AlaXaa: 0.0 ± 0.0
Cys
1.58CysAla: 1.58 ± 0.482
0.0CysCys: 0.0 ± 0.0
1.185CysAsp: 1.185 ± 0.417
2.37CysGlu: 2.37 ± 0.862
1.58CysPhe: 1.58 ± 0.448
1.975CysGly: 1.975 ± 0.469
1.777CysHis: 1.777 ± 0.523
0.987CysIle: 0.987 ± 0.391
1.58CysLys: 1.58 ± 0.377
1.382CysLeu: 1.382 ± 0.454
0.592CysMet: 0.592 ± 0.323
0.395CysAsn: 0.395 ± 0.271
0.592CysPro: 0.592 ± 0.323
0.592CysGln: 0.592 ± 0.323
0.592CysArg: 0.592 ± 0.31
0.592CysSer: 0.592 ± 0.374
1.975CysThr: 1.975 ± 0.728
0.197CysVal: 0.197 ± 0.188
0.0CysTrp: 0.0 ± 0.0
1.185CysTyr: 1.185 ± 0.322
0.0CysXaa: 0.0 ± 0.0
Asp
1.975AspAla: 1.975 ± 0.487
1.975AspCys: 1.975 ± 0.923
6.714AspAsp: 6.714 ± 1.162
2.765AspGlu: 2.765 ± 0.58
2.37AspPhe: 2.37 ± 0.588
8.096AspGly: 8.096 ± 1.155
0.79AspHis: 0.79 ± 0.395
5.332AspIle: 5.332 ± 0.939
1.185AspLys: 1.185 ± 0.593
6.319AspLeu: 6.319 ± 0.861
2.172AspMet: 2.172 ± 0.431
1.975AspAsn: 1.975 ± 0.764
4.147AspPro: 4.147 ± 1.372
0.395AspGln: 0.395 ± 0.319
0.79AspArg: 0.79 ± 0.371
3.357AspSer: 3.357 ± 0.903
7.109AspThr: 7.109 ± 1.686
4.937AspVal: 4.937 ± 1.154
1.777AspTrp: 1.777 ± 0.454
2.765AspTyr: 2.765 ± 0.567
0.0AspXaa: 0.0 ± 0.0
Glu
4.739GluAla: 4.739 ± 0.981
1.185GluCys: 1.185 ± 0.68
3.949GluAsp: 3.949 ± 0.854
2.567GluGlu: 2.567 ± 0.555
1.975GluPhe: 1.975 ± 0.495
1.58GluGly: 1.58 ± 0.477
0.0GluHis: 0.0 ± 0.0
4.344GluIle: 4.344 ± 0.916
2.962GluLys: 2.962 ± 0.754
6.122GluLeu: 6.122 ± 1.059
0.79GluMet: 0.79 ± 0.451
3.752GluAsn: 3.752 ± 1.154
1.382GluPro: 1.382 ± 0.579
5.529GluGln: 5.529 ± 1.043
1.975GluArg: 1.975 ± 0.394
3.949GluSer: 3.949 ± 0.698
1.382GluThr: 1.382 ± 0.569
2.37GluVal: 2.37 ± 0.58
1.185GluTrp: 1.185 ± 0.508
2.567GluTyr: 2.567 ± 0.5
0.0GluXaa: 0.0 ± 0.0
Phe
3.949PheAla: 3.949 ± 0.995
0.592PheCys: 0.592 ± 0.31
3.949PheAsp: 3.949 ± 0.672
3.357PheGlu: 3.357 ± 0.733
1.58PhePhe: 1.58 ± 0.536
3.555PheGly: 3.555 ± 1.063
1.185PheHis: 1.185 ± 0.41
1.382PheIle: 1.382 ± 0.483
2.172PheLys: 2.172 ± 0.744
1.58PheLeu: 1.58 ± 0.48
0.197PheMet: 0.197 ± 0.187
2.962PheAsn: 2.962 ± 0.61
1.58PhePro: 1.58 ± 0.596
1.58PheGln: 1.58 ± 0.582
2.172PheArg: 2.172 ± 0.739
2.37PheSer: 2.37 ± 0.619
2.962PheThr: 2.962 ± 0.675
3.16PheVal: 3.16 ± 0.589
1.382PheTrp: 1.382 ± 0.378
2.567PheTyr: 2.567 ± 0.748
0.0PheXaa: 0.0 ± 0.0
Gly
3.949GlyAla: 3.949 ± 0.831
2.172GlyCys: 2.172 ± 0.55
4.937GlyAsp: 4.937 ± 1.165
2.765GlyGlu: 2.765 ± 0.541
3.555GlyPhe: 3.555 ± 0.833
3.949GlyGly: 3.949 ± 0.778
0.197GlyHis: 0.197 ± 0.188
5.332GlyIle: 5.332 ± 0.951
5.134GlyLys: 5.134 ± 0.893
7.109GlyLeu: 7.109 ± 0.934
2.37GlyMet: 2.37 ± 0.733
1.382GlyAsn: 1.382 ± 0.511
1.777GlyPro: 1.777 ± 0.385
2.765GlyGln: 2.765 ± 0.633
2.962GlyArg: 2.962 ± 0.856
4.542GlySer: 4.542 ± 0.747
4.739GlyThr: 4.739 ± 0.788
5.529GlyVal: 5.529 ± 1.033
0.0GlyTrp: 0.0 ± 0.0
2.37GlyTyr: 2.37 ± 0.812
0.0GlyXaa: 0.0 ± 0.0
His
1.382HisAla: 1.382 ± 0.502
0.79HisCys: 0.79 ± 0.426
1.777HisAsp: 1.777 ± 0.455
1.777HisGlu: 1.777 ± 0.645
1.382HisPhe: 1.382 ± 0.458
0.395HisGly: 0.395 ± 0.266
0.79HisHis: 0.79 ± 0.381
2.172HisIle: 2.172 ± 0.746
0.592HisLys: 0.592 ± 0.352
1.185HisLeu: 1.185 ± 0.481
0.592HisMet: 0.592 ± 0.285
0.592HisAsn: 0.592 ± 0.279
0.395HisPro: 0.395 ± 0.284
0.592HisGln: 0.592 ± 0.366
0.79HisArg: 0.79 ± 0.487
0.592HisSer: 0.592 ± 0.322
0.197HisThr: 0.197 ± 0.188
0.592HisVal: 0.592 ± 0.31
0.592HisTrp: 0.592 ± 0.31
1.382HisTyr: 1.382 ± 0.43
0.0HisXaa: 0.0 ± 0.0
Ile
6.517IleAla: 6.517 ± 1.433
0.987IleCys: 0.987 ± 0.329
4.147IleAsp: 4.147 ± 0.674
3.949IleGlu: 3.949 ± 0.629
1.777IlePhe: 1.777 ± 0.595
3.555IleGly: 3.555 ± 0.879
1.382IleHis: 1.382 ± 0.428
2.765IleIle: 2.765 ± 0.693
4.344IleLys: 4.344 ± 0.929
4.147IleLeu: 4.147 ± 0.77
0.987IleMet: 0.987 ± 0.357
3.16IleAsn: 3.16 ± 0.678
3.357IlePro: 3.357 ± 0.742
2.172IleGln: 2.172 ± 0.716
2.567IleArg: 2.567 ± 0.627
4.344IleSer: 4.344 ± 0.918
3.752IleThr: 3.752 ± 0.786
1.382IleVal: 1.382 ± 0.522
0.197IleTrp: 0.197 ± 0.188
3.357IleTyr: 3.357 ± 0.868
0.0IleXaa: 0.0 ± 0.0
Lys
6.122LysAla: 6.122 ± 0.787
0.592LysCys: 0.592 ± 0.376
4.147LysAsp: 4.147 ± 0.659
0.79LysGlu: 0.79 ± 0.38
2.962LysPhe: 2.962 ± 0.934
2.765LysGly: 2.765 ± 0.902
2.765LysHis: 2.765 ± 0.474
2.765LysIle: 2.765 ± 0.609
6.714LysLys: 6.714 ± 1.193
4.344LysLeu: 4.344 ± 1.185
1.777LysMet: 1.777 ± 0.822
4.147LysAsn: 4.147 ± 0.663
2.37LysPro: 2.37 ± 0.698
2.567LysGln: 2.567 ± 0.548
4.344LysArg: 4.344 ± 0.782
6.319LysSer: 6.319 ± 0.892
1.777LysThr: 1.777 ± 0.429
2.765LysVal: 2.765 ± 1.111
1.185LysTrp: 1.185 ± 0.426
0.79LysTyr: 0.79 ± 0.383
0.0LysXaa: 0.0 ± 0.0
Leu
6.319LeuAla: 6.319 ± 1.147
1.777LeuCys: 1.777 ± 0.548
4.147LeuAsp: 4.147 ± 0.85
6.517LeuGlu: 6.517 ± 0.986
2.567LeuPhe: 2.567 ± 0.838
7.306LeuGly: 7.306 ± 0.946
1.777LeuHis: 1.777 ± 0.737
3.949LeuIle: 3.949 ± 0.725
5.332LeuLys: 5.332 ± 0.627
7.899LeuLeu: 7.899 ± 1.792
2.962LeuMet: 2.962 ± 0.872
6.714LeuAsn: 6.714 ± 0.902
3.357LeuPro: 3.357 ± 0.606
2.765LeuGln: 2.765 ± 0.596
3.752LeuArg: 3.752 ± 0.699
6.714LeuSer: 6.714 ± 1.22
5.727LeuThr: 5.727 ± 1.32
5.727LeuVal: 5.727 ± 1.111
0.79LeuTrp: 0.79 ± 0.411
0.79LeuTyr: 0.79 ± 0.353
0.0LeuXaa: 0.0 ± 0.0
Met
2.172MetAla: 2.172 ± 0.964
0.0MetCys: 0.0 ± 0.0
1.975MetAsp: 1.975 ± 0.614
0.0MetGlu: 0.0 ± 0.0
0.79MetPhe: 0.79 ± 0.419
0.395MetGly: 0.395 ± 0.243
0.395MetHis: 0.395 ± 0.3
1.185MetIle: 1.185 ± 0.517
1.975MetLys: 1.975 ± 0.77
2.172MetLeu: 2.172 ± 0.867
0.197MetMet: 0.197 ± 0.226
2.765MetAsn: 2.765 ± 1.098
0.987MetPro: 0.987 ± 0.464
0.197MetGln: 0.197 ± 0.188
1.975MetArg: 1.975 ± 0.559
1.58MetSer: 1.58 ± 0.468
2.37MetThr: 2.37 ± 0.644
2.765MetVal: 2.765 ± 0.623
0.592MetTrp: 0.592 ± 0.365
1.185MetTyr: 1.185 ± 0.498
0.0MetXaa: 0.0 ± 0.0
Asn
3.949AsnAla: 3.949 ± 1.256
0.0AsnCys: 0.0 ± 0.0
2.765AsnAsp: 2.765 ± 0.582
2.567AsnGlu: 2.567 ± 0.696
1.58AsnPhe: 1.58 ± 0.603
2.765AsnGly: 2.765 ± 0.453
0.197AsnHis: 0.197 ± 0.188
3.16AsnIle: 3.16 ± 0.844
4.937AsnLys: 4.937 ± 0.89
0.79AsnLeu: 0.79 ± 0.418
0.79AsnMet: 0.79 ± 0.376
0.987AsnAsn: 0.987 ± 0.393
4.344AsnPro: 4.344 ± 0.756
4.542AsnGln: 4.542 ± 1.334
0.197AsnArg: 0.197 ± 0.206
3.752AsnSer: 3.752 ± 0.879
6.912AsnThr: 6.912 ± 1.632
2.962AsnVal: 2.962 ± 0.859
0.0AsnTrp: 0.0 ± 0.0
1.58AsnTyr: 1.58 ± 0.521
0.0AsnXaa: 0.0 ± 0.0
Pro
1.777ProAla: 1.777 ± 0.648
0.0ProCys: 0.0 ± 0.0
7.306ProAsp: 7.306 ± 1.93
2.172ProGlu: 2.172 ± 0.551
3.555ProPhe: 3.555 ± 0.761
0.395ProGly: 0.395 ± 0.279
0.592ProHis: 0.592 ± 0.365
2.962ProIle: 2.962 ± 0.52
0.592ProLys: 0.592 ± 0.31
2.765ProLeu: 2.765 ± 1.064
0.395ProMet: 0.395 ± 0.257
0.197ProAsn: 0.197 ± 0.204
2.37ProPro: 2.37 ± 0.473
1.382ProGln: 1.382 ± 0.528
1.777ProArg: 1.777 ± 0.59
3.555ProSer: 3.555 ± 1.131
4.937ProThr: 4.937 ± 1.029
4.739ProVal: 4.739 ± 1.01
0.0ProTrp: 0.0 ± 0.0
1.382ProTyr: 1.382 ± 0.764
0.0ProXaa: 0.0 ± 0.0
Gln
3.555GlnAla: 3.555 ± 1.074
1.58GlnCys: 1.58 ± 0.509
0.987GlnAsp: 0.987 ± 0.429
1.777GlnGlu: 1.777 ± 0.52
2.765GlnPhe: 2.765 ± 0.523
1.58GlnGly: 1.58 ± 0.452
0.592GlnHis: 0.592 ± 0.372
4.542GlnIle: 4.542 ± 0.666
1.777GlnLys: 1.777 ± 0.402
4.147GlnLeu: 4.147 ± 0.951
0.197GlnMet: 0.197 ± 0.188
1.58GlnAsn: 1.58 ± 0.522
0.987GlnPro: 0.987 ± 0.476
2.765GlnGln: 2.765 ± 0.882
1.58GlnArg: 1.58 ± 0.457
3.949GlnSer: 3.949 ± 0.953
1.382GlnThr: 1.382 ± 0.414
1.58GlnVal: 1.58 ± 0.317
1.58GlnTrp: 1.58 ± 0.683
2.765GlnTyr: 2.765 ± 0.805
0.0GlnXaa: 0.0 ± 0.0
Arg
3.16ArgAla: 3.16 ± 0.748
2.765ArgCys: 2.765 ± 1.052
1.777ArgAsp: 1.777 ± 0.705
2.765ArgGlu: 2.765 ± 0.978
1.382ArgPhe: 1.382 ± 0.424
3.16ArgGly: 3.16 ± 0.888
1.185ArgHis: 1.185 ± 0.459
3.555ArgIle: 3.555 ± 0.894
2.172ArgLys: 2.172 ± 0.66
4.937ArgLeu: 4.937 ± 1.302
1.185ArgMet: 1.185 ± 0.723
1.58ArgAsn: 1.58 ± 0.481
2.765ArgPro: 2.765 ± 0.635
0.79ArgGln: 0.79 ± 0.352
5.529ArgArg: 5.529 ± 1.1
2.37ArgSer: 2.37 ± 0.956
2.962ArgThr: 2.962 ± 0.758
1.975ArgVal: 1.975 ± 0.736
0.395ArgTrp: 0.395 ± 0.243
1.777ArgTyr: 1.777 ± 0.518
0.0ArgXaa: 0.0 ± 0.0
Ser
3.949SerAla: 3.949 ± 0.862
1.382SerCys: 1.382 ± 0.545
3.16SerAsp: 3.16 ± 0.681
5.529SerGlu: 5.529 ± 1.024
3.16SerPhe: 3.16 ± 1.086
6.319SerGly: 6.319 ± 0.976
0.79SerHis: 0.79 ± 0.322
5.134SerIle: 5.134 ± 1.032
0.987SerLys: 0.987 ± 0.441
6.912SerLeu: 6.912 ± 1.463
4.344SerMet: 4.344 ± 0.864
1.975SerAsn: 1.975 ± 0.655
1.777SerPro: 1.777 ± 0.54
2.765SerGln: 2.765 ± 0.574
4.344SerArg: 4.344 ± 0.904
2.172SerSer: 2.172 ± 0.514
4.542SerThr: 4.542 ± 1.032
5.332SerVal: 5.332 ± 1.374
0.197SerTrp: 0.197 ± 0.195
1.382SerTyr: 1.382 ± 0.48
0.0SerXaa: 0.0 ± 0.0
Thr
3.555ThrAla: 3.555 ± 0.678
2.567ThrCys: 2.567 ± 0.972
2.962ThrAsp: 2.962 ± 1.056
3.16ThrGlu: 3.16 ± 0.89
1.58ThrPhe: 1.58 ± 0.38
8.294ThrGly: 8.294 ± 1.636
0.79ThrHis: 0.79 ± 0.306
2.37ThrIle: 2.37 ± 0.445
5.924ThrLys: 5.924 ± 1.238
5.727ThrLeu: 5.727 ± 1.186
0.592ThrMet: 0.592 ± 0.306
3.949ThrAsn: 3.949 ± 0.717
3.752ThrPro: 3.752 ± 1.195
2.172ThrGln: 2.172 ± 0.73
1.58ThrArg: 1.58 ± 0.432
4.937ThrSer: 4.937 ± 0.913
1.58ThrThr: 1.58 ± 0.443
4.739ThrVal: 4.739 ± 0.913
1.777ThrTrp: 1.777 ± 0.504
1.382ThrTyr: 1.382 ± 0.452
0.0ThrXaa: 0.0 ± 0.0
Val
2.172ValAla: 2.172 ± 0.835
1.777ValCys: 1.777 ± 0.688
4.542ValAsp: 4.542 ± 0.837
2.765ValGlu: 2.765 ± 0.616
5.924ValPhe: 5.924 ± 0.919
3.555ValGly: 3.555 ± 0.571
1.382ValHis: 1.382 ± 0.533
1.185ValIle: 1.185 ± 0.492
2.962ValLys: 2.962 ± 0.891
6.912ValLeu: 6.912 ± 1.023
1.975ValMet: 1.975 ± 0.681
5.332ValAsn: 5.332 ± 1.051
2.567ValPro: 2.567 ± 0.564
2.765ValGln: 2.765 ± 0.603
3.949ValArg: 3.949 ± 0.875
3.16ValSer: 3.16 ± 0.698
4.344ValThr: 4.344 ± 0.558
2.37ValVal: 2.37 ± 0.991
0.592ValTrp: 0.592 ± 0.31
0.987ValTyr: 0.987 ± 0.444
0.0ValXaa: 0.0 ± 0.0
Trp
0.79TrpAla: 0.79 ± 0.335
0.0TrpCys: 0.0 ± 0.0
0.987TrpAsp: 0.987 ± 0.501
1.185TrpGlu: 1.185 ± 0.474
0.79TrpPhe: 0.79 ± 0.358
0.592TrpGly: 0.592 ± 0.323
0.0TrpHis: 0.0 ± 0.0
0.592TrpIle: 0.592 ± 0.359
0.395TrpLys: 0.395 ± 0.243
1.777TrpLeu: 1.777 ± 0.416
0.592TrpMet: 0.592 ± 0.31
1.185TrpAsn: 1.185 ± 0.439
0.592TrpPro: 0.592 ± 0.386
0.0TrpGln: 0.0 ± 0.0
1.58TrpArg: 1.58 ± 0.632
0.79TrpSer: 0.79 ± 0.38
0.592TrpThr: 0.592 ± 0.323
0.987TrpVal: 0.987 ± 0.444
0.197TrpTrp: 0.197 ± 0.188
0.987TrpTyr: 0.987 ± 0.365
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.357TyrAla: 3.357 ± 0.886
0.197TyrCys: 0.197 ± 0.195
3.357TyrAsp: 3.357 ± 0.795
2.172TyrGlu: 2.172 ± 0.795
1.382TyrPhe: 1.382 ± 0.46
3.752TyrGly: 3.752 ± 1.061
0.197TyrHis: 0.197 ± 0.188
0.987TyrIle: 0.987 ± 0.463
2.962TyrLys: 2.962 ± 0.746
2.172TyrLeu: 2.172 ± 0.516
0.395TyrMet: 0.395 ± 0.279
1.185TyrAsn: 1.185 ± 0.621
1.185TyrPro: 1.185 ± 0.543
2.962TyrGln: 2.962 ± 0.935
1.777TyrArg: 1.777 ± 0.675
2.765TyrSer: 2.765 ± 0.563
2.172TyrThr: 2.172 ± 0.747
2.172TyrVal: 2.172 ± 0.512
0.592TyrTrp: 0.592 ± 0.31
0.987TyrTyr: 0.987 ± 0.482
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 28 proteins (5065 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski