Amino acid dipepetide frequency for Vibrio phage K05K4_VK05K4_2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.516AlaAla: 5.516 ± 1.471
0.919AlaCys: 0.919 ± 0.437
2.452AlaAsp: 2.452 ± 0.711
3.371AlaGlu: 3.371 ± 0.865
1.226AlaPhe: 1.226 ± 0.629
3.065AlaGly: 3.065 ± 0.682
1.532AlaHis: 1.532 ± 0.557
4.291AlaIle: 4.291 ± 0.905
6.742AlaLys: 6.742 ± 1.355
8.275AlaLeu: 8.275 ± 1.372
2.758AlaMet: 2.758 ± 0.885
2.758AlaAsn: 2.758 ± 0.954
2.145AlaPro: 2.145 ± 0.822
2.758AlaGln: 2.758 ± 1.009
3.065AlaArg: 3.065 ± 0.791
3.984AlaSer: 3.984 ± 0.917
2.452AlaThr: 2.452 ± 0.935
6.129AlaVal: 6.129 ± 1.29
1.532AlaTrp: 1.532 ± 0.555
3.984AlaTyr: 3.984 ± 1.088
0.0AlaXaa: 0.0 ± 0.0
Cys
2.145CysAla: 2.145 ± 0.891
0.0CysCys: 0.0 ± 0.0
1.226CysAsp: 1.226 ± 0.559
2.145CysGlu: 2.145 ± 0.966
1.226CysPhe: 1.226 ± 0.576
2.145CysGly: 2.145 ± 0.66
2.145CysHis: 2.145 ± 0.555
0.919CysIle: 0.919 ± 0.363
1.226CysLys: 1.226 ± 0.414
1.839CysLeu: 1.839 ± 0.715
0.613CysMet: 0.613 ± 0.332
0.613CysAsn: 0.613 ± 0.508
0.613CysPro: 0.613 ± 0.332
0.613CysGln: 0.613 ± 0.332
0.306CysArg: 0.306 ± 0.238
0.306CysSer: 0.306 ± 0.312
2.145CysThr: 2.145 ± 0.795
0.306CysVal: 0.306 ± 0.286
0.0CysTrp: 0.0 ± 0.0
0.919CysTyr: 0.919 ± 0.358
0.0CysXaa: 0.0 ± 0.0
Asp
2.452AspAla: 2.452 ± 0.623
1.226AspCys: 1.226 ± 0.706
5.21AspAsp: 5.21 ± 1.363
3.371AspGlu: 3.371 ± 0.75
1.839AspPhe: 1.839 ± 0.631
7.662AspGly: 7.662 ± 1.448
0.613AspHis: 0.613 ± 0.384
4.903AspIle: 4.903 ± 1.128
1.532AspLys: 1.532 ± 0.783
6.129AspLeu: 6.129 ± 1.426
2.452AspMet: 2.452 ± 0.522
2.145AspAsn: 2.145 ± 0.835
3.678AspPro: 3.678 ± 1.257
0.306AspGln: 0.306 ± 0.296
0.919AspArg: 0.919 ± 0.492
3.065AspSer: 3.065 ± 0.961
7.049AspThr: 7.049 ± 1.739
4.903AspVal: 4.903 ± 1.466
1.839AspTrp: 1.839 ± 0.643
3.065AspTyr: 3.065 ± 0.862
0.0AspXaa: 0.0 ± 0.0
Glu
4.903GluAla: 4.903 ± 1.261
1.226GluCys: 1.226 ± 0.807
3.678GluAsp: 3.678 ± 0.926
2.145GluGlu: 2.145 ± 0.574
1.839GluPhe: 1.839 ± 0.572
1.839GluGly: 1.839 ± 0.796
0.306GluHis: 0.306 ± 0.315
3.371GluIle: 3.371 ± 0.948
3.371GluLys: 3.371 ± 0.988
5.516GluLeu: 5.516 ± 1.233
0.919GluMet: 0.919 ± 0.537
3.984GluAsn: 3.984 ± 1.195
1.839GluPro: 1.839 ± 0.677
5.21GluGln: 5.21 ± 1.702
1.839GluArg: 1.839 ± 0.554
3.984GluSer: 3.984 ± 1.044
1.226GluThr: 1.226 ± 0.656
2.758GluVal: 2.758 ± 0.756
1.839GluTrp: 1.839 ± 0.8
2.145GluTyr: 2.145 ± 0.574
0.0GluXaa: 0.0 ± 0.0
Phe
3.371PheAla: 3.371 ± 1.064
0.306PheCys: 0.306 ± 0.238
3.678PheAsp: 3.678 ± 0.89
3.065PheGlu: 3.065 ± 0.751
2.145PhePhe: 2.145 ± 0.654
3.371PheGly: 3.371 ± 1.201
1.226PheHis: 1.226 ± 0.54
0.919PheIle: 0.919 ± 0.415
1.839PheLys: 1.839 ± 0.783
2.758PheLeu: 2.758 ± 0.841
0.613PheMet: 0.613 ± 0.401
2.758PheAsn: 2.758 ± 0.568
1.226PhePro: 1.226 ± 0.548
1.226PheGln: 1.226 ± 0.509
2.145PheArg: 2.145 ± 1.0
2.758PheSer: 2.758 ± 0.859
3.678PheThr: 3.678 ± 0.753
2.758PheVal: 2.758 ± 0.637
0.919PheTrp: 0.919 ± 0.422
2.452PheTyr: 2.452 ± 1.079
0.0PheXaa: 0.0 ± 0.0
Gly
3.371GlyAla: 3.371 ± 0.986
2.145GlyCys: 2.145 ± 0.679
3.984GlyAsp: 3.984 ± 1.249
2.758GlyGlu: 2.758 ± 0.708
3.371GlyPhe: 3.371 ± 1.034
3.678GlyGly: 3.678 ± 1.02
0.306GlyHis: 0.306 ± 0.294
4.597GlyIle: 4.597 ± 1.015
4.291GlyLys: 4.291 ± 1.023
7.355GlyLeu: 7.355 ± 1.332
1.839GlyMet: 1.839 ± 0.704
1.532GlyAsn: 1.532 ± 0.648
1.532GlyPro: 1.532 ± 0.547
3.371GlyGln: 3.371 ± 0.671
3.371GlyArg: 3.371 ± 1.044
4.597GlySer: 4.597 ± 0.921
4.291GlyThr: 4.291 ± 0.795
7.049GlyVal: 7.049 ± 1.612
0.0GlyTrp: 0.0 ± 0.0
3.065GlyTyr: 3.065 ± 0.971
0.0GlyXaa: 0.0 ± 0.0
His
1.226HisAla: 1.226 ± 0.582
0.613HisCys: 0.613 ± 0.476
1.532HisAsp: 1.532 ± 0.627
1.532HisGlu: 1.532 ± 0.783
0.919HisPhe: 0.919 ± 0.456
0.613HisGly: 0.613 ± 0.332
0.919HisHis: 0.919 ± 0.545
2.452HisIle: 2.452 ± 0.894
0.919HisLys: 0.919 ± 0.516
1.226HisLeu: 1.226 ± 0.638
0.919HisMet: 0.919 ± 0.424
0.613HisAsn: 0.613 ± 0.32
0.306HisPro: 0.306 ± 0.264
0.306HisGln: 0.306 ± 0.296
1.532HisArg: 1.532 ± 0.72
1.226HisSer: 1.226 ± 0.46
0.306HisThr: 0.306 ± 0.294
0.306HisVal: 0.306 ± 0.238
0.306HisTrp: 0.306 ± 0.238
1.839HisTyr: 1.839 ± 0.654
0.0HisXaa: 0.0 ± 0.0
Ile
6.436IleAla: 6.436 ± 1.606
0.919IleCys: 0.919 ± 0.358
4.291IleAsp: 4.291 ± 0.853
4.291IleGlu: 4.291 ± 0.931
0.919IlePhe: 0.919 ± 0.497
2.758IleGly: 2.758 ± 0.836
1.532IleHis: 1.532 ± 0.467
3.371IleIle: 3.371 ± 0.93
5.21IleLys: 5.21 ± 1.319
5.516IleLeu: 5.516 ± 0.974
0.919IleMet: 0.919 ± 0.412
2.452IleAsn: 2.452 ± 0.82
3.371IlePro: 3.371 ± 0.996
2.145IleGln: 2.145 ± 0.725
2.758IleArg: 2.758 ± 0.609
4.597IleSer: 4.597 ± 1.274
3.984IleThr: 3.984 ± 1.202
1.532IleVal: 1.532 ± 0.561
0.306IleTrp: 0.306 ± 0.286
3.065IleTyr: 3.065 ± 0.819
0.0IleXaa: 0.0 ± 0.0
Lys
5.516LysAla: 5.516 ± 0.843
0.613LysCys: 0.613 ± 0.4
3.984LysAsp: 3.984 ± 0.843
0.919LysGlu: 0.919 ± 0.445
2.452LysPhe: 2.452 ± 0.951
3.065LysGly: 3.065 ± 0.923
2.452LysHis: 2.452 ± 0.608
3.371LysIle: 3.371 ± 0.881
6.436LysLys: 6.436 ± 1.337
4.291LysLeu: 4.291 ± 1.12
2.145LysMet: 2.145 ± 1.087
3.371LysAsn: 3.371 ± 0.745
1.532LysPro: 1.532 ± 0.677
2.758LysGln: 2.758 ± 0.752
3.984LysArg: 3.984 ± 1.17
6.742LysSer: 6.742 ± 1.063
1.839LysThr: 1.839 ± 0.577
2.758LysVal: 2.758 ± 1.155
0.919LysTrp: 0.919 ± 0.45
0.919LysTyr: 0.919 ± 0.494
0.0LysXaa: 0.0 ± 0.0
Leu
6.129LeuAla: 6.129 ± 1.183
2.145LeuCys: 2.145 ± 0.693
3.678LeuAsp: 3.678 ± 1.014
6.436LeuGlu: 6.436 ± 0.987
3.371LeuPhe: 3.371 ± 0.991
7.355LeuGly: 7.355 ± 1.236
1.226LeuHis: 1.226 ± 0.767
5.21LeuIle: 5.21 ± 0.992
5.823LeuLys: 5.823 ± 0.878
8.275LeuLeu: 8.275 ± 1.806
3.065LeuMet: 3.065 ± 1.251
6.129LeuAsn: 6.129 ± 1.212
3.984LeuPro: 3.984 ± 0.946
3.065LeuGln: 3.065 ± 0.724
3.065LeuArg: 3.065 ± 0.947
7.662LeuSer: 7.662 ± 1.51
7.049LeuThr: 7.049 ± 2.228
7.049LeuVal: 7.049 ± 1.39
0.613LeuTrp: 0.613 ± 0.417
1.532LeuTyr: 1.532 ± 0.473
0.0LeuXaa: 0.0 ± 0.0
Met
3.065MetAla: 3.065 ± 1.238
0.0MetCys: 0.0 ± 0.0
2.145MetAsp: 2.145 ± 0.819
0.0MetGlu: 0.0 ± 0.0
1.226MetPhe: 1.226 ± 0.477
0.306MetGly: 0.306 ± 0.286
0.613MetHis: 0.613 ± 0.472
1.226MetIle: 1.226 ± 0.582
1.532MetLys: 1.532 ± 0.831
3.065MetLeu: 3.065 ± 1.017
0.306MetMet: 0.306 ± 0.304
2.758MetAsn: 2.758 ± 1.211
1.226MetPro: 1.226 ± 0.528
0.306MetGln: 0.306 ± 0.294
1.532MetArg: 1.532 ± 0.647
2.145MetSer: 2.145 ± 0.85
2.758MetThr: 2.758 ± 0.893
3.065MetVal: 3.065 ± 0.756
0.613MetTrp: 0.613 ± 0.417
1.226MetTyr: 1.226 ± 0.612
0.0MetXaa: 0.0 ± 0.0
Asn
4.291AsnAla: 4.291 ± 1.386
0.0AsnCys: 0.0 ± 0.0
2.452AsnAsp: 2.452 ± 0.771
3.065AsnGlu: 3.065 ± 1.118
1.839AsnPhe: 1.839 ± 0.682
3.065AsnGly: 3.065 ± 0.654
0.306AsnHis: 0.306 ± 0.294
3.678AsnIle: 3.678 ± 1.011
4.903AsnLys: 4.903 ± 1.327
1.226AsnLeu: 1.226 ± 0.729
2.145AsnMet: 2.145 ± 0.759
0.613AsnAsn: 0.613 ± 0.36
3.065AsnPro: 3.065 ± 0.82
4.597AsnGln: 4.597 ± 1.769
0.306AsnArg: 0.306 ± 0.319
2.452AsnSer: 2.452 ± 0.874
5.823AsnThr: 5.823 ± 1.772
1.839AsnVal: 1.839 ± 0.737
0.0AsnTrp: 0.0 ± 0.0
1.226AsnTyr: 1.226 ± 0.554
0.0AsnXaa: 0.0 ± 0.0
Pro
1.226ProAla: 1.226 ± 0.656
0.0ProCys: 0.0 ± 0.0
6.436ProAsp: 6.436 ± 2.055
2.452ProGlu: 2.452 ± 0.749
3.984ProPhe: 3.984 ± 0.78
0.613ProGly: 0.613 ± 0.429
0.613ProHis: 0.613 ± 0.417
3.065ProIle: 3.065 ± 0.567
0.306ProLys: 0.306 ± 0.238
3.065ProLeu: 3.065 ± 1.032
0.306ProMet: 0.306 ± 0.315
0.306ProAsn: 0.306 ± 0.312
2.452ProPro: 2.452 ± 0.612
1.532ProGln: 1.532 ± 0.631
1.226ProArg: 1.226 ± 0.569
3.371ProSer: 3.371 ± 0.982
4.291ProThr: 4.291 ± 1.293
3.678ProVal: 3.678 ± 1.174
0.0ProTrp: 0.0 ± 0.0
1.532ProTyr: 1.532 ± 0.886
0.0ProXaa: 0.0 ± 0.0
Gln
4.291GlnAla: 4.291 ± 1.051
1.839GlnCys: 1.839 ± 0.6
0.919GlnAsp: 0.919 ± 0.514
1.532GlnGlu: 1.532 ± 0.711
3.371GlnPhe: 3.371 ± 0.652
1.532GlnGly: 1.532 ± 0.555
0.613GlnHis: 0.613 ± 0.489
4.597GlnIle: 4.597 ± 0.853
1.532GlnLys: 1.532 ± 0.515
4.291GlnLeu: 4.291 ± 0.942
0.306GlnMet: 0.306 ± 0.294
2.145GlnAsn: 2.145 ± 0.626
1.226GlnPro: 1.226 ± 0.635
3.065GlnGln: 3.065 ± 0.987
1.532GlnArg: 1.532 ± 0.584
3.984GlnSer: 3.984 ± 1.349
0.919GlnThr: 0.919 ± 0.358
1.226GlnVal: 1.226 ± 0.355
1.839GlnTrp: 1.839 ± 0.89
2.452GlnTyr: 2.452 ± 0.825
0.0GlnXaa: 0.0 ± 0.0
Arg
2.452ArgAla: 2.452 ± 1.005
2.758ArgCys: 2.758 ± 0.948
1.839ArgAsp: 1.839 ± 0.843
2.452ArgGlu: 2.452 ± 0.986
0.919ArgPhe: 0.919 ± 0.442
2.452ArgGly: 2.452 ± 0.709
1.226ArgHis: 1.226 ± 0.544
2.758ArgIle: 2.758 ± 0.689
2.452ArgLys: 2.452 ± 0.85
5.823ArgLeu: 5.823 ± 1.639
1.532ArgMet: 1.532 ± 0.841
1.532ArgAsn: 1.532 ± 0.54
2.145ArgPro: 2.145 ± 0.758
0.306ArgGln: 0.306 ± 0.238
4.291ArgArg: 4.291 ± 1.315
2.452ArgSer: 2.452 ± 1.199
2.145ArgThr: 2.145 ± 0.769
2.145ArgVal: 2.145 ± 0.857
0.306ArgTrp: 0.306 ± 0.286
1.839ArgTyr: 1.839 ± 0.62
0.0ArgXaa: 0.0 ± 0.0
Ser
2.758SerAla: 2.758 ± 0.957
1.532SerCys: 1.532 ± 0.561
3.984SerAsp: 3.984 ± 0.883
4.597SerGlu: 4.597 ± 1.375
1.839SerPhe: 1.839 ± 1.036
7.049SerGly: 7.049 ± 1.215
0.919SerHis: 0.919 ± 0.525
5.516SerIle: 5.516 ± 1.184
0.919SerLys: 0.919 ± 0.412
7.968SerLeu: 7.968 ± 1.754
4.291SerMet: 4.291 ± 1.306
2.452SerAsn: 2.452 ± 0.949
1.839SerPro: 1.839 ± 0.849
3.371SerGln: 3.371 ± 0.805
3.984SerArg: 3.984 ± 1.082
2.452SerSer: 2.452 ± 0.791
4.291SerThr: 4.291 ± 0.975
4.903SerVal: 4.903 ± 1.399
0.613SerTrp: 0.613 ± 0.423
1.532SerTyr: 1.532 ± 0.708
0.0SerXaa: 0.0 ± 0.0
Thr
3.065ThrAla: 3.065 ± 0.895
3.065ThrCys: 3.065 ± 0.921
3.371ThrAsp: 3.371 ± 1.231
3.371ThrGlu: 3.371 ± 1.245
1.532ThrPhe: 1.532 ± 0.476
8.275ThrGly: 8.275 ± 1.655
0.613ThrHis: 0.613 ± 0.376
2.452ThrIle: 2.452 ± 0.601
4.291ThrLys: 4.291 ± 1.123
6.742ThrLeu: 6.742 ± 1.275
0.613ThrMet: 0.613 ± 0.436
3.678ThrAsn: 3.678 ± 0.857
4.291ThrPro: 4.291 ± 1.212
2.452ThrGln: 2.452 ± 0.707
1.532ThrArg: 1.532 ± 0.445
4.291ThrSer: 4.291 ± 0.769
1.532ThrThr: 1.532 ± 0.506
4.903ThrVal: 4.903 ± 1.394
1.532ThrTrp: 1.532 ± 0.485
1.532ThrTyr: 1.532 ± 0.65
0.0ThrXaa: 0.0 ± 0.0
Val
2.758ValAla: 2.758 ± 1.221
1.839ValCys: 1.839 ± 0.711
5.21ValAsp: 5.21 ± 1.0
2.758ValGlu: 2.758 ± 0.838
6.742ValPhe: 6.742 ± 1.037
3.678ValGly: 3.678 ± 0.77
2.145ValHis: 2.145 ± 0.668
1.839ValIle: 1.839 ± 0.807
3.371ValLys: 3.371 ± 1.156
5.21ValLeu: 5.21 ± 0.921
1.532ValMet: 1.532 ± 0.61
5.823ValAsn: 5.823 ± 1.323
2.145ValPro: 2.145 ± 0.663
2.452ValGln: 2.452 ± 0.88
3.984ValArg: 3.984 ± 0.931
3.065ValSer: 3.065 ± 1.12
4.597ValThr: 4.597 ± 0.519
2.145ValVal: 2.145 ± 1.122
0.306ValTrp: 0.306 ± 0.238
1.532ValTyr: 1.532 ± 0.54
0.0ValXaa: 0.0 ± 0.0
Trp
0.919TrpAla: 0.919 ± 0.575
0.0TrpCys: 0.0 ± 0.0
0.919TrpAsp: 0.919 ± 0.594
1.226TrpGlu: 1.226 ± 0.571
0.919TrpPhe: 0.919 ± 0.432
0.306TrpGly: 0.306 ± 0.261
0.0TrpHis: 0.0 ± 0.0
0.613TrpIle: 0.613 ± 0.405
0.306TrpLys: 0.306 ± 0.286
1.839TrpLeu: 1.839 ± 0.601
0.613TrpMet: 0.613 ± 0.326
0.613TrpAsn: 0.613 ± 0.344
0.613TrpPro: 0.613 ± 0.571
0.306TrpGln: 0.306 ± 0.261
1.226TrpArg: 1.226 ± 0.544
0.613TrpSer: 0.613 ± 0.376
0.613TrpThr: 0.613 ± 0.332
1.226TrpVal: 1.226 ± 0.457
0.306TrpTrp: 0.306 ± 0.286
0.919TrpTyr: 0.919 ± 0.391
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.065TyrAla: 3.065 ± 1.135
0.306TyrCys: 0.306 ± 0.386
3.371TyrAsp: 3.371 ± 0.95
2.452TyrGlu: 2.452 ± 0.853
1.532TyrPhe: 1.532 ± 0.501
3.678TyrGly: 3.678 ± 1.201
0.306TyrHis: 0.306 ± 0.286
1.226TyrIle: 1.226 ± 0.701
3.065TyrLys: 3.065 ± 1.033
2.452TyrLeu: 2.452 ± 0.818
0.919TyrMet: 0.919 ± 0.504
0.613TyrAsn: 0.613 ± 0.476
1.226TyrPro: 1.226 ± 0.536
3.065TyrGln: 3.065 ± 0.909
1.226TyrArg: 1.226 ± 0.656
3.371TyrSer: 3.371 ± 0.704
2.145TyrThr: 2.145 ± 0.708
2.452TyrVal: 2.452 ± 0.671
0.306TyrTrp: 0.306 ± 0.238
1.226TyrTyr: 1.226 ± 0.648
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 20 proteins (3264 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski