Amino acid dipepetide frequency for Vibrio phage KSF1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.065AlaAla: 4.065 ± 2.209
1.161AlaCys: 1.161 ± 0.913
2.323AlaAsp: 2.323 ± 0.801
4.065AlaGlu: 4.065 ± 0.783
7.549AlaPhe: 7.549 ± 2.576
2.904AlaGly: 2.904 ± 1.002
1.742AlaHis: 1.742 ± 0.535
5.226AlaIle: 5.226 ± 1.853
2.323AlaLys: 2.323 ± 0.91
12.776AlaLeu: 12.776 ± 3.408
0.581AlaMet: 0.581 ± 0.451
5.226AlaAsn: 5.226 ± 1.028
2.904AlaPro: 2.904 ± 0.937
4.065AlaGln: 4.065 ± 1.137
1.742AlaArg: 1.742 ± 0.868
4.646AlaSer: 4.646 ± 1.684
3.484AlaThr: 3.484 ± 1.066
3.484AlaVal: 3.484 ± 1.034
1.161AlaTrp: 1.161 ± 0.755
5.807AlaTyr: 5.807 ± 1.731
0.0AlaXaa: 0.0 ± 0.0
Cys
1.161CysAla: 1.161 ± 0.852
0.0CysCys: 0.0 ± 0.0
0.581CysAsp: 0.581 ± 0.451
0.581CysGlu: 0.581 ± 0.457
1.161CysPhe: 1.161 ± 0.589
1.161CysGly: 1.161 ± 0.903
0.581CysHis: 0.581 ± 0.474
0.0CysIle: 0.0 ± 0.0
1.742CysLys: 1.742 ± 0.865
1.742CysLeu: 1.742 ± 0.825
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.161CysPro: 1.161 ± 0.52
1.161CysGln: 1.161 ± 0.553
0.581CysArg: 0.581 ± 0.474
0.581CysSer: 0.581 ± 0.451
1.161CysThr: 1.161 ± 0.903
0.581CysVal: 0.581 ± 0.474
0.581CysTrp: 0.581 ± 0.457
1.161CysTyr: 1.161 ± 0.679
0.0CysXaa: 0.0 ± 0.0
Asp
3.484AspAla: 3.484 ± 1.363
0.581AspCys: 0.581 ± 0.457
3.484AspAsp: 3.484 ± 1.065
2.323AspGlu: 2.323 ± 1.09
2.323AspPhe: 2.323 ± 0.957
1.742AspGly: 1.742 ± 0.923
0.581AspHis: 0.581 ± 0.451
4.646AspIle: 4.646 ± 1.437
2.323AspLys: 2.323 ± 0.801
4.646AspLeu: 4.646 ± 2.242
2.904AspMet: 2.904 ± 0.801
2.904AspAsn: 2.904 ± 1.547
2.904AspPro: 2.904 ± 1.29
1.161AspGln: 1.161 ± 0.848
1.161AspArg: 1.161 ± 0.52
5.226AspSer: 5.226 ± 1.847
4.646AspThr: 4.646 ± 0.995
1.161AspVal: 1.161 ± 0.946
0.581AspTrp: 0.581 ± 0.457
3.484AspTyr: 3.484 ± 1.376
0.0AspXaa: 0.0 ± 0.0
Glu
4.646GluAla: 4.646 ± 1.454
2.323GluCys: 2.323 ± 1.04
0.581GluAsp: 0.581 ± 0.768
1.161GluGlu: 1.161 ± 0.52
1.742GluPhe: 1.742 ± 0.822
2.323GluGly: 2.323 ± 0.801
0.0GluHis: 0.0 ± 0.0
1.742GluIle: 1.742 ± 1.21
3.484GluLys: 3.484 ± 1.003
4.065GluLeu: 4.065 ± 1.483
1.742GluMet: 1.742 ± 1.226
1.742GluAsn: 1.742 ± 0.816
1.161GluPro: 1.161 ± 0.754
2.323GluGln: 2.323 ± 1.39
1.161GluArg: 1.161 ± 1.015
1.742GluSer: 1.742 ± 1.293
2.904GluThr: 2.904 ± 1.456
1.742GluVal: 1.742 ± 1.128
0.0GluTrp: 0.0 ± 0.0
2.323GluTyr: 2.323 ± 1.485
0.0GluXaa: 0.0 ± 0.0
Phe
5.226PheAla: 5.226 ± 1.822
0.581PheCys: 0.581 ± 0.457
2.323PheAsp: 2.323 ± 1.272
1.742PheGlu: 1.742 ± 0.714
4.646PhePhe: 4.646 ± 1.925
2.904PheGly: 2.904 ± 0.947
1.161PheHis: 1.161 ± 0.52
5.807PheIle: 5.807 ± 1.716
3.484PheLys: 3.484 ± 2.152
6.969PheLeu: 6.969 ± 2.743
3.484PheMet: 3.484 ± 1.02
5.226PheAsn: 5.226 ± 2.78
3.484PhePro: 3.484 ± 1.222
0.581PheGln: 0.581 ± 0.457
1.742PheArg: 1.742 ± 0.86
2.323PheSer: 2.323 ± 1.286
2.323PheThr: 2.323 ± 1.33
5.226PheVal: 5.226 ± 1.414
0.0PheTrp: 0.0 ± 0.0
2.904PheTyr: 2.904 ± 1.794
0.0PheXaa: 0.0 ± 0.0
Gly
5.807GlyAla: 5.807 ± 1.355
0.581GlyCys: 0.581 ± 0.451
2.904GlyAsp: 2.904 ± 0.947
2.323GlyGlu: 2.323 ± 1.147
4.646GlyPhe: 4.646 ± 1.739
3.484GlyGly: 3.484 ± 1.329
1.161GlyHis: 1.161 ± 0.913
6.969GlyIle: 6.969 ± 1.181
1.161GlyLys: 1.161 ± 0.679
5.807GlyLeu: 5.807 ± 1.748
1.161GlyMet: 1.161 ± 0.589
4.065GlyAsn: 4.065 ± 1.945
1.161GlyPro: 1.161 ± 0.829
5.226GlyGln: 5.226 ± 1.001
4.646GlyArg: 4.646 ± 1.857
4.646GlySer: 4.646 ± 1.349
2.904GlyThr: 2.904 ± 0.867
4.065GlyVal: 4.065 ± 1.964
1.161GlyTrp: 1.161 ± 0.882
1.742GlyTyr: 1.742 ± 0.535
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.581HisCys: 0.581 ± 0.474
1.161HisAsp: 1.161 ± 0.829
0.581HisGlu: 0.581 ± 0.451
1.161HisPhe: 1.161 ± 0.553
1.742HisGly: 1.742 ± 0.891
0.581HisHis: 0.581 ± 0.451
0.0HisIle: 0.0 ± 0.0
2.323HisLys: 2.323 ± 1.272
2.904HisLeu: 2.904 ± 0.942
1.742HisMet: 1.742 ± 1.184
0.581HisAsn: 0.581 ± 0.474
1.161HisPro: 1.161 ± 0.52
0.581HisGln: 0.581 ± 0.457
2.323HisArg: 2.323 ± 1.04
0.581HisSer: 0.581 ± 0.886
0.581HisThr: 0.581 ± 0.457
0.0HisVal: 0.0 ± 0.0
0.581HisTrp: 0.581 ± 0.451
1.161HisTyr: 1.161 ± 0.913
0.0HisXaa: 0.0 ± 0.0
Ile
5.226IleAla: 5.226 ± 1.675
0.581IleCys: 0.581 ± 0.451
3.484IleAsp: 3.484 ± 1.065
6.388IleGlu: 6.388 ± 1.737
2.904IlePhe: 2.904 ± 1.523
5.807IleGly: 5.807 ± 1.64
0.581IleHis: 0.581 ± 0.457
1.161IleIle: 1.161 ± 0.679
2.904IleLys: 2.904 ± 1.484
5.807IleLeu: 5.807 ± 1.448
1.742IleMet: 1.742 ± 1.532
2.904IleAsn: 2.904 ± 0.942
4.646IlePro: 4.646 ± 1.395
0.581IleGln: 0.581 ± 0.474
4.646IleArg: 4.646 ± 1.772
4.065IleSer: 4.065 ± 2.078
5.807IleThr: 5.807 ± 0.957
2.904IleVal: 2.904 ± 1.263
0.581IleTrp: 0.581 ± 0.451
4.065IleTyr: 4.065 ± 1.56
0.0IleXaa: 0.0 ± 0.0
Lys
3.484LysAla: 3.484 ± 1.276
1.161LysCys: 1.161 ± 0.903
2.323LysAsp: 2.323 ± 1.06
1.161LysGlu: 1.161 ± 0.52
2.323LysPhe: 2.323 ± 1.459
4.065LysGly: 4.065 ± 1.596
1.161LysHis: 1.161 ± 0.903
3.484LysIle: 3.484 ± 1.694
4.065LysLys: 4.065 ± 2.194
5.226LysLeu: 5.226 ± 1.062
1.742LysMet: 1.742 ± 0.998
5.807LysAsn: 5.807 ± 1.558
2.323LysPro: 2.323 ± 1.106
1.161LysGln: 1.161 ± 0.742
2.904LysArg: 2.904 ± 1.022
5.807LysSer: 5.807 ± 2.398
6.388LysThr: 6.388 ± 2.313
2.323LysVal: 2.323 ± 0.821
0.0LysTrp: 0.0 ± 0.0
0.581LysTyr: 0.581 ± 0.451
0.0LysXaa: 0.0 ± 0.0
Leu
9.292LeuAla: 9.292 ± 3.235
0.0LeuCys: 0.0 ± 0.0
8.13LeuAsp: 8.13 ± 1.742
5.226LeuGlu: 5.226 ± 1.107
6.388LeuPhe: 6.388 ± 1.79
8.13LeuGly: 8.13 ± 1.461
1.161LeuHis: 1.161 ± 0.903
6.388LeuIle: 6.388 ± 1.297
6.388LeuLys: 6.388 ± 1.126
8.711LeuLeu: 8.711 ± 2.524
1.742LeuMet: 1.742 ± 1.012
5.226LeuAsn: 5.226 ± 2.706
4.065LeuPro: 4.065 ± 1.672
2.323LeuGln: 2.323 ± 0.914
5.226LeuArg: 5.226 ± 1.631
7.549LeuSer: 7.549 ± 2.077
2.323LeuThr: 2.323 ± 1.403
4.646LeuVal: 4.646 ± 1.401
1.742LeuTrp: 1.742 ± 0.701
2.323LeuTyr: 2.323 ± 1.311
0.0LeuXaa: 0.0 ± 0.0
Met
2.904MetAla: 2.904 ± 0.841
0.581MetCys: 0.581 ± 0.869
1.161MetAsp: 1.161 ± 0.553
2.323MetGlu: 2.323 ± 1.005
0.0MetPhe: 0.0 ± 0.0
2.323MetGly: 2.323 ± 1.147
0.0MetHis: 0.0 ± 0.0
1.161MetIle: 1.161 ± 1.117
0.581MetLys: 0.581 ± 0.768
4.646MetLeu: 4.646 ± 1.243
0.0MetMet: 0.0 ± 0.0
0.581MetAsn: 0.581 ± 0.819
2.323MetPro: 2.323 ± 1.178
1.161MetGln: 1.161 ± 0.788
1.742MetArg: 1.742 ± 0.856
3.484MetSer: 3.484 ± 1.435
3.484MetThr: 3.484 ± 1.456
2.323MetVal: 2.323 ± 1.752
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
6.388AsnAla: 6.388 ± 2.744
1.161AsnCys: 1.161 ± 1.113
3.484AsnAsp: 3.484 ± 1.552
1.742AsnGlu: 1.742 ± 0.966
3.484AsnPhe: 3.484 ± 2.119
4.646AsnGly: 4.646 ± 1.271
1.742AsnHis: 1.742 ± 1.422
3.484AsnIle: 3.484 ± 1.488
4.646AsnLys: 4.646 ± 1.786
2.904AsnLeu: 2.904 ± 1.393
2.323AsnMet: 2.323 ± 1.154
1.161AsnAsn: 1.161 ± 0.589
2.323AsnPro: 2.323 ± 0.913
0.581AsnGln: 0.581 ± 0.457
0.581AsnArg: 0.581 ± 0.474
2.904AsnSer: 2.904 ± 2.04
5.807AsnThr: 5.807 ± 2.346
4.646AsnVal: 4.646 ± 1.864
0.581AsnTrp: 0.581 ± 0.457
0.581AsnTyr: 0.581 ± 0.819
0.0AsnXaa: 0.0 ± 0.0
Pro
2.904ProAla: 2.904 ± 1.342
1.161ProCys: 1.161 ± 0.818
6.388ProAsp: 6.388 ± 1.54
0.581ProGlu: 0.581 ± 0.457
2.323ProPhe: 2.323 ± 1.116
1.161ProGly: 1.161 ± 0.857
2.323ProHis: 2.323 ± 1.45
2.323ProIle: 2.323 ± 1.273
2.323ProLys: 2.323 ± 0.911
2.904ProLeu: 2.904 ± 1.801
1.742ProMet: 1.742 ± 0.535
2.904ProAsn: 2.904 ± 1.254
1.742ProPro: 1.742 ± 0.868
3.484ProGln: 3.484 ± 1.441
0.581ProArg: 0.581 ± 0.451
3.484ProSer: 3.484 ± 1.003
1.742ProThr: 1.742 ± 0.895
2.323ProVal: 2.323 ± 1.399
1.742ProTrp: 1.742 ± 1.089
0.581ProTyr: 0.581 ± 0.474
0.0ProXaa: 0.0 ± 0.0
Gln
5.226GlnAla: 5.226 ± 1.552
1.161GlnCys: 1.161 ± 0.589
1.161GlnAsp: 1.161 ± 0.52
0.581GlnGlu: 0.581 ± 0.869
1.161GlnPhe: 1.161 ± 0.788
2.904GlnGly: 2.904 ± 1.704
0.581GlnHis: 0.581 ± 0.457
4.065GlnIle: 4.065 ± 1.053
1.742GlnLys: 1.742 ± 0.535
3.484GlnLeu: 3.484 ± 1.771
1.742GlnMet: 1.742 ± 0.535
1.742GlnAsn: 1.742 ± 0.701
1.742GlnPro: 1.742 ± 1.123
0.581GlnGln: 0.581 ± 0.474
0.581GlnArg: 0.581 ± 0.768
1.742GlnSer: 1.742 ± 1.37
2.904GlnThr: 2.904 ± 1.746
1.742GlnVal: 1.742 ± 0.86
0.0GlnTrp: 0.0 ± 0.0
0.581GlnTyr: 0.581 ± 0.697
0.0GlnXaa: 0.0 ± 0.0
Arg
1.742ArgAla: 1.742 ± 1.016
0.0ArgCys: 0.0 ± 0.0
1.161ArgAsp: 1.161 ± 0.913
1.161ArgGlu: 1.161 ± 0.88
2.323ArgPhe: 2.323 ± 0.685
2.904ArgGly: 2.904 ± 1.296
2.904ArgHis: 2.904 ± 1.356
5.226ArgIle: 5.226 ± 1.688
2.323ArgLys: 2.323 ± 1.618
4.646ArgLeu: 4.646 ± 1.484
1.742ArgMet: 1.742 ± 0.923
2.323ArgAsn: 2.323 ± 1.03
2.904ArgPro: 2.904 ± 1.062
2.323ArgGln: 2.323 ± 0.852
3.484ArgArg: 3.484 ± 1.718
1.742ArgSer: 1.742 ± 1.012
2.323ArgThr: 2.323 ± 0.867
0.581ArgVal: 0.581 ± 0.457
0.581ArgTrp: 0.581 ± 0.474
2.323ArgTyr: 2.323 ± 0.963
0.0ArgXaa: 0.0 ± 0.0
Ser
4.646SerAla: 4.646 ± 1.524
0.0SerCys: 0.0 ± 0.0
4.065SerAsp: 4.065 ± 1.606
1.742SerGlu: 1.742 ± 1.416
3.484SerPhe: 3.484 ± 1.077
6.388SerGly: 6.388 ± 2.478
1.742SerHis: 1.742 ± 1.117
2.904SerIle: 2.904 ± 0.818
4.065SerLys: 4.065 ± 1.45
3.484SerLeu: 3.484 ± 1.544
2.323SerMet: 2.323 ± 1.394
4.065SerAsn: 4.065 ± 1.786
2.904SerPro: 2.904 ± 1.189
4.065SerGln: 4.065 ± 1.651
2.323SerArg: 2.323 ± 0.821
4.065SerSer: 4.065 ± 1.223
5.226SerThr: 5.226 ± 1.519
3.484SerVal: 3.484 ± 1.258
0.581SerTrp: 0.581 ± 0.474
3.484SerTyr: 3.484 ± 1.117
0.0SerXaa: 0.0 ± 0.0
Thr
2.323ThrAla: 2.323 ± 1.455
2.323ThrCys: 2.323 ± 1.359
1.742ThrAsp: 1.742 ± 0.829
1.742ThrGlu: 1.742 ± 1.052
5.807ThrPhe: 5.807 ± 1.821
6.969ThrGly: 6.969 ± 2.165
1.161ThrHis: 1.161 ± 1.198
4.065ThrIle: 4.065 ± 1.394
5.807ThrLys: 5.807 ± 2.065
6.969ThrLeu: 6.969 ± 1.834
1.742ThrMet: 1.742 ± 1.23
1.742ThrAsn: 1.742 ± 0.923
1.161ThrPro: 1.161 ± 0.52
1.161ThrGln: 1.161 ± 0.903
2.904ThrArg: 2.904 ± 0.924
4.065ThrSer: 4.065 ± 1.6
4.065ThrThr: 4.065 ± 1.713
6.388ThrVal: 6.388 ± 1.441
0.0ThrTrp: 0.0 ± 0.0
2.323ThrTyr: 2.323 ± 1.064
0.0ThrXaa: 0.0 ± 0.0
Val
5.807ValAla: 5.807 ± 2.393
1.161ValCys: 1.161 ± 0.589
2.323ValAsp: 2.323 ± 1.493
2.323ValGlu: 2.323 ± 1.507
3.484ValPhe: 3.484 ± 1.302
1.742ValGly: 1.742 ± 1.126
0.581ValHis: 0.581 ± 0.457
6.969ValIle: 6.969 ± 1.92
3.484ValLys: 3.484 ± 0.872
2.904ValLeu: 2.904 ± 1.825
1.161ValMet: 1.161 ± 0.882
2.904ValAsn: 2.904 ± 1.99
2.904ValPro: 2.904 ± 1.018
1.161ValGln: 1.161 ± 0.589
2.323ValArg: 2.323 ± 1.347
3.484ValSer: 3.484 ± 0.985
2.904ValThr: 2.904 ± 1.343
3.484ValVal: 3.484 ± 2.459
1.742ValTrp: 1.742 ± 1.853
0.581ValTyr: 0.581 ± 0.451
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.581TrpCys: 0.581 ± 0.457
1.161TrpAsp: 1.161 ± 0.553
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
1.161TrpGly: 1.161 ± 0.788
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.581TrpLys: 0.581 ± 0.746
1.742TrpLeu: 1.742 ± 1.261
0.581TrpMet: 0.581 ± 0.465
0.581TrpAsn: 0.581 ± 0.697
0.581TrpPro: 0.581 ± 0.474
0.581TrpGln: 0.581 ± 0.457
1.161TrpArg: 1.161 ± 0.589
0.581TrpSer: 0.581 ± 0.451
0.0TrpThr: 0.0 ± 0.0
0.581TrpVal: 0.581 ± 0.451
0.581TrpTrp: 0.581 ± 0.451
2.323TrpTyr: 2.323 ± 1.727
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.904TyrAla: 2.904 ± 0.903
0.0TyrCys: 0.0 ± 0.0
2.323TyrAsp: 2.323 ± 1.577
1.161TyrGlu: 1.161 ± 0.52
5.226TyrPhe: 5.226 ± 2.633
1.161TyrGly: 1.161 ± 1.05
0.581TyrHis: 0.581 ± 0.451
1.161TyrIle: 1.161 ± 0.798
1.742TyrLys: 1.742 ± 1.049
4.646TyrLeu: 4.646 ± 2.165
0.0TyrMet: 0.0 ± 0.0
3.484TyrAsn: 3.484 ± 1.065
1.161TyrPro: 1.161 ± 0.843
1.161TyrGln: 1.161 ± 1.384
2.904TyrArg: 2.904 ± 0.767
2.904TyrSer: 2.904 ± 2.001
3.484TyrThr: 3.484 ± 1.535
1.742TyrVal: 1.742 ± 1.049
0.581TyrTrp: 0.581 ± 0.451
1.742TyrTyr: 1.742 ± 1.049
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12 proteins (1723 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski