Amino acid dipepetide frequency for Vibrio phage ND1-fs1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.671AlaAla: 5.671 ± 2.152
0.0AlaCys: 0.0 ± 0.0
2.836AlaAsp: 2.836 ± 0.993
3.308AlaGlu: 3.308 ± 0.798
4.726AlaPhe: 4.726 ± 0.811
2.836AlaGly: 2.836 ± 1.281
1.89AlaHis: 1.89 ± 0.861
8.507AlaIle: 8.507 ± 2.17
5.198AlaLys: 5.198 ± 1.421
7.089AlaLeu: 7.089 ± 1.733
2.836AlaMet: 2.836 ± 1.504
2.836AlaAsn: 2.836 ± 1.335
2.363AlaPro: 2.363 ± 1.122
5.198AlaGln: 5.198 ± 1.055
1.418AlaArg: 1.418 ± 1.274
1.89AlaSer: 1.89 ± 1.053
0.945AlaThr: 0.945 ± 0.471
5.198AlaVal: 5.198 ± 1.012
0.945AlaTrp: 0.945 ± 0.625
3.781AlaTyr: 3.781 ± 1.066
0.0AlaXaa: 0.0 ± 0.0
Cys
0.473CysAla: 0.473 ± 0.375
0.0CysCys: 0.0 ± 0.0
1.418CysAsp: 1.418 ± 0.588
0.473CysGlu: 0.473 ± 0.387
2.363CysPhe: 2.363 ± 0.852
1.89CysGly: 1.89 ± 1.034
0.473CysHis: 0.473 ± 0.525
1.418CysIle: 1.418 ± 0.736
1.418CysLys: 1.418 ± 0.701
0.473CysLeu: 0.473 ± 0.503
0.473CysMet: 0.473 ± 0.387
0.0CysAsn: 0.0 ± 0.0
0.945CysPro: 0.945 ± 0.648
0.473CysGln: 0.473 ± 0.387
0.945CysArg: 0.945 ± 0.529
2.836CysSer: 2.836 ± 1.023
1.89CysThr: 1.89 ± 0.942
0.945CysVal: 0.945 ± 0.438
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.198AspAla: 5.198 ± 1.21
0.945AspCys: 0.945 ± 0.593
4.726AspAsp: 4.726 ± 1.265
2.836AspGlu: 2.836 ± 1.025
2.836AspPhe: 2.836 ± 0.829
3.308AspGly: 3.308 ± 1.365
1.89AspHis: 1.89 ± 0.488
3.781AspIle: 3.781 ± 1.453
1.418AspLys: 1.418 ± 1.054
5.198AspLeu: 5.198 ± 1.891
1.89AspMet: 1.89 ± 0.867
1.418AspAsn: 1.418 ± 0.956
5.671AspPro: 5.671 ± 2.322
1.418AspGln: 1.418 ± 0.411
1.89AspArg: 1.89 ± 0.938
2.363AspSer: 2.363 ± 1.108
3.781AspThr: 3.781 ± 1.33
3.308AspVal: 3.308 ± 1.895
1.418AspTrp: 1.418 ± 0.565
1.89AspTyr: 1.89 ± 0.665
0.0AspXaa: 0.0 ± 0.0
Glu
5.198GluAla: 5.198 ± 0.957
1.89GluCys: 1.89 ± 0.908
1.418GluAsp: 1.418 ± 0.736
2.363GluGlu: 2.363 ± 0.84
3.308GluPhe: 3.308 ± 1.102
0.945GluGly: 0.945 ± 0.72
1.89GluHis: 1.89 ± 0.631
1.89GluIle: 1.89 ± 1.037
3.308GluLys: 3.308 ± 0.772
4.726GluLeu: 4.726 ± 1.453
0.945GluMet: 0.945 ± 0.645
2.363GluAsn: 2.363 ± 0.876
4.726GluPro: 4.726 ± 1.627
3.308GluGln: 3.308 ± 1.357
0.0GluArg: 0.0 ± 0.0
4.253GluSer: 4.253 ± 1.556
3.781GluThr: 3.781 ± 1.727
2.363GluVal: 2.363 ± 0.926
0.945GluTrp: 0.945 ± 0.947
1.418GluTyr: 1.418 ± 0.542
0.0GluXaa: 0.0 ± 0.0
Phe
4.726PheAla: 4.726 ± 1.723
0.945PheCys: 0.945 ± 0.529
3.308PheAsp: 3.308 ± 1.038
2.836PheGlu: 2.836 ± 0.855
0.945PhePhe: 0.945 ± 0.748
4.726PheGly: 4.726 ± 1.23
0.945PheHis: 0.945 ± 0.647
2.363PheIle: 2.363 ± 0.675
1.89PheLys: 1.89 ± 1.018
3.781PheLeu: 3.781 ± 1.267
1.89PheMet: 1.89 ± 0.942
2.836PheAsn: 2.836 ± 1.349
1.418PhePro: 1.418 ± 0.678
0.473PheGln: 0.473 ± 0.375
2.363PheArg: 2.363 ± 1.356
5.671PheSer: 5.671 ± 1.305
2.363PheThr: 2.363 ± 0.618
3.308PheVal: 3.308 ± 0.929
1.418PheTrp: 1.418 ± 0.582
2.836PheTyr: 2.836 ± 1.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.781GlyAla: 3.781 ± 0.986
1.418GlyCys: 1.418 ± 0.531
5.198GlyAsp: 5.198 ± 1.464
2.836GlyGlu: 2.836 ± 0.76
2.836GlyPhe: 2.836 ± 0.69
4.253GlyGly: 4.253 ± 1.434
1.418GlyHis: 1.418 ± 0.958
8.034GlyIle: 8.034 ± 1.657
1.89GlyLys: 1.89 ± 0.828
6.144GlyLeu: 6.144 ± 2.048
1.89GlyMet: 1.89 ± 0.75
1.89GlyAsn: 1.89 ± 0.942
0.473GlyPro: 0.473 ± 0.386
2.836GlyGln: 2.836 ± 1.646
1.89GlyArg: 1.89 ± 0.841
5.671GlySer: 5.671 ± 1.277
1.89GlyThr: 1.89 ± 0.793
3.308GlyVal: 3.308 ± 1.363
0.945GlyTrp: 0.945 ± 0.593
2.363GlyTyr: 2.363 ± 0.954
0.0GlyXaa: 0.0 ± 0.0
His
2.363HisAla: 2.363 ± 0.952
0.473HisCys: 0.473 ± 0.375
0.945HisAsp: 0.945 ± 0.595
0.473HisGlu: 0.473 ± 0.386
0.945HisPhe: 0.945 ± 0.773
0.945HisGly: 0.945 ± 0.442
0.945HisHis: 0.945 ± 0.771
0.473HisIle: 0.473 ± 0.375
1.418HisLys: 1.418 ± 0.855
1.418HisLeu: 1.418 ± 0.769
0.945HisMet: 0.945 ± 0.442
0.473HisAsn: 0.473 ± 0.387
0.473HisPro: 0.473 ± 0.386
0.473HisGln: 0.473 ± 0.452
1.89HisArg: 1.89 ± 1.196
0.473HisSer: 0.473 ± 0.606
0.473HisThr: 0.473 ± 0.375
0.945HisVal: 0.945 ± 0.751
0.473HisTrp: 0.473 ± 0.386
1.89HisTyr: 1.89 ± 1.273
0.0HisXaa: 0.0 ± 0.0
Ile
4.726IleAla: 4.726 ± 1.543
2.363IleCys: 2.363 ± 0.737
7.089IleAsp: 7.089 ± 2.575
4.726IleGlu: 4.726 ± 1.477
2.836IlePhe: 2.836 ± 1.134
2.363IleGly: 2.363 ± 0.963
0.945IleHis: 0.945 ± 0.705
3.781IleIle: 3.781 ± 1.456
4.253IleLys: 4.253 ± 1.747
3.781IleLeu: 3.781 ± 1.536
1.418IleMet: 1.418 ± 0.653
5.198IleAsn: 5.198 ± 1.235
5.198IlePro: 5.198 ± 1.619
2.363IleGln: 2.363 ± 0.807
1.418IleArg: 1.418 ± 0.815
6.144IleSer: 6.144 ± 2.44
7.089IleThr: 7.089 ± 1.689
3.781IleVal: 3.781 ± 0.941
1.418IleTrp: 1.418 ± 0.59
3.781IleTyr: 3.781 ± 0.861
0.0IleXaa: 0.0 ± 0.0
Lys
4.253LysAla: 4.253 ± 1.166
1.418LysCys: 1.418 ± 0.769
2.836LysAsp: 2.836 ± 1.044
1.89LysGlu: 1.89 ± 0.769
0.945LysPhe: 0.945 ± 0.548
2.363LysGly: 2.363 ± 1.119
0.945LysHis: 0.945 ± 0.771
4.726LysIle: 4.726 ± 0.7
6.144LysLys: 6.144 ± 2.058
5.198LysLeu: 5.198 ± 2.039
3.781LysMet: 3.781 ± 1.409
3.781LysAsn: 3.781 ± 0.937
1.418LysPro: 1.418 ± 0.736
4.253LysGln: 4.253 ± 1.298
4.253LysArg: 4.253 ± 1.857
2.363LysSer: 2.363 ± 0.678
4.253LysThr: 4.253 ± 0.775
4.726LysVal: 4.726 ± 1.434
0.0LysTrp: 0.0 ± 0.0
1.89LysTyr: 1.89 ± 0.729
0.0LysXaa: 0.0 ± 0.0
Leu
5.198LeuAla: 5.198 ± 2.279
2.363LeuCys: 2.363 ± 1.106
3.781LeuAsp: 3.781 ± 1.064
3.308LeuGlu: 3.308 ± 1.274
2.363LeuPhe: 2.363 ± 1.166
7.561LeuGly: 7.561 ± 1.072
1.418LeuHis: 1.418 ± 0.769
8.507LeuIle: 8.507 ± 2.661
4.253LeuLys: 4.253 ± 1.068
8.034LeuLeu: 8.034 ± 2.75
1.89LeuMet: 1.89 ± 0.923
4.253LeuAsn: 4.253 ± 1.169
4.726LeuPro: 4.726 ± 1.747
1.89LeuGln: 1.89 ± 0.787
2.836LeuArg: 2.836 ± 1.181
5.198LeuSer: 5.198 ± 1.496
5.198LeuThr: 5.198 ± 1.734
4.253LeuVal: 4.253 ± 2.081
1.418LeuTrp: 1.418 ± 0.85
3.781LeuTyr: 3.781 ± 1.074
0.0LeuXaa: 0.0 ± 0.0
Met
2.363MetAla: 2.363 ± 1.081
0.0MetCys: 0.0 ± 0.0
0.945MetAsp: 0.945 ± 1.006
1.418MetGlu: 1.418 ± 0.983
1.89MetPhe: 1.89 ± 0.884
1.418MetGly: 1.418 ± 0.68
0.473MetHis: 0.473 ± 0.375
1.89MetIle: 1.89 ± 0.929
0.945MetLys: 0.945 ± 0.734
2.363MetLeu: 2.363 ± 1.169
0.945MetMet: 0.945 ± 0.669
1.89MetAsn: 1.89 ± 0.786
1.89MetPro: 1.89 ± 1.014
0.945MetGln: 0.945 ± 1.143
1.89MetArg: 1.89 ± 0.721
1.89MetSer: 1.89 ± 0.858
2.363MetThr: 2.363 ± 1.356
3.308MetVal: 3.308 ± 0.702
0.0MetTrp: 0.0 ± 0.0
0.473MetTyr: 0.473 ± 0.525
0.0MetXaa: 0.0 ± 0.0
Asn
1.89AsnAla: 1.89 ± 0.939
0.473AsnCys: 0.473 ± 0.387
2.363AsnAsp: 2.363 ± 1.085
4.726AsnGlu: 4.726 ± 2.603
1.89AsnPhe: 1.89 ± 0.778
1.89AsnGly: 1.89 ± 0.922
0.473AsnHis: 0.473 ± 0.375
3.308AsnIle: 3.308 ± 1.545
5.198AsnLys: 5.198 ± 1.576
4.253AsnLeu: 4.253 ± 0.866
0.473AsnMet: 0.473 ± 0.572
1.89AsnAsn: 1.89 ± 0.665
4.253AsnPro: 4.253 ± 1.714
0.945AsnGln: 0.945 ± 0.438
2.363AsnArg: 2.363 ± 0.78
3.781AsnSer: 3.781 ± 1.454
2.363AsnThr: 2.363 ± 0.679
1.89AsnVal: 1.89 ± 1.031
0.945AsnTrp: 0.945 ± 0.741
0.945AsnTyr: 0.945 ± 0.816
0.0AsnXaa: 0.0 ± 0.0
Pro
1.418ProAla: 1.418 ± 0.411
0.945ProCys: 0.945 ± 0.471
4.726ProAsp: 4.726 ± 1.882
4.253ProGlu: 4.253 ± 2.613
3.781ProPhe: 3.781 ± 0.824
0.473ProGly: 0.473 ± 0.387
1.418ProHis: 1.418 ± 1.157
0.945ProIle: 0.945 ± 0.645
4.253ProLys: 4.253 ± 1.007
5.671ProLeu: 5.671 ± 2.123
1.89ProMet: 1.89 ± 0.858
2.836ProAsn: 2.836 ± 1.372
2.836ProPro: 2.836 ± 0.778
3.308ProGln: 3.308 ± 1.035
2.363ProArg: 2.363 ± 1.071
5.198ProSer: 5.198 ± 1.844
4.726ProThr: 4.726 ± 2.019
2.836ProVal: 2.836 ± 1.022
0.0ProTrp: 0.0 ± 0.0
0.473ProTyr: 0.473 ± 0.599
0.0ProXaa: 0.0 ± 0.0
Gln
1.89GlnAla: 1.89 ± 0.845
0.945GlnCys: 0.945 ± 0.625
2.363GlnAsp: 2.363 ± 0.552
1.418GlnGlu: 1.418 ± 0.837
1.89GlnPhe: 1.89 ± 1.177
2.836GlnGly: 2.836 ± 1.604
0.473GlnHis: 0.473 ± 0.387
3.781GlnIle: 3.781 ± 0.918
1.418GlnLys: 1.418 ± 0.59
4.253GlnLeu: 4.253 ± 1.338
0.473GlnMet: 0.473 ± 0.386
1.418GlnAsn: 1.418 ± 0.519
1.89GlnPro: 1.89 ± 0.733
1.89GlnGln: 1.89 ± 0.773
2.363GlnArg: 2.363 ± 0.874
3.781GlnSer: 3.781 ± 1.793
1.89GlnThr: 1.89 ± 0.805
3.781GlnVal: 3.781 ± 1.088
0.473GlnTrp: 0.473 ± 0.606
0.945GlnTyr: 0.945 ± 0.727
0.0GlnXaa: 0.0 ± 0.0
Arg
2.836ArgAla: 2.836 ± 1.254
0.0ArgCys: 0.0 ± 0.0
0.945ArgAsp: 0.945 ± 0.766
2.363ArgGlu: 2.363 ± 1.066
3.781ArgPhe: 3.781 ± 1.188
1.418ArgGly: 1.418 ± 1.157
0.473ArgHis: 0.473 ± 0.386
5.198ArgIle: 5.198 ± 1.338
2.836ArgLys: 2.836 ± 1.079
5.198ArgLeu: 5.198 ± 1.658
0.473ArgMet: 0.473 ± 0.606
2.836ArgAsn: 2.836 ± 1.565
3.308ArgPro: 3.308 ± 0.962
0.473ArgGln: 0.473 ± 0.386
2.363ArgArg: 2.363 ± 2.356
2.836ArgSer: 2.836 ± 1.412
2.836ArgThr: 2.836 ± 1.346
2.363ArgVal: 2.363 ± 0.867
0.945ArgTrp: 0.945 ± 0.645
1.418ArgTyr: 1.418 ± 0.81
0.0ArgXaa: 0.0 ± 0.0
Ser
6.616SerAla: 6.616 ± 1.804
0.473SerCys: 0.473 ± 0.387
3.781SerAsp: 3.781 ± 2.174
2.363SerGlu: 2.363 ± 1.19
4.253SerPhe: 4.253 ± 1.225
6.144SerGly: 6.144 ± 2.09
0.945SerHis: 0.945 ± 0.438
2.836SerIle: 2.836 ± 0.847
6.616SerLys: 6.616 ± 1.024
3.308SerLeu: 3.308 ± 1.099
4.726SerMet: 4.726 ± 1.557
2.836SerAsn: 2.836 ± 0.708
2.363SerPro: 2.363 ± 0.563
2.363SerGln: 2.363 ± 1.33
4.253SerArg: 4.253 ± 1.373
4.726SerSer: 4.726 ± 1.191
2.363SerThr: 2.363 ± 1.038
5.198SerVal: 5.198 ± 1.709
0.0SerTrp: 0.0 ± 0.0
3.308SerTyr: 3.308 ± 0.916
0.0SerXaa: 0.0 ± 0.0
Thr
4.253ThrAla: 4.253 ± 1.37
2.363ThrCys: 2.363 ± 1.148
2.836ThrAsp: 2.836 ± 1.001
2.363ThrGlu: 2.363 ± 0.678
2.363ThrPhe: 2.363 ± 0.589
6.144ThrGly: 6.144 ± 1.48
0.0ThrHis: 0.0 ± 0.0
3.308ThrIle: 3.308 ± 1.233
3.308ThrLys: 3.308 ± 0.815
3.308ThrLeu: 3.308 ± 1.058
1.418ThrMet: 1.418 ± 1.011
2.363ThrAsn: 2.363 ± 0.999
3.308ThrPro: 3.308 ± 1.748
1.89ThrGln: 1.89 ± 0.631
3.781ThrArg: 3.781 ± 0.861
2.363ThrSer: 2.363 ± 1.33
2.363ThrThr: 2.363 ± 1.085
4.726ThrVal: 4.726 ± 1.159
0.473ThrTrp: 0.473 ± 0.375
2.363ThrTyr: 2.363 ± 1.091
0.0ThrXaa: 0.0 ± 0.0
Val
3.308ValAla: 3.308 ± 0.883
1.418ValCys: 1.418 ± 0.588
3.308ValAsp: 3.308 ± 1.39
4.726ValGlu: 4.726 ± 1.321
3.781ValPhe: 3.781 ± 1.132
2.363ValGly: 2.363 ± 0.949
0.0ValHis: 0.0 ± 0.0
7.089ValIle: 7.089 ± 1.876
3.308ValLys: 3.308 ± 0.971
3.308ValLeu: 3.308 ± 1.34
0.473ValMet: 0.473 ± 0.387
3.308ValAsn: 3.308 ± 1.209
4.253ValPro: 4.253 ± 1.339
2.363ValGln: 2.363 ± 1.017
3.308ValArg: 3.308 ± 1.021
6.144ValSer: 6.144 ± 1.027
3.781ValThr: 3.781 ± 1.841
1.89ValVal: 1.89 ± 0.765
0.945ValTrp: 0.945 ± 0.83
2.363ValTyr: 2.363 ± 1.182
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.473TrpAsp: 0.473 ± 0.606
0.473TrpGlu: 0.473 ± 0.525
0.0TrpPhe: 0.0 ± 0.0
3.308TrpGly: 3.308 ± 1.225
0.473TrpHis: 0.473 ± 0.606
0.473TrpIle: 0.473 ± 0.387
0.473TrpLys: 0.473 ± 0.375
1.418TrpLeu: 1.418 ± 0.85
0.0TrpMet: 0.0 ± 0.0
0.473TrpAsn: 0.473 ± 0.387
1.89TrpPro: 1.89 ± 1.035
0.473TrpGln: 0.473 ± 0.713
0.945TrpArg: 0.945 ± 0.442
0.473TrpSer: 0.473 ± 0.386
0.0TrpThr: 0.0 ± 0.0
1.418TrpVal: 1.418 ± 1.067
0.945TrpTrp: 0.945 ± 0.72
0.473TrpTyr: 0.473 ± 0.386
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.781TyrAla: 3.781 ± 1.036
0.473TyrCys: 0.473 ± 0.375
1.89TyrAsp: 1.89 ± 0.789
1.89TyrGlu: 1.89 ± 0.665
3.308TyrPhe: 3.308 ± 1.049
3.781TyrGly: 3.781 ± 1.117
1.418TyrHis: 1.418 ± 0.723
2.363TyrIle: 2.363 ± 0.988
1.89TyrLys: 1.89 ± 1.206
3.308TyrLeu: 3.308 ± 1.301
0.0TyrMet: 0.0 ± 0.0
1.418TyrAsn: 1.418 ± 0.881
0.945TyrPro: 0.945 ± 0.773
2.836TyrGln: 2.836 ± 1.239
2.363TyrArg: 2.363 ± 0.694
1.418TyrSer: 1.418 ± 0.974
0.945TyrThr: 0.945 ± 0.442
1.89TyrVal: 1.89 ± 1.177
0.473TyrTrp: 0.473 ± 0.386
2.363TyrTyr: 2.363 ± 0.967
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12 proteins (2117 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski