Amino acid dipepetide frequency for Vibrio phage VCY-phi

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.294AlaAla: 3.294 ± 2.616
0.471AlaCys: 0.471 ± 0.397
2.353AlaAsp: 2.353 ± 0.913
3.294AlaGlu: 3.294 ± 0.73
2.824AlaPhe: 2.824 ± 1.663
3.294AlaGly: 3.294 ± 1.0
1.412AlaHis: 1.412 ± 0.738
5.647AlaIle: 5.647 ± 1.469
5.176AlaLys: 5.176 ± 1.607
8.471AlaLeu: 8.471 ± 1.058
0.941AlaMet: 0.941 ± 0.881
7.529AlaAsn: 7.529 ± 2.031
2.824AlaPro: 2.824 ± 1.477
3.765AlaGln: 3.765 ± 0.899
1.412AlaArg: 1.412 ± 1.207
2.353AlaSer: 2.353 ± 1.117
4.235AlaThr: 4.235 ± 0.949
4.706AlaVal: 4.706 ± 1.869
0.941AlaTrp: 0.941 ± 0.631
4.235AlaTyr: 4.235 ± 1.434
0.0AlaXaa: 0.0 ± 0.0
Cys
0.471CysAla: 0.471 ± 0.397
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.471CysGlu: 0.471 ± 0.367
1.882CysPhe: 1.882 ± 1.041
1.882CysGly: 1.882 ± 1.021
0.941CysHis: 0.941 ± 0.436
0.471CysIle: 0.471 ± 0.531
1.412CysLys: 1.412 ± 0.691
0.471CysLeu: 0.471 ± 0.397
0.0CysMet: 0.0 ± 0.0
1.412CysAsn: 1.412 ± 0.827
0.471CysPro: 0.471 ± 0.397
0.941CysGln: 0.941 ± 0.436
0.471CysArg: 0.471 ± 0.389
0.471CysSer: 0.471 ± 0.59
0.471CysThr: 0.471 ± 0.492
1.882CysVal: 1.882 ± 0.753
0.471CysTrp: 0.471 ± 0.397
0.941CysTyr: 0.941 ± 0.518
0.0CysXaa: 0.0 ± 0.0
Asp
1.412AspAla: 1.412 ± 0.592
0.941AspCys: 0.941 ± 0.567
4.235AspAsp: 4.235 ± 1.068
4.706AspGlu: 4.706 ± 1.633
3.765AspPhe: 3.765 ± 1.043
2.824AspGly: 2.824 ± 1.072
0.471AspHis: 0.471 ± 0.389
4.706AspIle: 4.706 ± 1.774
3.294AspLys: 3.294 ± 1.448
6.588AspLeu: 6.588 ± 1.023
1.412AspMet: 1.412 ± 0.818
2.824AspAsn: 2.824 ± 1.202
4.235AspPro: 4.235 ± 1.205
1.412AspGln: 1.412 ± 0.871
2.824AspArg: 2.824 ± 1.194
3.294AspSer: 3.294 ± 0.79
3.294AspThr: 3.294 ± 0.712
2.353AspVal: 2.353 ± 0.887
0.471AspTrp: 0.471 ± 0.397
0.941AspTyr: 0.941 ± 0.673
0.0AspXaa: 0.0 ± 0.0
Glu
4.235GluAla: 4.235 ± 1.249
1.882GluCys: 1.882 ± 0.732
0.471GluAsp: 0.471 ± 0.389
2.353GluGlu: 2.353 ± 0.955
2.824GluPhe: 2.824 ± 1.413
5.176GluGly: 5.176 ± 1.343
0.471GluHis: 0.471 ± 0.59
3.294GluIle: 3.294 ± 0.966
1.882GluLys: 1.882 ± 0.671
5.176GluLeu: 5.176 ± 1.21
1.412GluMet: 1.412 ± 0.831
5.647GluAsn: 5.647 ± 1.237
1.412GluPro: 1.412 ± 0.453
1.882GluGln: 1.882 ± 1.041
2.353GluArg: 2.353 ± 1.042
3.765GluSer: 3.765 ± 1.416
1.882GluThr: 1.882 ± 0.762
2.824GluVal: 2.824 ± 1.205
0.471GluTrp: 0.471 ± 0.591
2.824GluTyr: 2.824 ± 1.113
0.0GluXaa: 0.0 ± 0.0
Phe
5.176PheAla: 5.176 ± 1.528
0.471PheCys: 0.471 ± 0.397
2.353PheAsp: 2.353 ± 0.763
1.412PheGlu: 1.412 ± 0.847
4.706PhePhe: 4.706 ± 2.315
5.176PheGly: 5.176 ± 1.939
0.471PheHis: 0.471 ± 0.397
3.294PheIle: 3.294 ± 0.803
3.294PheLys: 3.294 ± 1.241
6.118PheLeu: 6.118 ± 1.768
3.294PheMet: 3.294 ± 0.836
5.176PheAsn: 5.176 ± 2.053
1.882PhePro: 1.882 ± 0.747
0.941PheGln: 0.941 ± 0.436
3.294PheArg: 3.294 ± 1.355
3.765PheSer: 3.765 ± 0.765
2.824PheThr: 2.824 ± 1.054
2.824PheVal: 2.824 ± 1.671
0.0PheTrp: 0.0 ± 0.0
1.882PheTyr: 1.882 ± 0.7
0.0PheXaa: 0.0 ± 0.0
Gly
5.176GlyAla: 5.176 ± 2.216
0.941GlyCys: 0.941 ± 0.734
4.706GlyAsp: 4.706 ± 1.622
2.824GlyGlu: 2.824 ± 1.102
3.294GlyPhe: 3.294 ± 0.858
7.529GlyGly: 7.529 ± 3.911
1.412GlyHis: 1.412 ± 0.648
4.706GlyIle: 4.706 ± 1.864
4.235GlyLys: 4.235 ± 1.668
10.353GlyLeu: 10.353 ± 2.824
1.412GlyMet: 1.412 ± 0.43
1.882GlyAsn: 1.882 ± 0.667
0.0GlyPro: 0.0 ± 0.0
3.765GlyGln: 3.765 ± 1.385
5.176GlyArg: 5.176 ± 1.844
7.529GlySer: 7.529 ± 1.744
1.412GlyThr: 1.412 ± 0.805
4.235GlyVal: 4.235 ± 2.097
0.941GlyTrp: 0.941 ± 0.673
3.294GlyTyr: 3.294 ± 1.155
0.0GlyXaa: 0.0 ± 0.0
His
0.941HisAla: 0.941 ± 0.594
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.941HisPhe: 0.941 ± 0.734
0.941HisGly: 0.941 ± 0.436
0.0HisHis: 0.0 ± 0.0
0.941HisIle: 0.941 ± 0.778
1.412HisLys: 1.412 ± 0.805
0.941HisLeu: 0.941 ± 0.794
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.941HisPro: 0.941 ± 0.518
0.0HisGln: 0.0 ± 0.0
0.471HisArg: 0.471 ± 0.397
1.412HisSer: 1.412 ± 0.769
0.471HisThr: 0.471 ± 0.397
1.412HisVal: 1.412 ± 0.846
0.471HisTrp: 0.471 ± 0.531
0.471HisTyr: 0.471 ± 0.397
0.0HisXaa: 0.0 ± 0.0
Ile
6.588IleAla: 6.588 ± 1.637
0.471IleCys: 0.471 ± 0.591
6.588IleAsp: 6.588 ± 1.274
3.294IleGlu: 3.294 ± 0.98
3.294IlePhe: 3.294 ± 1.826
5.176IleGly: 5.176 ± 1.628
0.471IleHis: 0.471 ± 0.397
0.941IleIle: 0.941 ± 0.518
4.235IleLys: 4.235 ± 0.698
6.118IleLeu: 6.118 ± 1.284
0.941IleMet: 0.941 ± 0.436
2.824IleAsn: 2.824 ± 1.039
4.235IlePro: 4.235 ± 1.732
1.412IleGln: 1.412 ± 1.095
1.882IleArg: 1.882 ± 0.697
7.529IleSer: 7.529 ± 2.818
3.765IleThr: 3.765 ± 1.131
2.353IleVal: 2.353 ± 0.755
0.0IleTrp: 0.0 ± 0.0
2.824IleTyr: 2.824 ± 1.586
0.0IleXaa: 0.0 ± 0.0
Lys
3.294LysAla: 3.294 ± 1.021
0.471LysCys: 0.471 ± 0.367
1.882LysAsp: 1.882 ± 0.7
3.294LysGlu: 3.294 ± 1.026
1.882LysPhe: 1.882 ± 1.113
5.176LysGly: 5.176 ± 1.376
0.471LysHis: 0.471 ± 0.367
4.706LysIle: 4.706 ± 0.777
3.765LysLys: 3.765 ± 2.118
8.0LysLeu: 8.0 ± 2.084
0.471LysMet: 0.471 ± 0.555
5.647LysAsn: 5.647 ± 1.571
2.824LysPro: 2.824 ± 1.056
2.353LysGln: 2.353 ± 1.555
2.824LysArg: 2.824 ± 1.62
2.824LysSer: 2.824 ± 1.221
3.294LysThr: 3.294 ± 0.825
2.353LysVal: 2.353 ± 0.671
0.0LysTrp: 0.0 ± 0.0
0.941LysTyr: 0.941 ± 0.636
0.0LysXaa: 0.0 ± 0.0
Leu
8.0LeuAla: 8.0 ± 2.981
3.765LeuCys: 3.765 ± 1.817
6.118LeuAsp: 6.118 ± 1.306
5.647LeuGlu: 5.647 ± 1.633
6.588LeuPhe: 6.588 ± 2.095
6.588LeuGly: 6.588 ± 2.195
0.941LeuHis: 0.941 ± 0.594
6.588LeuIle: 6.588 ± 2.006
4.706LeuLys: 4.706 ± 1.072
10.824LeuLeu: 10.824 ± 1.783
1.412LeuMet: 1.412 ± 1.217
6.118LeuAsn: 6.118 ± 2.004
4.235LeuPro: 4.235 ± 1.33
4.235LeuGln: 4.235 ± 0.962
3.765LeuArg: 3.765 ± 0.805
4.706LeuSer: 4.706 ± 1.266
5.647LeuThr: 5.647 ± 1.884
3.294LeuVal: 3.294 ± 1.047
0.941LeuTrp: 0.941 ± 0.582
3.765LeuTyr: 3.765 ± 0.987
0.0LeuXaa: 0.0 ± 0.0
Met
1.882MetAla: 1.882 ± 0.786
0.0MetCys: 0.0 ± 0.0
0.471MetAsp: 0.471 ± 0.564
2.824MetGlu: 2.824 ± 1.259
0.0MetPhe: 0.0 ± 0.0
1.882MetGly: 1.882 ± 0.828
0.471MetHis: 0.471 ± 0.367
0.941MetIle: 0.941 ± 0.603
0.941MetLys: 0.941 ± 0.859
3.765MetLeu: 3.765 ± 0.816
0.0MetMet: 0.0 ± 0.0
0.471MetAsn: 0.471 ± 0.591
1.412MetPro: 1.412 ± 0.74
0.941MetGln: 0.941 ± 0.918
1.412MetArg: 1.412 ± 0.829
2.353MetSer: 2.353 ± 0.926
1.412MetThr: 1.412 ± 0.835
2.824MetVal: 2.824 ± 1.208
0.0MetTrp: 0.0 ± 0.0
0.941MetTyr: 0.941 ± 0.788
0.0MetXaa: 0.0 ± 0.0
Asn
6.588AsnAla: 6.588 ± 2.224
0.941AsnCys: 0.941 ± 0.589
3.765AsnAsp: 3.765 ± 1.502
3.765AsnGlu: 3.765 ± 2.076
0.941AsnPhe: 0.941 ± 0.61
5.176AsnGly: 5.176 ± 1.593
0.471AsnHis: 0.471 ± 0.367
2.824AsnIle: 2.824 ± 1.091
2.353AsnLys: 2.353 ± 1.474
3.765AsnLeu: 3.765 ± 1.165
2.353AsnMet: 2.353 ± 0.94
1.882AsnAsn: 1.882 ± 0.99
1.882AsnPro: 1.882 ± 0.761
2.353AsnGln: 2.353 ± 0.678
1.882AsnArg: 1.882 ± 0.906
4.235AsnSer: 4.235 ± 1.689
4.706AsnThr: 4.706 ± 2.164
3.765AsnVal: 3.765 ± 1.684
2.353AsnTrp: 2.353 ± 0.669
2.353AsnTyr: 2.353 ± 1.113
0.0AsnXaa: 0.0 ± 0.0
Pro
1.412ProAla: 1.412 ± 0.453
0.0ProCys: 0.0 ± 0.0
4.706ProAsp: 4.706 ± 1.153
1.882ProGlu: 1.882 ± 0.758
0.941ProPhe: 0.941 ± 0.436
0.471ProGly: 0.471 ± 0.367
0.0ProHis: 0.0 ± 0.0
1.882ProIle: 1.882 ± 0.979
1.882ProLys: 1.882 ± 0.814
2.353ProLeu: 2.353 ± 1.087
0.0ProMet: 0.0 ± 0.0
3.765ProAsn: 3.765 ± 1.103
1.412ProPro: 1.412 ± 0.691
2.824ProGln: 2.824 ± 1.205
0.941ProArg: 0.941 ± 0.67
5.176ProSer: 5.176 ± 1.494
2.353ProThr: 2.353 ± 1.011
2.353ProVal: 2.353 ± 0.671
0.471ProTrp: 0.471 ± 0.389
2.353ProTyr: 2.353 ± 0.762
0.0ProXaa: 0.0 ± 0.0
Gln
3.765GlnAla: 3.765 ± 1.088
0.0GlnCys: 0.0 ± 0.0
1.882GlnAsp: 1.882 ± 0.794
1.412GlnGlu: 1.412 ± 0.598
3.294GlnPhe: 3.294 ± 1.438
1.882GlnGly: 1.882 ± 1.047
0.471GlnHis: 0.471 ± 0.389
4.706GlnIle: 4.706 ± 1.558
2.353GlnLys: 2.353 ± 1.654
3.765GlnLeu: 3.765 ± 1.227
2.353GlnMet: 2.353 ± 0.738
1.882GlnAsn: 1.882 ± 1.34
1.412GlnPro: 1.412 ± 0.821
1.882GlnGln: 1.882 ± 0.535
1.882GlnArg: 1.882 ± 0.794
3.294GlnSer: 3.294 ± 0.974
1.882GlnThr: 1.882 ± 1.588
1.882GlnVal: 1.882 ± 0.678
0.0GlnTrp: 0.0 ± 0.0
2.353GlnTyr: 2.353 ± 0.671
0.0GlnXaa: 0.0 ± 0.0
Arg
2.824ArgAla: 2.824 ± 1.316
0.471ArgCys: 0.471 ± 0.367
3.294ArgAsp: 3.294 ± 0.759
2.824ArgGlu: 2.824 ± 0.998
3.294ArgPhe: 3.294 ± 0.983
1.412ArgGly: 1.412 ± 0.805
1.412ArgHis: 1.412 ± 1.191
3.765ArgIle: 3.765 ± 1.355
2.824ArgLys: 2.824 ± 1.339
3.294ArgLeu: 3.294 ± 1.425
2.353ArgMet: 2.353 ± 1.286
1.412ArgAsn: 1.412 ± 1.193
1.412ArgPro: 1.412 ± 0.517
0.471ArgGln: 0.471 ± 0.397
0.941ArgArg: 0.941 ± 0.604
1.882ArgSer: 1.882 ± 0.777
2.824ArgThr: 2.824 ± 0.758
1.882ArgVal: 1.882 ± 0.906
0.941ArgTrp: 0.941 ± 1.182
1.882ArgTyr: 1.882 ± 0.784
0.0ArgXaa: 0.0 ± 0.0
Ser
5.176SerAla: 5.176 ± 1.658
0.471SerCys: 0.471 ± 0.59
3.294SerAsp: 3.294 ± 1.019
6.588SerGlu: 6.588 ± 1.425
6.588SerPhe: 6.588 ± 1.176
9.412SerGly: 9.412 ± 3.389
0.471SerHis: 0.471 ± 0.531
4.235SerIle: 4.235 ± 1.144
2.824SerLys: 2.824 ± 1.24
4.706SerLeu: 4.706 ± 1.5
2.824SerMet: 2.824 ± 2.219
2.353SerAsn: 2.353 ± 0.95
0.941SerPro: 0.941 ± 0.552
4.235SerGln: 4.235 ± 1.695
2.353SerArg: 2.353 ± 0.989
6.118SerSer: 6.118 ± 2.515
4.706SerThr: 4.706 ± 1.682
4.706SerVal: 4.706 ± 1.125
0.0SerTrp: 0.0 ± 0.0
1.882SerTyr: 1.882 ± 0.855
0.0SerXaa: 0.0 ± 0.0
Thr
2.824ThrAla: 2.824 ± 1.401
1.882ThrCys: 1.882 ± 1.078
2.353ThrAsp: 2.353 ± 0.822
1.412ThrGlu: 1.412 ± 0.775
2.824ThrPhe: 2.824 ± 1.083
2.824ThrGly: 2.824 ± 1.046
0.0ThrHis: 0.0 ± 0.0
2.824ThrIle: 2.824 ± 1.028
4.235ThrLys: 4.235 ± 1.394
5.647ThrLeu: 5.647 ± 1.701
1.882ThrMet: 1.882 ± 1.028
2.353ThrAsn: 2.353 ± 1.037
2.824ThrPro: 2.824 ± 0.988
3.294ThrGln: 3.294 ± 1.387
2.824ThrArg: 2.824 ± 0.774
5.647ThrSer: 5.647 ± 1.583
2.353ThrThr: 2.353 ± 0.627
3.765ThrVal: 3.765 ± 1.564
0.471ThrTrp: 0.471 ± 0.63
2.353ThrTyr: 2.353 ± 1.194
0.0ThrXaa: 0.0 ± 0.0
Val
4.235ValAla: 4.235 ± 0.952
0.941ValCys: 0.941 ± 0.866
3.765ValAsp: 3.765 ± 1.481
2.824ValGlu: 2.824 ± 1.412
4.235ValPhe: 4.235 ± 1.012
2.824ValGly: 2.824 ± 1.01
0.471ValHis: 0.471 ± 0.397
7.059ValIle: 7.059 ± 1.636
2.824ValLys: 2.824 ± 1.066
3.294ValLeu: 3.294 ± 1.508
1.412ValMet: 1.412 ± 1.095
2.824ValAsn: 2.824 ± 1.519
0.471ValPro: 0.471 ± 0.559
2.353ValGln: 2.353 ± 0.671
3.765ValArg: 3.765 ± 1.371
4.706ValSer: 4.706 ± 0.963
3.294ValThr: 3.294 ± 1.324
4.706ValVal: 4.706 ± 2.593
1.412ValTrp: 1.412 ± 1.206
0.941ValTyr: 0.941 ± 0.552
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.941TrpCys: 0.941 ± 0.69
1.412TrpAsp: 1.412 ± 0.555
0.471TrpGlu: 0.471 ± 0.564
0.0TrpPhe: 0.0 ± 0.0
1.882TrpGly: 1.882 ± 0.906
0.471TrpHis: 0.471 ± 0.591
0.0TrpIle: 0.0 ± 0.0
0.471TrpLys: 0.471 ± 0.63
0.471TrpLeu: 0.471 ± 0.63
0.0TrpMet: 0.0 ± 0.0
0.471TrpAsn: 0.471 ± 0.367
0.0TrpPro: 0.0 ± 0.0
0.941TrpGln: 0.941 ± 0.552
0.0TrpArg: 0.0 ± 0.0
0.941TrpSer: 0.941 ± 0.778
1.412TrpThr: 1.412 ± 0.887
0.471TrpVal: 0.471 ± 0.59
0.471TrpTrp: 0.471 ± 0.591
0.471TrpTyr: 0.471 ± 0.63
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.882TyrAla: 1.882 ± 1.024
0.471TyrCys: 0.471 ± 0.367
2.353TyrAsp: 2.353 ± 1.335
0.941TyrGlu: 0.941 ± 0.844
4.235TyrPhe: 4.235 ± 2.126
3.294TyrGly: 3.294 ± 1.379
0.471TyrHis: 0.471 ± 0.367
1.412TyrIle: 1.412 ± 0.591
2.353TyrLys: 2.353 ± 1.108
3.765TyrLeu: 3.765 ± 1.624
0.0TyrMet: 0.0 ± 0.0
1.882TyrAsn: 1.882 ± 0.557
2.353TyrPro: 2.353 ± 1.489
2.353TyrGln: 2.353 ± 1.44
0.941TyrArg: 0.941 ± 0.794
2.353TyrSer: 2.353 ± 0.763
2.353TyrThr: 2.353 ± 0.763
3.765TyrVal: 3.765 ± 1.298
0.471TyrTrp: 0.471 ± 0.531
2.824TyrTyr: 2.824 ± 1.512
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 11 proteins (2126 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski