Amino acid dipepetide frequency for Enterobacteria phage WA13

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.438AlaAla: 10.438 ± 3.938
2.088AlaCys: 2.088 ± 1.163
2.61AlaAsp: 2.61 ± 0.667
5.741AlaGlu: 5.741 ± 1.611
3.653AlaPhe: 3.653 ± 1.424
7.307AlaGly: 7.307 ± 3.214
2.61AlaHis: 2.61 ± 0.92
3.132AlaIle: 3.132 ± 0.772
5.741AlaLys: 5.741 ± 2.09
8.351AlaLeu: 8.351 ± 1.388
0.522AlaMet: 0.522 ± 0.39
2.61AlaAsn: 2.61 ± 1.264
4.697AlaPro: 4.697 ± 1.584
4.697AlaGln: 4.697 ± 1.211
4.175AlaArg: 4.175 ± 1.553
9.916AlaSer: 9.916 ± 3.031
6.263AlaThr: 6.263 ± 1.754
4.697AlaVal: 4.697 ± 1.45
1.044AlaTrp: 1.044 ± 0.451
2.088AlaTyr: 2.088 ± 0.985
0.0AlaXaa: 0.0 ± 0.0
Cys
1.044CysAla: 1.044 ± 0.619
0.0CysCys: 0.0 ± 0.0
0.522CysAsp: 0.522 ± 0.401
0.522CysGlu: 0.522 ± 0.536
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.522CysLys: 0.522 ± 0.39
1.044CysLeu: 1.044 ± 0.611
0.0CysMet: 0.0 ± 0.0
0.522CysAsn: 0.522 ± 0.401
1.044CysPro: 1.044 ± 0.416
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.522CysSer: 0.522 ± 0.401
0.522CysThr: 0.522 ± 0.401
3.653CysVal: 3.653 ± 1.739
0.0CysTrp: 0.0 ± 0.0
0.522CysTyr: 0.522 ± 0.39
0.0CysXaa: 0.0 ± 0.0
Asp
5.219AspAla: 5.219 ± 1.579
1.044AspCys: 1.044 ± 0.644
3.132AspAsp: 3.132 ± 1.426
3.132AspGlu: 3.132 ± 1.333
3.653AspPhe: 3.653 ± 1.719
3.132AspGly: 3.132 ± 1.279
1.566AspHis: 1.566 ± 0.864
3.653AspIle: 3.653 ± 0.844
1.566AspLys: 1.566 ± 0.616
5.741AspLeu: 5.741 ± 1.727
1.566AspMet: 1.566 ± 0.559
2.088AspAsn: 2.088 ± 0.831
2.088AspPro: 2.088 ± 0.733
1.566AspGln: 1.566 ± 0.433
2.088AspArg: 2.088 ± 0.509
6.263AspSer: 6.263 ± 1.332
3.132AspThr: 3.132 ± 0.883
3.653AspVal: 3.653 ± 0.439
1.044AspTrp: 1.044 ± 0.655
2.088AspTyr: 2.088 ± 0.668
0.0AspXaa: 0.0 ± 0.0
Glu
1.566GluAla: 1.566 ± 0.779
0.522GluCys: 0.522 ± 0.39
2.61GluAsp: 2.61 ± 0.857
1.566GluGlu: 1.566 ± 1.077
4.175GluPhe: 4.175 ± 1.279
2.61GluGly: 2.61 ± 0.68
1.566GluHis: 1.566 ± 0.923
2.088GluIle: 2.088 ± 0.885
5.741GluLys: 5.741 ± 2.397
5.219GluLeu: 5.219 ± 1.745
2.61GluMet: 2.61 ± 0.729
2.61GluAsn: 2.61 ± 1.147
0.522GluPro: 0.522 ± 0.484
1.044GluGln: 1.044 ± 0.802
3.653GluArg: 3.653 ± 1.005
3.653GluSer: 3.653 ± 1.553
4.175GluThr: 4.175 ± 0.911
3.132GluVal: 3.132 ± 1.072
0.0GluTrp: 0.0 ± 0.0
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.088PheAla: 2.088 ± 1.265
0.522PheCys: 0.522 ± 0.401
4.175PheAsp: 4.175 ± 1.115
1.566PheGlu: 1.566 ± 0.536
2.61PhePhe: 2.61 ± 0.634
4.175PheGly: 4.175 ± 1.13
0.522PheHis: 0.522 ± 0.401
1.566PheIle: 1.566 ± 1.159
3.132PheLys: 3.132 ± 0.99
2.61PheLeu: 2.61 ± 0.927
2.61PheMet: 2.61 ± 1.241
2.61PheAsn: 2.61 ± 0.676
2.088PhePro: 2.088 ± 0.886
2.088PheGln: 2.088 ± 0.872
2.61PheArg: 2.61 ± 1.148
3.653PheSer: 3.653 ± 2.412
3.132PheThr: 3.132 ± 0.782
2.088PheVal: 2.088 ± 1.271
0.522PheTrp: 0.522 ± 0.401
1.566PheTyr: 1.566 ± 0.906
0.0PheXaa: 0.0 ± 0.0
Gly
6.263GlyAla: 6.263 ± 2.225
0.522GlyCys: 0.522 ± 0.484
1.566GlyAsp: 1.566 ± 0.537
1.566GlyGlu: 1.566 ± 0.855
2.088GlyPhe: 2.088 ± 0.612
4.175GlyGly: 4.175 ± 1.446
0.522GlyHis: 0.522 ± 0.484
3.132GlyIle: 3.132 ± 1.556
4.697GlyLys: 4.697 ± 1.502
2.61GlyLeu: 2.61 ± 0.922
1.566GlyMet: 1.566 ± 0.722
3.132GlyAsn: 3.132 ± 1.588
0.0GlyPro: 0.0 ± 0.0
4.175GlyGln: 4.175 ± 2.047
3.653GlyArg: 3.653 ± 1.244
4.697GlySer: 4.697 ± 1.124
1.566GlyThr: 1.566 ± 0.845
5.741GlyVal: 5.741 ± 1.363
1.566GlyTrp: 1.566 ± 0.699
2.088GlyTyr: 2.088 ± 0.932
0.0GlyXaa: 0.0 ± 0.0
His
1.566HisAla: 1.566 ± 0.879
0.0HisCys: 0.0 ± 0.0
1.044HisAsp: 1.044 ± 0.416
1.566HisGlu: 1.566 ± 0.703
0.522HisPhe: 0.522 ± 0.484
1.044HisGly: 1.044 ± 0.416
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.522HisLys: 0.522 ± 0.562
3.132HisLeu: 3.132 ± 0.968
1.044HisMet: 1.044 ± 0.779
0.522HisAsn: 0.522 ± 0.435
0.522HisPro: 0.522 ± 0.532
1.566HisGln: 1.566 ± 0.433
1.566HisArg: 1.566 ± 0.614
2.088HisSer: 2.088 ± 1.603
0.522HisThr: 0.522 ± 0.401
2.61HisVal: 2.61 ± 0.818
1.566HisTrp: 1.566 ± 0.61
0.522HisTyr: 0.522 ± 0.401
0.0HisXaa: 0.0 ± 0.0
Ile
7.307IleAla: 7.307 ± 2.413
1.566IleCys: 1.566 ± 0.879
2.61IleAsp: 2.61 ± 0.93
0.522IleGlu: 0.522 ± 0.562
0.522IlePhe: 0.522 ± 0.39
2.61IleGly: 2.61 ± 0.922
1.044IleHis: 1.044 ± 0.709
0.522IleIle: 0.522 ± 0.39
1.566IleLys: 1.566 ± 0.919
1.044IleLeu: 1.044 ± 0.451
3.653IleMet: 3.653 ± 1.273
1.044IleAsn: 1.044 ± 0.779
4.175IlePro: 4.175 ± 1.701
2.61IleGln: 2.61 ± 1.315
1.566IleArg: 1.566 ± 0.699
1.044IleSer: 1.044 ± 0.779
2.088IleThr: 2.088 ± 0.798
2.088IleVal: 2.088 ± 0.612
1.044IleTrp: 1.044 ± 0.582
1.044IleTyr: 1.044 ± 0.802
0.0IleXaa: 0.0 ± 0.0
Lys
5.219LysAla: 5.219 ± 1.664
1.044LysCys: 1.044 ± 0.779
4.175LysAsp: 4.175 ± 1.908
2.61LysGlu: 2.61 ± 0.613
2.088LysPhe: 2.088 ± 1.235
2.61LysGly: 2.61 ± 1.291
2.088LysHis: 2.088 ± 0.733
2.61LysIle: 2.61 ± 1.01
6.263LysLys: 6.263 ± 2.086
4.175LysLeu: 4.175 ± 1.45
2.61LysMet: 2.61 ± 0.937
3.132LysAsn: 3.132 ± 1.444
2.61LysPro: 2.61 ± 0.836
4.175LysGln: 4.175 ± 1.214
2.088LysArg: 2.088 ± 0.928
1.566LysSer: 1.566 ± 0.722
2.61LysThr: 2.61 ± 0.956
3.132LysVal: 3.132 ± 0.935
1.044LysTrp: 1.044 ± 0.738
2.61LysTyr: 2.61 ± 1.513
0.0LysXaa: 0.0 ± 0.0
Leu
6.263LeuAla: 6.263 ± 2.349
0.522LeuCys: 0.522 ± 0.39
6.785LeuAsp: 6.785 ± 1.548
3.132LeuGlu: 3.132 ± 1.142
2.088LeuPhe: 2.088 ± 0.509
2.088LeuGly: 2.088 ± 0.605
2.088LeuHis: 2.088 ± 0.831
2.088LeuIle: 2.088 ± 1.039
7.307LeuLys: 7.307 ± 1.076
8.873LeuLeu: 8.873 ± 4.365
3.653LeuMet: 3.653 ± 0.808
3.132LeuAsn: 3.132 ± 0.934
3.653LeuPro: 3.653 ± 1.146
2.61LeuGln: 2.61 ± 0.536
6.785LeuArg: 6.785 ± 2.113
10.438LeuSer: 10.438 ± 2.273
5.219LeuThr: 5.219 ± 1.375
4.697LeuVal: 4.697 ± 1.596
2.088LeuTrp: 2.088 ± 1.032
1.044LeuTyr: 1.044 ± 0.779
0.0LeuXaa: 0.0 ± 0.0
Met
5.219MetAla: 5.219 ± 1.231
0.0MetCys: 0.0 ± 0.0
1.566MetAsp: 1.566 ± 0.703
3.653MetGlu: 3.653 ± 1.092
2.088MetPhe: 2.088 ± 1.02
0.522MetGly: 0.522 ± 0.401
1.566MetHis: 1.566 ± 0.871
2.088MetIle: 2.088 ± 0.762
2.088MetLys: 2.088 ± 0.816
2.61MetLeu: 2.61 ± 0.959
1.044MetMet: 1.044 ± 0.781
0.0MetAsn: 0.0 ± 0.0
0.522MetPro: 0.522 ± 0.401
4.697MetGln: 4.697 ± 1.052
1.566MetArg: 1.566 ± 0.796
2.088MetSer: 2.088 ± 0.886
2.61MetThr: 2.61 ± 0.729
1.044MetVal: 1.044 ± 0.779
0.522MetTrp: 0.522 ± 0.39
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.175AsnAla: 4.175 ± 1.241
0.522AsnCys: 0.522 ± 0.401
2.088AsnAsp: 2.088 ± 0.793
2.088AsnGlu: 2.088 ± 0.605
1.044AsnPhe: 1.044 ± 0.611
3.653AsnGly: 3.653 ± 1.193
0.522AsnHis: 0.522 ± 0.562
2.61AsnIle: 2.61 ± 1.078
0.522AsnLys: 0.522 ± 0.39
3.653AsnLeu: 3.653 ± 1.213
3.132AsnMet: 3.132 ± 1.742
1.566AsnAsn: 1.566 ± 0.703
2.088AsnPro: 2.088 ± 0.692
1.566AsnGln: 1.566 ± 0.877
2.61AsnArg: 2.61 ± 0.71
4.175AsnSer: 4.175 ± 1.117
3.132AsnThr: 3.132 ± 1.259
2.088AsnVal: 2.088 ± 1.204
0.522AsnTrp: 0.522 ± 0.39
2.61AsnTyr: 2.61 ± 0.997
0.0AsnXaa: 0.0 ± 0.0
Pro
3.132ProAla: 3.132 ± 1.277
1.044ProCys: 1.044 ± 0.954
2.61ProAsp: 2.61 ± 1.208
4.175ProGlu: 4.175 ± 1.167
1.566ProPhe: 1.566 ± 0.536
1.044ProGly: 1.044 ± 0.619
1.044ProHis: 1.044 ± 0.802
0.522ProIle: 0.522 ± 0.39
0.522ProLys: 0.522 ± 0.401
6.785ProLeu: 6.785 ± 1.452
0.0ProMet: 0.0 ± 0.0
2.61ProAsn: 2.61 ± 0.823
1.566ProPro: 1.566 ± 1.202
0.0ProGln: 0.0 ± 0.0
2.61ProArg: 2.61 ± 1.269
4.697ProSer: 4.697 ± 1.978
2.088ProThr: 2.088 ± 0.889
3.653ProVal: 3.653 ± 1.457
0.522ProTrp: 0.522 ± 0.401
1.566ProTyr: 1.566 ± 0.699
0.0ProXaa: 0.0 ± 0.0
Gln
4.697GlnAla: 4.697 ± 1.868
0.522GlnCys: 0.522 ± 0.401
1.044GlnAsp: 1.044 ± 0.87
4.175GlnGlu: 4.175 ± 1.194
1.566GlnPhe: 1.566 ± 0.95
2.61GlnGly: 2.61 ± 1.458
1.566GlnHis: 1.566 ± 0.722
3.132GlnIle: 3.132 ± 1.307
4.175GlnLys: 4.175 ± 1.419
4.697GlnLeu: 4.697 ± 1.343
1.044GlnMet: 1.044 ± 0.721
1.566GlnAsn: 1.566 ± 1.305
3.132GlnPro: 3.132 ± 1.366
4.175GlnGln: 4.175 ± 1.646
1.044GlnArg: 1.044 ± 0.416
4.697GlnSer: 4.697 ± 1.152
3.653GlnThr: 3.653 ± 0.904
2.61GlnVal: 2.61 ± 1.774
2.61GlnTrp: 2.61 ± 0.838
1.044GlnTyr: 1.044 ± 0.416
0.0GlnXaa: 0.0 ± 0.0
Arg
6.263ArgAla: 6.263 ± 1.178
0.522ArgCys: 0.522 ± 0.532
3.653ArgAsp: 3.653 ± 0.99
1.566ArgGlu: 1.566 ± 0.639
4.175ArgPhe: 4.175 ± 1.663
1.566ArgGly: 1.566 ± 0.718
1.044ArgHis: 1.044 ± 0.605
2.61ArgIle: 2.61 ± 0.548
4.697ArgLys: 4.697 ± 1.745
5.741ArgLeu: 5.741 ± 1.764
2.61ArgMet: 2.61 ± 0.789
2.088ArgAsn: 2.088 ± 0.685
2.61ArgPro: 2.61 ± 1.372
4.175ArgGln: 4.175 ± 2.133
3.132ArgArg: 3.132 ± 1.17
3.132ArgSer: 3.132 ± 1.665
2.61ArgThr: 2.61 ± 1.148
4.175ArgVal: 4.175 ± 1.345
0.0ArgTrp: 0.0 ± 0.0
3.653ArgTyr: 3.653 ± 1.109
0.0ArgXaa: 0.0 ± 0.0
Ser
7.829SerAla: 7.829 ± 2.35
0.0SerCys: 0.0 ± 0.0
4.697SerAsp: 4.697 ± 1.262
3.132SerGlu: 3.132 ± 1.58
4.175SerPhe: 4.175 ± 1.114
6.785SerGly: 6.785 ± 1.902
1.566SerHis: 1.566 ± 0.718
3.132SerIle: 3.132 ± 1.042
2.088SerLys: 2.088 ± 0.837
3.653SerLeu: 3.653 ± 2.0
3.653SerMet: 3.653 ± 1.039
5.741SerAsn: 5.741 ± 1.111
2.61SerPro: 2.61 ± 1.334
5.741SerGln: 5.741 ± 1.639
5.219SerArg: 5.219 ± 1.122
7.829SerSer: 7.829 ± 2.262
3.132SerThr: 3.132 ± 1.516
5.741SerVal: 5.741 ± 2.276
1.044SerTrp: 1.044 ± 0.745
2.088SerTyr: 2.088 ± 0.945
0.0SerXaa: 0.0 ± 0.0
Thr
4.175ThrAla: 4.175 ± 1.393
0.0ThrCys: 0.0 ± 0.0
3.653ThrAsp: 3.653 ± 1.517
4.175ThrGlu: 4.175 ± 1.053
2.61ThrPhe: 2.61 ± 0.872
2.61ThrGly: 2.61 ± 0.941
1.044ThrHis: 1.044 ± 0.416
2.088ThrIle: 2.088 ± 1.086
3.132ThrLys: 3.132 ± 1.286
7.307ThrLeu: 7.307 ± 2.548
0.522ThrMet: 0.522 ± 0.401
2.088ThrAsn: 2.088 ± 0.978
2.61ThrPro: 2.61 ± 0.956
3.132ThrGln: 3.132 ± 1.353
4.697ThrArg: 4.697 ± 1.337
4.697ThrSer: 4.697 ± 1.696
0.522ThrThr: 0.522 ± 0.401
3.132ThrVal: 3.132 ± 1.37
0.0ThrTrp: 0.0 ± 0.0
2.088ThrTyr: 2.088 ± 0.831
0.0ThrXaa: 0.0 ± 0.0
Val
6.263ValAla: 6.263 ± 2.228
0.0ValCys: 0.0 ± 0.0
5.741ValAsp: 5.741 ± 1.575
3.132ValGlu: 3.132 ± 1.688
3.653ValPhe: 3.653 ± 1.593
3.132ValGly: 3.132 ± 0.793
1.044ValHis: 1.044 ± 0.611
2.088ValIle: 2.088 ± 0.798
3.653ValLys: 3.653 ± 1.703
3.132ValLeu: 3.132 ± 1.234
2.088ValMet: 2.088 ± 0.965
4.697ValAsn: 4.697 ± 1.996
2.61ValPro: 2.61 ± 1.058
3.132ValGln: 3.132 ± 1.047
7.307ValArg: 7.307 ± 2.339
3.132ValSer: 3.132 ± 1.366
4.175ValThr: 4.175 ± 1.757
4.697ValVal: 4.697 ± 2.078
0.522ValTrp: 0.522 ± 0.39
3.132ValTyr: 3.132 ± 0.761
0.0ValXaa: 0.0 ± 0.0
Trp
0.522TrpAla: 0.522 ± 0.401
0.0TrpCys: 0.0 ± 0.0
1.044TrpAsp: 1.044 ± 0.745
0.522TrpGlu: 0.522 ± 0.435
1.044TrpPhe: 1.044 ± 0.733
0.522TrpGly: 0.522 ± 0.484
0.522TrpHis: 0.522 ± 0.39
1.044TrpIle: 1.044 ± 0.582
1.044TrpLys: 1.044 ± 0.818
1.044TrpLeu: 1.044 ± 0.733
0.522TrpMet: 0.522 ± 0.401
1.044TrpAsn: 1.044 ± 0.56
1.566TrpPro: 1.566 ± 1.169
0.522TrpGln: 0.522 ± 0.39
0.0TrpArg: 0.0 ± 0.0
0.522TrpSer: 0.522 ± 0.39
2.088TrpThr: 2.088 ± 0.831
0.522TrpVal: 0.522 ± 0.435
0.0TrpTrp: 0.0 ± 0.0
1.566TrpTyr: 1.566 ± 0.681
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.132TyrAla: 3.132 ± 0.991
0.0TyrCys: 0.0 ± 0.0
2.088TyrAsp: 2.088 ± 1.086
0.522TyrGlu: 0.522 ± 0.39
2.61TyrPhe: 2.61 ± 0.867
3.132TyrGly: 3.132 ± 0.666
0.0TyrHis: 0.0 ± 0.0
1.566TyrIle: 1.566 ± 0.861
0.0TyrLys: 0.0 ± 0.0
3.132TyrLeu: 3.132 ± 1.438
0.522TyrMet: 0.522 ± 0.39
1.566TyrAsn: 1.566 ± 0.718
1.044TyrPro: 1.044 ± 0.702
2.088TyrGln: 2.088 ± 1.558
3.653TyrArg: 3.653 ± 1.289
1.044TyrSer: 1.044 ± 0.416
1.044TyrThr: 1.044 ± 0.582
4.175TyrVal: 4.175 ± 1.234
0.0TyrTrp: 0.0 ± 0.0
1.044TyrTyr: 1.044 ± 0.723
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (1917 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski