Amino acid dipepetide frequency for Enterobacteria phage M13 (Bacteriophage M13)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.312AlaAla: 2.312 ± 0.978
0.0AlaCys: 0.0 ± 0.0
1.387AlaAsp: 1.387 ± 0.768
1.387AlaGlu: 1.387 ± 0.657
4.161AlaPhe: 4.161 ± 1.296
4.161AlaGly: 4.161 ± 1.524
0.0AlaHis: 0.0 ± 0.0
5.548AlaIle: 5.548 ± 1.794
5.086AlaLys: 5.086 ± 1.212
6.935AlaLeu: 6.935 ± 1.677
2.774AlaMet: 2.774 ± 1.033
4.161AlaAsn: 4.161 ± 2.476
1.849AlaPro: 1.849 ± 0.979
1.849AlaGln: 1.849 ± 0.773
2.312AlaArg: 2.312 ± 1.375
5.548AlaSer: 5.548 ± 2.341
6.01AlaThr: 6.01 ± 1.713
1.849AlaVal: 1.849 ± 0.751
0.462AlaTrp: 0.462 ± 0.448
1.849AlaTyr: 1.849 ± 0.63
0.0AlaXaa: 0.0 ± 0.0
Cys
0.462CysAla: 0.462 ± 0.479
0.0CysCys: 0.0 ± 0.0
0.462CysAsp: 0.462 ± 0.479
0.462CysGlu: 0.462 ± 0.479
0.925CysPhe: 0.925 ± 0.979
1.387CysGly: 1.387 ± 0.609
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.462CysLys: 0.462 ± 0.396
2.774CysLeu: 2.774 ± 1.105
0.0CysMet: 0.0 ± 0.0
1.849CysAsn: 1.849 ± 0.782
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.925CysArg: 0.925 ± 0.573
0.925CysSer: 0.925 ± 0.838
1.387CysThr: 1.387 ± 0.767
0.462CysVal: 0.462 ± 0.419
0.0CysTrp: 0.0 ± 0.0
0.925CysTyr: 0.925 ± 0.643
0.0CysXaa: 0.0 ± 0.0
Asp
3.236AspAla: 3.236 ± 1.049
0.925AspCys: 0.925 ± 0.958
3.236AspAsp: 3.236 ± 0.709
2.774AspGlu: 2.774 ± 1.0
4.623AspPhe: 4.623 ± 0.967
2.774AspGly: 2.774 ± 1.181
0.462AspHis: 0.462 ± 0.419
1.849AspIle: 1.849 ± 0.872
4.161AspLys: 4.161 ± 2.13
8.784AspLeu: 8.784 ± 3.009
0.925AspMet: 0.925 ± 0.55
2.312AspAsn: 2.312 ± 1.577
1.387AspPro: 1.387 ± 0.994
0.462AspGln: 0.462 ± 0.424
1.849AspArg: 1.849 ± 0.624
7.859AspSer: 7.859 ± 1.787
1.387AspThr: 1.387 ± 0.855
3.236AspVal: 3.236 ± 1.478
1.849AspTrp: 1.849 ± 0.806
2.312AspTyr: 2.312 ± 0.857
0.0AspXaa: 0.0 ± 0.0
Glu
0.925GluAla: 0.925 ± 0.838
1.849GluCys: 1.849 ± 0.581
0.462GluAsp: 0.462 ± 0.479
0.925GluGlu: 0.925 ± 0.505
0.925GluPhe: 0.925 ± 0.478
7.397GluGly: 7.397 ± 3.565
0.462GluHis: 0.462 ± 0.396
1.849GluIle: 1.849 ± 0.918
0.462GluLys: 0.462 ± 0.479
1.849GluLeu: 1.849 ± 1.099
0.462GluMet: 0.462 ± 0.424
3.236GluAsn: 3.236 ± 1.549
0.925GluPro: 0.925 ± 0.599
1.849GluGln: 1.849 ± 1.028
1.387GluArg: 1.387 ± 0.755
3.699GluSer: 3.699 ± 1.27
1.849GluThr: 1.849 ± 0.994
1.387GluVal: 1.387 ± 0.707
0.0GluTrp: 0.0 ± 0.0
2.774GluTyr: 2.774 ± 0.799
0.0GluXaa: 0.0 ± 0.0
Phe
7.859PheAla: 7.859 ± 1.322
0.0PheCys: 0.0 ± 0.0
3.699PheAsp: 3.699 ± 0.933
1.849PheGlu: 1.849 ± 1.265
0.925PhePhe: 0.925 ± 0.848
3.699PheGly: 3.699 ± 1.567
0.462PheHis: 0.462 ± 0.479
3.236PheIle: 3.236 ± 0.961
3.699PheLys: 3.699 ± 1.259
4.623PheLeu: 4.623 ± 2.072
0.925PheMet: 0.925 ± 0.958
1.849PheAsn: 1.849 ± 0.522
0.462PhePro: 0.462 ± 0.424
0.925PheGln: 0.925 ± 0.505
1.849PheArg: 1.849 ± 1.916
6.01PheSer: 6.01 ± 0.887
4.623PheThr: 4.623 ± 1.77
5.548PheVal: 5.548 ± 1.411
0.925PheTrp: 0.925 ± 0.599
2.312PheTyr: 2.312 ± 0.913
0.0PheXaa: 0.0 ± 0.0
Gly
2.312GlyAla: 2.312 ± 1.9
0.925GlyCys: 0.925 ± 0.573
6.01GlyAsp: 6.01 ± 2.632
0.925GlyGlu: 0.925 ± 0.848
4.161GlyPhe: 4.161 ± 1.112
13.87GlyGly: 13.87 ± 10.66
0.462GlyHis: 0.462 ± 0.419
4.161GlyIle: 4.161 ± 1.725
6.472GlyLys: 6.472 ± 1.937
5.548GlyLeu: 5.548 ± 1.196
0.462GlyMet: 0.462 ± 0.419
3.236GlyAsn: 3.236 ± 0.834
0.0GlyPro: 0.0 ± 0.0
4.161GlyGln: 4.161 ± 1.526
1.849GlyArg: 1.849 ± 1.112
10.171GlySer: 10.171 ± 5.242
5.086GlyThr: 5.086 ± 2.34
3.699GlyVal: 3.699 ± 1.021
0.925GlyTrp: 0.925 ± 0.786
3.699GlyTyr: 3.699 ± 1.28
0.0GlyXaa: 0.0 ± 0.0
His
0.462HisAla: 0.462 ± 0.396
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.925HisPhe: 0.925 ± 0.599
0.462HisGly: 0.462 ± 0.396
0.0HisHis: 0.0 ± 0.0
0.462HisIle: 0.462 ± 0.419
0.0HisLys: 0.0 ± 0.0
0.925HisLeu: 0.925 ± 0.63
0.0HisMet: 0.0 ± 0.0
0.462HisAsn: 0.462 ± 0.419
0.462HisPro: 0.462 ± 0.419
0.462HisGln: 0.462 ± 0.419
0.462HisArg: 0.462 ± 0.419
1.387HisSer: 1.387 ± 0.888
0.462HisThr: 0.462 ± 0.479
1.387HisVal: 1.387 ± 0.835
0.0HisTrp: 0.0 ± 0.0
0.462HisTyr: 0.462 ± 0.419
0.0HisXaa: 0.0 ± 0.0
Ile
5.086IleAla: 5.086 ± 2.281
0.0IleCys: 0.0 ± 0.0
6.935IleAsp: 6.935 ± 1.557
1.387IleGlu: 1.387 ± 1.272
2.774IlePhe: 2.774 ± 1.135
4.161IleGly: 4.161 ± 1.283
0.0IleHis: 0.0 ± 0.0
2.312IleIle: 2.312 ± 1.175
3.236IleLys: 3.236 ± 1.554
3.699IleLeu: 3.699 ± 1.979
0.0IleMet: 0.0 ± 0.0
3.699IleAsn: 3.699 ± 0.754
4.161IlePro: 4.161 ± 1.079
2.774IleGln: 2.774 ± 1.173
0.925IleArg: 0.925 ± 0.55
3.699IleSer: 3.699 ± 1.584
5.086IleThr: 5.086 ± 1.593
4.161IleVal: 4.161 ± 1.469
0.0IleTrp: 0.0 ± 0.0
2.774IleTyr: 2.774 ± 1.289
0.0IleXaa: 0.0 ± 0.0
Lys
5.086LysAla: 5.086 ± 1.747
0.925LysCys: 0.925 ± 0.693
2.774LysAsp: 2.774 ± 0.637
1.387LysGlu: 1.387 ± 0.523
2.774LysPhe: 2.774 ± 1.479
4.161LysGly: 4.161 ± 1.323
1.387LysHis: 1.387 ± 0.89
7.397LysIle: 7.397 ± 1.4
4.623LysLys: 4.623 ± 2.309
5.086LysLeu: 5.086 ± 2.125
2.774LysMet: 2.774 ± 0.97
1.849LysAsn: 1.849 ± 0.765
5.086LysPro: 5.086 ± 1.528
3.236LysGln: 3.236 ± 1.192
0.0LysArg: 0.0 ± 0.0
3.236LysSer: 3.236 ± 1.49
3.699LysThr: 3.699 ± 0.694
2.774LysVal: 2.774 ± 1.414
0.0LysTrp: 0.0 ± 0.0
0.925LysTyr: 0.925 ± 0.467
0.0LysXaa: 0.0 ± 0.0
Leu
4.161LeuAla: 4.161 ± 1.572
3.236LeuCys: 3.236 ± 1.8
6.472LeuAsp: 6.472 ± 0.776
0.925LeuGlu: 0.925 ± 0.478
4.161LeuPhe: 4.161 ± 1.125
4.623LeuGly: 4.623 ± 2.017
0.925LeuHis: 0.925 ± 0.793
5.086LeuIle: 5.086 ± 1.642
4.623LeuLys: 4.623 ± 1.645
8.322LeuLeu: 8.322 ± 2.483
3.236LeuMet: 3.236 ± 0.977
4.161LeuAsn: 4.161 ± 0.848
7.397LeuPro: 7.397 ± 1.473
3.236LeuGln: 3.236 ± 1.103
5.548LeuArg: 5.548 ± 1.239
12.02LeuSer: 12.02 ± 3.705
6.01LeuThr: 6.01 ± 1.216
10.633LeuVal: 10.633 ± 2.928
0.925LeuTrp: 0.925 ± 0.478
3.236LeuTyr: 3.236 ± 1.006
0.0LeuXaa: 0.0 ± 0.0
Met
1.849MetAla: 1.849 ± 0.842
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.925MetGlu: 0.925 ± 1.158
0.925MetPhe: 0.925 ± 0.505
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.849MetIle: 1.849 ± 1.534
2.312MetLys: 2.312 ± 0.771
0.925MetLeu: 0.925 ± 0.664
0.0MetMet: 0.0 ± 0.0
3.236MetAsn: 3.236 ± 1.287
1.849MetPro: 1.849 ± 0.806
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
0.925MetSer: 0.925 ± 0.656
0.925MetThr: 0.925 ± 0.478
0.925MetVal: 0.925 ± 0.561
0.0MetTrp: 0.0 ± 0.0
0.925MetTyr: 0.925 ± 0.958
0.0MetXaa: 0.0 ± 0.0
Asn
3.699AsnAla: 3.699 ± 2.204
0.0AsnCys: 0.0 ± 0.0
1.849AsnAsp: 1.849 ± 0.765
6.01AsnGlu: 6.01 ± 1.413
2.774AsnPhe: 2.774 ± 1.189
2.774AsnGly: 2.774 ± 1.087
0.0AsnHis: 0.0 ± 0.0
2.312AsnIle: 2.312 ± 0.755
1.387AsnLys: 1.387 ± 0.979
6.935AsnLeu: 6.935 ± 1.635
0.925AsnMet: 0.925 ± 0.654
4.161AsnAsn: 4.161 ± 1.897
3.236AsnPro: 3.236 ± 1.586
1.387AsnGln: 1.387 ± 1.272
0.925AsnArg: 0.925 ± 0.958
7.397AsnSer: 7.397 ± 0.855
2.312AsnThr: 2.312 ± 1.061
6.472AsnVal: 6.472 ± 3.001
0.462AsnTrp: 0.462 ± 0.419
0.925AsnTyr: 0.925 ± 0.573
0.0AsnXaa: 0.0 ± 0.0
Pro
2.312ProAla: 2.312 ± 1.12
1.387ProCys: 1.387 ± 0.915
2.312ProAsp: 2.312 ± 0.845
2.312ProGlu: 2.312 ± 0.929
4.161ProPhe: 4.161 ± 1.064
1.387ProGly: 1.387 ± 1.003
0.462ProHis: 0.462 ± 0.479
1.387ProIle: 1.387 ± 0.979
2.774ProLys: 2.774 ± 0.888
5.548ProLeu: 5.548 ± 1.212
0.462ProMet: 0.462 ± 0.448
1.387ProAsn: 1.387 ± 0.448
1.387ProPro: 1.387 ± 1.437
2.312ProGln: 2.312 ± 0.905
1.849ProArg: 1.849 ± 0.622
5.548ProSer: 5.548 ± 1.101
1.387ProThr: 1.387 ± 0.536
3.699ProVal: 3.699 ± 1.24
0.0ProTrp: 0.0 ± 0.0
1.387ProTyr: 1.387 ± 0.714
0.0ProXaa: 0.0 ± 0.0
Gln
2.774GlnAla: 2.774 ± 1.325
0.462GlnCys: 0.462 ± 0.479
1.849GlnAsp: 1.849 ± 0.933
0.462GlnGlu: 0.462 ± 0.424
1.387GlnPhe: 1.387 ± 1.117
4.623GlnGly: 4.623 ± 0.928
0.462GlnHis: 0.462 ± 0.419
1.387GlnIle: 1.387 ± 0.923
3.236GlnLys: 3.236 ± 0.981
3.699GlnLeu: 3.699 ± 1.57
0.462GlnMet: 0.462 ± 0.479
3.236GlnAsn: 3.236 ± 0.948
2.774GlnPro: 2.774 ± 1.31
0.925GlnGln: 0.925 ± 0.55
3.236GlnArg: 3.236 ± 1.242
2.774GlnSer: 2.774 ± 1.218
2.774GlnThr: 2.774 ± 1.213
2.312GlnVal: 2.312 ± 0.692
0.0GlnTrp: 0.0 ± 0.0
0.925GlnTyr: 0.925 ± 0.958
0.0GlnXaa: 0.0 ± 0.0
Arg
2.312ArgAla: 2.312 ± 1.685
0.0ArgCys: 0.0 ± 0.0
1.387ArgAsp: 1.387 ± 0.8
0.0ArgGlu: 0.0 ± 0.0
3.236ArgPhe: 3.236 ± 1.107
1.387ArgGly: 1.387 ± 0.484
0.462ArgHis: 0.462 ± 0.419
2.312ArgIle: 2.312 ± 0.976
0.462ArgLys: 0.462 ± 0.396
5.548ArgLeu: 5.548 ± 1.901
0.462ArgMet: 0.462 ± 0.626
2.774ArgAsn: 2.774 ± 0.85
1.387ArgPro: 1.387 ± 0.62
2.312ArgGln: 2.312 ± 0.877
1.387ArgArg: 1.387 ± 0.609
3.236ArgSer: 3.236 ± 1.137
0.925ArgThr: 0.925 ± 0.838
2.312ArgVal: 2.312 ± 1.052
0.462ArgTrp: 0.462 ± 0.396
4.161ArgTyr: 4.161 ± 1.256
0.0ArgXaa: 0.0 ± 0.0
Ser
8.322SerAla: 8.322 ± 1.477
0.462SerCys: 0.462 ± 0.479
5.548SerAsp: 5.548 ± 1.242
4.161SerGlu: 4.161 ± 3.078
6.935SerPhe: 6.935 ± 2.361
10.171SerGly: 10.171 ± 1.947
1.849SerHis: 1.849 ± 0.356
5.086SerIle: 5.086 ± 1.811
6.935SerLys: 6.935 ± 1.824
6.472SerLeu: 6.472 ± 1.497
2.312SerMet: 2.312 ± 0.736
5.086SerAsn: 5.086 ± 1.188
2.312SerPro: 2.312 ± 0.582
7.397SerGln: 7.397 ± 1.258
4.161SerArg: 4.161 ± 1.562
9.246SerSer: 9.246 ± 3.332
3.699SerThr: 3.699 ± 1.397
7.397SerVal: 7.397 ± 2.334
0.462SerTrp: 0.462 ± 0.396
3.699SerTyr: 3.699 ± 1.895
0.0SerXaa: 0.0 ± 0.0
Thr
0.0ThrAla: 0.0 ± 0.0
0.925ThrCys: 0.925 ± 0.599
2.774ThrAsp: 2.774 ± 1.127
1.849ThrGlu: 1.849 ± 1.439
2.774ThrPhe: 2.774 ± 1.184
4.161ThrGly: 4.161 ± 1.774
0.462ThrHis: 0.462 ± 0.419
4.161ThrIle: 4.161 ± 1.418
2.312ThrLys: 2.312 ± 0.882
5.548ThrLeu: 5.548 ± 1.073
0.925ThrMet: 0.925 ± 0.599
1.849ThrAsn: 1.849 ± 0.707
2.774ThrPro: 2.774 ± 0.986
2.774ThrGln: 2.774 ± 0.966
2.312ThrArg: 2.312 ± 1.203
4.623ThrSer: 4.623 ± 1.991
1.849ThrThr: 1.849 ± 0.955
6.935ThrVal: 6.935 ± 1.266
1.849ThrTrp: 1.849 ± 0.356
4.623ThrTyr: 4.623 ± 2.433
0.0ThrXaa: 0.0 ± 0.0
Val
5.086ValAla: 5.086 ± 1.077
1.387ValCys: 1.387 ± 0.86
2.774ValAsp: 2.774 ± 0.919
4.623ValGlu: 4.623 ± 1.127
3.699ValPhe: 3.699 ± 1.012
5.086ValGly: 5.086 ± 1.48
0.462ValHis: 0.462 ± 0.529
3.236ValIle: 3.236 ± 1.671
6.01ValLys: 6.01 ± 1.925
10.633ValLeu: 10.633 ± 3.104
0.0ValMet: 0.0 ± 0.0
3.236ValAsn: 3.236 ± 1.226
4.623ValPro: 4.623 ± 1.449
1.849ValGln: 1.849 ± 1.045
2.774ValArg: 2.774 ± 1.456
7.397ValSer: 7.397 ± 0.966
3.699ValThr: 3.699 ± 1.322
6.472ValVal: 6.472 ± 1.249
0.462ValTrp: 0.462 ± 0.479
2.774ValTyr: 2.774 ± 1.226
0.0ValXaa: 0.0 ± 0.0
Trp
0.462TrpAla: 0.462 ± 0.448
0.462TrpCys: 0.462 ± 0.628
0.925TrpAsp: 0.925 ± 0.636
0.462TrpGlu: 0.462 ± 0.419
0.925TrpPhe: 0.925 ± 0.793
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.462TrpLys: 0.462 ± 0.479
0.462TrpLeu: 0.462 ± 0.396
0.0TrpMet: 0.0 ± 0.0
1.849TrpAsn: 1.849 ± 0.853
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.925TrpArg: 0.925 ± 0.599
0.462TrpSer: 0.462 ± 0.419
0.0TrpThr: 0.0 ± 0.0
0.462TrpVal: 0.462 ± 0.479
0.0TrpTrp: 0.0 ± 0.0
1.387TrpTyr: 1.387 ± 0.703
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.387TyrAla: 1.387 ± 0.746
0.462TyrCys: 0.462 ± 0.396
5.548TyrAsp: 5.548 ± 1.419
1.849TyrGlu: 1.849 ± 1.308
2.312TyrPhe: 2.312 ± 1.161
1.849TyrGly: 1.849 ± 1.431
0.462TyrHis: 0.462 ± 0.419
3.236TyrIle: 3.236 ± 1.351
0.462TyrLys: 0.462 ± 0.424
4.623TyrLeu: 4.623 ± 1.77
0.0TyrMet: 0.0 ± 0.0
2.312TyrAsn: 2.312 ± 0.845
1.387TyrPro: 1.387 ± 0.595
1.849TyrGln: 1.849 ± 0.984
1.849TyrArg: 1.849 ± 0.782
5.548TyrSer: 5.548 ± 1.807
2.312TyrThr: 2.312 ± 1.427
4.161TyrVal: 4.161 ± 0.892
0.462TyrTrp: 0.462 ± 0.479
0.462TyrTyr: 0.462 ± 0.479
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 11 proteins (2164 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski