Amino acid dipepetide frequency for Lactococcus phage 936 group phage Phi43

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.906AlaAla: 0.906 ± 0.416
0.227AlaCys: 0.227 ± 0.19
3.284AlaAsp: 3.284 ± 0.651
4.757AlaGlu: 4.757 ± 0.993
3.851AlaPhe: 3.851 ± 0.965
4.077AlaGly: 4.077 ± 0.843
0.793AlaHis: 0.793 ± 0.359
4.643AlaIle: 4.643 ± 0.861
6.229AlaLys: 6.229 ± 0.945
6.569AlaLeu: 6.569 ± 0.989
2.492AlaMet: 2.492 ± 0.67
4.417AlaAsn: 4.417 ± 0.843
0.793AlaPro: 0.793 ± 0.355
2.718AlaGln: 2.718 ± 0.559
2.265AlaArg: 2.265 ± 0.522
2.945AlaSer: 2.945 ± 0.835
3.511AlaThr: 3.511 ± 0.741
4.077AlaVal: 4.077 ± 0.875
2.152AlaTrp: 2.152 ± 0.975
1.699AlaTyr: 1.699 ± 0.385
0.0AlaXaa: 0.0 ± 0.0
Cys
0.227CysAla: 0.227 ± 0.141
0.113CysCys: 0.113 ± 0.125
0.34CysAsp: 0.34 ± 0.174
0.227CysGlu: 0.227 ± 0.163
0.227CysPhe: 0.227 ± 0.152
0.793CysGly: 0.793 ± 0.287
0.227CysHis: 0.227 ± 0.154
0.34CysIle: 0.34 ± 0.192
0.793CysLys: 0.793 ± 0.339
0.34CysLeu: 0.34 ± 0.18
0.227CysMet: 0.227 ± 0.143
0.68CysAsn: 0.68 ± 0.291
0.227CysPro: 0.227 ± 0.142
0.227CysGln: 0.227 ± 0.153
0.68CysArg: 0.68 ± 0.277
0.113CysSer: 0.113 ± 0.121
0.113CysThr: 0.113 ± 0.125
0.34CysVal: 0.34 ± 0.195
0.34CysTrp: 0.34 ± 0.197
0.227CysTyr: 0.227 ± 0.162
0.0CysXaa: 0.0 ± 0.0
Asp
2.378AspAla: 2.378 ± 0.572
0.227AspCys: 0.227 ± 0.161
2.718AspAsp: 2.718 ± 0.645
3.737AspGlu: 3.737 ± 0.685
3.511AspPhe: 3.511 ± 0.649
3.737AspGly: 3.737 ± 0.54
0.793AspHis: 0.793 ± 0.342
3.737AspIle: 3.737 ± 0.792
5.549AspLys: 5.549 ± 0.702
6.455AspLeu: 6.455 ± 0.84
1.133AspMet: 1.133 ± 0.264
3.624AspAsn: 3.624 ± 0.721
1.586AspPro: 1.586 ± 0.446
0.68AspGln: 0.68 ± 0.325
1.359AspArg: 1.359 ± 0.448
3.624AspSer: 3.624 ± 0.681
3.737AspThr: 3.737 ± 0.699
3.058AspVal: 3.058 ± 0.703
0.793AspTrp: 0.793 ± 0.256
2.378AspTyr: 2.378 ± 0.518
0.0AspXaa: 0.0 ± 0.0
Glu
4.19GluAla: 4.19 ± 0.635
0.566GluCys: 0.566 ± 0.252
3.058GluAsp: 3.058 ± 0.562
4.983GluGlu: 4.983 ± 1.04
3.284GluPhe: 3.284 ± 0.543
2.378GluGly: 2.378 ± 0.567
1.133GluHis: 1.133 ± 0.346
6.342GluIle: 6.342 ± 0.869
5.436GluLys: 5.436 ± 1.024
9.513GluLeu: 9.513 ± 1.457
2.152GluMet: 2.152 ± 0.506
4.983GluAsn: 4.983 ± 0.825
1.133GluPro: 1.133 ± 0.345
4.077GluGln: 4.077 ± 0.817
3.171GluArg: 3.171 ± 0.585
3.851GluSer: 3.851 ± 0.61
4.304GluThr: 4.304 ± 0.688
4.757GluVal: 4.757 ± 0.778
1.019GluTrp: 1.019 ± 0.314
2.945GluTyr: 2.945 ± 0.574
0.0GluXaa: 0.0 ± 0.0
Phe
2.831PheAla: 2.831 ± 0.586
0.227PheCys: 0.227 ± 0.18
3.058PheAsp: 3.058 ± 0.56
2.718PheGlu: 2.718 ± 0.641
1.699PhePhe: 1.699 ± 0.561
2.152PheGly: 2.152 ± 0.498
0.34PheHis: 0.34 ± 0.253
3.058PheIle: 3.058 ± 0.58
3.737PheLys: 3.737 ± 0.568
2.718PheLeu: 2.718 ± 0.488
0.68PheMet: 0.68 ± 0.262
2.831PheAsn: 2.831 ± 0.656
1.019PhePro: 1.019 ± 0.385
1.472PheGln: 1.472 ± 0.436
1.472PheArg: 1.472 ± 0.339
4.077PheSer: 4.077 ± 0.91
3.511PheThr: 3.511 ± 0.514
2.152PheVal: 2.152 ± 0.431
0.227PheTrp: 0.227 ± 0.157
1.586PheTyr: 1.586 ± 0.385
0.0PheXaa: 0.0 ± 0.0
Gly
3.851GlyAla: 3.851 ± 1.117
0.566GlyCys: 0.566 ± 0.235
3.171GlyAsp: 3.171 ± 0.671
4.19GlyGlu: 4.19 ± 0.573
2.492GlyPhe: 2.492 ± 0.647
4.304GlyGly: 4.304 ± 0.721
0.793GlyHis: 0.793 ± 0.342
4.757GlyIle: 4.757 ± 1.284
6.795GlyLys: 6.795 ± 1.094
5.889GlyLeu: 5.889 ± 1.136
1.359GlyMet: 1.359 ± 0.395
3.964GlyAsn: 3.964 ± 0.79
0.227GlyPro: 0.227 ± 0.162
2.265GlyGln: 2.265 ± 0.414
2.152GlyArg: 2.152 ± 0.433
4.643GlySer: 4.643 ± 0.841
3.964GlyThr: 3.964 ± 0.986
5.549GlyVal: 5.549 ± 1.207
1.246GlyTrp: 1.246 ± 0.299
3.058GlyTyr: 3.058 ± 0.576
0.0GlyXaa: 0.0 ± 0.0
His
0.793HisAla: 0.793 ± 0.24
0.793HisCys: 0.793 ± 0.333
0.68HisAsp: 0.68 ± 0.249
0.566HisGlu: 0.566 ± 0.246
0.34HisPhe: 0.34 ± 0.213
2.039HisGly: 2.039 ± 0.572
0.113HisHis: 0.113 ± 0.121
1.133HisIle: 1.133 ± 0.366
1.019HisLys: 1.019 ± 0.314
0.906HisLeu: 0.906 ± 0.297
0.0HisMet: 0.0 ± 0.0
1.359HisAsn: 1.359 ± 0.379
0.113HisPro: 0.113 ± 0.096
0.227HisGln: 0.227 ± 0.146
0.453HisArg: 0.453 ± 0.228
0.34HisSer: 0.34 ± 0.282
1.246HisThr: 1.246 ± 0.407
0.453HisVal: 0.453 ± 0.208
0.0HisTrp: 0.0 ± 0.0
0.566HisTyr: 0.566 ± 0.27
0.0HisXaa: 0.0 ± 0.0
Ile
5.323IleAla: 5.323 ± 0.806
0.0IleCys: 0.0 ± 0.0
4.304IleAsp: 4.304 ± 0.677
6.908IleGlu: 6.908 ± 0.933
3.171IlePhe: 3.171 ± 0.66
4.077IleGly: 4.077 ± 0.897
1.019IleHis: 1.019 ± 0.291
4.757IleIle: 4.757 ± 0.682
6.455IleLys: 6.455 ± 0.77
5.21IleLeu: 5.21 ± 0.795
1.699IleMet: 1.699 ± 0.381
4.87IleAsn: 4.87 ± 0.695
1.925IlePro: 1.925 ± 0.45
2.492IleGln: 2.492 ± 0.53
1.699IleArg: 1.699 ± 0.443
3.851IleSer: 3.851 ± 0.838
5.21IleThr: 5.21 ± 0.693
4.417IleVal: 4.417 ± 0.644
1.133IleTrp: 1.133 ± 0.336
2.378IleTyr: 2.378 ± 0.495
0.0IleXaa: 0.0 ± 0.0
Lys
6.569LysAla: 6.569 ± 1.06
0.566LysCys: 0.566 ± 0.258
4.53LysAsp: 4.53 ± 0.557
7.928LysGlu: 7.928 ± 1.561
2.152LysPhe: 2.152 ± 0.483
5.889LysGly: 5.889 ± 0.93
1.359LysHis: 1.359 ± 0.464
5.323LysIle: 5.323 ± 0.88
8.834LysLys: 8.834 ± 1.02
7.814LysLeu: 7.814 ± 0.724
3.171LysMet: 3.171 ± 0.422
5.663LysAsn: 5.663 ± 0.823
1.359LysPro: 1.359 ± 0.47
3.284LysGln: 3.284 ± 0.704
3.058LysArg: 3.058 ± 0.8
5.663LysSer: 5.663 ± 0.847
5.436LysThr: 5.436 ± 0.807
5.549LysVal: 5.549 ± 0.922
1.359LysTrp: 1.359 ± 0.305
3.398LysTyr: 3.398 ± 0.701
0.0LysXaa: 0.0 ± 0.0
Leu
4.983LeuAla: 4.983 ± 0.742
0.227LeuCys: 0.227 ± 0.164
4.757LeuAsp: 4.757 ± 0.631
5.436LeuGlu: 5.436 ± 0.768
3.398LeuPhe: 3.398 ± 0.711
5.096LeuGly: 5.096 ± 0.956
1.359LeuHis: 1.359 ± 0.433
7.588LeuIle: 7.588 ± 0.961
7.814LeuLys: 7.814 ± 0.923
6.795LeuLeu: 6.795 ± 1.18
1.586LeuMet: 1.586 ± 0.482
4.983LeuAsn: 4.983 ± 0.855
3.058LeuPro: 3.058 ± 0.549
3.058LeuGln: 3.058 ± 0.486
2.945LeuArg: 2.945 ± 0.56
4.983LeuSer: 4.983 ± 0.733
6.002LeuThr: 6.002 ± 0.743
5.889LeuVal: 5.889 ± 0.672
1.472LeuTrp: 1.472 ± 0.394
4.643LeuTyr: 4.643 ± 0.823
0.0LeuXaa: 0.0 ± 0.0
Met
1.925MetAla: 1.925 ± 0.465
0.113MetCys: 0.113 ± 0.109
1.246MetAsp: 1.246 ± 0.433
1.812MetGlu: 1.812 ± 0.562
0.453MetPhe: 0.453 ± 0.207
1.019MetGly: 1.019 ± 0.331
0.227MetHis: 0.227 ± 0.164
2.265MetIle: 2.265 ± 0.418
2.831MetLys: 2.831 ± 0.599
1.246MetLeu: 1.246 ± 0.355
0.34MetMet: 0.34 ± 0.199
2.152MetAsn: 2.152 ± 0.503
0.34MetPro: 0.34 ± 0.17
1.812MetGln: 1.812 ± 0.377
0.453MetArg: 0.453 ± 0.198
1.586MetSer: 1.586 ± 0.398
1.925MetThr: 1.925 ± 0.548
1.019MetVal: 1.019 ± 0.286
0.227MetTrp: 0.227 ± 0.154
1.133MetTyr: 1.133 ± 0.377
0.0MetXaa: 0.0 ± 0.0
Asn
5.663AsnAla: 5.663 ± 1.007
0.227AsnCys: 0.227 ± 0.166
4.757AsnAsp: 4.757 ± 0.822
4.87AsnGlu: 4.87 ± 0.746
1.812AsnPhe: 1.812 ± 0.565
5.776AsnGly: 5.776 ± 0.829
0.793AsnHis: 0.793 ± 0.295
4.304AsnIle: 4.304 ± 0.66
5.323AsnLys: 5.323 ± 0.96
5.776AsnLeu: 5.776 ± 0.699
1.133AsnMet: 1.133 ± 0.31
4.304AsnAsn: 4.304 ± 1.029
2.378AsnPro: 2.378 ± 0.507
2.039AsnGln: 2.039 ± 0.557
2.492AsnArg: 2.492 ± 0.443
4.53AsnSer: 4.53 ± 0.587
3.851AsnThr: 3.851 ± 0.681
3.624AsnVal: 3.624 ± 0.602
1.019AsnTrp: 1.019 ± 0.36
2.605AsnTyr: 2.605 ± 0.607
0.0AsnXaa: 0.0 ± 0.0
Pro
1.133ProAla: 1.133 ± 0.369
0.113ProCys: 0.113 ± 0.092
1.472ProAsp: 1.472 ± 0.508
1.925ProGlu: 1.925 ± 0.501
0.906ProPhe: 0.906 ± 0.314
0.34ProGly: 0.34 ± 0.197
0.0ProHis: 0.0 ± 0.0
1.472ProIle: 1.472 ± 0.421
1.812ProLys: 1.812 ± 0.418
1.812ProLeu: 1.812 ± 0.372
0.566ProMet: 0.566 ± 0.25
2.378ProAsn: 2.378 ± 0.82
0.68ProPro: 0.68 ± 0.303
0.68ProGln: 0.68 ± 0.304
0.566ProArg: 0.566 ± 0.207
1.019ProSer: 1.019 ± 0.331
2.605ProThr: 2.605 ± 0.639
1.586ProVal: 1.586 ± 0.395
0.227ProTrp: 0.227 ± 0.171
0.906ProTyr: 0.906 ± 0.318
0.0ProXaa: 0.0 ± 0.0
Gln
3.398GlnAla: 3.398 ± 0.595
0.227GlnCys: 0.227 ± 0.132
1.925GlnAsp: 1.925 ± 0.437
2.718GlnGlu: 2.718 ± 0.519
1.359GlnPhe: 1.359 ± 0.412
2.718GlnGly: 2.718 ± 0.479
0.34GlnHis: 0.34 ± 0.191
1.359GlnIle: 1.359 ± 0.251
2.831GlnLys: 2.831 ± 0.541
2.831GlnLeu: 2.831 ± 0.487
1.246GlnMet: 1.246 ± 0.293
1.699GlnAsn: 1.699 ± 0.417
1.133GlnPro: 1.133 ± 0.334
1.586GlnGln: 1.586 ± 0.391
1.925GlnArg: 1.925 ± 0.536
2.718GlnSer: 2.718 ± 0.502
2.831GlnThr: 2.831 ± 0.729
2.605GlnVal: 2.605 ± 0.611
0.793GlnTrp: 0.793 ± 0.274
1.472GlnTyr: 1.472 ± 0.334
0.0GlnXaa: 0.0 ± 0.0
Arg
1.925ArgAla: 1.925 ± 0.474
0.34ArgCys: 0.34 ± 0.201
1.925ArgAsp: 1.925 ± 0.508
1.925ArgGlu: 1.925 ± 0.509
0.793ArgPhe: 0.793 ± 0.258
2.492ArgGly: 2.492 ± 0.558
0.906ArgHis: 0.906 ± 0.285
2.152ArgIle: 2.152 ± 0.501
3.964ArgLys: 3.964 ± 0.924
3.511ArgLeu: 3.511 ± 0.654
0.906ArgMet: 0.906 ± 0.341
2.492ArgAsn: 2.492 ± 0.634
0.453ArgPro: 0.453 ± 0.224
1.812ArgGln: 1.812 ± 0.354
2.378ArgArg: 2.378 ± 0.628
1.586ArgSer: 1.586 ± 0.431
2.039ArgThr: 2.039 ± 0.501
1.812ArgVal: 1.812 ± 0.483
0.227ArgTrp: 0.227 ± 0.142
2.039ArgTyr: 2.039 ± 0.542
0.0ArgXaa: 0.0 ± 0.0
Ser
4.983SerAla: 4.983 ± 1.113
0.566SerCys: 0.566 ± 0.264
3.058SerAsp: 3.058 ± 0.58
3.737SerGlu: 3.737 ± 0.562
2.605SerPhe: 2.605 ± 0.688
6.455SerGly: 6.455 ± 1.602
0.566SerHis: 0.566 ± 0.238
4.983SerIle: 4.983 ± 0.749
4.077SerLys: 4.077 ± 0.677
5.889SerLeu: 5.889 ± 1.181
1.699SerMet: 1.699 ± 0.361
4.304SerAsn: 4.304 ± 0.787
1.472SerPro: 1.472 ± 0.48
2.152SerGln: 2.152 ± 0.443
2.265SerArg: 2.265 ± 0.471
6.002SerSer: 6.002 ± 1.298
2.718SerThr: 2.718 ± 0.626
4.19SerVal: 4.19 ± 1.07
1.359SerTrp: 1.359 ± 0.35
1.699SerTyr: 1.699 ± 0.365
0.0SerXaa: 0.0 ± 0.0
Thr
5.663ThrAla: 5.663 ± 0.803
0.227ThrCys: 0.227 ± 0.16
3.171ThrAsp: 3.171 ± 0.639
5.323ThrGlu: 5.323 ± 0.881
3.058ThrPhe: 3.058 ± 0.492
4.87ThrGly: 4.87 ± 0.839
0.113ThrHis: 0.113 ± 0.128
4.304ThrIle: 4.304 ± 0.805
5.776ThrLys: 5.776 ± 0.7
4.983ThrLeu: 4.983 ± 0.802
1.019ThrMet: 1.019 ± 0.314
4.417ThrAsn: 4.417 ± 0.584
1.699ThrPro: 1.699 ± 0.326
3.171ThrGln: 3.171 ± 0.523
1.812ThrArg: 1.812 ± 0.413
4.417ThrSer: 4.417 ± 0.662
4.53ThrThr: 4.53 ± 0.673
4.643ThrVal: 4.643 ± 0.883
1.359ThrTrp: 1.359 ± 0.368
1.812ThrTyr: 1.812 ± 0.435
0.0ThrXaa: 0.0 ± 0.0
Val
3.511ValAla: 3.511 ± 0.556
0.68ValCys: 0.68 ± 0.321
4.304ValAsp: 4.304 ± 0.666
4.757ValGlu: 4.757 ± 0.732
2.718ValPhe: 2.718 ± 0.475
3.171ValGly: 3.171 ± 0.555
0.68ValHis: 0.68 ± 0.213
4.983ValIle: 4.983 ± 0.499
5.889ValLys: 5.889 ± 0.775
3.398ValLeu: 3.398 ± 0.552
1.472ValMet: 1.472 ± 0.394
3.171ValAsn: 3.171 ± 0.762
1.586ValPro: 1.586 ± 0.511
1.699ValGln: 1.699 ± 0.554
2.831ValArg: 2.831 ± 0.734
6.342ValSer: 6.342 ± 1.594
4.983ValThr: 4.983 ± 1.027
3.284ValVal: 3.284 ± 0.759
0.227ValTrp: 0.227 ± 0.164
2.831ValTyr: 2.831 ± 0.521
0.0ValXaa: 0.0 ± 0.0
Trp
0.906TrpAla: 0.906 ± 0.375
0.34TrpCys: 0.34 ± 0.204
0.906TrpAsp: 0.906 ± 0.348
0.566TrpGlu: 0.566 ± 0.25
1.359TrpPhe: 1.359 ± 0.597
1.019TrpGly: 1.019 ± 0.378
0.227TrpHis: 0.227 ± 0.168
0.793TrpIle: 0.793 ± 0.301
1.133TrpLys: 1.133 ± 0.335
1.133TrpLeu: 1.133 ± 0.406
0.34TrpMet: 0.34 ± 0.172
1.812TrpAsn: 1.812 ± 0.498
0.0TrpPro: 0.0 ± 0.0
1.019TrpGln: 1.019 ± 0.308
0.453TrpArg: 0.453 ± 0.302
1.019TrpSer: 1.019 ± 0.271
1.019TrpThr: 1.019 ± 0.367
0.566TrpVal: 0.566 ± 0.223
0.113TrpTrp: 0.113 ± 0.122
1.019TrpTyr: 1.019 ± 0.291
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.246TyrAla: 1.246 ± 0.368
0.453TyrCys: 0.453 ± 0.288
2.152TyrAsp: 2.152 ± 0.557
4.19TyrGlu: 4.19 ± 0.777
2.378TyrPhe: 2.378 ± 0.513
2.831TyrGly: 2.831 ± 0.629
1.359TyrHis: 1.359 ± 0.387
2.718TyrIle: 2.718 ± 0.559
2.605TyrLys: 2.605 ± 0.692
3.284TyrLeu: 3.284 ± 0.69
0.793TyrMet: 0.793 ± 0.431
3.171TyrAsn: 3.171 ± 0.544
1.019TyrPro: 1.019 ± 0.434
1.246TyrGln: 1.246 ± 0.502
1.246TyrArg: 1.246 ± 0.457
1.472TyrSer: 1.472 ± 0.451
2.831TyrThr: 2.831 ± 0.554
2.945TyrVal: 2.945 ± 0.513
0.453TyrTrp: 0.453 ± 0.252
2.152TyrTyr: 2.152 ± 0.526
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 54 proteins (8831 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski