Amino acid dipepetide frequency for Lactococcus phage CHPC781

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.816AlaAla: 0.816 ± 0.405
0.233AlaCys: 0.233 ± 0.176
3.031AlaAsp: 3.031 ± 0.589
5.13AlaGlu: 5.13 ± 0.963
3.381AlaPhe: 3.381 ± 0.822
4.081AlaGly: 4.081 ± 0.752
0.816AlaHis: 0.816 ± 0.34
4.78AlaIle: 4.78 ± 0.739
5.48AlaLys: 5.48 ± 0.91
7.112AlaLeu: 7.112 ± 1.062
1.865AlaMet: 1.865 ± 0.574
3.731AlaAsn: 3.731 ± 0.823
0.933AlaPro: 0.933 ± 0.36
2.215AlaGln: 2.215 ± 0.646
1.749AlaArg: 1.749 ± 0.48
2.915AlaSer: 2.915 ± 0.683
3.381AlaThr: 3.381 ± 0.663
4.197AlaVal: 4.197 ± 1.451
1.982AlaTrp: 1.982 ± 0.96
1.749AlaTyr: 1.749 ± 0.403
0.0AlaXaa: 0.0 ± 0.0
Cys
0.35CysAla: 0.35 ± 0.186
0.117CysCys: 0.117 ± 0.106
0.35CysAsp: 0.35 ± 0.209
0.583CysGlu: 0.583 ± 0.309
0.7CysPhe: 0.7 ± 0.416
0.816CysGly: 0.816 ± 0.314
0.35CysHis: 0.35 ± 0.233
0.233CysIle: 0.233 ± 0.158
1.049CysLys: 1.049 ± 0.435
0.35CysLeu: 0.35 ± 0.202
0.0CysMet: 0.0 ± 0.0
0.466CysAsn: 0.466 ± 0.238
0.233CysPro: 0.233 ± 0.174
0.466CysGln: 0.466 ± 0.209
0.816CysArg: 0.816 ± 0.301
0.233CysSer: 0.233 ± 0.157
0.117CysThr: 0.117 ± 0.106
0.35CysVal: 0.35 ± 0.19
0.117CysTrp: 0.117 ± 0.121
0.35CysTyr: 0.35 ± 0.202
0.0CysXaa: 0.0 ± 0.0
Asp
1.982AspAla: 1.982 ± 0.559
0.466AspCys: 0.466 ± 0.296
3.031AspAsp: 3.031 ± 0.709
4.43AspGlu: 4.43 ± 0.856
3.614AspPhe: 3.614 ± 0.538
4.081AspGly: 4.081 ± 0.594
1.049AspHis: 1.049 ± 0.378
4.664AspIle: 4.664 ± 0.621
5.247AspLys: 5.247 ± 0.64
5.596AspLeu: 5.596 ± 0.87
1.049AspMet: 1.049 ± 0.243
3.847AspAsn: 3.847 ± 0.721
1.166AspPro: 1.166 ± 0.404
0.816AspGln: 0.816 ± 0.307
1.982AspArg: 1.982 ± 0.605
2.798AspSer: 2.798 ± 0.567
4.081AspThr: 4.081 ± 0.764
2.682AspVal: 2.682 ± 0.642
1.049AspTrp: 1.049 ± 0.408
2.798AspTyr: 2.798 ± 0.505
0.0AspXaa: 0.0 ± 0.0
Glu
3.614GluAla: 3.614 ± 0.633
0.233GluCys: 0.233 ± 0.19
3.381GluAsp: 3.381 ± 0.669
5.363GluGlu: 5.363 ± 0.997
3.964GluPhe: 3.964 ± 0.705
2.565GluGly: 2.565 ± 0.47
0.933GluHis: 0.933 ± 0.36
5.713GluIle: 5.713 ± 0.831
6.412GluLys: 6.412 ± 1.442
9.327GluLeu: 9.327 ± 1.735
2.448GluMet: 2.448 ± 0.593
5.247GluAsn: 5.247 ± 0.805
1.282GluPro: 1.282 ± 0.413
3.498GluGln: 3.498 ± 0.774
2.915GluArg: 2.915 ± 0.717
3.148GluSer: 3.148 ± 0.519
5.247GluThr: 5.247 ± 0.757
4.43GluVal: 4.43 ± 0.703
0.933GluTrp: 0.933 ± 0.302
3.614GluTyr: 3.614 ± 0.76
0.0GluXaa: 0.0 ± 0.0
Phe
2.565PheAla: 2.565 ± 0.69
0.233PheCys: 0.233 ± 0.158
3.614PheAsp: 3.614 ± 0.67
2.682PheGlu: 2.682 ± 0.562
2.099PhePhe: 2.099 ± 0.847
2.448PheGly: 2.448 ± 0.643
0.7PheHis: 0.7 ± 0.349
3.614PheIle: 3.614 ± 0.663
4.314PheLys: 4.314 ± 0.795
2.915PheLeu: 2.915 ± 0.546
0.7PheMet: 0.7 ± 0.277
3.265PheAsn: 3.265 ± 0.854
1.049PhePro: 1.049 ± 0.388
1.516PheGln: 1.516 ± 0.461
1.282PheArg: 1.282 ± 0.297
3.731PheSer: 3.731 ± 0.746
2.798PheThr: 2.798 ± 0.579
2.798PheVal: 2.798 ± 0.446
0.35PheTrp: 0.35 ± 0.205
1.982PheTyr: 1.982 ± 0.448
0.0PheXaa: 0.0 ± 0.0
Gly
3.614GlyAla: 3.614 ± 1.047
0.233GlyCys: 0.233 ± 0.143
3.498GlyAsp: 3.498 ± 0.657
3.731GlyGlu: 3.731 ± 0.541
2.215GlyPhe: 2.215 ± 0.461
3.731GlyGly: 3.731 ± 0.804
1.166GlyHis: 1.166 ± 0.385
4.664GlyIle: 4.664 ± 1.532
6.063GlyLys: 6.063 ± 0.733
5.596GlyLeu: 5.596 ± 1.15
1.282GlyMet: 1.282 ± 0.402
3.731GlyAsn: 3.731 ± 0.617
0.117GlyPro: 0.117 ± 0.098
2.099GlyGln: 2.099 ± 0.495
2.332GlyArg: 2.332 ± 0.465
5.13GlySer: 5.13 ± 1.094
3.614GlyThr: 3.614 ± 0.822
5.596GlyVal: 5.596 ± 1.225
1.049GlyTrp: 1.049 ± 0.35
4.43GlyTyr: 4.43 ± 0.884
0.0GlyXaa: 0.0 ± 0.0
His
1.049HisAla: 1.049 ± 0.353
0.583HisCys: 0.583 ± 0.301
0.933HisAsp: 0.933 ± 0.366
0.466HisGlu: 0.466 ± 0.27
0.466HisPhe: 0.466 ± 0.243
1.516HisGly: 1.516 ± 0.437
0.0HisHis: 0.0 ± 0.0
1.399HisIle: 1.399 ± 0.432
0.816HisLys: 0.816 ± 0.344
1.166HisLeu: 1.166 ± 0.519
0.0HisMet: 0.0 ± 0.0
1.282HisAsn: 1.282 ± 0.386
0.233HisPro: 0.233 ± 0.186
0.233HisGln: 0.233 ± 0.172
0.7HisArg: 0.7 ± 0.34
0.233HisSer: 0.233 ± 0.178
1.166HisThr: 1.166 ± 0.437
0.816HisVal: 0.816 ± 0.301
0.233HisTrp: 0.233 ± 0.3
0.466HisTyr: 0.466 ± 0.217
0.0HisXaa: 0.0 ± 0.0
Ile
4.547IleAla: 4.547 ± 0.596
0.35IleCys: 0.35 ± 0.202
5.247IleAsp: 5.247 ± 0.669
6.412IleGlu: 6.412 ± 1.099
2.798IlePhe: 2.798 ± 0.595
3.964IleGly: 3.964 ± 0.932
1.282IleHis: 1.282 ± 0.377
4.897IleIle: 4.897 ± 0.76
6.995IleLys: 6.995 ± 0.958
4.547IleLeu: 4.547 ± 0.865
1.982IleMet: 1.982 ± 0.505
5.363IleAsn: 5.363 ± 0.702
1.399IlePro: 1.399 ± 0.366
2.215IleGln: 2.215 ± 0.517
1.749IleArg: 1.749 ± 0.409
3.614IleSer: 3.614 ± 0.815
5.13IleThr: 5.13 ± 0.722
4.897IleVal: 4.897 ± 0.816
1.166IleTrp: 1.166 ± 0.416
2.565IleTyr: 2.565 ± 0.526
0.0IleXaa: 0.0 ± 0.0
Lys
6.412LysAla: 6.412 ± 0.908
0.583LysCys: 0.583 ± 0.262
5.247LysAsp: 5.247 ± 0.556
7.695LysGlu: 7.695 ± 1.598
2.682LysPhe: 2.682 ± 0.575
6.296LysGly: 6.296 ± 1.05
0.583LysHis: 0.583 ± 0.314
5.946LysIle: 5.946 ± 0.917
9.91LysLys: 9.91 ± 1.151
8.278LysLeu: 8.278 ± 1.064
2.798LysMet: 2.798 ± 0.471
4.78LysAsn: 4.78 ± 0.729
1.982LysPro: 1.982 ± 0.683
3.031LysGln: 3.031 ± 0.837
3.498LysArg: 3.498 ± 0.766
5.013LysSer: 5.013 ± 0.708
5.363LysThr: 5.363 ± 0.761
6.646LysVal: 6.646 ± 0.764
1.166LysTrp: 1.166 ± 0.272
4.081LysTyr: 4.081 ± 0.839
0.0LysXaa: 0.0 ± 0.0
Leu
5.48LeuAla: 5.48 ± 0.603
0.466LeuCys: 0.466 ± 0.225
4.197LeuAsp: 4.197 ± 0.606
5.946LeuGlu: 5.946 ± 0.882
3.847LeuPhe: 3.847 ± 0.691
5.48LeuGly: 5.48 ± 0.675
1.282LeuHis: 1.282 ± 0.391
7.112LeuIle: 7.112 ± 0.999
7.812LeuLys: 7.812 ± 0.903
6.296LeuLeu: 6.296 ± 1.081
2.215LeuMet: 2.215 ± 0.54
5.946LeuAsn: 5.946 ± 0.932
2.448LeuPro: 2.448 ± 0.727
2.915LeuGln: 2.915 ± 0.541
2.565LeuArg: 2.565 ± 0.482
5.247LeuSer: 5.247 ± 0.835
6.296LeuThr: 6.296 ± 0.819
5.83LeuVal: 5.83 ± 0.86
1.049LeuTrp: 1.049 ± 0.327
4.197LeuTyr: 4.197 ± 0.811
0.0LeuXaa: 0.0 ± 0.0
Met
2.099MetAla: 2.099 ± 0.52
0.117MetCys: 0.117 ± 0.104
1.516MetAsp: 1.516 ± 0.453
2.099MetGlu: 2.099 ± 0.6
0.466MetPhe: 0.466 ± 0.239
0.816MetGly: 0.816 ± 0.238
0.233MetHis: 0.233 ± 0.155
2.332MetIle: 2.332 ± 0.552
2.798MetLys: 2.798 ± 0.489
1.166MetLeu: 1.166 ± 0.364
0.466MetMet: 0.466 ± 0.231
2.448MetAsn: 2.448 ± 0.584
0.35MetPro: 0.35 ± 0.251
1.516MetGln: 1.516 ± 0.412
0.583MetArg: 0.583 ± 0.206
1.516MetSer: 1.516 ± 0.398
1.399MetThr: 1.399 ± 0.416
1.516MetVal: 1.516 ± 0.426
0.117MetTrp: 0.117 ± 0.12
1.399MetTyr: 1.399 ± 0.416
0.0MetXaa: 0.0 ± 0.0
Asn
4.43AsnAla: 4.43 ± 1.223
0.117AsnCys: 0.117 ± 0.115
3.731AsnAsp: 3.731 ± 0.717
4.314AsnGlu: 4.314 ± 0.834
1.749AsnPhe: 1.749 ± 0.62
7.345AsnGly: 7.345 ± 0.928
0.933AsnHis: 0.933 ± 0.328
4.081AsnIle: 4.081 ± 0.661
5.946AsnLys: 5.946 ± 1.115
6.063AsnLeu: 6.063 ± 0.8
1.749AsnMet: 1.749 ± 0.472
3.731AsnAsn: 3.731 ± 0.744
2.099AsnPro: 2.099 ± 0.479
2.565AsnGln: 2.565 ± 0.568
2.215AsnArg: 2.215 ± 0.44
4.081AsnSer: 4.081 ± 0.561
3.847AsnThr: 3.847 ± 0.701
3.847AsnVal: 3.847 ± 0.624
1.049AsnTrp: 1.049 ± 0.382
2.215AsnTyr: 2.215 ± 0.604
0.0AsnXaa: 0.0 ± 0.0
Pro
1.166ProAla: 1.166 ± 0.421
0.233ProCys: 0.233 ± 0.171
1.516ProAsp: 1.516 ± 0.463
1.865ProGlu: 1.865 ± 0.563
0.816ProPhe: 0.816 ± 0.285
0.233ProGly: 0.233 ± 0.149
0.117ProHis: 0.117 ± 0.118
2.099ProIle: 2.099 ± 0.58
1.982ProLys: 1.982 ± 0.461
1.749ProLeu: 1.749 ± 0.355
0.35ProMet: 0.35 ± 0.195
2.215ProAsn: 2.215 ± 0.734
0.35ProPro: 0.35 ± 0.218
0.35ProGln: 0.35 ± 0.173
0.466ProArg: 0.466 ± 0.212
0.933ProSer: 0.933 ± 0.296
2.332ProThr: 2.332 ± 0.443
1.632ProVal: 1.632 ± 0.595
0.117ProTrp: 0.117 ± 0.116
0.466ProTyr: 0.466 ± 0.197
0.0ProXaa: 0.0 ± 0.0
Gln
3.148GlnAla: 3.148 ± 0.873
0.466GlnCys: 0.466 ± 0.257
1.516GlnAsp: 1.516 ± 0.409
2.565GlnGlu: 2.565 ± 0.514
1.166GlnPhe: 1.166 ± 0.352
1.982GlnGly: 1.982 ± 0.566
0.35GlnHis: 0.35 ± 0.192
1.282GlnIle: 1.282 ± 0.276
3.031GlnLys: 3.031 ± 0.664
3.614GlnLeu: 3.614 ± 0.597
1.049GlnMet: 1.049 ± 0.261
2.332GlnAsn: 2.332 ± 0.413
1.166GlnPro: 1.166 ± 0.349
1.632GlnGln: 1.632 ± 0.461
1.632GlnArg: 1.632 ± 0.503
2.215GlnSer: 2.215 ± 0.587
2.215GlnThr: 2.215 ± 0.429
1.982GlnVal: 1.982 ± 0.51
0.583GlnTrp: 0.583 ± 0.23
1.049GlnTyr: 1.049 ± 0.352
0.0GlnXaa: 0.0 ± 0.0
Arg
2.798ArgAla: 2.798 ± 0.781
0.583ArgCys: 0.583 ± 0.283
1.049ArgAsp: 1.049 ± 0.337
2.565ArgGlu: 2.565 ± 0.518
1.166ArgPhe: 1.166 ± 0.319
2.332ArgGly: 2.332 ± 0.54
0.933ArgHis: 0.933 ± 0.329
1.865ArgIle: 1.865 ± 0.457
3.731ArgLys: 3.731 ± 0.636
3.614ArgLeu: 3.614 ± 0.622
0.7ArgMet: 0.7 ± 0.3
2.215ArgAsn: 2.215 ± 0.561
0.583ArgPro: 0.583 ± 0.256
1.632ArgGln: 1.632 ± 0.383
2.099ArgArg: 2.099 ± 0.55
1.865ArgSer: 1.865 ± 0.542
1.865ArgThr: 1.865 ± 0.392
2.215ArgVal: 2.215 ± 0.572
0.35ArgTrp: 0.35 ± 0.199
1.749ArgTyr: 1.749 ± 0.458
0.0ArgXaa: 0.0 ± 0.0
Ser
4.664SerAla: 4.664 ± 1.349
0.816SerCys: 0.816 ± 0.321
4.081SerAsp: 4.081 ± 0.693
3.148SerGlu: 3.148 ± 0.465
3.265SerPhe: 3.265 ± 0.691
5.013SerGly: 5.013 ± 1.187
0.7SerHis: 0.7 ± 0.281
4.43SerIle: 4.43 ± 0.934
4.78SerLys: 4.78 ± 0.825
4.897SerLeu: 4.897 ± 0.629
2.215SerMet: 2.215 ± 0.4
3.498SerAsn: 3.498 ± 0.588
1.282SerPro: 1.282 ± 0.486
1.516SerGln: 1.516 ± 0.408
2.332SerArg: 2.332 ± 0.364
4.547SerSer: 4.547 ± 0.981
2.448SerThr: 2.448 ± 0.524
4.081SerVal: 4.081 ± 0.629
0.933SerTrp: 0.933 ± 0.335
2.099SerTyr: 2.099 ± 0.437
0.0SerXaa: 0.0 ± 0.0
Thr
5.013ThrAla: 5.013 ± 0.722
0.35ThrCys: 0.35 ± 0.217
3.031ThrAsp: 3.031 ± 0.685
5.83ThrGlu: 5.83 ± 0.772
2.915ThrPhe: 2.915 ± 0.759
3.847ThrGly: 3.847 ± 0.813
0.233ThrHis: 0.233 ± 0.148
4.197ThrIle: 4.197 ± 0.771
4.78ThrLys: 4.78 ± 0.595
5.713ThrLeu: 5.713 ± 0.763
0.933ThrMet: 0.933 ± 0.352
4.081ThrAsn: 4.081 ± 0.735
1.632ThrPro: 1.632 ± 0.32
2.798ThrGln: 2.798 ± 0.458
2.099ThrArg: 2.099 ± 0.528
4.43ThrSer: 4.43 ± 0.616
3.731ThrThr: 3.731 ± 0.769
4.43ThrVal: 4.43 ± 0.711
0.933ThrTrp: 0.933 ± 0.405
2.448ThrTyr: 2.448 ± 0.592
0.0ThrXaa: 0.0 ± 0.0
Val
3.498ValAla: 3.498 ± 0.538
0.583ValCys: 0.583 ± 0.295
4.547ValAsp: 4.547 ± 0.753
4.547ValGlu: 4.547 ± 0.584
3.731ValPhe: 3.731 ± 0.756
3.148ValGly: 3.148 ± 0.623
0.816ValHis: 0.816 ± 0.321
4.314ValIle: 4.314 ± 0.574
6.646ValLys: 6.646 ± 0.877
3.498ValLeu: 3.498 ± 0.573
1.749ValMet: 1.749 ± 0.441
2.915ValAsn: 2.915 ± 0.569
1.632ValPro: 1.632 ± 0.463
1.865ValGln: 1.865 ± 0.442
3.148ValArg: 3.148 ± 0.706
6.296ValSer: 6.296 ± 1.649
5.48ValThr: 5.48 ± 1.006
3.847ValVal: 3.847 ± 0.925
0.466ValTrp: 0.466 ± 0.215
2.682ValTyr: 2.682 ± 0.472
0.0ValXaa: 0.0 ± 0.0
Trp
0.466TrpAla: 0.466 ± 0.191
0.35TrpCys: 0.35 ± 0.199
1.049TrpAsp: 1.049 ± 0.618
0.583TrpGlu: 0.583 ± 0.274
1.049TrpPhe: 1.049 ± 0.369
0.933TrpGly: 0.933 ± 0.391
0.233TrpHis: 0.233 ± 0.18
0.583TrpIle: 0.583 ± 0.269
0.816TrpLys: 0.816 ± 0.273
1.049TrpLeu: 1.049 ± 0.34
0.35TrpMet: 0.35 ± 0.196
1.282TrpAsn: 1.282 ± 0.443
0.233TrpPro: 0.233 ± 0.148
0.7TrpGln: 0.7 ± 0.217
0.583TrpArg: 0.583 ± 0.275
1.282TrpSer: 1.282 ± 0.332
0.583TrpThr: 0.583 ± 0.273
0.933TrpVal: 0.933 ± 0.388
0.117TrpTrp: 0.117 ± 0.137
0.7TrpTyr: 0.7 ± 0.264
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.865TyrAla: 1.865 ± 0.545
0.933TyrCys: 0.933 ± 0.439
1.865TyrAsp: 1.865 ± 0.663
4.314TyrGlu: 4.314 ± 0.843
2.565TyrPhe: 2.565 ± 0.51
3.265TyrGly: 3.265 ± 0.588
1.049TyrHis: 1.049 ± 0.281
2.915TyrIle: 2.915 ± 0.679
3.498TyrLys: 3.498 ± 0.834
4.081TyrLeu: 4.081 ± 0.917
0.933TyrMet: 0.933 ± 0.495
3.731TyrAsn: 3.731 ± 0.536
0.816TyrPro: 0.816 ± 0.377
1.399TyrGln: 1.399 ± 0.578
1.166TyrArg: 1.166 ± 0.343
1.516TyrSer: 1.516 ± 0.404
2.332TyrThr: 2.332 ± 0.511
2.682TyrVal: 2.682 ± 0.65
0.117TyrTrp: 0.117 ± 0.113
2.215TyrTyr: 2.215 ± 0.483
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (8578 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski