Amino acid dipepetide frequency for Streptococcus phage CHPC595

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.413AlaAla: 3.413 ± 0.717
0.195AlaCys: 0.195 ± 0.179
3.706AlaAsp: 3.706 ± 0.426
3.413AlaGlu: 3.413 ± 0.544
2.633AlaPhe: 2.633 ± 0.847
4.193AlaGly: 4.193 ± 0.784
0.78AlaHis: 0.78 ± 0.296
5.656AlaIle: 5.656 ± 0.903
6.632AlaLys: 6.632 ± 1.017
5.851AlaLeu: 5.851 ± 0.668
1.755AlaMet: 1.755 ± 0.48
4.193AlaAsn: 4.193 ± 0.894
1.853AlaPro: 1.853 ± 0.371
2.048AlaGln: 2.048 ± 0.451
2.633AlaArg: 2.633 ± 0.458
4.486AlaSer: 4.486 ± 0.603
3.803AlaThr: 3.803 ± 0.952
3.121AlaVal: 3.121 ± 0.713
0.975AlaTrp: 0.975 ± 0.241
2.633AlaTyr: 2.633 ± 0.541
0.0AlaXaa: 0.0 ± 0.0
Cys
0.195CysAla: 0.195 ± 0.13
0.0CysCys: 0.0 ± 0.0
0.78CysAsp: 0.78 ± 0.376
0.293CysGlu: 0.293 ± 0.161
0.293CysPhe: 0.293 ± 0.258
0.293CysGly: 0.293 ± 0.179
0.195CysHis: 0.195 ± 0.116
0.195CysIle: 0.195 ± 0.157
0.293CysLys: 0.293 ± 0.182
0.585CysLeu: 0.585 ± 0.27
0.098CysMet: 0.098 ± 0.116
0.39CysAsn: 0.39 ± 0.216
0.293CysPro: 0.293 ± 0.217
0.293CysGln: 0.293 ± 0.17
0.488CysArg: 0.488 ± 0.242
0.293CysSer: 0.293 ± 0.27
0.39CysThr: 0.39 ± 0.174
0.195CysVal: 0.195 ± 0.099
0.195CysTrp: 0.195 ± 0.162
0.098CysTyr: 0.098 ± 0.113
0.0CysXaa: 0.0 ± 0.0
Asp
3.413AspAla: 3.413 ± 0.607
0.195AspCys: 0.195 ± 0.149
3.998AspAsp: 3.998 ± 0.689
3.511AspGlu: 3.511 ± 0.582
3.608AspPhe: 3.608 ± 0.584
6.827AspGly: 6.827 ± 1.429
1.17AspHis: 1.17 ± 0.346
4.779AspIle: 4.779 ± 0.707
4.779AspLys: 4.779 ± 0.672
4.096AspLeu: 4.096 ± 0.765
2.438AspMet: 2.438 ± 0.45
3.901AspAsn: 3.901 ± 0.895
2.243AspPro: 2.243 ± 0.485
1.365AspGln: 1.365 ± 0.283
3.316AspArg: 3.316 ± 0.622
3.608AspSer: 3.608 ± 0.459
3.511AspThr: 3.511 ± 0.575
4.486AspVal: 4.486 ± 0.782
0.975AspTrp: 0.975 ± 0.28
2.926AspTyr: 2.926 ± 0.517
0.0AspXaa: 0.0 ± 0.0
Glu
3.706GluAla: 3.706 ± 0.589
0.293GluCys: 0.293 ± 0.148
3.511GluAsp: 3.511 ± 0.718
3.608GluGlu: 3.608 ± 0.811
2.341GluPhe: 2.341 ± 0.528
3.121GluGly: 3.121 ± 0.452
0.975GluHis: 0.975 ± 0.34
5.949GluIle: 5.949 ± 0.778
3.413GluLys: 3.413 ± 0.735
5.949GluLeu: 5.949 ± 0.855
1.755GluMet: 1.755 ± 0.434
4.389GluAsn: 4.389 ± 0.648
2.146GluPro: 2.146 ± 0.628
3.121GluGln: 3.121 ± 0.475
3.121GluArg: 3.121 ± 0.617
3.218GluSer: 3.218 ± 0.542
3.511GluThr: 3.511 ± 0.464
4.779GluVal: 4.779 ± 0.762
1.56GluTrp: 1.56 ± 0.287
3.121GluTyr: 3.121 ± 0.48
0.0GluXaa: 0.0 ± 0.0
Phe
2.633PheAla: 2.633 ± 0.414
0.195PheCys: 0.195 ± 0.153
3.511PheAsp: 3.511 ± 0.518
2.146PheGlu: 2.146 ± 0.567
1.56PhePhe: 1.56 ± 0.381
3.511PheGly: 3.511 ± 0.739
0.585PheHis: 0.585 ± 0.204
2.341PheIle: 2.341 ± 0.521
4.876PheLys: 4.876 ± 0.703
3.023PheLeu: 3.023 ± 0.555
0.488PheMet: 0.488 ± 0.171
2.926PheAsn: 2.926 ± 0.619
0.488PhePro: 0.488 ± 0.217
1.073PheGln: 1.073 ± 0.282
1.56PheArg: 1.56 ± 0.319
3.023PheSer: 3.023 ± 0.582
2.731PheThr: 2.731 ± 0.605
2.731PheVal: 2.731 ± 0.502
0.78PheTrp: 0.78 ± 0.262
1.56PheTyr: 1.56 ± 0.447
0.0PheXaa: 0.0 ± 0.0
Gly
2.731GlyAla: 2.731 ± 0.662
0.39GlyCys: 0.39 ± 0.196
4.681GlyAsp: 4.681 ± 0.637
4.193GlyGlu: 4.193 ± 0.585
3.023GlyPhe: 3.023 ± 0.508
4.681GlyGly: 4.681 ± 0.864
0.683GlyHis: 0.683 ± 0.271
6.241GlyIle: 6.241 ± 0.946
7.119GlyLys: 7.119 ± 1.026
6.729GlyLeu: 6.729 ± 0.753
1.658GlyMet: 1.658 ± 0.433
3.998GlyAsn: 3.998 ± 0.535
1.755GlyPro: 1.755 ± 0.804
2.828GlyGln: 2.828 ± 0.652
3.413GlyArg: 3.413 ± 0.58
4.389GlySer: 4.389 ± 0.692
4.486GlyThr: 4.486 ± 0.812
3.608GlyVal: 3.608 ± 0.706
1.365GlyTrp: 1.365 ± 0.384
3.121GlyTyr: 3.121 ± 0.46
0.0GlyXaa: 0.0 ± 0.0
His
0.293HisAla: 0.293 ± 0.154
0.195HisCys: 0.195 ± 0.133
1.073HisAsp: 1.073 ± 0.292
0.683HisGlu: 0.683 ± 0.281
0.585HisPhe: 0.585 ± 0.245
1.073HisGly: 1.073 ± 0.335
0.39HisHis: 0.39 ± 0.182
1.17HisIle: 1.17 ± 0.339
0.78HisLys: 0.78 ± 0.207
1.268HisLeu: 1.268 ± 0.284
0.39HisMet: 0.39 ± 0.183
0.878HisAsn: 0.878 ± 0.311
0.683HisPro: 0.683 ± 0.238
0.683HisGln: 0.683 ± 0.255
0.878HisArg: 0.878 ± 0.283
0.975HisSer: 0.975 ± 0.25
0.488HisThr: 0.488 ± 0.253
1.365HisVal: 1.365 ± 0.278
0.0HisTrp: 0.0 ± 0.0
0.878HisTyr: 0.878 ± 0.384
0.0HisXaa: 0.0 ± 0.0
Ile
5.559IleAla: 5.559 ± 0.901
0.585IleCys: 0.585 ± 0.269
5.461IleAsp: 5.461 ± 0.816
4.876IleGlu: 4.876 ± 0.575
1.658IlePhe: 1.658 ± 0.447
4.193IleGly: 4.193 ± 0.732
0.975IleHis: 0.975 ± 0.204
3.413IleIle: 3.413 ± 0.659
6.827IleLys: 6.827 ± 0.689
3.901IleLeu: 3.901 ± 0.794
2.243IleMet: 2.243 ± 0.601
4.193IleAsn: 4.193 ± 0.569
3.706IlePro: 3.706 ± 0.526
2.731IleGln: 2.731 ± 0.421
2.828IleArg: 2.828 ± 0.539
4.876IleSer: 4.876 ± 0.688
3.901IleThr: 3.901 ± 0.699
3.023IleVal: 3.023 ± 0.629
0.878IleTrp: 0.878 ± 0.28
2.243IleTyr: 2.243 ± 0.481
0.0IleXaa: 0.0 ± 0.0
Lys
6.339LysAla: 6.339 ± 0.506
0.293LysCys: 0.293 ± 0.19
5.071LysAsp: 5.071 ± 0.905
6.729LysGlu: 6.729 ± 0.767
3.803LysPhe: 3.803 ± 0.699
6.437LysGly: 6.437 ± 0.765
1.365LysHis: 1.365 ± 0.497
5.461LysIle: 5.461 ± 0.901
6.924LysLys: 6.924 ± 1.211
6.339LysLeu: 6.339 ± 0.895
1.853LysMet: 1.853 ± 0.483
5.461LysAsn: 5.461 ± 0.598
2.731LysPro: 2.731 ± 0.535
3.998LysGln: 3.998 ± 0.587
3.511LysArg: 3.511 ± 0.717
4.193LysSer: 4.193 ± 0.497
5.364LysThr: 5.364 ± 0.816
4.193LysVal: 4.193 ± 0.63
1.365LysTrp: 1.365 ± 0.297
2.926LysTyr: 2.926 ± 0.616
0.0LysXaa: 0.0 ± 0.0
Leu
6.534LeuAla: 6.534 ± 0.814
0.683LeuCys: 0.683 ± 0.265
4.681LeuAsp: 4.681 ± 0.595
6.339LeuGlu: 6.339 ± 0.979
2.828LeuPhe: 2.828 ± 0.458
5.949LeuGly: 5.949 ± 0.978
0.878LeuHis: 0.878 ± 0.28
3.901LeuIle: 3.901 ± 0.535
6.632LeuLys: 6.632 ± 0.733
4.779LeuLeu: 4.779 ± 0.8
2.341LeuMet: 2.341 ± 0.454
5.266LeuAsn: 5.266 ± 0.734
2.926LeuPro: 2.926 ± 0.509
3.023LeuGln: 3.023 ± 0.548
3.218LeuArg: 3.218 ± 0.764
4.876LeuSer: 4.876 ± 0.782
5.169LeuThr: 5.169 ± 0.686
4.193LeuVal: 4.193 ± 0.612
0.683LeuTrp: 0.683 ± 0.294
2.048LeuTyr: 2.048 ± 0.441
0.0LeuXaa: 0.0 ± 0.0
Met
1.658MetAla: 1.658 ± 0.335
0.0MetCys: 0.0 ± 0.0
0.878MetAsp: 0.878 ± 0.265
1.365MetGlu: 1.365 ± 0.327
1.268MetPhe: 1.268 ± 0.392
1.268MetGly: 1.268 ± 0.294
0.293MetHis: 0.293 ± 0.187
1.95MetIle: 1.95 ± 0.424
3.023MetLys: 3.023 ± 0.634
1.95MetLeu: 1.95 ± 0.355
0.293MetMet: 0.293 ± 0.288
1.17MetAsn: 1.17 ± 0.341
1.073MetPro: 1.073 ± 0.288
0.878MetGln: 0.878 ± 0.268
1.073MetArg: 1.073 ± 0.287
1.56MetSer: 1.56 ± 0.651
1.56MetThr: 1.56 ± 0.338
2.048MetVal: 2.048 ± 0.453
0.098MetTrp: 0.098 ± 0.076
1.073MetTyr: 1.073 ± 0.36
0.0MetXaa: 0.0 ± 0.0
Asn
5.266AsnAla: 5.266 ± 1.1
0.39AsnCys: 0.39 ± 0.19
3.803AsnAsp: 3.803 ± 0.579
3.803AsnGlu: 3.803 ± 0.792
2.828AsnPhe: 2.828 ± 0.493
6.827AsnGly: 6.827 ± 1.187
0.975AsnHis: 0.975 ± 0.328
3.413AsnIle: 3.413 ± 0.607
3.998AsnLys: 3.998 ± 0.459
4.486AsnLeu: 4.486 ± 0.639
1.17AsnMet: 1.17 ± 0.325
3.998AsnAsn: 3.998 ± 0.626
3.413AsnPro: 3.413 ± 0.649
2.243AsnGln: 2.243 ± 0.376
2.146AsnArg: 2.146 ± 0.59
3.511AsnSer: 3.511 ± 0.532
3.901AsnThr: 3.901 ± 0.53
3.706AsnVal: 3.706 ± 0.483
1.268AsnTrp: 1.268 ± 0.319
2.341AsnTyr: 2.341 ± 0.516
0.0AsnXaa: 0.0 ± 0.0
Pro
1.853ProAla: 1.853 ± 0.35
0.0ProCys: 0.0 ± 0.0
1.658ProAsp: 1.658 ± 0.461
2.633ProGlu: 2.633 ± 0.495
1.365ProPhe: 1.365 ± 0.264
1.17ProGly: 1.17 ± 0.384
0.39ProHis: 0.39 ± 0.174
1.95ProIle: 1.95 ± 0.429
3.803ProLys: 3.803 ± 0.658
2.828ProLeu: 2.828 ± 0.506
0.293ProMet: 0.293 ± 0.171
2.633ProAsn: 2.633 ± 0.423
0.975ProPro: 0.975 ± 0.378
1.755ProGln: 1.755 ± 0.375
0.683ProArg: 0.683 ± 0.325
2.438ProSer: 2.438 ± 0.441
2.828ProThr: 2.828 ± 0.501
1.658ProVal: 1.658 ± 0.416
0.585ProTrp: 0.585 ± 0.212
0.878ProTyr: 0.878 ± 0.29
0.0ProXaa: 0.0 ± 0.0
Gln
3.121GlnAla: 3.121 ± 0.709
0.195GlnCys: 0.195 ± 0.147
1.853GlnAsp: 1.853 ± 0.378
2.731GlnGlu: 2.731 ± 0.49
1.463GlnPhe: 1.463 ± 0.362
3.608GlnGly: 3.608 ± 0.917
0.488GlnHis: 0.488 ± 0.206
2.438GlnIle: 2.438 ± 0.713
3.218GlnLys: 3.218 ± 0.453
2.926GlnLeu: 2.926 ± 0.462
1.463GlnMet: 1.463 ± 0.321
3.023GlnAsn: 3.023 ± 0.439
0.39GlnPro: 0.39 ± 0.162
2.536GlnGln: 2.536 ± 0.515
1.853GlnArg: 1.853 ± 0.428
2.341GlnSer: 2.341 ± 0.528
2.536GlnThr: 2.536 ± 0.464
2.341GlnVal: 2.341 ± 0.559
0.488GlnTrp: 0.488 ± 0.222
2.536GlnTyr: 2.536 ± 0.571
0.0GlnXaa: 0.0 ± 0.0
Arg
1.755ArgAla: 1.755 ± 0.394
0.195ArgCys: 0.195 ± 0.132
2.828ArgAsp: 2.828 ± 0.425
2.341ArgGlu: 2.341 ± 0.666
2.438ArgPhe: 2.438 ± 0.515
2.926ArgGly: 2.926 ± 0.563
0.78ArgHis: 0.78 ± 0.336
2.926ArgIle: 2.926 ± 0.568
3.121ArgLys: 3.121 ± 0.594
3.218ArgLeu: 3.218 ± 0.716
1.463ArgMet: 1.463 ± 0.368
2.828ArgAsn: 2.828 ± 0.445
1.268ArgPro: 1.268 ± 0.262
2.438ArgGln: 2.438 ± 0.458
1.463ArgArg: 1.463 ± 0.402
1.463ArgSer: 1.463 ± 0.388
2.731ArgThr: 2.731 ± 0.719
2.633ArgVal: 2.633 ± 0.418
1.365ArgTrp: 1.365 ± 0.331
2.048ArgTyr: 2.048 ± 0.486
0.0ArgXaa: 0.0 ± 0.0
Ser
3.413SerAla: 3.413 ± 0.451
0.488SerCys: 0.488 ± 0.236
3.998SerAsp: 3.998 ± 0.598
4.193SerGlu: 4.193 ± 0.537
2.731SerPhe: 2.731 ± 0.525
4.389SerGly: 4.389 ± 0.521
0.488SerHis: 0.488 ± 0.15
4.681SerIle: 4.681 ± 0.691
5.461SerLys: 5.461 ± 1.014
3.998SerLeu: 3.998 ± 0.574
1.853SerMet: 1.853 ± 0.346
4.096SerAsn: 4.096 ± 0.566
1.755SerPro: 1.755 ± 0.296
3.023SerGln: 3.023 ± 0.674
2.536SerArg: 2.536 ± 0.652
4.096SerSer: 4.096 ± 0.564
3.803SerThr: 3.803 ± 0.718
5.461SerVal: 5.461 ± 0.827
0.585SerTrp: 0.585 ± 0.298
1.463SerTyr: 1.463 ± 0.416
0.0SerXaa: 0.0 ± 0.0
Thr
4.389ThrAla: 4.389 ± 0.764
0.39ThrCys: 0.39 ± 0.189
4.486ThrAsp: 4.486 ± 0.653
3.608ThrGlu: 3.608 ± 0.529
2.731ThrPhe: 2.731 ± 0.563
3.998ThrGly: 3.998 ± 0.588
0.975ThrHis: 0.975 ± 0.231
4.389ThrIle: 4.389 ± 0.731
5.071ThrLys: 5.071 ± 0.696
6.144ThrLeu: 6.144 ± 0.849
0.683ThrMet: 0.683 ± 0.209
3.901ThrAsn: 3.901 ± 0.644
1.365ThrPro: 1.365 ± 0.452
2.438ThrGln: 2.438 ± 0.437
1.95ThrArg: 1.95 ± 0.401
3.901ThrSer: 3.901 ± 0.563
2.926ThrThr: 2.926 ± 0.501
3.608ThrVal: 3.608 ± 0.632
0.878ThrTrp: 0.878 ± 0.296
3.511ThrTyr: 3.511 ± 0.703
0.0ThrXaa: 0.0 ± 0.0
Val
4.291ValAla: 4.291 ± 0.627
0.39ValCys: 0.39 ± 0.188
4.876ValAsp: 4.876 ± 0.629
3.803ValGlu: 3.803 ± 0.658
2.341ValPhe: 2.341 ± 0.454
4.389ValGly: 4.389 ± 0.547
0.878ValHis: 0.878 ± 0.268
3.901ValIle: 3.901 ± 0.479
4.681ValLys: 4.681 ± 0.721
3.706ValLeu: 3.706 ± 0.685
1.268ValMet: 1.268 ± 0.434
3.511ValAsn: 3.511 ± 0.633
1.95ValPro: 1.95 ± 0.453
1.658ValGln: 1.658 ± 0.409
2.048ValArg: 2.048 ± 0.434
5.266ValSer: 5.266 ± 0.782
4.486ValThr: 4.486 ± 0.82
3.511ValVal: 3.511 ± 0.566
1.463ValTrp: 1.463 ± 0.312
1.95ValTyr: 1.95 ± 0.357
0.0ValXaa: 0.0 ± 0.0
Trp
0.488TrpAla: 0.488 ± 0.199
0.195TrpCys: 0.195 ± 0.131
1.268TrpAsp: 1.268 ± 0.349
1.17TrpGlu: 1.17 ± 0.292
0.878TrpPhe: 0.878 ± 0.258
0.488TrpGly: 0.488 ± 0.217
0.39TrpHis: 0.39 ± 0.186
0.683TrpIle: 0.683 ± 0.2
1.073TrpLys: 1.073 ± 0.343
1.56TrpLeu: 1.56 ± 0.4
0.098TrpMet: 0.098 ± 0.089
1.268TrpAsn: 1.268 ± 0.375
0.195TrpPro: 0.195 ± 0.133
0.975TrpGln: 0.975 ± 0.376
1.073TrpArg: 1.073 ± 0.285
1.56TrpSer: 1.56 ± 0.461
1.073TrpThr: 1.073 ± 0.35
1.17TrpVal: 1.17 ± 0.292
0.293TrpTrp: 0.293 ± 0.163
0.098TrpTyr: 0.098 ± 0.086
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.926TyrAla: 2.926 ± 0.484
0.585TyrCys: 0.585 ± 0.247
3.023TyrAsp: 3.023 ± 0.57
2.243TyrGlu: 2.243 ± 0.44
1.365TyrPhe: 1.365 ± 0.323
1.755TyrGly: 1.755 ± 0.39
1.073TyrHis: 1.073 ± 0.428
2.633TyrIle: 2.633 ± 0.45
2.633TyrLys: 2.633 ± 0.506
3.608TyrLeu: 3.608 ± 0.446
0.78TyrMet: 0.78 ± 0.261
1.463TyrAsn: 1.463 ± 0.393
1.17TyrPro: 1.17 ± 0.386
2.438TyrGln: 2.438 ± 0.411
2.438TyrArg: 2.438 ± 0.599
2.438TyrSer: 2.438 ± 0.539
2.146TyrThr: 2.146 ± 0.494
2.536TyrVal: 2.536 ± 0.516
0.195TyrTrp: 0.195 ± 0.116
2.731TyrTyr: 2.731 ± 0.699
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 46 proteins (10255 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski