Amino acid dipepetide frequency for Streptococcus phage Javan249

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.83AlaAla: 3.83 ± 0.887
0.16AlaCys: 0.16 ± 0.116
4.388AlaAsp: 4.388 ± 0.622
5.266AlaGlu: 5.266 ± 0.698
2.553AlaPhe: 2.553 ± 0.831
3.67AlaGly: 3.67 ± 0.848
1.197AlaHis: 1.197 ± 0.41
7.101AlaIle: 7.101 ± 0.875
6.782AlaLys: 6.782 ± 0.64
6.064AlaLeu: 6.064 ± 1.343
2.792AlaMet: 2.792 ± 0.649
4.947AlaAsn: 4.947 ± 0.954
1.915AlaPro: 1.915 ± 0.457
5.186AlaGln: 5.186 ± 0.882
2.952AlaArg: 2.952 ± 0.631
4.228AlaSer: 4.228 ± 1.136
5.106AlaThr: 5.106 ± 0.831
4.388AlaVal: 4.388 ± 0.615
1.117AlaTrp: 1.117 ± 0.29
2.314AlaTyr: 2.314 ± 0.511
0.0AlaXaa: 0.0 ± 0.0
Cys
0.16CysAla: 0.16 ± 0.118
0.08CysCys: 0.08 ± 0.087
0.239CysAsp: 0.239 ± 0.141
0.16CysGlu: 0.16 ± 0.108
0.239CysPhe: 0.239 ± 0.131
0.479CysGly: 0.479 ± 0.196
0.319CysHis: 0.319 ± 0.187
0.16CysIle: 0.16 ± 0.118
0.558CysLys: 0.558 ± 0.25
0.16CysLeu: 0.16 ± 0.118
0.239CysMet: 0.239 ± 0.136
0.319CysAsn: 0.319 ± 0.143
0.319CysPro: 0.319 ± 0.186
0.08CysGln: 0.08 ± 0.089
0.479CysArg: 0.479 ± 0.212
0.239CysSer: 0.239 ± 0.124
0.0CysThr: 0.0 ± 0.0
0.239CysVal: 0.239 ± 0.143
0.0CysTrp: 0.0 ± 0.0
0.16CysTyr: 0.16 ± 0.109
0.0CysXaa: 0.0 ± 0.0
Asp
4.468AspAla: 4.468 ± 1.045
0.638AspCys: 0.638 ± 0.232
3.83AspAsp: 3.83 ± 0.657
4.867AspGlu: 4.867 ± 0.785
2.792AspPhe: 2.792 ± 0.432
4.947AspGly: 4.947 ± 0.684
0.479AspHis: 0.479 ± 0.16
3.271AspIle: 3.271 ± 0.501
6.143AspLys: 6.143 ± 0.887
5.186AspLeu: 5.186 ± 0.721
1.117AspMet: 1.117 ± 0.331
4.149AspAsn: 4.149 ± 0.684
1.835AspPro: 1.835 ± 0.461
1.835AspGln: 1.835 ± 0.516
2.314AspArg: 2.314 ± 0.566
3.431AspSer: 3.431 ± 0.483
3.75AspThr: 3.75 ± 0.467
2.553AspVal: 2.553 ± 0.555
1.037AspTrp: 1.037 ± 0.308
3.351AspTyr: 3.351 ± 0.6
0.0AspXaa: 0.0 ± 0.0
Glu
4.468GluAla: 4.468 ± 0.53
0.239GluCys: 0.239 ± 0.119
3.83GluAsp: 3.83 ± 0.641
5.106GluGlu: 5.106 ± 0.94
2.154GluPhe: 2.154 ± 0.392
2.713GluGly: 2.713 ± 0.453
0.878GluHis: 0.878 ± 0.243
3.83GluIle: 3.83 ± 0.426
5.505GluLys: 5.505 ± 0.939
8.776GluLeu: 8.776 ± 1.469
1.915GluMet: 1.915 ± 0.574
3.75GluAsn: 3.75 ± 0.582
1.436GluPro: 1.436 ± 0.245
3.271GluGln: 3.271 ± 0.67
4.069GluArg: 4.069 ± 0.835
3.59GluSer: 3.59 ± 0.534
3.989GluThr: 3.989 ± 0.766
4.149GluVal: 4.149 ± 0.547
0.957GluTrp: 0.957 ± 0.316
2.792GluTyr: 2.792 ± 0.486
0.0GluXaa: 0.0 ± 0.0
Phe
3.112PheAla: 3.112 ± 0.481
0.08PheCys: 0.08 ± 0.077
2.872PheAsp: 2.872 ± 0.514
3.032PheGlu: 3.032 ± 0.481
1.436PhePhe: 1.436 ± 0.299
3.351PheGly: 3.351 ± 0.492
0.239PheHis: 0.239 ± 0.124
2.314PheIle: 2.314 ± 0.43
3.51PheLys: 3.51 ± 0.559
2.553PheLeu: 2.553 ± 0.465
1.037PheMet: 1.037 ± 0.299
3.032PheAsn: 3.032 ± 0.49
0.957PhePro: 0.957 ± 0.269
0.957PheGln: 0.957 ± 0.234
1.197PheArg: 1.197 ± 0.288
2.393PheSer: 2.393 ± 0.437
2.393PheThr: 2.393 ± 0.369
2.074PheVal: 2.074 ± 0.368
0.479PheTrp: 0.479 ± 0.232
1.037PheTyr: 1.037 ± 0.27
0.0PheXaa: 0.0 ± 0.0
Gly
4.308GlyAla: 4.308 ± 0.904
0.239GlyCys: 0.239 ± 0.117
3.909GlyAsp: 3.909 ± 0.965
2.952GlyGlu: 2.952 ± 0.461
3.112GlyPhe: 3.112 ± 0.74
3.191GlyGly: 3.191 ± 0.673
1.117GlyHis: 1.117 ± 0.402
3.989GlyIle: 3.989 ± 0.902
5.824GlyLys: 5.824 ± 0.722
6.223GlyLeu: 6.223 ± 1.234
1.516GlyMet: 1.516 ± 0.357
3.431GlyAsn: 3.431 ± 0.466
1.037GlyPro: 1.037 ± 0.276
2.393GlyGln: 2.393 ± 0.68
2.713GlyArg: 2.713 ± 0.453
3.909GlySer: 3.909 ± 0.701
4.149GlyThr: 4.149 ± 0.725
4.548GlyVal: 4.548 ± 0.821
0.798GlyTrp: 0.798 ± 0.311
2.872GlyTyr: 2.872 ± 0.35
0.0GlyXaa: 0.0 ± 0.0
His
1.037HisAla: 1.037 ± 0.354
0.08HisCys: 0.08 ± 0.089
0.638HisAsp: 0.638 ± 0.227
0.798HisGlu: 0.798 ± 0.285
0.878HisPhe: 0.878 ± 0.299
0.479HisGly: 0.479 ± 0.239
0.479HisHis: 0.479 ± 0.212
1.277HisIle: 1.277 ± 0.397
1.277HisLys: 1.277 ± 0.328
1.356HisLeu: 1.356 ± 0.347
0.08HisMet: 0.08 ± 0.086
0.479HisAsn: 0.479 ± 0.151
0.558HisPro: 0.558 ± 0.255
0.399HisGln: 0.399 ± 0.176
0.638HisArg: 0.638 ± 0.253
0.878HisSer: 0.878 ± 0.273
1.117HisThr: 1.117 ± 0.33
0.399HisVal: 0.399 ± 0.136
0.08HisTrp: 0.08 ± 0.074
0.878HisTyr: 0.878 ± 0.267
0.0HisXaa: 0.0 ± 0.0
Ile
6.064IleAla: 6.064 ± 0.706
0.479IleCys: 0.479 ± 0.331
5.984IleAsp: 5.984 ± 0.579
5.345IleGlu: 5.345 ± 0.794
2.234IlePhe: 2.234 ± 0.384
4.627IleGly: 4.627 ± 0.657
0.558IleHis: 0.558 ± 0.206
3.351IleIle: 3.351 ± 0.662
4.707IleLys: 4.707 ± 0.72
4.069IleLeu: 4.069 ± 0.685
1.037IleMet: 1.037 ± 0.233
3.112IleAsn: 3.112 ± 0.483
2.314IlePro: 2.314 ± 0.375
3.112IleGln: 3.112 ± 0.596
1.995IleArg: 1.995 ± 0.301
5.505IleSer: 5.505 ± 0.896
4.308IleThr: 4.308 ± 1.109
4.308IleVal: 4.308 ± 0.601
0.558IleTrp: 0.558 ± 0.256
2.952IleTyr: 2.952 ± 0.578
0.0IleXaa: 0.0 ± 0.0
Lys
6.064LysAla: 6.064 ± 0.693
0.319LysCys: 0.319 ± 0.169
3.83LysAsp: 3.83 ± 0.574
6.383LysGlu: 6.383 ± 0.877
2.393LysPhe: 2.393 ± 0.399
4.867LysGly: 4.867 ± 0.628
1.516LysHis: 1.516 ± 0.395
6.941LysIle: 6.941 ± 0.835
5.585LysLys: 5.585 ± 0.945
6.782LysLeu: 6.782 ± 0.664
2.872LysMet: 2.872 ± 0.697
5.266LysAsn: 5.266 ± 0.757
1.915LysPro: 1.915 ± 0.497
2.872LysGln: 2.872 ± 0.558
3.909LysArg: 3.909 ± 0.645
6.223LysSer: 6.223 ± 0.677
5.425LysThr: 5.425 ± 0.654
5.026LysVal: 5.026 ± 0.971
0.798LysTrp: 0.798 ± 0.226
2.314LysTyr: 2.314 ± 0.423
0.0LysXaa: 0.0 ± 0.0
Leu
6.542LeuAla: 6.542 ± 0.736
0.319LeuCys: 0.319 ± 0.19
6.303LeuAsp: 6.303 ± 0.695
5.904LeuGlu: 5.904 ± 1.036
2.393LeuPhe: 2.393 ± 0.482
5.744LeuGly: 5.744 ± 1.075
0.878LeuHis: 0.878 ± 0.253
4.707LeuIle: 4.707 ± 0.692
5.186LeuLys: 5.186 ± 0.773
7.34LeuLeu: 7.34 ± 0.866
2.234LeuMet: 2.234 ± 0.436
5.505LeuAsn: 5.505 ± 0.79
3.909LeuPro: 3.909 ± 0.557
3.351LeuGln: 3.351 ± 0.583
2.952LeuArg: 2.952 ± 0.465
7.978LeuSer: 7.978 ± 1.293
5.984LeuThr: 5.984 ± 0.607
4.707LeuVal: 4.707 ± 0.397
0.558LeuTrp: 0.558 ± 0.238
1.835LeuTyr: 1.835 ± 0.404
0.0LeuXaa: 0.0 ± 0.0
Met
3.351MetAla: 3.351 ± 1.147
0.0MetCys: 0.0 ± 0.0
1.516MetAsp: 1.516 ± 0.411
1.596MetGlu: 1.596 ± 0.454
0.399MetPhe: 0.399 ± 0.2
1.197MetGly: 1.197 ± 0.328
0.399MetHis: 0.399 ± 0.177
1.277MetIle: 1.277 ± 0.277
1.835MetLys: 1.835 ± 0.423
1.915MetLeu: 1.915 ± 0.42
0.479MetMet: 0.479 ± 0.223
1.596MetAsn: 1.596 ± 0.547
0.558MetPro: 0.558 ± 0.229
1.117MetGln: 1.117 ± 0.314
1.277MetArg: 1.277 ± 0.358
1.995MetSer: 1.995 ± 0.611
2.074MetThr: 2.074 ± 0.394
1.356MetVal: 1.356 ± 0.293
0.319MetTrp: 0.319 ± 0.173
0.479MetTyr: 0.479 ± 0.197
0.0MetXaa: 0.0 ± 0.0
Asn
4.867AsnAla: 4.867 ± 0.767
0.16AsnCys: 0.16 ± 0.1
2.792AsnAsp: 2.792 ± 0.494
3.59AsnGlu: 3.59 ± 0.602
2.234AsnPhe: 2.234 ± 0.446
5.345AsnGly: 5.345 ± 0.606
0.798AsnHis: 0.798 ± 0.237
3.989AsnIle: 3.989 ± 0.561
4.468AsnLys: 4.468 ± 0.785
5.585AsnLeu: 5.585 ± 0.648
1.596AsnMet: 1.596 ± 0.413
3.989AsnAsn: 3.989 ± 0.553
1.995AsnPro: 1.995 ± 0.42
2.792AsnGln: 2.792 ± 0.525
2.633AsnArg: 2.633 ± 0.434
4.388AsnSer: 4.388 ± 0.679
2.393AsnThr: 2.393 ± 0.393
3.75AsnVal: 3.75 ± 0.429
1.037AsnTrp: 1.037 ± 0.341
1.915AsnTyr: 1.915 ± 0.403
0.0AsnXaa: 0.0 ± 0.0
Pro
2.553ProAla: 2.553 ± 0.689
0.08ProCys: 0.08 ± 0.089
2.154ProAsp: 2.154 ± 0.478
1.516ProGlu: 1.516 ± 0.34
1.037ProPhe: 1.037 ± 0.308
0.718ProGly: 0.718 ± 0.189
0.16ProHis: 0.16 ± 0.093
1.755ProIle: 1.755 ± 0.368
3.032ProLys: 3.032 ± 0.536
1.755ProLeu: 1.755 ± 0.451
0.319ProMet: 0.319 ± 0.158
1.675ProAsn: 1.675 ± 0.382
0.479ProPro: 0.479 ± 0.22
1.277ProGln: 1.277 ± 0.485
0.878ProArg: 0.878 ± 0.259
1.835ProSer: 1.835 ± 0.422
1.915ProThr: 1.915 ± 0.457
1.995ProVal: 1.995 ± 0.461
0.16ProTrp: 0.16 ± 0.113
1.516ProTyr: 1.516 ± 0.458
0.0ProXaa: 0.0 ± 0.0
Gln
3.909GlnAla: 3.909 ± 0.746
0.399GlnCys: 0.399 ± 0.176
1.675GlnAsp: 1.675 ± 0.34
2.872GlnGlu: 2.872 ± 0.592
1.995GlnPhe: 1.995 ± 0.346
2.792GlnGly: 2.792 ± 0.349
0.718GlnHis: 0.718 ± 0.292
2.633GlnIle: 2.633 ± 0.413
3.75GlnLys: 3.75 ± 0.567
4.548GlnLeu: 4.548 ± 0.796
1.277GlnMet: 1.277 ± 0.336
2.234GlnAsn: 2.234 ± 0.491
1.277GlnPro: 1.277 ± 0.516
2.393GlnGln: 2.393 ± 0.665
1.755GlnArg: 1.755 ± 0.405
2.952GlnSer: 2.952 ± 0.673
2.872GlnThr: 2.872 ± 0.625
1.436GlnVal: 1.436 ± 0.321
0.319GlnTrp: 0.319 ± 0.172
1.516GlnTyr: 1.516 ± 0.379
0.0GlnXaa: 0.0 ± 0.0
Arg
2.154ArgAla: 2.154 ± 0.387
0.399ArgCys: 0.399 ± 0.228
1.596ArgAsp: 1.596 ± 0.347
2.074ArgGlu: 2.074 ± 0.437
1.835ArgPhe: 1.835 ± 0.353
1.995ArgGly: 1.995 ± 0.48
0.798ArgHis: 0.798 ± 0.229
3.51ArgIle: 3.51 ± 0.579
3.67ArgLys: 3.67 ± 0.713
5.106ArgLeu: 5.106 ± 0.792
1.197ArgMet: 1.197 ± 0.3
3.032ArgAsn: 3.032 ± 0.456
0.798ArgPro: 0.798 ± 0.358
1.197ArgGln: 1.197 ± 0.346
1.995ArgArg: 1.995 ± 0.415
2.393ArgSer: 2.393 ± 0.433
1.835ArgThr: 1.835 ± 0.358
3.67ArgVal: 3.67 ± 0.68
0.479ArgTrp: 0.479 ± 0.25
2.314ArgTyr: 2.314 ± 0.573
0.0ArgXaa: 0.0 ± 0.0
Ser
6.383SerAla: 6.383 ± 2.229
0.16SerCys: 0.16 ± 0.118
4.468SerAsp: 4.468 ± 0.695
4.867SerGlu: 4.867 ± 0.569
3.112SerPhe: 3.112 ± 0.673
4.468SerGly: 4.468 ± 1.17
0.957SerHis: 0.957 ± 0.313
4.627SerIle: 4.627 ± 0.76
5.744SerLys: 5.744 ± 0.743
4.787SerLeu: 4.787 ± 0.834
1.436SerMet: 1.436 ± 0.389
3.83SerAsn: 3.83 ± 0.652
1.117SerPro: 1.117 ± 0.253
3.271SerGln: 3.271 ± 0.545
2.792SerArg: 2.792 ± 0.325
6.702SerSer: 6.702 ± 1.555
5.744SerThr: 5.744 ± 0.755
4.548SerVal: 4.548 ± 0.709
0.798SerTrp: 0.798 ± 0.195
2.234SerTyr: 2.234 ± 0.428
0.0SerXaa: 0.0 ± 0.0
Thr
4.867ThrAla: 4.867 ± 0.999
0.319ThrCys: 0.319 ± 0.129
4.228ThrAsp: 4.228 ± 0.706
3.67ThrGlu: 3.67 ± 0.675
2.473ThrPhe: 2.473 ± 0.438
5.266ThrGly: 5.266 ± 0.696
0.479ThrHis: 0.479 ± 0.218
4.947ThrIle: 4.947 ± 0.463
3.989ThrLys: 3.989 ± 0.588
4.867ThrLeu: 4.867 ± 0.51
1.516ThrMet: 1.516 ± 0.384
3.112ThrAsn: 3.112 ± 0.524
1.516ThrPro: 1.516 ± 0.35
3.351ThrGln: 3.351 ± 0.716
2.234ThrArg: 2.234 ± 0.477
4.787ThrSer: 4.787 ± 0.941
5.744ThrThr: 5.744 ± 0.897
5.026ThrVal: 5.026 ± 0.745
0.718ThrTrp: 0.718 ± 0.224
2.234ThrTyr: 2.234 ± 0.483
0.0ThrXaa: 0.0 ± 0.0
Val
4.707ValAla: 4.707 ± 0.648
0.16ValCys: 0.16 ± 0.131
3.51ValAsp: 3.51 ± 0.484
4.228ValGlu: 4.228 ± 0.877
2.553ValPhe: 2.553 ± 0.576
3.431ValGly: 3.431 ± 0.507
0.878ValHis: 0.878 ± 0.209
3.59ValIle: 3.59 ± 0.64
6.542ValLys: 6.542 ± 0.643
4.228ValLeu: 4.228 ± 0.769
1.197ValMet: 1.197 ± 0.323
3.51ValAsn: 3.51 ± 0.456
1.596ValPro: 1.596 ± 0.392
1.835ValGln: 1.835 ± 0.501
2.393ValArg: 2.393 ± 0.399
5.824ValSer: 5.824 ± 0.708
3.75ValThr: 3.75 ± 0.689
4.228ValVal: 4.228 ± 0.731
0.479ValTrp: 0.479 ± 0.17
2.393ValTyr: 2.393 ± 0.975
0.0ValXaa: 0.0 ± 0.0
Trp
0.878TrpAla: 0.878 ± 0.283
0.0TrpCys: 0.0 ± 0.0
1.197TrpAsp: 1.197 ± 0.301
0.957TrpGlu: 0.957 ± 0.3
0.16TrpPhe: 0.16 ± 0.123
0.479TrpGly: 0.479 ± 0.232
0.399TrpHis: 0.399 ± 0.193
0.878TrpIle: 0.878 ± 0.371
0.718TrpLys: 0.718 ± 0.217
1.117TrpLeu: 1.117 ± 0.298
0.319TrpMet: 0.319 ± 0.139
0.718TrpAsn: 0.718 ± 0.266
0.16TrpPro: 0.16 ± 0.117
0.638TrpGln: 0.638 ± 0.225
0.638TrpArg: 0.638 ± 0.193
0.399TrpSer: 0.399 ± 0.146
0.239TrpThr: 0.239 ± 0.157
0.798TrpVal: 0.798 ± 0.247
0.08TrpTrp: 0.08 ± 0.083
0.399TrpTyr: 0.399 ± 0.169
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.234TyrAla: 2.234 ± 0.399
0.319TyrCys: 0.319 ± 0.193
3.032TyrAsp: 3.032 ± 0.652
2.234TyrGlu: 2.234 ± 0.439
2.074TyrPhe: 2.074 ± 0.386
2.473TyrGly: 2.473 ± 0.921
0.558TyrHis: 0.558 ± 0.192
2.473TyrIle: 2.473 ± 0.507
2.553TyrLys: 2.553 ± 0.656
1.915TyrLeu: 1.915 ± 0.419
0.479TyrMet: 0.479 ± 0.195
2.633TyrAsn: 2.633 ± 0.514
1.117TyrPro: 1.117 ± 0.314
1.995TyrGln: 1.995 ± 0.663
2.234TyrArg: 2.234 ± 0.42
2.393TyrSer: 2.393 ± 0.542
2.553TyrThr: 2.553 ± 0.5
1.835TyrVal: 1.835 ± 0.515
0.399TyrTrp: 0.399 ± 0.155
1.356TyrTyr: 1.356 ± 0.316
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 52 proteins (12535 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski