Amino acid dipepetide frequency for Streptococcus phage Javan116

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.243AlaAla: 5.243 ± 1.356
0.244AlaCys: 0.244 ± 0.243
4.268AlaAsp: 4.268 ± 0.643
4.877AlaGlu: 4.877 ± 0.664
2.805AlaPhe: 2.805 ± 0.605
4.39AlaGly: 4.39 ± 0.979
0.732AlaHis: 0.732 ± 0.336
6.341AlaIle: 6.341 ± 1.199
4.756AlaLys: 4.756 ± 0.737
6.585AlaLeu: 6.585 ± 0.775
2.805AlaMet: 2.805 ± 0.584
3.658AlaAsn: 3.658 ± 0.97
1.097AlaPro: 1.097 ± 0.393
3.536AlaGln: 3.536 ± 0.842
2.561AlaArg: 2.561 ± 0.481
3.78AlaSer: 3.78 ± 0.707
3.17AlaThr: 3.17 ± 0.559
4.268AlaVal: 4.268 ± 0.836
0.975AlaTrp: 0.975 ± 0.398
2.073AlaTyr: 2.073 ± 0.589
0.0AlaXaa: 0.0 ± 0.0
Cys
0.244CysAla: 0.244 ± 0.154
0.0CysCys: 0.0 ± 0.0
0.244CysAsp: 0.244 ± 0.194
0.244CysGlu: 0.244 ± 0.186
0.0CysPhe: 0.0 ± 0.0
0.366CysGly: 0.366 ± 0.244
0.244CysHis: 0.244 ± 0.169
0.488CysIle: 0.488 ± 0.238
0.854CysLys: 0.854 ± 0.399
0.366CysLeu: 0.366 ± 0.193
0.0CysMet: 0.0 ± 0.0
0.61CysAsn: 0.61 ± 0.257
0.122CysPro: 0.122 ± 0.129
0.488CysGln: 0.488 ± 0.222
0.244CysArg: 0.244 ± 0.148
0.366CysSer: 0.366 ± 0.206
0.366CysThr: 0.366 ± 0.234
0.488CysVal: 0.488 ± 0.203
0.0CysTrp: 0.0 ± 0.0
0.366CysTyr: 0.366 ± 0.208
0.0CysXaa: 0.0 ± 0.0
Asp
2.805AspAla: 2.805 ± 0.482
0.61AspCys: 0.61 ± 0.272
4.146AspAsp: 4.146 ± 0.799
5.243AspGlu: 5.243 ± 1.077
3.658AspPhe: 3.658 ± 0.628
4.756AspGly: 4.756 ± 0.607
0.61AspHis: 0.61 ± 0.244
6.097AspIle: 6.097 ± 0.755
5.365AspLys: 5.365 ± 0.516
4.877AspLeu: 4.877 ± 0.968
2.317AspMet: 2.317 ± 0.599
3.048AspAsn: 3.048 ± 0.692
1.463AspPro: 1.463 ± 0.416
0.732AspGln: 0.732 ± 0.261
2.439AspArg: 2.439 ± 0.563
3.414AspSer: 3.414 ± 0.528
3.658AspThr: 3.658 ± 0.712
4.39AspVal: 4.39 ± 0.748
1.585AspTrp: 1.585 ± 0.454
2.317AspTyr: 2.317 ± 0.557
0.0AspXaa: 0.0 ± 0.0
Glu
6.463GluAla: 6.463 ± 1.261
0.366GluCys: 0.366 ± 0.234
3.292GluAsp: 3.292 ± 0.551
7.56GluGlu: 7.56 ± 1.328
2.926GluPhe: 2.926 ± 0.567
3.536GluGly: 3.536 ± 0.807
0.488GluHis: 0.488 ± 0.236
6.219GluIle: 6.219 ± 0.849
7.56GluLys: 7.56 ± 1.236
8.901GluLeu: 8.901 ± 1.087
2.561GluMet: 2.561 ± 0.655
5.121GluAsn: 5.121 ± 0.649
1.707GluPro: 1.707 ± 0.561
3.78GluGln: 3.78 ± 0.644
4.024GluArg: 4.024 ± 0.606
3.414GluSer: 3.414 ± 0.557
4.024GluThr: 4.024 ± 0.747
4.146GluVal: 4.146 ± 0.79
0.975GluTrp: 0.975 ± 0.334
2.073GluTyr: 2.073 ± 0.373
0.0GluXaa: 0.0 ± 0.0
Phe
2.683PheAla: 2.683 ± 0.511
0.244PheCys: 0.244 ± 0.172
3.048PheAsp: 3.048 ± 0.54
4.024PheGlu: 4.024 ± 0.793
1.219PhePhe: 1.219 ± 0.398
3.292PheGly: 3.292 ± 0.544
0.244PheHis: 0.244 ± 0.171
3.414PheIle: 3.414 ± 0.673
3.17PheLys: 3.17 ± 0.598
3.292PheLeu: 3.292 ± 0.622
1.463PheMet: 1.463 ± 0.413
2.439PheAsn: 2.439 ± 0.483
1.463PhePro: 1.463 ± 0.373
1.707PheGln: 1.707 ± 0.4
0.854PheArg: 0.854 ± 0.204
2.805PheSer: 2.805 ± 0.581
1.341PheThr: 1.341 ± 0.411
2.195PheVal: 2.195 ± 0.485
0.366PheTrp: 0.366 ± 0.199
1.585PheTyr: 1.585 ± 0.553
0.0PheXaa: 0.0 ± 0.0
Gly
4.756GlyAla: 4.756 ± 1.265
0.366GlyCys: 0.366 ± 0.228
4.512GlyAsp: 4.512 ± 0.735
2.561GlyGlu: 2.561 ± 0.556
2.683GlyPhe: 2.683 ± 0.621
3.536GlyGly: 3.536 ± 0.734
0.61GlyHis: 0.61 ± 0.315
4.634GlyIle: 4.634 ± 0.846
5.365GlyLys: 5.365 ± 0.929
3.78GlyLeu: 3.78 ± 0.604
1.707GlyMet: 1.707 ± 0.485
3.048GlyAsn: 3.048 ± 0.813
0.732GlyPro: 0.732 ± 0.283
1.463GlyGln: 1.463 ± 0.419
3.536GlyArg: 3.536 ± 0.558
3.414GlySer: 3.414 ± 0.62
3.536GlyThr: 3.536 ± 0.678
4.146GlyVal: 4.146 ± 0.732
1.219GlyTrp: 1.219 ± 0.367
3.902GlyTyr: 3.902 ± 0.614
0.0GlyXaa: 0.0 ± 0.0
His
0.732HisAla: 0.732 ± 0.325
0.244HisCys: 0.244 ± 0.246
0.854HisAsp: 0.854 ± 0.27
0.975HisGlu: 0.975 ± 0.275
1.341HisPhe: 1.341 ± 0.457
0.488HisGly: 0.488 ± 0.26
0.488HisHis: 0.488 ± 0.232
0.732HisIle: 0.732 ± 0.299
1.097HisLys: 1.097 ± 0.389
1.341HisLeu: 1.341 ± 0.454
0.122HisMet: 0.122 ± 0.122
0.732HisAsn: 0.732 ± 0.26
0.0HisPro: 0.0 ± 0.0
0.732HisGln: 0.732 ± 0.284
0.366HisArg: 0.366 ± 0.204
0.975HisSer: 0.975 ± 0.403
0.854HisThr: 0.854 ± 0.266
0.61HisVal: 0.61 ± 0.24
0.366HisTrp: 0.366 ± 0.226
0.732HisTyr: 0.732 ± 0.335
0.0HisXaa: 0.0 ± 0.0
Ile
5.121IleAla: 5.121 ± 0.635
0.732IleCys: 0.732 ± 0.278
6.097IleAsp: 6.097 ± 0.774
7.804IleGlu: 7.804 ± 1.134
2.439IlePhe: 2.439 ± 0.591
4.39IleGly: 4.39 ± 0.8
0.61IleHis: 0.61 ± 0.263
4.024IleIle: 4.024 ± 0.815
5.609IleLys: 5.609 ± 0.679
5.731IleLeu: 5.731 ± 0.701
2.317IleMet: 2.317 ± 0.472
3.78IleAsn: 3.78 ± 0.681
2.317IlePro: 2.317 ± 0.509
2.073IleGln: 2.073 ± 0.382
3.658IleArg: 3.658 ± 0.602
5.365IleSer: 5.365 ± 1.109
2.561IleThr: 2.561 ± 0.608
4.756IleVal: 4.756 ± 0.712
0.61IleTrp: 0.61 ± 0.249
2.195IleTyr: 2.195 ± 0.524
0.0IleXaa: 0.0 ± 0.0
Lys
6.341LysAla: 6.341 ± 0.934
0.244LysCys: 0.244 ± 0.159
5.731LysAsp: 5.731 ± 1.07
6.097LysGlu: 6.097 ± 0.921
2.195LysPhe: 2.195 ± 0.531
4.146LysGly: 4.146 ± 0.799
0.61LysHis: 0.61 ± 0.311
4.512LysIle: 4.512 ± 0.695
6.097LysLys: 6.097 ± 0.609
7.682LysLeu: 7.682 ± 0.877
2.683LysMet: 2.683 ± 0.617
4.756LysAsn: 4.756 ± 0.698
2.561LysPro: 2.561 ± 0.423
3.536LysGln: 3.536 ± 0.563
4.268LysArg: 4.268 ± 0.768
4.39LysSer: 4.39 ± 0.615
4.39LysThr: 4.39 ± 0.834
5.731LysVal: 5.731 ± 0.672
0.488LysTrp: 0.488 ± 0.213
3.658LysTyr: 3.658 ± 0.631
0.0LysXaa: 0.0 ± 0.0
Leu
5.365LeuAla: 5.365 ± 0.718
0.244LeuCys: 0.244 ± 0.165
5.487LeuAsp: 5.487 ± 0.597
6.585LeuGlu: 6.585 ± 0.861
3.902LeuPhe: 3.902 ± 0.75
5.365LeuGly: 5.365 ± 0.711
1.463LeuHis: 1.463 ± 0.407
5.121LeuIle: 5.121 ± 0.722
7.072LeuLys: 7.072 ± 0.785
5.609LeuLeu: 5.609 ± 1.068
2.073LeuMet: 2.073 ± 0.465
5.243LeuAsn: 5.243 ± 0.631
2.926LeuPro: 2.926 ± 0.65
3.78LeuGln: 3.78 ± 0.875
3.17LeuArg: 3.17 ± 0.583
6.097LeuSer: 6.097 ± 0.854
4.999LeuThr: 4.999 ± 0.702
4.756LeuVal: 4.756 ± 1.062
0.61LeuTrp: 0.61 ± 0.283
3.902LeuTyr: 3.902 ± 0.673
0.0LeuXaa: 0.0 ± 0.0
Met
1.951MetAla: 1.951 ± 0.525
0.244MetCys: 0.244 ± 0.179
1.463MetAsp: 1.463 ± 0.406
2.439MetGlu: 2.439 ± 0.528
1.341MetPhe: 1.341 ± 0.303
0.854MetGly: 0.854 ± 0.261
0.366MetHis: 0.366 ± 0.257
1.463MetIle: 1.463 ± 0.33
2.683MetLys: 2.683 ± 0.514
2.439MetLeu: 2.439 ± 0.61
0.975MetMet: 0.975 ± 0.368
2.561MetAsn: 2.561 ± 0.537
0.61MetPro: 0.61 ± 0.264
0.854MetGln: 0.854 ± 0.318
1.219MetArg: 1.219 ± 0.376
1.951MetSer: 1.951 ± 0.428
2.317MetThr: 2.317 ± 0.537
1.829MetVal: 1.829 ± 0.514
0.366MetTrp: 0.366 ± 0.21
0.732MetTyr: 0.732 ± 0.447
0.0MetXaa: 0.0 ± 0.0
Asn
4.146AsnAla: 4.146 ± 1.164
0.122AsnCys: 0.122 ± 0.123
3.17AsnAsp: 3.17 ± 0.583
3.536AsnGlu: 3.536 ± 0.653
1.097AsnPhe: 1.097 ± 0.237
4.146AsnGly: 4.146 ± 0.95
0.854AsnHis: 0.854 ± 0.283
2.805AsnIle: 2.805 ± 0.65
4.39AsnLys: 4.39 ± 0.604
4.39AsnLeu: 4.39 ± 0.737
0.975AsnMet: 0.975 ± 0.389
3.536AsnAsn: 3.536 ± 0.604
1.951AsnPro: 1.951 ± 0.576
3.658AsnGln: 3.658 ± 0.743
2.561AsnArg: 2.561 ± 0.605
3.536AsnSer: 3.536 ± 0.726
2.926AsnThr: 2.926 ± 0.536
4.634AsnVal: 4.634 ± 0.885
1.463AsnTrp: 1.463 ± 0.318
1.951AsnTyr: 1.951 ± 0.473
0.0AsnXaa: 0.0 ± 0.0
Pro
2.317ProAla: 2.317 ± 0.52
0.244ProCys: 0.244 ± 0.204
1.219ProAsp: 1.219 ± 0.526
1.951ProGlu: 1.951 ± 0.52
1.097ProPhe: 1.097 ± 0.313
0.732ProGly: 0.732 ± 0.329
0.732ProHis: 0.732 ± 0.287
2.926ProIle: 2.926 ± 0.551
2.683ProLys: 2.683 ± 0.487
2.439ProLeu: 2.439 ± 0.501
0.366ProMet: 0.366 ± 0.196
1.463ProAsn: 1.463 ± 0.382
1.585ProPro: 1.585 ± 0.444
0.975ProGln: 0.975 ± 0.344
1.341ProArg: 1.341 ± 0.424
2.317ProSer: 2.317 ± 0.57
1.951ProThr: 1.951 ± 0.362
1.463ProVal: 1.463 ± 0.401
0.488ProTrp: 0.488 ± 0.176
0.488ProTyr: 0.488 ± 0.231
0.0ProXaa: 0.0 ± 0.0
Gln
3.17GlnAla: 3.17 ± 0.776
0.244GlnCys: 0.244 ± 0.182
1.707GlnAsp: 1.707 ± 0.329
4.268GlnGlu: 4.268 ± 0.742
1.341GlnPhe: 1.341 ± 0.403
2.073GlnGly: 2.073 ± 0.421
0.61GlnHis: 0.61 ± 0.251
2.805GlnIle: 2.805 ± 0.571
2.561GlnLys: 2.561 ± 0.475
3.048GlnLeu: 3.048 ± 0.46
0.732GlnMet: 0.732 ± 0.34
2.073GlnAsn: 2.073 ± 0.478
1.341GlnPro: 1.341 ± 0.38
1.707GlnGln: 1.707 ± 0.471
2.561GlnArg: 2.561 ± 0.557
2.926GlnSer: 2.926 ± 0.613
3.292GlnThr: 3.292 ± 0.523
2.195GlnVal: 2.195 ± 0.537
0.0GlnTrp: 0.0 ± 0.0
2.073GlnTyr: 2.073 ± 0.498
0.0GlnXaa: 0.0 ± 0.0
Arg
3.048ArgAla: 3.048 ± 0.552
0.488ArgCys: 0.488 ± 0.226
1.707ArgAsp: 1.707 ± 0.378
3.048ArgGlu: 3.048 ± 0.625
1.341ArgPhe: 1.341 ± 0.352
2.195ArgGly: 2.195 ± 0.715
0.975ArgHis: 0.975 ± 0.356
3.658ArgIle: 3.658 ± 0.444
4.756ArgLys: 4.756 ± 0.927
4.512ArgLeu: 4.512 ± 0.979
1.585ArgMet: 1.585 ± 0.41
2.439ArgAsn: 2.439 ± 0.658
1.951ArgPro: 1.951 ± 0.428
1.829ArgGln: 1.829 ± 0.628
2.683ArgArg: 2.683 ± 0.642
3.536ArgSer: 3.536 ± 0.832
1.585ArgThr: 1.585 ± 0.368
1.829ArgVal: 1.829 ± 0.468
0.488ArgTrp: 0.488 ± 0.27
2.073ArgTyr: 2.073 ± 0.52
0.0ArgXaa: 0.0 ± 0.0
Ser
3.414SerAla: 3.414 ± 0.927
0.244SerCys: 0.244 ± 0.163
4.756SerAsp: 4.756 ± 0.956
5.487SerGlu: 5.487 ± 0.584
2.805SerPhe: 2.805 ± 0.603
4.999SerGly: 4.999 ± 0.978
1.219SerHis: 1.219 ± 0.417
4.512SerIle: 4.512 ± 0.713
3.536SerLys: 3.536 ± 0.684
5.731SerLeu: 5.731 ± 0.848
2.073SerMet: 2.073 ± 0.41
3.902SerAsn: 3.902 ± 0.722
2.073SerPro: 2.073 ± 0.456
2.073SerGln: 2.073 ± 0.519
2.805SerArg: 2.805 ± 0.438
3.902SerSer: 3.902 ± 0.872
3.048SerThr: 3.048 ± 0.651
4.39SerVal: 4.39 ± 0.929
0.975SerTrp: 0.975 ± 0.282
2.439SerTyr: 2.439 ± 0.674
0.0SerXaa: 0.0 ± 0.0
Thr
3.048ThrAla: 3.048 ± 0.666
0.122ThrCys: 0.122 ± 0.129
3.17ThrAsp: 3.17 ± 0.679
4.634ThrGlu: 4.634 ± 0.814
2.805ThrPhe: 2.805 ± 0.652
3.78ThrGly: 3.78 ± 1.138
0.975ThrHis: 0.975 ± 0.339
5.487ThrIle: 5.487 ± 0.674
3.536ThrLys: 3.536 ± 0.592
4.146ThrLeu: 4.146 ± 0.647
0.975ThrMet: 0.975 ± 0.375
2.317ThrAsn: 2.317 ± 0.539
2.439ThrPro: 2.439 ± 0.535
1.951ThrGln: 1.951 ± 0.447
2.195ThrArg: 2.195 ± 0.475
3.048ThrSer: 3.048 ± 0.709
2.073ThrThr: 2.073 ± 0.483
3.902ThrVal: 3.902 ± 0.681
0.122ThrTrp: 0.122 ± 0.099
2.317ThrTyr: 2.317 ± 0.719
0.0ThrXaa: 0.0 ± 0.0
Val
4.39ValAla: 4.39 ± 0.625
0.366ValCys: 0.366 ± 0.227
4.39ValAsp: 4.39 ± 0.634
3.78ValGlu: 3.78 ± 0.637
2.317ValPhe: 2.317 ± 0.478
3.048ValGly: 3.048 ± 0.576
1.097ValHis: 1.097 ± 0.36
3.78ValIle: 3.78 ± 0.599
5.609ValLys: 5.609 ± 0.836
4.877ValLeu: 4.877 ± 0.8
1.707ValMet: 1.707 ± 0.716
2.561ValAsn: 2.561 ± 0.436
1.341ValPro: 1.341 ± 0.443
3.048ValGln: 3.048 ± 0.779
2.439ValArg: 2.439 ± 0.512
5.365ValSer: 5.365 ± 0.949
4.268ValThr: 4.268 ± 0.652
5.609ValVal: 5.609 ± 0.803
0.975ValTrp: 0.975 ± 0.391
2.805ValTyr: 2.805 ± 0.632
0.0ValXaa: 0.0 ± 0.0
Trp
0.732TrpAla: 0.732 ± 0.29
0.0TrpCys: 0.0 ± 0.0
0.366TrpAsp: 0.366 ± 0.194
0.854TrpGlu: 0.854 ± 0.338
0.975TrpPhe: 0.975 ± 0.292
1.097TrpGly: 1.097 ± 0.548
0.122TrpHis: 0.122 ± 0.134
0.61TrpIle: 0.61 ± 0.332
1.097TrpLys: 1.097 ± 0.436
1.219TrpLeu: 1.219 ± 0.383
0.122TrpMet: 0.122 ± 0.123
1.341TrpAsn: 1.341 ± 0.477
0.0TrpPro: 0.0 ± 0.0
0.732TrpGln: 0.732 ± 0.253
0.61TrpArg: 0.61 ± 0.254
1.097TrpSer: 1.097 ± 0.308
0.61TrpThr: 0.61 ± 0.218
0.366TrpVal: 0.366 ± 0.213
0.122TrpTrp: 0.122 ± 0.099
0.488TrpTyr: 0.488 ± 0.23
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.195TyrAla: 2.195 ± 0.558
0.732TyrCys: 0.732 ± 0.472
3.902TyrAsp: 3.902 ± 0.81
3.17TyrGlu: 3.17 ± 0.62
2.561TyrPhe: 2.561 ± 0.635
2.317TyrGly: 2.317 ± 0.618
0.732TyrHis: 0.732 ± 0.291
2.561TyrIle: 2.561 ± 0.51
2.317TyrLys: 2.317 ± 0.595
2.805TyrLeu: 2.805 ± 0.736
0.975TyrMet: 0.975 ± 0.414
1.219TyrAsn: 1.219 ± 0.401
1.097TyrPro: 1.097 ± 0.34
2.073TyrGln: 2.073 ± 0.57
2.195TyrArg: 2.195 ± 0.625
2.683TyrSer: 2.683 ± 0.548
2.073TyrThr: 2.073 ± 0.501
2.073TyrVal: 2.073 ± 0.603
0.366TyrTrp: 0.366 ± 0.185
1.341TyrTyr: 1.341 ± 0.358
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 44 proteins (8202 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski