Amino acid dipepetide frequency for Streptococcus phage 20617

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.466AlaAla: 6.466 ± 1.999
0.337AlaCys: 0.337 ± 0.129
4.647AlaAsp: 4.647 ± 0.741
5.86AlaGlu: 5.86 ± 0.691
3.368AlaPhe: 3.368 ± 0.733
4.917AlaGly: 4.917 ± 0.98
0.471AlaHis: 0.471 ± 0.181
6.129AlaIle: 6.129 ± 1.412
5.388AlaLys: 5.388 ± 0.543
6.197AlaLeu: 6.197 ± 0.866
2.492AlaMet: 2.492 ± 0.906
4.58AlaAsn: 4.58 ± 0.522
2.694AlaPro: 2.694 ± 0.392
3.3AlaGln: 3.3 ± 0.799
3.368AlaArg: 3.368 ± 0.632
5.388AlaSer: 5.388 ± 1.151
3.772AlaThr: 3.772 ± 0.81
5.388AlaVal: 5.388 ± 1.041
0.876AlaTrp: 0.876 ± 0.276
2.29AlaTyr: 2.29 ± 0.403
0.0AlaXaa: 0.0 ± 0.0
Cys
0.269CysAla: 0.269 ± 0.133
0.0CysCys: 0.0 ± 0.0
0.404CysAsp: 0.404 ± 0.185
0.606CysGlu: 0.606 ± 0.228
0.067CysPhe: 0.067 ± 0.066
0.606CysGly: 0.606 ± 0.288
0.067CysHis: 0.067 ± 0.073
0.202CysIle: 0.202 ± 0.103
0.269CysLys: 0.269 ± 0.129
0.269CysLeu: 0.269 ± 0.122
0.0CysMet: 0.0 ± 0.0
0.135CysAsn: 0.135 ± 0.1
0.202CysPro: 0.202 ± 0.119
0.0CysGln: 0.0 ± 0.0
0.539CysArg: 0.539 ± 0.203
0.606CysSer: 0.606 ± 0.188
0.135CysThr: 0.135 ± 0.104
0.404CysVal: 0.404 ± 0.155
0.067CysTrp: 0.067 ± 0.063
0.269CysTyr: 0.269 ± 0.122
0.0CysXaa: 0.0 ± 0.0
Asp
2.964AspAla: 2.964 ± 0.412
0.606AspCys: 0.606 ± 0.193
4.243AspAsp: 4.243 ± 0.467
4.109AspGlu: 4.109 ± 0.695
3.839AspPhe: 3.839 ± 0.564
5.59AspGly: 5.59 ± 0.903
0.741AspHis: 0.741 ± 0.271
3.704AspIle: 3.704 ± 0.572
4.513AspLys: 4.513 ± 0.527
4.513AspLeu: 4.513 ± 0.615
1.347AspMet: 1.347 ± 0.319
4.243AspAsn: 4.243 ± 0.536
1.01AspPro: 1.01 ± 0.286
2.088AspGln: 2.088 ± 0.323
2.223AspArg: 2.223 ± 0.382
3.772AspSer: 3.772 ± 0.59
3.368AspThr: 3.368 ± 0.625
3.031AspVal: 3.031 ± 0.495
1.01AspTrp: 1.01 ± 0.298
3.233AspTyr: 3.233 ± 0.543
0.0AspXaa: 0.0 ± 0.0
Glu
4.58GluAla: 4.58 ± 0.748
0.067GluCys: 0.067 ± 0.048
3.031GluAsp: 3.031 ± 0.492
4.513GluGlu: 4.513 ± 0.86
2.492GluPhe: 2.492 ± 0.385
3.098GluGly: 3.098 ± 0.449
1.347GluHis: 1.347 ± 0.318
4.917GluIle: 4.917 ± 0.697
6.129GluLys: 6.129 ± 0.994
8.015GluLeu: 8.015 ± 1.19
2.627GluMet: 2.627 ± 0.6
4.917GluAsn: 4.917 ± 0.592
1.482GluPro: 1.482 ± 0.392
3.839GluGln: 3.839 ± 0.512
3.3GluArg: 3.3 ± 0.646
2.694GluSer: 2.694 ± 0.496
4.715GluThr: 4.715 ± 0.683
5.658GluVal: 5.658 ± 0.661
0.876GluTrp: 0.876 ± 0.281
3.3GluTyr: 3.3 ± 0.686
0.0GluXaa: 0.0 ± 0.0
Phe
3.031PheAla: 3.031 ± 0.473
0.135PheCys: 0.135 ± 0.098
3.031PheAsp: 3.031 ± 0.4
4.109PheGlu: 4.109 ± 0.678
1.549PhePhe: 1.549 ± 0.452
3.57PheGly: 3.57 ± 0.568
0.539PheHis: 0.539 ± 0.204
2.29PheIle: 2.29 ± 0.317
3.907PheLys: 3.907 ± 0.522
2.223PheLeu: 2.223 ± 0.505
0.741PheMet: 0.741 ± 0.259
2.357PheAsn: 2.357 ± 0.403
0.943PhePro: 0.943 ± 0.279
1.28PheGln: 1.28 ± 0.273
1.145PheArg: 1.145 ± 0.226
3.839PheSer: 3.839 ± 0.666
2.762PheThr: 2.762 ± 0.454
2.088PheVal: 2.088 ± 0.389
0.471PheTrp: 0.471 ± 0.216
1.953PheTyr: 1.953 ± 0.414
0.0PheXaa: 0.0 ± 0.0
Gly
4.58GlyAla: 4.58 ± 0.897
0.606GlyCys: 0.606 ± 0.228
3.166GlyAsp: 3.166 ± 0.435
3.502GlyGlu: 3.502 ± 0.583
3.166GlyPhe: 3.166 ± 0.466
4.243GlyGly: 4.243 ± 0.61
0.741GlyHis: 0.741 ± 0.22
5.725GlyIle: 5.725 ± 1.084
4.647GlyLys: 4.647 ± 0.573
7.072GlyLeu: 7.072 ± 0.762
1.616GlyMet: 1.616 ± 0.509
3.233GlyAsn: 3.233 ± 0.523
0.741GlyPro: 0.741 ± 0.279
3.031GlyGln: 3.031 ± 0.381
3.368GlyArg: 3.368 ± 0.518
4.243GlySer: 4.243 ± 0.78
4.647GlyThr: 4.647 ± 0.669
4.58GlyVal: 4.58 ± 0.62
0.674GlyTrp: 0.674 ± 0.205
3.166GlyTyr: 3.166 ± 0.58
0.0GlyXaa: 0.0 ± 0.0
His
0.943HisAla: 0.943 ± 0.237
0.0HisCys: 0.0 ± 0.0
0.741HisAsp: 0.741 ± 0.21
0.674HisGlu: 0.674 ± 0.216
0.741HisPhe: 0.741 ± 0.247
0.539HisGly: 0.539 ± 0.243
0.471HisHis: 0.471 ± 0.178
1.01HisIle: 1.01 ± 0.253
0.606HisLys: 0.606 ± 0.196
1.212HisLeu: 1.212 ± 0.284
0.269HisMet: 0.269 ± 0.147
0.674HisAsn: 0.674 ± 0.284
0.269HisPro: 0.269 ± 0.13
0.606HisGln: 0.606 ± 0.218
0.674HisArg: 0.674 ± 0.212
1.078HisSer: 1.078 ± 0.36
0.539HisThr: 0.539 ± 0.19
1.01HisVal: 1.01 ± 0.277
0.202HisTrp: 0.202 ± 0.116
0.471HisTyr: 0.471 ± 0.227
0.0HisXaa: 0.0 ± 0.0
Ile
5.927IleAla: 5.927 ± 0.933
0.606IleCys: 0.606 ± 0.192
4.849IleAsp: 4.849 ± 0.582
5.321IleGlu: 5.321 ± 0.832
2.29IlePhe: 2.29 ± 0.316
4.984IleGly: 4.984 ± 0.881
1.212IleHis: 1.212 ± 0.315
3.637IleIle: 3.637 ± 0.723
5.186IleLys: 5.186 ± 0.443
4.109IleLeu: 4.109 ± 0.477
1.347IleMet: 1.347 ± 0.325
3.166IleAsn: 3.166 ± 0.473
2.492IlePro: 2.492 ± 0.518
2.762IleGln: 2.762 ± 0.585
2.829IleArg: 2.829 ± 0.457
5.456IleSer: 5.456 ± 1.207
4.243IleThr: 4.243 ± 0.644
4.109IleVal: 4.109 ± 0.478
0.539IleTrp: 0.539 ± 0.183
2.762IleTyr: 2.762 ± 0.574
0.0IleXaa: 0.0 ± 0.0
Lys
6.129LysAla: 6.129 ± 0.78
0.337LysCys: 0.337 ± 0.14
4.647LysAsp: 4.647 ± 0.554
6.668LysGlu: 6.668 ± 0.881
2.896LysPhe: 2.896 ± 0.537
4.715LysGly: 4.715 ± 0.46
1.078LysHis: 1.078 ± 0.245
4.58LysIle: 4.58 ± 0.58
6.533LysLys: 6.533 ± 0.974
6.129LysLeu: 6.129 ± 0.617
2.357LysMet: 2.357 ± 0.436
2.694LysAsn: 2.694 ± 0.424
2.694LysPro: 2.694 ± 0.411
4.041LysGln: 4.041 ± 0.717
4.782LysArg: 4.782 ± 0.791
3.907LysSer: 3.907 ± 0.507
4.378LysThr: 4.378 ± 0.567
3.907LysVal: 3.907 ± 0.544
1.01LysTrp: 1.01 ± 0.235
3.435LysTyr: 3.435 ± 0.498
0.0LysXaa: 0.0 ± 0.0
Leu
6.937LeuAla: 6.937 ± 0.665
0.337LeuCys: 0.337 ± 0.171
5.321LeuAsp: 5.321 ± 0.659
7.678LeuGlu: 7.678 ± 0.97
2.762LeuPhe: 2.762 ± 0.474
5.388LeuGly: 5.388 ± 0.663
0.674LeuHis: 0.674 ± 0.205
4.715LeuIle: 4.715 ± 0.463
6.129LeuLys: 6.129 ± 0.839
5.725LeuLeu: 5.725 ± 0.724
2.694LeuMet: 2.694 ± 0.458
4.917LeuAsn: 4.917 ± 0.564
2.29LeuPro: 2.29 ± 0.366
1.886LeuGln: 1.886 ± 0.34
3.435LeuArg: 3.435 ± 0.517
6.331LeuSer: 6.331 ± 0.674
5.456LeuThr: 5.456 ± 0.75
4.782LeuVal: 4.782 ± 0.52
0.471LeuTrp: 0.471 ± 0.142
3.098LeuTyr: 3.098 ± 0.512
0.0LeuXaa: 0.0 ± 0.0
Met
2.627MetAla: 2.627 ± 0.729
0.135MetCys: 0.135 ± 0.102
1.347MetAsp: 1.347 ± 0.27
1.414MetGlu: 1.414 ± 0.336
0.943MetPhe: 0.943 ± 0.209
1.347MetGly: 1.347 ± 0.308
0.067MetHis: 0.067 ± 0.06
1.751MetIle: 1.751 ± 0.339
2.425MetLys: 2.425 ± 0.416
1.953MetLeu: 1.953 ± 0.307
0.808MetMet: 0.808 ± 0.336
1.145MetAsn: 1.145 ± 0.351
0.471MetPro: 0.471 ± 0.205
1.414MetGln: 1.414 ± 0.403
1.212MetArg: 1.212 ± 0.265
2.559MetSer: 2.559 ± 0.473
1.616MetThr: 1.616 ± 0.358
1.886MetVal: 1.886 ± 0.389
0.067MetTrp: 0.067 ± 0.059
0.943MetTyr: 0.943 ± 0.218
0.0MetXaa: 0.0 ± 0.0
Asn
3.637AsnAla: 3.637 ± 0.461
0.202AsnCys: 0.202 ± 0.106
3.233AsnAsp: 3.233 ± 0.545
3.637AsnGlu: 3.637 ± 0.591
2.155AsnPhe: 2.155 ± 0.393
4.984AsnGly: 4.984 ± 0.749
0.943AsnHis: 0.943 ± 0.349
3.368AsnIle: 3.368 ± 0.471
5.052AsnLys: 5.052 ± 0.593
4.176AsnLeu: 4.176 ± 0.575
0.943AsnMet: 0.943 ± 0.268
2.762AsnAsn: 2.762 ± 0.493
2.492AsnPro: 2.492 ± 0.315
2.694AsnGln: 2.694 ± 0.664
2.357AsnArg: 2.357 ± 0.479
2.964AsnSer: 2.964 ± 0.417
2.896AsnThr: 2.896 ± 0.499
3.233AsnVal: 3.233 ± 0.449
1.01AsnTrp: 1.01 ± 0.276
2.29AsnTyr: 2.29 ± 0.455
0.0AsnXaa: 0.0 ± 0.0
Pro
1.751ProAla: 1.751 ± 0.367
0.067ProCys: 0.067 ± 0.064
2.088ProAsp: 2.088 ± 0.395
2.088ProGlu: 2.088 ± 0.451
1.28ProPhe: 1.28 ± 0.404
1.145ProGly: 1.145 ± 0.379
0.269ProHis: 0.269 ± 0.115
1.886ProIle: 1.886 ± 0.365
2.559ProLys: 2.559 ± 0.368
2.223ProLeu: 2.223 ± 0.381
0.269ProMet: 0.269 ± 0.148
2.021ProAsn: 2.021 ± 0.536
0.674ProPro: 0.674 ± 0.19
1.28ProGln: 1.28 ± 0.244
1.414ProArg: 1.414 ± 0.356
2.021ProSer: 2.021 ± 0.378
1.819ProThr: 1.819 ± 0.42
1.616ProVal: 1.616 ± 0.294
0.202ProTrp: 0.202 ± 0.109
1.01ProTyr: 1.01 ± 0.282
0.0ProXaa: 0.0 ± 0.0
Gln
4.378GlnAla: 4.378 ± 0.911
0.202GlnCys: 0.202 ± 0.123
2.559GlnAsp: 2.559 ± 0.433
2.627GlnGlu: 2.627 ± 0.476
2.223GlnPhe: 2.223 ± 0.393
2.357GlnGly: 2.357 ± 0.548
0.471GlnHis: 0.471 ± 0.201
2.223GlnIle: 2.223 ± 0.48
2.559GlnLys: 2.559 ± 0.459
4.243GlnLeu: 4.243 ± 0.477
1.953GlnMet: 1.953 ± 0.337
1.684GlnAsn: 1.684 ± 0.272
0.876GlnPro: 0.876 ± 0.213
2.559GlnGln: 2.559 ± 0.632
1.616GlnArg: 1.616 ± 0.347
2.829GlnSer: 2.829 ± 0.521
2.559GlnThr: 2.559 ± 0.471
1.953GlnVal: 1.953 ± 0.438
0.674GlnTrp: 0.674 ± 0.22
1.347GlnTyr: 1.347 ± 0.308
0.0GlnXaa: 0.0 ± 0.0
Arg
3.704ArgAla: 3.704 ± 0.436
0.269ArgCys: 0.269 ± 0.132
2.425ArgAsp: 2.425 ± 0.308
3.3ArgGlu: 3.3 ± 0.564
1.414ArgPhe: 1.414 ± 0.275
3.031ArgGly: 3.031 ± 0.489
0.471ArgHis: 0.471 ± 0.17
3.098ArgIle: 3.098 ± 0.518
3.502ArgLys: 3.502 ± 0.551
4.647ArgLeu: 4.647 ± 0.573
1.414ArgMet: 1.414 ± 0.311
2.357ArgAsn: 2.357 ± 0.499
0.674ArgPro: 0.674 ± 0.164
1.616ArgGln: 1.616 ± 0.413
1.347ArgArg: 1.347 ± 0.415
2.762ArgSer: 2.762 ± 0.369
2.155ArgThr: 2.155 ± 0.411
2.425ArgVal: 2.425 ± 0.391
0.606ArgTrp: 0.606 ± 0.23
2.492ArgTyr: 2.492 ± 0.588
0.0ArgXaa: 0.0 ± 0.0
Ser
5.86SerAla: 5.86 ± 1.962
0.404SerCys: 0.404 ± 0.181
4.243SerAsp: 4.243 ± 0.626
4.041SerGlu: 4.041 ± 0.517
2.896SerPhe: 2.896 ± 0.444
4.715SerGly: 4.715 ± 0.562
0.741SerHis: 0.741 ± 0.209
5.658SerIle: 5.658 ± 0.872
3.637SerLys: 3.637 ± 0.545
5.186SerLeu: 5.186 ± 0.694
1.751SerMet: 1.751 ± 0.304
3.098SerAsn: 3.098 ± 0.483
2.021SerPro: 2.021 ± 0.304
3.3SerGln: 3.3 ± 0.785
2.829SerArg: 2.829 ± 0.38
5.254SerSer: 5.254 ± 1.056
4.647SerThr: 4.647 ± 0.653
4.378SerVal: 4.378 ± 0.689
0.606SerTrp: 0.606 ± 0.172
2.694SerTyr: 2.694 ± 0.442
0.0SerXaa: 0.0 ± 0.0
Thr
5.658ThrAla: 5.658 ± 1.593
0.135ThrCys: 0.135 ± 0.097
2.964ThrAsp: 2.964 ± 0.502
3.772ThrGlu: 3.772 ± 0.531
3.098ThrPhe: 3.098 ± 0.395
4.311ThrGly: 4.311 ± 0.532
0.876ThrHis: 0.876 ± 0.261
5.254ThrIle: 5.254 ± 0.673
4.445ThrLys: 4.445 ± 0.691
4.715ThrLeu: 4.715 ± 0.532
1.482ThrMet: 1.482 ± 0.614
3.435ThrAsn: 3.435 ± 0.543
2.021ThrPro: 2.021 ± 0.398
2.492ThrGln: 2.492 ± 0.333
2.155ThrArg: 2.155 ± 0.346
4.58ThrSer: 4.58 ± 0.692
3.704ThrThr: 3.704 ± 0.58
4.445ThrVal: 4.445 ± 0.441
0.471ThrTrp: 0.471 ± 0.24
2.425ThrTyr: 2.425 ± 0.462
0.0ThrXaa: 0.0 ± 0.0
Val
4.58ValAla: 4.58 ± 0.966
0.404ValCys: 0.404 ± 0.159
4.109ValAsp: 4.109 ± 0.574
4.58ValGlu: 4.58 ± 0.636
2.492ValPhe: 2.492 ± 0.353
3.772ValGly: 3.772 ± 0.597
0.741ValHis: 0.741 ± 0.222
4.109ValIle: 4.109 ± 0.477
5.59ValLys: 5.59 ± 0.652
4.243ValLeu: 4.243 ± 0.508
0.943ValMet: 0.943 ± 0.242
4.445ValAsn: 4.445 ± 0.671
2.223ValPro: 2.223 ± 0.495
1.886ValGln: 1.886 ± 0.376
2.223ValArg: 2.223 ± 0.281
4.311ValSer: 4.311 ± 0.524
4.647ValThr: 4.647 ± 0.587
3.435ValVal: 3.435 ± 0.508
0.606ValTrp: 0.606 ± 0.182
1.684ValTyr: 1.684 ± 0.377
0.0ValXaa: 0.0 ± 0.0
Trp
0.539TrpAla: 0.539 ± 0.185
0.0TrpCys: 0.0 ± 0.0
0.404TrpAsp: 0.404 ± 0.132
0.876TrpGlu: 0.876 ± 0.23
0.337TrpPhe: 0.337 ± 0.144
0.943TrpGly: 0.943 ± 0.217
0.202TrpHis: 0.202 ± 0.107
0.741TrpIle: 0.741 ± 0.253
0.876TrpLys: 0.876 ± 0.224
0.876TrpLeu: 0.876 ± 0.229
0.0TrpMet: 0.0 ± 0.0
0.674TrpAsn: 0.674 ± 0.324
0.135TrpPro: 0.135 ± 0.093
0.471TrpGln: 0.471 ± 0.159
0.539TrpArg: 0.539 ± 0.222
0.943TrpSer: 0.943 ± 0.357
1.078TrpThr: 1.078 ± 0.31
0.674TrpVal: 0.674 ± 0.218
0.337TrpTrp: 0.337 ± 0.173
0.404TrpTyr: 0.404 ± 0.154
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.368TyrAla: 3.368 ± 0.489
0.269TyrCys: 0.269 ± 0.124
2.694TyrAsp: 2.694 ± 0.541
2.29TyrGlu: 2.29 ± 0.476
1.953TyrPhe: 1.953 ± 0.45
2.492TyrGly: 2.492 ± 0.547
0.606TyrHis: 0.606 ± 0.201
2.829TyrIle: 2.829 ± 0.512
2.964TyrLys: 2.964 ± 0.457
3.3TyrLeu: 3.3 ± 0.579
1.01TyrMet: 1.01 ± 0.291
2.492TyrAsn: 2.492 ± 0.456
1.28TyrPro: 1.28 ± 0.307
1.347TyrGln: 1.347 ± 0.298
2.357TyrArg: 2.357 ± 0.484
2.425TyrSer: 2.425 ± 0.404
3.3TyrThr: 3.3 ± 0.581
2.021TyrVal: 2.021 ± 0.351
0.269TyrTrp: 0.269 ± 0.131
1.819TyrTyr: 1.819 ± 0.471
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 68 proteins (14848 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski