Amino acid dipepetide frequency for Streptococcus phage CHPC640

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.897AlaAla: 5.897 ± 1.438
0.249AlaCys: 0.249 ± 0.141
4.236AlaAsp: 4.236 ± 0.626
4.402AlaGlu: 4.402 ± 0.69
2.658AlaPhe: 2.658 ± 1.121
5.731AlaGly: 5.731 ± 1.205
0.831AlaHis: 0.831 ± 0.242
6.395AlaIle: 6.395 ± 1.594
5.814AlaLys: 5.814 ± 0.75
6.894AlaLeu: 6.894 ± 0.9
2.492AlaMet: 2.492 ± 0.788
3.571AlaAsn: 3.571 ± 0.592
2.326AlaPro: 2.326 ± 0.494
2.575AlaGln: 2.575 ± 0.78
3.239AlaArg: 3.239 ± 0.627
6.229AlaSer: 6.229 ± 1.401
4.651AlaThr: 4.651 ± 0.631
3.654AlaVal: 3.654 ± 0.843
0.997AlaTrp: 0.997 ± 0.373
2.326AlaTyr: 2.326 ± 0.536
0.0AlaXaa: 0.0 ± 0.0
Cys
0.249CysAla: 0.249 ± 0.139
0.0CysCys: 0.0 ± 0.0
0.664CysAsp: 0.664 ± 0.253
0.415CysGlu: 0.415 ± 0.217
0.083CysPhe: 0.083 ± 0.083
0.748CysGly: 0.748 ± 0.338
0.166CysHis: 0.166 ± 0.126
0.166CysIle: 0.166 ± 0.11
0.498CysLys: 0.498 ± 0.224
0.415CysLeu: 0.415 ± 0.231
0.083CysMet: 0.083 ± 0.083
0.332CysAsn: 0.332 ± 0.147
0.083CysPro: 0.083 ± 0.068
0.083CysGln: 0.083 ± 0.068
0.249CysArg: 0.249 ± 0.15
0.498CysSer: 0.498 ± 0.294
0.0CysThr: 0.0 ± 0.0
0.166CysVal: 0.166 ± 0.108
0.083CysTrp: 0.083 ± 0.082
0.581CysTyr: 0.581 ± 0.344
0.0CysXaa: 0.0 ± 0.0
Asp
3.156AspAla: 3.156 ± 0.493
0.748AspCys: 0.748 ± 0.272
4.734AspAsp: 4.734 ± 0.815
3.488AspGlu: 3.488 ± 0.729
3.239AspPhe: 3.239 ± 0.557
6.395AspGly: 6.395 ± 1.53
0.581AspHis: 0.581 ± 0.238
3.488AspIle: 3.488 ± 0.523
5.066AspLys: 5.066 ± 0.664
4.402AspLeu: 4.402 ± 0.641
1.495AspMet: 1.495 ± 0.364
4.568AspAsn: 4.568 ± 0.727
0.997AspPro: 0.997 ± 0.326
1.495AspGln: 1.495 ± 0.31
2.99AspArg: 2.99 ± 0.542
4.236AspSer: 4.236 ± 0.738
3.239AspThr: 3.239 ± 0.461
3.239AspVal: 3.239 ± 0.574
1.329AspTrp: 1.329 ± 0.356
3.322AspTyr: 3.322 ± 0.595
0.0AspXaa: 0.0 ± 0.0
Glu
5.399GluAla: 5.399 ± 0.823
0.332GluCys: 0.332 ± 0.176
2.492GluAsp: 2.492 ± 0.481
3.239GluGlu: 3.239 ± 0.568
2.907GluPhe: 2.907 ± 0.523
3.405GluGly: 3.405 ± 0.513
1.163GluHis: 1.163 ± 0.292
5.316GluIle: 5.316 ± 0.768
4.568GluLys: 4.568 ± 0.924
6.811GluLeu: 6.811 ± 0.979
1.993GluMet: 1.993 ± 0.416
4.485GluAsn: 4.485 ± 0.712
1.91GluPro: 1.91 ± 0.591
2.409GluGln: 2.409 ± 0.655
3.322GluArg: 3.322 ± 0.754
2.907GluSer: 2.907 ± 0.518
3.821GluThr: 3.821 ± 0.659
4.734GluVal: 4.734 ± 0.837
1.163GluTrp: 1.163 ± 0.265
3.571GluTyr: 3.571 ± 0.738
0.0GluXaa: 0.0 ± 0.0
Phe
2.243PheAla: 2.243 ± 0.346
0.415PheCys: 0.415 ± 0.21
2.907PheAsp: 2.907 ± 0.597
3.239PheGlu: 3.239 ± 0.672
1.246PhePhe: 1.246 ± 0.327
3.405PheGly: 3.405 ± 0.618
0.415PheHis: 0.415 ± 0.199
2.575PheIle: 2.575 ± 0.44
4.9PheLys: 4.9 ± 0.803
1.993PheLeu: 1.993 ± 0.485
0.914PheMet: 0.914 ± 0.292
2.907PheAsn: 2.907 ± 0.46
0.581PhePro: 0.581 ± 0.28
1.163PheGln: 1.163 ± 0.342
1.578PheArg: 1.578 ± 0.345
2.409PheSer: 2.409 ± 0.483
2.575PheThr: 2.575 ± 0.598
1.744PheVal: 1.744 ± 0.388
0.664PheTrp: 0.664 ± 0.202
1.08PheTyr: 1.08 ± 0.359
0.0PheXaa: 0.0 ± 0.0
Gly
5.233GlyAla: 5.233 ± 0.766
0.083GlyCys: 0.083 ± 0.074
3.571GlyAsp: 3.571 ± 0.501
3.987GlyGlu: 3.987 ± 0.477
2.658GlyPhe: 2.658 ± 0.586
3.405GlyGly: 3.405 ± 0.506
0.748GlyHis: 0.748 ± 0.233
6.312GlyIle: 6.312 ± 1.727
7.807GlyLys: 7.807 ± 0.95
6.561GlyLeu: 6.561 ± 0.762
1.993GlyMet: 1.993 ± 0.573
3.322GlyAsn: 3.322 ± 0.592
0.914GlyPro: 0.914 ± 0.363
2.907GlyGln: 2.907 ± 0.53
3.488GlyArg: 3.488 ± 0.799
4.236GlySer: 4.236 ± 0.545
5.648GlyThr: 5.648 ± 0.892
4.485GlyVal: 4.485 ± 0.735
0.748GlyTrp: 0.748 ± 0.221
3.073GlyTyr: 3.073 ± 0.544
0.0GlyXaa: 0.0 ± 0.0
His
0.581HisAla: 0.581 ± 0.226
0.083HisCys: 0.083 ± 0.092
1.08HisAsp: 1.08 ± 0.277
0.831HisGlu: 0.831 ± 0.276
0.415HisPhe: 0.415 ± 0.163
0.914HisGly: 0.914 ± 0.354
0.498HisHis: 0.498 ± 0.197
0.831HisIle: 0.831 ± 0.244
0.748HisLys: 0.748 ± 0.264
1.163HisLeu: 1.163 ± 0.315
0.166HisMet: 0.166 ± 0.125
0.498HisAsn: 0.498 ± 0.198
0.581HisPro: 0.581 ± 0.235
0.166HisGln: 0.166 ± 0.107
0.748HisArg: 0.748 ± 0.286
0.997HisSer: 0.997 ± 0.282
0.914HisThr: 0.914 ± 0.265
0.914HisVal: 0.914 ± 0.292
0.083HisTrp: 0.083 ± 0.083
0.581HisTyr: 0.581 ± 0.202
0.0HisXaa: 0.0 ± 0.0
Ile
5.731IleAla: 5.731 ± 1.197
0.498IleCys: 0.498 ± 0.2
5.066IleAsp: 5.066 ± 0.495
4.651IleGlu: 4.651 ± 0.694
1.91IlePhe: 1.91 ± 0.392
4.734IleGly: 4.734 ± 1.1
0.748IleHis: 0.748 ± 0.243
3.654IleIle: 3.654 ± 0.605
6.229IleLys: 6.229 ± 0.61
3.405IleLeu: 3.405 ± 0.499
1.91IleMet: 1.91 ± 0.452
3.654IleAsn: 3.654 ± 0.639
2.492IlePro: 2.492 ± 0.413
2.99IleGln: 2.99 ± 0.477
2.824IleArg: 2.824 ± 0.502
6.063IleSer: 6.063 ± 1.343
4.153IleThr: 4.153 ± 0.824
3.904IleVal: 3.904 ± 0.883
0.581IleTrp: 0.581 ± 0.197
3.322IleTyr: 3.322 ± 0.64
0.0IleXaa: 0.0 ± 0.0
Lys
7.226LysAla: 7.226 ± 0.761
0.332LysCys: 0.332 ± 0.185
4.651LysAsp: 4.651 ± 0.788
7.89LysGlu: 7.89 ± 1.11
2.658LysPhe: 2.658 ± 0.513
6.063LysGly: 6.063 ± 0.965
0.664LysHis: 0.664 ± 0.256
4.153LysIle: 4.153 ± 0.668
5.897LysLys: 5.897 ± 1.084
6.561LysLeu: 6.561 ± 0.792
1.993LysMet: 1.993 ± 0.51
4.568LysAsn: 4.568 ± 0.699
2.741LysPro: 2.741 ± 0.364
2.326LysGln: 2.326 ± 0.426
4.734LysArg: 4.734 ± 0.752
4.568LysSer: 4.568 ± 0.67
5.316LysThr: 5.316 ± 0.662
4.402LysVal: 4.402 ± 0.558
1.163LysTrp: 1.163 ± 0.283
4.153LysTyr: 4.153 ± 0.91
0.0LysXaa: 0.0 ± 0.0
Leu
6.146LeuAla: 6.146 ± 0.878
0.249LeuCys: 0.249 ± 0.147
5.15LeuAsp: 5.15 ± 0.653
6.478LeuGlu: 6.478 ± 0.92
3.073LeuPhe: 3.073 ± 0.407
5.482LeuGly: 5.482 ± 1.036
0.664LeuHis: 0.664 ± 0.214
4.153LeuIle: 4.153 ± 0.569
6.312LeuLys: 6.312 ± 0.758
4.319LeuLeu: 4.319 ± 0.619
1.661LeuMet: 1.661 ± 0.331
5.565LeuAsn: 5.565 ± 0.647
3.073LeuPro: 3.073 ± 0.505
2.907LeuGln: 2.907 ± 0.411
2.741LeuArg: 2.741 ± 0.578
5.15LeuSer: 5.15 ± 0.766
6.312LeuThr: 6.312 ± 1.021
4.983LeuVal: 4.983 ± 0.664
0.997LeuTrp: 0.997 ± 0.395
2.326LeuTyr: 2.326 ± 0.384
0.0LeuXaa: 0.0 ± 0.0
Met
2.824MetAla: 2.824 ± 0.785
0.083MetCys: 0.083 ± 0.068
1.163MetAsp: 1.163 ± 0.35
1.329MetGlu: 1.329 ± 0.31
0.914MetPhe: 0.914 ± 0.242
1.329MetGly: 1.329 ± 0.405
0.581MetHis: 0.581 ± 0.213
1.412MetIle: 1.412 ± 0.443
2.741MetLys: 2.741 ± 0.442
1.661MetLeu: 1.661 ± 0.366
1.163MetMet: 1.163 ± 0.491
1.08MetAsn: 1.08 ± 0.292
0.581MetPro: 0.581 ± 0.205
1.495MetGln: 1.495 ± 0.501
1.163MetArg: 1.163 ± 0.332
1.91MetSer: 1.91 ± 0.435
1.329MetThr: 1.329 ± 0.282
1.412MetVal: 1.412 ± 0.374
0.083MetTrp: 0.083 ± 0.08
0.664MetTyr: 0.664 ± 0.27
0.0MetXaa: 0.0 ± 0.0
Asn
3.405AsnAla: 3.405 ± 0.634
0.498AsnCys: 0.498 ± 0.176
3.987AsnAsp: 3.987 ± 0.82
3.904AsnGlu: 3.904 ± 0.649
2.326AsnPhe: 2.326 ± 0.466
5.15AsnGly: 5.15 ± 0.718
1.495AsnHis: 1.495 ± 0.409
3.738AsnIle: 3.738 ± 0.679
3.322AsnLys: 3.322 ± 0.605
4.485AsnLeu: 4.485 ± 0.499
1.08AsnMet: 1.08 ± 0.378
3.073AsnAsn: 3.073 ± 0.585
2.492AsnPro: 2.492 ± 0.465
1.661AsnGln: 1.661 ± 0.405
1.827AsnArg: 1.827 ± 0.412
2.99AsnSer: 2.99 ± 0.489
3.156AsnThr: 3.156 ± 0.589
3.904AsnVal: 3.904 ± 0.466
1.578AsnTrp: 1.578 ± 0.432
1.993AsnTyr: 1.993 ± 0.426
0.0AsnXaa: 0.0 ± 0.0
Pro
1.744ProAla: 1.744 ± 0.459
0.249ProCys: 0.249 ± 0.184
1.993ProAsp: 1.993 ± 0.429
1.744ProGlu: 1.744 ± 0.434
0.997ProPhe: 0.997 ± 0.276
1.827ProGly: 1.827 ± 0.528
0.332ProHis: 0.332 ± 0.204
1.993ProIle: 1.993 ± 0.373
2.658ProLys: 2.658 ± 0.467
1.993ProLeu: 1.993 ± 0.531
0.415ProMet: 0.415 ± 0.181
2.076ProAsn: 2.076 ± 0.555
0.914ProPro: 0.914 ± 0.263
1.08ProGln: 1.08 ± 0.303
0.748ProArg: 0.748 ± 0.274
2.243ProSer: 2.243 ± 0.446
1.993ProThr: 1.993 ± 0.454
2.409ProVal: 2.409 ± 0.504
0.166ProTrp: 0.166 ± 0.119
0.748ProTyr: 0.748 ± 0.241
0.0ProXaa: 0.0 ± 0.0
Gln
3.904GlnAla: 3.904 ± 0.695
0.249GlnCys: 0.249 ± 0.113
1.495GlnAsp: 1.495 ± 0.298
2.907GlnGlu: 2.907 ± 0.602
1.578GlnPhe: 1.578 ± 0.379
2.326GlnGly: 2.326 ± 0.636
0.332GlnHis: 0.332 ± 0.14
2.824GlnIle: 2.824 ± 0.742
2.326GlnLys: 2.326 ± 0.436
3.156GlnLeu: 3.156 ± 0.461
0.748GlnMet: 0.748 ± 0.24
1.744GlnAsn: 1.744 ± 0.255
1.08GlnPro: 1.08 ± 0.312
1.246GlnGln: 1.246 ± 0.34
1.412GlnArg: 1.412 ± 0.279
2.741GlnSer: 2.741 ± 0.623
2.326GlnThr: 2.326 ± 0.334
2.326GlnVal: 2.326 ± 0.411
0.415GlnTrp: 0.415 ± 0.167
1.329GlnTyr: 1.329 ± 0.407
0.0GlnXaa: 0.0 ± 0.0
Arg
2.824ArgAla: 2.824 ± 0.479
0.415ArgCys: 0.415 ± 0.212
2.159ArgAsp: 2.159 ± 0.402
2.492ArgGlu: 2.492 ± 0.519
1.661ArgPhe: 1.661 ± 0.464
2.824ArgGly: 2.824 ± 0.545
0.498ArgHis: 0.498 ± 0.198
3.073ArgIle: 3.073 ± 0.69
3.488ArgLys: 3.488 ± 0.772
3.322ArgLeu: 3.322 ± 0.753
1.661ArgMet: 1.661 ± 0.492
2.159ArgAsn: 2.159 ± 0.428
0.997ArgPro: 0.997 ± 0.285
1.495ArgGln: 1.495 ± 0.319
2.159ArgArg: 2.159 ± 0.465
2.243ArgSer: 2.243 ± 0.412
2.326ArgThr: 2.326 ± 0.535
3.405ArgVal: 3.405 ± 0.589
0.664ArgTrp: 0.664 ± 0.228
2.575ArgTyr: 2.575 ± 0.345
0.0ArgXaa: 0.0 ± 0.0
Ser
5.565SerAla: 5.565 ± 2.263
0.166SerCys: 0.166 ± 0.134
4.402SerAsp: 4.402 ± 0.683
3.821SerGlu: 3.821 ± 0.658
2.575SerPhe: 2.575 ± 0.513
4.983SerGly: 4.983 ± 0.675
0.664SerHis: 0.664 ± 0.269
5.399SerIle: 5.399 ± 0.888
4.568SerLys: 4.568 ± 0.64
5.066SerLeu: 5.066 ± 0.728
1.744SerMet: 1.744 ± 0.342
3.904SerAsn: 3.904 ± 0.589
1.495SerPro: 1.495 ± 0.326
3.571SerGln: 3.571 ± 0.721
2.741SerArg: 2.741 ± 0.552
5.731SerSer: 5.731 ± 0.978
3.654SerThr: 3.654 ± 0.526
5.814SerVal: 5.814 ± 0.755
0.581SerTrp: 0.581 ± 0.225
1.993SerTyr: 1.993 ± 0.403
0.0SerXaa: 0.0 ± 0.0
Thr
4.402ThrAla: 4.402 ± 1.091
0.332ThrCys: 0.332 ± 0.186
3.987ThrAsp: 3.987 ± 0.854
3.156ThrGlu: 3.156 ± 0.536
3.239ThrPhe: 3.239 ± 0.488
4.153ThrGly: 4.153 ± 0.566
1.163ThrHis: 1.163 ± 0.218
5.066ThrIle: 5.066 ± 0.55
6.063ThrLys: 6.063 ± 0.73
5.731ThrLeu: 5.731 ± 0.655
0.997ThrMet: 0.997 ± 0.437
1.993ThrAsn: 1.993 ± 0.323
1.495ThrPro: 1.495 ± 0.333
2.326ThrGln: 2.326 ± 0.424
1.993ThrArg: 1.993 ± 0.38
4.236ThrSer: 4.236 ± 0.926
4.236ThrThr: 4.236 ± 0.641
5.399ThrVal: 5.399 ± 0.668
0.581ThrTrp: 0.581 ± 0.277
3.239ThrTyr: 3.239 ± 0.784
0.0ThrXaa: 0.0 ± 0.0
Val
4.568ValAla: 4.568 ± 0.573
0.249ValCys: 0.249 ± 0.138
4.568ValAsp: 4.568 ± 0.782
4.9ValGlu: 4.9 ± 0.808
2.409ValPhe: 2.409 ± 0.486
4.485ValGly: 4.485 ± 0.685
0.581ValHis: 0.581 ± 0.191
4.9ValIle: 4.9 ± 0.831
5.15ValLys: 5.15 ± 0.554
4.485ValLeu: 4.485 ± 0.553
0.914ValMet: 0.914 ± 0.196
3.987ValAsn: 3.987 ± 0.784
2.243ValPro: 2.243 ± 0.389
2.409ValGln: 2.409 ± 0.411
1.91ValArg: 1.91 ± 0.414
5.316ValSer: 5.316 ± 0.666
4.568ValThr: 4.568 ± 0.546
4.319ValVal: 4.319 ± 0.566
1.08ValTrp: 1.08 ± 0.277
1.744ValTyr: 1.744 ± 0.406
0.0ValXaa: 0.0 ± 0.0
Trp
0.748TrpAla: 0.748 ± 0.258
0.083TrpCys: 0.083 ± 0.084
0.831TrpAsp: 0.831 ± 0.299
1.08TrpGlu: 1.08 ± 0.319
0.581TrpPhe: 0.581 ± 0.22
0.997TrpGly: 0.997 ± 0.28
0.166TrpHis: 0.166 ± 0.097
0.748TrpIle: 0.748 ± 0.251
0.997TrpLys: 0.997 ± 0.245
0.914TrpLeu: 0.914 ± 0.287
0.083TrpMet: 0.083 ± 0.088
0.831TrpAsn: 0.831 ± 0.244
0.249TrpPro: 0.249 ± 0.12
0.498TrpGln: 0.498 ± 0.195
0.664TrpArg: 0.664 ± 0.244
1.246TrpSer: 1.246 ± 0.458
1.08TrpThr: 1.08 ± 0.462
1.08TrpVal: 1.08 ± 0.317
0.166TrpTrp: 0.166 ± 0.114
0.332TrpTyr: 0.332 ± 0.138
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.322TyrAla: 3.322 ± 0.681
0.249TyrCys: 0.249 ± 0.124
3.073TyrAsp: 3.073 ± 0.703
1.993TyrGlu: 1.993 ± 0.433
1.827TyrPhe: 1.827 ± 0.357
2.824TyrGly: 2.824 ± 0.478
0.581TyrHis: 0.581 ± 0.202
2.326TyrIle: 2.326 ± 0.622
2.907TyrLys: 2.907 ± 0.489
4.402TyrLeu: 4.402 ± 0.804
1.329TyrMet: 1.329 ± 0.357
1.744TyrAsn: 1.744 ± 0.336
1.08TyrPro: 1.08 ± 0.322
1.661TyrGln: 1.661 ± 0.434
1.91TyrArg: 1.91 ± 0.639
2.492TyrSer: 2.492 ± 0.483
2.492TyrThr: 2.492 ± 0.55
2.492TyrVal: 2.492 ± 0.54
0.249TyrTrp: 0.249 ± 0.13
2.326TyrTyr: 2.326 ± 0.523
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (12041 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski