Amino acid dipepetide frequency for Streptococcus phage CHPC952

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.318AlaAla: 6.318 ± 2.245
0.366AlaCys: 0.366 ± 0.189
5.31AlaAsp: 5.31 ± 0.831
4.486AlaGlu: 4.486 ± 0.659
3.296AlaPhe: 3.296 ± 1.18
4.761AlaGly: 4.761 ± 1.315
0.916AlaHis: 0.916 ± 0.26
5.677AlaIle: 5.677 ± 1.545
4.303AlaLys: 4.303 ± 0.568
5.951AlaLeu: 5.951 ± 1.05
2.289AlaMet: 2.289 ± 0.975
4.029AlaAsn: 4.029 ± 0.796
2.289AlaPro: 2.289 ± 0.529
3.205AlaGln: 3.205 ± 1.003
3.571AlaArg: 3.571 ± 0.713
6.226AlaSer: 6.226 ± 1.433
3.845AlaThr: 3.845 ± 0.949
5.768AlaVal: 5.768 ± 1.367
0.458AlaTrp: 0.458 ± 0.167
2.014AlaTyr: 2.014 ± 0.451
0.0AlaXaa: 0.0 ± 0.0
Cys
0.275CysAla: 0.275 ± 0.165
0.0CysCys: 0.0 ± 0.0
0.732CysAsp: 0.732 ± 0.291
0.549CysGlu: 0.549 ± 0.223
0.183CysPhe: 0.183 ± 0.134
0.366CysGly: 0.366 ± 0.216
0.092CysHis: 0.092 ± 0.109
0.183CysIle: 0.183 ± 0.116
0.458CysLys: 0.458 ± 0.217
0.275CysLeu: 0.275 ± 0.168
0.092CysMet: 0.092 ± 0.081
0.183CysAsn: 0.183 ± 0.129
0.092CysPro: 0.092 ± 0.102
0.0CysGln: 0.0 ± 0.0
0.183CysArg: 0.183 ± 0.139
0.641CysSer: 0.641 ± 0.236
0.0CysThr: 0.0 ± 0.0
0.458CysVal: 0.458 ± 0.185
0.183CysTrp: 0.183 ± 0.132
0.092CysTyr: 0.092 ± 0.075
0.0CysXaa: 0.0 ± 0.0
Asp
3.021AspAla: 3.021 ± 0.542
0.366AspCys: 0.366 ± 0.192
4.578AspAsp: 4.578 ± 0.883
3.937AspGlu: 3.937 ± 0.781
3.845AspPhe: 3.845 ± 0.711
6.318AspGly: 6.318 ± 1.23
0.275AspHis: 0.275 ± 0.196
3.662AspIle: 3.662 ± 0.775
4.486AspLys: 4.486 ± 0.983
5.219AspLeu: 5.219 ± 0.614
1.465AspMet: 1.465 ± 0.372
4.395AspAsn: 4.395 ± 0.662
0.824AspPro: 0.824 ± 0.3
1.19AspGln: 1.19 ± 0.313
2.747AspArg: 2.747 ± 0.702
4.486AspSer: 4.486 ± 0.951
3.479AspThr: 3.479 ± 0.683
3.021AspVal: 3.021 ± 0.553
1.099AspTrp: 1.099 ± 0.425
3.205AspTyr: 3.205 ± 0.699
0.0AspXaa: 0.0 ± 0.0
Glu
5.036GluAla: 5.036 ± 0.847
0.183GluCys: 0.183 ± 0.129
2.838GluAsp: 2.838 ± 0.469
4.395GluGlu: 4.395 ± 1.047
2.564GluPhe: 2.564 ± 0.524
3.021GluGly: 3.021 ± 0.557
1.465GluHis: 1.465 ± 0.408
4.212GluIle: 4.212 ± 0.699
5.31GluLys: 5.31 ± 1.171
7.233GluLeu: 7.233 ± 1.245
2.747GluMet: 2.747 ± 0.602
4.303GluAsn: 4.303 ± 0.78
1.465GluPro: 1.465 ± 0.433
2.838GluGln: 2.838 ± 0.549
4.395GluArg: 4.395 ± 0.871
2.655GluSer: 2.655 ± 0.598
3.296GluThr: 3.296 ± 0.656
5.768GluVal: 5.768 ± 0.969
0.916GluTrp: 0.916 ± 0.387
3.205GluTyr: 3.205 ± 0.787
0.0GluXaa: 0.0 ± 0.0
Phe
2.289PheAla: 2.289 ± 0.407
0.092PheCys: 0.092 ± 0.084
2.564PheAsp: 2.564 ± 0.636
4.303PheGlu: 4.303 ± 0.797
1.282PhePhe: 1.282 ± 0.442
4.12PheGly: 4.12 ± 0.761
0.366PheHis: 0.366 ± 0.202
3.021PheIle: 3.021 ± 0.511
5.31PheLys: 5.31 ± 0.82
2.014PheLeu: 2.014 ± 0.524
0.641PheMet: 0.641 ± 0.212
3.388PheAsn: 3.388 ± 0.457
0.641PhePro: 0.641 ± 0.282
1.282PheGln: 1.282 ± 0.338
1.373PheArg: 1.373 ± 0.327
4.12PheSer: 4.12 ± 0.676
2.655PheThr: 2.655 ± 0.716
1.923PheVal: 1.923 ± 0.459
0.732PheTrp: 0.732 ± 0.257
1.465PheTyr: 1.465 ± 0.505
0.0PheXaa: 0.0 ± 0.0
Gly
4.944GlyAla: 4.944 ± 1.019
0.366GlyCys: 0.366 ± 0.168
3.479GlyAsp: 3.479 ± 0.474
2.838GlyGlu: 2.838 ± 0.536
3.296GlyPhe: 3.296 ± 0.531
2.93GlyGly: 2.93 ± 0.538
1.282GlyHis: 1.282 ± 0.443
6.775GlyIle: 6.775 ± 1.546
6.226GlyLys: 6.226 ± 0.846
6.318GlyLeu: 6.318 ± 0.98
1.556GlyMet: 1.556 ± 0.766
3.662GlyAsn: 3.662 ± 0.727
1.373GlyPro: 1.373 ± 0.64
3.205GlyGln: 3.205 ± 0.499
3.021GlyArg: 3.021 ± 0.586
3.571GlySer: 3.571 ± 0.888
4.669GlyThr: 4.669 ± 0.795
5.219GlyVal: 5.219 ± 0.757
0.641GlyTrp: 0.641 ± 0.3
2.838GlyTyr: 2.838 ± 0.511
0.0GlyXaa: 0.0 ± 0.0
His
1.099HisAla: 1.099 ± 0.294
0.092HisCys: 0.092 ± 0.107
0.824HisAsp: 0.824 ± 0.254
0.641HisGlu: 0.641 ± 0.326
0.641HisPhe: 0.641 ± 0.222
1.007HisGly: 1.007 ± 0.33
0.458HisHis: 0.458 ± 0.223
0.824HisIle: 0.824 ± 0.272
1.19HisLys: 1.19 ± 0.324
1.282HisLeu: 1.282 ± 0.375
0.366HisMet: 0.366 ± 0.172
0.641HisAsn: 0.641 ± 0.36
0.275HisPro: 0.275 ± 0.15
0.366HisGln: 0.366 ± 0.188
0.824HisArg: 0.824 ± 0.29
1.099HisSer: 1.099 ± 0.388
1.007HisThr: 1.007 ± 0.3
1.282HisVal: 1.282 ± 0.5
0.092HisTrp: 0.092 ± 0.084
0.458HisTyr: 0.458 ± 0.228
0.0HisXaa: 0.0 ± 0.0
Ile
4.761IleAla: 4.761 ± 1.048
0.275IleCys: 0.275 ± 0.146
5.677IleAsp: 5.677 ± 0.685
4.303IleGlu: 4.303 ± 0.778
1.465IlePhe: 1.465 ± 0.492
5.86IleGly: 5.86 ± 1.128
0.916IleHis: 0.916 ± 0.26
2.93IleIle: 2.93 ± 0.585
4.395IleLys: 4.395 ± 0.575
3.388IleLeu: 3.388 ± 0.768
1.923IleMet: 1.923 ± 0.387
3.754IleAsn: 3.754 ± 0.803
2.655IlePro: 2.655 ± 0.734
3.571IleGln: 3.571 ± 0.615
2.655IleArg: 2.655 ± 0.671
5.677IleSer: 5.677 ± 1.67
4.303IleThr: 4.303 ± 0.794
4.12IleVal: 4.12 ± 0.704
0.732IleTrp: 0.732 ± 0.281
2.472IleTyr: 2.472 ± 0.669
0.0IleXaa: 0.0 ± 0.0
Lys
6.775LysAla: 6.775 ± 1.004
0.275LysCys: 0.275 ± 0.235
4.761LysAsp: 4.761 ± 0.753
7.05LysGlu: 7.05 ± 1.487
2.014LysPhe: 2.014 ± 0.37
5.036LysGly: 5.036 ± 0.71
1.282LysHis: 1.282 ± 0.385
5.402LysIle: 5.402 ± 0.617
5.036LysLys: 5.036 ± 1.094
6.958LysLeu: 6.958 ± 0.958
1.74LysMet: 1.74 ± 0.489
2.93LysAsn: 2.93 ± 0.464
3.754LysPro: 3.754 ± 0.642
2.747LysGln: 2.747 ± 0.547
4.669LysArg: 4.669 ± 0.775
4.578LysSer: 4.578 ± 0.652
4.761LysThr: 4.761 ± 0.664
3.296LysVal: 3.296 ± 0.635
1.099LysTrp: 1.099 ± 0.316
4.029LysTyr: 4.029 ± 0.748
0.0LysXaa: 0.0 ± 0.0
Leu
5.86LeuAla: 5.86 ± 1.037
0.183LeuCys: 0.183 ± 0.16
4.212LeuAsp: 4.212 ± 0.665
5.951LeuGlu: 5.951 ± 1.077
3.388LeuPhe: 3.388 ± 0.576
6.134LeuGly: 6.134 ± 0.879
0.824LeuHis: 0.824 ± 0.298
4.303LeuIle: 4.303 ± 0.569
6.318LeuLys: 6.318 ± 0.896
4.578LeuLeu: 4.578 ± 0.734
1.556LeuMet: 1.556 ± 0.355
5.31LeuAsn: 5.31 ± 0.645
1.923LeuPro: 1.923 ± 0.53
2.564LeuGln: 2.564 ± 0.492
2.93LeuArg: 2.93 ± 0.634
5.951LeuSer: 5.951 ± 0.683
6.318LeuThr: 6.318 ± 0.755
5.219LeuVal: 5.219 ± 0.645
0.458LeuTrp: 0.458 ± 0.213
3.021LeuTyr: 3.021 ± 0.632
0.0LeuXaa: 0.0 ± 0.0
Met
2.747MetAla: 2.747 ± 1.055
0.0MetCys: 0.0 ± 0.0
1.099MetAsp: 1.099 ± 0.28
1.007MetGlu: 1.007 ± 0.331
1.373MetPhe: 1.373 ± 0.252
1.099MetGly: 1.099 ± 0.311
0.366MetHis: 0.366 ± 0.232
1.465MetIle: 1.465 ± 0.446
2.106MetLys: 2.106 ± 0.549
1.465MetLeu: 1.465 ± 0.387
1.007MetMet: 1.007 ± 0.453
1.099MetAsn: 1.099 ± 0.322
0.549MetPro: 0.549 ± 0.226
1.373MetGln: 1.373 ± 0.469
1.007MetArg: 1.007 ± 0.289
2.381MetSer: 2.381 ± 0.378
1.648MetThr: 1.648 ± 0.349
1.648MetVal: 1.648 ± 0.459
0.0MetTrp: 0.0 ± 0.0
0.641MetTyr: 0.641 ± 0.255
0.0MetXaa: 0.0 ± 0.0
Asn
3.937AsnAla: 3.937 ± 0.661
0.275AsnCys: 0.275 ± 0.139
3.113AsnAsp: 3.113 ± 0.819
4.395AsnGlu: 4.395 ± 0.9
2.564AsnPhe: 2.564 ± 0.391
5.768AsnGly: 5.768 ± 1.12
1.282AsnHis: 1.282 ± 0.4
2.655AsnIle: 2.655 ± 0.55
4.761AsnLys: 4.761 ± 0.771
4.12AsnLeu: 4.12 ± 0.664
0.916AsnMet: 0.916 ± 0.321
3.479AsnAsn: 3.479 ± 0.698
2.564AsnPro: 2.564 ± 0.566
2.197AsnGln: 2.197 ± 0.497
2.564AsnArg: 2.564 ± 0.566
3.845AsnSer: 3.845 ± 0.568
2.93AsnThr: 2.93 ± 0.537
2.93AsnVal: 2.93 ± 0.645
1.19AsnTrp: 1.19 ± 0.393
1.831AsnTyr: 1.831 ± 0.443
0.0AsnXaa: 0.0 ± 0.0
Pro
1.74ProAla: 1.74 ± 0.303
0.183ProCys: 0.183 ± 0.186
1.74ProAsp: 1.74 ± 0.496
1.648ProGlu: 1.648 ± 0.427
1.282ProPhe: 1.282 ± 0.414
1.19ProGly: 1.19 ± 0.411
0.366ProHis: 0.366 ± 0.158
1.74ProIle: 1.74 ± 0.381
2.93ProLys: 2.93 ± 0.418
1.831ProLeu: 1.831 ± 0.403
0.092ProMet: 0.092 ± 0.085
2.197ProAsn: 2.197 ± 0.55
0.824ProPro: 0.824 ± 0.24
2.197ProGln: 2.197 ± 0.658
1.19ProArg: 1.19 ± 0.407
1.831ProSer: 1.831 ± 0.384
1.556ProThr: 1.556 ± 0.522
1.648ProVal: 1.648 ± 0.357
0.366ProTrp: 0.366 ± 0.174
1.099ProTyr: 1.099 ± 0.39
0.0ProXaa: 0.0 ± 0.0
Gln
3.845GlnAla: 3.845 ± 0.859
0.275GlnCys: 0.275 ± 0.166
2.747GlnAsp: 2.747 ± 0.492
2.93GlnGlu: 2.93 ± 0.703
2.472GlnPhe: 2.472 ± 0.598
2.838GlnGly: 2.838 ± 0.851
0.366GlnHis: 0.366 ± 0.166
2.381GlnIle: 2.381 ± 0.581
2.747GlnLys: 2.747 ± 0.492
4.212GlnLeu: 4.212 ± 0.477
1.465GlnMet: 1.465 ± 0.322
1.556GlnAsn: 1.556 ± 0.361
1.282GlnPro: 1.282 ± 0.362
1.648GlnGln: 1.648 ± 0.505
0.824GlnArg: 0.824 ± 0.283
2.564GlnSer: 2.564 ± 0.712
3.021GlnThr: 3.021 ± 0.519
2.381GlnVal: 2.381 ± 0.426
0.732GlnTrp: 0.732 ± 0.317
1.465GlnTyr: 1.465 ± 0.44
0.0GlnXaa: 0.0 ± 0.0
Arg
3.754ArgAla: 3.754 ± 0.48
0.732ArgCys: 0.732 ± 0.292
2.472ArgAsp: 2.472 ± 0.706
3.754ArgGlu: 3.754 ± 0.675
2.197ArgPhe: 2.197 ± 0.443
2.747ArgGly: 2.747 ± 0.514
0.458ArgHis: 0.458 ± 0.231
2.747ArgIle: 2.747 ± 0.698
3.479ArgLys: 3.479 ± 0.733
3.754ArgLeu: 3.754 ± 0.651
1.19ArgMet: 1.19 ± 0.328
1.465ArgAsn: 1.465 ± 0.429
0.824ArgPro: 0.824 ± 0.27
2.106ArgGln: 2.106 ± 0.447
1.465ArgArg: 1.465 ± 0.454
2.747ArgSer: 2.747 ± 0.518
2.106ArgThr: 2.106 ± 0.478
2.289ArgVal: 2.289 ± 0.504
0.641ArgTrp: 0.641 ± 0.284
2.106ArgTyr: 2.106 ± 0.524
0.0ArgXaa: 0.0 ± 0.0
Ser
7.05SerAla: 7.05 ± 2.88
0.549SerCys: 0.549 ± 0.215
4.761SerAsp: 4.761 ± 0.795
3.388SerGlu: 3.388 ± 0.652
2.655SerPhe: 2.655 ± 0.478
4.212SerGly: 4.212 ± 0.648
0.916SerHis: 0.916 ± 0.328
5.493SerIle: 5.493 ± 0.757
5.127SerLys: 5.127 ± 0.625
4.669SerLeu: 4.669 ± 0.889
1.282SerMet: 1.282 ± 0.263
4.12SerAsn: 4.12 ± 0.791
1.556SerPro: 1.556 ± 0.458
4.12SerGln: 4.12 ± 1.05
2.289SerArg: 2.289 ± 0.462
4.12SerSer: 4.12 ± 1.07
5.31SerThr: 5.31 ± 0.667
5.219SerVal: 5.219 ± 0.738
0.458SerTrp: 0.458 ± 0.234
2.014SerTyr: 2.014 ± 0.449
0.0SerXaa: 0.0 ± 0.0
Thr
4.669ThrAla: 4.669 ± 1.588
0.0ThrCys: 0.0 ± 0.0
3.113ThrAsp: 3.113 ± 0.616
3.937ThrGlu: 3.937 ± 0.654
3.571ThrPhe: 3.571 ± 0.483
4.12ThrGly: 4.12 ± 0.489
1.099ThrHis: 1.099 ± 0.325
4.761ThrIle: 4.761 ± 0.795
6.043ThrLys: 6.043 ± 0.819
5.31ThrLeu: 5.31 ± 0.611
1.465ThrMet: 1.465 ± 0.799
3.662ThrAsn: 3.662 ± 0.721
1.556ThrPro: 1.556 ± 0.367
3.021ThrGln: 3.021 ± 0.508
2.106ThrArg: 2.106 ± 0.455
3.296ThrSer: 3.296 ± 0.936
4.578ThrThr: 4.578 ± 0.891
4.944ThrVal: 4.944 ± 0.611
0.458ThrTrp: 0.458 ± 0.236
2.747ThrTyr: 2.747 ± 0.702
0.0ThrXaa: 0.0 ± 0.0
Val
4.12ValAla: 4.12 ± 1.143
0.366ValCys: 0.366 ± 0.182
4.212ValAsp: 4.212 ± 0.803
5.493ValGlu: 5.493 ± 0.774
2.655ValPhe: 2.655 ± 0.492
3.388ValGly: 3.388 ± 0.679
1.099ValHis: 1.099 ± 0.387
4.212ValIle: 4.212 ± 0.626
4.944ValLys: 4.944 ± 0.644
4.486ValLeu: 4.486 ± 0.485
0.732ValMet: 0.732 ± 0.29
4.486ValAsn: 4.486 ± 1.035
1.923ValPro: 1.923 ± 0.365
2.381ValGln: 2.381 ± 0.705
2.381ValArg: 2.381 ± 0.6
5.493ValSer: 5.493 ± 0.722
5.036ValThr: 5.036 ± 0.742
4.578ValVal: 4.578 ± 0.731
0.916ValTrp: 0.916 ± 0.333
1.923ValTyr: 1.923 ± 0.464
0.0ValXaa: 0.0 ± 0.0
Trp
0.366TrpAla: 0.366 ± 0.161
0.0TrpCys: 0.0 ± 0.0
0.549TrpAsp: 0.549 ± 0.22
0.824TrpGlu: 0.824 ± 0.247
0.549TrpPhe: 0.549 ± 0.234
0.824TrpGly: 0.824 ± 0.313
0.092TrpHis: 0.092 ± 0.093
0.549TrpIle: 0.549 ± 0.263
0.641TrpLys: 0.641 ± 0.25
0.732TrpLeu: 0.732 ± 0.253
0.183TrpMet: 0.183 ± 0.136
1.007TrpAsn: 1.007 ± 0.339
0.092TrpPro: 0.092 ± 0.091
0.183TrpGln: 0.183 ± 0.129
0.549TrpArg: 0.549 ± 0.233
1.556TrpSer: 1.556 ± 0.634
1.373TrpThr: 1.373 ± 0.578
1.099TrpVal: 1.099 ± 0.32
0.275TrpTrp: 0.275 ± 0.202
0.458TrpTyr: 0.458 ± 0.235
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.472TyrAla: 2.472 ± 0.409
0.458TyrCys: 0.458 ± 0.186
2.838TyrAsp: 2.838 ± 0.776
2.014TyrGlu: 2.014 ± 0.501
2.289TyrPhe: 2.289 ± 0.563
2.472TyrGly: 2.472 ± 0.518
0.549TyrHis: 0.549 ± 0.269
2.838TyrIle: 2.838 ± 0.651
2.381TyrLys: 2.381 ± 0.508
3.296TyrLeu: 3.296 ± 0.697
1.282TyrMet: 1.282 ± 0.427
1.648TyrAsn: 1.648 ± 0.506
1.19TyrPro: 1.19 ± 0.404
1.556TyrGln: 1.556 ± 0.411
2.289TyrArg: 2.289 ± 0.656
2.472TyrSer: 2.472 ± 0.541
2.564TyrThr: 2.564 ± 0.765
2.014TyrVal: 2.014 ± 0.361
0.458TyrTrp: 0.458 ± 0.204
1.74TyrTyr: 1.74 ± 0.606
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 46 proteins (10923 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski