Amino acid dipepetide frequency for Streptococcus phage CHPC929

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.902AlaAla: 4.902 ± 1.647
0.395AlaCys: 0.395 ± 0.265
4.744AlaAsp: 4.744 ± 1.001
4.981AlaGlu: 4.981 ± 0.75
2.451AlaPhe: 2.451 ± 0.713
5.298AlaGly: 5.298 ± 1.025
0.474AlaHis: 0.474 ± 0.157
5.772AlaIle: 5.772 ± 1.511
5.219AlaLys: 5.219 ± 0.769
5.614AlaLeu: 5.614 ± 0.876
2.214AlaMet: 2.214 ± 0.923
3.479AlaAsn: 3.479 ± 0.663
2.056AlaPro: 2.056 ± 0.548
2.926AlaGln: 2.926 ± 0.911
3.558AlaArg: 3.558 ± 0.549
5.14AlaSer: 5.14 ± 1.104
4.191AlaThr: 4.191 ± 0.65
4.665AlaVal: 4.665 ± 0.768
0.712AlaTrp: 0.712 ± 0.241
3.084AlaTyr: 3.084 ± 0.498
0.0AlaXaa: 0.0 ± 0.0
Cys
0.237CysAla: 0.237 ± 0.146
0.158CysCys: 0.158 ± 0.117
0.474CysAsp: 0.474 ± 0.232
0.633CysGlu: 0.633 ± 0.277
0.316CysPhe: 0.316 ± 0.152
0.474CysGly: 0.474 ± 0.249
0.158CysHis: 0.158 ± 0.11
0.395CysIle: 0.395 ± 0.161
0.949CysLys: 0.949 ± 0.289
0.553CysLeu: 0.553 ± 0.211
0.079CysMet: 0.079 ± 0.064
0.712CysAsn: 0.712 ± 0.242
0.158CysPro: 0.158 ± 0.104
0.0CysGln: 0.0 ± 0.0
0.316CysArg: 0.316 ± 0.202
0.553CysSer: 0.553 ± 0.31
0.395CysThr: 0.395 ± 0.193
0.158CysVal: 0.158 ± 0.112
0.079CysTrp: 0.079 ± 0.103
0.553CysTyr: 0.553 ± 0.287
0.0CysXaa: 0.0 ± 0.0
Asp
3.163AspAla: 3.163 ± 0.461
0.712AspCys: 0.712 ± 0.244
3.716AspAsp: 3.716 ± 0.562
4.191AspGlu: 4.191 ± 0.905
3.242AspPhe: 3.242 ± 0.444
6.088AspGly: 6.088 ± 1.124
0.87AspHis: 0.87 ± 0.29
4.112AspIle: 4.112 ± 0.6
4.744AspLys: 4.744 ± 0.852
4.507AspLeu: 4.507 ± 0.748
1.423AspMet: 1.423 ± 0.298
4.665AspAsn: 4.665 ± 0.766
1.186AspPro: 1.186 ± 0.3
1.581AspGln: 1.581 ± 0.391
1.977AspArg: 1.977 ± 0.261
4.586AspSer: 4.586 ± 0.658
3.716AspThr: 3.716 ± 0.529
3.242AspVal: 3.242 ± 0.481
1.107AspTrp: 1.107 ± 0.354
2.767AspTyr: 2.767 ± 0.443
0.0AspXaa: 0.0 ± 0.0
Glu
4.428GluAla: 4.428 ± 0.738
0.237GluCys: 0.237 ± 0.143
3.637GluAsp: 3.637 ± 0.596
4.349GluGlu: 4.349 ± 0.825
3.084GluPhe: 3.084 ± 0.572
3.4GluGly: 3.4 ± 0.59
1.344GluHis: 1.344 ± 0.388
5.456GluIle: 5.456 ± 0.875
4.981GluLys: 4.981 ± 1.166
6.642GluLeu: 6.642 ± 1.25
2.056GluMet: 2.056 ± 0.317
3.874GluAsn: 3.874 ± 0.652
1.423GluPro: 1.423 ± 0.428
2.926GluGln: 2.926 ± 0.602
3.321GluArg: 3.321 ± 0.72
3.321GluSer: 3.321 ± 0.602
3.005GluThr: 3.005 ± 0.536
4.981GluVal: 4.981 ± 0.723
1.186GluTrp: 1.186 ± 0.334
2.767GluTyr: 2.767 ± 0.557
0.0GluXaa: 0.0 ± 0.0
Phe
2.372PheAla: 2.372 ± 0.442
0.474PheCys: 0.474 ± 0.236
3.005PheAsp: 3.005 ± 0.409
3.558PheGlu: 3.558 ± 0.761
1.819PhePhe: 1.819 ± 0.341
3.321PheGly: 3.321 ± 0.682
0.395PheHis: 0.395 ± 0.157
2.056PheIle: 2.056 ± 0.361
4.586PheLys: 4.586 ± 0.711
2.372PheLeu: 2.372 ± 0.653
0.791PheMet: 0.791 ± 0.214
2.926PheAsn: 2.926 ± 0.428
0.553PhePro: 0.553 ± 0.244
1.344PheGln: 1.344 ± 0.287
1.581PheArg: 1.581 ± 0.354
3.637PheSer: 3.637 ± 0.652
2.847PheThr: 2.847 ± 0.506
2.214PheVal: 2.214 ± 0.548
0.553PheTrp: 0.553 ± 0.191
1.66PheTyr: 1.66 ± 0.407
0.0PheXaa: 0.0 ± 0.0
Gly
5.535GlyAla: 5.535 ± 0.798
0.395GlyCys: 0.395 ± 0.258
3.163GlyAsp: 3.163 ± 0.368
3.163GlyGlu: 3.163 ± 0.499
2.926GlyPhe: 2.926 ± 0.485
3.716GlyGly: 3.716 ± 0.602
0.712GlyHis: 0.712 ± 0.254
6.088GlyIle: 6.088 ± 1.943
6.8GlyLys: 6.8 ± 0.859
4.902GlyLeu: 4.902 ± 0.795
1.977GlyMet: 1.977 ± 0.566
4.744GlyAsn: 4.744 ± 0.773
1.265GlyPro: 1.265 ± 0.49
3.005GlyGln: 3.005 ± 0.496
3.637GlyArg: 3.637 ± 0.689
4.349GlySer: 4.349 ± 0.785
4.586GlyThr: 4.586 ± 0.645
4.27GlyVal: 4.27 ± 0.571
1.107GlyTrp: 1.107 ± 0.292
2.451GlyTyr: 2.451 ± 0.544
0.0GlyXaa: 0.0 ± 0.0
His
0.553HisAla: 0.553 ± 0.235
0.0HisCys: 0.0 ± 0.0
0.949HisAsp: 0.949 ± 0.27
0.553HisGlu: 0.553 ± 0.224
0.553HisPhe: 0.553 ± 0.216
0.791HisGly: 0.791 ± 0.287
0.553HisHis: 0.553 ± 0.179
0.949HisIle: 0.949 ± 0.298
1.344HisLys: 1.344 ± 0.348
1.028HisLeu: 1.028 ± 0.273
0.079HisMet: 0.079 ± 0.072
0.791HisAsn: 0.791 ± 0.262
0.316HisPro: 0.316 ± 0.141
0.553HisGln: 0.553 ± 0.189
0.474HisArg: 0.474 ± 0.18
0.791HisSer: 0.791 ± 0.238
0.633HisThr: 0.633 ± 0.2
0.791HisVal: 0.791 ± 0.253
0.079HisTrp: 0.079 ± 0.083
0.87HisTyr: 0.87 ± 0.257
0.0HisXaa: 0.0 ± 0.0
Ile
5.456IleAla: 5.456 ± 1.234
0.316IleCys: 0.316 ± 0.162
6.009IleAsp: 6.009 ± 0.815
4.823IleGlu: 4.823 ± 0.748
1.66IlePhe: 1.66 ± 0.382
4.823IleGly: 4.823 ± 0.891
0.791IleHis: 0.791 ± 0.205
4.349IleIle: 4.349 ± 0.71
4.665IleLys: 4.665 ± 0.474
4.507IleLeu: 4.507 ± 0.498
1.344IleMet: 1.344 ± 0.339
3.558IleAsn: 3.558 ± 0.512
2.451IlePro: 2.451 ± 0.524
2.688IleGln: 2.688 ± 0.46
2.609IleArg: 2.609 ± 0.449
5.93IleSer: 5.93 ± 1.147
4.823IleThr: 4.823 ± 0.873
3.637IleVal: 3.637 ± 0.636
1.107IleTrp: 1.107 ± 0.36
3.084IleTyr: 3.084 ± 0.618
0.0IleXaa: 0.0 ± 0.0
Lys
6.167LysAla: 6.167 ± 0.825
0.158LysCys: 0.158 ± 0.114
4.349LysAsp: 4.349 ± 0.784
6.405LysGlu: 6.405 ± 0.949
2.53LysPhe: 2.53 ± 0.452
5.614LysGly: 5.614 ± 0.661
1.186LysHis: 1.186 ± 0.308
4.981LysIle: 4.981 ± 0.67
5.535LysLys: 5.535 ± 1.162
6.484LysLeu: 6.484 ± 0.919
1.66LysMet: 1.66 ± 0.388
4.823LysAsn: 4.823 ± 0.818
2.609LysPro: 2.609 ± 0.458
2.451LysGln: 2.451 ± 0.453
4.428LysArg: 4.428 ± 0.687
4.586LysSer: 4.586 ± 0.501
6.247LysThr: 6.247 ± 0.734
4.665LysVal: 4.665 ± 0.647
1.028LysTrp: 1.028 ± 0.261
3.4LysTyr: 3.4 ± 0.826
0.0LysXaa: 0.0 ± 0.0
Leu
5.93LeuAla: 5.93 ± 0.715
0.633LeuCys: 0.633 ± 0.22
5.456LeuAsp: 5.456 ± 0.837
6.247LeuGlu: 6.247 ± 1.039
2.609LeuPhe: 2.609 ± 0.353
4.507LeuGly: 4.507 ± 0.743
0.791LeuHis: 0.791 ± 0.289
3.005LeuIle: 3.005 ± 0.42
6.484LeuLys: 6.484 ± 0.729
4.665LeuLeu: 4.665 ± 0.582
1.898LeuMet: 1.898 ± 0.353
4.981LeuAsn: 4.981 ± 0.46
2.53LeuPro: 2.53 ± 0.563
3.005LeuGln: 3.005 ± 0.461
2.293LeuArg: 2.293 ± 0.467
5.298LeuSer: 5.298 ± 0.863
6.879LeuThr: 6.879 ± 1.023
4.112LeuVal: 4.112 ± 0.67
0.712LeuTrp: 0.712 ± 0.22
3.242LeuTyr: 3.242 ± 0.523
0.0LeuXaa: 0.0 ± 0.0
Met
2.451MetAla: 2.451 ± 0.573
0.158MetCys: 0.158 ± 0.12
0.949MetAsp: 0.949 ± 0.265
1.502MetGlu: 1.502 ± 0.381
0.791MetPhe: 0.791 ± 0.258
1.186MetGly: 1.186 ± 0.512
0.474MetHis: 0.474 ± 0.21
1.74MetIle: 1.74 ± 0.471
2.135MetLys: 2.135 ± 0.374
1.819MetLeu: 1.819 ± 0.35
0.87MetMet: 0.87 ± 0.447
0.949MetAsn: 0.949 ± 0.257
0.553MetPro: 0.553 ± 0.214
1.265MetGln: 1.265 ± 0.549
0.791MetArg: 0.791 ± 0.212
1.898MetSer: 1.898 ± 0.448
1.581MetThr: 1.581 ± 0.389
1.423MetVal: 1.423 ± 0.405
0.0MetTrp: 0.0 ± 0.0
0.791MetTyr: 0.791 ± 0.25
0.0MetXaa: 0.0 ± 0.0
Asn
4.349AsnAla: 4.349 ± 0.691
0.712AsnCys: 0.712 ± 0.225
3.005AsnAsp: 3.005 ± 0.696
4.586AsnGlu: 4.586 ± 0.767
2.293AsnPhe: 2.293 ± 0.447
6.247AsnGly: 6.247 ± 0.996
1.107AsnHis: 1.107 ± 0.389
3.005AsnIle: 3.005 ± 0.406
3.637AsnLys: 3.637 ± 0.598
4.033AsnLeu: 4.033 ± 0.501
1.344AsnMet: 1.344 ± 0.37
4.191AsnAsn: 4.191 ± 0.695
2.451AsnPro: 2.451 ± 0.377
1.898AsnGln: 1.898 ± 0.401
2.847AsnArg: 2.847 ± 0.601
3.242AsnSer: 3.242 ± 0.494
3.637AsnThr: 3.637 ± 0.7
4.191AsnVal: 4.191 ± 0.639
1.107AsnTrp: 1.107 ± 0.296
2.214AsnTyr: 2.214 ± 0.467
0.0AsnXaa: 0.0 ± 0.0
Pro
1.502ProAla: 1.502 ± 0.318
0.158ProCys: 0.158 ± 0.168
1.819ProAsp: 1.819 ± 0.387
1.74ProGlu: 1.74 ± 0.403
1.344ProPhe: 1.344 ± 0.281
1.423ProGly: 1.423 ± 0.292
0.316ProHis: 0.316 ± 0.163
2.293ProIle: 2.293 ± 0.378
2.609ProLys: 2.609 ± 0.394
1.186ProLeu: 1.186 ± 0.315
0.158ProMet: 0.158 ± 0.113
2.214ProAsn: 2.214 ± 0.568
1.107ProPro: 1.107 ± 0.244
1.423ProGln: 1.423 ± 0.413
1.423ProArg: 1.423 ± 0.325
2.372ProSer: 2.372 ± 0.414
1.028ProThr: 1.028 ± 0.297
2.135ProVal: 2.135 ± 0.326
0.316ProTrp: 0.316 ± 0.15
1.344ProTyr: 1.344 ± 0.436
0.0ProXaa: 0.0 ± 0.0
Gln
3.005GlnAla: 3.005 ± 0.697
0.316GlnCys: 0.316 ± 0.13
2.056GlnAsp: 2.056 ± 0.515
2.214GlnGlu: 2.214 ± 0.42
1.898GlnPhe: 1.898 ± 0.398
2.293GlnGly: 2.293 ± 0.77
0.237GlnHis: 0.237 ± 0.137
2.53GlnIle: 2.53 ± 0.457
2.135GlnLys: 2.135 ± 0.44
3.479GlnLeu: 3.479 ± 0.385
0.949GlnMet: 0.949 ± 0.276
2.135GlnAsn: 2.135 ± 0.363
1.186GlnPro: 1.186 ± 0.331
1.186GlnGln: 1.186 ± 0.257
1.423GlnArg: 1.423 ± 0.385
3.558GlnSer: 3.558 ± 0.72
3.084GlnThr: 3.084 ± 0.461
2.609GlnVal: 2.609 ± 0.428
0.316GlnTrp: 0.316 ± 0.125
1.581GlnTyr: 1.581 ± 0.406
0.0GlnXaa: 0.0 ± 0.0
Arg
2.926ArgAla: 2.926 ± 0.484
0.474ArgCys: 0.474 ± 0.188
1.898ArgAsp: 1.898 ± 0.338
2.926ArgGlu: 2.926 ± 0.555
2.293ArgPhe: 2.293 ± 0.501
2.53ArgGly: 2.53 ± 0.404
0.237ArgHis: 0.237 ± 0.186
2.293ArgIle: 2.293 ± 0.491
3.795ArgLys: 3.795 ± 0.73
3.716ArgLeu: 3.716 ± 0.5
1.581ArgMet: 1.581 ± 0.402
2.135ArgAsn: 2.135 ± 0.433
0.87ArgPro: 0.87 ± 0.22
1.502ArgGln: 1.502 ± 0.346
1.423ArgArg: 1.423 ± 0.406
1.977ArgSer: 1.977 ± 0.437
2.056ArgThr: 2.056 ± 0.359
3.4ArgVal: 3.4 ± 0.491
0.791ArgTrp: 0.791 ± 0.247
2.293ArgTyr: 2.293 ± 0.474
0.0ArgXaa: 0.0 ± 0.0
Ser
6.326SerAla: 6.326 ± 1.937
0.553SerCys: 0.553 ± 0.253
4.586SerAsp: 4.586 ± 0.587
3.321SerGlu: 3.321 ± 0.546
3.4SerPhe: 3.4 ± 0.568
5.14SerGly: 5.14 ± 0.916
0.712SerHis: 0.712 ± 0.293
5.219SerIle: 5.219 ± 0.579
4.981SerLys: 4.981 ± 0.687
5.298SerLeu: 5.298 ± 0.661
1.74SerMet: 1.74 ± 0.375
3.163SerAsn: 3.163 ± 0.603
1.977SerPro: 1.977 ± 0.374
3.242SerGln: 3.242 ± 0.693
2.056SerArg: 2.056 ± 0.515
4.428SerSer: 4.428 ± 1.225
5.219SerThr: 5.219 ± 0.69
5.456SerVal: 5.456 ± 0.852
0.87SerTrp: 0.87 ± 0.348
1.819SerTyr: 1.819 ± 0.425
0.0SerXaa: 0.0 ± 0.0
Thr
4.428ThrAla: 4.428 ± 1.065
0.474ThrCys: 0.474 ± 0.282
4.033ThrAsp: 4.033 ± 0.713
3.163ThrGlu: 3.163 ± 0.492
3.558ThrPhe: 3.558 ± 0.529
3.637ThrGly: 3.637 ± 0.505
0.791ThrHis: 0.791 ± 0.242
6.326ThrIle: 6.326 ± 1.22
5.377ThrLys: 5.377 ± 0.659
5.93ThrLeu: 5.93 ± 0.719
0.87ThrMet: 0.87 ± 0.391
3.479ThrAsn: 3.479 ± 0.662
2.372ThrPro: 2.372 ± 0.494
2.53ThrGln: 2.53 ± 0.533
2.056ThrArg: 2.056 ± 0.405
4.665ThrSer: 4.665 ± 0.909
4.665ThrThr: 4.665 ± 0.675
5.14ThrVal: 5.14 ± 0.642
0.474ThrTrp: 0.474 ± 0.308
3.005ThrTyr: 3.005 ± 0.536
0.0ThrXaa: 0.0 ± 0.0
Val
4.112ValAla: 4.112 ± 0.779
0.316ValCys: 0.316 ± 0.14
4.033ValAsp: 4.033 ± 0.723
5.06ValGlu: 5.06 ± 1.016
2.926ValPhe: 2.926 ± 0.569
4.744ValGly: 4.744 ± 0.947
0.553ValHis: 0.553 ± 0.241
4.191ValIle: 4.191 ± 0.535
5.772ValLys: 5.772 ± 0.576
4.902ValLeu: 4.902 ± 0.629
1.186ValMet: 1.186 ± 0.297
4.349ValAsn: 4.349 ± 0.729
1.423ValPro: 1.423 ± 0.371
2.53ValGln: 2.53 ± 0.388
1.74ValArg: 1.74 ± 0.374
5.14ValSer: 5.14 ± 0.639
4.744ValThr: 4.744 ± 0.691
4.507ValVal: 4.507 ± 0.657
1.265ValTrp: 1.265 ± 0.269
1.423ValTyr: 1.423 ± 0.341
0.0ValXaa: 0.0 ± 0.0
Trp
0.633TrpAla: 0.633 ± 0.254
0.237TrpCys: 0.237 ± 0.148
0.791TrpAsp: 0.791 ± 0.226
0.949TrpGlu: 0.949 ± 0.272
0.712TrpPhe: 0.712 ± 0.283
0.87TrpGly: 0.87 ± 0.214
0.237TrpHis: 0.237 ± 0.134
0.87TrpIle: 0.87 ± 0.302
0.633TrpLys: 0.633 ± 0.238
0.87TrpLeu: 0.87 ± 0.36
0.237TrpMet: 0.237 ± 0.116
0.791TrpAsn: 0.791 ± 0.227
0.158TrpPro: 0.158 ± 0.119
0.474TrpGln: 0.474 ± 0.22
0.791TrpArg: 0.791 ± 0.294
1.502TrpSer: 1.502 ± 0.597
0.949TrpThr: 0.949 ± 0.338
0.949TrpVal: 0.949 ± 0.241
0.316TrpTrp: 0.316 ± 0.203
0.395TrpTyr: 0.395 ± 0.2
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.4TyrAla: 3.4 ± 0.576
0.474TyrCys: 0.474 ± 0.201
2.926TyrAsp: 2.926 ± 0.652
2.056TyrGlu: 2.056 ± 0.486
1.66TyrPhe: 1.66 ± 0.301
2.688TyrGly: 2.688 ± 0.526
0.712TyrHis: 0.712 ± 0.313
3.005TyrIle: 3.005 ± 0.511
2.926TyrLys: 2.926 ± 0.467
2.847TyrLeu: 2.847 ± 0.564
0.791TyrMet: 0.791 ± 0.223
2.135TyrAsn: 2.135 ± 0.463
1.265TyrPro: 1.265 ± 0.316
1.66TyrGln: 1.66 ± 0.389
2.372TyrArg: 2.372 ± 0.521
2.53TyrSer: 2.53 ± 0.436
2.688TyrThr: 2.688 ± 0.589
2.372TyrVal: 2.372 ± 0.577
0.237TyrTrp: 0.237 ± 0.145
1.898TyrTyr: 1.898 ± 0.667
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 58 proteins (12648 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski