Amino acid dipepetide frequency for Flavobacterium phage 2A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.708AlaAla: 2.708 ± 0.744
0.285AlaCys: 0.285 ± 0.144
2.921AlaAsp: 2.921 ± 0.606
4.204AlaGlu: 4.204 ± 0.663
3.278AlaPhe: 3.278 ± 0.404
3.919AlaGly: 3.919 ± 0.779
0.784AlaHis: 0.784 ± 0.202
4.774AlaIle: 4.774 ± 0.59
5.059AlaLys: 5.059 ± 1.139
4.988AlaLeu: 4.988 ± 0.699
0.926AlaMet: 0.926 ± 0.267
3.919AlaAsn: 3.919 ± 0.589
1.069AlaPro: 1.069 ± 0.343
2.565AlaGln: 2.565 ± 0.668
1.71AlaArg: 1.71 ± 0.307
3.99AlaSer: 3.99 ± 0.53
3.206AlaThr: 3.206 ± 0.553
2.779AlaVal: 2.779 ± 0.423
0.641AlaTrp: 0.641 ± 0.208
2.138AlaTyr: 2.138 ± 0.408
0.0AlaXaa: 0.0 ± 0.0
Cys
0.356CysAla: 0.356 ± 0.134
0.285CysCys: 0.285 ± 0.118
0.784CysAsp: 0.784 ± 0.244
0.356CysGlu: 0.356 ± 0.185
0.356CysPhe: 0.356 ± 0.133
0.356CysGly: 0.356 ± 0.15
0.214CysHis: 0.214 ± 0.13
0.143CysIle: 0.143 ± 0.099
0.784CysLys: 0.784 ± 0.282
0.499CysLeu: 0.499 ± 0.194
0.214CysMet: 0.214 ± 0.094
0.428CysAsn: 0.428 ± 0.204
0.57CysPro: 0.57 ± 0.215
0.356CysGln: 0.356 ± 0.141
0.285CysArg: 0.285 ± 0.175
0.57CysSer: 0.57 ± 0.19
0.499CysThr: 0.499 ± 0.17
0.499CysVal: 0.499 ± 0.17
0.143CysTrp: 0.143 ± 0.098
0.641CysTyr: 0.641 ± 0.226
0.0CysXaa: 0.0 ± 0.0
Asp
4.062AspAla: 4.062 ± 0.641
0.641AspCys: 0.641 ± 0.221
2.494AspAsp: 2.494 ± 0.476
4.204AspGlu: 4.204 ± 0.562
3.278AspPhe: 3.278 ± 0.455
3.064AspGly: 3.064 ± 0.474
0.641AspHis: 0.641 ± 0.216
4.632AspIle: 4.632 ± 0.52
5.202AspLys: 5.202 ± 0.415
5.059AspLeu: 5.059 ± 0.637
1.069AspMet: 1.069 ± 0.293
2.779AspAsn: 2.779 ± 0.376
0.641AspPro: 0.641 ± 0.147
1.354AspGln: 1.354 ± 0.281
2.138AspArg: 2.138 ± 0.363
3.42AspSer: 3.42 ± 0.554
2.138AspThr: 2.138 ± 0.501
3.634AspVal: 3.634 ± 0.477
1.069AspTrp: 1.069 ± 0.275
3.278AspTyr: 3.278 ± 0.472
0.0AspXaa: 0.0 ± 0.0
Glu
3.848GluAla: 3.848 ± 0.453
0.855GluCys: 0.855 ± 0.257
3.777GluAsp: 3.777 ± 0.499
5.344GluGlu: 5.344 ± 0.638
3.135GluPhe: 3.135 ± 0.432
3.634GluGly: 3.634 ± 0.454
1.14GluHis: 1.14 ± 0.264
7.268GluIle: 7.268 ± 0.827
6.841GluLys: 6.841 ± 0.738
8.194GluLeu: 8.194 ± 0.844
1.639GluMet: 1.639 ± 0.306
5.629GluAsn: 5.629 ± 0.643
1.14GluPro: 1.14 ± 0.347
2.708GluGln: 2.708 ± 0.394
2.85GluArg: 2.85 ± 0.443
3.919GluSer: 3.919 ± 0.46
3.064GluThr: 3.064 ± 0.415
4.56GluVal: 4.56 ± 0.734
0.855GluTrp: 0.855 ± 0.296
3.064GluTyr: 3.064 ± 0.511
0.0GluXaa: 0.0 ± 0.0
Phe
1.995PheAla: 1.995 ± 0.375
0.713PheCys: 0.713 ± 0.176
3.206PheAsp: 3.206 ± 0.397
4.845PheGlu: 4.845 ± 0.612
2.138PhePhe: 2.138 ± 0.431
2.636PheGly: 2.636 ± 0.541
0.641PheHis: 0.641 ± 0.222
3.99PheIle: 3.99 ± 0.531
3.99PheLys: 3.99 ± 0.538
4.418PheLeu: 4.418 ± 0.581
0.784PheMet: 0.784 ± 0.256
2.708PheAsn: 2.708 ± 0.452
1.568PhePro: 1.568 ± 0.312
2.066PheGln: 2.066 ± 0.387
2.636PheArg: 2.636 ± 0.416
3.99PheSer: 3.99 ± 0.515
2.708PheThr: 2.708 ± 0.44
2.494PheVal: 2.494 ± 0.47
0.926PheTrp: 0.926 ± 0.254
2.708PheTyr: 2.708 ± 0.473
0.0PheXaa: 0.0 ± 0.0
Gly
2.921GlyAla: 2.921 ± 0.677
0.428GlyCys: 0.428 ± 0.196
1.924GlyAsp: 1.924 ± 0.356
2.993GlyGlu: 2.993 ± 0.527
3.135GlyPhe: 3.135 ± 0.435
2.423GlyGly: 2.423 ± 0.474
0.57GlyHis: 0.57 ± 0.194
4.774GlyIle: 4.774 ± 0.621
4.062GlyLys: 4.062 ± 0.741
4.56GlyLeu: 4.56 ± 0.615
1.568GlyMet: 1.568 ± 0.305
4.062GlyAsn: 4.062 ± 0.582
0.713GlyPro: 0.713 ± 0.218
1.425GlyGln: 1.425 ± 0.335
1.995GlyArg: 1.995 ± 0.424
2.779GlySer: 2.779 ± 0.454
3.064GlyThr: 3.064 ± 0.319
3.278GlyVal: 3.278 ± 0.696
0.998GlyTrp: 0.998 ± 0.208
2.565GlyTyr: 2.565 ± 0.399
0.0GlyXaa: 0.0 ± 0.0
His
0.428HisAla: 0.428 ± 0.165
0.071HisCys: 0.071 ± 0.067
0.641HisAsp: 0.641 ± 0.208
0.926HisGlu: 0.926 ± 0.256
1.14HisPhe: 1.14 ± 0.282
0.998HisGly: 0.998 ± 0.212
0.214HisHis: 0.214 ± 0.121
1.639HisIle: 1.639 ± 0.314
1.71HisLys: 1.71 ± 0.405
1.853HisLeu: 1.853 ± 0.397
0.285HisMet: 0.285 ± 0.117
1.211HisAsn: 1.211 ± 0.245
0.356HisPro: 0.356 ± 0.184
0.499HisGln: 0.499 ± 0.156
0.784HisArg: 0.784 ± 0.205
0.998HisSer: 0.998 ± 0.262
0.855HisThr: 0.855 ± 0.198
0.428HisVal: 0.428 ± 0.175
0.071HisTrp: 0.071 ± 0.075
0.784HisTyr: 0.784 ± 0.2
0.0HisXaa: 0.0 ± 0.0
Ile
5.202IleAla: 5.202 ± 0.812
0.713IleCys: 0.713 ± 0.274
5.202IleAsp: 5.202 ± 0.619
8.194IleGlu: 8.194 ± 0.612
4.133IlePhe: 4.133 ± 0.532
2.993IleGly: 2.993 ± 0.381
1.995IleHis: 1.995 ± 0.328
5.772IleIle: 5.772 ± 0.556
8.622IleLys: 8.622 ± 0.703
6.556IleLeu: 6.556 ± 0.589
2.423IleMet: 2.423 ± 0.392
6.128IleAsn: 6.128 ± 0.675
2.779IlePro: 2.779 ± 0.363
3.42IleGln: 3.42 ± 0.535
3.135IleArg: 3.135 ± 0.585
6.057IleSer: 6.057 ± 0.573
4.774IleThr: 4.774 ± 0.651
4.275IleVal: 4.275 ± 0.42
0.57IleTrp: 0.57 ± 0.21
2.993IleTyr: 2.993 ± 0.467
0.0IleXaa: 0.0 ± 0.0
Lys
5.914LysAla: 5.914 ± 0.784
0.356LysCys: 0.356 ± 0.151
4.204LysAsp: 4.204 ± 0.447
7.482LysGlu: 7.482 ± 0.627
3.705LysPhe: 3.705 ± 0.52
3.848LysGly: 3.848 ± 0.49
1.425LysHis: 1.425 ± 0.319
9.049LysIle: 9.049 ± 0.787
9.334LysLys: 9.334 ± 1.111
7.553LysLeu: 7.553 ± 0.723
2.28LysMet: 2.28 ± 0.414
7.054LysAsn: 7.054 ± 0.717
2.494LysPro: 2.494 ± 0.403
3.919LysGln: 3.919 ± 0.832
4.062LysArg: 4.062 ± 0.566
6.128LysSer: 6.128 ± 0.561
5.273LysThr: 5.273 ± 0.649
5.772LysVal: 5.772 ± 0.562
1.211LysTrp: 1.211 ± 0.261
4.703LysTyr: 4.703 ± 0.545
0.0LysXaa: 0.0 ± 0.0
Leu
4.988LeuAla: 4.988 ± 0.564
0.713LeuCys: 0.713 ± 0.254
4.489LeuAsp: 4.489 ± 0.573
6.199LeuGlu: 6.199 ± 0.541
4.133LeuPhe: 4.133 ± 0.525
4.062LeuGly: 4.062 ± 0.676
1.496LeuHis: 1.496 ± 0.306
6.841LeuIle: 6.841 ± 0.82
9.691LeuLys: 9.691 ± 0.735
8.052LeuLeu: 8.052 ± 0.763
2.138LeuMet: 2.138 ± 0.419
6.199LeuAsn: 6.199 ± 0.71
3.777LeuPro: 3.777 ± 0.63
4.062LeuGln: 4.062 ± 0.476
3.705LeuArg: 3.705 ± 0.5
6.413LeuSer: 6.413 ± 0.576
4.632LeuThr: 4.632 ± 0.6
3.848LeuVal: 3.848 ± 0.445
0.998LeuTrp: 0.998 ± 0.212
3.206LeuTyr: 3.206 ± 0.354
0.0LeuXaa: 0.0 ± 0.0
Met
1.853MetAla: 1.853 ± 0.363
0.143MetCys: 0.143 ± 0.132
1.425MetAsp: 1.425 ± 0.302
1.069MetGlu: 1.069 ± 0.308
0.784MetPhe: 0.784 ± 0.213
0.57MetGly: 0.57 ± 0.173
0.285MetHis: 0.285 ± 0.146
1.14MetIle: 1.14 ± 0.216
2.921MetLys: 2.921 ± 0.44
2.209MetLeu: 2.209 ± 0.471
0.855MetMet: 0.855 ± 0.301
1.639MetAsn: 1.639 ± 0.307
0.713MetPro: 0.713 ± 0.23
1.069MetGln: 1.069 ± 0.292
0.926MetArg: 0.926 ± 0.185
1.71MetSer: 1.71 ± 0.307
1.853MetThr: 1.853 ± 0.378
0.998MetVal: 0.998 ± 0.264
0.214MetTrp: 0.214 ± 0.105
0.926MetTyr: 0.926 ± 0.326
0.0MetXaa: 0.0 ± 0.0
Asn
3.705AsnAla: 3.705 ± 0.466
0.428AsnCys: 0.428 ± 0.16
4.204AsnAsp: 4.204 ± 0.525
4.632AsnGlu: 4.632 ± 0.597
3.563AsnPhe: 3.563 ± 0.555
3.777AsnGly: 3.777 ± 0.521
0.998AsnHis: 0.998 ± 0.247
6.342AsnIle: 6.342 ± 0.776
6.128AsnLys: 6.128 ± 0.588
5.7AsnLeu: 5.7 ± 0.696
1.14AsnMet: 1.14 ± 0.263
4.418AsnAsn: 4.418 ± 0.61
2.066AsnPro: 2.066 ± 0.364
2.28AsnGln: 2.28 ± 0.393
3.206AsnArg: 3.206 ± 0.439
4.418AsnSer: 4.418 ± 0.585
3.848AsnThr: 3.848 ± 0.572
3.919AsnVal: 3.919 ± 0.562
0.641AsnTrp: 0.641 ± 0.24
2.779AsnTyr: 2.779 ± 0.48
0.0AsnXaa: 0.0 ± 0.0
Pro
1.568ProAla: 1.568 ± 0.353
0.214ProCys: 0.214 ± 0.113
1.283ProAsp: 1.283 ± 0.33
2.779ProGlu: 2.779 ± 0.431
1.568ProPhe: 1.568 ± 0.403
1.069ProGly: 1.069 ± 0.292
0.285ProHis: 0.285 ± 0.132
2.209ProIle: 2.209 ± 0.319
2.28ProLys: 2.28 ± 0.435
2.28ProLeu: 2.28 ± 0.445
0.499ProMet: 0.499 ± 0.202
2.28ProAsn: 2.28 ± 0.26
0.926ProPro: 0.926 ± 0.23
1.069ProGln: 1.069 ± 0.272
0.713ProArg: 0.713 ± 0.241
1.924ProSer: 1.924 ± 0.403
1.568ProThr: 1.568 ± 0.379
1.853ProVal: 1.853 ± 0.51
0.285ProTrp: 0.285 ± 0.132
1.283ProTyr: 1.283 ± 0.293
0.0ProXaa: 0.0 ± 0.0
Gln
2.351GlnAla: 2.351 ± 0.684
0.285GlnCys: 0.285 ± 0.133
1.853GlnAsp: 1.853 ± 0.317
3.563GlnGlu: 3.563 ± 0.509
1.425GlnPhe: 1.425 ± 0.318
1.71GlnGly: 1.71 ± 0.315
0.713GlnHis: 0.713 ± 0.169
3.919GlnIle: 3.919 ± 0.48
4.062GlnLys: 4.062 ± 0.573
3.705GlnLeu: 3.705 ± 0.482
0.926GlnMet: 0.926 ± 0.243
1.924GlnAsn: 1.924 ± 0.332
0.998GlnPro: 0.998 ± 0.243
1.781GlnGln: 1.781 ± 0.339
1.568GlnArg: 1.568 ± 0.387
1.496GlnSer: 1.496 ± 0.253
2.565GlnThr: 2.565 ± 0.391
1.853GlnVal: 1.853 ± 0.37
0.499GlnTrp: 0.499 ± 0.182
1.354GlnTyr: 1.354 ± 0.268
0.0GlnXaa: 0.0 ± 0.0
Arg
2.28ArgAla: 2.28 ± 0.357
0.57ArgCys: 0.57 ± 0.191
2.565ArgAsp: 2.565 ± 0.495
2.708ArgGlu: 2.708 ± 0.468
2.351ArgPhe: 2.351 ± 0.416
1.924ArgGly: 1.924 ± 0.307
0.713ArgHis: 0.713 ± 0.249
3.705ArgIle: 3.705 ± 0.458
4.204ArgLys: 4.204 ± 0.498
2.993ArgLeu: 2.993 ± 0.46
0.784ArgMet: 0.784 ± 0.29
2.423ArgAsn: 2.423 ± 0.328
0.713ArgPro: 0.713 ± 0.171
1.496ArgGln: 1.496 ± 0.324
1.283ArgArg: 1.283 ± 0.346
1.924ArgSer: 1.924 ± 0.37
1.425ArgThr: 1.425 ± 0.288
2.494ArgVal: 2.494 ± 0.363
0.214ArgTrp: 0.214 ± 0.147
0.926ArgTyr: 0.926 ± 0.225
0.0ArgXaa: 0.0 ± 0.0
Ser
3.634SerAla: 3.634 ± 0.449
0.499SerCys: 0.499 ± 0.186
4.632SerAsp: 4.632 ± 0.475
4.133SerGlu: 4.133 ± 0.519
4.418SerPhe: 4.418 ± 0.538
3.99SerGly: 3.99 ± 0.721
0.855SerHis: 0.855 ± 0.236
5.059SerIle: 5.059 ± 0.513
5.415SerLys: 5.415 ± 0.636
5.487SerLeu: 5.487 ± 0.428
1.853SerMet: 1.853 ± 0.473
3.705SerAsn: 3.705 ± 0.453
1.781SerPro: 1.781 ± 0.308
2.565SerGln: 2.565 ± 0.384
1.853SerArg: 1.853 ± 0.39
4.489SerSer: 4.489 ± 0.557
3.563SerThr: 3.563 ± 0.514
3.42SerVal: 3.42 ± 0.514
0.499SerTrp: 0.499 ± 0.162
1.853SerTyr: 1.853 ± 0.368
0.0SerXaa: 0.0 ± 0.0
Thr
3.278ThrAla: 3.278 ± 0.625
0.285ThrCys: 0.285 ± 0.137
3.064ThrAsp: 3.064 ± 0.568
3.705ThrGlu: 3.705 ± 0.527
2.993ThrPhe: 2.993 ± 0.53
4.062ThrGly: 4.062 ± 0.541
0.784ThrHis: 0.784 ± 0.223
5.13ThrIle: 5.13 ± 0.547
4.56ThrLys: 4.56 ± 0.564
4.56ThrLeu: 4.56 ± 0.475
0.998ThrMet: 0.998 ± 0.223
3.42ThrAsn: 3.42 ± 0.516
2.565ThrPro: 2.565 ± 0.444
1.568ThrGln: 1.568 ± 0.374
1.354ThrArg: 1.354 ± 0.377
2.494ThrSer: 2.494 ± 0.394
2.423ThrThr: 2.423 ± 0.433
2.28ThrVal: 2.28 ± 0.352
0.499ThrTrp: 0.499 ± 0.151
2.28ThrTyr: 2.28 ± 0.539
0.0ThrXaa: 0.0 ± 0.0
Val
2.921ValAla: 2.921 ± 0.661
0.57ValCys: 0.57 ± 0.18
3.492ValAsp: 3.492 ± 0.378
3.492ValGlu: 3.492 ± 0.462
2.708ValPhe: 2.708 ± 0.406
2.565ValGly: 2.565 ± 0.371
1.283ValHis: 1.283 ± 0.365
4.632ValIle: 4.632 ± 0.475
3.99ValLys: 3.99 ± 0.525
5.273ValLeu: 5.273 ± 0.521
1.568ValMet: 1.568 ± 0.401
4.062ValAsn: 4.062 ± 0.546
1.354ValPro: 1.354 ± 0.415
1.924ValGln: 1.924 ± 0.347
1.354ValArg: 1.354 ± 0.252
3.777ValSer: 3.777 ± 0.434
2.209ValThr: 2.209 ± 0.361
3.206ValVal: 3.206 ± 0.398
1.069ValTrp: 1.069 ± 0.245
2.209ValTyr: 2.209 ± 0.436
0.0ValXaa: 0.0 ± 0.0
Trp
0.499TrpAla: 0.499 ± 0.208
0.071TrpCys: 0.071 ± 0.067
0.641TrpAsp: 0.641 ± 0.292
0.57TrpGlu: 0.57 ± 0.167
0.499TrpPhe: 0.499 ± 0.13
0.641TrpGly: 0.641 ± 0.184
0.143TrpHis: 0.143 ± 0.093
0.998TrpIle: 0.998 ± 0.354
1.496TrpLys: 1.496 ± 0.341
1.71TrpLeu: 1.71 ± 0.339
0.356TrpMet: 0.356 ± 0.169
0.428TrpAsn: 0.428 ± 0.158
0.214TrpPro: 0.214 ± 0.126
0.57TrpGln: 0.57 ± 0.176
0.499TrpArg: 0.499 ± 0.192
0.57TrpSer: 0.57 ± 0.202
0.499TrpThr: 0.499 ± 0.13
0.784TrpVal: 0.784 ± 0.198
0.143TrpTrp: 0.143 ± 0.09
0.641TrpTyr: 0.641 ± 0.203
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.425TyrAla: 1.425 ± 0.312
0.214TyrCys: 0.214 ± 0.109
1.995TyrAsp: 1.995 ± 0.298
2.066TyrGlu: 2.066 ± 0.301
2.423TyrPhe: 2.423 ± 0.449
1.995TyrGly: 1.995 ± 0.39
0.713TyrHis: 0.713 ± 0.185
3.919TyrIle: 3.919 ± 0.531
5.415TyrLys: 5.415 ± 0.612
3.99TyrLeu: 3.99 ± 0.615
0.926TyrMet: 0.926 ± 0.234
3.777TyrAsn: 3.777 ± 0.496
1.496TyrPro: 1.496 ± 0.297
1.853TyrGln: 1.853 ± 0.375
1.639TyrArg: 1.639 ± 0.341
2.636TyrSer: 2.636 ± 0.383
2.138TyrThr: 2.138 ± 0.349
1.283TyrVal: 1.283 ± 0.363
0.428TyrTrp: 0.428 ± 0.171
1.283TyrTyr: 1.283 ± 0.344
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 61 proteins (14035 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski