Amino acid dipepetide frequency for Escherichia phage vB_EcoS_ACG-M12

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.856AlaAla: 7.856 ± 0.999
0.714AlaCys: 0.714 ± 0.205
4.357AlaAsp: 4.357 ± 0.608
4.856AlaGlu: 4.856 ± 0.588
3.0AlaPhe: 3.0 ± 0.43
6.142AlaGly: 6.142 ± 0.587
1.214AlaHis: 1.214 ± 0.335
6.285AlaIle: 6.285 ± 0.593
6.285AlaLys: 6.285 ± 1.09
7.213AlaLeu: 7.213 ± 1.154
2.357AlaMet: 2.357 ± 0.416
4.214AlaAsn: 4.214 ± 0.655
1.928AlaPro: 1.928 ± 0.34
3.285AlaGln: 3.285 ± 0.567
4.571AlaArg: 4.571 ± 0.689
5.428AlaSer: 5.428 ± 0.688
4.571AlaThr: 4.571 ± 0.629
5.142AlaVal: 5.142 ± 0.642
1.071AlaTrp: 1.071 ± 0.247
2.143AlaTyr: 2.143 ± 0.296
0.0AlaXaa: 0.0 ± 0.0
Cys
0.928CysAla: 0.928 ± 0.28
0.214CysCys: 0.214 ± 0.127
1.071CysAsp: 1.071 ± 0.266
1.071CysGlu: 1.071 ± 0.297
0.214CysPhe: 0.214 ± 0.126
1.286CysGly: 1.286 ± 0.35
0.214CysHis: 0.214 ± 0.143
0.214CysIle: 0.214 ± 0.108
0.857CysLys: 0.857 ± 0.266
1.143CysLeu: 1.143 ± 0.321
0.571CysMet: 0.571 ± 0.226
0.786CysAsn: 0.786 ± 0.268
0.143CysPro: 0.143 ± 0.098
0.286CysGln: 0.286 ± 0.134
0.643CysArg: 0.643 ± 0.216
1.5CysSer: 1.5 ± 0.315
0.714CysThr: 0.714 ± 0.212
0.857CysVal: 0.857 ± 0.265
0.286CysTrp: 0.286 ± 0.127
0.429CysTyr: 0.429 ± 0.152
0.0CysXaa: 0.0 ± 0.0
Asp
4.928AspAla: 4.928 ± 0.841
0.786AspCys: 0.786 ± 0.262
3.857AspAsp: 3.857 ± 0.583
4.142AspGlu: 4.142 ± 0.539
2.857AspPhe: 2.857 ± 0.392
7.07AspGly: 7.07 ± 0.71
0.786AspHis: 0.786 ± 0.261
4.071AspIle: 4.071 ± 0.504
4.571AspLys: 4.571 ± 0.572
3.999AspLeu: 3.999 ± 0.497
1.357AspMet: 1.357 ± 0.313
3.285AspAsn: 3.285 ± 0.467
1.857AspPro: 1.857 ± 0.367
1.071AspGln: 1.071 ± 0.212
2.0AspArg: 2.0 ± 0.383
4.071AspSer: 4.071 ± 0.605
2.571AspThr: 2.571 ± 0.416
4.428AspVal: 4.428 ± 0.464
1.357AspTrp: 1.357 ± 0.285
2.785AspTyr: 2.785 ± 0.374
0.0AspXaa: 0.0 ± 0.0
Glu
4.928GluAla: 4.928 ± 0.557
0.928GluCys: 0.928 ± 0.301
3.142GluAsp: 3.142 ± 0.441
4.428GluGlu: 4.428 ± 0.716
3.142GluPhe: 3.142 ± 0.511
3.785GluGly: 3.785 ± 0.54
0.714GluHis: 0.714 ± 0.221
5.214GluIle: 5.214 ± 0.559
3.857GluLys: 3.857 ± 0.601
5.356GluLeu: 5.356 ± 0.66
2.928GluMet: 2.928 ± 0.54
3.428GluAsn: 3.428 ± 0.511
1.857GluPro: 1.857 ± 0.302
2.571GluGln: 2.571 ± 0.672
2.928GluArg: 2.928 ± 0.558
3.928GluSer: 3.928 ± 0.522
3.357GluThr: 3.357 ± 0.426
5.928GluVal: 5.928 ± 0.701
0.928GluTrp: 0.928 ± 0.305
3.214GluTyr: 3.214 ± 0.508
0.0GluXaa: 0.0 ± 0.0
Phe
2.071PheAla: 2.071 ± 0.41
0.857PheCys: 0.857 ± 0.282
3.285PheAsp: 3.285 ± 0.417
3.428PheGlu: 3.428 ± 0.648
1.143PhePhe: 1.143 ± 0.308
3.714PheGly: 3.714 ± 0.45
0.571PheHis: 0.571 ± 0.226
2.714PheIle: 2.714 ± 0.453
2.857PheLys: 2.857 ± 0.482
2.285PheLeu: 2.285 ± 0.524
1.0PheMet: 1.0 ± 0.253
1.643PheAsn: 1.643 ± 0.337
1.214PhePro: 1.214 ± 0.27
1.5PheGln: 1.5 ± 0.346
2.214PheArg: 2.214 ± 0.482
2.642PheSer: 2.642 ± 0.425
2.214PheThr: 2.214 ± 0.368
2.928PheVal: 2.928 ± 0.446
0.357PheTrp: 0.357 ± 0.116
0.857PheTyr: 0.857 ± 0.23
0.0PheXaa: 0.0 ± 0.0
Gly
5.142GlyAla: 5.142 ± 0.776
1.643GlyCys: 1.643 ± 0.383
4.499GlyAsp: 4.499 ± 0.548
4.499GlyGlu: 4.499 ± 0.532
3.0GlyPhe: 3.0 ± 0.389
6.499GlyGly: 6.499 ± 1.156
1.0GlyHis: 1.0 ± 0.29
5.571GlyIle: 5.571 ± 0.612
5.142GlyLys: 5.142 ± 0.8
6.713GlyLeu: 6.713 ± 0.693
2.071GlyMet: 2.071 ± 0.401
3.5GlyAsn: 3.5 ± 0.473
0.643GlyPro: 0.643 ± 0.228
2.214GlyGln: 2.214 ± 0.348
3.285GlyArg: 3.285 ± 0.432
5.142GlySer: 5.142 ± 0.592
3.142GlyThr: 3.142 ± 0.551
5.856GlyVal: 5.856 ± 0.634
0.857GlyTrp: 0.857 ± 0.264
3.785GlyTyr: 3.785 ± 0.583
0.0GlyXaa: 0.0 ± 0.0
His
0.928HisAla: 0.928 ± 0.273
0.214HisCys: 0.214 ± 0.101
1.071HisAsp: 1.071 ± 0.25
0.714HisGlu: 0.714 ± 0.234
0.429HisPhe: 0.429 ± 0.166
1.0HisGly: 1.0 ± 0.298
0.571HisHis: 0.571 ± 0.247
1.143HisIle: 1.143 ± 0.331
1.214HisLys: 1.214 ± 0.308
1.143HisLeu: 1.143 ± 0.32
0.5HisMet: 0.5 ± 0.259
1.0HisAsn: 1.0 ± 0.22
0.0HisPro: 0.0 ± 0.0
0.857HisGln: 0.857 ± 0.29
0.857HisArg: 0.857 ± 0.238
0.571HisSer: 0.571 ± 0.205
1.071HisThr: 1.071 ± 0.302
1.0HisVal: 1.0 ± 0.272
0.214HisTrp: 0.214 ± 0.127
0.429HisTyr: 0.429 ± 0.172
0.0HisXaa: 0.0 ± 0.0
Ile
6.57IleAla: 6.57 ± 0.993
0.857IleCys: 0.857 ± 0.285
5.999IleAsp: 5.999 ± 0.522
3.5IleGlu: 3.5 ± 0.462
1.785IlePhe: 1.785 ± 0.334
4.357IleGly: 4.357 ± 0.628
1.214IleHis: 1.214 ± 0.269
3.714IleIle: 3.714 ± 0.676
4.714IleLys: 4.714 ± 0.634
3.214IleLeu: 3.214 ± 0.527
1.643IleMet: 1.643 ± 0.345
4.142IleAsn: 4.142 ± 0.525
3.071IlePro: 3.071 ± 0.477
2.5IleGln: 2.5 ± 0.529
3.0IleArg: 3.0 ± 0.405
5.214IleSer: 5.214 ± 0.608
4.928IleThr: 4.928 ± 0.519
4.285IleVal: 4.285 ± 0.629
0.786IleTrp: 0.786 ± 0.23
2.714IleTyr: 2.714 ± 0.512
0.0IleXaa: 0.0 ± 0.0
Lys
6.928LysAla: 6.928 ± 0.928
0.5LysCys: 0.5 ± 0.181
3.928LysAsp: 3.928 ± 0.524
5.571LysGlu: 5.571 ± 1.028
3.071LysPhe: 3.071 ± 0.475
3.285LysGly: 3.285 ± 0.453
0.928LysHis: 0.928 ± 0.284
4.357LysIle: 4.357 ± 0.544
4.357LysLys: 4.357 ± 0.681
5.428LysLeu: 5.428 ± 0.797
2.714LysMet: 2.714 ± 0.442
2.571LysAsn: 2.571 ± 0.414
1.857LysPro: 1.857 ± 0.402
2.0LysGln: 2.0 ± 0.36
2.785LysArg: 2.785 ± 0.671
4.428LysSer: 4.428 ± 0.488
3.714LysThr: 3.714 ± 0.5
4.785LysVal: 4.785 ± 0.701
0.928LysTrp: 0.928 ± 0.262
3.0LysTyr: 3.0 ± 0.479
0.0LysXaa: 0.0 ± 0.0
Leu
6.356LeuAla: 6.356 ± 0.819
1.071LeuCys: 1.071 ± 0.235
3.857LeuAsp: 3.857 ± 0.553
4.428LeuGlu: 4.428 ± 0.708
2.0LeuPhe: 2.0 ± 0.325
3.928LeuGly: 3.928 ± 0.633
0.857LeuHis: 0.857 ± 0.268
5.285LeuIle: 5.285 ± 0.573
3.928LeuLys: 3.928 ± 0.511
3.642LeuLeu: 3.642 ± 0.414
1.286LeuMet: 1.286 ± 0.314
3.785LeuAsn: 3.785 ± 0.437
3.0LeuPro: 3.0 ± 0.445
2.785LeuGln: 2.785 ± 0.808
4.142LeuArg: 4.142 ± 0.596
5.642LeuSer: 5.642 ± 0.712
4.499LeuThr: 4.499 ± 0.699
5.356LeuVal: 5.356 ± 0.552
0.5LeuTrp: 0.5 ± 0.224
1.928LeuTyr: 1.928 ± 0.401
0.0LeuXaa: 0.0 ± 0.0
Met
2.857MetAla: 2.857 ± 0.392
0.357MetCys: 0.357 ± 0.153
0.928MetAsp: 0.928 ± 0.263
1.428MetGlu: 1.428 ± 0.343
1.5MetPhe: 1.5 ± 0.334
0.786MetGly: 0.786 ± 0.255
0.5MetHis: 0.5 ± 0.209
2.428MetIle: 2.428 ± 0.402
1.714MetLys: 1.714 ± 0.351
1.428MetLeu: 1.428 ± 0.362
0.928MetMet: 0.928 ± 0.318
1.428MetAsn: 1.428 ± 0.308
0.5MetPro: 0.5 ± 0.213
0.857MetGln: 0.857 ± 0.223
2.071MetArg: 2.071 ± 0.354
1.714MetSer: 1.714 ± 0.414
2.285MetThr: 2.285 ± 0.44
1.785MetVal: 1.785 ± 0.383
0.357MetTrp: 0.357 ± 0.179
0.643MetTyr: 0.643 ± 0.225
0.0MetXaa: 0.0 ± 0.0
Asn
4.357AsnAla: 4.357 ± 0.576
0.714AsnCys: 0.714 ± 0.223
3.285AsnAsp: 3.285 ± 0.461
3.857AsnGlu: 3.857 ± 0.512
1.785AsnPhe: 1.785 ± 0.299
5.713AsnGly: 5.713 ± 1.097
1.214AsnHis: 1.214 ± 0.316
2.357AsnIle: 2.357 ± 0.357
3.285AsnLys: 3.285 ± 0.49
3.285AsnLeu: 3.285 ± 0.549
0.928AsnMet: 0.928 ± 0.225
3.428AsnAsn: 3.428 ± 0.544
1.928AsnPro: 1.928 ± 0.32
2.143AsnGln: 2.143 ± 0.468
2.214AsnArg: 2.214 ± 0.375
4.142AsnSer: 4.142 ± 0.592
2.5AsnThr: 2.5 ± 0.608
3.714AsnVal: 3.714 ± 0.429
0.857AsnTrp: 0.857 ± 0.22
1.857AsnTyr: 1.857 ± 0.406
0.0AsnXaa: 0.0 ± 0.0
Pro
2.928ProAla: 2.928 ± 0.446
0.5ProCys: 0.5 ± 0.219
1.785ProAsp: 1.785 ± 0.426
2.357ProGlu: 2.357 ± 0.381
1.571ProPhe: 1.571 ± 0.332
2.071ProGly: 2.071 ± 0.348
0.5ProHis: 0.5 ± 0.183
2.0ProIle: 2.0 ± 0.301
1.286ProLys: 1.286 ± 0.305
1.428ProLeu: 1.428 ± 0.318
0.714ProMet: 0.714 ± 0.218
1.286ProAsn: 1.286 ± 0.301
0.714ProPro: 0.714 ± 0.241
1.0ProGln: 1.0 ± 0.29
1.571ProArg: 1.571 ± 0.307
1.5ProSer: 1.5 ± 0.326
1.714ProThr: 1.714 ± 0.399
2.857ProVal: 2.857 ± 0.525
0.571ProTrp: 0.571 ± 0.213
1.5ProTyr: 1.5 ± 0.306
0.0ProXaa: 0.0 ± 0.0
Gln
3.5GlnAla: 3.5 ± 0.941
0.214GlnCys: 0.214 ± 0.117
1.357GlnAsp: 1.357 ± 0.271
2.357GlnGlu: 2.357 ± 0.433
1.214GlnPhe: 1.214 ± 0.32
2.357GlnGly: 2.357 ± 0.369
0.357GlnHis: 0.357 ± 0.175
3.357GlnIle: 3.357 ± 0.598
2.5GlnLys: 2.5 ± 0.352
2.571GlnLeu: 2.571 ± 0.597
0.643GlnMet: 0.643 ± 0.191
1.928GlnAsn: 1.928 ± 0.469
1.071GlnPro: 1.071 ± 0.257
2.214GlnGln: 2.214 ± 0.64
1.286GlnArg: 1.286 ± 0.36
3.285GlnSer: 3.285 ± 0.5
1.643GlnThr: 1.643 ± 0.314
2.428GlnVal: 2.428 ± 0.356
0.571GlnTrp: 0.571 ± 0.245
1.357GlnTyr: 1.357 ± 0.278
0.0GlnXaa: 0.0 ± 0.0
Arg
3.928ArgAla: 3.928 ± 0.556
0.643ArgCys: 0.643 ± 0.333
2.642ArgAsp: 2.642 ± 0.367
3.714ArgGlu: 3.714 ± 0.464
2.214ArgPhe: 2.214 ± 0.396
2.428ArgGly: 2.428 ± 0.36
0.786ArgHis: 0.786 ± 0.243
3.571ArgIle: 3.571 ± 0.47
4.214ArgLys: 4.214 ± 0.485
3.142ArgLeu: 3.142 ± 0.445
1.214ArgMet: 1.214 ± 0.326
2.714ArgAsn: 2.714 ± 0.478
1.643ArgPro: 1.643 ± 0.342
1.714ArgGln: 1.714 ± 0.34
2.428ArgArg: 2.428 ± 0.356
2.928ArgSer: 2.928 ± 0.467
1.785ArgThr: 1.785 ± 0.402
3.714ArgVal: 3.714 ± 0.44
0.714ArgTrp: 0.714 ± 0.222
2.285ArgTyr: 2.285 ± 0.353
0.0ArgXaa: 0.0 ± 0.0
Ser
4.999SerAla: 4.999 ± 0.63
0.786SerCys: 0.786 ± 0.22
5.499SerAsp: 5.499 ± 0.576
4.928SerGlu: 4.928 ± 0.517
3.0SerPhe: 3.0 ± 0.427
6.356SerGly: 6.356 ± 0.538
1.143SerHis: 1.143 ± 0.305
3.999SerIle: 3.999 ± 0.462
3.928SerLys: 3.928 ± 0.629
4.928SerLeu: 4.928 ± 0.551
1.428SerMet: 1.428 ± 0.396
3.428SerAsn: 3.428 ± 0.515
2.357SerPro: 2.357 ± 0.406
2.428SerGln: 2.428 ± 0.409
3.642SerArg: 3.642 ± 0.404
4.571SerSer: 4.571 ± 0.822
4.214SerThr: 4.214 ± 0.524
5.285SerVal: 5.285 ± 0.467
0.643SerTrp: 0.643 ± 0.217
2.428SerTyr: 2.428 ± 0.527
0.0SerXaa: 0.0 ± 0.0
Thr
4.714ThrAla: 4.714 ± 0.717
0.5ThrCys: 0.5 ± 0.221
2.642ThrAsp: 2.642 ± 0.453
2.857ThrGlu: 2.857 ± 0.337
2.285ThrPhe: 2.285 ± 0.376
5.571ThrGly: 5.571 ± 0.658
0.643ThrHis: 0.643 ± 0.265
3.785ThrIle: 3.785 ± 0.443
3.357ThrLys: 3.357 ± 0.499
3.571ThrLeu: 3.571 ± 0.441
1.357ThrMet: 1.357 ± 0.294
3.357ThrAsn: 3.357 ± 0.568
2.143ThrPro: 2.143 ± 0.334
2.428ThrGln: 2.428 ± 0.488
1.928ThrArg: 1.928 ± 0.292
3.714ThrSer: 3.714 ± 0.494
2.928ThrThr: 2.928 ± 0.555
3.357ThrVal: 3.357 ± 0.501
0.5ThrTrp: 0.5 ± 0.182
2.857ThrTyr: 2.857 ± 0.492
0.0ThrXaa: 0.0 ± 0.0
Val
5.356ValAla: 5.356 ± 0.77
1.0ValCys: 1.0 ± 0.284
4.642ValAsp: 4.642 ± 0.51
4.785ValGlu: 4.785 ± 0.663
3.142ValPhe: 3.142 ± 0.46
3.999ValGly: 3.999 ± 0.626
0.643ValHis: 0.643 ± 0.234
4.499ValIle: 4.499 ± 0.51
5.713ValLys: 5.713 ± 0.687
3.999ValLeu: 3.999 ± 0.558
1.928ValMet: 1.928 ± 0.375
4.928ValAsn: 4.928 ± 0.472
2.071ValPro: 2.071 ± 0.386
2.571ValGln: 2.571 ± 0.597
4.142ValArg: 4.142 ± 0.542
5.856ValSer: 5.856 ± 0.595
4.071ValThr: 4.071 ± 0.583
5.142ValVal: 5.142 ± 0.646
0.928ValTrp: 0.928 ± 0.252
2.714ValTyr: 2.714 ± 0.412
0.0ValXaa: 0.0 ± 0.0
Trp
0.786TrpAla: 0.786 ± 0.169
0.214TrpCys: 0.214 ± 0.127
1.0TrpAsp: 1.0 ± 0.232
0.786TrpGlu: 0.786 ± 0.248
0.928TrpPhe: 0.928 ± 0.27
0.714TrpGly: 0.714 ± 0.183
0.571TrpHis: 0.571 ± 0.211
0.928TrpIle: 0.928 ± 0.268
1.286TrpLys: 1.286 ± 0.294
1.214TrpLeu: 1.214 ± 0.288
0.143TrpMet: 0.143 ± 0.096
0.5TrpAsn: 0.5 ± 0.155
0.429TrpPro: 0.429 ± 0.197
0.357TrpGln: 0.357 ± 0.157
0.786TrpArg: 0.786 ± 0.211
0.714TrpSer: 0.714 ± 0.259
0.5TrpThr: 0.5 ± 0.195
1.143TrpVal: 1.143 ± 0.265
0.071TrpTrp: 0.071 ± 0.063
0.357TrpTyr: 0.357 ± 0.15
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.571TyrAla: 2.571 ± 0.438
0.714TyrCys: 0.714 ± 0.204
3.0TyrAsp: 3.0 ± 0.483
2.857TyrGlu: 2.857 ± 0.465
1.214TyrPhe: 1.214 ± 0.345
3.142TyrGly: 3.142 ± 0.429
0.5TyrHis: 0.5 ± 0.179
2.714TyrIle: 2.714 ± 0.527
2.143TyrLys: 2.143 ± 0.466
2.571TyrLeu: 2.571 ± 0.338
0.714TyrMet: 0.714 ± 0.249
2.214TyrAsn: 2.214 ± 0.387
1.428TyrPro: 1.428 ± 0.349
1.428TyrGln: 1.428 ± 0.325
1.928TyrArg: 1.928 ± 0.348
3.0TyrSer: 3.0 ± 0.527
2.071TyrThr: 2.071 ± 0.345
2.143TyrVal: 2.143 ± 0.381
0.928TyrTrp: 0.928 ± 0.237
1.428TyrTyr: 1.428 ± 0.276
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 78 proteins (14003 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski