Amino acid dipepetide frequency for Enterobacteria phage HK225

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.64AlaAla: 11.64 ± 1.475
0.857AlaCys: 0.857 ± 0.24
6.284AlaAsp: 6.284 ± 0.628
7.426AlaGlu: 7.426 ± 0.723
3.356AlaPhe: 3.356 ± 0.389
7.212AlaGly: 7.212 ± 0.777
1.571AlaHis: 1.571 ± 0.382
5.356AlaIle: 5.356 ± 0.652
4.284AlaLys: 4.284 ± 0.503
7.712AlaLeu: 7.712 ± 0.929
3.927AlaMet: 3.927 ± 0.533
3.499AlaAsn: 3.499 ± 0.539
2.571AlaPro: 2.571 ± 0.478
5.427AlaGln: 5.427 ± 0.842
5.356AlaArg: 5.356 ± 0.759
6.213AlaSer: 6.213 ± 0.768
5.498AlaThr: 5.498 ± 0.925
7.069AlaVal: 7.069 ± 0.727
1.642AlaTrp: 1.642 ± 0.333
2.499AlaTyr: 2.499 ± 0.432
0.0AlaXaa: 0.0 ± 0.0
Cys
1.214CysAla: 1.214 ± 0.307
0.214CysCys: 0.214 ± 0.122
0.928CysAsp: 0.928 ± 0.236
0.571CysGlu: 0.571 ± 0.169
0.286CysPhe: 0.286 ± 0.128
1.143CysGly: 1.143 ± 0.335
0.286CysHis: 0.286 ± 0.161
0.428CysIle: 0.428 ± 0.176
0.643CysLys: 0.643 ± 0.227
0.857CysLeu: 0.857 ± 0.235
0.286CysMet: 0.286 ± 0.148
0.286CysAsn: 0.286 ± 0.121
0.428CysPro: 0.428 ± 0.184
0.643CysGln: 0.643 ± 0.204
0.785CysArg: 0.785 ± 0.243
0.643CysSer: 0.643 ± 0.254
0.5CysThr: 0.5 ± 0.154
0.5CysVal: 0.5 ± 0.211
0.357CysTrp: 0.357 ± 0.178
0.143CysTyr: 0.143 ± 0.104
0.0CysXaa: 0.0 ± 0.0
Asp
5.784AspAla: 5.784 ± 0.719
0.928AspCys: 0.928 ± 0.311
4.213AspAsp: 4.213 ± 0.597
4.499AspGlu: 4.499 ± 0.858
2.285AspPhe: 2.285 ± 0.374
5.141AspGly: 5.141 ± 0.605
0.5AspHis: 0.5 ± 0.173
2.785AspIle: 2.785 ± 0.416
2.571AspLys: 2.571 ± 0.402
4.284AspLeu: 4.284 ± 0.624
1.785AspMet: 1.785 ± 0.413
2.071AspAsn: 2.071 ± 0.383
1.928AspPro: 1.928 ± 0.52
1.642AspGln: 1.642 ± 0.292
2.499AspArg: 2.499 ± 0.394
3.428AspSer: 3.428 ± 0.535
3.999AspThr: 3.999 ± 0.52
3.142AspVal: 3.142 ± 0.369
1.785AspTrp: 1.785 ± 0.316
2.499AspTyr: 2.499 ± 0.485
0.0AspXaa: 0.0 ± 0.0
Glu
6.284GluAla: 6.284 ± 0.738
0.857GluCys: 0.857 ± 0.288
2.356GluAsp: 2.356 ± 0.365
4.356GluGlu: 4.356 ± 0.684
1.999GluPhe: 1.999 ± 0.357
3.999GluGly: 3.999 ± 0.521
1.071GluHis: 1.071 ± 0.267
3.428GluIle: 3.428 ± 0.572
3.927GluLys: 3.927 ± 0.511
6.355GluLeu: 6.355 ± 0.598
1.642GluMet: 1.642 ± 0.289
2.856GluAsn: 2.856 ± 0.41
2.928GluPro: 2.928 ± 0.599
3.856GluGln: 3.856 ± 0.503
3.57GluArg: 3.57 ± 0.535
4.142GluSer: 4.142 ± 0.507
3.856GluThr: 3.856 ± 0.498
4.642GluVal: 4.642 ± 0.565
0.928GluTrp: 0.928 ± 0.27
2.356GluTyr: 2.356 ± 0.396
0.0GluXaa: 0.0 ± 0.0
Phe
1.928PheAla: 1.928 ± 0.28
0.571PheCys: 0.571 ± 0.175
2.285PheAsp: 2.285 ± 0.357
2.142PheGlu: 2.142 ± 0.387
1.0PhePhe: 1.0 ± 0.212
1.857PheGly: 1.857 ± 0.349
0.643PheHis: 0.643 ± 0.189
1.571PheIle: 1.571 ± 0.337
1.5PheLys: 1.5 ± 0.267
2.071PheLeu: 2.071 ± 0.42
0.928PheMet: 0.928 ± 0.247
1.857PheAsn: 1.857 ± 0.358
1.785PhePro: 1.785 ± 0.366
0.785PheGln: 0.785 ± 0.241
2.071PheArg: 2.071 ± 0.411
2.071PheSer: 2.071 ± 0.342
2.214PheThr: 2.214 ± 0.35
1.928PheVal: 1.928 ± 0.284
0.785PheTrp: 0.785 ± 0.195
1.0PheTyr: 1.0 ± 0.235
0.0PheXaa: 0.0 ± 0.0
Gly
4.927GlyAla: 4.927 ± 0.686
0.857GlyCys: 0.857 ± 0.271
4.856GlyAsp: 4.856 ± 0.473
3.713GlyGlu: 3.713 ± 0.5
1.928GlyPhe: 1.928 ± 0.369
6.07GlyGly: 6.07 ± 1.041
0.714GlyHis: 0.714 ± 0.279
5.141GlyIle: 5.141 ± 0.566
4.142GlyLys: 4.142 ± 0.504
5.641GlyLeu: 5.641 ± 0.643
2.571GlyMet: 2.571 ± 0.494
4.642GlyAsn: 4.642 ± 0.593
1.5GlyPro: 1.5 ± 0.325
2.571GlyGln: 2.571 ± 0.411
3.713GlyArg: 3.713 ± 0.537
4.07GlySer: 4.07 ± 0.594
4.784GlyThr: 4.784 ± 0.748
5.213GlyVal: 5.213 ± 0.631
1.214GlyTrp: 1.214 ± 0.272
2.999GlyTyr: 2.999 ± 0.456
0.0GlyXaa: 0.0 ± 0.0
His
1.357HisAla: 1.357 ± 0.303
0.286HisCys: 0.286 ± 0.147
1.071HisAsp: 1.071 ± 0.261
0.785HisGlu: 0.785 ± 0.284
0.928HisPhe: 0.928 ± 0.307
0.928HisGly: 0.928 ± 0.248
0.428HisHis: 0.428 ± 0.171
1.071HisIle: 1.071 ± 0.253
0.928HisLys: 0.928 ± 0.245
1.143HisLeu: 1.143 ± 0.458
0.571HisMet: 0.571 ± 0.258
0.643HisAsn: 0.643 ± 0.207
0.428HisPro: 0.428 ± 0.159
0.857HisGln: 0.857 ± 0.209
1.071HisArg: 1.071 ± 0.262
1.0HisSer: 1.0 ± 0.269
0.785HisThr: 0.785 ± 0.198
1.071HisVal: 1.071 ± 0.25
0.357HisTrp: 0.357 ± 0.171
0.5HisTyr: 0.5 ± 0.193
0.0HisXaa: 0.0 ± 0.0
Ile
5.641IleAla: 5.641 ± 0.888
0.714IleCys: 0.714 ± 0.225
3.213IleAsp: 3.213 ± 0.409
4.427IleGlu: 4.427 ± 0.619
1.642IlePhe: 1.642 ± 0.279
2.785IleGly: 2.785 ± 0.54
0.785IleHis: 0.785 ± 0.197
3.285IleIle: 3.285 ± 0.537
2.642IleLys: 2.642 ± 0.479
3.213IleLeu: 3.213 ± 0.426
0.714IleMet: 0.714 ± 0.265
2.499IleAsn: 2.499 ± 0.447
2.785IlePro: 2.785 ± 0.417
1.999IleGln: 1.999 ± 0.397
2.928IleArg: 2.928 ± 0.324
4.642IleSer: 4.642 ± 0.657
3.356IleThr: 3.356 ± 0.641
3.499IleVal: 3.499 ± 0.478
1.071IleTrp: 1.071 ± 0.283
1.285IleTyr: 1.285 ± 0.305
0.0IleXaa: 0.0 ± 0.0
Lys
4.07LysAla: 4.07 ± 0.577
0.428LysCys: 0.428 ± 0.19
2.714LysAsp: 2.714 ± 0.37
4.07LysGlu: 4.07 ± 0.614
1.285LysPhe: 1.285 ± 0.32
2.928LysGly: 2.928 ± 0.529
0.714LysHis: 0.714 ± 0.231
2.928LysIle: 2.928 ± 0.492
3.927LysLys: 3.927 ± 0.643
4.142LysLeu: 4.142 ± 0.494
1.285LysMet: 1.285 ± 0.35
2.142LysAsn: 2.142 ± 0.425
3.142LysPro: 3.142 ± 0.513
2.642LysGln: 2.642 ± 0.385
2.928LysArg: 2.928 ± 0.659
3.785LysSer: 3.785 ± 0.619
4.356LysThr: 4.356 ± 0.529
3.428LysVal: 3.428 ± 0.624
0.857LysTrp: 0.857 ± 0.282
1.785LysTyr: 1.785 ± 0.391
0.0LysXaa: 0.0 ± 0.0
Leu
7.926LeuAla: 7.926 ± 0.841
0.785LeuCys: 0.785 ± 0.287
5.141LeuAsp: 5.141 ± 0.578
4.784LeuGlu: 4.784 ± 0.69
1.785LeuPhe: 1.785 ± 0.314
4.57LeuGly: 4.57 ± 0.651
1.642LeuHis: 1.642 ± 0.314
3.213LeuIle: 3.213 ± 0.532
4.427LeuLys: 4.427 ± 0.501
5.855LeuLeu: 5.855 ± 0.669
2.071LeuMet: 2.071 ± 0.329
2.999LeuAsn: 2.999 ± 0.423
4.356LeuPro: 4.356 ± 0.546
3.356LeuGln: 3.356 ± 0.583
5.284LeuArg: 5.284 ± 0.435
5.356LeuSer: 5.356 ± 0.619
5.427LeuThr: 5.427 ± 0.627
5.284LeuVal: 5.284 ± 0.598
0.928LeuTrp: 0.928 ± 0.233
1.857LeuTyr: 1.857 ± 0.337
0.0LeuXaa: 0.0 ± 0.0
Met
3.57MetAla: 3.57 ± 0.495
0.357MetCys: 0.357 ± 0.173
1.143MetAsp: 1.143 ± 0.3
1.214MetGlu: 1.214 ± 0.285
0.643MetPhe: 0.643 ± 0.236
1.428MetGly: 1.428 ± 0.324
0.857MetHis: 0.857 ± 0.261
1.357MetIle: 1.357 ± 0.322
2.428MetLys: 2.428 ± 0.34
2.285MetLeu: 2.285 ± 0.362
0.714MetMet: 0.714 ± 0.202
0.857MetAsn: 0.857 ± 0.257
1.214MetPro: 1.214 ± 0.291
1.0MetGln: 1.0 ± 0.283
1.571MetArg: 1.571 ± 0.271
2.999MetSer: 2.999 ± 0.499
2.071MetThr: 2.071 ± 0.385
1.785MetVal: 1.785 ± 0.331
0.286MetTrp: 0.286 ± 0.118
0.5MetTyr: 0.5 ± 0.205
0.0MetXaa: 0.0 ± 0.0
Asn
4.427AsnAla: 4.427 ± 0.682
0.571AsnCys: 0.571 ± 0.23
2.071AsnAsp: 2.071 ± 0.406
2.785AsnGlu: 2.785 ± 0.423
1.071AsnPhe: 1.071 ± 0.22
4.284AsnGly: 4.284 ± 0.572
1.0AsnHis: 1.0 ± 0.254
2.356AsnIle: 2.356 ± 0.365
2.214AsnLys: 2.214 ± 0.416
2.642AsnLeu: 2.642 ± 0.45
1.214AsnMet: 1.214 ± 0.309
1.999AsnAsn: 1.999 ± 0.384
2.499AsnPro: 2.499 ± 0.39
1.785AsnGln: 1.785 ± 0.379
2.928AsnArg: 2.928 ± 0.405
2.428AsnSer: 2.428 ± 0.411
2.928AsnThr: 2.928 ± 0.524
2.356AsnVal: 2.356 ± 0.435
1.0AsnTrp: 1.0 ± 0.261
1.5AsnTyr: 1.5 ± 0.245
0.0AsnXaa: 0.0 ± 0.0
Pro
4.784ProAla: 4.784 ± 0.6
0.571ProCys: 0.571 ± 0.226
3.213ProAsp: 3.213 ± 0.533
2.642ProGlu: 2.642 ± 0.517
1.571ProPhe: 1.571 ± 0.304
3.356ProGly: 3.356 ± 0.481
0.643ProHis: 0.643 ± 0.19
1.428ProIle: 1.428 ± 0.281
2.499ProLys: 2.499 ± 0.396
2.785ProLeu: 2.785 ± 0.375
0.714ProMet: 0.714 ± 0.201
1.143ProAsn: 1.143 ± 0.235
1.285ProPro: 1.285 ± 0.334
2.071ProGln: 2.071 ± 0.444
1.999ProArg: 1.999 ± 0.439
2.428ProSer: 2.428 ± 0.534
2.714ProThr: 2.714 ± 0.444
4.356ProVal: 4.356 ± 0.563
0.571ProTrp: 0.571 ± 0.203
1.143ProTyr: 1.143 ± 0.305
0.0ProXaa: 0.0 ± 0.0
Gln
4.427GlnAla: 4.427 ± 0.617
0.286GlnCys: 0.286 ± 0.166
1.428GlnAsp: 1.428 ± 0.286
3.285GlnGlu: 3.285 ± 0.49
1.5GlnPhe: 1.5 ± 0.27
1.928GlnGly: 1.928 ± 0.398
0.357GlnHis: 0.357 ± 0.18
2.142GlnIle: 2.142 ± 0.328
2.071GlnLys: 2.071 ± 0.425
4.356GlnLeu: 4.356 ± 0.463
1.357GlnMet: 1.357 ± 0.392
2.142GlnAsn: 2.142 ± 0.333
2.428GlnPro: 2.428 ± 0.457
2.999GlnGln: 2.999 ± 0.542
2.571GlnArg: 2.571 ± 0.361
3.142GlnSer: 3.142 ± 0.423
3.213GlnThr: 3.213 ± 0.515
3.142GlnVal: 3.142 ± 0.461
0.857GlnTrp: 0.857 ± 0.222
1.143GlnTyr: 1.143 ± 0.353
0.0GlnXaa: 0.0 ± 0.0
Arg
5.927ArgAla: 5.927 ± 0.769
0.214ArgCys: 0.214 ± 0.132
2.428ArgAsp: 2.428 ± 0.359
4.142ArgGlu: 4.142 ± 0.595
1.857ArgPhe: 1.857 ± 0.326
3.499ArgGly: 3.499 ± 0.456
0.785ArgHis: 0.785 ± 0.234
3.57ArgIle: 3.57 ± 0.44
3.428ArgLys: 3.428 ± 0.416
4.642ArgLeu: 4.642 ± 0.561
1.928ArgMet: 1.928 ± 0.431
3.213ArgAsn: 3.213 ± 0.472
1.5ArgPro: 1.5 ± 0.293
2.571ArgGln: 2.571 ± 0.435
4.284ArgArg: 4.284 ± 0.635
2.785ArgSer: 2.785 ± 0.422
2.785ArgThr: 2.785 ± 0.387
4.856ArgVal: 4.856 ± 0.613
0.857ArgTrp: 0.857 ± 0.26
1.928ArgTyr: 1.928 ± 0.388
0.0ArgXaa: 0.0 ± 0.0
Ser
6.784SerAla: 6.784 ± 0.761
0.357SerCys: 0.357 ± 0.144
4.57SerAsp: 4.57 ± 0.512
3.927SerGlu: 3.927 ± 0.49
1.928SerPhe: 1.928 ± 0.32
6.498SerGly: 6.498 ± 0.949
1.143SerHis: 1.143 ± 0.308
3.142SerIle: 3.142 ± 0.608
2.214SerLys: 2.214 ± 0.465
4.642SerLeu: 4.642 ± 0.604
1.642SerMet: 1.642 ± 0.324
2.999SerAsn: 2.999 ± 0.365
2.785SerPro: 2.785 ± 0.413
2.499SerGln: 2.499 ± 0.368
3.785SerArg: 3.785 ± 0.511
4.427SerSer: 4.427 ± 0.91
4.284SerThr: 4.284 ± 0.55
5.713SerVal: 5.713 ± 0.67
1.357SerTrp: 1.357 ± 0.296
1.642SerTyr: 1.642 ± 0.322
0.0SerXaa: 0.0 ± 0.0
Thr
7.712ThrAla: 7.712 ± 0.819
0.785ThrCys: 0.785 ± 0.243
3.927ThrAsp: 3.927 ± 0.516
3.428ThrGlu: 3.428 ± 0.428
1.642ThrPhe: 1.642 ± 0.324
5.07ThrGly: 5.07 ± 0.665
1.143ThrHis: 1.143 ± 0.347
3.356ThrIle: 3.356 ± 0.45
2.499ThrLys: 2.499 ± 0.387
5.427ThrLeu: 5.427 ± 0.549
1.928ThrMet: 1.928 ± 0.392
2.356ThrAsn: 2.356 ± 0.453
3.356ThrPro: 3.356 ± 0.507
3.213ThrGln: 3.213 ± 0.357
3.213ThrArg: 3.213 ± 0.543
3.071ThrSer: 3.071 ± 0.455
3.856ThrThr: 3.856 ± 0.639
4.713ThrVal: 4.713 ± 0.715
1.0ThrTrp: 1.0 ± 0.244
2.499ThrTyr: 2.499 ± 0.423
0.0ThrXaa: 0.0 ± 0.0
Val
6.213ValAla: 6.213 ± 0.58
0.5ValCys: 0.5 ± 0.234
3.713ValAsp: 3.713 ± 0.496
4.784ValGlu: 4.784 ± 0.634
2.571ValPhe: 2.571 ± 0.375
4.856ValGly: 4.856 ± 0.692
0.785ValHis: 0.785 ± 0.241
4.213ValIle: 4.213 ± 0.565
4.856ValLys: 4.856 ± 0.778
4.927ValLeu: 4.927 ± 0.674
2.356ValMet: 2.356 ± 0.534
3.713ValAsn: 3.713 ± 0.54
2.642ValPro: 2.642 ± 0.397
2.285ValGln: 2.285 ± 0.358
3.642ValArg: 3.642 ± 0.486
6.07ValSer: 6.07 ± 0.822
4.142ValThr: 4.142 ± 0.605
3.642ValVal: 3.642 ± 0.498
1.0ValTrp: 1.0 ± 0.257
2.071ValTyr: 2.071 ± 0.467
0.0ValXaa: 0.0 ± 0.0
Trp
1.714TrpAla: 1.714 ± 0.242
0.357TrpCys: 0.357 ± 0.129
0.928TrpAsp: 0.928 ± 0.244
0.643TrpGlu: 0.643 ± 0.219
0.714TrpPhe: 0.714 ± 0.22
1.5TrpGly: 1.5 ± 0.407
0.428TrpHis: 0.428 ± 0.181
1.214TrpIle: 1.214 ± 0.279
0.928TrpLys: 0.928 ± 0.24
1.714TrpLeu: 1.714 ± 0.331
0.357TrpMet: 0.357 ± 0.173
0.714TrpAsn: 0.714 ± 0.242
0.643TrpPro: 0.643 ± 0.216
0.928TrpGln: 0.928 ± 0.252
1.0TrpArg: 1.0 ± 0.204
1.0TrpSer: 1.0 ± 0.217
1.0TrpThr: 1.0 ± 0.361
0.928TrpVal: 0.928 ± 0.241
0.143TrpTrp: 0.143 ± 0.092
0.643TrpTyr: 0.643 ± 0.201
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.071TyrAla: 3.071 ± 0.396
0.643TyrCys: 0.643 ± 0.26
1.285TyrAsp: 1.285 ± 0.31
1.785TyrGlu: 1.785 ± 0.338
1.071TyrPhe: 1.071 ± 0.281
2.214TyrGly: 2.214 ± 0.403
0.714TyrHis: 0.714 ± 0.243
1.143TyrIle: 1.143 ± 0.291
1.428TyrLys: 1.428 ± 0.301
2.356TyrLeu: 2.356 ± 0.471
0.214TyrMet: 0.214 ± 0.108
1.428TyrAsn: 1.428 ± 0.267
1.714TyrPro: 1.714 ± 0.379
1.642TyrGln: 1.642 ± 0.3
1.999TyrArg: 1.999 ± 0.343
2.642TyrSer: 2.642 ± 0.611
2.428TyrThr: 2.428 ± 0.526
1.714TyrVal: 1.714 ± 0.402
0.5TyrTrp: 0.5 ± 0.177
0.857TyrTyr: 0.857 ± 0.226
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 69 proteins (14005 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski