Amino acid dipepetide frequency for Gordonia phage Mellie

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.665AlaAla: 16.665 ± 1.151
0.641AlaCys: 0.641 ± 0.216
7.435AlaAsp: 7.435 ± 0.804
6.794AlaGlu: 6.794 ± 0.601
3.269AlaPhe: 3.269 ± 0.599
9.614AlaGly: 9.614 ± 1.03
2.243AlaHis: 2.243 ± 0.43
5.128AlaIle: 5.128 ± 0.517
3.141AlaLys: 3.141 ± 0.628
10.127AlaLeu: 10.127 ± 0.98
2.948AlaMet: 2.948 ± 0.434
3.205AlaAsn: 3.205 ± 0.603
5.64AlaPro: 5.64 ± 0.647
4.487AlaGln: 4.487 ± 0.662
8.14AlaArg: 8.14 ± 0.784
5.32AlaSer: 5.32 ± 0.701
7.691AlaThr: 7.691 ± 0.741
8.012AlaVal: 8.012 ± 1.013
1.666AlaTrp: 1.666 ± 0.331
2.243AlaTyr: 2.243 ± 0.417
0.0AlaXaa: 0.0 ± 0.0
Cys
0.641CysAla: 0.641 ± 0.194
0.192CysCys: 0.192 ± 0.148
0.961CysAsp: 0.961 ± 0.282
0.256CysGlu: 0.256 ± 0.129
0.064CysPhe: 0.064 ± 0.059
0.897CysGly: 0.897 ± 0.286
0.449CysHis: 0.449 ± 0.155
0.192CysIle: 0.192 ± 0.111
0.192CysLys: 0.192 ± 0.109
0.385CysLeu: 0.385 ± 0.143
0.192CysMet: 0.192 ± 0.117
0.385CysAsn: 0.385 ± 0.195
0.705CysPro: 0.705 ± 0.221
0.32CysGln: 0.32 ± 0.135
1.09CysArg: 1.09 ± 0.361
0.256CysSer: 0.256 ± 0.139
0.641CysThr: 0.641 ± 0.223
0.513CysVal: 0.513 ± 0.174
0.192CysTrp: 0.192 ± 0.16
0.064CysTyr: 0.064 ± 0.065
0.0CysXaa: 0.0 ± 0.0
Asp
7.82AspAla: 7.82 ± 0.707
0.449AspCys: 0.449 ± 0.154
5.512AspAsp: 5.512 ± 0.704
4.615AspGlu: 4.615 ± 0.729
1.666AspPhe: 1.666 ± 0.341
6.538AspGly: 6.538 ± 0.886
2.115AspHis: 2.115 ± 0.467
2.628AspIle: 2.628 ± 0.372
1.666AspLys: 1.666 ± 0.321
6.089AspLeu: 6.089 ± 0.748
1.346AspMet: 1.346 ± 0.328
2.243AspAsn: 2.243 ± 0.442
4.358AspPro: 4.358 ± 0.589
2.692AspGln: 2.692 ± 0.364
4.935AspArg: 4.935 ± 0.699
3.012AspSer: 3.012 ± 0.344
4.615AspThr: 4.615 ± 0.555
6.025AspVal: 6.025 ± 0.691
0.961AspTrp: 0.961 ± 0.229
1.923AspTyr: 1.923 ± 0.389
0.0AspXaa: 0.0 ± 0.0
Glu
5.063GluAla: 5.063 ± 0.652
0.32GluCys: 0.32 ± 0.151
3.012GluAsp: 3.012 ± 0.465
2.436GluGlu: 2.436 ± 0.544
2.436GluPhe: 2.436 ± 0.403
3.782GluGly: 3.782 ± 0.52
1.538GluHis: 1.538 ± 0.248
2.5GluIle: 2.5 ± 0.307
2.243GluLys: 2.243 ± 0.315
5.64GluLeu: 5.64 ± 0.842
1.154GluMet: 1.154 ± 0.23
1.538GluAsn: 1.538 ± 0.324
4.23GluPro: 4.23 ± 0.758
2.948GluGln: 2.948 ± 0.435
4.358GluArg: 4.358 ± 0.644
2.371GluSer: 2.371 ± 0.361
2.692GluThr: 2.692 ± 0.367
4.999GluVal: 4.999 ± 0.705
1.09GluTrp: 1.09 ± 0.244
1.474GluTyr: 1.474 ± 0.358
0.0GluXaa: 0.0 ± 0.0
Phe
2.756PheAla: 2.756 ± 0.372
0.256PheCys: 0.256 ± 0.136
2.307PheAsp: 2.307 ± 0.378
1.795PheGlu: 1.795 ± 0.334
0.833PhePhe: 0.833 ± 0.225
2.371PheGly: 2.371 ± 0.415
0.32PheHis: 0.32 ± 0.188
0.769PheIle: 0.769 ± 0.237
0.961PheLys: 0.961 ± 0.346
1.795PheLeu: 1.795 ± 0.375
0.449PheMet: 0.449 ± 0.148
0.641PheAsn: 0.641 ± 0.232
1.41PhePro: 1.41 ± 0.239
0.513PheGln: 0.513 ± 0.154
1.987PheArg: 1.987 ± 0.284
1.538PheSer: 1.538 ± 0.35
2.692PheThr: 2.692 ± 0.378
2.82PheVal: 2.82 ± 0.431
0.256PheTrp: 0.256 ± 0.14
0.577PheTyr: 0.577 ± 0.187
0.0PheXaa: 0.0 ± 0.0
Gly
8.717GlyAla: 8.717 ± 1.116
0.577GlyCys: 0.577 ± 0.193
6.474GlyAsp: 6.474 ± 0.536
4.679GlyGlu: 4.679 ± 0.62
2.82GlyPhe: 2.82 ± 0.494
7.114GlyGly: 7.114 ± 1.025
1.987GlyHis: 1.987 ± 0.388
3.653GlyIle: 3.653 ± 0.603
3.589GlyLys: 3.589 ± 0.551
8.332GlyLeu: 8.332 ± 1.187
1.474GlyMet: 1.474 ± 0.287
2.436GlyAsn: 2.436 ± 0.405
3.91GlyPro: 3.91 ± 0.494
3.012GlyGln: 3.012 ± 0.349
6.345GlyArg: 6.345 ± 0.594
4.166GlySer: 4.166 ± 0.493
4.935GlyThr: 4.935 ± 0.684
6.409GlyVal: 6.409 ± 0.513
2.436GlyTrp: 2.436 ± 0.388
2.371GlyTyr: 2.371 ± 0.358
0.0GlyXaa: 0.0 ± 0.0
His
2.179HisAla: 2.179 ± 0.394
0.385HisCys: 0.385 ± 0.159
1.474HisAsp: 1.474 ± 0.285
0.833HisGlu: 0.833 ± 0.22
0.449HisPhe: 0.449 ± 0.198
1.731HisGly: 1.731 ± 0.399
0.705HisHis: 0.705 ± 0.323
1.09HisIle: 1.09 ± 0.235
0.577HisLys: 0.577 ± 0.179
1.859HisLeu: 1.859 ± 0.416
0.128HisMet: 0.128 ± 0.081
0.449HisAsn: 0.449 ± 0.17
2.371HisPro: 2.371 ± 0.497
0.705HisGln: 0.705 ± 0.211
1.731HisArg: 1.731 ± 0.38
0.961HisSer: 0.961 ± 0.264
1.666HisThr: 1.666 ± 0.377
1.154HisVal: 1.154 ± 0.249
0.513HisTrp: 0.513 ± 0.187
0.705HisTyr: 0.705 ± 0.225
0.0HisXaa: 0.0 ± 0.0
Ile
5.64IleAla: 5.64 ± 0.518
0.192IleCys: 0.192 ± 0.12
3.525IleAsp: 3.525 ± 0.464
2.628IleGlu: 2.628 ± 0.386
0.897IlePhe: 0.897 ± 0.233
4.935IleGly: 4.935 ± 0.759
0.705IleHis: 0.705 ± 0.195
1.538IleIle: 1.538 ± 0.354
1.602IleLys: 1.602 ± 0.433
2.436IleLeu: 2.436 ± 0.365
0.128IleMet: 0.128 ± 0.081
1.09IleAsn: 1.09 ± 0.255
2.884IlePro: 2.884 ± 0.392
0.641IleGln: 0.641 ± 0.187
4.102IleArg: 4.102 ± 0.526
2.371IleSer: 2.371 ± 0.343
2.82IleThr: 2.82 ± 0.37
4.679IleVal: 4.679 ± 0.486
0.192IleTrp: 0.192 ± 0.119
0.897IleTyr: 0.897 ± 0.239
0.0IleXaa: 0.0 ± 0.0
Lys
3.782LysAla: 3.782 ± 0.458
0.192LysCys: 0.192 ± 0.091
1.923LysAsp: 1.923 ± 0.442
1.218LysGlu: 1.218 ± 0.267
1.154LysPhe: 1.154 ± 0.292
2.692LysGly: 2.692 ± 0.441
0.385LysHis: 0.385 ± 0.176
1.538LysIle: 1.538 ± 0.35
1.923LysLys: 1.923 ± 0.314
2.948LysLeu: 2.948 ± 0.423
0.449LysMet: 0.449 ± 0.19
0.961LysAsn: 0.961 ± 0.288
2.628LysPro: 2.628 ± 0.37
0.769LysGln: 0.769 ± 0.26
2.115LysArg: 2.115 ± 0.392
2.115LysSer: 2.115 ± 0.362
1.987LysThr: 1.987 ± 0.34
2.692LysVal: 2.692 ± 0.301
0.641LysTrp: 0.641 ± 0.191
0.833LysTyr: 0.833 ± 0.29
0.0LysXaa: 0.0 ± 0.0
Leu
10.704LeuAla: 10.704 ± 0.947
0.833LeuCys: 0.833 ± 0.269
5.897LeuAsp: 5.897 ± 0.803
3.653LeuGlu: 3.653 ± 0.602
1.859LeuPhe: 1.859 ± 0.279
6.153LeuGly: 6.153 ± 0.825
1.218LeuHis: 1.218 ± 0.288
3.269LeuIle: 3.269 ± 0.463
1.859LeuLys: 1.859 ± 0.316
4.615LeuLeu: 4.615 ± 0.642
2.051LeuMet: 2.051 ± 0.394
2.179LeuAsn: 2.179 ± 0.44
4.358LeuPro: 4.358 ± 0.529
1.987LeuGln: 1.987 ± 0.328
5.32LeuArg: 5.32 ± 0.488
4.423LeuSer: 4.423 ± 0.587
5.833LeuThr: 5.833 ± 0.704
6.666LeuVal: 6.666 ± 0.673
2.436LeuTrp: 2.436 ± 0.358
1.41LeuTyr: 1.41 ± 0.242
0.0LeuXaa: 0.0 ± 0.0
Met
3.525MetAla: 3.525 ± 0.516
0.256MetCys: 0.256 ± 0.126
0.705MetAsp: 0.705 ± 0.187
0.769MetGlu: 0.769 ± 0.18
0.641MetPhe: 0.641 ± 0.193
1.474MetGly: 1.474 ± 0.384
0.385MetHis: 0.385 ± 0.166
0.513MetIle: 0.513 ± 0.169
0.577MetLys: 0.577 ± 0.186
1.731MetLeu: 1.731 ± 0.32
0.192MetMet: 0.192 ± 0.12
0.449MetAsn: 0.449 ± 0.174
1.795MetPro: 1.795 ± 0.327
0.513MetGln: 0.513 ± 0.186
2.051MetArg: 2.051 ± 0.575
1.923MetSer: 1.923 ± 0.37
2.307MetThr: 2.307 ± 0.325
0.769MetVal: 0.769 ± 0.232
0.513MetTrp: 0.513 ± 0.277
0.385MetTyr: 0.385 ± 0.16
0.0MetXaa: 0.0 ± 0.0
Asn
2.5AsnAla: 2.5 ± 0.473
0.192AsnCys: 0.192 ± 0.115
1.923AsnAsp: 1.923 ± 0.343
1.154AsnGlu: 1.154 ± 0.28
0.513AsnPhe: 0.513 ± 0.143
4.166AsnGly: 4.166 ± 0.59
0.577AsnHis: 0.577 ± 0.176
0.961AsnIle: 0.961 ± 0.292
0.897AsnLys: 0.897 ± 0.23
2.115AsnLeu: 2.115 ± 0.359
0.641AsnMet: 0.641 ± 0.197
0.833AsnAsn: 0.833 ± 0.257
2.628AsnPro: 2.628 ± 0.397
0.897AsnGln: 0.897 ± 0.195
1.859AsnArg: 1.859 ± 0.364
1.731AsnSer: 1.731 ± 0.345
2.179AsnThr: 2.179 ± 0.461
1.731AsnVal: 1.731 ± 0.406
0.513AsnTrp: 0.513 ± 0.15
1.09AsnTyr: 1.09 ± 0.241
0.0AsnXaa: 0.0 ± 0.0
Pro
6.986ProAla: 6.986 ± 0.94
0.833ProCys: 0.833 ± 0.298
4.551ProAsp: 4.551 ± 0.564
4.166ProGlu: 4.166 ± 0.578
1.602ProPhe: 1.602 ± 0.302
5.063ProGly: 5.063 ± 0.675
1.474ProHis: 1.474 ± 0.371
3.269ProIle: 3.269 ± 0.495
2.756ProLys: 2.756 ± 0.375
2.756ProLeu: 2.756 ± 0.38
1.731ProMet: 1.731 ± 0.462
1.795ProAsn: 1.795 ± 0.253
3.461ProPro: 3.461 ± 0.587
2.051ProGln: 2.051 ± 0.355
4.166ProArg: 4.166 ± 0.682
2.692ProSer: 2.692 ± 0.312
3.91ProThr: 3.91 ± 0.527
4.166ProVal: 4.166 ± 0.468
1.218ProTrp: 1.218 ± 0.296
1.026ProTyr: 1.026 ± 0.25
0.0ProXaa: 0.0 ± 0.0
Gln
2.82GlnAla: 2.82 ± 0.505
0.064GlnCys: 0.064 ± 0.059
0.961GlnAsp: 0.961 ± 0.21
1.282GlnGlu: 1.282 ± 0.333
0.961GlnPhe: 0.961 ± 0.265
2.115GlnGly: 2.115 ± 0.428
1.09GlnHis: 1.09 ± 0.26
1.731GlnIle: 1.731 ± 0.36
1.026GlnLys: 1.026 ± 0.18
2.884GlnLeu: 2.884 ± 0.391
1.09GlnMet: 1.09 ± 0.261
0.897GlnAsn: 0.897 ± 0.207
1.987GlnPro: 1.987 ± 0.396
1.666GlnGln: 1.666 ± 0.397
3.141GlnArg: 3.141 ± 0.426
2.564GlnSer: 2.564 ± 0.437
1.987GlnThr: 1.987 ± 0.39
2.564GlnVal: 2.564 ± 0.458
1.09GlnTrp: 1.09 ± 0.25
0.833GlnTyr: 0.833 ± 0.193
0.0GlnXaa: 0.0 ± 0.0
Arg
8.332ArgAla: 8.332 ± 0.928
1.154ArgCys: 1.154 ± 0.318
5.961ArgAsp: 5.961 ± 0.693
4.23ArgGlu: 4.23 ± 0.522
1.923ArgPhe: 1.923 ± 0.32
6.986ArgGly: 6.986 ± 0.772
1.538ArgHis: 1.538 ± 0.332
3.846ArgIle: 3.846 ± 0.552
2.307ArgLys: 2.307 ± 0.326
6.217ArgLeu: 6.217 ± 0.566
2.179ArgMet: 2.179 ± 0.402
2.5ArgAsn: 2.5 ± 0.389
3.717ArgPro: 3.717 ± 0.585
2.564ArgGln: 2.564 ± 0.423
8.204ArgArg: 8.204 ± 1.145
4.23ArgSer: 4.23 ± 0.432
4.23ArgThr: 4.23 ± 0.459
5.192ArgVal: 5.192 ± 0.665
1.474ArgTrp: 1.474 ± 0.377
1.666ArgTyr: 1.666 ± 0.312
0.0ArgXaa: 0.0 ± 0.0
Ser
5.64SerAla: 5.64 ± 0.873
0.064SerCys: 0.064 ± 0.062
3.525SerAsp: 3.525 ± 0.43
3.589SerGlu: 3.589 ± 0.593
1.346SerPhe: 1.346 ± 0.271
5.448SerGly: 5.448 ± 0.653
0.833SerHis: 0.833 ± 0.216
2.884SerIle: 2.884 ± 0.402
1.346SerLys: 1.346 ± 0.246
3.012SerLeu: 3.012 ± 0.397
1.41SerMet: 1.41 ± 0.288
1.666SerAsn: 1.666 ± 0.433
2.82SerPro: 2.82 ± 0.353
1.09SerGln: 1.09 ± 0.324
3.974SerArg: 3.974 ± 0.639
2.115SerSer: 2.115 ± 0.365
3.589SerThr: 3.589 ± 0.529
4.358SerVal: 4.358 ± 0.562
1.538SerTrp: 1.538 ± 0.265
0.833SerTyr: 0.833 ± 0.24
0.0SerXaa: 0.0 ± 0.0
Thr
7.179ThrAla: 7.179 ± 0.619
0.449ThrCys: 0.449 ± 0.17
5.256ThrAsp: 5.256 ± 0.61
4.743ThrGlu: 4.743 ± 0.607
2.179ThrPhe: 2.179 ± 0.557
5.063ThrGly: 5.063 ± 0.699
1.666ThrHis: 1.666 ± 0.35
3.397ThrIle: 3.397 ± 0.583
2.371ThrLys: 2.371 ± 0.385
5.192ThrLeu: 5.192 ± 0.64
1.218ThrMet: 1.218 ± 0.21
2.307ThrAsn: 2.307 ± 0.506
4.358ThrPro: 4.358 ± 0.575
1.923ThrGln: 1.923 ± 0.333
4.487ThrArg: 4.487 ± 0.476
2.5ThrSer: 2.5 ± 0.416
4.551ThrThr: 4.551 ± 0.621
5.704ThrVal: 5.704 ± 0.533
1.282ThrTrp: 1.282 ± 0.289
1.666ThrTyr: 1.666 ± 0.392
0.0ThrXaa: 0.0 ± 0.0
Val
9.422ValAla: 9.422 ± 0.934
1.154ValCys: 1.154 ± 0.323
7.114ValAsp: 7.114 ± 0.812
4.743ValGlu: 4.743 ± 0.641
1.41ValPhe: 1.41 ± 0.405
5.833ValGly: 5.833 ± 0.611
1.218ValHis: 1.218 ± 0.248
3.333ValIle: 3.333 ± 0.393
2.243ValLys: 2.243 ± 0.366
5.448ValLeu: 5.448 ± 0.709
1.666ValMet: 1.666 ± 0.392
1.923ValAsn: 1.923 ± 0.32
3.846ValPro: 3.846 ± 0.509
2.307ValGln: 2.307 ± 0.416
6.73ValArg: 6.73 ± 0.673
4.038ValSer: 4.038 ± 0.611
5.833ValThr: 5.833 ± 0.743
6.858ValVal: 6.858 ± 0.742
1.602ValTrp: 1.602 ± 0.372
1.987ValTyr: 1.987 ± 0.259
0.0ValXaa: 0.0 ± 0.0
Trp
1.666TrpAla: 1.666 ± 0.356
0.128TrpCys: 0.128 ± 0.096
1.41TrpAsp: 1.41 ± 0.374
0.833TrpGlu: 0.833 ± 0.24
0.577TrpPhe: 0.577 ± 0.225
0.897TrpGly: 0.897 ± 0.269
0.513TrpHis: 0.513 ± 0.217
0.897TrpIle: 0.897 ± 0.235
0.769TrpLys: 0.769 ± 0.188
2.051TrpLeu: 2.051 ± 0.369
0.385TrpMet: 0.385 ± 0.162
0.897TrpAsn: 0.897 ± 0.388
1.41TrpPro: 1.41 ± 0.285
0.833TrpGln: 0.833 ± 0.231
1.731TrpArg: 1.731 ± 0.313
1.41TrpSer: 1.41 ± 0.29
1.474TrpThr: 1.474 ± 0.286
1.538TrpVal: 1.538 ± 0.303
0.513TrpTrp: 0.513 ± 0.176
0.513TrpTyr: 0.513 ± 0.189
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.371TyrAla: 2.371 ± 0.377
0.256TyrCys: 0.256 ± 0.119
1.282TyrAsp: 1.282 ± 0.294
1.41TyrGlu: 1.41 ± 0.356
0.449TyrPhe: 0.449 ± 0.158
2.564TyrGly: 2.564 ± 0.42
0.705TyrHis: 0.705 ± 0.219
0.769TyrIle: 0.769 ± 0.256
0.897TyrLys: 0.897 ± 0.218
1.09TyrLeu: 1.09 ± 0.299
0.577TyrMet: 0.577 ± 0.191
0.705TyrAsn: 0.705 ± 0.208
1.154TyrPro: 1.154 ± 0.269
0.833TyrGln: 0.833 ± 0.256
2.051TyrArg: 2.051 ± 0.361
1.346TyrSer: 1.346 ± 0.291
1.859TyrThr: 1.859 ± 0.331
1.859TyrVal: 1.859 ± 0.34
0.32TyrTrp: 0.32 ± 0.175
0.577TyrTyr: 0.577 ± 0.224
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 76 proteins (15603 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski