Amino acid dipepetide frequency for Escherichia phage SRT8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.079AlaAla: 9.079 ± 0.772
0.74AlaCys: 0.74 ± 0.197
4.775AlaAsp: 4.775 ± 0.523
5.783AlaGlu: 5.783 ± 0.8
2.69AlaPhe: 2.69 ± 0.485
6.523AlaGly: 6.523 ± 0.727
0.941AlaHis: 0.941 ± 0.273
6.658AlaIle: 6.658 ± 0.673
6.254AlaLys: 6.254 ± 0.88
7.599AlaLeu: 7.599 ± 0.794
2.892AlaMet: 2.892 ± 0.39
3.631AlaAsn: 3.631 ± 0.565
2.286AlaPro: 2.286 ± 0.345
3.43AlaGln: 3.43 ± 0.829
5.044AlaArg: 5.044 ± 0.613
4.64AlaSer: 4.64 ± 0.619
3.833AlaThr: 3.833 ± 0.465
5.649AlaVal: 5.649 ± 0.509
0.874AlaTrp: 0.874 ± 0.23
2.555AlaTyr: 2.555 ± 0.439
0.0AlaXaa: 0.0 ± 0.0
Cys
1.076CysAla: 1.076 ± 0.334
0.134CysCys: 0.134 ± 0.096
0.941CysAsp: 0.941 ± 0.264
1.21CysGlu: 1.21 ± 0.322
0.403CysPhe: 0.403 ± 0.177
1.076CysGly: 1.076 ± 0.305
0.202CysHis: 0.202 ± 0.116
0.672CysIle: 0.672 ± 0.234
1.076CysLys: 1.076 ± 0.375
0.874CysLeu: 0.874 ± 0.289
0.336CysMet: 0.336 ± 0.151
0.74CysAsn: 0.74 ± 0.237
0.403CysPro: 0.403 ± 0.177
0.067CysGln: 0.067 ± 0.066
0.807CysArg: 0.807 ± 0.244
0.336CysSer: 0.336 ± 0.2
0.74CysThr: 0.74 ± 0.299
1.143CysVal: 1.143 ± 0.256
0.134CysTrp: 0.134 ± 0.102
0.403CysTyr: 0.403 ± 0.199
0.0CysXaa: 0.0 ± 0.0
Asp
4.371AspAla: 4.371 ± 0.508
0.74AspCys: 0.74 ± 0.24
3.295AspAsp: 3.295 ± 0.495
4.842AspGlu: 4.842 ± 0.533
3.228AspPhe: 3.228 ± 0.568
5.514AspGly: 5.514 ± 0.759
0.672AspHis: 0.672 ± 0.181
2.824AspIle: 2.824 ± 0.443
2.488AspLys: 2.488 ± 0.507
4.237AspLeu: 4.237 ± 0.621
1.681AspMet: 1.681 ± 0.375
2.488AspAsn: 2.488 ± 0.359
2.286AspPro: 2.286 ± 0.309
2.017AspGln: 2.017 ± 0.354
2.69AspArg: 2.69 ± 0.493
3.699AspSer: 3.699 ± 0.499
3.093AspThr: 3.093 ± 0.39
4.842AspVal: 4.842 ± 0.557
0.74AspTrp: 0.74 ± 0.223
2.757AspTyr: 2.757 ± 0.403
0.0AspXaa: 0.0 ± 0.0
Glu
6.859GluAla: 6.859 ± 0.765
0.605GluCys: 0.605 ± 0.221
3.833GluAsp: 3.833 ± 0.487
4.976GluGlu: 4.976 ± 0.754
4.035GluPhe: 4.035 ± 0.488
4.438GluGly: 4.438 ± 0.57
1.345GluHis: 1.345 ± 0.338
4.976GluIle: 4.976 ± 0.625
3.9GluLys: 3.9 ± 0.574
4.438GluLeu: 4.438 ± 0.502
3.43GluMet: 3.43 ± 0.381
2.555GluAsn: 2.555 ± 0.515
1.681GluPro: 1.681 ± 0.265
3.43GluGln: 3.43 ± 0.736
2.892GluArg: 2.892 ± 0.601
4.304GluSer: 4.304 ± 0.498
3.9GluThr: 3.9 ± 0.509
5.313GluVal: 5.313 ± 0.631
0.74GluTrp: 0.74 ± 0.256
2.824GluTyr: 2.824 ± 0.34
0.0GluXaa: 0.0 ± 0.0
Phe
2.892PheAla: 2.892 ± 0.467
1.412PheCys: 1.412 ± 0.427
3.228PheAsp: 3.228 ± 0.549
2.555PheGlu: 2.555 ± 0.452
1.076PhePhe: 1.076 ± 0.284
3.766PheGly: 3.766 ± 0.505
0.672PheHis: 0.672 ± 0.184
2.421PheIle: 2.421 ± 0.374
3.093PheLys: 3.093 ± 0.412
2.421PheLeu: 2.421 ± 0.462
1.143PheMet: 1.143 ± 0.277
2.219PheAsn: 2.219 ± 0.314
1.614PhePro: 1.614 ± 0.422
1.009PheGln: 1.009 ± 0.27
2.085PheArg: 2.085 ± 0.364
2.152PheSer: 2.152 ± 0.369
3.766PheThr: 3.766 ± 0.555
2.085PheVal: 2.085 ± 0.351
0.941PheTrp: 0.941 ± 0.226
1.21PheTyr: 1.21 ± 0.284
0.0PheXaa: 0.0 ± 0.0
Gly
5.985GlyAla: 5.985 ± 0.809
1.614GlyCys: 1.614 ± 0.379
4.506GlyAsp: 4.506 ± 0.501
4.909GlyGlu: 4.909 ± 0.438
2.488GlyPhe: 2.488 ± 0.319
8.272GlyGly: 8.272 ± 1.087
1.479GlyHis: 1.479 ± 0.371
4.842GlyIle: 4.842 ± 0.52
7.061GlyLys: 7.061 ± 0.721
5.111GlyLeu: 5.111 ± 0.45
3.026GlyMet: 3.026 ± 0.448
3.766GlyAsn: 3.766 ± 0.613
0.0GlyPro: 0.0 ± 0.0
3.161GlyGln: 3.161 ± 0.442
3.497GlyArg: 3.497 ± 0.467
4.169GlySer: 4.169 ± 0.547
3.699GlyThr: 3.699 ± 0.518
5.851GlyVal: 5.851 ± 0.586
1.479GlyTrp: 1.479 ± 0.334
2.892GlyTyr: 2.892 ± 0.456
0.0GlyXaa: 0.0 ± 0.0
His
1.21HisAla: 1.21 ± 0.321
0.134HisCys: 0.134 ± 0.1
1.21HisAsp: 1.21 ± 0.366
1.681HisGlu: 1.681 ± 0.327
1.278HisPhe: 1.278 ± 0.312
1.345HisGly: 1.345 ± 0.307
0.941HisHis: 0.941 ± 0.274
0.807HisIle: 0.807 ± 0.293
1.345HisLys: 1.345 ± 0.321
1.547HisLeu: 1.547 ± 0.324
0.269HisMet: 0.269 ± 0.138
0.941HisAsn: 0.941 ± 0.326
0.605HisPro: 0.605 ± 0.181
0.336HisGln: 0.336 ± 0.138
1.143HisArg: 1.143 ± 0.298
1.009HisSer: 1.009 ± 0.296
0.807HisThr: 0.807 ± 0.26
1.816HisVal: 1.816 ± 0.398
0.067HisTrp: 0.067 ± 0.059
1.479HisTyr: 1.479 ± 0.338
0.0HisXaa: 0.0 ± 0.0
Ile
5.582IleAla: 5.582 ± 0.529
0.941IleCys: 0.941 ± 0.224
4.237IleAsp: 4.237 ± 0.506
5.447IleGlu: 5.447 ± 0.578
2.354IlePhe: 2.354 ± 0.336
4.169IleGly: 4.169 ± 0.57
1.278IleHis: 1.278 ± 0.348
3.228IleIle: 3.228 ± 0.431
4.237IleLys: 4.237 ± 0.405
3.362IleLeu: 3.362 ± 0.538
1.614IleMet: 1.614 ± 0.404
3.699IleAsn: 3.699 ± 0.67
2.085IlePro: 2.085 ± 0.423
2.286IleGln: 2.286 ± 0.464
3.497IleArg: 3.497 ± 0.452
3.564IleSer: 3.564 ± 0.422
4.573IleThr: 4.573 ± 0.571
3.564IleVal: 3.564 ± 0.486
0.874IleTrp: 0.874 ± 0.209
1.95IleTyr: 1.95 ± 0.331
0.0IleXaa: 0.0 ± 0.0
Lys
6.12LysAla: 6.12 ± 0.707
0.807LysCys: 0.807 ± 0.292
4.371LysAsp: 4.371 ± 0.556
4.842LysGlu: 4.842 ± 0.627
2.354LysPhe: 2.354 ± 0.38
4.102LysGly: 4.102 ± 0.59
1.345LysHis: 1.345 ± 0.278
3.497LysIle: 3.497 ± 0.49
4.371LysLys: 4.371 ± 0.523
3.9LysLeu: 3.9 ± 0.428
3.093LysMet: 3.093 ± 0.426
3.093LysAsn: 3.093 ± 0.415
3.093LysPro: 3.093 ± 0.514
2.555LysGln: 2.555 ± 0.443
3.699LysArg: 3.699 ± 0.549
4.035LysSer: 4.035 ± 0.518
3.497LysThr: 3.497 ± 0.526
5.447LysVal: 5.447 ± 0.593
0.941LysTrp: 0.941 ± 0.272
2.219LysTyr: 2.219 ± 0.34
0.0LysXaa: 0.0 ± 0.0
Leu
5.447LeuAla: 5.447 ± 0.663
0.672LeuCys: 0.672 ± 0.19
3.228LeuAsp: 3.228 ± 0.394
3.9LeuGlu: 3.9 ± 0.588
2.017LeuPhe: 2.017 ± 0.35
4.035LeuGly: 4.035 ± 0.633
1.345LeuHis: 1.345 ± 0.358
5.044LeuIle: 5.044 ± 0.545
5.918LeuLys: 5.918 ± 0.679
4.707LeuLeu: 4.707 ± 0.539
1.95LeuMet: 1.95 ± 0.389
4.169LeuAsn: 4.169 ± 0.753
3.699LeuPro: 3.699 ± 0.512
2.017LeuGln: 2.017 ± 0.342
4.102LeuArg: 4.102 ± 0.539
4.573LeuSer: 4.573 ± 0.466
3.564LeuThr: 3.564 ± 0.503
3.9LeuVal: 3.9 ± 0.48
1.076LeuTrp: 1.076 ± 0.298
2.354LeuTyr: 2.354 ± 0.37
0.0LeuXaa: 0.0 ± 0.0
Met
2.824MetAla: 2.824 ± 0.446
0.134MetCys: 0.134 ± 0.09
0.807MetAsp: 0.807 ± 0.269
2.219MetGlu: 2.219 ± 0.392
1.009MetPhe: 1.009 ± 0.231
1.547MetGly: 1.547 ± 0.325
1.009MetHis: 1.009 ± 0.252
2.623MetIle: 2.623 ± 0.384
3.497MetLys: 3.497 ± 0.501
2.824MetLeu: 2.824 ± 0.459
1.345MetMet: 1.345 ± 0.432
1.748MetAsn: 1.748 ± 0.283
0.807MetPro: 0.807 ± 0.213
1.143MetGln: 1.143 ± 0.277
1.479MetArg: 1.479 ± 0.373
1.547MetSer: 1.547 ± 0.248
1.614MetThr: 1.614 ± 0.355
2.152MetVal: 2.152 ± 0.412
0.202MetTrp: 0.202 ± 0.121
1.143MetTyr: 1.143 ± 0.25
0.0MetXaa: 0.0 ± 0.0
Asn
4.707AsnAla: 4.707 ± 0.501
0.403AsnCys: 0.403 ± 0.167
2.757AsnAsp: 2.757 ± 0.381
3.026AsnGlu: 3.026 ± 0.411
1.883AsnPhe: 1.883 ± 0.37
5.514AsnGly: 5.514 ± 0.648
1.278AsnHis: 1.278 ± 0.353
2.69AsnIle: 2.69 ± 0.384
3.766AsnLys: 3.766 ± 0.48
2.286AsnLeu: 2.286 ± 0.534
0.874AsnMet: 0.874 ± 0.221
2.623AsnAsn: 2.623 ± 0.478
2.017AsnPro: 2.017 ± 0.403
1.816AsnGln: 1.816 ± 0.498
1.816AsnArg: 1.816 ± 0.347
2.959AsnSer: 2.959 ± 0.449
2.085AsnThr: 2.085 ± 0.298
3.497AsnVal: 3.497 ± 0.673
0.74AsnTrp: 0.74 ± 0.157
1.547AsnTyr: 1.547 ± 0.34
0.067AsnXaa: 0.067 ± 0.081
Pro
2.555ProAla: 2.555 ± 0.398
0.538ProCys: 0.538 ± 0.161
1.816ProAsp: 1.816 ± 0.401
3.026ProGlu: 3.026 ± 0.584
1.748ProPhe: 1.748 ± 0.373
2.757ProGly: 2.757 ± 0.451
1.009ProHis: 1.009 ± 0.244
1.95ProIle: 1.95 ± 0.349
1.681ProLys: 1.681 ± 0.415
1.816ProLeu: 1.816 ± 0.314
1.278ProMet: 1.278 ± 0.262
1.816ProAsn: 1.816 ± 0.352
0.874ProPro: 0.874 ± 0.244
1.21ProGln: 1.21 ± 0.337
1.345ProArg: 1.345 ± 0.341
1.681ProSer: 1.681 ± 0.356
1.412ProThr: 1.412 ± 0.261
3.362ProVal: 3.362 ± 0.369
0.336ProTrp: 0.336 ± 0.136
0.874ProTyr: 0.874 ± 0.221
0.0ProXaa: 0.0 ± 0.0
Gln
3.497GlnAla: 3.497 ± 0.731
0.538GlnCys: 0.538 ± 0.181
1.95GlnAsp: 1.95 ± 0.363
2.152GlnGlu: 2.152 ± 0.367
1.143GlnPhe: 1.143 ± 0.239
1.883GlnGly: 1.883 ± 0.39
0.874GlnHis: 0.874 ± 0.247
2.959GlnIle: 2.959 ± 0.448
2.152GlnLys: 2.152 ± 0.49
2.824GlnLeu: 2.824 ± 0.438
1.21GlnMet: 1.21 ± 0.278
1.547GlnAsn: 1.547 ± 0.288
1.21GlnPro: 1.21 ± 0.34
2.286GlnGln: 2.286 ± 0.511
1.95GlnArg: 1.95 ± 0.376
2.959GlnSer: 2.959 ± 0.468
1.614GlnThr: 1.614 ± 0.27
2.488GlnVal: 2.488 ± 0.493
0.74GlnTrp: 0.74 ± 0.268
2.017GlnTyr: 2.017 ± 0.372
0.0GlnXaa: 0.0 ± 0.0
Arg
4.169ArgAla: 4.169 ± 0.51
0.807ArgCys: 0.807 ± 0.334
2.555ArgAsp: 2.555 ± 0.471
3.9ArgGlu: 3.9 ± 0.499
2.892ArgPhe: 2.892 ± 0.393
2.757ArgGly: 2.757 ± 0.374
0.874ArgHis: 0.874 ± 0.279
2.824ArgIle: 2.824 ± 0.478
3.968ArgLys: 3.968 ± 0.457
3.968ArgLeu: 3.968 ± 0.503
1.95ArgMet: 1.95 ± 0.431
2.354ArgAsn: 2.354 ± 0.498
1.681ArgPro: 1.681 ± 0.376
2.286ArgGln: 2.286 ± 0.342
2.959ArgArg: 2.959 ± 0.517
2.757ArgSer: 2.757 ± 0.504
1.816ArgThr: 1.816 ± 0.328
3.968ArgVal: 3.968 ± 0.578
0.403ArgTrp: 0.403 ± 0.146
2.354ArgTyr: 2.354 ± 0.459
0.0ArgXaa: 0.0 ± 0.0
Ser
4.304SerAla: 4.304 ± 0.434
0.471SerCys: 0.471 ± 0.184
3.564SerAsp: 3.564 ± 0.473
3.968SerGlu: 3.968 ± 0.568
3.093SerPhe: 3.093 ± 0.385
6.725SerGly: 6.725 ± 0.711
0.874SerHis: 0.874 ± 0.269
3.362SerIle: 3.362 ± 0.389
2.555SerLys: 2.555 ± 0.386
4.237SerLeu: 4.237 ± 0.462
1.614SerMet: 1.614 ± 0.359
2.555SerAsn: 2.555 ± 0.373
1.614SerPro: 1.614 ± 0.363
2.421SerGln: 2.421 ± 0.408
3.093SerArg: 3.093 ± 0.394
2.152SerSer: 2.152 ± 0.452
2.757SerThr: 2.757 ± 0.421
4.775SerVal: 4.775 ± 0.629
0.874SerTrp: 0.874 ± 0.379
1.95SerTyr: 1.95 ± 0.328
0.0SerXaa: 0.0 ± 0.0
Thr
4.371ThrAla: 4.371 ± 0.6
0.74ThrCys: 0.74 ± 0.189
2.959ThrAsp: 2.959 ± 0.499
4.102ThrGlu: 4.102 ± 0.508
2.623ThrPhe: 2.623 ± 0.452
5.178ThrGly: 5.178 ± 0.522
1.143ThrHis: 1.143 ± 0.299
3.766ThrIle: 3.766 ± 0.565
2.69ThrLys: 2.69 ± 0.586
3.9ThrLeu: 3.9 ± 0.504
1.009ThrMet: 1.009 ± 0.247
2.757ThrAsn: 2.757 ± 0.448
2.757ThrPro: 2.757 ± 0.478
1.748ThrGln: 1.748 ± 0.365
2.219ThrArg: 2.219 ± 0.379
2.892ThrSer: 2.892 ± 0.492
2.757ThrThr: 2.757 ± 0.488
3.295ThrVal: 3.295 ± 0.513
0.672ThrTrp: 0.672 ± 0.212
2.354ThrTyr: 2.354 ± 0.456
0.0ThrXaa: 0.0 ± 0.0
Val
6.389ValAla: 6.389 ± 0.678
0.941ValCys: 0.941 ± 0.29
4.169ValAsp: 4.169 ± 0.464
4.506ValGlu: 4.506 ± 0.652
3.161ValPhe: 3.161 ± 0.412
4.64ValGly: 4.64 ± 0.614
1.345ValHis: 1.345 ± 0.264
4.775ValIle: 4.775 ± 0.7
4.573ValLys: 4.573 ± 0.509
4.64ValLeu: 4.64 ± 0.562
1.95ValMet: 1.95 ± 0.462
3.161ValAsn: 3.161 ± 0.352
2.354ValPro: 2.354 ± 0.463
2.824ValGln: 2.824 ± 0.523
3.43ValArg: 3.43 ± 0.558
4.573ValSer: 4.573 ± 0.564
5.178ValThr: 5.178 ± 0.627
4.506ValVal: 4.506 ± 0.586
1.681ValTrp: 1.681 ± 0.321
2.286ValTyr: 2.286 ± 0.467
0.0ValXaa: 0.0 ± 0.0
Trp
0.941TrpAla: 0.941 ± 0.202
0.067TrpCys: 0.067 ± 0.066
1.076TrpAsp: 1.076 ± 0.245
0.74TrpGlu: 0.74 ± 0.252
0.403TrpPhe: 0.403 ± 0.239
1.009TrpGly: 1.009 ± 0.217
0.538TrpHis: 0.538 ± 0.189
1.009TrpIle: 1.009 ± 0.312
0.807TrpLys: 0.807 ± 0.204
1.21TrpLeu: 1.21 ± 0.304
0.403TrpMet: 0.403 ± 0.142
0.403TrpAsn: 0.403 ± 0.178
0.269TrpPro: 0.269 ± 0.158
0.202TrpGln: 0.202 ± 0.112
1.143TrpArg: 1.143 ± 0.244
0.874TrpSer: 0.874 ± 0.23
1.009TrpThr: 1.009 ± 0.26
1.009TrpVal: 1.009 ± 0.23
0.202TrpTrp: 0.202 ± 0.122
0.605TrpTyr: 0.605 ± 0.204
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.497TyrAla: 3.497 ± 0.43
0.336TyrCys: 0.336 ± 0.133
3.228TyrAsp: 3.228 ± 0.431
2.69TyrGlu: 2.69 ± 0.425
1.883TyrPhe: 1.883 ± 0.409
2.555TyrGly: 2.555 ± 0.416
0.807TyrHis: 0.807 ± 0.284
1.614TyrIle: 1.614 ± 0.36
1.95TyrLys: 1.95 ± 0.404
1.816TyrLeu: 1.816 ± 0.319
0.538TyrMet: 0.538 ± 0.174
2.085TyrAsn: 2.085 ± 0.375
1.748TyrPro: 1.748 ± 0.328
1.816TyrGln: 1.816 ± 0.363
2.286TyrArg: 2.286 ± 0.429
2.085TyrSer: 2.085 ± 0.378
2.219TyrThr: 2.219 ± 0.346
2.421TyrVal: 2.421 ± 0.394
0.202TyrTrp: 0.202 ± 0.107
1.076TyrTyr: 1.076 ± 0.283
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.067XaaArg: 0.067 ± 0.081
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 84 proteins (14871 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski