Amino acid dipepetide frequency for Escherichia phage UAB_Phi78

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.577AlaAla: 8.577 ± 1.246
0.828AlaCys: 0.828 ± 0.31
4.815AlaAsp: 4.815 ± 0.695
6.847AlaGlu: 6.847 ± 0.972
2.633AlaPhe: 2.633 ± 0.519
7.75AlaGly: 7.75 ± 0.914
1.204AlaHis: 1.204 ± 0.271
5.116AlaIle: 5.116 ± 0.544
5.116AlaLys: 5.116 ± 0.536
6.997AlaLeu: 6.997 ± 0.659
3.762AlaMet: 3.762 ± 0.741
3.461AlaAsn: 3.461 ± 0.486
3.311AlaPro: 3.311 ± 0.668
4.665AlaGln: 4.665 ± 0.832
5.267AlaArg: 5.267 ± 0.635
5.116AlaSer: 5.116 ± 0.764
4.514AlaThr: 4.514 ± 0.549
7.373AlaVal: 7.373 ± 0.859
1.43AlaTrp: 1.43 ± 0.321
2.859AlaTyr: 2.859 ± 0.45
0.0AlaXaa: 0.0 ± 0.0
Cys
0.451CysAla: 0.451 ± 0.15
0.075CysCys: 0.075 ± 0.072
0.602CysAsp: 0.602 ± 0.232
0.376CysGlu: 0.376 ± 0.196
0.602CysPhe: 0.602 ± 0.272
1.279CysGly: 1.279 ± 0.409
0.451CysHis: 0.451 ± 0.209
0.527CysIle: 0.527 ± 0.203
0.752CysLys: 0.752 ± 0.256
1.053CysLeu: 1.053 ± 0.321
0.451CysMet: 0.451 ± 0.205
0.451CysAsn: 0.451 ± 0.205
0.527CysPro: 0.527 ± 0.215
0.376CysGln: 0.376 ± 0.178
0.978CysArg: 0.978 ± 0.325
0.602CysSer: 0.602 ± 0.212
0.15CysThr: 0.15 ± 0.117
0.828CysVal: 0.828 ± 0.269
0.075CysTrp: 0.075 ± 0.081
0.527CysTyr: 0.527 ± 0.203
0.0CysXaa: 0.0 ± 0.0
Asp
5.793AspAla: 5.793 ± 0.67
0.752AspCys: 0.752 ± 0.317
3.611AspAsp: 3.611 ± 0.673
3.912AspGlu: 3.912 ± 0.591
1.806AspPhe: 1.806 ± 0.396
4.213AspGly: 4.213 ± 0.69
0.978AspHis: 0.978 ± 0.301
3.311AspIle: 3.311 ± 0.481
4.213AspLys: 4.213 ± 0.508
5.492AspLeu: 5.492 ± 0.792
1.881AspMet: 1.881 ± 0.316
2.408AspAsn: 2.408 ± 0.403
2.257AspPro: 2.257 ± 0.42
0.903AspGln: 0.903 ± 0.249
3.01AspArg: 3.01 ± 0.323
4.213AspSer: 4.213 ± 0.556
2.934AspThr: 2.934 ± 0.422
4.364AspVal: 4.364 ± 0.589
1.053AspTrp: 1.053 ± 0.349
2.408AspTyr: 2.408 ± 0.457
0.0AspXaa: 0.0 ± 0.0
Glu
9.33GluAla: 9.33 ± 1.116
0.903GluCys: 0.903 ± 0.312
5.793GluAsp: 5.793 ± 0.58
5.793GluGlu: 5.793 ± 0.894
2.784GluPhe: 2.784 ± 0.39
5.718GluGly: 5.718 ± 0.783
1.204GluHis: 1.204 ± 0.297
2.558GluIle: 2.558 ± 0.401
4.289GluLys: 4.289 ± 0.599
5.492GluLeu: 5.492 ± 0.797
2.332GluMet: 2.332 ± 0.5
1.806GluAsn: 1.806 ± 0.504
1.655GluPro: 1.655 ± 0.322
3.16GluGln: 3.16 ± 0.468
3.837GluArg: 3.837 ± 0.572
3.311GluSer: 3.311 ± 0.5
2.784GluThr: 2.784 ± 0.448
5.041GluVal: 5.041 ± 0.534
1.505GluTrp: 1.505 ± 0.318
1.655GluTyr: 1.655 ± 0.3
0.0GluXaa: 0.0 ± 0.0
Phe
2.859PheAla: 2.859 ± 0.45
0.15PheCys: 0.15 ± 0.112
3.311PheAsp: 3.311 ± 0.594
3.16PheGlu: 3.16 ± 0.479
0.903PhePhe: 0.903 ± 0.265
2.182PheGly: 2.182 ± 0.635
0.752PheHis: 0.752 ± 0.262
2.332PheIle: 2.332 ± 0.364
2.332PheLys: 2.332 ± 0.36
3.01PheLeu: 3.01 ± 0.455
1.43PheMet: 1.43 ± 0.375
1.655PheAsn: 1.655 ± 0.324
0.903PhePro: 0.903 ± 0.25
0.978PheGln: 0.978 ± 0.261
1.956PheArg: 1.956 ± 0.345
2.332PheSer: 2.332 ± 0.395
1.806PheThr: 1.806 ± 0.345
0.978PheVal: 0.978 ± 0.283
0.226PheTrp: 0.226 ± 0.169
0.828PheTyr: 0.828 ± 0.252
0.0PheXaa: 0.0 ± 0.0
Gly
7.148GlyAla: 7.148 ± 0.771
1.204GlyCys: 1.204 ± 0.407
5.643GlyAsp: 5.643 ± 0.723
4.966GlyGlu: 4.966 ± 0.547
3.687GlyPhe: 3.687 ± 0.707
5.793GlyGly: 5.793 ± 0.912
2.107GlyHis: 2.107 ± 0.422
4.815GlyIle: 4.815 ± 0.638
5.869GlyLys: 5.869 ± 1.001
4.213GlyLeu: 4.213 ± 0.633
2.031GlyMet: 2.031 ± 0.348
3.085GlyAsn: 3.085 ± 0.554
0.376GlyPro: 0.376 ± 0.143
3.386GlyGln: 3.386 ± 0.454
4.364GlyArg: 4.364 ± 0.452
5.116GlySer: 5.116 ± 0.831
4.74GlyThr: 4.74 ± 0.723
5.643GlyVal: 5.643 ± 0.689
1.58GlyTrp: 1.58 ± 0.321
3.16GlyTyr: 3.16 ± 0.418
0.0GlyXaa: 0.0 ± 0.0
His
0.978HisAla: 0.978 ± 0.388
0.301HisCys: 0.301 ± 0.136
1.354HisAsp: 1.354 ± 0.308
1.58HisGlu: 1.58 ± 0.451
0.752HisPhe: 0.752 ± 0.223
1.73HisGly: 1.73 ± 0.441
0.677HisHis: 0.677 ± 0.267
1.354HisIle: 1.354 ± 0.433
1.279HisLys: 1.279 ± 0.346
2.784HisLeu: 2.784 ± 0.516
0.752HisMet: 0.752 ± 0.324
0.828HisAsn: 0.828 ± 0.201
0.527HisPro: 0.527 ± 0.206
0.752HisGln: 0.752 ± 0.178
1.279HisArg: 1.279 ± 0.404
0.602HisSer: 0.602 ± 0.197
1.279HisThr: 1.279 ± 0.311
1.354HisVal: 1.354 ± 0.289
0.075HisTrp: 0.075 ± 0.072
1.053HisTyr: 1.053 ± 0.283
0.0HisXaa: 0.0 ± 0.0
Ile
5.116IleAla: 5.116 ± 0.591
0.226IleCys: 0.226 ± 0.139
3.762IleAsp: 3.762 ± 0.529
3.311IleGlu: 3.311 ± 0.421
1.806IlePhe: 1.806 ± 0.453
5.191IleGly: 5.191 ± 0.723
1.204IleHis: 1.204 ± 0.262
3.687IleIle: 3.687 ± 0.704
4.289IleLys: 4.289 ± 0.698
3.386IleLeu: 3.386 ± 0.374
1.505IleMet: 1.505 ± 0.259
2.483IleAsn: 2.483 ± 0.361
2.107IlePro: 2.107 ± 0.386
1.956IleGln: 1.956 ± 0.372
3.762IleArg: 3.762 ± 0.468
2.709IleSer: 2.709 ± 0.482
3.01IleThr: 3.01 ± 0.478
3.461IleVal: 3.461 ± 0.555
0.451IleTrp: 0.451 ± 0.149
1.956IleTyr: 1.956 ± 0.446
0.0IleXaa: 0.0 ± 0.0
Lys
7.148LysAla: 7.148 ± 0.958
0.903LysCys: 0.903 ± 0.292
3.235LysAsp: 3.235 ± 0.503
5.116LysGlu: 5.116 ± 0.778
1.505LysPhe: 1.505 ± 0.266
5.568LysGly: 5.568 ± 0.547
1.655LysHis: 1.655 ± 0.534
2.558LysIle: 2.558 ± 0.371
4.213LysLys: 4.213 ± 0.699
4.665LysLeu: 4.665 ± 0.849
0.978LysMet: 0.978 ± 0.244
1.881LysAsn: 1.881 ± 0.379
3.16LysPro: 3.16 ± 0.461
2.408LysGln: 2.408 ± 0.398
3.837LysArg: 3.837 ± 0.486
4.364LysSer: 4.364 ± 0.56
2.408LysThr: 2.408 ± 0.387
4.966LysVal: 4.966 ± 0.718
0.903LysTrp: 0.903 ± 0.306
2.332LysTyr: 2.332 ± 0.417
0.0LysXaa: 0.0 ± 0.0
Leu
6.245LeuAla: 6.245 ± 0.971
1.053LeuCys: 1.053 ± 0.327
5.417LeuAsp: 5.417 ± 0.652
5.267LeuGlu: 5.267 ± 0.655
2.332LeuPhe: 2.332 ± 0.49
5.568LeuGly: 5.568 ± 0.638
1.129LeuHis: 1.129 ± 0.328
4.364LeuIle: 4.364 ± 0.406
5.342LeuLys: 5.342 ± 0.702
5.417LeuLeu: 5.417 ± 0.575
3.01LeuMet: 3.01 ± 0.561
3.762LeuAsn: 3.762 ± 0.465
3.386LeuPro: 3.386 ± 0.586
3.687LeuGln: 3.687 ± 0.606
4.289LeuArg: 4.289 ± 0.636
5.417LeuSer: 5.417 ± 0.769
5.417LeuThr: 5.417 ± 0.79
5.041LeuVal: 5.041 ± 0.546
1.053LeuTrp: 1.053 ± 0.241
2.257LeuTyr: 2.257 ± 0.387
0.0LeuXaa: 0.0 ± 0.0
Met
3.611MetAla: 3.611 ± 0.386
0.301MetCys: 0.301 ± 0.178
1.354MetAsp: 1.354 ± 0.301
2.107MetGlu: 2.107 ± 0.289
0.828MetPhe: 0.828 ± 0.283
2.332MetGly: 2.332 ± 0.659
0.527MetHis: 0.527 ± 0.289
1.43MetIle: 1.43 ± 0.367
2.332MetLys: 2.332 ± 0.399
2.859MetLeu: 2.859 ± 0.488
0.903MetMet: 0.903 ± 0.252
1.204MetAsn: 1.204 ± 0.266
1.354MetPro: 1.354 ± 0.279
1.505MetGln: 1.505 ± 0.401
1.881MetArg: 1.881 ± 0.486
2.859MetSer: 2.859 ± 0.388
1.881MetThr: 1.881 ± 0.346
2.182MetVal: 2.182 ± 0.448
0.451MetTrp: 0.451 ± 0.169
0.828MetTyr: 0.828 ± 0.251
0.0MetXaa: 0.0 ± 0.0
Asn
3.01AsnAla: 3.01 ± 0.509
0.376AsnCys: 0.376 ± 0.18
2.031AsnAsp: 2.031 ± 0.339
1.806AsnGlu: 1.806 ± 0.537
1.73AsnPhe: 1.73 ± 0.28
3.085AsnGly: 3.085 ± 0.489
1.204AsnHis: 1.204 ± 0.314
2.182AsnIle: 2.182 ± 0.566
2.257AsnLys: 2.257 ± 0.37
3.687AsnLeu: 3.687 ± 0.404
0.903AsnMet: 0.903 ± 0.247
1.73AsnAsn: 1.73 ± 0.336
1.73AsnPro: 1.73 ± 0.446
1.806AsnGln: 1.806 ± 0.503
3.311AsnArg: 3.311 ± 0.475
1.806AsnSer: 1.806 ± 0.357
2.709AsnThr: 2.709 ± 0.467
3.01AsnVal: 3.01 ± 0.515
0.602AsnTrp: 0.602 ± 0.223
1.58AsnTyr: 1.58 ± 0.408
0.0AsnXaa: 0.0 ± 0.0
Pro
3.085ProAla: 3.085 ± 0.535
0.376ProCys: 0.376 ± 0.162
2.483ProAsp: 2.483 ± 0.602
3.235ProGlu: 3.235 ± 0.466
1.204ProPhe: 1.204 ± 0.269
1.956ProGly: 1.956 ± 0.426
0.828ProHis: 0.828 ± 0.312
0.903ProIle: 0.903 ± 0.273
1.806ProLys: 1.806 ± 0.42
2.483ProLeu: 2.483 ± 0.357
1.354ProMet: 1.354 ± 0.225
1.58ProAsn: 1.58 ± 0.337
0.677ProPro: 0.677 ± 0.16
1.204ProGln: 1.204 ± 0.26
1.129ProArg: 1.129 ± 0.259
1.881ProSer: 1.881 ± 0.35
2.558ProThr: 2.558 ± 0.476
3.611ProVal: 3.611 ± 0.554
0.828ProTrp: 0.828 ± 0.211
1.58ProTyr: 1.58 ± 0.32
0.0ProXaa: 0.0 ± 0.0
Gln
4.289GlnAla: 4.289 ± 0.523
0.451GlnCys: 0.451 ± 0.162
1.881GlnAsp: 1.881 ± 0.378
2.332GlnGlu: 2.332 ± 0.449
1.655GlnPhe: 1.655 ± 0.331
2.182GlnGly: 2.182 ± 0.467
0.828GlnHis: 0.828 ± 0.249
2.483GlnIle: 2.483 ± 0.34
1.73GlnLys: 1.73 ± 0.401
4.289GlnLeu: 4.289 ± 0.577
1.505GlnMet: 1.505 ± 0.284
1.655GlnAsn: 1.655 ± 0.295
1.204GlnPro: 1.204 ± 0.369
2.107GlnGln: 2.107 ± 0.465
2.182GlnArg: 2.182 ± 0.475
2.257GlnSer: 2.257 ± 0.393
2.182GlnThr: 2.182 ± 0.392
2.934GlnVal: 2.934 ± 0.368
0.752GlnTrp: 0.752 ± 0.299
1.881GlnTyr: 1.881 ± 0.466
0.0GlnXaa: 0.0 ± 0.0
Arg
4.665ArgAla: 4.665 ± 0.642
0.602ArgCys: 0.602 ± 0.276
3.085ArgAsp: 3.085 ± 0.451
4.815ArgGlu: 4.815 ± 0.664
1.73ArgPhe: 1.73 ± 0.332
4.74ArgGly: 4.74 ± 0.518
1.279ArgHis: 1.279 ± 0.321
3.837ArgIle: 3.837 ± 0.645
3.085ArgLys: 3.085 ± 0.533
4.74ArgLeu: 4.74 ± 0.763
2.483ArgMet: 2.483 ± 0.321
2.408ArgAsn: 2.408 ± 0.367
1.806ArgPro: 1.806 ± 0.417
2.257ArgGln: 2.257 ± 0.392
3.461ArgArg: 3.461 ± 0.523
2.859ArgSer: 2.859 ± 0.42
2.483ArgThr: 2.483 ± 0.463
3.386ArgVal: 3.386 ± 0.437
0.752ArgTrp: 0.752 ± 0.216
1.956ArgTyr: 1.956 ± 0.457
0.0ArgXaa: 0.0 ± 0.0
Ser
4.966SerAla: 4.966 ± 0.581
0.602SerCys: 0.602 ± 0.199
2.934SerAsp: 2.934 ± 0.395
3.912SerGlu: 3.912 ± 0.607
2.031SerPhe: 2.031 ± 0.353
5.944SerGly: 5.944 ± 0.685
1.58SerHis: 1.58 ± 0.289
3.687SerIle: 3.687 ± 0.508
3.386SerLys: 3.386 ± 0.537
5.041SerLeu: 5.041 ± 0.709
1.806SerMet: 1.806 ± 0.412
2.483SerAsn: 2.483 ± 0.544
1.505SerPro: 1.505 ± 0.248
2.859SerGln: 2.859 ± 0.423
3.01SerArg: 3.01 ± 0.42
2.483SerSer: 2.483 ± 0.549
2.934SerThr: 2.934 ± 0.479
3.988SerVal: 3.988 ± 0.595
0.978SerTrp: 0.978 ± 0.27
2.332SerTyr: 2.332 ± 0.466
0.0SerXaa: 0.0 ± 0.0
Thr
3.461ThrAla: 3.461 ± 0.557
0.828ThrCys: 0.828 ± 0.248
2.107ThrAsp: 2.107 ± 0.512
3.988ThrGlu: 3.988 ± 0.668
2.483ThrPhe: 2.483 ± 0.511
4.439ThrGly: 4.439 ± 0.576
1.505ThrHis: 1.505 ± 0.459
3.762ThrIle: 3.762 ± 0.612
3.235ThrLys: 3.235 ± 0.389
5.267ThrLeu: 5.267 ± 0.714
1.505ThrMet: 1.505 ± 0.325
1.956ThrAsn: 1.956 ± 0.518
2.182ThrPro: 2.182 ± 0.3
2.257ThrGln: 2.257 ± 0.436
3.085ThrArg: 3.085 ± 0.538
2.784ThrSer: 2.784 ± 0.359
2.332ThrThr: 2.332 ± 0.469
3.386ThrVal: 3.386 ± 0.492
0.903ThrTrp: 0.903 ± 0.253
1.505ThrTyr: 1.505 ± 0.386
0.0ThrXaa: 0.0 ± 0.0
Val
6.997ValAla: 6.997 ± 0.807
0.527ValCys: 0.527 ± 0.209
3.235ValAsp: 3.235 ± 0.493
4.665ValGlu: 4.665 ± 0.5
2.031ValPhe: 2.031 ± 0.459
5.267ValGly: 5.267 ± 0.69
1.279ValHis: 1.279 ± 0.359
3.912ValIle: 3.912 ± 0.652
5.267ValLys: 5.267 ± 0.61
4.665ValLeu: 4.665 ± 0.791
2.408ValMet: 2.408 ± 0.354
3.01ValAsn: 3.01 ± 0.401
3.536ValPro: 3.536 ± 0.506
2.257ValGln: 2.257 ± 0.518
3.988ValArg: 3.988 ± 0.614
4.59ValSer: 4.59 ± 0.624
4.138ValThr: 4.138 ± 0.69
4.289ValVal: 4.289 ± 0.617
0.527ValTrp: 0.527 ± 0.201
2.031ValTyr: 2.031 ± 0.371
0.0ValXaa: 0.0 ± 0.0
Trp
1.053TrpAla: 1.053 ± 0.234
0.301TrpCys: 0.301 ± 0.138
1.354TrpAsp: 1.354 ± 0.334
1.053TrpGlu: 1.053 ± 0.277
0.376TrpPhe: 0.376 ± 0.203
1.053TrpGly: 1.053 ± 0.281
0.301TrpHis: 0.301 ± 0.155
0.376TrpIle: 0.376 ± 0.162
1.129TrpLys: 1.129 ± 0.356
0.903TrpLeu: 0.903 ± 0.236
0.677TrpMet: 0.677 ± 0.161
0.376TrpAsn: 0.376 ± 0.143
1.279TrpPro: 1.279 ± 0.301
0.978TrpGln: 0.978 ± 0.273
0.677TrpArg: 0.677 ± 0.209
0.978TrpSer: 0.978 ± 0.359
0.301TrpThr: 0.301 ± 0.166
1.279TrpVal: 1.279 ± 0.279
0.226TrpTrp: 0.226 ± 0.13
0.15TrpTyr: 0.15 ± 0.115
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.859TyrAla: 2.859 ± 0.356
0.376TyrCys: 0.376 ± 0.164
1.505TyrAsp: 1.505 ± 0.282
2.709TyrGlu: 2.709 ± 0.548
1.204TyrPhe: 1.204 ± 0.261
2.934TyrGly: 2.934 ± 0.49
0.602TyrHis: 0.602 ± 0.264
2.182TyrIle: 2.182 ± 0.316
1.881TyrLys: 1.881 ± 0.316
3.01TyrLeu: 3.01 ± 0.463
0.978TyrMet: 0.978 ± 0.245
2.408TyrAsn: 2.408 ± 0.383
1.279TyrPro: 1.279 ± 0.333
1.279TyrGln: 1.279 ± 0.32
1.129TyrArg: 1.129 ± 0.3
2.182TyrSer: 2.182 ± 0.536
2.408TyrThr: 2.408 ± 0.343
1.505TyrVal: 1.505 ± 0.323
0.451TyrTrp: 0.451 ± 0.227
0.828TyrTyr: 0.828 ± 0.282
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 58 proteins (13292 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski