Amino acid dipepetide frequency for Escherichia phage vB_EcoS_XY2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.812AlaAla: 10.812 ± 1.468
1.242AlaCys: 1.242 ± 0.343
6.429AlaAsp: 6.429 ± 0.727
7.963AlaGlu: 7.963 ± 1.064
3.945AlaPhe: 3.945 ± 0.638
7.67AlaGly: 7.67 ± 0.648
1.461AlaHis: 1.461 ± 0.292
5.698AlaIle: 5.698 ± 0.661
5.406AlaLys: 5.406 ± 0.605
9.058AlaLeu: 9.058 ± 0.683
1.826AlaMet: 1.826 ± 0.456
3.799AlaAsn: 3.799 ± 0.435
3.726AlaPro: 3.726 ± 0.483
3.506AlaGln: 3.506 ± 0.876
4.456AlaArg: 4.456 ± 0.568
6.136AlaSer: 6.136 ± 0.776
6.209AlaThr: 6.209 ± 0.846
6.282AlaVal: 6.282 ± 0.681
1.753AlaTrp: 1.753 ± 0.292
3.287AlaTyr: 3.287 ± 0.407
0.0AlaXaa: 0.0 ± 0.0
Cys
1.242CysAla: 1.242 ± 0.272
0.219CysCys: 0.219 ± 0.143
0.95CysAsp: 0.95 ± 0.238
1.242CysGlu: 1.242 ± 0.393
0.365CysPhe: 0.365 ± 0.171
1.242CysGly: 1.242 ± 0.311
0.438CysHis: 0.438 ± 0.175
0.511CysIle: 0.511 ± 0.181
0.584CysLys: 0.584 ± 0.181
0.731CysLeu: 0.731 ± 0.242
0.219CysMet: 0.219 ± 0.114
0.219CysAsn: 0.219 ± 0.126
0.219CysPro: 0.219 ± 0.11
0.365CysGln: 0.365 ± 0.158
0.511CysArg: 0.511 ± 0.182
0.365CysSer: 0.365 ± 0.177
0.877CysThr: 0.877 ± 0.209
0.584CysVal: 0.584 ± 0.202
0.146CysTrp: 0.146 ± 0.105
0.438CysTyr: 0.438 ± 0.2
0.0CysXaa: 0.0 ± 0.0
Asp
6.429AspAla: 6.429 ± 0.769
0.95AspCys: 0.95 ± 0.296
4.456AspAsp: 4.456 ± 0.583
4.529AspGlu: 4.529 ± 0.613
2.776AspPhe: 2.776 ± 0.39
5.625AspGly: 5.625 ± 0.645
0.804AspHis: 0.804 ± 0.263
3.872AspIle: 3.872 ± 0.417
3.068AspLys: 3.068 ± 0.482
4.529AspLeu: 4.529 ± 0.558
1.607AspMet: 1.607 ± 0.282
2.118AspAsn: 2.118 ± 0.494
1.826AspPro: 1.826 ± 0.356
1.023AspGln: 1.023 ± 0.285
2.922AspArg: 2.922 ± 0.443
2.411AspSer: 2.411 ± 0.389
4.675AspThr: 4.675 ± 0.646
3.872AspVal: 3.872 ± 0.422
0.877AspTrp: 0.877 ± 0.22
2.265AspTyr: 2.265 ± 0.439
0.0AspXaa: 0.0 ± 0.0
Glu
6.721GluAla: 6.721 ± 0.842
0.731GluCys: 0.731 ± 0.242
3.433GluAsp: 3.433 ± 0.502
5.844GluGlu: 5.844 ± 0.933
2.922GluPhe: 2.922 ± 0.542
4.456GluGly: 4.456 ± 0.556
1.096GluHis: 1.096 ± 0.337
2.776GluIle: 2.776 ± 0.433
4.31GluLys: 4.31 ± 0.711
6.209GluLeu: 6.209 ± 0.637
2.922GluMet: 2.922 ± 0.49
2.045GluAsn: 2.045 ± 0.393
2.192GluPro: 2.192 ± 0.383
3.433GluGln: 3.433 ± 0.78
3.726GluArg: 3.726 ± 0.625
3.068GluSer: 3.068 ± 0.522
3.945GluThr: 3.945 ± 0.496
5.187GluVal: 5.187 ± 0.61
1.242GluTrp: 1.242 ± 0.308
2.411GluTyr: 2.411 ± 0.43
0.0GluXaa: 0.0 ± 0.0
Phe
2.703PheAla: 2.703 ± 0.414
0.657PheCys: 0.657 ± 0.246
3.58PheAsp: 3.58 ± 0.648
1.972PheGlu: 1.972 ± 0.446
0.877PhePhe: 0.877 ± 0.26
3.141PheGly: 3.141 ± 0.488
0.584PheHis: 0.584 ± 0.2
2.63PheIle: 2.63 ± 0.467
1.826PheLys: 1.826 ± 0.363
1.972PheLeu: 1.972 ± 0.356
0.584PheMet: 0.584 ± 0.194
1.899PheAsn: 1.899 ± 0.362
1.388PhePro: 1.388 ± 0.369
1.534PheGln: 1.534 ± 0.343
2.192PheArg: 2.192 ± 0.313
2.557PheSer: 2.557 ± 0.529
3.141PheThr: 3.141 ± 0.546
2.411PheVal: 2.411 ± 0.486
0.877PheTrp: 0.877 ± 0.268
1.242PheTyr: 1.242 ± 0.274
0.0PheXaa: 0.0 ± 0.0
Gly
7.67GlyAla: 7.67 ± 0.723
0.95GlyCys: 0.95 ± 0.248
4.675GlyAsp: 4.675 ± 0.617
4.748GlyGlu: 4.748 ± 0.743
3.653GlyPhe: 3.653 ± 0.452
6.209GlyGly: 6.209 ± 0.766
1.096GlyHis: 1.096 ± 0.366
2.995GlyIle: 2.995 ± 0.477
4.894GlyLys: 4.894 ± 0.722
5.479GlyLeu: 5.479 ± 0.603
2.118GlyMet: 2.118 ± 0.428
3.506GlyAsn: 3.506 ± 0.55
2.118GlyPro: 2.118 ± 0.4
2.411GlyGln: 2.411 ± 0.478
3.872GlyArg: 3.872 ± 0.57
5.479GlySer: 5.479 ± 0.682
4.602GlyThr: 4.602 ± 0.8
5.552GlyVal: 5.552 ± 0.54
1.169GlyTrp: 1.169 ± 0.288
2.63GlyTyr: 2.63 ± 0.543
0.0GlyXaa: 0.0 ± 0.0
His
1.461HisAla: 1.461 ± 0.387
0.511HisCys: 0.511 ± 0.187
1.096HisAsp: 1.096 ± 0.249
0.804HisGlu: 0.804 ± 0.282
0.804HisPhe: 0.804 ± 0.231
1.242HisGly: 1.242 ± 0.313
0.731HisHis: 0.731 ± 0.239
1.096HisIle: 1.096 ± 0.246
1.169HisLys: 1.169 ± 0.323
1.315HisLeu: 1.315 ± 0.323
0.292HisMet: 0.292 ± 0.135
0.804HisAsn: 0.804 ± 0.276
0.877HisPro: 0.877 ± 0.315
0.804HisGln: 0.804 ± 0.202
0.877HisArg: 0.877 ± 0.284
0.584HisSer: 0.584 ± 0.177
0.657HisThr: 0.657 ± 0.266
1.169HisVal: 1.169 ± 0.297
0.219HisTrp: 0.219 ± 0.134
0.511HisTyr: 0.511 ± 0.195
0.0HisXaa: 0.0 ± 0.0
Ile
5.844IleAla: 5.844 ± 0.792
0.877IleCys: 0.877 ± 0.24
3.945IleAsp: 3.945 ± 0.569
3.068IleGlu: 3.068 ± 0.382
1.315IlePhe: 1.315 ± 0.348
3.726IleGly: 3.726 ± 0.488
0.511IleHis: 0.511 ± 0.196
2.265IleIle: 2.265 ± 0.315
3.36IleLys: 3.36 ± 0.601
3.506IleLeu: 3.506 ± 0.481
0.95IleMet: 0.95 ± 0.297
2.995IleAsn: 2.995 ± 0.506
2.776IlePro: 2.776 ± 0.447
1.68IleGln: 1.68 ± 0.355
2.776IleArg: 2.776 ± 0.341
3.653IleSer: 3.653 ± 0.626
4.237IleThr: 4.237 ± 0.637
4.529IleVal: 4.529 ± 0.415
0.731IleTrp: 0.731 ± 0.248
1.315IleTyr: 1.315 ± 0.278
0.0IleXaa: 0.0 ± 0.0
Lys
5.99LysAla: 5.99 ± 0.932
0.438LysCys: 0.438 ± 0.249
2.922LysAsp: 2.922 ± 0.484
3.872LysGlu: 3.872 ± 0.729
2.118LysPhe: 2.118 ± 0.373
3.872LysGly: 3.872 ± 0.474
1.461LysHis: 1.461 ± 0.282
2.338LysIle: 2.338 ± 0.424
2.63LysLys: 2.63 ± 0.457
4.821LysLeu: 4.821 ± 0.652
2.63LysMet: 2.63 ± 0.555
2.484LysAsn: 2.484 ± 0.472
2.338LysPro: 2.338 ± 0.463
2.338LysGln: 2.338 ± 0.554
3.726LysArg: 3.726 ± 0.529
2.849LysSer: 2.849 ± 0.617
3.653LysThr: 3.653 ± 0.454
3.287LysVal: 3.287 ± 0.574
0.731LysTrp: 0.731 ± 0.202
2.338LysTyr: 2.338 ± 0.364
0.0LysXaa: 0.0 ± 0.0
Leu
8.474LeuAla: 8.474 ± 0.818
0.731LeuCys: 0.731 ± 0.25
3.945LeuAsp: 3.945 ± 0.493
5.479LeuGlu: 5.479 ± 0.81
2.411LeuPhe: 2.411 ± 0.429
5.479LeuGly: 5.479 ± 0.498
1.242LeuHis: 1.242 ± 0.297
5.26LeuIle: 5.26 ± 0.481
3.36LeuLys: 3.36 ± 0.548
4.821LeuLeu: 4.821 ± 0.628
1.461LeuMet: 1.461 ± 0.276
3.214LeuAsn: 3.214 ± 0.456
3.726LeuPro: 3.726 ± 0.533
2.922LeuGln: 2.922 ± 0.512
5.26LeuArg: 5.26 ± 0.666
4.237LeuSer: 4.237 ± 0.508
5.552LeuThr: 5.552 ± 0.752
4.383LeuVal: 4.383 ± 0.571
0.95LeuTrp: 0.95 ± 0.31
1.972LeuTyr: 1.972 ± 0.275
0.0LeuXaa: 0.0 ± 0.0
Met
2.557MetAla: 2.557 ± 0.399
0.438MetCys: 0.438 ± 0.141
0.877MetAsp: 0.877 ± 0.266
1.315MetGlu: 1.315 ± 0.361
0.95MetPhe: 0.95 ± 0.298
1.753MetGly: 1.753 ± 0.352
0.365MetHis: 0.365 ± 0.179
1.315MetIle: 1.315 ± 0.279
1.607MetLys: 1.607 ± 0.384
1.972MetLeu: 1.972 ± 0.355
0.511MetMet: 0.511 ± 0.189
0.877MetAsn: 0.877 ± 0.251
1.169MetPro: 1.169 ± 0.276
0.657MetGln: 0.657 ± 0.206
1.242MetArg: 1.242 ± 0.259
1.826MetSer: 1.826 ± 0.381
2.045MetThr: 2.045 ± 0.408
2.118MetVal: 2.118 ± 0.371
0.292MetTrp: 0.292 ± 0.169
0.804MetTyr: 0.804 ± 0.228
0.0MetXaa: 0.0 ± 0.0
Asn
4.748AsnAla: 4.748 ± 0.764
0.365AsnCys: 0.365 ± 0.166
2.265AsnAsp: 2.265 ± 0.319
2.411AsnGlu: 2.411 ± 0.423
0.804AsnPhe: 0.804 ± 0.221
3.506AsnGly: 3.506 ± 0.437
0.657AsnHis: 0.657 ± 0.209
2.338AsnIle: 2.338 ± 0.494
2.265AsnLys: 2.265 ± 0.455
3.068AsnLeu: 3.068 ± 0.516
0.804AsnMet: 0.804 ± 0.285
2.338AsnAsn: 2.338 ± 0.404
1.68AsnPro: 1.68 ± 0.393
1.461AsnGln: 1.461 ± 0.298
2.338AsnArg: 2.338 ± 0.347
2.557AsnSer: 2.557 ± 0.352
2.484AsnThr: 2.484 ± 0.471
3.653AsnVal: 3.653 ± 0.445
0.584AsnTrp: 0.584 ± 0.209
1.315AsnTyr: 1.315 ± 0.278
0.0AsnXaa: 0.0 ± 0.0
Pro
3.653ProAla: 3.653 ± 0.553
0.511ProCys: 0.511 ± 0.202
2.995ProAsp: 2.995 ± 0.535
3.872ProGlu: 3.872 ± 0.469
1.972ProPhe: 1.972 ± 0.419
2.557ProGly: 2.557 ± 0.509
0.657ProHis: 0.657 ± 0.222
1.753ProIle: 1.753 ± 0.39
1.461ProLys: 1.461 ± 0.32
2.703ProLeu: 2.703 ± 0.53
0.511ProMet: 0.511 ± 0.218
1.388ProAsn: 1.388 ± 0.296
1.023ProPro: 1.023 ± 0.26
1.607ProGln: 1.607 ± 0.292
1.68ProArg: 1.68 ± 0.292
2.557ProSer: 2.557 ± 0.48
1.972ProThr: 1.972 ± 0.354
3.799ProVal: 3.799 ± 0.493
0.219ProTrp: 0.219 ± 0.115
1.169ProTyr: 1.169 ± 0.301
0.0ProXaa: 0.0 ± 0.0
Gln
3.653GlnAla: 3.653 ± 0.633
0.584GlnCys: 0.584 ± 0.186
2.192GlnAsp: 2.192 ± 0.383
2.63GlnGlu: 2.63 ± 0.437
0.877GlnPhe: 0.877 ± 0.225
1.826GlnGly: 1.826 ± 0.306
1.096GlnHis: 1.096 ± 0.312
2.265GlnIle: 2.265 ± 0.48
2.411GlnLys: 2.411 ± 0.488
3.214GlnLeu: 3.214 ± 0.539
1.169GlnMet: 1.169 ± 0.245
1.534GlnAsn: 1.534 ± 0.455
1.461GlnPro: 1.461 ± 0.3
2.63GlnGln: 2.63 ± 0.764
2.192GlnArg: 2.192 ± 0.36
1.972GlnSer: 1.972 ± 0.365
1.899GlnThr: 1.899 ± 0.384
2.484GlnVal: 2.484 ± 0.479
0.731GlnTrp: 0.731 ± 0.259
1.607GlnTyr: 1.607 ± 0.308
0.0GlnXaa: 0.0 ± 0.0
Arg
4.675ArgAla: 4.675 ± 0.616
0.365ArgCys: 0.365 ± 0.134
2.995ArgAsp: 2.995 ± 0.486
3.726ArgGlu: 3.726 ± 0.609
1.899ArgPhe: 1.899 ± 0.339
3.214ArgGly: 3.214 ± 0.505
0.95ArgHis: 0.95 ± 0.227
3.653ArgIle: 3.653 ± 0.565
4.091ArgLys: 4.091 ± 0.693
3.799ArgLeu: 3.799 ± 0.508
1.534ArgMet: 1.534 ± 0.298
2.703ArgAsn: 2.703 ± 0.356
1.607ArgPro: 1.607 ± 0.34
2.995ArgGln: 2.995 ± 0.396
4.31ArgArg: 4.31 ± 0.608
2.922ArgSer: 2.922 ± 0.351
2.849ArgThr: 2.849 ± 0.36
3.506ArgVal: 3.506 ± 0.409
0.804ArgTrp: 0.804 ± 0.264
1.826ArgTyr: 1.826 ± 0.344
0.0ArgXaa: 0.0 ± 0.0
Ser
5.698SerAla: 5.698 ± 0.736
0.438SerCys: 0.438 ± 0.168
2.63SerAsp: 2.63 ± 0.328
2.922SerGlu: 2.922 ± 0.413
2.63SerPhe: 2.63 ± 0.516
6.429SerGly: 6.429 ± 0.967
1.023SerHis: 1.023 ± 0.261
3.068SerIle: 3.068 ± 0.672
3.506SerLys: 3.506 ± 0.51
4.529SerLeu: 4.529 ± 0.574
1.096SerMet: 1.096 ± 0.265
3.068SerAsn: 3.068 ± 0.413
2.338SerPro: 2.338 ± 0.484
1.607SerGln: 1.607 ± 0.317
3.36SerArg: 3.36 ± 0.482
2.703SerSer: 2.703 ± 0.442
4.456SerThr: 4.456 ± 0.745
5.041SerVal: 5.041 ± 0.558
0.511SerTrp: 0.511 ± 0.145
1.972SerTyr: 1.972 ± 0.348
0.0SerXaa: 0.0 ± 0.0
Thr
6.867ThrAla: 6.867 ± 0.929
0.365ThrCys: 0.365 ± 0.16
4.237ThrAsp: 4.237 ± 0.435
3.799ThrGlu: 3.799 ± 0.468
3.068ThrPhe: 3.068 ± 0.715
6.136ThrGly: 6.136 ± 1.065
0.877ThrHis: 0.877 ± 0.246
3.433ThrIle: 3.433 ± 0.406
3.872ThrLys: 3.872 ± 0.488
4.456ThrLeu: 4.456 ± 0.592
1.388ThrMet: 1.388 ± 0.353
1.972ThrAsn: 1.972 ± 0.492
3.945ThrPro: 3.945 ± 0.602
1.899ThrGln: 1.899 ± 0.334
2.849ThrArg: 2.849 ± 0.432
4.675ThrSer: 4.675 ± 0.909
4.237ThrThr: 4.237 ± 0.856
4.748ThrVal: 4.748 ± 0.82
0.804ThrTrp: 0.804 ± 0.247
2.411ThrTyr: 2.411 ± 0.401
0.0ThrXaa: 0.0 ± 0.0
Val
6.94ValAla: 6.94 ± 0.692
0.584ValCys: 0.584 ± 0.172
4.164ValAsp: 4.164 ± 0.438
5.333ValGlu: 5.333 ± 0.853
1.972ValPhe: 1.972 ± 0.406
4.164ValGly: 4.164 ± 0.417
0.95ValHis: 0.95 ± 0.22
4.383ValIle: 4.383 ± 0.559
3.653ValLys: 3.653 ± 0.487
4.967ValLeu: 4.967 ± 0.717
1.753ValMet: 1.753 ± 0.408
2.703ValAsn: 2.703 ± 0.55
2.118ValPro: 2.118 ± 0.418
3.506ValGln: 3.506 ± 0.5
3.506ValArg: 3.506 ± 0.433
5.479ValSer: 5.479 ± 0.596
5.698ValThr: 5.698 ± 0.674
5.625ValVal: 5.625 ± 0.756
1.169ValTrp: 1.169 ± 0.304
2.849ValTyr: 2.849 ± 0.418
0.0ValXaa: 0.0 ± 0.0
Trp
1.315TrpAla: 1.315 ± 0.371
0.219TrpCys: 0.219 ± 0.124
0.95TrpAsp: 0.95 ± 0.304
0.657TrpGlu: 0.657 ± 0.193
0.95TrpPhe: 0.95 ± 0.268
0.804TrpGly: 0.804 ± 0.226
0.365TrpHis: 0.365 ± 0.193
0.219TrpIle: 0.219 ± 0.138
0.95TrpLys: 0.95 ± 0.267
1.753TrpLeu: 1.753 ± 0.319
0.657TrpMet: 0.657 ± 0.207
0.657TrpAsn: 0.657 ± 0.211
0.365TrpPro: 0.365 ± 0.193
0.584TrpGln: 0.584 ± 0.264
0.877TrpArg: 0.877 ± 0.342
0.657TrpSer: 0.657 ± 0.255
0.804TrpThr: 0.804 ± 0.208
0.95TrpVal: 0.95 ± 0.24
0.073TrpTrp: 0.073 ± 0.066
0.365TrpTyr: 0.365 ± 0.151
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.287TyrAla: 3.287 ± 0.454
0.292TyrCys: 0.292 ± 0.138
1.899TyrAsp: 1.899 ± 0.493
2.484TyrGlu: 2.484 ± 0.427
1.461TyrPhe: 1.461 ± 0.362
2.776TyrGly: 2.776 ± 0.44
0.657TyrHis: 0.657 ± 0.195
2.192TyrIle: 2.192 ± 0.437
2.411TyrLys: 2.411 ± 0.379
1.972TyrLeu: 1.972 ± 0.357
0.584TyrMet: 0.584 ± 0.191
1.315TyrAsn: 1.315 ± 0.273
1.169TyrPro: 1.169 ± 0.363
1.461TyrGln: 1.461 ± 0.24
1.753TyrArg: 1.753 ± 0.356
2.338TyrSer: 2.338 ± 0.411
2.045TyrThr: 2.045 ± 0.462
2.338TyrVal: 2.338 ± 0.414
0.292TyrTrp: 0.292 ± 0.139
1.461TyrTyr: 1.461 ± 0.258
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 73 proteins (13690 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski