Amino acid dipepetide frequency for Escherichia virus LS2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.583AlaAla: 9.583 ± 1.215
0.44AlaCys: 0.44 ± 0.207
4.572AlaAsp: 4.572 ± 0.548
5.451AlaGlu: 5.451 ± 1.02
3.253AlaPhe: 3.253 ± 0.536
6.77AlaGly: 6.77 ± 0.903
0.528AlaHis: 0.528 ± 0.193
5.979AlaIle: 5.979 ± 0.781
6.858AlaLys: 6.858 ± 0.849
7.297AlaLeu: 7.297 ± 1.034
2.638AlaMet: 2.638 ± 0.572
2.638AlaAsn: 2.638 ± 0.432
2.462AlaPro: 2.462 ± 0.71
2.374AlaGln: 2.374 ± 0.575
4.66AlaArg: 4.66 ± 0.796
5.011AlaSer: 5.011 ± 0.785
4.572AlaThr: 4.572 ± 0.638
5.803AlaVal: 5.803 ± 0.871
1.055AlaTrp: 1.055 ± 0.411
2.989AlaTyr: 2.989 ± 0.843
0.0AlaXaa: 0.0 ± 0.0
Cys
0.791CysAla: 0.791 ± 0.298
0.088CysCys: 0.088 ± 0.088
0.528CysAsp: 0.528 ± 0.335
0.615CysGlu: 0.615 ± 0.214
0.703CysPhe: 0.703 ± 0.375
0.615CysGly: 0.615 ± 0.247
0.176CysHis: 0.176 ± 0.132
0.088CysIle: 0.088 ± 0.1
0.528CysLys: 0.528 ± 0.225
1.319CysLeu: 1.319 ± 0.432
0.352CysMet: 0.352 ± 0.208
0.176CysAsn: 0.176 ± 0.124
0.967CysPro: 0.967 ± 0.462
0.44CysGln: 0.44 ± 0.239
0.879CysArg: 0.879 ± 0.33
1.055CysSer: 1.055 ± 0.445
0.264CysThr: 0.264 ± 0.17
0.703CysVal: 0.703 ± 0.295
0.352CysTrp: 0.352 ± 0.203
0.44CysTyr: 0.44 ± 0.295
0.0CysXaa: 0.0 ± 0.0
Asp
6.242AspAla: 6.242 ± 0.782
0.879AspCys: 0.879 ± 0.286
4.044AspAsp: 4.044 ± 0.749
4.044AspGlu: 4.044 ± 0.565
1.67AspPhe: 1.67 ± 0.452
6.154AspGly: 6.154 ± 0.782
1.495AspHis: 1.495 ± 0.414
3.077AspIle: 3.077 ± 0.783
3.605AspLys: 3.605 ± 0.559
6.418AspLeu: 6.418 ± 0.963
2.462AspMet: 2.462 ± 0.666
2.726AspAsn: 2.726 ± 0.499
2.374AspPro: 2.374 ± 0.496
2.022AspGln: 2.022 ± 0.827
2.11AspArg: 2.11 ± 0.462
4.748AspSer: 4.748 ± 0.742
3.605AspThr: 3.605 ± 0.641
3.429AspVal: 3.429 ± 0.59
1.055AspTrp: 1.055 ± 0.384
2.11AspTyr: 2.11 ± 0.456
0.0AspXaa: 0.0 ± 0.0
Glu
7.913GluAla: 7.913 ± 1.27
0.264GluCys: 0.264 ± 0.164
4.66GluAsp: 4.66 ± 1.012
4.396GluGlu: 4.396 ± 0.926
2.901GluPhe: 2.901 ± 0.602
3.956GluGly: 3.956 ± 0.772
0.879GluHis: 0.879 ± 0.302
3.253GluIle: 3.253 ± 0.493
2.374GluLys: 2.374 ± 0.596
5.451GluLeu: 5.451 ± 0.967
1.67GluMet: 1.67 ± 0.433
2.286GluAsn: 2.286 ± 0.403
1.846GluPro: 1.846 ± 0.415
2.11GluGln: 2.11 ± 0.522
3.693GluArg: 3.693 ± 0.562
2.638GluSer: 2.638 ± 0.615
3.693GluThr: 3.693 ± 0.573
3.781GluVal: 3.781 ± 0.627
1.143GluTrp: 1.143 ± 0.314
3.341GluTyr: 3.341 ± 0.588
0.0GluXaa: 0.0 ± 0.0
Phe
2.374PheAla: 2.374 ± 0.456
0.528PheCys: 0.528 ± 0.377
3.605PheAsp: 3.605 ± 0.518
1.583PheGlu: 1.583 ± 0.321
0.967PhePhe: 0.967 ± 0.287
1.846PheGly: 1.846 ± 0.451
0.44PheHis: 0.44 ± 0.209
1.934PheIle: 1.934 ± 0.544
2.55PheLys: 2.55 ± 0.572
3.165PheLeu: 3.165 ± 0.514
1.055PheMet: 1.055 ± 0.36
2.638PheAsn: 2.638 ± 0.593
1.67PhePro: 1.67 ± 0.45
1.055PheGln: 1.055 ± 0.437
1.846PheArg: 1.846 ± 0.488
2.813PheSer: 2.813 ± 0.434
2.11PheThr: 2.11 ± 0.366
2.638PheVal: 2.638 ± 0.648
0.264PheTrp: 0.264 ± 0.153
0.879PheTyr: 0.879 ± 0.29
0.0PheXaa: 0.0 ± 0.0
Gly
5.627GlyAla: 5.627 ± 0.918
0.615GlyCys: 0.615 ± 0.273
4.396GlyAsp: 4.396 ± 0.953
4.748GlyGlu: 4.748 ± 0.703
2.022GlyPhe: 2.022 ± 0.472
4.484GlyGly: 4.484 ± 0.672
0.528GlyHis: 0.528 ± 0.258
3.693GlyIle: 3.693 ± 0.539
4.924GlyLys: 4.924 ± 0.924
6.066GlyLeu: 6.066 ± 0.63
2.11GlyMet: 2.11 ± 0.392
2.55GlyAsn: 2.55 ± 0.682
1.67GlyPro: 1.67 ± 0.456
2.022GlyGln: 2.022 ± 0.403
5.539GlyArg: 5.539 ± 0.778
4.924GlySer: 4.924 ± 1.006
4.66GlyThr: 4.66 ± 0.651
5.187GlyVal: 5.187 ± 0.811
0.967GlyTrp: 0.967 ± 0.284
2.813GlyTyr: 2.813 ± 0.741
0.0GlyXaa: 0.0 ± 0.0
His
0.703HisAla: 0.703 ± 0.336
0.44HisCys: 0.44 ± 0.222
1.319HisAsp: 1.319 ± 0.483
0.967HisGlu: 0.967 ± 0.423
0.44HisPhe: 0.44 ± 0.237
1.407HisGly: 1.407 ± 0.403
0.264HisHis: 0.264 ± 0.192
1.055HisIle: 1.055 ± 0.344
1.143HisLys: 1.143 ± 0.386
1.495HisLeu: 1.495 ± 0.509
0.264HisMet: 0.264 ± 0.154
0.615HisAsn: 0.615 ± 0.275
0.264HisPro: 0.264 ± 0.147
0.615HisGln: 0.615 ± 0.363
0.352HisArg: 0.352 ± 0.161
1.055HisSer: 1.055 ± 0.337
1.231HisThr: 1.231 ± 0.374
0.967HisVal: 0.967 ± 0.266
0.44HisTrp: 0.44 ± 0.212
0.703HisTyr: 0.703 ± 0.299
0.0HisXaa: 0.0 ± 0.0
Ile
3.253IleAla: 3.253 ± 0.634
0.703IleCys: 0.703 ± 0.298
3.253IleAsp: 3.253 ± 0.464
2.813IleGlu: 2.813 ± 0.562
1.495IlePhe: 1.495 ± 0.373
3.517IleGly: 3.517 ± 0.773
1.143IleHis: 1.143 ± 0.38
2.638IleIle: 2.638 ± 0.612
3.253IleLys: 3.253 ± 0.701
2.989IleLeu: 2.989 ± 0.524
0.967IleMet: 0.967 ± 0.283
3.781IleAsn: 3.781 ± 0.649
1.495IlePro: 1.495 ± 0.49
2.11IleGln: 2.11 ± 0.589
2.901IleArg: 2.901 ± 0.429
2.374IleSer: 2.374 ± 0.553
4.748IleThr: 4.748 ± 1.046
4.836IleVal: 4.836 ± 0.513
0.703IleTrp: 0.703 ± 0.352
1.846IleTyr: 1.846 ± 0.412
0.0IleXaa: 0.0 ± 0.0
Lys
7.034LysAla: 7.034 ± 0.942
0.528LysCys: 0.528 ± 0.236
3.781LysAsp: 3.781 ± 0.573
4.044LysGlu: 4.044 ± 0.544
2.374LysPhe: 2.374 ± 0.556
3.605LysGly: 3.605 ± 0.602
0.791LysHis: 0.791 ± 0.4
2.198LysIle: 2.198 ± 0.589
3.517LysLys: 3.517 ± 0.802
5.011LysLeu: 5.011 ± 0.736
1.319LysMet: 1.319 ± 0.433
1.758LysAsn: 1.758 ± 0.464
2.11LysPro: 2.11 ± 0.582
2.11LysGln: 2.11 ± 0.579
3.693LysArg: 3.693 ± 0.602
4.396LysSer: 4.396 ± 0.573
3.253LysThr: 3.253 ± 0.445
5.891LysVal: 5.891 ± 0.927
1.495LysTrp: 1.495 ± 0.549
2.462LysTyr: 2.462 ± 0.505
0.0LysXaa: 0.0 ± 0.0
Leu
6.946LeuAla: 6.946 ± 0.78
0.791LeuCys: 0.791 ± 0.308
4.044LeuAsp: 4.044 ± 0.527
5.803LeuGlu: 5.803 ± 1.08
2.022LeuPhe: 2.022 ± 0.38
4.572LeuGly: 4.572 ± 0.553
1.319LeuHis: 1.319 ± 0.473
3.605LeuIle: 3.605 ± 0.893
6.33LeuLys: 6.33 ± 0.767
5.627LeuLeu: 5.627 ± 0.666
3.517LeuMet: 3.517 ± 0.586
4.044LeuAsn: 4.044 ± 0.699
3.077LeuPro: 3.077 ± 0.4
4.836LeuGln: 4.836 ± 0.725
7.209LeuArg: 7.209 ± 0.945
5.891LeuSer: 5.891 ± 1.312
6.682LeuThr: 6.682 ± 1.158
5.715LeuVal: 5.715 ± 0.787
0.967LeuTrp: 0.967 ± 0.383
2.286LeuTyr: 2.286 ± 0.418
0.0LeuXaa: 0.0 ± 0.0
Met
3.341MetAla: 3.341 ± 0.507
0.44MetCys: 0.44 ± 0.232
2.286MetAsp: 2.286 ± 0.678
2.286MetGlu: 2.286 ± 0.487
1.583MetPhe: 1.583 ± 0.533
2.022MetGly: 2.022 ± 0.427
0.528MetHis: 0.528 ± 0.247
1.055MetIle: 1.055 ± 0.263
0.879MetLys: 0.879 ± 0.291
2.813MetLeu: 2.813 ± 0.485
0.791MetMet: 0.791 ± 0.27
1.319MetAsn: 1.319 ± 0.409
0.967MetPro: 0.967 ± 0.261
0.703MetGln: 0.703 ± 0.258
1.407MetArg: 1.407 ± 0.384
1.495MetSer: 1.495 ± 0.369
2.11MetThr: 2.11 ± 0.38
3.165MetVal: 3.165 ± 0.671
0.615MetTrp: 0.615 ± 0.316
0.791MetTyr: 0.791 ± 0.265
0.0MetXaa: 0.0 ± 0.0
Asn
4.484AsnAla: 4.484 ± 0.79
0.44AsnCys: 0.44 ± 0.246
2.374AsnAsp: 2.374 ± 0.63
1.319AsnGlu: 1.319 ± 0.271
1.055AsnPhe: 1.055 ± 0.254
4.836AsnGly: 4.836 ± 0.826
0.791AsnHis: 0.791 ± 0.254
2.813AsnIle: 2.813 ± 0.556
2.989AsnLys: 2.989 ± 0.551
3.605AsnLeu: 3.605 ± 0.567
0.967AsnMet: 0.967 ± 0.382
2.638AsnAsn: 2.638 ± 0.706
2.462AsnPro: 2.462 ± 0.558
1.407AsnGln: 1.407 ± 0.418
2.726AsnArg: 2.726 ± 0.626
2.989AsnSer: 2.989 ± 0.893
2.286AsnThr: 2.286 ± 0.632
2.286AsnVal: 2.286 ± 0.58
0.088AsnTrp: 0.088 ± 0.091
2.286AsnTyr: 2.286 ± 0.523
0.0AsnXaa: 0.0 ± 0.0
Pro
2.901ProAla: 2.901 ± 0.556
0.703ProCys: 0.703 ± 0.299
2.55ProAsp: 2.55 ± 0.646
3.341ProGlu: 3.341 ± 0.765
1.583ProPhe: 1.583 ± 0.387
1.143ProGly: 1.143 ± 0.336
0.44ProHis: 0.44 ± 0.213
1.583ProIle: 1.583 ± 0.485
2.813ProLys: 2.813 ± 0.741
2.286ProLeu: 2.286 ± 0.508
1.495ProMet: 1.495 ± 0.473
2.022ProAsn: 2.022 ± 0.546
0.352ProPro: 0.352 ± 0.206
1.055ProGln: 1.055 ± 0.348
2.11ProArg: 2.11 ± 0.466
3.605ProSer: 3.605 ± 0.657
2.198ProThr: 2.198 ± 0.433
2.989ProVal: 2.989 ± 0.509
0.44ProTrp: 0.44 ± 0.203
0.791ProTyr: 0.791 ± 0.287
0.0ProXaa: 0.0 ± 0.0
Gln
3.868GlnAla: 3.868 ± 0.557
0.176GlnCys: 0.176 ± 0.137
2.726GlnAsp: 2.726 ± 0.801
2.374GlnGlu: 2.374 ± 0.488
1.846GlnPhe: 1.846 ± 0.409
2.022GlnGly: 2.022 ± 0.601
0.615GlnHis: 0.615 ± 0.278
1.143GlnIle: 1.143 ± 0.374
2.11GlnLys: 2.11 ± 0.516
2.638GlnLeu: 2.638 ± 0.661
1.231GlnMet: 1.231 ± 0.435
1.67GlnAsn: 1.67 ± 0.413
0.967GlnPro: 0.967 ± 0.363
1.583GlnGln: 1.583 ± 0.467
2.55GlnArg: 2.55 ± 0.883
2.55GlnSer: 2.55 ± 0.544
2.022GlnThr: 2.022 ± 0.618
2.726GlnVal: 2.726 ± 0.553
0.44GlnTrp: 0.44 ± 0.229
1.143GlnTyr: 1.143 ± 0.399
0.0GlnXaa: 0.0 ± 0.0
Arg
3.693ArgAla: 3.693 ± 0.648
0.615ArgCys: 0.615 ± 0.263
4.132ArgAsp: 4.132 ± 0.837
2.989ArgGlu: 2.989 ± 0.484
1.758ArgPhe: 1.758 ± 0.351
3.429ArgGly: 3.429 ± 0.626
1.055ArgHis: 1.055 ± 0.334
4.396ArgIle: 4.396 ± 1.023
3.253ArgLys: 3.253 ± 0.661
6.33ArgLeu: 6.33 ± 0.606
1.583ArgMet: 1.583 ± 0.446
2.286ArgAsn: 2.286 ± 0.611
2.022ArgPro: 2.022 ± 0.441
1.846ArgGln: 1.846 ± 0.484
3.781ArgArg: 3.781 ± 0.772
4.132ArgSer: 4.132 ± 0.635
3.868ArgThr: 3.868 ± 0.78
3.605ArgVal: 3.605 ± 0.679
1.583ArgTrp: 1.583 ± 0.42
2.11ArgTyr: 2.11 ± 0.629
0.0ArgXaa: 0.0 ± 0.0
Ser
3.693SerAla: 3.693 ± 0.708
1.495SerCys: 1.495 ± 0.53
4.572SerAsp: 4.572 ± 0.484
3.429SerGlu: 3.429 ± 0.536
3.341SerPhe: 3.341 ± 0.662
5.627SerGly: 5.627 ± 1.196
2.11SerHis: 2.11 ± 0.598
3.077SerIle: 3.077 ± 0.606
2.638SerLys: 2.638 ± 0.451
5.363SerLeu: 5.363 ± 0.902
1.846SerMet: 1.846 ± 0.39
3.429SerAsn: 3.429 ± 0.76
3.165SerPro: 3.165 ± 0.558
2.374SerGln: 2.374 ± 0.646
2.901SerArg: 2.901 ± 0.645
4.836SerSer: 4.836 ± 0.778
4.22SerThr: 4.22 ± 0.756
3.868SerVal: 3.868 ± 0.574
1.407SerTrp: 1.407 ± 0.48
2.901SerTyr: 2.901 ± 0.619
0.0SerXaa: 0.0 ± 0.0
Thr
3.341ThrAla: 3.341 ± 0.63
0.528ThrCys: 0.528 ± 0.225
5.187ThrAsp: 5.187 ± 0.643
4.22ThrGlu: 4.22 ± 0.601
3.341ThrPhe: 3.341 ± 0.521
5.187ThrGly: 5.187 ± 0.974
0.615ThrHis: 0.615 ± 0.296
3.253ThrIle: 3.253 ± 0.778
3.781ThrLys: 3.781 ± 0.645
6.77ThrLeu: 6.77 ± 0.776
2.726ThrMet: 2.726 ± 0.727
2.022ThrAsn: 2.022 ± 0.584
3.077ThrPro: 3.077 ± 0.696
2.462ThrGln: 2.462 ± 0.707
3.077ThrArg: 3.077 ± 0.738
3.781ThrSer: 3.781 ± 0.842
3.341ThrThr: 3.341 ± 0.69
5.187ThrVal: 5.187 ± 0.741
0.44ThrTrp: 0.44 ± 0.238
1.143ThrTyr: 1.143 ± 0.305
0.0ThrXaa: 0.0 ± 0.0
Val
4.836ValAla: 4.836 ± 0.776
0.791ValCys: 0.791 ± 0.294
3.956ValAsp: 3.956 ± 0.627
4.924ValGlu: 4.924 ± 0.645
1.934ValPhe: 1.934 ± 0.445
4.748ValGly: 4.748 ± 0.747
0.879ValHis: 0.879 ± 0.399
3.429ValIle: 3.429 ± 0.667
5.011ValLys: 5.011 ± 0.761
5.539ValLeu: 5.539 ± 0.793
2.198ValMet: 2.198 ± 0.575
3.165ValAsn: 3.165 ± 0.643
3.341ValPro: 3.341 ± 0.685
3.341ValGln: 3.341 ± 0.443
4.396ValArg: 4.396 ± 0.784
5.275ValSer: 5.275 ± 0.961
5.011ValThr: 5.011 ± 1.015
6.33ValVal: 6.33 ± 1.179
0.967ValTrp: 0.967 ± 0.25
2.286ValTyr: 2.286 ± 0.561
0.0ValXaa: 0.0 ± 0.0
Trp
0.528TrpAla: 0.528 ± 0.232
0.088TrpCys: 0.088 ± 0.101
0.615TrpAsp: 0.615 ± 0.24
0.967TrpGlu: 0.967 ± 0.305
0.615TrpPhe: 0.615 ± 0.268
0.703TrpGly: 0.703 ± 0.272
0.528TrpHis: 0.528 ± 0.305
0.44TrpIle: 0.44 ± 0.184
1.055TrpLys: 1.055 ± 0.383
2.374TrpLeu: 2.374 ± 0.526
0.528TrpMet: 0.528 ± 0.213
1.143TrpAsn: 1.143 ± 0.34
0.352TrpPro: 0.352 ± 0.24
0.615TrpGln: 0.615 ± 0.207
0.528TrpArg: 0.528 ± 0.219
0.703TrpSer: 0.703 ± 0.362
1.319TrpThr: 1.319 ± 0.458
1.319TrpVal: 1.319 ± 0.416
0.264TrpTrp: 0.264 ± 0.183
0.352TrpTyr: 0.352 ± 0.174
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.781TyrAla: 3.781 ± 0.597
0.528TyrCys: 0.528 ± 0.362
1.758TyrAsp: 1.758 ± 0.381
1.934TyrGlu: 1.934 ± 0.563
1.055TyrPhe: 1.055 ± 0.358
2.901TyrGly: 2.901 ± 0.487
0.615TyrHis: 0.615 ± 0.264
2.11TyrIle: 2.11 ± 0.523
1.758TyrLys: 1.758 ± 0.505
2.726TyrLeu: 2.726 ± 0.389
0.791TyrMet: 0.791 ± 0.356
1.934TyrAsn: 1.934 ± 0.475
1.758TyrPro: 1.758 ± 0.461
1.319TyrGln: 1.319 ± 0.509
2.11TyrArg: 2.11 ± 0.617
2.022TyrSer: 2.022 ± 0.35
2.286TyrThr: 2.286 ± 0.489
1.934TyrVal: 1.934 ± 0.556
0.352TyrTrp: 0.352 ± 0.185
1.231TyrTyr: 1.231 ± 0.424
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 44 proteins (11375 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski