Amino acid dipepetide frequency for Wad Medani virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.398AlaAla: 10.398 ± 1.844
0.812AlaCys: 0.812 ± 0.296
4.712AlaAsp: 4.712 ± 0.752
5.686AlaGlu: 5.686 ± 0.816
2.437AlaPhe: 2.437 ± 0.564
3.737AlaGly: 3.737 ± 1.03
2.762AlaHis: 2.762 ± 0.708
4.387AlaIle: 4.387 ± 1.173
3.574AlaLys: 3.574 ± 1.187
13.647AlaLeu: 13.647 ± 1.826
2.6AlaMet: 2.6 ± 0.658
2.762AlaAsn: 2.762 ± 0.86
4.874AlaPro: 4.874 ± 0.863
5.199AlaGln: 5.199 ± 0.828
6.824AlaArg: 6.824 ± 0.822
6.661AlaSer: 6.661 ± 1.157
6.986AlaThr: 6.986 ± 1.444
7.474AlaVal: 7.474 ± 1.116
1.3AlaTrp: 1.3 ± 0.293
3.249AlaTyr: 3.249 ± 0.541
0.0AlaXaa: 0.0 ± 0.0
Cys
1.95CysAla: 1.95 ± 0.851
0.325CysCys: 0.325 ± 0.25
0.487CysAsp: 0.487 ± 0.237
0.812CysGlu: 0.812 ± 0.333
0.812CysPhe: 0.812 ± 0.308
0.975CysGly: 0.975 ± 0.622
0.65CysHis: 0.65 ± 0.314
0.65CysIle: 0.65 ± 0.288
0.325CysLys: 0.325 ± 0.228
1.137CysLeu: 1.137 ± 0.774
0.162CysMet: 0.162 ± 0.166
0.162CysAsn: 0.162 ± 0.129
0.487CysPro: 0.487 ± 0.271
0.487CysGln: 0.487 ± 0.391
1.137CysArg: 1.137 ± 0.64
0.812CysSer: 0.812 ± 0.338
0.487CysThr: 0.487 ± 0.237
1.137CysVal: 1.137 ± 0.439
0.325CysTrp: 0.325 ± 0.21
0.812CysTyr: 0.812 ± 0.425
0.0CysXaa: 0.0 ± 0.0
Asp
6.661AspAla: 6.661 ± 1.245
0.812AspCys: 0.812 ± 0.401
3.737AspAsp: 3.737 ± 0.894
2.924AspGlu: 2.924 ± 0.574
2.762AspPhe: 2.762 ± 0.615
4.549AspGly: 4.549 ± 0.802
0.975AspHis: 0.975 ± 0.269
2.6AspIle: 2.6 ± 0.625
1.3AspLys: 1.3 ± 0.432
6.661AspLeu: 6.661 ± 1.247
0.812AspMet: 0.812 ± 0.311
0.487AspAsn: 0.487 ± 0.339
4.062AspPro: 4.062 ± 0.669
2.275AspGln: 2.275 ± 0.704
3.899AspArg: 3.899 ± 0.736
2.762AspSer: 2.762 ± 0.761
3.737AspThr: 3.737 ± 0.626
5.361AspVal: 5.361 ± 1.006
0.162AspTrp: 0.162 ± 0.174
1.462AspTyr: 1.462 ± 0.422
0.0AspXaa: 0.0 ± 0.0
Glu
4.387GluAla: 4.387 ± 0.859
0.975GluCys: 0.975 ± 0.49
3.412GluAsp: 3.412 ± 0.758
4.062GluGlu: 4.062 ± 1.253
1.787GluPhe: 1.787 ± 0.635
3.412GluGly: 3.412 ± 0.594
1.137GluHis: 1.137 ± 0.414
2.437GluIle: 2.437 ± 0.577
3.574GluLys: 3.574 ± 0.723
3.574GluLeu: 3.574 ± 0.9
2.6GluMet: 2.6 ± 0.595
1.3GluAsn: 1.3 ± 0.676
2.762GluPro: 2.762 ± 0.47
0.65GluGln: 0.65 ± 0.306
5.037GluArg: 5.037 ± 0.848
3.412GluSer: 3.412 ± 0.542
3.574GluThr: 3.574 ± 0.759
3.899GluVal: 3.899 ± 0.789
1.3GluTrp: 1.3 ± 0.338
1.625GluTyr: 1.625 ± 0.519
0.0GluXaa: 0.0 ± 0.0
Phe
2.762PheAla: 2.762 ± 0.554
0.975PheCys: 0.975 ± 0.476
2.6PheAsp: 2.6 ± 0.762
1.787PheGlu: 1.787 ± 0.517
1.95PhePhe: 1.95 ± 0.351
1.625PheGly: 1.625 ± 0.405
1.3PheHis: 1.3 ± 0.426
1.95PheIle: 1.95 ± 0.488
0.65PheLys: 0.65 ± 0.268
2.112PheLeu: 2.112 ± 0.767
0.487PheMet: 0.487 ± 0.271
0.812PheAsn: 0.812 ± 0.274
1.95PhePro: 1.95 ± 0.532
1.137PheGln: 1.137 ± 0.443
3.412PheArg: 3.412 ± 0.606
3.087PheSer: 3.087 ± 0.967
1.95PheThr: 1.95 ± 0.705
2.437PheVal: 2.437 ± 0.422
0.812PheTrp: 0.812 ± 0.391
0.812PheTyr: 0.812 ± 0.28
0.0PheXaa: 0.0 ± 0.0
Gly
6.986GlyAla: 6.986 ± 0.934
0.487GlyCys: 0.487 ± 0.425
4.874GlyAsp: 4.874 ± 1.424
2.762GlyGlu: 2.762 ± 0.648
2.6GlyPhe: 2.6 ± 0.453
3.737GlyGly: 3.737 ± 1.094
1.625GlyHis: 1.625 ± 0.478
2.6GlyIle: 2.6 ± 0.869
0.975GlyLys: 0.975 ± 0.465
4.549GlyLeu: 4.549 ± 0.881
3.087GlyMet: 3.087 ± 0.635
1.462GlyAsn: 1.462 ± 0.4
4.062GlyPro: 4.062 ± 0.777
1.3GlyGln: 1.3 ± 0.32
4.549GlyArg: 4.549 ± 0.612
4.062GlySer: 4.062 ± 0.606
2.6GlyThr: 2.6 ± 0.646
4.874GlyVal: 4.874 ± 0.785
0.325GlyTrp: 0.325 ± 0.213
2.112GlyTyr: 2.112 ± 0.62
0.0GlyXaa: 0.0 ± 0.0
His
2.762HisAla: 2.762 ± 0.932
0.487HisCys: 0.487 ± 0.301
1.787HisAsp: 1.787 ± 0.517
0.487HisGlu: 0.487 ± 0.388
0.487HisPhe: 0.487 ± 0.391
1.462HisGly: 1.462 ± 0.504
0.487HisHis: 0.487 ± 0.32
1.137HisIle: 1.137 ± 0.51
0.325HisLys: 0.325 ± 0.183
1.787HisLeu: 1.787 ± 0.629
0.65HisMet: 0.65 ± 0.277
0.812HisAsn: 0.812 ± 0.372
2.112HisPro: 2.112 ± 0.727
0.975HisGln: 0.975 ± 0.302
2.762HisArg: 2.762 ± 0.651
1.787HisSer: 1.787 ± 0.536
1.462HisThr: 1.462 ± 0.576
3.412HisVal: 3.412 ± 0.675
1.137HisTrp: 1.137 ± 0.512
0.162HisTyr: 0.162 ± 0.129
0.0HisXaa: 0.0 ± 0.0
Ile
4.062IleAla: 4.062 ± 0.937
0.65IleCys: 0.65 ± 0.211
2.762IleAsp: 2.762 ± 0.438
3.737IleGlu: 3.737 ± 0.759
2.275IlePhe: 2.275 ± 0.626
2.275IleGly: 2.275 ± 0.756
1.787IleHis: 1.787 ± 0.575
2.924IleIle: 2.924 ± 0.665
1.625IleLys: 1.625 ± 0.566
5.361IleLeu: 5.361 ± 0.865
0.812IleMet: 0.812 ± 0.379
1.137IleAsn: 1.137 ± 0.475
2.437IlePro: 2.437 ± 0.68
1.3IleGln: 1.3 ± 0.291
5.199IleArg: 5.199 ± 0.996
3.412IleSer: 3.412 ± 0.865
2.275IleThr: 2.275 ± 0.709
2.437IleVal: 2.437 ± 0.543
0.65IleTrp: 0.65 ± 0.256
1.95IleTyr: 1.95 ± 0.374
0.0IleXaa: 0.0 ± 0.0
Lys
1.787LysAla: 1.787 ± 0.904
0.65LysCys: 0.65 ± 0.42
1.462LysAsp: 1.462 ± 0.506
2.6LysGlu: 2.6 ± 0.736
0.487LysPhe: 0.487 ± 0.233
1.462LysGly: 1.462 ± 0.261
1.462LysHis: 1.462 ± 0.423
1.787LysIle: 1.787 ± 0.753
2.762LysLys: 2.762 ± 1.038
2.762LysLeu: 2.762 ± 0.635
1.95LysMet: 1.95 ± 0.527
1.787LysAsn: 1.787 ± 0.325
0.325LysPro: 0.325 ± 0.222
1.137LysGln: 1.137 ± 0.307
1.787LysArg: 1.787 ± 0.495
1.3LysSer: 1.3 ± 0.523
1.95LysThr: 1.95 ± 0.505
2.275LysVal: 2.275 ± 0.396
0.162LysTrp: 0.162 ± 0.17
0.65LysTyr: 0.65 ± 0.308
0.0LysXaa: 0.0 ± 0.0
Leu
12.348LeuAla: 12.348 ± 1.475
0.975LeuCys: 0.975 ± 0.436
4.224LeuAsp: 4.224 ± 0.846
5.037LeuGlu: 5.037 ± 0.683
3.899LeuPhe: 3.899 ± 0.915
6.011LeuGly: 6.011 ± 0.586
2.762LeuHis: 2.762 ± 0.786
3.249LeuIle: 3.249 ± 0.999
3.412LeuLys: 3.412 ± 0.621
9.423LeuLeu: 9.423 ± 1.175
1.625LeuMet: 1.625 ± 0.414
3.249LeuAsn: 3.249 ± 0.453
6.011LeuPro: 6.011 ± 0.772
3.737LeuGln: 3.737 ± 0.944
9.261LeuArg: 9.261 ± 1.105
8.286LeuSer: 8.286 ± 1.208
5.199LeuThr: 5.199 ± 0.912
4.224LeuVal: 4.224 ± 0.81
1.787LeuTrp: 1.787 ± 0.63
1.787LeuTyr: 1.787 ± 0.643
0.0LeuXaa: 0.0 ± 0.0
Met
3.249MetAla: 3.249 ± 1.015
1.137MetCys: 1.137 ± 0.52
1.625MetAsp: 1.625 ± 0.686
1.625MetGlu: 1.625 ± 0.399
0.975MetPhe: 0.975 ± 0.436
1.625MetGly: 1.625 ± 0.394
0.65MetHis: 0.65 ± 0.336
1.462MetIle: 1.462 ± 0.511
0.812MetLys: 0.812 ± 0.334
3.249MetLeu: 3.249 ± 0.779
1.137MetMet: 1.137 ± 0.641
0.487MetAsn: 0.487 ± 0.257
0.975MetPro: 0.975 ± 0.448
0.812MetGln: 0.812 ± 0.491
2.6MetArg: 2.6 ± 0.366
2.437MetSer: 2.437 ± 0.675
2.275MetThr: 2.275 ± 0.759
0.65MetVal: 0.65 ± 0.34
0.162MetTrp: 0.162 ± 0.202
0.975MetTyr: 0.975 ± 0.454
0.0MetXaa: 0.0 ± 0.0
Asn
3.899AsnAla: 3.899 ± 0.806
0.162AsnCys: 0.162 ± 0.17
1.95AsnAsp: 1.95 ± 0.481
2.762AsnGlu: 2.762 ± 0.519
0.812AsnPhe: 0.812 ± 0.298
1.625AsnGly: 1.625 ± 0.554
0.487AsnHis: 0.487 ± 0.248
1.787AsnIle: 1.787 ± 0.548
0.162AsnLys: 0.162 ± 0.172
1.95AsnLeu: 1.95 ± 0.571
0.325AsnMet: 0.325 ± 0.364
0.65AsnAsn: 0.65 ± 0.317
1.462AsnPro: 1.462 ± 0.485
0.975AsnGln: 0.975 ± 0.498
2.275AsnArg: 2.275 ± 0.767
1.625AsnSer: 1.625 ± 0.484
2.112AsnThr: 2.112 ± 0.546
2.275AsnVal: 2.275 ± 0.45
0.162AsnTrp: 0.162 ± 0.174
1.137AsnTyr: 1.137 ± 0.622
0.0AsnXaa: 0.0 ± 0.0
Pro
5.037ProAla: 5.037 ± 1.207
0.975ProCys: 0.975 ± 0.471
3.412ProAsp: 3.412 ± 0.685
3.249ProGlu: 3.249 ± 0.572
1.625ProPhe: 1.625 ± 0.32
3.249ProGly: 3.249 ± 1.031
1.95ProHis: 1.95 ± 0.621
2.112ProIle: 2.112 ± 0.403
0.487ProLys: 0.487 ± 0.386
7.311ProLeu: 7.311 ± 1.046
0.975ProMet: 0.975 ± 0.545
2.112ProAsn: 2.112 ± 0.572
5.524ProPro: 5.524 ± 1.365
1.787ProGln: 1.787 ± 0.444
3.737ProArg: 3.737 ± 0.849
3.087ProSer: 3.087 ± 0.862
3.574ProThr: 3.574 ± 0.873
4.062ProVal: 4.062 ± 0.846
0.975ProTrp: 0.975 ± 0.379
2.275ProTyr: 2.275 ± 0.429
0.0ProXaa: 0.0 ± 0.0
Gln
1.787GlnAla: 1.787 ± 0.661
0.325GlnCys: 0.325 ± 0.248
1.787GlnAsp: 1.787 ± 0.284
0.975GlnGlu: 0.975 ± 0.341
0.812GlnPhe: 0.812 ± 0.25
1.95GlnGly: 1.95 ± 0.479
1.625GlnHis: 1.625 ± 0.489
1.95GlnIle: 1.95 ± 0.536
1.3GlnLys: 1.3 ± 0.493
3.574GlnLeu: 3.574 ± 0.506
1.787GlnMet: 1.787 ± 0.681
1.137GlnAsn: 1.137 ± 0.42
1.462GlnPro: 1.462 ± 0.392
0.812GlnGln: 0.812 ± 0.34
3.087GlnArg: 3.087 ± 1.449
2.437GlnSer: 2.437 ± 0.511
2.275GlnThr: 2.275 ± 0.619
2.275GlnVal: 2.275 ± 0.652
0.162GlnTrp: 0.162 ± 0.172
1.137GlnTyr: 1.137 ± 0.377
0.0GlnXaa: 0.0 ± 0.0
Arg
10.398ArgAla: 10.398 ± 1.076
1.137ArgCys: 1.137 ± 0.441
6.174ArgAsp: 6.174 ± 1.076
6.336ArgGlu: 6.336 ± 1.524
2.437ArgPhe: 2.437 ± 0.735
5.524ArgGly: 5.524 ± 1.11
1.625ArgHis: 1.625 ± 0.624
3.574ArgIle: 3.574 ± 0.672
2.437ArgLys: 2.437 ± 0.709
6.824ArgLeu: 6.824 ± 0.8
2.6ArgMet: 2.6 ± 0.597
3.249ArgAsn: 3.249 ± 0.436
4.387ArgPro: 4.387 ± 0.64
3.087ArgGln: 3.087 ± 0.782
7.149ArgArg: 7.149 ± 2.154
6.011ArgSer: 6.011 ± 0.988
4.549ArgThr: 4.549 ± 1.026
7.149ArgVal: 7.149 ± 1.166
0.975ArgTrp: 0.975 ± 0.445
1.462ArgTyr: 1.462 ± 0.537
0.0ArgXaa: 0.0 ± 0.0
Ser
6.824SerAla: 6.824 ± 1.407
0.975SerCys: 0.975 ± 0.394
4.387SerAsp: 4.387 ± 0.832
3.574SerGlu: 3.574 ± 0.844
2.6SerPhe: 2.6 ± 0.987
4.874SerGly: 4.874 ± 0.974
1.462SerHis: 1.462 ± 0.602
5.037SerIle: 5.037 ± 0.7
2.112SerLys: 2.112 ± 0.523
6.336SerLeu: 6.336 ± 0.793
2.112SerMet: 2.112 ± 0.459
2.437SerAsn: 2.437 ± 0.705
3.737SerPro: 3.737 ± 1.116
2.112SerGln: 2.112 ± 0.735
7.311SerArg: 7.311 ± 1.183
4.712SerSer: 4.712 ± 1.145
2.762SerThr: 2.762 ± 0.844
4.062SerVal: 4.062 ± 0.467
0.812SerTrp: 0.812 ± 0.353
1.787SerTyr: 1.787 ± 0.701
0.0SerXaa: 0.0 ± 0.0
Thr
4.712ThrAla: 4.712 ± 0.919
0.325ThrCys: 0.325 ± 0.259
2.762ThrAsp: 2.762 ± 0.641
2.6ThrGlu: 2.6 ± 0.833
2.6ThrPhe: 2.6 ± 1.081
4.224ThrGly: 4.224 ± 0.867
1.137ThrHis: 1.137 ± 0.436
3.574ThrIle: 3.574 ± 0.992
1.462ThrLys: 1.462 ± 0.594
5.849ThrLeu: 5.849 ± 1.44
1.3ThrMet: 1.3 ± 0.515
1.625ThrAsn: 1.625 ± 0.402
5.037ThrPro: 5.037 ± 0.996
1.625ThrGln: 1.625 ± 0.48
6.011ThrArg: 6.011 ± 0.65
5.524ThrSer: 5.524 ± 0.973
2.924ThrThr: 2.924 ± 0.629
3.249ThrVal: 3.249 ± 0.696
0.812ThrTrp: 0.812 ± 0.299
2.275ThrTyr: 2.275 ± 0.511
0.0ThrXaa: 0.0 ± 0.0
Val
6.174ValAla: 6.174 ± 0.552
0.487ValCys: 0.487 ± 0.293
4.062ValAsp: 4.062 ± 0.854
2.6ValGlu: 2.6 ± 0.791
2.112ValPhe: 2.112 ± 0.467
4.224ValGly: 4.224 ± 0.783
0.812ValHis: 0.812 ± 0.333
2.6ValIle: 2.6 ± 0.62
1.787ValLys: 1.787 ± 0.654
6.499ValLeu: 6.499 ± 1.098
2.6ValMet: 2.6 ± 0.795
1.625ValAsn: 1.625 ± 0.518
4.387ValPro: 4.387 ± 0.863
2.6ValGln: 2.6 ± 0.707
7.799ValArg: 7.799 ± 0.89
5.849ValSer: 5.849 ± 0.861
4.387ValThr: 4.387 ± 1.232
2.6ValVal: 2.6 ± 0.587
1.3ValTrp: 1.3 ± 0.559
3.249ValTyr: 3.249 ± 0.758
0.0ValXaa: 0.0 ± 0.0
Trp
0.975TrpAla: 0.975 ± 0.406
0.65TrpCys: 0.65 ± 0.5
0.812TrpAsp: 0.812 ± 0.25
0.325TrpGlu: 0.325 ± 0.203
0.487TrpPhe: 0.487 ± 0.259
0.812TrpGly: 0.812 ± 0.344
0.487TrpHis: 0.487 ± 0.189
1.625TrpIle: 1.625 ± 1.056
0.812TrpLys: 0.812 ± 0.353
1.137TrpLeu: 1.137 ± 0.351
0.487TrpMet: 0.487 ± 0.213
0.325TrpAsn: 0.325 ± 0.257
0.325TrpPro: 0.325 ± 0.202
0.325TrpGln: 0.325 ± 0.21
1.3TrpArg: 1.3 ± 0.449
0.65TrpSer: 0.65 ± 0.305
0.975TrpThr: 0.975 ± 0.41
0.812TrpVal: 0.812 ± 0.357
0.325TrpTrp: 0.325 ± 0.221
0.162TrpTyr: 0.162 ± 0.172
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.924TyrAla: 2.924 ± 0.791
0.812TyrCys: 0.812 ± 0.305
0.975TyrAsp: 0.975 ± 0.329
0.975TyrGlu: 0.975 ± 0.357
0.65TyrPhe: 0.65 ± 0.279
2.275TyrGly: 2.275 ± 0.752
1.3TyrHis: 1.3 ± 0.353
1.625TyrIle: 1.625 ± 0.52
0.65TyrLys: 0.65 ± 0.272
2.762TyrLeu: 2.762 ± 0.603
0.65TyrMet: 0.65 ± 0.324
1.137TyrAsn: 1.137 ± 0.592
1.137TyrPro: 1.137 ± 0.658
0.325TyrGln: 0.325 ± 0.242
1.95TyrArg: 1.95 ± 0.381
2.112TyrSer: 2.112 ± 0.479
3.574TyrThr: 3.574 ± 0.73
3.087TyrVal: 3.087 ± 0.267
0.162TyrTrp: 0.162 ± 0.172
0.975TyrTyr: 0.975 ± 0.436
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12 proteins (6156 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski