Amino acid dipepetide frequency for Wuhan horsefly Virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.054AlaAla: 1.054 ± 0.556
1.054AlaCys: 1.054 ± 0.23
2.107AlaAsp: 2.107 ± 0.227
1.264AlaGlu: 1.264 ± 0.588
1.475AlaPhe: 1.475 ± 0.484
1.264AlaGly: 1.264 ± 0.993
0.632AlaHis: 0.632 ± 0.321
2.95AlaIle: 2.95 ± 1.102
2.528AlaLys: 2.528 ± 0.676
2.528AlaLeu: 2.528 ± 0.744
0.843AlaMet: 0.843 ± 0.259
1.264AlaAsn: 1.264 ± 0.336
1.264AlaPro: 1.264 ± 0.604
1.054AlaGln: 1.054 ± 0.556
1.264AlaArg: 1.264 ± 0.59
3.371AlaSer: 3.371 ± 0.717
2.318AlaThr: 2.318 ± 0.306
1.054AlaVal: 1.054 ± 0.298
0.211AlaTrp: 0.211 ± 0.107
1.475AlaTyr: 1.475 ± 0.473
0.0AlaXaa: 0.0 ± 0.0
Cys
0.421CysAla: 0.421 ± 0.214
0.0CysCys: 0.0 ± 0.0
1.264CysAsp: 1.264 ± 0.573
0.843CysGlu: 0.843 ± 0.382
0.0CysPhe: 0.0 ± 0.0
0.843CysGly: 0.843 ± 0.513
0.211CysHis: 0.211 ± 0.107
1.264CysIle: 1.264 ± 0.375
1.264CysLys: 1.264 ± 0.887
3.582CysLeu: 3.582 ± 0.924
0.211CysMet: 0.211 ± 0.49
0.421CysAsn: 0.421 ± 0.191
0.211CysPro: 0.211 ± 0.107
0.843CysGln: 0.843 ± 0.207
1.686CysArg: 1.686 ± 0.414
2.107CysSer: 2.107 ± 0.423
1.475CysThr: 1.475 ± 1.144
1.475CysVal: 1.475 ± 1.144
0.0CysTrp: 0.0 ± 0.0
0.632CysTyr: 0.632 ± 0.443
0.0CysXaa: 0.0 ± 0.0
Asp
1.896AspAla: 1.896 ± 0.485
1.475AspCys: 1.475 ± 0.361
2.95AspAsp: 2.95 ± 0.721
5.689AspGlu: 5.689 ± 1.297
3.793AspPhe: 3.793 ± 0.561
2.528AspGly: 2.528 ± 0.892
0.0AspHis: 0.0 ± 0.0
4.846AspIle: 4.846 ± 1.05
5.268AspLys: 5.268 ± 2.003
6.11AspLeu: 6.11 ± 0.67
1.896AspMet: 1.896 ± 0.482
3.582AspAsn: 3.582 ± 0.689
2.528AspPro: 2.528 ± 0.795
1.264AspGln: 1.264 ± 0.375
2.739AspArg: 2.739 ± 0.847
5.057AspSer: 5.057 ± 1.054
1.264AspThr: 1.264 ± 0.187
2.318AspVal: 2.318 ± 0.072
1.264AspTrp: 1.264 ± 0.365
1.054AspTyr: 1.054 ± 0.236
0.0AspXaa: 0.0 ± 0.0
Glu
2.318GluAla: 2.318 ± 0.628
0.421GluCys: 0.421 ± 0.214
5.9GluAsp: 5.9 ± 0.654
5.9GluGlu: 5.9 ± 1.515
2.95GluPhe: 2.95 ± 0.947
2.739GluGly: 2.739 ± 0.523
1.054GluHis: 1.054 ± 0.631
5.9GluIle: 5.9 ± 0.767
8.007GluLys: 8.007 ± 1.412
7.164GluLeu: 7.164 ± 0.699
1.896GluMet: 1.896 ± 0.5
5.057GluAsn: 5.057 ± 0.358
1.264GluPro: 1.264 ± 0.365
2.95GluGln: 2.95 ± 0.639
2.739GluArg: 2.739 ± 0.533
6.532GluSer: 6.532 ± 0.499
3.161GluThr: 3.161 ± 0.966
3.582GluVal: 3.582 ± 0.875
0.632GluTrp: 0.632 ± 0.321
1.686GluTyr: 1.686 ± 0.855
0.0GluXaa: 0.0 ± 0.0
Phe
0.421PheAla: 0.421 ± 0.191
0.843PheCys: 0.843 ± 0.382
2.739PheAsp: 2.739 ± 0.728
2.107PheGlu: 2.107 ± 0.227
1.896PhePhe: 1.896 ± 0.332
1.264PheGly: 1.264 ± 0.375
1.054PheHis: 1.054 ± 0.556
5.057PheIle: 5.057 ± 0.68
4.214PheLys: 4.214 ± 1.32
4.214PheLeu: 4.214 ± 2.004
1.264PheMet: 1.264 ± 0.375
3.371PheAsn: 3.371 ± 0.717
1.896PhePro: 1.896 ± 0.503
1.264PheGln: 1.264 ± 0.336
2.528PheArg: 2.528 ± 0.399
4.846PheSer: 4.846 ± 0.346
4.214PheThr: 4.214 ± 0.743
1.896PheVal: 1.896 ± 0.178
0.421PheTrp: 0.421 ± 0.214
2.107PheTyr: 2.107 ± 0.252
0.0PheXaa: 0.0 ± 0.0
Gly
1.475GlyAla: 1.475 ± 0.893
0.843GlyCys: 0.843 ± 0.382
2.95GlyAsp: 2.95 ± 0.482
2.95GlyGlu: 2.95 ± 0.494
2.739GlyPhe: 2.739 ± 1.275
1.475GlyGly: 1.475 ± 0.341
0.632GlyHis: 0.632 ± 0.168
3.161GlyIle: 3.161 ± 0.477
3.793GlyLys: 3.793 ± 0.299
3.582GlyLeu: 3.582 ± 0.791
0.843GlyMet: 0.843 ± 0.336
2.318GlyAsn: 2.318 ± 0.399
1.475GlyPro: 1.475 ± 0.585
0.632GlyGln: 0.632 ± 0.261
1.264GlyArg: 1.264 ± 0.187
2.739GlySer: 2.739 ± 1.098
0.843GlyThr: 0.843 ± 0.207
2.318GlyVal: 2.318 ± 0.659
0.632GlyTrp: 0.632 ± 0.321
1.896GlyTyr: 1.896 ± 0.485
0.0GlyXaa: 0.0 ± 0.0
His
0.843HisAla: 0.843 ± 0.428
0.421HisCys: 0.421 ± 0.279
1.054HisAsp: 1.054 ± 0.283
1.054HisGlu: 1.054 ± 0.515
2.318HisPhe: 2.318 ± 0.072
1.264HisGly: 1.264 ± 0.375
0.0HisHis: 0.0 ± 0.0
2.318HisIle: 2.318 ± 0.072
1.264HisLys: 1.264 ± 0.332
3.161HisLeu: 3.161 ± 0.911
0.843HisMet: 0.843 ± 0.336
1.054HisAsn: 1.054 ± 0.298
1.475HisPro: 1.475 ± 0.572
0.843HisGln: 0.843 ± 0.757
0.632HisArg: 0.632 ± 0.261
1.896HisSer: 1.896 ± 0.457
0.421HisThr: 0.421 ± 0.279
1.054HisVal: 1.054 ± 0.669
0.211HisTrp: 0.211 ± 0.107
1.475HisTyr: 1.475 ± 0.307
0.0HisXaa: 0.0 ± 0.0
Ile
2.95IleAla: 2.95 ± 0.178
2.318IleCys: 2.318 ± 0.675
4.003IleAsp: 4.003 ± 0.378
6.743IleGlu: 6.743 ± 1.576
5.057IlePhe: 5.057 ± 1.028
4.425IleGly: 4.425 ± 0.528
2.739IleHis: 2.739 ± 0.698
8.428IleIle: 8.428 ± 1.441
8.007IleLys: 8.007 ± 1.571
9.06IleLeu: 9.06 ± 1.463
3.371IleMet: 3.371 ± 0.247
3.582IleAsn: 3.582 ± 1.43
3.161IlePro: 3.161 ± 0.377
3.582IleGln: 3.582 ± 0.421
3.161IleArg: 3.161 ± 0.95
7.585IleSer: 7.585 ± 2.32
5.268IleThr: 5.268 ± 0.715
5.478IleVal: 5.478 ± 1.312
0.421IleTrp: 0.421 ± 0.505
2.318IleTyr: 2.318 ± 0.675
0.0IleXaa: 0.0 ± 0.0
Lys
4.214LysAla: 4.214 ± 1.25
1.475LysCys: 1.475 ± 0.82
5.057LysAsp: 5.057 ± 0.596
5.9LysGlu: 5.9 ± 1.251
3.371LysPhe: 3.371 ± 0.247
2.739LysGly: 2.739 ± 0.675
2.107LysHis: 2.107 ± 0.603
8.428LysIle: 8.428 ± 1.475
5.268LysLys: 5.268 ± 1.717
7.796LysLeu: 7.796 ± 0.223
2.95LysMet: 2.95 ± 0.659
5.057LysAsn: 5.057 ± 2.035
2.318LysPro: 2.318 ± 0.565
2.107LysGln: 2.107 ± 0.584
4.214LysArg: 4.214 ± 0.785
5.689LysSer: 5.689 ± 0.768
5.9LysThr: 5.9 ± 0.356
2.107LysVal: 2.107 ± 0.687
1.264LysTrp: 1.264 ± 0.642
4.425LysTyr: 4.425 ± 0.213
0.0LysXaa: 0.0 ± 0.0
Leu
3.582LeuAla: 3.582 ± 0.977
1.896LeuCys: 1.896 ± 0.485
7.375LeuAsp: 7.375 ± 0.933
4.846LeuGlu: 4.846 ± 0.947
2.107LeuPhe: 2.107 ± 0.524
4.003LeuGly: 4.003 ± 1.176
3.161LeuHis: 3.161 ± 0.61
7.796LeuIle: 7.796 ± 1.289
9.06LeuLys: 9.06 ± 2.061
6.953LeuLeu: 6.953 ± 0.693
2.528LeuMet: 2.528 ± 0.372
5.9LeuAsn: 5.9 ± 1.17
2.318LeuPro: 2.318 ± 0.886
3.793LeuGln: 3.793 ± 0.356
5.268LeuArg: 5.268 ± 0.213
9.692LeuSer: 9.692 ± 0.334
4.425LeuThr: 4.425 ± 1.441
5.689LeuVal: 5.689 ± 0.758
1.054LeuTrp: 1.054 ± 0.283
4.425LeuTyr: 4.425 ± 1.046
0.0LeuXaa: 0.0 ± 0.0
Met
1.475MetAla: 1.475 ± 0.956
0.211MetCys: 0.211 ± 0.107
1.686MetAsp: 1.686 ± 0.29
2.528MetGlu: 2.528 ± 0.663
1.475MetPhe: 1.475 ± 0.193
1.264MetGly: 1.264 ± 0.375
0.421MetHis: 0.421 ± 0.279
1.896MetIle: 1.896 ± 0.482
2.528MetLys: 2.528 ± 0.75
2.95MetLeu: 2.95 ± 0.86
1.686MetMet: 1.686 ± 0.54
1.475MetAsn: 1.475 ± 0.58
0.632MetPro: 0.632 ± 0.321
0.843MetGln: 0.843 ± 0.259
0.632MetArg: 0.632 ± 0.168
2.318MetSer: 2.318 ± 0.303
2.318MetThr: 2.318 ± 0.886
2.318MetVal: 2.318 ± 0.941
0.211MetTrp: 0.211 ± 0.107
0.421MetTyr: 0.421 ± 0.214
0.0MetXaa: 0.0 ± 0.0
Asn
2.107AsnAla: 2.107 ± 0.535
1.475AsnCys: 1.475 ± 1.479
3.371AsnAsp: 3.371 ± 0.581
3.582AsnGlu: 3.582 ± 1.082
4.214AsnPhe: 4.214 ± 0.956
2.739AsnGly: 2.739 ± 1.074
1.686AsnHis: 1.686 ± 0.275
3.793AsnIle: 3.793 ± 1.355
6.11AsnLys: 6.11 ± 0.387
5.9AsnLeu: 5.9 ± 0.638
1.896AsnMet: 1.896 ± 0.503
4.425AsnAsn: 4.425 ± 1.296
3.371AsnPro: 3.371 ± 1.356
2.107AsnGln: 2.107 ± 0.567
2.107AsnArg: 2.107 ± 0.573
4.003AsnSer: 4.003 ± 0.677
2.318AsnThr: 2.318 ± 0.415
1.475AsnVal: 1.475 ± 0.361
0.211AsnTrp: 0.211 ± 0.107
2.107AsnTyr: 2.107 ± 0.603
0.0AsnXaa: 0.0 ± 0.0
Pro
0.421ProAla: 0.421 ± 0.303
0.421ProCys: 0.421 ± 0.403
1.896ProAsp: 1.896 ± 0.476
2.95ProGlu: 2.95 ± 0.494
2.739ProPhe: 2.739 ± 0.847
1.686ProGly: 1.686 ± 1.068
1.054ProHis: 1.054 ± 0.535
4.003ProIle: 4.003 ± 0.677
2.528ProLys: 2.528 ± 0.443
1.686ProLeu: 1.686 ± 0.575
1.054ProMet: 1.054 ± 0.558
1.264ProAsn: 1.264 ± 0.642
0.421ProPro: 0.421 ± 0.214
1.054ProGln: 1.054 ± 0.283
1.896ProArg: 1.896 ± 0.729
3.582ProSer: 3.582 ± 1.141
1.896ProThr: 1.896 ± 0.476
1.475ProVal: 1.475 ± 0.833
0.632ProTrp: 0.632 ± 0.443
2.107ProTyr: 2.107 ± 0.435
0.0ProXaa: 0.0 ± 0.0
Gln
1.054GlnAla: 1.054 ± 0.972
0.421GlnCys: 0.421 ± 0.191
1.475GlnAsp: 1.475 ± 0.594
3.161GlnGlu: 3.161 ± 0.616
1.054GlnPhe: 1.054 ± 0.236
0.632GlnGly: 0.632 ± 0.671
0.843GlnHis: 0.843 ± 0.607
3.582GlnIle: 3.582 ± 0.664
1.686GlnLys: 1.686 ± 0.386
3.582GlnLeu: 3.582 ± 0.861
1.686GlnMet: 1.686 ± 0.301
1.686GlnAsn: 1.686 ± 0.249
1.264GlnPro: 1.264 ± 0.249
1.054GlnGln: 1.054 ± 0.348
1.475GlnArg: 1.475 ± 0.572
2.528GlnSer: 2.528 ± 0.374
1.264GlnThr: 1.264 ± 0.522
1.475GlnVal: 1.475 ± 0.362
0.0GlnTrp: 0.0 ± 0.0
1.686GlnTyr: 1.686 ± 0.909
0.0GlnXaa: 0.0 ± 0.0
Arg
1.686ArgAla: 1.686 ± 0.413
0.421ArgCys: 0.421 ± 0.191
2.739ArgAsp: 2.739 ± 0.759
4.425ArgGlu: 4.425 ± 1.302
2.528ArgPhe: 2.528 ± 0.773
1.475ArgGly: 1.475 ± 0.594
1.475ArgHis: 1.475 ± 0.529
4.003ArgIle: 4.003 ± 1.282
4.003ArgLys: 4.003 ± 0.968
2.95ArgLeu: 2.95 ± 0.98
0.421ArgMet: 0.421 ± 0.214
2.739ArgAsn: 2.739 ± 0.764
1.686ArgPro: 1.686 ± 0.855
0.632ArgGln: 0.632 ± 0.261
0.632ArgArg: 0.632 ± 0.289
4.425ArgSer: 4.425 ± 0.541
2.739ArgThr: 2.739 ± 0.312
2.95ArgVal: 2.95 ± 1.204
0.632ArgTrp: 0.632 ± 0.168
1.475ArgTyr: 1.475 ± 0.484
0.0ArgXaa: 0.0 ± 0.0
Ser
2.528SerAla: 2.528 ± 0.395
2.528SerCys: 2.528 ± 1.145
4.846SerAsp: 4.846 ± 0.878
6.743SerGlu: 6.743 ± 1.219
4.003SerPhe: 4.003 ± 0.677
2.528SerGly: 2.528 ± 0.613
2.528SerHis: 2.528 ± 1.3
9.06SerIle: 9.06 ± 0.52
6.11SerLys: 6.11 ± 0.461
8.217SerLeu: 8.217 ± 1.257
1.896SerMet: 1.896 ± 0.178
4.846SerAsn: 4.846 ± 0.704
1.686SerPro: 1.686 ± 0.515
2.528SerGln: 2.528 ± 0.395
4.425SerArg: 4.425 ± 0.831
10.535SerSer: 10.535 ± 1.235
4.425SerThr: 4.425 ± 1.431
4.846SerVal: 4.846 ± 1.335
0.421SerTrp: 0.421 ± 0.505
5.268SerTyr: 5.268 ± 0.77
0.0SerXaa: 0.0 ± 0.0
Thr
1.054ThrAla: 1.054 ± 0.23
1.054ThrCys: 1.054 ± 0.631
1.896ThrAsp: 1.896 ± 0.482
4.425ThrGlu: 4.425 ± 0.475
2.528ThrPhe: 2.528 ± 0.072
1.896ThrGly: 1.896 ± 0.868
0.843ThrHis: 0.843 ± 0.336
5.057ThrIle: 5.057 ± 0.826
3.161ThrLys: 3.161 ± 0.677
6.11ThrLeu: 6.11 ± 0.625
1.054ThrMet: 1.054 ± 0.535
3.371ThrAsn: 3.371 ± 0.984
3.161ThrPro: 3.161 ± 0.538
2.739ThrGln: 2.739 ± 0.328
2.528ThrArg: 2.528 ± 0.602
4.214ThrSer: 4.214 ± 1.683
3.371ThrThr: 3.371 ± 1.051
1.475ThrVal: 1.475 ± 0.448
0.843ThrTrp: 0.843 ± 0.258
2.318ThrTyr: 2.318 ± 1.59
0.0ThrXaa: 0.0 ± 0.0
Val
0.421ValAla: 0.421 ± 0.527
1.054ValCys: 1.054 ± 0.631
2.318ValAsp: 2.318 ± 1.177
3.582ValGlu: 3.582 ± 0.602
1.264ValPhe: 1.264 ± 0.522
2.318ValGly: 2.318 ± 0.621
1.475ValHis: 1.475 ± 0.72
5.268ValIle: 5.268 ± 1.476
4.003ValLys: 4.003 ± 0.803
3.161ValLeu: 3.161 ± 0.77
1.054ValMet: 1.054 ± 0.298
3.371ValAsn: 3.371 ± 0.244
2.528ValPro: 2.528 ± 0.613
1.054ValGln: 1.054 ± 0.348
2.318ValArg: 2.318 ± 0.487
5.689ValSer: 5.689 ± 0.926
2.95ValThr: 2.95 ± 1.458
1.896ValVal: 1.896 ± 0.503
0.421ValTrp: 0.421 ± 0.191
1.896ValTyr: 1.896 ± 0.332
0.0ValXaa: 0.0 ± 0.0
Trp
0.211TrpAla: 0.211 ± 0.26
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.211TrpGlu: 0.211 ± 0.107
0.421TrpPhe: 0.421 ± 0.191
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.054TrpIle: 1.054 ± 0.535
1.475TrpLys: 1.475 ± 0.307
2.318TrpLeu: 2.318 ± 0.524
0.211TrpMet: 0.211 ± 0.107
1.054TrpAsn: 1.054 ± 0.515
0.0TrpPro: 0.0 ± 0.0
0.211TrpGln: 0.211 ± 0.107
0.843TrpArg: 0.843 ± 0.315
0.632TrpSer: 0.632 ± 0.321
0.421TrpThr: 0.421 ± 0.214
0.421TrpVal: 0.421 ± 0.214
0.0TrpTrp: 0.0 ± 0.0
0.211TrpTyr: 0.211 ± 0.26
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.843TyrAla: 0.843 ± 0.336
0.632TyrCys: 0.632 ± 0.168
1.896TyrAsp: 1.896 ± 0.718
2.95TyrGlu: 2.95 ± 0.947
1.686TyrPhe: 1.686 ± 0.506
1.475TyrGly: 1.475 ± 0.72
1.686TyrHis: 1.686 ± 0.764
3.793TyrIle: 3.793 ± 0.638
2.107TyrLys: 2.107 ± 0.252
4.846TyrLeu: 4.846 ± 0.631
1.475TyrMet: 1.475 ± 0.728
3.161TyrAsn: 3.161 ± 1.047
2.107TyrPro: 2.107 ± 0.758
1.054TyrGln: 1.054 ± 0.669
1.686TyrArg: 1.686 ± 0.676
2.528TyrSer: 2.528 ± 0.866
2.107TyrThr: 2.107 ± 0.954
2.739TyrVal: 2.739 ± 1.102
0.211TyrTrp: 0.211 ± 0.107
2.739TyrTyr: 2.739 ± 0.483
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (4747 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski