Amino acid dipepetide frequency for Wuhan Louse Fly Virus 10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.732AlaAla: 2.732 ± 1.292
1.093AlaCys: 1.093 ± 0.311
3.005AlaAsp: 3.005 ± 0.638
3.005AlaGlu: 3.005 ± 0.732
1.366AlaPhe: 1.366 ± 0.764
1.913AlaGly: 1.913 ± 0.785
0.273AlaHis: 0.273 ± 0.393
4.098AlaIle: 4.098 ± 2.335
2.459AlaLys: 2.459 ± 0.962
3.552AlaLeu: 3.552 ± 1.57
0.0AlaMet: 0.0 ± 0.0
1.639AlaAsn: 1.639 ± 0.278
0.273AlaPro: 0.273 ± 0.393
3.005AlaGln: 3.005 ± 0.515
1.366AlaArg: 1.366 ± 0.676
3.825AlaSer: 3.825 ± 1.607
1.093AlaThr: 1.093 ± 0.611
2.459AlaVal: 2.459 ± 0.813
0.273AlaTrp: 0.273 ± 0.406
1.093AlaTyr: 1.093 ± 0.332
0.0AlaXaa: 0.0 ± 0.0
Cys
0.546CysAla: 0.546 ± 0.305
0.0CysCys: 0.0 ± 0.0
1.093CysAsp: 1.093 ± 0.356
1.093CysGlu: 1.093 ± 0.346
0.546CysPhe: 0.546 ± 0.316
0.546CysGly: 0.546 ± 0.237
0.82CysHis: 0.82 ± 0.527
1.093CysIle: 1.093 ± 0.474
0.546CysLys: 0.546 ± 0.237
2.186CysLeu: 2.186 ± 0.946
0.273CysMet: 0.273 ± 0.406
0.82CysAsn: 0.82 ± 0.458
0.273CysPro: 0.273 ± 0.307
0.273CysGln: 0.273 ± 0.153
0.546CysArg: 0.546 ± 0.35
1.366CysSer: 1.366 ± 0.43
0.546CysThr: 0.546 ± 0.237
1.366CysVal: 1.366 ± 0.471
0.82CysTrp: 0.82 ± 0.334
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.093AspAla: 1.093 ± 0.332
0.273AspCys: 0.273 ± 0.153
1.366AspAsp: 1.366 ± 0.252
2.459AspGlu: 2.459 ± 0.909
3.005AspPhe: 3.005 ± 1.421
2.186AspGly: 2.186 ± 0.436
2.732AspHis: 2.732 ± 0.896
3.552AspIle: 3.552 ± 0.638
4.098AspLys: 4.098 ± 0.755
4.918AspLeu: 4.918 ± 0.874
1.093AspMet: 1.093 ± 0.348
1.913AspAsn: 1.913 ± 0.858
4.645AspPro: 4.645 ± 1.168
2.186AspGln: 2.186 ± 0.396
1.366AspArg: 1.366 ± 0.787
4.098AspSer: 4.098 ± 1.101
1.366AspThr: 1.366 ± 0.471
1.366AspVal: 1.366 ± 0.448
0.82AspTrp: 0.82 ± 0.286
2.732AspTyr: 2.732 ± 0.813
0.0AspXaa: 0.0 ± 0.0
Glu
3.279GluAla: 3.279 ± 0.546
1.093GluCys: 1.093 ± 0.346
2.459GluAsp: 2.459 ± 1.013
4.918GluGlu: 4.918 ± 1.348
3.005GluPhe: 3.005 ± 0.302
3.552GluGly: 3.552 ± 0.945
1.913GluHis: 1.913 ± 1.059
4.918GluIle: 4.918 ± 0.678
6.284GluLys: 6.284 ± 0.733
7.104GluLeu: 7.104 ± 0.758
0.82GluMet: 0.82 ± 0.404
2.732GluAsn: 2.732 ± 1.418
2.459GluPro: 2.459 ± 0.541
2.186GluGln: 2.186 ± 0.545
1.913GluArg: 1.913 ± 0.438
3.279GluSer: 3.279 ± 1.034
5.191GluThr: 5.191 ± 1.719
3.552GluVal: 3.552 ± 1.105
1.366GluTrp: 1.366 ± 1.349
2.732GluTyr: 2.732 ± 0.369
0.0GluXaa: 0.0 ± 0.0
Phe
3.005PheAla: 3.005 ± 1.11
0.546PheCys: 0.546 ± 0.237
1.913PheAsp: 1.913 ± 0.99
2.459PheGlu: 2.459 ± 0.618
1.366PhePhe: 1.366 ± 0.676
4.098PheGly: 4.098 ± 0.94
0.82PheHis: 0.82 ± 0.255
2.732PheIle: 2.732 ± 0.468
3.279PheLys: 3.279 ± 0.818
4.098PheLeu: 4.098 ± 0.523
0.82PheMet: 0.82 ± 0.448
2.459PheAsn: 2.459 ± 0.526
3.279PhePro: 3.279 ± 0.494
1.639PheGln: 1.639 ± 0.449
3.005PheArg: 3.005 ± 0.615
3.552PheSer: 3.552 ± 0.613
2.186PheThr: 2.186 ± 0.628
2.732PheVal: 2.732 ± 0.338
0.273PheTrp: 0.273 ± 0.153
1.366PheTyr: 1.366 ± 0.448
0.0PheXaa: 0.0 ± 0.0
Gly
1.093GlyAla: 1.093 ± 0.448
0.546GlyCys: 0.546 ± 0.316
3.005GlyAsp: 3.005 ± 0.54
2.732GlyGlu: 2.732 ± 0.369
2.186GlyPhe: 2.186 ± 0.715
2.186GlyGly: 2.186 ± 0.693
1.639GlyHis: 1.639 ± 0.7
5.464GlyIle: 5.464 ± 1.445
3.005GlyLys: 3.005 ± 0.892
6.011GlyLeu: 6.011 ± 1.913
0.546GlyMet: 0.546 ± 0.35
4.372GlyAsn: 4.372 ± 0.425
1.093GlyPro: 1.093 ± 0.399
1.366GlyGln: 1.366 ± 0.764
2.186GlyArg: 2.186 ± 0.686
5.738GlySer: 5.738 ± 0.806
4.098GlyThr: 4.098 ± 2.011
2.732GlyVal: 2.732 ± 0.867
0.546GlyTrp: 0.546 ± 0.305
2.459GlyTyr: 2.459 ± 0.602
0.0GlyXaa: 0.0 ± 0.0
His
0.82HisAla: 0.82 ± 0.458
0.546HisCys: 0.546 ± 0.331
1.366HisAsp: 1.366 ± 0.662
1.366HisGlu: 1.366 ± 0.43
2.459HisPhe: 2.459 ± 0.78
1.639HisGly: 1.639 ± 0.916
0.82HisHis: 0.82 ± 0.397
3.279HisIle: 3.279 ± 0.427
1.913HisLys: 1.913 ± 1.11
3.552HisLeu: 3.552 ± 0.47
0.273HisMet: 0.273 ± 0.331
0.546HisAsn: 0.546 ± 0.237
2.459HisPro: 2.459 ± 0.801
1.639HisGln: 1.639 ± 0.449
1.913HisArg: 1.913 ± 0.321
1.639HisSer: 1.639 ± 0.278
1.093HisThr: 1.093 ± 1.078
1.093HisVal: 1.093 ± 0.311
1.093HisTrp: 1.093 ± 0.332
1.639HisTyr: 1.639 ± 0.278
0.0HisXaa: 0.0 ± 0.0
Ile
2.459IleAla: 2.459 ± 0.726
1.913IleCys: 1.913 ± 0.589
4.098IleAsp: 4.098 ± 1.56
4.372IleGlu: 4.372 ± 0.617
2.459IlePhe: 2.459 ± 1.047
4.918IleGly: 4.918 ± 0.628
3.005IleHis: 3.005 ± 0.993
7.65IleIle: 7.65 ± 1.104
7.923IleLys: 7.923 ± 0.869
9.563IleLeu: 9.563 ± 0.938
1.093IleMet: 1.093 ± 0.355
1.639IleAsn: 1.639 ± 0.629
5.464IlePro: 5.464 ± 1.884
3.825IleGln: 3.825 ± 0.998
4.918IleArg: 4.918 ± 1.665
7.104IleSer: 7.104 ± 1.422
4.098IleThr: 4.098 ± 2.163
4.098IleVal: 4.098 ± 0.741
1.366IleTrp: 1.366 ± 0.471
2.732IleTyr: 2.732 ± 0.547
0.0IleXaa: 0.0 ± 0.0
Lys
3.005LysAla: 3.005 ± 0.508
0.82LysCys: 0.82 ± 0.527
3.552LysAsp: 3.552 ± 0.869
4.372LysGlu: 4.372 ± 2.02
3.005LysPhe: 3.005 ± 1.283
4.098LysGly: 4.098 ± 0.792
1.366LysHis: 1.366 ± 0.764
7.923LysIle: 7.923 ± 1.08
3.825LysLys: 3.825 ± 0.798
6.557LysLeu: 6.557 ± 1.333
1.366LysMet: 1.366 ± 0.252
3.825LysAsn: 3.825 ± 0.814
3.005LysPro: 3.005 ± 1.058
1.639LysGln: 1.639 ± 0.831
2.732LysArg: 2.732 ± 1.343
6.011LysSer: 6.011 ± 1.814
4.372LysThr: 4.372 ± 1.343
4.098LysVal: 4.098 ± 0.712
1.639LysTrp: 1.639 ± 0.449
3.005LysTyr: 3.005 ± 0.819
0.0LysXaa: 0.0 ± 0.0
Leu
3.825LeuAla: 3.825 ± 0.715
1.639LeuCys: 1.639 ± 0.273
4.918LeuAsp: 4.918 ± 1.277
7.104LeuGlu: 7.104 ± 0.86
3.825LeuPhe: 3.825 ± 0.896
4.645LeuGly: 4.645 ± 0.975
2.459LeuHis: 2.459 ± 1.047
10.383LeuIle: 10.383 ± 1.583
8.743LeuLys: 8.743 ± 2.304
11.202LeuLeu: 11.202 ± 1.192
4.098LeuMet: 4.098 ± 0.698
6.831LeuAsn: 6.831 ± 1.455
3.552LeuPro: 3.552 ± 1.608
2.186LeuGln: 2.186 ± 0.26
5.464LeuArg: 5.464 ± 1.744
7.923LeuSer: 7.923 ± 0.887
8.743LeuThr: 8.743 ± 1.583
4.372LeuVal: 4.372 ± 1.855
0.273LeuTrp: 0.273 ± 0.153
4.372LeuTyr: 4.372 ± 1.38
0.0LeuXaa: 0.0 ± 0.0
Met
1.366MetAla: 1.366 ± 0.471
0.546MetCys: 0.546 ± 0.812
1.366MetAsp: 1.366 ± 0.471
2.186MetGlu: 2.186 ± 1.045
0.273MetPhe: 0.273 ± 0.406
0.82MetGly: 0.82 ± 0.458
0.0MetHis: 0.0 ± 0.0
2.186MetIle: 2.186 ± 0.946
2.459MetLys: 2.459 ± 0.733
1.639MetLeu: 1.639 ± 0.572
0.546MetMet: 0.546 ± 0.648
0.82MetAsn: 0.82 ± 0.432
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
0.82MetArg: 0.82 ± 0.435
1.639MetSer: 1.639 ± 0.779
1.366MetThr: 1.366 ± 0.676
0.82MetVal: 0.82 ± 0.776
0.273MetTrp: 0.273 ± 0.153
1.093MetTyr: 1.093 ± 0.311
0.0MetXaa: 0.0 ± 0.0
Asn
1.366AsnAla: 1.366 ± 0.583
1.093AsnCys: 1.093 ± 0.611
1.639AsnAsp: 1.639 ± 0.721
2.186AsnGlu: 2.186 ± 0.545
3.552AsnPhe: 3.552 ± 0.886
2.186AsnGly: 2.186 ± 1.586
1.366AsnHis: 1.366 ± 0.504
2.732AsnIle: 2.732 ± 0.369
3.005AsnLys: 3.005 ± 0.666
6.557AsnLeu: 6.557 ± 0.37
1.093AsnMet: 1.093 ± 0.792
3.279AsnAsn: 3.279 ± 0.768
4.918AsnPro: 4.918 ± 0.664
1.639AsnGln: 1.639 ± 0.916
1.639AsnArg: 1.639 ± 1.098
4.918AsnSer: 4.918 ± 1.199
3.005AsnThr: 3.005 ± 0.515
2.186AsnVal: 2.186 ± 0.26
3.279AsnTrp: 3.279 ± 0.637
1.093AsnTyr: 1.093 ± 0.611
0.0AsnXaa: 0.0 ± 0.0
Pro
1.366ProAla: 1.366 ± 0.344
0.0ProCys: 0.0 ± 0.0
2.732ProAsp: 2.732 ± 0.75
4.372ProGlu: 4.372 ± 1.655
2.732ProPhe: 2.732 ± 0.722
1.913ProGly: 1.913 ± 0.971
3.005ProHis: 3.005 ± 0.677
4.098ProIle: 4.098 ± 0.66
2.732ProLys: 2.732 ± 1.201
3.552ProLeu: 3.552 ± 0.949
1.093ProMet: 1.093 ± 0.474
4.098ProAsn: 4.098 ± 1.063
1.913ProPro: 1.913 ± 0.388
0.82ProGln: 0.82 ± 0.711
1.913ProArg: 1.913 ± 0.512
4.918ProSer: 4.918 ± 0.705
1.639ProThr: 1.639 ± 0.273
2.186ProVal: 2.186 ± 0.495
0.273ProTrp: 0.273 ± 0.307
2.186ProTyr: 2.186 ± 1.08
0.0ProXaa: 0.0 ± 0.0
Gln
1.639GlnAla: 1.639 ± 0.948
0.546GlnCys: 0.546 ± 0.237
0.82GlnAsp: 0.82 ± 0.482
1.639GlnGlu: 1.639 ± 0.273
2.459GlnPhe: 2.459 ± 0.591
2.186GlnGly: 2.186 ± 0.484
1.366GlnHis: 1.366 ± 0.43
1.913GlnIle: 1.913 ± 0.397
1.366GlnLys: 1.366 ± 0.592
3.005GlnLeu: 3.005 ± 0.659
1.093GlnMet: 1.093 ± 0.765
2.732GlnAsn: 2.732 ± 0.942
1.093GlnPro: 1.093 ± 0.664
0.82GlnGln: 0.82 ± 0.708
1.913GlnArg: 1.913 ± 0.469
2.732GlnSer: 2.732 ± 0.815
1.639GlnThr: 1.639 ± 0.273
3.552GlnVal: 3.552 ± 0.734
0.273GlnTrp: 0.273 ± 0.393
0.273GlnTyr: 0.273 ± 0.307
0.0GlnXaa: 0.0 ± 0.0
Arg
2.459ArgAla: 2.459 ± 0.68
0.0ArgCys: 0.0 ± 0.0
2.186ArgAsp: 2.186 ± 0.484
3.825ArgGlu: 3.825 ± 0.634
3.279ArgPhe: 3.279 ± 0.877
3.005ArgGly: 3.005 ± 0.615
1.639ArgHis: 1.639 ± 0.849
3.279ArgIle: 3.279 ± 1.417
2.186ArgLys: 2.186 ± 1.088
4.645ArgLeu: 4.645 ± 0.689
0.82ArgMet: 0.82 ± 0.397
2.732ArgAsn: 2.732 ± 0.861
1.366ArgPro: 1.366 ± 0.764
1.366ArgGln: 1.366 ± 0.583
3.552ArgArg: 3.552 ± 1.773
3.005ArgSer: 3.005 ± 0.917
3.279ArgThr: 3.279 ± 0.804
3.005ArgVal: 3.005 ± 1.13
0.546ArgTrp: 0.546 ± 0.305
1.913ArgTyr: 1.913 ± 0.371
0.0ArgXaa: 0.0 ± 0.0
Ser
4.645SerAla: 4.645 ± 1.301
1.093SerCys: 1.093 ± 0.356
4.098SerAsp: 4.098 ± 1.03
5.738SerGlu: 5.738 ± 1.457
2.459SerPhe: 2.459 ± 0.604
3.552SerGly: 3.552 ± 0.624
2.186SerHis: 2.186 ± 0.26
6.831SerIle: 6.831 ± 2.866
7.65SerLys: 7.65 ± 1.124
9.29SerLeu: 9.29 ± 1.73
1.366SerMet: 1.366 ± 0.978
3.279SerAsn: 3.279 ± 1.412
3.825SerPro: 3.825 ± 1.032
3.552SerGln: 3.552 ± 1.922
3.825SerArg: 3.825 ± 0.175
6.557SerSer: 6.557 ± 1.499
4.372SerThr: 4.372 ± 0.917
2.459SerVal: 2.459 ± 0.48
2.186SerTrp: 2.186 ± 0.693
2.459SerTyr: 2.459 ± 0.926
0.0SerXaa: 0.0 ± 0.0
Thr
1.366ThrAla: 1.366 ± 0.676
1.639ThrCys: 1.639 ± 0.509
2.459ThrAsp: 2.459 ± 0.853
3.005ThrGlu: 3.005 ± 0.607
2.459ThrPhe: 2.459 ± 0.618
4.098ThrGly: 4.098 ± 0.856
2.459ThrHis: 2.459 ± 0.853
5.191ThrIle: 5.191 ± 2.811
4.098ThrLys: 4.098 ± 1.127
6.831ThrLeu: 6.831 ± 1.67
1.366ThrMet: 1.366 ± 0.374
3.005ThrAsn: 3.005 ± 0.49
1.913ThrPro: 1.913 ± 0.6
0.82ThrGln: 0.82 ± 0.776
3.552ThrArg: 3.552 ± 1.304
3.279ThrSer: 3.279 ± 1.063
3.552ThrThr: 3.552 ± 0.723
2.732ThrVal: 2.732 ± 1.413
0.546ThrTrp: 0.546 ± 0.305
3.279ThrTyr: 3.279 ± 0.783
0.0ThrXaa: 0.0 ± 0.0
Val
1.093ValAla: 1.093 ± 0.332
0.82ValCys: 0.82 ± 0.527
2.732ValAsp: 2.732 ± 0.535
3.279ValGlu: 3.279 ± 0.572
1.639ValPhe: 1.639 ± 0.506
2.459ValGly: 2.459 ± 0.393
1.639ValHis: 1.639 ± 0.851
3.279ValIle: 3.279 ± 1.039
2.186ValLys: 2.186 ± 0.472
5.464ValLeu: 5.464 ± 1.045
1.093ValMet: 1.093 ± 0.477
3.552ValAsn: 3.552 ± 1.018
3.279ValPro: 3.279 ± 0.952
1.639ValGln: 1.639 ± 0.572
2.459ValArg: 2.459 ± 0.971
5.191ValSer: 5.191 ± 1.488
3.005ValThr: 3.005 ± 0.635
1.913ValVal: 1.913 ± 0.478
0.0ValTrp: 0.0 ± 0.0
1.639ValTyr: 1.639 ± 0.397
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.546TrpAsp: 0.546 ± 0.747
1.913TrpGlu: 1.913 ± 0.752
1.366TrpPhe: 1.366 ± 0.583
1.913TrpGly: 1.913 ± 0.764
0.273TrpHis: 0.273 ± 0.153
1.639TrpIle: 1.639 ± 0.506
1.093TrpLys: 1.093 ± 0.474
0.82TrpLeu: 0.82 ± 0.527
0.273TrpMet: 0.273 ± 0.153
0.82TrpAsn: 0.82 ± 0.458
0.82TrpPro: 0.82 ± 0.458
0.273TrpGln: 0.273 ± 0.153
0.82TrpArg: 0.82 ± 0.458
1.366TrpSer: 1.366 ± 0.337
1.093TrpThr: 1.093 ± 0.885
1.093TrpVal: 1.093 ± 1.262
0.546TrpTrp: 0.546 ± 0.237
0.273TrpTyr: 0.273 ± 0.307
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.366TyrAla: 1.366 ± 0.43
0.546TyrCys: 0.546 ± 0.305
1.913TyrAsp: 1.913 ± 0.397
3.005TyrGlu: 3.005 ± 0.696
2.186TyrPhe: 2.186 ± 0.693
1.093TyrGly: 1.093 ± 0.611
1.366TyrHis: 1.366 ± 0.375
2.459TyrIle: 2.459 ± 0.366
1.366TyrLys: 1.366 ± 0.252
6.011TyrLeu: 6.011 ± 0.775
0.82TyrMet: 0.82 ± 0.286
1.639TyrAsn: 1.639 ± 0.449
1.913TyrPro: 1.913 ± 0.604
2.186TyrGln: 2.186 ± 0.653
2.186TyrArg: 2.186 ± 0.601
3.279TyrSer: 3.279 ± 0.722
1.913TyrThr: 1.913 ± 0.586
0.546TyrVal: 0.546 ± 0.305
0.546TyrTrp: 0.546 ± 0.541
1.093TyrTyr: 1.093 ± 0.611
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3661 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski