Amino acid dipepetide frequency for Wenling hepe-like virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.679AlaAla: 7.679 ± 1.028
0.512AlaCys: 0.512 ± 0.315
3.071AlaAsp: 3.071 ± 1.139
2.815AlaGlu: 2.815 ± 0.51
4.607AlaPhe: 4.607 ± 0.71
4.607AlaGly: 4.607 ± 1.147
0.768AlaHis: 0.768 ± 0.472
4.351AlaIle: 4.351 ± 0.358
3.071AlaLys: 3.071 ± 1.208
4.863AlaLeu: 4.863 ± 2.284
1.28AlaMet: 1.28 ± 0.471
5.375AlaAsn: 5.375 ± 1.048
3.071AlaPro: 3.071 ± 0.791
3.583AlaGln: 3.583 ± 1.173
3.327AlaArg: 3.327 ± 0.577
5.887AlaSer: 5.887 ± 1.223
4.095AlaThr: 4.095 ± 0.38
3.839AlaVal: 3.839 ± 0.347
0.256AlaTrp: 0.256 ± 0.272
1.792AlaTyr: 1.792 ± 0.77
0.0AlaXaa: 0.0 ± 0.0
Cys
1.28CysAla: 1.28 ± 0.25
0.0CysCys: 0.0 ± 0.0
1.792CysAsp: 1.792 ± 0.444
0.512CysGlu: 0.512 ± 0.315
1.28CysPhe: 1.28 ± 0.66
0.256CysGly: 0.256 ± 0.157
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.024CysLys: 1.024 ± 0.445
0.768CysLeu: 0.768 ± 0.472
0.0CysMet: 0.0 ± 0.0
0.512CysAsn: 0.512 ± 0.315
0.256CysPro: 0.256 ± 0.157
0.512CysGln: 0.512 ± 0.192
1.024CysArg: 1.024 ± 0.333
2.815CysSer: 2.815 ± 0.397
1.28CysThr: 1.28 ± 0.764
1.024CysVal: 1.024 ± 0.778
0.0CysTrp: 0.0 ± 0.0
1.024CysTyr: 1.024 ± 0.333
0.0CysXaa: 0.0 ± 0.0
Asp
2.304AspAla: 2.304 ± 0.808
1.28AspCys: 1.28 ± 0.572
4.863AspAsp: 4.863 ± 0.552
5.119AspGlu: 5.119 ± 0.604
3.327AspPhe: 3.327 ± 2.121
4.607AspGly: 4.607 ± 0.678
1.024AspHis: 1.024 ± 0.629
3.583AspIle: 3.583 ± 0.449
2.815AspLys: 2.815 ± 1.417
5.119AspLeu: 5.119 ± 1.126
0.512AspMet: 0.512 ± 0.402
2.304AspAsn: 2.304 ± 0.573
5.119AspPro: 5.119 ± 0.748
1.792AspGln: 1.792 ± 0.177
1.792AspArg: 1.792 ± 0.444
6.143AspSer: 6.143 ± 0.975
2.304AspThr: 2.304 ± 0.271
2.815AspVal: 2.815 ± 0.39
0.512AspTrp: 0.512 ± 0.315
2.304AspTyr: 2.304 ± 0.65
0.0AspXaa: 0.0 ± 0.0
Glu
4.351GluAla: 4.351 ± 0.849
0.768GluCys: 0.768 ± 0.472
2.56GluAsp: 2.56 ± 0.519
4.863GluGlu: 4.863 ± 0.552
3.583GluPhe: 3.583 ± 0.526
1.792GluGly: 1.792 ± 0.868
1.792GluHis: 1.792 ± 0.564
1.792GluIle: 1.792 ± 0.815
1.536GluLys: 1.536 ± 1.067
4.095GluLeu: 4.095 ± 0.44
1.28GluMet: 1.28 ± 0.383
1.536GluAsn: 1.536 ± 0.226
3.583GluPro: 3.583 ± 1.032
1.792GluGln: 1.792 ± 0.543
3.071GluArg: 3.071 ± 1.044
4.863GluSer: 4.863 ± 1.609
5.119GluThr: 5.119 ± 1.428
3.839GluVal: 3.839 ± 0.757
0.0GluTrp: 0.0 ± 0.0
3.327GluTyr: 3.327 ± 2.045
0.0GluXaa: 0.0 ± 0.0
Phe
3.839PheAla: 3.839 ± 1.914
0.768PheCys: 0.768 ± 0.472
2.56PheAsp: 2.56 ± 0.423
4.351PheGlu: 4.351 ± 1.005
1.792PhePhe: 1.792 ± 0.177
2.304PheGly: 2.304 ± 0.487
2.048PheHis: 2.048 ± 0.471
2.048PheIle: 2.048 ± 0.325
3.583PheLys: 3.583 ± 0.81
2.56PheLeu: 2.56 ± 0.657
0.768PheMet: 0.768 ± 0.472
2.048PheAsn: 2.048 ± 0.65
3.071PhePro: 3.071 ± 0.441
1.28PheGln: 1.28 ± 0.707
3.327PheArg: 3.327 ± 1.152
2.048PheSer: 2.048 ± 0.961
4.095PheThr: 4.095 ± 1.349
3.839PheVal: 3.839 ± 0.347
0.768PheTrp: 0.768 ± 0.472
1.28PheTyr: 1.28 ± 0.383
0.0PheXaa: 0.0 ± 0.0
Gly
3.327GlyAla: 3.327 ± 0.694
1.024GlyCys: 1.024 ± 0.384
4.095GlyAsp: 4.095 ± 0.291
2.048GlyGlu: 2.048 ± 0.767
3.327GlyPhe: 3.327 ± 0.994
2.304GlyGly: 2.304 ± 1.004
1.28GlyHis: 1.28 ± 1.159
2.56GlyIle: 2.56 ± 0.302
3.583GlyLys: 3.583 ± 1.27
2.304GlyLeu: 2.304 ± 0.664
1.024GlyMet: 1.024 ± 0.384
1.536GlyAsn: 1.536 ± 0.818
2.048GlyPro: 2.048 ± 0.767
2.304GlyGln: 2.304 ± 1.343
2.815GlyArg: 2.815 ± 0.863
3.327GlySer: 3.327 ± 0.445
6.143GlyThr: 6.143 ± 0.794
6.143GlyVal: 6.143 ± 2.001
1.024GlyTrp: 1.024 ± 0.384
3.839GlyTyr: 3.839 ± 0.802
0.0GlyXaa: 0.0 ± 0.0
His
1.536HisAla: 1.536 ± 0.324
0.0HisCys: 0.0 ± 0.0
1.536HisAsp: 1.536 ± 0.619
1.792HisGlu: 1.792 ± 0.789
1.024HisPhe: 1.024 ± 0.445
2.815HisGly: 2.815 ± 0.436
0.512HisHis: 0.512 ± 0.544
1.024HisIle: 1.024 ± 0.384
1.024HisLys: 1.024 ± 0.333
2.048HisLeu: 2.048 ± 0.657
0.256HisMet: 0.256 ± 0.157
1.536HisAsn: 1.536 ± 0.615
1.024HisPro: 1.024 ± 0.629
1.28HisGln: 1.28 ± 1.022
1.792HisArg: 1.792 ± 0.815
1.792HisSer: 1.792 ± 0.177
1.792HisThr: 1.792 ± 0.895
1.536HisVal: 1.536 ± 0.693
0.0HisTrp: 0.0 ± 0.0
2.048HisTyr: 2.048 ± 0.956
0.0HisXaa: 0.0 ± 0.0
Ile
2.815IleAla: 2.815 ± 0.436
1.28IleCys: 1.28 ± 0.471
1.792IleAsp: 1.792 ± 0.543
2.815IleGlu: 2.815 ± 0.436
2.304IlePhe: 2.304 ± 0.784
2.56IleGly: 2.56 ± 0.5
1.792IleHis: 1.792 ± 0.177
1.792IleIle: 1.792 ± 0.586
4.095IleLys: 4.095 ± 1.314
4.607IleLeu: 4.607 ± 0.184
1.28IleMet: 1.28 ± 0.801
3.327IleAsn: 3.327 ± 0.976
2.56IlePro: 2.56 ± 0.302
2.304IleGln: 2.304 ± 0.664
3.327IleArg: 3.327 ± 0.373
4.863IleSer: 4.863 ± 0.539
3.583IleThr: 3.583 ± 0.883
2.815IleVal: 2.815 ± 1.197
0.256IleTrp: 0.256 ± 0.157
1.28IleTyr: 1.28 ± 0.981
0.0IleXaa: 0.0 ± 0.0
Lys
4.607LysAla: 4.607 ± 0.667
0.768LysCys: 0.768 ± 0.513
3.583LysAsp: 3.583 ± 1.084
3.071LysGlu: 3.071 ± 1.044
3.327LysPhe: 3.327 ± 0.336
3.327LysGly: 3.327 ± 0.634
1.28LysHis: 1.28 ± 0.383
5.119LysIle: 5.119 ± 1.732
3.839LysLys: 3.839 ± 1.627
3.583LysLeu: 3.583 ± 0.449
1.792LysMet: 1.792 ± 0.77
2.048LysAsn: 2.048 ± 0.665
2.815LysPro: 2.815 ± 0.669
1.536LysGln: 1.536 ± 0.226
3.071LysArg: 3.071 ± 1.348
3.071LysSer: 3.071 ± 1.044
3.583LysThr: 3.583 ± 0.531
4.351LysVal: 4.351 ± 0.98
0.768LysTrp: 0.768 ± 0.472
1.792LysTyr: 1.792 ± 0.993
0.0LysXaa: 0.0 ± 0.0
Leu
3.583LeuAla: 3.583 ± 0.594
1.28LeuCys: 1.28 ± 0.572
4.351LeuAsp: 4.351 ± 0.998
4.351LeuGlu: 4.351 ± 2.057
2.048LeuPhe: 2.048 ± 0.923
2.815LeuGly: 2.815 ± 0.721
2.56LeuHis: 2.56 ± 1.457
2.304LeuIle: 2.304 ± 0.487
5.631LeuLys: 5.631 ± 1.448
3.839LeuLeu: 3.839 ± 1.496
2.048LeuMet: 2.048 ± 0.582
5.119LeuAsn: 5.119 ± 0.7
4.351LeuPro: 4.351 ± 0.864
2.56LeuGln: 2.56 ± 0.957
4.095LeuArg: 4.095 ± 1.847
7.934LeuSer: 7.934 ± 0.984
4.863LeuThr: 4.863 ± 1.809
4.863LeuVal: 4.863 ± 1.957
0.512LeuTrp: 0.512 ± 0.402
1.792LeuTyr: 1.792 ± 0.543
0.0LeuXaa: 0.0 ± 0.0
Met
1.792MetAla: 1.792 ± 1.099
0.512MetCys: 0.512 ± 0.544
1.536MetAsp: 1.536 ± 0.324
0.768MetGlu: 0.768 ± 0.221
0.512MetPhe: 0.512 ± 0.192
1.024MetGly: 1.024 ± 0.384
0.512MetHis: 0.512 ± 0.315
1.536MetIle: 1.536 ± 0.226
1.28MetLys: 1.28 ± 0.25
1.792MetLeu: 1.792 ± 0.868
0.256MetMet: 0.256 ± 0.454
1.536MetAsn: 1.536 ± 0.615
0.512MetPro: 0.512 ± 0.315
0.256MetGln: 0.256 ± 0.157
0.256MetArg: 0.256 ± 0.157
1.536MetSer: 1.536 ± 0.324
2.304MetThr: 2.304 ± 0.728
3.071MetVal: 3.071 ± 0.945
0.768MetTrp: 0.768 ± 0.355
1.024MetTyr: 1.024 ± 0.488
0.0MetXaa: 0.0 ± 0.0
Asn
4.095AsnAla: 4.095 ± 0.949
1.536AsnCys: 1.536 ± 0.324
2.048AsnAsp: 2.048 ± 1.259
2.048AsnGlu: 2.048 ± 0.582
3.839AsnPhe: 3.839 ± 0.702
1.536AsnGly: 1.536 ± 0.652
2.048AsnHis: 2.048 ± 1.136
3.583AsnIle: 3.583 ± 0.526
3.583AsnLys: 3.583 ± 0.526
4.095AsnLeu: 4.095 ± 1.054
0.512AsnMet: 0.512 ± 0.397
4.351AsnAsn: 4.351 ± 0.892
2.304AsnPro: 2.304 ± 0.91
2.048AsnGln: 2.048 ± 0.582
1.792AsnArg: 1.792 ± 0.632
5.375AsnSer: 5.375 ± 0.676
3.327AsnThr: 3.327 ± 0.976
3.839AsnVal: 3.839 ± 0.711
0.0AsnTrp: 0.0 ± 0.0
2.048AsnTyr: 2.048 ± 1.259
0.0AsnXaa: 0.0 ± 0.0
Pro
2.304ProAla: 2.304 ± 0.271
0.0ProCys: 0.0 ± 0.0
2.815ProAsp: 2.815 ± 0.557
3.583ProGlu: 3.583 ± 0.848
3.071ProPhe: 3.071 ± 0.724
2.815ProGly: 2.815 ± 1.194
0.768ProHis: 0.768 ± 0.472
2.815ProIle: 2.815 ± 0.397
3.583ProLys: 3.583 ± 0.81
4.607ProLeu: 4.607 ± 0.71
1.28ProMet: 1.28 ± 0.572
2.304ProAsn: 2.304 ± 1.036
3.839ProPro: 3.839 ± 0.4
1.536ProGln: 1.536 ± 0.443
3.071ProArg: 3.071 ± 0.864
3.839ProSer: 3.839 ± 1.15
5.119ProThr: 5.119 ± 0.903
4.095ProVal: 4.095 ± 0.38
1.28ProTrp: 1.28 ± 1.361
1.792ProTyr: 1.792 ± 0.433
0.0ProXaa: 0.0 ± 0.0
Gln
3.327GlnAla: 3.327 ± 1.083
0.512GlnCys: 0.512 ± 0.192
3.327GlnAsp: 3.327 ± 0.336
1.536GlnGlu: 1.536 ± 0.615
1.28GlnPhe: 1.28 ± 0.427
1.792GlnGly: 1.792 ± 0.543
1.536GlnHis: 1.536 ± 0.226
1.28GlnIle: 1.28 ± 0.981
1.792GlnLys: 1.792 ± 0.564
3.327GlnLeu: 3.327 ± 1.129
1.28GlnMet: 1.28 ± 0.594
1.792GlnAsn: 1.792 ± 0.993
2.304GlnPro: 2.304 ± 0.664
2.56GlnGln: 2.56 ± 0.302
1.536GlnArg: 1.536 ± 0.226
2.048GlnSer: 2.048 ± 0.687
2.56GlnThr: 2.56 ± 0.766
2.56GlnVal: 2.56 ± 0.5
0.768GlnTrp: 0.768 ± 0.221
1.28GlnTyr: 1.28 ± 0.25
0.0GlnXaa: 0.0 ± 0.0
Arg
2.304ArgAla: 2.304 ± 0.271
1.536ArgCys: 1.536 ± 1.167
2.56ArgAsp: 2.56 ± 0.786
3.327ArgGlu: 3.327 ± 0.689
1.792ArgPhe: 1.792 ± 1.101
4.095ArgGly: 4.095 ± 0.789
1.536ArgHis: 1.536 ± 0.975
3.327ArgIle: 3.327 ± 0.558
2.048ArgLys: 2.048 ± 0.247
4.351ArgLeu: 4.351 ± 0.998
1.536ArgMet: 1.536 ± 0.268
2.56ArgAsn: 2.56 ± 0.302
2.048ArgPro: 2.048 ± 0.665
1.792ArgGln: 1.792 ± 0.547
3.071ArgArg: 3.071 ± 2.052
2.304ArgSer: 2.304 ± 1.146
4.351ArgThr: 4.351 ± 0.628
4.351ArgVal: 4.351 ± 1.168
0.0ArgTrp: 0.0 ± 0.0
2.304ArgTyr: 2.304 ± 0.857
0.0ArgXaa: 0.0 ± 0.0
Ser
5.119SerAla: 5.119 ± 1.318
1.28SerCys: 1.28 ± 0.787
4.607SerAsp: 4.607 ± 1.531
3.071SerGlu: 3.071 ± 1.151
2.304SerPhe: 2.304 ± 1.692
3.839SerGly: 3.839 ± 0.347
1.536SerHis: 1.536 ± 0.324
3.583SerIle: 3.583 ± 0.844
3.839SerLys: 3.839 ± 0.813
6.655SerLeu: 6.655 ± 2.152
3.071SerMet: 3.071 ± 2.144
4.095SerAsn: 4.095 ± 0.416
2.56SerPro: 2.56 ± 0.766
3.839SerGln: 3.839 ± 0.965
3.839SerArg: 3.839 ± 0.965
3.583SerSer: 3.583 ± 0.354
6.911SerThr: 6.911 ± 0.976
6.655SerVal: 6.655 ± 0.994
1.024SerTrp: 1.024 ± 0.445
1.28SerTyr: 1.28 ± 0.471
0.0SerXaa: 0.0 ± 0.0
Thr
6.911ThrAla: 6.911 ± 2.09
0.768ThrCys: 0.768 ± 0.541
3.327ThrAsp: 3.327 ± 1.573
4.351ThrGlu: 4.351 ± 0.699
4.351ThrPhe: 4.351 ± 1.317
5.119ThrGly: 5.119 ± 1.689
1.28ThrHis: 1.28 ± 0.383
4.863ThrIle: 4.863 ± 1.389
4.095ThrLys: 4.095 ± 1.494
4.863ThrLeu: 4.863 ± 0.901
1.28ThrMet: 1.28 ± 0.383
3.583ThrAsn: 3.583 ± 1.084
6.655ThrPro: 6.655 ± 1.098
2.56ThrGln: 2.56 ± 0.775
5.119ThrArg: 5.119 ± 0.698
3.327ThrSer: 3.327 ± 0.968
5.631ThrThr: 5.631 ± 1.427
5.631ThrVal: 5.631 ± 1.867
0.256ThrTrp: 0.256 ± 0.454
3.071ThrTyr: 3.071 ± 0.536
0.0ThrXaa: 0.0 ± 0.0
Val
4.607ValAla: 4.607 ± 0.184
1.024ValCys: 1.024 ± 0.333
5.631ValAsp: 5.631 ± 0.807
3.327ValGlu: 3.327 ± 0.978
2.815ValPhe: 2.815 ± 0.291
4.351ValGly: 4.351 ± 1.973
2.56ValHis: 2.56 ± 0.766
2.304ValIle: 2.304 ± 0.271
4.863ValLys: 4.863 ± 0.297
4.095ValLeu: 4.095 ± 0.291
2.048ValMet: 2.048 ± 0.665
5.119ValAsn: 5.119 ± 1.038
4.095ValPro: 4.095 ± 1.738
2.815ValGln: 2.815 ± 0.669
3.327ValArg: 3.327 ± 0.761
5.119ValSer: 5.119 ± 1.997
7.167ValThr: 7.167 ± 1.347
4.607ValVal: 4.607 ± 1.219
0.256ValTrp: 0.256 ± 0.157
2.815ValTyr: 2.815 ± 0.48
0.0ValXaa: 0.0 ± 0.0
Trp
0.512TrpAla: 0.512 ± 0.544
0.256TrpCys: 0.256 ± 0.157
0.0TrpAsp: 0.0 ± 0.0
0.256TrpGlu: 0.256 ± 0.454
0.512TrpPhe: 0.512 ± 0.192
1.024TrpGly: 1.024 ± 0.445
0.256TrpHis: 0.256 ± 0.157
0.256TrpIle: 0.256 ± 0.272
0.0TrpLys: 0.0 ± 0.0
1.024TrpLeu: 1.024 ± 0.629
0.256TrpMet: 0.256 ± 0.157
1.536TrpAsn: 1.536 ± 0.443
0.256TrpPro: 0.256 ± 0.454
0.0TrpGln: 0.0 ± 0.0
0.256TrpArg: 0.256 ± 0.454
0.512TrpSer: 0.512 ± 0.192
0.256TrpThr: 0.256 ± 0.157
1.024TrpVal: 1.024 ± 0.384
0.0TrpTrp: 0.0 ± 0.0
0.768TrpTyr: 0.768 ± 0.221
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.56TyrAla: 2.56 ± 0.436
0.256TyrCys: 0.256 ± 0.157
3.583TyrAsp: 3.583 ± 0.887
1.536TyrGlu: 1.536 ± 0.443
1.28TyrPhe: 1.28 ± 0.427
3.071TyrGly: 3.071 ± 0.625
1.28TyrHis: 1.28 ± 0.471
3.327TyrIle: 3.327 ± 0.663
2.048TyrLys: 2.048 ± 0.89
2.304TyrLeu: 2.304 ± 0.664
0.768TyrMet: 0.768 ± 0.221
2.048TyrAsn: 2.048 ± 0.596
2.048TyrPro: 2.048 ± 0.598
2.048TyrGln: 2.048 ± 0.596
1.28TyrArg: 1.28 ± 0.66
2.304TyrSer: 2.304 ± 1.078
2.56TyrThr: 2.56 ± 0.775
2.048TyrVal: 2.048 ± 0.767
0.512TyrTrp: 0.512 ± 0.315
0.768TyrTyr: 0.768 ± 0.472
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3908 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski