Amino acid dipepetide frequency for Wenling hepe-like virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.917AlaAla: 5.917 ± 0.567
1.614AlaCys: 1.614 ± 0.413
3.228AlaAsp: 3.228 ± 0.82
5.379AlaGlu: 5.379 ± 0.506
1.614AlaPhe: 1.614 ± 1.007
4.034AlaGly: 4.034 ± 0.209
1.883AlaHis: 1.883 ± 0.122
4.034AlaIle: 4.034 ± 0.777
2.152AlaLys: 2.152 ± 0.615
5.917AlaLeu: 5.917 ± 1.243
1.614AlaMet: 1.614 ± 0.576
3.765AlaAsn: 3.765 ± 0.636
4.572AlaPro: 4.572 ± 1.086
2.959AlaGln: 2.959 ± 0.882
5.917AlaArg: 5.917 ± 0.554
3.765AlaSer: 3.765 ± 0.607
4.572AlaThr: 4.572 ± 1.488
4.303AlaVal: 4.303 ± 1.015
0.269AlaTrp: 0.269 ± 0.149
2.421AlaTyr: 2.421 ± 0.923
0.0AlaXaa: 0.0 ± 0.0
Cys
1.614CysAla: 1.614 ± 0.413
0.0CysCys: 0.0 ± 0.0
1.345CysAsp: 1.345 ± 0.71
2.152CysGlu: 2.152 ± 1.189
0.538CysPhe: 0.538 ± 0.186
0.807CysGly: 0.807 ± 0.446
0.269CysHis: 0.269 ± 0.149
1.076CysIle: 1.076 ± 0.308
1.076CysLys: 1.076 ± 0.559
1.076CysLeu: 1.076 ± 0.559
1.076CysMet: 1.076 ± 0.594
1.614CysAsn: 1.614 ± 1.158
1.076CysPro: 1.076 ± 0.594
0.269CysGln: 0.269 ± 0.149
0.538CysArg: 0.538 ± 0.186
2.152CysSer: 2.152 ± 0.863
1.883CysThr: 1.883 ± 1.04
1.883CysVal: 1.883 ± 0.424
0.0CysTrp: 0.0 ± 0.0
0.538CysTyr: 0.538 ± 0.297
0.0CysXaa: 0.0 ± 0.0
Asp
3.765AspAla: 3.765 ± 1.246
0.807AspCys: 0.807 ± 0.369
5.11AspAsp: 5.11 ± 0.751
6.186AspGlu: 6.186 ± 1.668
4.034AspPhe: 4.034 ± 0.452
2.421AspGly: 2.421 ± 0.241
1.345AspHis: 1.345 ± 0.743
4.034AspIle: 4.034 ± 1.204
2.959AspLys: 2.959 ± 0.461
4.841AspLeu: 4.841 ± 0.378
2.421AspMet: 2.421 ± 0.741
3.228AspAsn: 3.228 ± 0.902
2.421AspPro: 2.421 ± 0.975
1.345AspGln: 1.345 ± 0.71
2.421AspArg: 2.421 ± 0.894
3.497AspSer: 3.497 ± 0.345
3.765AspThr: 3.765 ± 0.922
5.917AspVal: 5.917 ± 0.495
0.538AspTrp: 0.538 ± 0.297
3.497AspTyr: 3.497 ± 1.437
0.0AspXaa: 0.0 ± 0.0
Glu
4.303GluAla: 4.303 ± 0.268
1.076GluCys: 1.076 ± 0.26
3.765GluAsp: 3.765 ± 0.655
5.11GluGlu: 5.11 ± 0.997
4.034GluPhe: 4.034 ± 0.579
3.228GluGly: 3.228 ± 0.397
1.345GluHis: 1.345 ± 0.493
5.11GluIle: 5.11 ± 1.393
3.765GluLys: 3.765 ± 1.386
7.262GluLeu: 7.262 ± 0.868
1.883GluMet: 1.883 ± 0.797
1.883GluAsn: 1.883 ± 0.503
3.228GluPro: 3.228 ± 0.527
2.421GluGln: 2.421 ± 0.741
4.034GluArg: 4.034 ± 0.504
4.572GluSer: 4.572 ± 0.928
3.765GluThr: 3.765 ± 0.407
3.765GluVal: 3.765 ± 0.667
1.076GluTrp: 1.076 ± 0.594
5.917GluTyr: 5.917 ± 1.076
0.0GluXaa: 0.0 ± 0.0
Phe
2.152PheAla: 2.152 ± 1.1
1.345PheCys: 1.345 ± 0.493
4.034PheAsp: 4.034 ± 0.209
3.228PheGlu: 3.228 ± 0.991
1.345PhePhe: 1.345 ± 0.202
2.421PheGly: 2.421 ± 0.256
1.076PheHis: 1.076 ± 0.26
3.228PheIle: 3.228 ± 0.801
2.69PheLys: 2.69 ± 0.49
1.345PheLeu: 1.345 ± 0.202
0.538PheMet: 0.538 ± 0.386
2.421PheAsn: 2.421 ± 0.483
1.345PhePro: 1.345 ± 0.375
1.614PheGln: 1.614 ± 0.413
3.765PheArg: 3.765 ± 1.269
2.69PheSer: 2.69 ± 0.646
3.228PheThr: 3.228 ± 0.66
4.034PheVal: 4.034 ± 0.839
0.0PheTrp: 0.0 ± 0.0
1.345PheTyr: 1.345 ± 0.71
0.0PheXaa: 0.0 ± 0.0
Gly
2.152GlyAla: 2.152 ± 0.134
1.883GlyCys: 1.883 ± 1.04
2.959GlyAsp: 2.959 ± 0.541
5.917GlyGlu: 5.917 ± 1.076
3.228GlyPhe: 3.228 ± 0.381
2.69GlyGly: 2.69 ± 0.217
0.807GlyHis: 0.807 ± 0.446
2.69GlyIle: 2.69 ± 2.343
4.572GlyLys: 4.572 ± 1.086
4.303GlyLeu: 4.303 ± 1.184
1.614GlyMet: 1.614 ± 0.236
2.959GlyAsn: 2.959 ± 0.35
1.883GlyPro: 1.883 ± 0.581
1.076GlyGln: 1.076 ± 0.26
3.765GlyArg: 3.765 ± 0.667
2.69GlySer: 2.69 ± 0.728
2.152GlyThr: 2.152 ± 1.1
5.648GlyVal: 5.648 ± 0.287
0.269GlyTrp: 0.269 ± 0.266
2.421GlyTyr: 2.421 ± 0.73
0.0GlyXaa: 0.0 ± 0.0
His
2.152HisAla: 2.152 ± 0.134
0.0HisCys: 0.0 ± 0.0
2.152HisAsp: 2.152 ± 0.475
1.345HisGlu: 1.345 ± 0.202
0.538HisPhe: 0.538 ± 0.297
2.69HisGly: 2.69 ± 1.155
0.0HisHis: 0.0 ± 0.0
1.076HisIle: 1.076 ± 0.594
1.076HisLys: 1.076 ± 0.308
1.345HisLeu: 1.345 ± 0.743
0.0HisMet: 0.0 ± 0.0
0.538HisAsn: 0.538 ± 0.297
0.807HisPro: 0.807 ± 0.435
0.807HisGln: 0.807 ± 0.206
0.807HisArg: 0.807 ± 0.206
1.076HisSer: 1.076 ± 0.41
1.345HisThr: 1.345 ± 0.437
2.152HisVal: 2.152 ± 1.189
0.269HisTrp: 0.269 ± 0.149
1.345HisTyr: 1.345 ± 0.437
0.0HisXaa: 0.0 ± 0.0
Ile
4.572IleAla: 4.572 ± 0.385
0.538IleCys: 0.538 ± 0.54
4.572IleAsp: 4.572 ± 2.245
5.648IleGlu: 5.648 ± 0.287
2.421IlePhe: 2.421 ± 0.588
4.034IleGly: 4.034 ± 0.452
1.345IleHis: 1.345 ± 0.202
4.303IleIle: 4.303 ± 1.112
3.497IleLys: 3.497 ± 0.594
2.69IleLeu: 2.69 ± 0.473
1.614IleMet: 1.614 ± 0.674
2.421IleAsn: 2.421 ± 0.241
1.076IlePro: 1.076 ± 0.372
2.421IleGln: 2.421 ± 1.51
2.69IleArg: 2.69 ± 0.874
3.497IleSer: 3.497 ± 1.591
2.421IleThr: 2.421 ± 0.917
5.379IleVal: 5.379 ± 1.202
0.269IleTrp: 0.269 ± 0.149
2.69IleTyr: 2.69 ± 0.666
0.0IleXaa: 0.0 ± 0.0
Lys
4.303LysAla: 4.303 ± 1.278
1.076LysCys: 1.076 ± 0.594
3.228LysAsp: 3.228 ± 1.151
5.379LysGlu: 5.379 ± 1.337
2.959LysPhe: 2.959 ± 1.013
4.034LysGly: 4.034 ± 0.564
1.614LysHis: 1.614 ± 0.576
7.262LysIle: 7.262 ± 1.194
4.034LysLys: 4.034 ± 0.964
5.379LysLeu: 5.379 ± 0.506
1.614LysMet: 1.614 ± 0.426
2.69LysAsn: 2.69 ± 0.874
1.883LysPro: 1.883 ± 0.503
3.228LysGln: 3.228 ± 0.397
1.614LysArg: 1.614 ± 0.241
4.303LysSer: 4.303 ± 1.071
1.614LysThr: 1.614 ± 0.603
3.228LysVal: 3.228 ± 1.151
0.0LysTrp: 0.0 ± 0.0
3.228LysTyr: 3.228 ± 0.66
0.0LysXaa: 0.0 ± 0.0
Leu
6.993LeuAla: 6.993 ± 0.69
1.614LeuCys: 1.614 ± 0.891
6.455LeuAsp: 6.455 ± 0.402
4.034LeuGlu: 4.034 ± 1.771
1.345LeuPhe: 1.345 ± 0.437
3.497LeuGly: 3.497 ± 1.729
1.345LeuHis: 1.345 ± 0.614
2.959LeuIle: 2.959 ± 1.55
5.648LeuLys: 5.648 ± 1.071
4.303LeuLeu: 4.303 ± 1.814
1.076LeuMet: 1.076 ± 0.52
3.497LeuAsn: 3.497 ± 0.528
4.303LeuPro: 4.303 ± 0.333
4.034LeuGln: 4.034 ± 1.032
4.572LeuArg: 4.572 ± 0.285
5.648LeuSer: 5.648 ± 0.287
4.841LeuThr: 4.841 ± 0.463
4.841LeuVal: 4.841 ± 0.785
1.614LeuTrp: 1.614 ± 0.236
1.883LeuTyr: 1.883 ± 0.718
0.0LeuXaa: 0.0 ± 0.0
Met
1.883MetAla: 1.883 ± 0.461
0.269MetCys: 0.269 ± 0.149
2.152MetAsp: 2.152 ± 0.829
0.807MetGlu: 0.807 ± 0.446
2.69MetPhe: 2.69 ± 0.95
0.538MetGly: 0.538 ± 0.297
1.076MetHis: 1.076 ± 0.594
1.076MetIle: 1.076 ± 0.41
1.345MetLys: 1.345 ± 0.364
2.69MetLeu: 2.69 ± 0.646
0.807MetMet: 0.807 ± 0.666
1.076MetAsn: 1.076 ± 0.308
0.538MetPro: 0.538 ± 0.186
1.076MetGln: 1.076 ± 0.372
1.345MetArg: 1.345 ± 0.202
1.883MetSer: 1.883 ± 0.503
1.345MetThr: 1.345 ± 0.202
0.538MetVal: 0.538 ± 0.297
0.269MetTrp: 0.269 ± 0.149
1.883MetTyr: 1.883 ± 0.122
0.0MetXaa: 0.0 ± 0.0
Asn
2.152AsnAla: 2.152 ± 0.959
1.883AsnCys: 1.883 ± 0.525
3.765AsnAsp: 3.765 ± 1.08
4.303AsnGlu: 4.303 ± 0.684
3.497AsnPhe: 3.497 ± 0.345
2.959AsnGly: 2.959 ± 1.274
1.345AsnHis: 1.345 ± 0.743
3.765AsnIle: 3.765 ± 0.56
3.228AsnLys: 3.228 ± 0.381
5.379AsnLeu: 5.379 ± 1.14
1.614AsnMet: 1.614 ± 0.241
2.421AsnAsn: 2.421 ± 0.588
1.345AsnPro: 1.345 ± 0.375
0.807AsnGln: 0.807 ± 0.369
2.152AsnArg: 2.152 ± 0.863
2.421AsnSer: 2.421 ± 1.334
1.345AsnThr: 1.345 ± 0.364
4.303AsnVal: 4.303 ± 0.953
0.0AsnTrp: 0.0 ± 0.0
1.883AsnTyr: 1.883 ± 0.503
0.0AsnXaa: 0.0 ± 0.0
Pro
3.228ProAla: 3.228 ± 1.325
0.538ProCys: 0.538 ± 0.297
1.883ProAsp: 1.883 ± 0.503
1.883ProGlu: 1.883 ± 0.122
2.69ProPhe: 2.69 ± 0.396
4.034ProGly: 4.034 ± 1.474
1.345ProHis: 1.345 ± 0.437
2.69ProIle: 2.69 ± 0.95
2.421ProLys: 2.421 ± 0.741
4.303ProLeu: 4.303 ± 1.749
0.538ProMet: 0.538 ± 0.186
1.345ProAsn: 1.345 ± 0.364
1.614ProPro: 1.614 ± 0.241
1.076ProGln: 1.076 ± 0.308
1.614ProArg: 1.614 ± 0.469
2.421ProSer: 2.421 ± 0.741
2.421ProThr: 2.421 ± 0.588
2.69ProVal: 2.69 ± 1.238
0.269ProTrp: 0.269 ± 0.454
2.421ProTyr: 2.421 ± 1.009
0.0ProXaa: 0.0 ± 0.0
Gln
0.538GlnAla: 0.538 ± 0.504
1.345GlnCys: 1.345 ± 0.202
1.345GlnAsp: 1.345 ± 0.614
2.421GlnGlu: 2.421 ± 0.61
1.076GlnPhe: 1.076 ± 0.52
2.69GlnGly: 2.69 ± 0.217
1.076GlnHis: 1.076 ± 0.308
0.269GlnIle: 0.269 ± 0.149
0.538GlnLys: 0.538 ± 0.186
2.421GlnLeu: 2.421 ± 0.969
0.807GlnMet: 0.807 ± 0.206
2.421GlnAsn: 2.421 ± 1.009
1.614GlnPro: 1.614 ± 0.241
1.076GlnGln: 1.076 ± 0.26
2.69GlnArg: 2.69 ± 0.785
3.228GlnSer: 3.228 ± 0.801
1.883GlnThr: 1.883 ± 0.54
2.959GlnVal: 2.959 ± 0.35
0.269GlnTrp: 0.269 ± 0.266
1.345GlnTyr: 1.345 ± 1.161
0.0GlnXaa: 0.0 ± 0.0
Arg
5.379ArgAla: 5.379 ± 1.538
0.538ArgCys: 0.538 ± 0.186
2.959ArgAsp: 2.959 ± 0.609
2.421ArgGlu: 2.421 ± 0.994
1.345ArgPhe: 1.345 ± 0.873
3.228ArgGly: 3.228 ± 0.66
0.807ArgHis: 0.807 ± 0.446
2.421ArgIle: 2.421 ± 0.594
4.572ArgLys: 4.572 ± 1.871
2.959ArgLeu: 2.959 ± 0.35
1.614ArgMet: 1.614 ± 0.241
2.69ArgAsn: 2.69 ± 1.311
1.614ArgPro: 1.614 ± 0.241
1.076ArgGln: 1.076 ± 0.809
4.303ArgArg: 4.303 ± 0.333
4.034ArgSer: 4.034 ± 0.842
4.034ArgThr: 4.034 ± 1.208
5.11ArgVal: 5.11 ± 1.147
0.269ArgTrp: 0.269 ± 0.266
2.69ArgTyr: 2.69 ± 1.131
0.0ArgXaa: 0.0 ± 0.0
Ser
4.572SerAla: 4.572 ± 1.269
0.538SerCys: 0.538 ± 0.297
4.841SerAsp: 4.841 ± 1.056
5.11SerGlu: 5.11 ± 1.318
2.421SerPhe: 2.421 ± 0.733
4.841SerGly: 4.841 ± 1.609
1.345SerHis: 1.345 ± 0.743
2.69SerIle: 2.69 ± 1.832
3.497SerLys: 3.497 ± 0.528
5.648SerLeu: 5.648 ± 0.635
1.076SerMet: 1.076 ± 0.308
2.421SerAsn: 2.421 ± 0.894
2.959SerPro: 2.959 ± 1.359
0.807SerGln: 0.807 ± 0.372
3.765SerArg: 3.765 ± 0.968
4.303SerSer: 4.303 ± 0.698
5.648SerThr: 5.648 ± 2.754
5.379SerVal: 5.379 ± 0.578
1.076SerTrp: 1.076 ± 0.594
2.421SerTyr: 2.421 ± 0.619
0.0SerXaa: 0.0 ± 0.0
Thr
4.303ThrAla: 4.303 ± 0.903
0.807ThrCys: 0.807 ± 0.446
2.421ThrAsp: 2.421 ± 1.361
1.614ThrGlu: 1.614 ± 0.241
2.69ThrPhe: 2.69 ± 0.857
2.152ThrGly: 2.152 ± 0.615
1.614ThrHis: 1.614 ± 0.576
3.497ThrIle: 3.497 ± 1.205
4.572ThrLys: 4.572 ± 0.285
4.572ThrLeu: 4.572 ± 0.844
1.614ThrMet: 1.614 ± 0.745
3.228ThrAsn: 3.228 ± 0.723
2.421ThrPro: 2.421 ± 0.894
1.883ThrGln: 1.883 ± 0.625
2.421ThrArg: 2.421 ± 0.612
4.303ThrSer: 4.303 ± 0.606
4.841ThrThr: 4.841 ± 0.309
5.917ThrVal: 5.917 ± 0.814
0.269ThrTrp: 0.269 ± 0.266
2.69ThrTyr: 2.69 ± 0.405
0.0ThrXaa: 0.0 ± 0.0
Val
5.11ValAla: 5.11 ± 0.579
2.69ValCys: 2.69 ± 0.57
4.841ValAsp: 4.841 ± 0.884
5.648ValGlu: 5.648 ± 0.759
2.959ValPhe: 2.959 ± 0.891
3.765ValGly: 3.765 ± 0.55
1.076ValHis: 1.076 ± 0.308
3.765ValIle: 3.765 ± 0.607
7.262ValLys: 7.262 ± 1.464
4.841ValLeu: 4.841 ± 0.884
1.614ValMet: 1.614 ± 0.632
6.455ValAsn: 6.455 ± 1.006
4.841ValPro: 4.841 ± 0.309
2.959ValGln: 2.959 ± 0.283
3.497ValArg: 3.497 ± 0.696
4.841ValSer: 4.841 ± 0.406
4.303ValThr: 4.303 ± 0.268
4.841ValVal: 4.841 ± 1.507
0.807ValTrp: 0.807 ± 0.777
1.614ValTyr: 1.614 ± 0.413
0.0ValXaa: 0.0 ± 0.0
Trp
0.538TrpAla: 0.538 ± 0.297
0.0TrpCys: 0.0 ± 0.0
0.538TrpAsp: 0.538 ± 0.297
0.807TrpGlu: 0.807 ± 0.666
0.538TrpPhe: 0.538 ± 0.297
0.269TrpGly: 0.269 ± 0.149
0.269TrpHis: 0.269 ± 0.149
0.269TrpIle: 0.269 ± 0.149
0.269TrpLys: 0.269 ± 0.266
0.807TrpLeu: 0.807 ± 0.539
0.269TrpMet: 0.269 ± 0.144
0.538TrpAsn: 0.538 ± 0.297
0.0TrpPro: 0.0 ± 0.0
0.269TrpGln: 0.269 ± 0.266
0.538TrpArg: 0.538 ± 0.186
0.807TrpSer: 0.807 ± 0.372
0.538TrpThr: 0.538 ± 0.186
0.538TrpVal: 0.538 ± 0.186
0.269TrpTrp: 0.269 ± 0.149
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.303TyrAla: 4.303 ± 1.448
2.421TyrCys: 2.421 ± 0.612
2.959TyrAsp: 2.959 ± 0.35
2.421TyrGlu: 2.421 ± 0.619
1.883TyrPhe: 1.883 ± 0.346
1.345TyrGly: 1.345 ± 0.623
0.538TyrHis: 0.538 ± 0.297
1.883TyrIle: 1.883 ± 0.65
3.497TyrLys: 3.497 ± 1.293
2.421TyrLeu: 2.421 ± 0.256
1.614TyrMet: 1.614 ± 0.413
2.959TyrAsn: 2.959 ± 0.764
1.883TyrPro: 1.883 ± 0.625
0.807TyrGln: 0.807 ± 0.529
1.614TyrArg: 1.614 ± 0.632
3.228TyrSer: 3.228 ± 1.111
1.883TyrThr: 1.883 ± 0.54
4.303TyrVal: 4.303 ± 0.753
0.269TyrTrp: 0.269 ± 0.266
2.152TyrTyr: 2.152 ± 0.615
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3719 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski