Amino acid dipepetide frequency for Hubei tombus-like virus 18

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.695AlaAla: 4.695 ± 2.925
3.912AlaCys: 3.912 ± 1.521
3.13AlaAsp: 3.13 ± 0.875
6.26AlaGlu: 6.26 ± 3.27
0.782AlaPhe: 0.782 ± 0.76
6.26AlaGly: 6.26 ± 2.413
1.565AlaHis: 1.565 ± 1.263
5.477AlaIle: 5.477 ± 1.649
3.912AlaLys: 3.912 ± 0.537
4.695AlaLeu: 4.695 ± 1.68
3.13AlaMet: 3.13 ± 1.741
5.477AlaAsn: 5.477 ± 2.52
2.347AlaPro: 2.347 ± 0.528
5.477AlaGln: 5.477 ± 1.695
3.13AlaArg: 3.13 ± 1.025
3.13AlaSer: 3.13 ± 1.161
4.695AlaThr: 4.695 ± 1.371
5.477AlaVal: 5.477 ± 2.028
1.565AlaTrp: 1.565 ± 1.114
2.347AlaTyr: 2.347 ± 0.528
0.0AlaXaa: 0.0 ± 0.0
Cys
1.565CysAla: 1.565 ± 1.114
0.0CysCys: 0.0 ± 0.0
0.782CysAsp: 0.782 ± 0.631
0.782CysGlu: 0.782 ± 0.557
0.0CysPhe: 0.0 ± 0.0
1.565CysGly: 1.565 ± 0.768
0.0CysHis: 0.0 ± 0.0
0.782CysIle: 0.782 ± 0.687
2.347CysLys: 2.347 ± 1.396
3.13CysLeu: 3.13 ± 0.897
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.782CysPro: 0.782 ± 0.687
1.565CysGln: 1.565 ± 0.811
0.782CysArg: 0.782 ± 0.687
3.13CysSer: 3.13 ± 1.145
0.0CysThr: 0.0 ± 0.0
0.782CysVal: 0.782 ± 0.687
0.782CysTrp: 0.782 ± 0.76
1.565CysTyr: 1.565 ± 1.114
0.0CysXaa: 0.0 ± 0.0
Asp
3.13AspAla: 3.13 ± 1.692
0.0AspCys: 0.0 ± 0.0
1.565AspAsp: 1.565 ± 1.114
1.565AspGlu: 1.565 ± 1.263
1.565AspPhe: 1.565 ± 1.114
2.347AspGly: 2.347 ± 1.102
2.347AspHis: 2.347 ± 0.975
1.565AspIle: 1.565 ± 1.263
3.912AspLys: 3.912 ± 1.156
5.477AspLeu: 5.477 ± 2.658
0.782AspMet: 0.782 ± 0.687
0.782AspAsn: 0.782 ± 0.631
3.912AspPro: 3.912 ± 2.346
1.565AspGln: 1.565 ± 0.783
2.347AspArg: 2.347 ± 0.975
1.565AspSer: 1.565 ± 0.603
2.347AspThr: 2.347 ± 0.528
3.912AspVal: 3.912 ± 1.522
0.782AspTrp: 0.782 ± 0.76
0.782AspTyr: 0.782 ± 0.687
0.0AspXaa: 0.0 ± 0.0
Glu
3.912GluAla: 3.912 ± 0.496
2.347GluCys: 2.347 ± 0.896
3.13GluAsp: 3.13 ± 1.025
4.695GluGlu: 4.695 ± 1.056
3.13GluPhe: 3.13 ± 1.207
2.347GluGly: 2.347 ± 1.202
0.782GluHis: 0.782 ± 0.631
7.042GluIle: 7.042 ± 2.171
3.13GluLys: 3.13 ± 1.7
2.347GluLeu: 2.347 ± 1.102
3.13GluMet: 3.13 ± 1.018
2.347GluAsn: 2.347 ± 1.154
1.565GluPro: 1.565 ± 1.263
2.347GluGln: 2.347 ± 1.396
7.042GluArg: 7.042 ± 2.171
3.13GluSer: 3.13 ± 2.228
5.477GluThr: 5.477 ± 2.249
0.0GluVal: 0.0 ± 0.0
1.565GluTrp: 1.565 ± 0.811
0.782GluTyr: 0.782 ± 0.631
0.0GluXaa: 0.0 ± 0.0
Phe
2.347PheAla: 2.347 ± 1.422
1.565PheCys: 1.565 ± 1.114
3.13PheAsp: 3.13 ± 1.468
3.13PheGlu: 3.13 ± 1.145
0.782PhePhe: 0.782 ± 0.631
2.347PheGly: 2.347 ± 1.102
2.347PheHis: 2.347 ± 1.105
1.565PheIle: 1.565 ± 1.114
0.782PheLys: 0.782 ± 0.557
0.782PheLeu: 0.782 ± 0.687
0.782PheMet: 0.782 ± 0.631
2.347PheAsn: 2.347 ± 0.528
0.0PhePro: 0.0 ± 0.0
0.0PheGln: 0.0 ± 0.0
4.695PheArg: 4.695 ± 1.127
0.782PheSer: 0.782 ± 0.687
2.347PheThr: 2.347 ± 1.408
2.347PheVal: 2.347 ± 1.154
0.782PheTrp: 0.782 ± 0.557
0.782PheTyr: 0.782 ± 0.76
0.0PheXaa: 0.0 ± 0.0
Gly
4.695GlyAla: 4.695 ± 1.826
1.565GlyCys: 1.565 ± 0.811
4.695GlyAsp: 4.695 ± 2.205
3.912GlyGlu: 3.912 ± 1.214
1.565GlyPhe: 1.565 ± 0.811
6.26GlyGly: 6.26 ± 3.387
0.782GlyHis: 0.782 ± 0.631
4.695GlyIle: 4.695 ± 1.371
2.347GlyLys: 2.347 ± 1.671
11.737GlyLeu: 11.737 ± 1.881
0.782GlyMet: 0.782 ± 0.631
3.13GlyAsn: 3.13 ± 1.161
2.347GlyPro: 2.347 ± 1.102
0.782GlyGln: 0.782 ± 0.76
10.955GlyArg: 10.955 ± 6.855
3.13GlySer: 3.13 ± 2.296
3.13GlyThr: 3.13 ± 1.207
3.912GlyVal: 3.912 ± 1.373
3.13GlyTrp: 3.13 ± 1.573
1.565GlyTyr: 1.565 ± 0.783
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.782HisPhe: 0.782 ± 0.557
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.782HisIle: 0.782 ± 0.631
0.782HisLys: 0.782 ± 0.557
1.565HisLeu: 1.565 ± 0.603
0.0HisMet: 0.0 ± 0.0
0.782HisAsn: 0.782 ± 0.557
0.782HisPro: 0.782 ± 0.631
1.565HisGln: 1.565 ± 1.114
3.912HisArg: 3.912 ± 3.157
0.782HisSer: 0.782 ± 0.557
0.782HisThr: 0.782 ± 0.557
0.782HisVal: 0.782 ± 0.76
0.0HisTrp: 0.0 ± 0.0
1.565HisTyr: 1.565 ± 0.811
0.0HisXaa: 0.0 ± 0.0
Ile
5.477IleAla: 5.477 ± 2.126
0.782IleCys: 0.782 ± 0.687
1.565IleAsp: 1.565 ± 0.603
1.565IleGlu: 1.565 ± 0.811
0.782IlePhe: 0.782 ± 0.631
2.347IleGly: 2.347 ± 1.102
0.0IleHis: 0.0 ± 0.0
3.912IleIle: 3.912 ± 1.432
3.912IleLys: 3.912 ± 2.085
4.695IleLeu: 4.695 ± 1.296
2.347IleMet: 2.347 ± 0.586
3.13IleAsn: 3.13 ± 0.752
3.912IlePro: 3.912 ± 2.304
4.695IleGln: 4.695 ± 2.114
4.695IleArg: 4.695 ± 1.079
3.13IleSer: 3.13 ± 1.621
1.565IleThr: 1.565 ± 0.958
4.695IleVal: 4.695 ± 1.826
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.912LysAla: 3.912 ± 2.222
1.565LysCys: 1.565 ± 0.603
0.782LysAsp: 0.782 ± 0.687
1.565LysGlu: 1.565 ± 1.374
3.912LysPhe: 3.912 ± 2.785
2.347LysGly: 2.347 ± 1.105
0.0LysHis: 0.0 ± 0.0
3.13LysIle: 3.13 ± 1.535
0.782LysLys: 0.782 ± 0.557
3.13LysLeu: 3.13 ± 2.045
1.565LysMet: 1.565 ± 1.114
4.695LysAsn: 4.695 ± 1.894
5.477LysPro: 5.477 ± 1.188
1.565LysGln: 1.565 ± 1.374
1.565LysArg: 1.565 ± 0.71
2.347LysSer: 2.347 ± 1.483
3.13LysThr: 3.13 ± 0.355
5.477LysVal: 5.477 ± 1.175
0.0LysTrp: 0.0 ± 0.0
4.695LysTyr: 4.695 ± 2.536
0.0LysXaa: 0.0 ± 0.0
Leu
7.825LeuAla: 7.825 ± 2.797
2.347LeuCys: 2.347 ± 1.246
4.695LeuAsp: 4.695 ± 2.536
7.042LeuGlu: 7.042 ± 1.201
2.347LeuPhe: 2.347 ± 1.154
9.39LeuGly: 9.39 ± 2.747
2.347LeuHis: 2.347 ± 0.975
3.912LeuIle: 3.912 ± 1.432
4.695LeuLys: 4.695 ± 2.046
10.172LeuLeu: 10.172 ± 2.931
3.13LeuMet: 3.13 ± 1.621
3.13LeuAsn: 3.13 ± 1.468
5.477LeuPro: 5.477 ± 1.697
5.477LeuGln: 5.477 ± 1.903
7.042LeuArg: 7.042 ± 2.171
8.607LeuSer: 8.607 ± 2.357
1.565LeuThr: 1.565 ± 0.71
5.477LeuVal: 5.477 ± 0.797
1.565LeuTrp: 1.565 ± 1.374
2.347LeuTyr: 2.347 ± 1.396
0.0LeuXaa: 0.0 ± 0.0
Met
4.695MetAla: 4.695 ± 0.251
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
3.13MetGlu: 3.13 ± 0.897
3.912MetPhe: 3.912 ± 0.842
0.0MetGly: 0.0 ± 0.0
0.782MetHis: 0.782 ± 0.631
0.782MetIle: 0.782 ± 0.687
2.347MetLys: 2.347 ± 2.061
2.347MetLeu: 2.347 ± 1.671
1.565MetMet: 1.565 ± 0.738
0.782MetAsn: 0.782 ± 0.687
0.782MetPro: 0.782 ± 0.631
2.347MetGln: 2.347 ± 0.765
1.565MetArg: 1.565 ± 0.71
2.347MetSer: 2.347 ± 0.896
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
3.13MetTyr: 3.13 ± 2.045
0.0MetXaa: 0.0 ± 0.0
Asn
2.347AsnAla: 2.347 ± 1.102
0.0AsnCys: 0.0 ± 0.0
0.782AsnAsp: 0.782 ± 0.631
3.912AsnGlu: 3.912 ± 1.809
2.347AsnPhe: 2.347 ± 0.765
1.565AsnGly: 1.565 ± 0.603
0.0AsnHis: 0.0 ± 0.0
3.13AsnIle: 3.13 ± 0.752
2.347AsnLys: 2.347 ± 1.102
3.912AsnLeu: 3.912 ± 2.222
1.565AsnMet: 1.565 ± 0.768
0.782AsnAsn: 0.782 ± 0.557
2.347AsnPro: 2.347 ± 1.105
1.565AsnGln: 1.565 ± 0.958
2.347AsnArg: 2.347 ± 1.105
2.347AsnSer: 2.347 ± 0.528
2.347AsnThr: 2.347 ± 1.209
3.912AsnVal: 3.912 ± 0.496
0.0AsnTrp: 0.0 ± 0.0
1.565AsnTyr: 1.565 ± 0.958
0.0AsnXaa: 0.0 ± 0.0
Pro
7.042ProAla: 7.042 ± 3.455
1.565ProCys: 1.565 ± 0.811
3.912ProAsp: 3.912 ± 0.842
3.13ProGlu: 3.13 ± 1.025
0.782ProPhe: 0.782 ± 0.557
5.477ProGly: 5.477 ± 2.126
0.0ProHis: 0.0 ± 0.0
0.782ProIle: 0.782 ± 0.687
2.347ProLys: 2.347 ± 0.528
7.825ProLeu: 7.825 ± 1.832
0.782ProMet: 0.782 ± 0.687
1.565ProAsn: 1.565 ± 1.263
3.13ProPro: 3.13 ± 0.752
3.13ProGln: 3.13 ± 1.025
2.347ProArg: 2.347 ± 0.528
1.565ProSer: 1.565 ± 0.71
2.347ProThr: 2.347 ± 1.154
2.347ProVal: 2.347 ± 0.586
0.0ProTrp: 0.0 ± 0.0
3.13ProTyr: 3.13 ± 1.145
0.0ProXaa: 0.0 ± 0.0
Gln
3.13GlnAla: 3.13 ± 1.419
0.0GlnCys: 0.0 ± 0.0
2.347GlnAsp: 2.347 ± 1.894
3.13GlnGlu: 3.13 ± 2.045
2.347GlnPhe: 2.347 ± 1.422
2.347GlnGly: 2.347 ± 0.586
0.782GlnHis: 0.782 ± 0.557
0.0GlnIle: 0.0 ± 0.0
1.565GlnLys: 1.565 ± 0.811
7.042GlnLeu: 7.042 ± 3.157
0.782GlnMet: 0.782 ± 0.76
0.782GlnAsn: 0.782 ± 0.687
4.695GlnPro: 4.695 ± 2.205
3.912GlnGln: 3.912 ± 1.614
3.13GlnArg: 3.13 ± 1.161
3.13GlnSer: 3.13 ± 1.145
3.912GlnThr: 3.912 ± 1.769
4.695GlnVal: 4.695 ± 2.335
0.0GlnTrp: 0.0 ± 0.0
1.565GlnTyr: 1.565 ± 0.603
0.0GlnXaa: 0.0 ± 0.0
Arg
5.477ArgAla: 5.477 ± 0.797
1.565ArgCys: 1.565 ± 1.114
6.26ArgAsp: 6.26 ± 4.192
6.26ArgGlu: 6.26 ± 4.176
2.347ArgPhe: 2.347 ± 1.154
10.955ArgGly: 10.955 ± 7.762
0.0ArgHis: 0.0 ± 0.0
3.912ArgIle: 3.912 ± 0.537
3.13ArgLys: 3.13 ± 0.91
7.042ArgLeu: 7.042 ± 1.201
3.912ArgMet: 3.912 ± 1.266
2.347ArgAsn: 2.347 ± 1.209
3.13ArgPro: 3.13 ± 0.875
0.782ArgGln: 0.782 ± 0.631
3.912ArgArg: 3.912 ± 1.214
3.912ArgSer: 3.912 ± 1.156
4.695ArgThr: 4.695 ± 2.129
5.477ArgVal: 5.477 ± 1.301
1.565ArgTrp: 1.565 ± 0.603
5.477ArgTyr: 5.477 ± 2.79
0.0ArgXaa: 0.0 ± 0.0
Ser
4.695SerAla: 4.695 ± 1.777
0.0SerCys: 0.0 ± 0.0
1.565SerAsp: 1.565 ± 0.811
2.347SerGlu: 2.347 ± 1.396
1.565SerPhe: 1.565 ± 1.114
4.695SerGly: 4.695 ± 2.21
1.565SerHis: 1.565 ± 1.114
3.912SerIle: 3.912 ± 1.769
3.13SerLys: 3.13 ± 1.468
7.042SerLeu: 7.042 ± 2.33
0.782SerMet: 0.782 ± 0.687
0.782SerAsn: 0.782 ± 0.631
0.782SerPro: 0.782 ± 0.687
2.347SerGln: 2.347 ± 1.586
7.042SerArg: 7.042 ± 1.268
3.912SerSer: 3.912 ± 1.661
0.782SerThr: 0.782 ± 0.557
2.347SerVal: 2.347 ± 1.202
0.0SerTrp: 0.0 ± 0.0
2.347SerTyr: 2.347 ± 1.586
0.0SerXaa: 0.0 ± 0.0
Thr
3.13ThrAla: 3.13 ± 0.875
0.782ThrCys: 0.782 ± 0.687
0.782ThrAsp: 0.782 ± 0.631
3.13ThrGlu: 3.13 ± 1.535
0.782ThrPhe: 0.782 ± 0.76
6.26ThrGly: 6.26 ± 1.49
0.782ThrHis: 0.782 ± 0.631
2.347ThrIle: 2.347 ± 1.105
0.782ThrLys: 0.782 ± 0.687
3.13ThrLeu: 3.13 ± 1.126
2.347ThrMet: 2.347 ± 0.513
0.0ThrAsn: 0.0 ± 0.0
4.695ThrPro: 4.695 ± 2.081
2.347ThrGln: 2.347 ± 1.154
5.477ThrArg: 5.477 ± 1.567
1.565ThrSer: 1.565 ± 1.374
0.782ThrThr: 0.782 ± 0.557
3.13ThrVal: 3.13 ± 0.355
0.782ThrTrp: 0.782 ± 0.687
0.782ThrTyr: 0.782 ± 0.557
0.0ThrXaa: 0.0 ± 0.0
Val
4.695ValAla: 4.695 ± 2.815
2.347ValCys: 2.347 ± 1.209
1.565ValAsp: 1.565 ± 1.52
4.695ValGlu: 4.695 ± 1.894
2.347ValPhe: 2.347 ± 1.422
6.26ValGly: 6.26 ± 2.291
0.0ValHis: 0.0 ± 0.0
3.13ValIle: 3.13 ± 1.207
3.13ValLys: 3.13 ± 1.4
3.13ValLeu: 3.13 ± 1.419
0.0ValMet: 0.0 ± 0.0
4.695ValAsn: 4.695 ± 1.628
4.695ValPro: 4.695 ± 2.033
3.13ValGln: 3.13 ± 1.207
3.912ValArg: 3.912 ± 1.661
1.565ValSer: 1.565 ± 0.783
3.13ValThr: 3.13 ± 1.161
8.607ValVal: 8.607 ± 2.909
0.782ValTrp: 0.782 ± 0.631
5.477ValTyr: 5.477 ± 0.581
0.0ValXaa: 0.0 ± 0.0
Trp
1.565TrpAla: 1.565 ± 0.811
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.782TrpGly: 0.782 ± 0.557
0.0TrpHis: 0.0 ± 0.0
0.782TrpIle: 0.782 ± 0.631
1.565TrpLys: 1.565 ± 0.811
2.347TrpLeu: 2.347 ± 1.209
1.565TrpMet: 1.565 ± 0.603
0.782TrpAsn: 0.782 ± 0.557
0.0TrpPro: 0.0 ± 0.0
1.565TrpGln: 1.565 ± 0.768
1.565TrpArg: 1.565 ± 0.958
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.782TrpVal: 0.782 ± 0.76
1.565TrpTrp: 1.565 ± 0.603
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.13TyrAla: 3.13 ± 0.897
0.0TyrCys: 0.0 ± 0.0
1.565TyrAsp: 1.565 ± 0.603
0.782TyrGlu: 0.782 ± 0.76
1.565TyrPhe: 1.565 ± 0.768
1.565TyrGly: 1.565 ± 0.811
0.782TyrHis: 0.782 ± 0.687
1.565TyrIle: 1.565 ± 0.603
4.695TyrLys: 4.695 ± 1.772
7.042TyrLeu: 7.042 ± 2.92
0.782TyrMet: 0.782 ± 0.631
0.782TyrAsn: 0.782 ± 0.557
2.347TyrPro: 2.347 ± 1.483
3.13TyrGln: 3.13 ± 1.145
4.695TyrArg: 4.695 ± 0.906
1.565TyrSer: 1.565 ± 1.114
0.782TyrThr: 0.782 ± 0.687
3.13TyrVal: 3.13 ± 0.355
0.0TyrTrp: 0.0 ± 0.0
0.782TyrTyr: 0.782 ± 0.76
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1279 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski