Amino acid dipepetide frequency for Hubei tombus-like virus 16

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.354AlaAla: 11.354 ± 4.265
0.873AlaCys: 0.873 ± 0.718
0.873AlaAsp: 0.873 ± 0.638
3.493AlaGlu: 3.493 ± 2.102
5.24AlaPhe: 5.24 ± 1.008
5.24AlaGly: 5.24 ± 2.436
0.873AlaHis: 0.873 ± 0.638
3.493AlaIle: 3.493 ± 1.156
2.62AlaLys: 2.62 ± 0.486
4.367AlaLeu: 4.367 ± 0.973
1.747AlaMet: 1.747 ± 0.578
0.873AlaAsn: 0.873 ± 1.028
5.24AlaPro: 5.24 ± 2.786
1.747AlaGln: 1.747 ± 0.882
6.114AlaArg: 6.114 ± 2.793
1.747AlaSer: 1.747 ± 1.051
4.367AlaThr: 4.367 ± 2.65
2.62AlaVal: 2.62 ± 0.984
1.747AlaTrp: 1.747 ± 0.882
5.24AlaTyr: 5.24 ± 0.972
0.0AlaXaa: 0.0 ± 0.0
Cys
0.873CysAla: 0.873 ± 1.028
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.747CysGlu: 1.747 ± 1.436
0.873CysPhe: 0.873 ± 0.718
1.747CysGly: 1.747 ± 0.578
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.747CysLys: 1.747 ± 1.436
2.62CysLeu: 2.62 ± 1.137
0.0CysMet: 0.0 ± 0.0
0.873CysAsn: 0.873 ± 0.718
1.747CysPro: 1.747 ± 1.277
0.0CysGln: 0.0 ± 0.0
0.873CysArg: 0.873 ± 0.718
2.62CysSer: 2.62 ± 0.984
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
2.62CysTyr: 2.62 ± 1.951
0.0CysXaa: 0.0 ± 0.0
Asp
1.747AspAla: 1.747 ± 2.056
2.62AspCys: 2.62 ± 1.137
1.747AspAsp: 1.747 ± 1.436
5.24AspGlu: 5.24 ± 1.967
0.873AspPhe: 0.873 ± 0.638
1.747AspGly: 1.747 ± 1.436
1.747AspHis: 1.747 ± 1.436
7.86AspIle: 7.86 ± 3.821
2.62AspLys: 2.62 ± 1.137
3.493AspLeu: 3.493 ± 2.554
0.873AspMet: 0.873 ± 0.638
0.873AspAsn: 0.873 ± 0.638
3.493AspPro: 3.493 ± 0.852
1.747AspGln: 1.747 ± 0.578
0.0AspArg: 0.0 ± 0.0
2.62AspSer: 2.62 ± 2.155
2.62AspThr: 2.62 ± 2.155
0.873AspVal: 0.873 ± 0.638
0.873AspTrp: 0.873 ± 0.638
1.747AspTyr: 1.747 ± 1.436
0.0AspXaa: 0.0 ± 0.0
Glu
2.62GluAla: 2.62 ± 0.984
0.0GluCys: 0.0 ± 0.0
2.62GluAsp: 2.62 ± 0.486
6.987GluGlu: 6.987 ± 1.746
2.62GluPhe: 2.62 ± 1.478
1.747GluGly: 1.747 ± 0.882
3.493GluHis: 3.493 ± 1.156
5.24GluIle: 5.24 ± 1.967
5.24GluLys: 5.24 ± 1.967
7.86GluLeu: 7.86 ± 2.951
0.873GluMet: 0.873 ± 0.718
1.747GluAsn: 1.747 ± 0.578
7.86GluPro: 7.86 ± 4.654
0.0GluGln: 0.0 ± 0.0
6.987GluArg: 6.987 ± 1.746
0.873GluSer: 0.873 ± 0.718
1.747GluThr: 1.747 ± 0.578
3.493GluVal: 3.493 ± 1.811
0.0GluTrp: 0.0 ± 0.0
2.62GluTyr: 2.62 ± 0.486
0.0GluXaa: 0.0 ± 0.0
Phe
0.873PheAla: 0.873 ± 1.028
0.873PheCys: 0.873 ± 0.718
1.747PheAsp: 1.747 ± 1.436
5.24PheGlu: 5.24 ± 2.273
2.62PhePhe: 2.62 ± 0.984
0.873PheGly: 0.873 ± 0.638
1.747PheHis: 1.747 ± 0.578
1.747PheIle: 1.747 ± 0.578
3.493PheLys: 3.493 ± 1.156
0.873PheLeu: 0.873 ± 0.718
1.747PheMet: 1.747 ± 1.023
5.24PheAsn: 5.24 ± 2.436
1.747PhePro: 1.747 ± 1.277
0.873PheGln: 0.873 ± 0.718
2.62PheArg: 2.62 ± 1.147
1.747PheSer: 1.747 ± 1.436
1.747PheThr: 1.747 ± 0.882
3.493PheVal: 3.493 ± 1.554
0.0PheTrp: 0.0 ± 0.0
0.873PheTyr: 0.873 ± 0.638
0.0PheXaa: 0.0 ± 0.0
Gly
4.367GlyAla: 4.367 ± 1.23
0.873GlyCys: 0.873 ± 0.718
4.367GlyAsp: 4.367 ± 1.499
1.747GlyGlu: 1.747 ± 0.578
1.747GlyPhe: 1.747 ± 1.436
5.24GlyGly: 5.24 ± 4.959
0.0GlyHis: 0.0 ± 0.0
2.62GlyIle: 2.62 ± 1.478
2.62GlyLys: 2.62 ± 0.486
3.493GlyLeu: 3.493 ± 1.156
0.873GlyMet: 0.873 ± 0.637
5.24GlyAsn: 5.24 ± 1.734
3.493GlyPro: 3.493 ± 1.501
3.493GlyGln: 3.493 ± 4.111
8.734GlyArg: 8.734 ± 2.46
3.493GlySer: 3.493 ± 1.764
3.493GlyThr: 3.493 ± 1.501
3.493GlyVal: 3.493 ± 2.102
0.0GlyTrp: 0.0 ± 0.0
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.747HisAla: 1.747 ± 1.277
0.873HisCys: 0.873 ± 0.638
0.873HisAsp: 0.873 ± 0.638
1.747HisGlu: 1.747 ± 1.436
0.873HisPhe: 0.873 ± 0.718
0.0HisGly: 0.0 ± 0.0
0.873HisHis: 0.873 ± 0.638
0.0HisIle: 0.0 ± 0.0
0.873HisLys: 0.873 ± 0.718
3.493HisLeu: 3.493 ± 1.811
0.873HisMet: 0.873 ± 0.638
0.873HisAsn: 0.873 ± 0.638
3.493HisPro: 3.493 ± 1.501
1.747HisGln: 1.747 ± 1.277
0.873HisArg: 0.873 ± 0.718
0.873HisSer: 0.873 ± 0.718
0.873HisThr: 0.873 ± 0.638
0.873HisVal: 0.873 ± 1.028
0.0HisTrp: 0.0 ± 0.0
2.62HisTyr: 2.62 ± 2.155
0.0HisXaa: 0.0 ± 0.0
Ile
0.873IleAla: 0.873 ± 0.718
0.0IleCys: 0.0 ± 0.0
4.367IleAsp: 4.367 ± 0.29
3.493IleGlu: 3.493 ± 0.852
0.873IlePhe: 0.873 ± 1.028
2.62IleGly: 2.62 ± 1.915
2.62IleHis: 2.62 ± 0.486
1.747IleIle: 1.747 ± 0.578
6.114IleLys: 6.114 ± 0.657
1.747IleLeu: 1.747 ± 0.882
0.873IleMet: 0.873 ± 0.718
5.24IleAsn: 5.24 ± 1.261
1.747IlePro: 1.747 ± 0.578
3.493IleGln: 3.493 ± 1.156
2.62IleArg: 2.62 ± 0.486
4.367IleSer: 4.367 ± 1.654
1.747IleThr: 1.747 ± 1.051
3.493IleVal: 3.493 ± 1.156
0.873IleTrp: 0.873 ± 0.638
0.873IleTyr: 0.873 ± 0.718
0.0IleXaa: 0.0 ± 0.0
Lys
3.493LysAla: 3.493 ± 0.852
1.747LysCys: 1.747 ± 1.051
1.747LysAsp: 1.747 ± 1.277
1.747LysGlu: 1.747 ± 1.277
1.747LysPhe: 1.747 ± 0.578
3.493LysGly: 3.493 ± 1.156
4.367LysHis: 4.367 ± 1.481
3.493LysIle: 3.493 ± 0.852
2.62LysLys: 2.62 ± 1.147
5.24LysLeu: 5.24 ± 0.507
2.62LysMet: 2.62 ± 1.137
3.493LysAsn: 3.493 ± 0.429
4.367LysPro: 4.367 ± 1.654
3.493LysGln: 3.493 ± 1.554
6.987LysArg: 6.987 ± 2.447
2.62LysSer: 2.62 ± 1.478
2.62LysThr: 2.62 ± 0.486
2.62LysVal: 2.62 ± 0.984
1.747LysTrp: 1.747 ± 1.277
4.367LysTyr: 4.367 ± 2.511
0.0LysXaa: 0.0 ± 0.0
Leu
7.86LeuAla: 7.86 ± 2.921
1.747LeuCys: 1.747 ± 1.436
6.987LeuAsp: 6.987 ± 1.746
7.86LeuGlu: 7.86 ± 2.951
4.367LeuPhe: 4.367 ± 1.481
6.114LeuGly: 6.114 ± 1.187
1.747LeuHis: 1.747 ± 1.436
2.62LeuIle: 2.62 ± 0.984
2.62LeuLys: 2.62 ± 0.486
7.86LeuLeu: 7.86 ± 2.951
0.873LeuMet: 0.873 ± 0.638
0.873LeuAsn: 0.873 ± 0.718
5.24LeuPro: 5.24 ± 2.796
4.367LeuGln: 4.367 ± 0.973
3.493LeuArg: 3.493 ± 1.811
4.367LeuSer: 4.367 ± 0.29
3.493LeuThr: 3.493 ± 1.633
2.62LeuVal: 2.62 ± 1.147
1.747LeuTrp: 1.747 ± 1.436
2.62LeuTyr: 2.62 ± 1.147
0.0LeuXaa: 0.0 ± 0.0
Met
0.873MetAla: 0.873 ± 1.028
0.873MetCys: 0.873 ± 0.718
0.873MetAsp: 0.873 ± 0.638
1.747MetGlu: 1.747 ± 1.277
0.873MetPhe: 0.873 ± 0.718
0.873MetGly: 0.873 ± 0.718
1.747MetHis: 1.747 ± 1.051
0.873MetIle: 0.873 ± 0.638
1.747MetLys: 1.747 ± 0.578
2.62MetLeu: 2.62 ± 1.478
1.747MetMet: 1.747 ± 1.436
0.873MetAsn: 0.873 ± 0.638
0.873MetPro: 0.873 ± 0.638
0.873MetGln: 0.873 ± 1.028
1.747MetArg: 1.747 ± 0.882
2.62MetSer: 2.62 ± 1.137
0.873MetThr: 0.873 ± 0.718
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.493AsnAla: 3.493 ± 1.764
1.747AsnCys: 1.747 ± 0.882
1.747AsnAsp: 1.747 ± 0.578
0.0AsnGlu: 0.0 ± 0.0
2.62AsnPhe: 2.62 ± 1.137
2.62AsnGly: 2.62 ± 2.155
0.0AsnHis: 0.0 ± 0.0
2.62AsnIle: 2.62 ± 0.486
2.62AsnLys: 2.62 ± 0.984
3.493AsnLeu: 3.493 ± 0.852
0.873AsnMet: 0.873 ± 1.028
0.873AsnAsn: 0.873 ± 0.718
1.747AsnPro: 1.747 ± 0.882
0.873AsnGln: 0.873 ± 1.028
2.62AsnArg: 2.62 ± 1.915
4.367AsnSer: 4.367 ± 2.96
3.493AsnThr: 3.493 ± 0.429
4.367AsnVal: 4.367 ± 2.199
0.0AsnTrp: 0.0 ± 0.0
4.367AsnTyr: 4.367 ± 0.29
0.0AsnXaa: 0.0 ± 0.0
Pro
6.987ProAla: 6.987 ± 1.746
0.873ProCys: 0.873 ± 0.718
2.62ProAsp: 2.62 ± 1.137
6.987ProGlu: 6.987 ± 5.108
3.493ProPhe: 3.493 ± 0.429
1.747ProGly: 1.747 ± 0.882
1.747ProHis: 1.747 ± 1.277
1.747ProIle: 1.747 ± 0.578
5.24ProLys: 5.24 ± 2.786
3.493ProLeu: 3.493 ± 2.554
2.62ProMet: 2.62 ± 0.486
1.747ProAsn: 1.747 ± 1.277
10.48ProPro: 10.48 ± 7.224
0.873ProGln: 0.873 ± 0.638
9.607ProArg: 9.607 ± 3.479
6.114ProSer: 6.114 ± 3.408
5.24ProThr: 5.24 ± 2.646
3.493ProVal: 3.493 ± 0.429
2.62ProTrp: 2.62 ± 1.147
0.873ProTyr: 0.873 ± 1.028
0.0ProXaa: 0.0 ± 0.0
Gln
0.0GlnAla: 0.0 ± 0.0
0.0GlnCys: 0.0 ± 0.0
0.873GlnAsp: 0.873 ± 0.718
0.873GlnGlu: 0.873 ± 0.638
4.367GlnPhe: 4.367 ± 1.23
0.0GlnGly: 0.0 ± 0.0
0.0GlnHis: 0.0 ± 0.0
3.493GlnIle: 3.493 ± 1.501
1.747GlnLys: 1.747 ± 1.277
5.24GlnLeu: 5.24 ± 0.972
0.0GlnMet: 0.0 ± 0.0
1.747GlnAsn: 1.747 ± 1.051
7.86GlnPro: 7.86 ± 2.591
1.747GlnGln: 1.747 ± 0.578
3.493GlnArg: 3.493 ± 0.852
1.747GlnSer: 1.747 ± 0.578
0.873GlnThr: 0.873 ± 0.638
0.0GlnVal: 0.0 ± 0.0
0.0GlnTrp: 0.0 ± 0.0
1.747GlnTyr: 1.747 ± 0.578
0.0GlnXaa: 0.0 ± 0.0
Arg
7.86ArgAla: 7.86 ± 1.487
0.0ArgCys: 0.0 ± 0.0
4.367ArgAsp: 4.367 ± 1.481
4.367ArgGlu: 4.367 ± 2.163
3.493ArgPhe: 3.493 ± 2.554
8.734ArgGly: 8.734 ± 6.458
0.0ArgHis: 0.0 ± 0.0
2.62ArgIle: 2.62 ± 0.486
3.493ArgLys: 3.493 ± 1.156
6.114ArgLeu: 6.114 ± 0.657
0.873ArgMet: 0.873 ± 0.718
0.873ArgAsn: 0.873 ± 0.638
5.24ArgPro: 5.24 ± 1.589
2.62ArgGln: 2.62 ± 1.147
8.734ArgArg: 8.734 ± 5.3
4.367ArgSer: 4.367 ± 1.508
6.987ArgThr: 6.987 ± 2.921
6.114ArgVal: 6.114 ± 0.774
0.873ArgTrp: 0.873 ± 1.028
3.493ArgTyr: 3.493 ± 0.429
0.0ArgXaa: 0.0 ± 0.0
Ser
5.24SerAla: 5.24 ± 1.967
0.873SerCys: 0.873 ± 0.638
0.873SerAsp: 0.873 ± 0.638
2.62SerGlu: 2.62 ± 1.915
1.747SerPhe: 1.747 ± 1.051
6.114SerGly: 6.114 ± 2.327
0.0SerHis: 0.0 ± 0.0
1.747SerIle: 1.747 ± 1.436
4.367SerLys: 4.367 ± 3.591
2.62SerLeu: 2.62 ± 0.984
0.0SerMet: 0.0 ± 0.0
6.114SerAsn: 6.114 ± 1.98
0.873SerPro: 0.873 ± 1.028
0.0SerGln: 0.0 ± 0.0
5.24SerArg: 5.24 ± 1.008
2.62SerSer: 2.62 ± 2.155
5.24SerThr: 5.24 ± 3.153
6.114SerVal: 6.114 ± 1.726
1.747SerTrp: 1.747 ± 1.436
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.493ThrAla: 3.493 ± 2.936
0.873ThrCys: 0.873 ± 1.028
0.873ThrAsp: 0.873 ± 0.638
4.367ThrGlu: 4.367 ± 1.23
0.873ThrPhe: 0.873 ± 0.638
4.367ThrGly: 4.367 ± 1.23
1.747ThrHis: 1.747 ± 1.051
2.62ThrIle: 2.62 ± 0.486
1.747ThrLys: 1.747 ± 0.578
5.24ThrLeu: 5.24 ± 2.436
0.873ThrMet: 0.873 ± 0.718
2.62ThrAsn: 2.62 ± 0.486
4.367ThrPro: 4.367 ± 0.973
1.747ThrGln: 1.747 ± 0.882
4.367ThrArg: 4.367 ± 1.654
2.62ThrSer: 2.62 ± 1.478
7.86ThrThr: 7.86 ± 5.44
3.493ThrVal: 3.493 ± 1.156
1.747ThrTrp: 1.747 ± 0.578
2.62ThrTyr: 2.62 ± 1.806
0.0ThrXaa: 0.0 ± 0.0
Val
0.873ValAla: 0.873 ± 1.028
1.747ValCys: 1.747 ± 1.277
1.747ValAsp: 1.747 ± 0.578
2.62ValGlu: 2.62 ± 1.137
0.0ValPhe: 0.0 ± 0.0
2.62ValGly: 2.62 ± 0.486
0.0ValHis: 0.0 ± 0.0
2.62ValIle: 2.62 ± 2.155
9.607ValLys: 9.607 ± 2.662
4.367ValLeu: 4.367 ± 0.29
0.873ValMet: 0.873 ± 0.638
1.747ValAsn: 1.747 ± 0.882
6.114ValPro: 6.114 ± 2.218
3.493ValGln: 3.493 ± 1.156
3.493ValArg: 3.493 ± 0.429
1.747ValSer: 1.747 ± 1.436
2.62ValThr: 2.62 ± 1.147
3.493ValVal: 3.493 ± 0.429
0.873ValTrp: 0.873 ± 0.718
1.747ValTyr: 1.747 ± 0.578
0.0ValXaa: 0.0 ± 0.0
Trp
0.873TrpAla: 0.873 ± 0.638
0.873TrpCys: 0.873 ± 0.718
1.747TrpAsp: 1.747 ± 1.277
0.0TrpGlu: 0.0 ± 0.0
0.873TrpPhe: 0.873 ± 0.638
1.747TrpGly: 1.747 ± 0.578
0.0TrpHis: 0.0 ± 0.0
0.873TrpIle: 0.873 ± 1.028
1.747TrpLys: 1.747 ± 1.051
1.747TrpLeu: 1.747 ± 0.578
0.873TrpMet: 0.873 ± 1.028
0.873TrpAsn: 0.873 ± 0.638
0.0TrpPro: 0.0 ± 0.0
0.873TrpGln: 0.873 ± 0.718
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.873TrpThr: 0.873 ± 0.718
0.873TrpVal: 0.873 ± 0.638
0.873TrpTrp: 0.873 ± 0.718
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.367TyrAla: 4.367 ± 1.481
0.873TyrCys: 0.873 ± 0.718
4.367TyrAsp: 4.367 ± 2.727
1.747TyrGlu: 1.747 ± 1.051
0.873TyrPhe: 0.873 ± 0.638
1.747TyrGly: 1.747 ± 2.056
1.747TyrHis: 1.747 ± 0.578
1.747TyrIle: 1.747 ± 1.051
1.747TyrLys: 1.747 ± 0.578
3.493TyrLeu: 3.493 ± 1.633
1.747TyrMet: 1.747 ± 1.051
1.747TyrAsn: 1.747 ± 1.051
2.62TyrPro: 2.62 ± 1.806
1.747TyrGln: 1.747 ± 1.436
2.62TyrArg: 2.62 ± 0.486
2.62TyrSer: 2.62 ± 1.137
1.747TyrThr: 1.747 ± 0.578
0.873TyrVal: 0.873 ± 0.638
0.0TyrTrp: 0.0 ± 0.0
1.747TyrTyr: 1.747 ± 0.882
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1146 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski