Amino acid dipepetide frequency for Hubei tombus-like virus 33

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.577AlaAla: 4.577 ± 3.052
2.288AlaCys: 2.288 ± 1.285
3.051AlaAsp: 3.051 ± 0.308
4.577AlaGlu: 4.577 ± 1.164
2.288AlaPhe: 2.288 ± 1.526
2.288AlaGly: 2.288 ± 0.121
2.288AlaHis: 2.288 ± 1.285
7.628AlaIle: 7.628 ± 1.339
4.577AlaLys: 4.577 ± 0.241
5.339AlaLeu: 5.339 ± 2.624
1.526AlaMet: 1.526 ± 0.413
3.051AlaAsn: 3.051 ± 0.308
2.288AlaPro: 2.288 ± 1.285
0.763AlaGln: 0.763 ± 0.977
1.526AlaArg: 1.526 ± 0.857
3.051AlaSer: 3.051 ± 3.909
7.628AlaThr: 7.628 ± 5.556
6.102AlaVal: 6.102 ± 0.615
0.763AlaTrp: 0.763 ± 0.428
0.763AlaTyr: 0.763 ± 0.977
0.0AlaXaa: 0.0 ± 0.0
Cys
1.526CysAla: 1.526 ± 0.549
0.0CysCys: 0.0 ± 0.0
1.526CysAsp: 1.526 ± 0.857
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.526CysLys: 1.526 ± 0.857
0.763CysLeu: 0.763 ± 0.428
0.0CysMet: 0.0 ± 0.0
0.763CysAsn: 0.763 ± 0.428
1.526CysPro: 1.526 ± 0.549
0.763CysGln: 0.763 ± 0.428
0.0CysArg: 0.0 ± 0.0
1.526CysSer: 1.526 ± 0.857
0.0CysThr: 0.0 ± 0.0
0.763CysVal: 0.763 ± 0.428
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.526AspAla: 1.526 ± 0.857
0.763AspCys: 0.763 ± 0.428
5.339AspAsp: 5.339 ± 2.998
3.051AspGlu: 3.051 ± 1.713
2.288AspPhe: 2.288 ± 0.121
0.0AspGly: 0.0 ± 0.0
0.0AspHis: 0.0 ± 0.0
3.051AspIle: 3.051 ± 1.713
4.577AspLys: 4.577 ± 1.164
4.577AspLeu: 4.577 ± 1.164
0.0AspMet: 0.0 ± 0.0
2.288AspAsn: 2.288 ± 1.285
2.288AspPro: 2.288 ± 1.526
3.051AspGln: 3.051 ± 1.098
2.288AspArg: 2.288 ± 1.285
6.102AspSer: 6.102 ± 0.615
3.051AspThr: 3.051 ± 0.308
4.577AspVal: 4.577 ± 0.241
0.0AspTrp: 0.0 ± 0.0
1.526AspTyr: 1.526 ± 0.857
0.0AspXaa: 0.0 ± 0.0
Glu
2.288GluAla: 2.288 ± 0.121
0.763GluCys: 0.763 ± 0.428
1.526GluAsp: 1.526 ± 0.857
2.288GluGlu: 2.288 ± 1.285
2.288GluPhe: 2.288 ± 1.285
4.577GluGly: 4.577 ± 0.241
0.763GluHis: 0.763 ± 0.428
2.288GluIle: 2.288 ± 1.285
3.814GluLys: 3.814 ± 0.736
7.628GluLeu: 7.628 ± 1.472
0.763GluMet: 0.763 ± 0.428
3.814GluAsn: 3.814 ± 0.67
0.0GluPro: 0.0 ± 0.0
0.763GluGln: 0.763 ± 0.428
0.0GluArg: 0.0 ± 0.0
6.102GluSer: 6.102 ± 3.427
2.288GluThr: 2.288 ± 0.121
1.526GluVal: 1.526 ± 0.549
2.288GluTrp: 2.288 ± 1.285
1.526GluTyr: 1.526 ± 0.857
0.0GluXaa: 0.0 ± 0.0
Phe
1.526PheAla: 1.526 ± 0.549
0.763PheCys: 0.763 ± 0.977
3.051PheAsp: 3.051 ± 0.308
1.526PheGlu: 1.526 ± 0.549
2.288PhePhe: 2.288 ± 1.285
2.288PheGly: 2.288 ± 0.121
0.0PheHis: 0.0 ± 0.0
2.288PheIle: 2.288 ± 0.121
5.339PheLys: 5.339 ± 0.187
4.577PheLeu: 4.577 ± 3.052
1.526PheMet: 1.526 ± 0.857
3.051PheAsn: 3.051 ± 0.308
2.288PhePro: 2.288 ± 1.526
0.763PheGln: 0.763 ± 0.977
1.526PheArg: 1.526 ± 0.857
3.814PheSer: 3.814 ± 0.736
1.526PheThr: 1.526 ± 1.955
2.288PheVal: 2.288 ± 1.285
0.763PheTrp: 0.763 ± 0.977
3.051PheTyr: 3.051 ± 1.713
0.0PheXaa: 0.0 ± 0.0
Gly
9.153GlyAla: 9.153 ± 6.105
0.763GlyCys: 0.763 ± 0.977
5.339GlyAsp: 5.339 ± 1.219
0.763GlyGlu: 0.763 ± 0.428
1.526GlyPhe: 1.526 ± 0.857
2.288GlyGly: 2.288 ± 0.121
0.763GlyHis: 0.763 ± 0.428
4.577GlyIle: 4.577 ± 0.241
1.526GlyLys: 1.526 ± 0.857
5.339GlyLeu: 5.339 ± 1.593
1.526GlyMet: 1.526 ± 1.955
4.577GlyAsn: 4.577 ± 0.241
0.763GlyPro: 0.763 ± 0.428
0.763GlyGln: 0.763 ± 0.428
3.814GlyArg: 3.814 ± 0.736
3.814GlySer: 3.814 ± 2.075
6.102GlyThr: 6.102 ± 3.601
6.865GlyVal: 6.865 ± 1.767
1.526GlyTrp: 1.526 ± 0.857
5.339GlyTyr: 5.339 ± 0.187
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
3.814HisAsp: 3.814 ± 2.142
1.526HisGlu: 1.526 ± 0.857
0.763HisPhe: 0.763 ± 0.428
1.526HisGly: 1.526 ± 0.549
0.763HisHis: 0.763 ± 0.977
2.288HisIle: 2.288 ± 0.121
3.051HisLys: 3.051 ± 1.713
0.763HisLeu: 0.763 ± 0.977
0.0HisMet: 0.0 ± 0.0
3.051HisAsn: 3.051 ± 1.713
0.763HisPro: 0.763 ± 0.428
0.0HisGln: 0.0 ± 0.0
0.763HisArg: 0.763 ± 0.428
0.763HisSer: 0.763 ± 0.428
3.814HisThr: 3.814 ± 0.736
1.526HisVal: 1.526 ± 0.857
0.0HisTrp: 0.0 ± 0.0
1.526HisTyr: 1.526 ± 0.857
0.0HisXaa: 0.0 ± 0.0
Ile
6.102IleAla: 6.102 ± 0.79
0.0IleCys: 0.0 ± 0.0
5.339IleAsp: 5.339 ± 0.187
3.051IleGlu: 3.051 ± 1.098
2.288IlePhe: 2.288 ± 0.121
4.577IleGly: 4.577 ± 0.241
1.526IleHis: 1.526 ± 0.857
3.814IleIle: 3.814 ± 2.142
2.288IleLys: 2.288 ± 0.121
4.577IleLeu: 4.577 ± 1.164
2.288IleMet: 2.288 ± 1.285
2.288IleAsn: 2.288 ± 1.285
1.526IlePro: 1.526 ± 0.549
2.288IleGln: 2.288 ± 1.285
2.288IleArg: 2.288 ± 0.121
3.051IleSer: 3.051 ± 1.098
5.339IleThr: 5.339 ± 0.187
2.288IleVal: 2.288 ± 0.121
0.0IleTrp: 0.0 ± 0.0
2.288IleTyr: 2.288 ± 2.932
0.0IleXaa: 0.0 ± 0.0
Lys
6.102LysAla: 6.102 ± 2.021
0.763LysCys: 0.763 ± 0.428
3.814LysAsp: 3.814 ± 0.736
1.526LysGlu: 1.526 ± 0.857
3.814LysPhe: 3.814 ± 2.142
2.288LysGly: 2.288 ± 0.121
1.526LysHis: 1.526 ± 0.857
4.577LysIle: 4.577 ± 1.164
4.577LysLys: 4.577 ± 1.647
8.391LysLeu: 8.391 ± 1.9
3.814LysMet: 3.814 ± 2.142
7.628LysAsn: 7.628 ± 0.066
3.814LysPro: 3.814 ± 2.142
1.526LysGln: 1.526 ± 1.955
2.288LysArg: 2.288 ± 1.285
7.628LysSer: 7.628 ± 0.066
4.577LysThr: 4.577 ± 1.164
2.288LysVal: 2.288 ± 0.121
0.0LysTrp: 0.0 ± 0.0
4.577LysTyr: 4.577 ± 2.57
0.0LysXaa: 0.0 ± 0.0
Leu
4.577LeuAla: 4.577 ± 1.164
0.0LeuCys: 0.0 ± 0.0
3.814LeuAsp: 3.814 ± 2.142
3.814LeuGlu: 3.814 ± 0.67
1.526LeuPhe: 1.526 ± 0.549
9.916LeuGly: 9.916 ± 0.054
3.051LeuHis: 3.051 ± 0.308
2.288LeuIle: 2.288 ± 0.121
8.391LeuLys: 8.391 ± 3.306
6.102LeuLeu: 6.102 ± 0.79
1.526LeuMet: 1.526 ± 0.549
5.339LeuAsn: 5.339 ± 0.187
3.814LeuPro: 3.814 ± 0.67
3.814LeuGln: 3.814 ± 2.075
2.288LeuArg: 2.288 ± 0.121
5.339LeuSer: 5.339 ± 1.593
6.865LeuThr: 6.865 ± 3.855
5.339LeuVal: 5.339 ± 1.219
0.0LeuTrp: 0.0 ± 0.0
2.288LeuTyr: 2.288 ± 1.285
0.0LeuXaa: 0.0 ± 0.0
Met
1.526MetAla: 1.526 ± 0.857
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
3.814MetGlu: 3.814 ± 2.142
1.526MetPhe: 1.526 ± 0.549
3.051MetGly: 3.051 ± 1.098
0.763MetHis: 0.763 ± 0.428
0.0MetIle: 0.0 ± 0.0
2.288MetLys: 2.288 ± 1.285
3.051MetLeu: 3.051 ± 1.713
2.288MetMet: 2.288 ± 1.075
0.0MetAsn: 0.0 ± 0.0
1.526MetPro: 1.526 ± 0.857
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
1.526MetSer: 1.526 ± 0.549
0.0MetThr: 0.0 ± 0.0
2.288MetVal: 2.288 ± 0.121
0.0MetTrp: 0.0 ± 0.0
1.526MetTyr: 1.526 ± 1.955
0.0MetXaa: 0.0 ± 0.0
Asn
4.577AsnAla: 4.577 ± 4.458
0.0AsnCys: 0.0 ± 0.0
1.526AsnAsp: 1.526 ± 0.857
3.814AsnGlu: 3.814 ± 0.736
3.051AsnPhe: 3.051 ± 1.713
3.814AsnGly: 3.814 ± 0.67
0.0AsnHis: 0.0 ± 0.0
3.814AsnIle: 3.814 ± 2.142
6.865AsnLys: 6.865 ± 1.044
6.865AsnLeu: 6.865 ± 1.044
1.526AsnMet: 1.526 ± 0.549
3.814AsnAsn: 3.814 ± 3.481
0.0AsnPro: 0.0 ± 0.0
3.051AsnGln: 3.051 ± 1.098
4.577AsnArg: 4.577 ± 0.241
3.814AsnSer: 3.814 ± 0.67
3.814AsnThr: 3.814 ± 2.075
4.577AsnVal: 4.577 ± 1.164
0.763AsnTrp: 0.763 ± 0.428
1.526AsnTyr: 1.526 ± 0.857
0.0AsnXaa: 0.0 ± 0.0
Pro
2.288ProAla: 2.288 ± 0.121
0.0ProCys: 0.0 ± 0.0
1.526ProAsp: 1.526 ± 0.549
2.288ProGlu: 2.288 ± 1.526
3.814ProPhe: 3.814 ± 3.481
2.288ProGly: 2.288 ± 1.285
1.526ProHis: 1.526 ± 0.857
1.526ProIle: 1.526 ± 0.549
0.763ProLys: 0.763 ± 0.428
0.763ProLeu: 0.763 ± 0.428
0.0ProMet: 0.0 ± 0.0
1.526ProAsn: 1.526 ± 0.549
1.526ProPro: 1.526 ± 0.857
0.763ProGln: 0.763 ± 0.977
0.763ProArg: 0.763 ± 0.428
5.339ProSer: 5.339 ± 2.624
0.763ProThr: 0.763 ± 0.977
6.865ProVal: 6.865 ± 1.767
0.0ProTrp: 0.0 ± 0.0
1.526ProTyr: 1.526 ± 0.549
0.0ProXaa: 0.0 ± 0.0
Gln
1.526GlnAla: 1.526 ± 0.549
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
2.288GlnGlu: 2.288 ± 1.285
2.288GlnPhe: 2.288 ± 0.121
4.577GlnGly: 4.577 ± 0.241
0.763GlnHis: 0.763 ± 0.428
2.288GlnIle: 2.288 ± 2.932
1.526GlnLys: 1.526 ± 0.549
2.288GlnLeu: 2.288 ± 1.526
0.763GlnMet: 0.763 ± 0.977
2.288GlnAsn: 2.288 ± 0.121
0.763GlnPro: 0.763 ± 0.977
0.763GlnGln: 0.763 ± 0.428
1.526GlnArg: 1.526 ± 0.549
1.526GlnSer: 1.526 ± 0.549
1.526GlnThr: 1.526 ± 0.549
3.814GlnVal: 3.814 ± 2.075
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
1.526ArgAla: 1.526 ± 0.857
0.0ArgCys: 0.0 ± 0.0
0.0ArgAsp: 0.0 ± 0.0
0.763ArgGlu: 0.763 ± 0.428
0.763ArgPhe: 0.763 ± 0.977
3.051ArgGly: 3.051 ± 1.713
2.288ArgHis: 2.288 ± 1.285
1.526ArgIle: 1.526 ± 0.549
5.339ArgLys: 5.339 ± 1.593
2.288ArgLeu: 2.288 ± 0.121
1.526ArgMet: 1.526 ± 0.549
1.526ArgAsn: 1.526 ± 1.955
0.0ArgPro: 0.0 ± 0.0
2.288ArgGln: 2.288 ± 0.121
0.0ArgArg: 0.0 ± 0.0
0.763ArgSer: 0.763 ± 0.977
0.763ArgThr: 0.763 ± 0.428
5.339ArgVal: 5.339 ± 0.187
0.763ArgTrp: 0.763 ± 0.428
1.526ArgTyr: 1.526 ± 0.857
0.0ArgXaa: 0.0 ± 0.0
Ser
6.102SerAla: 6.102 ± 3.601
1.526SerCys: 1.526 ± 0.857
3.814SerAsp: 3.814 ± 2.142
3.051SerGlu: 3.051 ± 0.308
3.051SerPhe: 3.051 ± 1.098
6.865SerGly: 6.865 ± 3.173
3.051SerHis: 3.051 ± 0.308
4.577SerIle: 4.577 ± 1.647
5.339SerLys: 5.339 ± 0.187
5.339SerLeu: 5.339 ± 1.593
3.814SerMet: 3.814 ± 2.142
2.288SerAsn: 2.288 ± 1.526
2.288SerPro: 2.288 ± 2.932
2.288SerGln: 2.288 ± 0.121
1.526SerArg: 1.526 ± 0.857
10.679SerSer: 10.679 ± 1.78
8.391SerThr: 8.391 ± 3.722
5.339SerVal: 5.339 ± 1.593
0.763SerTrp: 0.763 ± 0.977
5.339SerTyr: 5.339 ± 2.998
0.0SerXaa: 0.0 ± 0.0
Thr
2.288ThrAla: 2.288 ± 2.932
1.526ThrCys: 1.526 ± 0.857
3.814ThrAsp: 3.814 ± 0.67
3.051ThrGlu: 3.051 ± 1.713
1.526ThrPhe: 1.526 ± 0.549
4.577ThrGly: 4.577 ± 3.052
2.288ThrHis: 2.288 ± 0.121
3.051ThrIle: 3.051 ± 2.504
5.339ThrLys: 5.339 ± 2.998
6.102ThrLeu: 6.102 ± 2.021
0.0ThrMet: 0.0 ± 0.0
4.577ThrAsn: 4.577 ± 0.241
5.339ThrPro: 5.339 ± 2.624
2.288ThrGln: 2.288 ± 0.121
4.577ThrArg: 4.577 ± 1.647
8.391ThrSer: 8.391 ± 2.316
7.628ThrThr: 7.628 ± 2.745
7.628ThrVal: 7.628 ± 4.15
0.0ThrTrp: 0.0 ± 0.0
1.526ThrTyr: 1.526 ± 0.549
0.0ThrXaa: 0.0 ± 0.0
Val
9.153ValAla: 9.153 ± 3.734
0.763ValCys: 0.763 ± 0.428
2.288ValAsp: 2.288 ± 0.121
3.814ValGlu: 3.814 ± 2.142
4.577ValPhe: 4.577 ± 1.647
4.577ValGly: 4.577 ± 3.052
3.051ValHis: 3.051 ± 1.713
3.814ValIle: 3.814 ± 0.736
4.577ValLys: 4.577 ± 1.164
3.051ValLeu: 3.051 ± 0.308
1.526ValMet: 1.526 ± 0.857
8.391ValAsn: 8.391 ± 0.911
5.339ValPro: 5.339 ± 4.03
2.288ValGln: 2.288 ± 1.526
1.526ValArg: 1.526 ± 0.549
9.153ValSer: 9.153 ± 0.482
6.102ValThr: 6.102 ± 2.196
3.814ValVal: 3.814 ± 0.67
1.526ValTrp: 1.526 ± 0.857
0.763ValTyr: 0.763 ± 0.428
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
2.288TrpPhe: 2.288 ± 1.285
1.526TrpGly: 1.526 ± 0.549
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.763TrpLys: 0.763 ± 0.428
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.763TrpAsn: 0.763 ± 0.428
0.0TrpPro: 0.0 ± 0.0
0.763TrpGln: 0.763 ± 0.977
0.0TrpArg: 0.0 ± 0.0
1.526TrpSer: 1.526 ± 0.857
0.763TrpThr: 0.763 ± 0.428
0.763TrpVal: 0.763 ± 0.428
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.763TyrAla: 0.763 ± 0.428
0.763TyrCys: 0.763 ± 0.428
0.763TyrAsp: 0.763 ± 0.428
2.288TyrGlu: 2.288 ± 1.285
2.288TyrPhe: 2.288 ± 1.526
3.051TyrGly: 3.051 ± 1.098
2.288TyrHis: 2.288 ± 0.121
3.814TyrIle: 3.814 ± 2.142
3.051TyrLys: 3.051 ± 0.308
2.288TyrLeu: 2.288 ± 1.285
0.0TyrMet: 0.0 ± 0.0
0.763TyrAsn: 0.763 ± 0.428
0.0TyrPro: 0.0 ± 0.0
1.526TyrGln: 1.526 ± 0.549
0.763TyrArg: 0.763 ± 0.977
1.526TyrSer: 1.526 ± 0.857
4.577TyrThr: 4.577 ± 0.241
6.102TyrVal: 6.102 ± 3.427
0.0TyrTrp: 0.0 ± 0.0
2.288TyrTyr: 2.288 ± 1.285
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1312 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski