Amino acid dipepetide frequency for Hubei tombus-like virus 28

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.002AlaAla: 6.002 ± 3.659
1.2AlaCys: 1.2 ± 0.732
2.401AlaAsp: 2.401 ± 0.572
4.802AlaGlu: 4.802 ± 1.144
3.601AlaPhe: 3.601 ± 0.16
0.0AlaGly: 0.0 ± 0.0
3.601AlaHis: 3.601 ± 0.16
3.601AlaIle: 3.601 ± 0.16
3.601AlaLys: 3.601 ± 0.16
2.401AlaLeu: 2.401 ± 2.607
3.601AlaMet: 3.601 ± 1.875
1.2AlaAsn: 1.2 ± 1.304
1.2AlaPro: 1.2 ± 0.732
2.401AlaGln: 2.401 ± 2.607
4.802AlaArg: 4.802 ± 0.892
7.203AlaSer: 7.203 ± 3.751
1.2AlaThr: 1.2 ± 0.732
0.0AlaVal: 0.0 ± 0.0
0.0AlaTrp: 0.0 ± 0.0
1.2AlaTyr: 1.2 ± 1.304
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
2.401CysGlu: 2.401 ± 0.572
0.0CysPhe: 0.0 ± 0.0
1.2CysGly: 1.2 ± 0.732
0.0CysHis: 0.0 ± 0.0
1.2CysIle: 1.2 ± 0.732
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
2.401CysGln: 2.401 ± 1.464
1.2CysArg: 1.2 ± 0.732
0.0CysSer: 0.0 ± 0.0
1.2CysThr: 1.2 ± 1.304
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.2CysTyr: 1.2 ± 0.732
0.0CysXaa: 0.0 ± 0.0
Asp
3.601AspAla: 3.601 ± 1.875
0.0AspCys: 0.0 ± 0.0
4.802AspAsp: 4.802 ± 0.892
6.002AspGlu: 6.002 ± 0.412
1.2AspPhe: 1.2 ± 0.732
4.802AspGly: 4.802 ± 1.144
6.002AspHis: 6.002 ± 1.624
0.0AspIle: 0.0 ± 0.0
2.401AspLys: 2.401 ± 0.572
1.2AspLeu: 1.2 ± 0.732
3.601AspMet: 3.601 ± 0.16
0.0AspAsn: 0.0 ± 0.0
0.0AspPro: 0.0 ± 0.0
2.401AspGln: 2.401 ± 0.572
2.401AspArg: 2.401 ± 0.572
1.2AspSer: 1.2 ± 0.732
3.601AspThr: 3.601 ± 0.16
2.401AspVal: 2.401 ± 1.464
0.0AspTrp: 0.0 ± 0.0
4.802AspTyr: 4.802 ± 1.144
0.0AspXaa: 0.0 ± 0.0
Glu
3.601GluAla: 3.601 ± 1.875
1.2GluCys: 1.2 ± 0.732
2.401GluAsp: 2.401 ± 0.572
1.2GluGlu: 1.2 ± 1.304
2.401GluPhe: 2.401 ± 1.464
6.002GluGly: 6.002 ± 0.412
0.0GluHis: 0.0 ± 0.0
6.002GluIle: 6.002 ± 1.624
2.401GluLys: 2.401 ± 1.464
16.807GluLeu: 16.807 ± 1.967
0.0GluMet: 0.0 ± 0.0
4.802GluAsn: 4.802 ± 1.144
1.2GluPro: 1.2 ± 0.732
4.802GluGln: 4.802 ± 2.927
1.2GluArg: 1.2 ± 0.732
4.802GluSer: 4.802 ± 1.144
1.2GluThr: 1.2 ± 0.732
1.2GluVal: 1.2 ± 1.304
1.2GluTrp: 1.2 ± 1.304
2.401GluTyr: 2.401 ± 1.464
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
1.2PheCys: 1.2 ± 0.732
1.2PheAsp: 1.2 ± 0.732
1.2PheGlu: 1.2 ± 1.304
0.0PhePhe: 0.0 ± 0.0
6.002PheGly: 6.002 ± 1.624
0.0PheHis: 0.0 ± 0.0
6.002PheIle: 6.002 ± 3.659
2.401PheLys: 2.401 ± 1.464
2.401PheLeu: 2.401 ± 1.464
0.0PheMet: 0.0 ± 0.771
1.2PheAsn: 1.2 ± 0.732
1.2PhePro: 1.2 ± 0.732
0.0PheGln: 0.0 ± 0.0
1.2PheArg: 1.2 ± 0.732
1.2PheSer: 1.2 ± 0.732
2.401PheThr: 2.401 ± 1.464
2.401PheVal: 2.401 ± 1.464
1.2PheTrp: 1.2 ± 1.304
3.601PheTyr: 3.601 ± 0.16
0.0PheXaa: 0.0 ± 0.0
Gly
0.0GlyAla: 0.0 ± 0.0
1.2GlyCys: 1.2 ± 1.304
4.802GlyAsp: 4.802 ± 2.927
2.401GlyGlu: 2.401 ± 0.572
0.0GlyPhe: 0.0 ± 0.0
2.401GlyGly: 2.401 ± 2.607
1.2GlyHis: 1.2 ± 1.304
2.401GlyIle: 2.401 ± 0.572
6.002GlyLys: 6.002 ± 1.624
4.802GlyLeu: 4.802 ± 0.892
4.802GlyMet: 4.802 ± 1.144
2.401GlyAsn: 2.401 ± 0.572
0.0GlyPro: 0.0 ± 0.0
1.2GlyGln: 1.2 ± 1.304
4.802GlyArg: 4.802 ± 0.892
7.203GlySer: 7.203 ± 0.32
0.0GlyThr: 0.0 ± 0.0
0.0GlyVal: 0.0 ± 0.0
2.401GlyTrp: 2.401 ± 1.464
2.401GlyTyr: 2.401 ± 0.572
0.0GlyXaa: 0.0 ± 0.0
His
1.2HisAla: 1.2 ± 1.304
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.2HisGlu: 1.2 ± 1.304
1.2HisPhe: 1.2 ± 0.732
3.601HisGly: 3.601 ± 2.196
1.2HisHis: 1.2 ± 0.732
4.802HisIle: 4.802 ± 0.892
2.401HisLys: 2.401 ± 0.572
2.401HisLeu: 2.401 ± 0.572
1.2HisMet: 1.2 ± 0.542
2.401HisAsn: 2.401 ± 1.464
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
2.401HisArg: 2.401 ± 1.464
1.2HisSer: 1.2 ± 0.732
3.601HisThr: 3.601 ± 2.196
1.2HisVal: 1.2 ± 1.304
0.0HisTrp: 0.0 ± 0.0
1.2HisTyr: 1.2 ± 0.732
0.0HisXaa: 0.0 ± 0.0
Ile
9.604IleAla: 9.604 ± 0.252
0.0IleCys: 0.0 ± 0.0
4.802IleAsp: 4.802 ± 3.179
4.802IleGlu: 4.802 ± 0.892
1.2IlePhe: 1.2 ± 0.732
4.802IleGly: 4.802 ± 0.892
3.601IleHis: 3.601 ± 0.16
4.802IleIle: 4.802 ± 0.892
8.403IleLys: 8.403 ± 1.052
4.802IleLeu: 4.802 ± 0.892
1.2IleMet: 1.2 ± 1.304
4.802IleAsn: 4.802 ± 2.927
0.0IlePro: 0.0 ± 0.0
1.2IleGln: 1.2 ± 0.732
3.601IleArg: 3.601 ± 0.16
4.802IleSer: 4.802 ± 1.144
8.403IleThr: 8.403 ± 0.984
2.401IleVal: 2.401 ± 0.572
2.401IleTrp: 2.401 ± 0.572
6.002IleTyr: 6.002 ± 1.624
0.0IleXaa: 0.0 ± 0.0
Lys
3.601LysAla: 3.601 ± 1.875
1.2LysCys: 1.2 ± 1.304
3.601LysAsp: 3.601 ± 1.875
4.802LysGlu: 4.802 ± 0.892
4.802LysPhe: 4.802 ± 0.892
3.601LysGly: 3.601 ± 0.16
6.002LysHis: 6.002 ± 3.659
6.002LysIle: 6.002 ± 2.447
2.401LysLys: 2.401 ± 0.572
7.203LysLeu: 7.203 ± 2.356
3.601LysMet: 3.601 ± 2.196
2.401LysAsn: 2.401 ± 0.572
6.002LysPro: 6.002 ± 3.659
2.401LysGln: 2.401 ± 0.572
6.002LysArg: 6.002 ± 1.624
0.0LysSer: 0.0 ± 0.0
4.802LysThr: 4.802 ± 2.927
3.601LysVal: 3.601 ± 3.911
2.401LysTrp: 2.401 ± 0.572
4.802LysTyr: 4.802 ± 1.144
0.0LysXaa: 0.0 ± 0.0
Leu
4.802LeuAla: 4.802 ± 3.179
1.2LeuCys: 1.2 ± 0.732
4.802LeuAsp: 4.802 ± 1.144
9.604LeuGlu: 9.604 ± 1.784
2.401LeuPhe: 2.401 ± 1.464
4.802LeuGly: 4.802 ± 1.144
0.0LeuHis: 0.0 ± 0.0
3.601LeuIle: 3.601 ± 0.16
8.403LeuLys: 8.403 ± 5.055
9.604LeuLeu: 9.604 ± 2.287
4.802LeuMet: 4.802 ± 1.144
1.2LeuAsn: 1.2 ± 1.304
4.802LeuPro: 4.802 ± 3.179
0.0LeuGln: 0.0 ± 0.0
2.401LeuArg: 2.401 ± 1.464
9.604LeuSer: 9.604 ± 1.784
3.601LeuThr: 3.601 ± 0.16
3.601LeuVal: 3.601 ± 0.16
2.401LeuTrp: 2.401 ± 2.607
4.802LeuTyr: 4.802 ± 2.927
0.0LeuXaa: 0.0 ± 0.0
Met
2.401MetAla: 2.401 ± 1.464
0.0MetCys: 0.0 ± 0.0
2.401MetAsp: 2.401 ± 0.572
2.401MetGlu: 2.401 ± 1.464
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
7.203MetIle: 7.203 ± 1.715
6.002MetLys: 6.002 ± 2.447
1.2MetLeu: 1.2 ± 1.304
1.2MetMet: 1.2 ± 0.732
3.601MetAsn: 3.601 ± 0.16
2.401MetPro: 2.401 ± 0.572
1.2MetGln: 1.2 ± 0.732
0.0MetArg: 0.0 ± 0.0
4.802MetSer: 4.802 ± 0.892
2.401MetThr: 2.401 ± 0.572
2.401MetVal: 2.401 ± 1.464
1.2MetTrp: 1.2 ± 0.732
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.601AsnAla: 3.601 ± 0.16
1.2AsnCys: 1.2 ± 0.732
1.2AsnAsp: 1.2 ± 0.732
1.2AsnGlu: 1.2 ± 1.304
2.401AsnPhe: 2.401 ± 0.572
2.401AsnGly: 2.401 ± 1.464
1.2AsnHis: 1.2 ± 0.732
7.203AsnIle: 7.203 ± 0.32
4.802AsnLys: 4.802 ± 0.892
4.802AsnLeu: 4.802 ± 3.179
1.2AsnMet: 1.2 ± 0.732
3.601AsnAsn: 3.601 ± 0.16
4.802AsnPro: 4.802 ± 1.144
1.2AsnGln: 1.2 ± 1.304
6.002AsnArg: 6.002 ± 1.624
2.401AsnSer: 2.401 ± 0.572
2.401AsnThr: 2.401 ± 0.572
0.0AsnVal: 0.0 ± 0.0
0.0AsnTrp: 0.0 ± 0.0
3.601AsnTyr: 3.601 ± 1.875
0.0AsnXaa: 0.0 ± 0.0
Pro
2.401ProAla: 2.401 ± 0.572
0.0ProCys: 0.0 ± 0.0
2.401ProAsp: 2.401 ± 1.464
4.802ProGlu: 4.802 ± 0.892
1.2ProPhe: 1.2 ± 0.732
0.0ProGly: 0.0 ± 0.0
0.0ProHis: 0.0 ± 0.0
3.601ProIle: 3.601 ± 0.16
0.0ProLys: 0.0 ± 0.0
3.601ProLeu: 3.601 ± 0.16
0.0ProMet: 0.0 ± 0.0
3.601ProAsn: 3.601 ± 1.875
0.0ProPro: 0.0 ± 0.0
0.0ProGln: 0.0 ± 0.0
3.601ProArg: 3.601 ± 0.16
2.401ProSer: 2.401 ± 0.572
3.601ProThr: 3.601 ± 1.875
1.2ProVal: 1.2 ± 0.732
1.2ProTrp: 1.2 ± 1.304
2.401ProTyr: 2.401 ± 1.464
0.0ProXaa: 0.0 ± 0.0
Gln
2.401GlnAla: 2.401 ± 0.572
0.0GlnCys: 0.0 ± 0.0
1.2GlnAsp: 1.2 ± 0.732
0.0GlnGlu: 0.0 ± 0.0
2.401GlnPhe: 2.401 ± 1.464
0.0GlnGly: 0.0 ± 0.0
1.2GlnHis: 1.2 ± 1.304
0.0GlnIle: 0.0 ± 0.0
2.401GlnLys: 2.401 ± 0.572
3.601GlnLeu: 3.601 ± 0.16
1.2GlnMet: 1.2 ± 0.732
2.401GlnAsn: 2.401 ± 2.607
3.601GlnPro: 3.601 ± 1.875
3.601GlnGln: 3.601 ± 1.875
3.601GlnArg: 3.601 ± 1.875
1.2GlnSer: 1.2 ± 1.304
2.401GlnThr: 2.401 ± 1.464
0.0GlnVal: 0.0 ± 0.0
1.2GlnTrp: 1.2 ± 0.732
2.401GlnTyr: 2.401 ± 1.464
0.0GlnXaa: 0.0 ± 0.0
Arg
2.401ArgAla: 2.401 ± 0.572
1.2ArgCys: 1.2 ± 0.732
2.401ArgAsp: 2.401 ± 0.572
1.2ArgGlu: 1.2 ± 0.732
2.401ArgPhe: 2.401 ± 1.464
0.0ArgGly: 0.0 ± 0.0
3.601ArgHis: 3.601 ± 0.16
6.002ArgIle: 6.002 ± 1.624
4.802ArgLys: 4.802 ± 0.892
8.403ArgLeu: 8.403 ± 5.055
3.601ArgMet: 3.601 ± 0.16
4.802ArgAsn: 4.802 ± 2.927
0.0ArgPro: 0.0 ± 0.0
1.2ArgGln: 1.2 ± 0.732
3.601ArgArg: 3.601 ± 2.196
1.2ArgSer: 1.2 ± 0.732
4.802ArgThr: 4.802 ± 0.892
0.0ArgVal: 0.0 ± 0.0
0.0ArgTrp: 0.0 ± 0.0
3.601ArgTyr: 3.601 ± 2.196
0.0ArgXaa: 0.0 ± 0.0
Ser
2.401SerAla: 2.401 ± 0.572
0.0SerCys: 0.0 ± 0.0
3.601SerAsp: 3.601 ± 0.16
4.802SerGlu: 4.802 ± 0.892
2.401SerPhe: 2.401 ± 0.572
3.601SerGly: 3.601 ± 1.875
1.2SerHis: 1.2 ± 0.732
4.802SerIle: 4.802 ± 0.892
6.002SerLys: 6.002 ± 0.412
4.802SerLeu: 4.802 ± 0.892
3.601SerMet: 3.601 ± 0.16
1.2SerAsn: 1.2 ± 1.304
1.2SerPro: 1.2 ± 0.732
1.2SerGln: 1.2 ± 1.304
3.601SerArg: 3.601 ± 0.16
1.2SerSer: 1.2 ± 0.732
4.802SerThr: 4.802 ± 1.144
2.401SerVal: 2.401 ± 2.607
0.0SerTrp: 0.0 ± 0.0
2.401SerTyr: 2.401 ± 0.572
0.0SerXaa: 0.0 ± 0.0
Thr
2.401ThrAla: 2.401 ± 1.464
1.2ThrCys: 1.2 ± 0.732
2.401ThrAsp: 2.401 ± 0.572
3.601ThrGlu: 3.601 ± 0.16
2.401ThrPhe: 2.401 ± 1.464
3.601ThrGly: 3.601 ± 1.875
2.401ThrHis: 2.401 ± 1.464
3.601ThrIle: 3.601 ± 0.16
10.804ThrLys: 10.804 ± 4.551
2.401ThrLeu: 2.401 ± 0.572
3.601ThrMet: 3.601 ± 0.16
3.601ThrAsn: 3.601 ± 1.875
3.601ThrPro: 3.601 ± 0.16
3.601ThrGln: 3.601 ± 1.875
2.401ThrArg: 2.401 ± 0.572
2.401ThrSer: 2.401 ± 0.572
9.604ThrThr: 9.604 ± 1.784
4.802ThrVal: 4.802 ± 0.892
1.2ThrTrp: 1.2 ± 1.304
1.2ThrTyr: 1.2 ± 1.304
0.0ThrXaa: 0.0 ± 0.0
Val
1.2ValAla: 1.2 ± 1.304
0.0ValCys: 0.0 ± 0.0
3.601ValAsp: 3.601 ± 0.16
4.802ValGlu: 4.802 ± 0.892
0.0ValPhe: 0.0 ± 0.0
1.2ValGly: 1.2 ± 0.732
0.0ValHis: 0.0 ± 0.0
1.2ValIle: 1.2 ± 1.304
0.0ValLys: 0.0 ± 0.0
3.601ValLeu: 3.601 ± 0.16
1.2ValMet: 1.2 ± 0.732
3.601ValAsn: 3.601 ± 2.196
2.401ValPro: 2.401 ± 0.572
1.2ValGln: 1.2 ± 0.732
1.2ValArg: 1.2 ± 1.304
2.401ValSer: 2.401 ± 2.607
3.601ValThr: 3.601 ± 1.875
2.401ValVal: 2.401 ± 1.464
0.0ValTrp: 0.0 ± 0.0
1.2ValTyr: 1.2 ± 1.304
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
4.802TrpIle: 4.802 ± 1.144
1.2TrpLys: 1.2 ± 1.304
0.0TrpLeu: 0.0 ± 0.0
1.2TrpMet: 1.2 ± 0.732
2.401TrpAsn: 2.401 ± 2.607
1.2TrpPro: 1.2 ± 0.732
2.401TrpGln: 2.401 ± 0.572
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
1.2TrpThr: 1.2 ± 0.732
1.2TrpVal: 1.2 ± 1.304
1.2TrpTrp: 1.2 ± 1.304
2.401TrpTyr: 2.401 ± 0.572
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.401TyrAla: 2.401 ± 0.572
0.0TyrCys: 0.0 ± 0.0
2.401TyrAsp: 2.401 ± 1.464
3.601TyrGlu: 3.601 ± 0.16
6.002TyrPhe: 6.002 ± 1.624
2.401TyrGly: 2.401 ± 0.572
0.0TyrHis: 0.0 ± 0.0
4.802TyrIle: 4.802 ± 1.144
4.802TyrLys: 4.802 ± 2.927
2.401TyrLeu: 2.401 ± 0.572
0.0TyrMet: 0.0 ± 0.0
6.002TyrAsn: 6.002 ± 1.624
2.401TyrPro: 2.401 ± 0.572
2.401TyrGln: 2.401 ± 0.572
1.2TyrArg: 1.2 ± 0.732
0.0TyrSer: 0.0 ± 0.0
6.002TyrThr: 6.002 ± 2.447
3.601TyrVal: 3.601 ± 0.16
1.2TyrTrp: 1.2 ± 0.732
2.401TyrTyr: 2.401 ± 0.572
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (834 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski