Amino acid dipepetide frequency for Wuhan insect virus 22

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.598AlaAla: 6.598 ± 0.384
0.0AlaCys: 0.0 ± 0.0
0.0AlaAsp: 0.0 ± 0.0
2.828AlaGlu: 2.828 ± 0.743
3.77AlaPhe: 3.77 ± 0.144
3.77AlaGly: 3.77 ± 1.415
0.0AlaHis: 0.0 ± 0.0
3.77AlaIle: 3.77 ± 1.127
5.655AlaLys: 5.655 ± 1.055
6.598AlaLeu: 6.598 ± 1.655
2.828AlaMet: 2.828 ± 0.528
3.77AlaAsn: 3.77 ± 1.127
4.713AlaPro: 4.713 ± 0.456
1.885AlaGln: 1.885 ± 1.199
1.885AlaArg: 1.885 ± 1.199
6.598AlaSer: 6.598 ± 0.887
3.77AlaThr: 3.77 ± 2.686
5.655AlaVal: 5.655 ± 1.055
1.885AlaTrp: 1.885 ± 0.072
5.655AlaTyr: 5.655 ± 0.216
0.0AlaXaa: 0.0 ± 0.0
Cys
1.885CysAla: 1.885 ± 1.199
0.0CysCys: 0.0 ± 0.0
1.885CysAsp: 1.885 ± 0.072
0.0CysGlu: 0.0 ± 0.0
0.943CysPhe: 0.943 ± 0.6
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.943CysLeu: 0.943 ± 0.671
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.943CysGln: 0.943 ± 0.671
0.0CysArg: 0.0 ± 0.0
2.828CysSer: 2.828 ± 0.528
0.0CysThr: 0.0 ± 0.0
0.943CysVal: 0.943 ± 0.6
0.0CysTrp: 0.0 ± 0.0
1.885CysTyr: 1.885 ± 1.343
0.0CysXaa: 0.0 ± 0.0
Asp
2.828AspAla: 2.828 ± 0.743
1.885AspCys: 1.885 ± 1.199
2.828AspAsp: 2.828 ± 0.528
1.885AspGlu: 1.885 ± 0.072
1.885AspPhe: 1.885 ± 0.072
4.713AspGly: 4.713 ± 0.815
0.0AspHis: 0.0 ± 0.0
3.77AspIle: 3.77 ± 1.127
2.828AspLys: 2.828 ± 0.743
2.828AspLeu: 2.828 ± 1.799
1.885AspMet: 1.885 ± 1.343
1.885AspAsn: 1.885 ± 0.072
1.885AspPro: 1.885 ± 1.343
1.885AspGln: 1.885 ± 0.072
3.77AspArg: 3.77 ± 0.144
0.943AspSer: 0.943 ± 0.671
4.713AspThr: 4.713 ± 0.456
7.54AspVal: 7.54 ± 0.984
4.713AspTrp: 4.713 ± 0.815
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
1.885GluAla: 1.885 ± 0.072
0.943GluCys: 0.943 ± 0.671
3.77GluAsp: 3.77 ± 1.415
5.655GluGlu: 5.655 ± 2.326
1.885GluPhe: 1.885 ± 0.072
2.828GluGly: 2.828 ± 0.528
0.0GluHis: 0.0 ± 0.0
1.885GluIle: 1.885 ± 1.199
2.828GluLys: 2.828 ± 0.743
6.598GluLeu: 6.598 ± 0.384
2.828GluMet: 2.828 ± 0.743
2.828GluAsn: 2.828 ± 0.528
1.885GluPro: 1.885 ± 0.072
0.943GluGln: 0.943 ± 0.6
5.655GluArg: 5.655 ± 1.487
2.828GluSer: 2.828 ± 1.799
0.943GluThr: 0.943 ± 0.671
2.828GluVal: 2.828 ± 0.743
0.0GluTrp: 0.0 ± 0.0
2.828GluTyr: 2.828 ± 0.528
0.0GluXaa: 0.0 ± 0.0
Phe
4.713PheAla: 4.713 ± 0.456
0.943PheCys: 0.943 ± 0.671
2.828PheAsp: 2.828 ± 0.528
1.885PheGlu: 1.885 ± 1.199
0.0PhePhe: 0.0 ± 0.0
4.713PheGly: 4.713 ± 0.456
1.885PheHis: 1.885 ± 1.343
1.885PheIle: 1.885 ± 1.199
1.885PheLys: 1.885 ± 1.199
3.77PheLeu: 3.77 ± 0.144
0.0PheMet: 0.0 ± 0.0
4.713PheAsn: 4.713 ± 0.456
4.713PhePro: 4.713 ± 2.086
0.0PheGln: 0.0 ± 0.0
4.713PheArg: 4.713 ± 0.815
3.77PheSer: 3.77 ± 0.144
1.885PheThr: 1.885 ± 1.199
1.885PheVal: 1.885 ± 1.199
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.828GlyAla: 2.828 ± 0.528
1.885GlyCys: 1.885 ± 1.199
6.598GlyAsp: 6.598 ± 0.887
1.885GlyGlu: 1.885 ± 1.199
1.885GlyPhe: 1.885 ± 1.199
0.943GlyGly: 0.943 ± 0.6
0.0GlyHis: 0.0 ± 0.0
0.0GlyIle: 0.0 ± 0.0
1.885GlyLys: 1.885 ± 1.199
8.483GlyLeu: 8.483 ± 0.312
3.77GlyMet: 3.77 ± 2.398
0.943GlyAsn: 0.943 ± 0.671
1.885GlyPro: 1.885 ± 1.343
3.77GlyGln: 3.77 ± 0.144
1.885GlyArg: 1.885 ± 0.072
4.713GlySer: 4.713 ± 0.815
1.885GlyThr: 1.885 ± 0.072
5.655GlyVal: 5.655 ± 1.055
1.885GlyTrp: 1.885 ± 0.072
2.828GlyTyr: 2.828 ± 0.743
0.0GlyXaa: 0.0 ± 0.0
His
1.885HisAla: 1.885 ± 0.072
0.0HisCys: 0.0 ± 0.0
0.943HisAsp: 0.943 ± 0.671
0.0HisGlu: 0.0 ± 0.0
0.943HisPhe: 0.943 ± 0.6
3.77HisGly: 3.77 ± 1.127
0.0HisHis: 0.0 ± 0.0
0.943HisIle: 0.943 ± 0.6
0.0HisLys: 0.0 ± 0.0
0.0HisLeu: 0.0 ± 0.0
0.943HisMet: 0.943 ± 0.6
0.943HisAsn: 0.943 ± 0.671
1.885HisPro: 1.885 ± 0.072
0.943HisGln: 0.943 ± 0.671
0.0HisArg: 0.0 ± 0.0
0.943HisSer: 0.943 ± 0.671
0.943HisThr: 0.943 ± 0.6
0.943HisVal: 0.943 ± 0.6
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
5.655IleAla: 5.655 ± 1.055
0.0IleCys: 0.0 ± 0.0
3.77IleAsp: 3.77 ± 1.127
3.77IleGlu: 3.77 ± 2.398
2.828IlePhe: 2.828 ± 1.799
1.885IleGly: 1.885 ± 0.072
0.943IleHis: 0.943 ± 0.6
2.828IleIle: 2.828 ± 0.528
1.885IleLys: 1.885 ± 0.072
0.943IleLeu: 0.943 ± 0.671
0.943IleMet: 0.943 ± 0.962
5.655IleAsn: 5.655 ± 0.216
2.828IlePro: 2.828 ± 0.743
2.828IleGln: 2.828 ± 0.528
0.943IleArg: 0.943 ± 0.6
4.713IleSer: 4.713 ± 0.456
3.77IleThr: 3.77 ± 1.127
3.77IleVal: 3.77 ± 1.415
0.943IleTrp: 0.943 ± 0.6
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
0.943LysAla: 0.943 ± 0.6
0.0LysCys: 0.0 ± 0.0
4.713LysAsp: 4.713 ± 0.456
0.943LysGlu: 0.943 ± 0.6
2.828LysPhe: 2.828 ± 0.528
4.713LysGly: 4.713 ± 1.727
0.943LysHis: 0.943 ± 0.6
7.54LysIle: 7.54 ± 0.984
5.655LysLys: 5.655 ± 1.055
2.828LysLeu: 2.828 ± 1.799
0.943LysMet: 0.943 ± 0.671
1.885LysAsn: 1.885 ± 1.343
1.885LysPro: 1.885 ± 1.343
0.0LysGln: 0.0 ± 0.0
3.77LysArg: 3.77 ± 0.144
2.828LysSer: 2.828 ± 0.528
2.828LysThr: 2.828 ± 0.743
3.77LysVal: 3.77 ± 0.144
0.0LysTrp: 0.0 ± 0.0
4.713LysTyr: 4.713 ± 1.727
0.0LysXaa: 0.0 ± 0.0
Leu
3.77LeuAla: 3.77 ± 2.398
1.885LeuCys: 1.885 ± 1.343
2.828LeuAsp: 2.828 ± 0.743
4.713LeuGlu: 4.713 ± 0.815
3.77LeuPhe: 3.77 ± 1.415
2.828LeuGly: 2.828 ± 0.528
2.828LeuHis: 2.828 ± 0.743
6.598LeuIle: 6.598 ± 0.887
4.713LeuLys: 4.713 ± 0.456
5.655LeuLeu: 5.655 ± 1.055
1.885LeuMet: 1.885 ± 0.072
2.828LeuAsn: 2.828 ± 0.528
12.253LeuPro: 12.253 ± 3.981
0.943LeuGln: 0.943 ± 0.671
6.598LeuArg: 6.598 ± 1.655
7.54LeuSer: 7.54 ± 1.559
0.943LeuThr: 0.943 ± 0.6
4.713LeuVal: 4.713 ± 0.815
0.0LeuTrp: 0.0 ± 0.0
2.828LeuTyr: 2.828 ± 0.743
0.0LeuXaa: 0.0 ± 0.0
Met
4.713MetAla: 4.713 ± 0.815
0.943MetCys: 0.943 ± 0.6
0.943MetAsp: 0.943 ± 0.6
2.828MetGlu: 2.828 ± 1.799
0.943MetPhe: 0.943 ± 0.6
1.885MetGly: 1.885 ± 0.072
0.0MetHis: 0.0 ± 0.0
0.943MetIle: 0.943 ± 0.6
0.943MetLys: 0.943 ± 0.6
2.828MetLeu: 2.828 ± 0.743
1.885MetMet: 1.885 ± 1.199
0.943MetAsn: 0.943 ± 0.671
1.885MetPro: 1.885 ± 0.072
0.943MetGln: 0.943 ± 0.6
0.943MetArg: 0.943 ± 0.671
1.885MetSer: 1.885 ± 1.199
6.598MetThr: 6.598 ± 0.887
1.885MetVal: 1.885 ± 0.072
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.713AsnAla: 4.713 ± 0.815
0.943AsnCys: 0.943 ± 0.6
0.0AsnAsp: 0.0 ± 0.0
1.885AsnGlu: 1.885 ± 1.199
4.713AsnPhe: 4.713 ± 2.086
4.713AsnGly: 4.713 ± 1.727
2.828AsnHis: 2.828 ± 0.528
4.713AsnIle: 4.713 ± 0.456
4.713AsnLys: 4.713 ± 0.815
1.885AsnLeu: 1.885 ± 1.343
0.943AsnMet: 0.943 ± 0.671
5.655AsnAsn: 5.655 ± 2.758
3.77AsnPro: 3.77 ± 0.144
0.0AsnGln: 0.0 ± 0.0
4.713AsnArg: 4.713 ± 2.086
3.77AsnSer: 3.77 ± 2.686
2.828AsnThr: 2.828 ± 0.528
1.885AsnVal: 1.885 ± 1.343
0.0AsnTrp: 0.0 ± 0.0
3.77AsnTyr: 3.77 ± 1.127
0.0AsnXaa: 0.0 ± 0.0
Pro
3.77ProAla: 3.77 ± 0.144
0.0ProCys: 0.0 ± 0.0
4.713ProAsp: 4.713 ± 0.456
6.598ProGlu: 6.598 ± 2.158
2.828ProPhe: 2.828 ± 0.743
3.77ProGly: 3.77 ± 1.127
0.0ProHis: 0.0 ± 0.0
2.828ProIle: 2.828 ± 0.528
0.943ProLys: 0.943 ± 0.6
7.54ProLeu: 7.54 ± 1.559
1.885ProMet: 1.885 ± 0.072
2.828ProAsn: 2.828 ± 0.743
4.713ProPro: 4.713 ± 2.086
7.54ProGln: 7.54 ± 0.288
2.828ProArg: 2.828 ± 0.528
5.655ProSer: 5.655 ± 1.055
2.828ProThr: 2.828 ± 0.743
4.713ProVal: 4.713 ± 0.815
0.943ProTrp: 0.943 ± 0.6
2.828ProTyr: 2.828 ± 1.799
0.0ProXaa: 0.0 ± 0.0
Gln
4.713GlnAla: 4.713 ± 0.456
0.0GlnCys: 0.0 ± 0.0
0.943GlnAsp: 0.943 ± 0.6
1.885GlnGlu: 1.885 ± 1.199
1.885GlnPhe: 1.885 ± 0.072
2.828GlnGly: 2.828 ± 0.528
0.943GlnHis: 0.943 ± 0.671
0.943GlnIle: 0.943 ± 0.671
0.943GlnLys: 0.943 ± 0.6
2.828GlnLeu: 2.828 ± 2.014
1.885GlnMet: 1.885 ± 0.072
2.828GlnAsn: 2.828 ± 0.743
2.828GlnPro: 2.828 ± 0.743
1.885GlnGln: 1.885 ± 1.343
3.77GlnArg: 3.77 ± 1.415
0.0GlnSer: 0.0 ± 0.0
2.828GlnThr: 2.828 ± 0.743
0.943GlnVal: 0.943 ± 0.671
0.943GlnTrp: 0.943 ± 0.671
1.885GlnTyr: 1.885 ± 1.199
0.0GlnXaa: 0.0 ± 0.0
Arg
5.655ArgAla: 5.655 ± 0.216
0.943ArgCys: 0.943 ± 0.671
7.54ArgAsp: 7.54 ± 1.559
3.77ArgGlu: 3.77 ± 1.415
3.77ArgPhe: 3.77 ± 0.144
0.0ArgGly: 0.0 ± 0.0
0.943ArgHis: 0.943 ± 0.6
4.713ArgIle: 4.713 ± 0.456
3.77ArgLys: 3.77 ± 1.127
4.713ArgLeu: 4.713 ± 2.086
2.828ArgMet: 2.828 ± 0.933
3.77ArgAsn: 3.77 ± 2.686
3.77ArgPro: 3.77 ± 1.127
2.828ArgGln: 2.828 ± 0.528
7.54ArgArg: 7.54 ± 0.288
3.77ArgSer: 3.77 ± 1.127
4.713ArgThr: 4.713 ± 0.456
2.828ArgVal: 2.828 ± 0.743
0.943ArgTrp: 0.943 ± 0.671
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
3.77SerAla: 3.77 ± 1.415
0.943SerCys: 0.943 ± 0.671
3.77SerAsp: 3.77 ± 0.144
3.77SerGlu: 3.77 ± 2.686
4.713SerPhe: 4.713 ± 0.815
5.655SerGly: 5.655 ± 0.216
1.885SerHis: 1.885 ± 1.199
0.0SerIle: 0.0 ± 0.0
4.713SerLys: 4.713 ± 0.456
6.598SerLeu: 6.598 ± 0.887
2.828SerMet: 2.828 ± 0.528
3.77SerAsn: 3.77 ± 1.415
4.713SerPro: 4.713 ± 1.727
6.598SerGln: 6.598 ± 3.429
7.54SerArg: 7.54 ± 0.288
7.54SerSer: 7.54 ± 1.559
0.943SerThr: 0.943 ± 0.671
2.828SerVal: 2.828 ± 0.743
0.943SerTrp: 0.943 ± 0.6
2.828SerTyr: 2.828 ± 1.799
0.0SerXaa: 0.0 ± 0.0
Thr
3.77ThrAla: 3.77 ± 0.144
0.0ThrCys: 0.0 ± 0.0
2.828ThrAsp: 2.828 ± 0.528
0.943ThrGlu: 0.943 ± 0.671
0.943ThrPhe: 0.943 ± 0.6
2.828ThrGly: 2.828 ± 2.014
0.943ThrHis: 0.943 ± 0.6
4.713ThrIle: 4.713 ± 0.456
3.77ThrLys: 3.77 ± 1.127
3.77ThrLeu: 3.77 ± 0.144
0.943ThrMet: 0.943 ± 0.6
2.828ThrAsn: 2.828 ± 0.528
4.713ThrPro: 4.713 ± 0.456
0.943ThrGln: 0.943 ± 0.671
3.77ThrArg: 3.77 ± 1.415
7.54ThrSer: 7.54 ± 4.101
2.828ThrThr: 2.828 ± 0.743
1.885ThrVal: 1.885 ± 1.343
0.943ThrTrp: 0.943 ± 0.6
0.943ThrTyr: 0.943 ± 0.671
0.0ThrXaa: 0.0 ± 0.0
Val
5.655ValAla: 5.655 ± 0.216
0.0ValCys: 0.0 ± 0.0
2.828ValAsp: 2.828 ± 0.743
3.77ValGlu: 3.77 ± 0.144
1.885ValPhe: 1.885 ± 0.072
0.943ValGly: 0.943 ± 0.6
0.0ValHis: 0.0 ± 0.0
0.943ValIle: 0.943 ± 0.6
2.828ValLys: 2.828 ± 0.743
2.828ValLeu: 2.828 ± 1.799
1.885ValMet: 1.885 ± 0.072
7.54ValAsn: 7.54 ± 0.288
5.655ValPro: 5.655 ± 1.487
1.885ValGln: 1.885 ± 1.343
4.713ValArg: 4.713 ± 0.456
5.655ValSer: 5.655 ± 1.487
4.713ValThr: 4.713 ± 3.357
0.0ValVal: 0.0 ± 0.0
0.943ValTrp: 0.943 ± 0.6
2.828ValTyr: 2.828 ± 1.799
0.0ValXaa: 0.0 ± 0.0
Trp
1.885TrpAla: 1.885 ± 1.199
0.0TrpCys: 0.0 ± 0.0
0.943TrpAsp: 0.943 ± 0.6
0.943TrpGlu: 0.943 ± 0.671
0.0TrpPhe: 0.0 ± 0.0
0.943TrpGly: 0.943 ± 0.671
0.0TrpHis: 0.0 ± 0.0
0.943TrpIle: 0.943 ± 0.671
1.885TrpLys: 1.885 ± 0.072
2.828TrpLeu: 2.828 ± 0.528
0.943TrpMet: 0.943 ± 0.671
1.885TrpAsn: 1.885 ± 1.199
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.943TrpArg: 0.943 ± 0.6
0.943TrpSer: 0.943 ± 0.671
0.0TrpThr: 0.0 ± 0.0
0.943TrpVal: 0.943 ± 0.6
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
0.943TyrCys: 0.943 ± 0.6
0.0TyrAsp: 0.0 ± 0.0
1.885TyrGlu: 1.885 ± 1.343
3.77TyrPhe: 3.77 ± 2.398
1.885TyrGly: 1.885 ± 1.199
1.885TyrHis: 1.885 ± 0.072
1.885TyrIle: 1.885 ± 1.199
1.885TyrLys: 1.885 ± 0.072
5.655TyrLeu: 5.655 ± 2.326
0.943TyrMet: 0.943 ± 0.6
0.943TyrAsn: 0.943 ± 0.671
3.77TyrPro: 3.77 ± 1.127
0.943TyrGln: 0.943 ± 0.6
2.828TyrArg: 2.828 ± 0.743
1.885TyrSer: 1.885 ± 0.072
1.885TyrThr: 1.885 ± 0.072
1.885TyrVal: 1.885 ± 1.343
0.943TyrTrp: 0.943 ± 0.6
0.943TyrTyr: 0.943 ± 0.671
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1062 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski