Amino acid dipepetide frequency for Wuhan insect virus 23

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.561AlaAla: 4.561 ± 1.925
0.0AlaCys: 0.0 ± 0.0
2.281AlaAsp: 2.281 ± 1.805
1.14AlaGlu: 1.14 ± 0.903
2.281AlaPhe: 2.281 ± 1.567
7.982AlaGly: 7.982 ± 2.946
1.14AlaHis: 1.14 ± 0.903
9.122AlaIle: 9.122 ± 2.163
2.281AlaLys: 2.281 ± 0.119
5.701AlaLeu: 5.701 ± 1.141
2.281AlaMet: 2.281 ± 0.119
3.421AlaAsn: 3.421 ± 1.022
2.281AlaPro: 2.281 ± 0.119
2.281AlaGln: 2.281 ± 1.805
5.701AlaArg: 5.701 ± 1.141
3.421AlaSer: 3.421 ± 0.664
4.561AlaThr: 4.561 ± 1.925
9.122AlaVal: 9.122 ± 1.209
0.0AlaTrp: 0.0 ± 0.0
5.701AlaTyr: 5.701 ± 2.231
0.0AlaXaa: 0.0 ± 0.0
Cys
1.14CysAla: 1.14 ± 0.783
1.14CysCys: 1.14 ± 0.783
1.14CysAsp: 1.14 ± 0.783
0.0CysGlu: 0.0 ± 0.0
2.281CysPhe: 2.281 ± 1.567
1.14CysGly: 1.14 ± 0.783
1.14CysHis: 1.14 ± 0.903
0.0CysIle: 0.0 ± 0.0
1.14CysLys: 1.14 ± 0.783
1.14CysLeu: 1.14 ± 0.903
0.0CysMet: 0.0 ± 0.0
2.281CysAsn: 2.281 ± 1.567
1.14CysPro: 1.14 ± 0.903
1.14CysGln: 1.14 ± 0.903
1.14CysArg: 1.14 ± 0.783
0.0CysSer: 0.0 ± 0.0
1.14CysThr: 1.14 ± 0.903
0.0CysVal: 0.0 ± 0.0
1.14CysTrp: 1.14 ± 0.783
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.561AspAla: 4.561 ± 1.925
1.14AspCys: 1.14 ± 0.783
4.561AspAsp: 4.561 ± 1.448
4.561AspGlu: 4.561 ± 1.925
5.701AspPhe: 5.701 ± 2.231
1.14AspGly: 1.14 ± 0.783
1.14AspHis: 1.14 ± 0.783
7.982AspIle: 7.982 ± 0.426
3.421AspLys: 3.421 ± 2.35
6.842AspLeu: 6.842 ± 3.015
1.14AspMet: 1.14 ± 0.783
1.14AspAsn: 1.14 ± 0.903
2.281AspPro: 2.281 ± 1.567
2.281AspGln: 2.281 ± 0.119
2.281AspArg: 2.281 ± 0.119
3.421AspSer: 3.421 ± 1.022
0.0AspThr: 0.0 ± 0.0
3.421AspVal: 3.421 ± 0.664
0.0AspTrp: 0.0 ± 0.0
1.14AspTyr: 1.14 ± 0.783
0.0AspXaa: 0.0 ± 0.0
Glu
5.701GluAla: 5.701 ± 1.141
0.0GluCys: 0.0 ± 0.0
2.281GluAsp: 2.281 ± 0.119
2.281GluGlu: 2.281 ± 0.119
1.14GluPhe: 1.14 ± 0.903
1.14GluGly: 1.14 ± 0.903
0.0GluHis: 0.0 ± 0.0
3.421GluIle: 3.421 ± 1.022
1.14GluLys: 1.14 ± 0.783
3.421GluLeu: 3.421 ± 2.708
0.0GluMet: 0.0 ± 0.0
3.421GluAsn: 3.421 ± 2.708
2.281GluPro: 2.281 ± 0.119
1.14GluGln: 1.14 ± 0.903
12.543GluArg: 12.543 ± 3.185
1.14GluSer: 1.14 ± 0.903
2.281GluThr: 2.281 ± 1.567
4.561GluVal: 4.561 ± 1.448
0.0GluTrp: 0.0 ± 0.0
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
4.561PheAla: 4.561 ± 0.238
0.0PheCys: 0.0 ± 0.0
3.421PheAsp: 3.421 ± 2.35
3.421PheGlu: 3.421 ± 2.35
0.0PhePhe: 0.0 ± 0.0
6.842PheGly: 6.842 ± 4.701
1.14PheHis: 1.14 ± 0.783
2.281PheIle: 2.281 ± 1.805
0.0PheLys: 0.0 ± 0.0
5.701PheLeu: 5.701 ± 3.917
1.14PheMet: 1.14 ± 0.783
1.14PheAsn: 1.14 ± 0.783
0.0PhePro: 0.0 ± 0.0
1.14PheGln: 1.14 ± 0.783
3.421PheArg: 3.421 ± 1.022
2.281PheSer: 2.281 ± 1.805
4.561PheThr: 4.561 ± 0.238
0.0PheVal: 0.0 ± 0.0
3.421PheTrp: 3.421 ± 0.664
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.421GlyAla: 3.421 ± 2.708
1.14GlyCys: 1.14 ± 0.783
2.281GlyAsp: 2.281 ± 0.119
5.701GlyGlu: 5.701 ± 0.545
6.842GlyPhe: 6.842 ± 0.358
2.281GlyGly: 2.281 ± 0.119
0.0GlyHis: 0.0 ± 0.0
6.842GlyIle: 6.842 ± 1.329
4.561GlyLys: 4.561 ± 0.238
7.982GlyLeu: 7.982 ± 0.426
1.14GlyMet: 1.14 ± 0.783
2.281GlyAsn: 2.281 ± 1.567
2.281GlyPro: 2.281 ± 0.119
3.421GlyGln: 3.421 ± 1.022
1.14GlyArg: 1.14 ± 0.903
4.561GlySer: 4.561 ± 1.448
1.14GlyThr: 1.14 ± 0.903
2.281GlyVal: 2.281 ± 0.119
0.0GlyTrp: 0.0 ± 0.0
5.701GlyTyr: 5.701 ± 2.231
0.0GlyXaa: 0.0 ± 0.0
His
1.14HisAla: 1.14 ± 0.903
0.0HisCys: 0.0 ± 0.0
1.14HisAsp: 1.14 ± 0.783
1.14HisGlu: 1.14 ± 0.903
1.14HisPhe: 1.14 ± 0.783
2.281HisGly: 2.281 ± 1.567
0.0HisHis: 0.0 ± 0.0
1.14HisIle: 1.14 ± 0.783
0.0HisLys: 0.0 ± 0.0
5.701HisLeu: 5.701 ± 1.141
3.421HisMet: 3.421 ± 0.664
1.14HisAsn: 1.14 ± 0.903
1.14HisPro: 1.14 ± 0.783
0.0HisGln: 0.0 ± 0.0
1.14HisArg: 1.14 ± 0.783
3.421HisSer: 3.421 ± 2.708
0.0HisThr: 0.0 ± 0.0
1.14HisVal: 1.14 ± 0.903
0.0HisTrp: 0.0 ± 0.0
2.281HisTyr: 2.281 ± 1.567
0.0HisXaa: 0.0 ± 0.0
Ile
6.842IleAla: 6.842 ± 2.044
1.14IleCys: 1.14 ± 0.783
3.421IleAsp: 3.421 ± 0.664
2.281IleGlu: 2.281 ± 1.805
0.0IlePhe: 0.0 ± 0.0
5.701IleGly: 5.701 ± 1.141
2.281IleHis: 2.281 ± 0.119
0.0IleIle: 0.0 ± 0.0
5.701IleLys: 5.701 ± 2.231
6.842IleLeu: 6.842 ± 0.358
1.14IleMet: 1.14 ± 0.783
4.561IleAsn: 4.561 ± 1.448
3.421IlePro: 3.421 ± 1.022
1.14IleGln: 1.14 ± 0.783
6.842IleArg: 6.842 ± 0.358
2.281IleSer: 2.281 ± 0.119
4.561IleThr: 4.561 ± 3.611
4.561IleVal: 4.561 ± 0.238
1.14IleTrp: 1.14 ± 0.783
3.421IleTyr: 3.421 ± 2.708
0.0IleXaa: 0.0 ± 0.0
Lys
1.14LysAla: 1.14 ± 0.903
2.281LysCys: 2.281 ± 1.567
1.14LysAsp: 1.14 ± 0.783
0.0LysGlu: 0.0 ± 0.0
1.14LysPhe: 1.14 ± 0.783
3.421LysGly: 3.421 ± 0.664
0.0LysHis: 0.0 ± 0.0
4.561LysIle: 4.561 ± 0.238
7.982LysLys: 7.982 ± 3.798
0.0LysLeu: 0.0 ± 0.0
1.14LysMet: 1.14 ± 0.783
3.421LysAsn: 3.421 ± 2.35
2.281LysPro: 2.281 ± 1.567
1.14LysGln: 1.14 ± 0.783
3.421LysArg: 3.421 ± 1.022
6.842LysSer: 6.842 ± 1.329
1.14LysThr: 1.14 ± 0.783
2.281LysVal: 2.281 ± 0.119
0.0LysTrp: 0.0 ± 0.0
1.14LysTyr: 1.14 ± 0.783
0.0LysXaa: 0.0 ± 0.0
Leu
11.403LeuAla: 11.403 ± 2.776
1.14LeuCys: 1.14 ± 0.903
7.982LeuAsp: 7.982 ± 0.426
1.14LeuGlu: 1.14 ± 0.903
0.0LeuPhe: 0.0 ± 0.0
4.561LeuGly: 4.561 ± 1.448
2.281LeuHis: 2.281 ± 1.567
1.14LeuIle: 1.14 ± 0.783
2.281LeuLys: 2.281 ± 0.119
7.982LeuLeu: 7.982 ± 2.112
1.14LeuMet: 1.14 ± 0.903
3.421LeuAsn: 3.421 ± 0.664
9.122LeuPro: 9.122 ± 2.896
1.14LeuGln: 1.14 ± 0.783
2.281LeuArg: 2.281 ± 0.119
5.701LeuSer: 5.701 ± 1.141
9.122LeuThr: 9.122 ± 3.849
10.262LeuVal: 10.262 ± 1.38
2.281LeuTrp: 2.281 ± 1.805
2.281LeuTyr: 2.281 ± 0.119
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
1.14MetCys: 1.14 ± 0.783
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
3.421MetPhe: 3.421 ± 0.664
1.14MetGly: 1.14 ± 0.903
1.14MetHis: 1.14 ± 0.783
2.281MetIle: 2.281 ± 0.119
1.14MetLys: 1.14 ± 0.783
2.281MetLeu: 2.281 ± 1.567
1.14MetMet: 1.14 ± 0.903
1.14MetAsn: 1.14 ± 0.903
0.0MetPro: 0.0 ± 0.0
1.14MetGln: 1.14 ± 0.783
0.0MetArg: 0.0 ± 0.0
2.281MetSer: 2.281 ± 1.567
2.281MetThr: 2.281 ± 1.567
1.14MetVal: 1.14 ± 0.783
0.0MetTrp: 0.0 ± 0.0
2.281MetTyr: 2.281 ± 0.119
0.0MetXaa: 0.0 ± 0.0
Asn
1.14AsnAla: 1.14 ± 0.783
1.14AsnCys: 1.14 ± 0.903
4.561AsnAsp: 4.561 ± 1.448
0.0AsnGlu: 0.0 ± 0.0
3.421AsnPhe: 3.421 ± 0.664
3.421AsnGly: 3.421 ± 0.664
1.14AsnHis: 1.14 ± 0.783
3.421AsnIle: 3.421 ± 1.022
0.0AsnLys: 0.0 ± 0.0
3.421AsnLeu: 3.421 ± 1.022
1.14AsnMet: 1.14 ± 0.783
1.14AsnAsn: 1.14 ± 0.903
1.14AsnPro: 1.14 ± 0.903
0.0AsnGln: 0.0 ± 0.0
1.14AsnArg: 1.14 ± 0.783
2.281AsnSer: 2.281 ± 1.805
3.421AsnThr: 3.421 ± 0.664
4.561AsnVal: 4.561 ± 1.448
0.0AsnTrp: 0.0 ± 0.0
1.14AsnTyr: 1.14 ± 0.783
0.0AsnXaa: 0.0 ± 0.0
Pro
4.561ProAla: 4.561 ± 0.238
0.0ProCys: 0.0 ± 0.0
5.701ProAsp: 5.701 ± 2.231
2.281ProGlu: 2.281 ± 0.119
1.14ProPhe: 1.14 ± 0.783
4.561ProGly: 4.561 ± 1.925
1.14ProHis: 1.14 ± 0.903
4.561ProIle: 4.561 ± 0.238
0.0ProLys: 0.0 ± 0.0
4.561ProLeu: 4.561 ± 1.448
1.14ProMet: 1.14 ± 0.783
0.0ProAsn: 0.0 ± 0.0
1.14ProPro: 1.14 ± 0.783
0.0ProGln: 0.0 ± 0.0
2.281ProArg: 2.281 ± 1.805
2.281ProSer: 2.281 ± 0.119
5.701ProThr: 5.701 ± 2.231
3.421ProVal: 3.421 ± 0.664
2.281ProTrp: 2.281 ± 1.567
1.14ProTyr: 1.14 ± 0.783
0.0ProXaa: 0.0 ± 0.0
Gln
3.421GlnAla: 3.421 ± 1.022
2.281GlnCys: 2.281 ± 0.119
0.0GlnAsp: 0.0 ± 0.0
6.842GlnGlu: 6.842 ± 0.358
2.281GlnPhe: 2.281 ± 0.119
2.281GlnGly: 2.281 ± 1.805
0.0GlnHis: 0.0 ± 0.0
2.281GlnIle: 2.281 ± 1.805
0.0GlnLys: 0.0 ± 0.0
5.701GlnLeu: 5.701 ± 1.141
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
0.0GlnPro: 0.0 ± 0.0
3.421GlnGln: 3.421 ± 1.022
4.561GlnArg: 4.561 ± 1.448
0.0GlnSer: 0.0 ± 0.0
0.0GlnThr: 0.0 ± 0.0
1.14GlnVal: 1.14 ± 0.903
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
6.842ArgAla: 6.842 ± 2.044
1.14ArgCys: 1.14 ± 0.783
9.122ArgAsp: 9.122 ± 1.209
7.982ArgGlu: 7.982 ± 4.633
3.421ArgPhe: 3.421 ± 0.664
3.421ArgGly: 3.421 ± 1.022
3.421ArgHis: 3.421 ± 1.022
6.842ArgIle: 6.842 ± 0.358
0.0ArgLys: 0.0 ± 0.0
2.281ArgLeu: 2.281 ± 1.805
1.14ArgMet: 1.14 ± 0.508
0.0ArgAsn: 0.0 ± 0.0
3.421ArgPro: 3.421 ± 2.35
3.421ArgGln: 3.421 ± 2.708
6.842ArgArg: 6.842 ± 0.358
2.281ArgSer: 2.281 ± 0.119
6.842ArgThr: 6.842 ± 0.358
5.701ArgVal: 5.701 ± 1.141
0.0ArgTrp: 0.0 ± 0.0
1.14ArgTyr: 1.14 ± 0.783
0.0ArgXaa: 0.0 ± 0.0
Ser
1.14SerAla: 1.14 ± 0.903
0.0SerCys: 0.0 ± 0.0
1.14SerAsp: 1.14 ± 0.903
1.14SerGlu: 1.14 ± 0.783
2.281SerPhe: 2.281 ± 0.119
6.842SerGly: 6.842 ± 1.329
0.0SerHis: 0.0 ± 0.0
2.281SerIle: 2.281 ± 0.119
5.701SerLys: 5.701 ± 1.141
4.561SerLeu: 4.561 ± 1.925
2.281SerMet: 2.281 ± 0.119
1.14SerAsn: 1.14 ± 0.783
3.421SerPro: 3.421 ± 0.664
2.281SerGln: 2.281 ± 0.119
4.561SerArg: 4.561 ± 1.925
6.842SerSer: 6.842 ± 3.015
4.561SerThr: 4.561 ± 1.925
5.701SerVal: 5.701 ± 2.827
1.14SerTrp: 1.14 ± 0.903
2.281SerTyr: 2.281 ± 1.567
0.0SerXaa: 0.0 ± 0.0
Thr
3.421ThrAla: 3.421 ± 0.664
1.14ThrCys: 1.14 ± 0.903
3.421ThrAsp: 3.421 ± 1.022
1.14ThrGlu: 1.14 ± 0.903
3.421ThrPhe: 3.421 ± 0.664
1.14ThrGly: 1.14 ± 0.783
1.14ThrHis: 1.14 ± 0.783
2.281ThrIle: 2.281 ± 1.805
3.421ThrLys: 3.421 ± 2.35
4.561ThrLeu: 4.561 ± 0.238
1.14ThrMet: 1.14 ± 0.783
0.0ThrAsn: 0.0 ± 0.0
4.561ThrPro: 4.561 ± 3.611
4.561ThrGln: 4.561 ± 1.925
6.842ThrArg: 6.842 ± 2.044
3.421ThrSer: 3.421 ± 1.022
5.701ThrThr: 5.701 ± 2.827
7.982ThrVal: 7.982 ± 1.26
2.281ThrTrp: 2.281 ± 1.805
1.14ThrTyr: 1.14 ± 0.783
0.0ThrXaa: 0.0 ± 0.0
Val
4.561ValAla: 4.561 ± 1.925
2.281ValCys: 2.281 ± 0.119
2.281ValAsp: 2.281 ± 1.567
6.842ValGlu: 6.842 ± 2.044
3.421ValPhe: 3.421 ± 2.35
3.421ValGly: 3.421 ± 1.022
7.982ValHis: 7.982 ± 1.26
3.421ValIle: 3.421 ± 1.022
2.281ValLys: 2.281 ± 1.567
1.14ValLeu: 1.14 ± 0.783
1.14ValMet: 1.14 ± 0.551
5.701ValAsn: 5.701 ± 0.545
3.421ValPro: 3.421 ± 0.664
2.281ValGln: 2.281 ± 0.119
5.701ValArg: 5.701 ± 0.545
2.281ValSer: 2.281 ± 0.119
4.561ValThr: 4.561 ± 1.925
3.421ValVal: 3.421 ± 0.664
3.421ValTrp: 3.421 ± 0.664
3.421ValTyr: 3.421 ± 2.708
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
1.14TrpCys: 1.14 ± 0.783
2.281TrpAsp: 2.281 ± 0.119
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
1.14TrpGly: 1.14 ± 0.783
1.14TrpHis: 1.14 ± 0.783
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
2.281TrpLeu: 2.281 ± 0.119
1.14TrpMet: 1.14 ± 0.783
0.0TrpAsn: 0.0 ± 0.0
2.281TrpPro: 2.281 ± 0.119
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
4.561TrpSer: 4.561 ± 1.925
0.0TrpThr: 0.0 ± 0.0
1.14TrpVal: 1.14 ± 0.903
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
5.701TyrAla: 5.701 ± 0.545
0.0TyrCys: 0.0 ± 0.0
0.0TyrAsp: 0.0 ± 0.0
0.0TyrGlu: 0.0 ± 0.0
1.14TyrPhe: 1.14 ± 0.783
1.14TyrGly: 1.14 ± 0.783
1.14TyrHis: 1.14 ± 0.903
3.421TyrIle: 3.421 ± 2.35
3.421TyrLys: 3.421 ± 0.664
4.561TyrLeu: 4.561 ± 1.448
0.0TyrMet: 0.0 ± 0.0
2.281TyrAsn: 2.281 ± 0.119
2.281TyrPro: 2.281 ± 1.567
2.281TyrGln: 2.281 ± 0.119
4.561TyrArg: 4.561 ± 0.238
0.0TyrSer: 0.0 ± 0.0
1.14TyrThr: 1.14 ± 0.903
1.14TyrVal: 1.14 ± 0.783
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (878 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski