Amino acid dipepetide frequency for Wuhan insect virus 27

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.048AlaAla: 8.048 ± 0.422
1.341AlaCys: 1.341 ± 1.007
4.695AlaAsp: 4.695 ± 2.096
2.012AlaGlu: 2.012 ± 0.363
0.671AlaPhe: 0.671 ± 0.433
3.353AlaGly: 3.353 ± 1.23
1.341AlaHis: 1.341 ± 0.867
3.353AlaIle: 3.353 ± 0.644
4.695AlaLys: 4.695 ± 1.159
6.036AlaLeu: 6.036 ± 2.026
1.341AlaMet: 1.341 ± 0.867
4.024AlaAsn: 4.024 ± 2.085
3.353AlaPro: 3.353 ± 0.644
0.671AlaGln: 0.671 ± 0.433
6.036AlaArg: 6.036 ± 0.152
4.695AlaSer: 4.695 ± 0.222
10.731AlaThr: 10.731 ± 3.374
6.707AlaVal: 6.707 ± 0.585
0.671AlaTrp: 0.671 ± 0.433
3.353AlaTyr: 3.353 ± 0.644
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.341CysAsp: 1.341 ± 0.07
2.012CysGlu: 2.012 ± 0.574
1.341CysPhe: 1.341 ± 1.007
0.0CysGly: 0.0 ± 0.0
0.671CysHis: 0.671 ± 0.504
0.671CysIle: 0.671 ± 0.433
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.671CysAsn: 0.671 ± 0.433
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.671CysArg: 0.671 ± 0.504
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
2.683CysVal: 2.683 ± 2.015
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.341AspAla: 1.341 ± 0.867
2.012AspCys: 2.012 ± 1.511
2.012AspAsp: 2.012 ± 0.574
2.012AspGlu: 2.012 ± 0.574
4.024AspPhe: 4.024 ± 0.726
3.353AspGly: 3.353 ± 1.582
0.671AspHis: 0.671 ± 0.433
1.341AspIle: 1.341 ± 1.007
2.012AspLys: 2.012 ± 0.574
8.048AspLeu: 8.048 ± 1.452
2.683AspMet: 2.683 ± 0.796
3.353AspAsn: 3.353 ± 0.644
3.353AspPro: 3.353 ± 1.582
1.341AspGln: 1.341 ± 0.07
2.012AspArg: 2.012 ± 0.363
4.024AspSer: 4.024 ± 0.211
2.683AspThr: 2.683 ± 2.015
6.036AspVal: 6.036 ± 2.026
0.671AspTrp: 0.671 ± 0.433
1.341AspTyr: 1.341 ± 0.07
0.0AspXaa: 0.0 ± 0.0
Glu
7.378GluAla: 7.378 ± 1.793
0.671GluCys: 0.671 ± 0.504
4.695GluAsp: 4.695 ± 0.222
2.683GluGlu: 2.683 ± 1.078
3.353GluPhe: 3.353 ± 0.293
1.341GluGly: 1.341 ± 1.007
0.671GluHis: 0.671 ± 0.433
3.353GluIle: 3.353 ± 0.644
2.683GluLys: 2.683 ± 0.141
6.707GluLeu: 6.707 ± 1.522
0.671GluMet: 0.671 ± 0.795
2.012GluAsn: 2.012 ± 0.574
2.012GluPro: 2.012 ± 0.574
2.012GluGln: 2.012 ± 0.574
3.353GluArg: 3.353 ± 2.167
4.695GluSer: 4.695 ± 1.159
2.012GluThr: 2.012 ± 0.574
5.366GluVal: 5.366 ± 0.655
1.341GluTrp: 1.341 ± 0.867
2.683GluTyr: 2.683 ± 0.796
0.0GluXaa: 0.0 ± 0.0
Phe
5.366PheAla: 5.366 ± 1.219
1.341PheCys: 1.341 ± 0.07
1.341PheAsp: 1.341 ± 1.007
2.012PheGlu: 2.012 ± 1.3
1.341PhePhe: 1.341 ± 0.07
2.683PheGly: 2.683 ± 0.141
0.0PheHis: 0.0 ± 0.0
1.341PheIle: 1.341 ± 0.07
1.341PheLys: 1.341 ± 1.007
2.683PheLeu: 2.683 ± 0.796
0.671PheMet: 0.671 ± 0.433
4.024PheAsn: 4.024 ± 0.726
0.671PhePro: 0.671 ± 0.433
1.341PheGln: 1.341 ± 1.007
2.012PheArg: 2.012 ± 0.363
3.353PheSer: 3.353 ± 0.644
2.012PheThr: 2.012 ± 0.363
4.695PheVal: 4.695 ± 1.159
0.671PheTrp: 0.671 ± 0.433
2.012PheTyr: 2.012 ± 0.574
0.0PheXaa: 0.0 ± 0.0
Gly
4.695GlyAla: 4.695 ± 1.652
1.341GlyCys: 1.341 ± 0.07
1.341GlyAsp: 1.341 ± 0.867
5.366GlyGlu: 5.366 ± 2.53
2.012GlyPhe: 2.012 ± 0.363
3.353GlyGly: 3.353 ± 1.23
0.0GlyHis: 0.0 ± 0.0
2.012GlyIle: 2.012 ± 0.363
3.353GlyLys: 3.353 ± 1.23
3.353GlyLeu: 3.353 ± 0.644
0.0GlyMet: 0.0 ± 0.0
0.671GlyAsn: 0.671 ± 0.504
2.012GlyPro: 2.012 ± 0.363
2.012GlyGln: 2.012 ± 0.574
2.012GlyArg: 2.012 ± 1.511
6.036GlySer: 6.036 ± 2.026
4.024GlyThr: 4.024 ± 0.211
3.353GlyVal: 3.353 ± 0.644
0.671GlyTrp: 0.671 ± 0.433
2.683GlyTyr: 2.683 ± 1.078
0.0GlyXaa: 0.0 ± 0.0
His
2.012HisAla: 2.012 ± 0.574
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
2.012HisGlu: 2.012 ± 1.511
2.683HisPhe: 2.683 ± 1.733
0.0HisGly: 0.0 ± 0.0
0.671HisHis: 0.671 ± 0.433
0.671HisIle: 0.671 ± 0.433
1.341HisLys: 1.341 ± 0.07
1.341HisLeu: 1.341 ± 1.007
0.0HisMet: 0.0 ± 0.0
0.671HisAsn: 0.671 ± 0.433
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
1.341HisArg: 1.341 ± 0.867
3.353HisSer: 3.353 ± 0.293
0.671HisThr: 0.671 ± 0.504
2.012HisVal: 2.012 ± 0.363
0.0HisTrp: 0.0 ± 0.0
2.012HisTyr: 2.012 ± 1.511
0.0HisXaa: 0.0 ± 0.0
Ile
5.366IleAla: 5.366 ± 2.53
0.0IleCys: 0.0 ± 0.0
6.036IleAsp: 6.036 ± 0.152
4.695IleGlu: 4.695 ± 0.222
0.0IlePhe: 0.0 ± 0.0
1.341IleGly: 1.341 ± 0.07
1.341IleHis: 1.341 ± 0.867
0.671IleIle: 0.671 ± 0.433
4.024IleLys: 4.024 ± 0.726
0.671IleLeu: 0.671 ± 0.433
0.671IleMet: 0.671 ± 0.433
2.012IleAsn: 2.012 ± 0.574
2.683IlePro: 2.683 ± 1.078
0.671IleGln: 0.671 ± 0.433
4.024IleArg: 4.024 ± 1.663
4.695IleSer: 4.695 ± 0.222
4.695IleThr: 4.695 ± 0.222
2.012IleVal: 2.012 ± 1.511
0.671IleTrp: 0.671 ± 0.433
1.341IleTyr: 1.341 ± 0.867
0.0IleXaa: 0.0 ± 0.0
Lys
4.695LysAla: 4.695 ± 1.159
0.671LysCys: 0.671 ± 0.433
0.0LysAsp: 0.0 ± 0.0
4.024LysGlu: 4.024 ± 2.6
2.012LysPhe: 2.012 ± 1.3
2.012LysGly: 2.012 ± 0.574
0.671LysHis: 0.671 ± 0.504
3.353LysIle: 3.353 ± 0.293
5.366LysLys: 5.366 ± 0.655
5.366LysLeu: 5.366 ± 0.282
2.012LysMet: 2.012 ± 0.574
2.683LysAsn: 2.683 ± 0.141
3.353LysPro: 3.353 ± 0.644
2.012LysGln: 2.012 ± 1.3
3.353LysArg: 3.353 ± 0.644
3.353LysSer: 3.353 ± 0.293
2.683LysThr: 2.683 ± 0.796
2.012LysVal: 2.012 ± 0.363
0.0LysTrp: 0.0 ± 0.0
3.353LysTyr: 3.353 ± 0.293
0.0LysXaa: 0.0 ± 0.0
Leu
6.036LeuAla: 6.036 ± 0.152
0.671LeuCys: 0.671 ± 0.504
4.695LeuAsp: 4.695 ± 1.652
4.695LeuGlu: 4.695 ± 0.715
2.683LeuPhe: 2.683 ± 0.141
5.366LeuGly: 5.366 ± 0.655
0.671LeuHis: 0.671 ± 0.504
5.366LeuIle: 5.366 ± 1.593
3.353LeuLys: 3.353 ± 0.644
3.353LeuLeu: 3.353 ± 0.644
0.671LeuMet: 0.671 ± 0.433
5.366LeuAsn: 5.366 ± 0.282
4.024LeuPro: 4.024 ± 0.726
4.695LeuGln: 4.695 ± 0.222
7.378LeuArg: 7.378 ± 1.018
6.707LeuSer: 6.707 ± 1.522
8.048LeuThr: 8.048 ± 1.359
7.378LeuVal: 7.378 ± 2.892
2.012LeuTrp: 2.012 ± 1.511
5.366LeuTyr: 5.366 ± 1.219
0.0LeuXaa: 0.0 ± 0.0
Met
2.012MetAla: 2.012 ± 0.363
0.0MetCys: 0.0 ± 0.0
2.012MetAsp: 2.012 ± 0.574
0.671MetGlu: 0.671 ± 0.433
0.671MetPhe: 0.671 ± 0.433
1.341MetGly: 1.341 ± 0.867
2.012MetHis: 2.012 ± 1.511
2.012MetIle: 2.012 ± 0.363
1.341MetLys: 1.341 ± 0.867
1.341MetLeu: 1.341 ± 1.007
0.0MetMet: 0.0 ± 0.0
0.671MetAsn: 0.671 ± 0.504
3.353MetPro: 3.353 ± 0.644
1.341MetGln: 1.341 ± 0.867
0.671MetArg: 0.671 ± 0.433
2.012MetSer: 2.012 ± 0.574
2.012MetThr: 2.012 ± 1.3
0.0MetVal: 0.0 ± 0.0
0.671MetTrp: 0.671 ± 0.504
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
5.366AsnAla: 5.366 ± 0.655
1.341AsnCys: 1.341 ± 0.07
3.353AsnAsp: 3.353 ± 1.582
3.353AsnGlu: 3.353 ± 0.644
2.683AsnPhe: 2.683 ± 2.015
2.683AsnGly: 2.683 ± 1.078
0.0AsnHis: 0.0 ± 0.0
1.341AsnIle: 1.341 ± 0.07
1.341AsnLys: 1.341 ± 0.867
4.024AsnLeu: 4.024 ± 0.211
1.341AsnMet: 1.341 ± 0.07
2.683AsnAsn: 2.683 ± 0.796
2.683AsnPro: 2.683 ± 0.141
1.341AsnGln: 1.341 ± 0.867
4.024AsnArg: 4.024 ± 1.148
3.353AsnSer: 3.353 ± 0.293
1.341AsnThr: 1.341 ± 0.07
4.024AsnVal: 4.024 ± 1.663
0.671AsnTrp: 0.671 ± 0.504
2.683AsnTyr: 2.683 ± 0.141
0.0AsnXaa: 0.0 ± 0.0
Pro
3.353ProAla: 3.353 ± 1.582
0.0ProCys: 0.0 ± 0.0
0.671ProAsp: 0.671 ± 0.433
4.695ProGlu: 4.695 ± 1.652
2.683ProPhe: 2.683 ± 0.141
2.012ProGly: 2.012 ± 1.3
0.0ProHis: 0.0 ± 0.0
2.683ProIle: 2.683 ± 0.796
2.012ProLys: 2.012 ± 1.3
2.683ProLeu: 2.683 ± 2.015
1.341ProMet: 1.341 ± 0.07
2.012ProAsn: 2.012 ± 1.511
0.0ProPro: 0.0 ± 0.0
1.341ProGln: 1.341 ± 0.867
2.012ProArg: 2.012 ± 0.574
2.683ProSer: 2.683 ± 1.078
2.012ProThr: 2.012 ± 0.363
6.036ProVal: 6.036 ± 1.722
1.341ProTrp: 1.341 ± 1.007
1.341ProTyr: 1.341 ± 1.007
0.0ProXaa: 0.0 ± 0.0
Gln
3.353GlnAla: 3.353 ± 2.167
0.0GlnCys: 0.0 ± 0.0
1.341GlnAsp: 1.341 ± 0.867
4.024GlnGlu: 4.024 ± 0.211
0.671GlnPhe: 0.671 ± 0.433
0.671GlnGly: 0.671 ± 0.433
2.012GlnHis: 2.012 ± 0.363
1.341GlnIle: 1.341 ± 0.867
2.683GlnLys: 2.683 ± 1.078
3.353GlnLeu: 3.353 ± 1.582
0.671GlnMet: 0.671 ± 0.504
1.341GlnAsn: 1.341 ± 0.07
2.012GlnPro: 2.012 ± 1.511
0.671GlnGln: 0.671 ± 0.433
2.012GlnArg: 2.012 ± 0.363
3.353GlnSer: 3.353 ± 1.582
1.341GlnThr: 1.341 ± 0.867
1.341GlnVal: 1.341 ± 0.07
1.341GlnTrp: 1.341 ± 0.867
2.012GlnTyr: 2.012 ± 1.3
0.0GlnXaa: 0.0 ± 0.0
Arg
4.024ArgAla: 4.024 ± 0.726
0.0ArgCys: 0.0 ± 0.0
5.366ArgAsp: 5.366 ± 0.282
3.353ArgGlu: 3.353 ± 0.293
4.024ArgPhe: 4.024 ± 0.211
4.024ArgGly: 4.024 ± 0.726
2.012ArgHis: 2.012 ± 0.363
2.683ArgIle: 2.683 ± 1.733
5.366ArgLys: 5.366 ± 0.282
7.378ArgLeu: 7.378 ± 1.955
0.671ArgMet: 0.671 ± 0.504
2.012ArgAsn: 2.012 ± 1.3
2.012ArgPro: 2.012 ± 0.363
0.671ArgGln: 0.671 ± 0.504
2.683ArgArg: 2.683 ± 0.141
4.695ArgSer: 4.695 ± 1.159
3.353ArgThr: 3.353 ± 0.644
7.378ArgVal: 7.378 ± 1.018
0.671ArgTrp: 0.671 ± 0.433
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
1.341SerAla: 1.341 ± 0.867
0.0SerCys: 0.0 ± 0.0
4.024SerAsp: 4.024 ± 0.726
3.353SerGlu: 3.353 ± 0.644
3.353SerPhe: 3.353 ± 0.644
5.366SerGly: 5.366 ± 0.282
4.024SerHis: 4.024 ± 1.148
4.695SerIle: 4.695 ± 3.033
2.012SerLys: 2.012 ± 1.3
8.048SerLeu: 8.048 ± 1.359
4.024SerMet: 4.024 ± 0.726
2.012SerAsn: 2.012 ± 1.3
2.683SerPro: 2.683 ± 0.141
2.683SerGln: 2.683 ± 1.733
5.366SerArg: 5.366 ± 0.655
2.683SerSer: 2.683 ± 1.078
4.695SerThr: 4.695 ± 2.589
7.378SerVal: 7.378 ± 0.081
0.0SerTrp: 0.0 ± 0.0
5.366SerTyr: 5.366 ± 0.282
0.0SerXaa: 0.0 ± 0.0
Thr
3.353ThrAla: 3.353 ± 1.582
0.671ThrCys: 0.671 ± 0.504
3.353ThrAsp: 3.353 ± 0.644
2.683ThrGlu: 2.683 ± 0.141
2.012ThrPhe: 2.012 ± 0.363
4.024ThrGly: 4.024 ± 0.211
2.012ThrHis: 2.012 ± 0.574
3.353ThrIle: 3.353 ± 1.582
3.353ThrLys: 3.353 ± 1.23
8.719ThrLeu: 8.719 ± 1.863
3.353ThrMet: 3.353 ± 0.893
2.012ThrAsn: 2.012 ± 0.574
2.012ThrPro: 2.012 ± 1.511
4.695ThrGln: 4.695 ± 2.589
5.366ThrArg: 5.366 ± 1.593
3.353ThrSer: 3.353 ± 0.293
5.366ThrThr: 5.366 ± 2.156
6.036ThrVal: 6.036 ± 1.089
0.0ThrTrp: 0.0 ± 0.0
1.341ThrTyr: 1.341 ± 1.007
0.0ThrXaa: 0.0 ± 0.0
Val
6.707ValAla: 6.707 ± 0.585
0.0ValCys: 0.0 ± 0.0
4.695ValAsp: 4.695 ± 0.222
4.024ValGlu: 4.024 ± 0.211
3.353ValPhe: 3.353 ± 1.582
2.012ValGly: 2.012 ± 1.3
2.012ValHis: 2.012 ± 0.363
5.366ValIle: 5.366 ± 0.655
4.695ValLys: 4.695 ± 0.715
9.39ValLeu: 9.39 ± 4.192
2.012ValMet: 2.012 ± 0.363
4.695ValAsn: 4.695 ± 0.222
4.695ValPro: 4.695 ± 0.715
4.024ValGln: 4.024 ± 0.211
4.024ValArg: 4.024 ± 2.6
7.378ValSer: 7.378 ± 0.856
5.366ValThr: 5.366 ± 1.219
8.719ValVal: 8.719 ± 1.863
1.341ValTrp: 1.341 ± 0.07
3.353ValTyr: 3.353 ± 0.293
0.0ValXaa: 0.0 ± 0.0
Trp
1.341TrpAla: 1.341 ± 0.07
0.0TrpCys: 0.0 ± 0.0
1.341TrpAsp: 1.341 ± 1.007
0.671TrpGlu: 0.671 ± 0.504
0.0TrpPhe: 0.0 ± 0.0
1.341TrpGly: 1.341 ± 0.867
0.0TrpHis: 0.0 ± 0.0
0.671TrpIle: 0.671 ± 0.504
0.671TrpLys: 0.671 ± 0.433
0.671TrpLeu: 0.671 ± 0.433
0.671TrpMet: 0.671 ± 0.504
0.671TrpAsn: 0.671 ± 0.433
0.0TrpPro: 0.0 ± 0.0
1.341TrpGln: 1.341 ± 0.07
1.341TrpArg: 1.341 ± 0.07
1.341TrpSer: 1.341 ± 0.867
0.671TrpThr: 0.671 ± 0.433
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.341TyrAla: 1.341 ± 0.867
0.0TyrCys: 0.0 ± 0.0
2.012TyrAsp: 2.012 ± 1.511
2.012TyrGlu: 2.012 ± 0.363
1.341TyrPhe: 1.341 ± 0.07
4.024TyrGly: 4.024 ± 1.148
0.0TyrHis: 0.0 ± 0.0
1.341TyrIle: 1.341 ± 0.07
1.341TyrLys: 1.341 ± 0.867
5.366TyrLeu: 5.366 ± 2.156
0.671TyrMet: 0.671 ± 0.433
6.036TyrAsn: 6.036 ± 0.785
0.0TyrPro: 0.0 ± 0.0
2.683TyrGln: 2.683 ± 0.796
2.683TyrArg: 2.683 ± 0.141
1.341TyrSer: 1.341 ± 0.07
3.353TyrThr: 3.353 ± 0.644
4.695TyrVal: 4.695 ± 0.715
0.0TyrTrp: 0.0 ± 0.0
4.024TyrTyr: 4.024 ± 1.148
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1492 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski