Amino acid dipepetide frequency for Beihai weivirus-like virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
23.994AlaAla: 23.994 ± 4.884
0.0AlaCys: 0.0 ± 0.0
5.418AlaAsp: 5.418 ± 1.32
5.418AlaGlu: 5.418 ± 1.32
2.322AlaPhe: 2.322 ± 0.244
13.158AlaGly: 13.158 ± 4.69
3.096AlaHis: 3.096 ± 0.147
10.062AlaIle: 10.062 ± 2.249
4.644AlaLys: 4.644 ± 0.929
7.74AlaLeu: 7.74 ± 1.076
3.096AlaMet: 3.096 ± 1.27
2.322AlaAsn: 2.322 ± 0.244
17.028AlaPro: 17.028 ± 2.934
4.644AlaGln: 4.644 ± 1.905
10.836AlaArg: 10.836 ± 5.863
4.644AlaSer: 4.644 ± 0.929
3.096AlaThr: 3.096 ± 1.27
10.836AlaVal: 10.836 ± 3.029
3.096AlaTrp: 3.096 ± 1.564
2.322AlaTyr: 2.322 ± 0.244
0.0AlaXaa: 0.0 ± 0.0
Cys
0.774CysAla: 0.774 ± 0.391
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
1.548CysPhe: 1.548 ± 0.782
0.774CysGly: 0.774 ± 0.391
0.774CysHis: 0.774 ± 1.026
0.774CysIle: 0.774 ± 0.391
0.774CysLys: 0.774 ± 0.391
2.322CysLeu: 2.322 ± 1.173
0.0CysMet: 0.0 ± 0.0
0.774CysAsn: 0.774 ± 0.391
0.774CysPro: 0.774 ± 0.391
1.548CysGln: 1.548 ± 0.782
0.0CysArg: 0.0 ± 0.0
1.548CysSer: 1.548 ± 0.782
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.418AspAla: 5.418 ± 1.32
0.774AspCys: 0.774 ± 0.391
2.322AspAsp: 2.322 ± 0.244
4.644AspGlu: 4.644 ± 0.929
2.322AspPhe: 2.322 ± 0.244
3.87AspGly: 3.87 ± 0.538
0.774AspHis: 0.774 ± 0.391
0.774AspIle: 0.774 ± 0.391
1.548AspLys: 1.548 ± 0.782
3.096AspLeu: 3.096 ± 0.147
1.548AspMet: 1.548 ± 0.782
0.774AspAsn: 0.774 ± 0.391
3.096AspPro: 3.096 ± 0.147
0.774AspGln: 0.774 ± 1.026
6.966AspArg: 6.966 ± 2.102
2.322AspSer: 2.322 ± 1.661
3.096AspThr: 3.096 ± 2.687
2.322AspVal: 2.322 ± 1.173
0.0AspTrp: 0.0 ± 0.0
3.096AspTyr: 3.096 ± 0.147
0.0AspXaa: 0.0 ± 0.0
Glu
6.966GluAla: 6.966 ± 2.102
0.0GluCys: 0.0 ± 0.0
5.418GluAsp: 5.418 ± 2.737
3.096GluGlu: 3.096 ± 1.564
2.322GluPhe: 2.322 ± 0.244
6.966GluGly: 6.966 ± 0.732
1.548GluHis: 1.548 ± 0.782
0.774GluIle: 0.774 ± 0.391
3.87GluLys: 3.87 ± 1.955
3.096GluLeu: 3.096 ± 0.147
3.096GluMet: 3.096 ± 1.564
0.774GluAsn: 0.774 ± 0.391
5.418GluPro: 5.418 ± 1.32
2.322GluGln: 2.322 ± 0.244
3.87GluArg: 3.87 ± 1.955
2.322GluSer: 2.322 ± 1.173
3.096GluThr: 3.096 ± 1.564
6.966GluVal: 6.966 ± 3.519
0.774GluTrp: 0.774 ± 0.391
0.774GluTyr: 0.774 ± 0.391
0.0GluXaa: 0.0 ± 0.0
Phe
4.644PheAla: 4.644 ± 0.929
1.548PheCys: 1.548 ± 0.782
1.548PheAsp: 1.548 ± 0.635
3.87PheGlu: 3.87 ± 1.955
1.548PhePhe: 1.548 ± 0.782
2.322PheGly: 2.322 ± 1.173
0.774PheHis: 0.774 ± 0.391
1.548PheIle: 1.548 ± 0.635
1.548PheLys: 1.548 ± 0.782
2.322PheLeu: 2.322 ± 0.244
0.0PheMet: 0.0 ± 0.0
1.548PheAsn: 1.548 ± 0.635
0.774PhePro: 0.774 ± 1.026
0.0PheGln: 0.0 ± 0.0
2.322PheArg: 2.322 ± 1.173
1.548PheSer: 1.548 ± 0.782
0.774PheThr: 0.774 ± 0.391
3.096PheVal: 3.096 ± 0.147
2.322PheTrp: 2.322 ± 0.244
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
10.836GlyAla: 10.836 ± 4.446
0.774GlyCys: 0.774 ± 0.391
3.096GlyAsp: 3.096 ± 1.564
2.322GlyGlu: 2.322 ± 1.173
3.096GlyPhe: 3.096 ± 1.564
11.61GlyGly: 11.61 ± 4.055
2.322GlyHis: 2.322 ± 0.244
3.096GlyIle: 3.096 ± 0.147
3.096GlyLys: 3.096 ± 1.564
5.418GlyLeu: 5.418 ± 1.32
2.322GlyMet: 2.322 ± 0.244
3.096GlyAsn: 3.096 ± 1.27
2.322GlyPro: 2.322 ± 0.244
3.87GlyGln: 3.87 ± 0.879
2.322GlyArg: 2.322 ± 1.661
5.418GlySer: 5.418 ± 1.514
3.096GlyThr: 3.096 ± 2.687
7.74GlyVal: 7.74 ± 0.341
3.096GlyTrp: 3.096 ± 1.27
0.774GlyTyr: 0.774 ± 0.391
0.0GlyXaa: 0.0 ± 0.0
His
2.322HisAla: 2.322 ± 1.173
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
2.322HisGlu: 2.322 ± 1.173
0.774HisPhe: 0.774 ± 0.391
2.322HisGly: 2.322 ± 0.244
0.774HisHis: 0.774 ± 0.391
1.548HisIle: 1.548 ± 2.052
0.0HisLys: 0.0 ± 0.0
3.096HisLeu: 3.096 ± 1.564
0.774HisMet: 0.774 ± 0.391
0.774HisAsn: 0.774 ± 1.026
2.322HisPro: 2.322 ± 1.661
0.0HisGln: 0.0 ± 0.0
1.548HisArg: 1.548 ± 0.782
0.774HisSer: 0.774 ± 0.391
0.0HisThr: 0.0 ± 0.0
3.096HisVal: 3.096 ± 1.564
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
6.966IleAla: 6.966 ± 0.685
0.774IleCys: 0.774 ± 1.026
0.0IleAsp: 0.0 ± 0.0
5.418IleGlu: 5.418 ± 0.097
1.548IlePhe: 1.548 ± 0.635
1.548IleGly: 1.548 ± 0.782
0.0IleHis: 0.0 ± 0.0
1.548IleIle: 1.548 ± 0.782
1.548IleLys: 1.548 ± 0.782
4.644IleLeu: 4.644 ± 0.929
0.774IleMet: 0.774 ± 0.391
0.0IleAsn: 0.0 ± 0.0
2.322IlePro: 2.322 ± 1.661
0.0IleGln: 0.0 ± 0.0
1.548IleArg: 1.548 ± 0.782
0.774IleSer: 0.774 ± 0.391
3.096IleThr: 3.096 ± 0.147
1.548IleVal: 1.548 ± 0.635
1.548IleTrp: 1.548 ± 0.635
1.548IleTyr: 1.548 ± 0.635
0.0IleXaa: 0.0 ± 0.0
Lys
4.644LysAla: 4.644 ± 0.488
0.774LysCys: 0.774 ± 0.391
1.548LysAsp: 1.548 ± 0.782
3.87LysGlu: 3.87 ± 1.955
3.87LysPhe: 3.87 ± 1.955
1.548LysGly: 1.548 ± 0.635
1.548LysHis: 1.548 ± 0.782
1.548LysIle: 1.548 ± 0.782
3.87LysLys: 3.87 ± 1.955
2.322LysLeu: 2.322 ± 1.173
0.0LysMet: 0.0 ± 0.0
1.548LysAsn: 1.548 ± 0.782
2.322LysPro: 2.322 ± 0.244
1.548LysGln: 1.548 ± 0.782
4.644LysArg: 4.644 ± 0.929
1.548LysSer: 1.548 ± 0.782
0.774LysThr: 0.774 ± 1.026
2.322LysVal: 2.322 ± 0.244
0.774LysTrp: 0.774 ± 0.391
2.322LysTyr: 2.322 ± 0.244
0.0LysXaa: 0.0 ± 0.0
Leu
10.836LeuAla: 10.836 ± 4.058
0.774LeuCys: 0.774 ± 0.391
6.192LeuAsp: 6.192 ± 1.123
2.322LeuGlu: 2.322 ± 1.173
5.418LeuPhe: 5.418 ± 1.32
2.322LeuGly: 2.322 ± 0.244
1.548LeuHis: 1.548 ± 0.782
3.096LeuIle: 3.096 ± 0.147
3.096LeuLys: 3.096 ± 1.27
8.514LeuLeu: 8.514 ± 0.05
3.096LeuMet: 3.096 ± 0.814
3.87LeuAsn: 3.87 ± 0.879
4.644LeuPro: 4.644 ± 0.488
1.548LeuGln: 1.548 ± 0.635
3.87LeuArg: 3.87 ± 1.955
4.644LeuSer: 4.644 ± 1.905
2.322LeuThr: 2.322 ± 0.244
3.87LeuVal: 3.87 ± 0.538
3.096LeuTrp: 3.096 ± 1.564
0.774LeuTyr: 0.774 ± 0.391
0.0LeuXaa: 0.0 ± 0.0
Met
4.644MetAla: 4.644 ± 0.488
0.0MetCys: 0.0 ± 0.0
1.548MetAsp: 1.548 ± 0.782
1.548MetGlu: 1.548 ± 0.635
0.0MetPhe: 0.0 ± 0.0
2.322MetGly: 2.322 ± 1.661
1.548MetHis: 1.548 ± 0.782
0.774MetIle: 0.774 ± 0.391
1.548MetLys: 1.548 ± 0.635
3.096MetLeu: 3.096 ± 1.27
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.774MetPro: 0.774 ± 0.391
0.774MetGln: 0.774 ± 0.391
1.548MetArg: 1.548 ± 0.782
0.774MetSer: 0.774 ± 1.026
0.774MetThr: 0.774 ± 1.026
1.548MetVal: 1.548 ± 0.782
0.774MetTrp: 0.774 ± 0.391
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
5.418AsnAla: 5.418 ± 7.183
0.0AsnCys: 0.0 ± 0.0
0.774AsnAsp: 0.774 ± 0.391
1.548AsnGlu: 1.548 ± 0.782
0.774AsnPhe: 0.774 ± 0.391
1.548AsnGly: 1.548 ± 0.782
0.0AsnHis: 0.0 ± 0.0
0.774AsnIle: 0.774 ± 0.391
2.322AsnLys: 2.322 ± 0.244
2.322AsnLeu: 2.322 ± 0.244
1.548AsnMet: 1.548 ± 0.782
0.0AsnAsn: 0.0 ± 0.0
1.548AsnPro: 1.548 ± 0.635
0.774AsnGln: 0.774 ± 0.391
0.774AsnArg: 0.774 ± 1.026
0.774AsnSer: 0.774 ± 0.391
3.87AsnThr: 3.87 ± 0.879
2.322AsnVal: 2.322 ± 0.244
0.0AsnTrp: 0.0 ± 0.0
1.548AsnTyr: 1.548 ± 0.782
0.0AsnXaa: 0.0 ± 0.0
Pro
13.158ProAla: 13.158 ± 0.979
0.774ProCys: 0.774 ± 0.391
2.322ProAsp: 2.322 ± 0.244
6.192ProGlu: 6.192 ± 3.128
2.322ProPhe: 2.322 ± 0.244
2.322ProGly: 2.322 ± 0.244
0.774ProHis: 0.774 ± 0.391
0.0ProIle: 0.0 ± 0.0
1.548ProLys: 1.548 ± 2.052
6.192ProLeu: 6.192 ± 2.54
2.322ProMet: 2.322 ± 3.079
0.774ProAsn: 0.774 ± 0.391
9.288ProPro: 9.288 ± 4.693
1.548ProGln: 1.548 ± 0.635
9.288ProArg: 9.288 ± 0.976
6.966ProSer: 6.966 ± 2.149
2.322ProThr: 2.322 ± 1.661
1.548ProVal: 1.548 ± 0.782
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.096GlnAla: 3.096 ± 2.687
0.0GlnCys: 0.0 ± 0.0
1.548GlnAsp: 1.548 ± 0.782
1.548GlnGlu: 1.548 ± 0.782
0.0GlnPhe: 0.0 ± 0.0
3.096GlnGly: 3.096 ± 0.147
0.774GlnHis: 0.774 ± 0.391
0.774GlnIle: 0.774 ± 0.391
1.548GlnLys: 1.548 ± 0.782
0.0GlnLeu: 0.0 ± 0.0
0.774GlnMet: 0.774 ± 0.391
2.322GlnAsn: 2.322 ± 3.079
2.322GlnPro: 2.322 ± 1.661
0.0GlnGln: 0.0 ± 0.0
3.096GlnArg: 3.096 ± 2.687
1.548GlnSer: 1.548 ± 0.782
1.548GlnThr: 1.548 ± 0.635
3.096GlnVal: 3.096 ± 0.147
0.774GlnTrp: 0.774 ± 1.026
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
12.384ArgAla: 12.384 ± 2.246
1.548ArgCys: 1.548 ± 0.782
3.87ArgAsp: 3.87 ± 0.538
5.418ArgGlu: 5.418 ± 1.32
2.322ArgPhe: 2.322 ± 1.173
6.966ArgGly: 6.966 ± 3.567
0.774ArgHis: 0.774 ± 0.391
2.322ArgIle: 2.322 ± 1.173
3.096ArgLys: 3.096 ± 1.564
6.192ArgLeu: 6.192 ± 3.128
0.0ArgMet: 0.0 ± 0.0
0.774ArgAsn: 0.774 ± 1.026
4.644ArgPro: 4.644 ± 3.323
1.548ArgGln: 1.548 ± 2.052
9.288ArgArg: 9.288 ± 0.976
5.418ArgSer: 5.418 ± 0.097
3.096ArgThr: 3.096 ± 0.147
6.192ArgVal: 6.192 ± 1.711
2.322ArgTrp: 2.322 ± 0.244
1.548ArgTyr: 1.548 ± 0.635
0.0ArgXaa: 0.0 ± 0.0
Ser
6.192SerAla: 6.192 ± 1.123
0.774SerCys: 0.774 ± 0.391
1.548SerAsp: 1.548 ± 0.782
1.548SerGlu: 1.548 ± 0.635
0.0SerPhe: 0.0 ± 0.0
5.418SerGly: 5.418 ± 1.32
1.548SerHis: 1.548 ± 2.052
1.548SerIle: 1.548 ± 0.635
3.87SerLys: 3.87 ± 0.538
4.644SerLeu: 4.644 ± 0.488
0.0SerMet: 0.0 ± 0.0
3.87SerAsn: 3.87 ± 1.955
2.322SerPro: 2.322 ± 1.173
0.774SerGln: 0.774 ± 0.391
3.096SerArg: 3.096 ± 1.27
3.096SerSer: 3.096 ± 1.27
4.644SerThr: 4.644 ± 1.905
6.192SerVal: 6.192 ± 0.294
1.548SerTrp: 1.548 ± 0.635
0.774SerTyr: 0.774 ± 0.391
0.0SerXaa: 0.0 ± 0.0
Thr
5.418ThrAla: 5.418 ± 1.514
0.774ThrCys: 0.774 ± 0.391
3.096ThrAsp: 3.096 ± 2.687
3.096ThrGlu: 3.096 ± 1.564
1.548ThrPhe: 1.548 ± 2.052
3.096ThrGly: 3.096 ± 1.27
0.774ThrHis: 0.774 ± 0.391
3.096ThrIle: 3.096 ± 4.105
0.774ThrLys: 0.774 ± 0.391
2.322ThrLeu: 2.322 ± 0.244
1.548ThrMet: 1.548 ± 0.635
0.774ThrAsn: 0.774 ± 1.026
1.548ThrPro: 1.548 ± 0.635
1.548ThrGln: 1.548 ± 2.052
4.644ThrArg: 4.644 ± 0.488
2.322ThrSer: 2.322 ± 1.173
3.096ThrThr: 3.096 ± 0.147
3.87ThrVal: 3.87 ± 0.538
0.0ThrTrp: 0.0 ± 0.0
1.548ThrTyr: 1.548 ± 0.635
0.0ThrXaa: 0.0 ± 0.0
Val
6.966ValAla: 6.966 ± 0.685
2.322ValCys: 2.322 ± 1.173
4.644ValAsp: 4.644 ± 0.488
6.192ValGlu: 6.192 ± 3.128
0.774ValPhe: 0.774 ± 0.391
7.74ValGly: 7.74 ± 2.493
0.774ValHis: 0.774 ± 0.391
3.096ValIle: 3.096 ± 0.147
3.096ValLys: 3.096 ± 1.564
5.418ValLeu: 5.418 ± 0.097
2.322ValMet: 2.322 ± 0.287
3.87ValAsn: 3.87 ± 0.879
3.87ValPro: 3.87 ± 0.879
3.87ValGln: 3.87 ± 0.879
4.644ValArg: 4.644 ± 0.929
3.87ValSer: 3.87 ± 0.879
4.644ValThr: 4.644 ± 0.488
6.192ValVal: 6.192 ± 3.958
0.0ValTrp: 0.0 ± 0.0
1.548ValTyr: 1.548 ± 0.782
0.0ValXaa: 0.0 ± 0.0
Trp
0.774TrpAla: 0.774 ± 0.391
0.0TrpCys: 0.0 ± 0.0
3.096TrpAsp: 3.096 ± 0.147
1.548TrpGlu: 1.548 ± 0.782
1.548TrpPhe: 1.548 ± 0.782
0.0TrpGly: 0.0 ± 0.0
0.774TrpHis: 0.774 ± 1.026
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
1.548TrpLeu: 1.548 ± 0.635
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
2.322TrpPro: 2.322 ± 1.661
0.0TrpGln: 0.0 ± 0.0
3.096TrpArg: 3.096 ± 1.564
1.548TrpSer: 1.548 ± 0.635
0.774TrpThr: 0.774 ± 0.391
3.87TrpVal: 3.87 ± 0.538
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.322TyrAla: 2.322 ± 1.173
0.774TyrCys: 0.774 ± 0.391
1.548TyrAsp: 1.548 ± 0.635
1.548TyrGlu: 1.548 ± 0.635
0.0TyrPhe: 0.0 ± 0.0
0.774TyrGly: 0.774 ± 1.026
1.548TyrHis: 1.548 ± 0.782
0.0TyrIle: 0.0 ± 0.0
1.548TyrLys: 1.548 ± 0.782
1.548TyrLeu: 1.548 ± 0.635
0.0TyrMet: 0.0 ± 0.0
0.774TyrAsn: 0.774 ± 0.391
0.0TyrPro: 0.0 ± 0.0
0.774TyrGln: 0.774 ± 0.391
2.322TyrArg: 2.322 ± 0.244
1.548TyrSer: 1.548 ± 0.782
0.774TyrThr: 0.774 ± 1.026
0.0TyrVal: 0.0 ± 0.0
0.774TyrTrp: 0.774 ± 0.391
0.774TyrTyr: 0.774 ± 1.026
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1293 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski