Amino acid dipepetide frequency for Beihai weivirus-like virus 5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.223AlaAla: 10.223 ± 0.722
3.717AlaCys: 3.717 ± 3.912
1.859AlaAsp: 1.859 ± 0.911
2.788AlaGlu: 2.788 ± 1.367
5.576AlaPhe: 5.576 ± 1.089
7.435AlaGly: 7.435 ± 0.178
2.788AlaHis: 2.788 ± 0.545
5.576AlaIle: 5.576 ± 0.823
7.435AlaLys: 7.435 ± 1.734
10.223AlaLeu: 10.223 ± 1.189
2.788AlaMet: 2.788 ± 2.249
3.717AlaAsn: 3.717 ± 2.0
4.647AlaPro: 4.647 ± 0.367
2.788AlaGln: 2.788 ± 0.545
4.647AlaArg: 4.647 ± 3.456
3.717AlaSer: 3.717 ± 0.089
5.576AlaThr: 5.576 ± 3.001
9.294AlaVal: 9.294 ± 1.178
1.859AlaTrp: 1.859 ± 0.911
1.859AlaTyr: 1.859 ± 1.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.929CysAla: 0.929 ± 1.456
0.0CysCys: 0.0 ± 0.0
1.859CysAsp: 1.859 ± 1.0
0.929CysGlu: 0.929 ± 0.456
1.859CysPhe: 1.859 ± 0.911
1.859CysGly: 1.859 ± 1.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.859CysLys: 1.859 ± 0.911
0.929CysLeu: 0.929 ± 0.456
2.788CysMet: 2.788 ± 1.367
1.859CysAsn: 1.859 ± 0.911
1.859CysPro: 1.859 ± 1.0
1.859CysGln: 1.859 ± 0.911
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
1.859CysThr: 1.859 ± 0.911
0.929CysVal: 0.929 ± 1.456
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.647AspAla: 4.647 ± 2.278
0.0AspCys: 0.0 ± 0.0
2.788AspAsp: 2.788 ± 1.367
4.647AspGlu: 4.647 ± 2.278
3.717AspPhe: 3.717 ± 1.823
5.576AspGly: 5.576 ± 2.734
0.929AspHis: 0.929 ± 0.456
0.0AspIle: 0.0 ± 0.0
1.859AspLys: 1.859 ± 0.911
2.788AspLeu: 2.788 ± 1.367
1.859AspMet: 1.859 ± 1.0
0.0AspAsn: 0.0 ± 0.0
2.788AspPro: 2.788 ± 1.367
0.0AspGln: 0.0 ± 0.0
2.788AspArg: 2.788 ± 1.367
1.859AspSer: 1.859 ± 1.0
3.717AspThr: 3.717 ± 0.089
5.576AspVal: 5.576 ± 1.089
0.0AspTrp: 0.0 ± 0.0
0.929AspTyr: 0.929 ± 0.456
0.0AspXaa: 0.0 ± 0.0
Glu
4.647GluAla: 4.647 ± 0.367
1.859GluCys: 1.859 ± 0.911
2.788GluAsp: 2.788 ± 1.367
5.576GluGlu: 5.576 ± 2.734
0.929GluPhe: 0.929 ± 0.456
6.506GluGly: 6.506 ± 3.19
2.788GluHis: 2.788 ± 1.367
0.929GluIle: 0.929 ± 0.456
2.788GluLys: 2.788 ± 1.367
4.647GluLeu: 4.647 ± 2.278
0.929GluMet: 0.929 ± 0.456
3.717GluAsn: 3.717 ± 1.823
2.788GluPro: 2.788 ± 0.545
0.0GluGln: 0.0 ± 0.0
2.788GluArg: 2.788 ± 1.367
0.929GluSer: 0.929 ± 0.456
1.859GluThr: 1.859 ± 0.911
0.929GluVal: 0.929 ± 0.456
1.859GluTrp: 1.859 ± 0.911
2.788GluTyr: 2.788 ± 0.545
0.0GluXaa: 0.0 ± 0.0
Phe
4.647PheAla: 4.647 ± 0.367
0.929PheCys: 0.929 ± 0.456
1.859PheAsp: 1.859 ± 0.911
5.576PheGlu: 5.576 ± 2.734
1.859PhePhe: 1.859 ± 1.0
3.717PheGly: 3.717 ± 2.0
0.0PheHis: 0.0 ± 0.0
0.929PheIle: 0.929 ± 1.456
2.788PheLys: 2.788 ± 1.367
2.788PheLeu: 2.788 ± 1.367
0.929PheMet: 0.929 ± 0.456
2.788PheAsn: 2.788 ± 0.545
0.929PhePro: 0.929 ± 0.456
0.0PheGln: 0.0 ± 0.0
3.717PheArg: 3.717 ± 0.089
4.647PheSer: 4.647 ± 2.278
2.788PheThr: 2.788 ± 2.456
2.788PheVal: 2.788 ± 1.367
0.929PheTrp: 0.929 ± 0.456
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.647GlyAla: 4.647 ± 3.456
3.717GlyCys: 3.717 ± 0.089
7.435GlyAsp: 7.435 ± 1.734
3.717GlyGlu: 3.717 ± 0.089
2.788GlyPhe: 2.788 ± 1.367
8.364GlyGly: 8.364 ± 3.545
0.929GlyHis: 0.929 ± 0.456
1.859GlyIle: 1.859 ± 0.911
4.647GlyLys: 4.647 ± 1.545
5.576GlyLeu: 5.576 ± 1.089
0.929GlyMet: 0.929 ± 1.456
2.788GlyAsn: 2.788 ± 1.367
3.717GlyPro: 3.717 ± 0.089
4.647GlyGln: 4.647 ± 0.367
4.647GlyArg: 4.647 ± 0.367
4.647GlySer: 4.647 ± 0.367
5.576GlyThr: 5.576 ± 0.823
6.506GlyVal: 6.506 ± 4.456
2.788GlyTrp: 2.788 ± 0.545
0.929GlyTyr: 0.929 ± 0.456
0.0GlyXaa: 0.0 ± 0.0
His
0.929HisAla: 0.929 ± 1.456
0.0HisCys: 0.0 ± 0.0
1.859HisAsp: 1.859 ± 1.0
0.0HisGlu: 0.0 ± 0.0
2.788HisPhe: 2.788 ± 0.545
0.929HisGly: 0.929 ± 0.456
0.0HisHis: 0.0 ± 0.0
0.929HisIle: 0.929 ± 0.456
0.929HisLys: 0.929 ± 0.456
1.859HisLeu: 1.859 ± 1.0
1.859HisMet: 1.859 ± 0.911
0.0HisAsn: 0.0 ± 0.0
0.929HisPro: 0.929 ± 0.456
0.929HisGln: 0.929 ± 0.456
1.859HisArg: 1.859 ± 0.911
0.0HisSer: 0.0 ± 0.0
0.929HisThr: 0.929 ± 1.456
2.788HisVal: 2.788 ± 1.367
0.0HisTrp: 0.0 ± 0.0
0.929HisTyr: 0.929 ± 0.456
0.0HisXaa: 0.0 ± 0.0
Ile
1.859IleAla: 1.859 ± 0.911
0.0IleCys: 0.0 ± 0.0
1.859IleAsp: 1.859 ± 0.911
1.859IleGlu: 1.859 ± 0.911
0.929IlePhe: 0.929 ± 1.456
3.717IleGly: 3.717 ± 1.823
0.0IleHis: 0.0 ± 0.0
0.0IleIle: 0.0 ± 0.0
1.859IleLys: 1.859 ± 0.911
0.929IleLeu: 0.929 ± 0.456
0.929IleMet: 0.929 ± 0.456
2.788IleAsn: 2.788 ± 0.545
0.929IlePro: 0.929 ± 0.456
0.0IleGln: 0.0 ± 0.0
4.647IleArg: 4.647 ± 0.367
1.859IleSer: 1.859 ± 1.0
6.506IleThr: 6.506 ± 0.633
2.788IleVal: 2.788 ± 1.367
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
6.506LysAla: 6.506 ± 0.633
1.859LysCys: 1.859 ± 0.911
3.717LysAsp: 3.717 ± 1.823
2.788LysGlu: 2.788 ± 1.367
0.929LysPhe: 0.929 ± 0.456
2.788LysGly: 2.788 ± 2.456
1.859LysHis: 1.859 ± 0.911
0.929LysIle: 0.929 ± 0.456
4.647LysLys: 4.647 ± 1.545
5.576LysLeu: 5.576 ± 0.823
0.929LysMet: 0.929 ± 0.456
0.929LysAsn: 0.929 ± 1.456
4.647LysPro: 4.647 ± 0.367
0.0LysGln: 0.0 ± 0.0
2.788LysArg: 2.788 ± 1.367
4.647LysSer: 4.647 ± 0.367
3.717LysThr: 3.717 ± 1.823
0.929LysVal: 0.929 ± 1.456
0.929LysTrp: 0.929 ± 0.456
1.859LysTyr: 1.859 ± 0.911
0.0LysXaa: 0.0 ± 0.0
Leu
10.223LeuAla: 10.223 ± 1.189
2.788LeuCys: 2.788 ± 1.367
4.647LeuAsp: 4.647 ± 0.367
2.788LeuGlu: 2.788 ± 1.367
1.859LeuPhe: 1.859 ± 0.911
6.506LeuGly: 6.506 ± 4.456
0.0LeuHis: 0.0 ± 0.0
2.788LeuIle: 2.788 ± 1.367
4.647LeuLys: 4.647 ± 0.367
3.717LeuLeu: 3.717 ± 0.089
0.929LeuMet: 0.929 ± 0.456
2.788LeuAsn: 2.788 ± 0.545
9.294LeuPro: 9.294 ± 3.089
2.788LeuGln: 2.788 ± 2.456
8.364LeuArg: 8.364 ± 2.19
1.859LeuSer: 1.859 ± 0.911
5.576LeuThr: 5.576 ± 1.089
4.647LeuVal: 4.647 ± 2.278
2.788LeuTrp: 2.788 ± 1.367
1.859LeuTyr: 1.859 ± 0.911
0.0LeuXaa: 0.0 ± 0.0
Met
3.717MetAla: 3.717 ± 2.0
0.929MetCys: 0.929 ± 1.456
2.788MetAsp: 2.788 ± 0.545
3.717MetGlu: 3.717 ± 0.089
0.0MetPhe: 0.0 ± 0.0
0.929MetGly: 0.929 ± 0.456
0.0MetHis: 0.0 ± 0.0
1.859MetIle: 1.859 ± 1.0
0.929MetLys: 0.929 ± 0.456
1.859MetLeu: 1.859 ± 0.911
0.0MetMet: 0.0 ± 0.0
0.929MetAsn: 0.929 ± 0.456
2.788MetPro: 2.788 ± 0.545
0.0MetGln: 0.0 ± 0.0
2.788MetArg: 2.788 ± 1.367
3.717MetSer: 3.717 ± 2.0
2.788MetThr: 2.788 ± 0.545
1.859MetVal: 1.859 ± 0.911
0.0MetTrp: 0.0 ± 0.0
0.929MetTyr: 0.929 ± 0.456
0.0MetXaa: 0.0 ± 0.0
Asn
9.294AsnAla: 9.294 ± 6.913
0.929AsnCys: 0.929 ± 0.456
0.929AsnAsp: 0.929 ± 0.456
0.929AsnGlu: 0.929 ± 0.456
2.788AsnPhe: 2.788 ± 1.367
3.717AsnGly: 3.717 ± 0.089
0.929AsnHis: 0.929 ± 0.456
0.929AsnIle: 0.929 ± 0.456
0.929AsnLys: 0.929 ± 0.456
1.859AsnLeu: 1.859 ± 0.911
0.929AsnMet: 0.929 ± 0.456
2.788AsnAsn: 2.788 ± 0.545
1.859AsnPro: 1.859 ± 1.0
1.859AsnGln: 1.859 ± 1.0
1.859AsnArg: 1.859 ± 1.0
1.859AsnSer: 1.859 ± 1.0
0.929AsnThr: 0.929 ± 1.456
3.717AsnVal: 3.717 ± 1.823
0.929AsnTrp: 0.929 ± 0.456
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
4.647ProAla: 4.647 ± 1.545
0.929ProCys: 0.929 ± 0.456
2.788ProAsp: 2.788 ± 1.367
4.647ProGlu: 4.647 ± 2.278
0.0ProPhe: 0.0 ± 0.0
2.788ProGly: 2.788 ± 0.545
1.859ProHis: 1.859 ± 0.911
1.859ProIle: 1.859 ± 0.911
2.788ProLys: 2.788 ± 0.545
2.788ProLeu: 2.788 ± 2.456
2.788ProMet: 2.788 ± 0.545
3.717ProAsn: 3.717 ± 0.089
5.576ProPro: 5.576 ± 2.734
0.929ProGln: 0.929 ± 0.456
6.506ProArg: 6.506 ± 4.456
4.647ProSer: 4.647 ± 0.367
4.647ProThr: 4.647 ± 0.367
3.717ProVal: 3.717 ± 2.0
1.859ProTrp: 1.859 ± 1.0
1.859ProTyr: 1.859 ± 2.912
0.0ProXaa: 0.0 ± 0.0
Gln
2.788GlnAla: 2.788 ± 0.545
0.0GlnCys: 0.0 ± 0.0
0.929GlnAsp: 0.929 ± 0.456
1.859GlnGlu: 1.859 ± 0.911
1.859GlnPhe: 1.859 ± 2.912
0.929GlnGly: 0.929 ± 1.456
0.0GlnHis: 0.0 ± 0.0
0.929GlnIle: 0.929 ± 0.456
0.0GlnLys: 0.0 ± 0.0
2.788GlnLeu: 2.788 ± 0.545
0.0GlnMet: 0.0 ± 0.0
0.929GlnAsn: 0.929 ± 1.456
0.0GlnPro: 0.0 ± 0.0
0.929GlnGln: 0.929 ± 1.456
1.859GlnArg: 1.859 ± 1.0
3.717GlnSer: 3.717 ± 0.089
0.929GlnThr: 0.929 ± 0.456
2.788GlnVal: 2.788 ± 2.456
0.0GlnTrp: 0.0 ± 0.0
0.929GlnTyr: 0.929 ± 0.456
0.0GlnXaa: 0.0 ± 0.0
Arg
7.435ArgAla: 7.435 ± 1.734
1.859ArgCys: 1.859 ± 0.911
1.859ArgAsp: 1.859 ± 0.911
1.859ArgGlu: 1.859 ± 0.911
3.717ArgPhe: 3.717 ± 1.823
5.576ArgGly: 5.576 ± 0.823
2.788ArgHis: 2.788 ± 0.545
3.717ArgIle: 3.717 ± 1.823
1.859ArgLys: 1.859 ± 1.0
6.506ArgLeu: 6.506 ± 2.545
0.929ArgMet: 0.929 ± 0.456
2.788ArgAsn: 2.788 ± 1.367
5.576ArgPro: 5.576 ± 3.001
0.0ArgGln: 0.0 ± 0.0
7.435ArgArg: 7.435 ± 2.089
4.647ArgSer: 4.647 ± 0.367
5.576ArgThr: 5.576 ± 0.823
3.717ArgVal: 3.717 ± 2.0
2.788ArgTrp: 2.788 ± 2.456
1.859ArgTyr: 1.859 ± 0.911
0.0ArgXaa: 0.0 ± 0.0
Ser
6.506SerAla: 6.506 ± 0.633
0.929SerCys: 0.929 ± 0.456
1.859SerAsp: 1.859 ± 0.911
0.929SerGlu: 0.929 ± 1.456
2.788SerPhe: 2.788 ± 0.545
8.364SerGly: 8.364 ± 0.278
0.929SerHis: 0.929 ± 0.456
4.647SerIle: 4.647 ± 0.367
1.859SerLys: 1.859 ± 0.911
10.223SerLeu: 10.223 ± 0.722
0.929SerMet: 0.929 ± 1.456
0.929SerAsn: 0.929 ± 0.456
0.929SerPro: 0.929 ± 0.456
0.929SerGln: 0.929 ± 0.456
2.788SerArg: 2.788 ± 1.367
1.859SerSer: 1.859 ± 1.0
1.859SerThr: 1.859 ± 0.911
6.506SerVal: 6.506 ± 1.278
1.859SerTrp: 1.859 ± 0.911
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
1.859ThrAla: 1.859 ± 2.912
0.929ThrCys: 0.929 ± 0.456
3.717ThrAsp: 3.717 ± 1.823
0.929ThrGlu: 0.929 ± 0.456
3.717ThrPhe: 3.717 ± 0.089
4.647ThrGly: 4.647 ± 1.545
0.0ThrHis: 0.0 ± 0.0
0.0ThrIle: 0.0 ± 0.0
4.647ThrLys: 4.647 ± 1.545
3.717ThrLeu: 3.717 ± 0.089
5.576ThrMet: 5.576 ± 1.089
0.929ThrAsn: 0.929 ± 0.456
5.576ThrPro: 5.576 ± 0.823
4.647ThrGln: 4.647 ± 3.456
6.506ThrArg: 6.506 ± 2.545
3.717ThrSer: 3.717 ± 1.823
7.435ThrThr: 7.435 ± 4.001
4.647ThrVal: 4.647 ± 2.278
1.859ThrTrp: 1.859 ± 1.0
0.929ThrTyr: 0.929 ± 0.456
0.0ThrXaa: 0.0 ± 0.0
Val
5.576ValAla: 5.576 ± 2.734
0.0ValCys: 0.0 ± 0.0
0.0ValAsp: 0.0 ± 0.0
2.788ValGlu: 2.788 ± 1.367
4.647ValPhe: 4.647 ± 2.278
4.647ValGly: 4.647 ± 0.367
1.859ValHis: 1.859 ± 2.912
3.717ValIle: 3.717 ± 2.0
4.647ValLys: 4.647 ± 2.278
10.223ValLeu: 10.223 ± 1.189
2.788ValMet: 2.788 ± 1.755
2.788ValAsn: 2.788 ± 2.456
4.647ValPro: 4.647 ± 3.456
0.929ValGln: 0.929 ± 1.456
3.717ValArg: 3.717 ± 1.823
5.576ValSer: 5.576 ± 0.823
3.717ValThr: 3.717 ± 2.0
4.647ValVal: 4.647 ± 1.545
0.929ValTrp: 0.929 ± 0.456
1.859ValTyr: 1.859 ± 0.911
0.0ValXaa: 0.0 ± 0.0
Trp
1.859TrpAla: 1.859 ± 0.911
0.0TrpCys: 0.0 ± 0.0
0.929TrpAsp: 0.929 ± 0.456
0.929TrpGlu: 0.929 ± 0.456
0.929TrpPhe: 0.929 ± 0.456
0.929TrpGly: 0.929 ± 1.456
0.929TrpHis: 0.929 ± 0.456
0.929TrpIle: 0.929 ± 0.456
1.859TrpLys: 1.859 ± 1.0
0.0TrpLeu: 0.0 ± 0.0
1.859TrpMet: 1.859 ± 0.911
2.788TrpAsn: 2.788 ± 2.456
0.929TrpPro: 0.929 ± 0.456
0.0TrpGln: 0.0 ± 0.0
2.788TrpArg: 2.788 ± 1.367
1.859TrpSer: 1.859 ± 0.911
0.0TrpThr: 0.0 ± 0.0
0.929TrpVal: 0.929 ± 0.456
2.788TrpTrp: 2.788 ± 1.367
0.929TrpTyr: 0.929 ± 1.456
0.0TrpXaa: 0.0 ± 0.0
Tyr
6.506TyrAla: 6.506 ± 0.633
0.0TyrCys: 0.0 ± 0.0
0.0TyrAsp: 0.0 ± 0.0
2.788TyrGlu: 2.788 ± 1.367
0.929TyrPhe: 0.929 ± 0.456
0.929TyrGly: 0.929 ± 0.456
1.859TyrHis: 1.859 ± 1.0
0.929TyrIle: 0.929 ± 0.456
0.0TyrLys: 0.0 ± 0.0
1.859TyrLeu: 1.859 ± 0.911
0.929TyrMet: 0.929 ± 0.456
0.0TyrAsn: 0.0 ± 0.0
0.929TyrPro: 0.929 ± 1.456
0.929TyrGln: 0.929 ± 1.456
0.0TyrArg: 0.0 ± 0.0
1.859TyrSer: 1.859 ± 0.911
0.0TyrThr: 0.0 ± 0.0
0.0TyrVal: 0.0 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1077 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski