Amino acid dipepetide frequency for Beauveria bassiana RNA virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.85AlaAla: 8.85 ± 8.696
0.0AlaCys: 0.0 ± 0.0
5.531AlaAsp: 5.531 ± 0.434
3.319AlaGlu: 3.319 ± 1.594
4.425AlaPhe: 4.425 ± 0.653
6.637AlaGly: 6.637 ± 0.146
0.0AlaHis: 0.0 ± 0.0
3.319AlaIle: 3.319 ± 1.594
1.106AlaLys: 1.106 ± 1.087
9.956AlaLeu: 9.956 ± 4.782
1.106AlaMet: 1.106 ± 0.58
1.106AlaAsn: 1.106 ± 0.58
3.319AlaPro: 3.319 ± 0.073
3.319AlaGln: 3.319 ± 1.594
8.85AlaArg: 8.85 ± 5.362
5.531AlaSer: 5.531 ± 0.434
11.062AlaThr: 11.062 ± 4.203
6.637AlaVal: 6.637 ± 3.188
2.212AlaTrp: 2.212 ± 1.16
3.319AlaTyr: 3.319 ± 1.74
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
1.106CysPhe: 1.106 ± 0.58
2.212CysGly: 2.212 ± 1.16
1.106CysHis: 1.106 ± 0.58
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.106CysArg: 1.106 ± 0.58
1.106CysSer: 1.106 ± 0.58
0.0CysThr: 0.0 ± 0.0
1.106CysVal: 1.106 ± 0.58
0.0CysTrp: 0.0 ± 0.0
1.106CysTyr: 1.106 ± 0.58
0.0CysXaa: 0.0 ± 0.0
Asp
4.425AspAla: 4.425 ± 2.681
0.0AspCys: 0.0 ± 0.0
6.637AspAsp: 6.637 ± 0.146
4.425AspGlu: 4.425 ± 1.014
3.319AspPhe: 3.319 ± 0.073
4.425AspGly: 4.425 ± 0.653
1.106AspHis: 1.106 ± 0.58
2.212AspIle: 2.212 ± 1.16
0.0AspLys: 0.0 ± 0.0
6.637AspLeu: 6.637 ± 3.188
3.319AspMet: 3.319 ± 1.492
0.0AspAsn: 0.0 ± 0.0
4.425AspPro: 4.425 ± 1.014
3.319AspGln: 3.319 ± 1.74
0.0AspArg: 0.0 ± 0.0
2.212AspSer: 2.212 ± 0.507
3.319AspThr: 3.319 ± 0.073
7.743AspVal: 7.743 ± 4.275
2.212AspTrp: 2.212 ± 1.16
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
6.637GluAla: 6.637 ± 3.188
0.0GluCys: 0.0 ± 0.0
1.106GluAsp: 1.106 ± 0.58
4.425GluGlu: 4.425 ± 0.653
5.531GluPhe: 5.531 ± 2.101
4.425GluGly: 4.425 ± 0.653
2.212GluHis: 2.212 ± 0.507
3.319GluIle: 3.319 ± 0.073
2.212GluLys: 2.212 ± 1.16
4.425GluLeu: 4.425 ± 2.32
2.212GluMet: 2.212 ± 0.507
1.106GluAsn: 1.106 ± 0.58
3.319GluPro: 3.319 ± 1.74
3.319GluGln: 3.319 ± 1.594
3.319GluArg: 3.319 ± 3.261
4.425GluSer: 4.425 ± 0.653
2.212GluThr: 2.212 ± 1.16
6.637GluVal: 6.637 ± 0.146
4.425GluTrp: 4.425 ± 0.653
1.106GluTyr: 1.106 ± 0.58
0.0GluXaa: 0.0 ± 0.0
Phe
1.106PheAla: 1.106 ± 0.58
1.106PheCys: 1.106 ± 0.58
1.106PheAsp: 1.106 ± 0.58
1.106PheGlu: 1.106 ± 0.58
2.212PhePhe: 2.212 ± 1.16
4.425PheGly: 4.425 ± 0.653
1.106PheHis: 1.106 ± 0.58
3.319PheIle: 3.319 ± 0.073
2.212PheLys: 2.212 ± 0.507
7.743PheLeu: 7.743 ± 2.393
0.0PheMet: 0.0 ± 0.67
1.106PheAsn: 1.106 ± 1.087
1.106PhePro: 1.106 ± 0.58
1.106PheGln: 1.106 ± 0.58
3.319PheArg: 3.319 ± 1.74
2.212PheSer: 2.212 ± 1.16
0.0PheThr: 0.0 ± 0.0
1.106PheVal: 1.106 ± 0.58
0.0PheTrp: 0.0 ± 0.0
1.106PheTyr: 1.106 ± 0.58
0.0PheXaa: 0.0 ± 0.0
Gly
4.425GlyAla: 4.425 ± 2.681
1.106GlyCys: 1.106 ± 0.58
3.319GlyAsp: 3.319 ± 0.073
6.637GlyGlu: 6.637 ± 1.521
3.319GlyPhe: 3.319 ± 0.073
5.531GlyGly: 5.531 ± 2.9
2.212GlyHis: 2.212 ± 1.16
3.319GlyIle: 3.319 ± 0.073
3.319GlyLys: 3.319 ± 0.073
4.425GlyLeu: 4.425 ± 0.653
3.319GlyMet: 3.319 ± 1.74
3.319GlyAsn: 3.319 ± 0.073
3.319GlyPro: 3.319 ± 1.74
1.106GlyGln: 1.106 ± 0.58
3.319GlyArg: 3.319 ± 0.073
11.062GlySer: 11.062 ± 4.132
3.319GlyThr: 3.319 ± 0.073
7.743GlyVal: 7.743 ± 2.393
0.0GlyTrp: 0.0 ± 0.0
2.212GlyTyr: 2.212 ± 0.507
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
1.106HisCys: 1.106 ± 0.58
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
1.106HisGly: 1.106 ± 0.58
0.0HisHis: 0.0 ± 0.0
2.212HisIle: 2.212 ± 1.16
0.0HisLys: 0.0 ± 0.0
5.531HisLeu: 5.531 ± 1.233
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.106HisPro: 1.106 ± 1.087
0.0HisGln: 0.0 ± 0.0
2.212HisArg: 2.212 ± 1.16
2.212HisSer: 2.212 ± 1.16
1.106HisThr: 1.106 ± 0.58
1.106HisVal: 1.106 ± 1.087
1.106HisTrp: 1.106 ± 1.087
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
7.743IleAla: 7.743 ± 0.726
1.106IleCys: 1.106 ± 0.58
3.319IleAsp: 3.319 ± 1.594
1.106IleGlu: 1.106 ± 0.58
1.106IlePhe: 1.106 ± 0.58
2.212IleGly: 2.212 ± 0.507
0.0IleHis: 0.0 ± 0.0
1.106IleIle: 1.106 ± 0.58
0.0IleLys: 0.0 ± 0.0
3.319IleLeu: 3.319 ± 0.073
0.0IleMet: 0.0 ± 0.0
0.0IleAsn: 0.0 ± 0.0
3.319IlePro: 3.319 ± 0.073
0.0IleGln: 0.0 ± 0.0
3.319IleArg: 3.319 ± 1.74
2.212IleSer: 2.212 ± 0.507
0.0IleThr: 0.0 ± 0.0
3.319IleVal: 3.319 ± 0.073
1.106IleTrp: 1.106 ± 0.58
1.106IleTyr: 1.106 ± 0.58
0.0IleXaa: 0.0 ± 0.0
Lys
4.425LysAla: 4.425 ± 2.681
1.106LysCys: 1.106 ± 0.58
3.319LysAsp: 3.319 ± 1.594
4.425LysGlu: 4.425 ± 2.681
3.319LysPhe: 3.319 ± 1.74
2.212LysGly: 2.212 ± 0.507
1.106LysHis: 1.106 ± 0.58
2.212LysIle: 2.212 ± 1.16
4.425LysLys: 4.425 ± 0.653
2.212LysLeu: 2.212 ± 1.16
0.0LysMet: 0.0 ± 0.0
0.0LysAsn: 0.0 ± 0.0
1.106LysPro: 1.106 ± 0.58
0.0LysGln: 0.0 ± 0.0
4.425LysArg: 4.425 ± 0.653
2.212LysSer: 2.212 ± 0.507
1.106LysThr: 1.106 ± 0.58
0.0LysVal: 0.0 ± 0.0
1.106LysTrp: 1.106 ± 0.58
1.106LysTyr: 1.106 ± 0.58
0.0LysXaa: 0.0 ± 0.0
Leu
7.743LeuAla: 7.743 ± 2.608
0.0LeuCys: 0.0 ± 0.0
6.637LeuAsp: 6.637 ± 3.188
6.637LeuGlu: 6.637 ± 0.146
4.425LeuPhe: 4.425 ± 2.32
11.062LeuGly: 11.062 ± 0.798
1.106LeuHis: 1.106 ± 0.58
3.319LeuIle: 3.319 ± 0.073
5.531LeuLys: 5.531 ± 1.233
7.743LeuLeu: 7.743 ± 4.06
3.319LeuMet: 3.319 ± 0.073
3.319LeuAsn: 3.319 ± 0.073
5.531LeuPro: 5.531 ± 0.434
3.319LeuGln: 3.319 ± 0.073
11.062LeuArg: 11.062 ± 4.203
1.106LeuSer: 1.106 ± 0.58
2.212LeuThr: 2.212 ± 0.507
5.531LeuVal: 5.531 ± 2.9
1.106LeuTrp: 1.106 ± 0.58
1.106LeuTyr: 1.106 ± 1.087
0.0LeuXaa: 0.0 ± 0.0
Met
3.319MetAla: 3.319 ± 0.073
0.0MetCys: 0.0 ± 0.0
3.319MetAsp: 3.319 ± 0.073
1.106MetGlu: 1.106 ± 0.58
0.0MetPhe: 0.0 ± 0.0
2.212MetGly: 2.212 ± 1.16
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
1.106MetLeu: 1.106 ± 0.58
2.212MetMet: 2.212 ± 0.507
2.212MetAsn: 2.212 ± 1.16
2.212MetPro: 2.212 ± 1.16
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
2.212MetSer: 2.212 ± 1.16
1.106MetThr: 1.106 ± 1.087
3.319MetVal: 3.319 ± 0.073
0.0MetTrp: 0.0 ± 0.0
3.319MetTyr: 3.319 ± 1.74
0.0MetXaa: 0.0 ± 0.0
Asn
2.212AsnAla: 2.212 ± 0.507
0.0AsnCys: 0.0 ± 0.0
1.106AsnAsp: 1.106 ± 0.58
2.212AsnGlu: 2.212 ± 1.16
0.0AsnPhe: 0.0 ± 0.0
1.106AsnGly: 1.106 ± 0.58
1.106AsnHis: 1.106 ± 0.58
1.106AsnIle: 1.106 ± 0.58
0.0AsnLys: 0.0 ± 0.0
0.0AsnLeu: 0.0 ± 0.0
1.106AsnMet: 1.106 ± 1.087
0.0AsnAsn: 0.0 ± 0.0
1.106AsnPro: 1.106 ± 0.58
1.106AsnGln: 1.106 ± 1.087
3.319AsnArg: 3.319 ± 0.073
0.0AsnSer: 0.0 ± 0.0
2.212AsnThr: 2.212 ± 2.174
1.106AsnVal: 1.106 ± 0.58
2.212AsnTrp: 2.212 ± 0.507
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
4.425ProAla: 4.425 ± 0.653
1.106ProCys: 1.106 ± 0.58
5.531ProAsp: 5.531 ± 1.233
7.743ProGlu: 7.743 ± 0.941
1.106ProPhe: 1.106 ± 0.58
7.743ProGly: 7.743 ± 0.726
1.106ProHis: 1.106 ± 1.087
1.106ProIle: 1.106 ± 0.58
2.212ProLys: 2.212 ± 0.507
1.106ProLeu: 1.106 ± 0.58
1.106ProMet: 1.106 ± 0.58
1.106ProAsn: 1.106 ± 0.58
4.425ProPro: 4.425 ± 0.653
3.319ProGln: 3.319 ± 1.594
1.106ProArg: 1.106 ± 0.58
7.743ProSer: 7.743 ± 0.941
2.212ProThr: 2.212 ± 0.507
6.637ProVal: 6.637 ± 4.855
2.212ProTrp: 2.212 ± 1.16
1.106ProTyr: 1.106 ± 0.58
0.0ProXaa: 0.0 ± 0.0
Gln
3.319GlnAla: 3.319 ± 1.594
0.0GlnCys: 0.0 ± 0.0
2.212GlnAsp: 2.212 ± 1.16
1.106GlnGlu: 1.106 ± 1.087
0.0GlnPhe: 0.0 ± 0.0
0.0GlnGly: 0.0 ± 0.0
0.0GlnHis: 0.0 ± 0.0
0.0GlnIle: 0.0 ± 0.0
2.212GlnLys: 2.212 ± 0.507
4.425GlnLeu: 4.425 ± 1.014
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
1.106GlnPro: 1.106 ± 0.58
2.212GlnGln: 2.212 ± 0.507
5.531GlnArg: 5.531 ± 0.434
2.212GlnSer: 2.212 ± 0.507
1.106GlnThr: 1.106 ± 0.58
3.319GlnVal: 3.319 ± 0.073
1.106GlnTrp: 1.106 ± 0.58
2.212GlnTyr: 2.212 ± 1.16
0.0GlnXaa: 0.0 ± 0.0
Arg
7.743ArgAla: 7.743 ± 2.608
0.0ArgCys: 0.0 ± 0.0
5.531ArgAsp: 5.531 ± 3.768
3.319ArgGlu: 3.319 ± 1.74
1.106ArgPhe: 1.106 ± 0.58
8.85ArgGly: 8.85 ± 0.361
2.212ArgHis: 2.212 ± 1.16
2.212ArgIle: 2.212 ± 0.507
3.319ArgLys: 3.319 ± 0.073
11.062ArgLeu: 11.062 ± 0.798
2.212ArgMet: 2.212 ± 1.16
1.106ArgAsn: 1.106 ± 1.087
4.425ArgPro: 4.425 ± 1.014
1.106ArgGln: 1.106 ± 0.58
5.531ArgArg: 5.531 ± 0.434
5.531ArgSer: 5.531 ± 3.768
2.212ArgThr: 2.212 ± 0.507
5.531ArgVal: 5.531 ± 1.233
2.212ArgTrp: 2.212 ± 0.507
4.425ArgTyr: 4.425 ± 0.653
0.0ArgXaa: 0.0 ± 0.0
Ser
6.637SerAla: 6.637 ± 0.146
1.106SerCys: 1.106 ± 0.58
2.212SerAsp: 2.212 ± 0.507
5.531SerGlu: 5.531 ± 1.233
3.319SerPhe: 3.319 ± 1.74
5.531SerGly: 5.531 ± 1.233
2.212SerHis: 2.212 ± 0.507
1.106SerIle: 1.106 ± 0.58
2.212SerLys: 2.212 ± 0.507
6.637SerLeu: 6.637 ± 0.146
2.212SerMet: 2.212 ± 1.16
1.106SerAsn: 1.106 ± 1.087
8.85SerPro: 8.85 ± 3.695
2.212SerGln: 2.212 ± 1.16
4.425SerArg: 4.425 ± 2.32
7.743SerSer: 7.743 ± 2.608
2.212SerThr: 2.212 ± 0.507
4.425SerVal: 4.425 ± 2.681
1.106SerTrp: 1.106 ± 0.58
2.212SerTyr: 2.212 ± 1.16
0.0SerXaa: 0.0 ± 0.0
Thr
5.531ThrAla: 5.531 ± 2.101
1.106ThrCys: 1.106 ± 0.58
0.0ThrAsp: 0.0 ± 0.0
3.319ThrGlu: 3.319 ± 1.594
1.106ThrPhe: 1.106 ± 0.58
4.425ThrGly: 4.425 ± 0.653
0.0ThrHis: 0.0 ± 0.0
1.106ThrIle: 1.106 ± 1.087
2.212ThrLys: 2.212 ± 1.16
2.212ThrLeu: 2.212 ± 2.174
1.106ThrMet: 1.106 ± 0.58
1.106ThrAsn: 1.106 ± 1.087
3.319ThrPro: 3.319 ± 1.594
1.106ThrGln: 1.106 ± 1.087
4.425ThrArg: 4.425 ± 1.014
5.531ThrSer: 5.531 ± 1.233
3.319ThrThr: 3.319 ± 3.261
4.425ThrVal: 4.425 ± 0.653
0.0ThrTrp: 0.0 ± 0.0
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
8.85ValAla: 8.85 ± 2.028
0.0ValCys: 0.0 ± 0.0
5.531ValAsp: 5.531 ± 1.233
5.531ValGlu: 5.531 ± 1.233
1.106ValPhe: 1.106 ± 0.58
0.0ValGly: 0.0 ± 0.0
2.212ValHis: 2.212 ± 0.507
0.0ValIle: 0.0 ± 0.0
4.425ValLys: 4.425 ± 1.014
8.85ValLeu: 8.85 ± 2.028
2.212ValMet: 2.212 ± 1.16
2.212ValAsn: 2.212 ± 0.507
11.062ValPro: 11.062 ± 0.798
2.212ValGln: 2.212 ± 0.507
9.956ValArg: 9.956 ± 1.448
6.637ValSer: 6.637 ± 1.521
3.319ValThr: 3.319 ± 1.594
5.531ValVal: 5.531 ± 1.233
1.106ValTrp: 1.106 ± 0.58
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.106TrpAla: 1.106 ± 0.58
0.0TrpCys: 0.0 ± 0.0
2.212TrpAsp: 2.212 ± 0.507
3.319TrpGlu: 3.319 ± 1.74
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
4.425TrpIle: 4.425 ± 0.653
1.106TrpLys: 1.106 ± 0.58
3.319TrpLeu: 3.319 ± 0.073
0.0TrpMet: 0.0 ± 0.0
1.106TrpAsn: 1.106 ± 0.58
0.0TrpPro: 0.0 ± 0.0
1.106TrpGln: 1.106 ± 0.58
2.212TrpArg: 2.212 ± 0.507
1.106TrpSer: 1.106 ± 0.58
2.212TrpThr: 2.212 ± 1.16
1.106TrpVal: 1.106 ± 0.58
0.0TrpTrp: 0.0 ± 0.0
1.106TrpTyr: 1.106 ± 0.58
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.106TyrAla: 1.106 ± 0.58
0.0TyrCys: 0.0 ± 0.0
1.106TyrAsp: 1.106 ± 0.58
1.106TyrGlu: 1.106 ± 0.58
1.106TyrPhe: 1.106 ± 0.58
2.212TyrGly: 2.212 ± 1.16
0.0TyrHis: 0.0 ± 0.0
0.0TyrIle: 0.0 ± 0.0
3.319TyrLys: 3.319 ± 0.073
1.106TyrLeu: 1.106 ± 0.58
1.106TyrMet: 1.106 ± 0.58
1.106TyrAsn: 1.106 ± 0.58
1.106TyrPro: 1.106 ± 1.087
2.212TyrGln: 2.212 ± 1.16
2.212TyrArg: 2.212 ± 1.16
0.0TyrSer: 0.0 ± 0.0
1.106TyrThr: 1.106 ± 0.58
4.425TyrVal: 4.425 ± 0.653
2.212TyrTrp: 2.212 ± 1.16
1.106TyrTyr: 1.106 ± 0.58
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (905 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski