Amino acid dipepetide frequency for Boiling Springs Lake RNA-DNA hybrid virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.098AlaAla: 6.098 ± 3.123
0.0AlaCys: 0.0 ± 0.0
2.439AlaAsp: 2.439 ± 1.249
1.22AlaGlu: 1.22 ± 0.625
4.878AlaPhe: 4.878 ± 2.498
3.659AlaGly: 3.659 ± 1.874
2.439AlaHis: 2.439 ± 1.249
4.878AlaIle: 4.878 ± 1.468
4.878AlaLys: 4.878 ± 3.001
6.098AlaLeu: 6.098 ± 4.515
2.439AlaMet: 2.439 ± 1.416
3.659AlaAsn: 3.659 ± 1.874
4.878AlaPro: 4.878 ± 2.498
2.439AlaGln: 2.439 ± 1.249
7.317AlaArg: 7.317 ± 2.377
3.659AlaSer: 3.659 ± 1.874
4.878AlaThr: 4.878 ± 2.498
6.098AlaVal: 6.098 ± 3.123
2.439AlaTrp: 2.439 ± 3.489
3.659AlaTyr: 3.659 ± 1.874
0.0AlaXaa: 0.0 ± 0.0
Cys
1.22CysAla: 1.22 ± 1.745
0.0CysCys: 0.0 ± 0.0
2.439CysAsp: 2.439 ± 1.501
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.22CysGly: 1.22 ± 0.625
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.22CysLys: 1.22 ± 0.625
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
2.439CysPro: 2.439 ± 1.416
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.098AspAla: 6.098 ± 1.889
0.0AspCys: 0.0 ± 0.0
1.22AspAsp: 1.22 ± 0.625
4.878AspGlu: 4.878 ± 1.184
2.439AspPhe: 2.439 ± 1.501
0.0AspGly: 0.0 ± 0.0
2.439AspHis: 2.439 ± 1.249
6.098AspIle: 6.098 ± 2.782
2.439AspLys: 2.439 ± 1.249
3.659AspLeu: 3.659 ± 3.116
2.439AspMet: 2.439 ± 3.726
7.317AspAsn: 7.317 ± 2.135
0.0AspPro: 0.0 ± 0.0
1.22AspGln: 1.22 ± 0.625
1.22AspArg: 1.22 ± 0.625
2.439AspSer: 2.439 ± 1.416
1.22AspThr: 1.22 ± 1.745
1.22AspVal: 1.22 ± 0.625
0.0AspTrp: 0.0 ± 0.0
3.659AspTyr: 3.659 ± 5.589
0.0AspXaa: 0.0 ± 0.0
Glu
1.22GluAla: 1.22 ± 1.745
1.22GluCys: 1.22 ± 0.625
3.659GluAsp: 3.659 ± 5.589
1.22GluGlu: 1.22 ± 1.863
3.659GluPhe: 3.659 ± 1.346
1.22GluGly: 1.22 ± 1.863
1.22GluHis: 1.22 ± 0.625
3.659GluIle: 3.659 ± 1.808
1.22GluLys: 1.22 ± 1.863
3.659GluLeu: 3.659 ± 3.116
0.0GluMet: 0.0 ± 0.0
1.22GluAsn: 1.22 ± 1.745
1.22GluPro: 1.22 ± 0.625
3.659GluGln: 3.659 ± 1.321
2.439GluArg: 2.439 ± 3.726
1.22GluSer: 1.22 ± 0.625
1.22GluThr: 1.22 ± 0.625
0.0GluVal: 0.0 ± 0.0
0.0GluTrp: 0.0 ± 0.0
4.878GluTyr: 4.878 ± 2.498
0.0GluXaa: 0.0 ± 0.0
Phe
2.439PheAla: 2.439 ± 1.416
0.0PheCys: 0.0 ± 0.0
4.878PheAsp: 4.878 ± 2.498
2.439PheGlu: 2.439 ± 3.726
2.439PhePhe: 2.439 ± 1.416
7.317PheGly: 7.317 ± 2.643
2.439PheHis: 2.439 ± 1.249
3.659PheIle: 3.659 ± 1.808
1.22PheLys: 1.22 ± 1.745
6.098PheLeu: 6.098 ± 4.81
0.0PheMet: 0.0 ± 0.0
2.439PheAsn: 2.439 ± 3.726
1.22PhePro: 1.22 ± 0.625
4.878PheGln: 4.878 ± 1.468
1.22PheArg: 1.22 ± 0.625
6.098PheSer: 6.098 ± 2.66
4.878PheThr: 4.878 ± 2.498
1.22PheVal: 1.22 ± 0.625
0.0PheTrp: 0.0 ± 0.0
1.22PheTyr: 1.22 ± 1.863
0.0PheXaa: 0.0 ± 0.0
Gly
7.317GlyAla: 7.317 ± 3.747
0.0GlyCys: 0.0 ± 0.0
1.22GlyAsp: 1.22 ± 0.625
2.439GlyGlu: 2.439 ± 1.416
1.22GlyPhe: 1.22 ± 0.625
3.659GlyGly: 3.659 ± 1.874
2.439GlyHis: 2.439 ± 1.501
2.439GlyIle: 2.439 ± 1.416
3.659GlyLys: 3.659 ± 1.346
9.756GlyLeu: 9.756 ± 3.486
0.0GlyMet: 0.0 ± 0.0
2.439GlyAsn: 2.439 ± 1.416
2.439GlyPro: 2.439 ± 1.416
4.878GlyGln: 4.878 ± 2.498
3.659GlyArg: 3.659 ± 1.808
6.098GlySer: 6.098 ± 3.123
12.195GlyThr: 12.195 ± 6.245
7.317GlyVal: 7.317 ± 2.377
1.22GlyTrp: 1.22 ± 1.863
2.439GlyTyr: 2.439 ± 1.501
0.0GlyXaa: 0.0 ± 0.0
His
2.439HisAla: 2.439 ± 2.433
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.22HisGlu: 1.22 ± 0.625
1.22HisPhe: 1.22 ± 0.625
3.659HisGly: 3.659 ± 1.874
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
1.22HisLeu: 1.22 ± 0.625
1.22HisMet: 1.22 ± 1.006
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
1.22HisArg: 1.22 ± 0.625
2.439HisSer: 2.439 ± 1.249
2.439HisThr: 2.439 ± 1.501
2.439HisVal: 2.439 ± 1.501
0.0HisTrp: 0.0 ± 0.0
2.439HisTyr: 2.439 ± 1.249
0.0HisXaa: 0.0 ± 0.0
Ile
6.098IleAla: 6.098 ± 3.123
1.22IleCys: 1.22 ± 1.745
2.439IleAsp: 2.439 ± 1.501
2.439IleGlu: 2.439 ± 1.501
2.439IlePhe: 2.439 ± 1.501
4.878IleGly: 4.878 ± 2.498
0.0IleHis: 0.0 ± 0.0
1.22IleIle: 1.22 ± 0.625
3.659IleLys: 3.659 ± 3.966
4.878IleLeu: 4.878 ± 3.387
2.439IleMet: 2.439 ± 1.501
3.659IleAsn: 3.659 ± 1.321
3.659IlePro: 3.659 ± 1.346
1.22IleGln: 1.22 ± 0.625
0.0IleArg: 0.0 ± 0.0
2.439IleSer: 2.439 ± 1.416
3.659IleThr: 3.659 ± 1.346
1.22IleVal: 1.22 ± 0.625
1.22IleTrp: 1.22 ± 0.625
2.439IleTyr: 2.439 ± 1.416
0.0IleXaa: 0.0 ± 0.0
Lys
3.659LysAla: 3.659 ± 1.346
0.0LysCys: 0.0 ± 0.0
4.878LysAsp: 4.878 ± 3.221
1.22LysGlu: 1.22 ± 1.863
4.878LysPhe: 4.878 ± 1.468
3.659LysGly: 3.659 ± 1.346
0.0LysHis: 0.0 ± 0.0
4.878LysIle: 4.878 ± 1.184
4.878LysLys: 4.878 ± 3.001
3.659LysLeu: 3.659 ± 1.346
0.0LysMet: 0.0 ± 0.0
4.878LysAsn: 4.878 ± 3.001
2.439LysPro: 2.439 ± 1.249
1.22LysGln: 1.22 ± 0.625
8.537LysArg: 8.537 ± 4.924
1.22LysSer: 1.22 ± 0.625
1.22LysThr: 1.22 ± 0.625
3.659LysVal: 3.659 ± 1.321
2.439LysTrp: 2.439 ± 1.501
2.439LysTyr: 2.439 ± 3.726
0.0LysXaa: 0.0 ± 0.0
Leu
6.098LeuAla: 6.098 ± 3.123
1.22LeuCys: 1.22 ± 1.745
0.0LeuAsp: 0.0 ± 0.0
3.659LeuGlu: 3.659 ± 1.321
8.537LeuPhe: 8.537 ± 1.68
7.317LeuGly: 7.317 ± 2.377
2.439LeuHis: 2.439 ± 3.489
3.659LeuIle: 3.659 ± 1.808
6.098LeuLys: 6.098 ± 0.56
2.439LeuLeu: 2.439 ± 1.416
2.439LeuMet: 2.439 ± 1.249
3.659LeuAsn: 3.659 ± 3.116
6.098LeuPro: 6.098 ± 2.782
3.659LeuGln: 3.659 ± 3.116
4.878LeuArg: 4.878 ± 1.468
9.756LeuSer: 9.756 ± 4.503
1.22LeuThr: 1.22 ± 0.625
3.659LeuVal: 3.659 ± 1.874
3.659LeuTrp: 3.659 ± 1.874
2.439LeuTyr: 2.439 ± 1.501
0.0LeuXaa: 0.0 ± 0.0
Met
1.22MetAla: 1.22 ± 1.863
0.0MetCys: 0.0 ± 0.0
1.22MetAsp: 1.22 ± 1.863
2.439MetGlu: 2.439 ± 1.501
0.0MetPhe: 0.0 ± 0.0
2.439MetGly: 2.439 ± 1.501
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
1.22MetLeu: 1.22 ± 0.625
0.0MetMet: 0.0 ± 0.0
1.22MetAsn: 1.22 ± 1.745
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
6.098MetSer: 6.098 ± 1.889
2.439MetThr: 2.439 ± 1.501
2.439MetVal: 2.439 ± 1.416
0.0MetTrp: 0.0 ± 0.0
1.22MetTyr: 1.22 ± 1.745
0.0MetXaa: 0.0 ± 0.0
Asn
1.22AsnAla: 1.22 ± 0.625
1.22AsnCys: 1.22 ± 0.625
3.659AsnAsp: 3.659 ± 3.116
2.439AsnGlu: 2.439 ± 3.726
2.439AsnPhe: 2.439 ± 1.416
7.317AsnGly: 7.317 ± 2.135
0.0AsnHis: 0.0 ± 0.0
4.878AsnIle: 4.878 ± 1.468
4.878AsnLys: 4.878 ± 3.001
3.659AsnLeu: 3.659 ± 1.321
1.22AsnMet: 1.22 ± 1.745
7.317AsnAsn: 7.317 ± 2.693
7.317AsnPro: 7.317 ± 4.341
1.22AsnGln: 1.22 ± 0.625
0.0AsnArg: 0.0 ± 0.0
9.756AsnSer: 9.756 ± 1.315
2.439AsnThr: 2.439 ± 1.249
2.439AsnVal: 2.439 ± 1.416
1.22AsnTrp: 1.22 ± 1.745
2.439AsnTyr: 2.439 ± 1.416
0.0AsnXaa: 0.0 ± 0.0
Pro
2.439ProAla: 2.439 ± 1.249
0.0ProCys: 0.0 ± 0.0
4.878ProAsp: 4.878 ± 5.392
0.0ProGlu: 0.0 ± 0.0
4.878ProPhe: 4.878 ± 3.221
2.439ProGly: 2.439 ± 1.249
0.0ProHis: 0.0 ± 0.0
1.22ProIle: 1.22 ± 0.625
2.439ProLys: 2.439 ± 1.249
7.317ProLeu: 7.317 ± 3.747
2.439ProMet: 2.439 ± 1.109
1.22ProAsn: 1.22 ± 1.745
2.439ProPro: 2.439 ± 2.433
1.22ProGln: 1.22 ± 0.625
2.439ProArg: 2.439 ± 3.489
1.22ProSer: 1.22 ± 0.625
2.439ProThr: 2.439 ± 1.249
3.659ProVal: 3.659 ± 1.808
1.22ProTrp: 1.22 ± 1.863
1.22ProTyr: 1.22 ± 0.625
0.0ProXaa: 0.0 ± 0.0
Gln
3.659GlnAla: 3.659 ± 1.874
1.22GlnCys: 1.22 ± 0.625
3.659GlnAsp: 3.659 ± 1.874
0.0GlnGlu: 0.0 ± 0.0
0.0GlnPhe: 0.0 ± 0.0
2.439GlnGly: 2.439 ± 1.249
0.0GlnHis: 0.0 ± 0.0
2.439GlnIle: 2.439 ± 1.249
1.22GlnLys: 1.22 ± 0.625
2.439GlnLeu: 2.439 ± 1.249
1.22GlnMet: 1.22 ± 0.625
3.659GlnAsn: 3.659 ± 1.874
4.878GlnPro: 4.878 ± 1.184
1.22GlnGln: 1.22 ± 0.625
1.22GlnArg: 1.22 ± 0.625
2.439GlnSer: 2.439 ± 1.416
1.22GlnThr: 1.22 ± 1.745
1.22GlnVal: 1.22 ± 0.625
0.0GlnTrp: 0.0 ± 0.0
2.439GlnTyr: 2.439 ± 1.416
0.0GlnXaa: 0.0 ± 0.0
Arg
3.659ArgAla: 3.659 ± 1.321
0.0ArgCys: 0.0 ± 0.0
0.0ArgAsp: 0.0 ± 0.0
1.22ArgGlu: 1.22 ± 0.625
2.439ArgPhe: 2.439 ± 1.501
4.878ArgGly: 4.878 ± 2.832
2.439ArgHis: 2.439 ± 1.501
3.659ArgIle: 3.659 ± 1.808
7.317ArgLys: 7.317 ± 2.693
4.878ArgLeu: 4.878 ± 2.832
1.22ArgMet: 1.22 ± 1.745
2.439ArgAsn: 2.439 ± 2.433
1.22ArgPro: 1.22 ± 0.625
2.439ArgGln: 2.439 ± 1.416
0.0ArgArg: 0.0 ± 0.0
3.659ArgSer: 3.659 ± 1.874
1.22ArgThr: 1.22 ± 1.863
2.439ArgVal: 2.439 ± 1.416
0.0ArgTrp: 0.0 ± 0.0
1.22ArgTyr: 1.22 ± 1.863
0.0ArgXaa: 0.0 ± 0.0
Ser
3.659SerAla: 3.659 ± 1.874
1.22SerCys: 1.22 ± 0.625
2.439SerAsp: 2.439 ± 1.249
3.659SerGlu: 3.659 ± 1.808
2.439SerPhe: 2.439 ± 2.433
7.317SerGly: 7.317 ± 3.747
4.878SerHis: 4.878 ± 1.468
4.878SerIle: 4.878 ± 2.498
6.098SerLys: 6.098 ± 2.66
4.878SerLeu: 4.878 ± 3.387
0.0SerMet: 0.0 ± 0.0
4.878SerAsn: 4.878 ± 2.498
1.22SerPro: 1.22 ± 1.745
2.439SerGln: 2.439 ± 1.249
2.439SerArg: 2.439 ± 1.501
2.439SerSer: 2.439 ± 1.249
7.317SerThr: 7.317 ± 3.747
3.659SerVal: 3.659 ± 1.321
0.0SerTrp: 0.0 ± 0.0
4.878SerTyr: 4.878 ± 2.832
0.0SerXaa: 0.0 ± 0.0
Thr
6.098ThrAla: 6.098 ± 3.123
0.0ThrCys: 0.0 ± 0.0
3.659ThrAsp: 3.659 ± 1.346
1.22ThrGlu: 1.22 ± 0.625
7.317ThrPhe: 7.317 ± 0.072
6.098ThrGly: 6.098 ± 3.123
0.0ThrHis: 0.0 ± 0.0
1.22ThrIle: 1.22 ± 0.625
1.22ThrLys: 1.22 ± 1.745
8.537ThrLeu: 8.537 ± 2.803
1.22ThrMet: 1.22 ± 0.625
6.098ThrAsn: 6.098 ± 1.81
1.22ThrPro: 1.22 ± 1.745
2.439ThrGln: 2.439 ± 1.249
2.439ThrArg: 2.439 ± 1.416
3.659ThrSer: 3.659 ± 1.874
8.537ThrThr: 8.537 ± 4.372
4.878ThrVal: 4.878 ± 1.468
0.0ThrTrp: 0.0 ± 0.0
4.878ThrTyr: 4.878 ± 2.498
0.0ThrXaa: 0.0 ± 0.0
Val
4.878ValAla: 4.878 ± 2.498
0.0ValCys: 0.0 ± 0.0
3.659ValAsp: 3.659 ± 1.874
3.659ValGlu: 3.659 ± 1.321
0.0ValPhe: 0.0 ± 0.0
6.098ValGly: 6.098 ± 3.123
1.22ValHis: 1.22 ± 0.625
1.22ValIle: 1.22 ± 0.625
3.659ValLys: 3.659 ± 1.808
3.659ValLeu: 3.659 ± 1.321
0.0ValMet: 0.0 ± 0.0
4.878ValAsn: 4.878 ± 4.848
0.0ValPro: 0.0 ± 0.0
0.0ValGln: 0.0 ± 0.0
2.439ValArg: 2.439 ± 1.416
3.659ValSer: 3.659 ± 1.346
8.537ValThr: 8.537 ± 0.691
2.439ValVal: 2.439 ± 1.249
1.22ValTrp: 1.22 ± 0.625
2.439ValTyr: 2.439 ± 1.249
0.0ValXaa: 0.0 ± 0.0
Trp
1.22TrpAla: 1.22 ± 0.625
0.0TrpCys: 0.0 ± 0.0
1.22TrpAsp: 1.22 ± 1.863
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
1.22TrpHis: 1.22 ± 0.625
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
2.439TrpLeu: 2.439 ± 1.416
0.0TrpMet: 0.0 ± 0.0
1.22TrpAsn: 1.22 ± 1.745
1.22TrpPro: 1.22 ± 1.745
0.0TrpGln: 0.0 ± 0.0
2.439TrpArg: 2.439 ± 1.501
1.22TrpSer: 1.22 ± 1.863
0.0TrpThr: 0.0 ± 0.0
1.22TrpVal: 1.22 ± 0.625
1.22TrpTrp: 1.22 ± 1.863
1.22TrpTyr: 1.22 ± 0.625
0.0TrpXaa: 0.0 ± 0.0
Tyr
7.317TyrAla: 7.317 ± 2.377
1.22TyrCys: 1.22 ± 1.863
2.439TyrAsp: 2.439 ± 1.416
2.439TyrGlu: 2.439 ± 1.416
4.878TyrPhe: 4.878 ± 3.001
1.22TyrGly: 1.22 ± 1.863
0.0TyrHis: 0.0 ± 0.0
1.22TyrIle: 1.22 ± 1.863
3.659TyrLys: 3.659 ± 1.346
2.439TyrLeu: 2.439 ± 1.501
2.439TyrMet: 2.439 ± 1.501
4.878TyrAsn: 4.878 ± 1.468
1.22TyrPro: 1.22 ± 0.625
2.439TyrGln: 2.439 ± 1.249
2.439TyrArg: 2.439 ± 1.416
1.22TyrSer: 1.22 ± 0.625
3.659TyrThr: 3.659 ± 1.321
2.439TyrVal: 2.439 ± 1.416
0.0TyrTrp: 0.0 ± 0.0
1.22TyrTyr: 1.22 ± 1.863
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (821 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski