Amino acid dipepetide frequency for Lake Sarah-associated circular virus-46

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.466AlaAla: 3.466 ± 0.115
1.733AlaCys: 1.733 ± 1.219
6.932AlaAsp: 6.932 ± 0.23
3.466AlaGlu: 3.466 ± 0.115
8.666AlaPhe: 8.666 ± 1.564
1.733AlaGly: 1.733 ± 1.219
1.733AlaHis: 1.733 ± 1.219
3.466AlaIle: 3.466 ± 0.115
8.666AlaLys: 8.666 ± 4.118
5.199AlaLeu: 5.199 ± 3.658
1.733AlaMet: 1.733 ± 0.846
1.733AlaAsn: 1.733 ± 1.334
3.466AlaPro: 3.466 ± 0.115
5.199AlaGln: 5.199 ± 1.104
8.666AlaArg: 8.666 ± 1.564
0.0AlaSer: 0.0 ± 0.0
3.466AlaThr: 3.466 ± 0.115
1.733AlaVal: 1.733 ± 1.334
0.0AlaTrp: 0.0 ± 0.0
1.733AlaTyr: 1.733 ± 1.334
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.733CysAsp: 1.733 ± 1.219
1.733CysGlu: 1.733 ± 1.219
0.0CysPhe: 0.0 ± 0.0
1.733CysGly: 1.733 ± 1.219
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
3.466CysMet: 3.466 ± 0.115
0.0CysAsn: 0.0 ± 0.0
1.733CysPro: 1.733 ± 1.219
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.733CysTyr: 1.733 ± 1.219
0.0CysXaa: 0.0 ± 0.0
Asp
1.733AspAla: 1.733 ± 1.334
0.0AspCys: 0.0 ± 0.0
1.733AspAsp: 1.733 ± 1.334
6.932AspGlu: 6.932 ± 2.324
1.733AspPhe: 1.733 ± 1.334
6.932AspGly: 6.932 ± 2.784
6.932AspHis: 6.932 ± 2.324
3.466AspIle: 3.466 ± 0.115
0.0AspLys: 0.0 ± 0.0
6.932AspLeu: 6.932 ± 0.23
0.0AspMet: 0.0 ± 0.0
0.0AspAsn: 0.0 ± 0.0
1.733AspPro: 1.733 ± 1.334
0.0AspGln: 0.0 ± 0.0
3.466AspArg: 3.466 ± 0.115
3.466AspSer: 3.466 ± 0.115
1.733AspThr: 1.733 ± 1.219
3.466AspVal: 3.466 ± 2.669
5.199AspTrp: 5.199 ± 1.104
3.466AspTyr: 3.466 ± 2.439
0.0AspXaa: 0.0 ± 0.0
Glu
8.666GluAla: 8.666 ± 3.543
1.733GluCys: 1.733 ± 1.219
1.733GluAsp: 1.733 ± 1.219
3.466GluGlu: 3.466 ± 0.115
0.0GluPhe: 0.0 ± 0.0
1.733GluGly: 1.733 ± 1.219
0.0GluHis: 0.0 ± 0.0
1.733GluIle: 1.733 ± 1.219
1.733GluLys: 1.733 ± 1.219
1.733GluLeu: 1.733 ± 1.219
0.0GluMet: 0.0 ± 0.0
0.0GluAsn: 0.0 ± 0.0
3.466GluPro: 3.466 ± 2.439
3.466GluGln: 3.466 ± 0.115
1.733GluArg: 1.733 ± 1.219
1.733GluSer: 1.733 ± 1.219
3.466GluThr: 3.466 ± 0.115
3.466GluVal: 3.466 ± 0.115
0.0GluTrp: 0.0 ± 0.0
3.466GluTyr: 3.466 ± 0.115
0.0GluXaa: 0.0 ± 0.0
Phe
3.466PheAla: 3.466 ± 0.115
0.0PheCys: 0.0 ± 0.0
3.466PheAsp: 3.466 ± 0.115
1.733PheGlu: 1.733 ± 1.219
0.0PhePhe: 0.0 ± 0.0
0.0PheGly: 0.0 ± 0.0
0.0PheHis: 0.0 ± 0.0
1.733PheIle: 1.733 ± 1.219
8.666PheLys: 8.666 ± 1.564
3.466PheLeu: 3.466 ± 0.115
0.0PheMet: 0.0 ± 0.0
1.733PheAsn: 1.733 ± 1.334
3.466PhePro: 3.466 ± 2.669
1.733PheGln: 1.733 ± 1.219
1.733PheArg: 1.733 ± 1.334
1.733PheSer: 1.733 ± 1.334
1.733PheThr: 1.733 ± 1.219
1.733PheVal: 1.733 ± 1.219
0.0PheTrp: 0.0 ± 0.0
1.733PheTyr: 1.733 ± 1.219
0.0PheXaa: 0.0 ± 0.0
Gly
1.733GlyAla: 1.733 ± 1.334
0.0GlyCys: 0.0 ± 0.0
1.733GlyAsp: 1.733 ± 1.219
3.466GlyGlu: 3.466 ± 2.439
1.733GlyPhe: 1.733 ± 1.334
6.932GlyGly: 6.932 ± 0.23
0.0GlyHis: 0.0 ± 0.0
5.199GlyIle: 5.199 ± 1.104
5.199GlyLys: 5.199 ± 3.658
1.733GlyLeu: 1.733 ± 1.334
0.0GlyMet: 0.0 ± 0.889
3.466GlyAsn: 3.466 ± 0.115
1.733GlyPro: 1.733 ± 1.219
12.132GlyGln: 12.132 ± 0.874
0.0GlyArg: 0.0 ± 0.0
12.132GlySer: 12.132 ± 1.68
3.466GlyThr: 3.466 ± 2.669
3.466GlyVal: 3.466 ± 0.115
0.0GlyTrp: 0.0 ± 0.0
1.733GlyTyr: 1.733 ± 1.334
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.733HisGlu: 1.733 ± 1.219
1.733HisPhe: 1.733 ± 1.219
5.199HisGly: 5.199 ± 3.658
1.733HisHis: 1.733 ± 1.219
1.733HisIle: 1.733 ± 1.219
0.0HisLys: 0.0 ± 0.0
0.0HisLeu: 0.0 ± 0.0
0.0HisMet: 0.0 ± 0.0
3.466HisAsn: 3.466 ± 0.115
3.466HisPro: 3.466 ± 0.115
1.733HisGln: 1.733 ± 1.219
0.0HisArg: 0.0 ± 0.0
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
5.199HisVal: 5.199 ± 3.658
3.466HisTrp: 3.466 ± 2.439
1.733HisTyr: 1.733 ± 1.334
0.0HisXaa: 0.0 ± 0.0
Ile
10.399IleAla: 10.399 ± 0.345
0.0IleCys: 0.0 ± 0.0
0.0IleAsp: 0.0 ± 0.0
1.733IleGlu: 1.733 ± 1.219
3.466IlePhe: 3.466 ± 2.439
3.466IleGly: 3.466 ± 2.669
1.733IleHis: 1.733 ± 1.219
1.733IleIle: 1.733 ± 1.219
0.0IleLys: 0.0 ± 0.0
3.466IleLeu: 3.466 ± 0.115
1.733IleMet: 1.733 ± 1.219
1.733IleAsn: 1.733 ± 1.334
1.733IlePro: 1.733 ± 1.219
1.733IleGln: 1.733 ± 1.219
3.466IleArg: 3.466 ± 0.115
3.466IleSer: 3.466 ± 2.669
5.199IleThr: 5.199 ± 3.658
5.199IleVal: 5.199 ± 1.449
0.0IleTrp: 0.0 ± 0.0
1.733IleTyr: 1.733 ± 1.219
0.0IleXaa: 0.0 ± 0.0
Lys
3.466LysAla: 3.466 ± 2.439
0.0LysCys: 0.0 ± 0.0
1.733LysAsp: 1.733 ± 1.334
1.733LysGlu: 1.733 ± 1.219
3.466LysPhe: 3.466 ± 0.115
8.666LysGly: 8.666 ± 1.564
0.0LysHis: 0.0 ± 0.0
3.466LysIle: 3.466 ± 0.115
6.932LysLys: 6.932 ± 2.784
3.466LysLeu: 3.466 ± 0.115
1.733LysMet: 1.733 ± 1.334
0.0LysAsn: 0.0 ± 0.0
3.466LysPro: 3.466 ± 2.669
1.733LysGln: 1.733 ± 1.219
5.199LysArg: 5.199 ± 3.658
5.199LysSer: 5.199 ± 1.449
6.932LysThr: 6.932 ± 5.337
6.932LysVal: 6.932 ± 2.784
0.0LysTrp: 0.0 ± 0.0
5.199LysTyr: 5.199 ± 1.449
0.0LysXaa: 0.0 ± 0.0
Leu
1.733LeuAla: 1.733 ± 1.219
0.0LeuCys: 0.0 ± 0.0
8.666LeuAsp: 8.666 ± 1.564
1.733LeuGlu: 1.733 ± 1.219
0.0LeuPhe: 0.0 ± 0.0
1.733LeuGly: 1.733 ± 1.334
5.199LeuHis: 5.199 ± 1.104
8.666LeuIle: 8.666 ± 1.564
3.466LeuLys: 3.466 ± 2.669
1.733LeuLeu: 1.733 ± 1.219
0.0LeuMet: 0.0 ± 0.0
1.733LeuAsn: 1.733 ± 1.334
3.466LeuPro: 3.466 ± 2.669
0.0LeuGln: 0.0 ± 0.0
8.666LeuArg: 8.666 ± 0.989
8.666LeuSer: 8.666 ± 3.543
1.733LeuThr: 1.733 ± 1.334
1.733LeuVal: 1.733 ± 1.334
0.0LeuTrp: 0.0 ± 0.0
5.199LeuTyr: 5.199 ± 1.104
0.0LeuXaa: 0.0 ± 0.0
Met
5.199MetAla: 5.199 ± 1.449
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.733MetIle: 1.733 ± 1.219
0.0MetLys: 0.0 ± 0.0
1.733MetLeu: 1.733 ± 1.334
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
1.733MetArg: 1.733 ± 1.334
1.733MetSer: 1.733 ± 1.219
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.466AsnAla: 3.466 ± 2.669
0.0AsnCys: 0.0 ± 0.0
3.466AsnAsp: 3.466 ± 2.669
3.466AsnGlu: 3.466 ± 2.669
0.0AsnPhe: 0.0 ± 0.0
3.466AsnGly: 3.466 ± 0.115
3.466AsnHis: 3.466 ± 0.115
1.733AsnIle: 1.733 ± 1.219
0.0AsnLys: 0.0 ± 0.0
3.466AsnLeu: 3.466 ± 2.669
0.0AsnMet: 0.0 ± 0.0
8.666AsnAsn: 8.666 ± 4.118
3.466AsnPro: 3.466 ± 2.439
0.0AsnGln: 0.0 ± 0.0
1.733AsnArg: 1.733 ± 1.219
1.733AsnSer: 1.733 ± 1.334
6.932AsnThr: 6.932 ± 2.784
0.0AsnVal: 0.0 ± 0.0
0.0AsnTrp: 0.0 ± 0.0
1.733AsnTyr: 1.733 ± 1.219
0.0AsnXaa: 0.0 ± 0.0
Pro
1.733ProAla: 1.733 ± 1.334
0.0ProCys: 0.0 ± 0.0
3.466ProAsp: 3.466 ± 0.115
3.466ProGlu: 3.466 ± 2.439
3.466ProPhe: 3.466 ± 0.115
0.0ProGly: 0.0 ± 0.0
1.733ProHis: 1.733 ± 1.219
3.466ProIle: 3.466 ± 2.669
5.199ProLys: 5.199 ± 1.449
1.733ProLeu: 1.733 ± 1.219
0.0ProMet: 0.0 ± 0.0
3.466ProAsn: 3.466 ± 2.669
1.733ProPro: 1.733 ± 1.219
6.932ProGln: 6.932 ± 0.23
5.199ProArg: 5.199 ± 3.658
5.199ProSer: 5.199 ± 1.449
3.466ProThr: 3.466 ± 0.115
3.466ProVal: 3.466 ± 0.115
1.733ProTrp: 1.733 ± 1.334
5.199ProTyr: 5.199 ± 1.104
0.0ProXaa: 0.0 ± 0.0
Gln
3.466GlnAla: 3.466 ± 0.115
1.733GlnCys: 1.733 ± 1.219
0.0GlnAsp: 0.0 ± 0.0
3.466GlnGlu: 3.466 ± 0.115
1.733GlnPhe: 1.733 ± 1.219
5.199GlnGly: 5.199 ± 1.104
0.0GlnHis: 0.0 ± 0.0
3.466GlnIle: 3.466 ± 0.115
1.733GlnLys: 1.733 ± 1.219
5.199GlnLeu: 5.199 ± 1.104
0.0GlnMet: 0.0 ± 0.0
3.466GlnAsn: 3.466 ± 0.115
3.466GlnPro: 3.466 ± 0.115
0.0GlnGln: 0.0 ± 0.0
1.733GlnArg: 1.733 ± 1.219
1.733GlnSer: 1.733 ± 1.334
3.466GlnThr: 3.466 ± 2.669
5.199GlnVal: 5.199 ± 1.104
0.0GlnTrp: 0.0 ± 0.0
1.733GlnTyr: 1.733 ± 1.219
0.0GlnXaa: 0.0 ± 0.0
Arg
3.466ArgAla: 3.466 ± 0.115
3.466ArgCys: 3.466 ± 2.439
3.466ArgAsp: 3.466 ± 2.439
0.0ArgGlu: 0.0 ± 0.0
5.199ArgPhe: 5.199 ± 1.104
3.466ArgGly: 3.466 ± 2.669
3.466ArgHis: 3.466 ± 2.439
1.733ArgIle: 1.733 ± 1.334
3.466ArgLys: 3.466 ± 0.115
5.199ArgLeu: 5.199 ± 4.003
0.0ArgMet: 0.0 ± 0.0
3.466ArgAsn: 3.466 ± 2.439
1.733ArgPro: 1.733 ± 1.219
1.733ArgGln: 1.733 ± 1.334
3.466ArgArg: 3.466 ± 2.669
5.199ArgSer: 5.199 ± 4.003
5.199ArgThr: 5.199 ± 3.658
3.466ArgVal: 3.466 ± 0.115
1.733ArgTrp: 1.733 ± 1.219
3.466ArgTyr: 3.466 ± 2.439
0.0ArgXaa: 0.0 ± 0.0
Ser
5.199SerAla: 5.199 ± 1.104
0.0SerCys: 0.0 ± 0.0
3.466SerAsp: 3.466 ± 0.115
0.0SerGlu: 0.0 ± 0.0
1.733SerPhe: 1.733 ± 1.219
3.466SerGly: 3.466 ± 0.115
1.733SerHis: 1.733 ± 1.219
3.466SerIle: 3.466 ± 0.115
6.932SerLys: 6.932 ± 2.784
3.466SerLeu: 3.466 ± 2.669
0.0SerMet: 0.0 ± 0.0
5.199SerAsn: 5.199 ± 3.658
6.932SerPro: 6.932 ± 2.784
5.199SerGln: 5.199 ± 1.449
5.199SerArg: 5.199 ± 1.449
0.0SerSer: 0.0 ± 0.0
0.0SerThr: 0.0 ± 0.0
6.932SerVal: 6.932 ± 2.784
0.0SerTrp: 0.0 ± 0.0
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.466ThrAla: 3.466 ± 2.669
0.0ThrCys: 0.0 ± 0.0
6.932ThrAsp: 6.932 ± 0.23
0.0ThrGlu: 0.0 ± 0.0
3.466ThrPhe: 3.466 ± 2.669
3.466ThrGly: 3.466 ± 0.115
1.733ThrHis: 1.733 ± 1.219
0.0ThrIle: 0.0 ± 0.0
5.199ThrLys: 5.199 ± 1.449
5.199ThrLeu: 5.199 ± 1.104
0.0ThrMet: 0.0 ± 0.0
0.0ThrAsn: 0.0 ± 0.0
1.733ThrPro: 1.733 ± 1.334
1.733ThrGln: 1.733 ± 1.334
6.932ThrArg: 6.932 ± 2.324
1.733ThrSer: 1.733 ± 1.219
5.199ThrThr: 5.199 ± 1.449
8.666ThrVal: 8.666 ± 1.564
3.466ThrTrp: 3.466 ± 2.439
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
6.932ValAla: 6.932 ± 2.324
0.0ValCys: 0.0 ± 0.0
5.199ValAsp: 5.199 ± 1.449
1.733ValGlu: 1.733 ± 1.219
0.0ValPhe: 0.0 ± 0.0
5.199ValGly: 5.199 ± 1.449
1.733ValHis: 1.733 ± 1.219
3.466ValIle: 3.466 ± 2.439
8.666ValLys: 8.666 ± 1.564
6.932ValLeu: 6.932 ± 2.784
1.733ValMet: 1.733 ± 1.334
5.199ValAsn: 5.199 ± 4.003
8.666ValPro: 8.666 ± 3.543
0.0ValGln: 0.0 ± 0.0
0.0ValArg: 0.0 ± 0.0
3.466ValSer: 3.466 ± 0.115
3.466ValThr: 3.466 ± 0.115
5.199ValVal: 5.199 ± 1.449
0.0ValTrp: 0.0 ± 0.0
5.199ValTyr: 5.199 ± 1.449
0.0ValXaa: 0.0 ± 0.0
Trp
1.733TrpAla: 1.733 ± 1.219
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
1.733TrpGly: 1.733 ± 1.219
0.0TrpHis: 0.0 ± 0.0
1.733TrpIle: 1.733 ± 1.219
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
1.733TrpAsn: 1.733 ± 1.334
0.0TrpPro: 0.0 ± 0.0
3.466TrpGln: 3.466 ± 2.439
0.0TrpArg: 0.0 ± 0.0
1.733TrpSer: 1.733 ± 1.219
0.0TrpThr: 0.0 ± 0.0
1.733TrpVal: 1.733 ± 1.219
0.0TrpTrp: 0.0 ± 0.0
1.733TrpTyr: 1.733 ± 1.334
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.466TyrAla: 3.466 ± 2.669
3.466TyrCys: 3.466 ± 0.115
5.199TyrAsp: 5.199 ± 1.104
3.466TyrGlu: 3.466 ± 2.439
1.733TyrPhe: 1.733 ± 1.334
3.466TyrGly: 3.466 ± 2.439
0.0TyrHis: 0.0 ± 0.0
0.0TyrIle: 0.0 ± 0.0
3.466TyrLys: 3.466 ± 0.115
3.466TyrLeu: 3.466 ± 2.439
0.0TyrMet: 0.0 ± 0.0
1.733TyrAsn: 1.733 ± 1.334
5.199TyrPro: 5.199 ± 1.449
0.0TyrGln: 0.0 ± 0.0
3.466TyrArg: 3.466 ± 2.669
0.0TyrSer: 0.0 ± 0.0
3.466TyrThr: 3.466 ± 2.439
5.199TyrVal: 5.199 ± 3.658
0.0TyrTrp: 0.0 ± 0.0
10.399TyrTyr: 10.399 ± 5.452
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (578 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski