Amino acid dipepetide frequency for Lake Sarah-associated circular virus-26

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.637AlaAla: 4.637 ± 3.014
0.0AlaCys: 0.0 ± 0.0
0.0AlaAsp: 0.0 ± 0.0
3.091AlaGlu: 3.091 ± 2.219
6.182AlaPhe: 6.182 ± 2.323
7.728AlaGly: 7.728 ± 5.024
1.546AlaHis: 1.546 ± 1.109
4.637AlaIle: 4.637 ± 3.014
1.546AlaLys: 1.546 ± 1.109
6.182AlaLeu: 6.182 ± 0.209
0.0AlaMet: 0.0 ± 0.0
3.091AlaAsn: 3.091 ± 2.01
0.0AlaPro: 0.0 ± 0.0
1.546AlaGln: 1.546 ± 1.005
6.182AlaArg: 6.182 ± 0.209
3.091AlaSer: 3.091 ± 0.105
10.819AlaThr: 10.819 ± 2.805
7.728AlaVal: 7.728 ± 3.433
1.546AlaTrp: 1.546 ± 1.005
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
3.091CysGlu: 3.091 ± 2.219
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
1.546CysHis: 1.546 ± 1.109
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
1.546CysGln: 1.546 ± 1.109
1.546CysArg: 1.546 ± 1.005
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
3.091CysVal: 3.091 ± 2.219
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.546AspAla: 1.546 ± 1.005
0.0AspCys: 0.0 ± 0.0
1.546AspAsp: 1.546 ± 1.109
3.091AspGlu: 3.091 ± 2.219
7.728AspPhe: 7.728 ± 3.433
1.546AspGly: 1.546 ± 1.109
3.091AspHis: 3.091 ± 2.01
0.0AspIle: 0.0 ± 0.0
1.546AspLys: 1.546 ± 1.005
6.182AspLeu: 6.182 ± 2.323
0.0AspMet: 0.0 ± 0.0
3.091AspAsn: 3.091 ± 0.105
3.091AspPro: 3.091 ± 0.105
0.0AspGln: 0.0 ± 0.0
3.091AspArg: 3.091 ± 2.219
1.546AspSer: 1.546 ± 1.109
0.0AspThr: 0.0 ± 0.0
3.091AspVal: 3.091 ± 0.105
6.182AspTrp: 6.182 ± 2.323
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
4.637GluAla: 4.637 ± 3.328
0.0GluCys: 0.0 ± 0.0
3.091GluAsp: 3.091 ± 2.219
1.546GluGlu: 1.546 ± 1.005
0.0GluPhe: 0.0 ± 0.0
3.091GluGly: 3.091 ± 2.219
0.0GluHis: 0.0 ± 0.0
0.0GluIle: 0.0 ± 0.0
7.728GluLys: 7.728 ± 3.433
3.091GluLeu: 3.091 ± 0.105
0.0GluMet: 0.0 ± 0.0
1.546GluAsn: 1.546 ± 1.109
1.546GluPro: 1.546 ± 1.005
0.0GluGln: 0.0 ± 0.0
3.091GluArg: 3.091 ± 2.219
3.091GluSer: 3.091 ± 2.01
4.637GluThr: 4.637 ± 3.014
1.546GluVal: 1.546 ± 1.005
1.546GluTrp: 1.546 ± 1.109
6.182GluTyr: 6.182 ± 2.323
0.0GluXaa: 0.0 ± 0.0
Phe
6.182PheAla: 6.182 ± 0.209
1.546PheCys: 1.546 ± 1.109
4.637PheAsp: 4.637 ± 3.328
1.546PheGlu: 1.546 ± 1.109
1.546PhePhe: 1.546 ± 1.109
1.546PheGly: 1.546 ± 1.109
1.546PheHis: 1.546 ± 1.109
3.091PheIle: 3.091 ± 2.219
4.637PheLys: 4.637 ± 3.014
3.091PheLeu: 3.091 ± 0.105
1.546PheMet: 1.546 ± 1.005
7.728PheAsn: 7.728 ± 0.796
0.0PhePro: 0.0 ± 0.0
0.0PheGln: 0.0 ± 0.0
6.182PheArg: 6.182 ± 0.209
6.182PheSer: 6.182 ± 1.905
4.637PheThr: 4.637 ± 1.214
3.091PheVal: 3.091 ± 0.105
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
7.728GlyAla: 7.728 ± 1.319
4.637GlyCys: 4.637 ± 3.328
1.546GlyAsp: 1.546 ± 1.005
3.091GlyGlu: 3.091 ± 2.01
1.546GlyPhe: 1.546 ± 1.005
3.091GlyGly: 3.091 ± 0.105
0.0GlyHis: 0.0 ± 0.0
4.637GlyIle: 4.637 ± 0.9
4.637GlyLys: 4.637 ± 0.9
4.637GlyLeu: 4.637 ± 1.214
0.0GlyMet: 0.0 ± 0.0
4.637GlyAsn: 4.637 ± 0.9
1.546GlyPro: 1.546 ± 1.005
1.546GlyGln: 1.546 ± 1.109
0.0GlyArg: 0.0 ± 0.0
3.091GlySer: 3.091 ± 2.01
6.182GlyThr: 6.182 ± 1.905
3.091GlyVal: 3.091 ± 2.01
0.0GlyTrp: 0.0 ± 0.0
4.637GlyTyr: 4.637 ± 0.9
0.0GlyXaa: 0.0 ± 0.0
His
6.182HisAla: 6.182 ± 0.209
0.0HisCys: 0.0 ± 0.0
1.546HisAsp: 1.546 ± 1.109
1.546HisGlu: 1.546 ± 1.005
4.637HisPhe: 4.637 ± 1.214
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
1.546HisIle: 1.546 ± 1.109
0.0HisLys: 0.0 ± 0.0
1.546HisLeu: 1.546 ± 1.109
0.0HisMet: 0.0 ± 0.0
1.546HisAsn: 1.546 ± 1.109
1.546HisPro: 1.546 ± 1.109
1.546HisGln: 1.546 ± 1.109
0.0HisArg: 0.0 ± 0.0
1.546HisSer: 1.546 ± 1.109
3.091HisThr: 3.091 ± 2.219
3.091HisVal: 3.091 ± 2.219
0.0HisTrp: 0.0 ± 0.0
3.091HisTyr: 3.091 ± 2.01
0.0HisXaa: 0.0 ± 0.0
Ile
1.546IleAla: 1.546 ± 1.005
0.0IleCys: 0.0 ± 0.0
4.637IleAsp: 4.637 ± 1.214
0.0IleGlu: 0.0 ± 0.0
1.546IlePhe: 1.546 ± 1.109
1.546IleGly: 1.546 ± 1.109
0.0IleHis: 0.0 ± 0.0
0.0IleIle: 0.0 ± 0.0
3.091IleLys: 3.091 ± 2.219
1.546IleLeu: 1.546 ± 1.005
3.091IleMet: 3.091 ± 0.105
3.091IleAsn: 3.091 ± 2.01
4.637IlePro: 4.637 ± 1.214
0.0IleGln: 0.0 ± 0.0
3.091IleArg: 3.091 ± 2.219
3.091IleSer: 3.091 ± 0.105
4.637IleThr: 4.637 ± 3.014
4.637IleVal: 4.637 ± 1.214
0.0IleTrp: 0.0 ± 0.0
1.546IleTyr: 1.546 ± 1.109
0.0IleXaa: 0.0 ± 0.0
Lys
1.546LysAla: 1.546 ± 1.109
0.0LysCys: 0.0 ± 0.0
3.091LysAsp: 3.091 ± 0.105
6.182LysGlu: 6.182 ± 2.323
1.546LysPhe: 1.546 ± 1.005
3.091LysGly: 3.091 ± 2.01
7.728LysHis: 7.728 ± 1.319
3.091LysIle: 3.091 ± 0.105
3.091LysLys: 3.091 ± 2.01
3.091LysLeu: 3.091 ± 2.219
1.546LysMet: 1.546 ± 1.005
3.091LysAsn: 3.091 ± 2.01
1.546LysPro: 1.546 ± 1.109
3.091LysGln: 3.091 ± 0.105
7.728LysArg: 7.728 ± 2.91
3.091LysSer: 3.091 ± 0.105
7.728LysThr: 7.728 ± 2.91
1.546LysVal: 1.546 ± 1.005
1.546LysTrp: 1.546 ± 1.005
4.637LysTyr: 4.637 ± 0.9
0.0LysXaa: 0.0 ± 0.0
Leu
4.637LeuAla: 4.637 ± 0.9
0.0LeuCys: 0.0 ± 0.0
0.0LeuAsp: 0.0 ± 0.0
6.182LeuGlu: 6.182 ± 2.323
1.546LeuPhe: 1.546 ± 1.109
6.182LeuGly: 6.182 ± 2.323
3.091LeuHis: 3.091 ± 2.219
0.0LeuIle: 0.0 ± 0.0
1.546LeuLys: 1.546 ± 1.005
3.091LeuLeu: 3.091 ± 2.219
3.091LeuMet: 3.091 ± 2.01
1.546LeuAsn: 1.546 ± 1.005
0.0LeuPro: 0.0 ± 0.0
4.637LeuGln: 4.637 ± 0.9
6.182LeuArg: 6.182 ± 2.323
7.728LeuSer: 7.728 ± 2.91
7.728LeuThr: 7.728 ± 1.319
1.546LeuVal: 1.546 ± 1.109
0.0LeuTrp: 0.0 ± 0.0
1.546LeuTyr: 1.546 ± 1.109
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
1.546MetAsp: 1.546 ± 1.005
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
3.091MetGly: 3.091 ± 2.01
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
3.091MetLys: 3.091 ± 2.01
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.546MetPro: 1.546 ± 1.005
1.546MetGln: 1.546 ± 1.109
0.0MetArg: 0.0 ± 0.0
6.182MetSer: 6.182 ± 4.019
1.546MetThr: 1.546 ± 1.109
3.091MetVal: 3.091 ± 0.105
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.637AsnAla: 4.637 ± 0.9
0.0AsnCys: 0.0 ± 0.0
3.091AsnAsp: 3.091 ± 0.105
0.0AsnGlu: 0.0 ± 0.0
3.091AsnPhe: 3.091 ± 0.105
3.091AsnGly: 3.091 ± 2.01
3.091AsnHis: 3.091 ± 2.219
3.091AsnIle: 3.091 ± 0.105
3.091AsnLys: 3.091 ± 0.105
3.091AsnLeu: 3.091 ± 2.01
3.091AsnMet: 3.091 ± 2.01
3.091AsnAsn: 3.091 ± 0.105
0.0AsnPro: 0.0 ± 0.0
3.091AsnGln: 3.091 ± 0.105
1.546AsnArg: 1.546 ± 1.005
4.637AsnSer: 4.637 ± 3.014
1.546AsnThr: 1.546 ± 1.005
10.819AsnVal: 10.819 ± 1.423
0.0AsnTrp: 0.0 ± 0.0
7.728AsnTyr: 7.728 ± 2.91
0.0AsnXaa: 0.0 ± 0.0
Pro
1.546ProAla: 1.546 ± 1.005
0.0ProCys: 0.0 ± 0.0
1.546ProAsp: 1.546 ± 1.109
3.091ProGlu: 3.091 ± 0.105
1.546ProPhe: 1.546 ± 1.005
0.0ProGly: 0.0 ± 0.0
0.0ProHis: 0.0 ± 0.0
1.546ProIle: 1.546 ± 1.109
1.546ProLys: 1.546 ± 1.005
0.0ProLeu: 0.0 ± 0.0
1.546ProMet: 1.546 ± 1.109
3.091ProAsn: 3.091 ± 0.105
0.0ProPro: 0.0 ± 0.0
3.091ProGln: 3.091 ± 2.219
3.091ProArg: 3.091 ± 2.219
1.546ProSer: 1.546 ± 1.109
1.546ProThr: 1.546 ± 1.005
3.091ProVal: 3.091 ± 0.105
0.0ProTrp: 0.0 ± 0.0
1.546ProTyr: 1.546 ± 1.005
0.0ProXaa: 0.0 ± 0.0
Gln
0.0GlnAla: 0.0 ± 0.0
0.0GlnCys: 0.0 ± 0.0
3.091GlnAsp: 3.091 ± 2.219
3.091GlnGlu: 3.091 ± 0.105
3.091GlnPhe: 3.091 ± 0.105
0.0GlnGly: 0.0 ± 0.0
1.546GlnHis: 1.546 ± 1.109
3.091GlnIle: 3.091 ± 0.105
1.546GlnLys: 1.546 ± 1.109
3.091GlnLeu: 3.091 ± 0.105
0.0GlnMet: 0.0 ± 0.0
4.637GlnAsn: 4.637 ± 0.9
1.546GlnPro: 1.546 ± 1.109
1.546GlnGln: 1.546 ± 1.109
0.0GlnArg: 0.0 ± 0.0
1.546GlnSer: 1.546 ± 1.109
4.637GlnThr: 4.637 ± 0.9
1.546GlnVal: 1.546 ± 1.005
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.091ArgAla: 3.091 ± 0.105
0.0ArgCys: 0.0 ± 0.0
4.637ArgAsp: 4.637 ± 1.214
3.091ArgGlu: 3.091 ± 2.219
3.091ArgPhe: 3.091 ± 0.105
1.546ArgGly: 1.546 ± 1.005
4.637ArgHis: 4.637 ± 3.328
0.0ArgIle: 0.0 ± 0.0
9.274ArgLys: 9.274 ± 1.8
4.637ArgLeu: 4.637 ± 1.214
1.546ArgMet: 1.546 ± 1.005
7.728ArgAsn: 7.728 ± 1.319
4.637ArgPro: 4.637 ± 3.328
0.0ArgGln: 0.0 ± 0.0
3.091ArgArg: 3.091 ± 2.01
4.637ArgSer: 4.637 ± 1.214
1.546ArgThr: 1.546 ± 1.005
1.546ArgVal: 1.546 ± 1.005
1.546ArgTrp: 1.546 ± 1.109
6.182ArgTyr: 6.182 ± 0.209
0.0ArgXaa: 0.0 ± 0.0
Ser
6.182SerAla: 6.182 ± 0.209
0.0SerCys: 0.0 ± 0.0
3.091SerAsp: 3.091 ± 2.01
1.546SerGlu: 1.546 ± 1.005
6.182SerPhe: 6.182 ± 1.905
7.728SerGly: 7.728 ± 0.796
0.0SerHis: 0.0 ± 0.0
4.637SerIle: 4.637 ± 1.214
1.546SerLys: 1.546 ± 1.005
6.182SerLeu: 6.182 ± 0.209
1.546SerMet: 1.546 ± 1.005
6.182SerAsn: 6.182 ± 1.905
1.546SerPro: 1.546 ± 1.005
3.091SerGln: 3.091 ± 0.105
6.182SerArg: 6.182 ± 2.323
4.637SerSer: 4.637 ± 3.014
6.182SerThr: 6.182 ± 4.019
4.637SerVal: 4.637 ± 0.9
1.546SerTrp: 1.546 ± 1.109
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
9.274ThrAla: 9.274 ± 6.029
1.546ThrCys: 1.546 ± 1.109
6.182ThrAsp: 6.182 ± 2.323
3.091ThrGlu: 3.091 ± 0.105
7.728ThrPhe: 7.728 ± 0.796
4.637ThrGly: 4.637 ± 3.014
0.0ThrHis: 0.0 ± 0.0
4.637ThrIle: 4.637 ± 0.9
6.182ThrLys: 6.182 ± 1.905
4.637ThrLeu: 4.637 ± 0.9
0.0ThrMet: 0.0 ± 0.753
6.182ThrAsn: 6.182 ± 1.905
3.091ThrPro: 3.091 ± 0.105
0.0ThrGln: 0.0 ± 0.0
3.091ThrArg: 3.091 ± 2.219
3.091ThrSer: 3.091 ± 0.105
12.365ThrThr: 12.365 ± 5.924
6.182ThrVal: 6.182 ± 4.019
0.0ThrTrp: 0.0 ± 0.0
4.637ThrTyr: 4.637 ± 1.214
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
1.546ValCys: 1.546 ± 1.005
1.546ValAsp: 1.546 ± 1.109
0.0ValGlu: 0.0 ± 0.0
7.728ValPhe: 7.728 ± 1.319
3.091ValGly: 3.091 ± 0.105
1.546ValHis: 1.546 ± 1.005
6.182ValIle: 6.182 ± 2.323
7.728ValLys: 7.728 ± 0.796
3.091ValLeu: 3.091 ± 2.01
3.091ValMet: 3.091 ± 0.715
1.546ValAsn: 1.546 ± 1.005
1.546ValPro: 1.546 ± 1.005
4.637ValGln: 4.637 ± 0.9
6.182ValArg: 6.182 ± 2.323
7.728ValSer: 7.728 ± 1.319
6.182ValThr: 6.182 ± 0.209
4.637ValVal: 4.637 ± 0.9
0.0ValTrp: 0.0 ± 0.0
3.091ValTyr: 3.091 ± 2.01
0.0ValXaa: 0.0 ± 0.0
Trp
1.546TrpAla: 1.546 ± 1.109
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.546TrpGlu: 1.546 ± 1.005
0.0TrpPhe: 0.0 ± 0.0
3.091TrpGly: 3.091 ± 2.219
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
3.091TrpLys: 3.091 ± 0.105
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
3.091TrpGln: 3.091 ± 0.105
0.0TrpArg: 0.0 ± 0.0
1.546TrpSer: 1.546 ± 1.109
1.546TrpThr: 1.546 ± 1.109
0.0TrpVal: 0.0 ± 0.0
1.546TrpTrp: 1.546 ± 1.109
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.637TyrAla: 4.637 ± 0.9
1.546TyrCys: 1.546 ± 1.109
1.546TyrAsp: 1.546 ± 1.005
1.546TyrGlu: 1.546 ± 1.109
0.0TyrPhe: 0.0 ± 0.0
6.182TyrGly: 6.182 ± 1.905
3.091TyrHis: 3.091 ± 0.105
1.546TyrIle: 1.546 ± 1.109
3.091TyrLys: 3.091 ± 0.105
3.091TyrLeu: 3.091 ± 2.219
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
1.546TyrPro: 1.546 ± 1.109
0.0TyrGln: 0.0 ± 0.0
6.182TyrArg: 6.182 ± 4.019
4.637TyrSer: 4.637 ± 3.014
1.546TyrThr: 1.546 ± 1.109
3.091TyrVal: 3.091 ± 2.01
1.546TyrTrp: 1.546 ± 1.109
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (648 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski