Amino acid dipepetide frequency for Lake Sarah-associated circular virus-12

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.0AlaCys: 0.0 ± 0.0
5.405AlaAsp: 5.405 ± 1.168
5.405AlaGlu: 5.405 ± 1.168
5.405AlaPhe: 5.405 ± 3.788
3.604AlaGly: 3.604 ± 2.714
1.802AlaHis: 1.802 ± 1.263
5.405AlaIle: 5.405 ± 1.168
5.405AlaLys: 5.405 ± 1.168
7.207AlaLeu: 7.207 ± 0.189
1.802AlaMet: 1.802 ± 1.263
0.0AlaAsn: 0.0 ± 0.0
7.207AlaPro: 7.207 ± 5.428
5.405AlaGln: 5.405 ± 1.168
0.0AlaArg: 0.0 ± 0.0
7.207AlaSer: 7.207 ± 2.808
0.0AlaThr: 0.0 ± 0.0
3.604AlaVal: 3.604 ± 0.094
1.802AlaTrp: 1.802 ± 1.263
3.604AlaTyr: 3.604 ± 0.094
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
1.802CysPhe: 1.802 ± 1.263
0.0CysGly: 0.0 ± 0.0
1.802CysHis: 1.802 ± 1.263
1.802CysIle: 1.802 ± 1.357
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
1.802CysAsn: 1.802 ± 1.263
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.802CysTyr: 1.802 ± 1.357
0.0CysXaa: 0.0 ± 0.0
Asp
3.604AspAla: 3.604 ± 0.094
0.0AspCys: 0.0 ± 0.0
0.0AspAsp: 0.0 ± 0.0
1.802AspGlu: 1.802 ± 1.357
1.802AspPhe: 1.802 ± 1.357
9.009AspGly: 9.009 ± 1.074
0.0AspHis: 0.0 ± 0.0
5.405AspIle: 5.405 ± 4.071
1.802AspLys: 1.802 ± 1.263
9.009AspLeu: 9.009 ± 1.546
0.0AspMet: 0.0 ± 0.0
3.604AspAsn: 3.604 ± 0.094
0.0AspPro: 0.0 ± 0.0
0.0AspGln: 0.0 ± 0.0
5.405AspArg: 5.405 ± 1.451
3.604AspSer: 3.604 ± 2.525
1.802AspThr: 1.802 ± 1.263
9.009AspVal: 9.009 ± 1.074
0.0AspTrp: 0.0 ± 0.0
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
5.405GluAla: 5.405 ± 3.788
0.0GluCys: 0.0 ± 0.0
3.604GluAsp: 3.604 ± 0.094
3.604GluGlu: 3.604 ± 2.525
0.0GluPhe: 0.0 ± 0.0
1.802GluGly: 1.802 ± 1.263
0.0GluHis: 0.0 ± 0.0
0.0GluIle: 0.0 ± 0.0
0.0GluLys: 0.0 ± 0.0
3.604GluLeu: 3.604 ± 0.094
0.0GluMet: 0.0 ± 0.909
1.802GluAsn: 1.802 ± 1.263
1.802GluPro: 1.802 ± 1.263
0.0GluGln: 0.0 ± 0.0
5.405GluArg: 5.405 ± 1.451
3.604GluSer: 3.604 ± 2.714
3.604GluThr: 3.604 ± 2.525
9.009GluVal: 9.009 ± 3.693
0.0GluTrp: 0.0 ± 0.0
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
3.604PheAla: 3.604 ± 0.094
1.802PheCys: 1.802 ± 1.263
0.0PheAsp: 0.0 ± 0.0
0.0PheGlu: 0.0 ± 0.0
1.802PhePhe: 1.802 ± 1.357
5.405PheGly: 5.405 ± 3.788
0.0PheHis: 0.0 ± 0.0
1.802PheIle: 1.802 ± 1.263
3.604PheLys: 3.604 ± 0.094
0.0PheLeu: 0.0 ± 0.0
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
3.604PhePro: 3.604 ± 0.094
1.802PheGln: 1.802 ± 1.357
9.009PheArg: 9.009 ± 4.165
3.604PheSer: 3.604 ± 2.714
9.009PheThr: 9.009 ± 1.546
0.0PheVal: 0.0 ± 0.0
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.604GlyAla: 3.604 ± 2.714
0.0GlyCys: 0.0 ± 0.0
5.405GlyAsp: 5.405 ± 1.451
5.405GlyGlu: 5.405 ± 1.168
0.0GlyPhe: 0.0 ± 0.0
0.0GlyGly: 0.0 ± 0.0
0.0GlyHis: 0.0 ± 0.0
1.802GlyIle: 1.802 ± 1.263
9.009GlyLys: 9.009 ± 1.546
1.802GlyLeu: 1.802 ± 1.357
0.0GlyMet: 0.0 ± 0.0
1.802GlyAsn: 1.802 ± 1.263
5.405GlyPro: 5.405 ± 1.168
0.0GlyGln: 0.0 ± 0.0
0.0GlyArg: 0.0 ± 0.0
10.811GlySer: 10.811 ± 2.336
7.207GlyThr: 7.207 ± 2.431
3.604GlyVal: 3.604 ± 2.525
0.0GlyTrp: 0.0 ± 0.0
5.405GlyTyr: 5.405 ± 1.168
0.0GlyXaa: 0.0 ± 0.0
His
1.802HisAla: 1.802 ± 1.263
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
3.604HisGlu: 3.604 ± 2.525
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
1.802HisHis: 1.802 ± 1.357
7.207HisIle: 7.207 ± 2.431
0.0HisLys: 0.0 ± 0.0
5.405HisLeu: 5.405 ± 1.451
0.0HisMet: 0.0 ± 0.0
3.604HisAsn: 3.604 ± 2.525
0.0HisPro: 0.0 ± 0.0
1.802HisGln: 1.802 ± 1.357
0.0HisArg: 0.0 ± 0.0
1.802HisSer: 1.802 ± 1.263
1.802HisThr: 1.802 ± 1.263
0.0HisVal: 0.0 ± 0.0
1.802HisTrp: 1.802 ± 1.263
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
5.405IleAla: 5.405 ± 1.451
1.802IleCys: 1.802 ± 1.357
1.802IleAsp: 1.802 ± 1.263
3.604IleGlu: 3.604 ± 2.525
0.0IlePhe: 0.0 ± 0.0
5.405IleGly: 5.405 ± 1.168
0.0IleHis: 0.0 ± 0.0
0.0IleIle: 0.0 ± 0.0
5.405IleLys: 5.405 ± 1.168
10.811IleLeu: 10.811 ± 0.283
0.0IleMet: 0.0 ± 0.0
3.604IleAsn: 3.604 ± 2.714
1.802IlePro: 1.802 ± 1.263
3.604IleGln: 3.604 ± 2.525
3.604IleArg: 3.604 ± 2.525
9.009IleSer: 9.009 ± 1.546
1.802IleThr: 1.802 ± 1.263
1.802IleVal: 1.802 ± 1.357
0.0IleTrp: 0.0 ± 0.0
1.802IleTyr: 1.802 ± 1.263
0.0IleXaa: 0.0 ± 0.0
Lys
3.604LysAla: 3.604 ± 0.094
1.802LysCys: 1.802 ± 1.263
3.604LysAsp: 3.604 ± 2.714
0.0LysGlu: 0.0 ± 0.0
1.802LysPhe: 1.802 ± 1.263
1.802LysGly: 1.802 ± 1.357
7.207LysHis: 7.207 ± 2.431
3.604LysIle: 3.604 ± 2.525
9.009LysLys: 9.009 ± 4.165
3.604LysLeu: 3.604 ± 2.525
1.802LysMet: 1.802 ± 0.875
1.802LysAsn: 1.802 ± 1.357
5.405LysPro: 5.405 ± 1.168
0.0LysGln: 0.0 ± 0.0
5.405LysArg: 5.405 ± 1.451
7.207LysSer: 7.207 ± 0.189
7.207LysThr: 7.207 ± 0.189
5.405LysVal: 5.405 ± 4.071
3.604LysTrp: 3.604 ± 2.525
1.802LysTyr: 1.802 ± 1.357
0.0LysXaa: 0.0 ± 0.0
Leu
1.802LeuAla: 1.802 ± 1.357
0.0LeuCys: 0.0 ± 0.0
3.604LeuAsp: 3.604 ± 0.094
1.802LeuGlu: 1.802 ± 1.263
3.604LeuPhe: 3.604 ± 2.714
5.405LeuGly: 5.405 ± 1.168
7.207LeuHis: 7.207 ± 0.189
1.802LeuIle: 1.802 ± 1.263
0.0LeuLys: 0.0 ± 0.0
7.207LeuLeu: 7.207 ± 2.808
0.0LeuMet: 0.0 ± 0.0
7.207LeuAsn: 7.207 ± 2.808
5.405LeuPro: 5.405 ± 1.451
1.802LeuGln: 1.802 ± 1.263
3.604LeuArg: 3.604 ± 0.094
1.802LeuSer: 1.802 ± 1.357
9.009LeuThr: 9.009 ± 1.074
7.207LeuVal: 7.207 ± 2.808
0.0LeuTrp: 0.0 ± 0.0
1.802LeuTyr: 1.802 ± 1.263
0.0LeuXaa: 0.0 ± 0.0
Met
3.604MetAla: 3.604 ± 2.525
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
1.802MetPhe: 1.802 ± 1.357
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
1.802MetLeu: 1.802 ± 1.263
1.802MetMet: 1.802 ± 1.263
0.0MetAsn: 0.0 ± 0.0
1.802MetPro: 1.802 ± 1.357
0.0MetGln: 0.0 ± 0.0
1.802MetArg: 1.802 ± 1.263
0.0MetSer: 0.0 ± 0.0
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
0.0AsnCys: 0.0 ± 0.0
5.405AsnAsp: 5.405 ± 1.451
3.604AsnGlu: 3.604 ± 2.714
7.207AsnPhe: 7.207 ± 2.808
0.0AsnGly: 0.0 ± 0.0
1.802AsnHis: 1.802 ± 1.263
1.802AsnIle: 1.802 ± 1.357
7.207AsnLys: 7.207 ± 2.431
0.0AsnLeu: 0.0 ± 0.0
0.0AsnMet: 0.0 ± 0.0
5.405AsnAsn: 5.405 ± 1.451
3.604AsnPro: 3.604 ± 0.094
1.802AsnGln: 1.802 ± 1.357
0.0AsnArg: 0.0 ± 0.0
3.604AsnSer: 3.604 ± 2.714
5.405AsnThr: 5.405 ± 1.451
1.802AsnVal: 1.802 ± 1.357
0.0AsnTrp: 0.0 ± 0.0
3.604AsnTyr: 3.604 ± 2.525
0.0AsnXaa: 0.0 ± 0.0
Pro
7.207ProAla: 7.207 ± 2.808
0.0ProCys: 0.0 ± 0.0
3.604ProAsp: 3.604 ± 2.714
3.604ProGlu: 3.604 ± 2.525
1.802ProPhe: 1.802 ± 1.357
3.604ProGly: 3.604 ± 0.094
5.405ProHis: 5.405 ± 3.788
1.802ProIle: 1.802 ± 1.357
7.207ProLys: 7.207 ± 2.431
0.0ProLeu: 0.0 ± 0.0
1.802ProMet: 1.802 ± 1.357
3.604ProAsn: 3.604 ± 2.714
5.405ProPro: 5.405 ± 1.168
1.802ProGln: 1.802 ± 1.263
1.802ProArg: 1.802 ± 1.263
1.802ProSer: 1.802 ± 1.357
5.405ProThr: 5.405 ± 3.788
3.604ProVal: 3.604 ± 2.714
0.0ProTrp: 0.0 ± 0.0
3.604ProTyr: 3.604 ± 2.714
0.0ProXaa: 0.0 ± 0.0
Gln
3.604GlnAla: 3.604 ± 2.714
0.0GlnCys: 0.0 ± 0.0
1.802GlnAsp: 1.802 ± 1.357
0.0GlnGlu: 0.0 ± 0.0
0.0GlnPhe: 0.0 ± 0.0
3.604GlnGly: 3.604 ± 0.094
0.0GlnHis: 0.0 ± 0.0
1.802GlnIle: 1.802 ± 1.263
1.802GlnLys: 1.802 ± 1.263
0.0GlnLeu: 0.0 ± 0.0
3.604GlnMet: 3.604 ± 2.525
0.0GlnAsn: 0.0 ± 0.0
3.604GlnPro: 3.604 ± 2.714
0.0GlnGln: 0.0 ± 0.0
0.0GlnArg: 0.0 ± 0.0
1.802GlnSer: 1.802 ± 1.263
1.802GlnThr: 1.802 ± 1.357
0.0GlnVal: 0.0 ± 0.0
0.0GlnTrp: 0.0 ± 0.0
3.604GlnTyr: 3.604 ± 0.094
0.0GlnXaa: 0.0 ± 0.0
Arg
1.802ArgAla: 1.802 ± 1.263
0.0ArgCys: 0.0 ± 0.0
5.405ArgAsp: 5.405 ± 3.788
1.802ArgGlu: 1.802 ± 1.357
5.405ArgPhe: 5.405 ± 3.788
5.405ArgGly: 5.405 ± 1.168
0.0ArgHis: 0.0 ± 0.0
3.604ArgIle: 3.604 ± 2.714
3.604ArgLys: 3.604 ± 0.094
9.009ArgLeu: 9.009 ± 1.546
0.0ArgMet: 0.0 ± 0.0
0.0ArgAsn: 0.0 ± 0.0
3.604ArgPro: 3.604 ± 0.094
0.0ArgGln: 0.0 ± 0.0
1.802ArgArg: 1.802 ± 1.263
1.802ArgSer: 1.802 ± 1.357
3.604ArgThr: 3.604 ± 2.714
3.604ArgVal: 3.604 ± 2.714
0.0ArgTrp: 0.0 ± 0.0
1.802ArgTyr: 1.802 ± 1.263
0.0ArgXaa: 0.0 ± 0.0
Ser
7.207SerAla: 7.207 ± 2.808
0.0SerCys: 0.0 ± 0.0
0.0SerAsp: 0.0 ± 0.0
0.0SerGlu: 0.0 ± 0.0
7.207SerPhe: 7.207 ± 0.189
1.802SerGly: 1.802 ± 1.263
1.802SerHis: 1.802 ± 1.357
5.405SerIle: 5.405 ± 1.168
7.207SerLys: 7.207 ± 0.189
1.802SerLeu: 1.802 ± 1.357
0.0SerMet: 0.0 ± 0.0
5.405SerAsn: 5.405 ± 1.168
1.802SerPro: 1.802 ± 1.263
7.207SerGln: 7.207 ± 2.808
5.405SerArg: 5.405 ± 1.451
0.0SerSer: 0.0 ± 0.0
3.604SerThr: 3.604 ± 2.714
7.207SerVal: 7.207 ± 0.189
0.0SerTrp: 0.0 ± 0.0
1.802SerTyr: 1.802 ± 1.357
0.0SerXaa: 0.0 ± 0.0
Thr
5.405ThrAla: 5.405 ± 1.168
1.802ThrCys: 1.802 ± 1.263
9.009ThrAsp: 9.009 ± 1.546
1.802ThrGlu: 1.802 ± 1.263
3.604ThrPhe: 3.604 ± 2.714
5.405ThrGly: 5.405 ± 1.168
1.802ThrHis: 1.802 ± 1.263
3.604ThrIle: 3.604 ± 0.094
7.207ThrLys: 7.207 ± 2.808
1.802ThrLeu: 1.802 ± 1.263
0.0ThrMet: 0.0 ± 0.0
1.802ThrAsn: 1.802 ± 1.357
5.405ThrPro: 5.405 ± 3.788
0.0ThrGln: 0.0 ± 0.0
1.802ThrArg: 1.802 ± 1.263
7.207ThrSer: 7.207 ± 0.189
7.207ThrThr: 7.207 ± 2.808
5.405ThrVal: 5.405 ± 1.168
3.604ThrTrp: 3.604 ± 2.714
3.604ThrTyr: 3.604 ± 0.094
0.0ThrXaa: 0.0 ± 0.0
Val
3.604ValAla: 3.604 ± 2.525
0.0ValCys: 0.0 ± 0.0
0.0ValAsp: 0.0 ± 0.0
3.604ValGlu: 3.604 ± 0.094
1.802ValPhe: 1.802 ± 1.357
3.604ValGly: 3.604 ± 0.094
0.0ValHis: 0.0 ± 0.0
3.604ValIle: 3.604 ± 2.714
5.405ValLys: 5.405 ± 1.451
7.207ValLeu: 7.207 ± 2.808
0.0ValMet: 0.0 ± 0.0
7.207ValAsn: 7.207 ± 5.428
1.802ValPro: 1.802 ± 1.357
0.0ValGln: 0.0 ± 0.0
7.207ValArg: 7.207 ± 2.431
0.0ValSer: 0.0 ± 0.0
7.207ValThr: 7.207 ± 0.189
7.207ValVal: 7.207 ± 5.05
0.0ValTrp: 0.0 ± 0.0
9.009ValTyr: 9.009 ± 3.693
0.0ValXaa: 0.0 ± 0.0
Trp
1.802TrpAla: 1.802 ± 1.263
0.0TrpCys: 0.0 ± 0.0
1.802TrpAsp: 1.802 ± 1.263
3.604TrpGlu: 3.604 ± 0.094
0.0TrpPhe: 0.0 ± 0.0
1.802TrpGly: 1.802 ± 1.357
0.0TrpHis: 0.0 ± 0.0
1.802TrpIle: 1.802 ± 1.263
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
1.802TrpPro: 1.802 ± 1.263
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
1.802TrpTrp: 1.802 ± 1.263
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
7.207TyrAla: 7.207 ± 2.431
1.802TyrCys: 1.802 ± 1.357
5.405TyrAsp: 5.405 ± 1.168
1.802TyrGlu: 1.802 ± 1.263
0.0TyrPhe: 0.0 ± 0.0
3.604TyrGly: 3.604 ± 0.094
0.0TyrHis: 0.0 ± 0.0
9.009TyrIle: 9.009 ± 6.313
1.802TyrLys: 1.802 ± 1.357
0.0TyrLeu: 0.0 ± 0.0
0.0TyrMet: 0.0 ± 0.0
3.604TyrAsn: 3.604 ± 0.094
3.604TyrPro: 3.604 ± 2.714
1.802TyrGln: 1.802 ± 1.357
0.0TyrArg: 0.0 ± 0.0
0.0TyrSer: 0.0 ± 0.0
1.802TyrThr: 1.802 ± 1.357
0.0TyrVal: 0.0 ± 0.0
1.802TyrTrp: 1.802 ± 1.263
1.802TyrTyr: 1.802 ± 1.357
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (556 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski