Amino acid dipepetide frequency for Paguma larvata circovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.604AlaAla: 3.604 ± 2.425
1.802AlaCys: 1.802 ± 1.212
1.802AlaAsp: 1.802 ± 1.401
5.405AlaGlu: 5.405 ± 3.637
0.0AlaPhe: 0.0 ± 0.0
7.207AlaGly: 7.207 ± 5.603
1.802AlaHis: 1.802 ± 1.212
1.802AlaIle: 1.802 ± 1.401
0.0AlaLys: 0.0 ± 0.0
1.802AlaLeu: 1.802 ± 1.212
0.0AlaMet: 0.0 ± 0.0
0.0AlaAsn: 0.0 ± 0.0
0.0AlaPro: 0.0 ± 0.0
0.0AlaGln: 0.0 ± 0.0
1.802AlaArg: 1.802 ± 1.212
0.0AlaSer: 0.0 ± 0.0
0.0AlaThr: 0.0 ± 0.0
3.604AlaVal: 3.604 ± 2.425
0.0AlaTrp: 0.0 ± 0.0
3.604AlaTyr: 3.604 ± 2.425
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
1.802CysPhe: 1.802 ± 1.212
5.405CysGly: 5.405 ± 3.637
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.802CysLys: 1.802 ± 1.212
1.802CysLeu: 1.802 ± 1.212
1.802CysMet: 1.802 ± 0.942
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.802CysArg: 1.802 ± 1.212
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
1.802CysVal: 1.802 ± 1.212
0.0CysTrp: 0.0 ± 0.0
1.802CysTyr: 1.802 ± 1.212
0.0CysXaa: 0.0 ± 0.0
Asp
1.802AspAla: 1.802 ± 1.401
0.0AspCys: 0.0 ± 0.0
3.604AspAsp: 3.604 ± 2.425
5.405AspGlu: 5.405 ± 1.589
7.207AspPhe: 7.207 ± 2.237
1.802AspGly: 1.802 ± 1.212
1.802AspHis: 1.802 ± 1.212
0.0AspIle: 0.0 ± 0.0
1.802AspLys: 1.802 ± 1.401
9.009AspLeu: 9.009 ± 0.836
0.0AspMet: 0.0 ± 0.0
0.0AspAsn: 0.0 ± 0.0
5.405AspPro: 5.405 ± 4.202
1.802AspGln: 1.802 ± 1.212
5.405AspArg: 5.405 ± 1.024
3.604AspSer: 3.604 ± 0.188
0.0AspThr: 0.0 ± 0.0
0.0AspVal: 0.0 ± 0.0
1.802AspTrp: 1.802 ± 1.401
1.802AspTyr: 1.802 ± 1.212
0.0AspXaa: 0.0 ± 0.0
Glu
7.207GluAla: 7.207 ± 4.85
1.802GluCys: 1.802 ± 1.401
1.802GluAsp: 1.802 ± 1.401
1.802GluGlu: 1.802 ± 1.212
3.604GluPhe: 3.604 ± 0.188
5.405GluGly: 5.405 ± 1.024
0.0GluHis: 0.0 ± 0.0
0.0GluIle: 0.0 ± 0.0
3.604GluLys: 3.604 ± 2.425
3.604GluLeu: 3.604 ± 2.425
0.0GluMet: 0.0 ± 0.0
3.604GluAsn: 3.604 ± 0.188
1.802GluPro: 1.802 ± 1.401
1.802GluGln: 1.802 ± 1.212
1.802GluArg: 1.802 ± 1.212
3.604GluSer: 3.604 ± 0.188
0.0GluThr: 0.0 ± 0.0
5.405GluVal: 5.405 ± 3.637
1.802GluTrp: 1.802 ± 1.212
1.802GluTyr: 1.802 ± 1.212
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.0PheCys: 0.0 ± 0.0
3.604PheAsp: 3.604 ± 0.188
1.802PheGlu: 1.802 ± 1.212
7.207PhePhe: 7.207 ± 5.603
3.604PheGly: 3.604 ± 2.425
1.802PheHis: 1.802 ± 1.401
3.604PheIle: 3.604 ± 2.802
9.009PheLys: 9.009 ± 4.391
7.207PheLeu: 7.207 ± 5.603
1.802PheMet: 1.802 ± 1.401
1.802PheAsn: 1.802 ± 1.401
1.802PhePro: 1.802 ± 1.212
1.802PheGln: 1.802 ± 1.401
5.405PheArg: 5.405 ± 4.202
0.0PheSer: 0.0 ± 0.0
9.009PheThr: 9.009 ± 4.391
3.604PheVal: 3.604 ± 2.425
5.405PheTrp: 5.405 ± 4.202
1.802PheTyr: 1.802 ± 1.212
0.0PheXaa: 0.0 ± 0.0
Gly
1.802GlyAla: 1.802 ± 1.212
1.802GlyCys: 1.802 ± 1.212
5.405GlyAsp: 5.405 ± 1.024
0.0GlyGlu: 0.0 ± 0.0
1.802GlyPhe: 1.802 ± 1.401
5.405GlyGly: 5.405 ± 3.637
0.0GlyHis: 0.0 ± 0.0
1.802GlyIle: 1.802 ± 1.401
7.207GlyLys: 7.207 ± 4.85
10.811GlyLeu: 10.811 ± 0.565
0.0GlyMet: 0.0 ± 0.0
7.207GlyAsn: 7.207 ± 4.85
5.405GlyPro: 5.405 ± 1.024
3.604GlyGln: 3.604 ± 0.188
5.405GlyArg: 5.405 ± 3.637
1.802GlySer: 1.802 ± 1.401
3.604GlyThr: 3.604 ± 2.425
1.802GlyVal: 1.802 ± 1.401
1.802GlyTrp: 1.802 ± 1.212
7.207GlyTyr: 7.207 ± 0.377
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
7.207HisPhe: 7.207 ± 5.603
1.802HisGly: 1.802 ± 1.212
12.613HisHis: 12.613 ± 9.806
1.802HisIle: 1.802 ± 1.401
0.0HisLys: 0.0 ± 0.0
3.604HisLeu: 3.604 ± 2.425
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
3.604HisPro: 3.604 ± 2.802
0.0HisGln: 0.0 ± 0.0
3.604HisArg: 3.604 ± 2.802
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
1.802HisVal: 1.802 ± 1.401
0.0HisTrp: 0.0 ± 0.0
1.802HisTyr: 1.802 ± 1.212
0.0HisXaa: 0.0 ± 0.0
Ile
1.802IleAla: 1.802 ± 1.212
1.802IleCys: 1.802 ± 1.212
3.604IleAsp: 3.604 ± 0.188
0.0IleGlu: 0.0 ± 0.0
0.0IlePhe: 0.0 ± 0.0
0.0IleGly: 0.0 ± 0.0
3.604IleHis: 3.604 ± 0.188
0.0IleIle: 0.0 ± 0.0
1.802IleLys: 1.802 ± 1.401
3.604IleLeu: 3.604 ± 0.188
1.802IleMet: 1.802 ± 1.401
3.604IleAsn: 3.604 ± 2.802
3.604IlePro: 3.604 ± 2.802
1.802IleGln: 1.802 ± 1.401
0.0IleArg: 0.0 ± 0.0
0.0IleSer: 0.0 ± 0.0
9.009IleThr: 9.009 ± 0.836
1.802IleVal: 1.802 ± 1.401
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.604LysAla: 3.604 ± 2.802
0.0LysCys: 0.0 ± 0.0
1.802LysAsp: 1.802 ± 1.212
9.009LysGlu: 9.009 ± 0.836
3.604LysPhe: 3.604 ± 2.802
1.802LysGly: 1.802 ± 1.212
1.802LysHis: 1.802 ± 1.401
0.0LysIle: 0.0 ± 0.0
3.604LysLys: 3.604 ± 2.425
0.0LysLeu: 0.0 ± 0.0
0.0LysMet: 0.0 ± 0.0
0.0LysAsn: 0.0 ± 0.0
9.009LysPro: 9.009 ± 4.391
9.009LysGln: 9.009 ± 0.836
9.009LysArg: 9.009 ± 0.836
0.0LysSer: 0.0 ± 0.0
9.009LysThr: 9.009 ± 1.777
1.802LysVal: 1.802 ± 1.212
0.0LysTrp: 0.0 ± 0.0
3.604LysTyr: 3.604 ± 2.425
0.0LysXaa: 0.0 ± 0.0
Leu
0.0LeuAla: 0.0 ± 0.0
1.802LeuCys: 1.802 ± 1.212
5.405LeuAsp: 5.405 ± 1.024
5.405LeuGlu: 5.405 ± 3.637
3.604LeuPhe: 3.604 ± 2.802
9.009LeuGly: 9.009 ± 0.836
1.802LeuHis: 1.802 ± 1.401
0.0LeuIle: 0.0 ± 0.0
5.405LeuLys: 5.405 ± 1.024
1.802LeuLeu: 1.802 ± 1.212
0.0LeuMet: 0.0 ± 0.0
10.811LeuAsn: 10.811 ± 3.178
1.802LeuPro: 1.802 ± 1.212
5.405LeuGln: 5.405 ± 3.637
9.009LeuArg: 9.009 ± 3.449
1.802LeuSer: 1.802 ± 1.212
5.405LeuThr: 5.405 ± 1.589
1.802LeuVal: 1.802 ± 1.401
3.604LeuTrp: 3.604 ± 0.188
3.604LeuTyr: 3.604 ± 0.188
0.0LeuXaa: 0.0 ± 0.0
Met
1.802MetAla: 1.802 ± 1.212
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
1.802MetGlu: 1.802 ± 1.212
1.802MetPhe: 1.802 ± 1.212
1.802MetGly: 1.802 ± 1.401
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.802MetLys: 1.802 ± 1.212
1.802MetLeu: 1.802 ± 1.401
0.0MetMet: 0.0 ± 0.0
3.604MetAsn: 3.604 ± 0.188
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
1.802MetArg: 1.802 ± 1.401
0.0MetSer: 0.0 ± 0.0
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.802MetTyr: 1.802 ± 1.401
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
0.0AsnCys: 0.0 ± 0.0
1.802AsnAsp: 1.802 ± 1.401
5.405AsnGlu: 5.405 ± 1.024
5.405AsnPhe: 5.405 ± 4.202
1.802AsnGly: 1.802 ± 1.212
0.0AsnHis: 0.0 ± 0.0
5.405AsnIle: 5.405 ± 1.589
0.0AsnLys: 0.0 ± 0.0
3.604AsnLeu: 3.604 ± 2.425
0.0AsnMet: 0.0 ± 0.0
1.802AsnAsn: 1.802 ± 1.212
3.604AsnPro: 3.604 ± 0.188
1.802AsnGln: 1.802 ± 1.401
0.0AsnArg: 0.0 ± 0.0
5.405AsnSer: 5.405 ± 1.024
0.0AsnThr: 0.0 ± 0.0
1.802AsnVal: 1.802 ± 1.401
3.604AsnTrp: 3.604 ± 0.188
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.802ProAla: 1.802 ± 1.401
0.0ProCys: 0.0 ± 0.0
3.604ProAsp: 3.604 ± 0.188
1.802ProGlu: 1.802 ± 1.401
1.802ProPhe: 1.802 ± 1.401
7.207ProGly: 7.207 ± 4.85
3.604ProHis: 3.604 ± 2.802
7.207ProIle: 7.207 ± 2.99
1.802ProLys: 1.802 ± 1.401
1.802ProLeu: 1.802 ± 1.212
5.405ProMet: 5.405 ± 1.589
0.0ProAsn: 0.0 ± 0.0
5.405ProPro: 5.405 ± 1.024
3.604ProGln: 3.604 ± 2.802
1.802ProArg: 1.802 ± 1.212
0.0ProSer: 0.0 ± 0.0
3.604ProThr: 3.604 ± 0.188
3.604ProVal: 3.604 ± 2.425
1.802ProTrp: 1.802 ± 1.401
1.802ProTyr: 1.802 ± 1.401
0.0ProXaa: 0.0 ± 0.0
Gln
1.802GlnAla: 1.802 ± 1.212
0.0GlnCys: 0.0 ± 0.0
1.802GlnAsp: 1.802 ± 1.401
5.405GlnGlu: 5.405 ± 3.637
3.604GlnPhe: 3.604 ± 2.802
5.405GlnGly: 5.405 ± 1.024
1.802GlnHis: 1.802 ± 1.212
0.0GlnIle: 0.0 ± 0.0
1.802GlnLys: 1.802 ± 1.401
1.802GlnLeu: 1.802 ± 1.212
0.0GlnMet: 0.0 ± 0.0
1.802GlnAsn: 1.802 ± 1.401
0.0GlnPro: 0.0 ± 0.0
1.802GlnGln: 1.802 ± 1.401
3.604GlnArg: 3.604 ± 2.425
1.802GlnSer: 1.802 ± 1.401
3.604GlnThr: 3.604 ± 2.802
5.405GlnVal: 5.405 ± 1.024
1.802GlnTrp: 1.802 ± 1.212
3.604GlnTyr: 3.604 ± 2.802
0.0GlnXaa: 0.0 ± 0.0
Arg
3.604ArgAla: 3.604 ± 2.425
0.0ArgCys: 0.0 ± 0.0
5.405ArgAsp: 5.405 ± 1.589
3.604ArgGlu: 3.604 ± 2.425
1.802ArgPhe: 1.802 ± 1.401
5.405ArgGly: 5.405 ± 1.024
5.405ArgHis: 5.405 ± 4.202
5.405ArgIle: 5.405 ± 1.024
5.405ArgLys: 5.405 ± 1.589
3.604ArgLeu: 3.604 ± 2.425
0.0ArgMet: 0.0 ± 0.0
1.802ArgAsn: 1.802 ± 1.401
0.0ArgPro: 0.0 ± 0.0
1.802ArgGln: 1.802 ± 1.401
19.82ArgArg: 19.82 ± 10.182
1.802ArgSer: 1.802 ± 1.212
1.802ArgThr: 1.802 ± 1.212
7.207ArgVal: 7.207 ± 4.85
1.802ArgTrp: 1.802 ± 1.212
7.207ArgTyr: 7.207 ± 2.237
0.0ArgXaa: 0.0 ± 0.0
Ser
0.0SerAla: 0.0 ± 0.0
0.0SerCys: 0.0 ± 0.0
0.0SerAsp: 0.0 ± 0.0
1.802SerGlu: 1.802 ± 1.212
3.604SerPhe: 3.604 ± 2.802
1.802SerGly: 1.802 ± 1.401
0.0SerHis: 0.0 ± 0.0
1.802SerIle: 1.802 ± 1.212
7.207SerLys: 7.207 ± 2.99
1.802SerLeu: 1.802 ± 1.401
0.0SerMet: 0.0 ± 0.0
3.604SerAsn: 3.604 ± 2.425
1.802SerPro: 1.802 ± 1.212
3.604SerGln: 3.604 ± 0.188
1.802SerArg: 1.802 ± 1.212
1.802SerSer: 1.802 ± 1.212
1.802SerThr: 1.802 ± 1.212
0.0SerVal: 0.0 ± 0.0
1.802SerTrp: 1.802 ± 1.401
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.604ThrAla: 3.604 ± 2.802
0.0ThrCys: 0.0 ± 0.0
5.405ThrAsp: 5.405 ± 1.589
1.802ThrGlu: 1.802 ± 1.401
3.604ThrPhe: 3.604 ± 0.188
0.0ThrGly: 0.0 ± 0.0
0.0ThrHis: 0.0 ± 0.0
1.802ThrIle: 1.802 ± 1.212
5.405ThrLys: 5.405 ± 4.202
5.405ThrLeu: 5.405 ± 1.589
1.802ThrMet: 1.802 ± 1.212
0.0ThrAsn: 0.0 ± 0.0
5.405ThrPro: 5.405 ± 1.024
7.207ThrGln: 7.207 ± 2.237
1.802ThrArg: 1.802 ± 1.212
7.207ThrSer: 7.207 ± 0.377
9.009ThrThr: 9.009 ± 4.391
3.604ThrVal: 3.604 ± 2.425
1.802ThrTrp: 1.802 ± 1.212
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
1.802ValAla: 1.802 ± 1.212
1.802ValCys: 1.802 ± 1.212
1.802ValAsp: 1.802 ± 1.212
0.0ValGlu: 0.0 ± 0.0
5.405ValPhe: 5.405 ± 1.024
5.405ValGly: 5.405 ± 3.637
0.0ValHis: 0.0 ± 0.0
3.604ValIle: 3.604 ± 0.188
9.009ValLys: 9.009 ± 3.449
3.604ValLeu: 3.604 ± 2.425
0.0ValMet: 0.0 ± 0.0
1.802ValAsn: 1.802 ± 1.401
7.207ValPro: 7.207 ± 0.377
0.0ValGln: 0.0 ± 0.0
3.604ValArg: 3.604 ± 0.188
1.802ValSer: 1.802 ± 1.212
1.802ValThr: 1.802 ± 1.212
3.604ValVal: 3.604 ± 0.188
0.0ValTrp: 0.0 ± 0.0
3.604ValTyr: 3.604 ± 2.425
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
1.802TrpCys: 1.802 ± 1.212
5.405TrpAsp: 5.405 ± 3.637
0.0TrpGlu: 0.0 ± 0.0
5.405TrpPhe: 5.405 ± 4.202
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
3.604TrpLeu: 3.604 ± 2.802
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.802TrpGln: 1.802 ± 1.401
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
3.604TrpThr: 3.604 ± 0.188
3.604TrpVal: 3.604 ± 0.188
5.405TrpTrp: 5.405 ± 1.589
1.802TrpTyr: 1.802 ± 1.212
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
5.405TyrCys: 5.405 ± 3.637
0.0TyrAsp: 0.0 ± 0.0
0.0TyrGlu: 0.0 ± 0.0
1.802TyrPhe: 1.802 ± 1.212
3.604TyrGly: 3.604 ± 0.188
1.802TyrHis: 1.802 ± 1.401
3.604TyrIle: 3.604 ± 0.188
0.0TyrLys: 0.0 ± 0.0
7.207TyrLeu: 7.207 ± 2.237
3.604TyrMet: 3.604 ± 3.1
0.0TyrAsn: 0.0 ± 0.0
1.802TyrPro: 1.802 ± 1.212
0.0TyrGln: 0.0 ± 0.0
5.405TyrArg: 5.405 ± 1.589
3.604TyrSer: 3.604 ± 2.802
3.604TyrThr: 3.604 ± 2.425
3.604TyrVal: 3.604 ± 2.425
0.0TyrTrp: 0.0 ± 0.0
1.802TyrTyr: 1.802 ± 1.401
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (556 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski