Amino acid dipepetide frequency for Circovirus-like genome SAR-A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.405AlaAla: 4.405 ± 3.969
0.0AlaCys: 0.0 ± 0.0
0.0AlaAsp: 0.0 ± 0.0
2.203AlaGlu: 2.203 ± 1.286
2.203AlaPhe: 2.203 ± 1.286
4.405AlaGly: 4.405 ± 2.571
0.0AlaHis: 0.0 ± 0.0
0.0AlaIle: 0.0 ± 0.0
6.608AlaLys: 6.608 ± 5.953
4.405AlaLeu: 4.405 ± 0.699
6.608AlaMet: 6.608 ± 5.953
2.203AlaAsn: 2.203 ± 1.286
4.405AlaPro: 4.405 ± 2.571
6.608AlaGln: 6.608 ± 0.587
2.203AlaArg: 2.203 ± 1.984
2.203AlaSer: 2.203 ± 1.286
2.203AlaThr: 2.203 ± 1.984
2.203AlaVal: 2.203 ± 1.286
2.203AlaTrp: 2.203 ± 1.984
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
2.203CysGlu: 2.203 ± 1.984
4.405CysPhe: 4.405 ± 3.969
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
2.203CysArg: 2.203 ± 1.286
2.203CysSer: 2.203 ± 1.984
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
2.203CysTyr: 2.203 ± 1.984
0.0CysXaa: 0.0 ± 0.0
Asp
4.405AspAla: 4.405 ± 0.699
2.203AspCys: 2.203 ± 1.984
2.203AspAsp: 2.203 ± 1.984
4.405AspGlu: 4.405 ± 0.699
4.405AspPhe: 4.405 ± 3.969
4.405AspGly: 4.405 ± 3.969
0.0AspHis: 0.0 ± 0.0
4.405AspIle: 4.405 ± 0.699
6.608AspLys: 6.608 ± 5.953
8.811AspLeu: 8.811 ± 1.873
0.0AspMet: 0.0 ± 0.0
0.0AspAsn: 0.0 ± 0.0
0.0AspPro: 0.0 ± 0.0
0.0AspGln: 0.0 ± 0.0
0.0AspArg: 0.0 ± 0.0
0.0AspSer: 0.0 ± 0.0
0.0AspThr: 0.0 ± 0.0
2.203AspVal: 2.203 ± 1.286
0.0AspTrp: 0.0 ± 0.0
2.203AspTyr: 2.203 ± 1.286
0.0AspXaa: 0.0 ± 0.0
Glu
6.608GluAla: 6.608 ± 2.683
0.0GluCys: 0.0 ± 0.0
2.203GluAsp: 2.203 ± 1.286
2.203GluGlu: 2.203 ± 1.286
0.0GluPhe: 0.0 ± 0.0
4.405GluGly: 4.405 ± 0.699
2.203GluHis: 2.203 ± 1.984
4.405GluIle: 4.405 ± 3.969
4.405GluLys: 4.405 ± 3.969
4.405GluLeu: 4.405 ± 2.571
0.0GluMet: 0.0 ± 0.0
2.203GluAsn: 2.203 ± 1.984
2.203GluPro: 2.203 ± 1.286
0.0GluGln: 0.0 ± 0.0
2.203GluArg: 2.203 ± 1.984
4.405GluSer: 4.405 ± 2.571
6.608GluThr: 6.608 ± 2.683
8.811GluVal: 8.811 ± 1.873
0.0GluTrp: 0.0 ± 0.0
2.203GluTyr: 2.203 ± 1.984
0.0GluXaa: 0.0 ± 0.0
Phe
2.203PheAla: 2.203 ± 1.286
0.0PheCys: 0.0 ± 0.0
2.203PheAsp: 2.203 ± 1.984
4.405PheGlu: 4.405 ± 3.969
2.203PhePhe: 2.203 ± 1.286
0.0PheGly: 0.0 ± 0.0
0.0PheHis: 0.0 ± 0.0
4.405PheIle: 4.405 ± 3.969
4.405PheLys: 4.405 ± 2.571
2.203PheLeu: 2.203 ± 1.286
2.203PheMet: 2.203 ± 1.984
2.203PheAsn: 2.203 ± 1.286
0.0PhePro: 0.0 ± 0.0
4.405PheGln: 4.405 ± 2.571
8.811PheArg: 8.811 ± 1.873
4.405PheSer: 4.405 ± 0.699
2.203PheThr: 2.203 ± 1.984
2.203PheVal: 2.203 ± 1.286
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.405GlyAla: 4.405 ± 3.969
2.203GlyCys: 2.203 ± 1.984
4.405GlyAsp: 4.405 ± 0.699
6.608GlyGlu: 6.608 ± 3.857
4.405GlyPhe: 4.405 ± 2.571
4.405GlyGly: 4.405 ± 2.571
2.203GlyHis: 2.203 ± 1.286
4.405GlyIle: 4.405 ± 0.699
2.203GlyLys: 2.203 ± 1.984
2.203GlyLeu: 2.203 ± 1.286
0.0GlyMet: 0.0 ± 0.0
4.405GlyAsn: 4.405 ± 0.699
0.0GlyPro: 0.0 ± 0.0
4.405GlyGln: 4.405 ± 0.699
2.203GlyArg: 2.203 ± 1.984
4.405GlySer: 4.405 ± 2.571
8.811GlyThr: 8.811 ± 1.873
2.203GlyVal: 2.203 ± 1.286
0.0GlyTrp: 0.0 ± 0.0
6.608GlyTyr: 6.608 ± 0.587
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
2.203HisPhe: 2.203 ± 1.984
4.405HisGly: 4.405 ± 0.699
0.0HisHis: 0.0 ± 0.0
2.203HisIle: 2.203 ± 1.984
2.203HisLys: 2.203 ± 1.286
4.405HisLeu: 4.405 ± 0.699
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
2.203HisSer: 2.203 ± 1.286
4.405HisThr: 4.405 ± 2.571
2.203HisVal: 2.203 ± 1.286
0.0HisTrp: 0.0 ± 0.0
2.203HisTyr: 2.203 ± 1.286
0.0HisXaa: 0.0 ± 0.0
Ile
2.203IleAla: 2.203 ± 1.286
0.0IleCys: 0.0 ± 0.0
2.203IleAsp: 2.203 ± 1.984
6.608IleGlu: 6.608 ± 2.683
2.203IlePhe: 2.203 ± 1.286
4.405IleGly: 4.405 ± 0.699
2.203IleHis: 2.203 ± 1.286
0.0IleIle: 0.0 ± 0.0
4.405IleLys: 4.405 ± 2.571
0.0IleLeu: 0.0 ± 0.0
2.203IleMet: 2.203 ± 3.07
4.405IleAsn: 4.405 ± 0.699
0.0IlePro: 0.0 ± 0.0
0.0IleGln: 0.0 ± 0.0
0.0IleArg: 0.0 ± 0.0
4.405IleSer: 4.405 ± 0.699
4.405IleThr: 4.405 ± 0.699
2.203IleVal: 2.203 ± 1.984
2.203IleTrp: 2.203 ± 1.984
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
4.405LysAla: 4.405 ± 3.969
0.0LysCys: 0.0 ± 0.0
6.608LysAsp: 6.608 ± 2.683
8.811LysGlu: 8.811 ± 1.397
2.203LysPhe: 2.203 ± 1.984
2.203LysGly: 2.203 ± 1.984
2.203LysHis: 2.203 ± 1.286
0.0LysIle: 0.0 ± 0.0
11.013LysLys: 11.013 ± 0.112
4.405LysLeu: 4.405 ± 2.571
2.203LysMet: 2.203 ± 1.286
6.608LysAsn: 6.608 ± 0.587
6.608LysPro: 6.608 ± 0.587
4.405LysGln: 4.405 ± 0.699
0.0LysArg: 0.0 ± 0.0
8.811LysSer: 8.811 ± 5.143
0.0LysThr: 0.0 ± 0.0
2.203LysVal: 2.203 ± 1.984
2.203LysTrp: 2.203 ± 1.286
4.405LysTyr: 4.405 ± 3.969
0.0LysXaa: 0.0 ± 0.0
Leu
0.0LeuAla: 0.0 ± 0.0
0.0LeuCys: 0.0 ± 0.0
2.203LeuAsp: 2.203 ± 1.286
2.203LeuGlu: 2.203 ± 1.286
2.203LeuPhe: 2.203 ± 1.286
2.203LeuGly: 2.203 ± 1.286
2.203LeuHis: 2.203 ± 1.286
0.0LeuIle: 0.0 ± 0.0
4.405LeuLys: 4.405 ± 2.571
6.608LeuLeu: 6.608 ± 3.857
0.0LeuMet: 0.0 ± 0.0
4.405LeuAsn: 4.405 ± 0.699
4.405LeuPro: 4.405 ± 0.699
6.608LeuGln: 6.608 ± 0.587
2.203LeuArg: 2.203 ± 1.984
0.0LeuSer: 0.0 ± 0.0
8.811LeuThr: 8.811 ± 5.143
4.405LeuVal: 4.405 ± 0.699
2.203LeuTrp: 2.203 ± 1.286
6.608LeuTyr: 6.608 ± 0.587
0.0LeuXaa: 0.0 ± 0.0
Met
2.203MetAla: 2.203 ± 1.286
0.0MetCys: 0.0 ± 0.0
2.203MetAsp: 2.203 ± 1.984
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
2.203MetGly: 2.203 ± 1.286
6.608MetHis: 6.608 ± 2.683
0.0MetIle: 0.0 ± 0.0
2.203MetLys: 2.203 ± 1.286
0.0MetLeu: 0.0 ± 0.0
4.405MetMet: 4.405 ± 0.699
0.0MetAsn: 0.0 ± 0.0
4.405MetPro: 4.405 ± 0.699
2.203MetGln: 2.203 ± 1.984
0.0MetArg: 0.0 ± 0.0
2.203MetSer: 2.203 ± 1.286
2.203MetThr: 2.203 ± 1.984
2.203MetVal: 2.203 ± 1.984
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.203AsnAla: 2.203 ± 1.984
0.0AsnCys: 0.0 ± 0.0
4.405AsnAsp: 4.405 ± 0.699
6.608AsnGlu: 6.608 ± 2.683
4.405AsnPhe: 4.405 ± 2.571
6.608AsnGly: 6.608 ± 0.587
0.0AsnHis: 0.0 ± 0.0
6.608AsnIle: 6.608 ± 0.587
2.203AsnLys: 2.203 ± 1.286
4.405AsnLeu: 4.405 ± 2.571
2.203AsnMet: 2.203 ± 0.998
8.811AsnAsn: 8.811 ± 1.873
8.811AsnPro: 8.811 ± 1.873
2.203AsnGln: 2.203 ± 1.286
2.203AsnArg: 2.203 ± 1.984
0.0AsnSer: 0.0 ± 0.0
4.405AsnThr: 4.405 ± 2.571
0.0AsnVal: 0.0 ± 0.0
4.405AsnTrp: 4.405 ± 2.571
6.608AsnTyr: 6.608 ± 5.953
0.0AsnXaa: 0.0 ± 0.0
Pro
2.203ProAla: 2.203 ± 1.984
2.203ProCys: 2.203 ± 1.984
2.203ProAsp: 2.203 ± 1.286
2.203ProGlu: 2.203 ± 1.984
2.203ProPhe: 2.203 ± 1.286
2.203ProGly: 2.203 ± 1.286
4.405ProHis: 4.405 ± 0.699
2.203ProIle: 2.203 ± 1.984
2.203ProLys: 2.203 ± 1.286
2.203ProLeu: 2.203 ± 1.984
0.0ProMet: 0.0 ± 0.0
6.608ProAsn: 6.608 ± 3.857
0.0ProPro: 0.0 ± 0.0
0.0ProGln: 0.0 ± 0.0
6.608ProArg: 6.608 ± 0.587
2.203ProSer: 2.203 ± 1.286
0.0ProThr: 0.0 ± 0.0
4.405ProVal: 4.405 ± 2.571
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
4.405GlnAla: 4.405 ± 0.699
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
0.0GlnGlu: 0.0 ± 0.0
2.203GlnPhe: 2.203 ± 1.286
6.608GlnGly: 6.608 ± 2.683
2.203GlnHis: 2.203 ± 1.286
0.0GlnIle: 0.0 ± 0.0
2.203GlnLys: 2.203 ± 1.984
0.0GlnLeu: 0.0 ± 0.0
0.0GlnMet: 0.0 ± 0.0
4.405GlnAsn: 4.405 ± 0.699
2.203GlnPro: 2.203 ± 1.286
2.203GlnGln: 2.203 ± 1.286
6.608GlnArg: 6.608 ± 0.587
2.203GlnSer: 2.203 ± 1.286
0.0GlnThr: 0.0 ± 0.0
4.405GlnVal: 4.405 ± 2.571
0.0GlnTrp: 0.0 ± 0.0
4.405GlnTyr: 4.405 ± 2.571
0.0GlnXaa: 0.0 ± 0.0
Arg
4.405ArgAla: 4.405 ± 0.699
0.0ArgCys: 0.0 ± 0.0
2.203ArgAsp: 2.203 ± 1.984
2.203ArgGlu: 2.203 ± 1.286
6.608ArgPhe: 6.608 ± 2.683
0.0ArgGly: 0.0 ± 0.0
0.0ArgHis: 0.0 ± 0.0
4.405ArgIle: 4.405 ± 2.571
2.203ArgLys: 2.203 ± 1.984
2.203ArgLeu: 2.203 ± 1.286
2.203ArgMet: 2.203 ± 1.286
0.0ArgAsn: 0.0 ± 0.0
2.203ArgPro: 2.203 ± 1.984
2.203ArgGln: 2.203 ± 1.286
2.203ArgArg: 2.203 ± 1.286
6.608ArgSer: 6.608 ± 3.857
2.203ArgThr: 2.203 ± 1.286
2.203ArgVal: 2.203 ± 1.984
0.0ArgTrp: 0.0 ± 0.0
4.405ArgTyr: 4.405 ± 3.969
0.0ArgXaa: 0.0 ± 0.0
Ser
2.203SerAla: 2.203 ± 1.286
2.203SerCys: 2.203 ± 1.286
4.405SerAsp: 4.405 ± 0.699
4.405SerGlu: 4.405 ± 0.699
0.0SerPhe: 0.0 ± 0.0
4.405SerGly: 4.405 ± 2.571
0.0SerHis: 0.0 ± 0.0
0.0SerIle: 0.0 ± 0.0
6.608SerLys: 6.608 ± 0.587
4.405SerLeu: 4.405 ± 2.571
0.0SerMet: 0.0 ± 0.0
11.013SerAsn: 11.013 ± 3.158
2.203SerPro: 2.203 ± 1.286
4.405SerGln: 4.405 ± 2.571
2.203SerArg: 2.203 ± 1.286
2.203SerSer: 2.203 ± 1.286
2.203SerThr: 2.203 ± 1.286
8.811SerVal: 8.811 ± 5.143
0.0SerTrp: 0.0 ± 0.0
2.203SerTyr: 2.203 ± 1.286
0.0SerXaa: 0.0 ± 0.0
Thr
4.405ThrAla: 4.405 ± 2.571
0.0ThrCys: 0.0 ± 0.0
2.203ThrAsp: 2.203 ± 1.984
2.203ThrGlu: 2.203 ± 1.984
2.203ThrPhe: 2.203 ± 1.286
2.203ThrGly: 2.203 ± 1.286
0.0ThrHis: 0.0 ± 0.0
6.608ThrIle: 6.608 ± 0.587
2.203ThrLys: 2.203 ± 1.286
2.203ThrLeu: 2.203 ± 1.984
2.203ThrMet: 2.203 ± 1.286
8.811ThrAsn: 8.811 ± 1.397
2.203ThrPro: 2.203 ± 1.984
0.0ThrGln: 0.0 ± 0.0
4.405ThrArg: 4.405 ± 2.571
8.811ThrSer: 8.811 ± 5.143
4.405ThrThr: 4.405 ± 2.571
4.405ThrVal: 4.405 ± 0.699
0.0ThrTrp: 0.0 ± 0.0
4.405ThrTyr: 4.405 ± 0.699
0.0ThrXaa: 0.0 ± 0.0
Val
2.203ValAla: 2.203 ± 1.286
0.0ValCys: 0.0 ± 0.0
2.203ValAsp: 2.203 ± 1.286
2.203ValGlu: 2.203 ± 1.984
2.203ValPhe: 2.203 ± 1.286
13.216ValGly: 13.216 ± 1.174
0.0ValHis: 0.0 ± 0.0
2.203ValIle: 2.203 ± 1.286
6.608ValLys: 6.608 ± 3.857
4.405ValLeu: 4.405 ± 0.699
2.203ValMet: 2.203 ± 1.286
6.608ValAsn: 6.608 ± 0.587
0.0ValPro: 0.0 ± 0.0
2.203ValGln: 2.203 ± 1.984
2.203ValArg: 2.203 ± 1.984
4.405ValSer: 4.405 ± 2.571
2.203ValThr: 2.203 ± 1.286
2.203ValVal: 2.203 ± 1.286
0.0ValTrp: 0.0 ± 0.0
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
2.203TrpCys: 2.203 ± 1.984
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
2.203TrpGly: 2.203 ± 1.286
0.0TrpHis: 0.0 ± 0.0
2.203TrpIle: 2.203 ± 1.286
2.203TrpLys: 2.203 ± 1.984
2.203TrpLeu: 2.203 ± 1.286
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
2.203TrpPro: 2.203 ± 1.286
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.203TyrAla: 2.203 ± 1.286
2.203TyrCys: 2.203 ± 1.984
4.405TyrAsp: 4.405 ± 3.969
0.0TyrGlu: 0.0 ± 0.0
2.203TyrPhe: 2.203 ± 1.984
0.0TyrGly: 0.0 ± 0.0
2.203TyrHis: 2.203 ± 1.286
2.203TyrIle: 2.203 ± 1.984
4.405TyrLys: 4.405 ± 0.699
2.203TyrLeu: 2.203 ± 1.286
4.405TyrMet: 4.405 ± 0.699
4.405TyrAsn: 4.405 ± 0.699
2.203TyrPro: 2.203 ± 1.984
2.203TyrGln: 2.203 ± 1.286
2.203TyrArg: 2.203 ± 1.286
2.203TyrSer: 2.203 ± 1.984
8.811TyrThr: 8.811 ± 1.397
0.0TyrVal: 0.0 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
2.203TyrTyr: 2.203 ± 1.984
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (455 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski