Amino acid dipepetide frequency for Golden silk orbweaver associated circular virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.656AlaAla: 1.656 ± 1.034
1.656AlaCys: 1.656 ± 1.034
1.656AlaAsp: 1.656 ± 1.177
3.311AlaGlu: 3.311 ± 2.069
3.311AlaPhe: 3.311 ± 2.069
1.656AlaGly: 1.656 ± 1.034
1.656AlaHis: 1.656 ± 1.034
1.656AlaIle: 1.656 ± 1.034
4.967AlaLys: 4.967 ± 3.531
3.311AlaLeu: 3.311 ± 0.143
0.0AlaMet: 0.0 ± 0.0
0.0AlaAsn: 0.0 ± 0.0
1.656AlaPro: 1.656 ± 1.177
1.656AlaGln: 1.656 ± 1.177
3.311AlaArg: 3.311 ± 2.069
4.967AlaSer: 4.967 ± 3.531
1.656AlaThr: 1.656 ± 1.034
6.623AlaVal: 6.623 ± 0.286
1.656AlaTrp: 1.656 ± 1.177
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
3.311CysAsp: 3.311 ± 2.069
1.656CysGlu: 1.656 ± 1.177
1.656CysPhe: 1.656 ± 1.034
3.311CysGly: 3.311 ± 2.069
0.0CysHis: 0.0 ± 0.0
3.311CysIle: 3.311 ± 2.069
4.967CysLys: 4.967 ± 3.103
1.656CysLeu: 1.656 ± 1.034
0.0CysMet: 0.0 ± 0.0
3.311CysAsn: 3.311 ± 2.069
1.656CysPro: 1.656 ± 1.034
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
1.656CysTrp: 1.656 ± 1.177
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.311AspAla: 3.311 ± 2.354
0.0AspCys: 0.0 ± 0.0
0.0AspAsp: 0.0 ± 0.0
3.311AspGlu: 3.311 ± 2.069
4.967AspPhe: 4.967 ± 1.32
4.967AspGly: 4.967 ± 3.103
1.656AspHis: 1.656 ± 1.034
6.623AspIle: 6.623 ± 0.286
0.0AspLys: 0.0 ± 0.0
1.656AspLeu: 1.656 ± 1.034
1.656AspMet: 1.656 ± 1.034
4.967AspAsn: 4.967 ± 0.892
0.0AspPro: 0.0 ± 0.0
0.0AspGln: 0.0 ± 0.0
1.656AspArg: 1.656 ± 1.177
1.656AspSer: 1.656 ± 1.177
4.967AspThr: 4.967 ± 1.32
4.967AspVal: 4.967 ± 0.892
0.0AspTrp: 0.0 ± 0.0
6.623AspTyr: 6.623 ± 1.926
0.0AspXaa: 0.0 ± 0.0
Glu
1.656GluAla: 1.656 ± 1.034
0.0GluCys: 0.0 ± 0.0
3.311GluAsp: 3.311 ± 2.069
4.967GluGlu: 4.967 ± 3.103
0.0GluPhe: 0.0 ± 0.0
0.0GluGly: 0.0 ± 0.0
1.656GluHis: 1.656 ± 1.034
6.623GluIle: 6.623 ± 0.286
1.656GluLys: 1.656 ± 1.034
3.311GluLeu: 3.311 ± 2.354
1.656GluMet: 1.656 ± 0.809
3.311GluAsn: 3.311 ± 0.143
3.311GluPro: 3.311 ± 2.069
1.656GluGln: 1.656 ± 1.177
4.967GluArg: 4.967 ± 1.32
3.311GluSer: 3.311 ± 2.069
1.656GluThr: 1.656 ± 1.034
3.311GluVal: 3.311 ± 2.069
1.656GluTrp: 1.656 ± 1.034
3.311GluTyr: 3.311 ± 0.143
0.0GluXaa: 0.0 ± 0.0
Phe
1.656PheAla: 1.656 ± 1.177
0.0PheCys: 0.0 ± 0.0
1.656PheAsp: 1.656 ± 1.177
3.311PheGlu: 3.311 ± 0.143
3.311PhePhe: 3.311 ± 2.354
3.311PheGly: 3.311 ± 2.069
1.656PheHis: 1.656 ± 1.177
3.311PheIle: 3.311 ± 0.143
3.311PheLys: 3.311 ± 2.069
1.656PheLeu: 1.656 ± 1.177
4.967PheMet: 4.967 ± 1.32
9.934PheAsn: 9.934 ± 4.851
1.656PhePro: 1.656 ± 1.177
1.656PheGln: 1.656 ± 1.177
3.311PheArg: 3.311 ± 2.354
1.656PheSer: 1.656 ± 1.034
4.967PheThr: 4.967 ± 0.892
0.0PheVal: 0.0 ± 0.0
0.0PheTrp: 0.0 ± 0.0
6.623PheTyr: 6.623 ± 2.497
0.0PheXaa: 0.0 ± 0.0
Gly
1.656GlyAla: 1.656 ± 1.034
1.656GlyCys: 1.656 ± 1.034
3.311GlyAsp: 3.311 ± 2.069
1.656GlyGlu: 1.656 ± 1.034
0.0GlyPhe: 0.0 ± 0.0
3.311GlyGly: 3.311 ± 0.143
1.656GlyHis: 1.656 ± 1.177
6.623GlyIle: 6.623 ± 2.497
3.311GlyLys: 3.311 ± 2.069
1.656GlyLeu: 1.656 ± 1.034
0.0GlyMet: 0.0 ± 0.0
4.967GlyAsn: 4.967 ± 1.32
1.656GlyPro: 1.656 ± 1.034
0.0GlyGln: 0.0 ± 0.0
4.967GlyArg: 4.967 ± 0.892
3.311GlySer: 3.311 ± 2.069
1.656GlyThr: 1.656 ± 1.034
6.623GlyVal: 6.623 ± 0.286
0.0GlyTrp: 0.0 ± 0.0
4.967GlyTyr: 4.967 ± 3.103
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
1.656HisAsp: 1.656 ± 1.177
6.623HisGlu: 6.623 ± 0.286
1.656HisPhe: 1.656 ± 1.034
1.656HisGly: 1.656 ± 1.034
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
3.311HisLeu: 3.311 ± 2.069
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
1.656HisVal: 1.656 ± 1.177
0.0HisTrp: 0.0 ± 0.0
1.656HisTyr: 1.656 ± 1.177
0.0HisXaa: 0.0 ± 0.0
Ile
3.311IleAla: 3.311 ± 2.354
0.0IleCys: 0.0 ± 0.0
1.656IleAsp: 1.656 ± 1.177
3.311IleGlu: 3.311 ± 2.069
1.656IlePhe: 1.656 ± 1.034
1.656IleGly: 1.656 ± 1.034
3.311IleHis: 3.311 ± 0.143
1.656IleIle: 1.656 ± 1.177
13.245IleLys: 13.245 ± 0.571
4.967IleLeu: 4.967 ± 0.892
0.0IleMet: 0.0 ± 0.0
0.0IleAsn: 0.0 ± 0.0
3.311IlePro: 3.311 ± 0.143
1.656IleGln: 1.656 ± 1.177
3.311IleArg: 3.311 ± 0.143
8.278IleSer: 8.278 ± 2.96
6.623IleThr: 6.623 ± 0.286
1.656IleVal: 1.656 ± 1.034
3.311IleTrp: 3.311 ± 0.143
1.656IleTyr: 1.656 ± 1.177
0.0IleXaa: 0.0 ± 0.0
Lys
1.656LysAla: 1.656 ± 1.034
3.311LysCys: 3.311 ± 0.143
4.967LysAsp: 4.967 ± 3.103
6.623LysGlu: 6.623 ± 1.926
6.623LysPhe: 6.623 ± 2.497
1.656LysGly: 1.656 ± 1.034
0.0LysHis: 0.0 ± 0.0
8.278LysIle: 8.278 ± 2.96
9.934LysLys: 9.934 ± 0.428
3.311LysLeu: 3.311 ± 0.143
1.656LysMet: 1.656 ± 1.034
8.278LysAsn: 8.278 ± 0.749
1.656LysPro: 1.656 ± 1.177
1.656LysGln: 1.656 ± 1.177
9.934LysArg: 9.934 ± 2.64
1.656LysSer: 1.656 ± 1.177
0.0LysThr: 0.0 ± 0.0
1.656LysVal: 1.656 ± 1.177
4.967LysTrp: 4.967 ± 3.103
1.656LysTyr: 1.656 ± 1.177
0.0LysXaa: 0.0 ± 0.0
Leu
3.311LeuAla: 3.311 ± 2.069
3.311LeuCys: 3.311 ± 2.069
4.967LeuAsp: 4.967 ± 0.892
0.0LeuGlu: 0.0 ± 0.0
1.656LeuPhe: 1.656 ± 1.034
1.656LeuGly: 1.656 ± 1.177
1.656LeuHis: 1.656 ± 1.034
3.311LeuIle: 3.311 ± 0.143
6.623LeuLys: 6.623 ± 0.286
1.656LeuLeu: 1.656 ± 1.034
1.656LeuMet: 1.656 ± 1.177
1.656LeuAsn: 1.656 ± 1.034
1.656LeuPro: 1.656 ± 1.177
4.967LeuGln: 4.967 ± 3.103
3.311LeuArg: 3.311 ± 0.143
6.623LeuSer: 6.623 ± 1.926
1.656LeuThr: 1.656 ± 1.177
3.311LeuVal: 3.311 ± 0.143
0.0LeuTrp: 0.0 ± 0.0
0.0LeuTyr: 0.0 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.656MetAla: 1.656 ± 1.177
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
1.656MetPhe: 1.656 ± 1.177
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
3.311MetLys: 3.311 ± 2.069
1.656MetLeu: 1.656 ± 1.034
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
1.656MetGln: 1.656 ± 1.177
0.0MetArg: 0.0 ± 0.0
3.311MetSer: 3.311 ± 2.069
1.656MetThr: 1.656 ± 1.034
1.656MetVal: 1.656 ± 1.177
1.656MetTrp: 1.656 ± 1.177
1.656MetTyr: 1.656 ± 1.034
0.0MetXaa: 0.0 ± 0.0
Asn
6.623AsnAla: 6.623 ± 2.497
0.0AsnCys: 0.0 ± 0.0
0.0AsnAsp: 0.0 ± 0.0
1.656AsnGlu: 1.656 ± 1.034
1.656AsnPhe: 1.656 ± 1.177
1.656AsnGly: 1.656 ± 1.034
1.656AsnHis: 1.656 ± 1.177
6.623AsnIle: 6.623 ± 2.497
3.311AsnLys: 3.311 ± 0.143
1.656AsnLeu: 1.656 ± 1.177
0.0AsnMet: 0.0 ± 0.0
4.967AsnAsn: 4.967 ± 3.531
0.0AsnPro: 0.0 ± 0.0
0.0AsnGln: 0.0 ± 0.0
4.967AsnArg: 4.967 ± 3.103
3.311AsnSer: 3.311 ± 0.143
6.623AsnThr: 6.623 ± 0.286
9.934AsnVal: 9.934 ± 1.783
1.656AsnTrp: 1.656 ± 1.177
3.311AsnTyr: 3.311 ± 0.143
0.0AsnXaa: 0.0 ± 0.0
Pro
0.0ProAla: 0.0 ± 0.0
0.0ProCys: 0.0 ± 0.0
3.311ProAsp: 3.311 ± 0.143
1.656ProGlu: 1.656 ± 1.177
6.623ProPhe: 6.623 ± 2.497
3.311ProGly: 3.311 ± 2.069
0.0ProHis: 0.0 ± 0.0
0.0ProIle: 0.0 ± 0.0
1.656ProLys: 1.656 ± 1.034
1.656ProLeu: 1.656 ± 1.177
0.0ProMet: 0.0 ± 0.0
0.0ProAsn: 0.0 ± 0.0
1.656ProPro: 1.656 ± 1.034
0.0ProGln: 0.0 ± 0.0
1.656ProArg: 1.656 ± 1.034
11.589ProSer: 11.589 ± 1.605
6.623ProThr: 6.623 ± 0.286
0.0ProVal: 0.0 ± 0.0
0.0ProTrp: 0.0 ± 0.0
1.656ProTyr: 1.656 ± 1.177
0.0ProXaa: 0.0 ± 0.0
Gln
3.311GlnAla: 3.311 ± 2.354
0.0GlnCys: 0.0 ± 0.0
1.656GlnAsp: 1.656 ± 1.177
1.656GlnGlu: 1.656 ± 1.034
0.0GlnPhe: 0.0 ± 0.0
1.656GlnGly: 1.656 ± 1.034
0.0GlnHis: 0.0 ± 0.0
0.0GlnIle: 0.0 ± 0.0
1.656GlnLys: 1.656 ± 1.177
1.656GlnLeu: 1.656 ± 1.034
1.656GlnMet: 1.656 ± 1.177
1.656GlnAsn: 1.656 ± 1.177
1.656GlnPro: 1.656 ± 1.177
0.0GlnGln: 0.0 ± 0.0
0.0GlnArg: 0.0 ± 0.0
3.311GlnSer: 3.311 ± 2.354
1.656GlnThr: 1.656 ± 1.034
0.0GlnVal: 0.0 ± 0.0
0.0GlnTrp: 0.0 ± 0.0
3.311GlnTyr: 3.311 ± 2.354
0.0GlnXaa: 0.0 ± 0.0
Arg
4.967ArgAla: 4.967 ± 0.892
1.656ArgCys: 1.656 ± 1.034
3.311ArgAsp: 3.311 ± 0.143
1.656ArgGlu: 1.656 ± 1.034
4.967ArgPhe: 4.967 ± 3.531
3.311ArgGly: 3.311 ± 0.143
1.656ArgHis: 1.656 ± 1.034
4.967ArgIle: 4.967 ± 1.32
0.0ArgLys: 0.0 ± 0.0
1.656ArgLeu: 1.656 ± 1.177
1.656ArgMet: 1.656 ± 1.034
1.656ArgAsn: 1.656 ± 1.177
3.311ArgPro: 3.311 ± 0.143
4.967ArgGln: 4.967 ± 1.32
11.589ArgArg: 11.589 ± 6.028
8.278ArgSer: 8.278 ± 3.674
0.0ArgThr: 0.0 ± 0.0
3.311ArgVal: 3.311 ± 2.354
1.656ArgTrp: 1.656 ± 1.034
3.311ArgTyr: 3.311 ± 0.143
0.0ArgXaa: 0.0 ± 0.0
Ser
3.311SerAla: 3.311 ± 2.069
4.967SerCys: 4.967 ± 0.892
6.623SerAsp: 6.623 ± 0.286
4.967SerGlu: 4.967 ± 0.892
8.278SerPhe: 8.278 ± 3.674
8.278SerGly: 8.278 ± 0.749
0.0SerHis: 0.0 ± 0.0
1.656SerIle: 1.656 ± 1.034
1.656SerLys: 1.656 ± 1.034
1.656SerLeu: 1.656 ± 1.034
1.656SerMet: 1.656 ± 0.759
4.967SerAsn: 4.967 ± 3.103
4.967SerPro: 4.967 ± 0.892
3.311SerGln: 3.311 ± 2.354
3.311SerArg: 3.311 ± 2.354
9.934SerSer: 9.934 ± 1.783
3.311SerThr: 3.311 ± 0.143
4.967SerVal: 4.967 ± 1.32
0.0SerTrp: 0.0 ± 0.0
1.656SerTyr: 1.656 ± 1.034
0.0SerXaa: 0.0 ± 0.0
Thr
0.0ThrAla: 0.0 ± 0.0
4.967ThrCys: 4.967 ± 3.103
1.656ThrAsp: 1.656 ± 1.034
3.311ThrGlu: 3.311 ± 0.143
1.656ThrPhe: 1.656 ± 1.177
8.278ThrGly: 8.278 ± 3.674
0.0ThrHis: 0.0 ± 0.0
0.0ThrIle: 0.0 ± 0.0
6.623ThrLys: 6.623 ± 0.286
4.967ThrLeu: 4.967 ± 0.892
0.0ThrMet: 0.0 ± 0.0
4.967ThrAsn: 4.967 ± 1.32
3.311ThrPro: 3.311 ± 0.143
0.0ThrGln: 0.0 ± 0.0
0.0ThrArg: 0.0 ± 0.0
4.967ThrSer: 4.967 ± 3.103
3.311ThrThr: 3.311 ± 0.143
6.623ThrVal: 6.623 ± 2.497
0.0ThrTrp: 0.0 ± 0.0
4.967ThrTyr: 4.967 ± 1.32
0.0ThrXaa: 0.0 ± 0.0
Val
4.967ValAla: 4.967 ± 0.892
1.656ValCys: 1.656 ± 1.034
1.656ValAsp: 1.656 ± 1.177
1.656ValGlu: 1.656 ± 1.177
4.967ValPhe: 4.967 ± 0.892
0.0ValGly: 0.0 ± 0.0
0.0ValHis: 0.0 ± 0.0
1.656ValIle: 1.656 ± 1.034
3.311ValLys: 3.311 ± 2.354
4.967ValLeu: 4.967 ± 0.892
0.0ValMet: 0.0 ± 0.0
1.656ValAsn: 1.656 ± 1.177
3.311ValPro: 3.311 ± 0.143
1.656ValGln: 1.656 ± 1.177
4.967ValArg: 4.967 ± 1.32
1.656ValSer: 1.656 ± 1.034
11.589ValThr: 11.589 ± 3.817
1.656ValVal: 1.656 ± 1.034
1.656ValTrp: 1.656 ± 1.034
9.934ValTyr: 9.934 ± 0.428
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
1.656TrpCys: 1.656 ± 1.034
4.967TrpAsp: 4.967 ± 1.32
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
3.311TrpGly: 3.311 ± 0.143
0.0TrpHis: 0.0 ± 0.0
3.311TrpIle: 3.311 ± 2.069
0.0TrpLys: 0.0 ± 0.0
3.311TrpLeu: 3.311 ± 2.069
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
3.311TrpArg: 3.311 ± 2.354
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
3.311TrpTrp: 3.311 ± 0.143
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.311TyrAla: 3.311 ± 2.069
1.656TyrCys: 1.656 ± 1.034
3.311TyrAsp: 3.311 ± 2.069
0.0TyrGlu: 0.0 ± 0.0
3.311TyrPhe: 3.311 ± 2.354
0.0TyrGly: 0.0 ± 0.0
1.656TyrHis: 1.656 ± 1.177
4.967TyrIle: 4.967 ± 1.32
9.934TyrLys: 9.934 ± 0.428
3.311TyrLeu: 3.311 ± 2.069
1.656TyrMet: 1.656 ± 1.034
3.311TyrAsn: 3.311 ± 0.143
6.623TyrPro: 6.623 ± 2.497
0.0TyrGln: 0.0 ± 0.0
3.311TyrArg: 3.311 ± 2.354
3.311TyrSer: 3.311 ± 0.143
1.656TyrThr: 1.656 ± 1.177
4.967TyrVal: 4.967 ± 1.32
0.0TyrTrp: 0.0 ± 0.0
1.656TyrTyr: 1.656 ± 1.034
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (605 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski