Amino acid dipepetide frequency for Grasshopper associated circular virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.252AlaAla: 3.252 ± 2.776
1.626AlaCys: 1.626 ± 0.941
3.252AlaAsp: 3.252 ± 1.882
4.878AlaGlu: 4.878 ± 2.823
1.626AlaPhe: 1.626 ± 1.388
8.13AlaGly: 8.13 ± 2.376
0.0AlaHis: 0.0 ± 0.0
3.252AlaIle: 3.252 ± 1.882
4.878AlaLys: 4.878 ± 4.164
3.252AlaLeu: 3.252 ± 0.447
0.0AlaMet: 0.0 ± 0.916
6.504AlaAsn: 6.504 ± 3.764
3.252AlaPro: 3.252 ± 0.447
1.626AlaGln: 1.626 ± 1.388
8.13AlaArg: 8.13 ± 2.282
8.13AlaSer: 8.13 ± 2.282
4.878AlaThr: 4.878 ± 0.494
3.252AlaVal: 3.252 ± 1.882
1.626AlaTrp: 1.626 ± 1.388
1.626AlaTyr: 1.626 ± 1.388
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
3.252CysAsp: 3.252 ± 1.882
0.0CysGlu: 0.0 ± 0.0
3.252CysPhe: 3.252 ± 0.447
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
1.626CysLeu: 1.626 ± 0.941
0.0CysMet: 0.0 ± 0.0
3.252CysAsn: 3.252 ± 1.882
3.252CysPro: 3.252 ± 0.447
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.626CysTyr: 1.626 ± 1.388
0.0CysXaa: 0.0 ± 0.0
Asp
3.252AspAla: 3.252 ± 1.882
1.626AspCys: 1.626 ± 0.941
3.252AspAsp: 3.252 ± 1.882
0.0AspGlu: 0.0 ± 0.0
4.878AspPhe: 4.878 ± 2.823
0.0AspGly: 0.0 ± 0.0
0.0AspHis: 0.0 ± 0.0
3.252AspIle: 3.252 ± 1.882
6.504AspLys: 6.504 ± 0.894
0.0AspLeu: 0.0 ± 0.0
0.0AspMet: 0.0 ± 0.0
4.878AspAsn: 4.878 ± 1.835
4.878AspPro: 4.878 ± 0.494
3.252AspGln: 3.252 ± 0.447
1.626AspArg: 1.626 ± 0.941
3.252AspSer: 3.252 ± 0.447
3.252AspThr: 3.252 ± 2.776
0.0AspVal: 0.0 ± 0.0
6.504AspTrp: 6.504 ± 1.435
1.626AspTyr: 1.626 ± 0.941
0.0AspXaa: 0.0 ± 0.0
Glu
1.626GluAla: 1.626 ± 0.941
0.0GluCys: 0.0 ± 0.0
0.0GluAsp: 0.0 ± 0.0
1.626GluGlu: 1.626 ± 0.941
4.878GluPhe: 4.878 ± 2.823
1.626GluGly: 1.626 ± 1.388
0.0GluHis: 0.0 ± 0.0
4.878GluIle: 4.878 ± 0.494
3.252GluLys: 3.252 ± 0.447
3.252GluLeu: 3.252 ± 1.882
1.626GluMet: 1.626 ± 0.941
1.626GluAsn: 1.626 ± 0.941
3.252GluPro: 3.252 ± 1.882
1.626GluGln: 1.626 ± 0.941
3.252GluArg: 3.252 ± 1.882
1.626GluSer: 1.626 ± 0.941
1.626GluThr: 1.626 ± 0.941
0.0GluVal: 0.0 ± 0.0
1.626GluTrp: 1.626 ± 0.941
1.626GluTyr: 1.626 ± 0.941
0.0GluXaa: 0.0 ± 0.0
Phe
4.878PheAla: 4.878 ± 1.835
0.0PheCys: 0.0 ± 0.0
4.878PheAsp: 4.878 ± 0.494
4.878PheGlu: 4.878 ± 2.823
4.878PhePhe: 4.878 ± 2.823
8.13PheGly: 8.13 ± 2.376
1.626PheHis: 1.626 ± 0.941
3.252PheIle: 3.252 ± 1.882
1.626PheLys: 1.626 ± 0.941
0.0PheLeu: 0.0 ± 0.0
0.0PheMet: 0.0 ± 0.0
1.626PheAsn: 1.626 ± 0.941
1.626PhePro: 1.626 ± 0.941
0.0PheGln: 0.0 ± 0.0
3.252PheArg: 3.252 ± 0.447
8.13PheSer: 8.13 ± 0.047
1.626PheThr: 1.626 ± 1.388
4.878PheVal: 4.878 ± 2.823
0.0PheTrp: 0.0 ± 0.0
3.252PheTyr: 3.252 ± 0.447
0.0PheXaa: 0.0 ± 0.0
Gly
3.252GlyAla: 3.252 ± 1.882
1.626GlyCys: 1.626 ± 0.941
3.252GlyAsp: 3.252 ± 1.882
0.0GlyGlu: 0.0 ± 0.0
3.252GlyPhe: 3.252 ± 1.882
16.26GlyGly: 16.26 ± 0.095
3.252GlyHis: 3.252 ± 1.882
0.0GlyIle: 0.0 ± 0.0
8.13GlyLys: 8.13 ± 2.376
6.504GlyLeu: 6.504 ± 0.894
1.626GlyMet: 1.626 ± 1.388
3.252GlyAsn: 3.252 ± 0.447
6.504GlyPro: 6.504 ± 1.435
3.252GlyGln: 3.252 ± 0.447
4.878GlyArg: 4.878 ± 2.823
4.878GlySer: 4.878 ± 0.494
9.756GlyThr: 9.756 ± 1.341
4.878GlyVal: 4.878 ± 4.164
1.626GlyTrp: 1.626 ± 1.388
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
3.252HisAla: 3.252 ± 0.447
1.626HisCys: 1.626 ± 0.941
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
3.252HisPhe: 3.252 ± 1.882
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
3.252HisLys: 3.252 ± 1.882
0.0HisLeu: 0.0 ± 0.0
1.626HisMet: 1.626 ± 0.941
0.0HisAsn: 0.0 ± 0.0
1.626HisPro: 1.626 ± 0.941
0.0HisGln: 0.0 ± 0.0
3.252HisArg: 3.252 ± 0.447
1.626HisSer: 1.626 ± 0.941
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
1.626IleCys: 1.626 ± 1.388
1.626IleAsp: 1.626 ± 0.941
0.0IleGlu: 0.0 ± 0.0
3.252IlePhe: 3.252 ± 1.882
1.626IleGly: 1.626 ± 0.941
0.0IleHis: 0.0 ± 0.0
8.13IleIle: 8.13 ± 0.047
1.626IleLys: 1.626 ± 0.941
3.252IleLeu: 3.252 ± 0.447
0.0IleMet: 0.0 ± 0.0
1.626IleAsn: 1.626 ± 0.941
0.0IlePro: 0.0 ± 0.0
4.878IleGln: 4.878 ± 4.164
1.626IleArg: 1.626 ± 0.941
4.878IleSer: 4.878 ± 0.494
1.626IleThr: 1.626 ± 0.941
0.0IleVal: 0.0 ± 0.0
3.252IleTrp: 3.252 ± 1.882
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.252LysAla: 3.252 ± 0.447
0.0LysCys: 0.0 ± 0.0
3.252LysAsp: 3.252 ± 0.447
1.626LysGlu: 1.626 ± 1.388
1.626LysPhe: 1.626 ± 0.941
1.626LysGly: 1.626 ± 1.388
3.252LysHis: 3.252 ± 1.882
1.626LysIle: 1.626 ± 1.388
1.626LysLys: 1.626 ± 1.388
8.13LysLeu: 8.13 ± 2.376
1.626LysMet: 1.626 ± 1.388
0.0LysAsn: 0.0 ± 0.0
6.504LysPro: 6.504 ± 3.764
4.878LysGln: 4.878 ± 1.835
8.13LysArg: 8.13 ± 2.282
0.0LysSer: 0.0 ± 0.0
3.252LysThr: 3.252 ± 0.447
0.0LysVal: 0.0 ± 0.0
1.626LysTrp: 1.626 ± 0.941
4.878LysTyr: 4.878 ± 0.494
0.0LysXaa: 0.0 ± 0.0
Leu
6.504LeuAla: 6.504 ± 3.764
1.626LeuCys: 1.626 ± 0.941
4.878LeuAsp: 4.878 ± 0.494
1.626LeuGlu: 1.626 ± 0.941
14.634LeuPhe: 14.634 ± 3.175
9.756LeuGly: 9.756 ± 3.317
1.626LeuHis: 1.626 ± 0.941
3.252LeuIle: 3.252 ± 1.882
3.252LeuLys: 3.252 ± 0.447
3.252LeuLeu: 3.252 ± 1.882
0.0LeuMet: 0.0 ± 0.0
3.252LeuAsn: 3.252 ± 0.447
1.626LeuPro: 1.626 ± 0.941
1.626LeuGln: 1.626 ± 1.388
1.626LeuArg: 1.626 ± 0.941
0.0LeuSer: 0.0 ± 0.0
3.252LeuThr: 3.252 ± 0.447
3.252LeuVal: 3.252 ± 0.447
3.252LeuTrp: 3.252 ± 0.447
1.626LeuTyr: 1.626 ± 0.941
0.0LeuXaa: 0.0 ± 0.0
Met
1.626MetAla: 1.626 ± 1.388
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
3.252MetGlu: 3.252 ± 1.882
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
1.626MetLeu: 1.626 ± 0.941
0.0MetMet: 0.0 ± 0.0
1.626MetAsn: 1.626 ± 1.388
0.0MetPro: 0.0 ± 0.0
1.626MetGln: 1.626 ± 1.388
3.252MetArg: 3.252 ± 2.776
1.626MetSer: 1.626 ± 0.941
3.252MetThr: 3.252 ± 2.776
1.626MetVal: 1.626 ± 0.941
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.252AsnAla: 3.252 ± 0.447
1.626AsnCys: 1.626 ± 0.941
3.252AsnAsp: 3.252 ± 0.447
0.0AsnGlu: 0.0 ± 0.0
3.252AsnPhe: 3.252 ± 1.882
3.252AsnGly: 3.252 ± 0.447
1.626AsnHis: 1.626 ± 1.388
1.626AsnIle: 1.626 ± 0.941
0.0AsnLys: 0.0 ± 0.0
1.626AsnLeu: 1.626 ± 1.388
1.626AsnMet: 1.626 ± 0.941
1.626AsnAsn: 1.626 ± 0.941
1.626AsnPro: 1.626 ± 0.941
0.0AsnGln: 0.0 ± 0.0
0.0AsnArg: 0.0 ± 0.0
1.626AsnSer: 1.626 ± 0.941
6.504AsnThr: 6.504 ± 0.894
6.504AsnVal: 6.504 ± 0.894
1.626AsnTrp: 1.626 ± 1.388
6.504AsnTyr: 6.504 ± 3.223
0.0AsnXaa: 0.0 ± 0.0
Pro
1.626ProAla: 1.626 ± 0.941
0.0ProCys: 0.0 ± 0.0
3.252ProAsp: 3.252 ± 1.882
1.626ProGlu: 1.626 ± 1.388
1.626ProPhe: 1.626 ± 0.941
4.878ProGly: 4.878 ± 2.823
3.252ProHis: 3.252 ± 1.882
1.626ProIle: 1.626 ± 0.941
3.252ProLys: 3.252 ± 1.882
4.878ProLeu: 4.878 ± 1.835
0.0ProMet: 0.0 ± 0.0
1.626ProAsn: 1.626 ± 0.941
1.626ProPro: 1.626 ± 0.941
1.626ProGln: 1.626 ± 0.941
3.252ProArg: 3.252 ± 1.882
9.756ProSer: 9.756 ± 0.988
6.504ProThr: 6.504 ± 3.223
1.626ProVal: 1.626 ± 0.941
1.626ProTrp: 1.626 ± 1.388
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.252GlnAla: 3.252 ± 2.776
1.626GlnCys: 1.626 ± 0.941
1.626GlnAsp: 1.626 ± 1.388
1.626GlnGlu: 1.626 ± 1.388
3.252GlnPhe: 3.252 ± 0.447
0.0GlnGly: 0.0 ± 0.0
0.0GlnHis: 0.0 ± 0.0
1.626GlnIle: 1.626 ± 1.388
0.0GlnLys: 0.0 ± 0.0
6.504GlnLeu: 6.504 ± 1.435
0.0GlnMet: 0.0 ± 0.0
1.626GlnAsn: 1.626 ± 1.388
1.626GlnPro: 1.626 ± 1.388
0.0GlnGln: 0.0 ± 0.0
0.0GlnArg: 0.0 ± 0.0
1.626GlnSer: 1.626 ± 1.388
3.252GlnThr: 3.252 ± 0.447
0.0GlnVal: 0.0 ± 0.0
3.252GlnTrp: 3.252 ± 0.447
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
4.878ArgAla: 4.878 ± 2.823
0.0ArgCys: 0.0 ± 0.0
0.0ArgAsp: 0.0 ± 0.0
3.252ArgGlu: 3.252 ± 1.882
1.626ArgPhe: 1.626 ± 0.941
6.504ArgGly: 6.504 ± 0.894
0.0ArgHis: 0.0 ± 0.0
1.626ArgIle: 1.626 ± 1.388
6.504ArgLys: 6.504 ± 3.223
6.504ArgLeu: 6.504 ± 3.764
1.626ArgMet: 1.626 ± 1.388
4.878ArgAsn: 4.878 ± 1.835
4.878ArgPro: 4.878 ± 0.494
0.0ArgGln: 0.0 ± 0.0
16.26ArgArg: 16.26 ± 9.221
8.13ArgSer: 8.13 ± 4.611
4.878ArgThr: 4.878 ± 1.835
3.252ArgVal: 3.252 ± 2.776
1.626ArgTrp: 1.626 ± 0.941
6.504ArgTyr: 6.504 ± 0.894
0.0ArgXaa: 0.0 ± 0.0
Ser
8.13SerAla: 8.13 ± 0.047
0.0SerCys: 0.0 ± 0.0
4.878SerAsp: 4.878 ± 1.835
6.504SerGlu: 6.504 ± 3.764
0.0SerPhe: 0.0 ± 0.0
9.756SerGly: 9.756 ± 3.67
1.626SerHis: 1.626 ± 0.941
0.0SerIle: 0.0 ± 0.0
1.626SerLys: 1.626 ± 1.388
8.13SerLeu: 8.13 ± 2.376
0.0SerMet: 0.0 ± 0.0
1.626SerAsn: 1.626 ± 1.388
4.878SerPro: 4.878 ± 0.494
0.0SerGln: 0.0 ± 0.0
8.13SerArg: 8.13 ± 4.611
0.0SerSer: 0.0 ± 0.0
4.878SerThr: 4.878 ± 4.164
3.252SerVal: 3.252 ± 0.447
0.0SerTrp: 0.0 ± 0.0
6.504SerTyr: 6.504 ± 0.894
0.0SerXaa: 0.0 ± 0.0
Thr
8.13ThrAla: 8.13 ± 4.611
1.626ThrCys: 1.626 ± 1.388
6.504ThrAsp: 6.504 ± 1.435
4.878ThrGlu: 4.878 ± 2.823
0.0ThrPhe: 0.0 ± 0.0
8.13ThrGly: 8.13 ± 2.282
3.252ThrHis: 3.252 ± 0.447
1.626ThrIle: 1.626 ± 1.388
1.626ThrLys: 1.626 ± 1.388
1.626ThrLeu: 1.626 ± 1.388
3.252ThrMet: 3.252 ± 1.095
4.878ThrAsn: 4.878 ± 4.164
3.252ThrPro: 3.252 ± 0.447
3.252ThrGln: 3.252 ± 0.447
6.504ThrArg: 6.504 ± 3.223
4.878ThrSer: 4.878 ± 4.164
4.878ThrThr: 4.878 ± 0.494
3.252ThrVal: 3.252 ± 0.447
0.0ThrTrp: 0.0 ± 0.0
1.626ThrTyr: 1.626 ± 0.941
0.0ThrXaa: 0.0 ± 0.0
Val
3.252ValAla: 3.252 ± 2.776
0.0ValCys: 0.0 ± 0.0
1.626ValAsp: 1.626 ± 1.388
1.626ValGlu: 1.626 ± 0.941
0.0ValPhe: 0.0 ± 0.0
4.878ValGly: 4.878 ± 2.823
0.0ValHis: 0.0 ± 0.0
1.626ValIle: 1.626 ± 1.388
3.252ValLys: 3.252 ± 0.447
4.878ValLeu: 4.878 ± 0.494
1.626ValMet: 1.626 ± 1.388
1.626ValAsn: 1.626 ± 0.941
0.0ValPro: 0.0 ± 0.0
1.626ValGln: 1.626 ± 0.941
1.626ValArg: 1.626 ± 0.941
1.626ValSer: 1.626 ± 1.388
6.504ValThr: 6.504 ± 0.894
1.626ValVal: 1.626 ± 0.941
1.626ValTrp: 1.626 ± 0.941
1.626ValTyr: 1.626 ± 0.941
0.0ValXaa: 0.0 ± 0.0
Trp
4.878TrpAla: 4.878 ± 0.494
1.626TrpCys: 1.626 ± 1.388
1.626TrpAsp: 1.626 ± 0.941
1.626TrpGlu: 1.626 ± 0.941
0.0TrpPhe: 0.0 ± 0.0
1.626TrpGly: 1.626 ± 0.941
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
4.878TrpLeu: 4.878 ± 0.494
1.626TrpMet: 1.626 ± 0.941
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.626TrpGln: 1.626 ± 1.388
4.878TrpArg: 4.878 ± 1.835
1.626TrpSer: 1.626 ± 0.941
1.626TrpThr: 1.626 ± 1.388
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.626TrpTyr: 1.626 ± 0.941
0.0TrpXaa: 0.0 ± 0.0
Tyr
6.504TyrAla: 6.504 ± 1.435
0.0TyrCys: 0.0 ± 0.0
1.626TyrAsp: 1.626 ± 1.388
0.0TyrGlu: 0.0 ± 0.0
3.252TyrPhe: 3.252 ± 0.447
0.0TyrGly: 0.0 ± 0.0
0.0TyrHis: 0.0 ± 0.0
1.626TyrIle: 1.626 ± 0.941
6.504TyrLys: 6.504 ± 3.764
0.0TyrLeu: 0.0 ± 0.0
1.626TyrMet: 1.626 ± 1.388
1.626TyrAsn: 1.626 ± 1.388
3.252TyrPro: 3.252 ± 0.447
0.0TyrGln: 0.0 ± 0.0
3.252TyrArg: 3.252 ± 0.447
6.504TyrSer: 6.504 ± 3.223
1.626TyrThr: 1.626 ± 1.388
3.252TyrVal: 3.252 ± 1.882
0.0TyrTrp: 0.0 ± 0.0
1.626TyrTyr: 1.626 ± 1.388
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (616 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski