Amino acid dipepetide frequency for Dragonfly larvae associated circular virus-4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.418AlaAla: 7.418 ± 4.808
0.0AlaCys: 0.0 ± 0.0
4.451AlaAsp: 4.451 ± 0.684
0.0AlaGlu: 0.0 ± 0.0
4.451AlaPhe: 4.451 ± 0.684
2.967AlaGly: 2.967 ± 0.278
1.484AlaHis: 1.484 ± 1.239
4.451AlaIle: 4.451 ± 3.717
4.451AlaLys: 4.451 ± 2.885
2.967AlaLeu: 2.967 ± 1.923
0.0AlaMet: 0.0 ± 0.699
4.451AlaAsn: 4.451 ± 2.885
4.451AlaPro: 4.451 ± 2.885
2.967AlaGln: 2.967 ± 2.478
4.451AlaArg: 4.451 ± 1.517
2.967AlaSer: 2.967 ± 1.923
4.451AlaThr: 4.451 ± 2.885
2.967AlaVal: 2.967 ± 0.278
0.0AlaTrp: 0.0 ± 0.0
1.484AlaTyr: 1.484 ± 1.239
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.484CysGlu: 1.484 ± 1.239
2.967CysPhe: 2.967 ± 0.278
1.484CysGly: 1.484 ± 1.239
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
2.967CysLys: 2.967 ± 2.478
1.484CysLeu: 1.484 ± 0.962
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.484CysArg: 1.484 ± 0.962
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.0AspAla: 0.0 ± 0.0
0.0AspCys: 0.0 ± 0.0
1.484AspAsp: 1.484 ± 1.239
2.967AspGlu: 2.967 ± 0.278
2.967AspPhe: 2.967 ± 0.278
1.484AspGly: 1.484 ± 0.962
0.0AspHis: 0.0 ± 0.0
4.451AspIle: 4.451 ± 0.684
1.484AspLys: 1.484 ± 1.239
8.902AspLeu: 8.902 ± 3.569
0.0AspMet: 0.0 ± 0.0
1.484AspAsn: 1.484 ± 1.239
10.386AspPro: 10.386 ± 0.129
1.484AspGln: 1.484 ± 0.962
1.484AspArg: 1.484 ± 1.239
1.484AspSer: 1.484 ± 0.962
0.0AspThr: 0.0 ± 0.0
2.967AspVal: 2.967 ± 0.278
0.0AspTrp: 0.0 ± 0.0
2.967AspTyr: 2.967 ± 2.478
0.0AspXaa: 0.0 ± 0.0
Glu
0.0GluAla: 0.0 ± 0.0
2.967GluCys: 2.967 ± 0.278
0.0GluAsp: 0.0 ± 0.0
8.902GluGlu: 8.902 ± 5.234
2.967GluPhe: 2.967 ± 2.478
7.418GluGly: 7.418 ± 0.407
1.484GluHis: 1.484 ± 1.239
8.902GluIle: 8.902 ± 3.033
4.451GluLys: 4.451 ± 1.517
0.0GluLeu: 0.0 ± 0.0
0.0GluMet: 0.0 ± 0.0
2.967GluAsn: 2.967 ± 0.278
2.967GluPro: 2.967 ± 2.478
0.0GluGln: 0.0 ± 0.0
2.967GluArg: 2.967 ± 2.478
1.484GluSer: 1.484 ± 0.962
0.0GluThr: 0.0 ± 0.0
1.484GluVal: 1.484 ± 1.239
4.451GluTrp: 4.451 ± 3.717
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.967PheAla: 2.967 ± 0.278
0.0PheCys: 0.0 ± 0.0
1.484PheAsp: 1.484 ± 0.962
2.967PheGlu: 2.967 ± 0.278
0.0PhePhe: 0.0 ± 0.0
0.0PheGly: 0.0 ± 0.0
0.0PheHis: 0.0 ± 0.0
1.484PheIle: 1.484 ± 0.962
4.451PheLys: 4.451 ± 3.717
5.935PheLeu: 5.935 ± 2.756
1.484PheMet: 1.484 ± 0.962
5.935PheAsn: 5.935 ± 0.555
2.967PhePro: 2.967 ± 2.478
2.967PheGln: 2.967 ± 0.278
4.451PheArg: 4.451 ± 0.684
1.484PheSer: 1.484 ± 0.962
4.451PheThr: 4.451 ± 3.717
5.935PheVal: 5.935 ± 1.646
0.0PheTrp: 0.0 ± 0.0
1.484PheTyr: 1.484 ± 1.239
0.0PheXaa: 0.0 ± 0.0
Gly
5.935GlyAla: 5.935 ± 1.646
0.0GlyCys: 0.0 ± 0.0
2.967GlyAsp: 2.967 ± 1.923
4.451GlyGlu: 4.451 ± 3.717
7.418GlyPhe: 7.418 ± 1.794
7.418GlyGly: 7.418 ± 4.808
0.0GlyHis: 0.0 ± 0.0
1.484GlyIle: 1.484 ± 1.239
5.935GlyLys: 5.935 ± 2.756
2.967GlyLeu: 2.967 ± 0.278
1.484GlyMet: 1.484 ± 0.962
4.451GlyAsn: 4.451 ± 2.885
4.451GlyPro: 4.451 ± 1.517
5.935GlyGln: 5.935 ± 0.555
2.967GlyArg: 2.967 ± 1.923
2.967GlySer: 2.967 ± 1.923
10.386GlyThr: 10.386 ± 0.129
5.935GlyVal: 5.935 ± 3.846
2.967GlyTrp: 2.967 ± 0.278
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.484HisAla: 1.484 ± 0.962
0.0HisCys: 0.0 ± 0.0
1.484HisAsp: 1.484 ± 0.962
0.0HisGlu: 0.0 ± 0.0
1.484HisPhe: 1.484 ± 1.239
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
1.484HisLeu: 1.484 ± 1.239
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
1.484HisSer: 1.484 ± 1.239
0.0HisThr: 0.0 ± 0.0
1.484HisVal: 1.484 ± 1.239
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.451IleAla: 4.451 ± 2.885
0.0IleCys: 0.0 ± 0.0
2.967IleAsp: 2.967 ± 0.278
2.967IleGlu: 2.967 ± 2.478
2.967IlePhe: 2.967 ± 0.278
4.451IleGly: 4.451 ± 2.885
1.484IleHis: 1.484 ± 1.239
4.451IleIle: 4.451 ± 1.517
2.967IleLys: 2.967 ± 1.923
4.451IleLeu: 4.451 ± 3.717
1.484IleMet: 1.484 ± 0.962
5.935IleAsn: 5.935 ± 0.555
1.484IlePro: 1.484 ± 0.962
1.484IleGln: 1.484 ± 1.239
4.451IleArg: 4.451 ± 0.684
7.418IleSer: 7.418 ± 1.794
5.935IleThr: 5.935 ± 3.846
2.967IleVal: 2.967 ± 2.478
0.0IleTrp: 0.0 ± 0.0
4.451IleTyr: 4.451 ± 0.684
0.0IleXaa: 0.0 ± 0.0
Lys
1.484LysAla: 1.484 ± 1.239
0.0LysCys: 0.0 ± 0.0
5.935LysAsp: 5.935 ± 2.756
4.451LysGlu: 4.451 ± 3.717
2.967LysPhe: 2.967 ± 0.278
1.484LysGly: 1.484 ± 1.239
0.0LysHis: 0.0 ± 0.0
7.418LysIle: 7.418 ± 0.407
10.386LysLys: 10.386 ± 2.33
5.935LysLeu: 5.935 ± 4.957
0.0LysMet: 0.0 ± 0.0
0.0LysAsn: 0.0 ± 0.0
1.484LysPro: 1.484 ± 0.962
1.484LysGln: 1.484 ± 1.239
7.418LysArg: 7.418 ± 0.407
1.484LysSer: 1.484 ± 1.239
1.484LysThr: 1.484 ± 0.962
7.418LysVal: 7.418 ± 2.607
1.484LysTrp: 1.484 ± 1.239
7.418LysTyr: 7.418 ± 4.808
0.0LysXaa: 0.0 ± 0.0
Leu
2.967LeuAla: 2.967 ± 2.478
1.484LeuCys: 1.484 ± 1.239
5.935LeuAsp: 5.935 ± 2.756
4.451LeuGlu: 4.451 ± 1.517
1.484LeuPhe: 1.484 ± 0.962
2.967LeuGly: 2.967 ± 1.923
0.0LeuHis: 0.0 ± 0.0
1.484LeuIle: 1.484 ± 0.962
4.451LeuLys: 4.451 ± 3.717
4.451LeuLeu: 4.451 ± 1.517
0.0LeuMet: 0.0 ± 0.0
8.902LeuAsn: 8.902 ± 3.569
1.484LeuPro: 1.484 ± 0.962
4.451LeuGln: 4.451 ± 1.517
4.451LeuArg: 4.451 ± 3.717
4.451LeuSer: 4.451 ± 0.684
5.935LeuThr: 5.935 ± 1.646
1.484LeuVal: 1.484 ± 1.239
0.0LeuTrp: 0.0 ± 0.0
2.967LeuTyr: 2.967 ± 1.923
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
1.484MetCys: 1.484 ± 0.962
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
1.484MetGly: 1.484 ± 1.239
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.484MetLys: 1.484 ± 0.962
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.484MetPro: 1.484 ± 0.962
2.967MetGln: 2.967 ± 1.923
1.484MetArg: 1.484 ± 0.962
0.0MetSer: 0.0 ± 0.0
1.484MetThr: 1.484 ± 0.962
1.484MetVal: 1.484 ± 1.239
0.0MetTrp: 0.0 ± 0.0
2.967MetTyr: 2.967 ± 1.923
0.0MetXaa: 0.0 ± 0.0
Asn
7.418AsnAla: 7.418 ± 2.607
0.0AsnCys: 0.0 ± 0.0
2.967AsnAsp: 2.967 ± 0.278
4.451AsnGlu: 4.451 ± 1.517
1.484AsnPhe: 1.484 ± 1.239
1.484AsnGly: 1.484 ± 1.239
0.0AsnHis: 0.0 ± 0.0
5.935AsnIle: 5.935 ± 1.646
2.967AsnLys: 2.967 ± 1.923
1.484AsnLeu: 1.484 ± 0.962
1.484AsnMet: 1.484 ± 0.962
2.967AsnAsn: 2.967 ± 0.278
2.967AsnPro: 2.967 ± 1.923
1.484AsnGln: 1.484 ± 0.962
2.967AsnArg: 2.967 ± 1.923
5.935AsnSer: 5.935 ± 0.555
4.451AsnThr: 4.451 ± 2.885
1.484AsnVal: 1.484 ± 0.962
0.0AsnTrp: 0.0 ± 0.0
5.935AsnTyr: 5.935 ± 2.756
0.0AsnXaa: 0.0 ± 0.0
Pro
7.418ProAla: 7.418 ± 0.407
0.0ProCys: 0.0 ± 0.0
2.967ProAsp: 2.967 ± 0.278
1.484ProGlu: 1.484 ± 1.239
2.967ProPhe: 2.967 ± 0.278
5.935ProGly: 5.935 ± 3.846
1.484ProHis: 1.484 ± 1.239
0.0ProIle: 0.0 ± 0.0
1.484ProLys: 1.484 ± 0.962
4.451ProLeu: 4.451 ± 1.517
2.967ProMet: 2.967 ± 1.926
2.967ProAsn: 2.967 ± 0.278
0.0ProPro: 0.0 ± 0.0
0.0ProGln: 0.0 ± 0.0
1.484ProArg: 1.484 ± 1.239
2.967ProSer: 2.967 ± 0.278
5.935ProThr: 5.935 ± 1.646
0.0ProVal: 0.0 ± 0.0
4.451ProTrp: 4.451 ± 0.684
1.484ProTyr: 1.484 ± 1.239
0.0ProXaa: 0.0 ± 0.0
Gln
2.967GlnAla: 2.967 ± 2.478
1.484GlnCys: 1.484 ± 0.962
0.0GlnAsp: 0.0 ± 0.0
2.967GlnGlu: 2.967 ± 0.278
1.484GlnPhe: 1.484 ± 0.962
5.935GlnGly: 5.935 ± 0.555
1.484GlnHis: 1.484 ± 0.962
1.484GlnIle: 1.484 ± 0.962
2.967GlnLys: 2.967 ± 2.478
4.451GlnLeu: 4.451 ± 1.517
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
0.0GlnPro: 0.0 ± 0.0
2.967GlnGln: 2.967 ± 0.278
7.418GlnArg: 7.418 ± 3.995
2.967GlnSer: 2.967 ± 1.923
1.484GlnThr: 1.484 ± 0.962
2.967GlnVal: 2.967 ± 1.923
0.0GlnTrp: 0.0 ± 0.0
1.484GlnTyr: 1.484 ± 1.239
0.0GlnXaa: 0.0 ± 0.0
Arg
4.451ArgAla: 4.451 ± 0.684
2.967ArgCys: 2.967 ± 2.478
5.935ArgAsp: 5.935 ± 1.646
1.484ArgGlu: 1.484 ± 1.239
2.967ArgPhe: 2.967 ± 0.278
7.418ArgGly: 7.418 ± 3.995
0.0ArgHis: 0.0 ± 0.0
1.484ArgIle: 1.484 ± 1.239
2.967ArgLys: 2.967 ± 1.923
2.967ArgLeu: 2.967 ± 0.278
2.967ArgMet: 2.967 ± 1.923
4.451ArgAsn: 4.451 ± 2.885
2.967ArgPro: 2.967 ± 2.478
1.484ArgGln: 1.484 ± 1.239
10.386ArgArg: 10.386 ± 2.33
5.935ArgSer: 5.935 ± 1.646
4.451ArgThr: 4.451 ± 2.885
2.967ArgVal: 2.967 ± 0.278
1.484ArgTrp: 1.484 ± 1.239
8.902ArgTyr: 8.902 ± 0.833
0.0ArgXaa: 0.0 ± 0.0
Ser
2.967SerAla: 2.967 ± 1.923
0.0SerCys: 0.0 ± 0.0
0.0SerAsp: 0.0 ± 0.0
1.484SerGlu: 1.484 ± 0.962
5.935SerPhe: 5.935 ± 0.555
5.935SerGly: 5.935 ± 1.646
0.0SerHis: 0.0 ± 0.0
4.451SerIle: 4.451 ± 0.684
4.451SerLys: 4.451 ± 3.717
2.967SerLeu: 2.967 ± 0.278
1.484SerMet: 1.484 ± 0.962
4.451SerAsn: 4.451 ± 0.684
1.484SerPro: 1.484 ± 0.962
1.484SerGln: 1.484 ± 0.962
8.902SerArg: 8.902 ± 1.368
1.484SerSer: 1.484 ± 0.962
4.451SerThr: 4.451 ± 2.885
1.484SerVal: 1.484 ± 0.962
0.0SerTrp: 0.0 ± 0.0
1.484SerTyr: 1.484 ± 0.962
0.0SerXaa: 0.0 ± 0.0
Thr
2.967ThrAla: 2.967 ± 1.923
0.0ThrCys: 0.0 ± 0.0
1.484ThrAsp: 1.484 ± 0.962
1.484ThrGlu: 1.484 ± 1.239
0.0ThrPhe: 0.0 ± 0.0
8.902ThrGly: 8.902 ± 1.368
0.0ThrHis: 0.0 ± 0.0
10.386ThrIle: 10.386 ± 4.53
4.451ThrLys: 4.451 ± 2.885
2.967ThrLeu: 2.967 ± 1.923
0.0ThrMet: 0.0 ± 0.0
1.484ThrAsn: 1.484 ± 1.239
2.967ThrPro: 2.967 ± 0.278
5.935ThrGln: 5.935 ± 1.646
7.418ThrArg: 7.418 ± 2.607
2.967ThrSer: 2.967 ± 1.923
5.935ThrThr: 5.935 ± 1.646
2.967ThrVal: 2.967 ± 1.923
0.0ThrTrp: 0.0 ± 0.0
2.967ThrTyr: 2.967 ± 1.923
0.0ThrXaa: 0.0 ± 0.0
Val
1.484ValAla: 1.484 ± 1.239
0.0ValCys: 0.0 ± 0.0
2.967ValAsp: 2.967 ± 1.923
4.451ValGlu: 4.451 ± 0.684
1.484ValPhe: 1.484 ± 1.239
4.451ValGly: 4.451 ± 0.684
1.484ValHis: 1.484 ± 0.962
5.935ValIle: 5.935 ± 0.555
2.967ValLys: 2.967 ± 0.278
2.967ValLeu: 2.967 ± 0.278
0.0ValMet: 0.0 ± 0.0
2.967ValAsn: 2.967 ± 0.278
1.484ValPro: 1.484 ± 0.962
2.967ValGln: 2.967 ± 0.278
1.484ValArg: 1.484 ± 0.962
4.451ValSer: 4.451 ± 2.885
2.967ValThr: 2.967 ± 1.923
1.484ValVal: 1.484 ± 1.239
1.484ValTrp: 1.484 ± 1.239
1.484ValTyr: 1.484 ± 1.239
0.0ValXaa: 0.0 ± 0.0
Trp
1.484TrpAla: 1.484 ± 1.239
0.0TrpCys: 0.0 ± 0.0
2.967TrpAsp: 2.967 ± 2.478
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
2.967TrpGly: 2.967 ± 2.478
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.484TrpLys: 1.484 ± 1.239
1.484TrpLeu: 1.484 ± 0.962
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
1.484TrpPro: 1.484 ± 1.239
0.0TrpGln: 0.0 ± 0.0
1.484TrpArg: 1.484 ± 0.962
0.0TrpSer: 0.0 ± 0.0
1.484TrpThr: 1.484 ± 0.962
1.484TrpVal: 1.484 ± 1.239
4.451TrpTrp: 4.451 ± 3.717
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.451TyrAla: 4.451 ± 2.885
1.484TyrCys: 1.484 ± 1.239
0.0TyrAsp: 0.0 ± 0.0
2.967TyrGlu: 2.967 ± 0.278
4.451TyrPhe: 4.451 ± 1.517
5.935TyrGly: 5.935 ± 1.646
0.0TyrHis: 0.0 ± 0.0
2.967TyrIle: 2.967 ± 0.278
2.967TyrLys: 2.967 ± 1.923
1.484TyrLeu: 1.484 ± 0.962
1.484TyrMet: 1.484 ± 0.962
4.451TyrAsn: 4.451 ± 0.684
5.935TyrPro: 5.935 ± 2.756
4.451TyrGln: 4.451 ± 1.517
1.484TyrArg: 1.484 ± 1.239
2.967TyrSer: 2.967 ± 0.278
0.0TyrThr: 0.0 ± 0.0
0.0TyrVal: 0.0 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
2.967TyrTyr: 2.967 ± 2.478
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (675 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski