Amino acid dipepetide frequency for Fly associated circular virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.419AlaAla: 3.419 ± 2.269
0.0AlaCys: 0.0 ± 0.0
6.838AlaAsp: 6.838 ± 1.961
0.0AlaGlu: 0.0 ± 0.0
1.709AlaPhe: 1.709 ± 1.135
5.128AlaGly: 5.128 ± 3.404
1.709AlaHis: 1.709 ± 1.443
3.419AlaIle: 3.419 ± 0.308
3.419AlaLys: 3.419 ± 0.308
0.0AlaLeu: 0.0 ± 0.0
1.709AlaMet: 1.709 ± 1.443
1.709AlaAsn: 1.709 ± 1.135
0.0AlaPro: 0.0 ± 0.0
5.128AlaGln: 5.128 ± 1.752
1.709AlaArg: 1.709 ± 1.135
8.547AlaSer: 8.547 ± 2.06
3.419AlaThr: 3.419 ± 0.308
8.547AlaVal: 8.547 ± 0.518
1.709AlaTrp: 1.709 ± 1.443
1.709AlaTyr: 1.709 ± 1.135
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.709CysAsp: 1.709 ± 1.135
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.709CysArg: 1.709 ± 1.443
1.709CysSer: 1.709 ± 1.443
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.419AspAla: 3.419 ± 0.308
1.709AspCys: 1.709 ± 1.135
0.0AspAsp: 0.0 ± 0.0
1.709AspGlu: 1.709 ± 1.135
0.0AspPhe: 0.0 ± 0.0
11.966AspGly: 11.966 ± 2.787
1.709AspHis: 1.709 ± 1.443
1.709AspIle: 1.709 ± 1.443
3.419AspLys: 3.419 ± 2.886
5.128AspLeu: 5.128 ± 1.752
0.0AspMet: 0.0 ± 0.0
1.709AspAsn: 1.709 ± 1.443
8.547AspPro: 8.547 ± 0.518
5.128AspGln: 5.128 ± 3.404
5.128AspArg: 5.128 ± 1.752
3.419AspSer: 3.419 ± 2.269
6.838AspThr: 6.838 ± 3.195
6.838AspVal: 6.838 ± 0.617
0.0AspTrp: 0.0 ± 0.0
3.419AspTyr: 3.419 ± 0.308
0.0AspXaa: 0.0 ± 0.0
Glu
6.838GluAla: 6.838 ± 0.617
0.0GluCys: 0.0 ± 0.0
5.128GluAsp: 5.128 ± 3.404
1.709GluGlu: 1.709 ± 1.443
0.0GluPhe: 0.0 ± 0.0
0.0GluGly: 0.0 ± 0.0
0.0GluHis: 0.0 ± 0.0
0.0GluIle: 0.0 ± 0.0
0.0GluLys: 0.0 ± 0.0
3.419GluLeu: 3.419 ± 2.269
1.709GluMet: 1.709 ± 1.135
1.709GluAsn: 1.709 ± 1.135
0.0GluPro: 0.0 ± 0.0
0.0GluGln: 0.0 ± 0.0
3.419GluArg: 3.419 ± 2.886
1.709GluSer: 1.709 ± 1.135
3.419GluThr: 3.419 ± 2.886
5.128GluVal: 5.128 ± 1.752
3.419GluTrp: 3.419 ± 2.886
1.709GluTyr: 1.709 ± 1.443
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.0PheCys: 0.0 ± 0.0
1.709PheAsp: 1.709 ± 1.135
0.0PheGlu: 0.0 ± 0.0
5.128PhePhe: 5.128 ± 0.826
1.709PheGly: 1.709 ± 1.443
0.0PheHis: 0.0 ± 0.0
0.0PheIle: 0.0 ± 0.0
6.838PheLys: 6.838 ± 1.961
0.0PheLeu: 0.0 ± 0.0
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
5.128PhePro: 5.128 ± 3.404
5.128PheGln: 5.128 ± 0.826
3.419PheArg: 3.419 ± 0.308
3.419PheSer: 3.419 ± 0.308
1.709PheThr: 1.709 ± 1.135
3.419PheVal: 3.419 ± 2.269
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
1.709GlyAla: 1.709 ± 1.135
0.0GlyCys: 0.0 ± 0.0
5.128GlyAsp: 5.128 ± 0.826
5.128GlyGlu: 5.128 ± 3.404
0.0GlyPhe: 0.0 ± 0.0
6.838GlyGly: 6.838 ± 4.539
3.419GlyHis: 3.419 ± 0.308
3.419GlyIle: 3.419 ± 2.886
5.128GlyLys: 5.128 ± 1.752
10.256GlyLeu: 10.256 ± 4.23
1.709GlyMet: 1.709 ± 1.443
1.709GlyAsn: 1.709 ± 1.443
3.419GlyPro: 3.419 ± 2.269
3.419GlyGln: 3.419 ± 0.308
1.709GlyArg: 1.709 ± 1.443
5.128GlySer: 5.128 ± 0.826
3.419GlyThr: 3.419 ± 2.269
8.547GlyVal: 8.547 ± 3.096
3.419GlyTrp: 3.419 ± 0.308
5.128GlyTyr: 5.128 ± 0.826
0.0GlyXaa: 0.0 ± 0.0
His
1.709HisAla: 1.709 ± 1.135
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
1.709HisGly: 1.709 ± 1.443
0.0HisHis: 0.0 ± 0.0
3.419HisIle: 3.419 ± 2.886
3.419HisLys: 3.419 ± 0.308
1.709HisLeu: 1.709 ± 1.443
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
3.419HisGln: 3.419 ± 2.269
1.709HisArg: 1.709 ± 1.135
0.0HisSer: 0.0 ± 0.0
1.709HisThr: 1.709 ± 1.443
0.0HisVal: 0.0 ± 0.0
1.709HisTrp: 1.709 ± 1.443
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
0.0IleCys: 0.0 ± 0.0
5.128IleAsp: 5.128 ± 1.752
5.128IleGlu: 5.128 ± 1.752
0.0IlePhe: 0.0 ± 0.0
1.709IleGly: 1.709 ± 1.135
0.0IleHis: 0.0 ± 0.0
5.128IleIle: 5.128 ± 1.752
3.419IleLys: 3.419 ± 2.886
0.0IleLeu: 0.0 ± 0.0
0.0IleMet: 0.0 ± 0.0
1.709IleAsn: 1.709 ± 1.443
5.128IlePro: 5.128 ± 0.826
3.419IleGln: 3.419 ± 0.308
3.419IleArg: 3.419 ± 2.886
1.709IleSer: 1.709 ± 1.135
3.419IleThr: 3.419 ± 2.269
3.419IleVal: 3.419 ± 0.308
0.0IleTrp: 0.0 ± 0.0
1.709IleTyr: 1.709 ± 1.443
0.0IleXaa: 0.0 ± 0.0
Lys
3.419LysAla: 3.419 ± 0.308
1.709LysCys: 1.709 ± 1.443
3.419LysAsp: 3.419 ± 2.886
3.419LysGlu: 3.419 ± 2.886
3.419LysPhe: 3.419 ± 0.308
5.128LysGly: 5.128 ± 1.752
0.0LysHis: 0.0 ± 0.0
0.0LysIle: 0.0 ± 0.0
1.709LysLys: 1.709 ± 1.443
10.256LysLeu: 10.256 ± 1.652
3.419LysMet: 3.419 ± 2.269
0.0LysAsn: 0.0 ± 0.0
1.709LysPro: 1.709 ± 1.135
1.709LysGln: 1.709 ± 1.443
3.419LysArg: 3.419 ± 2.269
5.128LysSer: 5.128 ± 1.752
6.838LysThr: 6.838 ± 3.195
3.419LysVal: 3.419 ± 0.308
6.838LysTrp: 6.838 ± 3.195
1.709LysTyr: 1.709 ± 1.135
0.0LysXaa: 0.0 ± 0.0
Leu
3.419LeuAla: 3.419 ± 0.308
0.0LeuCys: 0.0 ± 0.0
5.128LeuAsp: 5.128 ± 0.826
0.0LeuGlu: 0.0 ± 0.0
3.419LeuPhe: 3.419 ± 2.269
5.128LeuGly: 5.128 ± 0.826
3.419LeuHis: 3.419 ± 2.269
1.709LeuIle: 1.709 ± 1.443
3.419LeuLys: 3.419 ± 0.308
3.419LeuLeu: 3.419 ± 0.308
1.709LeuMet: 1.709 ± 0.923
1.709LeuAsn: 1.709 ± 1.443
6.838LeuPro: 6.838 ± 4.539
3.419LeuGln: 3.419 ± 2.269
1.709LeuArg: 1.709 ± 1.443
5.128LeuSer: 5.128 ± 0.826
8.547LeuThr: 8.547 ± 3.096
1.709LeuVal: 1.709 ± 1.443
0.0LeuTrp: 0.0 ± 0.0
5.128LeuTyr: 5.128 ± 1.752
0.0LeuXaa: 0.0 ± 0.0
Met
1.709MetAla: 1.709 ± 1.135
0.0MetCys: 0.0 ± 0.0
1.709MetAsp: 1.709 ± 1.443
1.709MetGlu: 1.709 ± 1.443
1.709MetPhe: 1.709 ± 1.443
1.709MetGly: 1.709 ± 1.443
1.709MetHis: 1.709 ± 1.135
3.419MetIle: 3.419 ± 2.269
1.709MetLys: 1.709 ± 1.135
1.709MetLeu: 1.709 ± 1.443
1.709MetMet: 1.709 ± 1.135
0.0MetAsn: 0.0 ± 0.0
3.419MetPro: 3.419 ± 0.308
1.709MetGln: 1.709 ± 1.135
0.0MetArg: 0.0 ± 0.0
0.0MetSer: 0.0 ± 0.0
1.709MetThr: 1.709 ± 1.443
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.709MetTyr: 1.709 ± 1.135
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
0.0AsnCys: 0.0 ± 0.0
5.128AsnAsp: 5.128 ± 1.752
0.0AsnGlu: 0.0 ± 0.0
1.709AsnPhe: 1.709 ± 1.135
3.419AsnGly: 3.419 ± 0.308
1.709AsnHis: 1.709 ± 1.443
0.0AsnIle: 0.0 ± 0.0
3.419AsnLys: 3.419 ± 0.308
3.419AsnLeu: 3.419 ± 2.269
0.0AsnMet: 0.0 ± 0.0
0.0AsnAsn: 0.0 ± 0.0
1.709AsnPro: 1.709 ± 1.135
0.0AsnGln: 0.0 ± 0.0
0.0AsnArg: 0.0 ± 0.0
1.709AsnSer: 1.709 ± 1.135
1.709AsnThr: 1.709 ± 1.135
1.709AsnVal: 1.709 ± 1.135
1.709AsnTrp: 1.709 ± 1.443
1.709AsnTyr: 1.709 ± 1.135
0.0AsnXaa: 0.0 ± 0.0
Pro
3.419ProAla: 3.419 ± 2.269
0.0ProCys: 0.0 ± 0.0
1.709ProAsp: 1.709 ± 1.443
1.709ProGlu: 1.709 ± 1.443
3.419ProPhe: 3.419 ± 2.269
5.128ProGly: 5.128 ± 3.404
0.0ProHis: 0.0 ± 0.0
3.419ProIle: 3.419 ± 0.308
1.709ProLys: 1.709 ± 1.443
0.0ProLeu: 0.0 ± 0.0
3.419ProMet: 3.419 ± 2.269
0.0ProAsn: 0.0 ± 0.0
1.709ProPro: 1.709 ± 1.135
1.709ProGln: 1.709 ± 1.135
8.547ProArg: 8.547 ± 2.06
3.419ProSer: 3.419 ± 2.269
6.838ProThr: 6.838 ± 4.539
5.128ProVal: 5.128 ± 3.404
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
8.547GlnAla: 8.547 ± 4.638
0.0GlnCys: 0.0 ± 0.0
5.128GlnAsp: 5.128 ± 1.752
1.709GlnGlu: 1.709 ± 1.443
1.709GlnPhe: 1.709 ± 1.135
0.0GlnGly: 0.0 ± 0.0
0.0GlnHis: 0.0 ± 0.0
5.128GlnIle: 5.128 ± 0.826
1.709GlnLys: 1.709 ± 1.135
6.838GlnLeu: 6.838 ± 1.961
0.0GlnMet: 0.0 ± 0.808
1.709GlnAsn: 1.709 ± 1.135
1.709GlnPro: 1.709 ± 1.135
1.709GlnGln: 1.709 ± 1.135
5.128GlnArg: 5.128 ± 0.826
1.709GlnSer: 1.709 ± 1.135
3.419GlnThr: 3.419 ± 0.308
3.419GlnVal: 3.419 ± 0.308
1.709GlnTrp: 1.709 ± 1.135
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
0.0ArgAla: 0.0 ± 0.0
0.0ArgCys: 0.0 ± 0.0
3.419ArgAsp: 3.419 ± 0.308
3.419ArgGlu: 3.419 ± 0.308
5.128ArgPhe: 5.128 ± 1.752
6.838ArgGly: 6.838 ± 1.961
1.709ArgHis: 1.709 ± 1.443
3.419ArgIle: 3.419 ± 0.308
6.838ArgLys: 6.838 ± 3.195
3.419ArgLeu: 3.419 ± 0.308
0.0ArgMet: 0.0 ± 0.0
1.709ArgAsn: 1.709 ± 1.135
3.419ArgPro: 3.419 ± 0.308
1.709ArgGln: 1.709 ± 1.443
5.128ArgArg: 5.128 ± 1.752
1.709ArgSer: 1.709 ± 1.135
1.709ArgThr: 1.709 ± 1.443
1.709ArgVal: 1.709 ± 1.135
3.419ArgTrp: 3.419 ± 0.308
5.128ArgTyr: 5.128 ± 1.752
0.0ArgXaa: 0.0 ± 0.0
Ser
10.256SerAla: 10.256 ± 4.23
0.0SerCys: 0.0 ± 0.0
5.128SerAsp: 5.128 ± 1.752
1.709SerGlu: 1.709 ± 1.443
0.0SerPhe: 0.0 ± 0.0
6.838SerGly: 6.838 ± 1.961
0.0SerHis: 0.0 ± 0.0
1.709SerIle: 1.709 ± 1.135
3.419SerLys: 3.419 ± 0.308
1.709SerLeu: 1.709 ± 1.135
1.709SerMet: 1.709 ± 1.135
3.419SerAsn: 3.419 ± 0.308
1.709SerPro: 1.709 ± 1.135
0.0SerGln: 0.0 ± 0.0
1.709SerArg: 1.709 ± 1.135
3.419SerSer: 3.419 ± 2.886
6.838SerThr: 6.838 ± 3.195
1.709SerVal: 1.709 ± 1.135
1.709SerTrp: 1.709 ± 1.443
1.709SerTyr: 1.709 ± 1.135
0.0SerXaa: 0.0 ± 0.0
Thr
1.709ThrAla: 1.709 ± 1.443
1.709ThrCys: 1.709 ± 1.443
5.128ThrAsp: 5.128 ± 0.826
3.419ThrGlu: 3.419 ± 2.269
1.709ThrPhe: 1.709 ± 1.443
6.838ThrGly: 6.838 ± 0.617
0.0ThrHis: 0.0 ± 0.0
1.709ThrIle: 1.709 ± 1.135
3.419ThrLys: 3.419 ± 2.269
3.419ThrLeu: 3.419 ± 0.308
3.419ThrMet: 3.419 ± 2.886
8.547ThrAsn: 8.547 ± 3.096
5.128ThrPro: 5.128 ± 0.826
1.709ThrGln: 1.709 ± 1.443
1.709ThrArg: 1.709 ± 1.135
3.419ThrSer: 3.419 ± 0.308
1.709ThrThr: 1.709 ± 1.135
13.675ThrVal: 13.675 ± 1.234
5.128ThrTrp: 5.128 ± 4.329
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
8.547ValAla: 8.547 ± 2.06
0.0ValCys: 0.0 ± 0.0
3.419ValAsp: 3.419 ± 2.269
1.709ValGlu: 1.709 ± 1.135
3.419ValPhe: 3.419 ± 2.269
5.128ValGly: 5.128 ± 0.826
5.128ValHis: 5.128 ± 1.752
5.128ValIle: 5.128 ± 1.752
5.128ValLys: 5.128 ± 1.752
5.128ValLeu: 5.128 ± 0.826
0.0ValMet: 0.0 ± 0.0
3.419ValAsn: 3.419 ± 2.269
1.709ValPro: 1.709 ± 1.443
6.838ValGln: 6.838 ± 3.195
3.419ValArg: 3.419 ± 0.308
1.709ValSer: 1.709 ± 1.135
5.128ValThr: 5.128 ± 0.826
11.966ValVal: 11.966 ± 4.946
1.709ValTrp: 1.709 ± 1.443
6.838ValTyr: 6.838 ± 4.539
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.709TrpAsp: 1.709 ± 1.443
0.0TrpGlu: 0.0 ± 0.0
3.419TrpPhe: 3.419 ± 2.269
1.709TrpGly: 1.709 ± 1.443
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.709TrpLys: 1.709 ± 1.443
3.419TrpLeu: 3.419 ± 2.886
5.128TrpMet: 5.128 ± 1.752
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
5.128TrpGln: 5.128 ± 0.826
3.419TrpArg: 3.419 ± 2.886
0.0TrpSer: 0.0 ± 0.0
3.419TrpThr: 3.419 ± 2.886
3.419TrpVal: 3.419 ± 2.886
0.0TrpTrp: 0.0 ± 0.0
1.709TrpTyr: 1.709 ± 1.443
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.709TyrAla: 1.709 ± 1.135
0.0TyrCys: 0.0 ± 0.0
5.128TyrAsp: 5.128 ± 1.752
6.838TyrGlu: 6.838 ± 0.617
1.709TyrPhe: 1.709 ± 1.135
1.709TyrGly: 1.709 ± 1.135
0.0TyrHis: 0.0 ± 0.0
1.709TyrIle: 1.709 ± 1.443
6.838TyrLys: 6.838 ± 0.617
1.709TyrLeu: 1.709 ± 1.135
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
0.0TyrPro: 0.0 ± 0.0
1.709TyrGln: 1.709 ± 1.135
3.419TyrArg: 3.419 ± 2.269
1.709TyrSer: 1.709 ± 1.443
1.709TyrThr: 1.709 ± 1.135
1.709TyrVal: 1.709 ± 1.443
1.709TyrTrp: 1.709 ± 1.135
5.128TyrTyr: 5.128 ± 3.404
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (586 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski