Amino acid dipepetide frequency for Fly associated circular virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
3.311AlaCys: 3.311 ± 2.193
6.623AlaAsp: 6.623 ± 0.475
1.656AlaGlu: 1.656 ± 1.097
1.656AlaPhe: 1.656 ± 1.097
4.967AlaGly: 4.967 ± 1.572
1.656AlaHis: 1.656 ± 1.334
6.623AlaIle: 6.623 ± 0.475
3.311AlaLys: 3.311 ± 2.193
3.311AlaLeu: 3.311 ± 2.668
4.967AlaMet: 4.967 ± 3.29
6.623AlaAsn: 6.623 ± 4.387
1.656AlaPro: 1.656 ± 1.097
1.656AlaGln: 1.656 ± 1.097
1.656AlaArg: 1.656 ± 1.334
3.311AlaSer: 3.311 ± 2.193
1.656AlaThr: 1.656 ± 1.097
1.656AlaVal: 1.656 ± 1.097
0.0AlaTrp: 0.0 ± 0.0
3.311AlaTyr: 3.311 ± 2.668
0.0AlaXaa: 0.0 ± 0.0
Cys
4.967CysAla: 4.967 ± 1.572
0.0CysCys: 0.0 ± 0.0
1.656CysAsp: 1.656 ± 1.097
0.0CysGlu: 0.0 ± 0.0
1.656CysPhe: 1.656 ± 1.334
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.656CysIle: 1.656 ± 1.097
1.656CysLys: 1.656 ± 1.334
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
1.656CysAsn: 1.656 ± 1.334
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.967AspAla: 4.967 ± 3.29
1.656AspCys: 1.656 ± 1.334
3.311AspAsp: 3.311 ± 2.193
1.656AspGlu: 1.656 ± 1.334
0.0AspPhe: 0.0 ± 0.0
3.311AspGly: 3.311 ± 2.668
0.0AspHis: 0.0 ± 0.0
1.656AspIle: 1.656 ± 1.334
3.311AspLys: 3.311 ± 0.237
9.934AspLeu: 9.934 ± 1.719
1.656AspMet: 1.656 ± 1.097
3.311AspAsn: 3.311 ± 0.237
3.311AspPro: 3.311 ± 0.237
4.967AspGln: 4.967 ± 0.859
4.967AspArg: 4.967 ± 4.003
4.967AspSer: 4.967 ± 0.859
0.0AspThr: 0.0 ± 0.0
4.967AspVal: 4.967 ± 0.859
1.656AspTrp: 1.656 ± 1.334
1.656AspTyr: 1.656 ± 1.097
0.0AspXaa: 0.0 ± 0.0
Glu
0.0GluAla: 0.0 ± 0.0
0.0GluCys: 0.0 ± 0.0
1.656GluAsp: 1.656 ± 1.334
1.656GluGlu: 1.656 ± 1.334
3.311GluPhe: 3.311 ± 2.193
1.656GluGly: 1.656 ± 1.334
1.656GluHis: 1.656 ± 1.334
4.967GluIle: 4.967 ± 3.29
0.0GluLys: 0.0 ± 0.0
0.0GluLeu: 0.0 ± 0.0
0.0GluMet: 0.0 ± 0.0
0.0GluAsn: 0.0 ± 0.0
3.311GluPro: 3.311 ± 2.193
0.0GluGln: 0.0 ± 0.0
4.967GluArg: 4.967 ± 4.003
6.623GluSer: 6.623 ± 0.475
6.623GluThr: 6.623 ± 2.906
1.656GluVal: 1.656 ± 1.334
0.0GluTrp: 0.0 ± 0.0
1.656GluTyr: 1.656 ± 1.334
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.0PheCys: 0.0 ± 0.0
4.967PheAsp: 4.967 ± 1.572
0.0PheGlu: 0.0 ± 0.0
1.656PhePhe: 1.656 ± 1.334
3.311PheGly: 3.311 ± 0.237
0.0PheHis: 0.0 ± 0.0
0.0PheIle: 0.0 ± 0.0
0.0PheLys: 0.0 ± 0.0
0.0PheLeu: 0.0 ± 0.0
1.656PheMet: 1.656 ± 1.097
1.656PheAsn: 1.656 ± 1.097
0.0PhePro: 0.0 ± 0.0
6.623PheGln: 6.623 ± 1.956
1.656PheArg: 1.656 ± 1.097
6.623PheSer: 6.623 ± 1.956
0.0PheThr: 0.0 ± 0.0
3.311PheVal: 3.311 ± 2.193
0.0PheTrp: 0.0 ± 0.0
1.656PheTyr: 1.656 ± 1.334
0.0PheXaa: 0.0 ± 0.0
Gly
3.311GlyAla: 3.311 ± 0.237
1.656GlyCys: 1.656 ± 1.334
3.311GlyAsp: 3.311 ± 2.193
3.311GlyGlu: 3.311 ± 2.193
1.656GlyPhe: 1.656 ± 1.097
3.311GlyGly: 3.311 ± 0.237
0.0GlyHis: 0.0 ± 0.0
1.656GlyIle: 1.656 ± 1.097
3.311GlyLys: 3.311 ± 0.237
9.934GlyLeu: 9.934 ± 1.719
0.0GlyMet: 0.0 ± 0.0
4.967GlyAsn: 4.967 ± 0.859
1.656GlyPro: 1.656 ± 1.097
0.0GlyGln: 0.0 ± 0.0
6.623GlyArg: 6.623 ± 2.906
4.967GlySer: 4.967 ± 0.859
1.656GlyThr: 1.656 ± 1.334
6.623GlyVal: 6.623 ± 1.956
1.656GlyTrp: 1.656 ± 1.097
3.311GlyTyr: 3.311 ± 2.668
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
1.656HisGly: 1.656 ± 1.334
0.0HisHis: 0.0 ± 0.0
1.656HisIle: 1.656 ± 1.334
0.0HisLys: 0.0 ± 0.0
1.656HisLeu: 1.656 ± 1.334
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
3.311HisArg: 3.311 ± 0.237
1.656HisSer: 1.656 ± 1.334
1.656HisThr: 1.656 ± 1.097
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
6.623IleAla: 6.623 ± 1.956
0.0IleCys: 0.0 ± 0.0
1.656IleAsp: 1.656 ± 1.334
6.623IleGlu: 6.623 ± 2.906
0.0IlePhe: 0.0 ± 0.0
6.623IleGly: 6.623 ± 4.387
1.656IleHis: 1.656 ± 1.097
4.967IleIle: 4.967 ± 1.572
3.311IleLys: 3.311 ± 0.237
8.278IleLeu: 8.278 ± 1.809
1.656IleMet: 1.656 ± 1.097
0.0IleAsn: 0.0 ± 0.0
3.311IlePro: 3.311 ± 0.237
4.967IleGln: 4.967 ± 1.572
1.656IleArg: 1.656 ± 1.334
4.967IleSer: 4.967 ± 0.859
1.656IleThr: 1.656 ± 1.097
3.311IleVal: 3.311 ± 0.237
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
1.656LysAla: 1.656 ± 1.097
0.0LysCys: 0.0 ± 0.0
4.967LysAsp: 4.967 ± 1.572
1.656LysGlu: 1.656 ± 1.334
1.656LysPhe: 1.656 ± 1.097
3.311LysGly: 3.311 ± 0.237
0.0LysHis: 0.0 ± 0.0
1.656LysIle: 1.656 ± 1.097
1.656LysLys: 1.656 ± 1.097
6.623LysLeu: 6.623 ± 0.475
3.311LysMet: 3.311 ± 2.193
1.656LysAsn: 1.656 ± 1.097
0.0LysPro: 0.0 ± 0.0
1.656LysGln: 1.656 ± 1.334
4.967LysArg: 4.967 ± 4.003
1.656LysSer: 1.656 ± 1.334
1.656LysThr: 1.656 ± 1.097
4.967LysVal: 4.967 ± 1.572
1.656LysTrp: 1.656 ± 1.334
1.656LysTyr: 1.656 ± 1.097
0.0LysXaa: 0.0 ± 0.0
Leu
6.623LeuAla: 6.623 ± 1.956
1.656LeuCys: 1.656 ± 1.334
3.311LeuAsp: 3.311 ± 2.668
0.0LeuGlu: 0.0 ± 0.0
0.0LeuPhe: 0.0 ± 0.0
1.656LeuGly: 1.656 ± 1.097
0.0LeuHis: 0.0 ± 0.0
4.967LeuIle: 4.967 ± 1.572
6.623LeuLys: 6.623 ± 2.906
8.278LeuLeu: 8.278 ± 1.809
1.656LeuMet: 1.656 ± 1.097
6.623LeuAsn: 6.623 ± 1.956
8.278LeuPro: 8.278 ± 1.809
8.278LeuGln: 8.278 ± 0.622
3.311LeuArg: 3.311 ± 2.668
4.967LeuSer: 4.967 ± 0.859
1.656LeuThr: 1.656 ± 1.334
4.967LeuVal: 4.967 ± 1.572
1.656LeuTrp: 1.656 ± 1.334
9.934LeuTyr: 9.934 ± 1.719
0.0LeuXaa: 0.0 ± 0.0
Met
1.656MetAla: 1.656 ± 1.097
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
1.656MetGlu: 1.656 ± 1.097
6.623MetPhe: 6.623 ± 4.387
1.656MetGly: 1.656 ± 1.097
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
1.656MetLeu: 1.656 ± 1.097
0.0MetMet: 0.0 ± 0.0
1.656MetAsn: 1.656 ± 1.097
3.311MetPro: 3.311 ± 2.193
0.0MetGln: 0.0 ± 0.0
1.656MetArg: 1.656 ± 1.097
4.967MetSer: 4.967 ± 0.859
0.0MetThr: 0.0 ± 0.0
4.967MetVal: 4.967 ± 1.572
1.656MetTrp: 1.656 ± 1.097
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.656AsnAla: 1.656 ± 1.334
0.0AsnCys: 0.0 ± 0.0
3.311AsnAsp: 3.311 ± 0.237
0.0AsnGlu: 0.0 ± 0.0
3.311AsnPhe: 3.311 ± 0.237
1.656AsnGly: 1.656 ± 1.097
1.656AsnHis: 1.656 ± 1.097
3.311AsnIle: 3.311 ± 0.237
1.656AsnLys: 1.656 ± 1.097
1.656AsnLeu: 1.656 ± 1.097
0.0AsnMet: 0.0 ± 0.0
1.656AsnAsn: 1.656 ± 1.097
0.0AsnPro: 0.0 ± 0.0
1.656AsnGln: 1.656 ± 1.097
3.311AsnArg: 3.311 ± 0.237
8.278AsnSer: 8.278 ± 3.053
6.623AsnThr: 6.623 ± 1.956
4.967AsnVal: 4.967 ± 3.29
0.0AsnTrp: 0.0 ± 0.0
1.656AsnTyr: 1.656 ± 1.097
0.0AsnXaa: 0.0 ± 0.0
Pro
4.967ProAla: 4.967 ± 3.29
1.656ProCys: 1.656 ± 1.334
1.656ProAsp: 1.656 ± 1.097
0.0ProGlu: 0.0 ± 0.0
1.656ProPhe: 1.656 ± 1.334
3.311ProGly: 3.311 ± 0.237
0.0ProHis: 0.0 ± 0.0
1.656ProIle: 1.656 ± 1.097
1.656ProLys: 1.656 ± 1.334
4.967ProLeu: 4.967 ± 1.572
3.311ProMet: 3.311 ± 0.237
0.0ProAsn: 0.0 ± 0.0
1.656ProPro: 1.656 ± 1.097
3.311ProGln: 3.311 ± 2.193
4.967ProArg: 4.967 ± 1.572
3.311ProSer: 3.311 ± 2.193
1.656ProThr: 1.656 ± 1.097
1.656ProVal: 1.656 ± 1.097
0.0ProTrp: 0.0 ± 0.0
1.656ProTyr: 1.656 ± 1.097
0.0ProXaa: 0.0 ± 0.0
Gln
1.656GlnAla: 1.656 ± 1.097
0.0GlnCys: 0.0 ± 0.0
3.311GlnAsp: 3.311 ± 0.237
0.0GlnGlu: 0.0 ± 0.0
3.311GlnPhe: 3.311 ± 0.237
0.0GlnGly: 0.0 ± 0.0
0.0GlnHis: 0.0 ± 0.0
1.656GlnIle: 1.656 ± 1.097
3.311GlnLys: 3.311 ± 0.237
3.311GlnLeu: 3.311 ± 2.193
1.656GlnMet: 1.656 ± 1.097
3.311GlnAsn: 3.311 ± 2.193
1.656GlnPro: 1.656 ± 1.334
0.0GlnGln: 0.0 ± 0.0
1.656GlnArg: 1.656 ± 1.334
9.934GlnSer: 9.934 ± 4.149
1.656GlnThr: 1.656 ± 1.334
6.623GlnVal: 6.623 ± 0.475
0.0GlnTrp: 0.0 ± 0.0
3.311GlnTyr: 3.311 ± 0.237
0.0GlnXaa: 0.0 ± 0.0
Arg
4.967ArgAla: 4.967 ± 1.572
0.0ArgCys: 0.0 ± 0.0
1.656ArgAsp: 1.656 ± 1.334
3.311ArgGlu: 3.311 ± 2.668
1.656ArgPhe: 1.656 ± 1.334
8.278ArgGly: 8.278 ± 4.24
3.311ArgHis: 3.311 ± 2.668
6.623ArgIle: 6.623 ± 0.475
3.311ArgLys: 3.311 ± 0.237
1.656ArgLeu: 1.656 ± 1.097
1.656ArgMet: 1.656 ± 1.741
0.0ArgAsn: 0.0 ± 0.0
0.0ArgPro: 0.0 ± 0.0
1.656ArgGln: 1.656 ± 1.334
1.656ArgArg: 1.656 ± 1.334
0.0ArgSer: 0.0 ± 0.0
3.311ArgThr: 3.311 ± 2.668
6.623ArgVal: 6.623 ± 5.337
4.967ArgTrp: 4.967 ± 4.003
4.967ArgTyr: 4.967 ± 1.572
0.0ArgXaa: 0.0 ± 0.0
Ser
6.623SerAla: 6.623 ± 0.475
1.656SerCys: 1.656 ± 1.097
8.278SerAsp: 8.278 ± 0.622
8.278SerGlu: 8.278 ± 0.622
1.656SerPhe: 1.656 ± 1.334
6.623SerGly: 6.623 ± 4.387
0.0SerHis: 0.0 ± 0.0
4.967SerIle: 4.967 ± 0.859
1.656SerLys: 1.656 ± 1.097
4.967SerLeu: 4.967 ± 0.859
6.623SerMet: 6.623 ± 2.681
6.623SerAsn: 6.623 ± 4.387
4.967SerPro: 4.967 ± 0.859
0.0SerGln: 0.0 ± 0.0
1.656SerArg: 1.656 ± 1.334
13.245SerSer: 13.245 ± 0.95
9.934SerThr: 9.934 ± 0.712
8.278SerVal: 8.278 ± 0.622
1.656SerTrp: 1.656 ± 1.334
4.967SerTyr: 4.967 ± 0.859
0.0SerXaa: 0.0 ± 0.0
Thr
1.656ThrAla: 1.656 ± 1.097
0.0ThrCys: 0.0 ± 0.0
0.0ThrAsp: 0.0 ± 0.0
1.656ThrGlu: 1.656 ± 1.097
1.656ThrPhe: 1.656 ± 1.097
4.967ThrGly: 4.967 ± 0.859
0.0ThrHis: 0.0 ± 0.0
4.967ThrIle: 4.967 ± 0.859
3.311ThrLys: 3.311 ± 2.668
1.656ThrLeu: 1.656 ± 1.334
0.0ThrMet: 0.0 ± 0.0
1.656ThrAsn: 1.656 ± 1.334
3.311ThrPro: 3.311 ± 2.193
1.656ThrGln: 1.656 ± 1.097
1.656ThrArg: 1.656 ± 1.334
9.934ThrSer: 9.934 ± 0.712
0.0ThrThr: 0.0 ± 0.0
3.311ThrVal: 3.311 ± 2.668
3.311ThrTrp: 3.311 ± 0.237
3.311ThrTyr: 3.311 ± 2.193
0.0ThrXaa: 0.0 ± 0.0
Val
6.623ValAla: 6.623 ± 0.475
0.0ValCys: 0.0 ± 0.0
6.623ValAsp: 6.623 ± 1.956
3.311ValGlu: 3.311 ± 0.237
1.656ValPhe: 1.656 ± 1.097
3.311ValGly: 3.311 ± 2.193
1.656ValHis: 1.656 ± 1.334
3.311ValIle: 3.311 ± 0.237
3.311ValLys: 3.311 ± 0.237
9.934ValLeu: 9.934 ± 3.143
1.656ValMet: 1.656 ± 1.334
1.656ValAsn: 1.656 ± 1.334
1.656ValPro: 1.656 ± 1.334
8.278ValGln: 8.278 ± 0.622
1.656ValArg: 1.656 ± 1.334
8.278ValSer: 8.278 ± 3.053
4.967ValThr: 4.967 ± 0.859
4.967ValVal: 4.967 ± 1.572
1.656ValTrp: 1.656 ± 1.334
1.656ValTyr: 1.656 ± 1.334
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.656TrpGlu: 1.656 ± 1.334
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
3.311TrpIle: 3.311 ± 2.668
3.311TrpLys: 3.311 ± 0.237
1.656TrpLeu: 1.656 ± 1.334
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
3.311TrpArg: 3.311 ± 0.237
1.656TrpSer: 1.656 ± 1.334
1.656TrpThr: 1.656 ± 1.097
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
3.311TrpTyr: 3.311 ± 2.668
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.656TyrAla: 1.656 ± 1.097
1.656TyrCys: 1.656 ± 1.334
4.967TyrAsp: 4.967 ± 0.859
3.311TyrGlu: 3.311 ± 2.668
0.0TyrPhe: 0.0 ± 0.0
3.311TyrGly: 3.311 ± 2.193
0.0TyrHis: 0.0 ± 0.0
1.656TyrIle: 1.656 ± 1.334
1.656TyrLys: 1.656 ± 1.097
4.967TyrLeu: 4.967 ± 4.003
0.0TyrMet: 0.0 ± 0.0
3.311TyrAsn: 3.311 ± 0.237
4.967TyrPro: 4.967 ± 0.859
1.656TyrGln: 1.656 ± 1.097
6.623TyrArg: 6.623 ± 0.475
3.311TyrSer: 3.311 ± 2.668
1.656TyrThr: 1.656 ± 1.097
3.311TyrVal: 3.311 ± 0.237
0.0TyrTrp: 0.0 ± 0.0
4.967TyrTyr: 4.967 ± 3.29
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (605 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski