Amino acid dipepetide frequency for Sewage-associated circular DNA virus-36

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.279AlaAla: 3.279 ± 0.542
0.0AlaCys: 0.0 ± 0.0
3.279AlaAsp: 3.279 ± 2.06
1.639AlaGlu: 1.639 ± 1.03
0.0AlaPhe: 0.0 ± 0.0
6.557AlaGly: 6.557 ± 3.685
1.639AlaHis: 1.639 ± 1.03
8.197AlaIle: 8.197 ± 2.655
4.918AlaLys: 4.918 ± 3.09
8.197AlaLeu: 8.197 ± 2.549
3.279AlaMet: 3.279 ± 2.413
1.639AlaAsn: 1.639 ± 1.03
1.639AlaPro: 1.639 ± 1.03
6.557AlaGln: 6.557 ± 1.519
4.918AlaArg: 4.918 ± 3.09
3.279AlaSer: 3.279 ± 0.542
0.0AlaThr: 0.0 ± 0.0
6.557AlaVal: 6.557 ± 1.083
0.0AlaTrp: 0.0 ± 0.0
1.639AlaTyr: 1.639 ± 1.572
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.639CysAsp: 1.639 ± 1.03
0.0CysGlu: 0.0 ± 0.0
1.639CysPhe: 1.639 ± 1.03
1.639CysGly: 1.639 ± 1.03
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
1.639CysGln: 1.639 ± 1.03
0.0CysArg: 0.0 ± 0.0
1.639CysSer: 1.639 ± 1.03
0.0CysThr: 0.0 ± 0.0
1.639CysVal: 1.639 ± 1.03
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.918AspAla: 4.918 ± 4.716
0.0AspCys: 0.0 ± 0.0
6.557AspAsp: 6.557 ± 4.121
8.197AspGlu: 8.197 ± 2.549
3.279AspPhe: 3.279 ± 2.06
1.639AspGly: 1.639 ± 1.03
1.639AspHis: 1.639 ± 1.03
6.557AspIle: 6.557 ± 1.083
3.279AspLys: 3.279 ± 0.542
6.557AspLeu: 6.557 ± 1.519
0.0AspMet: 0.0 ± 0.0
1.639AspAsn: 1.639 ± 1.03
0.0AspPro: 0.0 ± 0.0
1.639AspGln: 1.639 ± 1.572
4.918AspArg: 4.918 ± 0.488
3.279AspSer: 3.279 ± 0.542
3.279AspThr: 3.279 ± 0.542
3.279AspVal: 3.279 ± 2.06
3.279AspTrp: 3.279 ± 2.06
1.639AspTyr: 1.639 ± 1.03
0.0AspXaa: 0.0 ± 0.0
Glu
0.0GluAla: 0.0 ± 0.0
0.0GluCys: 0.0 ± 0.0
0.0GluAsp: 0.0 ± 0.0
4.918GluGlu: 4.918 ± 3.09
4.918GluPhe: 4.918 ± 3.09
0.0GluGly: 0.0 ± 0.0
3.279GluHis: 3.279 ± 2.06
3.279GluIle: 3.279 ± 0.542
1.639GluLys: 1.639 ± 1.03
0.0GluLeu: 0.0 ± 0.0
0.0GluMet: 0.0 ± 0.0
1.639GluAsn: 1.639 ± 1.03
3.279GluPro: 3.279 ± 2.06
1.639GluGln: 1.639 ± 1.572
3.279GluArg: 3.279 ± 2.06
9.836GluSer: 9.836 ± 0.977
3.279GluThr: 3.279 ± 0.542
9.836GluVal: 9.836 ± 0.977
1.639GluTrp: 1.639 ± 1.03
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.639PheAla: 1.639 ± 1.572
0.0PheCys: 0.0 ± 0.0
6.557PheAsp: 6.557 ± 4.121
3.279PheGlu: 3.279 ± 0.542
1.639PhePhe: 1.639 ± 1.03
0.0PheGly: 0.0 ± 0.0
3.279PheHis: 3.279 ± 0.542
3.279PheIle: 3.279 ± 0.542
1.639PheLys: 1.639 ± 1.572
6.557PheLeu: 6.557 ± 1.519
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
0.0PhePro: 0.0 ± 0.0
3.279PheGln: 3.279 ± 0.542
1.639PheArg: 1.639 ± 1.572
1.639PheSer: 1.639 ± 1.03
4.918PheThr: 4.918 ± 0.488
0.0PheVal: 0.0 ± 0.0
0.0PheTrp: 0.0 ± 0.0
3.279PheTyr: 3.279 ± 3.144
0.0PheXaa: 0.0 ± 0.0
Gly
4.918GlyAla: 4.918 ± 0.488
0.0GlyCys: 0.0 ± 0.0
4.918GlyAsp: 4.918 ± 2.114
3.279GlyGlu: 3.279 ± 0.542
0.0GlyPhe: 0.0 ± 0.0
1.639GlyGly: 1.639 ± 1.03
1.639GlyHis: 1.639 ± 1.572
4.918GlyIle: 4.918 ± 0.488
3.279GlyLys: 3.279 ± 2.06
3.279GlyLeu: 3.279 ± 0.542
3.279GlyMet: 3.279 ± 2.06
1.639GlyAsn: 1.639 ± 1.572
6.557GlyPro: 6.557 ± 1.083
3.279GlyGln: 3.279 ± 3.144
4.918GlyArg: 4.918 ± 3.09
3.279GlySer: 3.279 ± 3.144
3.279GlyThr: 3.279 ± 3.144
3.279GlyVal: 3.279 ± 2.06
0.0GlyTrp: 0.0 ± 0.0
1.639GlyTyr: 1.639 ± 1.572
0.0GlyXaa: 0.0 ± 0.0
His
3.279HisAla: 3.279 ± 2.06
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
3.279HisGly: 3.279 ± 0.542
1.639HisHis: 1.639 ± 1.03
0.0HisIle: 0.0 ± 0.0
1.639HisLys: 1.639 ± 1.03
0.0HisLeu: 0.0 ± 0.0
1.639HisMet: 1.639 ± 1.572
1.639HisAsn: 1.639 ± 1.03
1.639HisPro: 1.639 ± 1.03
1.639HisGln: 1.639 ± 1.03
3.279HisArg: 3.279 ± 3.144
1.639HisSer: 1.639 ± 1.03
1.639HisThr: 1.639 ± 1.03
3.279HisVal: 3.279 ± 2.06
0.0HisTrp: 0.0 ± 0.0
4.918HisTyr: 4.918 ± 3.09
0.0HisXaa: 0.0 ± 0.0
Ile
4.918IleAla: 4.918 ± 0.488
1.639IleCys: 1.639 ± 1.03
3.279IleAsp: 3.279 ± 0.542
1.639IleGlu: 1.639 ± 1.03
1.639IlePhe: 1.639 ± 1.572
3.279IleGly: 3.279 ± 3.144
1.639IleHis: 1.639 ± 1.03
4.918IleIle: 4.918 ± 3.09
4.918IleLys: 4.918 ± 2.114
6.557IleLeu: 6.557 ± 1.519
0.0IleMet: 0.0 ± 0.744
3.279IleAsn: 3.279 ± 3.144
3.279IlePro: 3.279 ± 3.144
0.0IleGln: 0.0 ± 0.0
4.918IleArg: 4.918 ± 0.488
3.279IleSer: 3.279 ± 3.144
1.639IleThr: 1.639 ± 1.572
4.918IleVal: 4.918 ± 0.488
3.279IleTrp: 3.279 ± 2.06
1.639IleTyr: 1.639 ± 1.572
0.0IleXaa: 0.0 ± 0.0
Lys
4.918LysAla: 4.918 ± 0.488
0.0LysCys: 0.0 ± 0.0
1.639LysAsp: 1.639 ± 1.572
1.639LysGlu: 1.639 ± 1.03
4.918LysPhe: 4.918 ± 2.114
6.557LysGly: 6.557 ± 1.519
0.0LysHis: 0.0 ± 0.0
1.639LysIle: 1.639 ± 1.572
3.279LysLys: 3.279 ± 0.542
0.0LysLeu: 0.0 ± 0.0
0.0LysMet: 0.0 ± 0.0
1.639LysAsn: 1.639 ± 1.03
1.639LysPro: 1.639 ± 1.03
1.639LysGln: 1.639 ± 1.03
4.918LysArg: 4.918 ± 4.716
4.918LysSer: 4.918 ± 3.09
8.197LysThr: 8.197 ± 0.053
1.639LysVal: 1.639 ± 1.03
1.639LysTrp: 1.639 ± 1.03
1.639LysTyr: 1.639 ± 1.03
0.0LysXaa: 0.0 ± 0.0
Leu
4.918LeuAla: 4.918 ± 3.09
1.639LeuCys: 1.639 ± 1.03
9.836LeuAsp: 9.836 ± 3.579
4.918LeuGlu: 4.918 ± 0.488
0.0LeuPhe: 0.0 ± 0.0
3.279LeuGly: 3.279 ± 2.06
1.639LeuHis: 1.639 ± 1.03
3.279LeuIle: 3.279 ± 2.06
0.0LeuLys: 0.0 ± 0.0
3.279LeuLeu: 3.279 ± 2.06
1.639LeuMet: 1.639 ± 1.572
3.279LeuAsn: 3.279 ± 2.06
4.918LeuPro: 4.918 ± 2.114
0.0LeuGln: 0.0 ± 0.0
8.197LeuArg: 8.197 ± 2.655
3.279LeuSer: 3.279 ± 3.144
4.918LeuThr: 4.918 ± 2.114
3.279LeuVal: 3.279 ± 2.06
0.0LeuTrp: 0.0 ± 0.0
1.639LeuTyr: 1.639 ± 1.572
0.0LeuXaa: 0.0 ± 0.0
Met
3.279MetAla: 3.279 ± 0.542
1.639MetCys: 1.639 ± 1.03
1.639MetAsp: 1.639 ± 1.03
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
1.639MetGly: 1.639 ± 1.572
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
3.279MetArg: 3.279 ± 0.542
0.0MetSer: 0.0 ± 0.0
1.639MetThr: 1.639 ± 1.572
3.279MetVal: 3.279 ± 3.144
3.279MetTrp: 3.279 ± 0.542
1.639MetTyr: 1.639 ± 1.572
0.0MetXaa: 0.0 ± 0.0
Asn
4.918AsnAla: 4.918 ± 0.488
1.639AsnCys: 1.639 ± 1.03
1.639AsnAsp: 1.639 ± 1.03
0.0AsnGlu: 0.0 ± 0.0
3.279AsnPhe: 3.279 ± 0.542
1.639AsnGly: 1.639 ± 1.572
3.279AsnHis: 3.279 ± 2.06
1.639AsnIle: 1.639 ± 1.03
0.0AsnLys: 0.0 ± 0.0
3.279AsnLeu: 3.279 ± 3.144
0.0AsnMet: 0.0 ± 0.0
3.279AsnAsn: 3.279 ± 0.542
3.279AsnPro: 3.279 ± 0.542
1.639AsnGln: 1.639 ± 1.03
4.918AsnArg: 4.918 ± 3.09
3.279AsnSer: 3.279 ± 3.144
6.557AsnThr: 6.557 ± 3.685
0.0AsnVal: 0.0 ± 0.0
1.639AsnTrp: 1.639 ± 1.03
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.639ProAla: 1.639 ± 1.03
1.639ProCys: 1.639 ± 1.03
4.918ProAsp: 4.918 ± 0.488
3.279ProGlu: 3.279 ± 2.06
1.639ProPhe: 1.639 ± 1.03
3.279ProGly: 3.279 ± 3.144
0.0ProHis: 0.0 ± 0.0
0.0ProIle: 0.0 ± 0.0
4.918ProLys: 4.918 ± 0.488
1.639ProLeu: 1.639 ± 1.572
0.0ProMet: 0.0 ± 0.0
1.639ProAsn: 1.639 ± 1.03
4.918ProPro: 4.918 ± 3.09
0.0ProGln: 0.0 ± 0.0
4.918ProArg: 4.918 ± 0.488
1.639ProSer: 1.639 ± 1.03
8.197ProThr: 8.197 ± 0.053
0.0ProVal: 0.0 ± 0.0
0.0ProTrp: 0.0 ± 0.0
1.639ProTyr: 1.639 ± 1.03
0.0ProXaa: 0.0 ± 0.0
Gln
1.639GlnAla: 1.639 ± 1.03
0.0GlnCys: 0.0 ± 0.0
1.639GlnAsp: 1.639 ± 1.03
4.918GlnGlu: 4.918 ± 3.09
0.0GlnPhe: 0.0 ± 0.0
3.279GlnGly: 3.279 ± 3.144
1.639GlnHis: 1.639 ± 1.572
1.639GlnIle: 1.639 ± 1.572
0.0GlnLys: 0.0 ± 0.0
1.639GlnLeu: 1.639 ± 1.572
0.0GlnMet: 0.0 ± 0.0
1.639GlnAsn: 1.639 ± 1.03
1.639GlnPro: 1.639 ± 1.03
4.918GlnGln: 4.918 ± 4.716
4.918GlnArg: 4.918 ± 0.488
1.639GlnSer: 1.639 ± 1.572
1.639GlnThr: 1.639 ± 1.572
1.639GlnVal: 1.639 ± 1.572
0.0GlnTrp: 0.0 ± 0.0
1.639GlnTyr: 1.639 ± 1.03
0.0GlnXaa: 0.0 ± 0.0
Arg
3.279ArgAla: 3.279 ± 0.542
0.0ArgCys: 0.0 ± 0.0
4.918ArgAsp: 4.918 ± 0.488
3.279ArgGlu: 3.279 ± 0.542
3.279ArgPhe: 3.279 ± 0.542
6.557ArgGly: 6.557 ± 1.083
1.639ArgHis: 1.639 ± 1.03
1.639ArgIle: 1.639 ± 1.572
13.115ArgLys: 13.115 ± 2.167
6.557ArgLeu: 6.557 ± 1.519
1.639ArgMet: 1.639 ± 1.572
3.279ArgAsn: 3.279 ± 0.542
4.918ArgPro: 4.918 ± 3.09
0.0ArgGln: 0.0 ± 0.0
8.197ArgArg: 8.197 ± 2.655
3.279ArgSer: 3.279 ± 0.542
4.918ArgThr: 4.918 ± 3.09
3.279ArgVal: 3.279 ± 0.542
3.279ArgTrp: 3.279 ± 0.542
4.918ArgTyr: 4.918 ± 0.488
0.0ArgXaa: 0.0 ± 0.0
Ser
4.918SerAla: 4.918 ± 3.09
0.0SerCys: 0.0 ± 0.0
8.197SerAsp: 8.197 ± 2.655
4.918SerGlu: 4.918 ± 0.488
4.918SerPhe: 4.918 ± 4.716
4.918SerGly: 4.918 ± 0.488
0.0SerHis: 0.0 ± 0.0
4.918SerIle: 4.918 ± 2.114
1.639SerLys: 1.639 ± 1.572
3.279SerLeu: 3.279 ± 0.542
1.639SerMet: 1.639 ± 1.572
4.918SerAsn: 4.918 ± 2.114
0.0SerPro: 0.0 ± 0.0
4.918SerGln: 4.918 ± 2.114
0.0SerArg: 0.0 ± 0.0
1.639SerSer: 1.639 ± 1.572
8.197SerThr: 8.197 ± 2.655
1.639SerVal: 1.639 ± 1.03
3.279SerTrp: 3.279 ± 2.06
1.639SerTyr: 1.639 ± 1.03
0.0SerXaa: 0.0 ± 0.0
Thr
6.557ThrAla: 6.557 ± 3.685
1.639ThrCys: 1.639 ± 1.03
4.918ThrAsp: 4.918 ± 2.114
3.279ThrGlu: 3.279 ± 2.06
0.0ThrPhe: 0.0 ± 0.0
3.279ThrGly: 3.279 ± 0.542
1.639ThrHis: 1.639 ± 1.03
8.197ThrIle: 8.197 ± 2.655
3.279ThrLys: 3.279 ± 0.542
1.639ThrLeu: 1.639 ± 1.03
1.639ThrMet: 1.639 ± 1.572
4.918ThrAsn: 4.918 ± 2.114
3.279ThrPro: 3.279 ± 0.542
1.639ThrGln: 1.639 ± 1.572
6.557ThrArg: 6.557 ± 1.519
3.279ThrSer: 3.279 ± 0.542
4.918ThrThr: 4.918 ± 0.488
4.918ThrVal: 4.918 ± 4.716
1.639ThrTrp: 1.639 ± 1.572
1.639ThrTyr: 1.639 ± 1.03
0.0ThrXaa: 0.0 ± 0.0
Val
3.279ValAla: 3.279 ± 2.06
0.0ValCys: 0.0 ± 0.0
0.0ValAsp: 0.0 ± 0.0
3.279ValGlu: 3.279 ± 0.542
6.557ValPhe: 6.557 ± 1.519
1.639ValGly: 1.639 ± 1.03
1.639ValHis: 1.639 ± 1.03
1.639ValIle: 1.639 ± 1.572
1.639ValLys: 1.639 ± 1.03
8.197ValLeu: 8.197 ± 0.053
1.639ValMet: 1.639 ± 1.03
8.197ValAsn: 8.197 ± 0.053
3.279ValPro: 3.279 ± 2.06
1.639ValGln: 1.639 ± 1.03
3.279ValArg: 3.279 ± 0.542
6.557ValSer: 6.557 ± 3.685
1.639ValThr: 1.639 ± 1.572
4.918ValVal: 4.918 ± 0.488
1.639ValTrp: 1.639 ± 1.03
4.918ValTyr: 4.918 ± 2.114
0.0ValXaa: 0.0 ± 0.0
Trp
1.639TrpAla: 1.639 ± 1.03
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
4.918TrpGly: 4.918 ± 3.09
1.639TrpHis: 1.639 ± 1.03
1.639TrpIle: 1.639 ± 1.03
0.0TrpLys: 0.0 ± 0.0
1.639TrpLeu: 1.639 ± 1.03
0.0TrpMet: 0.0 ± 0.0
1.639TrpAsn: 1.639 ± 1.572
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.639TrpArg: 1.639 ± 1.03
3.279TrpSer: 3.279 ± 3.144
0.0TrpThr: 0.0 ± 0.0
6.557TrpVal: 6.557 ± 1.519
0.0TrpTrp: 0.0 ± 0.0
1.639TrpTyr: 1.639 ± 1.03
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.279TyrAla: 3.279 ± 0.542
0.0TyrCys: 0.0 ± 0.0
0.0TyrAsp: 0.0 ± 0.0
0.0TyrGlu: 0.0 ± 0.0
4.918TyrPhe: 4.918 ± 2.114
0.0TyrGly: 0.0 ± 0.0
3.279TyrHis: 3.279 ± 0.542
4.918TyrIle: 4.918 ± 0.488
3.279TyrLys: 3.279 ± 2.06
1.639TyrLeu: 1.639 ± 1.03
3.279TyrMet: 3.279 ± 0.542
0.0TyrAsn: 0.0 ± 0.0
1.639TyrPro: 1.639 ± 1.03
0.0TyrGln: 0.0 ± 0.0
3.279TyrArg: 3.279 ± 3.144
4.918TyrSer: 4.918 ± 3.09
0.0TyrThr: 0.0 ± 0.0
1.639TyrVal: 1.639 ± 1.572
1.639TyrTrp: 1.639 ± 1.572
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (611 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski