Amino acid dipepetide frequency for Sewage-associated circular DNA virus-18

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.836AlaAla: 9.836 ± 8.799
0.0AlaCys: 0.0 ± 0.0
1.639AlaAsp: 1.639 ± 1.466
4.918AlaGlu: 4.918 ± 1.932
0.0AlaPhe: 0.0 ± 0.0
1.639AlaGly: 1.639 ± 1.466
0.0AlaHis: 0.0 ± 0.0
1.639AlaIle: 1.639 ± 1.001
3.279AlaLys: 3.279 ± 2.002
3.279AlaLeu: 3.279 ± 2.933
1.639AlaMet: 1.639 ± 1.466
4.918AlaAsn: 4.918 ± 1.932
4.918AlaPro: 4.918 ± 3.004
3.279AlaGln: 3.279 ± 2.933
4.918AlaArg: 4.918 ± 0.536
4.918AlaSer: 4.918 ± 4.399
6.557AlaThr: 6.557 ± 0.93
8.197AlaVal: 8.197 ± 2.397
0.0AlaTrp: 0.0 ± 0.0
1.639AlaTyr: 1.639 ± 1.466
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.639CysAsp: 1.639 ± 1.001
3.279CysGlu: 3.279 ± 2.002
1.639CysPhe: 1.639 ± 1.001
0.0CysGly: 0.0 ± 0.0
1.639CysHis: 1.639 ± 1.001
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
1.639CysAsn: 1.639 ± 1.466
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
1.639CysSer: 1.639 ± 1.001
1.639CysThr: 1.639 ± 1.001
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.639CysTyr: 1.639 ± 1.001
0.0CysXaa: 0.0 ± 0.0
Asp
3.279AspAla: 3.279 ± 0.465
1.639AspCys: 1.639 ± 1.001
3.279AspAsp: 3.279 ± 2.002
1.639AspGlu: 1.639 ± 1.001
8.197AspPhe: 8.197 ± 4.865
4.918AspGly: 4.918 ± 3.004
0.0AspHis: 0.0 ± 0.0
1.639AspIle: 1.639 ± 1.001
0.0AspLys: 0.0 ± 0.0
4.918AspLeu: 4.918 ± 3.004
1.639AspMet: 1.639 ± 1.466
1.639AspAsn: 1.639 ± 1.001
3.279AspPro: 3.279 ± 0.465
0.0AspGln: 0.0 ± 0.0
3.279AspArg: 3.279 ± 0.465
3.279AspSer: 3.279 ± 0.465
1.639AspThr: 1.639 ± 1.001
6.557AspVal: 6.557 ± 1.537
1.639AspTrp: 1.639 ± 1.001
1.639AspTyr: 1.639 ± 1.001
0.0AspXaa: 0.0 ± 0.0
Glu
1.639GluAla: 1.639 ± 1.001
0.0GluCys: 0.0 ± 0.0
1.639GluAsp: 1.639 ± 1.001
6.557GluGlu: 6.557 ± 4.005
3.279GluPhe: 3.279 ± 2.002
1.639GluGly: 1.639 ± 1.001
0.0GluHis: 0.0 ± 0.0
4.918GluIle: 4.918 ± 3.004
1.639GluLys: 1.639 ± 1.001
1.639GluLeu: 1.639 ± 1.466
0.0GluMet: 0.0 ± 0.0
1.639GluAsn: 1.639 ± 1.466
1.639GluPro: 1.639 ± 1.001
3.279GluGln: 3.279 ± 2.002
3.279GluArg: 3.279 ± 2.002
1.639GluSer: 1.639 ± 1.001
4.918GluThr: 4.918 ± 0.536
1.639GluVal: 1.639 ± 1.001
3.279GluTrp: 3.279 ± 2.002
1.639GluTyr: 1.639 ± 1.001
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.0PheCys: 0.0 ± 0.0
1.639PheAsp: 1.639 ± 1.001
0.0PheGlu: 0.0 ± 0.0
3.279PhePhe: 3.279 ± 2.933
1.639PheGly: 1.639 ± 1.001
0.0PheHis: 0.0 ± 0.0
4.918PheIle: 4.918 ± 3.004
4.918PheLys: 4.918 ± 3.004
4.918PheLeu: 4.918 ± 0.536
0.0PheMet: 0.0 ± 0.0
8.197PheAsn: 8.197 ± 4.865
6.557PhePro: 6.557 ± 0.93
0.0PheGln: 0.0 ± 0.0
1.639PheArg: 1.639 ± 1.001
3.279PheSer: 3.279 ± 2.933
1.639PheThr: 1.639 ± 1.001
3.279PheVal: 3.279 ± 2.933
0.0PheTrp: 0.0 ± 0.0
6.557PheTyr: 6.557 ± 0.93
0.0PheXaa: 0.0 ± 0.0
Gly
6.557GlyAla: 6.557 ± 5.866
0.0GlyCys: 0.0 ± 0.0
4.918GlyAsp: 4.918 ± 0.536
0.0GlyGlu: 0.0 ± 0.0
3.279GlyPhe: 3.279 ± 0.465
3.279GlyGly: 3.279 ± 0.465
0.0GlyHis: 0.0 ± 0.0
1.639GlyIle: 1.639 ± 1.466
6.557GlyLys: 6.557 ± 4.005
3.279GlyLeu: 3.279 ± 2.002
1.639GlyMet: 1.639 ± 1.466
3.279GlyAsn: 3.279 ± 0.465
4.918GlyPro: 4.918 ± 1.932
4.918GlyGln: 4.918 ± 0.536
6.557GlyArg: 6.557 ± 3.398
13.115GlySer: 13.115 ± 4.329
9.836GlyThr: 9.836 ± 6.331
3.279GlyVal: 3.279 ± 2.933
0.0GlyTrp: 0.0 ± 0.0
3.279GlyTyr: 3.279 ± 2.002
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
1.639HisCys: 1.639 ± 1.001
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
1.639HisHis: 1.639 ± 1.001
1.639HisIle: 1.639 ± 1.001
1.639HisLys: 1.639 ± 1.001
1.639HisLeu: 1.639 ± 1.001
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
1.639HisVal: 1.639 ± 1.001
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.918IleAla: 4.918 ± 1.932
3.279IleCys: 3.279 ± 2.002
4.918IleAsp: 4.918 ± 1.932
6.557IleGlu: 6.557 ± 4.005
3.279IlePhe: 3.279 ± 2.933
4.918IleGly: 4.918 ± 1.932
0.0IleHis: 0.0 ± 0.0
1.639IleIle: 1.639 ± 1.001
3.279IleLys: 3.279 ± 2.002
6.557IleLeu: 6.557 ± 1.537
0.0IleMet: 0.0 ± 0.0
4.918IleAsn: 4.918 ± 0.536
1.639IlePro: 1.639 ± 1.001
1.639IleGln: 1.639 ± 1.001
3.279IleArg: 3.279 ± 2.002
1.639IleSer: 1.639 ± 1.466
3.279IleThr: 3.279 ± 2.002
1.639IleVal: 1.639 ± 1.466
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
6.557LysAla: 6.557 ± 1.537
3.279LysCys: 3.279 ± 0.465
3.279LysAsp: 3.279 ± 2.002
3.279LysGlu: 3.279 ± 0.465
4.918LysPhe: 4.918 ± 0.536
11.475LysGly: 11.475 ± 4.541
0.0LysHis: 0.0 ± 0.0
1.639LysIle: 1.639 ± 1.466
11.475LysLys: 11.475 ± 4.541
3.279LysLeu: 3.279 ± 2.002
1.639LysMet: 1.639 ± 1.001
6.557LysAsn: 6.557 ± 4.005
0.0LysPro: 0.0 ± 0.0
4.918LysGln: 4.918 ± 0.536
3.279LysArg: 3.279 ± 0.465
8.197LysSer: 8.197 ± 2.538
4.918LysThr: 4.918 ± 3.004
0.0LysVal: 0.0 ± 0.0
0.0LysTrp: 0.0 ± 0.0
1.639LysTyr: 1.639 ± 1.466
0.0LysXaa: 0.0 ± 0.0
Leu
3.279LeuAla: 3.279 ± 0.465
0.0LeuCys: 0.0 ± 0.0
4.918LeuAsp: 4.918 ± 3.004
6.557LeuGlu: 6.557 ± 4.005
6.557LeuPhe: 6.557 ± 0.93
8.197LeuGly: 8.197 ± 0.071
1.639LeuHis: 1.639 ± 1.001
6.557LeuIle: 6.557 ± 0.93
1.639LeuLys: 1.639 ± 1.466
1.639LeuLeu: 1.639 ± 1.001
1.639LeuMet: 1.639 ± 1.609
4.918LeuAsn: 4.918 ± 1.932
1.639LeuPro: 1.639 ± 1.001
3.279LeuGln: 3.279 ± 2.002
3.279LeuArg: 3.279 ± 2.933
8.197LeuSer: 8.197 ± 5.006
4.918LeuThr: 4.918 ± 0.536
3.279LeuVal: 3.279 ± 2.002
0.0LeuTrp: 0.0 ± 0.0
3.279LeuTyr: 3.279 ± 2.002
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
1.639MetAsp: 1.639 ± 1.001
1.639MetGlu: 1.639 ± 1.001
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
3.279MetIle: 3.279 ± 0.465
4.918MetLys: 4.918 ± 0.536
1.639MetLeu: 1.639 ± 1.001
1.639MetMet: 1.639 ± 0.932
3.279MetAsn: 3.279 ± 2.002
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
1.639MetArg: 1.639 ± 1.466
1.639MetSer: 1.639 ± 1.466
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.639MetTyr: 1.639 ± 1.466
0.0MetXaa: 0.0 ± 0.0
Asn
8.197AsnAla: 8.197 ± 4.865
0.0AsnCys: 0.0 ± 0.0
1.639AsnAsp: 1.639 ± 1.466
0.0AsnGlu: 0.0 ± 0.0
6.557AsnPhe: 6.557 ± 1.537
6.557AsnGly: 6.557 ± 5.866
0.0AsnHis: 0.0 ± 0.0
3.279AsnIle: 3.279 ± 2.002
9.836AsnLys: 9.836 ± 1.072
6.557AsnLeu: 6.557 ± 1.537
0.0AsnMet: 0.0 ± 0.0
1.639AsnAsn: 1.639 ± 1.466
0.0AsnPro: 0.0 ± 0.0
3.279AsnGln: 3.279 ± 0.465
1.639AsnArg: 1.639 ± 1.466
1.639AsnSer: 1.639 ± 1.466
1.639AsnThr: 1.639 ± 1.466
0.0AsnVal: 0.0 ± 0.0
0.0AsnTrp: 0.0 ± 0.0
6.557AsnTyr: 6.557 ± 1.537
0.0AsnXaa: 0.0 ± 0.0
Pro
3.279ProAla: 3.279 ± 2.933
0.0ProCys: 0.0 ± 0.0
0.0ProAsp: 0.0 ± 0.0
1.639ProGlu: 1.639 ± 1.001
0.0ProPhe: 0.0 ± 0.0
4.918ProGly: 4.918 ± 1.932
1.639ProHis: 1.639 ± 1.001
0.0ProIle: 0.0 ± 0.0
4.918ProLys: 4.918 ± 3.004
3.279ProLeu: 3.279 ± 0.465
1.639ProMet: 1.639 ± 1.001
4.918ProAsn: 4.918 ± 1.932
3.279ProPro: 3.279 ± 0.465
6.557ProGln: 6.557 ± 1.537
1.639ProArg: 1.639 ± 1.001
0.0ProSer: 0.0 ± 0.0
4.918ProThr: 4.918 ± 0.536
1.639ProVal: 1.639 ± 1.001
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.279GlnAla: 3.279 ± 2.002
1.639GlnCys: 1.639 ± 1.001
0.0GlnAsp: 0.0 ± 0.0
0.0GlnGlu: 0.0 ± 0.0
0.0GlnPhe: 0.0 ± 0.0
8.197GlnGly: 8.197 ± 2.397
1.639GlnHis: 1.639 ± 1.001
3.279GlnIle: 3.279 ± 0.465
3.279GlnLys: 3.279 ± 2.002
3.279GlnLeu: 3.279 ± 2.002
1.639GlnMet: 1.639 ± 1.466
1.639GlnAsn: 1.639 ± 1.466
3.279GlnPro: 3.279 ± 0.465
0.0GlnGln: 0.0 ± 0.0
1.639GlnArg: 1.639 ± 1.466
0.0GlnSer: 0.0 ± 0.0
1.639GlnThr: 1.639 ± 1.001
1.639GlnVal: 1.639 ± 1.001
1.639GlnTrp: 1.639 ± 1.001
4.918GlnTyr: 4.918 ± 0.536
0.0GlnXaa: 0.0 ± 0.0
Arg
4.918ArgAla: 4.918 ± 4.399
0.0ArgCys: 0.0 ± 0.0
4.918ArgAsp: 4.918 ± 0.536
0.0ArgGlu: 0.0 ± 0.0
1.639ArgPhe: 1.639 ± 1.466
4.918ArgGly: 4.918 ± 4.399
0.0ArgHis: 0.0 ± 0.0
3.279ArgIle: 3.279 ± 0.465
9.836ArgLys: 9.836 ± 1.396
4.918ArgLeu: 4.918 ± 3.004
0.0ArgMet: 0.0 ± 0.0
3.279ArgAsn: 3.279 ± 0.465
0.0ArgPro: 0.0 ± 0.0
1.639ArgGln: 1.639 ± 1.001
0.0ArgArg: 0.0 ± 0.0
3.279ArgSer: 3.279 ± 0.465
0.0ArgThr: 0.0 ± 0.0
3.279ArgVal: 3.279 ± 2.933
0.0ArgTrp: 0.0 ± 0.0
1.639ArgTyr: 1.639 ± 1.466
0.0ArgXaa: 0.0 ± 0.0
Ser
0.0SerAla: 0.0 ± 0.0
0.0SerCys: 0.0 ± 0.0
1.639SerAsp: 1.639 ± 1.001
3.279SerGlu: 3.279 ± 2.002
3.279SerPhe: 3.279 ± 2.002
6.557SerGly: 6.557 ± 3.398
0.0SerHis: 0.0 ± 0.0
3.279SerIle: 3.279 ± 2.933
1.639SerLys: 1.639 ± 1.001
6.557SerLeu: 6.557 ± 1.537
1.639SerMet: 1.639 ± 1.466
3.279SerAsn: 3.279 ± 2.002
3.279SerPro: 3.279 ± 0.465
4.918SerGln: 4.918 ± 1.932
4.918SerArg: 4.918 ± 4.399
4.918SerSer: 4.918 ± 4.399
4.918SerThr: 4.918 ± 1.932
3.279SerVal: 3.279 ± 0.465
1.639SerTrp: 1.639 ± 1.001
3.279SerTyr: 3.279 ± 2.933
0.0SerXaa: 0.0 ± 0.0
Thr
8.197ThrAla: 8.197 ± 2.397
0.0ThrCys: 0.0 ± 0.0
6.557ThrAsp: 6.557 ± 0.93
3.279ThrGlu: 3.279 ± 2.002
4.918ThrPhe: 4.918 ± 0.536
3.279ThrGly: 3.279 ± 0.465
0.0ThrHis: 0.0 ± 0.0
6.557ThrIle: 6.557 ± 1.537
4.918ThrLys: 4.918 ± 0.536
6.557ThrLeu: 6.557 ± 1.537
1.639ThrMet: 1.639 ± 1.001
1.639ThrAsn: 1.639 ± 1.466
6.557ThrPro: 6.557 ± 0.93
0.0ThrGln: 0.0 ± 0.0
1.639ThrArg: 1.639 ± 1.466
3.279ThrSer: 3.279 ± 0.465
6.557ThrThr: 6.557 ± 3.398
1.639ThrVal: 1.639 ± 1.001
3.279ThrTrp: 3.279 ± 0.465
1.639ThrTyr: 1.639 ± 1.001
0.0ThrXaa: 0.0 ± 0.0
Val
3.279ValAla: 3.279 ± 2.002
1.639ValCys: 1.639 ± 1.001
4.918ValAsp: 4.918 ± 0.536
1.639ValGlu: 1.639 ± 1.001
1.639ValPhe: 1.639 ± 1.001
4.918ValGly: 4.918 ± 4.399
0.0ValHis: 0.0 ± 0.0
4.918ValIle: 4.918 ± 0.536
1.639ValLys: 1.639 ± 1.001
3.279ValLeu: 3.279 ± 2.933
1.639ValMet: 1.639 ± 1.001
0.0ValAsn: 0.0 ± 0.0
1.639ValPro: 1.639 ± 1.001
1.639ValGln: 1.639 ± 1.001
4.918ValArg: 4.918 ± 1.932
0.0ValSer: 0.0 ± 0.0
6.557ValThr: 6.557 ± 1.537
3.279ValVal: 3.279 ± 0.465
0.0ValTrp: 0.0 ± 0.0
4.918ValTyr: 4.918 ± 1.932
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.639TrpIle: 1.639 ± 1.001
0.0TrpLys: 0.0 ± 0.0
1.639TrpLeu: 1.639 ± 1.001
1.639TrpMet: 1.639 ± 1.001
1.639TrpAsn: 1.639 ± 1.466
0.0TrpPro: 0.0 ± 0.0
1.639TrpGln: 1.639 ± 1.466
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
4.918TrpVal: 4.918 ± 3.004
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
1.639TyrCys: 1.639 ± 1.001
4.918TyrAsp: 4.918 ± 0.536
1.639TyrGlu: 1.639 ± 1.001
0.0TyrPhe: 0.0 ± 0.0
3.279TyrGly: 3.279 ± 2.933
1.639TyrHis: 1.639 ± 1.001
1.639TyrIle: 1.639 ± 1.466
4.918TyrLys: 4.918 ± 1.932
8.197TyrLeu: 8.197 ± 0.071
1.639TyrMet: 1.639 ± 1.001
0.0TyrAsn: 0.0 ± 0.0
1.639TyrPro: 1.639 ± 1.001
1.639TyrGln: 1.639 ± 1.001
0.0TyrArg: 0.0 ± 0.0
1.639TyrSer: 1.639 ± 1.466
6.557TyrThr: 6.557 ± 0.93
3.279TyrVal: 3.279 ± 2.002
1.639TyrTrp: 1.639 ± 1.466
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (611 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski