Amino acid dipepetide frequency for Human feces pecovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.825AlaAla: 2.825 ± 0.199
1.412AlaCys: 1.412 ± 0.92
8.475AlaAsp: 8.475 ± 2.636
2.825AlaGlu: 2.825 ± 0.199
1.412AlaPhe: 1.412 ± 0.92
8.475AlaGly: 8.475 ± 0.596
1.412AlaHis: 1.412 ± 1.119
0.0AlaIle: 0.0 ± 0.0
5.65AlaLys: 5.65 ± 0.397
7.062AlaLeu: 7.062 ± 4.601
1.412AlaMet: 1.412 ± 1.119
1.412AlaAsn: 1.412 ± 1.119
2.825AlaPro: 2.825 ± 1.841
1.412AlaGln: 1.412 ± 0.92
1.412AlaArg: 1.412 ± 0.92
2.825AlaSer: 2.825 ± 1.841
1.412AlaThr: 1.412 ± 0.92
2.825AlaVal: 2.825 ± 0.199
1.412AlaTrp: 1.412 ± 0.92
1.412AlaTyr: 1.412 ± 1.119
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.412CysGlu: 1.412 ± 0.92
0.0CysPhe: 0.0 ± 0.0
1.412CysGly: 1.412 ± 0.92
0.0CysHis: 0.0 ± 0.0
2.825CysIle: 2.825 ± 1.841
2.825CysLys: 2.825 ± 2.238
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
1.412CysAsn: 1.412 ± 0.92
2.825CysPro: 2.825 ± 2.238
0.0CysGln: 0.0 ± 0.0
2.825CysArg: 2.825 ± 0.199
2.825CysSer: 2.825 ± 1.841
0.0CysThr: 0.0 ± 0.0
2.825CysVal: 2.825 ± 2.238
1.412CysTrp: 1.412 ± 0.92
1.412CysTyr: 1.412 ± 1.119
0.0CysXaa: 0.0 ± 0.0
Asp
4.237AspAla: 4.237 ± 1.318
1.412AspCys: 1.412 ± 1.119
1.412AspAsp: 1.412 ± 1.119
5.65AspGlu: 5.65 ± 2.437
2.825AspPhe: 2.825 ± 0.199
7.062AspGly: 7.062 ± 3.556
0.0AspHis: 0.0 ± 0.0
2.825AspIle: 2.825 ± 0.199
1.412AspLys: 1.412 ± 0.92
5.65AspLeu: 5.65 ± 1.642
2.825AspMet: 2.825 ± 0.199
4.237AspAsn: 4.237 ± 0.722
2.825AspPro: 2.825 ± 0.199
1.412AspGln: 1.412 ± 1.119
1.412AspArg: 1.412 ± 1.119
2.825AspSer: 2.825 ± 0.199
5.65AspThr: 5.65 ± 1.642
4.237AspVal: 4.237 ± 0.722
1.412AspTrp: 1.412 ± 0.92
1.412AspTyr: 1.412 ± 1.119
0.0AspXaa: 0.0 ± 0.0
Glu
2.825GluAla: 2.825 ± 0.199
0.0GluCys: 0.0 ± 0.0
1.412GluAsp: 1.412 ± 1.119
2.825GluGlu: 2.825 ± 0.199
1.412GluPhe: 1.412 ± 1.119
2.825GluGly: 2.825 ± 0.199
0.0GluHis: 0.0 ± 0.0
0.0GluIle: 0.0 ± 0.0
1.412GluLys: 1.412 ± 1.119
2.825GluLeu: 2.825 ± 2.238
0.0GluMet: 0.0 ± 0.0
0.0GluAsn: 0.0 ± 0.0
4.237GluPro: 4.237 ± 3.357
2.825GluGln: 2.825 ± 1.841
4.237GluArg: 4.237 ± 1.318
5.65GluSer: 5.65 ± 1.642
2.825GluThr: 2.825 ± 2.238
1.412GluVal: 1.412 ± 1.119
5.65GluTrp: 5.65 ± 2.437
5.65GluTyr: 5.65 ± 0.397
0.0GluXaa: 0.0 ± 0.0
Phe
2.825PheAla: 2.825 ± 0.199
0.0PheCys: 0.0 ± 0.0
2.825PheAsp: 2.825 ± 2.238
0.0PheGlu: 0.0 ± 0.0
0.0PhePhe: 0.0 ± 0.0
5.65PheGly: 5.65 ± 0.397
0.0PheHis: 0.0 ± 0.0
0.0PheIle: 0.0 ± 0.0
0.0PheLys: 0.0 ± 0.0
1.412PheLeu: 1.412 ± 1.119
0.0PheMet: 0.0 ± 0.0
5.65PheAsn: 5.65 ± 1.642
2.825PhePro: 2.825 ± 0.199
0.0PheGln: 0.0 ± 0.0
1.412PheArg: 1.412 ± 0.92
2.825PheSer: 2.825 ± 0.199
5.65PheThr: 5.65 ± 0.397
1.412PheVal: 1.412 ± 1.119
0.0PheTrp: 0.0 ± 0.0
1.412PheTyr: 1.412 ± 0.92
0.0PheXaa: 0.0 ± 0.0
Gly
0.0GlyAla: 0.0 ± 0.0
0.0GlyCys: 0.0 ± 0.0
4.237GlyAsp: 4.237 ± 1.318
7.062GlyGlu: 7.062 ± 3.556
4.237GlyPhe: 4.237 ± 0.722
8.475GlyGly: 8.475 ± 3.482
0.0GlyHis: 0.0 ± 0.0
4.237GlyIle: 4.237 ± 0.722
5.65GlyLys: 5.65 ± 4.476
0.0GlyLeu: 0.0 ± 0.0
2.825GlyMet: 2.825 ± 1.841
2.825GlyAsn: 2.825 ± 1.841
5.65GlyPro: 5.65 ± 0.397
1.412GlyGln: 1.412 ± 1.119
9.887GlyArg: 9.887 ± 0.324
7.062GlySer: 7.062 ± 3.556
7.062GlyThr: 7.062 ± 2.562
2.825GlyVal: 2.825 ± 1.841
2.825GlyTrp: 2.825 ± 0.199
7.062GlyTyr: 7.062 ± 1.517
0.0GlyXaa: 0.0 ± 0.0
His
2.825HisAla: 2.825 ± 0.199
1.412HisCys: 1.412 ± 1.119
1.412HisAsp: 1.412 ± 1.119
0.0HisGlu: 0.0 ± 0.0
1.412HisPhe: 1.412 ± 1.119
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
1.412HisIle: 1.412 ± 0.92
0.0HisLys: 0.0 ± 0.0
2.825HisLeu: 2.825 ± 1.841
2.825HisMet: 2.825 ± 0.821
0.0HisAsn: 0.0 ± 0.0
1.412HisPro: 1.412 ± 0.92
0.0HisGln: 0.0 ± 0.0
1.412HisArg: 1.412 ± 0.92
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
2.825HisTrp: 2.825 ± 2.238
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
8.475IleAla: 8.475 ± 1.443
0.0IleCys: 0.0 ± 0.0
7.062IleAsp: 7.062 ± 0.523
1.412IleGlu: 1.412 ± 0.92
1.412IlePhe: 1.412 ± 1.119
1.412IleGly: 1.412 ± 0.92
0.0IleHis: 0.0 ± 0.0
2.825IleIle: 2.825 ± 0.199
2.825IleLys: 2.825 ± 2.238
1.412IleLeu: 1.412 ± 0.92
0.0IleMet: 0.0 ± 0.0
1.412IleAsn: 1.412 ± 0.92
0.0IlePro: 0.0 ± 0.0
0.0IleGln: 0.0 ± 0.0
4.237IleArg: 4.237 ± 2.761
1.412IleSer: 1.412 ± 1.119
1.412IleThr: 1.412 ± 1.119
2.825IleVal: 2.825 ± 2.238
2.825IleTrp: 2.825 ± 0.199
2.825IleTyr: 2.825 ± 0.199
0.0IleXaa: 0.0 ± 0.0
Lys
1.412LysAla: 1.412 ± 0.92
0.0LysCys: 0.0 ± 0.0
4.237LysAsp: 4.237 ± 3.357
2.825LysGlu: 2.825 ± 2.238
0.0LysPhe: 0.0 ± 0.0
1.412LysGly: 1.412 ± 1.119
0.0LysHis: 0.0 ± 0.0
2.825LysIle: 2.825 ± 2.238
8.475LysLys: 8.475 ± 6.714
2.825LysLeu: 2.825 ± 0.199
1.412LysMet: 1.412 ± 0.92
2.825LysAsn: 2.825 ± 0.199
2.825LysPro: 2.825 ± 1.841
2.825LysGln: 2.825 ± 2.238
1.412LysArg: 1.412 ± 1.119
2.825LysSer: 2.825 ± 2.238
5.65LysThr: 5.65 ± 0.397
1.412LysVal: 1.412 ± 1.119
0.0LysTrp: 0.0 ± 0.0
4.237LysTyr: 4.237 ± 1.318
0.0LysXaa: 0.0 ± 0.0
Leu
4.237LeuAla: 4.237 ± 2.761
2.825LeuCys: 2.825 ± 1.841
1.412LeuAsp: 1.412 ± 0.92
2.825LeuGlu: 2.825 ± 0.199
1.412LeuPhe: 1.412 ± 1.119
2.825LeuGly: 2.825 ± 0.199
0.0LeuHis: 0.0 ± 0.0
2.825LeuIle: 2.825 ± 0.199
1.412LeuLys: 1.412 ± 1.119
4.237LeuLeu: 4.237 ± 2.761
0.0LeuMet: 0.0 ± 0.0
1.412LeuAsn: 1.412 ± 0.92
9.887LeuPro: 9.887 ± 4.403
2.825LeuGln: 2.825 ± 0.199
5.65LeuArg: 5.65 ± 3.681
2.825LeuSer: 2.825 ± 1.841
4.237LeuThr: 4.237 ± 2.761
5.65LeuVal: 5.65 ± 0.397
1.412LeuTrp: 1.412 ± 0.92
2.825LeuTyr: 2.825 ± 0.199
0.0LeuXaa: 0.0 ± 0.0
Met
2.825MetAla: 2.825 ± 0.199
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
1.412MetGlu: 1.412 ± 1.119
0.0MetPhe: 0.0 ± 0.0
1.412MetGly: 1.412 ± 1.119
0.0MetHis: 0.0 ± 0.0
1.412MetIle: 1.412 ± 0.92
0.0MetLys: 0.0 ± 0.0
1.412MetLeu: 1.412 ± 0.92
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.412MetPro: 1.412 ± 1.119
1.412MetGln: 1.412 ± 1.119
4.237MetArg: 4.237 ± 1.318
4.237MetSer: 4.237 ± 1.318
2.825MetThr: 2.825 ± 1.841
2.825MetVal: 2.825 ± 0.199
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.412AsnAla: 1.412 ± 1.119
1.412AsnCys: 1.412 ± 0.92
1.412AsnAsp: 1.412 ± 0.92
1.412AsnGlu: 1.412 ± 0.92
2.825AsnPhe: 2.825 ± 1.841
5.65AsnGly: 5.65 ± 1.642
0.0AsnHis: 0.0 ± 0.0
1.412AsnIle: 1.412 ± 1.119
4.237AsnLys: 4.237 ± 0.722
1.412AsnLeu: 1.412 ± 0.92
0.0AsnMet: 0.0 ± 0.0
2.825AsnAsn: 2.825 ± 1.841
1.412AsnPro: 1.412 ± 0.92
2.825AsnGln: 2.825 ± 1.841
1.412AsnArg: 1.412 ± 0.92
2.825AsnSer: 2.825 ± 0.199
0.0AsnThr: 0.0 ± 0.0
4.237AsnVal: 4.237 ± 2.761
2.825AsnTrp: 2.825 ± 0.199
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.412ProAla: 1.412 ± 1.119
1.412ProCys: 1.412 ± 0.92
5.65ProAsp: 5.65 ± 1.642
5.65ProGlu: 5.65 ± 0.397
0.0ProPhe: 0.0 ± 0.0
2.825ProGly: 2.825 ± 0.199
1.412ProHis: 1.412 ± 1.119
1.412ProIle: 1.412 ± 0.92
2.825ProLys: 2.825 ± 0.199
1.412ProLeu: 1.412 ± 0.92
0.0ProMet: 0.0 ± 0.747
0.0ProAsn: 0.0 ± 0.0
1.412ProPro: 1.412 ± 0.92
4.237ProGln: 4.237 ± 2.761
4.237ProArg: 4.237 ± 0.722
4.237ProSer: 4.237 ± 0.722
0.0ProThr: 0.0 ± 0.0
7.062ProVal: 7.062 ± 2.562
0.0ProTrp: 0.0 ± 0.0
2.825ProTyr: 2.825 ± 0.199
0.0ProXaa: 0.0 ± 0.0
Gln
4.237GlnAla: 4.237 ± 0.722
1.412GlnCys: 1.412 ± 0.92
0.0GlnAsp: 0.0 ± 0.0
1.412GlnGlu: 1.412 ± 1.119
0.0GlnPhe: 0.0 ± 0.0
1.412GlnGly: 1.412 ± 1.119
0.0GlnHis: 0.0 ± 0.0
7.062GlnIle: 7.062 ± 3.556
0.0GlnLys: 0.0 ± 0.0
4.237GlnLeu: 4.237 ± 0.722
1.412GlnMet: 1.412 ± 1.119
1.412GlnAsn: 1.412 ± 0.92
0.0GlnPro: 0.0 ± 0.0
1.412GlnGln: 1.412 ± 1.119
5.65GlnArg: 5.65 ± 0.397
1.412GlnSer: 1.412 ± 0.92
1.412GlnThr: 1.412 ± 0.92
0.0GlnVal: 0.0 ± 0.0
5.65GlnTrp: 5.65 ± 0.397
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
5.65ArgAla: 5.65 ± 0.397
2.825ArgCys: 2.825 ± 1.841
1.412ArgAsp: 1.412 ± 0.92
1.412ArgGlu: 1.412 ± 1.119
5.65ArgPhe: 5.65 ± 0.397
5.65ArgGly: 5.65 ± 0.397
2.825ArgHis: 2.825 ± 1.841
2.825ArgIle: 2.825 ± 0.199
0.0ArgLys: 0.0 ± 0.0
5.65ArgLeu: 5.65 ± 1.642
2.825ArgMet: 2.825 ± 0.199
1.412ArgAsn: 1.412 ± 1.119
4.237ArgPro: 4.237 ± 2.761
4.237ArgGln: 4.237 ± 1.318
29.661ArgArg: 29.661 ± 15.247
5.65ArgSer: 5.65 ± 0.397
4.237ArgThr: 4.237 ± 1.318
5.65ArgVal: 5.65 ± 0.397
0.0ArgTrp: 0.0 ± 0.0
7.062ArgTyr: 7.062 ± 0.523
0.0ArgXaa: 0.0 ± 0.0
Ser
1.412SerAla: 1.412 ± 0.92
0.0SerCys: 0.0 ± 0.0
2.825SerAsp: 2.825 ± 0.199
1.412SerGlu: 1.412 ± 0.92
4.237SerPhe: 4.237 ± 0.722
9.887SerGly: 9.887 ± 1.715
2.825SerHis: 2.825 ± 1.841
1.412SerIle: 1.412 ± 0.92
4.237SerLys: 4.237 ± 0.722
1.412SerLeu: 1.412 ± 0.92
4.237SerMet: 4.237 ± 1.318
2.825SerAsn: 2.825 ± 1.841
1.412SerPro: 1.412 ± 0.92
2.825SerGln: 2.825 ± 0.199
5.65SerArg: 5.65 ± 2.437
2.825SerSer: 2.825 ± 0.199
1.412SerThr: 1.412 ± 1.119
7.062SerVal: 7.062 ± 0.523
1.412SerTrp: 1.412 ± 1.119
4.237SerTyr: 4.237 ± 0.722
0.0SerXaa: 0.0 ± 0.0
Thr
1.412ThrAla: 1.412 ± 0.92
5.65ThrCys: 5.65 ± 2.437
2.825ThrAsp: 2.825 ± 1.841
1.412ThrGlu: 1.412 ± 0.92
0.0ThrPhe: 0.0 ± 0.0
4.237ThrGly: 4.237 ± 0.722
2.825ThrHis: 2.825 ± 0.199
4.237ThrIle: 4.237 ± 0.722
4.237ThrLys: 4.237 ± 1.318
0.0ThrLeu: 0.0 ± 0.0
1.412ThrMet: 1.412 ± 1.119
7.062ThrAsn: 7.062 ± 4.601
1.412ThrPro: 1.412 ± 0.92
2.825ThrGln: 2.825 ± 0.199
4.237ThrArg: 4.237 ± 0.722
1.412ThrSer: 1.412 ± 1.119
0.0ThrThr: 0.0 ± 0.0
8.475ThrVal: 8.475 ± 1.443
0.0ThrTrp: 0.0 ± 0.0
1.412ThrTyr: 1.412 ± 1.119
0.0ThrXaa: 0.0 ± 0.0
Val
5.65ValAla: 5.65 ± 3.681
1.412ValCys: 1.412 ± 1.119
7.062ValAsp: 7.062 ± 1.517
5.65ValGlu: 5.65 ± 4.476
2.825ValPhe: 2.825 ± 0.199
8.475ValGly: 8.475 ± 1.443
2.825ValHis: 2.825 ± 0.199
1.412ValIle: 1.412 ± 1.119
1.412ValLys: 1.412 ± 1.119
5.65ValLeu: 5.65 ± 3.681
2.825ValMet: 2.825 ± 0.199
2.825ValAsn: 2.825 ± 0.199
1.412ValPro: 1.412 ± 0.92
2.825ValGln: 2.825 ± 0.199
1.412ValArg: 1.412 ± 1.119
5.65ValSer: 5.65 ± 1.642
2.825ValThr: 2.825 ± 0.199
4.237ValVal: 4.237 ± 0.722
0.0ValTrp: 0.0 ± 0.0
1.412ValTyr: 1.412 ± 1.119
0.0ValXaa: 0.0 ± 0.0
Trp
1.412TrpAla: 1.412 ± 1.119
2.825TrpCys: 2.825 ± 2.238
2.825TrpAsp: 2.825 ± 1.841
0.0TrpGlu: 0.0 ± 0.0
2.825TrpPhe: 2.825 ± 0.199
2.825TrpGly: 2.825 ± 2.238
2.825TrpHis: 2.825 ± 0.199
2.825TrpIle: 2.825 ± 1.841
1.412TrpLys: 1.412 ± 1.119
1.412TrpLeu: 1.412 ± 0.92
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
1.412TrpPro: 1.412 ± 1.119
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.412TrpSer: 1.412 ± 0.92
5.65TrpThr: 5.65 ± 0.397
0.0TrpVal: 0.0 ± 0.0
1.412TrpTrp: 1.412 ± 1.119
1.412TrpTyr: 1.412 ± 1.119
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.825TyrAla: 2.825 ± 0.199
0.0TyrCys: 0.0 ± 0.0
4.237TyrAsp: 4.237 ± 1.318
1.412TyrGlu: 1.412 ± 1.119
1.412TyrPhe: 1.412 ± 1.119
1.412TyrGly: 1.412 ± 0.92
2.825TyrHis: 2.825 ± 2.238
0.0TyrIle: 0.0 ± 0.0
1.412TyrLys: 1.412 ± 1.119
9.887TyrLeu: 9.887 ± 0.324
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
0.0TyrPro: 0.0 ± 0.0
2.825TyrGln: 2.825 ± 2.238
8.475TyrArg: 8.475 ± 0.596
2.825TyrSer: 2.825 ± 1.841
2.825TyrThr: 2.825 ± 1.841
2.825TyrVal: 2.825 ± 2.238
1.412TyrTrp: 1.412 ± 1.119
2.825TyrTyr: 2.825 ± 0.199
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (709 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski