Amino acid dipepetide frequency for Alces alces faeces associated genomovirus MP84

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.616AlaAla: 1.616 ± 1.084
0.0AlaCys: 0.0 ± 0.0
1.616AlaAsp: 1.616 ± 1.084
6.462AlaGlu: 6.462 ± 0.0
1.616AlaPhe: 1.616 ± 1.084
6.462AlaGly: 6.462 ± 2.167
0.0AlaHis: 0.0 ± 0.0
1.616AlaIle: 1.616 ± 1.084
1.616AlaLys: 1.616 ± 1.084
1.616AlaLeu: 1.616 ± 1.084
1.616AlaMet: 1.616 ± 1.084
6.462AlaAsn: 6.462 ± 2.167
4.847AlaPro: 4.847 ± 1.084
3.231AlaGln: 3.231 ± 2.167
9.693AlaArg: 9.693 ± 0.0
3.231AlaSer: 3.231 ± 0.0
8.078AlaThr: 8.078 ± 5.418
1.616AlaVal: 1.616 ± 1.084
1.616AlaTrp: 1.616 ± 1.084
4.847AlaTyr: 4.847 ± 3.251
0.0AlaXaa: 0.0 ± 0.0
Cys
1.616CysAla: 1.616 ± 1.084
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.616CysGlu: 1.616 ± 1.084
4.847CysPhe: 4.847 ± 1.084
1.616CysGly: 1.616 ± 1.084
0.0CysHis: 0.0 ± 0.0
1.616CysIle: 1.616 ± 1.084
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
1.616CysAsn: 1.616 ± 1.084
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
3.231CysSer: 3.231 ± 2.167
1.616CysThr: 1.616 ± 1.084
1.616CysVal: 1.616 ± 1.084
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.0AspAla: 0.0 ± 0.0
1.616AspCys: 1.616 ± 1.084
8.078AspAsp: 8.078 ± 1.084
1.616AspGlu: 1.616 ± 1.084
1.616AspPhe: 1.616 ± 1.084
8.078AspGly: 8.078 ± 5.418
0.0AspHis: 0.0 ± 0.0
4.847AspIle: 4.847 ± 1.084
1.616AspLys: 1.616 ± 1.084
8.078AspLeu: 8.078 ± 3.251
0.0AspMet: 0.0 ± 0.0
3.231AspAsn: 3.231 ± 2.167
8.078AspPro: 8.078 ± 3.251
0.0AspGln: 0.0 ± 0.0
0.0AspArg: 0.0 ± 0.0
1.616AspSer: 1.616 ± 1.084
1.616AspThr: 1.616 ± 1.084
4.847AspVal: 4.847 ± 3.251
3.231AspTrp: 3.231 ± 0.0
3.231AspTyr: 3.231 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.231GluAla: 3.231 ± 0.0
1.616GluCys: 1.616 ± 1.084
1.616GluAsp: 1.616 ± 1.084
3.231GluGlu: 3.231 ± 2.167
3.231GluPhe: 3.231 ± 2.167
0.0GluGly: 0.0 ± 0.0
0.0GluHis: 0.0 ± 0.0
1.616GluIle: 1.616 ± 1.084
3.231GluLys: 3.231 ± 0.0
4.847GluLeu: 4.847 ± 1.084
0.0GluMet: 0.0 ± 0.0
3.231GluAsn: 3.231 ± 2.167
1.616GluPro: 1.616 ± 1.084
0.0GluGln: 0.0 ± 0.0
1.616GluArg: 1.616 ± 1.084
1.616GluSer: 1.616 ± 1.084
1.616GluThr: 1.616 ± 1.084
1.616GluVal: 1.616 ± 1.084
3.231GluTrp: 3.231 ± 2.167
3.231GluTyr: 3.231 ± 2.167
0.0GluXaa: 0.0 ± 0.0
Phe
3.231PheAla: 3.231 ± 2.167
0.0PheCys: 0.0 ± 0.0
4.847PheAsp: 4.847 ± 3.251
1.616PheGlu: 1.616 ± 1.084
1.616PhePhe: 1.616 ± 1.084
3.231PheGly: 3.231 ± 2.167
0.0PheHis: 0.0 ± 0.0
1.616PheIle: 1.616 ± 1.084
3.231PheLys: 3.231 ± 0.0
6.462PheLeu: 6.462 ± 0.0
1.616PheMet: 1.616 ± 1.084
0.0PheAsn: 0.0 ± 0.0
1.616PhePro: 1.616 ± 1.084
3.231PheGln: 3.231 ± 2.167
6.462PheArg: 6.462 ± 2.167
0.0PheSer: 0.0 ± 0.0
4.847PheThr: 4.847 ± 1.084
3.231PheVal: 3.231 ± 2.167
1.616PheTrp: 1.616 ± 1.084
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
9.693GlyAla: 9.693 ± 2.167
0.0GlyCys: 0.0 ± 0.0
4.847GlyAsp: 4.847 ± 1.084
3.231GlyGlu: 3.231 ± 2.167
3.231GlyPhe: 3.231 ± 0.0
19.386GlyGly: 19.386 ± 4.334
0.0GlyHis: 0.0 ± 0.0
3.231GlyIle: 3.231 ± 2.167
8.078GlyLys: 8.078 ± 1.084
4.847GlyLeu: 4.847 ± 1.084
1.616GlyMet: 1.616 ± 0.795
8.078GlyAsn: 8.078 ± 3.251
1.616GlyPro: 1.616 ± 1.084
4.847GlyGln: 4.847 ± 1.084
4.847GlyArg: 4.847 ± 1.084
8.078GlySer: 8.078 ± 1.084
6.462GlyThr: 6.462 ± 0.0
3.231GlyVal: 3.231 ± 2.167
0.0GlyTrp: 0.0 ± 0.0
1.616GlyTyr: 1.616 ± 1.084
0.0GlyXaa: 0.0 ± 0.0
His
1.616HisAla: 1.616 ± 1.084
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
3.231HisGlu: 3.231 ± 0.0
1.616HisPhe: 1.616 ± 1.084
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
1.616HisIle: 1.616 ± 1.084
0.0HisLys: 0.0 ± 0.0
1.616HisLeu: 1.616 ± 1.084
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.616HisPro: 1.616 ± 1.084
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
3.231HisVal: 3.231 ± 2.167
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.231IleAla: 3.231 ± 2.167
1.616IleCys: 1.616 ± 1.084
0.0IleAsp: 0.0 ± 0.0
1.616IleGlu: 1.616 ± 1.084
4.847IlePhe: 4.847 ± 1.084
1.616IleGly: 1.616 ± 1.084
1.616IleHis: 1.616 ± 1.084
1.616IleIle: 1.616 ± 1.084
8.078IleLys: 8.078 ± 1.084
1.616IleLeu: 1.616 ± 1.084
0.0IleMet: 0.0 ± 0.0
0.0IleAsn: 0.0 ± 0.0
0.0IlePro: 0.0 ± 0.0
0.0IleGln: 0.0 ± 0.0
1.616IleArg: 1.616 ± 1.084
1.616IleSer: 1.616 ± 1.084
3.231IleThr: 3.231 ± 2.167
0.0IleVal: 0.0 ± 0.0
1.616IleTrp: 1.616 ± 1.084
1.616IleTyr: 1.616 ± 1.084
0.0IleXaa: 0.0 ± 0.0
Lys
3.231LysAla: 3.231 ± 0.0
0.0LysCys: 0.0 ± 0.0
3.231LysAsp: 3.231 ± 0.0
1.616LysGlu: 1.616 ± 1.084
4.847LysPhe: 4.847 ± 3.251
1.616LysGly: 1.616 ± 1.084
0.0LysHis: 0.0 ± 0.0
0.0LysIle: 0.0 ± 0.0
4.847LysLys: 4.847 ± 3.251
4.847LysLeu: 4.847 ± 1.084
3.231LysMet: 3.231 ± 1.729
1.616LysAsn: 1.616 ± 1.084
1.616LysPro: 1.616 ± 1.084
1.616LysGln: 1.616 ± 1.084
9.693LysArg: 9.693 ± 2.167
1.616LysSer: 1.616 ± 1.084
3.231LysThr: 3.231 ± 0.0
0.0LysVal: 0.0 ± 0.0
0.0LysTrp: 0.0 ± 0.0
4.847LysTyr: 4.847 ± 1.084
0.0LysXaa: 0.0 ± 0.0
Leu
1.616LeuAla: 1.616 ± 1.084
3.231LeuCys: 3.231 ± 2.167
3.231LeuAsp: 3.231 ± 2.167
6.462LeuGlu: 6.462 ± 0.0
0.0LeuPhe: 0.0 ± 0.0
11.309LeuGly: 11.309 ± 1.084
1.616LeuHis: 1.616 ± 1.084
1.616LeuIle: 1.616 ± 1.084
0.0LeuLys: 0.0 ± 0.0
4.847LeuLeu: 4.847 ± 1.084
0.0LeuMet: 0.0 ± 0.0
1.616LeuAsn: 1.616 ± 1.084
1.616LeuPro: 1.616 ± 1.084
1.616LeuGln: 1.616 ± 1.084
1.616LeuArg: 1.616 ± 1.084
8.078LeuSer: 8.078 ± 3.251
3.231LeuThr: 3.231 ± 2.167
9.693LeuVal: 9.693 ± 0.0
6.462LeuTrp: 6.462 ± 0.0
4.847LeuTyr: 4.847 ± 1.084
0.0LeuXaa: 0.0 ± 0.0
Met
1.616MetAla: 1.616 ± 1.084
0.0MetCys: 0.0 ± 0.0
3.231MetAsp: 3.231 ± 0.0
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
1.616MetGly: 1.616 ± 1.084
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
1.616MetLeu: 1.616 ± 1.084
0.0MetMet: 0.0 ± 0.0
1.616MetAsn: 1.616 ± 1.084
3.231MetPro: 3.231 ± 0.0
0.0MetGln: 0.0 ± 0.0
1.616MetArg: 1.616 ± 1.084
3.231MetSer: 3.231 ± 0.0
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
1.616MetTrp: 1.616 ± 1.084
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.847AsnAla: 4.847 ± 1.084
0.0AsnCys: 0.0 ± 0.0
3.231AsnAsp: 3.231 ± 2.167
0.0AsnGlu: 0.0 ± 0.0
1.616AsnPhe: 1.616 ± 1.084
3.231AsnGly: 3.231 ± 2.167
0.0AsnHis: 0.0 ± 0.0
1.616AsnIle: 1.616 ± 1.084
1.616AsnLys: 1.616 ± 1.084
6.462AsnLeu: 6.462 ± 4.334
0.0AsnMet: 0.0 ± 0.0
1.616AsnAsn: 1.616 ± 1.084
3.231AsnPro: 3.231 ± 0.0
0.0AsnGln: 0.0 ± 0.0
3.231AsnArg: 3.231 ± 2.167
4.847AsnSer: 4.847 ± 3.251
6.462AsnThr: 6.462 ± 2.167
6.462AsnVal: 6.462 ± 2.167
0.0AsnTrp: 0.0 ± 0.0
1.616AsnTyr: 1.616 ± 1.084
0.0AsnXaa: 0.0 ± 0.0
Pro
4.847ProAla: 4.847 ± 1.084
0.0ProCys: 0.0 ± 0.0
3.231ProAsp: 3.231 ± 0.0
1.616ProGlu: 1.616 ± 1.084
0.0ProPhe: 0.0 ± 0.0
4.847ProGly: 4.847 ± 1.084
3.231ProHis: 3.231 ± 2.167
1.616ProIle: 1.616 ± 1.084
3.231ProLys: 3.231 ± 2.167
0.0ProLeu: 0.0 ± 0.0
3.231ProMet: 3.231 ± 0.0
1.616ProAsn: 1.616 ± 1.084
0.0ProPro: 0.0 ± 0.0
3.231ProGln: 3.231 ± 2.167
3.231ProArg: 3.231 ± 2.167
3.231ProSer: 3.231 ± 2.167
1.616ProThr: 1.616 ± 1.084
4.847ProVal: 4.847 ± 3.251
0.0ProTrp: 0.0 ± 0.0
1.616ProTyr: 1.616 ± 1.084
0.0ProXaa: 0.0 ± 0.0
Gln
3.231GlnAla: 3.231 ± 2.167
3.231GlnCys: 3.231 ± 2.167
0.0GlnAsp: 0.0 ± 0.0
0.0GlnGlu: 0.0 ± 0.0
0.0GlnPhe: 0.0 ± 0.0
3.231GlnGly: 3.231 ± 2.167
0.0GlnHis: 0.0 ± 0.0
1.616GlnIle: 1.616 ± 1.084
1.616GlnLys: 1.616 ± 1.084
6.462GlnLeu: 6.462 ± 2.167
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
1.616GlnPro: 1.616 ± 1.084
0.0GlnGln: 0.0 ± 0.0
1.616GlnArg: 1.616 ± 1.084
1.616GlnSer: 1.616 ± 1.084
3.231GlnThr: 3.231 ± 2.167
0.0GlnVal: 0.0 ± 0.0
1.616GlnTrp: 1.616 ± 1.084
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.231ArgAla: 3.231 ± 2.167
1.616ArgCys: 1.616 ± 1.084
4.847ArgAsp: 4.847 ± 1.084
1.616ArgGlu: 1.616 ± 1.084
6.462ArgPhe: 6.462 ± 2.167
4.847ArgGly: 4.847 ± 1.084
3.231ArgHis: 3.231 ± 2.167
3.231ArgIle: 3.231 ± 2.167
6.462ArgLys: 6.462 ± 2.167
1.616ArgLeu: 1.616 ± 1.084
0.0ArgMet: 0.0 ± 0.0
1.616ArgAsn: 1.616 ± 1.084
3.231ArgPro: 3.231 ± 0.0
1.616ArgGln: 1.616 ± 1.084
11.309ArgArg: 11.309 ± 3.251
9.693ArgSer: 9.693 ± 0.0
8.078ArgThr: 8.078 ± 1.084
3.231ArgVal: 3.231 ± 2.167
0.0ArgTrp: 0.0 ± 0.0
1.616ArgTyr: 1.616 ± 1.084
0.0ArgXaa: 0.0 ± 0.0
Ser
1.616SerAla: 1.616 ± 1.084
0.0SerCys: 0.0 ± 0.0
3.231SerAsp: 3.231 ± 0.0
1.616SerGlu: 1.616 ± 1.084
3.231SerPhe: 3.231 ± 0.0
11.309SerGly: 11.309 ± 3.251
1.616SerHis: 1.616 ± 1.084
4.847SerIle: 4.847 ± 1.084
3.231SerLys: 3.231 ± 0.0
3.231SerLeu: 3.231 ± 2.167
1.616SerMet: 1.616 ± 1.084
4.847SerAsn: 4.847 ± 1.084
3.231SerPro: 3.231 ± 2.167
3.231SerGln: 3.231 ± 0.0
6.462SerArg: 6.462 ± 2.167
8.078SerSer: 8.078 ± 5.418
8.078SerThr: 8.078 ± 3.251
0.0SerVal: 0.0 ± 0.0
3.231SerTrp: 3.231 ± 2.167
3.231SerTyr: 3.231 ± 2.167
0.0SerXaa: 0.0 ± 0.0
Thr
6.462ThrAla: 6.462 ± 2.167
1.616ThrCys: 1.616 ± 1.084
1.616ThrAsp: 1.616 ± 1.084
3.231ThrGlu: 3.231 ± 2.167
3.231ThrPhe: 3.231 ± 0.0
6.462ThrGly: 6.462 ± 0.0
0.0ThrHis: 0.0 ± 0.0
1.616ThrIle: 1.616 ± 1.084
1.616ThrLys: 1.616 ± 1.084
4.847ThrLeu: 4.847 ± 1.084
1.616ThrMet: 1.616 ± 1.084
4.847ThrAsn: 4.847 ± 3.251
3.231ThrPro: 3.231 ± 0.0
1.616ThrGln: 1.616 ± 1.084
3.231ThrArg: 3.231 ± 2.167
6.462ThrSer: 6.462 ± 4.334
6.462ThrThr: 6.462 ± 4.334
6.462ThrVal: 6.462 ± 2.167
1.616ThrTrp: 1.616 ± 1.084
4.847ThrTyr: 4.847 ± 1.084
0.0ThrXaa: 0.0 ± 0.0
Val
3.231ValAla: 3.231 ± 0.0
1.616ValCys: 1.616 ± 1.084
8.078ValAsp: 8.078 ± 3.251
1.616ValGlu: 1.616 ± 1.084
4.847ValPhe: 4.847 ± 1.084
3.231ValGly: 3.231 ± 0.0
0.0ValHis: 0.0 ± 0.0
1.616ValIle: 1.616 ± 1.084
3.231ValLys: 3.231 ± 2.167
4.847ValLeu: 4.847 ± 1.084
1.616ValMet: 1.616 ± 1.084
4.847ValAsn: 4.847 ± 3.251
1.616ValPro: 1.616 ± 1.084
1.616ValGln: 1.616 ± 1.084
1.616ValArg: 1.616 ± 1.084
4.847ValSer: 4.847 ± 3.251
3.231ValThr: 3.231 ± 0.0
1.616ValVal: 1.616 ± 1.084
0.0ValTrp: 0.0 ± 0.0
1.616ValTyr: 1.616 ± 1.084
0.0ValXaa: 0.0 ± 0.0
Trp
1.616TrpAla: 1.616 ± 1.084
3.231TrpCys: 3.231 ± 0.0
1.616TrpAsp: 1.616 ± 1.084
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
3.231TrpGly: 3.231 ± 2.167
1.616TrpHis: 1.616 ± 1.084
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
3.231TrpLeu: 3.231 ± 2.167
1.616TrpMet: 1.616 ± 1.084
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.616TrpGln: 1.616 ± 1.084
4.847TrpArg: 4.847 ± 1.084
3.231TrpSer: 3.231 ± 0.0
0.0TrpThr: 0.0 ± 0.0
1.616TrpVal: 1.616 ± 1.084
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
8.078TyrAla: 8.078 ± 3.251
0.0TyrCys: 0.0 ± 0.0
4.847TyrAsp: 4.847 ± 1.084
0.0TyrGlu: 0.0 ± 0.0
1.616TyrPhe: 1.616 ± 1.084
3.231TyrGly: 3.231 ± 0.0
1.616TyrHis: 1.616 ± 1.084
0.0TyrIle: 0.0 ± 0.0
0.0TyrLys: 0.0 ± 0.0
1.616TyrLeu: 1.616 ± 1.084
0.0TyrMet: 0.0 ± 0.0
3.231TyrAsn: 3.231 ± 0.0
3.231TyrPro: 3.231 ± 2.167
1.616TyrGln: 1.616 ± 1.084
4.847TyrArg: 4.847 ± 1.084
1.616TyrSer: 1.616 ± 1.084
0.0TyrThr: 0.0 ± 0.0
1.616TyrVal: 1.616 ± 1.084
1.616TyrTrp: 1.616 ± 1.084
1.616TyrTyr: 1.616 ± 1.084
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (620 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski