Amino acid dipepetide frequency for Alces alces faeces associated microvirus MP21 4718

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.467AlaAla: 8.467 ± 2.022
0.0AlaCys: 0.0 ± 0.0
5.644AlaAsp: 5.644 ± 2.59
1.881AlaGlu: 1.881 ± 0.707
0.941AlaPhe: 0.941 ± 0.588
8.467AlaGly: 8.467 ± 5.155
0.0AlaHis: 0.0 ± 0.0
4.704AlaIle: 4.704 ± 2.282
3.763AlaLys: 3.763 ± 1.336
6.585AlaLeu: 6.585 ± 2.314
2.822AlaMet: 2.822 ± 1.007
3.763AlaAsn: 3.763 ± 1.107
2.822AlaPro: 2.822 ± 0.941
4.704AlaGln: 4.704 ± 1.894
5.644AlaArg: 5.644 ± 1.304
7.526AlaSer: 7.526 ± 2.627
0.941AlaThr: 0.941 ± 0.588
3.763AlaVal: 3.763 ± 2.349
1.881AlaTrp: 1.881 ± 0.749
1.881AlaTyr: 1.881 ± 0.707
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.941CysGly: 0.941 ± 0.963
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.941CysLys: 0.941 ± 0.588
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.941CysVal: 0.941 ± 0.963
0.0CysTrp: 0.0 ± 0.0
0.941CysTyr: 0.941 ± 0.963
0.0CysXaa: 0.0 ± 0.0
Asp
6.585AspAla: 6.585 ± 2.715
0.0AspCys: 0.0 ± 0.0
1.881AspAsp: 1.881 ± 0.707
3.763AspGlu: 3.763 ± 1.498
2.822AspPhe: 2.822 ± 1.764
5.644AspGly: 5.644 ± 0.691
1.881AspHis: 1.881 ± 0.749
1.881AspIle: 1.881 ± 0.749
0.941AspLys: 0.941 ± 0.963
4.704AspLeu: 4.704 ± 0.937
1.881AspMet: 1.881 ± 1.051
1.881AspAsn: 1.881 ± 0.707
0.941AspPro: 0.941 ± 0.823
1.881AspGln: 1.881 ± 1.176
0.0AspArg: 0.0 ± 0.0
2.822AspSer: 2.822 ± 2.468
5.644AspThr: 5.644 ± 1.304
4.704AspVal: 4.704 ± 0.717
0.941AspTrp: 0.941 ± 0.588
3.763AspTyr: 3.763 ± 1.415
0.0AspXaa: 0.0 ± 0.0
Glu
4.704GluAla: 4.704 ± 1.515
0.0GluCys: 0.0 ± 0.0
3.763GluAsp: 3.763 ± 0.139
3.763GluGlu: 3.763 ± 1.498
4.704GluPhe: 4.704 ± 0.717
1.881GluGly: 1.881 ± 0.749
3.763GluHis: 3.763 ± 0.139
1.881GluIle: 1.881 ± 0.749
1.881GluLys: 1.881 ± 0.749
3.763GluLeu: 3.763 ± 1.379
1.881GluMet: 1.881 ± 1.926
2.822GluAsn: 2.822 ± 0.465
3.763GluPro: 3.763 ± 1.379
2.822GluGln: 2.822 ± 1.417
5.644GluArg: 5.644 ± 0.929
2.822GluSer: 2.822 ± 2.888
1.881GluThr: 1.881 ± 0.707
3.763GluVal: 3.763 ± 1.49
0.941GluTrp: 0.941 ± 0.588
4.704GluTyr: 4.704 ± 0.711
0.0GluXaa: 0.0 ± 0.0
Phe
4.704PheAla: 4.704 ± 1.639
0.0PheCys: 0.0 ± 0.0
1.881PheAsp: 1.881 ± 1.051
1.881PheGlu: 1.881 ± 1.176
0.0PhePhe: 0.0 ± 0.0
5.644PheGly: 5.644 ± 1.304
0.0PheHis: 0.0 ± 0.0
5.644PheIle: 5.644 ± 0.813
0.0PheLys: 0.0 ± 0.0
2.822PheLeu: 2.822 ± 0.465
1.881PheMet: 1.881 ± 0.749
0.0PheAsn: 0.0 ± 0.0
2.822PhePro: 2.822 ± 1.007
1.881PheGln: 1.881 ± 1.926
2.822PheArg: 2.822 ± 0.941
4.704PheSer: 4.704 ± 1.597
0.941PheThr: 0.941 ± 0.588
0.0PheVal: 0.0 ± 0.0
0.941PheTrp: 0.941 ± 0.588
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
6.585GlyAla: 6.585 ± 1.068
0.0GlyCys: 0.0 ± 0.0
7.526GlyAsp: 7.526 ± 2.114
4.704GlyGlu: 4.704 ± 1.515
1.881GlyPhe: 1.881 ± 1.176
6.585GlyGly: 6.585 ± 1.34
0.941GlyHis: 0.941 ± 0.588
5.644GlyIle: 5.644 ± 1.304
2.822GlyLys: 2.822 ± 0.465
8.467GlyLeu: 8.467 ± 0.849
0.941GlyMet: 0.941 ± 0.823
2.822GlyAsn: 2.822 ± 1.764
3.763GlyPro: 3.763 ± 0.139
4.704GlyGln: 4.704 ± 1.894
6.585GlyArg: 6.585 ± 4.57
9.407GlySer: 9.407 ± 5.031
4.704GlyThr: 4.704 ± 2.939
3.763GlyVal: 3.763 ± 0.139
0.941GlyTrp: 0.941 ± 0.823
5.644GlyTyr: 5.644 ± 0.691
0.0GlyXaa: 0.0 ± 0.0
His
1.881HisAla: 1.881 ± 1.926
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.881HisGlu: 1.881 ± 1.926
0.0HisPhe: 0.0 ± 0.0
2.822HisGly: 2.822 ± 0.941
0.941HisHis: 0.941 ± 0.588
0.941HisIle: 0.941 ± 0.588
0.941HisLys: 0.941 ± 0.963
2.822HisLeu: 2.822 ± 1.007
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.881HisPro: 1.881 ± 0.707
1.881HisGln: 1.881 ± 1.645
0.941HisArg: 0.941 ± 0.588
0.941HisSer: 0.941 ± 0.588
0.0HisThr: 0.0 ± 0.0
0.941HisVal: 0.941 ± 0.588
0.941HisTrp: 0.941 ± 0.588
1.881HisTyr: 1.881 ± 0.749
0.0HisXaa: 0.0 ± 0.0
Ile
2.822IleAla: 2.822 ± 1.007
0.0IleCys: 0.0 ± 0.0
5.644IleAsp: 5.644 ± 0.929
4.704IleGlu: 4.704 ± 2.336
0.941IlePhe: 0.941 ± 0.588
6.585IleGly: 6.585 ± 1.485
1.881IleHis: 1.881 ± 0.749
0.941IleIle: 0.941 ± 0.588
1.881IleLys: 1.881 ± 0.749
1.881IleLeu: 1.881 ± 0.749
2.822IleMet: 2.822 ± 1.297
1.881IleAsn: 1.881 ± 0.749
0.941IlePro: 0.941 ± 0.588
4.704IleGln: 4.704 ± 0.717
3.763IleArg: 3.763 ± 2.207
2.822IleSer: 2.822 ± 1.764
3.763IleThr: 3.763 ± 1.379
1.881IleVal: 1.881 ± 0.749
0.0IleTrp: 0.0 ± 0.0
4.704IleTyr: 4.704 ± 0.717
0.0IleXaa: 0.0 ± 0.0
Lys
3.763LysAla: 3.763 ± 1.336
0.0LysCys: 0.0 ± 0.0
0.941LysAsp: 0.941 ± 0.823
4.704LysGlu: 4.704 ± 1.515
2.822LysPhe: 2.822 ± 0.941
2.822LysGly: 2.822 ± 0.465
0.0LysHis: 0.0 ± 0.0
1.881LysIle: 1.881 ± 0.749
4.704LysLys: 4.704 ± 4.814
2.822LysLeu: 2.822 ± 2.888
2.822LysMet: 2.822 ± 0.465
0.0LysAsn: 0.0 ± 0.0
1.881LysPro: 1.881 ± 1.926
2.822LysGln: 2.822 ± 1.622
3.763LysArg: 3.763 ± 1.336
0.941LysSer: 0.941 ± 0.588
0.941LysThr: 0.941 ± 0.963
4.704LysVal: 4.704 ± 0.711
0.0LysTrp: 0.0 ± 0.0
2.822LysTyr: 2.822 ± 1.622
0.0LysXaa: 0.0 ± 0.0
Leu
5.644LeuAla: 5.644 ± 2.59
1.881LeuCys: 1.881 ± 1.926
2.822LeuAsp: 2.822 ± 0.941
8.467LeuGlu: 8.467 ± 3.614
3.763LeuPhe: 3.763 ± 2.56
7.526LeuGly: 7.526 ± 1.392
0.941LeuHis: 0.941 ± 0.823
3.763LeuIle: 3.763 ± 0.139
2.822LeuLys: 2.822 ± 1.84
3.763LeuLeu: 3.763 ± 1.107
2.822LeuMet: 2.822 ± 0.465
3.763LeuAsn: 3.763 ± 1.49
8.467LeuPro: 8.467 ± 2.359
7.526LeuGln: 7.526 ± 2.479
8.467LeuArg: 8.467 ± 2.039
2.822LeuSer: 2.822 ± 1.417
5.644LeuThr: 5.644 ± 2.247
1.881LeuVal: 1.881 ± 0.749
0.0LeuTrp: 0.0 ± 0.0
0.0LeuTyr: 0.0 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
3.763MetAla: 3.763 ± 1.415
0.0MetCys: 0.0 ± 0.0
0.941MetAsp: 0.941 ± 0.823
0.0MetGlu: 0.0 ± 0.0
0.941MetPhe: 0.941 ± 0.963
2.822MetGly: 2.822 ± 0.465
1.881MetHis: 1.881 ± 0.749
0.0MetIle: 0.0 ± 0.0
0.941MetLys: 0.941 ± 0.588
4.704MetLeu: 4.704 ± 1.597
0.0MetMet: 0.0 ± 0.0
0.941MetAsn: 0.941 ± 0.823
0.0MetPro: 0.0 ± 0.0
0.941MetGln: 0.941 ± 0.823
2.822MetArg: 2.822 ± 0.941
2.822MetSer: 2.822 ± 1.623
0.941MetThr: 0.941 ± 0.963
1.881MetVal: 1.881 ± 1.051
0.0MetTrp: 0.0 ± 0.0
0.941MetTyr: 0.941 ± 0.588
0.0MetXaa: 0.0 ± 0.0
Asn
1.881AsnAla: 1.881 ± 0.749
0.941AsnCys: 0.941 ± 0.963
1.881AsnAsp: 1.881 ± 1.176
3.763AsnGlu: 3.763 ± 0.139
1.881AsnPhe: 1.881 ± 1.645
1.881AsnGly: 1.881 ± 0.707
1.881AsnHis: 1.881 ± 0.707
2.822AsnIle: 2.822 ± 0.941
2.822AsnLys: 2.822 ± 1.622
2.822AsnLeu: 2.822 ± 0.465
0.0AsnMet: 0.0 ± 0.0
0.941AsnAsn: 0.941 ± 0.963
3.763AsnPro: 3.763 ± 1.379
1.881AsnGln: 1.881 ± 1.645
2.822AsnArg: 2.822 ± 1.007
2.822AsnSer: 2.822 ± 0.465
0.941AsnThr: 0.941 ± 0.823
0.941AsnVal: 0.941 ± 0.823
0.0AsnTrp: 0.0 ± 0.0
1.881AsnTyr: 1.881 ± 0.707
0.0AsnXaa: 0.0 ± 0.0
Pro
0.941ProAla: 0.941 ± 0.588
0.0ProCys: 0.0 ± 0.0
3.763ProAsp: 3.763 ± 1.336
2.822ProGlu: 2.822 ± 0.941
2.822ProPhe: 2.822 ± 0.465
5.644ProGly: 5.644 ± 1.304
0.941ProHis: 0.941 ± 0.963
4.704ProIle: 4.704 ± 0.937
2.822ProLys: 2.822 ± 0.941
2.822ProLeu: 2.822 ± 0.465
0.941ProMet: 0.941 ± 0.588
0.941ProAsn: 0.941 ± 0.823
2.822ProPro: 2.822 ± 0.465
0.941ProGln: 0.941 ± 0.588
1.881ProArg: 1.881 ± 0.749
3.763ProSer: 3.763 ± 1.107
6.585ProThr: 6.585 ± 1.34
6.585ProVal: 6.585 ± 2.288
0.941ProTrp: 0.941 ± 0.588
1.881ProTyr: 1.881 ± 1.051
0.0ProXaa: 0.0 ± 0.0
Gln
6.585GlnAla: 6.585 ± 2.314
0.0GlnCys: 0.0 ± 0.0
1.881GlnAsp: 1.881 ± 0.707
4.704GlnGlu: 4.704 ± 2.939
1.881GlnPhe: 1.881 ± 0.749
1.881GlnGly: 1.881 ± 0.707
1.881GlnHis: 1.881 ± 1.176
1.881GlnIle: 1.881 ± 0.707
3.763GlnLys: 3.763 ± 1.107
4.704GlnLeu: 4.704 ± 2.608
2.822GlnMet: 2.822 ± 1.623
3.763GlnAsn: 3.763 ± 2.207
0.941GlnPro: 0.941 ± 0.963
4.704GlnGln: 4.704 ± 2.029
4.704GlnArg: 4.704 ± 0.711
1.881GlnSer: 1.881 ± 1.645
2.822GlnThr: 2.822 ± 1.764
0.941GlnVal: 0.941 ± 0.588
0.941GlnTrp: 0.941 ± 0.823
1.881GlnTyr: 1.881 ± 1.926
0.0GlnXaa: 0.0 ± 0.0
Arg
7.526ArgAla: 7.526 ± 3.176
0.0ArgCys: 0.0 ± 0.0
6.585ArgAsp: 6.585 ± 1.754
2.822ArgGlu: 2.822 ± 1.007
1.881ArgPhe: 1.881 ± 1.176
3.763ArgGly: 3.763 ± 1.498
0.0ArgHis: 0.0 ± 0.0
3.763ArgIle: 3.763 ± 1.107
7.526ArgLys: 7.526 ± 4.666
5.644ArgLeu: 5.644 ± 1.883
0.0ArgMet: 0.0 ± 0.0
3.763ArgAsn: 3.763 ± 1.336
4.704ArgPro: 4.704 ± 0.717
1.881ArgGln: 1.881 ± 0.707
5.644ArgArg: 5.644 ± 1.304
2.822ArgSer: 2.822 ± 0.941
1.881ArgThr: 1.881 ± 0.707
0.941ArgVal: 0.941 ± 0.588
0.941ArgTrp: 0.941 ± 0.823
5.644ArgTyr: 5.644 ± 1.883
0.0ArgXaa: 0.0 ± 0.0
Ser
2.822SerAla: 2.822 ± 1.417
0.0SerCys: 0.0 ± 0.0
5.644SerAsp: 5.644 ± 2.015
1.881SerGlu: 1.881 ± 0.707
2.822SerPhe: 2.822 ± 1.622
9.407SerGly: 9.407 ± 3.277
0.941SerHis: 0.941 ± 0.588
8.467SerIle: 8.467 ± 0.849
1.881SerLys: 1.881 ± 0.749
3.763SerLeu: 3.763 ± 1.379
0.941SerMet: 0.941 ± 0.823
3.763SerAsn: 3.763 ± 1.336
2.822SerPro: 2.822 ± 2.468
1.881SerGln: 1.881 ± 1.645
4.704SerArg: 4.704 ± 1.515
8.467SerSer: 8.467 ± 2.022
1.881SerThr: 1.881 ± 1.645
4.704SerVal: 4.704 ± 1.639
0.941SerTrp: 0.941 ± 0.823
1.881SerTyr: 1.881 ± 0.707
0.0SerXaa: 0.0 ± 0.0
Thr
3.763ThrAla: 3.763 ± 1.49
0.941ThrCys: 0.941 ± 0.588
1.881ThrAsp: 1.881 ± 0.749
3.763ThrGlu: 3.763 ± 1.49
0.941ThrPhe: 0.941 ± 0.588
5.644ThrGly: 5.644 ± 0.813
0.941ThrHis: 0.941 ± 0.823
3.763ThrIle: 3.763 ± 2.352
0.941ThrLys: 0.941 ± 0.963
3.763ThrLeu: 3.763 ± 1.336
0.0ThrMet: 0.0 ± 0.0
1.881ThrAsn: 1.881 ± 0.707
3.763ThrPro: 3.763 ± 1.107
2.822ThrGln: 2.822 ± 1.417
0.941ThrArg: 0.941 ± 0.963
3.763ThrSer: 3.763 ± 2.352
3.763ThrThr: 3.763 ± 2.352
2.822ThrVal: 2.822 ± 1.764
0.941ThrTrp: 0.941 ± 0.588
1.881ThrTyr: 1.881 ± 1.051
0.0ThrXaa: 0.0 ± 0.0
Val
2.822ValAla: 2.822 ± 0.465
0.0ValCys: 0.0 ± 0.0
0.941ValAsp: 0.941 ± 0.963
3.763ValGlu: 3.763 ± 1.107
2.822ValPhe: 2.822 ± 1.764
4.704ValGly: 4.704 ± 2.083
0.941ValHis: 0.941 ± 0.963
0.941ValIle: 0.941 ± 0.823
1.881ValLys: 1.881 ± 1.176
7.526ValLeu: 7.526 ± 1.517
0.941ValMet: 0.941 ± 0.588
2.822ValAsn: 2.822 ± 0.941
6.585ValPro: 6.585 ± 2.31
2.822ValGln: 2.822 ± 0.941
2.822ValArg: 2.822 ± 1.623
5.644ValSer: 5.644 ± 2.015
3.763ValThr: 3.763 ± 1.379
1.881ValVal: 1.881 ± 1.645
0.0ValTrp: 0.0 ± 0.0
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.941TrpGlu: 0.941 ± 0.588
0.941TrpPhe: 0.941 ± 0.823
0.941TrpGly: 0.941 ± 0.823
0.941TrpHis: 0.941 ± 0.588
0.0TrpIle: 0.0 ± 0.0
0.941TrpLys: 0.941 ± 0.823
2.822TrpLeu: 2.822 ± 1.007
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.941TrpPro: 0.941 ± 0.588
0.941TrpGln: 0.941 ± 0.588
0.0TrpArg: 0.0 ± 0.0
0.941TrpSer: 0.941 ± 0.963
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.881TrpTyr: 1.881 ± 1.176
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.941TyrAla: 0.941 ± 0.823
0.0TyrCys: 0.0 ± 0.0
0.941TyrAsp: 0.941 ± 0.588
0.941TyrGlu: 0.941 ± 0.823
3.763TyrPhe: 3.763 ± 1.107
2.822TyrGly: 2.822 ± 0.941
0.941TyrHis: 0.941 ± 0.963
1.881TyrIle: 1.881 ± 0.749
0.941TyrLys: 0.941 ± 0.963
6.585TyrLeu: 6.585 ± 3.066
2.822TyrMet: 2.822 ± 1.093
2.822TyrAsn: 2.822 ± 0.941
0.941TyrPro: 0.941 ± 0.823
2.822TyrGln: 2.822 ± 1.007
3.763TyrArg: 3.763 ± 1.49
1.881TyrSer: 1.881 ± 1.645
1.881TyrThr: 1.881 ± 1.645
6.585TyrVal: 6.585 ± 2.31
0.941TyrTrp: 0.941 ± 0.588
1.881TyrTyr: 1.881 ± 0.749
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1064 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski