Amino acid dipepetide frequency for Alces alces faeces associated microvirus MP12 5423

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.054AlaAla: 5.054 ± 3.974
0.0AlaCys: 0.0 ± 0.0
4.332AlaAsp: 4.332 ± 1.525
4.332AlaGlu: 4.332 ± 2.898
1.444AlaPhe: 1.444 ± 1.012
2.888AlaGly: 2.888 ± 1.558
0.722AlaHis: 0.722 ± 0.506
5.054AlaIle: 5.054 ± 2.19
5.776AlaLys: 5.776 ± 1.026
1.444AlaLeu: 1.444 ± 1.012
0.722AlaMet: 0.722 ± 0.853
5.054AlaAsn: 5.054 ± 1.714
2.166AlaPro: 2.166 ± 0.884
5.776AlaGln: 5.776 ± 1.644
5.776AlaArg: 5.776 ± 1.203
8.664AlaSer: 8.664 ± 4.984
2.888AlaThr: 2.888 ± 1.321
6.498AlaVal: 6.498 ± 2.029
0.722AlaTrp: 0.722 ± 0.779
2.888AlaTyr: 2.888 ± 1.558
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.444CysCys: 1.444 ± 1.234
1.444CysAsp: 1.444 ± 0.987
1.444CysGlu: 1.444 ± 0.9
0.0CysPhe: 0.0 ± 0.0
1.444CysGly: 1.444 ± 1.249
0.0CysHis: 0.0 ± 0.0
0.722CysIle: 0.722 ± 0.625
0.722CysLys: 0.722 ± 0.812
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.722CysAsn: 0.722 ± 0.506
1.444CysPro: 1.444 ± 0.737
0.722CysGln: 0.722 ± 0.625
0.722CysArg: 0.722 ± 0.625
0.0CysSer: 0.0 ± 0.0
0.722CysThr: 0.722 ± 0.625
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.722CysTyr: 0.722 ± 0.506
0.0CysXaa: 0.0 ± 0.0
Asp
5.054AspAla: 5.054 ± 1.532
0.722AspCys: 0.722 ± 0.625
2.888AspAsp: 2.888 ± 1.148
5.054AspGlu: 5.054 ± 2.759
5.054AspPhe: 5.054 ± 1.214
2.888AspGly: 2.888 ± 1.321
0.722AspHis: 0.722 ± 0.625
3.61AspIle: 3.61 ± 1.147
2.166AspLys: 2.166 ± 1.294
7.942AspLeu: 7.942 ± 2.06
1.444AspMet: 1.444 ± 0.9
2.166AspAsn: 2.166 ± 1.111
0.0AspPro: 0.0 ± 0.0
1.444AspGln: 1.444 ± 1.012
1.444AspArg: 1.444 ± 1.141
2.166AspSer: 2.166 ± 0.884
2.166AspThr: 2.166 ± 1.056
2.166AspVal: 2.166 ± 1.058
0.722AspTrp: 0.722 ± 0.506
2.166AspTyr: 2.166 ± 1.027
0.0AspXaa: 0.0 ± 0.0
Glu
4.332GluAla: 4.332 ± 0.79
2.888GluCys: 2.888 ± 1.8
7.22GluAsp: 7.22 ± 1.32
5.054GluGlu: 5.054 ± 1.846
3.61GluPhe: 3.61 ± 1.799
2.166GluGly: 2.166 ± 0.934
2.166GluHis: 2.166 ± 1.056
7.22GluIle: 7.22 ± 3.724
9.386GluLys: 9.386 ± 5.092
3.61GluLeu: 3.61 ± 1.303
2.888GluMet: 2.888 ± 0.617
2.888GluAsn: 2.888 ± 1.321
0.0GluPro: 0.0 ± 0.0
4.332GluGln: 4.332 ± 2.709
3.61GluArg: 3.61 ± 0.94
4.332GluSer: 4.332 ± 0.717
4.332GluThr: 4.332 ± 1.57
1.444GluVal: 1.444 ± 0.9
2.888GluTrp: 2.888 ± 1.911
6.498GluTyr: 6.498 ± 1.457
0.0GluXaa: 0.0 ± 0.0
Phe
4.332PheAla: 4.332 ± 2.376
0.0PheCys: 0.0 ± 0.0
2.166PheAsp: 2.166 ± 0.793
2.166PheGlu: 2.166 ± 0.884
0.722PhePhe: 0.722 ± 0.625
2.888PheGly: 2.888 ± 1.321
1.444PheHis: 1.444 ± 1.012
0.722PheIle: 0.722 ± 0.506
2.166PheLys: 2.166 ± 0.884
2.166PheLeu: 2.166 ± 1.088
2.888PheMet: 2.888 ± 1.408
2.888PheAsn: 2.888 ± 0.565
0.0PhePro: 0.0 ± 0.0
1.444PheGln: 1.444 ± 0.987
0.722PheArg: 0.722 ± 0.506
2.166PheSer: 2.166 ± 0.556
2.888PheThr: 2.888 ± 1.554
0.722PheVal: 0.722 ± 0.853
0.0PheTrp: 0.0 ± 0.0
5.054PheTyr: 5.054 ± 0.739
0.0PheXaa: 0.0 ± 0.0
Gly
2.888GlyAla: 2.888 ± 1.558
0.0GlyCys: 0.0 ± 0.0
1.444GlyAsp: 1.444 ± 1.012
9.386GlyGlu: 9.386 ± 2.176
0.722GlyPhe: 0.722 ± 0.506
5.054GlyGly: 5.054 ± 2.092
2.166GlyHis: 2.166 ± 1.363
2.166GlyIle: 2.166 ± 0.986
2.888GlyLys: 2.888 ± 0.848
2.166GlyLeu: 2.166 ± 0.724
0.0GlyMet: 0.0 ± 0.0
3.61GlyAsn: 3.61 ± 1.859
0.722GlyPro: 0.722 ± 0.506
3.61GlyGln: 3.61 ± 1.689
0.722GlyArg: 0.722 ± 0.625
1.444GlySer: 1.444 ± 0.574
7.22GlyThr: 7.22 ± 1.787
2.888GlyVal: 2.888 ± 0.565
2.166GlyTrp: 2.166 ± 0.94
2.888GlyTyr: 2.888 ± 1.148
0.0GlyXaa: 0.0 ± 0.0
His
1.444HisAla: 1.444 ± 0.918
0.722HisCys: 0.722 ± 0.506
1.444HisAsp: 1.444 ± 0.574
1.444HisGlu: 1.444 ± 0.779
0.722HisPhe: 0.722 ± 0.506
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.722HisIle: 0.722 ± 0.506
2.166HisLys: 2.166 ± 0.986
0.722HisLeu: 0.722 ± 0.506
0.722HisMet: 0.722 ± 0.506
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.722HisGln: 0.722 ± 0.625
0.722HisArg: 0.722 ± 0.853
0.722HisSer: 0.722 ± 0.779
0.722HisThr: 0.722 ± 0.779
1.444HisVal: 1.444 ± 0.794
0.722HisTrp: 0.722 ± 0.506
3.61HisTyr: 3.61 ± 1.722
0.0HisXaa: 0.0 ± 0.0
Ile
4.332IleAla: 4.332 ± 1.11
0.0IleCys: 0.0 ± 0.0
2.166IleAsp: 2.166 ± 0.556
5.054IleGlu: 5.054 ± 1.203
2.888IlePhe: 2.888 ± 1.114
4.332IleGly: 4.332 ± 1.449
0.722IleHis: 0.722 ± 0.779
0.722IleIle: 0.722 ± 0.625
7.22IleLys: 7.22 ± 2.176
3.61IleLeu: 3.61 ± 1.605
0.722IleMet: 0.722 ± 0.506
5.054IleAsn: 5.054 ± 2.093
2.888IlePro: 2.888 ± 1.475
2.166IleGln: 2.166 ± 1.513
5.054IleArg: 5.054 ± 2.13
2.888IleSer: 2.888 ± 1.321
2.166IleThr: 2.166 ± 0.724
1.444IleVal: 1.444 ± 1.707
0.0IleTrp: 0.0 ± 0.0
2.888IleTyr: 2.888 ± 0.892
0.0IleXaa: 0.0 ± 0.0
Lys
5.776LysAla: 5.776 ± 2.406
0.722LysCys: 0.722 ± 0.506
4.332LysAsp: 4.332 ± 0.806
5.054LysGlu: 5.054 ± 2.852
3.61LysPhe: 3.61 ± 1.121
5.776LysGly: 5.776 ± 3.184
0.722LysHis: 0.722 ± 0.506
5.776LysIle: 5.776 ± 0.831
11.552LysLys: 11.552 ± 7.308
7.942LysLeu: 7.942 ± 3.252
2.166LysMet: 2.166 ± 0.881
5.054LysAsn: 5.054 ± 1.83
0.722LysPro: 0.722 ± 0.506
0.722LysGln: 0.722 ± 0.506
4.332LysArg: 4.332 ± 2.587
2.888LysSer: 2.888 ± 0.924
6.498LysThr: 6.498 ± 2.765
2.888LysVal: 2.888 ± 0.863
0.722LysTrp: 0.722 ± 0.625
4.332LysTyr: 4.332 ± 2.137
0.0LysXaa: 0.0 ± 0.0
Leu
2.166LeuAla: 2.166 ± 1.519
0.0LeuCys: 0.0 ± 0.0
0.0LeuAsp: 0.0 ± 0.0
4.332LeuGlu: 4.332 ± 0.79
2.888LeuPhe: 2.888 ± 1.321
2.888LeuGly: 2.888 ± 1.454
0.722LeuHis: 0.722 ± 0.779
3.61LeuIle: 3.61 ± 1.859
7.22LeuLys: 7.22 ± 2.682
3.61LeuLeu: 3.61 ± 0.74
1.444LeuMet: 1.444 ± 1.026
2.166LeuAsn: 2.166 ± 1.118
7.22LeuPro: 7.22 ± 2.326
2.166LeuGln: 2.166 ± 0.934
2.166LeuArg: 2.166 ± 1.438
5.054LeuSer: 5.054 ± 1.951
6.498LeuThr: 6.498 ± 1.662
2.166LeuVal: 2.166 ± 1.519
2.888LeuTrp: 2.888 ± 0.892
2.888LeuTyr: 2.888 ± 0.848
0.0LeuXaa: 0.0 ± 0.0
Met
2.166MetAla: 2.166 ± 1.335
1.444MetCys: 1.444 ± 1.249
1.444MetAsp: 1.444 ± 0.574
0.0MetGlu: 0.0 ± 0.0
1.444MetPhe: 1.444 ± 1.012
0.722MetGly: 0.722 ± 0.506
0.722MetHis: 0.722 ± 0.956
1.444MetIle: 1.444 ± 1.249
2.166MetLys: 2.166 ± 1.294
2.888MetLeu: 2.888 ± 1.713
0.722MetMet: 0.722 ± 0.625
2.888MetAsn: 2.888 ± 1.308
0.722MetPro: 0.722 ± 0.506
0.722MetGln: 0.722 ± 0.506
1.444MetArg: 1.444 ± 0.95
2.888MetSer: 2.888 ± 1.558
2.888MetThr: 2.888 ± 1.475
1.444MetVal: 1.444 ± 0.779
0.722MetTrp: 0.722 ± 0.506
0.722MetTyr: 0.722 ± 0.625
0.0MetXaa: 0.0 ± 0.0
Asn
1.444AsnAla: 1.444 ± 0.779
0.722AsnCys: 0.722 ± 0.812
7.22AsnAsp: 7.22 ± 1.813
5.054AsnGlu: 5.054 ± 2.776
1.444AsnPhe: 1.444 ± 0.574
3.61AsnGly: 3.61 ± 1.369
1.444AsnHis: 1.444 ± 0.794
2.888AsnIle: 2.888 ± 1.747
5.776AsnLys: 5.776 ± 1.423
1.444AsnLeu: 1.444 ± 0.574
1.444AsnMet: 1.444 ± 0.574
4.332AsnAsn: 4.332 ± 4.032
3.61AsnPro: 3.61 ± 1.086
0.722AsnGln: 0.722 ± 0.779
5.054AsnArg: 5.054 ± 1.071
3.61AsnSer: 3.61 ± 1.24
3.61AsnThr: 3.61 ± 1.287
9.386AsnVal: 9.386 ± 3.273
0.0AsnTrp: 0.0 ± 0.0
0.722AsnTyr: 0.722 ± 0.779
0.0AsnXaa: 0.0 ± 0.0
Pro
0.722ProAla: 0.722 ± 0.506
0.722ProCys: 0.722 ± 0.625
1.444ProAsp: 1.444 ± 0.794
1.444ProGlu: 1.444 ± 0.9
1.444ProPhe: 1.444 ± 0.987
1.444ProGly: 1.444 ± 1.012
1.444ProHis: 1.444 ± 0.9
2.888ProIle: 2.888 ± 1.308
4.332ProLys: 4.332 ± 3.025
2.888ProLeu: 2.888 ± 1.321
1.444ProMet: 1.444 ± 0.574
0.0ProAsn: 0.0 ± 0.0
1.444ProPro: 1.444 ± 0.574
1.444ProGln: 1.444 ± 1.012
4.332ProArg: 4.332 ± 1.25
2.888ProSer: 2.888 ± 0.977
3.61ProThr: 3.61 ± 2.531
2.888ProVal: 2.888 ± 0.565
0.0ProTrp: 0.0 ± 0.0
0.722ProTyr: 0.722 ± 0.506
0.0ProXaa: 0.0 ± 0.0
Gln
8.664GlnAla: 8.664 ± 6.466
0.0GlnCys: 0.0 ± 0.0
2.888GlnAsp: 2.888 ± 0.565
2.888GlnGlu: 2.888 ± 0.924
0.722GlnPhe: 0.722 ± 0.506
2.166GlnGly: 2.166 ± 1.519
0.0GlnHis: 0.0 ± 0.0
0.722GlnIle: 0.722 ± 0.812
2.166GlnLys: 2.166 ± 1.118
0.722GlnLeu: 0.722 ± 0.506
2.166GlnMet: 2.166 ± 2.338
8.664GlnAsn: 8.664 ± 5.287
2.166GlnPro: 2.166 ± 1.519
2.888GlnGln: 2.888 ± 2.67
2.166GlnArg: 2.166 ± 1.111
1.444GlnSer: 1.444 ± 0.918
5.054GlnThr: 5.054 ± 1.412
0.722GlnVal: 0.722 ± 0.506
0.0GlnTrp: 0.0 ± 0.0
0.722GlnTyr: 0.722 ± 0.853
0.0GlnXaa: 0.0 ± 0.0
Arg
3.61ArgAla: 3.61 ± 2.282
0.0ArgCys: 0.0 ± 0.0
2.166ArgAsp: 2.166 ± 0.986
4.332ArgGlu: 4.332 ± 1.248
2.166ArgPhe: 2.166 ± 1.571
2.888ArgGly: 2.888 ± 1.35
0.0ArgHis: 0.0 ± 0.0
3.61ArgIle: 3.61 ± 1.059
1.444ArgLys: 1.444 ± 1.249
3.61ArgLeu: 3.61 ± 0.991
2.888ArgMet: 2.888 ± 1.24
2.888ArgAsn: 2.888 ± 1.235
3.61ArgPro: 3.61 ± 1.102
0.722ArgGln: 0.722 ± 0.779
0.722ArgArg: 0.722 ± 0.956
3.61ArgSer: 3.61 ± 1.626
7.22ArgThr: 7.22 ± 3.259
0.722ArgVal: 0.722 ± 0.625
0.0ArgTrp: 0.0 ± 0.0
2.166ArgTyr: 2.166 ± 1.088
0.0ArgXaa: 0.0 ± 0.0
Ser
4.332SerAla: 4.332 ± 1.525
0.0SerCys: 0.0 ± 0.0
1.444SerAsp: 1.444 ± 0.737
4.332SerGlu: 4.332 ± 1.248
1.444SerPhe: 1.444 ± 1.249
1.444SerGly: 1.444 ± 0.779
2.166SerHis: 2.166 ± 0.884
2.888SerIle: 2.888 ± 0.565
3.61SerLys: 3.61 ± 1.722
7.22SerLeu: 7.22 ± 1.9
1.444SerMet: 1.444 ± 0.737
3.61SerAsn: 3.61 ± 1.751
2.888SerPro: 2.888 ± 0.848
7.942SerGln: 7.942 ± 4.718
2.166SerArg: 2.166 ± 0.986
3.61SerSer: 3.61 ± 1.788
2.888SerThr: 2.888 ± 0.848
5.054SerVal: 5.054 ± 1.6
0.0SerTrp: 0.0 ± 0.0
3.61SerTyr: 3.61 ± 1.402
0.0SerXaa: 0.0 ± 0.0
Thr
5.776ThrAla: 5.776 ± 2.222
0.722ThrCys: 0.722 ± 0.625
3.61ThrAsp: 3.61 ± 1.402
7.942ThrGlu: 7.942 ± 1.368
2.888ThrPhe: 2.888 ± 1.308
6.498ThrGly: 6.498 ± 1.138
0.722ThrHis: 0.722 ± 0.506
6.498ThrIle: 6.498 ± 2.172
4.332ThrLys: 4.332 ± 0.79
4.332ThrLeu: 4.332 ± 2.071
3.61ThrMet: 3.61 ± 0.918
5.054ThrAsn: 5.054 ± 2.49
4.332ThrPro: 4.332 ± 1.438
2.166ThrGln: 2.166 ± 0.934
2.888ThrArg: 2.888 ± 1.075
6.498ThrSer: 6.498 ± 1.733
4.332ThrThr: 4.332 ± 2.22
0.0ThrVal: 0.0 ± 0.0
0.722ThrTrp: 0.722 ± 0.506
3.61ThrTyr: 3.61 ± 1.795
0.0ThrXaa: 0.0 ± 0.0
Val
3.61ValAla: 3.61 ± 2.225
1.444ValCys: 1.444 ± 0.574
1.444ValAsp: 1.444 ± 1.012
3.61ValGlu: 3.61 ± 2.153
0.722ValPhe: 0.722 ± 0.853
2.166ValGly: 2.166 ± 0.793
1.444ValHis: 1.444 ± 0.574
2.166ValIle: 2.166 ± 0.934
2.888ValLys: 2.888 ± 1.8
1.444ValLeu: 1.444 ± 0.779
1.444ValMet: 1.444 ± 0.744
3.61ValAsn: 3.61 ± 1.767
3.61ValPro: 3.61 ± 1.751
3.61ValGln: 3.61 ± 4.267
2.888ValArg: 2.888 ± 0.848
2.166ValSer: 2.166 ± 1.519
5.054ValThr: 5.054 ± 1.248
2.166ValVal: 2.166 ± 1.438
0.0ValTrp: 0.0 ± 0.0
0.722ValTyr: 0.722 ± 0.506
0.0ValXaa: 0.0 ± 0.0
Trp
2.888TrpAla: 2.888 ± 1.558
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.444TrpGlu: 1.444 ± 0.574
1.444TrpPhe: 1.444 ± 1.012
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.722TrpIle: 0.722 ± 0.853
1.444TrpLys: 1.444 ± 0.987
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.444TrpGln: 1.444 ± 0.779
0.722TrpArg: 0.722 ± 0.506
2.166TrpSer: 2.166 ± 1.088
0.722TrpThr: 0.722 ± 0.625
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.61TyrAla: 3.61 ± 1.215
0.722TyrCys: 0.722 ± 0.812
2.888TyrAsp: 2.888 ± 0.924
6.498TyrGlu: 6.498 ± 2.011
2.166TyrPhe: 2.166 ± 0.884
2.888TyrGly: 2.888 ± 1.106
1.444TyrHis: 1.444 ± 0.918
2.166TyrIle: 2.166 ± 1.874
2.888TyrLys: 2.888 ± 0.924
4.332TyrLeu: 4.332 ± 1.768
0.722TyrMet: 0.722 ± 0.506
2.888TyrAsn: 2.888 ± 0.848
0.0TyrPro: 0.0 ± 0.0
2.166TyrGln: 2.166 ± 1.54
0.722TyrArg: 0.722 ± 0.506
2.888TyrSer: 2.888 ± 1.464
5.054TyrThr: 5.054 ± 2.703
2.166TyrVal: 2.166 ± 1.874
0.722TyrTrp: 0.722 ± 0.506
1.444TyrTyr: 1.444 ± 0.9
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1386 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski