Amino acid dipepetide frequency for Alces alces faeces associated microvirus MP10 5560

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.882AlaAla: 5.882 ± 2.863
1.307AlaCys: 1.307 ± 1.214
3.268AlaAsp: 3.268 ± 1.207
7.19AlaGlu: 7.19 ± 3.974
4.575AlaPhe: 4.575 ± 2.026
5.229AlaGly: 5.229 ± 1.688
1.961AlaHis: 1.961 ± 0.684
5.229AlaIle: 5.229 ± 2.195
3.922AlaLys: 3.922 ± 0.931
3.922AlaLeu: 3.922 ± 0.936
2.614AlaMet: 2.614 ± 2.83
3.268AlaAsn: 3.268 ± 1.375
3.922AlaPro: 3.922 ± 1.988
5.229AlaGln: 5.229 ± 3.665
2.614AlaArg: 2.614 ± 1.139
2.614AlaSer: 2.614 ± 1.918
7.19AlaThr: 7.19 ± 3.846
3.268AlaVal: 3.268 ± 0.885
1.307AlaTrp: 1.307 ± 0.903
4.575AlaTyr: 4.575 ± 1.115
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.654CysAsp: 0.654 ± 0.607
1.307CysGlu: 1.307 ± 0.775
0.0CysPhe: 0.0 ± 0.0
1.961CysGly: 1.961 ± 1.821
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
2.614CysLys: 2.614 ± 0.987
0.654CysLeu: 0.654 ± 0.451
1.961CysMet: 1.961 ± 0.61
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.654CysTyr: 0.654 ± 0.607
0.0CysXaa: 0.0 ± 0.0
Asp
1.961AspAla: 1.961 ± 0.994
0.0AspCys: 0.0 ± 0.0
1.961AspAsp: 1.961 ± 1.032
5.229AspGlu: 5.229 ± 1.241
3.922AspPhe: 3.922 ± 0.77
3.268AspGly: 3.268 ± 1.179
1.307AspHis: 1.307 ± 0.903
5.229AspIle: 5.229 ± 1.359
3.268AspLys: 3.268 ± 1.966
3.922AspLeu: 3.922 ± 1.321
1.307AspMet: 1.307 ± 0.685
3.922AspAsn: 3.922 ± 2.095
0.654AspPro: 0.654 ± 0.451
1.961AspGln: 1.961 ± 1.354
0.654AspArg: 0.654 ± 0.607
2.614AspSer: 2.614 ± 1.37
3.922AspThr: 3.922 ± 1.238
1.307AspVal: 1.307 ± 0.636
1.961AspTrp: 1.961 ± 0.846
5.229AspTyr: 5.229 ± 1.712
0.0AspXaa: 0.0 ± 0.0
Glu
6.536GluAla: 6.536 ± 2.298
0.0GluCys: 0.0 ± 0.0
1.961GluAsp: 1.961 ± 0.761
7.843GluGlu: 7.843 ± 2.067
3.268GluPhe: 3.268 ± 1.516
2.614GluGly: 2.614 ± 0.923
2.614GluHis: 2.614 ± 1.04
4.575GluIle: 4.575 ± 1.736
7.19GluLys: 7.19 ± 2.694
5.229GluLeu: 5.229 ± 2.844
1.307GluMet: 1.307 ± 1.335
8.497GluAsn: 8.497 ± 2.549
1.961GluPro: 1.961 ± 1.209
4.575GluGln: 4.575 ± 1.647
3.268GluArg: 3.268 ± 1.853
2.614GluSer: 2.614 ± 1.494
6.536GluThr: 6.536 ± 0.88
3.922GluVal: 3.922 ± 0.957
0.654GluTrp: 0.654 ± 0.451
3.922GluTyr: 3.922 ± 0.934
0.0GluXaa: 0.0 ± 0.0
Phe
3.922PheAla: 3.922 ± 1.461
0.0PheCys: 0.0 ± 0.0
1.961PheAsp: 1.961 ± 0.761
1.307PheGlu: 1.307 ± 1.16
2.614PhePhe: 2.614 ± 1.139
4.575PheGly: 4.575 ± 1.558
0.0PheHis: 0.0 ± 0.0
4.575PheIle: 4.575 ± 1.285
2.614PheLys: 2.614 ± 1.356
2.614PheLeu: 2.614 ± 1.561
1.307PheMet: 1.307 ± 0.52
3.922PheAsn: 3.922 ± 1.245
2.614PhePro: 2.614 ± 1.064
1.961PheGln: 1.961 ± 1.226
1.961PheArg: 1.961 ± 1.354
1.307PheSer: 1.307 ± 0.959
2.614PheThr: 2.614 ± 1.139
1.307PheVal: 1.307 ± 0.903
0.654PheTrp: 0.654 ± 0.658
3.922PheTyr: 3.922 ± 0.934
0.0PheXaa: 0.0 ± 0.0
Gly
5.229GlyAla: 5.229 ± 2.006
0.654GlyCys: 0.654 ± 0.607
7.843GlyAsp: 7.843 ± 2.313
7.843GlyGlu: 7.843 ± 2.088
0.0GlyPhe: 0.0 ± 0.0
5.882GlyGly: 5.882 ± 2.026
1.307GlyHis: 1.307 ± 0.685
2.614GlyIle: 2.614 ± 0.743
2.614GlyLys: 2.614 ± 1.829
3.922GlyLeu: 3.922 ± 0.932
1.307GlyMet: 1.307 ± 0.701
3.922GlyAsn: 3.922 ± 2.595
0.654GlyPro: 0.654 ± 0.451
1.961GlyGln: 1.961 ± 0.846
1.307GlyArg: 1.307 ± 0.52
3.922GlySer: 3.922 ± 1.489
5.882GlyThr: 5.882 ± 1.569
4.575GlyVal: 4.575 ± 1.899
0.654GlyTrp: 0.654 ± 0.451
3.268GlyTyr: 3.268 ± 0.869
0.0GlyXaa: 0.0 ± 0.0
His
1.307HisAla: 1.307 ± 1.214
0.0HisCys: 0.0 ± 0.0
1.307HisAsp: 1.307 ± 0.712
0.0HisGlu: 0.0 ± 0.0
0.654HisPhe: 0.654 ± 0.451
1.307HisGly: 1.307 ± 0.685
0.654HisHis: 0.654 ± 0.451
1.307HisIle: 1.307 ± 1.316
2.614HisLys: 2.614 ± 1.154
0.0HisLeu: 0.0 ± 0.0
1.961HisMet: 1.961 ± 0.965
1.307HisAsn: 1.307 ± 0.836
0.654HisPro: 0.654 ± 0.607
0.654HisGln: 0.654 ± 0.607
0.654HisArg: 0.654 ± 0.451
0.0HisSer: 0.0 ± 0.0
1.961HisThr: 1.961 ± 1.036
1.307HisVal: 1.307 ± 1.16
1.307HisTrp: 1.307 ± 0.903
1.961HisTyr: 1.961 ± 0.61
0.0HisXaa: 0.0 ± 0.0
Ile
4.575IleAla: 4.575 ± 1.535
0.654IleCys: 0.654 ± 0.658
7.19IleAsp: 7.19 ± 1.383
2.614IleGlu: 2.614 ± 1.403
3.922IlePhe: 3.922 ± 1.797
1.961IleGly: 1.961 ± 0.918
0.654IleHis: 0.654 ± 0.658
5.229IleIle: 5.229 ± 2.57
5.882IleLys: 5.882 ± 2.437
3.922IleLeu: 3.922 ± 1.53
1.307IleMet: 1.307 ± 0.798
5.229IleAsn: 5.229 ± 1.751
3.268IlePro: 3.268 ± 2.257
1.307IleGln: 1.307 ± 0.701
3.922IleArg: 3.922 ± 1.066
4.575IleSer: 4.575 ± 1.416
1.307IleThr: 1.307 ± 0.775
0.654IleVal: 0.654 ± 0.603
0.654IleTrp: 0.654 ± 0.607
5.882IleTyr: 5.882 ± 1.75
0.0IleXaa: 0.0 ± 0.0
Lys
6.536LysAla: 6.536 ± 1.566
0.0LysCys: 0.0 ± 0.0
3.268LysAsp: 3.268 ± 1.172
3.922LysGlu: 3.922 ± 1.298
4.575LysPhe: 4.575 ± 1.631
4.575LysGly: 4.575 ± 1.445
1.307LysHis: 1.307 ± 1.214
5.229LysIle: 5.229 ± 2.542
7.19LysLys: 7.19 ± 4.351
1.961LysLeu: 1.961 ± 1.821
1.961LysMet: 1.961 ± 1.655
3.922LysAsn: 3.922 ± 1.543
4.575LysPro: 4.575 ± 2.498
3.922LysGln: 3.922 ± 2.072
5.882LysArg: 5.882 ± 2.305
2.614LysSer: 2.614 ± 0.914
7.19LysThr: 7.19 ± 3.02
1.307LysVal: 1.307 ± 0.916
1.961LysTrp: 1.961 ± 0.984
1.961LysTyr: 1.961 ± 1.036
0.0LysXaa: 0.0 ± 0.0
Leu
3.922LeuAla: 3.922 ± 1.854
0.654LeuCys: 0.654 ± 0.607
2.614LeuAsp: 2.614 ± 0.548
3.268LeuGlu: 3.268 ± 1.172
3.922LeuPhe: 3.922 ± 1.79
3.922LeuGly: 3.922 ± 1.536
1.961LeuHis: 1.961 ± 1.204
2.614LeuIle: 2.614 ± 1.064
5.882LeuLys: 5.882 ± 3.441
3.268LeuLeu: 3.268 ± 0.947
1.961LeuMet: 1.961 ± 0.761
3.268LeuAsn: 3.268 ± 2.243
7.843LeuPro: 7.843 ± 3.073
2.614LeuGln: 2.614 ± 1.37
2.614LeuArg: 2.614 ± 0.743
1.961LeuSer: 1.961 ± 0.994
3.268LeuThr: 3.268 ± 1.223
1.961LeuVal: 1.961 ± 1.036
0.654LeuTrp: 0.654 ± 1.01
3.268LeuTyr: 3.268 ± 1.966
0.0LeuXaa: 0.0 ± 0.0
Met
1.307MetAla: 1.307 ± 0.52
0.654MetCys: 0.654 ± 0.451
1.307MetAsp: 1.307 ± 0.903
1.307MetGlu: 1.307 ± 0.712
1.307MetPhe: 1.307 ± 0.685
2.614MetGly: 2.614 ± 1.197
0.0MetHis: 0.0 ± 0.0
0.654MetIle: 0.654 ± 0.658
1.961MetLys: 1.961 ± 1.204
3.268MetLeu: 3.268 ± 1.149
0.654MetMet: 0.654 ± 0.707
0.654MetAsn: 0.654 ± 0.451
1.307MetPro: 1.307 ± 1.206
2.614MetGln: 2.614 ± 1.368
0.0MetArg: 0.0 ± 0.0
3.922MetSer: 3.922 ± 1.984
2.614MetThr: 2.614 ± 2.244
0.654MetVal: 0.654 ± 0.451
0.0MetTrp: 0.0 ± 0.0
1.307MetTyr: 1.307 ± 0.701
0.0MetXaa: 0.0 ± 0.0
Asn
3.922AsnAla: 3.922 ± 1.664
0.654AsnCys: 0.654 ± 0.658
1.961AsnAsp: 1.961 ± 1.226
3.268AsnGlu: 3.268 ± 1.172
0.0AsnPhe: 0.0 ± 0.0
1.961AsnGly: 1.961 ± 1.974
2.614AsnHis: 2.614 ± 1.154
6.536AsnIle: 6.536 ± 1.02
3.922AsnLys: 3.922 ± 0.806
5.229AsnLeu: 5.229 ± 1.476
1.307AsnMet: 1.307 ± 0.679
3.922AsnAsn: 3.922 ± 1.861
2.614AsnPro: 2.614 ± 2.209
1.307AsnGln: 1.307 ± 0.685
3.922AsnArg: 3.922 ± 0.932
2.614AsnSer: 2.614 ± 1.356
5.882AsnThr: 5.882 ± 1.952
2.614AsnVal: 2.614 ± 1.04
0.654AsnTrp: 0.654 ± 0.607
1.961AsnTyr: 1.961 ± 1.036
0.0AsnXaa: 0.0 ± 0.0
Pro
1.307ProAla: 1.307 ± 0.636
2.614ProCys: 2.614 ± 0.743
3.268ProAsp: 3.268 ± 0.985
4.575ProGlu: 4.575 ± 1.74
1.307ProPhe: 1.307 ± 0.903
1.961ProGly: 1.961 ± 0.761
0.654ProHis: 0.654 ± 0.607
5.882ProIle: 5.882 ± 2.068
1.307ProLys: 1.307 ± 1.206
1.961ProLeu: 1.961 ± 0.761
0.654ProMet: 0.654 ± 0.658
1.307ProAsn: 1.307 ± 0.903
0.654ProPro: 0.654 ± 0.607
3.268ProGln: 3.268 ± 1.6
0.654ProArg: 0.654 ± 0.607
1.961ProSer: 1.961 ± 1.017
3.268ProThr: 3.268 ± 0.978
3.268ProVal: 3.268 ± 1.282
0.0ProTrp: 0.0 ± 0.0
1.307ProTyr: 1.307 ± 0.52
0.0ProXaa: 0.0 ± 0.0
Gln
3.922GlnAla: 3.922 ± 1.907
0.0GlnCys: 0.0 ± 0.0
2.614GlnAsp: 2.614 ± 1.158
5.229GlnGlu: 5.229 ± 1.118
1.961GlnPhe: 1.961 ± 0.83
5.882GlnGly: 5.882 ± 1.417
0.0GlnHis: 0.0 ± 0.0
2.614GlnIle: 2.614 ± 0.845
3.922GlnLys: 3.922 ± 1.56
3.268GlnLeu: 3.268 ± 0.869
1.307GlnMet: 1.307 ± 0.636
2.614GlnAsn: 2.614 ± 1.424
1.961GlnPro: 1.961 ± 1.354
6.536GlnGln: 6.536 ± 2.372
1.307GlnArg: 1.307 ± 1.415
2.614GlnSer: 2.614 ± 1.672
4.575GlnThr: 4.575 ± 2.04
0.654GlnVal: 0.654 ± 0.451
0.0GlnTrp: 0.0 ± 0.0
3.922GlnTyr: 3.922 ± 1.199
0.0GlnXaa: 0.0 ± 0.0
Arg
2.614ArgAla: 2.614 ± 0.969
0.654ArgCys: 0.654 ± 0.607
2.614ArgAsp: 2.614 ± 1.04
1.307ArgGlu: 1.307 ± 0.712
1.961ArgPhe: 1.961 ± 0.709
1.961ArgGly: 1.961 ± 0.846
0.0ArgHis: 0.0 ± 0.0
1.307ArgIle: 1.307 ± 1.316
3.922ArgLys: 3.922 ± 2.153
1.961ArgLeu: 1.961 ± 0.761
1.961ArgMet: 1.961 ± 0.864
1.307ArgAsn: 1.307 ± 0.685
3.922ArgPro: 3.922 ± 0.955
3.922ArgGln: 3.922 ± 1.001
1.307ArgArg: 1.307 ± 0.52
1.307ArgSer: 1.307 ± 0.903
3.268ArgThr: 3.268 ± 1.088
0.654ArgVal: 0.654 ± 0.451
0.654ArgTrp: 0.654 ± 0.451
2.614ArgTyr: 2.614 ± 1.04
0.0ArgXaa: 0.0 ± 0.0
Ser
4.575SerAla: 4.575 ± 1.037
0.0SerCys: 0.0 ± 0.0
2.614SerAsp: 2.614 ± 1.099
5.882SerGlu: 5.882 ± 2.437
0.654SerPhe: 0.654 ± 0.451
6.536SerGly: 6.536 ± 3.379
0.654SerHis: 0.654 ± 0.658
2.614SerIle: 2.614 ± 1.332
0.654SerLys: 0.654 ± 0.607
3.268SerLeu: 3.268 ± 0.618
1.961SerMet: 1.961 ± 1.267
3.268SerAsn: 3.268 ± 1.074
1.307SerPro: 1.307 ± 0.903
0.654SerGln: 0.654 ± 0.707
3.922SerArg: 3.922 ± 1.988
2.614SerSer: 2.614 ± 1.37
3.922SerThr: 3.922 ± 0.712
1.307SerVal: 1.307 ± 0.903
0.654SerTrp: 0.654 ± 0.451
0.654SerTyr: 0.654 ± 0.658
0.0SerXaa: 0.0 ± 0.0
Thr
7.843ThrAla: 7.843 ± 4.749
0.654ThrCys: 0.654 ± 0.607
5.229ThrAsp: 5.229 ± 1.486
5.882ThrGlu: 5.882 ± 2.105
3.268ThrPhe: 3.268 ± 1.228
4.575ThrGly: 4.575 ± 1.728
1.307ThrHis: 1.307 ± 0.712
6.536ThrIle: 6.536 ± 1.677
7.19ThrLys: 7.19 ± 2.064
5.882ThrLeu: 5.882 ± 1.853
1.961ThrMet: 1.961 ± 0.846
1.961ThrAsn: 1.961 ± 0.709
1.307ThrPro: 1.307 ± 0.903
3.922ThrGln: 3.922 ± 1.499
2.614ThrArg: 2.614 ± 1.332
3.922ThrSer: 3.922 ± 1.268
3.922ThrThr: 3.922 ± 1.461
1.307ThrVal: 1.307 ± 0.52
1.307ThrTrp: 1.307 ± 0.712
5.229ThrTyr: 5.229 ± 1.308
0.0ThrXaa: 0.0 ± 0.0
Val
3.922ValAla: 3.922 ± 1.834
0.0ValCys: 0.0 ± 0.0
0.654ValAsp: 0.654 ± 0.451
3.922ValGlu: 3.922 ± 1.111
1.961ValPhe: 1.961 ± 1.354
1.961ValGly: 1.961 ± 1.017
0.654ValHis: 0.654 ± 0.658
1.307ValIle: 1.307 ± 0.52
1.961ValLys: 1.961 ± 1.209
3.268ValLeu: 3.268 ± 1.179
0.654ValMet: 0.654 ± 0.451
1.307ValAsn: 1.307 ± 0.903
1.307ValPro: 1.307 ± 0.52
1.307ValGln: 1.307 ± 0.903
1.307ValArg: 1.307 ± 0.903
3.268ValSer: 3.268 ± 1.207
3.922ValThr: 3.922 ± 1.523
0.654ValVal: 0.654 ± 0.451
0.0ValTrp: 0.0 ± 0.0
0.654ValTyr: 0.654 ± 0.603
0.0ValXaa: 0.0 ± 0.0
Trp
3.268TrpAla: 3.268 ± 0.885
0.0TrpCys: 0.0 ± 0.0
0.654TrpAsp: 0.654 ± 0.607
2.614TrpGlu: 2.614 ± 1.37
0.654TrpPhe: 0.654 ± 0.451
0.654TrpGly: 0.654 ± 0.607
0.654TrpHis: 0.654 ± 0.451
0.654TrpIle: 0.654 ± 0.451
1.307TrpLys: 1.307 ± 1.151
0.654TrpLeu: 0.654 ± 0.658
0.0TrpMet: 0.0 ± 0.0
0.654TrpAsn: 0.654 ± 0.658
0.0TrpPro: 0.0 ± 0.0
0.654TrpGln: 0.654 ± 0.707
0.0TrpArg: 0.0 ± 0.0
0.654TrpSer: 0.654 ± 0.451
0.0TrpThr: 0.0 ± 0.0
1.307TrpVal: 1.307 ± 0.712
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
6.536TyrAla: 6.536 ± 2.335
1.307TyrCys: 1.307 ± 1.214
0.654TyrAsp: 0.654 ± 0.607
5.229TyrGlu: 5.229 ± 1.308
5.229TyrPhe: 5.229 ± 1.97
1.961TyrGly: 1.961 ± 0.761
2.614TyrHis: 2.614 ± 1.04
0.0TyrIle: 0.0 ± 0.0
3.922TyrLys: 3.922 ± 2.486
3.268TyrLeu: 3.268 ± 1.282
0.0TyrMet: 0.0 ± 0.0
2.614TyrAsn: 2.614 ± 0.845
0.654TyrPro: 0.654 ± 0.451
6.536TyrGln: 6.536 ± 1.857
1.307TyrArg: 1.307 ± 0.903
2.614TyrSer: 2.614 ± 0.969
4.575TyrThr: 4.575 ± 1.286
1.961TyrVal: 1.961 ± 1.036
1.307TyrTrp: 1.307 ± 0.775
1.961TyrTyr: 1.961 ± 0.761
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1531 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski