Amino acid dipepetide frequency for Microbacterium phage Ixel

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.219AlaAla: 12.219 ± 1.248
0.928AlaCys: 0.928 ± 0.278
4.408AlaAsp: 4.408 ± 0.697
7.888AlaGlu: 7.888 ± 0.937
3.557AlaPhe: 3.557 ± 0.483
6.96AlaGly: 6.96 ± 0.973
1.779AlaHis: 1.779 ± 0.447
5.955AlaIle: 5.955 ± 0.958
4.717AlaLys: 4.717 ± 0.758
10.749AlaLeu: 10.749 ± 1.137
2.011AlaMet: 2.011 ± 0.4
2.552AlaAsn: 2.552 ± 0.523
4.099AlaPro: 4.099 ± 0.722
5.104AlaGln: 5.104 ± 0.582
6.109AlaArg: 6.109 ± 0.783
6.341AlaSer: 6.341 ± 0.908
5.877AlaThr: 5.877 ± 0.921
7.037AlaVal: 7.037 ± 0.776
2.32AlaTrp: 2.32 ± 0.487
1.856AlaTyr: 1.856 ± 0.293
0.0AlaXaa: 0.0 ± 0.0
Cys
0.464CysAla: 0.464 ± 0.204
0.0CysCys: 0.0 ± 0.0
0.309CysAsp: 0.309 ± 0.153
0.155CysGlu: 0.155 ± 0.102
0.155CysPhe: 0.155 ± 0.114
0.619CysGly: 0.619 ± 0.214
0.232CysHis: 0.232 ± 0.136
0.077CysIle: 0.077 ± 0.072
0.309CysLys: 0.309 ± 0.156
0.309CysLeu: 0.309 ± 0.161
0.0CysMet: 0.0 ± 0.0
0.232CysAsn: 0.232 ± 0.124
0.696CysPro: 0.696 ± 0.244
0.309CysGln: 0.309 ± 0.173
0.619CysArg: 0.619 ± 0.234
0.464CysSer: 0.464 ± 0.18
0.309CysThr: 0.309 ± 0.133
0.387CysVal: 0.387 ± 0.139
0.309CysTrp: 0.309 ± 0.134
0.232CysTyr: 0.232 ± 0.129
0.0CysXaa: 0.0 ± 0.0
Asp
3.789AspAla: 3.789 ± 0.628
0.387AspCys: 0.387 ± 0.182
5.104AspAsp: 5.104 ± 1.211
5.413AspGlu: 5.413 ± 1.257
1.469AspPhe: 1.469 ± 0.286
4.563AspGly: 4.563 ± 0.715
1.083AspHis: 1.083 ± 0.265
3.171AspIle: 3.171 ± 0.504
1.701AspLys: 1.701 ± 0.371
5.568AspLeu: 5.568 ± 0.664
1.547AspMet: 1.547 ± 0.353
1.083AspAsn: 1.083 ± 0.318
3.712AspPro: 3.712 ± 0.528
2.939AspGln: 2.939 ± 0.593
3.325AspArg: 3.325 ± 0.57
3.016AspSer: 3.016 ± 0.476
3.635AspThr: 3.635 ± 0.495
4.021AspVal: 4.021 ± 0.63
1.779AspTrp: 1.779 ± 0.465
1.547AspTyr: 1.547 ± 0.334
0.0AspXaa: 0.0 ± 0.0
Glu
8.507GluAla: 8.507 ± 1.213
0.541GluCys: 0.541 ± 0.229
5.259GluAsp: 5.259 ± 1.155
5.645GluGlu: 5.645 ± 1.306
1.933GluPhe: 1.933 ± 0.444
4.949GluGly: 4.949 ± 0.679
1.701GluHis: 1.701 ± 0.298
1.933GluIle: 1.933 ± 0.37
2.861GluLys: 2.861 ± 0.531
6.032GluLeu: 6.032 ± 0.695
1.856GluMet: 1.856 ± 0.411
1.779GluAsn: 1.779 ± 0.397
2.552GluPro: 2.552 ± 0.574
2.552GluGln: 2.552 ± 0.436
5.181GluArg: 5.181 ± 0.747
2.475GluSer: 2.475 ± 0.442
4.408GluThr: 4.408 ± 0.529
4.485GluVal: 4.485 ± 0.678
0.773GluTrp: 0.773 ± 0.218
2.243GluTyr: 2.243 ± 0.299
0.0GluXaa: 0.0 ± 0.0
Phe
2.243PheAla: 2.243 ± 0.39
0.077PheCys: 0.077 ± 0.076
1.701PheAsp: 1.701 ± 0.378
2.707PheGlu: 2.707 ± 0.673
0.696PhePhe: 0.696 ± 0.202
2.243PheGly: 2.243 ± 0.494
0.541PheHis: 0.541 ± 0.22
0.928PheIle: 0.928 ± 0.273
1.392PheLys: 1.392 ± 0.275
2.32PheLeu: 2.32 ± 0.374
1.237PheMet: 1.237 ± 0.318
0.773PheAsn: 0.773 ± 0.219
1.392PhePro: 1.392 ± 0.384
1.083PheGln: 1.083 ± 0.278
1.933PheArg: 1.933 ± 0.429
1.933PheSer: 1.933 ± 0.324
1.701PheThr: 1.701 ± 0.339
1.856PheVal: 1.856 ± 0.383
0.464PheTrp: 0.464 ± 0.166
0.464PheTyr: 0.464 ± 0.187
0.0PheXaa: 0.0 ± 0.0
Gly
7.424GlyAla: 7.424 ± 0.961
0.464GlyCys: 0.464 ± 0.195
4.408GlyAsp: 4.408 ± 0.471
3.248GlyGlu: 3.248 ± 0.446
3.016GlyPhe: 3.016 ± 0.444
6.651GlyGly: 6.651 ± 0.995
1.237GlyHis: 1.237 ± 0.359
4.485GlyIle: 4.485 ± 0.792
4.64GlyLys: 4.64 ± 0.667
7.192GlyLeu: 7.192 ± 0.743
2.397GlyMet: 2.397 ± 0.442
3.171GlyAsn: 3.171 ± 0.47
3.48GlyPro: 3.48 ± 0.513
4.099GlyGln: 4.099 ± 0.579
4.563GlyArg: 4.563 ± 0.681
4.795GlySer: 4.795 ± 0.758
5.568GlyThr: 5.568 ± 0.557
5.568GlyVal: 5.568 ± 0.699
1.005GlyTrp: 1.005 ± 0.228
2.552GlyTyr: 2.552 ± 0.489
0.0GlyXaa: 0.0 ± 0.0
His
1.547HisAla: 1.547 ± 0.394
0.155HisCys: 0.155 ± 0.102
1.005HisAsp: 1.005 ± 0.225
1.315HisGlu: 1.315 ± 0.35
1.005HisPhe: 1.005 ± 0.303
2.165HisGly: 2.165 ± 0.402
0.387HisHis: 0.387 ± 0.184
1.315HisIle: 1.315 ± 0.352
1.392HisLys: 1.392 ± 0.418
1.624HisLeu: 1.624 ± 0.382
0.464HisMet: 0.464 ± 0.159
0.851HisAsn: 0.851 ± 0.265
1.547HisPro: 1.547 ± 0.388
0.541HisGln: 0.541 ± 0.194
0.851HisArg: 0.851 ± 0.205
0.696HisSer: 0.696 ± 0.185
0.851HisThr: 0.851 ± 0.274
1.005HisVal: 1.005 ± 0.289
0.077HisTrp: 0.077 ± 0.074
1.083HisTyr: 1.083 ± 0.257
0.0HisXaa: 0.0 ± 0.0
Ile
5.413IleAla: 5.413 ± 0.731
0.232IleCys: 0.232 ± 0.126
3.867IleAsp: 3.867 ± 0.536
4.253IleGlu: 4.253 ± 0.624
0.696IlePhe: 0.696 ± 0.231
4.176IleGly: 4.176 ± 0.622
1.083IleHis: 1.083 ± 0.281
3.712IleIle: 3.712 ± 0.643
2.243IleLys: 2.243 ± 0.42
2.32IleLeu: 2.32 ± 0.492
1.005IleMet: 1.005 ± 0.321
1.933IleAsn: 1.933 ± 0.384
2.861IlePro: 2.861 ± 0.697
2.707IleGln: 2.707 ± 0.85
2.397IleArg: 2.397 ± 0.527
3.325IleSer: 3.325 ± 0.64
4.331IleThr: 4.331 ± 0.705
3.712IleVal: 3.712 ± 0.779
0.619IleTrp: 0.619 ± 0.223
1.005IleTyr: 1.005 ± 0.317
0.0IleXaa: 0.0 ± 0.0
Lys
6.419LysAla: 6.419 ± 0.863
0.077LysCys: 0.077 ± 0.085
2.629LysAsp: 2.629 ± 0.498
2.243LysGlu: 2.243 ± 0.486
0.773LysPhe: 0.773 ± 0.337
3.789LysGly: 3.789 ± 0.674
0.619LysHis: 0.619 ± 0.199
2.475LysIle: 2.475 ± 0.466
2.552LysLys: 2.552 ± 0.681
4.408LysLeu: 4.408 ± 0.441
0.851LysMet: 0.851 ± 0.224
1.237LysAsn: 1.237 ± 0.269
3.248LysPro: 3.248 ± 0.64
1.547LysGln: 1.547 ± 0.35
2.397LysArg: 2.397 ± 0.513
2.784LysSer: 2.784 ± 0.424
3.325LysThr: 3.325 ± 0.467
3.789LysVal: 3.789 ± 0.616
1.16LysTrp: 1.16 ± 0.342
0.541LysTyr: 0.541 ± 0.198
0.0LysXaa: 0.0 ± 0.0
Leu
8.661LeuAla: 8.661 ± 0.842
0.619LeuCys: 0.619 ± 0.214
5.568LeuAsp: 5.568 ± 0.614
4.717LeuGlu: 4.717 ± 0.595
2.088LeuPhe: 2.088 ± 0.531
7.192LeuGly: 7.192 ± 0.918
1.779LeuHis: 1.779 ± 0.405
5.259LeuIle: 5.259 ± 0.969
3.557LeuLys: 3.557 ± 0.547
7.656LeuLeu: 7.656 ± 0.738
1.701LeuMet: 1.701 ± 0.383
3.171LeuAsn: 3.171 ± 0.472
4.64LeuPro: 4.64 ± 0.726
2.707LeuGln: 2.707 ± 0.353
5.568LeuArg: 5.568 ± 0.887
4.717LeuSer: 4.717 ± 0.567
6.109LeuThr: 6.109 ± 0.857
6.651LeuVal: 6.651 ± 0.756
1.16LeuTrp: 1.16 ± 0.24
2.011LeuTyr: 2.011 ± 0.349
0.0LeuXaa: 0.0 ± 0.0
Met
3.403MetAla: 3.403 ± 0.443
0.155MetCys: 0.155 ± 0.116
1.392MetAsp: 1.392 ± 0.315
1.005MetGlu: 1.005 ± 0.266
0.619MetPhe: 0.619 ± 0.242
1.624MetGly: 1.624 ± 0.428
0.387MetHis: 0.387 ± 0.178
1.237MetIle: 1.237 ± 0.269
0.696MetLys: 0.696 ± 0.226
2.011MetLeu: 2.011 ± 0.453
0.619MetMet: 0.619 ± 0.236
0.619MetAsn: 0.619 ± 0.209
1.315MetPro: 1.315 ± 0.285
1.005MetGln: 1.005 ± 0.238
1.083MetArg: 1.083 ± 0.304
2.243MetSer: 2.243 ± 0.45
1.624MetThr: 1.624 ± 0.364
1.624MetVal: 1.624 ± 0.328
0.309MetTrp: 0.309 ± 0.152
0.696MetTyr: 0.696 ± 0.227
0.0MetXaa: 0.0 ± 0.0
Asn
3.093AsnAla: 3.093 ± 0.485
0.0AsnCys: 0.0 ± 0.0
1.392AsnAsp: 1.392 ± 0.307
2.475AsnGlu: 2.475 ± 0.394
0.696AsnPhe: 0.696 ± 0.186
3.171AsnGly: 3.171 ± 0.652
0.387AsnHis: 0.387 ± 0.151
1.469AsnIle: 1.469 ± 0.297
1.469AsnLys: 1.469 ± 0.301
2.784AsnLeu: 2.784 ± 0.458
0.696AsnMet: 0.696 ± 0.213
1.005AsnAsn: 1.005 ± 0.248
2.165AsnPro: 2.165 ± 0.462
1.547AsnGln: 1.547 ± 0.336
1.237AsnArg: 1.237 ± 0.306
2.165AsnSer: 2.165 ± 0.388
1.933AsnThr: 1.933 ± 0.432
2.243AsnVal: 2.243 ± 0.469
0.387AsnTrp: 0.387 ± 0.181
1.392AsnTyr: 1.392 ± 0.292
0.0AsnXaa: 0.0 ± 0.0
Pro
6.496ProAla: 6.496 ± 0.909
0.077ProCys: 0.077 ± 0.091
3.171ProAsp: 3.171 ± 0.509
5.104ProGlu: 5.104 ± 0.774
1.083ProPhe: 1.083 ± 0.293
4.485ProGly: 4.485 ± 0.711
0.851ProHis: 0.851 ± 0.301
2.088ProIle: 2.088 ± 0.377
2.243ProLys: 2.243 ± 0.504
4.099ProLeu: 4.099 ± 0.604
0.773ProMet: 0.773 ± 0.243
1.547ProAsn: 1.547 ± 0.324
1.392ProPro: 1.392 ± 0.514
2.861ProGln: 2.861 ± 0.762
2.243ProArg: 2.243 ± 0.517
3.248ProSer: 3.248 ± 0.512
3.789ProThr: 3.789 ± 0.636
4.64ProVal: 4.64 ± 0.619
1.083ProTrp: 1.083 ± 0.36
1.392ProTyr: 1.392 ± 0.279
0.0ProXaa: 0.0 ± 0.0
Gln
5.104GlnAla: 5.104 ± 0.602
0.155GlnCys: 0.155 ± 0.169
2.243GlnAsp: 2.243 ± 0.386
3.248GlnGlu: 3.248 ± 0.569
0.541GlnPhe: 0.541 ± 0.193
3.248GlnGly: 3.248 ± 0.646
1.392GlnHis: 1.392 ± 0.259
2.243GlnIle: 2.243 ± 0.505
1.624GlnLys: 1.624 ± 0.384
3.867GlnLeu: 3.867 ± 0.696
1.392GlnMet: 1.392 ± 0.302
1.779GlnAsn: 1.779 ± 0.32
2.707GlnPro: 2.707 ± 0.369
2.707GlnGln: 2.707 ± 0.508
2.861GlnArg: 2.861 ± 0.502
2.397GlnSer: 2.397 ± 0.376
1.933GlnThr: 1.933 ± 0.493
2.784GlnVal: 2.784 ± 0.501
0.464GlnTrp: 0.464 ± 0.167
1.16GlnTyr: 1.16 ± 0.348
0.0GlnXaa: 0.0 ± 0.0
Arg
4.795ArgAla: 4.795 ± 0.75
0.619ArgCys: 0.619 ± 0.209
2.629ArgAsp: 2.629 ± 0.425
3.789ArgGlu: 3.789 ± 0.616
1.16ArgPhe: 1.16 ± 0.37
4.099ArgGly: 4.099 ± 0.581
0.928ArgHis: 0.928 ± 0.308
2.784ArgIle: 2.784 ± 0.467
3.48ArgLys: 3.48 ± 0.577
5.181ArgLeu: 5.181 ± 0.622
1.701ArgMet: 1.701 ± 0.399
2.011ArgAsn: 2.011 ± 0.386
2.397ArgPro: 2.397 ± 0.368
1.933ArgGln: 1.933 ± 0.433
3.635ArgArg: 3.635 ± 0.648
3.712ArgSer: 3.712 ± 0.497
2.629ArgThr: 2.629 ± 0.455
5.645ArgVal: 5.645 ± 0.654
1.083ArgTrp: 1.083 ± 0.3
1.392ArgTyr: 1.392 ± 0.524
0.0ArgXaa: 0.0 ± 0.0
Ser
5.723SerAla: 5.723 ± 0.725
0.232SerCys: 0.232 ± 0.138
3.325SerAsp: 3.325 ± 0.503
2.861SerGlu: 2.861 ± 0.441
2.011SerPhe: 2.011 ± 0.372
5.336SerGly: 5.336 ± 0.775
1.083SerHis: 1.083 ± 0.294
2.784SerIle: 2.784 ± 0.465
2.939SerLys: 2.939 ± 0.421
4.795SerLeu: 4.795 ± 0.707
2.243SerMet: 2.243 ± 0.366
1.547SerAsn: 1.547 ± 0.425
3.635SerPro: 3.635 ± 0.454
2.243SerGln: 2.243 ± 0.437
2.475SerArg: 2.475 ± 0.438
3.712SerSer: 3.712 ± 0.525
5.336SerThr: 5.336 ± 0.751
3.635SerVal: 3.635 ± 0.59
1.16SerTrp: 1.16 ± 0.306
2.011SerTyr: 2.011 ± 0.443
0.0SerXaa: 0.0 ± 0.0
Thr
6.651ThrAla: 6.651 ± 1.029
0.309ThrCys: 0.309 ± 0.196
2.32ThrAsp: 2.32 ± 0.564
3.248ThrGlu: 3.248 ± 0.658
3.016ThrPhe: 3.016 ± 0.493
5.8ThrGly: 5.8 ± 0.778
1.701ThrHis: 1.701 ± 0.412
3.944ThrIle: 3.944 ± 0.766
2.552ThrLys: 2.552 ± 0.451
6.187ThrLeu: 6.187 ± 0.761
1.237ThrMet: 1.237 ± 0.343
2.32ThrAsn: 2.32 ± 0.734
4.485ThrPro: 4.485 ± 0.739
2.397ThrGln: 2.397 ± 0.432
3.403ThrArg: 3.403 ± 0.517
3.789ThrSer: 3.789 ± 0.545
4.408ThrThr: 4.408 ± 0.822
5.8ThrVal: 5.8 ± 0.982
1.237ThrTrp: 1.237 ± 0.284
1.469ThrTyr: 1.469 ± 0.264
0.0ThrXaa: 0.0 ± 0.0
Val
7.269ValAla: 7.269 ± 0.674
0.619ValCys: 0.619 ± 0.241
5.104ValAsp: 5.104 ± 0.609
5.104ValGlu: 5.104 ± 0.554
1.856ValPhe: 1.856 ± 0.422
5.877ValGly: 5.877 ± 0.895
1.469ValHis: 1.469 ± 0.382
4.176ValIle: 4.176 ± 0.732
4.331ValLys: 4.331 ± 0.495
4.949ValLeu: 4.949 ± 0.755
1.315ValMet: 1.315 ± 0.296
1.779ValAsn: 1.779 ± 0.313
3.712ValPro: 3.712 ± 0.752
3.867ValGln: 3.867 ± 0.654
3.403ValArg: 3.403 ± 0.521
3.867ValSer: 3.867 ± 0.616
5.413ValThr: 5.413 ± 0.778
5.723ValVal: 5.723 ± 0.619
2.784ValTrp: 2.784 ± 0.601
1.779ValTyr: 1.779 ± 0.372
0.0ValXaa: 0.0 ± 0.0
Trp
1.16TrpAla: 1.16 ± 0.298
0.309TrpCys: 0.309 ± 0.19
1.392TrpAsp: 1.392 ± 0.278
1.392TrpGlu: 1.392 ± 0.324
0.619TrpPhe: 0.619 ± 0.216
0.696TrpGly: 0.696 ± 0.29
0.851TrpHis: 0.851 ± 0.245
1.16TrpIle: 1.16 ± 0.29
0.851TrpLys: 0.851 ± 0.254
1.856TrpLeu: 1.856 ± 0.429
0.232TrpMet: 0.232 ± 0.123
1.083TrpAsn: 1.083 ± 0.276
0.851TrpPro: 0.851 ± 0.292
0.851TrpGln: 0.851 ± 0.262
0.619TrpArg: 0.619 ± 0.195
1.315TrpSer: 1.315 ± 0.471
1.315TrpThr: 1.315 ± 0.298
1.624TrpVal: 1.624 ± 0.347
0.387TrpTrp: 0.387 ± 0.195
0.619TrpTyr: 0.619 ± 0.208
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.165TyrAla: 2.165 ± 0.425
0.232TyrCys: 0.232 ± 0.133
1.624TyrAsp: 1.624 ± 0.312
1.856TyrGlu: 1.856 ± 0.472
0.696TyrPhe: 0.696 ± 0.219
2.32TyrGly: 2.32 ± 0.346
0.387TyrHis: 0.387 ± 0.138
0.696TyrIle: 0.696 ± 0.222
1.315TyrLys: 1.315 ± 0.393
1.624TyrLeu: 1.624 ± 0.299
0.309TyrMet: 0.309 ± 0.138
1.237TyrAsn: 1.237 ± 0.285
1.779TyrPro: 1.779 ± 0.427
0.851TyrGln: 0.851 ± 0.239
1.701TyrArg: 1.701 ± 0.343
2.088TyrSer: 2.088 ± 0.384
1.779TyrThr: 1.779 ± 0.38
2.165TyrVal: 2.165 ± 0.397
0.619TyrTrp: 0.619 ± 0.245
1.16TyrTyr: 1.16 ± 0.358
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 60 proteins (12932 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski