Amino acid dipepetide frequency for Microbacterium phage Naby

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
27.732AlaAla: 27.732 ± 2.888
0.37AlaCys: 0.37 ± 0.218
9.429AlaAsp: 9.429 ± 1.354
7.765AlaGlu: 7.765 ± 1.054
4.807AlaPhe: 4.807 ± 0.854
20.891AlaGly: 20.891 ± 2.737
1.479AlaHis: 1.479 ± 0.548
7.58AlaIle: 7.58 ± 1.202
3.698AlaLys: 3.698 ± 1.105
14.236AlaLeu: 14.236 ± 1.799
3.513AlaMet: 3.513 ± 0.75
2.403AlaAsn: 2.403 ± 0.74
7.025AlaPro: 7.025 ± 1.196
4.252AlaGln: 4.252 ± 0.899
9.244AlaArg: 9.244 ± 1.659
7.395AlaSer: 7.395 ± 1.418
7.21AlaThr: 7.21 ± 1.403
7.395AlaVal: 7.395 ± 1.614
2.403AlaTrp: 2.403 ± 0.605
2.773AlaTyr: 2.773 ± 0.701
0.0AlaXaa: 0.0 ± 0.0
Cys
0.37CysAla: 0.37 ± 0.244
0.0CysCys: 0.0 ± 0.0
0.555CysAsp: 0.555 ± 0.257
0.185CysGlu: 0.185 ± 0.198
0.0CysPhe: 0.0 ± 0.0
0.185CysGly: 0.185 ± 0.181
0.185CysHis: 0.185 ± 0.181
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.185CysLeu: 0.185 ± 0.172
0.0CysMet: 0.0 ± 0.0
0.37CysAsn: 0.37 ± 0.225
0.37CysPro: 0.37 ± 0.261
0.185CysGln: 0.185 ± 0.198
0.555CysArg: 0.555 ± 0.363
0.555CysSer: 0.555 ± 0.313
0.74CysThr: 0.74 ± 0.338
0.37CysVal: 0.37 ± 0.232
0.37CysTrp: 0.37 ± 0.243
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
11.462AspAla: 11.462 ± 1.184
0.37AspCys: 0.37 ± 0.361
1.849AspAsp: 1.849 ± 0.844
5.731AspGlu: 5.731 ± 1.479
1.664AspPhe: 1.664 ± 0.486
4.807AspGly: 4.807 ± 1.021
1.109AspHis: 1.109 ± 0.51
0.924AspIle: 0.924 ± 0.53
0.555AspLys: 0.555 ± 0.347
6.101AspLeu: 6.101 ± 1.107
0.555AspMet: 0.555 ± 0.402
1.109AspAsn: 1.109 ± 0.448
2.219AspPro: 2.219 ± 0.778
1.294AspGln: 1.294 ± 0.431
2.958AspArg: 2.958 ± 1.017
2.588AspSer: 2.588 ± 0.529
2.034AspThr: 2.034 ± 0.46
4.437AspVal: 4.437 ± 0.978
0.37AspTrp: 0.37 ± 0.238
1.664AspTyr: 1.664 ± 0.486
0.0AspXaa: 0.0 ± 0.0
Glu
5.916GluAla: 5.916 ± 1.133
0.0GluCys: 0.0 ± 0.0
2.588GluAsp: 2.588 ± 0.775
0.74GluGlu: 0.74 ± 0.495
0.924GluPhe: 0.924 ± 0.38
2.958GluGly: 2.958 ± 0.814
0.555GluHis: 0.555 ± 0.32
4.437GluIle: 4.437 ± 0.909
0.185GluLys: 0.185 ± 0.159
9.429GluLeu: 9.429 ± 1.764
1.109GluMet: 1.109 ± 0.361
1.664GluAsn: 1.664 ± 0.655
3.513GluPro: 3.513 ± 0.876
2.403GluGln: 2.403 ± 0.627
4.067GluArg: 4.067 ± 1.215
2.219GluSer: 2.219 ± 0.529
4.622GluThr: 4.622 ± 1.148
1.849GluVal: 1.849 ± 0.527
1.294GluTrp: 1.294 ± 0.441
1.294GluTyr: 1.294 ± 0.476
0.0GluXaa: 0.0 ± 0.0
Phe
3.882PheAla: 3.882 ± 0.904
0.185PheCys: 0.185 ± 0.198
2.403PheAsp: 2.403 ± 0.802
1.479PheGlu: 1.479 ± 0.608
0.555PhePhe: 0.555 ± 0.321
4.437PheGly: 4.437 ± 0.959
0.37PheHis: 0.37 ± 0.289
0.555PheIle: 0.555 ± 0.3
1.294PheLys: 1.294 ± 0.506
3.143PheLeu: 3.143 ± 0.996
0.37PheMet: 0.37 ± 0.232
0.924PheAsn: 0.924 ± 0.391
0.37PhePro: 0.37 ± 0.209
0.924PheGln: 0.924 ± 0.432
1.294PheArg: 1.294 ± 0.502
1.479PheSer: 1.479 ± 0.752
3.143PheThr: 3.143 ± 0.678
2.034PheVal: 2.034 ± 0.508
0.185PheTrp: 0.185 ± 0.2
0.74PheTyr: 0.74 ± 0.36
0.0PheXaa: 0.0 ± 0.0
Gly
12.017GlyAla: 12.017 ± 2.022
0.555GlyCys: 0.555 ± 0.323
4.252GlyAsp: 4.252 ± 1.043
4.252GlyGlu: 4.252 ± 1.233
2.958GlyPhe: 2.958 ± 0.681
6.286GlyGly: 6.286 ± 1.112
1.109GlyHis: 1.109 ± 0.724
3.698GlyIle: 3.698 ± 0.799
3.513GlyLys: 3.513 ± 0.716
8.689GlyLeu: 8.689 ± 1.67
1.479GlyMet: 1.479 ± 0.404
3.143GlyAsn: 3.143 ± 0.858
2.773GlyPro: 2.773 ± 0.937
4.067GlyGln: 4.067 ± 0.761
6.471GlyArg: 6.471 ± 0.987
5.546GlySer: 5.546 ± 0.709
5.546GlyThr: 5.546 ± 1.119
8.874GlyVal: 8.874 ± 1.083
1.294GlyTrp: 1.294 ± 0.532
3.328GlyTyr: 3.328 ± 0.9
0.0GlyXaa: 0.0 ± 0.0
His
2.403HisAla: 2.403 ± 0.757
0.0HisCys: 0.0 ± 0.0
0.74HisAsp: 0.74 ± 0.347
0.74HisGlu: 0.74 ± 0.344
0.37HisPhe: 0.37 ± 0.242
1.479HisGly: 1.479 ± 0.52
0.185HisHis: 0.185 ± 0.157
0.37HisIle: 0.37 ± 0.244
0.37HisLys: 0.37 ± 0.277
1.109HisLeu: 1.109 ± 0.452
0.0HisMet: 0.0 ± 0.0
0.37HisAsn: 0.37 ± 0.21
0.74HisPro: 0.74 ± 0.308
0.0HisGln: 0.0 ± 0.0
0.924HisArg: 0.924 ± 0.495
0.555HisSer: 0.555 ± 0.254
0.0HisThr: 0.0 ± 0.0
2.034HisVal: 2.034 ± 0.607
0.0HisTrp: 0.0 ± 0.0
0.555HisTyr: 0.555 ± 0.265
0.0HisXaa: 0.0 ± 0.0
Ile
4.437IleAla: 4.437 ± 1.019
0.37IleCys: 0.37 ± 0.278
3.698IleAsp: 3.698 ± 0.652
5.177IleGlu: 5.177 ± 1.541
0.924IlePhe: 0.924 ± 0.298
2.773IleGly: 2.773 ± 0.804
0.74IleHis: 0.74 ± 0.342
1.664IleIle: 1.664 ± 0.62
1.109IleLys: 1.109 ± 0.363
2.403IleLeu: 2.403 ± 0.612
0.185IleMet: 0.185 ± 0.238
0.555IleAsn: 0.555 ± 0.319
2.588IlePro: 2.588 ± 0.834
2.773IleGln: 2.773 ± 0.679
3.328IleArg: 3.328 ± 0.833
2.958IleSer: 2.958 ± 0.735
2.958IleThr: 2.958 ± 0.58
3.513IleVal: 3.513 ± 0.785
0.37IleTrp: 0.37 ± 0.247
0.185IleTyr: 0.185 ± 0.172
0.0IleXaa: 0.0 ± 0.0
Lys
2.403LysAla: 2.403 ± 0.451
0.0LysCys: 0.0 ± 0.0
2.034LysAsp: 2.034 ± 0.684
0.0LysGlu: 0.0 ± 0.0
0.74LysPhe: 0.74 ± 0.396
1.294LysGly: 1.294 ± 0.49
0.185LysHis: 0.185 ± 0.216
1.109LysIle: 1.109 ± 0.465
0.185LysLys: 0.185 ± 0.202
2.773LysLeu: 2.773 ± 0.667
0.37LysMet: 0.37 ± 0.191
1.479LysAsn: 1.479 ± 0.756
0.924LysPro: 0.924 ± 0.457
0.555LysGln: 0.555 ± 0.339
2.034LysArg: 2.034 ± 0.999
1.479LysSer: 1.479 ± 0.446
1.479LysThr: 1.479 ± 0.597
0.37LysVal: 0.37 ± 0.267
0.37LysTrp: 0.37 ± 0.252
0.185LysTyr: 0.185 ± 0.202
0.0LysXaa: 0.0 ± 0.0
Leu
14.42LeuAla: 14.42 ± 1.341
0.555LeuCys: 0.555 ± 0.315
7.025LeuAsp: 7.025 ± 1.506
2.219LeuGlu: 2.219 ± 0.74
4.437LeuPhe: 4.437 ± 1.174
9.244LeuGly: 9.244 ± 1.421
1.294LeuHis: 1.294 ± 0.421
4.437LeuIle: 4.437 ± 1.44
1.109LeuLys: 1.109 ± 0.476
8.135LeuLeu: 8.135 ± 1.625
2.219LeuMet: 2.219 ± 0.505
4.252LeuAsn: 4.252 ± 0.837
7.395LeuPro: 7.395 ± 1.654
4.622LeuGln: 4.622 ± 0.885
9.614LeuArg: 9.614 ± 1.488
4.807LeuSer: 4.807 ± 0.843
4.807LeuThr: 4.807 ± 1.014
8.874LeuVal: 8.874 ± 1.922
0.74LeuTrp: 0.74 ± 0.412
1.479LeuTyr: 1.479 ± 0.377
0.0LeuXaa: 0.0 ± 0.0
Met
3.143MetAla: 3.143 ± 0.577
0.0MetCys: 0.0 ± 0.0
1.294MetAsp: 1.294 ± 0.453
0.185MetGlu: 0.185 ± 0.226
0.37MetPhe: 0.37 ± 0.206
1.664MetGly: 1.664 ± 0.659
0.185MetHis: 0.185 ± 0.166
0.185MetIle: 0.185 ± 0.159
0.185MetLys: 0.185 ± 0.172
1.479MetLeu: 1.479 ± 0.793
0.0MetMet: 0.0 ± 0.0
0.74MetAsn: 0.74 ± 0.32
1.479MetPro: 1.479 ± 0.615
0.37MetGln: 0.37 ± 0.292
0.924MetArg: 0.924 ± 0.338
1.849MetSer: 1.849 ± 0.561
2.403MetThr: 2.403 ± 0.701
2.034MetVal: 2.034 ± 0.465
0.37MetTrp: 0.37 ± 0.244
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
5.731AsnAla: 5.731 ± 1.418
0.185AsnCys: 0.185 ± 0.172
1.294AsnAsp: 1.294 ± 0.525
1.109AsnGlu: 1.109 ± 0.502
0.185AsnPhe: 0.185 ± 0.172
2.034AsnGly: 2.034 ± 0.571
0.74AsnHis: 0.74 ± 0.296
0.924AsnIle: 0.924 ± 0.366
0.74AsnLys: 0.74 ± 0.39
1.479AsnLeu: 1.479 ± 0.52
0.185AsnMet: 0.185 ± 0.159
0.555AsnAsn: 0.555 ± 0.267
1.664AsnPro: 1.664 ± 0.651
0.74AsnGln: 0.74 ± 0.485
2.219AsnArg: 2.219 ± 0.618
1.664AsnSer: 1.664 ± 0.501
1.479AsnThr: 1.479 ± 0.499
2.588AsnVal: 2.588 ± 0.73
0.555AsnTrp: 0.555 ± 0.445
0.555AsnTyr: 0.555 ± 0.355
0.0AsnXaa: 0.0 ± 0.0
Pro
10.723ProAla: 10.723 ± 1.533
0.185ProCys: 0.185 ± 0.179
3.328ProAsp: 3.328 ± 0.967
2.403ProGlu: 2.403 ± 0.702
0.924ProPhe: 0.924 ± 0.395
2.403ProGly: 2.403 ± 0.603
0.924ProHis: 0.924 ± 0.514
1.479ProIle: 1.479 ± 0.465
1.109ProLys: 1.109 ± 0.438
4.622ProLeu: 4.622 ± 0.948
1.849ProMet: 1.849 ± 0.733
0.555ProAsn: 0.555 ± 0.262
1.109ProPro: 1.109 ± 0.614
3.513ProGln: 3.513 ± 1.14
2.219ProArg: 2.219 ± 0.605
2.588ProSer: 2.588 ± 0.572
2.219ProThr: 2.219 ± 0.96
3.698ProVal: 3.698 ± 0.642
0.74ProTrp: 0.74 ± 0.327
2.034ProTyr: 2.034 ± 0.484
0.0ProXaa: 0.0 ± 0.0
Gln
3.513GlnAla: 3.513 ± 0.49
0.0GlnCys: 0.0 ± 0.0
0.924GlnAsp: 0.924 ± 0.283
0.74GlnGlu: 0.74 ± 0.332
2.219GlnPhe: 2.219 ± 0.628
1.479GlnGly: 1.479 ± 0.545
0.37GlnHis: 0.37 ± 0.24
1.849GlnIle: 1.849 ± 0.649
0.555GlnLys: 0.555 ± 0.295
13.126GlnLeu: 13.126 ± 1.933
0.74GlnMet: 0.74 ± 0.495
0.924GlnAsn: 0.924 ± 0.319
2.773GlnPro: 2.773 ± 0.847
3.882GlnGln: 3.882 ± 0.856
4.067GlnArg: 4.067 ± 0.828
1.294GlnSer: 1.294 ± 0.483
2.219GlnThr: 2.219 ± 0.633
1.109GlnVal: 1.109 ± 0.474
0.924GlnTrp: 0.924 ± 0.466
0.74GlnTyr: 0.74 ± 0.35
0.0GlnXaa: 0.0 ± 0.0
Arg
9.429ArgAla: 9.429 ± 1.944
0.555ArgCys: 0.555 ± 0.31
4.807ArgAsp: 4.807 ± 0.855
5.916ArgGlu: 5.916 ± 1.454
2.588ArgPhe: 2.588 ± 0.622
6.101ArgGly: 6.101 ± 1.057
1.479ArgHis: 1.479 ± 0.42
2.958ArgIle: 2.958 ± 0.896
1.294ArgLys: 1.294 ± 0.383
6.101ArgLeu: 6.101 ± 1.081
0.924ArgMet: 0.924 ± 0.42
1.479ArgAsn: 1.479 ± 0.642
2.219ArgPro: 2.219 ± 0.805
4.252ArgGln: 4.252 ± 0.957
7.395ArgArg: 7.395 ± 2.291
3.698ArgSer: 3.698 ± 0.716
2.958ArgThr: 2.958 ± 0.85
4.992ArgVal: 4.992 ± 0.777
2.588ArgTrp: 2.588 ± 0.703
2.219ArgTyr: 2.219 ± 0.516
0.0ArgXaa: 0.0 ± 0.0
Ser
9.244SerAla: 9.244 ± 1.234
0.74SerCys: 0.74 ± 0.448
0.74SerAsp: 0.74 ± 0.307
3.143SerGlu: 3.143 ± 0.671
1.294SerPhe: 1.294 ± 0.597
4.992SerGly: 4.992 ± 1.053
0.74SerHis: 0.74 ± 0.421
3.513SerIle: 3.513 ± 0.665
1.109SerLys: 1.109 ± 0.461
3.882SerLeu: 3.882 ± 0.995
1.664SerMet: 1.664 ± 0.687
1.109SerAsn: 1.109 ± 0.393
1.664SerPro: 1.664 ± 0.566
2.773SerGln: 2.773 ± 0.536
2.958SerArg: 2.958 ± 0.532
4.622SerSer: 4.622 ± 1.047
3.698SerThr: 3.698 ± 0.801
3.698SerVal: 3.698 ± 0.712
2.034SerTrp: 2.034 ± 0.492
2.034SerTyr: 2.034 ± 0.735
0.0SerXaa: 0.0 ± 0.0
Thr
8.135ThrAla: 8.135 ± 1.702
0.185ThrCys: 0.185 ± 0.182
2.403ThrAsp: 2.403 ± 0.602
3.513ThrGlu: 3.513 ± 0.634
2.219ThrPhe: 2.219 ± 0.82
5.546ThrGly: 5.546 ± 1.041
0.0ThrHis: 0.0 ± 0.0
3.698ThrIle: 3.698 ± 1.132
1.109ThrLys: 1.109 ± 0.467
5.731ThrLeu: 5.731 ± 1.113
1.479ThrMet: 1.479 ± 0.574
1.479ThrAsn: 1.479 ± 0.432
4.252ThrPro: 4.252 ± 0.795
1.109ThrGln: 1.109 ± 0.396
4.622ThrArg: 4.622 ± 0.868
3.328ThrSer: 3.328 ± 0.988
4.807ThrThr: 4.807 ± 1.133
6.286ThrVal: 6.286 ± 1.483
0.74ThrTrp: 0.74 ± 0.34
0.74ThrTyr: 0.74 ± 0.356
0.0ThrXaa: 0.0 ± 0.0
Val
11.462ValAla: 11.462 ± 1.764
0.185ValCys: 0.185 ± 0.181
2.958ValAsp: 2.958 ± 0.602
4.067ValGlu: 4.067 ± 1.303
0.924ValPhe: 0.924 ± 0.528
6.84ValGly: 6.84 ± 1.303
0.37ValHis: 0.37 ± 0.311
2.034ValIle: 2.034 ± 0.577
1.294ValLys: 1.294 ± 0.355
4.437ValLeu: 4.437 ± 1.209
1.664ValMet: 1.664 ± 0.633
2.403ValAsn: 2.403 ± 0.571
3.328ValPro: 3.328 ± 0.609
3.328ValGln: 3.328 ± 0.937
5.916ValArg: 5.916 ± 1.058
4.807ValSer: 4.807 ± 0.981
7.95ValThr: 7.95 ± 1.589
5.361ValVal: 5.361 ± 0.86
0.74ValTrp: 0.74 ± 0.339
2.034ValTyr: 2.034 ± 0.488
0.0ValXaa: 0.0 ± 0.0
Trp
2.403TrpAla: 2.403 ± 0.641
0.0TrpCys: 0.0 ± 0.0
0.185TrpAsp: 0.185 ± 0.181
1.109TrpGlu: 1.109 ± 0.405
1.294TrpPhe: 1.294 ± 0.465
1.109TrpGly: 1.109 ± 0.47
0.74TrpHis: 0.74 ± 0.402
0.74TrpIle: 0.74 ± 0.317
0.185TrpLys: 0.185 ± 0.157
3.882TrpLeu: 3.882 ± 1.047
0.37TrpMet: 0.37 ± 0.221
0.37TrpAsn: 0.37 ± 0.242
0.74TrpPro: 0.74 ± 0.407
0.924TrpGln: 0.924 ± 0.322
0.924TrpArg: 0.924 ± 0.344
0.555TrpSer: 0.555 ± 0.355
0.555TrpThr: 0.555 ± 0.401
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.37TrpTyr: 0.37 ± 0.25
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.773TyrAla: 2.773 ± 0.792
0.74TyrCys: 0.74 ± 0.432
1.294TyrAsp: 1.294 ± 0.387
1.294TyrGlu: 1.294 ± 0.343
0.37TyrPhe: 0.37 ± 0.256
3.328TyrGly: 3.328 ± 0.717
0.185TyrHis: 0.185 ± 0.157
0.74TyrIle: 0.74 ± 0.367
0.37TyrLys: 0.37 ± 0.264
0.74TyrLeu: 0.74 ± 0.343
0.0TyrMet: 0.0 ± 0.0
0.74TyrAsn: 0.74 ± 0.299
1.479TyrPro: 1.479 ± 0.417
1.479TyrGln: 1.479 ± 0.44
2.219TyrArg: 2.219 ± 0.797
1.664TyrSer: 1.664 ± 0.533
0.555TyrThr: 0.555 ± 0.305
2.588TyrVal: 2.588 ± 0.775
0.37TyrTrp: 0.37 ± 0.314
0.74TyrTyr: 0.74 ± 0.412
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 26 proteins (5410 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski