Amino acid dipepetide frequency for Mycobacterium phage Ruotula

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.057AlaAla: 12.057 ± 1.193
0.603AlaCys: 0.603 ± 0.186
6.27AlaAsp: 6.27 ± 0.574
6.028AlaGlu: 6.028 ± 0.79
3.255AlaPhe: 3.255 ± 0.415
7.174AlaGly: 7.174 ± 0.871
1.266AlaHis: 1.266 ± 0.341
4.582AlaIle: 4.582 ± 0.611
3.798AlaLys: 3.798 ± 0.558
8.681AlaLeu: 8.681 ± 0.876
2.653AlaMet: 2.653 ± 0.442
2.291AlaAsn: 2.291 ± 0.376
5.064AlaPro: 5.064 ± 0.81
3.316AlaGln: 3.316 ± 0.516
5.908AlaArg: 5.908 ± 0.592
5.064AlaSer: 5.064 ± 0.576
6.27AlaThr: 6.27 ± 0.813
7.716AlaVal: 7.716 ± 0.626
1.869AlaTrp: 1.869 ± 0.341
3.075AlaTyr: 3.075 ± 0.447
0.0AlaXaa: 0.0 ± 0.0
Cys
0.844CysAla: 0.844 ± 0.249
0.0CysCys: 0.0 ± 0.0
0.844CysAsp: 0.844 ± 0.273
0.543CysGlu: 0.543 ± 0.177
0.181CysPhe: 0.181 ± 0.1
0.543CysGly: 0.543 ± 0.217
0.181CysHis: 0.181 ± 0.097
0.181CysIle: 0.181 ± 0.099
0.121CysLys: 0.121 ± 0.078
0.482CysLeu: 0.482 ± 0.167
0.241CysMet: 0.241 ± 0.129
0.301CysAsn: 0.301 ± 0.119
0.543CysPro: 0.543 ± 0.203
0.121CysGln: 0.121 ± 0.079
0.362CysArg: 0.362 ± 0.128
0.362CysSer: 0.362 ± 0.139
0.301CysThr: 0.301 ± 0.153
0.603CysVal: 0.603 ± 0.177
0.241CysTrp: 0.241 ± 0.117
0.121CysTyr: 0.121 ± 0.102
0.0CysXaa: 0.0 ± 0.0
Asp
6.39AspAla: 6.39 ± 0.618
0.543AspCys: 0.543 ± 0.186
4.34AspAsp: 4.34 ± 0.484
3.497AspGlu: 3.497 ± 0.423
2.231AspPhe: 2.231 ± 0.391
5.908AspGly: 5.908 ± 0.639
1.387AspHis: 1.387 ± 0.288
2.713AspIle: 2.713 ± 0.469
2.411AspLys: 2.411 ± 0.369
6.571AspLeu: 6.571 ± 0.779
1.025AspMet: 1.025 ± 0.205
1.869AspAsn: 1.869 ± 0.347
4.521AspPro: 4.521 ± 0.559
1.688AspGln: 1.688 ± 0.358
3.436AspArg: 3.436 ± 0.412
3.376AspSer: 3.376 ± 0.503
3.798AspThr: 3.798 ± 0.409
3.858AspVal: 3.858 ± 0.528
2.05AspTrp: 2.05 ± 0.376
2.351AspTyr: 2.351 ± 0.379
0.0AspXaa: 0.0 ± 0.0
Glu
5.486GluAla: 5.486 ± 0.723
0.422GluCys: 0.422 ± 0.176
4.22GluAsp: 4.22 ± 0.456
4.823GluGlu: 4.823 ± 0.579
2.231GluPhe: 2.231 ± 0.464
4.22GluGly: 4.22 ± 0.417
1.567GluHis: 1.567 ± 0.376
3.316GluIle: 3.316 ± 0.49
2.833GluLys: 2.833 ± 0.451
7.053GluLeu: 7.053 ± 0.635
1.688GluMet: 1.688 ± 0.289
1.869GluAsn: 1.869 ± 0.33
2.291GluPro: 2.291 ± 0.419
2.833GluGln: 2.833 ± 0.38
3.918GluArg: 3.918 ± 0.56
3.195GluSer: 3.195 ± 0.418
3.316GluThr: 3.316 ± 0.462
5.908GluVal: 5.908 ± 0.713
1.567GluTrp: 1.567 ± 0.342
2.231GluTyr: 2.231 ± 0.427
0.0GluXaa: 0.0 ± 0.0
Phe
2.17PheAla: 2.17 ± 0.367
0.301PheCys: 0.301 ± 0.143
2.833PheAsp: 2.833 ± 0.293
2.351PheGlu: 2.351 ± 0.314
0.603PhePhe: 0.603 ± 0.172
3.918PheGly: 3.918 ± 0.509
0.844PheHis: 0.844 ± 0.253
1.447PheIle: 1.447 ± 0.298
1.206PheLys: 1.206 ± 0.253
2.713PheLeu: 2.713 ± 0.557
0.603PheMet: 0.603 ± 0.192
1.145PheAsn: 1.145 ± 0.258
1.688PhePro: 1.688 ± 0.325
1.206PheGln: 1.206 ± 0.264
1.748PheArg: 1.748 ± 0.347
2.231PheSer: 2.231 ± 0.36
1.809PheThr: 1.809 ± 0.349
2.291PheVal: 2.291 ± 0.366
0.663PheTrp: 0.663 ± 0.197
0.904PheTyr: 0.904 ± 0.246
0.0PheXaa: 0.0 ± 0.0
Gly
6.812GlyAla: 6.812 ± 0.931
0.904GlyCys: 0.904 ± 0.214
6.089GlyAsp: 6.089 ± 0.494
4.823GlyGlu: 4.823 ± 0.609
3.014GlyPhe: 3.014 ± 0.512
9.826GlyGly: 9.826 ± 2.524
2.11GlyHis: 2.11 ± 0.378
4.702GlyIle: 4.702 ± 0.712
3.436GlyLys: 3.436 ± 0.491
7.716GlyLeu: 7.716 ± 0.757
1.688GlyMet: 1.688 ± 0.311
3.436GlyAsn: 3.436 ± 0.526
4.22GlyPro: 4.22 ± 0.612
2.411GlyGln: 2.411 ± 0.376
5.486GlyArg: 5.486 ± 0.605
5.365GlySer: 5.365 ± 0.807
5.184GlyThr: 5.184 ± 0.658
5.546GlyVal: 5.546 ± 0.566
2.773GlyTrp: 2.773 ± 0.378
3.075GlyTyr: 3.075 ± 0.445
0.0GlyXaa: 0.0 ± 0.0
His
1.567HisAla: 1.567 ± 0.346
0.181HisCys: 0.181 ± 0.128
1.447HisAsp: 1.447 ± 0.255
1.266HisGlu: 1.266 ± 0.307
0.784HisPhe: 0.784 ± 0.205
1.809HisGly: 1.809 ± 0.343
0.723HisHis: 0.723 ± 0.196
1.025HisIle: 1.025 ± 0.26
1.025HisLys: 1.025 ± 0.309
1.206HisLeu: 1.206 ± 0.279
0.121HisMet: 0.121 ± 0.087
0.422HisAsn: 0.422 ± 0.149
1.507HisPro: 1.507 ± 0.264
1.145HisGln: 1.145 ± 0.303
1.567HisArg: 1.567 ± 0.347
0.663HisSer: 0.663 ± 0.184
1.206HisThr: 1.206 ± 0.264
1.567HisVal: 1.567 ± 0.312
0.543HisTrp: 0.543 ± 0.165
0.844HisTyr: 0.844 ± 0.261
0.0HisXaa: 0.0 ± 0.0
Ile
6.39IleAla: 6.39 ± 0.64
0.362IleCys: 0.362 ± 0.152
3.376IleAsp: 3.376 ± 0.362
3.979IleGlu: 3.979 ± 0.444
0.904IlePhe: 0.904 ± 0.273
3.979IleGly: 3.979 ± 0.533
0.965IleHis: 0.965 ± 0.244
1.869IleIle: 1.869 ± 0.364
1.628IleLys: 1.628 ± 0.322
3.376IleLeu: 3.376 ± 0.425
0.965IleMet: 0.965 ± 0.232
1.688IleAsn: 1.688 ± 0.327
3.014IlePro: 3.014 ± 0.42
1.688IleGln: 1.688 ± 0.361
3.255IleArg: 3.255 ± 0.474
3.195IleSer: 3.195 ± 0.436
3.436IleThr: 3.436 ± 0.485
2.833IleVal: 2.833 ± 0.509
0.723IleTrp: 0.723 ± 0.175
1.507IleTyr: 1.507 ± 0.245
0.0IleXaa: 0.0 ± 0.0
Lys
3.497LysAla: 3.497 ± 0.451
0.301LysCys: 0.301 ± 0.129
2.532LysAsp: 2.532 ± 0.437
1.688LysGlu: 1.688 ± 0.29
1.447LysPhe: 1.447 ± 0.336
2.532LysGly: 2.532 ± 0.425
1.387LysHis: 1.387 ± 0.271
2.231LysIle: 2.231 ± 0.386
2.472LysLys: 2.472 ± 0.5
3.135LysLeu: 3.135 ± 0.417
0.844LysMet: 0.844 ± 0.176
1.628LysAsn: 1.628 ± 0.302
2.833LysPro: 2.833 ± 0.472
1.507LysGln: 1.507 ± 0.274
3.075LysArg: 3.075 ± 0.485
2.411LysSer: 2.411 ± 0.346
2.17LysThr: 2.17 ± 0.372
3.436LysVal: 3.436 ± 0.465
0.784LysTrp: 0.784 ± 0.243
1.145LysTyr: 1.145 ± 0.305
0.0LysXaa: 0.0 ± 0.0
Leu
9.163LeuAla: 9.163 ± 1.034
0.362LeuCys: 0.362 ± 0.162
5.848LeuAsp: 5.848 ± 0.521
5.004LeuGlu: 5.004 ± 0.605
2.472LeuPhe: 2.472 ± 0.462
6.993LeuGly: 6.993 ± 0.599
1.387LeuHis: 1.387 ± 0.289
4.823LeuIle: 4.823 ± 0.526
3.738LeuLys: 3.738 ± 0.478
5.787LeuLeu: 5.787 ± 0.628
1.748LeuMet: 1.748 ± 0.281
2.894LeuAsn: 2.894 ± 0.366
5.184LeuPro: 5.184 ± 0.595
2.592LeuGln: 2.592 ± 0.454
5.908LeuArg: 5.908 ± 0.503
5.908LeuSer: 5.908 ± 0.608
5.968LeuThr: 5.968 ± 0.689
5.184LeuVal: 5.184 ± 0.587
1.145LeuTrp: 1.145 ± 0.308
2.231LeuTyr: 2.231 ± 0.377
0.0LeuXaa: 0.0 ± 0.0
Met
2.17MetAla: 2.17 ± 0.251
0.121MetCys: 0.121 ± 0.114
1.025MetAsp: 1.025 ± 0.265
1.628MetGlu: 1.628 ± 0.337
0.663MetPhe: 0.663 ± 0.183
1.688MetGly: 1.688 ± 0.346
0.603MetHis: 0.603 ± 0.233
0.844MetIle: 0.844 ± 0.247
1.266MetLys: 1.266 ± 0.271
0.965MetLeu: 0.965 ± 0.247
0.181MetMet: 0.181 ± 0.108
0.965MetAsn: 0.965 ± 0.226
1.326MetPro: 1.326 ± 0.279
0.543MetGln: 0.543 ± 0.202
1.145MetArg: 1.145 ± 0.266
1.869MetSer: 1.869 ± 0.366
2.05MetThr: 2.05 ± 0.29
1.206MetVal: 1.206 ± 0.297
0.241MetTrp: 0.241 ± 0.114
0.362MetTyr: 0.362 ± 0.144
0.0MetXaa: 0.0 ± 0.0
Asn
3.014AsnAla: 3.014 ± 0.552
0.121AsnCys: 0.121 ± 0.084
1.628AsnAsp: 1.628 ± 0.329
2.17AsnGlu: 2.17 ± 0.353
0.965AsnPhe: 0.965 ± 0.223
3.617AsnGly: 3.617 ± 0.563
0.543AsnHis: 0.543 ± 0.183
1.507AsnIle: 1.507 ± 0.314
0.844AsnLys: 0.844 ± 0.235
2.291AsnLeu: 2.291 ± 0.36
0.663AsnMet: 0.663 ± 0.179
0.844AsnAsn: 0.844 ± 0.22
2.833AsnPro: 2.833 ± 0.392
1.206AsnGln: 1.206 ± 0.275
1.628AsnArg: 1.628 ± 0.384
1.809AsnSer: 1.809 ± 0.316
1.748AsnThr: 1.748 ± 0.328
2.532AsnVal: 2.532 ± 0.41
0.844AsnTrp: 0.844 ± 0.192
1.206AsnTyr: 1.206 ± 0.296
0.0AsnXaa: 0.0 ± 0.0
Pro
5.064ProAla: 5.064 ± 0.598
0.422ProCys: 0.422 ± 0.163
3.858ProAsp: 3.858 ± 0.519
4.702ProGlu: 4.702 ± 0.504
2.231ProPhe: 2.231 ± 0.388
5.606ProGly: 5.606 ± 0.699
1.025ProHis: 1.025 ± 0.28
2.532ProIle: 2.532 ± 0.388
2.17ProLys: 2.17 ± 0.331
4.28ProLeu: 4.28 ± 0.527
1.085ProMet: 1.085 ± 0.252
1.688ProAsn: 1.688 ± 0.322
2.954ProPro: 2.954 ± 0.531
1.567ProGln: 1.567 ± 0.312
2.954ProArg: 2.954 ± 0.476
3.858ProSer: 3.858 ± 0.484
3.738ProThr: 3.738 ± 0.443
3.918ProVal: 3.918 ± 0.437
0.844ProTrp: 0.844 ± 0.32
1.688ProTyr: 1.688 ± 0.328
0.0ProXaa: 0.0 ± 0.0
Gln
3.135GlnAla: 3.135 ± 0.59
0.06GlnCys: 0.06 ± 0.053
1.266GlnAsp: 1.266 ± 0.344
1.809GlnGlu: 1.809 ± 0.364
1.206GlnPhe: 1.206 ± 0.353
2.833GlnGly: 2.833 ± 0.404
0.482GlnHis: 0.482 ± 0.165
2.351GlnIle: 2.351 ± 0.409
1.266GlnLys: 1.266 ± 0.259
3.918GlnLeu: 3.918 ± 0.52
0.844GlnMet: 0.844 ± 0.187
0.603GlnAsn: 0.603 ± 0.203
1.929GlnPro: 1.929 ± 0.414
1.869GlnGln: 1.869 ± 0.397
1.809GlnArg: 1.809 ± 0.371
1.869GlnSer: 1.869 ± 0.26
1.809GlnThr: 1.809 ± 0.343
2.532GlnVal: 2.532 ± 0.343
0.784GlnTrp: 0.784 ± 0.195
0.904GlnTyr: 0.904 ± 0.247
0.0GlnXaa: 0.0 ± 0.0
Arg
5.426ArgAla: 5.426 ± 0.54
0.603ArgCys: 0.603 ± 0.223
3.195ArgAsp: 3.195 ± 0.394
4.762ArgGlu: 4.762 ± 0.614
2.17ArgPhe: 2.17 ± 0.494
5.486ArgGly: 5.486 ± 0.58
1.145ArgHis: 1.145 ± 0.268
2.954ArgIle: 2.954 ± 0.483
3.436ArgLys: 3.436 ± 0.516
5.305ArgLeu: 5.305 ± 0.63
1.929ArgMet: 1.929 ± 0.316
2.17ArgAsn: 2.17 ± 0.437
2.894ArgPro: 2.894 ± 0.553
1.748ArgGln: 1.748 ± 0.355
4.461ArgArg: 4.461 ± 0.575
3.617ArgSer: 3.617 ± 0.455
3.075ArgThr: 3.075 ± 0.547
4.883ArgVal: 4.883 ± 0.594
1.025ArgTrp: 1.025 ± 0.256
1.929ArgTyr: 1.929 ± 0.299
0.0ArgXaa: 0.0 ± 0.0
Ser
6.27SerAla: 6.27 ± 0.622
0.422SerCys: 0.422 ± 0.173
3.316SerAsp: 3.316 ± 0.427
3.798SerGlu: 3.798 ± 0.49
2.05SerPhe: 2.05 ± 0.391
6.45SerGly: 6.45 ± 0.617
1.447SerHis: 1.447 ± 0.292
2.833SerIle: 2.833 ± 0.457
2.11SerLys: 2.11 ± 0.298
5.124SerLeu: 5.124 ± 0.531
1.326SerMet: 1.326 ± 0.259
2.291SerAsn: 2.291 ± 0.424
2.713SerPro: 2.713 ± 0.472
2.17SerGln: 2.17 ± 0.28
2.713SerArg: 2.713 ± 0.324
3.135SerSer: 3.135 ± 0.61
3.617SerThr: 3.617 ± 0.54
4.34SerVal: 4.34 ± 0.401
1.266SerTrp: 1.266 ± 0.267
1.447SerTyr: 1.447 ± 0.322
0.0SerXaa: 0.0 ± 0.0
Thr
6.33ThrAla: 6.33 ± 0.812
0.362ThrCys: 0.362 ± 0.166
3.738ThrAsp: 3.738 ± 0.582
4.16ThrGlu: 4.16 ± 0.465
2.17ThrPhe: 2.17 ± 0.414
6.631ThrGly: 6.631 ± 0.702
1.025ThrHis: 1.025 ± 0.305
2.351ThrIle: 2.351 ± 0.588
2.592ThrLys: 2.592 ± 0.342
5.787ThrLeu: 5.787 ± 0.674
0.663ThrMet: 0.663 ± 0.177
1.748ThrAsn: 1.748 ± 0.34
4.099ThrPro: 4.099 ± 0.499
1.688ThrGln: 1.688 ± 0.355
3.858ThrArg: 3.858 ± 0.536
3.798ThrSer: 3.798 ± 0.554
4.039ThrThr: 4.039 ± 0.576
5.305ThrVal: 5.305 ± 0.523
1.145ThrTrp: 1.145 ± 0.251
1.869ThrTyr: 1.869 ± 0.42
0.0ThrXaa: 0.0 ± 0.0
Val
7.294ValAla: 7.294 ± 0.75
0.482ValCys: 0.482 ± 0.177
5.184ValAsp: 5.184 ± 0.541
4.582ValGlu: 4.582 ± 0.573
2.653ValPhe: 2.653 ± 0.376
4.823ValGly: 4.823 ± 0.676
1.447ValHis: 1.447 ± 0.317
4.22ValIle: 4.22 ± 0.531
3.014ValLys: 3.014 ± 0.476
5.305ValLeu: 5.305 ± 0.51
1.387ValMet: 1.387 ± 0.357
2.351ValAsn: 2.351 ± 0.369
4.28ValPro: 4.28 ± 0.536
2.05ValGln: 2.05 ± 0.382
5.064ValArg: 5.064 ± 0.704
4.582ValSer: 4.582 ± 0.511
6.028ValThr: 6.028 ± 0.573
4.823ValVal: 4.823 ± 0.597
1.145ValTrp: 1.145 ± 0.264
2.05ValTyr: 2.05 ± 0.328
0.0ValXaa: 0.0 ± 0.0
Trp
1.507TrpAla: 1.507 ± 0.269
0.241TrpCys: 0.241 ± 0.105
1.748TrpAsp: 1.748 ± 0.272
0.965TrpGlu: 0.965 ± 0.254
0.844TrpPhe: 0.844 ± 0.23
1.688TrpGly: 1.688 ± 0.35
0.422TrpHis: 0.422 ± 0.172
1.145TrpIle: 1.145 ± 0.24
0.301TrpLys: 0.301 ± 0.158
1.929TrpLeu: 1.929 ± 0.338
0.482TrpMet: 0.482 ± 0.17
0.603TrpAsn: 0.603 ± 0.207
0.904TrpPro: 0.904 ± 0.242
0.904TrpGln: 0.904 ± 0.228
1.266TrpArg: 1.266 ± 0.321
0.904TrpSer: 0.904 ± 0.256
1.688TrpThr: 1.688 ± 0.406
2.231TrpVal: 2.231 ± 0.368
0.844TrpTrp: 0.844 ± 0.284
0.301TrpTyr: 0.301 ± 0.142
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.351TyrAla: 2.351 ± 0.397
0.301TyrCys: 0.301 ± 0.173
1.206TyrAsp: 1.206 ± 0.292
2.411TyrGlu: 2.411 ± 0.364
0.603TyrPhe: 0.603 ± 0.164
2.894TyrGly: 2.894 ± 0.415
0.663TyrHis: 0.663 ± 0.183
1.507TyrIle: 1.507 ± 0.359
1.387TyrLys: 1.387 ± 0.307
2.713TyrLeu: 2.713 ± 0.442
0.663TyrMet: 0.663 ± 0.186
1.206TyrAsn: 1.206 ± 0.267
1.387TyrPro: 1.387 ± 0.305
1.025TyrGln: 1.025 ± 0.286
2.653TyrArg: 2.653 ± 0.415
1.688TyrSer: 1.688 ± 0.296
2.11TyrThr: 2.11 ± 0.418
2.05TyrVal: 2.05 ± 0.31
0.422TyrTrp: 0.422 ± 0.17
0.904TyrTyr: 0.904 ± 0.231
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 94 proteins (16589 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski