Amino acid dipepetide frequency for Mycobacterium phage LittleCherry

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.626AlaAla: 10.626 ± 1.075
0.506AlaCys: 0.506 ± 0.182
6.388AlaAsp: 6.388 ± 0.673
7.653AlaGlu: 7.653 ± 1.013
3.289AlaPhe: 3.289 ± 0.498
7.527AlaGly: 7.527 ± 0.727
1.202AlaHis: 1.202 ± 0.252
4.554AlaIle: 4.554 ± 0.6
4.934AlaLys: 4.934 ± 0.931
7.4AlaLeu: 7.4 ± 0.736
2.593AlaMet: 2.593 ± 0.482
2.973AlaAsn: 2.973 ± 0.409
4.617AlaPro: 4.617 ± 0.492
2.593AlaGln: 2.593 ± 0.416
5.187AlaArg: 5.187 ± 0.576
4.87AlaSer: 4.87 ± 0.64
5.503AlaThr: 5.503 ± 0.553
6.894AlaVal: 6.894 ± 0.709
1.708AlaTrp: 1.708 ± 0.373
2.34AlaTyr: 2.34 ± 0.409
0.0AlaXaa: 0.0 ± 0.0
Cys
0.506CysAla: 0.506 ± 0.173
0.063CysCys: 0.063 ± 0.073
0.633CysAsp: 0.633 ± 0.184
0.569CysGlu: 0.569 ± 0.185
0.316CysPhe: 0.316 ± 0.137
1.012CysGly: 1.012 ± 0.234
0.253CysHis: 0.253 ± 0.124
0.506CysIle: 0.506 ± 0.175
0.443CysLys: 0.443 ± 0.154
0.759CysLeu: 0.759 ± 0.227
0.063CysMet: 0.063 ± 0.061
0.38CysAsn: 0.38 ± 0.15
0.38CysPro: 0.38 ± 0.156
0.38CysGln: 0.38 ± 0.161
0.949CysArg: 0.949 ± 0.302
0.569CysSer: 0.569 ± 0.182
0.253CysThr: 0.253 ± 0.125
0.506CysVal: 0.506 ± 0.174
0.506CysTrp: 0.506 ± 0.176
0.19CysTyr: 0.19 ± 0.11
0.0CysXaa: 0.0 ± 0.0
Asp
6.199AspAla: 6.199 ± 0.641
0.886AspCys: 0.886 ± 0.264
4.428AspAsp: 4.428 ± 0.421
4.934AspGlu: 4.934 ± 0.606
2.973AspPhe: 2.973 ± 0.478
5.819AspGly: 5.819 ± 0.6
1.455AspHis: 1.455 ± 0.325
2.783AspIle: 2.783 ± 0.539
2.53AspLys: 2.53 ± 0.613
6.831AspLeu: 6.831 ± 0.852
1.898AspMet: 1.898 ± 0.263
2.151AspAsn: 2.151 ± 0.428
4.617AspPro: 4.617 ± 0.637
2.973AspGln: 2.973 ± 0.476
3.732AspArg: 3.732 ± 0.477
2.783AspSer: 2.783 ± 0.5
3.289AspThr: 3.289 ± 0.416
5.629AspVal: 5.629 ± 0.653
1.328AspTrp: 1.328 ± 0.311
2.91AspTyr: 2.91 ± 0.439
0.0AspXaa: 0.0 ± 0.0
Glu
7.464GluAla: 7.464 ± 0.683
0.569GluCys: 0.569 ± 0.176
4.997GluAsp: 4.997 ± 0.616
5.566GluGlu: 5.566 ± 0.635
1.898GluPhe: 1.898 ± 0.392
5.629GluGly: 5.629 ± 0.534
1.455GluHis: 1.455 ± 0.298
3.795GluIle: 3.795 ± 0.414
3.858GluLys: 3.858 ± 0.447
7.464GluLeu: 7.464 ± 0.759
1.898GluMet: 1.898 ± 0.353
1.961GluAsn: 1.961 ± 0.378
3.036GluPro: 3.036 ± 0.462
2.783GluGln: 2.783 ± 0.505
5.187GluArg: 5.187 ± 0.646
3.036GluSer: 3.036 ± 0.375
3.163GluThr: 3.163 ± 0.486
5.819GluVal: 5.819 ± 0.474
1.518GluTrp: 1.518 ± 0.36
2.467GluTyr: 2.467 ± 0.462
0.0GluXaa: 0.0 ± 0.0
Phe
3.289PheAla: 3.289 ± 0.505
0.19PheCys: 0.19 ± 0.143
3.099PheAsp: 3.099 ± 0.452
2.024PheGlu: 2.024 ± 0.353
0.822PhePhe: 0.822 ± 0.276
3.289PheGly: 3.289 ± 0.336
0.443PheHis: 0.443 ± 0.159
1.455PheIle: 1.455 ± 0.293
1.771PheLys: 1.771 ± 0.363
3.226PheLeu: 3.226 ± 0.563
0.443PheMet: 0.443 ± 0.147
1.645PheAsn: 1.645 ± 0.359
1.328PhePro: 1.328 ± 0.352
1.265PheGln: 1.265 ± 0.335
2.151PheArg: 2.151 ± 0.347
1.518PheSer: 1.518 ± 0.345
1.834PheThr: 1.834 ± 0.359
2.53PheVal: 2.53 ± 0.348
0.696PheTrp: 0.696 ± 0.266
1.202PheTyr: 1.202 ± 0.272
0.0PheXaa: 0.0 ± 0.0
Gly
6.135GlyAla: 6.135 ± 0.901
0.886GlyCys: 0.886 ± 0.245
5.187GlyAsp: 5.187 ± 0.656
5.629GlyGlu: 5.629 ± 0.548
2.657GlyPhe: 2.657 ± 0.351
9.741GlyGly: 9.741 ± 2.056
1.708GlyHis: 1.708 ± 0.313
4.048GlyIle: 4.048 ± 0.58
4.617GlyLys: 4.617 ± 0.632
7.147GlyLeu: 7.147 ± 0.809
1.898GlyMet: 1.898 ± 0.315
3.732GlyAsn: 3.732 ± 0.445
3.795GlyPro: 3.795 ± 0.557
2.846GlyGln: 2.846 ± 0.496
4.554GlyArg: 4.554 ± 0.56
4.997GlySer: 4.997 ± 0.61
4.428GlyThr: 4.428 ± 0.646
5.882GlyVal: 5.882 ± 0.629
1.961GlyTrp: 1.961 ± 0.375
2.72GlyTyr: 2.72 ± 0.561
0.0GlyXaa: 0.0 ± 0.0
His
1.518HisAla: 1.518 ± 0.276
0.127HisCys: 0.127 ± 0.094
1.834HisAsp: 1.834 ± 0.308
1.392HisGlu: 1.392 ± 0.232
0.759HisPhe: 0.759 ± 0.172
2.024HisGly: 2.024 ± 0.361
0.569HisHis: 0.569 ± 0.178
1.012HisIle: 1.012 ± 0.227
0.696HisLys: 0.696 ± 0.196
1.392HisLeu: 1.392 ± 0.27
0.063HisMet: 0.063 ± 0.065
0.506HisAsn: 0.506 ± 0.148
1.012HisPro: 1.012 ± 0.238
0.822HisGln: 0.822 ± 0.225
1.265HisArg: 1.265 ± 0.394
0.759HisSer: 0.759 ± 0.185
0.949HisThr: 0.949 ± 0.228
1.708HisVal: 1.708 ± 0.297
0.569HisTrp: 0.569 ± 0.202
0.506HisTyr: 0.506 ± 0.197
0.0HisXaa: 0.0 ± 0.0
Ile
5.123IleAla: 5.123 ± 0.559
0.506IleCys: 0.506 ± 0.166
4.048IleAsp: 4.048 ± 0.511
4.491IleGlu: 4.491 ± 0.612
1.392IlePhe: 1.392 ± 0.336
3.289IleGly: 3.289 ± 0.4
1.265IleHis: 1.265 ± 0.291
2.214IleIle: 2.214 ± 0.345
2.214IleLys: 2.214 ± 0.39
3.416IleLeu: 3.416 ± 0.405
0.633IleMet: 0.633 ± 0.2
2.024IleAsn: 2.024 ± 0.38
3.605IlePro: 3.605 ± 0.515
1.392IleGln: 1.392 ± 0.282
3.352IleArg: 3.352 ± 0.466
2.467IleSer: 2.467 ± 0.424
2.91IleThr: 2.91 ± 0.442
3.226IleVal: 3.226 ± 0.498
0.633IleTrp: 0.633 ± 0.191
1.139IleTyr: 1.139 ± 0.246
0.0IleXaa: 0.0 ± 0.0
Lys
5.376LysAla: 5.376 ± 0.907
0.316LysCys: 0.316 ± 0.135
3.226LysAsp: 3.226 ± 0.4
2.593LysGlu: 2.593 ± 0.504
1.581LysPhe: 1.581 ± 0.353
2.973LysGly: 2.973 ± 0.434
0.759LysHis: 0.759 ± 0.243
2.467LysIle: 2.467 ± 0.493
2.973LysLys: 2.973 ± 0.562
4.111LysLeu: 4.111 ± 0.489
1.328LysMet: 1.328 ± 0.262
0.949LysAsn: 0.949 ± 0.287
2.214LysPro: 2.214 ± 0.355
1.455LysGln: 1.455 ± 0.436
2.593LysArg: 2.593 ± 0.475
2.214LysSer: 2.214 ± 0.499
3.036LysThr: 3.036 ± 0.452
4.111LysVal: 4.111 ± 0.547
0.949LysTrp: 0.949 ± 0.242
1.455LysTyr: 1.455 ± 0.342
0.0LysXaa: 0.0 ± 0.0
Leu
7.906LeuAla: 7.906 ± 1.034
0.633LeuCys: 0.633 ± 0.179
6.578LeuAsp: 6.578 ± 0.73
6.388LeuGlu: 6.388 ± 0.668
1.961LeuPhe: 1.961 ± 0.367
6.009LeuGly: 6.009 ± 0.865
1.012LeuHis: 1.012 ± 0.296
4.175LeuIle: 4.175 ± 0.55
4.491LeuLys: 4.491 ± 0.802
6.072LeuLeu: 6.072 ± 0.643
2.087LeuMet: 2.087 ± 0.329
2.593LeuAsn: 2.593 ± 0.472
3.732LeuPro: 3.732 ± 0.592
2.467LeuGln: 2.467 ± 0.557
5.44LeuArg: 5.44 ± 0.503
5.187LeuSer: 5.187 ± 0.623
5.44LeuThr: 5.44 ± 0.765
6.135LeuVal: 6.135 ± 0.658
1.265LeuTrp: 1.265 ± 0.29
2.973LeuTyr: 2.973 ± 0.525
0.0LeuXaa: 0.0 ± 0.0
Met
1.898MetAla: 1.898 ± 0.392
0.253MetCys: 0.253 ± 0.129
1.075MetAsp: 1.075 ± 0.256
1.328MetGlu: 1.328 ± 0.237
1.075MetPhe: 1.075 ± 0.241
1.771MetGly: 1.771 ± 0.309
0.443MetHis: 0.443 ± 0.176
0.443MetIle: 0.443 ± 0.2
1.328MetLys: 1.328 ± 0.269
2.087MetLeu: 2.087 ± 0.372
0.38MetMet: 0.38 ± 0.174
0.696MetAsn: 0.696 ± 0.216
1.202MetPro: 1.202 ± 0.278
0.506MetGln: 0.506 ± 0.202
1.645MetArg: 1.645 ± 0.404
1.834MetSer: 1.834 ± 0.356
2.783MetThr: 2.783 ± 0.437
1.328MetVal: 1.328 ± 0.343
0.316MetTrp: 0.316 ± 0.112
0.569MetTyr: 0.569 ± 0.207
0.0MetXaa: 0.0 ± 0.0
Asn
2.593AsnAla: 2.593 ± 0.376
0.38AsnCys: 0.38 ± 0.161
2.277AsnAsp: 2.277 ± 0.348
2.087AsnGlu: 2.087 ± 0.325
1.898AsnPhe: 1.898 ± 0.357
3.858AsnGly: 3.858 ± 0.495
0.759AsnHis: 0.759 ± 0.204
1.518AsnIle: 1.518 ± 0.313
0.886AsnLys: 0.886 ± 0.202
2.72AsnLeu: 2.72 ± 0.395
0.696AsnMet: 0.696 ± 0.198
0.886AsnAsn: 0.886 ± 0.222
2.214AsnPro: 2.214 ± 0.443
0.696AsnGln: 0.696 ± 0.235
1.961AsnArg: 1.961 ± 0.372
1.645AsnSer: 1.645 ± 0.361
1.834AsnThr: 1.834 ± 0.327
2.34AsnVal: 2.34 ± 0.369
0.759AsnTrp: 0.759 ± 0.213
0.696AsnTyr: 0.696 ± 0.22
0.0AsnXaa: 0.0 ± 0.0
Pro
4.175ProAla: 4.175 ± 0.553
0.316ProCys: 0.316 ± 0.141
2.783ProAsp: 2.783 ± 0.479
4.428ProGlu: 4.428 ± 0.54
1.834ProPhe: 1.834 ± 0.417
4.491ProGly: 4.491 ± 0.716
1.012ProHis: 1.012 ± 0.296
2.657ProIle: 2.657 ± 0.394
1.834ProLys: 1.834 ± 0.389
3.226ProLeu: 3.226 ± 0.482
1.012ProMet: 1.012 ± 0.228
1.834ProAsn: 1.834 ± 0.349
3.099ProPro: 3.099 ± 0.692
1.898ProGln: 1.898 ± 0.363
1.898ProArg: 1.898 ± 0.411
3.289ProSer: 3.289 ± 0.327
4.238ProThr: 4.238 ± 0.663
4.491ProVal: 4.491 ± 0.546
1.012ProTrp: 1.012 ± 0.383
1.328ProTyr: 1.328 ± 0.337
0.0ProXaa: 0.0 ± 0.0
Gln
3.985GlnAla: 3.985 ± 0.633
0.19GlnCys: 0.19 ± 0.11
1.771GlnAsp: 1.771 ± 0.352
2.72GlnGlu: 2.72 ± 0.402
1.518GlnPhe: 1.518 ± 0.29
2.404GlnGly: 2.404 ± 0.431
0.949GlnHis: 0.949 ± 0.241
2.151GlnIle: 2.151 ± 0.384
1.202GlnLys: 1.202 ± 0.347
3.732GlnLeu: 3.732 ± 0.768
0.759GlnMet: 0.759 ± 0.224
0.759GlnAsn: 0.759 ± 0.193
1.012GlnPro: 1.012 ± 0.266
0.949GlnGln: 0.949 ± 0.226
1.708GlnArg: 1.708 ± 0.388
1.518GlnSer: 1.518 ± 0.331
1.708GlnThr: 1.708 ± 0.319
2.783GlnVal: 2.783 ± 0.317
0.506GlnTrp: 0.506 ± 0.186
1.265GlnTyr: 1.265 ± 0.284
0.0GlnXaa: 0.0 ± 0.0
Arg
5.06ArgAla: 5.06 ± 0.649
0.886ArgCys: 0.886 ± 0.261
3.542ArgAsp: 3.542 ± 0.404
5.06ArgGlu: 5.06 ± 0.594
2.91ArgPhe: 2.91 ± 0.493
4.364ArgGly: 4.364 ± 0.532
1.581ArgHis: 1.581 ± 0.389
3.416ArgIle: 3.416 ± 0.509
3.352ArgLys: 3.352 ± 0.508
5.693ArgLeu: 5.693 ± 0.564
1.581ArgMet: 1.581 ± 0.29
2.34ArgAsn: 2.34 ± 0.349
2.72ArgPro: 2.72 ± 0.377
2.214ArgGln: 2.214 ± 0.438
4.681ArgArg: 4.681 ± 0.566
2.783ArgSer: 2.783 ± 0.488
3.036ArgThr: 3.036 ± 0.411
3.542ArgVal: 3.542 ± 0.491
1.139ArgTrp: 1.139 ± 0.352
1.518ArgTyr: 1.518 ± 0.312
0.0ArgXaa: 0.0 ± 0.0
Ser
4.681SerAla: 4.681 ± 0.431
0.633SerCys: 0.633 ± 0.179
4.111SerAsp: 4.111 ± 0.569
3.669SerGlu: 3.669 ± 0.495
2.214SerPhe: 2.214 ± 0.335
5.187SerGly: 5.187 ± 0.718
0.886SerHis: 0.886 ± 0.246
2.277SerIle: 2.277 ± 0.349
2.34SerLys: 2.34 ± 0.409
3.099SerLeu: 3.099 ± 0.437
1.328SerMet: 1.328 ± 0.308
1.139SerAsn: 1.139 ± 0.366
2.277SerPro: 2.277 ± 0.442
2.214SerGln: 2.214 ± 0.331
3.985SerArg: 3.985 ± 0.624
3.226SerSer: 3.226 ± 0.557
3.352SerThr: 3.352 ± 0.519
3.416SerVal: 3.416 ± 0.481
0.949SerTrp: 0.949 ± 0.222
1.518SerTyr: 1.518 ± 0.334
0.0SerXaa: 0.0 ± 0.0
Thr
5.123ThrAla: 5.123 ± 0.508
0.569ThrCys: 0.569 ± 0.199
4.111ThrAsp: 4.111 ± 0.475
4.554ThrGlu: 4.554 ± 0.462
2.024ThrPhe: 2.024 ± 0.325
4.744ThrGly: 4.744 ± 0.687
1.202ThrHis: 1.202 ± 0.296
3.163ThrIle: 3.163 ± 0.473
2.34ThrLys: 2.34 ± 0.36
4.554ThrLeu: 4.554 ± 0.433
1.581ThrMet: 1.581 ± 0.31
1.898ThrAsn: 1.898 ± 0.354
4.048ThrPro: 4.048 ± 0.554
2.214ThrGln: 2.214 ± 0.376
3.352ThrArg: 3.352 ± 0.475
2.593ThrSer: 2.593 ± 0.433
2.91ThrThr: 2.91 ± 0.431
4.364ThrVal: 4.364 ± 0.49
1.392ThrTrp: 1.392 ± 0.269
1.581ThrTyr: 1.581 ± 0.372
0.0ThrXaa: 0.0 ± 0.0
Val
7.211ValAla: 7.211 ± 0.701
1.075ValCys: 1.075 ± 0.287
5.756ValAsp: 5.756 ± 0.628
4.997ValGlu: 4.997 ± 0.646
1.834ValPhe: 1.834 ± 0.386
6.262ValGly: 6.262 ± 0.756
1.392ValHis: 1.392 ± 0.308
3.922ValIle: 3.922 ± 0.506
3.542ValLys: 3.542 ± 0.503
5.44ValLeu: 5.44 ± 0.774
1.581ValMet: 1.581 ± 0.345
2.53ValAsn: 2.53 ± 0.498
3.605ValPro: 3.605 ± 0.475
2.024ValGln: 2.024 ± 0.39
4.301ValArg: 4.301 ± 0.521
4.554ValSer: 4.554 ± 0.535
4.87ValThr: 4.87 ± 0.542
6.325ValVal: 6.325 ± 0.731
1.645ValTrp: 1.645 ± 0.37
2.214ValTyr: 2.214 ± 0.452
0.0ValXaa: 0.0 ± 0.0
Trp
2.087TrpAla: 2.087 ± 0.37
0.19TrpCys: 0.19 ± 0.113
1.581TrpAsp: 1.581 ± 0.312
1.139TrpGlu: 1.139 ± 0.309
0.506TrpPhe: 0.506 ± 0.177
1.518TrpGly: 1.518 ± 0.339
0.506TrpHis: 0.506 ± 0.2
1.139TrpIle: 1.139 ± 0.314
0.696TrpLys: 0.696 ± 0.195
1.455TrpLeu: 1.455 ± 0.319
0.443TrpMet: 0.443 ± 0.147
0.886TrpAsn: 0.886 ± 0.244
0.822TrpPro: 0.822 ± 0.253
0.949TrpGln: 0.949 ± 0.219
1.012TrpArg: 1.012 ± 0.231
1.265TrpSer: 1.265 ± 0.29
1.202TrpThr: 1.202 ± 0.256
1.455TrpVal: 1.455 ± 0.232
0.506TrpTrp: 0.506 ± 0.203
0.38TrpTyr: 0.38 ± 0.147
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.404TyrAla: 2.404 ± 0.38
0.063TyrCys: 0.063 ± 0.059
2.72TyrAsp: 2.72 ± 0.494
2.34TyrGlu: 2.34 ± 0.494
0.759TyrPhe: 0.759 ± 0.26
2.72TyrGly: 2.72 ± 0.404
0.633TyrHis: 0.633 ± 0.194
1.645TyrIle: 1.645 ± 0.292
0.633TyrLys: 0.633 ± 0.209
2.72TyrLeu: 2.72 ± 0.43
0.633TyrMet: 0.633 ± 0.205
0.822TyrAsn: 0.822 ± 0.197
1.518TyrPro: 1.518 ± 0.287
0.949TyrGln: 0.949 ± 0.247
2.467TyrArg: 2.467 ± 0.476
1.392TyrSer: 1.392 ± 0.371
1.581TyrThr: 1.581 ± 0.41
2.593TyrVal: 2.593 ± 0.56
0.38TyrTrp: 0.38 ± 0.175
0.759TyrTyr: 0.759 ± 0.223
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 87 proteins (15811 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski