Amino acid dipepetide frequency for Mycobacterium phage Blinn1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.11AlaAla: 11.11 ± 1.053
0.621AlaCys: 0.621 ± 0.185
5.648AlaAsp: 5.648 ± 0.615
7.696AlaGlu: 7.696 ± 0.84
3.165AlaPhe: 3.165 ± 0.514
8.565AlaGly: 8.565 ± 0.894
1.49AlaHis: 1.49 ± 0.335
3.786AlaIle: 3.786 ± 0.397
5.338AlaLys: 5.338 ± 0.577
7.386AlaLeu: 7.386 ± 0.763
2.048AlaMet: 2.048 ± 0.364
2.917AlaAsn: 2.917 ± 0.483
4.469AlaPro: 4.469 ± 0.587
3.352AlaGln: 3.352 ± 0.551
5.772AlaArg: 5.772 ± 0.66
4.531AlaSer: 4.531 ± 0.55
4.407AlaThr: 4.407 ± 0.553
6.765AlaVal: 6.765 ± 0.846
1.8AlaTrp: 1.8 ± 0.361
2.545AlaTyr: 2.545 ± 0.479
0.0AlaXaa: 0.0 ± 0.0
Cys
0.497CysAla: 0.497 ± 0.16
0.0CysCys: 0.0 ± 0.0
0.745CysAsp: 0.745 ± 0.245
0.559CysGlu: 0.559 ± 0.19
0.31CysPhe: 0.31 ± 0.121
0.993CysGly: 0.993 ± 0.265
0.186CysHis: 0.186 ± 0.088
0.248CysIle: 0.248 ± 0.127
0.248CysLys: 0.248 ± 0.136
0.745CysLeu: 0.745 ± 0.247
0.124CysMet: 0.124 ± 0.099
0.434CysAsn: 0.434 ± 0.131
0.931CysPro: 0.931 ± 0.285
0.186CysGln: 0.186 ± 0.13
0.621CysArg: 0.621 ± 0.263
0.559CysSer: 0.559 ± 0.237
0.683CysThr: 0.683 ± 0.22
0.683CysVal: 0.683 ± 0.193
0.372CysTrp: 0.372 ± 0.169
0.186CysTyr: 0.186 ± 0.118
0.0CysXaa: 0.0 ± 0.0
Asp
6.579AspAla: 6.579 ± 0.653
0.683AspCys: 0.683 ± 0.243
4.096AspAsp: 4.096 ± 0.544
4.283AspGlu: 4.283 ± 0.595
2.917AspPhe: 2.917 ± 0.503
6.144AspGly: 6.144 ± 0.682
1.552AspHis: 1.552 ± 0.405
3.289AspIle: 3.289 ± 0.503
2.545AspLys: 2.545 ± 0.402
5.524AspLeu: 5.524 ± 0.728
1.676AspMet: 1.676 ± 0.332
1.738AspAsn: 1.738 ± 0.339
4.593AspPro: 4.593 ± 0.599
1.8AspGln: 1.8 ± 0.318
3.289AspArg: 3.289 ± 0.473
2.048AspSer: 2.048 ± 0.391
3.662AspThr: 3.662 ± 0.413
4.903AspVal: 4.903 ± 0.551
1.49AspTrp: 1.49 ± 0.271
2.172AspTyr: 2.172 ± 0.336
0.0AspXaa: 0.0 ± 0.0
Glu
6.393GluAla: 6.393 ± 0.694
0.497GluCys: 0.497 ± 0.201
4.965GluAsp: 4.965 ± 0.714
5.772GluGlu: 5.772 ± 0.665
3.289GluPhe: 3.289 ± 0.472
5.462GluGly: 5.462 ± 0.625
1.241GluHis: 1.241 ± 0.332
3.972GluIle: 3.972 ± 0.441
2.669GluLys: 2.669 ± 0.438
7.2GluLeu: 7.2 ± 0.858
2.234GluMet: 2.234 ± 0.349
2.296GluAsn: 2.296 ± 0.353
2.669GluPro: 2.669 ± 0.503
3.041GluGln: 3.041 ± 0.427
4.034GluArg: 4.034 ± 0.471
3.6GluSer: 3.6 ± 0.372
3.724GluThr: 3.724 ± 0.507
4.655GluVal: 4.655 ± 0.52
1.055GluTrp: 1.055 ± 0.352
2.296GluTyr: 2.296 ± 0.412
0.0GluXaa: 0.0 ± 0.0
Phe
3.041PheAla: 3.041 ± 0.408
0.31PheCys: 0.31 ± 0.151
2.669PheAsp: 2.669 ± 0.362
3.414PheGlu: 3.414 ± 0.511
0.993PhePhe: 0.993 ± 0.317
3.476PheGly: 3.476 ± 0.551
0.434PheHis: 0.434 ± 0.154
1.552PheIle: 1.552 ± 0.267
1.614PheLys: 1.614 ± 0.293
2.234PheLeu: 2.234 ± 0.408
0.497PheMet: 0.497 ± 0.166
1.8PheAsn: 1.8 ± 0.4
2.11PhePro: 2.11 ± 0.382
1.117PheGln: 1.117 ± 0.247
2.172PheArg: 2.172 ± 0.314
2.234PheSer: 2.234 ± 0.453
2.172PheThr: 2.172 ± 0.342
2.358PheVal: 2.358 ± 0.369
0.683PheTrp: 0.683 ± 0.201
0.807PheTyr: 0.807 ± 0.2
0.0PheXaa: 0.0 ± 0.0
Gly
6.082GlyAla: 6.082 ± 0.729
0.869GlyCys: 0.869 ± 0.3
6.641GlyAsp: 6.641 ± 0.794
4.531GlyGlu: 4.531 ± 0.652
2.731GlyPhe: 2.731 ± 0.491
10.427GlyGly: 10.427 ± 2.708
2.11GlyHis: 2.11 ± 0.315
4.965GlyIle: 4.965 ± 0.588
4.531GlyLys: 4.531 ± 0.556
7.075GlyLeu: 7.075 ± 0.927
2.234GlyMet: 2.234 ± 0.339
2.793GlyAsn: 2.793 ± 0.517
4.593GlyPro: 4.593 ± 0.896
2.793GlyGln: 2.793 ± 0.489
4.655GlyArg: 4.655 ± 0.523
5.027GlySer: 5.027 ± 0.696
5.834GlyThr: 5.834 ± 0.745
5.648GlyVal: 5.648 ± 0.666
1.614GlyTrp: 1.614 ± 0.29
3.041GlyTyr: 3.041 ± 0.461
0.0GlyXaa: 0.0 ± 0.0
His
1.8HisAla: 1.8 ± 0.454
0.186HisCys: 0.186 ± 0.09
1.49HisAsp: 1.49 ± 0.296
1.676HisGlu: 1.676 ± 0.362
0.807HisPhe: 0.807 ± 0.262
1.552HisGly: 1.552 ± 0.274
0.497HisHis: 0.497 ± 0.169
1.179HisIle: 1.179 ± 0.273
1.055HisLys: 1.055 ± 0.288
1.303HisLeu: 1.303 ± 0.283
0.372HisMet: 0.372 ± 0.155
0.745HisAsn: 0.745 ± 0.216
1.49HisPro: 1.49 ± 0.293
0.745HisGln: 0.745 ± 0.212
1.552HisArg: 1.552 ± 0.355
0.869HisSer: 0.869 ± 0.252
0.559HisThr: 0.559 ± 0.187
0.683HisVal: 0.683 ± 0.198
0.497HisTrp: 0.497 ± 0.191
0.683HisTyr: 0.683 ± 0.276
0.0HisXaa: 0.0 ± 0.0
Ile
4.841IleAla: 4.841 ± 0.516
0.559IleCys: 0.559 ± 0.171
4.096IleAsp: 4.096 ± 0.49
4.22IleGlu: 4.22 ± 0.486
1.49IlePhe: 1.49 ± 0.323
3.972IleGly: 3.972 ± 0.671
1.117IleHis: 1.117 ± 0.238
1.862IleIle: 1.862 ± 0.402
3.165IleLys: 3.165 ± 0.434
3.538IleLeu: 3.538 ± 0.454
0.434IleMet: 0.434 ± 0.139
2.358IleAsn: 2.358 ± 0.359
3.227IlePro: 3.227 ± 0.447
1.303IleGln: 1.303 ± 0.396
3.289IleArg: 3.289 ± 0.429
2.855IleSer: 2.855 ± 0.394
3.476IleThr: 3.476 ± 0.43
3.414IleVal: 3.414 ± 0.442
0.869IleTrp: 0.869 ± 0.181
1.055IleTyr: 1.055 ± 0.222
0.0IleXaa: 0.0 ± 0.0
Lys
4.655LysAla: 4.655 ± 0.555
0.248LysCys: 0.248 ± 0.132
2.731LysAsp: 2.731 ± 0.389
3.662LysGlu: 3.662 ± 0.501
1.428LysPhe: 1.428 ± 0.317
4.22LysGly: 4.22 ± 0.677
0.869LysHis: 0.869 ± 0.222
2.731LysIle: 2.731 ± 0.424
3.91LysLys: 3.91 ± 0.611
3.972LysLeu: 3.972 ± 0.618
0.807LysMet: 0.807 ± 0.209
1.241LysAsn: 1.241 ± 0.269
3.6LysPro: 3.6 ± 0.587
1.738LysGln: 1.738 ± 0.32
2.917LysArg: 2.917 ± 0.542
2.669LysSer: 2.669 ± 0.441
3.227LysThr: 3.227 ± 0.418
3.538LysVal: 3.538 ± 0.536
1.117LysTrp: 1.117 ± 0.374
1.49LysTyr: 1.49 ± 0.302
0.0LysXaa: 0.0 ± 0.0
Leu
8.813LeuAla: 8.813 ± 0.814
0.621LeuCys: 0.621 ± 0.201
4.22LeuAsp: 4.22 ± 0.477
5.027LeuGlu: 5.027 ± 0.531
2.669LeuPhe: 2.669 ± 0.358
5.151LeuGly: 5.151 ± 0.485
1.738LeuHis: 1.738 ± 0.337
5.151LeuIle: 5.151 ± 0.509
3.972LeuLys: 3.972 ± 0.544
4.779LeuLeu: 4.779 ± 0.492
2.731LeuMet: 2.731 ± 0.387
3.103LeuAsn: 3.103 ± 0.425
4.407LeuPro: 4.407 ± 0.474
1.738LeuGln: 1.738 ± 0.288
5.276LeuArg: 5.276 ± 0.551
4.469LeuSer: 4.469 ± 0.515
4.965LeuThr: 4.965 ± 0.553
4.841LeuVal: 4.841 ± 0.613
1.49LeuTrp: 1.49 ± 0.296
2.545LeuTyr: 2.545 ± 0.481
0.0LeuXaa: 0.0 ± 0.0
Met
1.862MetAla: 1.862 ± 0.365
0.124MetCys: 0.124 ± 0.098
0.993MetAsp: 0.993 ± 0.269
1.8MetGlu: 1.8 ± 0.285
0.434MetPhe: 0.434 ± 0.142
1.49MetGly: 1.49 ± 0.315
0.559MetHis: 0.559 ± 0.199
0.931MetIle: 0.931 ± 0.275
1.676MetLys: 1.676 ± 0.354
1.428MetLeu: 1.428 ± 0.374
0.497MetMet: 0.497 ± 0.158
0.745MetAsn: 0.745 ± 0.239
1.055MetPro: 1.055 ± 0.288
0.683MetGln: 0.683 ± 0.245
1.552MetArg: 1.552 ± 0.286
1.738MetSer: 1.738 ± 0.316
2.545MetThr: 2.545 ± 0.375
1.303MetVal: 1.303 ± 0.34
0.248MetTrp: 0.248 ± 0.119
0.931MetTyr: 0.931 ± 0.233
0.0MetXaa: 0.0 ± 0.0
Asn
3.352AsnAla: 3.352 ± 0.539
0.372AsnCys: 0.372 ± 0.16
1.862AsnAsp: 1.862 ± 0.252
2.296AsnGlu: 2.296 ± 0.332
1.055AsnPhe: 1.055 ± 0.299
3.662AsnGly: 3.662 ± 0.547
0.745AsnHis: 0.745 ± 0.192
1.117AsnIle: 1.117 ± 0.286
1.428AsnLys: 1.428 ± 0.312
2.669AsnLeu: 2.669 ± 0.454
0.807AsnMet: 0.807 ± 0.195
0.559AsnAsn: 0.559 ± 0.148
2.607AsnPro: 2.607 ± 0.424
1.365AsnGln: 1.365 ± 0.293
2.296AsnArg: 2.296 ± 0.33
1.862AsnSer: 1.862 ± 0.466
1.738AsnThr: 1.738 ± 0.358
2.11AsnVal: 2.11 ± 0.355
0.807AsnTrp: 0.807 ± 0.228
0.931AsnTyr: 0.931 ± 0.253
0.0AsnXaa: 0.0 ± 0.0
Pro
4.717ProAla: 4.717 ± 0.522
0.434ProCys: 0.434 ± 0.187
3.662ProAsp: 3.662 ± 0.568
4.158ProGlu: 4.158 ± 0.589
2.669ProPhe: 2.669 ± 0.413
4.903ProGly: 4.903 ± 0.718
0.869ProHis: 0.869 ± 0.258
2.917ProIle: 2.917 ± 0.442
2.607ProLys: 2.607 ± 0.5
3.103ProLeu: 3.103 ± 0.474
0.931ProMet: 0.931 ± 0.27
2.545ProAsn: 2.545 ± 0.37
2.172ProPro: 2.172 ± 0.436
2.669ProGln: 2.669 ± 0.734
2.855ProArg: 2.855 ± 0.515
3.289ProSer: 3.289 ± 0.431
3.972ProThr: 3.972 ± 0.429
3.786ProVal: 3.786 ± 0.397
1.055ProTrp: 1.055 ± 0.346
1.365ProTyr: 1.365 ± 0.26
0.0ProXaa: 0.0 ± 0.0
Gln
4.283GlnAla: 4.283 ± 0.551
0.248GlnCys: 0.248 ± 0.132
1.8GlnAsp: 1.8 ± 0.329
1.8GlnGlu: 1.8 ± 0.349
1.179GlnPhe: 1.179 ± 0.257
3.91GlnGly: 3.91 ± 1.48
1.055GlnHis: 1.055 ± 0.244
2.545GlnIle: 2.545 ± 0.401
1.49GlnLys: 1.49 ± 0.414
3.289GlnLeu: 3.289 ± 0.499
1.117GlnMet: 1.117 ± 0.278
0.683GlnAsn: 0.683 ± 0.175
1.055GlnPro: 1.055 ± 0.27
1.862GlnGln: 1.862 ± 0.399
1.552GlnArg: 1.552 ± 0.302
1.614GlnSer: 1.614 ± 0.356
1.552GlnThr: 1.552 ± 0.269
2.234GlnVal: 2.234 ± 0.425
0.621GlnTrp: 0.621 ± 0.193
1.055GlnTyr: 1.055 ± 0.3
0.0GlnXaa: 0.0 ± 0.0
Arg
4.655ArgAla: 4.655 ± 0.567
1.055ArgCys: 1.055 ± 0.376
3.662ArgAsp: 3.662 ± 0.492
4.903ArgGlu: 4.903 ± 0.643
2.545ArgPhe: 2.545 ± 0.47
4.22ArgGly: 4.22 ± 0.467
0.993ArgHis: 0.993 ± 0.275
3.724ArgIle: 3.724 ± 0.47
3.724ArgLys: 3.724 ± 0.615
5.027ArgLeu: 5.027 ± 0.601
1.676ArgMet: 1.676 ± 0.324
1.738ArgAsn: 1.738 ± 0.323
1.862ArgPro: 1.862 ± 0.327
2.11ArgGln: 2.11 ± 0.362
4.841ArgArg: 4.841 ± 0.661
3.414ArgSer: 3.414 ± 0.447
2.979ArgThr: 2.979 ± 0.417
4.158ArgVal: 4.158 ± 0.492
0.993ArgTrp: 0.993 ± 0.225
2.234ArgTyr: 2.234 ± 0.413
0.0ArgXaa: 0.0 ± 0.0
Ser
5.027SerAla: 5.027 ± 0.515
0.372SerCys: 0.372 ± 0.16
3.352SerAsp: 3.352 ± 0.593
3.289SerGlu: 3.289 ± 0.424
2.234SerPhe: 2.234 ± 0.481
5.027SerGly: 5.027 ± 0.728
0.807SerHis: 0.807 ± 0.184
2.731SerIle: 2.731 ± 0.349
2.607SerLys: 2.607 ± 0.39
4.593SerLeu: 4.593 ± 0.552
1.055SerMet: 1.055 ± 0.222
1.365SerAsn: 1.365 ± 0.336
4.034SerPro: 4.034 ± 0.421
2.11SerGln: 2.11 ± 0.361
3.786SerArg: 3.786 ± 0.548
3.165SerSer: 3.165 ± 0.532
3.227SerThr: 3.227 ± 0.355
3.662SerVal: 3.662 ± 0.499
1.055SerTrp: 1.055 ± 0.257
1.365SerTyr: 1.365 ± 0.261
0.0SerXaa: 0.0 ± 0.0
Thr
4.965ThrAla: 4.965 ± 0.53
0.559ThrCys: 0.559 ± 0.201
3.662ThrAsp: 3.662 ± 0.536
3.352ThrGlu: 3.352 ± 0.383
2.234ThrPhe: 2.234 ± 0.374
5.958ThrGly: 5.958 ± 0.708
1.179ThrHis: 1.179 ± 0.311
3.041ThrIle: 3.041 ± 0.461
3.289ThrLys: 3.289 ± 0.434
4.779ThrLeu: 4.779 ± 0.548
1.055ThrMet: 1.055 ± 0.251
1.924ThrAsn: 1.924 ± 0.368
4.096ThrPro: 4.096 ± 0.503
2.421ThrGln: 2.421 ± 0.351
2.979ThrArg: 2.979 ± 0.482
3.6ThrSer: 3.6 ± 0.48
2.917ThrThr: 2.917 ± 0.47
4.22ThrVal: 4.22 ± 0.455
1.303ThrTrp: 1.303 ± 0.277
1.428ThrTyr: 1.428 ± 0.278
0.0ThrXaa: 0.0 ± 0.0
Val
6.393ValAla: 6.393 ± 0.793
0.745ValCys: 0.745 ± 0.253
4.841ValAsp: 4.841 ± 0.61
4.593ValGlu: 4.593 ± 0.58
2.421ValPhe: 2.421 ± 0.458
5.214ValGly: 5.214 ± 0.481
1.365ValHis: 1.365 ± 0.337
3.165ValIle: 3.165 ± 0.483
3.6ValLys: 3.6 ± 0.472
5.214ValLeu: 5.214 ± 0.581
1.49ValMet: 1.49 ± 0.32
2.545ValAsn: 2.545 ± 0.387
3.414ValPro: 3.414 ± 0.493
1.986ValGln: 1.986 ± 0.352
4.407ValArg: 4.407 ± 0.603
3.91ValSer: 3.91 ± 0.57
4.469ValThr: 4.469 ± 0.609
5.214ValVal: 5.214 ± 0.712
1.552ValTrp: 1.552 ± 0.313
1.738ValTyr: 1.738 ± 0.306
0.0ValXaa: 0.0 ± 0.0
Trp
1.738TrpAla: 1.738 ± 0.37
0.31TrpCys: 0.31 ± 0.142
1.8TrpAsp: 1.8 ± 0.333
1.676TrpGlu: 1.676 ± 0.301
0.497TrpPhe: 0.497 ± 0.18
1.303TrpGly: 1.303 ± 0.356
0.497TrpHis: 0.497 ± 0.179
0.931TrpIle: 0.931 ± 0.278
0.621TrpLys: 0.621 ± 0.196
0.993TrpLeu: 0.993 ± 0.207
0.497TrpMet: 0.497 ± 0.171
1.117TrpAsn: 1.117 ± 0.357
0.869TrpPro: 0.869 ± 0.239
0.931TrpGln: 0.931 ± 0.301
0.993TrpArg: 0.993 ± 0.224
1.241TrpSer: 1.241 ± 0.318
1.117TrpThr: 1.117 ± 0.244
1.49TrpVal: 1.49 ± 0.327
0.931TrpTrp: 0.931 ± 0.283
0.434TrpTyr: 0.434 ± 0.178
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.607TyrAla: 2.607 ± 0.368
0.497TyrCys: 0.497 ± 0.216
2.048TyrAsp: 2.048 ± 0.325
2.358TyrGlu: 2.358 ± 0.367
0.621TyrPhe: 0.621 ± 0.201
2.545TyrGly: 2.545 ± 0.37
0.434TyrHis: 0.434 ± 0.162
1.303TyrIle: 1.303 ± 0.241
0.683TyrLys: 0.683 ± 0.243
2.855TyrLeu: 2.855 ± 0.398
0.248TyrMet: 0.248 ± 0.161
0.993TyrAsn: 0.993 ± 0.24
1.428TyrPro: 1.428 ± 0.311
0.993TyrGln: 0.993 ± 0.27
1.676TyrArg: 1.676 ± 0.333
2.234TyrSer: 2.234 ± 0.331
1.738TyrThr: 1.738 ± 0.379
2.607TyrVal: 2.607 ± 0.42
0.434TyrTrp: 0.434 ± 0.175
1.241TyrTyr: 1.241 ± 0.287
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 97 proteins (16113 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski