Amino acid dipepetide frequency for Mycobacterium phage Konstantine

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.854AlaAla: 9.854 ± 1.281
0.474AlaCys: 0.474 ± 0.134
5.875AlaAsp: 5.875 ± 0.556
6.727AlaGlu: 6.727 ± 0.677
4.074AlaPhe: 4.074 ± 0.803
6.869AlaGly: 6.869 ± 0.72
1.516AlaHis: 1.516 ± 0.271
4.595AlaIle: 4.595 ± 0.657
5.638AlaLys: 5.638 ± 0.625
6.68AlaLeu: 6.68 ± 0.715
2.132AlaMet: 2.132 ± 0.32
3.316AlaAsn: 3.316 ± 0.352
3.79AlaPro: 3.79 ± 0.502
4.027AlaGln: 4.027 ± 0.567
6.206AlaArg: 6.206 ± 0.759
4.264AlaSer: 4.264 ± 0.644
4.406AlaThr: 4.406 ± 0.399
6.017AlaVal: 6.017 ± 0.681
1.516AlaTrp: 1.516 ± 0.319
2.416AlaTyr: 2.416 ± 0.323
0.0AlaXaa: 0.0 ± 0.0
Cys
0.9CysAla: 0.9 ± 0.258
0.237CysCys: 0.237 ± 0.13
0.663CysAsp: 0.663 ± 0.21
0.521CysGlu: 0.521 ± 0.168
0.142CysPhe: 0.142 ± 0.089
0.948CysGly: 0.948 ± 0.237
0.19CysHis: 0.19 ± 0.097
0.332CysIle: 0.332 ± 0.114
0.474CysLys: 0.474 ± 0.155
0.237CysLeu: 0.237 ± 0.111
0.19CysMet: 0.19 ± 0.092
0.237CysAsn: 0.237 ± 0.098
0.474CysPro: 0.474 ± 0.159
0.142CysGln: 0.142 ± 0.089
0.711CysArg: 0.711 ± 0.219
0.379CysSer: 0.379 ± 0.119
0.521CysThr: 0.521 ± 0.173
0.521CysVal: 0.521 ± 0.164
0.0CysTrp: 0.0 ± 0.0
0.237CysTyr: 0.237 ± 0.097
0.0CysXaa: 0.0 ± 0.0
Asp
4.69AspAla: 4.69 ± 0.423
0.379AspCys: 0.379 ± 0.126
4.88AspAsp: 4.88 ± 0.625
5.59AspGlu: 5.59 ± 0.695
3.079AspPhe: 3.079 ± 0.425
5.448AspGly: 5.448 ± 0.573
1.658AspHis: 1.658 ± 0.33
3.458AspIle: 3.458 ± 0.36
3.364AspLys: 3.364 ± 0.499
5.922AspLeu: 5.922 ± 0.564
1.563AspMet: 1.563 ± 0.272
2.037AspAsn: 2.037 ± 0.285
4.785AspPro: 4.785 ± 0.589
2.037AspGln: 2.037 ± 0.298
4.027AspArg: 4.027 ± 0.418
4.027AspSer: 4.027 ± 0.438
3.269AspThr: 3.269 ± 0.362
3.98AspVal: 3.98 ± 0.352
1.184AspTrp: 1.184 ± 0.309
2.179AspTyr: 2.179 ± 0.373
0.0AspXaa: 0.0 ± 0.0
Glu
6.585GluAla: 6.585 ± 0.687
0.616GluCys: 0.616 ± 0.151
5.543GluAsp: 5.543 ± 0.538
4.169GluGlu: 4.169 ± 0.555
3.127GluPhe: 3.127 ± 0.382
3.743GluGly: 3.743 ± 0.382
1.279GluHis: 1.279 ± 0.255
4.406GluIle: 4.406 ± 0.414
3.079GluLys: 3.079 ± 0.487
4.927GluLeu: 4.927 ± 0.431
1.611GluMet: 1.611 ± 0.267
3.269GluAsn: 3.269 ± 0.411
2.606GluPro: 2.606 ± 0.391
2.369GluGln: 2.369 ± 0.313
3.837GluArg: 3.837 ± 0.578
4.027GluSer: 4.027 ± 0.489
4.074GluThr: 4.074 ± 0.45
3.932GluVal: 3.932 ± 0.393
1.137GluTrp: 1.137 ± 0.284
1.706GluTyr: 1.706 ± 0.341
0.0GluXaa: 0.0 ± 0.0
Phe
3.411PheAla: 3.411 ± 0.512
0.142PheCys: 0.142 ± 0.084
2.795PheAsp: 2.795 ± 0.359
2.558PheGlu: 2.558 ± 0.344
0.758PhePhe: 0.758 ± 0.192
4.406PheGly: 4.406 ± 0.537
0.569PheHis: 0.569 ± 0.162
1.99PheIle: 1.99 ± 0.338
1.8PheLys: 1.8 ± 0.292
2.369PheLeu: 2.369 ± 0.352
0.948PheMet: 0.948 ± 0.221
1.942PheAsn: 1.942 ± 0.468
2.037PhePro: 2.037 ± 0.348
0.711PheGln: 0.711 ± 0.21
2.795PheArg: 2.795 ± 0.366
1.421PheSer: 1.421 ± 0.228
2.085PheThr: 2.085 ± 0.269
2.558PheVal: 2.558 ± 0.26
0.426PheTrp: 0.426 ± 0.162
0.995PheTyr: 0.995 ± 0.274
0.0PheXaa: 0.0 ± 0.0
Gly
6.111GlyAla: 6.111 ± 1.068
0.474GlyCys: 0.474 ± 0.171
4.643GlyAsp: 4.643 ± 0.456
5.59GlyGlu: 5.59 ± 0.458
3.174GlyPhe: 3.174 ± 0.339
6.917GlyGly: 6.917 ± 0.74
1.374GlyHis: 1.374 ± 0.275
4.738GlyIle: 4.738 ± 0.435
4.501GlyLys: 4.501 ± 0.4
6.869GlyLeu: 6.869 ± 0.781
2.321GlyMet: 2.321 ± 0.311
3.885GlyAsn: 3.885 ± 0.545
3.316GlyPro: 3.316 ± 0.399
2.7GlyGln: 2.7 ± 0.345
4.69GlyArg: 4.69 ± 0.509
5.211GlySer: 5.211 ± 0.498
6.254GlyThr: 6.254 ± 0.659
6.111GlyVal: 6.111 ± 0.413
2.369GlyTrp: 2.369 ± 0.308
3.222GlyTyr: 3.222 ± 0.421
0.0GlyXaa: 0.0 ± 0.0
His
1.516HisAla: 1.516 ± 0.23
0.332HisCys: 0.332 ± 0.128
1.042HisAsp: 1.042 ± 0.191
1.09HisGlu: 1.09 ± 0.25
0.711HisPhe: 0.711 ± 0.199
1.232HisGly: 1.232 ± 0.249
0.379HisHis: 0.379 ± 0.163
0.711HisIle: 0.711 ± 0.19
1.184HisLys: 1.184 ± 0.25
1.184HisLeu: 1.184 ± 0.284
0.284HisMet: 0.284 ± 0.109
0.995HisAsn: 0.995 ± 0.225
1.09HisPro: 1.09 ± 0.29
0.711HisGln: 0.711 ± 0.187
1.611HisArg: 1.611 ± 0.295
0.805HisSer: 0.805 ± 0.193
0.948HisThr: 0.948 ± 0.221
1.184HisVal: 1.184 ± 0.243
0.569HisTrp: 0.569 ± 0.171
0.805HisTyr: 0.805 ± 0.199
0.0HisXaa: 0.0 ± 0.0
Ile
5.496IleAla: 5.496 ± 0.627
0.616IleCys: 0.616 ± 0.177
4.074IleAsp: 4.074 ± 0.402
3.743IleGlu: 3.743 ± 0.424
2.085IlePhe: 2.085 ± 0.334
4.359IleGly: 4.359 ± 0.512
0.853IleHis: 0.853 ± 0.219
2.179IleIle: 2.179 ± 0.323
2.179IleLys: 2.179 ± 0.358
3.837IleLeu: 3.837 ± 0.359
1.279IleMet: 1.279 ± 0.229
2.416IleAsn: 2.416 ± 0.408
3.079IlePro: 3.079 ± 0.456
1.611IleGln: 1.611 ± 0.263
4.501IleArg: 4.501 ± 0.427
3.506IleSer: 3.506 ± 0.403
3.458IleThr: 3.458 ± 0.369
4.264IleVal: 4.264 ± 0.372
0.805IleTrp: 0.805 ± 0.188
1.374IleTyr: 1.374 ± 0.275
0.0IleXaa: 0.0 ± 0.0
Lys
5.543LysAla: 5.543 ± 0.679
0.19LysCys: 0.19 ± 0.119
2.843LysAsp: 2.843 ± 0.374
1.8LysGlu: 1.8 ± 0.36
1.99LysPhe: 1.99 ± 0.332
3.411LysGly: 3.411 ± 0.433
0.995LysHis: 0.995 ± 0.263
4.027LysIle: 4.027 ± 0.43
3.127LysLys: 3.127 ± 0.554
5.401LysLeu: 5.401 ± 0.497
1.753LysMet: 1.753 ± 0.277
1.848LysAsn: 1.848 ± 0.258
2.321LysPro: 2.321 ± 0.347
2.274LysGln: 2.274 ± 0.421
3.695LysArg: 3.695 ± 0.476
2.606LysSer: 2.606 ± 0.417
2.937LysThr: 2.937 ± 0.397
3.222LysVal: 3.222 ± 0.492
0.758LysTrp: 0.758 ± 0.235
1.232LysTyr: 1.232 ± 0.231
0.0LysXaa: 0.0 ± 0.0
Leu
7.248LeuAla: 7.248 ± 0.956
0.569LeuCys: 0.569 ± 0.192
4.406LeuAsp: 4.406 ± 0.405
4.595LeuGlu: 4.595 ± 0.441
2.037LeuPhe: 2.037 ± 0.297
5.78LeuGly: 5.78 ± 0.623
0.995LeuHis: 0.995 ± 0.207
4.359LeuIle: 4.359 ± 0.569
3.885LeuLys: 3.885 ± 0.39
4.88LeuLeu: 4.88 ± 0.546
1.279LeuMet: 1.279 ± 0.22
3.648LeuAsn: 3.648 ± 0.424
5.211LeuPro: 5.211 ± 0.562
2.227LeuGln: 2.227 ± 0.312
5.022LeuArg: 5.022 ± 0.512
4.453LeuSer: 4.453 ± 0.509
4.216LeuThr: 4.216 ± 0.552
4.311LeuVal: 4.311 ± 0.478
1.137LeuTrp: 1.137 ± 0.212
1.848LeuTyr: 1.848 ± 0.363
0.0LeuXaa: 0.0 ± 0.0
Met
2.227MetAla: 2.227 ± 0.391
0.284MetCys: 0.284 ± 0.125
1.8MetAsp: 1.8 ± 0.28
1.09MetGlu: 1.09 ± 0.223
0.616MetPhe: 0.616 ± 0.153
2.085MetGly: 2.085 ± 0.337
0.474MetHis: 0.474 ± 0.163
1.232MetIle: 1.232 ± 0.275
1.516MetLys: 1.516 ± 0.266
1.374MetLeu: 1.374 ± 0.27
0.616MetMet: 0.616 ± 0.199
0.948MetAsn: 0.948 ± 0.214
0.948MetPro: 0.948 ± 0.238
1.232MetGln: 1.232 ± 0.212
1.232MetArg: 1.232 ± 0.231
1.516MetSer: 1.516 ± 0.26
1.753MetThr: 1.753 ± 0.328
1.658MetVal: 1.658 ± 0.282
0.569MetTrp: 0.569 ± 0.196
0.521MetTyr: 0.521 ± 0.193
0.0MetXaa: 0.0 ± 0.0
Asn
3.458AsnAla: 3.458 ± 0.52
0.332AsnCys: 0.332 ± 0.163
2.369AsnAsp: 2.369 ± 0.3
2.416AsnGlu: 2.416 ± 0.283
1.516AsnPhe: 1.516 ± 0.373
3.932AsnGly: 3.932 ± 0.361
1.042AsnHis: 1.042 ± 0.248
1.611AsnIle: 1.611 ± 0.258
1.942AsnLys: 1.942 ± 0.311
3.222AsnLeu: 3.222 ± 0.346
0.616AsnMet: 0.616 ± 0.173
1.042AsnAsn: 1.042 ± 0.226
3.553AsnPro: 3.553 ± 0.471
1.374AsnGln: 1.374 ± 0.296
3.032AsnArg: 3.032 ± 0.367
2.132AsnSer: 2.132 ± 0.31
2.558AsnThr: 2.558 ± 0.329
3.316AsnVal: 3.316 ± 0.378
0.9AsnTrp: 0.9 ± 0.209
1.232AsnTyr: 1.232 ± 0.259
0.0AsnXaa: 0.0 ± 0.0
Pro
4.88ProAla: 4.88 ± 0.521
0.332ProCys: 0.332 ± 0.127
4.264ProAsp: 4.264 ± 0.675
4.406ProGlu: 4.406 ± 0.609
1.706ProPhe: 1.706 ± 0.27
4.785ProGly: 4.785 ± 0.448
0.616ProHis: 0.616 ± 0.196
2.274ProIle: 2.274 ± 0.291
2.606ProLys: 2.606 ± 0.455
3.506ProLeu: 3.506 ± 0.428
0.758ProMet: 0.758 ± 0.217
2.653ProAsn: 2.653 ± 0.422
2.321ProPro: 2.321 ± 0.37
2.037ProGln: 2.037 ± 0.307
2.511ProArg: 2.511 ± 0.378
3.364ProSer: 3.364 ± 0.483
2.274ProThr: 2.274 ± 0.322
3.932ProVal: 3.932 ± 0.509
1.279ProTrp: 1.279 ± 0.221
1.374ProTyr: 1.374 ± 0.228
0.0ProXaa: 0.0 ± 0.0
Gln
2.7GlnAla: 2.7 ± 0.417
0.19GlnCys: 0.19 ± 0.102
2.179GlnAsp: 2.179 ± 0.362
2.085GlnGlu: 2.085 ± 0.314
1.374GlnPhe: 1.374 ± 0.259
2.937GlnGly: 2.937 ± 0.517
0.758GlnHis: 0.758 ± 0.244
2.748GlnIle: 2.748 ± 0.309
2.085GlnLys: 2.085 ± 0.4
2.511GlnLeu: 2.511 ± 0.36
1.137GlnMet: 1.137 ± 0.214
1.279GlnAsn: 1.279 ± 0.215
1.753GlnPro: 1.753 ± 0.343
2.132GlnGln: 2.132 ± 0.499
2.416GlnArg: 2.416 ± 0.348
1.99GlnSer: 1.99 ± 0.28
1.895GlnThr: 1.895 ± 0.348
2.464GlnVal: 2.464 ± 0.337
0.426GlnTrp: 0.426 ± 0.131
0.853GlnTyr: 0.853 ± 0.236
0.0GlnXaa: 0.0 ± 0.0
Arg
5.922ArgAla: 5.922 ± 0.633
0.711ArgCys: 0.711 ± 0.232
4.501ArgAsp: 4.501 ± 0.553
4.785ArgGlu: 4.785 ± 0.57
2.89ArgPhe: 2.89 ± 0.349
4.88ArgGly: 4.88 ± 0.477
1.279ArgHis: 1.279 ± 0.237
3.695ArgIle: 3.695 ± 0.368
4.359ArgLys: 4.359 ± 0.67
3.932ArgLeu: 3.932 ± 0.373
2.274ArgMet: 2.274 ± 0.347
2.748ArgAsn: 2.748 ± 0.322
2.558ArgPro: 2.558 ± 0.377
2.321ArgGln: 2.321 ± 0.325
6.017ArgArg: 6.017 ± 0.682
3.458ArgSer: 3.458 ± 0.436
3.601ArgThr: 3.601 ± 0.363
4.264ArgVal: 4.264 ± 0.539
1.421ArgTrp: 1.421 ± 0.231
1.895ArgTyr: 1.895 ± 0.341
0.0ArgXaa: 0.0 ± 0.0
Ser
5.306SerAla: 5.306 ± 0.625
0.521SerCys: 0.521 ± 0.137
3.648SerAsp: 3.648 ± 0.514
3.837SerGlu: 3.837 ± 0.527
2.037SerPhe: 2.037 ± 0.306
6.585SerGly: 6.585 ± 0.542
1.232SerHis: 1.232 ± 0.274
2.369SerIle: 2.369 ± 0.366
2.227SerLys: 2.227 ± 0.353
3.222SerLeu: 3.222 ± 0.529
1.8SerMet: 1.8 ± 0.237
2.037SerAsn: 2.037 ± 0.303
2.653SerPro: 2.653 ± 0.359
1.469SerGln: 1.469 ± 0.369
3.79SerArg: 3.79 ± 0.434
3.364SerSer: 3.364 ± 0.485
3.079SerThr: 3.079 ± 0.407
4.216SerVal: 4.216 ± 0.495
0.995SerTrp: 0.995 ± 0.236
1.753SerTyr: 1.753 ± 0.395
0.0SerXaa: 0.0 ± 0.0
Thr
4.974ThrAla: 4.974 ± 0.508
0.474ThrCys: 0.474 ± 0.164
3.648ThrAsp: 3.648 ± 0.381
3.648ThrGlu: 3.648 ± 0.437
1.658ThrPhe: 1.658 ± 0.329
6.301ThrGly: 6.301 ± 0.53
1.184ThrHis: 1.184 ± 0.191
3.269ThrIle: 3.269 ± 0.441
2.748ThrLys: 2.748 ± 0.309
3.601ThrLeu: 3.601 ± 0.491
0.9ThrMet: 0.9 ± 0.198
2.416ThrAsn: 2.416 ± 0.375
3.458ThrPro: 3.458 ± 0.451
1.99ThrGln: 1.99 ± 0.267
3.079ThrArg: 3.079 ± 0.324
3.648ThrSer: 3.648 ± 0.455
2.369ThrThr: 2.369 ± 0.337
4.264ThrVal: 4.264 ± 0.419
1.232ThrTrp: 1.232 ± 0.33
1.848ThrTyr: 1.848 ± 0.256
0.0ThrXaa: 0.0 ± 0.0
Val
6.064ValAla: 6.064 ± 0.513
0.758ValCys: 0.758 ± 0.177
5.164ValAsp: 5.164 ± 0.44
4.359ValGlu: 4.359 ± 0.475
2.321ValPhe: 2.321 ± 0.326
5.543ValGly: 5.543 ± 0.528
1.09ValHis: 1.09 ± 0.227
4.738ValIle: 4.738 ± 0.492
3.269ValLys: 3.269 ± 0.387
4.595ValLeu: 4.595 ± 0.536
1.184ValMet: 1.184 ± 0.243
2.7ValAsn: 2.7 ± 0.278
3.695ValPro: 3.695 ± 0.475
2.748ValGln: 2.748 ± 0.3
4.738ValArg: 4.738 ± 0.529
3.98ValSer: 3.98 ± 0.415
3.932ValThr: 3.932 ± 0.419
4.501ValVal: 4.501 ± 0.557
1.374ValTrp: 1.374 ± 0.237
1.327ValTyr: 1.327 ± 0.275
0.0ValXaa: 0.0 ± 0.0
Trp
1.516TrpAla: 1.516 ± 0.256
0.19TrpCys: 0.19 ± 0.078
1.042TrpAsp: 1.042 ± 0.228
1.327TrpGlu: 1.327 ± 0.277
0.616TrpPhe: 0.616 ± 0.195
1.563TrpGly: 1.563 ± 0.232
0.426TrpHis: 0.426 ± 0.162
1.516TrpIle: 1.516 ± 0.32
0.758TrpLys: 0.758 ± 0.168
1.374TrpLeu: 1.374 ± 0.242
0.237TrpMet: 0.237 ± 0.101
1.09TrpAsn: 1.09 ± 0.249
0.616TrpPro: 0.616 ± 0.196
0.853TrpGln: 0.853 ± 0.173
1.611TrpArg: 1.611 ± 0.315
0.616TrpSer: 0.616 ± 0.151
1.327TrpThr: 1.327 ± 0.19
1.611TrpVal: 1.611 ± 0.3
0.284TrpTrp: 0.284 ± 0.12
0.474TrpTyr: 0.474 ± 0.134
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.037TyrAla: 2.037 ± 0.347
0.237TyrCys: 0.237 ± 0.113
2.179TyrAsp: 2.179 ± 0.315
1.706TyrGlu: 1.706 ± 0.264
0.805TyrPhe: 0.805 ± 0.185
2.748TyrGly: 2.748 ± 0.388
0.474TyrHis: 0.474 ± 0.117
1.469TyrIle: 1.469 ± 0.261
1.232TyrLys: 1.232 ± 0.25
2.464TyrLeu: 2.464 ± 0.405
0.758TyrMet: 0.758 ± 0.227
1.09TyrAsn: 1.09 ± 0.212
1.611TyrPro: 1.611 ± 0.337
0.948TyrGln: 0.948 ± 0.219
1.99TyrArg: 1.99 ± 0.365
1.374TyrSer: 1.374 ± 0.313
1.753TyrThr: 1.753 ± 0.262
1.753TyrVal: 1.753 ± 0.313
0.663TyrTrp: 0.663 ± 0.181
0.758TyrTyr: 0.758 ± 0.171
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 95 proteins (21109 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski