Amino acid dipepetide frequency for Mycobacterium phage MyraDee

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.381AlaAla: 9.381 ± 0.909
0.57AlaCys: 0.57 ± 0.154
6.528AlaAsp: 6.528 ± 0.648
7.226AlaGlu: 7.226 ± 0.914
4.247AlaPhe: 4.247 ± 0.563
7.479AlaGly: 7.479 ± 0.929
1.458AlaHis: 1.458 ± 0.333
4.564AlaIle: 4.564 ± 0.621
4.944AlaLys: 4.944 ± 0.682
9.191AlaLeu: 9.191 ± 0.658
2.789AlaMet: 2.789 ± 0.358
2.979AlaAsn: 2.979 ± 0.399
4.437AlaPro: 4.437 ± 0.775
3.042AlaGln: 3.042 ± 0.388
6.212AlaArg: 6.212 ± 0.732
4.373AlaSer: 4.373 ± 0.373
4.817AlaThr: 4.817 ± 0.604
5.578AlaVal: 5.578 ± 0.679
2.028AlaTrp: 2.028 ± 0.349
2.028AlaTyr: 2.028 ± 0.397
0.0AlaXaa: 0.0 ± 0.0
Cys
0.697CysAla: 0.697 ± 0.194
0.063CysCys: 0.063 ± 0.062
0.697CysAsp: 0.697 ± 0.186
0.761CysGlu: 0.761 ± 0.199
0.444CysPhe: 0.444 ± 0.139
1.078CysGly: 1.078 ± 0.318
0.254CysHis: 0.254 ± 0.107
0.317CysIle: 0.317 ± 0.183
0.444CysLys: 0.444 ± 0.154
0.57CysLeu: 0.57 ± 0.191
0.127CysMet: 0.127 ± 0.088
0.38CysAsn: 0.38 ± 0.136
0.444CysPro: 0.444 ± 0.194
0.063CysGln: 0.063 ± 0.06
0.444CysArg: 0.444 ± 0.176
0.38CysSer: 0.38 ± 0.167
0.254CysThr: 0.254 ± 0.122
0.761CysVal: 0.761 ± 0.209
0.38CysTrp: 0.38 ± 0.169
0.127CysTyr: 0.127 ± 0.102
0.0CysXaa: 0.0 ± 0.0
Asp
6.338AspAla: 6.338 ± 0.662
0.824AspCys: 0.824 ± 0.225
4.5AspAsp: 4.5 ± 0.778
5.261AspGlu: 5.261 ± 0.856
2.218AspPhe: 2.218 ± 0.373
5.895AspGly: 5.895 ± 0.513
1.965AspHis: 1.965 ± 0.451
3.296AspIle: 3.296 ± 0.517
2.852AspLys: 2.852 ± 0.454
6.655AspLeu: 6.655 ± 0.754
1.078AspMet: 1.078 ± 0.274
1.838AspAsn: 1.838 ± 0.339
4.881AspPro: 4.881 ± 0.691
2.092AspGln: 2.092 ± 0.394
4.12AspArg: 4.12 ± 0.554
3.169AspSer: 3.169 ± 0.376
4.057AspThr: 4.057 ± 0.448
4.627AspVal: 4.627 ± 0.474
1.838AspTrp: 1.838 ± 0.328
2.789AspTyr: 2.789 ± 0.401
0.0AspXaa: 0.0 ± 0.0
Glu
6.845GluAla: 6.845 ± 0.748
0.254GluCys: 0.254 ± 0.12
4.944GluAsp: 4.944 ± 0.605
3.549GluGlu: 3.549 ± 0.47
2.282GluPhe: 2.282 ± 0.341
5.831GluGly: 5.831 ± 0.478
1.711GluHis: 1.711 ± 0.358
3.866GluIle: 3.866 ± 0.426
2.662GluLys: 2.662 ± 0.431
7.099GluLeu: 7.099 ± 0.685
1.648GluMet: 1.648 ± 0.355
2.345GluAsn: 2.345 ± 0.391
3.613GluPro: 3.613 ± 0.603
3.233GluGln: 3.233 ± 0.425
4.944GluArg: 4.944 ± 0.682
2.852GluSer: 2.852 ± 0.341
3.74GluThr: 3.74 ± 0.501
5.071GluVal: 5.071 ± 0.697
1.711GluTrp: 1.711 ± 0.287
2.092GluTyr: 2.092 ± 0.387
0.0GluXaa: 0.0 ± 0.0
Phe
3.169PheAla: 3.169 ± 0.409
0.507PheCys: 0.507 ± 0.185
3.106PheAsp: 3.106 ± 0.436
2.599PheGlu: 2.599 ± 0.402
0.761PhePhe: 0.761 ± 0.199
3.359PheGly: 3.359 ± 0.501
0.761PheHis: 0.761 ± 0.218
1.458PheIle: 1.458 ± 0.318
1.521PheLys: 1.521 ± 0.295
3.233PheLeu: 3.233 ± 0.556
0.697PheMet: 0.697 ± 0.182
1.141PheAsn: 1.141 ± 0.208
1.775PhePro: 1.775 ± 0.287
1.521PheGln: 1.521 ± 0.326
2.409PheArg: 2.409 ± 0.404
2.409PheSer: 2.409 ± 0.377
1.331PheThr: 1.331 ± 0.276
2.662PheVal: 2.662 ± 0.447
0.57PheTrp: 0.57 ± 0.152
1.078PheTyr: 1.078 ± 0.277
0.0PheXaa: 0.0 ± 0.0
Gly
6.212GlyAla: 6.212 ± 1.041
0.887GlyCys: 0.887 ± 0.203
6.972GlyAsp: 6.972 ± 0.915
5.388GlyGlu: 5.388 ± 0.567
3.866GlyPhe: 3.866 ± 0.465
8.874GlyGly: 8.874 ± 1.404
2.282GlyHis: 2.282 ± 0.352
4.627GlyIle: 4.627 ± 0.794
4.373GlyLys: 4.373 ± 0.535
6.719GlyLeu: 6.719 ± 0.751
2.852GlyMet: 2.852 ± 0.396
3.74GlyAsn: 3.74 ± 0.652
3.359GlyPro: 3.359 ± 0.572
3.613GlyGln: 3.613 ± 0.612
3.866GlyArg: 3.866 ± 0.453
4.817GlySer: 4.817 ± 0.739
5.768GlyThr: 5.768 ± 0.886
5.641GlyVal: 5.641 ± 0.678
1.838GlyTrp: 1.838 ± 0.33
2.535GlyTyr: 2.535 ± 0.466
0.0GlyXaa: 0.0 ± 0.0
His
1.775HisAla: 1.775 ± 0.363
0.19HisCys: 0.19 ± 0.105
1.331HisAsp: 1.331 ± 0.288
1.141HisGlu: 1.141 ± 0.233
0.887HisPhe: 0.887 ± 0.224
1.521HisGly: 1.521 ± 0.375
0.38HisHis: 0.38 ± 0.134
1.585HisIle: 1.585 ± 0.311
0.634HisLys: 0.634 ± 0.19
1.521HisLeu: 1.521 ± 0.345
0.444HisMet: 0.444 ± 0.187
0.317HisAsn: 0.317 ± 0.152
1.204HisPro: 1.204 ± 0.238
0.507HisGln: 0.507 ± 0.181
1.521HisArg: 1.521 ± 0.321
1.268HisSer: 1.268 ± 0.324
1.204HisThr: 1.204 ± 0.348
1.458HisVal: 1.458 ± 0.366
0.507HisTrp: 0.507 ± 0.177
0.951HisTyr: 0.951 ± 0.298
0.0HisXaa: 0.0 ± 0.0
Ile
5.134IleAla: 5.134 ± 0.614
0.38IleCys: 0.38 ± 0.144
3.866IleAsp: 3.866 ± 0.451
4.373IleGlu: 4.373 ± 0.497
1.141IlePhe: 1.141 ± 0.229
4.183IleGly: 4.183 ± 0.414
1.141IleHis: 1.141 ± 0.239
1.902IleIle: 1.902 ± 0.354
2.155IleLys: 2.155 ± 0.516
3.042IleLeu: 3.042 ± 0.395
0.634IleMet: 0.634 ± 0.172
2.028IleAsn: 2.028 ± 0.379
3.296IlePro: 3.296 ± 0.473
1.838IleGln: 1.838 ± 0.58
3.169IleArg: 3.169 ± 0.449
2.599IleSer: 2.599 ± 0.428
3.803IleThr: 3.803 ± 0.473
2.979IleVal: 2.979 ± 0.418
0.761IleTrp: 0.761 ± 0.201
1.078IleTyr: 1.078 ± 0.267
0.0IleXaa: 0.0 ± 0.0
Lys
4.247LysAla: 4.247 ± 0.598
0.063LysCys: 0.063 ± 0.059
2.155LysAsp: 2.155 ± 0.411
1.902LysGlu: 1.902 ± 0.37
1.711LysPhe: 1.711 ± 0.34
4.437LysGly: 4.437 ± 0.786
1.078LysHis: 1.078 ± 0.262
2.472LysIle: 2.472 ± 0.378
2.472LysLys: 2.472 ± 0.455
3.169LysLeu: 3.169 ± 0.399
1.394LysMet: 1.394 ± 0.389
0.951LysAsn: 0.951 ± 0.243
2.789LysPro: 2.789 ± 0.529
1.648LysGln: 1.648 ± 0.305
2.662LysArg: 2.662 ± 0.401
2.409LysSer: 2.409 ± 0.453
2.599LysThr: 2.599 ± 0.485
3.486LysVal: 3.486 ± 0.416
1.078LysTrp: 1.078 ± 0.247
0.824LysTyr: 0.824 ± 0.205
0.0LysXaa: 0.0 ± 0.0
Leu
8.493LeuAla: 8.493 ± 0.789
0.57LeuCys: 0.57 ± 0.211
6.592LeuAsp: 6.592 ± 0.7
5.895LeuGlu: 5.895 ± 0.623
2.028LeuPhe: 2.028 ± 0.374
6.719LeuGly: 6.719 ± 0.674
1.331LeuHis: 1.331 ± 0.354
3.803LeuIle: 3.803 ± 0.52
3.803LeuLys: 3.803 ± 0.605
6.275LeuLeu: 6.275 ± 0.584
2.599LeuMet: 2.599 ± 0.438
3.549LeuAsn: 3.549 ± 0.488
3.866LeuPro: 3.866 ± 0.481
2.535LeuGln: 2.535 ± 0.429
5.641LeuArg: 5.641 ± 0.608
5.451LeuSer: 5.451 ± 0.461
4.373LeuThr: 4.373 ± 0.655
4.817LeuVal: 4.817 ± 0.538
1.648LeuTrp: 1.648 ± 0.365
2.662LeuTyr: 2.662 ± 0.42
0.0LeuXaa: 0.0 ± 0.0
Met
2.789MetAla: 2.789 ± 0.43
0.0MetCys: 0.0 ± 0.0
1.521MetAsp: 1.521 ± 0.295
1.775MetGlu: 1.775 ± 0.353
1.014MetPhe: 1.014 ± 0.254
1.838MetGly: 1.838 ± 0.329
0.444MetHis: 0.444 ± 0.168
0.697MetIle: 0.697 ± 0.221
0.887MetLys: 0.887 ± 0.288
1.014MetLeu: 1.014 ± 0.255
0.444MetMet: 0.444 ± 0.178
1.078MetAsn: 1.078 ± 0.275
1.204MetPro: 1.204 ± 0.268
0.824MetGln: 0.824 ± 0.243
1.711MetArg: 1.711 ± 0.32
2.599MetSer: 2.599 ± 0.417
2.916MetThr: 2.916 ± 0.438
1.458MetVal: 1.458 ± 0.279
0.507MetTrp: 0.507 ± 0.161
0.444MetTyr: 0.444 ± 0.173
0.0MetXaa: 0.0 ± 0.0
Asn
3.676AsnAla: 3.676 ± 0.492
0.19AsnCys: 0.19 ± 0.094
1.711AsnAsp: 1.711 ± 0.311
2.218AsnGlu: 2.218 ± 0.333
0.634AsnPhe: 0.634 ± 0.184
4.754AsnGly: 4.754 ± 0.791
0.697AsnHis: 0.697 ± 0.209
1.711AsnIle: 1.711 ± 0.324
1.711AsnLys: 1.711 ± 0.306
2.472AsnLeu: 2.472 ± 0.388
0.697AsnMet: 0.697 ± 0.182
0.951AsnAsn: 0.951 ± 0.293
2.472AsnPro: 2.472 ± 0.422
1.014AsnGln: 1.014 ± 0.365
2.155AsnArg: 2.155 ± 0.353
1.838AsnSer: 1.838 ± 0.405
1.268AsnThr: 1.268 ± 0.325
2.725AsnVal: 2.725 ± 0.338
0.634AsnTrp: 0.634 ± 0.207
1.078AsnTyr: 1.078 ± 0.24
0.0AsnXaa: 0.0 ± 0.0
Pro
5.071ProAla: 5.071 ± 0.729
0.634ProCys: 0.634 ± 0.238
3.423ProAsp: 3.423 ± 0.522
4.881ProGlu: 4.881 ± 0.509
1.648ProPhe: 1.648 ± 0.366
4.247ProGly: 4.247 ± 0.704
0.761ProHis: 0.761 ± 0.253
2.599ProIle: 2.599 ± 0.39
1.902ProLys: 1.902 ± 0.55
3.359ProLeu: 3.359 ± 0.452
1.078ProMet: 1.078 ± 0.248
1.775ProAsn: 1.775 ± 0.395
2.535ProPro: 2.535 ± 0.464
2.028ProGln: 2.028 ± 0.273
2.662ProArg: 2.662 ± 0.456
2.409ProSer: 2.409 ± 0.387
3.169ProThr: 3.169 ± 0.544
3.866ProVal: 3.866 ± 0.574
1.268ProTrp: 1.268 ± 0.495
1.394ProTyr: 1.394 ± 0.319
0.0ProXaa: 0.0 ± 0.0
Gln
4.437GlnAla: 4.437 ± 0.468
0.19GlnCys: 0.19 ± 0.111
1.838GlnAsp: 1.838 ± 0.354
2.472GlnGlu: 2.472 ± 0.465
1.141GlnPhe: 1.141 ± 0.27
3.866GlnGly: 3.866 ± 1.536
0.951GlnHis: 0.951 ± 0.22
1.902GlnIle: 1.902 ± 0.44
1.141GlnLys: 1.141 ± 0.338
3.169GlnLeu: 3.169 ± 0.5
0.824GlnMet: 0.824 ± 0.241
1.204GlnAsn: 1.204 ± 0.344
1.268GlnPro: 1.268 ± 0.395
1.648GlnGln: 1.648 ± 0.552
1.711GlnArg: 1.711 ± 0.292
1.648GlnSer: 1.648 ± 0.424
1.965GlnThr: 1.965 ± 0.37
2.725GlnVal: 2.725 ± 0.315
0.761GlnTrp: 0.761 ± 0.215
1.078GlnTyr: 1.078 ± 0.247
0.0GlnXaa: 0.0 ± 0.0
Arg
6.085ArgAla: 6.085 ± 0.6
0.951ArgCys: 0.951 ± 0.251
4.5ArgAsp: 4.5 ± 0.507
4.437ArgGlu: 4.437 ± 0.518
2.852ArgPhe: 2.852 ± 0.409
4.437ArgGly: 4.437 ± 0.6
1.014ArgHis: 1.014 ± 0.239
2.345ArgIle: 2.345 ± 0.352
2.662ArgLys: 2.662 ± 0.38
5.514ArgLeu: 5.514 ± 0.511
1.838ArgMet: 1.838 ± 0.352
1.775ArgAsn: 1.775 ± 0.359
2.282ArgPro: 2.282 ± 0.356
2.409ArgGln: 2.409 ± 0.446
5.071ArgArg: 5.071 ± 0.589
2.725ArgSer: 2.725 ± 0.394
3.296ArgThr: 3.296 ± 0.472
4.5ArgVal: 4.5 ± 0.462
1.204ArgTrp: 1.204 ± 0.263
2.028ArgTyr: 2.028 ± 0.393
0.0ArgXaa: 0.0 ± 0.0
Ser
4.057SerAla: 4.057 ± 0.568
0.38SerCys: 0.38 ± 0.149
4.627SerAsp: 4.627 ± 0.656
3.169SerGlu: 3.169 ± 0.382
2.218SerPhe: 2.218 ± 0.357
4.5SerGly: 4.5 ± 0.586
1.204SerHis: 1.204 ± 0.258
3.106SerIle: 3.106 ± 0.452
1.838SerLys: 1.838 ± 0.295
5.514SerLeu: 5.514 ± 0.654
1.458SerMet: 1.458 ± 0.299
2.472SerAsn: 2.472 ± 0.325
2.409SerPro: 2.409 ± 0.492
2.092SerGln: 2.092 ± 0.417
3.803SerArg: 3.803 ± 0.445
2.472SerSer: 2.472 ± 0.349
2.409SerThr: 2.409 ± 0.328
2.662SerVal: 2.662 ± 0.442
0.951SerTrp: 0.951 ± 0.252
1.585SerTyr: 1.585 ± 0.324
0.0SerXaa: 0.0 ± 0.0
Thr
5.197ThrAla: 5.197 ± 0.701
0.444ThrCys: 0.444 ± 0.143
3.549ThrAsp: 3.549 ± 0.393
4.12ThrGlu: 4.12 ± 0.506
2.345ThrPhe: 2.345 ± 0.416
5.578ThrGly: 5.578 ± 0.623
0.951ThrHis: 0.951 ± 0.254
2.725ThrIle: 2.725 ± 0.454
2.282ThrLys: 2.282 ± 0.404
5.007ThrLeu: 5.007 ± 0.659
1.585ThrMet: 1.585 ± 0.357
1.775ThrAsn: 1.775 ± 0.342
3.613ThrPro: 3.613 ± 0.555
1.775ThrGln: 1.775 ± 0.331
2.725ThrArg: 2.725 ± 0.381
2.789ThrSer: 2.789 ± 0.416
2.345ThrThr: 2.345 ± 0.371
4.5ThrVal: 4.5 ± 0.378
1.331ThrTrp: 1.331 ± 0.283
1.775ThrTyr: 1.775 ± 0.314
0.0ThrXaa: 0.0 ± 0.0
Val
5.958ValAla: 5.958 ± 0.564
0.951ValCys: 0.951 ± 0.244
4.437ValAsp: 4.437 ± 0.666
4.754ValGlu: 4.754 ± 0.572
2.472ValPhe: 2.472 ± 0.48
5.451ValGly: 5.451 ± 0.612
1.331ValHis: 1.331 ± 0.277
3.803ValIle: 3.803 ± 0.644
3.803ValLys: 3.803 ± 0.527
4.944ValLeu: 4.944 ± 0.524
1.331ValMet: 1.331 ± 0.312
2.155ValAsn: 2.155 ± 0.354
3.486ValPro: 3.486 ± 0.451
2.472ValGln: 2.472 ± 0.526
3.74ValArg: 3.74 ± 0.459
3.549ValSer: 3.549 ± 0.515
4.31ValThr: 4.31 ± 0.487
5.261ValVal: 5.261 ± 0.653
1.521ValTrp: 1.521 ± 0.292
2.028ValTyr: 2.028 ± 0.312
0.0ValXaa: 0.0 ± 0.0
Trp
1.965TrpAla: 1.965 ± 0.377
0.317TrpCys: 0.317 ± 0.14
1.775TrpAsp: 1.775 ± 0.367
1.458TrpGlu: 1.458 ± 0.303
1.141TrpPhe: 1.141 ± 0.343
1.648TrpGly: 1.648 ± 0.263
0.317TrpHis: 0.317 ± 0.139
1.268TrpIle: 1.268 ± 0.243
0.57TrpLys: 0.57 ± 0.177
1.648TrpLeu: 1.648 ± 0.327
0.634TrpMet: 0.634 ± 0.173
1.078TrpAsn: 1.078 ± 0.242
0.634TrpPro: 0.634 ± 0.19
0.697TrpGln: 0.697 ± 0.227
0.761TrpArg: 0.761 ± 0.241
1.711TrpSer: 1.711 ± 0.286
1.394TrpThr: 1.394 ± 0.363
1.521TrpVal: 1.521 ± 0.318
0.507TrpTrp: 0.507 ± 0.168
0.824TrpTyr: 0.824 ± 0.24
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.345TyrAla: 2.345 ± 0.355
0.317TyrCys: 0.317 ± 0.132
2.218TyrAsp: 2.218 ± 0.411
2.916TyrGlu: 2.916 ± 0.413
1.014TyrPhe: 1.014 ± 0.246
2.535TyrGly: 2.535 ± 0.489
0.317TyrHis: 0.317 ± 0.162
1.458TyrIle: 1.458 ± 0.288
0.887TyrLys: 0.887 ± 0.258
2.535TyrLeu: 2.535 ± 0.358
0.951TyrMet: 0.951 ± 0.237
1.078TyrAsn: 1.078 ± 0.267
1.141TyrPro: 1.141 ± 0.243
0.887TyrGln: 0.887 ± 0.195
2.599TyrArg: 2.599 ± 0.482
1.458TyrSer: 1.458 ± 0.317
1.394TyrThr: 1.394 ± 0.251
1.458TyrVal: 1.458 ± 0.244
0.887TyrTrp: 0.887 ± 0.236
0.824TyrTyr: 0.824 ± 0.231
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 94 proteins (15778 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski