Amino acid dipepetide frequency for Mycobacterium phage BodEinwohner17

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.67AlaAla: 12.67 ± 1.389
0.868AlaCys: 0.868 ± 0.219
6.948AlaAsp: 6.948 ± 0.532
7.561AlaGlu: 7.561 ± 0.74
3.065AlaPhe: 3.065 ± 0.406
9.553AlaGly: 9.553 ± 1.251
2.503AlaHis: 2.503 ± 0.362
3.576AlaIle: 3.576 ± 0.515
4.087AlaLys: 4.087 ± 0.376
7.51AlaLeu: 7.51 ± 0.663
2.963AlaMet: 2.963 ± 0.406
2.759AlaAsn: 2.759 ± 0.455
4.802AlaPro: 4.802 ± 0.552
3.372AlaGln: 3.372 ± 0.42
6.948AlaArg: 6.948 ± 0.712
5.313AlaSer: 5.313 ± 0.646
6.233AlaThr: 6.233 ± 0.487
6.641AlaVal: 6.641 ± 0.556
2.146AlaTrp: 2.146 ± 0.343
2.299AlaTyr: 2.299 ± 0.355
0.0AlaXaa: 0.0 ± 0.0
Cys
0.971CysAla: 0.971 ± 0.314
0.0CysCys: 0.0 ± 0.0
1.226CysAsp: 1.226 ± 0.282
0.92CysGlu: 0.92 ± 0.279
0.204CysPhe: 0.204 ± 0.107
1.686CysGly: 1.686 ± 0.33
0.307CysHis: 0.307 ± 0.12
0.153CysIle: 0.153 ± 0.094
0.664CysLys: 0.664 ± 0.213
0.817CysLeu: 0.817 ± 0.238
0.102CysMet: 0.102 ± 0.074
0.46CysAsn: 0.46 ± 0.15
1.226CysPro: 1.226 ± 0.276
0.255CysGln: 0.255 ± 0.105
0.664CysArg: 0.664 ± 0.202
0.613CysSer: 0.613 ± 0.236
0.92CysThr: 0.92 ± 0.255
0.868CysVal: 0.868 ± 0.177
0.255CysTrp: 0.255 ± 0.112
0.153CysTyr: 0.153 ± 0.089
0.0CysXaa: 0.0 ± 0.0
Asp
6.488AspAla: 6.488 ± 0.551
0.613AspCys: 0.613 ± 0.184
4.7AspAsp: 4.7 ± 0.497
3.065AspGlu: 3.065 ± 0.388
1.584AspPhe: 1.584 ± 0.231
6.59AspGly: 6.59 ± 0.597
1.43AspHis: 1.43 ± 0.254
2.197AspIle: 2.197 ± 0.311
1.941AspLys: 1.941 ± 0.316
6.028AspLeu: 6.028 ± 0.501
1.277AspMet: 1.277 ± 0.303
1.533AspAsn: 1.533 ± 0.346
4.445AspPro: 4.445 ± 0.568
2.708AspGln: 2.708 ± 0.316
5.415AspArg: 5.415 ± 0.643
3.321AspSer: 3.321 ± 0.535
4.138AspThr: 4.138 ± 0.492
4.496AspVal: 4.496 ± 0.507
1.737AspTrp: 1.737 ± 0.3
2.146AspTyr: 2.146 ± 0.307
0.0AspXaa: 0.0 ± 0.0
Glu
5.875GluAla: 5.875 ± 0.685
0.92GluCys: 0.92 ± 0.222
2.759GluAsp: 2.759 ± 0.37
2.861GluGlu: 2.861 ± 0.479
2.248GluPhe: 2.248 ± 0.341
3.832GluGly: 3.832 ± 0.397
1.737GluHis: 1.737 ± 0.412
2.452GluIle: 2.452 ± 0.448
1.992GluLys: 1.992 ± 0.266
5.926GluLeu: 5.926 ± 0.703
1.533GluMet: 1.533 ± 0.315
2.299GluAsn: 2.299 ± 0.333
2.81GluPro: 2.81 ± 0.405
3.014GluGln: 3.014 ± 0.393
5.466GluArg: 5.466 ± 0.626
3.167GluSer: 3.167 ± 0.413
3.985GluThr: 3.985 ± 0.551
4.291GluVal: 4.291 ± 0.529
1.43GluTrp: 1.43 ± 0.289
1.89GluTyr: 1.89 ± 0.281
0.0GluXaa: 0.0 ± 0.0
Phe
3.167PheAla: 3.167 ± 0.374
0.409PheCys: 0.409 ± 0.141
2.503PheAsp: 2.503 ± 0.457
1.686PheGlu: 1.686 ± 0.305
0.92PhePhe: 0.92 ± 0.287
2.963PheGly: 2.963 ± 0.762
0.562PheHis: 0.562 ± 0.182
1.277PheIle: 1.277 ± 0.351
0.971PheLys: 0.971 ± 0.26
1.686PheLeu: 1.686 ± 0.263
0.715PheMet: 0.715 ± 0.174
1.175PheAsn: 1.175 ± 0.332
1.533PhePro: 1.533 ± 0.307
1.482PheGln: 1.482 ± 0.299
1.533PheArg: 1.533 ± 0.295
1.737PheSer: 1.737 ± 0.296
2.197PheThr: 2.197 ± 0.395
1.839PheVal: 1.839 ± 0.252
0.92PheTrp: 0.92 ± 0.215
0.817PheTyr: 0.817 ± 0.234
0.0PheXaa: 0.0 ± 0.0
Gly
8.685GlyAla: 8.685 ± 1.149
1.124GlyCys: 1.124 ± 0.263
5.722GlyAsp: 5.722 ± 0.53
4.649GlyGlu: 4.649 ± 0.549
2.605GlyPhe: 2.605 ± 0.379
10.677GlyGly: 10.677 ± 2.597
2.299GlyHis: 2.299 ± 0.311
3.781GlyIle: 3.781 ± 0.499
2.861GlyLys: 2.861 ± 0.352
6.182GlyLeu: 6.182 ± 0.567
1.992GlyMet: 1.992 ± 0.427
3.372GlyAsn: 3.372 ± 0.431
3.934GlyPro: 3.934 ± 0.541
2.657GlyGln: 2.657 ± 0.728
4.956GlyArg: 4.956 ± 0.567
5.926GlySer: 5.926 ± 0.856
5.977GlyThr: 5.977 ± 0.712
6.488GlyVal: 6.488 ± 0.617
2.197GlyTrp: 2.197 ± 0.331
1.992GlyTyr: 1.992 ± 0.331
0.0GlyXaa: 0.0 ± 0.0
His
2.095HisAla: 2.095 ± 0.344
0.409HisCys: 0.409 ± 0.17
1.175HisAsp: 1.175 ± 0.233
1.226HisGlu: 1.226 ± 0.292
0.511HisPhe: 0.511 ± 0.133
1.635HisGly: 1.635 ± 0.25
0.971HisHis: 0.971 ± 0.279
1.635HisIle: 1.635 ± 0.298
0.868HisLys: 0.868 ± 0.24
1.584HisLeu: 1.584 ± 0.269
0.46HisMet: 0.46 ± 0.143
0.92HisAsn: 0.92 ± 0.234
1.686HisPro: 1.686 ± 0.309
0.971HisGln: 0.971 ± 0.226
2.299HisArg: 2.299 ± 0.393
0.971HisSer: 0.971 ± 0.177
1.584HisThr: 1.584 ± 0.289
1.686HisVal: 1.686 ± 0.324
0.562HisTrp: 0.562 ± 0.181
0.868HisTyr: 0.868 ± 0.182
0.0HisXaa: 0.0 ± 0.0
Ile
4.802IleAla: 4.802 ± 0.551
0.817IleCys: 0.817 ± 0.309
3.627IleAsp: 3.627 ± 0.481
3.678IleGlu: 3.678 ± 0.407
0.766IlePhe: 0.766 ± 0.209
3.627IleGly: 3.627 ± 0.317
1.43IleHis: 1.43 ± 0.28
1.482IleIle: 1.482 ± 0.286
0.971IleLys: 0.971 ± 0.202
2.35IleLeu: 2.35 ± 0.359
0.255IleMet: 0.255 ± 0.118
1.635IleAsn: 1.635 ± 0.253
2.503IlePro: 2.503 ± 0.276
1.328IleGln: 1.328 ± 0.262
2.197IleArg: 2.197 ± 0.424
2.35IleSer: 2.35 ± 0.469
3.525IleThr: 3.525 ± 0.43
3.576IleVal: 3.576 ± 0.417
0.92IleTrp: 0.92 ± 0.21
1.022IleTyr: 1.022 ± 0.194
0.0IleXaa: 0.0 ± 0.0
Lys
3.985LysAla: 3.985 ± 0.508
0.715LysCys: 0.715 ± 0.245
1.686LysAsp: 1.686 ± 0.273
1.584LysGlu: 1.584 ± 0.264
1.226LysPhe: 1.226 ± 0.2
2.503LysGly: 2.503 ± 0.336
1.022LysHis: 1.022 ± 0.229
0.715LysIle: 0.715 ± 0.171
1.328LysLys: 1.328 ± 0.394
2.605LysLeu: 2.605 ± 0.446
0.868LysMet: 0.868 ± 0.218
1.073LysAsn: 1.073 ± 0.231
2.401LysPro: 2.401 ± 0.377
1.737LysGln: 1.737 ± 0.306
2.452LysArg: 2.452 ± 0.405
1.992LysSer: 1.992 ± 0.306
1.89LysThr: 1.89 ± 0.326
2.248LysVal: 2.248 ± 0.376
0.92LysTrp: 0.92 ± 0.275
0.92LysTyr: 0.92 ± 0.239
0.0LysXaa: 0.0 ± 0.0
Leu
8.021LeuAla: 8.021 ± 0.732
0.715LeuCys: 0.715 ± 0.229
4.445LeuAsp: 4.445 ± 0.562
4.291LeuGlu: 4.291 ± 0.509
2.81LeuPhe: 2.81 ± 0.332
5.16LeuGly: 5.16 ± 0.703
1.073LeuHis: 1.073 ± 0.265
3.372LeuIle: 3.372 ± 0.404
2.35LeuLys: 2.35 ± 0.453
4.649LeuLeu: 4.649 ± 0.629
1.941LeuMet: 1.941 ± 0.311
2.605LeuAsn: 2.605 ± 0.378
4.956LeuPro: 4.956 ± 0.607
2.708LeuGln: 2.708 ± 0.453
4.904LeuArg: 4.904 ± 0.678
5.262LeuSer: 5.262 ± 0.532
5.518LeuThr: 5.518 ± 0.536
4.751LeuVal: 4.751 ± 0.518
1.482LeuTrp: 1.482 ± 0.289
2.044LeuTyr: 2.044 ± 0.378
0.0LeuXaa: 0.0 ± 0.0
Met
2.044MetAla: 2.044 ± 0.323
0.255MetCys: 0.255 ± 0.189
1.328MetAsp: 1.328 ± 0.271
1.022MetGlu: 1.022 ± 0.197
0.613MetPhe: 0.613 ± 0.154
2.044MetGly: 2.044 ± 0.345
0.153MetHis: 0.153 ± 0.08
0.971MetIle: 0.971 ± 0.284
0.715MetLys: 0.715 ± 0.183
1.941MetLeu: 1.941 ± 0.253
0.409MetMet: 0.409 ± 0.183
0.868MetAsn: 0.868 ± 0.179
1.379MetPro: 1.379 ± 0.277
0.358MetGln: 0.358 ± 0.135
1.533MetArg: 1.533 ± 0.279
2.861MetSer: 2.861 ± 0.395
2.401MetThr: 2.401 ± 0.337
1.175MetVal: 1.175 ± 0.291
0.255MetTrp: 0.255 ± 0.123
0.46MetTyr: 0.46 ± 0.172
0.0MetXaa: 0.0 ± 0.0
Asn
3.474AsnAla: 3.474 ± 0.37
0.204AsnCys: 0.204 ± 0.096
1.89AsnAsp: 1.89 ± 0.321
1.992AsnGlu: 1.992 ± 0.359
0.868AsnPhe: 0.868 ± 0.28
4.138AsnGly: 4.138 ± 0.51
1.124AsnHis: 1.124 ± 0.19
1.584AsnIle: 1.584 ± 0.438
0.971AsnLys: 0.971 ± 0.272
2.299AsnLeu: 2.299 ± 0.376
0.613AsnMet: 0.613 ± 0.143
1.788AsnAsn: 1.788 ± 0.341
2.452AsnPro: 2.452 ± 0.359
1.277AsnGln: 1.277 ± 0.336
2.554AsnArg: 2.554 ± 0.471
1.379AsnSer: 1.379 ± 0.302
1.941AsnThr: 1.941 ± 0.26
1.635AsnVal: 1.635 ± 0.302
0.766AsnTrp: 0.766 ± 0.224
0.715AsnTyr: 0.715 ± 0.138
0.0AsnXaa: 0.0 ± 0.0
Pro
4.649ProAla: 4.649 ± 0.58
0.511ProCys: 0.511 ± 0.152
4.445ProAsp: 4.445 ± 0.539
4.24ProGlu: 4.24 ± 0.466
1.635ProPhe: 1.635 ± 0.283
5.926ProGly: 5.926 ± 0.658
1.635ProHis: 1.635 ± 0.287
1.941ProIle: 1.941 ± 0.321
2.146ProLys: 2.146 ± 0.432
4.189ProLeu: 4.189 ± 0.497
1.635ProMet: 1.635 ± 0.316
1.992ProAsn: 1.992 ± 0.394
4.138ProPro: 4.138 ± 0.565
2.503ProGln: 2.503 ± 0.407
3.116ProArg: 3.116 ± 0.512
3.27ProSer: 3.27 ± 0.383
3.372ProThr: 3.372 ± 0.475
4.445ProVal: 4.445 ± 0.534
1.175ProTrp: 1.175 ± 0.291
1.686ProTyr: 1.686 ± 0.302
0.0ProXaa: 0.0 ± 0.0
Gln
4.904GlnAla: 4.904 ± 0.65
0.562GlnCys: 0.562 ± 0.256
1.277GlnAsp: 1.277 ± 0.294
1.839GlnGlu: 1.839 ± 0.282
1.073GlnPhe: 1.073 ± 0.256
2.248GlnGly: 2.248 ± 0.398
0.868GlnHis: 0.868 ± 0.206
1.839GlnIle: 1.839 ± 0.278
1.277GlnLys: 1.277 ± 0.225
2.963GlnLeu: 2.963 ± 0.433
0.971GlnMet: 0.971 ± 0.219
0.868GlnAsn: 0.868 ± 0.26
2.605GlnPro: 2.605 ± 0.418
1.43GlnGln: 1.43 ± 0.293
3.014GlnArg: 3.014 ± 0.448
2.299GlnSer: 2.299 ± 0.353
1.737GlnThr: 1.737 ± 0.372
2.554GlnVal: 2.554 ± 0.361
0.664GlnTrp: 0.664 ± 0.185
1.022GlnTyr: 1.022 ± 0.286
0.0GlnXaa: 0.0 ± 0.0
Arg
6.386ArgAla: 6.386 ± 0.574
1.022ArgCys: 1.022 ± 0.303
5.109ArgAsp: 5.109 ± 0.638
5.569ArgGlu: 5.569 ± 0.662
1.89ArgPhe: 1.89 ± 0.368
4.291ArgGly: 4.291 ± 0.447
1.584ArgHis: 1.584 ± 0.266
4.24ArgIle: 4.24 ± 0.575
2.197ArgLys: 2.197 ± 0.398
4.904ArgLeu: 4.904 ± 0.558
2.095ArgMet: 2.095 ± 0.356
2.35ArgAsn: 2.35 ± 0.341
3.781ArgPro: 3.781 ± 0.543
2.146ArgGln: 2.146 ± 0.436
5.313ArgArg: 5.313 ± 0.785
3.781ArgSer: 3.781 ± 0.407
3.525ArgThr: 3.525 ± 0.459
5.364ArgVal: 5.364 ± 0.67
2.095ArgTrp: 2.095 ± 0.374
2.248ArgTyr: 2.248 ± 0.29
0.0ArgXaa: 0.0 ± 0.0
Ser
5.364SerAla: 5.364 ± 0.796
0.562SerCys: 0.562 ± 0.206
4.24SerAsp: 4.24 ± 0.498
3.627SerGlu: 3.627 ± 0.437
2.044SerPhe: 2.044 ± 0.394
6.131SerGly: 6.131 ± 0.748
1.022SerHis: 1.022 ± 0.191
2.861SerIle: 2.861 ± 0.389
2.146SerLys: 2.146 ± 0.357
3.985SerLeu: 3.985 ± 0.434
1.533SerMet: 1.533 ± 0.282
2.197SerAsn: 2.197 ± 0.355
3.372SerPro: 3.372 ± 0.314
1.737SerGln: 1.737 ± 0.267
3.474SerArg: 3.474 ± 0.389
3.832SerSer: 3.832 ± 0.602
3.525SerThr: 3.525 ± 0.464
4.547SerVal: 4.547 ± 0.544
1.328SerTrp: 1.328 ± 0.229
1.328SerTyr: 1.328 ± 0.233
0.0SerXaa: 0.0 ± 0.0
Thr
6.539ThrAla: 6.539 ± 0.593
0.817ThrCys: 0.817 ± 0.242
3.985ThrAsp: 3.985 ± 0.519
3.576ThrGlu: 3.576 ± 0.347
2.197ThrPhe: 2.197 ± 0.314
6.131ThrGly: 6.131 ± 0.539
1.788ThrHis: 1.788 ± 0.299
3.576ThrIle: 3.576 ± 0.417
2.197ThrLys: 2.197 ± 0.377
4.598ThrLeu: 4.598 ± 0.466
1.073ThrMet: 1.073 ± 0.268
1.941ThrAsn: 1.941 ± 0.288
4.138ThrPro: 4.138 ± 0.49
1.788ThrGln: 1.788 ± 0.301
4.445ThrArg: 4.445 ± 0.44
3.627ThrSer: 3.627 ± 0.418
4.7ThrThr: 4.7 ± 0.611
5.518ThrVal: 5.518 ± 0.609
1.328ThrTrp: 1.328 ± 0.267
1.686ThrTyr: 1.686 ± 0.257
0.0ThrXaa: 0.0 ± 0.0
Val
6.846ValAla: 6.846 ± 0.558
1.328ValCys: 1.328 ± 0.282
5.773ValAsp: 5.773 ± 0.563
4.342ValGlu: 4.342 ± 0.557
2.299ValPhe: 2.299 ± 0.356
5.466ValGly: 5.466 ± 0.592
1.482ValHis: 1.482 ± 0.273
3.065ValIle: 3.065 ± 0.387
2.503ValLys: 2.503 ± 0.363
4.956ValLeu: 4.956 ± 0.645
1.277ValMet: 1.277 ± 0.241
2.401ValAsn: 2.401 ± 0.351
4.087ValPro: 4.087 ± 0.412
2.759ValGln: 2.759 ± 0.311
5.058ValArg: 5.058 ± 0.751
4.7ValSer: 4.7 ± 0.573
5.262ValThr: 5.262 ± 0.54
6.233ValVal: 6.233 ± 0.573
1.686ValTrp: 1.686 ± 0.306
1.328ValTyr: 1.328 ± 0.317
0.0ValXaa: 0.0 ± 0.0
Trp
2.503TrpAla: 2.503 ± 0.297
0.307TrpCys: 0.307 ± 0.12
1.43TrpAsp: 1.43 ± 0.262
1.073TrpGlu: 1.073 ± 0.305
0.817TrpPhe: 0.817 ± 0.205
1.124TrpGly: 1.124 ± 0.307
0.562TrpHis: 0.562 ± 0.166
0.817TrpIle: 0.817 ± 0.227
1.022TrpLys: 1.022 ± 0.179
1.839TrpLeu: 1.839 ± 0.306
0.766TrpMet: 0.766 ± 0.207
0.613TrpAsn: 0.613 ± 0.236
1.124TrpPro: 1.124 ± 0.265
1.022TrpGln: 1.022 ± 0.224
2.248TrpArg: 2.248 ± 0.476
1.379TrpSer: 1.379 ± 0.262
1.482TrpThr: 1.482 ± 0.295
1.992TrpVal: 1.992 ± 0.454
1.073TrpTrp: 1.073 ± 0.206
0.307TrpTyr: 0.307 ± 0.119
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.35TyrAla: 2.35 ± 0.34
0.307TyrCys: 0.307 ± 0.125
1.584TyrAsp: 1.584 ± 0.313
1.686TyrGlu: 1.686 ± 0.277
0.715TyrPhe: 0.715 ± 0.193
2.35TyrGly: 2.35 ± 0.387
0.562TyrHis: 0.562 ± 0.2
1.328TyrIle: 1.328 ± 0.224
0.664TyrLys: 0.664 ± 0.203
2.044TyrLeu: 2.044 ± 0.273
0.255TyrMet: 0.255 ± 0.122
0.817TyrAsn: 0.817 ± 0.183
1.277TyrPro: 1.277 ± 0.241
0.817TyrGln: 0.817 ± 0.196
2.299TyrArg: 2.299 ± 0.396
1.073TyrSer: 1.073 ± 0.266
1.737TyrThr: 1.737 ± 0.316
2.503TyrVal: 2.503 ± 0.323
0.562TyrTrp: 0.562 ± 0.158
0.511TyrTyr: 0.511 ± 0.15
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 116 proteins (19575 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski