Amino acid dipepetide frequency for Mycobacterium phage 32HC

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.897AlaAla: 16.897 ± 1.415
0.488AlaCys: 0.488 ± 0.18
8.845AlaAsp: 8.845 ± 0.94
9.394AlaGlu: 9.394 ± 0.925
3.172AlaPhe: 3.172 ± 0.418
15.433AlaGly: 15.433 ± 1.438
1.83AlaHis: 1.83 ± 0.379
4.819AlaIle: 4.819 ± 0.465
3.477AlaLys: 3.477 ± 0.492
8.967AlaLeu: 8.967 ± 0.758
2.989AlaMet: 2.989 ± 0.514
3.66AlaAsn: 3.66 ± 0.457
6.161AlaPro: 6.161 ± 0.582
4.209AlaGln: 4.209 ± 0.691
9.028AlaArg: 9.028 ± 0.988
5.734AlaSer: 5.734 ± 0.795
6.039AlaThr: 6.039 ± 0.682
7.747AlaVal: 7.747 ± 0.765
1.952AlaTrp: 1.952 ± 0.37
2.318AlaTyr: 2.318 ± 0.343
0.0AlaXaa: 0.0 ± 0.0
Cys
1.342CysAla: 1.342 ± 0.304
0.244CysCys: 0.244 ± 0.146
0.732CysAsp: 0.732 ± 0.316
0.732CysGlu: 0.732 ± 0.226
0.122CysPhe: 0.122 ± 0.094
1.464CysGly: 1.464 ± 0.291
0.244CysHis: 0.244 ± 0.12
0.305CysIle: 0.305 ± 0.129
0.488CysLys: 0.488 ± 0.2
0.671CysLeu: 0.671 ± 0.213
0.122CysMet: 0.122 ± 0.093
0.427CysAsn: 0.427 ± 0.165
1.22CysPro: 1.22 ± 0.364
0.183CysGln: 0.183 ± 0.104
0.976CysArg: 0.976 ± 0.243
0.488CysSer: 0.488 ± 0.173
0.549CysThr: 0.549 ± 0.271
0.427CysVal: 0.427 ± 0.176
0.366CysTrp: 0.366 ± 0.155
0.305CysTyr: 0.305 ± 0.139
0.0CysXaa: 0.0 ± 0.0
Asp
8.906AspAla: 8.906 ± 0.84
1.037AspCys: 1.037 ± 0.29
5.49AspAsp: 5.49 ± 0.735
5.124AspGlu: 5.124 ± 0.717
1.891AspPhe: 1.891 ± 0.347
6.344AspGly: 6.344 ± 0.667
1.037AspHis: 1.037 ± 0.275
2.745AspIle: 2.745 ± 0.499
1.891AspLys: 1.891 ± 0.317
5.368AspLeu: 5.368 ± 0.6
1.037AspMet: 1.037 ± 0.272
1.952AspAsn: 1.952 ± 0.468
6.039AspPro: 6.039 ± 0.68
2.135AspGln: 2.135 ± 0.386
4.88AspArg: 4.88 ± 0.482
3.782AspSer: 3.782 ± 0.661
4.026AspThr: 4.026 ± 0.48
5.551AspVal: 5.551 ± 0.624
1.098AspTrp: 1.098 ± 0.203
1.281AspTyr: 1.281 ± 0.318
0.0AspXaa: 0.0 ± 0.0
Glu
6.039GluAla: 6.039 ± 0.868
0.549GluCys: 0.549 ± 0.281
3.477GluAsp: 3.477 ± 0.515
2.379GluGlu: 2.379 ± 0.355
1.83GluPhe: 1.83 ± 0.32
2.013GluGly: 2.013 ± 0.396
1.281GluHis: 1.281 ± 0.234
3.416GluIle: 3.416 ± 0.513
1.464GluLys: 1.464 ± 0.316
6.771GluLeu: 6.771 ± 0.588
1.464GluMet: 1.464 ± 0.339
0.854GluAsn: 0.854 ± 0.231
4.087GluPro: 4.087 ± 0.522
2.867GluGln: 2.867 ± 0.456
4.88GluArg: 4.88 ± 0.689
2.623GluSer: 2.623 ± 0.327
2.806GluThr: 2.806 ± 0.429
3.416GluVal: 3.416 ± 0.385
1.403GluTrp: 1.403 ± 0.278
1.037GluTyr: 1.037 ± 0.238
0.0GluXaa: 0.0 ± 0.0
Phe
3.294PheAla: 3.294 ± 0.464
0.366PheCys: 0.366 ± 0.168
2.562PheAsp: 2.562 ± 0.383
1.464PheGlu: 1.464 ± 0.365
1.037PhePhe: 1.037 ± 0.256
4.026PheGly: 4.026 ± 0.685
0.793PheHis: 0.793 ± 0.206
0.793PheIle: 0.793 ± 0.226
0.61PheLys: 0.61 ± 0.182
1.769PheLeu: 1.769 ± 0.277
0.305PheMet: 0.305 ± 0.134
0.854PheAsn: 0.854 ± 0.205
1.952PhePro: 1.952 ± 0.308
0.549PheGln: 0.549 ± 0.216
1.891PheArg: 1.891 ± 0.381
1.159PheSer: 1.159 ± 0.291
2.318PheThr: 2.318 ± 0.336
1.83PheVal: 1.83 ± 0.324
0.671PheTrp: 0.671 ± 0.179
0.427PheTyr: 0.427 ± 0.132
0.0PheXaa: 0.0 ± 0.0
Gly
9.455GlyAla: 9.455 ± 1.025
1.159GlyCys: 1.159 ± 0.312
7.259GlyAsp: 7.259 ± 0.689
5.002GlyGlu: 5.002 ± 0.558
2.745GlyPhe: 2.745 ± 0.484
10.492GlyGly: 10.492 ± 1.853
1.647GlyHis: 1.647 ± 0.39
4.819GlyIle: 4.819 ± 0.512
2.806GlyLys: 2.806 ± 0.462
6.283GlyLeu: 6.283 ± 0.741
1.647GlyMet: 1.647 ± 0.296
3.233GlyAsn: 3.233 ± 0.518
5.063GlyPro: 5.063 ± 0.714
3.233GlyGln: 3.233 ± 0.515
6.588GlyArg: 6.588 ± 0.654
6.222GlySer: 6.222 ± 0.84
4.514GlyThr: 4.514 ± 0.606
6.161GlyVal: 6.161 ± 0.777
2.013GlyTrp: 2.013 ± 0.335
2.257GlyTyr: 2.257 ± 0.366
0.0GlyXaa: 0.0 ± 0.0
His
1.281HisAla: 1.281 ± 0.318
0.305HisCys: 0.305 ± 0.119
1.464HisAsp: 1.464 ± 0.277
0.671HisGlu: 0.671 ± 0.215
0.427HisPhe: 0.427 ± 0.171
1.586HisGly: 1.586 ± 0.367
0.427HisHis: 0.427 ± 0.149
1.159HisIle: 1.159 ± 0.286
0.244HisLys: 0.244 ± 0.11
1.403HisLeu: 1.403 ± 0.327
0.122HisMet: 0.122 ± 0.074
0.549HisAsn: 0.549 ± 0.213
1.708HisPro: 1.708 ± 0.316
0.305HisGln: 0.305 ± 0.131
1.525HisArg: 1.525 ± 0.362
1.098HisSer: 1.098 ± 0.35
0.976HisThr: 0.976 ± 0.218
1.586HisVal: 1.586 ± 0.277
0.61HisTrp: 0.61 ± 0.162
0.671HisTyr: 0.671 ± 0.217
0.0HisXaa: 0.0 ± 0.0
Ile
7.93IleAla: 7.93 ± 0.65
0.732IleCys: 0.732 ± 0.262
4.514IleAsp: 4.514 ± 0.512
3.721IleGlu: 3.721 ± 0.499
0.671IlePhe: 0.671 ± 0.172
4.026IleGly: 4.026 ± 0.572
0.549IleHis: 0.549 ± 0.183
1.83IleIle: 1.83 ± 0.308
1.037IleLys: 1.037 ± 0.316
3.355IleLeu: 3.355 ± 0.471
0.366IleMet: 0.366 ± 0.17
1.159IleAsn: 1.159 ± 0.231
3.05IlePro: 3.05 ± 0.515
1.464IleGln: 1.464 ± 0.298
3.782IleArg: 3.782 ± 0.553
3.05IleSer: 3.05 ± 0.434
2.318IleThr: 2.318 ± 0.391
4.087IleVal: 4.087 ± 0.668
0.305IleTrp: 0.305 ± 0.137
1.464IleTyr: 1.464 ± 0.267
0.0IleXaa: 0.0 ± 0.0
Lys
2.928LysAla: 2.928 ± 0.425
0.549LysCys: 0.549 ± 0.178
2.318LysAsp: 2.318 ± 0.438
1.098LysGlu: 1.098 ± 0.243
1.281LysPhe: 1.281 ± 0.268
1.525LysGly: 1.525 ± 0.307
0.427LysHis: 0.427 ± 0.185
1.647LysIle: 1.647 ± 0.284
0.854LysLys: 0.854 ± 0.226
2.562LysLeu: 2.562 ± 0.419
0.732LysMet: 0.732 ± 0.185
1.342LysAsn: 1.342 ± 0.277
2.135LysPro: 2.135 ± 0.404
2.013LysGln: 2.013 ± 0.368
2.867LysArg: 2.867 ± 0.423
1.342LysSer: 1.342 ± 0.246
1.525LysThr: 1.525 ± 0.29
1.891LysVal: 1.891 ± 0.349
0.732LysTrp: 0.732 ± 0.225
0.976LysTyr: 0.976 ± 0.297
0.0LysXaa: 0.0 ± 0.0
Leu
10.797LeuAla: 10.797 ± 0.669
0.61LeuCys: 0.61 ± 0.2
6.649LeuAsp: 6.649 ± 0.759
3.172LeuGlu: 3.172 ± 0.419
2.074LeuPhe: 2.074 ± 0.363
7.381LeuGly: 7.381 ± 0.692
1.037LeuHis: 1.037 ± 0.223
3.66LeuIle: 3.66 ± 0.518
2.501LeuLys: 2.501 ± 0.361
5.917LeuLeu: 5.917 ± 0.727
1.403LeuMet: 1.403 ± 0.314
2.257LeuAsn: 2.257 ± 0.341
5.612LeuPro: 5.612 ± 0.59
1.464LeuGln: 1.464 ± 0.298
5.795LeuArg: 5.795 ± 0.611
3.721LeuSer: 3.721 ± 0.464
5.185LeuThr: 5.185 ± 0.649
4.575LeuVal: 4.575 ± 0.533
1.525LeuTrp: 1.525 ± 0.359
1.586LeuTyr: 1.586 ± 0.276
0.0LeuXaa: 0.0 ± 0.0
Met
2.989MetAla: 2.989 ± 0.363
0.183MetCys: 0.183 ± 0.116
0.732MetAsp: 0.732 ± 0.226
0.549MetGlu: 0.549 ± 0.209
0.854MetPhe: 0.854 ± 0.247
1.464MetGly: 1.464 ± 0.284
0.244MetHis: 0.244 ± 0.115
1.037MetIle: 1.037 ± 0.261
1.098MetLys: 1.098 ± 0.247
0.671MetLeu: 0.671 ± 0.162
0.244MetMet: 0.244 ± 0.123
0.915MetAsn: 0.915 ± 0.228
2.013MetPro: 2.013 ± 0.358
0.305MetGln: 0.305 ± 0.119
0.915MetArg: 0.915 ± 0.244
1.891MetSer: 1.891 ± 0.352
1.952MetThr: 1.952 ± 0.327
0.854MetVal: 0.854 ± 0.313
0.244MetTrp: 0.244 ± 0.111
0.244MetTyr: 0.244 ± 0.119
0.0MetXaa: 0.0 ± 0.0
Asn
3.294AsnAla: 3.294 ± 0.418
0.488AsnCys: 0.488 ± 0.186
1.159AsnAsp: 1.159 ± 0.295
1.647AsnGlu: 1.647 ± 0.279
0.61AsnPhe: 0.61 ± 0.184
3.904AsnGly: 3.904 ± 0.593
0.671AsnHis: 0.671 ± 0.205
1.098AsnIle: 1.098 ± 0.302
0.549AsnLys: 0.549 ± 0.183
1.891AsnLeu: 1.891 ± 0.26
0.61AsnMet: 0.61 ± 0.194
0.549AsnAsn: 0.549 ± 0.186
3.904AsnPro: 3.904 ± 0.624
1.098AsnGln: 1.098 ± 0.239
2.074AsnArg: 2.074 ± 0.339
1.22AsnSer: 1.22 ± 0.348
1.281AsnThr: 1.281 ± 0.284
1.952AsnVal: 1.952 ± 0.326
0.61AsnTrp: 0.61 ± 0.176
0.732AsnTyr: 0.732 ± 0.203
0.0AsnXaa: 0.0 ± 0.0
Pro
11.468ProAla: 11.468 ± 1.332
0.488ProCys: 0.488 ± 0.145
6.222ProAsp: 6.222 ± 0.698
3.355ProGlu: 3.355 ± 0.482
2.135ProPhe: 2.135 ± 0.37
6.527ProGly: 6.527 ± 0.812
1.037ProHis: 1.037 ± 0.273
3.172ProIle: 3.172 ± 0.546
2.501ProLys: 2.501 ± 0.466
4.209ProLeu: 4.209 ± 0.642
1.586ProMet: 1.586 ± 0.409
1.708ProAsn: 1.708 ± 0.291
3.416ProPro: 3.416 ± 0.452
2.196ProGln: 2.196 ± 0.339
4.026ProArg: 4.026 ± 0.534
3.172ProSer: 3.172 ± 0.346
3.66ProThr: 3.66 ± 0.481
4.087ProVal: 4.087 ± 0.58
1.281ProTrp: 1.281 ± 0.378
1.037ProTyr: 1.037 ± 0.256
0.0ProXaa: 0.0 ± 0.0
Gln
3.416GlnAla: 3.416 ± 0.559
0.305GlnCys: 0.305 ± 0.133
1.159GlnAsp: 1.159 ± 0.461
1.098GlnGlu: 1.098 ± 0.304
1.342GlnPhe: 1.342 ± 0.303
1.891GlnGly: 1.891 ± 0.376
1.159GlnHis: 1.159 ± 0.276
2.501GlnIle: 2.501 ± 0.384
0.793GlnLys: 0.793 ± 0.236
3.477GlnLeu: 3.477 ± 0.403
0.854GlnMet: 0.854 ± 0.234
0.793GlnAsn: 0.793 ± 0.287
2.745GlnPro: 2.745 ± 0.548
2.013GlnGln: 2.013 ± 0.443
2.989GlnArg: 2.989 ± 0.409
2.013GlnSer: 2.013 ± 0.358
2.44GlnThr: 2.44 ± 0.419
2.135GlnVal: 2.135 ± 0.339
0.854GlnTrp: 0.854 ± 0.192
0.732GlnTyr: 0.732 ± 0.207
0.0GlnXaa: 0.0 ± 0.0
Arg
8.479ArgAla: 8.479 ± 0.871
0.976ArgCys: 0.976 ± 0.301
3.416ArgAsp: 3.416 ± 0.398
4.758ArgGlu: 4.758 ± 0.607
2.318ArgPhe: 2.318 ± 0.396
5.368ArgGly: 5.368 ± 0.648
1.586ArgHis: 1.586 ± 0.377
4.758ArgIle: 4.758 ± 0.619
3.355ArgLys: 3.355 ± 0.603
5.429ArgLeu: 5.429 ± 0.508
2.074ArgMet: 2.074 ± 0.39
2.989ArgAsn: 2.989 ± 0.436
4.88ArgPro: 4.88 ± 0.589
3.416ArgGln: 3.416 ± 0.482
7.93ArgArg: 7.93 ± 0.886
3.538ArgSer: 3.538 ± 0.419
2.623ArgThr: 2.623 ± 0.376
3.294ArgVal: 3.294 ± 0.574
1.586ArgTrp: 1.586 ± 0.378
1.769ArgTyr: 1.769 ± 0.39
0.0ArgXaa: 0.0 ± 0.0
Ser
7.503SerAla: 7.503 ± 0.782
0.366SerCys: 0.366 ± 0.177
3.355SerAsp: 3.355 ± 0.503
2.379SerGlu: 2.379 ± 0.345
1.342SerPhe: 1.342 ± 0.292
6.1SerGly: 6.1 ± 0.706
0.915SerHis: 0.915 ± 0.264
2.257SerIle: 2.257 ± 0.349
1.769SerLys: 1.769 ± 0.403
3.904SerLeu: 3.904 ± 0.466
1.22SerMet: 1.22 ± 0.283
1.586SerAsn: 1.586 ± 0.314
3.599SerPro: 3.599 ± 0.522
1.464SerGln: 1.464 ± 0.297
4.27SerArg: 4.27 ± 0.537
4.209SerSer: 4.209 ± 0.567
3.172SerThr: 3.172 ± 0.369
2.867SerVal: 2.867 ± 0.415
0.915SerTrp: 0.915 ± 0.259
0.976SerTyr: 0.976 ± 0.301
0.0SerXaa: 0.0 ± 0.0
Thr
6.283ThrAla: 6.283 ± 0.611
0.549ThrCys: 0.549 ± 0.17
3.782ThrAsp: 3.782 ± 0.367
2.074ThrGlu: 2.074 ± 0.369
1.647ThrPhe: 1.647 ± 0.311
5.734ThrGly: 5.734 ± 0.831
1.342ThrHis: 1.342 ± 0.296
3.599ThrIle: 3.599 ± 0.566
2.196ThrLys: 2.196 ± 0.442
5.429ThrLeu: 5.429 ± 0.621
0.671ThrMet: 0.671 ± 0.195
1.525ThrAsn: 1.525 ± 0.35
3.599ThrPro: 3.599 ± 0.431
1.403ThrGln: 1.403 ± 0.317
2.867ThrArg: 2.867 ± 0.466
2.684ThrSer: 2.684 ± 0.441
4.819ThrThr: 4.819 ± 0.617
5.063ThrVal: 5.063 ± 0.638
0.976ThrTrp: 0.976 ± 0.233
1.098ThrTyr: 1.098 ± 0.27
0.0ThrXaa: 0.0 ± 0.0
Val
7.381ValAla: 7.381 ± 0.723
1.098ValCys: 1.098 ± 0.29
5.551ValAsp: 5.551 ± 0.7
4.148ValGlu: 4.148 ± 0.572
2.013ValPhe: 2.013 ± 0.33
4.575ValGly: 4.575 ± 0.526
0.854ValHis: 0.854 ± 0.222
3.416ValIle: 3.416 ± 0.505
1.83ValLys: 1.83 ± 0.385
5.307ValLeu: 5.307 ± 0.665
1.037ValMet: 1.037 ± 0.279
1.83ValAsn: 1.83 ± 0.338
3.599ValPro: 3.599 ± 0.529
2.745ValGln: 2.745 ± 0.446
4.148ValArg: 4.148 ± 0.379
3.538ValSer: 3.538 ± 0.401
4.453ValThr: 4.453 ± 0.485
4.331ValVal: 4.331 ± 0.549
1.464ValTrp: 1.464 ± 0.305
1.525ValTyr: 1.525 ± 0.3
0.0ValXaa: 0.0 ± 0.0
Trp
0.915TrpAla: 0.915 ± 0.198
0.488TrpCys: 0.488 ± 0.195
1.342TrpAsp: 1.342 ± 0.268
0.793TrpGlu: 0.793 ± 0.213
0.732TrpPhe: 0.732 ± 0.303
1.037TrpGly: 1.037 ± 0.269
0.366TrpHis: 0.366 ± 0.141
1.403TrpIle: 1.403 ± 0.329
0.915TrpLys: 0.915 ± 0.238
2.318TrpLeu: 2.318 ± 0.391
0.366TrpMet: 0.366 ± 0.136
0.488TrpAsn: 0.488 ± 0.162
1.525TrpPro: 1.525 ± 0.319
0.854TrpGln: 0.854 ± 0.268
1.281TrpArg: 1.281 ± 0.292
1.342TrpSer: 1.342 ± 0.297
1.342TrpThr: 1.342 ± 0.258
0.793TrpVal: 0.793 ± 0.207
0.549TrpTrp: 0.549 ± 0.206
0.671TrpTyr: 0.671 ± 0.214
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.562TyrAla: 2.562 ± 0.581
0.427TyrCys: 0.427 ± 0.161
1.586TyrAsp: 1.586 ± 0.292
1.098TyrGlu: 1.098 ± 0.286
0.427TyrPhe: 0.427 ± 0.175
1.708TyrGly: 1.708 ± 0.302
0.671TyrHis: 0.671 ± 0.27
1.037TyrIle: 1.037 ± 0.316
0.366TyrLys: 0.366 ± 0.125
1.22TyrLeu: 1.22 ± 0.316
0.488TyrMet: 0.488 ± 0.152
0.671TyrAsn: 0.671 ± 0.194
0.976TyrPro: 0.976 ± 0.263
0.793TyrGln: 0.793 ± 0.199
1.83TyrArg: 1.83 ± 0.387
1.342TyrSer: 1.342 ± 0.281
1.342TyrThr: 1.342 ± 0.305
2.196TyrVal: 2.196 ± 0.435
0.366TyrTrp: 0.366 ± 0.152
0.427TyrTyr: 0.427 ± 0.172
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 86 proteins (16394 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski