Amino acid dipepetide frequency for Mycobacterium phage Kenuha5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.856AlaAla: 14.856 ± 2.071
0.935AlaCys: 0.935 ± 0.224
7.868AlaAsp: 7.868 ± 0.624
6.823AlaGlu: 6.823 ± 0.923
2.641AlaPhe: 2.641 ± 0.35
10.565AlaGly: 10.565 ± 1.408
2.916AlaHis: 2.916 ± 0.447
4.347AlaIle: 4.347 ± 0.633
3.742AlaLys: 3.742 ± 0.512
8.088AlaLeu: 8.088 ± 0.735
2.311AlaMet: 2.311 ± 0.392
2.806AlaAsn: 2.806 ± 0.44
4.457AlaPro: 4.457 ± 0.663
3.852AlaGln: 3.852 ± 0.506
7.538AlaArg: 7.538 ± 0.779
5.282AlaSer: 5.282 ± 0.554
5.943AlaThr: 5.943 ± 0.59
7.483AlaVal: 7.483 ± 0.672
2.366AlaTrp: 2.366 ± 0.394
2.476AlaTyr: 2.476 ± 0.367
0.0AlaXaa: 0.0 ± 0.0
Cys
0.935CysAla: 0.935 ± 0.241
0.0CysCys: 0.0 ± 0.0
1.376CysAsp: 1.376 ± 0.3
0.88CysGlu: 0.88 ± 0.295
0.33CysPhe: 0.33 ± 0.154
1.596CysGly: 1.596 ± 0.368
0.275CysHis: 0.275 ± 0.137
0.165CysIle: 0.165 ± 0.118
0.385CysLys: 0.385 ± 0.145
0.44CysLeu: 0.44 ± 0.192
0.275CysMet: 0.275 ± 0.131
0.33CysAsn: 0.33 ± 0.145
0.99CysPro: 0.99 ± 0.239
0.22CysGln: 0.22 ± 0.116
0.99CysArg: 0.99 ± 0.314
0.495CysSer: 0.495 ± 0.178
0.77CysThr: 0.77 ± 0.182
0.66CysVal: 0.66 ± 0.19
0.22CysTrp: 0.22 ± 0.104
0.165CysTyr: 0.165 ± 0.077
0.0CysXaa: 0.0 ± 0.0
Asp
6.768AspAla: 6.768 ± 0.553
1.155AspCys: 1.155 ± 0.275
4.567AspAsp: 4.567 ± 0.534
3.742AspGlu: 3.742 ± 0.442
1.816AspPhe: 1.816 ± 0.249
6.988AspGly: 6.988 ± 0.648
1.321AspHis: 1.321 ± 0.244
2.641AspIle: 2.641 ± 0.341
1.431AspLys: 1.431 ± 0.24
6.273AspLeu: 6.273 ± 0.649
0.99AspMet: 0.99 ± 0.258
1.376AspAsn: 1.376 ± 0.267
3.907AspPro: 3.907 ± 0.514
2.421AspGln: 2.421 ± 0.407
4.842AspArg: 4.842 ± 0.532
3.136AspSer: 3.136 ± 0.523
4.512AspThr: 4.512 ± 0.414
4.292AspVal: 4.292 ± 0.521
1.706AspTrp: 1.706 ± 0.24
1.761AspTyr: 1.761 ± 0.328
0.0AspXaa: 0.0 ± 0.0
Glu
6.273GluAla: 6.273 ± 0.77
0.825GluCys: 0.825 ± 0.216
2.971GluAsp: 2.971 ± 0.382
3.136GluGlu: 3.136 ± 0.505
2.036GluPhe: 2.036 ± 0.376
3.246GluGly: 3.246 ± 0.394
1.651GluHis: 1.651 ± 0.332
1.816GluIle: 1.816 ± 0.321
1.651GluLys: 1.651 ± 0.281
5.117GluLeu: 5.117 ± 0.627
1.761GluMet: 1.761 ± 0.291
1.486GluAsn: 1.486 ± 0.236
3.356GluPro: 3.356 ± 0.504
3.081GluGln: 3.081 ± 0.417
4.897GluArg: 4.897 ± 0.688
2.531GluSer: 2.531 ± 0.47
4.512GluThr: 4.512 ± 0.59
4.402GluVal: 4.402 ± 0.547
1.486GluTrp: 1.486 ± 0.271
1.981GluTyr: 1.981 ± 0.382
0.0GluXaa: 0.0 ± 0.0
Phe
2.916PheAla: 2.916 ± 0.372
0.275PheCys: 0.275 ± 0.119
2.641PheAsp: 2.641 ± 0.573
1.651PheGlu: 1.651 ± 0.389
0.715PhePhe: 0.715 ± 0.203
2.696PheGly: 2.696 ± 0.452
0.605PheHis: 0.605 ± 0.207
1.871PheIle: 1.871 ± 0.421
0.825PheLys: 0.825 ± 0.232
1.486PheLeu: 1.486 ± 0.245
0.77PheMet: 0.77 ± 0.248
0.935PheAsn: 0.935 ± 0.295
1.541PhePro: 1.541 ± 0.338
0.77PheGln: 0.77 ± 0.284
1.376PheArg: 1.376 ± 0.269
1.706PheSer: 1.706 ± 0.327
2.531PheThr: 2.531 ± 0.37
1.761PheVal: 1.761 ± 0.304
0.605PheTrp: 0.605 ± 0.197
0.825PheTyr: 0.825 ± 0.237
0.0PheXaa: 0.0 ± 0.0
Gly
8.749GlyAla: 8.749 ± 1.46
0.825GlyCys: 0.825 ± 0.189
5.392GlyAsp: 5.392 ± 0.614
4.127GlyGlu: 4.127 ± 0.593
2.366GlyPhe: 2.366 ± 0.373
10.675GlyGly: 10.675 ± 2.296
1.871GlyHis: 1.871 ± 0.352
4.072GlyIle: 4.072 ± 0.454
2.586GlyLys: 2.586 ± 0.348
5.612GlyLeu: 5.612 ± 0.624
1.981GlyMet: 1.981 ± 0.386
3.081GlyAsn: 3.081 ± 0.431
4.567GlyPro: 4.567 ± 0.675
3.411GlyGln: 3.411 ± 0.566
5.557GlyArg: 5.557 ± 0.531
5.833GlySer: 5.833 ± 0.899
5.943GlyThr: 5.943 ± 0.71
6.328GlyVal: 6.328 ± 0.621
2.421GlyTrp: 2.421 ± 0.401
2.476GlyTyr: 2.476 ± 0.513
0.0GlyXaa: 0.0 ± 0.0
His
1.926HisAla: 1.926 ± 0.327
0.275HisCys: 0.275 ± 0.136
1.211HisAsp: 1.211 ± 0.303
1.266HisGlu: 1.266 ± 0.29
0.385HisPhe: 0.385 ± 0.134
1.706HisGly: 1.706 ± 0.278
0.605HisHis: 0.605 ± 0.215
1.211HisIle: 1.211 ± 0.255
0.77HisLys: 0.77 ± 0.231
1.431HisLeu: 1.431 ± 0.312
0.66HisMet: 0.66 ± 0.162
0.66HisAsn: 0.66 ± 0.191
1.541HisPro: 1.541 ± 0.251
0.99HisGln: 0.99 ± 0.19
2.586HisArg: 2.586 ± 0.4
1.155HisSer: 1.155 ± 0.284
1.596HisThr: 1.596 ± 0.412
1.761HisVal: 1.761 ± 0.363
0.495HisTrp: 0.495 ± 0.149
1.045HisTyr: 1.045 ± 0.204
0.0HisXaa: 0.0 ± 0.0
Ile
5.227IleAla: 5.227 ± 0.555
0.55IleCys: 0.55 ± 0.196
3.962IleAsp: 3.962 ± 0.436
3.522IleGlu: 3.522 ± 0.36
1.045IlePhe: 1.045 ± 0.26
3.797IleGly: 3.797 ± 0.481
1.376IleHis: 1.376 ± 0.315
1.761IleIle: 1.761 ± 0.28
1.321IleLys: 1.321 ± 0.281
2.366IleLeu: 2.366 ± 0.335
0.495IleMet: 0.495 ± 0.149
2.146IleAsn: 2.146 ± 0.368
2.971IlePro: 2.971 ± 0.392
1.155IleGln: 1.155 ± 0.236
2.476IleArg: 2.476 ± 0.443
1.816IleSer: 1.816 ± 0.361
3.301IleThr: 3.301 ± 0.516
2.916IleVal: 2.916 ± 0.369
0.99IleTrp: 0.99 ± 0.262
0.88IleTyr: 0.88 ± 0.238
0.0IleXaa: 0.0 ± 0.0
Lys
3.742LysAla: 3.742 ± 0.464
0.55LysCys: 0.55 ± 0.182
1.761LysAsp: 1.761 ± 0.295
1.211LysGlu: 1.211 ± 0.253
1.045LysPhe: 1.045 ± 0.197
2.366LysGly: 2.366 ± 0.292
0.99LysHis: 0.99 ± 0.293
1.155LysIle: 1.155 ± 0.227
1.211LysLys: 1.211 ± 0.298
2.586LysLeu: 2.586 ± 0.522
0.715LysMet: 0.715 ± 0.163
0.88LysAsn: 0.88 ± 0.217
2.201LysPro: 2.201 ± 0.336
1.266LysGln: 1.266 ± 0.333
2.256LysArg: 2.256 ± 0.312
1.981LysSer: 1.981 ± 0.3
2.036LysThr: 2.036 ± 0.355
2.531LysVal: 2.531 ± 0.397
0.825LysTrp: 0.825 ± 0.191
1.1LysTyr: 1.1 ± 0.315
0.0LysXaa: 0.0 ± 0.0
Leu
8.309LeuAla: 8.309 ± 0.874
0.825LeuCys: 0.825 ± 0.256
4.237LeuAsp: 4.237 ± 0.496
3.907LeuGlu: 3.907 ± 0.625
2.311LeuPhe: 2.311 ± 0.339
5.667LeuGly: 5.667 ± 0.719
0.99LeuHis: 0.99 ± 0.236
3.687LeuIle: 3.687 ± 0.465
1.871LeuLys: 1.871 ± 0.353
5.833LeuLeu: 5.833 ± 0.674
1.596LeuMet: 1.596 ± 0.321
2.861LeuAsn: 2.861 ± 0.414
4.897LeuPro: 4.897 ± 0.595
2.751LeuGln: 2.751 ± 0.369
5.447LeuArg: 5.447 ± 0.666
5.117LeuSer: 5.117 ± 0.522
6.328LeuThr: 6.328 ± 0.678
4.457LeuVal: 4.457 ± 0.49
1.266LeuTrp: 1.266 ± 0.241
1.706LeuTyr: 1.706 ± 0.368
0.0LeuXaa: 0.0 ± 0.0
Met
2.476MetAla: 2.476 ± 0.382
0.165MetCys: 0.165 ± 0.095
1.431MetAsp: 1.431 ± 0.235
1.045MetGlu: 1.045 ± 0.214
0.66MetPhe: 0.66 ± 0.206
1.431MetGly: 1.431 ± 0.268
0.385MetHis: 0.385 ± 0.152
0.99MetIle: 0.99 ± 0.228
0.77MetLys: 0.77 ± 0.236
1.541MetLeu: 1.541 ± 0.255
0.66MetMet: 0.66 ± 0.241
0.825MetAsn: 0.825 ± 0.2
1.321MetPro: 1.321 ± 0.293
0.55MetGln: 0.55 ± 0.14
1.431MetArg: 1.431 ± 0.279
2.751MetSer: 2.751 ± 0.366
2.311MetThr: 2.311 ± 0.354
1.376MetVal: 1.376 ± 0.324
0.22MetTrp: 0.22 ± 0.093
0.495MetTyr: 0.495 ± 0.202
0.0MetXaa: 0.0 ± 0.0
Asn
3.136AsnAla: 3.136 ± 0.36
0.11AsnCys: 0.11 ± 0.079
1.706AsnAsp: 1.706 ± 0.301
1.596AsnGlu: 1.596 ± 0.297
0.715AsnPhe: 0.715 ± 0.203
4.017AsnGly: 4.017 ± 0.45
0.77AsnHis: 0.77 ± 0.166
1.266AsnIle: 1.266 ± 0.305
0.88AsnLys: 0.88 ± 0.195
3.026AsnLeu: 3.026 ± 0.406
0.66AsnMet: 0.66 ± 0.188
1.871AsnAsn: 1.871 ± 0.388
2.641AsnPro: 2.641 ± 0.371
0.99AsnGln: 0.99 ± 0.337
2.311AsnArg: 2.311 ± 0.363
1.431AsnSer: 1.431 ± 0.31
2.256AsnThr: 2.256 ± 0.336
1.926AsnVal: 1.926 ± 0.34
0.605AsnTrp: 0.605 ± 0.197
0.66AsnTyr: 0.66 ± 0.179
0.0AsnXaa: 0.0 ± 0.0
Pro
5.777ProAla: 5.777 ± 0.64
0.77ProCys: 0.77 ± 0.282
3.962ProAsp: 3.962 ± 0.487
3.687ProGlu: 3.687 ± 0.374
1.926ProPhe: 1.926 ± 0.332
6.548ProGly: 6.548 ± 0.72
1.981ProHis: 1.981 ± 0.338
2.146ProIle: 2.146 ± 0.351
2.146ProLys: 2.146 ± 0.351
3.962ProLeu: 3.962 ± 0.498
1.981ProMet: 1.981 ± 0.379
2.036ProAsn: 2.036 ± 0.285
3.962ProPro: 3.962 ± 0.702
1.981ProGln: 1.981 ± 0.29
3.301ProArg: 3.301 ± 0.568
2.751ProSer: 2.751 ± 0.391
3.742ProThr: 3.742 ± 0.591
4.512ProVal: 4.512 ± 0.558
0.99ProTrp: 0.99 ± 0.201
1.321ProTyr: 1.321 ± 0.27
0.0ProXaa: 0.0 ± 0.0
Gln
4.237GlnAla: 4.237 ± 0.538
0.33GlnCys: 0.33 ± 0.175
1.596GlnAsp: 1.596 ± 0.3
1.486GlnGlu: 1.486 ± 0.323
1.211GlnPhe: 1.211 ± 0.207
2.256GlnGly: 2.256 ± 0.388
0.77GlnHis: 0.77 ± 0.205
1.871GlnIle: 1.871 ± 0.304
1.431GlnLys: 1.431 ± 0.261
3.356GlnLeu: 3.356 ± 0.436
0.55GlnMet: 0.55 ± 0.198
0.77GlnAsn: 0.77 ± 0.211
2.311GlnPro: 2.311 ± 0.619
1.321GlnGln: 1.321 ± 0.274
3.081GlnArg: 3.081 ± 0.431
2.146GlnSer: 2.146 ± 0.326
1.486GlnThr: 1.486 ± 0.292
2.861GlnVal: 2.861 ± 0.332
0.88GlnTrp: 0.88 ± 0.204
0.77GlnTyr: 0.77 ± 0.275
0.0GlnXaa: 0.0 ± 0.0
Arg
7.098ArgAla: 7.098 ± 0.84
1.266ArgCys: 1.266 ± 0.38
4.732ArgAsp: 4.732 ± 0.53
4.567ArgGlu: 4.567 ± 0.703
2.421ArgPhe: 2.421 ± 0.412
4.072ArgGly: 4.072 ± 0.408
1.981ArgHis: 1.981 ± 0.344
3.962ArgIle: 3.962 ± 0.474
3.081ArgLys: 3.081 ± 0.415
5.282ArgLeu: 5.282 ± 0.663
2.256ArgMet: 2.256 ± 0.364
2.201ArgAsn: 2.201 ± 0.399
2.806ArgPro: 2.806 ± 0.454
2.201ArgGln: 2.201 ± 0.333
6.273ArgArg: 6.273 ± 0.88
4.402ArgSer: 4.402 ± 0.402
3.687ArgThr: 3.687 ± 0.526
5.062ArgVal: 5.062 ± 0.654
2.201ArgTrp: 2.201 ± 0.367
1.706ArgTyr: 1.706 ± 0.326
0.0ArgXaa: 0.0 ± 0.0
Ser
5.888SerAla: 5.888 ± 0.871
0.33SerCys: 0.33 ± 0.136
3.852SerAsp: 3.852 ± 0.471
3.632SerGlu: 3.632 ± 0.5
1.981SerPhe: 1.981 ± 0.319
6.218SerGly: 6.218 ± 0.885
0.77SerHis: 0.77 ± 0.19
2.421SerIle: 2.421 ± 0.335
2.091SerLys: 2.091 ± 0.39
3.907SerLeu: 3.907 ± 0.458
1.596SerMet: 1.596 ± 0.349
2.201SerAsn: 2.201 ± 0.364
3.466SerPro: 3.466 ± 0.462
1.706SerGln: 1.706 ± 0.264
3.632SerArg: 3.632 ± 0.533
3.742SerSer: 3.742 ± 0.6
3.742SerThr: 3.742 ± 0.484
4.127SerVal: 4.127 ± 0.575
1.321SerTrp: 1.321 ± 0.263
1.155SerTyr: 1.155 ± 0.231
0.0SerXaa: 0.0 ± 0.0
Thr
7.153ThrAla: 7.153 ± 0.711
0.77ThrCys: 0.77 ± 0.216
4.457ThrAsp: 4.457 ± 0.644
4.017ThrGlu: 4.017 ± 0.367
1.431ThrPhe: 1.431 ± 0.324
6.548ThrGly: 6.548 ± 0.854
1.541ThrHis: 1.541 ± 0.297
3.632ThrIle: 3.632 ± 0.496
2.311ThrLys: 2.311 ± 0.362
4.457ThrLeu: 4.457 ± 0.534
1.045ThrMet: 1.045 ± 0.232
2.421ThrAsn: 2.421 ± 0.345
5.777ThrPro: 5.777 ± 0.715
1.761ThrGln: 1.761 ± 0.3
3.852ThrArg: 3.852 ± 0.411
3.687ThrSer: 3.687 ± 0.385
4.952ThrThr: 4.952 ± 0.574
6.108ThrVal: 6.108 ± 0.68
1.155ThrTrp: 1.155 ± 0.319
2.091ThrTyr: 2.091 ± 0.348
0.0ThrXaa: 0.0 ± 0.0
Val
7.263ValAla: 7.263 ± 0.673
0.88ValCys: 0.88 ± 0.209
5.117ValAsp: 5.117 ± 0.536
5.337ValGlu: 5.337 ± 0.565
2.146ValPhe: 2.146 ± 0.311
4.842ValGly: 4.842 ± 0.508
1.045ValHis: 1.045 ± 0.235
3.356ValIle: 3.356 ± 0.469
2.256ValLys: 2.256 ± 0.363
5.007ValLeu: 5.007 ± 0.567
1.376ValMet: 1.376 ± 0.27
2.036ValAsn: 2.036 ± 0.322
3.852ValPro: 3.852 ± 0.391
2.366ValGln: 2.366 ± 0.37
5.392ValArg: 5.392 ± 0.693
5.447ValSer: 5.447 ± 0.759
5.502ValThr: 5.502 ± 0.567
5.447ValVal: 5.447 ± 0.673
1.926ValTrp: 1.926 ± 0.337
1.155ValTyr: 1.155 ± 0.275
0.0ValXaa: 0.0 ± 0.0
Trp
2.311TrpAla: 2.311 ± 0.326
0.22TrpCys: 0.22 ± 0.107
1.155TrpAsp: 1.155 ± 0.257
0.935TrpGlu: 0.935 ± 0.297
0.605TrpPhe: 0.605 ± 0.174
1.045TrpGly: 1.045 ± 0.238
0.825TrpHis: 0.825 ± 0.203
0.825TrpIle: 0.825 ± 0.221
0.935TrpLys: 0.935 ± 0.244
1.871TrpLeu: 1.871 ± 0.324
0.935TrpMet: 0.935 ± 0.23
0.495TrpAsn: 0.495 ± 0.162
1.376TrpPro: 1.376 ± 0.297
1.1TrpGln: 1.1 ± 0.238
2.036TrpArg: 2.036 ± 0.412
1.376TrpSer: 1.376 ± 0.278
1.871TrpThr: 1.871 ± 0.314
1.651TrpVal: 1.651 ± 0.333
0.66TrpTrp: 0.66 ± 0.202
0.495TrpTyr: 0.495 ± 0.151
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.256TyrAla: 2.256 ± 0.421
0.44TyrCys: 0.44 ± 0.158
1.651TyrAsp: 1.651 ± 0.354
1.651TyrGlu: 1.651 ± 0.296
0.715TyrPhe: 0.715 ± 0.204
1.596TyrGly: 1.596 ± 0.312
0.605TyrHis: 0.605 ± 0.171
0.88TyrIle: 0.88 ± 0.234
0.715TyrLys: 0.715 ± 0.207
2.421TyrLeu: 2.421 ± 0.409
0.165TyrMet: 0.165 ± 0.088
1.155TyrAsn: 1.155 ± 0.231
1.541TyrPro: 1.541 ± 0.271
0.825TyrGln: 0.825 ± 0.186
2.036TyrArg: 2.036 ± 0.332
0.99TyrSer: 0.99 ± 0.223
2.146TyrThr: 2.146 ± 0.385
2.091TyrVal: 2.091 ± 0.3
0.385TyrTrp: 0.385 ± 0.157
0.605TyrTyr: 0.605 ± 0.155
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 96 proteins (18175 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski