Amino acid dipepetide frequency for Mycobacterium phage Chuckly

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.079AlaAla: 14.079 ± 1.46
1.03AlaCys: 1.03 ± 0.264
7.154AlaAsp: 7.154 ± 0.641
6.696AlaGlu: 6.696 ± 0.662
3.033AlaPhe: 3.033 ± 0.429
10.588AlaGly: 10.588 ± 1.165
2.633AlaHis: 2.633 ± 0.424
5.151AlaIle: 5.151 ± 0.617
4.006AlaLys: 4.006 ± 0.46
7.497AlaLeu: 7.497 ± 0.678
2.575AlaMet: 2.575 ± 0.37
2.461AlaAsn: 2.461 ± 0.347
5.838AlaPro: 5.838 ± 0.699
3.777AlaGln: 3.777 ± 0.575
7.44AlaArg: 7.44 ± 0.811
5.208AlaSer: 5.208 ± 0.58
6.811AlaThr: 6.811 ± 0.616
7.039AlaVal: 7.039 ± 0.717
2.633AlaTrp: 2.633 ± 0.454
2.175AlaTyr: 2.175 ± 0.304
0.0AlaXaa: 0.0 ± 0.0
Cys
0.801CysAla: 0.801 ± 0.234
0.057CysCys: 0.057 ± 0.056
1.259CysAsp: 1.259 ± 0.378
1.03CysGlu: 1.03 ± 0.233
0.114CysPhe: 0.114 ± 0.082
1.66CysGly: 1.66 ± 0.352
0.401CysHis: 0.401 ± 0.162
0.057CysIle: 0.057 ± 0.062
0.458CysLys: 0.458 ± 0.166
0.858CysLeu: 0.858 ± 0.256
0.343CysMet: 0.343 ± 0.146
0.401CysAsn: 0.401 ± 0.155
1.259CysPro: 1.259 ± 0.276
0.343CysGln: 0.343 ± 0.133
0.687CysArg: 0.687 ± 0.213
0.572CysSer: 0.572 ± 0.188
0.858CysThr: 0.858 ± 0.237
0.63CysVal: 0.63 ± 0.189
0.343CysTrp: 0.343 ± 0.128
0.229CysTyr: 0.229 ± 0.111
0.0CysXaa: 0.0 ± 0.0
Asp
6.639AspAla: 6.639 ± 0.561
0.858AspCys: 0.858 ± 0.235
3.949AspAsp: 3.949 ± 0.534
3.548AspGlu: 3.548 ± 0.483
1.774AspPhe: 1.774 ± 0.287
6.696AspGly: 6.696 ± 0.589
1.259AspHis: 1.259 ± 0.225
2.232AspIle: 2.232 ± 0.331
1.717AspLys: 1.717 ± 0.272
5.78AspLeu: 5.78 ± 0.495
0.801AspMet: 0.801 ± 0.237
1.831AspAsn: 1.831 ± 0.371
5.265AspPro: 5.265 ± 0.542
2.175AspGln: 2.175 ± 0.369
5.208AspArg: 5.208 ± 0.738
3.777AspSer: 3.777 ± 0.61
3.834AspThr: 3.834 ± 0.472
4.35AspVal: 4.35 ± 0.581
1.488AspTrp: 1.488 ± 0.276
2.003AspTyr: 2.003 ± 0.334
0.0AspXaa: 0.0 ± 0.0
Glu
7.154GluAla: 7.154 ± 0.761
0.973GluCys: 0.973 ± 0.311
2.862GluAsp: 2.862 ± 0.369
2.747GluGlu: 2.747 ± 0.501
2.06GluPhe: 2.06 ± 0.359
3.319GluGly: 3.319 ± 0.441
1.717GluHis: 1.717 ± 0.36
2.346GluIle: 2.346 ± 0.389
2.003GluLys: 2.003 ± 0.275
5.838GluLeu: 5.838 ± 0.665
1.431GluMet: 1.431 ± 0.291
1.889GluAsn: 1.889 ± 0.315
3.205GluPro: 3.205 ± 0.466
2.461GluGln: 2.461 ± 0.362
5.38GluArg: 5.38 ± 0.627
2.976GluSer: 2.976 ± 0.43
4.292GluThr: 4.292 ± 0.564
4.178GluVal: 4.178 ± 0.488
1.259GluTrp: 1.259 ± 0.251
1.889GluTyr: 1.889 ± 0.339
0.0GluXaa: 0.0 ± 0.0
Phe
3.319PheAla: 3.319 ± 0.44
0.172PheCys: 0.172 ± 0.091
2.175PheAsp: 2.175 ± 0.388
1.831PheGlu: 1.831 ± 0.321
0.916PhePhe: 0.916 ± 0.25
3.262PheGly: 3.262 ± 0.64
0.458PheHis: 0.458 ± 0.16
1.316PheIle: 1.316 ± 0.34
1.03PheLys: 1.03 ± 0.219
1.602PheLeu: 1.602 ± 0.282
0.744PheMet: 0.744 ± 0.187
0.916PheAsn: 0.916 ± 0.252
1.545PhePro: 1.545 ± 0.266
1.259PheGln: 1.259 ± 0.331
1.374PheArg: 1.374 ± 0.249
1.431PheSer: 1.431 ± 0.298
2.06PheThr: 2.06 ± 0.382
2.06PheVal: 2.06 ± 0.281
0.63PheTrp: 0.63 ± 0.148
0.916PheTyr: 0.916 ± 0.257
0.0PheXaa: 0.0 ± 0.0
Gly
9.558GlyAla: 9.558 ± 1.307
1.259GlyCys: 1.259 ± 0.298
6.238GlyAsp: 6.238 ± 0.668
4.464GlyGlu: 4.464 ± 0.567
2.575GlyPhe: 2.575 ± 0.469
10.302GlyGly: 10.302 ± 2.151
1.946GlyHis: 1.946 ± 0.355
4.407GlyIle: 4.407 ± 0.575
2.461GlyLys: 2.461 ± 0.362
5.952GlyLeu: 5.952 ± 0.567
2.461GlyMet: 2.461 ± 0.459
3.205GlyAsn: 3.205 ± 0.435
4.578GlyPro: 4.578 ± 0.549
2.175GlyGln: 2.175 ± 0.585
5.265GlyArg: 5.265 ± 0.59
5.723GlySer: 5.723 ± 0.857
6.238GlyThr: 6.238 ± 0.811
5.666GlyVal: 5.666 ± 0.61
2.289GlyTrp: 2.289 ± 0.365
2.289GlyTyr: 2.289 ± 0.434
0.0GlyXaa: 0.0 ± 0.0
His
2.175HisAla: 2.175 ± 0.393
0.458HisCys: 0.458 ± 0.178
0.973HisAsp: 0.973 ± 0.22
1.488HisGlu: 1.488 ± 0.275
0.401HisPhe: 0.401 ± 0.134
1.66HisGly: 1.66 ± 0.273
1.202HisHis: 1.202 ± 0.303
1.259HisIle: 1.259 ± 0.265
1.202HisLys: 1.202 ± 0.271
1.602HisLeu: 1.602 ± 0.281
0.343HisMet: 0.343 ± 0.13
0.801HisAsn: 0.801 ± 0.21
1.66HisPro: 1.66 ± 0.295
0.801HisGln: 0.801 ± 0.201
2.118HisArg: 2.118 ± 0.415
0.801HisSer: 0.801 ± 0.21
1.316HisThr: 1.316 ± 0.297
1.431HisVal: 1.431 ± 0.282
0.401HisTrp: 0.401 ± 0.142
0.744HisTyr: 0.744 ± 0.203
0.0HisXaa: 0.0 ± 0.0
Ile
4.979IleAla: 4.979 ± 0.554
0.63IleCys: 0.63 ± 0.216
3.949IleAsp: 3.949 ± 0.546
3.262IleGlu: 3.262 ± 0.408
0.801IlePhe: 0.801 ± 0.244
3.72IleGly: 3.72 ± 0.426
1.545IleHis: 1.545 ± 0.354
1.374IleIle: 1.374 ± 0.258
0.973IleLys: 0.973 ± 0.25
2.346IleLeu: 2.346 ± 0.463
0.343IleMet: 0.343 ± 0.143
2.003IleAsn: 2.003 ± 0.333
3.033IlePro: 3.033 ± 0.354
1.545IleGln: 1.545 ± 0.239
2.633IleArg: 2.633 ± 0.439
2.461IleSer: 2.461 ± 0.412
3.262IleThr: 3.262 ± 0.392
2.862IleVal: 2.862 ± 0.38
0.858IleTrp: 0.858 ± 0.23
0.687IleTyr: 0.687 ± 0.176
0.0IleXaa: 0.0 ± 0.0
Lys
3.834LysAla: 3.834 ± 0.466
0.458LysCys: 0.458 ± 0.152
1.66LysAsp: 1.66 ± 0.298
1.545LysGlu: 1.545 ± 0.282
1.202LysPhe: 1.202 ± 0.204
2.518LysGly: 2.518 ± 0.33
0.973LysHis: 0.973 ± 0.218
1.03LysIle: 1.03 ± 0.289
1.374LysLys: 1.374 ± 0.345
2.518LysLeu: 2.518 ± 0.463
0.744LysMet: 0.744 ± 0.179
0.973LysAsn: 0.973 ± 0.226
2.06LysPro: 2.06 ± 0.26
1.946LysGln: 1.946 ± 0.294
2.118LysArg: 2.118 ± 0.321
1.946LysSer: 1.946 ± 0.346
2.346LysThr: 2.346 ± 0.375
2.633LysVal: 2.633 ± 0.43
0.744LysTrp: 0.744 ± 0.22
0.687LysTyr: 0.687 ± 0.256
0.0LysXaa: 0.0 ± 0.0
Leu
7.555LeuAla: 7.555 ± 0.667
0.63LeuCys: 0.63 ± 0.195
4.75LeuAsp: 4.75 ± 0.582
4.292LeuGlu: 4.292 ± 0.524
2.575LeuPhe: 2.575 ± 0.246
5.208LeuGly: 5.208 ± 0.54
1.03LeuHis: 1.03 ± 0.218
3.033LeuIle: 3.033 ± 0.468
2.175LeuLys: 2.175 ± 0.354
4.979LeuLeu: 4.979 ± 0.569
1.545LeuMet: 1.545 ± 0.261
2.69LeuAsn: 2.69 ± 0.422
5.78LeuPro: 5.78 ± 0.688
2.862LeuGln: 2.862 ± 0.411
5.094LeuArg: 5.094 ± 0.68
5.208LeuSer: 5.208 ± 0.514
5.036LeuThr: 5.036 ± 0.535
5.036LeuVal: 5.036 ± 0.532
1.145LeuTrp: 1.145 ± 0.278
2.175LeuTyr: 2.175 ± 0.38
0.0LeuXaa: 0.0 ± 0.0
Met
2.175MetAla: 2.175 ± 0.362
0.343MetCys: 0.343 ± 0.196
1.374MetAsp: 1.374 ± 0.285
1.259MetGlu: 1.259 ± 0.233
0.572MetPhe: 0.572 ± 0.173
1.774MetGly: 1.774 ± 0.294
0.114MetHis: 0.114 ± 0.085
0.801MetIle: 0.801 ± 0.202
0.343MetLys: 0.343 ± 0.132
1.831MetLeu: 1.831 ± 0.32
0.515MetMet: 0.515 ± 0.193
0.916MetAsn: 0.916 ± 0.206
1.374MetPro: 1.374 ± 0.306
0.515MetGln: 0.515 ± 0.16
1.488MetArg: 1.488 ± 0.291
3.033MetSer: 3.033 ± 0.411
1.946MetThr: 1.946 ± 0.302
1.545MetVal: 1.545 ± 0.314
0.515MetTrp: 0.515 ± 0.167
0.286MetTyr: 0.286 ± 0.115
0.0MetXaa: 0.0 ± 0.0
Asn
3.491AsnAla: 3.491 ± 0.377
0.172AsnCys: 0.172 ± 0.105
2.232AsnAsp: 2.232 ± 0.369
1.602AsnGlu: 1.602 ± 0.33
0.801AsnPhe: 0.801 ± 0.244
4.178AsnGly: 4.178 ± 0.51
0.858AsnHis: 0.858 ± 0.172
1.774AsnIle: 1.774 ± 0.444
1.145AsnLys: 1.145 ± 0.26
2.175AsnLeu: 2.175 ± 0.347
0.572AsnMet: 0.572 ± 0.167
1.717AsnAsn: 1.717 ± 0.413
2.518AsnPro: 2.518 ± 0.362
1.145AsnGln: 1.145 ± 0.312
2.06AsnArg: 2.06 ± 0.372
1.431AsnSer: 1.431 ± 0.245
2.289AsnThr: 2.289 ± 0.367
2.06AsnVal: 2.06 ± 0.351
0.744AsnTrp: 0.744 ± 0.165
0.63AsnTyr: 0.63 ± 0.161
0.0AsnXaa: 0.0 ± 0.0
Pro
6.009ProAla: 6.009 ± 0.7
0.744ProCys: 0.744 ± 0.207
4.807ProAsp: 4.807 ± 0.557
4.121ProGlu: 4.121 ± 0.448
1.717ProPhe: 1.717 ± 0.326
6.753ProGly: 6.753 ± 0.588
1.374ProHis: 1.374 ± 0.272
2.175ProIle: 2.175 ± 0.333
2.346ProLys: 2.346 ± 0.363
4.807ProLeu: 4.807 ± 0.624
1.66ProMet: 1.66 ± 0.358
2.289ProAsn: 2.289 ± 0.328
3.606ProPro: 3.606 ± 0.571
2.232ProGln: 2.232 ± 0.4
3.205ProArg: 3.205 ± 0.49
3.548ProSer: 3.548 ± 0.402
2.976ProThr: 2.976 ± 0.54
4.578ProVal: 4.578 ± 0.529
1.202ProTrp: 1.202 ± 0.289
1.66ProTyr: 1.66 ± 0.278
0.0ProXaa: 0.0 ± 0.0
Gln
3.834GlnAla: 3.834 ± 0.521
0.343GlnCys: 0.343 ± 0.16
1.545GlnAsp: 1.545 ± 0.246
1.66GlnGlu: 1.66 ± 0.309
1.145GlnPhe: 1.145 ± 0.22
2.633GlnGly: 2.633 ± 0.528
1.03GlnHis: 1.03 ± 0.314
1.774GlnIle: 1.774 ± 0.365
1.545GlnLys: 1.545 ± 0.264
3.262GlnLeu: 3.262 ± 0.372
0.572GlnMet: 0.572 ± 0.174
0.916GlnAsn: 0.916 ± 0.281
2.804GlnPro: 2.804 ± 0.433
0.916GlnGln: 0.916 ± 0.24
2.633GlnArg: 2.633 ± 0.444
2.346GlnSer: 2.346 ± 0.347
1.602GlnThr: 1.602 ± 0.351
2.118GlnVal: 2.118 ± 0.416
0.687GlnTrp: 0.687 ± 0.201
1.087GlnTyr: 1.087 ± 0.269
0.0GlnXaa: 0.0 ± 0.0
Arg
7.383ArgAla: 7.383 ± 0.71
1.316ArgCys: 1.316 ± 0.344
4.464ArgAsp: 4.464 ± 0.612
4.636ArgGlu: 4.636 ± 0.586
1.946ArgPhe: 1.946 ± 0.442
3.663ArgGly: 3.663 ± 0.455
1.431ArgHis: 1.431 ± 0.309
3.777ArgIle: 3.777 ± 0.528
2.404ArgLys: 2.404 ± 0.376
5.265ArgLeu: 5.265 ± 0.674
2.575ArgMet: 2.575 ± 0.406
2.633ArgAsn: 2.633 ± 0.449
3.606ArgPro: 3.606 ± 0.543
1.774ArgGln: 1.774 ± 0.333
5.151ArgArg: 5.151 ± 0.726
3.949ArgSer: 3.949 ± 0.457
3.491ArgThr: 3.491 ± 0.468
4.578ArgVal: 4.578 ± 0.513
2.232ArgTrp: 2.232 ± 0.349
1.946ArgTyr: 1.946 ± 0.343
0.0ArgXaa: 0.0 ± 0.0
Ser
5.895SerAla: 5.895 ± 0.742
0.458SerCys: 0.458 ± 0.153
3.892SerAsp: 3.892 ± 0.485
3.434SerGlu: 3.434 ± 0.466
1.946SerPhe: 1.946 ± 0.294
6.009SerGly: 6.009 ± 0.905
0.916SerHis: 0.916 ± 0.267
3.148SerIle: 3.148 ± 0.444
2.518SerLys: 2.518 ± 0.449
4.063SerLeu: 4.063 ± 0.462
1.374SerMet: 1.374 ± 0.276
2.346SerAsn: 2.346 ± 0.431
3.491SerPro: 3.491 ± 0.375
1.717SerGln: 1.717 ± 0.281
3.377SerArg: 3.377 ± 0.37
4.006SerSer: 4.006 ± 0.585
3.319SerThr: 3.319 ± 0.415
4.865SerVal: 4.865 ± 0.557
1.316SerTrp: 1.316 ± 0.277
1.431SerTyr: 1.431 ± 0.241
0.0SerXaa: 0.0 ± 0.0
Thr
6.582ThrAla: 6.582 ± 0.598
0.572ThrCys: 0.572 ± 0.19
3.434ThrAsp: 3.434 ± 0.499
3.72ThrGlu: 3.72 ± 0.356
1.717ThrPhe: 1.717 ± 0.309
6.295ThrGly: 6.295 ± 0.662
1.66ThrHis: 1.66 ± 0.391
2.976ThrIle: 2.976 ± 0.411
2.118ThrLys: 2.118 ± 0.367
4.006ThrLeu: 4.006 ± 0.473
1.602ThrMet: 1.602 ± 0.327
2.232ThrAsn: 2.232 ± 0.416
3.72ThrPro: 3.72 ± 0.561
2.175ThrGln: 2.175 ± 0.31
3.606ThrArg: 3.606 ± 0.477
4.006ThrSer: 4.006 ± 0.513
4.807ThrThr: 4.807 ± 0.68
5.838ThrVal: 5.838 ± 0.524
1.145ThrTrp: 1.145 ± 0.338
1.66ThrTyr: 1.66 ± 0.297
0.0ThrXaa: 0.0 ± 0.0
Val
7.555ValAla: 7.555 ± 0.599
1.316ValCys: 1.316 ± 0.256
4.979ValAsp: 4.979 ± 0.518
5.38ValGlu: 5.38 ± 0.655
2.404ValPhe: 2.404 ± 0.423
5.78ValGly: 5.78 ± 0.647
1.259ValHis: 1.259 ± 0.252
2.404ValIle: 2.404 ± 0.368
2.06ValLys: 2.06 ± 0.343
5.151ValLeu: 5.151 ± 0.553
1.316ValMet: 1.316 ± 0.216
1.946ValAsn: 1.946 ± 0.379
3.834ValPro: 3.834 ± 0.441
2.919ValGln: 2.919 ± 0.38
4.807ValArg: 4.807 ± 0.561
4.75ValSer: 4.75 ± 0.553
4.521ValThr: 4.521 ± 0.467
6.124ValVal: 6.124 ± 0.688
1.831ValTrp: 1.831 ± 0.329
1.488ValTyr: 1.488 ± 0.275
0.0ValXaa: 0.0 ± 0.0
Trp
2.06TrpAla: 2.06 ± 0.325
0.229TrpCys: 0.229 ± 0.115
1.602TrpAsp: 1.602 ± 0.287
1.259TrpGlu: 1.259 ± 0.332
0.687TrpPhe: 0.687 ± 0.17
1.03TrpGly: 1.03 ± 0.272
0.801TrpHis: 0.801 ± 0.195
1.202TrpIle: 1.202 ± 0.231
0.63TrpLys: 0.63 ± 0.183
1.488TrpLeu: 1.488 ± 0.307
0.801TrpMet: 0.801 ± 0.228
0.572TrpAsn: 0.572 ± 0.201
1.316TrpPro: 1.316 ± 0.246
0.916TrpGln: 0.916 ± 0.265
2.232TrpArg: 2.232 ± 0.46
1.316TrpSer: 1.316 ± 0.249
1.488TrpThr: 1.488 ± 0.305
1.774TrpVal: 1.774 ± 0.441
1.087TrpTrp: 1.087 ± 0.207
0.63TrpTyr: 0.63 ± 0.158
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.747TyrAla: 2.747 ± 0.412
0.343TyrCys: 0.343 ± 0.126
1.889TyrAsp: 1.889 ± 0.377
2.06TyrGlu: 2.06 ± 0.342
0.63TyrPhe: 0.63 ± 0.173
2.003TyrGly: 2.003 ± 0.352
0.229TyrHis: 0.229 ± 0.102
1.03TyrIle: 1.03 ± 0.255
0.801TyrLys: 0.801 ± 0.235
1.774TyrLeu: 1.774 ± 0.281
0.343TyrMet: 0.343 ± 0.13
0.858TyrAsn: 0.858 ± 0.237
1.316TyrPro: 1.316 ± 0.227
0.801TyrGln: 0.801 ± 0.216
2.346TyrArg: 2.346 ± 0.423
1.03TyrSer: 1.03 ± 0.247
1.374TyrThr: 1.374 ± 0.358
2.461TyrVal: 2.461 ± 0.354
0.572TyrTrp: 0.572 ± 0.175
0.63TyrTyr: 0.63 ± 0.169
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 102 proteins (17474 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski