Amino acid dipepetide frequency for Mycobacterium phage Dori

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.642AlaAla: 17.642 ± 1.544
0.84AlaCys: 0.84 ± 0.221
6.671AlaAsp: 6.671 ± 0.539
9.834AlaGlu: 9.834 ± 0.911
2.372AlaPhe: 2.372 ± 0.355
10.378AlaGly: 10.378 ± 0.864
2.076AlaHis: 2.076 ± 0.329
4.448AlaIle: 4.448 ± 0.562
3.459AlaLys: 3.459 ± 0.447
10.575AlaLeu: 10.575 ± 0.878
2.817AlaMet: 2.817 ± 0.383
3.706AlaAsn: 3.706 ± 0.401
7.017AlaPro: 7.017 ± 0.569
5.189AlaGln: 5.189 ± 0.536
8.055AlaArg: 8.055 ± 0.768
5.881AlaSer: 5.881 ± 0.613
6.572AlaThr: 6.572 ± 0.657
9.735AlaVal: 9.735 ± 0.788
1.532AlaTrp: 1.532 ± 0.292
1.977AlaTyr: 1.977 ± 0.384
0.0AlaXaa: 0.0 ± 0.0
Cys
0.741CysAla: 0.741 ± 0.21
0.0CysCys: 0.0 ± 0.0
0.791CysAsp: 0.791 ± 0.22
0.89CysGlu: 0.89 ± 0.247
0.148CysPhe: 0.148 ± 0.107
1.384CysGly: 1.384 ± 0.304
0.198CysHis: 0.198 ± 0.097
0.198CysIle: 0.198 ± 0.114
0.297CysLys: 0.297 ± 0.126
0.692CysLeu: 0.692 ± 0.215
0.099CysMet: 0.099 ± 0.077
0.247CysAsn: 0.247 ± 0.127
0.84CysPro: 0.84 ± 0.252
0.198CysGln: 0.198 ± 0.105
0.84CysArg: 0.84 ± 0.228
0.692CysSer: 0.692 ± 0.193
0.692CysThr: 0.692 ± 0.19
0.692CysVal: 0.692 ± 0.173
0.297CysTrp: 0.297 ± 0.146
0.395CysTyr: 0.395 ± 0.125
0.0CysXaa: 0.0 ± 0.0
Asp
8.006AspAla: 8.006 ± 0.576
0.544AspCys: 0.544 ± 0.185
4.497AspAsp: 4.497 ± 0.58
5.535AspGlu: 5.535 ± 0.632
1.334AspPhe: 1.334 ± 0.278
6.572AspGly: 6.572 ± 0.542
0.84AspHis: 0.84 ± 0.209
2.076AspIle: 2.076 ± 0.27
1.779AspLys: 1.779 ± 0.295
5.386AspLeu: 5.386 ± 0.591
0.89AspMet: 0.89 ± 0.226
1.631AspAsn: 1.631 ± 0.322
4.299AspPro: 4.299 ± 0.581
1.779AspGln: 1.779 ± 0.303
4.299AspArg: 4.299 ± 0.463
3.163AspSer: 3.163 ± 0.473
3.262AspThr: 3.262 ± 0.478
5.386AspVal: 5.386 ± 0.568
0.988AspTrp: 0.988 ± 0.207
1.137AspTyr: 1.137 ± 0.299
0.0AspXaa: 0.0 ± 0.0
Glu
7.363GluAla: 7.363 ± 0.68
0.89GluCys: 0.89 ± 0.242
3.509GluAsp: 3.509 ± 0.448
2.52GluGlu: 2.52 ± 0.467
2.026GluPhe: 2.026 ± 0.374
4.448GluGly: 4.448 ± 0.586
1.927GluHis: 1.927 ± 0.309
3.558GluIle: 3.558 ± 0.388
1.828GluLys: 1.828 ± 0.321
6.77GluLeu: 6.77 ± 0.665
1.137GluMet: 1.137 ± 0.297
1.631GluAsn: 1.631 ± 0.342
3.607GluPro: 3.607 ± 0.499
2.817GluGln: 2.817 ± 0.305
4.497GluArg: 4.497 ± 0.513
3.014GluSer: 3.014 ± 0.34
3.262GluThr: 3.262 ± 0.435
4.892GluVal: 4.892 ± 0.615
1.483GluTrp: 1.483 ± 0.282
1.384GluTyr: 1.384 ± 0.338
0.0GluXaa: 0.0 ± 0.0
Phe
3.163PheAla: 3.163 ± 0.317
0.395PheCys: 0.395 ± 0.142
1.927PheAsp: 1.927 ± 0.384
1.73PheGlu: 1.73 ± 0.308
0.593PhePhe: 0.593 ± 0.164
2.57PheGly: 2.57 ± 0.39
0.395PheHis: 0.395 ± 0.142
0.84PheIle: 0.84 ± 0.214
0.741PheLys: 0.741 ± 0.179
1.878PheLeu: 1.878 ± 0.307
0.593PheMet: 0.593 ± 0.171
0.84PheAsn: 0.84 ± 0.189
1.087PhePro: 1.087 ± 0.273
0.741PheGln: 0.741 ± 0.186
1.68PheArg: 1.68 ± 0.337
1.384PheSer: 1.384 ± 0.203
1.779PheThr: 1.779 ± 0.282
1.73PheVal: 1.73 ± 0.28
0.593PheTrp: 0.593 ± 0.182
0.642PheTyr: 0.642 ± 0.209
0.0PheXaa: 0.0 ± 0.0
Gly
8.5GlyAla: 8.5 ± 0.996
0.988GlyCys: 0.988 ± 0.218
4.596GlyAsp: 4.596 ± 0.386
5.09GlyGlu: 5.09 ± 0.538
2.076GlyPhe: 2.076 ± 0.312
12.107GlyGly: 12.107 ± 2.024
2.076GlyHis: 2.076 ± 0.329
3.657GlyIle: 3.657 ± 0.454
3.311GlyLys: 3.311 ± 0.495
6.622GlyLeu: 6.622 ± 0.702
2.026GlyMet: 2.026 ± 0.282
2.52GlyAsn: 2.52 ± 0.352
4.25GlyPro: 4.25 ± 0.617
3.262GlyGln: 3.262 ± 0.363
5.782GlyArg: 5.782 ± 0.57
5.584GlySer: 5.584 ± 0.859
5.831GlyThr: 5.831 ± 0.575
7.758GlyVal: 7.758 ± 0.839
1.73GlyTrp: 1.73 ± 0.304
2.57GlyTyr: 2.57 ± 0.411
0.0GlyXaa: 0.0 ± 0.0
His
2.174HisAla: 2.174 ± 0.319
0.148HisCys: 0.148 ± 0.118
1.285HisAsp: 1.285 ± 0.266
1.235HisGlu: 1.235 ± 0.275
0.544HisPhe: 0.544 ± 0.153
2.076HisGly: 2.076 ± 0.31
0.494HisHis: 0.494 ± 0.198
0.939HisIle: 0.939 ± 0.221
0.297HisLys: 0.297 ± 0.104
1.977HisLeu: 1.977 ± 0.332
0.544HisMet: 0.544 ± 0.171
0.544HisAsn: 0.544 ± 0.137
1.038HisPro: 1.038 ± 0.269
0.642HisGln: 0.642 ± 0.186
1.532HisArg: 1.532 ± 0.295
0.791HisSer: 0.791 ± 0.209
1.235HisThr: 1.235 ± 0.251
0.988HisVal: 0.988 ± 0.203
0.445HisTrp: 0.445 ± 0.148
0.741HisTyr: 0.741 ± 0.186
0.0HisXaa: 0.0 ± 0.0
Ile
6.325IleAla: 6.325 ± 0.533
0.395IleCys: 0.395 ± 0.143
3.113IleAsp: 3.113 ± 0.387
3.212IleGlu: 3.212 ± 0.424
1.038IlePhe: 1.038 ± 0.234
3.36IleGly: 3.36 ± 0.535
0.642IleHis: 0.642 ± 0.162
1.235IleIle: 1.235 ± 0.24
1.483IleLys: 1.483 ± 0.281
2.965IleLeu: 2.965 ± 0.351
0.247IleMet: 0.247 ± 0.109
1.631IleAsn: 1.631 ± 0.333
2.471IlePro: 2.471 ± 0.299
1.186IleGln: 1.186 ± 0.267
2.965IleArg: 2.965 ± 0.495
1.878IleSer: 1.878 ± 0.294
3.014IleThr: 3.014 ± 0.357
3.41IleVal: 3.41 ± 0.411
0.445IleTrp: 0.445 ± 0.147
0.494IleTyr: 0.494 ± 0.165
0.0IleXaa: 0.0 ± 0.0
Lys
3.262LysAla: 3.262 ± 0.479
0.297LysCys: 0.297 ± 0.096
1.828LysAsp: 1.828 ± 0.297
0.988LysGlu: 0.988 ± 0.19
0.84LysPhe: 0.84 ± 0.194
2.076LysGly: 2.076 ± 0.4
0.593LysHis: 0.593 ± 0.146
1.581LysIle: 1.581 ± 0.226
0.939LysLys: 0.939 ± 0.217
2.57LysLeu: 2.57 ± 0.42
0.544LysMet: 0.544 ± 0.151
0.791LysAsn: 0.791 ± 0.171
1.779LysPro: 1.779 ± 0.314
1.384LysGln: 1.384 ± 0.24
2.52LysArg: 2.52 ± 0.484
1.581LysSer: 1.581 ± 0.284
2.026LysThr: 2.026 ± 0.283
1.977LysVal: 1.977 ± 0.295
0.791LysTrp: 0.791 ± 0.217
0.692LysTyr: 0.692 ± 0.183
0.0LysXaa: 0.0 ± 0.0
Leu
11.218LeuAla: 11.218 ± 0.962
1.038LeuCys: 1.038 ± 0.205
5.139LeuAsp: 5.139 ± 0.462
4.349LeuGlu: 4.349 ± 0.515
2.372LeuPhe: 2.372 ± 0.336
7.314LeuGly: 7.314 ± 0.553
1.433LeuHis: 1.433 ± 0.251
3.41LeuIle: 3.41 ± 0.43
2.125LeuLys: 2.125 ± 0.327
6.029LeuLeu: 6.029 ± 0.672
1.73LeuMet: 1.73 ± 0.264
2.273LeuAsn: 2.273 ± 0.407
5.436LeuPro: 5.436 ± 0.6
2.767LeuGln: 2.767 ± 0.34
5.732LeuArg: 5.732 ± 0.748
3.805LeuSer: 3.805 ± 0.442
6.276LeuThr: 6.276 ± 0.533
6.276LeuVal: 6.276 ± 0.59
1.483LeuTrp: 1.483 ± 0.333
1.532LeuTyr: 1.532 ± 0.321
0.0LeuXaa: 0.0 ± 0.0
Met
2.125MetAla: 2.125 ± 0.345
0.198MetCys: 0.198 ± 0.102
0.988MetAsp: 0.988 ± 0.245
0.741MetGlu: 0.741 ± 0.174
0.642MetPhe: 0.642 ± 0.168
1.977MetGly: 1.977 ± 0.29
0.247MetHis: 0.247 ± 0.119
1.038MetIle: 1.038 ± 0.206
0.346MetLys: 0.346 ± 0.14
1.68MetLeu: 1.68 ± 0.305
0.297MetMet: 0.297 ± 0.119
0.494MetAsn: 0.494 ± 0.142
1.334MetPro: 1.334 ± 0.323
0.494MetGln: 0.494 ± 0.136
1.137MetArg: 1.137 ± 0.233
1.334MetSer: 1.334 ± 0.2
2.669MetThr: 2.669 ± 0.373
1.334MetVal: 1.334 ± 0.295
0.297MetTrp: 0.297 ± 0.127
0.494MetTyr: 0.494 ± 0.14
0.0MetXaa: 0.0 ± 0.0
Asn
3.41AsnAla: 3.41 ± 0.454
0.297AsnCys: 0.297 ± 0.129
1.581AsnAsp: 1.581 ± 0.258
1.334AsnGlu: 1.334 ± 0.305
0.445AsnPhe: 0.445 ± 0.152
3.41AsnGly: 3.41 ± 0.409
0.741AsnHis: 0.741 ± 0.247
0.89AsnIle: 0.89 ± 0.214
0.692AsnLys: 0.692 ± 0.177
2.076AsnLeu: 2.076 ± 0.356
0.346AsnMet: 0.346 ± 0.129
1.137AsnAsn: 1.137 ± 0.257
2.52AsnPro: 2.52 ± 0.35
1.137AsnGln: 1.137 ± 0.214
2.224AsnArg: 2.224 ± 0.394
1.532AsnSer: 1.532 ± 0.255
1.977AsnThr: 1.977 ± 0.323
1.977AsnVal: 1.977 ± 0.279
0.544AsnTrp: 0.544 ± 0.155
0.494AsnTyr: 0.494 ± 0.143
0.0AsnXaa: 0.0 ± 0.0
Pro
6.177ProAla: 6.177 ± 0.645
0.494ProCys: 0.494 ± 0.184
5.288ProAsp: 5.288 ± 0.545
5.09ProGlu: 5.09 ± 0.629
1.384ProPhe: 1.384 ± 0.277
6.128ProGly: 6.128 ± 0.702
1.038ProHis: 1.038 ± 0.253
2.224ProIle: 2.224 ± 0.446
1.878ProLys: 1.878 ± 0.322
3.953ProLeu: 3.953 ± 0.479
1.285ProMet: 1.285 ± 0.255
1.433ProAsn: 1.433 ± 0.285
4.596ProPro: 4.596 ± 0.487
2.421ProGln: 2.421 ± 0.339
3.855ProArg: 3.855 ± 0.53
3.163ProSer: 3.163 ± 0.409
3.509ProThr: 3.509 ± 0.358
5.436ProVal: 5.436 ± 0.555
1.186ProTrp: 1.186 ± 0.234
0.939ProTyr: 0.939 ± 0.219
0.0ProXaa: 0.0 ± 0.0
Gln
4.843GlnAla: 4.843 ± 0.573
0.049GlnCys: 0.049 ± 0.05
1.384GlnAsp: 1.384 ± 0.329
1.483GlnGlu: 1.483 ± 0.279
1.235GlnPhe: 1.235 ± 0.249
2.323GlnGly: 2.323 ± 0.49
0.741GlnHis: 0.741 ± 0.171
2.076GlnIle: 2.076 ± 0.311
1.038GlnLys: 1.038 ± 0.194
3.509GlnLeu: 3.509 ± 0.421
1.038GlnMet: 1.038 ± 0.23
0.741GlnAsn: 0.741 ± 0.213
2.273GlnPro: 2.273 ± 0.416
1.779GlnGln: 1.779 ± 0.404
2.916GlnArg: 2.916 ± 0.431
1.334GlnSer: 1.334 ± 0.292
2.619GlnThr: 2.619 ± 0.373
2.669GlnVal: 2.669 ± 0.392
1.087GlnTrp: 1.087 ± 0.232
0.89GlnTyr: 0.89 ± 0.265
0.0GlnXaa: 0.0 ± 0.0
Arg
8.648ArgAla: 8.648 ± 0.991
1.186ArgCys: 1.186 ± 0.251
4.299ArgAsp: 4.299 ± 0.468
4.497ArgGlu: 4.497 ± 0.506
1.828ArgPhe: 1.828 ± 0.273
4.843ArgGly: 4.843 ± 0.444
1.581ArgHis: 1.581 ± 0.306
2.669ArgIle: 2.669 ± 0.348
2.817ArgLys: 2.817 ± 0.572
6.029ArgLeu: 6.029 ± 0.673
2.026ArgMet: 2.026 ± 0.304
1.977ArgAsn: 1.977 ± 0.273
4.2ArgPro: 4.2 ± 0.539
2.174ArgGln: 2.174 ± 0.333
6.424ArgArg: 6.424 ± 0.888
3.657ArgSer: 3.657 ± 0.393
4.102ArgThr: 4.102 ± 0.495
5.041ArgVal: 5.041 ± 0.521
2.174ArgTrp: 2.174 ± 0.328
1.828ArgTyr: 1.828 ± 0.328
0.0ArgXaa: 0.0 ± 0.0
Ser
5.93SerAla: 5.93 ± 0.58
0.642SerCys: 0.642 ± 0.285
4.349SerAsp: 4.349 ± 0.449
2.965SerGlu: 2.965 ± 0.379
1.334SerPhe: 1.334 ± 0.241
5.485SerGly: 5.485 ± 0.637
1.087SerHis: 1.087 ± 0.242
1.828SerIle: 1.828 ± 0.28
1.235SerLys: 1.235 ± 0.253
3.558SerLeu: 3.558 ± 0.447
1.433SerMet: 1.433 ± 0.279
1.878SerAsn: 1.878 ± 0.349
2.372SerPro: 2.372 ± 0.289
1.384SerGln: 1.384 ± 0.215
3.459SerArg: 3.459 ± 0.437
2.619SerSer: 2.619 ± 0.354
3.805SerThr: 3.805 ± 0.489
4.25SerVal: 4.25 ± 0.44
0.89SerTrp: 0.89 ± 0.237
1.087SerTyr: 1.087 ± 0.272
0.0SerXaa: 0.0 ± 0.0
Thr
8.104ThrAla: 8.104 ± 0.899
0.642ThrCys: 0.642 ± 0.246
3.904ThrAsp: 3.904 ± 0.532
3.41ThrGlu: 3.41 ± 0.419
1.779ThrPhe: 1.779 ± 0.272
5.436ThrGly: 5.436 ± 0.586
1.186ThrHis: 1.186 ± 0.27
3.311ThrIle: 3.311 ± 0.377
1.878ThrLys: 1.878 ± 0.249
5.09ThrLeu: 5.09 ± 0.523
1.384ThrMet: 1.384 ± 0.233
1.73ThrAsn: 1.73 ± 0.304
4.497ThrPro: 4.497 ± 0.456
2.471ThrGln: 2.471 ± 0.461
4.546ThrArg: 4.546 ± 0.483
3.262ThrSer: 3.262 ± 0.483
3.904ThrThr: 3.904 ± 0.48
5.979ThrVal: 5.979 ± 0.682
1.532ThrTrp: 1.532 ± 0.314
1.384ThrTyr: 1.384 ± 0.239
0.0ThrXaa: 0.0 ± 0.0
Val
8.253ValAla: 8.253 ± 0.579
0.642ValCys: 0.642 ± 0.178
5.535ValAsp: 5.535 ± 0.624
5.09ValGlu: 5.09 ± 0.598
1.977ValPhe: 1.977 ± 0.302
5.881ValGly: 5.881 ± 0.523
1.68ValHis: 1.68 ± 0.376
3.756ValIle: 3.756 ± 0.458
1.828ValLys: 1.828 ± 0.342
6.721ValLeu: 6.721 ± 0.744
0.84ValMet: 0.84 ± 0.188
2.471ValAsn: 2.471 ± 0.293
5.238ValPro: 5.238 ± 0.561
2.718ValGln: 2.718 ± 0.402
6.078ValArg: 6.078 ± 0.596
5.041ValSer: 5.041 ± 0.558
6.078ValThr: 6.078 ± 0.599
6.078ValVal: 6.078 ± 0.63
1.433ValTrp: 1.433 ± 0.243
1.73ValTyr: 1.73 ± 0.353
0.0ValXaa: 0.0 ± 0.0
Trp
2.323TrpAla: 2.323 ± 0.385
0.346TrpCys: 0.346 ± 0.123
1.038TrpAsp: 1.038 ± 0.226
0.988TrpGlu: 0.988 ± 0.217
0.84TrpPhe: 0.84 ± 0.212
1.285TrpGly: 1.285 ± 0.292
0.445TrpHis: 0.445 ± 0.159
1.038TrpIle: 1.038 ± 0.203
0.544TrpLys: 0.544 ± 0.146
1.68TrpLeu: 1.68 ± 0.274
0.198TrpMet: 0.198 ± 0.093
0.642TrpAsn: 0.642 ± 0.149
0.988TrpPro: 0.988 ± 0.262
0.791TrpGln: 0.791 ± 0.225
1.68TrpArg: 1.68 ± 0.302
0.89TrpSer: 0.89 ± 0.217
1.483TrpThr: 1.483 ± 0.286
1.532TrpVal: 1.532 ± 0.282
0.593TrpTrp: 0.593 ± 0.217
0.544TrpTyr: 0.544 ± 0.146
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.57TyrAla: 2.57 ± 0.391
0.297TyrCys: 0.297 ± 0.13
1.68TyrAsp: 1.68 ± 0.298
1.581TyrGlu: 1.581 ± 0.299
0.494TyrPhe: 0.494 ± 0.126
1.285TyrGly: 1.285 ± 0.279
0.247TyrHis: 0.247 ± 0.108
0.741TyrIle: 0.741 ± 0.176
0.544TyrLys: 0.544 ± 0.163
1.977TyrLeu: 1.977 ± 0.424
0.346TyrMet: 0.346 ± 0.142
0.445TyrAsn: 0.445 ± 0.13
1.631TyrPro: 1.631 ± 0.382
0.692TyrGln: 0.692 ± 0.169
1.828TyrArg: 1.828 ± 0.385
0.988TyrSer: 0.988 ± 0.216
1.334TyrThr: 1.334 ± 0.266
1.927TyrVal: 1.927 ± 0.35
0.346TyrTrp: 0.346 ± 0.136
0.297TyrTyr: 0.297 ± 0.111
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 94 proteins (20237 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski