Amino acid dipepetide frequency for Mycobacterium phage LilMcDreamy

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.728AlaAla: 17.728 ± 1.636
0.915AlaCys: 0.915 ± 0.252
8.799AlaAsp: 8.799 ± 0.622
7.753AlaGlu: 7.753 ± 0.624
2.918AlaPhe: 2.918 ± 0.368
10.541AlaGly: 10.541 ± 1.382
2.221AlaHis: 2.221 ± 0.333
4.966AlaIle: 4.966 ± 0.513
3.92AlaLys: 3.92 ± 0.439
10.715AlaLeu: 10.715 ± 0.875
2.918AlaMet: 2.918 ± 0.356
3.746AlaAsn: 3.746 ± 0.486
7.492AlaPro: 7.492 ± 0.595
4.051AlaGln: 4.051 ± 0.465
8.058AlaArg: 8.058 ± 0.728
5.793AlaSer: 5.793 ± 0.452
8.363AlaThr: 8.363 ± 0.669
8.886AlaVal: 8.886 ± 0.664
1.96AlaTrp: 1.96 ± 0.307
2.918AlaTyr: 2.918 ± 0.357
0.0AlaXaa: 0.0 ± 0.0
Cys
0.566CysAla: 0.566 ± 0.141
0.087CysCys: 0.087 ± 0.058
0.74CysAsp: 0.74 ± 0.15
0.436CysGlu: 0.436 ± 0.161
0.392CysPhe: 0.392 ± 0.149
1.568CysGly: 1.568 ± 0.325
0.218CysHis: 0.218 ± 0.088
0.305CysIle: 0.305 ± 0.127
0.087CysLys: 0.087 ± 0.058
0.653CysLeu: 0.653 ± 0.167
0.218CysMet: 0.218 ± 0.104
0.174CysAsn: 0.174 ± 0.089
0.871CysPro: 0.871 ± 0.218
0.218CysGln: 0.218 ± 0.094
1.133CysArg: 1.133 ± 0.257
0.479CysSer: 0.479 ± 0.167
0.74CysThr: 0.74 ± 0.165
0.479CysVal: 0.479 ± 0.156
0.174CysTrp: 0.174 ± 0.083
0.087CysTyr: 0.087 ± 0.066
0.0CysXaa: 0.0 ± 0.0
Asp
7.971AspAla: 7.971 ± 0.566
0.784AspCys: 0.784 ± 0.199
4.704AspAsp: 4.704 ± 0.636
4.661AspGlu: 4.661 ± 0.463
1.917AspPhe: 1.917 ± 0.287
6.229AspGly: 6.229 ± 0.563
1.089AspHis: 1.089 ± 0.192
3.093AspIle: 3.093 ± 0.389
1.35AspLys: 1.35 ± 0.247
5.575AspLeu: 5.575 ± 0.443
1.089AspMet: 1.089 ± 0.25
1.655AspAsn: 1.655 ± 0.267
5.27AspPro: 5.27 ± 0.592
2.004AspGln: 2.004 ± 0.272
5.27AspArg: 5.27 ± 0.56
3.398AspSer: 3.398 ± 0.409
4.007AspThr: 4.007 ± 0.432
5.837AspVal: 5.837 ± 0.474
1.35AspTrp: 1.35 ± 0.222
1.002AspTyr: 1.002 ± 0.23
0.0AspXaa: 0.0 ± 0.0
Glu
7.884GluAla: 7.884 ± 0.704
0.784GluCys: 0.784 ± 0.223
2.091GluAsp: 2.091 ± 0.387
2.396GluGlu: 2.396 ± 0.483
1.612GluPhe: 1.612 ± 0.31
4.356GluGly: 4.356 ± 0.536
1.307GluHis: 1.307 ± 0.267
3.18GluIle: 3.18 ± 0.304
1.133GluLys: 1.133 ± 0.198
4.312GluLeu: 4.312 ± 0.461
1.525GluMet: 1.525 ± 0.301
1.089GluAsn: 1.089 ± 0.247
3.136GluPro: 3.136 ± 0.452
2.962GluGln: 2.962 ± 0.404
3.79GluArg: 3.79 ± 0.379
2.352GluSer: 2.352 ± 0.345
4.704GluThr: 4.704 ± 0.431
5.75GluVal: 5.75 ± 0.533
1.655GluTrp: 1.655 ± 0.272
1.176GluTyr: 1.176 ± 0.209
0.0GluXaa: 0.0 ± 0.0
Phe
3.441PheAla: 3.441 ± 0.384
0.174PheCys: 0.174 ± 0.086
2.091PheAsp: 2.091 ± 0.381
1.437PheGlu: 1.437 ± 0.264
0.566PhePhe: 0.566 ± 0.178
3.354PheGly: 3.354 ± 0.416
0.958PheHis: 0.958 ± 0.213
1.045PheIle: 1.045 ± 0.322
1.002PheLys: 1.002 ± 0.228
1.133PheLeu: 1.133 ± 0.262
0.131PheMet: 0.131 ± 0.068
0.784PheAsn: 0.784 ± 0.19
1.35PhePro: 1.35 ± 0.195
0.348PheGln: 0.348 ± 0.143
1.699PheArg: 1.699 ± 0.241
1.045PheSer: 1.045 ± 0.216
1.96PheThr: 1.96 ± 0.256
1.699PheVal: 1.699 ± 0.239
0.305PheTrp: 0.305 ± 0.105
0.523PheTyr: 0.523 ± 0.195
0.0PheXaa: 0.0 ± 0.0
Gly
9.365GlyAla: 9.365 ± 1.622
1.002GlyCys: 1.002 ± 0.245
6.664GlyAsp: 6.664 ± 0.539
5.663GlyGlu: 5.663 ± 0.517
2.831GlyPhe: 2.831 ± 0.436
11.456GlyGly: 11.456 ± 1.822
1.699GlyHis: 1.699 ± 0.291
3.79GlyIle: 3.79 ± 0.428
3.136GlyLys: 3.136 ± 0.393
8.058GlyLeu: 8.058 ± 0.819
2.091GlyMet: 2.091 ± 0.308
2.701GlyAsn: 2.701 ± 0.466
4.574GlyPro: 4.574 ± 0.506
2.744GlyGln: 2.744 ± 0.29
6.621GlyArg: 6.621 ± 0.597
5.401GlySer: 5.401 ± 0.484
6.534GlyThr: 6.534 ± 0.506
6.403GlyVal: 6.403 ± 0.802
2.221GlyTrp: 2.221 ± 0.415
2.657GlyTyr: 2.657 ± 0.351
0.0GlyXaa: 0.0 ± 0.0
His
2.178HisAla: 2.178 ± 0.352
0.218HisCys: 0.218 ± 0.112
1.394HisAsp: 1.394 ± 0.281
0.74HisGlu: 0.74 ± 0.211
0.566HisPhe: 0.566 ± 0.162
1.307HisGly: 1.307 ± 0.231
0.436HisHis: 0.436 ± 0.148
0.915HisIle: 0.915 ± 0.189
0.436HisLys: 0.436 ± 0.128
1.655HisLeu: 1.655 ± 0.267
0.261HisMet: 0.261 ± 0.142
0.348HisAsn: 0.348 ± 0.12
1.612HisPro: 1.612 ± 0.302
0.523HisGln: 0.523 ± 0.141
1.699HisArg: 1.699 ± 0.299
0.61HisSer: 0.61 ± 0.161
1.22HisThr: 1.22 ± 0.251
0.958HisVal: 0.958 ± 0.196
0.566HisTrp: 0.566 ± 0.151
0.261HisTyr: 0.261 ± 0.115
0.0HisXaa: 0.0 ± 0.0
Ile
6.664IleAla: 6.664 ± 0.536
0.566IleCys: 0.566 ± 0.169
4.443IleAsp: 4.443 ± 0.525
3.79IleGlu: 3.79 ± 0.494
0.828IlePhe: 0.828 ± 0.16
4.356IleGly: 4.356 ± 0.673
0.479IleHis: 0.479 ± 0.14
1.568IleIle: 1.568 ± 0.305
1.525IleLys: 1.525 ± 0.287
2.309IleLeu: 2.309 ± 0.265
0.61IleMet: 0.61 ± 0.127
1.917IleAsn: 1.917 ± 0.297
2.309IlePro: 2.309 ± 0.303
0.958IleGln: 0.958 ± 0.196
1.612IleArg: 1.612 ± 0.272
1.742IleSer: 1.742 ± 0.247
3.833IleThr: 3.833 ± 0.339
3.441IleVal: 3.441 ± 0.359
0.523IleTrp: 0.523 ± 0.155
0.915IleTyr: 0.915 ± 0.175
0.0IleXaa: 0.0 ± 0.0
Lys
3.702LysAla: 3.702 ± 0.505
0.131LysCys: 0.131 ± 0.07
0.871LysAsp: 0.871 ± 0.186
0.958LysGlu: 0.958 ± 0.184
0.479LysPhe: 0.479 ± 0.147
2.613LysGly: 2.613 ± 0.358
0.566LysHis: 0.566 ± 0.148
1.437LysIle: 1.437 ± 0.24
1.176LysLys: 1.176 ± 0.248
2.439LysLeu: 2.439 ± 0.353
0.784LysMet: 0.784 ± 0.205
0.697LysAsn: 0.697 ± 0.136
1.829LysPro: 1.829 ± 0.358
1.133LysGln: 1.133 ± 0.225
2.047LysArg: 2.047 ± 0.323
2.265LysSer: 2.265 ± 0.277
1.96LysThr: 1.96 ± 0.267
2.744LysVal: 2.744 ± 0.417
0.523LysTrp: 0.523 ± 0.16
0.523LysTyr: 0.523 ± 0.153
0.0LysXaa: 0.0 ± 0.0
Leu
11.02LeuAla: 11.02 ± 0.762
0.523LeuCys: 0.523 ± 0.138
5.793LeuAsp: 5.793 ± 0.61
3.572LeuGlu: 3.572 ± 0.316
1.742LeuPhe: 1.742 ± 0.322
7.492LeuGly: 7.492 ± 0.812
1.568LeuHis: 1.568 ± 0.301
4.007LeuIle: 4.007 ± 0.395
1.742LeuLys: 1.742 ± 0.314
5.793LeuLeu: 5.793 ± 0.455
2.004LeuMet: 2.004 ± 0.272
2.57LeuAsn: 2.57 ± 0.44
5.488LeuPro: 5.488 ± 0.522
2.309LeuGln: 2.309 ± 0.327
5.096LeuArg: 5.096 ± 0.593
4.617LeuSer: 4.617 ± 0.439
5.967LeuThr: 5.967 ± 0.523
5.053LeuVal: 5.053 ± 0.375
0.958LeuTrp: 0.958 ± 0.213
1.612LeuTyr: 1.612 ± 0.249
0.0LeuXaa: 0.0 ± 0.0
Met
2.221MetAla: 2.221 ± 0.333
0.131MetCys: 0.131 ± 0.07
1.176MetAsp: 1.176 ± 0.207
0.653MetGlu: 0.653 ± 0.183
0.479MetPhe: 0.479 ± 0.131
1.394MetGly: 1.394 ± 0.218
0.436MetHis: 0.436 ± 0.142
1.176MetIle: 1.176 ± 0.245
0.566MetLys: 0.566 ± 0.164
1.525MetLeu: 1.525 ± 0.263
0.523MetMet: 0.523 ± 0.131
0.653MetAsn: 0.653 ± 0.192
1.655MetPro: 1.655 ± 0.282
0.828MetGln: 0.828 ± 0.171
1.655MetArg: 1.655 ± 0.293
1.437MetSer: 1.437 ± 0.237
2.613MetThr: 2.613 ± 0.352
0.915MetVal: 0.915 ± 0.187
0.305MetTrp: 0.305 ± 0.11
0.348MetTyr: 0.348 ± 0.121
0.0MetXaa: 0.0 ± 0.0
Asn
3.528AsnAla: 3.528 ± 0.437
0.392AsnCys: 0.392 ± 0.127
2.788AsnAsp: 2.788 ± 0.435
1.35AsnGlu: 1.35 ± 0.244
0.479AsnPhe: 0.479 ± 0.153
2.875AsnGly: 2.875 ± 0.366
0.523AsnHis: 0.523 ± 0.15
0.74AsnIle: 0.74 ± 0.17
1.133AsnLys: 1.133 ± 0.208
2.265AsnLeu: 2.265 ± 0.325
0.61AsnMet: 0.61 ± 0.176
0.828AsnAsn: 0.828 ± 0.241
2.221AsnPro: 2.221 ± 0.342
1.002AsnGln: 1.002 ± 0.231
1.437AsnArg: 1.437 ± 0.25
1.307AsnSer: 1.307 ± 0.223
1.568AsnThr: 1.568 ± 0.342
2.352AsnVal: 2.352 ± 0.344
0.653AsnTrp: 0.653 ± 0.169
0.784AsnTyr: 0.784 ± 0.182
0.0AsnXaa: 0.0 ± 0.0
Pro
6.664ProAla: 6.664 ± 0.506
0.261ProCys: 0.261 ± 0.126
5.227ProAsp: 5.227 ± 0.624
4.661ProGlu: 4.661 ± 0.56
1.873ProPhe: 1.873 ± 0.295
7.448ProGly: 7.448 ± 0.637
0.958ProHis: 0.958 ± 0.208
2.57ProIle: 2.57 ± 0.307
1.699ProLys: 1.699 ± 0.26
3.485ProLeu: 3.485 ± 0.387
1.002ProMet: 1.002 ± 0.182
1.917ProAsn: 1.917 ± 0.298
4.399ProPro: 4.399 ± 0.453
1.612ProGln: 1.612 ± 0.28
3.659ProArg: 3.659 ± 0.513
3.223ProSer: 3.223 ± 0.333
4.835ProThr: 4.835 ± 0.516
4.748ProVal: 4.748 ± 0.376
1.045ProTrp: 1.045 ± 0.252
0.915ProTyr: 0.915 ± 0.203
0.0ProXaa: 0.0 ± 0.0
Gln
4.225GlnAla: 4.225 ± 0.452
0.261GlnCys: 0.261 ± 0.105
1.045GlnAsp: 1.045 ± 0.244
1.133GlnGlu: 1.133 ± 0.211
1.089GlnPhe: 1.089 ± 0.196
2.613GlnGly: 2.613 ± 0.278
0.436GlnHis: 0.436 ± 0.131
1.568GlnIle: 1.568 ± 0.282
0.61GlnLys: 0.61 ± 0.173
3.398GlnLeu: 3.398 ± 0.341
0.784GlnMet: 0.784 ± 0.169
0.697GlnAsn: 0.697 ± 0.214
1.96GlnPro: 1.96 ± 0.318
1.045GlnGln: 1.045 ± 0.212
2.918GlnArg: 2.918 ± 0.422
1.307GlnSer: 1.307 ± 0.27
2.265GlnThr: 2.265 ± 0.315
2.613GlnVal: 2.613 ± 0.296
0.784GlnTrp: 0.784 ± 0.204
0.74GlnTyr: 0.74 ± 0.147
0.0GlnXaa: 0.0 ± 0.0
Arg
7.84ArgAla: 7.84 ± 0.765
0.915ArgCys: 0.915 ± 0.197
4.661ArgAsp: 4.661 ± 0.501
4.661ArgGlu: 4.661 ± 0.508
1.917ArgPhe: 1.917 ± 0.286
4.835ArgGly: 4.835 ± 0.594
1.133ArgHis: 1.133 ± 0.246
2.831ArgIle: 2.831 ± 0.428
1.829ArgLys: 1.829 ± 0.326
6.403ArgLeu: 6.403 ± 0.518
1.873ArgMet: 1.873 ± 0.291
2.483ArgAsn: 2.483 ± 0.308
3.964ArgPro: 3.964 ± 0.491
2.134ArgGln: 2.134 ± 0.276
5.793ArgArg: 5.793 ± 0.7
3.659ArgSer: 3.659 ± 0.363
4.574ArgThr: 4.574 ± 0.43
4.356ArgVal: 4.356 ± 0.572
1.699ArgTrp: 1.699 ± 0.267
1.873ArgTyr: 1.873 ± 0.282
0.0ArgXaa: 0.0 ± 0.0
Ser
6.055SerAla: 6.055 ± 0.667
0.523SerCys: 0.523 ± 0.138
2.831SerAsp: 2.831 ± 0.346
2.657SerGlu: 2.657 ± 0.472
1.089SerPhe: 1.089 ± 0.215
5.488SerGly: 5.488 ± 0.67
0.61SerHis: 0.61 ± 0.156
2.396SerIle: 2.396 ± 0.298
1.35SerLys: 1.35 ± 0.279
4.443SerLeu: 4.443 ± 0.354
1.22SerMet: 1.22 ± 0.208
1.612SerAsn: 1.612 ± 0.22
3.136SerPro: 3.136 ± 0.439
2.134SerGln: 2.134 ± 0.339
3.528SerArg: 3.528 ± 0.439
1.873SerSer: 1.873 ± 0.376
2.788SerThr: 2.788 ± 0.308
3.398SerVal: 3.398 ± 0.393
1.133SerTrp: 1.133 ± 0.201
1.481SerTyr: 1.481 ± 0.247
0.0SerXaa: 0.0 ± 0.0
Thr
9.408ThrAla: 9.408 ± 0.743
0.915ThrCys: 0.915 ± 0.271
4.661ThrAsp: 4.661 ± 0.446
4.312ThrGlu: 4.312 ± 0.426
1.612ThrPhe: 1.612 ± 0.242
8.015ThrGly: 8.015 ± 0.579
1.176ThrHis: 1.176 ± 0.254
3.485ThrIle: 3.485 ± 0.376
2.309ThrLys: 2.309 ± 0.292
5.009ThrLeu: 5.009 ± 0.439
0.915ThrMet: 0.915 ± 0.2
1.35ThrAsn: 1.35 ± 0.273
4.138ThrPro: 4.138 ± 0.481
1.829ThrGln: 1.829 ± 0.225
4.704ThrArg: 4.704 ± 0.46
3.877ThrSer: 3.877 ± 0.42
4.312ThrThr: 4.312 ± 0.586
5.837ThrVal: 5.837 ± 0.466
1.612ThrTrp: 1.612 ± 0.268
1.22ThrTyr: 1.22 ± 0.255
0.0ThrXaa: 0.0 ± 0.0
Val
8.755ValAla: 8.755 ± 0.591
0.74ValCys: 0.74 ± 0.219
5.27ValAsp: 5.27 ± 0.533
4.007ValGlu: 4.007 ± 0.336
1.394ValPhe: 1.394 ± 0.236
7.274ValGly: 7.274 ± 0.555
1.263ValHis: 1.263 ± 0.227
4.094ValIle: 4.094 ± 0.342
2.57ValLys: 2.57 ± 0.316
6.229ValLeu: 6.229 ± 0.523
1.394ValMet: 1.394 ± 0.216
2.265ValAsn: 2.265 ± 0.325
4.661ValPro: 4.661 ± 0.375
2.091ValGln: 2.091 ± 0.335
5.096ValArg: 5.096 ± 0.531
3.528ValSer: 3.528 ± 0.473
5.663ValThr: 5.663 ± 0.451
5.924ValVal: 5.924 ± 0.527
1.176ValTrp: 1.176 ± 0.246
1.699ValTyr: 1.699 ± 0.281
0.0ValXaa: 0.0 ± 0.0
Trp
2.788TrpAla: 2.788 ± 0.469
0.218TrpCys: 0.218 ± 0.108
1.394TrpAsp: 1.394 ± 0.24
0.74TrpGlu: 0.74 ± 0.173
0.74TrpPhe: 0.74 ± 0.249
0.784TrpGly: 0.784 ± 0.173
0.523TrpHis: 0.523 ± 0.169
0.74TrpIle: 0.74 ± 0.174
0.697TrpLys: 0.697 ± 0.17
1.612TrpLeu: 1.612 ± 0.246
0.479TrpMet: 0.479 ± 0.134
0.523TrpAsn: 0.523 ± 0.137
1.176TrpPro: 1.176 ± 0.232
0.653TrpGln: 0.653 ± 0.155
1.22TrpArg: 1.22 ± 0.232
1.263TrpSer: 1.263 ± 0.204
1.525TrpThr: 1.525 ± 0.311
1.655TrpVal: 1.655 ± 0.315
0.61TrpTrp: 0.61 ± 0.169
0.348TrpTyr: 0.348 ± 0.131
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.701TyrAla: 2.701 ± 0.303
0.087TyrCys: 0.087 ± 0.058
1.612TyrAsp: 1.612 ± 0.257
1.612TyrGlu: 1.612 ± 0.3
0.523TyrPhe: 0.523 ± 0.136
1.612TyrGly: 1.612 ± 0.306
0.305TyrHis: 0.305 ± 0.146
0.479TyrIle: 0.479 ± 0.182
0.61TyrLys: 0.61 ± 0.171
1.917TyrLeu: 1.917 ± 0.289
0.218TyrMet: 0.218 ± 0.109
0.828TyrAsn: 0.828 ± 0.229
0.958TyrPro: 0.958 ± 0.256
0.915TyrGln: 0.915 ± 0.228
2.396TyrArg: 2.396 ± 0.336
0.566TyrSer: 0.566 ± 0.144
1.263TyrThr: 1.263 ± 0.259
2.047TyrVal: 2.047 ± 0.326
0.436TyrTrp: 0.436 ± 0.132
0.566TyrTyr: 0.566 ± 0.143
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 99 proteins (22959 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski