Amino acid dipepetide frequency for Clostridium phage CDMH1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.385AlaAla: 1.385 ± 0.31
0.819AlaCys: 0.819 ± 0.24
2.204AlaAsp: 2.204 ± 0.417
3.337AlaGlu: 3.337 ± 0.398
1.826AlaPhe: 1.826 ± 0.331
3.274AlaGly: 3.274 ± 0.752
0.441AlaHis: 0.441 ± 0.18
4.345AlaIle: 4.345 ± 0.605
4.66AlaLys: 4.66 ± 0.534
3.967AlaLeu: 3.967 ± 0.501
1.511AlaMet: 1.511 ± 0.266
2.834AlaAsn: 2.834 ± 0.473
1.07AlaPro: 1.07 ± 0.234
1.637AlaGln: 1.637 ± 0.34
1.763AlaArg: 1.763 ± 0.455
3.148AlaSer: 3.148 ± 0.478
3.463AlaThr: 3.463 ± 0.579
2.771AlaVal: 2.771 ± 0.612
0.567AlaTrp: 0.567 ± 0.24
1.448AlaTyr: 1.448 ± 0.315
0.0AlaXaa: 0.0 ± 0.0
Cys
0.315CysAla: 0.315 ± 0.162
0.315CysCys: 0.315 ± 0.134
1.259CysAsp: 1.259 ± 0.294
1.322CysGlu: 1.322 ± 0.262
0.441CysPhe: 0.441 ± 0.199
0.945CysGly: 0.945 ± 0.254
0.189CysHis: 0.189 ± 0.118
1.7CysIle: 1.7 ± 0.463
1.511CysLys: 1.511 ± 0.393
1.007CysLeu: 1.007 ± 0.236
0.378CysMet: 0.378 ± 0.146
0.756CysAsn: 0.756 ± 0.245
0.441CysPro: 0.441 ± 0.148
0.189CysGln: 0.189 ± 0.116
0.945CysArg: 0.945 ± 0.269
0.945CysSer: 0.945 ± 0.222
0.504CysThr: 0.504 ± 0.17
0.504CysVal: 0.504 ± 0.153
0.315CysTrp: 0.315 ± 0.151
0.693CysTyr: 0.693 ± 0.229
0.0CysXaa: 0.0 ± 0.0
Asp
2.834AspAla: 2.834 ± 0.395
0.819AspCys: 0.819 ± 0.275
3.463AspAsp: 3.463 ± 0.537
4.849AspGlu: 4.849 ± 0.585
3.4AspPhe: 3.4 ± 0.453
3.022AspGly: 3.022 ± 0.452
0.252AspHis: 0.252 ± 0.116
6.675AspIle: 6.675 ± 0.6
6.738AspLys: 6.738 ± 0.73
4.723AspLeu: 4.723 ± 0.522
1.322AspMet: 1.322 ± 0.252
3.526AspAsn: 3.526 ± 0.352
0.693AspPro: 0.693 ± 0.193
0.567AspGln: 0.567 ± 0.16
2.519AspArg: 2.519 ± 0.509
2.897AspSer: 2.897 ± 0.416
3.841AspThr: 3.841 ± 0.505
3.904AspVal: 3.904 ± 0.453
0.693AspTrp: 0.693 ± 0.197
2.771AspTyr: 2.771 ± 0.339
0.0AspXaa: 0.0 ± 0.0
Glu
3.904GluAla: 3.904 ± 0.507
1.574GluCys: 1.574 ± 0.386
4.974GluAsp: 4.974 ± 0.639
7.997GluGlu: 7.997 ± 0.936
3.337GluPhe: 3.337 ± 0.429
3.589GluGly: 3.589 ± 0.514
0.882GluHis: 0.882 ± 0.187
7.052GluIle: 7.052 ± 0.767
9.76GluLys: 9.76 ± 0.87
10.138GluLeu: 10.138 ± 1.059
2.267GluMet: 2.267 ± 0.39
5.793GluAsn: 5.793 ± 0.669
1.196GluPro: 1.196 ± 0.325
2.645GluGln: 2.645 ± 0.423
2.96GluArg: 2.96 ± 0.522
3.967GluSer: 3.967 ± 0.411
3.022GluThr: 3.022 ± 0.576
3.778GluVal: 3.778 ± 0.529
0.882GluTrp: 0.882 ± 0.238
4.66GluTyr: 4.66 ± 0.525
0.0GluXaa: 0.0 ± 0.0
Phe
1.763PheAla: 1.763 ± 0.37
0.693PheCys: 0.693 ± 0.215
3.463PheAsp: 3.463 ± 0.38
4.093PheGlu: 4.093 ± 0.588
2.078PhePhe: 2.078 ± 0.395
2.204PheGly: 2.204 ± 0.332
0.567PheHis: 0.567 ± 0.16
4.156PheIle: 4.156 ± 0.631
4.156PheLys: 4.156 ± 0.521
3.841PheLeu: 3.841 ± 0.621
0.945PheMet: 0.945 ± 0.25
3.526PheAsn: 3.526 ± 0.42
0.819PhePro: 0.819 ± 0.303
0.945PheGln: 0.945 ± 0.268
1.448PheArg: 1.448 ± 0.363
2.141PheSer: 2.141 ± 0.47
2.582PheThr: 2.582 ± 0.434
2.078PheVal: 2.078 ± 0.386
0.126PheTrp: 0.126 ± 0.095
1.7PheTyr: 1.7 ± 0.321
0.0PheXaa: 0.0 ± 0.0
Gly
2.897GlyAla: 2.897 ± 0.582
0.693GlyCys: 0.693 ± 0.23
2.33GlyAsp: 2.33 ± 0.516
4.156GlyGlu: 4.156 ± 0.537
2.582GlyPhe: 2.582 ± 0.486
4.723GlyGly: 4.723 ± 1.523
0.756GlyHis: 0.756 ± 0.246
4.912GlyIle: 4.912 ± 0.659
4.219GlyLys: 4.219 ± 0.559
3.778GlyLeu: 3.778 ± 0.7
1.07GlyMet: 1.07 ± 0.301
3.4GlyAsn: 3.4 ± 0.439
0.441GlyPro: 0.441 ± 0.175
1.07GlyGln: 1.07 ± 0.259
1.826GlyArg: 1.826 ± 0.392
3.4GlySer: 3.4 ± 0.512
2.015GlyThr: 2.015 ± 0.413
3.904GlyVal: 3.904 ± 0.551
0.819GlyTrp: 0.819 ± 0.207
2.519GlyTyr: 2.519 ± 0.345
0.0GlyXaa: 0.0 ± 0.0
His
0.378HisAla: 0.378 ± 0.167
0.441HisCys: 0.441 ± 0.154
0.63HisAsp: 0.63 ± 0.177
0.819HisGlu: 0.819 ± 0.216
0.693HisPhe: 0.693 ± 0.17
0.378HisGly: 0.378 ± 0.133
0.189HisHis: 0.189 ± 0.108
0.567HisIle: 0.567 ± 0.249
1.511HisLys: 1.511 ± 0.275
0.882HisLeu: 0.882 ± 0.24
0.504HisMet: 0.504 ± 0.158
0.567HisAsn: 0.567 ± 0.193
0.567HisPro: 0.567 ± 0.23
0.252HisGln: 0.252 ± 0.112
0.315HisArg: 0.315 ± 0.138
0.693HisSer: 0.693 ± 0.172
0.819HisThr: 0.819 ± 0.233
0.693HisVal: 0.693 ± 0.193
0.189HisTrp: 0.189 ± 0.111
0.504HisTyr: 0.504 ± 0.177
0.0HisXaa: 0.0 ± 0.0
Ile
5.226IleAla: 5.226 ± 0.662
1.322IleCys: 1.322 ± 0.326
6.36IleAsp: 6.36 ± 0.621
8.627IleGlu: 8.627 ± 0.882
3.274IlePhe: 3.274 ± 0.521
3.841IleGly: 3.841 ± 0.543
1.259IleHis: 1.259 ± 0.259
6.423IleIle: 6.423 ± 0.82
10.012IleLys: 10.012 ± 0.84
6.675IleLeu: 6.675 ± 0.677
1.889IleMet: 1.889 ± 0.317
6.612IleAsn: 6.612 ± 0.593
2.582IlePro: 2.582 ± 0.402
3.085IleGln: 3.085 ± 0.437
4.156IleArg: 4.156 ± 0.545
6.675IleSer: 6.675 ± 0.691
4.156IleThr: 4.156 ± 0.506
4.282IleVal: 4.282 ± 0.556
0.441IleTrp: 0.441 ± 0.15
3.337IleTyr: 3.337 ± 0.474
0.0IleXaa: 0.0 ± 0.0
Lys
4.345LysAla: 4.345 ± 0.546
1.007LysCys: 1.007 ± 0.255
7.367LysAsp: 7.367 ± 0.726
10.705LysGlu: 10.705 ± 0.966
3.715LysPhe: 3.715 ± 0.504
4.534LysGly: 4.534 ± 0.493
1.259LysHis: 1.259 ± 0.269
8.753LysIle: 8.753 ± 0.974
10.516LysLys: 10.516 ± 0.92
9.067LysLeu: 9.067 ± 0.883
3.211LysMet: 3.211 ± 0.394
8.375LysAsn: 8.375 ± 0.6
1.574LysPro: 1.574 ± 0.391
3.148LysGln: 3.148 ± 0.496
4.345LysArg: 4.345 ± 0.52
6.864LysSer: 6.864 ± 0.683
5.163LysThr: 5.163 ± 0.633
7.367LysVal: 7.367 ± 0.782
1.196LysTrp: 1.196 ± 0.306
5.289LysTyr: 5.289 ± 0.599
0.0LysXaa: 0.0 ± 0.0
Leu
3.463LeuAla: 3.463 ± 0.46
1.007LeuCys: 1.007 ± 0.292
5.667LeuAsp: 5.667 ± 0.496
7.367LeuGlu: 7.367 ± 0.765
3.085LeuPhe: 3.085 ± 0.487
4.597LeuGly: 4.597 ± 0.585
1.133LeuHis: 1.133 ± 0.235
7.808LeuIle: 7.808 ± 0.887
10.39LeuLys: 10.39 ± 0.782
6.864LeuLeu: 6.864 ± 0.815
1.763LeuMet: 1.763 ± 0.317
7.367LeuAsn: 7.367 ± 0.785
2.204LeuPro: 2.204 ± 0.414
2.582LeuGln: 2.582 ± 0.335
3.967LeuArg: 3.967 ± 0.511
4.282LeuSer: 4.282 ± 0.48
4.282LeuThr: 4.282 ± 0.576
3.967LeuVal: 3.967 ± 0.544
0.441LeuTrp: 0.441 ± 0.152
3.463LeuTyr: 3.463 ± 0.444
0.0LeuXaa: 0.0 ± 0.0
Met
1.763MetAla: 1.763 ± 0.3
0.252MetCys: 0.252 ± 0.127
1.511MetAsp: 1.511 ± 0.372
2.141MetGlu: 2.141 ± 0.405
0.693MetPhe: 0.693 ± 0.206
0.693MetGly: 0.693 ± 0.269
0.189MetHis: 0.189 ± 0.102
1.826MetIle: 1.826 ± 0.363
2.897MetLys: 2.897 ± 0.459
2.393MetLeu: 2.393 ± 0.39
0.441MetMet: 0.441 ± 0.155
1.889MetAsn: 1.889 ± 0.388
0.315MetPro: 0.315 ± 0.146
0.441MetGln: 0.441 ± 0.177
0.819MetArg: 0.819 ± 0.24
1.889MetSer: 1.889 ± 0.331
1.637MetThr: 1.637 ± 0.278
0.63MetVal: 0.63 ± 0.179
0.189MetTrp: 0.189 ± 0.101
0.882MetTyr: 0.882 ± 0.226
0.0MetXaa: 0.0 ± 0.0
Asn
3.841AsnAla: 3.841 ± 0.63
0.945AsnCys: 0.945 ± 0.236
3.463AsnAsp: 3.463 ± 0.596
5.289AsnGlu: 5.289 ± 0.559
3.085AsnPhe: 3.085 ± 0.41
3.337AsnGly: 3.337 ± 0.431
0.819AsnHis: 0.819 ± 0.245
7.241AsnIle: 7.241 ± 0.733
8.627AsnLys: 8.627 ± 0.935
5.667AsnLeu: 5.667 ± 0.514
1.889AsnMet: 1.889 ± 0.361
6.36AsnAsn: 6.36 ± 0.792
2.015AsnPro: 2.015 ± 0.37
1.826AsnGln: 1.826 ± 0.323
2.519AsnArg: 2.519 ± 0.361
5.037AsnSer: 5.037 ± 0.618
3.841AsnThr: 3.841 ± 0.418
3.337AsnVal: 3.337 ± 0.431
0.693AsnTrp: 0.693 ± 0.25
2.645AsnTyr: 2.645 ± 0.407
0.0AsnXaa: 0.0 ± 0.0
Pro
0.819ProAla: 0.819 ± 0.201
0.441ProCys: 0.441 ± 0.174
0.882ProAsp: 0.882 ± 0.274
1.07ProGlu: 1.07 ± 0.237
0.819ProPhe: 0.819 ± 0.254
1.133ProGly: 1.133 ± 0.301
0.441ProHis: 0.441 ± 0.168
2.33ProIle: 2.33 ± 0.423
2.015ProLys: 2.015 ± 0.399
1.448ProLeu: 1.448 ± 0.305
0.252ProMet: 0.252 ± 0.104
1.448ProAsn: 1.448 ± 0.223
0.504ProPro: 0.504 ± 0.156
0.693ProGln: 0.693 ± 0.181
0.441ProArg: 0.441 ± 0.166
1.574ProSer: 1.574 ± 0.304
1.7ProThr: 1.7 ± 0.335
1.196ProVal: 1.196 ± 0.284
0.252ProTrp: 0.252 ± 0.151
0.504ProTyr: 0.504 ± 0.212
0.0ProXaa: 0.0 ± 0.0
Gln
1.511GlnAla: 1.511 ± 0.291
0.441GlnCys: 0.441 ± 0.168
1.574GlnAsp: 1.574 ± 0.358
2.456GlnGlu: 2.456 ± 0.492
1.133GlnPhe: 1.133 ± 0.268
1.574GlnGly: 1.574 ± 0.358
0.378GlnHis: 0.378 ± 0.156
2.771GlnIle: 2.771 ± 0.499
2.582GlnLys: 2.582 ± 0.459
2.393GlnLeu: 2.393 ± 0.411
0.882GlnMet: 0.882 ± 0.218
2.267GlnAsn: 2.267 ± 0.349
0.189GlnPro: 0.189 ± 0.144
1.007GlnGln: 1.007 ± 0.305
1.007GlnArg: 1.007 ± 0.267
1.385GlnSer: 1.385 ± 0.316
1.889GlnThr: 1.889 ± 0.282
1.322GlnVal: 1.322 ± 0.264
0.126GlnTrp: 0.126 ± 0.09
1.007GlnTyr: 1.007 ± 0.241
0.0GlnXaa: 0.0 ± 0.0
Arg
1.448ArgAla: 1.448 ± 0.282
0.756ArgCys: 0.756 ± 0.221
2.141ArgAsp: 2.141 ± 0.444
3.715ArgGlu: 3.715 ± 0.555
1.763ArgPhe: 1.763 ± 0.324
1.889ArgGly: 1.889 ± 0.294
0.252ArgHis: 0.252 ± 0.103
3.904ArgIle: 3.904 ± 0.571
3.967ArgLys: 3.967 ± 0.538
3.589ArgLeu: 3.589 ± 0.448
1.07ArgMet: 1.07 ± 0.277
1.826ArgAsn: 1.826 ± 0.388
0.819ArgPro: 0.819 ± 0.255
1.385ArgGln: 1.385 ± 0.247
1.7ArgArg: 1.7 ± 0.334
1.574ArgSer: 1.574 ± 0.348
2.015ArgThr: 2.015 ± 0.369
2.33ArgVal: 2.33 ± 0.336
0.504ArgTrp: 0.504 ± 0.163
1.7ArgTyr: 1.7 ± 0.281
0.0ArgXaa: 0.0 ± 0.0
Ser
2.897SerAla: 2.897 ± 0.503
0.756SerCys: 0.756 ± 0.219
3.211SerAsp: 3.211 ± 0.42
3.904SerGlu: 3.904 ± 0.492
3.463SerPhe: 3.463 ± 0.561
2.519SerGly: 2.519 ± 0.42
0.441SerHis: 0.441 ± 0.157
5.73SerIle: 5.73 ± 0.618
8.564SerLys: 8.564 ± 0.612
4.597SerLeu: 4.597 ± 0.498
1.259SerMet: 1.259 ± 0.298
5.1SerAsn: 5.1 ± 0.611
0.693SerPro: 0.693 ± 0.216
2.015SerGln: 2.015 ± 0.414
1.952SerArg: 1.952 ± 0.375
3.904SerSer: 3.904 ± 0.755
3.022SerThr: 3.022 ± 0.422
3.463SerVal: 3.463 ± 0.475
0.756SerTrp: 0.756 ± 0.184
2.897SerTyr: 2.897 ± 0.371
0.0SerXaa: 0.0 ± 0.0
Thr
2.771ThrAla: 2.771 ± 0.571
0.567ThrCys: 0.567 ± 0.248
2.897ThrAsp: 2.897 ± 0.435
3.715ThrGlu: 3.715 ± 0.516
3.085ThrPhe: 3.085 ± 0.386
3.904ThrGly: 3.904 ± 0.431
1.007ThrHis: 1.007 ± 0.257
5.226ThrIle: 5.226 ± 0.636
4.723ThrLys: 4.723 ± 0.592
4.723ThrLeu: 4.723 ± 0.602
0.882ThrMet: 0.882 ± 0.208
3.148ThrAsn: 3.148 ± 0.414
1.637ThrPro: 1.637 ± 0.352
1.574ThrGln: 1.574 ± 0.283
1.826ThrArg: 1.826 ± 0.295
3.085ThrSer: 3.085 ± 0.481
3.715ThrThr: 3.715 ± 0.753
3.022ThrVal: 3.022 ± 0.449
0.378ThrTrp: 0.378 ± 0.139
2.015ThrTyr: 2.015 ± 0.424
0.0ThrXaa: 0.0 ± 0.0
Val
2.645ValAla: 2.645 ± 0.398
0.945ValCys: 0.945 ± 0.212
3.211ValAsp: 3.211 ± 0.528
4.66ValGlu: 4.66 ± 0.541
2.519ValPhe: 2.519 ± 0.409
3.211ValGly: 3.211 ± 0.49
0.819ValHis: 0.819 ± 0.235
3.841ValIle: 3.841 ± 0.541
4.974ValLys: 4.974 ± 0.484
4.849ValLeu: 4.849 ± 0.525
0.945ValMet: 0.945 ± 0.232
4.471ValAsn: 4.471 ± 0.675
1.259ValPro: 1.259 ± 0.293
1.322ValGln: 1.322 ± 0.271
1.7ValArg: 1.7 ± 0.435
4.093ValSer: 4.093 ± 0.571
2.897ValThr: 2.897 ± 0.499
3.274ValVal: 3.274 ± 0.472
0.63ValTrp: 0.63 ± 0.19
2.393ValTyr: 2.393 ± 0.348
0.0ValXaa: 0.0 ± 0.0
Trp
0.378TrpAla: 0.378 ± 0.136
0.252TrpCys: 0.252 ± 0.119
0.252TrpAsp: 0.252 ± 0.11
1.07TrpGlu: 1.07 ± 0.263
0.378TrpPhe: 0.378 ± 0.144
0.567TrpGly: 0.567 ± 0.193
0.0TrpHis: 0.0 ± 0.0
0.63TrpIle: 0.63 ± 0.212
0.882TrpLys: 0.882 ± 0.324
1.322TrpLeu: 1.322 ± 0.32
0.252TrpMet: 0.252 ± 0.125
0.63TrpAsn: 0.63 ± 0.199
0.063TrpPro: 0.063 ± 0.058
0.252TrpGln: 0.252 ± 0.141
0.441TrpArg: 0.441 ± 0.184
0.63TrpSer: 0.63 ± 0.231
0.504TrpThr: 0.504 ± 0.211
0.819TrpVal: 0.819 ± 0.234
0.0TrpTrp: 0.0 ± 0.0
0.441TrpTyr: 0.441 ± 0.236
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.889TyrAla: 1.889 ± 0.364
0.63TyrCys: 0.63 ± 0.193
2.645TyrAsp: 2.645 ± 0.398
3.085TyrGlu: 3.085 ± 0.548
1.952TyrPhe: 1.952 ± 0.355
1.574TyrGly: 1.574 ± 0.285
0.315TyrHis: 0.315 ± 0.122
4.345TyrIle: 4.345 ± 0.531
5.1TyrLys: 5.1 ± 0.592
3.778TyrLeu: 3.778 ± 0.409
0.693TyrMet: 0.693 ± 0.214
2.645TyrAsn: 2.645 ± 0.387
0.882TyrPro: 0.882 ± 0.219
1.259TyrGln: 1.259 ± 0.246
1.7TyrArg: 1.7 ± 0.313
2.96TyrSer: 2.96 ± 0.413
2.834TyrThr: 2.834 ± 0.473
2.015TyrVal: 2.015 ± 0.36
0.567TyrTrp: 0.567 ± 0.17
2.33TyrTyr: 2.33 ± 0.537
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 84 proteins (15882 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski