Amino acid dipepetide frequency for Mycobacterium phage Pollywog

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.699AlaAla: 13.699 ± 1.731
1.456AlaCys: 1.456 ± 0.319
6.58AlaAsp: 6.58 ± 0.557
7.55AlaGlu: 7.55 ± 0.843
2.751AlaPhe: 2.751 ± 0.384
9.492AlaGly: 9.492 ± 1.222
1.942AlaHis: 1.942 ± 0.346
4.422AlaIle: 4.422 ± 0.554
4.584AlaLys: 4.584 ± 0.442
8.467AlaLeu: 8.467 ± 0.814
2.427AlaMet: 2.427 ± 0.389
3.398AlaAsn: 3.398 ± 0.534
5.016AlaPro: 5.016 ± 0.483
3.128AlaGln: 3.128 ± 0.48
6.849AlaArg: 6.849 ± 0.693
5.825AlaSer: 5.825 ± 0.71
6.094AlaThr: 6.094 ± 0.584
7.496AlaVal: 7.496 ± 0.602
2.535AlaTrp: 2.535 ± 0.419
2.319AlaTyr: 2.319 ± 0.356
0.0AlaXaa: 0.0 ± 0.0
Cys
0.917CysAla: 0.917 ± 0.287
0.108CysCys: 0.108 ± 0.078
0.863CysAsp: 0.863 ± 0.192
0.755CysGlu: 0.755 ± 0.181
0.27CysPhe: 0.27 ± 0.117
1.672CysGly: 1.672 ± 0.406
0.324CysHis: 0.324 ± 0.132
0.431CysIle: 0.431 ± 0.163
0.216CysLys: 0.216 ± 0.114
0.917CysLeu: 0.917 ± 0.266
0.054CysMet: 0.054 ± 0.052
0.431CysAsn: 0.431 ± 0.156
1.24CysPro: 1.24 ± 0.274
0.431CysGln: 0.431 ± 0.167
0.917CysArg: 0.917 ± 0.227
0.755CysSer: 0.755 ± 0.201
0.593CysThr: 0.593 ± 0.22
0.917CysVal: 0.917 ± 0.202
0.216CysTrp: 0.216 ± 0.12
0.378CysTyr: 0.378 ± 0.128
0.0CysXaa: 0.0 ± 0.0
Asp
6.741AspAla: 6.741 ± 0.549
0.863AspCys: 0.863 ± 0.189
4.746AspAsp: 4.746 ± 0.521
3.775AspGlu: 3.775 ± 0.437
1.186AspPhe: 1.186 ± 0.244
5.986AspGly: 5.986 ± 0.519
1.294AspHis: 1.294 ± 0.274
2.643AspIle: 2.643 ± 0.39
1.78AspLys: 1.78 ± 0.279
5.663AspLeu: 5.663 ± 0.563
0.917AspMet: 0.917 ± 0.18
1.51AspAsn: 1.51 ± 0.304
4.153AspPro: 4.153 ± 0.569
3.02AspGln: 3.02 ± 0.394
5.016AspArg: 5.016 ± 0.776
3.452AspSer: 3.452 ± 0.526
3.883AspThr: 3.883 ± 0.373
4.8AspVal: 4.8 ± 0.583
1.456AspTrp: 1.456 ± 0.237
1.78AspTyr: 1.78 ± 0.299
0.0AspXaa: 0.0 ± 0.0
Glu
6.094GluAla: 6.094 ± 0.672
0.917GluCys: 0.917 ± 0.226
3.613GluAsp: 3.613 ± 0.431
2.697GluGlu: 2.697 ± 0.403
2.157GluPhe: 2.157 ± 0.316
2.966GluGly: 2.966 ± 0.456
1.133GluHis: 1.133 ± 0.3
2.211GluIle: 2.211 ± 0.348
1.456GluLys: 1.456 ± 0.285
6.256GluLeu: 6.256 ± 0.756
1.834GluMet: 1.834 ± 0.291
1.834GluAsn: 1.834 ± 0.288
3.613GluPro: 3.613 ± 0.473
3.236GluGln: 3.236 ± 0.356
5.285GluArg: 5.285 ± 0.621
2.966GluSer: 2.966 ± 0.479
4.638GluThr: 4.638 ± 0.477
4.638GluVal: 4.638 ± 0.525
1.186GluTrp: 1.186 ± 0.208
1.564GluTyr: 1.564 ± 0.315
0.0GluXaa: 0.0 ± 0.0
Phe
2.912PheAla: 2.912 ± 0.476
0.324PheCys: 0.324 ± 0.131
2.265PheAsp: 2.265 ± 0.345
1.672PheGlu: 1.672 ± 0.343
0.647PhePhe: 0.647 ± 0.158
2.966PheGly: 2.966 ± 0.461
0.539PheHis: 0.539 ± 0.169
1.456PheIle: 1.456 ± 0.351
1.564PheLys: 1.564 ± 0.334
1.78PheLeu: 1.78 ± 0.307
0.593PheMet: 0.593 ± 0.164
1.025PheAsn: 1.025 ± 0.311
1.402PhePro: 1.402 ± 0.222
1.025PheGln: 1.025 ± 0.312
1.402PheArg: 1.402 ± 0.267
1.294PheSer: 1.294 ± 0.282
2.049PheThr: 2.049 ± 0.256
1.618PheVal: 1.618 ± 0.257
0.593PheTrp: 0.593 ± 0.175
0.647PheTyr: 0.647 ± 0.187
0.0PheXaa: 0.0 ± 0.0
Gly
9.654GlyAla: 9.654 ± 1.173
1.294GlyCys: 1.294 ± 0.247
5.447GlyAsp: 5.447 ± 0.385
4.476GlyGlu: 4.476 ± 0.574
2.535GlyPhe: 2.535 ± 0.363
10.948GlyGly: 10.948 ± 2.131
1.726GlyHis: 1.726 ± 0.273
3.344GlyIle: 3.344 ± 0.485
2.643GlyLys: 2.643 ± 0.356
5.177GlyLeu: 5.177 ± 0.633
2.049GlyMet: 2.049 ± 0.442
3.29GlyAsn: 3.29 ± 0.436
3.991GlyPro: 3.991 ± 0.497
2.427GlyGln: 2.427 ± 0.552
5.07GlyArg: 5.07 ± 0.577
6.256GlySer: 6.256 ± 0.858
6.256GlyThr: 6.256 ± 0.835
6.31GlyVal: 6.31 ± 0.672
2.481GlyTrp: 2.481 ± 0.331
2.211GlyTyr: 2.211 ± 0.388
0.0GlyXaa: 0.0 ± 0.0
His
1.78HisAla: 1.78 ± 0.314
0.378HisCys: 0.378 ± 0.148
1.025HisAsp: 1.025 ± 0.254
1.402HisGlu: 1.402 ± 0.277
0.431HisPhe: 0.431 ± 0.143
1.834HisGly: 1.834 ± 0.283
0.809HisHis: 0.809 ± 0.236
1.348HisIle: 1.348 ± 0.266
0.971HisLys: 0.971 ± 0.199
1.402HisLeu: 1.402 ± 0.273
0.431HisMet: 0.431 ± 0.114
0.971HisAsn: 0.971 ± 0.19
1.186HisPro: 1.186 ± 0.259
0.809HisGln: 0.809 ± 0.203
1.564HisArg: 1.564 ± 0.282
0.755HisSer: 0.755 ± 0.193
1.025HisThr: 1.025 ± 0.255
1.24HisVal: 1.24 ± 0.298
0.539HisTrp: 0.539 ± 0.166
0.971HisTyr: 0.971 ± 0.187
0.0HisXaa: 0.0 ± 0.0
Ile
5.339IleAla: 5.339 ± 0.5
0.593IleCys: 0.593 ± 0.235
3.667IleAsp: 3.667 ± 0.435
2.966IleGlu: 2.966 ± 0.373
0.755IlePhe: 0.755 ± 0.187
4.099IleGly: 4.099 ± 0.508
1.456IleHis: 1.456 ± 0.274
1.186IleIle: 1.186 ± 0.249
1.348IleLys: 1.348 ± 0.305
2.804IleLeu: 2.804 ± 0.359
0.324IleMet: 0.324 ± 0.121
1.995IleAsn: 1.995 ± 0.379
2.427IlePro: 2.427 ± 0.422
1.24IleGln: 1.24 ± 0.255
2.697IleArg: 2.697 ± 0.41
1.942IleSer: 1.942 ± 0.398
3.667IleThr: 3.667 ± 0.584
3.074IleVal: 3.074 ± 0.331
0.809IleTrp: 0.809 ± 0.203
0.647IleTyr: 0.647 ± 0.168
0.0IleXaa: 0.0 ± 0.0
Lys
3.721LysAla: 3.721 ± 0.441
0.485LysCys: 0.485 ± 0.177
1.618LysAsp: 1.618 ± 0.302
1.24LysGlu: 1.24 ± 0.27
0.917LysPhe: 0.917 ± 0.204
2.373LysGly: 2.373 ± 0.387
0.917LysHis: 0.917 ± 0.288
0.917LysIle: 0.917 ± 0.228
1.564LysLys: 1.564 ± 0.271
2.804LysLeu: 2.804 ± 0.501
0.485LysMet: 0.485 ± 0.153
1.133LysAsn: 1.133 ± 0.276
3.128LysPro: 3.128 ± 0.406
1.564LysGln: 1.564 ± 0.235
2.427LysArg: 2.427 ± 0.328
2.211LysSer: 2.211 ± 0.336
2.535LysThr: 2.535 ± 0.44
2.481LysVal: 2.481 ± 0.367
1.079LysTrp: 1.079 ± 0.28
0.701LysTyr: 0.701 ± 0.201
0.0LysXaa: 0.0 ± 0.0
Leu
9.061LeuAla: 9.061 ± 0.803
0.485LeuCys: 0.485 ± 0.142
4.908LeuAsp: 4.908 ± 0.572
4.153LeuGlu: 4.153 ± 0.51
1.672LeuPhe: 1.672 ± 0.252
5.447LeuGly: 5.447 ± 0.585
0.917LeuHis: 0.917 ± 0.211
3.02LeuIle: 3.02 ± 0.332
2.319LeuLys: 2.319 ± 0.407
5.285LeuLeu: 5.285 ± 0.639
1.456LeuMet: 1.456 ± 0.287
2.751LeuAsn: 2.751 ± 0.397
5.393LeuPro: 5.393 ± 0.608
2.643LeuGln: 2.643 ± 0.451
6.04LeuArg: 6.04 ± 0.701
4.8LeuSer: 4.8 ± 0.428
5.663LeuThr: 5.663 ± 0.503
5.717LeuVal: 5.717 ± 0.644
0.863LeuTrp: 0.863 ± 0.175
2.427LeuTyr: 2.427 ± 0.391
0.0LeuXaa: 0.0 ± 0.0
Met
1.51MetAla: 1.51 ± 0.275
0.216MetCys: 0.216 ± 0.2
1.186MetAsp: 1.186 ± 0.256
0.647MetGlu: 0.647 ± 0.183
0.647MetPhe: 0.647 ± 0.218
1.618MetGly: 1.618 ± 0.278
0.216MetHis: 0.216 ± 0.086
0.917MetIle: 0.917 ± 0.215
0.755MetLys: 0.755 ± 0.2
1.456MetLeu: 1.456 ± 0.233
0.593MetMet: 0.593 ± 0.207
1.079MetAsn: 1.079 ± 0.224
1.618MetPro: 1.618 ± 0.31
0.485MetGln: 0.485 ± 0.16
1.51MetArg: 1.51 ± 0.281
2.481MetSer: 2.481 ± 0.388
1.78MetThr: 1.78 ± 0.262
1.672MetVal: 1.672 ± 0.33
0.485MetTrp: 0.485 ± 0.159
0.539MetTyr: 0.539 ± 0.163
0.0MetXaa: 0.0 ± 0.0
Asn
3.29AsnAla: 3.29 ± 0.407
0.378AsnCys: 0.378 ± 0.145
1.186AsnAsp: 1.186 ± 0.222
2.211AsnGlu: 2.211 ± 0.318
0.539AsnPhe: 0.539 ± 0.205
4.53AsnGly: 4.53 ± 0.544
0.755AsnHis: 0.755 ± 0.173
1.24AsnIle: 1.24 ± 0.273
0.863AsnLys: 0.863 ± 0.238
2.966AsnLeu: 2.966 ± 0.345
0.863AsnMet: 0.863 ± 0.181
1.618AsnAsn: 1.618 ± 0.388
2.535AsnPro: 2.535 ± 0.409
1.186AsnGln: 1.186 ± 0.362
2.319AsnArg: 2.319 ± 0.326
1.402AsnSer: 1.402 ± 0.289
2.319AsnThr: 2.319 ± 0.33
1.942AsnVal: 1.942 ± 0.324
0.485AsnTrp: 0.485 ± 0.157
0.593AsnTyr: 0.593 ± 0.15
0.0AsnXaa: 0.0 ± 0.0
Pro
6.202ProAla: 6.202 ± 0.62
0.701ProCys: 0.701 ± 0.179
4.476ProAsp: 4.476 ± 0.438
5.016ProGlu: 5.016 ± 0.571
1.726ProPhe: 1.726 ± 0.314
6.256ProGly: 6.256 ± 0.731
1.726ProHis: 1.726 ± 0.266
2.265ProIle: 2.265 ± 0.317
1.995ProLys: 1.995 ± 0.281
4.368ProLeu: 4.368 ± 0.56
1.618ProMet: 1.618 ± 0.286
1.78ProAsn: 1.78 ± 0.249
3.613ProPro: 3.613 ± 0.508
2.481ProGln: 2.481 ± 0.371
2.966ProArg: 2.966 ± 0.508
3.344ProSer: 3.344 ± 0.418
3.398ProThr: 3.398 ± 0.532
4.207ProVal: 4.207 ± 0.43
1.186ProTrp: 1.186 ± 0.218
1.995ProTyr: 1.995 ± 0.284
0.0ProXaa: 0.0 ± 0.0
Gln
4.207GlnAla: 4.207 ± 0.515
0.324GlnCys: 0.324 ± 0.146
1.672GlnAsp: 1.672 ± 0.265
1.726GlnGlu: 1.726 ± 0.288
1.079GlnPhe: 1.079 ± 0.256
2.481GlnGly: 2.481 ± 0.457
0.863GlnHis: 0.863 ± 0.203
1.672GlnIle: 1.672 ± 0.327
1.456GlnLys: 1.456 ± 0.251
2.966GlnLeu: 2.966 ± 0.476
0.971GlnMet: 0.971 ± 0.209
1.133GlnAsn: 1.133 ± 0.254
2.751GlnPro: 2.751 ± 0.521
1.726GlnGln: 1.726 ± 0.445
2.643GlnArg: 2.643 ± 0.375
2.481GlnSer: 2.481 ± 0.335
1.51GlnThr: 1.51 ± 0.329
2.535GlnVal: 2.535 ± 0.344
0.755GlnTrp: 0.755 ± 0.188
1.186GlnTyr: 1.186 ± 0.297
0.0GlnXaa: 0.0 ± 0.0
Arg
6.688ArgAla: 6.688 ± 0.655
1.348ArgCys: 1.348 ± 0.368
4.854ArgAsp: 4.854 ± 0.563
5.447ArgGlu: 5.447 ± 0.714
2.211ArgPhe: 2.211 ± 0.336
3.559ArgGly: 3.559 ± 0.405
1.456ArgHis: 1.456 ± 0.346
4.153ArgIle: 4.153 ± 0.488
2.373ArgLys: 2.373 ± 0.295
5.393ArgLeu: 5.393 ± 0.569
2.427ArgMet: 2.427 ± 0.387
1.995ArgAsn: 1.995 ± 0.377
3.721ArgPro: 3.721 ± 0.49
2.265ArgGln: 2.265 ± 0.367
5.663ArgArg: 5.663 ± 0.783
4.045ArgSer: 4.045 ± 0.42
3.721ArgThr: 3.721 ± 0.537
4.53ArgVal: 4.53 ± 0.546
1.78ArgTrp: 1.78 ± 0.306
2.103ArgTyr: 2.103 ± 0.319
0.0ArgXaa: 0.0 ± 0.0
Ser
5.932SerAla: 5.932 ± 0.702
0.27SerCys: 0.27 ± 0.146
4.207SerAsp: 4.207 ± 0.491
3.182SerGlu: 3.182 ± 0.396
2.049SerPhe: 2.049 ± 0.351
6.202SerGly: 6.202 ± 0.831
0.809SerHis: 0.809 ± 0.197
2.966SerIle: 2.966 ± 0.354
1.888SerLys: 1.888 ± 0.335
3.937SerLeu: 3.937 ± 0.423
1.294SerMet: 1.294 ± 0.255
1.942SerAsn: 1.942 ± 0.345
3.883SerPro: 3.883 ± 0.464
1.995SerGln: 1.995 ± 0.28
3.721SerArg: 3.721 ± 0.457
4.207SerSer: 4.207 ± 0.533
3.883SerThr: 3.883 ± 0.51
3.991SerVal: 3.991 ± 0.522
1.186SerTrp: 1.186 ± 0.273
1.78SerTyr: 1.78 ± 0.271
0.0SerXaa: 0.0 ± 0.0
Thr
6.04ThrAla: 6.04 ± 0.661
0.539ThrCys: 0.539 ± 0.208
3.775ThrAsp: 3.775 ± 0.494
4.422ThrGlu: 4.422 ± 0.489
1.888ThrPhe: 1.888 ± 0.366
6.526ThrGly: 6.526 ± 0.844
1.51ThrHis: 1.51 ± 0.238
3.775ThrIle: 3.775 ± 0.6
1.942ThrLys: 1.942 ± 0.332
4.099ThrLeu: 4.099 ± 0.482
0.863ThrMet: 0.863 ± 0.214
1.888ThrAsn: 1.888 ± 0.369
4.8ThrPro: 4.8 ± 0.56
1.888ThrGln: 1.888 ± 0.288
4.153ThrArg: 4.153 ± 0.499
3.398ThrSer: 3.398 ± 0.474
4.261ThrThr: 4.261 ± 0.663
6.849ThrVal: 6.849 ± 0.69
1.079ThrTrp: 1.079 ± 0.28
1.888ThrTyr: 1.888 ± 0.276
0.0ThrXaa: 0.0 ± 0.0
Val
8.09ValAla: 8.09 ± 0.684
1.133ValCys: 1.133 ± 0.291
5.177ValAsp: 5.177 ± 0.599
4.53ValGlu: 4.53 ± 0.509
2.589ValPhe: 2.589 ± 0.408
5.339ValGly: 5.339 ± 0.573
1.456ValHis: 1.456 ± 0.258
3.398ValIle: 3.398 ± 0.44
2.858ValLys: 2.858 ± 0.361
5.555ValLeu: 5.555 ± 0.654
0.917ValMet: 0.917 ± 0.215
2.049ValAsn: 2.049 ± 0.312
4.207ValPro: 4.207 ± 0.457
2.373ValGln: 2.373 ± 0.307
5.285ValArg: 5.285 ± 0.689
4.638ValSer: 4.638 ± 0.532
5.124ValThr: 5.124 ± 0.571
5.771ValVal: 5.771 ± 0.73
1.672ValTrp: 1.672 ± 0.262
1.348ValTyr: 1.348 ± 0.298
0.0ValXaa: 0.0 ± 0.0
Trp
1.888TrpAla: 1.888 ± 0.318
0.431TrpCys: 0.431 ± 0.14
1.294TrpAsp: 1.294 ± 0.296
0.863TrpGlu: 0.863 ± 0.194
0.863TrpPhe: 0.863 ± 0.245
1.24TrpGly: 1.24 ± 0.247
0.539TrpHis: 0.539 ± 0.171
0.809TrpIle: 0.809 ± 0.221
0.809TrpLys: 0.809 ± 0.174
1.726TrpLeu: 1.726 ± 0.303
0.863TrpMet: 0.863 ± 0.216
0.378TrpAsn: 0.378 ± 0.132
1.079TrpPro: 1.079 ± 0.216
1.294TrpGln: 1.294 ± 0.245
1.834TrpArg: 1.834 ± 0.326
1.456TrpSer: 1.456 ± 0.262
1.402TrpThr: 1.402 ± 0.328
1.618TrpVal: 1.618 ± 0.339
0.809TrpTrp: 0.809 ± 0.216
0.485TrpTyr: 0.485 ± 0.169
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.265TyrAla: 2.265 ± 0.373
0.162TyrCys: 0.162 ± 0.096
2.103TyrAsp: 2.103 ± 0.377
1.564TyrGlu: 1.564 ± 0.266
0.863TyrPhe: 0.863 ± 0.208
2.049TyrGly: 2.049 ± 0.405
0.485TyrHis: 0.485 ± 0.123
0.863TyrIle: 0.863 ± 0.185
0.917TyrLys: 0.917 ± 0.243
1.995TyrLeu: 1.995 ± 0.282
0.378TyrMet: 0.378 ± 0.145
1.025TyrAsn: 1.025 ± 0.24
1.564TyrPro: 1.564 ± 0.263
0.917TyrGln: 0.917 ± 0.225
2.373TyrArg: 2.373 ± 0.37
1.618TyrSer: 1.618 ± 0.278
1.78TyrThr: 1.78 ± 0.256
2.103TyrVal: 2.103 ± 0.346
0.539TyrTrp: 0.539 ± 0.183
0.539TyrTyr: 0.539 ± 0.127
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 105 proteins (18543 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski