Amino acid dipepetide frequency for Microbacterium phage TinyTimothy

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.121AlaAla: 10.121 ± 0.866
0.471AlaCys: 0.471 ± 0.235
5.061AlaAsp: 5.061 ± 0.548
6.296AlaGlu: 6.296 ± 0.648
3.178AlaPhe: 3.178 ± 0.419
6.473AlaGly: 6.473 ± 0.748
1.471AlaHis: 1.471 ± 0.231
4.413AlaIle: 4.413 ± 0.477
3.413AlaLys: 3.413 ± 0.87
8.415AlaLeu: 8.415 ± 0.65
2.471AlaMet: 2.471 ± 0.54
4.766AlaAsn: 4.766 ± 0.771
4.649AlaPro: 4.649 ± 0.839
3.472AlaGln: 3.472 ± 0.428
6.708AlaArg: 6.708 ± 0.787
5.884AlaSer: 5.884 ± 0.59
5.59AlaThr: 5.59 ± 0.473
5.178AlaVal: 5.178 ± 0.571
1.53AlaTrp: 1.53 ± 0.264
2.883AlaTyr: 2.883 ± 0.468
0.0AlaXaa: 0.0 ± 0.0
Cys
0.294CysAla: 0.294 ± 0.184
0.0CysCys: 0.0 ± 0.0
0.118CysAsp: 0.118 ± 0.089
0.353CysGlu: 0.353 ± 0.207
0.118CysPhe: 0.118 ± 0.077
0.471CysGly: 0.471 ± 0.221
0.177CysHis: 0.177 ± 0.148
0.294CysIle: 0.294 ± 0.142
0.177CysLys: 0.177 ± 0.112
0.235CysLeu: 0.235 ± 0.141
0.235CysMet: 0.235 ± 0.135
0.177CysAsn: 0.177 ± 0.119
0.412CysPro: 0.412 ± 0.237
0.118CysGln: 0.118 ± 0.108
0.059CysArg: 0.059 ± 0.061
0.235CysSer: 0.235 ± 0.131
0.353CysThr: 0.353 ± 0.172
0.294CysVal: 0.294 ± 0.181
0.059CysTrp: 0.059 ± 0.063
0.177CysTyr: 0.177 ± 0.108
0.0CysXaa: 0.0 ± 0.0
Asp
6.944AspAla: 6.944 ± 0.715
0.412AspCys: 0.412 ± 0.192
4.531AspAsp: 4.531 ± 0.56
4.766AspGlu: 4.766 ± 0.471
2.001AspPhe: 2.001 ± 0.353
5.649AspGly: 5.649 ± 0.514
0.883AspHis: 0.883 ± 0.226
4.001AspIle: 4.001 ± 0.472
3.001AspLys: 3.001 ± 0.603
5.531AspLeu: 5.531 ± 0.67
1.706AspMet: 1.706 ± 0.314
2.471AspAsn: 2.471 ± 0.422
2.766AspPro: 2.766 ± 0.47
2.413AspGln: 2.413 ± 0.396
3.413AspArg: 3.413 ± 0.534
3.648AspSer: 3.648 ± 0.348
3.766AspThr: 3.766 ± 0.463
3.707AspVal: 3.707 ± 0.549
1.353AspTrp: 1.353 ± 0.258
2.295AspTyr: 2.295 ± 0.404
0.0AspXaa: 0.0 ± 0.0
Glu
5.826GluAla: 5.826 ± 0.601
0.118GluCys: 0.118 ± 0.103
5.237GluAsp: 5.237 ± 0.804
4.119GluGlu: 4.119 ± 0.738
2.53GluPhe: 2.53 ± 0.465
4.649GluGly: 4.649 ± 0.447
1.118GluHis: 1.118 ± 0.295
3.354GluIle: 3.354 ± 0.598
3.001GluLys: 3.001 ± 0.539
6.002GluLeu: 6.002 ± 0.785
1.177GluMet: 1.177 ± 0.279
3.354GluAsn: 3.354 ± 0.447
2.53GluPro: 2.53 ± 0.517
3.119GluGln: 3.119 ± 0.477
4.413GluArg: 4.413 ± 0.659
4.119GluSer: 4.119 ± 0.476
4.119GluThr: 4.119 ± 0.634
4.708GluVal: 4.708 ± 0.657
1.648GluTrp: 1.648 ± 0.383
2.589GluTyr: 2.589 ± 0.341
0.0GluXaa: 0.0 ± 0.0
Phe
2.413PheAla: 2.413 ± 0.436
0.412PheCys: 0.412 ± 0.189
2.236PheAsp: 2.236 ± 0.439
2.942PheGlu: 2.942 ± 0.526
1.0PhePhe: 1.0 ± 0.219
2.648PheGly: 2.648 ± 0.314
0.353PheHis: 0.353 ± 0.148
1.353PheIle: 1.353 ± 0.234
1.589PheLys: 1.589 ± 0.256
2.53PheLeu: 2.53 ± 0.333
0.824PheMet: 0.824 ± 0.243
1.412PheAsn: 1.412 ± 0.286
1.706PhePro: 1.706 ± 0.31
1.53PheGln: 1.53 ± 0.45
2.06PheArg: 2.06 ± 0.474
2.236PheSer: 2.236 ± 0.347
2.118PheThr: 2.118 ± 0.45
2.354PheVal: 2.354 ± 0.41
0.412PheTrp: 0.412 ± 0.145
0.883PheTyr: 0.883 ± 0.188
0.0PheXaa: 0.0 ± 0.0
Gly
6.708GlyAla: 6.708 ± 0.725
0.118GlyCys: 0.118 ± 0.098
4.884GlyAsp: 4.884 ± 0.385
4.766GlyGlu: 4.766 ± 0.495
2.53GlyPhe: 2.53 ± 0.283
6.708GlyGly: 6.708 ± 0.885
0.883GlyHis: 0.883 ± 0.245
4.472GlyIle: 4.472 ± 0.455
3.001GlyLys: 3.001 ± 0.537
5.649GlyLeu: 5.649 ± 0.518
2.53GlyMet: 2.53 ± 0.357
3.06GlyAsn: 3.06 ± 0.464
1.706GlyPro: 1.706 ± 0.283
3.178GlyGln: 3.178 ± 0.344
4.59GlyArg: 4.59 ± 0.638
5.355GlySer: 5.355 ± 0.581
5.884GlyThr: 5.884 ± 0.602
4.884GlyVal: 4.884 ± 0.478
1.177GlyTrp: 1.177 ± 0.238
2.942GlyTyr: 2.942 ± 0.355
0.0GlyXaa: 0.0 ± 0.0
His
0.765HisAla: 0.765 ± 0.221
0.235HisCys: 0.235 ± 0.148
1.118HisAsp: 1.118 ± 0.277
1.118HisGlu: 1.118 ± 0.287
0.588HisPhe: 0.588 ± 0.193
1.236HisGly: 1.236 ± 0.293
0.353HisHis: 0.353 ± 0.163
0.53HisIle: 0.53 ± 0.163
1.177HisLys: 1.177 ± 0.453
1.118HisLeu: 1.118 ± 0.267
0.294HisMet: 0.294 ± 0.101
0.647HisAsn: 0.647 ± 0.148
1.118HisPro: 1.118 ± 0.211
0.588HisGln: 0.588 ± 0.153
1.177HisArg: 1.177 ± 0.23
0.765HisSer: 0.765 ± 0.206
0.588HisThr: 0.588 ± 0.27
1.236HisVal: 1.236 ± 0.262
0.235HisTrp: 0.235 ± 0.096
0.353HisTyr: 0.353 ± 0.149
0.0HisXaa: 0.0 ± 0.0
Ile
4.296IleAla: 4.296 ± 0.474
0.177IleCys: 0.177 ± 0.128
3.825IleAsp: 3.825 ± 0.61
4.119IleGlu: 4.119 ± 0.538
1.059IlePhe: 1.059 ± 0.261
2.825IleGly: 2.825 ± 0.705
0.824IleHis: 0.824 ± 0.245
2.53IleIle: 2.53 ± 0.519
2.413IleLys: 2.413 ± 0.451
4.354IleLeu: 4.354 ± 0.428
0.942IleMet: 0.942 ± 0.217
2.413IleAsn: 2.413 ± 0.501
2.06IlePro: 2.06 ± 0.346
2.354IleGln: 2.354 ± 0.422
3.295IleArg: 3.295 ± 0.424
2.942IleSer: 2.942 ± 0.441
3.531IleThr: 3.531 ± 0.508
2.883IleVal: 2.883 ± 0.576
0.824IleTrp: 0.824 ± 0.188
0.883IleTyr: 0.883 ± 0.278
0.0IleXaa: 0.0 ± 0.0
Lys
4.001LysAla: 4.001 ± 0.677
0.118LysCys: 0.118 ± 0.099
2.354LysAsp: 2.354 ± 0.319
2.766LysGlu: 2.766 ± 0.525
1.177LysPhe: 1.177 ± 0.266
2.177LysGly: 2.177 ± 0.373
0.471LysHis: 0.471 ± 0.196
2.236LysIle: 2.236 ± 0.433
2.589LysLys: 2.589 ± 0.621
3.59LysLeu: 3.59 ± 0.512
1.177LysMet: 1.177 ± 0.305
2.177LysAsn: 2.177 ± 0.439
2.471LysPro: 2.471 ± 0.355
1.589LysGln: 1.589 ± 0.393
2.53LysArg: 2.53 ± 0.49
2.942LysSer: 2.942 ± 0.418
3.472LysThr: 3.472 ± 0.65
2.413LysVal: 2.413 ± 0.344
0.883LysTrp: 0.883 ± 0.204
2.177LysTyr: 2.177 ± 0.421
0.0LysXaa: 0.0 ± 0.0
Leu
7.002LeuAla: 7.002 ± 0.801
0.706LeuCys: 0.706 ± 0.316
6.237LeuAsp: 6.237 ± 0.675
5.826LeuGlu: 5.826 ± 0.806
3.119LeuPhe: 3.119 ± 0.525
6.002LeuGly: 6.002 ± 0.662
1.53LeuHis: 1.53 ± 0.296
3.707LeuIle: 3.707 ± 0.731
3.707LeuLys: 3.707 ± 0.48
6.885LeuLeu: 6.885 ± 0.786
2.707LeuMet: 2.707 ± 0.592
3.707LeuAsn: 3.707 ± 0.558
4.237LeuPro: 4.237 ± 0.575
4.119LeuGln: 4.119 ± 0.667
6.414LeuArg: 6.414 ± 0.486
5.649LeuSer: 5.649 ± 0.834
5.178LeuThr: 5.178 ± 0.579
5.002LeuVal: 5.002 ± 0.553
1.177LeuTrp: 1.177 ± 0.25
2.942LeuTyr: 2.942 ± 0.343
0.0LeuXaa: 0.0 ± 0.0
Met
3.06MetAla: 3.06 ± 0.399
0.059MetCys: 0.059 ± 0.052
1.295MetAsp: 1.295 ± 0.275
1.118MetGlu: 1.118 ± 0.249
1.236MetPhe: 1.236 ± 0.222
2.001MetGly: 2.001 ± 0.658
0.412MetHis: 0.412 ± 0.112
0.765MetIle: 0.765 ± 0.196
1.177MetLys: 1.177 ± 0.256
1.883MetLeu: 1.883 ± 0.383
0.765MetMet: 0.765 ± 0.234
1.236MetAsn: 1.236 ± 0.259
1.824MetPro: 1.824 ± 0.393
1.059MetGln: 1.059 ± 0.238
1.118MetArg: 1.118 ± 0.314
2.295MetSer: 2.295 ± 0.468
1.706MetThr: 1.706 ± 0.401
1.471MetVal: 1.471 ± 0.286
0.177MetTrp: 0.177 ± 0.087
0.53MetTyr: 0.53 ± 0.184
0.0MetXaa: 0.0 ± 0.0
Asn
4.354AsnAla: 4.354 ± 0.72
0.235AsnCys: 0.235 ± 0.131
2.06AsnAsp: 2.06 ± 0.316
2.766AsnGlu: 2.766 ± 0.428
1.295AsnPhe: 1.295 ± 0.294
5.119AsnGly: 5.119 ± 0.631
0.647AsnHis: 0.647 ± 0.238
2.118AsnIle: 2.118 ± 0.356
2.295AsnLys: 2.295 ± 0.485
3.413AsnLeu: 3.413 ± 0.467
1.118AsnMet: 1.118 ± 0.28
2.177AsnAsn: 2.177 ± 0.349
3.472AsnPro: 3.472 ± 0.577
1.942AsnGln: 1.942 ± 0.283
1.706AsnArg: 1.706 ± 0.407
2.53AsnSer: 2.53 ± 0.468
3.648AsnThr: 3.648 ± 0.638
3.766AsnVal: 3.766 ± 0.51
0.706AsnTrp: 0.706 ± 0.198
1.883AsnTyr: 1.883 ± 0.29
0.0AsnXaa: 0.0 ± 0.0
Pro
5.002ProAla: 5.002 ± 0.823
0.177ProCys: 0.177 ± 0.11
3.001ProAsp: 3.001 ± 0.36
4.531ProGlu: 4.531 ± 0.537
1.236ProPhe: 1.236 ± 0.304
2.648ProGly: 2.648 ± 0.395
0.53ProHis: 0.53 ± 0.223
2.354ProIle: 2.354 ± 0.4
1.824ProLys: 1.824 ± 0.499
4.178ProLeu: 4.178 ± 0.418
0.824ProMet: 0.824 ± 0.224
3.236ProAsn: 3.236 ± 0.319
1.942ProPro: 1.942 ± 0.426
1.295ProGln: 1.295 ± 0.26
2.177ProArg: 2.177 ± 0.42
3.119ProSer: 3.119 ± 0.411
3.943ProThr: 3.943 ± 0.455
3.648ProVal: 3.648 ± 0.485
0.942ProTrp: 0.942 ± 0.228
1.295ProTyr: 1.295 ± 0.202
0.0ProXaa: 0.0 ± 0.0
Gln
5.296GlnAla: 5.296 ± 0.549
0.177GlnCys: 0.177 ± 0.116
2.648GlnAsp: 2.648 ± 0.396
2.707GlnGlu: 2.707 ± 0.549
1.295GlnPhe: 1.295 ± 0.259
2.648GlnGly: 2.648 ± 0.31
0.765GlnHis: 0.765 ± 0.159
2.471GlnIle: 2.471 ± 0.363
1.765GlnLys: 1.765 ± 0.39
4.06GlnLeu: 4.06 ± 0.382
1.177GlnMet: 1.177 ± 0.243
2.295GlnAsn: 2.295 ± 0.494
1.53GlnPro: 1.53 ± 0.342
3.119GlnGln: 3.119 ± 0.599
2.589GlnArg: 2.589 ± 0.499
2.413GlnSer: 2.413 ± 0.301
2.825GlnThr: 2.825 ± 0.434
2.177GlnVal: 2.177 ± 0.404
0.824GlnTrp: 0.824 ± 0.209
1.589GlnTyr: 1.589 ± 0.296
0.0GlnXaa: 0.0 ± 0.0
Arg
5.59ArgAla: 5.59 ± 0.596
0.059ArgCys: 0.059 ± 0.061
4.296ArgAsp: 4.296 ± 0.577
4.472ArgGlu: 4.472 ± 0.482
2.236ArgPhe: 2.236 ± 0.438
4.354ArgGly: 4.354 ± 0.575
0.942ArgHis: 0.942 ± 0.243
2.883ArgIle: 2.883 ± 0.397
2.354ArgLys: 2.354 ± 0.329
5.708ArgLeu: 5.708 ± 0.63
1.295ArgMet: 1.295 ± 0.27
3.001ArgAsn: 3.001 ± 0.384
2.589ArgPro: 2.589 ± 0.373
2.883ArgGln: 2.883 ± 0.342
5.002ArgArg: 5.002 ± 0.715
3.06ArgSer: 3.06 ± 0.728
3.884ArgThr: 3.884 ± 0.727
4.237ArgVal: 4.237 ± 0.472
0.588ArgTrp: 0.588 ± 0.171
1.412ArgTyr: 1.412 ± 0.337
0.0ArgXaa: 0.0 ± 0.0
Ser
5.943SerAla: 5.943 ± 0.638
0.177SerCys: 0.177 ± 0.112
3.648SerAsp: 3.648 ± 0.446
4.001SerGlu: 4.001 ± 0.533
3.236SerPhe: 3.236 ± 0.358
5.708SerGly: 5.708 ± 0.702
0.883SerHis: 0.883 ± 0.196
2.236SerIle: 2.236 ± 0.482
2.942SerLys: 2.942 ± 0.317
5.296SerLeu: 5.296 ± 0.733
1.53SerMet: 1.53 ± 0.27
2.883SerAsn: 2.883 ± 0.433
2.942SerPro: 2.942 ± 0.413
2.942SerGln: 2.942 ± 0.369
3.413SerArg: 3.413 ± 0.384
4.237SerSer: 4.237 ± 0.559
4.649SerThr: 4.649 ± 0.576
3.354SerVal: 3.354 ± 0.543
1.118SerTrp: 1.118 ± 0.191
2.06SerTyr: 2.06 ± 0.354
0.0SerXaa: 0.0 ± 0.0
Thr
6.002ThrAla: 6.002 ± 0.563
0.059ThrCys: 0.059 ± 0.054
4.119ThrAsp: 4.119 ± 0.445
4.06ThrGlu: 4.06 ± 0.424
1.53ThrPhe: 1.53 ± 0.278
6.296ThrGly: 6.296 ± 0.873
0.883ThrHis: 0.883 ± 0.232
3.06ThrIle: 3.06 ± 0.43
2.471ThrLys: 2.471 ± 0.494
6.473ThrLeu: 6.473 ± 0.763
1.295ThrMet: 1.295 ± 0.312
3.178ThrAsn: 3.178 ± 0.383
4.06ThrPro: 4.06 ± 0.575
2.883ThrGln: 2.883 ± 0.364
3.766ThrArg: 3.766 ± 0.346
4.825ThrSer: 4.825 ± 0.741
5.473ThrThr: 5.473 ± 0.864
4.178ThrVal: 4.178 ± 0.55
1.353ThrTrp: 1.353 ± 0.322
1.648ThrTyr: 1.648 ± 0.336
0.0ThrXaa: 0.0 ± 0.0
Val
5.002ValAla: 5.002 ± 0.543
0.235ValCys: 0.235 ± 0.137
4.943ValAsp: 4.943 ± 0.623
4.354ValGlu: 4.354 ± 0.754
1.824ValPhe: 1.824 ± 0.271
4.06ValGly: 4.06 ± 0.524
1.295ValHis: 1.295 ± 0.225
3.119ValIle: 3.119 ± 0.345
2.295ValLys: 2.295 ± 0.343
6.355ValLeu: 6.355 ± 0.603
1.648ValMet: 1.648 ± 0.246
2.471ValAsn: 2.471 ± 0.342
4.237ValPro: 4.237 ± 0.619
3.119ValGln: 3.119 ± 0.421
3.825ValArg: 3.825 ± 0.469
3.648ValSer: 3.648 ± 0.452
3.766ValThr: 3.766 ± 0.547
4.06ValVal: 4.06 ± 0.671
1.177ValTrp: 1.177 ± 0.207
2.177ValTyr: 2.177 ± 0.395
0.0ValXaa: 0.0 ± 0.0
Trp
0.942TrpAla: 0.942 ± 0.198
0.118TrpCys: 0.118 ± 0.091
1.765TrpAsp: 1.765 ± 0.321
0.824TrpGlu: 0.824 ± 0.244
0.706TrpPhe: 0.706 ± 0.204
1.118TrpGly: 1.118 ± 0.338
0.118TrpHis: 0.118 ± 0.103
0.765TrpIle: 0.765 ± 0.202
0.765TrpLys: 0.765 ± 0.251
1.353TrpLeu: 1.353 ± 0.299
0.471TrpMet: 0.471 ± 0.139
0.647TrpAsn: 0.647 ± 0.231
0.353TrpPro: 0.353 ± 0.186
1.0TrpGln: 1.0 ± 0.306
0.883TrpArg: 0.883 ± 0.171
1.295TrpSer: 1.295 ± 0.331
1.236TrpThr: 1.236 ± 0.272
1.295TrpVal: 1.295 ± 0.227
0.235TrpTrp: 0.235 ± 0.117
0.824TrpTyr: 0.824 ± 0.26
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.883TyrAla: 2.883 ± 0.395
0.294TyrCys: 0.294 ± 0.153
1.883TyrAsp: 1.883 ± 0.343
1.824TyrGlu: 1.824 ± 0.271
1.118TyrPhe: 1.118 ± 0.3
2.236TyrGly: 2.236 ± 0.313
0.706TyrHis: 0.706 ± 0.205
1.883TyrIle: 1.883 ± 0.351
1.353TyrLys: 1.353 ± 0.339
2.825TyrLeu: 2.825 ± 0.433
0.942TyrMet: 0.942 ± 0.216
1.942TyrAsn: 1.942 ± 0.392
1.295TyrPro: 1.295 ± 0.274
1.648TyrGln: 1.648 ± 0.276
1.706TyrArg: 1.706 ± 0.361
2.06TyrSer: 2.06 ± 0.369
1.883TyrThr: 1.883 ± 0.391
2.766TyrVal: 2.766 ± 0.471
0.294TyrTrp: 0.294 ± 0.132
1.0TyrTyr: 1.0 ± 0.238
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (16995 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski