Amino acid dipepetide frequency for Escherichia phage vB_EcoM-Ro157lw

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.324AlaAla: 9.324 ± 1.018
0.478AlaCys: 0.478 ± 0.153
4.83AlaAsp: 4.83 ± 0.416
5.882AlaGlu: 5.882 ± 0.876
3.682AlaPhe: 3.682 ± 0.466
6.503AlaGly: 6.503 ± 0.645
1.578AlaHis: 1.578 ± 0.268
5.451AlaIle: 5.451 ± 0.461
5.595AlaLys: 5.595 ± 0.975
6.599AlaLeu: 6.599 ± 0.478
2.726AlaMet: 2.726 ± 0.312
3.586AlaAsn: 3.586 ± 0.565
2.486AlaPro: 2.486 ± 0.303
3.347AlaGln: 3.347 ± 0.484
3.778AlaArg: 3.778 ± 0.443
4.925AlaSer: 4.925 ± 0.573
5.451AlaThr: 5.451 ± 0.512
5.738AlaVal: 5.738 ± 0.505
1.243AlaTrp: 1.243 ± 0.221
2.821AlaTyr: 2.821 ± 0.339
0.0AlaXaa: 0.0 ± 0.0
Cys
0.909CysAla: 0.909 ± 0.211
0.096CysCys: 0.096 ± 0.069
0.622CysAsp: 0.622 ± 0.187
0.765CysGlu: 0.765 ± 0.182
0.287CysPhe: 0.287 ± 0.109
0.765CysGly: 0.765 ± 0.154
0.287CysHis: 0.287 ± 0.129
0.622CysIle: 0.622 ± 0.163
0.383CysLys: 0.383 ± 0.145
0.717CysLeu: 0.717 ± 0.175
0.287CysMet: 0.287 ± 0.11
0.43CysAsn: 0.43 ± 0.151
0.574CysPro: 0.574 ± 0.15
0.43CysGln: 0.43 ± 0.127
0.478CysArg: 0.478 ± 0.164
0.43CysSer: 0.43 ± 0.153
0.478CysThr: 0.478 ± 0.141
0.622CysVal: 0.622 ± 0.15
0.143CysTrp: 0.143 ± 0.083
0.43CysTyr: 0.43 ± 0.146
0.0CysXaa: 0.0 ± 0.0
Asp
5.308AspAla: 5.308 ± 0.615
0.669AspCys: 0.669 ± 0.207
3.634AspAsp: 3.634 ± 0.454
4.208AspGlu: 4.208 ± 0.549
2.726AspPhe: 2.726 ± 0.334
5.356AspGly: 5.356 ± 0.542
1.1AspHis: 1.1 ± 0.207
4.208AspIle: 4.208 ± 0.431
3.395AspLys: 3.395 ± 0.445
4.59AspLeu: 4.59 ± 0.526
1.291AspMet: 1.291 ± 0.234
2.678AspAsn: 2.678 ± 0.301
2.726AspPro: 2.726 ± 0.353
2.391AspGln: 2.391 ± 0.289
2.008AspArg: 2.008 ± 0.34
3.204AspSer: 3.204 ± 0.43
2.821AspThr: 2.821 ± 0.383
4.877AspVal: 4.877 ± 0.512
1.148AspTrp: 1.148 ± 0.321
2.247AspTyr: 2.247 ± 0.402
0.0AspXaa: 0.0 ± 0.0
Glu
6.073GluAla: 6.073 ± 0.664
0.478GluCys: 0.478 ± 0.148
3.969GluAsp: 3.969 ± 0.558
5.499GluGlu: 5.499 ± 0.677
3.108GluPhe: 3.108 ± 0.418
4.638GluGly: 4.638 ± 0.485
1.291GluHis: 1.291 ± 0.23
4.399GluIle: 4.399 ± 0.556
3.538GluLys: 3.538 ± 0.604
5.499GluLeu: 5.499 ± 0.493
2.2GluMet: 2.2 ± 0.377
2.726GluAsn: 2.726 ± 0.412
1.578GluPro: 1.578 ± 0.31
2.391GluGln: 2.391 ± 0.39
4.351GluArg: 4.351 ± 0.56
4.017GluSer: 4.017 ± 0.435
2.917GluThr: 2.917 ± 0.303
4.782GluVal: 4.782 ± 0.529
1.195GluTrp: 1.195 ± 0.239
2.63GluTyr: 2.63 ± 0.372
0.0GluXaa: 0.0 ± 0.0
Phe
2.821PheAla: 2.821 ± 0.405
0.335PheCys: 0.335 ± 0.114
3.443PheAsp: 3.443 ± 0.445
2.917PheGlu: 2.917 ± 0.373
1.243PhePhe: 1.243 ± 0.279
3.538PheGly: 3.538 ± 0.331
0.287PheHis: 0.287 ± 0.118
2.391PheIle: 2.391 ± 0.356
3.252PheLys: 3.252 ± 0.4
2.821PheLeu: 2.821 ± 0.387
0.909PheMet: 0.909 ± 0.207
2.678PheAsn: 2.678 ± 0.41
1.482PhePro: 1.482 ± 0.27
1.339PheGln: 1.339 ± 0.277
1.482PheArg: 1.482 ± 0.303
2.056PheSer: 2.056 ± 0.305
2.008PheThr: 2.008 ± 0.366
3.443PheVal: 3.443 ± 0.425
0.335PheTrp: 0.335 ± 0.118
1.291PheTyr: 1.291 ± 0.249
0.0PheXaa: 0.0 ± 0.0
Gly
6.168GlyAla: 6.168 ± 0.625
0.909GlyCys: 0.909 ± 0.213
4.925GlyAsp: 4.925 ± 0.476
5.547GlyGlu: 5.547 ± 0.501
2.965GlyPhe: 2.965 ± 0.364
4.973GlyGly: 4.973 ± 0.622
0.813GlyHis: 0.813 ± 0.217
5.26GlyIle: 5.26 ± 0.506
4.064GlyLys: 4.064 ± 0.474
4.782GlyLeu: 4.782 ± 0.462
2.2GlyMet: 2.2 ± 0.279
3.586GlyAsn: 3.586 ± 0.454
1.53GlyPro: 1.53 ± 0.328
2.965GlyGln: 2.965 ± 0.374
4.208GlyArg: 4.208 ± 0.393
4.351GlySer: 4.351 ± 0.528
4.017GlyThr: 4.017 ± 0.624
6.168GlyVal: 6.168 ± 0.642
1.387GlyTrp: 1.387 ± 0.261
2.773GlyTyr: 2.773 ± 0.361
0.0GlyXaa: 0.0 ± 0.0
His
0.669HisAla: 0.669 ± 0.161
0.383HisCys: 0.383 ± 0.136
1.148HisAsp: 1.148 ± 0.267
1.387HisGlu: 1.387 ± 0.277
0.813HisPhe: 0.813 ± 0.21
1.195HisGly: 1.195 ± 0.248
0.526HisHis: 0.526 ± 0.171
1.1HisIle: 1.1 ± 0.216
0.669HisLys: 0.669 ± 0.207
1.1HisLeu: 1.1 ± 0.21
0.383HisMet: 0.383 ± 0.123
0.861HisAsn: 0.861 ± 0.234
0.526HisPro: 0.526 ± 0.145
0.813HisGln: 0.813 ± 0.24
0.717HisArg: 0.717 ± 0.189
0.909HisSer: 0.909 ± 0.217
0.622HisThr: 0.622 ± 0.223
1.195HisVal: 1.195 ± 0.31
0.191HisTrp: 0.191 ± 0.095
0.813HisTyr: 0.813 ± 0.226
0.0HisXaa: 0.0 ± 0.0
Ile
5.26IleAla: 5.26 ± 0.562
0.861IleCys: 0.861 ± 0.211
5.499IleAsp: 5.499 ± 0.545
4.447IleGlu: 4.447 ± 0.505
1.865IlePhe: 1.865 ± 0.27
4.399IleGly: 4.399 ± 0.44
1.1IleHis: 1.1 ± 0.262
3.347IleIle: 3.347 ± 0.466
5.642IleLys: 5.642 ± 0.582
4.064IleLeu: 4.064 ± 0.463
1.674IleMet: 1.674 ± 0.278
3.586IleAsn: 3.586 ± 0.385
3.204IlePro: 3.204 ± 0.351
2.726IleGln: 2.726 ± 0.353
2.582IleArg: 2.582 ± 0.322
3.969IleSer: 3.969 ± 0.488
4.208IleThr: 4.208 ± 0.594
4.543IleVal: 4.543 ± 0.502
0.43IleTrp: 0.43 ± 0.157
1.626IleTyr: 1.626 ± 0.258
0.0IleXaa: 0.0 ± 0.0
Lys
6.742LysAla: 6.742 ± 0.944
0.622LysCys: 0.622 ± 0.163
4.017LysAsp: 4.017 ± 0.479
5.069LysGlu: 5.069 ± 0.79
2.582LysPhe: 2.582 ± 0.399
4.495LysGly: 4.495 ± 0.553
0.813LysHis: 0.813 ± 0.183
3.443LysIle: 3.443 ± 0.48
4.017LysLys: 4.017 ± 0.56
4.351LysLeu: 4.351 ± 0.546
2.63LysMet: 2.63 ± 0.437
2.821LysAsn: 2.821 ± 0.341
2.008LysPro: 2.008 ± 0.351
2.534LysGln: 2.534 ± 0.36
3.73LysArg: 3.73 ± 0.517
4.208LysSer: 4.208 ± 0.497
3.108LysThr: 3.108 ± 0.427
4.734LysVal: 4.734 ± 0.49
0.622LysTrp: 0.622 ± 0.157
2.678LysTyr: 2.678 ± 0.441
0.0LysXaa: 0.0 ± 0.0
Leu
5.977LeuAla: 5.977 ± 0.54
0.861LeuCys: 0.861 ± 0.183
3.73LeuAsp: 3.73 ± 0.413
4.782LeuGlu: 4.782 ± 0.46
2.247LeuPhe: 2.247 ± 0.318
4.399LeuGly: 4.399 ± 0.496
1.052LeuHis: 1.052 ± 0.221
4.495LeuIle: 4.495 ± 0.465
6.599LeuLys: 6.599 ± 0.651
4.782LeuLeu: 4.782 ± 0.395
2.295LeuMet: 2.295 ± 0.343
4.351LeuAsn: 4.351 ± 0.504
3.252LeuPro: 3.252 ± 0.514
3.108LeuGln: 3.108 ± 0.418
4.017LeuArg: 4.017 ± 0.465
5.499LeuSer: 5.499 ± 0.59
5.642LeuThr: 5.642 ± 0.404
4.017LeuVal: 4.017 ± 0.463
0.765LeuTrp: 0.765 ± 0.174
2.056LeuTyr: 2.056 ± 0.303
0.0LeuXaa: 0.0 ± 0.0
Met
2.63MetAla: 2.63 ± 0.381
0.287MetCys: 0.287 ± 0.108
1.291MetAsp: 1.291 ± 0.273
1.578MetGlu: 1.578 ± 0.383
0.861MetPhe: 0.861 ± 0.198
1.482MetGly: 1.482 ± 0.217
0.239MetHis: 0.239 ± 0.116
1.53MetIle: 1.53 ± 0.29
2.2MetLys: 2.2 ± 0.326
1.961MetLeu: 1.961 ± 0.283
0.526MetMet: 0.526 ± 0.164
2.104MetAsn: 2.104 ± 0.277
1.339MetPro: 1.339 ± 0.269
1.53MetGln: 1.53 ± 0.271
1.578MetArg: 1.578 ± 0.239
1.817MetSer: 1.817 ± 0.34
2.152MetThr: 2.152 ± 0.414
2.104MetVal: 2.104 ± 0.315
0.287MetTrp: 0.287 ± 0.098
0.813MetTyr: 0.813 ± 0.213
0.0MetXaa: 0.0 ± 0.0
Asn
4.16AsnAla: 4.16 ± 0.482
0.43AsnCys: 0.43 ± 0.163
2.2AsnAsp: 2.2 ± 0.254
2.773AsnGlu: 2.773 ± 0.324
1.913AsnPhe: 1.913 ± 0.302
5.26AsnGly: 5.26 ± 0.58
1.148AsnHis: 1.148 ± 0.33
3.682AsnIle: 3.682 ± 0.508
2.63AsnLys: 2.63 ± 0.41
3.634AsnLeu: 3.634 ± 0.38
1.291AsnMet: 1.291 ± 0.219
2.439AsnAsn: 2.439 ± 0.369
2.821AsnPro: 2.821 ± 0.385
1.674AsnGln: 1.674 ± 0.235
2.534AsnArg: 2.534 ± 0.302
3.06AsnSer: 3.06 ± 0.305
3.347AsnThr: 3.347 ± 0.432
2.965AsnVal: 2.965 ± 0.343
0.622AsnTrp: 0.622 ± 0.168
1.913AsnTyr: 1.913 ± 0.373
0.0AsnXaa: 0.0 ± 0.0
Pro
2.439ProAla: 2.439 ± 0.331
0.335ProCys: 0.335 ± 0.109
2.63ProAsp: 2.63 ± 0.336
3.06ProGlu: 3.06 ± 0.42
1.482ProPhe: 1.482 ± 0.271
2.2ProGly: 2.2 ± 0.299
0.765ProHis: 0.765 ± 0.211
2.152ProIle: 2.152 ± 0.458
2.247ProLys: 2.247 ± 0.294
2.678ProLeu: 2.678 ± 0.302
1.1ProMet: 1.1 ± 0.269
2.486ProAsn: 2.486 ± 0.329
1.148ProPro: 1.148 ± 0.256
1.052ProGln: 1.052 ± 0.204
1.387ProArg: 1.387 ± 0.23
2.391ProSer: 2.391 ± 0.36
2.391ProThr: 2.391 ± 0.374
3.06ProVal: 3.06 ± 0.469
0.669ProTrp: 0.669 ± 0.22
1.195ProTyr: 1.195 ± 0.231
0.0ProXaa: 0.0 ± 0.0
Gln
3.491GlnAla: 3.491 ± 0.496
0.239GlnCys: 0.239 ± 0.104
1.674GlnAsp: 1.674 ± 0.302
1.913GlnGlu: 1.913 ± 0.354
1.817GlnPhe: 1.817 ± 0.26
2.295GlnGly: 2.295 ± 0.393
0.813GlnHis: 0.813 ± 0.199
3.06GlnIle: 3.06 ± 0.353
2.582GlnLys: 2.582 ± 0.344
3.538GlnLeu: 3.538 ± 0.406
0.813GlnMet: 0.813 ± 0.203
1.769GlnAsn: 1.769 ± 0.263
1.291GlnPro: 1.291 ± 0.214
1.674GlnGln: 1.674 ± 0.282
2.773GlnArg: 2.773 ± 0.413
2.582GlnSer: 2.582 ± 0.458
2.008GlnThr: 2.008 ± 0.365
2.534GlnVal: 2.534 ± 0.381
0.383GlnTrp: 0.383 ± 0.138
1.578GlnTyr: 1.578 ± 0.277
0.0GlnXaa: 0.0 ± 0.0
Arg
4.208ArgAla: 4.208 ± 0.471
0.765ArgCys: 0.765 ± 0.196
2.726ArgAsp: 2.726 ± 0.336
2.965ArgGlu: 2.965 ± 0.391
2.008ArgPhe: 2.008 ± 0.312
2.965ArgGly: 2.965 ± 0.388
0.861ArgHis: 0.861 ± 0.21
3.538ArgIle: 3.538 ± 0.442
3.395ArgLys: 3.395 ± 0.411
4.064ArgLeu: 4.064 ± 0.488
1.387ArgMet: 1.387 ± 0.235
2.582ArgAsn: 2.582 ± 0.384
1.674ArgPro: 1.674 ± 0.284
2.295ArgGln: 2.295 ± 0.405
2.247ArgArg: 2.247 ± 0.385
2.773ArgSer: 2.773 ± 0.385
2.439ArgThr: 2.439 ± 0.351
3.73ArgVal: 3.73 ± 0.485
0.526ArgTrp: 0.526 ± 0.145
2.247ArgTyr: 2.247 ± 0.308
0.0ArgXaa: 0.0 ± 0.0
Ser
4.638SerAla: 4.638 ± 0.515
0.43SerCys: 0.43 ± 0.126
3.347SerAsp: 3.347 ± 0.386
3.825SerGlu: 3.825 ± 0.479
3.012SerPhe: 3.012 ± 0.387
5.642SerGly: 5.642 ± 0.503
0.956SerHis: 0.956 ± 0.226
5.164SerIle: 5.164 ± 0.666
3.586SerLys: 3.586 ± 0.447
4.59SerLeu: 4.59 ± 0.61
2.104SerMet: 2.104 ± 0.318
3.443SerAsn: 3.443 ± 0.431
2.2SerPro: 2.2 ± 0.352
1.961SerGln: 1.961 ± 0.33
2.582SerArg: 2.582 ± 0.342
3.825SerSer: 3.825 ± 0.465
4.112SerThr: 4.112 ± 0.646
4.351SerVal: 4.351 ± 0.45
0.765SerTrp: 0.765 ± 0.238
1.769SerTyr: 1.769 ± 0.299
0.0SerXaa: 0.0 ± 0.0
Thr
5.595ThrAla: 5.595 ± 0.629
0.43ThrCys: 0.43 ± 0.147
2.726ThrAsp: 2.726 ± 0.385
2.439ThrGlu: 2.439 ± 0.374
2.678ThrPhe: 2.678 ± 0.41
5.356ThrGly: 5.356 ± 0.549
0.861ThrHis: 0.861 ± 0.23
3.252ThrIle: 3.252 ± 0.423
3.443ThrLys: 3.443 ± 0.382
5.308ThrLeu: 5.308 ± 0.546
1.148ThrMet: 1.148 ± 0.241
2.056ThrAsn: 2.056 ± 0.306
2.534ThrPro: 2.534 ± 0.321
2.295ThrGln: 2.295 ± 0.318
2.63ThrArg: 2.63 ± 0.385
4.399ThrSer: 4.399 ± 0.597
3.395ThrThr: 3.395 ± 0.547
4.543ThrVal: 4.543 ± 0.548
0.669ThrTrp: 0.669 ± 0.171
2.2ThrTyr: 2.2 ± 0.324
0.0ThrXaa: 0.0 ± 0.0
Val
5.786ValAla: 5.786 ± 0.512
0.717ValCys: 0.717 ± 0.181
4.877ValAsp: 4.877 ± 0.565
5.547ValGlu: 5.547 ± 0.522
2.678ValPhe: 2.678 ± 0.339
4.351ValGly: 4.351 ± 0.414
0.717ValHis: 0.717 ± 0.225
4.973ValIle: 4.973 ± 0.564
5.116ValLys: 5.116 ± 0.531
5.069ValLeu: 5.069 ± 0.452
2.152ValMet: 2.152 ± 0.335
4.064ValAsn: 4.064 ± 0.489
2.773ValPro: 2.773 ± 0.32
2.295ValGln: 2.295 ± 0.362
3.443ValArg: 3.443 ± 0.443
4.59ValSer: 4.59 ± 0.585
3.634ValThr: 3.634 ± 0.429
4.16ValVal: 4.16 ± 0.438
0.765ValTrp: 0.765 ± 0.182
2.152ValTyr: 2.152 ± 0.449
0.0ValXaa: 0.0 ± 0.0
Trp
1.148TrpAla: 1.148 ± 0.205
0.191TrpCys: 0.191 ± 0.093
0.622TrpAsp: 0.622 ± 0.17
0.861TrpGlu: 0.861 ± 0.277
0.478TrpPhe: 0.478 ± 0.153
0.526TrpGly: 0.526 ± 0.153
0.191TrpHis: 0.191 ± 0.101
0.956TrpIle: 0.956 ± 0.208
0.574TrpLys: 0.574 ± 0.209
1.482TrpLeu: 1.482 ± 0.253
0.43TrpMet: 0.43 ± 0.108
0.622TrpAsn: 0.622 ± 0.159
0.43TrpPro: 0.43 ± 0.13
0.526TrpGln: 0.526 ± 0.166
0.813TrpArg: 0.813 ± 0.195
0.909TrpSer: 0.909 ± 0.248
0.861TrpThr: 0.861 ± 0.209
0.526TrpVal: 0.526 ± 0.17
0.048TrpTrp: 0.048 ± 0.046
0.717TrpTyr: 0.717 ± 0.194
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.678TyrAla: 2.678 ± 0.291
0.239TyrCys: 0.239 ± 0.134
2.678TyrAsp: 2.678 ± 0.279
1.817TyrGlu: 1.817 ± 0.252
1.626TyrPhe: 1.626 ± 0.318
3.252TyrGly: 3.252 ± 0.405
0.622TyrHis: 0.622 ± 0.187
2.008TyrIle: 2.008 ± 0.316
2.295TyrLys: 2.295 ± 0.387
2.295TyrLeu: 2.295 ± 0.271
0.717TyrMet: 0.717 ± 0.19
1.626TyrAsn: 1.626 ± 0.274
1.291TyrPro: 1.291 ± 0.215
1.53TyrGln: 1.53 ± 0.244
2.056TyrArg: 2.056 ± 0.357
2.391TyrSer: 2.391 ± 0.434
2.343TyrThr: 2.343 ± 0.332
1.769TyrVal: 1.769 ± 0.312
0.717TyrTrp: 0.717 ± 0.191
1.626TyrTyr: 1.626 ± 0.287
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 102 proteins (20914 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski