Amino acid dipepetide frequency for Ralstonia phage RS138

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
22.003AlaAla: 22.003 ± 4.227
0.695AlaCys: 0.695 ± 0.228
7.18AlaAsp: 7.18 ± 0.701
9.033AlaGlu: 9.033 ± 1.425
3.242AlaPhe: 3.242 ± 0.573
11.426AlaGly: 11.426 ± 0.894
2.162AlaHis: 2.162 ± 0.448
4.401AlaIle: 4.401 ± 0.717
6.022AlaLys: 6.022 ± 0.88
12.198AlaLeu: 12.198 ± 1.235
3.242AlaMet: 3.242 ± 0.615
2.856AlaAsn: 2.856 ± 0.492
5.945AlaPro: 5.945 ± 1.013
8.106AlaGln: 8.106 ± 1.479
10.577AlaArg: 10.577 ± 1.719
7.72AlaSer: 7.72 ± 0.999
9.033AlaThr: 9.033 ± 0.815
8.955AlaVal: 8.955 ± 0.848
1.467AlaTrp: 1.467 ± 0.376
3.783AlaTyr: 3.783 ± 0.482
0.0AlaXaa: 0.0 ± 0.0
Cys
0.695CysAla: 0.695 ± 0.283
0.309CysCys: 0.309 ± 0.186
0.463CysAsp: 0.463 ± 0.238
0.154CysGlu: 0.154 ± 0.112
0.077CysPhe: 0.077 ± 0.087
0.463CysGly: 0.463 ± 0.236
0.154CysHis: 0.154 ± 0.125
0.077CysIle: 0.077 ± 0.069
0.309CysLys: 0.309 ± 0.159
0.463CysLeu: 0.463 ± 0.176
0.154CysMet: 0.154 ± 0.13
0.154CysAsn: 0.154 ± 0.104
0.618CysPro: 0.618 ± 0.282
0.54CysGln: 0.54 ± 0.219
0.926CysArg: 0.926 ± 0.346
0.386CysSer: 0.386 ± 0.185
0.618CysThr: 0.618 ± 0.239
0.54CysVal: 0.54 ± 0.206
0.077CysTrp: 0.077 ± 0.093
0.232CysTyr: 0.232 ± 0.125
0.0CysXaa: 0.0 ± 0.0
Asp
7.566AspAla: 7.566 ± 0.899
0.0AspCys: 0.0 ± 0.0
2.316AspAsp: 2.316 ± 0.385
3.165AspGlu: 3.165 ± 0.494
1.544AspPhe: 1.544 ± 0.487
5.095AspGly: 5.095 ± 0.821
1.235AspHis: 1.235 ± 0.31
3.165AspIle: 3.165 ± 0.497
2.162AspLys: 2.162 ± 0.521
5.095AspLeu: 5.095 ± 0.605
1.698AspMet: 1.698 ± 0.517
1.544AspAsn: 1.544 ± 0.383
2.393AspPro: 2.393 ± 0.477
2.47AspGln: 2.47 ± 0.474
3.551AspArg: 3.551 ± 0.529
2.47AspSer: 2.47 ± 0.447
4.478AspThr: 4.478 ± 0.672
3.783AspVal: 3.783 ± 0.568
1.235AspTrp: 1.235 ± 0.351
1.853AspTyr: 1.853 ± 0.373
0.0AspXaa: 0.0 ± 0.0
Glu
6.717GluAla: 6.717 ± 1.134
0.463GluCys: 0.463 ± 0.227
2.239GluAsp: 2.239 ± 0.427
2.316GluGlu: 2.316 ± 0.487
1.621GluPhe: 1.621 ± 0.384
3.629GluGly: 3.629 ± 0.746
1.081GluHis: 1.081 ± 0.33
3.165GluIle: 3.165 ± 0.472
1.776GluLys: 1.776 ± 0.336
6.717GluLeu: 6.717 ± 1.227
1.004GluMet: 1.004 ± 0.268
1.467GluAsn: 1.467 ± 0.383
1.39GluPro: 1.39 ± 0.405
2.393GluGln: 2.393 ± 0.531
4.787GluArg: 4.787 ± 0.838
1.312GluSer: 1.312 ± 0.28
3.397GluThr: 3.397 ± 0.542
3.397GluVal: 3.397 ± 0.493
0.695GluTrp: 0.695 ± 0.296
1.235GluTyr: 1.235 ± 0.281
0.0GluXaa: 0.0 ± 0.0
Phe
2.779PheAla: 2.779 ± 0.446
0.232PheCys: 0.232 ± 0.181
1.776PheAsp: 1.776 ± 0.351
1.39PheGlu: 1.39 ± 0.339
0.463PhePhe: 0.463 ± 0.198
3.783PheGly: 3.783 ± 0.689
0.309PheHis: 0.309 ± 0.165
0.618PheIle: 0.618 ± 0.168
0.849PheLys: 0.849 ± 0.268
1.467PheLeu: 1.467 ± 0.316
0.463PheMet: 0.463 ± 0.215
1.235PheAsn: 1.235 ± 0.422
1.776PhePro: 1.776 ± 0.526
1.467PheGln: 1.467 ± 0.397
1.621PheArg: 1.621 ± 0.4
1.081PheSer: 1.081 ± 0.267
2.162PheThr: 2.162 ± 0.579
1.776PheVal: 1.776 ± 0.35
0.232PheTrp: 0.232 ± 0.178
0.695PheTyr: 0.695 ± 0.233
0.0PheXaa: 0.0 ± 0.0
Gly
9.65GlyAla: 9.65 ± 1.106
0.618GlyCys: 0.618 ± 0.27
4.632GlyAsp: 4.632 ± 0.577
3.32GlyGlu: 3.32 ± 0.407
1.853GlyPhe: 1.853 ± 0.452
6.176GlyGly: 6.176 ± 0.786
1.93GlyHis: 1.93 ± 0.474
3.86GlyIle: 3.86 ± 0.574
3.937GlyLys: 3.937 ± 0.613
6.022GlyLeu: 6.022 ± 0.725
2.239GlyMet: 2.239 ± 0.453
3.551GlyAsn: 3.551 ± 0.651
2.548GlyPro: 2.548 ± 0.462
3.86GlyGln: 3.86 ± 0.572
6.485GlyArg: 6.485 ± 0.84
3.629GlySer: 3.629 ± 0.864
6.871GlyThr: 6.871 ± 1.073
5.404GlyVal: 5.404 ± 0.581
1.312GlyTrp: 1.312 ± 0.283
2.316GlyTyr: 2.316 ± 0.464
0.0GlyXaa: 0.0 ± 0.0
His
2.084HisAla: 2.084 ± 0.563
0.695HisCys: 0.695 ± 0.233
1.081HisAsp: 1.081 ± 0.339
0.463HisGlu: 0.463 ± 0.176
0.695HisPhe: 0.695 ± 0.249
1.853HisGly: 1.853 ± 0.524
1.004HisHis: 1.004 ± 0.468
0.849HisIle: 0.849 ± 0.273
0.695HisLys: 0.695 ± 0.233
1.081HisLeu: 1.081 ± 0.306
0.386HisMet: 0.386 ± 0.21
0.618HisAsn: 0.618 ± 0.21
1.235HisPro: 1.235 ± 0.402
1.081HisGln: 1.081 ± 0.423
1.312HisArg: 1.312 ± 0.308
0.772HisSer: 0.772 ± 0.277
0.926HisThr: 0.926 ± 0.275
1.544HisVal: 1.544 ± 0.296
0.309HisTrp: 0.309 ± 0.156
0.232HisTyr: 0.232 ± 0.126
0.0HisXaa: 0.0 ± 0.0
Ile
6.717IleAla: 6.717 ± 0.737
0.077IleCys: 0.077 ± 0.082
3.011IleAsp: 3.011 ± 0.483
3.242IleGlu: 3.242 ± 0.601
0.54IlePhe: 0.54 ± 0.196
3.011IleGly: 3.011 ± 0.45
1.081IleHis: 1.081 ± 0.251
1.081IleIle: 1.081 ± 0.294
1.312IleLys: 1.312 ± 0.311
2.625IleLeu: 2.625 ± 0.559
0.309IleMet: 0.309 ± 0.155
1.235IleAsn: 1.235 ± 0.308
1.39IlePro: 1.39 ± 0.467
1.698IleGln: 1.698 ± 0.375
3.011IleArg: 3.011 ± 0.554
1.621IleSer: 1.621 ± 0.423
2.779IleThr: 2.779 ± 0.631
2.316IleVal: 2.316 ± 0.395
0.618IleTrp: 0.618 ± 0.249
0.772IleTyr: 0.772 ± 0.241
0.0IleXaa: 0.0 ± 0.0
Lys
5.945LysAla: 5.945 ± 0.839
0.154LysCys: 0.154 ± 0.118
1.544LysAsp: 1.544 ± 0.338
1.544LysGlu: 1.544 ± 0.432
0.54LysPhe: 0.54 ± 0.195
2.316LysGly: 2.316 ± 0.36
0.695LysHis: 0.695 ± 0.264
0.54LysIle: 0.54 ± 0.195
1.312LysLys: 1.312 ± 0.502
4.015LysLeu: 4.015 ± 0.523
0.386LysMet: 0.386 ± 0.186
0.772LysAsn: 0.772 ± 0.237
2.47LysPro: 2.47 ± 0.67
1.544LysGln: 1.544 ± 0.412
2.702LysArg: 2.702 ± 0.569
2.084LysSer: 2.084 ± 0.397
3.088LysThr: 3.088 ± 0.523
2.779LysVal: 2.779 ± 0.496
0.54LysTrp: 0.54 ± 0.211
0.618LysTyr: 0.618 ± 0.203
0.0LysXaa: 0.0 ± 0.0
Leu
12.893LeuAla: 12.893 ± 2.028
0.386LeuCys: 0.386 ± 0.233
6.176LeuAsp: 6.176 ± 0.718
4.864LeuGlu: 4.864 ± 0.684
2.625LeuPhe: 2.625 ± 0.525
5.79LeuGly: 5.79 ± 0.746
2.007LeuHis: 2.007 ± 0.444
2.779LeuIle: 2.779 ± 0.449
2.007LeuLys: 2.007 ± 0.35
8.724LeuLeu: 8.724 ± 0.751
1.312LeuMet: 1.312 ± 0.323
2.856LeuAsn: 2.856 ± 0.485
5.327LeuPro: 5.327 ± 0.54
5.404LeuGln: 5.404 ± 0.699
6.871LeuArg: 6.871 ± 0.666
5.713LeuSer: 5.713 ± 0.587
5.636LeuThr: 5.636 ± 0.586
6.562LeuVal: 6.562 ± 0.822
1.158LeuTrp: 1.158 ± 0.296
1.467LeuTyr: 1.467 ± 0.416
0.0LeuXaa: 0.0 ± 0.0
Met
3.629MetAla: 3.629 ± 0.395
0.154MetCys: 0.154 ± 0.113
1.467MetAsp: 1.467 ± 0.419
1.004MetGlu: 1.004 ± 0.33
0.54MetPhe: 0.54 ± 0.168
1.158MetGly: 1.158 ± 0.336
0.232MetHis: 0.232 ± 0.138
0.926MetIle: 0.926 ± 0.255
0.618MetLys: 0.618 ± 0.199
1.698MetLeu: 1.698 ± 0.39
0.54MetMet: 0.54 ± 0.18
0.463MetAsn: 0.463 ± 0.168
1.235MetPro: 1.235 ± 0.391
1.544MetGln: 1.544 ± 0.339
2.47MetArg: 2.47 ± 0.493
0.926MetSer: 0.926 ± 0.268
1.467MetThr: 1.467 ± 0.393
1.853MetVal: 1.853 ± 0.36
0.077MetTrp: 0.077 ± 0.073
0.386MetTyr: 0.386 ± 0.159
0.0MetXaa: 0.0 ± 0.0
Asn
4.169AsnAla: 4.169 ± 0.603
0.154AsnCys: 0.154 ± 0.127
1.312AsnAsp: 1.312 ± 0.271
1.081AsnGlu: 1.081 ± 0.222
0.386AsnPhe: 0.386 ± 0.164
3.629AsnGly: 3.629 ± 0.646
0.618AsnHis: 0.618 ± 0.205
0.849AsnIle: 0.849 ± 0.215
1.081AsnLys: 1.081 ± 0.353
2.393AsnLeu: 2.393 ± 0.54
0.772AsnMet: 0.772 ± 0.23
0.926AsnAsn: 0.926 ± 0.286
1.621AsnPro: 1.621 ± 0.396
0.849AsnGln: 0.849 ± 0.261
1.93AsnArg: 1.93 ± 0.398
1.312AsnSer: 1.312 ± 0.317
1.467AsnThr: 1.467 ± 0.483
2.779AsnVal: 2.779 ± 0.476
0.926AsnTrp: 0.926 ± 0.218
1.004AsnTyr: 1.004 ± 0.438
0.0AsnXaa: 0.0 ± 0.0
Pro
7.257ProAla: 7.257 ± 0.99
0.077ProCys: 0.077 ± 0.079
2.934ProAsp: 2.934 ± 0.533
3.242ProGlu: 3.242 ± 0.555
1.544ProPhe: 1.544 ± 0.417
3.242ProGly: 3.242 ± 0.755
0.926ProHis: 0.926 ± 0.358
1.853ProIle: 1.853 ± 0.412
1.776ProLys: 1.776 ± 0.399
3.551ProLeu: 3.551 ± 0.687
1.312ProMet: 1.312 ± 0.377
1.004ProAsn: 1.004 ± 0.324
1.853ProPro: 1.853 ± 0.424
1.621ProGln: 1.621 ± 0.356
3.011ProArg: 3.011 ± 0.471
2.702ProSer: 2.702 ± 0.547
3.088ProThr: 3.088 ± 0.434
3.783ProVal: 3.783 ± 0.674
0.618ProTrp: 0.618 ± 0.205
1.621ProTyr: 1.621 ± 0.426
0.0ProXaa: 0.0 ± 0.0
Gln
8.029GlnAla: 8.029 ± 1.277
0.232GlnCys: 0.232 ± 0.134
1.93GlnAsp: 1.93 ± 0.367
2.007GlnGlu: 2.007 ± 0.439
1.312GlnPhe: 1.312 ± 0.323
2.934GlnGly: 2.934 ± 0.453
0.463GlnHis: 0.463 ± 0.2
2.393GlnIle: 2.393 ± 0.393
0.926GlnLys: 0.926 ± 0.277
6.871GlnLeu: 6.871 ± 0.885
1.467GlnMet: 1.467 ± 0.401
0.849GlnAsn: 0.849 ± 0.229
2.779GlnPro: 2.779 ± 0.536
2.856GlnGln: 2.856 ± 0.575
4.864GlnArg: 4.864 ± 0.815
2.393GlnSer: 2.393 ± 0.421
3.242GlnThr: 3.242 ± 0.54
3.242GlnVal: 3.242 ± 0.414
0.618GlnTrp: 0.618 ± 0.307
1.158GlnTyr: 1.158 ± 0.336
0.0GlnXaa: 0.0 ± 0.0
Arg
9.264ArgAla: 9.264 ± 1.464
0.54ArgCys: 0.54 ± 0.226
4.169ArgAsp: 4.169 ± 0.463
4.555ArgGlu: 4.555 ± 0.776
2.162ArgPhe: 2.162 ± 0.278
4.092ArgGly: 4.092 ± 0.433
1.544ArgHis: 1.544 ± 0.491
3.86ArgIle: 3.86 ± 0.585
2.702ArgLys: 2.702 ± 0.409
7.566ArgLeu: 7.566 ± 1.01
1.853ArgMet: 1.853 ± 0.332
2.239ArgAsn: 2.239 ± 0.472
3.86ArgPro: 3.86 ± 0.658
3.551ArgGln: 3.551 ± 0.653
5.327ArgArg: 5.327 ± 1.217
4.092ArgSer: 4.092 ± 0.61
3.937ArgThr: 3.937 ± 0.716
5.481ArgVal: 5.481 ± 0.736
1.776ArgTrp: 1.776 ± 0.319
2.239ArgTyr: 2.239 ± 0.424
0.0ArgXaa: 0.0 ± 0.0
Ser
8.415SerAla: 8.415 ± 0.999
0.232SerCys: 0.232 ± 0.135
3.011SerAsp: 3.011 ± 0.399
1.853SerGlu: 1.853 ± 0.377
1.158SerPhe: 1.158 ± 0.333
5.713SerGly: 5.713 ± 1.067
0.695SerHis: 0.695 ± 0.266
1.698SerIle: 1.698 ± 0.543
1.621SerLys: 1.621 ± 0.353
4.169SerLeu: 4.169 ± 0.657
1.158SerMet: 1.158 ± 0.288
1.93SerAsn: 1.93 ± 0.375
2.393SerPro: 2.393 ± 0.448
2.393SerGln: 2.393 ± 0.433
3.32SerArg: 3.32 ± 0.586
2.856SerSer: 2.856 ± 0.563
3.242SerThr: 3.242 ± 0.489
4.015SerVal: 4.015 ± 0.632
1.081SerTrp: 1.081 ± 0.317
1.312SerTyr: 1.312 ± 0.295
0.0SerXaa: 0.0 ± 0.0
Thr
9.033ThrAla: 9.033 ± 1.25
0.695ThrCys: 0.695 ± 0.271
4.478ThrAsp: 4.478 ± 0.606
2.779ThrGlu: 2.779 ± 0.551
2.084ThrPhe: 2.084 ± 0.478
7.18ThrGly: 7.18 ± 0.971
0.849ThrHis: 0.849 ± 0.258
2.162ThrIle: 2.162 ± 0.341
2.007ThrLys: 2.007 ± 0.518
5.867ThrLeu: 5.867 ± 0.698
1.698ThrMet: 1.698 ± 0.366
1.776ThrAsn: 1.776 ± 0.312
3.011ThrPro: 3.011 ± 0.575
2.779ThrGln: 2.779 ± 0.48
3.242ThrArg: 3.242 ± 0.45
3.474ThrSer: 3.474 ± 0.606
3.706ThrThr: 3.706 ± 0.607
6.485ThrVal: 6.485 ± 1.195
1.39ThrTrp: 1.39 ± 0.376
1.621ThrTyr: 1.621 ± 0.365
0.0ThrXaa: 0.0 ± 0.0
Val
9.419ValAla: 9.419 ± 0.86
1.235ValCys: 1.235 ± 0.334
5.095ValAsp: 5.095 ± 0.631
2.702ValGlu: 2.702 ± 0.48
2.316ValPhe: 2.316 ± 0.366
5.559ValGly: 5.559 ± 0.72
1.235ValHis: 1.235 ± 0.319
2.702ValIle: 2.702 ± 0.478
3.088ValLys: 3.088 ± 0.458
6.331ValLeu: 6.331 ± 0.729
1.235ValMet: 1.235 ± 0.397
2.548ValAsn: 2.548 ± 0.504
3.242ValPro: 3.242 ± 0.512
4.015ValGln: 4.015 ± 0.489
4.941ValArg: 4.941 ± 0.682
4.555ValSer: 4.555 ± 0.725
4.401ValThr: 4.401 ± 0.764
4.632ValVal: 4.632 ± 0.809
0.54ValTrp: 0.54 ± 0.216
1.93ValTyr: 1.93 ± 0.406
0.0ValXaa: 0.0 ± 0.0
Trp
1.158TrpAla: 1.158 ± 0.303
0.077TrpCys: 0.077 ± 0.088
0.386TrpAsp: 0.386 ± 0.152
0.618TrpGlu: 0.618 ± 0.271
0.54TrpPhe: 0.54 ± 0.205
1.081TrpGly: 1.081 ± 0.295
0.232TrpHis: 0.232 ± 0.151
0.618TrpIle: 0.618 ± 0.272
0.54TrpLys: 0.54 ± 0.199
1.467TrpLeu: 1.467 ± 0.337
0.386TrpMet: 0.386 ± 0.171
0.309TrpAsn: 0.309 ± 0.136
0.926TrpPro: 0.926 ± 0.271
1.081TrpGln: 1.081 ± 0.355
1.544TrpArg: 1.544 ± 0.463
1.544TrpSer: 1.544 ± 0.503
1.235TrpThr: 1.235 ± 0.33
0.772TrpVal: 0.772 ± 0.245
0.386TrpTrp: 0.386 ± 0.19
0.386TrpTyr: 0.386 ± 0.18
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.625TyrAla: 2.625 ± 0.515
0.386TyrCys: 0.386 ± 0.179
1.621TyrAsp: 1.621 ± 0.374
1.467TyrGlu: 1.467 ± 0.259
0.926TyrPhe: 0.926 ± 0.324
2.084TyrGly: 2.084 ± 0.455
0.463TyrHis: 0.463 ± 0.213
0.849TyrIle: 0.849 ± 0.211
0.849TyrLys: 0.849 ± 0.267
2.239TyrLeu: 2.239 ± 0.45
0.695TyrMet: 0.695 ± 0.253
0.926TyrAsn: 0.926 ± 0.313
1.004TyrPro: 1.004 ± 0.272
1.312TyrGln: 1.312 ± 0.297
2.316TyrArg: 2.316 ± 0.436
1.698TyrSer: 1.698 ± 0.397
1.544TyrThr: 1.544 ± 0.33
1.621TyrVal: 1.621 ± 0.37
0.232TyrTrp: 0.232 ± 0.13
0.54TyrTyr: 0.54 ± 0.193
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (12954 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski