Amino acid dipepetide frequency for Ralstonia phage phiAp1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.267AlaAla: 16.267 ± 1.583
1.518AlaCys: 1.518 ± 0.297
6.507AlaAsp: 6.507 ± 0.566
6.001AlaGlu: 6.001 ± 0.68
3.398AlaPhe: 3.398 ± 0.503
10.338AlaGly: 10.338 ± 0.951
2.603AlaHis: 2.603 ± 0.52
4.844AlaIle: 4.844 ± 0.551
4.41AlaLys: 4.41 ± 0.595
10.7AlaLeu: 10.7 ± 1.018
3.326AlaMet: 3.326 ± 0.535
4.699AlaAsn: 4.699 ± 0.628
5.278AlaPro: 5.278 ± 0.868
5.567AlaGln: 5.567 ± 0.661
6.217AlaArg: 6.217 ± 0.776
6.29AlaSer: 6.29 ± 1.356
6.651AlaThr: 6.651 ± 1.197
8.892AlaVal: 8.892 ± 0.807
2.097AlaTrp: 2.097 ± 0.383
3.615AlaTyr: 3.615 ± 0.685
0.0AlaXaa: 0.0 ± 0.0
Cys
0.94CysAla: 0.94 ± 0.335
0.0CysCys: 0.0 ± 0.0
0.651CysAsp: 0.651 ± 0.291
0.434CysGlu: 0.434 ± 0.176
0.289CysPhe: 0.289 ± 0.158
0.94CysGly: 0.94 ± 0.299
0.145CysHis: 0.145 ± 0.105
0.434CysIle: 0.434 ± 0.162
0.651CysLys: 0.651 ± 0.27
0.506CysLeu: 0.506 ± 0.189
0.217CysMet: 0.217 ± 0.103
0.361CysAsn: 0.361 ± 0.16
0.651CysPro: 0.651 ± 0.214
0.289CysGln: 0.289 ± 0.174
0.94CysArg: 0.94 ± 0.33
0.651CysSer: 0.651 ± 0.218
0.723CysThr: 0.723 ± 0.209
0.578CysVal: 0.578 ± 0.226
0.072CysTrp: 0.072 ± 0.071
0.651CysTyr: 0.651 ± 0.206
0.0CysXaa: 0.0 ± 0.0
Asp
7.157AspAla: 7.157 ± 1.022
0.506AspCys: 0.506 ± 0.253
3.615AspAsp: 3.615 ± 0.593
2.964AspGlu: 2.964 ± 0.578
1.807AspPhe: 1.807 ± 0.311
4.121AspGly: 4.121 ± 0.522
0.94AspHis: 0.94 ± 0.337
4.121AspIle: 4.121 ± 0.551
3.326AspLys: 3.326 ± 0.713
4.482AspLeu: 4.482 ± 0.524
1.229AspMet: 1.229 ± 0.247
1.88AspAsn: 1.88 ± 0.331
3.398AspPro: 3.398 ± 0.602
2.169AspGln: 2.169 ± 0.368
2.964AspArg: 2.964 ± 0.647
2.964AspSer: 2.964 ± 0.351
4.121AspThr: 4.121 ± 0.494
4.193AspVal: 4.193 ± 0.525
1.012AspTrp: 1.012 ± 0.273
1.446AspTyr: 1.446 ± 0.424
0.0AspXaa: 0.0 ± 0.0
Glu
6.868GluAla: 6.868 ± 0.652
0.795GluCys: 0.795 ± 0.251
2.53GluAsp: 2.53 ± 0.442
2.964GluGlu: 2.964 ± 0.569
2.241GluPhe: 2.241 ± 0.484
3.976GluGly: 3.976 ± 0.52
1.88GluHis: 1.88 ± 0.434
1.88GluIle: 1.88 ± 0.398
2.892GluLys: 2.892 ± 0.517
4.772GluLeu: 4.772 ± 0.614
2.024GluMet: 2.024 ± 0.357
1.663GluAsn: 1.663 ± 0.347
1.591GluPro: 1.591 ± 0.358
3.109GluGln: 3.109 ± 0.543
3.253GluArg: 3.253 ± 0.566
2.82GluSer: 2.82 ± 0.418
2.675GluThr: 2.675 ± 0.456
4.049GluVal: 4.049 ± 0.59
0.506GluTrp: 0.506 ± 0.235
2.024GluTyr: 2.024 ± 0.407
0.0GluXaa: 0.0 ± 0.0
Phe
3.253PheAla: 3.253 ± 0.396
0.361PheCys: 0.361 ± 0.17
2.675PheAsp: 2.675 ± 0.398
2.458PheGlu: 2.458 ± 0.497
1.301PhePhe: 1.301 ± 0.367
2.53PheGly: 2.53 ± 0.259
0.795PheHis: 0.795 ± 0.234
1.518PheIle: 1.518 ± 0.236
1.735PheLys: 1.735 ± 0.346
1.952PheLeu: 1.952 ± 0.388
0.868PheMet: 0.868 ± 0.192
1.88PheAsn: 1.88 ± 0.502
1.518PhePro: 1.518 ± 0.369
1.229PheGln: 1.229 ± 0.277
1.301PheArg: 1.301 ± 0.33
2.386PheSer: 2.386 ± 0.401
1.663PheThr: 1.663 ± 0.378
2.241PheVal: 2.241 ± 0.368
0.361PheTrp: 0.361 ± 0.167
0.868PheTyr: 0.868 ± 0.223
0.0PheXaa: 0.0 ± 0.0
Gly
7.953GlyAla: 7.953 ± 0.838
0.578GlyCys: 0.578 ± 0.236
3.832GlyAsp: 3.832 ± 0.447
4.121GlyGlu: 4.121 ± 0.568
2.82GlyPhe: 2.82 ± 0.513
6.434GlyGly: 6.434 ± 0.99
1.663GlyHis: 1.663 ± 0.376
3.615GlyIle: 3.615 ± 0.563
4.699GlyLys: 4.699 ± 0.744
5.928GlyLeu: 5.928 ± 0.733
2.53GlyMet: 2.53 ± 0.45
3.181GlyAsn: 3.181 ± 0.591
2.675GlyPro: 2.675 ± 0.417
3.036GlyGln: 3.036 ± 0.49
4.121GlyArg: 4.121 ± 0.477
5.278GlySer: 5.278 ± 1.093
5.784GlyThr: 5.784 ± 1.109
5.856GlyVal: 5.856 ± 0.7
1.084GlyTrp: 1.084 ± 0.269
1.807GlyTyr: 1.807 ± 0.258
0.0GlyXaa: 0.0 ± 0.0
His
2.241HisAla: 2.241 ± 0.573
0.145HisCys: 0.145 ± 0.1
1.157HisAsp: 1.157 ± 0.242
1.012HisGlu: 1.012 ± 0.229
0.651HisPhe: 0.651 ± 0.213
1.663HisGly: 1.663 ± 0.392
0.651HisHis: 0.651 ± 0.287
0.868HisIle: 0.868 ± 0.199
1.446HisLys: 1.446 ± 0.379
2.603HisLeu: 2.603 ± 0.574
0.434HisMet: 0.434 ± 0.17
1.084HisAsn: 1.084 ± 0.295
1.012HisPro: 1.012 ± 0.354
0.723HisGln: 0.723 ± 0.215
0.868HisArg: 0.868 ± 0.228
1.012HisSer: 1.012 ± 0.256
0.795HisThr: 0.795 ± 0.227
1.374HisVal: 1.374 ± 0.353
0.361HisTrp: 0.361 ± 0.163
0.651HisTyr: 0.651 ± 0.249
0.0HisXaa: 0.0 ± 0.0
Ile
5.205IleAla: 5.205 ± 0.666
0.289IleCys: 0.289 ± 0.112
3.181IleAsp: 3.181 ± 0.415
3.326IleGlu: 3.326 ± 0.526
1.012IlePhe: 1.012 ± 0.278
3.181IleGly: 3.181 ± 0.541
0.868IleHis: 0.868 ± 0.317
1.807IleIle: 1.807 ± 0.379
2.169IleLys: 2.169 ± 0.43
2.964IleLeu: 2.964 ± 0.491
1.012IleMet: 1.012 ± 0.242
1.952IleAsn: 1.952 ± 0.408
2.53IlePro: 2.53 ± 0.457
2.53IleGln: 2.53 ± 0.506
2.964IleArg: 2.964 ± 0.518
1.735IleSer: 1.735 ± 0.412
2.747IleThr: 2.747 ± 0.502
3.47IleVal: 3.47 ± 0.514
0.506IleTrp: 0.506 ± 0.17
1.012IleTyr: 1.012 ± 0.231
0.0IleXaa: 0.0 ± 0.0
Lys
5.35LysAla: 5.35 ± 0.573
0.217LysCys: 0.217 ± 0.114
3.036LysAsp: 3.036 ± 0.449
3.253LysGlu: 3.253 ± 0.673
1.88LysPhe: 1.88 ± 0.29
3.326LysGly: 3.326 ± 0.471
1.518LysHis: 1.518 ± 0.351
1.807LysIle: 1.807 ± 0.319
2.241LysLys: 2.241 ± 0.462
5.205LysLeu: 5.205 ± 0.873
1.663LysMet: 1.663 ± 0.36
1.807LysAsn: 1.807 ± 0.351
1.952LysPro: 1.952 ± 0.338
2.747LysGln: 2.747 ± 0.431
3.47LysArg: 3.47 ± 0.392
2.169LysSer: 2.169 ± 0.363
2.097LysThr: 2.097 ± 0.378
2.747LysVal: 2.747 ± 0.416
0.506LysTrp: 0.506 ± 0.179
1.735LysTyr: 1.735 ± 0.293
0.0LysXaa: 0.0 ± 0.0
Leu
8.459LeuAla: 8.459 ± 0.893
1.229LeuCys: 1.229 ± 0.32
4.844LeuAsp: 4.844 ± 0.772
3.976LeuGlu: 3.976 ± 0.485
2.169LeuPhe: 2.169 ± 0.38
4.772LeuGly: 4.772 ± 0.886
1.807LeuHis: 1.807 ± 0.356
3.687LeuIle: 3.687 ± 0.545
4.555LeuLys: 4.555 ± 0.616
6.217LeuLeu: 6.217 ± 0.926
2.458LeuMet: 2.458 ± 0.414
3.759LeuAsn: 3.759 ± 0.606
3.904LeuPro: 3.904 ± 0.584
3.036LeuGln: 3.036 ± 0.441
5.856LeuArg: 5.856 ± 0.95
5.205LeuSer: 5.205 ± 0.672
6.651LeuThr: 6.651 ± 0.665
4.338LeuVal: 4.338 ± 0.7
1.084LeuTrp: 1.084 ± 0.235
2.675LeuTyr: 2.675 ± 0.417
0.0LeuXaa: 0.0 ± 0.0
Met
2.964MetAla: 2.964 ± 0.494
0.289MetCys: 0.289 ± 0.139
1.518MetAsp: 1.518 ± 0.24
1.446MetGlu: 1.446 ± 0.305
0.94MetPhe: 0.94 ± 0.247
1.88MetGly: 1.88 ± 0.389
0.723MetHis: 0.723 ± 0.224
0.651MetIle: 0.651 ± 0.186
1.591MetLys: 1.591 ± 0.319
1.807MetLeu: 1.807 ± 0.318
1.157MetMet: 1.157 ± 0.318
1.012MetAsn: 1.012 ± 0.254
1.229MetPro: 1.229 ± 0.285
1.663MetGln: 1.663 ± 0.389
1.735MetArg: 1.735 ± 0.376
2.82MetSer: 2.82 ± 0.535
2.386MetThr: 2.386 ± 0.407
1.518MetVal: 1.518 ± 0.384
0.217MetTrp: 0.217 ± 0.108
1.012MetTyr: 1.012 ± 0.26
0.0MetXaa: 0.0 ± 0.0
Asn
4.916AsnAla: 4.916 ± 0.684
0.145AsnCys: 0.145 ± 0.096
2.458AsnAsp: 2.458 ± 0.296
1.952AsnGlu: 1.952 ± 0.362
1.591AsnPhe: 1.591 ± 0.271
3.615AsnGly: 3.615 ± 0.632
0.361AsnHis: 0.361 ± 0.144
2.241AsnIle: 2.241 ± 0.341
2.024AsnLys: 2.024 ± 0.335
2.675AsnLeu: 2.675 ± 0.459
0.795AsnMet: 0.795 ± 0.236
1.807AsnAsn: 1.807 ± 0.372
2.241AsnPro: 2.241 ± 0.316
1.446AsnGln: 1.446 ± 0.295
2.241AsnArg: 2.241 ± 0.423
2.747AsnSer: 2.747 ± 0.562
2.964AsnThr: 2.964 ± 0.466
2.747AsnVal: 2.747 ± 0.409
0.434AsnTrp: 0.434 ± 0.375
1.084AsnTyr: 1.084 ± 0.272
0.0AsnXaa: 0.0 ± 0.0
Pro
6.29ProAla: 6.29 ± 0.997
0.434ProCys: 0.434 ± 0.142
3.326ProAsp: 3.326 ± 0.466
2.964ProGlu: 2.964 ± 0.503
1.807ProPhe: 1.807 ± 0.329
3.398ProGly: 3.398 ± 0.509
0.506ProHis: 0.506 ± 0.176
1.807ProIle: 1.807 ± 0.337
2.313ProLys: 2.313 ± 0.323
2.82ProLeu: 2.82 ± 0.388
0.868ProMet: 0.868 ± 0.307
1.591ProAsn: 1.591 ± 0.348
1.735ProPro: 1.735 ± 0.467
1.374ProGln: 1.374 ± 0.296
2.241ProArg: 2.241 ± 0.35
1.88ProSer: 1.88 ± 0.366
3.687ProThr: 3.687 ± 0.401
3.253ProVal: 3.253 ± 0.586
0.651ProTrp: 0.651 ± 0.218
1.952ProTyr: 1.952 ± 0.505
0.0ProXaa: 0.0 ± 0.0
Gln
6.073GlnAla: 6.073 ± 0.669
0.361GlnCys: 0.361 ± 0.162
2.747GlnAsp: 2.747 ± 0.431
2.313GlnGlu: 2.313 ± 0.406
1.663GlnPhe: 1.663 ± 0.313
3.253GlnGly: 3.253 ± 0.521
0.868GlnHis: 0.868 ± 0.29
2.241GlnIle: 2.241 ± 0.534
2.169GlnLys: 2.169 ± 0.352
3.253GlnLeu: 3.253 ± 0.366
1.374GlnMet: 1.374 ± 0.349
2.169GlnAsn: 2.169 ± 0.349
1.518GlnPro: 1.518 ± 0.318
3.109GlnGln: 3.109 ± 0.522
3.543GlnArg: 3.543 ± 0.502
2.386GlnSer: 2.386 ± 0.455
2.675GlnThr: 2.675 ± 0.463
2.458GlnVal: 2.458 ± 0.347
0.795GlnTrp: 0.795 ± 0.246
1.952GlnTyr: 1.952 ± 0.33
0.0GlnXaa: 0.0 ± 0.0
Arg
6.579ArgAla: 6.579 ± 0.851
0.651ArgCys: 0.651 ± 0.241
3.109ArgAsp: 3.109 ± 0.488
3.759ArgGlu: 3.759 ± 0.639
2.024ArgPhe: 2.024 ± 0.396
4.627ArgGly: 4.627 ± 0.563
0.868ArgHis: 0.868 ± 0.247
3.326ArgIle: 3.326 ± 0.424
2.82ArgLys: 2.82 ± 0.527
4.699ArgLeu: 4.699 ± 0.604
1.952ArgMet: 1.952 ± 0.317
2.892ArgAsn: 2.892 ± 0.402
1.88ArgPro: 1.88 ± 0.375
2.892ArgGln: 2.892 ± 0.387
4.916ArgArg: 4.916 ± 0.536
4.121ArgSer: 4.121 ± 0.427
2.53ArgThr: 2.53 ± 0.373
3.326ArgVal: 3.326 ± 0.549
0.868ArgTrp: 0.868 ± 0.304
1.591ArgTyr: 1.591 ± 0.349
0.0ArgXaa: 0.0 ± 0.0
Ser
8.169SerAla: 8.169 ± 2.05
0.434SerCys: 0.434 ± 0.177
3.47SerAsp: 3.47 ± 0.464
1.663SerGlu: 1.663 ± 0.319
2.169SerPhe: 2.169 ± 0.505
5.133SerGly: 5.133 ± 0.865
1.157SerHis: 1.157 ± 0.321
2.892SerIle: 2.892 ± 0.496
2.458SerLys: 2.458 ± 0.509
5.205SerLeu: 5.205 ± 0.529
1.735SerMet: 1.735 ± 0.344
2.024SerAsn: 2.024 ± 0.404
2.024SerPro: 2.024 ± 0.482
2.241SerGln: 2.241 ± 0.382
3.253SerArg: 3.253 ± 0.506
2.53SerSer: 2.53 ± 0.675
4.193SerThr: 4.193 ± 0.654
3.904SerVal: 3.904 ± 0.393
1.229SerTrp: 1.229 ± 0.418
1.663SerTyr: 1.663 ± 0.326
0.0SerXaa: 0.0 ± 0.0
Thr
7.013ThrAla: 7.013 ± 0.833
0.723ThrCys: 0.723 ± 0.234
3.759ThrAsp: 3.759 ± 0.48
3.326ThrGlu: 3.326 ± 0.526
2.024ThrPhe: 2.024 ± 0.384
4.772ThrGly: 4.772 ± 0.64
0.506ThrHis: 0.506 ± 0.169
2.386ThrIle: 2.386 ± 0.351
2.024ThrLys: 2.024 ± 0.396
6.145ThrLeu: 6.145 ± 0.685
2.024ThrMet: 2.024 ± 0.4
2.603ThrAsn: 2.603 ± 0.711
3.615ThrPro: 3.615 ± 0.614
3.181ThrGln: 3.181 ± 0.573
3.036ThrArg: 3.036 ± 0.365
3.904ThrSer: 3.904 ± 0.502
4.121ThrThr: 4.121 ± 0.601
5.205ThrVal: 5.205 ± 0.781
1.012ThrTrp: 1.012 ± 0.193
1.663ThrTyr: 1.663 ± 0.385
0.0ThrXaa: 0.0 ± 0.0
Val
8.892ValAla: 8.892 ± 0.89
0.868ValCys: 0.868 ± 0.252
3.615ValAsp: 3.615 ± 0.395
3.181ValGlu: 3.181 ± 0.549
1.663ValPhe: 1.663 ± 0.305
5.711ValGly: 5.711 ± 0.753
2.097ValHis: 2.097 ± 0.45
2.313ValIle: 2.313 ± 0.419
3.398ValLys: 3.398 ± 0.53
4.338ValLeu: 4.338 ± 0.499
1.591ValMet: 1.591 ± 0.3
1.952ValAsn: 1.952 ± 0.418
4.193ValPro: 4.193 ± 0.426
4.193ValGln: 4.193 ± 0.486
3.832ValArg: 3.832 ± 0.498
3.904ValSer: 3.904 ± 0.618
4.049ValThr: 4.049 ± 0.635
5.422ValVal: 5.422 ± 0.861
1.229ValTrp: 1.229 ± 0.36
1.518ValTyr: 1.518 ± 0.224
0.0ValXaa: 0.0 ± 0.0
Trp
2.024TrpAla: 2.024 ± 0.469
0.217TrpCys: 0.217 ± 0.118
0.723TrpAsp: 0.723 ± 0.25
0.94TrpGlu: 0.94 ± 0.403
0.578TrpPhe: 0.578 ± 0.173
0.795TrpGly: 0.795 ± 0.221
0.361TrpHis: 0.361 ± 0.189
0.506TrpIle: 0.506 ± 0.227
0.434TrpLys: 0.434 ± 0.181
1.229TrpLeu: 1.229 ± 0.356
0.434TrpMet: 0.434 ± 0.15
0.506TrpAsn: 0.506 ± 0.247
0.651TrpPro: 0.651 ± 0.175
0.868TrpGln: 0.868 ± 0.198
0.723TrpArg: 0.723 ± 0.219
1.157TrpSer: 1.157 ± 0.253
1.229TrpThr: 1.229 ± 0.309
0.578TrpVal: 0.578 ± 0.221
0.217TrpTrp: 0.217 ± 0.143
0.289TrpTyr: 0.289 ± 0.128
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.603TyrAla: 2.603 ± 0.317
0.361TyrCys: 0.361 ± 0.163
1.663TyrAsp: 1.663 ± 0.344
2.458TyrGlu: 2.458 ± 0.38
0.868TyrPhe: 0.868 ± 0.229
2.53TyrGly: 2.53 ± 0.527
0.434TyrHis: 0.434 ± 0.16
1.518TyrIle: 1.518 ± 0.33
1.374TyrLys: 1.374 ± 0.273
3.181TyrLeu: 3.181 ± 0.442
0.795TyrMet: 0.795 ± 0.222
1.663TyrAsn: 1.663 ± 0.386
1.374TyrPro: 1.374 ± 0.308
1.518TyrGln: 1.518 ± 0.326
1.952TyrArg: 1.952 ± 0.285
1.591TyrSer: 1.591 ± 0.456
1.301TyrThr: 1.301 ± 0.279
1.807TyrVal: 1.807 ± 0.363
0.217TyrTrp: 0.217 ± 0.12
0.723TyrTyr: 0.723 ± 0.223
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (13833 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski