Amino acid dipepetide frequency for Ralstonia phage phiRSP

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.151AlaAla: 18.151 ± 1.369
1.167AlaCys: 1.167 ± 0.343
8.816AlaAsp: 8.816 ± 1.092
7.26AlaGlu: 7.26 ± 1.011
3.112AlaPhe: 3.112 ± 0.592
12.447AlaGly: 12.447 ± 1.256
2.204AlaHis: 2.204 ± 0.696
4.927AlaIle: 4.927 ± 0.875
5.705AlaLys: 5.705 ± 0.982
8.298AlaLeu: 8.298 ± 1.107
3.76AlaMet: 3.76 ± 0.461
5.445AlaAsn: 5.445 ± 1.034
5.834AlaPro: 5.834 ± 0.808
4.278AlaGln: 4.278 ± 0.636
7.131AlaArg: 7.131 ± 0.743
6.094AlaSer: 6.094 ± 0.699
7.779AlaThr: 7.779 ± 1.264
8.946AlaVal: 8.946 ± 1.208
1.685AlaTrp: 1.685 ± 0.398
2.723AlaTyr: 2.723 ± 0.513
0.0AlaXaa: 0.0 ± 0.0
Cys
0.648CysAla: 0.648 ± 0.287
0.389CysCys: 0.389 ± 0.199
0.648CysAsp: 0.648 ± 0.389
0.259CysGlu: 0.259 ± 0.174
0.13CysPhe: 0.13 ± 0.126
0.908CysGly: 0.908 ± 0.329
0.13CysHis: 0.13 ± 0.126
0.519CysIle: 0.519 ± 0.257
0.648CysLys: 0.648 ± 0.251
0.13CysLeu: 0.13 ± 0.112
0.259CysMet: 0.259 ± 0.213
0.13CysAsn: 0.13 ± 0.108
0.778CysPro: 0.778 ± 0.321
0.13CysGln: 0.13 ± 0.161
0.259CysArg: 0.259 ± 0.204
0.908CysSer: 0.908 ± 0.284
0.259CysThr: 0.259 ± 0.166
0.259CysVal: 0.259 ± 0.226
0.13CysTrp: 0.13 ± 0.133
0.259CysTyr: 0.259 ± 0.176
0.0CysXaa: 0.0 ± 0.0
Asp
6.872AspAla: 6.872 ± 0.846
0.519AspCys: 0.519 ± 0.264
3.89AspAsp: 3.89 ± 0.82
4.538AspGlu: 4.538 ± 0.878
2.204AspPhe: 2.204 ± 0.6
8.038AspGly: 8.038 ± 1.339
1.426AspHis: 1.426 ± 0.365
2.334AspIle: 2.334 ± 0.522
2.852AspLys: 2.852 ± 0.447
4.538AspLeu: 4.538 ± 0.729
2.074AspMet: 2.074 ± 0.517
1.945AspAsn: 1.945 ± 0.429
3.76AspPro: 3.76 ± 0.627
2.463AspGln: 2.463 ± 0.583
3.501AspArg: 3.501 ± 0.818
2.334AspSer: 2.334 ± 0.68
2.852AspThr: 2.852 ± 0.475
6.483AspVal: 6.483 ± 0.816
1.037AspTrp: 1.037 ± 0.293
1.297AspTyr: 1.297 ± 0.475
0.0AspXaa: 0.0 ± 0.0
Glu
5.186GluAla: 5.186 ± 0.956
0.648GluCys: 0.648 ± 0.296
1.945GluAsp: 1.945 ± 0.583
2.204GluGlu: 2.204 ± 0.593
2.463GluPhe: 2.463 ± 0.565
3.371GluGly: 3.371 ± 0.55
1.297GluHis: 1.297 ± 0.404
3.112GluIle: 3.112 ± 0.694
2.463GluLys: 2.463 ± 0.523
5.575GluLeu: 5.575 ± 0.974
1.815GluMet: 1.815 ± 0.48
1.556GluAsn: 1.556 ± 0.499
2.204GluPro: 2.204 ± 0.526
2.074GluGln: 2.074 ± 0.487
4.797GluArg: 4.797 ± 0.631
2.463GluSer: 2.463 ± 0.515
2.982GluThr: 2.982 ± 0.555
2.723GluVal: 2.723 ± 0.563
1.815GluTrp: 1.815 ± 0.606
1.685GluTyr: 1.685 ± 0.406
0.0GluXaa: 0.0 ± 0.0
Phe
3.241PheAla: 3.241 ± 0.512
0.13PheCys: 0.13 ± 0.112
2.723PheAsp: 2.723 ± 0.629
1.167PheGlu: 1.167 ± 0.342
0.519PhePhe: 0.519 ± 0.201
3.501PheGly: 3.501 ± 0.503
0.648PheHis: 0.648 ± 0.244
1.685PheIle: 1.685 ± 0.572
1.815PheLys: 1.815 ± 0.457
2.074PheLeu: 2.074 ± 0.63
0.648PheMet: 0.648 ± 0.266
1.945PheAsn: 1.945 ± 0.498
1.426PhePro: 1.426 ± 0.296
1.037PheGln: 1.037 ± 0.365
2.074PheArg: 2.074 ± 0.46
1.945PheSer: 1.945 ± 0.525
1.945PheThr: 1.945 ± 0.523
2.204PheVal: 2.204 ± 0.63
0.519PheTrp: 0.519 ± 0.239
0.13PheTyr: 0.13 ± 0.113
0.0PheXaa: 0.0 ± 0.0
Gly
11.409GlyAla: 11.409 ± 1.55
0.389GlyCys: 0.389 ± 0.229
7.26GlyAsp: 7.26 ± 0.756
4.797GlyGlu: 4.797 ± 0.935
3.63GlyPhe: 3.63 ± 0.867
6.742GlyGly: 6.742 ± 1.225
1.297GlyHis: 1.297 ± 0.4
4.927GlyIle: 4.927 ± 0.657
4.538GlyLys: 4.538 ± 0.711
5.186GlyLeu: 5.186 ± 0.869
2.074GlyMet: 2.074 ± 0.585
3.76GlyAsn: 3.76 ± 0.755
2.204GlyPro: 2.204 ± 0.553
4.149GlyGln: 4.149 ± 0.691
5.964GlyArg: 5.964 ± 0.935
4.927GlySer: 4.927 ± 0.793
5.834GlyThr: 5.834 ± 1.07
6.612GlyVal: 6.612 ± 0.898
1.815GlyTrp: 1.815 ± 0.456
2.334GlyTyr: 2.334 ± 0.467
0.0GlyXaa: 0.0 ± 0.0
His
2.334HisAla: 2.334 ± 0.488
0.0HisCys: 0.0 ± 0.0
0.778HisAsp: 0.778 ± 0.275
1.037HisGlu: 1.037 ± 0.338
0.648HisPhe: 0.648 ± 0.247
1.945HisGly: 1.945 ± 0.465
0.778HisHis: 0.778 ± 0.271
1.297HisIle: 1.297 ± 0.434
0.778HisLys: 0.778 ± 0.376
1.685HisLeu: 1.685 ± 0.44
0.519HisMet: 0.519 ± 0.31
0.259HisAsn: 0.259 ± 0.206
1.037HisPro: 1.037 ± 0.418
0.519HisGln: 0.519 ± 0.223
0.259HisArg: 0.259 ± 0.172
0.778HisSer: 0.778 ± 0.299
0.778HisThr: 0.778 ± 0.498
1.167HisVal: 1.167 ± 0.33
0.648HisTrp: 0.648 ± 0.215
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
5.964IleAla: 5.964 ± 0.884
0.519IleCys: 0.519 ± 0.256
4.408IleAsp: 4.408 ± 0.771
3.63IleGlu: 3.63 ± 0.812
1.426IlePhe: 1.426 ± 0.506
4.019IleGly: 4.019 ± 0.834
1.167IleHis: 1.167 ± 0.356
2.593IleIle: 2.593 ± 0.695
2.463IleLys: 2.463 ± 0.416
2.463IleLeu: 2.463 ± 0.549
0.13IleMet: 0.13 ± 0.108
2.204IleAsn: 2.204 ± 0.462
1.945IlePro: 1.945 ± 0.599
1.945IleGln: 1.945 ± 0.431
3.241IleArg: 3.241 ± 0.752
2.723IleSer: 2.723 ± 0.581
3.112IleThr: 3.112 ± 0.778
3.112IleVal: 3.112 ± 0.597
0.389IleTrp: 0.389 ± 0.207
0.259IleTyr: 0.259 ± 0.152
0.0IleXaa: 0.0 ± 0.0
Lys
4.667LysAla: 4.667 ± 0.712
0.259LysCys: 0.259 ± 0.201
2.204LysAsp: 2.204 ± 0.636
1.685LysGlu: 1.685 ± 0.4
1.297LysPhe: 1.297 ± 0.42
5.445LysGly: 5.445 ± 0.873
0.259LysHis: 0.259 ± 0.237
3.112LysIle: 3.112 ± 0.621
1.037LysLys: 1.037 ± 0.313
3.89LysLeu: 3.89 ± 0.606
0.778LysMet: 0.778 ± 0.319
1.297LysAsn: 1.297 ± 0.303
2.463LysPro: 2.463 ± 0.643
1.815LysGln: 1.815 ± 0.554
3.63LysArg: 3.63 ± 0.832
2.852LysSer: 2.852 ± 0.502
2.852LysThr: 2.852 ± 0.559
2.723LysVal: 2.723 ± 0.518
0.648LysTrp: 0.648 ± 0.274
1.297LysTyr: 1.297 ± 0.323
0.0LysXaa: 0.0 ± 0.0
Leu
10.242LeuAla: 10.242 ± 1.358
0.389LeuCys: 0.389 ± 0.205
4.797LeuAsp: 4.797 ± 0.767
3.89LeuGlu: 3.89 ± 0.814
2.723LeuPhe: 2.723 ± 0.547
5.316LeuGly: 5.316 ± 0.852
1.426LeuHis: 1.426 ± 0.436
2.852LeuIle: 2.852 ± 0.598
3.63LeuLys: 3.63 ± 0.744
6.094LeuLeu: 6.094 ± 0.932
2.074LeuMet: 2.074 ± 0.487
2.593LeuAsn: 2.593 ± 0.714
4.667LeuPro: 4.667 ± 0.752
3.501LeuGln: 3.501 ± 0.591
5.445LeuArg: 5.445 ± 0.879
4.278LeuSer: 4.278 ± 1.018
3.76LeuThr: 3.76 ± 0.709
6.223LeuVal: 6.223 ± 0.62
1.167LeuTrp: 1.167 ± 0.408
1.037LeuTyr: 1.037 ± 0.357
0.0LeuXaa: 0.0 ± 0.0
Met
3.371MetAla: 3.371 ± 0.675
0.259MetCys: 0.259 ± 0.164
1.297MetAsp: 1.297 ± 0.412
1.556MetGlu: 1.556 ± 0.543
0.519MetPhe: 0.519 ± 0.284
3.112MetGly: 3.112 ± 0.547
0.13MetHis: 0.13 ± 0.12
1.037MetIle: 1.037 ± 0.426
1.556MetLys: 1.556 ± 0.382
1.556MetLeu: 1.556 ± 0.38
0.389MetMet: 0.389 ± 0.199
0.648MetAsn: 0.648 ± 0.25
1.426MetPro: 1.426 ± 0.609
1.037MetGln: 1.037 ± 0.279
2.463MetArg: 2.463 ± 0.538
1.815MetSer: 1.815 ± 0.408
1.297MetThr: 1.297 ± 0.397
1.556MetVal: 1.556 ± 0.513
0.389MetTrp: 0.389 ± 0.201
0.519MetTyr: 0.519 ± 0.179
0.0MetXaa: 0.0 ± 0.0
Asn
5.316AsnAla: 5.316 ± 1.003
0.648AsnCys: 0.648 ± 0.221
2.204AsnAsp: 2.204 ± 0.548
1.815AsnGlu: 1.815 ± 0.454
0.778AsnPhe: 0.778 ± 0.431
2.723AsnGly: 2.723 ± 0.564
0.13AsnHis: 0.13 ± 0.108
1.426AsnIle: 1.426 ± 0.428
1.426AsnLys: 1.426 ± 0.313
4.149AsnLeu: 4.149 ± 0.541
0.519AsnMet: 0.519 ± 0.351
0.908AsnAsn: 0.908 ± 0.357
2.593AsnPro: 2.593 ± 0.492
0.778AsnGln: 0.778 ± 0.313
2.204AsnArg: 2.204 ± 0.612
2.074AsnSer: 2.074 ± 0.477
2.334AsnThr: 2.334 ± 0.724
3.63AsnVal: 3.63 ± 0.433
0.389AsnTrp: 0.389 ± 0.18
0.778AsnTyr: 0.778 ± 0.302
0.0AsnXaa: 0.0 ± 0.0
Pro
6.483ProAla: 6.483 ± 0.947
0.13ProCys: 0.13 ± 0.112
3.76ProAsp: 3.76 ± 0.639
1.685ProGlu: 1.685 ± 0.474
0.648ProPhe: 0.648 ± 0.309
5.186ProGly: 5.186 ± 0.717
1.037ProHis: 1.037 ± 0.299
1.167ProIle: 1.167 ± 0.374
1.556ProLys: 1.556 ± 0.4
3.89ProLeu: 3.89 ± 0.67
1.685ProMet: 1.685 ± 0.392
2.723ProAsn: 2.723 ± 0.668
2.334ProPro: 2.334 ± 0.508
2.463ProGln: 2.463 ± 0.561
2.982ProArg: 2.982 ± 0.559
1.685ProSer: 1.685 ± 0.327
4.408ProThr: 4.408 ± 0.733
2.463ProVal: 2.463 ± 0.759
1.167ProTrp: 1.167 ± 0.352
1.426ProTyr: 1.426 ± 0.438
0.0ProXaa: 0.0 ± 0.0
Gln
5.705GlnAla: 5.705 ± 0.709
0.13GlnCys: 0.13 ± 0.126
1.815GlnAsp: 1.815 ± 0.442
1.815GlnGlu: 1.815 ± 0.554
0.778GlnPhe: 0.778 ± 0.231
2.723GlnGly: 2.723 ± 0.926
0.778GlnHis: 0.778 ± 0.321
1.945GlnIle: 1.945 ± 0.471
1.037GlnLys: 1.037 ± 0.411
2.982GlnLeu: 2.982 ± 0.717
1.815GlnMet: 1.815 ± 0.606
1.685GlnAsn: 1.685 ± 0.421
2.074GlnPro: 2.074 ± 0.417
1.945GlnGln: 1.945 ± 0.447
4.019GlnArg: 4.019 ± 0.618
2.334GlnSer: 2.334 ± 0.516
1.685GlnThr: 1.685 ± 0.394
2.723GlnVal: 2.723 ± 0.547
0.648GlnTrp: 0.648 ± 0.311
1.167GlnTyr: 1.167 ± 0.407
0.0GlnXaa: 0.0 ± 0.0
Arg
9.076ArgAla: 9.076 ± 1.041
0.13ArgCys: 0.13 ± 0.132
3.89ArgAsp: 3.89 ± 0.977
4.667ArgGlu: 4.667 ± 0.741
2.074ArgPhe: 2.074 ± 0.522
5.186ArgGly: 5.186 ± 0.922
0.778ArgHis: 0.778 ± 0.308
2.723ArgIle: 2.723 ± 0.834
2.852ArgLys: 2.852 ± 0.566
5.316ArgLeu: 5.316 ± 0.884
1.945ArgMet: 1.945 ± 0.391
1.945ArgAsn: 1.945 ± 0.366
3.371ArgPro: 3.371 ± 0.56
2.723ArgGln: 2.723 ± 0.633
4.927ArgArg: 4.927 ± 0.927
2.723ArgSer: 2.723 ± 0.501
4.278ArgThr: 4.278 ± 0.872
3.241ArgVal: 3.241 ± 0.493
2.074ArgTrp: 2.074 ± 0.62
2.723ArgTyr: 2.723 ± 0.629
0.0ArgXaa: 0.0 ± 0.0
Ser
6.612SerAla: 6.612 ± 0.922
0.13SerCys: 0.13 ± 0.126
3.371SerAsp: 3.371 ± 0.699
2.204SerGlu: 2.204 ± 0.497
2.334SerPhe: 2.334 ± 0.481
4.667SerGly: 4.667 ± 0.578
0.648SerHis: 0.648 ± 0.232
2.204SerIle: 2.204 ± 0.427
2.723SerLys: 2.723 ± 0.438
4.408SerLeu: 4.408 ± 0.637
0.778SerMet: 0.778 ± 0.261
2.074SerAsn: 2.074 ± 0.493
1.815SerPro: 1.815 ± 0.39
1.945SerGln: 1.945 ± 0.416
3.501SerArg: 3.501 ± 0.659
2.593SerSer: 2.593 ± 0.796
3.371SerThr: 3.371 ± 0.661
2.852SerVal: 2.852 ± 0.555
1.037SerTrp: 1.037 ± 0.415
1.426SerTyr: 1.426 ± 0.54
0.0SerXaa: 0.0 ± 0.0
Thr
7.001ThrAla: 7.001 ± 0.999
0.908ThrCys: 0.908 ± 0.25
4.019ThrAsp: 4.019 ± 0.568
1.815ThrGlu: 1.815 ± 0.363
2.852ThrPhe: 2.852 ± 0.501
5.186ThrGly: 5.186 ± 0.815
1.426ThrHis: 1.426 ± 0.665
5.056ThrIle: 5.056 ± 0.859
1.945ThrLys: 1.945 ± 0.584
5.316ThrLeu: 5.316 ± 0.927
1.297ThrMet: 1.297 ± 0.413
1.685ThrAsn: 1.685 ± 0.448
2.852ThrPro: 2.852 ± 0.432
2.463ThrGln: 2.463 ± 0.56
3.112ThrArg: 3.112 ± 0.956
3.112ThrSer: 3.112 ± 0.497
3.89ThrThr: 3.89 ± 0.893
5.316ThrVal: 5.316 ± 0.78
0.778ThrTrp: 0.778 ± 0.32
1.297ThrTyr: 1.297 ± 0.405
0.0ThrXaa: 0.0 ± 0.0
Val
8.038ValAla: 8.038 ± 0.779
0.519ValCys: 0.519 ± 0.244
4.278ValAsp: 4.278 ± 0.656
4.538ValGlu: 4.538 ± 0.643
1.945ValPhe: 1.945 ± 0.416
4.667ValGly: 4.667 ± 1.024
1.167ValHis: 1.167 ± 0.289
3.112ValIle: 3.112 ± 0.707
3.501ValLys: 3.501 ± 0.884
5.575ValLeu: 5.575 ± 0.652
1.815ValMet: 1.815 ± 0.519
2.852ValAsn: 2.852 ± 0.607
4.149ValPro: 4.149 ± 0.638
2.723ValGln: 2.723 ± 0.394
4.927ValArg: 4.927 ± 0.503
3.241ValSer: 3.241 ± 0.488
5.056ValThr: 5.056 ± 0.827
6.223ValVal: 6.223 ± 1.481
0.908ValTrp: 0.908 ± 0.462
1.815ValTyr: 1.815 ± 0.445
0.0ValXaa: 0.0 ± 0.0
Trp
2.334TrpAla: 2.334 ± 0.47
0.389TrpCys: 0.389 ± 0.17
0.908TrpAsp: 0.908 ± 0.356
0.778TrpGlu: 0.778 ± 0.348
0.908TrpPhe: 0.908 ± 0.293
1.556TrpGly: 1.556 ± 0.444
0.259TrpHis: 0.259 ± 0.17
0.908TrpIle: 0.908 ± 0.423
0.648TrpLys: 0.648 ± 0.282
1.815TrpLeu: 1.815 ± 0.411
0.778TrpMet: 0.778 ± 0.325
0.648TrpAsn: 0.648 ± 0.336
0.519TrpPro: 0.519 ± 0.238
0.519TrpGln: 0.519 ± 0.258
0.908TrpArg: 0.908 ± 0.328
1.037TrpSer: 1.037 ± 0.273
0.778TrpThr: 0.778 ± 0.273
1.297TrpVal: 1.297 ± 0.321
0.519TrpTrp: 0.519 ± 0.262
0.389TrpTyr: 0.389 ± 0.231
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.241TyrAla: 3.241 ± 0.632
0.13TyrCys: 0.13 ± 0.125
1.945TyrAsp: 1.945 ± 0.486
1.037TyrGlu: 1.037 ± 0.508
0.389TyrPhe: 0.389 ± 0.245
3.112TyrGly: 3.112 ± 0.601
0.259TyrHis: 0.259 ± 0.171
0.778TyrIle: 0.778 ± 0.238
1.037TyrLys: 1.037 ± 0.388
1.167TyrLeu: 1.167 ± 0.281
0.648TyrMet: 0.648 ± 0.247
0.519TyrAsn: 0.519 ± 0.191
1.426TyrPro: 1.426 ± 0.347
1.297TyrGln: 1.297 ± 0.285
1.297TyrArg: 1.297 ± 0.319
0.778TyrSer: 0.778 ± 0.21
2.074TyrThr: 2.074 ± 0.45
1.297TyrVal: 1.297 ± 0.439
0.13TyrTrp: 0.13 ± 0.108
1.037TyrTyr: 1.037 ± 0.448
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 24 proteins (7714 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski