Amino acid dipepetide frequency for Helicobacter phage Pt21299RU

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.947AlaAla: 0.947 ± 0.384
0.541AlaCys: 0.541 ± 0.339
1.893AlaAsp: 1.893 ± 0.517
3.652AlaGlu: 3.652 ± 0.887
4.463AlaPhe: 4.463 ± 0.879
2.84AlaGly: 2.84 ± 0.842
0.947AlaHis: 0.947 ± 0.299
5.816AlaIle: 5.816 ± 0.992
6.492AlaLys: 6.492 ± 1.082
11.631AlaLeu: 11.631 ± 1.357
1.217AlaMet: 1.217 ± 0.461
5.545AlaAsn: 5.545 ± 1.013
1.082AlaPro: 1.082 ± 0.336
2.705AlaGln: 2.705 ± 0.617
3.246AlaArg: 3.246 ± 0.658
2.975AlaSer: 2.975 ± 0.671
2.434AlaThr: 2.434 ± 0.632
2.029AlaVal: 2.029 ± 0.636
0.27AlaTrp: 0.27 ± 0.198
2.57AlaTyr: 2.57 ± 0.652
0.0AlaXaa: 0.0 ± 0.0
Cys
0.541CysAla: 0.541 ± 0.312
0.135CysCys: 0.135 ± 0.14
0.541CysAsp: 0.541 ± 0.559
0.947CysGlu: 0.947 ± 0.384
0.541CysPhe: 0.541 ± 0.336
0.406CysGly: 0.406 ± 0.315
0.0CysHis: 0.0 ± 0.0
0.406CysIle: 0.406 ± 0.352
0.135CysLys: 0.135 ± 0.13
1.082CysLeu: 1.082 ± 0.569
0.0CysMet: 0.0 ± 0.0
0.406CysAsn: 0.406 ± 0.305
0.406CysPro: 0.406 ± 0.252
0.135CysGln: 0.135 ± 0.13
0.135CysArg: 0.135 ± 0.13
0.0CysSer: 0.0 ± 0.0
0.27CysThr: 0.27 ± 0.196
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.27CysTyr: 0.27 ± 0.187
0.0CysXaa: 0.0 ± 0.0
Asp
2.434AspAla: 2.434 ± 0.567
0.135AspCys: 0.135 ± 0.155
1.217AspAsp: 1.217 ± 0.306
3.652AspGlu: 3.652 ± 0.672
4.598AspPhe: 4.598 ± 0.801
1.352AspGly: 1.352 ± 0.347
0.541AspHis: 0.541 ± 0.231
3.246AspIle: 3.246 ± 0.949
6.357AspLys: 6.357 ± 1.128
6.627AspLeu: 6.627 ± 1.077
1.217AspMet: 1.217 ± 0.525
5.004AspAsn: 5.004 ± 1.006
1.623AspPro: 1.623 ± 0.569
0.541AspGln: 0.541 ± 0.308
2.164AspArg: 2.164 ± 0.661
3.381AspSer: 3.381 ± 0.704
1.352AspThr: 1.352 ± 0.471
1.623AspVal: 1.623 ± 0.496
0.0AspTrp: 0.0 ± 0.0
3.246AspTyr: 3.246 ± 0.779
0.0AspXaa: 0.0 ± 0.0
Glu
6.221GluAla: 6.221 ± 0.96
0.406GluCys: 0.406 ± 0.235
2.299GluAsp: 2.299 ± 0.484
4.463GluGlu: 4.463 ± 0.682
4.463GluPhe: 4.463 ± 0.693
1.893GluGly: 1.893 ± 0.722
1.488GluHis: 1.488 ± 0.427
7.709GluIle: 7.709 ± 1.162
9.061GluLys: 9.061 ± 1.726
9.197GluLeu: 9.197 ± 1.632
1.217GluMet: 1.217 ± 0.344
7.303GluAsn: 7.303 ± 0.866
2.164GluPro: 2.164 ± 0.686
6.897GluGln: 6.897 ± 1.815
6.627GluArg: 6.627 ± 1.098
7.844GluSer: 7.844 ± 1.331
4.734GluThr: 4.734 ± 0.639
4.057GluVal: 4.057 ± 1.09
0.406GluTrp: 0.406 ± 0.227
1.623GluTyr: 1.623 ± 0.364
0.0GluXaa: 0.0 ± 0.0
Phe
1.082PheAla: 1.082 ± 0.371
0.406PheCys: 0.406 ± 0.222
2.975PheAsp: 2.975 ± 0.752
3.246PheGlu: 3.246 ± 0.695
3.516PhePhe: 3.516 ± 0.733
1.488PheGly: 1.488 ± 0.38
0.676PheHis: 0.676 ± 0.246
3.922PheIle: 3.922 ± 0.592
7.709PheLys: 7.709 ± 0.886
6.357PheLeu: 6.357 ± 1.016
0.541PheMet: 0.541 ± 0.361
2.705PheAsn: 2.705 ± 0.617
0.406PhePro: 0.406 ± 0.195
0.947PheGln: 0.947 ± 0.314
1.488PheArg: 1.488 ± 0.517
4.869PheSer: 4.869 ± 0.878
2.705PheThr: 2.705 ± 0.584
1.217PheVal: 1.217 ± 0.436
0.406PheTrp: 0.406 ± 0.224
1.893PheTyr: 1.893 ± 0.5
0.0PheXaa: 0.0 ± 0.0
Gly
3.111GlyAla: 3.111 ± 1.089
0.406GlyCys: 0.406 ± 0.268
2.029GlyAsp: 2.029 ± 0.525
2.164GlyGlu: 2.164 ± 0.581
2.434GlyPhe: 2.434 ± 0.683
2.84GlyGly: 2.84 ± 0.793
0.541GlyHis: 0.541 ± 0.317
2.57GlyIle: 2.57 ± 0.476
2.029GlyLys: 2.029 ± 0.478
4.869GlyLeu: 4.869 ± 0.77
1.217GlyMet: 1.217 ± 0.334
3.381GlyAsn: 3.381 ± 0.763
0.0GlyPro: 0.0 ± 0.0
1.623GlyGln: 1.623 ± 0.371
1.488GlyArg: 1.488 ± 0.57
2.164GlySer: 2.164 ± 0.802
1.082GlyThr: 1.082 ± 0.374
3.516GlyVal: 3.516 ± 1.5
0.0GlyTrp: 0.0 ± 0.0
1.488GlyTyr: 1.488 ± 0.379
0.0GlyXaa: 0.0 ± 0.0
His
0.541HisAla: 0.541 ± 0.23
0.135HisCys: 0.135 ± 0.14
0.541HisAsp: 0.541 ± 0.251
0.947HisGlu: 0.947 ± 0.299
0.676HisPhe: 0.676 ± 0.387
0.27HisGly: 0.27 ± 0.183
0.0HisHis: 0.0 ± 0.0
1.082HisIle: 1.082 ± 0.358
1.623HisLys: 1.623 ± 0.469
1.758HisLeu: 1.758 ± 0.555
0.27HisMet: 0.27 ± 0.182
1.082HisAsn: 1.082 ± 0.467
0.406HisPro: 0.406 ± 0.207
0.27HisGln: 0.27 ± 0.16
0.676HisArg: 0.676 ± 0.229
0.811HisSer: 0.811 ± 0.339
0.947HisThr: 0.947 ± 0.382
0.135HisVal: 0.135 ± 0.102
0.0HisTrp: 0.0 ± 0.0
0.811HisTyr: 0.811 ± 0.254
0.0HisXaa: 0.0 ± 0.0
Ile
4.598IleAla: 4.598 ± 0.958
0.811IleCys: 0.811 ± 0.495
3.922IleAsp: 3.922 ± 0.856
6.221IleGlu: 6.221 ± 1.028
2.029IlePhe: 2.029 ± 0.348
1.623IleGly: 1.623 ± 0.64
1.082IleHis: 1.082 ± 0.436
3.516IleIle: 3.516 ± 0.605
8.791IleLys: 8.791 ± 1.224
7.168IleLeu: 7.168 ± 0.758
0.811IleMet: 0.811 ± 0.311
4.193IleAsn: 4.193 ± 0.942
1.893IlePro: 1.893 ± 0.503
3.922IleGln: 3.922 ± 0.766
3.246IleArg: 3.246 ± 0.586
3.652IleSer: 3.652 ± 0.676
4.328IleThr: 4.328 ± 0.863
3.246IleVal: 3.246 ± 0.659
0.135IleTrp: 0.135 ± 0.16
2.84IleTyr: 2.84 ± 0.638
0.0IleXaa: 0.0 ± 0.0
Lys
8.791LysAla: 8.791 ± 1.533
0.0LysCys: 0.0 ± 0.0
8.115LysAsp: 8.115 ± 1.727
15.012LysGlu: 15.012 ± 2.359
2.84LysPhe: 2.84 ± 0.411
3.652LysGly: 3.652 ± 0.608
1.758LysHis: 1.758 ± 0.679
8.25LysIle: 8.25 ± 1.213
9.332LysLys: 9.332 ± 1.491
8.52LysLeu: 8.52 ± 1.111
1.488LysMet: 1.488 ± 0.508
10.008LysAsn: 10.008 ± 1.248
4.057LysPro: 4.057 ± 0.91
6.897LysGln: 6.897 ± 1.276
4.598LysArg: 4.598 ± 1.073
6.221LysSer: 6.221 ± 1.069
5.816LysThr: 5.816 ± 0.756
4.598LysVal: 4.598 ± 0.936
0.947LysTrp: 0.947 ± 0.397
2.57LysTyr: 2.57 ± 0.635
0.0LysXaa: 0.0 ± 0.0
Leu
4.869LeuAla: 4.869 ± 0.805
1.758LeuCys: 1.758 ± 0.761
5.004LeuAsp: 5.004 ± 0.739
12.578LeuGlu: 12.578 ± 1.627
2.434LeuPhe: 2.434 ± 0.602
5.68LeuGly: 5.68 ± 0.731
0.541LeuHis: 0.541 ± 0.228
5.545LeuIle: 5.545 ± 1.069
19.205LeuLys: 19.205 ± 1.839
7.438LeuLeu: 7.438 ± 1.235
2.57LeuMet: 2.57 ± 0.707
11.766LeuAsn: 11.766 ± 2.298
2.164LeuPro: 2.164 ± 0.475
3.246LeuGln: 3.246 ± 0.895
3.787LeuArg: 3.787 ± 0.934
6.086LeuSer: 6.086 ± 0.734
5.816LeuThr: 5.816 ± 0.816
3.787LeuVal: 3.787 ± 0.529
0.27LeuTrp: 0.27 ± 0.257
1.893LeuTyr: 1.893 ± 0.461
0.0LeuXaa: 0.0 ± 0.0
Met
0.541MetAla: 0.541 ± 0.243
0.0MetCys: 0.0 ± 0.0
1.623MetAsp: 1.623 ± 0.517
0.947MetGlu: 0.947 ± 0.498
1.082MetPhe: 1.082 ± 0.364
1.082MetGly: 1.082 ± 0.427
0.27MetHis: 0.27 ± 0.212
0.947MetIle: 0.947 ± 0.419
1.758MetLys: 1.758 ± 0.424
1.893MetLeu: 1.893 ± 0.544
0.135MetMet: 0.135 ± 0.129
1.758MetAsn: 1.758 ± 0.501
1.217MetPro: 1.217 ± 0.401
1.217MetGln: 1.217 ± 0.342
0.676MetArg: 0.676 ± 0.418
1.488MetSer: 1.488 ± 0.374
0.541MetThr: 0.541 ± 0.251
0.541MetVal: 0.541 ± 0.219
0.27MetTrp: 0.27 ± 0.323
0.406MetTyr: 0.406 ± 0.201
0.0MetXaa: 0.0 ± 0.0
Asn
8.791AsnAla: 8.791 ± 1.266
0.27AsnCys: 0.27 ± 0.218
3.652AsnAsp: 3.652 ± 0.531
8.52AsnGlu: 8.52 ± 1.658
4.057AsnPhe: 4.057 ± 0.865
2.705AsnGly: 2.705 ± 0.52
0.676AsnHis: 0.676 ± 0.322
4.057AsnIle: 4.057 ± 0.643
7.574AsnLys: 7.574 ± 1.068
7.574AsnLeu: 7.574 ± 1.058
1.352AsnMet: 1.352 ± 0.415
5.545AsnAsn: 5.545 ± 1.005
1.893AsnPro: 1.893 ± 0.598
5.004AsnGln: 5.004 ± 1.209
2.705AsnArg: 2.705 ± 0.644
4.734AsnSer: 4.734 ± 0.99
3.922AsnThr: 3.922 ± 0.627
1.623AsnVal: 1.623 ± 0.525
0.27AsnTrp: 0.27 ± 0.181
4.057AsnTyr: 4.057 ± 1.263
0.0AsnXaa: 0.0 ± 0.0
Pro
0.406ProAla: 0.406 ± 0.173
0.0ProCys: 0.0 ± 0.0
1.082ProAsp: 1.082 ± 0.378
1.893ProGlu: 1.893 ± 0.464
1.758ProPhe: 1.758 ± 0.388
0.406ProGly: 0.406 ± 0.3
0.406ProHis: 0.406 ± 0.239
2.299ProIle: 2.299 ± 0.515
4.598ProLys: 4.598 ± 0.835
2.57ProLeu: 2.57 ± 0.731
0.27ProMet: 0.27 ± 0.186
2.705ProAsn: 2.705 ± 0.493
0.406ProPro: 0.406 ± 0.227
0.676ProGln: 0.676 ± 0.251
0.676ProArg: 0.676 ± 0.254
2.164ProSer: 2.164 ± 0.524
1.623ProThr: 1.623 ± 0.493
1.217ProVal: 1.217 ± 0.417
0.0ProTrp: 0.0 ± 0.0
0.947ProTyr: 0.947 ± 0.411
0.0ProXaa: 0.0 ± 0.0
Gln
4.734GlnAla: 4.734 ± 0.636
0.27GlnCys: 0.27 ± 0.189
2.434GlnAsp: 2.434 ± 0.458
5.41GlnGlu: 5.41 ± 1.186
1.623GlnPhe: 1.623 ± 0.443
2.299GlnGly: 2.299 ± 0.887
0.135GlnHis: 0.135 ± 0.157
2.57GlnIle: 2.57 ± 0.431
7.033GlnLys: 7.033 ± 0.865
3.246GlnLeu: 3.246 ± 0.698
1.082GlnMet: 1.082 ± 0.419
2.705GlnAsn: 2.705 ± 0.666
1.082GlnPro: 1.082 ± 0.326
2.299GlnGln: 2.299 ± 0.978
1.623GlnArg: 1.623 ± 0.464
2.705GlnSer: 2.705 ± 0.532
2.029GlnThr: 2.029 ± 0.58
1.488GlnVal: 1.488 ± 0.446
0.27GlnTrp: 0.27 ± 0.149
0.541GlnTyr: 0.541 ± 0.347
0.0GlnXaa: 0.0 ± 0.0
Arg
3.652ArgAla: 3.652 ± 0.667
0.135ArgCys: 0.135 ± 0.13
2.164ArgAsp: 2.164 ± 0.54
4.057ArgGlu: 4.057 ± 0.731
2.705ArgPhe: 2.705 ± 0.526
1.352ArgGly: 1.352 ± 0.416
0.541ArgHis: 0.541 ± 0.337
2.57ArgIle: 2.57 ± 0.783
3.111ArgLys: 3.111 ± 0.775
6.221ArgLeu: 6.221 ± 1.031
0.406ArgMet: 0.406 ± 0.293
1.893ArgAsn: 1.893 ± 0.456
0.947ArgPro: 0.947 ± 0.396
1.217ArgGln: 1.217 ± 0.444
1.082ArgArg: 1.082 ± 0.473
3.381ArgSer: 3.381 ± 0.953
1.352ArgThr: 1.352 ± 0.514
1.352ArgVal: 1.352 ± 0.514
0.135ArgTrp: 0.135 ± 0.102
2.164ArgTyr: 2.164 ± 0.618
0.0ArgXaa: 0.0 ± 0.0
Ser
5.275SerAla: 5.275 ± 0.718
0.135SerCys: 0.135 ± 0.118
4.734SerAsp: 4.734 ± 0.806
6.627SerGlu: 6.627 ± 1.209
4.193SerPhe: 4.193 ± 0.65
2.975SerGly: 2.975 ± 0.684
0.676SerHis: 0.676 ± 0.345
4.193SerIle: 4.193 ± 0.952
6.086SerLys: 6.086 ± 0.854
7.979SerLeu: 7.979 ± 1.502
0.947SerMet: 0.947 ± 0.425
3.246SerAsn: 3.246 ± 0.525
1.082SerPro: 1.082 ± 0.33
2.434SerGln: 2.434 ± 0.585
1.488SerArg: 1.488 ± 0.433
2.299SerSer: 2.299 ± 0.625
2.029SerThr: 2.029 ± 0.572
5.275SerVal: 5.275 ± 1.29
0.541SerTrp: 0.541 ± 0.25
3.246SerTyr: 3.246 ± 0.526
0.0SerXaa: 0.0 ± 0.0
Thr
2.434ThrAla: 2.434 ± 0.672
0.27ThrCys: 0.27 ± 0.227
2.434ThrAsp: 2.434 ± 0.584
2.84ThrGlu: 2.84 ± 0.523
1.082ThrPhe: 1.082 ± 0.342
2.57ThrGly: 2.57 ± 0.729
0.676ThrHis: 0.676 ± 0.335
3.381ThrIle: 3.381 ± 0.75
5.004ThrLys: 5.004 ± 1.142
4.328ThrLeu: 4.328 ± 0.807
1.217ThrMet: 1.217 ± 0.456
3.922ThrAsn: 3.922 ± 0.864
3.516ThrPro: 3.516 ± 0.743
2.705ThrGln: 2.705 ± 1.055
1.758ThrArg: 1.758 ± 0.445
3.652ThrSer: 3.652 ± 0.844
2.84ThrThr: 2.84 ± 0.562
1.217ThrVal: 1.217 ± 0.473
0.27ThrTrp: 0.27 ± 0.203
1.352ThrTyr: 1.352 ± 0.524
0.0ThrXaa: 0.0 ± 0.0
Val
2.975ValAla: 2.975 ± 0.644
0.27ValCys: 0.27 ± 0.271
1.758ValAsp: 1.758 ± 0.636
3.111ValGlu: 3.111 ± 0.629
1.758ValPhe: 1.758 ± 0.561
2.57ValGly: 2.57 ± 0.798
0.27ValHis: 0.27 ± 0.262
3.516ValIle: 3.516 ± 0.825
4.734ValLys: 4.734 ± 0.861
4.193ValLeu: 4.193 ± 1.11
0.947ValMet: 0.947 ± 0.505
2.705ValAsn: 2.705 ± 0.734
0.406ValPro: 0.406 ± 0.259
0.541ValGln: 0.541 ± 0.288
1.488ValArg: 1.488 ± 0.38
3.787ValSer: 3.787 ± 1.302
1.488ValThr: 1.488 ± 0.357
2.299ValVal: 2.299 ± 0.842
0.406ValTrp: 0.406 ± 0.265
0.947ValTyr: 0.947 ± 0.515
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.135TrpAsp: 0.135 ± 0.13
0.676TrpGlu: 0.676 ± 0.391
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.135TrpHis: 0.135 ± 0.13
0.27TrpIle: 0.27 ± 0.206
0.406TrpLys: 0.406 ± 0.217
0.135TrpLeu: 0.135 ± 0.143
0.27TrpMet: 0.27 ± 0.211
0.541TrpAsn: 0.541 ± 0.341
0.0TrpPro: 0.0 ± 0.0
0.27TrpGln: 0.27 ± 0.168
0.27TrpArg: 0.27 ± 0.17
0.27TrpSer: 0.27 ± 0.181
0.406TrpThr: 0.406 ± 0.239
0.541TrpVal: 0.541 ± 0.284
0.0TrpTrp: 0.0 ± 0.0
0.27TrpTyr: 0.27 ± 0.257
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.352TyrAla: 1.352 ± 0.435
0.27TyrCys: 0.27 ± 0.28
1.758TyrAsp: 1.758 ± 0.414
3.111TyrGlu: 3.111 ± 0.719
2.299TyrPhe: 2.299 ± 0.768
0.947TyrGly: 0.947 ± 0.243
1.623TyrHis: 1.623 ± 0.561
2.299TyrIle: 2.299 ± 0.387
2.434TyrLys: 2.434 ± 0.53
3.652TyrLeu: 3.652 ± 0.807
1.082TyrMet: 1.082 ± 0.32
2.57TyrAsn: 2.57 ± 0.609
1.352TyrPro: 1.352 ± 0.378
2.029TyrGln: 2.029 ± 0.477
1.217TyrArg: 1.217 ± 0.443
2.975TyrSer: 2.975 ± 0.709
1.758TyrThr: 1.758 ± 0.763
0.406TyrVal: 0.406 ± 0.254
0.0TyrTrp: 0.0 ± 0.0
1.217TyrTyr: 1.217 ± 0.345
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 24 proteins (7395 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski