Amino acid dipepetide frequency for Helicobacter phage UKEN32U

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.052AlaAla: 1.052 ± 0.478
0.947AlaCys: 0.947 ± 0.365
1.894AlaAsp: 1.894 ± 0.378
3.577AlaGlu: 3.577 ± 0.753
3.472AlaPhe: 3.472 ± 0.71
2.63AlaGly: 2.63 ± 0.802
0.736AlaHis: 0.736 ± 0.351
5.892AlaIle: 5.892 ± 0.837
8.101AlaLys: 8.101 ± 0.821
11.888AlaLeu: 11.888 ± 1.224
1.578AlaMet: 1.578 ± 0.457
6.207AlaAsn: 6.207 ± 0.953
1.262AlaPro: 1.262 ± 0.3
2.841AlaGln: 2.841 ± 0.601
3.156AlaArg: 3.156 ± 0.505
3.682AlaSer: 3.682 ± 0.722
3.156AlaThr: 3.156 ± 0.864
1.894AlaVal: 1.894 ± 0.495
0.105AlaTrp: 0.105 ± 0.107
2.209AlaTyr: 2.209 ± 0.634
0.0AlaXaa: 0.0 ± 0.0
Cys
0.421CysAla: 0.421 ± 0.221
0.105CysCys: 0.105 ± 0.095
0.736CysAsp: 0.736 ± 0.34
0.631CysGlu: 0.631 ± 0.291
0.736CysPhe: 0.736 ± 0.301
0.421CysGly: 0.421 ± 0.232
0.0CysHis: 0.0 ± 0.0
0.21CysIle: 0.21 ± 0.152
0.421CysLys: 0.421 ± 0.232
1.052CysLeu: 1.052 ± 0.442
0.0CysMet: 0.0 ± 0.0
0.421CysAsn: 0.421 ± 0.164
0.421CysPro: 0.421 ± 0.159
0.21CysGln: 0.21 ± 0.147
0.21CysArg: 0.21 ± 0.213
0.526CysSer: 0.526 ± 0.397
0.421CysThr: 0.421 ± 0.196
0.631CysVal: 0.631 ± 0.239
0.0CysTrp: 0.0 ± 0.0
0.421CysTyr: 0.421 ± 0.208
0.0CysXaa: 0.0 ± 0.0
Asp
2.209AspAla: 2.209 ± 0.572
0.526AspCys: 0.526 ± 0.294
1.683AspAsp: 1.683 ± 0.414
3.156AspGlu: 3.156 ± 0.641
4.103AspPhe: 4.103 ± 0.509
1.052AspGly: 1.052 ± 0.297
0.631AspHis: 0.631 ± 0.258
2.42AspIle: 2.42 ± 0.723
6.523AspLys: 6.523 ± 1.116
7.154AspLeu: 7.154 ± 1.17
1.262AspMet: 1.262 ± 0.371
4.419AspAsn: 4.419 ± 0.81
1.578AspPro: 1.578 ± 0.336
0.842AspGln: 0.842 ± 0.303
1.578AspArg: 1.578 ± 0.453
2.841AspSer: 2.841 ± 0.518
1.683AspThr: 1.683 ± 0.471
1.157AspVal: 1.157 ± 0.403
0.105AspTrp: 0.105 ± 0.105
3.893AspTyr: 3.893 ± 0.756
0.0AspXaa: 0.0 ± 0.0
Glu
7.154GluAla: 7.154 ± 0.964
0.526GluCys: 0.526 ± 0.271
1.789GluAsp: 1.789 ± 0.406
4.629GluGlu: 4.629 ± 0.897
3.577GluPhe: 3.577 ± 0.698
1.683GluGly: 1.683 ± 0.439
1.157GluHis: 1.157 ± 0.305
8.627GluIle: 8.627 ± 1.105
7.785GluLys: 7.785 ± 0.857
8.943GluLeu: 8.943 ± 1.11
1.368GluMet: 1.368 ± 0.325
6.628GluAsn: 6.628 ± 1.04
1.368GluPro: 1.368 ± 0.376
5.26GluGln: 5.26 ± 1.038
4.524GluArg: 4.524 ± 0.736
7.259GluSer: 7.259 ± 0.915
4.945GluThr: 4.945 ± 0.615
4.103GluVal: 4.103 ± 0.89
0.526GluTrp: 0.526 ± 0.2
2.209GluTyr: 2.209 ± 0.427
0.0GluXaa: 0.0 ± 0.0
Phe
1.578PheAla: 1.578 ± 0.413
0.631PheCys: 0.631 ± 0.262
2.63PheAsp: 2.63 ± 0.619
3.367PheGlu: 3.367 ± 0.447
3.682PhePhe: 3.682 ± 0.635
1.157PheGly: 1.157 ± 0.266
0.736PheHis: 0.736 ± 0.215
3.261PheIle: 3.261 ± 0.605
6.839PheLys: 6.839 ± 0.784
6.839PheLeu: 6.839 ± 0.877
0.421PheMet: 0.421 ± 0.303
3.261PheAsn: 3.261 ± 0.655
0.421PhePro: 0.421 ± 0.197
0.631PheGln: 0.631 ± 0.203
1.789PheArg: 1.789 ± 0.412
5.05PheSer: 5.05 ± 0.732
2.315PheThr: 2.315 ± 0.542
1.683PheVal: 1.683 ± 0.331
0.316PheTrp: 0.316 ± 0.266
2.315PheTyr: 2.315 ± 0.675
0.0PheXaa: 0.0 ± 0.0
Gly
2.841GlyAla: 2.841 ± 0.762
0.421GlyCys: 0.421 ± 0.174
1.578GlyAsp: 1.578 ± 0.376
1.999GlyGlu: 1.999 ± 0.467
3.367GlyPhe: 3.367 ± 0.656
2.946GlyGly: 2.946 ± 0.742
0.316GlyHis: 0.316 ± 0.27
3.051GlyIle: 3.051 ± 0.491
2.315GlyLys: 2.315 ± 0.502
5.471GlyLeu: 5.471 ± 0.634
1.473GlyMet: 1.473 ± 0.359
3.367GlyAsn: 3.367 ± 0.553
0.0GlyPro: 0.0 ± 0.0
1.052GlyGln: 1.052 ± 0.248
1.157GlyArg: 1.157 ± 0.335
3.682GlySer: 3.682 ± 0.618
0.736GlyThr: 0.736 ± 0.329
4.524GlyVal: 4.524 ± 0.844
0.105GlyTrp: 0.105 ± 0.118
1.368GlyTyr: 1.368 ± 0.379
0.0GlyXaa: 0.0 ± 0.0
His
0.947HisAla: 0.947 ± 0.323
0.105HisCys: 0.105 ± 0.093
1.052HisAsp: 1.052 ± 0.427
0.947HisGlu: 0.947 ± 0.279
1.052HisPhe: 1.052 ± 0.302
0.105HisGly: 0.105 ± 0.118
0.21HisHis: 0.21 ± 0.186
0.947HisIle: 0.947 ± 0.358
1.894HisLys: 1.894 ± 0.39
1.578HisLeu: 1.578 ± 0.462
0.21HisMet: 0.21 ± 0.197
0.842HisAsn: 0.842 ± 0.26
0.316HisPro: 0.316 ± 0.186
0.316HisGln: 0.316 ± 0.263
0.947HisArg: 0.947 ± 0.271
0.947HisSer: 0.947 ± 0.285
0.736HisThr: 0.736 ± 0.271
0.21HisVal: 0.21 ± 0.133
0.0HisTrp: 0.0 ± 0.0
0.842HisTyr: 0.842 ± 0.229
0.0HisXaa: 0.0 ± 0.0
Ile
4.419IleAla: 4.419 ± 0.642
0.631IleCys: 0.631 ± 0.31
4.734IleAsp: 4.734 ± 0.632
5.366IleGlu: 5.366 ± 1.143
1.999IlePhe: 1.999 ± 0.423
2.104IleGly: 2.104 ± 0.504
0.842IleHis: 0.842 ± 0.255
3.787IleIle: 3.787 ± 0.584
10.1IleLys: 10.1 ± 1.097
6.523IleLeu: 6.523 ± 0.771
1.262IleMet: 1.262 ± 0.286
5.786IleAsn: 5.786 ± 1.296
1.999IlePro: 1.999 ± 0.46
3.156IleGln: 3.156 ± 0.655
2.525IleArg: 2.525 ± 0.44
4.629IleSer: 4.629 ± 0.608
5.892IleThr: 5.892 ± 1.316
3.261IleVal: 3.261 ± 0.562
0.105IleTrp: 0.105 ± 0.107
2.104IleTyr: 2.104 ± 0.5
0.0IleXaa: 0.0 ± 0.0
Lys
9.89LysAla: 9.89 ± 1.525
0.421LysCys: 0.421 ± 0.209
6.523LysAsp: 6.523 ± 1.243
14.308LysGlu: 14.308 ± 1.71
3.261LysPhe: 3.261 ± 0.562
3.577LysGly: 3.577 ± 0.653
2.315LysHis: 2.315 ± 0.603
7.154LysIle: 7.154 ± 0.907
7.785LysLys: 7.785 ± 1.332
8.522LysLeu: 8.522 ± 1.067
1.368LysMet: 1.368 ± 0.318
9.995LysAsn: 9.995 ± 1.395
3.682LysPro: 3.682 ± 0.54
5.576LysGln: 5.576 ± 0.989
4.734LysArg: 4.734 ± 0.764
5.26LysSer: 5.26 ± 0.69
6.523LysThr: 6.523 ± 1.106
4.629LysVal: 4.629 ± 0.55
0.316LysTrp: 0.316 ± 0.175
2.315LysTyr: 2.315 ± 0.364
0.0LysXaa: 0.0 ± 0.0
Leu
6.733LeuAla: 6.733 ± 0.773
1.368LeuCys: 1.368 ± 0.443
4.419LeuAsp: 4.419 ± 0.615
12.52LeuGlu: 12.52 ± 1.691
3.156LeuPhe: 3.156 ± 0.539
5.892LeuGly: 5.892 ± 0.683
1.052LeuHis: 1.052 ± 0.361
7.259LeuIle: 7.259 ± 1.09
17.78LeuLys: 17.78 ± 1.831
7.259LeuLeu: 7.259 ± 0.921
1.894LeuMet: 1.894 ± 0.389
11.362LeuAsn: 11.362 ± 1.271
2.104LeuPro: 2.104 ± 0.391
4.734LeuGln: 4.734 ± 0.829
3.787LeuArg: 3.787 ± 0.457
5.576LeuSer: 5.576 ± 0.69
4.208LeuThr: 4.208 ± 0.691
3.893LeuVal: 3.893 ± 0.623
0.526LeuTrp: 0.526 ± 0.258
2.42LeuTyr: 2.42 ± 0.523
0.0LeuXaa: 0.0 ± 0.0
Met
0.842MetAla: 0.842 ± 0.339
0.105MetCys: 0.105 ± 0.122
1.262MetAsp: 1.262 ± 0.379
0.842MetGlu: 0.842 ± 0.279
0.842MetPhe: 0.842 ± 0.333
1.368MetGly: 1.368 ± 0.475
0.105MetHis: 0.105 ± 0.112
0.947MetIle: 0.947 ± 0.318
1.999MetLys: 1.999 ± 0.513
2.209MetLeu: 2.209 ± 0.457
0.21MetMet: 0.21 ± 0.156
0.947MetAsn: 0.947 ± 0.272
1.052MetPro: 1.052 ± 0.294
1.683MetGln: 1.683 ± 0.45
0.736MetArg: 0.736 ± 0.334
0.631MetSer: 0.631 ± 0.237
0.421MetThr: 0.421 ± 0.279
0.316MetVal: 0.316 ± 0.15
0.316MetTrp: 0.316 ± 0.176
0.21MetTyr: 0.21 ± 0.134
0.0MetXaa: 0.0 ± 0.0
Asn
10.626AsnAla: 10.626 ± 1.472
0.21AsnCys: 0.21 ± 0.138
4.419AsnAsp: 4.419 ± 0.593
8.206AsnGlu: 8.206 ± 1.552
3.682AsnPhe: 3.682 ± 0.737
3.472AsnGly: 3.472 ± 0.453
1.262AsnHis: 1.262 ± 0.272
3.577AsnIle: 3.577 ± 0.749
7.365AsnLys: 7.365 ± 0.812
7.996AsnLeu: 7.996 ± 1.022
1.052AsnMet: 1.052 ± 0.273
6.418AsnAsn: 6.418 ± 0.912
1.789AsnPro: 1.789 ± 0.446
4.84AsnGln: 4.84 ± 1.041
2.525AsnArg: 2.525 ± 0.404
3.682AsnSer: 3.682 ± 0.583
4.419AsnThr: 4.419 ± 0.581
2.104AsnVal: 2.104 ± 0.48
0.21AsnTrp: 0.21 ± 0.153
3.367AsnTyr: 3.367 ± 0.729
0.0AsnXaa: 0.0 ± 0.0
Pro
0.421ProAla: 0.421 ± 0.16
0.0ProCys: 0.0 ± 0.0
1.052ProAsp: 1.052 ± 0.462
1.368ProGlu: 1.368 ± 0.392
1.789ProPhe: 1.789 ± 0.458
0.316ProGly: 0.316 ± 0.236
0.316ProHis: 0.316 ± 0.192
2.315ProIle: 2.315 ± 0.409
3.367ProLys: 3.367 ± 0.65
2.104ProLeu: 2.104 ± 0.617
0.21ProMet: 0.21 ± 0.173
2.209ProAsn: 2.209 ± 0.445
0.316ProPro: 0.316 ± 0.182
0.842ProGln: 0.842 ± 0.261
0.947ProArg: 0.947 ± 0.394
2.525ProSer: 2.525 ± 0.47
1.894ProThr: 1.894 ± 0.394
0.526ProVal: 0.526 ± 0.217
0.105ProTrp: 0.105 ± 0.134
0.842ProTyr: 0.842 ± 0.297
0.0ProXaa: 0.0 ± 0.0
Gln
4.629GlnAla: 4.629 ± 0.688
0.105GlnCys: 0.105 ± 0.109
1.578GlnAsp: 1.578 ± 0.4
5.155GlnGlu: 5.155 ± 0.779
1.473GlnPhe: 1.473 ± 0.287
2.525GlnGly: 2.525 ± 0.581
0.631GlnHis: 0.631 ± 0.464
3.261GlnIle: 3.261 ± 0.651
5.155GlnLys: 5.155 ± 0.849
3.577GlnLeu: 3.577 ± 0.381
0.842GlnMet: 0.842 ± 0.343
4.208GlnAsn: 4.208 ± 0.741
0.526GlnPro: 0.526 ± 0.247
2.525GlnGln: 2.525 ± 0.504
1.368GlnArg: 1.368 ± 0.362
3.577GlnSer: 3.577 ± 0.539
2.104GlnThr: 2.104 ± 0.409
1.894GlnVal: 1.894 ± 0.451
0.421GlnTrp: 0.421 ± 0.188
1.052GlnTyr: 1.052 ± 0.459
0.0GlnXaa: 0.0 ± 0.0
Arg
2.735ArgAla: 2.735 ± 0.479
0.21ArgCys: 0.21 ± 0.167
2.42ArgAsp: 2.42 ± 0.513
2.946ArgGlu: 2.946 ± 0.637
2.42ArgPhe: 2.42 ± 0.333
1.157ArgGly: 1.157 ± 0.302
0.631ArgHis: 0.631 ± 0.235
3.367ArgIle: 3.367 ± 0.708
3.472ArgLys: 3.472 ± 0.656
5.471ArgLeu: 5.471 ± 0.698
0.526ArgMet: 0.526 ± 0.252
2.209ArgAsn: 2.209 ± 0.501
0.947ArgPro: 0.947 ± 0.395
1.894ArgGln: 1.894 ± 0.643
0.526ArgArg: 0.526 ± 0.216
2.209ArgSer: 2.209 ± 0.348
1.052ArgThr: 1.052 ± 0.237
1.683ArgVal: 1.683 ± 0.363
0.105ArgTrp: 0.105 ± 0.082
1.683ArgTyr: 1.683 ± 0.364
0.0ArgXaa: 0.0 ± 0.0
Ser
4.629SerAla: 4.629 ± 0.838
0.421SerCys: 0.421 ± 0.233
5.05SerAsp: 5.05 ± 0.589
5.892SerGlu: 5.892 ± 1.055
3.787SerPhe: 3.787 ± 0.695
4.103SerGly: 4.103 ± 0.659
0.631SerHis: 0.631 ± 0.232
3.367SerIle: 3.367 ± 0.415
5.05SerLys: 5.05 ± 0.632
8.206SerLeu: 8.206 ± 0.805
1.262SerMet: 1.262 ± 0.32
3.893SerAsn: 3.893 ± 0.836
0.947SerPro: 0.947 ± 0.291
2.841SerGln: 2.841 ± 0.437
1.683SerArg: 1.683 ± 0.425
2.841SerSer: 2.841 ± 0.595
1.368SerThr: 1.368 ± 0.281
5.681SerVal: 5.681 ± 0.794
0.421SerTrp: 0.421 ± 0.195
2.63SerTyr: 2.63 ± 0.481
0.0SerXaa: 0.0 ± 0.0
Thr
1.999ThrAla: 1.999 ± 0.725
0.316ThrCys: 0.316 ± 0.182
2.735ThrAsp: 2.735 ± 0.467
3.261ThrGlu: 3.261 ± 0.652
0.842ThrPhe: 0.842 ± 0.3
2.42ThrGly: 2.42 ± 0.611
1.157ThrHis: 1.157 ± 0.341
4.734ThrIle: 4.734 ± 1.031
4.103ThrLys: 4.103 ± 1.267
4.314ThrLeu: 4.314 ± 0.668
0.947ThrMet: 0.947 ± 0.289
3.998ThrAsn: 3.998 ± 0.674
2.735ThrPro: 2.735 ± 0.474
3.682ThrGln: 3.682 ± 1.032
1.578ThrArg: 1.578 ± 0.442
3.998ThrSer: 3.998 ± 0.69
3.156ThrThr: 3.156 ± 0.832
0.421ThrVal: 0.421 ± 0.176
0.526ThrTrp: 0.526 ± 0.218
1.683ThrTyr: 1.683 ± 0.471
0.0ThrXaa: 0.0 ± 0.0
Val
1.683ValAla: 1.683 ± 0.481
0.736ValCys: 0.736 ± 0.313
1.368ValAsp: 1.368 ± 0.448
2.104ValGlu: 2.104 ± 0.443
3.156ValPhe: 3.156 ± 0.611
3.367ValGly: 3.367 ± 0.719
0.21ValHis: 0.21 ± 0.16
3.893ValIle: 3.893 ± 0.59
4.419ValLys: 4.419 ± 0.645
5.155ValLeu: 5.155 ± 0.991
0.421ValMet: 0.421 ± 0.232
2.315ValAsn: 2.315 ± 0.541
0.736ValPro: 0.736 ± 0.286
1.052ValGln: 1.052 ± 0.298
1.999ValArg: 1.999 ± 0.456
3.367ValSer: 3.367 ± 0.656
2.104ValThr: 2.104 ± 0.428
1.894ValVal: 1.894 ± 0.524
0.21ValTrp: 0.21 ± 0.145
1.473ValTyr: 1.473 ± 0.502
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.631TrpGlu: 0.631 ± 0.259
0.0TrpPhe: 0.0 ± 0.0
0.316TrpGly: 0.316 ± 0.171
0.105TrpHis: 0.105 ± 0.107
0.421TrpIle: 0.421 ± 0.23
0.421TrpLys: 0.421 ± 0.305
0.105TrpLeu: 0.105 ± 0.118
0.105TrpMet: 0.105 ± 0.094
0.421TrpAsn: 0.421 ± 0.239
0.0TrpPro: 0.0 ± 0.0
0.105TrpGln: 0.105 ± 0.082
0.316TrpArg: 0.316 ± 0.2
0.421TrpSer: 0.421 ± 0.28
0.21TrpThr: 0.21 ± 0.129
0.631TrpVal: 0.631 ± 0.316
0.0TrpTrp: 0.0 ± 0.0
0.21TrpTyr: 0.21 ± 0.198
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.473TyrAla: 1.473 ± 0.378
0.21TyrCys: 0.21 ± 0.127
1.894TyrAsp: 1.894 ± 0.413
3.156TyrGlu: 3.156 ± 0.608
2.525TyrPhe: 2.525 ± 0.733
1.157TyrGly: 1.157 ± 0.328
1.157TyrHis: 1.157 ± 0.321
2.735TyrIle: 2.735 ± 0.535
3.261TyrLys: 3.261 ± 0.686
3.577TyrLeu: 3.577 ± 0.485
0.526TyrMet: 0.526 ± 0.252
2.315TyrAsn: 2.315 ± 0.426
1.368TyrPro: 1.368 ± 0.429
2.525TyrGln: 2.525 ± 0.454
1.578TyrArg: 1.578 ± 0.422
1.999TyrSer: 1.999 ± 0.412
1.262TyrThr: 1.262 ± 0.398
0.526TyrVal: 0.526 ± 0.313
0.0TyrTrp: 0.0 ± 0.0
1.999TyrTyr: 1.999 ± 0.516
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 35 proteins (9506 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski