Amino acid dipepetide frequency for Helicobacter phage Pt5322G

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.442AlaAla: 1.442 ± 0.467
0.333AlaCys: 0.333 ± 0.18
1.886AlaAsp: 1.886 ± 0.314
3.771AlaGlu: 3.771 ± 0.88
4.547AlaPhe: 4.547 ± 0.893
2.662AlaGly: 2.662 ± 0.602
0.665AlaHis: 0.665 ± 0.283
5.324AlaIle: 5.324 ± 1.064
7.875AlaLys: 7.875 ± 1.106
11.424AlaLeu: 11.424 ± 1.134
1.331AlaMet: 1.331 ± 0.574
5.989AlaAsn: 5.989 ± 0.822
1.553AlaPro: 1.553 ± 0.341
3.106AlaGln: 3.106 ± 0.59
2.773AlaArg: 2.773 ± 0.689
3.882AlaSer: 3.882 ± 0.657
2.551AlaThr: 2.551 ± 0.507
1.996AlaVal: 1.996 ± 0.473
0.222AlaTrp: 0.222 ± 0.162
2.107AlaTyr: 2.107 ± 0.499
0.0AlaXaa: 0.0 ± 0.0
Cys
0.444CysAla: 0.444 ± 0.251
0.0CysCys: 0.0 ± 0.0
0.665CysAsp: 0.665 ± 0.359
0.887CysGlu: 0.887 ± 0.339
0.665CysPhe: 0.665 ± 0.357
0.222CysGly: 0.222 ± 0.161
0.0CysHis: 0.0 ± 0.0
0.444CysIle: 0.444 ± 0.348
0.111CysLys: 0.111 ± 0.104
1.109CysLeu: 1.109 ± 0.486
0.0CysMet: 0.0 ± 0.0
0.333CysAsn: 0.333 ± 0.211
0.333CysPro: 0.333 ± 0.175
0.222CysGln: 0.222 ± 0.14
0.111CysArg: 0.111 ± 0.104
0.111CysSer: 0.111 ± 0.114
0.555CysThr: 0.555 ± 0.318
0.333CysVal: 0.333 ± 0.252
0.0CysTrp: 0.0 ± 0.0
0.222CysTyr: 0.222 ± 0.148
0.0CysXaa: 0.0 ± 0.0
Asp
2.551AspAla: 2.551 ± 0.483
0.333AspCys: 0.333 ± 0.238
1.886AspAsp: 1.886 ± 0.362
4.104AspGlu: 4.104 ± 0.583
4.104AspPhe: 4.104 ± 0.663
1.22AspGly: 1.22 ± 0.355
0.555AspHis: 0.555 ± 0.234
3.438AspIle: 3.438 ± 1.037
6.322AspLys: 6.322 ± 0.822
7.875AspLeu: 7.875 ± 0.917
1.22AspMet: 1.22 ± 0.556
4.326AspAsn: 4.326 ± 0.671
1.553AspPro: 1.553 ± 0.371
0.887AspGln: 0.887 ± 0.432
1.996AspArg: 1.996 ± 0.524
2.662AspSer: 2.662 ± 0.596
2.107AspThr: 2.107 ± 0.585
1.109AspVal: 1.109 ± 0.428
0.111AspTrp: 0.111 ± 0.114
3.549AspTyr: 3.549 ± 0.608
0.0AspXaa: 0.0 ± 0.0
Glu
6.1GluAla: 6.1 ± 0.851
0.444GluCys: 0.444 ± 0.237
2.107GluAsp: 2.107 ± 0.499
5.213GluGlu: 5.213 ± 1.088
4.215GluPhe: 4.215 ± 0.694
1.442GluGly: 1.442 ± 0.332
1.22GluHis: 1.22 ± 0.331
7.653GluIle: 7.653 ± 0.887
9.428GluLys: 9.428 ± 0.908
9.539GluLeu: 9.539 ± 1.342
0.998GluMet: 0.998 ± 0.315
7.764GluAsn: 7.764 ± 0.6
1.886GluPro: 1.886 ± 0.474
5.435GluGln: 5.435 ± 1.052
4.658GluArg: 4.658 ± 0.883
6.766GluSer: 6.766 ± 1.121
4.991GluThr: 4.991 ± 0.724
4.104GluVal: 4.104 ± 0.906
0.444GluTrp: 0.444 ± 0.196
2.218GluTyr: 2.218 ± 0.402
0.0GluXaa: 0.0 ± 0.0
Phe
1.442PheAla: 1.442 ± 0.38
0.998PheCys: 0.998 ± 0.465
2.884PheAsp: 2.884 ± 0.547
3.438PheGlu: 3.438 ± 0.412
3.217PhePhe: 3.217 ± 0.631
1.442PheGly: 1.442 ± 0.396
0.665PheHis: 0.665 ± 0.184
3.438PheIle: 3.438 ± 0.347
6.988PheLys: 6.988 ± 0.679
6.877PheLeu: 6.877 ± 0.993
0.776PheMet: 0.776 ± 0.361
3.549PheAsn: 3.549 ± 0.56
0.665PhePro: 0.665 ± 0.245
0.555PheGln: 0.555 ± 0.216
1.886PheArg: 1.886 ± 0.589
4.88PheSer: 4.88 ± 0.623
2.329PheThr: 2.329 ± 0.558
1.775PheVal: 1.775 ± 0.39
0.111PheTrp: 0.111 ± 0.101
1.886PheTyr: 1.886 ± 0.701
0.0PheXaa: 0.0 ± 0.0
Gly
2.995GlyAla: 2.995 ± 0.972
0.444GlyCys: 0.444 ± 0.231
2.218GlyAsp: 2.218 ± 0.477
2.218GlyGlu: 2.218 ± 0.521
2.551GlyPhe: 2.551 ± 0.596
2.995GlyGly: 2.995 ± 0.706
0.665GlyHis: 0.665 ± 0.266
3.217GlyIle: 3.217 ± 0.476
2.551GlyLys: 2.551 ± 0.428
4.215GlyLeu: 4.215 ± 0.61
1.331GlyMet: 1.331 ± 0.33
3.106GlyAsn: 3.106 ± 0.472
0.0GlyPro: 0.0 ± 0.0
0.887GlyGln: 0.887 ± 0.255
1.22GlyArg: 1.22 ± 0.295
2.551GlySer: 2.551 ± 0.567
1.109GlyThr: 1.109 ± 0.289
3.66GlyVal: 3.66 ± 1.037
0.222GlyTrp: 0.222 ± 0.218
1.775GlyTyr: 1.775 ± 0.446
0.0GlyXaa: 0.0 ± 0.0
His
0.998HisAla: 0.998 ± 0.316
0.0HisCys: 0.0 ± 0.0
1.109HisAsp: 1.109 ± 0.502
1.442HisGlu: 1.442 ± 0.427
0.665HisPhe: 0.665 ± 0.23
0.333HisGly: 0.333 ± 0.157
0.0HisHis: 0.0 ± 0.0
0.998HisIle: 0.998 ± 0.317
1.886HisLys: 1.886 ± 0.483
1.331HisLeu: 1.331 ± 0.267
0.333HisMet: 0.333 ± 0.187
0.776HisAsn: 0.776 ± 0.213
0.222HisPro: 0.222 ± 0.141
0.333HisGln: 0.333 ± 0.159
0.444HisArg: 0.444 ± 0.21
1.109HisSer: 1.109 ± 0.384
0.776HisThr: 0.776 ± 0.258
0.333HisVal: 0.333 ± 0.161
0.0HisTrp: 0.0 ± 0.0
0.665HisTyr: 0.665 ± 0.259
0.0HisXaa: 0.0 ± 0.0
Ile
4.88IleAla: 4.88 ± 0.67
0.776IleCys: 0.776 ± 0.398
4.547IleAsp: 4.547 ± 0.639
6.322IleGlu: 6.322 ± 0.96
1.886IlePhe: 1.886 ± 0.476
1.886IleGly: 1.886 ± 0.513
0.998IleHis: 0.998 ± 0.307
4.326IleIle: 4.326 ± 0.709
8.762IleLys: 8.762 ± 1.132
6.211IleLeu: 6.211 ± 0.671
1.331IleMet: 1.331 ± 0.392
5.768IleAsn: 5.768 ± 0.935
0.998IlePro: 0.998 ± 0.296
4.326IleGln: 4.326 ± 0.48
2.329IleArg: 2.329 ± 0.408
3.66IleSer: 3.66 ± 0.497
3.771IleThr: 3.771 ± 0.577
3.217IleVal: 3.217 ± 0.447
0.222IleTrp: 0.222 ± 0.152
2.884IleTyr: 2.884 ± 0.647
0.0IleXaa: 0.0 ± 0.0
Lys
8.208LysAla: 8.208 ± 1.1
0.333LysCys: 0.333 ± 0.224
8.097LysAsp: 8.097 ± 1.523
13.531LysGlu: 13.531 ± 1.571
3.217LysPhe: 3.217 ± 0.445
3.549LysGly: 3.549 ± 0.727
2.662LysHis: 2.662 ± 0.652
7.875LysIle: 7.875 ± 1.01
8.54LysLys: 8.54 ± 1.205
8.651LysLeu: 8.651 ± 1.206
1.22LysMet: 1.22 ± 0.372
11.535LysAsn: 11.535 ± 1.422
3.771LysPro: 3.771 ± 0.692
5.657LysGln: 5.657 ± 0.869
4.88LysArg: 4.88 ± 0.993
5.102LysSer: 5.102 ± 1.192
5.435LysThr: 5.435 ± 0.831
4.104LysVal: 4.104 ± 0.81
0.555LysTrp: 0.555 ± 0.269
2.662LysTyr: 2.662 ± 0.44
0.0LysXaa: 0.0 ± 0.0
Leu
6.877LeuAla: 6.877 ± 0.971
1.442LeuCys: 1.442 ± 0.622
5.102LeuAsp: 5.102 ± 0.59
11.757LeuGlu: 11.757 ± 1.357
3.327LeuPhe: 3.327 ± 0.75
6.211LeuGly: 6.211 ± 0.57
0.111LeuHis: 0.111 ± 0.088
6.433LeuIle: 6.433 ± 1.087
17.635LeuLys: 17.635 ± 1.564
7.764LeuLeu: 7.764 ± 0.956
2.107LeuMet: 2.107 ± 0.352
11.091LeuAsn: 11.091 ± 1.218
2.107LeuPro: 2.107 ± 0.387
4.104LeuGln: 4.104 ± 0.78
3.438LeuArg: 3.438 ± 0.505
6.877LeuSer: 6.877 ± 0.993
4.769LeuThr: 4.769 ± 0.717
3.66LeuVal: 3.66 ± 0.569
0.665LeuTrp: 0.665 ± 0.31
1.996LeuTyr: 1.996 ± 0.364
0.0LeuXaa: 0.0 ± 0.0
Met
0.555MetAla: 0.555 ± 0.324
0.0MetCys: 0.0 ± 0.0
1.331MetAsp: 1.331 ± 0.455
0.887MetGlu: 0.887 ± 0.267
1.331MetPhe: 1.331 ± 0.391
1.109MetGly: 1.109 ± 0.403
0.222MetHis: 0.222 ± 0.151
0.998MetIle: 0.998 ± 0.381
2.218MetLys: 2.218 ± 0.536
1.996MetLeu: 1.996 ± 0.36
0.111MetMet: 0.111 ± 0.104
1.664MetAsn: 1.664 ± 0.427
0.998MetPro: 0.998 ± 0.367
1.442MetGln: 1.442 ± 0.41
0.776MetArg: 0.776 ± 0.333
0.998MetSer: 0.998 ± 0.343
0.444MetThr: 0.444 ± 0.314
0.333MetVal: 0.333 ± 0.16
0.333MetTrp: 0.333 ± 0.199
0.555MetTyr: 0.555 ± 0.219
0.0MetXaa: 0.0 ± 0.0
Asn
9.539AsnAla: 9.539 ± 1.714
0.111AsnCys: 0.111 ± 0.104
3.993AsnAsp: 3.993 ± 0.56
7.875AsnGlu: 7.875 ± 1.079
3.993AsnPhe: 3.993 ± 0.784
2.773AsnGly: 2.773 ± 0.342
1.886AsnHis: 1.886 ± 0.505
4.104AsnIle: 4.104 ± 0.673
7.875AsnLys: 7.875 ± 0.834
7.764AsnLeu: 7.764 ± 0.978
1.442AsnMet: 1.442 ± 0.429
7.098AsnAsn: 7.098 ± 1.081
1.996AsnPro: 1.996 ± 0.585
4.547AsnGln: 4.547 ± 1.125
3.106AsnArg: 3.106 ± 0.558
4.215AsnSer: 4.215 ± 0.461
3.66AsnThr: 3.66 ± 0.602
2.218AsnVal: 2.218 ± 0.4
0.222AsnTrp: 0.222 ± 0.131
4.215AsnTyr: 4.215 ± 0.81
0.0AsnXaa: 0.0 ± 0.0
Pro
0.555ProAla: 0.555 ± 0.207
0.111ProCys: 0.111 ± 0.101
0.998ProAsp: 0.998 ± 0.329
1.331ProGlu: 1.331 ± 0.4
1.996ProPhe: 1.996 ± 0.453
0.333ProGly: 0.333 ± 0.252
0.222ProHis: 0.222 ± 0.176
1.996ProIle: 1.996 ± 0.349
3.771ProLys: 3.771 ± 0.655
2.44ProLeu: 2.44 ± 0.756
0.555ProMet: 0.555 ± 0.29
2.662ProAsn: 2.662 ± 0.557
0.333ProPro: 0.333 ± 0.148
0.887ProGln: 0.887 ± 0.269
0.665ProArg: 0.665 ± 0.199
2.884ProSer: 2.884 ± 0.533
1.553ProThr: 1.553 ± 0.357
0.998ProVal: 0.998 ± 0.326
0.111ProTrp: 0.111 ± 0.109
0.887ProTyr: 0.887 ± 0.417
0.0ProXaa: 0.0 ± 0.0
Gln
4.547GlnAla: 4.547 ± 0.781
0.222GlnCys: 0.222 ± 0.156
1.886GlnAsp: 1.886 ± 0.377
4.215GlnGlu: 4.215 ± 0.717
1.553GlnPhe: 1.553 ± 0.327
2.44GlnGly: 2.44 ± 0.583
0.444GlnHis: 0.444 ± 0.199
3.327GlnIle: 3.327 ± 0.649
5.102GlnLys: 5.102 ± 0.916
3.217GlnLeu: 3.217 ± 0.439
0.998GlnMet: 0.998 ± 0.388
3.217GlnAsn: 3.217 ± 0.716
0.998GlnPro: 0.998 ± 0.236
2.884GlnGln: 2.884 ± 0.764
1.442GlnArg: 1.442 ± 0.377
3.882GlnSer: 3.882 ± 0.585
2.551GlnThr: 2.551 ± 0.43
2.107GlnVal: 2.107 ± 0.435
0.444GlnTrp: 0.444 ± 0.177
0.665GlnTyr: 0.665 ± 0.308
0.0GlnXaa: 0.0 ± 0.0
Arg
3.327ArgAla: 3.327 ± 0.734
0.111ArgCys: 0.111 ± 0.104
2.107ArgAsp: 2.107 ± 0.439
3.993ArgGlu: 3.993 ± 0.634
2.884ArgPhe: 2.884 ± 0.476
0.998ArgGly: 0.998 ± 0.262
0.555ArgHis: 0.555 ± 0.192
2.884ArgIle: 2.884 ± 0.691
2.773ArgLys: 2.773 ± 0.659
5.102ArgLeu: 5.102 ± 0.6
0.665ArgMet: 0.665 ± 0.367
1.996ArgAsn: 1.996 ± 0.485
0.998ArgPro: 0.998 ± 0.332
1.553ArgGln: 1.553 ± 0.577
0.776ArgArg: 0.776 ± 0.216
2.329ArgSer: 2.329 ± 0.537
1.442ArgThr: 1.442 ± 0.488
1.664ArgVal: 1.664 ± 0.439
0.222ArgTrp: 0.222 ± 0.176
1.553ArgTyr: 1.553 ± 0.384
0.0ArgXaa: 0.0 ± 0.0
Ser
4.547SerAla: 4.547 ± 0.648
0.333SerCys: 0.333 ± 0.269
5.546SerAsp: 5.546 ± 0.602
6.877SerGlu: 6.877 ± 1.019
3.549SerPhe: 3.549 ± 0.807
3.771SerGly: 3.771 ± 0.731
0.444SerHis: 0.444 ± 0.263
2.662SerIle: 2.662 ± 0.464
4.991SerLys: 4.991 ± 0.673
8.651SerLeu: 8.651 ± 1.11
1.442SerMet: 1.442 ± 0.425
3.549SerAsn: 3.549 ± 0.71
1.886SerPro: 1.886 ± 0.47
2.662SerGln: 2.662 ± 0.561
1.886SerArg: 1.886 ± 0.427
2.329SerSer: 2.329 ± 0.685
1.22SerThr: 1.22 ± 0.339
4.991SerVal: 4.991 ± 0.871
0.555SerTrp: 0.555 ± 0.165
2.44SerTyr: 2.44 ± 0.446
0.0SerXaa: 0.0 ± 0.0
Thr
1.664ThrAla: 1.664 ± 0.446
0.222ThrCys: 0.222 ± 0.165
2.107ThrAsp: 2.107 ± 0.444
2.773ThrGlu: 2.773 ± 0.476
0.776ThrPhe: 0.776 ± 0.215
1.775ThrGly: 1.775 ± 0.575
0.998ThrHis: 0.998 ± 0.315
3.438ThrIle: 3.438 ± 0.718
4.437ThrLys: 4.437 ± 1.093
4.991ThrLeu: 4.991 ± 0.77
0.887ThrMet: 0.887 ± 0.352
3.217ThrAsn: 3.217 ± 0.378
3.106ThrPro: 3.106 ± 0.414
3.106ThrGln: 3.106 ± 0.738
2.107ThrArg: 2.107 ± 0.561
4.437ThrSer: 4.437 ± 0.717
2.884ThrThr: 2.884 ± 0.623
0.555ThrVal: 0.555 ± 0.294
0.555ThrTrp: 0.555 ± 0.259
1.331ThrTyr: 1.331 ± 0.365
0.0ThrXaa: 0.0 ± 0.0
Val
2.773ValAla: 2.773 ± 0.66
0.222ValCys: 0.222 ± 0.165
1.996ValAsp: 1.996 ± 0.433
2.329ValGlu: 2.329 ± 0.408
2.884ValPhe: 2.884 ± 0.614
3.217ValGly: 3.217 ± 0.577
0.222ValHis: 0.222 ± 0.139
3.438ValIle: 3.438 ± 0.602
3.993ValLys: 3.993 ± 0.533
4.88ValLeu: 4.88 ± 0.98
0.555ValMet: 0.555 ± 0.282
2.218ValAsn: 2.218 ± 0.46
0.887ValPro: 0.887 ± 0.338
0.887ValGln: 0.887 ± 0.278
1.996ValArg: 1.996 ± 0.441
3.106ValSer: 3.106 ± 0.77
1.664ValThr: 1.664 ± 0.308
2.329ValVal: 2.329 ± 0.583
0.222ValTrp: 0.222 ± 0.119
0.776ValTyr: 0.776 ± 0.327
0.0ValXaa: 0.0 ± 0.0
Trp
0.111TrpAla: 0.111 ± 0.113
0.111TrpCys: 0.111 ± 0.134
0.0TrpAsp: 0.0 ± 0.0
0.444TrpGlu: 0.444 ± 0.263
0.0TrpPhe: 0.0 ± 0.0
0.555TrpGly: 0.555 ± 0.214
0.222TrpHis: 0.222 ± 0.162
0.444TrpIle: 0.444 ± 0.263
0.333TrpLys: 0.333 ± 0.136
0.222TrpLeu: 0.222 ± 0.162
0.222TrpMet: 0.222 ± 0.157
0.333TrpAsn: 0.333 ± 0.221
0.0TrpPro: 0.0 ± 0.0
0.222TrpGln: 0.222 ± 0.12
0.222TrpArg: 0.222 ± 0.131
0.444TrpSer: 0.444 ± 0.243
0.222TrpThr: 0.222 ± 0.134
0.887TrpVal: 0.887 ± 0.299
0.0TrpTrp: 0.0 ± 0.0
0.222TrpTyr: 0.222 ± 0.209
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.886TyrAla: 1.886 ± 0.525
0.222TyrCys: 0.222 ± 0.152
1.442TyrAsp: 1.442 ± 0.391
3.217TyrGlu: 3.217 ± 0.613
2.44TyrPhe: 2.44 ± 0.816
0.998TyrGly: 0.998 ± 0.261
0.998TyrHis: 0.998 ± 0.309
2.662TyrIle: 2.662 ± 0.623
3.993TyrLys: 3.993 ± 0.661
3.217TyrLeu: 3.217 ± 0.759
0.665TyrMet: 0.665 ± 0.225
2.44TyrAsn: 2.44 ± 0.449
1.109TyrPro: 1.109 ± 0.27
2.44TyrGln: 2.44 ± 0.338
1.22TyrArg: 1.22 ± 0.307
2.107TyrSer: 2.107 ± 0.492
1.331TyrThr: 1.331 ± 0.433
0.222TyrVal: 0.222 ± 0.161
0.0TyrTrp: 0.0 ± 0.0
1.22TyrTyr: 1.22 ± 0.348
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 31 proteins (9017 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski