Amino acid dipepetide frequency for Aeromonas phage 62AhydR11PP

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.009AlaAla: 14.009 ± 1.318
0.82AlaCys: 0.82 ± 0.189
6.408AlaAsp: 6.408 ± 0.628
8.793AlaGlu: 8.793 ± 1.324
3.204AlaPhe: 3.204 ± 0.475
9.613AlaGly: 9.613 ± 1.043
1.863AlaHis: 1.863 ± 0.407
6.706AlaIle: 6.706 ± 0.794
5.589AlaLys: 5.589 ± 0.686
7.824AlaLeu: 7.824 ± 0.751
3.428AlaMet: 3.428 ± 0.499
4.024AlaAsn: 4.024 ± 0.582
4.769AlaPro: 4.769 ± 0.593
4.694AlaGln: 4.694 ± 0.653
5.365AlaArg: 5.365 ± 0.745
5.216AlaSer: 5.216 ± 0.751
6.93AlaThr: 6.93 ± 0.786
6.632AlaVal: 6.632 ± 0.815
1.788AlaTrp: 1.788 ± 0.329
3.13AlaTyr: 3.13 ± 0.417
0.0AlaXaa: 0.0 ± 0.0
Cys
1.416CysAla: 1.416 ± 0.353
0.075CysCys: 0.075 ± 0.069
0.969CysAsp: 0.969 ± 0.263
0.969CysGlu: 0.969 ± 0.265
0.447CysPhe: 0.447 ± 0.153
0.894CysGly: 0.894 ± 0.265
0.224CysHis: 0.224 ± 0.121
0.447CysIle: 0.447 ± 0.196
0.894CysLys: 0.894 ± 0.261
1.043CysLeu: 1.043 ± 0.362
0.224CysMet: 0.224 ± 0.118
0.447CysAsn: 0.447 ± 0.172
0.82CysPro: 0.82 ± 0.256
0.224CysGln: 0.224 ± 0.119
1.192CysArg: 1.192 ± 0.284
0.745CysSer: 0.745 ± 0.196
0.82CysThr: 0.82 ± 0.229
0.82CysVal: 0.82 ± 0.205
0.149CysTrp: 0.149 ± 0.118
0.522CysTyr: 0.522 ± 0.201
0.0CysXaa: 0.0 ± 0.0
Asp
7.079AspAla: 7.079 ± 0.672
1.192AspCys: 1.192 ± 0.275
2.832AspAsp: 2.832 ± 0.572
3.353AspGlu: 3.353 ± 0.45
2.534AspPhe: 2.534 ± 0.407
5.887AspGly: 5.887 ± 0.686
1.565AspHis: 1.565 ± 0.304
3.428AspIle: 3.428 ± 0.424
2.683AspLys: 2.683 ± 0.513
3.577AspLeu: 3.577 ± 0.507
2.012AspMet: 2.012 ± 0.371
2.161AspAsn: 2.161 ± 0.445
3.502AspPro: 3.502 ± 0.484
1.565AspGln: 1.565 ± 0.333
3.204AspArg: 3.204 ± 0.451
3.353AspSer: 3.353 ± 0.494
2.981AspThr: 2.981 ± 0.512
3.726AspVal: 3.726 ± 0.559
1.565AspTrp: 1.565 ± 0.272
1.714AspTyr: 1.714 ± 0.315
0.0AspXaa: 0.0 ± 0.0
Glu
7.601GluAla: 7.601 ± 1.045
1.043GluCys: 1.043 ± 0.34
1.937GluAsp: 1.937 ± 0.339
3.502GluGlu: 3.502 ± 0.733
3.13GluPhe: 3.13 ± 0.506
3.875GluGly: 3.875 ± 0.601
1.043GluHis: 1.043 ± 0.284
3.428GluIle: 3.428 ± 0.605
3.353GluLys: 3.353 ± 0.561
7.154GluLeu: 7.154 ± 0.762
2.981GluMet: 2.981 ± 0.489
1.416GluAsn: 1.416 ± 0.267
2.459GluPro: 2.459 ± 0.377
3.502GluGln: 3.502 ± 0.608
3.875GluArg: 3.875 ± 0.625
2.683GluSer: 2.683 ± 0.337
2.534GluThr: 2.534 ± 0.387
2.981GluVal: 2.981 ± 0.405
1.416GluTrp: 1.416 ± 0.311
1.937GluTyr: 1.937 ± 0.302
0.0GluXaa: 0.0 ± 0.0
Phe
3.13PheAla: 3.13 ± 0.474
0.745PheCys: 0.745 ± 0.195
2.832PheAsp: 2.832 ± 0.414
1.639PheGlu: 1.639 ± 0.286
1.341PhePhe: 1.341 ± 0.24
3.13PheGly: 3.13 ± 0.571
0.522PheHis: 0.522 ± 0.259
1.043PheIle: 1.043 ± 0.244
1.49PheLys: 1.49 ± 0.278
2.385PheLeu: 2.385 ± 0.418
1.192PheMet: 1.192 ± 0.281
2.534PheAsn: 2.534 ± 0.399
1.714PhePro: 1.714 ± 0.375
1.416PheGln: 1.416 ± 0.28
1.49PheArg: 1.49 ± 0.303
2.31PheSer: 2.31 ± 0.429
2.235PheThr: 2.235 ± 0.371
2.086PheVal: 2.086 ± 0.416
0.596PheTrp: 0.596 ± 0.241
1.341PheTyr: 1.341 ± 0.274
0.0PheXaa: 0.0 ± 0.0
Gly
6.408GlyAla: 6.408 ± 0.758
1.267GlyCys: 1.267 ± 0.315
4.918GlyAsp: 4.918 ± 0.57
4.471GlyGlu: 4.471 ± 0.539
4.173GlyPhe: 4.173 ± 0.745
6.483GlyGly: 6.483 ± 0.755
1.118GlyHis: 1.118 ± 0.309
4.396GlyIle: 4.396 ± 0.56
5.365GlyLys: 5.365 ± 0.589
5.887GlyLeu: 5.887 ± 0.637
1.937GlyMet: 1.937 ± 0.379
2.31GlyAsn: 2.31 ± 0.391
2.086GlyPro: 2.086 ± 0.376
3.949GlyGln: 3.949 ± 0.505
3.949GlyArg: 3.949 ± 0.481
4.62GlySer: 4.62 ± 0.683
4.471GlyThr: 4.471 ± 0.622
6.483GlyVal: 6.483 ± 0.511
1.639GlyTrp: 1.639 ± 0.318
2.683GlyTyr: 2.683 ± 0.753
0.0GlyXaa: 0.0 ± 0.0
His
1.118HisAla: 1.118 ± 0.284
0.596HisCys: 0.596 ± 0.175
1.043HisAsp: 1.043 ± 0.339
0.745HisGlu: 0.745 ± 0.199
0.522HisPhe: 0.522 ± 0.196
1.49HisGly: 1.49 ± 0.309
0.373HisHis: 0.373 ± 0.188
0.969HisIle: 0.969 ± 0.264
0.82HisLys: 0.82 ± 0.226
1.192HisLeu: 1.192 ± 0.254
0.522HisMet: 0.522 ± 0.197
0.447HisAsn: 0.447 ± 0.174
0.745HisPro: 0.745 ± 0.279
0.82HisGln: 0.82 ± 0.251
1.267HisArg: 1.267 ± 0.306
0.894HisSer: 0.894 ± 0.281
1.118HisThr: 1.118 ± 0.28
0.969HisVal: 0.969 ± 0.254
0.522HisTrp: 0.522 ± 0.191
0.82HisTyr: 0.82 ± 0.245
0.0HisXaa: 0.0 ± 0.0
Ile
4.918IleAla: 4.918 ± 0.57
0.298IleCys: 0.298 ± 0.146
4.396IleAsp: 4.396 ± 0.489
4.322IleGlu: 4.322 ± 0.456
1.49IlePhe: 1.49 ± 0.319
3.949IleGly: 3.949 ± 0.659
0.969IleHis: 0.969 ± 0.307
2.385IleIle: 2.385 ± 0.459
3.875IleLys: 3.875 ± 0.631
3.279IleLeu: 3.279 ± 0.373
1.192IleMet: 1.192 ± 0.294
2.385IleAsn: 2.385 ± 0.444
2.161IlePro: 2.161 ± 0.424
2.235IleGln: 2.235 ± 0.508
2.981IleArg: 2.981 ± 0.395
2.981IleSer: 2.981 ± 0.472
4.471IleThr: 4.471 ± 0.659
3.13IleVal: 3.13 ± 0.331
0.298IleTrp: 0.298 ± 0.134
1.416IleTyr: 1.416 ± 0.319
0.0IleXaa: 0.0 ± 0.0
Lys
6.557LysAla: 6.557 ± 0.689
0.596LysCys: 0.596 ± 0.202
3.279LysAsp: 3.279 ± 0.421
3.279LysGlu: 3.279 ± 0.533
1.639LysPhe: 1.639 ± 0.413
3.502LysGly: 3.502 ± 0.579
1.043LysHis: 1.043 ± 0.289
2.235LysIle: 2.235 ± 0.378
3.279LysLys: 3.279 ± 0.526
4.471LysLeu: 4.471 ± 0.668
2.235LysMet: 2.235 ± 0.357
1.565LysAsn: 1.565 ± 0.36
2.683LysPro: 2.683 ± 0.364
2.31LysGln: 2.31 ± 0.414
3.279LysArg: 3.279 ± 0.573
2.534LysSer: 2.534 ± 0.525
2.608LysThr: 2.608 ± 0.506
3.055LysVal: 3.055 ± 0.43
0.522LysTrp: 0.522 ± 0.223
0.745LysTyr: 0.745 ± 0.22
0.0LysXaa: 0.0 ± 0.0
Leu
9.463LeuAla: 9.463 ± 0.718
0.745LeuCys: 0.745 ± 0.257
4.322LeuAsp: 4.322 ± 0.578
4.993LeuGlu: 4.993 ± 0.604
1.937LeuPhe: 1.937 ± 0.332
5.291LeuGly: 5.291 ± 0.685
1.118LeuHis: 1.118 ± 0.267
3.428LeuIle: 3.428 ± 0.501
3.13LeuLys: 3.13 ± 0.484
4.694LeuLeu: 4.694 ± 0.721
1.267LeuMet: 1.267 ± 0.282
3.651LeuAsn: 3.651 ± 0.596
3.651LeuPro: 3.651 ± 0.507
2.832LeuGln: 2.832 ± 0.449
5.142LeuArg: 5.142 ± 0.66
4.993LeuSer: 4.993 ± 0.808
5.291LeuThr: 5.291 ± 0.491
6.11LeuVal: 6.11 ± 0.663
0.671LeuTrp: 0.671 ± 0.22
2.012LeuTyr: 2.012 ± 0.395
0.0LeuXaa: 0.0 ± 0.0
Met
4.098MetAla: 4.098 ± 0.586
0.596MetCys: 0.596 ± 0.197
1.267MetAsp: 1.267 ± 0.278
1.565MetGlu: 1.565 ± 0.339
0.745MetPhe: 0.745 ± 0.239
2.31MetGly: 2.31 ± 0.409
0.447MetHis: 0.447 ± 0.198
1.639MetIle: 1.639 ± 0.317
1.565MetLys: 1.565 ± 0.329
2.235MetLeu: 2.235 ± 0.434
0.522MetMet: 0.522 ± 0.208
1.043MetAsn: 1.043 ± 0.281
1.192MetPro: 1.192 ± 0.325
1.043MetGln: 1.043 ± 0.232
1.639MetArg: 1.639 ± 0.34
2.608MetSer: 2.608 ± 0.337
2.012MetThr: 2.012 ± 0.478
1.788MetVal: 1.788 ± 0.348
0.298MetTrp: 0.298 ± 0.165
0.298MetTyr: 0.298 ± 0.12
0.0MetXaa: 0.0 ± 0.0
Asn
4.396AsnAla: 4.396 ± 0.615
0.224AsnCys: 0.224 ± 0.153
1.639AsnAsp: 1.639 ± 0.228
1.341AsnGlu: 1.341 ± 0.304
1.118AsnPhe: 1.118 ± 0.3
3.055AsnGly: 3.055 ± 0.618
0.447AsnHis: 0.447 ± 0.195
1.863AsnIle: 1.863 ± 0.375
2.385AsnLys: 2.385 ± 0.367
2.906AsnLeu: 2.906 ± 0.516
0.894AsnMet: 0.894 ± 0.222
1.341AsnAsn: 1.341 ± 0.354
2.683AsnPro: 2.683 ± 0.383
1.341AsnGln: 1.341 ± 0.333
2.235AsnArg: 2.235 ± 0.457
2.757AsnSer: 2.757 ± 0.485
2.757AsnThr: 2.757 ± 0.457
2.534AsnVal: 2.534 ± 0.45
0.82AsnTrp: 0.82 ± 0.237
0.522AsnTyr: 0.522 ± 0.18
0.0AsnXaa: 0.0 ± 0.0
Pro
5.44ProAla: 5.44 ± 0.457
0.671ProCys: 0.671 ± 0.241
3.875ProAsp: 3.875 ± 0.545
3.502ProGlu: 3.502 ± 0.496
1.341ProPhe: 1.341 ± 0.295
4.098ProGly: 4.098 ± 0.736
0.373ProHis: 0.373 ± 0.159
2.012ProIle: 2.012 ± 0.361
1.49ProLys: 1.49 ± 0.282
3.13ProLeu: 3.13 ± 0.5
0.894ProMet: 0.894 ± 0.273
1.192ProAsn: 1.192 ± 0.301
2.012ProPro: 2.012 ± 0.347
1.49ProGln: 1.49 ± 0.323
1.639ProArg: 1.639 ± 0.371
3.055ProSer: 3.055 ± 0.423
2.981ProThr: 2.981 ± 0.466
3.353ProVal: 3.353 ± 0.561
0.447ProTrp: 0.447 ± 0.173
1.863ProTyr: 1.863 ± 0.371
0.0ProXaa: 0.0 ± 0.0
Gln
5.663GlnAla: 5.663 ± 0.642
0.522GlnCys: 0.522 ± 0.204
2.161GlnAsp: 2.161 ± 0.535
2.012GlnGlu: 2.012 ± 0.379
1.043GlnPhe: 1.043 ± 0.208
2.608GlnGly: 2.608 ± 0.549
0.969GlnHis: 0.969 ± 0.225
2.832GlnIle: 2.832 ± 0.501
1.639GlnLys: 1.639 ± 0.372
3.577GlnLeu: 3.577 ± 0.468
1.043GlnMet: 1.043 ± 0.284
1.788GlnAsn: 1.788 ± 0.294
1.416GlnPro: 1.416 ± 0.313
2.906GlnGln: 2.906 ± 0.568
2.608GlnArg: 2.608 ± 0.528
3.055GlnSer: 3.055 ± 0.534
1.788GlnThr: 1.788 ± 0.294
3.204GlnVal: 3.204 ± 0.45
0.82GlnTrp: 0.82 ± 0.214
1.341GlnTyr: 1.341 ± 0.373
0.0GlnXaa: 0.0 ± 0.0
Arg
5.961ArgAla: 5.961 ± 0.801
0.82ArgCys: 0.82 ± 0.215
3.13ArgAsp: 3.13 ± 0.508
3.279ArgGlu: 3.279 ± 0.59
2.31ArgPhe: 2.31 ± 0.392
3.204ArgGly: 3.204 ± 0.491
1.267ArgHis: 1.267 ± 0.293
2.981ArgIle: 2.981 ± 0.355
2.757ArgLys: 2.757 ± 0.516
4.173ArgLeu: 4.173 ± 0.507
2.086ArgMet: 2.086 ± 0.414
1.937ArgAsn: 1.937 ± 0.28
2.534ArgPro: 2.534 ± 0.392
2.906ArgGln: 2.906 ± 0.464
3.726ArgArg: 3.726 ± 0.698
4.396ArgSer: 4.396 ± 0.513
2.608ArgThr: 2.608 ± 0.457
3.651ArgVal: 3.651 ± 0.521
1.043ArgTrp: 1.043 ± 0.219
1.937ArgTyr: 1.937 ± 0.423
0.0ArgXaa: 0.0 ± 0.0
Ser
6.706SerAla: 6.706 ± 0.672
0.969SerCys: 0.969 ± 0.247
3.279SerAsp: 3.279 ± 0.574
3.726SerGlu: 3.726 ± 0.605
1.937SerPhe: 1.937 ± 0.439
5.44SerGly: 5.44 ± 0.6
0.894SerHis: 0.894 ± 0.265
4.545SerIle: 4.545 ± 0.585
3.279SerLys: 3.279 ± 0.537
3.353SerLeu: 3.353 ± 0.451
1.267SerMet: 1.267 ± 0.292
2.459SerAsn: 2.459 ± 0.359
2.086SerPro: 2.086 ± 0.378
2.683SerGln: 2.683 ± 0.568
2.981SerArg: 2.981 ± 0.369
4.322SerSer: 4.322 ± 0.64
3.204SerThr: 3.204 ± 0.572
5.514SerVal: 5.514 ± 0.796
1.49SerTrp: 1.49 ± 0.414
1.49SerTyr: 1.49 ± 0.336
0.0SerXaa: 0.0 ± 0.0
Thr
5.663ThrAla: 5.663 ± 0.73
0.447ThrCys: 0.447 ± 0.219
3.204ThrAsp: 3.204 ± 0.639
3.502ThrGlu: 3.502 ± 0.623
1.788ThrPhe: 1.788 ± 0.374
5.738ThrGly: 5.738 ± 0.596
1.416ThrHis: 1.416 ± 0.301
3.279ThrIle: 3.279 ± 0.401
1.937ThrLys: 1.937 ± 0.391
5.44ThrLeu: 5.44 ± 0.787
1.118ThrMet: 1.118 ± 0.276
1.937ThrAsn: 1.937 ± 0.354
4.024ThrPro: 4.024 ± 0.514
1.565ThrGln: 1.565 ± 0.336
2.683ThrArg: 2.683 ± 0.425
3.502ThrSer: 3.502 ± 0.484
2.683ThrThr: 2.683 ± 0.402
4.396ThrVal: 4.396 ± 0.691
0.894ThrTrp: 0.894 ± 0.273
1.788ThrTyr: 1.788 ± 0.359
0.0ThrXaa: 0.0 ± 0.0
Val
7.303ValAla: 7.303 ± 0.81
0.82ValCys: 0.82 ± 0.256
5.067ValAsp: 5.067 ± 0.641
5.291ValGlu: 5.291 ± 0.697
2.459ValPhe: 2.459 ± 0.494
4.694ValGly: 4.694 ± 1.049
0.671ValHis: 0.671 ± 0.203
3.577ValIle: 3.577 ± 0.489
4.173ValLys: 4.173 ± 0.537
4.993ValLeu: 4.993 ± 0.647
2.385ValMet: 2.385 ± 0.373
2.981ValAsn: 2.981 ± 0.509
2.459ValPro: 2.459 ± 0.5
2.459ValGln: 2.459 ± 0.329
3.502ValArg: 3.502 ± 0.49
4.694ValSer: 4.694 ± 0.518
3.875ValThr: 3.875 ± 0.553
5.067ValVal: 5.067 ± 0.554
0.969ValTrp: 0.969 ± 0.263
1.714ValTyr: 1.714 ± 0.385
0.0ValXaa: 0.0 ± 0.0
Trp
1.863TrpAla: 1.863 ± 0.405
0.522TrpCys: 0.522 ± 0.227
1.565TrpAsp: 1.565 ± 0.321
0.745TrpGlu: 0.745 ± 0.204
0.894TrpPhe: 0.894 ± 0.261
1.043TrpGly: 1.043 ± 0.273
0.298TrpHis: 0.298 ± 0.145
0.671TrpIle: 0.671 ± 0.187
0.224TrpLys: 0.224 ± 0.116
1.192TrpLeu: 1.192 ± 0.256
0.745TrpMet: 0.745 ± 0.246
0.298TrpAsn: 0.298 ± 0.148
1.043TrpPro: 1.043 ± 0.247
0.969TrpGln: 0.969 ± 0.255
1.267TrpArg: 1.267 ± 0.271
0.894TrpSer: 0.894 ± 0.297
0.522TrpThr: 0.522 ± 0.161
1.267TrpVal: 1.267 ± 0.299
0.298TrpTrp: 0.298 ± 0.143
0.447TrpTyr: 0.447 ± 0.183
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.012TyrAla: 2.012 ± 0.352
0.373TyrCys: 0.373 ± 0.149
1.788TyrAsp: 1.788 ± 0.347
0.82TyrGlu: 0.82 ± 0.249
1.118TyrPhe: 1.118 ± 0.239
2.31TyrGly: 2.31 ± 0.344
0.373TyrHis: 0.373 ± 0.201
1.267TyrIle: 1.267 ± 0.216
1.565TyrLys: 1.565 ± 0.286
2.235TyrLeu: 2.235 ± 0.451
0.894TyrMet: 0.894 ± 0.187
1.341TyrAsn: 1.341 ± 0.395
0.82TyrPro: 0.82 ± 0.2
2.086TyrGln: 2.086 ± 0.353
2.683TyrArg: 2.683 ± 0.469
2.012TyrSer: 2.012 ± 0.322
1.118TyrThr: 1.118 ± 0.337
2.385TyrVal: 2.385 ± 0.418
0.522TyrTrp: 0.522 ± 0.207
1.043TyrTyr: 1.043 ± 0.286
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 64 proteins (13421 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski