Amino acid dipepetide frequency for Vibrio phage H188

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.334AlaAla: 6.334 ± 0.713
0.849AlaCys: 0.849 ± 0.225
4.767AlaAsp: 4.767 ± 0.539
5.289AlaGlu: 5.289 ± 0.914
2.285AlaPhe: 2.285 ± 0.351
5.681AlaGly: 5.681 ± 0.628
1.241AlaHis: 1.241 ± 0.262
5.485AlaIle: 5.485 ± 0.596
5.877AlaLys: 5.877 ± 0.723
6.595AlaLeu: 6.595 ± 0.793
2.22AlaMet: 2.22 ± 0.363
4.963AlaAsn: 4.963 ± 0.52
2.873AlaPro: 2.873 ± 0.379
3.983AlaGln: 3.983 ± 0.522
2.743AlaArg: 2.743 ± 0.391
5.42AlaSer: 5.42 ± 0.701
5.355AlaThr: 5.355 ± 0.536
5.028AlaVal: 5.028 ± 0.786
1.306AlaTrp: 1.306 ± 0.317
2.481AlaTyr: 2.481 ± 0.428
0.0AlaXaa: 0.0 ± 0.0
Cys
0.849CysAla: 0.849 ± 0.225
0.326CysCys: 0.326 ± 0.126
1.11CysAsp: 1.11 ± 0.303
0.784CysGlu: 0.784 ± 0.202
0.196CysPhe: 0.196 ± 0.122
1.306CysGly: 1.306 ± 0.321
0.653CysHis: 0.653 ± 0.189
0.392CysIle: 0.392 ± 0.156
1.437CysLys: 1.437 ± 0.33
0.914CysLeu: 0.914 ± 0.305
0.196CysMet: 0.196 ± 0.092
0.784CysAsn: 0.784 ± 0.227
1.045CysPro: 1.045 ± 0.263
0.457CysGln: 0.457 ± 0.184
0.522CysArg: 0.522 ± 0.201
0.914CysSer: 0.914 ± 0.268
0.588CysThr: 0.588 ± 0.239
0.849CysVal: 0.849 ± 0.179
0.261CysTrp: 0.261 ± 0.135
0.457CysTyr: 0.457 ± 0.181
0.0CysXaa: 0.0 ± 0.0
Asp
5.746AspAla: 5.746 ± 0.598
0.653AspCys: 0.653 ± 0.227
2.808AspAsp: 2.808 ± 0.46
3.2AspGlu: 3.2 ± 0.479
3.265AspPhe: 3.265 ± 0.449
7.509AspGly: 7.509 ± 0.872
0.914AspHis: 0.914 ± 0.212
4.571AspIle: 4.571 ± 0.566
3.787AspLys: 3.787 ± 0.57
4.636AspLeu: 4.636 ± 0.454
1.763AspMet: 1.763 ± 0.328
3.396AspAsn: 3.396 ± 0.422
2.22AspPro: 2.22 ± 0.45
1.371AspGln: 1.371 ± 0.316
3.069AspArg: 3.069 ± 0.429
4.049AspSer: 4.049 ± 0.49
3.265AspThr: 3.265 ± 0.46
4.375AspVal: 4.375 ± 0.51
0.849AspTrp: 0.849 ± 0.248
2.285AspTyr: 2.285 ± 0.433
0.0AspXaa: 0.0 ± 0.0
Glu
5.224GluAla: 5.224 ± 0.716
0.849GluCys: 0.849 ± 0.282
2.808GluAsp: 2.808 ± 0.374
3.853GluGlu: 3.853 ± 0.523
3.33GluPhe: 3.33 ± 0.495
3.526GluGly: 3.526 ± 0.423
1.11GluHis: 1.11 ± 0.27
4.049GluIle: 4.049 ± 0.478
4.114GluLys: 4.114 ± 0.515
7.248GluLeu: 7.248 ± 0.895
2.547GluMet: 2.547 ± 0.464
2.155GluAsn: 2.155 ± 0.29
1.502GluPro: 1.502 ± 0.353
3.265GluGln: 3.265 ± 0.538
2.938GluArg: 2.938 ± 0.516
4.31GluSer: 4.31 ± 0.512
4.114GluThr: 4.114 ± 0.556
4.897GluVal: 4.897 ± 0.679
1.306GluTrp: 1.306 ± 0.297
2.155GluTyr: 2.155 ± 0.363
0.0GluXaa: 0.0 ± 0.0
Phe
3.069PheAla: 3.069 ± 0.471
0.326PheCys: 0.326 ± 0.101
3.265PheAsp: 3.265 ± 0.451
2.808PheGlu: 2.808 ± 0.415
1.241PhePhe: 1.241 ± 0.272
2.677PheGly: 2.677 ± 0.456
0.196PheHis: 0.196 ± 0.105
3.787PheIle: 3.787 ± 0.456
2.808PheLys: 2.808 ± 0.419
2.09PheLeu: 2.09 ± 0.291
1.045PheMet: 1.045 ± 0.31
2.612PheAsn: 2.612 ± 0.366
1.045PhePro: 1.045 ± 0.188
0.718PheGln: 0.718 ± 0.224
1.241PheArg: 1.241 ± 0.233
3.134PheSer: 3.134 ± 0.456
2.416PheThr: 2.416 ± 0.419
2.547PheVal: 2.547 ± 0.511
0.261PheTrp: 0.261 ± 0.128
1.045PheTyr: 1.045 ± 0.267
0.0PheXaa: 0.0 ± 0.0
Gly
6.073GlyAla: 6.073 ± 0.737
1.306GlyCys: 1.306 ± 0.386
5.159GlyAsp: 5.159 ± 0.591
5.028GlyGlu: 5.028 ± 0.555
2.938GlyPhe: 2.938 ± 0.408
5.55GlyGly: 5.55 ± 0.559
1.632GlyHis: 1.632 ± 0.384
4.636GlyIle: 4.636 ± 0.676
5.093GlyLys: 5.093 ± 0.579
6.138GlyLeu: 6.138 ± 0.626
1.632GlyMet: 1.632 ± 0.324
4.179GlyAsn: 4.179 ± 0.56
0.979GlyPro: 0.979 ± 0.226
2.677GlyGln: 2.677 ± 0.408
2.873GlyArg: 2.873 ± 0.431
5.681GlySer: 5.681 ± 0.76
3.918GlyThr: 3.918 ± 0.626
5.877GlyVal: 5.877 ± 0.561
0.979GlyTrp: 0.979 ± 0.201
2.416GlyTyr: 2.416 ± 0.412
0.0GlyXaa: 0.0 ± 0.0
His
0.914HisAla: 0.914 ± 0.261
0.392HisCys: 0.392 ± 0.172
1.567HisAsp: 1.567 ± 0.296
0.979HisGlu: 0.979 ± 0.234
0.457HisPhe: 0.457 ± 0.194
1.894HisGly: 1.894 ± 0.521
0.653HisHis: 0.653 ± 0.207
0.392HisIle: 0.392 ± 0.156
1.437HisLys: 1.437 ± 0.279
1.371HisLeu: 1.371 ± 0.316
0.457HisMet: 0.457 ± 0.197
1.11HisAsn: 1.11 ± 0.238
0.653HisPro: 0.653 ± 0.192
0.588HisGln: 0.588 ± 0.18
0.522HisArg: 0.522 ± 0.164
1.11HisSer: 1.11 ± 0.273
0.653HisThr: 0.653 ± 0.2
1.045HisVal: 1.045 ± 0.234
0.196HisTrp: 0.196 ± 0.108
1.11HisTyr: 1.11 ± 0.253
0.0HisXaa: 0.0 ± 0.0
Ile
5.42IleAla: 5.42 ± 0.626
0.979IleCys: 0.979 ± 0.291
5.616IleAsp: 5.616 ± 0.619
5.159IleGlu: 5.159 ± 0.675
2.155IlePhe: 2.155 ± 0.49
5.093IleGly: 5.093 ± 0.581
0.914IleHis: 0.914 ± 0.251
3.265IleIle: 3.265 ± 0.44
4.571IleLys: 4.571 ± 0.56
3.526IleLeu: 3.526 ± 0.503
0.653IleMet: 0.653 ± 0.207
3.983IleAsn: 3.983 ± 0.419
2.416IlePro: 2.416 ± 0.355
2.155IleGln: 2.155 ± 0.361
1.763IleArg: 1.763 ± 0.309
3.983IleSer: 3.983 ± 0.544
5.355IleThr: 5.355 ± 0.519
3.2IleVal: 3.2 ± 0.388
0.522IleTrp: 0.522 ± 0.209
1.567IleTyr: 1.567 ± 0.315
0.0IleXaa: 0.0 ± 0.0
Lys
6.922LysAla: 6.922 ± 0.772
0.784LysCys: 0.784 ± 0.207
3.526LysAsp: 3.526 ± 0.45
4.506LysGlu: 4.506 ± 0.578
2.416LysPhe: 2.416 ± 0.342
3.461LysGly: 3.461 ± 0.516
2.285LysHis: 2.285 ± 0.355
4.179LysIle: 4.179 ± 0.645
3.918LysLys: 3.918 ± 0.634
5.616LysLeu: 5.616 ± 0.64
2.612LysMet: 2.612 ± 0.425
3.069LysAsn: 3.069 ± 0.463
3.657LysPro: 3.657 ± 0.442
2.547LysGln: 2.547 ± 0.446
3.2LysArg: 3.2 ± 0.495
4.571LysSer: 4.571 ± 0.719
4.832LysThr: 4.832 ± 0.587
3.722LysVal: 3.722 ± 0.541
1.11LysTrp: 1.11 ± 0.327
1.632LysTyr: 1.632 ± 0.339
0.0LysXaa: 0.0 ± 0.0
Leu
5.355LeuAla: 5.355 ± 0.502
1.11LeuCys: 1.11 ± 0.265
5.289LeuAsp: 5.289 ± 0.572
4.702LeuGlu: 4.702 ± 0.595
2.22LeuPhe: 2.22 ± 0.369
4.636LeuGly: 4.636 ± 0.49
0.849LeuHis: 0.849 ± 0.193
5.746LeuIle: 5.746 ± 0.603
4.897LeuLys: 4.897 ± 0.442
4.636LeuLeu: 4.636 ± 0.462
1.763LeuMet: 1.763 ± 0.341
4.179LeuAsn: 4.179 ± 0.5
2.938LeuPro: 2.938 ± 0.472
2.808LeuGln: 2.808 ± 0.38
3.265LeuArg: 3.265 ± 0.422
5.355LeuSer: 5.355 ± 0.617
4.506LeuThr: 4.506 ± 0.584
5.746LeuVal: 5.746 ± 0.625
0.784LeuTrp: 0.784 ± 0.213
2.285LeuTyr: 2.285 ± 0.38
0.0LeuXaa: 0.0 ± 0.0
Met
2.155MetAla: 2.155 ± 0.44
0.326MetCys: 0.326 ± 0.142
1.371MetAsp: 1.371 ± 0.331
1.437MetGlu: 1.437 ± 0.347
0.979MetPhe: 0.979 ± 0.286
2.22MetGly: 2.22 ± 0.529
0.653MetHis: 0.653 ± 0.218
1.698MetIle: 1.698 ± 0.342
2.743MetLys: 2.743 ± 0.386
1.632MetLeu: 1.632 ± 0.329
0.784MetMet: 0.784 ± 0.201
1.763MetAsn: 1.763 ± 0.345
1.175MetPro: 1.175 ± 0.287
0.784MetGln: 0.784 ± 0.218
1.437MetArg: 1.437 ± 0.29
2.285MetSer: 2.285 ± 0.398
2.024MetThr: 2.024 ± 0.409
1.11MetVal: 1.11 ± 0.243
0.065MetTrp: 0.065 ± 0.063
0.457MetTyr: 0.457 ± 0.141
0.0MetXaa: 0.0 ± 0.0
Asn
4.702AsnAla: 4.702 ± 0.583
0.718AsnCys: 0.718 ± 0.217
3.134AsnAsp: 3.134 ± 0.45
3.265AsnGlu: 3.265 ± 0.417
2.09AsnPhe: 2.09 ± 0.26
5.616AsnGly: 5.616 ± 0.715
0.914AsnHis: 0.914 ± 0.248
2.481AsnIle: 2.481 ± 0.502
3.787AsnLys: 3.787 ± 0.57
3.591AsnLeu: 3.591 ± 0.406
1.698AsnMet: 1.698 ± 0.307
2.481AsnAsn: 2.481 ± 0.375
2.285AsnPro: 2.285 ± 0.48
2.285AsnGln: 2.285 ± 0.395
2.351AsnArg: 2.351 ± 0.327
3.918AsnSer: 3.918 ± 0.473
2.808AsnThr: 2.808 ± 0.418
2.873AsnVal: 2.873 ± 0.455
0.653AsnTrp: 0.653 ± 0.214
2.024AsnTyr: 2.024 ± 0.3
0.0AsnXaa: 0.0 ± 0.0
Pro
2.416ProAla: 2.416 ± 0.363
0.588ProCys: 0.588 ± 0.207
2.416ProAsp: 2.416 ± 0.422
2.938ProGlu: 2.938 ± 0.363
1.502ProPhe: 1.502 ± 0.286
1.437ProGly: 1.437 ± 0.321
0.457ProHis: 0.457 ± 0.166
2.22ProIle: 2.22 ± 0.392
2.351ProLys: 2.351 ± 0.374
2.808ProLeu: 2.808 ± 0.474
1.437ProMet: 1.437 ± 0.306
2.612ProAsn: 2.612 ± 0.35
1.11ProPro: 1.11 ± 0.275
1.371ProGln: 1.371 ± 0.278
1.698ProArg: 1.698 ± 0.384
2.351ProSer: 2.351 ± 0.414
2.873ProThr: 2.873 ± 0.496
3.265ProVal: 3.265 ± 0.481
0.718ProTrp: 0.718 ± 0.197
0.979ProTyr: 0.979 ± 0.201
0.0ProXaa: 0.0 ± 0.0
Gln
3.657GlnAla: 3.657 ± 0.465
0.457GlnCys: 0.457 ± 0.167
1.959GlnAsp: 1.959 ± 0.421
1.894GlnGlu: 1.894 ± 0.281
1.11GlnPhe: 1.11 ± 0.28
2.155GlnGly: 2.155 ± 0.302
0.653GlnHis: 0.653 ± 0.235
1.959GlnIle: 1.959 ± 0.372
2.09GlnLys: 2.09 ± 0.437
3.069GlnLeu: 3.069 ± 0.69
1.959GlnMet: 1.959 ± 0.324
1.763GlnAsn: 1.763 ± 0.267
1.894GlnPro: 1.894 ± 0.36
1.437GlnGln: 1.437 ± 0.382
2.285GlnArg: 2.285 ± 0.325
2.938GlnSer: 2.938 ± 0.419
2.351GlnThr: 2.351 ± 0.361
2.481GlnVal: 2.481 ± 0.304
0.588GlnTrp: 0.588 ± 0.189
1.371GlnTyr: 1.371 ± 0.241
0.0GlnXaa: 0.0 ± 0.0
Arg
3.265ArgAla: 3.265 ± 0.434
0.392ArgCys: 0.392 ± 0.167
3.2ArgAsp: 3.2 ± 0.514
2.743ArgGlu: 2.743 ± 0.532
1.959ArgPhe: 1.959 ± 0.384
2.09ArgGly: 2.09 ± 0.402
0.718ArgHis: 0.718 ± 0.231
2.612ArgIle: 2.612 ± 0.443
3.591ArgLys: 3.591 ± 0.553
2.808ArgLeu: 2.808 ± 0.495
0.588ArgMet: 0.588 ± 0.197
2.024ArgAsn: 2.024 ± 0.467
1.437ArgPro: 1.437 ± 0.264
1.763ArgGln: 1.763 ± 0.313
1.763ArgArg: 1.763 ± 0.387
2.22ArgSer: 2.22 ± 0.301
2.612ArgThr: 2.612 ± 0.449
3.004ArgVal: 3.004 ± 0.466
0.457ArgTrp: 0.457 ± 0.158
1.828ArgTyr: 1.828 ± 0.362
0.0ArgXaa: 0.0 ± 0.0
Ser
4.636SerAla: 4.636 ± 0.591
1.175SerCys: 1.175 ± 0.281
4.244SerAsp: 4.244 ± 0.499
4.44SerGlu: 4.44 ± 0.452
3.33SerPhe: 3.33 ± 0.465
5.942SerGly: 5.942 ± 0.621
0.784SerHis: 0.784 ± 0.198
4.963SerIle: 4.963 ± 0.465
5.159SerLys: 5.159 ± 0.573
4.44SerLeu: 4.44 ± 0.42
1.241SerMet: 1.241 ± 0.265
3.722SerAsn: 3.722 ± 0.523
2.351SerPro: 2.351 ± 0.394
2.743SerGln: 2.743 ± 0.352
2.09SerArg: 2.09 ± 0.418
4.44SerSer: 4.44 ± 0.608
3.853SerThr: 3.853 ± 0.535
5.42SerVal: 5.42 ± 0.721
1.045SerTrp: 1.045 ± 0.216
2.155SerTyr: 2.155 ± 0.454
0.0SerXaa: 0.0 ± 0.0
Thr
5.746ThrAla: 5.746 ± 0.645
1.045ThrCys: 1.045 ± 0.374
3.591ThrAsp: 3.591 ± 0.526
3.265ThrGlu: 3.265 ± 0.549
2.22ThrPhe: 2.22 ± 0.356
5.093ThrGly: 5.093 ± 0.624
1.567ThrHis: 1.567 ± 0.323
3.591ThrIle: 3.591 ± 0.479
4.114ThrLys: 4.114 ± 0.495
4.832ThrLeu: 4.832 ± 0.564
1.567ThrMet: 1.567 ± 0.37
3.787ThrAsn: 3.787 ± 0.502
3.526ThrPro: 3.526 ± 0.47
3.2ThrGln: 3.2 ± 0.392
2.22ThrArg: 2.22 ± 0.358
2.743ThrSer: 2.743 ± 0.444
4.049ThrThr: 4.049 ± 0.552
5.681ThrVal: 5.681 ± 0.701
0.718ThrTrp: 0.718 ± 0.252
1.175ThrTyr: 1.175 ± 0.218
0.0ThrXaa: 0.0 ± 0.0
Val
5.42ValAla: 5.42 ± 0.66
1.045ValCys: 1.045 ± 0.24
4.897ValAsp: 4.897 ± 0.481
5.812ValGlu: 5.812 ± 0.794
2.22ValPhe: 2.22 ± 0.327
5.42ValGly: 5.42 ± 0.554
0.718ValHis: 0.718 ± 0.224
3.591ValIle: 3.591 ± 0.602
4.114ValLys: 4.114 ± 0.523
4.31ValLeu: 4.31 ± 0.438
1.828ValMet: 1.828 ± 0.382
3.526ValAsn: 3.526 ± 0.482
2.481ValPro: 2.481 ± 0.304
2.22ValGln: 2.22 ± 0.504
2.938ValArg: 2.938 ± 0.358
5.355ValSer: 5.355 ± 0.605
4.963ValThr: 4.963 ± 0.742
4.767ValVal: 4.767 ± 0.514
0.522ValTrp: 0.522 ± 0.218
2.285ValTyr: 2.285 ± 0.46
0.0ValXaa: 0.0 ± 0.0
Trp
0.653TrpAla: 0.653 ± 0.157
0.196TrpCys: 0.196 ± 0.113
0.718TrpAsp: 0.718 ± 0.228
0.979TrpGlu: 0.979 ± 0.284
0.784TrpPhe: 0.784 ± 0.215
0.718TrpGly: 0.718 ± 0.219
0.196TrpHis: 0.196 ± 0.13
0.979TrpIle: 0.979 ± 0.215
0.914TrpLys: 0.914 ± 0.267
1.045TrpLeu: 1.045 ± 0.244
0.261TrpMet: 0.261 ± 0.134
0.588TrpAsn: 0.588 ± 0.16
0.457TrpPro: 0.457 ± 0.164
0.392TrpGln: 0.392 ± 0.15
0.522TrpArg: 0.522 ± 0.185
0.914TrpSer: 0.914 ± 0.275
0.718TrpThr: 0.718 ± 0.276
1.175TrpVal: 1.175 ± 0.241
0.196TrpTrp: 0.196 ± 0.099
0.392TrpTyr: 0.392 ± 0.147
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.22TyrAla: 2.22 ± 0.376
0.522TyrCys: 0.522 ± 0.257
2.285TyrAsp: 2.285 ± 0.439
2.22TyrGlu: 2.22 ± 0.393
1.632TyrPhe: 1.632 ± 0.323
2.743TyrGly: 2.743 ± 0.435
0.392TyrHis: 0.392 ± 0.174
1.698TyrIle: 1.698 ± 0.339
1.828TyrLys: 1.828 ± 0.294
1.763TyrLeu: 1.763 ± 0.327
0.784TyrMet: 0.784 ± 0.2
0.979TyrAsn: 0.979 ± 0.246
1.502TyrPro: 1.502 ± 0.304
1.306TyrGln: 1.306 ± 0.314
1.632TyrArg: 1.632 ± 0.311
2.481TyrSer: 2.481 ± 0.41
2.481TyrThr: 2.481 ± 0.386
1.371TyrVal: 1.371 ± 0.295
0.261TyrTrp: 0.261 ± 0.103
0.914TyrTyr: 0.914 ± 0.242
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 76 proteins (15315 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski