Amino acid dipepetide frequency for Proteus phage P16-2532

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.167AlaAla: 3.167 ± 0.631
0.435AlaCys: 0.435 ± 0.212
4.595AlaAsp: 4.595 ± 0.489
5.775AlaGlu: 5.775 ± 0.548
3.602AlaPhe: 3.602 ± 0.49
4.781AlaGly: 4.781 ± 0.621
0.931AlaHis: 0.931 ± 0.244
4.533AlaIle: 4.533 ± 0.642
5.527AlaLys: 5.527 ± 0.79
5.961AlaLeu: 5.961 ± 0.665
2.235AlaMet: 2.235 ± 0.413
3.291AlaAsn: 3.291 ± 0.454
2.608AlaPro: 2.608 ± 0.45
2.298AlaGln: 2.298 ± 0.348
4.223AlaArg: 4.223 ± 0.646
5.651AlaSer: 5.651 ± 0.667
4.657AlaThr: 4.657 ± 0.555
5.154AlaVal: 5.154 ± 0.561
1.428AlaTrp: 1.428 ± 0.318
2.422AlaTyr: 2.422 ± 0.381
0.0AlaXaa: 0.0 ± 0.0
Cys
0.621CysAla: 0.621 ± 0.174
0.0CysCys: 0.0 ± 0.0
0.683CysAsp: 0.683 ± 0.241
0.745CysGlu: 0.745 ± 0.253
0.435CysPhe: 0.435 ± 0.167
0.745CysGly: 0.745 ± 0.229
0.062CysHis: 0.062 ± 0.061
0.248CysIle: 0.248 ± 0.106
0.435CysLys: 0.435 ± 0.135
0.621CysLeu: 0.621 ± 0.184
0.373CysMet: 0.373 ± 0.17
0.248CysAsn: 0.248 ± 0.114
0.807CysPro: 0.807 ± 0.267
0.435CysGln: 0.435 ± 0.177
0.559CysArg: 0.559 ± 0.194
0.31CysSer: 0.31 ± 0.129
0.373CysThr: 0.373 ± 0.145
0.621CysVal: 0.621 ± 0.213
0.124CysTrp: 0.124 ± 0.078
0.186CysTyr: 0.186 ± 0.113
0.0CysXaa: 0.0 ± 0.0
Asp
4.719AspAla: 4.719 ± 0.495
0.497AspCys: 0.497 ± 0.168
4.533AspAsp: 4.533 ± 0.719
4.968AspGlu: 4.968 ± 0.559
2.298AspPhe: 2.298 ± 0.373
6.148AspGly: 6.148 ± 0.784
1.118AspHis: 1.118 ± 0.253
4.098AspIle: 4.098 ± 0.527
4.285AspLys: 4.285 ± 0.501
5.278AspLeu: 5.278 ± 0.534
2.235AspMet: 2.235 ± 0.424
3.353AspAsn: 3.353 ± 0.452
3.353AspPro: 3.353 ± 0.662
1.552AspGln: 1.552 ± 0.308
3.353AspArg: 3.353 ± 0.539
2.919AspSer: 2.919 ± 0.438
3.477AspThr: 3.477 ± 0.31
4.719AspVal: 4.719 ± 0.566
1.18AspTrp: 1.18 ± 0.318
2.235AspTyr: 2.235 ± 0.323
0.0AspXaa: 0.0 ± 0.0
Glu
5.092GluAla: 5.092 ± 0.678
0.683GluCys: 0.683 ± 0.201
4.285GluAsp: 4.285 ± 0.538
5.713GluGlu: 5.713 ± 0.663
2.298GluPhe: 2.298 ± 0.377
2.608GluGly: 2.608 ± 0.361
0.994GluHis: 0.994 ± 0.239
4.657GluIle: 4.657 ± 0.515
4.595GluLys: 4.595 ± 0.528
6.085GluLeu: 6.085 ± 0.736
2.36GluMet: 2.36 ± 0.349
3.85GluAsn: 3.85 ± 0.484
2.484GluPro: 2.484 ± 0.355
2.422GluGln: 2.422 ± 0.379
3.912GluArg: 3.912 ± 0.529
4.781GluSer: 4.781 ± 0.527
5.092GluThr: 5.092 ± 0.583
4.906GluVal: 4.906 ± 0.597
1.366GluTrp: 1.366 ± 0.355
2.049GluTyr: 2.049 ± 0.451
0.0GluXaa: 0.0 ± 0.0
Phe
2.732PheAla: 2.732 ± 0.404
0.559PheCys: 0.559 ± 0.196
3.415PheAsp: 3.415 ± 0.451
4.595PheGlu: 4.595 ± 0.494
1.801PhePhe: 1.801 ± 0.325
3.602PheGly: 3.602 ± 0.399
0.497PheHis: 0.497 ± 0.196
2.546PheIle: 2.546 ± 0.359
2.546PheLys: 2.546 ± 0.354
2.919PheLeu: 2.919 ± 0.372
0.994PheMet: 0.994 ± 0.284
1.739PheAsn: 1.739 ± 0.306
1.304PhePro: 1.304 ± 0.345
1.056PheGln: 1.056 ± 0.258
1.801PheArg: 1.801 ± 0.323
2.173PheSer: 2.173 ± 0.34
2.856PheThr: 2.856 ± 0.523
2.484PheVal: 2.484 ± 0.452
0.31PheTrp: 0.31 ± 0.165
1.118PheTyr: 1.118 ± 0.195
0.0PheXaa: 0.0 ± 0.0
Gly
4.657GlyAla: 4.657 ± 0.609
0.994GlyCys: 0.994 ± 0.303
4.719GlyAsp: 4.719 ± 0.706
4.595GlyGlu: 4.595 ± 0.59
3.353GlyPhe: 3.353 ± 0.493
5.216GlyGly: 5.216 ± 0.847
1.18GlyHis: 1.18 ± 0.262
4.719GlyIle: 4.719 ± 0.498
4.781GlyLys: 4.781 ± 0.539
4.347GlyLeu: 4.347 ± 0.427
1.615GlyMet: 1.615 ± 0.355
2.298GlyAsn: 2.298 ± 0.404
1.552GlyPro: 1.552 ± 0.246
3.353GlyGln: 3.353 ± 0.44
3.788GlyArg: 3.788 ± 0.486
5.03GlySer: 5.03 ± 0.591
3.85GlyThr: 3.85 ± 0.489
6.085GlyVal: 6.085 ± 0.582
0.994GlyTrp: 0.994 ± 0.254
2.794GlyTyr: 2.794 ± 0.353
0.0GlyXaa: 0.0 ± 0.0
His
0.994HisAla: 0.994 ± 0.269
0.062HisCys: 0.062 ± 0.063
1.118HisAsp: 1.118 ± 0.244
0.807HisGlu: 0.807 ± 0.169
0.559HisPhe: 0.559 ± 0.182
0.994HisGly: 0.994 ± 0.243
0.435HisHis: 0.435 ± 0.165
1.056HisIle: 1.056 ± 0.24
0.745HisLys: 0.745 ± 0.22
1.552HisLeu: 1.552 ± 0.282
0.435HisMet: 0.435 ± 0.197
0.683HisAsn: 0.683 ± 0.189
1.056HisPro: 1.056 ± 0.248
0.559HisGln: 0.559 ± 0.191
1.49HisArg: 1.49 ± 0.287
0.931HisSer: 0.931 ± 0.233
0.994HisThr: 0.994 ± 0.237
0.807HisVal: 0.807 ± 0.168
0.186HisTrp: 0.186 ± 0.114
0.621HisTyr: 0.621 ± 0.218
0.0HisXaa: 0.0 ± 0.0
Ile
4.223IleAla: 4.223 ± 0.463
0.497IleCys: 0.497 ± 0.192
4.471IleAsp: 4.471 ± 0.619
5.03IleGlu: 5.03 ± 0.758
1.987IlePhe: 1.987 ± 0.295
3.602IleGly: 3.602 ± 0.41
1.428IleHis: 1.428 ± 0.29
3.415IleIle: 3.415 ± 0.474
4.968IleLys: 4.968 ± 0.66
4.098IleLeu: 4.098 ± 0.523
1.18IleMet: 1.18 ± 0.317
3.353IleAsn: 3.353 ± 0.489
2.732IlePro: 2.732 ± 0.427
1.615IleGln: 1.615 ± 0.352
3.912IleArg: 3.912 ± 0.508
4.471IleSer: 4.471 ± 0.563
3.912IleThr: 3.912 ± 0.474
3.477IleVal: 3.477 ± 0.512
0.373IleTrp: 0.373 ± 0.141
1.801IleTyr: 1.801 ± 0.323
0.0IleXaa: 0.0 ± 0.0
Lys
5.03LysAla: 5.03 ± 0.709
0.435LysCys: 0.435 ± 0.173
4.16LysAsp: 4.16 ± 0.447
4.098LysGlu: 4.098 ± 0.402
2.546LysPhe: 2.546 ± 0.384
3.353LysGly: 3.353 ± 0.403
1.118LysHis: 1.118 ± 0.247
3.043LysIle: 3.043 ± 0.404
3.415LysLys: 3.415 ± 0.432
5.03LysLeu: 5.03 ± 0.595
2.608LysMet: 2.608 ± 0.424
2.794LysAsn: 2.794 ± 0.408
3.105LysPro: 3.105 ± 0.556
2.235LysGln: 2.235 ± 0.391
4.347LysArg: 4.347 ± 0.566
4.036LysSer: 4.036 ± 0.541
3.664LysThr: 3.664 ± 0.492
4.16LysVal: 4.16 ± 0.585
1.18LysTrp: 1.18 ± 0.346
1.987LysTyr: 1.987 ± 0.372
0.0LysXaa: 0.0 ± 0.0
Leu
6.644LeuAla: 6.644 ± 0.74
0.931LeuCys: 0.931 ± 0.278
4.781LeuAsp: 4.781 ± 0.557
4.781LeuGlu: 4.781 ± 0.588
2.235LeuPhe: 2.235 ± 0.29
5.092LeuGly: 5.092 ± 0.683
0.869LeuHis: 0.869 ± 0.227
3.974LeuIle: 3.974 ± 0.53
4.595LeuLys: 4.595 ± 0.612
4.906LeuLeu: 4.906 ± 0.549
2.422LeuMet: 2.422 ± 0.33
4.098LeuAsn: 4.098 ± 0.543
3.415LeuPro: 3.415 ± 0.49
2.111LeuGln: 2.111 ± 0.381
5.527LeuArg: 5.527 ± 0.623
5.651LeuSer: 5.651 ± 0.668
5.216LeuThr: 5.216 ± 0.565
5.775LeuVal: 5.775 ± 0.512
0.248LeuTrp: 0.248 ± 0.113
1.863LeuTyr: 1.863 ± 0.275
0.0LeuXaa: 0.0 ± 0.0
Met
3.477MetAla: 3.477 ± 0.423
0.124MetCys: 0.124 ± 0.081
1.118MetAsp: 1.118 ± 0.284
1.428MetGlu: 1.428 ± 0.315
1.056MetPhe: 1.056 ± 0.243
1.428MetGly: 1.428 ± 0.324
0.31MetHis: 0.31 ± 0.143
2.235MetIle: 2.235 ± 0.35
1.49MetLys: 1.49 ± 0.32
2.173MetLeu: 2.173 ± 0.385
0.621MetMet: 0.621 ± 0.198
1.615MetAsn: 1.615 ± 0.347
1.18MetPro: 1.18 ± 0.268
0.497MetGln: 0.497 ± 0.205
1.739MetArg: 1.739 ± 0.312
1.801MetSer: 1.801 ± 0.429
1.987MetThr: 1.987 ± 0.347
2.422MetVal: 2.422 ± 0.434
0.497MetTrp: 0.497 ± 0.166
0.807MetTyr: 0.807 ± 0.211
0.0MetXaa: 0.0 ± 0.0
Asn
4.16AsnAla: 4.16 ± 0.633
0.248AsnCys: 0.248 ± 0.122
2.794AsnAsp: 2.794 ± 0.469
2.856AsnGlu: 2.856 ± 0.369
1.552AsnPhe: 1.552 ± 0.261
4.285AsnGly: 4.285 ± 0.512
0.683AsnHis: 0.683 ± 0.23
2.981AsnIle: 2.981 ± 0.488
2.856AsnLys: 2.856 ± 0.431
3.664AsnLeu: 3.664 ± 0.503
0.745AsnMet: 0.745 ± 0.232
2.36AsnAsn: 2.36 ± 0.365
2.794AsnPro: 2.794 ± 0.5
1.49AsnGln: 1.49 ± 0.324
2.546AsnArg: 2.546 ± 0.499
3.043AsnSer: 3.043 ± 0.523
3.291AsnThr: 3.291 ± 0.472
3.043AsnVal: 3.043 ± 0.529
0.497AsnTrp: 0.497 ± 0.155
1.615AsnTyr: 1.615 ± 0.353
0.0AsnXaa: 0.0 ± 0.0
Pro
2.546ProAla: 2.546 ± 0.413
0.373ProCys: 0.373 ± 0.196
2.919ProAsp: 2.919 ± 0.625
3.291ProGlu: 3.291 ± 0.472
1.552ProPhe: 1.552 ± 0.296
3.726ProGly: 3.726 ± 0.639
1.242ProHis: 1.242 ± 0.327
3.105ProIle: 3.105 ± 0.469
2.794ProLys: 2.794 ± 0.383
3.105ProLeu: 3.105 ± 0.342
1.18ProMet: 1.18 ± 0.257
2.36ProAsn: 2.36 ± 0.385
1.925ProPro: 1.925 ± 0.42
1.304ProGln: 1.304 ± 0.509
1.925ProArg: 1.925 ± 0.383
2.981ProSer: 2.981 ± 0.398
2.484ProThr: 2.484 ± 0.38
3.229ProVal: 3.229 ± 0.435
0.869ProTrp: 0.869 ± 0.275
1.863ProTyr: 1.863 ± 0.325
0.0ProXaa: 0.0 ± 0.0
Gln
2.546GlnAla: 2.546 ± 0.48
0.248GlnCys: 0.248 ± 0.112
1.677GlnAsp: 1.677 ± 0.347
1.49GlnGlu: 1.49 ± 0.324
1.739GlnPhe: 1.739 ± 0.344
1.49GlnGly: 1.49 ± 0.389
0.497GlnHis: 0.497 ± 0.175
2.794GlnIle: 2.794 ± 0.401
1.801GlnLys: 1.801 ± 0.349
3.229GlnLeu: 3.229 ± 0.436
0.931GlnMet: 0.931 ± 0.236
1.552GlnAsn: 1.552 ± 0.357
1.18GlnPro: 1.18 ± 0.276
1.615GlnGln: 1.615 ± 0.415
1.677GlnArg: 1.677 ± 0.335
2.484GlnSer: 2.484 ± 0.351
1.552GlnThr: 1.552 ± 0.289
2.919GlnVal: 2.919 ± 0.518
0.435GlnTrp: 0.435 ± 0.152
0.745GlnTyr: 0.745 ± 0.206
0.0GlnXaa: 0.0 ± 0.0
Arg
3.974ArgAla: 3.974 ± 0.396
0.435ArgCys: 0.435 ± 0.184
3.602ArgAsp: 3.602 ± 0.434
3.539ArgGlu: 3.539 ± 0.538
2.608ArgPhe: 2.608 ± 0.459
4.595ArgGly: 4.595 ± 0.644
1.056ArgHis: 1.056 ± 0.224
3.726ArgIle: 3.726 ± 0.457
4.968ArgLys: 4.968 ± 0.533
4.098ArgLeu: 4.098 ± 0.438
1.925ArgMet: 1.925 ± 0.324
2.484ArgAsn: 2.484 ± 0.352
3.229ArgPro: 3.229 ± 0.488
2.049ArgGln: 2.049 ± 0.405
3.602ArgArg: 3.602 ± 0.524
2.919ArgSer: 2.919 ± 0.44
2.36ArgThr: 2.36 ± 0.329
4.223ArgVal: 4.223 ± 0.622
0.497ArgTrp: 0.497 ± 0.163
1.552ArgTyr: 1.552 ± 0.321
0.0ArgXaa: 0.0 ± 0.0
Ser
5.651SerAla: 5.651 ± 0.713
0.559SerCys: 0.559 ± 0.16
4.719SerAsp: 4.719 ± 0.524
4.968SerGlu: 4.968 ± 0.546
3.291SerPhe: 3.291 ± 0.443
5.03SerGly: 5.03 ± 0.633
0.869SerHis: 0.869 ± 0.239
3.043SerIle: 3.043 ± 0.486
3.353SerLys: 3.353 ± 0.453
5.589SerLeu: 5.589 ± 0.708
2.298SerMet: 2.298 ± 0.367
3.229SerAsn: 3.229 ± 0.39
3.415SerPro: 3.415 ± 0.639
1.801SerGln: 1.801 ± 0.292
2.981SerArg: 2.981 ± 0.496
4.16SerSer: 4.16 ± 0.747
3.291SerThr: 3.291 ± 0.529
5.154SerVal: 5.154 ± 0.744
0.807SerTrp: 0.807 ± 0.212
2.173SerTyr: 2.173 ± 0.454
0.0SerXaa: 0.0 ± 0.0
Thr
4.285ThrAla: 4.285 ± 0.545
0.435ThrCys: 0.435 ± 0.157
3.85ThrAsp: 3.85 ± 0.531
3.477ThrGlu: 3.477 ± 0.346
2.235ThrPhe: 2.235 ± 0.392
4.657ThrGly: 4.657 ± 0.55
1.056ThrHis: 1.056 ± 0.273
3.477ThrIle: 3.477 ± 0.439
3.353ThrLys: 3.353 ± 0.647
4.223ThrLeu: 4.223 ± 0.436
1.49ThrMet: 1.49 ± 0.296
2.36ThrAsn: 2.36 ± 0.385
2.67ThrPro: 2.67 ± 0.414
1.801ThrGln: 1.801 ± 0.324
3.353ThrArg: 3.353 ± 0.418
3.353ThrSer: 3.353 ± 0.676
3.415ThrThr: 3.415 ± 0.517
6.21ThrVal: 6.21 ± 0.684
1.118ThrTrp: 1.118 ± 0.277
2.36ThrTyr: 2.36 ± 0.38
0.0ThrXaa: 0.0 ± 0.0
Val
5.527ValAla: 5.527 ± 0.442
0.621ValCys: 0.621 ± 0.21
5.03ValAsp: 5.03 ± 0.499
5.216ValGlu: 5.216 ± 0.54
3.353ValPhe: 3.353 ± 0.438
5.464ValGly: 5.464 ± 0.599
0.869ValHis: 0.869 ± 0.19
4.781ValIle: 4.781 ± 0.487
3.788ValLys: 3.788 ± 0.42
4.409ValLeu: 4.409 ± 0.484
1.366ValMet: 1.366 ± 0.275
3.788ValAsn: 3.788 ± 0.491
2.981ValPro: 2.981 ± 0.379
1.987ValGln: 1.987 ± 0.39
4.098ValArg: 4.098 ± 0.513
6.396ValSer: 6.396 ± 0.606
4.471ValThr: 4.471 ± 0.595
5.154ValVal: 5.154 ± 0.626
1.18ValTrp: 1.18 ± 0.269
3.167ValTyr: 3.167 ± 0.496
0.0ValXaa: 0.0 ± 0.0
Trp
0.931TrpAla: 0.931 ± 0.23
0.062TrpCys: 0.062 ± 0.063
1.304TrpAsp: 1.304 ± 0.272
1.18TrpGlu: 1.18 ± 0.243
0.807TrpPhe: 0.807 ± 0.226
0.745TrpGly: 0.745 ± 0.196
0.0TrpHis: 0.0 ± 0.0
0.435TrpIle: 0.435 ± 0.172
0.621TrpLys: 0.621 ± 0.194
1.056TrpLeu: 1.056 ± 0.26
0.124TrpMet: 0.124 ± 0.087
0.745TrpAsn: 0.745 ± 0.218
0.745TrpPro: 0.745 ± 0.219
0.807TrpGln: 0.807 ± 0.201
0.931TrpArg: 0.931 ± 0.259
0.807TrpSer: 0.807 ± 0.257
0.745TrpThr: 0.745 ± 0.202
1.056TrpVal: 1.056 ± 0.283
0.186TrpTrp: 0.186 ± 0.095
0.435TrpTyr: 0.435 ± 0.159
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.298TyrAla: 2.298 ± 0.431
0.435TyrCys: 0.435 ± 0.187
2.546TyrAsp: 2.546 ± 0.392
1.677TyrGlu: 1.677 ± 0.334
1.801TyrPhe: 1.801 ± 0.322
2.36TyrGly: 2.36 ± 0.394
0.869TyrHis: 0.869 ± 0.203
1.677TyrIle: 1.677 ± 0.312
1.428TyrLys: 1.428 ± 0.327
2.794TyrLeu: 2.794 ± 0.346
0.807TyrMet: 0.807 ± 0.236
1.304TyrAsn: 1.304 ± 0.27
2.235TyrPro: 2.235 ± 0.511
1.615TyrGln: 1.615 ± 0.407
1.677TyrArg: 1.677 ± 0.312
2.484TyrSer: 2.484 ± 0.398
1.49TyrThr: 1.49 ± 0.306
2.049TyrVal: 2.049 ± 0.42
0.248TyrTrp: 0.248 ± 0.127
1.428TyrTyr: 1.428 ± 0.311
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 86 proteins (16105 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski