Amino acid dipepetide frequency for Proteus phage PM 116

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.712AlaAla: 2.712 ± 0.629
0.513AlaCys: 0.513 ± 0.226
4.764AlaAsp: 4.764 ± 0.591
5.497AlaGlu: 5.497 ± 0.559
2.858AlaPhe: 2.858 ± 0.54
5.497AlaGly: 5.497 ± 0.98
1.319AlaHis: 1.319 ± 0.303
5.79AlaIle: 5.79 ± 0.719
5.79AlaLys: 5.79 ± 0.852
6.303AlaLeu: 6.303 ± 0.684
3.371AlaMet: 3.371 ± 0.677
2.931AlaAsn: 2.931 ± 0.347
2.712AlaPro: 2.712 ± 0.567
4.397AlaGln: 4.397 ± 0.723
4.617AlaArg: 4.617 ± 0.772
5.643AlaSer: 5.643 ± 0.816
3.664AlaThr: 3.664 ± 0.536
3.957AlaVal: 3.957 ± 0.589
0.733AlaTrp: 0.733 ± 0.208
2.418AlaTyr: 2.418 ± 0.379
0.0AlaXaa: 0.0 ± 0.0
Cys
0.513CysAla: 0.513 ± 0.145
0.147CysCys: 0.147 ± 0.102
0.66CysAsp: 0.66 ± 0.307
0.586CysGlu: 0.586 ± 0.236
0.293CysPhe: 0.293 ± 0.167
0.953CysGly: 0.953 ± 0.38
0.22CysHis: 0.22 ± 0.164
0.44CysIle: 0.44 ± 0.205
0.953CysLys: 0.953 ± 0.317
1.173CysLeu: 1.173 ± 0.354
0.66CysMet: 0.66 ± 0.224
0.806CysAsn: 0.806 ± 0.304
0.44CysPro: 0.44 ± 0.17
0.293CysGln: 0.293 ± 0.13
0.66CysArg: 0.66 ± 0.226
0.293CysSer: 0.293 ± 0.167
0.44CysThr: 0.44 ± 0.306
1.026CysVal: 1.026 ± 0.369
0.147CysTrp: 0.147 ± 0.104
0.586CysTyr: 0.586 ± 0.268
0.0CysXaa: 0.0 ± 0.0
Asp
5.277AspAla: 5.277 ± 0.576
0.366AspCys: 0.366 ± 0.206
3.518AspAsp: 3.518 ± 0.506
4.91AspGlu: 4.91 ± 0.814
2.272AspPhe: 2.272 ± 0.365
4.91AspGly: 4.91 ± 0.71
1.099AspHis: 1.099 ± 0.272
4.251AspIle: 4.251 ± 0.533
5.057AspLys: 5.057 ± 0.556
5.057AspLeu: 5.057 ± 0.615
2.052AspMet: 2.052 ± 0.318
3.591AspAsn: 3.591 ± 0.56
1.686AspPro: 1.686 ± 0.306
0.806AspGln: 0.806 ± 0.274
2.638AspArg: 2.638 ± 0.336
3.884AspSer: 3.884 ± 0.563
3.591AspThr: 3.591 ± 0.511
5.057AspVal: 5.057 ± 0.743
1.099AspTrp: 1.099 ± 0.426
2.272AspTyr: 2.272 ± 0.42
0.0AspXaa: 0.0 ± 0.0
Glu
6.303GluAla: 6.303 ± 0.762
1.246GluCys: 1.246 ± 0.464
4.764GluAsp: 4.764 ± 0.412
6.523GluGlu: 6.523 ± 0.906
2.418GluPhe: 2.418 ± 0.431
4.837GluGly: 4.837 ± 0.539
1.686GluHis: 1.686 ± 0.393
3.957GluIle: 3.957 ± 0.451
4.69GluLys: 4.69 ± 0.61
6.596GluLeu: 6.596 ± 0.868
2.272GluMet: 2.272 ± 0.485
2.565GluAsn: 2.565 ± 0.438
1.759GluPro: 1.759 ± 0.328
3.957GluGln: 3.957 ± 0.552
3.664GluArg: 3.664 ± 0.611
2.712GluSer: 2.712 ± 0.484
2.492GluThr: 2.492 ± 0.388
6.01GluVal: 6.01 ± 0.676
1.173GluTrp: 1.173 ± 0.268
1.979GluTyr: 1.979 ± 0.395
0.0GluXaa: 0.0 ± 0.0
Phe
2.272PheAla: 2.272 ± 0.409
0.44PheCys: 0.44 ± 0.178
3.371PheAsp: 3.371 ± 0.578
2.418PheGlu: 2.418 ± 0.398
1.612PhePhe: 1.612 ± 0.34
3.078PheGly: 3.078 ± 0.555
0.66PheHis: 0.66 ± 0.209
2.638PheIle: 2.638 ± 0.259
3.371PheLys: 3.371 ± 0.487
3.591PheLeu: 3.591 ± 0.473
1.466PheMet: 1.466 ± 0.249
2.565PheAsn: 2.565 ± 0.432
1.246PhePro: 1.246 ± 0.298
1.246PheGln: 1.246 ± 0.341
1.539PheArg: 1.539 ± 0.285
1.905PheSer: 1.905 ± 0.285
2.125PheThr: 2.125 ± 0.36
1.686PheVal: 1.686 ± 0.344
0.073PheTrp: 0.073 ± 0.07
1.026PheTyr: 1.026 ± 0.261
0.0PheXaa: 0.0 ± 0.0
Gly
5.57GlyAla: 5.57 ± 0.693
0.953GlyCys: 0.953 ± 0.339
6.449GlyAsp: 6.449 ± 0.81
4.397GlyGlu: 4.397 ± 0.562
3.884GlyPhe: 3.884 ± 0.657
5.277GlyGly: 5.277 ± 0.811
1.539GlyHis: 1.539 ± 0.303
5.057GlyIle: 5.057 ± 0.93
6.303GlyLys: 6.303 ± 0.783
5.57GlyLeu: 5.57 ± 0.759
1.832GlyMet: 1.832 ± 0.355
3.664GlyAsn: 3.664 ± 0.703
0.0GlyPro: 0.0 ± 0.0
2.565GlyGln: 2.565 ± 0.391
3.078GlyArg: 3.078 ± 0.513
4.471GlySer: 4.471 ± 0.558
5.13GlyThr: 5.13 ± 0.566
4.69GlyVal: 4.69 ± 0.747
1.392GlyTrp: 1.392 ± 0.315
3.444GlyTyr: 3.444 ± 0.437
0.0GlyXaa: 0.0 ± 0.0
His
0.806HisAla: 0.806 ± 0.209
0.147HisCys: 0.147 ± 0.102
1.319HisAsp: 1.319 ± 0.338
1.173HisGlu: 1.173 ± 0.279
1.099HisPhe: 1.099 ± 0.17
2.052HisGly: 2.052 ± 0.509
0.44HisHis: 0.44 ± 0.176
0.953HisIle: 0.953 ± 0.225
1.539HisLys: 1.539 ± 0.452
2.712HisLeu: 2.712 ± 0.49
0.806HisMet: 0.806 ± 0.383
1.099HisAsn: 1.099 ± 0.37
0.366HisPro: 0.366 ± 0.144
0.733HisGln: 0.733 ± 0.233
1.026HisArg: 1.026 ± 0.375
0.953HisSer: 0.953 ± 0.302
0.879HisThr: 0.879 ± 0.255
0.879HisVal: 0.879 ± 0.248
0.147HisTrp: 0.147 ± 0.077
1.173HisTyr: 1.173 ± 0.367
0.0HisXaa: 0.0 ± 0.0
Ile
4.471IleAla: 4.471 ± 0.531
0.66IleCys: 0.66 ± 0.219
3.884IleAsp: 3.884 ± 0.483
4.324IleGlu: 4.324 ± 0.619
1.832IlePhe: 1.832 ± 0.432
4.764IleGly: 4.764 ± 0.542
1.099IleHis: 1.099 ± 0.251
4.324IleIle: 4.324 ± 0.696
4.984IleLys: 4.984 ± 0.641
3.957IleLeu: 3.957 ± 0.628
2.125IleMet: 2.125 ± 0.496
3.371IleAsn: 3.371 ± 0.427
2.565IlePro: 2.565 ± 0.447
2.345IleGln: 2.345 ± 0.451
3.298IleArg: 3.298 ± 0.504
3.664IleSer: 3.664 ± 0.538
3.664IleThr: 3.664 ± 0.935
3.738IleVal: 3.738 ± 0.625
0.513IleTrp: 0.513 ± 0.23
1.686IleTyr: 1.686 ± 0.435
0.0IleXaa: 0.0 ± 0.0
Lys
6.816LysAla: 6.816 ± 1.009
1.026LysCys: 1.026 ± 0.289
3.957LysAsp: 3.957 ± 0.665
5.79LysGlu: 5.79 ± 0.629
1.246LysPhe: 1.246 ± 0.353
5.423LysGly: 5.423 ± 0.704
1.612LysHis: 1.612 ± 0.429
3.518LysIle: 3.518 ± 0.504
3.811LysLys: 3.811 ± 0.669
5.863LysLeu: 5.863 ± 0.717
1.392LysMet: 1.392 ± 0.332
2.199LysAsn: 2.199 ± 0.425
3.298LysPro: 3.298 ± 0.355
3.005LysGln: 3.005 ± 0.443
3.957LysArg: 3.957 ± 0.458
4.397LysSer: 4.397 ± 0.483
2.858LysThr: 2.858 ± 0.452
5.79LysVal: 5.79 ± 0.821
1.099LysTrp: 1.099 ± 0.282
2.272LysTyr: 2.272 ± 0.369
0.0LysXaa: 0.0 ± 0.0
Leu
6.523LeuAla: 6.523 ± 0.881
0.733LeuCys: 0.733 ± 0.263
5.277LeuAsp: 5.277 ± 0.565
5.79LeuGlu: 5.79 ± 0.622
2.858LeuPhe: 2.858 ± 0.613
5.423LeuGly: 5.423 ± 0.639
1.173LeuHis: 1.173 ± 0.332
3.957LeuIle: 3.957 ± 0.637
5.79LeuLys: 5.79 ± 0.768
6.669LeuLeu: 6.669 ± 0.622
1.686LeuMet: 1.686 ± 0.327
4.177LeuAsn: 4.177 ± 0.473
3.371LeuPro: 3.371 ± 0.471
4.837LeuGln: 4.837 ± 0.861
4.177LeuArg: 4.177 ± 0.629
5.643LeuSer: 5.643 ± 0.776
4.471LeuThr: 4.471 ± 0.457
4.69LeuVal: 4.69 ± 0.527
0.879LeuTrp: 0.879 ± 0.223
2.931LeuTyr: 2.931 ± 0.474
0.0LeuXaa: 0.0 ± 0.0
Met
3.005MetAla: 3.005 ± 0.432
0.366MetCys: 0.366 ± 0.179
1.099MetAsp: 1.099 ± 0.301
1.905MetGlu: 1.905 ± 0.389
1.539MetPhe: 1.539 ± 0.373
2.199MetGly: 2.199 ± 0.437
0.586MetHis: 0.586 ± 0.173
1.686MetIle: 1.686 ± 0.427
2.492MetLys: 2.492 ± 0.43
3.005MetLeu: 3.005 ± 0.46
0.66MetMet: 0.66 ± 0.237
1.099MetAsn: 1.099 ± 0.284
0.953MetPro: 0.953 ± 0.206
1.466MetGln: 1.466 ± 0.359
1.832MetArg: 1.832 ± 0.459
2.858MetSer: 2.858 ± 0.461
1.905MetThr: 1.905 ± 0.29
1.612MetVal: 1.612 ± 0.307
0.366MetTrp: 0.366 ± 0.226
1.392MetTyr: 1.392 ± 0.357
0.0MetXaa: 0.0 ± 0.0
Asn
2.712AsnAla: 2.712 ± 0.318
0.879AsnCys: 0.879 ± 0.267
1.905AsnAsp: 1.905 ± 0.463
2.712AsnGlu: 2.712 ± 0.578
1.686AsnPhe: 1.686 ± 0.368
3.591AsnGly: 3.591 ± 0.582
1.246AsnHis: 1.246 ± 0.332
3.078AsnIle: 3.078 ± 0.7
3.298AsnLys: 3.298 ± 0.435
3.884AsnLeu: 3.884 ± 0.356
1.686AsnMet: 1.686 ± 0.374
2.785AsnAsn: 2.785 ± 0.541
1.832AsnPro: 1.832 ± 0.386
2.345AsnGln: 2.345 ± 0.428
3.005AsnArg: 3.005 ± 0.411
1.979AsnSer: 1.979 ± 0.41
2.931AsnThr: 2.931 ± 0.432
3.298AsnVal: 3.298 ± 0.46
0.513AsnTrp: 0.513 ± 0.198
1.466AsnTyr: 1.466 ± 0.299
0.0AsnXaa: 0.0 ± 0.0
Pro
3.151ProAla: 3.151 ± 0.471
0.44ProCys: 0.44 ± 0.16
2.785ProAsp: 2.785 ± 0.503
3.151ProGlu: 3.151 ± 0.482
1.246ProPhe: 1.246 ± 0.281
0.293ProGly: 0.293 ± 0.143
0.733ProHis: 0.733 ± 0.23
1.246ProIle: 1.246 ± 0.29
2.199ProLys: 2.199 ± 0.356
1.979ProLeu: 1.979 ± 0.341
0.879ProMet: 0.879 ± 0.195
1.466ProAsn: 1.466 ± 0.364
0.733ProPro: 0.733 ± 0.205
1.173ProGln: 1.173 ± 0.249
1.319ProArg: 1.319 ± 0.26
1.832ProSer: 1.832 ± 0.356
2.418ProThr: 2.418 ± 0.372
2.418ProVal: 2.418 ± 0.499
0.66ProTrp: 0.66 ± 0.225
1.539ProTyr: 1.539 ± 0.341
0.0ProXaa: 0.0 ± 0.0
Gln
3.884GlnAla: 3.884 ± 0.574
0.366GlnCys: 0.366 ± 0.166
2.638GlnAsp: 2.638 ± 0.359
3.225GlnGlu: 3.225 ± 0.486
1.979GlnPhe: 1.979 ± 0.349
4.251GlnGly: 4.251 ± 0.582
0.806GlnHis: 0.806 ± 0.236
2.125GlnIle: 2.125 ± 0.297
1.392GlnLys: 1.392 ± 0.324
3.591GlnLeu: 3.591 ± 0.536
1.905GlnMet: 1.905 ± 0.406
1.246GlnAsn: 1.246 ± 0.226
1.392GlnPro: 1.392 ± 0.289
2.125GlnGln: 2.125 ± 0.584
2.272GlnArg: 2.272 ± 0.465
2.125GlnSer: 2.125 ± 0.482
2.565GlnThr: 2.565 ± 0.407
2.345GlnVal: 2.345 ± 0.316
0.733GlnTrp: 0.733 ± 0.205
1.612GlnTyr: 1.612 ± 0.333
0.0GlnXaa: 0.0 ± 0.0
Arg
4.617ArgAla: 4.617 ± 0.931
0.66ArgCys: 0.66 ± 0.283
3.371ArgAsp: 3.371 ± 0.51
3.884ArgGlu: 3.884 ± 0.466
1.979ArgPhe: 1.979 ± 0.298
4.471ArgGly: 4.471 ± 0.744
1.466ArgHis: 1.466 ± 0.346
3.444ArgIle: 3.444 ± 0.639
2.345ArgLys: 2.345 ± 0.445
3.738ArgLeu: 3.738 ± 0.699
1.979ArgMet: 1.979 ± 0.386
2.199ArgAsn: 2.199 ± 0.379
1.099ArgPro: 1.099 ± 0.296
1.539ArgGln: 1.539 ± 0.278
3.151ArgArg: 3.151 ± 0.471
2.931ArgSer: 2.931 ± 0.53
2.199ArgThr: 2.199 ± 0.475
3.738ArgVal: 3.738 ± 0.531
0.66ArgTrp: 0.66 ± 0.207
2.125ArgTyr: 2.125 ± 0.392
0.0ArgXaa: 0.0 ± 0.0
Ser
4.397SerAla: 4.397 ± 0.737
0.586SerCys: 0.586 ± 0.209
3.884SerAsp: 3.884 ± 0.44
3.444SerGlu: 3.444 ± 0.482
2.418SerPhe: 2.418 ± 0.343
4.984SerGly: 4.984 ± 0.448
0.879SerHis: 0.879 ± 0.241
4.617SerIle: 4.617 ± 0.505
3.738SerLys: 3.738 ± 0.59
5.13SerLeu: 5.13 ± 0.663
1.832SerMet: 1.832 ± 0.289
3.078SerAsn: 3.078 ± 0.467
1.612SerPro: 1.612 ± 0.341
2.418SerGln: 2.418 ± 0.466
2.785SerArg: 2.785 ± 0.514
3.078SerSer: 3.078 ± 0.661
2.858SerThr: 2.858 ± 0.538
4.177SerVal: 4.177 ± 0.517
1.026SerTrp: 1.026 ± 0.232
1.979SerTyr: 1.979 ± 0.379
0.0SerXaa: 0.0 ± 0.0
Thr
3.591ThrAla: 3.591 ± 0.423
0.44ThrCys: 0.44 ± 0.176
2.858ThrAsp: 2.858 ± 0.529
3.738ThrGlu: 3.738 ± 0.511
2.712ThrPhe: 2.712 ± 0.493
4.031ThrGly: 4.031 ± 0.526
1.392ThrHis: 1.392 ± 0.294
3.444ThrIle: 3.444 ± 0.422
3.884ThrLys: 3.884 ± 0.617
5.203ThrLeu: 5.203 ± 0.591
1.832ThrMet: 1.832 ± 0.369
2.345ThrAsn: 2.345 ± 0.535
1.979ThrPro: 1.979 ± 0.305
2.638ThrGln: 2.638 ± 0.39
2.565ThrArg: 2.565 ± 0.355
3.225ThrSer: 3.225 ± 0.542
3.664ThrThr: 3.664 ± 0.724
2.712ThrVal: 2.712 ± 0.59
0.806ThrTrp: 0.806 ± 0.353
1.612ThrTyr: 1.612 ± 0.346
0.0ThrXaa: 0.0 ± 0.0
Val
5.35ValAla: 5.35 ± 0.524
0.586ValCys: 0.586 ± 0.193
4.251ValAsp: 4.251 ± 0.556
4.91ValGlu: 4.91 ± 0.599
2.345ValPhe: 2.345 ± 0.456
4.984ValGly: 4.984 ± 0.611
1.173ValHis: 1.173 ± 0.362
4.251ValIle: 4.251 ± 0.725
4.471ValLys: 4.471 ± 0.539
3.298ValLeu: 3.298 ± 0.604
2.199ValMet: 2.199 ± 0.322
3.298ValAsn: 3.298 ± 0.623
2.712ValPro: 2.712 ± 0.349
2.638ValGln: 2.638 ± 0.485
4.031ValArg: 4.031 ± 0.68
3.957ValSer: 3.957 ± 0.676
3.811ValThr: 3.811 ± 0.706
4.91ValVal: 4.91 ± 0.761
0.806ValTrp: 0.806 ± 0.261
2.418ValTyr: 2.418 ± 0.448
0.0ValXaa: 0.0 ± 0.0
Trp
0.733TrpAla: 0.733 ± 0.195
0.22TrpCys: 0.22 ± 0.122
0.513TrpAsp: 0.513 ± 0.18
1.246TrpGlu: 1.246 ± 0.347
0.586TrpPhe: 0.586 ± 0.243
1.099TrpGly: 1.099 ± 0.266
0.513TrpHis: 0.513 ± 0.21
0.44TrpIle: 0.44 ± 0.138
1.246TrpLys: 1.246 ± 0.269
1.246TrpLeu: 1.246 ± 0.313
0.22TrpMet: 0.22 ± 0.12
0.513TrpAsn: 0.513 ± 0.218
0.879TrpPro: 0.879 ± 0.278
0.293TrpGln: 0.293 ± 0.138
0.586TrpArg: 0.586 ± 0.222
1.173TrpSer: 1.173 ± 0.406
0.513TrpThr: 0.513 ± 0.178
1.099TrpVal: 1.099 ± 0.296
0.147TrpTrp: 0.147 ± 0.105
0.073TrpTyr: 0.073 ± 0.069
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.785TyrAla: 2.785 ± 0.385
0.513TyrCys: 0.513 ± 0.196
2.125TyrAsp: 2.125 ± 0.304
1.979TyrGlu: 1.979 ± 0.425
1.466TyrPhe: 1.466 ± 0.355
3.151TyrGly: 3.151 ± 0.473
0.66TyrHis: 0.66 ± 0.233
2.052TyrIle: 2.052 ± 0.322
1.979TyrLys: 1.979 ± 0.39
2.565TyrLeu: 2.565 ± 0.405
1.099TyrMet: 1.099 ± 0.298
1.832TyrAsn: 1.832 ± 0.31
1.026TyrPro: 1.026 ± 0.32
1.759TyrGln: 1.759 ± 0.355
1.392TyrArg: 1.392 ± 0.365
2.199TyrSer: 2.199 ± 0.457
2.492TyrThr: 2.492 ± 0.34
2.565TyrVal: 2.565 ± 0.412
0.366TyrTrp: 0.366 ± 0.203
0.66TyrTyr: 0.66 ± 0.206
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 50 proteins (13646 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski