Amino acid dipepetide frequency for Oenococcus phage phi9805

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.651AlaAla: 5.651 ± 1.203
0.286AlaCys: 0.286 ± 0.183
3.934AlaAsp: 3.934 ± 0.593
3.004AlaGlu: 3.004 ± 0.568
2.718AlaPhe: 2.718 ± 0.333
4.506AlaGly: 4.506 ± 0.826
0.501AlaHis: 0.501 ± 0.222
5.722AlaIle: 5.722 ± 0.636
6.008AlaLys: 6.008 ± 0.744
6.151AlaLeu: 6.151 ± 0.767
1.931AlaMet: 1.931 ± 0.559
4.649AlaAsn: 4.649 ± 0.581
2.074AlaPro: 2.074 ± 0.381
2.432AlaGln: 2.432 ± 0.425
2.36AlaArg: 2.36 ± 0.449
5.937AlaSer: 5.937 ± 0.753
4.721AlaThr: 4.721 ± 0.661
4.005AlaVal: 4.005 ± 0.531
1.073AlaTrp: 1.073 ± 0.528
3.004AlaTyr: 3.004 ± 0.586
0.0AlaXaa: 0.0 ± 0.0
Cys
0.143CysAla: 0.143 ± 0.098
0.0CysCys: 0.0 ± 0.0
0.429CysAsp: 0.429 ± 0.24
0.143CysGlu: 0.143 ± 0.11
0.358CysPhe: 0.358 ± 0.208
0.072CysGly: 0.072 ± 0.08
0.072CysHis: 0.072 ± 0.08
0.286CysIle: 0.286 ± 0.159
0.286CysLys: 0.286 ± 0.151
0.787CysLeu: 0.787 ± 0.359
0.072CysMet: 0.072 ± 0.057
0.358CysAsn: 0.358 ± 0.182
0.072CysPro: 0.072 ± 0.068
0.143CysGln: 0.143 ± 0.089
0.286CysArg: 0.286 ± 0.199
0.286CysSer: 0.286 ± 0.127
0.072CysThr: 0.072 ± 0.073
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.072CysTyr: 0.072 ± 0.057
0.0CysXaa: 0.0 ± 0.0
Asp
3.004AspAla: 3.004 ± 0.474
0.072AspCys: 0.072 ± 0.08
3.648AspAsp: 3.648 ± 0.658
3.648AspGlu: 3.648 ± 0.616
3.076AspPhe: 3.076 ± 0.413
5.865AspGly: 5.865 ± 0.872
0.787AspHis: 0.787 ± 0.234
5.651AspIle: 5.651 ± 0.669
4.435AspLys: 4.435 ± 0.817
6.151AspLeu: 6.151 ± 0.596
1.359AspMet: 1.359 ± 0.341
3.076AspAsn: 3.076 ± 0.519
2.36AspPro: 2.36 ± 0.48
2.79AspGln: 2.79 ± 0.478
2.432AspArg: 2.432 ± 0.416
5.293AspSer: 5.293 ± 0.553
3.791AspThr: 3.791 ± 0.733
3.719AspVal: 3.719 ± 0.508
1.001AspTrp: 1.001 ± 0.274
3.862AspTyr: 3.862 ± 0.639
0.0AspXaa: 0.0 ± 0.0
Glu
3.719GluAla: 3.719 ± 0.411
0.143GluCys: 0.143 ± 0.145
2.718GluAsp: 2.718 ± 0.442
2.933GluGlu: 2.933 ± 0.72
1.287GluPhe: 1.287 ± 0.329
2.646GluGly: 2.646 ± 0.478
0.644GluHis: 0.644 ± 0.206
4.077GluIle: 4.077 ± 0.656
5.364GluLys: 5.364 ± 0.889
5.15GluLeu: 5.15 ± 0.664
1.502GluMet: 1.502 ± 0.326
3.29GluAsn: 3.29 ± 0.567
1.001GluPro: 1.001 ± 0.359
2.503GluGln: 2.503 ± 0.517
1.502GluArg: 1.502 ± 0.302
3.719GluSer: 3.719 ± 0.486
2.503GluThr: 2.503 ± 0.51
2.146GluVal: 2.146 ± 0.367
0.501GluTrp: 0.501 ± 0.195
2.217GluTyr: 2.217 ± 0.438
0.0GluXaa: 0.0 ± 0.0
Phe
3.219PheAla: 3.219 ± 0.424
0.358PheCys: 0.358 ± 0.174
3.076PheAsp: 3.076 ± 0.489
1.359PheGlu: 1.359 ± 0.333
3.147PhePhe: 3.147 ± 0.665
2.79PheGly: 2.79 ± 0.454
0.858PheHis: 0.858 ± 0.275
2.718PheIle: 2.718 ± 0.602
2.79PheLys: 2.79 ± 0.511
4.292PheLeu: 4.292 ± 0.972
0.93PheMet: 0.93 ± 0.253
2.79PheAsn: 2.79 ± 0.457
0.858PhePro: 0.858 ± 0.251
1.001PheGln: 1.001 ± 0.202
1.431PheArg: 1.431 ± 0.415
5.293PheSer: 5.293 ± 0.856
2.718PheThr: 2.718 ± 0.383
2.074PheVal: 2.074 ± 0.444
0.501PheTrp: 0.501 ± 0.202
1.717PheTyr: 1.717 ± 0.356
0.0PheXaa: 0.0 ± 0.0
Gly
2.79GlyAla: 2.79 ± 0.597
0.215GlyCys: 0.215 ± 0.134
3.862GlyAsp: 3.862 ± 0.592
3.648GlyGlu: 3.648 ± 0.49
2.432GlyPhe: 2.432 ± 0.387
3.505GlyGly: 3.505 ± 1.01
1.216GlyHis: 1.216 ± 0.227
5.293GlyIle: 5.293 ± 0.554
5.651GlyLys: 5.651 ± 0.664
5.15GlyLeu: 5.15 ± 0.834
1.502GlyMet: 1.502 ± 0.411
4.435GlyAsn: 4.435 ± 0.846
1.073GlyPro: 1.073 ± 0.288
3.219GlyGln: 3.219 ± 0.431
1.86GlyArg: 1.86 ± 0.303
5.865GlySer: 5.865 ± 1.056
5.15GlyThr: 5.15 ± 1.526
3.29GlyVal: 3.29 ± 0.455
0.572GlyTrp: 0.572 ± 0.158
2.003GlyTyr: 2.003 ± 0.411
0.0GlyXaa: 0.0 ± 0.0
His
1.001HisAla: 1.001 ± 0.356
0.072HisCys: 0.072 ± 0.078
1.287HisAsp: 1.287 ± 0.282
0.286HisGlu: 0.286 ± 0.161
1.431HisPhe: 1.431 ± 0.316
0.858HisGly: 0.858 ± 0.334
0.072HisHis: 0.072 ± 0.075
1.287HisIle: 1.287 ± 0.303
1.001HisLys: 1.001 ± 0.318
1.216HisLeu: 1.216 ± 0.311
0.215HisMet: 0.215 ± 0.184
1.216HisAsn: 1.216 ± 0.253
0.358HisPro: 0.358 ± 0.193
0.501HisGln: 0.501 ± 0.164
0.501HisArg: 0.501 ± 0.164
0.858HisSer: 0.858 ± 0.297
0.644HisThr: 0.644 ± 0.205
0.715HisVal: 0.715 ± 0.226
0.501HisTrp: 0.501 ± 0.175
0.429HisTyr: 0.429 ± 0.142
0.0HisXaa: 0.0 ± 0.0
Ile
5.651IleAla: 5.651 ± 0.731
0.715IleCys: 0.715 ± 0.284
5.937IleAsp: 5.937 ± 0.745
4.292IleGlu: 4.292 ± 0.794
3.219IlePhe: 3.219 ± 1.084
3.648IleGly: 3.648 ± 0.503
1.359IleHis: 1.359 ± 0.316
5.15IleIle: 5.15 ± 1.077
6.938IleLys: 6.938 ± 0.725
4.22IleLeu: 4.22 ± 0.563
1.431IleMet: 1.431 ± 0.356
4.649IleAsn: 4.649 ± 0.693
2.718IlePro: 2.718 ± 0.449
3.219IleGln: 3.219 ± 0.441
2.79IleArg: 2.79 ± 0.42
6.294IleSer: 6.294 ± 0.74
5.507IleThr: 5.507 ± 0.773
4.721IleVal: 4.721 ± 0.699
1.001IleTrp: 1.001 ± 0.246
3.004IleTyr: 3.004 ± 0.429
0.0IleXaa: 0.0 ± 0.0
Lys
7.081LysAla: 7.081 ± 0.712
0.358LysCys: 0.358 ± 0.18
5.651LysAsp: 5.651 ± 0.887
4.578LysGlu: 4.578 ± 0.907
2.36LysPhe: 2.36 ± 0.467
4.292LysGly: 4.292 ± 0.603
1.359LysHis: 1.359 ± 0.444
6.223LysIle: 6.223 ± 0.888
6.437LysLys: 6.437 ± 1.07
6.223LysLeu: 6.223 ± 0.442
2.146LysMet: 2.146 ± 0.433
5.293LysAsn: 5.293 ± 0.717
1.86LysPro: 1.86 ± 0.403
4.005LysGln: 4.005 ± 0.786
3.076LysArg: 3.076 ± 0.63
6.151LysSer: 6.151 ± 0.976
5.507LysThr: 5.507 ± 0.639
4.005LysVal: 4.005 ± 0.468
1.144LysTrp: 1.144 ± 0.303
3.219LysTyr: 3.219 ± 0.663
0.0LysXaa: 0.0 ± 0.0
Leu
6.652LeuAla: 6.652 ± 0.798
0.358LeuCys: 0.358 ± 0.167
4.864LeuAsp: 4.864 ± 0.652
4.506LeuGlu: 4.506 ± 0.743
4.005LeuPhe: 4.005 ± 0.729
4.721LeuGly: 4.721 ± 0.583
0.787LeuHis: 0.787 ± 0.218
7.01LeuIle: 7.01 ± 0.934
6.938LeuLys: 6.938 ± 0.87
7.224LeuLeu: 7.224 ± 0.916
2.003LeuMet: 2.003 ± 0.359
6.294LeuAsn: 6.294 ± 0.717
3.362LeuPro: 3.362 ± 0.388
3.219LeuGln: 3.219 ± 0.481
2.646LeuArg: 2.646 ± 0.403
7.939LeuSer: 7.939 ± 0.938
5.507LeuThr: 5.507 ± 0.555
4.435LeuVal: 4.435 ± 0.599
1.073LeuTrp: 1.073 ± 0.364
2.003LeuTyr: 2.003 ± 0.38
0.0LeuXaa: 0.0 ± 0.0
Met
1.431MetAla: 1.431 ± 0.411
0.072MetCys: 0.072 ± 0.057
1.001MetAsp: 1.001 ± 0.242
1.001MetGlu: 1.001 ± 0.236
0.715MetPhe: 0.715 ± 0.251
1.287MetGly: 1.287 ± 0.349
0.429MetHis: 0.429 ± 0.166
1.574MetIle: 1.574 ± 0.406
1.788MetLys: 1.788 ± 0.38
1.931MetLeu: 1.931 ± 0.287
0.358MetMet: 0.358 ± 0.134
2.074MetAsn: 2.074 ± 0.506
1.073MetPro: 1.073 ± 0.276
1.144MetGln: 1.144 ± 0.241
0.572MetArg: 0.572 ± 0.226
1.502MetSer: 1.502 ± 0.33
2.289MetThr: 2.289 ± 0.407
1.144MetVal: 1.144 ± 0.27
0.072MetTrp: 0.072 ± 0.061
0.358MetTyr: 0.358 ± 0.16
0.0MetXaa: 0.0 ± 0.0
Asn
3.362AsnAla: 3.362 ± 0.479
0.215AsnCys: 0.215 ± 0.111
5.436AsnAsp: 5.436 ± 0.619
2.79AsnGlu: 2.79 ± 0.472
2.503AsnPhe: 2.503 ± 0.561
4.721AsnGly: 4.721 ± 0.797
1.287AsnHis: 1.287 ± 0.369
4.148AsnIle: 4.148 ± 0.589
5.007AsnLys: 5.007 ± 0.751
5.794AsnLeu: 5.794 ± 0.583
1.216AsnMet: 1.216 ± 0.286
4.148AsnAsn: 4.148 ± 0.586
1.645AsnPro: 1.645 ± 0.381
3.004AsnGln: 3.004 ± 0.594
2.36AsnArg: 2.36 ± 0.431
4.363AsnSer: 4.363 ± 0.777
2.933AsnThr: 2.933 ± 0.479
3.505AsnVal: 3.505 ± 0.455
0.93AsnTrp: 0.93 ± 0.27
3.29AsnTyr: 3.29 ± 0.695
0.0AsnXaa: 0.0 ± 0.0
Pro
2.289ProAla: 2.289 ± 0.553
0.0ProCys: 0.0 ± 0.0
2.074ProAsp: 2.074 ± 0.38
1.931ProGlu: 1.931 ± 0.432
1.144ProPhe: 1.144 ± 0.389
1.216ProGly: 1.216 ± 0.339
0.143ProHis: 0.143 ± 0.082
1.86ProIle: 1.86 ± 0.39
2.575ProLys: 2.575 ± 0.52
2.432ProLeu: 2.432 ± 0.462
0.572ProMet: 0.572 ± 0.166
1.86ProAsn: 1.86 ± 0.424
0.429ProPro: 0.429 ± 0.19
0.787ProGln: 0.787 ± 0.236
0.787ProArg: 0.787 ± 0.3
2.146ProSer: 2.146 ± 0.37
1.717ProThr: 1.717 ± 0.528
2.289ProVal: 2.289 ± 0.389
0.215ProTrp: 0.215 ± 0.131
1.216ProTyr: 1.216 ± 0.277
0.0ProXaa: 0.0 ± 0.0
Gln
3.934GlnAla: 3.934 ± 0.612
0.072GlnCys: 0.072 ± 0.057
2.646GlnAsp: 2.646 ± 0.496
2.146GlnGlu: 2.146 ± 0.487
1.86GlnPhe: 1.86 ± 0.324
3.076GlnGly: 3.076 ± 0.446
0.787GlnHis: 0.787 ± 0.281
3.219GlnIle: 3.219 ± 0.545
3.29GlnLys: 3.29 ± 0.564
2.861GlnLeu: 2.861 ± 0.626
1.001GlnMet: 1.001 ± 0.315
1.788GlnAsn: 1.788 ± 0.354
0.858GlnPro: 0.858 ± 0.224
2.074GlnGln: 2.074 ± 0.324
1.931GlnArg: 1.931 ± 0.484
3.719GlnSer: 3.719 ± 0.51
3.29GlnThr: 3.29 ± 0.564
2.933GlnVal: 2.933 ± 0.458
0.215GlnTrp: 0.215 ± 0.144
1.073GlnTyr: 1.073 ± 0.289
0.0GlnXaa: 0.0 ± 0.0
Arg
2.575ArgAla: 2.575 ± 0.475
0.143ArgCys: 0.143 ± 0.106
1.788ArgAsp: 1.788 ± 0.406
1.931ArgGlu: 1.931 ± 0.439
1.788ArgPhe: 1.788 ± 0.322
1.359ArgGly: 1.359 ± 0.342
0.429ArgHis: 0.429 ± 0.179
2.146ArgIle: 2.146 ± 0.401
2.646ArgLys: 2.646 ± 0.57
3.576ArgLeu: 3.576 ± 0.647
0.787ArgMet: 0.787 ± 0.239
2.217ArgAsn: 2.217 ± 0.43
0.501ArgPro: 0.501 ± 0.173
1.717ArgGln: 1.717 ± 0.302
1.144ArgArg: 1.144 ± 0.343
1.86ArgSer: 1.86 ± 0.463
2.432ArgThr: 2.432 ± 0.615
2.074ArgVal: 2.074 ± 0.377
0.358ArgTrp: 0.358 ± 0.167
1.788ArgTyr: 1.788 ± 0.354
0.0ArgXaa: 0.0 ± 0.0
Ser
5.579SerAla: 5.579 ± 0.743
0.429SerCys: 0.429 ± 0.172
5.078SerAsp: 5.078 ± 0.767
3.505SerGlu: 3.505 ± 0.593
4.005SerPhe: 4.005 ± 0.497
6.938SerGly: 6.938 ± 0.891
1.359SerHis: 1.359 ± 0.333
5.722SerIle: 5.722 ± 0.73
6.294SerLys: 6.294 ± 0.855
7.296SerLeu: 7.296 ± 0.931
2.217SerMet: 2.217 ± 0.472
4.792SerAsn: 4.792 ± 0.663
1.86SerPro: 1.86 ± 0.451
3.29SerGln: 3.29 ± 0.479
1.717SerArg: 1.717 ± 0.368
8.369SerSer: 8.369 ± 1.367
5.722SerThr: 5.722 ± 1.268
4.435SerVal: 4.435 ± 0.613
1.431SerTrp: 1.431 ± 0.351
3.362SerTyr: 3.362 ± 0.755
0.0SerXaa: 0.0 ± 0.0
Thr
5.794ThrAla: 5.794 ± 0.895
0.072ThrCys: 0.072 ± 0.068
5.507ThrAsp: 5.507 ± 0.794
2.79ThrGlu: 2.79 ± 0.515
2.503ThrPhe: 2.503 ± 0.599
5.221ThrGly: 5.221 ± 1.078
0.93ThrHis: 0.93 ± 0.276
6.223ThrIle: 6.223 ± 0.716
4.792ThrLys: 4.792 ± 0.642
3.934ThrLeu: 3.934 ± 0.481
1.073ThrMet: 1.073 ± 0.257
2.933ThrAsn: 2.933 ± 0.534
1.86ThrPro: 1.86 ± 0.409
1.574ThrGln: 1.574 ± 0.289
1.717ThrArg: 1.717 ± 0.372
5.221ThrSer: 5.221 ± 1.367
3.076ThrThr: 3.076 ± 0.885
5.15ThrVal: 5.15 ± 0.643
0.572ThrTrp: 0.572 ± 0.262
2.933ThrTyr: 2.933 ± 0.754
0.0ThrXaa: 0.0 ± 0.0
Val
3.505ValAla: 3.505 ± 0.681
0.143ValCys: 0.143 ± 0.093
4.578ValAsp: 4.578 ± 0.565
2.79ValGlu: 2.79 ± 0.52
2.003ValPhe: 2.003 ± 0.462
3.076ValGly: 3.076 ± 0.508
0.501ValHis: 0.501 ± 0.155
4.506ValIle: 4.506 ± 0.539
5.078ValLys: 5.078 ± 0.778
5.794ValLeu: 5.794 ± 1.102
0.644ValMet: 0.644 ± 0.27
3.934ValAsn: 3.934 ± 0.496
1.788ValPro: 1.788 ± 0.364
3.29ValGln: 3.29 ± 0.449
1.574ValArg: 1.574 ± 0.3
4.578ValSer: 4.578 ± 0.659
2.861ValThr: 2.861 ± 0.504
3.505ValVal: 3.505 ± 0.636
1.073ValTrp: 1.073 ± 0.254
2.217ValTyr: 2.217 ± 0.438
0.0ValXaa: 0.0 ± 0.0
Trp
0.715TrpAla: 0.715 ± 0.229
0.072TrpCys: 0.072 ± 0.08
0.572TrpAsp: 0.572 ± 0.247
0.715TrpGlu: 0.715 ± 0.255
1.144TrpPhe: 1.144 ± 0.316
0.715TrpGly: 0.715 ± 0.279
0.286TrpHis: 0.286 ± 0.125
1.144TrpIle: 1.144 ± 0.412
0.93TrpLys: 0.93 ± 0.267
0.715TrpLeu: 0.715 ± 0.291
0.215TrpMet: 0.215 ± 0.107
1.001TrpAsn: 1.001 ± 0.604
0.143TrpPro: 0.143 ± 0.101
0.787TrpGln: 0.787 ± 0.203
0.286TrpArg: 0.286 ± 0.12
1.073TrpSer: 1.073 ± 0.291
0.93TrpThr: 0.93 ± 0.29
0.93TrpVal: 0.93 ± 0.244
0.143TrpTrp: 0.143 ± 0.103
0.358TrpTyr: 0.358 ± 0.148
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.575TyrAla: 2.575 ± 0.438
0.143TyrCys: 0.143 ± 0.1
1.931TyrAsp: 1.931 ± 0.367
1.645TyrGlu: 1.645 ± 0.337
2.217TyrPhe: 2.217 ± 0.542
2.217TyrGly: 2.217 ± 0.339
0.644TyrHis: 0.644 ± 0.227
2.503TyrIle: 2.503 ± 0.432
2.718TyrLys: 2.718 ± 0.448
4.649TyrLeu: 4.649 ± 0.535
0.572TyrMet: 0.572 ± 0.223
1.86TyrAsn: 1.86 ± 0.383
1.788TyrPro: 1.788 ± 0.37
2.074TyrGln: 2.074 ± 0.352
2.289TyrArg: 2.289 ± 0.523
2.933TyrSer: 2.933 ± 0.472
2.503TyrThr: 2.503 ± 0.624
2.36TyrVal: 2.36 ± 0.454
0.501TyrTrp: 0.501 ± 0.197
1.86TyrTyr: 1.86 ± 0.52
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (13982 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski