Amino acid dipepetide frequency for Entercoccus phage phiM1EF2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.519AlaAla: 6.519 ± 1.357
0.362AlaCys: 0.362 ± 0.173
4.467AlaAsp: 4.467 ± 0.54
6.278AlaGlu: 6.278 ± 0.664
2.777AlaPhe: 2.777 ± 0.412
4.346AlaGly: 4.346 ± 0.742
0.724AlaHis: 0.724 ± 0.233
5.614AlaIle: 5.614 ± 0.542
4.588AlaLys: 4.588 ± 0.597
6.097AlaLeu: 6.097 ± 0.642
2.475AlaMet: 2.475 ± 0.43
3.743AlaAsn: 3.743 ± 0.586
1.932AlaPro: 1.932 ± 0.35
2.113AlaGln: 2.113 ± 0.41
3.682AlaArg: 3.682 ± 0.467
4.407AlaSer: 4.407 ± 0.459
3.863AlaThr: 3.863 ± 0.499
4.588AlaVal: 4.588 ± 0.61
0.483AlaTrp: 0.483 ± 0.186
3.441AlaTyr: 3.441 ± 0.488
0.0AlaXaa: 0.0 ± 0.0
Cys
0.241CysAla: 0.241 ± 0.124
0.241CysCys: 0.241 ± 0.162
0.423CysAsp: 0.423 ± 0.14
0.664CysGlu: 0.664 ± 0.234
0.423CysPhe: 0.423 ± 0.144
0.785CysGly: 0.785 ± 0.294
0.06CysHis: 0.06 ± 0.057
0.302CysIle: 0.302 ± 0.155
0.302CysLys: 0.302 ± 0.215
0.664CysLeu: 0.664 ± 0.178
0.181CysMet: 0.181 ± 0.099
0.181CysAsn: 0.181 ± 0.098
0.06CysPro: 0.06 ± 0.053
0.362CysGln: 0.362 ± 0.141
0.181CysArg: 0.181 ± 0.097
0.423CysSer: 0.423 ± 0.139
0.483CysThr: 0.483 ± 0.228
0.362CysVal: 0.362 ± 0.148
0.121CysTrp: 0.121 ± 0.089
0.181CysTyr: 0.181 ± 0.089
0.0CysXaa: 0.0 ± 0.0
Asp
4.286AspAla: 4.286 ± 0.538
0.543AspCys: 0.543 ± 0.206
2.837AspAsp: 2.837 ± 0.381
4.588AspGlu: 4.588 ± 0.539
3.803AspPhe: 3.803 ± 0.394
3.743AspGly: 3.743 ± 0.558
0.543AspHis: 0.543 ± 0.191
4.226AspIle: 4.226 ± 0.47
4.467AspLys: 4.467 ± 0.447
5.735AspLeu: 5.735 ± 0.553
1.811AspMet: 1.811 ± 0.472
2.777AspAsn: 2.777 ± 0.381
1.388AspPro: 1.388 ± 0.295
1.207AspGln: 1.207 ± 0.276
2.233AspArg: 2.233 ± 0.36
3.562AspSer: 3.562 ± 0.447
4.286AspThr: 4.286 ± 0.777
4.527AspVal: 4.527 ± 0.542
1.026AspTrp: 1.026 ± 0.353
3.803AspTyr: 3.803 ± 0.446
0.0AspXaa: 0.0 ± 0.0
Glu
7.968GluAla: 7.968 ± 0.789
0.543GluCys: 0.543 ± 0.148
6.097GluAsp: 6.097 ± 0.655
10.443GluGlu: 10.443 ± 1.084
3.682GluPhe: 3.682 ± 0.618
5.976GluGly: 5.976 ± 0.611
0.845GluHis: 0.845 ± 0.228
4.226GluIle: 4.226 ± 0.502
5.191GluLys: 5.191 ± 0.809
8.149GluLeu: 8.149 ± 0.837
2.656GluMet: 2.656 ± 0.441
4.346GluAsn: 4.346 ± 0.569
1.63GluPro: 1.63 ± 0.352
3.079GluGln: 3.079 ± 0.49
4.105GluArg: 4.105 ± 0.505
5.433GluSer: 5.433 ± 0.715
4.044GluThr: 4.044 ± 0.517
6.157GluVal: 6.157 ± 0.488
1.449GluTrp: 1.449 ± 0.324
3.622GluTyr: 3.622 ± 0.494
0.0GluXaa: 0.0 ± 0.0
Phe
2.656PheAla: 2.656 ± 0.473
0.241PheCys: 0.241 ± 0.123
2.716PheAsp: 2.716 ± 0.412
3.38PheGlu: 3.38 ± 0.47
1.871PhePhe: 1.871 ± 0.335
3.38PheGly: 3.38 ± 0.39
0.724PheHis: 0.724 ± 0.201
3.38PheIle: 3.38 ± 0.537
3.38PheLys: 3.38 ± 0.408
3.38PheLeu: 3.38 ± 0.459
1.026PheMet: 1.026 ± 0.293
2.656PheAsn: 2.656 ± 0.303
1.328PhePro: 1.328 ± 0.304
1.087PheGln: 1.087 ± 0.238
1.509PheArg: 1.509 ± 0.26
2.535PheSer: 2.535 ± 0.462
2.596PheThr: 2.596 ± 0.363
2.716PheVal: 2.716 ± 0.464
0.604PheTrp: 0.604 ± 0.19
1.63PheTyr: 1.63 ± 0.406
0.0PheXaa: 0.0 ± 0.0
Gly
3.743GlyAla: 3.743 ± 0.425
0.362GlyCys: 0.362 ± 0.173
3.682GlyAsp: 3.682 ± 0.484
4.407GlyGlu: 4.407 ± 0.471
3.199GlyPhe: 3.199 ± 0.464
3.682GlyGly: 3.682 ± 0.759
1.026GlyHis: 1.026 ± 0.23
4.829GlyIle: 4.829 ± 0.662
5.614GlyLys: 5.614 ± 0.696
4.467GlyLeu: 4.467 ± 0.533
1.63GlyMet: 1.63 ± 0.254
3.139GlyAsn: 3.139 ± 0.39
0.121GlyPro: 0.121 ± 0.092
1.871GlyGln: 1.871 ± 0.393
2.535GlyArg: 2.535 ± 0.424
3.682GlySer: 3.682 ± 0.35
4.95GlyThr: 4.95 ± 0.71
3.984GlyVal: 3.984 ± 0.606
1.026GlyTrp: 1.026 ± 0.342
3.562GlyTyr: 3.562 ± 0.391
0.0GlyXaa: 0.0 ± 0.0
His
1.026HisAla: 1.026 ± 0.242
0.241HisCys: 0.241 ± 0.1
0.423HisAsp: 0.423 ± 0.167
0.966HisGlu: 0.966 ± 0.238
0.905HisPhe: 0.905 ± 0.208
1.147HisGly: 1.147 ± 0.265
0.302HisHis: 0.302 ± 0.154
1.328HisIle: 1.328 ± 0.223
0.966HisLys: 0.966 ± 0.25
1.147HisLeu: 1.147 ± 0.289
0.241HisMet: 0.241 ± 0.114
0.905HisAsn: 0.905 ± 0.187
0.664HisPro: 0.664 ± 0.21
0.483HisGln: 0.483 ± 0.18
0.724HisArg: 0.724 ± 0.178
0.543HisSer: 0.543 ± 0.189
0.543HisThr: 0.543 ± 0.162
0.905HisVal: 0.905 ± 0.202
0.06HisTrp: 0.06 ± 0.055
0.785HisTyr: 0.785 ± 0.223
0.0HisXaa: 0.0 ± 0.0
Ile
4.95IleAla: 4.95 ± 0.52
0.483IleCys: 0.483 ± 0.186
4.407IleAsp: 4.407 ± 0.492
5.312IleGlu: 5.312 ± 0.657
2.294IlePhe: 2.294 ± 0.419
3.26IleGly: 3.26 ± 0.404
1.268IleHis: 1.268 ± 0.258
3.441IleIle: 3.441 ± 0.478
5.554IleLys: 5.554 ± 0.669
3.803IleLeu: 3.803 ± 0.539
1.509IleMet: 1.509 ± 0.262
4.467IleAsn: 4.467 ± 0.46
1.932IlePro: 1.932 ± 0.345
2.656IleGln: 2.656 ± 0.493
3.32IleArg: 3.32 ± 0.442
3.562IleSer: 3.562 ± 0.521
4.226IleThr: 4.226 ± 0.697
3.622IleVal: 3.622 ± 0.426
0.483IleTrp: 0.483 ± 0.196
3.32IleTyr: 3.32 ± 0.472
0.0IleXaa: 0.0 ± 0.0
Lys
7.063LysAla: 7.063 ± 0.772
0.181LysCys: 0.181 ± 0.108
5.372LysAsp: 5.372 ± 0.456
7.002LysGlu: 7.002 ± 0.874
2.898LysPhe: 2.898 ± 0.432
4.286LysGly: 4.286 ± 0.457
1.207LysHis: 1.207 ± 0.307
3.984LysIle: 3.984 ± 0.408
5.433LysLys: 5.433 ± 0.73
7.123LysLeu: 7.123 ± 0.651
2.294LysMet: 2.294 ± 0.408
3.139LysAsn: 3.139 ± 0.534
3.018LysPro: 3.018 ± 0.414
2.777LysGln: 2.777 ± 0.489
2.958LysArg: 2.958 ± 0.573
4.346LysSer: 4.346 ± 0.624
4.165LysThr: 4.165 ± 0.447
5.372LysVal: 5.372 ± 0.523
0.966LysTrp: 0.966 ± 0.237
2.475LysTyr: 2.475 ± 0.419
0.0LysXaa: 0.0 ± 0.0
Leu
5.372LeuAla: 5.372 ± 0.532
0.483LeuCys: 0.483 ± 0.202
4.769LeuAsp: 4.769 ± 0.401
9.417LeuGlu: 9.417 ± 0.915
2.898LeuPhe: 2.898 ± 0.433
5.252LeuGly: 5.252 ± 0.653
1.268LeuHis: 1.268 ± 0.26
5.674LeuIle: 5.674 ± 0.672
6.519LeuLys: 6.519 ± 0.678
6.459LeuLeu: 6.459 ± 0.976
2.415LeuMet: 2.415 ± 0.323
4.527LeuAsn: 4.527 ± 0.531
2.898LeuPro: 2.898 ± 0.378
3.501LeuGln: 3.501 ± 0.455
2.898LeuArg: 2.898 ± 0.38
5.372LeuSer: 5.372 ± 0.452
7.364LeuThr: 7.364 ± 0.686
5.252LeuVal: 5.252 ± 0.729
0.905LeuTrp: 0.905 ± 0.236
2.294LeuTyr: 2.294 ± 0.387
0.0LeuXaa: 0.0 ± 0.0
Met
2.777MetAla: 2.777 ± 0.539
0.121MetCys: 0.121 ± 0.098
1.569MetAsp: 1.569 ± 0.244
2.777MetGlu: 2.777 ± 0.451
0.966MetPhe: 0.966 ± 0.279
1.207MetGly: 1.207 ± 0.286
0.241MetHis: 0.241 ± 0.116
1.569MetIle: 1.569 ± 0.347
1.932MetLys: 1.932 ± 0.378
3.018MetLeu: 3.018 ± 0.405
0.604MetMet: 0.604 ± 0.18
1.569MetAsn: 1.569 ± 0.255
1.026MetPro: 1.026 ± 0.188
0.724MetGln: 0.724 ± 0.185
1.268MetArg: 1.268 ± 0.262
1.751MetSer: 1.751 ± 0.282
1.087MetThr: 1.087 ± 0.214
1.569MetVal: 1.569 ± 0.279
0.241MetTrp: 0.241 ± 0.122
0.785MetTyr: 0.785 ± 0.19
0.0MetXaa: 0.0 ± 0.0
Asn
3.622AsnAla: 3.622 ± 0.538
0.241AsnCys: 0.241 ± 0.121
2.958AsnAsp: 2.958 ± 0.366
4.165AsnGlu: 4.165 ± 0.454
1.992AsnPhe: 1.992 ± 0.293
3.984AsnGly: 3.984 ± 0.399
1.026AsnHis: 1.026 ± 0.271
3.622AsnIle: 3.622 ± 0.471
4.588AsnLys: 4.588 ± 0.662
4.105AsnLeu: 4.105 ± 0.471
1.087AsnMet: 1.087 ± 0.219
2.052AsnAsn: 2.052 ± 0.376
2.777AsnPro: 2.777 ± 0.465
1.932AsnGln: 1.932 ± 0.38
2.173AsnArg: 2.173 ± 0.308
3.018AsnSer: 3.018 ± 0.365
2.777AsnThr: 2.777 ± 0.364
3.079AsnVal: 3.079 ± 0.379
0.664AsnTrp: 0.664 ± 0.19
1.932AsnTyr: 1.932 ± 0.329
0.0AsnXaa: 0.0 ± 0.0
Pro
1.69ProAla: 1.69 ± 0.336
0.121ProCys: 0.121 ± 0.077
1.871ProAsp: 1.871 ± 0.351
3.441ProGlu: 3.441 ± 0.506
1.449ProPhe: 1.449 ± 0.296
0.181ProGly: 0.181 ± 0.087
0.483ProHis: 0.483 ± 0.162
1.992ProIle: 1.992 ± 0.325
2.898ProLys: 2.898 ± 0.424
2.052ProLeu: 2.052 ± 0.383
0.845ProMet: 0.845 ± 0.26
1.751ProAsn: 1.751 ± 0.421
0.845ProPro: 0.845 ± 0.226
0.604ProGln: 0.604 ± 0.216
1.147ProArg: 1.147 ± 0.321
2.716ProSer: 2.716 ± 0.425
1.569ProThr: 1.569 ± 0.271
1.871ProVal: 1.871 ± 0.351
0.423ProTrp: 0.423 ± 0.153
1.509ProTyr: 1.509 ± 0.357
0.0ProXaa: 0.0 ± 0.0
Gln
2.475GlnAla: 2.475 ± 0.425
0.06GlnCys: 0.06 ± 0.059
1.751GlnAsp: 1.751 ± 0.307
2.837GlnGlu: 2.837 ± 0.399
1.388GlnPhe: 1.388 ± 0.259
2.837GlnGly: 2.837 ± 0.386
0.604GlnHis: 0.604 ± 0.179
1.751GlnIle: 1.751 ± 0.386
2.233GlnLys: 2.233 ± 0.453
3.622GlnLeu: 3.622 ± 0.495
0.845GlnMet: 0.845 ± 0.277
1.268GlnAsn: 1.268 ± 0.304
0.905GlnPro: 0.905 ± 0.194
1.63GlnGln: 1.63 ± 0.441
1.69GlnArg: 1.69 ± 0.263
2.113GlnSer: 2.113 ± 0.359
2.535GlnThr: 2.535 ± 0.331
2.354GlnVal: 2.354 ± 0.355
0.483GlnTrp: 0.483 ± 0.164
1.569GlnTyr: 1.569 ± 0.334
0.0GlnXaa: 0.0 ± 0.0
Arg
2.596ArgAla: 2.596 ± 0.39
0.362ArgCys: 0.362 ± 0.13
2.354ArgAsp: 2.354 ± 0.296
3.26ArgGlu: 3.26 ± 0.382
1.992ArgPhe: 1.992 ± 0.338
2.354ArgGly: 2.354 ± 0.381
0.724ArgHis: 0.724 ± 0.227
3.38ArgIle: 3.38 ± 0.422
3.26ArgLys: 3.26 ± 0.424
4.286ArgLeu: 4.286 ± 0.587
0.724ArgMet: 0.724 ± 0.203
1.811ArgAsn: 1.811 ± 0.305
1.449ArgPro: 1.449 ± 0.373
1.751ArgGln: 1.751 ± 0.258
1.63ArgArg: 1.63 ± 0.306
1.992ArgSer: 1.992 ± 0.347
1.69ArgThr: 1.69 ± 0.305
2.475ArgVal: 2.475 ± 0.41
0.423ArgTrp: 0.423 ± 0.147
2.052ArgTyr: 2.052 ± 0.349
0.0ArgXaa: 0.0 ± 0.0
Ser
3.018SerAla: 3.018 ± 0.417
0.362SerCys: 0.362 ± 0.16
3.743SerAsp: 3.743 ± 0.498
4.467SerGlu: 4.467 ± 0.429
3.018SerPhe: 3.018 ± 0.342
4.165SerGly: 4.165 ± 0.42
0.724SerHis: 0.724 ± 0.217
3.682SerIle: 3.682 ± 0.453
5.191SerLys: 5.191 ± 0.451
5.372SerLeu: 5.372 ± 0.557
1.509SerMet: 1.509 ± 0.299
3.441SerAsn: 3.441 ± 0.402
1.69SerPro: 1.69 ± 0.279
2.113SerGln: 2.113 ± 0.414
2.173SerArg: 2.173 ± 0.35
2.837SerSer: 2.837 ± 0.404
4.226SerThr: 4.226 ± 0.471
4.105SerVal: 4.105 ± 0.381
0.905SerTrp: 0.905 ± 0.249
2.716SerTyr: 2.716 ± 0.364
0.0SerXaa: 0.0 ± 0.0
Thr
5.191ThrAla: 5.191 ± 0.744
0.604ThrCys: 0.604 ± 0.202
4.044ThrAsp: 4.044 ± 0.42
4.286ThrGlu: 4.286 ± 0.439
2.294ThrPhe: 2.294 ± 0.399
3.562ThrGly: 3.562 ± 0.65
0.905ThrHis: 0.905 ± 0.245
4.165ThrIle: 4.165 ± 0.612
3.682ThrLys: 3.682 ± 0.363
6.7ThrLeu: 6.7 ± 0.646
1.449ThrMet: 1.449 ± 0.311
3.018ThrAsn: 3.018 ± 0.511
2.475ThrPro: 2.475 ± 0.434
1.871ThrGln: 1.871 ± 0.347
1.63ThrArg: 1.63 ± 0.397
3.924ThrSer: 3.924 ± 0.586
2.716ThrThr: 2.716 ± 0.49
5.071ThrVal: 5.071 ± 0.781
0.724ThrTrp: 0.724 ± 0.207
2.656ThrTyr: 2.656 ± 0.42
0.0ThrXaa: 0.0 ± 0.0
Val
4.588ValAla: 4.588 ± 0.539
0.664ValCys: 0.664 ± 0.22
4.527ValAsp: 4.527 ± 0.465
5.674ValGlu: 5.674 ± 0.723
2.958ValPhe: 2.958 ± 0.389
3.562ValGly: 3.562 ± 0.494
0.845ValHis: 0.845 ± 0.21
3.863ValIle: 3.863 ± 0.469
5.735ValLys: 5.735 ± 0.665
4.95ValLeu: 4.95 ± 0.524
1.69ValMet: 1.69 ± 0.297
3.501ValAsn: 3.501 ± 0.443
2.113ValPro: 2.113 ± 0.38
2.596ValGln: 2.596 ± 0.302
2.475ValArg: 2.475 ± 0.411
3.682ValSer: 3.682 ± 0.54
4.044ValThr: 4.044 ± 0.498
3.863ValVal: 3.863 ± 0.55
1.569ValTrp: 1.569 ± 0.314
2.233ValTyr: 2.233 ± 0.425
0.0ValXaa: 0.0 ± 0.0
Trp
0.604TrpAla: 0.604 ± 0.177
0.06TrpCys: 0.06 ± 0.064
0.664TrpAsp: 0.664 ± 0.209
1.569TrpGlu: 1.569 ± 0.278
0.664TrpPhe: 0.664 ± 0.198
1.087TrpGly: 1.087 ± 0.295
0.241TrpHis: 0.241 ± 0.093
1.207TrpIle: 1.207 ± 0.242
0.845TrpLys: 0.845 ± 0.217
0.483TrpLeu: 0.483 ± 0.2
0.121TrpMet: 0.121 ± 0.08
0.966TrpAsn: 0.966 ± 0.196
0.06TrpPro: 0.06 ± 0.059
0.604TrpGln: 0.604 ± 0.226
0.362TrpArg: 0.362 ± 0.14
0.966TrpSer: 0.966 ± 0.228
1.087TrpThr: 1.087 ± 0.301
0.604TrpVal: 0.604 ± 0.169
0.121TrpTrp: 0.121 ± 0.072
0.785TrpTyr: 0.785 ± 0.174
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.113TyrAla: 2.113 ± 0.366
0.362TyrCys: 0.362 ± 0.185
2.716TyrAsp: 2.716 ± 0.459
4.708TyrGlu: 4.708 ± 0.728
1.328TyrPhe: 1.328 ± 0.293
2.716TyrGly: 2.716 ± 0.425
0.543TyrHis: 0.543 ± 0.215
1.63TyrIle: 1.63 ± 0.313
3.743TyrLys: 3.743 ± 0.383
3.924TyrLeu: 3.924 ± 0.432
1.63TyrMet: 1.63 ± 0.347
2.535TyrAsn: 2.535 ± 0.386
1.147TyrPro: 1.147 ± 0.24
1.992TyrGln: 1.992 ± 0.341
1.871TyrArg: 1.871 ± 0.31
2.596TyrSer: 2.596 ± 0.365
2.716TyrThr: 2.716 ± 0.423
2.656TyrVal: 2.656 ± 0.359
0.423TyrTrp: 0.423 ± 0.153
2.173TyrTyr: 2.173 ± 0.351
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 91 proteins (16567 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski