Amino acid dipepetide frequency for Enterococcus phage PEf771

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.09AlaAla: 0.09 ± 0.055
0.61AlaCys: 0.61 ± 0.114
4.226AlaAsp: 4.226 ± 0.427
4.881AlaGlu: 4.881 ± 0.372
2.554AlaPhe: 2.554 ± 0.243
3.503AlaGly: 3.503 ± 0.504
1.085AlaHis: 1.085 ± 0.162
4.588AlaIle: 4.588 ± 0.318
5.379AlaLys: 5.379 ± 0.361
5.401AlaLeu: 5.401 ± 0.36
1.537AlaMet: 1.537 ± 0.201
3.367AlaAsn: 3.367 ± 0.393
2.35AlaPro: 2.35 ± 0.306
2.983AlaGln: 2.983 ± 0.322
2.893AlaArg: 2.893 ± 0.247
3.684AlaSer: 3.684 ± 0.402
4.655AlaThr: 4.655 ± 0.476
3.977AlaVal: 3.977 ± 0.301
0.588AlaTrp: 0.588 ± 0.109
3.028AlaTyr: 3.028 ± 0.198
0.0AlaXaa: 0.0 ± 0.0
Cys
0.429CysAla: 0.429 ± 0.11
0.09CysCys: 0.09 ± 0.043
0.655CysAsp: 0.655 ± 0.135
0.384CysGlu: 0.384 ± 0.1
0.339CysPhe: 0.339 ± 0.083
0.588CysGly: 0.588 ± 0.112
0.158CysHis: 0.158 ± 0.064
0.384CysIle: 0.384 ± 0.1
0.836CysLys: 0.836 ± 0.167
0.588CysLeu: 0.588 ± 0.102
0.068CysMet: 0.068 ± 0.035
0.339CysAsn: 0.339 ± 0.105
0.565CysPro: 0.565 ± 0.114
0.181CysGln: 0.181 ± 0.06
0.429CysArg: 0.429 ± 0.118
0.768CysSer: 0.768 ± 0.134
0.429CysThr: 0.429 ± 0.094
0.339CysVal: 0.339 ± 0.098
0.045CysTrp: 0.045 ± 0.029
0.542CysTyr: 0.542 ± 0.106
0.0CysXaa: 0.0 ± 0.0
Asp
3.3AspAla: 3.3 ± 0.288
0.475AspCys: 0.475 ± 0.099
3.119AspAsp: 3.119 ± 0.261
5.266AspGlu: 5.266 ± 0.465
2.87AspPhe: 2.87 ± 0.215
3.977AspGly: 3.977 ± 0.381
0.565AspHis: 0.565 ± 0.103
4.542AspIle: 4.542 ± 0.295
5.537AspLys: 5.537 ± 0.423
5.627AspLeu: 5.627 ± 0.345
2.215AspMet: 2.215 ± 0.322
3.028AspAsn: 3.028 ± 0.274
1.831AspPro: 1.831 ± 0.23
1.333AspGln: 1.333 ± 0.183
2.283AspArg: 2.283 ± 0.253
3.503AspSer: 3.503 ± 0.274
4.384AspThr: 4.384 ± 0.303
4.565AspVal: 4.565 ± 0.339
0.881AspTrp: 0.881 ± 0.12
4.113AspTyr: 4.113 ± 0.4
0.0AspXaa: 0.0 ± 0.0
Glu
5.65GluAla: 5.65 ± 0.39
0.678GluCys: 0.678 ± 0.136
5.107GluAsp: 5.107 ± 0.439
8.452GluGlu: 8.452 ± 0.758
2.735GluPhe: 2.735 ± 0.258
4.271GluGly: 4.271 ± 0.284
1.582GluHis: 1.582 ± 0.211
5.107GluIle: 5.107 ± 0.336
6.034GluLys: 6.034 ± 0.383
7.729GluLeu: 7.729 ± 0.496
2.17GluMet: 2.17 ± 0.225
4.497GluAsn: 4.497 ± 0.342
2.26GluPro: 2.26 ± 0.229
4.068GluGln: 4.068 ± 0.315
3.39GluArg: 3.39 ± 0.309
3.751GluSer: 3.751 ± 0.303
4.791GluThr: 4.791 ± 0.303
5.65GluVal: 5.65 ± 0.365
1.04GluTrp: 1.04 ± 0.136
3.413GluTyr: 3.413 ± 0.275
0.0GluXaa: 0.0 ± 0.0
Phe
2.283PheAla: 2.283 ± 0.215
0.362PheCys: 0.362 ± 0.091
2.463PheAsp: 2.463 ± 0.24
2.712PheGlu: 2.712 ± 0.268
1.13PhePhe: 1.13 ± 0.193
2.509PheGly: 2.509 ± 0.304
0.565PheHis: 0.565 ± 0.107
2.667PheIle: 2.667 ± 0.281
2.757PheLys: 2.757 ± 0.276
2.712PheLeu: 2.712 ± 0.227
1.085PheMet: 1.085 ± 0.174
2.576PheAsn: 2.576 ± 0.241
1.266PhePro: 1.266 ± 0.186
0.994PheGln: 0.994 ± 0.118
1.446PheArg: 1.446 ± 0.178
2.961PheSer: 2.961 ± 0.307
2.667PheThr: 2.667 ± 0.236
2.87PheVal: 2.87 ± 0.247
0.316PheTrp: 0.316 ± 0.105
1.966PheTyr: 1.966 ± 0.237
0.0PheXaa: 0.0 ± 0.0
Gly
3.797GlyAla: 3.797 ± 0.423
0.61GlyCys: 0.61 ± 0.134
3.729GlyAsp: 3.729 ± 0.315
4.542GlyGlu: 4.542 ± 0.295
2.712GlyPhe: 2.712 ± 0.254
4.497GlyGly: 4.497 ± 0.58
1.175GlyHis: 1.175 ± 0.156
4.339GlyIle: 4.339 ± 0.288
5.13GlyLys: 5.13 ± 0.364
4.384GlyLeu: 4.384 ± 0.314
1.492GlyMet: 1.492 ± 0.236
3.571GlyAsn: 3.571 ± 0.321
0.023GlyPro: 0.023 ± 0.02
2.237GlyGln: 2.237 ± 0.298
2.712GlyArg: 2.712 ± 0.296
3.729GlySer: 3.729 ± 0.421
4.655GlyThr: 4.655 ± 0.399
4.723GlyVal: 4.723 ± 0.311
0.904GlyTrp: 0.904 ± 0.172
3.096GlyTyr: 3.096 ± 0.242
0.0GlyXaa: 0.0 ± 0.0
His
0.836HisAla: 0.836 ± 0.134
0.113HisCys: 0.113 ± 0.046
0.814HisAsp: 0.814 ± 0.138
1.311HisGlu: 1.311 ± 0.192
0.791HisPhe: 0.791 ± 0.132
0.881HisGly: 0.881 ± 0.144
0.407HisHis: 0.407 ± 0.09
1.062HisIle: 1.062 ± 0.192
1.22HisLys: 1.22 ± 0.161
1.401HisLeu: 1.401 ± 0.143
0.362HisMet: 0.362 ± 0.102
0.881HisAsn: 0.881 ± 0.121
0.475HisPro: 0.475 ± 0.094
0.429HisGln: 0.429 ± 0.09
0.542HisArg: 0.542 ± 0.108
0.881HisSer: 0.881 ± 0.156
0.994HisThr: 0.994 ± 0.123
1.266HisVal: 1.266 ± 0.185
0.294HisTrp: 0.294 ± 0.094
1.062HisTyr: 1.062 ± 0.183
0.0HisXaa: 0.0 ± 0.0
Ile
4.565IleAla: 4.565 ± 0.308
0.497IleCys: 0.497 ± 0.096
4.565IleAsp: 4.565 ± 0.317
5.559IleGlu: 5.559 ± 0.455
2.011IlePhe: 2.011 ± 0.184
3.706IleGly: 3.706 ± 0.282
1.017IleHis: 1.017 ± 0.171
4.181IleIle: 4.181 ± 0.354
4.836IleLys: 4.836 ± 0.363
4.316IleLeu: 4.316 ± 0.344
1.627IleMet: 1.627 ± 0.192
3.503IleAsn: 3.503 ± 0.314
2.215IlePro: 2.215 ± 0.27
2.599IleGln: 2.599 ± 0.225
2.848IleArg: 2.848 ± 0.287
4.181IleSer: 4.181 ± 0.315
4.249IleThr: 4.249 ± 0.36
3.48IleVal: 3.48 ± 0.29
0.565IleTrp: 0.565 ± 0.128
2.418IleTyr: 2.418 ± 0.218
0.0IleXaa: 0.0 ± 0.0
Lys
5.537LysAla: 5.537 ± 0.347
0.655LysCys: 0.655 ± 0.142
4.972LysAsp: 4.972 ± 0.302
8.362LysGlu: 8.362 ± 0.524
2.237LysPhe: 2.237 ± 0.204
4.203LysGly: 4.203 ± 0.358
1.379LysHis: 1.379 ± 0.201
3.458LysIle: 3.458 ± 0.262
5.785LysLys: 5.785 ± 0.419
6.283LysLeu: 6.283 ± 0.353
2.441LysMet: 2.441 ± 0.261
3.661LysAsn: 3.661 ± 0.311
2.938LysPro: 2.938 ± 0.275
3.413LysGln: 3.413 ± 0.274
3.774LysArg: 3.774 ± 0.337
4.203LysSer: 4.203 ± 0.385
4.52LysThr: 4.52 ± 0.274
5.446LysVal: 5.446 ± 0.375
0.723LysTrp: 0.723 ± 0.135
3.39LysTyr: 3.39 ± 0.284
0.0LysXaa: 0.0 ± 0.0
Leu
5.446LeuAla: 5.446 ± 0.361
0.655LeuCys: 0.655 ± 0.124
5.831LeuAsp: 5.831 ± 0.414
6.486LeuGlu: 6.486 ± 0.419
3.028LeuPhe: 3.028 ± 0.245
5.65LeuGly: 5.65 ± 0.381
1.356LeuHis: 1.356 ± 0.196
4.768LeuIle: 4.768 ± 0.293
5.989LeuLys: 5.989 ± 0.296
6.147LeuLeu: 6.147 ± 0.433
1.65LeuMet: 1.65 ± 0.192
4.927LeuAsn: 4.927 ± 0.375
2.825LeuPro: 2.825 ± 0.28
3.751LeuGln: 3.751 ± 0.252
4.09LeuArg: 4.09 ± 0.374
4.972LeuSer: 4.972 ± 0.311
5.514LeuThr: 5.514 ± 0.378
5.198LeuVal: 5.198 ± 0.406
0.791LeuTrp: 0.791 ± 0.14
3.187LeuTyr: 3.187 ± 0.254
0.0LeuXaa: 0.0 ± 0.0
Met
1.695MetAla: 1.695 ± 0.233
0.226MetCys: 0.226 ± 0.09
1.107MetAsp: 1.107 ± 0.138
1.966MetGlu: 1.966 ± 0.201
0.927MetPhe: 0.927 ± 0.138
1.333MetGly: 1.333 ± 0.203
0.316MetHis: 0.316 ± 0.069
1.424MetIle: 1.424 ± 0.185
2.486MetLys: 2.486 ± 0.25
2.576MetLeu: 2.576 ± 0.282
0.542MetMet: 0.542 ± 0.101
1.65MetAsn: 1.65 ± 0.184
0.655MetPro: 0.655 ± 0.135
1.04MetGln: 1.04 ± 0.177
1.266MetArg: 1.266 ± 0.156
1.898MetSer: 1.898 ± 0.182
1.695MetThr: 1.695 ± 0.177
1.22MetVal: 1.22 ± 0.157
0.203MetTrp: 0.203 ± 0.064
1.582MetTyr: 1.582 ± 0.202
0.0MetXaa: 0.0 ± 0.0
Asn
3.141AsnAla: 3.141 ± 0.306
0.339AsnCys: 0.339 ± 0.095
2.757AsnAsp: 2.757 ± 0.239
3.842AsnGlu: 3.842 ± 0.339
1.853AsnPhe: 1.853 ± 0.226
4.181AsnGly: 4.181 ± 0.31
0.927AsnHis: 0.927 ± 0.15
3.435AsnIle: 3.435 ± 0.346
4.565AsnLys: 4.565 ± 0.328
4.429AsnLeu: 4.429 ± 0.337
1.492AsnMet: 1.492 ± 0.196
2.893AsnAsn: 2.893 ± 0.292
2.215AsnPro: 2.215 ± 0.198
1.831AsnGln: 1.831 ± 0.178
2.893AsnArg: 2.893 ± 0.268
3.526AsnSer: 3.526 ± 0.298
3.3AsnThr: 3.3 ± 0.325
3.548AsnVal: 3.548 ± 0.295
0.723AsnTrp: 0.723 ± 0.123
2.509AsnTyr: 2.509 ± 0.233
0.0AsnXaa: 0.0 ± 0.0
Pro
1.989ProAla: 1.989 ± 0.228
0.203ProCys: 0.203 ± 0.071
2.373ProAsp: 2.373 ± 0.243
2.893ProGlu: 2.893 ± 0.294
1.243ProPhe: 1.243 ± 0.18
0.497ProGly: 0.497 ± 0.115
0.542ProHis: 0.542 ± 0.124
2.011ProIle: 2.011 ± 0.173
2.576ProLys: 2.576 ± 0.244
2.441ProLeu: 2.441 ± 0.234
0.859ProMet: 0.859 ± 0.151
2.011ProAsn: 2.011 ± 0.256
0.655ProPro: 0.655 ± 0.124
1.107ProGln: 1.107 ± 0.257
1.175ProArg: 1.175 ± 0.14
2.396ProSer: 2.396 ± 0.264
1.944ProThr: 1.944 ± 0.219
3.074ProVal: 3.074 ± 0.378
0.384ProTrp: 0.384 ± 0.1
1.446ProTyr: 1.446 ± 0.186
0.0ProXaa: 0.0 ± 0.0
Gln
4.407GlnAla: 4.407 ± 0.483
0.249GlnCys: 0.249 ± 0.086
1.944GlnAsp: 1.944 ± 0.214
3.593GlnGlu: 3.593 ± 0.344
1.288GlnPhe: 1.288 ± 0.182
2.599GlnGly: 2.599 ± 0.255
0.475GlnHis: 0.475 ± 0.103
1.944GlnIle: 1.944 ± 0.201
2.622GlnLys: 2.622 ± 0.264
3.48GlnLeu: 3.48 ± 0.323
1.13GlnMet: 1.13 ± 0.207
1.469GlnAsn: 1.469 ± 0.187
1.333GlnPro: 1.333 ± 0.264
1.763GlnGln: 1.763 ± 0.317
1.559GlnArg: 1.559 ± 0.223
2.057GlnSer: 2.057 ± 0.214
1.876GlnThr: 1.876 ± 0.217
2.825GlnVal: 2.825 ± 0.233
0.226GlnTrp: 0.226 ± 0.078
1.785GlnTyr: 1.785 ± 0.199
0.0GlnXaa: 0.0 ± 0.0
Arg
2.441ArgAla: 2.441 ± 0.299
0.203ArgCys: 0.203 ± 0.061
2.78ArgAsp: 2.78 ± 0.241
3.616ArgGlu: 3.616 ± 0.374
1.785ArgPhe: 1.785 ± 0.22
2.599ArgGly: 2.599 ± 0.262
0.588ArgHis: 0.588 ± 0.113
3.096ArgIle: 3.096 ± 0.308
3.819ArgLys: 3.819 ± 0.312
4.158ArgLeu: 4.158 ± 0.256
1.311ArgMet: 1.311 ± 0.169
2.305ArgAsn: 2.305 ± 0.225
1.198ArgPro: 1.198 ± 0.197
1.695ArgGln: 1.695 ± 0.204
1.492ArgArg: 1.492 ± 0.193
2.057ArgSer: 2.057 ± 0.24
2.463ArgThr: 2.463 ± 0.196
3.006ArgVal: 3.006 ± 0.311
0.429ArgTrp: 0.429 ± 0.104
1.944ArgTyr: 1.944 ± 0.216
0.0ArgXaa: 0.0 ± 0.0
Ser
3.548SerAla: 3.548 ± 0.323
0.362SerCys: 0.362 ± 0.09
4.09SerAsp: 4.09 ± 0.285
3.819SerGlu: 3.819 ± 0.283
2.961SerPhe: 2.961 ± 0.238
4.859SerGly: 4.859 ± 0.411
0.768SerHis: 0.768 ± 0.142
4.136SerIle: 4.136 ± 0.267
4.52SerLys: 4.52 ± 0.339
5.198SerLeu: 5.198 ± 0.393
1.243SerMet: 1.243 ± 0.151
2.825SerAsn: 2.825 ± 0.297
1.831SerPro: 1.831 ± 0.239
1.944SerGln: 1.944 ± 0.213
2.305SerArg: 2.305 ± 0.269
3.774SerSer: 3.774 ± 0.35
3.526SerThr: 3.526 ± 0.344
4.09SerVal: 4.09 ± 0.274
0.836SerTrp: 0.836 ± 0.123
2.667SerTyr: 2.667 ± 0.273
0.0SerXaa: 0.0 ± 0.0
Thr
4.407ThrAla: 4.407 ± 0.45
0.52ThrCys: 0.52 ± 0.13
4.339ThrAsp: 4.339 ± 0.303
4.497ThrGlu: 4.497 ± 0.314
3.277ThrPhe: 3.277 ± 0.308
4.09ThrGly: 4.09 ± 0.309
1.107ThrHis: 1.107 ± 0.14
4.475ThrIle: 4.475 ± 0.353
3.887ThrLys: 3.887 ± 0.253
5.627ThrLeu: 5.627 ± 0.35
1.808ThrMet: 1.808 ± 0.221
3.209ThrAsn: 3.209 ± 0.297
3.006ThrPro: 3.006 ± 0.271
2.373ThrGln: 2.373 ± 0.342
2.418ThrArg: 2.418 ± 0.217
3.39ThrSer: 3.39 ± 0.351
4.249ThrThr: 4.249 ± 0.464
5.243ThrVal: 5.243 ± 0.452
0.859ThrTrp: 0.859 ± 0.141
2.599ThrTyr: 2.599 ± 0.306
0.0ThrXaa: 0.0 ± 0.0
Val
4.701ValAla: 4.701 ± 0.326
0.475ValCys: 0.475 ± 0.118
5.062ValAsp: 5.062 ± 0.368
5.763ValGlu: 5.763 ± 0.48
2.644ValPhe: 2.644 ± 0.255
4.384ValGly: 4.384 ± 0.408
1.107ValHis: 1.107 ± 0.181
3.955ValIle: 3.955 ± 0.347
5.153ValLys: 5.153 ± 0.338
5.062ValLeu: 5.062 ± 0.383
1.446ValMet: 1.446 ± 0.195
3.91ValAsn: 3.91 ± 0.293
2.283ValPro: 2.283 ± 0.271
2.644ValGln: 2.644 ± 0.265
3.119ValArg: 3.119 ± 0.318
4.09ValSer: 4.09 ± 0.315
5.243ValThr: 5.243 ± 0.574
4.746ValVal: 4.746 ± 0.406
0.588ValTrp: 0.588 ± 0.104
3.187ValTyr: 3.187 ± 0.296
0.0ValXaa: 0.0 ± 0.0
Trp
0.633TrpAla: 0.633 ± 0.134
0.203TrpCys: 0.203 ± 0.068
0.61TrpAsp: 0.61 ± 0.125
1.198TrpGlu: 1.198 ± 0.166
0.497TrpPhe: 0.497 ± 0.116
0.746TrpGly: 0.746 ± 0.127
0.113TrpHis: 0.113 ± 0.05
0.678TrpIle: 0.678 ± 0.124
0.701TrpLys: 0.701 ± 0.137
0.881TrpLeu: 0.881 ± 0.147
0.136TrpMet: 0.136 ± 0.053
0.565TrpAsn: 0.565 ± 0.114
0.0TrpPro: 0.0 ± 0.0
0.452TrpGln: 0.452 ± 0.111
0.384TrpArg: 0.384 ± 0.086
0.565TrpSer: 0.565 ± 0.141
0.949TrpThr: 0.949 ± 0.142
0.904TrpVal: 0.904 ± 0.202
0.226TrpTrp: 0.226 ± 0.08
0.701TrpTyr: 0.701 ± 0.124
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.486TyrAla: 2.486 ± 0.239
0.633TyrCys: 0.633 ± 0.113
2.87TyrAsp: 2.87 ± 0.259
3.232TyrGlu: 3.232 ± 0.284
1.424TyrPhe: 1.424 ± 0.179
2.983TyrGly: 2.983 ± 0.269
0.791TyrHis: 0.791 ± 0.136
2.689TyrIle: 2.689 ± 0.307
3.548TyrLys: 3.548 ± 0.363
3.819TyrLeu: 3.819 ± 0.34
1.175TyrMet: 1.175 ± 0.162
3.209TyrAsn: 3.209 ± 0.261
1.898TyrPro: 1.898 ± 0.224
1.785TyrGln: 1.785 ± 0.203
1.966TyrArg: 1.966 ± 0.243
2.915TyrSer: 2.915 ± 0.287
3.277TyrThr: 3.277 ± 0.301
3.413TyrVal: 3.413 ± 0.274
0.52TyrTrp: 0.52 ± 0.119
2.057TyrTyr: 2.057 ± 0.201
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 197 proteins (44250 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski