Amino acid dipepetide frequency for Enterococcus phage EFP01

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.201AlaAla: 0.201 ± 0.072
0.558AlaCys: 0.558 ± 0.102
4.311AlaAsp: 4.311 ± 0.477
4.624AlaGlu: 4.624 ± 0.417
2.614AlaPhe: 2.614 ± 0.241
3.395AlaGly: 3.395 ± 0.39
1.05AlaHis: 1.05 ± 0.147
4.423AlaIle: 4.423 ± 0.325
5.741AlaLys: 5.741 ± 0.36
5.518AlaLeu: 5.518 ± 0.404
1.497AlaMet: 1.497 ± 0.206
3.395AlaAsn: 3.395 ± 0.47
2.256AlaPro: 2.256 ± 0.33
3.06AlaGln: 3.06 ± 0.466
2.748AlaArg: 2.748 ± 0.251
3.686AlaSer: 3.686 ± 0.344
4.892AlaThr: 4.892 ± 0.516
3.775AlaVal: 3.775 ± 0.315
0.558AlaTrp: 0.558 ± 0.099
2.815AlaTyr: 2.815 ± 0.255
0.0AlaXaa: 0.0 ± 0.0
Cys
0.491CysAla: 0.491 ± 0.126
0.156CysCys: 0.156 ± 0.059
0.469CysAsp: 0.469 ± 0.126
0.581CysGlu: 0.581 ± 0.107
0.313CysPhe: 0.313 ± 0.074
0.961CysGly: 0.961 ± 0.191
0.179CysHis: 0.179 ± 0.068
0.313CysIle: 0.313 ± 0.079
1.139CysLys: 1.139 ± 0.188
0.581CysLeu: 0.581 ± 0.11
0.246CysMet: 0.246 ± 0.077
0.268CysAsn: 0.268 ± 0.073
0.692CysPro: 0.692 ± 0.141
0.201CysGln: 0.201 ± 0.064
0.469CysArg: 0.469 ± 0.111
0.648CysSer: 0.648 ± 0.131
0.469CysThr: 0.469 ± 0.109
0.402CysVal: 0.402 ± 0.1
0.067CysTrp: 0.067 ± 0.04
0.536CysTyr: 0.536 ± 0.118
0.0CysXaa: 0.0 ± 0.0
Asp
3.686AspAla: 3.686 ± 0.345
0.469AspCys: 0.469 ± 0.1
2.904AspAsp: 2.904 ± 0.298
4.78AspGlu: 4.78 ± 0.461
2.949AspPhe: 2.949 ± 0.272
3.999AspGly: 3.999 ± 0.408
0.581AspHis: 0.581 ± 0.116
4.244AspIle: 4.244 ± 0.339
5.451AspLys: 5.451 ± 0.446
5.406AspLeu: 5.406 ± 0.376
1.921AspMet: 1.921 ± 0.202
3.15AspAsn: 3.15 ± 0.293
1.698AspPro: 1.698 ± 0.224
1.318AspGln: 1.318 ± 0.192
2.234AspArg: 2.234 ± 0.202
3.708AspSer: 3.708 ± 0.26
4.691AspThr: 4.691 ± 0.368
4.378AspVal: 4.378 ± 0.291
0.983AspTrp: 0.983 ± 0.112
3.44AspTyr: 3.44 ± 0.313
0.0AspXaa: 0.0 ± 0.0
Glu
5.54GluAla: 5.54 ± 0.417
0.692GluCys: 0.692 ± 0.125
4.914GluAsp: 4.914 ± 0.498
7.997GluGlu: 7.997 ± 0.656
2.39GluPhe: 2.39 ± 0.251
4.2GluGly: 4.2 ± 0.3
1.787GluHis: 1.787 ± 0.248
4.847GluIle: 4.847 ± 0.294
5.853GluLys: 5.853 ± 0.407
7.707GluLeu: 7.707 ± 0.572
2.122GluMet: 2.122 ± 0.209
4.088GluAsn: 4.088 ± 0.295
2.457GluPro: 2.457 ± 0.283
3.909GluGln: 3.909 ± 0.322
3.775GluArg: 3.775 ± 0.318
3.619GluSer: 3.619 ± 0.312
4.11GluThr: 4.11 ± 0.249
5.361GluVal: 5.361 ± 0.392
1.05GluTrp: 1.05 ± 0.165
3.485GluTyr: 3.485 ± 0.319
0.0GluXaa: 0.0 ± 0.0
Phe
2.033PheAla: 2.033 ± 0.195
0.357PheCys: 0.357 ± 0.096
2.256PheAsp: 2.256 ± 0.234
2.413PheGlu: 2.413 ± 0.281
1.184PhePhe: 1.184 ± 0.181
2.569PheGly: 2.569 ± 0.258
0.603PheHis: 0.603 ± 0.109
2.904PheIle: 2.904 ± 0.304
2.837PheLys: 2.837 ± 0.227
2.815PheLeu: 2.815 ± 0.276
1.117PheMet: 1.117 ± 0.158
2.524PheAsn: 2.524 ± 0.244
1.43PhePro: 1.43 ± 0.183
1.005PheGln: 1.005 ± 0.145
1.519PheArg: 1.519 ± 0.162
2.748PheSer: 2.748 ± 0.24
2.524PheThr: 2.524 ± 0.229
2.904PheVal: 2.904 ± 0.254
0.268PheTrp: 0.268 ± 0.09
2.033PheTyr: 2.033 ± 0.245
0.0PheXaa: 0.0 ± 0.0
Gly
3.686GlyAla: 3.686 ± 0.277
0.67GlyCys: 0.67 ± 0.13
3.552GlyAsp: 3.552 ± 0.301
4.602GlyGlu: 4.602 ± 0.318
2.614GlyPhe: 2.614 ± 0.264
4.378GlyGly: 4.378 ± 0.532
1.139GlyHis: 1.139 ± 0.145
4.267GlyIle: 4.267 ± 0.344
4.981GlyLys: 4.981 ± 0.336
4.758GlyLeu: 4.758 ± 0.301
1.519GlyMet: 1.519 ± 0.217
3.686GlyAsn: 3.686 ± 0.309
0.022GlyPro: 0.022 ± 0.02
2.279GlyGln: 2.279 ± 0.272
2.949GlyArg: 2.949 ± 0.274
3.82GlySer: 3.82 ± 0.473
4.825GlyThr: 4.825 ± 0.419
4.736GlyVal: 4.736 ± 0.303
0.715GlyTrp: 0.715 ± 0.138
3.284GlyTyr: 3.284 ± 0.258
0.0GlyXaa: 0.0 ± 0.0
His
0.938HisAla: 0.938 ± 0.134
0.112HisCys: 0.112 ± 0.042
0.715HisAsp: 0.715 ± 0.128
1.206HisGlu: 1.206 ± 0.186
0.827HisPhe: 0.827 ± 0.154
1.229HisGly: 1.229 ± 0.161
0.424HisHis: 0.424 ± 0.094
1.072HisIle: 1.072 ± 0.182
1.34HisLys: 1.34 ± 0.173
1.296HisLeu: 1.296 ± 0.166
0.514HisMet: 0.514 ± 0.103
1.028HisAsn: 1.028 ± 0.136
0.447HisPro: 0.447 ± 0.092
0.402HisGln: 0.402 ± 0.084
0.558HisArg: 0.558 ± 0.117
0.871HisSer: 0.871 ± 0.152
1.095HisThr: 1.095 ± 0.16
1.184HisVal: 1.184 ± 0.198
0.268HisTrp: 0.268 ± 0.074
0.983HisTyr: 0.983 ± 0.143
0.0HisXaa: 0.0 ± 0.0
Ile
4.267IleAla: 4.267 ± 0.314
0.581IleCys: 0.581 ± 0.122
4.624IleAsp: 4.624 ± 0.326
5.183IleGlu: 5.183 ± 0.443
1.943IlePhe: 1.943 ± 0.214
3.753IleGly: 3.753 ± 0.295
1.028IleHis: 1.028 ± 0.168
3.842IleIle: 3.842 ± 0.31
4.937IleLys: 4.937 ± 0.32
4.244IleLeu: 4.244 ± 0.315
1.474IleMet: 1.474 ± 0.169
3.574IleAsn: 3.574 ± 0.323
2.167IlePro: 2.167 ± 0.204
2.703IleGln: 2.703 ± 0.251
2.904IleArg: 2.904 ± 0.24
4.177IleSer: 4.177 ± 0.3
4.11IleThr: 4.11 ± 0.373
3.574IleVal: 3.574 ± 0.361
0.625IleTrp: 0.625 ± 0.122
2.435IleTyr: 2.435 ± 0.237
0.0IleXaa: 0.0 ± 0.0
Lys
5.652LysAla: 5.652 ± 0.413
0.737LysCys: 0.737 ± 0.158
5.16LysAsp: 5.16 ± 0.264
8.221LysGlu: 8.221 ± 0.57
2.614LysPhe: 2.614 ± 0.216
4.713LysGly: 4.713 ± 0.474
1.318LysHis: 1.318 ± 0.173
3.596LysIle: 3.596 ± 0.308
6.031LysLys: 6.031 ± 0.383
6.165LysLeu: 6.165 ± 0.381
2.256LysMet: 2.256 ± 0.207
3.731LysAsn: 3.731 ± 0.317
2.993LysPro: 2.993 ± 0.247
3.351LysGln: 3.351 ± 0.242
3.976LysArg: 3.976 ± 0.423
4.378LysSer: 4.378 ± 0.382
4.736LysThr: 4.736 ± 0.36
5.562LysVal: 5.562 ± 0.381
0.648LysTrp: 0.648 ± 0.137
3.418LysTyr: 3.418 ± 0.268
0.0LysXaa: 0.0 ± 0.0
Leu
5.629LeuAla: 5.629 ± 0.371
0.692LeuCys: 0.692 ± 0.139
6.076LeuAsp: 6.076 ± 0.37
6.255LeuGlu: 6.255 ± 0.408
2.926LeuPhe: 2.926 ± 0.247
5.361LeuGly: 5.361 ± 0.36
1.43LeuHis: 1.43 ± 0.18
4.825LeuIle: 4.825 ± 0.307
6.433LeuLys: 6.433 ± 0.314
6.523LeuLeu: 6.523 ± 0.422
1.72LeuMet: 1.72 ± 0.197
5.272LeuAsn: 5.272 ± 0.403
2.591LeuPro: 2.591 ± 0.266
3.395LeuGln: 3.395 ± 0.254
4.177LeuArg: 4.177 ± 0.395
5.183LeuSer: 5.183 ± 0.363
5.384LeuThr: 5.384 ± 0.367
5.138LeuVal: 5.138 ± 0.366
0.804LeuTrp: 0.804 ± 0.131
3.306LeuTyr: 3.306 ± 0.297
0.0LeuXaa: 0.0 ± 0.0
Met
1.72MetAla: 1.72 ± 0.217
0.29MetCys: 0.29 ± 0.101
1.229MetAsp: 1.229 ± 0.163
1.899MetGlu: 1.899 ± 0.196
0.849MetPhe: 0.849 ± 0.129
1.206MetGly: 1.206 ± 0.193
0.29MetHis: 0.29 ± 0.088
1.497MetIle: 1.497 ± 0.224
2.279MetLys: 2.279 ± 0.175
2.39MetLeu: 2.39 ± 0.261
0.469MetMet: 0.469 ± 0.096
1.519MetAsn: 1.519 ± 0.203
0.625MetPro: 0.625 ± 0.148
1.05MetGln: 1.05 ± 0.174
1.296MetArg: 1.296 ± 0.199
1.899MetSer: 1.899 ± 0.201
1.698MetThr: 1.698 ± 0.218
0.983MetVal: 0.983 ± 0.127
0.246MetTrp: 0.246 ± 0.065
1.296MetTyr: 1.296 ± 0.161
0.0MetXaa: 0.0 ± 0.0
Asn
3.105AsnAla: 3.105 ± 0.333
0.447AsnCys: 0.447 ± 0.088
2.725AsnAsp: 2.725 ± 0.233
3.932AsnGlu: 3.932 ± 0.276
1.921AsnPhe: 1.921 ± 0.254
4.2AsnGly: 4.2 ± 0.315
1.162AsnHis: 1.162 ± 0.194
3.284AsnIle: 3.284 ± 0.284
4.691AsnLys: 4.691 ± 0.297
4.512AsnLeu: 4.512 ± 0.346
1.586AsnMet: 1.586 ± 0.215
2.77AsnAsn: 2.77 ± 0.282
2.323AsnPro: 2.323 ± 0.216
1.966AsnGln: 1.966 ± 0.199
2.77AsnArg: 2.77 ± 0.254
3.485AsnSer: 3.485 ± 0.341
3.552AsnThr: 3.552 ± 0.332
3.328AsnVal: 3.328 ± 0.293
0.737AsnTrp: 0.737 ± 0.135
2.547AsnTyr: 2.547 ± 0.233
0.0AsnXaa: 0.0 ± 0.0
Pro
1.943ProAla: 1.943 ± 0.246
0.201ProCys: 0.201 ± 0.07
2.167ProAsp: 2.167 ± 0.236
2.904ProGlu: 2.904 ± 0.329
1.363ProPhe: 1.363 ± 0.182
0.514ProGly: 0.514 ± 0.107
0.424ProHis: 0.424 ± 0.1
2.077ProIle: 2.077 ± 0.219
2.882ProLys: 2.882 ± 0.284
2.681ProLeu: 2.681 ± 0.235
0.715ProMet: 0.715 ± 0.13
2.167ProAsn: 2.167 ± 0.288
0.692ProPro: 0.692 ± 0.168
1.028ProGln: 1.028 ± 0.236
1.072ProArg: 1.072 ± 0.126
2.279ProSer: 2.279 ± 0.262
2.346ProThr: 2.346 ± 0.213
2.703ProVal: 2.703 ± 0.372
0.402ProTrp: 0.402 ± 0.094
1.653ProTyr: 1.653 ± 0.193
0.0ProXaa: 0.0 ± 0.0
Gln
3.954GlnAla: 3.954 ± 0.502
0.246GlnCys: 0.246 ± 0.064
1.631GlnAsp: 1.631 ± 0.192
3.306GlnGlu: 3.306 ± 0.343
1.318GlnPhe: 1.318 ± 0.191
2.368GlnGly: 2.368 ± 0.244
0.581GlnHis: 0.581 ± 0.12
2.122GlnIle: 2.122 ± 0.201
2.681GlnLys: 2.681 ± 0.297
3.351GlnLeu: 3.351 ± 0.251
0.871GlnMet: 0.871 ± 0.167
1.608GlnAsn: 1.608 ± 0.176
1.296GlnPro: 1.296 ± 0.283
1.876GlnGln: 1.876 ± 0.306
1.742GlnArg: 1.742 ± 0.2
2.301GlnSer: 2.301 ± 0.226
2.167GlnThr: 2.167 ± 0.265
2.882GlnVal: 2.882 ± 0.244
0.313GlnTrp: 0.313 ± 0.089
1.631GlnTyr: 1.631 ± 0.204
0.0GlnXaa: 0.0 ± 0.0
Arg
2.614ArgAla: 2.614 ± 0.338
0.424ArgCys: 0.424 ± 0.147
2.859ArgAsp: 2.859 ± 0.259
3.842ArgGlu: 3.842 ± 0.312
1.765ArgPhe: 1.765 ± 0.206
2.703ArgGly: 2.703 ± 0.262
0.715ArgHis: 0.715 ± 0.159
2.971ArgIle: 2.971 ± 0.232
3.485ArgLys: 3.485 ± 0.346
4.691ArgLeu: 4.691 ± 0.296
1.229ArgMet: 1.229 ± 0.158
2.167ArgAsn: 2.167 ± 0.213
1.005ArgPro: 1.005 ± 0.173
1.787ArgGln: 1.787 ± 0.234
1.631ArgArg: 1.631 ± 0.192
1.876ArgSer: 1.876 ± 0.221
2.48ArgThr: 2.48 ± 0.269
3.306ArgVal: 3.306 ± 0.301
0.402ArgTrp: 0.402 ± 0.081
2.033ArgTyr: 2.033 ± 0.269
0.0ArgXaa: 0.0 ± 0.0
Ser
3.485SerAla: 3.485 ± 0.296
0.558SerCys: 0.558 ± 0.14
3.932SerAsp: 3.932 ± 0.31
4.133SerGlu: 4.133 ± 0.322
2.815SerPhe: 2.815 ± 0.266
4.78SerGly: 4.78 ± 0.39
0.715SerHis: 0.715 ± 0.112
4.066SerIle: 4.066 ± 0.312
4.803SerLys: 4.803 ± 0.333
4.959SerLeu: 4.959 ± 0.369
1.117SerMet: 1.117 ± 0.181
2.859SerAsn: 2.859 ± 0.25
2.01SerPro: 2.01 ± 0.228
1.899SerGln: 1.899 ± 0.218
2.256SerArg: 2.256 ± 0.23
3.596SerSer: 3.596 ± 0.281
3.731SerThr: 3.731 ± 0.306
3.999SerVal: 3.999 ± 0.269
0.849SerTrp: 0.849 ± 0.122
2.636SerTyr: 2.636 ± 0.232
0.0SerXaa: 0.0 ± 0.0
Thr
4.244ThrAla: 4.244 ± 0.514
0.581ThrCys: 0.581 ± 0.127
4.334ThrAsp: 4.334 ± 0.383
4.512ThrGlu: 4.512 ± 0.273
3.261ThrPhe: 3.261 ± 0.293
4.177ThrGly: 4.177 ± 0.358
1.117ThrHis: 1.117 ± 0.126
4.468ThrIle: 4.468 ± 0.349
4.356ThrLys: 4.356 ± 0.275
5.83ThrLeu: 5.83 ± 0.422
1.72ThrMet: 1.72 ± 0.227
3.552ThrAsn: 3.552 ± 0.303
3.239ThrPro: 3.239 ± 0.295
2.279ThrGln: 2.279 ± 0.231
2.658ThrArg: 2.658 ± 0.265
3.596ThrSer: 3.596 ± 0.276
4.021ThrThr: 4.021 ± 0.442
5.361ThrVal: 5.361 ± 0.505
0.804ThrTrp: 0.804 ± 0.203
2.681ThrTyr: 2.681 ± 0.321
0.0ThrXaa: 0.0 ± 0.0
Val
4.758ValAla: 4.758 ± 0.386
0.715ValCys: 0.715 ± 0.134
4.803ValAsp: 4.803 ± 0.363
5.406ValGlu: 5.406 ± 0.474
2.591ValPhe: 2.591 ± 0.25
4.423ValGly: 4.423 ± 0.375
1.05ValHis: 1.05 ± 0.167
3.82ValIle: 3.82 ± 0.282
4.937ValLys: 4.937 ± 0.336
4.959ValLeu: 4.959 ± 0.401
1.452ValMet: 1.452 ± 0.198
4.088ValAsn: 4.088 ± 0.308
2.48ValPro: 2.48 ± 0.237
2.301ValGln: 2.301 ± 0.234
2.904ValArg: 2.904 ± 0.267
3.731ValSer: 3.731 ± 0.278
5.607ValThr: 5.607 ± 0.462
4.803ValVal: 4.803 ± 0.42
0.558ValTrp: 0.558 ± 0.117
3.44ValTyr: 3.44 ± 0.302
0.0ValXaa: 0.0 ± 0.0
Trp
0.648TrpAla: 0.648 ± 0.134
0.179TrpCys: 0.179 ± 0.065
0.491TrpAsp: 0.491 ± 0.096
1.095TrpGlu: 1.095 ± 0.139
0.447TrpPhe: 0.447 ± 0.108
0.737TrpGly: 0.737 ± 0.135
0.156TrpHis: 0.156 ± 0.059
0.692TrpIle: 0.692 ± 0.112
0.692TrpLys: 0.692 ± 0.132
0.804TrpLeu: 0.804 ± 0.141
0.134TrpMet: 0.134 ± 0.054
0.603TrpAsn: 0.603 ± 0.116
0.0TrpPro: 0.0 ± 0.0
0.536TrpGln: 0.536 ± 0.118
0.424TrpArg: 0.424 ± 0.093
0.558TrpSer: 0.558 ± 0.103
0.961TrpThr: 0.961 ± 0.123
0.849TrpVal: 0.849 ± 0.157
0.246TrpTrp: 0.246 ± 0.077
0.67TrpTyr: 0.67 ± 0.122
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.524TyrAla: 2.524 ± 0.264
0.514TyrCys: 0.514 ± 0.111
2.837TyrAsp: 2.837 ± 0.266
3.016TyrGlu: 3.016 ± 0.26
1.34TyrPhe: 1.34 ± 0.201
2.815TyrGly: 2.815 ± 0.252
0.76TyrHis: 0.76 ± 0.122
2.815TyrIle: 2.815 ± 0.251
3.708TyrLys: 3.708 ± 0.299
3.932TyrLeu: 3.932 ± 0.321
1.028TyrMet: 1.028 ± 0.162
2.993TyrAsn: 2.993 ± 0.257
1.787TyrPro: 1.787 ± 0.216
1.899TyrGln: 1.899 ± 0.198
1.921TyrArg: 1.921 ± 0.215
3.038TyrSer: 3.038 ± 0.232
3.395TyrThr: 3.395 ± 0.289
3.596TyrVal: 3.596 ± 0.313
0.38TyrTrp: 0.38 ± 0.084
2.01TyrTyr: 2.01 ± 0.278
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 193 proteins (44767 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski