Amino acid dipepetide frequency for Erwinia phage phiEa2809

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.373AlaAla: 7.373 ± 0.492
0.73AlaCys: 0.73 ± 0.14
5.465AlaAsp: 5.465 ± 0.404
5.041AlaGlu: 5.041 ± 0.344
3.251AlaPhe: 3.251 ± 0.272
5.842AlaGly: 5.842 ± 0.4
1.06AlaHis: 1.06 ± 0.15
4.782AlaIle: 4.782 ± 0.288
4.664AlaLys: 4.664 ± 0.358
6.313AlaLeu: 6.313 ± 0.386
2.308AlaMet: 2.308 ± 0.203
3.816AlaAsn: 3.816 ± 0.305
3.251AlaPro: 3.251 ± 0.348
3.274AlaGln: 3.274 ± 0.347
3.722AlaArg: 3.722 ± 0.296
4.099AlaSer: 4.099 ± 0.339
5.135AlaThr: 5.135 ± 0.341
5.253AlaVal: 5.253 ± 0.459
1.06AlaTrp: 1.06 ± 0.154
2.403AlaTyr: 2.403 ± 0.25
0.0AlaXaa: 0.0 ± 0.0
Cys
0.801CysAla: 0.801 ± 0.181
0.094CysCys: 0.094 ± 0.042
0.542CysAsp: 0.542 ± 0.104
0.683CysGlu: 0.683 ± 0.138
0.283CysPhe: 0.283 ± 0.076
0.824CysGly: 0.824 ± 0.133
0.259CysHis: 0.259 ± 0.088
0.471CysIle: 0.471 ± 0.085
0.448CysLys: 0.448 ± 0.098
0.471CysLeu: 0.471 ± 0.114
0.212CysMet: 0.212 ± 0.069
0.4CysAsn: 0.4 ± 0.089
0.424CysPro: 0.424 ± 0.097
0.306CysGln: 0.306 ± 0.079
0.495CysArg: 0.495 ± 0.101
0.66CysSer: 0.66 ± 0.146
0.518CysThr: 0.518 ± 0.123
0.73CysVal: 0.73 ± 0.133
0.188CysTrp: 0.188 ± 0.067
0.283CysTyr: 0.283 ± 0.079
0.0CysXaa: 0.0 ± 0.0
Asp
6.007AspAla: 6.007 ± 0.397
0.518AspCys: 0.518 ± 0.109
3.675AspAsp: 3.675 ± 0.294
4.311AspGlu: 4.311 ± 0.322
2.662AspPhe: 2.662 ± 0.249
4.994AspGly: 4.994 ± 0.286
1.343AspHis: 1.343 ± 0.173
4.287AspIle: 4.287 ± 0.281
3.627AspLys: 3.627 ± 0.31
5.111AspLeu: 5.111 ± 0.322
2.12AspMet: 2.12 ± 0.232
3.109AspAsn: 3.109 ± 0.246
2.567AspPro: 2.567 ± 0.229
2.426AspGln: 2.426 ± 0.254
2.638AspArg: 2.638 ± 0.273
3.463AspSer: 3.463 ± 0.249
3.769AspThr: 3.769 ± 0.266
4.523AspVal: 4.523 ± 0.384
1.107AspTrp: 1.107 ± 0.165
3.133AspTyr: 3.133 ± 0.258
0.0AspXaa: 0.0 ± 0.0
Glu
4.617GluAla: 4.617 ± 0.331
0.542GluCys: 0.542 ± 0.105
3.651GluAsp: 3.651 ± 0.295
3.839GluGlu: 3.839 ± 0.379
3.463GluPhe: 3.463 ± 0.296
4.546GluGly: 4.546 ± 0.365
1.107GluHis: 1.107 ± 0.156
3.957GluIle: 3.957 ± 0.292
3.604GluLys: 3.604 ± 0.312
5.724GluLeu: 5.724 ± 0.443
2.12GluMet: 2.12 ± 0.203
2.756GluAsn: 2.756 ± 0.258
2.167GluPro: 2.167 ± 0.23
2.944GluGln: 2.944 ± 0.287
3.321GluArg: 3.321 ± 0.344
3.463GluSer: 3.463 ± 0.279
3.463GluThr: 3.463 ± 0.303
4.57GluVal: 4.57 ± 0.305
1.084GluTrp: 1.084 ± 0.166
2.709GluTyr: 2.709 ± 0.222
0.0GluXaa: 0.0 ± 0.0
Phe
2.803PheAla: 2.803 ± 0.208
0.518PheCys: 0.518 ± 0.127
3.133PheAsp: 3.133 ± 0.264
3.392PheGlu: 3.392 ± 0.294
1.39PhePhe: 1.39 ± 0.154
2.968PheGly: 2.968 ± 0.236
0.895PheHis: 0.895 ± 0.155
2.426PheIle: 2.426 ± 0.244
2.615PheLys: 2.615 ± 0.199
3.039PheLeu: 3.039 ± 0.244
1.154PheMet: 1.154 ± 0.166
2.214PheAsn: 2.214 ± 0.214
1.437PhePro: 1.437 ± 0.217
1.531PheGln: 1.531 ± 0.151
2.143PheArg: 2.143 ± 0.274
2.921PheSer: 2.921 ± 0.31
3.321PheThr: 3.321 ± 0.311
2.638PheVal: 2.638 ± 0.181
0.754PheTrp: 0.754 ± 0.151
1.531PheTyr: 1.531 ± 0.168
0.0PheXaa: 0.0 ± 0.0
Gly
4.499GlyAla: 4.499 ± 0.415
0.707GlyCys: 0.707 ± 0.126
4.452GlyAsp: 4.452 ± 0.291
4.122GlyGlu: 4.122 ± 0.312
2.756GlyPhe: 2.756 ± 0.249
4.899GlyGly: 4.899 ± 0.479
0.942GlyHis: 0.942 ± 0.159
3.745GlyIle: 3.745 ± 0.295
5.3GlyLys: 5.3 ± 0.434
5.488GlyLeu: 5.488 ± 0.34
1.814GlyMet: 1.814 ± 0.201
3.486GlyAsn: 3.486 ± 0.414
1.649GlyPro: 1.649 ± 0.202
2.897GlyGln: 2.897 ± 0.232
3.58GlyArg: 3.58 ± 0.33
4.24GlySer: 4.24 ± 0.461
4.004GlyThr: 4.004 ± 0.401
5.253GlyVal: 5.253 ± 0.371
1.036GlyTrp: 1.036 ± 0.147
2.874GlyTyr: 2.874 ± 0.223
0.0GlyXaa: 0.0 ± 0.0
His
0.754HisAla: 0.754 ± 0.136
0.188HisCys: 0.188 ± 0.08
0.989HisAsp: 0.989 ± 0.154
1.084HisGlu: 1.084 ± 0.154
0.848HisPhe: 0.848 ± 0.144
1.201HisGly: 1.201 ± 0.154
0.4HisHis: 0.4 ± 0.095
1.036HisIle: 1.036 ± 0.185
1.225HisLys: 1.225 ± 0.165
1.625HisLeu: 1.625 ± 0.212
0.471HisMet: 0.471 ± 0.111
0.683HisAsn: 0.683 ± 0.117
0.942HisPro: 0.942 ± 0.146
0.542HisGln: 0.542 ± 0.125
0.848HisArg: 0.848 ± 0.147
1.225HisSer: 1.225 ± 0.162
1.296HisThr: 1.296 ± 0.178
0.966HisVal: 0.966 ± 0.136
0.283HisTrp: 0.283 ± 0.079
0.801HisTyr: 0.801 ± 0.137
0.0HisXaa: 0.0 ± 0.0
Ile
4.381IleAla: 4.381 ± 0.328
0.236IleCys: 0.236 ± 0.075
4.899IleAsp: 4.899 ± 0.388
4.24IleGlu: 4.24 ± 0.368
1.814IlePhe: 1.814 ± 0.219
3.298IleGly: 3.298 ± 0.231
0.848IleHis: 0.848 ± 0.12
3.486IleIle: 3.486 ± 0.332
4.169IleLys: 4.169 ± 0.332
3.557IleLeu: 3.557 ± 0.309
1.272IleMet: 1.272 ± 0.198
3.651IleAsn: 3.651 ± 0.269
2.544IlePro: 2.544 ± 0.227
2.191IleGln: 2.191 ± 0.228
3.368IleArg: 3.368 ± 0.245
3.345IleSer: 3.345 ± 0.231
3.792IleThr: 3.792 ± 0.286
3.58IleVal: 3.58 ± 0.276
0.872IleTrp: 0.872 ± 0.118
1.649IleTyr: 1.649 ± 0.192
0.0IleXaa: 0.0 ± 0.0
Lys
5.653LysAla: 5.653 ± 0.335
0.424LysCys: 0.424 ± 0.11
3.769LysAsp: 3.769 ± 0.337
3.91LysGlu: 3.91 ± 0.397
2.52LysPhe: 2.52 ± 0.223
3.981LysGly: 3.981 ± 0.327
1.131LysHis: 1.131 ± 0.152
4.24LysIle: 4.24 ± 0.278
3.887LysLys: 3.887 ± 0.357
4.994LysLeu: 4.994 ± 0.374
2.049LysMet: 2.049 ± 0.22
2.85LysAsn: 2.85 ± 0.242
3.015LysPro: 3.015 ± 0.29
2.473LysGln: 2.473 ± 0.21
3.133LysArg: 3.133 ± 0.288
3.651LysSer: 3.651 ± 0.238
3.722LysThr: 3.722 ± 0.244
3.91LysVal: 3.91 ± 0.313
0.801LysTrp: 0.801 ± 0.143
2.662LysTyr: 2.662 ± 0.211
0.0LysXaa: 0.0 ± 0.0
Leu
6.831LeuAla: 6.831 ± 0.393
0.589LeuCys: 0.589 ± 0.121
5.418LeuAsp: 5.418 ± 0.388
5.3LeuGlu: 5.3 ± 0.431
3.439LeuPhe: 3.439 ± 0.256
4.475LeuGly: 4.475 ± 0.366
1.46LeuHis: 1.46 ± 0.217
3.274LeuIle: 3.274 ± 0.229
5.795LeuLys: 5.795 ± 0.4
5.583LeuLeu: 5.583 ± 0.413
1.932LeuMet: 1.932 ± 0.223
4.122LeuAsn: 4.122 ± 0.267
3.58LeuPro: 3.58 ± 0.299
3.486LeuGln: 3.486 ± 0.257
4.099LeuArg: 4.099 ± 0.287
4.829LeuSer: 4.829 ± 0.297
5.159LeuThr: 5.159 ± 0.371
5.559LeuVal: 5.559 ± 0.34
0.683LeuTrp: 0.683 ± 0.142
2.662LeuTyr: 2.662 ± 0.263
0.0LeuXaa: 0.0 ± 0.0
Met
2.52MetAla: 2.52 ± 0.217
0.283MetCys: 0.283 ± 0.085
1.79MetAsp: 1.79 ± 0.169
1.437MetGlu: 1.437 ± 0.176
1.272MetPhe: 1.272 ± 0.201
1.672MetGly: 1.672 ± 0.244
0.424MetHis: 0.424 ± 0.098
1.437MetIle: 1.437 ± 0.188
1.908MetLys: 1.908 ± 0.196
2.261MetLeu: 2.261 ± 0.282
0.966MetMet: 0.966 ± 0.134
1.814MetAsn: 1.814 ± 0.192
1.154MetPro: 1.154 ± 0.174
1.06MetGln: 1.06 ± 0.149
1.39MetArg: 1.39 ± 0.172
1.955MetSer: 1.955 ± 0.221
1.72MetThr: 1.72 ± 0.198
1.861MetVal: 1.861 ± 0.21
0.259MetTrp: 0.259 ± 0.088
0.895MetTyr: 0.895 ± 0.157
0.0MetXaa: 0.0 ± 0.0
Asn
4.051AsnAla: 4.051 ± 0.315
0.471AsnCys: 0.471 ± 0.097
3.015AsnAsp: 3.015 ± 0.236
2.827AsnGlu: 2.827 ± 0.279
2.143AsnPhe: 2.143 ± 0.242
4.499AsnGly: 4.499 ± 0.439
1.178AsnHis: 1.178 ± 0.16
2.897AsnIle: 2.897 ± 0.279
2.591AsnLys: 2.591 ± 0.246
3.533AsnLeu: 3.533 ± 0.23
1.39AsnMet: 1.39 ± 0.165
2.897AsnAsn: 2.897 ± 0.276
2.355AsnPro: 2.355 ± 0.266
2.261AsnGln: 2.261 ± 0.211
2.403AsnArg: 2.403 ± 0.256
3.557AsnSer: 3.557 ± 0.296
3.557AsnThr: 3.557 ± 0.357
3.109AsnVal: 3.109 ± 0.296
0.73AsnTrp: 0.73 ± 0.157
1.767AsnTyr: 1.767 ± 0.189
0.0AsnXaa: 0.0 ± 0.0
Pro
3.368ProAla: 3.368 ± 0.359
0.283ProCys: 0.283 ± 0.088
2.732ProAsp: 2.732 ± 0.237
3.439ProGlu: 3.439 ± 0.302
1.743ProPhe: 1.743 ± 0.194
2.426ProGly: 2.426 ± 0.23
0.707ProHis: 0.707 ± 0.128
2.002ProIle: 2.002 ± 0.185
2.567ProLys: 2.567 ± 0.166
2.756ProLeu: 2.756 ± 0.232
1.343ProMet: 1.343 ± 0.171
1.837ProAsn: 1.837 ± 0.227
1.107ProPro: 1.107 ± 0.223
1.508ProGln: 1.508 ± 0.189
1.743ProArg: 1.743 ± 0.211
2.662ProSer: 2.662 ± 0.248
2.403ProThr: 2.403 ± 0.234
3.18ProVal: 3.18 ± 0.32
0.66ProTrp: 0.66 ± 0.113
1.296ProTyr: 1.296 ± 0.167
0.0ProXaa: 0.0 ± 0.0
Gln
3.463GlnAla: 3.463 ± 0.369
0.377GlnCys: 0.377 ± 0.079
2.709GlnAsp: 2.709 ± 0.275
2.308GlnGlu: 2.308 ± 0.25
2.167GlnPhe: 2.167 ± 0.236
2.473GlnGly: 2.473 ± 0.251
0.707GlnHis: 0.707 ± 0.143
2.615GlnIle: 2.615 ± 0.287
2.497GlnLys: 2.497 ± 0.24
4.004GlnLeu: 4.004 ± 0.333
1.154GlnMet: 1.154 ± 0.143
2.073GlnAsn: 2.073 ± 0.196
1.248GlnPro: 1.248 ± 0.196
1.884GlnGln: 1.884 ± 0.259
2.544GlnArg: 2.544 ± 0.238
2.073GlnSer: 2.073 ± 0.187
2.85GlnThr: 2.85 ± 0.266
2.921GlnVal: 2.921 ± 0.279
0.565GlnTrp: 0.565 ± 0.093
1.672GlnTyr: 1.672 ± 0.209
0.0GlnXaa: 0.0 ± 0.0
Arg
3.439ArgAla: 3.439 ± 0.263
0.707ArgCys: 0.707 ± 0.128
2.944ArgAsp: 2.944 ± 0.308
2.944ArgGlu: 2.944 ± 0.263
2.285ArgPhe: 2.285 ± 0.278
3.133ArgGly: 3.133 ± 0.324
0.895ArgHis: 0.895 ± 0.132
2.85ArgIle: 2.85 ± 0.217
3.18ArgLys: 3.18 ± 0.262
4.546ArgLeu: 4.546 ± 0.346
1.555ArgMet: 1.555 ± 0.189
2.002ArgAsn: 2.002 ± 0.175
1.484ArgPro: 1.484 ± 0.206
2.379ArgGln: 2.379 ± 0.249
2.567ArgArg: 2.567 ± 0.256
3.039ArgSer: 3.039 ± 0.301
2.779ArgThr: 2.779 ± 0.219
4.004ArgVal: 4.004 ± 0.363
0.683ArgTrp: 0.683 ± 0.126
2.073ArgTyr: 2.073 ± 0.245
0.0ArgXaa: 0.0 ± 0.0
Ser
4.687SerAla: 4.687 ± 0.417
0.495SerCys: 0.495 ± 0.133
3.769SerAsp: 3.769 ± 0.29
3.486SerGlu: 3.486 ± 0.241
2.567SerPhe: 2.567 ± 0.233
4.852SerGly: 4.852 ± 0.416
0.777SerHis: 0.777 ± 0.162
4.004SerIle: 4.004 ± 0.292
3.627SerLys: 3.627 ± 0.29
4.523SerLeu: 4.523 ± 0.344
1.508SerMet: 1.508 ± 0.187
3.203SerAsn: 3.203 ± 0.289
2.332SerPro: 2.332 ± 0.248
3.109SerGln: 3.109 ± 0.243
2.52SerArg: 2.52 ± 0.244
3.698SerSer: 3.698 ± 0.297
3.863SerThr: 3.863 ± 0.328
5.064SerVal: 5.064 ± 0.442
0.872SerTrp: 0.872 ± 0.137
2.379SerTyr: 2.379 ± 0.211
0.0SerXaa: 0.0 ± 0.0
Thr
4.899ThrAla: 4.899 ± 0.493
0.424ThrCys: 0.424 ± 0.099
4.004ThrAsp: 4.004 ± 0.317
3.769ThrGlu: 3.769 ± 0.251
3.18ThrPhe: 3.18 ± 0.308
4.64ThrGly: 4.64 ± 0.402
0.848ThrHis: 0.848 ± 0.147
3.321ThrIle: 3.321 ± 0.327
3.863ThrLys: 3.863 ± 0.33
4.97ThrLeu: 4.97 ± 0.356
1.343ThrMet: 1.343 ± 0.196
3.133ThrAsn: 3.133 ± 0.336
3.533ThrPro: 3.533 ± 0.343
2.85ThrGln: 2.85 ± 0.308
2.756ThrArg: 2.756 ± 0.248
4.169ThrSer: 4.169 ± 0.384
3.722ThrThr: 3.722 ± 0.405
5.276ThrVal: 5.276 ± 0.382
0.919ThrTrp: 0.919 ± 0.121
1.72ThrTyr: 1.72 ± 0.21
0.0ThrXaa: 0.0 ± 0.0
Val
4.829ValAla: 4.829 ± 0.404
0.707ValCys: 0.707 ± 0.13
5.3ValAsp: 5.3 ± 0.347
4.829ValGlu: 4.829 ± 0.35
2.615ValPhe: 2.615 ± 0.251
4.57ValGly: 4.57 ± 0.436
1.39ValHis: 1.39 ± 0.209
3.722ValIle: 3.722 ± 0.227
4.64ValLys: 4.64 ± 0.347
5.3ValLeu: 5.3 ± 0.344
1.979ValMet: 1.979 ± 0.205
4.193ValAsn: 4.193 ± 0.306
2.685ValPro: 2.685 ± 0.22
3.039ValGln: 3.039 ± 0.242
3.18ValArg: 3.18 ± 0.223
5.088ValSer: 5.088 ± 0.349
4.994ValThr: 4.994 ± 0.417
4.852ValVal: 4.852 ± 0.36
0.872ValTrp: 0.872 ± 0.13
1.861ValTyr: 1.861 ± 0.229
0.0ValXaa: 0.0 ± 0.0
Trp
0.942TrpAla: 0.942 ± 0.146
0.259TrpCys: 0.259 ± 0.096
0.989TrpAsp: 0.989 ± 0.149
1.107TrpGlu: 1.107 ± 0.154
0.683TrpPhe: 0.683 ± 0.143
0.471TrpGly: 0.471 ± 0.108
0.306TrpHis: 0.306 ± 0.082
0.73TrpIle: 0.73 ± 0.128
0.636TrpLys: 0.636 ± 0.126
1.46TrpLeu: 1.46 ± 0.213
0.4TrpMet: 0.4 ± 0.085
0.73TrpAsn: 0.73 ± 0.133
0.542TrpPro: 0.542 ± 0.113
0.612TrpGln: 0.612 ± 0.118
0.872TrpArg: 0.872 ± 0.145
0.754TrpSer: 0.754 ± 0.125
0.777TrpThr: 0.777 ± 0.135
1.154TrpVal: 1.154 ± 0.208
0.212TrpTrp: 0.212 ± 0.062
0.542TrpTyr: 0.542 ± 0.127
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.921TyrAla: 2.921 ± 0.235
0.542TyrCys: 0.542 ± 0.125
2.143TyrAsp: 2.143 ± 0.251
1.578TyrGlu: 1.578 ± 0.164
1.743TyrPhe: 1.743 ± 0.169
1.979TyrGly: 1.979 ± 0.208
0.73TyrHis: 0.73 ± 0.129
1.908TyrIle: 1.908 ± 0.195
1.932TyrLys: 1.932 ± 0.214
3.015TyrLeu: 3.015 ± 0.24
0.966TyrMet: 0.966 ± 0.132
2.261TyrAsn: 2.261 ± 0.217
1.743TyrPro: 1.743 ± 0.186
1.531TyrGln: 1.531 ± 0.154
2.167TyrArg: 2.167 ± 0.234
2.45TyrSer: 2.45 ± 0.226
2.497TyrThr: 2.497 ± 0.311
2.238TyrVal: 2.238 ± 0.216
0.542TyrTrp: 0.542 ± 0.099
1.578TyrTyr: 1.578 ± 0.187
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 145 proteins (42455 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski