Amino acid dipepetide frequency for Staphylococcus phage Twort (strain DSM 17442 / HER 48) (Bacteriophage Twort)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.051AlaAla: 0.051 ± 0.032
0.384AlaCys: 0.384 ± 0.103
1.896AlaAsp: 1.896 ± 0.232
2.05AlaGlu: 2.05 ± 0.306
1.537AlaPhe: 1.537 ± 0.232
1.666AlaGly: 1.666 ± 0.309
0.564AlaHis: 0.564 ± 0.108
2.896AlaIle: 2.896 ± 0.292
3.895AlaLys: 3.895 ± 0.357
3.741AlaLeu: 3.741 ± 0.368
1.102AlaMet: 1.102 ± 0.2
1.409AlaAsn: 1.409 ± 0.208
1.179AlaPro: 1.179 ± 0.203
1.409AlaGln: 1.409 ± 0.197
1.691AlaArg: 1.691 ± 0.207
2.691AlaSer: 2.691 ± 0.361
2.742AlaThr: 2.742 ± 0.268
2.511AlaVal: 2.511 ± 0.283
0.333AlaTrp: 0.333 ± 0.086
2.024AlaTyr: 2.024 ± 0.24
0.0AlaXaa: 0.0 ± 0.0
Cys
0.231CysAla: 0.231 ± 0.079
0.154CysCys: 0.154 ± 0.057
0.307CysAsp: 0.307 ± 0.082
0.359CysGlu: 0.359 ± 0.096
0.179CysPhe: 0.179 ± 0.066
0.666CysGly: 0.666 ± 0.158
0.102CysHis: 0.102 ± 0.052
0.564CysIle: 0.564 ± 0.157
0.564CysLys: 0.564 ± 0.137
0.589CysLeu: 0.589 ± 0.135
0.179CysMet: 0.179 ± 0.074
0.641CysAsn: 0.641 ± 0.143
0.333CysPro: 0.333 ± 0.101
0.154CysGln: 0.154 ± 0.071
0.307CysArg: 0.307 ± 0.096
0.461CysSer: 0.461 ± 0.117
0.384CysThr: 0.384 ± 0.087
0.359CysVal: 0.359 ± 0.093
0.154CysTrp: 0.154 ± 0.056
0.564CysTyr: 0.564 ± 0.127
0.0CysXaa: 0.0 ± 0.0
Asp
2.665AspAla: 2.665 ± 0.328
0.359AspCys: 0.359 ± 0.098
2.921AspAsp: 2.921 ± 0.243
4.254AspGlu: 4.254 ± 0.36
3.049AspPhe: 3.049 ± 0.277
3.126AspGly: 3.126 ± 0.327
0.487AspHis: 0.487 ± 0.118
6.406AspIle: 6.406 ± 0.516
6.534AspLys: 6.534 ± 0.483
5.279AspLeu: 5.279 ± 0.361
1.409AspMet: 1.409 ± 0.16
5.202AspAsn: 5.202 ± 0.427
1.64AspPro: 1.64 ± 0.2
0.794AspGln: 0.794 ± 0.141
2.434AspArg: 2.434 ± 0.279
4.202AspSer: 4.202 ± 0.302
4.074AspThr: 4.074 ± 0.281
4.228AspVal: 4.228 ± 0.275
0.589AspTrp: 0.589 ± 0.102
3.895AspTyr: 3.895 ± 0.36
0.0AspXaa: 0.0 ± 0.0
Glu
2.46GluAla: 2.46 ± 0.263
0.564GluCys: 0.564 ± 0.097
5.894GluAsp: 5.894 ± 0.537
8.328GluGlu: 8.328 ± 0.8
2.921GluPhe: 2.921 ± 0.227
4.049GluGly: 4.049 ± 0.322
1.127GluHis: 1.127 ± 0.192
4.869GluIle: 4.869 ± 0.334
6.944GluLys: 6.944 ± 0.533
7.841GluLeu: 7.841 ± 0.428
2.281GluMet: 2.281 ± 0.252
4.587GluAsn: 4.587 ± 0.311
2.152GluPro: 2.152 ± 0.365
3.946GluGln: 3.946 ± 0.394
2.665GluArg: 2.665 ± 0.253
4.792GluSer: 4.792 ± 0.298
3.716GluThr: 3.716 ± 0.288
5.689GluVal: 5.689 ± 0.381
0.666GluTrp: 0.666 ± 0.145
4.561GluTyr: 4.561 ± 0.382
0.0GluXaa: 0.0 ± 0.0
Phe
1.127PheAla: 1.127 ± 0.169
0.41PheCys: 0.41 ± 0.1
2.793PheAsp: 2.793 ± 0.271
3.101PheGlu: 3.101 ± 0.244
1.409PhePhe: 1.409 ± 0.163
1.896PheGly: 1.896 ± 0.234
0.564PheHis: 0.564 ± 0.127
3.562PheIle: 3.562 ± 0.328
4.228PheLys: 4.228 ± 0.319
2.87PheLeu: 2.87 ± 0.241
0.999PheMet: 0.999 ± 0.199
3.562PheAsn: 3.562 ± 0.295
0.871PhePro: 0.871 ± 0.149
1.153PheGln: 1.153 ± 0.185
1.025PheArg: 1.025 ± 0.167
2.588PheSer: 2.588 ± 0.235
3.254PheThr: 3.254 ± 0.261
2.409PheVal: 2.409 ± 0.294
0.231PheTrp: 0.231 ± 0.079
2.562PheTyr: 2.562 ± 0.277
0.0PheXaa: 0.0 ± 0.0
Gly
2.332GlyAla: 2.332 ± 0.307
0.461GlyCys: 0.461 ± 0.108
2.665GlyAsp: 2.665 ± 0.27
3.434GlyGlu: 3.434 ± 0.366
2.562GlyPhe: 2.562 ± 0.237
3.177GlyGly: 3.177 ± 0.524
0.999GlyHis: 0.999 ± 0.15
4.1GlyIle: 4.1 ± 0.339
5.612GlyLys: 5.612 ± 0.557
4.869GlyLeu: 4.869 ± 0.402
1.153GlyMet: 1.153 ± 0.189
3.946GlyAsn: 3.946 ± 0.365
0.051GlyPro: 0.051 ± 0.031
1.871GlyGln: 1.871 ± 0.249
2.127GlyArg: 2.127 ± 0.255
3.946GlySer: 3.946 ± 0.563
3.767GlyThr: 3.767 ± 0.487
3.562GlyVal: 3.562 ± 0.312
0.564GlyTrp: 0.564 ± 0.143
3.28GlyTyr: 3.28 ± 0.273
0.0GlyXaa: 0.0 ± 0.0
His
0.436HisAla: 0.436 ± 0.119
0.179HisCys: 0.179 ± 0.07
0.922HisAsp: 0.922 ± 0.149
0.922HisGlu: 0.922 ± 0.147
0.82HisPhe: 0.82 ± 0.138
0.999HisGly: 0.999 ± 0.139
0.359HisHis: 0.359 ± 0.089
1.332HisIle: 1.332 ± 0.199
1.717HisLys: 1.717 ± 0.206
1.332HisLeu: 1.332 ± 0.214
0.436HisMet: 0.436 ± 0.125
0.999HisAsn: 0.999 ± 0.148
0.512HisPro: 0.512 ± 0.113
0.589HisGln: 0.589 ± 0.122
0.615HisArg: 0.615 ± 0.129
0.794HisSer: 0.794 ± 0.124
0.82HisThr: 0.82 ± 0.13
1.179HisVal: 1.179 ± 0.193
0.179HisTrp: 0.179 ± 0.067
0.692HisTyr: 0.692 ± 0.14
0.0HisXaa: 0.0 ± 0.0
Ile
2.716IleAla: 2.716 ± 0.262
0.487IleCys: 0.487 ± 0.136
5.484IleAsp: 5.484 ± 0.371
6.201IleGlu: 6.201 ± 0.457
2.691IlePhe: 2.691 ± 0.25
3.306IleGly: 3.306 ± 0.334
1.102IleHis: 1.102 ± 0.149
5.919IleIle: 5.919 ± 0.461
7.559IleLys: 7.559 ± 0.477
6.073IleLeu: 6.073 ± 0.494
1.614IleMet: 1.614 ± 0.227
5.253IleAsn: 5.253 ± 0.326
2.101IlePro: 2.101 ± 0.23
2.562IleGln: 2.562 ± 0.268
2.383IleArg: 2.383 ± 0.22
4.894IleSer: 4.894 ± 0.376
4.92IleThr: 4.92 ± 0.395
4.177IleVal: 4.177 ± 0.352
0.461IleTrp: 0.461 ± 0.114
3.434IleTyr: 3.434 ± 0.38
0.0IleXaa: 0.0 ± 0.0
Lys
3.767LysAla: 3.767 ± 0.443
0.692LysCys: 0.692 ± 0.152
7.457LysAsp: 7.457 ± 0.478
10.66LysGlu: 10.66 ± 0.71
3.331LysPhe: 3.331 ± 0.242
5.868LysGly: 5.868 ± 0.594
1.666LysHis: 1.666 ± 0.224
5.151LysIle: 5.151 ± 0.352
8.994LysLys: 8.994 ± 0.598
7.662LysLeu: 7.662 ± 0.51
1.64LysMet: 1.64 ± 0.194
6.15LysAsn: 6.15 ± 0.395
2.819LysPro: 2.819 ± 0.301
4.177LysGln: 4.177 ± 0.42
3.331LysArg: 3.331 ± 0.321
5.227LysSer: 5.227 ± 0.384
5.022LysThr: 5.022 ± 0.405
6.97LysVal: 6.97 ± 0.357
0.743LysTrp: 0.743 ± 0.124
5.202LysTyr: 5.202 ± 0.367
0.0LysXaa: 0.0 ± 0.0
Leu
3.229LeuAla: 3.229 ± 0.307
0.384LeuCys: 0.384 ± 0.102
5.996LeuAsp: 5.996 ± 0.412
7.995LeuGlu: 7.995 ± 0.547
3.306LeuPhe: 3.306 ± 0.277
4.715LeuGly: 4.715 ± 0.449
1.512LeuHis: 1.512 ± 0.222
5.151LeuIle: 5.151 ± 0.391
7.457LeuLys: 7.457 ± 0.474
7.38LeuLeu: 7.38 ± 0.59
2.152LeuMet: 2.152 ± 0.237
5.996LeuAsn: 5.996 ± 0.357
2.767LeuPro: 2.767 ± 0.249
3.511LeuGln: 3.511 ± 0.338
3.203LeuArg: 3.203 ± 0.323
6.457LeuSer: 6.457 ± 0.46
5.125LeuThr: 5.125 ± 0.367
5.304LeuVal: 5.304 ± 0.35
0.461LeuTrp: 0.461 ± 0.105
3.511LeuTyr: 3.511 ± 0.292
0.0LeuXaa: 0.0 ± 0.0
Met
1.051MetAla: 1.051 ± 0.153
0.102MetCys: 0.102 ± 0.049
1.256MetAsp: 1.256 ± 0.194
1.973MetGlu: 1.973 ± 0.24
0.948MetPhe: 0.948 ± 0.171
1.179MetGly: 1.179 ± 0.246
0.282MetHis: 0.282 ± 0.092
1.435MetIle: 1.435 ± 0.206
2.101MetLys: 2.101 ± 0.235
1.717MetLeu: 1.717 ± 0.25
0.359MetMet: 0.359 ± 0.101
1.486MetAsn: 1.486 ± 0.196
0.615MetPro: 0.615 ± 0.134
0.641MetGln: 0.641 ± 0.15
1.256MetArg: 1.256 ± 0.183
1.666MetSer: 1.666 ± 0.234
1.127MetThr: 1.127 ± 0.17
1.486MetVal: 1.486 ± 0.21
0.231MetTrp: 0.231 ± 0.084
1.537MetTyr: 1.537 ± 0.192
0.0MetXaa: 0.0 ± 0.0
Asn
2.383AsnAla: 2.383 ± 0.227
0.615AsnCys: 0.615 ± 0.134
3.664AsnAsp: 3.664 ± 0.31
4.715AsnGlu: 4.715 ± 0.268
2.691AsnPhe: 2.691 ± 0.247
4.279AsnGly: 4.279 ± 0.373
0.999AsnHis: 0.999 ± 0.159
5.637AsnIle: 5.637 ± 0.356
7.995AsnLys: 7.995 ± 0.485
5.535AsnLeu: 5.535 ± 0.378
1.256AsnMet: 1.256 ± 0.175
6.176AsnAsn: 6.176 ± 0.519
2.588AsnPro: 2.588 ± 0.281
1.819AsnGln: 1.819 ± 0.202
2.332AsnArg: 2.332 ± 0.249
4.356AsnSer: 4.356 ± 0.431
5.099AsnThr: 5.099 ± 0.345
3.921AsnVal: 3.921 ± 0.359
0.615AsnTrp: 0.615 ± 0.139
3.664AsnTyr: 3.664 ± 0.346
0.0AsnXaa: 0.0 ± 0.0
Pro
0.769ProAla: 0.769 ± 0.131
0.205ProCys: 0.205 ± 0.086
1.486ProAsp: 1.486 ± 0.238
2.152ProGlu: 2.152 ± 0.218
1.179ProPhe: 1.179 ± 0.186
0.922ProGly: 0.922 ± 0.136
0.564ProHis: 0.564 ± 0.116
1.845ProIle: 1.845 ± 0.231
2.716ProLys: 2.716 ± 0.316
2.076ProLeu: 2.076 ± 0.24
0.615ProMet: 0.615 ± 0.124
2.152ProAsn: 2.152 ± 0.222
0.846ProPro: 0.846 ± 0.24
1.256ProGln: 1.256 ± 0.205
1.102ProArg: 1.102 ± 0.159
2.127ProSer: 2.127 ± 0.258
2.178ProThr: 2.178 ± 0.295
1.794ProVal: 1.794 ± 0.234
0.128ProTrp: 0.128 ± 0.059
1.666ProTyr: 1.666 ± 0.206
0.0ProXaa: 0.0 ± 0.0
Gln
1.794GlnAla: 1.794 ± 0.24
0.179GlnCys: 0.179 ± 0.061
2.332GlnAsp: 2.332 ± 0.215
3.229GlnGlu: 3.229 ± 0.314
1.204GlnPhe: 1.204 ± 0.186
2.152GlnGly: 2.152 ± 0.295
0.538GlnHis: 0.538 ± 0.099
2.152GlnIle: 2.152 ± 0.271
3.024GlnLys: 3.024 ± 0.32
3.306GlnLeu: 3.306 ± 0.369
0.589GlnMet: 0.589 ± 0.113
1.845GlnAsn: 1.845 ± 0.229
1.179GlnPro: 1.179 ± 0.285
2.076GlnGln: 2.076 ± 0.339
1.281GlnArg: 1.281 ± 0.194
2.46GlnSer: 2.46 ± 0.267
1.537GlnThr: 1.537 ± 0.169
2.409GlnVal: 2.409 ± 0.277
0.359GlnTrp: 0.359 ± 0.092
1.819GlnTyr: 1.819 ± 0.229
0.0GlnXaa: 0.0 ± 0.0
Arg
1.461ArgAla: 1.461 ± 0.203
0.256ArgCys: 0.256 ± 0.096
2.178ArgAsp: 2.178 ± 0.223
2.178ArgGlu: 2.178 ± 0.228
1.563ArgPhe: 1.563 ± 0.197
1.947ArgGly: 1.947 ± 0.25
0.589ArgHis: 0.589 ± 0.126
2.793ArgIle: 2.793 ± 0.32
3.177ArgLys: 3.177 ± 0.287
3.767ArgLeu: 3.767 ± 0.319
1.127ArgMet: 1.127 ± 0.177
2.486ArgAsn: 2.486 ± 0.31
0.974ArgPro: 0.974 ± 0.143
1.358ArgGln: 1.358 ± 0.166
1.435ArgArg: 1.435 ± 0.209
1.947ArgSer: 1.947 ± 0.233
1.999ArgThr: 1.999 ± 0.193
2.204ArgVal: 2.204 ± 0.268
0.333ArgTrp: 0.333 ± 0.096
1.922ArgTyr: 1.922 ± 0.237
0.0ArgXaa: 0.0 ± 0.0
Ser
2.665SerAla: 2.665 ± 0.284
0.461SerCys: 0.461 ± 0.123
4.049SerAsp: 4.049 ± 0.385
4.407SerGlu: 4.407 ± 0.368
3.382SerPhe: 3.382 ± 0.312
4.177SerGly: 4.177 ± 0.482
0.82SerHis: 0.82 ± 0.148
4.971SerIle: 4.971 ± 0.43
6.765SerLys: 6.765 ± 0.38
5.535SerLeu: 5.535 ± 0.393
1.435SerMet: 1.435 ± 0.214
5.048SerAsn: 5.048 ± 0.452
1.537SerPro: 1.537 ± 0.175
1.691SerGln: 1.691 ± 0.227
2.229SerArg: 2.229 ± 0.214
5.561SerSer: 5.561 ± 0.674
3.818SerThr: 3.818 ± 0.396
4.023SerVal: 4.023 ± 0.357
0.666SerTrp: 0.666 ± 0.161
3.408SerTyr: 3.408 ± 0.318
0.0SerXaa: 0.0 ± 0.0
Thr
2.05ThrAla: 2.05 ± 0.287
0.282ThrCys: 0.282 ± 0.097
4.023ThrAsp: 4.023 ± 0.338
3.741ThrGlu: 3.741 ± 0.292
2.972ThrPhe: 2.972 ± 0.359
3.536ThrGly: 3.536 ± 0.37
1.358ThrHis: 1.358 ± 0.192
5.074ThrIle: 5.074 ± 0.396
5.996ThrLys: 5.996 ± 0.356
5.253ThrLeu: 5.253 ± 0.414
1.076ThrMet: 1.076 ± 0.165
4.382ThrAsn: 4.382 ± 0.397
2.434ThrPro: 2.434 ± 0.275
2.639ThrGln: 2.639 ± 0.223
2.229ThrArg: 2.229 ± 0.274
3.741ThrSer: 3.741 ± 0.439
4.049ThrThr: 4.049 ± 0.427
4.202ThrVal: 4.202 ± 0.366
0.538ThrTrp: 0.538 ± 0.132
2.921ThrTyr: 2.921 ± 0.319
0.0ThrXaa: 0.0 ± 0.0
Val
2.434ValAla: 2.434 ± 0.335
0.615ValCys: 0.615 ± 0.122
4.433ValAsp: 4.433 ± 0.35
5.894ValGlu: 5.894 ± 0.455
2.742ValPhe: 2.742 ± 0.279
3.639ValGly: 3.639 ± 0.347
1.102ValHis: 1.102 ± 0.147
4.433ValIle: 4.433 ± 0.389
5.33ValLys: 5.33 ± 0.412
5.458ValLeu: 5.458 ± 0.329
1.358ValMet: 1.358 ± 0.184
4.305ValAsn: 4.305 ± 0.32
1.666ValPro: 1.666 ± 0.209
1.947ValGln: 1.947 ± 0.202
1.973ValArg: 1.973 ± 0.238
4.792ValSer: 4.792 ± 0.409
4.561ValThr: 4.561 ± 0.414
4.382ValVal: 4.382 ± 0.447
0.436ValTrp: 0.436 ± 0.125
2.921ValTyr: 2.921 ± 0.245
0.0ValXaa: 0.0 ± 0.0
Trp
0.256TrpAla: 0.256 ± 0.088
0.051TrpCys: 0.051 ± 0.034
0.564TrpAsp: 0.564 ± 0.125
0.717TrpGlu: 0.717 ± 0.125
0.282TrpPhe: 0.282 ± 0.077
0.641TrpGly: 0.641 ± 0.15
0.128TrpHis: 0.128 ± 0.057
0.692TrpIle: 0.692 ± 0.124
0.692TrpLys: 0.692 ± 0.112
0.82TrpLeu: 0.82 ± 0.159
0.231TrpMet: 0.231 ± 0.071
0.589TrpAsn: 0.589 ± 0.126
0.0TrpPro: 0.0 ± 0.0
0.205TrpGln: 0.205 ± 0.064
0.282TrpArg: 0.282 ± 0.092
0.41TrpSer: 0.41 ± 0.122
0.436TrpThr: 0.436 ± 0.101
0.666TrpVal: 0.666 ± 0.115
0.102TrpTrp: 0.102 ± 0.069
0.487TrpTyr: 0.487 ± 0.114
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.896TyrAla: 1.896 ± 0.22
0.41TyrCys: 0.41 ± 0.112
3.229TyrAsp: 3.229 ± 0.318
3.741TyrGlu: 3.741 ± 0.285
1.845TyrPhe: 1.845 ± 0.222
2.537TyrGly: 2.537 ± 0.286
0.974TyrHis: 0.974 ± 0.145
4.356TyrIle: 4.356 ± 0.388
5.253TyrLys: 5.253 ± 0.417
4.51TyrLeu: 4.51 ± 0.36
1.409TyrMet: 1.409 ± 0.159
4.126TyrAsn: 4.126 ± 0.357
1.512TyrPro: 1.512 ± 0.229
1.666TyrGln: 1.666 ± 0.207
1.794TyrArg: 1.794 ± 0.22
3.536TyrSer: 3.536 ± 0.428
3.869TyrThr: 3.869 ± 0.454
2.947TyrVal: 2.947 ± 0.251
0.461TyrTrp: 0.461 ± 0.091
3.357TyrTyr: 3.357 ± 0.359
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 195 proteins (39026 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski