Amino acid dipepetide frequency for Escherichia phage JS98

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.989AlaAla: 5.989 ± 0.476
0.383AlaCys: 0.383 ± 0.097
4.037AlaAsp: 4.037 ± 0.295
5.491AlaGlu: 5.491 ± 0.399
2.468AlaPhe: 2.468 ± 0.206
5.07AlaGly: 5.07 ± 0.46
1.339AlaHis: 1.339 ± 0.158
4.477AlaIle: 4.477 ± 0.322
4.688AlaLys: 4.688 ± 0.344
6.391AlaLeu: 6.391 ± 0.393
1.856AlaMet: 1.856 ± 0.177
3.234AlaAsn: 3.234 ± 0.244
2.832AlaPro: 2.832 ± 0.255
2.583AlaGln: 2.583 ± 0.239
3.1AlaArg: 3.1 ± 0.234
4.707AlaSer: 4.707 ± 0.348
3.674AlaThr: 3.674 ± 0.412
4.994AlaVal: 4.994 ± 0.317
0.957AlaTrp: 0.957 ± 0.136
3.061AlaTyr: 3.061 ± 0.255
0.0AlaXaa: 0.0 ± 0.0
Cys
0.823CysAla: 0.823 ± 0.132
0.153CysCys: 0.153 ± 0.063
0.708CysAsp: 0.708 ± 0.121
0.899CysGlu: 0.899 ± 0.156
0.555CysPhe: 0.555 ± 0.098
0.593CysGly: 0.593 ± 0.113
0.287CysHis: 0.287 ± 0.087
0.67CysIle: 0.67 ± 0.12
0.631CysLys: 0.631 ± 0.102
0.67CysLeu: 0.67 ± 0.11
0.383CysMet: 0.383 ± 0.083
0.402CysAsn: 0.402 ± 0.082
0.497CysPro: 0.497 ± 0.102
0.325CysGln: 0.325 ± 0.075
0.574CysArg: 0.574 ± 0.111
0.631CysSer: 0.631 ± 0.112
0.478CysThr: 0.478 ± 0.085
0.689CysVal: 0.689 ± 0.133
0.153CysTrp: 0.153 ± 0.049
0.344CysTyr: 0.344 ± 0.084
0.0CysXaa: 0.0 ± 0.0
Asp
4.362AspAla: 4.362 ± 0.257
0.727AspCys: 0.727 ± 0.102
4.209AspAsp: 4.209 ± 0.334
4.688AspGlu: 4.688 ± 0.317
3.482AspPhe: 3.482 ± 0.283
5.204AspGly: 5.204 ± 0.312
0.784AspHis: 0.784 ± 0.116
5.166AspIle: 5.166 ± 0.29
4.764AspLys: 4.764 ± 0.321
5.319AspLeu: 5.319 ± 0.33
1.645AspMet: 1.645 ± 0.187
2.966AspAsn: 2.966 ± 0.214
2.277AspPro: 2.277 ± 0.214
1.837AspGln: 1.837 ± 0.181
2.334AspArg: 2.334 ± 0.231
3.367AspSer: 3.367 ± 0.245
3.31AspThr: 3.31 ± 0.318
4.267AspVal: 4.267 ± 0.301
1.301AspTrp: 1.301 ± 0.163
3.138AspTyr: 3.138 ± 0.273
0.0AspXaa: 0.0 ± 0.0
Glu
5.434GluAla: 5.434 ± 0.375
0.823GluCys: 0.823 ± 0.117
4.515GluAsp: 4.515 ± 0.317
4.841GluGlu: 4.841 ± 0.378
3.712GluPhe: 3.712 ± 0.288
3.788GluGly: 3.788 ± 0.31
1.167GluHis: 1.167 ± 0.152
6.008GluIle: 6.008 ± 0.41
4.362GluLys: 4.362 ± 0.339
7.003GluLeu: 7.003 ± 0.429
2.143GluMet: 2.143 ± 0.22
4.037GluAsn: 4.037 ± 0.28
1.971GluPro: 1.971 ± 0.197
2.564GluGln: 2.564 ± 0.231
2.545GluArg: 2.545 ± 0.231
3.731GluSer: 3.731 ± 0.293
4.228GluThr: 4.228 ± 0.324
5.357GluVal: 5.357 ± 0.263
1.071GluTrp: 1.071 ± 0.146
3.674GluTyr: 3.674 ± 0.312
0.0GluXaa: 0.0 ± 0.0
Phe
2.87PheAla: 2.87 ± 0.189
0.44PheCys: 0.44 ± 0.097
3.214PheAsp: 3.214 ± 0.284
3.463PheGlu: 3.463 ± 0.304
1.378PhePhe: 1.378 ± 0.161
3.1PheGly: 3.1 ± 0.228
0.689PheHis: 0.689 ± 0.098
2.947PheIle: 2.947 ± 0.23
4.19PheLys: 4.19 ± 0.333
2.219PheLeu: 2.219 ± 0.199
1.358PheMet: 1.358 ± 0.166
2.774PheAsn: 2.774 ± 0.271
1.11PhePro: 1.11 ± 0.138
1.512PheGln: 1.512 ± 0.184
2.047PheArg: 2.047 ± 0.195
2.717PheSer: 2.717 ± 0.211
2.277PheThr: 2.277 ± 0.184
2.832PheVal: 2.832 ± 0.242
0.689PheTrp: 0.689 ± 0.102
1.76PheTyr: 1.76 ± 0.172
0.0PheXaa: 0.0 ± 0.0
Gly
3.559GlyAla: 3.559 ± 0.333
0.689GlyCys: 0.689 ± 0.115
4.496GlyAsp: 4.496 ± 0.312
4.018GlyGlu: 4.018 ± 0.243
2.755GlyPhe: 2.755 ± 0.224
3.98GlyGly: 3.98 ± 0.625
0.918GlyHis: 0.918 ± 0.138
4.037GlyIle: 4.037 ± 0.272
4.171GlyLys: 4.171 ± 0.322
5.396GlyLeu: 5.396 ± 0.26
1.818GlyMet: 1.818 ± 0.217
3.635GlyAsn: 3.635 ± 0.401
1.952GlyPro: 1.952 ± 0.158
2.162GlyGln: 2.162 ± 0.239
2.755GlyArg: 2.755 ± 0.256
4.056GlySer: 4.056 ± 0.315
4.573GlyThr: 4.573 ± 0.417
3.769GlyVal: 3.769 ± 0.279
1.148GlyTrp: 1.148 ± 0.169
2.832GlyTyr: 2.832 ± 0.262
0.0GlyXaa: 0.0 ± 0.0
His
0.88HisAla: 0.88 ± 0.118
0.287HisCys: 0.287 ± 0.078
0.899HisAsp: 0.899 ± 0.107
1.091HisGlu: 1.091 ± 0.157
0.842HisPhe: 0.842 ± 0.13
1.014HisGly: 1.014 ± 0.119
0.383HisHis: 0.383 ± 0.089
1.282HisIle: 1.282 ± 0.166
1.244HisLys: 1.244 ± 0.165
1.32HisLeu: 1.32 ± 0.164
0.325HisMet: 0.325 ± 0.092
0.861HisAsn: 0.861 ± 0.139
1.071HisPro: 1.071 ± 0.131
0.478HisGln: 0.478 ± 0.096
0.746HisArg: 0.746 ± 0.108
1.091HisSer: 1.091 ± 0.121
0.957HisThr: 0.957 ± 0.124
1.129HisVal: 1.129 ± 0.147
0.249HisTrp: 0.249 ± 0.068
0.689HisTyr: 0.689 ± 0.132
0.0HisXaa: 0.0 ± 0.0
Ile
4.86IleAla: 4.86 ± 0.362
0.593IleCys: 0.593 ± 0.1
5.262IleAsp: 5.262 ± 0.341
5.109IleGlu: 5.109 ± 0.383
2.373IlePhe: 2.373 ± 0.202
3.234IleGly: 3.234 ± 0.215
1.282IleHis: 1.282 ± 0.172
4.267IleIle: 4.267 ± 0.298
6.965IleLys: 6.965 ± 0.398
4.075IleLeu: 4.075 ± 0.292
1.913IleMet: 1.913 ± 0.192
3.922IleAsn: 3.922 ± 0.258
2.87IlePro: 2.87 ± 0.234
2.621IleGln: 2.621 ± 0.196
3.521IleArg: 3.521 ± 0.311
4.114IleSer: 4.114 ± 0.285
4.324IleThr: 4.324 ± 0.28
4.286IleVal: 4.286 ± 0.318
0.689IleTrp: 0.689 ± 0.108
2.181IleTyr: 2.181 ± 0.198
0.0IleXaa: 0.0 ± 0.0
Lys
6.046LysAla: 6.046 ± 0.443
0.899LysCys: 0.899 ± 0.144
4.764LysAsp: 4.764 ± 0.307
5.549LysGlu: 5.549 ± 0.385
3.291LysPhe: 3.291 ± 0.253
4.305LysGly: 4.305 ± 0.31
1.55LysHis: 1.55 ± 0.2
4.86LysIle: 4.86 ± 0.338
4.458LysLys: 4.458 ± 0.323
6.046LysLeu: 6.046 ± 0.352
2.526LysMet: 2.526 ± 0.206
3.578LysAsn: 3.578 ± 0.287
2.583LysPro: 2.583 ± 0.225
2.602LysGln: 2.602 ± 0.252
3.176LysArg: 3.176 ± 0.261
4.037LysSer: 4.037 ± 0.255
4.114LysThr: 4.114 ± 0.288
5.53LysVal: 5.53 ± 0.338
1.014LysTrp: 1.014 ± 0.123
3.138LysTyr: 3.138 ± 0.245
0.0LysXaa: 0.0 ± 0.0
Leu
5.874LeuAla: 5.874 ± 0.298
0.536LeuCys: 0.536 ± 0.115
5.051LeuAsp: 5.051 ± 0.288
5.434LeuGlu: 5.434 ± 0.395
3.291LeuPhe: 3.291 ± 0.265
4.095LeuGly: 4.095 ± 0.289
1.186LeuHis: 1.186 ± 0.165
4.841LeuIle: 4.841 ± 0.299
6.18LeuLys: 6.18 ± 0.38
5.319LeuLeu: 5.319 ± 0.345
2.373LeuMet: 2.373 ± 0.263
4.802LeuAsn: 4.802 ± 0.304
2.87LeuPro: 2.87 ± 0.217
2.43LeuGln: 2.43 ± 0.187
3.425LeuArg: 3.425 ± 0.223
5.013LeuSer: 5.013 ± 0.281
4.669LeuThr: 4.669 ± 0.342
4.707LeuVal: 4.707 ± 0.261
0.689LeuTrp: 0.689 ± 0.109
3.444LeuTyr: 3.444 ± 0.267
0.0LeuXaa: 0.0 ± 0.0
Met
2.296MetAla: 2.296 ± 0.182
0.402MetCys: 0.402 ± 0.086
1.971MetAsp: 1.971 ± 0.218
1.645MetGlu: 1.645 ± 0.147
1.339MetPhe: 1.339 ± 0.163
1.358MetGly: 1.358 ± 0.161
0.364MetHis: 0.364 ± 0.085
1.645MetIle: 1.645 ± 0.197
2.545MetLys: 2.545 ± 0.213
1.856MetLeu: 1.856 ± 0.186
0.708MetMet: 0.708 ± 0.127
1.741MetAsn: 1.741 ± 0.174
0.861MetPro: 0.861 ± 0.122
0.918MetGln: 0.918 ± 0.126
1.091MetArg: 1.091 ± 0.145
1.875MetSer: 1.875 ± 0.171
1.856MetThr: 1.856 ± 0.18
1.799MetVal: 1.799 ± 0.174
0.364MetTrp: 0.364 ± 0.096
1.052MetTyr: 1.052 ± 0.132
0.0MetXaa: 0.0 ± 0.0
Asn
3.693AsnAla: 3.693 ± 0.29
0.517AsnCys: 0.517 ± 0.109
3.272AsnAsp: 3.272 ± 0.262
3.884AsnGlu: 3.884 ± 0.273
2.526AsnPhe: 2.526 ± 0.223
4.228AsnGly: 4.228 ± 0.342
1.052AsnHis: 1.052 ± 0.139
3.846AsnIle: 3.846 ± 0.28
3.654AsnLys: 3.654 ± 0.238
3.482AsnLeu: 3.482 ± 0.268
1.435AsnMet: 1.435 ± 0.202
2.927AsnAsn: 2.927 ± 0.295
2.526AsnPro: 2.526 ± 0.224
1.779AsnGln: 1.779 ± 0.175
2.086AsnArg: 2.086 ± 0.195
3.674AsnSer: 3.674 ± 0.266
3.004AsnThr: 3.004 ± 0.271
3.253AsnVal: 3.253 ± 0.275
0.555AsnTrp: 0.555 ± 0.114
2.086AsnTyr: 2.086 ± 0.183
0.0AsnXaa: 0.0 ± 0.0
Pro
2.373ProAla: 2.373 ± 0.191
0.344ProCys: 0.344 ± 0.076
2.774ProAsp: 2.774 ± 0.233
3.348ProGlu: 3.348 ± 0.301
1.435ProPhe: 1.435 ± 0.162
2.564ProGly: 2.564 ± 0.214
0.555ProHis: 0.555 ± 0.092
2.545ProIle: 2.545 ± 0.238
2.526ProLys: 2.526 ± 0.223
2.392ProLeu: 2.392 ± 0.191
0.651ProMet: 0.651 ± 0.124
1.799ProAsn: 1.799 ± 0.169
0.899ProPro: 0.899 ± 0.153
1.148ProGln: 1.148 ± 0.132
1.282ProArg: 1.282 ± 0.199
2.277ProSer: 2.277 ± 0.209
2.296ProThr: 2.296 ± 0.191
2.698ProVal: 2.698 ± 0.213
0.612ProTrp: 0.612 ± 0.129
1.492ProTyr: 1.492 ± 0.19
0.0ProXaa: 0.0 ± 0.0
Gln
3.004GlnAla: 3.004 ± 0.243
0.325GlnCys: 0.325 ± 0.082
1.818GlnAsp: 1.818 ± 0.185
2.258GlnGlu: 2.258 ± 0.229
1.55GlnPhe: 1.55 ± 0.149
2.028GlnGly: 2.028 ± 0.196
0.536GlnHis: 0.536 ± 0.102
2.602GlnIle: 2.602 ± 0.252
2.124GlnLys: 2.124 ± 0.197
2.793GlnLeu: 2.793 ± 0.241
1.071GlnMet: 1.071 ± 0.159
1.531GlnAsn: 1.531 ± 0.175
0.976GlnPro: 0.976 ± 0.128
1.014GlnGln: 1.014 ± 0.166
1.952GlnArg: 1.952 ± 0.185
1.894GlnSer: 1.894 ± 0.204
2.219GlnThr: 2.219 ± 0.285
2.296GlnVal: 2.296 ± 0.224
0.938GlnTrp: 0.938 ± 0.126
1.588GlnTyr: 1.588 ± 0.179
0.0GlnXaa: 0.0 ± 0.0
Arg
2.698ArgAla: 2.698 ± 0.26
0.497ArgCys: 0.497 ± 0.095
2.64ArgAsp: 2.64 ± 0.201
3.272ArgGlu: 3.272 ± 0.246
1.952ArgPhe: 1.952 ± 0.171
2.526ArgGly: 2.526 ± 0.25
0.727ArgHis: 0.727 ± 0.118
3.31ArgIle: 3.31 ± 0.24
3.138ArgLys: 3.138 ± 0.226
3.597ArgLeu: 3.597 ± 0.276
1.244ArgMet: 1.244 ± 0.165
2.124ArgAsn: 2.124 ± 0.207
1.32ArgPro: 1.32 ± 0.151
1.799ArgGln: 1.799 ± 0.183
2.162ArgArg: 2.162 ± 0.207
2.353ArgSer: 2.353 ± 0.222
2.353ArgThr: 2.353 ± 0.231
2.947ArgVal: 2.947 ± 0.236
0.67ArgTrp: 0.67 ± 0.113
1.703ArgTyr: 1.703 ± 0.179
0.0ArgXaa: 0.0 ± 0.0
Ser
3.482SerAla: 3.482 ± 0.284
0.727SerCys: 0.727 ± 0.136
3.98SerAsp: 3.98 ± 0.267
4.439SerGlu: 4.439 ± 0.343
2.564SerPhe: 2.564 ± 0.26
4.439SerGly: 4.439 ± 0.315
1.052SerHis: 1.052 ± 0.134
4.056SerIle: 4.056 ± 0.25
4.515SerLys: 4.515 ± 0.295
4.956SerLeu: 4.956 ± 0.307
1.492SerMet: 1.492 ± 0.162
2.793SerAsn: 2.793 ± 0.29
2.258SerPro: 2.258 ± 0.228
2.239SerGln: 2.239 ± 0.177
2.545SerArg: 2.545 ± 0.219
4.305SerSer: 4.305 ± 0.331
3.463SerThr: 3.463 ± 0.293
3.961SerVal: 3.961 ± 0.234
0.899SerTrp: 0.899 ± 0.14
2.736SerTyr: 2.736 ± 0.218
0.0SerXaa: 0.0 ± 0.0
Thr
4.63ThrAla: 4.63 ± 0.364
0.536ThrCys: 0.536 ± 0.092
3.348ThrAsp: 3.348 ± 0.256
4.42ThrGlu: 4.42 ± 0.324
2.736ThrPhe: 2.736 ± 0.27
4.267ThrGly: 4.267 ± 0.334
0.823ThrHis: 0.823 ± 0.12
3.884ThrIle: 3.884 ± 0.328
3.961ThrLys: 3.961 ± 0.261
4.592ThrLeu: 4.592 ± 0.395
1.225ThrMet: 1.225 ± 0.16
2.851ThrAsn: 2.851 ± 0.242
2.564ThrPro: 2.564 ± 0.22
2.066ThrGln: 2.066 ± 0.28
2.889ThrArg: 2.889 ± 0.287
3.348ThrSer: 3.348 ± 0.303
3.463ThrThr: 3.463 ± 0.371
3.98ThrVal: 3.98 ± 0.305
0.67ThrTrp: 0.67 ± 0.126
2.162ThrTyr: 2.162 ± 0.2
0.0ThrXaa: 0.0 ± 0.0
Val
4.688ValAla: 4.688 ± 0.266
0.861ValCys: 0.861 ± 0.123
4.496ValAsp: 4.496 ± 0.292
5.415ValGlu: 5.415 ± 0.367
2.755ValPhe: 2.755 ± 0.218
3.616ValGly: 3.616 ± 0.327
0.976ValHis: 0.976 ± 0.132
4.286ValIle: 4.286 ± 0.294
5.376ValLys: 5.376 ± 0.293
4.879ValLeu: 4.879 ± 0.315
1.894ValMet: 1.894 ± 0.171
4.037ValAsn: 4.037 ± 0.271
2.602ValPro: 2.602 ± 0.234
2.411ValGln: 2.411 ± 0.19
2.679ValArg: 2.679 ± 0.214
4.037ValSer: 4.037 ± 0.306
3.808ValThr: 3.808 ± 0.316
4.248ValVal: 4.248 ± 0.324
0.804ValTrp: 0.804 ± 0.126
3.1ValTyr: 3.1 ± 0.248
0.0ValXaa: 0.0 ± 0.0
Trp
0.899TrpAla: 0.899 ± 0.123
0.172TrpCys: 0.172 ± 0.047
0.823TrpAsp: 0.823 ± 0.121
0.823TrpGlu: 0.823 ± 0.122
0.67TrpPhe: 0.67 ± 0.108
0.555TrpGly: 0.555 ± 0.139
0.325TrpHis: 0.325 ± 0.076
0.823TrpIle: 0.823 ± 0.12
1.473TrpLys: 1.473 ± 0.195
1.091TrpLeu: 1.091 ± 0.166
0.536TrpMet: 0.536 ± 0.102
0.842TrpAsn: 0.842 ± 0.133
0.44TrpPro: 0.44 ± 0.102
0.478TrpGln: 0.478 ± 0.093
0.478TrpArg: 0.478 ± 0.101
0.804TrpSer: 0.804 ± 0.135
0.784TrpThr: 0.784 ± 0.129
1.129TrpVal: 1.129 ± 0.163
0.249TrpTrp: 0.249 ± 0.058
0.88TrpTyr: 0.88 ± 0.128
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.87TyrAla: 2.87 ± 0.287
0.555TyrCys: 0.555 ± 0.111
2.87TyrAsp: 2.87 ± 0.242
2.889TyrGlu: 2.889 ± 0.245
1.913TyrPhe: 1.913 ± 0.178
2.755TyrGly: 2.755 ± 0.241
0.842TyrHis: 0.842 ± 0.114
2.908TyrIle: 2.908 ± 0.259
3.1TyrLys: 3.1 ± 0.283
2.832TyrLeu: 2.832 ± 0.272
1.071TyrMet: 1.071 ± 0.126
2.621TyrAsn: 2.621 ± 0.228
1.531TyrPro: 1.531 ± 0.155
1.55TyrGln: 1.55 ± 0.19
1.645TyrArg: 1.645 ± 0.188
2.87TyrSer: 2.87 ± 0.261
2.564TyrThr: 2.564 ± 0.232
3.119TyrVal: 3.119 ± 0.225
0.555TyrTrp: 0.555 ± 0.105
1.856TyrTyr: 1.856 ± 0.212
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 266 proteins (52266 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski