Amino acid dipepetide frequency for Escherichia phage nieznany

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.774AlaAla: 4.774 ± 0.388
0.937AlaCys: 0.937 ± 0.154
3.495AlaAsp: 3.495 ± 0.254
5.003AlaGlu: 5.003 ± 0.357
2.878AlaPhe: 2.878 ± 0.246
5.094AlaGly: 5.094 ± 0.412
1.325AlaHis: 1.325 ± 0.18
4.363AlaIle: 4.363 ± 0.323
5.026AlaLys: 5.026 ± 0.344
5.894AlaLeu: 5.894 ± 0.369
1.965AlaMet: 1.965 ± 0.197
3.678AlaAsn: 3.678 ± 0.306
2.01AlaPro: 2.01 ± 0.206
2.079AlaGln: 2.079 ± 0.201
2.673AlaArg: 2.673 ± 0.237
3.838AlaSer: 3.838 ± 0.385
4.523AlaThr: 4.523 ± 0.397
4.637AlaVal: 4.637 ± 0.365
1.279AlaTrp: 1.279 ± 0.168
3.541AlaTyr: 3.541 ± 0.33
0.0AlaXaa: 0.0 ± 0.0
Cys
1.005CysAla: 1.005 ± 0.142
0.388CysCys: 0.388 ± 0.098
0.685CysAsp: 0.685 ± 0.134
0.868CysGlu: 0.868 ± 0.131
1.119CysPhe: 1.119 ± 0.16
1.074CysGly: 1.074 ± 0.167
0.274CysHis: 0.274 ± 0.08
0.754CysIle: 0.754 ± 0.17
1.142CysLys: 1.142 ± 0.145
1.028CysLeu: 1.028 ± 0.17
0.548CysMet: 0.548 ± 0.12
0.525CysAsn: 0.525 ± 0.113
0.662CysPro: 0.662 ± 0.139
0.503CysGln: 0.503 ± 0.102
0.685CysArg: 0.685 ± 0.138
0.754CysSer: 0.754 ± 0.132
0.617CysThr: 0.617 ± 0.12
1.051CysVal: 1.051 ± 0.145
0.297CysTrp: 0.297 ± 0.106
0.662CysTyr: 0.662 ± 0.114
0.0CysXaa: 0.0 ± 0.0
Asp
4.203AspAla: 4.203 ± 0.308
0.731AspCys: 0.731 ± 0.137
3.541AspAsp: 3.541 ± 0.243
4.158AspGlu: 4.158 ± 0.366
2.947AspPhe: 2.947 ± 0.246
4.889AspGly: 4.889 ± 0.38
1.416AspHis: 1.416 ± 0.167
4.226AspIle: 4.226 ± 0.319
4.226AspLys: 4.226 ± 0.323
5.46AspLeu: 5.46 ± 0.33
1.69AspMet: 1.69 ± 0.173
3.495AspAsn: 3.495 ± 0.272
2.421AspPro: 2.421 ± 0.228
2.079AspGln: 2.079 ± 0.208
2.353AspArg: 2.353 ± 0.24
3.29AspSer: 3.29 ± 0.32
3.267AspThr: 3.267 ± 0.27
4.5AspVal: 4.5 ± 0.292
1.279AspTrp: 1.279 ± 0.183
3.404AspTyr: 3.404 ± 0.265
0.0AspXaa: 0.0 ± 0.0
Glu
5.574GluAla: 5.574 ± 0.363
0.822GluCys: 0.822 ± 0.138
5.734GluAsp: 5.734 ± 0.442
7.721GluGlu: 7.721 ± 0.559
2.833GluPhe: 2.833 ± 0.234
4.66GluGly: 4.66 ± 0.31
1.371GluHis: 1.371 ± 0.189
4.523GluIle: 4.523 ± 0.312
4.98GluLys: 4.98 ± 0.377
5.3GluLeu: 5.3 ± 0.423
2.718GluMet: 2.718 ± 0.246
3.655GluAsn: 3.655 ± 0.325
1.782GluPro: 1.782 ± 0.219
2.787GluGln: 2.787 ± 0.272
2.947GluArg: 2.947 ± 0.271
3.267GluSer: 3.267 ± 0.284
4.043GluThr: 4.043 ± 0.28
4.911GluVal: 4.911 ± 0.302
1.759GluTrp: 1.759 ± 0.201
3.221GluTyr: 3.221 ± 0.313
0.0GluXaa: 0.0 ± 0.0
Phe
3.061PheAla: 3.061 ± 0.234
0.617PheCys: 0.617 ± 0.114
3.244PheAsp: 3.244 ± 0.251
3.152PheGlu: 3.152 ± 0.244
1.987PhePhe: 1.987 ± 0.24
2.901PheGly: 2.901 ± 0.276
0.754PheHis: 0.754 ± 0.119
2.33PheIle: 2.33 ± 0.242
2.97PheLys: 2.97 ± 0.251
3.29PheLeu: 3.29 ± 0.254
1.005PheMet: 1.005 ± 0.179
2.604PheAsn: 2.604 ± 0.264
1.645PhePro: 1.645 ± 0.174
1.302PheGln: 1.302 ± 0.191
1.828PheArg: 1.828 ± 0.231
2.856PheSer: 2.856 ± 0.258
2.467PheThr: 2.467 ± 0.277
3.312PheVal: 3.312 ± 0.28
0.617PheTrp: 0.617 ± 0.119
2.193PheTyr: 2.193 ± 0.285
0.0PheXaa: 0.0 ± 0.0
Gly
4.066GlyAla: 4.066 ± 0.325
1.279GlyCys: 1.279 ± 0.206
4.615GlyAsp: 4.615 ± 0.537
4.866GlyGlu: 4.866 ± 0.29
3.29GlyPhe: 3.29 ± 0.272
4.432GlyGly: 4.432 ± 0.406
1.393GlyHis: 1.393 ± 0.161
4.112GlyIle: 4.112 ± 0.387
4.957GlyLys: 4.957 ± 0.277
5.254GlyLeu: 5.254 ± 0.358
1.782GlyMet: 1.782 ± 0.194
3.587GlyAsn: 3.587 ± 0.309
1.074GlyPro: 1.074 ± 0.218
1.965GlyGln: 1.965 ± 0.222
2.81GlyArg: 2.81 ± 0.241
3.883GlySer: 3.883 ± 0.314
4.043GlyThr: 4.043 ± 0.485
5.208GlyVal: 5.208 ± 0.344
1.325GlyTrp: 1.325 ± 0.162
3.609GlyTyr: 3.609 ± 0.293
0.0GlyXaa: 0.0 ± 0.0
His
1.074HisAla: 1.074 ± 0.149
0.434HisCys: 0.434 ± 0.098
1.028HisAsp: 1.028 ± 0.154
1.051HisGlu: 1.051 ± 0.189
1.074HisPhe: 1.074 ± 0.17
1.416HisGly: 1.416 ± 0.204
0.274HisHis: 0.274 ± 0.094
1.142HisIle: 1.142 ± 0.189
1.416HisLys: 1.416 ± 0.156
1.508HisLeu: 1.508 ± 0.187
0.662HisMet: 0.662 ± 0.124
0.982HisAsn: 0.982 ± 0.146
0.8HisPro: 0.8 ± 0.153
0.503HisGln: 0.503 ± 0.105
0.982HisArg: 0.982 ± 0.137
0.959HisSer: 0.959 ± 0.155
0.891HisThr: 0.891 ± 0.153
1.531HisVal: 1.531 ± 0.208
0.32HisTrp: 0.32 ± 0.091
1.142HisTyr: 1.142 ± 0.151
0.0HisXaa: 0.0 ± 0.0
Ile
4.637IleAla: 4.637 ± 0.366
1.028IleCys: 1.028 ± 0.162
3.998IleAsp: 3.998 ± 0.29
4.455IleGlu: 4.455 ± 0.338
2.307IlePhe: 2.307 ± 0.201
3.587IleGly: 3.587 ± 0.278
0.959IleHis: 0.959 ± 0.139
3.701IleIle: 3.701 ± 0.351
4.066IleLys: 4.066 ± 0.307
4.866IleLeu: 4.866 ± 0.385
1.645IleMet: 1.645 ± 0.218
4.089IleAsn: 4.089 ± 0.278
2.833IlePro: 2.833 ± 0.273
2.147IleGln: 2.147 ± 0.226
2.604IleArg: 2.604 ± 0.244
3.29IleSer: 3.29 ± 0.261
4.318IleThr: 4.318 ± 0.366
4.409IleVal: 4.409 ± 0.316
0.548IleTrp: 0.548 ± 0.112
2.376IleTyr: 2.376 ± 0.229
0.0IleXaa: 0.0 ± 0.0
Lys
5.528LysAla: 5.528 ± 0.366
0.685LysCys: 0.685 ± 0.141
4.615LysAsp: 4.615 ± 0.394
5.848LysGlu: 5.848 ± 0.37
2.353LysPhe: 2.353 ± 0.25
4.752LysGly: 4.752 ± 0.547
1.599LysHis: 1.599 ± 0.206
4.295LysIle: 4.295 ± 0.302
4.386LysLys: 4.386 ± 0.429
4.843LysLeu: 4.843 ± 0.336
2.513LysMet: 2.513 ± 0.27
3.312LysAsn: 3.312 ± 0.296
2.193LysPro: 2.193 ± 0.25
2.741LysGln: 2.741 ± 0.264
2.924LysArg: 2.924 ± 0.265
3.609LysSer: 3.609 ± 0.316
3.746LysThr: 3.746 ± 0.31
4.569LysVal: 4.569 ± 0.358
0.959LysTrp: 0.959 ± 0.151
2.627LysTyr: 2.627 ± 0.268
0.0LysXaa: 0.0 ± 0.0
Leu
6.008LeuAla: 6.008 ± 0.369
1.074LeuCys: 1.074 ± 0.147
5.505LeuAsp: 5.505 ± 0.334
6.396LeuGlu: 6.396 ± 0.406
3.13LeuPhe: 3.13 ± 0.307
5.231LeuGly: 5.231 ± 0.311
1.393LeuHis: 1.393 ± 0.169
4.295LeuIle: 4.295 ± 0.354
5.711LeuLys: 5.711 ± 0.329
5.505LeuLeu: 5.505 ± 0.376
2.307LeuMet: 2.307 ± 0.215
4.226LeuAsn: 4.226 ± 0.362
3.244LeuPro: 3.244 ± 0.288
2.399LeuGln: 2.399 ± 0.252
3.29LeuArg: 3.29 ± 0.283
5.483LeuSer: 5.483 ± 0.327
5.117LeuThr: 5.117 ± 0.317
5.094LeuVal: 5.094 ± 0.309
1.051LeuTrp: 1.051 ± 0.146
3.221LeuTyr: 3.221 ± 0.233
0.0LeuXaa: 0.0 ± 0.0
Met
2.102MetAla: 2.102 ± 0.22
0.411MetCys: 0.411 ± 0.108
1.508MetAsp: 1.508 ± 0.188
1.759MetGlu: 1.759 ± 0.217
1.119MetPhe: 1.119 ± 0.151
1.736MetGly: 1.736 ± 0.199
0.708MetHis: 0.708 ± 0.114
2.01MetIle: 2.01 ± 0.197
1.942MetLys: 1.942 ± 0.189
2.262MetLeu: 2.262 ± 0.218
0.731MetMet: 0.731 ± 0.147
0.845MetAsn: 0.845 ± 0.163
0.731MetPro: 0.731 ± 0.143
1.074MetGln: 1.074 ± 0.175
1.645MetArg: 1.645 ± 0.205
2.262MetSer: 2.262 ± 0.236
1.873MetThr: 1.873 ± 0.211
1.828MetVal: 1.828 ± 0.2
0.343MetTrp: 0.343 ± 0.1
1.005MetTyr: 1.005 ± 0.143
0.0MetXaa: 0.0 ± 0.0
Asn
3.472AsnAla: 3.472 ± 0.269
0.662AsnCys: 0.662 ± 0.1
2.741AsnAsp: 2.741 ± 0.245
2.993AsnGlu: 2.993 ± 0.255
1.965AsnPhe: 1.965 ± 0.223
4.272AsnGly: 4.272 ± 0.364
0.891AsnHis: 0.891 ± 0.138
4.226AsnIle: 4.226 ± 0.354
3.906AsnLys: 3.906 ± 0.32
4.226AsnLeu: 4.226 ± 0.319
1.462AsnMet: 1.462 ± 0.189
3.015AsnAsn: 3.015 ± 0.317
2.947AsnPro: 2.947 ± 0.261
1.393AsnGln: 1.393 ± 0.179
1.942AsnArg: 1.942 ± 0.19
3.312AsnSer: 3.312 ± 0.288
3.678AsnThr: 3.678 ± 0.303
3.107AsnVal: 3.107 ± 0.253
0.8AsnTrp: 0.8 ± 0.145
2.376AsnTyr: 2.376 ± 0.232
0.0AsnXaa: 0.0 ± 0.0
Pro
2.079ProAla: 2.079 ± 0.225
0.8ProCys: 0.8 ± 0.144
2.467ProAsp: 2.467 ± 0.237
3.198ProGlu: 3.198 ± 0.27
1.873ProPhe: 1.873 ± 0.248
1.873ProGly: 1.873 ± 0.218
0.685ProHis: 0.685 ± 0.135
1.782ProIle: 1.782 ± 0.19
2.125ProLys: 2.125 ± 0.233
2.421ProLeu: 2.421 ± 0.24
0.617ProMet: 0.617 ± 0.128
2.353ProAsn: 2.353 ± 0.243
1.005ProPro: 1.005 ± 0.144
0.937ProGln: 0.937 ± 0.165
1.325ProArg: 1.325 ± 0.18
2.421ProSer: 2.421 ± 0.208
2.239ProThr: 2.239 ± 0.226
2.696ProVal: 2.696 ± 0.236
0.548ProTrp: 0.548 ± 0.113
1.782ProTyr: 1.782 ± 0.201
0.0ProXaa: 0.0 ± 0.0
Gln
2.467GlnAla: 2.467 ± 0.243
0.343GlnCys: 0.343 ± 0.078
1.805GlnAsp: 1.805 ± 0.198
3.038GlnGlu: 3.038 ± 0.247
1.302GlnPhe: 1.302 ± 0.178
1.85GlnGly: 1.85 ± 0.256
0.594GlnHis: 0.594 ± 0.122
2.627GlnIle: 2.627 ± 0.206
1.508GlnLys: 1.508 ± 0.209
2.147GlnLeu: 2.147 ± 0.218
0.959GlnMet: 0.959 ± 0.164
1.736GlnAsn: 1.736 ± 0.214
1.188GlnPro: 1.188 ± 0.176
0.891GlnGln: 0.891 ± 0.14
1.256GlnArg: 1.256 ± 0.154
1.645GlnSer: 1.645 ± 0.168
1.942GlnThr: 1.942 ± 0.202
2.056GlnVal: 2.056 ± 0.231
0.571GlnTrp: 0.571 ± 0.136
1.393GlnTyr: 1.393 ± 0.151
0.0GlnXaa: 0.0 ± 0.0
Arg
2.49ArgAla: 2.49 ± 0.183
0.845ArgCys: 0.845 ± 0.15
3.015ArgAsp: 3.015 ± 0.268
2.764ArgGlu: 2.764 ± 0.321
1.782ArgPhe: 1.782 ± 0.19
2.741ArgGly: 2.741 ± 0.242
0.662ArgHis: 0.662 ± 0.114
2.513ArgIle: 2.513 ± 0.263
3.29ArgLys: 3.29 ± 0.277
3.335ArgLeu: 3.335 ± 0.278
1.416ArgMet: 1.416 ± 0.188
2.307ArgAsn: 2.307 ± 0.224
1.234ArgPro: 1.234 ± 0.182
1.508ArgGln: 1.508 ± 0.175
1.85ArgArg: 1.85 ± 0.215
2.787ArgSer: 2.787 ± 0.241
1.645ArgThr: 1.645 ± 0.199
2.513ArgVal: 2.513 ± 0.214
0.411ArgTrp: 0.411 ± 0.097
1.439ArgTyr: 1.439 ± 0.183
0.0ArgXaa: 0.0 ± 0.0
Ser
4.203SerAla: 4.203 ± 0.332
0.891SerCys: 0.891 ± 0.157
3.335SerAsp: 3.335 ± 0.264
3.449SerGlu: 3.449 ± 0.246
3.061SerPhe: 3.061 ± 0.241
4.203SerGly: 4.203 ± 0.406
1.097SerHis: 1.097 ± 0.177
3.312SerIle: 3.312 ± 0.255
3.952SerLys: 3.952 ± 0.309
5.368SerLeu: 5.368 ± 0.35
1.302SerMet: 1.302 ± 0.153
3.381SerAsn: 3.381 ± 0.28
2.17SerPro: 2.17 ± 0.213
1.828SerGln: 1.828 ± 0.244
2.33SerArg: 2.33 ± 0.219
3.632SerSer: 3.632 ± 0.397
3.472SerThr: 3.472 ± 0.297
4.18SerVal: 4.18 ± 0.364
0.8SerTrp: 0.8 ± 0.129
2.581SerTyr: 2.581 ± 0.257
0.0SerXaa: 0.0 ± 0.0
Thr
3.769ThrAla: 3.769 ± 0.362
0.548ThrCys: 0.548 ± 0.101
2.787ThrAsp: 2.787 ± 0.257
4.112ThrGlu: 4.112 ± 0.258
3.175ThrPhe: 3.175 ± 0.308
4.797ThrGly: 4.797 ± 0.362
1.119ThrHis: 1.119 ± 0.146
3.678ThrIle: 3.678 ± 0.314
4.203ThrLys: 4.203 ± 0.348
5.505ThrLeu: 5.505 ± 0.393
1.165ThrMet: 1.165 ± 0.146
2.741ThrAsn: 2.741 ± 0.268
2.993ThrPro: 2.993 ± 0.227
1.69ThrGln: 1.69 ± 0.196
1.919ThrArg: 1.919 ± 0.182
3.29ThrSer: 3.29 ± 0.313
4.021ThrThr: 4.021 ± 0.396
4.455ThrVal: 4.455 ± 0.34
1.119ThrTrp: 1.119 ± 0.175
2.696ThrTyr: 2.696 ± 0.234
0.0ThrXaa: 0.0 ± 0.0
Val
4.5ValAla: 4.5 ± 0.366
1.211ValCys: 1.211 ± 0.174
4.683ValAsp: 4.683 ± 0.315
5.46ValGlu: 5.46 ± 0.372
3.175ValPhe: 3.175 ± 0.339
4.477ValGly: 4.477 ± 0.373
1.256ValHis: 1.256 ± 0.158
4.409ValIle: 4.409 ± 0.362
4.843ValLys: 4.843 ± 0.386
5.802ValLeu: 5.802 ± 0.398
1.828ValMet: 1.828 ± 0.169
3.404ValAsn: 3.404 ± 0.295
2.102ValPro: 2.102 ± 0.181
1.531ValGln: 1.531 ± 0.186
2.513ValArg: 2.513 ± 0.235
4.18ValSer: 4.18 ± 0.348
4.318ValThr: 4.318 ± 0.289
5.962ValVal: 5.962 ± 0.435
0.868ValTrp: 0.868 ± 0.149
3.244ValTyr: 3.244 ± 0.285
0.0ValXaa: 0.0 ± 0.0
Trp
0.868TrpAla: 0.868 ± 0.147
0.206TrpCys: 0.206 ± 0.07
1.302TrpAsp: 1.302 ± 0.164
1.325TrpGlu: 1.325 ± 0.192
0.982TrpPhe: 0.982 ± 0.169
0.822TrpGly: 0.822 ± 0.167
0.388TrpHis: 0.388 ± 0.092
1.119TrpIle: 1.119 ± 0.17
1.051TrpLys: 1.051 ± 0.156
1.805TrpLeu: 1.805 ± 0.225
0.457TrpMet: 0.457 ± 0.086
0.937TrpAsn: 0.937 ± 0.169
0.251TrpPro: 0.251 ± 0.075
0.366TrpGln: 0.366 ± 0.103
0.685TrpArg: 0.685 ± 0.139
0.8TrpSer: 0.8 ± 0.159
0.777TrpThr: 0.777 ± 0.117
0.959TrpVal: 0.959 ± 0.139
0.411TrpTrp: 0.411 ± 0.108
0.525TrpTyr: 0.525 ± 0.106
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.13TyrAla: 3.13 ± 0.265
0.731TyrCys: 0.731 ± 0.127
3.472TyrAsp: 3.472 ± 0.266
2.993TyrGlu: 2.993 ± 0.261
1.782TyrPhe: 1.782 ± 0.207
2.741TyrGly: 2.741 ± 0.249
1.028TyrHis: 1.028 ± 0.158
2.444TyrIle: 2.444 ± 0.26
2.399TyrLys: 2.399 ± 0.231
3.861TyrLeu: 3.861 ± 0.232
1.097TyrMet: 1.097 ± 0.151
2.513TyrAsn: 2.513 ± 0.257
1.942TyrPro: 1.942 ± 0.23
1.599TyrGln: 1.599 ± 0.202
1.987TyrArg: 1.987 ± 0.19
3.015TyrSer: 3.015 ± 0.281
2.787TyrThr: 2.787 ± 0.225
2.787TyrVal: 2.787 ± 0.219
0.731TyrTrp: 0.731 ± 0.147
2.102TyrTyr: 2.102 ± 0.229
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 256 proteins (43776 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski