Amino acid dipepetide frequency for Escherichia phage CJ20

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.431AlaAla: 5.431 ± 0.38
0.672AlaCys: 0.672 ± 0.103
3.965AlaAsp: 3.965 ± 0.266
4.862AlaGlu: 4.862 ± 0.374
2.845AlaPhe: 2.845 ± 0.208
4.862AlaGly: 4.862 ± 0.435
1.052AlaHis: 1.052 ± 0.128
5.034AlaIle: 5.034 ± 0.319
4.759AlaLys: 4.759 ± 0.355
6.0AlaLeu: 6.0 ± 0.349
1.845AlaMet: 1.845 ± 0.166
3.103AlaAsn: 3.103 ± 0.238
2.879AlaPro: 2.879 ± 0.217
2.552AlaGln: 2.552 ± 0.21
2.983AlaArg: 2.983 ± 0.222
4.81AlaSer: 4.81 ± 0.405
4.259AlaThr: 4.259 ± 0.468
4.103AlaVal: 4.103 ± 0.303
1.052AlaTrp: 1.052 ± 0.142
2.379AlaTyr: 2.379 ± 0.198
0.0AlaXaa: 0.0 ± 0.0
Cys
0.931CysAla: 0.931 ± 0.127
0.276CysCys: 0.276 ± 0.086
0.724CysAsp: 0.724 ± 0.1
0.862CysGlu: 0.862 ± 0.147
0.759CysPhe: 0.759 ± 0.136
0.828CysGly: 0.828 ± 0.14
0.31CysHis: 0.31 ± 0.074
0.759CysIle: 0.759 ± 0.119
0.759CysLys: 0.759 ± 0.131
0.897CysLeu: 0.897 ± 0.13
0.466CysMet: 0.466 ± 0.073
0.517CysAsn: 0.517 ± 0.084
0.466CysPro: 0.466 ± 0.1
0.517CysGln: 0.517 ± 0.091
0.672CysArg: 0.672 ± 0.118
0.931CysSer: 0.931 ± 0.123
0.621CysThr: 0.621 ± 0.108
0.81CysVal: 0.81 ± 0.127
0.172CysTrp: 0.172 ± 0.045
0.466CysTyr: 0.466 ± 0.078
0.0CysXaa: 0.0 ± 0.0
Asp
3.965AspAla: 3.965 ± 0.258
0.862AspCys: 0.862 ± 0.113
3.69AspAsp: 3.69 ± 0.307
4.121AspGlu: 4.121 ± 0.283
2.81AspPhe: 2.81 ± 0.223
4.724AspGly: 4.724 ± 0.335
0.759AspHis: 0.759 ± 0.111
4.448AspIle: 4.448 ± 0.27
4.017AspLys: 4.017 ± 0.291
4.534AspLeu: 4.534 ± 0.283
1.828AspMet: 1.828 ± 0.193
3.155AspAsn: 3.155 ± 0.234
1.879AspPro: 1.879 ± 0.211
1.672AspGln: 1.672 ± 0.161
1.793AspArg: 1.793 ± 0.187
3.552AspSer: 3.552 ± 0.244
2.81AspThr: 2.81 ± 0.223
4.103AspVal: 4.103 ± 0.283
1.103AspTrp: 1.103 ± 0.147
2.948AspTyr: 2.948 ± 0.255
0.0AspXaa: 0.0 ± 0.0
Glu
5.569GluAla: 5.569 ± 0.385
0.862GluCys: 0.862 ± 0.127
3.586GluAsp: 3.586 ± 0.28
4.345GluGlu: 4.345 ± 0.381
3.586GluPhe: 3.586 ± 0.268
3.224GluGly: 3.224 ± 0.209
1.172GluHis: 1.172 ± 0.151
5.276GluIle: 5.276 ± 0.251
4.069GluLys: 4.069 ± 0.317
6.0GluLeu: 6.0 ± 0.353
2.034GluMet: 2.034 ± 0.172
3.19GluAsn: 3.19 ± 0.237
1.845GluPro: 1.845 ± 0.151
2.379GluGln: 2.379 ± 0.222
2.586GluArg: 2.586 ± 0.214
3.672GluSer: 3.672 ± 0.271
3.793GluThr: 3.793 ± 0.28
4.483GluVal: 4.483 ± 0.266
0.897GluTrp: 0.897 ± 0.125
3.069GluTyr: 3.069 ± 0.267
0.0GluXaa: 0.0 ± 0.0
Phe
3.069PheAla: 3.069 ± 0.203
0.638PheCys: 0.638 ± 0.089
3.034PheAsp: 3.034 ± 0.251
3.172PheGlu: 3.172 ± 0.278
2.138PhePhe: 2.138 ± 0.22
3.086PheGly: 3.086 ± 0.225
1.052PheHis: 1.052 ± 0.145
3.724PheIle: 3.724 ± 0.24
3.948PheLys: 3.948 ± 0.331
2.862PheLeu: 2.862 ± 0.236
1.414PheMet: 1.414 ± 0.162
3.276PheAsn: 3.276 ± 0.275
1.345PhePro: 1.345 ± 0.16
1.465PheGln: 1.465 ± 0.167
2.276PheArg: 2.276 ± 0.17
2.793PheSer: 2.793 ± 0.22
2.793PheThr: 2.793 ± 0.266
3.138PheVal: 3.138 ± 0.267
0.69PheTrp: 0.69 ± 0.109
1.845PheTyr: 1.845 ± 0.209
0.0PheXaa: 0.0 ± 0.0
Gly
3.517GlyAla: 3.517 ± 0.31
0.741GlyCys: 0.741 ± 0.117
3.793GlyAsp: 3.793 ± 0.328
3.31GlyGlu: 3.31 ± 0.244
2.931GlyPhe: 2.931 ± 0.218
3.672GlyGly: 3.672 ± 0.55
1.0GlyHis: 1.0 ± 0.132
4.241GlyIle: 4.241 ± 0.293
4.0GlyLys: 4.0 ± 0.282
5.155GlyLeu: 5.155 ± 0.24
1.897GlyMet: 1.897 ± 0.217
3.293GlyAsn: 3.293 ± 0.45
2.121GlyPro: 2.121 ± 0.203
2.379GlyGln: 2.379 ± 0.228
2.379GlyArg: 2.379 ± 0.198
3.879GlySer: 3.879 ± 0.3
3.879GlyThr: 3.879 ± 0.415
3.879GlyVal: 3.879 ± 0.266
1.086GlyTrp: 1.086 ± 0.166
2.586GlyTyr: 2.586 ± 0.231
0.0GlyXaa: 0.0 ± 0.0
His
0.931HisAla: 0.931 ± 0.114
0.328HisCys: 0.328 ± 0.072
0.948HisAsp: 0.948 ± 0.131
1.241HisGlu: 1.241 ± 0.139
1.017HisPhe: 1.017 ± 0.161
1.069HisGly: 1.069 ± 0.147
0.466HisHis: 0.466 ± 0.09
1.586HisIle: 1.586 ± 0.197
1.259HisLys: 1.259 ± 0.165
1.345HisLeu: 1.345 ± 0.148
0.603HisMet: 0.603 ± 0.085
1.017HisAsn: 1.017 ± 0.146
1.069HisPro: 1.069 ± 0.132
0.724HisGln: 0.724 ± 0.121
0.828HisArg: 0.828 ± 0.115
1.086HisSer: 1.086 ± 0.136
1.034HisThr: 1.034 ± 0.151
1.293HisVal: 1.293 ± 0.139
0.224HisTrp: 0.224 ± 0.068
0.828HisTyr: 0.828 ± 0.121
0.0HisXaa: 0.0 ± 0.0
Ile
5.19IleAla: 5.19 ± 0.387
0.948IleCys: 0.948 ± 0.141
4.379IleAsp: 4.379 ± 0.272
4.724IleGlu: 4.724 ± 0.269
3.0IlePhe: 3.0 ± 0.239
3.414IleGly: 3.414 ± 0.277
1.345IleHis: 1.345 ± 0.178
4.793IleIle: 4.793 ± 0.327
6.086IleLys: 6.086 ± 0.349
4.379IleLeu: 4.379 ± 0.293
2.397IleMet: 2.397 ± 0.209
4.431IleAsn: 4.431 ± 0.295
2.81IlePro: 2.81 ± 0.226
2.793IleGln: 2.793 ± 0.196
3.896IleArg: 3.896 ± 0.258
4.724IleSer: 4.724 ± 0.305
4.621IleThr: 4.621 ± 0.303
4.293IleVal: 4.293 ± 0.284
0.638IleTrp: 0.638 ± 0.109
2.224IleTyr: 2.224 ± 0.17
0.0IleXaa: 0.0 ± 0.0
Lys
5.293LysAla: 5.293 ± 0.341
0.655LysCys: 0.655 ± 0.101
4.086LysAsp: 4.086 ± 0.307
5.19LysGlu: 5.19 ± 0.328
3.431LysPhe: 3.431 ± 0.283
3.896LysGly: 3.896 ± 0.287
1.655LysHis: 1.655 ± 0.187
4.534LysIle: 4.534 ± 0.275
4.379LysLys: 4.379 ± 0.359
6.31LysLeu: 6.31 ± 0.365
2.569LysMet: 2.569 ± 0.242
3.948LysAsn: 3.948 ± 0.323
2.845LysPro: 2.845 ± 0.234
2.603LysGln: 2.603 ± 0.252
3.19LysArg: 3.19 ± 0.26
4.379LysSer: 4.379 ± 0.297
3.879LysThr: 3.879 ± 0.245
4.931LysVal: 4.931 ± 0.291
0.966LysTrp: 0.966 ± 0.13
2.81LysTyr: 2.81 ± 0.207
0.0LysXaa: 0.0 ± 0.0
Leu
4.965LeuAla: 4.965 ± 0.291
0.966LeuCys: 0.966 ± 0.141
4.776LeuAsp: 4.776 ± 0.273
5.121LeuGlu: 5.121 ± 0.303
3.621LeuPhe: 3.621 ± 0.243
3.948LeuGly: 3.948 ± 0.264
1.259LeuHis: 1.259 ± 0.141
5.293LeuIle: 5.293 ± 0.266
6.345LeuLys: 6.345 ± 0.378
5.81LeuLeu: 5.81 ± 0.398
2.414LeuMet: 2.414 ± 0.202
5.121LeuAsn: 5.121 ± 0.321
3.345LeuPro: 3.345 ± 0.314
2.931LeuGln: 2.931 ± 0.256
3.517LeuArg: 3.517 ± 0.241
4.845LeuSer: 4.845 ± 0.278
5.138LeuThr: 5.138 ± 0.292
4.552LeuVal: 4.552 ± 0.313
0.776LeuTrp: 0.776 ± 0.118
3.276LeuTyr: 3.276 ± 0.228
0.0LeuXaa: 0.0 ± 0.0
Met
2.172MetAla: 2.172 ± 0.186
0.379MetCys: 0.379 ± 0.084
1.431MetAsp: 1.431 ± 0.177
1.724MetGlu: 1.724 ± 0.156
1.552MetPhe: 1.552 ± 0.205
1.603MetGly: 1.603 ± 0.182
0.569MetHis: 0.569 ± 0.1
2.034MetIle: 2.034 ± 0.204
2.31MetLys: 2.31 ± 0.221
2.465MetLeu: 2.465 ± 0.21
0.914MetMet: 0.914 ± 0.117
1.931MetAsn: 1.931 ± 0.184
0.81MetPro: 0.81 ± 0.116
1.034MetGln: 1.034 ± 0.118
1.259MetArg: 1.259 ± 0.143
1.914MetSer: 1.914 ± 0.173
1.879MetThr: 1.879 ± 0.186
2.172MetVal: 2.172 ± 0.191
0.241MetTrp: 0.241 ± 0.066
1.172MetTyr: 1.172 ± 0.161
0.0MetXaa: 0.0 ± 0.0
Asn
3.983AsnAla: 3.983 ± 0.299
0.845AsnCys: 0.845 ± 0.122
2.948AsnAsp: 2.948 ± 0.297
3.517AsnGlu: 3.517 ± 0.246
2.586AsnPhe: 2.586 ± 0.231
4.069AsnGly: 4.069 ± 0.417
1.224AsnHis: 1.224 ± 0.156
3.793AsnIle: 3.793 ± 0.265
3.948AsnLys: 3.948 ± 0.273
4.345AsnLeu: 4.345 ± 0.292
1.465AsnMet: 1.465 ± 0.183
3.086AsnAsn: 3.086 ± 0.316
2.31AsnPro: 2.31 ± 0.228
2.207AsnGln: 2.207 ± 0.225
2.328AsnArg: 2.328 ± 0.182
3.465AsnSer: 3.465 ± 0.274
3.172AsnThr: 3.172 ± 0.259
3.31AsnVal: 3.31 ± 0.277
0.603AsnTrp: 0.603 ± 0.101
2.224AsnTyr: 2.224 ± 0.188
0.0AsnXaa: 0.0 ± 0.0
Pro
2.621ProAla: 2.621 ± 0.244
0.483ProCys: 0.483 ± 0.102
2.569ProAsp: 2.569 ± 0.227
3.172ProGlu: 3.172 ± 0.233
1.759ProPhe: 1.759 ± 0.193
2.397ProGly: 2.397 ± 0.2
0.759ProHis: 0.759 ± 0.11
2.103ProIle: 2.103 ± 0.196
2.465ProLys: 2.465 ± 0.191
2.655ProLeu: 2.655 ± 0.213
0.707ProMet: 0.707 ± 0.11
1.707ProAsn: 1.707 ± 0.157
1.172ProPro: 1.172 ± 0.242
0.862ProGln: 0.862 ± 0.127
1.379ProArg: 1.379 ± 0.16
2.448ProSer: 2.448 ± 0.228
2.534ProThr: 2.534 ± 0.21
3.31ProVal: 3.31 ± 0.256
0.569ProTrp: 0.569 ± 0.109
1.534ProTyr: 1.534 ± 0.166
0.0ProXaa: 0.0 ± 0.0
Gln
2.621GlnAla: 2.621 ± 0.224
0.31GlnCys: 0.31 ± 0.078
1.724GlnAsp: 1.724 ± 0.192
2.483GlnGlu: 2.483 ± 0.214
1.69GlnPhe: 1.69 ± 0.161
2.155GlnGly: 2.155 ± 0.207
0.776GlnHis: 0.776 ± 0.117
2.897GlnIle: 2.897 ± 0.235
2.5GlnLys: 2.5 ± 0.224
3.0GlnLeu: 3.0 ± 0.222
1.052GlnMet: 1.052 ± 0.132
1.707GlnAsn: 1.707 ± 0.172
1.362GlnPro: 1.362 ± 0.142
1.345GlnGln: 1.345 ± 0.169
1.776GlnArg: 1.776 ± 0.186
2.086GlnSer: 2.086 ± 0.178
2.707GlnThr: 2.707 ± 0.215
2.345GlnVal: 2.345 ± 0.191
0.707GlnTrp: 0.707 ± 0.115
1.672GlnTyr: 1.672 ± 0.15
0.0GlnXaa: 0.0 ± 0.0
Arg
2.983ArgAla: 2.983 ± 0.245
0.655ArgCys: 0.655 ± 0.127
2.741ArgAsp: 2.741 ± 0.184
2.948ArgGlu: 2.948 ± 0.252
2.034ArgPhe: 2.034 ± 0.196
2.276ArgGly: 2.276 ± 0.19
1.052ArgHis: 1.052 ± 0.148
3.276ArgIle: 3.276 ± 0.246
3.121ArgLys: 3.121 ± 0.235
3.569ArgLeu: 3.569 ± 0.248
1.397ArgMet: 1.397 ± 0.152
2.155ArgAsn: 2.155 ± 0.205
1.552ArgPro: 1.552 ± 0.202
1.724ArgGln: 1.724 ± 0.166
2.569ArgArg: 2.569 ± 0.242
2.638ArgSer: 2.638 ± 0.224
2.397ArgThr: 2.397 ± 0.19
2.81ArgVal: 2.81 ± 0.186
0.586ArgTrp: 0.586 ± 0.122
1.586ArgTyr: 1.586 ± 0.17
0.0ArgXaa: 0.0 ± 0.0
Ser
3.828SerAla: 3.828 ± 0.265
0.966SerCys: 0.966 ± 0.15
3.396SerAsp: 3.396 ± 0.284
3.759SerGlu: 3.759 ± 0.273
3.431SerPhe: 3.431 ± 0.253
4.31SerGly: 4.31 ± 0.311
1.19SerHis: 1.19 ± 0.141
4.655SerIle: 4.655 ± 0.296
3.896SerLys: 3.896 ± 0.297
5.396SerLeu: 5.396 ± 0.299
1.879SerMet: 1.879 ± 0.133
3.362SerAsn: 3.362 ± 0.281
2.414SerPro: 2.414 ± 0.224
2.276SerGln: 2.276 ± 0.186
3.241SerArg: 3.241 ± 0.229
4.569SerSer: 4.569 ± 0.36
3.448SerThr: 3.448 ± 0.292
4.241SerVal: 4.241 ± 0.289
0.724SerTrp: 0.724 ± 0.122
2.638SerTyr: 2.638 ± 0.205
0.0SerXaa: 0.0 ± 0.0
Thr
4.259ThrAla: 4.259 ± 0.287
0.569ThrCys: 0.569 ± 0.093
3.19ThrAsp: 3.19 ± 0.235
3.569ThrGlu: 3.569 ± 0.232
3.138ThrPhe: 3.138 ± 0.252
4.276ThrGly: 4.276 ± 0.344
0.862ThrHis: 0.862 ± 0.122
4.293ThrIle: 4.293 ± 0.307
4.207ThrLys: 4.207 ± 0.287
4.603ThrLeu: 4.603 ± 0.29
1.328ThrMet: 1.328 ± 0.148
3.345ThrAsn: 3.345 ± 0.223
2.379ThrPro: 2.379 ± 0.235
2.259ThrGln: 2.259 ± 0.217
2.552ThrArg: 2.552 ± 0.267
3.965ThrSer: 3.965 ± 0.351
3.396ThrThr: 3.396 ± 0.334
4.345ThrVal: 4.345 ± 0.263
0.897ThrTrp: 0.897 ± 0.13
2.172ThrTyr: 2.172 ± 0.188
0.0ThrXaa: 0.0 ± 0.0
Val
4.396ValAla: 4.396 ± 0.276
1.0ValCys: 1.0 ± 0.114
4.207ValAsp: 4.207 ± 0.286
4.414ValGlu: 4.414 ± 0.315
2.793ValPhe: 2.793 ± 0.238
3.259ValGly: 3.259 ± 0.28
1.172ValHis: 1.172 ± 0.144
4.379ValIle: 4.379 ± 0.269
5.345ValLys: 5.345 ± 0.316
4.741ValLeu: 4.741 ± 0.276
1.776ValMet: 1.776 ± 0.184
3.948ValAsn: 3.948 ± 0.253
2.759ValPro: 2.759 ± 0.235
2.914ValGln: 2.914 ± 0.232
2.69ValArg: 2.69 ± 0.19
4.5ValSer: 4.5 ± 0.28
4.0ValThr: 4.0 ± 0.243
4.241ValVal: 4.241 ± 0.273
0.879ValTrp: 0.879 ± 0.135
2.552ValTyr: 2.552 ± 0.19
0.0ValXaa: 0.0 ± 0.0
Trp
0.862TrpAla: 0.862 ± 0.113
0.138TrpCys: 0.138 ± 0.053
0.897TrpAsp: 0.897 ± 0.118
0.828TrpGlu: 0.828 ± 0.115
0.672TrpPhe: 0.672 ± 0.107
0.362TrpGly: 0.362 ± 0.087
0.259TrpHis: 0.259 ± 0.067
0.931TrpIle: 0.931 ± 0.137
1.276TrpLys: 1.276 ± 0.163
1.069TrpLeu: 1.069 ± 0.144
0.448TrpMet: 0.448 ± 0.096
0.793TrpAsn: 0.793 ± 0.106
0.431TrpPro: 0.431 ± 0.086
0.517TrpGln: 0.517 ± 0.094
0.483TrpArg: 0.483 ± 0.094
0.776TrpSer: 0.776 ± 0.114
1.052TrpThr: 1.052 ± 0.108
0.931TrpVal: 0.931 ± 0.126
0.19TrpTrp: 0.19 ± 0.053
0.672TrpTyr: 0.672 ± 0.1
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.879TyrAla: 2.879 ± 0.235
0.448TyrCys: 0.448 ± 0.083
2.69TyrAsp: 2.69 ± 0.22
2.276TyrGlu: 2.276 ± 0.194
1.965TyrPhe: 1.965 ± 0.181
2.31TyrGly: 2.31 ± 0.204
0.966TyrHis: 0.966 ± 0.13
3.034TyrIle: 3.034 ± 0.253
2.862TyrLys: 2.862 ± 0.217
2.879TyrLeu: 2.879 ± 0.217
1.121TyrMet: 1.121 ± 0.154
2.345TyrAsn: 2.345 ± 0.224
1.414TyrPro: 1.414 ± 0.166
1.759TyrGln: 1.759 ± 0.191
1.672TyrArg: 1.672 ± 0.166
2.586TyrSer: 2.586 ± 0.211
2.19TyrThr: 2.19 ± 0.209
2.672TyrVal: 2.672 ± 0.168
0.586TyrTrp: 0.586 ± 0.098
1.741TyrTyr: 1.741 ± 0.193
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 307 proteins (58002 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski