Amino acid dipepetide frequency for Escherichia phage JES2013

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.051AlaAla: 6.051 ± 0.516
1.307AlaCys: 1.307 ± 0.189
4.333AlaAsp: 4.333 ± 0.281
4.913AlaGlu: 4.913 ± 0.324
2.687AlaPhe: 2.687 ± 0.234
5.107AlaGly: 5.107 ± 0.443
1.138AlaHis: 1.138 ± 0.129
3.824AlaIle: 3.824 ± 0.381
5.422AlaLys: 5.422 ± 0.417
5.349AlaLeu: 5.349 ± 0.416
2.445AlaMet: 2.445 ± 0.284
3.51AlaAsn: 3.51 ± 0.315
2.106AlaPro: 2.106 ± 0.236
2.13AlaGln: 2.13 ± 0.28
2.953AlaArg: 2.953 ± 0.27
3.364AlaSer: 3.364 ± 0.357
3.873AlaThr: 3.873 ± 0.489
4.55AlaVal: 4.55 ± 0.263
1.259AlaTrp: 1.259 ± 0.177
3.195AlaTyr: 3.195 ± 0.272
0.0AlaXaa: 0.0 ± 0.0
Cys
0.92CysAla: 0.92 ± 0.171
0.411CysCys: 0.411 ± 0.099
1.041CysAsp: 1.041 ± 0.156
0.968CysGlu: 0.968 ± 0.136
0.557CysPhe: 0.557 ± 0.135
1.404CysGly: 1.404 ± 0.177
0.484CysHis: 0.484 ± 0.104
0.799CysIle: 0.799 ± 0.162
1.089CysLys: 1.089 ± 0.179
1.113CysLeu: 1.113 ± 0.204
0.508CysMet: 0.508 ± 0.118
0.847CysAsn: 0.847 ± 0.129
0.799CysPro: 0.799 ± 0.127
0.436CysGln: 0.436 ± 0.112
1.065CysArg: 1.065 ± 0.157
0.823CysSer: 0.823 ± 0.133
0.75CysThr: 0.75 ± 0.151
1.331CysVal: 1.331 ± 0.153
0.315CysTrp: 0.315 ± 0.094
0.678CysTyr: 0.678 ± 0.129
0.0CysXaa: 0.0 ± 0.0
Asp
4.333AspAla: 4.333 ± 0.273
1.21AspCys: 1.21 ± 0.16
4.212AspAsp: 4.212 ± 0.377
4.526AspGlu: 4.526 ± 0.321
3.364AspPhe: 3.364 ± 0.274
4.889AspGly: 4.889 ± 0.367
1.694AspHis: 1.694 ± 0.203
4.042AspIle: 4.042 ± 0.334
4.986AspLys: 4.986 ± 0.326
6.075AspLeu: 6.075 ± 0.367
2.033AspMet: 2.033 ± 0.226
3.389AspAsn: 3.389 ± 0.316
3.413AspPro: 3.413 ± 0.298
1.84AspGln: 1.84 ± 0.211
2.832AspArg: 2.832 ± 0.262
3.752AspSer: 3.752 ± 0.321
3.292AspThr: 3.292 ± 0.256
4.623AspVal: 4.623 ± 0.301
1.234AspTrp: 1.234 ± 0.16
2.783AspTyr: 2.783 ± 0.289
0.0AspXaa: 0.0 ± 0.0
Glu
4.962GluAla: 4.962 ± 0.313
0.75GluCys: 0.75 ± 0.154
5.01GluAsp: 5.01 ± 0.389
6.729GluGlu: 6.729 ± 0.768
2.856GluPhe: 2.856 ± 0.214
5.494GluGly: 5.494 ± 0.361
1.234GluHis: 1.234 ± 0.194
4.599GluIle: 4.599 ± 0.359
4.478GluLys: 4.478 ± 0.463
6.342GluLeu: 6.342 ± 0.445
2.275GluMet: 2.275 ± 0.262
3.219GluAsn: 3.219 ± 0.275
1.912GluPro: 1.912 ± 0.207
2.009GluGln: 2.009 ± 0.249
3.171GluArg: 3.171 ± 0.318
3.389GluSer: 3.389 ± 0.281
3.001GluThr: 3.001 ± 0.292
5.083GluVal: 5.083 ± 0.288
1.259GluTrp: 1.259 ± 0.181
2.929GluTyr: 2.929 ± 0.247
0.0GluXaa: 0.0 ± 0.0
Phe
2.469PheAla: 2.469 ± 0.253
0.581PheCys: 0.581 ± 0.125
3.243PheAsp: 3.243 ± 0.29
3.219PheGlu: 3.219 ± 0.295
1.525PhePhe: 1.525 ± 0.196
3.147PheGly: 3.147 ± 0.272
0.799PheHis: 0.799 ± 0.12
2.566PheIle: 2.566 ± 0.245
3.026PheLys: 3.026 ± 0.209
3.219PheLeu: 3.219 ± 0.283
1.452PheMet: 1.452 ± 0.21
2.372PheAsn: 2.372 ± 0.228
1.428PhePro: 1.428 ± 0.173
1.113PheGln: 1.113 ± 0.174
1.936PheArg: 1.936 ± 0.212
2.566PheSer: 2.566 ± 0.281
2.154PheThr: 2.154 ± 0.215
2.759PheVal: 2.759 ± 0.217
0.678PheTrp: 0.678 ± 0.12
2.227PheTyr: 2.227 ± 0.281
0.0PheXaa: 0.0 ± 0.0
Gly
4.575GlyAla: 4.575 ± 0.385
1.138GlyCys: 1.138 ± 0.177
5.034GlyAsp: 5.034 ± 0.306
4.962GlyGlu: 4.962 ± 0.316
3.413GlyPhe: 3.413 ± 0.337
5.131GlyGly: 5.131 ± 0.521
1.259GlyHis: 1.259 ± 0.186
4.091GlyIle: 4.091 ± 0.342
5.543GlyLys: 5.543 ± 0.359
4.986GlyLeu: 4.986 ± 0.324
1.912GlyMet: 1.912 ± 0.184
3.776GlyAsn: 3.776 ± 0.3
1.38GlyPro: 1.38 ± 0.238
2.009GlyGln: 2.009 ± 0.213
3.316GlyArg: 3.316 ± 0.273
3.582GlySer: 3.582 ± 0.294
4.333GlyThr: 4.333 ± 0.625
4.744GlyVal: 4.744 ± 0.312
1.307GlyTrp: 1.307 ± 0.199
3.8GlyTyr: 3.8 ± 0.293
0.0GlyXaa: 0.0 ± 0.0
His
0.944HisAla: 0.944 ± 0.157
0.605HisCys: 0.605 ± 0.133
1.138HisAsp: 1.138 ± 0.183
0.992HisGlu: 0.992 ± 0.161
0.823HisPhe: 0.823 ± 0.139
1.065HisGly: 1.065 ± 0.152
0.411HisHis: 0.411 ± 0.11
1.162HisIle: 1.162 ± 0.164
1.525HisLys: 1.525 ± 0.176
1.67HisLeu: 1.67 ± 0.185
0.508HisMet: 0.508 ± 0.092
0.992HisAsn: 0.992 ± 0.141
1.138HisPro: 1.138 ± 0.187
0.629HisGln: 0.629 ± 0.11
0.92HisArg: 0.92 ± 0.144
1.162HisSer: 1.162 ± 0.159
1.428HisThr: 1.428 ± 0.202
1.138HisVal: 1.138 ± 0.176
0.315HisTrp: 0.315 ± 0.089
1.162HisTyr: 1.162 ± 0.163
0.0HisXaa: 0.0 ± 0.0
Ile
4.212IleAla: 4.212 ± 0.337
0.944IleCys: 0.944 ± 0.162
4.018IleAsp: 4.018 ± 0.373
3.534IleGlu: 3.534 ± 0.264
2.057IlePhe: 2.057 ± 0.219
3.582IleGly: 3.582 ± 0.309
1.234IleHis: 1.234 ± 0.161
4.091IleIle: 4.091 ± 0.382
3.776IleLys: 3.776 ± 0.315
4.671IleLeu: 4.671 ± 0.435
1.597IleMet: 1.597 ± 0.163
3.05IleAsn: 3.05 ± 0.27
2.808IlePro: 2.808 ± 0.235
1.936IleGln: 1.936 ± 0.231
2.905IleArg: 2.905 ± 0.271
3.558IleSer: 3.558 ± 0.291
3.727IleThr: 3.727 ± 0.27
4.623IleVal: 4.623 ± 0.346
0.75IleTrp: 0.75 ± 0.128
2.372IleTyr: 2.372 ± 0.241
0.0IleXaa: 0.0 ± 0.0
Lys
5.591LysAla: 5.591 ± 0.461
1.331LysCys: 1.331 ± 0.172
4.913LysAsp: 4.913 ± 0.38
5.301LysGlu: 5.301 ± 0.419
2.783LysPhe: 2.783 ± 0.279
4.623LysGly: 4.623 ± 0.33
1.283LysHis: 1.283 ± 0.192
4.744LysIle: 4.744 ± 0.378
4.526LysLys: 4.526 ± 0.437
4.865LysLeu: 4.865 ± 0.307
2.348LysMet: 2.348 ± 0.223
3.195LysAsn: 3.195 ± 0.307
2.396LysPro: 2.396 ± 0.224
2.227LysGln: 2.227 ± 0.241
2.832LysArg: 2.832 ± 0.258
3.437LysSer: 3.437 ± 0.31
3.97LysThr: 3.97 ± 0.321
5.325LysVal: 5.325 ± 0.393
1.017LysTrp: 1.017 ± 0.147
2.638LysTyr: 2.638 ± 0.231
0.0LysXaa: 0.0 ± 0.0
Leu
5.567LeuAla: 5.567 ± 0.357
0.968LeuCys: 0.968 ± 0.153
5.64LeuAsp: 5.64 ± 0.349
6.463LeuGlu: 6.463 ± 0.504
2.735LeuPhe: 2.735 ± 0.258
4.55LeuGly: 4.55 ± 0.297
1.719LeuHis: 1.719 ± 0.186
4.284LeuIle: 4.284 ± 0.33
5.277LeuLys: 5.277 ± 0.398
5.204LeuLeu: 5.204 ± 0.483
2.203LeuMet: 2.203 ± 0.235
4.187LeuAsn: 4.187 ± 0.276
3.97LeuPro: 3.97 ± 0.273
2.88LeuGln: 2.88 ± 0.297
3.413LeuArg: 3.413 ± 0.27
4.429LeuSer: 4.429 ± 0.342
4.429LeuThr: 4.429 ± 0.369
5.131LeuVal: 5.131 ± 0.395
1.501LeuTrp: 1.501 ± 0.204
2.735LeuTyr: 2.735 ± 0.25
0.0LeuXaa: 0.0 ± 0.0
Met
2.445MetAla: 2.445 ± 0.24
0.436MetCys: 0.436 ± 0.109
1.355MetAsp: 1.355 ± 0.176
1.573MetGlu: 1.573 ± 0.217
1.162MetPhe: 1.162 ± 0.176
1.815MetGly: 1.815 ± 0.229
0.484MetHis: 0.484 ± 0.105
1.912MetIle: 1.912 ± 0.185
2.977MetLys: 2.977 ± 0.278
2.372MetLeu: 2.372 ± 0.236
0.847MetMet: 0.847 ± 0.134
1.186MetAsn: 1.186 ± 0.179
0.944MetPro: 0.944 ± 0.148
1.186MetGln: 1.186 ± 0.165
1.113MetArg: 1.113 ± 0.155
2.106MetSer: 2.106 ± 0.218
1.646MetThr: 1.646 ± 0.198
1.646MetVal: 1.646 ± 0.171
0.629MetTrp: 0.629 ± 0.121
0.847MetTyr: 0.847 ± 0.113
0.0MetXaa: 0.0 ± 0.0
Asn
3.703AsnAla: 3.703 ± 0.269
0.726AsnCys: 0.726 ± 0.125
3.026AsnAsp: 3.026 ± 0.221
2.396AsnGlu: 2.396 ± 0.265
2.42AsnPhe: 2.42 ± 0.243
4.502AsnGly: 4.502 ± 0.438
0.92AsnHis: 0.92 ± 0.14
3.122AsnIle: 3.122 ± 0.316
3.389AsnLys: 3.389 ± 0.312
4.066AsnLeu: 4.066 ± 0.292
1.21AsnMet: 1.21 ± 0.156
3.098AsnAsn: 3.098 ± 0.359
2.251AsnPro: 2.251 ± 0.215
1.501AsnGln: 1.501 ± 0.202
2.203AsnArg: 2.203 ± 0.221
2.42AsnSer: 2.42 ± 0.219
3.001AsnThr: 3.001 ± 0.281
2.905AsnVal: 2.905 ± 0.279
0.896AsnTrp: 0.896 ± 0.124
1.646AsnTyr: 1.646 ± 0.202
0.0AsnXaa: 0.0 ± 0.0
Pro
2.082ProAla: 2.082 ± 0.24
0.557ProCys: 0.557 ± 0.121
3.485ProAsp: 3.485 ± 0.302
3.848ProGlu: 3.848 ± 0.3
1.985ProPhe: 1.985 ± 0.223
2.493ProGly: 2.493 ± 0.336
0.678ProHis: 0.678 ± 0.101
1.525ProIle: 1.525 ± 0.192
2.493ProLys: 2.493 ± 0.325
2.759ProLeu: 2.759 ± 0.285
0.871ProMet: 0.871 ± 0.154
1.501ProAsn: 1.501 ± 0.193
1.041ProPro: 1.041 ± 0.164
1.38ProGln: 1.38 ± 0.187
1.549ProArg: 1.549 ± 0.2
2.59ProSer: 2.59 ± 0.253
2.203ProThr: 2.203 ± 0.239
3.026ProVal: 3.026 ± 0.306
0.436ProTrp: 0.436 ± 0.108
1.38ProTyr: 1.38 ± 0.175
0.0ProXaa: 0.0 ± 0.0
Gln
2.541GlnAla: 2.541 ± 0.245
0.266GlnCys: 0.266 ± 0.085
1.936GlnAsp: 1.936 ± 0.229
2.59GlnGlu: 2.59 ± 0.271
1.331GlnPhe: 1.331 ± 0.16
1.791GlnGly: 1.791 ± 0.172
0.484GlnHis: 0.484 ± 0.092
1.84GlnIle: 1.84 ± 0.217
1.912GlnLys: 1.912 ± 0.223
2.711GlnLeu: 2.711 ± 0.22
1.089GlnMet: 1.089 ± 0.154
1.234GlnAsn: 1.234 ± 0.157
1.428GlnPro: 1.428 ± 0.194
1.791GlnGln: 1.791 ± 0.259
1.549GlnArg: 1.549 ± 0.17
1.525GlnSer: 1.525 ± 0.227
1.694GlnThr: 1.694 ± 0.227
2.203GlnVal: 2.203 ± 0.264
0.775GlnTrp: 0.775 ± 0.121
1.452GlnTyr: 1.452 ± 0.216
0.0GlnXaa: 0.0 ± 0.0
Arg
2.711ArgAla: 2.711 ± 0.26
0.944ArgCys: 0.944 ± 0.16
3.195ArgAsp: 3.195 ± 0.237
2.905ArgGlu: 2.905 ± 0.3
1.743ArgPhe: 1.743 ± 0.206
2.759ArgGly: 2.759 ± 0.245
0.75ArgHis: 0.75 ± 0.131
2.711ArgIle: 2.711 ± 0.212
3.413ArgLys: 3.413 ± 0.261
3.703ArgLeu: 3.703 ± 0.298
1.525ArgMet: 1.525 ± 0.171
2.154ArgAsn: 2.154 ± 0.251
1.355ArgPro: 1.355 ± 0.192
1.501ArgGln: 1.501 ± 0.186
2.493ArgArg: 2.493 ± 0.258
2.662ArgSer: 2.662 ± 0.256
2.445ArgThr: 2.445 ± 0.271
3.34ArgVal: 3.34 ± 0.287
0.799ArgTrp: 0.799 ± 0.124
1.961ArgTyr: 1.961 ± 0.209
0.0ArgXaa: 0.0 ± 0.0
Ser
3.945SerAla: 3.945 ± 0.35
1.065SerCys: 1.065 ± 0.152
3.413SerAsp: 3.413 ± 0.302
3.243SerGlu: 3.243 ± 0.293
2.59SerPhe: 2.59 ± 0.322
5.083SerGly: 5.083 ± 0.384
1.259SerHis: 1.259 ± 0.147
3.219SerIle: 3.219 ± 0.269
3.848SerLys: 3.848 ± 0.331
3.51SerLeu: 3.51 ± 0.289
1.452SerMet: 1.452 ± 0.197
2.856SerAsn: 2.856 ± 0.269
2.42SerPro: 2.42 ± 0.258
1.622SerGln: 1.622 ± 0.172
2.469SerArg: 2.469 ± 0.204
3.243SerSer: 3.243 ± 0.311
3.34SerThr: 3.34 ± 0.292
3.897SerVal: 3.897 ± 0.325
0.896SerTrp: 0.896 ± 0.147
1.936SerTyr: 1.936 ± 0.228
0.0SerXaa: 0.0 ± 0.0
Thr
4.042ThrAla: 4.042 ± 0.361
0.896ThrCys: 0.896 ± 0.159
3.195ThrAsp: 3.195 ± 0.283
3.703ThrGlu: 3.703 ± 0.363
2.808ThrPhe: 2.808 ± 0.23
4.454ThrGly: 4.454 ± 0.396
1.283ThrHis: 1.283 ± 0.166
3.679ThrIle: 3.679 ± 0.359
3.316ThrLys: 3.316 ± 0.339
4.817ThrLeu: 4.817 ± 0.437
1.21ThrMet: 1.21 ± 0.166
2.324ThrAsn: 2.324 ± 0.254
2.541ThrPro: 2.541 ± 0.245
1.864ThrGln: 1.864 ± 0.24
2.082ThrArg: 2.082 ± 0.247
3.534ThrSer: 3.534 ± 0.429
3.558ThrThr: 3.558 ± 0.409
4.454ThrVal: 4.454 ± 0.394
0.992ThrTrp: 0.992 ± 0.157
2.324ThrTyr: 2.324 ± 0.29
0.0ThrXaa: 0.0 ± 0.0
Val
4.623ValAla: 4.623 ± 0.343
1.138ValCys: 1.138 ± 0.147
5.978ValAsp: 5.978 ± 0.38
5.083ValGlu: 5.083 ± 0.496
3.34ValPhe: 3.34 ± 0.292
4.478ValGly: 4.478 ± 0.347
1.186ValHis: 1.186 ± 0.208
4.139ValIle: 4.139 ± 0.32
4.696ValLys: 4.696 ± 0.367
4.526ValLeu: 4.526 ± 0.366
1.936ValMet: 1.936 ± 0.216
3.51ValAsn: 3.51 ± 0.309
1.985ValPro: 1.985 ± 0.227
1.912ValGln: 1.912 ± 0.228
3.243ValArg: 3.243 ± 0.226
3.679ValSer: 3.679 ± 0.25
4.429ValThr: 4.429 ± 0.415
6.656ValVal: 6.656 ± 0.534
1.525ValTrp: 1.525 ± 0.184
2.832ValTyr: 2.832 ± 0.226
0.0ValXaa: 0.0 ± 0.0
Trp
1.017TrpAla: 1.017 ± 0.221
0.436TrpCys: 0.436 ± 0.107
1.162TrpAsp: 1.162 ± 0.151
1.694TrpGlu: 1.694 ± 0.197
0.775TrpPhe: 0.775 ± 0.168
0.992TrpGly: 0.992 ± 0.207
0.363TrpHis: 0.363 ± 0.094
1.307TrpIle: 1.307 ± 0.175
1.138TrpLys: 1.138 ± 0.183
1.404TrpLeu: 1.404 ± 0.219
0.387TrpMet: 0.387 ± 0.103
0.871TrpAsn: 0.871 ± 0.149
0.532TrpPro: 0.532 ± 0.096
0.581TrpGln: 0.581 ± 0.12
0.847TrpArg: 0.847 ± 0.139
0.896TrpSer: 0.896 ± 0.137
1.065TrpThr: 1.065 ± 0.142
1.089TrpVal: 1.089 ± 0.177
0.484TrpTrp: 0.484 ± 0.141
0.896TrpTyr: 0.896 ± 0.137
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.783TyrAla: 2.783 ± 0.267
0.629TyrCys: 0.629 ± 0.139
3.195TyrAsp: 3.195 ± 0.287
2.057TyrGlu: 2.057 ± 0.272
1.525TyrPhe: 1.525 ± 0.17
2.929TyrGly: 2.929 ± 0.293
1.065TyrHis: 1.065 ± 0.151
1.815TyrIle: 1.815 ± 0.221
2.517TyrLys: 2.517 ± 0.222
3.921TyrLeu: 3.921 ± 0.307
0.847TyrMet: 0.847 ± 0.132
2.13TyrAsn: 2.13 ± 0.221
2.033TyrPro: 2.033 ± 0.23
1.597TyrGln: 1.597 ± 0.216
2.178TyrArg: 2.178 ± 0.267
2.59TyrSer: 2.59 ± 0.251
2.735TyrThr: 2.735 ± 0.253
2.372TyrVal: 2.372 ± 0.255
0.871TyrTrp: 0.871 ± 0.134
2.178TyrTyr: 2.178 ± 0.223
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 220 proteins (41316 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski