Amino acid dipepetide frequency for Escherichia phage Av-05

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.868AlaAla: 5.868 ± 0.652
0.68AlaCys: 0.68 ± 0.148
3.572AlaAsp: 3.572 ± 0.279
4.45AlaGlu: 4.45 ± 0.391
2.976AlaPhe: 2.976 ± 0.292
4.734AlaGly: 4.734 ± 0.32
0.964AlaHis: 0.964 ± 0.143
4.422AlaIle: 4.422 ± 0.33
4.961AlaLys: 4.961 ± 0.465
5.811AlaLeu: 5.811 ± 0.406
2.523AlaMet: 2.523 ± 0.305
3.345AlaAsn: 3.345 ± 0.343
2.013AlaPro: 2.013 ± 0.245
2.636AlaGln: 2.636 ± 0.253
3.345AlaArg: 3.345 ± 0.418
3.883AlaSer: 3.883 ± 0.375
3.742AlaThr: 3.742 ± 0.424
4.791AlaVal: 4.791 ± 0.357
0.992AlaTrp: 0.992 ± 0.187
3.26AlaTyr: 3.26 ± 0.301
0.0AlaXaa: 0.0 ± 0.0
Cys
0.624CysAla: 0.624 ± 0.141
0.255CysCys: 0.255 ± 0.077
0.879CysAsp: 0.879 ± 0.161
0.85CysGlu: 0.85 ± 0.153
0.68CysPhe: 0.68 ± 0.14
1.049CysGly: 1.049 ± 0.165
0.34CysHis: 0.34 ± 0.099
0.567CysIle: 0.567 ± 0.134
0.935CysLys: 0.935 ± 0.161
1.276CysLeu: 1.276 ± 0.185
0.34CysMet: 0.34 ± 0.093
0.737CysAsn: 0.737 ± 0.141
0.68CysPro: 0.68 ± 0.132
0.312CysGln: 0.312 ± 0.109
0.68CysArg: 0.68 ± 0.138
0.822CysSer: 0.822 ± 0.164
0.652CysThr: 0.652 ± 0.155
0.85CysVal: 0.85 ± 0.183
0.255CysTrp: 0.255 ± 0.082
0.539CysTyr: 0.539 ± 0.141
0.0CysXaa: 0.0 ± 0.0
Asp
4.11AspAla: 4.11 ± 0.333
0.709AspCys: 0.709 ± 0.165
4.252AspAsp: 4.252 ± 0.355
4.11AspGlu: 4.11 ± 0.314
2.948AspPhe: 2.948 ± 0.306
5.017AspGly: 5.017 ± 0.384
1.389AspHis: 1.389 ± 0.201
4.904AspIle: 4.904 ± 0.403
4.252AspLys: 4.252 ± 0.28
5.244AspLeu: 5.244 ± 0.413
1.843AspMet: 1.843 ± 0.221
3.487AspAsn: 3.487 ± 0.301
2.806AspPro: 2.806 ± 0.266
1.672AspGln: 1.672 ± 0.182
1.956AspArg: 1.956 ± 0.218
3.317AspSer: 3.317 ± 0.309
2.806AspThr: 2.806 ± 0.259
5.017AspVal: 5.017 ± 0.46
1.361AspTrp: 1.361 ± 0.225
3.005AspTyr: 3.005 ± 0.297
0.0AspXaa: 0.0 ± 0.0
Glu
4.535GluAla: 4.535 ± 0.396
0.68GluCys: 0.68 ± 0.158
4.961GluAsp: 4.961 ± 0.383
5.414GluGlu: 5.414 ± 0.41
2.693GluPhe: 2.693 ± 0.344
3.146GluGly: 3.146 ± 0.28
0.879GluHis: 0.879 ± 0.16
4.876GluIle: 4.876 ± 0.386
5.471GluLys: 5.471 ± 0.369
5.726GluLeu: 5.726 ± 0.421
2.211GluMet: 2.211 ± 0.251
3.628GluAsn: 3.628 ± 0.291
1.672GluPro: 1.672 ± 0.225
2.409GluGln: 2.409 ± 0.286
3.26GluArg: 3.26 ± 0.305
3.883GluSer: 3.883 ± 0.357
3.118GluThr: 3.118 ± 0.311
3.515GluVal: 3.515 ± 0.292
1.162GluTrp: 1.162 ± 0.2
3.43GluTyr: 3.43 ± 0.323
0.028GluXaa: 0.028 ± 0.028
Phe
2.891PheAla: 2.891 ± 0.356
0.737PheCys: 0.737 ± 0.166
3.005PheAsp: 3.005 ± 0.293
2.296PheGlu: 2.296 ± 0.252
1.531PhePhe: 1.531 ± 0.197
2.948PheGly: 2.948 ± 0.298
0.68PheHis: 0.68 ± 0.124
3.061PheIle: 3.061 ± 0.296
3.146PheLys: 3.146 ± 0.298
3.146PheLeu: 3.146 ± 0.309
1.247PheMet: 1.247 ± 0.204
2.353PheAsn: 2.353 ± 0.259
1.502PhePro: 1.502 ± 0.158
1.191PheGln: 1.191 ± 0.173
1.672PheArg: 1.672 ± 0.21
3.373PheSer: 3.373 ± 0.311
2.551PheThr: 2.551 ± 0.303
2.863PheVal: 2.863 ± 0.286
0.68PheTrp: 0.68 ± 0.156
1.871PheTyr: 1.871 ± 0.229
0.0PheXaa: 0.0 ± 0.0
Gly
3.77GlyAla: 3.77 ± 0.289
0.879GlyCys: 0.879 ± 0.151
3.912GlyAsp: 3.912 ± 0.337
4.592GlyGlu: 4.592 ± 0.358
3.317GlyPhe: 3.317 ± 0.259
4.535GlyGly: 4.535 ± 0.437
1.219GlyHis: 1.219 ± 0.188
4.139GlyIle: 4.139 ± 0.35
6.009GlyLys: 6.009 ± 0.432
4.932GlyLeu: 4.932 ± 0.363
2.239GlyMet: 2.239 ± 0.215
3.94GlyAsn: 3.94 ± 0.356
0.935GlyPro: 0.935 ± 0.144
2.268GlyGln: 2.268 ± 0.257
2.409GlyArg: 2.409 ± 0.297
3.883GlySer: 3.883 ± 0.348
3.77GlyThr: 3.77 ± 0.352
4.337GlyVal: 4.337 ± 0.296
1.587GlyTrp: 1.587 ± 0.249
3.402GlyTyr: 3.402 ± 0.278
0.0GlyXaa: 0.0 ± 0.0
His
1.02HisAla: 1.02 ± 0.204
0.312HisCys: 0.312 ± 0.108
1.049HisAsp: 1.049 ± 0.164
1.191HisGlu: 1.191 ± 0.198
0.737HisPhe: 0.737 ± 0.166
1.077HisGly: 1.077 ± 0.18
0.51HisHis: 0.51 ± 0.105
1.389HisIle: 1.389 ± 0.24
1.757HisLys: 1.757 ± 0.234
1.276HisLeu: 1.276 ± 0.177
0.369HisMet: 0.369 ± 0.099
0.964HisAsn: 0.964 ± 0.174
0.709HisPro: 0.709 ± 0.155
0.737HisGln: 0.737 ± 0.133
0.624HisArg: 0.624 ± 0.137
0.935HisSer: 0.935 ± 0.139
1.304HisThr: 1.304 ± 0.185
0.992HisVal: 0.992 ± 0.185
0.425HisTrp: 0.425 ± 0.114
0.709HisTyr: 0.709 ± 0.156
0.0HisXaa: 0.0 ± 0.0
Ile
4.309IleAla: 4.309 ± 0.35
1.049IleCys: 1.049 ± 0.189
3.515IleAsp: 3.515 ± 0.373
4.337IleGlu: 4.337 ± 0.34
2.863IlePhe: 2.863 ± 0.279
3.883IleGly: 3.883 ± 0.35
1.162IleHis: 1.162 ± 0.174
4.139IleIle: 4.139 ± 0.362
4.904IleLys: 4.904 ± 0.407
5.216IleLeu: 5.216 ± 0.394
1.446IleMet: 1.446 ± 0.177
3.345IleAsn: 3.345 ± 0.349
3.402IlePro: 3.402 ± 0.316
1.757IleGln: 1.757 ± 0.198
3.09IleArg: 3.09 ± 0.37
4.592IleSer: 4.592 ± 0.403
4.252IleThr: 4.252 ± 0.357
4.224IleVal: 4.224 ± 0.392
0.539IleTrp: 0.539 ± 0.16
2.551IleTyr: 2.551 ± 0.209
0.0IleXaa: 0.0 ± 0.0
Lys
6.435LysAla: 6.435 ± 0.608
0.964LysCys: 0.964 ± 0.186
5.357LysAsp: 5.357 ± 0.396
6.633LysGlu: 6.633 ± 0.447
2.693LysPhe: 2.693 ± 0.28
4.989LysGly: 4.989 ± 0.417
1.134LysHis: 1.134 ± 0.188
5.272LysIle: 5.272 ± 0.469
6.123LysLys: 6.123 ± 0.505
5.783LysLeu: 5.783 ± 0.456
2.239LysMet: 2.239 ± 0.266
3.6LysAsn: 3.6 ± 0.333
2.154LysPro: 2.154 ± 0.257
2.721LysGln: 2.721 ± 0.245
3.742LysArg: 3.742 ± 0.39
4.195LysSer: 4.195 ± 0.371
4.564LysThr: 4.564 ± 0.395
4.705LysVal: 4.705 ± 0.342
0.964LysTrp: 0.964 ± 0.161
2.948LysTyr: 2.948 ± 0.304
0.0LysXaa: 0.0 ± 0.0
Leu
5.726LeuAla: 5.726 ± 0.427
1.162LeuCys: 1.162 ± 0.196
5.301LeuAsp: 5.301 ± 0.398
5.669LeuGlu: 5.669 ± 0.408
2.948LeuPhe: 2.948 ± 0.274
4.904LeuGly: 4.904 ± 0.332
1.361LeuHis: 1.361 ± 0.177
4.167LeuIle: 4.167 ± 0.355
6.718LeuLys: 6.718 ± 0.499
5.499LeuLeu: 5.499 ± 0.469
2.126LeuMet: 2.126 ± 0.308
4.11LeuAsn: 4.11 ± 0.329
3.912LeuPro: 3.912 ± 0.299
2.948LeuGln: 2.948 ± 0.286
4.309LeuArg: 4.309 ± 0.346
6.463LeuSer: 6.463 ± 0.433
5.102LeuThr: 5.102 ± 0.378
4.479LeuVal: 4.479 ± 0.322
1.276LeuTrp: 1.276 ± 0.191
3.317LeuTyr: 3.317 ± 0.295
0.0LeuXaa: 0.0 ± 0.0
Met
2.608MetAla: 2.608 ± 0.336
0.454MetCys: 0.454 ± 0.116
1.502MetAsp: 1.502 ± 0.212
1.417MetGlu: 1.417 ± 0.195
1.474MetPhe: 1.474 ± 0.203
1.446MetGly: 1.446 ± 0.19
0.482MetHis: 0.482 ± 0.116
1.871MetIle: 1.871 ± 0.225
2.778MetLys: 2.778 ± 0.282
1.928MetLeu: 1.928 ± 0.203
0.964MetMet: 0.964 ± 0.171
1.276MetAsn: 1.276 ± 0.21
0.964MetPro: 0.964 ± 0.178
0.765MetGln: 0.765 ± 0.163
0.992MetArg: 0.992 ± 0.175
1.644MetSer: 1.644 ± 0.211
1.899MetThr: 1.899 ± 0.242
1.871MetVal: 1.871 ± 0.218
0.397MetTrp: 0.397 ± 0.105
0.935MetTyr: 0.935 ± 0.155
0.0MetXaa: 0.0 ± 0.0
Asn
3.883AsnAla: 3.883 ± 0.37
0.539AsnCys: 0.539 ± 0.148
2.778AsnAsp: 2.778 ± 0.301
2.154AsnGlu: 2.154 ± 0.247
2.211AsnPhe: 2.211 ± 0.27
4.62AsnGly: 4.62 ± 0.371
0.765AsnHis: 0.765 ± 0.146
3.997AsnIle: 3.997 ± 0.337
3.543AsnLys: 3.543 ± 0.294
4.479AsnLeu: 4.479 ± 0.419
1.757AsnMet: 1.757 ± 0.193
2.75AsnAsn: 2.75 ± 0.296
2.438AsnPro: 2.438 ± 0.292
1.474AsnGln: 1.474 ± 0.221
2.154AsnArg: 2.154 ± 0.256
2.863AsnSer: 2.863 ± 0.299
2.721AsnThr: 2.721 ± 0.281
3.118AsnVal: 3.118 ± 0.348
0.624AsnTrp: 0.624 ± 0.149
2.183AsnTyr: 2.183 ± 0.251
0.0AsnXaa: 0.0 ± 0.0
Pro
2.013ProAla: 2.013 ± 0.244
0.34ProCys: 0.34 ± 0.094
2.976ProAsp: 2.976 ± 0.386
3.061ProGlu: 3.061 ± 0.273
1.644ProPhe: 1.644 ± 0.235
1.814ProGly: 1.814 ± 0.255
0.737ProHis: 0.737 ± 0.145
1.871ProIle: 1.871 ± 0.273
2.92ProLys: 2.92 ± 0.309
3.061ProLeu: 3.061 ± 0.325
0.595ProMet: 0.595 ± 0.129
1.616ProAsn: 1.616 ± 0.22
0.992ProPro: 0.992 ± 0.18
1.247ProGln: 1.247 ± 0.18
1.247ProArg: 1.247 ± 0.204
2.806ProSer: 2.806 ± 0.251
2.58ProThr: 2.58 ± 0.3
3.033ProVal: 3.033 ± 0.286
0.283ProTrp: 0.283 ± 0.086
1.332ProTyr: 1.332 ± 0.181
0.0ProXaa: 0.0 ± 0.0
Gln
2.324GlnAla: 2.324 ± 0.268
0.397GlnCys: 0.397 ± 0.105
1.786GlnAsp: 1.786 ± 0.2
2.154GlnGlu: 2.154 ± 0.222
1.644GlnPhe: 1.644 ± 0.203
2.098GlnGly: 2.098 ± 0.239
0.737GlnHis: 0.737 ± 0.148
2.353GlnIle: 2.353 ± 0.27
2.381GlnLys: 2.381 ± 0.294
2.608GlnLeu: 2.608 ± 0.281
0.907GlnMet: 0.907 ± 0.188
1.332GlnAsn: 1.332 ± 0.212
1.304GlnPro: 1.304 ± 0.178
1.899GlnGln: 1.899 ± 0.252
1.644GlnArg: 1.644 ± 0.203
2.069GlnSer: 2.069 ± 0.284
1.956GlnThr: 1.956 ± 0.234
2.551GlnVal: 2.551 ± 0.285
0.482GlnTrp: 0.482 ± 0.126
1.247GlnTyr: 1.247 ± 0.205
0.0GlnXaa: 0.0 ± 0.0
Arg
2.268ArgAla: 2.268 ± 0.304
0.34ArgCys: 0.34 ± 0.1
2.778ArgAsp: 2.778 ± 0.322
3.146ArgGlu: 3.146 ± 0.294
1.701ArgPhe: 1.701 ± 0.238
2.863ArgGly: 2.863 ± 0.278
0.822ArgHis: 0.822 ± 0.156
2.806ArgIle: 2.806 ± 0.28
3.742ArgLys: 3.742 ± 0.39
4.025ArgLeu: 4.025 ± 0.341
1.276ArgMet: 1.276 ± 0.21
1.899ArgAsn: 1.899 ± 0.211
1.361ArgPro: 1.361 ± 0.19
1.531ArgGln: 1.531 ± 0.196
2.381ArgArg: 2.381 ± 0.291
2.268ArgSer: 2.268 ± 0.329
2.211ArgThr: 2.211 ± 0.233
3.203ArgVal: 3.203 ± 0.325
0.85ArgTrp: 0.85 ± 0.151
1.984ArgTyr: 1.984 ± 0.274
0.0ArgXaa: 0.0 ± 0.0
Ser
3.657SerAla: 3.657 ± 0.353
0.822SerCys: 0.822 ± 0.163
3.968SerAsp: 3.968 ± 0.389
3.685SerGlu: 3.685 ± 0.378
2.778SerPhe: 2.778 ± 0.28
5.102SerGly: 5.102 ± 0.407
0.879SerHis: 0.879 ± 0.142
3.373SerIle: 3.373 ± 0.331
4.535SerLys: 4.535 ± 0.433
5.783SerLeu: 5.783 ± 0.43
1.361SerMet: 1.361 ± 0.176
3.402SerAsn: 3.402 ± 0.309
2.58SerPro: 2.58 ± 0.236
2.041SerGln: 2.041 ± 0.237
2.693SerArg: 2.693 ± 0.322
3.6SerSer: 3.6 ± 0.387
3.09SerThr: 3.09 ± 0.341
4.535SerVal: 4.535 ± 0.419
1.304SerTrp: 1.304 ± 0.163
2.381SerTyr: 2.381 ± 0.274
0.0SerXaa: 0.0 ± 0.0
Thr
4.054ThrAla: 4.054 ± 0.384
0.567ThrCys: 0.567 ± 0.132
3.43ThrAsp: 3.43 ± 0.295
3.628ThrGlu: 3.628 ± 0.318
2.665ThrPhe: 2.665 ± 0.305
4.819ThrGly: 4.819 ± 0.447
1.361ThrHis: 1.361 ± 0.222
3.997ThrIle: 3.997 ± 0.354
4.224ThrLys: 4.224 ± 0.355
4.309ThrLeu: 4.309 ± 0.34
1.049ThrMet: 1.049 ± 0.154
2.438ThrAsn: 2.438 ± 0.318
2.58ThrPro: 2.58 ± 0.267
2.013ThrGln: 2.013 ± 0.264
1.928ThrArg: 1.928 ± 0.215
3.203ThrSer: 3.203 ± 0.296
3.515ThrThr: 3.515 ± 0.377
4.337ThrVal: 4.337 ± 0.421
0.964ThrTrp: 0.964 ± 0.147
2.041ThrTyr: 2.041 ± 0.239
0.0ThrXaa: 0.0 ± 0.0
Val
4.734ValAla: 4.734 ± 0.369
1.247ValCys: 1.247 ± 0.181
4.961ValAsp: 4.961 ± 0.403
4.705ValGlu: 4.705 ± 0.389
2.693ValPhe: 2.693 ± 0.264
3.657ValGly: 3.657 ± 0.317
1.162ValHis: 1.162 ± 0.17
3.912ValIle: 3.912 ± 0.365
4.876ValLys: 4.876 ± 0.376
5.839ValLeu: 5.839 ± 0.412
1.672ValMet: 1.672 ± 0.244
3.713ValAsn: 3.713 ± 0.475
2.438ValPro: 2.438 ± 0.289
2.268ValGln: 2.268 ± 0.255
2.58ValArg: 2.58 ± 0.252
4.592ValSer: 4.592 ± 0.332
3.968ValThr: 3.968 ± 0.456
4.791ValVal: 4.791 ± 0.443
1.304ValTrp: 1.304 ± 0.214
2.409ValTyr: 2.409 ± 0.27
0.0ValXaa: 0.0 ± 0.0
Trp
1.162TrpAla: 1.162 ± 0.207
0.283TrpCys: 0.283 ± 0.086
1.049TrpAsp: 1.049 ± 0.176
1.162TrpGlu: 1.162 ± 0.182
0.794TrpPhe: 0.794 ± 0.143
0.794TrpGly: 0.794 ± 0.147
0.255TrpHis: 0.255 ± 0.096
0.794TrpIle: 0.794 ± 0.172
1.134TrpLys: 1.134 ± 0.201
1.814TrpLeu: 1.814 ± 0.265
0.595TrpMet: 0.595 ± 0.115
0.794TrpAsn: 0.794 ± 0.16
0.312TrpPro: 0.312 ± 0.083
0.539TrpGln: 0.539 ± 0.129
0.85TrpArg: 0.85 ± 0.181
0.794TrpSer: 0.794 ± 0.178
0.85TrpThr: 0.85 ± 0.14
1.389TrpVal: 1.389 ± 0.213
0.51TrpTrp: 0.51 ± 0.123
0.794TrpTyr: 0.794 ± 0.119
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.976TyrAla: 2.976 ± 0.279
0.907TyrCys: 0.907 ± 0.159
3.175TyrAsp: 3.175 ± 0.333
2.041TyrGlu: 2.041 ± 0.256
1.757TyrPhe: 1.757 ± 0.232
2.58TyrGly: 2.58 ± 0.305
1.247TyrHis: 1.247 ± 0.2
2.665TyrIle: 2.665 ± 0.346
2.806TyrLys: 2.806 ± 0.279
3.742TyrLeu: 3.742 ± 0.335
0.879TyrMet: 0.879 ± 0.152
2.324TyrAsn: 2.324 ± 0.239
1.417TyrPro: 1.417 ± 0.178
1.446TyrGln: 1.446 ± 0.177
1.899TyrArg: 1.899 ± 0.238
2.381TyrSer: 2.381 ± 0.261
2.466TyrThr: 2.466 ± 0.265
2.92TyrVal: 2.92 ± 0.283
0.709TyrTrp: 0.709 ± 0.128
1.162TyrTyr: 1.162 ± 0.175
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.028XaaGlu: 0.028 ± 0.028
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 209 proteins (35279 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski