Amino acid dipepetide frequency for Aeromonas phage phiA8-29

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.572AlaAla: 5.572 ± 0.576
0.575AlaCys: 0.575 ± 0.122
3.626AlaAsp: 3.626 ± 0.308
4.378AlaGlu: 4.378 ± 0.381
2.786AlaPhe: 2.786 ± 0.252
4.334AlaGly: 4.334 ± 0.376
1.194AlaHis: 1.194 ± 0.14
3.648AlaIle: 3.648 ± 0.254
4.909AlaLys: 4.909 ± 0.38
6.147AlaLeu: 6.147 ± 0.424
1.68AlaMet: 1.68 ± 0.164
2.322AlaAsn: 2.322 ± 0.287
2.454AlaPro: 2.454 ± 0.236
2.675AlaGln: 2.675 ± 0.209
3.029AlaArg: 3.029 ± 0.237
4.312AlaSer: 4.312 ± 0.297
4.489AlaThr: 4.489 ± 0.384
5.152AlaVal: 5.152 ± 0.371
0.973AlaTrp: 0.973 ± 0.179
2.167AlaTyr: 2.167 ± 0.202
0.0AlaXaa: 0.0 ± 0.0
Cys
0.597CysAla: 0.597 ± 0.113
0.332CysCys: 0.332 ± 0.104
0.84CysAsp: 0.84 ± 0.139
0.907CysGlu: 0.907 ± 0.15
0.531CysPhe: 0.531 ± 0.111
0.73CysGly: 0.73 ± 0.137
0.243CysHis: 0.243 ± 0.078
0.509CysIle: 0.509 ± 0.096
0.641CysLys: 0.641 ± 0.124
0.907CysLeu: 0.907 ± 0.129
0.221CysMet: 0.221 ± 0.064
0.73CysAsn: 0.73 ± 0.15
0.641CysPro: 0.641 ± 0.11
0.31CysGln: 0.31 ± 0.083
0.752CysArg: 0.752 ± 0.125
1.172CysSer: 1.172 ± 0.178
0.509CysThr: 0.509 ± 0.094
0.973CysVal: 0.973 ± 0.143
0.265CysTrp: 0.265 ± 0.076
0.376CysTyr: 0.376 ± 0.077
0.0CysXaa: 0.0 ± 0.0
Asp
3.648AspAla: 3.648 ± 0.28
0.619AspCys: 0.619 ± 0.128
3.118AspAsp: 3.118 ± 0.284
4.444AspGlu: 4.444 ± 0.434
3.25AspPhe: 3.25 ± 0.277
4.002AspGly: 4.002 ± 0.274
0.752AspHis: 0.752 ± 0.122
4.201AspIle: 4.201 ± 0.326
3.361AspLys: 3.361 ± 0.289
5.484AspLeu: 5.484 ± 0.341
1.614AspMet: 1.614 ± 0.206
2.322AspAsn: 2.322 ± 0.219
2.786AspPro: 2.786 ± 0.206
2.123AspGln: 2.123 ± 0.24
2.698AspArg: 2.698 ± 0.238
4.489AspSer: 4.489 ± 0.36
3.029AspThr: 3.029 ± 0.231
3.626AspVal: 3.626 ± 0.271
1.15AspTrp: 1.15 ± 0.169
2.211AspTyr: 2.211 ± 0.207
0.0AspXaa: 0.0 ± 0.0
Glu
4.909GluAla: 4.909 ± 0.379
0.752GluCys: 0.752 ± 0.127
4.179GluAsp: 4.179 ± 0.341
5.904GluGlu: 5.904 ± 0.499
3.737GluPhe: 3.737 ± 0.304
5.24GluGly: 5.24 ± 0.309
1.924GluHis: 1.924 ± 0.204
5.086GluIle: 5.086 ± 0.323
4.444GluLys: 4.444 ± 0.38
7.12GluLeu: 7.12 ± 0.423
2.631GluMet: 2.631 ± 0.247
3.096GluAsn: 3.096 ± 0.226
2.366GluPro: 2.366 ± 0.233
2.631GluGln: 2.631 ± 0.281
3.737GluArg: 3.737 ± 0.279
4.201GluSer: 4.201 ± 0.306
3.892GluThr: 3.892 ± 0.291
5.24GluVal: 5.24 ± 0.331
1.15GluTrp: 1.15 ± 0.164
2.388GluTyr: 2.388 ± 0.208
0.0GluXaa: 0.0 ± 0.0
Phe
2.543PheAla: 2.543 ± 0.226
0.752PheCys: 0.752 ± 0.116
3.051PheAsp: 3.051 ± 0.265
3.317PheGlu: 3.317 ± 0.244
1.99PhePhe: 1.99 ± 0.193
2.963PheGly: 2.963 ± 0.217
1.194PheHis: 1.194 ± 0.184
2.764PheIle: 2.764 ± 0.24
4.113PheLys: 4.113 ± 0.296
3.14PheLeu: 3.14 ± 0.236
1.548PheMet: 1.548 ± 0.212
2.101PheAsn: 2.101 ± 0.24
1.548PhePro: 1.548 ± 0.202
1.725PheGln: 1.725 ± 0.171
2.277PheArg: 2.277 ± 0.244
3.405PheSer: 3.405 ± 0.293
2.852PheThr: 2.852 ± 0.272
3.162PheVal: 3.162 ± 0.27
0.774PheTrp: 0.774 ± 0.136
1.57PheTyr: 1.57 ± 0.173
0.0PheXaa: 0.0 ± 0.0
Gly
4.842GlyAla: 4.842 ± 0.351
0.995GlyCys: 0.995 ± 0.172
4.002GlyAsp: 4.002 ± 0.257
4.732GlyGlu: 4.732 ± 0.323
3.184GlyPhe: 3.184 ± 0.257
4.997GlyGly: 4.997 ± 0.359
0.995GlyHis: 0.995 ± 0.151
3.25GlyIle: 3.25 ± 0.29
4.798GlyLys: 4.798 ± 0.376
5.528GlyLeu: 5.528 ± 0.347
1.548GlyMet: 1.548 ± 0.166
2.653GlyAsn: 2.653 ± 0.248
1.791GlyPro: 1.791 ± 0.206
2.344GlyGln: 2.344 ± 0.247
3.295GlyArg: 3.295 ± 0.285
5.461GlySer: 5.461 ± 0.33
3.184GlyThr: 3.184 ± 0.365
6.633GlyVal: 6.633 ± 0.457
1.15GlyTrp: 1.15 ± 0.146
2.123GlyTyr: 2.123 ± 0.204
0.0GlyXaa: 0.0 ± 0.0
His
0.929HisAla: 0.929 ± 0.132
0.177HisCys: 0.177 ± 0.057
1.106HisAsp: 1.106 ± 0.18
1.282HisGlu: 1.282 ± 0.142
0.884HisPhe: 0.884 ± 0.154
1.194HisGly: 1.194 ± 0.148
0.531HisHis: 0.531 ± 0.103
1.194HisIle: 1.194 ± 0.167
0.973HisLys: 0.973 ± 0.136
1.459HisLeu: 1.459 ± 0.224
0.597HisMet: 0.597 ± 0.109
0.597HisAsn: 0.597 ± 0.104
0.929HisPro: 0.929 ± 0.147
0.73HisGln: 0.73 ± 0.115
0.995HisArg: 0.995 ± 0.162
1.017HisSer: 1.017 ± 0.138
1.061HisThr: 1.061 ± 0.169
1.172HisVal: 1.172 ± 0.193
0.221HisTrp: 0.221 ± 0.074
0.708HisTyr: 0.708 ± 0.127
0.0HisXaa: 0.0 ± 0.0
Ile
3.516IleAla: 3.516 ± 0.288
0.685IleCys: 0.685 ± 0.142
3.67IleAsp: 3.67 ± 0.259
4.312IleGlu: 4.312 ± 0.344
2.078IlePhe: 2.078 ± 0.248
3.715IleGly: 3.715 ± 0.267
1.128IleHis: 1.128 ± 0.155
2.786IleIle: 2.786 ± 0.234
4.046IleLys: 4.046 ± 0.308
4.82IleLeu: 4.82 ± 0.352
1.592IleMet: 1.592 ± 0.183
3.272IleAsn: 3.272 ± 0.299
2.675IlePro: 2.675 ± 0.25
2.322IleGln: 2.322 ± 0.181
2.985IleArg: 2.985 ± 0.235
4.599IleSer: 4.599 ± 0.302
3.228IleThr: 3.228 ± 0.28
3.516IleVal: 3.516 ± 0.263
0.663IleTrp: 0.663 ± 0.126
1.282IleTyr: 1.282 ± 0.14
0.0IleXaa: 0.0 ± 0.0
Lys
4.621LysAla: 4.621 ± 0.361
0.685LysCys: 0.685 ± 0.13
4.201LysAsp: 4.201 ± 0.351
5.859LysGlu: 5.859 ± 0.416
3.007LysPhe: 3.007 ± 0.215
4.378LysGly: 4.378 ± 0.359
0.951LysHis: 0.951 ± 0.153
4.533LysIle: 4.533 ± 0.309
4.864LysLys: 4.864 ± 0.428
6.081LysLeu: 6.081 ± 0.382
2.3LysMet: 2.3 ± 0.237
3.14LysAsn: 3.14 ± 0.245
2.366LysPro: 2.366 ± 0.257
2.454LysGln: 2.454 ± 0.249
4.046LysArg: 4.046 ± 0.339
3.914LysSer: 3.914 ± 0.266
4.179LysThr: 4.179 ± 0.238
5.086LysVal: 5.086 ± 0.362
1.106LysTrp: 1.106 ± 0.154
2.476LysTyr: 2.476 ± 0.262
0.0LysXaa: 0.0 ± 0.0
Leu
5.285LeuAla: 5.285 ± 0.327
1.061LeuCys: 1.061 ± 0.178
4.754LeuAsp: 4.754 ± 0.287
6.213LeuGlu: 6.213 ± 0.383
3.869LeuPhe: 3.869 ± 0.249
5.484LeuGly: 5.484 ± 0.37
1.194LeuHis: 1.194 ± 0.164
4.665LeuIle: 4.665 ± 0.269
7.628LeuLys: 7.628 ± 0.453
5.727LeuLeu: 5.727 ± 0.395
2.211LeuMet: 2.211 ± 0.228
4.223LeuAsn: 4.223 ± 0.294
3.648LeuPro: 3.648 ± 0.305
3.184LeuGln: 3.184 ± 0.3
4.356LeuArg: 4.356 ± 0.275
5.97LeuSer: 5.97 ± 0.356
4.798LeuThr: 4.798 ± 0.348
5.749LeuVal: 5.749 ± 0.385
1.106LeuTrp: 1.106 ± 0.186
2.698LeuTyr: 2.698 ± 0.221
0.0LeuXaa: 0.0 ± 0.0
Met
2.123MetAla: 2.123 ± 0.252
0.464MetCys: 0.464 ± 0.113
1.437MetAsp: 1.437 ± 0.145
1.924MetGlu: 1.924 ± 0.2
1.415MetPhe: 1.415 ± 0.167
1.349MetGly: 1.349 ± 0.163
0.332MetHis: 0.332 ± 0.098
1.172MetIle: 1.172 ± 0.158
2.476MetLys: 2.476 ± 0.272
1.946MetLeu: 1.946 ± 0.226
0.597MetMet: 0.597 ± 0.097
1.658MetAsn: 1.658 ± 0.192
1.039MetPro: 1.039 ± 0.171
0.862MetGln: 0.862 ± 0.159
1.437MetArg: 1.437 ± 0.166
2.587MetSer: 2.587 ± 0.221
1.57MetThr: 1.57 ± 0.196
1.592MetVal: 1.592 ± 0.237
0.177MetTrp: 0.177 ± 0.067
0.752MetTyr: 0.752 ± 0.14
0.0MetXaa: 0.0 ± 0.0
Asn
2.985AsnAla: 2.985 ± 0.248
0.685AsnCys: 0.685 ± 0.145
2.189AsnAsp: 2.189 ± 0.21
3.096AsnGlu: 3.096 ± 0.328
2.432AsnPhe: 2.432 ± 0.25
3.98AsnGly: 3.98 ± 0.292
0.818AsnHis: 0.818 ± 0.133
2.808AsnIle: 2.808 ± 0.276
2.587AsnLys: 2.587 ± 0.2
3.516AsnLeu: 3.516 ± 0.247
1.238AsnMet: 1.238 ± 0.164
1.968AsnAsn: 1.968 ± 0.25
2.432AsnPro: 2.432 ± 0.261
1.791AsnGln: 1.791 ± 0.216
2.366AsnArg: 2.366 ± 0.237
3.073AsnSer: 3.073 ± 0.306
2.388AsnThr: 2.388 ± 0.24
3.14AsnVal: 3.14 ± 0.243
0.752AsnTrp: 0.752 ± 0.116
1.592AsnTyr: 1.592 ± 0.195
0.0AsnXaa: 0.0 ± 0.0
Pro
2.432ProAla: 2.432 ± 0.219
0.442ProCys: 0.442 ± 0.097
2.211ProAsp: 2.211 ± 0.234
3.14ProGlu: 3.14 ± 0.317
1.769ProPhe: 1.769 ± 0.212
2.322ProGly: 2.322 ± 0.223
0.663ProHis: 0.663 ± 0.105
1.636ProIle: 1.636 ± 0.174
2.808ProLys: 2.808 ± 0.222
3.14ProLeu: 3.14 ± 0.225
0.907ProMet: 0.907 ± 0.126
1.857ProAsn: 1.857 ± 0.183
1.061ProPro: 1.061 ± 0.157
1.26ProGln: 1.26 ± 0.164
1.857ProArg: 1.857 ± 0.228
3.449ProSer: 3.449 ± 0.271
1.968ProThr: 1.968 ± 0.215
3.516ProVal: 3.516 ± 0.267
0.774ProTrp: 0.774 ± 0.136
1.371ProTyr: 1.371 ± 0.157
0.0ProXaa: 0.0 ± 0.0
Gln
2.963GlnAla: 2.963 ± 0.289
0.287GlnCys: 0.287 ± 0.081
1.924GlnAsp: 1.924 ± 0.21
3.14GlnGlu: 3.14 ± 0.275
1.57GlnPhe: 1.57 ± 0.165
1.99GlnGly: 1.99 ± 0.203
0.464GlnHis: 0.464 ± 0.103
2.322GlnIle: 2.322 ± 0.203
2.742GlnLys: 2.742 ± 0.349
3.405GlnLeu: 3.405 ± 0.24
0.973GlnMet: 0.973 ± 0.142
1.769GlnAsn: 1.769 ± 0.186
1.305GlnPro: 1.305 ± 0.171
1.437GlnGln: 1.437 ± 0.176
1.636GlnArg: 1.636 ± 0.192
1.902GlnSer: 1.902 ± 0.225
2.543GlnThr: 2.543 ± 0.261
2.432GlnVal: 2.432 ± 0.248
0.376GlnTrp: 0.376 ± 0.09
1.349GlnTyr: 1.349 ± 0.213
0.0GlnXaa: 0.0 ± 0.0
Arg
3.073ArgAla: 3.073 ± 0.269
0.862ArgCys: 0.862 ± 0.127
2.985ArgAsp: 2.985 ± 0.239
3.825ArgGlu: 3.825 ± 0.313
2.985ArgPhe: 2.985 ± 0.237
3.494ArgGly: 3.494 ± 0.266
0.84ArgHis: 0.84 ± 0.141
2.985ArgIle: 2.985 ± 0.288
3.339ArgLys: 3.339 ± 0.271
5.13ArgLeu: 5.13 ± 0.374
1.305ArgMet: 1.305 ± 0.14
2.432ArgAsn: 2.432 ± 0.225
1.305ArgPro: 1.305 ± 0.168
1.592ArgGln: 1.592 ± 0.173
2.764ArgArg: 2.764 ± 0.253
2.83ArgSer: 2.83 ± 0.275
2.078ArgThr: 2.078 ± 0.195
3.825ArgVal: 3.825 ± 0.252
0.752ArgTrp: 0.752 ± 0.143
1.946ArgTyr: 1.946 ± 0.205
0.0ArgXaa: 0.0 ± 0.0
Ser
4.113SerAla: 4.113 ± 0.338
0.708SerCys: 0.708 ± 0.148
4.024SerAsp: 4.024 ± 0.262
5.285SerGlu: 5.285 ± 0.375
3.56SerPhe: 3.56 ± 0.257
4.776SerGly: 4.776 ± 0.384
1.26SerHis: 1.26 ± 0.173
4.135SerIle: 4.135 ± 0.306
5.108SerLys: 5.108 ± 0.409
5.66SerLeu: 5.66 ± 0.329
1.548SerMet: 1.548 ± 0.191
3.184SerAsn: 3.184 ± 0.312
3.007SerPro: 3.007 ± 0.293
2.587SerGln: 2.587 ± 0.254
3.295SerArg: 3.295 ± 0.27
5.019SerSer: 5.019 ± 0.348
3.892SerThr: 3.892 ± 0.346
4.798SerVal: 4.798 ± 0.345
1.106SerTrp: 1.106 ± 0.164
2.322SerTyr: 2.322 ± 0.237
0.0SerXaa: 0.0 ± 0.0
Thr
4.068ThrAla: 4.068 ± 0.34
0.486ThrCys: 0.486 ± 0.112
3.051ThrAsp: 3.051 ± 0.327
4.201ThrGlu: 4.201 ± 0.324
2.587ThrPhe: 2.587 ± 0.273
4.29ThrGly: 4.29 ± 0.361
0.862ThrHis: 0.862 ± 0.123
3.14ThrIle: 3.14 ± 0.256
3.56ThrLys: 3.56 ± 0.299
4.864ThrLeu: 4.864 ± 0.352
1.327ThrMet: 1.327 ± 0.177
2.609ThrAsn: 2.609 ± 0.216
2.985ThrPro: 2.985 ± 0.235
1.902ThrGln: 1.902 ± 0.2
2.499ThrArg: 2.499 ± 0.24
3.494ThrSer: 3.494 ± 0.359
3.295ThrThr: 3.295 ± 0.406
4.776ThrVal: 4.776 ± 0.393
0.973ThrTrp: 0.973 ± 0.123
1.548ThrTyr: 1.548 ± 0.172
0.0ThrXaa: 0.0 ± 0.0
Val
4.842ValAla: 4.842 ± 0.327
0.973ValCys: 0.973 ± 0.163
5.24ValAsp: 5.24 ± 0.399
5.528ValGlu: 5.528 ± 0.365
3.029ValPhe: 3.029 ± 0.265
4.334ValGly: 4.334 ± 0.269
1.57ValHis: 1.57 ± 0.177
4.113ValIle: 4.113 ± 0.306
5.174ValLys: 5.174 ± 0.347
5.506ValLeu: 5.506 ± 0.344
2.012ValMet: 2.012 ± 0.19
3.604ValAsn: 3.604 ± 0.333
2.432ValPro: 2.432 ± 0.304
2.631ValGln: 2.631 ± 0.249
3.317ValArg: 3.317 ± 0.261
5.506ValSer: 5.506 ± 0.357
4.975ValThr: 4.975 ± 0.488
5.13ValVal: 5.13 ± 0.303
0.862ValTrp: 0.862 ± 0.132
2.3ValTyr: 2.3 ± 0.184
0.0ValXaa: 0.0 ± 0.0
Trp
1.194TrpAla: 1.194 ± 0.177
0.265TrpCys: 0.265 ± 0.075
1.061TrpAsp: 1.061 ± 0.174
1.061TrpGlu: 1.061 ± 0.149
0.862TrpPhe: 0.862 ± 0.131
0.995TrpGly: 0.995 ± 0.164
0.199TrpHis: 0.199 ± 0.062
0.663TrpIle: 0.663 ± 0.1
1.216TrpLys: 1.216 ± 0.147
1.481TrpLeu: 1.481 ± 0.175
0.332TrpMet: 0.332 ± 0.086
0.73TrpAsn: 0.73 ± 0.144
0.464TrpPro: 0.464 ± 0.098
0.376TrpGln: 0.376 ± 0.098
0.818TrpArg: 0.818 ± 0.147
0.84TrpSer: 0.84 ± 0.124
0.796TrpThr: 0.796 ± 0.125
1.061TrpVal: 1.061 ± 0.165
0.265TrpTrp: 0.265 ± 0.08
0.442TrpTyr: 0.442 ± 0.095
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.167TyrAla: 2.167 ± 0.179
0.376TyrCys: 0.376 ± 0.094
2.255TyrAsp: 2.255 ± 0.281
2.078TyrGlu: 2.078 ± 0.251
1.282TyrPhe: 1.282 ± 0.153
2.941TyrGly: 2.941 ± 0.265
0.641TyrHis: 0.641 ± 0.109
1.282TyrIle: 1.282 ± 0.176
1.459TyrLys: 1.459 ± 0.198
2.764TyrLeu: 2.764 ± 0.236
0.708TyrMet: 0.708 ± 0.127
1.57TyrAsn: 1.57 ± 0.177
1.349TyrPro: 1.349 ± 0.172
1.592TyrGln: 1.592 ± 0.197
2.189TyrArg: 2.189 ± 0.248
2.145TyrSer: 2.145 ± 0.211
1.791TyrThr: 1.791 ± 0.186
2.499TyrVal: 2.499 ± 0.289
0.531TyrTrp: 0.531 ± 0.116
0.907TyrTyr: 0.907 ± 0.151
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 185 proteins (45227 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski