Amino acid dipepetide frequency for Erwinia phage PhiEaH1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.742AlaAla: 6.742 ± 0.372
0.553AlaCys: 0.553 ± 0.091
4.805AlaAsp: 4.805 ± 0.311
4.034AlaGlu: 4.034 ± 0.273
2.752AlaPhe: 2.752 ± 0.224
5.577AlaGly: 5.577 ± 0.344
1.558AlaHis: 1.558 ± 0.175
4.281AlaIle: 4.281 ± 0.252
4.368AlaLys: 4.368 ± 0.278
7.004AlaLeu: 7.004 ± 0.327
2.228AlaMet: 2.228 ± 0.192
3.917AlaAsn: 3.917 ± 0.287
2.752AlaPro: 2.752 ± 0.193
3.204AlaGln: 3.204 ± 0.246
3.902AlaArg: 3.902 ± 0.246
4.048AlaSer: 4.048 ± 0.263
4.674AlaThr: 4.674 ± 0.257
5.795AlaVal: 5.795 ± 0.278
0.976AlaTrp: 0.976 ± 0.146
2.708AlaTyr: 2.708 ± 0.245
0.0AlaXaa: 0.0 ± 0.0
Cys
0.67CysAla: 0.67 ± 0.111
0.102CysCys: 0.102 ± 0.031
0.437CysAsp: 0.437 ± 0.079
0.597CysGlu: 0.597 ± 0.098
0.291CysPhe: 0.291 ± 0.069
0.684CysGly: 0.684 ± 0.112
0.233CysHis: 0.233 ± 0.066
0.451CysIle: 0.451 ± 0.077
0.422CysLys: 0.422 ± 0.075
0.539CysLeu: 0.539 ± 0.082
0.233CysMet: 0.233 ± 0.056
0.306CysAsn: 0.306 ± 0.065
0.335CysPro: 0.335 ± 0.072
0.262CysGln: 0.262 ± 0.064
0.582CysArg: 0.582 ± 0.112
0.306CysSer: 0.306 ± 0.059
0.524CysThr: 0.524 ± 0.095
0.553CysVal: 0.553 ± 0.08
0.175CysTrp: 0.175 ± 0.049
0.422CysTyr: 0.422 ± 0.081
0.0CysXaa: 0.0 ± 0.0
Asp
4.965AspAla: 4.965 ± 0.296
0.524AspCys: 0.524 ± 0.077
4.135AspAsp: 4.135 ± 0.255
3.597AspGlu: 3.597 ± 0.248
3.014AspPhe: 3.014 ± 0.264
5.024AspGly: 5.024 ± 0.351
1.005AspHis: 1.005 ± 0.125
3.655AspIle: 3.655 ± 0.211
3.393AspLys: 3.393 ± 0.317
5.271AspLeu: 5.271 ± 0.277
1.471AspMet: 1.471 ± 0.134
2.548AspAsn: 2.548 ± 0.205
3.043AspPro: 3.043 ± 0.204
1.922AspGln: 1.922 ± 0.158
3.713AspArg: 3.713 ± 0.311
3.378AspSer: 3.378 ± 0.229
4.368AspThr: 4.368 ± 0.273
4.747AspVal: 4.747 ± 0.24
0.961AspTrp: 0.961 ± 0.104
2.505AspTyr: 2.505 ± 0.215
0.0AspXaa: 0.0 ± 0.0
Glu
4.864GluAla: 4.864 ± 0.315
0.451GluCys: 0.451 ± 0.092
3.393GluAsp: 3.393 ± 0.272
5.213GluGlu: 5.213 ± 0.381
2.971GluPhe: 2.971 ± 0.21
3.64GluGly: 3.64 ± 0.251
1.485GluHis: 1.485 ± 0.16
3.451GluIle: 3.451 ± 0.193
3.684GluLys: 3.684 ± 0.279
6.232GluLeu: 6.232 ± 0.347
2.505GluMet: 2.505 ± 0.197
2.534GluAsn: 2.534 ± 0.199
1.835GluPro: 1.835 ± 0.166
2.941GluGln: 2.941 ± 0.226
3.742GluArg: 3.742 ± 0.285
2.927GluSer: 2.927 ± 0.249
3.713GluThr: 3.713 ± 0.271
4.179GluVal: 4.179 ± 0.303
1.092GluTrp: 1.092 ± 0.127
2.344GluTyr: 2.344 ± 0.192
0.0GluXaa: 0.0 ± 0.0
Phe
3.0PheAla: 3.0 ± 0.203
0.539PheCys: 0.539 ± 0.086
2.854PheAsp: 2.854 ± 0.233
2.141PheGlu: 2.141 ± 0.157
1.806PhePhe: 1.806 ± 0.19
2.898PheGly: 2.898 ± 0.217
0.947PheHis: 0.947 ± 0.114
1.878PheIle: 1.878 ± 0.187
2.155PheLys: 2.155 ± 0.203
3.145PheLeu: 3.145 ± 0.255
1.019PheMet: 1.019 ± 0.134
2.432PheAsn: 2.432 ± 0.195
1.602PhePro: 1.602 ± 0.158
1.704PheGln: 1.704 ± 0.139
2.257PheArg: 2.257 ± 0.171
2.621PheSer: 2.621 ± 0.22
2.869PheThr: 2.869 ± 0.234
2.796PheVal: 2.796 ± 0.201
0.335PheTrp: 0.335 ± 0.067
1.66PheTyr: 1.66 ± 0.184
0.0PheXaa: 0.0 ± 0.0
Gly
4.645GlyAla: 4.645 ± 0.291
0.466GlyCys: 0.466 ± 0.088
4.762GlyAsp: 4.762 ± 0.637
4.135GlyGlu: 4.135 ± 0.264
2.971GlyPhe: 2.971 ± 0.177
5.155GlyGly: 5.155 ± 0.463
1.281GlyHis: 1.281 ± 0.147
3.844GlyIle: 3.844 ± 0.212
4.747GlyLys: 4.747 ± 0.351
5.679GlyLeu: 5.679 ± 0.305
1.951GlyMet: 1.951 ± 0.176
3.699GlyAsn: 3.699 ± 0.334
1.922GlyPro: 1.922 ± 0.217
2.344GlyGln: 2.344 ± 0.182
3.961GlyArg: 3.961 ± 0.258
4.252GlySer: 4.252 ± 0.251
5.067GlyThr: 5.067 ± 0.508
5.315GlyVal: 5.315 ± 0.309
1.107GlyTrp: 1.107 ± 0.116
3.305GlyTyr: 3.305 ± 0.243
0.0GlyXaa: 0.0 ± 0.0
His
1.544HisAla: 1.544 ± 0.161
0.233HisCys: 0.233 ± 0.057
1.005HisAsp: 1.005 ± 0.125
0.976HisGlu: 0.976 ± 0.112
1.048HisPhe: 1.048 ± 0.135
1.34HisGly: 1.34 ± 0.178
0.466HisHis: 0.466 ± 0.092
0.947HisIle: 0.947 ± 0.137
0.801HisLys: 0.801 ± 0.106
2.082HisLeu: 2.082 ± 0.179
0.568HisMet: 0.568 ± 0.089
0.932HisAsn: 0.932 ± 0.121
1.136HisPro: 1.136 ± 0.121
0.83HisGln: 0.83 ± 0.107
1.544HisArg: 1.544 ± 0.165
1.267HisSer: 1.267 ± 0.123
0.845HisThr: 0.845 ± 0.113
1.15HisVal: 1.15 ± 0.157
0.306HisTrp: 0.306 ± 0.059
0.99HisTyr: 0.99 ± 0.105
0.0HisXaa: 0.0 ± 0.0
Ile
4.034IleAla: 4.034 ± 0.251
0.379IleCys: 0.379 ± 0.078
3.99IleAsp: 3.99 ± 0.23
3.349IleGlu: 3.349 ± 0.252
1.529IlePhe: 1.529 ± 0.151
3.029IleGly: 3.029 ± 0.202
1.209IleHis: 1.209 ± 0.134
2.519IleIle: 2.519 ± 0.179
2.912IleLys: 2.912 ± 0.208
4.267IleLeu: 4.267 ± 0.24
1.194IleMet: 1.194 ± 0.122
2.708IleAsn: 2.708 ± 0.188
3.058IlePro: 3.058 ± 0.224
2.024IleGln: 2.024 ± 0.192
3.349IleArg: 3.349 ± 0.235
3.116IleSer: 3.116 ± 0.198
3.305IleThr: 3.305 ± 0.154
2.796IleVal: 2.796 ± 0.201
0.728IleTrp: 0.728 ± 0.103
1.704IleTyr: 1.704 ± 0.141
0.0IleXaa: 0.0 ± 0.0
Lys
3.684LysAla: 3.684 ± 0.245
0.291LysCys: 0.291 ± 0.061
3.029LysAsp: 3.029 ± 0.222
3.932LysGlu: 3.932 ± 0.3
1.995LysPhe: 1.995 ± 0.18
4.325LysGly: 4.325 ± 0.723
0.903LysHis: 0.903 ± 0.111
2.738LysIle: 2.738 ± 0.211
2.927LysLys: 2.927 ± 0.237
4.82LysLeu: 4.82 ± 0.249
1.5LysMet: 1.5 ± 0.164
2.577LysAsn: 2.577 ± 0.25
2.039LysPro: 2.039 ± 0.193
2.315LysGln: 2.315 ± 0.213
3.014LysArg: 3.014 ± 0.222
2.548LysSer: 2.548 ± 0.19
2.912LysThr: 2.912 ± 0.199
4.281LysVal: 4.281 ± 0.309
0.655LysTrp: 0.655 ± 0.078
2.082LysTyr: 2.082 ± 0.223
0.0LysXaa: 0.0 ± 0.0
Leu
6.276LeuAla: 6.276 ± 0.327
0.888LeuCys: 0.888 ± 0.133
5.839LeuAsp: 5.839 ± 0.321
6.218LeuGlu: 6.218 ± 0.442
3.145LeuPhe: 3.145 ± 0.208
5.49LeuGly: 5.49 ± 0.272
1.645LeuHis: 1.645 ± 0.16
4.368LeuIle: 4.368 ± 0.259
4.878LeuLys: 4.878 ± 0.318
7.237LeuLeu: 7.237 ± 0.359
2.723LeuMet: 2.723 ± 0.207
4.98LeuAsn: 4.98 ± 0.272
4.674LeuPro: 4.674 ± 0.25
3.335LeuGln: 3.335 ± 0.261
5.373LeuArg: 5.373 ± 0.286
6.305LeuSer: 6.305 ± 0.323
6.596LeuThr: 6.596 ± 0.334
5.752LeuVal: 5.752 ± 0.298
0.961LeuTrp: 0.961 ± 0.118
3.407LeuTyr: 3.407 ± 0.241
0.0LeuXaa: 0.0 ± 0.0
Met
3.058MetAla: 3.058 ± 0.217
0.277MetCys: 0.277 ± 0.059
1.369MetAsp: 1.369 ± 0.162
2.155MetGlu: 2.155 ± 0.184
0.976MetPhe: 0.976 ± 0.105
1.922MetGly: 1.922 ± 0.173
0.422MetHis: 0.422 ± 0.09
1.325MetIle: 1.325 ± 0.17
1.514MetLys: 1.514 ± 0.161
2.082MetLeu: 2.082 ± 0.166
0.699MetMet: 0.699 ± 0.098
1.587MetAsn: 1.587 ± 0.156
0.903MetPro: 0.903 ± 0.107
0.874MetGln: 0.874 ± 0.137
1.849MetArg: 1.849 ± 0.195
2.534MetSer: 2.534 ± 0.18
1.908MetThr: 1.908 ± 0.148
1.98MetVal: 1.98 ± 0.186
0.306MetTrp: 0.306 ± 0.063
1.048MetTyr: 1.048 ± 0.114
0.0MetXaa: 0.0 ± 0.0
Asn
3.684AsnAla: 3.684 ± 0.254
0.277AsnCys: 0.277 ± 0.067
2.621AsnAsp: 2.621 ± 0.185
2.84AsnGlu: 2.84 ± 0.195
2.184AsnPhe: 2.184 ± 0.168
4.383AsnGly: 4.383 ± 0.306
0.947AsnHis: 0.947 ± 0.111
2.49AsnIle: 2.49 ± 0.187
2.359AsnLys: 2.359 ± 0.18
4.616AsnLeu: 4.616 ± 0.222
1.267AsnMet: 1.267 ± 0.163
2.548AsnAsn: 2.548 ± 0.216
2.971AsnPro: 2.971 ± 0.223
1.937AsnGln: 1.937 ± 0.149
3.0AsnArg: 3.0 ± 0.217
2.694AsnSer: 2.694 ± 0.202
2.985AsnThr: 2.985 ± 0.226
3.247AsnVal: 3.247 ± 0.243
0.67AsnTrp: 0.67 ± 0.126
1.893AsnTyr: 1.893 ± 0.174
0.0AsnXaa: 0.0 ± 0.0
Pro
2.941ProAla: 2.941 ± 0.189
0.32ProCys: 0.32 ± 0.077
3.102ProAsp: 3.102 ± 0.232
3.218ProGlu: 3.218 ± 0.237
1.733ProPhe: 1.733 ± 0.149
2.81ProGly: 2.81 ± 0.186
0.874ProHis: 0.874 ± 0.151
2.082ProIle: 2.082 ± 0.159
2.155ProLys: 2.155 ± 0.219
3.859ProLeu: 3.859 ± 0.281
1.427ProMet: 1.427 ± 0.128
2.097ProAsn: 2.097 ± 0.178
1.529ProPro: 1.529 ± 0.172
1.5ProGln: 1.5 ± 0.181
1.849ProArg: 1.849 ± 0.172
2.592ProSer: 2.592 ± 0.206
2.694ProThr: 2.694 ± 0.186
3.655ProVal: 3.655 ± 0.248
0.524ProTrp: 0.524 ± 0.088
1.325ProTyr: 1.325 ± 0.138
0.0ProXaa: 0.0 ± 0.0
Gln
3.072GlnAla: 3.072 ± 0.239
0.349GlnCys: 0.349 ± 0.069
1.864GlnAsp: 1.864 ± 0.156
2.534GlnGlu: 2.534 ± 0.182
1.616GlnPhe: 1.616 ± 0.136
2.65GlnGly: 2.65 ± 0.402
0.961GlnHis: 0.961 ± 0.127
1.747GlnIle: 1.747 ± 0.153
1.602GlnLys: 1.602 ± 0.17
3.771GlnLeu: 3.771 ± 0.297
1.267GlnMet: 1.267 ± 0.157
1.645GlnAsn: 1.645 ± 0.137
1.325GlnPro: 1.325 ± 0.129
1.893GlnGln: 1.893 ± 0.225
1.893GlnArg: 1.893 ± 0.151
2.111GlnSer: 2.111 ± 0.179
2.388GlnThr: 2.388 ± 0.173
3.262GlnVal: 3.262 ± 0.233
0.655GlnTrp: 0.655 ± 0.105
1.893GlnTyr: 1.893 ± 0.163
0.0GlnXaa: 0.0 ± 0.0
Arg
3.917ArgAla: 3.917 ± 0.226
0.539ArgCys: 0.539 ± 0.091
4.092ArgAsp: 4.092 ± 0.319
3.684ArgGlu: 3.684 ± 0.284
2.607ArgPhe: 2.607 ± 0.197
3.99ArgGly: 3.99 ± 0.327
1.092ArgHis: 1.092 ± 0.15
3.32ArgIle: 3.32 ± 0.188
3.058ArgLys: 3.058 ± 0.249
5.839ArgLeu: 5.839 ± 0.298
1.806ArgMet: 1.806 ± 0.193
2.883ArgAsn: 2.883 ± 0.218
2.082ArgPro: 2.082 ± 0.185
2.068ArgGln: 2.068 ± 0.18
3.495ArgArg: 3.495 ± 0.301
3.029ArgSer: 3.029 ± 0.275
3.32ArgThr: 3.32 ± 0.226
4.194ArgVal: 4.194 ± 0.219
0.714ArgTrp: 0.714 ± 0.104
2.315ArgTyr: 2.315 ± 0.208
0.0ArgXaa: 0.0 ± 0.0
Ser
4.907SerAla: 4.907 ± 0.318
0.393SerCys: 0.393 ± 0.078
3.509SerAsp: 3.509 ± 0.249
3.364SerGlu: 3.364 ± 0.201
2.344SerPhe: 2.344 ± 0.187
4.529SerGly: 4.529 ± 0.284
1.311SerHis: 1.311 ± 0.15
3.16SerIle: 3.16 ± 0.218
2.81SerLys: 2.81 ± 0.212
5.49SerLeu: 5.49 ± 0.306
1.529SerMet: 1.529 ± 0.142
2.621SerAsn: 2.621 ± 0.192
2.446SerPro: 2.446 ± 0.17
2.097SerGln: 2.097 ± 0.191
3.83SerArg: 3.83 ± 0.281
3.407SerSer: 3.407 ± 0.265
3.917SerThr: 3.917 ± 0.218
3.844SerVal: 3.844 ± 0.296
0.801SerTrp: 0.801 ± 0.103
2.009SerTyr: 2.009 ± 0.155
0.0SerXaa: 0.0 ± 0.0
Thr
4.747ThrAla: 4.747 ± 0.335
0.481ThrCys: 0.481 ± 0.09
4.237ThrAsp: 4.237 ± 0.276
3.67ThrGlu: 3.67 ± 0.244
2.49ThrPhe: 2.49 ± 0.2
5.315ThrGly: 5.315 ± 0.426
1.092ThrHis: 1.092 ± 0.119
3.0ThrIle: 3.0 ± 0.23
3.058ThrLys: 3.058 ± 0.24
6.363ThrLeu: 6.363 ± 0.333
1.733ThrMet: 1.733 ± 0.165
2.912ThrAsn: 2.912 ± 0.251
3.247ThrPro: 3.247 ± 0.214
2.49ThrGln: 2.49 ± 0.198
3.262ThrArg: 3.262 ± 0.197
3.495ThrSer: 3.495 ± 0.224
4.427ThrThr: 4.427 ± 0.303
4.951ThrVal: 4.951 ± 0.281
1.15ThrTrp: 1.15 ± 0.133
2.286ThrTyr: 2.286 ± 0.221
0.0ThrXaa: 0.0 ± 0.0
Val
5.825ValAla: 5.825 ± 0.315
0.422ValCys: 0.422 ± 0.073
4.922ValAsp: 4.922 ± 0.246
4.703ValGlu: 4.703 ± 0.247
2.636ValPhe: 2.636 ± 0.21
4.121ValGly: 4.121 ± 0.345
1.296ValHis: 1.296 ± 0.143
3.771ValIle: 3.771 ± 0.239
3.582ValLys: 3.582 ± 0.229
6.203ValLeu: 6.203 ± 0.265
2.403ValMet: 2.403 ± 0.15
3.917ValAsn: 3.917 ± 0.232
3.0ValPro: 3.0 ± 0.229
2.403ValGln: 2.403 ± 0.184
3.961ValArg: 3.961 ± 0.236
4.572ValSer: 4.572 ± 0.286
5.024ValThr: 5.024 ± 0.24
5.082ValVal: 5.082 ± 0.301
0.947ValTrp: 0.947 ± 0.121
2.81ValTyr: 2.81 ± 0.211
0.0ValXaa: 0.0 ± 0.0
Trp
0.961TrpAla: 0.961 ± 0.129
0.131TrpCys: 0.131 ± 0.045
0.801TrpAsp: 0.801 ± 0.108
1.005TrpGlu: 1.005 ± 0.112
0.597TrpPhe: 0.597 ± 0.089
0.83TrpGly: 0.83 ± 0.104
0.306TrpHis: 0.306 ± 0.061
0.597TrpIle: 0.597 ± 0.096
0.655TrpLys: 0.655 ± 0.083
1.529TrpLeu: 1.529 ± 0.161
0.291TrpMet: 0.291 ± 0.066
0.815TrpAsn: 0.815 ± 0.115
0.408TrpPro: 0.408 ± 0.08
0.568TrpGln: 0.568 ± 0.073
0.772TrpArg: 0.772 ± 0.1
0.801TrpSer: 0.801 ± 0.115
0.728TrpThr: 0.728 ± 0.118
1.296TrpVal: 1.296 ± 0.21
0.248TrpTrp: 0.248 ± 0.066
0.553TrpTyr: 0.553 ± 0.094
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.344TyrAla: 2.344 ± 0.175
0.51TyrCys: 0.51 ± 0.083
2.475TyrAsp: 2.475 ± 0.163
1.835TyrGlu: 1.835 ± 0.161
1.951TyrPhe: 1.951 ± 0.173
2.898TyrGly: 2.898 ± 0.213
0.976TyrHis: 0.976 ± 0.114
1.573TyrIle: 1.573 ± 0.18
1.485TyrLys: 1.485 ± 0.142
4.063TyrLeu: 4.063 ± 0.24
0.961TyrMet: 0.961 ± 0.113
1.951TyrAsn: 1.951 ± 0.178
1.98TyrPro: 1.98 ± 0.205
1.791TyrGln: 1.791 ± 0.164
2.694TyrArg: 2.694 ± 0.212
2.315TyrSer: 2.315 ± 0.207
2.141TyrThr: 2.141 ± 0.195
2.738TyrVal: 2.738 ± 0.21
0.582TyrTrp: 0.582 ± 0.104
1.718TyrTyr: 1.718 ± 0.187
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 241 proteins (68675 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski