Amino acid dipepetide frequency for Erwinia phage vB_EamM_Kwan

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.982AlaAla: 7.982 ± 0.51
0.816AlaCys: 0.816 ± 0.106
4.911AlaAsp: 4.911 ± 0.275
4.768AlaGlu: 4.768 ± 0.258
3.641AlaPhe: 3.641 ± 0.228
5.468AlaGly: 5.468 ± 0.367
1.581AlaHis: 1.581 ± 0.122
4.626AlaIle: 4.626 ± 0.239
4.354AlaLys: 4.354 ± 0.268
8.28AlaLeu: 8.28 ± 0.331
2.747AlaMet: 2.747 ± 0.222
3.887AlaAsn: 3.887 ± 0.303
3.084AlaPro: 3.084 ± 0.204
3.265AlaGln: 3.265 ± 0.241
4.172AlaArg: 4.172 ± 0.261
4.354AlaSer: 4.354 ± 0.325
4.963AlaThr: 4.963 ± 0.243
6.271AlaVal: 6.271 ± 0.31
1.011AlaTrp: 1.011 ± 0.094
3.058AlaTyr: 3.058 ± 0.207
0.0AlaXaa: 0.0 ± 0.0
Cys
0.531CysAla: 0.531 ± 0.085
0.078CysCys: 0.078 ± 0.034
0.596CysAsp: 0.596 ± 0.104
0.544CysGlu: 0.544 ± 0.103
0.35CysPhe: 0.35 ± 0.061
0.583CysGly: 0.583 ± 0.097
0.35CysHis: 0.35 ± 0.063
0.466CysIle: 0.466 ± 0.087
0.415CysLys: 0.415 ± 0.071
0.739CysLeu: 0.739 ± 0.092
0.194CysMet: 0.194 ± 0.048
0.518CysAsn: 0.518 ± 0.107
0.466CysPro: 0.466 ± 0.083
0.415CysGln: 0.415 ± 0.065
0.324CysArg: 0.324 ± 0.071
0.479CysSer: 0.479 ± 0.082
0.544CysThr: 0.544 ± 0.092
0.726CysVal: 0.726 ± 0.084
0.117CysTrp: 0.117 ± 0.038
0.298CysTyr: 0.298 ± 0.064
0.0CysXaa: 0.0 ± 0.0
Asp
5.533AspAla: 5.533 ± 0.287
0.609AspCys: 0.609 ± 0.104
4.263AspAsp: 4.263 ± 0.258
4.004AspGlu: 4.004 ± 0.246
3.123AspPhe: 3.123 ± 0.195
4.185AspGly: 4.185 ± 0.242
1.257AspHis: 1.257 ± 0.132
3.81AspIle: 3.81 ± 0.212
3.201AspLys: 3.201 ± 0.222
5.455AspLeu: 5.455 ± 0.24
1.827AspMet: 1.827 ± 0.155
2.773AspAsn: 2.773 ± 0.198
2.786AspPro: 2.786 ± 0.173
2.138AspGln: 2.138 ± 0.172
3.019AspArg: 3.019 ± 0.201
3.252AspSer: 3.252 ± 0.183
3.758AspThr: 3.758 ± 0.199
4.846AspVal: 4.846 ± 0.238
0.985AspTrp: 0.985 ± 0.119
2.579AspTyr: 2.579 ± 0.161
0.0AspXaa: 0.0 ± 0.0
Glu
4.742GluAla: 4.742 ± 0.289
0.57GluCys: 0.57 ± 0.083
3.615GluAsp: 3.615 ± 0.224
4.043GluGlu: 4.043 ± 0.309
2.579GluPhe: 2.579 ± 0.199
3.589GluGly: 3.589 ± 0.223
1.477GluHis: 1.477 ± 0.139
3.304GluIle: 3.304 ± 0.204
3.33GluLys: 3.33 ± 0.254
6.66GluLeu: 6.66 ± 0.368
1.892GluMet: 1.892 ± 0.142
2.384GluAsn: 2.384 ± 0.182
2.177GluPro: 2.177 ± 0.192
2.851GluGln: 2.851 ± 0.236
3.408GluArg: 3.408 ± 0.208
3.524GluSer: 3.524 ± 0.199
3.201GluThr: 3.201 ± 0.199
4.082GluVal: 4.082 ± 0.252
1.024GluTrp: 1.024 ± 0.127
2.177GluTyr: 2.177 ± 0.191
0.0GluXaa: 0.0 ± 0.0
Phe
3.317PheAla: 3.317 ± 0.175
0.441PheCys: 0.441 ± 0.087
3.019PheAsp: 3.019 ± 0.211
2.345PheGlu: 2.345 ± 0.181
1.84PhePhe: 1.84 ± 0.144
2.643PheGly: 2.643 ± 0.188
0.842PheHis: 0.842 ± 0.093
2.008PheIle: 2.008 ± 0.16
1.995PheLys: 1.995 ± 0.163
2.63PheLeu: 2.63 ± 0.211
1.218PheMet: 1.218 ± 0.126
2.954PheAsn: 2.954 ± 0.227
1.684PhePro: 1.684 ± 0.154
1.231PheGln: 1.231 ± 0.115
2.164PheArg: 2.164 ± 0.154
2.747PheSer: 2.747 ± 0.196
3.032PheThr: 3.032 ± 0.2
2.928PheVal: 2.928 ± 0.208
0.376PheTrp: 0.376 ± 0.06
1.879PheTyr: 1.879 ± 0.155
0.0PheXaa: 0.0 ± 0.0
Gly
4.315GlyAla: 4.315 ± 0.313
0.376GlyCys: 0.376 ± 0.078
3.758GlyAsp: 3.758 ± 0.239
4.444GlyGlu: 4.444 ± 0.271
3.084GlyPhe: 3.084 ± 0.189
5.079GlyGly: 5.079 ± 0.616
1.088GlyHis: 1.088 ± 0.133
3.771GlyIle: 3.771 ± 0.204
3.978GlyLys: 3.978 ± 0.232
5.637GlyLeu: 5.637 ± 0.239
2.397GlyMet: 2.397 ± 0.185
3.097GlyAsn: 3.097 ± 0.192
1.568GlyPro: 1.568 ± 0.162
2.579GlyGln: 2.579 ± 0.183
3.835GlyArg: 3.835 ± 0.313
4.276GlySer: 4.276 ± 0.331
4.043GlyThr: 4.043 ± 0.261
4.95GlyVal: 4.95 ± 0.242
1.088GlyTrp: 1.088 ± 0.136
2.799GlyTyr: 2.799 ± 0.208
0.0GlyXaa: 0.0 ± 0.0
His
1.659HisAla: 1.659 ± 0.15
0.22HisCys: 0.22 ± 0.056
1.399HisAsp: 1.399 ± 0.13
1.114HisGlu: 1.114 ± 0.135
0.777HisPhe: 0.777 ± 0.097
1.153HisGly: 1.153 ± 0.134
0.713HisHis: 0.713 ± 0.096
1.231HisIle: 1.231 ± 0.137
0.92HisLys: 0.92 ± 0.115
2.034HisLeu: 2.034 ± 0.178
0.441HisMet: 0.441 ± 0.075
0.829HisAsn: 0.829 ± 0.095
1.309HisPro: 1.309 ± 0.133
0.752HisGln: 0.752 ± 0.092
1.412HisArg: 1.412 ± 0.119
1.063HisSer: 1.063 ± 0.099
1.024HisThr: 1.024 ± 0.118
1.84HisVal: 1.84 ± 0.164
0.337HisTrp: 0.337 ± 0.072
1.296HisTyr: 1.296 ± 0.155
0.0HisXaa: 0.0 ± 0.0
Ile
4.872IleAla: 4.872 ± 0.224
0.454IleCys: 0.454 ± 0.078
4.108IleAsp: 4.108 ± 0.261
3.602IleGlu: 3.602 ± 0.247
1.412IlePhe: 1.412 ± 0.147
3.265IleGly: 3.265 ± 0.223
1.27IleHis: 1.27 ± 0.133
2.475IleIle: 2.475 ± 0.18
2.915IleLys: 2.915 ± 0.193
3.745IleLeu: 3.745 ± 0.213
1.101IleMet: 1.101 ± 0.11
2.721IleAsn: 2.721 ± 0.16
3.019IlePro: 3.019 ± 0.205
1.814IleGln: 1.814 ± 0.145
3.343IleArg: 3.343 ± 0.198
3.239IleSer: 3.239 ± 0.19
3.628IleThr: 3.628 ± 0.239
3.434IleVal: 3.434 ± 0.215
0.531IleTrp: 0.531 ± 0.077
1.801IleTyr: 1.801 ± 0.15
0.0IleXaa: 0.0 ± 0.0
Lys
4.509LysAla: 4.509 ± 0.214
0.35LysCys: 0.35 ± 0.07
3.434LysAsp: 3.434 ± 0.193
3.291LysGlu: 3.291 ± 0.248
2.086LysPhe: 2.086 ± 0.172
3.252LysGly: 3.252 ± 0.225
1.399LysHis: 1.399 ± 0.142
2.397LysIle: 2.397 ± 0.188
2.682LysLys: 2.682 ± 0.223
4.937LysLeu: 4.937 ± 0.253
1.71LysMet: 1.71 ± 0.158
2.047LysAsn: 2.047 ± 0.155
2.151LysPro: 2.151 ± 0.14
1.814LysGln: 1.814 ± 0.159
2.838LysArg: 2.838 ± 0.181
2.669LysSer: 2.669 ± 0.194
3.46LysThr: 3.46 ± 0.19
3.978LysVal: 3.978 ± 0.226
0.557LysTrp: 0.557 ± 0.079
2.021LysTyr: 2.021 ± 0.187
0.0LysXaa: 0.0 ± 0.0
Leu
8.021LeuAla: 8.021 ± 0.309
0.907LeuCys: 0.907 ± 0.129
5.3LeuAsp: 5.3 ± 0.247
5.481LeuGlu: 5.481 ± 0.303
3.537LeuPhe: 3.537 ± 0.234
5.585LeuGly: 5.585 ± 0.249
2.073LeuHis: 2.073 ± 0.163
4.159LeuIle: 4.159 ± 0.265
4.691LeuLys: 4.691 ± 0.242
7.787LeuLeu: 7.787 ± 0.31
2.436LeuMet: 2.436 ± 0.182
4.794LeuAsn: 4.794 ± 0.301
4.794LeuPro: 4.794 ± 0.252
3.589LeuGln: 3.589 ± 0.217
5.274LeuArg: 5.274 ± 0.272
6.233LeuSer: 6.233 ± 0.269
6.349LeuThr: 6.349 ± 0.319
5.429LeuVal: 5.429 ± 0.229
1.101LeuTrp: 1.101 ± 0.13
3.136LeuTyr: 3.136 ± 0.226
0.0LeuXaa: 0.0 ± 0.0
Met
2.527MetAla: 2.527 ± 0.194
0.155MetCys: 0.155 ± 0.037
1.697MetAsp: 1.697 ± 0.149
1.581MetGlu: 1.581 ± 0.146
1.27MetPhe: 1.27 ± 0.122
1.905MetGly: 1.905 ± 0.171
0.363MetHis: 0.363 ± 0.074
1.063MetIle: 1.063 ± 0.122
1.594MetLys: 1.594 ± 0.17
2.592MetLeu: 2.592 ± 0.216
0.687MetMet: 0.687 ± 0.097
1.503MetAsn: 1.503 ± 0.125
1.037MetPro: 1.037 ± 0.105
1.192MetGln: 1.192 ± 0.114
1.931MetArg: 1.931 ± 0.147
1.879MetSer: 1.879 ± 0.132
1.723MetThr: 1.723 ± 0.124
1.814MetVal: 1.814 ± 0.174
0.376MetTrp: 0.376 ± 0.078
0.855MetTyr: 0.855 ± 0.108
0.0MetXaa: 0.0 ± 0.0
Asn
4.315AsnAla: 4.315 ± 0.217
0.363AsnCys: 0.363 ± 0.063
3.291AsnAsp: 3.291 ± 0.23
2.527AsnGlu: 2.527 ± 0.16
1.581AsnPhe: 1.581 ± 0.131
4.444AsnGly: 4.444 ± 0.212
0.946AsnHis: 0.946 ± 0.106
2.488AsnIle: 2.488 ± 0.183
2.281AsnLys: 2.281 ± 0.186
3.797AsnLeu: 3.797 ± 0.199
1.14AsnMet: 1.14 ± 0.111
2.306AsnAsn: 2.306 ± 0.236
2.708AsnPro: 2.708 ± 0.223
1.905AsnGln: 1.905 ± 0.181
2.695AsnArg: 2.695 ± 0.192
2.643AsnSer: 2.643 ± 0.22
3.11AsnThr: 3.11 ± 0.202
3.395AsnVal: 3.395 ± 0.162
0.661AsnTrp: 0.661 ± 0.09
1.723AsnTyr: 1.723 ± 0.147
0.0AsnXaa: 0.0 ± 0.0
Pro
3.952ProAla: 3.952 ± 0.238
0.259ProCys: 0.259 ± 0.06
3.291ProAsp: 3.291 ± 0.2
3.084ProGlu: 3.084 ± 0.231
1.62ProPhe: 1.62 ± 0.14
2.682ProGly: 2.682 ± 0.165
1.088ProHis: 1.088 ± 0.111
2.397ProIle: 2.397 ± 0.201
1.957ProLys: 1.957 ± 0.147
4.03ProLeu: 4.03 ± 0.225
0.972ProMet: 0.972 ± 0.098
1.97ProAsn: 1.97 ± 0.171
1.827ProPro: 1.827 ± 0.197
1.581ProGln: 1.581 ± 0.144
2.06ProArg: 2.06 ± 0.191
2.669ProSer: 2.669 ± 0.208
3.032ProThr: 3.032 ± 0.206
3.473ProVal: 3.473 ± 0.22
0.428ProTrp: 0.428 ± 0.073
1.451ProTyr: 1.451 ± 0.154
0.0ProXaa: 0.0 ± 0.0
Gln
3.486GlnAla: 3.486 ± 0.232
0.311GlnCys: 0.311 ± 0.07
1.594GlnAsp: 1.594 ± 0.139
2.306GlnGlu: 2.306 ± 0.169
1.749GlnPhe: 1.749 ± 0.153
2.177GlnGly: 2.177 ± 0.186
0.868GlnHis: 0.868 ± 0.135
1.827GlnIle: 1.827 ± 0.124
1.516GlnLys: 1.516 ± 0.13
3.9GlnLeu: 3.9 ± 0.224
1.114GlnMet: 1.114 ± 0.136
1.931GlnAsn: 1.931 ± 0.173
1.646GlnPro: 1.646 ± 0.188
1.827GlnGln: 1.827 ± 0.163
2.553GlnArg: 2.553 ± 0.197
2.242GlnSer: 2.242 ± 0.196
2.125GlnThr: 2.125 ± 0.165
2.436GlnVal: 2.436 ± 0.178
0.803GlnTrp: 0.803 ± 0.103
1.516GlnTyr: 1.516 ± 0.138
0.0GlnXaa: 0.0 ± 0.0
Arg
3.356ArgAla: 3.356 ± 0.229
0.648ArgCys: 0.648 ± 0.108
3.239ArgAsp: 3.239 ± 0.202
3.434ArgGlu: 3.434 ± 0.251
2.488ArgPhe: 2.488 ± 0.169
3.563ArgGly: 3.563 ± 0.272
1.386ArgHis: 1.386 ± 0.139
3.239ArgIle: 3.239 ± 0.182
3.33ArgLys: 3.33 ± 0.238
5.546ArgLeu: 5.546 ± 0.32
1.723ArgMet: 1.723 ± 0.149
2.902ArgAsn: 2.902 ± 0.181
1.944ArgPro: 1.944 ± 0.163
2.164ArgGln: 2.164 ± 0.183
3.33ArgArg: 3.33 ± 0.246
3.071ArgSer: 3.071 ± 0.19
2.967ArgThr: 2.967 ± 0.22
4.354ArgVal: 4.354 ± 0.264
0.855ArgTrp: 0.855 ± 0.116
2.151ArgTyr: 2.151 ± 0.152
0.0ArgXaa: 0.0 ± 0.0
Ser
4.276SerAla: 4.276 ± 0.23
0.363SerCys: 0.363 ± 0.065
3.511SerAsp: 3.511 ± 0.223
3.123SerGlu: 3.123 ± 0.239
2.475SerPhe: 2.475 ± 0.176
4.315SerGly: 4.315 ± 0.267
0.985SerHis: 0.985 ± 0.108
3.369SerIle: 3.369 ± 0.201
3.136SerLys: 3.136 ± 0.204
5.714SerLeu: 5.714 ± 0.254
1.529SerMet: 1.529 ± 0.143
2.838SerAsn: 2.838 ± 0.199
2.514SerPro: 2.514 ± 0.187
2.099SerGln: 2.099 ± 0.164
3.045SerArg: 3.045 ± 0.196
3.797SerSer: 3.797 ± 0.29
3.732SerThr: 3.732 ± 0.231
4.976SerVal: 4.976 ± 0.294
0.829SerTrp: 0.829 ± 0.104
2.021SerTyr: 2.021 ± 0.165
0.0SerXaa: 0.0 ± 0.0
Thr
5.339ThrAla: 5.339 ± 0.301
0.557ThrCys: 0.557 ± 0.086
3.939ThrAsp: 3.939 ± 0.219
3.265ThrGlu: 3.265 ± 0.218
2.708ThrPhe: 2.708 ± 0.205
4.47ThrGly: 4.47 ± 0.255
1.27ThrHis: 1.27 ± 0.125
3.499ThrIle: 3.499 ± 0.194
2.812ThrLys: 2.812 ± 0.171
6.233ThrLeu: 6.233 ± 0.301
1.451ThrMet: 1.451 ± 0.135
2.799ThrAsn: 2.799 ± 0.205
3.641ThrPro: 3.641 ± 0.205
2.268ThrGln: 2.268 ± 0.158
3.226ThrArg: 3.226 ± 0.245
3.343ThrSer: 3.343 ± 0.183
4.159ThrThr: 4.159 ± 0.282
5.377ThrVal: 5.377 ± 0.282
0.816ThrTrp: 0.816 ± 0.11
2.021ThrTyr: 2.021 ± 0.164
0.0ThrXaa: 0.0 ± 0.0
Val
6.595ValAla: 6.595 ± 0.326
0.687ValCys: 0.687 ± 0.086
5.157ValAsp: 5.157 ± 0.253
5.079ValGlu: 5.079 ± 0.262
2.76ValPhe: 2.76 ± 0.163
4.393ValGly: 4.393 ± 0.209
1.309ValHis: 1.309 ± 0.149
4.017ValIle: 4.017 ± 0.206
3.978ValLys: 3.978 ± 0.203
6.077ValLeu: 6.077 ± 0.311
1.775ValMet: 1.775 ± 0.154
3.576ValAsn: 3.576 ± 0.235
3.408ValPro: 3.408 ± 0.207
2.255ValGln: 2.255 ± 0.162
3.835ValArg: 3.835 ± 0.217
4.328ValSer: 4.328 ± 0.256
5.144ValThr: 5.144 ± 0.303
5.339ValVal: 5.339 ± 0.318
0.998ValTrp: 0.998 ± 0.096
2.708ValTyr: 2.708 ± 0.195
0.0ValXaa: 0.0 ± 0.0
Trp
1.05TrpAla: 1.05 ± 0.11
0.13TrpCys: 0.13 ± 0.044
0.907TrpAsp: 0.907 ± 0.09
0.752TrpGlu: 0.752 ± 0.103
0.635TrpPhe: 0.635 ± 0.086
0.777TrpGly: 0.777 ± 0.086
0.285TrpHis: 0.285 ± 0.063
0.7TrpIle: 0.7 ± 0.09
0.842TrpLys: 0.842 ± 0.098
1.451TrpLeu: 1.451 ± 0.138
0.518TrpMet: 0.518 ± 0.069
0.57TrpAsn: 0.57 ± 0.078
0.389TrpPro: 0.389 ± 0.056
0.596TrpGln: 0.596 ± 0.083
0.79TrpArg: 0.79 ± 0.096
0.726TrpSer: 0.726 ± 0.095
0.661TrpThr: 0.661 ± 0.099
1.127TrpVal: 1.127 ± 0.11
0.233TrpTrp: 0.233 ± 0.063
0.415TrpTyr: 0.415 ± 0.078
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.864TyrAla: 2.864 ± 0.212
0.441TyrCys: 0.441 ± 0.078
2.371TyrAsp: 2.371 ± 0.168
1.84TyrGlu: 1.84 ± 0.151
1.581TyrPhe: 1.581 ± 0.14
2.63TyrGly: 2.63 ± 0.154
0.972TyrHis: 0.972 ± 0.116
1.905TyrIle: 1.905 ± 0.204
1.659TyrLys: 1.659 ± 0.149
3.434TyrLeu: 3.434 ± 0.213
0.855TyrMet: 0.855 ± 0.109
1.918TyrAsn: 1.918 ± 0.168
1.672TyrPro: 1.672 ± 0.124
1.607TyrGln: 1.607 ± 0.153
2.436TyrArg: 2.436 ± 0.175
2.112TyrSer: 2.112 ± 0.182
2.54TyrThr: 2.54 ± 0.199
2.643TyrVal: 2.643 ± 0.185
0.428TyrTrp: 0.428 ± 0.069
1.788TyrTyr: 1.788 ± 0.156
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 285 proteins (77176 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski