Amino acid dipepetide frequency for Erwinia phage phiEaH2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.576AlaAla: 7.576 ± 0.541
0.715AlaCys: 0.715 ± 0.114
4.611AlaAsp: 4.611 ± 0.312
5.123AlaGlu: 5.123 ± 0.235
3.074AlaPhe: 3.074 ± 0.196
5.676AlaGly: 5.676 ± 0.422
1.564AlaHis: 1.564 ± 0.158
4.745AlaIle: 4.745 ± 0.248
4.381AlaLys: 4.381 ± 0.351
7.293AlaLeu: 7.293 ± 0.338
2.44AlaMet: 2.44 ± 0.167
3.64AlaAsn: 3.64 ± 0.23
2.952AlaPro: 2.952 ± 0.214
3.033AlaGln: 3.033 ± 0.212
3.734AlaArg: 3.734 ± 0.243
4.139AlaSer: 4.139 ± 0.224
4.934AlaThr: 4.934 ± 0.236
6.08AlaVal: 6.08 ± 0.315
1.173AlaTrp: 1.173 ± 0.135
2.939AlaTyr: 2.939 ± 0.187
0.0AlaXaa: 0.0 ± 0.0
Cys
0.391CysAla: 0.391 ± 0.075
0.067CysCys: 0.067 ± 0.026
0.391CysAsp: 0.391 ± 0.065
0.539CysGlu: 0.539 ± 0.097
0.283CysPhe: 0.283 ± 0.069
0.62CysGly: 0.62 ± 0.098
0.31CysHis: 0.31 ± 0.058
0.472CysIle: 0.472 ± 0.079
0.431CysLys: 0.431 ± 0.082
0.728CysLeu: 0.728 ± 0.117
0.297CysMet: 0.297 ± 0.062
0.418CysAsn: 0.418 ± 0.083
0.337CysPro: 0.337 ± 0.064
0.256CysGln: 0.256 ± 0.058
0.472CysArg: 0.472 ± 0.086
0.634CysSer: 0.634 ± 0.1
0.526CysThr: 0.526 ± 0.089
0.674CysVal: 0.674 ± 0.104
0.081CysTrp: 0.081 ± 0.028
0.404CysTyr: 0.404 ± 0.072
0.0CysXaa: 0.0 ± 0.0
Asp
5.163AspAla: 5.163 ± 0.257
0.485AspCys: 0.485 ± 0.09
4.435AspAsp: 4.435 ± 0.329
4.341AspGlu: 4.341 ± 0.274
2.764AspPhe: 2.764 ± 0.187
4.853AspGly: 4.853 ± 0.241
1.146AspHis: 1.146 ± 0.112
3.721AspIle: 3.721 ± 0.224
3.289AspLys: 3.289 ± 0.226
5.851AspLeu: 5.851 ± 0.274
1.577AspMet: 1.577 ± 0.158
3.155AspAsn: 3.155 ± 0.237
2.993AspPro: 2.993 ± 0.189
2.009AspGln: 2.009 ± 0.166
2.764AspArg: 2.764 ± 0.19
2.75AspSer: 2.75 ± 0.193
3.856AspThr: 3.856 ± 0.21
5.258AspVal: 5.258 ± 0.277
1.146AspTrp: 1.146 ± 0.115
2.481AspTyr: 2.481 ± 0.188
0.0AspXaa: 0.0 ± 0.0
Glu
4.691GluAla: 4.691 ± 0.264
0.634GluCys: 0.634 ± 0.101
3.532GluAsp: 3.532 ± 0.237
4.071GluGlu: 4.071 ± 0.291
2.656GluPhe: 2.656 ± 0.216
3.68GluGly: 3.68 ± 0.238
1.389GluHis: 1.389 ± 0.156
3.546GluIle: 3.546 ± 0.194
3.707GluLys: 3.707 ± 0.248
6.687GluLeu: 6.687 ± 0.293
1.847GluMet: 1.847 ± 0.149
3.087GluAsn: 3.087 ± 0.198
2.076GluPro: 2.076 ± 0.182
2.386GluGln: 2.386 ± 0.157
3.546GluArg: 3.546 ± 0.203
3.262GluSer: 3.262 ± 0.23
3.64GluThr: 3.64 ± 0.208
4.368GluVal: 4.368 ± 0.246
1.025GluTrp: 1.025 ± 0.128
2.454GluTyr: 2.454 ± 0.195
0.0GluXaa: 0.0 ± 0.0
Phe
3.006PheAla: 3.006 ± 0.213
0.377PheCys: 0.377 ± 0.061
2.872PheAsp: 2.872 ± 0.217
2.521PheGlu: 2.521 ± 0.172
1.699PhePhe: 1.699 ± 0.148
3.006PheGly: 3.006 ± 0.214
0.809PheHis: 0.809 ± 0.119
2.211PheIle: 2.211 ± 0.161
2.224PheLys: 2.224 ± 0.146
2.952PheLeu: 2.952 ± 0.192
1.119PheMet: 1.119 ± 0.132
2.629PheAsn: 2.629 ± 0.165
1.604PhePro: 1.604 ± 0.178
1.052PheGln: 1.052 ± 0.132
2.076PheArg: 2.076 ± 0.189
2.629PheSer: 2.629 ± 0.186
2.804PheThr: 2.804 ± 0.188
2.656PheVal: 2.656 ± 0.201
0.566PheTrp: 0.566 ± 0.096
1.86PheTyr: 1.86 ± 0.147
0.0PheXaa: 0.0 ± 0.0
Gly
4.651GlyAla: 4.651 ± 0.306
0.512GlyCys: 0.512 ± 0.078
3.815GlyAsp: 3.815 ± 0.256
4.395GlyGlu: 4.395 ± 0.253
2.993GlyPhe: 2.993 ± 0.203
5.325GlyGly: 5.325 ± 0.571
1.186GlyHis: 1.186 ± 0.128
3.883GlyIle: 3.883 ± 0.217
4.435GlyLys: 4.435 ± 0.287
6.107GlyLeu: 6.107 ± 0.298
1.928GlyMet: 1.928 ± 0.155
3.626GlyAsn: 3.626 ± 0.211
1.685GlyPro: 1.685 ± 0.208
2.561GlyGln: 2.561 ± 0.166
3.667GlyArg: 3.667 ± 0.276
4.031GlySer: 4.031 ± 0.284
4.139GlyThr: 4.139 ± 0.247
4.948GlyVal: 4.948 ± 0.215
1.281GlyTrp: 1.281 ± 0.141
2.818GlyTyr: 2.818 ± 0.191
0.0GlyXaa: 0.0 ± 0.0
His
1.631HisAla: 1.631 ± 0.18
0.175HisCys: 0.175 ± 0.045
1.254HisAsp: 1.254 ± 0.128
1.213HisGlu: 1.213 ± 0.127
1.011HisPhe: 1.011 ± 0.113
1.294HisGly: 1.294 ± 0.138
0.566HisHis: 0.566 ± 0.08
1.335HisIle: 1.335 ± 0.14
0.917HisLys: 0.917 ± 0.132
2.036HisLeu: 2.036 ± 0.153
0.566HisMet: 0.566 ± 0.09
0.917HisAsn: 0.917 ± 0.101
1.267HisPro: 1.267 ± 0.152
0.876HisGln: 0.876 ± 0.113
1.213HisArg: 1.213 ± 0.137
0.822HisSer: 0.822 ± 0.101
1.119HisThr: 1.119 ± 0.137
1.092HisVal: 1.092 ± 0.131
0.256HisTrp: 0.256 ± 0.05
0.957HisTyr: 0.957 ± 0.146
0.0HisXaa: 0.0 ± 0.0
Ile
4.503IleAla: 4.503 ± 0.222
0.499IleCys: 0.499 ± 0.077
4.031IleAsp: 4.031 ± 0.236
3.721IleGlu: 3.721 ± 0.219
1.618IlePhe: 1.618 ± 0.127
3.64IleGly: 3.64 ± 0.262
1.079IleHis: 1.079 ± 0.121
2.508IleIle: 2.508 ± 0.199
2.548IleLys: 2.548 ± 0.184
4.233IleLeu: 4.233 ± 0.253
1.159IleMet: 1.159 ± 0.148
2.818IleAsn: 2.818 ± 0.223
2.845IlePro: 2.845 ± 0.208
1.847IleGln: 1.847 ± 0.17
3.141IleArg: 3.141 ± 0.219
3.411IleSer: 3.411 ± 0.209
3.68IleThr: 3.68 ± 0.216
3.276IleVal: 3.276 ± 0.184
0.539IleTrp: 0.539 ± 0.095
2.305IleTyr: 2.305 ± 0.164
0.0IleXaa: 0.0 ± 0.0
Lys
4.462LysAla: 4.462 ± 0.315
0.351LysCys: 0.351 ± 0.063
3.289LysAsp: 3.289 ± 0.214
3.842LysGlu: 3.842 ± 0.219
2.197LysPhe: 2.197 ± 0.173
3.478LysGly: 3.478 ± 0.295
1.132LysHis: 1.132 ± 0.122
2.534LysIle: 2.534 ± 0.18
2.858LysLys: 2.858 ± 0.212
5.676LysLeu: 5.676 ± 0.268
1.941LysMet: 1.941 ± 0.139
2.481LysAsn: 2.481 ± 0.158
2.548LysPro: 2.548 ± 0.179
2.211LysGln: 2.211 ± 0.159
2.885LysArg: 2.885 ± 0.191
2.966LysSer: 2.966 ± 0.192
3.101LysThr: 3.101 ± 0.195
4.031LysVal: 4.031 ± 0.267
0.58LysTrp: 0.58 ± 0.073
1.806LysTyr: 1.806 ± 0.192
0.0LysXaa: 0.0 ± 0.0
Leu
7.415LeuAla: 7.415 ± 0.309
0.984LeuCys: 0.984 ± 0.138
5.918LeuAsp: 5.918 ± 0.276
5.406LeuGlu: 5.406 ± 0.247
3.357LeuPhe: 3.357 ± 0.204
5.271LeuGly: 5.271 ± 0.295
1.726LeuHis: 1.726 ± 0.169
3.842LeuIle: 3.842 ± 0.221
5.433LeuLys: 5.433 ± 0.231
7.212LeuLeu: 7.212 ± 0.305
2.494LeuMet: 2.494 ± 0.202
5.163LeuAsn: 5.163 ± 0.29
4.961LeuPro: 4.961 ± 0.267
3.262LeuGln: 3.262 ± 0.267
5.568LeuArg: 5.568 ± 0.22
6.296LeuSer: 6.296 ± 0.274
6.619LeuThr: 6.619 ± 0.366
5.73LeuVal: 5.73 ± 0.292
1.038LeuTrp: 1.038 ± 0.115
3.276LeuTyr: 3.276 ± 0.221
0.0LeuXaa: 0.0 ± 0.0
Met
2.4MetAla: 2.4 ± 0.176
0.27MetCys: 0.27 ± 0.055
1.577MetAsp: 1.577 ± 0.148
1.658MetGlu: 1.658 ± 0.19
1.186MetPhe: 1.186 ± 0.114
1.699MetGly: 1.699 ± 0.167
0.458MetHis: 0.458 ± 0.075
1.173MetIle: 1.173 ± 0.121
1.86MetLys: 1.86 ± 0.172
2.764MetLeu: 2.764 ± 0.214
0.741MetMet: 0.741 ± 0.097
1.267MetAsn: 1.267 ± 0.137
0.93MetPro: 0.93 ± 0.124
0.957MetGln: 0.957 ± 0.109
1.604MetArg: 1.604 ± 0.142
1.928MetSer: 1.928 ± 0.157
1.726MetThr: 1.726 ± 0.155
1.766MetVal: 1.766 ± 0.166
0.364MetTrp: 0.364 ± 0.076
0.984MetTyr: 0.984 ± 0.116
0.0MetXaa: 0.0 ± 0.0
Asn
4.462AsnAla: 4.462 ± 0.267
0.31AsnCys: 0.31 ± 0.06
2.845AsnAsp: 2.845 ± 0.192
2.831AsnGlu: 2.831 ± 0.201
1.968AsnPhe: 1.968 ± 0.144
4.112AsnGly: 4.112 ± 0.232
0.957AsnHis: 0.957 ± 0.117
2.588AsnIle: 2.588 ± 0.206
2.481AsnLys: 2.481 ± 0.186
4.327AsnLeu: 4.327 ± 0.221
1.065AsnMet: 1.065 ± 0.115
2.332AsnAsn: 2.332 ± 0.196
3.02AsnPro: 3.02 ± 0.219
1.645AsnGln: 1.645 ± 0.174
2.629AsnArg: 2.629 ± 0.184
2.845AsnSer: 2.845 ± 0.205
3.155AsnThr: 3.155 ± 0.215
3.559AsnVal: 3.559 ± 0.219
1.011AsnTrp: 1.011 ± 0.136
1.833AsnTyr: 1.833 ± 0.161
0.0AsnXaa: 0.0 ± 0.0
Pro
3.424ProAla: 3.424 ± 0.23
0.175ProCys: 0.175 ± 0.047
3.384ProAsp: 3.384 ± 0.228
3.397ProGlu: 3.397 ± 0.216
1.699ProPhe: 1.699 ± 0.148
2.683ProGly: 2.683 ± 0.185
0.903ProHis: 0.903 ± 0.108
2.278ProIle: 2.278 ± 0.193
2.184ProLys: 2.184 ± 0.18
3.546ProLeu: 3.546 ± 0.218
1.186ProMet: 1.186 ± 0.114
2.049ProAsn: 2.049 ± 0.186
1.699ProPro: 1.699 ± 0.162
1.456ProGln: 1.456 ± 0.142
1.995ProArg: 1.995 ± 0.16
2.508ProSer: 2.508 ± 0.181
3.262ProThr: 3.262 ± 0.205
3.626ProVal: 3.626 ± 0.266
0.566ProTrp: 0.566 ± 0.084
1.51ProTyr: 1.51 ± 0.155
0.0ProXaa: 0.0 ± 0.0
Gln
3.316GlnAla: 3.316 ± 0.287
0.404GlnCys: 0.404 ± 0.077
1.806GlnAsp: 1.806 ± 0.157
1.982GlnGlu: 1.982 ± 0.146
1.631GlnPhe: 1.631 ± 0.137
2.184GlnGly: 2.184 ± 0.171
0.957GlnHis: 0.957 ± 0.127
1.847GlnIle: 1.847 ± 0.156
1.496GlnLys: 1.496 ± 0.158
3.829GlnLeu: 3.829 ± 0.223
1.173GlnMet: 1.173 ± 0.12
1.564GlnAsn: 1.564 ± 0.126
1.618GlnPro: 1.618 ± 0.134
1.833GlnGln: 1.833 ± 0.174
2.481GlnArg: 2.481 ± 0.25
1.793GlnSer: 1.793 ± 0.166
2.157GlnThr: 2.157 ± 0.174
2.373GlnVal: 2.373 ± 0.151
0.526GlnTrp: 0.526 ± 0.081
1.726GlnTyr: 1.726 ± 0.165
0.0GlnXaa: 0.0 ± 0.0
Arg
3.734ArgAla: 3.734 ± 0.199
0.512ArgCys: 0.512 ± 0.085
3.411ArgAsp: 3.411 ± 0.249
3.222ArgGlu: 3.222 ± 0.194
2.588ArgPhe: 2.588 ± 0.172
3.182ArgGly: 3.182 ± 0.266
1.119ArgHis: 1.119 ± 0.113
3.289ArgIle: 3.289 ± 0.194
3.236ArgLys: 3.236 ± 0.217
5.055ArgLeu: 5.055 ± 0.27
1.685ArgMet: 1.685 ± 0.123
3.006ArgAsn: 3.006 ± 0.179
1.995ArgPro: 1.995 ± 0.182
1.739ArgGln: 1.739 ± 0.143
3.303ArgArg: 3.303 ± 0.322
2.818ArgSer: 2.818 ± 0.23
2.966ArgThr: 2.966 ± 0.203
4.004ArgVal: 4.004 ± 0.252
1.119ArgTrp: 1.119 ± 0.131
2.197ArgTyr: 2.197 ± 0.197
0.0ArgXaa: 0.0 ± 0.0
Ser
4.314SerAla: 4.314 ± 0.267
0.404SerCys: 0.404 ± 0.072
3.626SerAsp: 3.626 ± 0.208
3.128SerGlu: 3.128 ± 0.252
2.346SerPhe: 2.346 ± 0.17
4.098SerGly: 4.098 ± 0.265
0.944SerHis: 0.944 ± 0.118
3.478SerIle: 3.478 ± 0.183
3.236SerLys: 3.236 ± 0.242
5.406SerLeu: 5.406 ± 0.269
1.429SerMet: 1.429 ± 0.142
2.561SerAsn: 2.561 ± 0.15
2.494SerPro: 2.494 ± 0.175
2.09SerGln: 2.09 ± 0.173
3.195SerArg: 3.195 ± 0.228
3.357SerSer: 3.357 ± 0.234
3.451SerThr: 3.451 ± 0.228
3.95SerVal: 3.95 ± 0.223
0.917SerTrp: 0.917 ± 0.105
1.941SerTyr: 1.941 ± 0.176
0.0SerXaa: 0.0 ± 0.0
Thr
4.894ThrAla: 4.894 ± 0.317
0.526ThrCys: 0.526 ± 0.092
4.193ThrAsp: 4.193 ± 0.268
3.519ThrGlu: 3.519 ± 0.228
2.777ThrPhe: 2.777 ± 0.225
4.705ThrGly: 4.705 ± 0.32
1.362ThrHis: 1.362 ± 0.123
3.465ThrIle: 3.465 ± 0.256
2.966ThrLys: 2.966 ± 0.17
6.592ThrLeu: 6.592 ± 0.333
1.51ThrMet: 1.51 ± 0.154
2.872ThrAsn: 2.872 ± 0.204
3.114ThrPro: 3.114 ± 0.194
2.494ThrGln: 2.494 ± 0.198
3.168ThrArg: 3.168 ± 0.195
3.182ThrSer: 3.182 ± 0.216
4.638ThrThr: 4.638 ± 0.313
5.19ThrVal: 5.19 ± 0.267
0.903ThrTrp: 0.903 ± 0.102
2.103ThrTyr: 2.103 ± 0.17
0.0ThrXaa: 0.0 ± 0.0
Val
6.053ValAla: 6.053 ± 0.34
0.391ValCys: 0.391 ± 0.078
4.961ValAsp: 4.961 ± 0.265
4.584ValGlu: 4.584 ± 0.264
2.669ValPhe: 2.669 ± 0.15
4.88ValGly: 4.88 ± 0.212
1.416ValHis: 1.416 ± 0.147
4.004ValIle: 4.004 ± 0.208
4.125ValLys: 4.125 ± 0.288
5.77ValLeu: 5.77 ± 0.317
1.726ValMet: 1.726 ± 0.137
3.667ValAsn: 3.667 ± 0.262
3.168ValPro: 3.168 ± 0.251
2.629ValGln: 2.629 ± 0.235
3.411ValArg: 3.411 ± 0.192
4.004ValSer: 4.004 ± 0.243
5.527ValThr: 5.527 ± 0.329
5.379ValVal: 5.379 ± 0.338
0.984ValTrp: 0.984 ± 0.129
2.642ValTyr: 2.642 ± 0.205
0.0ValXaa: 0.0 ± 0.0
Trp
0.998TrpAla: 0.998 ± 0.103
0.148TrpCys: 0.148 ± 0.052
1.254TrpAsp: 1.254 ± 0.13
0.903TrpGlu: 0.903 ± 0.131
0.607TrpPhe: 0.607 ± 0.088
0.782TrpGly: 0.782 ± 0.097
0.404TrpHis: 0.404 ± 0.082
0.768TrpIle: 0.768 ± 0.097
0.809TrpLys: 0.809 ± 0.091
1.591TrpLeu: 1.591 ± 0.145
0.404TrpMet: 0.404 ± 0.071
0.566TrpAsn: 0.566 ± 0.081
0.526TrpPro: 0.526 ± 0.092
0.593TrpGln: 0.593 ± 0.096
0.89TrpArg: 0.89 ± 0.126
0.822TrpSer: 0.822 ± 0.096
0.62TrpThr: 0.62 ± 0.076
1.308TrpVal: 1.308 ± 0.128
0.229TrpTrp: 0.229 ± 0.059
0.593TrpTyr: 0.593 ± 0.095
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.737TyrAla: 2.737 ± 0.199
0.391TyrCys: 0.391 ± 0.083
2.831TyrAsp: 2.831 ± 0.194
1.887TyrGlu: 1.887 ± 0.145
1.55TyrPhe: 1.55 ± 0.138
2.818TyrGly: 2.818 ± 0.211
1.092TyrHis: 1.092 ± 0.138
1.712TyrIle: 1.712 ± 0.141
1.753TyrLys: 1.753 ± 0.144
3.316TyrLeu: 3.316 ± 0.19
1.011TyrMet: 1.011 ± 0.121
2.157TyrAsn: 2.157 ± 0.172
1.699TyrPro: 1.699 ± 0.153
1.766TyrGln: 1.766 ± 0.167
2.44TyrArg: 2.44 ± 0.216
2.184TyrSer: 2.184 ± 0.208
2.251TyrThr: 2.251 ± 0.184
2.723TyrVal: 2.723 ± 0.228
0.512TyrTrp: 0.512 ± 0.079
1.739TyrTyr: 1.739 ± 0.186
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 262 proteins (74178 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski