Amino acid dipepetide frequency for Yersinia phage fHe-Yen9-04

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.476AlaAla: 3.476 ± 0.267
0.511AlaCys: 0.511 ± 0.075
2.76AlaAsp: 2.76 ± 0.194
3.114AlaGlu: 3.114 ± 0.254
2.1AlaPhe: 2.1 ± 0.158
3.123AlaGly: 3.123 ± 0.204
0.753AlaHis: 0.753 ± 0.076
3.56AlaIle: 3.56 ± 0.171
3.411AlaLys: 3.411 ± 0.192
4.127AlaLeu: 4.127 ± 0.185
1.422AlaMet: 1.422 ± 0.125
3.011AlaAsn: 3.011 ± 0.192
1.561AlaPro: 1.561 ± 0.155
1.747AlaGln: 1.747 ± 0.133
1.896AlaArg: 1.896 ± 0.13
3.067AlaSer: 3.067 ± 0.241
3.048AlaThr: 3.048 ± 0.278
3.197AlaVal: 3.197 ± 0.192
0.53AlaTrp: 0.53 ± 0.066
2.017AlaTyr: 2.017 ± 0.153
0.0AlaXaa: 0.0 ± 0.0
Cys
0.455CysAla: 0.455 ± 0.07
0.167CysCys: 0.167 ± 0.043
0.725CysAsp: 0.725 ± 0.094
0.725CysGlu: 0.725 ± 0.089
0.483CysPhe: 0.483 ± 0.066
0.939CysGly: 0.939 ± 0.103
0.232CysHis: 0.232 ± 0.045
0.836CysIle: 0.836 ± 0.086
0.697CysLys: 0.697 ± 0.096
0.827CysLeu: 0.827 ± 0.091
0.325CysMet: 0.325 ± 0.062
0.781CysAsn: 0.781 ± 0.086
0.483CysPro: 0.483 ± 0.08
0.409CysGln: 0.409 ± 0.067
0.39CysArg: 0.39 ± 0.058
0.911CysSer: 0.911 ± 0.098
0.799CysThr: 0.799 ± 0.081
0.688CysVal: 0.688 ± 0.089
0.186CysTrp: 0.186 ± 0.044
0.511CysTyr: 0.511 ± 0.065
0.0CysXaa: 0.0 ± 0.0
Asp
3.132AspAla: 3.132 ± 0.199
0.66AspCys: 0.66 ± 0.079
4.647AspAsp: 4.647 ± 0.271
5.939AspGlu: 5.939 ± 0.305
3.327AspPhe: 3.327 ± 0.206
4.405AspGly: 4.405 ± 0.228
1.022AspHis: 1.022 ± 0.091
5.409AspIle: 5.409 ± 0.216
4.052AspLys: 4.052 ± 0.235
4.814AspLeu: 4.814 ± 0.212
1.757AspMet: 1.757 ± 0.122
4.099AspAsn: 4.099 ± 0.219
2.035AspPro: 2.035 ± 0.135
1.608AspGln: 1.608 ± 0.12
1.989AspArg: 1.989 ± 0.132
4.396AspSer: 4.396 ± 0.198
3.522AspThr: 3.522 ± 0.18
4.127AspVal: 4.127 ± 0.188
0.846AspTrp: 0.846 ± 0.087
3.216AspTyr: 3.216 ± 0.188
0.0AspXaa: 0.0 ± 0.0
Glu
2.993GluAla: 2.993 ± 0.172
0.994GluCys: 0.994 ± 0.11
4.182GluAsp: 4.182 ± 0.235
4.721GluGlu: 4.721 ± 0.26
4.006GluPhe: 4.006 ± 0.179
2.844GluGly: 2.844 ± 0.186
1.757GluHis: 1.757 ± 0.146
5.298GluIle: 5.298 ± 0.27
4.192GluLys: 4.192 ± 0.23
6.998GluLeu: 6.998 ± 0.295
2.333GluMet: 2.333 ± 0.157
4.043GluAsn: 4.043 ± 0.207
1.71GluPro: 1.71 ± 0.125
2.64GluGln: 2.64 ± 0.179
2.844GluArg: 2.844 ± 0.175
4.22GluSer: 4.22 ± 0.23
3.392GluThr: 3.392 ± 0.148
4.498GluVal: 4.498 ± 0.205
0.799GluTrp: 0.799 ± 0.08
3.55GluTyr: 3.55 ± 0.213
0.0GluXaa: 0.0 ± 0.0
Phe
1.682PheAla: 1.682 ± 0.111
0.604PheCys: 0.604 ± 0.082
3.718PheAsp: 3.718 ± 0.206
3.364PheGlu: 3.364 ± 0.184
1.868PhePhe: 1.868 ± 0.138
2.946PheGly: 2.946 ± 0.182
0.818PheHis: 0.818 ± 0.087
4.238PheIle: 4.238 ± 0.219
3.151PheLys: 3.151 ± 0.197
2.937PheLeu: 2.937 ± 0.17
1.292PheMet: 1.292 ± 0.116
3.504PheAsn: 3.504 ± 0.181
1.171PhePro: 1.171 ± 0.104
1.394PheGln: 1.394 ± 0.125
1.329PheArg: 1.329 ± 0.098
3.364PheSer: 3.364 ± 0.186
2.64PheThr: 2.64 ± 0.14
2.928PheVal: 2.928 ± 0.167
0.744PheTrp: 0.744 ± 0.077
2.073PheTyr: 2.073 ± 0.115
0.0PheXaa: 0.0 ± 0.0
Gly
2.649GlyAla: 2.649 ± 0.221
0.762GlyCys: 0.762 ± 0.089
3.411GlyAsp: 3.411 ± 0.182
3.225GlyGlu: 3.225 ± 0.167
2.268GlyPhe: 2.268 ± 0.177
2.946GlyGly: 2.946 ± 0.199
0.948GlyHis: 0.948 ± 0.099
4.554GlyIle: 4.554 ± 0.243
4.192GlyLys: 4.192 ± 0.215
4.387GlyLeu: 4.387 ± 0.227
1.292GlyMet: 1.292 ± 0.122
4.257GlyAsn: 4.257 ± 0.264
0.595GlyPro: 0.595 ± 0.077
1.85GlyGln: 1.85 ± 0.145
2.184GlyArg: 2.184 ± 0.147
4.907GlySer: 4.907 ± 0.241
5.056GlyThr: 5.056 ± 0.426
3.597GlyVal: 3.597 ± 0.233
0.669GlyTrp: 0.669 ± 0.08
3.03GlyTyr: 3.03 ± 0.186
0.0GlyXaa: 0.0 ± 0.0
His
0.994HisAla: 0.994 ± 0.089
0.242HisCys: 0.242 ± 0.047
1.357HisAsp: 1.357 ± 0.116
1.524HisGlu: 1.524 ± 0.108
1.004HisPhe: 1.004 ± 0.121
1.245HisGly: 1.245 ± 0.107
0.428HisHis: 0.428 ± 0.068
1.273HisIle: 1.273 ± 0.106
1.31HisLys: 1.31 ± 0.118
1.58HisLeu: 1.58 ± 0.118
0.502HisMet: 0.502 ± 0.065
1.366HisAsn: 1.366 ± 0.144
0.678HisPro: 0.678 ± 0.081
0.539HisGln: 0.539 ± 0.076
0.613HisArg: 0.613 ± 0.077
1.255HisSer: 1.255 ± 0.111
0.976HisThr: 0.976 ± 0.091
1.125HisVal: 1.125 ± 0.1
0.251HisTrp: 0.251 ± 0.053
0.79HisTyr: 0.79 ± 0.085
0.0HisXaa: 0.0 ± 0.0
Ile
3.857IleAla: 3.857 ± 0.219
0.985IleCys: 0.985 ± 0.104
5.391IleAsp: 5.391 ± 0.211
6.088IleGlu: 6.088 ± 0.279
3.132IlePhe: 3.132 ± 0.169
4.08IleGly: 4.08 ± 0.23
1.692IleHis: 1.692 ± 0.145
6.301IleIle: 6.301 ± 0.281
6.041IleLys: 6.041 ± 0.297
6.19IleLeu: 6.19 ± 0.244
1.868IleMet: 1.868 ± 0.152
5.363IleAsn: 5.363 ± 0.248
2.835IlePro: 2.835 ± 0.178
3.011IleGln: 3.011 ± 0.161
3.318IleArg: 3.318 ± 0.163
6.246IleSer: 6.246 ± 0.268
4.982IleThr: 4.982 ± 0.256
4.777IleVal: 4.777 ± 0.181
0.855IleTrp: 0.855 ± 0.092
2.89IleTyr: 2.89 ± 0.166
0.0IleXaa: 0.0 ± 0.0
Lys
2.77LysAla: 2.77 ± 0.175
0.781LysCys: 0.781 ± 0.115
4.173LysAsp: 4.173 ± 0.209
4.322LysGlu: 4.322 ± 0.244
3.346LysPhe: 3.346 ± 0.212
2.742LysGly: 2.742 ± 0.213
1.543LysHis: 1.543 ± 0.113
5.456LysIle: 5.456 ± 0.259
4.47LysLys: 4.47 ± 0.296
6.088LysLeu: 6.088 ± 0.273
2.193LysMet: 2.193 ± 0.136
4.74LysAsn: 4.74 ± 0.214
1.98LysPro: 1.98 ± 0.129
2.621LysGln: 2.621 ± 0.18
2.686LysArg: 2.686 ± 0.173
4.257LysSer: 4.257 ± 0.195
3.848LysThr: 3.848 ± 0.186
4.452LysVal: 4.452 ± 0.211
0.604LysTrp: 0.604 ± 0.084
3.894LysTyr: 3.894 ± 0.204
0.0LysXaa: 0.0 ± 0.0
Leu
4.192LeuAla: 4.192 ± 0.222
0.948LeuCys: 0.948 ± 0.097
5.484LeuAsp: 5.484 ± 0.229
5.446LeuGlu: 5.446 ± 0.256
3.076LeuPhe: 3.076 ± 0.176
3.792LeuGly: 3.792 ± 0.215
1.394LeuHis: 1.394 ± 0.115
5.576LeuIle: 5.576 ± 0.254
5.865LeuLys: 5.865 ± 0.284
6.041LeuLeu: 6.041 ± 0.269
1.998LeuMet: 1.998 ± 0.15
5.8LeuAsn: 5.8 ± 0.213
3.03LeuPro: 3.03 ± 0.179
2.881LeuGln: 2.881 ± 0.175
3.355LeuArg: 3.355 ± 0.164
6.45LeuSer: 6.45 ± 0.234
5.0LeuThr: 5.0 ± 0.263
4.982LeuVal: 4.982 ± 0.195
0.846LeuTrp: 0.846 ± 0.088
3.931LeuTyr: 3.931 ± 0.229
0.0LeuXaa: 0.0 ± 0.0
Met
1.645MetAla: 1.645 ± 0.112
0.325MetCys: 0.325 ± 0.057
1.58MetAsp: 1.58 ± 0.118
1.552MetGlu: 1.552 ± 0.138
1.422MetPhe: 1.422 ± 0.123
1.366MetGly: 1.366 ± 0.127
0.362MetHis: 0.362 ± 0.056
2.063MetIle: 2.063 ± 0.14
2.509MetLys: 2.509 ± 0.171
1.738MetLeu: 1.738 ± 0.119
0.864MetMet: 0.864 ± 0.084
2.203MetAsn: 2.203 ± 0.134
0.678MetPro: 0.678 ± 0.085
0.883MetGln: 0.883 ± 0.09
0.985MetArg: 0.985 ± 0.091
2.008MetSer: 2.008 ± 0.108
1.524MetThr: 1.524 ± 0.112
1.403MetVal: 1.403 ± 0.1
0.316MetTrp: 0.316 ± 0.05
1.441MetTyr: 1.441 ± 0.136
0.0MetXaa: 0.0 ± 0.0
Asn
3.327AsnAla: 3.327 ± 0.198
0.79AsnCys: 0.79 ± 0.098
3.95AsnAsp: 3.95 ± 0.214
4.759AsnGlu: 4.759 ± 0.241
2.965AsnPhe: 2.965 ± 0.176
5.047AsnGly: 5.047 ± 0.216
1.245AsnHis: 1.245 ± 0.105
6.143AsnIle: 6.143 ± 0.259
4.461AsnLys: 4.461 ± 0.248
5.214AsnLeu: 5.214 ± 0.2
1.664AsnMet: 1.664 ± 0.112
5.102AsnAsn: 5.102 ± 0.271
2.509AsnPro: 2.509 ± 0.201
1.877AsnGln: 1.877 ± 0.146
2.463AsnArg: 2.463 ± 0.13
5.075AsnSer: 5.075 ± 0.258
4.805AsnThr: 4.805 ± 0.259
4.312AsnVal: 4.312 ± 0.262
0.576AsnTrp: 0.576 ± 0.078
3.011AsnTyr: 3.011 ± 0.189
0.0AsnXaa: 0.0 ± 0.0
Pro
1.506ProAla: 1.506 ± 0.134
0.335ProCys: 0.335 ± 0.056
2.435ProAsp: 2.435 ± 0.149
2.705ProGlu: 2.705 ± 0.179
1.664ProPhe: 1.664 ± 0.127
1.125ProGly: 1.125 ± 0.112
0.483ProHis: 0.483 ± 0.066
2.472ProIle: 2.472 ± 0.143
1.794ProLys: 1.794 ± 0.118
2.11ProLeu: 2.11 ± 0.135
0.771ProMet: 0.771 ± 0.083
2.082ProAsn: 2.082 ± 0.154
0.734ProPro: 0.734 ± 0.103
1.143ProGln: 1.143 ± 0.112
0.874ProArg: 0.874 ± 0.103
2.314ProSer: 2.314 ± 0.128
2.156ProThr: 2.156 ± 0.169
2.24ProVal: 2.24 ± 0.153
0.26ProTrp: 0.26 ± 0.052
1.31ProTyr: 1.31 ± 0.106
0.0ProXaa: 0.0 ± 0.0
Gln
1.71GlnAla: 1.71 ± 0.125
0.465GlnCys: 0.465 ± 0.067
1.915GlnAsp: 1.915 ± 0.133
2.398GlnGlu: 2.398 ± 0.173
1.608GlnPhe: 1.608 ± 0.114
1.626GlnGly: 1.626 ± 0.132
0.744GlnHis: 0.744 ± 0.08
2.612GlnIle: 2.612 ± 0.137
2.054GlnLys: 2.054 ± 0.153
3.151GlnLeu: 3.151 ± 0.194
1.19GlnMet: 1.19 ± 0.099
2.454GlnAsn: 2.454 ± 0.145
0.911GlnPro: 0.911 ± 0.098
1.524GlnGln: 1.524 ± 0.138
1.357GlnArg: 1.357 ± 0.111
2.231GlnSer: 2.231 ± 0.161
1.803GlnThr: 1.803 ± 0.121
1.877GlnVal: 1.877 ± 0.122
0.455GlnTrp: 0.455 ± 0.067
2.528GlnTyr: 2.528 ± 0.18
0.0GlnXaa: 0.0 ± 0.0
Arg
1.552ArgAla: 1.552 ± 0.142
0.335ArgCys: 0.335 ± 0.047
2.184ArgAsp: 2.184 ± 0.142
2.37ArgGlu: 2.37 ± 0.172
1.747ArgPhe: 1.747 ± 0.127
2.203ArgGly: 2.203 ± 0.145
0.632ArgHis: 0.632 ± 0.082
3.206ArgIle: 3.206 ± 0.173
2.751ArgLys: 2.751 ± 0.187
3.011ArgLeu: 3.011 ± 0.168
1.078ArgMet: 1.078 ± 0.107
2.565ArgAsn: 2.565 ± 0.146
1.087ArgPro: 1.087 ± 0.089
1.329ArgGln: 1.329 ± 0.105
1.524ArgArg: 1.524 ± 0.124
2.537ArgSer: 2.537 ± 0.17
2.286ArgThr: 2.286 ± 0.161
2.324ArgVal: 2.324 ± 0.145
0.483ArgTrp: 0.483 ± 0.07
1.877ArgTyr: 1.877 ± 0.169
0.0ArgXaa: 0.0 ± 0.0
Ser
3.68SerAla: 3.68 ± 0.213
0.539SerCys: 0.539 ± 0.071
4.266SerAsp: 4.266 ± 0.206
4.266SerGlu: 4.266 ± 0.199
3.179SerPhe: 3.179 ± 0.179
4.498SerGly: 4.498 ± 0.251
1.143SerHis: 1.143 ± 0.108
6.134SerIle: 6.134 ± 0.244
4.322SerLys: 4.322 ± 0.211
5.344SerLeu: 5.344 ± 0.205
1.487SerMet: 1.487 ± 0.126
4.712SerAsn: 4.712 ± 0.226
2.175SerPro: 2.175 ± 0.124
2.203SerGln: 2.203 ± 0.148
2.519SerArg: 2.519 ± 0.162
5.158SerSer: 5.158 ± 0.289
7.184SerThr: 7.184 ± 0.299
4.647SerVal: 4.647 ± 0.206
0.994SerTrp: 0.994 ± 0.101
3.03SerTyr: 3.03 ± 0.186
0.0SerXaa: 0.0 ± 0.0
Thr
3.476ThrAla: 3.476 ± 0.281
0.483ThrCys: 0.483 ± 0.064
4.052ThrAsp: 4.052 ± 0.207
4.117ThrGlu: 4.117 ± 0.188
3.179ThrPhe: 3.179 ± 0.216
4.721ThrGly: 4.721 ± 0.406
1.162ThrHis: 1.162 ± 0.102
5.418ThrIle: 5.418 ± 0.25
3.838ThrLys: 3.838 ± 0.218
5.307ThrLeu: 5.307 ± 0.382
1.534ThrMet: 1.534 ± 0.112
4.192ThrAsn: 4.192 ± 0.289
2.212ThrPro: 2.212 ± 0.149
1.812ThrGln: 1.812 ± 0.158
1.924ThrArg: 1.924 ± 0.14
4.647ThrSer: 4.647 ± 0.284
4.015ThrThr: 4.015 ± 0.422
4.879ThrVal: 4.879 ± 0.316
0.753ThrTrp: 0.753 ± 0.106
3.104ThrTyr: 3.104 ± 0.2
0.0ThrXaa: 0.0 ± 0.0
Val
2.844ValAla: 2.844 ± 0.165
0.632ValCys: 0.632 ± 0.068
4.359ValAsp: 4.359 ± 0.227
3.764ValGlu: 3.764 ± 0.21
2.482ValPhe: 2.482 ± 0.142
3.29ValGly: 3.29 ± 0.177
1.385ValHis: 1.385 ± 0.116
4.694ValIle: 4.694 ± 0.237
3.718ValLys: 3.718 ± 0.198
6.543ValLeu: 6.543 ± 0.239
1.831ValMet: 1.831 ± 0.149
4.006ValAsn: 4.006 ± 0.224
2.398ValPro: 2.398 ± 0.185
3.048ValGln: 3.048 ± 0.184
2.398ValArg: 2.398 ± 0.145
4.508ValSer: 4.508 ± 0.207
3.922ValThr: 3.922 ± 0.409
4.089ValVal: 4.089 ± 0.189
0.539ValTrp: 0.539 ± 0.079
3.253ValTyr: 3.253 ± 0.188
0.0ValXaa: 0.0 ± 0.0
Trp
0.502TrpAla: 0.502 ± 0.078
0.158TrpCys: 0.158 ± 0.037
0.818TrpAsp: 0.818 ± 0.093
0.641TrpGlu: 0.641 ± 0.085
0.52TrpPhe: 0.52 ± 0.065
0.66TrpGly: 0.66 ± 0.076
0.251TrpHis: 0.251 ± 0.047
0.911TrpIle: 0.911 ± 0.098
0.92TrpLys: 0.92 ± 0.092
0.669TrpLeu: 0.669 ± 0.072
0.39TrpMet: 0.39 ± 0.06
0.985TrpAsn: 0.985 ± 0.091
0.177TrpPro: 0.177 ± 0.041
0.307TrpGln: 0.307 ± 0.054
0.372TrpArg: 0.372 ± 0.055
0.669TrpSer: 0.669 ± 0.082
0.734TrpThr: 0.734 ± 0.083
0.734TrpVal: 0.734 ± 0.1
0.102TrpTrp: 0.102 ± 0.032
0.762TrpTyr: 0.762 ± 0.083
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.989TyrAla: 1.989 ± 0.137
0.855TyrCys: 0.855 ± 0.083
3.606TyrAsp: 3.606 ± 0.164
2.974TyrGlu: 2.974 ± 0.194
2.389TyrPhe: 2.389 ± 0.14
3.179TyrGly: 3.179 ± 0.212
1.06TyrHis: 1.06 ± 0.093
3.746TyrIle: 3.746 ± 0.194
3.216TyrLys: 3.216 ± 0.171
3.011TyrLeu: 3.011 ± 0.149
1.069TyrMet: 1.069 ± 0.11
4.034TyrAsn: 4.034 ± 0.246
1.552TyrPro: 1.552 ± 0.126
1.784TyrGln: 1.784 ± 0.13
2.063TyrArg: 2.063 ± 0.148
3.262TyrSer: 3.262 ± 0.21
3.132TyrThr: 3.132 ± 0.183
2.825TyrVal: 2.825 ± 0.226
0.474TyrTrp: 0.474 ± 0.063
2.547TyrTyr: 2.547 ± 0.143
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 531 proteins (107596 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski