Amino acid dipepetide frequency for Tenacibaculum phage PTm5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.045AlaAla: 0.045 ± 0.026
0.445AlaCys: 0.445 ± 0.083
2.257AlaAsp: 2.257 ± 0.169
3.058AlaGlu: 3.058 ± 0.237
1.796AlaPhe: 1.796 ± 0.17
2.851AlaGly: 2.851 ± 0.325
0.727AlaHis: 0.727 ± 0.121
2.776AlaIle: 2.776 ± 0.251
3.949AlaLys: 3.949 ± 0.245
4.024AlaLeu: 4.024 ± 0.318
0.98AlaMet: 0.98 ± 0.116
3.014AlaAsn: 3.014 ± 0.24
1.203AlaPro: 1.203 ± 0.178
1.47AlaGln: 1.47 ± 0.147
1.618AlaArg: 1.618 ± 0.17
3.444AlaSer: 3.444 ± 0.291
3.355AlaThr: 3.355 ± 0.292
2.984AlaVal: 2.984 ± 0.224
0.341AlaTrp: 0.341 ± 0.076
2.286AlaTyr: 2.286 ± 0.164
0.0AlaXaa: 0.0 ± 0.0
Cys
0.312CysAla: 0.312 ± 0.062
0.074CysCys: 0.074 ± 0.032
0.921CysAsp: 0.921 ± 0.139
0.713CysGlu: 0.713 ± 0.103
0.475CysPhe: 0.475 ± 0.078
0.668CysGly: 0.668 ± 0.112
0.178CysHis: 0.178 ± 0.051
0.876CysIle: 0.876 ± 0.112
0.891CysLys: 0.891 ± 0.147
0.638CysLeu: 0.638 ± 0.096
0.178CysMet: 0.178 ± 0.05
0.564CysAsn: 0.564 ± 0.092
0.356CysPro: 0.356 ± 0.069
0.193CysGln: 0.193 ± 0.051
0.282CysArg: 0.282 ± 0.067
0.668CysSer: 0.668 ± 0.103
0.564CysThr: 0.564 ± 0.077
0.698CysVal: 0.698 ± 0.119
0.148CysTrp: 0.148 ± 0.047
0.401CysTyr: 0.401 ± 0.081
0.0CysXaa: 0.0 ± 0.0
Asp
3.103AspAla: 3.103 ± 0.202
0.698AspCys: 0.698 ± 0.104
3.979AspAsp: 3.979 ± 0.251
4.365AspGlu: 4.365 ± 0.269
3.534AspPhe: 3.534 ± 0.213
4.261AspGly: 4.261 ± 0.3
0.698AspHis: 0.698 ± 0.075
5.909AspIle: 5.909 ± 0.301
5.33AspLys: 5.33 ± 0.28
5.449AspLeu: 5.449 ± 0.277
1.455AspMet: 1.455 ± 0.142
3.727AspAsn: 3.727 ± 0.229
1.366AspPro: 1.366 ± 0.152
0.921AspGln: 0.921 ± 0.125
2.227AspArg: 2.227 ± 0.196
4.41AspSer: 4.41 ± 0.223
3.964AspThr: 3.964 ± 0.204
4.677AspVal: 4.677 ± 0.257
0.624AspTrp: 0.624 ± 0.1
3.029AspTyr: 3.029 ± 0.241
0.0AspXaa: 0.0 ± 0.0
Glu
2.672GluAla: 2.672 ± 0.251
0.787GluCys: 0.787 ± 0.129
3.415GluAsp: 3.415 ± 0.255
3.801GluGlu: 3.801 ± 0.305
3.385GluPhe: 3.385 ± 0.225
1.633GluGly: 1.633 ± 0.197
1.514GluHis: 1.514 ± 0.145
5.553GluIle: 5.553 ± 0.294
4.053GluLys: 4.053 ± 0.28
7.646GluLeu: 7.646 ± 0.453
1.529GluMet: 1.529 ± 0.165
4.395GluAsn: 4.395 ± 0.284
1.544GluPro: 1.544 ± 0.166
2.94GluGln: 2.94 ± 0.183
2.598GluArg: 2.598 ± 0.186
4.543GluSer: 4.543 ± 0.364
3.548GluThr: 3.548 ± 0.249
4.528GluVal: 4.528 ± 0.27
0.698GluTrp: 0.698 ± 0.103
3.801GluTyr: 3.801 ± 0.246
0.0GluXaa: 0.0 ± 0.0
Phe
1.841PheAla: 1.841 ± 0.169
0.534PheCys: 0.534 ± 0.091
4.068PheAsp: 4.068 ± 0.274
3.43PheGlu: 3.43 ± 0.236
1.9PhePhe: 1.9 ± 0.169
2.806PheGly: 2.806 ± 0.211
0.594PheHis: 0.594 ± 0.095
3.563PheIle: 3.563 ± 0.233
4.573PheLys: 4.573 ± 0.237
2.658PheLeu: 2.658 ± 0.178
1.158PheMet: 1.158 ± 0.139
4.068PheAsn: 4.068 ± 0.255
1.114PhePro: 1.114 ± 0.118
0.935PheGln: 0.935 ± 0.117
2.004PheArg: 2.004 ± 0.169
2.999PheSer: 2.999 ± 0.21
3.014PheThr: 3.014 ± 0.221
2.88PheVal: 2.88 ± 0.192
0.431PheTrp: 0.431 ± 0.074
1.722PheTyr: 1.722 ± 0.187
0.0PheXaa: 0.0 ± 0.0
Gly
2.331GlyAla: 2.331 ± 0.305
0.505GlyCys: 0.505 ± 0.089
3.385GlyAsp: 3.385 ± 0.232
2.806GlyGlu: 2.806 ± 0.212
2.153GlyPhe: 2.153 ± 0.154
2.747GlyGly: 2.747 ± 0.305
1.114GlyHis: 1.114 ± 0.182
3.816GlyIle: 3.816 ± 0.277
4.113GlyLys: 4.113 ± 0.239
3.712GlyLeu: 3.712 ± 0.239
1.648GlyMet: 1.648 ± 0.156
3.637GlyAsn: 3.637 ± 0.292
0.193GlyPro: 0.193 ± 0.049
1.396GlyGln: 1.396 ± 0.204
1.737GlyArg: 1.737 ± 0.138
3.593GlySer: 3.593 ± 0.242
4.009GlyThr: 4.009 ± 0.337
3.667GlyVal: 3.667 ± 0.261
0.549GlyTrp: 0.549 ± 0.093
2.628GlyTyr: 2.628 ± 0.28
0.0GlyXaa: 0.0 ± 0.0
His
0.742HisAla: 0.742 ± 0.1
0.223HisCys: 0.223 ± 0.048
1.247HisAsp: 1.247 ± 0.128
0.921HisGlu: 0.921 ± 0.132
0.861HisPhe: 0.861 ± 0.101
1.158HisGly: 1.158 ± 0.126
0.579HisHis: 0.579 ± 0.095
1.826HisIle: 1.826 ± 0.154
1.693HisLys: 1.693 ± 0.161
1.351HisLeu: 1.351 ± 0.127
0.371HisMet: 0.371 ± 0.07
1.455HisAsn: 1.455 ± 0.142
0.594HisPro: 0.594 ± 0.082
0.609HisGln: 0.609 ± 0.099
0.965HisArg: 0.965 ± 0.118
1.366HisSer: 1.366 ± 0.141
1.396HisThr: 1.396 ± 0.148
0.906HisVal: 0.906 ± 0.148
0.163HisTrp: 0.163 ± 0.047
0.861HisTyr: 0.861 ± 0.114
0.0HisXaa: 0.0 ± 0.0
Ile
3.177IleAla: 3.177 ± 0.243
0.683IleCys: 0.683 ± 0.099
5.686IleAsp: 5.686 ± 0.284
5.805IleGlu: 5.805 ± 0.282
3.029IlePhe: 3.029 ± 0.22
3.682IleGly: 3.682 ± 0.362
1.47IleHis: 1.47 ± 0.153
5.701IleIle: 5.701 ± 0.328
7.958IleLys: 7.958 ± 0.367
6.147IleLeu: 6.147 ± 0.314
1.693IleMet: 1.693 ± 0.158
5.82IleAsn: 5.82 ± 0.294
2.91IlePro: 2.91 ± 0.223
3.192IleGln: 3.192 ± 0.177
2.672IleArg: 2.672 ± 0.22
5.345IleSer: 5.345 ± 0.312
5.048IleThr: 5.048 ± 0.286
3.979IleVal: 3.979 ± 0.254
0.757IleTrp: 0.757 ± 0.115
3.073IleTyr: 3.073 ± 0.248
0.0IleXaa: 0.0 ± 0.0
Lys
3.741LysAla: 3.741 ± 0.227
0.861LysCys: 0.861 ± 0.12
5.092LysAsp: 5.092 ± 0.273
6.161LysGlu: 6.161 ± 0.377
4.009LysPhe: 4.009 ± 0.254
3.845LysGly: 3.845 ± 0.28
2.108LysHis: 2.108 ± 0.16
6.562LysIle: 6.562 ± 0.345
7.409LysLys: 7.409 ± 0.464
8.596LysLeu: 8.596 ± 0.401
2.257LysMet: 2.257 ± 0.209
6.399LysAsn: 6.399 ± 0.317
2.583LysPro: 2.583 ± 0.194
3.727LysGln: 3.727 ± 0.242
3.444LysArg: 3.444 ± 0.256
6.251LysSer: 6.251 ± 0.304
5.553LysThr: 5.553 ± 0.322
5.063LysVal: 5.063 ± 0.289
1.084LysTrp: 1.084 ± 0.156
5.345LysTyr: 5.345 ± 0.314
0.0LysXaa: 0.0 ± 0.0
Leu
3.311LeuAla: 3.311 ± 0.274
0.831LeuCys: 0.831 ± 0.111
5.063LeuAsp: 5.063 ± 0.263
5.018LeuGlu: 5.018 ± 0.309
3.415LeuPhe: 3.415 ± 0.22
3.89LeuGly: 3.89 ± 0.245
1.707LeuHis: 1.707 ± 0.158
5.196LeuIle: 5.196 ± 0.249
8.344LeuLys: 8.344 ± 0.448
6.206LeuLeu: 6.206 ± 0.373
2.093LeuMet: 2.093 ± 0.199
6.34LeuAsn: 6.34 ± 0.32
2.865LeuPro: 2.865 ± 0.212
2.94LeuGln: 2.94 ± 0.191
3.712LeuArg: 3.712 ± 0.226
6.666LeuSer: 6.666 ± 0.387
6.102LeuThr: 6.102 ± 0.318
5.122LeuVal: 5.122 ± 0.28
0.653LeuTrp: 0.653 ± 0.095
3.905LeuTyr: 3.905 ± 0.319
0.0LeuXaa: 0.0 ± 0.0
Met
1.039MetAla: 1.039 ± 0.132
0.208MetCys: 0.208 ± 0.054
1.336MetAsp: 1.336 ± 0.134
1.351MetGlu: 1.351 ± 0.151
1.425MetPhe: 1.425 ± 0.123
0.965MetGly: 0.965 ± 0.114
0.46MetHis: 0.46 ± 0.082
1.648MetIle: 1.648 ± 0.151
2.286MetLys: 2.286 ± 0.169
2.242MetLeu: 2.242 ± 0.181
0.579MetMet: 0.579 ± 0.083
1.559MetAsn: 1.559 ± 0.132
0.698MetPro: 0.698 ± 0.104
0.876MetGln: 0.876 ± 0.125
0.727MetArg: 0.727 ± 0.105
1.752MetSer: 1.752 ± 0.169
1.247MetThr: 1.247 ± 0.149
1.425MetVal: 1.425 ± 0.135
0.163MetTrp: 0.163 ± 0.051
1.01MetTyr: 1.01 ± 0.126
0.0MetXaa: 0.0 ± 0.0
Asn
3.251AsnAla: 3.251 ± 0.265
0.445AsnCys: 0.445 ± 0.082
4.024AsnAsp: 4.024 ± 0.239
5.018AsnGlu: 5.018 ± 0.333
2.851AsnPhe: 2.851 ± 0.184
4.246AsnGly: 4.246 ± 0.272
1.232AsnHis: 1.232 ± 0.15
6.458AsnIle: 6.458 ± 0.37
7.409AsnLys: 7.409 ± 0.377
5.211AsnLeu: 5.211 ± 0.284
1.574AsnMet: 1.574 ± 0.158
5.419AsnAsn: 5.419 ± 0.324
2.286AsnPro: 2.286 ± 0.168
2.272AsnGln: 2.272 ± 0.177
2.851AsnArg: 2.851 ± 0.205
4.558AsnSer: 4.558 ± 0.264
5.419AsnThr: 5.419 ± 0.432
4.38AsnVal: 4.38 ± 0.27
0.624AsnTrp: 0.624 ± 0.088
2.999AsnTyr: 2.999 ± 0.212
0.0AsnXaa: 0.0 ± 0.0
Pro
1.217ProAla: 1.217 ± 0.137
0.178ProCys: 0.178 ± 0.053
1.143ProAsp: 1.143 ± 0.12
1.203ProGlu: 1.203 ± 0.135
1.455ProPhe: 1.455 ± 0.171
0.327ProGly: 0.327 ± 0.093
0.653ProHis: 0.653 ± 0.095
2.316ProIle: 2.316 ± 0.17
2.851ProLys: 2.851 ± 0.247
2.331ProLeu: 2.331 ± 0.185
0.431ProMet: 0.431 ± 0.08
2.613ProAsn: 2.613 ± 0.196
0.549ProPro: 0.549 ± 0.087
0.846ProGln: 0.846 ± 0.104
0.861ProArg: 0.861 ± 0.121
2.331ProSer: 2.331 ± 0.175
2.628ProThr: 2.628 ± 0.244
1.737ProVal: 1.737 ± 0.179
0.341ProTrp: 0.341 ± 0.072
1.589ProTyr: 1.589 ± 0.152
0.0ProXaa: 0.0 ± 0.0
Gln
1.633GlnAla: 1.633 ± 0.164
0.267GlnCys: 0.267 ± 0.068
1.47GlnAsp: 1.47 ± 0.138
2.064GlnGlu: 2.064 ± 0.165
1.5GlnPhe: 1.5 ± 0.151
1.44GlnGly: 1.44 ± 0.13
0.594GlnHis: 0.594 ± 0.102
2.465GlnIle: 2.465 ± 0.181
2.851GlnLys: 2.851 ± 0.202
3.192GlnLeu: 3.192 ± 0.212
0.935GlnMet: 0.935 ± 0.132
2.212GlnAsn: 2.212 ± 0.186
1.158GlnPro: 1.158 ± 0.136
1.47GlnGln: 1.47 ± 0.188
1.321GlnArg: 1.321 ± 0.125
2.212GlnSer: 2.212 ± 0.168
2.242GlnThr: 2.242 ± 0.196
1.9GlnVal: 1.9 ± 0.191
0.297GlnTrp: 0.297 ± 0.067
1.752GlnTyr: 1.752 ± 0.137
0.0GlnXaa: 0.0 ± 0.0
Arg
1.559ArgAla: 1.559 ± 0.15
0.431ArgCys: 0.431 ± 0.079
2.004ArgAsp: 2.004 ± 0.167
2.776ArgGlu: 2.776 ± 0.213
1.915ArgPhe: 1.915 ± 0.158
1.707ArgGly: 1.707 ± 0.151
0.787ArgHis: 0.787 ± 0.109
2.851ArgIle: 2.851 ± 0.193
3.519ArgLys: 3.519 ± 0.296
3.311ArgLeu: 3.311 ± 0.228
0.965ArgMet: 0.965 ± 0.111
2.836ArgAsn: 2.836 ± 0.188
0.935ArgPro: 0.935 ± 0.106
1.069ArgGln: 1.069 ± 0.141
1.425ArgArg: 1.425 ± 0.142
1.96ArgSer: 1.96 ± 0.173
2.494ArgThr: 2.494 ± 0.161
2.583ArgVal: 2.583 ± 0.2
0.475ArgTrp: 0.475 ± 0.079
2.019ArgTyr: 2.019 ± 0.166
0.0ArgXaa: 0.0 ± 0.0
Ser
3.578SerAla: 3.578 ± 0.316
0.609SerCys: 0.609 ± 0.087
4.617SerAsp: 4.617 ± 0.27
4.885SerGlu: 4.885 ± 0.304
3.222SerPhe: 3.222 ± 0.199
3.682SerGly: 3.682 ± 0.34
1.173SerHis: 1.173 ± 0.124
5.82SerIle: 5.82 ± 0.276
6.34SerLys: 6.34 ± 0.419
5.315SerLeu: 5.315 ± 0.291
1.603SerMet: 1.603 ± 0.173
5.196SerAsn: 5.196 ± 0.351
1.871SerPro: 1.871 ± 0.163
1.975SerGln: 1.975 ± 0.195
2.227SerArg: 2.227 ± 0.199
4.187SerSer: 4.187 ± 0.285
4.721SerThr: 4.721 ± 0.277
4.825SerVal: 4.825 ± 0.247
0.698SerTrp: 0.698 ± 0.105
3.148SerTyr: 3.148 ± 0.198
0.0SerXaa: 0.0 ± 0.0
Thr
3.519ThrAla: 3.519 ± 0.287
0.549ThrCys: 0.549 ± 0.087
4.142ThrAsp: 4.142 ± 0.325
4.038ThrGlu: 4.038 ± 0.294
3.43ThrPhe: 3.43 ± 0.238
3.133ThrGly: 3.133 ± 0.485
1.396ThrHis: 1.396 ± 0.134
5.434ThrIle: 5.434 ± 0.306
6.325ThrLys: 6.325 ± 0.287
6.117ThrLeu: 6.117 ± 0.323
1.039ThrMet: 1.039 ± 0.131
5.107ThrAsn: 5.107 ± 0.259
2.286ThrPro: 2.286 ± 0.177
2.479ThrGln: 2.479 ± 0.185
1.989ThrArg: 1.989 ± 0.158
4.751ThrSer: 4.751 ± 0.288
4.84ThrThr: 4.84 ± 0.385
4.127ThrVal: 4.127 ± 0.374
0.564ThrTrp: 0.564 ± 0.086
2.91ThrTyr: 2.91 ± 0.187
0.0ThrXaa: 0.0 ± 0.0
Val
3.103ValAla: 3.103 ± 0.218
0.609ValCys: 0.609 ± 0.093
5.3ValAsp: 5.3 ± 0.265
3.994ValGlu: 3.994 ± 0.328
2.851ValPhe: 2.851 ± 0.2
3.251ValGly: 3.251 ± 0.277
1.114ValHis: 1.114 ± 0.11
4.692ValIle: 4.692 ± 0.296
4.959ValLys: 4.959 ± 0.24
4.38ValLeu: 4.38 ± 0.248
1.307ValMet: 1.307 ± 0.147
4.291ValAsn: 4.291 ± 0.241
1.767ValPro: 1.767 ± 0.168
1.93ValGln: 1.93 ± 0.199
2.598ValArg: 2.598 ± 0.209
5.003ValSer: 5.003 ± 0.255
4.231ValThr: 4.231 ± 0.293
4.32ValVal: 4.32 ± 0.283
0.52ValTrp: 0.52 ± 0.078
2.732ValTyr: 2.732 ± 0.207
0.0ValXaa: 0.0 ± 0.0
Trp
0.341TrpAla: 0.341 ± 0.085
0.267TrpCys: 0.267 ± 0.058
0.609TrpAsp: 0.609 ± 0.092
0.371TrpGlu: 0.371 ± 0.068
0.653TrpPhe: 0.653 ± 0.096
0.386TrpGly: 0.386 ± 0.096
0.223TrpHis: 0.223 ± 0.056
0.698TrpIle: 0.698 ± 0.105
0.846TrpLys: 0.846 ± 0.115
1.024TrpLeu: 1.024 ± 0.131
0.267TrpMet: 0.267 ± 0.062
0.683TrpAsn: 0.683 ± 0.104
0.0TrpPro: 0.0 ± 0.0
0.49TrpGln: 0.49 ± 0.107
0.327TrpArg: 0.327 ± 0.077
0.698TrpSer: 0.698 ± 0.104
0.534TrpThr: 0.534 ± 0.109
0.594TrpVal: 0.594 ± 0.097
0.089TrpTrp: 0.089 ± 0.038
0.727TrpTyr: 0.727 ± 0.105
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.108TyrAla: 2.108 ± 0.193
0.594TyrCys: 0.594 ± 0.09
3.964TyrAsp: 3.964 ± 0.228
2.806TyrGlu: 2.806 ± 0.213
2.435TyrPhe: 2.435 ± 0.203
3.058TyrGly: 3.058 ± 0.218
0.965TyrHis: 0.965 ± 0.122
3.667TyrIle: 3.667 ± 0.251
4.395TyrLys: 4.395 ± 0.286
3.831TyrLeu: 3.831 ± 0.285
0.921TyrMet: 0.921 ± 0.127
3.192TyrAsn: 3.192 ± 0.241
1.321TyrPro: 1.321 ± 0.142
1.247TyrGln: 1.247 ± 0.123
2.049TyrArg: 2.049 ± 0.199
2.955TyrSer: 2.955 ± 0.236
3.266TyrThr: 3.266 ± 0.241
2.554TyrVal: 2.554 ± 0.212
0.609TyrTrp: 0.609 ± 0.092
2.346TyrTyr: 2.346 ± 0.222
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 306 proteins (67355 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski