Amino acid dipepetide frequency for Salmonella phage ViI

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.639AlaAla: 5.639 ± 0.394
0.622AlaCys: 0.622 ± 0.115
4.25AlaAsp: 4.25 ± 0.354
4.478AlaGlu: 4.478 ± 0.413
2.508AlaPhe: 2.508 ± 0.209
4.519AlaGly: 4.519 ± 0.409
1.389AlaHis: 1.389 ± 0.187
4.395AlaIle: 4.395 ± 0.296
4.105AlaLys: 4.105 ± 0.298
5.535AlaLeu: 5.535 ± 0.36
2.114AlaMet: 2.114 ± 0.193
3.13AlaAsn: 3.13 ± 0.261
2.405AlaPro: 2.405 ± 0.218
2.716AlaGln: 2.716 ± 0.212
3.42AlaArg: 3.42 ± 0.254
3.959AlaSer: 3.959 ± 0.314
3.877AlaThr: 3.877 ± 0.353
4.747AlaVal: 4.747 ± 0.329
0.767AlaTrp: 0.767 ± 0.126
2.467AlaTyr: 2.467 ± 0.255
0.0AlaXaa: 0.0 ± 0.0
Cys
0.746CysAla: 0.746 ± 0.122
0.166CysCys: 0.166 ± 0.061
0.891CysAsp: 0.891 ± 0.156
0.954CysGlu: 0.954 ± 0.151
0.394CysPhe: 0.394 ± 0.088
0.726CysGly: 0.726 ± 0.137
0.456CysHis: 0.456 ± 0.089
0.684CysIle: 0.684 ± 0.118
0.622CysLys: 0.622 ± 0.109
0.601CysLeu: 0.601 ± 0.103
0.29CysMet: 0.29 ± 0.075
0.684CysAsn: 0.684 ± 0.101
0.498CysPro: 0.498 ± 0.092
0.29CysGln: 0.29 ± 0.071
0.435CysArg: 0.435 ± 0.092
0.891CysSer: 0.891 ± 0.138
0.726CysThr: 0.726 ± 0.127
0.954CysVal: 0.954 ± 0.134
0.145CysTrp: 0.145 ± 0.048
0.311CysTyr: 0.311 ± 0.087
0.0CysXaa: 0.0 ± 0.0
Asp
4.561AspAla: 4.561 ± 0.271
0.622AspCys: 0.622 ± 0.115
3.773AspAsp: 3.773 ± 0.282
3.835AspGlu: 3.835 ± 0.3
2.964AspPhe: 2.964 ± 0.226
5.037AspGly: 5.037 ± 0.322
1.037AspHis: 1.037 ± 0.148
4.498AspIle: 4.498 ± 0.302
3.586AspLys: 3.586 ± 0.267
5.804AspLeu: 5.804 ± 0.365
2.052AspMet: 2.052 ± 0.204
2.944AspAsn: 2.944 ± 0.221
2.674AspPro: 2.674 ± 0.234
2.052AspGln: 2.052 ± 0.198
2.114AspArg: 2.114 ± 0.197
3.939AspSer: 3.939 ± 0.282
3.503AspThr: 3.503 ± 0.29
4.457AspVal: 4.457 ± 0.255
1.037AspTrp: 1.037 ± 0.172
3.151AspTyr: 3.151 ± 0.289
0.0AspXaa: 0.0 ± 0.0
Glu
4.208GluAla: 4.208 ± 0.433
0.622GluCys: 0.622 ± 0.107
4.27GluAsp: 4.27 ± 0.3
4.312GluGlu: 4.312 ± 0.373
3.089GluPhe: 3.089 ± 0.231
4.208GluGly: 4.208 ± 0.301
1.347GluHis: 1.347 ± 0.169
4.395GluIle: 4.395 ± 0.282
3.628GluLys: 3.628 ± 0.349
6.198GluLeu: 6.198 ± 0.33
2.052GluMet: 2.052 ± 0.197
3.13GluAsn: 3.13 ± 0.252
1.969GluPro: 1.969 ± 0.221
2.695GluGln: 2.695 ± 0.254
3.669GluArg: 3.669 ± 0.324
3.69GluSer: 3.69 ± 0.31
3.669GluThr: 3.669 ± 0.239
4.229GluVal: 4.229 ± 0.336
1.099GluTrp: 1.099 ± 0.149
2.964GluTyr: 2.964 ± 0.322
0.0GluXaa: 0.0 ± 0.0
Phe
2.343PheAla: 2.343 ± 0.221
0.477PheCys: 0.477 ± 0.108
2.633PheAsp: 2.633 ± 0.232
2.964PheGlu: 2.964 ± 0.289
1.7PhePhe: 1.7 ± 0.198
3.275PheGly: 3.275 ± 0.317
0.891PheHis: 0.891 ± 0.169
2.84PheIle: 2.84 ± 0.253
2.695PheLys: 2.695 ± 0.247
2.778PheLeu: 2.778 ± 0.245
1.285PheMet: 1.285 ± 0.155
2.591PheAsn: 2.591 ± 0.238
1.472PhePro: 1.472 ± 0.179
1.555PheGln: 1.555 ± 0.174
2.301PheArg: 2.301 ± 0.241
3.047PheSer: 3.047 ± 0.224
2.467PheThr: 2.467 ± 0.245
3.068PheVal: 3.068 ± 0.269
0.684PheTrp: 0.684 ± 0.125
1.451PheTyr: 1.451 ± 0.159
0.0PheXaa: 0.0 ± 0.0
Gly
3.649GlyAla: 3.649 ± 0.294
0.933GlyCys: 0.933 ± 0.146
4.167GlyAsp: 4.167 ± 0.279
4.623GlyGlu: 4.623 ± 0.327
2.571GlyPhe: 2.571 ± 0.246
4.83GlyGly: 4.83 ± 0.486
1.41GlyHis: 1.41 ± 0.167
5.079GlyIle: 5.079 ± 0.305
5.058GlyLys: 5.058 ± 0.398
4.851GlyLeu: 4.851 ± 0.302
2.094GlyMet: 2.094 ± 0.199
3.794GlyAsn: 3.794 ± 0.371
1.244GlyPro: 1.244 ± 0.177
2.529GlyGln: 2.529 ± 0.2
3.006GlyArg: 3.006 ± 0.252
4.664GlySer: 4.664 ± 0.394
3.566GlyThr: 3.566 ± 0.38
5.141GlyVal: 5.141 ± 0.37
1.202GlyTrp: 1.202 ± 0.146
2.778GlyTyr: 2.778 ± 0.289
0.0GlyXaa: 0.0 ± 0.0
His
1.037HisAla: 1.037 ± 0.138
0.269HisCys: 0.269 ± 0.086
1.202HisAsp: 1.202 ± 0.149
0.726HisGlu: 0.726 ± 0.103
1.099HisPhe: 1.099 ± 0.163
1.016HisGly: 1.016 ± 0.129
0.498HisHis: 0.498 ± 0.111
1.534HisIle: 1.534 ± 0.235
1.306HisLys: 1.306 ± 0.155
1.658HisLeu: 1.658 ± 0.161
0.539HisMet: 0.539 ± 0.096
0.746HisAsn: 0.746 ± 0.148
0.933HisPro: 0.933 ± 0.141
0.58HisGln: 0.58 ± 0.105
1.057HisArg: 1.057 ± 0.146
1.119HisSer: 1.119 ± 0.16
1.099HisThr: 1.099 ± 0.171
1.368HisVal: 1.368 ± 0.164
0.207HisTrp: 0.207 ± 0.074
0.995HisTyr: 0.995 ± 0.166
0.0HisXaa: 0.0 ± 0.0
Ile
4.416IleAla: 4.416 ± 0.272
0.767IleCys: 0.767 ± 0.11
4.954IleAsp: 4.954 ± 0.289
4.664IleGlu: 4.664 ± 0.327
1.783IlePhe: 1.783 ± 0.181
3.752IleGly: 3.752 ± 0.281
1.43IleHis: 1.43 ± 0.207
3.711IleIle: 3.711 ± 0.342
3.794IleLys: 3.794 ± 0.294
4.561IleLeu: 4.561 ± 0.335
1.783IleMet: 1.783 ± 0.169
3.4IleAsn: 3.4 ± 0.274
3.13IlePro: 3.13 ± 0.258
2.757IleGln: 2.757 ± 0.263
3.296IleArg: 3.296 ± 0.258
3.711IleSer: 3.711 ± 0.262
4.623IleThr: 4.623 ± 0.39
4.229IleVal: 4.229 ± 0.277
0.746IleTrp: 0.746 ± 0.129
2.156IleTyr: 2.156 ± 0.216
0.0IleXaa: 0.0 ± 0.0
Lys
4.353LysAla: 4.353 ± 0.36
0.456LysCys: 0.456 ± 0.098
3.794LysAsp: 3.794 ± 0.289
4.436LysGlu: 4.436 ± 0.423
3.151LysPhe: 3.151 ± 0.251
3.794LysGly: 3.794 ± 0.297
1.016LysHis: 1.016 ± 0.153
3.814LysIle: 3.814 ± 0.242
3.959LysLys: 3.959 ± 0.332
4.975LysLeu: 4.975 ± 0.336
2.28LysMet: 2.28 ± 0.221
2.508LysAsn: 2.508 ± 0.21
2.508LysPro: 2.508 ± 0.223
2.923LysGln: 2.923 ± 0.241
3.213LysArg: 3.213 ± 0.248
4.187LysSer: 4.187 ± 0.285
3.939LysThr: 3.939 ± 0.266
4.498LysVal: 4.498 ± 0.304
0.954LysTrp: 0.954 ± 0.139
2.156LysTyr: 2.156 ± 0.194
0.0LysXaa: 0.0 ± 0.0
Leu
6.136LeuAla: 6.136 ± 0.38
0.705LeuCys: 0.705 ± 0.108
5.12LeuAsp: 5.12 ± 0.291
5.037LeuGlu: 5.037 ± 0.321
3.379LeuPhe: 3.379 ± 0.28
4.975LeuGly: 4.975 ± 0.302
1.347LeuHis: 1.347 ± 0.198
4.063LeuIle: 4.063 ± 0.289
6.074LeuLys: 6.074 ± 0.385
6.24LeuLeu: 6.24 ± 0.428
2.156LeuMet: 2.156 ± 0.205
4.789LeuAsn: 4.789 ± 0.322
3.275LeuPro: 3.275 ± 0.262
2.799LeuGln: 2.799 ± 0.217
3.897LeuArg: 3.897 ± 0.289
5.659LeuSer: 5.659 ± 0.301
4.996LeuThr: 4.996 ± 0.322
5.224LeuVal: 5.224 ± 0.285
0.726LeuTrp: 0.726 ± 0.137
3.234LeuTyr: 3.234 ± 0.271
0.0LeuXaa: 0.0 ± 0.0
Met
2.508MetAla: 2.508 ± 0.277
0.394MetCys: 0.394 ± 0.097
1.555MetAsp: 1.555 ± 0.167
1.513MetGlu: 1.513 ± 0.191
1.389MetPhe: 1.389 ± 0.2
1.596MetGly: 1.596 ± 0.154
0.435MetHis: 0.435 ± 0.104
1.949MetIle: 1.949 ± 0.224
2.343MetLys: 2.343 ± 0.223
2.405MetLeu: 2.405 ± 0.224
1.099MetMet: 1.099 ± 0.152
1.389MetAsn: 1.389 ± 0.171
1.037MetPro: 1.037 ± 0.16
0.995MetGln: 0.995 ± 0.146
1.741MetArg: 1.741 ± 0.185
2.26MetSer: 2.26 ± 0.211
1.866MetThr: 1.866 ± 0.176
1.721MetVal: 1.721 ± 0.202
0.228MetTrp: 0.228 ± 0.072
0.933MetTyr: 0.933 ± 0.107
0.0MetXaa: 0.0 ± 0.0
Asn
3.814AsnAla: 3.814 ± 0.319
0.684AsnCys: 0.684 ± 0.117
3.089AsnAsp: 3.089 ± 0.296
2.653AsnGlu: 2.653 ± 0.28
2.073AsnPhe: 2.073 ± 0.248
4.084AsnGly: 4.084 ± 0.299
0.995AsnHis: 0.995 ± 0.166
3.11AsnIle: 3.11 ± 0.271
3.234AsnLys: 3.234 ± 0.278
3.586AsnLeu: 3.586 ± 0.278
1.679AsnMet: 1.679 ± 0.173
3.275AsnAsn: 3.275 ± 0.332
2.467AsnPro: 2.467 ± 0.244
2.135AsnGln: 2.135 ± 0.211
2.591AsnArg: 2.591 ± 0.253
2.964AsnSer: 2.964 ± 0.281
2.757AsnThr: 2.757 ± 0.306
3.566AsnVal: 3.566 ± 0.298
0.663AsnTrp: 0.663 ± 0.125
2.011AsnTyr: 2.011 ± 0.225
0.0AsnXaa: 0.0 ± 0.0
Pro
2.467ProAla: 2.467 ± 0.263
0.518ProCys: 0.518 ± 0.11
2.923ProAsp: 2.923 ± 0.241
3.317ProGlu: 3.317 ± 0.29
1.721ProPhe: 1.721 ± 0.206
2.529ProGly: 2.529 ± 0.246
0.643ProHis: 0.643 ± 0.122
2.322ProIle: 2.322 ± 0.227
1.969ProLys: 1.969 ± 0.175
3.047ProLeu: 3.047 ± 0.261
0.954ProMet: 0.954 ± 0.129
1.824ProAsn: 1.824 ± 0.201
1.472ProPro: 1.472 ± 0.248
1.285ProGln: 1.285 ± 0.143
1.824ProArg: 1.824 ± 0.187
2.819ProSer: 2.819 ± 0.275
2.508ProThr: 2.508 ± 0.264
2.695ProVal: 2.695 ± 0.234
0.601ProTrp: 0.601 ± 0.102
1.389ProTyr: 1.389 ± 0.179
0.0ProXaa: 0.0 ± 0.0
Gln
2.55GlnAla: 2.55 ± 0.257
0.352GlnCys: 0.352 ± 0.085
2.011GlnAsp: 2.011 ± 0.222
2.322GlnGlu: 2.322 ± 0.243
1.866GlnPhe: 1.866 ± 0.198
2.363GlnGly: 2.363 ± 0.221
0.788GlnHis: 0.788 ± 0.135
2.778GlnIle: 2.778 ± 0.231
2.032GlnLys: 2.032 ± 0.216
3.089GlnLeu: 3.089 ± 0.299
0.995GlnMet: 0.995 ± 0.154
1.575GlnAsn: 1.575 ± 0.169
1.389GlnPro: 1.389 ± 0.177
1.866GlnGln: 1.866 ± 0.259
2.425GlnArg: 2.425 ± 0.22
2.529GlnSer: 2.529 ± 0.295
2.26GlnThr: 2.26 ± 0.19
2.778GlnVal: 2.778 ± 0.233
0.56GlnTrp: 0.56 ± 0.118
1.493GlnTyr: 1.493 ± 0.195
0.0GlnXaa: 0.0 ± 0.0
Arg
3.11ArgAla: 3.11 ± 0.219
0.829ArgCys: 0.829 ± 0.136
2.799ArgAsp: 2.799 ± 0.248
3.047ArgGlu: 3.047 ± 0.265
2.052ArgPhe: 2.052 ± 0.218
2.819ArgGly: 2.819 ± 0.205
1.14ArgHis: 1.14 ± 0.154
3.42ArgIle: 3.42 ± 0.28
3.13ArgLys: 3.13 ± 0.293
4.644ArgLeu: 4.644 ± 0.331
1.555ArgMet: 1.555 ± 0.172
2.55ArgAsn: 2.55 ± 0.246
1.783ArgPro: 1.783 ± 0.239
1.99ArgGln: 1.99 ± 0.189
3.338ArgArg: 3.338 ± 0.285
3.089ArgSer: 3.089 ± 0.265
2.488ArgThr: 2.488 ± 0.237
3.296ArgVal: 3.296 ± 0.316
0.808ArgTrp: 0.808 ± 0.135
2.343ArgTyr: 2.343 ± 0.206
0.0ArgXaa: 0.0 ± 0.0
Ser
3.98SerAla: 3.98 ± 0.354
0.518SerCys: 0.518 ± 0.111
3.939SerAsp: 3.939 ± 0.297
4.063SerGlu: 4.063 ± 0.264
2.819SerPhe: 2.819 ± 0.24
4.83SerGly: 4.83 ± 0.421
0.912SerHis: 0.912 ± 0.162
4.374SerIle: 4.374 ± 0.313
3.649SerLys: 3.649 ± 0.327
5.825SerLeu: 5.825 ± 0.356
1.886SerMet: 1.886 ± 0.219
3.794SerAsn: 3.794 ± 0.32
2.571SerPro: 2.571 ± 0.218
2.26SerGln: 2.26 ± 0.218
2.861SerArg: 2.861 ± 0.242
4.084SerSer: 4.084 ± 0.34
3.835SerThr: 3.835 ± 0.344
4.892SerVal: 4.892 ± 0.317
0.767SerTrp: 0.767 ± 0.114
2.612SerTyr: 2.612 ± 0.251
0.0SerXaa: 0.0 ± 0.0
Thr
4.146ThrAla: 4.146 ± 0.384
0.726ThrCys: 0.726 ± 0.114
3.4ThrAsp: 3.4 ± 0.26
3.959ThrGlu: 3.959 ± 0.319
2.653ThrPhe: 2.653 ± 0.208
4.644ThrGly: 4.644 ± 0.362
0.912ThrHis: 0.912 ± 0.137
4.042ThrIle: 4.042 ± 0.32
3.628ThrLys: 3.628 ± 0.267
4.768ThrLeu: 4.768 ± 0.306
1.202ThrMet: 1.202 ± 0.181
2.488ThrAsn: 2.488 ± 0.203
3.607ThrPro: 3.607 ± 0.305
2.073ThrGln: 2.073 ± 0.188
2.923ThrArg: 2.923 ± 0.24
3.649ThrSer: 3.649 ± 0.385
3.918ThrThr: 3.918 ± 0.342
4.498ThrVal: 4.498 ± 0.373
0.829ThrTrp: 0.829 ± 0.146
1.804ThrTyr: 1.804 ± 0.25
0.0ThrXaa: 0.0 ± 0.0
Val
3.959ValAla: 3.959 ± 0.304
0.871ValCys: 0.871 ± 0.152
5.307ValAsp: 5.307 ± 0.336
5.183ValGlu: 5.183 ± 0.374
2.778ValPhe: 2.778 ± 0.278
4.664ValGly: 4.664 ± 0.346
1.099ValHis: 1.099 ± 0.16
3.897ValIle: 3.897 ± 0.274
5.245ValLys: 5.245 ± 0.353
5.141ValLeu: 5.141 ± 0.376
1.638ValMet: 1.638 ± 0.201
3.649ValAsn: 3.649 ± 0.285
2.405ValPro: 2.405 ± 0.209
2.529ValGln: 2.529 ± 0.245
3.192ValArg: 3.192 ± 0.26
4.996ValSer: 4.996 ± 0.296
4.872ValThr: 4.872 ± 0.448
6.219ValVal: 6.219 ± 0.443
1.182ValTrp: 1.182 ± 0.164
3.11ValTyr: 3.11 ± 0.245
0.0ValXaa: 0.0 ± 0.0
Trp
0.995TrpAla: 0.995 ± 0.158
0.352TrpCys: 0.352 ± 0.09
0.995TrpAsp: 0.995 ± 0.13
1.016TrpGlu: 1.016 ± 0.137
0.788TrpPhe: 0.788 ± 0.119
0.871TrpGly: 0.871 ± 0.119
0.187TrpHis: 0.187 ± 0.069
0.663TrpIle: 0.663 ± 0.118
0.767TrpLys: 0.767 ± 0.133
1.244TrpLeu: 1.244 ± 0.181
0.539TrpMet: 0.539 ± 0.096
0.726TrpAsn: 0.726 ± 0.107
0.394TrpPro: 0.394 ± 0.077
0.352TrpGln: 0.352 ± 0.075
0.85TrpArg: 0.85 ± 0.129
0.788TrpSer: 0.788 ± 0.128
0.643TrpThr: 0.643 ± 0.108
1.099TrpVal: 1.099 ± 0.143
0.124TrpTrp: 0.124 ± 0.053
0.456TrpTyr: 0.456 ± 0.109
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.425TyrAla: 2.425 ± 0.216
0.58TyrCys: 0.58 ± 0.098
2.571TyrAsp: 2.571 ± 0.22
2.343TyrGlu: 2.343 ± 0.214
1.741TyrPhe: 1.741 ± 0.201
2.653TyrGly: 2.653 ± 0.28
1.057TyrHis: 1.057 ± 0.156
2.094TyrIle: 2.094 ± 0.212
2.094TyrLys: 2.094 ± 0.231
2.881TyrLeu: 2.881 ± 0.244
1.14TyrMet: 1.14 ± 0.161
2.446TyrAsn: 2.446 ± 0.191
1.596TyrPro: 1.596 ± 0.166
1.638TyrGln: 1.638 ± 0.189
2.073TyrArg: 2.073 ± 0.218
2.446TyrSer: 2.446 ± 0.194
2.26TyrThr: 2.26 ± 0.306
3.255TyrVal: 3.255 ± 0.256
0.518TyrTrp: 0.518 ± 0.106
1.513TyrTyr: 1.513 ± 0.161
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 208 proteins (48240 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski