Amino acid dipepetide frequency for Ralstonia phage phiRSL1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.109AlaAla: 10.109 ± 0.561
0.918AlaCys: 0.918 ± 0.107
4.913AlaAsp: 4.913 ± 0.279
4.791AlaGlu: 4.791 ± 0.343
3.442AlaPhe: 3.442 ± 0.236
5.817AlaGly: 5.817 ± 0.433
1.822AlaHis: 1.822 ± 0.173
3.685AlaIle: 3.685 ± 0.235
4.764AlaLys: 4.764 ± 0.398
8.341AlaLeu: 8.341 ± 0.406
1.984AlaMet: 1.984 ± 0.183
3.55AlaAsn: 3.55 ± 0.286
4.899AlaPro: 4.899 ± 0.292
4.413AlaGln: 4.413 ± 0.233
5.318AlaArg: 5.318 ± 0.271
6.276AlaSer: 6.276 ± 0.354
6.492AlaThr: 6.492 ± 0.617
7.383AlaVal: 7.383 ± 0.319
1.134AlaTrp: 1.134 ± 0.168
2.956AlaTyr: 2.956 ± 0.193
0.0AlaXaa: 0.0 ± 0.0
Cys
0.891CysAla: 0.891 ± 0.119
0.189CysCys: 0.189 ± 0.05
0.756CysAsp: 0.756 ± 0.096
0.445CysGlu: 0.445 ± 0.093
0.364CysPhe: 0.364 ± 0.077
0.985CysGly: 0.985 ± 0.148
0.324CysHis: 0.324 ± 0.076
0.499CysIle: 0.499 ± 0.103
0.418CysLys: 0.418 ± 0.079
0.837CysLeu: 0.837 ± 0.118
0.337CysMet: 0.337 ± 0.076
0.256CysAsn: 0.256 ± 0.056
0.675CysPro: 0.675 ± 0.11
0.418CysGln: 0.418 ± 0.081
0.688CysArg: 0.688 ± 0.09
0.769CysSer: 0.769 ± 0.112
0.702CysThr: 0.702 ± 0.107
0.567CysVal: 0.567 ± 0.085
0.135CysTrp: 0.135 ± 0.046
0.297CysTyr: 0.297 ± 0.073
0.0CysXaa: 0.0 ± 0.0
Asp
4.602AspAla: 4.602 ± 0.271
0.499AspCys: 0.499 ± 0.087
3.833AspAsp: 3.833 ± 0.646
4.008AspGlu: 4.008 ± 0.537
2.47AspPhe: 2.47 ± 0.17
4.332AspGly: 4.332 ± 0.301
1.161AspHis: 1.161 ± 0.151
2.105AspIle: 2.105 ± 0.16
2.281AspLys: 2.281 ± 0.253
5.547AspLeu: 5.547 ± 0.346
1.377AspMet: 1.377 ± 0.138
1.862AspAsn: 1.862 ± 0.163
3.806AspPro: 3.806 ± 0.256
2.699AspGln: 2.699 ± 0.175
3.172AspArg: 3.172 ± 0.249
2.983AspSer: 2.983 ± 0.214
3.347AspThr: 3.347 ± 0.216
4.035AspVal: 4.035 ± 0.249
0.918AspTrp: 0.918 ± 0.127
1.701AspTyr: 1.701 ± 0.151
0.0AspXaa: 0.0 ± 0.0
Glu
5.088GluAla: 5.088 ± 0.31
0.54GluCys: 0.54 ± 0.099
3.765GluAsp: 3.765 ± 0.478
3.347GluGlu: 3.347 ± 0.325
2.119GluPhe: 2.119 ± 0.212
2.753GluGly: 2.753 ± 0.219
1.35GluHis: 1.35 ± 0.146
1.795GluIle: 1.795 ± 0.164
2.51GluLys: 2.51 ± 0.189
5.21GluLeu: 5.21 ± 0.359
1.066GluMet: 1.066 ± 0.112
1.336GluAsn: 1.336 ± 0.119
1.943GluPro: 1.943 ± 0.175
3.01GluGln: 3.01 ± 0.303
3.752GluArg: 3.752 ± 0.269
2.308GluSer: 2.308 ± 0.199
2.186GluThr: 2.186 ± 0.161
4.076GluVal: 4.076 ± 0.29
0.958GluTrp: 0.958 ± 0.139
1.566GluTyr: 1.566 ± 0.162
0.0GluXaa: 0.0 ± 0.0
Phe
3.01PheAla: 3.01 ± 0.204
0.432PheCys: 0.432 ± 0.088
2.497PheAsp: 2.497 ± 0.225
1.633PheGlu: 1.633 ± 0.156
1.282PhePhe: 1.282 ± 0.143
2.402PheGly: 2.402 ± 0.169
0.945PheHis: 0.945 ± 0.113
1.431PheIle: 1.431 ± 0.153
2.092PheLys: 2.092 ± 0.205
2.78PheLeu: 2.78 ± 0.195
1.255PheMet: 1.255 ± 0.124
1.93PheAsn: 1.93 ± 0.163
1.674PhePro: 1.674 ± 0.159
1.728PheGln: 1.728 ± 0.138
1.66PheArg: 1.66 ± 0.143
2.173PheSer: 2.173 ± 0.174
2.78PheThr: 2.78 ± 0.202
2.807PheVal: 2.807 ± 0.192
0.405PheTrp: 0.405 ± 0.069
1.35PheTyr: 1.35 ± 0.132
0.0PheXaa: 0.0 ± 0.0
Gly
6.262GlyAla: 6.262 ± 0.424
0.769GlyCys: 0.769 ± 0.118
3.563GlyAsp: 3.563 ± 0.264
2.686GlyGlu: 2.686 ± 0.221
2.537GlyPhe: 2.537 ± 0.187
5.183GlyGly: 5.183 ± 0.523
1.431GlyHis: 1.431 ± 0.136
2.969GlyIle: 2.969 ± 0.271
3.455GlyLys: 3.455 ± 0.299
5.534GlyLeu: 5.534 ± 0.294
1.755GlyMet: 1.755 ± 0.162
2.888GlyAsn: 2.888 ± 0.327
2.794GlyPro: 2.794 ± 0.174
3.023GlyGln: 3.023 ± 0.195
3.685GlyArg: 3.685 ± 0.259
4.562GlySer: 4.562 ± 0.415
5.507GlyThr: 5.507 ± 0.555
5.48GlyVal: 5.48 ± 0.295
0.972GlyTrp: 0.972 ± 0.109
2.47GlyTyr: 2.47 ± 0.172
0.0GlyXaa: 0.0 ± 0.0
His
1.849HisAla: 1.849 ± 0.175
0.405HisCys: 0.405 ± 0.08
1.026HisAsp: 1.026 ± 0.131
1.161HisGlu: 1.161 ± 0.137
0.891HisPhe: 0.891 ± 0.128
1.552HisGly: 1.552 ± 0.154
0.729HisHis: 0.729 ± 0.113
1.107HisIle: 1.107 ± 0.104
1.161HisLys: 1.161 ± 0.127
2.348HisLeu: 2.348 ± 0.201
0.769HisMet: 0.769 ± 0.104
0.864HisAsn: 0.864 ± 0.106
1.201HisPro: 1.201 ± 0.143
1.012HisGln: 1.012 ± 0.124
1.498HisArg: 1.498 ± 0.172
1.404HisSer: 1.404 ± 0.124
1.35HisThr: 1.35 ± 0.15
1.687HisVal: 1.687 ± 0.181
0.378HisTrp: 0.378 ± 0.062
0.702HisTyr: 0.702 ± 0.106
0.0HisXaa: 0.0 ± 0.0
Ile
3.738IleAla: 3.738 ± 0.239
0.472IleCys: 0.472 ± 0.071
2.834IleAsp: 2.834 ± 0.193
2.402IleGlu: 2.402 ± 0.191
1.201IlePhe: 1.201 ± 0.114
2.618IleGly: 2.618 ± 0.231
0.972IleHis: 0.972 ± 0.133
1.714IleIle: 1.714 ± 0.14
2.132IleLys: 2.132 ± 0.183
3.064IleLeu: 3.064 ± 0.231
1.12IleMet: 1.12 ± 0.127
1.97IleAsn: 1.97 ± 0.196
2.456IlePro: 2.456 ± 0.158
2.335IleGln: 2.335 ± 0.166
2.524IleArg: 2.524 ± 0.195
3.064IleSer: 3.064 ± 0.216
3.32IleThr: 3.32 ± 0.228
3.32IleVal: 3.32 ± 0.186
0.459IleTrp: 0.459 ± 0.081
1.12IleTyr: 1.12 ± 0.126
0.0IleXaa: 0.0 ± 0.0
Lys
5.547LysAla: 5.547 ± 0.389
0.432LysCys: 0.432 ± 0.085
2.929LysAsp: 2.929 ± 0.254
2.915LysGlu: 2.915 ± 0.303
1.606LysPhe: 1.606 ± 0.155
3.185LysGly: 3.185 ± 0.3
0.85LysHis: 0.85 ± 0.116
1.943LysIle: 1.943 ± 0.201
3.927LysLys: 3.927 ± 0.398
4.17LysLeu: 4.17 ± 0.314
1.714LysMet: 1.714 ± 0.174
1.782LysAsn: 1.782 ± 0.174
2.78LysPro: 2.78 ± 0.233
2.2LysGln: 2.2 ± 0.167
2.996LysArg: 2.996 ± 0.288
2.686LysSer: 2.686 ± 0.203
2.861LysThr: 2.861 ± 0.225
3.28LysVal: 3.28 ± 0.255
1.08LysTrp: 1.08 ± 0.114
1.606LysTyr: 1.606 ± 0.165
0.0LysXaa: 0.0 ± 0.0
Leu
7.463LeuAla: 7.463 ± 0.387
0.972LeuCys: 0.972 ± 0.111
4.994LeuAsp: 4.994 ± 0.316
4.818LeuGlu: 4.818 ± 0.3
2.834LeuPhe: 2.834 ± 0.216
5.439LeuGly: 5.439 ± 0.273
2.24LeuHis: 2.24 ± 0.17
3.671LeuIle: 3.671 ± 0.213
4.872LeuLys: 4.872 ± 0.338
7.49LeuLeu: 7.49 ± 0.358
2.092LeuMet: 2.092 ± 0.174
4.224LeuAsn: 4.224 ± 0.283
5.318LeuPro: 5.318 ± 0.268
4.062LeuGln: 4.062 ± 0.258
5.412LeuArg: 5.412 ± 0.375
6.424LeuSer: 6.424 ± 0.349
5.749LeuThr: 5.749 ± 0.28
5.803LeuVal: 5.803 ± 0.277
1.255LeuTrp: 1.255 ± 0.159
2.888LeuTyr: 2.888 ± 0.21
0.0LeuXaa: 0.0 ± 0.0
Met
2.672MetAla: 2.672 ± 0.192
0.27MetCys: 0.27 ± 0.057
1.485MetAsp: 1.485 ± 0.17
1.471MetGlu: 1.471 ± 0.143
0.756MetPhe: 0.756 ± 0.097
1.458MetGly: 1.458 ± 0.152
0.54MetHis: 0.54 ± 0.077
0.796MetIle: 0.796 ± 0.113
1.444MetLys: 1.444 ± 0.147
2.065MetLeu: 2.065 ± 0.177
0.58MetMet: 0.58 ± 0.096
1.147MetAsn: 1.147 ± 0.124
1.363MetPro: 1.363 ± 0.12
1.012MetGln: 1.012 ± 0.1
1.404MetArg: 1.404 ± 0.161
1.822MetSer: 1.822 ± 0.145
1.943MetThr: 1.943 ± 0.156
1.66MetVal: 1.66 ± 0.148
0.391MetTrp: 0.391 ± 0.074
0.715MetTyr: 0.715 ± 0.102
0.0MetXaa: 0.0 ± 0.0
Asn
3.631AsnAla: 3.631 ± 0.242
0.391AsnCys: 0.391 ± 0.09
1.836AsnAsp: 1.836 ± 0.153
1.593AsnGlu: 1.593 ± 0.13
1.323AsnPhe: 1.323 ± 0.153
4.089AsnGly: 4.089 ± 0.348
0.823AsnHis: 0.823 ± 0.108
1.957AsnIle: 1.957 ± 0.2
1.916AsnLys: 1.916 ± 0.165
3.509AsnLeu: 3.509 ± 0.242
0.796AsnMet: 0.796 ± 0.107
1.674AsnAsn: 1.674 ± 0.222
2.699AsnPro: 2.699 ± 0.209
1.741AsnGln: 1.741 ± 0.216
2.173AsnArg: 2.173 ± 0.167
2.821AsnSer: 2.821 ± 0.241
2.699AsnThr: 2.699 ± 0.263
3.455AsnVal: 3.455 ± 0.282
0.54AsnTrp: 0.54 ± 0.092
1.255AsnTyr: 1.255 ± 0.128
0.0AsnXaa: 0.0 ± 0.0
Pro
5.115ProAla: 5.115 ± 0.345
0.472ProCys: 0.472 ± 0.094
3.01ProAsp: 3.01 ± 0.205
2.794ProGlu: 2.794 ± 0.253
1.795ProPhe: 1.795 ± 0.159
3.199ProGly: 3.199 ± 0.213
1.417ProHis: 1.417 ± 0.142
2.186ProIle: 2.186 ± 0.171
2.578ProLys: 2.578 ± 0.212
4.049ProLeu: 4.049 ± 0.242
0.972ProMet: 0.972 ± 0.12
2.213ProAsn: 2.213 ± 0.189
2.578ProPro: 2.578 ± 0.214
2.254ProGln: 2.254 ± 0.196
2.767ProArg: 2.767 ± 0.223
4.008ProSer: 4.008 ± 0.27
4.562ProThr: 4.562 ± 0.297
3.995ProVal: 3.995 ± 0.26
0.445ProTrp: 0.445 ± 0.081
1.417ProTyr: 1.417 ± 0.14
0.0ProXaa: 0.0 ± 0.0
Gln
4.602GlnAla: 4.602 ± 0.226
0.324GlnCys: 0.324 ± 0.066
2.146GlnAsp: 2.146 ± 0.153
2.051GlnGlu: 2.051 ± 0.193
2.254GlnPhe: 2.254 ± 0.175
2.767GlnGly: 2.767 ± 0.189
1.12GlnHis: 1.12 ± 0.137
2.429GlnIle: 2.429 ± 0.182
1.93GlnLys: 1.93 ± 0.19
4.292GlnLeu: 4.292 ± 0.275
1.431GlnMet: 1.431 ± 0.146
1.566GlnAsn: 1.566 ± 0.199
2.119GlnPro: 2.119 ± 0.16
2.996GlnGln: 2.996 ± 0.271
2.902GlnArg: 2.902 ± 0.234
2.902GlnSer: 2.902 ± 0.203
2.74GlnThr: 2.74 ± 0.221
3.765GlnVal: 3.765 ± 0.223
0.769GlnTrp: 0.769 ± 0.106
1.741GlnTyr: 1.741 ± 0.16
0.0GlnXaa: 0.0 ± 0.0
Arg
4.953ArgAla: 4.953 ± 0.341
0.837ArgCys: 0.837 ± 0.1
2.753ArgAsp: 2.753 ± 0.207
3.05ArgGlu: 3.05 ± 0.242
2.119ArgPhe: 2.119 ± 0.204
3.496ArgGly: 3.496 ± 0.243
1.593ArgHis: 1.593 ± 0.163
3.104ArgIle: 3.104 ± 0.204
2.659ArgLys: 2.659 ± 0.203
6.154ArgLeu: 6.154 ± 0.363
1.363ArgMet: 1.363 ± 0.145
2.362ArgAsn: 2.362 ± 0.207
2.686ArgPro: 2.686 ± 0.227
3.064ArgGln: 3.064 ± 0.238
4.764ArgArg: 4.764 ± 0.41
3.131ArgSer: 3.131 ± 0.196
3.765ArgThr: 3.765 ± 0.239
4.278ArgVal: 4.278 ± 0.304
1.174ArgTrp: 1.174 ± 0.134
1.809ArgTyr: 1.809 ± 0.182
0.0ArgXaa: 0.0 ± 0.0
Ser
5.574SerAla: 5.574 ± 0.323
0.81SerCys: 0.81 ± 0.113
3.617SerAsp: 3.617 ± 0.259
2.551SerGlu: 2.551 ± 0.208
2.632SerPhe: 2.632 ± 0.204
5.493SerGly: 5.493 ± 0.548
1.431SerHis: 1.431 ± 0.137
2.902SerIle: 2.902 ± 0.221
3.266SerLys: 3.266 ± 0.238
5.426SerLeu: 5.426 ± 0.262
1.809SerMet: 1.809 ± 0.156
2.672SerAsn: 2.672 ± 0.284
3.077SerPro: 3.077 ± 0.251
2.456SerGln: 2.456 ± 0.186
3.819SerArg: 3.819 ± 0.262
4.481SerSer: 4.481 ± 0.418
4.94SerThr: 4.94 ± 0.377
5.088SerVal: 5.088 ± 0.319
0.985SerTrp: 0.985 ± 0.107
2.375SerTyr: 2.375 ± 0.21
0.0SerXaa: 0.0 ± 0.0
Thr
6.87ThrAla: 6.87 ± 0.565
0.675ThrCys: 0.675 ± 0.096
3.415ThrAsp: 3.415 ± 0.213
2.645ThrGlu: 2.645 ± 0.187
2.483ThrPhe: 2.483 ± 0.186
5.156ThrGly: 5.156 ± 0.611
1.525ThrHis: 1.525 ± 0.147
3.415ThrIle: 3.415 ± 0.316
3.01ThrLys: 3.01 ± 0.242
6.505ThrLeu: 6.505 ± 0.436
1.498ThrMet: 1.498 ± 0.131
3.05ThrAsn: 3.05 ± 0.288
3.671ThrPro: 3.671 ± 0.24
2.875ThrGln: 2.875 ± 0.239
3.118ThrArg: 3.118 ± 0.214
4.967ThrSer: 4.967 ± 0.38
5.318ThrThr: 5.318 ± 0.536
5.587ThrVal: 5.587 ± 0.482
0.958ThrTrp: 0.958 ± 0.105
2.159ThrTyr: 2.159 ± 0.162
0.0ThrXaa: 0.0 ± 0.0
Val
7.18ValAla: 7.18 ± 0.451
0.607ValCys: 0.607 ± 0.085
4.494ValAsp: 4.494 ± 0.272
3.792ValGlu: 3.792 ± 0.253
2.389ValPhe: 2.389 ± 0.195
4.643ValGly: 4.643 ± 0.34
1.889ValHis: 1.889 ± 0.146
3.118ValIle: 3.118 ± 0.241
3.86ValLys: 3.86 ± 0.284
6.478ValLeu: 6.478 ± 0.331
1.822ValMet: 1.822 ± 0.175
3.617ValAsn: 3.617 ± 0.264
4.197ValPro: 4.197 ± 0.25
3.428ValGln: 3.428 ± 0.221
4.602ValArg: 4.602 ± 0.258
5.021ValSer: 5.021 ± 0.331
5.237ValThr: 5.237 ± 0.424
5.925ValVal: 5.925 ± 0.304
1.282ValTrp: 1.282 ± 0.149
2.24ValTyr: 2.24 ± 0.167
0.0ValXaa: 0.0 ± 0.0
Trp
1.431TrpAla: 1.431 ± 0.15
0.216TrpCys: 0.216 ± 0.059
0.756TrpAsp: 0.756 ± 0.105
0.769TrpGlu: 0.769 ± 0.102
0.499TrpPhe: 0.499 ± 0.072
0.729TrpGly: 0.729 ± 0.107
0.27TrpHis: 0.27 ± 0.068
0.769TrpIle: 0.769 ± 0.105
0.648TrpLys: 0.648 ± 0.091
1.687TrpLeu: 1.687 ± 0.159
0.459TrpMet: 0.459 ± 0.09
0.702TrpAsn: 0.702 ± 0.108
0.553TrpPro: 0.553 ± 0.106
0.675TrpGln: 0.675 ± 0.094
0.81TrpArg: 0.81 ± 0.108
1.039TrpSer: 1.039 ± 0.129
1.188TrpThr: 1.188 ± 0.155
1.242TrpVal: 1.242 ± 0.139
0.364TrpTrp: 0.364 ± 0.07
0.364TrpTyr: 0.364 ± 0.073
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.416TyrAla: 2.416 ± 0.191
0.324TyrCys: 0.324 ± 0.062
2.119TyrAsp: 2.119 ± 0.173
1.755TyrGlu: 1.755 ± 0.167
1.269TyrPhe: 1.269 ± 0.118
2.227TyrGly: 2.227 ± 0.182
0.661TyrHis: 0.661 ± 0.101
1.242TyrIle: 1.242 ± 0.135
1.566TyrLys: 1.566 ± 0.154
2.551TyrLeu: 2.551 ± 0.192
0.783TyrMet: 0.783 ± 0.101
1.255TyrAsn: 1.255 ± 0.137
1.336TyrPro: 1.336 ± 0.117
1.444TyrGln: 1.444 ± 0.147
2.051TyrArg: 2.051 ± 0.197
2.564TyrSer: 2.564 ± 0.2
2.213TyrThr: 2.213 ± 0.194
2.389TyrVal: 2.389 ± 0.177
0.567TyrTrp: 0.567 ± 0.083
1.066TyrTyr: 1.066 ± 0.125
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 343 proteins (74095 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski