Amino acid dipepetide frequency for Synechococcus phage ACG-2014i

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.656AlaAla: 5.656 ± 0.501
0.514AlaCys: 0.514 ± 0.109
4.147AlaAsp: 4.147 ± 0.257
3.533AlaGlu: 3.533 ± 0.286
2.787AlaPhe: 2.787 ± 0.256
6.751AlaGly: 6.751 ± 0.567
0.896AlaHis: 0.896 ± 0.149
4.097AlaIle: 4.097 ± 0.263
3.649AlaLys: 3.649 ± 0.334
4.562AlaLeu: 4.562 ± 0.36
1.161AlaMet: 1.161 ± 0.176
4.429AlaAsn: 4.429 ± 0.363
2.671AlaPro: 2.671 ± 0.219
2.173AlaGln: 2.173 ± 0.193
2.272AlaArg: 2.272 ± 0.155
5.358AlaSer: 5.358 ± 0.457
5.822AlaThr: 5.822 ± 0.444
4.114AlaVal: 4.114 ± 0.283
0.564AlaTrp: 0.564 ± 0.083
2.223AlaTyr: 2.223 ± 0.206
0.0AlaXaa: 0.0 ± 0.0
Cys
0.564CysAla: 0.564 ± 0.105
0.1CysCys: 0.1 ± 0.044
0.514CysAsp: 0.514 ± 0.097
0.431CysGlu: 0.431 ± 0.09
0.448CysPhe: 0.448 ± 0.096
0.564CysGly: 0.564 ± 0.136
0.216CysHis: 0.216 ± 0.071
0.431CysIle: 0.431 ± 0.096
0.63CysLys: 0.63 ± 0.147
0.597CysLeu: 0.597 ± 0.122
0.133CysMet: 0.133 ± 0.054
0.481CysAsn: 0.481 ± 0.079
0.398CysPro: 0.398 ± 0.091
0.464CysGln: 0.464 ± 0.086
0.332CysArg: 0.332 ± 0.072
0.547CysSer: 0.547 ± 0.104
0.448CysThr: 0.448 ± 0.086
0.68CysVal: 0.68 ± 0.111
0.1CysTrp: 0.1 ± 0.04
0.332CysTyr: 0.332 ± 0.078
0.0CysXaa: 0.0 ± 0.0
Asp
4.81AspAla: 4.81 ± 0.441
0.581AspCys: 0.581 ± 0.108
4.329AspAsp: 4.329 ± 0.358
3.898AspGlu: 3.898 ± 0.337
3.085AspPhe: 3.085 ± 0.26
5.889AspGly: 5.889 ± 0.354
0.995AspHis: 0.995 ± 0.162
4.495AspIle: 4.495 ± 0.285
3.351AspLys: 3.351 ± 0.345
4.843AspLeu: 4.843 ± 0.293
1.36AspMet: 1.36 ± 0.201
3.666AspAsn: 3.666 ± 0.325
3.085AspPro: 3.085 ± 0.299
2.123AspGln: 2.123 ± 0.23
2.389AspArg: 2.389 ± 0.249
4.429AspSer: 4.429 ± 0.288
5.026AspThr: 5.026 ± 0.31
4.147AspVal: 4.147 ± 0.24
1.161AspTrp: 1.161 ± 0.177
3.218AspTyr: 3.218 ± 0.204
0.0AspXaa: 0.0 ± 0.0
Glu
3.052GluAla: 3.052 ± 0.307
0.547GluCys: 0.547 ± 0.101
3.699GluAsp: 3.699 ± 0.272
4.495GluGlu: 4.495 ± 0.441
3.035GluPhe: 3.035 ± 0.213
3.948GluGly: 3.948 ± 0.3
0.846GluHis: 0.846 ± 0.162
4.23GluIle: 4.23 ± 0.311
3.218GluLys: 3.218 ± 0.351
4.744GluLeu: 4.744 ± 0.378
1.576GluMet: 1.576 ± 0.292
2.853GluAsn: 2.853 ± 0.266
1.609GluPro: 1.609 ± 0.194
2.04GluGln: 2.04 ± 0.212
2.372GluArg: 2.372 ± 0.294
3.45GluSer: 3.45 ± 0.298
3.931GluThr: 3.931 ± 0.239
4.479GluVal: 4.479 ± 0.254
1.028GluTrp: 1.028 ± 0.154
2.704GluTyr: 2.704 ± 0.25
0.0GluXaa: 0.0 ± 0.0
Phe
3.168PheAla: 3.168 ± 0.284
0.63PheCys: 0.63 ± 0.117
3.251PheAsp: 3.251 ± 0.211
2.488PheGlu: 2.488 ± 0.192
1.675PhePhe: 1.675 ± 0.205
2.919PheGly: 2.919 ± 0.27
0.697PheHis: 0.697 ± 0.124
2.289PheIle: 2.289 ± 0.227
2.272PheLys: 2.272 ± 0.196
3.135PheLeu: 3.135 ± 0.259
0.78PheMet: 0.78 ± 0.148
2.803PheAsn: 2.803 ± 0.244
1.443PhePro: 1.443 ± 0.171
1.642PheGln: 1.642 ± 0.166
1.725PheArg: 1.725 ± 0.176
3.069PheSer: 3.069 ± 0.198
3.649PheThr: 3.649 ± 0.286
2.853PheVal: 2.853 ± 0.203
0.531PheTrp: 0.531 ± 0.11
1.941PheTyr: 1.941 ± 0.189
0.0PheXaa: 0.0 ± 0.0
Gly
6.038GlyAla: 6.038 ± 0.447
0.498GlyCys: 0.498 ± 0.099
5.723GlyAsp: 5.723 ± 0.455
4.197GlyGlu: 4.197 ± 0.287
2.903GlyPhe: 2.903 ± 0.255
9.654GlyGly: 9.654 ± 1.298
1.095GlyHis: 1.095 ± 0.123
4.329GlyIle: 4.329 ± 0.296
3.666GlyLys: 3.666 ± 0.318
4.562GlyLeu: 4.562 ± 0.286
1.393GlyMet: 1.393 ± 0.199
5.557GlyAsn: 5.557 ± 0.511
1.99GlyPro: 1.99 ± 0.198
2.77GlyGln: 2.77 ± 0.23
2.986GlyArg: 2.986 ± 0.27
8.476GlySer: 8.476 ± 0.708
7.232GlyThr: 7.232 ± 0.616
5.49GlyVal: 5.49 ± 0.296
0.879GlyTrp: 0.879 ± 0.121
3.218GlyTyr: 3.218 ± 0.225
0.0GlyXaa: 0.0 ± 0.0
His
0.829HisAla: 0.829 ± 0.133
0.199HisCys: 0.199 ± 0.062
0.945HisAsp: 0.945 ± 0.172
0.929HisGlu: 0.929 ± 0.177
0.663HisPhe: 0.663 ± 0.113
0.979HisGly: 0.979 ± 0.143
0.265HisHis: 0.265 ± 0.079
0.713HisIle: 0.713 ± 0.11
0.962HisLys: 0.962 ± 0.16
1.028HisLeu: 1.028 ± 0.169
0.216HisMet: 0.216 ± 0.062
0.73HisAsn: 0.73 ± 0.133
0.796HisPro: 0.796 ± 0.156
0.464HisGln: 0.464 ± 0.091
0.63HisArg: 0.63 ± 0.098
0.846HisSer: 0.846 ± 0.126
0.73HisThr: 0.73 ± 0.118
0.846HisVal: 0.846 ± 0.158
0.1HisTrp: 0.1 ± 0.046
0.713HisTyr: 0.713 ± 0.135
0.0HisXaa: 0.0 ± 0.0
Ile
4.13IleAla: 4.13 ± 0.347
0.514IleCys: 0.514 ± 0.118
4.794IleAsp: 4.794 ± 0.343
4.362IleGlu: 4.362 ± 0.252
2.239IlePhe: 2.239 ± 0.168
4.562IleGly: 4.562 ± 0.32
0.663IleHis: 0.663 ± 0.113
4.296IleIle: 4.296 ± 0.298
4.097IleLys: 4.097 ± 0.3
4.18IleLeu: 4.18 ± 0.291
0.962IleMet: 0.962 ± 0.138
4.28IleAsn: 4.28 ± 0.237
2.538IlePro: 2.538 ± 0.24
2.505IleGln: 2.505 ± 0.199
2.571IleArg: 2.571 ± 0.193
5.109IleSer: 5.109 ± 0.307
6.469IleThr: 6.469 ± 0.447
4.396IleVal: 4.396 ± 0.412
0.481IleTrp: 0.481 ± 0.087
2.007IleTyr: 2.007 ± 0.195
0.0IleXaa: 0.0 ± 0.0
Lys
2.787LysAla: 2.787 ± 0.323
0.431LysCys: 0.431 ± 0.088
3.019LysAsp: 3.019 ± 0.257
3.566LysGlu: 3.566 ± 0.349
2.521LysPhe: 2.521 ± 0.275
2.919LysGly: 2.919 ± 0.233
1.012LysHis: 1.012 ± 0.184
4.263LysIle: 4.263 ± 0.363
3.848LysLys: 3.848 ± 0.515
4.362LysLeu: 4.362 ± 0.349
1.393LysMet: 1.393 ± 0.24
2.87LysAsn: 2.87 ± 0.268
1.609LysPro: 1.609 ± 0.226
2.289LysGln: 2.289 ± 0.23
2.339LysArg: 2.339 ± 0.286
3.533LysSer: 3.533 ± 0.303
3.699LysThr: 3.699 ± 0.307
3.367LysVal: 3.367 ± 0.278
0.581LysTrp: 0.581 ± 0.108
2.787LysTyr: 2.787 ± 0.264
0.0LysXaa: 0.0 ± 0.0
Leu
4.429LeuAla: 4.429 ± 0.269
0.547LeuCys: 0.547 ± 0.118
5.407LeuAsp: 5.407 ± 0.289
4.495LeuGlu: 4.495 ± 0.34
2.571LeuPhe: 2.571 ± 0.205
4.926LeuGly: 4.926 ± 0.287
1.062LeuHis: 1.062 ± 0.176
4.014LeuIle: 4.014 ± 0.272
4.197LeuLys: 4.197 ± 0.371
4.81LeuLeu: 4.81 ± 0.401
1.543LeuMet: 1.543 ± 0.198
4.479LeuAsn: 4.479 ± 0.258
3.384LeuPro: 3.384 ± 0.238
2.488LeuGln: 2.488 ± 0.217
3.118LeuArg: 3.118 ± 0.203
5.291LeuSer: 5.291 ± 0.302
6.386LeuThr: 6.386 ± 0.383
4.18LeuVal: 4.18 ± 0.245
0.63LeuTrp: 0.63 ± 0.096
2.87LeuTyr: 2.87 ± 0.206
0.0LeuXaa: 0.0 ± 0.0
Met
1.393MetAla: 1.393 ± 0.184
0.182MetCys: 0.182 ± 0.065
1.045MetAsp: 1.045 ± 0.152
0.879MetGlu: 0.879 ± 0.163
0.796MetPhe: 0.796 ± 0.158
0.995MetGly: 0.995 ± 0.153
0.232MetHis: 0.232 ± 0.078
0.945MetIle: 0.945 ± 0.174
1.493MetLys: 1.493 ± 0.253
2.073MetLeu: 2.073 ± 0.277
0.68MetMet: 0.68 ± 0.141
1.277MetAsn: 1.277 ± 0.199
1.062MetPro: 1.062 ± 0.144
0.63MetGln: 0.63 ± 0.119
0.879MetArg: 0.879 ± 0.156
1.277MetSer: 1.277 ± 0.199
1.327MetThr: 1.327 ± 0.193
1.178MetVal: 1.178 ± 0.147
0.265MetTrp: 0.265 ± 0.079
0.78MetTyr: 0.78 ± 0.138
0.0MetXaa: 0.0 ± 0.0
Asn
3.649AsnAla: 3.649 ± 0.365
0.531AsnCys: 0.531 ± 0.081
3.649AsnAsp: 3.649 ± 0.222
3.367AsnGlu: 3.367 ± 0.272
2.82AsnPhe: 2.82 ± 0.225
5.391AsnGly: 5.391 ± 0.451
0.746AsnHis: 0.746 ± 0.116
4.545AsnIle: 4.545 ± 0.377
2.903AsnLys: 2.903 ± 0.217
4.147AsnLeu: 4.147 ± 0.269
1.045AsnMet: 1.045 ± 0.167
3.716AsnAsn: 3.716 ± 0.264
3.201AsnPro: 3.201 ± 0.244
2.206AsnGln: 2.206 ± 0.183
2.239AsnArg: 2.239 ± 0.199
4.661AsnSer: 4.661 ± 0.432
5.125AsnThr: 5.125 ± 0.381
4.263AsnVal: 4.263 ± 0.277
0.796AsnTrp: 0.796 ± 0.1
2.687AsnTyr: 2.687 ± 0.286
0.017AsnXaa: 0.017 ± 0.014
Pro
2.671ProAla: 2.671 ± 0.235
0.166ProCys: 0.166 ± 0.057
3.069ProAsp: 3.069 ± 0.26
2.355ProGlu: 2.355 ± 0.231
1.808ProPhe: 1.808 ± 0.168
3.367ProGly: 3.367 ± 0.247
0.547ProHis: 0.547 ± 0.116
2.14ProIle: 2.14 ± 0.182
1.742ProLys: 1.742 ± 0.214
2.389ProLeu: 2.389 ± 0.207
0.663ProMet: 0.663 ± 0.154
2.355ProAsn: 2.355 ± 0.198
1.808ProPro: 1.808 ± 0.187
1.31ProGln: 1.31 ± 0.204
1.327ProArg: 1.327 ± 0.181
3.434ProSer: 3.434 ± 0.278
3.649ProThr: 3.649 ± 0.225
2.753ProVal: 2.753 ± 0.193
0.382ProTrp: 0.382 ± 0.081
1.626ProTyr: 1.626 ± 0.172
0.0ProXaa: 0.0 ± 0.0
Gln
2.306GlnAla: 2.306 ± 0.225
0.249GlnCys: 0.249 ± 0.07
2.04GlnAsp: 2.04 ± 0.236
2.339GlnGlu: 2.339 ± 0.236
1.775GlnPhe: 1.775 ± 0.156
2.306GlnGly: 2.306 ± 0.228
0.647GlnHis: 0.647 ± 0.096
2.588GlnIle: 2.588 ± 0.237
1.924GlnLys: 1.924 ± 0.22
2.654GlnLeu: 2.654 ± 0.209
0.713GlnMet: 0.713 ± 0.139
2.107GlnAsn: 2.107 ± 0.258
1.178GlnPro: 1.178 ± 0.171
1.294GlnGln: 1.294 ± 0.147
1.427GlnArg: 1.427 ± 0.144
2.405GlnSer: 2.405 ± 0.226
2.107GlnThr: 2.107 ± 0.252
2.588GlnVal: 2.588 ± 0.201
0.746GlnTrp: 0.746 ± 0.102
1.924GlnTyr: 1.924 ± 0.163
0.0GlnXaa: 0.0 ± 0.0
Arg
2.505ArgAla: 2.505 ± 0.172
0.365ArgCys: 0.365 ± 0.068
2.156ArgAsp: 2.156 ± 0.193
1.957ArgGlu: 1.957 ± 0.23
1.791ArgPhe: 1.791 ± 0.196
2.853ArgGly: 2.853 ± 0.216
0.448ArgHis: 0.448 ± 0.105
2.687ArgIle: 2.687 ± 0.2
2.422ArgLys: 2.422 ± 0.192
3.633ArgLeu: 3.633 ± 0.305
1.062ArgMet: 1.062 ± 0.148
2.04ArgAsn: 2.04 ± 0.17
1.344ArgPro: 1.344 ± 0.139
1.41ArgGln: 1.41 ± 0.128
1.725ArgArg: 1.725 ± 0.165
2.355ArgSer: 2.355 ± 0.171
2.472ArgThr: 2.472 ± 0.221
2.272ArgVal: 2.272 ± 0.198
0.547ArgTrp: 0.547 ± 0.098
2.073ArgTyr: 2.073 ± 0.183
0.0ArgXaa: 0.0 ± 0.0
Ser
5.341SerAla: 5.341 ± 0.276
0.547SerCys: 0.547 ± 0.1
4.562SerAsp: 4.562 ± 0.271
3.517SerGlu: 3.517 ± 0.26
3.317SerPhe: 3.317 ± 0.304
9.156SerGly: 9.156 ± 0.85
0.813SerHis: 0.813 ± 0.119
5.573SerIle: 5.573 ± 0.451
3.5SerLys: 3.5 ± 0.299
5.225SerLeu: 5.225 ± 0.298
1.261SerMet: 1.261 ± 0.164
5.009SerAsn: 5.009 ± 0.401
3.002SerPro: 3.002 ± 0.263
2.77SerGln: 2.77 ± 0.205
2.14SerArg: 2.14 ± 0.186
6.9SerSer: 6.9 ± 0.622
5.607SerThr: 5.607 ± 0.391
4.794SerVal: 4.794 ± 0.386
0.697SerTrp: 0.697 ± 0.133
3.085SerTyr: 3.085 ± 0.262
0.0SerXaa: 0.0 ± 0.0
Thr
6.121ThrAla: 6.121 ± 0.528
0.647ThrCys: 0.647 ± 0.112
5.175ThrAsp: 5.175 ± 0.325
3.964ThrGlu: 3.964 ± 0.275
3.599ThrPhe: 3.599 ± 0.299
7.232ThrGly: 7.232 ± 0.611
0.896ThrHis: 0.896 ± 0.128
5.772ThrIle: 5.772 ± 0.488
2.87ThrLys: 2.87 ± 0.284
5.656ThrLeu: 5.656 ± 0.373
0.962ThrMet: 0.962 ± 0.154
4.744ThrAsn: 4.744 ± 0.397
3.616ThrPro: 3.616 ± 0.217
2.488ThrGln: 2.488 ± 0.257
2.422ThrArg: 2.422 ± 0.169
5.756ThrSer: 5.756 ± 0.485
6.552ThrThr: 6.552 ± 0.546
5.756ThrVal: 5.756 ± 0.453
0.763ThrTrp: 0.763 ± 0.118
3.616ThrTyr: 3.616 ± 0.238
0.017ThrXaa: 0.017 ± 0.014
Val
4.462ValAla: 4.462 ± 0.297
0.581ValCys: 0.581 ± 0.101
5.192ValAsp: 5.192 ± 0.323
4.047ValGlu: 4.047 ± 0.303
2.82ValPhe: 2.82 ± 0.217
4.893ValGly: 4.893 ± 0.348
0.68ValHis: 0.68 ± 0.121
4.23ValIle: 4.23 ± 0.275
3.467ValLys: 3.467 ± 0.34
4.346ValLeu: 4.346 ± 0.286
1.327ValMet: 1.327 ± 0.166
4.462ValAsn: 4.462 ± 0.296
2.87ValPro: 2.87 ± 0.213
2.156ValGln: 2.156 ± 0.194
2.787ValArg: 2.787 ± 0.238
6.303ValSer: 6.303 ± 0.513
4.313ValThr: 4.313 ± 0.341
5.092ValVal: 5.092 ± 0.388
0.647ValTrp: 0.647 ± 0.118
2.588ValTyr: 2.588 ± 0.255
0.0ValXaa: 0.0 ± 0.0
Trp
0.829TrpAla: 0.829 ± 0.131
0.1TrpCys: 0.1 ± 0.041
1.078TrpAsp: 1.078 ± 0.149
0.697TrpGlu: 0.697 ± 0.121
0.348TrpPhe: 0.348 ± 0.076
0.78TrpGly: 0.78 ± 0.14
0.166TrpHis: 0.166 ± 0.058
0.663TrpIle: 0.663 ± 0.114
0.647TrpLys: 0.647 ± 0.123
0.514TrpLeu: 0.514 ± 0.114
0.332TrpMet: 0.332 ± 0.076
0.863TrpAsn: 0.863 ± 0.11
0.232TrpPro: 0.232 ± 0.072
0.498TrpGln: 0.498 ± 0.112
0.547TrpArg: 0.547 ± 0.083
0.763TrpSer: 0.763 ± 0.116
0.995TrpThr: 0.995 ± 0.135
0.796TrpVal: 0.796 ± 0.126
0.083TrpTrp: 0.083 ± 0.035
0.547TrpTyr: 0.547 ± 0.084
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.637TyrAla: 2.637 ± 0.21
0.481TyrCys: 0.481 ± 0.113
3.085TyrAsp: 3.085 ± 0.25
2.339TyrGlu: 2.339 ± 0.202
1.957TyrPhe: 1.957 ± 0.194
2.571TyrGly: 2.571 ± 0.23
0.663TyrHis: 0.663 ± 0.129
2.853TyrIle: 2.853 ± 0.2
2.339TyrLys: 2.339 ± 0.246
3.367TyrLeu: 3.367 ± 0.207
0.813TyrMet: 0.813 ± 0.156
2.986TyrAsn: 2.986 ± 0.221
1.725TyrPro: 1.725 ± 0.17
1.592TyrGln: 1.592 ± 0.176
1.891TyrArg: 1.891 ± 0.167
2.853TyrSer: 2.853 ± 0.217
3.085TyrThr: 3.085 ± 0.23
3.218TyrVal: 3.218 ± 0.257
0.498TyrTrp: 0.498 ± 0.087
1.841TyrTyr: 1.841 ± 0.205
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.017XaaGlu: 0.017 ± 0.014
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.017XaaTyr: 0.017 ± 0.014
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 212 proteins (60288 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski