Amino acid dipepetide frequency for Prochlorococcus phage Syn33

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.494AlaAla: 6.494 ± 0.569
0.553AlaCys: 0.553 ± 0.129
3.871AlaAsp: 3.871 ± 0.335
4.014AlaGlu: 4.014 ± 0.31
3.104AlaPhe: 3.104 ± 0.238
6.832AlaGly: 6.832 ± 0.618
0.731AlaHis: 0.731 ± 0.1
4.442AlaIle: 4.442 ± 0.346
3.532AlaLys: 3.532 ± 0.335
4.87AlaLeu: 4.87 ± 0.349
1.445AlaMet: 1.445 ± 0.2
3.978AlaAsn: 3.978 ± 0.347
2.426AlaPro: 2.426 ± 0.193
2.444AlaGln: 2.444 ± 0.212
2.676AlaArg: 2.676 ± 0.227
5.405AlaSer: 5.405 ± 0.389
5.851AlaThr: 5.851 ± 0.566
4.727AlaVal: 4.727 ± 0.382
0.696AlaTrp: 0.696 ± 0.128
2.301AlaTyr: 2.301 ± 0.194
0.0AlaXaa: 0.0 ± 0.0
Cys
0.66CysAla: 0.66 ± 0.115
0.089CysCys: 0.089 ± 0.04
0.571CysAsp: 0.571 ± 0.138
0.446CysGlu: 0.446 ± 0.09
0.428CysPhe: 0.428 ± 0.099
0.553CysGly: 0.553 ± 0.111
0.25CysHis: 0.25 ± 0.079
0.517CysIle: 0.517 ± 0.117
0.66CysLys: 0.66 ± 0.107
0.642CysLeu: 0.642 ± 0.106
0.321CysMet: 0.321 ± 0.077
0.535CysAsn: 0.535 ± 0.115
0.285CysPro: 0.285 ± 0.093
0.303CysGln: 0.303 ± 0.089
0.428CysArg: 0.428 ± 0.105
0.731CysSer: 0.731 ± 0.131
0.553CysThr: 0.553 ± 0.117
0.482CysVal: 0.482 ± 0.106
0.143CysTrp: 0.143 ± 0.053
0.285CysTyr: 0.285 ± 0.067
0.0CysXaa: 0.0 ± 0.0
Asp
5.441AspAla: 5.441 ± 0.41
0.535AspCys: 0.535 ± 0.112
4.121AspAsp: 4.121 ± 0.292
3.693AspGlu: 3.693 ± 0.384
2.747AspPhe: 2.747 ± 0.274
6.101AspGly: 6.101 ± 0.439
0.624AspHis: 0.624 ± 0.124
4.157AspIle: 4.157 ± 0.382
3.336AspLys: 3.336 ± 0.347
4.478AspLeu: 4.478 ± 0.338
1.534AspMet: 1.534 ± 0.207
3.693AspAsn: 3.693 ± 0.346
2.872AspPro: 2.872 ± 0.222
2.016AspGln: 2.016 ± 0.165
2.462AspArg: 2.462 ± 0.201
4.513AspSer: 4.513 ± 0.277
4.674AspThr: 4.674 ± 0.457
4.085AspVal: 4.085 ± 0.308
0.999AspTrp: 0.999 ± 0.139
3.068AspTyr: 3.068 ± 0.226
0.0AspXaa: 0.0 ± 0.0
Glu
3.104GluAla: 3.104 ± 0.319
0.767GluCys: 0.767 ± 0.137
4.264GluAsp: 4.264 ± 0.335
4.513GluGlu: 4.513 ± 0.513
3.211GluPhe: 3.211 ± 0.231
3.942GluGly: 3.942 ± 0.27
0.856GluHis: 0.856 ± 0.146
3.711GluIle: 3.711 ± 0.378
3.55GluLys: 3.55 ± 0.387
4.906GluLeu: 4.906 ± 0.305
1.748GluMet: 1.748 ± 0.235
3.14GluAsn: 3.14 ± 0.218
1.57GluPro: 1.57 ± 0.168
2.373GluGln: 2.373 ± 0.2
2.801GluArg: 2.801 ± 0.329
3.514GluSer: 3.514 ± 0.342
4.139GluThr: 4.139 ± 0.247
4.62GluVal: 4.62 ± 0.278
0.714GluTrp: 0.714 ± 0.124
2.961GluTyr: 2.961 ± 0.272
0.0GluXaa: 0.0 ± 0.0
Phe
2.515PheAla: 2.515 ± 0.209
0.553PheCys: 0.553 ± 0.121
3.372PheAsp: 3.372 ± 0.227
2.533PheGlu: 2.533 ± 0.247
1.766PhePhe: 1.766 ± 0.185
3.015PheGly: 3.015 ± 0.271
0.535PheHis: 0.535 ± 0.129
2.729PheIle: 2.729 ± 0.246
2.319PheLys: 2.319 ± 0.215
3.193PheLeu: 3.193 ± 0.301
1.106PheMet: 1.106 ± 0.171
2.801PheAsn: 2.801 ± 0.239
1.606PhePro: 1.606 ± 0.193
1.713PheGln: 1.713 ± 0.142
1.409PheArg: 1.409 ± 0.173
3.14PheSer: 3.14 ± 0.253
3.461PheThr: 3.461 ± 0.261
2.676PheVal: 2.676 ± 0.27
0.321PheTrp: 0.321 ± 0.081
1.713PheTyr: 1.713 ± 0.147
0.0PheXaa: 0.0 ± 0.0
Gly
6.565GlyAla: 6.565 ± 0.614
0.749GlyCys: 0.749 ± 0.136
5.334GlyAsp: 5.334 ± 0.494
4.424GlyGlu: 4.424 ± 0.331
3.229GlyPhe: 3.229 ± 0.346
7.617GlyGly: 7.617 ± 0.838
0.963GlyHis: 0.963 ± 0.137
4.067GlyIle: 4.067 ± 0.341
3.639GlyLys: 3.639 ± 0.362
4.371GlyLeu: 4.371 ± 0.297
1.73GlyMet: 1.73 ± 0.309
4.21GlyAsn: 4.21 ± 0.454
1.927GlyPro: 1.927 ± 0.221
2.498GlyGln: 2.498 ± 0.22
2.926GlyArg: 2.926 ± 0.268
6.815GlySer: 6.815 ± 0.647
7.243GlyThr: 7.243 ± 0.812
5.28GlyVal: 5.28 ± 0.301
1.035GlyTrp: 1.035 ± 0.141
3.443GlyTyr: 3.443 ± 0.339
0.0GlyXaa: 0.0 ± 0.0
His
0.731HisAla: 0.731 ± 0.118
0.214HisCys: 0.214 ± 0.067
0.928HisAsp: 0.928 ± 0.148
0.607HisGlu: 0.607 ± 0.132
0.714HisPhe: 0.714 ± 0.129
0.892HisGly: 0.892 ± 0.138
0.285HisHis: 0.285 ± 0.085
0.696HisIle: 0.696 ± 0.114
0.874HisLys: 0.874 ± 0.133
1.017HisLeu: 1.017 ± 0.18
0.303HisMet: 0.303 ± 0.07
0.66HisAsn: 0.66 ± 0.128
0.838HisPro: 0.838 ± 0.137
0.464HisGln: 0.464 ± 0.09
0.535HisArg: 0.535 ± 0.122
0.945HisSer: 0.945 ± 0.135
0.856HisThr: 0.856 ± 0.133
0.981HisVal: 0.981 ± 0.143
0.232HisTrp: 0.232 ± 0.079
0.928HisTyr: 0.928 ± 0.137
0.0HisXaa: 0.0 ± 0.0
Ile
4.299IleAla: 4.299 ± 0.349
0.589IleCys: 0.589 ± 0.116
4.335IleAsp: 4.335 ± 0.329
4.246IleGlu: 4.246 ± 0.267
2.426IlePhe: 2.426 ± 0.18
4.085IleGly: 4.085 ± 0.322
0.589IleHis: 0.589 ± 0.1
3.942IleIle: 3.942 ± 0.298
4.032IleLys: 4.032 ± 0.324
4.335IleLeu: 4.335 ± 0.276
1.088IleMet: 1.088 ± 0.204
4.067IleAsn: 4.067 ± 0.265
2.997IlePro: 2.997 ± 0.293
2.498IleGln: 2.498 ± 0.201
2.123IleArg: 2.123 ± 0.172
4.817IleSer: 4.817 ± 0.57
5.798IleThr: 5.798 ± 0.53
4.067IleVal: 4.067 ± 0.35
0.678IleTrp: 0.678 ± 0.112
2.105IleTyr: 2.105 ± 0.246
0.0IleXaa: 0.0 ± 0.0
Lys
3.14LysAla: 3.14 ± 0.341
0.607LysCys: 0.607 ± 0.111
3.389LysAsp: 3.389 ± 0.379
3.96LysGlu: 3.96 ± 0.51
2.39LysPhe: 2.39 ± 0.252
3.229LysGly: 3.229 ± 0.331
0.856LysHis: 0.856 ± 0.18
3.604LysIle: 3.604 ± 0.365
4.478LysLys: 4.478 ± 0.618
4.549LysLeu: 4.549 ± 0.354
1.481LysMet: 1.481 ± 0.23
3.282LysAsn: 3.282 ± 0.276
2.034LysPro: 2.034 ± 0.291
2.052LysGln: 2.052 ± 0.293
2.337LysArg: 2.337 ± 0.274
4.228LysSer: 4.228 ± 0.494
3.621LysThr: 3.621 ± 0.253
3.907LysVal: 3.907 ± 0.265
0.714LysTrp: 0.714 ± 0.137
2.587LysTyr: 2.587 ± 0.325
0.0LysXaa: 0.0 ± 0.0
Leu
4.924LeuAla: 4.924 ± 0.283
0.767LeuCys: 0.767 ± 0.152
5.387LeuAsp: 5.387 ± 0.385
4.656LeuGlu: 4.656 ± 0.37
2.408LeuPhe: 2.408 ± 0.181
4.763LeuGly: 4.763 ± 0.394
1.391LeuHis: 1.391 ± 0.18
3.925LeuIle: 3.925 ± 0.256
4.834LeuLys: 4.834 ± 0.412
5.387LeuLeu: 5.387 ± 0.376
1.499LeuMet: 1.499 ± 0.265
4.567LeuAsn: 4.567 ± 0.305
3.015LeuPro: 3.015 ± 0.28
2.854LeuGln: 2.854 ± 0.241
3.247LeuArg: 3.247 ± 0.257
5.263LeuSer: 5.263 ± 0.338
5.334LeuThr: 5.334 ± 0.503
4.603LeuVal: 4.603 ± 0.289
0.803LeuTrp: 0.803 ± 0.131
3.336LeuTyr: 3.336 ± 0.237
0.0LeuXaa: 0.0 ± 0.0
Met
1.552MetAla: 1.552 ± 0.237
0.143MetCys: 0.143 ± 0.05
1.177MetAsp: 1.177 ± 0.173
1.374MetGlu: 1.374 ± 0.218
0.928MetPhe: 0.928 ± 0.162
1.374MetGly: 1.374 ± 0.204
0.624MetHis: 0.624 ± 0.13
1.195MetIle: 1.195 ± 0.164
1.784MetLys: 1.784 ± 0.304
1.713MetLeu: 1.713 ± 0.202
0.714MetMet: 0.714 ± 0.155
1.267MetAsn: 1.267 ± 0.194
1.053MetPro: 1.053 ± 0.18
0.928MetGln: 0.928 ± 0.17
1.124MetArg: 1.124 ± 0.197
1.534MetSer: 1.534 ± 0.263
1.57MetThr: 1.57 ± 0.259
1.088MetVal: 1.088 ± 0.139
0.285MetTrp: 0.285 ± 0.093
0.571MetTyr: 0.571 ± 0.114
0.0MetXaa: 0.0 ± 0.0
Asn
3.728AsnAla: 3.728 ± 0.376
0.392AsnCys: 0.392 ± 0.083
3.229AsnAsp: 3.229 ± 0.217
3.407AsnGlu: 3.407 ± 0.216
2.747AsnPhe: 2.747 ± 0.214
4.496AsnGly: 4.496 ± 0.339
0.66AsnHis: 0.66 ± 0.105
4.139AsnIle: 4.139 ± 0.363
2.712AsnLys: 2.712 ± 0.282
5.28AsnLeu: 5.28 ± 0.382
0.785AsnMet: 0.785 ± 0.141
3.229AsnAsn: 3.229 ± 0.315
2.926AsnPro: 2.926 ± 0.23
2.159AsnGln: 2.159 ± 0.165
2.052AsnArg: 2.052 ± 0.185
3.978AsnSer: 3.978 ± 0.279
4.406AsnThr: 4.406 ± 0.493
4.281AsnVal: 4.281 ± 0.286
0.874AsnTrp: 0.874 ± 0.126
2.462AsnTyr: 2.462 ± 0.183
0.0AsnXaa: 0.0 ± 0.0
Pro
2.462ProAla: 2.462 ± 0.215
0.303ProCys: 0.303 ± 0.09
2.23ProAsp: 2.23 ± 0.266
2.854ProGlu: 2.854 ± 0.267
1.534ProPhe: 1.534 ± 0.173
3.086ProGly: 3.086 ± 0.269
0.607ProHis: 0.607 ± 0.092
2.515ProIle: 2.515 ± 0.239
2.034ProLys: 2.034 ± 0.258
2.212ProLeu: 2.212 ± 0.225
0.749ProMet: 0.749 ± 0.148
2.266ProAsn: 2.266 ± 0.191
1.606ProPro: 1.606 ± 0.217
1.445ProGln: 1.445 ± 0.192
1.32ProArg: 1.32 ± 0.143
3.033ProSer: 3.033 ± 0.235
3.122ProThr: 3.122 ± 0.26
2.569ProVal: 2.569 ± 0.232
0.589ProTrp: 0.589 ± 0.109
1.606ProTyr: 1.606 ± 0.191
0.0ProXaa: 0.0 ± 0.0
Gln
2.123GlnAla: 2.123 ± 0.213
0.375GlnCys: 0.375 ± 0.106
2.212GlnAsp: 2.212 ± 0.193
2.569GlnGlu: 2.569 ± 0.296
1.552GlnPhe: 1.552 ± 0.189
2.747GlnGly: 2.747 ± 0.255
0.571GlnHis: 0.571 ± 0.124
2.64GlnIle: 2.64 ± 0.242
2.159GlnLys: 2.159 ± 0.26
3.265GlnLeu: 3.265 ± 0.263
0.91GlnMet: 0.91 ± 0.162
1.677GlnAsn: 1.677 ± 0.151
1.302GlnPro: 1.302 ± 0.155
1.463GlnGln: 1.463 ± 0.172
1.516GlnArg: 1.516 ± 0.19
2.23GlnSer: 2.23 ± 0.257
2.355GlnThr: 2.355 ± 0.235
2.747GlnVal: 2.747 ± 0.248
0.607GlnTrp: 0.607 ± 0.1
1.855GlnTyr: 1.855 ± 0.188
0.0GlnXaa: 0.0 ± 0.0
Arg
2.301ArgAla: 2.301 ± 0.199
0.25ArgCys: 0.25 ± 0.067
2.141ArgAsp: 2.141 ± 0.19
2.426ArgGlu: 2.426 ± 0.293
1.606ArgPhe: 1.606 ± 0.207
2.605ArgGly: 2.605 ± 0.237
0.66ArgHis: 0.66 ± 0.118
2.658ArgIle: 2.658 ± 0.231
2.694ArgLys: 2.694 ± 0.387
3.479ArgLeu: 3.479 ± 0.251
0.981ArgMet: 0.981 ± 0.176
2.194ArgAsn: 2.194 ± 0.207
1.213ArgPro: 1.213 ± 0.147
1.748ArgGln: 1.748 ± 0.191
1.962ArgArg: 1.962 ± 0.271
2.283ArgSer: 2.283 ± 0.252
2.587ArgThr: 2.587 ± 0.231
2.961ArgVal: 2.961 ± 0.289
0.571ArgTrp: 0.571 ± 0.119
2.034ArgTyr: 2.034 ± 0.198
0.0ArgXaa: 0.0 ± 0.0
Ser
5.923SerAla: 5.923 ± 0.402
0.41SerCys: 0.41 ± 0.108
4.264SerAsp: 4.264 ± 0.34
3.389SerGlu: 3.389 ± 0.27
3.604SerPhe: 3.604 ± 0.325
7.243SerGly: 7.243 ± 0.695
0.785SerHis: 0.785 ± 0.15
4.763SerIle: 4.763 ± 0.382
3.871SerLys: 3.871 ± 0.348
5.031SerLeu: 5.031 ± 0.297
1.516SerMet: 1.516 ± 0.189
4.21SerAsn: 4.21 ± 0.337
2.605SerPro: 2.605 ± 0.248
2.729SerGln: 2.729 ± 0.258
2.408SerArg: 2.408 ± 0.212
6.351SerSer: 6.351 ± 0.599
5.923SerThr: 5.923 ± 0.612
4.995SerVal: 4.995 ± 0.448
0.642SerTrp: 0.642 ± 0.103
2.89SerTyr: 2.89 ± 0.214
0.0SerXaa: 0.0 ± 0.0
Thr
6.422ThrAla: 6.422 ± 0.589
0.464ThrCys: 0.464 ± 0.098
4.745ThrAsp: 4.745 ± 0.404
3.996ThrGlu: 3.996 ± 0.337
3.086ThrPhe: 3.086 ± 0.35
6.904ThrGly: 6.904 ± 0.778
0.945ThrHis: 0.945 ± 0.162
5.495ThrIle: 5.495 ± 0.504
3.247ThrLys: 3.247 ± 0.254
6.297ThrLeu: 6.297 ± 0.555
1.195ThrMet: 1.195 ± 0.155
4.317ThrAsn: 4.317 ± 0.548
3.497ThrPro: 3.497 ± 0.256
2.515ThrGln: 2.515 ± 0.166
2.569ThrArg: 2.569 ± 0.217
5.94ThrSer: 5.94 ± 0.724
6.761ThrThr: 6.761 ± 0.709
6.155ThrVal: 6.155 ± 0.766
0.785ThrTrp: 0.785 ± 0.105
2.783ThrTyr: 2.783 ± 0.176
0.0ThrXaa: 0.0 ± 0.0
Val
4.727ValAla: 4.727 ± 0.328
0.482ValCys: 0.482 ± 0.118
5.495ValAsp: 5.495 ± 0.488
4.014ValGlu: 4.014 ± 0.253
2.729ValPhe: 2.729 ± 0.254
5.387ValGly: 5.387 ± 0.575
0.91ValHis: 0.91 ± 0.129
4.406ValIle: 4.406 ± 0.343
3.389ValLys: 3.389 ± 0.315
4.228ValLeu: 4.228 ± 0.295
1.57ValMet: 1.57 ± 0.218
4.085ValAsn: 4.085 ± 0.364
2.64ValPro: 2.64 ± 0.229
2.551ValGln: 2.551 ± 0.2
2.765ValArg: 2.765 ± 0.235
5.298ValSer: 5.298 ± 0.378
5.958ValThr: 5.958 ± 0.66
4.852ValVal: 4.852 ± 0.378
0.696ValTrp: 0.696 ± 0.079
2.426ValTyr: 2.426 ± 0.253
0.0ValXaa: 0.0 ± 0.0
Trp
0.91TrpAla: 0.91 ± 0.139
0.161TrpCys: 0.161 ± 0.059
0.838TrpAsp: 0.838 ± 0.15
0.714TrpGlu: 0.714 ± 0.125
0.482TrpPhe: 0.482 ± 0.095
0.678TrpGly: 0.678 ± 0.109
0.357TrpHis: 0.357 ± 0.103
0.678TrpIle: 0.678 ± 0.119
0.963TrpLys: 0.963 ± 0.186
0.731TrpLeu: 0.731 ± 0.124
0.303TrpMet: 0.303 ± 0.092
1.053TrpAsn: 1.053 ± 0.166
0.161TrpPro: 0.161 ± 0.053
0.482TrpGln: 0.482 ± 0.081
0.517TrpArg: 0.517 ± 0.076
0.767TrpSer: 0.767 ± 0.112
0.803TrpThr: 0.803 ± 0.123
0.874TrpVal: 0.874 ± 0.123
0.178TrpTrp: 0.178 ± 0.059
0.41TrpTyr: 0.41 ± 0.074
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.373TyrAla: 2.373 ± 0.148
0.482TyrCys: 0.482 ± 0.111
3.247TyrAsp: 3.247 ± 0.308
2.569TyrGlu: 2.569 ± 0.299
1.784TyrPhe: 1.784 ± 0.189
2.515TyrGly: 2.515 ± 0.259
0.5TyrHis: 0.5 ± 0.075
2.836TyrIle: 2.836 ± 0.243
2.337TyrLys: 2.337 ± 0.269
2.997TyrLeu: 2.997 ± 0.249
0.999TyrMet: 0.999 ± 0.167
2.694TyrAsn: 2.694 ± 0.227
1.57TyrPro: 1.57 ± 0.155
1.641TyrGln: 1.641 ± 0.179
2.176TyrArg: 2.176 ± 0.22
2.765TyrSer: 2.765 ± 0.255
3.104TyrThr: 3.104 ± 0.369
2.658TyrVal: 2.658 ± 0.18
0.5TyrTrp: 0.5 ± 0.111
1.98TyrTyr: 1.98 ± 0.197
0.018TyrXaa: 0.018 ± 0.017
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.018XaaAsn: 0.018 ± 0.017
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 227 proteins (56057 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski