Amino acid dipepetide frequency for Prochlorococcus phage P-SSM7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.522AlaAla: 6.522 ± 0.562
0.477AlaCys: 0.477 ± 0.099
4.138AlaAsp: 4.138 ± 0.29
3.644AlaGlu: 3.644 ± 0.334
2.384AlaPhe: 2.384 ± 0.194
6.181AlaGly: 6.181 ± 0.455
1.09AlaHis: 1.09 ± 0.176
4.359AlaIle: 4.359 ± 0.297
4.07AlaLys: 4.07 ± 0.381
4.342AlaLeu: 4.342 ± 0.279
1.464AlaMet: 1.464 ± 0.207
3.491AlaAsn: 3.491 ± 0.327
2.367AlaPro: 2.367 ± 0.214
2.163AlaGln: 2.163 ± 0.14
2.554AlaArg: 2.554 ± 0.212
5.381AlaSer: 5.381 ± 0.537
5.977AlaThr: 5.977 ± 0.743
4.325AlaVal: 4.325 ± 0.328
0.681AlaTrp: 0.681 ± 0.117
2.026AlaTyr: 2.026 ± 0.18
0.0AlaXaa: 0.0 ± 0.0
Cys
0.596CysAla: 0.596 ± 0.096
0.068CysCys: 0.068 ± 0.031
0.63CysAsp: 0.63 ± 0.11
0.579CysGlu: 0.579 ± 0.098
0.409CysPhe: 0.409 ± 0.098
0.528CysGly: 0.528 ± 0.115
0.272CysHis: 0.272 ± 0.078
0.477CysIle: 0.477 ± 0.111
0.613CysLys: 0.613 ± 0.113
0.613CysLeu: 0.613 ± 0.101
0.255CysMet: 0.255 ± 0.064
0.324CysAsn: 0.324 ± 0.098
0.358CysPro: 0.358 ± 0.083
0.375CysGln: 0.375 ± 0.107
0.272CysArg: 0.272 ± 0.078
0.698CysSer: 0.698 ± 0.149
0.749CysThr: 0.749 ± 0.127
0.511CysVal: 0.511 ± 0.113
0.119CysTrp: 0.119 ± 0.049
0.341CysTyr: 0.341 ± 0.083
0.0CysXaa: 0.0 ± 0.0
Asp
4.581AspAla: 4.581 ± 0.295
0.613AspCys: 0.613 ± 0.099
4.581AspAsp: 4.581 ± 0.46
4.393AspGlu: 4.393 ± 0.332
2.946AspPhe: 2.946 ± 0.224
6.113AspGly: 6.113 ± 0.408
1.124AspHis: 1.124 ± 0.17
4.53AspIle: 4.53 ± 0.326
3.559AspLys: 3.559 ± 0.319
4.785AspLeu: 4.785 ± 0.306
1.464AspMet: 1.464 ± 0.187
3.44AspAsn: 3.44 ± 0.289
3.372AspPro: 3.372 ± 0.247
2.35AspGln: 2.35 ± 0.248
2.316AspArg: 2.316 ± 0.213
4.495AspSer: 4.495 ± 0.38
4.581AspThr: 4.581 ± 0.299
4.019AspVal: 4.019 ± 0.319
1.107AspTrp: 1.107 ± 0.148
3.286AspTyr: 3.286 ± 0.244
0.0AspXaa: 0.0 ± 0.0
Glu
2.793GluAla: 2.793 ± 0.262
0.63GluCys: 0.63 ± 0.114
3.985GluAsp: 3.985 ± 0.392
4.274GluGlu: 4.274 ± 0.542
3.065GluPhe: 3.065 ± 0.264
3.542GluGly: 3.542 ± 0.287
1.039GluHis: 1.039 ± 0.146
4.359GluIle: 4.359 ± 0.309
4.036GluLys: 4.036 ± 0.444
4.751GluLeu: 4.751 ± 0.36
1.362GluMet: 1.362 ± 0.237
3.235GluAsn: 3.235 ± 0.254
1.498GluPro: 1.498 ± 0.156
2.163GluGln: 2.163 ± 0.196
2.435GluArg: 2.435 ± 0.256
3.627GluSer: 3.627 ± 0.337
3.627GluThr: 3.627 ± 0.292
4.581GluVal: 4.581 ± 0.279
0.783GluTrp: 0.783 ± 0.119
2.844GluTyr: 2.844 ± 0.256
0.0GluXaa: 0.0 ± 0.0
Phe
2.673PheAla: 2.673 ± 0.215
0.477PheCys: 0.477 ± 0.102
3.508PheAsp: 3.508 ± 0.229
2.435PheGlu: 2.435 ± 0.204
1.72PhePhe: 1.72 ± 0.199
2.895PheGly: 2.895 ± 0.234
0.443PheHis: 0.443 ± 0.095
2.639PheIle: 2.639 ± 0.199
2.35PheLys: 2.35 ± 0.254
3.014PheLeu: 3.014 ± 0.268
1.039PheMet: 1.039 ± 0.16
2.656PheAsn: 2.656 ± 0.188
1.567PhePro: 1.567 ± 0.191
1.243PheGln: 1.243 ± 0.153
1.447PheArg: 1.447 ± 0.191
3.167PheSer: 3.167 ± 0.336
3.763PheThr: 3.763 ± 0.328
2.861PheVal: 2.861 ± 0.219
0.392PheTrp: 0.392 ± 0.084
1.567PheTyr: 1.567 ± 0.15
0.0PheXaa: 0.0 ± 0.0
Gly
5.687GlyAla: 5.687 ± 0.615
0.698GlyCys: 0.698 ± 0.147
5.296GlyAsp: 5.296 ± 0.422
3.934GlyGlu: 3.934 ± 0.292
3.031GlyPhe: 3.031 ± 0.228
8.361GlyGly: 8.361 ± 0.969
0.988GlyHis: 0.988 ± 0.162
4.359GlyIle: 4.359 ± 0.243
4.581GlyLys: 4.581 ± 0.464
4.938GlyLeu: 4.938 ± 0.346
1.447GlyMet: 1.447 ± 0.236
4.734GlyAsn: 4.734 ± 0.376
1.72GlyPro: 1.72 ± 0.201
2.418GlyGln: 2.418 ± 0.201
2.759GlyArg: 2.759 ± 0.289
6.641GlySer: 6.641 ± 0.551
7.918GlyThr: 7.918 ± 0.767
5.074GlyVal: 5.074 ± 0.394
1.073GlyTrp: 1.073 ± 0.138
3.269GlyTyr: 3.269 ± 0.31
0.0GlyXaa: 0.0 ± 0.0
His
0.63HisAla: 0.63 ± 0.114
0.204HisCys: 0.204 ± 0.062
1.073HisAsp: 1.073 ± 0.193
1.056HisGlu: 1.056 ± 0.172
0.851HisPhe: 0.851 ± 0.13
0.954HisGly: 0.954 ± 0.146
0.443HisHis: 0.443 ± 0.099
1.107HisIle: 1.107 ± 0.136
0.937HisLys: 0.937 ± 0.14
0.988HisLeu: 0.988 ± 0.173
0.272HisMet: 0.272 ± 0.078
0.817HisAsn: 0.817 ± 0.131
1.005HisPro: 1.005 ± 0.195
0.477HisGln: 0.477 ± 0.094
0.698HisArg: 0.698 ± 0.119
1.039HisSer: 1.039 ± 0.139
1.158HisThr: 1.158 ± 0.153
1.073HisVal: 1.073 ± 0.149
0.17HisTrp: 0.17 ± 0.059
0.732HisTyr: 0.732 ± 0.146
0.0HisXaa: 0.0 ± 0.0
Ile
3.985IleAla: 3.985 ± 0.286
0.562IleCys: 0.562 ± 0.138
4.87IleAsp: 4.87 ± 0.303
4.615IleGlu: 4.615 ± 0.341
2.452IlePhe: 2.452 ± 0.171
4.632IleGly: 4.632 ± 0.351
0.92IleHis: 0.92 ± 0.165
3.763IleIle: 3.763 ± 0.237
4.053IleLys: 4.053 ± 0.283
4.683IleLeu: 4.683 ± 0.304
1.243IleMet: 1.243 ± 0.193
4.087IleAsn: 4.087 ± 0.217
2.707IlePro: 2.707 ± 0.238
2.452IleGln: 2.452 ± 0.237
2.588IleArg: 2.588 ± 0.211
4.989IleSer: 4.989 ± 0.349
6.232IleThr: 6.232 ± 0.772
3.882IleVal: 3.882 ± 0.307
0.613IleTrp: 0.613 ± 0.105
2.146IleTyr: 2.146 ± 0.259
0.0IleXaa: 0.0 ± 0.0
Lys
3.78LysAla: 3.78 ± 0.454
0.477LysCys: 0.477 ± 0.093
4.019LysAsp: 4.019 ± 0.304
3.865LysGlu: 3.865 ± 0.443
2.81LysPhe: 2.81 ± 0.253
4.07LysGly: 4.07 ± 0.41
1.107LysHis: 1.107 ± 0.194
4.461LysIle: 4.461 ± 0.362
5.262LysLys: 5.262 ± 0.772
4.444LysLeu: 4.444 ± 0.354
1.533LysMet: 1.533 ± 0.229
3.014LysAsn: 3.014 ± 0.217
2.129LysPro: 2.129 ± 0.26
2.231LysGln: 2.231 ± 0.29
2.367LysArg: 2.367 ± 0.309
4.07LysSer: 4.07 ± 0.411
4.053LysThr: 4.053 ± 0.258
4.291LysVal: 4.291 ± 0.288
0.851LysTrp: 0.851 ± 0.124
2.725LysTyr: 2.725 ± 0.291
0.0LysXaa: 0.0 ± 0.0
Leu
4.734LeuAla: 4.734 ± 0.293
0.8LeuCys: 0.8 ± 0.167
5.773LeuAsp: 5.773 ± 0.309
4.002LeuGlu: 4.002 ± 0.334
2.81LeuPhe: 2.81 ± 0.221
4.751LeuGly: 4.751 ± 0.325
1.175LeuHis: 1.175 ± 0.179
4.24LeuIle: 4.24 ± 0.299
5.057LeuLys: 5.057 ± 0.404
4.938LeuLeu: 4.938 ± 0.389
1.43LeuMet: 1.43 ± 0.232
4.223LeuAsn: 4.223 ± 0.286
2.588LeuPro: 2.588 ± 0.28
2.827LeuGln: 2.827 ± 0.202
2.895LeuArg: 2.895 ± 0.206
5.177LeuSer: 5.177 ± 0.266
5.892LeuThr: 5.892 ± 0.549
4.291LeuVal: 4.291 ± 0.266
0.562LeuTrp: 0.562 ± 0.101
2.895LeuTyr: 2.895 ± 0.197
0.0LeuXaa: 0.0 ± 0.0
Met
1.43MetAla: 1.43 ± 0.223
0.136MetCys: 0.136 ± 0.047
0.834MetAsp: 0.834 ± 0.118
1.243MetGlu: 1.243 ± 0.185
0.681MetPhe: 0.681 ± 0.119
1.039MetGly: 1.039 ± 0.178
0.477MetHis: 0.477 ± 0.107
1.192MetIle: 1.192 ± 0.2
1.652MetLys: 1.652 ± 0.24
1.379MetLeu: 1.379 ± 0.2
0.511MetMet: 0.511 ± 0.117
1.141MetAsn: 1.141 ± 0.162
1.022MetPro: 1.022 ± 0.172
0.681MetGln: 0.681 ± 0.134
1.226MetArg: 1.226 ± 0.166
1.686MetSer: 1.686 ± 0.268
1.703MetThr: 1.703 ± 0.229
0.834MetVal: 0.834 ± 0.102
0.238MetTrp: 0.238 ± 0.074
0.647MetTyr: 0.647 ± 0.131
0.0MetXaa: 0.0 ± 0.0
Asn
4.07AsnAla: 4.07 ± 0.343
0.511AsnCys: 0.511 ± 0.093
2.98AsnAsp: 2.98 ± 0.189
2.827AsnGlu: 2.827 ± 0.242
2.282AsnPhe: 2.282 ± 0.224
4.768AsnGly: 4.768 ± 0.403
0.92AsnHis: 0.92 ± 0.125
4.376AsnIle: 4.376 ± 0.405
3.372AsnLys: 3.372 ± 0.306
4.938AsnLeu: 4.938 ± 0.358
0.766AsnMet: 0.766 ± 0.138
3.542AsnAsn: 3.542 ± 0.273
3.235AsnPro: 3.235 ± 0.235
2.06AsnGln: 2.06 ± 0.215
2.248AsnArg: 2.248 ± 0.219
3.899AsnSer: 3.899 ± 0.311
4.206AsnThr: 4.206 ± 0.525
4.155AsnVal: 4.155 ± 0.324
0.749AsnTrp: 0.749 ± 0.137
2.588AsnTyr: 2.588 ± 0.18
0.0AsnXaa: 0.0 ± 0.0
Pro
2.18ProAla: 2.18 ± 0.232
0.307ProCys: 0.307 ± 0.089
2.827ProAsp: 2.827 ± 0.274
2.537ProGlu: 2.537 ± 0.214
1.703ProPhe: 1.703 ± 0.201
2.707ProGly: 2.707 ± 0.217
0.817ProHis: 0.817 ± 0.175
2.537ProIle: 2.537 ± 0.23
2.18ProLys: 2.18 ± 0.301
2.384ProLeu: 2.384 ± 0.233
0.477ProMet: 0.477 ± 0.111
2.554ProAsn: 2.554 ± 0.207
1.396ProPro: 1.396 ± 0.195
1.277ProGln: 1.277 ± 0.192
1.686ProArg: 1.686 ± 0.228
2.946ProSer: 2.946 ± 0.249
3.082ProThr: 3.082 ± 0.194
2.282ProVal: 2.282 ± 0.221
0.681ProTrp: 0.681 ± 0.133
1.533ProTyr: 1.533 ± 0.185
0.0ProXaa: 0.0 ± 0.0
Gln
2.197GlnAla: 2.197 ± 0.223
0.255GlnCys: 0.255 ± 0.053
1.992GlnAsp: 1.992 ± 0.166
2.333GlnGlu: 2.333 ± 0.215
1.873GlnPhe: 1.873 ± 0.141
2.248GlnGly: 2.248 ± 0.171
0.409GlnHis: 0.409 ± 0.077
2.656GlnIle: 2.656 ± 0.213
2.248GlnLys: 2.248 ± 0.261
2.69GlnLeu: 2.69 ± 0.233
0.698GlnMet: 0.698 ± 0.114
1.686GlnAsn: 1.686 ± 0.178
1.192GlnPro: 1.192 ± 0.165
1.277GlnGln: 1.277 ± 0.17
1.635GlnArg: 1.635 ± 0.179
2.077GlnSer: 2.077 ± 0.215
2.486GlnThr: 2.486 ± 0.234
2.554GlnVal: 2.554 ± 0.209
0.426GlnTrp: 0.426 ± 0.087
1.652GlnTyr: 1.652 ± 0.164
0.0GlnXaa: 0.0 ± 0.0
Arg
2.367ArgAla: 2.367 ± 0.187
0.307ArgCys: 0.307 ± 0.089
2.094ArgAsp: 2.094 ± 0.179
2.316ArgGlu: 2.316 ± 0.276
1.924ArgPhe: 1.924 ± 0.197
2.571ArgGly: 2.571 ± 0.285
0.664ArgHis: 0.664 ± 0.12
2.929ArgIle: 2.929 ± 0.233
2.656ArgLys: 2.656 ± 0.374
3.099ArgLeu: 3.099 ± 0.254
0.92ArgMet: 0.92 ± 0.164
2.009ArgAsn: 2.009 ± 0.22
1.447ArgPro: 1.447 ± 0.204
1.635ArgGln: 1.635 ± 0.178
1.601ArgArg: 1.601 ± 0.275
2.742ArgSer: 2.742 ± 0.221
2.384ArgThr: 2.384 ± 0.282
2.742ArgVal: 2.742 ± 0.271
0.443ArgTrp: 0.443 ± 0.105
1.618ArgTyr: 1.618 ± 0.202
0.0ArgXaa: 0.0 ± 0.0
Ser
5.177SerAla: 5.177 ± 0.407
0.528SerCys: 0.528 ± 0.125
4.615SerAsp: 4.615 ± 0.219
3.542SerGlu: 3.542 ± 0.273
3.065SerPhe: 3.065 ± 0.296
8.003SerGly: 8.003 ± 0.6
0.971SerHis: 0.971 ± 0.126
4.478SerIle: 4.478 ± 0.367
4.036SerLys: 4.036 ± 0.318
4.734SerLeu: 4.734 ± 0.289
1.362SerMet: 1.362 ± 0.186
4.478SerAsn: 4.478 ± 0.261
2.622SerPro: 2.622 ± 0.222
2.299SerGln: 2.299 ± 0.189
2.231SerArg: 2.231 ± 0.224
6.573SerSer: 6.573 ± 0.445
5.807SerThr: 5.807 ± 0.381
5.398SerVal: 5.398 ± 0.472
0.715SerTrp: 0.715 ± 0.109
2.878SerTyr: 2.878 ± 0.234
0.0SerXaa: 0.0 ± 0.0
Thr
6.624ThrAla: 6.624 ± 0.899
0.647ThrCys: 0.647 ± 0.139
5.177ThrAsp: 5.177 ± 0.433
3.831ThrGlu: 3.831 ± 0.273
3.661ThrPhe: 3.661 ± 0.445
7.067ThrGly: 7.067 ± 0.727
1.107ThrHis: 1.107 ± 0.142
5.977ThrIle: 5.977 ± 0.581
3.78ThrLys: 3.78 ± 0.262
6.011ThrLeu: 6.011 ± 0.489
1.158ThrMet: 1.158 ± 0.162
4.751ThrAsn: 4.751 ± 0.516
3.065ThrPro: 3.065 ± 0.256
2.725ThrGln: 2.725 ± 0.213
2.469ThrArg: 2.469 ± 0.235
5.841ThrSer: 5.841 ± 0.586
6.488ThrThr: 6.488 ± 0.84
6.317ThrVal: 6.317 ± 0.545
0.647ThrTrp: 0.647 ± 0.091
2.946ThrTyr: 2.946 ± 0.278
0.0ThrXaa: 0.0 ± 0.0
Val
4.717ValAla: 4.717 ± 0.315
0.477ValCys: 0.477 ± 0.097
5.262ValAsp: 5.262 ± 0.376
4.223ValGlu: 4.223 ± 0.327
2.248ValPhe: 2.248 ± 0.195
5.33ValGly: 5.33 ± 0.382
0.834ValHis: 0.834 ± 0.142
3.729ValIle: 3.729 ± 0.277
3.968ValLys: 3.968 ± 0.265
4.649ValLeu: 4.649 ± 0.292
1.192ValMet: 1.192 ± 0.165
4.887ValAsn: 4.887 ± 0.309
2.69ValPro: 2.69 ± 0.227
2.146ValGln: 2.146 ± 0.21
2.554ValArg: 2.554 ± 0.21
5.16ValSer: 5.16 ± 0.432
6.096ValThr: 6.096 ± 0.667
4.598ValVal: 4.598 ± 0.368
0.511ValTrp: 0.511 ± 0.081
2.333ValTyr: 2.333 ± 0.206
0.0ValXaa: 0.0 ± 0.0
Trp
0.562TrpAla: 0.562 ± 0.102
0.136TrpCys: 0.136 ± 0.049
0.698TrpAsp: 0.698 ± 0.117
0.749TrpGlu: 0.749 ± 0.149
0.579TrpPhe: 0.579 ± 0.103
0.664TrpGly: 0.664 ± 0.123
0.289TrpHis: 0.289 ± 0.073
0.715TrpIle: 0.715 ± 0.117
0.732TrpLys: 0.732 ± 0.147
0.902TrpLeu: 0.902 ± 0.165
0.341TrpMet: 0.341 ± 0.086
0.817TrpAsn: 0.817 ± 0.119
0.238TrpPro: 0.238 ± 0.068
0.392TrpGln: 0.392 ± 0.055
0.409TrpArg: 0.409 ± 0.109
0.715TrpSer: 0.715 ± 0.101
0.868TrpThr: 0.868 ± 0.126
0.885TrpVal: 0.885 ± 0.146
0.153TrpTrp: 0.153 ± 0.066
0.511TrpTyr: 0.511 ± 0.093
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.333TyrAla: 2.333 ± 0.175
0.477TyrCys: 0.477 ± 0.091
3.303TyrAsp: 3.303 ± 0.239
2.231TyrGlu: 2.231 ± 0.279
1.362TyrPhe: 1.362 ± 0.185
2.503TyrGly: 2.503 ± 0.195
0.596TyrHis: 0.596 ± 0.107
2.588TyrIle: 2.588 ± 0.239
2.418TyrLys: 2.418 ± 0.218
2.776TyrLeu: 2.776 ± 0.208
0.8TyrMet: 0.8 ± 0.151
2.605TyrAsn: 2.605 ± 0.2
1.873TyrPro: 1.873 ± 0.259
1.396TyrGln: 1.396 ± 0.122
2.077TyrArg: 2.077 ± 0.212
2.588TyrSer: 2.588 ± 0.202
3.235TyrThr: 3.235 ± 0.329
2.929TyrVal: 2.929 ± 0.197
0.443TyrTrp: 0.443 ± 0.104
2.06TyrTyr: 2.06 ± 0.186
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 237 proteins (58727 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski