Amino acid dipepetide frequency for Synechococcus phage S-SKS1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.533AlaAla: 4.533 ± 0.512
0.571AlaCys: 0.571 ± 0.112
2.927AlaAsp: 2.927 ± 0.222
3.284AlaGlu: 3.284 ± 0.293
2.231AlaPhe: 2.231 ± 0.234
4.837AlaGly: 4.837 ± 0.364
0.892AlaHis: 0.892 ± 0.142
3.641AlaIle: 3.641 ± 0.256
3.052AlaLys: 3.052 ± 0.263
4.301AlaLeu: 4.301 ± 0.281
1.428AlaMet: 1.428 ± 0.172
3.302AlaAsn: 3.302 ± 0.31
2.463AlaPro: 2.463 ± 0.261
1.928AlaGln: 1.928 ± 0.173
2.445AlaArg: 2.445 ± 0.272
4.159AlaSer: 4.159 ± 0.346
4.569AlaThr: 4.569 ± 0.707
3.213AlaVal: 3.213 ± 0.27
0.553AlaTrp: 0.553 ± 0.115
1.928AlaTyr: 1.928 ± 0.184
0.0AlaXaa: 0.0 ± 0.0
Cys
0.589CysAla: 0.589 ± 0.098
0.089CysCys: 0.089 ± 0.039
1.017CysAsp: 1.017 ± 0.143
0.857CysGlu: 0.857 ± 0.125
0.553CysPhe: 0.553 ± 0.132
0.75CysGly: 0.75 ± 0.159
0.411CysHis: 0.411 ± 0.084
0.696CysIle: 0.696 ± 0.114
0.678CysLys: 0.678 ± 0.123
0.839CysLeu: 0.839 ± 0.137
0.321CysMet: 0.321 ± 0.078
0.696CysAsn: 0.696 ± 0.121
0.428CysPro: 0.428 ± 0.088
0.303CysGln: 0.303 ± 0.072
0.607CysArg: 0.607 ± 0.123
0.767CysSer: 0.767 ± 0.134
0.625CysThr: 0.625 ± 0.125
0.678CysVal: 0.678 ± 0.116
0.161CysTrp: 0.161 ± 0.055
0.66CysTyr: 0.66 ± 0.115
0.0CysXaa: 0.0 ± 0.0
Asp
4.052AspAla: 4.052 ± 0.27
0.982AspCys: 0.982 ± 0.146
4.212AspAsp: 4.212 ± 0.369
4.873AspGlu: 4.873 ± 0.341
3.409AspPhe: 3.409 ± 0.325
4.819AspGly: 4.819 ± 0.372
0.964AspHis: 0.964 ± 0.124
4.765AspIle: 4.765 ± 0.449
3.748AspLys: 3.748 ± 0.304
5.372AspLeu: 5.372 ± 0.317
1.553AspMet: 1.553 ± 0.18
3.445AspAsn: 3.445 ± 0.282
2.624AspPro: 2.624 ± 0.205
1.767AspGln: 1.767 ± 0.181
2.874AspArg: 2.874 ± 0.333
3.909AspSer: 3.909 ± 0.334
3.873AspThr: 3.873 ± 0.302
4.319AspVal: 4.319 ± 0.31
0.982AspTrp: 0.982 ± 0.147
3.266AspTyr: 3.266 ± 0.28
0.0AspXaa: 0.0 ± 0.0
Glu
2.999GluAla: 2.999 ± 0.276
0.875GluCys: 0.875 ± 0.137
3.837GluAsp: 3.837 ± 0.269
5.908GluGlu: 5.908 ± 0.572
2.963GluPhe: 2.963 ± 0.25
3.927GluGly: 3.927 ± 0.305
1.124GluHis: 1.124 ± 0.142
4.962GluIle: 4.962 ± 0.349
4.533GluLys: 4.533 ± 0.391
5.658GluLeu: 5.658 ± 0.39
1.892GluMet: 1.892 ± 0.233
4.105GluAsn: 4.105 ± 0.306
1.856GluPro: 1.856 ± 0.22
2.499GluGln: 2.499 ± 0.265
2.356GluArg: 2.356 ± 0.214
4.123GluSer: 4.123 ± 0.271
4.694GluThr: 4.694 ± 0.386
4.409GluVal: 4.409 ± 0.278
0.928GluTrp: 0.928 ± 0.139
3.427GluTyr: 3.427 ± 0.316
0.0GluXaa: 0.0 ± 0.0
Phe
2.374PheAla: 2.374 ± 0.238
0.553PheCys: 0.553 ± 0.133
3.248PheAsp: 3.248 ± 0.275
2.945PheGlu: 2.945 ± 0.31
1.785PhePhe: 1.785 ± 0.199
2.945PheGly: 2.945 ± 0.278
0.66PheHis: 0.66 ± 0.129
2.731PheIle: 2.731 ± 0.203
2.356PheLys: 2.356 ± 0.191
3.231PheLeu: 3.231 ± 0.258
0.91PheMet: 0.91 ± 0.134
3.052PheAsn: 3.052 ± 0.266
1.535PhePro: 1.535 ± 0.2
1.196PheGln: 1.196 ± 0.189
1.535PheArg: 1.535 ± 0.145
3.552PheSer: 3.552 ± 0.306
2.802PheThr: 2.802 ± 0.272
3.284PheVal: 3.284 ± 0.283
0.464PheTrp: 0.464 ± 0.08
1.767PheTyr: 1.767 ± 0.167
0.0PheXaa: 0.0 ± 0.0
Gly
3.962GlyAla: 3.962 ± 0.286
0.732GlyCys: 0.732 ± 0.152
4.052GlyAsp: 4.052 ± 0.365
4.052GlyGlu: 4.052 ± 0.301
2.856GlyPhe: 2.856 ± 0.212
7.157GlyGly: 7.157 ± 0.688
1.089GlyHis: 1.089 ± 0.149
6.193GlyIle: 6.193 ± 0.562
4.069GlyLys: 4.069 ± 0.352
5.14GlyLeu: 5.14 ± 0.34
1.803GlyMet: 1.803 ± 0.201
5.015GlyAsn: 5.015 ± 0.446
1.446GlyPro: 1.446 ± 0.159
2.124GlyGln: 2.124 ± 0.207
2.499GlyArg: 2.499 ± 0.202
6.336GlySer: 6.336 ± 0.546
6.425GlyThr: 6.425 ± 0.857
4.944GlyVal: 4.944 ± 0.325
1.142GlyTrp: 1.142 ± 0.137
3.427GlyTyr: 3.427 ± 0.315
0.0GlyXaa: 0.0 ± 0.0
His
0.643HisAla: 0.643 ± 0.129
0.143HisCys: 0.143 ± 0.051
0.982HisAsp: 0.982 ± 0.137
1.107HisGlu: 1.107 ± 0.15
0.785HisPhe: 0.785 ± 0.124
0.892HisGly: 0.892 ± 0.17
0.375HisHis: 0.375 ± 0.078
1.124HisIle: 1.124 ± 0.139
1.035HisLys: 1.035 ± 0.151
1.089HisLeu: 1.089 ± 0.144
0.375HisMet: 0.375 ± 0.079
0.714HisAsn: 0.714 ± 0.128
0.803HisPro: 0.803 ± 0.12
0.66HisGln: 0.66 ± 0.114
0.803HisArg: 0.803 ± 0.11
1.089HisSer: 1.089 ± 0.163
1.142HisThr: 1.142 ± 0.145
0.821HisVal: 0.821 ± 0.13
0.303HisTrp: 0.303 ± 0.078
1.053HisTyr: 1.053 ± 0.159
0.0HisXaa: 0.0 ± 0.0
Ile
3.998IleAla: 3.998 ± 0.287
1.089IleCys: 1.089 ± 0.137
6.193IleAsp: 6.193 ± 0.414
4.944IleGlu: 4.944 ± 0.341
2.606IlePhe: 2.606 ± 0.22
4.855IleGly: 4.855 ± 0.388
1.178IleHis: 1.178 ± 0.159
4.73IleIle: 4.73 ± 0.362
4.908IleLys: 4.908 ± 0.315
4.587IleLeu: 4.587 ± 0.276
1.321IleMet: 1.321 ± 0.141
3.784IleAsn: 3.784 ± 0.218
2.481IlePro: 2.481 ± 0.229
2.267IleGln: 2.267 ± 0.197
2.963IleArg: 2.963 ± 0.236
5.337IleSer: 5.337 ± 0.431
5.711IleThr: 5.711 ± 0.651
4.462IleVal: 4.462 ± 0.261
0.66IleTrp: 0.66 ± 0.105
2.534IleTyr: 2.534 ± 0.222
0.0IleXaa: 0.0 ± 0.0
Lys
3.231LysAla: 3.231 ± 0.347
0.625LysCys: 0.625 ± 0.116
3.677LysAsp: 3.677 ± 0.279
5.444LysGlu: 5.444 ± 0.451
2.891LysPhe: 2.891 ± 0.262
3.409LysGly: 3.409 ± 0.293
0.821LysHis: 0.821 ± 0.123
4.783LysIle: 4.783 ± 0.311
5.069LysLys: 5.069 ± 0.523
5.051LysLeu: 5.051 ± 0.362
1.838LysMet: 1.838 ± 0.235
3.927LysAsn: 3.927 ± 0.327
2.088LysPro: 2.088 ± 0.209
2.106LysGln: 2.106 ± 0.201
2.499LysArg: 2.499 ± 0.304
4.141LysSer: 4.141 ± 0.289
3.159LysThr: 3.159 ± 0.257
4.194LysVal: 4.194 ± 0.314
0.678LysTrp: 0.678 ± 0.1
3.106LysTyr: 3.106 ± 0.279
0.0LysXaa: 0.0 ± 0.0
Leu
4.658LeuAla: 4.658 ± 0.36
0.946LeuCys: 0.946 ± 0.161
5.926LeuAsp: 5.926 ± 0.273
4.837LeuGlu: 4.837 ± 0.337
2.981LeuPhe: 2.981 ± 0.196
4.284LeuGly: 4.284 ± 0.3
1.374LeuHis: 1.374 ± 0.172
4.426LeuIle: 4.426 ± 0.275
5.676LeuLys: 5.676 ± 0.404
4.783LeuLeu: 4.783 ± 0.452
1.892LeuMet: 1.892 ± 0.211
4.694LeuAsn: 4.694 ± 0.397
3.284LeuPro: 3.284 ± 0.262
2.106LeuGln: 2.106 ± 0.185
3.266LeuArg: 3.266 ± 0.234
5.801LeuSer: 5.801 ± 0.373
5.319LeuThr: 5.319 ± 0.516
3.962LeuVal: 3.962 ± 0.291
0.803LeuTrp: 0.803 ± 0.132
3.177LeuTyr: 3.177 ± 0.226
0.0LeuXaa: 0.0 ± 0.0
Met
1.428MetAla: 1.428 ± 0.15
0.375MetCys: 0.375 ± 0.085
1.267MetAsp: 1.267 ± 0.179
1.731MetGlu: 1.731 ± 0.177
0.785MetPhe: 0.785 ± 0.124
1.374MetGly: 1.374 ± 0.194
0.411MetHis: 0.411 ± 0.087
1.642MetIle: 1.642 ± 0.206
1.749MetLys: 1.749 ± 0.234
1.303MetLeu: 1.303 ± 0.172
0.732MetMet: 0.732 ± 0.119
1.66MetAsn: 1.66 ± 0.166
0.839MetPro: 0.839 ± 0.128
0.785MetGln: 0.785 ± 0.135
1.303MetArg: 1.303 ± 0.163
1.874MetSer: 1.874 ± 0.194
1.481MetThr: 1.481 ± 0.183
1.0MetVal: 1.0 ± 0.144
0.393MetTrp: 0.393 ± 0.088
0.91MetTyr: 0.91 ± 0.116
0.0MetXaa: 0.0 ± 0.0
Asn
3.052AsnAla: 3.052 ± 0.241
0.91AsnCys: 0.91 ± 0.142
3.516AsnAsp: 3.516 ± 0.204
3.677AsnGlu: 3.677 ± 0.318
2.963AsnPhe: 2.963 ± 0.255
4.98AsnGly: 4.98 ± 0.404
0.964AsnHis: 0.964 ± 0.125
4.873AsnIle: 4.873 ± 0.453
3.338AsnLys: 3.338 ± 0.28
5.586AsnLeu: 5.586 ± 0.493
1.142AsnMet: 1.142 ± 0.143
3.784AsnAsn: 3.784 ± 0.288
2.981AsnPro: 2.981 ± 0.221
2.106AsnGln: 2.106 ± 0.201
2.124AsnArg: 2.124 ± 0.186
4.194AsnSer: 4.194 ± 0.328
4.212AsnThr: 4.212 ± 0.341
4.034AsnVal: 4.034 ± 0.338
0.767AsnTrp: 0.767 ± 0.13
2.642AsnTyr: 2.642 ± 0.195
0.0AsnXaa: 0.0 ± 0.0
Pro
1.749ProAla: 1.749 ± 0.199
0.268ProCys: 0.268 ± 0.071
2.285ProAsp: 2.285 ± 0.197
2.945ProGlu: 2.945 ± 0.278
1.606ProPhe: 1.606 ± 0.148
2.874ProGly: 2.874 ± 0.281
0.589ProHis: 0.589 ± 0.096
2.374ProIle: 2.374 ± 0.197
2.195ProLys: 2.195 ± 0.195
2.463ProLeu: 2.463 ± 0.238
0.625ProMet: 0.625 ± 0.12
2.32ProAsn: 2.32 ± 0.204
1.571ProPro: 1.571 ± 0.225
1.339ProGln: 1.339 ± 0.147
1.16ProArg: 1.16 ± 0.139
3.141ProSer: 3.141 ± 0.237
2.552ProThr: 2.552 ± 0.186
2.195ProVal: 2.195 ± 0.202
0.411ProTrp: 0.411 ± 0.086
1.678ProTyr: 1.678 ± 0.192
0.0ProXaa: 0.0 ± 0.0
Gln
1.535GlnAla: 1.535 ± 0.21
0.303GlnCys: 0.303 ± 0.074
1.981GlnAsp: 1.981 ± 0.178
2.302GlnGlu: 2.302 ± 0.244
1.571GlnPhe: 1.571 ± 0.15
2.053GlnGly: 2.053 ± 0.181
0.5GlnHis: 0.5 ± 0.108
2.392GlnIle: 2.392 ± 0.2
2.302GlnLys: 2.302 ± 0.228
2.659GlnLeu: 2.659 ± 0.19
0.821GlnMet: 0.821 ± 0.143
1.892GlnAsn: 1.892 ± 0.169
0.91GlnPro: 0.91 ± 0.141
1.321GlnGln: 1.321 ± 0.164
1.356GlnArg: 1.356 ± 0.14
2.41GlnSer: 2.41 ± 0.174
1.713GlnThr: 1.713 ± 0.156
2.195GlnVal: 2.195 ± 0.202
0.464GlnTrp: 0.464 ± 0.097
1.428GlnTyr: 1.428 ± 0.179
0.0GlnXaa: 0.0 ± 0.0
Arg
2.07ArgAla: 2.07 ± 0.271
0.357ArgCys: 0.357 ± 0.068
2.856ArgAsp: 2.856 ± 0.214
2.517ArgGlu: 2.517 ± 0.278
1.588ArgPhe: 1.588 ± 0.163
2.552ArgGly: 2.552 ± 0.229
0.678ArgHis: 0.678 ± 0.121
3.248ArgIle: 3.248 ± 0.277
2.766ArgLys: 2.766 ± 0.269
3.195ArgLeu: 3.195 ± 0.263
1.071ArgMet: 1.071 ± 0.139
2.32ArgAsn: 2.32 ± 0.204
1.196ArgPro: 1.196 ± 0.158
1.499ArgGln: 1.499 ± 0.203
1.821ArgArg: 1.821 ± 0.221
2.517ArgSer: 2.517 ± 0.215
2.053ArgThr: 2.053 ± 0.184
2.106ArgVal: 2.106 ± 0.192
0.482ArgTrp: 0.482 ± 0.095
1.981ArgTyr: 1.981 ± 0.195
0.0ArgXaa: 0.0 ± 0.0
Ser
4.587SerAla: 4.587 ± 0.349
0.803SerCys: 0.803 ± 0.153
4.926SerAsp: 4.926 ± 0.311
4.105SerGlu: 4.105 ± 0.327
3.177SerPhe: 3.177 ± 0.336
6.836SerGly: 6.836 ± 0.492
1.0SerHis: 1.0 ± 0.133
5.051SerIle: 5.051 ± 0.405
4.23SerLys: 4.23 ± 0.326
5.872SerLeu: 5.872 ± 0.337
1.446SerMet: 1.446 ± 0.166
5.301SerAsn: 5.301 ± 0.368
2.874SerPro: 2.874 ± 0.222
2.374SerGln: 2.374 ± 0.219
2.338SerArg: 2.338 ± 0.198
7.461SerSer: 7.461 ± 0.707
5.122SerThr: 5.122 ± 0.389
4.819SerVal: 4.819 ± 0.305
0.857SerTrp: 0.857 ± 0.143
3.195SerTyr: 3.195 ± 0.31
0.0SerXaa: 0.0 ± 0.0
Thr
4.605ThrAla: 4.605 ± 0.844
0.643ThrCys: 0.643 ± 0.095
4.052ThrAsp: 4.052 ± 0.294
4.123ThrGlu: 4.123 ± 0.264
3.016ThrPhe: 3.016 ± 0.226
6.675ThrGly: 6.675 ± 0.945
0.892ThrHis: 0.892 ± 0.139
5.105ThrIle: 5.105 ± 0.441
3.48ThrLys: 3.48 ± 0.257
4.712ThrLeu: 4.712 ± 0.329
1.089ThrMet: 1.089 ± 0.15
4.533ThrAsn: 4.533 ± 0.481
2.82ThrPro: 2.82 ± 0.269
2.32ThrGln: 2.32 ± 0.209
2.142ThrArg: 2.142 ± 0.184
5.872ThrSer: 5.872 ± 0.505
6.515ThrThr: 6.515 ± 0.773
4.658ThrVal: 4.658 ± 0.379
0.767ThrTrp: 0.767 ± 0.17
2.856ThrTyr: 2.856 ± 0.247
0.0ThrXaa: 0.0 ± 0.0
Val
3.32ValAla: 3.32 ± 0.275
0.714ValCys: 0.714 ± 0.115
4.409ValAsp: 4.409 ± 0.243
4.069ValGlu: 4.069 ± 0.279
2.195ValPhe: 2.195 ± 0.23
5.14ValGly: 5.14 ± 0.467
1.035ValHis: 1.035 ± 0.154
4.176ValIle: 4.176 ± 0.261
3.909ValLys: 3.909 ± 0.302
4.034ValLeu: 4.034 ± 0.289
1.428ValMet: 1.428 ± 0.148
3.766ValAsn: 3.766 ± 0.333
2.302ValPro: 2.302 ± 0.205
1.642ValGln: 1.642 ± 0.168
2.713ValArg: 2.713 ± 0.224
5.729ValSer: 5.729 ± 0.294
5.087ValThr: 5.087 ± 0.437
3.909ValVal: 3.909 ± 0.308
0.553ValTrp: 0.553 ± 0.109
2.57ValTyr: 2.57 ± 0.218
0.0ValXaa: 0.0 ± 0.0
Trp
0.518TrpAla: 0.518 ± 0.081
0.125TrpCys: 0.125 ± 0.054
0.785TrpAsp: 0.785 ± 0.121
0.839TrpGlu: 0.839 ± 0.135
0.607TrpPhe: 0.607 ± 0.097
0.714TrpGly: 0.714 ± 0.106
0.161TrpHis: 0.161 ± 0.054
0.928TrpIle: 0.928 ± 0.141
1.053TrpLys: 1.053 ± 0.158
0.928TrpLeu: 0.928 ± 0.134
0.428TrpMet: 0.428 ± 0.088
0.785TrpAsn: 0.785 ± 0.118
0.321TrpPro: 0.321 ± 0.074
0.411TrpGln: 0.411 ± 0.089
0.464TrpArg: 0.464 ± 0.097
0.696TrpSer: 0.696 ± 0.137
0.857TrpThr: 0.857 ± 0.101
0.714TrpVal: 0.714 ± 0.116
0.196TrpTrp: 0.196 ± 0.062
0.5TrpTyr: 0.5 ± 0.099
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.356TyrAla: 2.356 ± 0.225
0.625TyrCys: 0.625 ± 0.104
3.659TyrAsp: 3.659 ± 0.255
2.534TyrGlu: 2.534 ± 0.25
2.356TyrPhe: 2.356 ± 0.269
3.391TyrGly: 3.391 ± 0.304
0.821TyrHis: 0.821 ± 0.113
2.766TyrIle: 2.766 ± 0.224
2.588TyrLys: 2.588 ± 0.257
3.391TyrLeu: 3.391 ± 0.229
0.91TyrMet: 0.91 ± 0.136
2.749TyrAsn: 2.749 ± 0.248
1.517TyrPro: 1.517 ± 0.155
1.356TyrGln: 1.356 ± 0.15
1.606TyrArg: 1.606 ± 0.183
3.266TyrSer: 3.266 ± 0.213
2.963TyrThr: 2.963 ± 0.223
2.784TyrVal: 2.784 ± 0.205
0.464TyrTrp: 0.464 ± 0.092
1.981TyrTyr: 1.981 ± 0.172
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 281 proteins (56029 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski