Amino acid dipepetide frequency for Synechococcus phage B3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.728AlaAla: 3.728 ± 0.518
0.534AlaCys: 0.534 ± 0.09
2.806AlaAsp: 2.806 ± 0.263
3.006AlaGlu: 3.006 ± 0.237
2.164AlaPhe: 2.164 ± 0.21
4.97AlaGly: 4.97 ± 1.213
0.842AlaHis: 0.842 ± 0.117
3.981AlaIle: 3.981 ± 0.24
3.1AlaLys: 3.1 ± 0.254
3.834AlaLeu: 3.834 ± 0.295
1.122AlaMet: 1.122 ± 0.143
3.354AlaAsn: 3.354 ± 0.383
1.79AlaPro: 1.79 ± 0.189
2.098AlaGln: 2.098 ± 0.194
2.084AlaArg: 2.084 ± 0.202
3.768AlaSer: 3.768 ± 0.229
5.264AlaThr: 5.264 ± 1.53
3.126AlaVal: 3.126 ± 0.192
0.468AlaTrp: 0.468 ± 0.071
2.298AlaTyr: 2.298 ± 0.184
0.027AlaXaa: 0.027 ± 0.019
Cys
0.561CysAla: 0.561 ± 0.098
0.174CysCys: 0.174 ± 0.054
0.949CysAsp: 0.949 ± 0.127
0.975CysGlu: 0.975 ± 0.129
0.601CysPhe: 0.601 ± 0.09
1.162CysGly: 1.162 ± 0.167
0.281CysHis: 0.281 ± 0.077
0.641CysIle: 0.641 ± 0.087
0.721CysLys: 0.721 ± 0.101
1.202CysLeu: 1.202 ± 0.133
0.267CysMet: 0.267 ± 0.061
0.548CysAsn: 0.548 ± 0.109
0.521CysPro: 0.521 ± 0.078
0.387CysGln: 0.387 ± 0.079
0.454CysArg: 0.454 ± 0.076
0.882CysSer: 0.882 ± 0.116
0.561CysThr: 0.561 ± 0.082
0.708CysVal: 0.708 ± 0.098
0.174CysTrp: 0.174 ± 0.048
0.641CysTyr: 0.641 ± 0.087
0.013CysXaa: 0.013 ± 0.015
Asp
2.913AspAla: 2.913 ± 0.23
0.842AspCys: 0.842 ± 0.1
3.701AspAsp: 3.701 ± 0.307
4.169AspGlu: 4.169 ± 0.319
3.287AspPhe: 3.287 ± 0.223
3.875AspGly: 3.875 ± 0.238
1.029AspHis: 1.029 ± 0.133
4.877AspIle: 4.877 ± 0.273
3.26AspLys: 3.26 ± 0.244
5.959AspLeu: 5.959 ± 0.296
1.363AspMet: 1.363 ± 0.15
3.447AspAsn: 3.447 ± 0.235
2.084AspPro: 2.084 ± 0.184
1.51AspGln: 1.51 ± 0.136
2.031AspArg: 2.031 ± 0.146
4.155AspSer: 4.155 ± 0.32
3.313AspThr: 3.313 ± 0.203
3.941AspVal: 3.941 ± 0.255
1.029AspTrp: 1.029 ± 0.12
3.3AspTyr: 3.3 ± 0.267
0.04AspXaa: 0.04 ± 0.029
Glu
3.1GluAla: 3.1 ± 0.208
0.962GluCys: 0.962 ± 0.127
3.915GluAsp: 3.915 ± 0.306
5.251GluGlu: 5.251 ± 0.446
3.046GluPhe: 3.046 ± 0.217
3.006GluGly: 3.006 ± 0.226
1.189GluHis: 1.189 ± 0.139
5.865GluIle: 5.865 ± 0.383
5.371GluLys: 5.371 ± 0.361
5.785GluLeu: 5.785 ± 0.356
1.39GluMet: 1.39 ± 0.153
4.756GluAsn: 4.756 ± 0.289
1.777GluPro: 1.777 ± 0.154
2.204GluGln: 2.204 ± 0.214
2.498GluArg: 2.498 ± 0.177
4.142GluSer: 4.142 ± 0.246
3.674GluThr: 3.674 ± 0.239
3.567GluVal: 3.567 ± 0.233
1.069GluTrp: 1.069 ± 0.154
3.647GluTyr: 3.647 ± 0.266
0.027GluXaa: 0.027 ± 0.016
Phe
3.06PheAla: 3.06 ± 0.948
0.668PheCys: 0.668 ± 0.112
3.647PheAsp: 3.647 ± 0.219
2.699PheGlu: 2.699 ± 0.205
1.911PhePhe: 1.911 ± 0.192
2.619PheGly: 2.619 ± 0.208
0.561PheHis: 0.561 ± 0.093
3.367PheIle: 3.367 ± 0.176
2.458PheLys: 2.458 ± 0.176
3.487PheLeu: 3.487 ± 0.21
0.989PheMet: 0.989 ± 0.126
3.073PheAsn: 3.073 ± 0.201
1.416PhePro: 1.416 ± 0.127
1.256PheGln: 1.256 ± 0.156
1.47PheArg: 1.47 ± 0.149
3.1PheSer: 3.1 ± 0.233
3.033PheThr: 3.033 ± 0.288
2.752PheVal: 2.752 ± 0.222
0.561PheTrp: 0.561 ± 0.096
2.178PheTyr: 2.178 ± 0.19
0.0PheXaa: 0.0 ± 0.0
Gly
3.407GlyAla: 3.407 ± 0.326
0.721GlyCys: 0.721 ± 0.109
3.621GlyAsp: 3.621 ± 0.282
3.447GlyGlu: 3.447 ± 0.201
2.979GlyPhe: 2.979 ± 0.26
4.369GlyGly: 4.369 ± 0.421
0.815GlyHis: 0.815 ± 0.117
6.253GlyIle: 6.253 ± 0.907
4.142GlyLys: 4.142 ± 0.328
4.142GlyLeu: 4.142 ± 0.284
0.962GlyMet: 0.962 ± 0.116
3.647GlyAsn: 3.647 ± 0.269
1.577GlyPro: 1.577 ± 0.155
2.218GlyGln: 2.218 ± 0.243
2.779GlyArg: 2.779 ± 0.24
3.981GlySer: 3.981 ± 0.273
4.235GlyThr: 4.235 ± 0.433
4.462GlyVal: 4.462 ± 0.298
0.855GlyTrp: 0.855 ± 0.105
3.808GlyTyr: 3.808 ± 0.449
0.0GlyXaa: 0.0 ± 0.0
His
0.588HisAla: 0.588 ± 0.09
0.347HisCys: 0.347 ± 0.065
1.136HisAsp: 1.136 ± 0.123
1.122HisGlu: 1.122 ± 0.129
0.655HisPhe: 0.655 ± 0.101
0.989HisGly: 0.989 ± 0.116
0.321HisHis: 0.321 ± 0.067
1.483HisIle: 1.483 ± 0.154
1.216HisLys: 1.216 ± 0.139
1.523HisLeu: 1.523 ± 0.168
0.374HisMet: 0.374 ± 0.081
1.122HisAsn: 1.122 ± 0.119
0.681HisPro: 0.681 ± 0.1
0.494HisGln: 0.494 ± 0.088
0.735HisArg: 0.735 ± 0.114
1.136HisSer: 1.136 ± 0.131
0.882HisThr: 0.882 ± 0.125
0.815HisVal: 0.815 ± 0.124
0.334HisTrp: 0.334 ± 0.087
1.109HisTyr: 1.109 ± 0.133
0.027HisXaa: 0.027 ± 0.016
Ile
4.649IleAla: 4.649 ± 0.666
0.909IleCys: 0.909 ± 0.11
5.184IleAsp: 5.184 ± 0.31
6.092IleGlu: 6.092 ± 0.339
2.913IlePhe: 2.913 ± 0.228
4.102IleGly: 4.102 ± 0.392
1.349IleHis: 1.349 ± 0.152
5.745IleIle: 5.745 ± 0.364
6.199IleLys: 6.199 ± 0.321
5.785IleLeu: 5.785 ± 0.298
1.323IleMet: 1.323 ± 0.128
5.264IleAsn: 5.264 ± 0.339
3.313IlePro: 3.313 ± 0.225
2.886IleGln: 2.886 ± 0.198
3.3IleArg: 3.3 ± 0.214
6.6IleSer: 6.6 ± 0.433
5.05IleThr: 5.05 ± 0.271
5.05IleVal: 5.05 ± 0.295
0.534IleTrp: 0.534 ± 0.086
3.1IleTyr: 3.1 ± 0.232
0.04IleXaa: 0.04 ± 0.026
Lys
3.313LysAla: 3.313 ± 0.29
0.788LysCys: 0.788 ± 0.113
3.474LysAsp: 3.474 ± 0.259
5.024LysGlu: 5.024 ± 0.376
2.766LysPhe: 2.766 ± 0.217
2.792LysGly: 2.792 ± 0.259
1.136LysHis: 1.136 ± 0.146
6.199LysIle: 6.199 ± 0.358
5.812LysLys: 5.812 ± 0.428
6.079LysLeu: 6.079 ± 0.358
1.67LysMet: 1.67 ± 0.203
4.89LysAsn: 4.89 ± 0.303
2.579LysPro: 2.579 ± 0.224
2.552LysGln: 2.552 ± 0.241
2.552LysArg: 2.552 ± 0.216
4.249LysSer: 4.249 ± 0.253
3.714LysThr: 3.714 ± 0.234
4.142LysVal: 4.142 ± 0.239
0.802LysTrp: 0.802 ± 0.125
3.581LysTyr: 3.581 ± 0.263
0.013LysXaa: 0.013 ± 0.013
Leu
4.302LeuAla: 4.302 ± 0.253
1.096LeuCys: 1.096 ± 0.143
5.064LeuAsp: 5.064 ± 0.312
5.611LeuGlu: 5.611 ± 0.383
2.979LeuPhe: 2.979 ± 0.208
4.195LeuGly: 4.195 ± 0.27
1.283LeuHis: 1.283 ± 0.134
5.945LeuIle: 5.945 ± 0.307
5.464LeuLys: 5.464 ± 0.358
5.972LeuLeu: 5.972 ± 0.357
2.138LeuMet: 2.138 ± 0.179
5.732LeuAsn: 5.732 ± 0.332
3.46LeuPro: 3.46 ± 0.2
2.792LeuGln: 2.792 ± 0.229
3.875LeuArg: 3.875 ± 0.307
5.518LeuSer: 5.518 ± 0.242
4.756LeuThr: 4.756 ± 0.253
4.77LeuVal: 4.77 ± 0.247
0.708LeuTrp: 0.708 ± 0.093
3.621LeuTyr: 3.621 ± 0.205
0.053LeuXaa: 0.053 ± 0.03
Met
1.403MetAla: 1.403 ± 0.141
0.347MetCys: 0.347 ± 0.07
0.949MetAsp: 0.949 ± 0.123
1.176MetGlu: 1.176 ± 0.152
0.909MetPhe: 0.909 ± 0.116
0.935MetGly: 0.935 ± 0.11
0.347MetHis: 0.347 ± 0.067
1.657MetIle: 1.657 ± 0.167
2.191MetLys: 2.191 ± 0.191
1.323MetLeu: 1.323 ± 0.168
0.428MetMet: 0.428 ± 0.084
1.349MetAsn: 1.349 ± 0.156
0.615MetPro: 0.615 ± 0.089
0.468MetGln: 0.468 ± 0.084
0.922MetArg: 0.922 ± 0.113
1.777MetSer: 1.777 ± 0.19
1.536MetThr: 1.536 ± 0.198
1.002MetVal: 1.002 ± 0.136
0.321MetTrp: 0.321 ± 0.061
0.628MetTyr: 0.628 ± 0.115
0.0MetXaa: 0.0 ± 0.0
Asn
2.739AsnAla: 2.739 ± 0.209
0.989AsnCys: 0.989 ± 0.124
3.581AsnAsp: 3.581 ± 0.231
3.567AsnGlu: 3.567 ± 0.24
2.899AsnPhe: 2.899 ± 0.196
4.262AsnGly: 4.262 ± 0.361
1.176AsnHis: 1.176 ± 0.109
5.745AsnIle: 5.745 ± 0.397
4.583AsnLys: 4.583 ± 0.315
5.905AsnLeu: 5.905 ± 0.432
1.309AsnMet: 1.309 ± 0.153
3.888AsnAsn: 3.888 ± 0.267
3.354AsnPro: 3.354 ± 0.214
2.432AsnGln: 2.432 ± 0.19
2.699AsnArg: 2.699 ± 0.188
4.529AsnSer: 4.529 ± 0.311
4.222AsnThr: 4.222 ± 0.25
3.567AsnVal: 3.567 ± 0.267
0.748AsnTrp: 0.748 ± 0.098
3.447AsnTyr: 3.447 ± 0.231
0.053AsnXaa: 0.053 ± 0.03
Pro
1.617ProAla: 1.617 ± 0.186
0.387ProCys: 0.387 ± 0.064
2.164ProAsp: 2.164 ± 0.18
2.993ProGlu: 2.993 ± 0.209
1.51ProPhe: 1.51 ± 0.133
2.605ProGly: 2.605 ± 0.2
0.962ProHis: 0.962 ± 0.147
2.939ProIle: 2.939 ± 0.266
2.498ProLys: 2.498 ± 0.196
2.565ProLeu: 2.565 ± 0.221
0.775ProMet: 0.775 ± 0.102
3.006ProAsn: 3.006 ± 0.201
1.67ProPro: 1.67 ± 0.169
1.162ProGln: 1.162 ± 0.135
1.523ProArg: 1.523 ± 0.151
2.458ProSer: 2.458 ± 0.2
2.806ProThr: 2.806 ± 0.205
2.311ProVal: 2.311 ± 0.162
0.468ProTrp: 0.468 ± 0.093
1.911ProTyr: 1.911 ± 0.176
0.027ProXaa: 0.027 ± 0.025
Gln
1.443GlnAla: 1.443 ± 0.171
0.401GlnCys: 0.401 ± 0.092
1.764GlnAsp: 1.764 ± 0.165
2.432GlnGlu: 2.432 ± 0.224
1.67GlnPhe: 1.67 ± 0.15
1.563GlnGly: 1.563 ± 0.17
0.588GlnHis: 0.588 ± 0.081
2.979GlnIle: 2.979 ± 0.168
2.726GlnLys: 2.726 ± 0.296
3.06GlnLeu: 3.06 ± 0.215
0.708GlnMet: 0.708 ± 0.108
2.138GlnAsn: 2.138 ± 0.193
1.176GlnPro: 1.176 ± 0.128
1.336GlnGln: 1.336 ± 0.155
1.523GlnArg: 1.523 ± 0.154
2.098GlnSer: 2.098 ± 0.2
1.924GlnThr: 1.924 ± 0.193
1.897GlnVal: 1.897 ± 0.143
0.548GlnTrp: 0.548 ± 0.079
1.75GlnTyr: 1.75 ± 0.146
0.013GlnXaa: 0.013 ± 0.013
Arg
2.445ArgAla: 2.445 ± 0.251
0.387ArgCys: 0.387 ± 0.077
2.365ArgAsp: 2.365 ± 0.202
2.659ArgGlu: 2.659 ± 0.189
1.79ArgPhe: 1.79 ± 0.139
2.258ArgGly: 2.258 ± 0.203
0.641ArgHis: 0.641 ± 0.118
3.207ArgIle: 3.207 ± 0.187
2.873ArgLys: 2.873 ± 0.226
3.14ArgLeu: 3.14 ± 0.214
0.922ArgMet: 0.922 ± 0.113
2.485ArgAsn: 2.485 ± 0.189
1.47ArgPro: 1.47 ± 0.138
1.336ArgGln: 1.336 ± 0.142
2.111ArgArg: 2.111 ± 0.168
2.979ArgSer: 2.979 ± 0.202
2.258ArgThr: 2.258 ± 0.217
2.418ArgVal: 2.418 ± 0.181
0.508ArgTrp: 0.508 ± 0.083
2.004ArgTyr: 2.004 ± 0.181
0.0ArgXaa: 0.0 ± 0.0
Ser
4.048SerAla: 4.048 ± 0.411
0.788SerCys: 0.788 ± 0.115
4.035SerAsp: 4.035 ± 0.266
3.701SerGlu: 3.701 ± 0.232
3.046SerPhe: 3.046 ± 0.181
5.839SerGly: 5.839 ± 0.469
1.136SerHis: 1.136 ± 0.135
5.545SerIle: 5.545 ± 0.397
4.329SerLys: 4.329 ± 0.292
5.411SerLeu: 5.411 ± 0.29
1.403SerMet: 1.403 ± 0.169
4.422SerAsn: 4.422 ± 0.293
2.779SerPro: 2.779 ± 0.222
2.258SerGln: 2.258 ± 0.181
2.873SerArg: 2.873 ± 0.205
6.026SerSer: 6.026 ± 0.469
4.743SerThr: 4.743 ± 0.374
4.289SerVal: 4.289 ± 0.308
0.641SerTrp: 0.641 ± 0.098
3.474SerTyr: 3.474 ± 0.239
0.013SerXaa: 0.013 ± 0.013
Thr
4.422ThrAla: 4.422 ± 0.7
0.481ThrCys: 0.481 ± 0.093
3.153ThrAsp: 3.153 ± 0.252
4.075ThrGlu: 4.075 ± 0.259
3.968ThrPhe: 3.968 ± 0.747
4.743ThrGly: 4.743 ± 0.45
1.269ThrHis: 1.269 ± 0.131
4.73ThrIle: 4.73 ± 0.243
3.46ThrLys: 3.46 ± 0.204
4.863ThrLeu: 4.863 ± 0.273
0.762ThrMet: 0.762 ± 0.096
3.888ThrAsn: 3.888 ± 0.316
3.487ThrPro: 3.487 ± 0.262
2.218ThrGln: 2.218 ± 0.222
1.964ThrArg: 1.964 ± 0.146
4.195ThrSer: 4.195 ± 0.321
5.144ThrThr: 5.144 ± 0.619
3.968ThrVal: 3.968 ± 0.284
0.494ThrTrp: 0.494 ± 0.09
2.899ThrTyr: 2.899 ± 0.253
0.04ThrXaa: 0.04 ± 0.027
Val
3.233ValAla: 3.233 ± 0.272
0.668ValCys: 0.668 ± 0.087
4.449ValAsp: 4.449 ± 0.265
4.275ValGlu: 4.275 ± 0.295
2.445ValPhe: 2.445 ± 0.201
4.663ValGly: 4.663 ± 0.4
0.975ValHis: 0.975 ± 0.139
4.262ValIle: 4.262 ± 0.241
3.968ValLys: 3.968 ± 0.255
4.342ValLeu: 4.342 ± 0.243
1.162ValMet: 1.162 ± 0.147
4.329ValAsn: 4.329 ± 0.265
2.378ValPro: 2.378 ± 0.192
1.977ValGln: 1.977 ± 0.15
2.351ValArg: 2.351 ± 0.176
4.556ValSer: 4.556 ± 0.347
3.781ValThr: 3.781 ± 0.318
4.235ValVal: 4.235 ± 0.308
0.802ValTrp: 0.802 ± 0.103
2.432ValTyr: 2.432 ± 0.212
0.0ValXaa: 0.0 ± 0.0
Trp
0.708TrpAla: 0.708 ± 0.1
0.321TrpCys: 0.321 ± 0.074
0.815TrpAsp: 0.815 ± 0.119
0.828TrpGlu: 0.828 ± 0.118
0.468TrpPhe: 0.468 ± 0.085
0.748TrpGly: 0.748 ± 0.106
0.281TrpHis: 0.281 ± 0.061
0.641TrpIle: 0.641 ± 0.089
0.748TrpLys: 0.748 ± 0.079
0.949TrpLeu: 0.949 ± 0.119
0.307TrpMet: 0.307 ± 0.068
0.655TrpAsn: 0.655 ± 0.087
0.321TrpPro: 0.321 ± 0.066
0.441TrpGln: 0.441 ± 0.089
0.575TrpArg: 0.575 ± 0.078
0.721TrpSer: 0.721 ± 0.093
0.655TrpThr: 0.655 ± 0.086
1.029TrpVal: 1.029 ± 0.131
0.334TrpTrp: 0.334 ± 0.072
0.561TrpTyr: 0.561 ± 0.101
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.672TyrAla: 2.672 ± 0.494
0.615TyrCys: 0.615 ± 0.082
3.34TyrAsp: 3.34 ± 0.212
3.073TyrGlu: 3.073 ± 0.241
2.285TyrPhe: 2.285 ± 0.225
2.699TyrGly: 2.699 ± 0.214
0.909TyrHis: 0.909 ± 0.137
3.287TyrIle: 3.287 ± 0.209
3.019TyrLys: 3.019 ± 0.214
3.995TyrLeu: 3.995 ± 0.216
0.882TyrMet: 0.882 ± 0.104
3.634TyrAsn: 3.634 ± 0.253
1.857TyrPro: 1.857 ± 0.17
1.697TyrGln: 1.697 ± 0.143
1.937TyrArg: 1.937 ± 0.165
3.915TyrSer: 3.915 ± 0.292
2.645TyrThr: 2.645 ± 0.18
3.22TyrVal: 3.22 ± 0.212
0.708TyrTrp: 0.708 ± 0.105
2.659TyrTyr: 2.659 ± 0.203
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.013XaaAla: 0.013 ± 0.013
0.0XaaCys: 0.0 ± 0.0
0.013XaaAsp: 0.013 ± 0.013
0.067XaaGlu: 0.067 ± 0.063
0.013XaaPhe: 0.013 ± 0.013
0.027XaaGly: 0.027 ± 0.025
0.0XaaHis: 0.0 ± 0.0
0.04XaaIle: 0.04 ± 0.026
0.0XaaLys: 0.0 ± 0.0
0.013XaaLeu: 0.013 ± 0.013
0.013XaaMet: 0.013 ± 0.013
0.067XaaAsn: 0.067 ± 0.032
0.053XaaPro: 0.053 ± 0.03
0.013XaaGln: 0.013 ± 0.016
0.0XaaArg: 0.0 ± 0.0
0.013XaaSer: 0.013 ± 0.013
0.027XaaThr: 0.027 ± 0.025
0.013XaaVal: 0.013 ± 0.013
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.013XaaXaa: 0.013 ± 0.013
Statistics based on 370 proteins (74848 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski