Amino acid dipepetide frequency for Halocynthia phage JM-2012

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.766AlaAla: 3.766 ± 0.332
0.637AlaCys: 0.637 ± 0.122
3.322AlaAsp: 3.322 ± 0.311
3.052AlaGlu: 3.052 ± 0.232
1.796AlaPhe: 1.796 ± 0.235
3.187AlaGly: 3.187 ± 0.275
0.966AlaHis: 0.966 ± 0.153
3.843AlaIle: 3.843 ± 0.264
4.597AlaLys: 4.597 ± 0.316
5.331AlaLeu: 5.331 ± 0.3
1.68AlaMet: 1.68 ± 0.182
3.534AlaAsn: 3.534 ± 0.285
1.449AlaPro: 1.449 ± 0.18
1.564AlaGln: 1.564 ± 0.192
1.642AlaArg: 1.642 ± 0.172
4.114AlaSer: 4.114 ± 0.317
3.38AlaThr: 3.38 ± 0.289
3.728AlaVal: 3.728 ± 0.258
0.56AlaTrp: 0.56 ± 0.117
1.777AlaTyr: 1.777 ± 0.162
0.0AlaXaa: 0.0 ± 0.0
Cys
0.27CysAla: 0.27 ± 0.095
0.097CysCys: 0.097 ± 0.043
0.444CysAsp: 0.444 ± 0.088
0.464CysGlu: 0.464 ± 0.1
0.502CysPhe: 0.502 ± 0.101
0.618CysGly: 0.618 ± 0.123
0.193CysHis: 0.193 ± 0.075
0.715CysIle: 0.715 ± 0.123
0.637CysLys: 0.637 ± 0.097
0.715CysLeu: 0.715 ± 0.116
0.212CysMet: 0.212 ± 0.057
0.579CysAsn: 0.579 ± 0.114
0.29CysPro: 0.29 ± 0.079
0.174CysGln: 0.174 ± 0.079
0.406CysArg: 0.406 ± 0.1
0.927CysSer: 0.927 ± 0.151
0.85CysThr: 0.85 ± 0.14
0.425CysVal: 0.425 ± 0.088
0.019CysTrp: 0.019 ± 0.023
0.386CysTyr: 0.386 ± 0.084
0.0CysXaa: 0.0 ± 0.0
Asp
3.747AspAla: 3.747 ± 0.235
0.464AspCys: 0.464 ± 0.107
4.674AspAsp: 4.674 ± 0.346
4.346AspGlu: 4.346 ± 0.282
3.052AspPhe: 3.052 ± 0.263
4.906AspGly: 4.906 ± 0.28
0.753AspHis: 0.753 ± 0.128
6.354AspIle: 6.354 ± 0.387
5.524AspLys: 5.524 ± 0.37
6.509AspLeu: 6.509 ± 0.367
1.333AspMet: 1.333 ± 0.154
4.519AspAsn: 4.519 ± 0.309
1.7AspPro: 1.7 ± 0.22
0.792AspGln: 0.792 ± 0.132
2.318AspArg: 2.318 ± 0.26
4.384AspSer: 4.384 ± 0.267
4.365AspThr: 4.365 ± 0.295
4.925AspVal: 4.925 ± 0.324
0.541AspTrp: 0.541 ± 0.095
3.09AspTyr: 3.09 ± 0.222
0.0AspXaa: 0.0 ± 0.0
Glu
3.766GluAla: 3.766 ± 0.274
0.676GluCys: 0.676 ± 0.133
4.751GluAsp: 4.751 ± 0.302
4.597GluGlu: 4.597 ± 0.375
3.11GluPhe: 3.11 ± 0.232
3.534GluGly: 3.534 ± 0.263
1.082GluHis: 1.082 ± 0.153
4.095GluIle: 4.095 ± 0.258
2.897GluLys: 2.897 ± 0.257
8.614GluLeu: 8.614 ± 0.435
1.68GluMet: 1.68 ± 0.189
2.279GluAsn: 2.279 ± 0.185
1.796GluPro: 1.796 ± 0.182
1.545GluGln: 1.545 ± 0.17
2.955GluArg: 2.955 ± 0.243
4.751GluSer: 4.751 ± 0.321
3.65GluThr: 3.65 ± 0.246
5.833GluVal: 5.833 ± 0.347
0.637GluTrp: 0.637 ± 0.125
3.11GluTyr: 3.11 ± 0.192
0.0GluXaa: 0.0 ± 0.0
Phe
1.854PheAla: 1.854 ± 0.218
0.483PheCys: 0.483 ± 0.092
2.665PheAsp: 2.665 ± 0.229
1.931PheGlu: 1.931 ± 0.192
1.333PhePhe: 1.333 ± 0.178
2.569PheGly: 2.569 ± 0.199
0.579PheHis: 0.579 ± 0.115
2.627PheIle: 2.627 ± 0.226
3.65PheLys: 3.65 ± 0.243
2.511PheLeu: 2.511 ± 0.24
0.888PheMet: 0.888 ± 0.14
3.438PheAsn: 3.438 ± 0.249
0.966PhePro: 0.966 ± 0.13
0.502PheGln: 0.502 ± 0.101
1.12PheArg: 1.12 ± 0.142
3.032PheSer: 3.032 ± 0.226
2.723PheThr: 2.723 ± 0.211
2.24PheVal: 2.24 ± 0.213
0.232PheTrp: 0.232 ± 0.064
1.7PheTyr: 1.7 ± 0.198
0.0PheXaa: 0.0 ± 0.0
Gly
2.646GlyAla: 2.646 ± 0.23
0.502GlyCys: 0.502 ± 0.109
4.172GlyAsp: 4.172 ± 0.27
4.191GlyGlu: 4.191 ± 0.307
2.202GlyPhe: 2.202 ± 0.207
3.979GlyGly: 3.979 ± 0.332
0.85GlyHis: 0.85 ± 0.107
4.964GlyIle: 4.964 ± 0.343
4.307GlyLys: 4.307 ± 0.498
5.717GlyLeu: 5.717 ± 0.305
2.125GlyMet: 2.125 ± 0.286
3.399GlyAsn: 3.399 ± 0.297
0.869GlyPro: 0.869 ± 0.118
1.449GlyGln: 1.449 ± 0.178
2.202GlyArg: 2.202 ± 0.225
4.404GlySer: 4.404 ± 0.308
3.901GlyThr: 3.901 ± 0.246
4.037GlyVal: 4.037 ± 0.268
0.676GlyTrp: 0.676 ± 0.114
2.492GlyTyr: 2.492 ± 0.181
0.0GlyXaa: 0.0 ± 0.0
His
0.734HisAla: 0.734 ± 0.14
0.155HisCys: 0.155 ± 0.054
1.12HisAsp: 1.12 ± 0.147
1.024HisGlu: 1.024 ± 0.159
0.56HisPhe: 0.56 ± 0.114
0.985HisGly: 0.985 ± 0.122
0.541HisHis: 0.541 ± 0.107
1.603HisIle: 1.603 ± 0.144
1.255HisLys: 1.255 ± 0.183
2.028HisLeu: 2.028 ± 0.221
0.521HisMet: 0.521 ± 0.096
1.159HisAsn: 1.159 ± 0.156
0.869HisPro: 0.869 ± 0.164
0.599HisGln: 0.599 ± 0.12
1.082HisArg: 1.082 ± 0.149
1.294HisSer: 1.294 ± 0.161
1.255HisThr: 1.255 ± 0.163
1.275HisVal: 1.275 ± 0.165
0.155HisTrp: 0.155 ± 0.051
0.637HisTyr: 0.637 ± 0.105
0.0HisXaa: 0.0 ± 0.0
Ile
3.515IleAla: 3.515 ± 0.258
0.792IleCys: 0.792 ± 0.117
5.64IleAsp: 5.64 ± 0.332
5.195IleGlu: 5.195 ± 0.343
2.105IlePhe: 2.105 ± 0.212
3.515IleGly: 3.515 ± 0.234
1.391IleHis: 1.391 ± 0.155
4.095IleIle: 4.095 ± 0.296
5.041IleLys: 5.041 ± 0.258
6.084IleLeu: 6.084 ± 0.348
1.468IleMet: 1.468 ± 0.191
5.331IleAsn: 5.331 ± 0.312
3.206IlePro: 3.206 ± 0.252
1.642IleGln: 1.642 ± 0.186
3.225IleArg: 3.225 ± 0.278
5.524IleSer: 5.524 ± 0.409
6.238IleThr: 6.238 ± 0.329
4.404IleVal: 4.404 ± 0.296
0.599IleTrp: 0.599 ± 0.126
2.897IleTyr: 2.897 ± 0.273
0.0IleXaa: 0.0 ± 0.0
Lys
4.5LysAla: 4.5 ± 0.412
0.541LysCys: 0.541 ± 0.114
5.253LysAsp: 5.253 ± 0.401
5.002LysGlu: 5.002 ± 0.33
2.839LysPhe: 2.839 ± 0.223
3.612LysGly: 3.612 ± 0.287
1.584LysHis: 1.584 ± 0.171
3.786LysIle: 3.786 ± 0.26
3.438LysLys: 3.438 ± 0.297
8.035LysLeu: 8.035 ± 0.44
1.506LysMet: 1.506 ± 0.189
3.013LysAsn: 3.013 ± 0.235
2.53LysPro: 2.53 ± 0.229
1.642LysGln: 1.642 ± 0.197
2.858LysArg: 2.858 ± 0.24
4.404LysSer: 4.404 ± 0.304
3.959LysThr: 3.959 ± 0.317
5.195LysVal: 5.195 ± 0.297
0.464LysTrp: 0.464 ± 0.092
3.071LysTyr: 3.071 ± 0.26
0.0LysXaa: 0.0 ± 0.0
Leu
6.219LeuAla: 6.219 ± 0.403
0.811LeuCys: 0.811 ± 0.158
7.03LeuAsp: 7.03 ± 0.378
6.876LeuGlu: 6.876 ± 0.417
3.071LeuPhe: 3.071 ± 0.224
5.215LeuGly: 5.215 ± 0.406
2.047LeuHis: 2.047 ± 0.23
6.876LeuIle: 6.876 ± 0.375
6.045LeuLys: 6.045 ± 0.409
8.46LeuLeu: 8.46 ± 0.447
2.492LeuMet: 2.492 ± 0.229
7.223LeuAsn: 7.223 ± 0.374
4.017LeuPro: 4.017 ± 0.337
2.125LeuGln: 2.125 ± 0.177
4.307LeuArg: 4.307 ± 0.26
7.861LeuSer: 7.861 ± 0.463
7.436LeuThr: 7.436 ± 0.388
6.837LeuVal: 6.837 ± 0.357
0.715LeuTrp: 0.715 ± 0.119
3.708LeuTyr: 3.708 ± 0.285
0.0LeuXaa: 0.0 ± 0.0
Met
1.873MetAla: 1.873 ± 0.182
0.174MetCys: 0.174 ± 0.061
1.603MetAsp: 1.603 ± 0.158
1.622MetGlu: 1.622 ± 0.188
1.004MetPhe: 1.004 ± 0.146
1.835MetGly: 1.835 ± 0.219
0.386MetHis: 0.386 ± 0.078
1.275MetIle: 1.275 ± 0.147
1.468MetLys: 1.468 ± 0.161
2.453MetLeu: 2.453 ± 0.203
0.773MetMet: 0.773 ± 0.124
1.294MetAsn: 1.294 ± 0.125
1.004MetPro: 1.004 ± 0.163
0.811MetGln: 0.811 ± 0.138
1.082MetArg: 1.082 ± 0.172
2.318MetSer: 2.318 ± 0.193
1.178MetThr: 1.178 ± 0.169
1.68MetVal: 1.68 ± 0.18
0.135MetTrp: 0.135 ± 0.051
0.966MetTyr: 0.966 ± 0.127
0.0MetXaa: 0.0 ± 0.0
Asn
3.264AsnAla: 3.264 ± 0.199
0.502AsnCys: 0.502 ± 0.1
3.901AsnAsp: 3.901 ± 0.268
3.419AsnGlu: 3.419 ± 0.23
2.337AsnPhe: 2.337 ± 0.233
3.477AsnGly: 3.477 ± 0.301
1.255AsnHis: 1.255 ± 0.145
4.635AsnIle: 4.635 ± 0.273
4.249AsnLys: 4.249 ± 0.276
6.219AsnLeu: 6.219 ± 0.371
1.178AsnMet: 1.178 ± 0.148
4.075AsnAsn: 4.075 ± 0.286
2.781AsnPro: 2.781 ± 0.224
2.279AsnGln: 2.279 ± 0.254
2.434AsnArg: 2.434 ± 0.216
4.346AsnSer: 4.346 ± 0.282
4.423AsnThr: 4.423 ± 0.326
4.539AsnVal: 4.539 ± 0.324
0.464AsnTrp: 0.464 ± 0.096
2.762AsnTyr: 2.762 ± 0.258
0.0AsnXaa: 0.0 ± 0.0
Pro
1.564ProAla: 1.564 ± 0.2
0.328ProCys: 0.328 ± 0.074
2.182ProAsp: 2.182 ± 0.227
2.916ProGlu: 2.916 ± 0.246
1.371ProPhe: 1.371 ± 0.178
1.313ProGly: 1.313 ± 0.179
0.618ProHis: 0.618 ± 0.112
2.511ProIle: 2.511 ± 0.235
2.414ProLys: 2.414 ± 0.237
3.09ProLeu: 3.09 ± 0.272
0.734ProMet: 0.734 ± 0.14
2.569ProAsn: 2.569 ± 0.278
0.831ProPro: 0.831 ± 0.129
0.831ProGln: 0.831 ± 0.121
1.294ProArg: 1.294 ± 0.19
2.511ProSer: 2.511 ± 0.251
2.916ProThr: 2.916 ± 0.246
2.453ProVal: 2.453 ± 0.239
0.232ProTrp: 0.232 ± 0.072
1.313ProTyr: 1.313 ± 0.162
0.0ProXaa: 0.0 ± 0.0
Gln
1.41GlnAla: 1.41 ± 0.192
0.232GlnCys: 0.232 ± 0.062
1.352GlnAsp: 1.352 ± 0.127
1.854GlnGlu: 1.854 ± 0.191
1.043GlnPhe: 1.043 ± 0.16
1.12GlnGly: 1.12 ± 0.136
0.618GlnHis: 0.618 ± 0.097
1.622GlnIle: 1.622 ± 0.192
1.004GlnLys: 1.004 ± 0.14
3.322GlnLeu: 3.322 ± 0.337
0.811GlnMet: 0.811 ± 0.118
0.946GlnAsn: 0.946 ± 0.133
0.753GlnPro: 0.753 ± 0.131
0.946GlnGln: 0.946 ± 0.18
1.12GlnArg: 1.12 ± 0.157
2.028GlnSer: 2.028 ± 0.184
1.68GlnThr: 1.68 ± 0.214
1.719GlnVal: 1.719 ± 0.167
0.212GlnTrp: 0.212 ± 0.061
1.506GlnTyr: 1.506 ± 0.142
0.0GlnXaa: 0.0 ± 0.0
Arg
2.047ArgAla: 2.047 ± 0.163
0.425ArgCys: 0.425 ± 0.093
2.627ArgAsp: 2.627 ± 0.234
2.839ArgGlu: 2.839 ± 0.205
1.719ArgPhe: 1.719 ± 0.162
2.569ArgGly: 2.569 ± 0.228
0.85ArgHis: 0.85 ± 0.137
2.781ArgIle: 2.781 ± 0.286
2.762ArgLys: 2.762 ± 0.249
4.326ArgLeu: 4.326 ± 0.304
1.178ArgMet: 1.178 ± 0.134
2.743ArgAsn: 2.743 ± 0.219
1.062ArgPro: 1.062 ± 0.142
1.12ArgGln: 1.12 ± 0.149
1.758ArgArg: 1.758 ± 0.189
2.376ArgSer: 2.376 ± 0.254
2.685ArgThr: 2.685 ± 0.212
2.762ArgVal: 2.762 ± 0.259
0.212ArgTrp: 0.212 ± 0.073
1.449ArgTyr: 1.449 ± 0.196
0.0ArgXaa: 0.0 ± 0.0
Ser
3.419SerAla: 3.419 ± 0.297
0.541SerCys: 0.541 ± 0.123
4.732SerAsp: 4.732 ± 0.32
4.5SerGlu: 4.5 ± 0.268
2.376SerPhe: 2.376 ± 0.223
4.5SerGly: 4.5 ± 0.38
1.275SerHis: 1.275 ± 0.175
6.026SerIle: 6.026 ± 0.335
5.871SerLys: 5.871 ± 0.321
7.146SerLeu: 7.146 ± 0.466
2.067SerMet: 2.067 ± 0.288
4.809SerAsn: 4.809 ± 0.303
2.549SerPro: 2.549 ± 0.232
1.719SerGln: 1.719 ± 0.205
2.955SerArg: 2.955 ± 0.222
5.195SerSer: 5.195 ± 0.386
5.331SerThr: 5.331 ± 0.272
4.655SerVal: 4.655 ± 0.278
0.869SerTrp: 0.869 ± 0.136
2.955SerTyr: 2.955 ± 0.251
0.0SerXaa: 0.0 ± 0.0
Thr
3.554ThrAla: 3.554 ± 0.268
0.425ThrCys: 0.425 ± 0.095
4.674ThrAsp: 4.674 ± 0.341
4.191ThrGlu: 4.191 ± 0.309
2.685ThrPhe: 2.685 ± 0.201
4.191ThrGly: 4.191 ± 0.276
1.41ThrHis: 1.41 ± 0.177
5.447ThrIle: 5.447 ± 0.339
4.21ThrLys: 4.21 ± 0.259
7.339ThrLeu: 7.339 ± 0.462
1.391ThrMet: 1.391 ± 0.167
3.979ThrAsn: 3.979 ± 0.253
2.607ThrPro: 2.607 ± 0.256
2.144ThrGln: 2.144 ± 0.179
2.549ThrArg: 2.549 ± 0.242
4.597ThrSer: 4.597 ± 0.296
5.022ThrThr: 5.022 ± 0.353
5.331ThrVal: 5.331 ± 0.396
0.811ThrTrp: 0.811 ± 0.118
2.549ThrTyr: 2.549 ± 0.22
0.0ThrXaa: 0.0 ± 0.0
Val
3.612ValAla: 3.612 ± 0.25
0.599ValCys: 0.599 ± 0.105
4.964ValAsp: 4.964 ± 0.33
4.597ValGlu: 4.597 ± 0.349
2.105ValPhe: 2.105 ± 0.18
4.848ValGly: 4.848 ± 0.384
1.082ValHis: 1.082 ± 0.152
4.751ValIle: 4.751 ± 0.31
5.273ValLys: 5.273 ± 0.308
6.451ValLeu: 6.451 ± 0.304
1.816ValMet: 1.816 ± 0.154
4.693ValAsn: 4.693 ± 0.27
2.839ValPro: 2.839 ± 0.233
1.468ValGln: 1.468 ± 0.174
2.414ValArg: 2.414 ± 0.217
5.35ValSer: 5.35 ± 0.339
5.389ValThr: 5.389 ± 0.342
4.867ValVal: 4.867 ± 0.394
0.502ValTrp: 0.502 ± 0.1
2.878ValTyr: 2.878 ± 0.316
0.0ValXaa: 0.0 ± 0.0
Trp
0.464TrpAla: 0.464 ± 0.098
0.155TrpCys: 0.155 ± 0.069
0.676TrpAsp: 0.676 ± 0.114
0.599TrpGlu: 0.599 ± 0.095
0.29TrpPhe: 0.29 ± 0.08
0.541TrpGly: 0.541 ± 0.118
0.097TrpHis: 0.097 ± 0.05
0.521TrpIle: 0.521 ± 0.087
0.29TrpLys: 0.29 ± 0.08
0.85TrpLeu: 0.85 ± 0.146
0.251TrpMet: 0.251 ± 0.07
0.56TrpAsn: 0.56 ± 0.106
0.135TrpPro: 0.135 ± 0.052
0.406TrpGln: 0.406 ± 0.087
0.406TrpArg: 0.406 ± 0.101
0.579TrpSer: 0.579 ± 0.097
0.348TrpThr: 0.348 ± 0.072
0.85TrpVal: 0.85 ± 0.126
0.077TrpTrp: 0.077 ± 0.033
0.464TrpTyr: 0.464 ± 0.096
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.7TyrAla: 1.7 ± 0.149
0.386TyrCys: 0.386 ± 0.085
2.511TyrAsp: 2.511 ± 0.223
2.144TyrGlu: 2.144 ± 0.228
1.545TyrPhe: 1.545 ± 0.192
2.569TyrGly: 2.569 ± 0.211
1.275TyrHis: 1.275 ± 0.161
3.399TyrIle: 3.399 ± 0.261
2.298TyrLys: 2.298 ± 0.231
4.172TyrLeu: 4.172 ± 0.373
0.811TyrMet: 0.811 ± 0.114
2.395TyrAsn: 2.395 ± 0.224
1.738TyrPro: 1.738 ± 0.195
1.545TyrGln: 1.545 ± 0.198
2.163TyrArg: 2.163 ± 0.207
3.496TyrSer: 3.496 ± 0.253
2.376TyrThr: 2.376 ± 0.227
2.743TyrVal: 2.743 ± 0.25
0.444TyrTrp: 0.444 ± 0.1
1.603TyrTyr: 1.603 ± 0.203
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 173 proteins (51777 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski