Amino acid dipepetide frequency for Human cytomegalovirus (strain Merlin) (HHV-5) (Human herpesvirus 5)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.071AlaAla: 11.071 ± 0.818
1.882AlaCys: 1.882 ± 0.188
3.108AlaAsp: 3.108 ± 0.227
3.795AlaGlu: 3.795 ± 0.266
2.973AlaPhe: 2.973 ± 0.192
5.319AlaGly: 5.319 ± 0.362
1.673AlaHis: 1.673 ± 0.171
2.077AlaIle: 2.077 ± 0.196
1.808AlaLys: 1.808 ± 0.16
8.531AlaLeu: 8.531 ± 0.382
1.539AlaMet: 1.539 ± 0.183
1.763AlaAsn: 1.763 ± 0.163
4.153AlaPro: 4.153 ± 0.311
2.405AlaGln: 2.405 ± 0.212
5.139AlaArg: 5.139 ± 0.326
6.618AlaSer: 6.618 ± 0.39
5.483AlaThr: 5.483 ± 0.364
7.395AlaVal: 7.395 ± 0.386
1.135AlaTrp: 1.135 ± 0.127
1.957AlaTyr: 1.957 ± 0.19
0.0AlaXaa: 0.0 ± 0.0
Cys
1.808CysAla: 1.808 ± 0.168
0.941CysCys: 0.941 ± 0.142
1.24CysAsp: 1.24 ± 0.157
1.225CysGlu: 1.225 ± 0.167
0.926CysPhe: 0.926 ± 0.112
1.494CysGly: 1.494 ± 0.13
0.762CysHis: 0.762 ± 0.099
0.956CysIle: 0.956 ± 0.132
0.493CysLys: 0.493 ± 0.09
2.794CysLeu: 2.794 ± 0.223
0.553CysMet: 0.553 ± 0.108
0.881CysAsn: 0.881 ± 0.143
1.3CysPro: 1.3 ± 0.125
0.926CysGln: 0.926 ± 0.114
1.868CysArg: 1.868 ± 0.194
1.494CysSer: 1.494 ± 0.195
1.449CysThr: 1.449 ± 0.161
2.136CysVal: 2.136 ± 0.182
0.269CysTrp: 0.269 ± 0.067
1.091CysTyr: 1.091 ± 0.147
0.0CysXaa: 0.0 ± 0.0
Asp
4.303AspAla: 4.303 ± 0.285
0.896AspCys: 0.896 ± 0.095
3.989AspAsp: 3.989 ± 0.373
4.273AspGlu: 4.273 ± 0.277
1.643AspPhe: 1.643 ± 0.157
3.511AspGly: 3.511 ± 0.277
1.374AspHis: 1.374 ± 0.156
1.942AspIle: 1.942 ± 0.176
1.24AspLys: 1.24 ± 0.126
5.289AspLeu: 5.289 ± 0.295
1.061AspMet: 1.061 ± 0.121
1.449AspAsn: 1.449 ± 0.156
2.644AspPro: 2.644 ± 0.22
1.016AspGln: 1.016 ± 0.106
2.868AspArg: 2.868 ± 0.236
3.466AspSer: 3.466 ± 0.205
2.824AspThr: 2.824 ± 0.23
3.645AspVal: 3.645 ± 0.261
0.553AspTrp: 0.553 ± 0.116
1.688AspTyr: 1.688 ± 0.16
0.0AspXaa: 0.0 ± 0.0
Glu
4.557GluAla: 4.557 ± 0.251
0.896GluCys: 0.896 ± 0.126
4.123GluAsp: 4.123 ± 0.275
5.154GluGlu: 5.154 ± 0.438
1.449GluPhe: 1.449 ± 0.157
2.585GluGly: 2.585 ± 0.198
1.464GluHis: 1.464 ± 0.169
1.688GluIle: 1.688 ± 0.154
1.748GluLys: 1.748 ± 0.157
5.468GluLeu: 5.468 ± 0.247
1.001GluMet: 1.001 ± 0.142
2.465GluAsn: 2.465 ± 0.174
2.6GluPro: 2.6 ± 0.187
2.226GluGln: 2.226 ± 0.264
4.392GluArg: 4.392 ± 0.318
3.421GluSer: 3.421 ± 0.234
3.959GluThr: 3.959 ± 0.231
2.928GluVal: 2.928 ± 0.254
0.553GluTrp: 0.553 ± 0.092
1.285GluTyr: 1.285 ± 0.133
0.0GluXaa: 0.0 ± 0.0
Phe
2.316PheAla: 2.316 ± 0.179
1.345PheCys: 1.345 ± 0.148
1.688PheAsp: 1.688 ± 0.188
1.584PheGlu: 1.584 ± 0.144
2.047PhePhe: 2.047 ± 0.191
2.077PheGly: 2.077 ± 0.156
1.135PheHis: 1.135 ± 0.124
1.449PheIle: 1.449 ± 0.182
1.031PheLys: 1.031 ± 0.12
4.168PheLeu: 4.168 ± 0.244
1.001PheMet: 1.001 ± 0.125
1.165PheAsn: 1.165 ± 0.135
1.853PhePro: 1.853 ± 0.15
1.36PheGln: 1.36 ± 0.168
2.57PheArg: 2.57 ± 0.198
2.54PheSer: 2.54 ± 0.26
2.54PheThr: 2.54 ± 0.206
3.167PheVal: 3.167 ± 0.249
0.672PheTrp: 0.672 ± 0.099
1.569PheTyr: 1.569 ± 0.149
0.0PheXaa: 0.0 ± 0.0
Gly
5.005GlyAla: 5.005 ± 0.352
1.225GlyCys: 1.225 ± 0.141
3.122GlyAsp: 3.122 ± 0.22
3.481GlyGlu: 3.481 ± 0.227
1.987GlyPhe: 1.987 ± 0.172
8.247GlyGly: 8.247 ± 1.051
1.658GlyHis: 1.658 ± 0.198
1.703GlyIle: 1.703 ± 0.163
1.718GlyLys: 1.718 ± 0.166
6.096GlyLeu: 6.096 ± 0.321
0.822GlyMet: 0.822 ± 0.107
2.196GlyAsn: 2.196 ± 0.162
3.003GlyPro: 3.003 ± 0.256
2.002GlyGln: 2.002 ± 0.135
4.064GlyArg: 4.064 ± 0.296
4.87GlySer: 4.87 ± 0.292
3.944GlyThr: 3.944 ± 0.228
4.363GlyVal: 4.363 ± 0.29
1.135GlyTrp: 1.135 ± 0.12
1.688GlyTyr: 1.688 ± 0.14
0.0GlyXaa: 0.0 ± 0.0
His
2.271HisAla: 2.271 ± 0.223
0.627HisCys: 0.627 ± 0.096
1.793HisAsp: 1.793 ± 0.159
1.554HisGlu: 1.554 ± 0.134
0.971HisPhe: 0.971 ± 0.132
2.6HisGly: 2.6 ± 0.205
1.942HisHis: 1.942 ± 0.209
0.747HisIle: 0.747 ± 0.08
0.837HisLys: 0.837 ± 0.115
3.063HisLeu: 3.063 ± 0.221
0.568HisMet: 0.568 ± 0.092
1.195HisAsn: 1.195 ± 0.118
1.987HisPro: 1.987 ± 0.194
1.374HisGln: 1.374 ± 0.2
2.898HisArg: 2.898 ± 0.194
1.524HisSer: 1.524 ± 0.189
2.047HisThr: 2.047 ± 0.197
2.375HisVal: 2.375 ± 0.228
0.329HisTrp: 0.329 ± 0.082
1.016HisTyr: 1.016 ± 0.119
0.0HisXaa: 0.0 ± 0.0
Ile
2.062IleAla: 2.062 ± 0.198
1.255IleCys: 1.255 ± 0.14
1.404IleAsp: 1.404 ± 0.121
1.091IleGlu: 1.091 ± 0.114
1.673IlePhe: 1.673 ± 0.195
1.688IleGly: 1.688 ± 0.152
0.926IleHis: 0.926 ± 0.132
1.897IleIle: 1.897 ± 0.223
1.061IleLys: 1.061 ± 0.131
3.466IleLeu: 3.466 ± 0.281
1.076IleMet: 1.076 ± 0.127
1.135IleAsn: 1.135 ± 0.155
1.882IlePro: 1.882 ± 0.15
1.33IleGln: 1.33 ± 0.136
2.301IleArg: 2.301 ± 0.22
2.659IleSer: 2.659 ± 0.265
2.644IleThr: 2.644 ± 0.226
2.734IleVal: 2.734 ± 0.214
0.433IleTrp: 0.433 ± 0.081
1.628IleTyr: 1.628 ± 0.145
0.0IleXaa: 0.0 ± 0.0
Lys
1.868LysAla: 1.868 ± 0.167
0.672LysCys: 0.672 ± 0.092
1.18LysAsp: 1.18 ± 0.135
1.464LysGlu: 1.464 ± 0.149
0.837LysPhe: 0.837 ± 0.129
1.554LysGly: 1.554 ± 0.144
1.18LysHis: 1.18 ± 0.136
1.18LysIle: 1.18 ± 0.148
2.107LysLys: 2.107 ± 0.222
2.6LysLeu: 2.6 ± 0.204
0.672LysMet: 0.672 ± 0.097
1.33LysAsn: 1.33 ± 0.167
1.688LysPro: 1.688 ± 0.244
1.21LysGln: 1.21 ± 0.115
2.749LysArg: 2.749 ± 0.155
1.838LysSer: 1.838 ± 0.183
1.912LysThr: 1.912 ± 0.178
1.658LysVal: 1.658 ± 0.179
0.314LysTrp: 0.314 ± 0.058
1.016LysTyr: 1.016 ± 0.125
0.0LysXaa: 0.0 ± 0.0
Leu
7.231LeuAla: 7.231 ± 0.344
3.586LeuCys: 3.586 ± 0.267
4.646LeuAsp: 4.646 ± 0.297
4.736LeuGlu: 4.736 ± 0.355
4.751LeuPhe: 4.751 ± 0.283
5.543LeuGly: 5.543 ± 0.37
3.332LeuHis: 3.332 ± 0.248
4.198LeuIle: 4.198 ± 0.297
3.227LeuLys: 3.227 ± 0.247
12.445LeuLeu: 12.445 ± 0.596
2.435LeuMet: 2.435 ± 0.186
3.287LeuAsn: 3.287 ± 0.245
6.125LeuPro: 6.125 ± 0.361
3.451LeuGln: 3.451 ± 0.306
8.695LeuArg: 8.695 ± 0.464
7.605LeuSer: 7.605 ± 0.337
6.604LeuThr: 6.604 ± 0.429
6.693LeuVal: 6.693 ± 0.313
1.539LeuTrp: 1.539 ± 0.183
3.391LeuTyr: 3.391 ± 0.251
0.0LeuXaa: 0.0 ± 0.0
Met
1.404MetAla: 1.404 ± 0.162
0.478MetCys: 0.478 ± 0.089
1.195MetAsp: 1.195 ± 0.128
1.195MetGlu: 1.195 ± 0.134
0.747MetPhe: 0.747 ± 0.108
1.091MetGly: 1.091 ± 0.123
0.478MetHis: 0.478 ± 0.085
0.881MetIle: 0.881 ± 0.127
0.642MetLys: 0.642 ± 0.114
2.435MetLeu: 2.435 ± 0.212
0.613MetMet: 0.613 ± 0.104
0.747MetAsn: 0.747 ± 0.113
1.046MetPro: 1.046 ± 0.103
0.553MetGln: 0.553 ± 0.087
1.524MetArg: 1.524 ± 0.152
1.554MetSer: 1.554 ± 0.184
1.33MetThr: 1.33 ± 0.123
1.404MetVal: 1.404 ± 0.127
0.493MetTrp: 0.493 ± 0.082
0.747MetTyr: 0.747 ± 0.104
0.0MetXaa: 0.0 ± 0.0
Asn
2.316AsnAla: 2.316 ± 0.177
0.792AsnCys: 0.792 ± 0.138
1.628AsnAsp: 1.628 ± 0.148
1.703AsnGlu: 1.703 ± 0.173
1.24AsnPhe: 1.24 ± 0.141
2.181AsnGly: 2.181 ± 0.163
1.15AsnHis: 1.15 ± 0.11
1.315AsnIle: 1.315 ± 0.17
1.255AsnLys: 1.255 ± 0.132
2.898AsnLeu: 2.898 ± 0.257
0.583AsnMet: 0.583 ± 0.087
1.763AsnAsn: 1.763 ± 0.18
1.509AsnPro: 1.509 ± 0.15
1.15AsnGln: 1.15 ± 0.147
1.882AsnArg: 1.882 ± 0.167
2.241AsnSer: 2.241 ± 0.222
2.405AsnThr: 2.405 ± 0.242
3.242AsnVal: 3.242 ± 0.24
0.359AsnTrp: 0.359 ± 0.075
1.046AsnTyr: 1.046 ± 0.138
0.0AsnXaa: 0.0 ± 0.0
Pro
4.602ProAla: 4.602 ± 0.34
1.195ProCys: 1.195 ± 0.129
2.525ProAsp: 2.525 ± 0.172
3.257ProGlu: 3.257 ± 0.23
1.599ProPhe: 1.599 ± 0.146
3.152ProGly: 3.152 ± 0.255
2.002ProHis: 2.002 ± 0.171
1.225ProIle: 1.225 ± 0.127
1.688ProLys: 1.688 ± 0.18
5.408ProLeu: 5.408 ± 0.262
1.255ProMet: 1.255 ± 0.134
1.404ProAsn: 1.404 ± 0.144
7.246ProPro: 7.246 ± 0.688
2.465ProGln: 2.465 ± 0.221
4.661ProArg: 4.661 ± 0.359
5.603ProSer: 5.603 ± 0.459
3.66ProThr: 3.66 ± 0.304
4.243ProVal: 4.243 ± 0.254
0.672ProTrp: 0.672 ± 0.115
1.36ProTyr: 1.36 ± 0.12
0.0ProXaa: 0.0 ± 0.0
Gln
2.196GlnAla: 2.196 ± 0.225
0.777GlnCys: 0.777 ± 0.106
1.584GlnAsp: 1.584 ± 0.206
1.942GlnGlu: 1.942 ± 0.212
1.076GlnPhe: 1.076 ± 0.115
1.479GlnGly: 1.479 ± 0.161
1.509GlnHis: 1.509 ± 0.165
1.389GlnIle: 1.389 ± 0.164
1.584GlnLys: 1.584 ± 0.153
3.765GlnLeu: 3.765 ± 0.36
0.837GlnMet: 0.837 ± 0.132
1.225GlnAsn: 1.225 ± 0.152
1.957GlnPro: 1.957 ± 0.192
2.615GlnGln: 2.615 ± 0.375
4.064GlnArg: 4.064 ± 0.306
2.256GlnSer: 2.256 ± 0.18
2.644GlnThr: 2.644 ± 0.208
1.987GlnVal: 1.987 ± 0.183
0.538GlnTrp: 0.538 ± 0.078
1.225GlnTyr: 1.225 ± 0.137
0.0GlnXaa: 0.0 ± 0.0
Arg
5.169ArgAla: 5.169 ± 0.341
1.912ArgCys: 1.912 ± 0.163
4.557ArgAsp: 4.557 ± 0.269
4.094ArgGlu: 4.094 ± 0.241
2.809ArgPhe: 2.809 ± 0.227
4.437ArgGly: 4.437 ± 0.271
3.332ArgHis: 3.332 ± 0.227
2.331ArgIle: 2.331 ± 0.182
2.286ArgLys: 2.286 ± 0.183
8.74ArgLeu: 8.74 ± 0.46
1.106ArgMet: 1.106 ± 0.118
2.45ArgAsn: 2.45 ± 0.207
4.019ArgPro: 4.019 ± 0.383
3.571ArgGln: 3.571 ± 0.257
9.412ArgArg: 9.412 ± 0.595
4.258ArgSer: 4.258 ± 0.273
3.571ArgThr: 3.571 ± 0.206
5.139ArgVal: 5.139 ± 0.281
1.374ArgTrp: 1.374 ± 0.15
2.839ArgTyr: 2.839 ± 0.218
0.0ArgXaa: 0.0 ± 0.0
Ser
6.484SerAla: 6.484 ± 0.462
1.554SerCys: 1.554 ± 0.171
3.451SerAsp: 3.451 ± 0.218
3.825SerGlu: 3.825 ± 0.226
2.361SerPhe: 2.361 ± 0.177
5.812SerGly: 5.812 ± 0.333
2.435SerHis: 2.435 ± 0.18
2.375SerIle: 2.375 ± 0.21
1.599SerLys: 1.599 ± 0.13
6.618SerLeu: 6.618 ± 0.33
1.3SerMet: 1.3 ± 0.141
2.062SerAsn: 2.062 ± 0.191
5.11SerPro: 5.11 ± 0.381
2.764SerGln: 2.764 ± 0.193
4.781SerArg: 4.781 ± 0.277
10.159SerSer: 10.159 ± 0.823
5.558SerThr: 5.558 ± 0.44
6.155SerVal: 6.155 ± 0.415
1.001SerTrp: 1.001 ± 0.124
2.316SerTyr: 2.316 ± 0.201
0.0SerXaa: 0.0 ± 0.0
Thr
5.931ThrAla: 5.931 ± 0.35
1.509ThrCys: 1.509 ± 0.157
2.585ThrAsp: 2.585 ± 0.197
3.376ThrGlu: 3.376 ± 0.289
2.704ThrPhe: 2.704 ± 0.206
3.122ThrGly: 3.122 ± 0.22
2.077ThrHis: 2.077 ± 0.218
2.002ThrIle: 2.002 ± 0.232
1.479ThrLys: 1.479 ± 0.163
6.813ThrLeu: 6.813 ± 0.397
1.285ThrMet: 1.285 ± 0.114
1.912ThrAsn: 1.912 ± 0.228
4.736ThrPro: 4.736 ± 0.363
2.107ThrGln: 2.107 ± 0.248
4.213ThrArg: 4.213 ± 0.238
5.931ThrSer: 5.931 ± 0.412
6.887ThrThr: 6.887 ± 0.575
6.215ThrVal: 6.215 ± 0.327
0.837ThrTrp: 0.837 ± 0.115
2.121ThrTyr: 2.121 ± 0.182
0.0ThrXaa: 0.0 ± 0.0
Val
6.096ValAla: 6.096 ± 0.312
1.912ValCys: 1.912 ± 0.184
3.391ValAsp: 3.391 ± 0.224
3.84ValGlu: 3.84 ± 0.247
3.511ValPhe: 3.511 ± 0.299
3.72ValGly: 3.72 ± 0.284
1.703ValHis: 1.703 ± 0.164
3.212ValIle: 3.212 ± 0.271
1.897ValLys: 1.897 ± 0.151
7.575ValLeu: 7.575 ± 0.369
1.748ValMet: 1.748 ± 0.173
2.51ValAsn: 2.51 ± 0.223
4.333ValPro: 4.333 ± 0.22
2.435ValGln: 2.435 ± 0.264
5.334ValArg: 5.334 ± 0.342
6.574ValSer: 6.574 ± 0.351
5.677ValThr: 5.677 ± 0.314
5.483ValVal: 5.483 ± 0.293
1.285ValTrp: 1.285 ± 0.142
2.734ValTyr: 2.734 ± 0.188
0.0ValXaa: 0.0 ± 0.0
Trp
0.896TrpAla: 0.896 ± 0.119
0.493TrpCys: 0.493 ± 0.101
0.583TrpAsp: 0.583 ± 0.092
0.732TrpGlu: 0.732 ± 0.118
0.583TrpPhe: 0.583 ± 0.084
0.538TrpGly: 0.538 ± 0.111
0.388TrpHis: 0.388 ± 0.076
0.672TrpIle: 0.672 ± 0.108
0.613TrpLys: 0.613 ± 0.103
2.017TrpLeu: 2.017 ± 0.193
0.493TrpMet: 0.493 ± 0.09
0.374TrpAsn: 0.374 ± 0.067
0.657TrpPro: 0.657 ± 0.106
0.627TrpGln: 0.627 ± 0.093
1.135TrpArg: 1.135 ± 0.139
0.896TrpSer: 0.896 ± 0.115
0.881TrpThr: 0.881 ± 0.126
0.837TrpVal: 0.837 ± 0.107
0.329TrpTrp: 0.329 ± 0.079
0.553TrpTyr: 0.553 ± 0.096
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.241TyrAla: 2.241 ± 0.165
0.657TyrCys: 0.657 ± 0.094
1.882TyrAsp: 1.882 ± 0.172
1.643TyrGlu: 1.643 ± 0.168
1.389TyrPhe: 1.389 ± 0.155
2.181TyrGly: 2.181 ± 0.202
1.195TyrHis: 1.195 ± 0.148
1.106TyrIle: 1.106 ± 0.132
0.717TyrLys: 0.717 ± 0.089
3.332TyrLeu: 3.332 ± 0.293
0.598TyrMet: 0.598 ± 0.088
1.345TyrAsn: 1.345 ± 0.156
1.374TyrPro: 1.374 ± 0.17
1.061TyrGln: 1.061 ± 0.112
2.854TyrArg: 2.854 ± 0.18
2.211TyrSer: 2.211 ± 0.2
1.823TyrThr: 1.823 ± 0.196
3.182TyrVal: 3.182 ± 0.257
0.478TyrTrp: 0.478 ± 0.095
1.121TyrTyr: 1.121 ± 0.157
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 180 proteins (66935 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski