Amino acid dipepetide frequency for Human cytomegalovirus (strain AD169) (HHV-5) (Human herpesvirus 5)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.036AlaAla: 11.036 ± 0.639
2.042AlaCys: 2.042 ± 0.212
3.056AlaAsp: 3.056 ± 0.218
3.751AlaGlu: 3.751 ± 0.236
2.926AlaPhe: 2.926 ± 0.216
5.446AlaGly: 5.446 ± 0.418
1.651AlaHis: 1.651 ± 0.15
1.999AlaIle: 1.999 ± 0.176
1.825AlaLys: 1.825 ± 0.177
7.98AlaLeu: 7.98 ± 0.374
1.347AlaMet: 1.347 ± 0.158
1.839AlaAsn: 1.839 ± 0.169
4.62AlaPro: 4.62 ± 0.298
2.52AlaGln: 2.52 ± 0.2
5.648AlaArg: 5.648 ± 0.344
6.59AlaSer: 6.59 ± 0.375
5.359AlaThr: 5.359 ± 0.248
6.952AlaVal: 6.952 ± 0.385
1.072AlaTrp: 1.072 ± 0.136
1.868AlaTyr: 1.868 ± 0.193
0.0AlaXaa: 0.0 ± 0.0
Cys
1.912CysAla: 1.912 ± 0.23
0.869CysCys: 0.869 ± 0.123
1.231CysAsp: 1.231 ± 0.148
1.115CysGlu: 1.115 ± 0.145
1.115CysPhe: 1.115 ± 0.141
1.535CysGly: 1.535 ± 0.147
0.71CysHis: 0.71 ± 0.089
0.898CysIle: 0.898 ± 0.113
0.478CysLys: 0.478 ± 0.091
2.665CysLeu: 2.665 ± 0.192
0.594CysMet: 0.594 ± 0.085
0.797CysAsn: 0.797 ± 0.132
1.376CysPro: 1.376 ± 0.141
0.883CysGln: 0.883 ± 0.119
2.144CysArg: 2.144 ± 0.22
1.521CysSer: 1.521 ± 0.158
1.246CysThr: 1.246 ± 0.139
2.433CysVal: 2.433 ± 0.245
0.275CysTrp: 0.275 ± 0.061
1.043CysTyr: 1.043 ± 0.134
0.0CysXaa: 0.0 ± 0.0
Asp
4.142AspAla: 4.142 ± 0.246
0.782AspCys: 0.782 ± 0.101
3.809AspAsp: 3.809 ± 0.354
3.939AspGlu: 3.939 ± 0.264
1.651AspPhe: 1.651 ± 0.171
3.606AspGly: 3.606 ± 0.227
1.332AspHis: 1.332 ± 0.137
1.767AspIle: 1.767 ± 0.162
1.202AspLys: 1.202 ± 0.128
5.084AspLeu: 5.084 ± 0.301
0.941AspMet: 0.941 ± 0.129
1.419AspAsn: 1.419 ± 0.132
2.708AspPro: 2.708 ± 0.225
0.985AspGln: 0.985 ± 0.119
2.882AspArg: 2.882 ± 0.194
3.186AspSer: 3.186 ± 0.216
2.839AspThr: 2.839 ± 0.219
3.49AspVal: 3.49 ± 0.246
0.608AspTrp: 0.608 ± 0.115
1.651AspTyr: 1.651 ± 0.167
0.0AspXaa: 0.0 ± 0.0
Glu
4.533GluAla: 4.533 ± 0.255
0.883GluCys: 0.883 ± 0.108
3.896GluAsp: 3.896 ± 0.251
5.011GluGlu: 5.011 ± 0.477
1.434GluPhe: 1.434 ± 0.15
2.607GluGly: 2.607 ± 0.192
1.448GluHis: 1.448 ± 0.138
1.666GluIle: 1.666 ± 0.162
1.97GluLys: 1.97 ± 0.199
5.228GluLeu: 5.228 ± 0.255
0.855GluMet: 0.855 ± 0.117
2.303GluAsn: 2.303 ± 0.182
2.404GluPro: 2.404 ± 0.162
2.1GluGln: 2.1 ± 0.272
4.417GluArg: 4.417 ± 0.308
3.36GluSer: 3.36 ± 0.23
3.838GluThr: 3.838 ± 0.292
2.81GluVal: 2.81 ± 0.228
0.478GluTrp: 0.478 ± 0.096
1.188GluTyr: 1.188 ± 0.122
0.0GluXaa: 0.0 ± 0.0
Phe
2.303PheAla: 2.303 ± 0.174
1.332PheCys: 1.332 ± 0.141
1.637PheAsp: 1.637 ± 0.187
1.535PheGlu: 1.535 ± 0.152
2.346PhePhe: 2.346 ± 0.195
2.172PheGly: 2.172 ± 0.174
1.188PheHis: 1.188 ± 0.153
1.622PheIle: 1.622 ± 0.153
0.97PheLys: 0.97 ± 0.111
4.577PheLeu: 4.577 ± 0.251
1.057PheMet: 1.057 ± 0.146
1.101PheAsn: 1.101 ± 0.115
1.97PhePro: 1.97 ± 0.17
1.419PheGln: 1.419 ± 0.155
2.621PheArg: 2.621 ± 0.201
2.766PheSer: 2.766 ± 0.234
2.39PheThr: 2.39 ± 0.198
3.186PheVal: 3.186 ± 0.22
0.623PheTrp: 0.623 ± 0.093
1.521PheTyr: 1.521 ± 0.132
0.0PheXaa: 0.0 ± 0.0
Gly
5.098GlyAla: 5.098 ± 0.337
1.405GlyCys: 1.405 ± 0.164
3.157GlyAsp: 3.157 ± 0.231
3.462GlyGlu: 3.462 ± 0.263
2.158GlyPhe: 2.158 ± 0.153
8.4GlyGly: 8.4 ± 0.975
1.724GlyHis: 1.724 ± 0.204
1.825GlyIle: 1.825 ± 0.168
1.767GlyLys: 1.767 ± 0.144
5.909GlyLeu: 5.909 ± 0.338
0.797GlyMet: 0.797 ± 0.109
2.274GlyAsn: 2.274 ± 0.17
3.201GlyPro: 3.201 ± 0.273
1.984GlyGln: 1.984 ± 0.119
4.461GlyArg: 4.461 ± 0.323
4.881GlySer: 4.881 ± 0.302
3.968GlyThr: 3.968 ± 0.245
4.475GlyVal: 4.475 ± 0.309
1.057GlyTrp: 1.057 ± 0.111
1.593GlyTyr: 1.593 ± 0.141
0.0GlyXaa: 0.0 ± 0.0
His
2.317HisAla: 2.317 ± 0.165
0.666HisCys: 0.666 ± 0.097
1.883HisAsp: 1.883 ± 0.178
1.622HisGlu: 1.622 ± 0.13
0.999HisPhe: 0.999 ± 0.145
2.535HisGly: 2.535 ± 0.214
1.984HisHis: 1.984 ± 0.236
0.855HisIle: 0.855 ± 0.124
0.84HisLys: 0.84 ± 0.108
3.172HisLeu: 3.172 ± 0.196
0.594HisMet: 0.594 ± 0.08
1.101HisAsn: 1.101 ± 0.123
2.172HisPro: 2.172 ± 0.248
1.26HisGln: 1.26 ± 0.184
3.259HisArg: 3.259 ± 0.219
1.622HisSer: 1.622 ± 0.141
2.144HisThr: 2.144 ± 0.23
2.462HisVal: 2.462 ± 0.215
0.275HisTrp: 0.275 ± 0.063
0.927HisTyr: 0.927 ± 0.104
0.0HisXaa: 0.0 ± 0.0
Ile
2.129IleAla: 2.129 ± 0.191
1.159IleCys: 1.159 ± 0.132
1.405IleAsp: 1.405 ± 0.152
1.115IleGlu: 1.115 ± 0.117
1.651IlePhe: 1.651 ± 0.169
1.767IleGly: 1.767 ± 0.15
1.028IleHis: 1.028 ± 0.13
2.042IleIle: 2.042 ± 0.236
1.072IleLys: 1.072 ± 0.139
3.244IleLeu: 3.244 ± 0.234
0.927IleMet: 0.927 ± 0.116
1.159IleAsn: 1.159 ± 0.137
1.912IlePro: 1.912 ± 0.169
1.405IleGln: 1.405 ± 0.165
2.346IleArg: 2.346 ± 0.196
2.81IleSer: 2.81 ± 0.289
2.593IleThr: 2.593 ± 0.22
2.593IleVal: 2.593 ± 0.197
0.348IleTrp: 0.348 ± 0.071
1.651IleTyr: 1.651 ± 0.157
0.0IleXaa: 0.0 ± 0.0
Lys
1.97LysAla: 1.97 ± 0.19
0.666LysCys: 0.666 ± 0.09
1.246LysAsp: 1.246 ± 0.135
1.405LysGlu: 1.405 ± 0.158
0.855LysPhe: 0.855 ± 0.099
1.492LysGly: 1.492 ± 0.162
1.101LysHis: 1.101 ± 0.142
1.202LysIle: 1.202 ± 0.147
2.404LysLys: 2.404 ± 0.229
2.535LysLeu: 2.535 ± 0.217
0.71LysMet: 0.71 ± 0.116
1.492LysAsn: 1.492 ± 0.158
1.608LysPro: 1.608 ± 0.203
1.275LysGln: 1.275 ± 0.146
2.868LysArg: 2.868 ± 0.225
1.912LysSer: 1.912 ± 0.174
2.115LysThr: 2.115 ± 0.208
1.622LysVal: 1.622 ± 0.194
0.348LysTrp: 0.348 ± 0.063
1.101LysTyr: 1.101 ± 0.132
0.0LysXaa: 0.0 ± 0.0
Leu
6.966LeuAla: 6.966 ± 0.345
3.505LeuCys: 3.505 ± 0.292
4.446LeuAsp: 4.446 ± 0.325
4.403LeuGlu: 4.403 ± 0.329
4.765LeuPhe: 4.765 ± 0.318
5.518LeuGly: 5.518 ± 0.367
3.418LeuHis: 3.418 ± 0.195
3.983LeuIle: 3.983 ± 0.261
3.186LeuLys: 3.186 ± 0.231
11.92LeuLeu: 11.92 ± 0.582
2.216LeuMet: 2.216 ± 0.175
3.099LeuAsn: 3.099 ± 0.21
6.083LeuPro: 6.083 ± 0.317
3.36LeuGln: 3.36 ± 0.306
8.835LeuArg: 8.835 ± 0.443
7.502LeuSer: 7.502 ± 0.403
6.286LeuThr: 6.286 ± 0.392
6.358LeuVal: 6.358 ± 0.312
1.477LeuTrp: 1.477 ± 0.188
3.317LeuTyr: 3.317 ± 0.24
0.0LeuXaa: 0.0 ± 0.0
Met
1.318MetAla: 1.318 ± 0.138
0.521MetCys: 0.521 ± 0.079
1.057MetAsp: 1.057 ± 0.115
1.159MetGlu: 1.159 ± 0.136
0.768MetPhe: 0.768 ± 0.107
1.115MetGly: 1.115 ± 0.12
0.507MetHis: 0.507 ± 0.076
0.883MetIle: 0.883 ± 0.107
0.608MetLys: 0.608 ± 0.106
2.245MetLeu: 2.245 ± 0.205
0.71MetMet: 0.71 ± 0.118
0.666MetAsn: 0.666 ± 0.107
1.028MetPro: 1.028 ± 0.115
0.55MetGln: 0.55 ± 0.098
1.506MetArg: 1.506 ± 0.141
1.492MetSer: 1.492 ± 0.156
1.347MetThr: 1.347 ± 0.149
1.303MetVal: 1.303 ± 0.126
0.521MetTrp: 0.521 ± 0.088
0.724MetTyr: 0.724 ± 0.108
0.0MetXaa: 0.0 ± 0.0
Asn
2.245AsnAla: 2.245 ± 0.197
0.724AsnCys: 0.724 ± 0.107
1.593AsnAsp: 1.593 ± 0.155
1.608AsnGlu: 1.608 ± 0.142
1.217AsnPhe: 1.217 ± 0.125
2.028AsnGly: 2.028 ± 0.154
1.217AsnHis: 1.217 ± 0.14
1.13AsnIle: 1.13 ± 0.153
1.26AsnLys: 1.26 ± 0.159
2.824AsnLeu: 2.824 ± 0.194
0.623AsnMet: 0.623 ± 0.093
1.68AsnAsn: 1.68 ± 0.157
1.593AsnPro: 1.593 ± 0.155
1.231AsnGln: 1.231 ± 0.157
1.941AsnArg: 1.941 ± 0.164
2.172AsnSer: 2.172 ± 0.17
2.65AsnThr: 2.65 ± 0.263
3.143AsnVal: 3.143 ± 0.243
0.304AsnTrp: 0.304 ± 0.066
0.985AsnTyr: 0.985 ± 0.132
0.0AsnXaa: 0.0 ± 0.0
Pro
4.895ProAla: 4.895 ± 0.352
1.217ProCys: 1.217 ± 0.135
2.332ProAsp: 2.332 ± 0.181
3.317ProGlu: 3.317 ± 0.233
1.608ProPhe: 1.608 ± 0.152
3.577ProGly: 3.577 ± 0.35
2.303ProHis: 2.303 ± 0.219
1.434ProIle: 1.434 ± 0.128
1.608ProLys: 1.608 ± 0.153
5.591ProLeu: 5.591 ± 0.268
1.144ProMet: 1.144 ± 0.121
1.39ProAsn: 1.39 ± 0.137
7.633ProPro: 7.633 ± 0.618
2.39ProGln: 2.39 ± 0.205
5.127ProArg: 5.127 ± 0.321
6.213ProSer: 6.213 ± 0.395
3.462ProThr: 3.462 ± 0.235
4.388ProVal: 4.388 ± 0.248
0.637ProTrp: 0.637 ± 0.092
1.492ProTyr: 1.492 ± 0.153
0.0ProXaa: 0.0 ± 0.0
Gln
2.115GlnAla: 2.115 ± 0.178
0.681GlnCys: 0.681 ± 0.098
1.55GlnAsp: 1.55 ± 0.196
1.868GlnGlu: 1.868 ± 0.228
1.101GlnPhe: 1.101 ± 0.131
1.521GlnGly: 1.521 ± 0.153
1.695GlnHis: 1.695 ± 0.16
1.376GlnIle: 1.376 ± 0.156
1.593GlnLys: 1.593 ± 0.186
3.664GlnLeu: 3.664 ± 0.32
0.84GlnMet: 0.84 ± 0.135
1.289GlnAsn: 1.289 ± 0.163
1.912GlnPro: 1.912 ± 0.173
2.404GlnGln: 2.404 ± 0.373
3.997GlnArg: 3.997 ± 0.31
2.158GlnSer: 2.158 ± 0.157
2.65GlnThr: 2.65 ± 0.192
1.81GlnVal: 1.81 ± 0.168
0.492GlnTrp: 0.492 ± 0.081
1.202GlnTyr: 1.202 ± 0.146
0.0GlnXaa: 0.0 ± 0.0
Arg
5.547ArgAla: 5.547 ± 0.302
2.028ArgCys: 2.028 ± 0.213
4.62ArgAsp: 4.62 ± 0.306
4.229ArgGlu: 4.229 ± 0.265
2.781ArgPhe: 2.781 ± 0.219
5.069ArgGly: 5.069 ± 0.393
3.635ArgHis: 3.635 ± 0.253
2.578ArgIle: 2.578 ± 0.189
2.361ArgLys: 2.361 ± 0.207
8.632ArgLeu: 8.632 ± 0.423
1.202ArgMet: 1.202 ± 0.12
2.477ArgAsn: 2.477 ± 0.217
4.446ArgPro: 4.446 ± 0.313
3.592ArgGln: 3.592 ± 0.226
10.167ArgArg: 10.167 ± 0.553
4.649ArgSer: 4.649 ± 0.292
3.954ArgThr: 3.954 ± 0.287
5.243ArgVal: 5.243 ± 0.29
1.463ArgTrp: 1.463 ± 0.185
2.824ArgTyr: 2.824 ± 0.207
0.0ArgXaa: 0.0 ± 0.0
Ser
6.286SerAla: 6.286 ± 0.432
1.724SerCys: 1.724 ± 0.167
3.128SerAsp: 3.128 ± 0.231
3.49SerGlu: 3.49 ± 0.209
2.578SerPhe: 2.578 ± 0.209
5.605SerGly: 5.605 ± 0.345
2.332SerHis: 2.332 ± 0.176
2.317SerIle: 2.317 ± 0.193
1.854SerLys: 1.854 ± 0.216
6.532SerLeu: 6.532 ± 0.308
1.332SerMet: 1.332 ± 0.127
2.042SerAsn: 2.042 ± 0.183
5.735SerPro: 5.735 ± 0.403
2.694SerGln: 2.694 ± 0.198
5.533SerArg: 5.533 ± 0.306
9.675SerSer: 9.675 ± 0.77
5.417SerThr: 5.417 ± 0.337
5.996SerVal: 5.996 ± 0.387
0.941SerTrp: 0.941 ± 0.117
2.332SerTyr: 2.332 ± 0.222
0.0SerXaa: 0.0 ± 0.0
Thr
5.808ThrAla: 5.808 ± 0.318
1.477ThrCys: 1.477 ± 0.152
2.462ThrAsp: 2.462 ± 0.21
3.462ThrGlu: 3.462 ± 0.259
2.737ThrPhe: 2.737 ± 0.219
3.244ThrGly: 3.244 ± 0.19
2.086ThrHis: 2.086 ± 0.217
2.144ThrIle: 2.144 ± 0.22
1.738ThrLys: 1.738 ± 0.184
6.373ThrLeu: 6.373 ± 0.437
1.231ThrMet: 1.231 ± 0.148
1.796ThrAsn: 1.796 ± 0.269
4.635ThrPro: 4.635 ± 0.311
2.158ThrGln: 2.158 ± 0.235
4.417ThrArg: 4.417 ± 0.239
5.706ThrSer: 5.706 ± 0.375
6.749ThrThr: 6.749 ± 0.609
6.17ThrVal: 6.17 ± 0.362
0.797ThrTrp: 0.797 ± 0.133
2.158ThrTyr: 2.158 ± 0.181
0.0ThrXaa: 0.0 ± 0.0
Val
5.924ValAla: 5.924 ± 0.279
2.013ValCys: 2.013 ± 0.208
3.172ValAsp: 3.172 ± 0.195
3.65ValGlu: 3.65 ± 0.2
3.679ValPhe: 3.679 ± 0.22
3.78ValGly: 3.78 ± 0.302
1.738ValHis: 1.738 ± 0.173
2.984ValIle: 2.984 ± 0.22
1.825ValLys: 1.825 ± 0.158
7.271ValLeu: 7.271 ± 0.383
1.781ValMet: 1.781 ± 0.189
2.491ValAsn: 2.491 ± 0.245
4.678ValPro: 4.678 ± 0.293
2.129ValGln: 2.129 ± 0.215
5.33ValArg: 5.33 ± 0.339
6.358ValSer: 6.358 ± 0.325
5.533ValThr: 5.533 ± 0.334
5.373ValVal: 5.373 ± 0.308
1.173ValTrp: 1.173 ± 0.166
2.766ValTyr: 2.766 ± 0.171
0.0ValXaa: 0.0 ± 0.0
Trp
0.768TrpAla: 0.768 ± 0.123
0.492TrpCys: 0.492 ± 0.095
0.55TrpAsp: 0.55 ± 0.09
0.695TrpGlu: 0.695 ± 0.093
0.594TrpPhe: 0.594 ± 0.097
0.594TrpGly: 0.594 ± 0.11
0.406TrpHis: 0.406 ± 0.074
0.55TrpIle: 0.55 ± 0.101
0.55TrpLys: 0.55 ± 0.097
1.941TrpLeu: 1.941 ± 0.197
0.521TrpMet: 0.521 ± 0.082
0.304TrpAsn: 0.304 ± 0.07
0.608TrpPro: 0.608 ± 0.077
0.536TrpGln: 0.536 ± 0.091
1.231TrpArg: 1.231 ± 0.165
0.855TrpSer: 0.855 ± 0.109
0.855TrpThr: 0.855 ± 0.123
0.797TrpVal: 0.797 ± 0.098
0.348TrpTrp: 0.348 ± 0.088
0.521TrpTyr: 0.521 ± 0.099
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.216TyrAla: 2.216 ± 0.173
0.652TyrCys: 0.652 ± 0.082
1.796TyrAsp: 1.796 ± 0.161
1.593TyrGlu: 1.593 ± 0.167
1.376TyrPhe: 1.376 ± 0.141
1.984TyrGly: 1.984 ± 0.199
1.188TyrHis: 1.188 ± 0.129
1.101TyrIle: 1.101 ± 0.143
0.811TyrLys: 0.811 ± 0.112
3.259TyrLeu: 3.259 ± 0.224
0.594TyrMet: 0.594 ± 0.093
1.448TyrAsn: 1.448 ± 0.175
1.434TyrPro: 1.434 ± 0.151
1.043TyrGln: 1.043 ± 0.123
2.766TyrArg: 2.766 ± 0.177
2.129TyrSer: 2.129 ± 0.204
1.955TyrThr: 1.955 ± 0.201
3.041TyrVal: 3.041 ± 0.192
0.478TyrTrp: 0.478 ± 0.085
1.144TyrTyr: 1.144 ± 0.17
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 202 proteins (69046 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski