Amino acid dipepetide frequency for Macacine betaherpesvirus 3 (Rhesus cytomegalovirus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.342AlaAla: 6.342 ± 0.553
1.787AlaCys: 1.787 ± 0.177
2.847AlaAsp: 2.847 ± 0.212
3.495AlaGlu: 3.495 ± 0.221
2.704AlaPhe: 2.704 ± 0.227
2.958AlaGly: 2.958 ± 0.246
1.74AlaHis: 1.74 ± 0.203
3.353AlaIle: 3.353 ± 0.231
2.167AlaLys: 2.167 ± 0.211
6.864AlaLeu: 6.864 ± 0.335
1.645AlaMet: 1.645 ± 0.172
2.531AlaAsn: 2.531 ± 0.193
3.511AlaPro: 3.511 ± 0.326
2.42AlaGln: 2.42 ± 0.24
3.369AlaArg: 3.369 ± 0.28
5.678AlaSer: 5.678 ± 0.337
4.84AlaThr: 4.84 ± 0.342
4.935AlaVal: 4.935 ± 0.305
1.012AlaTrp: 1.012 ± 0.133
1.676AlaTyr: 1.676 ± 0.155
0.0AlaXaa: 0.0 ± 0.0
Cys
1.724CysAla: 1.724 ± 0.163
0.949CysCys: 0.949 ± 0.13
1.344CysAsp: 1.344 ± 0.155
1.139CysGlu: 1.139 ± 0.135
1.139CysPhe: 1.139 ± 0.136
1.613CysGly: 1.613 ± 0.184
0.807CysHis: 0.807 ± 0.115
1.329CysIle: 1.329 ± 0.156
0.775CysLys: 0.775 ± 0.116
2.799CysLeu: 2.799 ± 0.211
0.838CysMet: 0.838 ± 0.106
1.313CysAsn: 1.313 ± 0.178
1.281CysPro: 1.281 ± 0.165
1.044CysGln: 1.044 ± 0.12
1.629CysArg: 1.629 ± 0.187
1.661CysSer: 1.661 ± 0.188
1.708CysThr: 1.708 ± 0.149
1.756CysVal: 1.756 ± 0.188
0.427CysTrp: 0.427 ± 0.077
1.218CysTyr: 1.218 ± 0.163
0.0CysXaa: 0.0 ± 0.0
Asp
2.973AspAla: 2.973 ± 0.22
0.696AspCys: 0.696 ± 0.116
3.574AspAsp: 3.574 ± 0.373
3.938AspGlu: 3.938 ± 0.263
2.04AspPhe: 2.04 ± 0.216
2.103AspGly: 2.103 ± 0.241
1.091AspHis: 1.091 ± 0.122
2.847AspIle: 2.847 ± 0.256
1.408AspLys: 1.408 ± 0.179
4.982AspLeu: 4.982 ± 0.313
1.376AspMet: 1.376 ± 0.128
1.819AspAsn: 1.819 ± 0.176
2.515AspPro: 2.515 ± 0.213
1.36AspGln: 1.36 ± 0.153
2.91AspArg: 2.91 ± 0.222
3.464AspSer: 3.464 ± 0.246
2.958AspThr: 2.958 ± 0.191
3.037AspVal: 3.037 ± 0.219
0.633AspTrp: 0.633 ± 0.108
1.882AspTyr: 1.882 ± 0.151
0.0AspXaa: 0.0 ± 0.0
Glu
3.764GluAla: 3.764 ± 0.255
1.297GluCys: 1.297 ± 0.147
3.147GluAsp: 3.147 ± 0.225
4.049GluGlu: 4.049 ± 0.32
1.534GluPhe: 1.534 ± 0.154
2.088GluGly: 2.088 ± 0.236
1.914GluHis: 1.914 ± 0.191
2.436GluIle: 2.436 ± 0.191
2.277GluLys: 2.277 ± 0.239
4.95GluLeu: 4.95 ± 0.341
1.36GluMet: 1.36 ± 0.118
2.388GluAsn: 2.388 ± 0.209
2.515GluPro: 2.515 ± 0.196
1.835GluGln: 1.835 ± 0.203
3.4GluArg: 3.4 ± 0.28
3.511GluSer: 3.511 ± 0.238
3.938GluThr: 3.938 ± 0.296
3.305GluVal: 3.305 ± 0.275
0.554GluTrp: 0.554 ± 0.085
1.566GluTyr: 1.566 ± 0.142
0.0GluXaa: 0.0 ± 0.0
Phe
2.183PheAla: 2.183 ± 0.179
1.329PheCys: 1.329 ± 0.189
1.819PheAsp: 1.819 ± 0.165
2.04PheGlu: 2.04 ± 0.196
1.93PhePhe: 1.93 ± 0.173
2.103PheGly: 2.103 ± 0.192
1.091PheHis: 1.091 ± 0.147
2.42PheIle: 2.42 ± 0.198
1.313PheLys: 1.313 ± 0.148
4.191PheLeu: 4.191 ± 0.212
1.218PheMet: 1.218 ± 0.152
1.819PheAsn: 1.819 ± 0.165
2.009PhePro: 2.009 ± 0.166
1.55PheGln: 1.55 ± 0.157
2.499PheArg: 2.499 ± 0.247
3.068PheSer: 3.068 ± 0.269
3.037PheThr: 3.037 ± 0.225
3.132PheVal: 3.132 ± 0.238
0.791PheTrp: 0.791 ± 0.138
1.882PheTyr: 1.882 ± 0.192
0.0PheXaa: 0.0 ± 0.0
Gly
2.578GlyAla: 2.578 ± 0.209
1.091GlyCys: 1.091 ± 0.116
2.325GlyAsp: 2.325 ± 0.203
2.309GlyGlu: 2.309 ± 0.217
1.85GlyPhe: 1.85 ± 0.195
3.021GlyGly: 3.021 ± 0.296
1.012GlyHis: 1.012 ± 0.123
2.562GlyIle: 2.562 ± 0.198
1.676GlyLys: 1.676 ± 0.176
4.618GlyLeu: 4.618 ± 0.248
0.854GlyMet: 0.854 ± 0.127
2.214GlyAsn: 2.214 ± 0.192
1.977GlyPro: 1.977 ± 0.209
1.961GlyGln: 1.961 ± 0.179
2.863GlyArg: 2.863 ± 0.23
3.258GlySer: 3.258 ± 0.206
3.4GlyThr: 3.4 ± 0.233
3.511GlyVal: 3.511 ± 0.265
0.68GlyTrp: 0.68 ± 0.105
1.756GlyTyr: 1.756 ± 0.149
0.0GlyXaa: 0.0 ± 0.0
His
2.167HisAla: 2.167 ± 0.193
0.633HisCys: 0.633 ± 0.105
1.629HisAsp: 1.629 ± 0.145
1.408HisGlu: 1.408 ± 0.148
1.091HisPhe: 1.091 ± 0.143
1.55HisGly: 1.55 ± 0.135
1.55HisHis: 1.55 ± 0.26
1.423HisIle: 1.423 ± 0.167
0.996HisLys: 0.996 ± 0.114
2.799HisLeu: 2.799 ± 0.248
0.807HisMet: 0.807 ± 0.108
1.487HisAsn: 1.487 ± 0.138
1.597HisPro: 1.597 ± 0.193
1.249HisGln: 1.249 ± 0.157
2.325HisArg: 2.325 ± 0.17
2.04HisSer: 2.04 ± 0.223
2.23HisThr: 2.23 ± 0.191
2.357HisVal: 2.357 ± 0.202
0.316HisTrp: 0.316 ± 0.069
0.838HisTyr: 0.838 ± 0.114
0.0HisXaa: 0.0 ± 0.0
Ile
2.831IleAla: 2.831 ± 0.244
1.708IleCys: 1.708 ± 0.187
1.993IleAsp: 1.993 ± 0.159
2.009IleGlu: 2.009 ± 0.161
2.277IlePhe: 2.277 ± 0.184
2.103IleGly: 2.103 ± 0.187
1.344IleHis: 1.344 ± 0.126
3.4IleIle: 3.4 ± 0.236
2.056IleLys: 2.056 ± 0.188
4.887IleLeu: 4.887 ± 0.316
1.487IleMet: 1.487 ± 0.157
2.388IleAsn: 2.388 ± 0.19
3.274IlePro: 3.274 ± 0.274
2.309IleGln: 2.309 ± 0.233
2.704IleArg: 2.704 ± 0.232
3.685IleSer: 3.685 ± 0.295
3.938IleThr: 3.938 ± 0.302
4.239IleVal: 4.239 ± 0.28
0.933IleTrp: 0.933 ± 0.127
2.531IleTyr: 2.531 ± 0.209
0.0IleXaa: 0.0 ± 0.0
Lys
2.546LysAla: 2.546 ± 0.205
0.949LysCys: 0.949 ± 0.103
1.771LysAsp: 1.771 ± 0.178
1.85LysGlu: 1.85 ± 0.217
1.313LysPhe: 1.313 ± 0.175
1.518LysGly: 1.518 ± 0.178
1.534LysHis: 1.534 ± 0.141
2.088LysIle: 2.088 ± 0.192
2.404LysLys: 2.404 ± 0.223
3.559LysLeu: 3.559 ± 0.261
1.123LysMet: 1.123 ± 0.12
2.119LysAsn: 2.119 ± 0.2
2.277LysPro: 2.277 ± 0.303
1.724LysGln: 1.724 ± 0.167
2.989LysArg: 2.989 ± 0.189
2.451LysSer: 2.451 ± 0.197
2.768LysThr: 2.768 ± 0.243
1.787LysVal: 1.787 ± 0.206
0.395LysTrp: 0.395 ± 0.078
1.439LysTyr: 1.439 ± 0.138
0.0LysXaa: 0.0 ± 0.0
Leu
6.231LeuAla: 6.231 ± 0.263
3.274LeuCys: 3.274 ± 0.237
4.001LeuAsp: 4.001 ± 0.308
4.144LeuGlu: 4.144 ± 0.28
4.808LeuPhe: 4.808 ± 0.284
4.523LeuGly: 4.523 ± 0.288
3.021LeuHis: 3.021 ± 0.264
5.346LeuIle: 5.346 ± 0.313
4.286LeuLys: 4.286 ± 0.29
10.328LeuLeu: 10.328 ± 0.523
3.258LeuMet: 3.258 ± 0.242
3.764LeuAsn: 3.764 ± 0.281
5.188LeuPro: 5.188 ± 0.307
3.875LeuGln: 3.875 ± 0.328
6.643LeuArg: 6.643 ± 0.401
7.781LeuSer: 7.781 ± 0.459
7.133LeuThr: 7.133 ± 0.347
5.915LeuVal: 5.915 ± 0.377
1.471LeuTrp: 1.471 ± 0.157
3.717LeuTyr: 3.717 ± 0.339
0.0LeuXaa: 0.0 ± 0.0
Met
2.024MetAla: 2.024 ± 0.209
0.696MetCys: 0.696 ± 0.113
1.091MetAsp: 1.091 ± 0.141
1.518MetGlu: 1.518 ± 0.162
1.265MetPhe: 1.265 ± 0.144
0.949MetGly: 0.949 ± 0.126
0.712MetHis: 0.712 ± 0.108
1.392MetIle: 1.392 ± 0.155
0.996MetLys: 0.996 ± 0.123
2.847MetLeu: 2.847 ± 0.254
0.87MetMet: 0.87 ± 0.129
1.107MetAsn: 1.107 ± 0.141
1.202MetPro: 1.202 ± 0.116
0.996MetGln: 0.996 ± 0.152
1.502MetArg: 1.502 ± 0.145
2.119MetSer: 2.119 ± 0.205
1.582MetThr: 1.582 ± 0.154
1.566MetVal: 1.566 ± 0.156
0.395MetTrp: 0.395 ± 0.084
1.202MetTyr: 1.202 ± 0.118
0.0MetXaa: 0.0 ± 0.0
Asn
3.116AsnAla: 3.116 ± 0.209
0.917AsnCys: 0.917 ± 0.139
1.993AsnAsp: 1.993 ± 0.169
2.198AsnGlu: 2.198 ± 0.161
1.329AsnPhe: 1.329 ± 0.155
1.945AsnGly: 1.945 ± 0.175
1.234AsnHis: 1.234 ± 0.124
2.357AsnIle: 2.357 ± 0.207
1.708AsnLys: 1.708 ± 0.149
4.16AsnLeu: 4.16 ± 0.308
1.218AsnMet: 1.218 ± 0.129
2.293AsnAsn: 2.293 ± 0.221
2.072AsnPro: 2.072 ± 0.175
1.645AsnGln: 1.645 ± 0.202
2.42AsnArg: 2.42 ± 0.238
3.479AsnSer: 3.479 ± 0.287
3.638AsnThr: 3.638 ± 0.385
3.606AsnVal: 3.606 ± 0.252
0.38AsnTrp: 0.38 ± 0.076
1.613AsnTyr: 1.613 ± 0.158
0.0AsnXaa: 0.0 ± 0.0
Pro
3.954ProAla: 3.954 ± 0.246
1.36ProCys: 1.36 ± 0.16
2.831ProAsp: 2.831 ± 0.235
2.878ProGlu: 2.878 ± 0.235
2.056ProPhe: 2.056 ± 0.183
2.246ProGly: 2.246 ± 0.181
1.708ProHis: 1.708 ± 0.196
2.388ProIle: 2.388 ± 0.182
2.04ProLys: 2.04 ± 0.243
4.27ProLeu: 4.27 ± 0.257
1.202ProMet: 1.202 ± 0.147
1.724ProAsn: 1.724 ± 0.193
5.393ProPro: 5.393 ± 0.494
2.167ProGln: 2.167 ± 0.24
3.559ProArg: 3.559 ± 0.309
4.745ProSer: 4.745 ± 0.382
3.733ProThr: 3.733 ± 0.301
3.922ProVal: 3.922 ± 0.285
0.648ProTrp: 0.648 ± 0.133
1.629ProTyr: 1.629 ± 0.173
0.0ProXaa: 0.0 ± 0.0
Gln
2.499GlnAla: 2.499 ± 0.222
0.981GlnCys: 0.981 ± 0.129
1.629GlnAsp: 1.629 ± 0.195
2.23GlnGlu: 2.23 ± 0.174
1.455GlnPhe: 1.455 ± 0.162
1.392GlnGly: 1.392 ± 0.157
1.313GlnHis: 1.313 ± 0.141
2.009GlnIle: 2.009 ± 0.228
2.135GlnLys: 2.135 ± 0.199
4.017GlnLeu: 4.017 ± 0.299
0.901GlnMet: 0.901 ± 0.105
1.882GlnAsn: 1.882 ± 0.176
2.088GlnPro: 2.088 ± 0.192
2.404GlnGln: 2.404 ± 0.308
2.878GlnArg: 2.878 ± 0.206
2.451GlnSer: 2.451 ± 0.202
2.784GlnThr: 2.784 ± 0.257
2.357GlnVal: 2.357 ± 0.188
0.332GlnTrp: 0.332 ± 0.069
1.502GlnTyr: 1.502 ± 0.154
0.0GlnXaa: 0.0 ± 0.0
Arg
3.606ArgAla: 3.606 ± 0.254
1.645ArgCys: 1.645 ± 0.157
3.305ArgAsp: 3.305 ± 0.272
3.179ArgGlu: 3.179 ± 0.232
2.341ArgPhe: 2.341 ± 0.225
3.1ArgGly: 3.1 ± 0.262
2.72ArgHis: 2.72 ± 0.248
2.815ArgIle: 2.815 ± 0.238
2.768ArgLys: 2.768 ± 0.184
6.738ArgLeu: 6.738 ± 0.361
1.313ArgMet: 1.313 ± 0.141
2.973ArgAsn: 2.973 ± 0.243
3.211ArgPro: 3.211 ± 0.273
3.147ArgGln: 3.147 ± 0.239
5.63ArgArg: 5.63 ± 0.464
4.318ArgSer: 4.318 ± 0.253
3.653ArgThr: 3.653 ± 0.267
3.906ArgVal: 3.906 ± 0.293
1.075ArgTrp: 1.075 ± 0.138
2.594ArgTyr: 2.594 ± 0.184
0.0ArgXaa: 0.0 ± 0.0
Ser
5.108SerAla: 5.108 ± 0.287
1.708SerCys: 1.708 ± 0.165
3.891SerAsp: 3.891 ± 0.224
4.27SerGlu: 4.27 ± 0.249
3.132SerPhe: 3.132 ± 0.225
4.381SerGly: 4.381 ± 0.274
2.072SerHis: 2.072 ± 0.187
3.369SerIle: 3.369 ± 0.255
2.499SerLys: 2.499 ± 0.224
7.354SerLeu: 7.354 ± 0.351
1.898SerMet: 1.898 ± 0.193
3.1SerAsn: 3.1 ± 0.231
4.587SerPro: 4.587 ± 0.351
2.752SerGln: 2.752 ± 0.178
5.093SerArg: 5.093 ± 0.337
9.221SerSer: 9.221 ± 0.605
6.247SerThr: 6.247 ± 0.754
5.267SerVal: 5.267 ± 0.345
1.028SerTrp: 1.028 ± 0.128
2.499SerTyr: 2.499 ± 0.209
0.0SerXaa: 0.0 ± 0.0
Thr
4.998ThrAla: 4.998 ± 0.413
2.103ThrCys: 2.103 ± 0.183
2.815ThrAsp: 2.815 ± 0.216
3.432ThrGlu: 3.432 ± 0.235
3.258ThrPhe: 3.258 ± 0.262
2.989ThrGly: 2.989 ± 0.247
2.072ThrHis: 2.072 ± 0.184
3.59ThrIle: 3.59 ± 0.288
2.926ThrLys: 2.926 ± 0.266
6.611ThrLeu: 6.611 ± 0.358
1.534ThrMet: 1.534 ± 0.139
3.005ThrAsn: 3.005 ± 0.336
4.334ThrPro: 4.334 ± 0.365
2.499ThrGln: 2.499 ± 0.182
3.812ThrArg: 3.812 ± 0.286
7.038ThrSer: 7.038 ± 0.728
7.908ThrThr: 7.908 ± 1.336
6.105ThrVal: 6.105 ± 0.371
0.949ThrTrp: 0.949 ± 0.136
2.372ThrTyr: 2.372 ± 0.212
0.0ThrXaa: 0.0 ± 0.0
Val
4.397ValAla: 4.397 ± 0.259
1.914ValCys: 1.914 ± 0.168
3.084ValAsp: 3.084 ± 0.229
2.91ValGlu: 2.91 ± 0.195
3.495ValPhe: 3.495 ± 0.257
2.736ValGly: 2.736 ± 0.211
1.85ValHis: 1.85 ± 0.184
3.97ValIle: 3.97 ± 0.221
2.483ValLys: 2.483 ± 0.232
7.149ValLeu: 7.149 ± 0.415
1.787ValMet: 1.787 ± 0.186
3.132ValAsn: 3.132 ± 0.237
3.479ValPro: 3.479 ± 0.231
2.357ValGln: 2.357 ± 0.237
4.049ValArg: 4.049 ± 0.332
5.931ValSer: 5.931 ± 0.332
5.536ValThr: 5.536 ± 0.376
5.077ValVal: 5.077 ± 0.305
1.028ValTrp: 1.028 ± 0.136
3.021ValTyr: 3.021 ± 0.236
0.0ValXaa: 0.0 ± 0.0
Trp
0.569TrpAla: 0.569 ± 0.103
0.506TrpCys: 0.506 ± 0.102
0.68TrpAsp: 0.68 ± 0.109
0.68TrpGlu: 0.68 ± 0.122
0.696TrpPhe: 0.696 ± 0.101
0.49TrpGly: 0.49 ± 0.105
0.506TrpHis: 0.506 ± 0.089
0.917TrpIle: 0.917 ± 0.135
0.538TrpLys: 0.538 ± 0.108
1.629TrpLeu: 1.629 ± 0.153
0.427TrpMet: 0.427 ± 0.08
0.554TrpAsn: 0.554 ± 0.094
0.807TrpPro: 0.807 ± 0.125
0.554TrpGln: 0.554 ± 0.089
0.996TrpArg: 0.996 ± 0.146
1.012TrpSer: 1.012 ± 0.117
0.728TrpThr: 0.728 ± 0.1
0.648TrpVal: 0.648 ± 0.083
0.316TrpTrp: 0.316 ± 0.077
0.712TrpTyr: 0.712 ± 0.104
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.262TyrAla: 2.262 ± 0.204
0.949TyrCys: 0.949 ± 0.116
1.835TyrAsp: 1.835 ± 0.171
2.151TyrGlu: 2.151 ± 0.212
1.708TyrPhe: 1.708 ± 0.169
1.708TyrGly: 1.708 ± 0.175
1.202TyrHis: 1.202 ± 0.144
2.088TyrIle: 2.088 ± 0.198
1.155TyrLys: 1.155 ± 0.153
3.954TyrLeu: 3.954 ± 0.232
0.901TyrMet: 0.901 ± 0.118
1.708TyrAsn: 1.708 ± 0.228
1.139TyrPro: 1.139 ± 0.126
1.392TyrGln: 1.392 ± 0.183
2.768TyrArg: 2.768 ± 0.191
2.562TyrSer: 2.562 ± 0.195
2.594TyrThr: 2.594 ± 0.251
2.973TyrVal: 2.973 ± 0.241
0.601TyrTrp: 0.601 ± 0.107
1.455TyrTyr: 1.455 ± 0.173
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 174 proteins (63229 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski