Amino acid dipepetide frequency for Rhesus cytomegalovirus (strain 68-1) (RhCMV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.739AlaAla: 6.739 ± 0.672
1.681AlaCys: 1.681 ± 0.157
2.762AlaAsp: 2.762 ± 0.226
3.317AlaGlu: 3.317 ± 0.228
2.927AlaPhe: 2.927 ± 0.233
3.092AlaGly: 3.092 ± 0.264
1.816AlaHis: 1.816 ± 0.179
3.227AlaIle: 3.227 ± 0.221
2.056AlaLys: 2.056 ± 0.216
6.694AlaLeu: 6.694 ± 0.343
1.516AlaMet: 1.516 ± 0.133
2.371AlaAsn: 2.371 ± 0.224
3.707AlaPro: 3.707 ± 0.285
2.567AlaGln: 2.567 ± 0.209
3.917AlaArg: 3.917 ± 0.279
5.598AlaSer: 5.598 ± 0.283
4.863AlaThr: 4.863 ± 0.335
5.013AlaVal: 5.013 ± 0.298
0.916AlaTrp: 0.916 ± 0.137
1.621AlaTyr: 1.621 ± 0.167
0.0AlaXaa: 0.0 ± 0.0
Cys
1.741CysAla: 1.741 ± 0.183
0.931CysCys: 0.931 ± 0.164
1.246CysAsp: 1.246 ± 0.133
1.186CysGlu: 1.186 ± 0.135
1.036CysPhe: 1.036 ± 0.122
1.741CysGly: 1.741 ± 0.167
0.916CysHis: 0.916 ± 0.118
1.231CysIle: 1.231 ± 0.134
0.765CysLys: 0.765 ± 0.105
2.792CysLeu: 2.792 ± 0.216
0.81CysMet: 0.81 ± 0.112
1.141CysAsn: 1.141 ± 0.151
1.366CysPro: 1.366 ± 0.166
0.976CysGln: 0.976 ± 0.101
1.861CysArg: 1.861 ± 0.19
1.861CysSer: 1.861 ± 0.184
1.651CysThr: 1.651 ± 0.16
1.876CysVal: 1.876 ± 0.2
0.45CysTrp: 0.45 ± 0.076
1.171CysTyr: 1.171 ± 0.137
0.0CysXaa: 0.0 ± 0.0
Asp
2.762AspAla: 2.762 ± 0.2
0.765AspCys: 0.765 ± 0.121
3.347AspAsp: 3.347 ± 0.309
3.452AspGlu: 3.452 ± 0.257
1.996AspPhe: 1.996 ± 0.177
2.071AspGly: 2.071 ± 0.231
1.051AspHis: 1.051 ± 0.128
2.492AspIle: 2.492 ± 0.256
1.231AspLys: 1.231 ± 0.134
4.623AspLeu: 4.623 ± 0.315
1.231AspMet: 1.231 ± 0.135
1.801AspAsn: 1.801 ± 0.158
2.477AspPro: 2.477 ± 0.215
1.201AspGln: 1.201 ± 0.145
2.897AspArg: 2.897 ± 0.219
3.317AspSer: 3.317 ± 0.232
2.777AspThr: 2.777 ± 0.197
3.002AspVal: 3.002 ± 0.223
0.63AspTrp: 0.63 ± 0.114
1.666AspTyr: 1.666 ± 0.159
0.0AspXaa: 0.0 ± 0.0
Glu
3.542GluAla: 3.542 ± 0.232
1.201GluCys: 1.201 ± 0.121
2.792GluAsp: 2.792 ± 0.237
3.407GluGlu: 3.407 ± 0.223
1.441GluPhe: 1.441 ± 0.156
1.876GluGly: 1.876 ± 0.164
1.771GluHis: 1.771 ± 0.146
2.251GluIle: 2.251 ± 0.192
2.086GluLys: 2.086 ± 0.189
4.578GluLeu: 4.578 ± 0.336
1.321GluMet: 1.321 ± 0.117
2.296GluAsn: 2.296 ± 0.179
2.522GluPro: 2.522 ± 0.218
1.621GluGln: 1.621 ± 0.188
3.497GluArg: 3.497 ± 0.256
3.362GluSer: 3.362 ± 0.211
3.557GluThr: 3.557 ± 0.259
3.047GluVal: 3.047 ± 0.23
0.585GluTrp: 0.585 ± 0.091
1.486GluTyr: 1.486 ± 0.157
0.0GluXaa: 0.0 ± 0.0
Phe
2.101PheAla: 2.101 ± 0.188
1.516PheCys: 1.516 ± 0.176
1.771PheAsp: 1.771 ± 0.18
1.786PheGlu: 1.786 ± 0.179
2.026PhePhe: 2.026 ± 0.203
2.416PheGly: 2.416 ± 0.203
1.066PheHis: 1.066 ± 0.118
2.446PheIle: 2.446 ± 0.188
1.231PheLys: 1.231 ± 0.148
4.022PheLeu: 4.022 ± 0.244
1.171PheMet: 1.171 ± 0.137
1.741PheAsn: 1.741 ± 0.139
2.221PhePro: 2.221 ± 0.159
1.621PheGln: 1.621 ± 0.167
2.537PheArg: 2.537 ± 0.231
3.242PheSer: 3.242 ± 0.26
2.747PheThr: 2.747 ± 0.206
3.407PheVal: 3.407 ± 0.22
0.901PheTrp: 0.901 ± 0.139
1.741PheTyr: 1.741 ± 0.18
0.0PheXaa: 0.0 ± 0.0
Gly
2.777GlyAla: 2.777 ± 0.242
1.171GlyCys: 1.171 ± 0.146
2.236GlyAsp: 2.236 ± 0.171
2.251GlyGlu: 2.251 ± 0.208
1.951GlyPhe: 1.951 ± 0.194
3.737GlyGly: 3.737 ± 0.373
1.291GlyHis: 1.291 ± 0.131
2.462GlyIle: 2.462 ± 0.211
1.726GlyLys: 1.726 ± 0.164
5.013GlyLeu: 5.013 ± 0.269
1.021GlyMet: 1.021 ± 0.126
2.086GlyAsn: 2.086 ± 0.18
1.996GlyPro: 1.996 ± 0.192
1.906GlyGln: 1.906 ± 0.176
3.572GlyArg: 3.572 ± 0.312
3.497GlySer: 3.497 ± 0.217
3.302GlyThr: 3.302 ± 0.255
4.007GlyVal: 4.007 ± 0.29
0.735GlyTrp: 0.735 ± 0.097
1.666GlyTyr: 1.666 ± 0.168
0.0GlyXaa: 0.0 ± 0.0
His
2.356HisAla: 2.356 ± 0.203
0.675HisCys: 0.675 ± 0.115
1.606HisAsp: 1.606 ± 0.138
1.471HisGlu: 1.471 ± 0.154
1.231HisPhe: 1.231 ± 0.154
1.696HisGly: 1.696 ± 0.168
1.591HisHis: 1.591 ± 0.233
1.576HisIle: 1.576 ± 0.139
1.036HisLys: 1.036 ± 0.132
3.257HisLeu: 3.257 ± 0.282
0.705HisMet: 0.705 ± 0.109
1.591HisAsn: 1.591 ± 0.12
1.591HisPro: 1.591 ± 0.162
1.246HisGln: 1.246 ± 0.155
2.927HisArg: 2.927 ± 0.219
2.176HisSer: 2.176 ± 0.212
2.221HisThr: 2.221 ± 0.206
2.717HisVal: 2.717 ± 0.224
0.345HisTrp: 0.345 ± 0.067
0.826HisTyr: 0.826 ± 0.122
0.0HisXaa: 0.0 ± 0.0
Ile
2.777IleAla: 2.777 ± 0.22
1.606IleCys: 1.606 ± 0.147
1.951IleAsp: 1.951 ± 0.182
1.921IleGlu: 1.921 ± 0.169
2.281IlePhe: 2.281 ± 0.214
2.086IleGly: 2.086 ± 0.177
1.591IleHis: 1.591 ± 0.151
3.197IleIle: 3.197 ± 0.25
1.831IleLys: 1.831 ± 0.196
4.938IleLeu: 4.938 ± 0.274
1.636IleMet: 1.636 ± 0.152
2.251IleAsn: 2.251 ± 0.192
3.242IlePro: 3.242 ± 0.217
2.221IleGln: 2.221 ± 0.197
2.522IleArg: 2.522 ± 0.229
3.662IleSer: 3.662 ± 0.262
3.632IleThr: 3.632 ± 0.282
4.323IleVal: 4.323 ± 0.328
0.931IleTrp: 0.931 ± 0.15
2.341IleTyr: 2.341 ± 0.179
0.0IleXaa: 0.0 ± 0.0
Lys
2.251LysAla: 2.251 ± 0.195
0.991LysCys: 0.991 ± 0.109
1.441LysAsp: 1.441 ± 0.171
1.531LysGlu: 1.531 ± 0.198
1.246LysPhe: 1.246 ± 0.155
1.501LysGly: 1.501 ± 0.168
1.531LysHis: 1.531 ± 0.144
1.936LysIle: 1.936 ± 0.187
2.131LysLys: 2.131 ± 0.219
3.287LysLeu: 3.287 ± 0.257
1.051LysMet: 1.051 ± 0.124
1.876LysAsn: 1.876 ± 0.148
1.981LysPro: 1.981 ± 0.24
1.591LysGln: 1.591 ± 0.154
2.912LysArg: 2.912 ± 0.208
2.356LysSer: 2.356 ± 0.181
2.507LysThr: 2.507 ± 0.229
2.056LysVal: 2.056 ± 0.201
0.375LysTrp: 0.375 ± 0.075
1.291LysTyr: 1.291 ± 0.124
0.0LysXaa: 0.0 ± 0.0
Leu
6.439LeuAla: 6.439 ± 0.292
3.137LeuCys: 3.137 ± 0.234
3.752LeuAsp: 3.752 ± 0.294
4.097LeuGlu: 4.097 ± 0.305
4.728LeuPhe: 4.728 ± 0.305
4.878LeuGly: 4.878 ± 0.29
3.287LeuHis: 3.287 ± 0.229
5.028LeuIle: 5.028 ± 0.355
3.752LeuLys: 3.752 ± 0.271
10.777LeuLeu: 10.777 ± 0.511
2.957LeuMet: 2.957 ± 0.245
3.557LeuAsn: 3.557 ± 0.242
5.523LeuPro: 5.523 ± 0.311
3.752LeuGln: 3.752 ± 0.287
7.354LeuArg: 7.354 ± 0.384
7.97LeuSer: 7.97 ± 0.42
6.919LeuThr: 6.919 ± 0.314
5.899LeuVal: 5.899 ± 0.301
1.471LeuTrp: 1.471 ± 0.188
3.587LeuTyr: 3.587 ± 0.279
0.0LeuXaa: 0.0 ± 0.0
Met
1.816MetAla: 1.816 ± 0.226
0.795MetCys: 0.795 ± 0.097
0.961MetAsp: 0.961 ± 0.119
1.291MetGlu: 1.291 ± 0.128
1.351MetPhe: 1.351 ± 0.137
0.871MetGly: 0.871 ± 0.128
0.705MetHis: 0.705 ± 0.094
1.471MetIle: 1.471 ± 0.171
0.946MetLys: 0.946 ± 0.103
2.567MetLeu: 2.567 ± 0.214
0.916MetMet: 0.916 ± 0.134
1.036MetAsn: 1.036 ± 0.123
1.231MetPro: 1.231 ± 0.118
0.976MetGln: 0.976 ± 0.151
1.591MetArg: 1.591 ± 0.155
2.056MetSer: 2.056 ± 0.18
1.591MetThr: 1.591 ± 0.158
1.561MetVal: 1.561 ± 0.164
0.435MetTrp: 0.435 ± 0.082
1.171MetTyr: 1.171 ± 0.138
0.0MetXaa: 0.0 ± 0.0
Asn
2.912AsnAla: 2.912 ± 0.21
0.916AsnCys: 0.916 ± 0.122
1.891AsnAsp: 1.891 ± 0.174
2.026AsnGlu: 2.026 ± 0.197
1.351AsnPhe: 1.351 ± 0.16
2.101AsnGly: 2.101 ± 0.171
1.291AsnHis: 1.291 ± 0.12
2.236AsnIle: 2.236 ± 0.217
1.531AsnLys: 1.531 ± 0.157
3.887AsnLeu: 3.887 ± 0.295
1.111AsnMet: 1.111 ± 0.141
2.176AsnAsn: 2.176 ± 0.208
2.146AsnPro: 2.146 ± 0.188
1.531AsnGln: 1.531 ± 0.179
2.326AsnArg: 2.326 ± 0.202
3.287AsnSer: 3.287 ± 0.266
3.347AsnThr: 3.347 ± 0.302
3.422AsnVal: 3.422 ± 0.225
0.42AsnTrp: 0.42 ± 0.081
1.366AsnTyr: 1.366 ± 0.147
0.0AsnXaa: 0.0 ± 0.0
Pro
4.308ProAla: 4.308 ± 0.282
1.441ProCys: 1.441 ± 0.138
2.912ProAsp: 2.912 ± 0.207
2.642ProGlu: 2.642 ± 0.21
2.116ProPhe: 2.116 ± 0.167
2.642ProGly: 2.642 ± 0.262
1.831ProHis: 1.831 ± 0.197
2.431ProIle: 2.431 ± 0.17
1.891ProLys: 1.891 ± 0.199
4.623ProLeu: 4.623 ± 0.232
1.186ProMet: 1.186 ± 0.114
1.741ProAsn: 1.741 ± 0.162
5.508ProPro: 5.508 ± 0.5
2.266ProGln: 2.266 ± 0.225
4.278ProArg: 4.278 ± 0.389
4.923ProSer: 4.923 ± 0.33
3.662ProThr: 3.662 ± 0.255
3.827ProVal: 3.827 ± 0.278
0.675ProTrp: 0.675 ± 0.118
1.636ProTyr: 1.636 ± 0.159
0.0ProXaa: 0.0 ± 0.0
Gln
2.401GlnAla: 2.401 ± 0.235
0.976GlnCys: 0.976 ± 0.121
1.561GlnAsp: 1.561 ± 0.169
1.936GlnGlu: 1.936 ± 0.175
1.321GlnPhe: 1.321 ± 0.141
1.441GlnGly: 1.441 ± 0.15
1.651GlnHis: 1.651 ± 0.196
2.176GlnIle: 2.176 ± 0.202
2.011GlnLys: 2.011 ± 0.185
3.962GlnLeu: 3.962 ± 0.297
0.901GlnMet: 0.901 ± 0.111
1.936GlnAsn: 1.936 ± 0.149
2.146GlnPro: 2.146 ± 0.207
2.507GlnGln: 2.507 ± 0.315
3.032GlnArg: 3.032 ± 0.21
2.341GlnSer: 2.341 ± 0.179
2.642GlnThr: 2.642 ± 0.251
2.326GlnVal: 2.326 ± 0.193
0.375GlnTrp: 0.375 ± 0.071
1.396GlnTyr: 1.396 ± 0.143
0.0GlnXaa: 0.0 ± 0.0
Arg
4.173ArgAla: 4.173 ± 0.277
1.861ArgCys: 1.861 ± 0.17
3.107ArgAsp: 3.107 ± 0.266
3.212ArgGlu: 3.212 ± 0.223
2.612ArgPhe: 2.612 ± 0.192
3.542ArgGly: 3.542 ± 0.291
3.137ArgHis: 3.137 ± 0.228
2.792ArgIle: 2.792 ± 0.227
2.296ArgLys: 2.296 ± 0.19
7.58ArgLeu: 7.58 ± 0.327
1.291ArgMet: 1.291 ± 0.124
2.987ArgAsn: 2.987 ± 0.285
4.022ArgPro: 4.022 ± 0.358
3.347ArgGln: 3.347 ± 0.24
6.814ArgArg: 6.814 ± 0.624
5.088ArgSer: 5.088 ± 0.302
4.052ArgThr: 4.052 ± 0.267
4.278ArgVal: 4.278 ± 0.301
1.246ArgTrp: 1.246 ± 0.172
2.657ArgTyr: 2.657 ± 0.185
0.0ArgXaa: 0.0 ± 0.0
Ser
5.043SerAla: 5.043 ± 0.27
1.756SerCys: 1.756 ± 0.145
3.767SerAsp: 3.767 ± 0.25
4.173SerGlu: 4.173 ± 0.275
3.212SerPhe: 3.212 ± 0.246
4.353SerGly: 4.353 ± 0.268
2.221SerHis: 2.221 ± 0.174
3.602SerIle: 3.602 ± 0.279
2.477SerLys: 2.477 ± 0.194
7.279SerLeu: 7.279 ± 0.323
1.816SerMet: 1.816 ± 0.179
2.822SerAsn: 2.822 ± 0.241
4.683SerPro: 4.683 ± 0.34
2.792SerGln: 2.792 ± 0.174
5.598SerArg: 5.598 ± 0.342
9.096SerSer: 9.096 ± 0.559
6.124SerThr: 6.124 ± 0.54
5.223SerVal: 5.223 ± 0.307
1.231SerTrp: 1.231 ± 0.138
2.401SerTyr: 2.401 ± 0.199
0.0SerXaa: 0.0 ± 0.0
Thr
5.088ThrAla: 5.088 ± 0.347
2.161ThrCys: 2.161 ± 0.177
2.401ThrAsp: 2.401 ± 0.186
3.077ThrGlu: 3.077 ± 0.22
3.107ThrPhe: 3.107 ± 0.24
2.912ThrGly: 2.912 ± 0.215
2.071ThrHis: 2.071 ± 0.191
3.617ThrIle: 3.617 ± 0.236
2.687ThrLys: 2.687 ± 0.222
6.379ThrLeu: 6.379 ± 0.355
1.591ThrMet: 1.591 ± 0.154
2.747ThrAsn: 2.747 ± 0.262
4.278ThrPro: 4.278 ± 0.266
2.612ThrGln: 2.612 ± 0.221
4.128ThrArg: 4.128 ± 0.266
6.394ThrSer: 6.394 ± 0.618
7.58ThrThr: 7.58 ± 1.107
6.124ThrVal: 6.124 ± 0.343
1.021ThrTrp: 1.021 ± 0.141
2.161ThrTyr: 2.161 ± 0.198
0.0ThrXaa: 0.0 ± 0.0
Val
4.443ValAla: 4.443 ± 0.291
1.756ValCys: 1.756 ± 0.169
2.897ValAsp: 2.897 ± 0.242
2.927ValGlu: 2.927 ± 0.21
3.632ValPhe: 3.632 ± 0.252
3.062ValGly: 3.062 ± 0.211
2.191ValHis: 2.191 ± 0.208
3.782ValIle: 3.782 ± 0.204
2.507ValLys: 2.507 ± 0.225
7.339ValLeu: 7.339 ± 0.428
1.846ValMet: 1.846 ± 0.162
3.017ValAsn: 3.017 ± 0.243
3.857ValPro: 3.857 ± 0.242
2.371ValGln: 2.371 ± 0.191
4.383ValArg: 4.383 ± 0.299
5.854ValSer: 5.854 ± 0.349
5.703ValThr: 5.703 ± 0.322
5.538ValVal: 5.538 ± 0.436
1.111ValTrp: 1.111 ± 0.141
2.927ValTyr: 2.927 ± 0.215
0.0ValXaa: 0.0 ± 0.0
Trp
0.6TrpAla: 0.6 ± 0.093
0.465TrpCys: 0.465 ± 0.099
0.63TrpAsp: 0.63 ± 0.087
0.615TrpGlu: 0.615 ± 0.108
0.555TrpPhe: 0.555 ± 0.087
0.63TrpGly: 0.63 ± 0.125
0.615TrpHis: 0.615 ± 0.124
0.886TrpIle: 0.886 ± 0.135
0.555TrpLys: 0.555 ± 0.1
1.756TrpLeu: 1.756 ± 0.172
0.45TrpMet: 0.45 ± 0.081
0.525TrpAsn: 0.525 ± 0.092
0.871TrpPro: 0.871 ± 0.105
0.615TrpGln: 0.615 ± 0.099
1.351TrpArg: 1.351 ± 0.191
1.066TrpSer: 1.066 ± 0.11
0.795TrpThr: 0.795 ± 0.112
0.645TrpVal: 0.645 ± 0.103
0.465TrpTrp: 0.465 ± 0.101
0.735TrpTyr: 0.735 ± 0.112
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.131TyrAla: 2.131 ± 0.194
0.946TyrCys: 0.946 ± 0.117
1.756TyrAsp: 1.756 ± 0.19
1.921TyrGlu: 1.921 ± 0.173
1.591TyrPhe: 1.591 ± 0.18
1.756TyrGly: 1.756 ± 0.151
1.216TyrHis: 1.216 ± 0.173
2.101TyrIle: 2.101 ± 0.173
1.096TyrLys: 1.096 ± 0.134
3.512TyrLeu: 3.512 ± 0.229
0.795TyrMet: 0.795 ± 0.112
1.576TyrAsn: 1.576 ± 0.184
1.171TyrPro: 1.171 ± 0.144
1.321TyrGln: 1.321 ± 0.183
2.597TyrArg: 2.597 ± 0.2
2.522TyrSer: 2.522 ± 0.212
2.462TyrThr: 2.462 ± 0.181
2.852TyrVal: 2.852 ± 0.214
0.51TyrTrp: 0.51 ± 0.092
1.336TyrTyr: 1.336 ± 0.142
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 223 proteins (66627 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski