Amino acid dipepetide frequency for Rat cytomegalovirus (isolate England) (RCMV-E) (Murid herpesvirus 8)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.308AlaAla: 5.308 ± 0.689
1.537AlaCys: 1.537 ± 0.187
3.181AlaAsp: 3.181 ± 0.233
2.842AlaGlu: 2.842 ± 0.25
2.502AlaPhe: 2.502 ± 0.239
3.003AlaGly: 3.003 ± 0.279
1.394AlaHis: 1.394 ± 0.204
3.485AlaIle: 3.485 ± 0.241
2.127AlaLys: 2.127 ± 0.178
5.666AlaLeu: 5.666 ± 0.333
1.68AlaMet: 1.68 ± 0.179
2.216AlaAsn: 2.216 ± 0.185
2.699AlaPro: 2.699 ± 0.257
1.519AlaGln: 1.519 ± 0.173
3.575AlaArg: 3.575 ± 0.304
5.308AlaSer: 5.308 ± 0.332
4.182AlaThr: 4.182 ± 0.268
4.861AlaVal: 4.861 ± 0.318
0.715AlaTrp: 0.715 ± 0.159
1.823AlaTyr: 1.823 ± 0.165
0.0AlaXaa: 0.0 ± 0.0
Cys
1.519CysAla: 1.519 ± 0.199
0.608CysCys: 0.608 ± 0.103
1.769CysAsp: 1.769 ± 0.182
1.501CysGlu: 1.501 ± 0.163
0.912CysPhe: 0.912 ± 0.129
1.215CysGly: 1.215 ± 0.14
0.447CysHis: 0.447 ± 0.086
1.68CysIle: 1.68 ± 0.169
1.001CysLys: 1.001 ± 0.13
2.091CysLeu: 2.091 ± 0.183
0.518CysMet: 0.518 ± 0.104
0.912CysAsn: 0.912 ± 0.136
1.001CysPro: 1.001 ± 0.124
0.679CysGln: 0.679 ± 0.126
1.912CysArg: 1.912 ± 0.206
1.68CysSer: 1.68 ± 0.181
1.126CysThr: 1.126 ± 0.174
2.359CysVal: 2.359 ± 0.197
0.161CysTrp: 0.161 ± 0.049
1.09CysTyr: 1.09 ± 0.132
0.0CysXaa: 0.0 ± 0.0
Asp
3.843AspAla: 3.843 ± 0.297
0.947AspCys: 0.947 ± 0.143
4.647AspAsp: 4.647 ± 0.339
4.254AspGlu: 4.254 ± 0.31
2.377AspPhe: 2.377 ± 0.213
4.468AspGly: 4.468 ± 0.36
1.18AspHis: 1.18 ± 0.122
3.843AspIle: 3.843 ± 0.3
2.377AspLys: 2.377 ± 0.214
4.915AspLeu: 4.915 ± 0.379
1.805AspMet: 1.805 ± 0.171
2.27AspAsn: 2.27 ± 0.187
3.36AspPro: 3.36 ± 0.25
1.162AspGln: 1.162 ± 0.128
3.896AspArg: 3.896 ± 0.282
4.736AspSer: 4.736 ± 0.342
3.432AspThr: 3.432 ± 0.274
4.844AspVal: 4.844 ± 0.385
0.5AspTrp: 0.5 ± 0.101
1.895AspTyr: 1.895 ± 0.195
0.0AspXaa: 0.0 ± 0.0
Glu
3.092GluAla: 3.092 ± 0.233
1.287GluCys: 1.287 ± 0.187
3.914GluAsp: 3.914 ± 0.303
3.682GluGlu: 3.682 ± 0.238
2.037GluPhe: 2.037 ± 0.188
2.538GluGly: 2.538 ± 0.221
1.519GluHis: 1.519 ± 0.157
3.342GluIle: 3.342 ± 0.252
2.86GluLys: 2.86 ± 0.23
4.486GluLeu: 4.486 ± 0.296
1.233GluMet: 1.233 ± 0.118
3.181GluAsn: 3.181 ± 0.253
2.091GluPro: 2.091 ± 0.161
1.358GluGln: 1.358 ± 0.174
3.449GluArg: 3.449 ± 0.243
4.45GluSer: 4.45 ± 0.273
4.236GluThr: 4.236 ± 0.365
2.878GluVal: 2.878 ± 0.212
0.411GluTrp: 0.411 ± 0.088
1.859GluTyr: 1.859 ± 0.207
0.0GluXaa: 0.0 ± 0.0
Phe
2.538PheAla: 2.538 ± 0.191
1.233PheCys: 1.233 ± 0.153
2.466PheAsp: 2.466 ± 0.219
2.037PheGlu: 2.037 ± 0.165
2.091PhePhe: 2.091 ± 0.222
2.663PheGly: 2.663 ± 0.243
1.144PheHis: 1.144 ± 0.13
3.003PheIle: 3.003 ± 0.219
1.93PheLys: 1.93 ± 0.157
4.236PheLeu: 4.236 ± 0.281
1.197PheMet: 1.197 ± 0.147
1.823PheAsn: 1.823 ± 0.178
2.002PhePro: 2.002 ± 0.18
0.894PheGln: 0.894 ± 0.143
2.538PheArg: 2.538 ± 0.182
3.646PheSer: 3.646 ± 0.292
2.538PheThr: 2.538 ± 0.212
3.181PheVal: 3.181 ± 0.261
0.465PheTrp: 0.465 ± 0.085
1.769PheTyr: 1.769 ± 0.211
0.0PheXaa: 0.0 ± 0.0
Gly
3.485GlyAla: 3.485 ± 0.331
1.072GlyCys: 1.072 ± 0.144
3.324GlyAsp: 3.324 ± 0.254
3.414GlyGlu: 3.414 ± 0.25
2.198GlyPhe: 2.198 ± 0.24
4.236GlyGly: 4.236 ± 0.738
1.197GlyHis: 1.197 ± 0.153
2.967GlyIle: 2.967 ± 0.238
2.627GlyLys: 2.627 ± 0.17
4.54GlyLeu: 4.54 ± 0.294
1.448GlyMet: 1.448 ± 0.157
2.502GlyAsn: 2.502 ± 0.228
2.216GlyPro: 2.216 ± 0.212
1.501GlyGln: 1.501 ± 0.2
3.807GlyArg: 3.807 ± 0.329
4.397GlySer: 4.397 ± 0.336
3.825GlyThr: 3.825 ± 0.267
4.504GlyVal: 4.504 ± 0.33
0.679GlyTrp: 0.679 ± 0.103
2.073GlyTyr: 2.073 ± 0.21
0.0GlyXaa: 0.0 ± 0.0
His
1.626HisAla: 1.626 ± 0.197
0.34HisCys: 0.34 ± 0.076
1.251HisAsp: 1.251 ± 0.151
1.305HisGlu: 1.305 ± 0.156
1.144HisPhe: 1.144 ± 0.157
1.466HisGly: 1.466 ± 0.161
0.608HisHis: 0.608 ± 0.105
1.555HisIle: 1.555 ± 0.148
1.09HisLys: 1.09 ± 0.133
1.662HisLeu: 1.662 ± 0.152
0.465HisMet: 0.465 ± 0.102
1.215HisAsn: 1.215 ± 0.153
1.233HisPro: 1.233 ± 0.169
0.608HisGln: 0.608 ± 0.123
1.984HisArg: 1.984 ± 0.157
1.626HisSer: 1.626 ± 0.199
2.055HisThr: 2.055 ± 0.192
1.805HisVal: 1.805 ± 0.165
0.197HisTrp: 0.197 ± 0.073
0.84HisTyr: 0.84 ± 0.112
0.0HisXaa: 0.0 ± 0.0
Ile
3.306IleAla: 3.306 ± 0.248
1.591IleCys: 1.591 ± 0.191
3.128IleAsp: 3.128 ± 0.236
3.199IleGlu: 3.199 ± 0.282
2.86IlePhe: 2.86 ± 0.249
3.021IleGly: 3.021 ± 0.248
1.305IleHis: 1.305 ± 0.152
3.735IleIle: 3.735 ± 0.304
2.717IleLys: 2.717 ± 0.226
5.791IleLeu: 5.791 ± 0.33
1.269IleMet: 1.269 ± 0.148
2.627IleAsn: 2.627 ± 0.21
3.503IlePro: 3.503 ± 0.216
1.805IleGln: 1.805 ± 0.179
4.361IleArg: 4.361 ± 0.316
5.344IleSer: 5.344 ± 0.409
3.932IleThr: 3.932 ± 0.304
3.986IleVal: 3.986 ± 0.253
0.536IleTrp: 0.536 ± 0.09
2.592IleTyr: 2.592 ± 0.287
0.0IleXaa: 0.0 ± 0.0
Lys
1.734LysAla: 1.734 ± 0.177
0.929LysCys: 0.929 ± 0.128
3.074LysAsp: 3.074 ± 0.233
2.127LysGlu: 2.127 ± 0.214
1.877LysPhe: 1.877 ± 0.224
1.948LysGly: 1.948 ± 0.16
1.466LysHis: 1.466 ± 0.173
3.378LysIle: 3.378 ± 0.248
3.056LysLys: 3.056 ± 0.304
4.004LysLeu: 4.004 ± 0.321
1.251LysMet: 1.251 ± 0.162
2.86LysAsn: 2.86 ± 0.24
1.895LysPro: 1.895 ± 0.207
1.34LysGln: 1.34 ± 0.163
4.004LysArg: 4.004 ± 0.24
3.253LysSer: 3.253 ± 0.262
3.521LysThr: 3.521 ± 0.251
2.717LysVal: 2.717 ± 0.211
0.375LysTrp: 0.375 ± 0.085
1.43LysTyr: 1.43 ± 0.159
0.0LysXaa: 0.0 ± 0.0
Leu
4.826LeuAla: 4.826 ± 0.298
2.931LeuCys: 2.931 ± 0.235
4.969LeuAsp: 4.969 ± 0.325
4.039LeuGlu: 4.039 ± 0.287
3.968LeuPhe: 3.968 ± 0.301
4.522LeuGly: 4.522 ± 0.326
2.02LeuHis: 2.02 ± 0.233
5.022LeuIle: 5.022 ± 0.317
3.878LeuLys: 3.878 ± 0.31
8.615LeuLeu: 8.615 ± 0.523
2.288LeuMet: 2.288 ± 0.185
3.7LeuAsn: 3.7 ± 0.242
3.986LeuPro: 3.986 ± 0.308
2.556LeuGln: 2.556 ± 0.224
6.095LeuArg: 6.095 ± 0.4
7.918LeuSer: 7.918 ± 0.322
5.791LeuThr: 5.791 ± 0.356
5.308LeuVal: 5.308 ± 0.337
0.769LeuTrp: 0.769 ± 0.134
3.128LeuTyr: 3.128 ± 0.303
0.0LeuXaa: 0.0 ± 0.0
Met
1.537MetAla: 1.537 ± 0.135
0.769MetCys: 0.769 ± 0.126
1.662MetAsp: 1.662 ± 0.208
1.287MetGlu: 1.287 ± 0.14
1.537MetPhe: 1.537 ± 0.183
1.001MetGly: 1.001 ± 0.161
0.357MetHis: 0.357 ± 0.07
1.394MetIle: 1.394 ± 0.18
1.18MetLys: 1.18 ± 0.148
2.359MetLeu: 2.359 ± 0.214
0.786MetMet: 0.786 ± 0.131
1.162MetAsn: 1.162 ± 0.138
0.697MetPro: 0.697 ± 0.113
0.483MetGln: 0.483 ± 0.097
1.823MetArg: 1.823 ± 0.194
2.091MetSer: 2.091 ± 0.191
1.466MetThr: 1.466 ± 0.149
1.215MetVal: 1.215 ± 0.165
0.375MetTrp: 0.375 ± 0.078
1.019MetTyr: 1.019 ± 0.135
0.0MetXaa: 0.0 ± 0.0
Asn
3.021AsnAla: 3.021 ± 0.248
1.019AsnCys: 1.019 ± 0.134
2.824AsnAsp: 2.824 ± 0.27
2.341AsnGlu: 2.341 ± 0.248
1.466AsnPhe: 1.466 ± 0.173
2.735AsnGly: 2.735 ± 0.211
0.804AsnHis: 0.804 ± 0.112
3.003AsnIle: 3.003 ± 0.265
2.037AsnLys: 2.037 ± 0.175
3.485AsnLeu: 3.485 ± 0.23
1.108AsnMet: 1.108 ± 0.15
2.145AsnAsn: 2.145 ± 0.207
1.93AsnPro: 1.93 ± 0.213
1.233AsnGln: 1.233 ± 0.161
2.752AsnArg: 2.752 ± 0.216
3.003AsnSer: 3.003 ± 0.334
3.217AsnThr: 3.217 ± 0.255
4.558AsnVal: 4.558 ± 0.307
0.518AsnTrp: 0.518 ± 0.094
1.18AsnTyr: 1.18 ± 0.14
0.0AsnXaa: 0.0 ± 0.0
Pro
2.323ProAla: 2.323 ± 0.229
1.019ProCys: 1.019 ± 0.139
2.788ProAsp: 2.788 ± 0.206
2.717ProGlu: 2.717 ± 0.229
2.127ProPhe: 2.127 ± 0.206
2.627ProGly: 2.627 ± 0.266
1.287ProHis: 1.287 ± 0.161
2.538ProIle: 2.538 ± 0.184
2.252ProLys: 2.252 ± 0.186
3.771ProLeu: 3.771 ± 0.271
1.019ProMet: 1.019 ± 0.14
1.769ProAsn: 1.769 ± 0.192
3.396ProPro: 3.396 ± 0.323
1.555ProGln: 1.555 ± 0.188
3.485ProArg: 3.485 ± 0.329
4.558ProSer: 4.558 ± 0.387
3.163ProThr: 3.163 ± 0.311
4.075ProVal: 4.075 ± 0.308
0.554ProTrp: 0.554 ± 0.102
1.573ProTyr: 1.573 ± 0.16
0.0ProXaa: 0.0 ± 0.0
Gln
1.43GlnAla: 1.43 ± 0.198
0.643GlnCys: 0.643 ± 0.115
1.144GlnAsp: 1.144 ± 0.116
1.501GlnGlu: 1.501 ± 0.208
1.126GlnPhe: 1.126 ± 0.109
1.144GlnGly: 1.144 ± 0.124
0.822GlnHis: 0.822 ± 0.134
1.895GlnIle: 1.895 ± 0.2
1.376GlnLys: 1.376 ± 0.151
2.055GlnLeu: 2.055 ± 0.169
0.661GlnMet: 0.661 ± 0.122
1.394GlnAsn: 1.394 ± 0.178
1.734GlnPro: 1.734 ± 0.213
1.358GlnGln: 1.358 ± 0.263
2.413GlnArg: 2.413 ± 0.214
1.912GlnSer: 1.912 ± 0.169
2.252GlnThr: 2.252 ± 0.245
1.394GlnVal: 1.394 ± 0.179
0.25GlnTrp: 0.25 ± 0.064
0.804GlnTyr: 0.804 ± 0.116
0.0GlnXaa: 0.0 ± 0.0
Arg
3.753ArgAla: 3.753 ± 0.301
1.966ArgCys: 1.966 ± 0.194
4.039ArgAsp: 4.039 ± 0.311
3.789ArgGlu: 3.789 ± 0.298
3.253ArgPhe: 3.253 ± 0.23
4.057ArgGly: 4.057 ± 0.364
1.912ArgHis: 1.912 ± 0.163
3.914ArgIle: 3.914 ± 0.289
3.199ArgLys: 3.199 ± 0.244
6.255ArgLeu: 6.255 ± 0.398
1.609ArgMet: 1.609 ± 0.15
2.931ArgAsn: 2.931 ± 0.289
2.806ArgPro: 2.806 ± 0.254
1.895ArgGln: 1.895 ± 0.186
6.649ArgArg: 6.649 ± 0.438
5.362ArgSer: 5.362 ± 0.318
3.682ArgThr: 3.682 ± 0.239
4.647ArgVal: 4.647 ± 0.3
0.983ArgTrp: 0.983 ± 0.16
2.377ArgTyr: 2.377 ± 0.293
0.0ArgXaa: 0.0 ± 0.0
Ser
4.951SerAla: 4.951 ± 0.29
1.823SerCys: 1.823 ± 0.179
5.809SerAsp: 5.809 ± 0.432
4.772SerGlu: 4.772 ± 0.324
3.324SerPhe: 3.324 ± 0.259
5.326SerGly: 5.326 ± 0.418
2.109SerHis: 2.109 ± 0.195
4.808SerIle: 4.808 ± 0.331
3.557SerLys: 3.557 ± 0.231
6.506SerLeu: 6.506 ± 0.359
1.698SerMet: 1.698 ± 0.142
3.378SerAsn: 3.378 ± 0.245
4.701SerPro: 4.701 ± 0.378
2.216SerGln: 2.216 ± 0.267
4.987SerArg: 4.987 ± 0.319
7.775SerSer: 7.775 ± 0.569
6.023SerThr: 6.023 ± 0.552
6.488SerVal: 6.488 ± 0.404
0.786SerTrp: 0.786 ± 0.112
2.735SerTyr: 2.735 ± 0.26
0.0SerXaa: 0.0 ± 0.0
Thr
4.236ThrAla: 4.236 ± 0.306
1.466ThrCys: 1.466 ± 0.16
3.789ThrAsp: 3.789 ± 0.294
3.485ThrGlu: 3.485 ± 0.275
3.235ThrPhe: 3.235 ± 0.303
3.861ThrGly: 3.861 ± 0.295
1.787ThrHis: 1.787 ± 0.183
3.861ThrIle: 3.861 ± 0.282
3.217ThrLys: 3.217 ± 0.33
5.701ThrLeu: 5.701 ± 0.377
1.483ThrMet: 1.483 ± 0.156
2.484ThrAsn: 2.484 ± 0.275
3.753ThrPro: 3.753 ± 0.396
2.52ThrGln: 2.52 ± 0.259
3.682ThrArg: 3.682 ± 0.306
5.63ThrSer: 5.63 ± 0.512
5.362ThrThr: 5.362 ± 0.583
5.612ThrVal: 5.612 ± 0.287
0.661ThrTrp: 0.661 ± 0.119
2.538ThrTyr: 2.538 ± 0.186
0.0ThrXaa: 0.0 ± 0.0
Val
4.379ValAla: 4.379 ± 0.359
2.002ValCys: 2.002 ± 0.203
4.093ValAsp: 4.093 ± 0.252
3.324ValGlu: 3.324 ± 0.258
3.378ValPhe: 3.378 ± 0.262
3.664ValGly: 3.664 ± 0.292
1.823ValHis: 1.823 ± 0.195
4.593ValIle: 4.593 ± 0.324
3.449ValLys: 3.449 ± 0.296
6.166ValLeu: 6.166 ± 0.43
1.609ValMet: 1.609 ± 0.19
3.414ValAsn: 3.414 ± 0.263
3.735ValPro: 3.735 ± 0.256
1.644ValGln: 1.644 ± 0.175
4.361ValArg: 4.361 ± 0.28
7.632ValSer: 7.632 ± 0.476
5.147ValThr: 5.147 ± 0.365
5.076ValVal: 5.076 ± 0.385
0.643ValTrp: 0.643 ± 0.109
3.128ValTyr: 3.128 ± 0.273
0.0ValXaa: 0.0 ± 0.0
Trp
0.518TrpAla: 0.518 ± 0.104
0.286TrpCys: 0.286 ± 0.071
0.447TrpAsp: 0.447 ± 0.087
0.483TrpGlu: 0.483 ± 0.093
0.59TrpPhe: 0.59 ± 0.092
0.375TrpGly: 0.375 ± 0.076
0.107TrpHis: 0.107 ± 0.048
0.626TrpIle: 0.626 ± 0.107
0.554TrpLys: 0.554 ± 0.076
1.019TrpLeu: 1.019 ± 0.13
0.25TrpMet: 0.25 ± 0.067
0.34TrpAsn: 0.34 ± 0.075
0.572TrpPro: 0.572 ± 0.105
0.268TrpGln: 0.268 ± 0.069
0.733TrpArg: 0.733 ± 0.13
0.983TrpSer: 0.983 ± 0.126
0.661TrpThr: 0.661 ± 0.103
0.608TrpVal: 0.608 ± 0.132
0.232TrpTrp: 0.232 ± 0.079
0.483TrpTyr: 0.483 ± 0.093
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.198TyrAla: 2.198 ± 0.212
0.715TyrCys: 0.715 ± 0.135
2.431TyrAsp: 2.431 ± 0.182
1.823TyrGlu: 1.823 ± 0.209
1.394TyrPhe: 1.394 ± 0.154
2.145TyrGly: 2.145 ± 0.235
0.786TyrHis: 0.786 ± 0.122
1.966TyrIle: 1.966 ± 0.194
1.752TyrLys: 1.752 ± 0.204
3.003TyrLeu: 3.003 ± 0.237
0.804TyrMet: 0.804 ± 0.133
2.055TyrAsn: 2.055 ± 0.233
1.287TyrPro: 1.287 ± 0.153
0.715TyrGln: 0.715 ± 0.111
2.574TyrArg: 2.574 ± 0.248
2.466TyrSer: 2.466 ± 0.194
2.752TyrThr: 2.752 ± 0.265
3.146TyrVal: 3.146 ± 0.268
0.322TyrTrp: 0.322 ± 0.076
1.466TyrTyr: 1.466 ± 0.158
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 137 proteins (55952 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski