Amino acid dipepetide frequency for Wood mouse herpesvirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.622AlaAla: 4.622 ± 0.415
1.598AlaCys: 1.598 ± 0.206
2.34AlaAsp: 2.34 ± 0.282
3.196AlaGlu: 3.196 ± 0.301
2.254AlaPhe: 2.254 ± 0.261
3.081AlaGly: 3.081 ± 0.36
1.541AlaHis: 1.541 ± 0.254
3.937AlaIle: 3.937 ± 0.281
2.653AlaLys: 2.653 ± 0.318
5.535AlaLeu: 5.535 ± 0.505
1.683AlaMet: 1.683 ± 0.24
2.34AlaAsn: 2.34 ± 0.289
3.709AlaPro: 3.709 ± 0.474
1.855AlaGln: 1.855 ± 0.213
2.283AlaArg: 2.283 ± 0.277
4.422AlaSer: 4.422 ± 0.353
5.136AlaThr: 5.136 ± 0.448
4.137AlaVal: 4.137 ± 0.426
0.713AlaTrp: 0.713 ± 0.222
1.826AlaTyr: 1.826 ± 0.268
0.0AlaXaa: 0.0 ± 0.0
Cys
1.541CysAla: 1.541 ± 0.243
0.457CysCys: 0.457 ± 0.124
1.227CysAsp: 1.227 ± 0.175
1.284CysGlu: 1.284 ± 0.196
1.484CysPhe: 1.484 ± 0.195
1.655CysGly: 1.655 ± 0.215
0.799CysHis: 0.799 ± 0.158
1.512CysIle: 1.512 ± 0.216
1.37CysLys: 1.37 ± 0.176
3.081CysLeu: 3.081 ± 0.323
0.514CysMet: 0.514 ± 0.115
1.227CysAsn: 1.227 ± 0.214
1.227CysPro: 1.227 ± 0.186
1.056CysGln: 1.056 ± 0.192
0.913CysArg: 0.913 ± 0.173
2.083CysSer: 2.083 ± 0.28
1.797CysThr: 1.797 ± 0.304
1.912CysVal: 1.912 ± 0.189
0.257CysTrp: 0.257 ± 0.085
0.97CysTyr: 0.97 ± 0.172
0.0CysXaa: 0.0 ± 0.0
Asp
2.653AspAla: 2.653 ± 0.232
1.056AspCys: 1.056 ± 0.196
2.425AspAsp: 2.425 ± 0.312
2.768AspGlu: 2.768 ± 0.227
2.311AspPhe: 2.311 ± 0.241
2.168AspGly: 2.168 ± 0.274
1.227AspHis: 1.227 ± 0.218
3.395AspIle: 3.395 ± 0.378
2.111AspLys: 2.111 ± 0.34
5.164AspLeu: 5.164 ± 0.427
1.427AspMet: 1.427 ± 0.168
1.883AspAsn: 1.883 ± 0.207
3.253AspPro: 3.253 ± 0.382
1.598AspGln: 1.598 ± 0.198
1.74AspArg: 1.74 ± 0.248
4.137AspSer: 4.137 ± 0.471
3.253AspThr: 3.253 ± 0.268
3.31AspVal: 3.31 ± 0.34
0.457AspTrp: 0.457 ± 0.136
1.712AspTyr: 1.712 ± 0.19
0.0AspXaa: 0.0 ± 0.0
Glu
3.253GluAla: 3.253 ± 0.294
1.427GluCys: 1.427 ± 0.241
2.768GluAsp: 2.768 ± 0.29
3.623GluGlu: 3.623 ± 0.344
1.855GluPhe: 1.855 ± 0.256
1.969GluGly: 1.969 ± 0.233
1.455GluHis: 1.455 ± 0.155
3.424GluIle: 3.424 ± 0.297
2.91GluLys: 2.91 ± 0.331
5.136GluLeu: 5.136 ± 0.368
1.427GluMet: 1.427 ± 0.196
2.796GluAsn: 2.796 ± 0.331
2.511GluPro: 2.511 ± 0.365
1.797GluGln: 1.797 ± 0.246
1.797GluArg: 1.797 ± 0.224
4.28GluSer: 4.28 ± 0.408
4.479GluThr: 4.479 ± 0.413
2.91GluVal: 2.91 ± 0.3
0.485GluTrp: 0.485 ± 0.107
1.626GluTyr: 1.626 ± 0.214
0.0GluXaa: 0.0 ± 0.0
Phe
2.111PheAla: 2.111 ± 0.26
1.398PheCys: 1.398 ± 0.248
2.168PheAsp: 2.168 ± 0.239
1.598PheGlu: 1.598 ± 0.187
2.14PhePhe: 2.14 ± 0.297
2.197PheGly: 2.197 ± 0.261
1.341PheHis: 1.341 ± 0.163
3.167PheIle: 3.167 ± 0.357
2.71PheLys: 2.71 ± 0.239
5.963PheLeu: 5.963 ± 0.506
1.084PheMet: 1.084 ± 0.177
2.254PheAsn: 2.254 ± 0.217
2.71PhePro: 2.71 ± 0.278
1.769PheGln: 1.769 ± 0.226
1.74PheArg: 1.74 ± 0.193
3.224PheSer: 3.224 ± 0.328
2.682PheThr: 2.682 ± 0.272
2.397PheVal: 2.397 ± 0.335
0.428PheTrp: 0.428 ± 0.104
2.054PheTyr: 2.054 ± 0.237
0.0PheXaa: 0.0 ± 0.0
Gly
3.253GlyAla: 3.253 ± 0.304
0.942GlyCys: 0.942 ± 0.19
2.311GlyAsp: 2.311 ± 0.256
2.454GlyGlu: 2.454 ± 0.176
2.397GlyPhe: 2.397 ± 0.253
3.138GlyGly: 3.138 ± 0.369
1.683GlyHis: 1.683 ± 0.219
2.796GlyIle: 2.796 ± 0.28
3.053GlyLys: 3.053 ± 0.379
5.136GlyLeu: 5.136 ± 0.379
1.312GlyMet: 1.312 ± 0.185
1.997GlyAsn: 1.997 ± 0.266
3.024GlyPro: 3.024 ± 0.28
2.825GlyGln: 2.825 ± 0.302
2.625GlyArg: 2.625 ± 0.286
3.852GlySer: 3.852 ± 0.398
3.11GlyThr: 3.11 ± 0.273
2.882GlyVal: 2.882 ± 0.254
0.571GlyTrp: 0.571 ± 0.108
1.655GlyTyr: 1.655 ± 0.274
0.0GlyXaa: 0.0 ± 0.0
His
1.569HisAla: 1.569 ± 0.188
0.656HisCys: 0.656 ± 0.123
1.37HisAsp: 1.37 ± 0.171
1.655HisGlu: 1.655 ± 0.185
1.341HisPhe: 1.341 ± 0.18
1.769HisGly: 1.769 ± 0.228
0.913HisHis: 0.913 ± 0.184
1.626HisIle: 1.626 ± 0.224
1.398HisLys: 1.398 ± 0.181
3.138HisLeu: 3.138 ± 0.376
0.656HisMet: 0.656 ± 0.145
1.512HisAsn: 1.512 ± 0.177
1.655HisPro: 1.655 ± 0.186
1.141HisGln: 1.141 ± 0.196
1.37HisArg: 1.37 ± 0.194
1.883HisSer: 1.883 ± 0.165
1.969HisThr: 1.969 ± 0.24
1.997HisVal: 1.997 ± 0.252
0.143HisTrp: 0.143 ± 0.066
0.856HisTyr: 0.856 ± 0.186
0.0HisXaa: 0.0 ± 0.0
Ile
2.768IleAla: 2.768 ± 0.291
1.74IleCys: 1.74 ± 0.257
2.996IleAsp: 2.996 ± 0.345
2.625IleGlu: 2.625 ± 0.294
2.739IlePhe: 2.739 ± 0.325
2.197IleGly: 2.197 ± 0.226
1.626IleHis: 1.626 ± 0.201
4.137IleIle: 4.137 ± 0.4
3.595IleLys: 3.595 ± 0.451
6.363IleLeu: 6.363 ± 0.39
1.512IleMet: 1.512 ± 0.215
3.167IleAsn: 3.167 ± 0.293
3.823IlePro: 3.823 ± 0.363
2.254IleGln: 2.254 ± 0.254
2.853IleArg: 2.853 ± 0.283
5.164IleSer: 5.164 ± 0.622
4.651IleThr: 4.651 ± 0.364
3.452IleVal: 3.452 ± 0.335
0.856IleTrp: 0.856 ± 0.147
2.653IleTyr: 2.653 ± 0.304
0.0IleXaa: 0.0 ± 0.0
Lys
2.511LysAla: 2.511 ± 0.308
1.284LysCys: 1.284 ± 0.203
2.454LysAsp: 2.454 ± 0.313
2.71LysGlu: 2.71 ± 0.223
2.254LysPhe: 2.254 ± 0.258
2.168LysGly: 2.168 ± 0.298
1.484LysHis: 1.484 ± 0.21
3.681LysIle: 3.681 ± 0.41
3.395LysLys: 3.395 ± 0.378
5.45LysLeu: 5.45 ± 0.4
1.084LysMet: 1.084 ± 0.176
2.625LysAsn: 2.625 ± 0.315
2.368LysPro: 2.368 ± 0.311
1.912LysGln: 1.912 ± 0.276
2.967LysArg: 2.967 ± 0.338
3.481LysSer: 3.481 ± 0.363
4.451LysThr: 4.451 ± 0.443
3.081LysVal: 3.081 ± 0.316
0.371LysTrp: 0.371 ± 0.115
2.111LysTyr: 2.111 ± 0.308
0.0LysXaa: 0.0 ± 0.0
Leu
6.562LeuAla: 6.562 ± 0.446
3.081LeuCys: 3.081 ± 0.322
5.706LeuAsp: 5.706 ± 0.442
5.564LeuGlu: 5.564 ± 0.445
5.079LeuPhe: 5.079 ± 0.474
4.907LeuGly: 4.907 ± 0.396
3.167LeuHis: 3.167 ± 0.301
5.592LeuIle: 5.592 ± 0.358
5.849LeuLys: 5.849 ± 0.521
11.784LeuLeu: 11.784 ± 0.751
2.511LeuMet: 2.511 ± 0.248
4.137LeuAsn: 4.137 ± 0.363
6.305LeuPro: 6.305 ± 0.486
3.909LeuGln: 3.909 ± 0.386
3.795LeuArg: 3.795 ± 0.293
8.445LeuSer: 8.445 ± 0.512
8.303LeuThr: 8.303 ± 0.71
5.535LeuVal: 5.535 ± 0.362
0.913LeuTrp: 0.913 ± 0.167
3.253LeuTyr: 3.253 ± 0.328
0.0LeuXaa: 0.0 ± 0.0
Met
2.168MetAla: 2.168 ± 0.233
1.056MetCys: 1.056 ± 0.186
1.37MetAsp: 1.37 ± 0.193
1.427MetGlu: 1.427 ± 0.203
1.541MetPhe: 1.541 ± 0.241
1.255MetGly: 1.255 ± 0.189
0.514MetHis: 0.514 ± 0.117
1.027MetIle: 1.027 ± 0.242
0.656MetLys: 0.656 ± 0.133
2.825MetLeu: 2.825 ± 0.323
0.799MetMet: 0.799 ± 0.146
0.685MetAsn: 0.685 ± 0.151
0.97MetPro: 0.97 ± 0.194
0.656MetGln: 0.656 ± 0.168
1.284MetArg: 1.284 ± 0.222
2.311MetSer: 2.311 ± 0.209
1.541MetThr: 1.541 ± 0.195
1.569MetVal: 1.569 ± 0.202
0.342MetTrp: 0.342 ± 0.101
0.913MetTyr: 0.913 ± 0.168
0.0MetXaa: 0.0 ± 0.0
Asn
1.712AsnAla: 1.712 ± 0.217
0.77AsnCys: 0.77 ± 0.14
1.484AsnAsp: 1.484 ± 0.178
1.341AsnGlu: 1.341 ± 0.219
2.34AsnPhe: 2.34 ± 0.347
2.168AsnGly: 2.168 ± 0.348
1.312AsnHis: 1.312 ± 0.173
3.623AsnIle: 3.623 ± 0.392
3.024AsnLys: 3.024 ± 0.373
5.05AsnLeu: 5.05 ± 0.443
1.312AsnMet: 1.312 ± 0.213
2.14AsnAsn: 2.14 ± 0.296
2.482AsnPro: 2.482 ± 0.273
1.341AsnGln: 1.341 ± 0.209
1.826AsnArg: 1.826 ± 0.23
3.766AsnSer: 3.766 ± 0.3
3.024AsnThr: 3.024 ± 0.324
2.739AsnVal: 2.739 ± 0.256
0.599AsnTrp: 0.599 ± 0.135
1.569AsnTyr: 1.569 ± 0.184
0.0AsnXaa: 0.0 ± 0.0
Pro
4.051ProAla: 4.051 ± 0.578
1.398ProCys: 1.398 ± 0.196
2.796ProAsp: 2.796 ± 0.32
4.023ProGlu: 4.023 ± 0.424
1.855ProPhe: 1.855 ± 0.185
4.109ProGly: 4.109 ± 0.409
1.569ProHis: 1.569 ± 0.232
3.966ProIle: 3.966 ± 0.343
2.14ProLys: 2.14 ± 0.247
5.621ProLeu: 5.621 ± 0.479
1.427ProMet: 1.427 ± 0.215
1.769ProAsn: 1.769 ± 0.219
5.278ProPro: 5.278 ± 0.807
2.511ProGln: 2.511 ± 0.337
2.568ProArg: 2.568 ± 0.411
5.25ProSer: 5.25 ± 0.634
5.193ProThr: 5.193 ± 0.643
4.337ProVal: 4.337 ± 0.37
0.77ProTrp: 0.77 ± 0.139
1.284ProTyr: 1.284 ± 0.177
0.0ProXaa: 0.0 ± 0.0
Gln
2.197GlnAla: 2.197 ± 0.301
0.999GlnCys: 0.999 ± 0.162
2.14GlnAsp: 2.14 ± 0.213
2.539GlnGlu: 2.539 ± 0.222
1.912GlnPhe: 1.912 ± 0.189
1.912GlnGly: 1.912 ± 0.2
1.084GlnHis: 1.084 ± 0.185
2.111GlnIle: 2.111 ± 0.192
2.197GlnLys: 2.197 ± 0.272
3.681GlnLeu: 3.681 ± 0.418
0.799GlnMet: 0.799 ± 0.171
1.484GlnAsn: 1.484 ± 0.175
2.111GlnPro: 2.111 ± 0.269
1.883GlnGln: 1.883 ± 0.222
1.655GlnArg: 1.655 ± 0.231
2.996GlnSer: 2.996 ± 0.271
2.14GlnThr: 2.14 ± 0.174
1.94GlnVal: 1.94 ± 0.203
0.599GlnTrp: 0.599 ± 0.105
1.027GlnTyr: 1.027 ± 0.208
0.0GlnXaa: 0.0 ± 0.0
Arg
2.825ArgAla: 2.825 ± 0.378
1.198ArgCys: 1.198 ± 0.218
2.083ArgAsp: 2.083 ± 0.277
2.796ArgGlu: 2.796 ± 0.324
1.341ArgPhe: 1.341 ± 0.171
2.768ArgGly: 2.768 ± 0.306
1.541ArgHis: 1.541 ± 0.228
2.254ArgIle: 2.254 ± 0.265
2.111ArgLys: 2.111 ± 0.273
4.537ArgLeu: 4.537 ± 0.369
0.713ArgMet: 0.713 ± 0.152
2.083ArgAsn: 2.083 ± 0.287
2.996ArgPro: 2.996 ± 0.359
1.255ArgGln: 1.255 ± 0.191
2.482ArgArg: 2.482 ± 0.332
2.653ArgSer: 2.653 ± 0.255
2.539ArgThr: 2.539 ± 0.3
2.425ArgVal: 2.425 ± 0.249
0.457ArgTrp: 0.457 ± 0.127
1.113ArgTyr: 1.113 ± 0.188
0.0ArgXaa: 0.0 ± 0.0
Ser
4.508SerAla: 4.508 ± 0.338
2.026SerCys: 2.026 ± 0.32
4.194SerAsp: 4.194 ± 0.4
3.681SerGlu: 3.681 ± 0.263
3.167SerPhe: 3.167 ± 0.341
4.708SerGly: 4.708 ± 0.375
2.539SerHis: 2.539 ± 0.313
4.765SerIle: 4.765 ± 0.398
4.109SerLys: 4.109 ± 0.293
8.559SerLeu: 8.559 ± 0.602
1.997SerMet: 1.997 ± 0.214
3.623SerAsn: 3.623 ± 0.341
5.25SerPro: 5.25 ± 0.637
2.939SerGln: 2.939 ± 0.347
3.281SerArg: 3.281 ± 0.373
7.704SerSer: 7.704 ± 0.626
6.42SerThr: 6.42 ± 0.549
4.765SerVal: 4.765 ± 0.357
0.942SerTrp: 0.942 ± 0.164
2.625SerTyr: 2.625 ± 0.24
0.0SerXaa: 0.0 ± 0.0
Thr
4.023ThrAla: 4.023 ± 0.339
1.912ThrCys: 1.912 ± 0.262
3.281ThrAsp: 3.281 ± 0.286
3.738ThrGlu: 3.738 ± 0.363
3.595ThrPhe: 3.595 ± 0.339
3.452ThrGly: 3.452 ± 0.297
2.14ThrHis: 2.14 ± 0.221
4.109ThrIle: 4.109 ± 0.312
3.196ThrLys: 3.196 ± 0.319
7.561ThrLeu: 7.561 ± 0.475
1.855ThrMet: 1.855 ± 0.22
2.539ThrAsn: 2.539 ± 0.295
5.906ThrPro: 5.906 ± 0.863
2.853ThrGln: 2.853 ± 0.298
2.682ThrArg: 2.682 ± 0.354
6.933ThrSer: 6.933 ± 0.458
5.592ThrThr: 5.592 ± 0.55
4.993ThrVal: 4.993 ± 0.335
0.856ThrTrp: 0.856 ± 0.158
2.625ThrTyr: 2.625 ± 0.291
0.0ThrXaa: 0.0 ± 0.0
Val
3.88ValAla: 3.88 ± 0.401
2.454ValCys: 2.454 ± 0.243
2.996ValAsp: 2.996 ± 0.281
3.31ValGlu: 3.31 ± 0.235
3.31ValPhe: 3.31 ± 0.289
2.91ValGly: 2.91 ± 0.327
1.484ValHis: 1.484 ± 0.152
2.882ValIle: 2.882 ± 0.263
2.853ValLys: 2.853 ± 0.358
5.763ValLeu: 5.763 ± 0.42
1.512ValMet: 1.512 ± 0.241
2.739ValAsn: 2.739 ± 0.306
3.966ValPro: 3.966 ± 0.371
2.197ValGln: 2.197 ± 0.251
2.482ValArg: 2.482 ± 0.312
5.221ValSer: 5.221 ± 0.33
4.508ValThr: 4.508 ± 0.337
3.595ValVal: 3.595 ± 0.302
0.599ValTrp: 0.599 ± 0.164
2.967ValTyr: 2.967 ± 0.299
0.0ValXaa: 0.0 ± 0.0
Trp
0.542TrpAla: 0.542 ± 0.123
0.228TrpCys: 0.228 ± 0.08
0.428TrpAsp: 0.428 ± 0.125
0.285TrpGlu: 0.285 ± 0.102
0.514TrpPhe: 0.514 ± 0.133
0.485TrpGly: 0.485 ± 0.111
0.342TrpHis: 0.342 ± 0.114
0.685TrpIle: 0.685 ± 0.139
0.77TrpLys: 0.77 ± 0.134
1.113TrpLeu: 1.113 ± 0.153
0.314TrpMet: 0.314 ± 0.089
0.542TrpAsn: 0.542 ± 0.102
0.742TrpPro: 0.742 ± 0.156
0.656TrpGln: 0.656 ± 0.139
0.656TrpArg: 0.656 ± 0.142
0.77TrpSer: 0.77 ± 0.147
0.856TrpThr: 0.856 ± 0.213
0.656TrpVal: 0.656 ± 0.148
0.057TrpTrp: 0.057 ± 0.036
0.314TrpTyr: 0.314 ± 0.109
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.111TyrAla: 2.111 ± 0.249
0.685TyrCys: 0.685 ± 0.157
1.512TyrAsp: 1.512 ± 0.244
1.312TyrGlu: 1.312 ± 0.182
1.712TyrPhe: 1.712 ± 0.257
1.997TyrGly: 1.997 ± 0.242
0.856TyrHis: 0.856 ± 0.147
2.482TyrIle: 2.482 ± 0.339
1.769TyrLys: 1.769 ± 0.218
2.939TyrLeu: 2.939 ± 0.307
0.942TyrMet: 0.942 ± 0.194
2.083TyrAsn: 2.083 ± 0.285
1.626TyrPro: 1.626 ± 0.253
0.999TyrGln: 0.999 ± 0.166
1.17TyrArg: 1.17 ± 0.216
3.224TyrSer: 3.224 ± 0.317
2.197TyrThr: 2.197 ± 0.247
2.996TyrVal: 2.996 ± 0.348
0.542TyrTrp: 0.542 ± 0.107
1.227TyrTyr: 1.227 ± 0.185
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 74 proteins (35050 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski