Amino acid dipepetide frequency for Murid herpesvirus 4 (MuHV-4) (Murine gammaherpesvirus 68)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.797AlaAla: 4.797 ± 0.373
1.542AlaCys: 1.542 ± 0.181
2.627AlaAsp: 2.627 ± 0.282
3.055AlaGlu: 3.055 ± 0.294
2.256AlaPhe: 2.256 ± 0.277
3.227AlaGly: 3.227 ± 0.387
1.656AlaHis: 1.656 ± 0.295
3.712AlaIle: 3.712 ± 0.278
2.399AlaLys: 2.399 ± 0.3
5.597AlaLeu: 5.597 ± 0.465
1.97AlaMet: 1.97 ± 0.218
2.484AlaAsn: 2.484 ± 0.291
3.455AlaPro: 3.455 ± 0.368
2.17AlaGln: 2.17 ± 0.231
2.227AlaArg: 2.227 ± 0.248
4.883AlaSer: 4.883 ± 0.387
4.683AlaThr: 4.683 ± 0.415
4.112AlaVal: 4.112 ± 0.383
0.8AlaTrp: 0.8 ± 0.16
2.17AlaTyr: 2.17 ± 0.257
0.0AlaXaa: 0.0 ± 0.0
Cys
1.628CysAla: 1.628 ± 0.279
0.4CysCys: 0.4 ± 0.112
1.314CysAsp: 1.314 ± 0.207
1.371CysGlu: 1.371 ± 0.223
1.428CysPhe: 1.428 ± 0.211
1.713CysGly: 1.713 ± 0.246
0.828CysHis: 0.828 ± 0.167
1.399CysIle: 1.399 ± 0.198
1.314CysLys: 1.314 ± 0.183
3.255CysLeu: 3.255 ± 0.329
0.514CysMet: 0.514 ± 0.118
1.142CysAsn: 1.142 ± 0.215
1.285CysPro: 1.285 ± 0.207
1.085CysGln: 1.085 ± 0.199
0.885CysArg: 0.885 ± 0.191
1.97CysSer: 1.97 ± 0.245
1.571CysThr: 1.571 ± 0.305
1.913CysVal: 1.913 ± 0.23
0.257CysTrp: 0.257 ± 0.094
0.857CysTyr: 0.857 ± 0.143
0.0CysXaa: 0.0 ± 0.0
Asp
2.741AspAla: 2.741 ± 0.273
0.999AspCys: 0.999 ± 0.184
2.456AspAsp: 2.456 ± 0.328
2.599AspGlu: 2.599 ± 0.27
2.256AspPhe: 2.256 ± 0.26
2.142AspGly: 2.142 ± 0.289
1.171AspHis: 1.171 ± 0.204
3.627AspIle: 3.627 ± 0.348
2.027AspLys: 2.027 ± 0.293
5.197AspLeu: 5.197 ± 0.414
1.456AspMet: 1.456 ± 0.204
2.085AspAsn: 2.085 ± 0.281
3.655AspPro: 3.655 ± 0.404
1.428AspGln: 1.428 ± 0.178
1.77AspArg: 1.77 ± 0.236
3.998AspSer: 3.998 ± 0.395
3.198AspThr: 3.198 ± 0.319
3.37AspVal: 3.37 ± 0.322
0.428AspTrp: 0.428 ± 0.111
1.799AspTyr: 1.799 ± 0.204
0.0AspXaa: 0.0 ± 0.0
Glu
3.113GluAla: 3.113 ± 0.315
1.371GluCys: 1.371 ± 0.217
2.913GluAsp: 2.913 ± 0.307
2.998GluGlu: 2.998 ± 0.341
1.828GluPhe: 1.828 ± 0.258
2.284GluGly: 2.284 ± 0.23
1.371GluHis: 1.371 ± 0.247
3.541GluIle: 3.541 ± 0.392
2.913GluLys: 2.913 ± 0.295
5.083GluLeu: 5.083 ± 0.45
1.428GluMet: 1.428 ± 0.179
3.084GluAsn: 3.084 ± 0.342
2.456GluPro: 2.456 ± 0.484
1.856GluGln: 1.856 ± 0.169
1.942GluArg: 1.942 ± 0.233
4.426GluSer: 4.426 ± 0.437
4.255GluThr: 4.255 ± 0.369
3.084GluVal: 3.084 ± 0.303
0.457GluTrp: 0.457 ± 0.089
1.571GluTyr: 1.571 ± 0.223
0.0GluXaa: 0.0 ± 0.0
Phe
1.942PheAla: 1.942 ± 0.224
1.485PheCys: 1.485 ± 0.237
2.284PheAsp: 2.284 ± 0.251
1.513PheGlu: 1.513 ± 0.205
2.227PhePhe: 2.227 ± 0.278
2.027PheGly: 2.027 ± 0.227
1.199PheHis: 1.199 ± 0.147
2.97PheIle: 2.97 ± 0.329
2.827PheLys: 2.827 ± 0.298
5.797PheLeu: 5.797 ± 0.487
1.085PheMet: 1.085 ± 0.191
2.427PheAsn: 2.427 ± 0.246
2.798PhePro: 2.798 ± 0.328
1.799PheGln: 1.799 ± 0.212
1.599PheArg: 1.599 ± 0.194
3.312PheSer: 3.312 ± 0.316
2.798PheThr: 2.798 ± 0.288
2.713PheVal: 2.713 ± 0.295
0.371PheTrp: 0.371 ± 0.109
1.999PheTyr: 1.999 ± 0.246
0.0PheXaa: 0.0 ± 0.0
Gly
3.512GlyAla: 3.512 ± 0.311
1.171GlyCys: 1.171 ± 0.225
2.17GlyAsp: 2.17 ± 0.226
2.599GlyGlu: 2.599 ± 0.226
2.599GlyPhe: 2.599 ± 0.304
2.998GlyGly: 2.998 ± 0.315
1.713GlyHis: 1.713 ± 0.249
2.856GlyIle: 2.856 ± 0.33
2.856GlyLys: 2.856 ± 0.323
5.169GlyLeu: 5.169 ± 0.39
1.171GlyMet: 1.171 ± 0.124
2.056GlyAsn: 2.056 ± 0.241
2.741GlyPro: 2.741 ± 0.253
2.77GlyGln: 2.77 ± 0.304
2.684GlyArg: 2.684 ± 0.264
3.684GlySer: 3.684 ± 0.367
3.255GlyThr: 3.255 ± 0.304
2.741GlyVal: 2.741 ± 0.271
0.543GlyTrp: 0.543 ± 0.127
1.571GlyTyr: 1.571 ± 0.256
0.0GlyXaa: 0.0 ± 0.0
His
1.685HisAla: 1.685 ± 0.196
0.685HisCys: 0.685 ± 0.126
1.314HisAsp: 1.314 ± 0.166
1.599HisGlu: 1.599 ± 0.163
1.399HisPhe: 1.399 ± 0.193
1.799HisGly: 1.799 ± 0.201
0.914HisHis: 0.914 ± 0.166
1.713HisIle: 1.713 ± 0.177
1.285HisLys: 1.285 ± 0.204
3.398HisLeu: 3.398 ± 0.334
0.6HisMet: 0.6 ± 0.127
1.485HisAsn: 1.485 ± 0.181
1.77HisPro: 1.77 ± 0.211
1.199HisGln: 1.199 ± 0.176
1.085HisArg: 1.085 ± 0.193
1.828HisSer: 1.828 ± 0.239
1.628HisThr: 1.628 ± 0.203
1.999HisVal: 1.999 ± 0.24
0.228HisTrp: 0.228 ± 0.089
0.8HisTyr: 0.8 ± 0.18
0.0HisXaa: 0.0 ± 0.0
Ile
2.37IleAla: 2.37 ± 0.295
1.571IleCys: 1.571 ± 0.265
2.856IleAsp: 2.856 ± 0.31
2.741IleGlu: 2.741 ± 0.303
2.998IlePhe: 2.998 ± 0.351
1.913IleGly: 1.913 ± 0.246
1.571IleHis: 1.571 ± 0.204
4.141IleIle: 4.141 ± 0.441
3.455IleLys: 3.455 ± 0.459
6.625IleLeu: 6.625 ± 0.472
1.314IleMet: 1.314 ± 0.175
3.198IleAsn: 3.198 ± 0.231
3.598IlePro: 3.598 ± 0.288
2.342IleGln: 2.342 ± 0.34
2.884IleArg: 2.884 ± 0.277
5.254IleSer: 5.254 ± 0.454
4.483IleThr: 4.483 ± 0.383
3.398IleVal: 3.398 ± 0.323
0.771IleTrp: 0.771 ± 0.16
2.684IleTyr: 2.684 ± 0.303
0.0IleXaa: 0.0 ± 0.0
Lys
2.913LysAla: 2.913 ± 0.368
1.228LysCys: 1.228 ± 0.185
2.427LysAsp: 2.427 ± 0.314
2.656LysGlu: 2.656 ± 0.266
2.256LysPhe: 2.256 ± 0.253
1.828LysGly: 1.828 ± 0.265
1.799LysHis: 1.799 ± 0.22
3.427LysIle: 3.427 ± 0.368
3.569LysLys: 3.569 ± 0.406
5.454LysLeu: 5.454 ± 0.442
1.028LysMet: 1.028 ± 0.177
2.741LysAsn: 2.741 ± 0.335
2.313LysPro: 2.313 ± 0.264
1.942LysGln: 1.942 ± 0.273
2.941LysArg: 2.941 ± 0.346
3.512LysSer: 3.512 ± 0.329
4.255LysThr: 4.255 ± 0.388
2.713LysVal: 2.713 ± 0.327
0.343LysTrp: 0.343 ± 0.092
2.199LysTyr: 2.199 ± 0.248
0.0LysXaa: 0.0 ± 0.0
Leu
6.625LeuAla: 6.625 ± 0.514
2.97LeuCys: 2.97 ± 0.407
5.683LeuAsp: 5.683 ± 0.371
5.74LeuGlu: 5.74 ± 0.424
4.94LeuPhe: 4.94 ± 0.353
5.197LeuGly: 5.197 ± 0.424
3.055LeuHis: 3.055 ± 0.306
5.711LeuIle: 5.711 ± 0.331
5.825LeuLys: 5.825 ± 0.495
11.508LeuLeu: 11.508 ± 0.647
2.656LeuMet: 2.656 ± 0.283
4.226LeuAsn: 4.226 ± 0.389
6.539LeuPro: 6.539 ± 0.425
3.998LeuGln: 3.998 ± 0.39
3.912LeuArg: 3.912 ± 0.334
8.824LeuSer: 8.824 ± 0.494
7.996LeuThr: 7.996 ± 0.558
5.683LeuVal: 5.683 ± 0.328
0.8LeuTrp: 0.8 ± 0.148
3.398LeuTyr: 3.398 ± 0.325
0.0LeuXaa: 0.0 ± 0.0
Met
2.284MetAla: 2.284 ± 0.237
0.942MetCys: 0.942 ± 0.195
1.342MetAsp: 1.342 ± 0.188
1.456MetGlu: 1.456 ± 0.213
1.542MetPhe: 1.542 ± 0.244
1.256MetGly: 1.256 ± 0.18
0.571MetHis: 0.571 ± 0.128
1.057MetIle: 1.057 ± 0.23
0.771MetLys: 0.771 ± 0.162
2.656MetLeu: 2.656 ± 0.289
0.914MetMet: 0.914 ± 0.159
0.657MetAsn: 0.657 ± 0.134
1.028MetPro: 1.028 ± 0.16
0.685MetGln: 0.685 ± 0.161
1.142MetArg: 1.142 ± 0.193
2.227MetSer: 2.227 ± 0.219
1.399MetThr: 1.399 ± 0.159
1.685MetVal: 1.685 ± 0.182
0.314MetTrp: 0.314 ± 0.093
0.942MetTyr: 0.942 ± 0.15
0.0MetXaa: 0.0 ± 0.0
Asn
1.799AsnAla: 1.799 ± 0.223
0.971AsnCys: 0.971 ± 0.184
1.542AsnAsp: 1.542 ± 0.21
1.428AsnGlu: 1.428 ± 0.225
2.113AsnPhe: 2.113 ± 0.245
2.17AsnGly: 2.17 ± 0.332
1.171AsnHis: 1.171 ± 0.169
3.627AsnIle: 3.627 ± 0.386
2.998AsnLys: 2.998 ± 0.344
5.197AsnLeu: 5.197 ± 0.395
1.199AsnMet: 1.199 ± 0.208
2.113AsnAsn: 2.113 ± 0.267
2.513AsnPro: 2.513 ± 0.264
1.314AsnGln: 1.314 ± 0.235
1.828AsnArg: 1.828 ± 0.278
3.884AsnSer: 3.884 ± 0.353
3.255AsnThr: 3.255 ± 0.368
2.884AsnVal: 2.884 ± 0.275
0.571AsnTrp: 0.571 ± 0.145
1.599AsnTyr: 1.599 ± 0.213
0.0AsnXaa: 0.0 ± 0.0
Pro
4.141ProAla: 4.141 ± 0.591
1.513ProCys: 1.513 ± 0.192
2.798ProAsp: 2.798 ± 0.31
3.826ProGlu: 3.826 ± 0.488
1.742ProPhe: 1.742 ± 0.186
4.226ProGly: 4.226 ± 0.458
1.399ProHis: 1.399 ± 0.189
3.427ProIle: 3.427 ± 0.36
2.256ProLys: 2.256 ± 0.248
5.854ProLeu: 5.854 ± 0.526
1.285ProMet: 1.285 ± 0.18
1.713ProAsn: 1.713 ± 0.222
5.568ProPro: 5.568 ± 0.96
2.627ProGln: 2.627 ± 0.329
2.342ProArg: 2.342 ± 0.329
5.34ProSer: 5.34 ± 0.496
5.254ProThr: 5.254 ± 0.781
4.54ProVal: 4.54 ± 0.361
0.8ProTrp: 0.8 ± 0.157
1.399ProTyr: 1.399 ± 0.193
0.0ProXaa: 0.0 ± 0.0
Gln
2.027GlnAla: 2.027 ± 0.272
0.999GlnCys: 0.999 ± 0.182
1.856GlnAsp: 1.856 ± 0.242
2.57GlnGlu: 2.57 ± 0.266
1.713GlnPhe: 1.713 ± 0.164
2.085GlnGly: 2.085 ± 0.246
1.228GlnHis: 1.228 ± 0.158
2.113GlnIle: 2.113 ± 0.214
2.284GlnLys: 2.284 ± 0.246
3.912GlnLeu: 3.912 ± 0.459
0.885GlnMet: 0.885 ± 0.172
1.399GlnAsn: 1.399 ± 0.176
2.199GlnPro: 2.199 ± 0.218
1.856GlnGln: 1.856 ± 0.208
1.628GlnArg: 1.628 ± 0.224
2.998GlnSer: 2.998 ± 0.307
2.399GlnThr: 2.399 ± 0.191
2.142GlnVal: 2.142 ± 0.266
0.543GlnTrp: 0.543 ± 0.106
1.142GlnTyr: 1.142 ± 0.214
0.0GlnXaa: 0.0 ± 0.0
Arg
3.027ArgAla: 3.027 ± 0.309
1.171ArgCys: 1.171 ± 0.226
2.17ArgAsp: 2.17 ± 0.244
2.627ArgGlu: 2.627 ± 0.263
1.485ArgPhe: 1.485 ± 0.204
2.827ArgGly: 2.827 ± 0.358
1.485ArgHis: 1.485 ± 0.212
2.313ArgIle: 2.313 ± 0.267
1.828ArgLys: 1.828 ± 0.262
4.655ArgLeu: 4.655 ± 0.349
0.657ArgMet: 0.657 ± 0.167
1.828ArgAsn: 1.828 ± 0.236
2.998ArgPro: 2.998 ± 0.462
1.428ArgGln: 1.428 ± 0.244
2.884ArgArg: 2.884 ± 0.422
2.541ArgSer: 2.541 ± 0.29
2.284ArgThr: 2.284 ± 0.285
2.399ArgVal: 2.399 ± 0.301
0.371ArgTrp: 0.371 ± 0.093
1.114ArgTyr: 1.114 ± 0.164
0.0ArgXaa: 0.0 ± 0.0
Ser
4.512SerAla: 4.512 ± 0.35
1.97SerCys: 1.97 ± 0.275
4.369SerAsp: 4.369 ± 0.421
3.998SerGlu: 3.998 ± 0.322
3.455SerPhe: 3.455 ± 0.373
4.512SerGly: 4.512 ± 0.361
2.342SerHis: 2.342 ± 0.278
4.369SerIle: 4.369 ± 0.312
4.283SerLys: 4.283 ± 0.344
8.424SerLeu: 8.424 ± 0.644
2.256SerMet: 2.256 ± 0.248
3.684SerAsn: 3.684 ± 0.3
5.511SerPro: 5.511 ± 0.622
2.97SerGln: 2.97 ± 0.224
3.141SerArg: 3.141 ± 0.348
7.567SerSer: 7.567 ± 0.678
5.997SerThr: 5.997 ± 0.436
5.112SerVal: 5.112 ± 0.409
0.885SerTrp: 0.885 ± 0.146
2.627SerTyr: 2.627 ± 0.268
0.0SerXaa: 0.0 ± 0.0
Thr
4.112ThrAla: 4.112 ± 0.387
2.027ThrCys: 2.027 ± 0.273
3.398ThrAsp: 3.398 ± 0.273
3.826ThrGlu: 3.826 ± 0.328
3.427ThrPhe: 3.427 ± 0.351
3.284ThrGly: 3.284 ± 0.306
2.17ThrHis: 2.17 ± 0.233
3.998ThrIle: 3.998 ± 0.41
3.141ThrLys: 3.141 ± 0.325
7.168ThrLeu: 7.168 ± 0.434
1.656ThrMet: 1.656 ± 0.196
2.513ThrAsn: 2.513 ± 0.307
5.054ThrPro: 5.054 ± 0.801
2.599ThrGln: 2.599 ± 0.207
2.713ThrArg: 2.713 ± 0.257
6.596ThrSer: 6.596 ± 0.469
4.826ThrThr: 4.826 ± 0.492
5.226ThrVal: 5.226 ± 0.393
0.828ThrTrp: 0.828 ± 0.14
2.456ThrTyr: 2.456 ± 0.321
0.0ThrXaa: 0.0 ± 0.0
Val
3.998ValAla: 3.998 ± 0.466
2.227ValCys: 2.227 ± 0.226
2.97ValAsp: 2.97 ± 0.279
3.627ValGlu: 3.627 ± 0.299
3.227ValPhe: 3.227 ± 0.306
2.913ValGly: 2.913 ± 0.285
1.628ValHis: 1.628 ± 0.192
2.913ValIle: 2.913 ± 0.321
2.941ValLys: 2.941 ± 0.339
6.311ValLeu: 6.311 ± 0.506
1.428ValMet: 1.428 ± 0.207
2.856ValAsn: 2.856 ± 0.351
4.398ValPro: 4.398 ± 0.416
2.17ValGln: 2.17 ± 0.22
2.541ValArg: 2.541 ± 0.274
5.283ValSer: 5.283 ± 0.361
4.598ValThr: 4.598 ± 0.371
3.455ValVal: 3.455 ± 0.328
0.714ValTrp: 0.714 ± 0.159
2.684ValTyr: 2.684 ± 0.292
0.0ValXaa: 0.0 ± 0.0
Trp
0.6TrpAla: 0.6 ± 0.144
0.228TrpCys: 0.228 ± 0.082
0.371TrpAsp: 0.371 ± 0.121
0.314TrpGlu: 0.314 ± 0.104
0.514TrpPhe: 0.514 ± 0.121
0.514TrpGly: 0.514 ± 0.108
0.314TrpHis: 0.314 ± 0.115
0.657TrpIle: 0.657 ± 0.147
0.8TrpLys: 0.8 ± 0.139
1.085TrpLeu: 1.085 ± 0.171
0.343TrpMet: 0.343 ± 0.098
0.514TrpAsn: 0.514 ± 0.104
0.6TrpPro: 0.6 ± 0.13
0.628TrpGln: 0.628 ± 0.149
0.543TrpArg: 0.543 ± 0.155
0.714TrpSer: 0.714 ± 0.144
0.828TrpThr: 0.828 ± 0.185
0.685TrpVal: 0.685 ± 0.13
0.057TrpTrp: 0.057 ± 0.038
0.286TrpTyr: 0.286 ± 0.098
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.999TyrAla: 1.999 ± 0.267
0.657TyrCys: 0.657 ± 0.145
1.542TyrAsp: 1.542 ± 0.26
1.428TyrGlu: 1.428 ± 0.223
1.77TyrPhe: 1.77 ± 0.245
1.942TyrGly: 1.942 ± 0.244
0.828TyrHis: 0.828 ± 0.195
2.599TyrIle: 2.599 ± 0.289
1.799TyrLys: 1.799 ± 0.269
2.941TyrLeu: 2.941 ± 0.307
0.999TyrMet: 0.999 ± 0.179
1.97TyrAsn: 1.97 ± 0.323
1.571TyrPro: 1.571 ± 0.239
1.114TyrGln: 1.114 ± 0.17
1.399TyrArg: 1.399 ± 0.195
3.141TyrSer: 3.141 ± 0.363
2.056TyrThr: 2.056 ± 0.233
3.027TyrVal: 3.027 ± 0.286
0.514TyrTrp: 0.514 ± 0.119
1.285TyrTyr: 1.285 ± 0.203
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 73 proteins (35020 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski