Amino acid dipepetide frequency for Murine roseolovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.955AlaAla: 2.955 ± 0.595
0.892AlaCys: 0.892 ± 0.146
1.864AlaAsp: 1.864 ± 0.218
2.142AlaGlu: 2.142 ± 0.261
1.745AlaPhe: 1.745 ± 0.196
1.646AlaGly: 1.646 ± 0.31
0.813AlaHis: 0.813 ± 0.164
2.856AlaIle: 2.856 ± 0.279
2.459AlaLys: 2.459 ± 0.213
3.768AlaLeu: 3.768 ± 0.291
0.873AlaMet: 0.873 ± 0.127
1.725AlaAsn: 1.725 ± 0.189
2.816AlaPro: 2.816 ± 0.553
1.388AlaGln: 1.388 ± 0.235
1.825AlaArg: 1.825 ± 0.32
3.253AlaSer: 3.253 ± 0.328
2.638AlaThr: 2.638 ± 0.29
2.638AlaVal: 2.638 ± 0.204
0.476AlaTrp: 0.476 ± 0.11
1.349AlaTyr: 1.349 ± 0.14
0.0AlaXaa: 0.0 ± 0.0
Cys
0.932CysAla: 0.932 ± 0.153
0.773CysCys: 0.773 ± 0.147
1.329CysAsp: 1.329 ± 0.144
1.011CysGlu: 1.011 ± 0.15
1.388CysPhe: 1.388 ± 0.164
1.21CysGly: 1.21 ± 0.144
0.674CysHis: 0.674 ± 0.106
2.122CysIle: 2.122 ± 0.198
1.388CysLys: 1.388 ± 0.195
2.34CysLeu: 2.34 ± 0.246
0.674CysMet: 0.674 ± 0.102
1.19CysAsn: 1.19 ± 0.173
1.111CysPro: 1.111 ± 0.159
0.654CysGln: 0.654 ± 0.102
0.972CysArg: 0.972 ± 0.174
1.805CysSer: 1.805 ± 0.186
1.011CysThr: 1.011 ± 0.166
1.487CysVal: 1.487 ± 0.179
0.119CysTrp: 0.119 ± 0.049
0.833CysTyr: 0.833 ± 0.157
0.0CysXaa: 0.0 ± 0.0
Asp
1.884AspAla: 1.884 ± 0.198
0.892AspCys: 0.892 ± 0.134
3.431AspAsp: 3.431 ± 0.367
4.066AspGlu: 4.066 ± 0.367
3.173AspPhe: 3.173 ± 0.267
2.003AspGly: 2.003 ± 0.352
1.21AspHis: 1.21 ± 0.14
5.275AspIle: 5.275 ± 0.377
3.193AspLys: 3.193 ± 0.272
5.553AspLeu: 5.553 ± 0.949
1.23AspMet: 1.23 ± 0.173
4.006AspAsn: 4.006 ± 0.363
2.876AspPro: 2.876 ± 0.264
1.924AspGln: 1.924 ± 0.255
1.606AspArg: 1.606 ± 0.222
4.284AspSer: 4.284 ± 0.383
2.995AspThr: 2.995 ± 0.234
2.816AspVal: 2.816 ± 0.267
0.416AspTrp: 0.416 ± 0.085
2.201AspTyr: 2.201 ± 0.215
0.0AspXaa: 0.0 ± 0.0
Glu
1.725GluAla: 1.725 ± 0.237
1.13GluCys: 1.13 ± 0.176
3.411GluAsp: 3.411 ± 0.325
4.165GluGlu: 4.165 ± 0.403
2.638GluPhe: 2.638 ± 0.223
1.864GluGly: 1.864 ± 0.296
1.329GluHis: 1.329 ± 0.166
5.176GluIle: 5.176 ± 0.408
4.443GluLys: 4.443 ± 0.339
4.998GluLeu: 4.998 ± 0.281
1.349GluMet: 1.349 ± 0.144
4.343GluAsn: 4.343 ± 0.372
2.677GluPro: 2.677 ± 0.289
1.805GluGln: 1.805 ± 0.199
1.805GluArg: 1.805 ± 0.239
5.057GluSer: 5.057 ± 0.565
3.61GluThr: 3.61 ± 0.281
1.983GluVal: 1.983 ± 0.273
0.496GluTrp: 0.496 ± 0.103
2.479GluTyr: 2.479 ± 0.258
0.0GluXaa: 0.0 ± 0.0
Phe
1.587PheAla: 1.587 ± 0.171
1.587PheCys: 1.587 ± 0.202
2.677PheAsp: 2.677 ± 0.226
2.32PheGlu: 2.32 ± 0.222
3.629PhePhe: 3.629 ± 0.331
2.261PheGly: 2.261 ± 0.249
1.091PheHis: 1.091 ± 0.146
4.799PheIle: 4.799 ± 0.459
4.343PheLys: 4.343 ± 0.262
5.652PheLeu: 5.652 ± 0.457
1.269PheMet: 1.269 ± 0.16
3.709PheAsn: 3.709 ± 0.262
2.42PhePro: 2.42 ± 0.218
1.606PheGln: 1.606 ± 0.209
1.904PheArg: 1.904 ± 0.17
3.828PheSer: 3.828 ± 0.302
3.272PheThr: 3.272 ± 0.282
2.995PheVal: 2.995 ± 0.253
0.317PheTrp: 0.317 ± 0.091
1.904PheTyr: 1.904 ± 0.207
0.0PheXaa: 0.0 ± 0.0
Gly
2.4GlyAla: 2.4 ± 0.542
0.813GlyCys: 0.813 ± 0.144
1.963GlyAsp: 1.963 ± 0.297
2.4GlyGlu: 2.4 ± 0.298
1.884GlyPhe: 1.884 ± 0.235
2.816GlyGly: 2.816 ± 0.567
1.011GlyHis: 1.011 ± 0.181
3.054GlyIle: 3.054 ± 0.271
2.201GlyLys: 2.201 ± 0.212
3.709GlyLeu: 3.709 ± 0.291
1.011GlyMet: 1.011 ± 0.173
2.102GlyAsn: 2.102 ± 0.211
2.082GlyPro: 2.082 ± 0.319
1.646GlyGln: 1.646 ± 0.255
2.479GlyArg: 2.479 ± 0.615
2.915GlySer: 2.915 ± 0.294
2.638GlyThr: 2.638 ± 0.25
2.4GlyVal: 2.4 ± 0.318
0.516GlyTrp: 0.516 ± 0.112
1.507GlyTyr: 1.507 ± 0.2
0.0GlyXaa: 0.0 ± 0.0
His
1.071HisAla: 1.071 ± 0.193
0.377HisCys: 0.377 ± 0.098
1.289HisAsp: 1.289 ± 0.174
1.15HisGlu: 1.15 ± 0.149
1.349HisPhe: 1.349 ± 0.171
1.19HisGly: 1.19 ± 0.192
0.754HisHis: 0.754 ± 0.126
2.122HisIle: 2.122 ± 0.227
1.329HisLys: 1.329 ± 0.238
2.38HisLeu: 2.38 ± 0.247
0.714HisMet: 0.714 ± 0.112
1.21HisAsn: 1.21 ± 0.151
1.428HisPro: 1.428 ± 0.199
0.654HisGln: 0.654 ± 0.127
1.349HisArg: 1.349 ± 0.206
1.587HisSer: 1.587 ± 0.284
1.269HisThr: 1.269 ± 0.18
1.368HisVal: 1.368 ± 0.192
0.099HisTrp: 0.099 ± 0.04
0.972HisTyr: 0.972 ± 0.13
0.0HisXaa: 0.0 ± 0.0
Ile
2.935IleAla: 2.935 ± 0.275
2.122IleCys: 2.122 ± 0.229
4.284IleAsp: 4.284 ± 0.336
4.165IleGlu: 4.165 ± 0.303
5.335IlePhe: 5.335 ± 0.417
3.233IleGly: 3.233 ± 0.334
2.182IleHis: 2.182 ± 0.286
8.151IleIle: 8.151 ± 0.684
6.346IleLys: 6.346 ± 0.47
7.913IleLeu: 7.913 ± 0.624
2.063IleMet: 2.063 ± 0.191
6.703IleAsn: 6.703 ± 0.523
3.907IlePro: 3.907 ± 0.269
3.074IleGln: 3.074 ± 0.301
2.539IleArg: 2.539 ± 0.276
7.1IleSer: 7.1 ± 0.475
5.196IleThr: 5.196 ± 0.432
3.61IleVal: 3.61 ± 0.291
0.615IleTrp: 0.615 ± 0.112
4.641IleTyr: 4.641 ± 0.332
0.0IleXaa: 0.0 ± 0.0
Lys
1.666LysAla: 1.666 ± 0.191
1.983LysCys: 1.983 ± 0.201
4.105LysAsp: 4.105 ± 0.34
4.125LysGlu: 4.125 ± 0.316
4.046LysPhe: 4.046 ± 0.286
1.368LysGly: 1.368 ± 0.159
1.765LysHis: 1.765 ± 0.2
6.941LysIle: 6.941 ± 0.53
6.644LysLys: 6.644 ± 0.448
6.386LysLeu: 6.386 ± 0.372
2.162LysMet: 2.162 ± 0.254
6.108LysAsn: 6.108 ± 0.538
2.082LysPro: 2.082 ± 0.214
2.558LysGln: 2.558 ± 0.251
3.134LysArg: 3.134 ± 0.241
4.998LysSer: 4.998 ± 0.648
4.343LysThr: 4.343 ± 0.306
2.558LysVal: 2.558 ± 0.228
0.535LysTrp: 0.535 ± 0.113
3.748LysTyr: 3.748 ± 0.271
0.0LysXaa: 0.0 ± 0.0
Leu
3.55LeuAla: 3.55 ± 0.245
2.618LeuCys: 2.618 ± 0.245
4.799LeuAsp: 4.799 ± 0.686
4.601LeuGlu: 4.601 ± 0.465
5.037LeuPhe: 5.037 ± 0.404
3.272LeuGly: 3.272 ± 0.284
2.499LeuHis: 2.499 ± 0.197
7.08LeuIle: 7.08 ± 0.473
7.08LeuLys: 7.08 ± 0.449
9.064LeuLeu: 9.064 ± 0.576
1.706LeuMet: 1.706 ± 0.216
5.573LeuAsn: 5.573 ± 0.3
5.573LeuPro: 5.573 ± 0.469
3.649LeuGln: 3.649 ± 0.27
3.094LeuArg: 3.094 ± 0.272
7.536LeuSer: 7.536 ± 0.448
4.641LeuThr: 4.641 ± 0.349
4.086LeuVal: 4.086 ± 0.319
0.912LeuTrp: 0.912 ± 0.149
4.641LeuTyr: 4.641 ± 0.314
0.0LeuXaa: 0.0 ± 0.0
Met
1.13MetAla: 1.13 ± 0.152
0.694MetCys: 0.694 ± 0.113
1.487MetAsp: 1.487 ± 0.169
1.448MetGlu: 1.448 ± 0.16
1.547MetPhe: 1.547 ± 0.189
0.932MetGly: 0.932 ± 0.14
0.416MetHis: 0.416 ± 0.09
1.527MetIle: 1.527 ± 0.198
1.567MetLys: 1.567 ± 0.186
2.261MetLeu: 2.261 ± 0.255
0.555MetMet: 0.555 ± 0.117
1.706MetAsn: 1.706 ± 0.177
0.873MetPro: 0.873 ± 0.126
0.813MetGln: 0.813 ± 0.122
0.615MetArg: 0.615 ± 0.122
2.063MetSer: 2.063 ± 0.18
1.487MetThr: 1.487 ± 0.161
0.952MetVal: 0.952 ± 0.144
0.377MetTrp: 0.377 ± 0.096
1.448MetTyr: 1.448 ± 0.167
0.0MetXaa: 0.0 ± 0.0
Asn
2.182AsnAla: 2.182 ± 0.241
1.309AsnCys: 1.309 ± 0.158
3.828AsnAsp: 3.828 ± 0.249
3.848AsnGlu: 3.848 ± 0.3
2.856AsnPhe: 2.856 ± 0.24
2.717AsnGly: 2.717 ± 0.323
0.972AsnHis: 0.972 ± 0.171
8.171AsnIle: 8.171 ± 0.621
5.057AsnLys: 5.057 ± 0.362
5.632AsnLeu: 5.632 ± 0.337
1.924AsnMet: 1.924 ± 0.214
5.335AsnAsn: 5.335 ± 0.38
2.578AsnPro: 2.578 ± 0.229
2.102AsnGln: 2.102 ± 0.209
2.241AsnArg: 2.241 ± 0.248
4.542AsnSer: 4.542 ± 0.443
4.324AsnThr: 4.324 ± 0.369
4.224AsnVal: 4.224 ± 0.33
0.317AsnTrp: 0.317 ± 0.088
2.995AsnTyr: 2.995 ± 0.331
0.0AsnXaa: 0.0 ± 0.0
Pro
2.459ProAla: 2.459 ± 0.464
0.833ProCys: 0.833 ± 0.16
2.4ProAsp: 2.4 ± 0.288
3.094ProGlu: 3.094 ± 0.343
2.082ProPhe: 2.082 ± 0.171
2.459ProGly: 2.459 ± 0.609
1.368ProHis: 1.368 ± 0.185
3.828ProIle: 3.828 ± 0.26
2.995ProLys: 2.995 ± 0.281
4.72ProLeu: 4.72 ± 0.436
0.932ProMet: 0.932 ± 0.118
2.618ProAsn: 2.618 ± 0.288
4.72ProPro: 4.72 ± 0.923
1.587ProGln: 1.587 ± 0.227
3.094ProArg: 3.094 ± 0.462
3.887ProSer: 3.887 ± 0.514
3.074ProThr: 3.074 ± 0.34
2.618ProVal: 2.618 ± 0.274
0.516ProTrp: 0.516 ± 0.098
1.626ProTyr: 1.626 ± 0.192
0.0ProXaa: 0.0 ± 0.0
Gln
1.19GlnAla: 1.19 ± 0.185
0.635GlnCys: 0.635 ± 0.138
2.241GlnAsp: 2.241 ± 0.295
1.765GlnGlu: 1.765 ± 0.174
2.023GlnPhe: 2.023 ± 0.289
1.23GlnGly: 1.23 ± 0.232
0.892GlnHis: 0.892 ± 0.128
2.638GlnIle: 2.638 ± 0.257
2.677GlnLys: 2.677 ± 0.292
3.015GlnLeu: 3.015 ± 0.244
0.813GlnMet: 0.813 ± 0.129
2.975GlnAsn: 2.975 ± 0.219
1.408GlnPro: 1.408 ± 0.226
1.111GlnGln: 1.111 ± 0.15
1.646GlnArg: 1.646 ± 0.231
2.658GlnSer: 2.658 ± 0.32
2.261GlnThr: 2.261 ± 0.19
1.408GlnVal: 1.408 ± 0.214
0.198GlnTrp: 0.198 ± 0.07
1.289GlnTyr: 1.289 ± 0.153
0.0GlnXaa: 0.0 ± 0.0
Arg
2.063ArgAla: 2.063 ± 0.33
0.654ArgCys: 0.654 ± 0.122
2.182ArgAsp: 2.182 ± 0.222
2.003ArgGlu: 2.003 ± 0.246
1.924ArgPhe: 1.924 ± 0.2
2.638ArgGly: 2.638 ± 0.736
1.17ArgHis: 1.17 ± 0.189
2.757ArgIle: 2.757 ± 0.213
2.519ArgLys: 2.519 ± 0.215
3.471ArgLeu: 3.471 ± 0.279
1.071ArgMet: 1.071 ± 0.141
1.864ArgAsn: 1.864 ± 0.235
2.717ArgPro: 2.717 ± 0.423
1.547ArgGln: 1.547 ± 0.191
2.638ArgArg: 2.638 ± 0.411
3.828ArgSer: 3.828 ± 0.933
2.261ArgThr: 2.261 ± 0.284
1.924ArgVal: 1.924 ± 0.238
0.337ArgTrp: 0.337 ± 0.075
1.507ArgTyr: 1.507 ± 0.194
0.0ArgXaa: 0.0 ± 0.0
Ser
3.153SerAla: 3.153 ± 0.295
1.805SerCys: 1.805 ± 0.197
4.76SerAsp: 4.76 ± 0.472
5.156SerGlu: 5.156 ± 0.481
3.867SerPhe: 3.867 ± 0.287
3.61SerGly: 3.61 ± 0.374
1.924SerHis: 1.924 ± 0.328
6.346SerIle: 6.346 ± 0.482
5.712SerLys: 5.712 ± 0.71
6.703SerLeu: 6.703 ± 0.357
2.102SerMet: 2.102 ± 0.185
4.601SerAsn: 4.601 ± 0.5
3.689SerPro: 3.689 ± 0.493
2.836SerGln: 2.836 ± 0.383
3.689SerArg: 3.689 ± 0.965
8.211SerSer: 8.211 ± 1.448
4.006SerThr: 4.006 ± 0.321
4.819SerVal: 4.819 ± 0.419
0.635SerTrp: 0.635 ± 0.098
2.499SerTyr: 2.499 ± 0.234
0.0SerXaa: 0.0 ± 0.0
Thr
3.153ThrAla: 3.153 ± 0.314
1.309ThrCys: 1.309 ± 0.167
3.59ThrAsp: 3.59 ± 0.301
3.649ThrGlu: 3.649 ± 0.258
3.094ThrPhe: 3.094 ± 0.258
2.479ThrGly: 2.479 ± 0.242
1.626ThrHis: 1.626 ± 0.15
4.522ThrIle: 4.522 ± 0.353
4.462ThrLys: 4.462 ± 0.323
4.403ThrLeu: 4.403 ± 0.317
1.19ThrMet: 1.19 ± 0.187
3.629ThrAsn: 3.629 ± 0.347
3.034ThrPro: 3.034 ± 0.327
2.241ThrGln: 2.241 ± 0.225
2.162ThrArg: 2.162 ± 0.233
4.423ThrSer: 4.423 ± 0.341
3.907ThrThr: 3.907 ± 0.38
3.629ThrVal: 3.629 ± 0.287
0.456ThrTrp: 0.456 ± 0.085
2.499ThrTyr: 2.499 ± 0.24
0.0ThrXaa: 0.0 ± 0.0
Val
2.261ValAla: 2.261 ± 0.274
1.349ValCys: 1.349 ± 0.169
2.439ValAsp: 2.439 ± 0.259
2.856ValGlu: 2.856 ± 0.273
2.876ValPhe: 2.876 ± 0.239
2.182ValGly: 2.182 ± 0.246
1.091ValHis: 1.091 ± 0.142
3.709ValIle: 3.709 ± 0.315
2.777ValLys: 2.777 ± 0.244
4.601ValLeu: 4.601 ± 0.29
0.992ValMet: 0.992 ± 0.141
3.649ValAsn: 3.649 ± 0.29
2.856ValPro: 2.856 ± 0.336
1.289ValGln: 1.289 ± 0.154
2.301ValArg: 2.301 ± 0.257
4.224ValSer: 4.224 ± 0.434
3.689ValThr: 3.689 ± 0.339
2.34ValVal: 2.34 ± 0.246
0.555ValTrp: 0.555 ± 0.107
2.499ValTyr: 2.499 ± 0.262
0.0ValXaa: 0.0 ± 0.0
Trp
0.297TrpAla: 0.297 ± 0.071
0.337TrpCys: 0.337 ± 0.06
0.476TrpAsp: 0.476 ± 0.104
0.357TrpGlu: 0.357 ± 0.075
0.416TrpPhe: 0.416 ± 0.084
0.317TrpGly: 0.317 ± 0.077
0.198TrpHis: 0.198 ± 0.055
0.555TrpIle: 0.555 ± 0.082
0.535TrpLys: 0.535 ± 0.089
0.754TrpLeu: 0.754 ± 0.124
0.238TrpMet: 0.238 ± 0.078
0.516TrpAsn: 0.516 ± 0.08
0.555TrpPro: 0.555 ± 0.144
0.357TrpGln: 0.357 ± 0.086
0.377TrpArg: 0.377 ± 0.102
0.595TrpSer: 0.595 ± 0.1
0.397TrpThr: 0.397 ± 0.093
0.575TrpVal: 0.575 ± 0.122
0.04TrpTrp: 0.04 ± 0.031
0.476TrpTyr: 0.476 ± 0.108
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.527TyrAla: 1.527 ± 0.161
0.912TyrCys: 0.912 ± 0.146
2.479TyrAsp: 2.479 ± 0.228
2.162TyrGlu: 2.162 ± 0.199
2.261TyrPhe: 2.261 ± 0.245
2.122TyrGly: 2.122 ± 0.238
0.674TyrHis: 0.674 ± 0.115
4.482TyrIle: 4.482 ± 0.386
3.53TyrLys: 3.53 ± 0.313
3.689TyrLeu: 3.689 ± 0.289
1.011TyrMet: 1.011 ± 0.142
3.59TyrAsn: 3.59 ± 0.324
1.507TyrPro: 1.507 ± 0.143
1.23TyrGln: 1.23 ± 0.218
1.606TyrArg: 1.606 ± 0.175
3.332TyrSer: 3.332 ± 0.266
2.459TyrThr: 2.459 ± 0.267
2.102TyrVal: 2.102 ± 0.221
0.436TyrTrp: 0.436 ± 0.091
1.864TyrTyr: 1.864 ± 0.187
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 128 proteins (50423 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski