Amino acid dipepetide frequency for Musca hytrovirus(isolate Musca domestica/United States/Boucias/-) (MHV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.913AlaAla: 3.913 ± 0.396
0.932AlaCys: 0.932 ± 0.153
3.274AlaAsp: 3.274 ± 0.257
2.289AlaGlu: 2.289 ± 0.213
2.209AlaPhe: 2.209 ± 0.198
1.81AlaGly: 1.81 ± 0.241
0.905AlaHis: 0.905 ± 0.181
3.726AlaIle: 3.726 ± 0.291
2.875AlaLys: 2.875 ± 0.217
3.993AlaLeu: 3.993 ± 0.34
1.331AlaMet: 1.331 ± 0.184
3.407AlaAsn: 3.407 ± 0.342
2.396AlaPro: 2.396 ± 0.35
1.757AlaGln: 1.757 ± 0.18
2.688AlaArg: 2.688 ± 0.271
3.514AlaSer: 3.514 ± 0.275
3.673AlaThr: 3.673 ± 0.288
2.955AlaVal: 2.955 ± 0.299
0.186AlaTrp: 0.186 ± 0.056
1.783AlaTyr: 1.783 ± 0.249
0.0AlaXaa: 0.0 ± 0.0
Cys
0.878CysAla: 0.878 ± 0.18
0.426CysCys: 0.426 ± 0.103
1.331CysAsp: 1.331 ± 0.19
1.011CysGlu: 1.011 ± 0.144
0.719CysPhe: 0.719 ± 0.138
1.038CysGly: 1.038 ± 0.177
0.559CysHis: 0.559 ± 0.116
1.73CysIle: 1.73 ± 0.23
0.905CysLys: 0.905 ± 0.169
1.89CysLeu: 1.89 ± 0.232
0.612CysMet: 0.612 ± 0.114
1.091CysAsn: 1.091 ± 0.205
0.985CysPro: 0.985 ± 0.189
0.799CysGln: 0.799 ± 0.158
1.437CysArg: 1.437 ± 0.198
1.517CysSer: 1.517 ± 0.173
1.517CysThr: 1.517 ± 0.227
1.358CysVal: 1.358 ± 0.184
0.0CysTrp: 0.0 ± 0.0
0.559CysTyr: 0.559 ± 0.123
0.0CysXaa: 0.0 ± 0.0
Asp
2.715AspAla: 2.715 ± 0.219
0.905AspCys: 0.905 ± 0.137
7.826AspAsp: 7.826 ± 0.81
5.27AspGlu: 5.27 ± 0.431
2.715AspPhe: 2.715 ± 0.234
3.726AspGly: 3.726 ± 0.338
0.559AspHis: 0.559 ± 0.138
4.871AspIle: 4.871 ± 0.368
3.088AspLys: 3.088 ± 0.383
4.871AspLeu: 4.871 ± 0.284
1.73AspMet: 1.73 ± 0.163
5.004AspAsn: 5.004 ± 0.405
1.57AspPro: 1.57 ± 0.201
1.704AspGln: 1.704 ± 0.34
2.901AspArg: 2.901 ± 0.308
3.62AspSer: 3.62 ± 0.281
4.312AspThr: 4.312 ± 0.333
3.567AspVal: 3.567 ± 0.298
0.479AspTrp: 0.479 ± 0.101
2.768AspTyr: 2.768 ± 0.239
0.0AspXaa: 0.0 ± 0.0
Glu
2.609GluAla: 2.609 ± 0.281
0.799GluCys: 0.799 ± 0.165
3.141GluAsp: 3.141 ± 0.339
3.247GluGlu: 3.247 ± 0.404
2.662GluPhe: 2.662 ± 0.294
1.757GluGly: 1.757 ± 0.271
1.597GluHis: 1.597 ± 0.188
3.247GluIle: 3.247 ± 0.244
2.795GluLys: 2.795 ± 0.24
5.164GluLeu: 5.164 ± 0.418
1.757GluMet: 1.757 ± 0.171
3.753GluAsn: 3.753 ± 0.316
1.89GluPro: 1.89 ± 0.312
2.183GluGln: 2.183 ± 0.282
3.54GluArg: 3.54 ± 0.289
3.274GluSer: 3.274 ± 0.357
3.647GluThr: 3.647 ± 0.28
1.996GluVal: 1.996 ± 0.225
0.399GluTrp: 0.399 ± 0.1
2.475GluTyr: 2.475 ± 0.288
0.0GluXaa: 0.0 ± 0.0
Phe
2.422PheAla: 2.422 ± 0.217
0.905PheCys: 0.905 ± 0.159
3.354PheAsp: 3.354 ± 0.275
2.875PheGlu: 2.875 ± 0.302
2.875PhePhe: 2.875 ± 0.286
2.129PheGly: 2.129 ± 0.261
1.171PheHis: 1.171 ± 0.173
3.7PheIle: 3.7 ± 0.276
2.555PheLys: 2.555 ± 0.229
3.993PheLeu: 3.993 ± 0.263
1.411PheMet: 1.411 ± 0.192
3.62PheAsn: 3.62 ± 0.375
1.411PhePro: 1.411 ± 0.182
1.517PheGln: 1.517 ± 0.175
2.821PheArg: 2.821 ± 0.283
3.141PheSer: 3.141 ± 0.305
2.422PheThr: 2.422 ± 0.269
3.647PheVal: 3.647 ± 0.288
0.453PheTrp: 0.453 ± 0.098
2.05PheTyr: 2.05 ± 0.25
0.0PheXaa: 0.0 ± 0.0
Gly
1.624GlyAla: 1.624 ± 0.209
0.932GlyCys: 0.932 ± 0.155
4.419GlyAsp: 4.419 ± 0.43
1.757GlyGlu: 1.757 ± 0.219
1.65GlyPhe: 1.65 ± 0.2
5.989GlyGly: 5.989 ± 0.572
0.878GlyHis: 0.878 ± 0.157
2.662GlyIle: 2.662 ± 0.215
1.837GlyLys: 1.837 ± 0.206
2.821GlyLeu: 2.821 ± 0.263
1.704GlyMet: 1.704 ± 0.191
2.555GlyAsn: 2.555 ± 0.284
1.011GlyPro: 1.011 ± 0.163
1.517GlyGln: 1.517 ± 0.159
2.396GlyArg: 2.396 ± 0.271
3.194GlySer: 3.194 ± 0.334
2.129GlyThr: 2.129 ± 0.268
2.662GlyVal: 2.662 ± 0.293
0.319GlyTrp: 0.319 ± 0.083
1.517GlyTyr: 1.517 ± 0.192
0.0GlyXaa: 0.0 ± 0.0
His
1.145HisAla: 1.145 ± 0.209
0.399HisCys: 0.399 ± 0.108
1.145HisAsp: 1.145 ± 0.184
1.198HisGlu: 1.198 ± 0.165
0.985HisPhe: 0.985 ± 0.169
0.905HisGly: 0.905 ± 0.174
0.799HisHis: 0.799 ± 0.178
1.863HisIle: 1.863 ± 0.254
1.011HisLys: 1.011 ± 0.188
2.502HisLeu: 2.502 ± 0.256
0.665HisMet: 0.665 ± 0.129
1.544HisAsn: 1.544 ± 0.229
0.932HisPro: 0.932 ± 0.162
0.719HisGln: 0.719 ± 0.142
1.491HisArg: 1.491 ± 0.175
1.624HisSer: 1.624 ± 0.198
1.411HisThr: 1.411 ± 0.182
1.544HisVal: 1.544 ± 0.212
0.213HisTrp: 0.213 ± 0.069
1.011HisTyr: 1.011 ± 0.143
0.0HisXaa: 0.0 ± 0.0
Ile
3.54IleAla: 3.54 ± 0.355
1.145IleCys: 1.145 ± 0.19
5.51IleAsp: 5.51 ± 0.372
4.658IleGlu: 4.658 ± 0.4
4.099IlePhe: 4.099 ± 0.374
2.742IleGly: 2.742 ± 0.268
1.704IleHis: 1.704 ± 0.235
4.605IleIle: 4.605 ± 0.354
3.141IleLys: 3.141 ± 0.275
6.415IleLeu: 6.415 ± 0.398
1.996IleMet: 1.996 ± 0.246
4.019IleAsn: 4.019 ± 0.411
2.742IlePro: 2.742 ± 0.204
2.422IleGln: 2.422 ± 0.243
3.514IleArg: 3.514 ± 0.296
4.046IleSer: 4.046 ± 0.313
3.088IleThr: 3.088 ± 0.258
5.856IleVal: 5.856 ± 0.435
0.346IleTrp: 0.346 ± 0.099
3.78IleTyr: 3.78 ± 0.269
0.0IleXaa: 0.0 ± 0.0
Lys
1.837LysAla: 1.837 ± 0.23
1.304LysCys: 1.304 ± 0.145
1.73LysAsp: 1.73 ± 0.254
1.89LysGlu: 1.89 ± 0.281
3.114LysPhe: 3.114 ± 0.327
1.251LysGly: 1.251 ± 0.168
1.118LysHis: 1.118 ± 0.158
3.886LysIle: 3.886 ± 0.325
2.502LysLys: 2.502 ± 0.351
4.765LysLeu: 4.765 ± 0.356
1.65LysMet: 1.65 ± 0.24
3.327LysAsn: 3.327 ± 0.394
1.544LysPro: 1.544 ± 0.179
2.103LysGln: 2.103 ± 0.249
3.7LysArg: 3.7 ± 0.279
3.46LysSer: 3.46 ± 0.353
3.593LysThr: 3.593 ± 0.353
2.103LysVal: 2.103 ± 0.245
0.319LysTrp: 0.319 ± 0.068
3.647LysTyr: 3.647 ± 0.418
0.0LysXaa: 0.0 ± 0.0
Leu
5.031LeuAla: 5.031 ± 0.389
2.289LeuCys: 2.289 ± 0.296
4.978LeuAsp: 4.978 ± 0.465
3.913LeuGlu: 3.913 ± 0.324
4.419LeuPhe: 4.419 ± 0.334
3.141LeuGly: 3.141 ± 0.311
2.396LeuHis: 2.396 ± 0.33
5.696LeuIle: 5.696 ± 0.465
3.886LeuLys: 3.886 ± 0.245
8.411LeuLeu: 8.411 ± 0.495
2.609LeuMet: 2.609 ± 0.272
6.122LeuAsn: 6.122 ± 0.473
4.312LeuPro: 4.312 ± 0.288
4.073LeuGln: 4.073 ± 0.389
6.282LeuArg: 6.282 ± 0.42
5.883LeuSer: 5.883 ± 0.454
5.084LeuThr: 5.084 ± 0.392
6.495LeuVal: 6.495 ± 0.419
0.878LeuTrp: 0.878 ± 0.159
4.951LeuTyr: 4.951 ± 0.439
0.0LeuXaa: 0.0 ± 0.0
Met
1.837MetAla: 1.837 ± 0.228
0.639MetCys: 0.639 ± 0.114
2.076MetAsp: 2.076 ± 0.268
1.091MetGlu: 1.091 ± 0.144
1.597MetPhe: 1.597 ± 0.211
1.304MetGly: 1.304 ± 0.155
0.719MetHis: 0.719 ± 0.147
1.118MetIle: 1.118 ± 0.188
1.437MetLys: 1.437 ± 0.217
2.768MetLeu: 2.768 ± 0.253
1.358MetMet: 1.358 ± 0.228
2.076MetAsn: 2.076 ± 0.215
1.145MetPro: 1.145 ± 0.145
0.878MetGln: 0.878 ± 0.141
1.916MetArg: 1.916 ± 0.197
2.955MetSer: 2.955 ± 0.289
1.97MetThr: 1.97 ± 0.249
1.411MetVal: 1.411 ± 0.217
0.213MetTrp: 0.213 ± 0.075
1.996MetTyr: 1.996 ± 0.228
0.0MetXaa: 0.0 ± 0.0
Asn
3.354AsnAla: 3.354 ± 0.257
1.358AsnCys: 1.358 ± 0.166
3.7AsnAsp: 3.7 ± 0.265
3.673AsnGlu: 3.673 ± 0.374
3.46AsnPhe: 3.46 ± 0.253
2.848AsnGly: 2.848 ± 0.338
1.065AsnHis: 1.065 ± 0.167
6.255AsnIle: 6.255 ± 0.352
3.354AsnLys: 3.354 ± 0.37
5.909AsnLeu: 5.909 ± 0.448
1.837AsnMet: 1.837 ± 0.214
4.259AsnAsn: 4.259 ± 0.449
1.783AsnPro: 1.783 ± 0.201
1.57AsnGln: 1.57 ± 0.177
4.206AsnArg: 4.206 ± 0.31
4.232AsnSer: 4.232 ± 0.352
3.833AsnThr: 3.833 ± 0.304
5.217AsnVal: 5.217 ± 0.39
0.346AsnTrp: 0.346 ± 0.085
3.114AsnTyr: 3.114 ± 0.252
0.0AsnXaa: 0.0 ± 0.0
Pro
2.103ProAla: 2.103 ± 0.278
0.745ProCys: 0.745 ± 0.143
2.129ProAsp: 2.129 ± 0.243
1.757ProGlu: 1.757 ± 0.199
1.57ProPhe: 1.57 ± 0.207
1.517ProGly: 1.517 ± 0.265
0.932ProHis: 0.932 ± 0.135
2.821ProIle: 2.821 ± 0.341
2.103ProLys: 2.103 ± 0.259
3.434ProLeu: 3.434 ± 0.299
0.905ProMet: 0.905 ± 0.164
2.023ProAsn: 2.023 ± 0.219
3.673ProPro: 3.673 ± 0.467
1.996ProGln: 1.996 ± 0.276
1.837ProArg: 1.837 ± 0.23
4.472ProSer: 4.472 ± 0.436
2.662ProThr: 2.662 ± 0.259
1.863ProVal: 1.863 ± 0.183
0.399ProTrp: 0.399 ± 0.084
1.704ProTyr: 1.704 ± 0.198
0.0ProXaa: 0.0 ± 0.0
Gln
1.358GlnAla: 1.358 ± 0.175
0.825GlnCys: 0.825 ± 0.112
1.251GlnAsp: 1.251 ± 0.214
1.464GlnGlu: 1.464 ± 0.262
1.597GlnPhe: 1.597 ± 0.207
0.932GlnGly: 0.932 ± 0.171
1.038GlnHis: 1.038 ± 0.172
2.316GlnIle: 2.316 ± 0.224
2.129GlnLys: 2.129 ± 0.255
3.806GlnLeu: 3.806 ± 0.306
1.411GlnMet: 1.411 ± 0.19
1.81GlnAsn: 1.81 ± 0.162
1.704GlnPro: 1.704 ± 0.197
2.635GlnGln: 2.635 ± 0.276
3.141GlnArg: 3.141 ± 0.327
2.555GlnSer: 2.555 ± 0.241
2.742GlnThr: 2.742 ± 0.249
1.73GlnVal: 1.73 ± 0.198
0.399GlnTrp: 0.399 ± 0.095
2.156GlnTyr: 2.156 ± 0.199
0.0GlnXaa: 0.0 ± 0.0
Arg
1.916ArgAla: 1.916 ± 0.211
1.358ArgCys: 1.358 ± 0.167
3.008ArgAsp: 3.008 ± 0.307
2.742ArgGlu: 2.742 ± 0.264
2.875ArgPhe: 2.875 ± 0.302
1.81ArgGly: 1.81 ± 0.236
2.209ArgHis: 2.209 ± 0.252
4.605ArgIle: 4.605 ± 0.337
3.673ArgLys: 3.673 ± 0.287
6.628ArgLeu: 6.628 ± 0.442
1.89ArgMet: 1.89 ± 0.199
4.312ArgAsn: 4.312 ± 0.278
2.263ArgPro: 2.263 ± 0.189
2.768ArgGln: 2.768 ± 0.28
5.137ArgArg: 5.137 ± 0.436
3.168ArgSer: 3.168 ± 0.295
3.647ArgThr: 3.647 ± 0.28
3.46ArgVal: 3.46 ± 0.347
0.479ArgTrp: 0.479 ± 0.11
3.194ArgTyr: 3.194 ± 0.306
0.0ArgXaa: 0.0 ± 0.0
Ser
3.86SerAla: 3.86 ± 0.335
1.198SerCys: 1.198 ± 0.183
4.046SerAsp: 4.046 ± 0.359
3.434SerGlu: 3.434 ± 0.287
2.875SerPhe: 2.875 ± 0.259
3.434SerGly: 3.434 ± 0.366
1.411SerHis: 1.411 ± 0.203
4.179SerIle: 4.179 ± 0.34
3.221SerLys: 3.221 ± 0.266
6.947SerLeu: 6.947 ± 0.405
2.236SerMet: 2.236 ± 0.258
3.993SerAsn: 3.993 ± 0.371
3.806SerPro: 3.806 ± 0.448
2.396SerGln: 2.396 ± 0.246
3.726SerArg: 3.726 ± 0.291
10.966SerSer: 10.966 ± 0.958
4.924SerThr: 4.924 ± 0.383
5.164SerVal: 5.164 ± 0.383
0.453SerTrp: 0.453 ± 0.106
2.635SerTyr: 2.635 ± 0.269
0.0SerXaa: 0.0 ± 0.0
Thr
3.034ThrAla: 3.034 ± 0.336
1.624ThrCys: 1.624 ± 0.225
3.327ThrAsp: 3.327 ± 0.369
3.247ThrGlu: 3.247 ± 0.312
3.114ThrPhe: 3.114 ± 0.337
2.529ThrGly: 2.529 ± 0.234
1.251ThrHis: 1.251 ± 0.161
4.392ThrIle: 4.392 ± 0.364
3.114ThrLys: 3.114 ± 0.309
6.042ThrLeu: 6.042 ± 0.354
2.05ThrMet: 2.05 ± 0.262
4.019ThrAsn: 4.019 ± 0.39
3.327ThrPro: 3.327 ± 0.27
2.129ThrGln: 2.129 ± 0.245
3.327ThrArg: 3.327 ± 0.277
4.738ThrSer: 4.738 ± 0.435
5.164ThrThr: 5.164 ± 0.462
3.354ThrVal: 3.354 ± 0.327
0.426ThrTrp: 0.426 ± 0.108
2.475ThrTyr: 2.475 ± 0.251
0.0ThrXaa: 0.0 ± 0.0
Val
3.86ValAla: 3.86 ± 0.315
1.331ValCys: 1.331 ± 0.191
4.818ValAsp: 4.818 ± 0.397
3.168ValGlu: 3.168 ± 0.258
3.567ValPhe: 3.567 ± 0.299
2.236ValGly: 2.236 ± 0.229
1.757ValHis: 1.757 ± 0.235
3.567ValIle: 3.567 ± 0.362
2.555ValLys: 2.555 ± 0.306
5.057ValLeu: 5.057 ± 0.356
1.464ValMet: 1.464 ± 0.176
4.285ValAsn: 4.285 ± 0.364
2.422ValPro: 2.422 ± 0.285
2.396ValGln: 2.396 ± 0.219
3.434ValArg: 3.434 ± 0.305
4.552ValSer: 4.552 ± 0.303
2.795ValThr: 2.795 ± 0.234
4.658ValVal: 4.658 ± 0.372
0.639ValTrp: 0.639 ± 0.123
3.647ValTyr: 3.647 ± 0.346
0.0ValXaa: 0.0 ± 0.0
Trp
0.186TrpAla: 0.186 ± 0.052
0.133TrpCys: 0.133 ± 0.058
0.213TrpAsp: 0.213 ± 0.072
0.293TrpGlu: 0.293 ± 0.09
0.319TrpPhe: 0.319 ± 0.088
0.213TrpGly: 0.213 ± 0.061
0.319TrpHis: 0.319 ± 0.088
0.612TrpIle: 0.612 ± 0.141
0.426TrpLys: 0.426 ± 0.108
0.745TrpLeu: 0.745 ± 0.148
0.16TrpMet: 0.16 ± 0.063
0.586TrpAsn: 0.586 ± 0.13
0.24TrpPro: 0.24 ± 0.083
0.24TrpGln: 0.24 ± 0.08
0.612TrpArg: 0.612 ± 0.127
0.745TrpSer: 0.745 ± 0.132
0.586TrpThr: 0.586 ± 0.138
0.24TrpVal: 0.24 ± 0.08
0.133TrpTrp: 0.133 ± 0.062
0.346TrpTyr: 0.346 ± 0.078
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.396TyrAla: 2.396 ± 0.248
1.118TyrCys: 1.118 ± 0.186
2.928TyrAsp: 2.928 ± 0.293
2.955TyrGlu: 2.955 ± 0.283
2.05TyrPhe: 2.05 ± 0.248
2.263TyrGly: 2.263 ± 0.28
0.719TyrHis: 0.719 ± 0.115
3.54TyrIle: 3.54 ± 0.315
2.236TyrLys: 2.236 ± 0.228
4.685TyrLeu: 4.685 ± 0.245
1.704TyrMet: 1.704 ± 0.199
3.567TyrAsn: 3.567 ± 0.263
1.491TyrPro: 1.491 ± 0.162
1.198TyrGln: 1.198 ± 0.186
3.034TyrArg: 3.034 ± 0.293
3.247TyrSer: 3.247 ± 0.263
3.487TyrThr: 3.487 ± 0.322
2.928TyrVal: 2.928 ± 0.248
0.266TyrTrp: 0.266 ± 0.1
2.076TyrTyr: 2.076 ± 0.221
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 108 proteins (37570 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski