Amino acid dipepetide frequency for Mink coronavirus strain WD1133

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.695AlaAla: 4.695 ± 0.366
2.348AlaCys: 2.348 ± 0.464
1.742AlaAsp: 1.742 ± 0.527
2.575AlaGlu: 2.575 ± 0.483
2.726AlaPhe: 2.726 ± 0.344
3.559AlaGly: 3.559 ± 0.503
0.984AlaHis: 0.984 ± 0.125
4.771AlaIle: 4.771 ± 0.412
4.695AlaLys: 4.695 ± 0.802
5.528AlaLeu: 5.528 ± 0.672
0.984AlaMet: 0.984 ± 0.179
4.998AlaAsn: 4.998 ± 0.364
1.893AlaPro: 1.893 ± 0.413
1.515AlaGln: 1.515 ± 0.467
1.742AlaArg: 1.742 ± 0.257
4.089AlaSer: 4.089 ± 0.525
4.468AlaThr: 4.468 ± 0.56
5.68AlaVal: 5.68 ± 0.637
0.53AlaTrp: 0.53 ± 0.073
2.953AlaTyr: 2.953 ± 0.205
0.0AlaXaa: 0.0 ± 0.0
Cys
2.272CysAla: 2.272 ± 0.258
1.06CysCys: 1.06 ± 0.367
1.59CysAsp: 1.59 ± 0.464
0.606CysGlu: 0.606 ± 0.156
1.969CysPhe: 1.969 ± 0.226
2.726CysGly: 2.726 ± 0.248
0.227CysHis: 0.227 ± 0.098
1.515CysIle: 1.515 ± 0.182
2.953CysLys: 2.953 ± 0.574
3.029CysLeu: 3.029 ± 0.465
0.303CysMet: 0.303 ± 0.119
2.12CysAsn: 2.12 ± 0.471
0.833CysPro: 0.833 ± 0.121
0.227CysGln: 0.227 ± 0.15
1.439CysArg: 1.439 ± 0.171
2.953CysSer: 2.953 ± 0.762
2.272CysThr: 2.272 ± 0.422
4.014CysVal: 4.014 ± 0.738
0.909CysTrp: 0.909 ± 0.175
2.878CysTyr: 2.878 ± 0.618
0.0CysXaa: 0.0 ± 0.0
Asp
3.408AspAla: 3.408 ± 0.566
2.423AspCys: 2.423 ± 0.251
2.651AspAsp: 2.651 ± 0.334
1.893AspGlu: 1.893 ± 0.398
3.332AspPhe: 3.332 ± 0.65
4.165AspGly: 4.165 ± 0.365
1.212AspHis: 1.212 ± 0.109
2.802AspIle: 2.802 ± 0.658
2.802AspLys: 2.802 ± 0.757
4.468AspLeu: 4.468 ± 0.546
0.984AspMet: 0.984 ± 0.313
3.332AspAsn: 3.332 ± 0.438
1.742AspPro: 1.742 ± 0.33
0.757AspGln: 0.757 ± 0.198
1.287AspArg: 1.287 ± 0.173
3.181AspSer: 3.181 ± 0.435
1.969AspThr: 1.969 ± 0.606
8.179AspVal: 8.179 ± 0.729
0.757AspTrp: 0.757 ± 0.197
3.635AspTyr: 3.635 ± 0.32
0.0AspXaa: 0.0 ± 0.0
Glu
2.878GluAla: 2.878 ± 0.336
1.287GluCys: 1.287 ± 0.261
3.029GluAsp: 3.029 ± 0.656
1.817GluGlu: 1.817 ± 0.309
1.666GluPhe: 1.666 ± 0.151
3.105GluGly: 3.105 ± 0.257
0.757GluHis: 0.757 ± 0.248
2.12GluIle: 2.12 ± 0.241
1.136GluLys: 1.136 ± 0.312
3.408GluLeu: 3.408 ± 0.459
0.151GluMet: 0.151 ± 0.264
2.045GluAsn: 2.045 ± 0.532
1.817GluPro: 1.817 ± 0.22
2.348GluGln: 2.348 ± 0.308
1.742GluArg: 1.742 ± 0.483
2.802GluSer: 2.802 ± 0.271
1.742GluThr: 1.742 ± 0.372
3.332GluVal: 3.332 ± 0.465
0.379GluTrp: 0.379 ± 0.087
1.969GluTyr: 1.969 ± 0.197
0.0GluXaa: 0.0 ± 0.0
Phe
2.348PheAla: 2.348 ± 0.213
2.045PheCys: 2.045 ± 0.221
4.241PheAsp: 4.241 ± 0.541
2.12PheGlu: 2.12 ± 0.514
2.726PhePhe: 2.726 ± 0.345
4.317PheGly: 4.317 ± 0.427
0.454PheHis: 0.454 ± 0.063
2.575PheIle: 2.575 ± 0.44
4.392PheLys: 4.392 ± 0.819
3.256PheLeu: 3.256 ± 0.555
0.833PheMet: 0.833 ± 0.226
3.711PheAsn: 3.711 ± 0.595
1.06PhePro: 1.06 ± 0.435
0.53PheGln: 0.53 ± 0.491
0.909PheArg: 0.909 ± 0.132
3.408PheSer: 3.408 ± 0.678
2.878PheThr: 2.878 ± 0.301
5.528PheVal: 5.528 ± 0.466
0.909PheTrp: 0.909 ± 0.154
4.014PheTyr: 4.014 ± 0.407
0.0PheXaa: 0.0 ± 0.0
Gly
3.786GlyAla: 3.786 ± 0.246
3.181GlyCys: 3.181 ± 0.378
4.998GlyAsp: 4.998 ± 0.332
1.817GlyGlu: 1.817 ± 0.437
4.014GlyPhe: 4.014 ± 0.385
5.15GlyGly: 5.15 ± 0.403
0.682GlyHis: 0.682 ± 0.093
2.651GlyIle: 2.651 ± 0.41
3.711GlyLys: 3.711 ± 0.549
4.619GlyLeu: 4.619 ± 0.625
1.817GlyMet: 1.817 ± 0.306
5.301GlyAsn: 5.301 ± 0.813
1.439GlyPro: 1.439 ± 0.219
0.757GlyGln: 0.757 ± 0.331
1.893GlyArg: 1.893 ± 0.779
4.695GlySer: 4.695 ± 0.219
4.014GlyThr: 4.014 ± 0.347
6.437GlyVal: 6.437 ± 0.887
0.303GlyTrp: 0.303 ± 0.191
4.317GlyTyr: 4.317 ± 0.709
0.0GlyXaa: 0.0 ± 0.0
His
0.984HisAla: 0.984 ± 0.303
0.757HisCys: 0.757 ± 0.163
0.984HisAsp: 0.984 ± 0.184
0.757HisGlu: 0.757 ± 0.199
0.984HisPhe: 0.984 ± 0.185
0.454HisGly: 0.454 ± 0.063
0.303HisHis: 0.303 ± 0.111
0.454HisIle: 0.454 ± 0.352
2.045HisLys: 2.045 ± 0.552
2.045HisLeu: 2.045 ± 0.147
0.303HisMet: 0.303 ± 0.178
1.212HisAsn: 1.212 ± 0.276
0.682HisPro: 0.682 ± 0.175
0.53HisGln: 0.53 ± 0.178
0.076HisArg: 0.076 ± 0.049
0.984HisSer: 0.984 ± 0.246
1.212HisThr: 1.212 ± 0.336
2.272HisVal: 2.272 ± 0.311
0.151HisTrp: 0.151 ± 0.262
1.136HisTyr: 1.136 ± 0.227
0.0HisXaa: 0.0 ± 0.0
Ile
2.878IleAla: 2.878 ± 0.428
1.817IleCys: 1.817 ± 0.157
2.953IleAsp: 2.953 ± 0.252
1.439IleGlu: 1.439 ± 0.352
2.272IlePhe: 2.272 ± 0.207
3.711IleGly: 3.711 ± 0.431
0.151IleHis: 0.151 ± 0.099
2.802IleIle: 2.802 ± 0.508
3.711IleLys: 3.711 ± 0.573
4.317IleLeu: 4.317 ± 0.509
1.59IleMet: 1.59 ± 0.21
3.711IleAsn: 3.711 ± 0.604
2.12IlePro: 2.12 ± 0.416
1.287IleGln: 1.287 ± 0.193
1.817IleArg: 1.817 ± 0.157
3.029IleSer: 3.029 ± 1.198
5.301IleThr: 5.301 ± 0.285
6.74IleVal: 6.74 ± 0.646
0.303IleTrp: 0.303 ± 0.123
1.742IleTyr: 1.742 ± 0.512
0.0IleXaa: 0.0 ± 0.0
Lys
4.165LysAla: 4.165 ± 0.664
1.969LysCys: 1.969 ± 0.61
2.953LysAsp: 2.953 ± 0.425
3.408LysGlu: 3.408 ± 0.467
3.938LysPhe: 3.938 ± 0.617
3.332LysGly: 3.332 ± 0.555
2.651LysHis: 2.651 ± 0.607
2.348LysIle: 2.348 ± 0.374
1.515LysLys: 1.515 ± 0.397
6.285LysLeu: 6.285 ± 0.355
1.59LysMet: 1.59 ± 0.238
3.105LysAsn: 3.105 ± 0.847
3.786LysPro: 3.786 ± 0.824
1.893LysGln: 1.893 ± 0.576
1.666LysArg: 1.666 ± 0.214
3.862LysSer: 3.862 ± 0.951
3.181LysThr: 3.181 ± 0.443
5.377LysVal: 5.377 ± 0.611
0.53LysTrp: 0.53 ± 0.237
2.953LysTyr: 2.953 ± 0.384
0.0LysXaa: 0.0 ± 0.0
Leu
4.165LeuAla: 4.165 ± 0.638
3.484LeuCys: 3.484 ± 0.654
4.544LeuAsp: 4.544 ± 0.388
4.392LeuGlu: 4.392 ± 0.235
3.862LeuPhe: 3.862 ± 0.541
4.922LeuGly: 4.922 ± 0.482
1.439LeuHis: 1.439 ± 0.249
4.468LeuIle: 4.468 ± 1.273
5.528LeuLys: 5.528 ± 0.594
8.709LeuLeu: 8.709 ± 0.834
1.893LeuMet: 1.893 ± 0.326
4.847LeuAsn: 4.847 ± 1.068
3.332LeuPro: 3.332 ± 0.872
3.862LeuGln: 3.862 ± 0.212
2.575LeuArg: 2.575 ± 0.452
7.043LeuSer: 7.043 ± 0.329
4.922LeuThr: 4.922 ± 1.034
6.816LeuVal: 6.816 ± 0.557
1.363LeuTrp: 1.363 ± 0.343
3.711LeuTyr: 3.711 ± 0.388
0.0LeuXaa: 0.0 ± 0.0
Met
1.06MetAla: 1.06 ± 0.286
0.53MetCys: 0.53 ± 0.252
0.606MetAsp: 0.606 ± 0.222
0.909MetGlu: 0.909 ± 0.173
1.287MetPhe: 1.287 ± 0.396
0.984MetGly: 0.984 ± 0.226
0.606MetHis: 0.606 ± 0.184
2.272MetIle: 2.272 ± 0.562
0.53MetLys: 0.53 ± 0.164
2.575MetLeu: 2.575 ± 0.3
0.53MetMet: 0.53 ± 0.18
0.682MetAsn: 0.682 ± 0.27
0.682MetPro: 0.682 ± 0.15
0.984MetGln: 0.984 ± 0.167
1.287MetArg: 1.287 ± 0.186
1.136MetSer: 1.136 ± 0.142
1.59MetThr: 1.59 ± 0.359
1.212MetVal: 1.212 ± 0.423
0.076MetTrp: 0.076 ± 0.139
1.515MetTyr: 1.515 ± 0.167
0.0MetXaa: 0.0 ± 0.0
Asn
4.317AsnAla: 4.317 ± 0.974
2.499AsnCys: 2.499 ± 0.379
3.332AsnAsp: 3.332 ± 0.464
1.817AsnGlu: 1.817 ± 0.338
2.423AsnPhe: 2.423 ± 0.345
6.891AsnGly: 6.891 ± 0.683
1.287AsnHis: 1.287 ± 1.069
3.256AsnIle: 3.256 ± 0.263
2.878AsnLys: 2.878 ± 0.588
5.377AsnLeu: 5.377 ± 0.699
1.212AsnMet: 1.212 ± 0.161
3.938AsnAsn: 3.938 ± 0.652
0.757AsnPro: 0.757 ± 0.282
1.287AsnGln: 1.287 ± 0.685
1.893AsnArg: 1.893 ± 0.522
5.301AsnSer: 5.301 ± 0.765
3.332AsnThr: 3.332 ± 0.474
6.967AsnVal: 6.967 ± 0.612
0.984AsnTrp: 0.984 ± 0.347
2.045AsnTyr: 2.045 ± 0.441
0.0AsnXaa: 0.0 ± 0.0
Pro
1.59ProAla: 1.59 ± 0.242
0.757ProCys: 0.757 ± 0.322
1.515ProAsp: 1.515 ± 0.292
1.515ProGlu: 1.515 ± 0.167
1.969ProPhe: 1.969 ± 0.35
2.651ProGly: 2.651 ± 0.243
0.379ProHis: 0.379 ± 0.142
2.423ProIle: 2.423 ± 0.224
1.969ProLys: 1.969 ± 0.664
3.181ProLeu: 3.181 ± 0.432
0.606ProMet: 0.606 ± 0.13
1.59ProAsn: 1.59 ± 0.167
1.06ProPro: 1.06 ± 0.224
0.682ProGln: 0.682 ± 0.218
1.136ProArg: 1.136 ± 0.369
2.651ProSer: 2.651 ± 0.772
2.575ProThr: 2.575 ± 0.447
3.256ProVal: 3.256 ± 0.29
0.682ProTrp: 0.682 ± 0.149
1.363ProTyr: 1.363 ± 0.263
0.0ProXaa: 0.0 ± 0.0
Gln
2.499GlnAla: 2.499 ± 0.57
0.379GlnCys: 0.379 ± 0.203
1.06GlnAsp: 1.06 ± 0.242
0.984GlnGlu: 0.984 ± 0.502
1.212GlnPhe: 1.212 ± 0.184
1.363GlnGly: 1.363 ± 0.186
0.682GlnHis: 0.682 ± 0.215
0.682GlnIle: 0.682 ± 0.247
2.196GlnLys: 2.196 ± 0.334
3.484GlnLeu: 3.484 ± 0.38
0.606GlnMet: 0.606 ± 0.173
1.666GlnAsn: 1.666 ± 0.374
1.59GlnPro: 1.59 ± 0.267
0.53GlnGln: 0.53 ± 0.148
0.757GlnArg: 0.757 ± 0.281
2.953GlnSer: 2.953 ± 0.268
2.348GlnThr: 2.348 ± 0.77
1.666GlnVal: 1.666 ± 0.465
0.303GlnTrp: 0.303 ± 0.111
0.909GlnTyr: 0.909 ± 0.219
0.0GlnXaa: 0.0 ± 0.0
Arg
2.575ArgAla: 2.575 ± 0.613
1.666ArgCys: 1.666 ± 0.246
1.817ArgAsp: 1.817 ± 0.262
0.984ArgGlu: 0.984 ± 0.246
2.272ArgPhe: 2.272 ± 0.254
1.742ArgGly: 1.742 ± 0.323
0.682ArgHis: 0.682 ± 0.215
1.287ArgIle: 1.287 ± 0.569
1.969ArgLys: 1.969 ± 0.368
3.332ArgLeu: 3.332 ± 0.284
0.606ArgMet: 0.606 ± 0.172
1.817ArgAsn: 1.817 ± 0.281
0.757ArgPro: 0.757 ± 0.289
1.212ArgGln: 1.212 ± 0.474
1.287ArgArg: 1.287 ± 0.49
2.953ArgSer: 2.953 ± 1.764
1.742ArgThr: 1.742 ± 0.401
2.272ArgVal: 2.272 ± 0.245
0.454ArgTrp: 0.454 ± 0.27
1.06ArgTyr: 1.06 ± 0.391
0.0ArgXaa: 0.0 ± 0.0
Ser
5.301SerAla: 5.301 ± 0.538
1.515SerCys: 1.515 ± 0.384
4.392SerAsp: 4.392 ± 0.823
2.348SerGlu: 2.348 ± 0.244
2.499SerPhe: 2.499 ± 0.428
4.468SerGly: 4.468 ± 0.705
1.439SerHis: 1.439 ± 0.206
5.225SerIle: 5.225 ± 0.463
4.771SerLys: 4.771 ± 0.839
5.528SerLeu: 5.528 ± 0.702
2.196SerMet: 2.196 ± 0.278
4.014SerAsn: 4.014 ± 0.622
1.439SerPro: 1.439 ± 0.184
2.045SerGln: 2.045 ± 0.855
2.802SerArg: 2.802 ± 2.069
5.452SerSer: 5.452 ± 0.644
3.786SerThr: 3.786 ± 1.088
7.119SerVal: 7.119 ± 0.563
0.757SerTrp: 0.757 ± 0.28
3.559SerTyr: 3.559 ± 0.279
0.0SerXaa: 0.0 ± 0.0
Thr
3.408ThrAla: 3.408 ± 0.412
2.12ThrCys: 2.12 ± 0.38
3.105ThrAsp: 3.105 ± 0.292
2.348ThrGlu: 2.348 ± 0.252
3.029ThrPhe: 3.029 ± 0.325
4.165ThrGly: 4.165 ± 1.15
0.984ThrHis: 0.984 ± 0.396
4.771ThrIle: 4.771 ± 0.705
3.105ThrLys: 3.105 ± 0.346
4.847ThrLeu: 4.847 ± 0.818
1.06ThrMet: 1.06 ± 0.312
3.786ThrAsn: 3.786 ± 0.672
3.029ThrPro: 3.029 ± 0.296
1.817ThrGln: 1.817 ± 0.205
2.651ThrArg: 2.651 ± 0.225
3.862ThrSer: 3.862 ± 0.555
4.847ThrThr: 4.847 ± 0.346
6.058ThrVal: 6.058 ± 0.679
0.227ThrTrp: 0.227 ± 0.127
2.499ThrTyr: 2.499 ± 0.273
0.0ThrXaa: 0.0 ± 0.0
Val
6.437ValAla: 6.437 ± 0.606
3.938ValCys: 3.938 ± 0.572
4.922ValAsp: 4.922 ± 0.649
5.15ValGlu: 5.15 ± 0.727
4.922ValPhe: 4.922 ± 0.801
4.544ValGly: 4.544 ± 0.524
2.272ValHis: 2.272 ± 0.477
5.074ValIle: 5.074 ± 0.609
6.891ValLys: 6.891 ± 0.98
8.557ValLeu: 8.557 ± 1.556
2.12ValMet: 2.12 ± 0.472
5.983ValAsn: 5.983 ± 0.533
3.711ValPro: 3.711 ± 0.848
4.014ValGln: 4.014 ± 0.537
3.256ValArg: 3.256 ± 0.312
5.604ValSer: 5.604 ± 0.651
5.301ValThr: 5.301 ± 0.483
11.056ValVal: 11.056 ± 2.084
1.515ValTrp: 1.515 ± 0.29
4.089ValTyr: 4.089 ± 0.737
0.0ValXaa: 0.0 ± 0.0
Trp
0.379TrpAla: 0.379 ± 0.115
0.227TrpCys: 0.227 ± 0.071
0.984TrpAsp: 0.984 ± 0.185
0.454TrpGlu: 0.454 ± 0.138
1.439TrpPhe: 1.439 ± 0.338
0.379TrpGly: 0.379 ± 0.142
0.454TrpHis: 0.454 ± 0.154
0.379TrpIle: 0.379 ± 0.166
0.303TrpLys: 0.303 ± 0.251
1.136TrpLeu: 1.136 ± 0.155
0.076TrpMet: 0.076 ± 0.049
0.757TrpAsn: 0.757 ± 0.294
0.379TrpPro: 0.379 ± 0.286
0.227TrpGln: 0.227 ± 0.071
0.606TrpArg: 0.606 ± 0.177
1.666TrpSer: 1.666 ± 0.373
0.379TrpThr: 0.379 ± 0.203
0.833TrpVal: 0.833 ± 0.147
0.303TrpTrp: 0.303 ± 0.178
0.909TrpTyr: 0.909 ± 0.127
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.332TyrAla: 3.332 ± 0.802
1.893TyrCys: 1.893 ± 0.304
3.559TyrAsp: 3.559 ± 0.84
2.196TyrGlu: 2.196 ± 0.32
3.711TyrPhe: 3.711 ± 0.293
2.953TyrGly: 2.953 ± 0.299
0.682TyrHis: 0.682 ± 0.282
1.817TyrIle: 1.817 ± 0.29
3.408TyrLys: 3.408 ± 0.298
2.272TyrLeu: 2.272 ± 0.424
1.515TyrMet: 1.515 ± 0.241
3.181TyrAsn: 3.181 ± 0.949
1.439TyrPro: 1.439 ± 0.356
1.363TyrGln: 1.363 ± 0.137
1.666TyrArg: 1.666 ± 0.392
3.029TyrSer: 3.029 ± 0.374
3.862TyrThr: 3.862 ± 0.37
4.544TyrVal: 4.544 ± 0.615
0.833TyrTrp: 0.833 ± 0.236
2.878TyrTyr: 2.878 ± 0.376
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (13206 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski