Amino acid dipepetide frequency for Wuhan nido-like virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.765AlaAla: 1.765 ± 0.962
0.177AlaCys: 0.177 ± 0.096
0.53AlaAsp: 0.53 ± 0.077
1.765AlaGlu: 1.765 ± 0.652
1.589AlaPhe: 1.589 ± 0.28
1.236AlaGly: 1.236 ± 0.367
0.883AlaHis: 0.883 ± 0.183
1.589AlaIle: 1.589 ± 0.28
2.471AlaLys: 2.471 ± 0.267
3.354AlaLeu: 3.354 ± 0.61
1.059AlaMet: 1.059 ± 0.273
2.295AlaAsn: 2.295 ± 1.208
1.059AlaPro: 1.059 ± 0.273
1.589AlaGln: 1.589 ± 0.28
2.118AlaArg: 2.118 ± 0.844
2.824AlaSer: 2.824 ± 0.655
1.589AlaThr: 1.589 ± 0.28
1.942AlaVal: 1.942 ± 0.248
0.177AlaTrp: 0.177 ± 0.096
1.236AlaTyr: 1.236 ± 0.673
0.0AlaXaa: 0.0 ± 0.0
Cys
0.706CysAla: 0.706 ± 0.385
0.0CysCys: 0.0 ± 0.0
1.236CysAsp: 1.236 ± 0.153
0.883CysGlu: 0.883 ± 0.509
0.883CysPhe: 0.883 ± 0.827
1.059CysGly: 1.059 ± 0.154
0.353CysHis: 0.353 ± 0.459
0.883CysIle: 0.883 ± 0.183
1.589CysLys: 1.589 ± 0.231
1.589CysLeu: 1.589 ± 0.448
0.706CysMet: 0.706 ± 0.385
1.412CysAsn: 1.412 ± 0.566
0.353CysPro: 0.353 ± 0.141
0.706CysGln: 0.706 ± 0.597
0.53CysArg: 0.53 ± 0.077
1.412CysSer: 1.412 ± 0.566
1.059CysThr: 1.059 ± 0.737
0.353CysVal: 0.353 ± 0.192
0.177CysTrp: 0.177 ± 0.096
0.883CysTyr: 0.883 ± 0.207
0.0CysXaa: 0.0 ± 0.0
Asp
1.765AspAla: 1.765 ± 0.291
0.53AspCys: 0.53 ± 0.499
2.824AspAsp: 2.824 ± 0.356
4.766AspGlu: 4.766 ± 1.092
3.001AspPhe: 3.001 ± 0.48
1.589AspGly: 1.589 ± 0.433
1.942AspHis: 1.942 ± 0.533
4.237AspIle: 4.237 ± 0.11
6.178AspLys: 6.178 ± 1.548
4.943AspLeu: 4.943 ± 0.714
1.412AspMet: 1.412 ± 0.622
3.707AspAsn: 3.707 ± 0.819
0.883AspPro: 0.883 ± 0.481
2.648AspGln: 2.648 ± 0.755
2.295AspArg: 2.295 ± 0.351
3.53AspSer: 3.53 ± 0.44
2.471AspThr: 2.471 ± 0.461
2.648AspVal: 2.648 ± 1.131
0.53AspTrp: 0.53 ± 0.499
3.53AspTyr: 3.53 ± 0.44
0.0AspXaa: 0.0 ± 0.0
Glu
1.765GluAla: 1.765 ± 0.22
0.883GluCys: 0.883 ± 0.509
3.354GluAsp: 3.354 ± 1.209
4.06GluGlu: 4.06 ± 0.549
3.53GluPhe: 3.53 ± 0.44
3.001GluGly: 3.001 ± 1.199
1.942GluHis: 1.942 ± 0.933
6.708GluIle: 6.708 ± 0.896
8.12GluLys: 8.12 ± 1.732
4.766GluLeu: 4.766 ± 1.671
2.471GluMet: 2.471 ± 0.734
5.296GluAsn: 5.296 ± 0.688
0.353GluPro: 0.353 ± 0.141
4.766GluGln: 4.766 ± 1.357
2.648GluArg: 2.648 ± 0.548
5.296GluSer: 5.296 ± 0.381
3.707GluThr: 3.707 ± 0.793
4.413GluVal: 4.413 ± 1.088
0.883GluTrp: 0.883 ± 0.207
2.295GluTyr: 2.295 ± 0.379
0.0GluXaa: 0.0 ± 0.0
Phe
1.589PheAla: 1.589 ± 0.866
1.236PheCys: 1.236 ± 0.153
3.001PheAsp: 3.001 ± 0.563
4.766PheGlu: 4.766 ± 0.694
3.354PhePhe: 3.354 ± 1.192
4.06PheGly: 4.06 ± 0.63
0.53PheHis: 0.53 ± 0.077
3.53PheIle: 3.53 ± 0.826
5.119PheLys: 5.119 ± 0.658
4.06PheLeu: 4.06 ± 0.896
1.942PheMet: 1.942 ± 0.561
3.883PheAsn: 3.883 ± 2.022
1.589PhePro: 1.589 ± 0.72
1.589PheGln: 1.589 ± 1.175
1.412PheArg: 1.412 ± 0.534
3.354PheSer: 3.354 ± 0.813
3.001PheThr: 3.001 ± 0.371
3.001PheVal: 3.001 ± 0.371
1.059PheTrp: 1.059 ± 0.424
1.589PheTyr: 1.589 ± 0.791
0.0PheXaa: 0.0 ± 0.0
Gly
0.883GlyAla: 0.883 ± 0.207
0.706GlyCys: 0.706 ± 0.283
1.236GlyAsp: 1.236 ± 0.65
3.001GlyGlu: 3.001 ± 0.879
3.883GlyPhe: 3.883 ± 0.886
1.942GlyGly: 1.942 ± 0.248
0.883GlyHis: 0.883 ± 0.827
4.237GlyIle: 4.237 ± 1.103
5.649GlyLys: 5.649 ± 1.013
3.707GlyLeu: 3.707 ± 0.581
1.059GlyMet: 1.059 ± 0.273
3.354GlyAsn: 3.354 ± 2.604
1.412GlyPro: 1.412 ± 0.534
2.648GlyGln: 2.648 ± 0.667
1.059GlyArg: 1.059 ± 0.367
1.942GlySer: 1.942 ± 0.533
3.001GlyThr: 3.001 ± 1.018
2.295GlyVal: 2.295 ± 0.217
0.706GlyTrp: 0.706 ± 0.102
3.177GlyTyr: 3.177 ± 0.216
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.177HisCys: 0.177 ± 0.229
1.765HisAsp: 1.765 ± 0.365
2.648HisGlu: 2.648 ± 0.548
1.765HisPhe: 1.765 ± 0.22
1.059HisGly: 1.059 ± 0.273
1.236HisHis: 1.236 ± 0.153
2.295HisIle: 2.295 ± 0.292
2.118HisLys: 2.118 ± 0.547
1.236HisLeu: 1.236 ± 0.65
0.706HisMet: 0.706 ± 0.102
0.0HisAsn: 0.0 ± 0.0
0.353HisPro: 0.353 ± 0.192
0.53HisGln: 0.53 ± 0.369
0.177HisArg: 0.177 ± 0.229
1.236HisSer: 1.236 ± 0.65
1.059HisThr: 1.059 ± 0.273
1.765HisVal: 1.765 ± 0.291
0.177HisTrp: 0.177 ± 0.229
1.236HisTyr: 1.236 ± 0.346
0.0HisXaa: 0.0 ± 0.0
Ile
2.824IleAla: 2.824 ± 0.923
1.942IleCys: 1.942 ± 0.627
6.884IleAsp: 6.884 ± 0.674
6.002IleGlu: 6.002 ± 1.388
4.943IlePhe: 4.943 ± 1.103
3.177IleGly: 3.177 ± 0.463
1.412IleHis: 1.412 ± 0.462
7.944IleIle: 7.944 ± 0.516
6.178IleLys: 6.178 ± 1.446
6.355IleLeu: 6.355 ± 1.944
1.412IleMet: 1.412 ± 0.462
7.414IleAsn: 7.414 ± 0.949
3.53IlePro: 3.53 ± 0.733
2.824IleGln: 2.824 ± 0.353
3.001IleArg: 3.001 ± 0.48
5.296IleSer: 5.296 ± 0.839
4.237IleThr: 4.237 ± 0.617
3.883IleVal: 3.883 ± 0.564
0.883IleTrp: 0.883 ± 0.183
4.943IleTyr: 4.943 ± 0.495
0.0IleXaa: 0.0 ± 0.0
Lys
2.471LysAla: 2.471 ± 0.734
1.236LysCys: 1.236 ± 0.346
6.002LysAsp: 6.002 ± 1.072
7.061LysGlu: 7.061 ± 0.742
3.53LysPhe: 3.53 ± 0.592
3.53LysGly: 3.53 ± 0.578
2.824LysHis: 2.824 ± 0.556
8.12LysIle: 8.12 ± 0.559
9.532LysLys: 9.532 ± 0.951
6.884LysLeu: 6.884 ± 1.138
1.765LysMet: 1.765 ± 0.37
7.767LysAsn: 7.767 ± 1.722
2.471LysPro: 2.471 ± 0.692
3.883LysGln: 3.883 ± 1.208
4.59LysArg: 4.59 ± 1.428
4.413LysSer: 4.413 ± 1.679
4.766LysThr: 4.766 ± 0.272
5.825LysVal: 5.825 ± 0.223
0.883LysTrp: 0.883 ± 0.183
3.354LysTyr: 3.354 ± 0.448
0.0LysXaa: 0.0 ± 0.0
Leu
3.001LeuAla: 3.001 ± 0.371
1.589LeuCys: 1.589 ± 0.28
2.824LeuAsp: 2.824 ± 0.366
5.296LeuGlu: 5.296 ± 0.695
4.06LeuPhe: 4.06 ± 0.014
3.177LeuGly: 3.177 ± 0.82
1.765LeuHis: 1.765 ± 0.652
6.884LeuIle: 6.884 ± 1.205
6.002LeuLys: 6.002 ± 0.469
7.237LeuLeu: 7.237 ± 0.894
2.648LeuMet: 2.648 ± 0.829
6.531LeuAsn: 6.531 ± 1.081
3.001LeuPro: 3.001 ± 0.758
2.824LeuGln: 2.824 ± 0.408
2.471LeuArg: 2.471 ± 0.427
7.414LeuSer: 7.414 ± 0.657
3.883LeuThr: 3.883 ± 0.564
4.943LeuVal: 4.943 ± 0.922
1.236LeuTrp: 1.236 ± 0.397
5.119LeuTyr: 5.119 ± 0.986
0.0LeuXaa: 0.0 ± 0.0
Met
0.883MetAla: 0.883 ± 0.393
0.177MetCys: 0.177 ± 0.096
1.236MetAsp: 1.236 ± 0.346
2.295MetGlu: 2.295 ± 1.25
0.706MetPhe: 0.706 ± 0.102
1.765MetGly: 1.765 ± 0.413
0.353MetHis: 0.353 ± 0.192
3.001MetIle: 3.001 ± 0.48
1.412MetLys: 1.412 ± 0.204
3.001MetLeu: 3.001 ± 1.018
1.589MetMet: 1.589 ± 0.866
1.412MetAsn: 1.412 ± 0.534
0.53MetPro: 0.53 ± 0.289
1.059MetGln: 1.059 ± 0.154
1.765MetArg: 1.765 ± 0.652
1.765MetSer: 1.765 ± 0.291
0.883MetThr: 0.883 ± 0.481
1.589MetVal: 1.589 ± 0.28
0.177MetTrp: 0.177 ± 0.096
1.059MetTyr: 1.059 ± 0.273
0.0MetXaa: 0.0 ± 0.0
Asn
2.648AsnAla: 2.648 ± 0.749
1.412AsnCys: 1.412 ± 0.278
2.648AsnAsp: 2.648 ± 0.193
6.002AsnGlu: 6.002 ± 1.19
3.883AsnPhe: 3.883 ± 0.59
5.119AsnGly: 5.119 ± 1.89
1.942AsnHis: 1.942 ± 0.455
5.296AsnIle: 5.296 ± 0.609
4.59AsnLys: 4.59 ± 0.435
7.061AsnLeu: 7.061 ± 0.865
2.295AsnMet: 2.295 ± 1.031
6.178AsnAsn: 6.178 ± 1.845
2.118AsnPro: 2.118 ± 0.241
3.707AsnGln: 3.707 ± 3.491
3.707AsnArg: 3.707 ± 1.19
5.119AsnSer: 5.119 ± 0.288
2.824AsnThr: 2.824 ± 0.148
3.354AsnVal: 3.354 ± 0.441
1.059AsnTrp: 1.059 ± 0.154
3.354AsnTyr: 3.354 ± 0.448
0.0AsnXaa: 0.0 ± 0.0
Pro
0.883ProAla: 0.883 ± 0.183
0.53ProCys: 0.53 ± 0.369
2.118ProAsp: 2.118 ± 0.547
2.471ProGlu: 2.471 ± 0.306
1.942ProPhe: 1.942 ± 1.532
0.883ProGly: 0.883 ± 0.207
0.177ProHis: 0.177 ± 0.096
3.53ProIle: 3.53 ± 0.592
1.765ProLys: 1.765 ± 0.786
2.471ProLeu: 2.471 ± 0.691
0.353ProMet: 0.353 ± 0.459
2.295ProAsn: 2.295 ± 0.947
0.706ProPro: 0.706 ± 0.283
0.353ProGln: 0.353 ± 0.454
1.236ProArg: 1.236 ± 0.367
2.295ProSer: 2.295 ± 0.638
1.059ProThr: 1.059 ± 0.367
1.765ProVal: 1.765 ± 0.291
0.706ProTrp: 0.706 ± 0.442
1.059ProTyr: 1.059 ± 0.154
0.0ProXaa: 0.0 ± 0.0
Gln
1.059GlnAla: 1.059 ± 0.273
0.353GlnCys: 0.353 ± 0.192
2.118GlnAsp: 2.118 ± 0.844
2.471GlnGlu: 2.471 ± 0.691
2.295GlnPhe: 2.295 ± 1.264
1.765GlnGly: 1.765 ± 1.272
0.177GlnHis: 0.177 ± 0.229
4.06GlnIle: 4.06 ± 0.74
4.06GlnLys: 4.06 ± 0.511
3.177GlnLeu: 3.177 ± 0.4
0.883GlnMet: 0.883 ± 0.183
3.354GlnAsn: 3.354 ± 2.082
1.412GlnPro: 1.412 ± 1.858
2.471GlnGln: 2.471 ± 2.739
2.295GlnArg: 2.295 ± 1.264
2.471GlnSer: 2.471 ± 0.994
1.942GlnThr: 1.942 ± 0.533
2.648GlnVal: 2.648 ± 0.447
0.706GlnTrp: 0.706 ± 0.385
2.471GlnTyr: 2.471 ± 0.542
0.0GlnXaa: 0.0 ± 0.0
Arg
0.706ArgAla: 0.706 ± 0.102
0.706ArgCys: 0.706 ± 0.102
2.648ArgAsp: 2.648 ± 0.62
2.471ArgGlu: 2.471 ± 0.427
3.177ArgPhe: 3.177 ± 1.155
1.236ArgGly: 1.236 ± 0.874
0.353ArgHis: 0.353 ± 0.192
3.53ArgIle: 3.53 ± 0.578
3.53ArgLys: 3.53 ± 0.733
3.177ArgLeu: 3.177 ± 0.559
0.353ArgMet: 0.353 ± 0.192
2.295ArgAsn: 2.295 ± 0.484
0.706ArgPro: 0.706 ± 0.283
1.765ArgGln: 1.765 ± 0.933
1.412ArgArg: 1.412 ± 0.389
3.883ArgSer: 3.883 ± 1.826
4.06ArgThr: 4.06 ± 0.014
2.648ArgVal: 2.648 ± 0.548
0.353ArgTrp: 0.353 ± 0.192
1.589ArgTyr: 1.589 ± 0.701
0.0ArgXaa: 0.0 ± 0.0
Ser
3.001SerAla: 3.001 ± 0.633
1.589SerCys: 1.589 ± 0.486
3.707SerAsp: 3.707 ± 0.596
3.53SerGlu: 3.53 ± 0.51
3.001SerPhe: 3.001 ± 0.502
2.471SerGly: 2.471 ± 0.732
1.412SerHis: 1.412 ± 0.278
5.296SerIle: 5.296 ± 0.572
7.237SerLys: 7.237 ± 1.141
6.002SerLeu: 6.002 ± 0.808
2.118SerMet: 2.118 ± 0.849
3.53SerAsn: 3.53 ± 1.791
2.824SerPro: 2.824 ± 0.831
2.824SerGln: 2.824 ± 1.793
3.001SerArg: 3.001 ± 1.246
2.824SerSer: 2.824 ± 0.831
2.648SerThr: 2.648 ± 0.193
3.177SerVal: 3.177 ± 0.169
1.412SerTrp: 1.412 ± 0.339
2.648SerTyr: 2.648 ± 0.386
0.0SerXaa: 0.0 ± 0.0
Thr
1.412ThrAla: 1.412 ± 0.462
1.236ThrCys: 1.236 ± 0.65
4.237ThrAsp: 4.237 ± 1.073
3.707ThrGlu: 3.707 ± 0.178
1.942ThrPhe: 1.942 ± 0.627
3.707ThrGly: 3.707 ± 0.459
0.883ThrHis: 0.883 ± 0.183
4.766ThrIle: 4.766 ± 0.399
4.413ThrLys: 4.413 ± 0.778
3.53ThrLeu: 3.53 ± 0.51
1.412ThrMet: 1.412 ± 0.278
3.883ThrAsn: 3.883 ± 0.724
0.883ThrPro: 0.883 ± 0.183
2.118ThrGln: 2.118 ± 0.309
2.295ThrArg: 2.295 ± 0.422
1.942ThrSer: 1.942 ± 1.253
3.177ThrThr: 3.177 ± 1.16
3.707ThrVal: 3.707 ± 0.596
0.53ThrTrp: 0.53 ± 0.289
2.295ThrTyr: 2.295 ± 0.379
0.0ThrXaa: 0.0 ± 0.0
Val
1.765ValAla: 1.765 ± 0.365
0.706ValCys: 0.706 ± 0.102
3.001ValAsp: 3.001 ± 0.371
4.413ValGlu: 4.413 ± 0.778
2.824ValPhe: 2.824 ± 0.366
2.824ValGly: 2.824 ± 0.148
1.236ValHis: 1.236 ± 0.673
4.413ValIle: 4.413 ± 1.088
6.355ValLys: 6.355 ± 0.803
4.237ValLeu: 4.237 ± 1.094
1.589ValMet: 1.589 ± 0.28
4.237ValAsn: 4.237 ± 1.094
2.471ValPro: 2.471 ± 0.267
1.942ValGln: 1.942 ± 0.561
1.589ValArg: 1.589 ± 0.231
2.824ValSer: 2.824 ± 0.677
4.06ValThr: 4.06 ± 0.529
3.354ValVal: 3.354 ± 1.251
0.53ValTrp: 0.53 ± 0.077
2.295ValTyr: 2.295 ± 0.64
0.0ValXaa: 0.0 ± 0.0
Trp
0.706TrpAla: 0.706 ± 0.442
0.177TrpCys: 0.177 ± 0.229
1.059TrpAsp: 1.059 ± 0.508
0.177TrpGlu: 0.177 ± 0.096
0.53TrpPhe: 0.53 ± 0.077
0.883TrpGly: 0.883 ± 0.183
0.53TrpHis: 0.53 ± 0.369
1.236TrpIle: 1.236 ± 0.367
0.883TrpLys: 0.883 ± 0.827
1.412TrpLeu: 1.412 ± 0.204
0.177TrpMet: 0.177 ± 0.096
0.883TrpAsn: 0.883 ± 0.925
0.706TrpPro: 0.706 ± 0.102
0.706TrpGln: 0.706 ± 0.102
0.706TrpArg: 0.706 ± 0.102
0.706TrpSer: 0.706 ± 0.283
0.53TrpThr: 0.53 ± 0.077
0.353TrpVal: 0.353 ± 0.141
0.177TrpTrp: 0.177 ± 0.229
0.53TrpTyr: 0.53 ± 0.289
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.412TyrAla: 1.412 ± 0.462
1.412TyrCys: 1.412 ± 0.566
3.001TyrAsp: 3.001 ± 0.728
2.471TyrGlu: 2.471 ± 0.461
2.824TyrPhe: 2.824 ± 1.294
2.295TyrGly: 2.295 ± 0.947
0.706TyrHis: 0.706 ± 0.385
3.53TyrIle: 3.53 ± 1.007
3.707TyrLys: 3.707 ± 0.581
3.707TyrLeu: 3.707 ± 0.178
0.53TyrMet: 0.53 ± 0.289
4.59TyrAsn: 4.59 ± 1.28
1.236TyrPro: 1.236 ± 0.65
1.412TyrGln: 1.412 ± 0.339
2.295TyrArg: 2.295 ± 0.422
3.707TyrSer: 3.707 ± 0.761
2.118TyrThr: 2.118 ± 0.826
3.001TyrVal: 3.001 ± 1.018
0.706TyrTrp: 0.706 ± 0.597
3.354TyrTyr: 3.354 ± 1.192
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (5666 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski