Amino acid dipepetide frequency for Wencheng Sm shrew coronavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.461AlaAla: 2.461 ± 1.472
1.723AlaCys: 1.723 ± 0.892
2.215AlaAsp: 2.215 ± 0.631
2.584AlaGlu: 2.584 ± 1.37
3.445AlaPhe: 3.445 ± 0.392
2.707AlaGly: 2.707 ± 0.887
0.615AlaHis: 0.615 ± 0.319
2.584AlaIle: 2.584 ± 1.015
3.445AlaLys: 3.445 ± 1.185
5.045AlaLeu: 5.045 ± 1.218
1.107AlaMet: 1.107 ± 0.674
4.061AlaAsn: 4.061 ± 1.319
1.23AlaPro: 1.23 ± 0.702
1.23AlaGln: 1.23 ± 0.892
1.354AlaArg: 1.354 ± 0.269
3.691AlaSer: 3.691 ± 1.178
1.846AlaThr: 1.846 ± 0.905
3.445AlaVal: 3.445 ± 0.996
0.123AlaTrp: 0.123 ± 0.479
3.076AlaTyr: 3.076 ± 1.099
0.0AlaXaa: 0.0 ± 0.0
Cys
1.107CysAla: 1.107 ± 0.401
1.846CysCys: 1.846 ± 0.489
2.092CysAsp: 2.092 ± 0.681
1.477CysGlu: 1.477 ± 0.476
2.215CysPhe: 2.215 ± 0.585
2.338CysGly: 2.338 ± 0.753
0.369CysHis: 0.369 ± 0.191
1.969CysIle: 1.969 ± 0.66
2.461CysLys: 2.461 ± 0.749
2.092CysLeu: 2.092 ± 0.858
0.123CysMet: 0.123 ± 0.064
1.969CysAsn: 1.969 ± 0.447
0.369CysPro: 0.369 ± 0.191
0.492CysGln: 0.492 ± 0.255
0.984CysArg: 0.984 ± 0.393
1.846CysSer: 1.846 ± 0.627
1.23CysThr: 1.23 ± 0.436
3.076CysVal: 3.076 ± 0.553
0.615CysTrp: 0.615 ± 0.319
2.215CysTyr: 2.215 ± 0.802
0.0CysXaa: 0.0 ± 0.0
Asp
2.584AspAla: 2.584 ± 0.441
2.092AspCys: 2.092 ± 0.792
3.076AspAsp: 3.076 ± 1.193
2.338AspGlu: 2.338 ± 0.97
4.184AspPhe: 4.184 ± 1.365
3.076AspGly: 3.076 ± 0.583
0.492AspHis: 0.492 ± 0.197
4.061AspIle: 4.061 ± 0.694
3.076AspLys: 3.076 ± 0.99
5.414AspLeu: 5.414 ± 0.416
1.107AspMet: 1.107 ± 0.578
4.553AspAsn: 4.553 ± 0.885
1.477AspPro: 1.477 ± 1.019
0.615AspGln: 0.615 ± 0.319
1.6AspArg: 1.6 ± 0.613
2.83AspSer: 2.83 ± 0.918
1.6AspThr: 1.6 ± 0.62
5.414AspVal: 5.414 ± 1.42
1.107AspTrp: 1.107 ± 0.574
3.076AspTyr: 3.076 ± 1.043
0.0AspXaa: 0.0 ± 0.0
Glu
1.969GluAla: 1.969 ± 0.66
1.354GluCys: 1.354 ± 0.591
3.322GluAsp: 3.322 ± 0.833
2.953GluGlu: 2.953 ± 0.484
3.445GluPhe: 3.445 ± 0.564
2.707GluGly: 2.707 ± 0.42
0.861GluHis: 0.861 ± 0.446
2.215GluIle: 2.215 ± 0.53
3.691GluLys: 3.691 ± 0.91
3.814GluLeu: 3.814 ± 1.192
0.615GluMet: 0.615 ± 0.209
3.322GluAsn: 3.322 ± 0.567
0.984GluPro: 0.984 ± 0.7
1.354GluGln: 1.354 ± 0.695
2.215GluArg: 2.215 ± 1.05
3.814GluSer: 3.814 ± 0.807
2.338GluThr: 2.338 ± 0.816
5.045GluVal: 5.045 ± 0.373
0.738GluTrp: 0.738 ± 0.238
1.846GluTyr: 1.846 ± 0.605
0.0GluXaa: 0.0 ± 0.0
Phe
1.6PheAla: 1.6 ± 0.575
2.338PheCys: 2.338 ± 0.5
3.199PheAsp: 3.199 ± 0.612
3.814PheGlu: 3.814 ± 0.834
2.338PhePhe: 2.338 ± 1.211
5.537PheGly: 5.537 ± 0.641
0.738PheHis: 0.738 ± 0.551
3.076PheIle: 3.076 ± 0.954
4.061PheLys: 4.061 ± 1.655
5.537PheLeu: 5.537 ± 2.173
1.846PheMet: 1.846 ± 0.489
4.307PheAsn: 4.307 ± 1.248
0.861PhePro: 0.861 ± 0.66
1.477PheGln: 1.477 ± 0.476
1.23PheArg: 1.23 ± 0.637
4.922PheSer: 4.922 ± 1.12
3.076PheThr: 3.076 ± 0.483
6.645PheVal: 6.645 ± 1.828
1.23PheTrp: 1.23 ± 0.303
3.322PheTyr: 3.322 ± 0.558
0.0PheXaa: 0.0 ± 0.0
Gly
2.338GlyAla: 2.338 ± 0.947
1.23GlyCys: 1.23 ± 0.418
3.199GlyAsp: 3.199 ± 0.612
2.215GlyGlu: 2.215 ± 0.404
4.922GlyPhe: 4.922 ± 1.153
3.691GlyGly: 3.691 ± 0.694
1.354GlyHis: 1.354 ± 1.172
3.199GlyIle: 3.199 ± 2.19
4.922GlyLys: 4.922 ± 2.269
7.629GlyLeu: 7.629 ± 0.865
1.107GlyMet: 1.107 ± 0.89
3.937GlyAsn: 3.937 ± 0.653
1.846GlyPro: 1.846 ± 2.064
1.6GlyGln: 1.6 ± 0.678
1.6GlyArg: 1.6 ± 1.062
5.168GlySer: 5.168 ± 0.75
2.215GlyThr: 2.215 ± 0.761
7.506GlyVal: 7.506 ± 1.569
0.861GlyTrp: 0.861 ± 0.553
2.461GlyTyr: 2.461 ± 1.035
0.0GlyXaa: 0.0 ± 0.0
His
0.984HisAla: 0.984 ± 0.452
0.492HisCys: 0.492 ± 0.255
0.738HisAsp: 0.738 ± 0.551
0.492HisGlu: 0.492 ± 0.407
1.23HisPhe: 1.23 ± 0.436
1.723HisGly: 1.723 ± 0.563
0.246HisHis: 0.246 ± 0.127
0.984HisIle: 0.984 ± 0.437
0.738HisLys: 0.738 ± 0.404
1.477HisLeu: 1.477 ± 1.008
0.123HisMet: 0.123 ± 0.064
1.107HisAsn: 1.107 ± 0.574
0.369HisPro: 0.369 ± 0.205
0.492HisGln: 0.492 ± 0.255
0.492HisArg: 0.492 ± 0.552
0.492HisSer: 0.492 ± 0.568
0.492HisThr: 0.492 ± 0.197
2.092HisVal: 2.092 ± 0.354
0.123HisTrp: 0.123 ± 0.064
0.246HisTyr: 0.246 ± 0.127
0.0HisXaa: 0.0 ± 0.0
Ile
2.338IleAla: 2.338 ± 0.816
1.846IleCys: 1.846 ± 0.925
1.969IleAsp: 1.969 ± 0.594
3.076IleGlu: 3.076 ± 1.461
1.846IlePhe: 1.846 ± 0.468
4.184IleGly: 4.184 ± 0.745
0.369IleHis: 0.369 ± 0.205
3.199IleIle: 3.199 ± 1.356
4.676IleLys: 4.676 ± 0.687
4.553IleLeu: 4.553 ± 1.349
1.6IleMet: 1.6 ± 0.515
3.814IleAsn: 3.814 ± 1.31
1.723IlePro: 1.723 ± 0.95
2.338IleGln: 2.338 ± 1.449
2.338IleArg: 2.338 ± 0.753
2.707IleSer: 2.707 ± 0.505
3.076IleThr: 3.076 ± 1.672
8.121IleVal: 8.121 ± 1.889
0.369IleTrp: 0.369 ± 0.205
1.846IleTyr: 1.846 ± 0.843
0.0IleXaa: 0.0 ± 0.0
Lys
2.953LysAla: 2.953 ± 1.089
3.199LysCys: 3.199 ± 0.958
5.168LysAsp: 5.168 ± 1.644
3.445LysGlu: 3.445 ± 1.238
3.568LysPhe: 3.568 ± 1.313
5.291LysGly: 5.291 ± 0.548
1.723LysHis: 1.723 ± 0.498
3.199LysIle: 3.199 ± 1.192
4.307LysLys: 4.307 ± 2.244
4.922LysLeu: 4.922 ± 1.107
1.846LysMet: 1.846 ± 0.698
5.291LysAsn: 5.291 ± 0.926
2.338LysPro: 2.338 ± 0.378
2.953LysGln: 2.953 ± 0.494
1.969LysArg: 1.969 ± 1.063
4.061LysSer: 4.061 ± 0.731
2.953LysThr: 2.953 ± 0.581
7.506LysVal: 7.506 ± 3.145
0.984LysTrp: 0.984 ± 0.759
3.937LysTyr: 3.937 ± 0.998
0.0LysXaa: 0.0 ± 0.0
Leu
4.799LeuAla: 4.799 ± 1.111
2.707LeuCys: 2.707 ± 0.505
4.676LeuAsp: 4.676 ± 0.721
4.553LeuGlu: 4.553 ± 1.065
4.799LeuPhe: 4.799 ± 0.881
4.553LeuGly: 4.553 ± 0.748
1.23LeuHis: 1.23 ± 0.46
5.414LeuIle: 5.414 ± 2.014
6.645LeuLys: 6.645 ± 1.141
8.859LeuLeu: 8.859 ± 1.364
1.969LeuMet: 1.969 ± 1.02
5.906LeuAsn: 5.906 ± 1.924
3.199LeuPro: 3.199 ± 1.64
3.322LeuGln: 3.322 ± 0.343
2.707LeuArg: 2.707 ± 0.482
5.906LeuSer: 5.906 ± 1.055
5.168LeuThr: 5.168 ± 2.196
6.521LeuVal: 6.521 ± 1.286
1.6LeuTrp: 1.6 ± 1.893
5.291LeuTyr: 5.291 ± 1.064
0.0LeuXaa: 0.0 ± 0.0
Met
1.107MetAla: 1.107 ± 0.614
0.738MetCys: 0.738 ± 0.238
0.615MetAsp: 0.615 ± 0.431
1.354MetGlu: 1.354 ± 0.591
1.23MetPhe: 1.23 ± 1.198
1.846MetGly: 1.846 ± 0.627
0.615MetHis: 0.615 ± 0.461
1.846MetIle: 1.846 ± 0.468
0.492MetLys: 0.492 ± 0.197
2.83MetLeu: 2.83 ± 0.959
0.861MetMet: 0.861 ± 0.446
1.6MetAsn: 1.6 ± 0.515
0.738MetPro: 0.738 ± 0.871
0.369MetGln: 0.369 ± 0.191
1.23MetArg: 1.23 ± 0.637
2.707MetSer: 2.707 ± 0.947
1.23MetThr: 1.23 ± 0.436
1.23MetVal: 1.23 ± 0.436
0.123MetTrp: 0.123 ± 0.064
1.107MetTyr: 1.107 ± 0.614
0.0MetXaa: 0.0 ± 0.0
Asn
3.568AsnAla: 3.568 ± 1.352
2.338AsnCys: 2.338 ± 0.753
2.953AsnAsp: 2.953 ± 0.486
1.846AsnGlu: 1.846 ± 0.275
5.168AsnPhe: 5.168 ± 1.363
5.66AsnGly: 5.66 ± 1.135
0.861AsnHis: 0.861 ± 0.279
4.307AsnIle: 4.307 ± 0.839
6.152AsnLys: 6.152 ± 1.614
5.045AsnLeu: 5.045 ± 2.154
2.092AsnMet: 2.092 ± 0.829
4.799AsnAsn: 4.799 ± 1.399
1.6AsnPro: 1.6 ± 0.425
1.354AsnGln: 1.354 ± 0.444
0.984AsnArg: 0.984 ± 0.393
6.398AsnSer: 6.398 ± 1.017
3.322AsnThr: 3.322 ± 1.289
7.875AsnVal: 7.875 ± 1.808
0.738AsnTrp: 0.738 ± 0.638
2.953AsnTyr: 2.953 ± 0.71
0.0AsnXaa: 0.0 ± 0.0
Pro
1.107ProAla: 1.107 ± 0.728
0.492ProCys: 0.492 ± 0.461
1.354ProAsp: 1.354 ± 0.548
1.354ProGlu: 1.354 ± 1.118
1.23ProPhe: 1.23 ± 0.599
1.969ProGly: 1.969 ± 0.6
0.492ProHis: 0.492 ± 0.255
2.092ProIle: 2.092 ± 1.841
1.969ProLys: 1.969 ± 1.518
3.322ProLeu: 3.322 ± 1.425
0.123ProMet: 0.123 ± 0.064
1.477ProAsn: 1.477 ± 0.571
1.354ProPro: 1.354 ± 0.509
1.23ProGln: 1.23 ± 1.16
1.107ProArg: 1.107 ± 1.235
2.092ProSer: 2.092 ± 1.854
2.215ProThr: 2.215 ± 2.458
2.584ProVal: 2.584 ± 0.523
0.246ProTrp: 0.246 ± 0.673
0.984ProTyr: 0.984 ± 0.51
0.0ProXaa: 0.0 ± 0.0
Gln
2.215GlnAla: 2.215 ± 0.358
0.246GlnCys: 0.246 ± 0.127
2.092GlnAsp: 2.092 ± 0.99
1.477GlnGlu: 1.477 ± 0.583
1.477GlnPhe: 1.477 ± 0.818
1.723GlnGly: 1.723 ± 0.25
0.492GlnHis: 0.492 ± 0.568
1.6GlnIle: 1.6 ± 0.948
1.723GlnLys: 1.723 ± 0.558
2.953GlnLeu: 2.953 ± 1.151
0.738GlnMet: 0.738 ± 0.378
1.723GlnAsn: 1.723 ± 0.835
0.615GlnPro: 0.615 ± 0.69
0.738GlnGln: 0.738 ± 0.691
0.738GlnArg: 0.738 ± 0.404
2.584GlnSer: 2.584 ± 0.57
1.723GlnThr: 1.723 ± 0.558
1.846GlnVal: 1.846 ± 0.627
0.369GlnTrp: 0.369 ± 0.191
1.354GlnTyr: 1.354 ± 1.414
0.0GlnXaa: 0.0 ± 0.0
Arg
1.6ArgAla: 1.6 ± 0.568
0.861ArgCys: 0.861 ± 0.446
1.354ArgAsp: 1.354 ± 0.68
0.738ArgGlu: 0.738 ± 0.382
2.83ArgPhe: 2.83 ± 0.471
1.6ArgGly: 1.6 ± 1.12
0.615ArgHis: 0.615 ± 0.369
1.723ArgIle: 1.723 ± 0.25
1.354ArgLys: 1.354 ± 1.118
2.953ArgLeu: 2.953 ± 0.484
0.738ArgMet: 0.738 ± 0.238
2.215ArgAsn: 2.215 ± 0.802
0.738ArgPro: 0.738 ± 0.46
0.738ArgGln: 0.738 ± 0.672
1.354ArgArg: 1.354 ± 0.614
3.076ArgSer: 3.076 ± 2.867
1.23ArgThr: 1.23 ± 0.649
3.199ArgVal: 3.199 ± 0.658
0.123ArgTrp: 0.123 ± 0.064
1.23ArgTyr: 1.23 ± 0.303
0.0ArgXaa: 0.0 ± 0.0
Ser
4.061SerAla: 4.061 ± 2.343
2.092SerCys: 2.092 ± 1.246
3.937SerAsp: 3.937 ± 0.886
3.691SerGlu: 3.691 ± 1.386
3.199SerPhe: 3.199 ± 0.778
4.43SerGly: 4.43 ± 1.895
0.984SerHis: 0.984 ± 0.562
3.814SerIle: 3.814 ± 1.506
5.783SerLys: 5.783 ± 1.126
5.783SerLeu: 5.783 ± 1.083
2.584SerMet: 2.584 ± 0.838
5.045SerAsn: 5.045 ± 1.638
1.477SerPro: 1.477 ± 1.034
2.338SerGln: 2.338 ± 0.378
2.215SerArg: 2.215 ± 2.831
3.691SerSer: 3.691 ± 1.461
2.953SerThr: 2.953 ± 0.71
7.752SerVal: 7.752 ± 1.103
1.107SerTrp: 1.107 ± 0.684
3.199SerTyr: 3.199 ± 0.674
0.0SerXaa: 0.0 ± 0.0
Thr
2.707ThrAla: 2.707 ± 0.587
0.615ThrCys: 0.615 ± 0.209
2.092ThrAsp: 2.092 ± 0.858
1.846ThrGlu: 1.846 ± 0.627
3.691ThrPhe: 3.691 ± 0.759
2.338ThrGly: 2.338 ± 1.283
1.23ThrHis: 1.23 ± 0.756
3.199ThrIle: 3.199 ± 0.674
3.568ThrLys: 3.568 ± 1.253
3.445ThrLeu: 3.445 ± 2.042
1.969ThrMet: 1.969 ± 0.651
2.461ThrAsn: 2.461 ± 1.452
1.6ThrPro: 1.6 ± 0.71
1.846ThrGln: 1.846 ± 0.389
1.6ThrArg: 1.6 ± 0.24
3.322ThrSer: 3.322 ± 0.567
3.814ThrThr: 3.814 ± 0.873
3.814ThrVal: 3.814 ± 0.966
0.246ThrTrp: 0.246 ± 0.23
1.969ThrTyr: 1.969 ± 0.594
0.0ThrXaa: 0.0 ± 0.0
Val
5.906ValAla: 5.906 ± 1.246
2.83ValCys: 2.83 ± 0.528
5.537ValAsp: 5.537 ± 2.059
5.66ValGlu: 5.66 ± 1.111
6.768ValPhe: 6.768 ± 2.019
4.307ValGly: 4.307 ± 1.394
0.984ValHis: 0.984 ± 0.393
4.307ValIle: 4.307 ± 1.087
8.736ValLys: 8.736 ± 2.45
8.613ValLeu: 8.613 ± 2.767
2.338ValMet: 2.338 ± 1.361
7.014ValAsn: 7.014 ± 1.702
4.676ValPro: 4.676 ± 2.096
2.707ValGln: 2.707 ± 0.538
2.092ValArg: 2.092 ± 1.112
7.383ValSer: 7.383 ± 0.71
4.184ValThr: 4.184 ± 0.869
11.936ValVal: 11.936 ± 3.812
0.738ValTrp: 0.738 ± 0.409
4.922ValTyr: 4.922 ± 0.943
0.0ValXaa: 0.0 ± 0.0
Trp
0.246TrpAla: 0.246 ± 0.127
0.369TrpCys: 0.369 ± 0.191
1.107TrpAsp: 1.107 ± 0.265
0.369TrpGlu: 0.369 ± 0.191
0.861TrpPhe: 0.861 ± 0.468
0.369TrpGly: 0.369 ± 0.191
0.246TrpHis: 0.246 ± 0.127
0.861TrpIle: 0.861 ± 0.279
0.615TrpLys: 0.615 ± 0.319
1.6TrpLeu: 1.6 ± 0.978
0.0TrpMet: 0.0 ± 0.0
1.6TrpAsn: 1.6 ± 0.753
0.369TrpPro: 0.369 ± 0.625
0.492TrpGln: 0.492 ± 0.461
0.369TrpArg: 0.369 ± 0.497
0.861TrpSer: 0.861 ± 1.417
0.615TrpThr: 0.615 ± 0.431
0.615TrpVal: 0.615 ± 0.848
0.246TrpTrp: 0.246 ± 0.127
0.984TrpTyr: 0.984 ± 0.831
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.338TyrAla: 2.338 ± 0.5
1.477TyrCys: 1.477 ± 0.553
3.076TyrAsp: 3.076 ± 1.36
3.322TyrGlu: 3.322 ± 0.642
2.707TyrPhe: 2.707 ± 0.712
2.953TyrGly: 2.953 ± 0.809
0.615TyrHis: 0.615 ± 0.319
2.338TyrIle: 2.338 ± 0.816
3.199TyrLys: 3.199 ± 0.851
3.937TyrLeu: 3.937 ± 1.424
0.861TyrMet: 0.861 ± 0.279
3.937TyrAsn: 3.937 ± 1.109
1.23TyrPro: 1.23 ± 0.506
0.738TyrGln: 0.738 ± 0.46
2.092TyrArg: 2.092 ± 1.826
2.584TyrSer: 2.584 ± 1.531
2.092TyrThr: 2.092 ± 0.574
5.414TyrVal: 5.414 ± 0.78
1.107TyrTrp: 1.107 ± 0.265
2.707TyrTyr: 2.707 ± 0.74
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (8128 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski