Amino acid dipepetide frequency for Alphacoronavirus BtMs-AlphaCoV/GS2013

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.507AlaAla: 5.507 ± 1.822
2.36AlaCys: 2.36 ± 0.567
2.585AlaAsp: 2.585 ± 0.271
2.697AlaGlu: 2.697 ± 1.364
4.833AlaPhe: 4.833 ± 0.886
3.147AlaGly: 3.147 ± 0.875
1.124AlaHis: 1.124 ± 0.356
4.495AlaIle: 4.495 ± 0.477
3.372AlaLys: 3.372 ± 0.871
5.732AlaLeu: 5.732 ± 1.211
1.461AlaMet: 1.461 ± 0.214
3.147AlaAsn: 3.147 ± 0.846
2.585AlaPro: 2.585 ± 0.86
1.911AlaGln: 1.911 ± 0.959
2.023AlaArg: 2.023 ± 0.772
4.833AlaSer: 4.833 ± 0.39
4.383AlaThr: 4.383 ± 1.209
5.844AlaVal: 5.844 ± 1.029
0.562AlaTrp: 0.562 ± 0.178
2.81AlaTyr: 2.81 ± 0.809
0.0AlaXaa: 0.0 ± 0.0
Cys
1.798CysAla: 1.798 ± 0.317
1.236CysCys: 1.236 ± 0.388
1.686CysAsp: 1.686 ± 0.682
1.011CysGlu: 1.011 ± 0.519
2.248CysPhe: 2.248 ± 0.506
2.023CysGly: 2.023 ± 0.638
0.562CysHis: 0.562 ± 0.359
1.236CysIle: 1.236 ± 0.333
2.023CysLys: 2.023 ± 0.708
1.911CysLeu: 1.911 ± 0.777
0.562CysMet: 0.562 ± 0.288
2.135CysAsn: 2.135 ± 0.678
0.674CysPro: 0.674 ± 0.213
0.674CysGln: 0.674 ± 0.316
1.124CysArg: 1.124 ± 0.356
2.023CysSer: 2.023 ± 0.43
2.36CysThr: 2.36 ± 0.438
3.484CysVal: 3.484 ± 0.934
0.45CysTrp: 0.45 ± 0.231
2.135CysTyr: 2.135 ± 1.095
0.0CysXaa: 0.0 ± 0.0
Asp
3.484AspAla: 3.484 ± 0.775
1.461AspCys: 1.461 ± 0.489
1.911AspAsp: 1.911 ± 0.379
2.585AspGlu: 2.585 ± 0.813
3.709AspPhe: 3.709 ± 1.151
4.72AspGly: 4.72 ± 1.542
0.899AspHis: 0.899 ± 0.42
3.484AspIle: 3.484 ± 0.477
2.472AspLys: 2.472 ± 0.37
4.271AspLeu: 4.271 ± 0.425
0.899AspMet: 0.899 ± 0.303
2.697AspAsn: 2.697 ± 0.909
1.461AspPro: 1.461 ± 0.57
1.236AspGln: 1.236 ± 0.905
1.124AspArg: 1.124 ± 0.209
3.372AspSer: 3.372 ± 0.918
2.023AspThr: 2.023 ± 0.326
5.282AspVal: 5.282 ± 0.554
0.787AspTrp: 0.787 ± 0.403
3.259AspTyr: 3.259 ± 1.076
0.0AspXaa: 0.0 ± 0.0
Glu
2.697GluAla: 2.697 ± 0.258
1.124GluCys: 1.124 ± 0.356
3.034GluAsp: 3.034 ± 1.18
2.472GluGlu: 2.472 ± 0.245
3.034GluPhe: 3.034 ± 1.068
2.81GluGly: 2.81 ± 1.037
1.349GluHis: 1.349 ± 0.559
2.248GluIle: 2.248 ± 0.945
2.36GluLys: 2.36 ± 0.425
3.372GluLeu: 3.372 ± 0.996
0.674GluMet: 0.674 ± 0.392
2.472GluAsn: 2.472 ± 0.786
2.135GluPro: 2.135 ± 0.409
1.798GluGln: 1.798 ± 0.789
1.911GluArg: 1.911 ± 0.405
3.034GluSer: 3.034 ± 0.524
1.236GluThr: 1.236 ± 0.808
5.17GluVal: 5.17 ± 1.204
0.45GluTrp: 0.45 ± 0.158
1.573GluTyr: 1.573 ± 0.333
0.0GluXaa: 0.0 ± 0.0
Phe
4.608PheAla: 4.608 ± 1.897
1.798PheCys: 1.798 ± 0.606
4.046PheAsp: 4.046 ± 0.69
2.697PheGlu: 2.697 ± 0.725
2.472PhePhe: 2.472 ± 0.47
3.484PheGly: 3.484 ± 1.082
0.225PheHis: 0.225 ± 0.177
1.798PheIle: 1.798 ± 0.729
4.158PheLys: 4.158 ± 1.107
4.383PheLeu: 4.383 ± 0.977
1.461PheMet: 1.461 ± 0.594
4.383PheAsn: 4.383 ± 0.931
0.674PhePro: 0.674 ± 0.316
1.011PheGln: 1.011 ± 0.559
1.349PheArg: 1.349 ± 0.695
4.608PheSer: 4.608 ± 1.183
3.034PheThr: 3.034 ± 0.677
5.956PheVal: 5.956 ± 1.13
1.349PheTrp: 1.349 ± 1.016
3.034PheTyr: 3.034 ± 0.389
0.0PheXaa: 0.0 ± 0.0
Gly
3.484GlyAla: 3.484 ± 0.934
1.798GlyCys: 1.798 ± 0.565
4.383GlyAsp: 4.383 ± 0.663
2.36GlyGlu: 2.36 ± 0.239
3.933GlyPhe: 3.933 ± 0.895
4.495GlyGly: 4.495 ± 1.117
0.45GlyHis: 0.45 ± 0.231
3.484GlyIle: 3.484 ± 1.09
4.608GlyLys: 4.608 ± 0.682
4.72GlyLeu: 4.72 ± 0.466
1.011GlyMet: 1.011 ± 0.519
4.158GlyAsn: 4.158 ± 1.219
1.798GlyPro: 1.798 ± 0.46
1.911GlyGln: 1.911 ± 0.951
1.911GlyArg: 1.911 ± 0.665
5.17GlySer: 5.17 ± 0.801
3.484GlyThr: 3.484 ± 1.207
8.878GlyVal: 8.878 ± 1.255
0.562GlyTrp: 0.562 ± 0.34
2.023GlyTyr: 2.023 ± 0.709
0.0GlyXaa: 0.0 ± 0.0
His
1.686HisAla: 1.686 ± 0.409
0.787HisCys: 0.787 ± 0.403
0.899HisAsp: 0.899 ± 0.303
0.45HisGlu: 0.45 ± 0.231
0.562HisPhe: 0.562 ± 0.294
0.45HisGly: 0.45 ± 0.231
0.225HisHis: 0.225 ± 0.434
0.674HisIle: 0.674 ± 0.213
1.124HisLys: 1.124 ± 0.406
1.461HisLeu: 1.461 ± 0.611
0.225HisMet: 0.225 ± 0.115
1.461HisAsn: 1.461 ± 0.381
0.45HisPro: 0.45 ± 0.231
0.674HisGln: 0.674 ± 0.365
0.225HisArg: 0.225 ± 0.115
1.349HisSer: 1.349 ± 0.335
1.686HisThr: 1.686 ± 0.535
1.461HisVal: 1.461 ± 0.266
0.225HisTrp: 0.225 ± 0.115
0.899HisTyr: 0.899 ± 0.332
0.0HisXaa: 0.0 ± 0.0
Ile
3.147IleAla: 3.147 ± 0.84
0.899IleCys: 0.899 ± 0.361
2.248IleAsp: 2.248 ± 0.567
1.798IleGlu: 1.798 ± 0.498
2.36IlePhe: 2.36 ± 0.688
3.933IleGly: 3.933 ± 1.006
1.011IleHis: 1.011 ± 0.332
2.36IleIle: 2.36 ± 0.812
4.158IleLys: 4.158 ± 0.651
4.046IleLeu: 4.046 ± 0.85
1.011IleMet: 1.011 ± 0.357
2.697IleAsn: 2.697 ± 0.909
2.36IlePro: 2.36 ± 1.01
2.135IleGln: 2.135 ± 1.338
1.236IleArg: 1.236 ± 0.388
4.72IleSer: 4.72 ± 1.001
4.945IleThr: 4.945 ± 2.059
4.495IleVal: 4.495 ± 0.446
0.337IleTrp: 0.337 ± 0.173
2.585IleTyr: 2.585 ± 1.067
0.0IleXaa: 0.0 ± 0.0
Lys
3.372LysAla: 3.372 ± 0.695
2.248LysCys: 2.248 ± 0.488
3.372LysAsp: 3.372 ± 0.558
3.933LysGlu: 3.933 ± 0.724
3.259LysPhe: 3.259 ± 0.728
3.147LysGly: 3.147 ± 1.236
1.911LysHis: 1.911 ± 0.735
3.259LysIle: 3.259 ± 1.119
1.686LysLys: 1.686 ± 0.478
5.619LysLeu: 5.619 ± 1.408
1.011LysMet: 1.011 ± 0.514
2.472LysAsn: 2.472 ± 0.393
3.259LysPro: 3.259 ± 0.82
2.697LysGln: 2.697 ± 1.184
2.023LysArg: 2.023 ± 0.723
3.372LysSer: 3.372 ± 0.911
3.933LysThr: 3.933 ± 0.63
5.732LysVal: 5.732 ± 1.146
0.787LysTrp: 0.787 ± 0.364
3.372LysTyr: 3.372 ± 0.658
0.0LysXaa: 0.0 ± 0.0
Leu
5.17LeuAla: 5.17 ± 0.777
2.472LeuCys: 2.472 ± 0.869
3.372LeuAsp: 3.372 ± 0.572
3.596LeuGlu: 3.596 ± 0.8
3.821LeuPhe: 3.821 ± 0.707
4.72LeuGly: 4.72 ± 0.843
2.248LeuHis: 2.248 ± 0.591
3.034LeuIle: 3.034 ± 1.5
5.17LeuLys: 5.17 ± 1.037
7.53LeuLeu: 7.53 ± 1.918
1.236LeuMet: 1.236 ± 0.42
5.619LeuAsn: 5.619 ± 1.374
3.484LeuPro: 3.484 ± 0.94
3.596LeuGln: 3.596 ± 0.992
2.472LeuArg: 2.472 ± 1.071
6.181LeuSer: 6.181 ± 0.787
4.271LeuThr: 4.271 ± 1.237
6.968LeuVal: 6.968 ± 1.757
1.349LeuTrp: 1.349 ± 1.097
5.282LeuTyr: 5.282 ± 1.206
0.0LeuXaa: 0.0 ± 0.0
Met
0.899MetAla: 0.899 ± 0.332
1.349MetCys: 1.349 ± 0.692
0.674MetAsp: 0.674 ± 0.346
0.562MetGlu: 0.562 ± 0.366
1.236MetPhe: 1.236 ± 0.509
1.124MetGly: 1.124 ± 0.467
0.45MetHis: 0.45 ± 0.231
1.349MetIle: 1.349 ± 0.347
0.674MetLys: 0.674 ± 0.296
2.248MetLeu: 2.248 ± 0.765
0.562MetMet: 0.562 ± 0.39
0.562MetAsn: 0.562 ± 0.366
0.787MetPro: 0.787 ± 0.309
0.787MetGln: 0.787 ± 0.505
0.787MetArg: 0.787 ± 0.255
1.911MetSer: 1.911 ± 0.831
1.011MetThr: 1.011 ± 0.434
1.911MetVal: 1.911 ± 0.47
0.225MetTrp: 0.225 ± 0.115
1.573MetTyr: 1.573 ± 0.428
0.0MetXaa: 0.0 ± 0.0
Asn
3.147AsnAla: 3.147 ± 1.083
2.36AsnCys: 2.36 ± 0.438
3.034AsnAsp: 3.034 ± 0.489
3.147AsnGlu: 3.147 ± 0.862
3.259AsnPhe: 3.259 ± 0.791
7.08AsnGly: 7.08 ± 1.076
0.787AsnHis: 0.787 ± 0.403
3.372AsnIle: 3.372 ± 0.728
3.933AsnLys: 3.933 ± 0.632
3.596AsnLeu: 3.596 ± 0.787
1.011AsnMet: 1.011 ± 0.294
3.372AsnAsn: 3.372 ± 0.487
1.798AsnPro: 1.798 ± 0.949
1.461AsnGln: 1.461 ± 1.378
1.911AsnArg: 1.911 ± 0.435
4.833AsnSer: 4.833 ± 1.436
3.147AsnThr: 3.147 ± 0.31
7.755AsnVal: 7.755 ± 2.009
0.45AsnTrp: 0.45 ± 0.771
1.798AsnTyr: 1.798 ± 0.633
0.0AsnXaa: 0.0 ± 0.0
Pro
3.372ProAla: 3.372 ± 0.914
0.899ProCys: 0.899 ± 0.298
1.573ProAsp: 1.573 ± 0.511
2.585ProGlu: 2.585 ± 0.45
1.686ProPhe: 1.686 ± 0.814
2.248ProGly: 2.248 ± 0.813
0.45ProHis: 0.45 ± 0.303
1.236ProIle: 1.236 ± 0.273
1.573ProLys: 1.573 ± 0.887
3.709ProLeu: 3.709 ± 0.72
0.674ProMet: 0.674 ± 0.346
1.573ProAsn: 1.573 ± 0.748
1.573ProPro: 1.573 ± 0.431
0.787ProGln: 0.787 ± 0.364
1.461ProArg: 1.461 ± 0.563
2.585ProSer: 2.585 ± 0.666
2.36ProThr: 2.36 ± 1.402
3.034ProVal: 3.034 ± 1.133
0.674ProTrp: 0.674 ± 0.413
1.236ProTyr: 1.236 ± 0.273
0.0ProXaa: 0.0 ± 0.0
Gln
2.472GlnAla: 2.472 ± 0.597
0.45GlnCys: 0.45 ± 0.231
1.349GlnAsp: 1.349 ± 0.515
1.461GlnGlu: 1.461 ± 0.479
1.236GlnPhe: 1.236 ± 0.32
1.798GlnGly: 1.798 ± 0.846
0.225GlnHis: 0.225 ± 0.115
1.911GlnIle: 1.911 ± 0.47
1.686GlnLys: 1.686 ± 0.806
4.383GlnLeu: 4.383 ± 2.21
0.787GlnMet: 0.787 ± 0.307
1.911GlnAsn: 1.911 ± 1.094
1.911GlnPro: 1.911 ± 0.997
1.349GlnGln: 1.349 ± 0.601
1.686GlnArg: 1.686 ± 0.354
1.573GlnSer: 1.573 ± 0.927
1.911GlnThr: 1.911 ± 0.366
2.023GlnVal: 2.023 ± 1.343
0.337GlnTrp: 0.337 ± 0.173
1.349GlnTyr: 1.349 ± 0.732
0.0GlnXaa: 0.0 ± 0.0
Arg
2.585ArgAla: 2.585 ± 0.927
1.349ArgCys: 1.349 ± 0.515
1.124ArgAsp: 1.124 ± 0.531
0.337ArgGlu: 0.337 ± 0.412
2.135ArgPhe: 2.135 ± 0.468
2.472ArgGly: 2.472 ± 1.09
0.45ArgHis: 0.45 ± 0.231
1.686ArgIle: 1.686 ± 0.489
2.472ArgLys: 2.472 ± 0.893
2.922ArgLeu: 2.922 ± 0.828
1.011ArgMet: 1.011 ± 0.362
2.585ArgAsn: 2.585 ± 1.12
0.45ArgPro: 0.45 ± 0.303
0.899ArgGln: 0.899 ± 0.604
0.562ArgArg: 0.562 ± 0.337
2.36ArgSer: 2.36 ± 2.578
2.135ArgThr: 2.135 ± 1.035
3.034ArgVal: 3.034 ± 0.508
0.225ArgTrp: 0.225 ± 0.115
1.461ArgTyr: 1.461 ± 0.339
0.0ArgXaa: 0.0 ± 0.0
Ser
4.72SerAla: 4.72 ± 0.657
2.472SerCys: 2.472 ± 0.374
3.147SerAsp: 3.147 ± 0.524
2.248SerGlu: 2.248 ± 0.813
5.282SerPhe: 5.282 ± 1.581
4.833SerGly: 4.833 ± 0.655
1.236SerHis: 1.236 ± 0.351
4.158SerIle: 4.158 ± 2.036
4.046SerLys: 4.046 ± 1.031
4.833SerLeu: 4.833 ± 1.138
1.573SerMet: 1.573 ± 0.608
5.17SerAsn: 5.17 ± 0.971
1.236SerPro: 1.236 ± 0.863
2.585SerGln: 2.585 ± 2.45
2.922SerArg: 2.922 ± 1.864
5.507SerSer: 5.507 ± 1.092
4.833SerThr: 4.833 ± 1.014
8.204SerVal: 8.204 ± 1.591
0.899SerTrp: 0.899 ± 0.658
4.046SerTyr: 4.046 ± 0.487
0.0SerXaa: 0.0 ± 0.0
Thr
2.472ThrAla: 2.472 ± 0.82
1.573ThrCys: 1.573 ± 0.489
3.034ThrAsp: 3.034 ± 0.356
2.922ThrGlu: 2.922 ± 1.092
2.697ThrPhe: 2.697 ± 0.696
3.821ThrGly: 3.821 ± 1.695
0.787ThrHis: 0.787 ± 0.507
3.933ThrIle: 3.933 ± 0.838
3.933ThrLys: 3.933 ± 1.079
5.057ThrLeu: 5.057 ± 1.461
1.798ThrMet: 1.798 ± 0.738
3.372ThrAsn: 3.372 ± 0.484
2.023ThrPro: 2.023 ± 0.647
1.573ThrGln: 1.573 ± 0.474
2.248ThrArg: 2.248 ± 0.765
4.72ThrSer: 4.72 ± 0.875
3.821ThrThr: 3.821 ± 0.426
6.406ThrVal: 6.406 ± 0.961
0.562ThrTrp: 0.562 ± 0.366
2.922ThrTyr: 2.922 ± 1.19
0.0ThrXaa: 0.0 ± 0.0
Val
6.631ValAla: 6.631 ± 0.697
3.147ValCys: 3.147 ± 0.988
5.507ValAsp: 5.507 ± 0.997
5.619ValGlu: 5.619 ± 0.853
6.069ValPhe: 6.069 ± 0.873
4.72ValGly: 4.72 ± 0.769
1.349ValHis: 1.349 ± 0.425
5.956ValIle: 5.956 ± 1.025
7.642ValLys: 7.642 ± 2.256
7.979ValLeu: 7.979 ± 1.685
2.135ValMet: 2.135 ± 0.373
6.294ValAsn: 6.294 ± 0.747
4.495ValPro: 4.495 ± 2.121
3.484ValGln: 3.484 ± 0.358
3.034ValArg: 3.034 ± 1.027
7.53ValSer: 7.53 ± 0.868
5.732ValThr: 5.732 ± 0.557
9.44ValVal: 9.44 ± 1.492
1.011ValTrp: 1.011 ± 0.354
2.36ValTyr: 2.36 ± 0.598
0.0ValXaa: 0.0 ± 0.0
Trp
0.674TrpAla: 0.674 ± 1.086
0.112TrpCys: 0.112 ± 0.058
0.787TrpAsp: 0.787 ± 0.403
0.562TrpGlu: 0.562 ± 0.288
0.787TrpPhe: 0.787 ± 0.312
0.225TrpGly: 0.225 ± 0.115
0.225TrpHis: 0.225 ± 0.35
0.674TrpIle: 0.674 ± 0.352
0.45TrpLys: 0.45 ± 0.699
1.349TrpLeu: 1.349 ± 0.472
0.225TrpMet: 0.225 ± 0.177
1.236TrpAsn: 1.236 ± 0.74
0.45TrpPro: 0.45 ± 0.533
0.112TrpGln: 0.112 ± 0.058
0.562TrpArg: 0.562 ± 0.178
1.124TrpSer: 1.124 ± 0.356
0.45TrpThr: 0.45 ± 0.158
0.899TrpVal: 0.899 ± 0.467
0.225TrpTrp: 0.225 ± 0.115
0.674TrpTyr: 0.674 ± 0.335
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.484TyrAla: 3.484 ± 1.152
1.461TyrCys: 1.461 ± 0.624
3.372TyrAsp: 3.372 ± 1.475
1.911TyrGlu: 1.911 ± 0.646
1.911TyrPhe: 1.911 ± 0.445
3.372TyrGly: 3.372 ± 0.565
1.011TyrHis: 1.011 ± 0.332
2.135TyrIle: 2.135 ± 0.799
2.922TyrLys: 2.922 ± 1.362
2.472TyrLeu: 2.472 ± 0.327
1.236TyrMet: 1.236 ± 0.32
3.933TyrAsn: 3.933 ± 0.751
1.686TyrPro: 1.686 ± 0.255
1.124TyrGln: 1.124 ± 0.324
1.686TyrArg: 1.686 ± 0.844
3.147TyrSer: 3.147 ± 0.857
2.922TyrThr: 2.922 ± 0.867
4.271TyrVal: 4.271 ± 0.888
0.337TyrTrp: 0.337 ± 0.322
3.259TyrTyr: 3.259 ± 0.771
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (8899 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski