Amino acid dipepetide frequency for BtRs-BetaCoV/HuB2013

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.631AlaAla: 6.631 ± 0.757
2.072AlaCys: 2.072 ± 0.73
3.005AlaAsp: 3.005 ± 0.609
2.59AlaGlu: 2.59 ± 0.739
2.901AlaPhe: 2.901 ± 0.991
4.145AlaGly: 4.145 ± 0.99
0.933AlaHis: 0.933 ± 0.482
4.663AlaIle: 4.663 ± 0.862
3.73AlaLys: 3.73 ± 1.376
7.046AlaLeu: 7.046 ± 1.287
2.28AlaMet: 2.28 ± 0.64
3.834AlaAsn: 3.834 ± 0.899
2.487AlaPro: 2.487 ± 0.465
2.59AlaGln: 2.59 ± 0.527
3.108AlaArg: 3.108 ± 0.536
5.077AlaSer: 5.077 ± 2.245
4.455AlaThr: 4.455 ± 0.595
4.663AlaVal: 4.663 ± 1.522
1.14AlaTrp: 1.14 ± 0.471
4.041AlaTyr: 4.041 ± 0.886
0.0AlaXaa: 0.0 ± 0.0
Cys
2.28CysAla: 2.28 ± 0.543
1.761CysCys: 1.761 ± 0.489
2.28CysAsp: 2.28 ± 0.693
1.451CysGlu: 1.451 ± 0.806
1.451CysPhe: 1.451 ± 0.587
2.487CysGly: 2.487 ± 0.847
0.518CysHis: 0.518 ± 0.175
2.072CysIle: 2.072 ± 0.687
0.725CysLys: 0.725 ± 0.246
2.383CysLeu: 2.383 ± 0.826
0.622CysMet: 0.622 ± 0.207
1.451CysAsn: 1.451 ± 0.492
0.829CysPro: 0.829 ± 0.224
0.622CysGln: 0.622 ± 0.291
1.243CysArg: 1.243 ± 0.531
2.072CysSer: 2.072 ± 0.735
2.072CysThr: 2.072 ± 0.906
2.694CysVal: 2.694 ± 0.734
0.518CysTrp: 0.518 ± 0.842
1.347CysTyr: 1.347 ± 0.537
0.0CysXaa: 0.0 ± 0.0
Asp
3.73AspAla: 3.73 ± 1.329
1.14AspCys: 1.14 ± 0.456
2.487AspAsp: 2.487 ± 0.883
2.487AspGlu: 2.487 ± 0.499
3.005AspPhe: 3.005 ± 0.663
4.041AspGly: 4.041 ± 0.715
0.829AspHis: 0.829 ± 0.537
3.005AspIle: 3.005 ± 0.788
2.798AspLys: 2.798 ± 0.646
4.663AspLeu: 4.663 ± 0.966
1.347AspMet: 1.347 ± 0.558
2.798AspAsn: 2.798 ± 0.839
1.658AspPro: 1.658 ± 0.636
1.658AspGln: 1.658 ± 0.459
1.451AspArg: 1.451 ± 0.344
3.005AspSer: 3.005 ± 0.407
3.523AspThr: 3.523 ± 0.975
4.352AspVal: 4.352 ± 1.164
0.622AspTrp: 0.622 ± 0.291
3.73AspTyr: 3.73 ± 0.896
0.0AspXaa: 0.0 ± 0.0
Glu
3.108GluAla: 3.108 ± 0.964
2.072GluCys: 2.072 ± 0.723
2.487GluAsp: 2.487 ± 0.633
4.352GluGlu: 4.352 ± 1.207
1.865GluPhe: 1.865 ± 0.798
2.901GluGly: 2.901 ± 0.796
1.347GluHis: 1.347 ± 0.417
2.694GluIle: 2.694 ± 0.746
1.969GluLys: 1.969 ± 1.017
4.974GluLeu: 4.974 ± 1.573
0.933GluMet: 0.933 ± 0.353
1.969GluAsn: 1.969 ± 0.508
2.176GluPro: 2.176 ± 0.916
1.761GluGln: 1.761 ± 0.273
1.451GluArg: 1.451 ± 0.605
2.694GluSer: 2.694 ± 0.365
3.108GluThr: 3.108 ± 0.904
3.523GluVal: 3.523 ± 0.831
0.311GluTrp: 0.311 ± 0.161
1.865GluTyr: 1.865 ± 0.78
0.0GluXaa: 0.0 ± 0.0
Phe
2.694PheAla: 2.694 ± 1.227
1.865PheCys: 1.865 ± 0.482
3.212PheAsp: 3.212 ± 0.871
1.451PheGlu: 1.451 ± 0.493
1.969PhePhe: 1.969 ± 0.683
2.901PheGly: 2.901 ± 1.352
0.622PheHis: 0.622 ± 0.473
2.694PheIle: 2.694 ± 1.186
3.108PheLys: 3.108 ± 0.646
5.077PheLeu: 5.077 ± 1.395
0.933PheMet: 0.933 ± 0.314
3.419PheAsn: 3.419 ± 1.81
1.969PhePro: 1.969 ± 0.521
1.14PheGln: 1.14 ± 0.791
1.451PheArg: 1.451 ± 0.584
3.005PheSer: 3.005 ± 0.61
4.041PheThr: 4.041 ± 0.619
4.041PheVal: 4.041 ± 1.44
0.414PheTrp: 0.414 ± 0.249
2.694PheTyr: 2.694 ± 0.659
0.0PheXaa: 0.0 ± 0.0
Gly
4.766GlyAla: 4.766 ± 1.11
1.658GlyCys: 1.658 ± 0.553
3.627GlyAsp: 3.627 ± 0.621
2.072GlyGlu: 2.072 ± 0.548
3.316GlyPhe: 3.316 ± 0.606
3.937GlyGly: 3.937 ± 1.54
1.243GlyHis: 1.243 ± 0.701
3.523GlyIle: 3.523 ± 1.054
2.798GlyLys: 2.798 ± 0.948
3.627GlyLeu: 3.627 ± 0.745
1.14GlyMet: 1.14 ± 0.582
2.901GlyAsn: 2.901 ± 0.747
2.176GlyPro: 2.176 ± 1.093
2.072GlyGln: 2.072 ± 0.729
1.761GlyArg: 1.761 ± 0.526
3.523GlySer: 3.523 ± 0.592
5.595GlyThr: 5.595 ± 1.597
6.528GlyVal: 6.528 ± 1.31
0.518GlyTrp: 0.518 ± 0.433
2.901GlyTyr: 2.901 ± 0.465
0.0GlyXaa: 0.0 ± 0.0
His
1.554HisAla: 1.554 ± 0.369
0.622HisCys: 0.622 ± 0.321
1.036HisAsp: 1.036 ± 0.545
1.036HisGlu: 1.036 ± 0.386
1.451HisPhe: 1.451 ± 0.603
1.451HisGly: 1.451 ± 0.53
0.622HisHis: 0.622 ± 0.291
1.036HisIle: 1.036 ± 0.515
0.829HisLys: 0.829 ± 0.399
2.28HisLeu: 2.28 ± 0.484
0.414HisMet: 0.414 ± 0.214
0.725HisAsn: 0.725 ± 0.375
0.622HisPro: 0.622 ± 0.262
0.414HisGln: 0.414 ± 0.228
0.311HisArg: 0.311 ± 0.152
1.658HisSer: 1.658 ± 0.752
1.865HisThr: 1.865 ± 0.527
1.554HisVal: 1.554 ± 0.415
0.311HisTrp: 0.311 ± 0.303
1.036HisTyr: 1.036 ± 0.704
0.0HisXaa: 0.0 ± 0.0
Ile
3.523IleAla: 3.523 ± 1.608
1.036IleCys: 1.036 ± 0.632
2.798IleAsp: 2.798 ± 0.347
1.658IleGlu: 1.658 ± 0.405
1.761IlePhe: 1.761 ± 0.474
3.627IleGly: 3.627 ± 0.98
0.725IleHis: 0.725 ± 1.283
3.108IleIle: 3.108 ± 1.215
3.316IleLys: 3.316 ± 0.775
4.455IleLeu: 4.455 ± 1.411
1.865IleMet: 1.865 ± 0.79
3.005IleAsn: 3.005 ± 0.387
2.487IlePro: 2.487 ± 0.767
1.865IleGln: 1.865 ± 0.849
1.865IleArg: 1.865 ± 0.517
3.419IleSer: 3.419 ± 0.983
4.041IleThr: 4.041 ± 0.585
4.87IleVal: 4.87 ± 1.113
0.518IleTrp: 0.518 ± 0.455
1.658IleTyr: 1.658 ± 1.693
0.0IleXaa: 0.0 ± 0.0
Lys
2.798LysAla: 2.798 ± 1.126
1.865LysCys: 1.865 ± 0.482
2.59LysAsp: 2.59 ± 1.534
3.212LysGlu: 3.212 ± 0.792
2.59LysPhe: 2.59 ± 0.841
4.559LysGly: 4.559 ± 1.049
1.761LysHis: 1.761 ± 0.752
2.59LysIle: 2.59 ± 0.663
2.798LysLys: 2.798 ± 2.573
6.321LysLeu: 6.321 ± 0.958
1.451LysMet: 1.451 ± 0.415
2.383LysAsn: 2.383 ± 1.016
3.73LysPro: 3.73 ± 0.689
1.658LysGln: 1.658 ± 0.827
2.59LysArg: 2.59 ± 0.376
3.73LysSer: 3.73 ± 0.861
3.212LysThr: 3.212 ± 0.612
3.316LysVal: 3.316 ± 0.586
0.725LysTrp: 0.725 ± 0.246
2.072LysTyr: 2.072 ± 0.613
0.0LysXaa: 0.0 ± 0.0
Leu
6.424LeuAla: 6.424 ± 1.556
2.901LeuCys: 2.901 ± 0.578
5.077LeuAsp: 5.077 ± 1.36
4.352LeuGlu: 4.352 ± 0.913
3.523LeuPhe: 3.523 ± 1.223
5.077LeuGly: 5.077 ± 0.846
1.761LeuHis: 1.761 ± 0.459
3.937LeuIle: 3.937 ± 2.375
6.839LeuLys: 6.839 ± 1.696
10.258LeuLeu: 10.258 ± 2.966
2.487LeuMet: 2.487 ± 0.812
5.803LeuAsn: 5.803 ± 0.717
4.87LeuPro: 4.87 ± 1.264
4.455LeuGln: 4.455 ± 0.654
4.352LeuArg: 4.352 ± 0.906
6.631LeuSer: 6.631 ± 1.111
5.803LeuThr: 5.803 ± 0.795
5.906LeuVal: 5.906 ± 2.084
1.14LeuTrp: 1.14 ± 0.687
3.523LeuTyr: 3.523 ± 0.888
0.0LeuXaa: 0.0 ± 0.0
Met
1.554MetAla: 1.554 ± 0.956
0.829MetCys: 0.829 ± 0.428
1.347MetAsp: 1.347 ± 0.563
0.933MetGlu: 0.933 ± 0.87
1.14MetPhe: 1.14 ± 0.532
0.725MetGly: 0.725 ± 0.375
0.518MetHis: 0.518 ± 0.268
0.829MetIle: 0.829 ± 0.31
1.14MetLys: 1.14 ± 0.747
2.798MetLeu: 2.798 ± 0.922
0.725MetMet: 0.725 ± 0.375
0.829MetAsn: 0.829 ± 0.332
1.243MetPro: 1.243 ± 0.486
1.036MetGln: 1.036 ± 0.386
0.725MetArg: 0.725 ± 0.229
2.487MetSer: 2.487 ± 0.687
1.451MetThr: 1.451 ± 0.499
1.14MetVal: 1.14 ± 0.436
0.622MetTrp: 0.622 ± 0.68
1.451MetTyr: 1.451 ± 0.432
0.0MetXaa: 0.0 ± 0.0
Asn
4.559AsnAla: 4.559 ± 1.034
1.761AsnCys: 1.761 ± 0.459
1.658AsnAsp: 1.658 ± 0.445
1.761AsnGlu: 1.761 ± 0.461
2.176AsnPhe: 2.176 ± 2.063
4.041AsnGly: 4.041 ± 0.845
1.451AsnHis: 1.451 ± 0.493
2.28AsnIle: 2.28 ± 0.727
2.901AsnLys: 2.901 ± 0.533
4.455AsnLeu: 4.455 ± 1.048
1.347AsnMet: 1.347 ± 0.69
3.005AsnAsn: 3.005 ± 0.915
1.658AsnPro: 1.658 ± 0.565
1.658AsnGln: 1.658 ± 0.995
2.176AsnArg: 2.176 ± 0.921
3.937AsnSer: 3.937 ± 1.473
3.108AsnThr: 3.108 ± 1.03
5.077AsnVal: 5.077 ± 0.814
0.414AsnTrp: 0.414 ± 0.335
2.383AsnTyr: 2.383 ± 0.616
0.0AsnXaa: 0.0 ± 0.0
Pro
3.108ProAla: 3.108 ± 0.456
1.14ProCys: 1.14 ± 0.312
1.451ProAsp: 1.451 ± 0.457
1.865ProGlu: 1.865 ± 0.532
1.969ProPhe: 1.969 ± 0.714
2.072ProGly: 2.072 ± 0.548
0.725ProHis: 0.725 ± 0.246
2.383ProIle: 2.383 ± 0.533
3.108ProLys: 3.108 ± 1.14
4.766ProLeu: 4.766 ± 1.058
0.622ProMet: 0.622 ± 0.887
1.969ProAsn: 1.969 ± 0.503
1.451ProPro: 1.451 ± 0.331
1.761ProGln: 1.761 ± 1.853
1.554ProArg: 1.554 ± 1.108
2.59ProSer: 2.59 ± 1.212
3.316ProThr: 3.316 ± 0.411
3.523ProVal: 3.523 ± 0.785
0.311ProTrp: 0.311 ± 0.152
1.036ProTyr: 1.036 ± 0.386
0.0ProXaa: 0.0 ± 0.0
Gln
3.108GlnAla: 3.108 ± 0.682
0.933GlnCys: 0.933 ± 0.337
1.969GlnAsp: 1.969 ± 0.855
2.383GlnGlu: 2.383 ± 1.129
1.865GlnPhe: 1.865 ± 0.804
1.865GlnGly: 1.865 ± 1.815
0.933GlnHis: 0.933 ± 0.269
2.176GlnIle: 2.176 ± 1.642
1.347GlnLys: 1.347 ± 0.529
4.041GlnLeu: 4.041 ± 1.269
1.036GlnMet: 1.036 ± 0.342
1.554GlnAsn: 1.554 ± 0.489
1.969GlnPro: 1.969 ± 0.741
1.761GlnGln: 1.761 ± 0.738
1.658GlnArg: 1.658 ± 0.79
1.969GlnSer: 1.969 ± 0.551
2.487GlnThr: 2.487 ± 0.513
2.59GlnVal: 2.59 ± 0.829
0.622GlnTrp: 0.622 ± 0.35
1.243GlnTyr: 1.243 ± 0.609
0.0GlnXaa: 0.0 ± 0.0
Arg
3.316ArgAla: 3.316 ± 0.822
1.243ArgCys: 1.243 ± 0.362
1.761ArgAsp: 1.761 ± 0.633
2.28ArgGlu: 2.28 ± 0.871
1.761ArgPhe: 1.761 ± 0.544
2.59ArgGly: 2.59 ± 1.9
1.14ArgHis: 1.14 ± 0.307
1.658ArgIle: 1.658 ± 0.878
2.072ArgLys: 2.072 ± 0.474
3.108ArgLeu: 3.108 ± 0.568
0.518ArgMet: 0.518 ± 0.448
2.176ArgAsn: 2.176 ± 0.452
0.933ArgPro: 0.933 ± 0.661
1.969ArgGln: 1.969 ± 1.146
1.243ArgArg: 1.243 ± 1.173
3.212ArgSer: 3.212 ± 1.281
1.658ArgThr: 1.658 ± 0.615
3.73ArgVal: 3.73 ± 0.69
0.518ArgTrp: 0.518 ± 0.604
1.347ArgTyr: 1.347 ± 0.427
0.0ArgXaa: 0.0 ± 0.0
Ser
5.595SerAla: 5.595 ± 1.23
1.451SerCys: 1.451 ± 0.605
3.834SerAsp: 3.834 ± 0.77
3.627SerGlu: 3.627 ± 0.94
4.041SerPhe: 4.041 ± 1.182
3.523SerGly: 3.523 ± 2.12
1.761SerHis: 1.761 ± 0.555
2.694SerIle: 2.694 ± 0.646
3.316SerLys: 3.316 ± 0.62
5.803SerLeu: 5.803 ± 0.803
1.451SerMet: 1.451 ± 0.365
2.798SerAsn: 2.798 ± 1.197
2.383SerPro: 2.383 ± 1.183
2.383SerGln: 2.383 ± 0.54
2.487SerArg: 2.487 ± 2.556
4.663SerSer: 4.663 ± 1.274
5.284SerThr: 5.284 ± 1.062
5.906SerVal: 5.906 ± 1.287
1.14SerTrp: 1.14 ± 0.25
3.005SerTyr: 3.005 ± 0.621
0.0SerXaa: 0.0 ± 0.0
Thr
3.627ThrAla: 3.627 ± 1.204
2.487ThrCys: 2.487 ± 1.117
3.212ThrAsp: 3.212 ± 0.865
3.73ThrGlu: 3.73 ± 0.478
4.145ThrPhe: 4.145 ± 0.628
4.559ThrGly: 4.559 ± 0.864
1.347ThrHis: 1.347 ± 0.391
3.627ThrIle: 3.627 ± 1.208
3.523ThrLys: 3.523 ± 0.776
6.217ThrLeu: 6.217 ± 0.936
1.761ThrMet: 1.761 ± 0.622
3.316ThrAsn: 3.316 ± 0.423
2.798ThrPro: 2.798 ± 0.676
3.73ThrGln: 3.73 ± 1.35
3.108ThrArg: 3.108 ± 0.625
4.87ThrSer: 4.87 ± 1.105
5.595ThrThr: 5.595 ± 1.101
5.388ThrVal: 5.388 ± 0.621
0.518ThrTrp: 0.518 ± 0.299
2.798ThrTyr: 2.798 ± 0.496
0.0ThrXaa: 0.0 ± 0.0
Val
6.01ValAla: 6.01 ± 0.666
2.383ValCys: 2.383 ± 0.622
5.181ValAsp: 5.181 ± 1.423
3.834ValGlu: 3.834 ± 1.611
3.73ValPhe: 3.73 ± 0.628
3.005ValGly: 3.005 ± 0.882
1.243ValHis: 1.243 ± 0.448
4.352ValIle: 4.352 ± 0.892
5.181ValLys: 5.181 ± 1.178
7.875ValLeu: 7.875 ± 1.24
1.554ValMet: 1.554 ± 0.565
3.834ValAsn: 3.834 ± 1.228
3.108ValPro: 3.108 ± 0.396
3.005ValGln: 3.005 ± 0.864
2.901ValArg: 2.901 ± 0.622
4.663ValSer: 4.663 ± 0.897
6.631ValThr: 6.631 ± 0.723
6.217ValVal: 6.217 ± 1.244
0.414ValTrp: 0.414 ± 0.296
4.455ValTyr: 4.455 ± 1.068
0.0ValXaa: 0.0 ± 0.0
Trp
0.622TrpAla: 0.622 ± 0.321
0.207TrpCys: 0.207 ± 0.107
0.518TrpAsp: 0.518 ± 0.268
0.518TrpGlu: 0.518 ± 0.175
1.14TrpPhe: 1.14 ± 0.468
0.207TrpGly: 0.207 ± 0.107
0.414TrpHis: 0.414 ± 0.545
0.622TrpIle: 0.622 ± 0.316
0.518TrpLys: 0.518 ± 0.251
1.554TrpLeu: 1.554 ± 1.054
0.104TrpMet: 0.104 ± 0.054
1.347TrpAsn: 1.347 ± 0.542
0.414TrpPro: 0.414 ± 0.509
0.311TrpGln: 0.311 ± 0.255
0.311TrpArg: 0.311 ± 0.152
0.725TrpSer: 0.725 ± 0.37
0.518TrpThr: 0.518 ± 0.175
0.725TrpVal: 0.725 ± 0.393
0.104TrpTrp: 0.104 ± 0.054
0.414TrpTyr: 0.414 ± 0.296
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.383TyrAla: 2.383 ± 0.369
1.451TyrCys: 1.451 ± 0.767
2.798TyrAsp: 2.798 ± 0.656
1.969TyrGlu: 1.969 ± 0.742
2.901TyrPhe: 2.901 ± 0.502
1.969TyrGly: 1.969 ± 0.341
0.933TyrHis: 0.933 ± 0.396
1.658TyrIle: 1.658 ± 0.53
3.937TyrLys: 3.937 ± 0.639
3.419TyrLeu: 3.419 ± 1.028
0.829TyrMet: 0.829 ± 0.428
2.694TyrAsn: 2.694 ± 0.28
1.761TyrPro: 1.761 ± 0.623
1.658TyrGln: 1.658 ± 0.981
2.487TyrArg: 2.487 ± 0.553
3.212TyrSer: 3.212 ± 1.783
2.59TyrThr: 2.59 ± 0.873
3.937TyrVal: 3.937 ± 1.002
0.414TyrTrp: 0.414 ± 0.247
2.798TyrTyr: 2.798 ± 0.581
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (9652 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski