Amino acid dipepetide frequency for BtMf-AlphaCoV/JX2012

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.978AlaAla: 4.978 ± 0.978
2.165AlaCys: 2.165 ± 0.914
3.03AlaAsp: 3.03 ± 0.475
2.922AlaGlu: 2.922 ± 0.481
4.654AlaPhe: 4.654 ± 1.119
3.139AlaGly: 3.139 ± 0.838
1.19AlaHis: 1.19 ± 0.376
4.978AlaIle: 4.978 ± 0.965
3.68AlaLys: 3.68 ± 1.131
5.519AlaLeu: 5.519 ± 1.271
2.165AlaMet: 2.165 ± 0.455
3.896AlaAsn: 3.896 ± 0.753
2.165AlaPro: 2.165 ± 0.998
1.299AlaGln: 1.299 ± 0.334
2.922AlaArg: 2.922 ± 0.792
5.195AlaSer: 5.195 ± 1.073
3.788AlaThr: 3.788 ± 0.382
5.736AlaVal: 5.736 ± 0.849
0.541AlaTrp: 0.541 ± 0.319
1.948AlaTyr: 1.948 ± 0.525
0.0AlaXaa: 0.0 ± 0.0
Cys
2.056CysAla: 2.056 ± 0.486
1.082CysCys: 1.082 ± 0.509
2.273CysAsp: 2.273 ± 1.007
0.433CysGlu: 0.433 ± 0.143
2.165CysPhe: 2.165 ± 0.426
2.922CysGly: 2.922 ± 0.762
0.649CysHis: 0.649 ± 0.431
0.974CysIle: 0.974 ± 0.354
1.948CysLys: 1.948 ± 0.707
2.489CysLeu: 2.489 ± 0.603
0.433CysMet: 0.433 ± 0.47
2.489CysAsn: 2.489 ± 0.793
0.649CysPro: 0.649 ± 0.209
0.866CysGln: 0.866 ± 0.609
1.082CysArg: 1.082 ± 0.406
2.056CysSer: 2.056 ± 0.495
3.03CysThr: 3.03 ± 0.591
3.139CysVal: 3.139 ± 0.822
0.541CysTrp: 0.541 ± 0.28
2.165CysTyr: 2.165 ± 1.119
0.0CysXaa: 0.0 ± 0.0
Asp
3.463AspAla: 3.463 ± 1.245
2.165AspCys: 2.165 ± 0.68
2.814AspAsp: 2.814 ± 0.407
2.165AspGlu: 2.165 ± 0.548
3.68AspPhe: 3.68 ± 0.994
5.519AspGly: 5.519 ± 1.252
1.19AspHis: 1.19 ± 0.523
2.814AspIle: 2.814 ± 1.034
2.489AspLys: 2.489 ± 0.582
4.545AspLeu: 4.545 ± 0.715
1.19AspMet: 1.19 ± 0.415
3.571AspAsn: 3.571 ± 1.176
1.84AspPro: 1.84 ± 0.656
1.19AspGln: 1.19 ± 0.705
0.974AspArg: 0.974 ± 0.309
3.571AspSer: 3.571 ± 0.881
2.273AspThr: 2.273 ± 0.523
5.628AspVal: 5.628 ± 1.304
0.541AspTrp: 0.541 ± 0.28
3.355AspTyr: 3.355 ± 0.932
0.0AspXaa: 0.0 ± 0.0
Glu
2.597GluAla: 2.597 ± 0.575
1.407GluCys: 1.407 ± 0.461
2.273GluAsp: 2.273 ± 0.523
2.706GluGlu: 2.706 ± 0.772
2.273GluPhe: 2.273 ± 0.321
3.463GluGly: 3.463 ± 1.384
1.623GluHis: 1.623 ± 0.564
1.515GluIle: 1.515 ± 0.304
2.597GluLys: 2.597 ± 0.643
3.896GluLeu: 3.896 ± 0.684
0.433GluMet: 0.433 ± 0.364
1.84GluAsn: 1.84 ± 0.316
1.84GluPro: 1.84 ± 1.002
1.515GluGln: 1.515 ± 0.758
1.299GluArg: 1.299 ± 0.775
1.407GluSer: 1.407 ± 0.865
1.623GluThr: 1.623 ± 0.423
4.978GluVal: 4.978 ± 1.067
0.433GluTrp: 0.433 ± 0.364
1.515GluTyr: 1.515 ± 0.621
0.0GluXaa: 0.0 ± 0.0
Phe
3.247PheAla: 3.247 ± 0.624
2.273PheCys: 2.273 ± 0.536
4.004PheAsp: 4.004 ± 0.717
2.706PheGlu: 2.706 ± 0.554
2.597PhePhe: 2.597 ± 0.907
4.545PheGly: 4.545 ± 0.761
0.541PheHis: 0.541 ± 0.17
3.788PheIle: 3.788 ± 1.131
4.113PheLys: 4.113 ± 1.415
2.814PheLeu: 2.814 ± 0.85
0.866PheMet: 0.866 ± 0.447
5.411PheAsn: 5.411 ± 1.573
0.649PhePro: 0.649 ± 0.304
1.299PheGln: 1.299 ± 0.634
0.649PheArg: 0.649 ± 0.863
4.978PheSer: 4.978 ± 1.762
3.139PheThr: 3.139 ± 0.537
6.818PheVal: 6.818 ± 1.416
0.866PheTrp: 0.866 ± 0.46
2.814PheTyr: 2.814 ± 0.37
0.0PheXaa: 0.0 ± 0.0
Gly
3.68GlyAla: 3.68 ± 1.431
2.381GlyCys: 2.381 ± 0.76
4.654GlyAsp: 4.654 ± 0.845
2.165GlyGlu: 2.165 ± 0.716
4.113GlyPhe: 4.113 ± 1.339
4.545GlyGly: 4.545 ± 0.827
0.974GlyHis: 0.974 ± 0.307
3.139GlyIle: 3.139 ± 1.132
3.247GlyLys: 3.247 ± 0.84
5.195GlyLeu: 5.195 ± 0.473
1.082GlyMet: 1.082 ± 0.66
3.139GlyAsn: 3.139 ± 0.755
1.299GlyPro: 1.299 ± 0.555
1.19GlyGln: 1.19 ± 1.039
2.489GlyArg: 2.489 ± 1.68
5.844GlySer: 5.844 ± 1.735
4.113GlyThr: 4.113 ± 0.883
8.442GlyVal: 8.442 ± 1.504
0.649GlyTrp: 0.649 ± 0.327
2.381GlyTyr: 2.381 ± 1.039
0.0GlyXaa: 0.0 ± 0.0
His
1.623HisAla: 1.623 ± 0.471
1.082HisCys: 1.082 ± 0.406
1.19HisAsp: 1.19 ± 0.615
0.974HisGlu: 0.974 ± 0.48
0.866HisPhe: 0.866 ± 0.46
0.974HisGly: 0.974 ± 0.755
0.325HisHis: 0.325 ± 0.328
0.758HisIle: 0.758 ± 0.254
1.19HisLys: 1.19 ± 0.615
1.299HisLeu: 1.299 ± 1.184
0.216HisMet: 0.216 ± 0.112
1.082HisAsn: 1.082 ± 0.421
0.325HisPro: 0.325 ± 0.168
0.758HisGln: 0.758 ± 0.273
0.541HisArg: 0.541 ± 0.433
1.082HisSer: 1.082 ± 0.406
1.19HisThr: 1.19 ± 0.376
2.056HisVal: 2.056 ± 0.808
0.216HisTrp: 0.216 ± 0.112
0.758HisTyr: 0.758 ± 0.686
0.0HisXaa: 0.0 ± 0.0
Ile
3.139IleAla: 3.139 ± 0.562
0.974IleCys: 0.974 ± 0.873
2.489IleAsp: 2.489 ± 0.775
2.056IleGlu: 2.056 ± 0.495
3.03IlePhe: 3.03 ± 0.591
2.706IleGly: 2.706 ± 0.322
1.082IleHis: 1.082 ± 0.501
3.03IleIle: 3.03 ± 1.851
3.463IleLys: 3.463 ± 0.945
3.896IleLeu: 3.896 ± 1.019
0.974IleMet: 0.974 ± 0.701
4.113IleAsn: 4.113 ± 1.337
2.814IlePro: 2.814 ± 1.807
1.732IleGln: 1.732 ± 0.716
1.515IleArg: 1.515 ± 0.483
4.978IleSer: 4.978 ± 1.553
3.896IleThr: 3.896 ± 1.573
5.519IleVal: 5.519 ± 1.19
0.325IleTrp: 0.325 ± 0.168
2.056IleTyr: 2.056 ± 0.569
0.0IleXaa: 0.0 ± 0.0
Lys
3.896LysAla: 3.896 ± 1.546
1.84LysCys: 1.84 ± 0.786
3.247LysAsp: 3.247 ± 1.074
2.381LysGlu: 2.381 ± 0.33
3.571LysPhe: 3.571 ± 0.912
2.814LysGly: 2.814 ± 1.063
2.165LysHis: 2.165 ± 1.119
2.273LysIle: 2.273 ± 0.658
1.732LysLys: 1.732 ± 1.06
4.978LysLeu: 4.978 ± 1.369
1.19LysMet: 1.19 ± 0.389
2.489LysAsn: 2.489 ± 0.617
4.113LysPro: 4.113 ± 1.216
2.381LysGln: 2.381 ± 0.791
2.165LysArg: 2.165 ± 0.908
3.463LysSer: 3.463 ± 1.122
3.463LysThr: 3.463 ± 0.627
4.87LysVal: 4.87 ± 1.525
0.541LysTrp: 0.541 ± 0.477
3.247LysTyr: 3.247 ± 0.841
0.0LysXaa: 0.0 ± 0.0
Leu
5.628LeuAla: 5.628 ± 1.612
2.922LeuCys: 2.922 ± 0.82
3.139LeuAsp: 3.139 ± 0.624
3.68LeuGlu: 3.68 ± 0.521
3.896LeuPhe: 3.896 ± 2.073
4.87LeuGly: 4.87 ± 1.341
1.515LeuHis: 1.515 ± 0.904
3.03LeuIle: 3.03 ± 1.681
5.844LeuLys: 5.844 ± 1.086
5.628LeuLeu: 5.628 ± 1.461
1.299LeuMet: 1.299 ± 0.438
6.061LeuAsn: 6.061 ± 1.92
4.113LeuPro: 4.113 ± 1.966
3.03LeuGln: 3.03 ± 0.377
3.247LeuArg: 3.247 ± 0.507
6.818LeuSer: 6.818 ± 1.371
5.411LeuThr: 5.411 ± 1.874
6.818LeuVal: 6.818 ± 2.122
1.299LeuTrp: 1.299 ± 1.217
4.004LeuTyr: 4.004 ± 1.201
0.0LeuXaa: 0.0 ± 0.0
Met
1.19MetAla: 1.19 ± 0.731
1.19MetCys: 1.19 ± 0.528
1.082MetAsp: 1.082 ± 0.559
0.866MetGlu: 0.866 ± 0.368
1.732MetPhe: 1.732 ± 0.433
1.299MetGly: 1.299 ± 0.625
0.325MetHis: 0.325 ± 0.168
1.515MetIle: 1.515 ± 0.379
0.108MetLys: 0.108 ± 0.056
2.165MetLeu: 2.165 ± 0.722
0.649MetMet: 0.649 ± 0.209
0.758MetAsn: 0.758 ± 0.37
0.758MetPro: 0.758 ± 0.286
0.649MetGln: 0.649 ± 0.768
1.082MetArg: 1.082 ± 0.34
1.623MetSer: 1.623 ± 0.419
1.19MetThr: 1.19 ± 0.348
0.758MetVal: 0.758 ± 0.354
0.325MetTrp: 0.325 ± 0.484
1.407MetTyr: 1.407 ± 0.619
0.0MetXaa: 0.0 ± 0.0
Asn
4.87AsnAla: 4.87 ± 0.952
2.273AsnCys: 2.273 ± 0.768
2.273AsnAsp: 2.273 ± 0.532
2.273AsnGlu: 2.273 ± 0.569
2.165AsnPhe: 2.165 ± 1.15
5.736AsnGly: 5.736 ± 1.28
0.541AsnHis: 0.541 ± 0.456
4.221AsnIle: 4.221 ± 0.922
3.355AsnLys: 3.355 ± 0.693
5.087AsnLeu: 5.087 ± 1.871
1.082AsnMet: 1.082 ± 0.406
4.654AsnAsn: 4.654 ± 0.836
1.732AsnPro: 1.732 ± 0.685
1.407AsnGln: 1.407 ± 1.626
1.732AsnArg: 1.732 ± 0.555
5.195AsnSer: 5.195 ± 1.561
4.113AsnThr: 4.113 ± 0.991
7.251AsnVal: 7.251 ± 2.047
0.649AsnTrp: 0.649 ± 0.863
2.381AsnTyr: 2.381 ± 0.76
0.0AsnXaa: 0.0 ± 0.0
Pro
2.381ProAla: 2.381 ± 2.125
0.758ProCys: 0.758 ± 0.254
1.84ProAsp: 1.84 ± 0.394
2.381ProGlu: 2.381 ± 1.122
1.84ProPhe: 1.84 ± 0.679
1.948ProGly: 1.948 ± 0.615
1.082ProHis: 1.082 ± 1.029
1.623ProIle: 1.623 ± 0.527
1.948ProLys: 1.948 ± 2.37
3.139ProLeu: 3.139 ± 1.098
0.325ProMet: 0.325 ± 0.168
1.732ProAsn: 1.732 ± 1.655
1.84ProPro: 1.84 ± 0.667
0.758ProGln: 0.758 ± 0.712
1.299ProArg: 1.299 ± 0.708
3.03ProSer: 3.03 ± 0.594
1.948ProThr: 1.948 ± 0.707
3.896ProVal: 3.896 ± 1.395
0.433ProTrp: 0.433 ± 0.143
0.758ProTyr: 0.758 ± 0.37
0.0ProXaa: 0.0 ± 0.0
Gln
2.273GlnAla: 2.273 ± 0.502
0.541GlnCys: 0.541 ± 0.28
1.082GlnAsp: 1.082 ± 0.387
0.758GlnGlu: 0.758 ± 0.712
1.082GlnPhe: 1.082 ± 0.551
1.299GlnGly: 1.299 ± 0.35
0.541GlnHis: 0.541 ± 0.17
1.407GlnIle: 1.407 ± 0.822
1.082GlnLys: 1.082 ± 0.598
4.221GlnLeu: 4.221 ± 2.149
1.082GlnMet: 1.082 ± 0.339
1.299GlnAsn: 1.299 ± 0.472
0.974GlnPro: 0.974 ± 0.289
1.299GlnGln: 1.299 ± 1.711
1.299GlnArg: 1.299 ± 1.038
2.273GlnSer: 2.273 ± 1.499
1.732GlnThr: 1.732 ± 0.403
1.948GlnVal: 1.948 ± 0.578
0.325GlnTrp: 0.325 ± 0.135
1.19GlnTyr: 1.19 ± 0.632
0.0GlnXaa: 0.0 ± 0.0
Arg
2.706ArgAla: 2.706 ± 0.715
1.515ArgCys: 1.515 ± 0.408
1.299ArgAsp: 1.299 ± 0.26
1.082ArgGlu: 1.082 ± 0.706
2.814ArgPhe: 2.814 ± 1.009
1.299ArgGly: 1.299 ± 0.572
0.433ArgHis: 0.433 ± 0.224
1.515ArgIle: 1.515 ± 0.622
1.732ArgLys: 1.732 ± 0.708
3.03ArgLeu: 3.03 ± 2.118
0.974ArgMet: 0.974 ± 0.566
2.706ArgAsn: 2.706 ± 0.962
1.19ArgPro: 1.19 ± 0.632
0.866ArgGln: 0.866 ± 0.318
1.082ArgArg: 1.082 ± 1.011
1.948ArgSer: 1.948 ± 2.966
2.056ArgThr: 2.056 ± 0.667
3.139ArgVal: 3.139 ± 0.814
0.433ArgTrp: 0.433 ± 0.358
1.515ArgTyr: 1.515 ± 0.574
0.0ArgXaa: 0.0 ± 0.0
Ser
4.545SerAla: 4.545 ± 0.88
1.407SerCys: 1.407 ± 0.693
4.978SerAsp: 4.978 ± 0.882
2.814SerGlu: 2.814 ± 0.743
4.437SerPhe: 4.437 ± 1.403
4.978SerGly: 4.978 ± 0.738
1.19SerHis: 1.19 ± 0.506
4.545SerIle: 4.545 ± 0.808
4.221SerLys: 4.221 ± 1.283
5.844SerLeu: 5.844 ± 0.77
1.515SerMet: 1.515 ± 0.427
4.113SerAsn: 4.113 ± 2.075
1.623SerPro: 1.623 ± 0.448
2.165SerGln: 2.165 ± 0.967
1.948SerArg: 1.948 ± 2.711
6.277SerSer: 6.277 ± 1.39
4.654SerThr: 4.654 ± 0.765
8.766SerVal: 8.766 ± 1.112
0.866SerTrp: 0.866 ± 0.551
3.896SerTyr: 3.896 ± 0.915
0.0SerXaa: 0.0 ± 0.0
Thr
3.463ThrAla: 3.463 ± 1.023
1.623ThrCys: 1.623 ± 0.385
2.489ThrAsp: 2.489 ± 0.681
2.165ThrGlu: 2.165 ± 1.005
4.437ThrPhe: 4.437 ± 0.84
3.571ThrGly: 3.571 ± 1.079
0.758ThrHis: 0.758 ± 0.428
3.788ThrIle: 3.788 ± 1.485
3.03ThrLys: 3.03 ± 0.662
5.736ThrLeu: 5.736 ± 0.734
2.056ThrMet: 2.056 ± 0.692
3.788ThrAsn: 3.788 ± 0.887
1.948ThrPro: 1.948 ± 1.034
1.515ThrGln: 1.515 ± 0.912
1.948ThrArg: 1.948 ± 0.7
4.329ThrSer: 4.329 ± 1.11
4.221ThrThr: 4.221 ± 1.317
6.602ThrVal: 6.602 ± 1.426
0.433ThrTrp: 0.433 ± 0.224
2.056ThrTyr: 2.056 ± 0.477
0.0ThrXaa: 0.0 ± 0.0
Val
6.602ValAla: 6.602 ± 0.85
4.004ValCys: 4.004 ± 1.0
7.359ValAsp: 7.359 ± 0.92
4.654ValGlu: 4.654 ± 1.01
5.411ValPhe: 5.411 ± 0.844
5.628ValGly: 5.628 ± 0.833
0.866ValHis: 0.866 ± 0.286
6.602ValIle: 6.602 ± 1.434
8.117ValLys: 8.117 ± 2.755
8.225ValLeu: 8.225 ± 1.766
1.948ValMet: 1.948 ± 2.055
6.061ValAsn: 6.061 ± 1.096
3.463ValPro: 3.463 ± 1.177
2.922ValGln: 2.922 ± 0.407
3.03ValArg: 3.03 ± 0.463
7.143ValSer: 7.143 ± 1.559
5.844ValThr: 5.844 ± 1.628
8.766ValVal: 8.766 ± 2.133
0.758ValTrp: 0.758 ± 0.273
2.922ValTyr: 2.922 ± 1.106
0.0ValXaa: 0.0 ± 0.0
Trp
0.541TrpAla: 0.541 ± 0.682
0.433TrpCys: 0.433 ± 0.143
0.649TrpAsp: 0.649 ± 0.336
0.216TrpGlu: 0.216 ± 0.112
0.758TrpPhe: 0.758 ± 0.423
0.325TrpGly: 0.325 ± 0.484
0.325TrpHis: 0.325 ± 0.371
0.433TrpIle: 0.433 ± 0.224
0.433TrpLys: 0.433 ± 0.358
1.407TrpLeu: 1.407 ± 0.861
0.108TrpMet: 0.108 ± 0.056
1.082TrpAsn: 1.082 ± 0.679
0.649TrpPro: 0.649 ± 0.44
0.108TrpGln: 0.108 ± 0.056
0.541TrpArg: 0.541 ± 0.353
0.974TrpSer: 0.974 ± 0.307
0.433TrpThr: 0.433 ± 0.364
0.758TrpVal: 0.758 ± 0.487
0.108TrpTrp: 0.108 ± 0.056
0.758TrpTyr: 0.758 ± 0.423
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.03TyrAla: 3.03 ± 0.686
1.407TyrCys: 1.407 ± 0.461
2.814TyrAsp: 2.814 ± 1.285
1.84TyrGlu: 1.84 ± 0.474
2.597TyrPhe: 2.597 ± 0.3
3.139TyrGly: 3.139 ± 0.621
1.19TyrHis: 1.19 ± 0.258
1.84TyrIle: 1.84 ± 0.444
2.489TyrLys: 2.489 ± 1.548
3.139TyrLeu: 3.139 ± 0.537
0.974TyrMet: 0.974 ± 0.382
2.706TyrAsn: 2.706 ± 0.85
0.974TyrPro: 0.974 ± 0.503
0.758TyrGln: 0.758 ± 0.437
2.489TyrArg: 2.489 ± 1.014
2.597TyrSer: 2.597 ± 0.981
1.84TyrThr: 1.84 ± 0.474
4.545TyrVal: 4.545 ± 1.23
0.758TyrTrp: 0.758 ± 0.37
3.355TyrTyr: 3.355 ± 1.779
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (9241 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski