Amino acid dipepetide frequency for BtMf-AlphaCoV/AH2011

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.87AlaAla: 4.87 ± 1.371
2.165AlaCys: 2.165 ± 0.929
3.139AlaAsp: 3.139 ± 0.374
2.922AlaGlu: 2.922 ± 0.442
4.762AlaPhe: 4.762 ± 1.117
3.03AlaGly: 3.03 ± 0.758
1.082AlaHis: 1.082 ± 0.397
4.437AlaIle: 4.437 ± 1.075
3.896AlaLys: 3.896 ± 1.199
5.411AlaLeu: 5.411 ± 1.085
2.273AlaMet: 2.273 ± 0.457
4.004AlaAsn: 4.004 ± 0.67
2.165AlaPro: 2.165 ± 0.74
1.19AlaGln: 1.19 ± 0.372
2.706AlaArg: 2.706 ± 0.637
5.087AlaSer: 5.087 ± 0.991
3.896AlaThr: 3.896 ± 0.427
5.952AlaVal: 5.952 ± 1.147
0.649AlaTrp: 0.649 ± 0.356
1.948AlaTyr: 1.948 ± 0.456
0.0AlaXaa: 0.0 ± 0.0
Cys
1.948CysAla: 1.948 ± 0.544
1.082CysCys: 1.082 ± 0.52
2.273CysAsp: 2.273 ± 0.849
0.433CysGlu: 0.433 ± 0.134
2.165CysPhe: 2.165 ± 0.369
3.03CysGly: 3.03 ± 0.704
0.649CysHis: 0.649 ± 0.373
1.082CysIle: 1.082 ± 0.315
1.84CysLys: 1.84 ± 0.788
2.489CysLeu: 2.489 ± 0.582
0.433CysMet: 0.433 ± 0.369
2.706CysAsn: 2.706 ± 0.788
0.649CysPro: 0.649 ± 0.196
0.758CysGln: 0.758 ± 0.447
1.19CysArg: 1.19 ± 0.351
1.84CysSer: 1.84 ± 0.495
2.922CysThr: 2.922 ± 0.454
3.139CysVal: 3.139 ± 0.784
0.541CysTrp: 0.541 ± 0.287
2.273CysTyr: 2.273 ± 1.016
0.0CysXaa: 0.0 ± 0.0
Asp
3.571AspAla: 3.571 ± 1.289
2.056AspCys: 2.056 ± 0.631
2.597AspAsp: 2.597 ± 0.407
2.273AspGlu: 2.273 ± 0.478
4.004AspPhe: 4.004 ± 1.034
5.195AspGly: 5.195 ± 1.167
1.082AspHis: 1.082 ± 0.484
3.139AspIle: 3.139 ± 1.049
2.489AspLys: 2.489 ± 0.52
4.221AspLeu: 4.221 ± 0.716
1.19AspMet: 1.19 ± 0.345
4.004AspAsn: 4.004 ± 1.172
1.84AspPro: 1.84 ± 0.542
1.082AspGln: 1.082 ± 0.521
1.082AspArg: 1.082 ± 0.393
3.139AspSer: 3.139 ± 0.778
2.381AspThr: 2.381 ± 0.745
5.411AspVal: 5.411 ± 1.191
0.541AspTrp: 0.541 ± 0.287
3.463AspTyr: 3.463 ± 0.854
0.0AspXaa: 0.0 ± 0.0
Glu
2.489GluAla: 2.489 ± 0.481
1.515GluCys: 1.515 ± 0.443
2.381GluAsp: 2.381 ± 0.57
2.706GluGlu: 2.706 ± 0.794
2.165GluPhe: 2.165 ± 0.246
3.571GluGly: 3.571 ± 1.422
1.732GluHis: 1.732 ± 0.616
1.623GluIle: 1.623 ± 0.474
2.489GluLys: 2.489 ± 0.709
4.113GluLeu: 4.113 ± 0.728
0.433GluMet: 0.433 ± 0.298
1.948GluAsn: 1.948 ± 0.273
1.84GluPro: 1.84 ± 0.644
1.515GluGln: 1.515 ± 0.597
1.299GluArg: 1.299 ± 0.723
1.299GluSer: 1.299 ± 0.666
1.515GluThr: 1.515 ± 0.289
4.87GluVal: 4.87 ± 0.67
0.433GluTrp: 0.433 ± 0.298
1.515GluTyr: 1.515 ± 0.619
0.0GluXaa: 0.0 ± 0.0
Phe
2.922PheAla: 2.922 ± 0.36
2.056PheCys: 2.056 ± 0.675
4.004PheAsp: 4.004 ± 0.646
2.706PheGlu: 2.706 ± 0.582
2.489PhePhe: 2.489 ± 0.892
4.329PheGly: 4.329 ± 0.735
0.541PheHis: 0.541 ± 0.158
3.788PheIle: 3.788 ± 0.959
4.221PheLys: 4.221 ± 1.495
3.03PheLeu: 3.03 ± 0.854
0.866PheMet: 0.866 ± 0.459
5.303PheAsn: 5.303 ± 1.212
0.649PhePro: 0.649 ± 0.231
1.299PheGln: 1.299 ± 0.635
0.649PheArg: 0.649 ± 0.862
4.87PheSer: 4.87 ± 1.683
3.139PheThr: 3.139 ± 0.598
6.926PheVal: 6.926 ± 1.162
0.974PheTrp: 0.974 ± 0.468
2.706PheTyr: 2.706 ± 0.259
0.0PheXaa: 0.0 ± 0.0
Gly
3.571GlyAla: 3.571 ± 1.356
2.381GlyCys: 2.381 ± 0.707
4.762GlyAsp: 4.762 ± 0.662
2.165GlyGlu: 2.165 ± 0.671
4.329GlyPhe: 4.329 ± 1.354
4.545GlyGly: 4.545 ± 0.84
0.974GlyHis: 0.974 ± 0.316
3.03GlyIle: 3.03 ± 1.081
3.247GlyLys: 3.247 ± 0.836
5.087GlyLeu: 5.087 ± 0.432
1.082GlyMet: 1.082 ± 0.521
3.463GlyAsn: 3.463 ± 0.906
1.299GlyPro: 1.299 ± 0.46
1.19GlyGln: 1.19 ± 1.136
2.381GlyArg: 2.381 ± 1.568
5.736GlySer: 5.736 ± 1.757
4.329GlyThr: 4.329 ± 0.959
8.442GlyVal: 8.442 ± 1.357
0.649GlyTrp: 0.649 ± 0.401
2.273GlyTyr: 2.273 ± 0.878
0.0GlyXaa: 0.0 ± 0.0
His
1.732HisAla: 1.732 ± 0.52
0.974HisCys: 0.974 ± 0.517
1.082HisAsp: 1.082 ± 0.574
0.974HisGlu: 0.974 ± 0.414
0.974HisPhe: 0.974 ± 0.533
0.974HisGly: 0.974 ± 0.528
0.216HisHis: 0.216 ± 0.156
0.758HisIle: 0.758 ± 0.241
1.19HisLys: 1.19 ± 0.632
1.407HisLeu: 1.407 ± 1.221
0.216HisMet: 0.216 ± 0.115
0.974HisAsn: 0.974 ± 0.318
0.325HisPro: 0.325 ± 0.172
0.758HisGln: 0.758 ± 0.262
0.541HisArg: 0.541 ± 0.363
1.082HisSer: 1.082 ± 0.315
1.19HisThr: 1.19 ± 0.351
2.165HisVal: 2.165 ± 0.654
0.216HisTrp: 0.216 ± 0.115
0.649HisTyr: 0.649 ± 0.531
0.0HisXaa: 0.0 ± 0.0
Ile
3.247IleAla: 3.247 ± 0.531
0.974IleCys: 0.974 ± 0.723
2.273IleAsp: 2.273 ± 0.692
1.948IleGlu: 1.948 ± 0.533
3.139IlePhe: 3.139 ± 0.546
2.814IleGly: 2.814 ± 0.455
0.974IleHis: 0.974 ± 0.414
2.814IleIle: 2.814 ± 1.741
3.355IleLys: 3.355 ± 0.884
4.221IleLeu: 4.221 ± 0.874
0.974IleMet: 0.974 ± 0.687
4.113IleAsn: 4.113 ± 1.338
2.922IlePro: 2.922 ± 1.895
1.84IleGln: 1.84 ± 0.69
1.515IleArg: 1.515 ± 0.441
5.087IleSer: 5.087 ± 1.444
3.896IleThr: 3.896 ± 1.591
5.411IleVal: 5.411 ± 0.873
0.325IleTrp: 0.325 ± 0.172
1.948IleTyr: 1.948 ± 0.533
0.0IleXaa: 0.0 ± 0.0
Lys
4.113LysAla: 4.113 ± 1.118
1.84LysCys: 1.84 ± 0.788
3.03LysAsp: 3.03 ± 1.184
2.597LysGlu: 2.597 ± 0.48
3.463LysPhe: 3.463 ± 1.165
2.922LysGly: 2.922 ± 0.784
2.381LysHis: 2.381 ± 1.264
2.273LysIle: 2.273 ± 0.672
1.84LysLys: 1.84 ± 0.718
4.978LysLeu: 4.978 ± 1.316
1.19LysMet: 1.19 ± 0.425
2.597LysAsn: 2.597 ± 0.524
4.004LysPro: 4.004 ± 0.822
2.381LysGln: 2.381 ± 0.737
2.273LysArg: 2.273 ± 0.792
3.463LysSer: 3.463 ± 0.839
3.139LysThr: 3.139 ± 0.661
5.087LysVal: 5.087 ± 1.635
0.433LysTrp: 0.433 ± 0.422
3.247LysTyr: 3.247 ± 0.623
0.0LysXaa: 0.0 ± 0.0
Leu
5.519LeuAla: 5.519 ± 1.345
3.139LeuCys: 3.139 ± 0.83
3.355LeuAsp: 3.355 ± 0.606
3.68LeuGlu: 3.68 ± 0.552
3.896LeuPhe: 3.896 ± 1.595
4.978LeuGly: 4.978 ± 1.108
1.623LeuHis: 1.623 ± 0.779
3.139LeuIle: 3.139 ± 1.247
5.844LeuLys: 5.844 ± 1.112
5.519LeuLeu: 5.519 ± 1.339
1.19LeuMet: 1.19 ± 0.398
6.169LeuAsn: 6.169 ± 1.865
4.221LeuPro: 4.221 ± 2.093
3.03LeuGln: 3.03 ± 0.458
3.03LeuArg: 3.03 ± 0.357
6.818LeuSer: 6.818 ± 1.41
5.411LeuThr: 5.411 ± 1.766
7.251LeuVal: 7.251 ± 2.523
1.299LeuTrp: 1.299 ± 1.497
4.004LeuTyr: 4.004 ± 1.042
0.0LeuXaa: 0.0 ± 0.0
Met
1.19MetAla: 1.19 ± 0.888
1.19MetCys: 1.19 ± 0.484
1.082MetAsp: 1.082 ± 0.574
0.866MetGlu: 0.866 ± 0.427
1.732MetPhe: 1.732 ± 0.287
1.299MetGly: 1.299 ± 0.457
0.325MetHis: 0.325 ± 0.172
1.407MetIle: 1.407 ± 0.275
0.108MetLys: 0.108 ± 0.057
2.165MetLeu: 2.165 ± 0.68
0.649MetMet: 0.649 ± 0.196
0.758MetAsn: 0.758 ± 0.307
0.649MetPro: 0.649 ± 0.231
0.541MetGln: 0.541 ± 0.756
1.082MetArg: 1.082 ± 0.315
1.515MetSer: 1.515 ± 0.477
1.19MetThr: 1.19 ± 0.339
0.866MetVal: 0.866 ± 0.292
0.325MetTrp: 0.325 ± 0.374
1.515MetTyr: 1.515 ± 0.668
0.108MetXaa: 0.108 ± 0.057
Asn
4.978AsnAla: 4.978 ± 1.317
2.381AsnCys: 2.381 ± 0.663
2.381AsnAsp: 2.381 ± 0.542
2.273AsnGlu: 2.273 ± 0.415
2.165AsnPhe: 2.165 ± 1.182
5.628AsnGly: 5.628 ± 0.976
0.758AsnHis: 0.758 ± 0.447
4.113AsnIle: 4.113 ± 0.773
3.571AsnLys: 3.571 ± 0.67
5.087AsnLeu: 5.087 ± 1.794
1.082AsnMet: 1.082 ± 0.397
4.113AsnAsn: 4.113 ± 0.714
1.948AsnPro: 1.948 ± 0.911
1.515AsnGln: 1.515 ± 1.272
1.732AsnArg: 1.732 ± 0.585
5.303AsnSer: 5.303 ± 1.485
4.221AsnThr: 4.221 ± 0.863
7.251AsnVal: 7.251 ± 2.236
0.649AsnTrp: 0.649 ± 0.862
2.381AsnTyr: 2.381 ± 0.701
0.0AsnXaa: 0.0 ± 0.0
Pro
2.381ProAla: 2.381 ± 1.554
0.758ProCys: 0.758 ± 0.241
1.84ProAsp: 1.84 ± 0.593
2.489ProGlu: 2.489 ± 1.083
1.623ProPhe: 1.623 ± 0.667
2.273ProGly: 2.273 ± 0.447
1.082ProHis: 1.082 ± 0.792
1.623ProIle: 1.623 ± 0.638
1.948ProLys: 1.948 ± 1.827
3.355ProLeu: 3.355 ± 1.113
0.325ProMet: 0.325 ± 0.172
1.515ProAsn: 1.515 ± 1.278
1.84ProPro: 1.84 ± 0.762
0.758ProGln: 0.758 ± 0.538
1.299ProArg: 1.299 ± 0.722
3.247ProSer: 3.247 ± 0.447
1.948ProThr: 1.948 ± 0.669
3.788ProVal: 3.788 ± 1.288
0.433ProTrp: 0.433 ± 0.134
0.758ProTyr: 0.758 ± 0.307
0.0ProXaa: 0.0 ± 0.0
Gln
2.273GlnAla: 2.273 ± 0.694
0.541GlnCys: 0.541 ± 0.287
1.19GlnAsp: 1.19 ± 0.388
0.758GlnGlu: 0.758 ± 0.538
1.082GlnPhe: 1.082 ± 0.477
1.299GlnGly: 1.299 ± 0.324
0.541GlnHis: 0.541 ± 0.158
1.407GlnIle: 1.407 ± 0.542
0.974GlnLys: 0.974 ± 0.533
4.113GlnLeu: 4.113 ± 1.821
1.19GlnMet: 1.19 ± 0.365
1.299GlnAsn: 1.299 ± 0.613
0.974GlnPro: 0.974 ± 0.26
1.19GlnGln: 1.19 ± 1.412
1.299GlnArg: 1.299 ± 0.793
2.165GlnSer: 2.165 ± 1.251
1.732GlnThr: 1.732 ± 0.412
2.056GlnVal: 2.056 ± 0.496
0.216GlnTrp: 0.216 ± 0.115
1.082GlnTyr: 1.082 ± 0.44
0.0GlnXaa: 0.0 ± 0.0
Arg
2.489ArgAla: 2.489 ± 0.701
1.515ArgCys: 1.515 ± 0.352
1.407ArgAsp: 1.407 ± 0.322
1.19ArgGlu: 1.19 ± 0.461
2.706ArgPhe: 2.706 ± 1.07
1.299ArgGly: 1.299 ± 0.547
0.325ArgHis: 0.325 ± 0.172
1.732ArgIle: 1.732 ± 0.513
1.84ArgLys: 1.84 ± 0.573
3.139ArgLeu: 3.139 ± 2.138
0.974ArgMet: 0.974 ± 0.436
2.597ArgAsn: 2.597 ± 0.846
1.082ArgPro: 1.082 ± 0.471
0.866ArgGln: 0.866 ± 0.292
0.974ArgArg: 0.974 ± 1.149
1.948ArgSer: 1.948 ± 2.443
2.165ArgThr: 2.165 ± 0.735
3.03ArgVal: 3.03 ± 0.622
0.541ArgTrp: 0.541 ± 0.26
1.407ArgTyr: 1.407 ± 0.425
0.0ArgXaa: 0.0 ± 0.0
Ser
4.437SerAla: 4.437 ± 0.657
1.407SerCys: 1.407 ± 0.699
4.978SerAsp: 4.978 ± 0.835
2.814SerGlu: 2.814 ± 0.638
4.004SerPhe: 4.004 ± 1.309
4.87SerGly: 4.87 ± 0.591
0.974SerHis: 0.974 ± 0.318
4.545SerIle: 4.545 ± 0.552
4.221SerLys: 4.221 ± 1.095
5.736SerLeu: 5.736 ± 0.693
1.407SerMet: 1.407 ± 0.369
4.329SerAsn: 4.329 ± 1.921
1.84SerPro: 1.84 ± 0.35
2.056SerGln: 2.056 ± 0.853
2.165SerArg: 2.165 ± 2.074
5.844SerSer: 5.844 ± 0.918
4.654SerThr: 4.654 ± 0.852
8.55SerVal: 8.55 ± 0.919
0.866SerTrp: 0.866 ± 0.553
3.896SerTyr: 3.896 ± 0.816
0.0SerXaa: 0.0 ± 0.0
Thr
3.463ThrAla: 3.463 ± 0.801
1.623ThrCys: 1.623 ± 0.315
2.381ThrAsp: 2.381 ± 0.577
2.165ThrGlu: 2.165 ± 0.785
4.437ThrPhe: 4.437 ± 0.719
3.788ThrGly: 3.788 ± 0.901
0.866ThrHis: 0.866 ± 0.622
3.896ThrIle: 3.896 ± 1.64
2.922ThrLys: 2.922 ± 0.664
5.736ThrLeu: 5.736 ± 0.633
1.948ThrMet: 1.948 ± 0.827
3.788ThrAsn: 3.788 ± 1.126
1.84ThrPro: 1.84 ± 0.875
1.515ThrGln: 1.515 ± 0.882
2.165ThrArg: 2.165 ± 0.477
4.329ThrSer: 4.329 ± 1.083
4.221ThrThr: 4.221 ± 1.162
6.494ThrVal: 6.494 ± 1.589
0.433ThrTrp: 0.433 ± 0.23
1.948ThrTyr: 1.948 ± 0.421
0.0ThrXaa: 0.0 ± 0.0
Val
6.71ValAla: 6.71 ± 0.847
4.004ValCys: 4.004 ± 0.977
7.035ValAsp: 7.035 ± 1.132
4.545ValGlu: 4.545 ± 0.818
5.195ValPhe: 5.195 ± 0.819
5.628ValGly: 5.628 ± 0.687
0.866ValHis: 0.866 ± 0.269
6.494ValIle: 6.494 ± 1.548
8.333ValLys: 8.333 ± 2.748
8.55ValLeu: 8.55 ± 1.794
2.056ValMet: 2.056 ± 2.671
6.061ValAsn: 6.061 ± 1.197
3.571ValPro: 3.571 ± 0.94
3.03ValGln: 3.03 ± 0.454
2.922ValArg: 2.922 ± 0.543
7.251ValSer: 7.251 ± 1.636
5.736ValThr: 5.736 ± 1.285
9.091ValVal: 9.091 ± 2.649
0.758ValTrp: 0.758 ± 0.262
3.03ValTyr: 3.03 ± 0.854
0.0ValXaa: 0.0 ± 0.0
Trp
0.541TrpAla: 0.541 ± 0.867
0.433TrpCys: 0.433 ± 0.134
0.758TrpAsp: 0.758 ± 0.402
0.216TrpGlu: 0.216 ± 0.115
0.758TrpPhe: 0.758 ± 0.332
0.325TrpGly: 0.325 ± 0.374
0.325TrpHis: 0.325 ± 0.273
0.433TrpIle: 0.433 ± 0.23
0.433TrpLys: 0.433 ± 0.264
1.407TrpLeu: 1.407 ± 1.018
0.108TrpMet: 0.108 ± 0.057
0.974TrpAsn: 0.974 ± 0.771
0.649TrpPro: 0.649 ± 0.454
0.108TrpGln: 0.108 ± 0.057
0.649TrpArg: 0.649 ± 0.231
0.974TrpSer: 0.974 ± 0.316
0.433TrpThr: 0.433 ± 0.298
0.758TrpVal: 0.758 ± 0.538
0.108TrpTrp: 0.108 ± 0.057
0.758TrpTyr: 0.758 ± 0.332
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.139TyrAla: 3.139 ± 0.555
1.407TyrCys: 1.407 ± 0.436
2.814TyrAsp: 2.814 ± 1.493
1.732TyrGlu: 1.732 ± 0.528
2.381TyrPhe: 2.381 ± 0.31
3.139TyrGly: 3.139 ± 0.598
1.082TyrHis: 1.082 ± 0.247
2.056TyrIle: 2.056 ± 0.627
2.489TyrLys: 2.489 ± 1.07
3.247TyrLeu: 3.247 ± 0.751
0.866TyrMet: 0.866 ± 0.417
2.814TyrAsn: 2.814 ± 0.84
0.974TyrPro: 0.974 ± 0.517
0.758TyrGln: 0.758 ± 0.391
2.273TyrArg: 2.273 ± 0.95
2.597TyrSer: 2.597 ± 0.937
1.84TyrThr: 1.84 ± 0.437
4.437TyrVal: 4.437 ± 0.939
0.758TyrTrp: 0.758 ± 0.307
3.355TyrTyr: 3.355 ± 1.418
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.108XaaLeu: 0.108 ± 0.057
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (9241 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski