Amino acid dipepetide frequency for Beatrice Hill virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.712AlaAla: 0.712 ± 0.38
1.187AlaCys: 1.187 ± 0.574
2.136AlaAsp: 2.136 ± 0.421
0.949AlaGlu: 0.949 ± 0.563
1.424AlaPhe: 1.424 ± 0.614
0.237AlaGly: 0.237 ± 0.262
0.949AlaHis: 0.949 ± 0.481
2.136AlaIle: 2.136 ± 0.606
1.899AlaLys: 1.899 ± 0.952
3.798AlaLeu: 3.798 ± 0.727
1.187AlaMet: 1.187 ± 0.725
3.323AlaAsn: 3.323 ± 0.477
0.949AlaPro: 0.949 ± 0.798
0.949AlaGln: 0.949 ± 0.363
1.424AlaArg: 1.424 ± 0.53
3.323AlaSer: 3.323 ± 1.095
2.136AlaThr: 2.136 ± 0.644
2.374AlaVal: 2.374 ± 0.446
0.475AlaTrp: 0.475 ± 0.281
1.899AlaTyr: 1.899 ± 1.089
0.0AlaXaa: 0.0 ± 0.0
Cys
0.949CysAla: 0.949 ± 0.531
0.475CysCys: 0.475 ± 0.445
1.899CysAsp: 1.899 ± 0.579
1.662CysGlu: 1.662 ± 0.463
0.712CysPhe: 0.712 ± 0.409
0.475CysGly: 0.475 ± 0.442
1.187CysHis: 1.187 ± 0.909
0.949CysIle: 0.949 ± 0.477
1.187CysLys: 1.187 ± 0.576
2.611CysLeu: 2.611 ± 0.523
0.0CysMet: 0.0 ± 0.0
1.424CysAsn: 1.424 ± 0.729
0.475CysPro: 0.475 ± 0.203
1.424CysGln: 1.424 ± 0.624
1.424CysArg: 1.424 ± 1.07
1.662CysSer: 1.662 ± 0.513
0.712CysThr: 0.712 ± 0.424
0.949CysVal: 0.949 ± 0.313
0.237CysTrp: 0.237 ± 0.136
1.662CysTyr: 1.662 ± 0.48
0.0CysXaa: 0.0 ± 0.0
Asp
3.323AspAla: 3.323 ± 0.451
0.712AspCys: 0.712 ± 0.568
5.222AspAsp: 5.222 ± 2.008
2.374AspGlu: 2.374 ± 0.643
2.611AspPhe: 2.611 ± 0.844
3.56AspGly: 3.56 ± 0.798
1.187AspHis: 1.187 ± 0.642
3.798AspIle: 3.798 ± 0.766
2.848AspLys: 2.848 ± 0.571
7.596AspLeu: 7.596 ± 0.684
1.662AspMet: 1.662 ± 0.432
4.51AspAsn: 4.51 ± 0.835
3.56AspPro: 3.56 ± 0.646
2.374AspGln: 2.374 ± 0.996
2.374AspArg: 2.374 ± 0.994
4.51AspSer: 4.51 ± 0.786
3.086AspThr: 3.086 ± 0.536
3.323AspVal: 3.323 ± 1.14
1.187AspTrp: 1.187 ± 0.705
4.51AspTyr: 4.51 ± 0.771
0.0AspXaa: 0.0 ± 0.0
Glu
1.187GluAla: 1.187 ± 0.453
1.662GluCys: 1.662 ± 0.519
5.697GluAsp: 5.697 ± 1.428
2.611GluGlu: 2.611 ± 0.473
2.136GluPhe: 2.136 ± 0.895
2.374GluGly: 2.374 ± 0.588
1.662GluHis: 1.662 ± 0.626
4.985GluIle: 4.985 ± 0.867
4.035GluLys: 4.035 ± 0.782
6.171GluLeu: 6.171 ± 1.518
1.899GluMet: 1.899 ± 0.413
2.611GluAsn: 2.611 ± 0.46
2.136GluPro: 2.136 ± 0.492
1.187GluGln: 1.187 ± 0.362
1.424GluArg: 1.424 ± 0.466
3.086GluSer: 3.086 ± 1.135
3.56GluThr: 3.56 ± 1.057
3.323GluVal: 3.323 ± 1.231
0.237GluTrp: 0.237 ± 0.136
0.237GluTyr: 0.237 ± 0.136
0.0GluXaa: 0.0 ± 0.0
Phe
1.899PheAla: 1.899 ± 0.309
0.712PheCys: 0.712 ± 0.449
2.374PheAsp: 2.374 ± 0.861
2.374PheGlu: 2.374 ± 0.577
3.086PhePhe: 3.086 ± 0.616
1.899PheGly: 1.899 ± 0.56
0.949PheHis: 0.949 ± 0.506
2.611PheIle: 2.611 ± 0.565
5.934PheLys: 5.934 ± 1.669
2.848PheLeu: 2.848 ± 0.933
0.475PheMet: 0.475 ± 0.272
2.611PheAsn: 2.611 ± 0.997
1.424PhePro: 1.424 ± 0.898
1.187PheGln: 1.187 ± 0.53
1.662PheArg: 1.662 ± 0.816
3.086PheSer: 3.086 ± 1.047
2.374PheThr: 2.374 ± 0.838
2.374PheVal: 2.374 ± 0.536
0.712PheTrp: 0.712 ± 0.323
3.798PheTyr: 3.798 ± 1.02
0.0PheXaa: 0.0 ± 0.0
Gly
0.949GlyAla: 0.949 ± 0.315
0.0GlyCys: 0.0 ± 0.0
2.848GlyAsp: 2.848 ± 1.182
2.848GlyGlu: 2.848 ± 0.518
2.848GlyPhe: 2.848 ± 0.435
3.56GlyGly: 3.56 ± 0.725
1.187GlyHis: 1.187 ± 0.338
3.798GlyIle: 3.798 ± 0.626
2.374GlyLys: 2.374 ± 1.171
7.596GlyLeu: 7.596 ± 1.273
0.712GlyMet: 0.712 ± 0.368
2.848GlyAsn: 2.848 ± 0.578
2.136GlyPro: 2.136 ± 0.772
1.662GlyGln: 1.662 ± 0.518
1.187GlyArg: 1.187 ± 0.456
4.985GlySer: 4.985 ± 1.144
3.323GlyThr: 3.323 ± 0.779
2.136GlyVal: 2.136 ± 1.368
0.949GlyTrp: 0.949 ± 0.439
2.374GlyTyr: 2.374 ± 0.499
0.0GlyXaa: 0.0 ± 0.0
His
0.712HisAla: 0.712 ± 0.735
0.712HisCys: 0.712 ± 0.295
1.187HisAsp: 1.187 ± 0.408
0.712HisGlu: 0.712 ± 0.648
1.424HisPhe: 1.424 ± 0.438
1.187HisGly: 1.187 ± 0.623
0.712HisHis: 0.712 ± 0.503
2.136HisIle: 2.136 ± 1.347
2.611HisLys: 2.611 ± 0.718
3.086HisLeu: 3.086 ± 0.789
0.712HisMet: 0.712 ± 0.347
1.662HisAsn: 1.662 ± 0.669
2.136HisPro: 2.136 ± 0.807
0.949HisGln: 0.949 ± 0.509
1.424HisArg: 1.424 ± 0.418
1.899HisSer: 1.899 ± 0.43
1.187HisThr: 1.187 ± 0.446
1.899HisVal: 1.899 ± 0.43
0.712HisTrp: 0.712 ± 0.323
0.237HisTyr: 0.237 ± 0.136
0.0HisXaa: 0.0 ± 0.0
Ile
2.136IleAla: 2.136 ± 0.935
2.611IleCys: 2.611 ± 0.926
3.798IleAsp: 3.798 ± 1.003
3.323IleGlu: 3.323 ± 0.773
2.848IlePhe: 2.848 ± 0.874
4.985IleGly: 4.985 ± 1.255
2.136IleHis: 2.136 ± 0.496
6.646IleIle: 6.646 ± 1.689
9.732IleLys: 9.732 ± 1.47
8.545IleLeu: 8.545 ± 1.194
1.424IleMet: 1.424 ± 0.488
7.596IleAsn: 7.596 ± 1.096
3.323IlePro: 3.323 ± 1.028
2.848IleGln: 2.848 ± 0.558
4.035IleArg: 4.035 ± 0.741
6.646IleSer: 6.646 ± 1.478
2.374IleThr: 2.374 ± 0.92
2.848IleVal: 2.848 ± 0.76
0.712IleTrp: 0.712 ± 0.226
3.798IleTyr: 3.798 ± 0.897
0.0IleXaa: 0.0 ± 0.0
Lys
2.136LysAla: 2.136 ± 1.023
1.899LysCys: 1.899 ± 0.412
4.51LysAsp: 4.51 ± 1.336
4.747LysGlu: 4.747 ± 1.405
1.899LysPhe: 1.899 ± 0.675
3.798LysGly: 3.798 ± 1.23
1.899LysHis: 1.899 ± 1.199
7.596LysIle: 7.596 ± 2.23
4.51LysLys: 4.51 ± 1.105
8.07LysLeu: 8.07 ± 1.388
1.187LysMet: 1.187 ± 0.674
4.51LysAsn: 4.51 ± 0.868
2.611LysPro: 2.611 ± 0.52
2.136LysGln: 2.136 ± 1.282
3.56LysArg: 3.56 ± 0.697
5.459LysSer: 5.459 ± 1.976
2.848LysThr: 2.848 ± 0.786
4.985LysVal: 4.985 ± 1.508
0.949LysTrp: 0.949 ± 0.514
3.798LysTyr: 3.798 ± 0.975
0.0LysXaa: 0.0 ± 0.0
Leu
5.459LeuAla: 5.459 ± 1.119
1.899LeuCys: 1.899 ± 0.626
7.596LeuAsp: 7.596 ± 1.181
5.459LeuGlu: 5.459 ± 0.986
3.56LeuPhe: 3.56 ± 0.979
5.934LeuGly: 5.934 ± 1.072
1.662LeuHis: 1.662 ± 0.397
9.969LeuIle: 9.969 ± 1.783
6.171LeuLys: 6.171 ± 1.323
7.358LeuLeu: 7.358 ± 1.531
3.798LeuMet: 3.798 ± 0.971
6.409LeuAsn: 6.409 ± 0.901
4.035LeuPro: 4.035 ± 0.571
3.323LeuGln: 3.323 ± 0.746
4.985LeuArg: 4.985 ± 1.023
8.308LeuSer: 8.308 ± 0.896
6.646LeuThr: 6.646 ± 1.278
5.697LeuVal: 5.697 ± 1.177
0.475LeuTrp: 0.475 ± 0.524
3.56LeuTyr: 3.56 ± 0.476
0.0LeuXaa: 0.0 ± 0.0
Met
0.475MetAla: 0.475 ± 0.272
0.712MetCys: 0.712 ± 0.587
1.424MetAsp: 1.424 ± 0.514
1.662MetGlu: 1.662 ± 0.346
1.424MetPhe: 1.424 ± 0.729
0.712MetGly: 0.712 ± 0.302
0.949MetHis: 0.949 ± 0.563
2.848MetIle: 2.848 ± 0.591
2.611MetLys: 2.611 ± 1.248
2.374MetLeu: 2.374 ± 0.701
0.475MetMet: 0.475 ± 0.442
1.899MetAsn: 1.899 ± 0.701
0.237MetPro: 0.237 ± 0.315
0.0MetGln: 0.0 ± 0.0
0.712MetArg: 0.712 ± 0.226
1.187MetSer: 1.187 ± 0.606
2.136MetThr: 2.136 ± 0.626
0.712MetVal: 0.712 ± 0.366
0.0MetTrp: 0.0 ± 0.0
0.475MetTyr: 0.475 ± 0.361
0.0MetXaa: 0.0 ± 0.0
Asn
1.899AsnAla: 1.899 ± 0.542
1.899AsnCys: 1.899 ± 0.657
2.848AsnAsp: 2.848 ± 0.619
4.51AsnGlu: 4.51 ± 1.034
3.56AsnPhe: 3.56 ± 0.815
2.611AsnGly: 2.611 ± 0.523
1.899AsnHis: 1.899 ± 0.664
6.171AsnIle: 6.171 ± 1.076
3.798AsnLys: 3.798 ± 1.071
8.782AsnLeu: 8.782 ± 0.728
1.187AsnMet: 1.187 ± 0.427
6.646AsnAsn: 6.646 ± 1.245
3.323AsnPro: 3.323 ± 0.958
1.899AsnGln: 1.899 ± 0.481
3.086AsnArg: 3.086 ± 0.634
2.848AsnSer: 2.848 ± 0.65
4.51AsnThr: 4.51 ± 0.594
2.848AsnVal: 2.848 ± 1.074
1.662AsnTrp: 1.662 ± 0.553
4.51AsnTyr: 4.51 ± 0.749
0.0AsnXaa: 0.0 ± 0.0
Pro
1.424ProAla: 1.424 ± 0.587
0.0ProCys: 0.0 ± 0.0
2.374ProAsp: 2.374 ± 0.593
3.086ProGlu: 3.086 ± 0.625
1.424ProPhe: 1.424 ± 0.957
1.662ProGly: 1.662 ± 1.277
1.424ProHis: 1.424 ± 0.452
2.848ProIle: 2.848 ± 0.645
3.323ProLys: 3.323 ± 0.662
3.086ProLeu: 3.086 ± 0.44
0.237ProMet: 0.237 ± 0.136
3.323ProAsn: 3.323 ± 0.983
0.949ProPro: 0.949 ± 0.412
1.187ProGln: 1.187 ± 0.471
0.712ProArg: 0.712 ± 0.409
4.747ProSer: 4.747 ± 0.91
2.374ProThr: 2.374 ± 0.852
2.374ProVal: 2.374 ± 0.522
0.712ProTrp: 0.712 ± 0.33
2.374ProTyr: 2.374 ± 0.451
0.0ProXaa: 0.0 ± 0.0
Gln
0.949GlnAla: 0.949 ± 0.467
0.712GlnCys: 0.712 ± 0.33
2.611GlnAsp: 2.611 ± 1.161
2.136GlnGlu: 2.136 ± 0.669
1.424GlnPhe: 1.424 ± 0.624
1.662GlnGly: 1.662 ± 0.6
1.899GlnHis: 1.899 ± 0.818
2.611GlnIle: 2.611 ± 0.516
2.374GlnLys: 2.374 ± 0.498
2.136GlnLeu: 2.136 ± 0.436
0.475GlnMet: 0.475 ± 0.459
2.136GlnAsn: 2.136 ± 0.814
0.949GlnPro: 0.949 ± 0.547
0.475GlnGln: 0.475 ± 0.257
1.187GlnArg: 1.187 ± 0.397
1.662GlnSer: 1.662 ± 0.788
2.136GlnThr: 2.136 ± 0.614
1.424GlnVal: 1.424 ± 0.545
0.712GlnTrp: 0.712 ± 0.295
0.712GlnTyr: 0.712 ± 0.563
0.0GlnXaa: 0.0 ± 0.0
Arg
2.611ArgAla: 2.611 ± 0.536
0.949ArgCys: 0.949 ± 0.407
0.712ArgAsp: 0.712 ± 0.268
1.899ArgGlu: 1.899 ± 0.571
2.611ArgPhe: 2.611 ± 0.553
2.374ArgGly: 2.374 ± 0.546
1.424ArgHis: 1.424 ± 0.584
2.374ArgIle: 2.374 ± 0.314
3.086ArgLys: 3.086 ± 1.347
3.323ArgLeu: 3.323 ± 0.814
0.712ArgMet: 0.712 ± 0.597
3.798ArgAsn: 3.798 ± 0.74
1.424ArgPro: 1.424 ± 0.347
1.187ArgGln: 1.187 ± 0.284
0.237ArgArg: 0.237 ± 0.136
4.51ArgSer: 4.51 ± 0.583
3.56ArgThr: 3.56 ± 0.465
0.712ArgVal: 0.712 ± 0.385
0.949ArgTrp: 0.949 ± 0.346
1.899ArgTyr: 1.899 ± 0.767
0.0ArgXaa: 0.0 ± 0.0
Ser
2.848SerAla: 2.848 ± 0.692
2.848SerCys: 2.848 ± 0.747
6.646SerAsp: 6.646 ± 1.385
4.272SerGlu: 4.272 ± 1.828
2.611SerPhe: 2.611 ± 0.704
4.747SerGly: 4.747 ± 1.254
1.662SerHis: 1.662 ± 0.436
6.883SerIle: 6.883 ± 1.01
3.56SerLys: 3.56 ± 1.31
8.07SerLeu: 8.07 ± 2.019
1.662SerMet: 1.662 ± 0.891
3.798SerAsn: 3.798 ± 0.82
3.086SerPro: 3.086 ± 0.704
3.323SerGln: 3.323 ± 0.418
3.56SerArg: 3.56 ± 1.142
6.171SerSer: 6.171 ± 1.763
3.323SerThr: 3.323 ± 0.883
2.848SerVal: 2.848 ± 0.53
2.136SerTrp: 2.136 ± 0.516
2.848SerTyr: 2.848 ± 0.74
0.0SerXaa: 0.0 ± 0.0
Thr
1.187ThrAla: 1.187 ± 0.71
1.424ThrCys: 1.424 ± 0.645
3.56ThrAsp: 3.56 ± 1.006
1.899ThrGlu: 1.899 ± 0.763
2.136ThrPhe: 2.136 ± 1.009
3.323ThrGly: 3.323 ± 0.554
2.374ThrHis: 2.374 ± 1.044
4.51ThrIle: 4.51 ± 0.894
5.222ThrLys: 5.222 ± 1.008
5.222ThrLeu: 5.222 ± 0.739
1.662ThrMet: 1.662 ± 0.42
3.086ThrAsn: 3.086 ± 1.398
1.662ThrPro: 1.662 ± 0.354
0.949ThrGln: 0.949 ± 0.884
3.323ThrArg: 3.323 ± 0.905
4.747ThrSer: 4.747 ± 0.826
2.848ThrThr: 2.848 ± 0.453
2.611ThrVal: 2.611 ± 0.803
1.899ThrTrp: 1.899 ± 0.579
1.662ThrTyr: 1.662 ± 0.336
0.0ThrXaa: 0.0 ± 0.0
Val
0.712ValAla: 0.712 ± 0.226
0.475ValCys: 0.475 ± 0.281
3.323ValAsp: 3.323 ± 1.528
2.374ValGlu: 2.374 ± 0.896
2.136ValPhe: 2.136 ± 0.735
2.374ValGly: 2.374 ± 0.615
0.712ValHis: 0.712 ± 0.268
4.985ValIle: 4.985 ± 1.079
3.56ValLys: 3.56 ± 1.705
5.459ValLeu: 5.459 ± 0.936
1.899ValMet: 1.899 ± 0.727
4.272ValAsn: 4.272 ± 0.792
1.899ValPro: 1.899 ± 0.832
1.899ValGln: 1.899 ± 0.421
1.187ValArg: 1.187 ± 0.427
3.798ValSer: 3.798 ± 0.667
4.035ValThr: 4.035 ± 0.577
0.712ValVal: 0.712 ± 0.549
0.712ValTrp: 0.712 ± 0.385
2.136ValTyr: 2.136 ± 0.912
0.0ValXaa: 0.0 ± 0.0
Trp
0.712TrpAla: 0.712 ± 0.694
0.0TrpCys: 0.0 ± 0.0
0.475TrpAsp: 0.475 ± 0.203
0.712TrpGlu: 0.712 ± 0.449
0.949TrpPhe: 0.949 ± 0.531
0.949TrpGly: 0.949 ± 0.545
0.237TrpHis: 0.237 ± 0.136
1.662TrpIle: 1.662 ± 0.921
1.187TrpLys: 1.187 ± 0.478
1.424TrpLeu: 1.424 ± 0.463
0.949TrpMet: 0.949 ± 0.459
0.949TrpAsn: 0.949 ± 0.545
0.712TrpPro: 0.712 ± 0.409
0.237TrpGln: 0.237 ± 0.136
0.949TrpArg: 0.949 ± 0.313
0.712TrpSer: 0.712 ± 0.226
1.187TrpThr: 1.187 ± 0.522
0.949TrpVal: 0.949 ± 0.563
0.237TrpTrp: 0.237 ± 0.262
0.949TrpTyr: 0.949 ± 0.286
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.187TyrAla: 1.187 ± 0.715
1.187TyrCys: 1.187 ± 0.462
3.086TyrAsp: 3.086 ± 0.507
2.611TyrGlu: 2.611 ± 0.562
3.56TyrPhe: 3.56 ± 0.568
1.899TyrGly: 1.899 ± 0.582
1.187TyrHis: 1.187 ± 0.441
2.848TyrIle: 2.848 ± 0.57
3.086TyrLys: 3.086 ± 0.42
4.747TyrLeu: 4.747 ± 0.967
0.475TyrMet: 0.475 ± 0.445
3.086TyrAsn: 3.086 ± 0.531
2.611TyrPro: 2.611 ± 0.666
1.187TyrGln: 1.187 ± 0.462
1.899TyrArg: 1.899 ± 1.064
3.798TyrSer: 3.798 ± 0.831
0.949TyrThr: 0.949 ± 0.563
3.56TyrVal: 3.56 ± 0.796
0.475TyrTrp: 0.475 ± 0.436
3.798TyrTyr: 3.798 ± 1.241
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (4214 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski