Amino acid dipepetide frequency for Bahia Grande virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.807AlaAla: 2.807 ± 1.716
0.766AlaCys: 0.766 ± 0.449
2.041AlaAsp: 2.041 ± 0.536
3.317AlaGlu: 3.317 ± 1.661
1.786AlaPhe: 1.786 ± 0.628
3.572AlaGly: 3.572 ± 0.849
1.276AlaHis: 1.276 ± 0.279
4.083AlaIle: 4.083 ± 1.025
2.297AlaLys: 2.297 ± 0.373
3.828AlaLeu: 3.828 ± 1.067
1.276AlaMet: 1.276 ± 0.607
1.531AlaAsn: 1.531 ± 1.106
2.807AlaPro: 2.807 ± 0.925
1.786AlaGln: 1.786 ± 0.559
3.062AlaArg: 3.062 ± 1.0
1.786AlaSer: 1.786 ± 0.498
3.062AlaThr: 3.062 ± 0.579
1.276AlaVal: 1.276 ± 1.238
0.255AlaTrp: 0.255 ± 0.516
2.297AlaTyr: 2.297 ± 1.084
0.0AlaXaa: 0.0 ± 0.0
Cys
0.255CysAla: 0.255 ± 0.328
0.51CysCys: 0.51 ± 0.273
0.766CysAsp: 0.766 ± 0.297
0.255CysGlu: 0.255 ± 0.325
0.51CysPhe: 0.51 ± 0.299
1.021CysGly: 1.021 ± 0.383
0.0CysHis: 0.0 ± 0.0
1.531CysIle: 1.531 ± 0.813
0.51CysLys: 0.51 ± 0.519
1.531CysLeu: 1.531 ± 0.82
0.255CysMet: 0.255 ± 0.15
1.021CysAsn: 1.021 ± 0.576
0.766CysPro: 0.766 ± 0.702
0.255CysGln: 0.255 ± 0.15
0.0CysArg: 0.0 ± 0.0
2.041CysSer: 2.041 ± 0.842
0.0CysThr: 0.0 ± 0.0
0.51CysVal: 0.51 ± 0.349
0.766CysTrp: 0.766 ± 0.449
0.766CysTyr: 0.766 ± 0.378
0.0CysXaa: 0.0 ± 0.0
Asp
1.531AspAla: 1.531 ± 0.251
0.51AspCys: 0.51 ± 0.273
4.848AspAsp: 4.848 ± 1.016
5.359AspGlu: 5.359 ± 1.476
2.297AspPhe: 2.297 ± 0.319
2.807AspGly: 2.807 ± 0.665
1.531AspHis: 1.531 ± 0.366
2.041AspIle: 2.041 ± 0.465
5.869AspLys: 5.869 ± 2.256
6.379AspLeu: 6.379 ± 2.072
1.786AspMet: 1.786 ± 0.498
2.041AspAsn: 2.041 ± 0.263
3.828AspPro: 3.828 ± 0.87
2.297AspGln: 2.297 ± 0.319
1.531AspArg: 1.531 ± 0.576
2.807AspSer: 2.807 ± 0.63
2.807AspThr: 2.807 ± 1.427
2.552AspVal: 2.552 ± 0.306
0.51AspTrp: 0.51 ± 0.278
3.572AspTyr: 3.572 ± 0.971
0.0AspXaa: 0.0 ± 0.0
Glu
3.062GluAla: 3.062 ± 1.069
0.51GluCys: 0.51 ± 0.273
4.593GluAsp: 4.593 ± 1.837
6.379GluGlu: 6.379 ± 1.086
3.317GluPhe: 3.317 ± 1.064
6.379GluGly: 6.379 ± 1.272
1.276GluHis: 1.276 ± 0.849
4.083GluIle: 4.083 ± 0.617
3.828GluLys: 3.828 ± 0.915
7.91GluLeu: 7.91 ± 1.268
1.786GluMet: 1.786 ± 1.081
6.89GluAsn: 6.89 ± 0.818
2.041GluPro: 2.041 ± 0.783
2.552GluGln: 2.552 ± 1.08
4.593GluArg: 4.593 ± 1.184
4.593GluSer: 4.593 ± 0.572
3.828GluThr: 3.828 ± 1.888
6.379GluVal: 6.379 ± 1.306
1.276GluTrp: 1.276 ± 0.695
2.552GluTyr: 2.552 ± 0.522
0.0GluXaa: 0.0 ± 0.0
Phe
1.531PheAla: 1.531 ± 0.636
0.51PheCys: 0.51 ± 0.299
2.807PheAsp: 2.807 ± 0.532
2.041PheGlu: 2.041 ± 0.684
3.317PhePhe: 3.317 ± 0.871
2.297PheGly: 2.297 ± 0.785
0.766PheHis: 0.766 ± 0.297
2.297PheIle: 2.297 ± 0.634
1.276PheLys: 1.276 ± 0.634
5.359PheLeu: 5.359 ± 0.803
0.51PheMet: 0.51 ± 0.278
2.552PheAsn: 2.552 ± 1.375
2.297PhePro: 2.297 ± 0.593
2.041PheGln: 2.041 ± 0.979
2.807PheArg: 2.807 ± 0.643
5.103PheSer: 5.103 ± 1.624
1.276PheThr: 1.276 ± 0.505
3.062PheVal: 3.062 ± 0.541
0.255PheTrp: 0.255 ± 0.325
1.531PheTyr: 1.531 ± 0.82
0.0PheXaa: 0.0 ± 0.0
Gly
1.531GlyAla: 1.531 ± 0.61
1.021GlyCys: 1.021 ± 0.475
3.572GlyAsp: 3.572 ± 0.841
3.317GlyGlu: 3.317 ± 1.322
2.807GlyPhe: 2.807 ± 0.776
2.552GlyGly: 2.552 ± 1.01
1.021GlyHis: 1.021 ± 0.465
4.338GlyIle: 4.338 ± 0.535
4.083GlyLys: 4.083 ± 0.735
7.655GlyLeu: 7.655 ± 1.057
2.041GlyMet: 2.041 ± 0.705
2.552GlyAsn: 2.552 ± 0.665
1.021GlyPro: 1.021 ± 0.598
2.041GlyGln: 2.041 ± 0.728
2.807GlyArg: 2.807 ± 0.675
3.317GlySer: 3.317 ± 1.154
3.317GlyThr: 3.317 ± 0.677
3.062GlyVal: 3.062 ± 0.904
0.766GlyTrp: 0.766 ± 0.297
2.297GlyTyr: 2.297 ± 0.561
0.0GlyXaa: 0.0 ± 0.0
His
0.766HisAla: 0.766 ± 0.303
0.0HisCys: 0.0 ± 0.0
0.51HisAsp: 0.51 ± 0.441
1.531HisGlu: 1.531 ± 0.509
1.531HisPhe: 1.531 ± 0.366
0.51HisGly: 0.51 ± 0.299
1.276HisHis: 1.276 ± 0.478
1.786HisIle: 1.786 ± 0.769
2.807HisLys: 2.807 ± 0.778
2.552HisLeu: 2.552 ± 1.167
0.255HisMet: 0.255 ± 0.15
0.255HisAsn: 0.255 ± 0.325
1.276HisPro: 1.276 ± 0.551
1.276HisGln: 1.276 ± 0.699
2.041HisArg: 2.041 ± 0.513
1.276HisSer: 1.276 ± 0.569
0.766HisThr: 0.766 ± 0.706
0.255HisVal: 0.255 ± 0.382
0.255HisTrp: 0.255 ± 0.15
0.51HisTyr: 0.51 ± 0.299
0.0HisXaa: 0.0 ± 0.0
Ile
3.317IleAla: 3.317 ± 1.056
0.51IleCys: 0.51 ± 0.65
5.103IleAsp: 5.103 ± 0.972
5.614IleGlu: 5.614 ± 0.942
2.807IlePhe: 2.807 ± 0.885
4.593IleGly: 4.593 ± 0.58
1.021IleHis: 1.021 ± 0.765
5.614IleIle: 5.614 ± 1.648
6.124IleLys: 6.124 ± 1.043
6.89IleLeu: 6.89 ± 0.78
1.276IleMet: 1.276 ± 0.551
4.083IleAsn: 4.083 ± 0.782
4.338IlePro: 4.338 ± 1.685
0.766IleGln: 0.766 ± 0.303
4.083IleArg: 4.083 ± 0.882
5.103IleSer: 5.103 ± 1.57
5.614IleThr: 5.614 ± 0.5
4.593IleVal: 4.593 ± 1.317
1.531IleTrp: 1.531 ± 0.472
3.317IleTyr: 3.317 ± 1.407
0.0IleXaa: 0.0 ± 0.0
Lys
3.572LysAla: 3.572 ± 1.051
0.766LysCys: 0.766 ± 0.449
4.848LysAsp: 4.848 ± 0.566
8.165LysGlu: 8.165 ± 2.324
2.807LysPhe: 2.807 ± 0.805
4.083LysGly: 4.083 ± 1.244
2.041LysHis: 2.041 ± 0.779
5.103LysIle: 5.103 ± 0.963
5.614LysLys: 5.614 ± 1.026
5.359LysLeu: 5.359 ± 1.246
1.531LysMet: 1.531 ± 0.366
5.359LysAsn: 5.359 ± 1.029
1.786LysPro: 1.786 ± 1.048
2.041LysGln: 2.041 ± 2.532
3.828LysArg: 3.828 ± 1.006
4.848LysSer: 4.848 ± 0.495
5.869LysThr: 5.869 ± 1.799
3.828LysVal: 3.828 ± 1.434
0.766LysTrp: 0.766 ± 0.378
3.062LysTyr: 3.062 ± 0.978
0.0LysXaa: 0.0 ± 0.0
Leu
6.634LeuAla: 6.634 ± 1.204
1.786LeuCys: 1.786 ± 0.5
6.89LeuAsp: 6.89 ± 0.537
7.4LeuGlu: 7.4 ± 2.138
3.828LeuPhe: 3.828 ± 0.87
5.103LeuGly: 5.103 ± 1.438
2.041LeuHis: 2.041 ± 0.914
8.421LeuIle: 8.421 ± 0.715
5.103LeuLys: 5.103 ± 1.219
6.89LeuLeu: 6.89 ± 1.564
2.041LeuMet: 2.041 ± 0.69
4.848LeuAsn: 4.848 ± 0.639
3.317LeuPro: 3.317 ± 0.919
4.848LeuGln: 4.848 ± 1.443
5.614LeuArg: 5.614 ± 1.227
7.145LeuSer: 7.145 ± 2.021
5.614LeuThr: 5.614 ± 1.264
3.828LeuVal: 3.828 ± 0.852
1.021LeuTrp: 1.021 ± 0.389
2.297LeuTyr: 2.297 ± 0.735
0.0LeuXaa: 0.0 ± 0.0
Met
1.531MetAla: 1.531 ± 0.576
0.0MetCys: 0.0 ± 0.0
1.276MetAsp: 1.276 ± 0.401
1.276MetGlu: 1.276 ± 0.317
1.786MetPhe: 1.786 ± 1.078
1.531MetGly: 1.531 ± 0.467
0.51MetHis: 0.51 ± 0.278
2.807MetIle: 2.807 ± 0.705
1.276MetLys: 1.276 ± 0.356
1.021MetLeu: 1.021 ± 0.389
0.766MetMet: 0.766 ± 0.582
0.51MetAsn: 0.51 ± 0.299
0.766MetPro: 0.766 ± 0.582
1.021MetGln: 1.021 ± 0.913
1.021MetArg: 1.021 ± 0.556
3.062MetSer: 3.062 ± 0.708
1.276MetThr: 1.276 ± 0.849
1.786MetVal: 1.786 ± 0.769
0.0MetTrp: 0.0 ± 0.0
0.51MetTyr: 0.51 ± 0.299
0.0MetXaa: 0.0 ± 0.0
Asn
2.552AsnAla: 2.552 ± 1.283
0.766AsnCys: 0.766 ± 0.333
3.062AsnAsp: 3.062 ± 0.802
5.614AsnGlu: 5.614 ± 1.194
1.531AsnPhe: 1.531 ± 0.251
2.807AsnGly: 2.807 ± 1.641
0.51AsnHis: 0.51 ± 0.349
3.828AsnIle: 3.828 ± 1.3
5.103AsnLys: 5.103 ± 2.76
4.338AsnLeu: 4.338 ± 0.625
1.531AsnMet: 1.531 ± 0.598
3.062AsnAsn: 3.062 ± 0.855
3.317AsnPro: 3.317 ± 1.025
2.041AsnGln: 2.041 ± 0.513
2.552AsnArg: 2.552 ± 0.469
4.338AsnSer: 4.338 ± 1.585
3.317AsnThr: 3.317 ± 1.174
2.041AsnVal: 2.041 ± 1.113
1.021AsnTrp: 1.021 ± 0.547
3.062AsnTyr: 3.062 ± 0.933
0.0AsnXaa: 0.0 ± 0.0
Pro
2.041ProAla: 2.041 ± 0.514
0.255ProCys: 0.255 ± 0.15
3.062ProAsp: 3.062 ± 1.523
1.786ProGlu: 1.786 ± 0.473
0.766ProPhe: 0.766 ± 0.449
1.021ProGly: 1.021 ± 0.457
0.766ProHis: 0.766 ± 0.449
3.828ProIle: 3.828 ± 1.057
2.552ProLys: 2.552 ± 0.522
4.593ProLeu: 4.593 ± 1.034
0.766ProMet: 0.766 ± 0.449
3.572ProAsn: 3.572 ± 0.849
1.786ProPro: 1.786 ± 1.12
1.276ProGln: 1.276 ± 1.237
1.786ProArg: 1.786 ± 0.209
3.317ProSer: 3.317 ± 0.8
2.297ProThr: 2.297 ± 1.296
2.297ProVal: 2.297 ± 1.114
0.51ProTrp: 0.51 ± 0.299
2.297ProTyr: 2.297 ± 0.609
0.0ProXaa: 0.0 ± 0.0
Gln
2.552GlnAla: 2.552 ± 1.08
0.255GlnCys: 0.255 ± 0.325
1.531GlnAsp: 1.531 ± 0.587
3.317GlnGlu: 3.317 ± 1.335
2.041GlnPhe: 2.041 ± 0.447
1.786GlnGly: 1.786 ± 0.638
1.531GlnHis: 1.531 ± 0.677
2.807GlnIle: 2.807 ± 1.339
2.552GlnLys: 2.552 ± 0.818
2.807GlnLeu: 2.807 ± 0.797
1.021GlnMet: 1.021 ± 0.268
2.041GlnAsn: 2.041 ± 0.82
1.786GlnPro: 1.786 ± 1.145
1.531GlnGln: 1.531 ± 1.025
0.766GlnArg: 0.766 ± 0.448
1.531GlnSer: 1.531 ± 0.251
2.552GlnThr: 2.552 ± 1.31
0.766GlnVal: 0.766 ± 0.333
0.766GlnTrp: 0.766 ± 0.706
1.531GlnTyr: 1.531 ± 0.606
0.0GlnXaa: 0.0 ± 0.0
Arg
1.531ArgAla: 1.531 ± 0.834
1.021ArgCys: 1.021 ± 0.457
0.766ArgAsp: 0.766 ± 0.303
4.593ArgGlu: 4.593 ± 0.945
2.552ArgPhe: 2.552 ± 0.578
2.552ArgGly: 2.552 ± 0.749
2.041ArgHis: 2.041 ± 0.669
2.552ArgIle: 2.552 ± 0.996
4.083ArgLys: 4.083 ± 1.561
6.124ArgLeu: 6.124 ± 1.287
1.531ArgMet: 1.531 ± 0.57
3.062ArgAsn: 3.062 ± 0.333
1.276ArgPro: 1.276 ± 0.513
1.786ArgGln: 1.786 ± 0.725
2.552ArgArg: 2.552 ± 0.655
1.276ArgSer: 1.276 ± 0.52
3.062ArgThr: 3.062 ± 0.934
4.338ArgVal: 4.338 ± 0.88
1.531ArgTrp: 1.531 ± 0.898
2.297ArgTyr: 2.297 ± 0.319
0.0ArgXaa: 0.0 ± 0.0
Ser
3.317SerAla: 3.317 ± 0.751
0.766SerCys: 0.766 ± 0.432
5.359SerAsp: 5.359 ± 1.598
5.103SerGlu: 5.103 ± 0.683
2.807SerPhe: 2.807 ± 1.287
3.317SerGly: 3.317 ± 1.412
1.021SerHis: 1.021 ± 0.677
5.359SerIle: 5.359 ± 1.137
7.145SerLys: 7.145 ± 1.273
6.124SerLeu: 6.124 ± 1.371
1.786SerMet: 1.786 ± 0.491
3.572SerAsn: 3.572 ± 0.904
1.786SerPro: 1.786 ± 0.61
2.297SerGln: 2.297 ± 0.994
4.593SerArg: 4.593 ± 0.833
4.848SerSer: 4.848 ± 1.22
3.572SerThr: 3.572 ± 0.663
3.062SerVal: 3.062 ± 0.922
1.276SerTrp: 1.276 ± 0.748
2.552SerTyr: 2.552 ± 0.487
0.0SerXaa: 0.0 ± 0.0
Thr
2.297ThrAla: 2.297 ± 1.335
1.021ThrCys: 1.021 ± 0.658
1.276ThrAsp: 1.276 ± 0.613
3.572ThrGlu: 3.572 ± 1.218
1.021ThrPhe: 1.021 ± 0.389
3.317ThrGly: 3.317 ± 1.031
1.276ThrHis: 1.276 ± 0.279
5.869ThrIle: 5.869 ± 2.036
5.869ThrLys: 5.869 ± 1.23
4.593ThrLeu: 4.593 ± 1.148
0.766ThrMet: 0.766 ± 0.303
3.572ThrAsn: 3.572 ± 1.103
1.786ThrPro: 1.786 ± 0.793
2.297ThrGln: 2.297 ± 0.741
3.062ThrArg: 3.062 ± 1.104
5.103ThrSer: 5.103 ± 0.955
3.572ThrThr: 3.572 ± 1.202
4.083ThrVal: 4.083 ± 1.291
1.276ThrTrp: 1.276 ± 0.505
2.297ThrTyr: 2.297 ± 0.709
0.0ThrXaa: 0.0 ± 0.0
Val
3.062ValAla: 3.062 ± 1.965
1.276ValCys: 1.276 ± 0.317
2.297ValAsp: 2.297 ± 0.691
4.593ValGlu: 4.593 ± 1.411
2.552ValPhe: 2.552 ± 0.912
2.041ValGly: 2.041 ± 0.738
0.255ValHis: 0.255 ± 0.325
4.593ValIle: 4.593 ± 0.63
3.062ValLys: 3.062 ± 1.059
6.124ValLeu: 6.124 ± 1.143
1.276ValMet: 1.276 ± 0.688
2.041ValAsn: 2.041 ± 0.728
2.552ValPro: 2.552 ± 1.235
1.531ValGln: 1.531 ± 0.834
2.297ValArg: 2.297 ± 1.058
4.593ValSer: 4.593 ± 0.696
2.807ValThr: 2.807 ± 0.643
2.807ValVal: 2.807 ± 0.983
0.255ValTrp: 0.255 ± 0.15
2.297ValTyr: 2.297 ± 0.503
0.0ValXaa: 0.0 ± 0.0
Trp
0.51TrpAla: 0.51 ± 0.349
0.255TrpCys: 0.255 ± 0.15
0.51TrpAsp: 0.51 ± 0.278
1.786TrpGlu: 1.786 ± 0.773
1.531TrpPhe: 1.531 ± 0.606
1.021TrpGly: 1.021 ± 0.383
1.021TrpHis: 1.021 ± 0.383
1.276TrpIle: 1.276 ± 0.374
1.786TrpLys: 1.786 ± 0.769
0.51TrpLeu: 0.51 ± 0.273
0.51TrpMet: 0.51 ± 0.441
1.276TrpAsn: 1.276 ± 0.456
0.255TrpPro: 0.255 ± 0.15
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.766TrpSer: 0.766 ± 0.378
0.51TrpThr: 0.51 ± 0.273
0.766TrpVal: 0.766 ± 0.297
0.255TrpTrp: 0.255 ± 0.15
0.255TrpTyr: 0.255 ± 0.15
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.51TyrAla: 0.51 ± 0.338
0.766TyrCys: 0.766 ± 0.767
1.786TyrAsp: 1.786 ± 0.793
2.807TyrGlu: 2.807 ± 0.532
2.041TyrPhe: 2.041 ± 0.505
2.807TyrGly: 2.807 ± 0.635
0.255TyrHis: 0.255 ± 0.15
4.083TyrIle: 4.083 ± 1.787
4.338TyrLys: 4.338 ± 1.452
4.083TyrLeu: 4.083 ± 0.719
0.766TyrMet: 0.766 ± 0.494
2.552TyrAsn: 2.552 ± 0.882
1.786TyrPro: 1.786 ± 0.209
1.786TyrGln: 1.786 ± 0.681
1.531TyrArg: 1.531 ± 0.927
2.807TyrSer: 2.807 ± 0.376
2.807TyrThr: 2.807 ± 1.128
1.021TyrVal: 1.021 ± 0.389
0.51TyrTrp: 0.51 ± 0.273
0.255TyrTyr: 0.255 ± 0.15
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3920 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski