Amino acid dipepetide frequency for Capybara microvirus Cap1_SP_175

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.135AlaAla: 6.135 ± 4.851
0.682AlaCys: 0.682 ± 0.665
5.453AlaAsp: 5.453 ± 1.055
3.408AlaGlu: 3.408 ± 1.248
1.363AlaPhe: 1.363 ± 0.855
3.408AlaGly: 3.408 ± 0.955
2.045AlaHis: 2.045 ± 0.988
5.453AlaIle: 5.453 ± 2.729
6.817AlaLys: 6.817 ± 1.797
2.727AlaLeu: 2.727 ± 1.768
1.363AlaMet: 1.363 ± 0.951
4.09AlaAsn: 4.09 ± 1.677
0.682AlaPro: 0.682 ± 0.427
2.727AlaGln: 2.727 ± 0.961
2.045AlaArg: 2.045 ± 0.718
2.727AlaSer: 2.727 ± 2.49
7.498AlaThr: 7.498 ± 2.255
3.408AlaVal: 3.408 ± 0.93
2.045AlaTrp: 2.045 ± 0.869
6.817AlaTyr: 6.817 ± 1.906
0.0AlaXaa: 0.0 ± 0.0
Cys
0.682CysAla: 0.682 ± 0.427
0.0CysCys: 0.0 ± 0.0
0.682CysAsp: 0.682 ± 0.665
0.682CysGlu: 0.682 ± 0.665
0.0CysPhe: 0.0 ± 0.0
0.682CysGly: 0.682 ± 0.665
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.363CysLys: 1.363 ± 0.895
1.363CysLeu: 1.363 ± 0.544
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.682CysPro: 0.682 ± 0.665
1.363CysGln: 1.363 ± 0.999
0.0CysArg: 0.0 ± 0.0
0.682CysSer: 0.682 ± 0.665
0.682CysThr: 0.682 ± 0.427
0.682CysVal: 0.682 ± 0.427
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.453AspAla: 5.453 ± 0.668
0.682AspCys: 0.682 ± 0.665
0.682AspAsp: 0.682 ± 0.427
6.817AspGlu: 6.817 ± 1.86
2.727AspPhe: 2.727 ± 1.184
1.363AspGly: 1.363 ± 1.331
1.363AspHis: 1.363 ± 0.855
3.408AspIle: 3.408 ± 1.434
2.727AspLys: 2.727 ± 1.912
2.045AspLeu: 2.045 ± 1.297
0.682AspMet: 0.682 ± 0.795
0.682AspAsn: 0.682 ± 0.427
2.727AspPro: 2.727 ± 1.049
0.682AspGln: 0.682 ± 0.427
1.363AspArg: 1.363 ± 0.855
1.363AspSer: 1.363 ± 1.075
2.727AspThr: 2.727 ± 1.22
0.682AspVal: 0.682 ± 0.427
2.045AspTrp: 2.045 ± 0.988
2.045AspTyr: 2.045 ± 1.282
0.0AspXaa: 0.0 ± 0.0
Glu
6.817GluAla: 6.817 ± 1.906
1.363GluCys: 1.363 ± 0.895
2.727GluAsp: 2.727 ± 0.55
6.817GluGlu: 6.817 ± 2.706
1.363GluPhe: 1.363 ± 0.767
2.727GluGly: 2.727 ± 1.348
4.09GluHis: 4.09 ± 0.973
6.135GluIle: 6.135 ± 1.441
3.408GluLys: 3.408 ± 1.935
3.408GluLeu: 3.408 ± 1.131
5.453GluMet: 5.453 ± 2.21
6.135GluAsn: 6.135 ± 2.834
0.682GluPro: 0.682 ± 0.885
2.727GluGln: 2.727 ± 1.791
4.772GluArg: 4.772 ± 1.427
6.135GluSer: 6.135 ± 3.332
5.453GluThr: 5.453 ± 2.479
2.045GluVal: 2.045 ± 1.138
2.045GluTrp: 2.045 ± 1.138
8.18GluTyr: 8.18 ± 2.03
0.0GluXaa: 0.0 ± 0.0
Phe
3.408PheAla: 3.408 ± 1.213
0.0PheCys: 0.0 ± 0.0
1.363PheAsp: 1.363 ± 0.544
2.045PheGlu: 2.045 ± 1.065
1.363PhePhe: 1.363 ± 0.544
1.363PheGly: 1.363 ± 0.855
0.682PheHis: 0.682 ± 0.427
1.363PheIle: 1.363 ± 0.544
2.727PheLys: 2.727 ± 0.974
2.045PheLeu: 2.045 ± 1.138
0.682PheMet: 0.682 ± 0.795
0.682PheAsn: 0.682 ± 0.427
0.0PhePro: 0.0 ± 0.0
0.0PheGln: 0.0 ± 0.0
2.045PheArg: 2.045 ± 0.718
1.363PheSer: 1.363 ± 0.951
4.09PheThr: 4.09 ± 1.445
0.682PheVal: 0.682 ± 0.427
0.682PheTrp: 0.682 ± 0.427
2.727PheTyr: 2.727 ± 1.049
0.0PheXaa: 0.0 ± 0.0
Gly
3.408GlyAla: 3.408 ± 1.246
0.0GlyCys: 0.0 ± 0.0
3.408GlyAsp: 3.408 ± 1.668
4.772GlyGlu: 4.772 ± 0.984
0.682GlyPhe: 0.682 ± 0.665
6.135GlyGly: 6.135 ± 3.512
1.363GlyHis: 1.363 ± 1.153
5.453GlyIle: 5.453 ± 1.719
4.772GlyLys: 4.772 ± 1.691
5.453GlyLeu: 5.453 ± 2.167
1.363GlyMet: 1.363 ± 0.944
4.09GlyAsn: 4.09 ± 1.473
0.0GlyPro: 0.0 ± 0.0
2.727GlyGln: 2.727 ± 1.327
1.363GlyArg: 1.363 ± 0.624
8.18GlySer: 8.18 ± 1.711
4.09GlyThr: 4.09 ± 0.964
2.727GlyVal: 2.727 ± 1.22
1.363GlyTrp: 1.363 ± 0.624
3.408GlyTyr: 3.408 ± 1.635
0.0GlyXaa: 0.0 ± 0.0
His
1.363HisAla: 1.363 ± 0.895
0.0HisCys: 0.0 ± 0.0
0.682HisAsp: 0.682 ± 0.427
1.363HisGlu: 1.363 ± 0.951
0.0HisPhe: 0.0 ± 0.0
0.682HisGly: 0.682 ± 0.427
0.0HisHis: 0.0 ± 0.0
1.363HisIle: 1.363 ± 0.544
2.045HisLys: 2.045 ± 0.68
0.682HisLeu: 0.682 ± 0.427
0.682HisMet: 0.682 ± 0.427
0.682HisAsn: 0.682 ± 0.427
0.0HisPro: 0.0 ± 0.0
0.682HisGln: 0.682 ± 0.427
0.682HisArg: 0.682 ± 0.795
1.363HisSer: 1.363 ± 1.007
0.682HisThr: 0.682 ± 0.427
0.0HisVal: 0.0 ± 0.0
0.682HisTrp: 0.682 ± 0.427
2.727HisTyr: 2.727 ± 0.651
0.0HisXaa: 0.0 ± 0.0
Ile
2.045IleAla: 2.045 ± 0.718
0.682IleCys: 0.682 ± 0.427
2.045IleAsp: 2.045 ± 1.282
7.498IleGlu: 7.498 ± 3.144
2.727IlePhe: 2.727 ± 1.049
5.453IleGly: 5.453 ± 2.24
0.0IleHis: 0.0 ± 0.0
4.09IleIle: 4.09 ± 1.843
8.18IleLys: 8.18 ± 3.433
4.772IleLeu: 4.772 ± 1.999
1.363IleMet: 1.363 ± 0.951
4.09IleAsn: 4.09 ± 0.973
5.453IlePro: 5.453 ± 2.956
4.09IleGln: 4.09 ± 1.387
6.817IleArg: 6.817 ± 2.799
3.408IleSer: 3.408 ± 1.995
3.408IleThr: 3.408 ± 1.6
0.682IleVal: 0.682 ± 0.713
2.727IleTrp: 2.727 ± 0.974
1.363IleTyr: 1.363 ± 0.895
0.0IleXaa: 0.0 ± 0.0
Lys
3.408LysAla: 3.408 ± 0.93
0.682LysCys: 0.682 ± 0.427
3.408LysAsp: 3.408 ± 2.398
10.225LysGlu: 10.225 ± 4.435
0.682LysPhe: 0.682 ± 0.427
6.135LysGly: 6.135 ± 2.769
0.0LysHis: 0.0 ± 0.0
7.498LysIle: 7.498 ± 2.865
6.817LysLys: 6.817 ± 3.4
4.772LysLeu: 4.772 ± 2.518
4.09LysMet: 4.09 ± 1.187
3.408LysAsn: 3.408 ± 1.531
3.408LysPro: 3.408 ± 2.439
0.682LysGln: 0.682 ± 0.665
2.727LysArg: 2.727 ± 0.871
3.408LysSer: 3.408 ± 0.93
3.408LysThr: 3.408 ± 0.941
1.363LysVal: 1.363 ± 0.855
2.045LysTrp: 2.045 ± 0.515
7.498LysTyr: 7.498 ± 3.929
0.0LysXaa: 0.0 ± 0.0
Leu
4.772LeuAla: 4.772 ± 1.599
0.0LeuCys: 0.0 ± 0.0
2.045LeuAsp: 2.045 ± 1.282
2.045LeuGlu: 2.045 ± 0.515
0.0LeuPhe: 0.0 ± 0.0
4.09LeuGly: 4.09 ± 1.043
0.0LeuHis: 0.0 ± 0.0
3.408LeuIle: 3.408 ± 1.717
6.135LeuLys: 6.135 ± 1.801
4.09LeuLeu: 4.09 ± 1.986
1.363LeuMet: 1.363 ± 1.287
5.453LeuAsn: 5.453 ± 2.269
5.453LeuPro: 5.453 ± 2.525
2.045LeuGln: 2.045 ± 0.869
1.363LeuArg: 1.363 ± 0.624
3.408LeuSer: 3.408 ± 1.48
4.772LeuThr: 4.772 ± 2.286
0.0LeuVal: 0.0 ± 0.0
0.682LeuTrp: 0.682 ± 0.427
3.408LeuTyr: 3.408 ± 1.201
0.0LeuXaa: 0.0 ± 0.0
Met
1.363MetAla: 1.363 ± 0.951
0.0MetCys: 0.0 ± 0.0
2.727MetAsp: 2.727 ± 1.709
4.09MetGlu: 4.09 ± 1.953
1.363MetPhe: 1.363 ± 0.951
2.045MetGly: 2.045 ± 0.907
0.0MetHis: 0.0 ± 0.0
2.045MetIle: 2.045 ± 1.187
4.772MetLys: 4.772 ± 1.676
1.363MetLeu: 1.363 ± 0.895
1.363MetMet: 1.363 ± 0.6
2.727MetAsn: 2.727 ± 1.768
3.408MetPro: 3.408 ± 1.462
0.0MetGln: 0.0 ± 0.0
1.363MetArg: 1.363 ± 0.767
2.045MetSer: 2.045 ± 0.515
1.363MetThr: 1.363 ± 0.788
1.363MetVal: 1.363 ± 1.331
0.682MetTrp: 0.682 ± 0.427
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.408AsnAla: 3.408 ± 1.77
1.363AsnCys: 1.363 ± 1.331
2.045AsnAsp: 2.045 ± 0.515
3.408AsnGlu: 3.408 ± 1.36
0.682AsnPhe: 0.682 ± 0.665
4.772AsnGly: 4.772 ± 1.213
1.363AsnHis: 1.363 ± 0.788
4.772AsnIle: 4.772 ± 1.373
4.09AsnLys: 4.09 ± 1.53
2.045AsnLeu: 2.045 ± 1.282
4.09AsnMet: 4.09 ± 1.513
2.045AsnAsn: 2.045 ± 0.718
0.682AsnPro: 0.682 ± 0.622
3.408AsnGln: 3.408 ± 1.883
4.772AsnArg: 4.772 ± 1.213
5.453AsnSer: 5.453 ± 2.408
1.363AsnThr: 1.363 ± 0.855
2.727AsnVal: 2.727 ± 1.248
2.045AsnTrp: 2.045 ± 0.988
2.727AsnTyr: 2.727 ± 1.089
0.0AsnXaa: 0.0 ± 0.0
Pro
1.363ProAla: 1.363 ± 0.544
0.682ProCys: 0.682 ± 0.665
0.682ProAsp: 0.682 ± 0.427
2.727ProGlu: 2.727 ± 1.358
2.727ProPhe: 2.727 ± 0.974
3.408ProGly: 3.408 ± 1.169
1.363ProHis: 1.363 ± 0.544
2.727ProIle: 2.727 ± 0.55
2.727ProLys: 2.727 ± 0.933
2.727ProLeu: 2.727 ± 1.498
0.0ProMet: 0.0 ± 0.0
2.045ProAsn: 2.045 ± 0.718
1.363ProPro: 1.363 ± 1.331
2.727ProGln: 2.727 ± 1.049
0.682ProArg: 0.682 ± 0.885
2.045ProSer: 2.045 ± 0.869
3.408ProThr: 3.408 ± 1.239
4.09ProVal: 4.09 ± 2.057
0.682ProTrp: 0.682 ± 0.427
0.682ProTyr: 0.682 ± 0.665
0.0ProXaa: 0.0 ± 0.0
Gln
4.09GlnAla: 4.09 ± 1.561
0.682GlnCys: 0.682 ± 0.885
0.682GlnAsp: 0.682 ± 0.622
2.727GlnGlu: 2.727 ± 0.998
2.727GlnPhe: 2.727 ± 0.699
2.045GlnGly: 2.045 ± 0.869
0.0GlnHis: 0.0 ± 0.0
2.727GlnIle: 2.727 ± 0.984
2.045GlnLys: 2.045 ± 1.138
2.045GlnLeu: 2.045 ± 0.869
2.727GlnMet: 2.727 ± 0.651
1.363GlnAsn: 1.363 ± 1.245
0.682GlnPro: 0.682 ± 0.427
3.408GlnGln: 3.408 ± 1.26
4.09GlnArg: 4.09 ± 0.908
2.045GlnSer: 2.045 ± 0.515
4.772GlnThr: 4.772 ± 1.331
0.682GlnVal: 0.682 ± 0.427
0.0GlnTrp: 0.0 ± 0.0
2.727GlnTyr: 2.727 ± 1.55
0.0GlnXaa: 0.0 ± 0.0
Arg
3.408ArgAla: 3.408 ± 0.905
0.0ArgCys: 0.0 ± 0.0
1.363ArgAsp: 1.363 ± 0.624
1.363ArgGlu: 1.363 ± 0.544
3.408ArgPhe: 3.408 ± 1.201
0.682ArgGly: 0.682 ± 0.427
0.0ArgHis: 0.0 ± 0.0
2.727ArgIle: 2.727 ± 1.257
4.09ArgLys: 4.09 ± 3.099
2.727ArgLeu: 2.727 ± 0.974
2.045ArgMet: 2.045 ± 1.292
0.682ArgAsn: 0.682 ± 0.665
2.045ArgPro: 2.045 ± 0.718
2.727ArgGln: 2.727 ± 1.22
2.727ArgArg: 2.727 ± 1.416
2.727ArgSer: 2.727 ± 0.961
6.135ArgThr: 6.135 ± 2.263
2.045ArgVal: 2.045 ± 0.718
0.682ArgTrp: 0.682 ± 0.713
3.408ArgTyr: 3.408 ± 0.85
0.0ArgXaa: 0.0 ± 0.0
Ser
5.453SerAla: 5.453 ± 3.699
1.363SerCys: 1.363 ± 0.855
2.045SerAsp: 2.045 ± 1.285
3.408SerGlu: 3.408 ± 0.519
1.363SerPhe: 1.363 ± 0.544
4.772SerGly: 4.772 ± 2.468
2.045SerHis: 2.045 ± 1.473
4.772SerIle: 4.772 ± 0.867
3.408SerLys: 3.408 ± 2.069
2.727SerLeu: 2.727 ± 0.55
1.363SerMet: 1.363 ± 0.769
4.09SerAsn: 4.09 ± 2.195
4.772SerPro: 4.772 ± 0.867
2.045SerGln: 2.045 ± 1.231
3.408SerArg: 3.408 ± 1.201
12.952SerSer: 12.952 ± 8.902
5.453SerThr: 5.453 ± 4.231
3.408SerVal: 3.408 ± 1.668
2.727SerTrp: 2.727 ± 2.025
2.727SerTyr: 2.727 ± 1.768
0.0SerXaa: 0.0 ± 0.0
Thr
5.453ThrAla: 5.453 ± 1.597
0.0ThrCys: 0.0 ± 0.0
3.408ThrAsp: 3.408 ± 1.736
6.817ThrGlu: 6.817 ± 2.271
0.682ThrPhe: 0.682 ± 0.427
8.18ThrGly: 8.18 ± 2.913
0.682ThrHis: 0.682 ± 0.427
4.09ThrIle: 4.09 ± 1.086
6.135ThrLys: 6.135 ± 1.88
4.09ThrLeu: 4.09 ± 0.9
0.0ThrMet: 0.0 ± 0.0
6.135ThrAsn: 6.135 ± 1.398
3.408ThrPro: 3.408 ± 0.857
2.045ThrGln: 2.045 ± 0.869
2.045ThrArg: 2.045 ± 1.282
6.135ThrSer: 6.135 ± 1.396
4.772ThrThr: 4.772 ± 1.615
4.772ThrVal: 4.772 ± 1.925
1.363ThrTrp: 1.363 ± 1.331
3.408ThrTyr: 3.408 ± 0.85
0.0ThrXaa: 0.0 ± 0.0
Val
2.045ValAla: 2.045 ± 0.515
0.0ValCys: 0.0 ± 0.0
1.363ValAsp: 1.363 ± 0.855
2.727ValGlu: 2.727 ± 0.699
0.682ValPhe: 0.682 ± 0.427
3.408ValGly: 3.408 ± 0.84
0.0ValHis: 0.0 ± 0.0
4.09ValIle: 4.09 ± 1.043
2.045ValLys: 2.045 ± 1.187
0.682ValLeu: 0.682 ± 0.427
1.363ValMet: 1.363 ± 0.855
2.727ValAsn: 2.727 ± 1.22
2.045ValPro: 2.045 ± 0.718
1.363ValGln: 1.363 ± 0.855
0.682ValArg: 0.682 ± 0.427
2.727ValSer: 2.727 ± 1.248
5.453ValThr: 5.453 ± 2.662
0.0ValVal: 0.0 ± 0.0
0.0ValTrp: 0.0 ± 0.0
1.363ValTyr: 1.363 ± 0.788
0.0ValXaa: 0.0 ± 0.0
Trp
2.727TrpAla: 2.727 ± 0.699
0.0TrpCys: 0.0 ± 0.0
1.363TrpAsp: 1.363 ± 0.895
2.727TrpGlu: 2.727 ± 0.998
0.682TrpPhe: 0.682 ± 0.427
2.045TrpGly: 2.045 ± 0.515
0.682TrpHis: 0.682 ± 0.427
1.363TrpIle: 1.363 ± 0.544
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
2.045TrpAsn: 2.045 ± 0.889
0.0TrpPro: 0.0 ± 0.0
1.363TrpGln: 1.363 ± 0.544
0.682TrpArg: 0.682 ± 0.713
2.045TrpSer: 2.045 ± 1.171
3.408TrpThr: 3.408 ± 1.239
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
2.045TrpTyr: 2.045 ± 0.869
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.09TyrAla: 4.09 ± 1.032
1.363TyrCys: 1.363 ± 1.331
4.09TyrAsp: 4.09 ± 3.214
6.817TyrGlu: 6.817 ± 2.376
4.09TyrPhe: 4.09 ± 1.26
1.363TyrGly: 1.363 ± 0.769
0.682TyrHis: 0.682 ± 0.665
3.408TyrIle: 3.408 ± 1.736
1.363TyrLys: 1.363 ± 1.331
4.772TyrLeu: 4.772 ± 1.089
3.408TyrMet: 3.408 ± 0.84
4.09TyrAsn: 4.09 ± 0.654
1.363TyrPro: 1.363 ± 0.855
5.453TyrGln: 5.453 ± 1.687
2.045TyrArg: 2.045 ± 0.907
4.09TyrSer: 4.09 ± 1.833
0.682TyrThr: 0.682 ± 0.713
3.408TyrVal: 3.408 ± 0.84
0.682TyrTrp: 0.682 ± 0.795
2.727TyrTyr: 2.727 ± 1.783
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1468 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski