Amino acid dipepetide frequency for Bat polyomavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.747AlaAla: 9.747 ± 3.206
0.65AlaCys: 0.65 ± 0.46
3.249AlaAsp: 3.249 ± 1.207
5.198AlaGlu: 5.198 ± 2.565
3.899AlaPhe: 3.899 ± 1.589
3.899AlaGly: 3.899 ± 2.172
1.949AlaHis: 1.949 ± 0.478
2.599AlaIle: 2.599 ± 1.19
4.548AlaLys: 4.548 ± 1.25
5.848AlaLeu: 5.848 ± 1.779
0.65AlaMet: 0.65 ± 0.55
2.599AlaAsn: 2.599 ± 0.813
1.949AlaPro: 1.949 ± 1.886
2.599AlaGln: 2.599 ± 0.932
2.599AlaArg: 2.599 ± 1.603
1.3AlaSer: 1.3 ± 0.778
3.249AlaThr: 3.249 ± 2.186
5.848AlaVal: 5.848 ± 1.672
1.949AlaTrp: 1.949 ± 0.792
1.3AlaTyr: 1.3 ± 0.778
0.0AlaXaa: 0.0 ± 0.0
Cys
0.65CysAla: 0.65 ± 0.629
0.0CysCys: 0.0 ± 0.0
2.599CysAsp: 2.599 ± 1.839
0.0CysGlu: 0.0 ± 0.0
1.949CysPhe: 1.949 ± 1.985
1.949CysGly: 1.949 ± 1.086
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
3.899CysLys: 3.899 ± 1.786
1.949CysLeu: 1.949 ± 1.985
0.65CysMet: 0.65 ± 0.46
0.65CysAsn: 0.65 ± 0.46
0.65CysPro: 0.65 ± 0.46
1.3CysGln: 1.3 ± 0.92
1.3CysArg: 1.3 ± 0.97
1.3CysSer: 1.3 ± 0.92
0.65CysThr: 0.65 ± 0.46
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.949CysTyr: 1.949 ± 0.893
0.0CysXaa: 0.0 ± 0.0
Asp
1.949AspAla: 1.949 ± 0.792
1.949AspCys: 1.949 ± 1.082
3.249AspAsp: 3.249 ± 1.702
4.548AspGlu: 4.548 ± 1.949
3.899AspPhe: 3.899 ± 1.28
2.599AspGly: 2.599 ± 0.525
1.949AspHis: 1.949 ± 1.082
1.949AspIle: 1.949 ± 1.282
2.599AspLys: 2.599 ± 1.35
6.498AspLeu: 6.498 ± 2.166
1.3AspMet: 1.3 ± 1.136
2.599AspAsn: 2.599 ± 1.096
2.599AspPro: 2.599 ± 1.864
2.599AspGln: 2.599 ± 0.981
0.0AspArg: 0.0 ± 0.0
1.949AspSer: 1.949 ± 1.086
0.65AspThr: 0.65 ± 0.46
1.949AspVal: 1.949 ± 1.379
1.3AspTrp: 1.3 ± 1.194
2.599AspTyr: 2.599 ± 1.07
0.0AspXaa: 0.0 ± 0.0
Glu
5.848GluAla: 5.848 ± 2.063
0.65GluCys: 0.65 ± 0.46
3.249GluAsp: 3.249 ± 0.797
10.396GluGlu: 10.396 ± 2.643
3.249GluPhe: 3.249 ± 1.617
2.599GluGly: 2.599 ± 1.096
0.0GluHis: 0.0 ± 0.0
2.599GluIle: 2.599 ± 0.849
1.949GluLys: 1.949 ± 1.379
10.396GluLeu: 10.396 ± 1.962
1.949GluMet: 1.949 ± 0.975
3.899GluAsn: 3.899 ± 0.706
1.949GluPro: 1.949 ± 0.792
3.899GluGln: 3.899 ± 1.48
0.65GluArg: 0.65 ± 0.596
5.198GluSer: 5.198 ± 3.07
3.899GluThr: 3.899 ± 2.775
5.848GluVal: 5.848 ± 2.549
0.65GluTrp: 0.65 ± 0.46
0.65GluTyr: 0.65 ± 0.46
0.0GluXaa: 0.0 ± 0.0
Phe
3.249PheAla: 3.249 ± 0.797
3.899PheCys: 3.899 ± 2.164
2.599PheAsp: 2.599 ± 1.839
2.599PheGlu: 2.599 ± 1.07
1.949PhePhe: 1.949 ± 0.792
3.249PheGly: 3.249 ± 1.112
1.949PheHis: 1.949 ± 1.086
2.599PheIle: 2.599 ± 1.35
1.949PheLys: 1.949 ± 0.792
6.498PheLeu: 6.498 ± 1.102
1.3PheMet: 1.3 ± 0.92
1.949PheAsn: 1.949 ± 0.792
4.548PhePro: 4.548 ± 1.394
1.949PheGln: 1.949 ± 1.379
1.3PheArg: 1.3 ± 0.97
5.198PheSer: 5.198 ± 0.661
2.599PheThr: 2.599 ± 0.525
1.3PheVal: 1.3 ± 0.92
0.65PheTrp: 0.65 ± 0.596
1.3PheTyr: 1.3 ± 0.548
0.0PheXaa: 0.0 ± 0.0
Gly
5.198GlyAla: 5.198 ± 1.823
0.0GlyCys: 0.0 ± 0.0
3.249GlyAsp: 3.249 ± 1.961
2.599GlyGlu: 2.599 ± 0.932
1.3GlyPhe: 1.3 ± 0.548
9.097GlyGly: 9.097 ± 1.849
0.0GlyHis: 0.0 ± 0.0
4.548GlyIle: 4.548 ± 1.204
2.599GlyLys: 2.599 ± 1.35
5.198GlyLeu: 5.198 ± 1.228
1.3GlyMet: 1.3 ± 1.257
3.249GlyAsn: 3.249 ± 1.852
6.498GlyPro: 6.498 ± 2.541
2.599GlyGln: 2.599 ± 0.525
1.949GlyArg: 1.949 ± 1.789
1.949GlySer: 1.949 ± 0.792
1.949GlyThr: 1.949 ± 0.478
6.498GlyVal: 6.498 ± 3.041
0.0GlyTrp: 0.0 ± 0.0
0.65GlyTyr: 0.65 ± 0.629
0.0GlyXaa: 0.0 ± 0.0
His
0.65HisAla: 0.65 ± 0.46
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.65HisGlu: 0.65 ± 1.065
1.3HisPhe: 1.3 ± 0.548
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.65HisIle: 0.65 ± 0.596
0.65HisLys: 0.65 ± 0.46
1.3HisLeu: 1.3 ± 0.92
0.0HisMet: 0.0 ± 0.0
0.65HisAsn: 0.65 ± 0.46
1.3HisPro: 1.3 ± 0.97
0.65HisGln: 0.65 ± 0.629
3.249HisArg: 3.249 ± 1.128
2.599HisSer: 2.599 ± 0.525
1.949HisThr: 1.949 ± 0.893
0.0HisVal: 0.0 ± 0.0
0.65HisTrp: 0.65 ± 0.46
0.65HisTyr: 0.65 ± 0.596
0.0HisXaa: 0.0 ± 0.0
Ile
3.899IleAla: 3.899 ± 1.306
1.949IleCys: 1.949 ± 0.792
1.949IleAsp: 1.949 ± 0.8
2.599IleGlu: 2.599 ± 0.981
3.249IlePhe: 3.249 ± 1.617
2.599IleGly: 2.599 ± 1.516
0.65IleHis: 0.65 ± 0.596
1.3IleIle: 1.3 ± 0.548
0.65IleLys: 0.65 ± 0.46
4.548IleLeu: 4.548 ± 0.804
1.3IleMet: 1.3 ± 0.827
1.3IleAsn: 1.3 ± 0.535
3.249IlePro: 3.249 ± 1.617
3.899IleGln: 3.899 ± 0.957
0.65IleArg: 0.65 ± 0.629
5.848IleSer: 5.848 ± 1.786
4.548IleThr: 4.548 ± 1.387
3.249IleVal: 3.249 ± 1.03
0.65IleTrp: 0.65 ± 1.065
1.949IleTyr: 1.949 ± 0.478
0.0IleXaa: 0.0 ± 0.0
Lys
3.899LysAla: 3.899 ± 1.049
2.599LysCys: 2.599 ± 1.94
1.3LysAsp: 1.3 ± 0.548
5.198LysGlu: 5.198 ± 2.7
1.3LysPhe: 1.3 ± 0.92
3.249LysGly: 3.249 ± 1.032
1.3LysHis: 1.3 ± 0.92
5.198LysIle: 5.198 ± 2.043
7.147LysLys: 7.147 ± 1.935
1.949LysLeu: 1.949 ± 0.478
1.949LysMet: 1.949 ± 1.086
2.599LysAsn: 2.599 ± 1.096
0.65LysPro: 0.65 ± 0.596
1.949LysGln: 1.949 ± 1.086
5.848LysArg: 5.848 ± 1.779
1.3LysSer: 1.3 ± 0.92
7.147LysThr: 7.147 ± 2.254
1.949LysVal: 1.949 ± 1.082
0.0LysTrp: 0.0 ± 0.0
0.65LysTyr: 0.65 ± 0.629
0.0LysXaa: 0.0 ± 0.0
Leu
5.848LeuAla: 5.848 ± 2.398
1.949LeuCys: 1.949 ± 0.792
5.198LeuAsp: 5.198 ± 1.568
7.147LeuGlu: 7.147 ± 2.247
7.147LeuPhe: 7.147 ± 1.984
5.198LeuGly: 5.198 ± 1.663
1.3LeuHis: 1.3 ± 0.97
5.198LeuIle: 5.198 ± 0.661
3.249LeuLys: 3.249 ± 1.032
10.396LeuLeu: 10.396 ± 2.658
3.249LeuMet: 3.249 ± 1.755
8.447LeuAsn: 8.447 ± 2.768
9.097LeuPro: 9.097 ± 0.978
3.249LeuGln: 3.249 ± 1.112
3.899LeuArg: 3.899 ± 1.354
7.147LeuSer: 7.147 ± 2.704
8.447LeuThr: 8.447 ± 1.13
2.599LeuVal: 2.599 ± 0.932
0.0LeuTrp: 0.0 ± 0.0
5.848LeuTyr: 5.848 ± 1.959
0.0LeuXaa: 0.0 ± 0.0
Met
1.949MetAla: 1.949 ± 1.282
0.65MetCys: 0.65 ± 0.46
2.599MetAsp: 2.599 ± 1.888
3.249MetGlu: 3.249 ± 2.003
0.65MetPhe: 0.65 ± 0.46
1.949MetGly: 1.949 ± 0.478
0.0MetHis: 0.0 ± 0.0
0.65MetIle: 0.65 ± 0.596
1.3MetLys: 1.3 ± 0.548
3.899MetLeu: 3.899 ± 0.607
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.3MetPro: 1.3 ± 0.548
1.3MetGln: 1.3 ± 1.257
1.3MetArg: 1.3 ± 0.97
0.0MetSer: 0.0 ± 0.0
1.3MetThr: 1.3 ± 0.548
1.949MetVal: 1.949 ± 1.082
0.65MetTrp: 0.65 ± 0.629
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.599AsnAla: 2.599 ± 0.525
0.65AsnCys: 0.65 ± 0.46
0.0AsnAsp: 0.0 ± 0.0
3.899AsnGlu: 3.899 ± 2.172
3.899AsnPhe: 3.899 ± 1.354
1.3AsnGly: 1.3 ± 1.257
1.3AsnHis: 1.3 ± 1.139
3.249AsnIle: 3.249 ± 0.551
2.599AsnLys: 2.599 ± 1.839
4.548AsnLeu: 4.548 ± 1.198
1.3AsnMet: 1.3 ± 0.92
0.65AsnAsn: 0.65 ± 0.629
4.548AsnPro: 4.548 ± 1.601
2.599AsnGln: 2.599 ± 1.35
1.3AsnArg: 1.3 ± 0.778
3.899AsnSer: 3.899 ± 1.566
3.249AsnThr: 3.249 ± 1.601
1.949AsnVal: 1.949 ± 0.8
0.0AsnTrp: 0.0 ± 0.0
0.65AsnTyr: 0.65 ± 0.596
0.0AsnXaa: 0.0 ± 0.0
Pro
2.599ProAla: 2.599 ± 1.556
1.3ProCys: 1.3 ± 0.548
5.198ProAsp: 5.198 ± 0.661
3.899ProGlu: 3.899 ± 1.354
1.3ProPhe: 1.3 ± 0.92
5.198ProGly: 5.198 ± 2.446
0.0ProHis: 0.0 ± 0.0
3.899ProIle: 3.899 ± 1.306
5.198ProLys: 5.198 ± 1.848
3.899ProLeu: 3.899 ± 1.262
1.949ProMet: 1.949 ± 0.893
1.949ProAsn: 1.949 ± 0.792
5.848ProPro: 5.848 ± 1.353
2.599ProGln: 2.599 ± 1.603
4.548ProArg: 4.548 ± 3.137
1.3ProSer: 1.3 ± 0.535
3.899ProThr: 3.899 ± 1.786
3.899ProVal: 3.899 ± 2.172
0.65ProTrp: 0.65 ± 1.065
3.899ProTyr: 3.899 ± 1.589
0.0ProXaa: 0.0 ± 0.0
Gln
4.548GlnAla: 4.548 ± 1.198
0.65GlnCys: 0.65 ± 0.46
1.949GlnAsp: 1.949 ± 1.086
4.548GlnGlu: 4.548 ± 1.25
3.249GlnPhe: 3.249 ± 0.838
1.949GlnGly: 1.949 ± 0.478
0.0GlnHis: 0.0 ± 0.0
1.3GlnIle: 1.3 ± 0.92
1.949GlnLys: 1.949 ± 0.8
3.249GlnLeu: 3.249 ± 0.838
0.65GlnMet: 0.65 ± 0.599
2.599GlnAsn: 2.599 ± 1.19
1.3GlnPro: 1.3 ± 1.257
2.599GlnGln: 2.599 ± 1.07
5.198GlnArg: 5.198 ± 1.697
2.599GlnSer: 2.599 ± 0.525
1.3GlnThr: 1.3 ± 0.548
2.599GlnVal: 2.599 ± 1.202
1.949GlnTrp: 1.949 ± 0.478
2.599GlnTyr: 2.599 ± 0.813
0.0GlnXaa: 0.0 ± 0.0
Arg
1.3ArgAla: 1.3 ± 0.778
1.3ArgCys: 1.3 ± 2.13
1.949ArgAsp: 1.949 ± 0.478
1.3ArgGlu: 1.3 ± 0.97
2.599ArgPhe: 2.599 ± 1.19
3.249ArgGly: 3.249 ± 1.49
1.3ArgHis: 1.3 ± 0.92
1.3ArgIle: 1.3 ± 0.535
6.498ArgLys: 6.498 ± 1.353
3.249ArgLeu: 3.249 ± 2.003
0.65ArgMet: 0.65 ± 0.629
1.3ArgAsn: 1.3 ± 0.97
1.3ArgPro: 1.3 ± 0.778
2.599ArgGln: 2.599 ± 0.525
4.548ArgArg: 4.548 ± 1.267
3.899ArgSer: 3.899 ± 2.07
1.949ArgThr: 1.949 ± 1.558
5.198ArgVal: 5.198 ± 0.661
0.65ArgTrp: 0.65 ± 0.596
3.899ArgTyr: 3.899 ± 1.683
0.0ArgXaa: 0.0 ± 0.0
Ser
3.249SerAla: 3.249 ± 1.537
0.65SerCys: 0.65 ± 0.629
3.899SerAsp: 3.899 ± 0.706
1.949SerGlu: 1.949 ± 1.036
2.599SerPhe: 2.599 ± 1.35
1.949SerGly: 1.949 ± 0.8
1.949SerHis: 1.949 ± 0.8
3.249SerIle: 3.249 ± 1.49
1.3SerLys: 1.3 ± 1.257
11.696SerLeu: 11.696 ± 1.8
2.599SerMet: 2.599 ± 0.525
3.249SerAsn: 3.249 ± 0.863
1.949SerPro: 1.949 ± 1.036
3.249SerGln: 3.249 ± 1.032
1.949SerArg: 1.949 ± 0.792
1.3SerSer: 1.3 ± 1.139
3.899SerThr: 3.899 ± 0.774
4.548SerVal: 4.548 ± 1.27
2.599SerTrp: 2.599 ± 1.971
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.249ThrAla: 3.249 ± 1.537
2.599ThrCys: 2.599 ± 1.202
1.949ThrAsp: 1.949 ± 0.478
3.249ThrGlu: 3.249 ± 1.976
1.949ThrPhe: 1.949 ± 1.036
3.899ThrGly: 3.899 ± 0.907
0.65ThrHis: 0.65 ± 0.46
3.249ThrIle: 3.249 ± 2.026
1.3ThrLys: 1.3 ± 1.257
11.696ThrLeu: 11.696 ± 0.31
0.65ThrMet: 0.65 ± 0.46
0.65ThrAsn: 0.65 ± 0.629
5.198ThrPro: 5.198 ± 1.531
4.548ThrGln: 4.548 ± 2.483
2.599ThrArg: 2.599 ± 0.949
3.249ThrSer: 3.249 ± 1.128
5.848ThrThr: 5.848 ± 1.765
1.949ThrVal: 1.949 ± 1.235
1.3ThrTrp: 1.3 ± 1.194
1.949ThrTyr: 1.949 ± 0.478
0.0ThrXaa: 0.0 ± 0.0
Val
2.599ValAla: 2.599 ± 1.688
0.0ValCys: 0.0 ± 0.0
1.949ValAsp: 1.949 ± 1.379
3.899ValGlu: 3.899 ± 1.683
1.949ValPhe: 1.949 ± 0.975
2.599ValGly: 2.599 ± 1.5
1.3ValHis: 1.3 ± 0.97
3.899ValIle: 3.899 ± 1.28
3.899ValLys: 3.899 ± 1.584
5.848ValLeu: 5.848 ± 0.899
0.0ValMet: 0.0 ± 0.0
3.899ValAsn: 3.899 ± 1.643
4.548ValPro: 4.548 ± 1.409
1.3ValGln: 1.3 ± 0.92
3.899ValArg: 3.899 ± 2.07
5.198ValSer: 5.198 ± 1.25
3.249ValThr: 3.249 ± 1.49
3.249ValVal: 3.249 ± 1.601
0.65ValTrp: 0.65 ± 1.065
1.3ValTyr: 1.3 ± 0.548
0.0ValXaa: 0.0 ± 0.0
Trp
1.949TrpAla: 1.949 ± 1.036
0.0TrpCys: 0.0 ± 0.0
0.65TrpAsp: 0.65 ± 0.46
1.949TrpGlu: 1.949 ± 1.282
1.949TrpPhe: 1.949 ± 1.082
0.65TrpGly: 0.65 ± 1.065
0.0TrpHis: 0.0 ± 0.0
0.65TrpIle: 0.65 ± 0.596
1.949TrpLys: 1.949 ± 1.082
0.0TrpLeu: 0.0 ± 0.0
1.949TrpMet: 1.949 ± 2.183
0.65TrpAsn: 0.65 ± 0.46
1.3TrpPro: 1.3 ± 2.13
0.65TrpGln: 0.65 ± 0.629
0.65TrpArg: 0.65 ± 0.596
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.65TrpTrp: 0.65 ± 0.46
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.65TyrAla: 0.65 ± 0.46
0.0TyrCys: 0.0 ± 0.0
2.599TyrAsp: 2.599 ± 0.932
0.65TyrGlu: 0.65 ± 0.629
3.249TyrPhe: 3.249 ± 1.207
3.249TyrGly: 3.249 ± 0.551
1.3TyrHis: 1.3 ± 0.535
1.949TyrIle: 1.949 ± 1.201
1.949TyrLys: 1.949 ± 1.082
1.949TyrLeu: 1.949 ± 0.8
0.65TyrMet: 0.65 ± 0.46
1.3TyrAsn: 1.3 ± 0.535
3.249TyrPro: 3.249 ± 2.026
0.65TyrGln: 0.65 ± 0.46
2.599TyrArg: 2.599 ± 1.5
2.599TyrSer: 2.599 ± 1.556
1.949TyrThr: 1.949 ± 1.036
0.65TyrVal: 0.65 ± 0.596
0.65TyrTrp: 0.65 ± 0.46
1.949TyrTyr: 1.949 ± 1.789
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1540 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski