Amino acid dipepetide frequency for Bromus-associated circular DNA virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.33AlaAla: 9.33 ± 2.287
1.696AlaCys: 1.696 ± 1.055
4.241AlaAsp: 4.241 ± 1.254
2.545AlaGlu: 2.545 ± 0.665
4.241AlaPhe: 4.241 ± 1.496
5.089AlaGly: 5.089 ± 2.322
0.848AlaHis: 0.848 ± 0.654
1.696AlaIle: 1.696 ± 0.543
5.937AlaLys: 5.937 ± 1.711
5.089AlaLeu: 5.089 ± 0.954
0.848AlaMet: 0.848 ± 0.603
4.241AlaAsn: 4.241 ± 1.4
3.393AlaPro: 3.393 ± 1.623
4.241AlaGln: 4.241 ± 1.464
5.937AlaArg: 5.937 ± 1.641
5.089AlaSer: 5.089 ± 1.172
5.089AlaThr: 5.089 ± 2.037
11.026AlaVal: 11.026 ± 1.522
0.848AlaTrp: 0.848 ± 0.907
5.089AlaTyr: 5.089 ± 0.954
0.0AlaXaa: 0.0 ± 0.0
Cys
0.848CysAla: 0.848 ± 0.907
0.848CysCys: 0.848 ± 0.907
0.848CysAsp: 0.848 ± 0.639
3.393CysGlu: 3.393 ± 1.495
0.848CysPhe: 0.848 ± 0.907
1.696CysGly: 1.696 ± 1.814
0.0CysHis: 0.0 ± 0.0
2.545CysIle: 2.545 ± 1.918
0.848CysLys: 0.848 ± 0.654
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.848CysAsn: 0.848 ± 0.639
1.696CysPro: 1.696 ± 0.895
0.848CysGln: 0.848 ± 0.766
1.696CysArg: 1.696 ± 1.279
0.848CysSer: 0.848 ± 0.907
0.848CysThr: 0.848 ± 0.639
1.696CysVal: 1.696 ± 0.812
0.0CysTrp: 0.0 ± 0.0
0.848CysTyr: 0.848 ± 0.907
0.0CysXaa: 0.0 ± 0.0
Asp
1.696AspAla: 1.696 ± 1.279
0.848AspCys: 0.848 ± 0.654
4.241AspAsp: 4.241 ± 3.197
1.696AspGlu: 1.696 ± 1.055
2.545AspPhe: 2.545 ± 1.49
3.393AspGly: 3.393 ± 0.628
1.696AspHis: 1.696 ± 1.15
3.393AspIle: 3.393 ± 2.557
0.0AspLys: 0.0 ± 0.0
5.089AspLeu: 5.089 ± 1.719
0.0AspMet: 0.0 ± 0.0
0.848AspAsn: 0.848 ± 0.654
1.696AspPro: 1.696 ± 0.543
4.241AspGln: 4.241 ± 2.381
5.937AspArg: 5.937 ± 1.778
2.545AspSer: 2.545 ± 1.962
9.33AspThr: 9.33 ± 1.95
4.241AspVal: 4.241 ± 1.575
0.848AspTrp: 0.848 ± 0.654
3.393AspTyr: 3.393 ± 2.109
0.0AspXaa: 0.0 ± 0.0
Glu
4.241GluAla: 4.241 ± 0.875
0.0GluCys: 0.0 ± 0.0
1.696GluAsp: 1.696 ± 1.055
0.0GluGlu: 0.0 ± 0.0
1.696GluPhe: 1.696 ± 1.055
0.0GluGly: 0.0 ± 0.0
3.393GluHis: 3.393 ± 2.73
1.696GluIle: 1.696 ± 1.055
0.0GluLys: 0.0 ± 0.0
2.545GluLeu: 2.545 ± 1.918
0.0GluMet: 0.0 ± 0.0
0.848GluAsn: 0.848 ± 0.766
1.696GluPro: 1.696 ± 1.055
1.696GluGln: 1.696 ± 1.055
6.785GluArg: 6.785 ± 1.255
2.545GluSer: 2.545 ± 0.665
1.696GluThr: 1.696 ± 1.279
2.545GluVal: 2.545 ± 0.99
0.0GluTrp: 0.0 ± 0.0
0.848GluTyr: 0.848 ± 0.654
0.0GluXaa: 0.0 ± 0.0
Phe
5.089PheAla: 5.089 ± 1.772
2.545PheCys: 2.545 ± 1.614
5.937PheAsp: 5.937 ± 1.974
2.545PheGlu: 2.545 ± 1.918
0.0PhePhe: 0.0 ± 0.0
1.696PheGly: 1.696 ± 1.279
0.848PheHis: 0.848 ± 0.654
1.696PheIle: 1.696 ± 0.543
1.696PheLys: 1.696 ± 0.543
0.0PheLeu: 0.0 ± 0.0
0.848PheMet: 0.848 ± 0.57
0.848PheAsn: 0.848 ± 0.639
0.0PhePro: 0.0 ± 0.0
2.545PheGln: 2.545 ± 1.354
4.241PheArg: 4.241 ± 1.464
3.393PheSer: 3.393 ± 2.616
5.089PheThr: 5.089 ± 1.63
2.545PheVal: 2.545 ± 0.99
0.848PheTrp: 0.848 ± 0.907
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
9.33GlyAla: 9.33 ± 3.648
0.848GlyCys: 0.848 ± 0.639
6.785GlyAsp: 6.785 ± 2.519
1.696GlyGlu: 1.696 ± 1.814
5.937GlyPhe: 5.937 ± 1.583
8.482GlyGly: 8.482 ± 3.079
1.696GlyHis: 1.696 ± 1.15
3.393GlyIle: 3.393 ± 0.848
3.393GlyLys: 3.393 ± 1.576
9.33GlyLeu: 9.33 ± 2.914
0.848GlyMet: 0.848 ± 0.654
4.241GlyAsn: 4.241 ± 1.46
3.393GlyPro: 3.393 ± 2.036
2.545GlyGln: 2.545 ± 2.721
5.089GlyArg: 5.089 ± 0.997
7.634GlySer: 7.634 ± 2.133
5.089GlyThr: 5.089 ± 1.888
1.696GlyVal: 1.696 ± 0.812
0.0GlyTrp: 0.0 ± 0.0
1.696GlyTyr: 1.696 ± 0.833
0.0GlyXaa: 0.0 ± 0.0
His
2.545HisAla: 2.545 ± 1.354
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.848HisPhe: 0.848 ± 0.654
0.848HisGly: 0.848 ± 0.639
2.545HisHis: 2.545 ± 1.962
3.393HisIle: 3.393 ± 1.576
0.848HisLys: 0.848 ± 0.766
2.545HisLeu: 2.545 ± 1.26
0.848HisMet: 0.848 ± 0.654
0.848HisAsn: 0.848 ± 0.639
3.393HisPro: 3.393 ± 2.73
0.848HisGln: 0.848 ± 0.907
2.545HisArg: 2.545 ± 2.297
3.393HisSer: 3.393 ± 1.495
0.848HisThr: 0.848 ± 0.654
2.545HisVal: 2.545 ± 1.614
0.848HisTrp: 0.848 ± 0.907
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
5.089IleAla: 5.089 ± 2.818
0.0IleCys: 0.0 ± 0.0
1.696IleAsp: 1.696 ± 0.895
2.545IleGlu: 2.545 ± 1.354
0.848IlePhe: 0.848 ± 0.639
5.937IleGly: 5.937 ± 3.544
1.696IleHis: 1.696 ± 1.279
1.696IleIle: 1.696 ± 1.279
0.848IleLys: 0.848 ± 0.654
0.0IleLeu: 0.0 ± 0.0
0.848IleMet: 0.848 ± 0.615
3.393IleAsn: 3.393 ± 1.087
1.696IlePro: 1.696 ± 1.308
0.848IleGln: 0.848 ± 0.654
5.937IleArg: 5.937 ± 1.579
4.241IleSer: 4.241 ± 1.464
0.848IleThr: 0.848 ± 0.654
4.241IleVal: 4.241 ± 3.002
0.0IleTrp: 0.0 ± 0.0
0.848IleTyr: 0.848 ± 0.654
0.0IleXaa: 0.0 ± 0.0
Lys
1.696LysAla: 1.696 ± 0.812
1.696LysCys: 1.696 ± 0.895
0.0LysAsp: 0.0 ± 0.0
0.848LysGlu: 0.848 ± 0.639
1.696LysPhe: 1.696 ± 1.308
3.393LysGly: 3.393 ± 0.907
0.0LysHis: 0.0 ± 0.0
0.848LysIle: 0.848 ± 0.654
1.696LysLys: 1.696 ± 0.543
5.937LysLeu: 5.937 ± 0.839
0.0LysMet: 0.0 ± 0.0
0.848LysAsn: 0.848 ± 0.639
3.393LysPro: 3.393 ± 0.848
0.848LysGln: 0.848 ± 0.654
5.089LysArg: 5.089 ± 2.037
2.545LysSer: 2.545 ± 1.018
4.241LysThr: 4.241 ± 1.496
1.696LysVal: 1.696 ± 0.812
0.848LysTrp: 0.848 ± 0.654
1.696LysTyr: 1.696 ± 0.543
0.0LysXaa: 0.0 ± 0.0
Leu
3.393LeuAla: 3.393 ± 1.1
2.545LeuCys: 2.545 ± 1.191
4.241LeuAsp: 4.241 ± 0.473
2.545LeuGlu: 2.545 ± 1.354
3.393LeuPhe: 3.393 ± 1.919
4.241LeuGly: 4.241 ± 1.4
2.545LeuHis: 2.545 ± 1.354
4.241LeuIle: 4.241 ± 2.451
4.241LeuLys: 4.241 ± 1.356
8.482LeuLeu: 8.482 ± 1.928
0.848LeuMet: 0.848 ± 0.639
0.848LeuAsn: 0.848 ± 0.639
5.937LeuPro: 5.937 ± 2.673
5.089LeuGln: 5.089 ± 1.073
3.393LeuArg: 3.393 ± 1.1
5.937LeuSer: 5.937 ± 1.641
4.241LeuThr: 4.241 ± 0.818
5.089LeuVal: 5.089 ± 4.594
1.696LeuTrp: 1.696 ± 1.279
4.241LeuTyr: 4.241 ± 1.496
0.0LeuXaa: 0.0 ± 0.0
Met
4.241MetAla: 4.241 ± 1.291
0.0MetCys: 0.0 ± 0.0
1.696MetAsp: 1.696 ± 0.543
0.0MetGlu: 0.0 ± 0.0
1.696MetPhe: 1.696 ± 0.543
0.0MetGly: 0.0 ± 0.0
1.696MetHis: 1.696 ± 0.812
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.848MetPro: 0.848 ± 0.766
0.0MetGln: 0.0 ± 0.0
0.848MetArg: 0.848 ± 0.654
0.0MetSer: 0.0 ± 0.0
0.0MetThr: 0.0 ± 0.0
0.848MetVal: 0.848 ± 0.766
0.0MetTrp: 0.0 ± 0.0
0.848MetTyr: 0.848 ± 0.654
0.0MetXaa: 0.0 ± 0.0
Asn
0.848AsnAla: 0.848 ± 0.654
0.848AsnCys: 0.848 ± 0.654
3.393AsnAsp: 3.393 ± 1.623
1.696AsnGlu: 1.696 ± 1.308
0.0AsnPhe: 0.0 ± 0.0
3.393AsnGly: 3.393 ± 1.79
1.696AsnHis: 1.696 ± 1.308
1.696AsnIle: 1.696 ± 0.543
2.545AsnLys: 2.545 ± 0.99
1.696AsnLeu: 1.696 ± 0.543
0.848AsnMet: 0.848 ± 0.654
0.0AsnAsn: 0.0 ± 0.0
2.545AsnPro: 2.545 ± 1.018
1.696AsnGln: 1.696 ± 0.812
0.848AsnArg: 0.848 ± 0.639
2.545AsnSer: 2.545 ± 0.576
3.393AsnThr: 3.393 ± 1.242
2.545AsnVal: 2.545 ± 0.576
0.0AsnTrp: 0.0 ± 0.0
0.848AsnTyr: 0.848 ± 0.639
0.0AsnXaa: 0.0 ± 0.0
Pro
5.937ProAla: 5.937 ± 2.007
0.848ProCys: 0.848 ± 0.907
5.937ProAsp: 5.937 ± 2.673
3.393ProGlu: 3.393 ± 2.109
1.696ProPhe: 1.696 ± 0.543
10.178ProGly: 10.178 ± 3.131
0.848ProHis: 0.848 ± 0.639
2.545ProIle: 2.545 ± 0.576
2.545ProLys: 2.545 ± 0.99
1.696ProLeu: 1.696 ± 1.15
0.0ProMet: 0.0 ± 0.0
1.696ProAsn: 1.696 ± 0.543
3.393ProPro: 3.393 ± 2.036
1.696ProGln: 1.696 ± 1.15
1.696ProArg: 1.696 ± 0.543
2.545ProSer: 2.545 ± 1.191
3.393ProThr: 3.393 ± 1.087
2.545ProVal: 2.545 ± 1.192
0.0ProTrp: 0.0 ± 0.0
0.848ProTyr: 0.848 ± 0.907
0.0ProXaa: 0.0 ± 0.0
Gln
5.089GlnAla: 5.089 ± 2.897
1.696GlnCys: 1.696 ± 1.055
3.393GlnAsp: 3.393 ± 0.628
1.696GlnGlu: 1.696 ± 0.833
2.545GlnPhe: 2.545 ± 0.576
2.545GlnGly: 2.545 ± 1.731
1.696GlnHis: 1.696 ± 1.055
1.696GlnIle: 1.696 ± 1.279
0.848GlnLys: 0.848 ± 0.907
1.696GlnLeu: 1.696 ± 1.279
0.848GlnMet: 0.848 ± 0.654
1.696GlnAsn: 1.696 ± 0.812
3.393GlnPro: 3.393 ± 0.792
1.696GlnGln: 1.696 ± 0.812
3.393GlnArg: 3.393 ± 0.628
5.089GlnSer: 5.089 ± 2.178
5.089GlnThr: 5.089 ± 1.073
2.545GlnVal: 2.545 ± 1.731
0.848GlnTrp: 0.848 ± 0.639
0.848GlnTyr: 0.848 ± 0.907
0.0GlnXaa: 0.0 ± 0.0
Arg
5.937ArgAla: 5.937 ± 1.379
1.696ArgCys: 1.696 ± 0.895
0.0ArgAsp: 0.0 ± 0.0
4.241ArgGlu: 4.241 ± 0.875
3.393ArgPhe: 3.393 ± 2.036
7.634ArgGly: 7.634 ± 1.748
2.545ArgHis: 2.545 ± 1.86
4.241ArgIle: 4.241 ± 1.254
1.696ArgLys: 1.696 ± 0.543
7.634ArgLeu: 7.634 ± 4.039
2.545ArgMet: 2.545 ± 1.018
3.393ArgAsn: 3.393 ± 1.836
3.393ArgPro: 3.393 ± 0.628
0.848ArgGln: 0.848 ± 0.639
12.723ArgArg: 12.723 ± 6.896
10.178ArgSer: 10.178 ± 2.872
5.937ArgThr: 5.937 ± 2.033
4.241ArgVal: 4.241 ± 1.291
0.848ArgTrp: 0.848 ± 0.654
1.696ArgTyr: 1.696 ± 1.055
0.0ArgXaa: 0.0 ± 0.0
Ser
5.937SerAla: 5.937 ± 2.05
0.848SerCys: 0.848 ± 0.639
4.241SerAsp: 4.241 ± 2.116
3.393SerGlu: 3.393 ± 1.919
2.545SerPhe: 2.545 ± 1.962
11.026SerGly: 11.026 ± 4.305
0.0SerHis: 0.0 ± 0.0
3.393SerIle: 3.393 ± 1.836
1.696SerLys: 1.696 ± 1.308
3.393SerLeu: 3.393 ± 1.242
0.0SerMet: 0.0 ± 0.0
1.696SerAsn: 1.696 ± 1.531
6.785SerPro: 6.785 ± 1.577
3.393SerGln: 3.393 ± 1.1
5.089SerArg: 5.089 ± 1.569
4.241SerSer: 4.241 ± 2.016
2.545SerThr: 2.545 ± 1.962
5.937SerVal: 5.937 ± 0.839
0.848SerTrp: 0.848 ± 0.639
2.545SerTyr: 2.545 ± 1.018
0.0SerXaa: 0.0 ± 0.0
Thr
7.634ThrAla: 7.634 ± 3.935
1.696ThrCys: 1.696 ± 1.279
2.545ThrAsp: 2.545 ± 0.99
0.848ThrGlu: 0.848 ± 0.639
4.241ThrPhe: 4.241 ± 1.464
5.089ThrGly: 5.089 ± 1.073
0.848ThrHis: 0.848 ± 0.654
2.545ThrIle: 2.545 ± 1.018
4.241ThrLys: 4.241 ± 1.496
7.634ThrLeu: 7.634 ± 2.531
0.848ThrMet: 0.848 ± 0.654
2.545ThrAsn: 2.545 ± 1.962
2.545ThrPro: 2.545 ± 1.018
5.089ThrGln: 5.089 ± 1.17
2.545ThrArg: 2.545 ± 1.436
3.393ThrSer: 3.393 ± 2.616
7.634ThrThr: 7.634 ± 5.886
7.634ThrVal: 7.634 ± 2.302
0.0ThrTrp: 0.0 ± 0.0
1.696ThrTyr: 1.696 ± 0.543
0.0ThrXaa: 0.0 ± 0.0
Val
3.393ValAla: 3.393 ± 1.733
0.848ValCys: 0.848 ± 0.907
4.241ValAsp: 4.241 ± 1.356
0.848ValGlu: 0.848 ± 0.639
3.393ValPhe: 3.393 ± 1.919
5.937ValGly: 5.937 ± 3.115
2.545ValHis: 2.545 ± 1.436
1.696ValIle: 1.696 ± 0.895
4.241ValLys: 4.241 ± 2.016
7.634ValLeu: 7.634 ± 2.0
0.848ValMet: 0.848 ± 0.766
2.545ValAsn: 2.545 ± 1.962
0.848ValPro: 0.848 ± 0.907
6.785ValGln: 6.785 ± 1.255
5.937ValArg: 5.937 ± 3.374
2.545ValSer: 2.545 ± 1.436
5.089ValThr: 5.089 ± 3.924
7.634ValVal: 7.634 ± 1.934
1.696ValTrp: 1.696 ± 1.279
3.393ValTyr: 3.393 ± 2.781
0.0ValXaa: 0.0 ± 0.0
Trp
1.696TrpAla: 1.696 ± 0.543
0.848TrpCys: 0.848 ± 0.907
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.848TrpHis: 0.848 ± 0.654
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
2.545TrpLeu: 2.545 ± 0.99
0.0TrpMet: 0.0 ± 0.755
0.0TrpAsn: 0.0 ± 0.0
1.696TrpPro: 1.696 ± 1.279
0.0TrpGln: 0.0 ± 0.0
0.848TrpArg: 0.848 ± 0.639
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.848TrpVal: 0.848 ± 0.907
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.545TyrAla: 2.545 ± 1.49
0.848TyrCys: 0.848 ± 0.639
2.545TyrAsp: 2.545 ± 1.018
0.0TyrGlu: 0.0 ± 0.0
0.848TyrPhe: 0.848 ± 0.639
2.545TyrGly: 2.545 ± 1.614
1.696TyrHis: 1.696 ± 1.055
0.848TyrIle: 0.848 ± 0.907
1.696TyrLys: 1.696 ± 1.308
4.241TyrLeu: 4.241 ± 0.808
0.848TyrMet: 0.848 ± 0.766
1.696TyrAsn: 1.696 ± 0.543
1.696TyrPro: 1.696 ± 1.814
3.393TyrGln: 3.393 ± 0.792
4.241TyrArg: 4.241 ± 1.949
0.848TyrSer: 0.848 ± 0.766
0.848TyrThr: 0.848 ± 0.654
0.0TyrVal: 0.0 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1180 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski