Amino acid dipepetide frequency for Hypericum japonicum associated circular DNA virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.854AlaAla: 2.854 ± 1.242
0.951AlaCys: 0.951 ± 0.709
9.515AlaAsp: 9.515 ± 3.793
6.66AlaGlu: 6.66 ± 3.101
0.951AlaPhe: 0.951 ± 1.012
2.854AlaGly: 2.854 ± 1.212
0.0AlaHis: 0.0 ± 0.0
1.903AlaIle: 1.903 ± 1.009
1.903AlaLys: 1.903 ± 0.713
3.806AlaLeu: 3.806 ± 1.338
0.951AlaMet: 0.951 ± 0.692
1.903AlaAsn: 1.903 ± 1.015
1.903AlaPro: 1.903 ± 1.009
4.757AlaGln: 4.757 ± 1.865
3.806AlaArg: 3.806 ± 1.094
3.806AlaSer: 3.806 ± 1.094
2.854AlaThr: 2.854 ± 1.212
1.903AlaVal: 1.903 ± 0.713
0.951AlaTrp: 0.951 ± 0.692
2.854AlaTyr: 2.854 ± 1.242
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.951CysAsp: 0.951 ± 1.264
0.0CysGlu: 0.0 ± 0.0
0.951CysPhe: 0.951 ± 0.692
0.951CysGly: 0.951 ± 0.709
0.951CysHis: 0.951 ± 0.709
1.903CysIle: 1.903 ± 1.418
0.951CysLys: 0.951 ± 1.264
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.951CysAsn: 0.951 ± 0.709
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
3.806CysSer: 3.806 ± 1.505
0.951CysThr: 0.951 ± 0.829
0.951CysVal: 0.951 ± 1.264
0.0CysTrp: 0.0 ± 0.0
1.903CysTyr: 1.903 ± 1.015
0.0CysXaa: 0.0 ± 0.0
Asp
3.806AspAla: 3.806 ± 1.892
0.951AspCys: 0.951 ± 1.264
2.854AspAsp: 2.854 ± 1.242
3.806AspGlu: 3.806 ± 0.928
3.806AspPhe: 3.806 ± 0.76
10.466AspGly: 10.466 ± 2.438
0.0AspHis: 0.0 ± 0.0
1.903AspIle: 1.903 ± 1.418
1.903AspLys: 1.903 ± 1.009
2.854AspLeu: 2.854 ± 0.729
0.0AspMet: 0.0 ± 0.0
1.903AspAsn: 1.903 ± 1.227
2.854AspPro: 2.854 ± 2.126
1.903AspGln: 1.903 ± 1.227
1.903AspArg: 1.903 ± 0.713
0.951AspSer: 0.951 ± 0.709
2.854AspThr: 2.854 ± 1.54
2.854AspVal: 2.854 ± 1.535
5.709AspTrp: 5.709 ± 1.37
1.903AspTyr: 1.903 ± 0.713
0.0AspXaa: 0.0 ± 0.0
Glu
1.903GluAla: 1.903 ± 1.383
1.903GluCys: 1.903 ± 1.418
3.806GluAsp: 3.806 ± 1.279
1.903GluGlu: 1.903 ± 2.025
1.903GluPhe: 1.903 ± 1.418
2.854GluGly: 2.854 ± 1.275
3.806GluHis: 3.806 ± 1.505
0.951GluIle: 0.951 ± 0.692
1.903GluLys: 1.903 ± 0.713
4.757GluLeu: 4.757 ± 3.752
5.709GluMet: 5.709 ± 3.735
0.951GluAsn: 0.951 ± 0.692
3.806GluPro: 3.806 ± 1.324
0.951GluGln: 0.951 ± 0.692
2.854GluArg: 2.854 ± 1.275
1.903GluSer: 1.903 ± 1.173
2.854GluThr: 2.854 ± 1.996
2.854GluVal: 2.854 ± 1.361
3.806GluTrp: 3.806 ± 1.853
1.903GluTyr: 1.903 ± 1.308
0.0GluXaa: 0.0 ± 0.0
Phe
2.854PheAla: 2.854 ± 1.242
0.0PheCys: 0.0 ± 0.0
0.951PheAsp: 0.951 ± 0.709
0.951PheGlu: 0.951 ± 1.012
1.903PhePhe: 1.903 ± 0.713
3.806PheGly: 3.806 ± 1.853
1.903PheHis: 1.903 ± 0.907
1.903PheIle: 1.903 ± 0.907
0.0PheLys: 0.0 ± 0.0
0.951PheLeu: 0.951 ± 0.709
0.0PheMet: 0.0 ± 0.0
3.806PheAsn: 3.806 ± 1.234
0.951PhePro: 0.951 ± 0.829
0.951PheGln: 0.951 ± 0.692
4.757PheArg: 4.757 ± 0.858
1.903PheSer: 1.903 ± 1.308
2.854PheThr: 2.854 ± 1.531
0.951PheVal: 0.951 ± 0.709
0.951PheTrp: 0.951 ± 0.709
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
6.66GlyAla: 6.66 ± 2.582
0.0GlyCys: 0.0 ± 0.0
5.709GlyAsp: 5.709 ± 1.384
1.903GlyGlu: 1.903 ± 1.098
1.903GlyPhe: 1.903 ± 1.418
6.66GlyGly: 6.66 ± 3.966
0.0GlyHis: 0.0 ± 0.0
1.903GlyIle: 1.903 ± 1.009
3.806GlyLys: 3.806 ± 1.892
8.563GlyLeu: 8.563 ± 1.584
2.854GlyMet: 2.854 ± 1.527
3.806GlyAsn: 3.806 ± 1.425
2.854GlyPro: 2.854 ± 1.212
1.903GlyGln: 1.903 ± 1.098
7.612GlyArg: 7.612 ± 3.497
3.806GlySer: 3.806 ± 1.558
2.854GlyThr: 2.854 ± 1.275
4.757GlyVal: 4.757 ± 2.404
0.951GlyTrp: 0.951 ± 1.012
4.757GlyTyr: 4.757 ± 1.053
0.0GlyXaa: 0.0 ± 0.0
His
1.903HisAla: 1.903 ± 0.907
0.951HisCys: 0.951 ± 0.829
0.951HisAsp: 0.951 ± 0.709
2.854HisGlu: 2.854 ± 1.242
0.951HisPhe: 0.951 ± 0.709
1.903HisGly: 1.903 ± 0.907
0.951HisHis: 0.951 ± 0.709
0.951HisIle: 0.951 ± 1.264
0.0HisLys: 0.0 ± 0.0
3.806HisLeu: 3.806 ± 1.814
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.903HisPro: 1.903 ± 1.009
0.0HisGln: 0.0 ± 0.0
0.951HisArg: 0.951 ± 1.012
2.854HisSer: 2.854 ± 2.311
0.0HisThr: 0.0 ± 0.0
3.806HisVal: 3.806 ± 1.779
0.951HisTrp: 0.951 ± 0.692
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
0.951IleAla: 0.951 ± 0.692
1.903IleCys: 1.903 ± 0.713
0.0IleAsp: 0.0 ± 0.0
3.806IleGlu: 3.806 ± 1.505
1.903IlePhe: 1.903 ± 1.098
3.806IleGly: 3.806 ± 1.324
0.951IleHis: 0.951 ± 1.012
0.0IleIle: 0.0 ± 0.0
3.806IleLys: 3.806 ± 1.892
6.66IleLeu: 6.66 ± 3.39
0.0IleMet: 0.0 ± 0.0
0.951IleAsn: 0.951 ± 0.692
2.854IlePro: 2.854 ± 1.193
0.951IleGln: 0.951 ± 0.829
3.806IleArg: 3.806 ± 1.31
7.612IleSer: 7.612 ± 3.024
1.903IleThr: 1.903 ± 0.907
0.951IleVal: 0.951 ± 0.709
0.951IleTrp: 0.951 ± 0.709
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.854LysAla: 2.854 ± 2.126
0.0LysCys: 0.0 ± 0.0
1.903LysAsp: 1.903 ± 1.418
2.854LysGlu: 2.854 ± 0.943
1.903LysPhe: 1.903 ± 0.713
2.854LysGly: 2.854 ± 1.212
1.903LysHis: 1.903 ± 0.713
0.0LysIle: 0.0 ± 0.0
0.951LysLys: 0.951 ± 0.692
4.757LysLeu: 4.757 ± 3.02
1.903LysMet: 1.903 ± 1.255
0.951LysAsn: 0.951 ± 0.692
3.806LysPro: 3.806 ± 1.425
0.951LysGln: 0.951 ± 0.709
4.757LysArg: 4.757 ± 1.871
1.903LysSer: 1.903 ± 1.308
2.854LysThr: 2.854 ± 0.943
0.951LysVal: 0.951 ± 1.264
1.903LysTrp: 1.903 ± 1.418
1.903LysTyr: 1.903 ± 0.713
0.0LysXaa: 0.0 ± 0.0
Leu
4.757LeuAla: 4.757 ± 2.503
0.951LeuCys: 0.951 ± 0.829
9.515LeuAsp: 9.515 ± 3.541
2.854LeuGlu: 2.854 ± 1.275
2.854LeuPhe: 2.854 ± 0.729
5.709LeuGly: 5.709 ± 2.548
6.66LeuHis: 6.66 ± 2.537
4.757LeuIle: 4.757 ± 1.043
1.903LeuLys: 1.903 ± 1.015
7.612LeuLeu: 7.612 ± 3.904
0.0LeuMet: 0.0 ± 0.0
3.806LeuAsn: 3.806 ± 2.197
4.757LeuPro: 4.757 ± 2.129
0.951LeuGln: 0.951 ± 1.264
5.709LeuArg: 5.709 ± 3.681
5.709LeuSer: 5.709 ± 2.816
5.709LeuThr: 5.709 ± 3.86
10.466LeuVal: 10.466 ± 2.137
3.806LeuTrp: 3.806 ± 2.968
2.854LeuTyr: 2.854 ± 1.242
0.0LeuXaa: 0.0 ± 0.0
Met
1.903MetAla: 1.903 ± 1.015
0.0MetCys: 0.0 ± 0.0
0.951MetAsp: 0.951 ± 1.264
3.806MetGlu: 3.806 ± 2.759
0.0MetPhe: 0.0 ± 0.0
1.903MetGly: 1.903 ± 1.098
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
2.854MetLys: 2.854 ± 1.07
4.757MetLeu: 4.757 ± 3.156
0.951MetMet: 0.951 ± 0.692
4.757MetAsn: 4.757 ± 2.574
1.903MetPro: 1.903 ± 0.907
0.0MetGln: 0.0 ± 0.0
2.854MetArg: 2.854 ± 1.72
2.854MetSer: 2.854 ± 1.996
1.903MetThr: 1.903 ± 1.173
0.951MetVal: 0.951 ± 0.692
0.951MetTrp: 0.951 ± 1.012
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.757AsnAla: 4.757 ± 1.027
0.951AsnCys: 0.951 ± 0.709
1.903AsnAsp: 1.903 ± 1.227
3.806AsnGlu: 3.806 ± 1.558
0.951AsnPhe: 0.951 ± 0.709
3.806AsnGly: 3.806 ± 1.841
0.951AsnHis: 0.951 ± 1.012
1.903AsnIle: 1.903 ± 1.227
0.951AsnLys: 0.951 ± 0.692
3.806AsnLeu: 3.806 ± 1.841
0.951AsnMet: 0.951 ± 0.692
0.951AsnAsn: 0.951 ± 0.692
1.903AsnPro: 1.903 ± 1.015
0.951AsnGln: 0.951 ± 0.692
2.854AsnArg: 2.854 ± 2.083
1.903AsnSer: 1.903 ± 1.227
2.854AsnThr: 2.854 ± 1.212
4.757AsnVal: 4.757 ± 2.737
0.951AsnTrp: 0.951 ± 1.012
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.854ProAla: 2.854 ± 1.212
0.0ProCys: 0.0 ± 0.0
0.951ProAsp: 0.951 ± 0.709
0.951ProGlu: 0.951 ± 0.692
1.903ProPhe: 1.903 ± 1.009
1.903ProGly: 1.903 ± 0.713
0.951ProHis: 0.951 ± 0.709
3.806ProIle: 3.806 ± 1.234
2.854ProLys: 2.854 ± 1.361
5.709ProLeu: 5.709 ± 3.389
2.854ProMet: 2.854 ± 1.193
4.757ProAsn: 4.757 ± 1.452
0.951ProPro: 0.951 ± 0.709
0.951ProGln: 0.951 ± 0.709
3.806ProArg: 3.806 ± 1.094
4.757ProSer: 4.757 ± 2.845
2.854ProThr: 2.854 ± 1.54
3.806ProVal: 3.806 ± 1.324
0.951ProTrp: 0.951 ± 0.709
0.951ProTyr: 0.951 ± 0.709
0.0ProXaa: 0.0 ± 0.0
Gln
0.951GlnAla: 0.951 ± 0.709
0.951GlnCys: 0.951 ± 0.709
0.951GlnAsp: 0.951 ± 0.692
1.903GlnGlu: 1.903 ± 1.098
0.0GlnPhe: 0.0 ± 0.0
0.951GlnGly: 0.951 ± 0.692
0.0GlnHis: 0.0 ± 0.0
0.0GlnIle: 0.0 ± 0.0
0.0GlnLys: 0.0 ± 0.0
2.854GlnLeu: 2.854 ± 1.242
0.0GlnMet: 0.0 ± 0.0
0.951GlnAsn: 0.951 ± 0.692
2.854GlnPro: 2.854 ± 1.07
0.0GlnGln: 0.0 ± 0.0
0.951GlnArg: 0.951 ± 1.264
3.806GlnSer: 3.806 ± 2.011
5.709GlnThr: 5.709 ± 2.703
0.0GlnVal: 0.0 ± 0.0
0.951GlnTrp: 0.951 ± 0.692
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.854ArgAla: 2.854 ± 1.193
1.903ArgCys: 1.903 ± 2.528
4.757ArgAsp: 4.757 ± 1.452
3.806ArgGlu: 3.806 ± 1.505
1.903ArgPhe: 1.903 ± 1.418
4.757ArgGly: 4.757 ± 1.679
0.951ArgHis: 0.951 ± 0.709
4.757ArgIle: 4.757 ± 1.027
4.757ArgLys: 4.757 ± 1.865
6.66ArgLeu: 6.66 ± 2.945
2.854ArgMet: 2.854 ± 2.214
1.903ArgAsn: 1.903 ± 1.51
3.806ArgPro: 3.806 ± 2.231
0.951ArgGln: 0.951 ± 0.692
17.127ArgArg: 17.127 ± 6.145
7.612ArgSer: 7.612 ± 2.078
11.418ArgThr: 11.418 ± 3.984
3.806ArgVal: 3.806 ± 2.019
0.0ArgTrp: 0.0 ± 0.0
1.903ArgTyr: 1.903 ± 0.713
0.0ArgXaa: 0.0 ± 0.0
Ser
3.806SerAla: 3.806 ± 2.011
0.951SerCys: 0.951 ± 1.264
3.806SerAsp: 3.806 ± 1.966
0.951SerGlu: 0.951 ± 1.264
3.806SerPhe: 3.806 ± 2.011
4.757SerGly: 4.757 ± 1.027
1.903SerHis: 1.903 ± 1.308
8.563SerIle: 8.563 ± 3.31
3.806SerLys: 3.806 ± 1.779
10.466SerLeu: 10.466 ± 3.696
4.757SerMet: 4.757 ± 4.89
2.854SerAsn: 2.854 ± 1.54
1.903SerPro: 1.903 ± 1.564
2.854SerGln: 2.854 ± 1.361
6.66SerArg: 6.66 ± 3.282
8.563SerSer: 8.563 ± 3.112
6.66SerThr: 6.66 ± 1.503
1.903SerVal: 1.903 ± 1.658
0.951SerTrp: 0.951 ± 0.692
3.806SerTyr: 3.806 ± 2.766
0.0SerXaa: 0.0 ± 0.0
Thr
0.951ThrAla: 0.951 ± 0.692
1.903ThrCys: 1.903 ± 1.098
1.903ThrAsp: 1.903 ± 0.713
2.854ThrGlu: 2.854 ± 1.002
1.903ThrPhe: 1.903 ± 1.51
4.757ThrGly: 4.757 ± 1.522
0.951ThrHis: 0.951 ± 1.012
3.806ThrIle: 3.806 ± 2.142
2.854ThrLys: 2.854 ± 2.075
3.806ThrLeu: 3.806 ± 2.107
2.854ThrMet: 2.854 ± 1.534
2.854ThrAsn: 2.854 ± 2.075
3.806ThrPro: 3.806 ± 1.324
2.854ThrGln: 2.854 ± 1.787
5.709ThrArg: 5.709 ± 1.611
8.563ThrSer: 8.563 ± 2.428
7.612ThrThr: 7.612 ± 2.427
2.854ThrVal: 2.854 ± 1.434
0.951ThrTrp: 0.951 ± 0.709
4.757ThrTyr: 4.757 ± 1.679
0.0ThrXaa: 0.0 ± 0.0
Val
2.854ValAla: 2.854 ± 0.943
0.0ValCys: 0.0 ± 0.0
1.903ValAsp: 1.903 ± 0.713
3.806ValGlu: 3.806 ± 1.279
2.854ValPhe: 2.854 ± 1.242
5.709ValGly: 5.709 ± 2.404
1.903ValHis: 1.903 ± 1.308
0.951ValIle: 0.951 ± 0.829
4.757ValLys: 4.757 ± 2.209
5.709ValLeu: 5.709 ± 1.886
0.951ValMet: 0.951 ± 1.264
0.951ValAsn: 0.951 ± 0.692
1.903ValPro: 1.903 ± 1.308
0.0ValGln: 0.0 ± 0.0
4.757ValArg: 4.757 ± 2.115
8.563ValSer: 8.563 ± 3.699
1.903ValThr: 1.903 ± 0.713
6.66ValVal: 6.66 ± 2.647
1.903ValTrp: 1.903 ± 1.227
1.903ValTyr: 1.903 ± 0.713
0.0ValXaa: 0.0 ± 0.0
Trp
1.903TrpAla: 1.903 ± 1.418
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
2.854TrpGlu: 2.854 ± 1.787
0.0TrpPhe: 0.0 ± 0.0
0.951TrpGly: 0.951 ± 0.709
0.951TrpHis: 0.951 ± 0.692
2.854TrpIle: 2.854 ± 0.729
0.0TrpLys: 0.0 ± 0.0
2.854TrpLeu: 2.854 ± 1.212
1.903TrpMet: 1.903 ± 0.907
1.903TrpAsn: 1.903 ± 0.907
0.951TrpPro: 0.951 ± 1.264
1.903TrpGln: 1.903 ± 1.383
3.806TrpArg: 3.806 ± 0.928
1.903TrpSer: 1.903 ± 0.907
0.951TrpThr: 0.951 ± 1.012
1.903TrpVal: 1.903 ± 1.308
0.0TrpTrp: 0.0 ± 0.0
0.951TrpTyr: 0.951 ± 0.692
0.0TrpXaa: 0.0 ± 0.0
Tyr
5.709TyrAla: 5.709 ± 2.484
0.951TyrCys: 0.951 ± 0.709
1.903TyrAsp: 1.903 ± 0.713
0.951TyrGlu: 0.951 ± 0.709
0.0TyrPhe: 0.0 ± 0.0
1.903TyrGly: 1.903 ± 0.713
0.0TyrHis: 0.0 ± 0.0
0.951TyrIle: 0.951 ± 0.692
1.903TyrLys: 1.903 ± 0.713
0.951TyrLeu: 0.951 ± 0.829
2.854TyrMet: 2.854 ± 0.943
0.951TyrAsn: 0.951 ± 0.692
2.854TyrPro: 2.854 ± 2.075
0.0TyrGln: 0.0 ± 0.0
3.806TyrArg: 3.806 ± 1.094
0.951TyrSer: 0.951 ± 1.012
1.903TyrThr: 1.903 ± 1.383
2.854TyrVal: 2.854 ± 1.07
0.951TyrTrp: 0.951 ± 0.692
1.903TyrTyr: 1.903 ± 0.713
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1052 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski