Amino acid dipepetide frequency for Ceratocystis polonica partitivirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.779AlaAla: 3.779 ± 1.607
0.756AlaCys: 0.756 ± 0.538
3.023AlaAsp: 3.023 ± 1.078
0.756AlaGlu: 0.756 ± 0.538
0.756AlaPhe: 0.756 ± 0.538
3.023AlaGly: 3.023 ± 1.078
4.535AlaHis: 4.535 ± 1.079
2.268AlaIle: 2.268 ± 0.535
1.512AlaLys: 1.512 ± 1.073
3.779AlaLeu: 3.779 ± 0.541
0.0AlaMet: 0.0 ± 0.0
4.535AlaAsn: 4.535 ± 1.079
6.803AlaPro: 6.803 ± 0.544
2.268AlaGln: 2.268 ± 1.609
6.047AlaArg: 6.047 ± 0.006
4.535AlaSer: 4.535 ± 0.005
6.047AlaThr: 6.047 ± 2.155
3.023AlaVal: 3.023 ± 1.078
0.756AlaTrp: 0.756 ± 0.536
3.023AlaTyr: 3.023 ± 1.071
0.0AlaXaa: 0.0 ± 0.0
Cys
0.756CysAla: 0.756 ± 0.536
0.0CysCys: 0.0 ± 0.0
0.756CysAsp: 0.756 ± 0.536
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.512CysLys: 1.512 ± 1.076
1.512CysLeu: 1.512 ± 1.076
1.512CysMet: 1.512 ± 1.073
0.0CysAsn: 0.0 ± 0.0
0.756CysPro: 0.756 ± 0.538
0.756CysGln: 0.756 ± 0.538
2.268CysArg: 2.268 ± 0.535
1.512CysSer: 1.512 ± 0.002
0.756CysThr: 0.756 ± 0.536
0.756CysVal: 0.756 ± 0.538
0.0CysTrp: 0.0 ± 0.0
0.756CysTyr: 0.756 ± 0.538
0.0CysXaa: 0.0 ± 0.0
Asp
6.047AspAla: 6.047 ± 3.229
0.0AspCys: 0.0 ± 0.0
3.023AspAsp: 3.023 ± 1.071
2.268AspGlu: 2.268 ± 0.54
5.291AspPhe: 5.291 ± 1.606
2.268AspGly: 2.268 ± 0.54
3.023AspHis: 3.023 ± 1.071
4.535AspIle: 4.535 ± 0.005
1.512AspLys: 1.512 ± 1.073
6.047AspLeu: 6.047 ± 0.006
1.512AspMet: 1.512 ± 1.073
1.512AspAsn: 1.512 ± 1.073
1.512AspPro: 1.512 ± 1.076
1.512AspGln: 1.512 ± 1.073
0.756AspArg: 0.756 ± 0.536
6.803AspSer: 6.803 ± 2.678
0.756AspThr: 0.756 ± 0.536
4.535AspVal: 4.535 ± 1.079
1.512AspTrp: 1.512 ± 1.073
2.268AspTyr: 2.268 ± 0.535
0.0AspXaa: 0.0 ± 0.0
Glu
5.291GluAla: 5.291 ± 0.543
0.756GluCys: 0.756 ± 0.536
2.268GluAsp: 2.268 ± 0.54
3.023GluGlu: 3.023 ± 1.071
1.512GluPhe: 1.512 ± 1.073
0.756GluGly: 0.756 ± 0.538
0.0GluHis: 0.0 ± 0.0
3.023GluIle: 3.023 ± 0.003
0.756GluLys: 0.756 ± 0.538
1.512GluLeu: 1.512 ± 1.073
0.756GluMet: 0.756 ± 0.536
1.512GluAsn: 1.512 ± 0.002
3.023GluPro: 3.023 ± 1.078
0.0GluGln: 0.0 ± 0.0
2.268GluArg: 2.268 ± 0.54
5.291GluSer: 5.291 ± 0.543
2.268GluThr: 2.268 ± 0.535
1.512GluVal: 1.512 ± 0.002
0.756GluTrp: 0.756 ± 0.536
1.512GluTyr: 1.512 ± 1.076
0.0GluXaa: 0.0 ± 0.0
Phe
7.559PheAla: 7.559 ± 2.14
3.023PheCys: 3.023 ± 0.003
4.535PheAsp: 4.535 ± 1.079
3.023PheGlu: 3.023 ± 1.071
3.023PhePhe: 3.023 ± 1.071
1.512PheGly: 1.512 ± 0.002
3.023PheHis: 3.023 ± 2.145
3.023PheIle: 3.023 ± 1.071
3.023PheLys: 3.023 ± 2.145
6.047PheLeu: 6.047 ± 1.081
1.512PheMet: 1.512 ± 1.073
2.268PheAsn: 2.268 ± 0.54
3.023PhePro: 3.023 ± 0.003
1.512PheGln: 1.512 ± 1.076
0.0PheArg: 0.0 ± 0.0
5.291PheSer: 5.291 ± 0.543
3.779PheThr: 3.779 ± 0.541
0.756PheVal: 0.756 ± 0.536
0.0PheTrp: 0.0 ± 0.0
3.023PheTyr: 3.023 ± 2.145
0.0PheXaa: 0.0 ± 0.0
Gly
0.756GlyAla: 0.756 ± 0.538
0.0GlyCys: 0.0 ± 0.0
1.512GlyAsp: 1.512 ± 1.073
1.512GlyGlu: 1.512 ± 1.076
3.023GlyPhe: 3.023 ± 1.071
1.512GlyGly: 1.512 ± 0.002
0.0GlyHis: 0.0 ± 0.0
3.023GlyIle: 3.023 ± 0.003
0.756GlyLys: 0.756 ± 0.538
5.291GlyLeu: 5.291 ± 1.617
2.268GlyMet: 2.268 ± 1.609
2.268GlyAsn: 2.268 ± 1.614
2.268GlyPro: 2.268 ± 0.535
1.512GlyGln: 1.512 ± 0.002
0.756GlyArg: 0.756 ± 0.538
3.779GlySer: 3.779 ± 1.615
0.756GlyThr: 0.756 ± 0.538
3.779GlyVal: 3.779 ± 1.607
0.0GlyTrp: 0.0 ± 0.0
3.023GlyTyr: 3.023 ± 1.071
0.0GlyXaa: 0.0 ± 0.0
His
3.779HisAla: 3.779 ± 0.541
0.0HisCys: 0.0 ± 0.0
3.023HisAsp: 3.023 ± 1.071
1.512HisGlu: 1.512 ± 0.002
3.023HisPhe: 3.023 ± 1.071
1.512HisGly: 1.512 ± 1.073
0.756HisHis: 0.756 ± 0.536
2.268HisIle: 2.268 ± 0.535
0.756HisLys: 0.756 ± 0.538
1.512HisLeu: 1.512 ± 0.002
0.756HisMet: 0.756 ± 0.538
3.779HisAsn: 3.779 ± 1.607
3.023HisPro: 3.023 ± 0.003
0.756HisGln: 0.756 ± 0.536
1.512HisArg: 1.512 ± 0.002
5.291HisSer: 5.291 ± 1.606
0.0HisThr: 0.0 ± 0.0
2.268HisVal: 2.268 ± 0.535
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.512IleAla: 1.512 ± 0.002
0.0IleCys: 0.0 ± 0.0
6.803IleAsp: 6.803 ± 0.53
4.535IleGlu: 4.535 ± 2.144
6.047IlePhe: 6.047 ± 1.081
3.779IleGly: 3.779 ± 0.533
0.756IleHis: 0.756 ± 0.536
1.512IleIle: 1.512 ± 0.002
3.023IleLys: 3.023 ± 2.145
6.803IleLeu: 6.803 ± 1.604
1.512IleMet: 1.512 ± 0.002
3.023IleAsn: 3.023 ± 2.145
4.535IlePro: 4.535 ± 1.069
2.268IleGln: 2.268 ± 1.614
1.512IleArg: 1.512 ± 1.073
4.535IleSer: 4.535 ± 2.153
2.268IleThr: 2.268 ± 0.54
1.512IleVal: 1.512 ± 0.002
0.756IleTrp: 0.756 ± 0.538
1.512IleTyr: 1.512 ± 0.002
0.0IleXaa: 0.0 ± 0.0
Lys
1.512LysAla: 1.512 ± 0.002
0.0LysCys: 0.0 ± 0.0
1.512LysAsp: 1.512 ± 1.073
2.268LysGlu: 2.268 ± 0.535
3.023LysPhe: 3.023 ± 1.071
0.756LysGly: 0.756 ± 0.538
0.756LysHis: 0.756 ± 0.536
3.779LysIle: 3.779 ± 2.682
4.535LysLys: 4.535 ± 2.144
7.559LysLeu: 7.559 ± 1.066
0.0LysMet: 0.0 ± 0.0
0.0LysAsn: 0.0 ± 0.0
3.779LysPro: 3.779 ± 0.533
1.512LysGln: 1.512 ± 1.073
2.268LysArg: 2.268 ± 0.535
3.023LysSer: 3.023 ± 1.071
2.268LysThr: 2.268 ± 0.54
4.535LysVal: 4.535 ± 2.144
0.0LysTrp: 0.0 ± 0.0
1.512LysTyr: 1.512 ± 0.002
0.0LysXaa: 0.0 ± 0.0
Leu
5.291LeuAla: 5.291 ± 0.543
1.512LeuCys: 1.512 ± 0.002
3.023LeuAsp: 3.023 ± 1.078
2.268LeuGlu: 2.268 ± 0.535
4.535LeuPhe: 4.535 ± 0.005
3.779LeuGly: 3.779 ± 0.533
3.023LeuHis: 3.023 ± 0.003
9.07LeuIle: 9.07 ± 0.01
4.535LeuLys: 4.535 ± 0.005
8.314LeuLeu: 8.314 ± 0.546
2.268LeuMet: 2.268 ± 0.54
3.023LeuAsn: 3.023 ± 0.003
9.826LeuPro: 9.826 ± 0.548
4.535LeuGln: 4.535 ± 2.153
3.779LeuArg: 3.779 ± 0.533
7.559LeuSer: 7.559 ± 0.008
5.291LeuThr: 5.291 ± 2.691
7.559LeuVal: 7.559 ± 1.066
2.268LeuTrp: 2.268 ± 0.535
2.268LeuTyr: 2.268 ± 0.54
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.756MetCys: 0.756 ± 0.536
0.0MetAsp: 0.0 ± 0.0
0.756MetGlu: 0.756 ± 0.536
2.268MetPhe: 2.268 ± 0.535
1.512MetGly: 1.512 ± 1.073
0.0MetHis: 0.0 ± 0.0
2.268MetIle: 2.268 ± 0.535
0.756MetLys: 0.756 ± 0.536
1.512MetLeu: 1.512 ± 0.002
1.512MetMet: 1.512 ± 1.073
0.0MetAsn: 0.0 ± 0.0
1.512MetPro: 1.512 ± 1.073
0.0MetGln: 0.0 ± 0.0
2.268MetArg: 2.268 ± 0.535
2.268MetSer: 2.268 ± 1.614
0.0MetThr: 0.0 ± 0.0
1.512MetVal: 1.512 ± 0.002
0.0MetTrp: 0.0 ± 0.0
3.023MetTyr: 3.023 ± 1.078
0.0MetXaa: 0.0 ± 0.0
Asn
5.291AsnAla: 5.291 ± 0.531
1.512AsnCys: 1.512 ± 1.076
3.023AsnAsp: 3.023 ± 2.145
1.512AsnGlu: 1.512 ± 0.002
2.268AsnPhe: 2.268 ± 0.54
0.756AsnGly: 0.756 ± 0.538
2.268AsnHis: 2.268 ± 1.609
3.023AsnIle: 3.023 ± 0.003
1.512AsnLys: 1.512 ± 1.073
5.291AsnLeu: 5.291 ± 1.606
0.756AsnMet: 0.756 ± 0.536
2.268AsnAsn: 2.268 ± 0.535
3.023AsnPro: 3.023 ± 0.003
1.512AsnGln: 1.512 ± 0.002
2.268AsnArg: 2.268 ± 0.54
2.268AsnSer: 2.268 ± 1.614
2.268AsnThr: 2.268 ± 0.535
3.779AsnVal: 3.779 ± 0.533
0.0AsnTrp: 0.0 ± 0.0
3.023AsnTyr: 3.023 ± 0.003
0.0AsnXaa: 0.0 ± 0.0
Pro
0.756ProAla: 0.756 ± 0.538
0.756ProCys: 0.756 ± 0.538
5.291ProAsp: 5.291 ± 0.543
1.512ProGlu: 1.512 ± 1.073
4.535ProPhe: 4.535 ± 0.005
3.023ProGly: 3.023 ± 1.078
2.268ProHis: 2.268 ± 0.54
3.779ProIle: 3.779 ± 0.541
3.023ProLys: 3.023 ± 1.071
4.535ProLeu: 4.535 ± 0.005
1.512ProMet: 1.512 ± 1.368
1.512ProAsn: 1.512 ± 0.002
7.559ProPro: 7.559 ± 0.008
3.023ProGln: 3.023 ± 0.003
1.512ProArg: 1.512 ± 1.076
14.361ProSer: 14.361 ± 4.85
6.803ProThr: 6.803 ± 1.604
2.268ProVal: 2.268 ± 1.609
0.756ProTrp: 0.756 ± 0.536
5.291ProTyr: 5.291 ± 1.617
0.0ProXaa: 0.0 ± 0.0
Gln
1.512GlnAla: 1.512 ± 0.002
0.756GlnCys: 0.756 ± 0.536
0.756GlnAsp: 0.756 ± 0.536
0.756GlnGlu: 0.756 ± 0.538
0.756GlnPhe: 0.756 ± 0.538
0.756GlnGly: 0.756 ± 0.538
0.756GlnHis: 0.756 ± 0.536
2.268GlnIle: 2.268 ± 0.54
0.756GlnLys: 0.756 ± 0.538
4.535GlnLeu: 4.535 ± 1.069
0.0GlnMet: 0.0 ± 0.0
1.512GlnAsn: 1.512 ± 1.076
3.779GlnPro: 3.779 ± 0.533
2.268GlnGln: 2.268 ± 0.54
0.756GlnArg: 0.756 ± 0.536
2.268GlnSer: 2.268 ± 0.54
4.535GlnThr: 4.535 ± 0.005
2.268GlnVal: 2.268 ± 0.54
0.0GlnTrp: 0.0 ± 0.0
2.268GlnTyr: 2.268 ± 0.54
0.0GlnXaa: 0.0 ± 0.0
Arg
1.512ArgAla: 1.512 ± 1.073
0.0ArgCys: 0.0 ± 0.0
3.023ArgAsp: 3.023 ± 0.003
0.0ArgGlu: 0.0 ± 0.0
2.268ArgPhe: 2.268 ± 0.535
1.512ArgGly: 1.512 ± 1.073
2.268ArgHis: 2.268 ± 0.535
2.268ArgIle: 2.268 ± 0.535
6.047ArgLys: 6.047 ± 3.216
1.512ArgLeu: 1.512 ± 1.076
0.756ArgMet: 0.756 ± 0.538
3.779ArgAsn: 3.779 ± 1.615
4.535ArgPro: 4.535 ± 1.079
1.512ArgGln: 1.512 ± 0.002
0.756ArgArg: 0.756 ± 0.536
7.559ArgSer: 7.559 ± 2.157
3.779ArgThr: 3.779 ± 1.615
2.268ArgVal: 2.268 ± 0.54
0.756ArgTrp: 0.756 ± 0.536
2.268ArgTyr: 2.268 ± 0.535
0.0ArgXaa: 0.0 ± 0.0
Ser
6.803SerAla: 6.803 ± 1.619
0.756SerCys: 0.756 ± 0.538
3.779SerAsp: 3.779 ± 2.682
2.268SerGlu: 2.268 ± 1.614
6.047SerPhe: 6.047 ± 1.068
5.291SerGly: 5.291 ± 1.617
5.291SerHis: 5.291 ± 1.617
7.559SerIle: 7.559 ± 0.008
5.291SerLys: 5.291 ± 1.606
8.314SerLeu: 8.314 ± 1.62
1.512SerMet: 1.512 ± 0.85
6.047SerAsn: 6.047 ± 2.142
3.779SerPro: 3.779 ± 2.69
1.512SerGln: 1.512 ± 0.002
6.047SerArg: 6.047 ± 3.229
15.873SerSer: 15.873 ± 10.223
6.803SerThr: 6.803 ± 1.619
3.779SerVal: 3.779 ± 1.615
1.512SerTrp: 1.512 ± 0.002
6.047SerTyr: 6.047 ± 2.155
0.0SerXaa: 0.0 ± 0.0
Thr
5.291ThrAla: 5.291 ± 2.691
0.756ThrCys: 0.756 ± 0.538
2.268ThrAsp: 2.268 ± 0.54
5.291ThrGlu: 5.291 ± 1.617
3.023ThrPhe: 3.023 ± 0.003
1.512ThrGly: 1.512 ± 1.073
1.512ThrHis: 1.512 ± 1.076
2.268ThrIle: 2.268 ± 0.54
1.512ThrLys: 1.512 ± 0.002
6.047ThrLeu: 6.047 ± 0.006
1.512ThrMet: 1.512 ± 1.076
2.268ThrAsn: 2.268 ± 0.54
2.268ThrPro: 2.268 ± 0.54
1.512ThrGln: 1.512 ± 0.002
5.291ThrArg: 5.291 ± 0.543
3.023ThrSer: 3.023 ± 0.003
4.535ThrThr: 4.535 ± 2.153
4.535ThrVal: 4.535 ± 0.005
1.512ThrTrp: 1.512 ± 0.002
2.268ThrTyr: 2.268 ± 0.54
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
0.0ValCys: 0.0 ± 0.0
4.535ValAsp: 4.535 ± 1.069
3.023ValGlu: 3.023 ± 1.078
3.779ValPhe: 3.779 ± 0.533
1.512ValGly: 1.512 ± 1.076
4.535ValHis: 4.535 ± 3.218
0.756ValIle: 0.756 ± 0.536
0.756ValLys: 0.756 ± 0.536
5.291ValLeu: 5.291 ± 1.617
0.756ValMet: 0.756 ± 0.538
5.291ValAsn: 5.291 ± 2.68
6.047ValPro: 6.047 ± 0.006
1.512ValGln: 1.512 ± 0.002
5.291ValArg: 5.291 ± 1.606
3.779ValSer: 3.779 ± 1.615
3.023ValThr: 3.023 ± 2.152
4.535ValVal: 4.535 ± 0.005
0.0ValTrp: 0.0 ± 0.0
2.268ValTyr: 2.268 ± 0.54
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
1.512TrpCys: 1.512 ± 1.073
0.756TrpAsp: 0.756 ± 0.536
0.0TrpGlu: 0.0 ± 0.0
2.268TrpPhe: 2.268 ± 1.609
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.756TrpIle: 0.756 ± 0.536
0.756TrpLys: 0.756 ± 0.536
2.268TrpLeu: 2.268 ± 1.614
0.0TrpMet: 0.0 ± 0.0
0.756TrpAsn: 0.756 ± 0.536
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.512TrpSer: 1.512 ± 0.002
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.779TyrAla: 3.779 ± 0.533
0.756TyrCys: 0.756 ± 0.538
3.779TyrAsp: 3.779 ± 0.541
1.512TyrGlu: 1.512 ± 0.002
1.512TyrPhe: 1.512 ± 0.002
3.023TyrGly: 3.023 ± 1.071
0.756TyrHis: 0.756 ± 0.536
0.756TyrIle: 0.756 ± 0.536
2.268TyrLys: 2.268 ± 0.54
6.047TyrLeu: 6.047 ± 1.081
0.756TyrMet: 0.756 ± 0.538
2.268TyrAsn: 2.268 ± 0.535
3.779TyrPro: 3.779 ± 0.533
3.779TyrGln: 3.779 ± 0.541
3.023TyrArg: 3.023 ± 0.003
3.779TyrSer: 3.779 ± 1.615
2.268TyrThr: 2.268 ± 0.54
1.512TyrVal: 1.512 ± 1.076
0.0TyrTrp: 0.0 ± 0.0
1.512TyrTyr: 1.512 ± 1.073
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1324 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski