Amino acid dipepetide frequency for Heterobasidion partitivirus 13

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.774AlaAla: 13.774 ± 8.067
0.918AlaCys: 0.918 ± 0.717
8.264AlaAsp: 8.264 ± 1.073
5.51AlaGlu: 5.51 ± 0.267
0.918AlaPhe: 0.918 ± 0.717
0.918AlaGly: 0.918 ± 0.628
0.918AlaHis: 0.918 ± 0.717
1.837AlaIle: 1.837 ± 1.434
1.837AlaLys: 1.837 ± 0.089
7.346AlaLeu: 7.346 ± 1.701
4.591AlaMet: 4.591 ± 0.623
6.428AlaAsn: 6.428 ± 5.021
4.591AlaPro: 4.591 ± 0.895
4.591AlaGln: 4.591 ± 0.45
5.51AlaArg: 5.51 ± 1.612
3.673AlaSer: 3.673 ± 1.523
7.346AlaThr: 7.346 ± 0.356
7.346AlaVal: 7.346 ± 1.701
2.755AlaTrp: 2.755 ± 0.806
3.673AlaTyr: 3.673 ± 1.168
0.0AlaXaa: 0.0 ± 0.0
Cys
1.837CysAla: 1.837 ± 1.434
0.918CysCys: 0.918 ± 0.628
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.918CysGly: 0.918 ± 0.628
0.918CysHis: 0.918 ± 0.717
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.918CysLeu: 0.918 ± 0.717
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.918CysPro: 0.918 ± 0.717
0.0CysGln: 0.0 ± 0.0
0.918CysArg: 0.918 ± 0.717
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.918CysTyr: 0.918 ± 0.717
0.0CysXaa: 0.0 ± 0.0
Asp
5.51AspAla: 5.51 ± 1.079
0.0AspCys: 0.0 ± 0.0
0.918AspAsp: 0.918 ± 0.628
0.918AspGlu: 0.918 ± 0.628
3.673AspPhe: 3.673 ± 0.178
2.755AspGly: 2.755 ± 0.539
2.755AspHis: 2.755 ± 1.885
2.755AspIle: 2.755 ± 1.885
2.755AspLys: 2.755 ± 0.539
5.51AspLeu: 5.51 ± 0.267
0.0AspMet: 0.0 ± 0.0
1.837AspAsn: 1.837 ± 1.257
3.673AspPro: 3.673 ± 0.178
1.837AspGln: 1.837 ± 1.257
0.918AspArg: 0.918 ± 0.717
7.346AspSer: 7.346 ± 0.99
2.755AspThr: 2.755 ± 0.806
1.837AspVal: 1.837 ± 1.257
2.755AspTrp: 2.755 ± 0.539
4.591AspTyr: 4.591 ± 2.241
0.0AspXaa: 0.0 ± 0.0
Glu
4.591GluAla: 4.591 ± 0.45
0.0GluCys: 0.0 ± 0.0
2.755GluAsp: 2.755 ± 0.539
1.837GluGlu: 1.837 ± 0.089
3.673GluPhe: 3.673 ± 1.523
1.837GluGly: 1.837 ± 0.089
2.755GluHis: 2.755 ± 0.806
0.918GluIle: 0.918 ± 0.717
1.837GluLys: 1.837 ± 0.089
0.0GluLeu: 0.0 ± 0.0
0.918GluMet: 0.918 ± 0.628
0.0GluAsn: 0.0 ± 0.0
0.918GluPro: 0.918 ± 0.628
1.837GluGln: 1.837 ± 1.434
1.837GluArg: 1.837 ± 0.089
0.918GluSer: 0.918 ± 0.717
8.264GluThr: 8.264 ± 1.618
0.918GluVal: 0.918 ± 0.628
1.837GluTrp: 1.837 ± 0.089
1.837GluTyr: 1.837 ± 1.257
0.0GluXaa: 0.0 ± 0.0
Phe
3.673PheAla: 3.673 ± 0.178
0.918PheCys: 0.918 ± 0.717
4.591PheAsp: 4.591 ± 1.796
4.591PheGlu: 4.591 ± 0.45
1.837PhePhe: 1.837 ± 1.257
3.673PheGly: 3.673 ± 0.178
1.837PheHis: 1.837 ± 1.257
2.755PheIle: 2.755 ± 0.806
2.755PheLys: 2.755 ± 1.885
6.428PheLeu: 6.428 ± 1.707
0.918PheMet: 0.918 ± 0.995
2.755PheAsn: 2.755 ± 0.539
4.591PhePro: 4.591 ± 0.895
2.755PheGln: 2.755 ± 0.806
3.673PheArg: 3.673 ± 1.168
0.918PheSer: 0.918 ± 0.628
1.837PheThr: 1.837 ± 1.434
1.837PheVal: 1.837 ± 0.089
2.755PheTrp: 2.755 ± 0.806
2.755PheTyr: 2.755 ± 1.885
0.0PheXaa: 0.0 ± 0.0
Gly
1.837GlyAla: 1.837 ± 0.089
0.0GlyCys: 0.0 ± 0.0
1.837GlyAsp: 1.837 ± 1.257
0.0GlyGlu: 0.0 ± 0.0
3.673GlyPhe: 3.673 ± 1.168
1.837GlyGly: 1.837 ± 0.089
0.0GlyHis: 0.0 ± 0.0
2.755GlyIle: 2.755 ± 0.539
0.0GlyLys: 0.0 ± 0.0
3.673GlyLeu: 3.673 ± 0.178
1.837GlyMet: 1.837 ± 0.089
1.837GlyAsn: 1.837 ± 0.089
3.673GlyPro: 3.673 ± 2.869
0.918GlyGln: 0.918 ± 0.628
5.51GlyArg: 5.51 ± 2.958
2.755GlySer: 2.755 ± 0.806
0.918GlyThr: 0.918 ± 0.717
2.755GlyVal: 2.755 ± 0.539
0.0GlyTrp: 0.0 ± 0.0
2.755GlyTyr: 2.755 ± 1.885
0.0GlyXaa: 0.0 ± 0.0
His
3.673HisAla: 3.673 ± 0.178
0.918HisCys: 0.918 ± 0.717
0.0HisAsp: 0.0 ± 0.0
1.837HisGlu: 1.837 ± 1.434
5.51HisPhe: 5.51 ± 1.079
0.0HisGly: 0.0 ± 0.0
0.918HisHis: 0.918 ± 0.628
1.837HisIle: 1.837 ± 0.089
0.918HisLys: 0.918 ± 0.628
0.0HisLeu: 0.0 ± 0.0
0.0HisMet: 0.0 ± 0.0
0.918HisAsn: 0.918 ± 0.717
3.673HisPro: 3.673 ± 2.513
0.0HisGln: 0.0 ± 0.0
2.755HisArg: 2.755 ± 0.539
0.918HisSer: 0.918 ± 0.628
0.918HisThr: 0.918 ± 0.628
4.591HisVal: 4.591 ± 1.796
0.0HisTrp: 0.0 ± 0.0
1.837HisTyr: 1.837 ± 0.089
0.0HisXaa: 0.0 ± 0.0
Ile
1.837IleAla: 1.837 ± 0.089
0.918IleCys: 0.918 ± 0.717
1.837IleAsp: 1.837 ± 0.089
0.918IleGlu: 0.918 ± 0.717
0.918IlePhe: 0.918 ± 0.628
1.837IleGly: 1.837 ± 1.434
2.755IleHis: 2.755 ± 0.806
2.755IleIle: 2.755 ± 0.539
2.755IleLys: 2.755 ± 1.885
4.591IleLeu: 4.591 ± 1.796
0.918IleMet: 0.918 ± 0.717
0.918IleAsn: 0.918 ± 0.628
3.673IlePro: 3.673 ± 0.178
2.755IleGln: 2.755 ± 0.806
4.591IleArg: 4.591 ± 3.141
8.264IleSer: 8.264 ± 1.073
0.918IleThr: 0.918 ± 0.717
0.0IleVal: 0.0 ± 0.0
0.0IleTrp: 0.0 ± 0.0
1.837IleTyr: 1.837 ± 0.089
0.0IleXaa: 0.0 ± 0.0
Lys
4.591LysAla: 4.591 ± 0.45
0.0LysCys: 0.0 ± 0.0
1.837LysAsp: 1.837 ± 1.257
0.918LysGlu: 0.918 ± 0.717
4.591LysPhe: 4.591 ± 0.895
0.0LysGly: 0.0 ± 0.0
0.918LysHis: 0.918 ± 0.628
0.918LysIle: 0.918 ± 0.628
0.0LysLys: 0.0 ± 0.0
2.755LysLeu: 2.755 ± 0.539
0.918LysMet: 0.918 ± 0.628
1.837LysAsn: 1.837 ± 1.257
2.755LysPro: 2.755 ± 0.539
0.0LysGln: 0.0 ± 0.0
3.673LysArg: 3.673 ± 2.513
2.755LysSer: 2.755 ± 0.539
1.837LysThr: 1.837 ± 1.257
1.837LysVal: 1.837 ± 1.434
0.0LysTrp: 0.0 ± 0.0
1.837LysTyr: 1.837 ± 1.257
0.0LysXaa: 0.0 ± 0.0
Leu
7.346LeuAla: 7.346 ± 1.701
0.0LeuCys: 0.0 ± 0.0
9.183LeuAsp: 9.183 ± 1.79
3.673LeuGlu: 3.673 ± 1.168
7.346LeuPhe: 7.346 ± 2.335
4.591LeuGly: 4.591 ± 0.45
1.837LeuHis: 1.837 ± 1.257
3.673LeuIle: 3.673 ± 0.178
2.755LeuLys: 2.755 ± 1.885
11.019LeuLeu: 11.019 ± 3.503
0.918LeuMet: 0.918 ± 0.628
3.673LeuAsn: 3.673 ± 0.178
6.428LeuPro: 6.428 ± 0.361
0.918LeuGln: 0.918 ± 0.628
5.51LeuArg: 5.51 ± 1.079
9.183LeuSer: 9.183 ± 1.79
7.346LeuThr: 7.346 ± 0.99
3.673LeuVal: 3.673 ± 1.523
2.755LeuTrp: 2.755 ± 1.885
0.918LeuTyr: 0.918 ± 0.628
0.0LeuXaa: 0.0 ± 0.0
Met
0.918MetAla: 0.918 ± 0.717
0.918MetCys: 0.918 ± 0.717
2.755MetAsp: 2.755 ± 0.539
0.0MetGlu: 0.0 ± 0.0
0.918MetPhe: 0.918 ± 0.628
0.918MetGly: 0.918 ± 0.628
0.918MetHis: 0.918 ± 0.628
0.918MetIle: 0.918 ± 0.628
0.0MetLys: 0.0 ± 0.0
5.51MetLeu: 5.51 ± 2.424
0.918MetMet: 0.918 ± 0.628
0.918MetAsn: 0.918 ± 0.717
2.755MetPro: 2.755 ± 0.539
0.0MetGln: 0.0 ± 0.0
1.837MetArg: 1.837 ± 1.434
3.673MetSer: 3.673 ± 0.178
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.918MetTyr: 0.918 ± 0.628
0.0MetXaa: 0.0 ± 0.0
Asn
3.673AsnAla: 3.673 ± 2.869
0.0AsnCys: 0.0 ± 0.0
0.918AsnAsp: 0.918 ± 0.628
3.673AsnGlu: 3.673 ± 1.168
2.755AsnPhe: 2.755 ± 2.152
3.673AsnGly: 3.673 ± 0.178
0.918AsnHis: 0.918 ± 0.628
0.0AsnIle: 0.0 ± 0.0
0.0AsnLys: 0.0 ± 0.0
3.673AsnLeu: 3.673 ± 0.178
1.837AsnMet: 1.837 ± 1.257
2.755AsnAsn: 2.755 ± 0.539
2.755AsnPro: 2.755 ± 2.152
0.918AsnGln: 0.918 ± 0.628
0.918AsnArg: 0.918 ± 0.628
4.591AsnSer: 4.591 ± 0.895
3.673AsnThr: 3.673 ± 1.523
5.51AsnVal: 5.51 ± 0.267
2.755AsnTrp: 2.755 ± 1.885
1.837AsnTyr: 1.837 ± 0.089
0.0AsnXaa: 0.0 ± 0.0
Pro
7.346ProAla: 7.346 ± 0.356
0.918ProCys: 0.918 ± 0.717
2.755ProAsp: 2.755 ± 0.539
2.755ProGlu: 2.755 ± 2.152
1.837ProPhe: 1.837 ± 0.089
1.837ProGly: 1.837 ± 1.434
1.837ProHis: 1.837 ± 1.257
4.591ProIle: 4.591 ± 0.45
1.837ProLys: 1.837 ± 0.089
4.591ProLeu: 4.591 ± 0.45
1.837ProMet: 1.837 ± 1.257
1.837ProAsn: 1.837 ± 0.089
2.755ProPro: 2.755 ± 0.806
2.755ProGln: 2.755 ± 0.539
2.755ProArg: 2.755 ± 0.539
3.673ProSer: 3.673 ± 0.178
8.264ProThr: 8.264 ± 0.272
6.428ProVal: 6.428 ± 2.33
0.0ProTrp: 0.0 ± 0.0
3.673ProTyr: 3.673 ± 0.178
0.0ProXaa: 0.0 ± 0.0
Gln
4.591GlnAla: 4.591 ± 2.241
0.0GlnCys: 0.0 ± 0.0
1.837GlnAsp: 1.837 ± 1.257
0.918GlnGlu: 0.918 ± 0.717
4.591GlnPhe: 4.591 ± 1.796
0.918GlnGly: 0.918 ± 0.628
0.918GlnHis: 0.918 ± 0.628
2.755GlnIle: 2.755 ± 0.806
0.0GlnLys: 0.0 ± 0.0
8.264GlnLeu: 8.264 ± 0.272
0.0GlnMet: 0.0 ± 0.0
2.755GlnAsn: 2.755 ± 0.539
1.837GlnPro: 1.837 ± 0.089
1.837GlnGln: 1.837 ± 1.434
2.755GlnArg: 2.755 ± 1.885
0.0GlnSer: 0.0 ± 0.0
3.673GlnThr: 3.673 ± 1.523
2.755GlnVal: 2.755 ± 0.539
0.0GlnTrp: 0.0 ± 0.0
1.837GlnTyr: 1.837 ± 0.089
0.0GlnXaa: 0.0 ± 0.0
Arg
5.51ArgAla: 5.51 ± 1.612
0.918ArgCys: 0.918 ± 0.628
5.51ArgAsp: 5.51 ± 1.079
0.918ArgGlu: 0.918 ± 0.628
2.755ArgPhe: 2.755 ± 0.539
2.755ArgGly: 2.755 ± 1.885
2.755ArgHis: 2.755 ± 0.539
4.591ArgIle: 4.591 ± 0.895
1.837ArgLys: 1.837 ± 1.257
5.51ArgLeu: 5.51 ± 1.079
0.918ArgMet: 0.918 ± 0.628
2.755ArgAsn: 2.755 ± 2.152
2.755ArgPro: 2.755 ± 0.539
1.837ArgGln: 1.837 ± 0.089
0.918ArgArg: 0.918 ± 0.717
3.673ArgSer: 3.673 ± 1.168
6.428ArgThr: 6.428 ± 0.984
0.918ArgVal: 0.918 ± 0.628
1.837ArgTrp: 1.837 ± 1.434
3.673ArgTyr: 3.673 ± 1.168
0.0ArgXaa: 0.0 ± 0.0
Ser
3.673SerAla: 3.673 ± 2.869
0.918SerCys: 0.918 ± 0.717
3.673SerAsp: 3.673 ± 0.178
4.591SerGlu: 4.591 ± 0.45
1.837SerPhe: 1.837 ± 1.257
2.755SerGly: 2.755 ± 0.539
1.837SerHis: 1.837 ± 1.257
0.918SerIle: 0.918 ± 0.628
2.755SerLys: 2.755 ± 0.806
5.51SerLeu: 5.51 ± 0.267
2.755SerMet: 2.755 ± 0.806
4.591SerAsn: 4.591 ± 1.796
2.755SerPro: 2.755 ± 0.539
7.346SerGln: 7.346 ± 0.99
3.673SerArg: 3.673 ± 0.178
2.755SerSer: 2.755 ± 0.806
10.101SerThr: 10.101 ± 5.199
3.673SerVal: 3.673 ± 1.523
0.0SerTrp: 0.0 ± 0.0
0.918SerTyr: 0.918 ± 0.717
0.0SerXaa: 0.0 ± 0.0
Thr
8.264ThrAla: 8.264 ± 3.764
0.0ThrCys: 0.0 ± 0.0
4.591ThrAsp: 4.591 ± 1.796
3.673ThrGlu: 3.673 ± 1.168
5.51ThrPhe: 5.51 ± 1.079
1.837ThrGly: 1.837 ± 1.434
3.673ThrHis: 3.673 ± 2.869
5.51ThrIle: 5.51 ± 1.079
5.51ThrLys: 5.51 ± 1.079
9.183ThrLeu: 9.183 ± 1.79
0.918ThrMet: 0.918 ± 0.628
3.673ThrAsn: 3.673 ± 0.178
3.673ThrPro: 3.673 ± 1.523
3.673ThrGln: 3.673 ± 0.178
3.673ThrArg: 3.673 ± 0.178
4.591ThrSer: 4.591 ± 2.241
5.51ThrThr: 5.51 ± 0.267
2.755ThrVal: 2.755 ± 0.806
0.918ThrTrp: 0.918 ± 0.717
3.673ThrTyr: 3.673 ± 0.178
0.0ThrXaa: 0.0 ± 0.0
Val
4.591ValAla: 4.591 ± 0.45
0.0ValCys: 0.0 ± 0.0
1.837ValAsp: 1.837 ± 0.089
1.837ValGlu: 1.837 ± 0.089
2.755ValPhe: 2.755 ± 1.885
1.837ValGly: 1.837 ± 1.434
0.0ValHis: 0.0 ± 0.0
3.673ValIle: 3.673 ± 0.178
2.755ValLys: 2.755 ± 0.539
1.837ValLeu: 1.837 ± 1.257
2.755ValMet: 2.755 ± 2.152
2.755ValAsn: 2.755 ± 0.806
4.591ValPro: 4.591 ± 0.895
5.51ValGln: 5.51 ± 0.267
2.755ValArg: 2.755 ± 0.806
4.591ValSer: 4.591 ± 0.895
6.428ValThr: 6.428 ± 0.361
1.837ValVal: 1.837 ± 1.434
0.0ValTrp: 0.0 ± 0.0
1.837ValTyr: 1.837 ± 0.089
0.0ValXaa: 0.0 ± 0.0
Trp
1.837TrpAla: 1.837 ± 1.434
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
1.837TrpPhe: 1.837 ± 1.434
0.918TrpGly: 0.918 ± 0.628
0.918TrpHis: 0.918 ± 0.628
0.0TrpIle: 0.0 ± 0.0
0.918TrpLys: 0.918 ± 0.717
0.918TrpLeu: 0.918 ± 0.628
0.0TrpMet: 0.0 ± 0.0
2.755TrpAsn: 2.755 ± 0.539
1.837TrpPro: 1.837 ± 1.257
0.918TrpGln: 0.918 ± 0.717
1.837TrpArg: 1.837 ± 1.257
0.918TrpSer: 0.918 ± 0.628
1.837TrpThr: 1.837 ± 0.089
0.918TrpVal: 0.918 ± 0.717
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.673TyrAla: 3.673 ± 0.178
0.0TyrCys: 0.0 ± 0.0
0.0TyrAsp: 0.0 ± 0.0
0.918TyrGlu: 0.918 ± 0.628
1.837TyrPhe: 1.837 ± 1.257
2.755TyrGly: 2.755 ± 2.152
1.837TyrHis: 1.837 ± 0.089
1.837TyrIle: 1.837 ± 0.089
2.755TyrLys: 2.755 ± 0.539
5.51TyrLeu: 5.51 ± 1.079
0.918TyrMet: 0.918 ± 0.628
1.837TyrAsn: 1.837 ± 0.089
3.673TyrPro: 3.673 ± 1.168
1.837TyrGln: 1.837 ± 1.257
2.755TyrArg: 2.755 ± 0.539
1.837TyrSer: 1.837 ± 0.089
2.755TyrThr: 2.755 ± 0.806
4.591TyrVal: 4.591 ± 1.796
0.0TyrTrp: 0.0 ± 0.0
6.428TyrTyr: 6.428 ± 3.052
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1090 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski