Amino acid dipepetide frequency for Puccinia striiformis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.491AlaAla: 5.491 ± 0.047
1.022AlaCys: 1.022 ± 0.015
3.279AlaAsp: 3.279 ± 0.026
3.836AlaGlu: 3.836 ± 0.033
2.351AlaPhe: 2.351 ± 0.023
4.003AlaGly: 4.003 ± 0.034
1.807AlaHis: 1.807 ± 0.018
3.864AlaIle: 3.864 ± 0.028
3.612AlaLys: 3.612 ± 0.027
6.118AlaLeu: 6.118 ± 0.038
1.38AlaMet: 1.38 ± 0.016
2.904AlaAsn: 2.904 ± 0.024
3.939AlaPro: 3.939 ± 0.033
2.966AlaGln: 2.966 ± 0.025
3.848AlaArg: 3.848 ± 0.03
6.29AlaSer: 6.29 ± 0.038
4.338AlaThr: 4.338 ± 0.03
3.437AlaVal: 3.437 ± 0.028
0.801AlaTrp: 0.801 ± 0.013
1.613AlaTyr: 1.613 ± 0.018
0.0AlaXaa: 0.0 ± 0.0
Cys
0.809CysAla: 0.809 ± 0.012
0.329CysCys: 0.329 ± 0.009
0.735CysAsp: 0.735 ± 0.011
0.621CysGlu: 0.621 ± 0.01
0.606CysPhe: 0.606 ± 0.01
0.928CysGly: 0.928 ± 0.017
0.442CysHis: 0.442 ± 0.009
0.682CysIle: 0.682 ± 0.01
0.756CysLys: 0.756 ± 0.014
1.471CysLeu: 1.471 ± 0.017
0.263CysMet: 0.263 ± 0.006
0.522CysAsn: 0.522 ± 0.01
0.877CysPro: 0.877 ± 0.014
0.676CysGln: 0.676 ± 0.009
0.808CysArg: 0.808 ± 0.014
1.282CysSer: 1.282 ± 0.017
0.759CysThr: 0.759 ± 0.013
0.696CysVal: 0.696 ± 0.012
0.207CysTrp: 0.207 ± 0.007
0.377CysTyr: 0.377 ± 0.007
0.0CysXaa: 0.0 ± 0.0
Asp
3.034AspAla: 3.034 ± 0.026
0.791AspCys: 0.791 ± 0.012
3.997AspAsp: 3.997 ± 0.041
3.917AspGlu: 3.917 ± 0.035
2.085AspPhe: 2.085 ± 0.022
3.284AspGly: 3.284 ± 0.027
1.937AspHis: 1.937 ± 0.018
2.639AspIle: 2.639 ± 0.022
2.478AspLys: 2.478 ± 0.022
5.449AspLeu: 5.449 ± 0.033
0.957AspMet: 0.957 ± 0.012
2.228AspAsn: 2.228 ± 0.02
3.68AspPro: 3.68 ± 0.029
2.998AspGln: 2.998 ± 0.023
2.99AspArg: 2.99 ± 0.025
4.751AspSer: 4.751 ± 0.035
2.558AspThr: 2.558 ± 0.026
2.681AspVal: 2.681 ± 0.027
0.837AspTrp: 0.837 ± 0.011
1.432AspTyr: 1.432 ± 0.017
0.0AspXaa: 0.0 ± 0.0
Glu
4.124GluAla: 4.124 ± 0.035
0.611GluCys: 0.611 ± 0.011
3.95GluAsp: 3.95 ± 0.035
5.446GluGlu: 5.446 ± 0.05
2.044GluPhe: 2.044 ± 0.017
3.0GluGly: 3.0 ± 0.026
1.353GluHis: 1.353 ± 0.015
3.436GluIle: 3.436 ± 0.027
3.419GluLys: 3.419 ± 0.03
5.527GluLeu: 5.527 ± 0.041
1.2GluMet: 1.2 ± 0.013
2.551GluAsn: 2.551 ± 0.025
2.789GluPro: 2.789 ± 0.028
2.395GluGln: 2.395 ± 0.022
3.189GluArg: 3.189 ± 0.027
4.628GluSer: 4.628 ± 0.033
2.998GluThr: 2.998 ± 0.027
3.032GluVal: 3.032 ± 0.026
0.725GluTrp: 0.725 ± 0.012
1.391GluTyr: 1.391 ± 0.016
0.0GluXaa: 0.0 ± 0.0
Phe
2.137PheAla: 2.137 ± 0.022
0.636PheCys: 0.636 ± 0.011
2.208PheAsp: 2.208 ± 0.022
2.16PheGlu: 2.16 ± 0.021
1.604PhePhe: 1.604 ± 0.021
2.383PheGly: 2.383 ± 0.028
1.07PheHis: 1.07 ± 0.015
2.078PheIle: 2.078 ± 0.022
2.102PheLys: 2.102 ± 0.018
3.56PheLeu: 3.56 ± 0.025
0.681PheMet: 0.681 ± 0.01
1.802PheAsn: 1.802 ± 0.019
1.976PhePro: 1.976 ± 0.022
1.593PheGln: 1.593 ± 0.018
1.863PheArg: 1.863 ± 0.018
3.177PheSer: 3.177 ± 0.024
2.062PheThr: 2.062 ± 0.018
1.982PheVal: 1.982 ± 0.018
0.495PheTrp: 0.495 ± 0.01
0.989PheTyr: 0.989 ± 0.012
0.0PheXaa: 0.0 ± 0.0
Gly
3.46GlyAla: 3.46 ± 0.034
0.867GlyCys: 0.867 ± 0.013
2.729GlyAsp: 2.729 ± 0.019
2.769GlyGlu: 2.769 ± 0.024
2.451GlyPhe: 2.451 ± 0.024
4.428GlyGly: 4.428 ± 0.071
1.555GlyHis: 1.555 ± 0.019
3.143GlyIle: 3.143 ± 0.03
3.224GlyLys: 3.224 ± 0.025
5.355GlyLeu: 5.355 ± 0.035
1.147GlyMet: 1.147 ± 0.017
2.455GlyAsn: 2.455 ± 0.027
2.949GlyPro: 2.949 ± 0.027
2.356GlyGln: 2.356 ± 0.029
3.168GlyArg: 3.168 ± 0.025
5.38GlySer: 5.38 ± 0.047
3.408GlyThr: 3.408 ± 0.03
2.912GlyVal: 2.912 ± 0.027
0.823GlyTrp: 0.823 ± 0.014
1.599GlyTyr: 1.599 ± 0.022
0.0GlyXaa: 0.0 ± 0.0
His
1.551HisAla: 1.551 ± 0.018
0.438HisCys: 0.438 ± 0.009
1.362HisAsp: 1.362 ± 0.018
1.392HisGlu: 1.392 ± 0.016
1.117HisPhe: 1.117 ± 0.013
1.427HisGly: 1.427 ± 0.018
1.782HisHis: 1.782 ± 0.032
1.337HisIle: 1.337 ± 0.017
1.265HisLys: 1.265 ± 0.016
3.037HisLeu: 3.037 ± 0.026
0.483HisMet: 0.483 ± 0.009
1.244HisAsn: 1.244 ± 0.015
2.335HisPro: 2.335 ± 0.025
2.012HisGln: 2.012 ± 0.022
1.76HisArg: 1.76 ± 0.019
2.811HisSer: 2.811 ± 0.023
1.543HisThr: 1.543 ± 0.02
1.316HisVal: 1.316 ± 0.017
0.368HisTrp: 0.368 ± 0.008
0.718HisTyr: 0.718 ± 0.011
0.0HisXaa: 0.0 ± 0.0
Ile
3.393IleAla: 3.393 ± 0.027
0.908IleCys: 0.908 ± 0.014
3.548IleAsp: 3.548 ± 0.026
3.368IleGlu: 3.368 ± 0.029
2.086IlePhe: 2.086 ± 0.021
3.107IleGly: 3.107 ± 0.025
1.629IleHis: 1.629 ± 0.018
3.084IleIle: 3.084 ± 0.025
3.359IleLys: 3.359 ± 0.031
5.192IleLeu: 5.192 ± 0.038
1.003IleMet: 1.003 ± 0.012
2.842IleAsn: 2.842 ± 0.027
3.561IlePro: 3.561 ± 0.028
2.55IleGln: 2.55 ± 0.024
3.036IleArg: 3.036 ± 0.027
4.981IleSer: 4.981 ± 0.033
3.198IleThr: 3.198 ± 0.024
2.871IleVal: 2.871 ± 0.023
0.691IleTrp: 0.691 ± 0.012
1.403IleTyr: 1.403 ± 0.017
0.0IleXaa: 0.0 ± 0.0
Lys
3.976LysAla: 3.976 ± 0.029
0.653LysCys: 0.653 ± 0.011
2.793LysAsp: 2.793 ± 0.024
3.421LysGlu: 3.421 ± 0.033
1.987LysPhe: 1.987 ± 0.02
2.622LysGly: 2.622 ± 0.024
1.344LysHis: 1.344 ± 0.017
3.38LysIle: 3.38 ± 0.029
4.533LysLys: 4.533 ± 0.041
5.549LysLeu: 5.549 ± 0.035
1.201LysMet: 1.201 ± 0.014
2.63LysAsn: 2.63 ± 0.023
3.392LysPro: 3.392 ± 0.035
2.319LysGln: 2.319 ± 0.023
3.475LysArg: 3.475 ± 0.026
4.863LysSer: 4.863 ± 0.032
3.557LysThr: 3.557 ± 0.029
2.739LysVal: 2.739 ± 0.022
0.639LysTrp: 0.639 ± 0.01
1.38LysTyr: 1.38 ± 0.017
0.0LysXaa: 0.0 ± 0.0
Leu
6.912LeuAla: 6.912 ± 0.039
1.222LeuCys: 1.222 ± 0.016
5.531LeuAsp: 5.531 ± 0.033
5.602LeuGlu: 5.602 ± 0.037
3.29LeuPhe: 3.29 ± 0.027
5.197LeuGly: 5.197 ± 0.037
2.424LeuHis: 2.424 ± 0.02
5.422LeuIle: 5.422 ± 0.04
5.748LeuLys: 5.748 ± 0.035
8.704LeuLeu: 8.704 ± 0.051
1.828LeuMet: 1.828 ± 0.02
4.488LeuAsn: 4.488 ± 0.028
5.979LeuPro: 5.979 ± 0.036
3.891LeuGln: 3.891 ± 0.03
5.127LeuArg: 5.127 ± 0.033
8.685LeuSer: 8.685 ± 0.042
5.439LeuThr: 5.439 ± 0.032
5.208LeuVal: 5.208 ± 0.035
1.048LeuTrp: 1.048 ± 0.014
2.133LeuTyr: 2.133 ± 0.021
0.0LeuXaa: 0.0 ± 0.0
Met
1.68MetAla: 1.68 ± 0.017
0.258MetCys: 0.258 ± 0.006
1.173MetAsp: 1.173 ± 0.015
1.134MetGlu: 1.134 ± 0.013
0.668MetPhe: 0.668 ± 0.011
1.116MetGly: 1.116 ± 0.016
0.381MetHis: 0.381 ± 0.009
1.303MetIle: 1.303 ± 0.015
1.169MetLys: 1.169 ± 0.015
1.543MetLeu: 1.543 ± 0.016
0.564MetMet: 0.564 ± 0.012
1.031MetAsn: 1.031 ± 0.014
0.931MetPro: 0.931 ± 0.014
0.643MetGln: 0.643 ± 0.012
0.994MetArg: 0.994 ± 0.013
1.768MetSer: 1.768 ± 0.018
1.185MetThr: 1.185 ± 0.015
1.172MetVal: 1.172 ± 0.015
0.224MetTrp: 0.224 ± 0.006
0.439MetTyr: 0.439 ± 0.009
0.0MetXaa: 0.0 ± 0.0
Asn
2.421AsnAla: 2.421 ± 0.021
0.648AsnCys: 0.648 ± 0.01
2.268AsnAsp: 2.268 ± 0.019
2.402AsnGlu: 2.402 ± 0.021
1.738AsnPhe: 1.738 ± 0.019
2.693AsnGly: 2.693 ± 0.031
1.806AsnHis: 1.806 ± 0.022
2.156AsnIle: 2.156 ± 0.016
2.374AsnLys: 2.374 ± 0.025
4.742AsnLeu: 4.742 ± 0.03
0.8AsnMet: 0.8 ± 0.011
2.801AsnAsn: 2.801 ± 0.034
3.53AsnPro: 3.53 ± 0.027
2.861AsnGln: 2.861 ± 0.024
2.639AsnArg: 2.639 ± 0.02
4.524AsnSer: 4.524 ± 0.034
2.621AsnThr: 2.621 ± 0.022
2.094AsnVal: 2.094 ± 0.019
0.577AsnTrp: 0.577 ± 0.01
1.19AsnTyr: 1.19 ± 0.014
0.0AsnXaa: 0.0 ± 0.0
Pro
4.639ProAla: 4.639 ± 0.039
0.662ProCys: 0.662 ± 0.013
3.125ProAsp: 3.125 ± 0.023
3.372ProGlu: 3.372 ± 0.026
2.043ProPhe: 2.043 ± 0.019
3.369ProGly: 3.369 ± 0.031
1.698ProHis: 1.698 ± 0.018
3.612ProIle: 3.612 ± 0.028
3.168ProLys: 3.168 ± 0.03
5.116ProLeu: 5.116 ± 0.036
1.073ProMet: 1.073 ± 0.014
3.036ProAsn: 3.036 ± 0.028
5.599ProPro: 5.599 ± 0.05
2.582ProGln: 2.582 ± 0.024
3.071ProArg: 3.071 ± 0.026
7.505ProSer: 7.505 ± 0.053
4.989ProThr: 4.989 ± 0.033
3.369ProVal: 3.369 ± 0.032
0.585ProTrp: 0.585 ± 0.01
1.224ProTyr: 1.224 ± 0.016
0.0ProXaa: 0.0 ± 0.0
Gln
3.527GlnAla: 3.527 ± 0.025
0.502GlnCys: 0.502 ± 0.01
2.312GlnAsp: 2.312 ± 0.02
2.727GlnGlu: 2.727 ± 0.025
1.521GlnPhe: 1.521 ± 0.017
2.016GlnGly: 2.016 ± 0.026
1.427GlnHis: 1.427 ± 0.018
2.598GlnIle: 2.598 ± 0.023
2.424GlnLys: 2.424 ± 0.024
4.486GlnLeu: 4.486 ± 0.028
0.903GlnMet: 0.903 ± 0.013
2.038GlnAsn: 2.038 ± 0.023
3.112GlnPro: 3.112 ± 0.031
3.127GlnGln: 3.127 ± 0.055
2.443GlnArg: 2.443 ± 0.019
4.421GlnSer: 4.421 ± 0.029
2.797GlnThr: 2.797 ± 0.025
2.359GlnVal: 2.359 ± 0.022
0.497GlnTrp: 0.497 ± 0.011
0.977GlnTyr: 0.977 ± 0.014
0.0GlnXaa: 0.0 ± 0.0
Arg
3.774ArgAla: 3.774 ± 0.026
0.771ArgCys: 0.771 ± 0.013
2.599ArgAsp: 2.599 ± 0.027
2.851ArgGlu: 2.851 ± 0.027
2.128ArgPhe: 2.128 ± 0.02
2.799ArgGly: 2.799 ± 0.025
1.475ArgHis: 1.475 ± 0.018
3.118ArgIle: 3.118 ± 0.023
3.483ArgLys: 3.483 ± 0.032
5.618ArgLeu: 5.618 ± 0.035
1.198ArgMet: 1.198 ± 0.014
2.371ArgAsn: 2.371 ± 0.019
3.43ArgPro: 3.43 ± 0.029
2.515ArgGln: 2.515 ± 0.027
4.164ArgArg: 4.164 ± 0.039
5.275ArgSer: 5.275 ± 0.036
3.28ArgThr: 3.28 ± 0.025
2.772ArgVal: 2.772 ± 0.024
0.753ArgTrp: 0.753 ± 0.012
1.464ArgTyr: 1.464 ± 0.016
0.0ArgXaa: 0.0 ± 0.0
Ser
6.107SerAla: 6.107 ± 0.039
1.201SerCys: 1.201 ± 0.018
4.919SerAsp: 4.919 ± 0.036
4.582SerGlu: 4.582 ± 0.031
3.373SerPhe: 3.373 ± 0.026
5.113SerGly: 5.113 ± 0.037
2.803SerHis: 2.803 ± 0.025
5.388SerIle: 5.388 ± 0.036
5.137SerLys: 5.137 ± 0.036
8.435SerLeu: 8.435 ± 0.046
1.751SerMet: 1.751 ± 0.018
4.825SerAsn: 4.825 ± 0.034
6.225SerPro: 6.225 ± 0.049
4.211SerGln: 4.211 ± 0.03
5.21SerArg: 5.21 ± 0.033
12.763SerSer: 12.763 ± 0.096
7.298SerThr: 7.298 ± 0.05
4.365SerVal: 4.365 ± 0.028
0.968SerTrp: 0.968 ± 0.013
2.06SerTyr: 2.06 ± 0.019
0.0SerXaa: 0.0 ± 0.0
Thr
4.014ThrAla: 4.014 ± 0.031
0.936ThrCys: 0.936 ± 0.013
2.998ThrAsp: 2.998 ± 0.023
3.053ThrGlu: 3.053 ± 0.026
2.054ThrPhe: 2.054 ± 0.019
3.626ThrGly: 3.626 ± 0.029
1.726ThrHis: 1.726 ± 0.019
3.548ThrIle: 3.548 ± 0.026
3.308ThrLys: 3.308 ± 0.028
5.362ThrLeu: 5.362 ± 0.029
1.154ThrMet: 1.154 ± 0.014
3.032ThrAsn: 3.032 ± 0.027
4.464ThrPro: 4.464 ± 0.033
2.655ThrGln: 2.655 ± 0.022
3.42ThrArg: 3.42 ± 0.025
6.487ThrSer: 6.487 ± 0.038
5.261ThrThr: 5.261 ± 0.052
2.996ThrVal: 2.996 ± 0.024
0.76ThrTrp: 0.76 ± 0.01
1.408ThrTyr: 1.408 ± 0.018
0.0ThrXaa: 0.0 ± 0.0
Val
3.65ValAla: 3.65 ± 0.031
0.788ValCys: 0.788 ± 0.013
3.155ValAsp: 3.155 ± 0.023
3.238ValGlu: 3.238 ± 0.028
1.927ValPhe: 1.927 ± 0.018
2.994ValGly: 2.994 ± 0.029
1.389ValHis: 1.389 ± 0.015
3.006ValIle: 3.006 ± 0.024
2.855ValLys: 2.855 ± 0.022
4.725ValLeu: 4.725 ± 0.029
1.034ValMet: 1.034 ± 0.015
2.274ValAsn: 2.274 ± 0.02
3.055ValPro: 3.055 ± 0.023
2.153ValGln: 2.153 ± 0.019
2.532ValArg: 2.532 ± 0.02
4.087ValSer: 4.087 ± 0.029
2.961ValThr: 2.961 ± 0.024
3.174ValVal: 3.174 ± 0.027
0.674ValTrp: 0.674 ± 0.011
1.314ValTyr: 1.314 ± 0.016
0.0ValXaa: 0.0 ± 0.0
Trp
0.804TrpAla: 0.804 ± 0.012
0.196TrpCys: 0.196 ± 0.006
0.698TrpAsp: 0.698 ± 0.011
0.675TrpGlu: 0.675 ± 0.01
0.466TrpPhe: 0.466 ± 0.008
0.618TrpGly: 0.618 ± 0.01
0.32TrpHis: 0.32 ± 0.007
0.763TrpIle: 0.763 ± 0.013
0.849TrpLys: 0.849 ± 0.011
1.238TrpLeu: 1.238 ± 0.015
0.286TrpMet: 0.286 ± 0.007
0.68TrpAsn: 0.68 ± 0.012
0.53TrpPro: 0.53 ± 0.009
0.459TrpGln: 0.459 ± 0.01
0.699TrpArg: 0.699 ± 0.011
1.027TrpSer: 1.027 ± 0.015
0.766TrpThr: 0.766 ± 0.011
0.629TrpVal: 0.629 ± 0.011
0.197TrpTrp: 0.197 ± 0.006
0.318TrpTyr: 0.318 ± 0.007
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.397TyrAla: 1.397 ± 0.015
0.406TyrCys: 0.406 ± 0.009
1.344TyrAsp: 1.344 ± 0.016
1.244TyrGlu: 1.244 ± 0.014
1.019TyrPhe: 1.019 ± 0.014
1.453TyrGly: 1.453 ± 0.018
0.929TyrHis: 0.929 ± 0.012
1.252TyrIle: 1.252 ± 0.017
1.187TyrLys: 1.187 ± 0.015
2.653TyrLeu: 2.653 ± 0.025
0.469TyrMet: 0.469 ± 0.009
1.13TyrAsn: 1.13 ± 0.014
1.486TyrPro: 1.486 ± 0.018
1.252TyrGln: 1.252 ± 0.013
1.401TyrArg: 1.401 ± 0.016
2.032TyrSer: 2.032 ± 0.019
1.245TyrThr: 1.245 ± 0.015
1.182TyrVal: 1.182 ± 0.015
0.349TyrTrp: 0.349 ± 0.008
0.778TyrTyr: 0.778 ± 0.013
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 14973 proteins (6057031 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski