Amino acid dipepetide frequency for Populus trichocarpa (Western balsam poplar) (Populus balsamifera subsp. trichocarpa)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.944AlaAla: 5.944 ± 0.023
1.257AlaCys: 1.257 ± 0.009
2.993AlaAsp: 2.993 ± 0.016
4.018AlaGlu: 4.018 ± 0.017
2.712AlaPhe: 2.712 ± 0.011
4.083AlaGly: 4.083 ± 0.016
1.262AlaHis: 1.262 ± 0.007
3.706AlaIle: 3.706 ± 0.015
3.771AlaLys: 3.771 ± 0.017
6.379AlaLeu: 6.379 ± 0.022
1.725AlaMet: 1.725 ± 0.009
2.57AlaAsn: 2.57 ± 0.011
2.723AlaPro: 2.723 ± 0.017
2.107AlaGln: 2.107 ± 0.01
3.326AlaArg: 3.326 ± 0.013
6.02AlaSer: 6.02 ± 0.021
3.502AlaThr: 3.502 ± 0.015
4.594AlaVal: 4.594 ± 0.017
0.765AlaTrp: 0.765 ± 0.006
1.758AlaTyr: 1.758 ± 0.01
0.0AlaXaa: 0.0 ± 0.0
Cys
0.991CysAla: 0.991 ± 0.008
0.589CysCys: 0.589 ± 0.007
0.89CysAsp: 0.89 ± 0.007
0.924CysGlu: 0.924 ± 0.007
0.956CysPhe: 0.956 ± 0.008
1.39CysGly: 1.39 ± 0.009
0.498CysHis: 0.498 ± 0.005
1.032CysIle: 1.032 ± 0.007
1.207CysLys: 1.207 ± 0.009
1.987CysLeu: 1.987 ± 0.01
0.467CysMet: 0.467 ± 0.005
0.888CysAsn: 0.888 ± 0.006
0.945CysPro: 0.945 ± 0.007
0.652CysGln: 0.652 ± 0.006
1.04CysArg: 1.04 ± 0.007
1.914CysSer: 1.914 ± 0.011
0.871CysThr: 0.871 ± 0.007
1.049CysVal: 1.049 ± 0.007
0.264CysTrp: 0.264 ± 0.004
0.584CysTyr: 0.584 ± 0.006
0.0CysXaa: 0.0 ± 0.0
Asp
3.369AspAla: 3.369 ± 0.013
0.993AspCys: 0.993 ± 0.006
3.543AspAsp: 3.543 ± 0.022
3.84AspGlu: 3.84 ± 0.018
2.43AspPhe: 2.43 ± 0.011
3.852AspGly: 3.852 ± 0.014
1.282AspHis: 1.282 ± 0.008
2.939AspIle: 2.939 ± 0.013
2.721AspLys: 2.721 ± 0.013
5.152AspLeu: 5.152 ± 0.017
1.327AspMet: 1.327 ± 0.008
2.118AspAsn: 2.118 ± 0.01
2.531AspPro: 2.531 ± 0.013
1.832AspGln: 1.832 ± 0.012
2.38AspArg: 2.38 ± 0.014
4.32AspSer: 4.32 ± 0.019
2.199AspThr: 2.199 ± 0.01
3.524AspVal: 3.524 ± 0.014
0.713AspTrp: 0.713 ± 0.006
1.531AspTyr: 1.531 ± 0.01
0.0AspXaa: 0.0 ± 0.0
Glu
4.558GluAla: 4.558 ± 0.02
0.918GluCys: 0.918 ± 0.007
3.907GluAsp: 3.907 ± 0.02
6.216GluGlu: 6.216 ± 0.034
2.354GluPhe: 2.354 ± 0.01
3.82GluGly: 3.82 ± 0.015
1.253GluHis: 1.253 ± 0.009
3.895GluIle: 3.895 ± 0.016
4.844GluLys: 4.844 ± 0.025
6.008GluLeu: 6.008 ± 0.023
1.813GluMet: 1.813 ± 0.01
3.282GluAsn: 3.282 ± 0.013
2.084GluPro: 2.084 ± 0.011
2.186GluGln: 2.186 ± 0.012
3.467GluArg: 3.467 ± 0.019
4.694GluSer: 4.694 ± 0.017
3.1GluThr: 3.1 ± 0.014
4.171GluVal: 4.171 ± 0.016
0.752GluTrp: 0.752 ± 0.006
1.599GluTyr: 1.599 ± 0.01
0.0GluXaa: 0.0 ± 0.0
Phe
2.403PheAla: 2.403 ± 0.011
0.964PheCys: 0.964 ± 0.008
2.326PheAsp: 2.326 ± 0.011
2.301PheGlu: 2.301 ± 0.011
2.031PhePhe: 2.031 ± 0.012
3.053PheGly: 3.053 ± 0.017
1.097PheHis: 1.097 ± 0.008
2.071PheIle: 2.071 ± 0.011
2.165PheLys: 2.165 ± 0.011
4.476PheLeu: 4.476 ± 0.018
1.008PheMet: 1.008 ± 0.008
1.825PheAsn: 1.825 ± 0.011
2.169PhePro: 2.169 ± 0.011
1.617PheGln: 1.617 ± 0.009
1.969PheArg: 1.969 ± 0.01
4.267PheSer: 4.267 ± 0.016
1.999PheThr: 1.999 ± 0.01
2.651PheVal: 2.651 ± 0.012
0.568PheTrp: 0.568 ± 0.005
1.287PheTyr: 1.287 ± 0.008
0.0PheXaa: 0.0 ± 0.0
Gly
3.734GlyAla: 3.734 ± 0.017
1.384GlyCys: 1.384 ± 0.011
3.366GlyAsp: 3.366 ± 0.013
3.678GlyGlu: 3.678 ± 0.014
3.232GlyPhe: 3.232 ± 0.015
5.359GlyGly: 5.359 ± 0.041
1.557GlyHis: 1.557 ± 0.009
3.67GlyIle: 3.67 ± 0.012
4.119GlyLys: 4.119 ± 0.015
5.958GlyLeu: 5.958 ± 0.017
1.611GlyMet: 1.611 ± 0.009
3.337GlyAsn: 3.337 ± 0.016
2.436GlyPro: 2.436 ± 0.012
2.189GlyGln: 2.189 ± 0.01
3.486GlyArg: 3.486 ± 0.015
6.169GlySer: 6.169 ± 0.025
3.259GlyThr: 3.259 ± 0.015
4.064GlyVal: 4.064 ± 0.017
0.905GlyTrp: 0.905 ± 0.008
2.043GlyTyr: 2.043 ± 0.012
0.0GlyXaa: 0.0 ± 0.0
His
1.417HisAla: 1.417 ± 0.01
0.519HisCys: 0.519 ± 0.004
1.193HisAsp: 1.193 ± 0.008
1.338HisGlu: 1.338 ± 0.009
1.092HisPhe: 1.092 ± 0.007
1.766HisGly: 1.766 ± 0.011
0.947HisHis: 0.947 ± 0.009
1.185HisIle: 1.185 ± 0.007
1.182HisLys: 1.182 ± 0.008
2.523HisLeu: 2.523 ± 0.013
0.542HisMet: 0.542 ± 0.005
0.992HisAsn: 0.992 ± 0.007
1.357HisPro: 1.357 ± 0.009
1.09HisGln: 1.09 ± 0.008
1.371HisArg: 1.371 ± 0.009
1.96HisSer: 1.96 ± 0.01
0.932HisThr: 0.932 ± 0.006
1.571HisVal: 1.571 ± 0.009
0.29HisTrp: 0.29 ± 0.004
0.682HisTyr: 0.682 ± 0.006
0.0HisXaa: 0.0 ± 0.0
Ile
3.545IleAla: 3.545 ± 0.013
1.129IleCys: 1.129 ± 0.008
2.863IleAsp: 2.863 ± 0.011
3.249IleGlu: 3.249 ± 0.013
2.323IlePhe: 2.323 ± 0.01
3.36IleGly: 3.36 ± 0.014
1.294IleHis: 1.294 ± 0.008
2.852IleIle: 2.852 ± 0.013
3.021IleLys: 3.021 ± 0.013
5.285IleLeu: 5.285 ± 0.016
1.168IleMet: 1.168 ± 0.008
2.321IleAsn: 2.321 ± 0.011
3.059IlePro: 3.059 ± 0.016
2.028IleGln: 2.028 ± 0.012
2.554IleArg: 2.554 ± 0.012
5.028IleSer: 5.028 ± 0.016
2.611IleThr: 2.611 ± 0.012
3.339IleVal: 3.339 ± 0.014
0.718IleTrp: 0.718 ± 0.006
1.505IleTyr: 1.505 ± 0.009
0.0IleXaa: 0.0 ± 0.0
Lys
3.942LysAla: 3.942 ± 0.015
0.993LysCys: 0.993 ± 0.007
3.309LysAsp: 3.309 ± 0.015
4.787LysGlu: 4.787 ± 0.021
2.153LysPhe: 2.153 ± 0.01
3.672LysGly: 3.672 ± 0.014
1.399LysHis: 1.399 ± 0.009
3.359LysIle: 3.359 ± 0.013
4.785LysLys: 4.785 ± 0.025
6.104LysLeu: 6.104 ± 0.021
1.545LysMet: 1.545 ± 0.008
2.815LysAsn: 2.815 ± 0.012
2.755LysPro: 2.755 ± 0.012
2.38LysGln: 2.38 ± 0.012
3.655LysArg: 3.655 ± 0.017
4.756LysSer: 4.756 ± 0.018
2.968LysThr: 2.968 ± 0.013
3.799LysVal: 3.799 ± 0.015
0.772LysTrp: 0.772 ± 0.006
1.583LysTyr: 1.583 ± 0.01
0.0LysXaa: 0.0 ± 0.0
Leu
6.378LeuAla: 6.378 ± 0.021
1.879LeuCys: 1.879 ± 0.01
5.239LeuAsp: 5.239 ± 0.018
6.564LeuGlu: 6.564 ± 0.023
3.882LeuPhe: 3.882 ± 0.015
5.708LeuGly: 5.708 ± 0.018
2.656LeuHis: 2.656 ± 0.013
4.642LeuIle: 4.642 ± 0.017
6.426LeuLys: 6.426 ± 0.026
10.035LeuLeu: 10.035 ± 0.03
2.165LeuMet: 2.165 ± 0.011
4.109LeuAsn: 4.109 ± 0.017
5.26LeuPro: 5.26 ± 0.019
4.55LeuGln: 4.55 ± 0.02
5.244LeuArg: 5.244 ± 0.02
9.116LeuSer: 9.116 ± 0.031
4.497LeuThr: 4.497 ± 0.017
6.338LeuVal: 6.338 ± 0.022
1.145LeuTrp: 1.145 ± 0.009
2.542LeuTyr: 2.542 ± 0.012
0.0LeuXaa: 0.0 ± 0.0
Met
2.083MetAla: 2.083 ± 0.01
0.316MetCys: 0.316 ± 0.004
1.483MetAsp: 1.483 ± 0.009
2.047MetGlu: 2.047 ± 0.011
0.829MetPhe: 0.829 ± 0.006
1.654MetGly: 1.654 ± 0.008
0.582MetHis: 0.582 ± 0.005
1.218MetIle: 1.218 ± 0.008
1.625MetLys: 1.625 ± 0.009
2.254MetLeu: 2.254 ± 0.009
0.697MetMet: 0.697 ± 0.007
1.078MetAsn: 1.078 ± 0.007
1.124MetPro: 1.124 ± 0.008
0.99MetGln: 0.99 ± 0.008
1.174MetArg: 1.174 ± 0.008
1.81MetSer: 1.81 ± 0.011
1.079MetThr: 1.079 ± 0.007
1.671MetVal: 1.671 ± 0.009
0.261MetTrp: 0.261 ± 0.003
0.561MetTyr: 0.561 ± 0.005
0.0MetXaa: 0.0 ± 0.0
Asn
2.68AsnAla: 2.68 ± 0.012
0.936AsnCys: 0.936 ± 0.007
2.192AsnAsp: 2.192 ± 0.011
2.619AsnGlu: 2.619 ± 0.012
1.951AsnPhe: 1.951 ± 0.011
3.383AsnGly: 3.383 ± 0.016
1.177AsnHis: 1.177 ± 0.008
2.505AsnIle: 2.505 ± 0.012
2.526AsnLys: 2.526 ± 0.013
4.955AsnLeu: 4.955 ± 0.023
1.168AsnMet: 1.168 ± 0.008
2.51AsnAsn: 2.51 ± 0.015
2.381AsnPro: 2.381 ± 0.013
1.881AsnGln: 1.881 ± 0.012
2.042AsnArg: 2.042 ± 0.011
4.188AsnSer: 4.188 ± 0.014
2.059AsnThr: 2.059 ± 0.01
2.724AsnVal: 2.724 ± 0.011
0.576AsnTrp: 0.576 ± 0.005
1.329AsnTyr: 1.329 ± 0.009
0.0AsnXaa: 0.0 ± 0.0
Pro
3.019ProAla: 3.019 ± 0.014
0.831ProCys: 0.831 ± 0.006
2.49ProAsp: 2.49 ± 0.012
3.119ProGlu: 3.119 ± 0.013
2.032ProPhe: 2.032 ± 0.011
2.786ProGly: 2.786 ± 0.016
1.059ProHis: 1.059 ± 0.008
2.294ProIle: 2.294 ± 0.011
2.725ProLys: 2.725 ± 0.014
4.334ProLeu: 4.334 ± 0.017
0.976ProMet: 0.976 ± 0.007
2.277ProAsn: 2.277 ± 0.011
3.81ProPro: 3.81 ± 0.034
1.854ProGln: 1.854 ± 0.011
2.359ProArg: 2.359 ± 0.011
5.297ProSer: 5.297 ± 0.021
2.517ProThr: 2.517 ± 0.013
3.111ProVal: 3.111 ± 0.015
0.584ProTrp: 0.584 ± 0.005
1.288ProTyr: 1.288 ± 0.009
0.0ProXaa: 0.0 ± 0.0
Gln
2.385GlnAla: 2.385 ± 0.012
0.616GlnCys: 0.616 ± 0.005
1.738GlnAsp: 1.738 ± 0.01
2.568GlnGlu: 2.568 ± 0.014
1.401GlnPhe: 1.401 ± 0.008
2.244GlnGly: 2.244 ± 0.012
1.003GlnHis: 1.003 ± 0.008
2.069GlnIle: 2.069 ± 0.011
2.412GlnLys: 2.412 ± 0.012
3.815GlnLeu: 3.815 ± 0.016
0.993GlnMet: 0.993 ± 0.007
1.951GlnAsn: 1.951 ± 0.012
1.819GlnPro: 1.819 ± 0.014
2.327GlnGln: 2.327 ± 0.026
2.095GlnArg: 2.095 ± 0.011
3.032GlnSer: 3.032 ± 0.014
1.756GlnThr: 1.756 ± 0.011
2.389GlnVal: 2.389 ± 0.011
0.457GlnTrp: 0.457 ± 0.005
0.928GlnTyr: 0.928 ± 0.007
0.0GlnXaa: 0.0 ± 0.0
Arg
3.085ArgAla: 3.085 ± 0.015
0.98ArgCys: 0.98 ± 0.008
2.615ArgAsp: 2.615 ± 0.015
3.398ArgGlu: 3.398 ± 0.017
2.154ArgPhe: 2.154 ± 0.009
3.217ArgGly: 3.217 ± 0.016
1.257ArgHis: 1.257 ± 0.008
2.902ArgIle: 2.902 ± 0.014
3.846ArgLys: 3.846 ± 0.018
4.964ArgLeu: 4.964 ± 0.017
1.315ArgMet: 1.315 ± 0.008
2.562ArgAsn: 2.562 ± 0.011
2.202ArgPro: 2.202 ± 0.011
1.871ArgGln: 1.871 ± 0.011
3.711ArgArg: 3.711 ± 0.019
4.328ArgSer: 4.328 ± 0.019
2.389ArgThr: 2.389 ± 0.011
3.143ArgVal: 3.143 ± 0.013
0.714ArgTrp: 0.714 ± 0.006
1.395ArgTyr: 1.395 ± 0.009
0.0ArgXaa: 0.0 ± 0.0
Ser
5.289SerAla: 5.289 ± 0.02
1.839SerCys: 1.839 ± 0.01
4.406SerAsp: 4.406 ± 0.018
4.821SerGlu: 4.821 ± 0.018
4.147SerPhe: 4.147 ± 0.015
6.025SerGly: 6.025 ± 0.021
2.108SerHis: 2.108 ± 0.011
4.753SerIle: 4.753 ± 0.017
5.159SerLys: 5.159 ± 0.018
9.273SerLeu: 9.273 ± 0.033
2.219SerMet: 2.219 ± 0.01
4.399SerAsn: 4.399 ± 0.018
4.654SerPro: 4.654 ± 0.022
3.156SerGln: 3.156 ± 0.015
4.573SerArg: 4.573 ± 0.018
11.351SerSer: 11.351 ± 0.04
4.867SerThr: 4.867 ± 0.018
5.173SerVal: 5.173 ± 0.016
1.178SerTrp: 1.178 ± 0.008
2.338SerTyr: 2.338 ± 0.011
0.0SerXaa: 0.0 ± 0.0
Thr
3.353ThrAla: 3.353 ± 0.015
0.95ThrCys: 0.95 ± 0.008
2.289ThrAsp: 2.289 ± 0.012
2.821ThrGlu: 2.821 ± 0.013
1.999ThrPhe: 1.999 ± 0.01
3.394ThrGly: 3.394 ± 0.015
1.003ThrHis: 1.003 ± 0.008
2.737ThrIle: 2.737 ± 0.013
2.644ThrLys: 2.644 ± 0.011
4.555ThrLeu: 4.555 ± 0.016
1.164ThrMet: 1.164 ± 0.008
2.126ThrAsn: 2.126 ± 0.011
2.591ThrPro: 2.591 ± 0.014
1.533ThrGln: 1.533 ± 0.009
2.422ThrArg: 2.422 ± 0.012
4.763ThrSer: 4.763 ± 0.017
3.048ThrThr: 3.048 ± 0.017
3.302ThrVal: 3.302 ± 0.016
0.632ThrTrp: 0.632 ± 0.006
1.309ThrTyr: 1.309 ± 0.008
0.0ThrXaa: 0.0 ± 0.0
Val
4.613ValAla: 4.613 ± 0.016
1.159ValCys: 1.159 ± 0.007
3.598ValAsp: 3.598 ± 0.014
4.305ValGlu: 4.305 ± 0.018
2.718ValPhe: 2.718 ± 0.011
3.978ValGly: 3.978 ± 0.015
1.5ValHis: 1.5 ± 0.008
3.289ValIle: 3.289 ± 0.013
3.865ValLys: 3.865 ± 0.014
6.31ValLeu: 6.31 ± 0.018
1.502ValMet: 1.502 ± 0.009
2.599ValAsn: 2.599 ± 0.012
3.128ValPro: 3.128 ± 0.013
2.324ValGln: 2.324 ± 0.011
2.952ValArg: 2.952 ± 0.012
5.47ValSer: 5.47 ± 0.017
3.107ValThr: 3.107 ± 0.014
4.632ValVal: 4.632 ± 0.019
0.751ValTrp: 0.751 ± 0.006
1.858ValTyr: 1.858 ± 0.01
0.0ValXaa: 0.0 ± 0.0
Trp
0.734TrpAla: 0.734 ± 0.005
0.247TrpCys: 0.247 ± 0.003
0.675TrpAsp: 0.675 ± 0.006
0.741TrpGlu: 0.741 ± 0.006
0.573TrpPhe: 0.573 ± 0.006
0.723TrpGly: 0.723 ± 0.007
0.301TrpHis: 0.301 ± 0.003
0.712TrpIle: 0.712 ± 0.006
0.986TrpLys: 0.986 ± 0.007
1.221TrpLeu: 1.221 ± 0.009
0.358TrpMet: 0.358 ± 0.004
0.709TrpAsn: 0.709 ± 0.007
0.493TrpPro: 0.493 ± 0.005
0.461TrpGln: 0.461 ± 0.005
0.804TrpArg: 0.804 ± 0.006
0.968TrpSer: 0.968 ± 0.007
0.626TrpThr: 0.626 ± 0.005
0.768TrpVal: 0.768 ± 0.006
0.234TrpTrp: 0.234 ± 0.004
0.333TrpTyr: 0.333 ± 0.004
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.694TyrAla: 1.694 ± 0.01
0.641TyrCys: 0.641 ± 0.006
1.48TyrAsp: 1.48 ± 0.01
1.568TyrGlu: 1.568 ± 0.009
1.277TyrPhe: 1.277 ± 0.009
2.087TyrGly: 2.087 ± 0.014
0.713TyrHis: 0.713 ± 0.006
1.421TyrIle: 1.421 ± 0.009
1.506TyrLys: 1.506 ± 0.01
2.785TyrLeu: 2.785 ± 0.013
0.769TyrMet: 0.769 ± 0.007
1.315TyrAsn: 1.315 ± 0.008
1.243TyrPro: 1.243 ± 0.009
0.97TyrGln: 0.97 ± 0.006
1.423TyrArg: 1.423 ± 0.009
2.276TyrSer: 2.276 ± 0.012
1.248TyrThr: 1.248 ± 0.009
1.645TyrVal: 1.645 ± 0.009
0.39TyrTrp: 0.39 ± 0.005
0.936TyrTyr: 0.936 ± 0.008
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.014XaaXaa: 0.014 ± 0.006
Statistics based on 53336 proteins (21108701 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski