Amino acid dipepetide frequency for Pyrus ussuriensis x Pyrus communis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.404AlaAla: 6.404 ± 0.029
1.254AlaCys: 1.254 ± 0.009
3.134AlaAsp: 3.134 ± 0.015
4.186AlaGlu: 4.186 ± 0.02
2.777AlaPhe: 2.777 ± 0.014
4.111AlaGly: 4.111 ± 0.016
1.367AlaHis: 1.367 ± 0.009
3.632AlaIle: 3.632 ± 0.015
3.913AlaLys: 3.913 ± 0.017
6.646AlaLeu: 6.646 ± 0.022
1.779AlaMet: 1.779 ± 0.012
2.626AlaAsn: 2.626 ± 0.013
2.934AlaPro: 2.934 ± 0.019
2.192AlaGln: 2.192 ± 0.012
3.367AlaArg: 3.367 ± 0.015
6.0AlaSer: 6.0 ± 0.023
3.684AlaThr: 3.684 ± 0.014
4.808AlaVal: 4.808 ± 0.021
0.79AlaTrp: 0.79 ± 0.007
1.819AlaTyr: 1.819 ± 0.011
0.0AlaXaa: 0.0 ± 0.0
Cys
1.003CysAla: 1.003 ± 0.009
0.525CysCys: 0.525 ± 0.006
0.873CysAsp: 0.873 ± 0.008
0.896CysGlu: 0.896 ± 0.008
0.919CysPhe: 0.919 ± 0.008
1.391CysGly: 1.391 ± 0.011
0.475CysHis: 0.475 ± 0.006
1.014CysIle: 1.014 ± 0.008
1.182CysLys: 1.182 ± 0.01
1.866CysLeu: 1.866 ± 0.011
0.455CysMet: 0.455 ± 0.005
0.87CysAsn: 0.87 ± 0.007
0.965CysPro: 0.965 ± 0.008
0.625CysGln: 0.625 ± 0.007
1.095CysArg: 1.095 ± 0.008
1.842CysSer: 1.842 ± 0.012
0.916CysThr: 0.916 ± 0.007
1.071CysVal: 1.071 ± 0.009
0.273CysTrp: 0.273 ± 0.005
0.539CysTyr: 0.539 ± 0.005
0.0CysXaa: 0.0 ± 0.0
Asp
3.436AspAla: 3.436 ± 0.014
0.974AspCys: 0.974 ± 0.008
3.609AspAsp: 3.609 ± 0.02
3.939AspGlu: 3.939 ± 0.019
2.325AspPhe: 2.325 ± 0.011
3.826AspGly: 3.826 ± 0.016
1.31AspHis: 1.31 ± 0.011
2.897AspIle: 2.897 ± 0.014
2.683AspLys: 2.683 ± 0.015
5.069AspLeu: 5.069 ± 0.019
1.351AspMet: 1.351 ± 0.01
2.06AspAsn: 2.06 ± 0.011
2.555AspPro: 2.555 ± 0.013
1.831AspGln: 1.831 ± 0.012
2.412AspArg: 2.412 ± 0.013
4.116AspSer: 4.116 ± 0.018
2.148AspThr: 2.148 ± 0.012
3.68AspVal: 3.68 ± 0.018
0.73AspTrp: 0.73 ± 0.007
1.54AspTyr: 1.54 ± 0.008
0.0AspXaa: 0.0 ± 0.0
Glu
4.789GluAla: 4.789 ± 0.022
0.914GluCys: 0.914 ± 0.008
3.94GluAsp: 3.94 ± 0.018
6.102GluGlu: 6.102 ± 0.04
2.398GluPhe: 2.398 ± 0.012
3.828GluGly: 3.828 ± 0.018
1.242GluHis: 1.242 ± 0.009
3.65GluIle: 3.65 ± 0.019
4.645GluLys: 4.645 ± 0.024
5.994GluLeu: 5.994 ± 0.029
1.742GluMet: 1.742 ± 0.012
3.048GluAsn: 3.048 ± 0.015
2.151GluPro: 2.151 ± 0.012
2.18GluGln: 2.18 ± 0.016
3.356GluArg: 3.356 ± 0.017
4.523GluSer: 4.523 ± 0.021
3.033GluThr: 3.033 ± 0.016
4.23GluVal: 4.23 ± 0.019
0.75GluTrp: 0.75 ± 0.007
1.619GluTyr: 1.619 ± 0.011
0.0GluXaa: 0.0 ± 0.0
Phe
2.467PheAla: 2.467 ± 0.014
0.924PheCys: 0.924 ± 0.008
2.36PheAsp: 2.36 ± 0.012
2.353PheGlu: 2.353 ± 0.014
1.917PhePhe: 1.917 ± 0.013
3.115PheGly: 3.115 ± 0.015
1.112PheHis: 1.112 ± 0.008
2.002PheIle: 2.002 ± 0.011
2.131PheLys: 2.131 ± 0.011
4.289PheLeu: 4.289 ± 0.018
0.98PheMet: 0.98 ± 0.008
1.75PheAsn: 1.75 ± 0.011
2.143PhePro: 2.143 ± 0.012
1.581PheGln: 1.581 ± 0.01
2.087PheArg: 2.087 ± 0.011
4.053PheSer: 4.053 ± 0.018
2.024PheThr: 2.024 ± 0.013
2.851PheVal: 2.851 ± 0.015
0.587PheTrp: 0.587 ± 0.006
1.27PheTyr: 1.27 ± 0.009
0.0PheXaa: 0.0 ± 0.0
Gly
3.896GlyAla: 3.896 ± 0.019
1.37GlyCys: 1.37 ± 0.011
3.394GlyAsp: 3.394 ± 0.015
3.685GlyGlu: 3.685 ± 0.018
3.246GlyPhe: 3.246 ± 0.016
5.644GlyGly: 5.644 ± 0.035
1.547GlyHis: 1.547 ± 0.011
3.573GlyIle: 3.573 ± 0.016
4.105GlyLys: 4.105 ± 0.016
5.975GlyLeu: 5.975 ± 0.022
1.58GlyMet: 1.58 ± 0.011
3.174GlyAsn: 3.174 ± 0.018
2.464GlyPro: 2.464 ± 0.016
2.148GlyGln: 2.148 ± 0.012
3.703GlyArg: 3.703 ± 0.019
6.131GlySer: 6.131 ± 0.026
3.285GlyThr: 3.285 ± 0.018
4.316GlyVal: 4.316 ± 0.017
0.922GlyTrp: 0.922 ± 0.008
2.067GlyTyr: 2.067 ± 0.014
0.0GlyXaa: 0.0 ± 0.0
His
1.444HisAla: 1.444 ± 0.01
0.542HisCys: 0.542 ± 0.006
1.176HisAsp: 1.176 ± 0.009
1.34HisGlu: 1.34 ± 0.009
1.1HisPhe: 1.1 ± 0.008
1.756HisGly: 1.756 ± 0.012
1.025HisHis: 1.025 ± 0.012
1.237HisIle: 1.237 ± 0.009
1.225HisLys: 1.225 ± 0.009
2.484HisLeu: 2.484 ± 0.014
0.585HisMet: 0.585 ± 0.005
1.016HisAsn: 1.016 ± 0.009
1.39HisPro: 1.39 ± 0.01
1.078HisGln: 1.078 ± 0.009
1.39HisArg: 1.39 ± 0.009
1.964HisSer: 1.964 ± 0.013
1.018HisThr: 1.018 ± 0.008
1.564HisVal: 1.564 ± 0.01
0.313HisTrp: 0.313 ± 0.005
0.695HisTyr: 0.695 ± 0.008
0.0HisXaa: 0.0 ± 0.0
Ile
3.47IleAla: 3.47 ± 0.017
1.068IleCys: 1.068 ± 0.009
2.76IleAsp: 2.76 ± 0.013
3.079IleGlu: 3.079 ± 0.017
2.261IlePhe: 2.261 ± 0.011
3.355IleGly: 3.355 ± 0.015
1.277IleHis: 1.277 ± 0.008
2.598IleIle: 2.598 ± 0.016
2.862IleLys: 2.862 ± 0.014
5.017IleLeu: 5.017 ± 0.02
1.11IleMet: 1.11 ± 0.008
2.133IleAsn: 2.133 ± 0.012
3.029IlePro: 3.029 ± 0.02
1.945IleGln: 1.945 ± 0.012
2.592IleArg: 2.592 ± 0.014
4.744IleSer: 4.744 ± 0.019
2.538IleThr: 2.538 ± 0.011
3.446IleVal: 3.446 ± 0.017
0.716IleTrp: 0.716 ± 0.007
1.436IleTyr: 1.436 ± 0.01
0.0IleXaa: 0.0 ± 0.0
Lys
4.109LysAla: 4.109 ± 0.019
0.998LysCys: 0.998 ± 0.008
3.235LysAsp: 3.235 ± 0.016
4.561LysGlu: 4.561 ± 0.024
2.224LysPhe: 2.224 ± 0.012
3.673LysGly: 3.673 ± 0.018
1.384LysHis: 1.384 ± 0.01
3.139LysIle: 3.139 ± 0.013
4.785LysLys: 4.785 ± 0.027
6.08LysLeu: 6.08 ± 0.023
1.512LysMet: 1.512 ± 0.009
2.696LysAsn: 2.696 ± 0.014
2.826LysPro: 2.826 ± 0.017
2.291LysGln: 2.291 ± 0.013
3.558LysArg: 3.558 ± 0.017
4.691LysSer: 4.691 ± 0.019
2.921LysThr: 2.921 ± 0.013
3.931LysVal: 3.931 ± 0.02
0.808LysTrp: 0.808 ± 0.007
1.646LysTyr: 1.646 ± 0.011
0.0LysXaa: 0.0 ± 0.0
Leu
6.427LeuAla: 6.427 ± 0.022
1.897LeuCys: 1.897 ± 0.013
5.137LeuAsp: 5.137 ± 0.02
6.325LeuGlu: 6.325 ± 0.026
3.827LeuPhe: 3.827 ± 0.021
5.964LeuGly: 5.964 ± 0.02
2.651LeuHis: 2.651 ± 0.014
4.545LeuIle: 4.545 ± 0.019
6.251LeuLys: 6.251 ± 0.027
9.817LeuLeu: 9.817 ± 0.037
2.114LeuMet: 2.114 ± 0.011
3.989LeuAsn: 3.989 ± 0.02
5.316LeuPro: 5.316 ± 0.022
4.321LeuGln: 4.321 ± 0.02
5.401LeuArg: 5.401 ± 0.022
8.748LeuSer: 8.748 ± 0.032
4.575LeuThr: 4.575 ± 0.02
6.505LeuVal: 6.505 ± 0.023
1.176LeuTrp: 1.176 ± 0.009
2.494LeuTyr: 2.494 ± 0.013
0.0LeuXaa: 0.0 ± 0.0
Met
2.147MetAla: 2.147 ± 0.012
0.35MetCys: 0.35 ± 0.005
1.418MetAsp: 1.418 ± 0.01
1.985MetGlu: 1.985 ± 0.013
0.821MetPhe: 0.821 ± 0.007
1.646MetGly: 1.646 ± 0.01
0.577MetHis: 0.577 ± 0.006
1.149MetIle: 1.149 ± 0.009
1.644MetLys: 1.644 ± 0.011
2.178MetLeu: 2.178 ± 0.012
0.67MetMet: 0.67 ± 0.006
1.026MetAsn: 1.026 ± 0.008
1.113MetPro: 1.113 ± 0.009
0.946MetGln: 0.946 ± 0.008
1.209MetArg: 1.209 ± 0.009
1.765MetSer: 1.765 ± 0.011
1.064MetThr: 1.064 ± 0.008
1.696MetVal: 1.696 ± 0.011
0.278MetTrp: 0.278 ± 0.004
0.572MetTyr: 0.572 ± 0.007
0.0MetXaa: 0.0 ± 0.0
Asn
2.687AsnAla: 2.687 ± 0.014
0.895AsnCys: 0.895 ± 0.008
2.068AsnAsp: 2.068 ± 0.011
2.466AsnGlu: 2.466 ± 0.014
1.967AsnPhe: 1.967 ± 0.013
3.287AsnGly: 3.287 ± 0.016
1.128AsnHis: 1.128 ± 0.009
2.353AsnIle: 2.353 ± 0.013
2.41AsnLys: 2.41 ± 0.014
4.863AsnLeu: 4.863 ± 0.029
1.117AsnMet: 1.117 ± 0.009
2.331AsnAsn: 2.331 ± 0.017
2.45AsnPro: 2.45 ± 0.013
1.76AsnGln: 1.76 ± 0.011
2.027AsnArg: 2.027 ± 0.012
3.93AsnSer: 3.93 ± 0.018
1.964AsnThr: 1.964 ± 0.01
2.861AsnVal: 2.861 ± 0.014
0.584AsnTrp: 0.584 ± 0.006
1.342AsnTyr: 1.342 ± 0.01
0.0AsnXaa: 0.0 ± 0.0
Pro
3.113ProAla: 3.113 ± 0.017
0.78ProCys: 0.78 ± 0.007
2.459ProAsp: 2.459 ± 0.013
3.116ProGlu: 3.116 ± 0.015
1.989ProPhe: 1.989 ± 0.012
2.65ProGly: 2.65 ± 0.015
1.197ProHis: 1.197 ± 0.01
2.37ProIle: 2.37 ± 0.014
2.916ProLys: 2.916 ± 0.016
4.368ProLeu: 4.368 ± 0.019
1.032ProMet: 1.032 ± 0.009
2.447ProAsn: 2.447 ± 0.013
4.062ProPro: 4.062 ± 0.039
1.92ProGln: 1.92 ± 0.012
2.422ProArg: 2.422 ± 0.015
5.289ProSer: 5.289 ± 0.024
2.851ProThr: 2.851 ± 0.015
3.046ProVal: 3.046 ± 0.014
0.6ProTrp: 0.6 ± 0.007
1.327ProTyr: 1.327 ± 0.011
0.0ProXaa: 0.0 ± 0.0
Gln
2.396GlnAla: 2.396 ± 0.014
0.6GlnCys: 0.6 ± 0.007
1.668GlnAsp: 1.668 ± 0.01
2.393GlnGlu: 2.393 ± 0.015
1.4GlnPhe: 1.4 ± 0.009
2.203GlnGly: 2.203 ± 0.013
0.967GlnHis: 0.967 ± 0.009
2.019GlnIle: 2.019 ± 0.012
2.333GlnLys: 2.333 ± 0.013
3.755GlnLeu: 3.755 ± 0.017
1.003GlnMet: 1.003 ± 0.007
1.816GlnAsn: 1.816 ± 0.012
1.848GlnPro: 1.848 ± 0.013
2.094GlnGln: 2.094 ± 0.025
2.161GlnArg: 2.161 ± 0.013
2.904GlnSer: 2.904 ± 0.015
1.822GlnThr: 1.822 ± 0.01
2.429GlnVal: 2.429 ± 0.012
0.47GlnTrp: 0.47 ± 0.005
0.933GlnTyr: 0.933 ± 0.008
0.0GlnXaa: 0.0 ± 0.0
Arg
3.233ArgAla: 3.233 ± 0.015
0.988ArgCys: 0.988 ± 0.009
2.612ArgAsp: 2.612 ± 0.014
3.329ArgGlu: 3.329 ± 0.016
2.196ArgPhe: 2.196 ± 0.012
3.289ArgGly: 3.289 ± 0.019
1.337ArgHis: 1.337 ± 0.009
2.753ArgIle: 2.753 ± 0.012
3.743ArgLys: 3.743 ± 0.018
5.085ArgLeu: 5.085 ± 0.02
1.337ArgMet: 1.337 ± 0.009
2.45ArgAsn: 2.45 ± 0.014
2.413ArgPro: 2.413 ± 0.014
1.88ArgGln: 1.88 ± 0.01
3.907ArgArg: 3.907 ± 0.022
4.35ArgSer: 4.35 ± 0.019
2.536ArgThr: 2.536 ± 0.013
3.387ArgVal: 3.387 ± 0.015
0.768ArgTrp: 0.768 ± 0.007
1.408ArgTyr: 1.408 ± 0.01
0.0ArgXaa: 0.0 ± 0.0
Ser
5.473SerAla: 5.473 ± 0.023
1.696SerCys: 1.696 ± 0.012
4.373SerAsp: 4.373 ± 0.021
4.724SerGlu: 4.724 ± 0.023
3.92SerPhe: 3.92 ± 0.017
6.005SerGly: 6.005 ± 0.025
2.067SerHis: 2.067 ± 0.012
4.384SerIle: 4.384 ± 0.019
4.962SerLys: 4.962 ± 0.017
8.83SerLeu: 8.83 ± 0.041
2.073SerMet: 2.073 ± 0.011
4.149SerAsn: 4.149 ± 0.019
4.576SerPro: 4.576 ± 0.026
3.041SerGln: 3.041 ± 0.017
4.438SerArg: 4.438 ± 0.018
11.03SerSer: 11.03 ± 0.039
4.831SerThr: 4.831 ± 0.021
5.261SerVal: 5.261 ± 0.019
1.206SerTrp: 1.206 ± 0.01
2.271SerTyr: 2.271 ± 0.013
0.0SerXaa: 0.0 ± 0.0
Thr
3.519ThrAla: 3.519 ± 0.016
0.936ThrCys: 0.936 ± 0.008
2.255ThrAsp: 2.255 ± 0.013
2.814ThrGlu: 2.814 ± 0.016
2.094ThrPhe: 2.094 ± 0.01
3.258ThrGly: 3.258 ± 0.014
1.075ThrHis: 1.075 ± 0.008
2.707ThrIle: 2.707 ± 0.012
2.808ThrLys: 2.808 ± 0.015
4.697ThrLeu: 4.697 ± 0.018
1.187ThrMet: 1.187 ± 0.008
2.192ThrAsn: 2.192 ± 0.013
2.696ThrPro: 2.696 ± 0.016
1.628ThrGln: 1.628 ± 0.01
2.443ThrArg: 2.443 ± 0.013
4.674ThrSer: 4.674 ± 0.019
3.191ThrThr: 3.191 ± 0.02
3.329ThrVal: 3.329 ± 0.014
0.666ThrTrp: 0.666 ± 0.007
1.394ThrTyr: 1.394 ± 0.01
0.0ThrXaa: 0.0 ± 0.0
Val
4.854ValAla: 4.854 ± 0.021
1.198ValCys: 1.198 ± 0.01
3.768ValAsp: 3.768 ± 0.017
4.504ValGlu: 4.504 ± 0.018
2.79ValPhe: 2.79 ± 0.013
4.35ValGly: 4.35 ± 0.018
1.579ValHis: 1.579 ± 0.01
3.322ValIle: 3.322 ± 0.014
3.926ValLys: 3.926 ± 0.018
6.453ValLeu: 6.453 ± 0.027
1.564ValMet: 1.564 ± 0.01
2.64ValAsn: 2.64 ± 0.014
3.263ValPro: 3.263 ± 0.013
2.375ValGln: 2.375 ± 0.013
3.179ValArg: 3.179 ± 0.016
5.369ValSer: 5.369 ± 0.019
3.237ValThr: 3.237 ± 0.014
5.099ValVal: 5.099 ± 0.022
0.802ValTrp: 0.802 ± 0.007
1.91ValTyr: 1.91 ± 0.012
0.0ValXaa: 0.0 ± 0.0
Trp
0.772TrpAla: 0.772 ± 0.007
0.265TrpCys: 0.265 ± 0.004
0.711TrpAsp: 0.711 ± 0.007
0.778TrpGlu: 0.778 ± 0.008
0.569TrpPhe: 0.569 ± 0.006
0.783TrpGly: 0.783 ± 0.008
0.305TrpHis: 0.305 ± 0.004
0.695TrpIle: 0.695 ± 0.006
0.988TrpLys: 0.988 ± 0.008
1.256TrpLeu: 1.256 ± 0.009
0.342TrpMet: 0.342 ± 0.004
0.721TrpAsn: 0.721 ± 0.007
0.507TrpPro: 0.507 ± 0.006
0.441TrpGln: 0.441 ± 0.005
0.84TrpArg: 0.84 ± 0.006
0.981TrpSer: 0.981 ± 0.008
0.641TrpThr: 0.641 ± 0.006
0.883TrpVal: 0.883 ± 0.008
0.247TrpTrp: 0.247 ± 0.004
0.351TrpTyr: 0.351 ± 0.005
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.786TyrAla: 1.786 ± 0.011
0.623TyrCys: 0.623 ± 0.006
1.538TyrAsp: 1.538 ± 0.009
1.561TyrGlu: 1.561 ± 0.01
1.257TyrPhe: 1.257 ± 0.009
2.09TyrGly: 2.09 ± 0.014
0.718TyrHis: 0.718 ± 0.008
1.395TyrIle: 1.395 ± 0.01
1.557TyrLys: 1.557 ± 0.012
2.756TyrLeu: 2.756 ± 0.014
0.722TyrMet: 0.722 ± 0.007
1.356TyrAsn: 1.356 ± 0.01
1.256TyrPro: 1.256 ± 0.009
0.936TyrGln: 0.936 ± 0.008
1.415TyrArg: 1.415 ± 0.008
2.206TyrSer: 2.206 ± 0.012
1.285TyrThr: 1.285 ± 0.008
1.789TyrVal: 1.789 ± 0.01
0.39TyrTrp: 0.39 ± 0.005
0.934TyrTyr: 0.934 ± 0.008
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 41937 proteins (16655602 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski