Amino acid dipepetide frequency for Spinacia oleracea (Spinach)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.287AlaAla: 6.287 ± 0.041
1.2AlaCys: 1.2 ± 0.012
3.185AlaAsp: 3.185 ± 0.02
4.18AlaGlu: 4.18 ± 0.027
2.662AlaPhe: 2.662 ± 0.019
4.069AlaGly: 4.069 ± 0.024
1.286AlaHis: 1.286 ± 0.011
3.652AlaIle: 3.652 ± 0.022
3.88AlaLys: 3.88 ± 0.025
6.272AlaLeu: 6.272 ± 0.034
1.672AlaMet: 1.672 ± 0.014
2.563AlaAsn: 2.563 ± 0.017
2.717AlaPro: 2.717 ± 0.023
2.105AlaGln: 2.105 ± 0.018
3.204AlaArg: 3.204 ± 0.02
5.796AlaSer: 5.796 ± 0.03
3.592AlaThr: 3.592 ± 0.023
4.903AlaVal: 4.903 ± 0.031
0.72AlaTrp: 0.72 ± 0.01
1.796AlaTyr: 1.796 ± 0.014
0.002AlaXaa: 0.002 ± 0.001
Cys
0.978CysAla: 0.978 ± 0.011
0.549CysCys: 0.549 ± 0.009
0.883CysAsp: 0.883 ± 0.009
0.887CysGlu: 0.887 ± 0.011
0.868CysPhe: 0.868 ± 0.01
1.387CysGly: 1.387 ± 0.015
0.463CysHis: 0.463 ± 0.007
0.995CysIle: 0.995 ± 0.009
1.154CysLys: 1.154 ± 0.012
1.863CysLeu: 1.863 ± 0.015
0.432CysMet: 0.432 ± 0.007
0.868CysAsn: 0.868 ± 0.01
0.937CysPro: 0.937 ± 0.011
0.62CysGln: 0.62 ± 0.009
1.029CysArg: 1.029 ± 0.012
1.843CysSer: 1.843 ± 0.015
0.866CysThr: 0.866 ± 0.01
1.099CysVal: 1.099 ± 0.012
0.244CysTrp: 0.244 ± 0.005
0.566CysTyr: 0.566 ± 0.008
0.001CysXaa: 0.001 ± 0.0
Asp
3.507AspAla: 3.507 ± 0.022
0.982AspCys: 0.982 ± 0.011
4.072AspAsp: 4.072 ± 0.033
4.258AspGlu: 4.258 ± 0.026
2.38AspPhe: 2.38 ± 0.017
3.809AspGly: 3.809 ± 0.027
1.268AspHis: 1.268 ± 0.014
3.113AspIle: 3.113 ± 0.02
2.881AspLys: 2.881 ± 0.021
4.995AspLeu: 4.995 ± 0.025
1.365AspMet: 1.365 ± 0.013
2.314AspAsn: 2.314 ± 0.015
2.565AspPro: 2.565 ± 0.019
1.808AspGln: 1.808 ± 0.016
2.393AspArg: 2.393 ± 0.022
4.331AspSer: 4.331 ± 0.024
2.315AspThr: 2.315 ± 0.017
3.856AspVal: 3.856 ± 0.023
0.694AspTrp: 0.694 ± 0.008
1.631AspTyr: 1.631 ± 0.014
0.002AspXaa: 0.002 ± 0.001
Glu
4.74GluAla: 4.74 ± 0.03
0.892GluCys: 0.892 ± 0.011
4.133GluAsp: 4.133 ± 0.029
6.26GluGlu: 6.26 ± 0.049
2.407GluPhe: 2.407 ± 0.017
3.728GluGly: 3.728 ± 0.023
1.254GluHis: 1.254 ± 0.013
3.711GluIle: 3.711 ± 0.024
4.849GluLys: 4.849 ± 0.033
5.959GluLeu: 5.959 ± 0.032
1.788GluMet: 1.788 ± 0.016
3.174GluAsn: 3.174 ± 0.022
2.076GluPro: 2.076 ± 0.016
2.215GluGln: 2.215 ± 0.018
3.32GluArg: 3.32 ± 0.023
4.516GluSer: 4.516 ± 0.026
3.048GluThr: 3.048 ± 0.018
4.359GluVal: 4.359 ± 0.024
0.706GluTrp: 0.706 ± 0.009
1.695GluTyr: 1.695 ± 0.014
0.002GluXaa: 0.002 ± 0.001
Phe
2.423PheAla: 2.423 ± 0.016
0.894PheCys: 0.894 ± 0.011
2.447PheAsp: 2.447 ± 0.017
2.284PheGlu: 2.284 ± 0.017
1.891PhePhe: 1.891 ± 0.019
2.986PheGly: 2.986 ± 0.022
1.086PheHis: 1.086 ± 0.011
2.02PheIle: 2.02 ± 0.017
2.107PheLys: 2.107 ± 0.017
4.149PheLeu: 4.149 ± 0.027
0.953PheMet: 0.953 ± 0.011
1.784PheAsn: 1.784 ± 0.014
2.046PhePro: 2.046 ± 0.015
1.535PheGln: 1.535 ± 0.013
2.058PheArg: 2.058 ± 0.015
4.103PheSer: 4.103 ± 0.026
2.01PheThr: 2.01 ± 0.016
2.75PheVal: 2.75 ± 0.018
0.544PheTrp: 0.544 ± 0.009
1.252PheTyr: 1.252 ± 0.012
0.001PheXaa: 0.001 ± 0.0
Gly
3.823GlyAla: 3.823 ± 0.027
1.296GlyCys: 1.296 ± 0.014
3.473GlyAsp: 3.473 ± 0.022
3.696GlyGlu: 3.696 ± 0.023
3.102GlyPhe: 3.102 ± 0.023
6.092GlyGly: 6.092 ± 0.1
1.524GlyHis: 1.524 ± 0.017
3.585GlyIle: 3.585 ± 0.023
4.191GlyLys: 4.191 ± 0.025
5.828GlyLeu: 5.828 ± 0.029
1.5GlyMet: 1.5 ± 0.014
3.219GlyAsn: 3.219 ± 0.022
2.422GlyPro: 2.422 ± 0.019
2.039GlyGln: 2.039 ± 0.014
3.58GlyArg: 3.58 ± 0.026
5.916GlySer: 5.916 ± 0.037
3.147GlyThr: 3.147 ± 0.022
4.445GlyVal: 4.445 ± 0.034
0.891GlyTrp: 0.891 ± 0.01
2.16GlyTyr: 2.16 ± 0.02
0.003GlyXaa: 0.003 ± 0.0
His
1.333HisAla: 1.333 ± 0.013
0.517HisCys: 0.517 ± 0.007
1.171HisAsp: 1.171 ± 0.013
1.27HisGlu: 1.27 ± 0.012
1.059HisPhe: 1.059 ± 0.011
1.689HisGly: 1.689 ± 0.019
1.085HisHis: 1.085 ± 0.018
1.19HisIle: 1.19 ± 0.013
1.173HisLys: 1.173 ± 0.011
2.37HisLeu: 2.37 ± 0.018
0.554HisMet: 0.554 ± 0.009
1.038HisAsn: 1.038 ± 0.011
1.378HisPro: 1.378 ± 0.012
1.099HisGln: 1.099 ± 0.012
1.308HisArg: 1.308 ± 0.013
1.911HisSer: 1.911 ± 0.016
0.997HisThr: 0.997 ± 0.012
1.506HisVal: 1.506 ± 0.013
0.285HisTrp: 0.285 ± 0.005
0.718HisTyr: 0.718 ± 0.009
0.001HisXaa: 0.001 ± 0.0
Ile
3.454IleAla: 3.454 ± 0.024
1.106IleCys: 1.106 ± 0.011
3.0IleAsp: 3.0 ± 0.022
3.182IleGlu: 3.182 ± 0.022
2.233IlePhe: 2.233 ± 0.015
3.315IleGly: 3.315 ± 0.023
1.271IleHis: 1.271 ± 0.012
2.818IleIle: 2.818 ± 0.021
2.977IleLys: 2.977 ± 0.021
5.174IleLeu: 5.174 ± 0.03
1.143IleMet: 1.143 ± 0.011
2.263IleAsn: 2.263 ± 0.016
2.947IlePro: 2.947 ± 0.025
2.017IleGln: 2.017 ± 0.015
2.617IleArg: 2.617 ± 0.018
4.951IleSer: 4.951 ± 0.028
2.632IleThr: 2.632 ± 0.017
3.465IleVal: 3.465 ± 0.022
0.668IleTrp: 0.668 ± 0.009
1.499IleTyr: 1.499 ± 0.015
0.002IleXaa: 0.002 ± 0.0
Lys
3.98LysAla: 3.98 ± 0.024
1.001LysCys: 1.001 ± 0.011
3.407LysAsp: 3.407 ± 0.023
4.749LysGlu: 4.749 ± 0.032
2.232LysPhe: 2.232 ± 0.018
3.712LysGly: 3.712 ± 0.023
1.387LysHis: 1.387 ± 0.011
3.217LysIle: 3.217 ± 0.02
4.955LysLys: 4.955 ± 0.035
6.011LysLeu: 6.011 ± 0.029
1.567LysMet: 1.567 ± 0.013
2.716LysAsn: 2.716 ± 0.018
2.787LysPro: 2.787 ± 0.021
2.308LysGln: 2.308 ± 0.017
3.608LysArg: 3.608 ± 0.024
4.797LysSer: 4.797 ± 0.031
2.964LysThr: 2.964 ± 0.019
3.975LysVal: 3.975 ± 0.025
0.796LysTrp: 0.796 ± 0.008
1.66LysTyr: 1.66 ± 0.015
0.002LysXaa: 0.002 ± 0.0
Leu
6.231LeuAla: 6.231 ± 0.026
1.83LeuCys: 1.83 ± 0.014
5.081LeuAsp: 5.081 ± 0.028
6.196LeuGlu: 6.196 ± 0.031
3.752LeuPhe: 3.752 ± 0.024
5.69LeuGly: 5.69 ± 0.028
2.486LeuHis: 2.486 ± 0.018
4.64LeuIle: 4.64 ± 0.024
6.197LeuLys: 6.197 ± 0.03
9.736LeuLeu: 9.736 ± 0.051
2.145LeuMet: 2.145 ± 0.015
3.967LeuAsn: 3.967 ± 0.024
5.084LeuPro: 5.084 ± 0.03
4.285LeuGln: 4.285 ± 0.024
5.181LeuArg: 5.181 ± 0.029
8.669LeuSer: 8.669 ± 0.04
4.423LeuThr: 4.423 ± 0.022
6.264LeuVal: 6.264 ± 0.025
1.091LeuTrp: 1.091 ± 0.013
2.483LeuTyr: 2.483 ± 0.016
0.003LeuXaa: 0.003 ± 0.001
Met
2.008MetAla: 2.008 ± 0.015
0.325MetCys: 0.325 ± 0.005
1.393MetAsp: 1.393 ± 0.012
1.953MetGlu: 1.953 ± 0.017
0.833MetPhe: 0.833 ± 0.01
1.613MetGly: 1.613 ± 0.016
0.499MetHis: 0.499 ± 0.008
1.271MetIle: 1.271 ± 0.015
1.652MetLys: 1.652 ± 0.014
2.194MetLeu: 2.194 ± 0.016
0.742MetMet: 0.742 ± 0.011
1.049MetAsn: 1.049 ± 0.011
1.065MetPro: 1.065 ± 0.011
0.921MetGln: 0.921 ± 0.012
1.171MetArg: 1.171 ± 0.011
1.874MetSer: 1.874 ± 0.013
1.053MetThr: 1.053 ± 0.011
1.675MetVal: 1.675 ± 0.015
0.262MetTrp: 0.262 ± 0.006
0.626MetTyr: 0.626 ± 0.009
0.0MetXaa: 0.0 ± 0.0
Asn
2.62AsnAla: 2.62 ± 0.018
0.872AsnCys: 0.872 ± 0.009
2.286AsnAsp: 2.286 ± 0.017
2.547AsnGlu: 2.547 ± 0.018
1.949AsnPhe: 1.949 ± 0.016
3.345AsnGly: 3.345 ± 0.022
1.188AsnHis: 1.188 ± 0.012
2.493AsnIle: 2.493 ± 0.018
2.537AsnLys: 2.537 ± 0.02
4.537AsnLeu: 4.537 ± 0.034
1.126AsnMet: 1.126 ± 0.012
2.907AsnAsn: 2.907 ± 0.037
2.445AsnPro: 2.445 ± 0.019
1.759AsnGln: 1.759 ± 0.014
2.007AsnArg: 2.007 ± 0.015
4.136AsnSer: 4.136 ± 0.032
2.116AsnThr: 2.116 ± 0.018
2.942AsnVal: 2.942 ± 0.019
0.561AsnTrp: 0.561 ± 0.008
1.348AsnTyr: 1.348 ± 0.013
0.002AsnXaa: 0.002 ± 0.0
Pro
2.917ProAla: 2.917 ± 0.022
0.801ProCys: 0.801 ± 0.01
2.502ProAsp: 2.502 ± 0.017
3.134ProGlu: 3.134 ± 0.022
2.007ProPhe: 2.007 ± 0.017
2.713ProGly: 2.713 ± 0.02
1.084ProHis: 1.084 ± 0.012
2.332ProIle: 2.332 ± 0.016
2.74ProLys: 2.74 ± 0.022
4.323ProLeu: 4.323 ± 0.024
0.992ProMet: 0.992 ± 0.012
2.351ProAsn: 2.351 ± 0.017
4.133ProPro: 4.133 ± 0.068
1.863ProGln: 1.863 ± 0.018
2.339ProArg: 2.339 ± 0.017
5.244ProSer: 5.244 ± 0.029
2.755ProThr: 2.755 ± 0.022
3.152ProVal: 3.152 ± 0.025
0.582ProTrp: 0.582 ± 0.009
1.321ProTyr: 1.321 ± 0.013
0.001ProXaa: 0.001 ± 0.0
Gln
2.303GlnAla: 2.303 ± 0.015
0.564GlnCys: 0.564 ± 0.008
1.701GlnAsp: 1.701 ± 0.015
2.4GlnGlu: 2.4 ± 0.02
1.415GlnPhe: 1.415 ± 0.013
2.19GlnGly: 2.19 ± 0.018
0.978GlnHis: 0.978 ± 0.012
2.004GlnIle: 2.004 ± 0.016
2.421GlnLys: 2.421 ± 0.017
3.682GlnLeu: 3.682 ± 0.027
0.983GlnMet: 0.983 ± 0.012
1.821GlnAsn: 1.821 ± 0.016
1.862GlnPro: 1.862 ± 0.02
2.519GlnGln: 2.519 ± 0.046
2.065GlnArg: 2.065 ± 0.017
2.848GlnSer: 2.848 ± 0.022
1.761GlnThr: 1.761 ± 0.013
2.38GlnVal: 2.38 ± 0.015
0.458GlnTrp: 0.458 ± 0.006
0.961GlnTyr: 0.961 ± 0.01
0.001GlnXaa: 0.001 ± 0.0
Arg
3.085ArgAla: 3.085 ± 0.023
0.985ArgCys: 0.985 ± 0.013
2.626ArgAsp: 2.626 ± 0.02
3.332ArgGlu: 3.332 ± 0.024
2.108ArgPhe: 2.108 ± 0.015
3.22ArgGly: 3.22 ± 0.027
1.241ArgHis: 1.241 ± 0.013
2.775ArgIle: 2.775 ± 0.017
3.8ArgLys: 3.8 ± 0.023
4.825ArgLeu: 4.825 ± 0.023
1.281ArgMet: 1.281 ± 0.013
2.436ArgAsn: 2.436 ± 0.016
2.283ArgPro: 2.283 ± 0.016
1.81ArgGln: 1.81 ± 0.016
3.905ArgArg: 3.905 ± 0.029
4.303ArgSer: 4.303 ± 0.029
2.37ArgThr: 2.37 ± 0.018
3.387ArgVal: 3.387 ± 0.022
0.69ArgTrp: 0.69 ± 0.01
1.454ArgTyr: 1.454 ± 0.013
0.002ArgXaa: 0.002 ± 0.0
Ser
5.301SerAla: 5.301 ± 0.027
1.73SerCys: 1.73 ± 0.016
4.522SerAsp: 4.522 ± 0.026
4.868SerGlu: 4.868 ± 0.027
3.847SerPhe: 3.847 ± 0.023
5.945SerGly: 5.945 ± 0.031
1.976SerHis: 1.976 ± 0.016
4.571SerIle: 4.571 ± 0.025
5.064SerLys: 5.064 ± 0.026
8.752SerLeu: 8.752 ± 0.04
2.123SerMet: 2.123 ± 0.015
4.219SerAsn: 4.219 ± 0.025
4.767SerPro: 4.767 ± 0.038
3.019SerGln: 3.019 ± 0.022
4.393SerArg: 4.393 ± 0.028
11.378SerSer: 11.378 ± 0.069
4.798SerThr: 4.798 ± 0.029
5.371SerVal: 5.371 ± 0.025
1.131SerTrp: 1.131 ± 0.012
2.364SerTyr: 2.364 ± 0.019
0.003SerXaa: 0.003 ± 0.001
Thr
3.393ThrAla: 3.393 ± 0.023
0.908ThrCys: 0.908 ± 0.01
2.328ThrAsp: 2.328 ± 0.016
2.754ThrGlu: 2.754 ± 0.019
2.053ThrPhe: 2.053 ± 0.017
3.288ThrGly: 3.288 ± 0.024
1.075ThrHis: 1.075 ± 0.012
2.724ThrIle: 2.724 ± 0.02
2.722ThrLys: 2.722 ± 0.017
4.624ThrLeu: 4.624 ± 0.023
1.193ThrMet: 1.193 ± 0.012
2.185ThrAsn: 2.185 ± 0.016
2.8ThrPro: 2.8 ± 0.026
1.62ThrGln: 1.62 ± 0.014
2.369ThrArg: 2.369 ± 0.017
4.778ThrSer: 4.778 ± 0.025
3.431ThrThr: 3.431 ± 0.032
3.298ThrVal: 3.298 ± 0.018
0.612ThrTrp: 0.612 ± 0.009
1.405ThrTyr: 1.405 ± 0.014
0.001ThrXaa: 0.001 ± 0.0
Val
4.827ValAla: 4.827 ± 0.026
1.199ValCys: 1.199 ± 0.013
4.019ValAsp: 4.019 ± 0.025
4.534ValGlu: 4.534 ± 0.026
2.72ValPhe: 2.72 ± 0.019
4.247ValGly: 4.247 ± 0.031
1.513ValHis: 1.513 ± 0.013
3.529ValIle: 3.529 ± 0.024
4.066ValLys: 4.066 ± 0.027
6.245ValLeu: 6.245 ± 0.031
1.549ValMet: 1.549 ± 0.014
2.768ValAsn: 2.768 ± 0.017
3.196ValPro: 3.196 ± 0.022
2.353ValGln: 2.353 ± 0.019
3.141ValArg: 3.141 ± 0.02
5.481ValSer: 5.481 ± 0.026
3.316ValThr: 3.316 ± 0.021
5.259ValVal: 5.259 ± 0.036
0.758ValTrp: 0.758 ± 0.009
1.979ValTyr: 1.979 ± 0.018
0.002ValXaa: 0.002 ± 0.001
Trp
0.732TrpAla: 0.732 ± 0.009
0.231TrpCys: 0.231 ± 0.005
0.69TrpAsp: 0.69 ± 0.01
0.751TrpGlu: 0.751 ± 0.009
0.531TrpPhe: 0.531 ± 0.007
0.735TrpGly: 0.735 ± 0.01
0.278TrpHis: 0.278 ± 0.006
0.656TrpIle: 0.656 ± 0.01
0.923TrpLys: 0.923 ± 0.009
1.167TrpLeu: 1.167 ± 0.013
0.335TrpMet: 0.335 ± 0.006
0.679TrpAsn: 0.679 ± 0.01
0.467TrpPro: 0.467 ± 0.008
0.418TrpGln: 0.418 ± 0.007
0.783TrpArg: 0.783 ± 0.011
0.951TrpSer: 0.951 ± 0.01
0.598TrpThr: 0.598 ± 0.008
0.819TrpVal: 0.819 ± 0.01
0.231TrpTrp: 0.231 ± 0.006
0.337TrpTyr: 0.337 ± 0.006
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.798TyrAla: 1.798 ± 0.016
0.649TyrCys: 0.649 ± 0.008
1.619TyrAsp: 1.619 ± 0.015
1.597TyrGlu: 1.597 ± 0.016
1.273TyrPhe: 1.273 ± 0.012
2.156TyrGly: 2.156 ± 0.022
0.707TyrHis: 0.707 ± 0.009
1.435TyrIle: 1.435 ± 0.014
1.541TyrLys: 1.541 ± 0.016
2.723TyrLeu: 2.723 ± 0.021
0.735TyrMet: 0.735 ± 0.009
1.401TyrAsn: 1.401 ± 0.013
1.308TyrPro: 1.308 ± 0.014
1.003TyrGln: 1.003 ± 0.011
1.447TyrArg: 1.447 ± 0.013
2.313TyrSer: 2.313 ± 0.018
1.367TyrThr: 1.367 ± 0.016
1.796TyrVal: 1.796 ± 0.014
0.39TyrTrp: 0.39 ± 0.008
1.002TyrTyr: 1.002 ± 0.012
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.002XaaAla: 0.002 ± 0.001
0.001XaaCys: 0.001 ± 0.0
0.002XaaAsp: 0.002 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.002XaaGly: 0.002 ± 0.0
0.001XaaHis: 0.001 ± 0.0
0.002XaaIle: 0.002 ± 0.0
0.002XaaLys: 0.002 ± 0.001
0.003XaaLeu: 0.003 ± 0.001
0.001XaaMet: 0.001 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.002XaaPro: 0.002 ± 0.0
0.001XaaGln: 0.001 ± 0.0
0.002XaaArg: 0.002 ± 0.0
0.002XaaSer: 0.002 ± 0.0
0.002XaaThr: 0.002 ± 0.001
0.002XaaVal: 0.002 ± 0.0
0.001XaaTrp: 0.001 ± 0.0
0.002XaaTyr: 0.002 ± 0.0
0.008XaaXaa: 0.008 ± 0.002
Statistics based on 23526 proteins (9276580 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski