Amino acid dipepetide frequency for Manihot esculenta (Cassava) (Jatropha manihot)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.083AlaAla: 6.083 ± 0.031
1.286AlaCys: 1.286 ± 0.011
3.057AlaAsp: 3.057 ± 0.015
4.209AlaGlu: 4.209 ± 0.022
2.758AlaPhe: 2.758 ± 0.014
3.946AlaGly: 3.946 ± 0.021
1.286AlaHis: 1.286 ± 0.009
3.92AlaIle: 3.92 ± 0.019
3.991AlaLys: 3.991 ± 0.02
6.508AlaLeu: 6.508 ± 0.026
1.746AlaMet: 1.746 ± 0.012
2.702AlaAsn: 2.702 ± 0.015
2.657AlaPro: 2.657 ± 0.016
2.113AlaGln: 2.113 ± 0.015
3.265AlaArg: 3.265 ± 0.016
6.087AlaSer: 6.087 ± 0.022
3.57AlaThr: 3.57 ± 0.017
4.656AlaVal: 4.656 ± 0.016
0.764AlaTrp: 0.764 ± 0.008
1.822AlaTyr: 1.822 ± 0.013
0.0AlaXaa: 0.0 ± 0.0
Cys
1.002CysAla: 1.002 ± 0.008
0.551CysCys: 0.551 ± 0.007
0.867CysAsp: 0.867 ± 0.009
0.947CysGlu: 0.947 ± 0.008
0.941CysPhe: 0.941 ± 0.007
1.377CysGly: 1.377 ± 0.011
0.479CysHis: 0.479 ± 0.005
1.076CysIle: 1.076 ± 0.009
1.258CysLys: 1.258 ± 0.012
1.918CysLeu: 1.918 ± 0.013
0.469CysMet: 0.469 ± 0.005
0.924CysAsn: 0.924 ± 0.009
0.971CysPro: 0.971 ± 0.009
0.631CysGln: 0.631 ± 0.006
1.029CysArg: 1.029 ± 0.01
1.918CysSer: 1.918 ± 0.013
0.865CysThr: 0.865 ± 0.008
1.033CysVal: 1.033 ± 0.009
0.254CysTrp: 0.254 ± 0.004
0.547CysTyr: 0.547 ± 0.006
0.0CysXaa: 0.0 ± 0.0
Asp
3.393AspAla: 3.393 ± 0.016
0.991AspCys: 0.991 ± 0.008
3.474AspAsp: 3.474 ± 0.02
3.99AspGlu: 3.99 ± 0.025
2.34AspPhe: 2.34 ± 0.013
3.677AspGly: 3.677 ± 0.017
1.245AspHis: 1.245 ± 0.01
3.084AspIle: 3.084 ± 0.016
2.828AspLys: 2.828 ± 0.016
5.059AspLeu: 5.059 ± 0.02
1.302AspMet: 1.302 ± 0.009
2.186AspAsn: 2.186 ± 0.012
2.539AspPro: 2.539 ± 0.014
1.725AspGln: 1.725 ± 0.012
2.277AspArg: 2.277 ± 0.015
4.275AspSer: 4.275 ± 0.02
2.155AspThr: 2.155 ± 0.012
3.538AspVal: 3.538 ± 0.015
0.697AspTrp: 0.697 ± 0.008
1.537AspTyr: 1.537 ± 0.009
0.0AspXaa: 0.0 ± 0.0
Glu
4.765GluAla: 4.765 ± 0.023
0.938GluCys: 0.938 ± 0.008
4.038GluAsp: 4.038 ± 0.024
6.48GluGlu: 6.48 ± 0.061
2.376GluPhe: 2.376 ± 0.012
3.739GluGly: 3.739 ± 0.017
1.266GluHis: 1.266 ± 0.012
4.092GluIle: 4.092 ± 0.032
4.94GluLys: 4.94 ± 0.028
6.096GluLeu: 6.096 ± 0.023
1.885GluMet: 1.885 ± 0.013
3.394GluAsn: 3.394 ± 0.024
2.097GluPro: 2.097 ± 0.014
2.223GluGln: 2.223 ± 0.015
3.384GluArg: 3.384 ± 0.02
4.725GluSer: 4.725 ± 0.024
3.171GluThr: 3.171 ± 0.028
4.111GluVal: 4.111 ± 0.021
0.714GluTrp: 0.714 ± 0.008
1.632GluTyr: 1.632 ± 0.011
0.0GluXaa: 0.0 ± 0.0
Phe
2.546PheAla: 2.546 ± 0.013
0.957PheCys: 0.957 ± 0.008
2.322PheAsp: 2.322 ± 0.012
2.27PheGlu: 2.27 ± 0.013
2.094PhePhe: 2.094 ± 0.013
3.028PheGly: 3.028 ± 0.018
1.149PheHis: 1.149 ± 0.008
2.197PheIle: 2.197 ± 0.015
2.144PheLys: 2.144 ± 0.013
4.469PheLeu: 4.469 ± 0.02
0.98PheMet: 0.98 ± 0.009
1.837PheAsn: 1.837 ± 0.013
2.119PhePro: 2.119 ± 0.014
1.651PheGln: 1.651 ± 0.01
1.998PheArg: 1.998 ± 0.013
4.335PheSer: 4.335 ± 0.02
2.02PheThr: 2.02 ± 0.012
2.64PheVal: 2.64 ± 0.013
0.564PheTrp: 0.564 ± 0.007
1.296PheTyr: 1.296 ± 0.011
0.0PheXaa: 0.0 ± 0.0
Gly
3.705GlyAla: 3.705 ± 0.017
1.314GlyCys: 1.314 ± 0.012
3.296GlyAsp: 3.296 ± 0.017
3.66GlyGlu: 3.66 ± 0.02
3.151GlyPhe: 3.151 ± 0.018
4.917GlyGly: 4.917 ± 0.029
1.519GlyHis: 1.519 ± 0.011
3.729GlyIle: 3.729 ± 0.019
4.117GlyLys: 4.117 ± 0.02
5.931GlyLeu: 5.931 ± 0.021
1.493GlyMet: 1.493 ± 0.011
3.265GlyAsn: 3.265 ± 0.018
2.357GlyPro: 2.357 ± 0.017
2.077GlyGln: 2.077 ± 0.013
3.452GlyArg: 3.452 ± 0.018
5.951GlySer: 5.951 ± 0.023
3.187GlyThr: 3.187 ± 0.015
4.002GlyVal: 4.002 ± 0.018
0.859GlyTrp: 0.859 ± 0.009
2.051GlyTyr: 2.051 ± 0.012
0.0GlyXaa: 0.0 ± 0.0
His
1.389HisAla: 1.389 ± 0.011
0.535HisCys: 0.535 ± 0.006
1.131HisAsp: 1.131 ± 0.009
1.386HisGlu: 1.386 ± 0.015
1.103HisPhe: 1.103 ± 0.009
1.712HisGly: 1.712 ± 0.012
0.958HisHis: 0.958 ± 0.011
1.254HisIle: 1.254 ± 0.009
1.246HisLys: 1.246 ± 0.011
2.527HisLeu: 2.527 ± 0.013
0.553HisMet: 0.553 ± 0.005
1.021HisAsn: 1.021 ± 0.009
1.345HisPro: 1.345 ± 0.011
1.105HisGln: 1.105 ± 0.011
1.366HisArg: 1.366 ± 0.011
2.014HisSer: 2.014 ± 0.012
0.946HisThr: 0.946 ± 0.009
1.501HisVal: 1.501 ± 0.01
0.297HisTrp: 0.297 ± 0.004
0.711HisTyr: 0.711 ± 0.007
0.0HisXaa: 0.0 ± 0.0
Ile
3.683IleAla: 3.683 ± 0.02
1.153IleCys: 1.153 ± 0.01
2.968IleAsp: 2.968 ± 0.014
3.331IleGlu: 3.331 ± 0.016
2.462IlePhe: 2.462 ± 0.015
3.485IleGly: 3.485 ± 0.017
1.381IleHis: 1.381 ± 0.01
3.031IleIle: 3.031 ± 0.017
3.111IleLys: 3.111 ± 0.018
5.516IleLeu: 5.516 ± 0.022
1.223IleMet: 1.223 ± 0.009
2.383IleAsn: 2.383 ± 0.015
3.102IlePro: 3.102 ± 0.02
2.084IleGln: 2.084 ± 0.012
2.612IleArg: 2.612 ± 0.015
5.238IleSer: 5.238 ± 0.023
2.64IleThr: 2.64 ± 0.016
3.505IleVal: 3.505 ± 0.017
0.738IleTrp: 0.738 ± 0.009
1.551IleTyr: 1.551 ± 0.011
0.0IleXaa: 0.0 ± 0.0
Lys
4.053LysAla: 4.053 ± 0.019
1.021LysCys: 1.021 ± 0.01
3.295LysAsp: 3.295 ± 0.018
4.977LysGlu: 4.977 ± 0.041
2.265LysPhe: 2.265 ± 0.014
3.677LysGly: 3.677 ± 0.015
1.408LysHis: 1.408 ± 0.012
3.433LysIle: 3.433 ± 0.015
4.796LysLys: 4.796 ± 0.031
6.162LysLeu: 6.162 ± 0.026
1.557LysMet: 1.557 ± 0.01
2.81LysAsn: 2.81 ± 0.015
2.716LysPro: 2.716 ± 0.014
2.402LysGln: 2.402 ± 0.014
3.572LysArg: 3.572 ± 0.018
4.745LysSer: 4.745 ± 0.021
2.834LysThr: 2.834 ± 0.015
3.792LysVal: 3.792 ± 0.019
0.789LysTrp: 0.789 ± 0.007
1.656LysTyr: 1.656 ± 0.012
0.0LysXaa: 0.0 ± 0.0
Leu
6.565LeuAla: 6.565 ± 0.025
1.91LeuCys: 1.91 ± 0.012
5.142LeuAsp: 5.142 ± 0.021
6.542LeuGlu: 6.542 ± 0.027
3.944LeuPhe: 3.944 ± 0.02
5.718LeuGly: 5.718 ± 0.024
2.721LeuHis: 2.721 ± 0.015
4.826LeuIle: 4.826 ± 0.02
6.361LeuLys: 6.361 ± 0.023
10.067LeuLeu: 10.067 ± 0.039
2.175LeuMet: 2.175 ± 0.012
4.124LeuAsn: 4.124 ± 0.017
5.219LeuPro: 5.219 ± 0.025
4.531LeuGln: 4.531 ± 0.02
5.232LeuArg: 5.232 ± 0.02
8.839LeuSer: 8.839 ± 0.032
4.367LeuThr: 4.367 ± 0.018
6.207LeuVal: 6.207 ± 0.024
1.17LeuTrp: 1.17 ± 0.008
2.533LeuTyr: 2.533 ± 0.015
0.0LeuXaa: 0.0 ± 0.0
Met
2.113MetAla: 2.113 ± 0.013
0.344MetCys: 0.344 ± 0.005
1.454MetAsp: 1.454 ± 0.01
2.065MetGlu: 2.065 ± 0.014
0.79MetPhe: 0.79 ± 0.007
1.655MetGly: 1.655 ± 0.013
0.569MetHis: 0.569 ± 0.006
1.257MetIle: 1.257 ± 0.011
1.686MetLys: 1.686 ± 0.012
2.18MetLeu: 2.18 ± 0.013
0.641MetMet: 0.641 ± 0.006
1.05MetAsn: 1.05 ± 0.008
1.077MetPro: 1.077 ± 0.009
0.968MetGln: 0.968 ± 0.01
1.18MetArg: 1.18 ± 0.009
1.744MetSer: 1.744 ± 0.012
0.995MetThr: 0.995 ± 0.008
1.635MetVal: 1.635 ± 0.011
0.264MetTrp: 0.264 ± 0.005
0.561MetTyr: 0.561 ± 0.007
0.0MetXaa: 0.0 ± 0.0
Asn
2.711AsnAla: 2.711 ± 0.012
0.977AsnCys: 0.977 ± 0.008
2.145AsnAsp: 2.145 ± 0.014
2.732AsnGlu: 2.732 ± 0.021
2.088AsnPhe: 2.088 ± 0.012
3.401AsnGly: 3.401 ± 0.016
1.16AsnHis: 1.16 ± 0.009
2.653AsnIle: 2.653 ± 0.015
2.499AsnLys: 2.499 ± 0.014
5.023AsnLeu: 5.023 ± 0.026
1.124AsnMet: 1.124 ± 0.009
2.455AsnAsn: 2.455 ± 0.017
2.389AsnPro: 2.389 ± 0.013
1.833AsnGln: 1.833 ± 0.012
2.002AsnArg: 2.002 ± 0.012
4.191AsnSer: 4.191 ± 0.021
1.959AsnThr: 1.959 ± 0.012
2.82AsnVal: 2.82 ± 0.016
0.579AsnTrp: 0.579 ± 0.006
1.349AsnTyr: 1.349 ± 0.01
0.0AsnXaa: 0.0 ± 0.0
Pro
2.993ProAla: 2.993 ± 0.015
0.794ProCys: 0.794 ± 0.007
2.421ProAsp: 2.421 ± 0.014
3.132ProGlu: 3.132 ± 0.015
1.998ProPhe: 1.998 ± 0.012
2.611ProGly: 2.611 ± 0.016
1.103ProHis: 1.103 ± 0.008
2.377ProIle: 2.377 ± 0.013
2.71ProLys: 2.71 ± 0.014
4.263ProLeu: 4.263 ± 0.016
0.974ProMet: 0.974 ± 0.008
2.315ProAsn: 2.315 ± 0.015
3.609ProPro: 3.609 ± 0.034
1.863ProGln: 1.863 ± 0.013
2.278ProArg: 2.278 ± 0.014
5.16ProSer: 5.16 ± 0.024
2.541ProThr: 2.541 ± 0.015
3.022ProVal: 3.022 ± 0.017
0.592ProTrp: 0.592 ± 0.007
1.307ProTyr: 1.307 ± 0.01
0.0ProXaa: 0.0 ± 0.0
Gln
2.438GlnAla: 2.438 ± 0.011
0.58GlnCys: 0.58 ± 0.007
1.682GlnAsp: 1.682 ± 0.01
2.604GlnGlu: 2.604 ± 0.02
1.458GlnPhe: 1.458 ± 0.009
2.146GlnGly: 2.146 ± 0.013
0.923GlnHis: 0.923 ± 0.008
2.149GlnIle: 2.149 ± 0.012
2.431GlnLys: 2.431 ± 0.014
3.824GlnLeu: 3.824 ± 0.019
1.001GlnMet: 1.001 ± 0.01
1.972GlnAsn: 1.972 ± 0.014
1.779GlnPro: 1.779 ± 0.013
2.217GlnGln: 2.217 ± 0.031
2.087GlnArg: 2.087 ± 0.013
2.942GlnSer: 2.942 ± 0.017
1.675GlnThr: 1.675 ± 0.011
2.382GlnVal: 2.382 ± 0.013
0.462GlnTrp: 0.462 ± 0.006
0.937GlnTyr: 0.937 ± 0.008
0.0GlnXaa: 0.0 ± 0.0
Arg
3.071ArgAla: 3.071 ± 0.019
0.952ArgCys: 0.952 ± 0.009
2.588ArgAsp: 2.588 ± 0.016
3.358ArgGlu: 3.358 ± 0.019
2.171ArgPhe: 2.171 ± 0.013
3.013ArgGly: 3.013 ± 0.016
1.217ArgHis: 1.217 ± 0.01
2.889ArgIle: 2.889 ± 0.013
3.759ArgLys: 3.759 ± 0.02
4.89ArgLeu: 4.89 ± 0.022
1.313ArgMet: 1.313 ± 0.01
2.447ArgAsn: 2.447 ± 0.014
2.246ArgPro: 2.246 ± 0.014
1.823ArgGln: 1.823 ± 0.013
3.726ArgArg: 3.726 ± 0.022
4.284ArgSer: 4.284 ± 0.023
2.361ArgThr: 2.361 ± 0.013
3.085ArgVal: 3.085 ± 0.017
0.714ArgTrp: 0.714 ± 0.007
1.41ArgTyr: 1.41 ± 0.011
0.0ArgXaa: 0.0 ± 0.0
Ser
5.406SerAla: 5.406 ± 0.02
1.796SerCys: 1.796 ± 0.011
4.344SerAsp: 4.344 ± 0.019
4.882SerGlu: 4.882 ± 0.021
4.169SerPhe: 4.169 ± 0.02
5.858SerGly: 5.858 ± 0.022
2.134SerHis: 2.134 ± 0.013
4.844SerIle: 4.844 ± 0.02
5.096SerLys: 5.096 ± 0.022
9.101SerLeu: 9.101 ± 0.031
2.143SerMet: 2.143 ± 0.012
4.38SerAsn: 4.38 ± 0.021
4.603SerPro: 4.603 ± 0.024
3.137SerGln: 3.137 ± 0.015
4.433SerArg: 4.433 ± 0.021
11.336SerSer: 11.336 ± 0.045
4.692SerThr: 4.692 ± 0.019
5.178SerVal: 5.178 ± 0.021
1.169SerTrp: 1.169 ± 0.01
2.361SerTyr: 2.361 ± 0.014
0.0SerXaa: 0.0 ± 0.0
Thr
3.385ThrAla: 3.385 ± 0.016
0.937ThrCys: 0.937 ± 0.008
2.241ThrAsp: 2.241 ± 0.012
2.826ThrGlu: 2.826 ± 0.021
2.018ThrPhe: 2.018 ± 0.013
3.251ThrGly: 3.251 ± 0.018
1.046ThrHis: 1.046 ± 0.009
2.775ThrIle: 2.775 ± 0.016
2.602ThrLys: 2.602 ± 0.016
4.518ThrLeu: 4.518 ± 0.017
1.096ThrMet: 1.096 ± 0.009
2.114ThrAsn: 2.114 ± 0.014
2.417ThrPro: 2.417 ± 0.015
1.572ThrGln: 1.572 ± 0.01
2.26ThrArg: 2.26 ± 0.013
4.649ThrSer: 4.649 ± 0.02
2.837ThrThr: 2.837 ± 0.018
3.259ThrVal: 3.259 ± 0.016
0.628ThrTrp: 0.628 ± 0.007
1.339ThrTyr: 1.339 ± 0.012
0.0ThrXaa: 0.0 ± 0.0
Val
4.632ValAla: 4.632 ± 0.02
1.118ValCys: 1.118 ± 0.009
3.686ValAsp: 3.686 ± 0.017
4.34ValGlu: 4.34 ± 0.023
2.628ValPhe: 2.628 ± 0.014
3.995ValGly: 3.995 ± 0.02
1.51ValHis: 1.51 ± 0.011
3.418ValIle: 3.418 ± 0.017
3.866ValLys: 3.866 ± 0.017
6.159ValLeu: 6.159 ± 0.024
1.47ValMet: 1.47 ± 0.01
2.647ValAsn: 2.647 ± 0.013
3.094ValPro: 3.094 ± 0.017
2.297ValGln: 2.297 ± 0.013
2.905ValArg: 2.905 ± 0.016
5.363ValSer: 5.363 ± 0.021
3.079ValThr: 3.079 ± 0.016
4.55ValVal: 4.55 ± 0.02
0.733ValTrp: 0.733 ± 0.008
1.902ValTyr: 1.902 ± 0.012
0.0ValXaa: 0.0 ± 0.0
Trp
0.747TrpAla: 0.747 ± 0.007
0.232TrpCys: 0.232 ± 0.004
0.689TrpAsp: 0.689 ± 0.008
0.745TrpGlu: 0.745 ± 0.008
0.556TrpPhe: 0.556 ± 0.007
0.702TrpGly: 0.702 ± 0.008
0.3TrpHis: 0.3 ± 0.005
0.72TrpIle: 0.72 ± 0.007
0.955TrpLys: 0.955 ± 0.008
1.219TrpLeu: 1.219 ± 0.009
0.339TrpMet: 0.339 ± 0.005
0.715TrpAsn: 0.715 ± 0.007
0.488TrpPro: 0.488 ± 0.007
0.459TrpGln: 0.459 ± 0.006
0.818TrpArg: 0.818 ± 0.007
0.96TrpSer: 0.96 ± 0.01
0.62TrpThr: 0.62 ± 0.007
0.788TrpVal: 0.788 ± 0.008
0.233TrpTrp: 0.233 ± 0.004
0.334TrpTyr: 0.334 ± 0.005
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.744TyrAla: 1.744 ± 0.011
0.67TyrCys: 0.67 ± 0.007
1.472TyrAsp: 1.472 ± 0.01
1.587TyrGlu: 1.587 ± 0.011
1.307TyrPhe: 1.307 ± 0.01
2.123TyrGly: 2.123 ± 0.014
0.728TyrHis: 0.728 ± 0.008
1.471TyrIle: 1.471 ± 0.011
1.561TyrLys: 1.561 ± 0.012
2.765TyrLeu: 2.765 ± 0.018
0.745TyrMet: 0.745 ± 0.007
1.349TyrAsn: 1.349 ± 0.01
1.241TyrPro: 1.241 ± 0.009
0.976TyrGln: 0.976 ± 0.008
1.444TyrArg: 1.444 ± 0.009
2.306TyrSer: 2.306 ± 0.014
1.259TyrThr: 1.259 ± 0.01
1.687TyrVal: 1.687 ± 0.011
0.4TyrTrp: 0.4 ± 0.006
0.967TyrTyr: 0.967 ± 0.009
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 37733 proteins (15226168 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski