Amino acid dipepetide frequency for Agrilus planipennis (Emerald ash borer) (Agrilus marcopoli)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.294AlaAla: 4.294 ± 0.038
1.148AlaCys: 1.148 ± 0.046
2.7AlaAsp: 2.7 ± 0.019
3.688AlaGlu: 3.688 ± 0.029
2.152AlaPhe: 2.152 ± 0.016
2.94AlaGly: 2.94 ± 0.023
1.214AlaHis: 1.214 ± 0.011
3.253AlaIle: 3.253 ± 0.02
3.553AlaLys: 3.553 ± 0.027
5.106AlaLeu: 5.106 ± 0.034
1.187AlaMet: 1.187 ± 0.014
2.669AlaAsn: 2.669 ± 0.018
2.574AlaPro: 2.574 ± 0.025
2.217AlaGln: 2.217 ± 0.018
2.506AlaArg: 2.506 ± 0.021
4.376AlaSer: 4.376 ± 0.031
3.386AlaThr: 3.386 ± 0.023
3.833AlaVal: 3.833 ± 0.025
0.494AlaTrp: 0.494 ± 0.009
1.533AlaTyr: 1.533 ± 0.013
0.002AlaXaa: 0.002 ± 0.0
Cys
1.004CysAla: 1.004 ± 0.02
0.473CysCys: 0.473 ± 0.009
1.158CysAsp: 1.158 ± 0.019
1.28CysGlu: 1.28 ± 0.024
0.827CysPhe: 0.827 ± 0.013
1.466CysGly: 1.466 ± 0.067
0.5CysHis: 0.5 ± 0.009
1.361CysIle: 1.361 ± 0.058
1.417CysLys: 1.417 ± 0.031
1.934CysLeu: 1.934 ± 0.036
0.396CysMet: 0.396 ± 0.007
1.192CysAsn: 1.192 ± 0.029
1.225CysPro: 1.225 ± 0.06
0.909CysGln: 0.909 ± 0.038
1.173CysArg: 1.173 ± 0.06
1.931CysSer: 1.931 ± 0.065
1.196CysThr: 1.196 ± 0.039
1.394CysVal: 1.394 ± 0.058
0.202CysTrp: 0.202 ± 0.004
0.642CysTyr: 0.642 ± 0.011
0.001CysXaa: 0.001 ± 0.0
Asp
2.593AspAla: 2.593 ± 0.017
1.1AspCys: 1.1 ± 0.03
3.592AspAsp: 3.592 ± 0.033
4.121AspGlu: 4.121 ± 0.03
2.286AspPhe: 2.286 ± 0.018
2.911AspGly: 2.911 ± 0.027
1.089AspHis: 1.089 ± 0.011
3.499AspIle: 3.499 ± 0.023
3.511AspLys: 3.511 ± 0.026
4.792AspLeu: 4.792 ± 0.034
1.105AspMet: 1.105 ± 0.012
2.908AspAsn: 2.908 ± 0.021
2.53AspPro: 2.53 ± 0.042
1.691AspGln: 1.691 ± 0.013
2.302AspArg: 2.302 ± 0.02
4.4AspSer: 4.4 ± 0.031
2.671AspThr: 2.671 ± 0.017
3.462AspVal: 3.462 ± 0.023
0.575AspTrp: 0.575 ± 0.009
1.882AspTyr: 1.882 ± 0.015
0.002AspXaa: 0.002 ± 0.0
Glu
3.718GluAla: 3.718 ± 0.032
1.518GluCys: 1.518 ± 0.076
4.107GluAsp: 4.107 ± 0.03
6.688GluGlu: 6.688 ± 0.079
2.224GluPhe: 2.224 ± 0.018
3.059GluGly: 3.059 ± 0.026
1.406GluHis: 1.406 ± 0.014
4.638GluIle: 4.638 ± 0.031
5.989GluLys: 5.989 ± 0.057
5.76GluLeu: 5.76 ± 0.048
1.533GluMet: 1.533 ± 0.014
4.59GluAsn: 4.59 ± 0.029
2.48GluPro: 2.48 ± 0.023
2.842GluGln: 2.842 ± 0.029
3.492GluArg: 3.492 ± 0.033
4.707GluSer: 4.707 ± 0.03
3.918GluThr: 3.918 ± 0.036
3.904GluVal: 3.904 ± 0.025
0.624GluTrp: 0.624 ± 0.009
2.022GluTyr: 2.022 ± 0.017
0.002GluXaa: 0.002 ± 0.0
Phe
2.021PheAla: 2.021 ± 0.017
0.886PheCys: 0.886 ± 0.011
2.125PheAsp: 2.125 ± 0.018
2.386PheGlu: 2.386 ± 0.019
1.656PhePhe: 1.656 ± 0.018
2.34PheGly: 2.34 ± 0.023
0.92PheHis: 0.92 ± 0.011
2.394PheIle: 2.394 ± 0.02
2.458PheLys: 2.458 ± 0.019
3.737PheLeu: 3.737 ± 0.027
0.822PheMet: 0.822 ± 0.011
2.062PheAsn: 2.062 ± 0.014
1.774PhePro: 1.774 ± 0.015
1.525PheGln: 1.525 ± 0.012
1.789PheArg: 1.789 ± 0.015
3.216PheSer: 3.216 ± 0.02
2.317PheThr: 2.317 ± 0.024
2.584PheVal: 2.584 ± 0.016
0.449PheTrp: 0.449 ± 0.008
1.386PheTyr: 1.386 ± 0.013
0.003PheXaa: 0.003 ± 0.0
Gly
2.839GlyAla: 2.839 ± 0.022
1.023GlyCys: 1.023 ± 0.025
2.855GlyAsp: 2.855 ± 0.031
3.184GlyGlu: 3.184 ± 0.03
2.287GlyPhe: 2.287 ± 0.025
3.887GlyGly: 3.887 ± 0.051
1.309GlyHis: 1.309 ± 0.015
3.324GlyIle: 3.324 ± 0.025
3.471GlyLys: 3.471 ± 0.027
4.334GlyLeu: 4.334 ± 0.033
1.082GlyMet: 1.082 ± 0.012
2.913GlyAsn: 2.913 ± 0.026
2.466GlyPro: 2.466 ± 0.037
1.969GlyGln: 1.969 ± 0.023
2.595GlyArg: 2.595 ± 0.021
4.557GlySer: 4.557 ± 0.031
3.078GlyThr: 3.078 ± 0.023
3.095GlyVal: 3.095 ± 0.022
0.606GlyTrp: 0.606 ± 0.01
1.952GlyTyr: 1.952 ± 0.023
0.001GlyXaa: 0.001 ± 0.0
His
1.105HisAla: 1.105 ± 0.01
0.576HisCys: 0.576 ± 0.009
0.992HisAsp: 0.992 ± 0.01
1.353HisGlu: 1.353 ± 0.014
1.04HisPhe: 1.04 ± 0.011
1.201HisGly: 1.201 ± 0.012
0.934HisHis: 0.934 ± 0.018
1.417HisIle: 1.417 ± 0.012
1.478HisLys: 1.478 ± 0.015
2.349HisLeu: 2.349 ± 0.017
0.597HisMet: 0.597 ± 0.008
1.219HisAsn: 1.219 ± 0.011
1.279HisPro: 1.279 ± 0.015
1.123HisGln: 1.123 ± 0.012
1.234HisArg: 1.234 ± 0.012
2.0HisSer: 2.0 ± 0.018
1.299HisThr: 1.299 ± 0.013
1.339HisVal: 1.339 ± 0.011
0.272HisTrp: 0.272 ± 0.005
0.838HisTyr: 0.838 ± 0.01
0.001HisXaa: 0.001 ± 0.0
Ile
3.311IleAla: 3.311 ± 0.021
1.473IleCys: 1.473 ± 0.043
3.166IleAsp: 3.166 ± 0.021
3.994IleGlu: 3.994 ± 0.029
2.463IlePhe: 2.463 ± 0.024
2.897IleGly: 2.897 ± 0.024
1.402IleHis: 1.402 ± 0.014
3.792IleIle: 3.792 ± 0.032
4.316IleLys: 4.316 ± 0.029
5.461IleLeu: 5.461 ± 0.034
1.176IleMet: 1.176 ± 0.013
3.485IleAsn: 3.485 ± 0.037
3.245IlePro: 3.245 ± 0.029
2.507IleGln: 2.507 ± 0.021
2.842IleArg: 2.842 ± 0.019
4.905IleSer: 4.905 ± 0.027
3.581IleThr: 3.581 ± 0.026
3.672IleVal: 3.672 ± 0.026
0.555IleTrp: 0.555 ± 0.009
1.839IleTyr: 1.839 ± 0.018
0.004IleXaa: 0.004 ± 0.001
Lys
3.498LysAla: 3.498 ± 0.026
1.602LysCys: 1.602 ± 0.045
3.76LysAsp: 3.76 ± 0.027
5.622LysGlu: 5.622 ± 0.044
2.407LysPhe: 2.407 ± 0.017
3.242LysGly: 3.242 ± 0.033
1.664LysHis: 1.664 ± 0.017
4.423LysIle: 4.423 ± 0.031
6.181LysLys: 6.181 ± 0.059
6.166LysLeu: 6.166 ± 0.046
1.549LysMet: 1.549 ± 0.014
4.098LysAsn: 4.098 ± 0.028
3.391LysPro: 3.391 ± 0.052
3.082LysGln: 3.082 ± 0.023
3.876LysArg: 3.876 ± 0.031
5.25LysSer: 5.25 ± 0.034
4.088LysThr: 4.088 ± 0.027
4.051LysVal: 4.051 ± 0.031
0.698LysTrp: 0.698 ± 0.01
2.339LysTyr: 2.339 ± 0.02
0.004LysXaa: 0.004 ± 0.001
Leu
5.123LeuAla: 5.123 ± 0.032
1.759LeuCys: 1.759 ± 0.021
4.546LeuAsp: 4.546 ± 0.029
6.406LeuGlu: 6.406 ± 0.051
3.342LeuPhe: 3.342 ± 0.029
4.289LeuGly: 4.289 ± 0.026
2.22LeuHis: 2.22 ± 0.02
4.8LeuIle: 4.8 ± 0.033
6.684LeuLys: 6.684 ± 0.043
8.474LeuLeu: 8.474 ± 0.062
1.834LeuMet: 1.834 ± 0.016
4.893LeuAsn: 4.893 ± 0.027
4.626LeuPro: 4.626 ± 0.027
4.599LeuGln: 4.599 ± 0.032
4.634LeuArg: 4.634 ± 0.029
6.971LeuSer: 6.971 ± 0.04
5.056LeuThr: 5.056 ± 0.027
4.795LeuVal: 4.795 ± 0.029
0.864LeuTrp: 0.864 ± 0.01
2.705LeuTyr: 2.705 ± 0.023
0.005LeuXaa: 0.005 ± 0.001
Met
1.36MetAla: 1.36 ± 0.013
0.423MetCys: 0.423 ± 0.007
1.176MetAsp: 1.176 ± 0.011
1.535MetGlu: 1.535 ± 0.015
0.854MetPhe: 0.854 ± 0.01
1.153MetGly: 1.153 ± 0.014
0.461MetHis: 0.461 ± 0.007
1.047MetIle: 1.047 ± 0.012
1.483MetLys: 1.483 ± 0.014
1.801MetLeu: 1.801 ± 0.018
0.516MetMet: 0.516 ± 0.008
1.054MetAsn: 1.054 ± 0.011
0.956MetPro: 0.956 ± 0.01
0.946MetGln: 0.946 ± 0.011
0.958MetArg: 0.958 ± 0.011
1.709MetSer: 1.709 ± 0.015
1.126MetThr: 1.126 ± 0.009
1.241MetVal: 1.241 ± 0.012
0.217MetTrp: 0.217 ± 0.005
0.708MetTyr: 0.708 ± 0.009
0.001MetXaa: 0.001 ± 0.0
Asn
2.965AsnAla: 2.965 ± 0.035
1.205AsnCys: 1.205 ± 0.023
2.962AsnAsp: 2.962 ± 0.022
3.758AsnGlu: 3.758 ± 0.028
2.284AsnPhe: 2.284 ± 0.022
3.104AsnGly: 3.104 ± 0.022
1.277AsnHis: 1.277 ± 0.026
3.756AsnIle: 3.756 ± 0.024
3.998AsnLys: 3.998 ± 0.027
4.957AsnLeu: 4.957 ± 0.028
1.181AsnMet: 1.181 ± 0.012
3.979AsnAsn: 3.979 ± 0.029
2.684AsnPro: 2.684 ± 0.063
2.216AsnGln: 2.216 ± 0.02
2.422AsnArg: 2.422 ± 0.016
4.969AsnSer: 4.969 ± 0.033
3.008AsnThr: 3.008 ± 0.021
3.656AsnVal: 3.656 ± 0.023
0.55AsnTrp: 0.55 ± 0.009
1.985AsnTyr: 1.985 ± 0.016
0.003AsnXaa: 0.003 ± 0.0
Pro
2.688ProAla: 2.688 ± 0.021
1.11ProCys: 1.11 ± 0.096
2.487ProAsp: 2.487 ± 0.016
3.464ProGlu: 3.464 ± 0.044
1.909ProPhe: 1.909 ± 0.027
2.913ProGly: 2.913 ± 0.061
1.175ProHis: 1.175 ± 0.013
2.795ProIle: 2.795 ± 0.026
3.33ProLys: 3.33 ± 0.036
4.187ProLeu: 4.187 ± 0.026
0.905ProMet: 0.905 ± 0.012
2.662ProAsn: 2.662 ± 0.036
4.551ProPro: 4.551 ± 0.051
2.259ProGln: 2.259 ± 0.025
2.174ProArg: 2.174 ± 0.021
4.685ProSer: 4.685 ± 0.049
3.205ProThr: 3.205 ± 0.029
3.335ProVal: 3.335 ± 0.03
0.463ProTrp: 0.463 ± 0.007
1.658ProTyr: 1.658 ± 0.017
0.002ProXaa: 0.002 ± 0.0
Gln
2.353GlnAla: 2.353 ± 0.019
0.928GlnCys: 0.928 ± 0.036
1.853GlnAsp: 1.853 ± 0.014
2.992GlnGlu: 2.992 ± 0.026
1.535GlnPhe: 1.535 ± 0.014
1.917GlnGly: 1.917 ± 0.017
1.15GlnHis: 1.15 ± 0.013
2.473GlnIle: 2.473 ± 0.019
3.105GlnLys: 3.105 ± 0.024
3.931GlnLeu: 3.931 ± 0.032
0.987GlnMet: 0.987 ± 0.011
2.667GlnAsn: 2.667 ± 0.025
2.17GlnPro: 2.17 ± 0.033
3.52GlnGln: 3.52 ± 0.087
2.247GlnArg: 2.247 ± 0.017
3.001GlnSer: 3.001 ± 0.027
2.407GlnThr: 2.407 ± 0.02
2.318GlnVal: 2.318 ± 0.018
0.443GlnTrp: 0.443 ± 0.008
1.352GlnTyr: 1.352 ± 0.014
0.001GlnXaa: 0.001 ± 0.0
Arg
2.492ArgAla: 2.492 ± 0.017
0.983ArgCys: 0.983 ± 0.022
2.447ArgAsp: 2.447 ± 0.019
3.166ArgGlu: 3.166 ± 0.025
1.791ArgPhe: 1.791 ± 0.016
2.428ArgGly: 2.428 ± 0.022
1.309ArgHis: 1.309 ± 0.013
2.845ArgIle: 2.845 ± 0.02
4.073ArgLys: 4.073 ± 0.031
4.223ArgLeu: 4.223 ± 0.032
1.045ArgMet: 1.045 ± 0.012
2.904ArgAsn: 2.904 ± 0.018
2.369ArgPro: 2.369 ± 0.036
2.08ArgGln: 2.08 ± 0.015
3.357ArgArg: 3.357 ± 0.031
3.944ArgSer: 3.944 ± 0.032
2.601ArgThr: 2.601 ± 0.017
2.514ArgVal: 2.514 ± 0.022
0.517ArgTrp: 0.517 ± 0.009
1.604ArgTyr: 1.604 ± 0.016
0.003ArgXaa: 0.003 ± 0.0
Ser
4.284SerAla: 4.284 ± 0.029
1.74SerCys: 1.74 ± 0.07
4.596SerAsp: 4.596 ± 0.029
5.289SerGlu: 5.289 ± 0.033
3.134SerPhe: 3.134 ± 0.022
4.528SerGly: 4.528 ± 0.03
1.869SerHis: 1.869 ± 0.013
4.366SerIle: 4.366 ± 0.029
5.35SerLys: 5.35 ± 0.035
7.014SerLeu: 7.014 ± 0.042
1.533SerMet: 1.533 ± 0.014
4.817SerAsn: 4.817 ± 0.03
4.914SerPro: 4.914 ± 0.054
3.406SerGln: 3.406 ± 0.027
3.774SerArg: 3.774 ± 0.03
9.091SerSer: 9.091 ± 0.066
5.17SerThr: 5.17 ± 0.03
4.927SerVal: 4.927 ± 0.028
0.783SerTrp: 0.783 ± 0.011
2.437SerTyr: 2.437 ± 0.017
0.002SerXaa: 0.002 ± 0.0
Thr
3.428ThrAla: 3.428 ± 0.022
1.302ThrCys: 1.302 ± 0.039
2.962ThrAsp: 2.962 ± 0.018
3.834ThrGlu: 3.834 ± 0.037
2.276ThrPhe: 2.276 ± 0.017
3.195ThrGly: 3.195 ± 0.026
1.247ThrHis: 1.247 ± 0.011
3.504ThrIle: 3.504 ± 0.02
3.744ThrLys: 3.744 ± 0.027
5.093ThrLeu: 5.093 ± 0.028
1.096ThrMet: 1.096 ± 0.01
3.187ThrAsn: 3.187 ± 0.019
3.432ThrPro: 3.432 ± 0.032
2.146ThrGln: 2.146 ± 0.019
2.37ThrArg: 2.37 ± 0.018
5.255ThrSer: 5.255 ± 0.032
4.294ThrThr: 4.294 ± 0.049
3.966ThrVal: 3.966 ± 0.022
0.579ThrTrp: 0.579 ± 0.009
1.733ThrTyr: 1.733 ± 0.014
0.002ThrXaa: 0.002 ± 0.0
Val
3.69ValAla: 3.69 ± 0.021
1.468ValCys: 1.468 ± 0.039
3.243ValAsp: 3.243 ± 0.019
3.897ValGlu: 3.897 ± 0.028
2.446ValPhe: 2.446 ± 0.021
2.955ValGly: 2.955 ± 0.02
1.413ValHis: 1.413 ± 0.013
3.816ValIle: 3.816 ± 0.023
3.973ValLys: 3.973 ± 0.031
5.384ValLeu: 5.384 ± 0.032
1.226ValMet: 1.226 ± 0.013
3.257ValAsn: 3.257 ± 0.03
3.391ValPro: 3.391 ± 0.036
2.551ValGln: 2.551 ± 0.017
2.737ValArg: 2.737 ± 0.02
4.714ValSer: 4.714 ± 0.021
3.857ValThr: 3.857 ± 0.025
4.111ValVal: 4.111 ± 0.024
0.622ValTrp: 0.622 ± 0.008
1.909ValTyr: 1.909 ± 0.016
0.002ValXaa: 0.002 ± 0.0
Trp
0.487TrpAla: 0.487 ± 0.007
0.206TrpCys: 0.206 ± 0.005
0.56TrpAsp: 0.56 ± 0.008
0.605TrpGlu: 0.605 ± 0.009
0.443TrpPhe: 0.443 ± 0.007
0.504TrpGly: 0.504 ± 0.009
0.226TrpHis: 0.226 ± 0.005
0.647TrpIle: 0.647 ± 0.011
0.764TrpLys: 0.764 ± 0.011
1.0TrpLeu: 1.0 ± 0.012
0.255TrpMet: 0.255 ± 0.006
0.601TrpAsn: 0.601 ± 0.009
0.394TrpPro: 0.394 ± 0.007
0.398TrpGln: 0.398 ± 0.007
0.547TrpArg: 0.547 ± 0.009
0.761TrpSer: 0.761 ± 0.011
0.567TrpThr: 0.567 ± 0.009
0.554TrpVal: 0.554 ± 0.009
0.165TrpTrp: 0.165 ± 0.005
0.347TrpTyr: 0.347 ± 0.006
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.572TyrAla: 1.572 ± 0.013
0.757TyrCys: 0.757 ± 0.011
1.732TyrAsp: 1.732 ± 0.015
1.958TyrGlu: 1.958 ± 0.013
1.489TyrPhe: 1.489 ± 0.013
1.896TyrGly: 1.896 ± 0.018
0.858TyrHis: 0.858 ± 0.009
1.929TyrIle: 1.929 ± 0.015
2.108TyrLys: 2.108 ± 0.015
2.918TyrLeu: 2.918 ± 0.021
0.7TyrMet: 0.7 ± 0.008
1.76TyrAsn: 1.76 ± 0.014
1.504TyrPro: 1.504 ± 0.016
1.364TyrGln: 1.364 ± 0.014
1.663TyrArg: 1.663 ± 0.015
2.582TyrSer: 2.582 ± 0.018
1.819TyrThr: 1.819 ± 0.014
1.91TyrVal: 1.91 ± 0.015
0.353TyrTrp: 0.353 ± 0.007
1.298TyrTyr: 1.298 ± 0.015
0.003TyrXaa: 0.003 ± 0.001
Xaa
0.002XaaAla: 0.002 ± 0.0
0.001XaaCys: 0.001 ± 0.0
0.002XaaAsp: 0.002 ± 0.0
0.002XaaGlu: 0.002 ± 0.0
0.003XaaPhe: 0.003 ± 0.001
0.001XaaGly: 0.001 ± 0.0
0.001XaaHis: 0.001 ± 0.0
0.005XaaIle: 0.005 ± 0.001
0.004XaaLys: 0.004 ± 0.001
0.004XaaLeu: 0.004 ± 0.001
0.002XaaMet: 0.002 ± 0.0
0.003XaaAsn: 0.003 ± 0.001
0.001XaaPro: 0.001 ± 0.0
0.001XaaGln: 0.001 ± 0.0
0.002XaaArg: 0.002 ± 0.0
0.002XaaSer: 0.002 ± 0.0
0.003XaaThr: 0.003 ± 0.001
0.002XaaVal: 0.002 ± 0.0
0.001XaaTrp: 0.001 ± 0.0
0.002XaaTyr: 0.002 ± 0.001
0.002XaaXaa: 0.002 ± 0.001
Statistics based on 18772 proteins (11197405 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski