Amino acid dipepetide frequency for Diabrotica virgifera virgifera (western corn rootworm)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.012AlaAla: 4.012 ± 0.039
0.978AlaCys: 0.978 ± 0.015
2.801AlaAsp: 2.801 ± 0.019
3.772AlaGlu: 3.772 ± 0.035
2.044AlaPhe: 2.044 ± 0.015
2.866AlaGly: 2.866 ± 0.023
1.251AlaHis: 1.251 ± 0.014
3.171AlaIle: 3.171 ± 0.023
3.674AlaLys: 3.674 ± 0.033
4.901AlaLeu: 4.901 ± 0.03
1.161AlaMet: 1.161 ± 0.014
2.52AlaAsn: 2.52 ± 0.018
2.445AlaPro: 2.445 ± 0.025
2.147AlaGln: 2.147 ± 0.022
2.347AlaArg: 2.347 ± 0.02
3.966AlaSer: 3.966 ± 0.021
3.257AlaThr: 3.257 ± 0.023
3.631AlaVal: 3.631 ± 0.026
0.451AlaTrp: 0.451 ± 0.007
1.539AlaTyr: 1.539 ± 0.014
0.001AlaXaa: 0.001 ± 0.0
Cys
0.961CysAla: 0.961 ± 0.013
0.479CysCys: 0.479 ± 0.008
1.344CysAsp: 1.344 ± 0.021
2.07CysGlu: 2.07 ± 0.034
1.203CysPhe: 1.203 ± 0.02
1.183CysGly: 1.183 ± 0.021
0.474CysHis: 0.474 ± 0.007
1.224CysIle: 1.224 ± 0.021
1.48CysLys: 1.48 ± 0.016
1.889CysLeu: 1.889 ± 0.02
0.383CysMet: 0.383 ± 0.006
1.145CysAsn: 1.145 ± 0.018
1.102CysPro: 1.102 ± 0.028
0.864CysGln: 0.864 ± 0.018
0.962CysArg: 0.962 ± 0.021
1.8CysSer: 1.8 ± 0.023
1.175CysThr: 1.175 ± 0.018
1.143CysVal: 1.143 ± 0.016
0.185CysTrp: 0.185 ± 0.004
0.6CysTyr: 0.6 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
2.635AspAla: 2.635 ± 0.021
1.089AspCys: 1.089 ± 0.019
3.682AspAsp: 3.682 ± 0.028
4.264AspGlu: 4.264 ± 0.03
2.31AspPhe: 2.31 ± 0.018
2.927AspGly: 2.927 ± 0.03
1.19AspHis: 1.19 ± 0.013
4.055AspIle: 4.055 ± 0.021
3.696AspLys: 3.696 ± 0.027
5.016AspLeu: 5.016 ± 0.028
1.191AspMet: 1.191 ± 0.009
3.071AspAsn: 3.071 ± 0.028
2.428AspPro: 2.428 ± 0.027
1.885AspGln: 1.885 ± 0.016
2.347AspArg: 2.347 ± 0.022
4.278AspSer: 4.278 ± 0.031
2.973AspThr: 2.973 ± 0.023
3.455AspVal: 3.455 ± 0.025
0.586AspTrp: 0.586 ± 0.01
1.902AspTyr: 1.902 ± 0.016
0.0AspXaa: 0.0 ± 0.0
Glu
3.87GluAla: 3.87 ± 0.033
1.333GluCys: 1.333 ± 0.029
4.444GluAsp: 4.444 ± 0.026
6.665GluGlu: 6.665 ± 0.058
2.316GluPhe: 2.316 ± 0.013
2.995GluGly: 2.995 ± 0.025
1.529GluHis: 1.529 ± 0.013
5.302GluIle: 5.302 ± 0.041
6.769GluLys: 6.769 ± 0.054
5.62GluLeu: 5.62 ± 0.035
1.561GluMet: 1.561 ± 0.014
4.454GluAsn: 4.454 ± 0.03
2.652GluPro: 2.652 ± 0.036
2.803GluGln: 2.803 ± 0.023
3.377GluArg: 3.377 ± 0.033
4.55GluSer: 4.55 ± 0.028
4.067GluThr: 4.067 ± 0.043
4.124GluVal: 4.124 ± 0.03
0.561GluTrp: 0.561 ± 0.008
2.143GluTyr: 2.143 ± 0.017
0.001GluXaa: 0.001 ± 0.0
Phe
1.997PheAla: 1.997 ± 0.019
0.861PheCys: 0.861 ± 0.011
2.137PheAsp: 2.137 ± 0.018
2.401PheGlu: 2.401 ± 0.016
1.704PhePhe: 1.704 ± 0.017
2.275PheGly: 2.275 ± 0.023
0.911PheHis: 0.911 ± 0.01
2.461PheIle: 2.461 ± 0.02
3.134PheLys: 3.134 ± 0.028
3.699PheLeu: 3.699 ± 0.025
0.869PheMet: 0.869 ± 0.009
2.165PheAsn: 2.165 ± 0.015
1.634PhePro: 1.634 ± 0.013
1.516PheGln: 1.516 ± 0.013
1.761PheArg: 1.761 ± 0.014
3.589PheSer: 3.589 ± 0.026
2.427PheThr: 2.427 ± 0.024
2.415PheVal: 2.415 ± 0.017
0.438PheTrp: 0.438 ± 0.007
1.39PheTyr: 1.39 ± 0.013
0.001PheXaa: 0.001 ± 0.0
Gly
2.715GlyAla: 2.715 ± 0.027
0.947GlyCys: 0.947 ± 0.015
2.733GlyAsp: 2.733 ± 0.022
3.727GlyGlu: 3.727 ± 0.04
2.143GlyPhe: 2.143 ± 0.022
3.598GlyGly: 3.598 ± 0.054
1.364GlyHis: 1.364 ± 0.02
3.124GlyIle: 3.124 ± 0.021
3.551GlyLys: 3.551 ± 0.025
4.081GlyLeu: 4.081 ± 0.028
1.063GlyMet: 1.063 ± 0.012
2.879GlyAsn: 2.879 ± 0.051
2.126GlyPro: 2.126 ± 0.028
1.958GlyGln: 1.958 ± 0.02
2.434GlyArg: 2.434 ± 0.026
4.163GlySer: 4.163 ± 0.045
2.973GlyThr: 2.973 ± 0.027
3.021GlyVal: 3.021 ± 0.02
0.604GlyTrp: 0.604 ± 0.008
1.96GlyTyr: 1.96 ± 0.02
0.001GlyXaa: 0.001 ± 0.0
His
1.065HisAla: 1.065 ± 0.01
0.522HisCys: 0.522 ± 0.007
1.022HisAsp: 1.022 ± 0.012
1.312HisGlu: 1.312 ± 0.012
1.038HisPhe: 1.038 ± 0.011
1.216HisGly: 1.216 ± 0.016
0.805HisHis: 0.805 ± 0.014
1.572HisIle: 1.572 ± 0.012
1.738HisLys: 1.738 ± 0.018
2.828HisLeu: 2.828 ± 0.027
0.939HisMet: 0.939 ± 0.02
1.234HisAsn: 1.234 ± 0.012
1.173HisPro: 1.173 ± 0.012
1.002HisGln: 1.002 ± 0.01
1.143HisArg: 1.143 ± 0.01
1.965HisSer: 1.965 ± 0.019
2.161HisThr: 2.161 ± 0.035
1.298HisVal: 1.298 ± 0.011
0.265HisTrp: 0.265 ± 0.004
0.832HisTyr: 0.832 ± 0.009
0.0HisXaa: 0.0 ± 0.0
Ile
3.15IleAla: 3.15 ± 0.019
2.272IleCys: 2.272 ± 0.034
3.382IleAsp: 3.382 ± 0.028
4.149IleGlu: 4.149 ± 0.029
2.585IlePhe: 2.585 ± 0.021
2.797IleGly: 2.797 ± 0.021
1.653IleHis: 1.653 ± 0.017
3.806IleIle: 3.806 ± 0.024
4.651IleLys: 4.651 ± 0.026
5.527IleLeu: 5.527 ± 0.033
1.281IleMet: 1.281 ± 0.013
3.405IleAsn: 3.405 ± 0.025
3.113IlePro: 3.113 ± 0.021
2.576IleGln: 2.576 ± 0.017
2.671IleArg: 2.671 ± 0.02
4.867IleSer: 4.867 ± 0.029
3.63IleThr: 3.63 ± 0.023
3.627IleVal: 3.627 ± 0.022
0.565IleTrp: 0.565 ± 0.009
1.988IleTyr: 1.988 ± 0.017
0.001IleXaa: 0.001 ± 0.0
Lys
3.617LysAla: 3.617 ± 0.023
2.205LysCys: 2.205 ± 0.032
4.091LysAsp: 4.091 ± 0.028
5.864LysGlu: 5.864 ± 0.039
2.588LysPhe: 2.588 ± 0.019
3.124LysGly: 3.124 ± 0.035
1.873LysHis: 1.873 ± 0.017
4.841LysIle: 4.841 ± 0.025
6.825LysLys: 6.825 ± 0.072
6.33LysLeu: 6.33 ± 0.037
1.646LysMet: 1.646 ± 0.014
4.198LysAsn: 4.198 ± 0.025
3.97LysPro: 3.97 ± 0.039
3.696LysGln: 3.696 ± 0.031
3.88LysArg: 3.88 ± 0.025
5.515LysSer: 5.515 ± 0.035
4.385LysThr: 4.385 ± 0.029
4.542LysVal: 4.542 ± 0.052
0.702LysTrp: 0.702 ± 0.01
2.574LysTyr: 2.574 ± 0.02
0.001LysXaa: 0.001 ± 0.0
Leu
4.798LeuAla: 4.798 ± 0.023
1.655LeuCys: 1.655 ± 0.014
4.827LeuAsp: 4.827 ± 0.027
6.261LeuGlu: 6.261 ± 0.051
3.292LeuPhe: 3.292 ± 0.027
4.229LeuGly: 4.229 ± 0.027
2.215LeuHis: 2.215 ± 0.019
4.774LeuIle: 4.774 ± 0.028
7.623LeuLys: 7.623 ± 0.042
8.002LeuLeu: 8.002 ± 0.045
1.829LeuMet: 1.829 ± 0.012
4.818LeuAsn: 4.818 ± 0.023
4.325LeuPro: 4.325 ± 0.027
4.223LeuGln: 4.223 ± 0.028
4.662LeuArg: 4.662 ± 0.03
6.674LeuSer: 6.674 ± 0.032
4.946LeuThr: 4.946 ± 0.022
4.902LeuVal: 4.902 ± 0.029
0.777LeuTrp: 0.777 ± 0.009
2.641LeuTyr: 2.641 ± 0.018
0.002LeuXaa: 0.002 ± 0.001
Met
1.445MetAla: 1.445 ± 0.019
0.424MetCys: 0.424 ± 0.006
1.263MetAsp: 1.263 ± 0.011
1.677MetGlu: 1.677 ± 0.02
0.886MetPhe: 0.886 ± 0.01
1.077MetGly: 1.077 ± 0.012
0.491MetHis: 0.491 ± 0.008
1.101MetIle: 1.101 ± 0.011
1.623MetLys: 1.623 ± 0.013
1.788MetLeu: 1.788 ± 0.015
0.576MetMet: 0.576 ± 0.01
1.049MetAsn: 1.049 ± 0.011
0.925MetPro: 0.925 ± 0.012
0.872MetGln: 0.872 ± 0.011
1.091MetArg: 1.091 ± 0.014
1.665MetSer: 1.665 ± 0.015
1.189MetThr: 1.189 ± 0.01
1.302MetVal: 1.302 ± 0.015
0.223MetTrp: 0.223 ± 0.004
0.731MetTyr: 0.731 ± 0.008
0.0MetXaa: 0.0 ± 0.0
Asn
2.599AsnAla: 2.599 ± 0.021
1.135AsnCys: 1.135 ± 0.018
2.815AsnAsp: 2.815 ± 0.021
3.627AsnGlu: 3.627 ± 0.023
2.3AsnPhe: 2.3 ± 0.016
3.069AsnGly: 3.069 ± 0.055
1.248AsnHis: 1.248 ± 0.013
4.033AsnIle: 4.033 ± 0.026
4.065AsnLys: 4.065 ± 0.025
5.188AsnLeu: 5.188 ± 0.024
1.24AsnMet: 1.24 ± 0.015
3.683AsnAsn: 3.683 ± 0.034
2.387AsnPro: 2.387 ± 0.024
2.352AsnGln: 2.352 ± 0.061
2.427AsnArg: 2.427 ± 0.018
4.396AsnSer: 4.396 ± 0.026
3.167AsnThr: 3.167 ± 0.027
3.417AsnVal: 3.417 ± 0.022
0.532AsnTrp: 0.532 ± 0.008
1.873AsnTyr: 1.873 ± 0.016
0.001AsnXaa: 0.001 ± 0.0
Pro
2.563ProAla: 2.563 ± 0.022
0.771ProCys: 0.771 ± 0.029
2.568ProAsp: 2.568 ± 0.022
3.358ProGlu: 3.358 ± 0.031
1.856ProPhe: 1.856 ± 0.019
2.42ProGly: 2.42 ± 0.036
1.162ProHis: 1.162 ± 0.013
2.677ProIle: 2.677 ± 0.021
3.373ProLys: 3.373 ± 0.028
3.743ProLeu: 3.743 ± 0.021
0.895ProMet: 0.895 ± 0.013
2.514ProAsn: 2.514 ± 0.024
3.772ProPro: 3.772 ± 0.045
2.102ProGln: 2.102 ± 0.02
2.041ProArg: 2.041 ± 0.018
4.169ProSer: 4.169 ± 0.034
3.186ProThr: 3.186 ± 0.042
3.199ProVal: 3.199 ± 0.025
0.419ProTrp: 0.419 ± 0.006
1.964ProTyr: 1.964 ± 0.024
0.001ProXaa: 0.001 ± 0.0
Gln
2.313GlnAla: 2.313 ± 0.023
0.969GlnCys: 0.969 ± 0.022
1.968GlnAsp: 1.968 ± 0.023
2.978GlnGlu: 2.978 ± 0.024
2.151GlnPhe: 2.151 ± 0.026
1.873GlnGly: 1.873 ± 0.026
1.1GlnHis: 1.1 ± 0.011
2.499GlnIle: 2.499 ± 0.014
3.121GlnLys: 3.121 ± 0.026
3.763GlnLeu: 3.763 ± 0.022
0.958GlnMet: 0.958 ± 0.011
2.641GlnAsn: 2.641 ± 0.045
2.012GlnPro: 2.012 ± 0.025
2.753GlnGln: 2.753 ± 0.056
2.154GlnArg: 2.154 ± 0.019
2.849GlnSer: 2.849 ± 0.021
2.316GlnThr: 2.316 ± 0.018
2.288GlnVal: 2.288 ± 0.014
0.42GlnTrp: 0.42 ± 0.007
1.388GlnTyr: 1.388 ± 0.012
0.001GlnXaa: 0.001 ± 0.0
Arg
2.32ArgAla: 2.32 ± 0.017
0.952ArgCys: 0.952 ± 0.017
2.475ArgAsp: 2.475 ± 0.023
3.085ArgGlu: 3.085 ± 0.023
1.838ArgPhe: 1.838 ± 0.017
2.269ArgGly: 2.269 ± 0.023
1.418ArgHis: 1.418 ± 0.014
2.765ArgIle: 2.765 ± 0.021
3.909ArgLys: 3.909 ± 0.028
3.995ArgLeu: 3.995 ± 0.025
0.999ArgMet: 0.999 ± 0.013
2.683ArgAsn: 2.683 ± 0.02
2.281ArgPro: 2.281 ± 0.029
2.124ArgGln: 2.124 ± 0.021
3.134ArgArg: 3.134 ± 0.021
3.663ArgSer: 3.663 ± 0.029
2.554ArgThr: 2.554 ± 0.02
2.707ArgVal: 2.707 ± 0.023
0.472ArgTrp: 0.472 ± 0.007
1.562ArgTyr: 1.562 ± 0.013
0.001ArgXaa: 0.001 ± 0.0
Ser
4.0SerAla: 4.0 ± 0.025
1.486SerCys: 1.486 ± 0.026
4.522SerAsp: 4.522 ± 0.029
5.029SerGlu: 5.029 ± 0.034
2.942SerPhe: 2.942 ± 0.019
4.284SerGly: 4.284 ± 0.039
1.874SerHis: 1.874 ± 0.02
4.402SerIle: 4.402 ± 0.026
5.426SerLys: 5.426 ± 0.034
6.835SerLeu: 6.835 ± 0.035
1.468SerMet: 1.468 ± 0.013
4.393SerAsn: 4.393 ± 0.031
4.196SerPro: 4.196 ± 0.04
3.31SerGln: 3.31 ± 0.024
3.698SerArg: 3.698 ± 0.029
8.251SerSer: 8.251 ± 0.055
5.29SerThr: 5.29 ± 0.048
4.55SerVal: 4.55 ± 0.022
0.74SerTrp: 0.74 ± 0.008
2.356SerTyr: 2.356 ± 0.018
0.002SerXaa: 0.002 ± 0.0
Thr
3.306ThrAla: 3.306 ± 0.023
1.299ThrCys: 1.299 ± 0.019
3.217ThrAsp: 3.217 ± 0.034
4.267ThrGlu: 4.267 ± 0.059
2.272ThrPhe: 2.272 ± 0.016
3.887ThrGly: 3.887 ± 0.032
1.548ThrHis: 1.548 ± 0.018
3.547ThrIle: 3.547 ± 0.022
4.108ThrLys: 4.108 ± 0.027
5.028ThrLeu: 5.028 ± 0.025
1.131ThrMet: 1.131 ± 0.011
3.156ThrAsn: 3.156 ± 0.022
3.319ThrPro: 3.319 ± 0.029
2.209ThrGln: 2.209 ± 0.017
2.454ThrArg: 2.454 ± 0.022
5.085ThrSer: 5.085 ± 0.04
4.743ThrThr: 4.743 ± 0.104
3.851ThrVal: 3.851 ± 0.033
0.555ThrTrp: 0.555 ± 0.01
1.746ThrTyr: 1.746 ± 0.014
0.001ThrXaa: 0.001 ± 0.0
Val
3.575ValAla: 3.575 ± 0.037
1.368ValCys: 1.368 ± 0.015
3.375ValAsp: 3.375 ± 0.02
4.18ValGlu: 4.18 ± 0.035
2.347ValPhe: 2.347 ± 0.018
2.805ValGly: 2.805 ± 0.018
1.947ValHis: 1.947 ± 0.024
3.595ValIle: 3.595 ± 0.024
4.326ValLys: 4.326 ± 0.027
5.225ValLeu: 5.225 ± 0.03
1.226ValMet: 1.226 ± 0.014
3.107ValAsn: 3.107 ± 0.02
3.104ValPro: 3.104 ± 0.025
2.381ValGln: 2.381 ± 0.017
2.523ValArg: 2.523 ± 0.02
4.465ValSer: 4.465 ± 0.024
3.837ValThr: 3.837 ± 0.033
4.158ValVal: 4.158 ± 0.047
0.572ValTrp: 0.572 ± 0.008
1.864ValTyr: 1.864 ± 0.015
0.001ValXaa: 0.001 ± 0.0
Trp
0.472TrpAla: 0.472 ± 0.007
0.183TrpCys: 0.183 ± 0.004
0.521TrpAsp: 0.521 ± 0.008
0.531TrpGlu: 0.531 ± 0.007
0.426TrpPhe: 0.426 ± 0.007
0.474TrpGly: 0.474 ± 0.009
0.233TrpHis: 0.233 ± 0.006
0.653TrpIle: 0.653 ± 0.011
0.749TrpLys: 0.749 ± 0.009
0.95TrpLeu: 0.95 ± 0.012
0.251TrpMet: 0.251 ± 0.006
0.568TrpAsn: 0.568 ± 0.009
0.378TrpPro: 0.378 ± 0.007
0.362TrpGln: 0.362 ± 0.006
0.541TrpArg: 0.541 ± 0.009
0.715TrpSer: 0.715 ± 0.009
0.566TrpThr: 0.566 ± 0.009
0.508TrpVal: 0.508 ± 0.008
0.145TrpTrp: 0.145 ± 0.004
0.352TrpTyr: 0.352 ± 0.006
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.519TyrAla: 1.519 ± 0.016
0.741TyrCys: 0.741 ± 0.011
1.794TyrAsp: 1.794 ± 0.016
1.915TyrGlu: 1.915 ± 0.014
1.541TyrPhe: 1.541 ± 0.016
1.796TyrGly: 1.796 ± 0.021
0.844TyrHis: 0.844 ± 0.011
2.093TyrIle: 2.093 ± 0.015
2.626TyrLys: 2.626 ± 0.026
3.066TyrLeu: 3.066 ± 0.021
0.728TyrMet: 0.728 ± 0.009
1.87TyrAsn: 1.87 ± 0.015
1.408TyrPro: 1.408 ± 0.016
1.38TyrGln: 1.38 ± 0.014
1.557TyrArg: 1.557 ± 0.013
2.428TyrSer: 2.428 ± 0.017
1.887TyrThr: 1.887 ± 0.015
1.846TyrVal: 1.846 ± 0.015
0.366TyrTrp: 0.366 ± 0.006
1.359TyrTyr: 1.359 ± 0.014
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.001XaaGly: 0.001 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.002XaaIle: 0.002 ± 0.0
0.001XaaLys: 0.001 ± 0.0
0.002XaaLeu: 0.002 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.002XaaAsn: 0.002 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.001XaaGln: 0.001 ± 0.0
0.001XaaArg: 0.001 ± 0.0
0.001XaaSer: 0.001 ± 0.0
0.001XaaThr: 0.001 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.001XaaTyr: 0.001 ± 0.0
0.02XaaXaa: 0.02 ± 0.008
Statistics based on 25502 proteins (12442332 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski