Amino acid dipepetide frequency for Trachymyrmex cornetzi

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.783AlaAla: 4.783 ± 0.046
1.218AlaCys: 1.218 ± 0.022
2.772AlaAsp: 2.772 ± 0.022
3.638AlaGlu: 3.638 ± 0.027
2.112AlaPhe: 2.112 ± 0.019
3.187AlaGly: 3.187 ± 0.024
1.377AlaHis: 1.377 ± 0.013
3.677AlaIle: 3.677 ± 0.026
3.59AlaLys: 3.59 ± 0.031
5.448AlaLeu: 5.448 ± 0.039
1.443AlaMet: 1.443 ± 0.016
2.752AlaAsn: 2.752 ± 0.02
2.506AlaPro: 2.506 ± 0.025
2.163AlaGln: 2.163 ± 0.021
3.596AlaArg: 3.596 ± 0.025
4.724AlaSer: 4.724 ± 0.029
3.878AlaThr: 3.878 ± 0.032
3.863AlaVal: 3.863 ± 0.025
0.624AlaTrp: 0.624 ± 0.01
1.642AlaTyr: 1.642 ± 0.017
0.004AlaXaa: 0.004 ± 0.001
Cys
1.169CysAla: 1.169 ± 0.016
0.497CysCys: 0.497 ± 0.01
1.158CysAsp: 1.158 ± 0.018
1.209CysGlu: 1.209 ± 0.02
0.807CysPhe: 0.807 ± 0.011
1.312CysGly: 1.312 ± 0.027
0.56CysHis: 0.56 ± 0.01
1.383CysIle: 1.383 ± 0.029
1.248CysLys: 1.248 ± 0.018
1.874CysLeu: 1.874 ± 0.026
0.451CysMet: 0.451 ± 0.01
1.095CysAsn: 1.095 ± 0.018
1.013CysPro: 1.013 ± 0.032
0.768CysGln: 0.768 ± 0.016
1.233CysArg: 1.233 ± 0.031
1.662CysSer: 1.662 ± 0.029
1.213CysThr: 1.213 ± 0.022
1.281CysVal: 1.281 ± 0.023
0.24CysTrp: 0.24 ± 0.006
0.645CysTyr: 0.645 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
3.038AspAla: 3.038 ± 0.025
1.068AspCys: 1.068 ± 0.021
3.562AspAsp: 3.562 ± 0.034
3.97AspGlu: 3.97 ± 0.031
2.033AspPhe: 2.033 ± 0.02
2.925AspGly: 2.925 ± 0.029
1.136AspHis: 1.136 ± 0.013
3.721AspIle: 3.721 ± 0.026
3.378AspLys: 3.378 ± 0.032
4.527AspLeu: 4.527 ± 0.031
1.186AspMet: 1.186 ± 0.013
2.847AspAsn: 2.847 ± 0.025
2.165AspPro: 2.165 ± 0.027
1.535AspGln: 1.535 ± 0.016
2.805AspArg: 2.805 ± 0.028
4.197AspSer: 4.197 ± 0.036
2.985AspThr: 2.985 ± 0.023
3.562AspVal: 3.562 ± 0.023
0.591AspTrp: 0.591 ± 0.009
1.779AspTyr: 1.779 ± 0.017
0.002AspXaa: 0.002 ± 0.0
Glu
3.649GluAla: 3.649 ± 0.028
1.271GluCys: 1.271 ± 0.03
3.877GluAsp: 3.877 ± 0.034
5.98GluGlu: 5.98 ± 0.079
2.165GluPhe: 2.165 ± 0.02
2.923GluGly: 2.923 ± 0.026
1.503GluHis: 1.503 ± 0.015
4.386GluIle: 4.386 ± 0.042
5.183GluLys: 5.183 ± 0.047
5.464GluLeu: 5.464 ± 0.042
1.572GluMet: 1.572 ± 0.017
3.945GluAsn: 3.945 ± 0.031
2.212GluPro: 2.212 ± 0.025
2.592GluGln: 2.592 ± 0.024
4.357GluArg: 4.357 ± 0.036
4.464GluSer: 4.464 ± 0.037
3.845GluThr: 3.845 ± 0.036
3.437GluVal: 3.437 ± 0.03
0.677GluTrp: 0.677 ± 0.011
1.974GluTyr: 1.974 ± 0.017
0.002GluXaa: 0.002 ± 0.001
Phe
2.143PheAla: 2.143 ± 0.021
0.894PheCys: 0.894 ± 0.012
2.081PheAsp: 2.081 ± 0.02
2.172PheGlu: 2.172 ± 0.02
1.602PhePhe: 1.602 ± 0.018
2.056PheGly: 2.056 ± 0.022
1.045PheHis: 1.045 ± 0.013
2.379PheIle: 2.379 ± 0.025
2.086PheLys: 2.086 ± 0.018
3.757PheLeu: 3.757 ± 0.034
0.813PheMet: 0.813 ± 0.011
1.875PheAsn: 1.875 ± 0.023
1.679PhePro: 1.679 ± 0.014
1.403PheGln: 1.403 ± 0.015
1.995PheArg: 1.995 ± 0.019
3.02PheSer: 3.02 ± 0.026
2.145PheThr: 2.145 ± 0.021
2.408PheVal: 2.408 ± 0.02
0.426PheTrp: 0.426 ± 0.009
1.372PheTyr: 1.372 ± 0.017
0.002PheXaa: 0.002 ± 0.001
Gly
2.937GlyAla: 2.937 ± 0.023
1.012GlyCys: 1.012 ± 0.017
2.626GlyAsp: 2.626 ± 0.025
3.027GlyGlu: 3.027 ± 0.03
1.969GlyPhe: 1.969 ± 0.022
3.847GlyGly: 3.847 ± 0.049
1.395GlyHis: 1.395 ± 0.017
3.227GlyIle: 3.227 ± 0.023
3.286GlyLys: 3.286 ± 0.023
4.205GlyLeu: 4.205 ± 0.032
1.185GlyMet: 1.185 ± 0.015
2.601GlyAsn: 2.601 ± 0.027
2.183GlyPro: 2.183 ± 0.036
1.88GlyGln: 1.88 ± 0.02
3.243GlyArg: 3.243 ± 0.028
4.208GlySer: 4.208 ± 0.033
3.075GlyThr: 3.075 ± 0.028
3.029GlyVal: 3.029 ± 0.031
0.656GlyTrp: 0.656 ± 0.012
1.891GlyTyr: 1.891 ± 0.023
0.006GlyXaa: 0.006 ± 0.001
His
1.385HisAla: 1.385 ± 0.017
0.604HisCys: 0.604 ± 0.011
1.163HisAsp: 1.163 ± 0.012
1.429HisGlu: 1.429 ± 0.014
1.039HisPhe: 1.039 ± 0.012
1.351HisGly: 1.351 ± 0.014
1.018HisHis: 1.018 ± 0.02
1.567HisIle: 1.567 ± 0.017
1.388HisLys: 1.388 ± 0.017
2.354HisLeu: 2.354 ± 0.019
0.575HisMet: 0.575 ± 0.008
1.205HisAsn: 1.205 ± 0.013
1.34HisPro: 1.34 ± 0.014
1.057HisGln: 1.057 ± 0.016
1.602HisArg: 1.602 ± 0.016
1.947HisSer: 1.947 ± 0.02
1.498HisThr: 1.498 ± 0.021
1.641HisVal: 1.641 ± 0.015
0.291HisTrp: 0.291 ± 0.007
0.903HisTyr: 0.903 ± 0.013
0.001HisXaa: 0.001 ± 0.0
Ile
3.854IleAla: 3.854 ± 0.027
1.363IleCys: 1.363 ± 0.023
3.425IleAsp: 3.425 ± 0.024
4.01IleGlu: 4.01 ± 0.035
2.611IlePhe: 2.611 ± 0.026
3.036IleGly: 3.036 ± 0.025
1.572IleHis: 1.572 ± 0.022
4.024IleIle: 4.024 ± 0.035
3.966IleLys: 3.966 ± 0.033
5.854IleLeu: 5.854 ± 0.04
1.404IleMet: 1.404 ± 0.014
3.449IleAsn: 3.449 ± 0.031
2.993IlePro: 2.993 ± 0.025
2.31IleGln: 2.31 ± 0.021
3.338IleArg: 3.338 ± 0.025
4.897IleSer: 4.897 ± 0.032
3.641IleThr: 3.641 ± 0.032
3.872IleVal: 3.872 ± 0.027
0.616IleTrp: 0.616 ± 0.012
2.089IleTyr: 2.089 ± 0.021
0.002IleXaa: 0.002 ± 0.0
Lys
3.187LysAla: 3.187 ± 0.027
1.305LysCys: 1.305 ± 0.022
3.557LysAsp: 3.557 ± 0.036
4.924LysGlu: 4.924 ± 0.056
2.274LysPhe: 2.274 ± 0.019
2.628LysGly: 2.628 ± 0.025
1.591LysHis: 1.591 ± 0.015
4.323LysIle: 4.323 ± 0.031
5.476LysLys: 5.476 ± 0.063
5.84LysLeu: 5.84 ± 0.042
1.541LysMet: 1.541 ± 0.017
3.618LysAsn: 3.618 ± 0.032
2.639LysPro: 2.639 ± 0.038
2.6LysGln: 2.6 ± 0.026
4.249LysArg: 4.249 ± 0.032
4.713LysSer: 4.713 ± 0.035
3.664LysThr: 3.664 ± 0.03
3.355LysVal: 3.355 ± 0.025
0.765LysTrp: 0.765 ± 0.012
2.26LysTyr: 2.26 ± 0.019
0.003LysXaa: 0.003 ± 0.001
Leu
5.597LeuAla: 5.597 ± 0.038
1.846LeuCys: 1.846 ± 0.022
4.579LeuAsp: 4.579 ± 0.032
5.831LeuGlu: 5.831 ± 0.038
3.229LeuPhe: 3.229 ± 0.028
4.128LeuGly: 4.128 ± 0.03
2.531LeuHis: 2.531 ± 0.02
5.047LeuIle: 5.047 ± 0.035
5.967LeuLys: 5.967 ± 0.043
8.753LeuLeu: 8.753 ± 0.051
2.012LeuMet: 2.012 ± 0.018
4.461LeuAsn: 4.461 ± 0.029
4.721LeuPro: 4.721 ± 0.03
4.363LeuGln: 4.363 ± 0.031
5.605LeuArg: 5.605 ± 0.033
7.308LeuSer: 7.308 ± 0.04
5.12LeuThr: 5.12 ± 0.032
4.769LeuVal: 4.769 ± 0.032
0.911LeuTrp: 0.911 ± 0.013
2.927LeuTyr: 2.927 ± 0.024
0.006LeuXaa: 0.006 ± 0.001
Met
1.426MetAla: 1.426 ± 0.015
0.457MetCys: 0.457 ± 0.01
1.254MetAsp: 1.254 ± 0.015
1.658MetGlu: 1.658 ± 0.015
0.856MetPhe: 0.856 ± 0.013
1.056MetGly: 1.056 ± 0.016
0.566MetHis: 0.566 ± 0.009
1.346MetIle: 1.346 ± 0.014
1.567MetLys: 1.567 ± 0.016
2.08MetLeu: 2.08 ± 0.018
0.609MetMet: 0.609 ± 0.01
1.12MetAsn: 1.12 ± 0.013
1.076MetPro: 1.076 ± 0.013
1.042MetGln: 1.042 ± 0.015
1.332MetArg: 1.332 ± 0.015
1.771MetSer: 1.771 ± 0.017
1.33MetThr: 1.33 ± 0.015
1.189MetVal: 1.189 ± 0.014
0.237MetTrp: 0.237 ± 0.006
0.787MetTyr: 0.787 ± 0.012
0.001MetXaa: 0.001 ± 0.0
Asn
2.991AsnAla: 2.991 ± 0.024
1.105AsnCys: 1.105 ± 0.017
2.903AsnAsp: 2.903 ± 0.026
3.325AsnGlu: 3.325 ± 0.028
2.085AsnPhe: 2.085 ± 0.022
2.775AsnGly: 2.775 ± 0.026
1.194AsnHis: 1.194 ± 0.016
3.754AsnIle: 3.754 ± 0.027
3.349AsnLys: 3.349 ± 0.027
4.604AsnLeu: 4.604 ± 0.034
1.21AsnMet: 1.21 ± 0.013
3.469AsnAsn: 3.469 ± 0.034
2.155AsnPro: 2.155 ± 0.027
1.813AsnGln: 1.813 ± 0.02
2.613AsnArg: 2.613 ± 0.023
4.051AsnSer: 4.051 ± 0.03
3.004AsnThr: 3.004 ± 0.022
3.679AsnVal: 3.679 ± 0.027
0.514AsnTrp: 0.514 ± 0.009
1.752AsnTyr: 1.752 ± 0.018
0.002AsnXaa: 0.002 ± 0.0
Pro
2.77ProAla: 2.77 ± 0.026
0.844ProCys: 0.844 ± 0.038
2.258ProAsp: 2.258 ± 0.021
2.835ProGlu: 2.835 ± 0.032
1.656ProPhe: 1.656 ± 0.014
2.636ProGly: 2.636 ± 0.056
1.163ProHis: 1.163 ± 0.014
2.701ProIle: 2.701 ± 0.026
2.532ProLys: 2.532 ± 0.027
4.055ProLeu: 4.055 ± 0.028
0.975ProMet: 0.975 ± 0.014
2.141ProAsn: 2.141 ± 0.023
3.777ProPro: 3.777 ± 0.058
1.847ProGln: 1.847 ± 0.024
2.756ProArg: 2.756 ± 0.028
4.226ProSer: 4.226 ± 0.039
3.035ProThr: 3.035 ± 0.027
2.912ProVal: 2.912 ± 0.025
0.498ProTrp: 0.498 ± 0.009
1.488ProTyr: 1.488 ± 0.018
0.004ProXaa: 0.004 ± 0.001
Gln
2.193GlnAla: 2.193 ± 0.023
0.815GlnCys: 0.815 ± 0.018
1.883GlnAsp: 1.883 ± 0.017
2.722GlnGlu: 2.722 ± 0.027
1.365GlnPhe: 1.365 ± 0.015
1.668GlnGly: 1.668 ± 0.018
1.181GlnHis: 1.181 ± 0.017
2.316GlnIle: 2.316 ± 0.02
2.518GlnLys: 2.518 ± 0.023
3.694GlnLeu: 3.694 ± 0.029
0.948GlnMet: 0.948 ± 0.014
2.177GlnAsn: 2.177 ± 0.018
1.835GlnPro: 1.835 ± 0.026
3.111GlnGln: 3.111 ± 0.075
2.574GlnArg: 2.574 ± 0.019
2.993GlnSer: 2.993 ± 0.026
2.257GlnThr: 2.257 ± 0.024
2.061GlnVal: 2.061 ± 0.019
0.462GlnTrp: 0.462 ± 0.008
1.269GlnTyr: 1.269 ± 0.014
0.001GlnXaa: 0.001 ± 0.0
Arg
3.394ArgAla: 3.394 ± 0.027
1.347ArgCys: 1.347 ± 0.022
3.116ArgAsp: 3.116 ± 0.028
4.106ArgGlu: 4.106 ± 0.032
2.183ArgPhe: 2.183 ± 0.02
3.205ArgGly: 3.205 ± 0.033
1.636ArgHis: 1.636 ± 0.017
3.527ArgIle: 3.527 ± 0.024
4.257ArgLys: 4.257 ± 0.032
5.317ArgLeu: 5.317 ± 0.034
1.333ArgMet: 1.333 ± 0.014
3.148ArgAsn: 3.148 ± 0.025
2.513ArgPro: 2.513 ± 0.031
2.41ArgGln: 2.41 ± 0.02
4.839ArgArg: 4.839 ± 0.04
4.632ArgSer: 4.632 ± 0.036
3.203ArgThr: 3.203 ± 0.022
3.22ArgVal: 3.22 ± 0.022
0.73ArgTrp: 0.73 ± 0.011
2.002ArgTyr: 2.002 ± 0.02
0.005ArgXaa: 0.005 ± 0.001
Ser
4.528SerAla: 4.528 ± 0.029
1.58SerCys: 1.58 ± 0.025
4.22SerAsp: 4.22 ± 0.031
4.647SerGlu: 4.647 ± 0.031
2.922SerPhe: 2.922 ± 0.025
4.357SerGly: 4.357 ± 0.035
1.899SerHis: 1.899 ± 0.018
4.59SerIle: 4.59 ± 0.03
4.672SerLys: 4.672 ± 0.038
7.075SerLeu: 7.075 ± 0.038
1.746SerMet: 1.746 ± 0.015
4.236SerAsn: 4.236 ± 0.029
4.283SerPro: 4.283 ± 0.044
3.03SerGln: 3.03 ± 0.028
4.824SerArg: 4.824 ± 0.036
8.346SerSer: 8.346 ± 0.071
5.34SerThr: 5.34 ± 0.042
4.695SerVal: 4.695 ± 0.026
0.839SerTrp: 0.839 ± 0.012
2.383SerTyr: 2.383 ± 0.02
0.004SerXaa: 0.004 ± 0.001
Thr
3.715ThrAla: 3.715 ± 0.029
1.247ThrCys: 1.247 ± 0.022
3.019ThrAsp: 3.019 ± 0.024
3.643ThrGlu: 3.643 ± 0.041
2.299ThrPhe: 2.299 ± 0.022
3.175ThrGly: 3.175 ± 0.027
1.316ThrHis: 1.316 ± 0.013
3.751ThrIle: 3.751 ± 0.027
3.567ThrLys: 3.567 ± 0.034
5.429ThrLeu: 5.429 ± 0.028
1.381ThrMet: 1.381 ± 0.013
2.959ThrAsn: 2.959 ± 0.026
3.104ThrPro: 3.104 ± 0.034
2.055ThrGln: 2.055 ± 0.019
3.215ThrArg: 3.215 ± 0.023
5.301ThrSer: 5.301 ± 0.041
4.325ThrThr: 4.325 ± 0.064
3.813ThrVal: 3.813 ± 0.028
0.674ThrTrp: 0.674 ± 0.011
1.929ThrTyr: 1.929 ± 0.024
0.003ThrXaa: 0.003 ± 0.001
Val
3.886ValAla: 3.886 ± 0.026
1.369ValCys: 1.369 ± 0.027
3.194ValAsp: 3.194 ± 0.019
3.775ValGlu: 3.775 ± 0.031
2.256ValPhe: 2.256 ± 0.02
2.952ValGly: 2.952 ± 0.024
1.511ValHis: 1.511 ± 0.017
3.654ValIle: 3.654 ± 0.025
3.676ValLys: 3.676 ± 0.031
5.259ValLeu: 5.259 ± 0.031
1.315ValMet: 1.315 ± 0.012
2.895ValAsn: 2.895 ± 0.022
3.017ValPro: 3.017 ± 0.027
2.382ValGln: 2.382 ± 0.018
3.29ValArg: 3.29 ± 0.025
4.602ValSer: 4.602 ± 0.028
3.865ValThr: 3.865 ± 0.04
3.704ValVal: 3.704 ± 0.027
0.647ValTrp: 0.647 ± 0.01
1.979ValTyr: 1.979 ± 0.019
0.005ValXaa: 0.005 ± 0.001
Trp
0.558TrpAla: 0.558 ± 0.011
0.229TrpCys: 0.229 ± 0.007
0.59TrpAsp: 0.59 ± 0.01
0.65TrpGlu: 0.65 ± 0.01
0.43TrpPhe: 0.43 ± 0.009
0.506TrpGly: 0.506 ± 0.01
0.286TrpHis: 0.286 ± 0.007
0.728TrpIle: 0.728 ± 0.012
0.785TrpLys: 0.785 ± 0.014
1.114TrpLeu: 1.114 ± 0.015
0.287TrpMet: 0.287 ± 0.008
0.612TrpAsn: 0.612 ± 0.008
0.414TrpPro: 0.414 ± 0.007
0.484TrpGln: 0.484 ± 0.009
0.745TrpArg: 0.745 ± 0.011
0.807TrpSer: 0.807 ± 0.012
0.597TrpThr: 0.597 ± 0.01
0.543TrpVal: 0.543 ± 0.01
0.188TrpTrp: 0.188 ± 0.005
0.384TrpTyr: 0.384 ± 0.009
0.001TrpXaa: 0.001 ± 0.0
Tyr
1.792TyrAla: 1.792 ± 0.018
0.748TyrCys: 0.748 ± 0.012
1.774TyrAsp: 1.774 ± 0.018
1.924TyrGlu: 1.924 ± 0.017
1.461TyrPhe: 1.461 ± 0.017
1.764TyrGly: 1.764 ± 0.019
0.876TyrHis: 0.876 ± 0.011
2.22TyrIle: 2.22 ± 0.024
2.019TyrLys: 2.019 ± 0.023
3.005TyrLeu: 3.005 ± 0.025
0.772TyrMet: 0.772 ± 0.011
1.739TyrAsn: 1.739 ± 0.018
1.48TyrPro: 1.48 ± 0.022
1.197TyrGln: 1.197 ± 0.015
1.905TyrArg: 1.905 ± 0.019
2.339TyrSer: 2.339 ± 0.021
1.894TyrThr: 1.894 ± 0.019
2.197TyrVal: 2.197 ± 0.025
0.351TyrTrp: 0.351 ± 0.008
1.278TyrTyr: 1.278 ± 0.016
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.003XaaAla: 0.003 ± 0.001
0.001XaaCys: 0.001 ± 0.0
0.002XaaAsp: 0.002 ± 0.001
0.002XaaGlu: 0.002 ± 0.001
0.002XaaPhe: 0.002 ± 0.0
0.003XaaGly: 0.003 ± 0.001
0.003XaaHis: 0.003 ± 0.001
0.003XaaIle: 0.003 ± 0.001
0.002XaaLys: 0.002 ± 0.001
0.005XaaLeu: 0.005 ± 0.001
0.003XaaMet: 0.003 ± 0.001
0.003XaaAsn: 0.003 ± 0.001
0.005XaaPro: 0.005 ± 0.001
0.002XaaGln: 0.002 ± 0.001
0.005XaaArg: 0.005 ± 0.001
0.004XaaSer: 0.004 ± 0.001
0.002XaaThr: 0.002 ± 0.001
0.002XaaVal: 0.002 ± 0.0
0.001XaaTrp: 0.001 ± 0.0
0.002XaaTyr: 0.002 ± 0.0
5.466XaaXaa: 5.466 ± 0.388
Statistics based on 18657 proteins (7760051 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski