Amino acid dipepetide frequency for Trichinella nativa

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.549AlaAla: 5.549 ± 0.046
1.5AlaCys: 1.5 ± 0.016
3.589AlaAsp: 3.589 ± 0.021
4.476AlaGlu: 4.476 ± 0.034
2.788AlaPhe: 2.788 ± 0.021
3.352AlaGly: 3.352 ± 0.025
1.315AlaHis: 1.315 ± 0.012
3.387AlaIle: 3.387 ± 0.024
3.61AlaLys: 3.61 ± 0.025
6.142AlaLeu: 6.142 ± 0.037
1.598AlaMet: 1.598 ± 0.019
2.995AlaAsn: 2.995 ± 0.022
2.351AlaPro: 2.351 ± 0.025
2.301AlaGln: 2.301 ± 0.019
3.018AlaArg: 3.018 ± 0.027
5.238AlaSer: 5.238 ± 0.032
3.57AlaThr: 3.57 ± 0.027
5.085AlaVal: 5.085 ± 0.028
0.697AlaTrp: 0.697 ± 0.011
1.84AlaTyr: 1.84 ± 0.015
0.001AlaXaa: 0.001 ± 0.0
Cys
1.462CysAla: 1.462 ± 0.018
0.909CysCys: 0.909 ± 0.015
1.373CysAsp: 1.373 ± 0.024
1.456CysGlu: 1.456 ± 0.022
1.292CysPhe: 1.292 ± 0.013
1.442CysGly: 1.442 ± 0.019
0.684CysHis: 0.684 ± 0.011
1.543CysIle: 1.543 ± 0.024
1.487CysLys: 1.487 ± 0.02
2.616CysLeu: 2.616 ± 0.024
0.555CysMet: 0.555 ± 0.009
1.216CysAsn: 1.216 ± 0.017
1.243CysPro: 1.243 ± 0.025
1.117CysGln: 1.117 ± 0.015
1.617CysArg: 1.617 ± 0.018
2.462CysSer: 2.462 ± 0.025
1.369CysThr: 1.369 ± 0.018
1.499CysVal: 1.499 ± 0.017
0.337CysTrp: 0.337 ± 0.007
0.766CysTyr: 0.766 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
3.282AspAla: 3.282 ± 0.023
1.4AspCys: 1.4 ± 0.022
4.005AspAsp: 4.005 ± 0.036
4.279AspGlu: 4.279 ± 0.03
2.385AspPhe: 2.385 ± 0.019
3.202AspGly: 3.202 ± 0.038
1.236AspHis: 1.236 ± 0.014
2.811AspIle: 2.811 ± 0.022
2.55AspLys: 2.55 ± 0.019
4.664AspLeu: 4.664 ± 0.026
1.174AspMet: 1.174 ± 0.013
2.345AspAsn: 2.345 ± 0.019
2.077AspPro: 2.077 ± 0.019
2.087AspGln: 2.087 ± 0.017
2.761AspArg: 2.761 ± 0.023
4.185AspSer: 4.185 ± 0.027
2.159AspThr: 2.159 ± 0.017
3.697AspVal: 3.697 ± 0.023
0.708AspTrp: 0.708 ± 0.011
1.653AspTyr: 1.653 ± 0.015
0.0AspXaa: 0.0 ± 0.0
Glu
3.933GluAla: 3.933 ± 0.037
1.337GluCys: 1.337 ± 0.022
3.182GluAsp: 3.182 ± 0.025
5.494GluGlu: 5.494 ± 0.05
2.455GluPhe: 2.455 ± 0.019
2.295GluGly: 2.295 ± 0.023
1.435GluHis: 1.435 ± 0.014
3.946GluIle: 3.946 ± 0.033
5.015GluLys: 5.015 ± 0.038
5.794GluLeu: 5.794 ± 0.036
1.925GluMet: 1.925 ± 0.016
4.093GluAsn: 4.093 ± 0.031
2.134GluPro: 2.134 ± 0.019
3.013GluGln: 3.013 ± 0.024
3.547GluArg: 3.547 ± 0.028
4.447GluSer: 4.447 ± 0.029
3.283GluThr: 3.283 ± 0.027
3.468GluVal: 3.468 ± 0.024
0.702GluTrp: 0.702 ± 0.012
1.748GluTyr: 1.748 ± 0.015
0.0GluXaa: 0.0 ± 0.0
Phe
2.794PheAla: 2.794 ± 0.019
1.399PheCys: 1.399 ± 0.015
2.643PheAsp: 2.643 ± 0.022
2.665PheGlu: 2.665 ± 0.019
2.232PhePhe: 2.232 ± 0.022
2.615PheGly: 2.615 ± 0.021
1.317PheHis: 1.317 ± 0.013
2.569PheIle: 2.569 ± 0.024
2.216PheLys: 2.216 ± 0.018
4.334PheLeu: 4.334 ± 0.032
0.892PheMet: 0.892 ± 0.012
2.079PheAsn: 2.079 ± 0.016
1.98PhePro: 1.98 ± 0.018
1.934PheGln: 1.934 ± 0.018
2.373PheArg: 2.373 ± 0.022
3.94PheSer: 3.94 ± 0.027
2.435PheThr: 2.435 ± 0.019
3.094PheVal: 3.094 ± 0.022
0.607PheTrp: 0.607 ± 0.009
1.698PheTyr: 1.698 ± 0.017
0.0PheXaa: 0.0 ± 0.0
Gly
2.744GlyAla: 2.744 ± 0.024
1.269GlyCys: 1.269 ± 0.016
2.626GlyAsp: 2.626 ± 0.023
2.911GlyGlu: 2.911 ± 0.026
2.257GlyPhe: 2.257 ± 0.021
3.304GlyGly: 3.304 ± 0.04
1.283GlyHis: 1.283 ± 0.014
2.908GlyIle: 2.908 ± 0.02
3.356GlyLys: 3.356 ± 0.025
4.044GlyLeu: 4.044 ± 0.032
1.268GlyMet: 1.268 ± 0.018
2.51GlyAsn: 2.51 ± 0.021
1.821GlyPro: 1.821 ± 0.03
2.077GlyGln: 2.077 ± 0.02
3.293GlyArg: 3.293 ± 0.024
4.261GlySer: 4.261 ± 0.034
2.602GlyThr: 2.602 ± 0.02
3.085GlyVal: 3.085 ± 0.022
0.651GlyTrp: 0.651 ± 0.011
1.71GlyTyr: 1.71 ± 0.022
0.001GlyXaa: 0.001 ± 0.0
His
1.437HisAla: 1.437 ± 0.014
0.89HisCys: 0.89 ± 0.013
1.047HisAsp: 1.047 ± 0.012
1.235HisGlu: 1.235 ± 0.014
1.409HisPhe: 1.409 ± 0.014
1.337HisGly: 1.337 ± 0.015
0.979HisHis: 0.979 ± 0.019
1.292HisIle: 1.292 ± 0.011
0.978HisLys: 0.978 ± 0.01
2.698HisLeu: 2.698 ± 0.02
0.556HisMet: 0.556 ± 0.01
0.966HisAsn: 0.966 ± 0.01
1.231HisPro: 1.231 ± 0.014
1.094HisGln: 1.094 ± 0.012
1.532HisArg: 1.532 ± 0.015
2.083HisSer: 2.083 ± 0.018
1.105HisThr: 1.105 ± 0.011
1.572HisVal: 1.572 ± 0.016
0.359HisTrp: 0.359 ± 0.006
0.88HisTyr: 0.88 ± 0.01
0.0HisXaa: 0.0 ± 0.0
Ile
3.683IleAla: 3.683 ± 0.022
1.699IleCys: 1.699 ± 0.02
3.026IleAsp: 3.026 ± 0.021
3.317IleGlu: 3.317 ± 0.026
2.786IlePhe: 2.786 ± 0.025
2.868IleGly: 2.868 ± 0.023
1.337IleHis: 1.337 ± 0.013
3.118IleIle: 3.118 ± 0.026
2.826IleLys: 2.826 ± 0.02
5.324IleLeu: 5.324 ± 0.029
1.189IleMet: 1.189 ± 0.011
2.455IleAsn: 2.455 ± 0.019
2.593IlePro: 2.593 ± 0.021
2.161IleGln: 2.161 ± 0.018
3.151IleArg: 3.151 ± 0.02
4.776IleSer: 4.776 ± 0.028
2.82IleThr: 2.82 ± 0.025
3.652IleVal: 3.652 ± 0.025
0.678IleTrp: 0.678 ± 0.009
1.919IleTyr: 1.919 ± 0.017
0.0IleXaa: 0.0 ± 0.0
Lys
3.577LysAla: 3.577 ± 0.023
1.503LysCys: 1.503 ± 0.022
2.654LysAsp: 2.654 ± 0.023
3.898LysGlu: 3.898 ± 0.034
2.576LysPhe: 2.576 ± 0.02
2.264LysGly: 2.264 ± 0.025
1.51LysHis: 1.51 ± 0.016
3.599LysIle: 3.599 ± 0.023
4.589LysLys: 4.589 ± 0.047
6.122LysLeu: 6.122 ± 0.037
1.747LysMet: 1.747 ± 0.018
3.186LysAsn: 3.186 ± 0.024
2.441LysPro: 2.441 ± 0.023
2.855LysGln: 2.855 ± 0.023
3.859LysArg: 3.859 ± 0.029
4.632LysSer: 4.632 ± 0.025
2.989LysThr: 2.989 ± 0.022
3.327LysVal: 3.327 ± 0.023
0.741LysTrp: 0.741 ± 0.01
1.882LysTyr: 1.882 ± 0.016
0.001LysXaa: 0.001 ± 0.0
Leu
6.023LeuAla: 6.023 ± 0.037
2.488LeuCys: 2.488 ± 0.024
4.558LeuAsp: 4.558 ± 0.03
5.77LeuGlu: 5.77 ± 0.04
4.593LeuPhe: 4.593 ± 0.037
3.826LeuGly: 3.826 ± 0.028
2.662LeuHis: 2.662 ± 0.017
5.417LeuIle: 5.417 ± 0.032
6.558LeuLys: 6.558 ± 0.038
10.599LeuLeu: 10.599 ± 0.056
2.241LeuMet: 2.241 ± 0.019
4.989LeuAsn: 4.989 ± 0.027
4.754LeuPro: 4.754 ± 0.031
4.493LeuGln: 4.493 ± 0.03
5.428LeuArg: 5.428 ± 0.036
7.685LeuSer: 7.685 ± 0.042
4.994LeuThr: 4.994 ± 0.028
5.495LeuVal: 5.495 ± 0.032
1.061LeuTrp: 1.061 ± 0.014
2.943LeuTyr: 2.943 ± 0.026
0.0LeuXaa: 0.0 ± 0.0
Met
1.628MetAla: 1.628 ± 0.016
0.487MetCys: 0.487 ± 0.007
1.283MetAsp: 1.283 ± 0.011
1.641MetGlu: 1.641 ± 0.015
1.019MetPhe: 1.019 ± 0.013
0.913MetGly: 0.913 ± 0.01
0.66MetHis: 0.66 ± 0.01
1.308MetIle: 1.308 ± 0.015
1.758MetLys: 1.758 ± 0.016
2.483MetLeu: 2.483 ± 0.021
0.675MetMet: 0.675 ± 0.01
1.322MetAsn: 1.322 ± 0.015
1.229MetPro: 1.229 ± 0.026
1.153MetGln: 1.153 ± 0.012
1.261MetArg: 1.261 ± 0.012
1.726MetSer: 1.726 ± 0.016
1.252MetThr: 1.252 ± 0.012
1.369MetVal: 1.369 ± 0.014
0.237MetTrp: 0.237 ± 0.005
0.687MetTyr: 0.687 ± 0.01
0.0MetXaa: 0.0 ± 0.0
Asn
3.258AsnAla: 3.258 ± 0.021
1.441AsnCys: 1.441 ± 0.016
2.807AsnAsp: 2.807 ± 0.026
3.255AsnGlu: 3.255 ± 0.025
2.536AsnPhe: 2.536 ± 0.022
3.026AsnGly: 3.026 ± 0.028
1.04AsnHis: 1.04 ± 0.011
2.732AsnIle: 2.732 ± 0.021
2.515AsnLys: 2.515 ± 0.021
4.526AsnLeu: 4.526 ± 0.026
1.147AsnMet: 1.147 ± 0.013
2.933AsnAsn: 2.933 ± 0.032
1.922AsnPro: 1.922 ± 0.018
1.803AsnGln: 1.803 ± 0.017
2.666AsnArg: 2.666 ± 0.02
4.307AsnSer: 4.307 ± 0.029
2.263AsnThr: 2.263 ± 0.019
3.465AsnVal: 3.465 ± 0.024
0.619AsnTrp: 0.619 ± 0.009
1.69AsnTyr: 1.69 ± 0.016
0.0AsnXaa: 0.0 ± 0.0
Pro
2.867ProAla: 2.867 ± 0.024
0.936ProCys: 0.936 ± 0.017
2.243ProAsp: 2.243 ± 0.019
2.625ProGlu: 2.625 ± 0.026
1.981ProPhe: 1.981 ± 0.019
2.383ProGly: 2.383 ± 0.06
0.922ProHis: 0.922 ± 0.012
2.143ProIle: 2.143 ± 0.017
2.365ProLys: 2.365 ± 0.018
4.071ProLeu: 4.071 ± 0.025
0.943ProMet: 0.943 ± 0.011
2.092ProAsn: 2.092 ± 0.017
2.995ProPro: 2.995 ± 0.041
1.564ProGln: 1.564 ± 0.029
1.977ProArg: 1.977 ± 0.018
3.804ProSer: 3.804 ± 0.031
2.648ProThr: 2.648 ± 0.019
3.133ProVal: 3.133 ± 0.026
0.482ProTrp: 0.482 ± 0.009
1.377ProTyr: 1.377 ± 0.015
0.0ProXaa: 0.0 ± 0.0
Gln
2.634GlnAla: 2.634 ± 0.025
1.167GlnCys: 1.167 ± 0.017
1.478GlnAsp: 1.478 ± 0.014
2.144GlnGlu: 2.144 ± 0.017
1.954GlnPhe: 1.954 ± 0.016
1.524GlnGly: 1.524 ± 0.02
1.211GlnHis: 1.211 ± 0.015
2.389GlnIle: 2.389 ± 0.017
2.499GlnLys: 2.499 ± 0.021
4.87GlnLeu: 4.87 ± 0.029
1.202GlnMet: 1.202 ± 0.014
2.093GlnAsn: 2.093 ± 0.016
1.892GlnPro: 1.892 ± 0.02
3.571GlnGln: 3.571 ± 0.077
2.674GlnArg: 2.674 ± 0.019
3.283GlnSer: 3.283 ± 0.023
2.118GlnThr: 2.118 ± 0.017
2.105GlnVal: 2.105 ± 0.019
0.597GlnTrp: 0.597 ± 0.009
1.291GlnTyr: 1.291 ± 0.013
0.0GlnXaa: 0.0 ± 0.0
Arg
3.057ArgAla: 3.057 ± 0.024
1.631ArgCys: 1.631 ± 0.021
2.455ArgAsp: 2.455 ± 0.02
3.021ArgGlu: 3.021 ± 0.026
2.579ArgPhe: 2.579 ± 0.02
2.512ArgGly: 2.512 ± 0.025
1.554ArgHis: 1.554 ± 0.014
3.269ArgIle: 3.269 ± 0.02
3.75ArgLys: 3.75 ± 0.027
5.85ArgLeu: 5.85 ± 0.036
1.422ArgMet: 1.422 ± 0.013
2.726ArgAsn: 2.726 ± 0.019
2.374ArgPro: 2.374 ± 0.021
2.621ArgGln: 2.621 ± 0.023
4.821ArgArg: 4.821 ± 0.044
4.494ArgSer: 4.494 ± 0.031
2.803ArgThr: 2.803 ± 0.022
3.016ArgVal: 3.016 ± 0.024
0.808ArgTrp: 0.808 ± 0.012
1.719ArgTyr: 1.719 ± 0.015
0.0ArgXaa: 0.0 ± 0.0
Ser
5.549SerAla: 5.549 ± 0.029
2.101SerCys: 2.101 ± 0.024
4.559SerAsp: 4.559 ± 0.028
4.864SerGlu: 4.864 ± 0.033
3.62SerPhe: 3.62 ± 0.023
4.594SerGly: 4.594 ± 0.029
1.66SerHis: 1.66 ± 0.015
4.191SerIle: 4.191 ± 0.026
4.728SerLys: 4.728 ± 0.03
7.408SerLeu: 7.408 ± 0.033
1.826SerMet: 1.826 ± 0.016
4.193SerAsn: 4.193 ± 0.03
3.561SerPro: 3.561 ± 0.033
2.865SerGln: 2.865 ± 0.021
4.344SerArg: 4.344 ± 0.034
9.633SerSer: 9.633 ± 0.088
5.264SerThr: 5.264 ± 0.032
5.674SerVal: 5.674 ± 0.031
0.933SerTrp: 0.933 ± 0.012
2.225SerTyr: 2.225 ± 0.018
0.001SerXaa: 0.001 ± 0.0
Thr
4.15ThrAla: 4.15 ± 0.031
1.334ThrCys: 1.334 ± 0.022
2.823ThrAsp: 2.823 ± 0.019
3.155ThrGlu: 3.155 ± 0.025
2.401ThrPhe: 2.401 ± 0.019
2.918ThrGly: 2.918 ± 0.024
0.955ThrHis: 0.955 ± 0.014
2.851ThrIle: 2.851 ± 0.019
2.745ThrLys: 2.745 ± 0.02
4.986ThrLeu: 4.986 ± 0.03
1.278ThrMet: 1.278 ± 0.011
2.469ThrAsn: 2.469 ± 0.019
2.375ThrPro: 2.375 ± 0.02
1.489ThrGln: 1.489 ± 0.016
2.246ThrArg: 2.246 ± 0.02
4.515ThrSer: 4.515 ± 0.028
3.866ThrThr: 3.866 ± 0.035
4.457ThrVal: 4.457 ± 0.027
0.562ThrTrp: 0.562 ± 0.01
1.386ThrTyr: 1.386 ± 0.015
0.001ThrXaa: 0.001 ± 0.0
Val
4.277ValAla: 4.277 ± 0.028
1.681ValCys: 1.681 ± 0.019
4.136ValAsp: 4.136 ± 0.026
4.681ValGlu: 4.681 ± 0.033
2.624ValPhe: 2.624 ± 0.021
3.417ValGly: 3.417 ± 0.027
1.778ValHis: 1.778 ± 0.018
3.511ValIle: 3.511 ± 0.023
3.798ValLys: 3.798 ± 0.023
5.732ValLeu: 5.732 ± 0.034
1.395ValMet: 1.395 ± 0.013
3.036ValAsn: 3.036 ± 0.02
2.745ValPro: 2.745 ± 0.019
2.768ValGln: 2.768 ± 0.021
3.354ValArg: 3.354 ± 0.025
4.759ValSer: 4.759 ± 0.028
3.28ValThr: 3.28 ± 0.022
4.631ValVal: 4.631 ± 0.031
0.742ValTrp: 0.742 ± 0.01
1.865ValTyr: 1.865 ± 0.017
0.001ValXaa: 0.001 ± 0.0
Trp
0.62TrpAla: 0.62 ± 0.01
0.283TrpCys: 0.283 ± 0.006
0.548TrpAsp: 0.548 ± 0.01
0.563TrpGlu: 0.563 ± 0.009
0.574TrpPhe: 0.574 ± 0.012
0.401TrpGly: 0.401 ± 0.008
0.323TrpHis: 0.323 ± 0.006
0.797TrpIle: 0.797 ± 0.01
0.952TrpLys: 0.952 ± 0.012
1.331TrpLeu: 1.331 ± 0.015
0.347TrpMet: 0.347 ± 0.006
0.776TrpAsn: 0.776 ± 0.009
0.512TrpPro: 0.512 ± 0.009
0.558TrpGln: 0.558 ± 0.008
0.751TrpArg: 0.751 ± 0.01
1.029TrpSer: 1.029 ± 0.013
0.696TrpThr: 0.696 ± 0.01
0.504TrpVal: 0.504 ± 0.009
0.189TrpTrp: 0.189 ± 0.005
0.413TrpTyr: 0.413 ± 0.011
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.879TyrAla: 1.879 ± 0.015
0.987TyrCys: 0.987 ± 0.015
1.626TyrAsp: 1.626 ± 0.015
1.798TyrGlu: 1.798 ± 0.015
1.752TyrPhe: 1.752 ± 0.017
1.79TyrGly: 1.79 ± 0.019
0.817TyrHis: 0.817 ± 0.01
1.597TyrIle: 1.597 ± 0.016
1.656TyrLys: 1.656 ± 0.017
2.979TyrLeu: 2.979 ± 0.026
0.711TyrMet: 0.711 ± 0.011
1.478TyrAsn: 1.478 ± 0.014
1.322TyrPro: 1.322 ± 0.015
1.175TyrGln: 1.175 ± 0.011
1.805TyrArg: 1.805 ± 0.017
2.535TyrSer: 2.535 ± 0.022
1.446TyrThr: 1.446 ± 0.016
1.882TyrVal: 1.882 ± 0.018
0.457TyrTrp: 0.457 ± 0.01
1.225TyrTyr: 1.225 ± 0.015
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.001XaaGly: 0.001 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.001XaaLeu: 0.001 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.002XaaArg: 0.002 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.001XaaTyr: 0.001 ± 0.0
0.025XaaXaa: 0.025 ± 0.005
Statistics based on 16771 proteins (8159799 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski