Amino acid dipepetide frequency for Trichinella pseudospiralis (Parasitic roundworm)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.406AlaAla: 5.406 ± 0.043
1.548AlaCys: 1.548 ± 0.018
3.583AlaAsp: 3.583 ± 0.024
4.526AlaGlu: 4.526 ± 0.031
2.812AlaPhe: 2.812 ± 0.024
3.317AlaGly: 3.317 ± 0.029
1.303AlaHis: 1.303 ± 0.014
3.473AlaIle: 3.473 ± 0.025
3.723AlaLys: 3.723 ± 0.026
6.311AlaLeu: 6.311 ± 0.037
1.585AlaMet: 1.585 ± 0.019
2.937AlaAsn: 2.937 ± 0.022
2.252AlaPro: 2.252 ± 0.024
2.213AlaGln: 2.213 ± 0.017
2.836AlaArg: 2.836 ± 0.023
5.149AlaSer: 5.149 ± 0.032
3.446AlaThr: 3.446 ± 0.024
4.965AlaVal: 4.965 ± 0.026
0.657AlaTrp: 0.657 ± 0.009
1.859AlaTyr: 1.859 ± 0.021
0.001AlaXaa: 0.001 ± 0.0
Cys
1.417CysAla: 1.417 ± 0.014
0.921CysCys: 0.921 ± 0.016
1.399CysAsp: 1.399 ± 0.024
1.521CysGlu: 1.521 ± 0.022
1.334CysPhe: 1.334 ± 0.015
1.451CysGly: 1.451 ± 0.018
0.693CysHis: 0.693 ± 0.013
1.595CysIle: 1.595 ± 0.022
1.55CysLys: 1.55 ± 0.02
2.688CysLeu: 2.688 ± 0.026
0.576CysMet: 0.576 ± 0.009
1.279CysAsn: 1.279 ± 0.016
1.27CysPro: 1.27 ± 0.026
1.134CysGln: 1.134 ± 0.016
1.577CysArg: 1.577 ± 0.023
2.487CysSer: 2.487 ± 0.026
1.399CysThr: 1.399 ± 0.018
1.542CysVal: 1.542 ± 0.018
0.366CysTrp: 0.366 ± 0.009
0.799CysTyr: 0.799 ± 0.012
0.002CysXaa: 0.002 ± 0.0
Asp
3.288AspAla: 3.288 ± 0.023
1.406AspCys: 1.406 ± 0.022
3.995AspAsp: 3.995 ± 0.04
4.279AspGlu: 4.279 ± 0.033
2.422AspPhe: 2.422 ± 0.023
3.154AspGly: 3.154 ± 0.033
1.192AspHis: 1.192 ± 0.014
2.798AspIle: 2.798 ± 0.021
2.608AspLys: 2.608 ± 0.021
4.732AspLeu: 4.732 ± 0.026
1.191AspMet: 1.191 ± 0.013
2.347AspAsn: 2.347 ± 0.021
2.037AspPro: 2.037 ± 0.021
2.079AspGln: 2.079 ± 0.021
2.678AspArg: 2.678 ± 0.02
4.234AspSer: 4.234 ± 0.026
2.129AspThr: 2.129 ± 0.016
3.742AspVal: 3.742 ± 0.025
0.692AspTrp: 0.692 ± 0.012
1.664AspTyr: 1.664 ± 0.017
0.001AspXaa: 0.001 ± 0.0
Glu
3.864GluAla: 3.864 ± 0.029
1.43GluCys: 1.43 ± 0.024
3.278GluAsp: 3.278 ± 0.025
5.691GluGlu: 5.691 ± 0.053
2.491GluPhe: 2.491 ± 0.022
2.344GluGly: 2.344 ± 0.025
1.458GluHis: 1.458 ± 0.017
3.965GluIle: 3.965 ± 0.031
5.057GluLys: 5.057 ± 0.036
5.986GluLeu: 5.986 ± 0.04
1.937GluMet: 1.937 ± 0.019
4.108GluAsn: 4.108 ± 0.027
2.083GluPro: 2.083 ± 0.021
3.098GluGln: 3.098 ± 0.027
3.528GluArg: 3.528 ± 0.03
4.49GluSer: 4.49 ± 0.03
3.275GluThr: 3.275 ± 0.022
3.52GluVal: 3.52 ± 0.027
0.666GluTrp: 0.666 ± 0.012
1.804GluTyr: 1.804 ± 0.017
0.001GluXaa: 0.001 ± 0.001
Phe
2.812PheAla: 2.812 ± 0.023
1.414PheCys: 1.414 ± 0.018
2.606PheAsp: 2.606 ± 0.021
2.685PheGlu: 2.685 ± 0.018
2.269PhePhe: 2.269 ± 0.021
2.57PheGly: 2.57 ± 0.022
1.232PheHis: 1.232 ± 0.015
2.65PheIle: 2.65 ± 0.025
2.277PheLys: 2.277 ± 0.022
4.476PheLeu: 4.476 ± 0.037
0.915PheMet: 0.915 ± 0.014
2.13PheAsn: 2.13 ± 0.02
1.856PhePro: 1.856 ± 0.019
1.935PheGln: 1.935 ± 0.016
2.255PheArg: 2.255 ± 0.016
3.9PheSer: 3.9 ± 0.026
2.417PheThr: 2.417 ± 0.02
3.076PheVal: 3.076 ± 0.025
0.618PheTrp: 0.618 ± 0.011
1.765PheTyr: 1.765 ± 0.018
0.0PheXaa: 0.0 ± 0.0
Gly
2.72GlyAla: 2.72 ± 0.024
1.289GlyCys: 1.289 ± 0.019
2.615GlyAsp: 2.615 ± 0.025
2.89GlyGlu: 2.89 ± 0.028
2.303GlyPhe: 2.303 ± 0.022
3.173GlyGly: 3.173 ± 0.039
1.228GlyHis: 1.228 ± 0.015
3.02GlyIle: 3.02 ± 0.025
3.377GlyLys: 3.377 ± 0.023
4.166GlyLeu: 4.166 ± 0.035
1.246GlyMet: 1.246 ± 0.015
2.507GlyAsn: 2.507 ± 0.021
1.748GlyPro: 1.748 ± 0.033
2.069GlyGln: 2.069 ± 0.021
3.021GlyArg: 3.021 ± 0.028
4.175GlySer: 4.175 ± 0.029
2.568GlyThr: 2.568 ± 0.02
2.97GlyVal: 2.97 ± 0.022
0.624GlyTrp: 0.624 ± 0.01
1.797GlyTyr: 1.797 ± 0.022
0.004GlyXaa: 0.004 ± 0.001
His
1.406HisAla: 1.406 ± 0.016
0.912HisCys: 0.912 ± 0.014
1.02HisAsp: 1.02 ± 0.013
1.199HisGlu: 1.199 ± 0.015
1.349HisPhe: 1.349 ± 0.015
1.306HisGly: 1.306 ± 0.016
0.911HisHis: 0.911 ± 0.016
1.283HisIle: 1.283 ± 0.016
0.999HisLys: 0.999 ± 0.012
2.692HisLeu: 2.692 ± 0.022
0.523HisMet: 0.523 ± 0.01
0.994HisAsn: 0.994 ± 0.012
1.199HisPro: 1.199 ± 0.013
1.095HisGln: 1.095 ± 0.014
1.507HisArg: 1.507 ± 0.016
2.023HisSer: 2.023 ± 0.017
1.018HisThr: 1.018 ± 0.014
1.544HisVal: 1.544 ± 0.015
0.367HisTrp: 0.367 ± 0.007
0.893HisTyr: 0.893 ± 0.011
0.0HisXaa: 0.0 ± 0.0
Ile
3.806IleAla: 3.806 ± 0.025
1.715IleCys: 1.715 ± 0.02
2.988IleAsp: 2.988 ± 0.023
3.408IleGlu: 3.408 ± 0.024
2.836IlePhe: 2.836 ± 0.027
2.923IleGly: 2.923 ± 0.024
1.301IleHis: 1.301 ± 0.015
3.181IleIle: 3.181 ± 0.028
2.88IleLys: 2.88 ± 0.025
5.481IleLeu: 5.481 ± 0.039
1.218IleMet: 1.218 ± 0.013
2.52IleAsn: 2.52 ± 0.022
2.557IlePro: 2.557 ± 0.021
2.2IleGln: 2.2 ± 0.021
3.119IleArg: 3.119 ± 0.024
4.699IleSer: 4.699 ± 0.029
2.847IleThr: 2.847 ± 0.025
3.68IleVal: 3.68 ± 0.023
0.701IleTrp: 0.701 ± 0.011
1.928IleTyr: 1.928 ± 0.018
0.001IleXaa: 0.001 ± 0.0
Lys
3.691LysAla: 3.691 ± 0.028
1.538LysCys: 1.538 ± 0.018
2.727LysAsp: 2.727 ± 0.023
4.097LysGlu: 4.097 ± 0.034
2.513LysPhe: 2.513 ± 0.02
2.306LysGly: 2.306 ± 0.028
1.502LysHis: 1.502 ± 0.017
3.624LysIle: 3.624 ± 0.028
4.536LysLys: 4.536 ± 0.043
6.288LysLeu: 6.288 ± 0.036
1.77LysMet: 1.77 ± 0.017
3.221LysAsn: 3.221 ± 0.024
2.461LysPro: 2.461 ± 0.028
2.919LysGln: 2.919 ± 0.025
3.836LysArg: 3.836 ± 0.028
4.565LysSer: 4.565 ± 0.032
2.976LysThr: 2.976 ± 0.025
3.429LysVal: 3.429 ± 0.023
0.734LysTrp: 0.734 ± 0.012
1.899LysTyr: 1.899 ± 0.016
0.001LysXaa: 0.001 ± 0.0
Leu
6.082LeuAla: 6.082 ± 0.031
2.556LeuCys: 2.556 ± 0.026
4.67LeuAsp: 4.67 ± 0.03
5.864LeuGlu: 5.864 ± 0.042
4.656LeuPhe: 4.656 ± 0.036
3.915LeuGly: 3.915 ± 0.028
2.718LeuHis: 2.718 ± 0.02
5.499LeuIle: 5.499 ± 0.035
6.824LeuLys: 6.824 ± 0.04
11.057LeuLeu: 11.057 ± 0.061
2.347LeuMet: 2.347 ± 0.02
5.208LeuAsn: 5.208 ± 0.03
4.819LeuPro: 4.819 ± 0.032
4.618LeuGln: 4.618 ± 0.03
5.403LeuArg: 5.403 ± 0.04
7.742LeuSer: 7.742 ± 0.038
5.031LeuThr: 5.031 ± 0.031
5.469LeuVal: 5.469 ± 0.034
1.105LeuTrp: 1.105 ± 0.013
3.018LeuTyr: 3.018 ± 0.022
0.002LeuXaa: 0.002 ± 0.001
Met
1.62MetAla: 1.62 ± 0.017
0.516MetCys: 0.516 ± 0.009
1.281MetAsp: 1.281 ± 0.012
1.637MetGlu: 1.637 ± 0.016
1.028MetPhe: 1.028 ± 0.013
0.908MetGly: 0.908 ± 0.013
0.666MetHis: 0.666 ± 0.009
1.363MetIle: 1.363 ± 0.015
1.778MetLys: 1.778 ± 0.017
2.546MetLeu: 2.546 ± 0.021
0.719MetMet: 0.719 ± 0.01
1.362MetAsn: 1.362 ± 0.017
1.146MetPro: 1.146 ± 0.014
1.188MetGln: 1.188 ± 0.016
1.254MetArg: 1.254 ± 0.014
1.771MetSer: 1.771 ± 0.015
1.282MetThr: 1.282 ± 0.014
1.403MetVal: 1.403 ± 0.015
0.24MetTrp: 0.24 ± 0.006
0.667MetTyr: 0.667 ± 0.01
0.0MetXaa: 0.0 ± 0.0
Asn
3.328AsnAla: 3.328 ± 0.023
1.476AsnCys: 1.476 ± 0.018
2.821AsnAsp: 2.821 ± 0.024
3.322AsnGlu: 3.322 ± 0.025
2.527AsnPhe: 2.527 ± 0.021
2.971AsnGly: 2.971 ± 0.024
1.035AsnHis: 1.035 ± 0.012
2.761AsnIle: 2.761 ± 0.024
2.56AsnLys: 2.56 ± 0.021
4.683AsnLeu: 4.683 ± 0.027
1.148AsnMet: 1.148 ± 0.013
3.114AsnAsn: 3.114 ± 0.041
1.941AsnPro: 1.941 ± 0.019
1.819AsnGln: 1.819 ± 0.019
2.596AsnArg: 2.596 ± 0.021
4.269AsnSer: 4.269 ± 0.031
2.229AsnThr: 2.229 ± 0.019
3.479AsnVal: 3.479 ± 0.024
0.612AsnTrp: 0.612 ± 0.01
1.747AsnTyr: 1.747 ± 0.019
0.001AsnXaa: 0.001 ± 0.0
Pro
2.776ProAla: 2.776 ± 0.025
0.967ProCys: 0.967 ± 0.017
2.208ProAsp: 2.208 ± 0.021
2.619ProGlu: 2.619 ± 0.024
1.936ProPhe: 1.936 ± 0.02
2.398ProGly: 2.398 ± 0.064
0.873ProHis: 0.873 ± 0.014
2.097ProIle: 2.097 ± 0.018
2.337ProLys: 2.337 ± 0.02
3.959ProLeu: 3.959 ± 0.03
0.947ProMet: 0.947 ± 0.011
2.018ProAsn: 2.018 ± 0.016
2.765ProPro: 2.765 ± 0.038
1.504ProGln: 1.504 ± 0.022
1.83ProArg: 1.83 ± 0.021
3.727ProSer: 3.727 ± 0.03
2.515ProThr: 2.515 ± 0.023
3.091ProVal: 3.091 ± 0.021
0.472ProTrp: 0.472 ± 0.009
1.373ProTyr: 1.373 ± 0.017
0.002ProXaa: 0.002 ± 0.0
Gln
2.617GlnAla: 2.617 ± 0.024
1.224GlnCys: 1.224 ± 0.019
1.519GlnAsp: 1.519 ± 0.016
2.266GlnGlu: 2.266 ± 0.021
1.981GlnPhe: 1.981 ± 0.017
1.48GlnGly: 1.48 ± 0.015
1.221GlnHis: 1.221 ± 0.015
2.411GlnIle: 2.411 ± 0.02
2.517GlnLys: 2.517 ± 0.02
4.932GlnLeu: 4.932 ± 0.033
1.202GlnMet: 1.202 ± 0.013
2.096GlnAsn: 2.096 ± 0.02
1.874GlnPro: 1.874 ± 0.022
3.798GlnGln: 3.798 ± 0.075
2.584GlnArg: 2.584 ± 0.021
3.296GlnSer: 3.296 ± 0.026
2.075GlnThr: 2.075 ± 0.019
2.067GlnVal: 2.067 ± 0.019
0.585GlnTrp: 0.585 ± 0.009
1.3GlnTyr: 1.3 ± 0.014
0.001GlnXaa: 0.001 ± 0.0
Arg
2.869ArgAla: 2.869 ± 0.022
1.655ArgCys: 1.655 ± 0.023
2.383ArgAsp: 2.383 ± 0.019
2.972ArgGlu: 2.972 ± 0.022
2.489ArgPhe: 2.489 ± 0.021
2.393ArgGly: 2.393 ± 0.024
1.493ArgHis: 1.493 ± 0.016
3.186ArgIle: 3.186 ± 0.024
3.633ArgLys: 3.633 ± 0.026
5.753ArgLeu: 5.753 ± 0.036
1.454ArgMet: 1.454 ± 0.015
2.679ArgAsn: 2.679 ± 0.022
2.224ArgPro: 2.224 ± 0.021
2.52ArgGln: 2.52 ± 0.022
4.341ArgArg: 4.341 ± 0.037
4.246ArgSer: 4.246 ± 0.038
2.636ArgThr: 2.636 ± 0.024
2.882ArgVal: 2.882 ± 0.022
0.795ArgTrp: 0.795 ± 0.011
1.709ArgTyr: 1.709 ± 0.015
0.001ArgXaa: 0.001 ± 0.0
Ser
5.428SerAla: 5.428 ± 0.03
2.106SerCys: 2.106 ± 0.023
4.5SerAsp: 4.5 ± 0.03
4.97SerGlu: 4.97 ± 0.035
3.557SerPhe: 3.557 ± 0.027
4.472SerGly: 4.472 ± 0.029
1.643SerHis: 1.643 ± 0.015
4.159SerIle: 4.159 ± 0.026
4.679SerLys: 4.679 ± 0.027
7.574SerLeu: 7.574 ± 0.039
1.861SerMet: 1.861 ± 0.016
4.152SerAsn: 4.152 ± 0.027
3.423SerPro: 3.423 ± 0.035
2.82SerGln: 2.82 ± 0.026
4.123SerArg: 4.123 ± 0.031
9.332SerSer: 9.332 ± 0.07
5.108SerThr: 5.108 ± 0.03
5.716SerVal: 5.716 ± 0.032
0.907SerTrp: 0.907 ± 0.012
2.206SerTyr: 2.206 ± 0.02
0.003SerXaa: 0.003 ± 0.001
Thr
4.115ThrAla: 4.115 ± 0.03
1.348ThrCys: 1.348 ± 0.02
2.77ThrAsp: 2.77 ± 0.023
3.142ThrGlu: 3.142 ± 0.024
2.372ThrPhe: 2.372 ± 0.02
2.847ThrGly: 2.847 ± 0.021
0.919ThrHis: 0.919 ± 0.011
2.861ThrIle: 2.861 ± 0.021
2.718ThrLys: 2.718 ± 0.023
4.997ThrLeu: 4.997 ± 0.03
1.24ThrMet: 1.24 ± 0.014
2.431ThrAsn: 2.431 ± 0.021
2.266ThrPro: 2.266 ± 0.022
1.454ThrGln: 1.454 ± 0.014
2.117ThrArg: 2.117 ± 0.019
4.455ThrSer: 4.455 ± 0.031
3.783ThrThr: 3.783 ± 0.039
4.393ThrVal: 4.393 ± 0.029
0.555ThrTrp: 0.555 ± 0.01
1.405ThrTyr: 1.405 ± 0.015
0.001ThrXaa: 0.001 ± 0.0
Val
4.185ValAla: 4.185 ± 0.028
1.668ValCys: 1.668 ± 0.025
4.072ValAsp: 4.072 ± 0.025
4.58ValGlu: 4.58 ± 0.029
2.65ValPhe: 2.65 ± 0.023
3.354ValGly: 3.354 ± 0.023
1.737ValHis: 1.737 ± 0.014
3.598ValIle: 3.598 ± 0.025
3.88ValLys: 3.88 ± 0.028
5.868ValLeu: 5.868 ± 0.034
1.426ValMet: 1.426 ± 0.015
3.092ValAsn: 3.092 ± 0.02
2.641ValPro: 2.641 ± 0.022
2.82ValGln: 2.82 ± 0.022
3.221ValArg: 3.221 ± 0.025
4.695ValSer: 4.695 ± 0.025
3.24ValThr: 3.24 ± 0.023
4.603ValVal: 4.603 ± 0.033
0.722ValTrp: 0.722 ± 0.011
1.92ValTyr: 1.92 ± 0.018
0.002ValXaa: 0.002 ± 0.0
Trp
0.585TrpAla: 0.585 ± 0.01
0.293TrpCys: 0.293 ± 0.007
0.565TrpAsp: 0.565 ± 0.01
0.576TrpGlu: 0.576 ± 0.009
0.562TrpPhe: 0.562 ± 0.01
0.386TrpGly: 0.386 ± 0.009
0.333TrpHis: 0.333 ± 0.006
0.81TrpIle: 0.81 ± 0.012
0.967TrpLys: 0.967 ± 0.012
1.317TrpLeu: 1.317 ± 0.016
0.348TrpMet: 0.348 ± 0.007
0.756TrpAsn: 0.756 ± 0.01
0.48TrpPro: 0.48 ± 0.008
0.545TrpGln: 0.545 ± 0.01
0.704TrpArg: 0.704 ± 0.011
0.999TrpSer: 0.999 ± 0.014
0.712TrpThr: 0.712 ± 0.011
0.512TrpVal: 0.512 ± 0.009
0.183TrpTrp: 0.183 ± 0.006
0.433TrpTyr: 0.433 ± 0.011
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.883TyrAla: 1.883 ± 0.017
1.016TyrCys: 1.016 ± 0.014
1.668TyrAsp: 1.668 ± 0.018
1.827TyrGlu: 1.827 ± 0.019
1.769TyrPhe: 1.769 ± 0.02
1.844TyrGly: 1.844 ± 0.019
0.78TyrHis: 0.78 ± 0.011
1.654TyrIle: 1.654 ± 0.017
1.732TyrLys: 1.732 ± 0.02
3.093TyrLeu: 3.093 ± 0.023
0.721TyrMet: 0.721 ± 0.008
1.491TyrAsn: 1.491 ± 0.017
1.369TyrPro: 1.369 ± 0.015
1.219TyrGln: 1.219 ± 0.012
1.782TyrArg: 1.782 ± 0.019
2.485TyrSer: 2.485 ± 0.021
1.501TyrThr: 1.501 ± 0.015
1.89TyrVal: 1.89 ± 0.019
0.463TyrTrp: 0.463 ± 0.01
1.266TyrTyr: 1.266 ± 0.017
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.004XaaAla: 0.004 ± 0.001
0.002XaaCys: 0.002 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.002XaaGlu: 0.002 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.002XaaGly: 0.002 ± 0.001
0.0XaaHis: 0.0 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.003XaaLeu: 0.003 ± 0.001
0.001XaaMet: 0.001 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.003XaaPro: 0.003 ± 0.001
0.001XaaGln: 0.001 ± 0.0
0.002XaaArg: 0.002 ± 0.001
0.002XaaSer: 0.002 ± 0.001
0.001XaaThr: 0.001 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.483XaaXaa: 0.483 ± 0.042
Statistics based on 14914 proteins (6968983 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski