Amino acid dipepetide frequency for Clonorchis sinensis (Chinese liver fluke)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.795AlaAla: 5.795 ± 0.058
1.421AlaCys: 1.421 ± 0.018
3.345AlaAsp: 3.345 ± 0.024
4.229AlaGlu: 4.229 ± 0.056
2.545AlaPhe: 2.545 ± 0.021
3.762AlaGly: 3.762 ± 0.029
1.687AlaHis: 1.687 ± 0.016
3.043AlaIle: 3.043 ± 0.022
2.936AlaLys: 2.936 ± 0.025
6.251AlaLeu: 6.251 ± 0.039
1.375AlaMet: 1.375 ± 0.015
2.733AlaAsn: 2.733 ± 0.023
3.385AlaPro: 3.385 ± 0.025
2.737AlaGln: 2.737 ± 0.025
4.097AlaArg: 4.097 ± 0.03
6.252AlaSer: 6.252 ± 0.035
4.409AlaThr: 4.409 ± 0.034
4.755AlaVal: 4.755 ± 0.031
0.804AlaTrp: 0.804 ± 0.013
1.775AlaTyr: 1.775 ± 0.017
0.001AlaXaa: 0.001 ± 0.0
Cys
1.44CysAla: 1.44 ± 0.015
0.643CysCys: 0.643 ± 0.011
1.052CysAsp: 1.052 ± 0.015
1.148CysGlu: 1.148 ± 0.015
0.948CysPhe: 0.948 ± 0.012
1.329CysGly: 1.329 ± 0.015
0.665CysHis: 0.665 ± 0.011
1.127CysIle: 1.127 ± 0.015
0.882CysLys: 0.882 ± 0.013
2.597CysLeu: 2.597 ± 0.026
0.423CysMet: 0.423 ± 0.009
0.756CysAsn: 0.756 ± 0.012
1.419CysPro: 1.419 ± 0.018
0.919CysGln: 0.919 ± 0.015
1.432CysArg: 1.432 ± 0.018
2.098CysSer: 2.098 ± 0.021
1.304CysThr: 1.304 ± 0.016
1.448CysVal: 1.448 ± 0.018
0.28CysTrp: 0.28 ± 0.007
0.584CysTyr: 0.584 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
3.336AspAla: 3.336 ± 0.025
1.127AspCys: 1.127 ± 0.013
3.354AspAsp: 3.354 ± 0.155
3.517AspGlu: 3.517 ± 0.05
1.896AspPhe: 1.896 ± 0.019
3.055AspGly: 3.055 ± 0.079
1.246AspHis: 1.246 ± 0.014
2.385AspIle: 2.385 ± 0.018
1.987AspLys: 1.987 ± 0.02
4.708AspLeu: 4.708 ± 0.03
0.972AspMet: 0.972 ± 0.013
1.745AspAsn: 1.745 ± 0.018
2.783AspPro: 2.783 ± 0.021
1.979AspGln: 1.979 ± 0.02
3.397AspArg: 3.397 ± 0.024
4.508AspSer: 4.508 ± 0.028
2.715AspThr: 2.715 ± 0.019
3.247AspVal: 3.247 ± 0.025
0.722AspTrp: 0.722 ± 0.011
1.386AspTyr: 1.386 ± 0.014
0.0AspXaa: 0.0 ± 0.0
Glu
4.091GluAla: 4.091 ± 0.059
1.103GluCys: 1.103 ± 0.014
3.05GluAsp: 3.05 ± 0.05
4.159GluGlu: 4.159 ± 0.074
2.131GluPhe: 2.131 ± 0.02
2.487GluGly: 2.487 ± 0.023
1.564GluHis: 1.564 ± 0.015
2.613GluIle: 2.613 ± 0.021
2.94GluLys: 2.94 ± 0.028
5.727GluLeu: 5.727 ± 0.044
1.259GluMet: 1.259 ± 0.016
2.506GluAsn: 2.506 ± 0.021
2.778GluPro: 2.778 ± 0.027
2.689GluGln: 2.689 ± 0.026
4.036GluArg: 4.036 ± 0.029
4.302GluSer: 4.302 ± 0.029
3.5GluThr: 3.5 ± 0.028
3.45GluVal: 3.45 ± 0.03
0.632GluTrp: 0.632 ± 0.01
1.451GluTyr: 1.451 ± 0.017
0.0GluXaa: 0.0 ± 0.0
Phe
2.48PheAla: 2.48 ± 0.023
0.929PheCys: 0.929 ± 0.014
2.002PheAsp: 2.002 ± 0.02
2.091PheGlu: 2.091 ± 0.018
1.434PhePhe: 1.434 ± 0.017
2.516PheGly: 2.516 ± 0.022
1.101PheHis: 1.101 ± 0.012
1.804PheIle: 1.804 ± 0.021
1.411PheLys: 1.411 ± 0.015
3.702PheLeu: 3.702 ± 0.028
0.701PheMet: 0.701 ± 0.01
1.406PheAsn: 1.406 ± 0.015
2.014PhePro: 2.014 ± 0.019
1.571PheGln: 1.571 ± 0.017
2.674PheArg: 2.674 ± 0.023
3.499PheSer: 3.499 ± 0.025
2.494PheThr: 2.494 ± 0.02
2.607PheVal: 2.607 ± 0.02
0.502PheTrp: 0.502 ± 0.011
1.148PheTyr: 1.148 ± 0.016
0.0PheXaa: 0.0 ± 0.0
Gly
3.387GlyAla: 3.387 ± 0.024
1.397GlyCys: 1.397 ± 0.017
2.75GlyAsp: 2.75 ± 0.029
2.903GlyGlu: 2.903 ± 0.035
2.34GlyPhe: 2.34 ± 0.02
3.566GlyGly: 3.566 ± 0.052
1.553GlyHis: 1.553 ± 0.017
2.688GlyIle: 2.688 ± 0.024
2.413GlyLys: 2.413 ± 0.021
5.234GlyLeu: 5.234 ± 0.03
1.319GlyMet: 1.319 ± 0.015
2.105GlyAsn: 2.105 ± 0.019
2.876GlyPro: 2.876 ± 0.051
2.475GlyGln: 2.475 ± 0.021
3.782GlyArg: 3.782 ± 0.03
5.795GlySer: 5.795 ± 0.04
3.446GlyThr: 3.446 ± 0.03
3.401GlyVal: 3.401 ± 0.029
0.895GlyTrp: 0.895 ± 0.079
1.553GlyTyr: 1.553 ± 0.019
0.001GlyXaa: 0.001 ± 0.0
His
1.623HisAla: 1.623 ± 0.016
0.73HisCys: 0.73 ± 0.011
1.053HisAsp: 1.053 ± 0.013
1.332HisGlu: 1.332 ± 0.013
1.135HisPhe: 1.135 ± 0.012
1.481HisGly: 1.481 ± 0.015
0.958HisHis: 0.958 ± 0.018
1.288HisIle: 1.288 ± 0.015
1.155HisLys: 1.155 ± 0.015
3.165HisLeu: 3.165 ± 0.025
0.571HisMet: 0.571 ± 0.009
0.942HisAsn: 0.942 ± 0.012
1.804HisPro: 1.804 ± 0.021
1.339HisGln: 1.339 ± 0.016
2.195HisArg: 2.195 ± 0.019
2.612HisSer: 2.612 ± 0.025
1.696HisThr: 1.696 ± 0.016
1.724HisVal: 1.724 ± 0.018
0.414HisTrp: 0.414 ± 0.007
0.765HisTyr: 0.765 ± 0.009
0.0HisXaa: 0.0 ± 0.0
Ile
2.903IleAla: 2.903 ± 0.022
1.15IleCys: 1.15 ± 0.014
2.296IleAsp: 2.296 ± 0.018
2.338IleGlu: 2.338 ± 0.021
1.83IlePhe: 1.83 ± 0.021
2.473IleGly: 2.473 ± 0.02
1.405IleHis: 1.405 ± 0.016
2.308IleIle: 2.308 ± 0.157
1.871IleLys: 1.871 ± 0.019
4.524IleLeu: 4.524 ± 0.036
0.849IleMet: 0.849 ± 0.013
1.83IleAsn: 1.83 ± 0.023
3.046IlePro: 3.046 ± 0.022
2.113IleGln: 2.113 ± 0.018
3.337IleArg: 3.337 ± 0.023
4.319IleSer: 4.319 ± 0.025
2.753IleThr: 2.753 ± 0.022
2.755IleVal: 2.755 ± 0.02
0.619IleTrp: 0.619 ± 0.01
1.371IleTyr: 1.371 ± 0.023
0.0IleXaa: 0.0 ± 0.0
Lys
2.927LysAla: 2.927 ± 0.027
0.919LysCys: 0.919 ± 0.014
1.994LysAsp: 1.994 ± 0.019
2.546LysGlu: 2.546 ± 0.026
1.512LysPhe: 1.512 ± 0.016
2.045LysGly: 2.045 ± 0.028
1.373LysHis: 1.373 ± 0.016
1.949LysIle: 1.949 ± 0.019
2.374LysLys: 2.374 ± 0.026
4.5LysLeu: 4.5 ± 0.031
0.937LysMet: 0.937 ± 0.011
1.719LysAsn: 1.719 ± 0.015
2.813LysPro: 2.813 ± 0.025
2.149LysGln: 2.149 ± 0.019
3.513LysArg: 3.513 ± 0.029
3.795LysSer: 3.795 ± 0.029
2.724LysThr: 2.724 ± 0.021
2.663LysVal: 2.663 ± 0.024
0.554LysTrp: 0.554 ± 0.01
1.164LysTyr: 1.164 ± 0.014
0.0LysXaa: 0.0 ± 0.0
Leu
6.658LeuAla: 6.658 ± 0.045
2.153LeuCys: 2.153 ± 0.022
5.042LeuAsp: 5.042 ± 0.031
5.677LeuGlu: 5.677 ± 0.047
3.95LeuPhe: 3.95 ± 0.038
5.274LeuGly: 5.274 ± 0.03
2.83LeuHis: 2.83 ± 0.023
4.506LeuIle: 4.506 ± 0.034
4.402LeuLys: 4.402 ± 0.028
10.2LeuLeu: 10.2 ± 0.079
1.902LeuMet: 1.902 ± 0.016
4.162LeuAsn: 4.162 ± 0.027
6.359LeuPro: 6.359 ± 0.036
4.277LeuGln: 4.277 ± 0.028
6.841LeuArg: 6.841 ± 0.037
9.145LeuSer: 9.145 ± 0.047
6.063LeuThr: 6.063 ± 0.032
6.138LeuVal: 6.138 ± 0.037
1.137LeuTrp: 1.137 ± 0.015
2.37LeuTyr: 2.37 ± 0.022
0.001LeuXaa: 0.001 ± 0.0
Met
1.579MetAla: 1.579 ± 0.016
0.428MetCys: 0.428 ± 0.008
1.221MetAsp: 1.221 ± 0.076
1.27MetGlu: 1.27 ± 0.013
0.818MetPhe: 0.818 ± 0.012
0.964MetGly: 0.964 ± 0.013
0.582MetHis: 0.582 ± 0.009
0.852MetIle: 0.852 ± 0.012
0.993MetLys: 0.993 ± 0.013
1.922MetLeu: 1.922 ± 0.018
0.755MetMet: 0.755 ± 0.05
0.913MetAsn: 0.913 ± 0.012
1.141MetPro: 1.141 ± 0.014
0.86MetGln: 0.86 ± 0.01
1.319MetArg: 1.319 ± 0.014
1.66MetSer: 1.66 ± 0.016
1.243MetThr: 1.243 ± 0.014
1.218MetVal: 1.218 ± 0.013
0.208MetTrp: 0.208 ± 0.005
0.509MetTyr: 0.509 ± 0.009
0.0MetXaa: 0.0 ± 0.0
Asn
2.742AsnAla: 2.742 ± 0.022
0.908AsnCys: 0.908 ± 0.011
1.8AsnAsp: 1.8 ± 0.021
2.248AsnGlu: 2.248 ± 0.022
1.527AsnPhe: 1.527 ± 0.016
2.472AsnGly: 2.472 ± 0.023
1.116AsnHis: 1.116 ± 0.013
1.837AsnIle: 1.837 ± 0.019
1.693AsnLys: 1.693 ± 0.017
4.148AsnLeu: 4.148 ± 0.025
0.822AsnMet: 0.822 ± 0.012
1.562AsnAsn: 1.562 ± 0.018
2.691AsnPro: 2.691 ± 0.02
1.79AsnGln: 1.79 ± 0.019
2.716AsnArg: 2.716 ± 0.021
3.763AsnSer: 3.763 ± 0.029
2.455AsnThr: 2.455 ± 0.018
2.491AsnVal: 2.491 ± 0.02
0.547AsnTrp: 0.547 ± 0.008
1.072AsnTyr: 1.072 ± 0.014
0.0AsnXaa: 0.0 ± 0.0
Pro
3.811ProAla: 3.811 ± 0.026
1.111ProCys: 1.111 ± 0.018
2.983ProAsp: 2.983 ± 0.02
3.329ProGlu: 3.329 ± 0.026
2.012ProPhe: 2.012 ± 0.017
3.808ProGly: 3.808 ± 0.061
1.524ProHis: 1.524 ± 0.017
2.605ProIle: 2.605 ± 0.023
2.602ProLys: 2.602 ± 0.021
5.015ProLeu: 5.015 ± 0.032
1.027ProMet: 1.027 ± 0.014
2.689ProAsn: 2.689 ± 0.021
4.501ProPro: 4.501 ± 0.074
2.204ProGln: 2.204 ± 0.02
3.393ProArg: 3.393 ± 0.029
6.407ProSer: 6.407 ± 0.058
4.473ProThr: 4.473 ± 0.038
4.224ProVal: 4.224 ± 0.027
0.594ProTrp: 0.594 ± 0.01
1.395ProTyr: 1.395 ± 0.018
0.001ProXaa: 0.001 ± 0.0
Gln
2.81GlnAla: 2.81 ± 0.023
0.935GlnCys: 0.935 ± 0.014
1.564GlnAsp: 1.564 ± 0.015
2.042GlnGlu: 2.042 ± 0.024
1.583GlnPhe: 1.583 ± 0.017
1.709GlnGly: 1.709 ± 0.022
1.248GlnHis: 1.248 ± 0.016
1.992GlnIle: 1.992 ± 0.018
1.984GlnLys: 1.984 ± 0.019
5.218GlnLeu: 5.218 ± 0.037
1.001GlnMet: 1.001 ± 0.012
1.741GlnAsn: 1.741 ± 0.018
3.128GlnPro: 3.128 ± 0.029
2.496GlnGln: 2.496 ± 0.047
2.872GlnArg: 2.872 ± 0.02
3.77GlnSer: 3.77 ± 0.031
2.909GlnThr: 2.909 ± 0.024
2.412GlnVal: 2.412 ± 0.022
0.66GlnTrp: 0.66 ± 0.009
1.025GlnTyr: 1.025 ± 0.014
0.0GlnXaa: 0.0 ± 0.0
Arg
3.905ArgAla: 3.905 ± 0.028
1.654ArgCys: 1.654 ± 0.02
2.789ArgAsp: 2.789 ± 0.021
3.477ArgGlu: 3.477 ± 0.025
2.717ArgPhe: 2.717 ± 0.019
3.406ArgGly: 3.406 ± 0.032
2.117ArgHis: 2.117 ± 0.019
3.459ArgIle: 3.459 ± 0.028
3.652ArgLys: 3.652 ± 0.029
7.861ArgLeu: 7.861 ± 0.043
1.483ArgMet: 1.483 ± 0.015
2.629ArgAsn: 2.629 ± 0.021
3.63ArgPro: 3.63 ± 0.028
3.071ArgGln: 3.071 ± 0.025
5.536ArgArg: 5.536 ± 0.041
5.961ArgSer: 5.961 ± 0.04
4.131ArgThr: 4.131 ± 0.025
3.909ArgVal: 3.909 ± 0.029
0.965ArgTrp: 0.965 ± 0.014
1.738ArgTyr: 1.738 ± 0.018
0.0ArgXaa: 0.0 ± 0.0
Ser
6.512SerAla: 6.512 ± 0.039
2.091SerCys: 2.091 ± 0.022
4.676SerAsp: 4.676 ± 0.03
4.997SerGlu: 4.997 ± 0.032
3.207SerPhe: 3.207 ± 0.024
5.884SerGly: 5.884 ± 0.038
2.391SerHis: 2.391 ± 0.021
4.044SerIle: 4.044 ± 0.033
3.834SerLys: 3.834 ± 0.023
8.381SerLeu: 8.381 ± 0.045
1.767SerMet: 1.767 ± 0.017
3.928SerAsn: 3.928 ± 0.025
5.697SerPro: 5.697 ± 0.056
3.612SerGln: 3.612 ± 0.035
6.002SerArg: 6.002 ± 0.04
10.551SerSer: 10.551 ± 0.077
6.617SerThr: 6.617 ± 0.043
6.52SerVal: 6.52 ± 0.035
1.108SerTrp: 1.108 ± 0.015
2.011SerTyr: 2.011 ± 0.017
0.001SerXaa: 0.001 ± 0.0
Thr
4.562ThrAla: 4.562 ± 0.031
1.288ThrCys: 1.288 ± 0.017
3.621ThrAsp: 3.621 ± 0.027
3.703ThrGlu: 3.703 ± 0.03
2.257ThrPhe: 2.257 ± 0.019
3.907ThrGly: 3.907 ± 0.029
1.693ThrHis: 1.693 ± 0.017
2.803ThrIle: 2.803 ± 0.018
2.773ThrLys: 2.773 ± 0.024
5.496ThrLeu: 5.496 ± 0.033
1.223ThrMet: 1.223 ± 0.015
2.742ThrAsn: 2.742 ± 0.023
3.81ThrPro: 3.81 ± 0.033
2.563ThrGln: 2.563 ± 0.02
3.765ThrArg: 3.765 ± 0.026
6.415ThrSer: 6.415 ± 0.039
4.737ThrThr: 4.737 ± 0.087
4.354ThrVal: 4.354 ± 0.028
0.805ThrTrp: 0.805 ± 0.011
1.601ThrTyr: 1.601 ± 0.021
0.001ThrXaa: 0.001 ± 0.0
Val
4.37ValAla: 4.37 ± 0.026
1.57ValCys: 1.57 ± 0.02
3.578ValAsp: 3.578 ± 0.028
3.522ValGlu: 3.522 ± 0.036
2.585ValPhe: 2.585 ± 0.023
3.562ValGly: 3.562 ± 0.03
1.857ValHis: 1.857 ± 0.017
2.895ValIle: 2.895 ± 0.025
2.626ValLys: 2.626 ± 0.023
6.052ValLeu: 6.052 ± 0.035
1.172ValMet: 1.172 ± 0.015
2.717ValAsn: 2.717 ± 0.023
3.686ValPro: 3.686 ± 0.026
2.6ValGln: 2.6 ± 0.017
4.342ValArg: 4.342 ± 0.026
5.669ValSer: 5.669 ± 0.037
4.085ValThr: 4.085 ± 0.026
4.199ValVal: 4.199 ± 0.032
0.731ValTrp: 0.731 ± 0.012
1.908ValTyr: 1.908 ± 0.016
0.001ValXaa: 0.001 ± 0.0
Trp
0.651TrpAla: 0.651 ± 0.009
0.34TrpCys: 0.34 ± 0.008
0.615TrpAsp: 0.615 ± 0.011
0.535TrpGlu: 0.535 ± 0.01
0.525TrpPhe: 0.525 ± 0.009
0.553TrpGly: 0.553 ± 0.009
0.339TrpHis: 0.339 ± 0.007
0.741TrpIle: 0.741 ± 0.011
0.636TrpLys: 0.636 ± 0.01
1.617TrpLeu: 1.617 ± 0.02
0.353TrpMet: 0.353 ± 0.075
0.621TrpAsn: 0.621 ± 0.01
0.685TrpPro: 0.685 ± 0.012
0.553TrpGln: 0.553 ± 0.009
0.953TrpArg: 0.953 ± 0.015
1.07TrpSer: 1.07 ± 0.014
0.804TrpThr: 0.804 ± 0.012
0.625TrpVal: 0.625 ± 0.01
0.202TrpTrp: 0.202 ± 0.006
0.325TrpTyr: 0.325 ± 0.008
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.753TyrAla: 1.753 ± 0.016
0.597TyrCys: 0.597 ± 0.009
1.281TyrAsp: 1.281 ± 0.016
1.402TyrGlu: 1.402 ± 0.019
1.136TyrPhe: 1.136 ± 0.015
1.626TyrGly: 1.626 ± 0.018
0.731TyrHis: 0.731 ± 0.011
1.15TyrIle: 1.15 ± 0.015
1.004TyrLys: 1.004 ± 0.013
2.77TyrLeu: 2.77 ± 0.03
0.547TyrMet: 0.547 ± 0.008
0.957TyrAsn: 0.957 ± 0.014
1.46TyrPro: 1.46 ± 0.015
1.104TyrGln: 1.104 ± 0.012
1.887TyrArg: 1.887 ± 0.02
2.238TyrSer: 2.238 ± 0.019
1.56TyrThr: 1.56 ± 0.014
1.585TyrVal: 1.585 ± 0.016
0.363TyrTrp: 0.363 ± 0.008
0.843TyrTyr: 0.843 ± 0.015
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.001
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.001XaaHis: 0.001 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.001XaaArg: 0.001 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.164XaaXaa: 0.164 ± 0.044
Statistics based on 14505 proteins (6939493 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski