Amino acid dipepetide frequency for Opisthorchis viverrini

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.834AlaAla: 5.834 ± 0.055
1.444AlaCys: 1.444 ± 0.016
3.457AlaAsp: 3.457 ± 0.028
4.174AlaGlu: 4.174 ± 0.036
2.506AlaPhe: 2.506 ± 0.02
3.756AlaGly: 3.756 ± 0.033
1.748AlaHis: 1.748 ± 0.018
3.103AlaIle: 3.103 ± 0.023
2.981AlaLys: 2.981 ± 0.023
6.221AlaLeu: 6.221 ± 0.039
1.433AlaMet: 1.433 ± 0.015
2.807AlaAsn: 2.807 ± 0.02
3.478AlaPro: 3.478 ± 0.031
2.696AlaGln: 2.696 ± 0.023
4.065AlaArg: 4.065 ± 0.026
6.287AlaSer: 6.287 ± 0.038
4.442AlaThr: 4.442 ± 0.033
4.713AlaVal: 4.713 ± 0.031
0.767AlaTrp: 0.767 ± 0.011
1.824AlaTyr: 1.824 ± 0.017
0.002AlaXaa: 0.002 ± 0.001
Cys
1.422CysAla: 1.422 ± 0.013
0.601CysCys: 0.601 ± 0.013
1.051CysAsp: 1.051 ± 0.015
1.115CysGlu: 1.115 ± 0.013
0.913CysPhe: 0.913 ± 0.011
1.34CysGly: 1.34 ± 0.016
0.629CysHis: 0.629 ± 0.01
1.067CysIle: 1.067 ± 0.012
0.838CysLys: 0.838 ± 0.012
2.459CysLeu: 2.459 ± 0.023
0.412CysMet: 0.412 ± 0.008
0.756CysAsn: 0.756 ± 0.012
1.356CysPro: 1.356 ± 0.018
0.897CysGln: 0.897 ± 0.014
1.389CysArg: 1.389 ± 0.017
2.051CysSer: 2.051 ± 0.023
1.279CysThr: 1.279 ± 0.014
1.404CysVal: 1.404 ± 0.02
0.282CysTrp: 0.282 ± 0.008
0.56CysTyr: 0.56 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
3.413AspAla: 3.413 ± 0.025
1.118AspCys: 1.118 ± 0.018
3.545AspAsp: 3.545 ± 0.09
3.635AspGlu: 3.635 ± 0.061
1.904AspPhe: 1.904 ± 0.017
3.109AspGly: 3.109 ± 0.04
1.23AspHis: 1.23 ± 0.016
2.437AspIle: 2.437 ± 0.021
2.068AspLys: 2.068 ± 0.023
4.85AspLeu: 4.85 ± 0.03
1.001AspMet: 1.001 ± 0.014
1.825AspAsn: 1.825 ± 0.02
2.925AspPro: 2.925 ± 0.023
2.074AspGln: 2.074 ± 0.019
3.313AspArg: 3.313 ± 0.022
4.63AspSer: 4.63 ± 0.03
2.812AspThr: 2.812 ± 0.023
3.377AspVal: 3.377 ± 0.029
0.72AspTrp: 0.72 ± 0.012
1.44AspTyr: 1.44 ± 0.016
0.001AspXaa: 0.001 ± 0.0
Glu
4.176GluAla: 4.176 ± 0.033
1.122GluCys: 1.122 ± 0.015
3.137GluAsp: 3.137 ± 0.059
4.261GluGlu: 4.261 ± 0.047
2.143GluPhe: 2.143 ± 0.021
2.481GluGly: 2.481 ± 0.023
1.567GluHis: 1.567 ± 0.016
2.638GluIle: 2.638 ± 0.022
2.996GluLys: 2.996 ± 0.031
5.905GluLeu: 5.905 ± 0.04
1.256GluMet: 1.256 ± 0.013
2.563GluAsn: 2.563 ± 0.022
2.824GluPro: 2.824 ± 0.026
2.783GluGln: 2.783 ± 0.026
3.853GluArg: 3.853 ± 0.031
4.457GluSer: 4.457 ± 0.033
3.505GluThr: 3.505 ± 0.029
3.474GluVal: 3.474 ± 0.03
0.641GluTrp: 0.641 ± 0.017
1.509GluTyr: 1.509 ± 0.019
0.0GluXaa: 0.0 ± 0.0
Phe
2.456PheAla: 2.456 ± 0.019
0.897PheCys: 0.897 ± 0.012
1.988PheAsp: 1.988 ± 0.018
2.087PheGlu: 2.087 ± 0.021
1.406PhePhe: 1.406 ± 0.018
2.456PheGly: 2.456 ± 0.021
1.123PheHis: 1.123 ± 0.012
1.826PheIle: 1.826 ± 0.018
1.411PheLys: 1.411 ± 0.015
3.652PheLeu: 3.652 ± 0.029
0.699PheMet: 0.699 ± 0.012
1.431PheAsn: 1.431 ± 0.014
2.067PhePro: 2.067 ± 0.019
1.558PheGln: 1.558 ± 0.014
2.53PheArg: 2.53 ± 0.019
3.448PheSer: 3.448 ± 0.024
2.415PheThr: 2.415 ± 0.02
2.556PheVal: 2.556 ± 0.019
0.479PheTrp: 0.479 ± 0.009
1.159PheTyr: 1.159 ± 0.013
0.0PheXaa: 0.0 ± 0.0
Gly
3.447GlyAla: 3.447 ± 0.026
1.299GlyCys: 1.299 ± 0.016
2.853GlyAsp: 2.853 ± 0.03
2.927GlyGlu: 2.927 ± 0.042
2.269GlyPhe: 2.269 ± 0.02
3.501GlyGly: 3.501 ± 0.038
1.567GlyHis: 1.567 ± 0.016
2.716GlyIle: 2.716 ± 0.025
2.457GlyLys: 2.457 ± 0.022
5.231GlyLeu: 5.231 ± 0.031
1.199GlyMet: 1.199 ± 0.015
2.114GlyAsn: 2.114 ± 0.017
2.949GlyPro: 2.949 ± 0.053
2.404GlyGln: 2.404 ± 0.023
3.769GlyArg: 3.769 ± 0.028
5.574GlySer: 5.574 ± 0.037
3.519GlyThr: 3.519 ± 0.023
3.417GlyVal: 3.417 ± 0.033
0.743GlyTrp: 0.743 ± 0.02
1.58GlyTyr: 1.58 ± 0.019
0.001GlyXaa: 0.001 ± 0.0
His
1.644HisAla: 1.644 ± 0.016
0.711HisCys: 0.711 ± 0.011
1.088HisAsp: 1.088 ± 0.013
1.358HisGlu: 1.358 ± 0.016
1.138HisPhe: 1.138 ± 0.013
1.503HisGly: 1.503 ± 0.014
0.973HisHis: 0.973 ± 0.014
1.278HisIle: 1.278 ± 0.015
1.142HisLys: 1.142 ± 0.013
3.117HisLeu: 3.117 ± 0.024
0.58HisMet: 0.58 ± 0.009
0.971HisAsn: 0.971 ± 0.012
1.874HisPro: 1.874 ± 0.019
1.332HisGln: 1.332 ± 0.014
2.04HisArg: 2.04 ± 0.018
2.739HisSer: 2.739 ± 0.029
1.729HisThr: 1.729 ± 0.017
1.708HisVal: 1.708 ± 0.016
0.391HisTrp: 0.391 ± 0.008
0.782HisTyr: 0.782 ± 0.011
0.001HisXaa: 0.001 ± 0.0
Ile
2.936IleAla: 2.936 ± 0.023
1.115IleCys: 1.115 ± 0.015
2.353IleAsp: 2.353 ± 0.019
2.417IleGlu: 2.417 ± 0.022
1.836IlePhe: 1.836 ± 0.019
2.537IleGly: 2.537 ± 0.021
1.412IleHis: 1.412 ± 0.017
2.127IleIle: 2.127 ± 0.022
1.937IleLys: 1.937 ± 0.018
4.509IleLeu: 4.509 ± 0.03
0.878IleMet: 0.878 ± 0.013
1.865IleAsn: 1.865 ± 0.022
3.041IlePro: 3.041 ± 0.025
2.171IleGln: 2.171 ± 0.019
3.289IleArg: 3.289 ± 0.026
4.23IleSer: 4.23 ± 0.029
2.79IleThr: 2.79 ± 0.022
2.814IleVal: 2.814 ± 0.023
0.592IleTrp: 0.592 ± 0.01
1.343IleTyr: 1.343 ± 0.019
0.001IleXaa: 0.001 ± 0.0
Lys
2.927LysAla: 2.927 ± 0.026
0.904LysCys: 0.904 ± 0.013
2.053LysAsp: 2.053 ± 0.019
2.651LysGlu: 2.651 ± 0.023
1.515LysPhe: 1.515 ± 0.015
1.957LysGly: 1.957 ± 0.031
1.38LysHis: 1.38 ± 0.016
2.006LysIle: 2.006 ± 0.02
2.467LysLys: 2.467 ± 0.029
4.598LysLeu: 4.598 ± 0.032
0.953LysMet: 0.953 ± 0.012
1.73LysAsn: 1.73 ± 0.018
2.891LysPro: 2.891 ± 0.024
2.175LysGln: 2.175 ± 0.019
3.533LysArg: 3.533 ± 0.03
3.83LysSer: 3.83 ± 0.028
2.71LysThr: 2.71 ± 0.019
2.563LysVal: 2.563 ± 0.022
0.527LysTrp: 0.527 ± 0.009
1.201LysTyr: 1.201 ± 0.016
0.001LysXaa: 0.001 ± 0.0
Leu
6.737LeuAla: 6.737 ± 0.039
2.159LeuCys: 2.159 ± 0.021
5.148LeuAsp: 5.148 ± 0.034
5.63LeuGlu: 5.63 ± 0.043
3.87LeuPhe: 3.87 ± 0.03
5.04LeuGly: 5.04 ± 0.032
2.833LeuHis: 2.833 ± 0.027
4.557LeuIle: 4.557 ± 0.033
4.38LeuLys: 4.38 ± 0.033
10.036LeuLeu: 10.036 ± 0.07
1.943LeuMet: 1.943 ± 0.02
4.125LeuAsn: 4.125 ± 0.024
6.121LeuPro: 6.121 ± 0.034
4.301LeuGln: 4.301 ± 0.03
6.723LeuArg: 6.723 ± 0.041
9.036LeuSer: 9.036 ± 0.048
6.009LeuThr: 6.009 ± 0.036
6.073LeuVal: 6.073 ± 0.041
1.08LeuTrp: 1.08 ± 0.013
2.34LeuTyr: 2.34 ± 0.018
0.002LeuXaa: 0.002 ± 0.001
Met
1.502MetAla: 1.502 ± 0.017
0.427MetCys: 0.427 ± 0.007
1.22MetAsp: 1.22 ± 0.021
1.332MetGlu: 1.332 ± 0.014
0.798MetPhe: 0.798 ± 0.012
1.017MetGly: 1.017 ± 0.013
0.589MetHis: 0.589 ± 0.009
0.884MetIle: 0.884 ± 0.013
1.047MetLys: 1.047 ± 0.013
1.921MetLeu: 1.921 ± 0.019
0.503MetMet: 0.503 ± 0.012
0.974MetAsn: 0.974 ± 0.011
1.106MetPro: 1.106 ± 0.014
0.856MetGln: 0.856 ± 0.012
1.314MetArg: 1.314 ± 0.015
1.7MetSer: 1.7 ± 0.015
1.265MetThr: 1.265 ± 0.016
1.231MetVal: 1.231 ± 0.012
0.218MetTrp: 0.218 ± 0.006
0.51MetTyr: 0.51 ± 0.008
0.0MetXaa: 0.0 ± 0.0
Asn
2.763AsnAla: 2.763 ± 0.023
0.889AsnCys: 0.889 ± 0.012
1.852AsnAsp: 1.852 ± 0.021
2.346AsnGlu: 2.346 ± 0.023
1.518AsnPhe: 1.518 ± 0.017
2.582AsnGly: 2.582 ± 0.025
1.112AsnHis: 1.112 ± 0.011
1.881AsnIle: 1.881 ± 0.018
1.76AsnLys: 1.76 ± 0.019
4.08AsnLeu: 4.08 ± 0.027
0.858AsnMet: 0.858 ± 0.012
1.577AsnAsn: 1.577 ± 0.02
2.644AsnPro: 2.644 ± 0.022
1.832AsnGln: 1.832 ± 0.022
2.678AsnArg: 2.678 ± 0.023
3.787AsnSer: 3.787 ± 0.031
2.518AsnThr: 2.518 ± 0.021
2.55AsnVal: 2.55 ± 0.021
0.533AsnTrp: 0.533 ± 0.008
1.097AsnTyr: 1.097 ± 0.014
0.001AsnXaa: 0.001 ± 0.0
Pro
3.806ProAla: 3.806 ± 0.028
1.07ProCys: 1.07 ± 0.015
3.107ProAsp: 3.107 ± 0.025
3.386ProGlu: 3.386 ± 0.024
2.019ProPhe: 2.019 ± 0.018
3.714ProGly: 3.714 ± 0.068
1.544ProHis: 1.544 ± 0.019
2.689ProIle: 2.689 ± 0.02
2.634ProLys: 2.634 ± 0.02
4.992ProLeu: 4.992 ± 0.032
1.1ProMet: 1.1 ± 0.013
2.7ProAsn: 2.7 ± 0.023
4.606ProPro: 4.606 ± 0.054
2.261ProGln: 2.261 ± 0.022
3.357ProArg: 3.357 ± 0.026
6.414ProSer: 6.414 ± 0.054
4.466ProThr: 4.466 ± 0.036
4.297ProVal: 4.297 ± 0.033
0.579ProTrp: 0.579 ± 0.009
1.437ProTyr: 1.437 ± 0.017
0.001ProXaa: 0.001 ± 0.0
Gln
2.845GlnAla: 2.845 ± 0.025
0.921GlnCys: 0.921 ± 0.011
1.607GlnAsp: 1.607 ± 0.016
2.089GlnGlu: 2.089 ± 0.023
1.584GlnPhe: 1.584 ± 0.016
1.669GlnGly: 1.669 ± 0.022
1.266GlnHis: 1.266 ± 0.016
2.013GlnIle: 2.013 ± 0.021
2.0GlnLys: 2.0 ± 0.022
5.231GlnLeu: 5.231 ± 0.033
1.014GlnMet: 1.014 ± 0.012
1.784GlnAsn: 1.784 ± 0.018
3.018GlnPro: 3.018 ± 0.025
2.551GlnGln: 2.551 ± 0.037
2.897GlnArg: 2.897 ± 0.025
3.857GlnSer: 3.857 ± 0.026
2.929GlnThr: 2.929 ± 0.024
2.372GlnVal: 2.372 ± 0.021
0.567GlnTrp: 0.567 ± 0.008
1.064GlnTyr: 1.064 ± 0.013
0.001GlnXaa: 0.001 ± 0.0
Arg
3.943ArgAla: 3.943 ± 0.028
1.503ArgCys: 1.503 ± 0.021
2.853ArgAsp: 2.853 ± 0.02
3.384ArgGlu: 3.384 ± 0.026
2.669ArgPhe: 2.669 ± 0.021
3.263ArgGly: 3.263 ± 0.031
2.019ArgHis: 2.019 ± 0.02
3.371ArgIle: 3.371 ± 0.025
3.466ArgLys: 3.466 ± 0.025
7.555ArgLeu: 7.555 ± 0.04
1.49ArgMet: 1.49 ± 0.016
2.577ArgAsn: 2.577 ± 0.02
3.684ArgPro: 3.684 ± 0.026
3.005ArgGln: 3.005 ± 0.024
5.5ArgArg: 5.5 ± 0.036
5.865ArgSer: 5.865 ± 0.045
4.074ArgThr: 4.074 ± 0.027
3.782ArgVal: 3.782 ± 0.025
0.894ArgTrp: 0.894 ± 0.011
1.744ArgTyr: 1.744 ± 0.015
0.002ArgXaa: 0.002 ± 0.0
Ser
6.504SerAla: 6.504 ± 0.044
1.914SerCys: 1.914 ± 0.019
4.829SerAsp: 4.829 ± 0.029
5.103SerGlu: 5.103 ± 0.033
3.165SerPhe: 3.165 ± 0.023
5.91SerGly: 5.91 ± 0.041
2.423SerHis: 2.423 ± 0.024
3.991SerIle: 3.991 ± 0.033
3.884SerLys: 3.884 ± 0.027
8.353SerLeu: 8.353 ± 0.039
1.807SerMet: 1.807 ± 0.017
3.945SerAsn: 3.945 ± 0.03
5.823SerPro: 5.823 ± 0.044
3.621SerGln: 3.621 ± 0.026
5.747SerArg: 5.747 ± 0.035
10.714SerSer: 10.714 ± 0.074
6.7SerThr: 6.7 ± 0.043
6.431SerVal: 6.431 ± 0.065
1.048SerTrp: 1.048 ± 0.013
2.023SerTyr: 2.023 ± 0.018
0.003SerXaa: 0.003 ± 0.001
Thr
4.564ThrAla: 4.564 ± 0.035
1.273ThrCys: 1.273 ± 0.016
3.604ThrAsp: 3.604 ± 0.035
3.813ThrGlu: 3.813 ± 0.034
2.248ThrPhe: 2.248 ± 0.018
4.006ThrGly: 4.006 ± 0.031
1.708ThrHis: 1.708 ± 0.019
2.761ThrIle: 2.761 ± 0.024
2.787ThrLys: 2.787 ± 0.02
5.48ThrLeu: 5.48 ± 0.032
1.254ThrMet: 1.254 ± 0.014
2.8ThrAsn: 2.8 ± 0.022
3.9ThrPro: 3.9 ± 0.04
2.571ThrGln: 2.571 ± 0.021
3.74ThrArg: 3.74 ± 0.026
6.359ThrSer: 6.359 ± 0.045
4.712ThrThr: 4.712 ± 0.046
4.388ThrVal: 4.388 ± 0.031
0.729ThrTrp: 0.729 ± 0.014
1.545ThrTyr: 1.545 ± 0.014
0.001ThrXaa: 0.001 ± 0.0
Val
4.406ValAla: 4.406 ± 0.03
1.539ValCys: 1.539 ± 0.017
3.714ValAsp: 3.714 ± 0.03
3.599ValGlu: 3.599 ± 0.03
2.5ValPhe: 2.5 ± 0.023
3.565ValGly: 3.565 ± 0.033
1.882ValHis: 1.882 ± 0.016
2.934ValIle: 2.934 ± 0.024
2.676ValLys: 2.676 ± 0.02
5.841ValLeu: 5.841 ± 0.033
1.187ValMet: 1.187 ± 0.013
2.739ValAsn: 2.739 ± 0.022
3.743ValPro: 3.743 ± 0.028
2.601ValGln: 2.601 ± 0.024
4.13ValArg: 4.13 ± 0.027
5.658ValSer: 5.658 ± 0.041
4.049ValThr: 4.049 ± 0.029
4.207ValVal: 4.207 ± 0.038
0.726ValTrp: 0.726 ± 0.012
1.905ValTyr: 1.905 ± 0.019
0.001ValXaa: 0.001 ± 0.0
Trp
0.637TrpAla: 0.637 ± 0.011
0.303TrpCys: 0.303 ± 0.007
0.617TrpAsp: 0.617 ± 0.01
0.535TrpGlu: 0.535 ± 0.009
0.516TrpPhe: 0.516 ± 0.01
0.513TrpGly: 0.513 ± 0.011
0.338TrpHis: 0.338 ± 0.007
0.736TrpIle: 0.736 ± 0.01
0.613TrpLys: 0.613 ± 0.011
1.399TrpLeu: 1.399 ± 0.018
0.287TrpMet: 0.287 ± 0.016
0.593TrpAsn: 0.593 ± 0.009
0.637TrpPro: 0.637 ± 0.012
0.478TrpGln: 0.478 ± 0.009
0.886TrpArg: 0.886 ± 0.013
1.083TrpSer: 1.083 ± 0.019
0.775TrpThr: 0.775 ± 0.012
0.616TrpVal: 0.616 ± 0.01
0.193TrpTrp: 0.193 ± 0.005
0.311TrpTyr: 0.311 ± 0.007
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.775TyrAla: 1.775 ± 0.017
0.61TyrCys: 0.61 ± 0.01
1.349TyrAsp: 1.349 ± 0.018
1.448TyrGlu: 1.448 ± 0.018
1.125TyrPhe: 1.125 ± 0.014
1.617TyrGly: 1.617 ± 0.019
0.756TyrHis: 0.756 ± 0.011
1.177TyrIle: 1.177 ± 0.016
1.029TyrLys: 1.029 ± 0.014
2.713TyrLeu: 2.713 ± 0.024
0.56TyrMet: 0.56 ± 0.009
0.981TyrAsn: 0.981 ± 0.013
1.49TyrPro: 1.49 ± 0.018
1.115TyrGln: 1.115 ± 0.014
1.885TyrArg: 1.885 ± 0.019
2.215TyrSer: 2.215 ± 0.021
1.542TyrThr: 1.542 ± 0.016
1.629TyrVal: 1.629 ± 0.017
0.36TyrTrp: 0.36 ± 0.007
0.837TyrTyr: 0.837 ± 0.014
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.0
0.001XaaCys: 0.001 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.002XaaGlu: 0.002 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.001XaaGly: 0.001 ± 0.0
0.001XaaHis: 0.001 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.003XaaLeu: 0.003 ± 0.001
0.0XaaMet: 0.0 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.002XaaPro: 0.002 ± 0.0
0.001XaaGln: 0.001 ± 0.0
0.001XaaArg: 0.001 ± 0.0
0.002XaaSer: 0.002 ± 0.001
0.001XaaThr: 0.001 ± 0.0
0.002XaaVal: 0.002 ± 0.001
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.082XaaXaa: 0.082 ± 0.026
Statistics based on 16307 proteins (7070076 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski