Amino acid dipepetide frequency for Schistosoma japonicum (Blood fluke)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.556AlaAla: 3.556 ± 0.045
1.119AlaCys: 1.119 ± 0.014
2.395AlaAsp: 2.395 ± 0.022
2.718AlaGlu: 2.718 ± 0.029
1.993AlaPhe: 1.993 ± 0.019
2.213AlaGly: 2.213 ± 0.023
1.137AlaHis: 1.137 ± 0.015
2.989AlaIle: 2.989 ± 0.02
2.644AlaLys: 2.644 ± 0.022
4.744AlaLeu: 4.744 ± 0.037
0.982AlaMet: 0.982 ± 0.011
2.715AlaAsn: 2.715 ± 0.019
1.803AlaPro: 1.803 ± 0.019
1.752AlaGln: 1.752 ± 0.017
2.322AlaArg: 2.322 ± 0.019
4.602AlaSer: 4.602 ± 0.035
2.982AlaThr: 2.982 ± 0.022
3.253AlaVal: 3.253 ± 0.025
0.482AlaTrp: 0.482 ± 0.009
1.687AlaTyr: 1.687 ± 0.015
0.0AlaXaa: 0.0 ± 0.0
Cys
0.945CysAla: 0.945 ± 0.013
0.588CysCys: 0.588 ± 0.01
1.195CysAsp: 1.195 ± 0.017
1.136CysGlu: 1.136 ± 0.017
0.941CysPhe: 0.941 ± 0.012
1.139CysGly: 1.139 ± 0.016
0.675CysHis: 0.675 ± 0.011
1.563CysIle: 1.563 ± 0.017
1.115CysLys: 1.115 ± 0.015
2.501CysLeu: 2.501 ± 0.023
0.41CysMet: 0.41 ± 0.008
1.279CysAsn: 1.279 ± 0.017
1.048CysPro: 1.048 ± 0.017
0.937CysGln: 0.937 ± 0.015
1.187CysArg: 1.187 ± 0.013
2.17CysSer: 2.17 ± 0.022
1.264CysThr: 1.264 ± 0.014
1.165CysVal: 1.165 ± 0.014
0.231CysTrp: 0.231 ± 0.006
0.691CysTyr: 0.691 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
2.472AspAla: 2.472 ± 0.021
1.137AspCys: 1.137 ± 0.012
3.876AspAsp: 3.876 ± 0.041
3.813AspGlu: 3.813 ± 0.028
2.112AspPhe: 2.112 ± 0.018
2.573AspGly: 2.573 ± 0.027
1.381AspHis: 1.381 ± 0.014
3.537AspIle: 3.537 ± 0.023
2.905AspLys: 2.905 ± 0.023
4.987AspLeu: 4.987 ± 0.031
1.022AspMet: 1.022 ± 0.012
3.508AspAsn: 3.508 ± 0.027
2.205AspPro: 2.205 ± 0.021
1.958AspGln: 1.958 ± 0.016
2.358AspArg: 2.358 ± 0.018
4.874AspSer: 4.874 ± 0.031
2.704AspThr: 2.704 ± 0.028
3.103AspVal: 3.103 ± 0.025
0.615AspTrp: 0.615 ± 0.01
1.974AspTyr: 1.974 ± 0.017
0.0AspXaa: 0.0 ± 0.0
Glu
3.102GluAla: 3.102 ± 0.028
1.183GluCys: 1.183 ± 0.016
2.994GluAsp: 2.994 ± 0.029
3.977GluGlu: 3.977 ± 0.04
2.246GluPhe: 2.246 ± 0.018
1.9GluGly: 1.9 ± 0.024
1.419GluHis: 1.419 ± 0.015
3.705GluIle: 3.705 ± 0.027
3.679GluLys: 3.679 ± 0.033
5.912GluLeu: 5.912 ± 0.045
1.289GluMet: 1.289 ± 0.015
3.856GluAsn: 3.856 ± 0.024
1.843GluPro: 1.843 ± 0.018
2.471GluGln: 2.471 ± 0.024
2.862GluArg: 2.862 ± 0.027
4.829GluSer: 4.829 ± 0.03
3.17GluThr: 3.17 ± 0.024
3.045GluVal: 3.045 ± 0.025
0.556GluTrp: 0.556 ± 0.009
1.905GluTyr: 1.905 ± 0.017
0.0GluXaa: 0.0 ± 0.0
Phe
1.78PheAla: 1.78 ± 0.018
0.926PheCys: 0.926 ± 0.012
2.2PheAsp: 2.2 ± 0.018
2.01PheGlu: 2.01 ± 0.016
1.383PhePhe: 1.383 ± 0.017
2.107PheGly: 2.107 ± 0.018
1.192PheHis: 1.192 ± 0.013
2.886PheIle: 2.886 ± 0.025
1.946PheLys: 1.946 ± 0.018
3.689PheLeu: 3.689 ± 0.027
0.851PheMet: 0.851 ± 0.01
2.457PheAsn: 2.457 ± 0.021
1.744PhePro: 1.744 ± 0.017
1.545PheGln: 1.545 ± 0.013
1.905PheArg: 1.905 ± 0.018
3.746PheSer: 3.746 ± 0.024
2.581PheThr: 2.581 ± 0.02
2.26PheVal: 2.26 ± 0.02
0.432PheTrp: 0.432 ± 0.008
1.405PheTyr: 1.405 ± 0.014
0.0PheXaa: 0.0 ± 0.0
Gly
2.119GlyAla: 2.119 ± 0.026
1.026GlyCys: 1.026 ± 0.016
2.424GlyAsp: 2.424 ± 0.029
2.323GlyGlu: 2.323 ± 0.043
1.971GlyPhe: 1.971 ± 0.021
2.746GlyGly: 2.746 ± 0.035
1.241GlyHis: 1.241 ± 0.013
2.975GlyIle: 2.975 ± 0.022
2.4GlyLys: 2.4 ± 0.023
4.479GlyLeu: 4.479 ± 0.034
0.867GlyMet: 0.867 ± 0.012
2.422GlyAsn: 2.422 ± 0.02
1.876GlyPro: 1.876 ± 0.056
1.776GlyGln: 1.776 ± 0.02
2.474GlyArg: 2.474 ± 0.026
4.259GlySer: 4.259 ± 0.033
2.473GlyThr: 2.473 ± 0.02
2.773GlyVal: 2.773 ± 0.027
0.54GlyTrp: 0.54 ± 0.012
1.65GlyTyr: 1.65 ± 0.018
0.0GlyXaa: 0.0 ± 0.0
His
1.153HisAla: 1.153 ± 0.012
0.673HisCys: 0.673 ± 0.009
1.326HisAsp: 1.326 ± 0.012
1.451HisGlu: 1.451 ± 0.015
1.241HisPhe: 1.241 ± 0.014
1.225HisGly: 1.225 ± 0.013
1.339HisHis: 1.339 ± 0.024
1.816HisIle: 1.816 ± 0.02
1.442HisLys: 1.442 ± 0.014
3.192HisLeu: 3.192 ± 0.023
0.546HisMet: 0.546 ± 0.009
1.741HisAsn: 1.741 ± 0.019
1.43HisPro: 1.43 ± 0.015
1.466HisGln: 1.466 ± 0.016
1.536HisArg: 1.536 ± 0.016
3.09HisSer: 3.09 ± 0.022
1.56HisThr: 1.56 ± 0.019
1.497HisVal: 1.497 ± 0.015
0.305HisTrp: 0.305 ± 0.007
1.036HisTyr: 1.036 ± 0.013
0.0HisXaa: 0.0 ± 0.0
Ile
2.888IleAla: 2.888 ± 0.024
1.515IleCys: 1.515 ± 0.017
3.593IleAsp: 3.593 ± 0.023
3.516IleGlu: 3.516 ± 0.026
2.429IlePhe: 2.429 ± 0.021
2.944IleGly: 2.944 ± 0.022
2.055IleHis: 2.055 ± 0.018
4.161IleIle: 4.161 ± 0.03
3.607IleLys: 3.607 ± 0.026
5.851IleLeu: 5.851 ± 0.038
1.253IleMet: 1.253 ± 0.013
4.531IleAsn: 4.531 ± 0.03
3.307IlePro: 3.307 ± 0.025
2.907IleGln: 2.907 ± 0.024
3.161IleArg: 3.161 ± 0.02
6.321IleSer: 6.321 ± 0.034
4.116IleThr: 4.116 ± 0.026
3.369IleVal: 3.369 ± 0.026
0.672IleTrp: 0.672 ± 0.011
2.117IleTyr: 2.117 ± 0.022
0.0IleXaa: 0.0 ± 0.0
Lys
2.588LysAla: 2.588 ± 0.019
1.35LysCys: 1.35 ± 0.019
2.55LysAsp: 2.55 ± 0.02
3.372LysGlu: 3.372 ± 0.029
2.101LysPhe: 2.101 ± 0.02
1.882LysGly: 1.882 ± 0.032
1.782LysHis: 1.782 ± 0.016
3.324LysIle: 3.324 ± 0.022
3.438LysLys: 3.438 ± 0.045
5.995LysLeu: 5.995 ± 0.038
1.256LysMet: 1.256 ± 0.015
3.056LysAsn: 3.056 ± 0.024
2.629LysPro: 2.629 ± 0.022
2.895LysGln: 2.895 ± 0.021
3.363LysArg: 3.363 ± 0.025
5.702LysSer: 5.702 ± 0.033
3.243LysThr: 3.243 ± 0.024
2.823LysVal: 2.823 ± 0.021
0.606LysTrp: 0.606 ± 0.008
1.982LysTyr: 1.982 ± 0.021
0.0LysXaa: 0.0 ± 0.0
Leu
5.101LeuAla: 5.101 ± 0.033
2.267LeuCys: 2.267 ± 0.022
4.913LeuAsp: 4.913 ± 0.032
5.189LeuGlu: 5.189 ± 0.04
3.999LeuPhe: 3.999 ± 0.028
3.967LeuGly: 3.967 ± 0.028
2.935LeuHis: 2.935 ± 0.021
6.341LeuIle: 6.341 ± 0.04
5.81LeuLys: 5.81 ± 0.033
10.343LeuLeu: 10.343 ± 0.115
2.008LeuMet: 2.008 ± 0.017
6.307LeuAsn: 6.307 ± 0.035
5.175LeuPro: 5.175 ± 0.032
4.267LeuGln: 4.267 ± 0.027
5.138LeuArg: 5.138 ± 0.028
9.822LeuSer: 9.822 ± 0.046
6.097LeuThr: 6.097 ± 0.031
5.047LeuVal: 5.047 ± 0.035
0.977LeuTrp: 0.977 ± 0.013
2.875LeuTyr: 2.875 ± 0.021
0.0LeuXaa: 0.0 ± 0.0
Met
1.053MetAla: 1.053 ± 0.013
0.4MetCys: 0.4 ± 0.008
1.125MetAsp: 1.125 ± 0.012
1.213MetGlu: 1.213 ± 0.016
0.757MetPhe: 0.757 ± 0.009
0.832MetGly: 0.832 ± 0.014
0.591MetHis: 0.591 ± 0.01
1.281MetIle: 1.281 ± 0.016
1.505MetLys: 1.505 ± 0.016
1.855MetLeu: 1.855 ± 0.018
0.505MetMet: 0.505 ± 0.016
1.718MetAsn: 1.718 ± 0.015
0.917MetPro: 0.917 ± 0.012
0.781MetGln: 0.781 ± 0.01
0.942MetArg: 0.942 ± 0.012
1.824MetSer: 1.824 ± 0.017
1.188MetThr: 1.188 ± 0.013
0.97MetVal: 0.97 ± 0.011
0.178MetTrp: 0.178 ± 0.005
0.614MetTyr: 0.614 ± 0.009
0.0MetXaa: 0.0 ± 0.0
Asn
2.753AsnAla: 2.753 ± 0.022
1.343AsnCys: 1.343 ± 0.018
3.873AsnAsp: 3.873 ± 0.027
4.141AsnGlu: 4.141 ± 0.032
2.245AsnPhe: 2.245 ± 0.021
2.81AsnGly: 2.81 ± 0.025
2.037AsnHis: 2.037 ± 0.02
4.201AsnIle: 4.201 ± 0.028
3.666AsnLys: 3.666 ± 0.026
5.938AsnLeu: 5.938 ± 0.033
1.217AsnMet: 1.217 ± 0.013
6.445AsnAsn: 6.445 ± 0.061
2.777AsnPro: 2.777 ± 0.021
3.026AsnGln: 3.026 ± 0.024
2.827AsnArg: 2.827 ± 0.021
7.019AsnSer: 7.019 ± 0.045
4.009AsnThr: 4.009 ± 0.03
3.505AsnVal: 3.505 ± 0.026
0.597AsnTrp: 0.597 ± 0.009
2.25AsnTyr: 2.25 ± 0.02
0.0AsnXaa: 0.0 ± 0.0
Pro
1.961ProAla: 1.961 ± 0.023
0.863ProCys: 0.863 ± 0.013
2.343ProAsp: 2.343 ± 0.02
2.468ProGlu: 2.468 ± 0.018
1.75ProPhe: 1.75 ± 0.017
2.396ProGly: 2.396 ± 0.064
1.161ProHis: 1.161 ± 0.015
3.134ProIle: 3.134 ± 0.024
2.331ProLys: 2.331 ± 0.018
4.097ProLeu: 4.097 ± 0.027
0.831ProMet: 0.831 ± 0.01
2.94ProAsn: 2.94 ± 0.024
3.088ProPro: 3.088 ± 0.044
1.666ProGln: 1.666 ± 0.018
1.937ProArg: 1.937 ± 0.017
4.907ProSer: 4.907 ± 0.04
3.284ProThr: 3.284 ± 0.039
3.074ProVal: 3.074 ± 0.025
0.412ProTrp: 0.412 ± 0.008
1.504ProTyr: 1.504 ± 0.015
0.0ProXaa: 0.0 ± 0.0
Gln
1.905GlnAla: 1.905 ± 0.019
0.923GlnCys: 0.923 ± 0.014
1.545GlnAsp: 1.545 ± 0.017
1.98GlnGlu: 1.98 ± 0.02
1.707GlnPhe: 1.707 ± 0.016
1.313GlnGly: 1.313 ± 0.021
1.406GlnHis: 1.406 ± 0.016
2.77GlnIle: 2.77 ± 0.021
2.291GlnLys: 2.291 ± 0.019
5.158GlnLeu: 5.158 ± 0.035
1.005GlnMet: 1.005 ± 0.012
2.662GlnAsn: 2.662 ± 0.022
2.117GlnPro: 2.117 ± 0.028
2.928GlnGln: 2.928 ± 0.042
2.194GlnArg: 2.194 ± 0.021
4.398GlnSer: 4.398 ± 0.034
2.606GlnThr: 2.606 ± 0.022
2.018GlnVal: 2.018 ± 0.02
0.448GlnTrp: 0.448 ± 0.008
1.387GlnTyr: 1.387 ± 0.017
0.0GlnXaa: 0.0 ± 0.0
Arg
2.293ArgAla: 2.293 ± 0.02
1.094ArgCys: 1.094 ± 0.016
2.246ArgAsp: 2.246 ± 0.017
2.545ArgGlu: 2.545 ± 0.025
2.061ArgPhe: 2.061 ± 0.016
2.088ArgGly: 2.088 ± 0.029
1.528ArgHis: 1.528 ± 0.016
3.373ArgIle: 3.373 ± 0.025
3.195ArgLys: 3.195 ± 0.025
5.689ArgLeu: 5.689 ± 0.038
1.01ArgMet: 1.01 ± 0.011
2.864ArgAsn: 2.864 ± 0.022
2.134ArgPro: 2.134 ± 0.019
2.321ArgGln: 2.321 ± 0.027
3.727ArgArg: 3.727 ± 0.03
4.432ArgSer: 4.432 ± 0.034
2.508ArgThr: 2.508 ± 0.019
2.57ArgVal: 2.57 ± 0.023
0.545ArgTrp: 0.545 ± 0.009
1.682ArgTyr: 1.682 ± 0.014
0.0ArgXaa: 0.0 ± 0.0
Ser
4.378SerAla: 4.378 ± 0.03
2.019SerCys: 2.019 ± 0.02
5.343SerAsp: 5.343 ± 0.031
5.236SerGlu: 5.236 ± 0.037
3.637SerPhe: 3.637 ± 0.026
4.869SerGly: 4.869 ± 0.038
2.722SerHis: 2.722 ± 0.023
6.048SerIle: 6.048 ± 0.033
5.354SerLys: 5.354 ± 0.033
9.318SerLeu: 9.318 ± 0.046
1.976SerMet: 1.976 ± 0.02
7.216SerAsn: 7.216 ± 0.043
4.628SerPro: 4.628 ± 0.036
3.932SerGln: 3.932 ± 0.029
4.534SerArg: 4.534 ± 0.034
13.633SerSer: 13.633 ± 0.124
7.227SerThr: 7.227 ± 0.05
6.024SerVal: 6.024 ± 0.031
0.846SerTrp: 0.846 ± 0.012
2.809SerTyr: 2.809 ± 0.02
0.0SerXaa: 0.0 ± 0.0
Thr
3.048ThrAla: 3.048 ± 0.023
1.311ThrCys: 1.311 ± 0.016
3.419ThrAsp: 3.419 ± 0.024
3.39ThrGlu: 3.39 ± 0.025
2.263ThrPhe: 2.263 ± 0.019
3.046ThrGly: 3.046 ± 0.025
1.497ThrHis: 1.497 ± 0.017
3.898ThrIle: 3.898 ± 0.027
3.193ThrLys: 3.193 ± 0.022
5.36ThrLeu: 5.36 ± 0.029
1.201ThrMet: 1.201 ± 0.013
4.576ThrAsn: 4.576 ± 0.029
3.018ThrPro: 3.018 ± 0.035
2.196ThrGln: 2.196 ± 0.021
2.499ThrArg: 2.499 ± 0.022
6.884ThrSer: 6.884 ± 0.044
5.588ThrThr: 5.588 ± 0.065
3.798ThrVal: 3.798 ± 0.029
0.529ThrTrp: 0.529 ± 0.008
1.87ThrTyr: 1.87 ± 0.016
0.0ThrXaa: 0.0 ± 0.0
Val
2.949ValAla: 2.949 ± 0.024
1.39ValCys: 1.39 ± 0.015
3.399ValAsp: 3.399 ± 0.026
3.13ValGlu: 3.13 ± 0.029
2.178ValPhe: 2.178 ± 0.019
2.748ValGly: 2.748 ± 0.027
1.604ValHis: 1.604 ± 0.016
3.562ValIle: 3.562 ± 0.026
3.182ValLys: 3.182 ± 0.023
4.882ValLeu: 4.882 ± 0.028
1.108ValMet: 1.108 ± 0.014
3.907ValAsn: 3.907 ± 0.027
2.416ValPro: 2.416 ± 0.021
2.164ValGln: 2.164 ± 0.017
2.591ValArg: 2.591 ± 0.023
5.147ValSer: 5.147 ± 0.035
3.564ValThr: 3.564 ± 0.027
3.406ValVal: 3.406 ± 0.028
0.563ValTrp: 0.563 ± 0.01
1.932ValTyr: 1.932 ± 0.017
0.0ValXaa: 0.0 ± 0.0
Trp
0.447TrpAla: 0.447 ± 0.008
0.259TrpCys: 0.259 ± 0.006
0.567TrpAsp: 0.567 ± 0.009
0.477TrpGlu: 0.477 ± 0.009
0.474TrpPhe: 0.474 ± 0.008
0.378TrpGly: 0.378 ± 0.009
0.266TrpHis: 0.266 ± 0.006
0.784TrpIle: 0.784 ± 0.012
0.612TrpLys: 0.612 ± 0.008
1.148TrpLeu: 1.148 ± 0.018
0.233TrpMet: 0.233 ± 0.005
0.61TrpAsn: 0.61 ± 0.009
0.46TrpPro: 0.46 ± 0.008
0.348TrpGln: 0.348 ± 0.007
0.602TrpArg: 0.602 ± 0.009
0.959TrpSer: 0.959 ± 0.011
0.543TrpThr: 0.543 ± 0.009
0.407TrpVal: 0.407 ± 0.008
0.143TrpTrp: 0.143 ± 0.004
0.337TrpTyr: 0.337 ± 0.007
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.599TyrAla: 1.599 ± 0.014
0.796TyrCys: 0.796 ± 0.013
1.789TyrAsp: 1.789 ± 0.017
1.859TyrGlu: 1.859 ± 0.018
1.556TyrPhe: 1.556 ± 0.016
1.638TyrGly: 1.638 ± 0.022
1.096TyrHis: 1.096 ± 0.012
1.963TyrIle: 1.963 ± 0.018
1.622TyrLys: 1.622 ± 0.015
3.436TyrLeu: 3.436 ± 0.026
0.678TyrMet: 0.678 ± 0.01
1.872TyrAsn: 1.872 ± 0.02
1.531TyrPro: 1.531 ± 0.015
1.424TyrGln: 1.424 ± 0.014
1.747TyrArg: 1.747 ± 0.017
3.132TyrSer: 3.132 ± 0.022
1.877TyrThr: 1.877 ± 0.017
1.716TyrVal: 1.716 ± 0.016
0.377TyrTrp: 0.377 ± 0.007
1.276TyrTyr: 1.276 ± 0.015
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 14733 proteins (7781729 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski