Amino acid dipepetide frequency for Theileria parva (East coast fever infection agent)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.597AlaAla: 1.597 ± 0.041
0.513AlaCys: 0.513 ± 0.019
1.782AlaAsp: 1.782 ± 0.031
2.007AlaGlu: 2.007 ± 0.04
1.638AlaPhe: 1.638 ± 0.03
1.609AlaGly: 1.609 ± 0.037
0.725AlaHis: 0.725 ± 0.021
2.253AlaIle: 2.253 ± 0.036
2.832AlaLys: 2.832 ± 0.043
3.734AlaLeu: 3.734 ± 0.046
0.696AlaMet: 0.696 ± 0.019
2.109AlaAsn: 2.109 ± 0.035
1.359AlaPro: 1.359 ± 0.041
1.173AlaGln: 1.173 ± 0.027
1.441AlaArg: 1.441 ± 0.031
2.766AlaSer: 2.766 ± 0.04
2.169AlaThr: 2.169 ± 0.057
2.344AlaVal: 2.344 ± 0.039
0.231AlaTrp: 0.231 ± 0.012
1.234AlaTyr: 1.234 ± 0.026
0.0AlaXaa: 0.0 ± 0.0
Cys
0.542CysAla: 0.542 ± 0.019
0.444CysCys: 0.444 ± 0.025
0.941CysAsp: 0.941 ± 0.021
0.901CysGlu: 0.901 ± 0.024
0.835CysPhe: 0.835 ± 0.021
0.893CysGly: 0.893 ± 0.026
0.358CysHis: 0.358 ± 0.013
1.207CysIle: 1.207 ± 0.029
1.369CysLys: 1.369 ± 0.03
1.693CysLeu: 1.693 ± 0.032
0.31CysMet: 0.31 ± 0.013
1.135CysAsn: 1.135 ± 0.03
0.574CysPro: 0.574 ± 0.024
0.392CysGln: 0.392 ± 0.015
0.68CysArg: 0.68 ± 0.02
1.397CysSer: 1.397 ± 0.03
0.877CysThr: 0.877 ± 0.025
1.21CysVal: 1.21 ± 0.029
0.137CysTrp: 0.137 ± 0.009
0.713CysTyr: 0.713 ± 0.02
0.0CysXaa: 0.0 ± 0.0
Asp
1.759AspAla: 1.759 ± 0.031
0.786AspCys: 0.786 ± 0.021
3.937AspAsp: 3.937 ± 0.07
4.564AspGlu: 4.564 ± 0.065
2.944AspPhe: 2.944 ± 0.045
2.591AspGly: 2.591 ± 0.04
1.126AspHis: 1.126 ± 0.025
3.62AspIle: 3.62 ± 0.055
4.61AspLys: 4.61 ± 0.059
6.047AspLeu: 6.047 ± 0.059
1.076AspMet: 1.076 ± 0.025
3.807AspAsn: 3.807 ± 0.049
2.52AspPro: 2.52 ± 0.043
1.811AspGln: 1.811 ± 0.035
2.12AspArg: 2.12 ± 0.036
5.477AspSer: 5.477 ± 0.072
3.226AspThr: 3.226 ± 0.052
3.647AspVal: 3.647 ± 0.045
0.427AspTrp: 0.427 ± 0.015
2.622AspTyr: 2.622 ± 0.042
0.001AspXaa: 0.001 ± 0.001
Glu
2.307GluAla: 2.307 ± 0.044
0.981GluCys: 0.981 ± 0.024
4.184GluAsp: 4.184 ± 0.066
5.031GluGlu: 5.031 ± 0.071
3.157GluPhe: 3.157 ± 0.046
2.505GluGly: 2.505 ± 0.045
1.349GluHis: 1.349 ± 0.026
4.122GluIle: 4.122 ± 0.06
4.802GluLys: 4.802 ± 0.061
6.656GluLeu: 6.656 ± 0.078
1.478GluMet: 1.478 ± 0.032
4.669GluAsn: 4.669 ± 0.057
2.293GluPro: 2.293 ± 0.047
1.961GluGln: 1.961 ± 0.035
2.571GluArg: 2.571 ± 0.037
5.487GluSer: 5.487 ± 0.068
3.296GluThr: 3.296 ± 0.046
3.639GluVal: 3.639 ± 0.053
0.498GluTrp: 0.498 ± 0.018
2.696GluTyr: 2.696 ± 0.037
0.0GluXaa: 0.0 ± 0.0
Phe
1.526PheAla: 1.526 ± 0.034
0.848PheCys: 0.848 ± 0.023
3.203PheAsp: 3.203 ± 0.047
2.967PheGlu: 2.967 ± 0.043
2.294PhePhe: 2.294 ± 0.043
2.676PheGly: 2.676 ± 0.057
1.048PheHis: 1.048 ± 0.025
3.361PheIle: 3.361 ± 0.052
4.033PheLys: 4.033 ± 0.056
4.824PheLeu: 4.824 ± 0.077
0.98PheMet: 0.98 ± 0.023
3.798PheAsn: 3.798 ± 0.05
1.651PhePro: 1.651 ± 0.032
1.378PheGln: 1.378 ± 0.026
1.897PheArg: 1.897 ± 0.033
4.266PheSer: 4.266 ± 0.05
2.917PheThr: 2.917 ± 0.045
3.098PheVal: 3.098 ± 0.041
0.425PheTrp: 0.425 ± 0.017
2.342PheTyr: 2.342 ± 0.041
0.0PheXaa: 0.0 ± 0.0
Gly
1.665GlyAla: 1.665 ± 0.035
0.665GlyCys: 0.665 ± 0.021
2.778GlyAsp: 2.778 ± 0.042
2.623GlyGlu: 2.623 ± 0.042
2.353GlyPhe: 2.353 ± 0.037
2.363GlyGly: 2.363 ± 0.047
0.941GlyHis: 0.941 ± 0.024
3.021GlyIle: 3.021 ± 0.051
3.433GlyLys: 3.433 ± 0.051
4.145GlyLeu: 4.145 ± 0.069
0.906GlyMet: 0.906 ± 0.023
2.934GlyAsn: 2.934 ± 0.042
1.516GlyPro: 1.516 ± 0.042
1.258GlyGln: 1.258 ± 0.043
1.849GlyArg: 1.849 ± 0.036
3.891GlySer: 3.891 ± 0.056
2.84GlyThr: 2.84 ± 0.058
3.084GlyVal: 3.084 ± 0.05
0.363GlyTrp: 0.363 ± 0.014
2.065GlyTyr: 2.065 ± 0.045
0.0GlyXaa: 0.0 ± 0.0
His
0.583HisAla: 0.583 ± 0.017
0.34HisCys: 0.34 ± 0.012
1.014HisAsp: 1.014 ± 0.026
1.177HisGlu: 1.177 ± 0.025
1.23HisPhe: 1.23 ± 0.028
0.894HisGly: 0.894 ± 0.023
0.503HisHis: 0.503 ± 0.017
1.515HisIle: 1.515 ± 0.033
1.57HisLys: 1.57 ± 0.029
2.461HisLeu: 2.461 ± 0.04
0.463HisMet: 0.463 ± 0.016
1.483HisAsn: 1.483 ± 0.033
0.985HisPro: 0.985 ± 0.024
0.682HisGln: 0.682 ± 0.02
0.952HisArg: 0.952 ± 0.026
1.912HisSer: 1.912 ± 0.036
1.37HisThr: 1.37 ± 0.033
1.225HisVal: 1.225 ± 0.026
0.19HisTrp: 0.19 ± 0.011
1.048HisTyr: 1.048 ± 0.026
0.0HisXaa: 0.0 ± 0.0
Ile
2.245IleAla: 2.245 ± 0.039
1.228IleCys: 1.228 ± 0.026
3.578IleAsp: 3.578 ± 0.046
3.567IleGlu: 3.567 ± 0.053
3.331IlePhe: 3.331 ± 0.053
2.605IleGly: 2.605 ± 0.045
1.501IleHis: 1.501 ± 0.03
4.673IleIle: 4.673 ± 0.081
5.242IleLys: 5.242 ± 0.066
6.933IleLeu: 6.933 ± 0.079
1.267IleMet: 1.267 ± 0.027
5.076IleAsn: 5.076 ± 0.072
2.913IlePro: 2.913 ± 0.046
2.113IleGln: 2.113 ± 0.043
2.607IleArg: 2.607 ± 0.037
5.716IleSer: 5.716 ± 0.074
3.88IleThr: 3.88 ± 0.054
3.664IleVal: 3.664 ± 0.048
0.677IleTrp: 0.677 ± 0.024
2.836IleTyr: 2.836 ± 0.039
0.0IleXaa: 0.0 ± 0.0
Lys
2.643LysAla: 2.643 ± 0.045
1.521LysCys: 1.521 ± 0.036
4.193LysAsp: 4.193 ± 0.06
4.751LysGlu: 4.751 ± 0.064
4.064LysPhe: 4.064 ± 0.054
2.982LysGly: 2.982 ± 0.047
1.852LysHis: 1.852 ± 0.031
5.407LysIle: 5.407 ± 0.063
5.925LysLys: 5.925 ± 0.082
8.966LysLeu: 8.966 ± 0.089
1.79LysMet: 1.79 ± 0.036
5.242LysAsn: 5.242 ± 0.062
3.03LysPro: 3.03 ± 0.053
2.231LysGln: 2.231 ± 0.038
3.8LysArg: 3.8 ± 0.051
6.795LysSer: 6.795 ± 0.078
4.141LysThr: 4.141 ± 0.052
4.65LysVal: 4.65 ± 0.057
0.654LysTrp: 0.654 ± 0.018
3.656LysTyr: 3.656 ± 0.045
0.0LysXaa: 0.0 ± 0.0
Leu
3.519LeuAla: 3.519 ± 0.049
1.785LeuCys: 1.785 ± 0.031
5.929LeuAsp: 5.929 ± 0.055
6.456LeuGlu: 6.456 ± 0.069
5.221LeuPhe: 5.221 ± 0.072
4.175LeuGly: 4.175 ± 0.054
2.18LeuHis: 2.18 ± 0.042
6.623LeuIle: 6.623 ± 0.086
8.307LeuLys: 8.307 ± 0.081
10.756LeuLeu: 10.756 ± 0.127
2.219LeuMet: 2.219 ± 0.034
7.691LeuAsn: 7.691 ± 0.083
3.384LeuPro: 3.384 ± 0.049
3.158LeuGln: 3.158 ± 0.051
4.289LeuArg: 4.289 ± 0.05
9.098LeuSer: 9.098 ± 0.086
5.727LeuThr: 5.727 ± 0.063
6.175LeuVal: 6.175 ± 0.082
0.798LeuTrp: 0.798 ± 0.023
4.37LeuTyr: 4.37 ± 0.055
0.0LeuXaa: 0.0 ± 0.0
Met
0.934MetAla: 0.934 ± 0.023
0.468MetCys: 0.468 ± 0.018
1.389MetAsp: 1.389 ± 0.032
1.439MetGlu: 1.439 ± 0.029
1.041MetPhe: 1.041 ± 0.026
1.053MetGly: 1.053 ± 0.027
0.305MetHis: 0.305 ± 0.013
1.321MetIle: 1.321 ± 0.031
1.571MetLys: 1.571 ± 0.028
1.856MetLeu: 1.856 ± 0.029
0.471MetMet: 0.471 ± 0.018
1.47MetAsn: 1.47 ± 0.035
0.68MetPro: 0.68 ± 0.022
0.448MetGln: 0.448 ± 0.015
0.84MetArg: 0.84 ± 0.022
1.707MetSer: 1.707 ± 0.035
1.072MetThr: 1.072 ± 0.028
1.286MetVal: 1.286 ± 0.025
0.143MetTrp: 0.143 ± 0.008
0.835MetTyr: 0.835 ± 0.022
0.0MetXaa: 0.0 ± 0.0
Asn
2.132AsnAla: 2.132 ± 0.037
1.244AsnCys: 1.244 ± 0.029
3.684AsnAsp: 3.684 ± 0.047
4.219AsnGlu: 4.219 ± 0.054
3.61AsnPhe: 3.61 ± 0.053
2.975AsnGly: 2.975 ± 0.042
1.459AsnHis: 1.459 ± 0.031
4.787AsnIle: 4.787 ± 0.067
5.601AsnLys: 5.601 ± 0.067
7.402AsnLeu: 7.402 ± 0.08
1.444AsnMet: 1.444 ± 0.033
5.932AsnAsn: 5.932 ± 0.09
2.805AsnPro: 2.805 ± 0.044
2.196AsnGln: 2.196 ± 0.034
2.616AsnArg: 2.616 ± 0.044
7.204AsnSer: 7.204 ± 0.093
5.058AsnThr: 5.058 ± 0.084
4.401AsnVal: 4.401 ± 0.061
0.569AsnTrp: 0.569 ± 0.018
3.28AsnTyr: 3.28 ± 0.055
0.0AsnXaa: 0.0 ± 0.0
Pro
1.352ProAla: 1.352 ± 0.045
0.464ProCys: 0.464 ± 0.017
2.135ProAsp: 2.135 ± 0.039
3.076ProGlu: 3.076 ± 0.054
1.752ProPhe: 1.752 ± 0.033
1.876ProGly: 1.876 ± 0.047
0.709ProHis: 0.709 ± 0.02
2.442ProIle: 2.442 ± 0.045
3.181ProLys: 3.181 ± 0.057
3.372ProLeu: 3.372 ± 0.048
0.722ProMet: 0.722 ± 0.023
2.823ProAsn: 2.823 ± 0.044
2.052ProPro: 2.052 ± 0.057
1.77ProGln: 1.77 ± 0.068
1.477ProArg: 1.477 ± 0.037
3.281ProSer: 3.281 ± 0.06
2.512ProThr: 2.512 ± 0.055
2.633ProVal: 2.633 ± 0.043
0.28ProTrp: 0.28 ± 0.014
1.545ProTyr: 1.545 ± 0.038
0.001ProXaa: 0.001 ± 0.0
Gln
1.143GlnAla: 1.143 ± 0.029
0.438GlnCys: 0.438 ± 0.016
1.56GlnAsp: 1.56 ± 0.035
1.859GlnGlu: 1.859 ± 0.034
1.587GlnPhe: 1.587 ± 0.032
1.189GlnGly: 1.189 ± 0.038
0.748GlnHis: 0.748 ± 0.022
2.263GlnIle: 2.263 ± 0.04
2.124GlnLys: 2.124 ± 0.035
3.547GlnLeu: 3.547 ± 0.051
0.782GlnMet: 0.782 ± 0.019
2.203GlnAsn: 2.203 ± 0.037
1.908GlnPro: 1.908 ± 0.098
1.514GlnGln: 1.514 ± 0.06
1.248GlnArg: 1.248 ± 0.028
2.488GlnSer: 2.488 ± 0.038
1.838GlnThr: 1.838 ± 0.037
1.782GlnVal: 1.782 ± 0.036
0.232GlnTrp: 0.232 ± 0.012
1.484GlnTyr: 1.484 ± 0.045
0.0GlnXaa: 0.0 ± 0.0
Arg
1.501ArgAla: 1.501 ± 0.028
0.747ArgCys: 0.747 ± 0.022
2.359ArgAsp: 2.359 ± 0.047
2.547ArgGlu: 2.547 ± 0.042
2.226ArgPhe: 2.226 ± 0.036
1.861ArgGly: 1.861 ± 0.035
0.899ArgHis: 0.899 ± 0.023
2.944ArgIle: 2.944 ± 0.039
3.232ArgLys: 3.232 ± 0.049
4.29ArgLeu: 4.29 ± 0.048
0.925ArgMet: 0.925 ± 0.022
2.845ArgAsn: 2.845 ± 0.043
1.407ArgPro: 1.407 ± 0.031
1.101ArgGln: 1.101 ± 0.025
2.527ArgArg: 2.527 ± 0.045
3.13ArgSer: 3.13 ± 0.054
2.09ArgThr: 2.09 ± 0.037
2.629ArgVal: 2.629 ± 0.046
0.336ArgTrp: 0.336 ± 0.014
1.794ArgTyr: 1.794 ± 0.03
0.0ArgXaa: 0.0 ± 0.0
Ser
2.896SerAla: 2.896 ± 0.044
1.33SerCys: 1.33 ± 0.03
5.639SerAsp: 5.639 ± 0.064
5.734SerGlu: 5.734 ± 0.064
4.132SerPhe: 4.132 ± 0.064
4.465SerGly: 4.465 ± 0.065
1.926SerHis: 1.926 ± 0.039
5.362SerIle: 5.362 ± 0.064
6.817SerLys: 6.817 ± 0.07
8.463SerLeu: 8.463 ± 0.078
1.591SerMet: 1.591 ± 0.029
6.324SerAsn: 6.324 ± 0.07
2.98SerPro: 2.98 ± 0.049
2.988SerGln: 2.988 ± 0.046
3.459SerArg: 3.459 ± 0.057
8.35SerSer: 8.35 ± 0.109
5.185SerThr: 5.185 ± 0.069
5.797SerVal: 5.797 ± 0.085
0.573SerTrp: 0.573 ± 0.018
3.241SerTyr: 3.241 ± 0.046
0.0SerXaa: 0.0 ± 0.0
Thr
2.201ThrAla: 2.201 ± 0.049
0.9ThrCys: 0.9 ± 0.022
3.257ThrAsp: 3.257 ± 0.049
3.538ThrGlu: 3.538 ± 0.046
2.757ThrPhe: 2.757 ± 0.035
2.977ThrGly: 2.977 ± 0.059
1.569ThrHis: 1.569 ± 0.035
3.599ThrIle: 3.599 ± 0.053
4.237ThrLys: 4.237 ± 0.06
5.646ThrLeu: 5.646 ± 0.056
1.069ThrMet: 1.069 ± 0.027
4.409ThrAsn: 4.409 ± 0.067
3.021ThrPro: 3.021 ± 0.079
2.33ThrGln: 2.33 ± 0.052
2.223ThrArg: 2.223 ± 0.038
4.91ThrSer: 4.91 ± 0.061
4.088ThrThr: 4.088 ± 0.09
4.004ThrVal: 4.004 ± 0.058
0.369ThrTrp: 0.369 ± 0.012
1.909ThrTyr: 1.909 ± 0.034
0.0ThrXaa: 0.0 ± 0.0
Val
2.09ValAla: 2.09 ± 0.04
1.04ValCys: 1.04 ± 0.026
4.191ValAsp: 4.191 ± 0.065
4.211ValGlu: 4.211 ± 0.054
2.852ValPhe: 2.852 ± 0.047
2.759ValGly: 2.759 ± 0.045
1.221ValHis: 1.221 ± 0.031
3.852ValIle: 3.852 ± 0.052
5.192ValLys: 5.192 ± 0.061
6.067ValLeu: 6.067 ± 0.071
1.155ValMet: 1.155 ± 0.022
4.694ValAsn: 4.694 ± 0.079
2.578ValPro: 2.578 ± 0.052
1.764ValGln: 1.764 ± 0.039
2.515ValArg: 2.515 ± 0.04
5.018ValSer: 5.018 ± 0.056
3.788ValThr: 3.788 ± 0.073
4.098ValVal: 4.098 ± 0.054
0.655ValTrp: 0.655 ± 0.023
2.606ValTyr: 2.606 ± 0.038
0.001ValXaa: 0.001 ± 0.001
Trp
0.295TrpAla: 0.295 ± 0.012
0.139TrpCys: 0.139 ± 0.01
0.488TrpAsp: 0.488 ± 0.017
0.557TrpGlu: 0.557 ± 0.023
0.348TrpPhe: 0.348 ± 0.014
0.297TrpGly: 0.297 ± 0.014
0.203TrpHis: 0.203 ± 0.01
0.563TrpIle: 0.563 ± 0.018
0.711TrpLys: 0.711 ± 0.028
0.75TrpLeu: 0.75 ± 0.022
0.164TrpMet: 0.164 ± 0.01
0.605TrpAsn: 0.605 ± 0.019
0.235TrpPro: 0.235 ± 0.01
0.193TrpGln: 0.193 ± 0.01
0.395TrpArg: 0.395 ± 0.015
0.669TrpSer: 0.669 ± 0.021
0.466TrpThr: 0.466 ± 0.016
0.482TrpVal: 0.482 ± 0.017
0.071TrpTrp: 0.071 ± 0.007
0.324TrpTyr: 0.324 ± 0.014
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.28TyrAla: 1.28 ± 0.028
0.728TyrCys: 0.728 ± 0.024
2.477TyrAsp: 2.477 ± 0.04
2.457TyrGlu: 2.457 ± 0.039
2.161TyrPhe: 2.161 ± 0.043
1.95TyrGly: 1.95 ± 0.036
0.971TyrHis: 0.971 ± 0.027
2.716TyrIle: 2.716 ± 0.046
3.506TyrLys: 3.506 ± 0.051
4.251TyrLeu: 4.251 ± 0.052
0.821TyrMet: 0.821 ± 0.022
3.369TyrAsn: 3.369 ± 0.058
1.563TyrPro: 1.563 ± 0.037
1.479TyrGln: 1.479 ± 0.048
1.883TyrArg: 1.883 ± 0.035
3.692TyrSer: 3.692 ± 0.047
2.471TyrThr: 2.471 ± 0.039
2.499TyrVal: 2.499 ± 0.032
0.327TyrTrp: 0.327 ± 0.015
2.256TyrTyr: 2.256 ± 0.043
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.001XaaAsp: 0.001 ± 0.001
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.001XaaPro: 0.001 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.001XaaVal: 0.001 ± 0.001
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4071 proteins (1895349 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski