Amino acid dipepetide frequency for Eimeria necatrix

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
72.861AlaAla: 72.861 ± 0.824
1.673AlaCys: 1.673 ± 0.021
4.641AlaAsp: 4.641 ± 0.038
7.937AlaGlu: 7.937 ± 0.071
2.935AlaPhe: 2.935 ± 0.029
7.477AlaGly: 7.477 ± 0.06
1.61AlaHis: 1.61 ± 0.021
2.207AlaIle: 2.207 ± 0.024
3.689AlaLys: 3.689 ± 0.048
9.517AlaLeu: 9.517 ± 0.057
1.128AlaMet: 1.128 ± 0.017
2.11AlaAsn: 2.11 ± 0.023
8.739AlaPro: 8.739 ± 0.104
3.888AlaGln: 3.888 ± 0.036
5.002AlaArg: 5.002 ± 0.041
8.298AlaSer: 8.298 ± 0.055
6.298AlaThr: 6.298 ± 0.061
6.825AlaVal: 6.825 ± 0.046
0.814AlaTrp: 0.814 ± 0.014
1.318AlaTyr: 1.318 ± 0.021
0.0AlaXaa: 0.0 ± 0.0
Cys
1.504CysAla: 1.504 ± 0.019
2.536CysCys: 2.536 ± 0.068
0.545CysAsp: 0.545 ± 0.011
0.776CysGlu: 0.776 ± 0.016
0.769CysPhe: 0.769 ± 0.015
1.443CysGly: 1.443 ± 0.024
0.318CysHis: 0.318 ± 0.009
0.616CysIle: 0.616 ± 0.011
0.594CysLys: 0.594 ± 0.015
2.015CysLeu: 2.015 ± 0.024
0.398CysMet: 0.398 ± 0.01
0.44CysAsn: 0.44 ± 0.01
0.978CysPro: 0.978 ± 0.024
0.493CysGln: 0.493 ± 0.011
1.312CysArg: 1.312 ± 0.019
2.94CysSer: 2.94 ± 0.035
0.919CysThr: 0.919 ± 0.018
1.068CysVal: 1.068 ± 0.019
0.291CysTrp: 0.291 ± 0.01
0.317CysTyr: 0.317 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
4.214AspAla: 4.214 ± 0.039
0.811AspCys: 0.811 ± 0.014
1.531AspAsp: 1.531 ± 0.033
2.524AspGlu: 2.524 ± 0.033
1.323AspPhe: 1.323 ± 0.02
2.425AspGly: 2.425 ± 0.028
0.49AspHis: 0.49 ± 0.011
1.214AspIle: 1.214 ± 0.019
1.375AspLys: 1.375 ± 0.019
3.306AspLeu: 3.306 ± 0.032
0.585AspMet: 0.585 ± 0.012
0.717AspAsn: 0.717 ± 0.013
2.228AspPro: 2.228 ± 0.03
1.054AspGln: 1.054 ± 0.016
1.79AspArg: 1.79 ± 0.03
3.415AspSer: 3.415 ± 0.034
1.809AspThr: 1.809 ± 0.021
1.893AspVal: 1.893 ± 0.025
0.454AspTrp: 0.454 ± 0.01
0.598AspTyr: 0.598 ± 0.012
0.0AspXaa: 0.0 ± 0.0
Glu
8.681GluAla: 8.681 ± 0.077
0.764GluCys: 0.764 ± 0.021
2.765GluAsp: 2.765 ± 0.03
5.898GluGlu: 5.898 ± 0.061
1.29GluPhe: 1.29 ± 0.019
4.789GluGly: 4.789 ± 0.045
0.898GluHis: 0.898 ± 0.017
1.502GluIle: 1.502 ± 0.024
2.73GluLys: 2.73 ± 0.038
5.183GluLeu: 5.183 ± 0.046
0.913GluMet: 0.913 ± 0.015
1.323GluAsn: 1.323 ± 0.018
2.617GluPro: 2.617 ± 0.028
4.618GluGln: 4.618 ± 0.07
3.431GluArg: 3.431 ± 0.038
3.351GluSer: 3.351 ± 0.036
2.994GluThr: 2.994 ± 0.035
3.103GluVal: 3.103 ± 0.033
0.596GluTrp: 0.596 ± 0.013
0.724GluTyr: 0.724 ± 0.016
0.0GluXaa: 0.0 ± 0.0
Phe
2.947PheAla: 2.947 ± 0.03
0.852PheCys: 0.852 ± 0.016
1.099PheAsp: 1.099 ± 0.016
1.481PheGlu: 1.481 ± 0.023
1.209PhePhe: 1.209 ± 0.021
1.84PheGly: 1.84 ± 0.027
0.511PheHis: 0.511 ± 0.011
0.914PheIle: 0.914 ± 0.018
0.992PheLys: 0.992 ± 0.017
2.979PheLeu: 2.979 ± 0.031
0.439PheMet: 0.439 ± 0.011
0.643PheAsn: 0.643 ± 0.014
1.453PhePro: 1.453 ± 0.018
0.921PheGln: 0.921 ± 0.014
1.541PheArg: 1.541 ± 0.019
2.47PheSer: 2.47 ± 0.03
1.067PheThr: 1.067 ± 0.018
1.885PheVal: 1.885 ± 0.021
0.354PheTrp: 0.354 ± 0.01
0.594PheTyr: 0.594 ± 0.013
0.0PheXaa: 0.0 ± 0.0
Gly
9.157GlyAla: 9.157 ± 0.092
1.339GlyCys: 1.339 ± 0.019
2.691GlyAsp: 2.691 ± 0.034
3.693GlyGlu: 3.693 ± 0.033
1.87GlyPhe: 1.87 ± 0.024
7.511GlyGly: 7.511 ± 0.101
1.036GlyHis: 1.036 ± 0.014
1.534GlyIle: 1.534 ± 0.025
2.094GlyLys: 2.094 ± 0.027
5.348GlyLeu: 5.348 ± 0.038
0.75GlyMet: 0.75 ± 0.014
1.356GlyAsn: 1.356 ± 0.022
6.993GlyPro: 6.993 ± 0.134
2.059GlyGln: 2.059 ± 0.025
3.563GlyArg: 3.563 ± 0.033
6.22GlySer: 6.22 ± 0.05
2.719GlyThr: 2.719 ± 0.032
3.343GlyVal: 3.343 ± 0.035
0.641GlyTrp: 0.641 ± 0.013
0.889GlyTyr: 0.889 ± 0.016
0.0GlyXaa: 0.0 ± 0.0
His
1.941HisAla: 1.941 ± 0.023
0.488HisCys: 0.488 ± 0.01
0.595HisAsp: 0.595 ± 0.012
1.137HisGlu: 1.137 ± 0.019
0.739HisPhe: 0.739 ± 0.013
1.167HisGly: 1.167 ± 0.019
0.675HisHis: 0.675 ± 0.023
0.591HisIle: 0.591 ± 0.013
0.813HisLys: 0.813 ± 0.014
2.254HisLeu: 2.254 ± 0.028
0.358HisMet: 0.358 ± 0.011
0.416HisAsn: 0.416 ± 0.01
1.272HisPro: 1.272 ± 0.019
1.593HisGln: 1.593 ± 0.031
1.509HisArg: 1.509 ± 0.021
1.727HisSer: 1.727 ± 0.022
0.807HisThr: 0.807 ± 0.014
0.956HisVal: 0.956 ± 0.014
0.274HisTrp: 0.274 ± 0.009
0.364HisTyr: 0.364 ± 0.01
0.0HisXaa: 0.0 ± 0.0
Ile
2.115IleAla: 2.115 ± 0.025
0.7IleCys: 0.7 ± 0.012
1.133IleAsp: 1.133 ± 0.019
1.426IleGlu: 1.426 ± 0.023
0.854IlePhe: 0.854 ± 0.016
1.319IleGly: 1.319 ± 0.022
0.5IleHis: 0.5 ± 0.01
0.794IleIle: 0.794 ± 0.018
1.074IleLys: 1.074 ± 0.019
2.109IleLeu: 2.109 ± 0.028
0.384IleMet: 0.384 ± 0.01
0.704IleAsn: 0.704 ± 0.015
1.323IlePro: 1.323 ± 0.018
1.003IleGln: 1.003 ± 0.021
1.553IleArg: 1.553 ± 0.023
2.135IleSer: 2.135 ± 0.024
1.018IleThr: 1.018 ± 0.019
1.323IleVal: 1.323 ± 0.019
0.261IleTrp: 0.261 ± 0.009
0.538IleTyr: 0.538 ± 0.012
0.0IleXaa: 0.0 ± 0.0
Lys
3.719LysAla: 3.719 ± 0.041
0.581LysCys: 0.581 ± 0.013
1.524LysAsp: 1.524 ± 0.02
3.152LysGlu: 3.152 ± 0.048
0.771LysPhe: 0.771 ± 0.015
2.524LysGly: 2.524 ± 0.036
0.77LysHis: 0.77 ± 0.013
0.995LysIle: 0.995 ± 0.021
2.268LysLys: 2.268 ± 0.037
3.039LysLeu: 3.039 ± 0.035
0.6LysMet: 0.6 ± 0.014
0.932LysAsn: 0.932 ± 0.017
1.859LysPro: 1.859 ± 0.028
3.039LysGln: 3.039 ± 0.035
2.758LysArg: 2.758 ± 0.027
2.311LysSer: 2.311 ± 0.029
1.805LysThr: 1.805 ± 0.023
1.725LysVal: 1.725 ± 0.022
0.389LysTrp: 0.389 ± 0.009
0.646LysTyr: 0.646 ± 0.015
0.0LysXaa: 0.0 ± 0.0
Leu
7.739LeuAla: 7.739 ± 0.05
1.918LeuCys: 1.918 ± 0.026
2.949LeuAsp: 2.949 ± 0.032
5.241LeuGlu: 5.241 ± 0.042
2.882LeuPhe: 2.882 ± 0.036
5.565LeuGly: 5.565 ± 0.05
2.56LeuHis: 2.56 ± 0.028
2.035LeuIle: 2.035 ± 0.025
3.451LeuLys: 3.451 ± 0.038
18.381LeuLeu: 18.381 ± 0.169
1.299LeuMet: 1.299 ± 0.018
1.837LeuAsn: 1.837 ± 0.024
6.324LeuPro: 6.324 ± 0.044
11.638LeuGln: 11.638 ± 0.126
7.132LeuArg: 7.132 ± 0.055
6.77LeuSer: 6.77 ± 0.044
3.255LeuThr: 3.255 ± 0.03
4.191LeuVal: 4.191 ± 0.04
1.16LeuTrp: 1.16 ± 0.018
1.513LeuTyr: 1.513 ± 0.02
0.0LeuXaa: 0.0 ± 0.0
Met
1.226MetAla: 1.226 ± 0.017
0.191MetCys: 0.191 ± 0.007
0.502MetAsp: 0.502 ± 0.012
0.904MetGlu: 0.904 ± 0.015
0.314MetPhe: 0.314 ± 0.009
0.916MetGly: 0.916 ± 0.017
0.408MetHis: 0.408 ± 0.01
0.298MetIle: 0.298 ± 0.009
0.585MetLys: 0.585 ± 0.013
1.432MetLeu: 1.432 ± 0.018
0.241MetMet: 0.241 ± 0.008
0.32MetAsn: 0.32 ± 0.009
0.804MetPro: 0.804 ± 0.014
1.256MetGln: 1.256 ± 0.022
0.997MetArg: 0.997 ± 0.016
0.877MetSer: 0.877 ± 0.017
0.542MetThr: 0.542 ± 0.012
0.552MetVal: 0.552 ± 0.013
0.141MetTrp: 0.141 ± 0.005
0.246MetTyr: 0.246 ± 0.008
0.0MetXaa: 0.0 ± 0.0
Asn
2.07AsnAla: 2.07 ± 0.026
0.569AsnCys: 0.569 ± 0.011
0.747AsnAsp: 0.747 ± 0.015
1.261AsnGlu: 1.261 ± 0.019
0.741AsnPhe: 0.741 ± 0.015
1.473AsnGly: 1.473 ± 0.022
0.334AsnHis: 0.334 ± 0.008
0.717AsnIle: 0.717 ± 0.015
1.03AsnLys: 1.03 ± 0.018
1.684AsnLeu: 1.684 ± 0.024
0.367AsnMet: 0.367 ± 0.01
0.77AsnAsn: 0.77 ± 0.018
1.325AsnPro: 1.325 ± 0.019
0.74AsnGln: 0.74 ± 0.014
1.168AsnArg: 1.168 ± 0.018
2.848AsnSer: 2.848 ± 0.035
1.012AsnThr: 1.012 ± 0.021
1.042AsnVal: 1.042 ± 0.017
0.234AsnTrp: 0.234 ± 0.007
0.421AsnTyr: 0.421 ± 0.012
0.0AsnXaa: 0.0 ± 0.0
Pro
7.915ProAla: 7.915 ± 0.08
0.967ProCys: 0.967 ± 0.019
1.74ProAsp: 1.74 ± 0.022
3.608ProGlu: 3.608 ± 0.039
1.83ProPhe: 1.83 ± 0.023
4.5ProGly: 4.5 ± 0.07
1.481ProHis: 1.481 ± 0.02
1.139ProIle: 1.139 ± 0.018
2.064ProLys: 2.064 ± 0.031
6.857ProLeu: 6.857 ± 0.062
0.721ProMet: 0.721 ± 0.016
1.091ProAsn: 1.091 ± 0.017
7.091ProPro: 7.091 ± 0.088
4.925ProGln: 4.925 ± 0.063
3.779ProArg: 3.779 ± 0.043
5.335ProSer: 5.335 ± 0.041
2.737ProThr: 2.737 ± 0.032
2.892ProVal: 2.892 ± 0.033
0.751ProTrp: 0.751 ± 0.015
0.866ProTyr: 0.866 ± 0.016
0.0ProXaa: 0.0 ± 0.0
Gln
3.805GlnAla: 3.805 ± 0.032
0.619GlnCys: 0.619 ± 0.012
1.626GlnAsp: 1.626 ± 0.022
4.493GlnGlu: 4.493 ± 0.063
0.926GlnPhe: 0.926 ± 0.016
3.344GlnGly: 3.344 ± 0.042
2.877GlnHis: 2.877 ± 0.043
1.163GlnIle: 1.163 ± 0.018
2.665GlnLys: 2.665 ± 0.034
8.957GlnLeu: 8.957 ± 0.096
0.979GlnMet: 0.979 ± 0.017
1.035GlnAsn: 1.035 ± 0.018
3.676GlnPro: 3.676 ± 0.059
51.06GlnGln: 51.06 ± 0.725
5.56GlnArg: 5.56 ± 0.056
2.527GlnSer: 2.527 ± 0.032
2.184GlnThr: 2.184 ± 0.025
2.218GlnVal: 2.218 ± 0.028
0.63GlnTrp: 0.63 ± 0.016
0.607GlnTyr: 0.607 ± 0.015
0.0GlnXaa: 0.0 ± 0.0
Arg
5.49ArgAla: 5.49 ± 0.043
1.463ArgCys: 1.463 ± 0.021
2.19ArgAsp: 2.19 ± 0.028
3.82ArgGlu: 3.82 ± 0.038
1.544ArgPhe: 1.544 ± 0.021
4.809ArgGly: 4.809 ± 0.051
1.514ArgHis: 1.514 ± 0.021
1.617ArgIle: 1.617 ± 0.022
2.564ArgLys: 2.564 ± 0.029
6.327ArgLeu: 6.327 ± 0.044
0.999ArgMet: 0.999 ± 0.015
1.353ArgAsn: 1.353 ± 0.018
3.115ArgPro: 3.115 ± 0.029
4.429ArgGln: 4.429 ± 0.049
6.139ArgArg: 6.139 ± 0.062
5.294ArgSer: 5.294 ± 0.052
2.276ArgThr: 2.276 ± 0.024
3.062ArgVal: 3.062 ± 0.029
0.752ArgTrp: 0.752 ± 0.014
0.872ArgTyr: 0.872 ± 0.016
0.0ArgXaa: 0.0 ± 0.0
Ser
8.211SerAla: 8.211 ± 0.057
2.272SerCys: 2.272 ± 0.033
2.715SerAsp: 2.715 ± 0.031
3.663SerGlu: 3.663 ± 0.039
2.574SerPhe: 2.574 ± 0.027
5.657SerGly: 5.657 ± 0.049
1.376SerHis: 1.376 ± 0.02
1.947SerIle: 1.947 ± 0.027
3.259SerLys: 3.259 ± 0.033
7.071SerLeu: 7.071 ± 0.044
0.939SerMet: 0.939 ± 0.016
2.877SerAsn: 2.877 ± 0.04
5.311SerPro: 5.311 ± 0.048
2.764SerGln: 2.764 ± 0.03
5.604SerArg: 5.604 ± 0.048
29.558SerSer: 29.558 ± 0.382
3.843SerThr: 3.843 ± 0.048
3.792SerVal: 3.792 ± 0.037
0.86SerTrp: 0.86 ± 0.016
1.065SerTyr: 1.065 ± 0.017
0.0SerXaa: 0.0 ± 0.0
Thr
7.242ThrAla: 7.242 ± 0.065
0.756ThrCys: 0.756 ± 0.017
1.726ThrAsp: 1.726 ± 0.021
2.776ThrGlu: 2.776 ± 0.042
1.109ThrPhe: 1.109 ± 0.02
2.713ThrGly: 2.713 ± 0.031
0.764ThrHis: 0.764 ± 0.014
1.051ThrIle: 1.051 ± 0.02
1.524ThrLys: 1.524 ± 0.019
3.531ThrLeu: 3.531 ± 0.031
0.514ThrMet: 0.514 ± 0.012
1.018ThrAsn: 1.018 ± 0.019
3.05ThrPro: 3.05 ± 0.028
1.878ThrGln: 1.878 ± 0.024
2.134ThrArg: 2.134 ± 0.023
3.287ThrSer: 3.287 ± 0.042
2.325ThrThr: 2.325 ± 0.035
2.291ThrVal: 2.291 ± 0.029
0.395ThrTrp: 0.395 ± 0.011
0.698ThrTyr: 0.698 ± 0.015
0.0ThrXaa: 0.0 ± 0.0
Val
6.154ValAla: 6.154 ± 0.051
1.181ValCys: 1.181 ± 0.018
2.051ValAsp: 2.051 ± 0.025
2.994ValGlu: 2.994 ± 0.032
1.651ValPhe: 1.651 ± 0.021
3.154ValGly: 3.154 ± 0.031
1.079ValHis: 1.079 ± 0.016
1.13ValIle: 1.13 ± 0.019
1.603ValLys: 1.603 ± 0.023
5.167ValLeu: 5.167 ± 0.044
0.688ValMet: 0.688 ± 0.014
1.004ValAsn: 1.004 ± 0.017
2.884ValPro: 2.884 ± 0.032
2.377ValGln: 2.377 ± 0.026
2.665ValArg: 2.665 ± 0.023
4.094ValSer: 4.094 ± 0.034
1.968ValThr: 1.968 ± 0.027
2.951ValVal: 2.951 ± 0.035
0.564ValTrp: 0.564 ± 0.012
1.117ValTyr: 1.117 ± 0.02
0.0ValXaa: 0.0 ± 0.0
Trp
0.822TrpAla: 0.822 ± 0.014
0.205TrpCys: 0.205 ± 0.007
0.397TrpAsp: 0.397 ± 0.011
0.562TrpGlu: 0.562 ± 0.01
0.248TrpPhe: 0.248 ± 0.008
1.194TrpGly: 1.194 ± 0.021
0.186TrpHis: 0.186 ± 0.008
0.279TrpIle: 0.279 ± 0.008
0.44TrpLys: 0.44 ± 0.011
1.182TrpLeu: 1.182 ± 0.019
0.185TrpMet: 0.185 ± 0.006
0.243TrpAsn: 0.243 ± 0.008
0.499TrpPro: 0.499 ± 0.011
0.68TrpGln: 0.68 ± 0.015
0.962TrpArg: 0.962 ± 0.019
0.671TrpSer: 0.671 ± 0.012
0.41TrpThr: 0.41 ± 0.009
0.483TrpVal: 0.483 ± 0.01
0.174TrpTrp: 0.174 ± 0.006
0.129TrpTyr: 0.129 ± 0.005
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.152TyrAla: 1.152 ± 0.018
0.385TyrCys: 0.385 ± 0.011
0.589TyrAsp: 0.589 ± 0.014
0.823TyrGlu: 0.823 ± 0.014
0.611TyrPhe: 0.611 ± 0.014
0.987TyrGly: 0.987 ± 0.019
0.29TyrHis: 0.29 ± 0.009
0.525TyrIle: 0.525 ± 0.011
0.594TyrLys: 0.594 ± 0.011
1.505TyrLeu: 1.505 ± 0.02
0.265TyrMet: 0.265 ± 0.008
0.363TyrAsn: 0.363 ± 0.009
0.747TyrPro: 0.747 ± 0.016
0.535TyrGln: 0.535 ± 0.013
1.03TyrArg: 1.03 ± 0.015
1.23TyrSer: 1.23 ± 0.02
0.795TyrThr: 0.795 ± 0.013
0.88TyrVal: 0.88 ± 0.016
0.217TyrTrp: 0.217 ± 0.007
0.374TyrTyr: 0.374 ± 0.011
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8608 proteins (4723510 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski