Amino acid dipepetide frequency for Trypanosoma theileri

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.644AlaAla: 8.644 ± 0.075
1.21AlaCys: 1.21 ± 0.016
3.192AlaAsp: 3.192 ± 0.03
5.278AlaGlu: 5.278 ± 0.045
2.484AlaPhe: 2.484 ± 0.021
3.908AlaGly: 3.908 ± 0.037
1.542AlaHis: 1.542 ± 0.017
3.165AlaIle: 3.165 ± 0.028
3.34AlaLys: 3.34 ± 0.036
7.417AlaLeu: 7.417 ± 0.047
1.771AlaMet: 1.771 ± 0.019
2.456AlaAsn: 2.456 ± 0.023
3.229AlaPro: 3.229 ± 0.033
2.63AlaGln: 2.63 ± 0.029
3.903AlaArg: 3.903 ± 0.034
5.324AlaSer: 5.324 ± 0.036
4.442AlaThr: 4.442 ± 0.032
5.919AlaVal: 5.919 ± 0.047
0.713AlaTrp: 0.713 ± 0.013
1.587AlaTyr: 1.587 ± 0.018
0.003AlaXaa: 0.003 ± 0.001
Cys
1.281CysAla: 1.281 ± 0.014
0.602CysCys: 0.602 ± 0.013
0.857CysAsp: 0.857 ± 0.013
1.028CysGlu: 1.028 ± 0.017
0.763CysPhe: 0.763 ± 0.012
1.39CysGly: 1.39 ± 0.019
0.376CysHis: 0.376 ± 0.009
1.016CysIle: 1.016 ± 0.014
0.704CysLys: 0.704 ± 0.012
1.473CysLeu: 1.473 ± 0.018
0.523CysMet: 0.523 ± 0.01
0.669CysAsn: 0.669 ± 0.011
0.823CysPro: 0.823 ± 0.014
0.446CysGln: 0.446 ± 0.009
1.107CysArg: 1.107 ± 0.016
1.457CysSer: 1.457 ± 0.019
1.201CysThr: 1.201 ± 0.016
1.582CysVal: 1.582 ± 0.018
0.19CysTrp: 0.19 ± 0.006
0.461CysTyr: 0.461 ± 0.011
0.002CysXaa: 0.002 ± 0.001
Asp
4.063AspAla: 4.063 ± 0.029
0.686AspCys: 0.686 ± 0.012
3.825AspAsp: 3.825 ± 0.041
4.088AspGlu: 4.088 ± 0.032
1.576AspPhe: 1.576 ± 0.018
3.41AspGly: 3.41 ± 0.031
1.015AspHis: 1.015 ± 0.016
2.66AspIle: 2.66 ± 0.026
2.091AspLys: 2.091 ± 0.02
3.6AspLeu: 3.6 ± 0.028
1.174AspMet: 1.174 ± 0.015
2.593AspAsn: 2.593 ± 0.026
2.26AspPro: 2.26 ± 0.03
1.169AspGln: 1.169 ± 0.016
2.322AspArg: 2.322 ± 0.023
3.683AspSer: 3.683 ± 0.03
2.94AspThr: 2.94 ± 0.024
4.084AspVal: 4.084 ± 0.028
0.473AspTrp: 0.473 ± 0.011
1.274AspTyr: 1.274 ± 0.016
0.0AspXaa: 0.0 ± 0.0
Glu
5.18GluAla: 5.18 ± 0.047
1.008GluCys: 1.008 ± 0.014
3.804GluAsp: 3.804 ± 0.035
9.719GluGlu: 9.719 ± 0.091
1.821GluPhe: 1.821 ± 0.02
4.04GluGly: 4.04 ± 0.032
1.405GluHis: 1.405 ± 0.015
2.847GluIle: 2.847 ± 0.025
5.733GluLys: 5.733 ± 0.045
6.181GluLeu: 6.181 ± 0.048
2.004GluMet: 2.004 ± 0.021
3.528GluAsn: 3.528 ± 0.028
2.12GluPro: 2.12 ± 0.022
2.968GluGln: 2.968 ± 0.028
5.073GluArg: 5.073 ± 0.045
4.966GluSer: 4.966 ± 0.034
4.006GluThr: 4.006 ± 0.034
4.618GluVal: 4.618 ± 0.031
0.821GluTrp: 0.821 ± 0.013
1.696GluTyr: 1.696 ± 0.021
0.001GluXaa: 0.001 ± 0.001
Phe
2.452PheAla: 2.452 ± 0.026
0.77PheCys: 0.77 ± 0.013
1.693PheAsp: 1.693 ± 0.017
1.686PheGlu: 1.686 ± 0.019
1.81PhePhe: 1.81 ± 0.027
1.964PheGly: 1.964 ± 0.024
0.945PheHis: 0.945 ± 0.013
1.827PheIle: 1.827 ± 0.019
1.156PheLys: 1.156 ± 0.017
3.782PheLeu: 3.782 ± 0.031
0.776PheMet: 0.776 ± 0.012
1.284PheAsn: 1.284 ± 0.017
1.735PhePro: 1.735 ± 0.022
1.014PheGln: 1.014 ± 0.013
1.832PheArg: 1.832 ± 0.02
2.97PheSer: 2.97 ± 0.026
2.185PheThr: 2.185 ± 0.019
2.524PheVal: 2.524 ± 0.024
0.379PheTrp: 0.379 ± 0.009
1.03PheTyr: 1.03 ± 0.014
0.0PheXaa: 0.0 ± 0.0
Gly
3.797GlyAla: 3.797 ± 0.038
1.046GlyCys: 1.046 ± 0.015
3.307GlyAsp: 3.307 ± 0.031
4.044GlyGlu: 4.044 ± 0.034
1.962GlyPhe: 1.962 ± 0.022
5.061GlyGly: 5.061 ± 0.058
1.148GlyHis: 1.148 ± 0.014
2.928GlyIle: 2.928 ± 0.029
3.254GlyLys: 3.254 ± 0.032
4.017GlyLeu: 4.017 ± 0.033
1.443GlyMet: 1.443 ± 0.019
3.08GlyAsn: 3.08 ± 0.03
2.074GlyPro: 2.074 ± 0.023
1.574GlyGln: 1.574 ± 0.019
3.412GlyArg: 3.412 ± 0.026
5.031GlySer: 5.031 ± 0.042
3.677GlyThr: 3.677 ± 0.028
4.619GlyVal: 4.619 ± 0.042
0.583GlyTrp: 0.583 ± 0.011
1.449GlyTyr: 1.449 ± 0.021
0.001GlyXaa: 0.001 ± 0.0
His
1.597HisAla: 1.597 ± 0.018
0.509HisCys: 0.509 ± 0.011
1.157HisAsp: 1.157 ± 0.014
1.449HisGlu: 1.449 ± 0.016
0.934HisPhe: 0.934 ± 0.015
1.389HisGly: 1.389 ± 0.016
1.287HisHis: 1.287 ± 0.025
1.269HisIle: 1.269 ± 0.016
0.952HisLys: 0.952 ± 0.014
2.244HisLeu: 2.244 ± 0.026
0.555HisMet: 0.555 ± 0.01
1.234HisAsn: 1.234 ± 0.017
1.45HisPro: 1.45 ± 0.017
1.076HisGln: 1.076 ± 0.017
1.725HisArg: 1.725 ± 0.02
2.088HisSer: 2.088 ± 0.023
1.623HisThr: 1.623 ± 0.02
1.689HisVal: 1.689 ± 0.019
0.294HisTrp: 0.294 ± 0.008
0.841HisTyr: 0.841 ± 0.013
0.001HisXaa: 0.001 ± 0.0
Ile
3.503IleAla: 3.503 ± 0.028
0.818IleCys: 0.818 ± 0.013
2.209IleAsp: 2.209 ± 0.022
2.628IleGlu: 2.628 ± 0.023
1.578IlePhe: 1.578 ± 0.022
2.39IleGly: 2.39 ± 0.026
1.249IleHis: 1.249 ± 0.015
2.381IleIle: 2.381 ± 0.025
1.964IleLys: 1.964 ± 0.022
4.315IleLeu: 4.315 ± 0.035
1.079IleMet: 1.079 ± 0.015
2.025IleAsn: 2.025 ± 0.022
2.857IlePro: 2.857 ± 0.025
1.728IleGln: 1.728 ± 0.019
2.625IleArg: 2.625 ± 0.024
3.96IleSer: 3.96 ± 0.03
3.086IleThr: 3.086 ± 0.025
3.273IleVal: 3.273 ± 0.029
0.384IleTrp: 0.384 ± 0.008
1.059IleTyr: 1.059 ± 0.017
0.001IleXaa: 0.001 ± 0.001
Lys
3.096LysAla: 3.096 ± 0.03
0.756LysCys: 0.756 ± 0.013
2.621LysAsp: 2.621 ± 0.025
5.435LysGlu: 5.435 ± 0.046
1.116LysPhe: 1.116 ± 0.016
2.906LysGly: 2.906 ± 0.028
1.098LysHis: 1.098 ± 0.015
1.925LysIle: 1.925 ± 0.023
4.492LysLys: 4.492 ± 0.046
3.849LysLeu: 3.849 ± 0.027
1.208LysMet: 1.208 ± 0.015
2.634LysAsn: 2.634 ± 0.026
1.976LysPro: 1.976 ± 0.024
2.2LysGln: 2.2 ± 0.024
3.609LysArg: 3.609 ± 0.03
3.379LysSer: 3.379 ± 0.029
2.827LysThr: 2.827 ± 0.025
2.774LysVal: 2.774 ± 0.026
0.531LysTrp: 0.531 ± 0.011
1.363LysTyr: 1.363 ± 0.015
0.001LysXaa: 0.001 ± 0.0
Leu
6.117LeuAla: 6.117 ± 0.044
2.06LeuCys: 2.06 ± 0.022
3.889LeuAsp: 3.889 ± 0.033
5.714LeuGlu: 5.714 ± 0.05
3.726LeuPhe: 3.726 ± 0.029
4.206LeuGly: 4.206 ± 0.035
2.8LeuHis: 2.8 ± 0.024
3.628LeuIle: 3.628 ± 0.03
3.951LeuLys: 3.951 ± 0.027
11.13LeuLeu: 11.13 ± 0.078
2.128LeuMet: 2.128 ± 0.022
3.239LeuAsn: 3.239 ± 0.028
4.984LeuPro: 4.984 ± 0.032
5.167LeuGln: 5.167 ± 0.044
7.127LeuArg: 7.127 ± 0.044
7.482LeuSer: 7.482 ± 0.046
5.107LeuThr: 5.107 ± 0.035
5.475LeuVal: 5.475 ± 0.041
1.083LeuTrp: 1.083 ± 0.015
2.489LeuTyr: 2.489 ± 0.024
0.001LeuXaa: 0.001 ± 0.0
Met
1.625MetAla: 1.625 ± 0.017
0.466MetCys: 0.466 ± 0.01
1.314MetAsp: 1.314 ± 0.017
1.979MetGlu: 1.979 ± 0.019
0.772MetPhe: 0.772 ± 0.012
1.334MetGly: 1.334 ± 0.017
0.586MetHis: 0.586 ± 0.011
1.057MetIle: 1.057 ± 0.014
1.295MetLys: 1.295 ± 0.018
2.135MetLeu: 2.135 ± 0.021
0.999MetMet: 0.999 ± 0.017
1.197MetAsn: 1.197 ± 0.017
1.038MetPro: 1.038 ± 0.012
1.104MetGln: 1.104 ± 0.014
1.802MetArg: 1.802 ± 0.018
1.838MetSer: 1.838 ± 0.021
1.326MetThr: 1.326 ± 0.016
1.329MetVal: 1.329 ± 0.018
0.291MetTrp: 0.291 ± 0.007
0.709MetTyr: 0.709 ± 0.012
0.001MetXaa: 0.001 ± 0.0
Asn
3.198AsnAla: 3.198 ± 0.027
0.693AsnCys: 0.693 ± 0.012
2.776AsnAsp: 2.776 ± 0.027
3.131AsnGlu: 3.131 ± 0.028
1.16AsnPhe: 1.16 ± 0.016
3.063AsnGly: 3.063 ± 0.03
1.075AsnHis: 1.075 ± 0.017
2.408AsnIle: 2.408 ± 0.024
2.519AsnLys: 2.519 ± 0.022
2.639AsnLeu: 2.639 ± 0.022
1.016AsnMet: 1.016 ± 0.014
9.741AsnAsn: 9.741 ± 0.186
1.921AsnPro: 1.921 ± 0.022
1.174AsnGln: 1.174 ± 0.014
2.118AsnArg: 2.118 ± 0.021
4.436AsnSer: 4.436 ± 0.042
4.081AsnThr: 4.081 ± 0.043
2.843AsnVal: 2.843 ± 0.022
0.348AsnTrp: 0.348 ± 0.008
1.089AsnTyr: 1.089 ± 0.014
0.002AsnXaa: 0.002 ± 0.001
Pro
3.234ProAla: 3.234 ± 0.033
0.696ProCys: 0.696 ± 0.013
1.859ProAsp: 1.859 ± 0.021
2.833ProGlu: 2.833 ± 0.028
1.891ProPhe: 1.891 ± 0.021
2.282ProGly: 2.282 ± 0.032
1.395ProHis: 1.395 ± 0.015
2.121ProIle: 2.121 ± 0.023
1.98ProLys: 1.98 ± 0.021
5.124ProLeu: 5.124 ± 0.036
0.961ProMet: 0.961 ± 0.014
1.922ProAsn: 1.922 ± 0.018
4.25ProPro: 4.25 ± 0.053
2.335ProGln: 2.335 ± 0.026
2.576ProArg: 2.576 ± 0.024
4.812ProSer: 4.812 ± 0.038
3.423ProThr: 3.423 ± 0.031
3.178ProVal: 3.178 ± 0.027
0.452ProTrp: 0.452 ± 0.01
1.224ProTyr: 1.224 ± 0.013
0.003ProXaa: 0.003 ± 0.001
Gln
2.097GlnAla: 2.097 ± 0.022
0.654GlnCys: 0.654 ± 0.012
1.463GlnAsp: 1.463 ± 0.015
3.49GlnGlu: 3.49 ± 0.028
1.112GlnPhe: 1.112 ± 0.015
1.84GlnGly: 1.84 ± 0.02
1.352GlnHis: 1.352 ± 0.018
1.431GlnIle: 1.431 ± 0.017
2.352GlnLys: 2.352 ± 0.025
4.13GlnLeu: 4.13 ± 0.034
0.962GlnMet: 0.962 ± 0.014
1.555GlnAsn: 1.555 ± 0.019
1.847GlnPro: 1.847 ± 0.025
5.461GlnGln: 5.461 ± 0.099
3.618GlnArg: 3.618 ± 0.03
2.822GlnSer: 2.822 ± 0.026
2.154GlnThr: 2.154 ± 0.021
1.966GlnVal: 1.966 ± 0.018
0.493GlnTrp: 0.493 ± 0.011
1.116GlnTyr: 1.116 ± 0.015
0.001GlnXaa: 0.001 ± 0.0
Arg
4.053ArgAla: 4.053 ± 0.035
1.289ArgCys: 1.289 ± 0.017
3.103ArgAsp: 3.103 ± 0.026
5.247ArgGlu: 5.247 ± 0.046
2.123ArgPhe: 2.123 ± 0.022
3.641ArgGly: 3.641 ± 0.03
1.797ArgHis: 1.797 ± 0.021
2.822ArgIle: 2.822 ± 0.024
3.185ArgLys: 3.185 ± 0.032
5.92ArgLeu: 5.92 ± 0.042
1.532ArgMet: 1.532 ± 0.016
2.487ArgAsn: 2.487 ± 0.025
2.415ArgPro: 2.415 ± 0.024
2.808ArgGln: 2.808 ± 0.027
6.07ArgArg: 6.07 ± 0.045
4.557ArgSer: 4.557 ± 0.031
3.216ArgThr: 3.216 ± 0.029
4.457ArgVal: 4.457 ± 0.032
0.796ArgTrp: 0.796 ± 0.014
1.853ArgTyr: 1.853 ± 0.019
0.001ArgXaa: 0.001 ± 0.0
Ser
5.435SerAla: 5.435 ± 0.035
1.445SerCys: 1.445 ± 0.016
3.687SerAsp: 3.687 ± 0.031
4.637SerGlu: 4.637 ± 0.035
2.963SerPhe: 2.963 ± 0.029
5.103SerGly: 5.103 ± 0.042
2.139SerHis: 2.139 ± 0.024
3.938SerIle: 3.938 ± 0.03
3.333SerLys: 3.333 ± 0.03
7.829SerLeu: 7.829 ± 0.055
1.819SerMet: 1.819 ± 0.019
4.134SerAsn: 4.134 ± 0.036
4.532SerPro: 4.532 ± 0.038
2.92SerGln: 2.92 ± 0.024
4.408SerArg: 4.408 ± 0.037
11.374SerSer: 11.374 ± 0.108
6.311SerThr: 6.311 ± 0.044
5.603SerVal: 5.603 ± 0.039
0.758SerTrp: 0.758 ± 0.013
1.781SerTyr: 1.781 ± 0.02
0.002SerXaa: 0.002 ± 0.001
Thr
5.357ThrAla: 5.357 ± 0.036
0.956ThrCys: 0.956 ± 0.014
2.699ThrAsp: 2.699 ± 0.024
3.931ThrGlu: 3.931 ± 0.032
1.948ThrPhe: 1.948 ± 0.022
3.532ThrGly: 3.532 ± 0.031
1.522ThrHis: 1.522 ± 0.02
3.01ThrIle: 3.01 ± 0.026
2.564ThrLys: 2.564 ± 0.025
5.859ThrLeu: 5.859 ± 0.036
1.404ThrMet: 1.404 ± 0.016
3.113ThrAsn: 3.113 ± 0.033
4.002ThrPro: 4.002 ± 0.035
2.263ThrGln: 2.263 ± 0.024
3.26ThrArg: 3.26 ± 0.029
5.856ThrSer: 5.856 ± 0.046
8.485ThrThr: 8.485 ± 0.11
4.565ThrVal: 4.565 ± 0.035
0.54ThrTrp: 0.54 ± 0.011
1.22ThrTyr: 1.22 ± 0.016
0.003ThrXaa: 0.003 ± 0.001
Val
4.969ValAla: 4.969 ± 0.034
1.445ValCys: 1.445 ± 0.023
3.567ValAsp: 3.567 ± 0.025
4.944ValGlu: 4.944 ± 0.033
2.609ValPhe: 2.609 ± 0.026
3.789ValGly: 3.789 ± 0.035
1.615ValHis: 1.615 ± 0.018
2.871ValIle: 2.871 ± 0.024
3.251ValLys: 3.251 ± 0.03
6.692ValLeu: 6.692 ± 0.041
1.788ValMet: 1.788 ± 0.02
2.807ValAsn: 2.807 ± 0.026
3.643ValPro: 3.643 ± 0.032
2.632ValGln: 2.632 ± 0.02
4.282ValArg: 4.282 ± 0.031
5.496ValSer: 5.496 ± 0.042
3.788ValThr: 3.788 ± 0.031
5.633ValVal: 5.633 ± 0.041
0.896ValTrp: 0.896 ± 0.014
1.841ValTyr: 1.841 ± 0.018
0.002ValXaa: 0.002 ± 0.001
Trp
0.617TrpAla: 0.617 ± 0.011
0.273TrpCys: 0.273 ± 0.008
0.551TrpAsp: 0.551 ± 0.011
0.72TrpGlu: 0.72 ± 0.013
0.374TrpPhe: 0.374 ± 0.008
0.629TrpGly: 0.629 ± 0.015
0.248TrpHis: 0.248 ± 0.007
0.514TrpIle: 0.514 ± 0.011
0.634TrpLys: 0.634 ± 0.012
0.925TrpLeu: 0.925 ± 0.015
0.377TrpMet: 0.377 ± 0.009
0.516TrpAsn: 0.516 ± 0.012
0.322TrpPro: 0.322 ± 0.008
0.373TrpGln: 0.373 ± 0.008
0.892TrpArg: 0.892 ± 0.015
0.799TrpSer: 0.799 ± 0.015
0.531TrpThr: 0.531 ± 0.01
0.702TrpVal: 0.702 ± 0.011
0.196TrpTrp: 0.196 ± 0.008
0.341TrpTyr: 0.341 ± 0.008
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.842TyrAla: 1.842 ± 0.019
0.569TyrCys: 0.569 ± 0.01
1.413TyrAsp: 1.413 ± 0.017
1.556TyrGlu: 1.556 ± 0.017
1.089TyrPhe: 1.089 ± 0.015
1.627TyrGly: 1.627 ± 0.021
0.767TyrHis: 0.767 ± 0.012
1.295TyrIle: 1.295 ± 0.017
1.044TyrLys: 1.044 ± 0.016
2.238TyrLeu: 2.238 ± 0.023
0.677TyrMet: 0.677 ± 0.011
1.216TyrAsn: 1.216 ± 0.016
1.071TyrPro: 1.071 ± 0.015
0.874TyrGln: 0.874 ± 0.015
1.61TyrArg: 1.61 ± 0.016
1.819TyrSer: 1.819 ± 0.02
1.677TyrThr: 1.677 ± 0.017
1.725TyrVal: 1.725 ± 0.021
0.312TyrTrp: 0.312 ± 0.009
0.992TyrTyr: 0.992 ± 0.016
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.0
0.001XaaCys: 0.001 ± 0.001
0.001XaaAsp: 0.001 ± 0.0
0.002XaaGlu: 0.002 ± 0.001
0.001XaaPhe: 0.001 ± 0.001
0.001XaaGly: 0.001 ± 0.0
0.001XaaHis: 0.001 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.001XaaLys: 0.001 ± 0.001
0.002XaaLeu: 0.002 ± 0.001
0.001XaaMet: 0.001 ± 0.001
0.002XaaAsn: 0.002 ± 0.001
0.002XaaPro: 0.002 ± 0.001
0.001XaaGln: 0.001 ± 0.001
0.002XaaArg: 0.002 ± 0.001
0.003XaaSer: 0.003 ± 0.001
0.001XaaThr: 0.001 ± 0.0
0.003XaaVal: 0.003 ± 0.001
0.001XaaTrp: 0.001 ± 0.0
0.001XaaTyr: 0.001 ± 0.0
0.028XaaXaa: 0.028 ± 0.006
Statistics based on 11236 proteins (5415872 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski