Amino acid dipepetide frequency for Terfezia boudieri ATCC MYA-4762

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.416AlaAla: 7.416 ± 0.06
0.978AlaCys: 0.978 ± 0.017
3.308AlaAsp: 3.308 ± 0.028
4.902AlaGlu: 4.902 ± 0.04
2.538AlaPhe: 2.538 ± 0.028
5.347AlaGly: 5.347 ± 0.04
1.63AlaHis: 1.63 ± 0.022
3.98AlaIle: 3.98 ± 0.03
4.043AlaLys: 4.043 ± 0.035
6.877AlaLeu: 6.877 ± 0.055
1.772AlaMet: 1.772 ± 0.022
2.599AlaAsn: 2.599 ± 0.029
4.658AlaPro: 4.658 ± 0.048
3.023AlaGln: 3.023 ± 0.035
4.534AlaArg: 4.534 ± 0.032
6.256AlaSer: 6.256 ± 0.047
4.864AlaThr: 4.864 ± 0.04
4.916AlaVal: 4.916 ± 0.035
0.984AlaTrp: 0.984 ± 0.016
1.942AlaTyr: 1.942 ± 0.019
0.0AlaXaa: 0.0 ± 0.0
Cys
0.847CysAla: 0.847 ± 0.016
0.301CysCys: 0.301 ± 0.01
0.588CysAsp: 0.588 ± 0.013
0.704CysGlu: 0.704 ± 0.011
0.539CysPhe: 0.539 ± 0.012
1.042CysGly: 1.042 ± 0.018
0.363CysHis: 0.363 ± 0.009
0.752CysIle: 0.752 ± 0.014
0.599CysLys: 0.599 ± 0.012
1.348CysLeu: 1.348 ± 0.02
0.311CysMet: 0.311 ± 0.008
0.492CysAsn: 0.492 ± 0.011
0.744CysPro: 0.744 ± 0.013
0.464CysGln: 0.464 ± 0.011
0.78CysArg: 0.78 ± 0.014
0.991CysSer: 0.991 ± 0.018
0.787CysThr: 0.787 ± 0.013
0.81CysVal: 0.81 ± 0.014
0.216CysTrp: 0.216 ± 0.008
0.4CysTyr: 0.4 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
3.591AspAla: 3.591 ± 0.027
0.637AspCys: 0.637 ± 0.012
3.653AspAsp: 3.653 ± 0.046
4.409AspGlu: 4.409 ± 0.046
1.883AspPhe: 1.883 ± 0.023
3.82AspGly: 3.82 ± 0.034
1.077AspHis: 1.077 ± 0.017
3.104AspIle: 3.104 ± 0.027
2.448AspLys: 2.448 ± 0.028
4.571AspLeu: 4.571 ± 0.035
1.204AspMet: 1.204 ± 0.016
1.829AspAsn: 1.829 ± 0.019
2.908AspPro: 2.908 ± 0.028
1.545AspGln: 1.545 ± 0.019
2.717AspArg: 2.717 ± 0.031
3.762AspSer: 3.762 ± 0.034
2.869AspThr: 2.869 ± 0.026
3.285AspVal: 3.285 ± 0.026
0.785AspTrp: 0.785 ± 0.014
1.505AspTyr: 1.505 ± 0.02
0.0AspXaa: 0.0 ± 0.0
Glu
5.247GluAla: 5.247 ± 0.043
0.72GluCys: 0.72 ± 0.013
4.464GluAsp: 4.464 ± 0.042
7.384GluGlu: 7.384 ± 0.089
2.076GluPhe: 2.076 ± 0.021
4.905GluGly: 4.905 ± 0.041
1.299GluHis: 1.299 ± 0.018
3.374GluIle: 3.374 ± 0.029
4.545GluLys: 4.545 ± 0.047
5.655GluLeu: 5.655 ± 0.042
1.649GluMet: 1.649 ± 0.021
2.466GluAsn: 2.466 ± 0.026
2.54GluPro: 2.54 ± 0.029
2.485GluGln: 2.485 ± 0.024
4.382GluArg: 4.382 ± 0.048
4.047GluSer: 4.047 ± 0.033
3.205GluThr: 3.205 ± 0.029
4.478GluVal: 4.478 ± 0.038
0.992GluTrp: 0.992 ± 0.016
1.816GluTyr: 1.816 ± 0.021
0.0GluXaa: 0.0 ± 0.0
Phe
2.398PheAla: 2.398 ± 0.023
0.575PheCys: 0.575 ± 0.011
1.935PheAsp: 1.935 ± 0.024
2.045PheGlu: 2.045 ± 0.021
1.453PhePhe: 1.453 ± 0.023
2.474PheGly: 2.474 ± 0.032
0.888PheHis: 0.888 ± 0.014
1.754PheIle: 1.754 ± 0.023
1.525PheLys: 1.525 ± 0.016
3.411PheLeu: 3.411 ± 0.03
0.738PheMet: 0.738 ± 0.013
1.299PheAsn: 1.299 ± 0.019
1.951PhePro: 1.951 ± 0.022
1.241PheGln: 1.241 ± 0.017
1.916PheArg: 1.916 ± 0.02
2.84PheSer: 2.84 ± 0.024
2.121PheThr: 2.121 ± 0.024
2.084PheVal: 2.084 ± 0.024
0.548PheTrp: 0.548 ± 0.011
1.022PheTyr: 1.022 ± 0.017
0.0PheXaa: 0.0 ± 0.0
Gly
4.88GlyAla: 4.88 ± 0.04
0.922GlyCys: 0.922 ± 0.017
3.655GlyAsp: 3.655 ± 0.035
4.619GlyGlu: 4.619 ± 0.04
2.52GlyPhe: 2.52 ± 0.028
7.118GlyGly: 7.118 ± 0.067
1.563GlyHis: 1.563 ± 0.02
3.642GlyIle: 3.642 ± 0.03
4.363GlyLys: 4.363 ± 0.037
5.788GlyLeu: 5.788 ± 0.038
1.754GlyMet: 1.754 ± 0.02
2.69GlyAsn: 2.69 ± 0.027
3.097GlyPro: 3.097 ± 0.035
2.338GlyGln: 2.338 ± 0.024
4.485GlyArg: 4.485 ± 0.037
5.542GlySer: 5.542 ± 0.047
3.797GlyThr: 3.797 ± 0.033
4.765GlyVal: 4.765 ± 0.033
1.208GlyTrp: 1.208 ± 0.018
2.088GlyTyr: 2.088 ± 0.027
0.0GlyXaa: 0.0 ± 0.0
His
1.611HisAla: 1.611 ± 0.019
0.354HisCys: 0.354 ± 0.009
1.087HisAsp: 1.087 ± 0.015
1.223HisGlu: 1.223 ± 0.017
0.875HisPhe: 0.875 ± 0.012
1.578HisGly: 1.578 ± 0.02
1.012HisHis: 1.012 ± 0.017
1.348HisIle: 1.348 ± 0.016
1.065HisLys: 1.065 ± 0.018
2.339HisLeu: 2.339 ± 0.022
0.526HisMet: 0.526 ± 0.011
0.914HisAsn: 0.914 ± 0.017
1.793HisPro: 1.793 ± 0.024
1.07HisGln: 1.07 ± 0.017
1.548HisArg: 1.548 ± 0.019
2.002HisSer: 2.002 ± 0.028
1.474HisThr: 1.474 ± 0.021
1.329HisVal: 1.329 ± 0.017
0.326HisTrp: 0.326 ± 0.009
0.702HisTyr: 0.702 ± 0.012
0.0HisXaa: 0.0 ± 0.0
Ile
3.819IleAla: 3.819 ± 0.031
0.805IleCys: 0.805 ± 0.014
2.736IleAsp: 2.736 ± 0.023
3.097IleGlu: 3.097 ± 0.028
1.899IlePhe: 1.899 ± 0.021
3.074IleGly: 3.074 ± 0.032
1.357IleHis: 1.357 ± 0.017
2.729IleIle: 2.729 ± 0.029
2.48IleLys: 2.48 ± 0.028
4.892IleLeu: 4.892 ± 0.041
1.075IleMet: 1.075 ± 0.016
1.893IleAsn: 1.893 ± 0.023
3.552IlePro: 3.552 ± 0.032
2.049IleGln: 2.049 ± 0.024
2.914IleArg: 2.914 ± 0.023
4.206IleSer: 4.206 ± 0.031
3.222IleThr: 3.222 ± 0.03
3.159IleVal: 3.159 ± 0.03
0.699IleTrp: 0.699 ± 0.013
1.513IleTyr: 1.513 ± 0.018
0.0IleXaa: 0.0 ± 0.0
Lys
4.408LysAla: 4.408 ± 0.04
0.624LysCys: 0.624 ± 0.012
2.803LysAsp: 2.803 ± 0.029
4.408LysGlu: 4.408 ± 0.045
1.592LysPhe: 1.592 ± 0.02
3.677LysGly: 3.677 ± 0.03
1.208LysHis: 1.208 ± 0.02
2.61LysIle: 2.61 ± 0.027
4.201LysLys: 4.201 ± 0.043
4.594LysLeu: 4.594 ± 0.035
1.16LysMet: 1.16 ± 0.016
1.911LysAsn: 1.911 ± 0.027
2.878LysPro: 2.878 ± 0.026
2.011LysGln: 2.011 ± 0.025
3.976LysArg: 3.976 ± 0.035
3.604LysSer: 3.604 ± 0.026
2.832LysThr: 2.832 ± 0.027
3.385LysVal: 3.385 ± 0.033
0.752LysTrp: 0.752 ± 0.013
1.511LysTyr: 1.511 ± 0.019
0.0LysXaa: 0.0 ± 0.0
Leu
7.023LeuAla: 7.023 ± 0.043
1.323LeuCys: 1.323 ± 0.019
4.679LeuAsp: 4.679 ± 0.035
5.901LeuGlu: 5.901 ± 0.046
3.143LeuPhe: 3.143 ± 0.028
5.868LeuGly: 5.868 ± 0.042
2.299LeuHis: 2.299 ± 0.023
3.983LeuIle: 3.983 ± 0.035
4.657LeuLys: 4.657 ± 0.033
8.674LeuLeu: 8.674 ± 0.066
1.79LeuMet: 1.79 ± 0.021
3.185LeuAsn: 3.185 ± 0.026
5.755LeuPro: 5.755 ± 0.045
3.85LeuGln: 3.85 ± 0.035
5.772LeuArg: 5.772 ± 0.037
7.16LeuSer: 7.16 ± 0.054
5.049LeuThr: 5.049 ± 0.034
5.36LeuVal: 5.36 ± 0.04
1.147LeuTrp: 1.147 ± 0.019
2.377LeuTyr: 2.377 ± 0.024
0.0LeuXaa: 0.0 ± 0.0
Met
1.945MetAla: 1.945 ± 0.022
0.275MetCys: 0.275 ± 0.008
1.277MetAsp: 1.277 ± 0.017
1.628MetGlu: 1.628 ± 0.022
0.702MetPhe: 0.702 ± 0.012
1.641MetGly: 1.641 ± 0.018
0.488MetHis: 0.488 ± 0.01
0.979MetIle: 0.979 ± 0.016
1.194MetLys: 1.194 ± 0.016
1.846MetLeu: 1.846 ± 0.02
0.544MetMet: 0.544 ± 0.012
0.819MetAsn: 0.819 ± 0.013
1.261MetPro: 1.261 ± 0.018
0.851MetGln: 0.851 ± 0.014
1.279MetArg: 1.279 ± 0.017
1.8MetSer: 1.8 ± 0.021
1.17MetThr: 1.17 ± 0.016
1.423MetVal: 1.423 ± 0.018
0.298MetTrp: 0.298 ± 0.008
0.554MetTyr: 0.554 ± 0.011
0.0MetXaa: 0.0 ± 0.0
Asn
2.681AsnAla: 2.681 ± 0.029
0.493AsnCys: 0.493 ± 0.01
1.781AsnAsp: 1.781 ± 0.022
2.094AsnGlu: 2.094 ± 0.023
1.281AsnPhe: 1.281 ± 0.017
2.701AsnGly: 2.701 ± 0.03
0.906AsnHis: 0.906 ± 0.016
2.236AsnIle: 2.236 ± 0.027
1.817AsnLys: 1.817 ± 0.028
3.369AsnLeu: 3.369 ± 0.03
0.861AsnMet: 0.861 ± 0.014
1.592AsnAsn: 1.592 ± 0.023
2.592AsnPro: 2.592 ± 0.025
1.338AsnGln: 1.338 ± 0.02
2.0AsnArg: 2.0 ± 0.022
3.029AsnSer: 3.029 ± 0.028
2.415AsnThr: 2.415 ± 0.026
2.189AsnVal: 2.189 ± 0.022
0.527AsnTrp: 0.527 ± 0.01
1.096AsnTyr: 1.096 ± 0.015
0.0AsnXaa: 0.0 ± 0.0
Pro
4.892ProAla: 4.892 ± 0.045
0.588ProCys: 0.588 ± 0.012
2.661ProAsp: 2.661 ± 0.026
3.711ProGlu: 3.711 ± 0.035
1.854ProPhe: 1.854 ± 0.02
3.852ProGly: 3.852 ± 0.038
1.549ProHis: 1.549 ± 0.021
2.912ProIle: 2.912 ± 0.028
2.944ProLys: 2.944 ± 0.026
5.096ProLeu: 5.096 ± 0.037
1.109ProMet: 1.109 ± 0.016
2.241ProAsn: 2.241 ± 0.026
6.819ProPro: 6.819 ± 0.097
2.759ProGln: 2.759 ± 0.04
3.423ProArg: 3.423 ± 0.033
5.921ProSer: 5.921 ± 0.048
4.824ProThr: 4.824 ± 0.049
3.727ProVal: 3.727 ± 0.051
0.636ProTrp: 0.636 ± 0.012
1.481ProTyr: 1.481 ± 0.019
0.0ProXaa: 0.0 ± 0.0
Gln
3.143GlnAla: 3.143 ± 0.036
0.46GlnCys: 0.46 ± 0.011
1.807GlnAsp: 1.807 ± 0.02
2.593GlnGlu: 2.593 ± 0.028
1.235GlnPhe: 1.235 ± 0.018
2.391GlnGly: 2.391 ± 0.026
1.165GlnHis: 1.165 ± 0.019
1.863GlnIle: 1.863 ± 0.022
2.159GlnLys: 2.159 ± 0.024
3.471GlnLeu: 3.471 ± 0.03
0.848GlnMet: 0.848 ± 0.014
1.529GlnAsn: 1.529 ± 0.021
2.425GlnPro: 2.425 ± 0.033
2.946GlnGln: 2.946 ± 0.067
2.55GlnArg: 2.55 ± 0.027
2.911GlnSer: 2.911 ± 0.032
2.109GlnThr: 2.109 ± 0.023
2.174GlnVal: 2.174 ± 0.025
0.516GlnTrp: 0.516 ± 0.01
1.162GlnTyr: 1.162 ± 0.02
0.0GlnXaa: 0.0 ± 0.0
Arg
4.428ArgAla: 4.428 ± 0.037
0.768ArgCys: 0.768 ± 0.014
3.093ArgAsp: 3.093 ± 0.027
4.479ArgGlu: 4.479 ± 0.05
2.021ArgPhe: 2.021 ± 0.022
4.302ArgGly: 4.302 ± 0.037
1.468ArgHis: 1.468 ± 0.02
3.105ArgIle: 3.105 ± 0.029
4.077ArgLys: 4.077 ± 0.036
5.294ArgLeu: 5.294 ± 0.037
1.418ArgMet: 1.418 ± 0.018
2.341ArgAsn: 2.341 ± 0.026
3.288ArgPro: 3.288 ± 0.031
2.454ArgGln: 2.454 ± 0.029
5.216ArgArg: 5.216 ± 0.053
4.512ArgSer: 4.512 ± 0.042
3.297ArgThr: 3.297 ± 0.029
3.609ArgVal: 3.609 ± 0.027
0.929ArgTrp: 0.929 ± 0.016
1.7ArgTyr: 1.7 ± 0.02
0.0ArgXaa: 0.0 ± 0.0
Ser
5.852SerAla: 5.852 ± 0.043
0.947SerCys: 0.947 ± 0.015
3.631SerAsp: 3.631 ± 0.033
4.138SerGlu: 4.138 ± 0.033
2.708SerPhe: 2.708 ± 0.025
5.593SerGly: 5.593 ± 0.046
1.981SerHis: 1.981 ± 0.026
4.187SerIle: 4.187 ± 0.035
3.728SerLys: 3.728 ± 0.03
6.965SerLeu: 6.965 ± 0.042
1.652SerMet: 1.652 ± 0.02
2.996SerAsn: 2.996 ± 0.031
6.033SerPro: 6.033 ± 0.061
3.021SerGln: 3.021 ± 0.03
4.866SerArg: 4.866 ± 0.043
8.623SerSer: 8.623 ± 0.083
6.012SerThr: 6.012 ± 0.043
4.285SerVal: 4.285 ± 0.036
0.969SerTrp: 0.969 ± 0.015
2.039SerTyr: 2.039 ± 0.024
0.0SerXaa: 0.0 ± 0.0
Thr
4.885ThrAla: 4.885 ± 0.039
0.769ThrCys: 0.769 ± 0.014
2.522ThrAsp: 2.522 ± 0.027
3.269ThrGlu: 3.269 ± 0.034
2.099ThrPhe: 2.099 ± 0.026
4.053ThrGly: 4.053 ± 0.038
1.451ThrHis: 1.451 ± 0.019
3.315ThrIle: 3.315 ± 0.032
2.787ThrLys: 2.787 ± 0.03
5.345ThrLeu: 5.345 ± 0.042
1.18ThrMet: 1.18 ± 0.017
2.189ThrAsn: 2.189 ± 0.024
5.027ThrPro: 5.027 ± 0.052
2.159ThrGln: 2.159 ± 0.027
3.235ThrArg: 3.235 ± 0.03
5.577ThrSer: 5.577 ± 0.04
5.079ThrThr: 5.079 ± 0.07
3.593ThrVal: 3.593 ± 0.026
0.763ThrTrp: 0.763 ± 0.014
1.661ThrTyr: 1.661 ± 0.02
0.0ThrXaa: 0.0 ± 0.0
Val
4.646ValAla: 4.646 ± 0.034
0.853ValCys: 0.853 ± 0.016
3.566ValAsp: 3.566 ± 0.028
4.549ValGlu: 4.549 ± 0.037
2.175ValPhe: 2.175 ± 0.023
4.445ValGly: 4.445 ± 0.04
1.372ValHis: 1.372 ± 0.015
3.014ValIle: 3.014 ± 0.027
3.406ValLys: 3.406 ± 0.029
5.466ValLeu: 5.466 ± 0.04
1.414ValMet: 1.414 ± 0.017
2.3ValAsn: 2.3 ± 0.027
3.582ValPro: 3.582 ± 0.043
2.284ValGln: 2.284 ± 0.025
3.569ValArg: 3.569 ± 0.029
4.383ValSer: 4.383 ± 0.032
3.417ValThr: 3.417 ± 0.033
4.54ValVal: 4.54 ± 0.038
0.891ValTrp: 0.891 ± 0.016
1.728ValTyr: 1.728 ± 0.02
0.0ValXaa: 0.0 ± 0.0
Trp
0.988TrpAla: 0.988 ± 0.016
0.21TrpCys: 0.21 ± 0.006
0.869TrpAsp: 0.869 ± 0.016
0.973TrpGlu: 0.973 ± 0.015
0.485TrpPhe: 0.485 ± 0.01
1.053TrpGly: 1.053 ± 0.015
0.313TrpHis: 0.313 ± 0.008
0.687TrpIle: 0.687 ± 0.012
0.866TrpLys: 0.866 ± 0.015
1.256TrpLeu: 1.256 ± 0.019
0.348TrpMet: 0.348 ± 0.009
0.57TrpAsn: 0.57 ± 0.012
0.505TrpPro: 0.505 ± 0.011
0.473TrpGln: 0.473 ± 0.01
1.008TrpArg: 1.008 ± 0.016
0.921TrpSer: 0.921 ± 0.015
0.723TrpThr: 0.723 ± 0.012
0.949TrpVal: 0.949 ± 0.017
0.264TrpTrp: 0.264 ± 0.009
0.392TrpTyr: 0.392 ± 0.011
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.866TyrAla: 1.866 ± 0.023
0.477TyrCys: 0.477 ± 0.011
1.487TyrAsp: 1.487 ± 0.019
1.603TyrGlu: 1.603 ± 0.019
1.138TyrPhe: 1.138 ± 0.017
1.9TyrGly: 1.9 ± 0.025
0.786TyrHis: 0.786 ± 0.016
1.594TyrIle: 1.594 ± 0.019
1.244TyrLys: 1.244 ± 0.018
2.735TyrLeu: 2.735 ± 0.031
0.621TyrMet: 0.621 ± 0.015
1.142TyrAsn: 1.142 ± 0.017
1.593TyrPro: 1.593 ± 0.02
1.097TyrGln: 1.097 ± 0.017
1.631TyrArg: 1.631 ± 0.021
2.139TyrSer: 2.139 ± 0.022
1.695TyrThr: 1.695 ± 0.019
1.541TyrVal: 1.541 ± 0.021
0.401TyrTrp: 0.401 ± 0.009
0.935TyrTyr: 0.935 ± 0.019
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.001XaaXaa: 0.001 ± 0.001
Statistics based on 12026 proteins (4669809 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski