Amino acid dipepetide frequency for Amphibacillus marinus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.385AlaAla: 5.385 ± 0.098
0.64AlaCys: 0.64 ± 0.028
4.1AlaAsp: 4.1 ± 0.072
5.12AlaGlu: 5.12 ± 0.086
3.391AlaPhe: 3.391 ± 0.065
5.137AlaGly: 5.137 ± 0.085
1.441AlaHis: 1.441 ± 0.042
6.599AlaIle: 6.599 ± 0.09
4.451AlaLys: 4.451 ± 0.067
7.974AlaLeu: 7.974 ± 0.098
1.951AlaMet: 1.951 ± 0.051
3.597AlaAsn: 3.597 ± 0.065
2.094AlaPro: 2.094 ± 0.057
2.928AlaGln: 2.928 ± 0.057
2.884AlaArg: 2.884 ± 0.057
4.207AlaSer: 4.207 ± 0.064
4.153AlaThr: 4.153 ± 0.061
5.166AlaVal: 5.166 ± 0.075
0.645AlaTrp: 0.645 ± 0.028
2.575AlaTyr: 2.575 ± 0.045
0.0AlaXaa: 0.0 ± 0.0
Cys
0.431CysAla: 0.431 ± 0.023
0.102CysCys: 0.102 ± 0.011
0.349CysAsp: 0.349 ± 0.02
0.369CysGlu: 0.369 ± 0.02
0.309CysPhe: 0.309 ± 0.019
0.551CysGly: 0.551 ± 0.026
0.232CysHis: 0.232 ± 0.015
0.462CysIle: 0.462 ± 0.021
0.292CysLys: 0.292 ± 0.018
0.723CysLeu: 0.723 ± 0.028
0.167CysMet: 0.167 ± 0.016
0.28CysAsn: 0.28 ± 0.018
0.292CysPro: 0.292 ± 0.015
0.336CysGln: 0.336 ± 0.019
0.257CysArg: 0.257 ± 0.018
0.429CysSer: 0.429 ± 0.023
0.353CysThr: 0.353 ± 0.018
0.374CysVal: 0.374 ± 0.019
0.063CysTrp: 0.063 ± 0.008
0.326CysTyr: 0.326 ± 0.017
0.0CysXaa: 0.0 ± 0.0
Asp
3.316AspAla: 3.316 ± 0.064
0.379AspCys: 0.379 ± 0.02
2.878AspAsp: 2.878 ± 0.066
4.101AspGlu: 4.101 ± 0.079
2.617AspPhe: 2.617 ± 0.06
3.339AspGly: 3.339 ± 0.068
1.528AspHis: 1.528 ± 0.047
4.004AspIle: 4.004 ± 0.076
2.682AspLys: 2.682 ± 0.054
5.625AspLeu: 5.625 ± 0.076
1.264AspMet: 1.264 ± 0.036
2.157AspAsn: 2.157 ± 0.046
2.032AspPro: 2.032 ± 0.05
3.804AspGln: 3.804 ± 0.075
2.594AspArg: 2.594 ± 0.046
2.566AspSer: 2.566 ± 0.05
2.411AspThr: 2.411 ± 0.053
3.624AspVal: 3.624 ± 0.067
0.733AspTrp: 0.733 ± 0.03
2.498AspTyr: 2.498 ± 0.055
0.0AspXaa: 0.0 ± 0.0
Glu
5.843GluAla: 5.843 ± 0.093
0.282GluCys: 0.282 ± 0.018
3.579GluAsp: 3.579 ± 0.067
5.556GluGlu: 5.556 ± 0.092
2.016GluPhe: 2.016 ± 0.048
3.88GluGly: 3.88 ± 0.067
1.616GluHis: 1.616 ± 0.042
5.19GluIle: 5.19 ± 0.076
4.369GluLys: 4.369 ± 0.068
7.36GluLeu: 7.36 ± 0.104
1.967GluMet: 1.967 ± 0.048
2.995GluAsn: 2.995 ± 0.056
2.111GluPro: 2.111 ± 0.047
5.079GluGln: 5.079 ± 0.097
3.598GluArg: 3.598 ± 0.068
3.314GluSer: 3.314 ± 0.061
3.715GluThr: 3.715 ± 0.073
4.816GluVal: 4.816 ± 0.077
0.72GluTrp: 0.72 ± 0.028
1.778GluTyr: 1.778 ± 0.051
0.0GluXaa: 0.0 ± 0.0
Phe
3.073PheAla: 3.073 ± 0.063
0.296PheCys: 0.296 ± 0.018
2.558PheAsp: 2.558 ± 0.046
2.829PheGlu: 2.829 ± 0.059
2.31PhePhe: 2.31 ± 0.064
2.971PheGly: 2.971 ± 0.064
0.87PheHis: 0.87 ± 0.032
4.006PheIle: 4.006 ± 0.076
2.486PheLys: 2.486 ± 0.053
4.367PheLeu: 4.367 ± 0.083
1.023PheMet: 1.023 ± 0.034
2.431PheAsn: 2.431 ± 0.056
1.494PhePro: 1.494 ± 0.041
1.699PheGln: 1.699 ± 0.042
1.527PheArg: 1.527 ± 0.038
3.36PheSer: 3.36 ± 0.064
2.721PheThr: 2.721 ± 0.057
3.078PheVal: 3.078 ± 0.059
0.429PheTrp: 0.429 ± 0.021
1.796PheTyr: 1.796 ± 0.041
0.0PheXaa: 0.0 ± 0.0
Gly
4.591GlyAla: 4.591 ± 0.081
0.545GlyCys: 0.545 ± 0.024
3.211GlyAsp: 3.211 ± 0.067
4.354GlyGlu: 4.354 ± 0.074
3.306GlyPhe: 3.306 ± 0.064
4.263GlyGly: 4.263 ± 0.084
1.404GlyHis: 1.404 ± 0.038
5.407GlyIle: 5.407 ± 0.079
3.8GlyLys: 3.8 ± 0.067
6.674GlyLeu: 6.674 ± 0.09
1.824GlyMet: 1.824 ± 0.049
2.545GlyAsn: 2.545 ± 0.047
1.621GlyPro: 1.621 ± 0.044
2.664GlyGln: 2.664 ± 0.056
2.644GlyArg: 2.644 ± 0.05
3.779GlySer: 3.779 ± 0.067
3.482GlyThr: 3.482 ± 0.062
4.735GlyVal: 4.735 ± 0.083
0.715GlyTrp: 0.715 ± 0.029
2.889GlyTyr: 2.889 ± 0.051
0.0GlyXaa: 0.0 ± 0.0
His
1.539HisAla: 1.539 ± 0.04
0.2HisCys: 0.2 ± 0.016
1.238HisAsp: 1.238 ± 0.036
1.336HisGlu: 1.336 ± 0.04
1.203HisPhe: 1.203 ± 0.039
1.363HisGly: 1.363 ± 0.039
0.693HisHis: 0.693 ± 0.033
1.519HisIle: 1.519 ± 0.044
0.957HisLys: 0.957 ± 0.032
2.256HisLeu: 2.256 ± 0.055
0.467HisMet: 0.467 ± 0.022
0.984HisAsn: 0.984 ± 0.033
1.055HisPro: 1.055 ± 0.034
1.048HisGln: 1.048 ± 0.036
0.841HisArg: 0.841 ± 0.031
1.172HisSer: 1.172 ± 0.037
1.096HisThr: 1.096 ± 0.034
1.54HisVal: 1.54 ± 0.038
0.224HisTrp: 0.224 ± 0.017
1.165HisTyr: 1.165 ± 0.04
0.0HisXaa: 0.0 ± 0.0
Ile
6.551IleAla: 6.551 ± 0.094
0.607IleCys: 0.607 ± 0.028
5.0IleAsp: 5.0 ± 0.075
6.417IleGlu: 6.417 ± 0.093
3.234IlePhe: 3.234 ± 0.078
5.837IleGly: 5.837 ± 0.085
1.548IleHis: 1.548 ± 0.045
6.605IleIle: 6.605 ± 0.096
4.882IleLys: 4.882 ± 0.078
6.749IleLeu: 6.749 ± 0.109
1.759IleMet: 1.759 ± 0.04
4.242IleAsn: 4.242 ± 0.065
3.101IlePro: 3.101 ± 0.058
2.82IleGln: 2.82 ± 0.059
3.054IleArg: 3.054 ± 0.059
4.913IleSer: 4.913 ± 0.075
4.516IleThr: 4.516 ± 0.072
5.552IleVal: 5.552 ± 0.076
0.688IleTrp: 0.688 ± 0.028
2.615IleTyr: 2.615 ± 0.054
0.0IleXaa: 0.0 ± 0.0
Lys
4.326LysAla: 4.326 ± 0.059
0.294LysCys: 0.294 ± 0.018
3.04LysAsp: 3.04 ± 0.059
4.59LysGlu: 4.59 ± 0.071
1.641LysPhe: 1.641 ± 0.041
3.453LysGly: 3.453 ± 0.063
1.423LysHis: 1.423 ± 0.037
3.833LysIle: 3.833 ± 0.066
4.427LysLys: 4.427 ± 0.081
5.723LysLeu: 5.723 ± 0.083
1.584LysMet: 1.584 ± 0.043
2.56LysAsn: 2.56 ± 0.054
1.955LysPro: 1.955 ± 0.045
3.851LysGln: 3.851 ± 0.074
3.14LysArg: 3.14 ± 0.057
2.845LysSer: 2.845 ± 0.062
2.975LysThr: 2.975 ± 0.057
3.875LysVal: 3.875 ± 0.066
0.621LysTrp: 0.621 ± 0.027
1.828LysTyr: 1.828 ± 0.044
0.0LysXaa: 0.0 ± 0.0
Leu
9.052LeuAla: 9.052 ± 0.109
0.582LeuCys: 0.582 ± 0.024
5.293LeuAsp: 5.293 ± 0.08
6.331LeuGlu: 6.331 ± 0.097
4.994LeuPhe: 4.994 ± 0.092
6.021LeuGly: 6.021 ± 0.087
1.773LeuHis: 1.773 ± 0.04
8.551LeuIle: 8.551 ± 0.127
5.932LeuLys: 5.932 ± 0.088
10.33LeuLeu: 10.33 ± 0.144
2.433LeuMet: 2.433 ± 0.057
4.91LeuAsn: 4.91 ± 0.082
3.859LeuPro: 3.859 ± 0.068
3.546LeuGln: 3.546 ± 0.069
3.608LeuArg: 3.608 ± 0.059
7.043LeuSer: 7.043 ± 0.093
6.845LeuThr: 6.845 ± 0.086
6.635LeuVal: 6.635 ± 0.087
0.776LeuTrp: 0.776 ± 0.033
3.31LeuTyr: 3.31 ± 0.063
0.0LeuXaa: 0.0 ± 0.0
Met
1.868MetAla: 1.868 ± 0.044
0.116MetCys: 0.116 ± 0.011
1.212MetAsp: 1.212 ± 0.037
1.467MetGlu: 1.467 ± 0.039
0.961MetPhe: 0.961 ± 0.04
1.452MetGly: 1.452 ± 0.039
0.5MetHis: 0.5 ± 0.022
2.02MetIle: 2.02 ± 0.047
1.839MetLys: 1.839 ± 0.036
2.599MetLeu: 2.599 ± 0.054
0.749MetMet: 0.749 ± 0.031
1.273MetAsn: 1.273 ± 0.043
0.906MetPro: 0.906 ± 0.036
0.957MetGln: 0.957 ± 0.034
1.145MetArg: 1.145 ± 0.036
1.508MetSer: 1.508 ± 0.04
1.609MetThr: 1.609 ± 0.038
1.743MetVal: 1.743 ± 0.043
0.158MetTrp: 0.158 ± 0.014
0.663MetTyr: 0.663 ± 0.027
0.0MetXaa: 0.0 ± 0.0
Asn
2.808AsnAla: 2.808 ± 0.051
0.334AsnCys: 0.334 ± 0.018
2.591AsnAsp: 2.591 ± 0.055
3.564AsnGlu: 3.564 ± 0.058
1.765AsnPhe: 1.765 ± 0.043
3.23AsnGly: 3.23 ± 0.058
1.215AsnHis: 1.215 ± 0.035
3.347AsnIle: 3.347 ± 0.067
2.894AsnLys: 2.894 ± 0.067
4.114AsnLeu: 4.114 ± 0.062
1.156AsnMet: 1.156 ± 0.033
2.246AsnAsn: 2.246 ± 0.06
1.978AsnPro: 1.978 ± 0.039
3.027AsnGln: 3.027 ± 0.064
2.221AsnArg: 2.221 ± 0.045
2.209AsnSer: 2.209 ± 0.052
2.116AsnThr: 2.116 ± 0.047
2.829AsnVal: 2.829 ± 0.058
0.613AsnTrp: 0.613 ± 0.024
1.988AsnTyr: 1.988 ± 0.045
0.0AsnXaa: 0.0 ± 0.0
Pro
2.367ProAla: 2.367 ± 0.05
0.216ProCys: 0.216 ± 0.017
2.084ProAsp: 2.084 ± 0.053
2.708ProGlu: 2.708 ± 0.057
1.874ProPhe: 1.874 ± 0.048
2.064ProGly: 2.064 ± 0.056
0.76ProHis: 0.76 ± 0.027
3.212ProIle: 3.212 ± 0.061
1.684ProLys: 1.684 ± 0.044
3.323ProLeu: 3.323 ± 0.059
0.712ProMet: 0.712 ± 0.031
1.807ProAsn: 1.807 ± 0.045
0.849ProPro: 0.849 ± 0.037
1.024ProGln: 1.024 ± 0.029
1.057ProArg: 1.057 ± 0.033
2.16ProSer: 2.16 ± 0.049
2.049ProThr: 2.049 ± 0.045
2.538ProVal: 2.538 ± 0.054
0.328ProTrp: 0.328 ± 0.021
1.398ProTyr: 1.398 ± 0.042
0.0ProXaa: 0.0 ± 0.0
Gln
4.343GlnAla: 4.343 ± 0.076
0.221GlnCys: 0.221 ± 0.016
2.276GlnAsp: 2.276 ± 0.051
3.089GlnGlu: 3.089 ± 0.064
2.115GlnPhe: 2.115 ± 0.044
2.623GlnGly: 2.623 ± 0.055
1.08GlnHis: 1.08 ± 0.036
3.493GlnIle: 3.493 ± 0.068
2.219GlnLys: 2.219 ± 0.053
6.187GlnLeu: 6.187 ± 0.103
1.147GlnMet: 1.147 ± 0.031
1.642GlnAsn: 1.642 ± 0.04
1.493GlnPro: 1.493 ± 0.043
2.684GlnGln: 2.684 ± 0.067
1.893GlnArg: 1.893 ± 0.049
2.637GlnSer: 2.637 ± 0.056
2.631GlnThr: 2.631 ± 0.057
3.275GlnVal: 3.275 ± 0.064
0.462GlnTrp: 0.462 ± 0.021
1.604GlnTyr: 1.604 ± 0.042
0.0GlnXaa: 0.0 ± 0.0
Arg
2.756ArgAla: 2.756 ± 0.059
0.24ArgCys: 0.24 ± 0.017
2.141ArgAsp: 2.141 ± 0.049
3.011ArgGlu: 3.011 ± 0.06
2.009ArgPhe: 2.009 ± 0.039
2.306ArgGly: 2.306 ± 0.051
0.893ArgHis: 0.893 ± 0.029
3.45ArgIle: 3.45 ± 0.063
2.68ArgLys: 2.68 ± 0.054
4.404ArgLeu: 4.404 ± 0.086
1.166ArgMet: 1.166 ± 0.035
1.817ArgAsn: 1.817 ± 0.043
1.28ArgPro: 1.28 ± 0.036
1.938ArgGln: 1.938 ± 0.05
1.793ArgArg: 1.793 ± 0.045
2.284ArgSer: 2.284 ± 0.046
2.214ArgThr: 2.214 ± 0.049
2.732ArgVal: 2.732 ± 0.056
0.398ArgTrp: 0.398 ± 0.02
1.784ArgTyr: 1.784 ± 0.044
0.0ArgXaa: 0.0 ± 0.0
Ser
3.788SerAla: 3.788 ± 0.071
0.378SerCys: 0.378 ± 0.018
3.035SerAsp: 3.035 ± 0.057
3.804SerGlu: 3.804 ± 0.063
3.313SerPhe: 3.313 ± 0.058
4.424SerGly: 4.424 ± 0.068
1.2SerHis: 1.2 ± 0.037
4.957SerIle: 4.957 ± 0.073
3.19SerLys: 3.19 ± 0.057
5.953SerLeu: 5.953 ± 0.085
1.475SerMet: 1.475 ± 0.038
2.599SerAsn: 2.599 ± 0.052
1.876SerPro: 1.876 ± 0.043
2.246SerGln: 2.246 ± 0.052
2.208SerArg: 2.208 ± 0.045
3.539SerSer: 3.539 ± 0.067
3.028SerThr: 3.028 ± 0.054
3.934SerVal: 3.934 ± 0.069
0.667SerTrp: 0.667 ± 0.025
2.517SerTyr: 2.517 ± 0.052
0.0SerXaa: 0.0 ± 0.0
Thr
4.133ThrAla: 4.133 ± 0.064
0.36ThrCys: 0.36 ± 0.019
3.154ThrAsp: 3.154 ± 0.072
3.785ThrGlu: 3.785 ± 0.068
2.75ThrPhe: 2.75 ± 0.052
4.158ThrGly: 4.158 ± 0.07
1.123ThrHis: 1.123 ± 0.037
5.173ThrIle: 5.173 ± 0.089
3.004ThrLys: 3.004 ± 0.05
5.449ThrLeu: 5.449 ± 0.07
1.237ThrMet: 1.237 ± 0.039
2.703ThrAsn: 2.703 ± 0.054
2.07ThrPro: 2.07 ± 0.052
1.717ThrGln: 1.717 ± 0.042
1.968ThrArg: 1.968 ± 0.039
3.081ThrSer: 3.081 ± 0.059
3.205ThrThr: 3.205 ± 0.063
4.202ThrVal: 4.202 ± 0.076
0.483ThrTrp: 0.483 ± 0.023
2.031ThrTyr: 2.031 ± 0.047
0.0ThrXaa: 0.0 ± 0.0
Val
5.35ValAla: 5.35 ± 0.095
0.503ValCys: 0.503 ± 0.025
3.858ValAsp: 3.858 ± 0.066
4.694ValGlu: 4.694 ± 0.078
3.09ValPhe: 3.09 ± 0.064
4.407ValGly: 4.407 ± 0.081
1.26ValHis: 1.26 ± 0.037
5.809ValIle: 5.809 ± 0.087
4.129ValLys: 4.129 ± 0.071
6.405ValLeu: 6.405 ± 0.08
1.679ValMet: 1.679 ± 0.041
3.426ValAsn: 3.426 ± 0.063
2.371ValPro: 2.371 ± 0.058
2.272ValGln: 2.272 ± 0.051
2.614ValArg: 2.614 ± 0.055
4.368ValSer: 4.368 ± 0.074
4.305ValThr: 4.305 ± 0.069
4.834ValVal: 4.834 ± 0.083
0.604ValTrp: 0.604 ± 0.028
2.275ValTyr: 2.275 ± 0.055
0.001ValXaa: 0.001 ± 0.001
Trp
0.605TrpAla: 0.605 ± 0.027
0.083TrpCys: 0.083 ± 0.01
0.52TrpAsp: 0.52 ± 0.024
0.561TrpGlu: 0.561 ± 0.026
0.526TrpPhe: 0.526 ± 0.027
0.612TrpGly: 0.612 ± 0.026
0.223TrpHis: 0.223 ± 0.015
0.656TrpIle: 0.656 ± 0.026
0.468TrpLys: 0.468 ± 0.022
1.386TrpLeu: 1.386 ± 0.037
0.216TrpMet: 0.216 ± 0.015
0.454TrpAsn: 0.454 ± 0.02
0.277TrpPro: 0.277 ± 0.019
0.565TrpGln: 0.565 ± 0.026
0.45TrpArg: 0.45 ± 0.024
0.651TrpSer: 0.651 ± 0.023
0.504TrpThr: 0.504 ± 0.02
0.609TrpVal: 0.609 ± 0.03
0.132TrpTrp: 0.132 ± 0.013
0.385TrpTyr: 0.385 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.313TyrAla: 2.313 ± 0.05
0.317TyrCys: 0.317 ± 0.017
2.035TyrAsp: 2.035 ± 0.043
2.095TyrGlu: 2.095 ± 0.05
1.9TyrPhe: 1.9 ± 0.043
2.361TyrGly: 2.361 ± 0.047
1.115TyrHis: 1.115 ± 0.039
2.398TyrIle: 2.398 ± 0.047
1.598TyrLys: 1.598 ± 0.042
4.225TyrLeu: 4.225 ± 0.064
0.77TyrMet: 0.77 ± 0.029
1.572TyrAsn: 1.572 ± 0.045
1.5TyrPro: 1.5 ± 0.042
2.91TyrGln: 2.91 ± 0.066
1.801TyrArg: 1.801 ± 0.046
2.068TyrSer: 2.068 ± 0.053
1.859TyrThr: 1.859 ± 0.046
2.162TyrVal: 2.162 ± 0.046
0.426TyrTrp: 0.426 ± 0.018
1.663TyrTyr: 1.663 ± 0.046
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.001XaaPhe: 0.001 ± 0.001
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.003XaaXaa: 0.003 ± 0.003
Statistics based on 3270 proteins (988883 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski