Amino acid dipepetide frequency for Harryflintia acetispora

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.188AlaAla: 11.188 ± 0.161
1.475AlaCys: 1.475 ± 0.046
4.471AlaAsp: 4.471 ± 0.088
4.783AlaGlu: 4.783 ± 0.088
3.368AlaPhe: 3.368 ± 0.074
8.664AlaGly: 8.664 ± 0.124
1.503AlaHis: 1.503 ± 0.041
4.994AlaIle: 4.994 ± 0.075
4.242AlaLys: 4.242 ± 0.08
10.719AlaLeu: 10.719 ± 0.157
2.451AlaMet: 2.451 ± 0.061
2.267AlaAsn: 2.267 ± 0.058
3.772AlaPro: 3.772 ± 0.08
3.832AlaGln: 3.832 ± 0.081
4.918AlaArg: 4.918 ± 0.092
4.675AlaSer: 4.675 ± 0.104
3.672AlaThr: 3.672 ± 0.08
7.376AlaVal: 7.376 ± 0.096
0.699AlaTrp: 0.699 ± 0.029
2.501AlaTyr: 2.501 ± 0.055
0.002AlaXaa: 0.002 ± 0.001
Cys
1.839CysAla: 1.839 ± 0.059
0.419CysCys: 0.419 ± 0.026
1.011CysAsp: 1.011 ± 0.043
1.214CysGlu: 1.214 ± 0.043
0.662CysPhe: 0.662 ± 0.027
2.027CysGly: 2.027 ± 0.064
0.325CysHis: 0.325 ± 0.022
0.722CysIle: 0.722 ± 0.032
0.76CysLys: 0.76 ± 0.033
1.334CysLeu: 1.334 ± 0.046
0.397CysMet: 0.397 ± 0.023
0.491CysAsn: 0.491 ± 0.022
0.798CysPro: 0.798 ± 0.038
0.507CysGln: 0.507 ± 0.027
1.105CysArg: 1.105 ± 0.037
1.0CysSer: 1.0 ± 0.035
0.816CysThr: 0.816 ± 0.034
1.097CysVal: 1.097 ± 0.033
0.145CysTrp: 0.145 ± 0.014
0.692CysTyr: 0.692 ± 0.035
0.0CysXaa: 0.0 ± 0.0
Asp
4.029AspAla: 4.029 ± 0.075
0.9AspCys: 0.9 ± 0.038
2.434AspAsp: 2.434 ± 0.067
4.411AspGlu: 4.411 ± 0.085
2.548AspPhe: 2.548 ± 0.06
4.785AspGly: 4.785 ± 0.108
0.83AspHis: 0.83 ± 0.034
3.541AspIle: 3.541 ± 0.067
2.479AspLys: 2.479 ± 0.065
4.629AspLeu: 4.629 ± 0.075
1.428AspMet: 1.428 ± 0.05
1.628AspAsn: 1.628 ± 0.046
2.628AspPro: 2.628 ± 0.062
1.265AspGln: 1.265 ± 0.037
2.721AspArg: 2.721 ± 0.057
2.797AspSer: 2.797 ± 0.069
2.679AspThr: 2.679 ± 0.061
3.438AspVal: 3.438 ± 0.069
0.617AspTrp: 0.617 ± 0.03
2.513AspTyr: 2.513 ± 0.06
0.0AspXaa: 0.0 ± 0.0
Glu
6.125GluAla: 6.125 ± 0.086
0.848GluCys: 0.848 ± 0.036
3.574GluAsp: 3.574 ± 0.072
5.855GluGlu: 5.855 ± 0.112
2.093GluPhe: 2.093 ± 0.049
5.252GluGly: 5.252 ± 0.09
1.403GluHis: 1.403 ± 0.045
4.339GluIle: 4.339 ± 0.085
3.738GluLys: 3.738 ± 0.077
7.033GluLeu: 7.033 ± 0.102
2.045GluMet: 2.045 ± 0.054
2.85GluAsn: 2.85 ± 0.06
2.324GluPro: 2.324 ± 0.054
3.061GluGln: 3.061 ± 0.072
5.418GluArg: 5.418 ± 0.123
3.398GluSer: 3.398 ± 0.064
3.061GluThr: 3.061 ± 0.058
4.172GluVal: 4.172 ± 0.081
0.575GluTrp: 0.575 ± 0.026
2.58GluTyr: 2.58 ± 0.061
0.0GluXaa: 0.0 ± 0.0
Phe
3.379PheAla: 3.379 ± 0.073
0.9PheCys: 0.9 ± 0.031
2.351PheAsp: 2.351 ± 0.053
2.667PheGlu: 2.667 ± 0.057
1.774PhePhe: 1.774 ± 0.057
3.373PheGly: 3.373 ± 0.063
0.729PheHis: 0.729 ± 0.029
2.133PheIle: 2.133 ± 0.056
1.544PheLys: 1.544 ± 0.048
4.105PheLeu: 4.105 ± 0.096
0.945PheMet: 0.945 ± 0.032
1.108PheAsn: 1.108 ± 0.041
1.572PhePro: 1.572 ± 0.041
1.06PheGln: 1.06 ± 0.033
1.68PheArg: 1.68 ± 0.046
3.044PheSer: 3.044 ± 0.053
2.355PheThr: 2.355 ± 0.048
2.664PheVal: 2.664 ± 0.054
0.514PheTrp: 0.514 ± 0.024
1.556PheTyr: 1.556 ± 0.046
0.0PheXaa: 0.0 ± 0.0
Gly
7.491GlyAla: 7.491 ± 0.101
1.646GlyCys: 1.646 ± 0.054
4.019GlyAsp: 4.019 ± 0.074
6.72GlyGlu: 6.72 ± 0.104
3.291GlyPhe: 3.291 ± 0.061
7.286GlyGly: 7.286 ± 0.112
1.382GlyHis: 1.382 ± 0.039
5.141GlyIle: 5.141 ± 0.084
4.471GlyLys: 4.471 ± 0.084
7.599GlyLeu: 7.599 ± 0.111
2.57GlyMet: 2.57 ± 0.061
2.624GlyAsn: 2.624 ± 0.066
1.995GlyPro: 1.995 ± 0.047
2.623GlyGln: 2.623 ± 0.058
4.829GlyArg: 4.829 ± 0.072
4.949GlySer: 4.949 ± 0.098
4.274GlyThr: 4.274 ± 0.09
6.214GlyVal: 6.214 ± 0.091
0.855GlyTrp: 0.855 ± 0.035
3.386GlyTyr: 3.386 ± 0.063
0.0GlyXaa: 0.0 ± 0.0
His
1.451HisAla: 1.451 ± 0.043
0.322HisCys: 0.322 ± 0.02
0.838HisAsp: 0.838 ± 0.031
1.09HisGlu: 1.09 ± 0.039
0.834HisPhe: 0.834 ± 0.029
1.388HisGly: 1.388 ± 0.045
0.376HisHis: 0.376 ± 0.022
1.111HisIle: 1.111 ± 0.036
0.788HisLys: 0.788 ± 0.034
1.684HisLeu: 1.684 ± 0.048
0.442HisMet: 0.442 ± 0.02
0.635HisAsn: 0.635 ± 0.026
1.025HisPro: 1.025 ± 0.032
0.469HisGln: 0.469 ± 0.024
1.003HisArg: 1.003 ± 0.038
0.995HisSer: 0.995 ± 0.036
0.987HisThr: 0.987 ± 0.036
0.958HisVal: 0.958 ± 0.032
0.171HisTrp: 0.171 ± 0.014
0.7HisTyr: 0.7 ± 0.031
0.0HisXaa: 0.0 ± 0.0
Ile
5.477IleAla: 5.477 ± 0.098
1.006IleCys: 1.006 ± 0.036
3.434IleAsp: 3.434 ± 0.07
4.1IleGlu: 4.1 ± 0.068
2.196IlePhe: 2.196 ± 0.058
4.765IleGly: 4.765 ± 0.079
0.956IleHis: 0.956 ± 0.034
3.187IleIle: 3.187 ± 0.075
2.713IleLys: 2.713 ± 0.06
5.81IleLeu: 5.81 ± 0.114
1.361IleMet: 1.361 ± 0.046
1.898IleAsn: 1.898 ± 0.048
2.968IlePro: 2.968 ± 0.061
1.52IleGln: 1.52 ± 0.037
3.15IleArg: 3.15 ± 0.062
3.851IleSer: 3.851 ± 0.061
3.542IleThr: 3.542 ± 0.083
4.173IleVal: 4.173 ± 0.069
0.477IleTrp: 0.477 ± 0.024
1.913IleTyr: 1.913 ± 0.04
0.0IleXaa: 0.0 ± 0.0
Lys
4.184LysAla: 4.184 ± 0.079
0.489LysCys: 0.489 ± 0.024
2.427LysAsp: 2.427 ± 0.062
4.027LysGlu: 4.027 ± 0.08
1.296LysPhe: 1.296 ± 0.044
3.491LysGly: 3.491 ± 0.063
0.655LysHis: 0.655 ± 0.028
3.056LysIle: 3.056 ± 0.061
3.362LysLys: 3.362 ± 0.085
4.442LysLeu: 4.442 ± 0.08
1.424LysMet: 1.424 ± 0.042
2.109LysAsn: 2.109 ± 0.05
1.788LysPro: 1.788 ± 0.052
1.54LysGln: 1.54 ± 0.046
2.872LysArg: 2.872 ± 0.06
2.498LysSer: 2.498 ± 0.056
2.659LysThr: 2.659 ± 0.062
3.244LysVal: 3.244 ± 0.079
0.4LysTrp: 0.4 ± 0.021
1.671LysTyr: 1.671 ± 0.047
0.0LysXaa: 0.0 ± 0.0
Leu
8.797LeuAla: 8.797 ± 0.113
2.695LeuCys: 2.695 ± 0.078
5.357LeuAsp: 5.357 ± 0.086
6.215LeuGlu: 6.215 ± 0.108
3.989LeuPhe: 3.989 ± 0.086
8.111LeuGly: 8.111 ± 0.13
1.916LeuHis: 1.916 ± 0.051
5.774LeuIle: 5.774 ± 0.11
3.982LeuLys: 3.982 ± 0.069
11.499LeuLeu: 11.499 ± 0.198
2.488LeuMet: 2.488 ± 0.057
2.656LeuAsn: 2.656 ± 0.057
5.265LeuPro: 5.265 ± 0.078
3.576LeuGln: 3.576 ± 0.077
6.792LeuArg: 6.792 ± 0.106
7.715LeuSer: 7.715 ± 0.11
5.079LeuThr: 5.079 ± 0.081
6.006LeuVal: 6.006 ± 0.094
0.931LeuTrp: 0.931 ± 0.038
3.451LeuTyr: 3.451 ± 0.065
0.0LeuXaa: 0.0 ± 0.0
Met
2.411MetAla: 2.411 ± 0.056
0.323MetCys: 0.323 ± 0.022
1.487MetAsp: 1.487 ± 0.045
2.009MetGlu: 2.009 ± 0.054
0.9MetPhe: 0.9 ± 0.041
2.149MetGly: 2.149 ± 0.054
0.379MetHis: 0.379 ± 0.022
1.664MetIle: 1.664 ± 0.047
1.855MetLys: 1.855 ± 0.052
2.723MetLeu: 2.723 ± 0.064
0.728MetMet: 0.728 ± 0.033
1.009MetAsn: 1.009 ± 0.034
1.083MetPro: 1.083 ± 0.037
0.879MetGln: 0.879 ± 0.035
1.38MetArg: 1.38 ± 0.038
1.612MetSer: 1.612 ± 0.045
1.48MetThr: 1.48 ± 0.043
1.864MetVal: 1.864 ± 0.049
0.169MetTrp: 0.169 ± 0.013
0.61MetTyr: 0.61 ± 0.03
0.0MetXaa: 0.0 ± 0.0
Asn
2.815AsnAla: 2.815 ± 0.068
0.534AsnCys: 0.534 ± 0.028
1.562AsnAsp: 1.562 ± 0.046
1.849AsnGlu: 1.849 ± 0.048
1.276AsnPhe: 1.276 ± 0.044
2.56AsnGly: 2.56 ± 0.057
0.589AsnHis: 0.589 ± 0.023
2.218AsnIle: 2.218 ± 0.051
1.332AsnLys: 1.332 ± 0.045
3.272AsnLeu: 3.272 ± 0.06
0.868AsnMet: 0.868 ± 0.032
1.072AsnAsn: 1.072 ± 0.045
1.929AsnPro: 1.929 ± 0.05
0.902AsnGln: 0.902 ± 0.03
1.742AsnArg: 1.742 ± 0.045
1.673AsnSer: 1.673 ± 0.051
1.83AsnThr: 1.83 ± 0.062
2.177AsnVal: 2.177 ± 0.061
0.326AsnTrp: 0.326 ± 0.022
1.19AsnTyr: 1.19 ± 0.04
0.0AsnXaa: 0.0 ± 0.0
Pro
4.217ProAla: 4.217 ± 0.082
0.608ProCys: 0.608 ± 0.028
2.73ProAsp: 2.73 ± 0.059
3.318ProGlu: 3.318 ± 0.078
1.874ProPhe: 1.874 ± 0.047
3.533ProGly: 3.533 ± 0.062
0.752ProHis: 0.752 ± 0.028
2.183ProIle: 2.183 ± 0.049
1.957ProLys: 1.957 ± 0.046
3.893ProLeu: 3.893 ± 0.071
0.986ProMet: 0.986 ± 0.033
1.201ProAsn: 1.201 ± 0.037
1.684ProPro: 1.684 ± 0.05
1.937ProGln: 1.937 ± 0.042
1.817ProArg: 1.817 ± 0.045
2.301ProSer: 2.301 ± 0.06
1.857ProThr: 1.857 ± 0.059
3.715ProVal: 3.715 ± 0.066
0.343ProTrp: 0.343 ± 0.021
1.503ProTyr: 1.503 ± 0.046
0.0ProXaa: 0.0 ± 0.0
Gln
3.105GlnAla: 3.105 ± 0.065
0.391GlnCys: 0.391 ± 0.02
1.543GlnAsp: 1.543 ± 0.037
2.634GlnGlu: 2.634 ± 0.063
1.184GlnPhe: 1.184 ± 0.036
2.405GlnGly: 2.405 ± 0.056
0.531GlnHis: 0.531 ± 0.025
2.096GlnIle: 2.096 ± 0.051
2.448GlnLys: 2.448 ± 0.066
3.203GlnLeu: 3.203 ± 0.061
1.119GlnMet: 1.119 ± 0.038
1.76GlnAsn: 1.76 ± 0.045
1.24GlnPro: 1.24 ± 0.038
1.425GlnGln: 1.425 ± 0.044
2.08GlnArg: 2.08 ± 0.055
2.167GlnSer: 2.167 ± 0.053
1.63GlnThr: 1.63 ± 0.045
2.307GlnVal: 2.307 ± 0.053
0.289GlnTrp: 0.289 ± 0.018
1.25GlnTyr: 1.25 ± 0.038
0.0GlnXaa: 0.0 ± 0.0
Arg
5.049ArgAla: 5.049 ± 0.082
0.956ArgCys: 0.956 ± 0.04
2.984ArgAsp: 2.984 ± 0.064
5.269ArgGlu: 5.269 ± 0.12
2.53ArgPhe: 2.53 ± 0.057
3.953ArgGly: 3.953 ± 0.083
1.112ArgHis: 1.112 ± 0.036
3.18ArgIle: 3.18 ± 0.067
2.577ArgLys: 2.577 ± 0.065
6.287ArgLeu: 6.287 ± 0.103
1.7ArgMet: 1.7 ± 0.05
1.447ArgAsn: 1.447 ± 0.041
2.157ArgPro: 2.157 ± 0.05
2.676ArgGln: 2.676 ± 0.061
4.225ArgArg: 4.225 ± 0.104
3.035ArgSer: 3.035 ± 0.063
2.659ArgThr: 2.659 ± 0.058
3.716ArgVal: 3.716 ± 0.065
0.586ArgTrp: 0.586 ± 0.024
2.426ArgTyr: 2.426 ± 0.061
0.0ArgXaa: 0.0 ± 0.0
Ser
5.489SerAla: 5.489 ± 0.1
0.972SerCys: 0.972 ± 0.041
3.002SerAsp: 3.002 ± 0.09
3.572SerGlu: 3.572 ± 0.065
2.797SerPhe: 2.797 ± 0.06
6.02SerGly: 6.02 ± 0.123
1.016SerHis: 1.016 ± 0.031
3.161SerIle: 3.161 ± 0.063
2.351SerLys: 2.351 ± 0.056
6.32SerLeu: 6.32 ± 0.103
1.537SerMet: 1.537 ± 0.044
1.61SerAsn: 1.61 ± 0.05
2.578SerPro: 2.578 ± 0.053
2.039SerGln: 2.039 ± 0.051
3.288SerArg: 3.288 ± 0.059
4.06SerSer: 4.06 ± 0.159
2.8SerThr: 2.8 ± 0.072
4.511SerVal: 4.511 ± 0.077
0.562SerTrp: 0.562 ± 0.028
2.288SerTyr: 2.288 ± 0.067
0.0SerXaa: 0.0 ± 0.0
Thr
5.074ThrAla: 5.074 ± 0.107
0.605ThrCys: 0.605 ± 0.027
2.576ThrAsp: 2.576 ± 0.073
2.643ThrGlu: 2.643 ± 0.06
2.015ThrPhe: 2.015 ± 0.049
4.802ThrGly: 4.802 ± 0.09
0.792ThrHis: 0.792 ± 0.033
3.17ThrIle: 3.17 ± 0.075
1.903ThrLys: 1.903 ± 0.055
5.603ThrLeu: 5.603 ± 0.081
1.304ThrMet: 1.304 ± 0.039
1.388ThrAsn: 1.388 ± 0.052
2.7ThrPro: 2.7 ± 0.066
1.356ThrGln: 1.356 ± 0.042
2.372ThrArg: 2.372 ± 0.05
2.635ThrSer: 2.635 ± 0.074
2.578ThrThr: 2.578 ± 0.062
5.007ThrVal: 5.007 ± 0.102
0.429ThrTrp: 0.429 ± 0.025
1.596ThrTyr: 1.596 ± 0.058
0.0ThrXaa: 0.0 ± 0.0
Val
5.87ValAla: 5.87 ± 0.092
1.479ValCys: 1.479 ± 0.044
3.702ValAsp: 3.702 ± 0.071
4.248ValGlu: 4.248 ± 0.071
2.882ValPhe: 2.882 ± 0.056
5.122ValGly: 5.122 ± 0.079
1.164ValHis: 1.164 ± 0.042
4.337ValIle: 4.337 ± 0.081
3.216ValLys: 3.216 ± 0.081
7.557ValLeu: 7.557 ± 0.107
1.906ValMet: 1.906 ± 0.056
2.205ValAsn: 2.205 ± 0.06
3.007ValPro: 3.007 ± 0.057
2.299ValGln: 2.299 ± 0.059
4.2ValArg: 4.2 ± 0.076
4.882ValSer: 4.882 ± 0.078
4.038ValThr: 4.038 ± 0.112
5.253ValVal: 5.253 ± 0.086
0.644ValTrp: 0.644 ± 0.026
2.505ValTyr: 2.505 ± 0.059
0.0ValXaa: 0.0 ± 0.0
Trp
0.681TrpAla: 0.681 ± 0.029
0.178TrpCys: 0.178 ± 0.014
0.583TrpAsp: 0.583 ± 0.029
0.671TrpGlu: 0.671 ± 0.029
0.375TrpPhe: 0.375 ± 0.019
0.758TrpGly: 0.758 ± 0.032
0.185TrpHis: 0.185 ± 0.014
0.461TrpIle: 0.461 ± 0.026
0.434TrpLys: 0.434 ± 0.023
0.952TrpLeu: 0.952 ± 0.036
0.295TrpMet: 0.295 ± 0.017
0.361TrpAsn: 0.361 ± 0.019
0.286TrpPro: 0.286 ± 0.018
0.442TrpGln: 0.442 ± 0.026
0.57TrpArg: 0.57 ± 0.025
0.525TrpSer: 0.525 ± 0.025
0.459TrpThr: 0.459 ± 0.036
0.53TrpVal: 0.53 ± 0.026
0.118TrpTrp: 0.118 ± 0.011
0.345TrpTyr: 0.345 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.905TyrAla: 2.905 ± 0.055
0.614TyrCys: 0.614 ± 0.026
2.217TyrAsp: 2.217 ± 0.049
2.451TyrGlu: 2.451 ± 0.057
1.57TyrPhe: 1.57 ± 0.044
2.919TyrGly: 2.919 ± 0.055
0.675TyrHis: 0.675 ± 0.031
2.031TyrIle: 2.031 ± 0.048
1.482TyrLys: 1.482 ± 0.06
3.632TyrLeu: 3.632 ± 0.067
0.72TyrMet: 0.72 ± 0.029
1.392TyrAsn: 1.392 ± 0.04
1.61TyrPro: 1.61 ± 0.049
1.398TyrGln: 1.398 ± 0.044
2.362TyrArg: 2.362 ± 0.056
2.16TyrSer: 2.16 ± 0.06
2.092TyrThr: 2.092 ± 0.074
2.091TyrVal: 2.091 ± 0.053
0.358TyrTrp: 0.358 ± 0.02
1.409TyrTyr: 1.409 ± 0.042
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.001XaaGly: 0.001 ± 0.001
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.001XaaArg: 0.001 ± 0.001
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.005XaaXaa: 0.005 ± 0.002
Statistics based on 2808 proteins (875103 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski