Amino acid dipepetide frequency for Hymenobacter sp. PAMC 26628

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.052AlaAla: 18.052 ± 0.193
0.841AlaCys: 0.841 ± 0.026
6.726AlaAsp: 6.726 ± 0.076
6.056AlaGlu: 6.056 ± 0.082
4.066AlaPhe: 4.066 ± 0.064
10.997AlaGly: 10.997 ± 0.131
2.581AlaHis: 2.581 ± 0.049
3.896AlaIle: 3.896 ± 0.055
3.821AlaLys: 3.821 ± 0.071
14.109AlaLeu: 14.109 ± 0.131
1.882AlaMet: 1.882 ± 0.034
3.567AlaAsn: 3.567 ± 0.065
7.425AlaPro: 7.425 ± 0.118
6.112AlaGln: 6.112 ± 0.078
7.28AlaArg: 7.28 ± 0.092
5.287AlaSer: 5.287 ± 0.068
7.371AlaThr: 7.371 ± 0.094
8.813AlaVal: 8.813 ± 0.095
1.534AlaTrp: 1.534 ± 0.035
3.625AlaTyr: 3.625 ± 0.053
0.0AlaXaa: 0.0 ± 0.0
Cys
0.709CysAla: 0.709 ± 0.024
0.1CysCys: 0.1 ± 0.009
0.334CysAsp: 0.334 ± 0.018
0.311CysGlu: 0.311 ± 0.016
0.298CysPhe: 0.298 ± 0.015
0.646CysGly: 0.646 ± 0.021
0.198CysHis: 0.198 ± 0.012
0.27CysIle: 0.27 ± 0.012
0.191CysLys: 0.191 ± 0.012
0.741CysLeu: 0.741 ± 0.024
0.099CysMet: 0.099 ± 0.01
0.228CysAsn: 0.228 ± 0.013
0.429CysPro: 0.429 ± 0.023
0.343CysGln: 0.343 ± 0.016
0.455CysArg: 0.455 ± 0.02
0.37CysSer: 0.37 ± 0.017
0.396CysThr: 0.396 ± 0.016
0.466CysVal: 0.466 ± 0.018
0.105CysTrp: 0.105 ± 0.008
0.272CysTyr: 0.272 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
5.983AspAla: 5.983 ± 0.08
0.331AspCys: 0.331 ± 0.015
2.467AspAsp: 2.467 ± 0.054
2.825AspGlu: 2.825 ± 0.052
2.573AspPhe: 2.573 ± 0.045
4.171AspGly: 4.171 ± 0.074
0.958AspHis: 0.958 ± 0.026
2.006AspIle: 2.006 ± 0.043
2.001AspLys: 2.001 ± 0.046
5.225AspLeu: 5.225 ± 0.067
0.82AspMet: 0.82 ± 0.027
1.841AspAsn: 1.841 ± 0.043
2.514AspPro: 2.514 ± 0.044
1.987AspGln: 1.987 ± 0.041
2.501AspArg: 2.501 ± 0.049
2.279AspSer: 2.279 ± 0.049
2.388AspThr: 2.388 ± 0.04
3.917AspVal: 3.917 ± 0.055
0.753AspTrp: 0.753 ± 0.023
2.198AspTyr: 2.198 ± 0.042
0.0AspXaa: 0.0 ± 0.0
Glu
5.733GluAla: 5.733 ± 0.075
0.267GluCys: 0.267 ± 0.015
1.852GluAsp: 1.852 ± 0.032
2.446GluGlu: 2.446 ± 0.051
1.941GluPhe: 1.941 ± 0.044
3.115GluGly: 3.115 ± 0.053
1.01GluHis: 1.01 ± 0.03
2.381GluIle: 2.381 ± 0.043
2.33GluLys: 2.33 ± 0.049
5.662GluLeu: 5.662 ± 0.067
1.094GluMet: 1.094 ± 0.027
1.727GluAsn: 1.727 ± 0.036
1.985GluPro: 1.985 ± 0.043
2.382GluGln: 2.382 ± 0.046
2.936GluArg: 2.936 ± 0.051
1.864GluSer: 1.864 ± 0.039
2.672GluThr: 2.672 ± 0.05
3.617GluVal: 3.617 ± 0.058
0.568GluTrp: 0.568 ± 0.021
1.476GluTyr: 1.476 ± 0.029
0.0GluXaa: 0.0 ± 0.0
Phe
4.288PheAla: 4.288 ± 0.059
0.341PheCys: 0.341 ± 0.017
2.485PheAsp: 2.485 ± 0.041
2.004PheGlu: 2.004 ± 0.041
1.792PhePhe: 1.792 ± 0.044
3.538PheGly: 3.538 ± 0.053
0.779PheHis: 0.779 ± 0.024
1.613PheIle: 1.613 ± 0.036
1.365PheLys: 1.365 ± 0.035
3.729PheLeu: 3.729 ± 0.066
0.686PheMet: 0.686 ± 0.023
1.581PheAsn: 1.581 ± 0.037
1.762PhePro: 1.762 ± 0.033
1.476PheGln: 1.476 ± 0.031
2.428PheArg: 2.428 ± 0.046
2.479PheSer: 2.479 ± 0.053
2.64PheThr: 2.64 ± 0.038
2.969PheVal: 2.969 ± 0.048
0.537PheTrp: 0.537 ± 0.024
1.502PheTyr: 1.502 ± 0.032
0.0PheXaa: 0.0 ± 0.0
Gly
9.456GlyAla: 9.456 ± 0.092
0.69GlyCys: 0.69 ± 0.022
3.277GlyAsp: 3.277 ± 0.055
3.394GlyGlu: 3.394 ± 0.053
3.404GlyPhe: 3.404 ± 0.048
6.774GlyGly: 6.774 ± 0.108
1.868GlyHis: 1.868 ± 0.04
3.501GlyIle: 3.501 ± 0.057
3.288GlyLys: 3.288 ± 0.054
9.109GlyLeu: 9.109 ± 0.095
1.435GlyMet: 1.435 ± 0.036
2.639GlyAsn: 2.639 ± 0.053
4.768GlyPro: 4.768 ± 0.068
4.197GlyGln: 4.197 ± 0.058
5.261GlyArg: 5.261 ± 0.072
4.159GlySer: 4.159 ± 0.067
5.376GlyThr: 5.376 ± 0.089
5.498GlyVal: 5.498 ± 0.068
1.131GlyTrp: 1.131 ± 0.03
3.084GlyTyr: 3.084 ± 0.049
0.0GlyXaa: 0.0 ± 0.0
His
2.123HisAla: 2.123 ± 0.04
0.203HisCys: 0.203 ± 0.013
1.12HisAsp: 1.12 ± 0.03
1.081HisGlu: 1.081 ± 0.03
1.075HisPhe: 1.075 ± 0.03
1.683HisGly: 1.683 ± 0.038
0.626HisHis: 0.626 ± 0.027
0.817HisIle: 0.817 ± 0.027
0.593HisLys: 0.593 ± 0.021
2.545HisLeu: 2.545 ± 0.055
0.318HisMet: 0.318 ± 0.016
0.669HisAsn: 0.669 ± 0.021
1.377HisPro: 1.377 ± 0.038
0.933HisGln: 0.933 ± 0.028
1.441HisArg: 1.441 ± 0.036
0.93HisSer: 0.93 ± 0.027
1.155HisThr: 1.155 ± 0.028
1.345HisVal: 1.345 ± 0.029
0.336HisTrp: 0.336 ± 0.016
0.951HisTyr: 0.951 ± 0.03
0.0HisXaa: 0.0 ± 0.0
Ile
4.037IleAla: 4.037 ± 0.055
0.347IleCys: 0.347 ± 0.018
2.322IleAsp: 2.322 ± 0.044
2.148IleGlu: 2.148 ± 0.043
1.541IlePhe: 1.541 ± 0.035
3.331IleGly: 3.331 ± 0.055
0.693IleHis: 0.693 ± 0.024
1.916IleIle: 1.916 ± 0.044
1.719IleLys: 1.719 ± 0.043
3.299IleLeu: 3.299 ± 0.052
0.671IleMet: 0.671 ± 0.022
1.573IleAsn: 1.573 ± 0.036
1.839IlePro: 1.839 ± 0.04
1.468IleGln: 1.468 ± 0.035
2.279IleArg: 2.279 ± 0.041
2.353IleSer: 2.353 ± 0.042
2.565IleThr: 2.565 ± 0.052
2.708IleVal: 2.708 ± 0.05
0.4IleTrp: 0.4 ± 0.02
1.224IleTyr: 1.224 ± 0.027
0.0IleXaa: 0.0 ± 0.0
Lys
3.887LysAla: 3.887 ± 0.079
0.187LysCys: 0.187 ± 0.012
1.836LysAsp: 1.836 ± 0.049
1.905LysGlu: 1.905 ± 0.045
1.346LysPhe: 1.346 ± 0.036
2.632LysGly: 2.632 ± 0.048
0.673LysHis: 0.673 ± 0.025
1.86LysIle: 1.86 ± 0.045
2.077LysLys: 2.077 ± 0.062
3.743LysLeu: 3.743 ± 0.063
1.037LysMet: 1.037 ± 0.029
1.548LysAsn: 1.548 ± 0.041
1.898LysPro: 1.898 ± 0.043
1.659LysGln: 1.659 ± 0.037
1.878LysArg: 1.878 ± 0.038
1.818LysSer: 1.818 ± 0.039
2.371LysThr: 2.371 ± 0.051
2.467LysVal: 2.467 ± 0.052
0.391LysTrp: 0.391 ± 0.015
1.321LysTyr: 1.321 ± 0.034
0.0LysXaa: 0.0 ± 0.0
Leu
14.905LeuAla: 14.905 ± 0.153
0.739LeuCys: 0.739 ± 0.027
5.564LeuAsp: 5.564 ± 0.066
4.571LeuGlu: 4.571 ± 0.068
3.918LeuPhe: 3.918 ± 0.064
9.248LeuGly: 9.248 ± 0.082
2.557LeuHis: 2.557 ± 0.049
3.598LeuIle: 3.598 ± 0.053
3.866LeuLys: 3.866 ± 0.068
13.403LeuLeu: 13.403 ± 0.177
1.752LeuMet: 1.752 ± 0.039
3.891LeuAsn: 3.891 ± 0.064
6.659LeuPro: 6.659 ± 0.077
3.809LeuGln: 3.809 ± 0.051
8.054LeuArg: 8.054 ± 0.097
5.609LeuSer: 5.609 ± 0.072
6.541LeuThr: 6.541 ± 0.067
8.148LeuVal: 8.148 ± 0.092
1.124LeuTrp: 1.124 ± 0.033
2.973LeuTyr: 2.973 ± 0.045
0.0LeuXaa: 0.0 ± 0.0
Met
1.972MetAla: 1.972 ± 0.035
0.085MetCys: 0.085 ± 0.008
0.744MetAsp: 0.744 ± 0.024
0.804MetGlu: 0.804 ± 0.025
0.526MetPhe: 0.526 ± 0.018
1.354MetGly: 1.354 ± 0.034
0.358MetHis: 0.358 ± 0.016
0.586MetIle: 0.586 ± 0.02
0.943MetLys: 0.943 ± 0.027
1.884MetLeu: 1.884 ± 0.037
0.334MetMet: 0.334 ± 0.015
0.669MetAsn: 0.669 ± 0.026
1.121MetPro: 1.121 ± 0.028
0.848MetGln: 0.848 ± 0.024
1.103MetArg: 1.103 ± 0.026
0.965MetSer: 0.965 ± 0.025
1.015MetThr: 1.015 ± 0.028
1.098MetVal: 1.098 ± 0.035
0.139MetTrp: 0.139 ± 0.01
0.429MetTyr: 0.429 ± 0.02
0.0MetXaa: 0.0 ± 0.0
Asn
3.544AsnAla: 3.544 ± 0.057
0.247AsnCys: 0.247 ± 0.013
1.695AsnAsp: 1.695 ± 0.041
1.508AsnGlu: 1.508 ± 0.037
1.62AsnPhe: 1.62 ± 0.038
2.981AsnGly: 2.981 ± 0.059
0.664AsnHis: 0.664 ± 0.023
1.576AsnIle: 1.576 ± 0.038
1.26AsnLys: 1.26 ± 0.033
3.593AsnLeu: 3.593 ± 0.059
0.593AsnMet: 0.593 ± 0.02
1.483AsnAsn: 1.483 ± 0.047
2.386AsnPro: 2.386 ± 0.043
1.534AsnGln: 1.534 ± 0.033
2.019AsnArg: 2.019 ± 0.038
1.777AsnSer: 1.777 ± 0.044
2.042AsnThr: 2.042 ± 0.047
2.483AsnVal: 2.483 ± 0.05
0.482AsnTrp: 0.482 ± 0.017
1.479AsnTyr: 1.479 ± 0.035
0.0AsnXaa: 0.0 ± 0.0
Pro
9.352ProAla: 9.352 ± 0.127
0.26ProCys: 0.26 ± 0.014
3.431ProAsp: 3.431 ± 0.053
2.971ProGlu: 2.971 ± 0.052
1.903ProPhe: 1.903 ± 0.036
5.283ProGly: 5.283 ± 0.074
1.075ProHis: 1.075 ± 0.03
1.794ProIle: 1.794 ± 0.034
1.724ProLys: 1.724 ± 0.042
5.559ProLeu: 5.559 ± 0.076
0.777ProMet: 0.777 ± 0.023
1.96ProAsn: 1.96 ± 0.042
2.567ProPro: 2.567 ± 0.057
2.004ProGln: 2.004 ± 0.037
2.88ProArg: 2.88 ± 0.048
2.359ProSer: 2.359 ± 0.038
3.456ProThr: 3.456 ± 0.053
4.508ProVal: 4.508 ± 0.057
0.627ProTrp: 0.627 ± 0.021
1.682ProTyr: 1.682 ± 0.035
0.0ProXaa: 0.0 ± 0.0
Gln
5.637GlnAla: 5.637 ± 0.074
0.207GlnCys: 0.207 ± 0.013
1.763GlnAsp: 1.763 ± 0.037
1.936GlnGlu: 1.936 ± 0.037
1.691GlnPhe: 1.691 ± 0.034
3.147GlnGly: 3.147 ± 0.051
1.193GlnHis: 1.193 ± 0.03
1.559GlnIle: 1.559 ± 0.035
1.567GlnLys: 1.567 ± 0.039
5.368GlnLeu: 5.368 ± 0.078
0.828GlnMet: 0.828 ± 0.024
1.485GlnAsn: 1.485 ± 0.034
2.78GlnPro: 2.78 ± 0.049
2.815GlnGln: 2.815 ± 0.063
3.267GlnArg: 3.267 ± 0.049
1.77GlnSer: 1.77 ± 0.036
2.357GlnThr: 2.357 ± 0.041
3.415GlnVal: 3.415 ± 0.054
0.549GlnTrp: 0.549 ± 0.022
1.462GlnTyr: 1.462 ± 0.036
0.0GlnXaa: 0.0 ± 0.0
Arg
6.992ArgAla: 6.992 ± 0.093
0.399ArgCys: 0.399 ± 0.017
2.779ArgAsp: 2.779 ± 0.049
2.955ArgGlu: 2.955 ± 0.048
2.578ArgPhe: 2.578 ± 0.042
4.157ArgGly: 4.157 ± 0.064
1.589ArgHis: 1.589 ± 0.04
2.574ArgIle: 2.574 ± 0.047
1.93ArgLys: 1.93 ± 0.041
7.542ArgLeu: 7.542 ± 0.093
1.16ArgMet: 1.16 ± 0.03
1.985ArgAsn: 1.985 ± 0.041
3.728ArgPro: 3.728 ± 0.062
3.405ArgGln: 3.405 ± 0.045
4.834ArgArg: 4.834 ± 0.074
2.46ArgSer: 2.46 ± 0.04
3.538ArgThr: 3.538 ± 0.055
4.444ArgVal: 4.444 ± 0.057
0.911ArgTrp: 0.911 ± 0.028
2.474ArgTyr: 2.474 ± 0.041
0.0ArgXaa: 0.0 ± 0.0
Ser
5.272SerAla: 5.272 ± 0.077
0.39SerCys: 0.39 ± 0.018
2.165SerAsp: 2.165 ± 0.041
2.05SerGlu: 2.05 ± 0.038
2.356SerPhe: 2.356 ± 0.049
4.343SerGly: 4.343 ± 0.074
0.917SerHis: 0.917 ± 0.025
2.173SerIle: 2.173 ± 0.045
1.731SerLys: 1.731 ± 0.038
5.222SerLeu: 5.222 ± 0.069
0.831SerMet: 0.831 ± 0.028
1.757SerAsn: 1.757 ± 0.041
2.717SerPro: 2.717 ± 0.043
1.835SerGln: 1.835 ± 0.038
2.702SerArg: 2.702 ± 0.046
2.726SerSer: 2.726 ± 0.051
3.038SerThr: 3.038 ± 0.049
3.446SerVal: 3.446 ± 0.053
0.626SerTrp: 0.626 ± 0.019
1.844SerTyr: 1.844 ± 0.046
0.0SerXaa: 0.0 ± 0.0
Thr
7.432ThrAla: 7.432 ± 0.092
0.343ThrCys: 0.343 ± 0.017
3.265ThrAsp: 3.265 ± 0.052
2.625ThrGlu: 2.625 ± 0.047
2.375ThrPhe: 2.375 ± 0.048
5.462ThrGly: 5.462 ± 0.066
1.115ThrHis: 1.115 ± 0.026
2.332ThrIle: 2.332 ± 0.046
2.066ThrLys: 2.066 ± 0.041
6.189ThrLeu: 6.189 ± 0.072
0.82ThrMet: 0.82 ± 0.024
2.091ThrAsn: 2.091 ± 0.041
3.704ThrPro: 3.704 ± 0.062
2.205ThrGln: 2.205 ± 0.04
2.967ThrArg: 2.967 ± 0.043
2.944ThrSer: 2.944 ± 0.051
3.806ThrThr: 3.806 ± 0.077
4.754ThrVal: 4.754 ± 0.063
0.733ThrTrp: 0.733 ± 0.027
2.107ThrTyr: 2.107 ± 0.05
0.0ThrXaa: 0.0 ± 0.0
Val
9.575ValAla: 9.575 ± 0.097
0.561ValCys: 0.561 ± 0.021
3.37ValAsp: 3.37 ± 0.046
3.416ValGlu: 3.416 ± 0.06
2.917ValPhe: 2.917 ± 0.052
5.758ValGly: 5.758 ± 0.067
1.334ValHis: 1.334 ± 0.032
2.618ValIle: 2.618 ± 0.052
2.492ValLys: 2.492 ± 0.051
8.581ValLeu: 8.581 ± 0.092
1.166ValMet: 1.166 ± 0.028
2.492ValAsn: 2.492 ± 0.047
4.29ValPro: 4.29 ± 0.053
3.145ValGln: 3.145 ± 0.052
4.891ValArg: 4.891 ± 0.063
3.73ValSer: 3.73 ± 0.06
3.968ValThr: 3.968 ± 0.07
6.121ValVal: 6.121 ± 0.079
0.792ValTrp: 0.792 ± 0.026
2.121ValTyr: 2.121 ± 0.044
0.0ValXaa: 0.0 ± 0.0
Trp
1.403TrpAla: 1.403 ± 0.038
0.104TrpCys: 0.104 ± 0.008
0.561TrpAsp: 0.561 ± 0.018
0.525TrpGlu: 0.525 ± 0.018
0.464TrpPhe: 0.464 ± 0.021
0.939TrpGly: 0.939 ± 0.027
0.331TrpHis: 0.331 ± 0.014
0.33TrpIle: 0.33 ± 0.014
0.426TrpLys: 0.426 ± 0.02
1.696TrpLeu: 1.696 ± 0.035
0.229TrpMet: 0.229 ± 0.013
0.452TrpAsn: 0.452 ± 0.02
0.595TrpPro: 0.595 ± 0.024
0.831TrpGln: 0.831 ± 0.027
0.882TrpArg: 0.882 ± 0.029
0.525TrpSer: 0.525 ± 0.021
0.66TrpThr: 0.66 ± 0.021
0.836TrpVal: 0.836 ± 0.03
0.226TrpTrp: 0.226 ± 0.015
0.403TrpTyr: 0.403 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.691TyrAla: 3.691 ± 0.054
0.317TyrCys: 0.317 ± 0.014
1.978TyrAsp: 1.978 ± 0.038
1.529TyrGlu: 1.529 ± 0.033
1.607TyrPhe: 1.607 ± 0.037
2.752TyrGly: 2.752 ± 0.055
0.803TyrHis: 0.803 ± 0.026
1.035TyrIle: 1.035 ± 0.027
1.141TyrLys: 1.141 ± 0.035
3.73TyrLeu: 3.73 ± 0.052
0.444TyrMet: 0.444 ± 0.019
1.327TyrAsn: 1.327 ± 0.041
1.631TyrPro: 1.631 ± 0.035
1.808TyrGln: 1.808 ± 0.045
2.377TyrArg: 2.377 ± 0.048
1.735TyrSer: 1.735 ± 0.035
1.981TyrThr: 1.981 ± 0.041
2.286TyrVal: 2.286 ± 0.044
0.456TyrTrp: 0.456 ± 0.019
1.465TyrTyr: 1.465 ± 0.036
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4319 proteins (1435981 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski