Amino acid dipepetide frequency for Saliterribacillus persicus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.525AlaAla: 4.525 ± 0.087
0.57AlaCys: 0.57 ± 0.024
3.328AlaAsp: 3.328 ± 0.062
4.509AlaGlu: 4.509 ± 0.079
3.224AlaPhe: 3.224 ± 0.062
4.624AlaGly: 4.624 ± 0.072
1.176AlaHis: 1.176 ± 0.036
6.243AlaIle: 6.243 ± 0.098
4.435AlaLys: 4.435 ± 0.065
6.809AlaLeu: 6.809 ± 0.085
1.939AlaMet: 1.939 ± 0.043
2.933AlaAsn: 2.933 ± 0.054
1.86AlaPro: 1.86 ± 0.045
1.923AlaGln: 1.923 ± 0.049
2.298AlaArg: 2.298 ± 0.05
4.157AlaSer: 4.157 ± 0.064
3.617AlaThr: 3.617 ± 0.068
4.582AlaVal: 4.582 ± 0.078
0.676AlaTrp: 0.676 ± 0.028
2.358AlaTyr: 2.358 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
0.37CysAla: 0.37 ± 0.019
0.068CysCys: 0.068 ± 0.008
0.356CysAsp: 0.356 ± 0.016
0.401CysGlu: 0.401 ± 0.019
0.272CysPhe: 0.272 ± 0.016
0.551CysGly: 0.551 ± 0.027
0.167CysHis: 0.167 ± 0.013
0.453CysIle: 0.453 ± 0.018
0.359CysLys: 0.359 ± 0.017
0.562CysLeu: 0.562 ± 0.024
0.155CysMet: 0.155 ± 0.013
0.319CysAsn: 0.319 ± 0.017
0.292CysPro: 0.292 ± 0.018
0.213CysGln: 0.213 ± 0.013
0.231CysArg: 0.231 ± 0.014
0.411CysSer: 0.411 ± 0.018
0.348CysThr: 0.348 ± 0.018
0.375CysVal: 0.375 ± 0.019
0.06CysTrp: 0.06 ± 0.007
0.242CysTyr: 0.242 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
3.412AspAla: 3.412 ± 0.068
0.31AspCys: 0.31 ± 0.017
2.804AspAsp: 2.804 ± 0.067
4.695AspGlu: 4.695 ± 0.089
2.67AspPhe: 2.67 ± 0.051
3.495AspGly: 3.495 ± 0.074
1.222AspHis: 1.222 ± 0.037
4.593AspIle: 4.593 ± 0.069
3.652AspLys: 3.652 ± 0.058
5.144AspLeu: 5.144 ± 0.071
1.408AspMet: 1.408 ± 0.04
2.452AspAsn: 2.452 ± 0.052
2.132AspPro: 2.132 ± 0.05
2.421AspGln: 2.421 ± 0.052
2.284AspArg: 2.284 ± 0.047
2.858AspSer: 2.858 ± 0.059
2.59AspThr: 2.59 ± 0.058
3.778AspVal: 3.778 ± 0.066
0.739AspTrp: 0.739 ± 0.032
2.362AspTyr: 2.362 ± 0.043
0.0AspXaa: 0.0 ± 0.0
Glu
5.791GluAla: 5.791 ± 0.075
0.295GluCys: 0.295 ± 0.016
4.444GluAsp: 4.444 ± 0.068
8.123GluGlu: 8.123 ± 0.134
2.498GluPhe: 2.498 ± 0.046
4.187GluGly: 4.187 ± 0.076
1.452GluHis: 1.452 ± 0.041
6.238GluIle: 6.238 ± 0.091
7.174GluLys: 7.174 ± 0.109
7.107GluLeu: 7.107 ± 0.092
2.375GluMet: 2.375 ± 0.058
4.641GluAsn: 4.641 ± 0.077
2.037GluPro: 2.037 ± 0.06
3.149GluGln: 3.149 ± 0.067
3.055GluArg: 3.055 ± 0.057
3.852GluSer: 3.852 ± 0.065
4.021GluThr: 4.021 ± 0.064
5.437GluVal: 5.437 ± 0.074
0.91GluTrp: 0.91 ± 0.03
2.232GluTyr: 2.232 ± 0.05
0.0GluXaa: 0.0 ± 0.0
Phe
2.766PheAla: 2.766 ± 0.061
0.316PheCys: 0.316 ± 0.018
2.676PheAsp: 2.676 ± 0.055
2.985PheGlu: 2.985 ± 0.053
2.6PhePhe: 2.6 ± 0.068
3.07PheGly: 3.07 ± 0.069
1.038PheHis: 1.038 ± 0.034
4.333PheIle: 4.333 ± 0.078
2.667PheLys: 2.667 ± 0.045
4.837PheLeu: 4.837 ± 0.098
1.214PheMet: 1.214 ± 0.041
2.043PheAsn: 2.043 ± 0.044
1.655PhePro: 1.655 ± 0.044
1.677PheGln: 1.677 ± 0.043
1.573PheArg: 1.573 ± 0.041
3.4PheSer: 3.4 ± 0.057
2.583PheThr: 2.583 ± 0.051
3.056PheVal: 3.056 ± 0.055
0.523PheTrp: 0.523 ± 0.022
1.89PheTyr: 1.89 ± 0.043
0.0PheXaa: 0.0 ± 0.0
Gly
4.388GlyAla: 4.388 ± 0.063
0.536GlyCys: 0.536 ± 0.026
3.278GlyAsp: 3.278 ± 0.069
4.513GlyGlu: 4.513 ± 0.077
3.327GlyPhe: 3.327 ± 0.058
4.231GlyGly: 4.231 ± 0.078
1.246GlyHis: 1.246 ± 0.041
5.73GlyIle: 5.73 ± 0.089
4.547GlyLys: 4.547 ± 0.069
6.172GlyLeu: 6.172 ± 0.095
2.013GlyMet: 2.013 ± 0.046
2.644GlyAsn: 2.644 ± 0.06
1.604GlyPro: 1.604 ± 0.046
1.891GlyGln: 1.891 ± 0.046
2.109GlyArg: 2.109 ± 0.053
3.741GlySer: 3.741 ± 0.062
3.57GlyThr: 3.57 ± 0.069
4.59GlyVal: 4.59 ± 0.076
0.766GlyTrp: 0.766 ± 0.023
2.774GlyTyr: 2.774 ± 0.051
0.0GlyXaa: 0.0 ± 0.0
His
1.421HisAla: 1.421 ± 0.035
0.176HisCys: 0.176 ± 0.013
1.057HisAsp: 1.057 ± 0.034
1.411HisGlu: 1.411 ± 0.036
1.098HisPhe: 1.098 ± 0.036
1.288HisGly: 1.288 ± 0.034
0.683HisHis: 0.683 ± 0.03
1.528HisIle: 1.528 ± 0.037
1.01HisLys: 1.01 ± 0.036
2.137HisLeu: 2.137 ± 0.053
0.519HisMet: 0.519 ± 0.022
0.823HisAsn: 0.823 ± 0.028
1.086HisPro: 1.086 ± 0.036
0.899HisGln: 0.899 ± 0.028
0.867HisArg: 0.867 ± 0.028
1.105HisSer: 1.105 ± 0.036
1.029HisThr: 1.029 ± 0.03
1.394HisVal: 1.394 ± 0.037
0.202HisTrp: 0.202 ± 0.013
0.924HisTyr: 0.924 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
6.109IleAla: 6.109 ± 0.087
0.614IleCys: 0.614 ± 0.023
5.079IleAsp: 5.079 ± 0.06
6.314IleGlu: 6.314 ± 0.083
3.994IlePhe: 3.994 ± 0.075
6.134IleGly: 6.134 ± 0.1
1.652IleHis: 1.652 ± 0.045
6.814IleIle: 6.814 ± 0.103
5.057IleLys: 5.057 ± 0.078
7.809IleLeu: 7.809 ± 0.121
1.82IleMet: 1.82 ± 0.05
3.941IleAsn: 3.941 ± 0.066
3.344IlePro: 3.344 ± 0.055
2.885IleGln: 2.885 ± 0.055
2.972IleArg: 2.972 ± 0.058
5.435IleSer: 5.435 ± 0.085
4.719IleThr: 4.719 ± 0.068
5.447IleVal: 5.447 ± 0.077
0.734IleTrp: 0.734 ± 0.026
2.758IleTyr: 2.758 ± 0.055
0.0IleXaa: 0.0 ± 0.0
Lys
4.409LysAla: 4.409 ± 0.075
0.308LysCys: 0.308 ± 0.015
4.295LysAsp: 4.295 ± 0.063
7.431LysGlu: 7.431 ± 0.105
1.942LysPhe: 1.942 ± 0.038
4.001LysGly: 4.001 ± 0.067
1.467LysHis: 1.467 ± 0.032
5.249LysIle: 5.249 ± 0.073
6.376LysLys: 6.376 ± 0.099
5.843LysLeu: 5.843 ± 0.087
2.111LysMet: 2.111 ± 0.044
3.799LysAsn: 3.799 ± 0.063
1.986LysPro: 1.986 ± 0.043
3.072LysGln: 3.072 ± 0.061
3.17LysArg: 3.17 ± 0.057
3.719LysSer: 3.719 ± 0.059
3.408LysThr: 3.408 ± 0.067
4.469LysVal: 4.469 ± 0.075
0.85LysTrp: 0.85 ± 0.028
2.329LysTyr: 2.329 ± 0.056
0.0LysXaa: 0.0 ± 0.0
Leu
6.804LeuAla: 6.804 ± 0.082
0.543LeuCys: 0.543 ± 0.024
5.243LeuAsp: 5.243 ± 0.074
7.217LeuGlu: 7.217 ± 0.088
5.071LeuPhe: 5.071 ± 0.095
5.827LeuGly: 5.827 ± 0.088
1.935LeuHis: 1.935 ± 0.048
7.899LeuIle: 7.899 ± 0.121
6.693LeuLys: 6.693 ± 0.079
10.023LeuLeu: 10.023 ± 0.15
2.439LeuMet: 2.439 ± 0.049
4.553LeuAsn: 4.553 ± 0.072
3.674LeuPro: 3.674 ± 0.067
3.191LeuGln: 3.191 ± 0.06
3.264LeuArg: 3.264 ± 0.065
6.74LeuSer: 6.74 ± 0.098
5.584LeuThr: 5.584 ± 0.078
5.954LeuVal: 5.954 ± 0.077
0.863LeuTrp: 0.863 ± 0.028
3.304LeuTyr: 3.304 ± 0.062
0.0LeuXaa: 0.0 ± 0.0
Met
1.798MetAla: 1.798 ± 0.037
0.111MetCys: 0.111 ± 0.01
1.526MetAsp: 1.526 ± 0.04
2.179MetGlu: 2.179 ± 0.045
1.056MetPhe: 1.056 ± 0.034
1.561MetGly: 1.561 ± 0.045
0.517MetHis: 0.517 ± 0.023
2.271MetIle: 2.271 ± 0.051
2.34MetLys: 2.34 ± 0.051
2.552MetLeu: 2.552 ± 0.052
0.842MetMet: 0.842 ± 0.027
1.476MetAsn: 1.476 ± 0.036
0.914MetPro: 0.914 ± 0.036
1.043MetGln: 1.043 ± 0.035
1.075MetArg: 1.075 ± 0.033
1.525MetSer: 1.525 ± 0.038
1.555MetThr: 1.555 ± 0.031
1.805MetVal: 1.805 ± 0.038
0.213MetTrp: 0.213 ± 0.016
0.805MetTyr: 0.805 ± 0.033
0.0MetXaa: 0.0 ± 0.0
Asn
2.78AsnAla: 2.78 ± 0.056
0.287AsnCys: 0.287 ± 0.016
2.692AsnAsp: 2.692 ± 0.054
4.11AsnGlu: 4.11 ± 0.071
2.003AsnPhe: 2.003 ± 0.044
3.295AsnGly: 3.295 ± 0.065
1.187AsnHis: 1.187 ± 0.031
3.957AsnIle: 3.957 ± 0.064
3.474AsnLys: 3.474 ± 0.06
4.211AsnLeu: 4.211 ± 0.063
1.254AsnMet: 1.254 ± 0.034
2.628AsnAsn: 2.628 ± 0.056
2.072AsnPro: 2.072 ± 0.046
2.345AsnGln: 2.345 ± 0.05
2.105AsnArg: 2.105 ± 0.043
2.476AsnSer: 2.476 ± 0.056
2.486AsnThr: 2.486 ± 0.052
3.0AsnVal: 3.0 ± 0.053
0.678AsnTrp: 0.678 ± 0.026
1.94AsnTyr: 1.94 ± 0.039
0.0AsnXaa: 0.0 ± 0.0
Pro
2.08ProAla: 2.08 ± 0.045
0.202ProCys: 0.202 ± 0.014
1.922ProAsp: 1.922 ± 0.049
2.809ProGlu: 2.809 ± 0.063
1.848ProPhe: 1.848 ± 0.037
1.974ProGly: 1.974 ± 0.046
0.818ProHis: 0.818 ± 0.028
2.878ProIle: 2.878 ± 0.049
2.08ProLys: 2.08 ± 0.045
3.279ProLeu: 3.279 ± 0.056
0.806ProMet: 0.806 ± 0.025
1.817ProAsn: 1.817 ± 0.042
0.863ProPro: 0.863 ± 0.031
0.975ProGln: 0.975 ± 0.031
0.969ProArg: 0.969 ± 0.03
2.189ProSer: 2.189 ± 0.056
2.047ProThr: 2.047 ± 0.044
2.568ProVal: 2.568 ± 0.043
0.393ProTrp: 0.393 ± 0.02
1.406ProTyr: 1.406 ± 0.041
0.0ProXaa: 0.0 ± 0.0
Gln
2.755GlnAla: 2.755 ± 0.058
0.146GlnCys: 0.146 ± 0.013
1.831GlnAsp: 1.831 ± 0.034
3.011GlnGlu: 3.011 ± 0.064
1.623GlnPhe: 1.623 ± 0.037
2.032GlnGly: 2.032 ± 0.041
0.689GlnHis: 0.689 ± 0.024
2.786GlnIle: 2.786 ± 0.056
2.706GlnLys: 2.706 ± 0.048
3.99GlnLeu: 3.99 ± 0.068
1.025GlnMet: 1.025 ± 0.032
1.732GlnAsn: 1.732 ± 0.038
1.133GlnPro: 1.133 ± 0.038
1.632GlnGln: 1.632 ± 0.042
1.26GlnArg: 1.26 ± 0.037
2.077GlnSer: 2.077 ± 0.047
1.998GlnThr: 1.998 ± 0.049
2.42GlnVal: 2.42 ± 0.042
0.387GlnTrp: 0.387 ± 0.021
1.301GlnTyr: 1.301 ± 0.033
0.0GlnXaa: 0.0 ± 0.0
Arg
2.107ArgAla: 2.107 ± 0.047
0.208ArgCys: 0.208 ± 0.015
1.946ArgAsp: 1.946 ± 0.046
2.897ArgGlu: 2.897 ± 0.052
1.784ArgPhe: 1.784 ± 0.045
2.01ArgGly: 2.01 ± 0.047
0.658ArgHis: 0.658 ± 0.021
3.047ArgIle: 3.047 ± 0.057
3.069ArgLys: 3.069 ± 0.056
3.61ArgLeu: 3.61 ± 0.072
1.238ArgMet: 1.238 ± 0.036
1.968ArgAsn: 1.968 ± 0.048
1.145ArgPro: 1.145 ± 0.033
1.318ArgGln: 1.318 ± 0.037
1.513ArgArg: 1.513 ± 0.038
1.975ArgSer: 1.975 ± 0.047
1.835ArgThr: 1.835 ± 0.037
2.466ArgVal: 2.466 ± 0.05
0.394ArgTrp: 0.394 ± 0.019
1.566ArgTyr: 1.566 ± 0.039
0.0ArgXaa: 0.0 ± 0.0
Ser
3.361SerAla: 3.361 ± 0.054
0.373SerCys: 0.373 ± 0.021
3.111SerAsp: 3.111 ± 0.057
4.206SerGlu: 4.206 ± 0.071
3.441SerPhe: 3.441 ± 0.065
4.133SerGly: 4.133 ± 0.075
1.152SerHis: 1.152 ± 0.036
5.398SerIle: 5.398 ± 0.07
3.906SerLys: 3.906 ± 0.067
5.967SerLeu: 5.967 ± 0.084
1.708SerMet: 1.708 ± 0.043
3.085SerAsn: 3.085 ± 0.057
1.879SerPro: 1.879 ± 0.044
1.883SerGln: 1.883 ± 0.041
2.038SerArg: 2.038 ± 0.046
3.962SerSer: 3.962 ± 0.073
3.262SerThr: 3.262 ± 0.062
4.003SerVal: 4.003 ± 0.064
0.648SerTrp: 0.648 ± 0.025
2.479SerTyr: 2.479 ± 0.052
0.001SerXaa: 0.001 ± 0.001
Thr
3.585ThrAla: 3.585 ± 0.062
0.346ThrCys: 0.346 ± 0.019
2.947ThrAsp: 2.947 ± 0.057
3.835ThrGlu: 3.835 ± 0.067
2.798ThrPhe: 2.798 ± 0.046
3.794ThrGly: 3.794 ± 0.064
1.082ThrHis: 1.082 ± 0.033
4.78ThrIle: 4.78 ± 0.068
3.304ThrLys: 3.304 ± 0.053
5.338ThrLeu: 5.338 ± 0.074
1.256ThrMet: 1.256 ± 0.037
2.729ThrAsn: 2.729 ± 0.051
2.18ThrPro: 2.18 ± 0.048
1.559ThrGln: 1.559 ± 0.036
1.648ThrArg: 1.648 ± 0.036
3.339ThrSer: 3.339 ± 0.061
3.061ThrThr: 3.061 ± 0.062
3.794ThrVal: 3.794 ± 0.065
0.565ThrTrp: 0.565 ± 0.025
2.1ThrTyr: 2.1 ± 0.049
0.0ThrXaa: 0.0 ± 0.0
Val
4.62ValAla: 4.62 ± 0.08
0.462ValCys: 0.462 ± 0.022
3.763ValAsp: 3.763 ± 0.052
5.098ValGlu: 5.098 ± 0.075
3.074ValPhe: 3.074 ± 0.06
4.395ValGly: 4.395 ± 0.069
1.307ValHis: 1.307 ± 0.041
5.819ValIle: 5.819 ± 0.073
4.359ValLys: 4.359 ± 0.07
6.48ValLeu: 6.48 ± 0.087
1.758ValMet: 1.758 ± 0.037
3.139ValAsn: 3.139 ± 0.058
2.271ValPro: 2.271 ± 0.046
2.048ValGln: 2.048 ± 0.044
2.281ValArg: 2.281 ± 0.058
4.31ValSer: 4.31 ± 0.062
3.92ValThr: 3.92 ± 0.057
4.721ValVal: 4.721 ± 0.08
0.625ValTrp: 0.625 ± 0.027
2.315ValTyr: 2.315 ± 0.047
0.0ValXaa: 0.0 ± 0.0
Trp
0.556TrpAla: 0.556 ± 0.025
0.063TrpCys: 0.063 ± 0.008
0.601TrpAsp: 0.601 ± 0.027
0.743TrpGlu: 0.743 ± 0.028
0.553TrpPhe: 0.553 ± 0.024
0.64TrpGly: 0.64 ± 0.026
0.216TrpHis: 0.216 ± 0.013
0.966TrpIle: 0.966 ± 0.031
0.826TrpLys: 0.826 ± 0.027
1.159TrpLeu: 1.159 ± 0.036
0.367TrpMet: 0.367 ± 0.021
0.609TrpAsn: 0.609 ± 0.025
0.269TrpPro: 0.269 ± 0.019
0.442TrpGln: 0.442 ± 0.023
0.405TrpArg: 0.405 ± 0.021
0.63TrpSer: 0.63 ± 0.029
0.544TrpThr: 0.544 ± 0.024
0.671TrpVal: 0.671 ± 0.026
0.174TrpTrp: 0.174 ± 0.014
0.401TrpTyr: 0.401 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.149TyrAla: 2.149 ± 0.045
0.269TyrCys: 0.269 ± 0.018
2.123TyrAsp: 2.123 ± 0.047
2.533TyrGlu: 2.533 ± 0.049
2.028TyrPhe: 2.028 ± 0.047
2.428TyrGly: 2.428 ± 0.05
0.993TyrHis: 0.993 ± 0.033
2.65TyrIle: 2.65 ± 0.047
2.197TyrLys: 2.197 ± 0.042
3.793TyrLeu: 3.793 ± 0.065
0.916TyrMet: 0.916 ± 0.032
1.648TyrAsn: 1.648 ± 0.044
1.527TyrPro: 1.527 ± 0.04
1.964TyrGln: 1.964 ± 0.047
1.638TyrArg: 1.638 ± 0.038
2.095TyrSer: 2.095 ± 0.053
1.863TyrThr: 1.863 ± 0.045
2.234TyrVal: 2.234 ± 0.045
0.439TyrTrp: 0.439 ± 0.021
1.582TyrTyr: 1.582 ± 0.042
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.001
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.001XaaXaa: 0.001 ± 0.001
Statistics based on 3585 proteins (1071052 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski