Amino acid dipepetide frequency for Weissella oryzae (strain DSM 25784 / JCM 18191 / SG25)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.896AlaAla: 8.896 ± 0.462
0.173AlaCys: 0.173 ± 0.018
5.973AlaAsp: 5.973 ± 0.114
5.601AlaGlu: 5.601 ± 0.11
3.494AlaPhe: 3.494 ± 0.09
6.764AlaGly: 6.764 ± 0.124
1.351AlaHis: 1.351 ± 0.05
6.34AlaIle: 6.34 ± 0.121
6.149AlaLys: 6.149 ± 0.112
8.903AlaLeu: 8.903 ± 0.145
2.527AlaMet: 2.527 ± 0.071
4.656AlaAsn: 4.656 ± 0.19
2.352AlaPro: 2.352 ± 0.081
4.028AlaGln: 4.028 ± 0.112
3.277AlaArg: 3.277 ± 0.089
5.652AlaSer: 5.652 ± 0.437
5.577AlaThr: 5.577 ± 0.197
6.174AlaVal: 6.174 ± 0.133
0.973AlaTrp: 0.973 ± 0.041
2.822AlaTyr: 2.822 ± 0.061
0.0AlaXaa: 0.0 ± 0.0
Cys
0.145CysAla: 0.145 ± 0.016
0.016CysCys: 0.016 ± 0.006
0.08CysAsp: 0.08 ± 0.01
0.082CysGlu: 0.082 ± 0.012
0.096CysPhe: 0.096 ± 0.012
0.158CysGly: 0.158 ± 0.019
0.033CysHis: 0.033 ± 0.009
0.088CysIle: 0.088 ± 0.014
0.096CysLys: 0.096 ± 0.013
0.183CysLeu: 0.183 ± 0.019
0.031CysMet: 0.031 ± 0.007
0.065CysAsn: 0.065 ± 0.009
0.095CysPro: 0.095 ± 0.014
0.072CysGln: 0.072 ± 0.012
0.075CysArg: 0.075 ± 0.012
0.114CysSer: 0.114 ± 0.014
0.13CysThr: 0.13 ± 0.015
0.099CysVal: 0.099 ± 0.013
0.018CysTrp: 0.018 ± 0.005
0.07CysTyr: 0.07 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
4.581AspAla: 4.581 ± 0.115
0.099AspCys: 0.099 ± 0.013
3.422AspAsp: 3.422 ± 0.107
4.296AspGlu: 4.296 ± 0.095
2.881AspPhe: 2.881 ± 0.07
3.849AspGly: 3.849 ± 0.088
0.884AspHis: 0.884 ± 0.038
3.655AspIle: 3.655 ± 0.079
3.489AspLys: 3.489 ± 0.107
5.967AspLeu: 5.967 ± 0.106
1.531AspMet: 1.531 ± 0.053
2.931AspAsn: 2.931 ± 0.077
1.878AspPro: 1.878 ± 0.06
2.349AspGln: 2.349 ± 0.06
2.147AspArg: 2.147 ± 0.058
3.472AspSer: 3.472 ± 0.123
3.034AspThr: 3.034 ± 0.07
4.154AspVal: 4.154 ± 0.105
0.815AspTrp: 0.815 ± 0.036
2.419AspTyr: 2.419 ± 0.065
0.0AspXaa: 0.0 ± 0.0
Glu
5.166GluAla: 5.166 ± 0.105
0.08GluCys: 0.08 ± 0.012
2.703GluAsp: 2.703 ± 0.088
3.277GluGlu: 3.277 ± 0.076
2.304GluPhe: 2.304 ± 0.063
2.683GluGly: 2.683 ± 0.075
1.293GluHis: 1.293 ± 0.04
4.343GluIle: 4.343 ± 0.105
3.751GluLys: 3.751 ± 0.088
6.513GluLeu: 6.513 ± 0.137
1.715GluMet: 1.715 ± 0.048
2.926GluAsn: 2.926 ± 0.062
1.694GluPro: 1.694 ± 0.059
3.352GluGln: 3.352 ± 0.086
3.047GluArg: 3.047 ± 0.087
3.393GluSer: 3.393 ± 0.194
3.0GluThr: 3.0 ± 0.084
4.038GluVal: 4.038 ± 0.102
0.54GluTrp: 0.54 ± 0.031
1.989GluTyr: 1.989 ± 0.074
0.0GluXaa: 0.0 ± 0.0
Phe
3.82PheAla: 3.82 ± 0.094
0.09PheCys: 0.09 ± 0.014
2.889PheAsp: 2.889 ± 0.07
2.473PheGlu: 2.473 ± 0.066
1.907PhePhe: 1.907 ± 0.06
3.528PheGly: 3.528 ± 0.094
0.577PheHis: 0.577 ± 0.025
3.19PheIle: 3.19 ± 0.081
2.69PheLys: 2.69 ± 0.061
3.593PheLeu: 3.593 ± 0.093
1.226PheMet: 1.226 ± 0.053
2.367PheAsn: 2.367 ± 0.066
1.312PhePro: 1.312 ± 0.05
1.242PheGln: 1.242 ± 0.046
1.229PheArg: 1.229 ± 0.059
2.863PheSer: 2.863 ± 0.081
2.794PheThr: 2.794 ± 0.069
2.912PheVal: 2.912 ± 0.07
0.522PheTrp: 0.522 ± 0.031
1.539PheTyr: 1.539 ± 0.055
0.0PheXaa: 0.0 ± 0.0
Gly
5.308GlyAla: 5.308 ± 0.11
0.108GlyCys: 0.108 ± 0.013
3.394GlyAsp: 3.394 ± 0.071
3.472GlyGlu: 3.472 ± 0.079
2.991GlyPhe: 2.991 ± 0.071
4.234GlyGly: 4.234 ± 0.115
1.278GlyHis: 1.278 ± 0.043
5.003GlyIle: 5.003 ± 0.099
4.126GlyLys: 4.126 ± 0.094
7.054GlyLeu: 7.054 ± 0.119
1.963GlyMet: 1.963 ± 0.059
3.115GlyAsn: 3.115 ± 0.095
1.425GlyPro: 1.425 ± 0.053
3.089GlyGln: 3.089 ± 0.085
2.892GlyArg: 2.892 ± 0.089
3.953GlySer: 3.953 ± 0.101
4.164GlyThr: 4.164 ± 0.109
4.852GlyVal: 4.852 ± 0.092
0.815GlyTrp: 0.815 ± 0.041
2.696GlyTyr: 2.696 ± 0.076
0.002GlyXaa: 0.002 ± 0.002
His
1.317HisAla: 1.317 ± 0.051
0.026HisCys: 0.026 ± 0.008
1.037HisAsp: 1.037 ± 0.046
1.048HisGlu: 1.048 ± 0.04
0.861HisPhe: 0.861 ± 0.034
1.272HisGly: 1.272 ± 0.05
0.37HisHis: 0.37 ± 0.028
1.148HisIle: 1.148 ± 0.039
0.807HisLys: 0.807 ± 0.036
1.749HisLeu: 1.749 ± 0.059
0.383HisMet: 0.383 ± 0.028
0.774HisAsn: 0.774 ± 0.038
0.748HisPro: 0.748 ± 0.04
0.807HisGln: 0.807 ± 0.035
0.719HisArg: 0.719 ± 0.036
0.915HisSer: 0.915 ± 0.044
0.936HisThr: 0.936 ± 0.047
1.06HisVal: 1.06 ± 0.042
0.236HisTrp: 0.236 ± 0.021
0.699HisTyr: 0.699 ± 0.039
0.0HisXaa: 0.0 ± 0.0
Ile
6.317IleAla: 6.317 ± 0.117
0.191IleCys: 0.191 ± 0.018
4.304IleAsp: 4.304 ± 0.086
4.441IleGlu: 4.441 ± 0.116
3.187IlePhe: 3.187 ± 0.089
4.824IleGly: 4.824 ± 0.102
1.035IleHis: 1.035 ± 0.036
5.21IleIle: 5.21 ± 0.107
4.348IleLys: 4.348 ± 0.104
6.58IleLeu: 6.58 ± 0.129
1.702IleMet: 1.702 ± 0.055
3.932IleAsn: 3.932 ± 0.094
2.432IlePro: 2.432 ± 0.068
2.457IleGln: 2.457 ± 0.064
2.455IleArg: 2.455 ± 0.084
4.491IleSer: 4.491 ± 0.092
4.578IleThr: 4.578 ± 0.086
4.821IleVal: 4.821 ± 0.085
0.719IleTrp: 0.719 ± 0.034
2.181IleTyr: 2.181 ± 0.068
0.0IleXaa: 0.0 ± 0.0
Lys
5.58LysAla: 5.58 ± 0.115
0.067LysCys: 0.067 ± 0.012
3.21LysAsp: 3.21 ± 0.088
3.397LysGlu: 3.397 ± 0.077
2.175LysPhe: 2.175 ± 0.068
3.166LysGly: 3.166 ± 0.074
1.109LysHis: 1.109 ± 0.048
4.203LysIle: 4.203 ± 0.082
4.097LysLys: 4.097 ± 0.11
5.734LysLeu: 5.734 ± 0.108
1.986LysMet: 1.986 ± 0.064
3.316LysAsn: 3.316 ± 0.093
2.018LysPro: 2.018 ± 0.066
3.35LysGln: 3.35 ± 0.086
2.884LysArg: 2.884 ± 0.076
3.394LysSer: 3.394 ± 0.091
3.772LysThr: 3.772 ± 0.089
4.248LysVal: 4.248 ± 0.091
0.613LysTrp: 0.613 ± 0.033
2.313LysTyr: 2.313 ± 0.069
0.0LysXaa: 0.0 ± 0.0
Leu
11.011LeuAla: 11.011 ± 0.22
0.155LeuCys: 0.155 ± 0.019
5.518LeuAsp: 5.518 ± 0.099
4.969LeuGlu: 4.969 ± 0.099
3.953LeuPhe: 3.953 ± 0.104
6.51LeuGly: 6.51 ± 0.132
1.511LeuHis: 1.511 ± 0.053
7.069LeuIle: 7.069 ± 0.139
5.901LeuLys: 5.901 ± 0.127
9.387LeuLeu: 9.387 ± 0.188
2.488LeuMet: 2.488 ± 0.082
5.039LeuAsn: 5.039 ± 0.095
4.149LeuPro: 4.149 ± 0.101
3.875LeuGln: 3.875 ± 0.086
3.679LeuArg: 3.679 ± 0.1
6.581LeuSer: 6.581 ± 0.133
6.933LeuThr: 6.933 ± 0.103
7.121LeuVal: 7.121 ± 0.126
0.858LeuTrp: 0.858 ± 0.04
2.55LeuTyr: 2.55 ± 0.065
0.0LeuXaa: 0.0 ± 0.0
Met
2.752MetAla: 2.752 ± 0.066
0.033MetCys: 0.033 ± 0.008
1.443MetAsp: 1.443 ± 0.044
1.172MetGlu: 1.172 ± 0.046
1.007MetPhe: 1.007 ± 0.043
1.578MetGly: 1.578 ± 0.06
0.388MetHis: 0.388 ± 0.024
1.906MetIle: 1.906 ± 0.07
1.508MetLys: 1.508 ± 0.048
2.605MetLeu: 2.605 ± 0.069
0.844MetMet: 0.844 ± 0.044
1.495MetAsn: 1.495 ± 0.049
1.022MetPro: 1.022 ± 0.04
1.371MetGln: 1.371 ± 0.049
1.125MetArg: 1.125 ± 0.04
1.663MetSer: 1.663 ± 0.059
1.945MetThr: 1.945 ± 0.052
1.883MetVal: 1.883 ± 0.054
0.217MetTrp: 0.217 ± 0.019
0.722MetTyr: 0.722 ± 0.036
0.0MetXaa: 0.0 ± 0.0
Asn
3.828AsnAla: 3.828 ± 0.131
0.104AsnCys: 0.104 ± 0.013
3.014AsnAsp: 3.014 ± 0.076
3.071AsnGlu: 3.071 ± 0.083
2.297AsnPhe: 2.297 ± 0.068
3.73AsnGly: 3.73 ± 0.094
1.061AsnHis: 1.061 ± 0.04
3.264AsnIle: 3.264 ± 0.079
2.928AsnLys: 2.928 ± 0.082
4.723AsnLeu: 4.723 ± 0.092
1.43AsnMet: 1.43 ± 0.056
3.053AsnAsn: 3.053 ± 0.086
2.136AsnPro: 2.136 ± 0.062
2.76AsnGln: 2.76 ± 0.076
2.036AsnArg: 2.036 ± 0.06
3.172AsnSer: 3.172 ± 0.141
2.827AsnThr: 2.827 ± 0.088
3.518AsnVal: 3.518 ± 0.088
0.68AsnTrp: 0.68 ± 0.035
2.038AsnTyr: 2.038 ± 0.059
0.0AsnXaa: 0.0 ± 0.0
Pro
3.272ProAla: 3.272 ± 0.078
0.042ProCys: 0.042 ± 0.009
2.051ProAsp: 2.051 ± 0.068
2.274ProGlu: 2.274 ± 0.076
1.58ProPhe: 1.58 ± 0.055
1.977ProGly: 1.977 ± 0.071
0.549ProHis: 0.549 ± 0.032
2.357ProIle: 2.357 ± 0.065
1.992ProLys: 1.992 ± 0.059
3.073ProLeu: 3.073 ± 0.078
0.799ProMet: 0.799 ± 0.043
1.717ProAsn: 1.717 ± 0.051
0.412ProPro: 0.412 ± 0.027
1.257ProGln: 1.257 ± 0.055
1.061ProArg: 1.061 ± 0.047
2.033ProSer: 2.033 ± 0.06
2.173ProThr: 2.173 ± 0.066
2.675ProVal: 2.675 ± 0.075
0.386ProTrp: 0.386 ± 0.028
1.125ProTyr: 1.125 ± 0.044
0.0ProXaa: 0.0 ± 0.0
Gln
4.99GlnAla: 4.99 ± 0.12
0.065GlnCys: 0.065 ± 0.009
1.969GlnAsp: 1.969 ± 0.058
2.199GlnGlu: 2.199 ± 0.074
1.607GlnPhe: 1.607 ± 0.054
2.511GlnGly: 2.511 ± 0.084
0.786GlnHis: 0.786 ± 0.041
3.182GlnIle: 3.182 ± 0.064
2.784GlnLys: 2.784 ± 0.074
4.749GlnLeu: 4.749 ± 0.111
1.161GlnMet: 1.161 ± 0.047
2.232GlnAsn: 2.232 ± 0.07
1.591GlnPro: 1.591 ± 0.052
2.287GlnGln: 2.287 ± 0.082
2.08GlnArg: 2.08 ± 0.066
2.546GlnSer: 2.546 ± 0.077
2.776GlnThr: 2.776 ± 0.073
3.399GlnVal: 3.399 ± 0.078
0.432GlnTrp: 0.432 ± 0.027
1.526GlnTyr: 1.526 ± 0.058
0.0GlnXaa: 0.0 ± 0.0
Arg
3.114ArgAla: 3.114 ± 0.082
0.077ArgCys: 0.077 ± 0.011
2.171ArgAsp: 2.171 ± 0.066
2.515ArgGlu: 2.515 ± 0.076
1.889ArgPhe: 1.889 ± 0.055
2.343ArgGly: 2.343 ± 0.07
0.796ArgHis: 0.796 ± 0.036
2.577ArgIle: 2.577 ± 0.084
2.341ArgLys: 2.341 ± 0.074
4.371ArgLeu: 4.371 ± 0.108
1.113ArgMet: 1.113 ± 0.047
1.847ArgAsn: 1.847 ± 0.069
1.351ArgPro: 1.351 ± 0.057
2.14ArgGln: 2.14 ± 0.068
2.085ArgArg: 2.085 ± 0.085
2.155ArgSer: 2.155 ± 0.063
2.085ArgThr: 2.085 ± 0.063
2.753ArgVal: 2.753 ± 0.074
0.483ArgTrp: 0.483 ± 0.033
1.624ArgTyr: 1.624 ± 0.053
0.0ArgXaa: 0.0 ± 0.0
Ser
5.375SerAla: 5.375 ± 0.37
0.086SerCys: 0.086 ± 0.014
4.017SerAsp: 4.017 ± 0.134
3.58SerGlu: 3.58 ± 0.096
2.776SerPhe: 2.776 ± 0.072
4.47SerGly: 4.47 ± 0.104
0.954SerHis: 0.954 ± 0.041
3.937SerIle: 3.937 ± 0.1
3.702SerLys: 3.702 ± 0.092
6.322SerLeu: 6.322 ± 0.201
1.601SerMet: 1.601 ± 0.07
3.21SerAsn: 3.21 ± 0.101
1.746SerPro: 1.746 ± 0.062
2.669SerGln: 2.669 ± 0.074
2.232SerArg: 2.232 ± 0.059
4.134SerSer: 4.134 ± 0.231
3.771SerThr: 3.771 ± 0.133
3.97SerVal: 3.97 ± 0.106
0.711SerTrp: 0.711 ± 0.036
2.134SerTyr: 2.134 ± 0.078
0.002SerXaa: 0.002 ± 0.002
Thr
5.898ThrAla: 5.898 ± 0.196
0.096ThrCys: 0.096 ± 0.013
3.785ThrAsp: 3.785 ± 0.088
3.21ThrGlu: 3.21 ± 0.076
2.669ThrPhe: 2.669 ± 0.075
4.542ThrGly: 4.542 ± 0.097
0.893ThrHis: 0.893 ± 0.036
4.511ThrIle: 4.511 ± 0.093
3.565ThrLys: 3.565 ± 0.086
6.017ThrLeu: 6.017 ± 0.105
1.355ThrMet: 1.355 ± 0.056
3.401ThrAsn: 3.401 ± 0.104
2.429ThrPro: 2.429 ± 0.072
2.333ThrGln: 2.333 ± 0.06
1.963ThrArg: 1.963 ± 0.059
3.828ThrSer: 3.828 ± 0.162
4.121ThrThr: 4.121 ± 0.122
4.9ThrVal: 4.9 ± 0.11
0.712ThrTrp: 0.712 ± 0.035
1.925ThrTyr: 1.925 ± 0.063
0.0ThrXaa: 0.0 ± 0.0
Val
6.734ValAla: 6.734 ± 0.171
0.139ValCys: 0.139 ± 0.015
4.281ValAsp: 4.281 ± 0.1
4.376ValGlu: 4.376 ± 0.105
2.724ValPhe: 2.724 ± 0.064
4.806ValGly: 4.806 ± 0.112
1.073ValHis: 1.073 ± 0.042
5.318ValIle: 5.318 ± 0.1
4.26ValLys: 4.26 ± 0.089
6.609ValLeu: 6.609 ± 0.117
1.762ValMet: 1.762 ± 0.061
3.652ValAsn: 3.652 ± 0.081
2.498ValPro: 2.498 ± 0.062
2.475ValGln: 2.475 ± 0.058
2.636ValArg: 2.636 ± 0.073
4.407ValSer: 4.407 ± 0.118
4.944ValThr: 4.944 ± 0.127
5.152ValVal: 5.152 ± 0.114
0.673ValTrp: 0.673 ± 0.034
2.095ValTyr: 2.095 ± 0.054
0.0ValXaa: 0.0 ± 0.0
Trp
0.809TrpAla: 0.809 ± 0.034
0.013TrpCys: 0.013 ± 0.004
0.504TrpAsp: 0.504 ± 0.03
0.44TrpGlu: 0.44 ± 0.028
0.543TrpPhe: 0.543 ± 0.031
0.662TrpGly: 0.662 ± 0.033
0.302TrpHis: 0.302 ± 0.023
0.738TrpIle: 0.738 ± 0.039
0.377TrpLys: 0.377 ± 0.024
1.417TrpLeu: 1.417 ± 0.056
0.292TrpMet: 0.292 ± 0.024
0.481TrpAsn: 0.481 ± 0.037
0.321TrpPro: 0.321 ± 0.022
0.867TrpGln: 0.867 ± 0.038
0.606TrpArg: 0.606 ± 0.034
0.677TrpSer: 0.677 ± 0.034
0.628TrpThr: 0.628 ± 0.032
0.721TrpVal: 0.721 ± 0.037
0.152TrpTrp: 0.152 ± 0.017
0.37TrpTyr: 0.37 ± 0.027
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.77TyrAla: 2.77 ± 0.07
0.086TyrCys: 0.086 ± 0.011
2.087TyrAsp: 2.087 ± 0.073
1.911TyrGlu: 1.911 ± 0.061
1.821TyrPhe: 1.821 ± 0.059
2.33TyrGly: 2.33 ± 0.06
0.67TyrHis: 0.67 ± 0.034
2.165TyrIle: 2.165 ± 0.062
1.72TyrLys: 1.72 ± 0.059
3.71TyrLeu: 3.71 ± 0.086
0.771TyrMet: 0.771 ± 0.038
1.461TyrAsn: 1.461 ± 0.054
1.197TyrPro: 1.197 ± 0.05
2.093TyrGln: 2.093 ± 0.063
1.601TyrArg: 1.601 ± 0.056
1.955TyrSer: 1.955 ± 0.067
1.964TyrThr: 1.964 ± 0.071
2.163TyrVal: 2.163 ± 0.059
0.364TyrTrp: 0.364 ± 0.029
1.324TyrTyr: 1.324 ± 0.051
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.002XaaArg: 0.002 ± 0.002
0.0XaaSer: 0.0 ± 0.0
0.002XaaThr: 0.002 ± 0.002
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.104XaaXaa: 0.104 ± 0.079
Statistics based on 2217 proteins (613410 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski