Amino acid dipepetide frequency for Anaerococcus lactolyticus ATCC 51172

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.76AlaAla: 3.76 ± 0.115
0.738AlaCys: 0.738 ± 0.034
3.81AlaAsp: 3.81 ± 0.07
3.67AlaGlu: 3.67 ± 0.082
2.838AlaPhe: 2.838 ± 0.076
4.376AlaGly: 4.376 ± 0.1
0.865AlaHis: 0.865 ± 0.043
6.026AlaIle: 6.026 ± 0.111
5.906AlaLys: 5.906 ± 0.102
5.876AlaLeu: 5.876 ± 0.121
1.929AlaMet: 1.929 ± 0.062
3.361AlaAsn: 3.361 ± 0.072
1.418AlaPro: 1.418 ± 0.051
1.435AlaGln: 1.435 ± 0.051
2.684AlaArg: 2.684 ± 0.066
3.984AlaSer: 3.984 ± 0.082
2.929AlaThr: 2.929 ± 0.082
3.592AlaVal: 3.592 ± 0.095
0.335AlaTrp: 0.335 ± 0.027
2.794AlaTyr: 2.794 ± 0.079
0.0AlaXaa: 0.0 ± 0.0
Cys
0.445CysAla: 0.445 ± 0.026
0.097CysCys: 0.097 ± 0.014
0.568CysAsp: 0.568 ± 0.032
0.505CysGlu: 0.505 ± 0.03
0.263CysPhe: 0.263 ± 0.019
0.778CysGly: 0.778 ± 0.037
0.231CysHis: 0.231 ± 0.023
0.578CysIle: 0.578 ± 0.031
0.631CysLys: 0.631 ± 0.032
0.734CysLeu: 0.734 ± 0.033
0.165CysMet: 0.165 ± 0.016
0.317CysAsn: 0.317 ± 0.024
0.332CysPro: 0.332 ± 0.026
0.212CysGln: 0.212 ± 0.017
0.238CysArg: 0.238 ± 0.019
0.431CysSer: 0.431 ± 0.023
0.283CysThr: 0.283 ± 0.02
0.42CysVal: 0.42 ± 0.027
0.058CysTrp: 0.058 ± 0.011
0.266CysTyr: 0.266 ± 0.02
0.0CysXaa: 0.0 ± 0.0
Asp
3.183AspAla: 3.183 ± 0.076
0.486AspCys: 0.486 ± 0.028
4.113AspAsp: 4.113 ± 0.101
5.652AspGlu: 5.652 ± 0.106
4.106AspPhe: 4.106 ± 0.087
4.187AspGly: 4.187 ± 0.104
0.94AspHis: 0.94 ± 0.042
5.938AspIle: 5.938 ± 0.097
7.139AspLys: 7.139 ± 0.145
7.216AspLeu: 7.216 ± 0.137
1.717AspMet: 1.717 ± 0.055
3.15AspAsn: 3.15 ± 0.085
1.935AspPro: 1.935 ± 0.067
1.538AspGln: 1.538 ± 0.049
2.489AspArg: 2.489 ± 0.069
3.398AspSer: 3.398 ± 0.069
2.671AspThr: 2.671 ± 0.064
3.534AspVal: 3.534 ± 0.071
0.435AspTrp: 0.435 ± 0.03
3.394AspTyr: 3.394 ± 0.077
0.0AspXaa: 0.0 ± 0.0
Glu
4.707GluAla: 4.707 ± 0.097
0.36GluCys: 0.36 ± 0.027
5.39GluAsp: 5.39 ± 0.096
7.113GluGlu: 7.113 ± 0.122
3.101GluPhe: 3.101 ± 0.069
4.173GluGly: 4.173 ± 0.081
0.652GluHis: 0.652 ± 0.032
7.69GluIle: 7.69 ± 0.108
8.715GluLys: 8.715 ± 0.149
5.827GluLeu: 5.827 ± 0.119
1.794GluMet: 1.794 ± 0.051
5.941GluAsn: 5.941 ± 0.104
1.328GluPro: 1.328 ± 0.06
1.455GluGln: 1.455 ± 0.048
2.717GluArg: 2.717 ± 0.075
3.726GluSer: 3.726 ± 0.076
2.921GluThr: 2.921 ± 0.069
4.689GluVal: 4.689 ± 0.085
0.389GluTrp: 0.389 ± 0.025
2.807GluTyr: 2.807 ± 0.07
0.0GluXaa: 0.0 ± 0.0
Phe
2.967PheAla: 2.967 ± 0.077
0.395PheCys: 0.395 ± 0.023
3.578PheAsp: 3.578 ± 0.081
3.221PheGlu: 3.221 ± 0.08
2.211PhePhe: 2.211 ± 0.064
2.824PheGly: 2.824 ± 0.074
0.549PheHis: 0.549 ± 0.033
4.406PheIle: 4.406 ± 0.115
3.829PheLys: 3.829 ± 0.077
4.293PheLeu: 4.293 ± 0.102
1.248PheMet: 1.248 ± 0.048
2.327PheAsn: 2.327 ± 0.063
1.211PhePro: 1.211 ± 0.038
0.838PheGln: 0.838 ± 0.034
1.557PheArg: 1.557 ± 0.056
3.166PheSer: 3.166 ± 0.068
2.358PheThr: 2.358 ± 0.066
2.894PheVal: 2.894 ± 0.061
0.302PheTrp: 0.302 ± 0.022
2.201PheTyr: 2.201 ± 0.058
0.0PheXaa: 0.0 ± 0.0
Gly
3.967GlyAla: 3.967 ± 0.108
0.505GlyCys: 0.505 ± 0.032
3.529GlyAsp: 3.529 ± 0.09
4.38GlyGlu: 4.38 ± 0.093
3.26GlyPhe: 3.26 ± 0.066
4.118GlyGly: 4.118 ± 0.105
1.018GlyHis: 1.018 ± 0.035
5.641GlyIle: 5.641 ± 0.118
6.026GlyLys: 6.026 ± 0.106
6.344GlyLeu: 6.344 ± 0.112
1.534GlyMet: 1.534 ± 0.056
2.874GlyAsn: 2.874 ± 0.079
1.297GlyPro: 1.297 ± 0.052
1.669GlyGln: 1.669 ± 0.053
2.449GlyArg: 2.449 ± 0.08
3.72GlySer: 3.72 ± 0.086
2.927GlyThr: 2.927 ± 0.083
4.163GlyVal: 4.163 ± 0.088
0.421GlyTrp: 0.421 ± 0.029
2.801GlyTyr: 2.801 ± 0.066
0.0GlyXaa: 0.0 ± 0.0
His
0.761HisAla: 0.761 ± 0.039
0.114HisCys: 0.114 ± 0.013
0.874HisAsp: 0.874 ± 0.042
0.829HisGlu: 0.829 ± 0.039
0.666HisPhe: 0.666 ± 0.035
0.885HisGly: 0.885 ± 0.038
0.291HisHis: 0.291 ± 0.029
1.169HisIle: 1.169 ± 0.046
0.986HisLys: 0.986 ± 0.04
1.234HisLeu: 1.234 ± 0.041
0.311HisMet: 0.311 ± 0.022
0.629HisAsn: 0.629 ± 0.032
0.574HisPro: 0.574 ± 0.029
0.391HisGln: 0.391 ± 0.024
0.58HisArg: 0.58 ± 0.032
0.8HisSer: 0.8 ± 0.035
0.681HisThr: 0.681 ± 0.033
0.674HisVal: 0.674 ± 0.028
0.094HisTrp: 0.094 ± 0.013
0.534HisTyr: 0.534 ± 0.03
0.0HisXaa: 0.0 ± 0.0
Ile
5.707IleAla: 5.707 ± 0.109
0.817IleCys: 0.817 ± 0.039
6.316IleAsp: 6.316 ± 0.114
6.696IleGlu: 6.696 ± 0.102
4.492IlePhe: 4.492 ± 0.099
5.769IleGly: 5.769 ± 0.119
1.066IleHis: 1.066 ± 0.041
7.948IleIle: 7.948 ± 0.142
8.501IleLys: 8.501 ± 0.115
8.935IleLeu: 8.935 ± 0.167
2.066IleMet: 2.066 ± 0.065
5.487IleAsn: 5.487 ± 0.103
2.804IlePro: 2.804 ± 0.066
1.815IleGln: 1.815 ± 0.055
3.384IleArg: 3.384 ± 0.07
6.73IleSer: 6.73 ± 0.121
4.057IleThr: 4.057 ± 0.081
4.903IleVal: 4.903 ± 0.102
0.454IleTrp: 0.454 ± 0.029
3.598IleTyr: 3.598 ± 0.079
0.0IleXaa: 0.0 ± 0.0
Lys
6.136LysAla: 6.136 ± 0.106
0.432LysCys: 0.432 ± 0.032
7.698LysAsp: 7.698 ± 0.128
9.087LysGlu: 9.087 ± 0.176
3.327LysPhe: 3.327 ± 0.065
4.729LysGly: 4.729 ± 0.095
0.991LysHis: 0.991 ± 0.038
8.831LysIle: 8.831 ± 0.153
9.342LysLys: 9.342 ± 0.166
8.024LysLeu: 8.024 ± 0.115
2.355LysMet: 2.355 ± 0.076
6.419LysAsn: 6.419 ± 0.109
2.054LysPro: 2.054 ± 0.085
2.024LysGln: 2.024 ± 0.064
3.167LysArg: 3.167 ± 0.071
5.439LysSer: 5.439 ± 0.101
4.801LysThr: 4.801 ± 0.101
5.341LysVal: 5.341 ± 0.124
0.532LysTrp: 0.532 ± 0.034
3.903LysTyr: 3.903 ± 0.077
0.002LysXaa: 0.002 ± 0.001
Leu
7.107LeuAla: 7.107 ± 0.126
0.694LeuCys: 0.694 ± 0.032
6.524LeuAsp: 6.524 ± 0.104
6.672LeuGlu: 6.672 ± 0.119
3.717LeuPhe: 3.717 ± 0.095
6.164LeuGly: 6.164 ± 0.115
0.977LeuHis: 0.977 ± 0.037
8.544LeuIle: 8.544 ± 0.171
8.327LeuLys: 8.327 ± 0.115
7.813LeuLeu: 7.813 ± 0.161
2.372LeuMet: 2.372 ± 0.065
5.072LeuAsn: 5.072 ± 0.089
2.724LeuPro: 2.724 ± 0.071
1.8LeuGln: 1.8 ± 0.059
3.567LeuArg: 3.567 ± 0.085
6.849LeuSer: 6.849 ± 0.132
4.66LeuThr: 4.66 ± 0.084
5.856LeuVal: 5.856 ± 0.097
0.52LeuTrp: 0.52 ± 0.028
3.24LeuTyr: 3.24 ± 0.081
0.0LeuXaa: 0.0 ± 0.0
Met
2.017MetAla: 2.017 ± 0.067
0.146MetCys: 0.146 ± 0.016
1.683MetAsp: 1.683 ± 0.054
1.744MetGlu: 1.744 ± 0.05
0.861MetPhe: 0.861 ± 0.04
1.741MetGly: 1.741 ± 0.056
0.269MetHis: 0.269 ± 0.022
2.234MetIle: 2.234 ± 0.058
2.471MetLys: 2.471 ± 0.061
2.075MetLeu: 2.075 ± 0.06
0.635MetMet: 0.635 ± 0.029
1.38MetAsn: 1.38 ± 0.047
0.821MetPro: 0.821 ± 0.04
0.605MetGln: 0.605 ± 0.029
0.863MetArg: 0.863 ± 0.033
1.364MetSer: 1.364 ± 0.044
1.5MetThr: 1.5 ± 0.049
1.78MetVal: 1.78 ± 0.054
0.146MetTrp: 0.146 ± 0.016
0.72MetTyr: 0.72 ± 0.039
0.0MetXaa: 0.0 ± 0.0
Asn
2.837AsnAla: 2.837 ± 0.064
0.378AsnCys: 0.378 ± 0.024
3.044AsnAsp: 3.044 ± 0.074
4.087AsnGlu: 4.087 ± 0.089
2.877AsnPhe: 2.877 ± 0.07
3.137AsnGly: 3.137 ± 0.089
0.817AsnHis: 0.817 ± 0.039
5.607AsnIle: 5.607 ± 0.106
5.829AsnLys: 5.829 ± 0.108
5.989AsnLeu: 5.989 ± 0.096
1.343AsnMet: 1.343 ± 0.046
3.107AsnAsn: 3.107 ± 0.097
2.434AsnPro: 2.434 ± 0.07
1.777AsnGln: 1.777 ± 0.066
2.006AsnArg: 2.006 ± 0.057
3.146AsnSer: 3.146 ± 0.074
2.586AsnThr: 2.586 ± 0.07
2.907AsnVal: 2.907 ± 0.081
0.355AsnTrp: 0.355 ± 0.023
2.269AsnTyr: 2.269 ± 0.066
0.0AsnXaa: 0.0 ± 0.0
Pro
1.797ProAla: 1.797 ± 0.054
0.2ProCys: 0.2 ± 0.019
1.738ProAsp: 1.738 ± 0.071
2.167ProGlu: 2.167 ± 0.077
1.303ProPhe: 1.303 ± 0.049
1.691ProGly: 1.691 ± 0.064
0.486ProHis: 0.486 ± 0.031
2.494ProIle: 2.494 ± 0.068
2.191ProLys: 2.191 ± 0.074
2.304ProLeu: 2.304 ± 0.063
0.638ProMet: 0.638 ± 0.032
1.466ProAsn: 1.466 ± 0.057
0.56ProPro: 0.56 ± 0.032
0.732ProGln: 0.732 ± 0.031
0.981ProArg: 0.981 ± 0.039
1.846ProSer: 1.846 ± 0.058
1.563ProThr: 1.563 ± 0.061
1.897ProVal: 1.897 ± 0.062
0.198ProTrp: 0.198 ± 0.02
1.249ProTyr: 1.249 ± 0.043
0.002ProXaa: 0.002 ± 0.002
Gln
1.921GlnAla: 1.921 ± 0.058
0.117GlnCys: 0.117 ± 0.014
1.404GlnAsp: 1.404 ± 0.048
1.772GlnGlu: 1.772 ± 0.065
0.794GlnPhe: 0.794 ± 0.04
1.523GlnGly: 1.523 ± 0.049
0.223GlnHis: 0.223 ± 0.017
2.374GlnIle: 2.374 ± 0.065
2.192GlnLys: 2.192 ± 0.066
1.769GlnLeu: 1.769 ± 0.054
0.698GlnMet: 0.698 ± 0.034
1.388GlnAsn: 1.388 ± 0.043
0.514GlnPro: 0.514 ± 0.036
0.538GlnGln: 0.538 ± 0.033
1.118GlnArg: 1.118 ± 0.051
1.232GlnSer: 1.232 ± 0.046
1.248GlnThr: 1.248 ± 0.045
1.611GlnVal: 1.611 ± 0.054
0.198GlnTrp: 0.198 ± 0.017
0.832GlnTyr: 0.832 ± 0.036
0.0GlnXaa: 0.0 ± 0.0
Arg
2.392ArgAla: 2.392 ± 0.067
0.269ArgCys: 0.269 ± 0.02
2.515ArgAsp: 2.515 ± 0.059
3.19ArgGlu: 3.19 ± 0.075
1.603ArgPhe: 1.603 ± 0.045
2.004ArgGly: 2.004 ± 0.054
0.475ArgHis: 0.475 ± 0.028
3.194ArgIle: 3.194 ± 0.076
3.523ArgLys: 3.523 ± 0.08
3.7ArgLeu: 3.7 ± 0.084
0.974ArgMet: 0.974 ± 0.043
2.1ArgAsn: 2.1 ± 0.056
1.201ArgPro: 1.201 ± 0.053
1.208ArgGln: 1.208 ± 0.045
1.652ArgArg: 1.652 ± 0.062
1.991ArgSer: 1.991 ± 0.058
1.606ArgThr: 1.606 ± 0.05
2.232ArgVal: 2.232 ± 0.063
0.243ArgTrp: 0.243 ± 0.02
1.488ArgTyr: 1.488 ± 0.055
0.0ArgXaa: 0.0 ± 0.0
Ser
3.241SerAla: 3.241 ± 0.073
0.515SerCys: 0.515 ± 0.031
3.707SerAsp: 3.707 ± 0.073
3.969SerGlu: 3.969 ± 0.098
3.457SerPhe: 3.457 ± 0.081
3.873SerGly: 3.873 ± 0.089
1.031SerHis: 1.031 ± 0.042
5.336SerIle: 5.336 ± 0.104
5.547SerLys: 5.547 ± 0.083
6.812SerLeu: 6.812 ± 0.133
1.554SerMet: 1.554 ± 0.051
2.967SerAsn: 2.967 ± 0.071
1.581SerPro: 1.581 ± 0.043
1.849SerGln: 1.849 ± 0.058
2.132SerArg: 2.132 ± 0.059
3.801SerSer: 3.801 ± 0.087
2.821SerThr: 2.821 ± 0.072
3.437SerVal: 3.437 ± 0.074
0.449SerTrp: 0.449 ± 0.024
3.227SerTyr: 3.227 ± 0.074
0.002SerXaa: 0.002 ± 0.001
Thr
2.86ThrAla: 2.86 ± 0.073
0.349ThrCys: 0.349 ± 0.025
3.124ThrAsp: 3.124 ± 0.083
2.915ThrGlu: 2.915 ± 0.069
2.158ThrPhe: 2.158 ± 0.069
3.524ThrGly: 3.524 ± 0.083
0.763ThrHis: 0.763 ± 0.035
4.38ThrIle: 4.38 ± 0.084
3.603ThrLys: 3.603 ± 0.081
4.081ThrLeu: 4.081 ± 0.079
1.111ThrMet: 1.111 ± 0.043
2.598ThrAsn: 2.598 ± 0.069
1.617ThrPro: 1.617 ± 0.067
1.028ThrGln: 1.028 ± 0.041
1.791ThrArg: 1.791 ± 0.046
3.137ThrSer: 3.137 ± 0.065
2.344ThrThr: 2.344 ± 0.066
3.064ThrVal: 3.064 ± 0.096
0.323ThrTrp: 0.323 ± 0.022
2.235ThrTyr: 2.235 ± 0.06
0.0ThrXaa: 0.0 ± 0.0
Val
3.9ValAla: 3.9 ± 0.103
0.545ValCys: 0.545 ± 0.032
4.226ValAsp: 4.226 ± 0.082
4.26ValGlu: 4.26 ± 0.094
2.99ValPhe: 2.99 ± 0.07
4.155ValGly: 4.155 ± 0.094
0.755ValHis: 0.755 ± 0.039
5.201ValIle: 5.201 ± 0.1
5.069ValLys: 5.069 ± 0.1
5.389ValLeu: 5.389 ± 0.106
1.495ValMet: 1.495 ± 0.048
3.327ValAsn: 3.327 ± 0.084
1.629ValPro: 1.629 ± 0.058
1.018ValGln: 1.018 ± 0.042
2.246ValArg: 2.246 ± 0.056
3.909ValSer: 3.909 ± 0.081
2.754ValThr: 2.754 ± 0.093
3.767ValVal: 3.767 ± 0.087
0.32ValTrp: 0.32 ± 0.024
2.297ValTyr: 2.297 ± 0.06
0.002ValXaa: 0.002 ± 0.002
Trp
0.382TrpAla: 0.382 ± 0.027
0.038TrpCys: 0.038 ± 0.008
0.461TrpAsp: 0.461 ± 0.027
0.397TrpGlu: 0.397 ± 0.027
0.242TrpPhe: 0.242 ± 0.023
0.421TrpGly: 0.421 ± 0.029
0.103TrpHis: 0.103 ± 0.015
0.58TrpIle: 0.58 ± 0.034
0.468TrpLys: 0.468 ± 0.03
0.572TrpLeu: 0.572 ± 0.036
0.191TrpMet: 0.191 ± 0.018
0.309TrpAsn: 0.309 ± 0.023
0.177TrpPro: 0.177 ± 0.017
0.234TrpGln: 0.234 ± 0.021
0.22TrpArg: 0.22 ± 0.017
0.348TrpSer: 0.348 ± 0.024
0.322TrpThr: 0.322 ± 0.024
0.355TrpVal: 0.355 ± 0.026
0.068TrpTrp: 0.068 ± 0.01
0.271TrpTyr: 0.271 ± 0.022
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.241TyrAla: 2.241 ± 0.059
0.355TyrCys: 0.355 ± 0.028
2.917TyrAsp: 2.917 ± 0.074
3.169TyrGlu: 3.169 ± 0.08
2.111TyrPhe: 2.111 ± 0.065
2.566TyrGly: 2.566 ± 0.071
0.572TyrHis: 0.572 ± 0.032
3.375TyrIle: 3.375 ± 0.076
4.344TyrLys: 4.344 ± 0.086
4.209TyrLeu: 4.209 ± 0.093
0.928TyrMet: 0.928 ± 0.04
2.311TyrAsn: 2.311 ± 0.064
1.292TyrPro: 1.292 ± 0.047
1.3TyrGln: 1.3 ± 0.05
1.717TyrArg: 1.717 ± 0.053
2.294TyrSer: 2.294 ± 0.06
1.972TyrThr: 1.972 ± 0.062
2.138TyrVal: 2.138 ± 0.07
0.315TyrTrp: 0.315 ± 0.022
1.96TyrTyr: 1.96 ± 0.067
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.002XaaGly: 0.002 ± 0.002
0.002XaaHis: 0.002 ± 0.001
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.002XaaLeu: 0.002 ± 0.001
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.002XaaThr: 0.002 ± 0.002
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2252 proteins (650064 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski