Amino acid dipepetide frequency for methanotrophic endosymbiont of Bathymodiolus azoricus (Menez Gwen)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.237AlaAla: 7.237 ± 0.245
0.917AlaCys: 0.917 ± 0.066
4.594AlaAsp: 4.594 ± 0.13
5.965AlaGlu: 5.965 ± 0.161
3.196AlaPhe: 3.196 ± 0.119
5.807AlaGly: 5.807 ± 0.176
1.782AlaHis: 1.782 ± 0.068
6.303AlaIle: 6.303 ± 0.161
5.356AlaLys: 5.356 ± 0.136
8.993AlaLeu: 8.993 ± 0.224
2.479AlaMet: 2.479 ± 0.098
3.499AlaAsn: 3.499 ± 0.118
2.644AlaPro: 2.644 ± 0.117
3.768AlaGln: 3.768 ± 0.12
3.781AlaArg: 3.781 ± 0.121
4.396AlaSer: 4.396 ± 0.124
4.442AlaThr: 4.442 ± 0.135
5.38AlaVal: 5.38 ± 0.149
0.858AlaTrp: 0.858 ± 0.054
2.529AlaTyr: 2.529 ± 0.093
0.0AlaXaa: 0.0 ± 0.0
Cys
0.898CysAla: 0.898 ± 0.058
0.161CysCys: 0.161 ± 0.022
0.562CysAsp: 0.562 ± 0.043
0.5CysGlu: 0.5 ± 0.04
0.431CysPhe: 0.431 ± 0.032
0.881CysGly: 0.881 ± 0.056
0.329CysHis: 0.329 ± 0.035
0.74CysIle: 0.74 ± 0.049
0.51CysLys: 0.51 ± 0.045
1.092CysLeu: 1.092 ± 0.058
0.276CysMet: 0.276 ± 0.027
0.355CysAsn: 0.355 ± 0.041
0.421CysPro: 0.421 ± 0.04
0.516CysGln: 0.516 ± 0.042
0.516CysArg: 0.516 ± 0.047
0.783CysSer: 0.783 ± 0.058
0.677CysThr: 0.677 ± 0.051
0.562CysVal: 0.562 ± 0.051
0.076CysTrp: 0.076 ± 0.015
0.355CysTyr: 0.355 ± 0.035
0.0CysXaa: 0.0 ± 0.0
Asp
4.571AspAla: 4.571 ± 0.13
0.612AspCys: 0.612 ± 0.053
2.782AspAsp: 2.782 ± 0.115
3.735AspGlu: 3.735 ± 0.109
2.802AspPhe: 2.802 ± 0.092
3.042AspGly: 3.042 ± 0.095
1.111AspHis: 1.111 ± 0.063
4.594AspIle: 4.594 ± 0.115
3.982AspLys: 3.982 ± 0.126
5.432AspLeu: 5.432 ± 0.147
1.305AspMet: 1.305 ± 0.065
2.433AspAsn: 2.433 ± 0.092
1.989AspPro: 1.989 ± 0.081
1.789AspGln: 1.789 ± 0.075
2.341AspArg: 2.341 ± 0.104
3.18AspSer: 3.18 ± 0.1
2.802AspThr: 2.802 ± 0.092
3.436AspVal: 3.436 ± 0.109
0.802AspTrp: 0.802 ± 0.048
2.114AspTyr: 2.114 ± 0.087
0.0AspXaa: 0.0 ± 0.0
Glu
5.126GluAla: 5.126 ± 0.16
0.543GluCys: 0.543 ± 0.037
2.989GluAsp: 2.989 ± 0.101
4.1GluGlu: 4.1 ± 0.127
2.502GluPhe: 2.502 ± 0.111
3.709GluGly: 3.709 ± 0.124
1.562GluHis: 1.562 ± 0.072
4.663GluIle: 4.663 ± 0.14
4.87GluLys: 4.87 ± 0.143
6.596GluLeu: 6.596 ± 0.155
1.792GluMet: 1.792 ± 0.078
3.245GluAsn: 3.245 ± 0.096
1.858GluPro: 1.858 ± 0.091
3.785GluGln: 3.785 ± 0.112
3.466GluArg: 3.466 ± 0.125
3.486GluSer: 3.486 ± 0.105
3.229GluThr: 3.229 ± 0.101
4.262GluVal: 4.262 ± 0.118
0.727GluTrp: 0.727 ± 0.048
1.832GluTyr: 1.832 ± 0.082
0.0GluXaa: 0.0 ± 0.0
Phe
3.074PheAla: 3.074 ± 0.106
0.582PheCys: 0.582 ± 0.045
2.502PheAsp: 2.502 ± 0.092
2.321PheGlu: 2.321 ± 0.088
1.947PhePhe: 1.947 ± 0.088
2.588PheGly: 2.588 ± 0.081
0.927PheHis: 0.927 ± 0.056
3.074PheIle: 3.074 ± 0.104
2.683PheLys: 2.683 ± 0.098
3.63PheLeu: 3.63 ± 0.136
1.115PheMet: 1.115 ± 0.065
2.058PheAsn: 2.058 ± 0.087
1.634PhePro: 1.634 ± 0.077
1.47PheGln: 1.47 ± 0.066
1.615PheArg: 1.615 ± 0.071
3.15PheSer: 3.15 ± 0.096
2.252PheThr: 2.252 ± 0.095
2.394PheVal: 2.394 ± 0.087
0.411PheTrp: 0.411 ± 0.039
1.457PheTyr: 1.457 ± 0.075
0.003PheXaa: 0.003 ± 0.003
Gly
4.975GlyAla: 4.975 ± 0.158
0.885GlyCys: 0.885 ± 0.051
3.505GlyAsp: 3.505 ± 0.116
3.939GlyGlu: 3.939 ± 0.122
3.209GlyPhe: 3.209 ± 0.115
4.436GlyGly: 4.436 ± 0.162
1.417GlyHis: 1.417 ± 0.067
4.83GlyIle: 4.83 ± 0.132
4.544GlyLys: 4.544 ± 0.108
6.336GlyLeu: 6.336 ± 0.156
1.966GlyMet: 1.966 ± 0.081
2.515GlyAsn: 2.515 ± 0.092
1.365GlyPro: 1.365 ± 0.077
2.608GlyGln: 2.608 ± 0.106
3.038GlyArg: 3.038 ± 0.091
3.801GlySer: 3.801 ± 0.11
3.259GlyThr: 3.259 ± 0.101
4.745GlyVal: 4.745 ± 0.146
0.799GlyTrp: 0.799 ± 0.055
2.269GlyTyr: 2.269 ± 0.092
0.0GlyXaa: 0.0 ± 0.0
His
1.664HisAla: 1.664 ± 0.074
0.293HisCys: 0.293 ± 0.031
1.207HisAsp: 1.207 ± 0.069
1.128HisGlu: 1.128 ± 0.063
1.25HisPhe: 1.25 ± 0.068
1.493HisGly: 1.493 ± 0.067
0.575HisHis: 0.575 ± 0.039
1.647HisIle: 1.647 ± 0.081
1.302HisLys: 1.302 ± 0.066
2.078HisLeu: 2.078 ± 0.092
0.582HisMet: 0.582 ± 0.041
1.095HisAsn: 1.095 ± 0.056
0.937HisPro: 0.937 ± 0.059
0.95HisGln: 0.95 ± 0.06
1.032HisArg: 1.032 ± 0.059
1.49HisSer: 1.49 ± 0.065
1.072HisThr: 1.072 ± 0.065
1.325HisVal: 1.325 ± 0.066
0.372HisTrp: 0.372 ± 0.034
0.812HisTyr: 0.812 ± 0.045
0.0HisXaa: 0.0 ± 0.0
Ile
6.665IleAla: 6.665 ± 0.149
0.76IleCys: 0.76 ± 0.053
4.686IleAsp: 4.686 ± 0.133
5.508IleGlu: 5.508 ± 0.142
2.525IlePhe: 2.525 ± 0.104
4.114IleGly: 4.114 ± 0.13
1.397IleHis: 1.397 ± 0.063
4.988IleIle: 4.988 ± 0.142
4.886IleLys: 4.886 ± 0.145
6.156IleLeu: 6.156 ± 0.143
1.568IleMet: 1.568 ± 0.081
3.726IleAsn: 3.726 ± 0.119
3.009IlePro: 3.009 ± 0.096
2.627IleGln: 2.627 ± 0.089
3.344IleArg: 3.344 ± 0.115
5.005IleSer: 5.005 ± 0.132
4.038IleThr: 4.038 ± 0.118
4.252IleVal: 4.252 ± 0.128
0.549IleTrp: 0.549 ± 0.057
2.124IleTyr: 2.124 ± 0.088
0.0IleXaa: 0.0 ± 0.0
Lys
5.429LysAla: 5.429 ± 0.151
0.441LysCys: 0.441 ± 0.037
3.367LysAsp: 3.367 ± 0.117
4.324LysGlu: 4.324 ± 0.138
1.897LysPhe: 1.897 ± 0.088
4.021LysGly: 4.021 ± 0.117
1.532LysHis: 1.532 ± 0.072
4.988LysIle: 4.988 ± 0.14
4.876LysLys: 4.876 ± 0.158
5.597LysLeu: 5.597 ± 0.128
1.805LysMet: 1.805 ± 0.072
3.512LysAsn: 3.512 ± 0.126
2.502LysPro: 2.502 ± 0.104
3.584LysGln: 3.584 ± 0.114
3.199LysArg: 3.199 ± 0.106
3.887LysSer: 3.887 ± 0.133
3.758LysThr: 3.758 ± 0.122
4.153LysVal: 4.153 ± 0.138
0.631LysTrp: 0.631 ± 0.048
2.095LysTyr: 2.095 ± 0.08
0.0LysXaa: 0.0 ± 0.0
Leu
9.128LeuAla: 9.128 ± 0.182
1.029LeuCys: 1.029 ± 0.054
5.61LeuAsp: 5.61 ± 0.136
5.896LeuGlu: 5.896 ± 0.152
3.906LeuPhe: 3.906 ± 0.126
6.205LeuGly: 6.205 ± 0.165
2.147LeuHis: 2.147 ± 0.093
6.567LeuIle: 6.567 ± 0.178
6.3LeuLys: 6.3 ± 0.173
9.743LeuLeu: 9.743 ± 0.247
2.368LeuMet: 2.368 ± 0.093
4.564LeuAsn: 4.564 ± 0.13
4.081LeuPro: 4.081 ± 0.129
3.936LeuGln: 3.936 ± 0.121
4.521LeuArg: 4.521 ± 0.136
7.412LeuSer: 7.412 ± 0.193
5.429LeuThr: 5.429 ± 0.143
6.116LeuVal: 6.116 ± 0.162
1.003LeuTrp: 1.003 ± 0.059
2.89LeuTyr: 2.89 ± 0.097
0.0LeuXaa: 0.0 ± 0.0
Met
2.631MetAla: 2.631 ± 0.09
0.217MetCys: 0.217 ± 0.029
1.522MetAsp: 1.522 ± 0.081
1.388MetGlu: 1.388 ± 0.064
0.868MetPhe: 0.868 ± 0.058
1.91MetGly: 1.91 ± 0.082
0.523MetHis: 0.523 ± 0.043
1.861MetIle: 1.861 ± 0.076
1.44MetLys: 1.44 ± 0.079
2.548MetLeu: 2.548 ± 0.086
0.753MetMet: 0.753 ± 0.053
1.273MetAsn: 1.273 ± 0.065
1.292MetPro: 1.292 ± 0.058
1.397MetGln: 1.397 ± 0.07
1.319MetArg: 1.319 ± 0.065
1.779MetSer: 1.779 ± 0.079
1.585MetThr: 1.585 ± 0.071
1.48MetVal: 1.48 ± 0.066
0.145MetTrp: 0.145 ± 0.025
0.52MetTyr: 0.52 ± 0.044
0.0MetXaa: 0.0 ± 0.0
Asn
3.469AsnAla: 3.469 ± 0.115
0.513AsnCys: 0.513 ± 0.048
2.361AsnAsp: 2.361 ± 0.091
2.726AsnGlu: 2.726 ± 0.096
1.634AsnPhe: 1.634 ± 0.08
2.736AsnGly: 2.736 ± 0.105
0.96AsnHis: 0.96 ± 0.06
3.716AsnIle: 3.716 ± 0.123
3.206AsnLys: 3.206 ± 0.105
4.176AsnLeu: 4.176 ± 0.105
1.345AsnMet: 1.345 ± 0.085
2.558AsnAsn: 2.558 ± 0.106
2.108AsnPro: 2.108 ± 0.094
1.92AsnGln: 1.92 ± 0.085
1.947AsnArg: 1.947 ± 0.1
2.752AsnSer: 2.752 ± 0.106
2.548AsnThr: 2.548 ± 0.095
2.492AsnVal: 2.492 ± 0.1
0.556AsnTrp: 0.556 ± 0.043
1.654AsnTyr: 1.654 ± 0.073
0.0AsnXaa: 0.0 ± 0.0
Pro
3.17ProAla: 3.17 ± 0.126
0.421ProCys: 0.421 ± 0.037
2.391ProAsp: 2.391 ± 0.087
3.298ProGlu: 3.298 ± 0.107
1.476ProPhe: 1.476 ± 0.066
2.157ProGly: 2.157 ± 0.089
0.829ProHis: 0.829 ± 0.054
2.44ProIle: 2.44 ± 0.095
2.266ProLys: 2.266 ± 0.098
3.555ProLeu: 3.555 ± 0.117
0.898ProMet: 0.898 ± 0.068
1.43ProAsn: 1.43 ± 0.071
1.351ProPro: 1.351 ± 0.108
1.299ProGln: 1.299 ± 0.066
1.312ProArg: 1.312 ± 0.07
2.197ProSer: 2.197 ± 0.08
1.828ProThr: 1.828 ± 0.079
2.828ProVal: 2.828 ± 0.109
0.411ProTrp: 0.411 ± 0.039
1.227ProTyr: 1.227 ± 0.074
0.0ProXaa: 0.0 ± 0.0
Gln
4.396GlnAla: 4.396 ± 0.123
0.385GlnCys: 0.385 ± 0.034
2.006GlnAsp: 2.006 ± 0.079
2.723GlnGlu: 2.723 ± 0.087
1.805GlnPhe: 1.805 ± 0.072
3.002GlnGly: 3.002 ± 0.101
1.095GlnHis: 1.095 ± 0.068
2.463GlnIle: 2.463 ± 0.092
2.848GlnLys: 2.848 ± 0.106
4.656GlnLeu: 4.656 ± 0.152
1.049GlnMet: 1.049 ± 0.06
1.631GlnAsn: 1.631 ± 0.072
1.315GlnPro: 1.315 ± 0.07
2.532GlnGln: 2.532 ± 0.113
2.16GlnArg: 2.16 ± 0.088
2.568GlnSer: 2.568 ± 0.099
1.993GlnThr: 1.993 ± 0.088
3.009GlnVal: 3.009 ± 0.104
0.552GlnTrp: 0.552 ± 0.044
1.378GlnTyr: 1.378 ± 0.062
0.0GlnXaa: 0.0 ± 0.0
Arg
3.387ArgAla: 3.387 ± 0.114
0.441ArgCys: 0.441 ± 0.045
2.394ArgAsp: 2.394 ± 0.098
2.92ArgGlu: 2.92 ± 0.095
2.2ArgPhe: 2.2 ± 0.097
2.805ArgGly: 2.805 ± 0.096
1.134ArgHis: 1.134 ± 0.065
3.364ArgIle: 3.364 ± 0.099
3.071ArgLys: 3.071 ± 0.109
4.965ArgLeu: 4.965 ± 0.148
1.44ArgMet: 1.44 ± 0.069
1.96ArgAsn: 1.96 ± 0.085
1.519ArgPro: 1.519 ± 0.079
1.999ArgGln: 1.999 ± 0.079
2.476ArgArg: 2.476 ± 0.115
2.604ArgSer: 2.604 ± 0.088
2.2ArgThr: 2.2 ± 0.08
3.167ArgVal: 3.167 ± 0.1
0.612ArgTrp: 0.612 ± 0.048
1.845ArgTyr: 1.845 ± 0.083
0.0ArgXaa: 0.0 ± 0.0
Ser
5.212SerAla: 5.212 ± 0.13
0.71SerCys: 0.71 ± 0.051
3.423SerAsp: 3.423 ± 0.116
3.903SerGlu: 3.903 ± 0.114
2.752SerPhe: 2.752 ± 0.104
4.945SerGly: 4.945 ± 0.131
1.427SerHis: 1.427 ± 0.074
4.472SerIle: 4.472 ± 0.147
3.551SerLys: 3.551 ± 0.12
6.336SerLeu: 6.336 ± 0.153
1.615SerMet: 1.615 ± 0.073
2.703SerAsn: 2.703 ± 0.104
2.22SerPro: 2.22 ± 0.083
2.285SerGln: 2.285 ± 0.091
2.805SerArg: 2.805 ± 0.09
4.334SerSer: 4.334 ± 0.13
3.15SerThr: 3.15 ± 0.095
4.258SerVal: 4.258 ± 0.118
0.819SerTrp: 0.819 ± 0.056
2.039SerTyr: 2.039 ± 0.097
0.0SerXaa: 0.0 ± 0.0
Thr
4.344ThrAla: 4.344 ± 0.127
0.47ThrCys: 0.47 ± 0.039
2.788ThrAsp: 2.788 ± 0.099
3.518ThrGlu: 3.518 ± 0.104
1.779ThrPhe: 1.779 ± 0.079
4.061ThrGly: 4.061 ± 0.133
1.22ThrHis: 1.22 ± 0.064
3.66ThrIle: 3.66 ± 0.112
3.084ThrLys: 3.084 ± 0.099
5.817ThrLeu: 5.817 ± 0.156
1.167ThrMet: 1.167 ± 0.069
2.101ThrAsn: 2.101 ± 0.089
2.525ThrPro: 2.525 ± 0.088
2.292ThrGln: 2.292 ± 0.084
2.407ThrArg: 2.407 ± 0.084
3.025ThrSer: 3.025 ± 0.106
2.759ThrThr: 2.759 ± 0.104
3.647ThrVal: 3.647 ± 0.115
0.608ThrTrp: 0.608 ± 0.049
1.506ThrTyr: 1.506 ± 0.067
0.0ThrXaa: 0.0 ± 0.0
Val
5.274ValAla: 5.274 ± 0.148
0.684ValCys: 0.684 ± 0.051
3.857ValAsp: 3.857 ± 0.123
4.127ValGlu: 4.127 ± 0.156
2.802ValPhe: 2.802 ± 0.108
3.742ValGly: 3.742 ± 0.128
1.282ValHis: 1.282 ± 0.069
4.807ValIle: 4.807 ± 0.128
4.156ValLys: 4.156 ± 0.132
6.267ValLeu: 6.267 ± 0.173
1.897ValMet: 1.897 ± 0.069
3.084ValAsn: 3.084 ± 0.103
2.384ValPro: 2.384 ± 0.097
2.308ValGln: 2.308 ± 0.093
2.923ValArg: 2.923 ± 0.104
4.354ValSer: 4.354 ± 0.114
3.663ValThr: 3.663 ± 0.115
4.6ValVal: 4.6 ± 0.157
0.618ValTrp: 0.618 ± 0.045
1.855ValTyr: 1.855 ± 0.083
0.0ValXaa: 0.0 ± 0.0
Trp
0.687TrpAla: 0.687 ± 0.056
0.109TrpCys: 0.109 ± 0.019
0.612TrpAsp: 0.612 ± 0.046
0.625TrpGlu: 0.625 ± 0.047
0.414TrpPhe: 0.414 ± 0.041
0.664TrpGly: 0.664 ± 0.048
0.375TrpHis: 0.375 ± 0.031
0.733TrpIle: 0.733 ± 0.044
0.612TrpLys: 0.612 ± 0.043
1.319TrpLeu: 1.319 ± 0.063
0.349TrpMet: 0.349 ± 0.035
0.395TrpAsn: 0.395 ± 0.036
0.388TrpPro: 0.388 ± 0.037
0.769TrpGln: 0.769 ± 0.059
0.582TrpArg: 0.582 ± 0.046
0.71TrpSer: 0.71 ± 0.055
0.434TrpThr: 0.434 ± 0.044
0.838TrpVal: 0.838 ± 0.056
0.135TrpTrp: 0.135 ± 0.022
0.345TrpTyr: 0.345 ± 0.039
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.591TyrAla: 2.591 ± 0.09
0.47TyrCys: 0.47 ± 0.043
1.697TyrAsp: 1.697 ± 0.079
1.697TyrGlu: 1.697 ± 0.082
1.493TyrPhe: 1.493 ± 0.078
2.124TyrGly: 2.124 ± 0.086
0.677TyrHis: 0.677 ± 0.049
2.012TyrIle: 2.012 ± 0.087
2.055TyrLys: 2.055 ± 0.077
3.558TyrLeu: 3.558 ± 0.117
0.677TyrMet: 0.677 ± 0.049
1.24TyrAsn: 1.24 ± 0.065
1.24TyrPro: 1.24 ± 0.068
1.73TyrGln: 1.73 ± 0.072
1.716TyrArg: 1.716 ± 0.07
2.049TyrSer: 2.049 ± 0.086
1.664TyrThr: 1.664 ± 0.065
1.743TyrVal: 1.743 ± 0.074
0.411TyrTrp: 0.411 ± 0.039
1.18TyrTyr: 1.18 ± 0.067
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.003XaaIle: 0.003 ± 0.003
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.003XaaXaa: 0.003 ± 0.003
Statistics based on 1523 proteins (304118 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski