Amino acid dipepetide frequency for Candidatus Gastranaerophilus sp. (ex Termes propinquus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.261AlaAla: 6.261 ± 0.221
1.15AlaCys: 1.15 ± 0.079
3.874AlaAsp: 3.874 ± 0.117
4.579AlaGlu: 4.579 ± 0.133
3.174AlaPhe: 3.174 ± 0.119
5.258AlaGly: 5.258 ± 0.158
1.596AlaHis: 1.596 ± 0.068
5.474AlaIle: 5.474 ± 0.16
6.75AlaLys: 6.75 ± 0.153
8.895AlaLeu: 8.895 ± 0.195
1.883AlaMet: 1.883 ± 0.085
3.623AlaAsn: 3.623 ± 0.112
2.71AlaPro: 2.71 ± 0.109
3.849AlaGln: 3.849 ± 0.135
3.67AlaArg: 3.67 ± 0.134
4.816AlaSer: 4.816 ± 0.141
4.148AlaThr: 4.148 ± 0.121
5.33AlaVal: 5.33 ± 0.154
0.453AlaTrp: 0.453 ± 0.041
2.534AlaTyr: 2.534 ± 0.097
0.0AlaXaa: 0.0 ± 0.0
Cys
1.387CysAla: 1.387 ± 0.073
0.248CysCys: 0.248 ± 0.032
0.83CysAsp: 0.83 ± 0.049
1.182CysGlu: 1.182 ± 0.077
0.536CysPhe: 0.536 ± 0.045
1.197CysGly: 1.197 ± 0.075
0.212CysHis: 0.212 ± 0.029
0.83CysIle: 0.83 ± 0.067
0.902CysLys: 0.902 ± 0.057
1.06CysLeu: 1.06 ± 0.062
0.262CysMet: 0.262 ± 0.031
0.5CysAsn: 0.5 ± 0.039
0.636CysPro: 0.636 ± 0.054
0.32CysGln: 0.32 ± 0.036
0.442CysArg: 0.442 ± 0.043
0.798CysSer: 0.798 ± 0.052
0.733CysThr: 0.733 ± 0.05
0.78CysVal: 0.78 ± 0.059
0.058CysTrp: 0.058 ± 0.012
0.392CysTyr: 0.392 ± 0.04
0.0CysXaa: 0.0 ± 0.0
Asp
4.126AspAla: 4.126 ± 0.148
0.661AspCys: 0.661 ± 0.048
2.509AspAsp: 2.509 ± 0.094
4.622AspGlu: 4.622 ± 0.146
3.231AspPhe: 3.231 ± 0.128
3.321AspGly: 3.321 ± 0.107
0.521AspHis: 0.521 ± 0.042
5.057AspIle: 5.057 ± 0.129
4.64AspLys: 4.64 ± 0.127
4.759AspLeu: 4.759 ± 0.137
1.384AspMet: 1.384 ± 0.07
2.16AspAsn: 2.16 ± 0.096
1.775AspPro: 1.775 ± 0.094
0.758AspGln: 0.758 ± 0.052
1.876AspArg: 1.876 ± 0.076
3.059AspSer: 3.059 ± 0.106
2.983AspThr: 2.983 ± 0.117
4.047AspVal: 4.047 ± 0.112
0.363AspTrp: 0.363 ± 0.04
2.124AspTyr: 2.124 ± 0.08
0.0AspXaa: 0.0 ± 0.0
Glu
5.614GluAla: 5.614 ± 0.153
0.859GluCys: 0.859 ± 0.062
3.688GluAsp: 3.688 ± 0.148
5.326GluGlu: 5.326 ± 0.178
3.414GluPhe: 3.414 ± 0.11
4.104GluGly: 4.104 ± 0.126
1.599GluHis: 1.599 ± 0.08
6.218GluIle: 6.218 ± 0.166
7.023GluLys: 7.023 ± 0.144
7.544GluLeu: 7.544 ± 0.176
1.783GluMet: 1.783 ± 0.077
4.331GluAsn: 4.331 ± 0.135
2.095GluPro: 2.095 ± 0.091
2.509GluGln: 2.509 ± 0.11
3.317GluArg: 3.317 ± 0.123
3.799GluSer: 3.799 ± 0.12
3.36GluThr: 3.36 ± 0.104
4.604GluVal: 4.604 ± 0.129
0.406GluTrp: 0.406 ± 0.039
2.62GluTyr: 2.62 ± 0.102
0.0GluXaa: 0.0 ± 0.0
Phe
3.58PheAla: 3.58 ± 0.121
0.701PheCys: 0.701 ± 0.053
3.188PheAsp: 3.188 ± 0.119
3.86PheGlu: 3.86 ± 0.118
2.041PhePhe: 2.041 ± 0.099
2.965PheGly: 2.965 ± 0.093
0.525PheHis: 0.525 ± 0.038
2.724PheIle: 2.724 ± 0.109
3.339PheLys: 3.339 ± 0.115
4.079PheLeu: 4.079 ± 0.151
1.017PheMet: 1.017 ± 0.058
2.138PheAsn: 2.138 ± 0.086
1.046PhePro: 1.046 ± 0.063
1.182PheGln: 1.182 ± 0.072
1.452PheArg: 1.452 ± 0.085
3.274PheSer: 3.274 ± 0.125
2.088PheThr: 2.088 ± 0.104
3.012PheVal: 3.012 ± 0.111
0.367PheTrp: 0.367 ± 0.04
1.466PheTyr: 1.466 ± 0.082
0.0PheXaa: 0.0 ± 0.0
Gly
6.078GlyAla: 6.078 ± 0.155
1.15GlyCys: 1.15 ± 0.07
3.285GlyAsp: 3.285 ± 0.107
4.572GlyGlu: 4.572 ± 0.129
3.084GlyPhe: 3.084 ± 0.102
4.608GlyGly: 4.608 ± 0.143
1.229GlyHis: 1.229 ± 0.072
4.5GlyIle: 4.5 ± 0.146
5.165GlyLys: 5.165 ± 0.141
5.919GlyLeu: 5.919 ± 0.144
1.563GlyMet: 1.563 ± 0.086
2.484GlyAsn: 2.484 ± 0.132
1.125GlyPro: 1.125 ± 0.064
2.085GlyGln: 2.085 ± 0.098
2.685GlyArg: 2.685 ± 0.123
3.767GlySer: 3.767 ± 0.127
3.688GlyThr: 3.688 ± 0.115
4.902GlyVal: 4.902 ± 0.144
0.496GlyTrp: 0.496 ± 0.041
2.426GlyTyr: 2.426 ± 0.089
0.0GlyXaa: 0.0 ± 0.0
His
1.258HisAla: 1.258 ± 0.064
0.27HisCys: 0.27 ± 0.029
0.744HisAsp: 0.744 ± 0.059
0.866HisGlu: 0.866 ± 0.064
0.906HisPhe: 0.906 ± 0.053
1.154HisGly: 1.154 ± 0.064
0.341HisHis: 0.341 ± 0.042
1.459HisIle: 1.459 ± 0.064
1.283HisLys: 1.283 ± 0.063
1.704HisLeu: 1.704 ± 0.087
0.32HisMet: 0.32 ± 0.037
0.726HisAsn: 0.726 ± 0.047
0.787HisPro: 0.787 ± 0.058
0.453HisGln: 0.453 ± 0.04
0.6HisArg: 0.6 ± 0.045
1.132HisSer: 1.132 ± 0.064
0.931HisThr: 0.931 ± 0.054
0.787HisVal: 0.787 ± 0.053
0.162HisTrp: 0.162 ± 0.027
0.636HisTyr: 0.636 ± 0.048
0.0HisXaa: 0.0 ± 0.0
Ile
5.909IleAla: 5.909 ± 0.137
0.956IleCys: 0.956 ± 0.064
4.316IleAsp: 4.316 ± 0.142
5.808IleGlu: 5.808 ± 0.158
3.094IlePhe: 3.094 ± 0.106
4.331IleGly: 4.331 ± 0.133
0.967IleHis: 0.967 ± 0.067
5.334IleIle: 5.334 ± 0.163
6.211IleLys: 6.211 ± 0.148
6.875IleLeu: 6.875 ± 0.153
1.56IleMet: 1.56 ± 0.073
3.677IleAsn: 3.677 ± 0.125
2.422IlePro: 2.422 ± 0.111
2.131IleGln: 2.131 ± 0.09
2.652IleArg: 2.652 ± 0.096
4.91IleSer: 4.91 ± 0.135
3.842IleThr: 3.842 ± 0.136
5.287IleVal: 5.287 ± 0.152
0.349IleTrp: 0.349 ± 0.035
2.056IleTyr: 2.056 ± 0.096
0.0IleXaa: 0.0 ± 0.0
Lys
5.83LysAla: 5.83 ± 0.162
0.906LysCys: 0.906 ± 0.06
4.431LysAsp: 4.431 ± 0.136
6.189LysGlu: 6.189 ± 0.158
3.174LysPhe: 3.174 ± 0.103
4.403LysGly: 4.403 ± 0.133
1.456LysHis: 1.456 ± 0.07
7.123LysIle: 7.123 ± 0.161
7.22LysLys: 7.22 ± 0.19
7.163LysLeu: 7.163 ± 0.169
2.192LysMet: 2.192 ± 0.091
5.736LysAsn: 5.736 ± 0.163
2.836LysPro: 2.836 ± 0.102
2.376LysGln: 2.376 ± 0.084
3.077LysArg: 3.077 ± 0.107
4.967LysSer: 4.967 ± 0.131
4.892LysThr: 4.892 ± 0.133
4.694LysVal: 4.694 ± 0.133
0.417LysTrp: 0.417 ± 0.039
3.202LysTyr: 3.202 ± 0.113
0.0LysXaa: 0.0 ± 0.0
Leu
7.666LeuAla: 7.666 ± 0.183
1.301LeuCys: 1.301 ± 0.078
5.337LeuAsp: 5.337 ± 0.134
7.695LeuGlu: 7.695 ± 0.167
3.522LeuPhe: 3.522 ± 0.147
6.649LeuGly: 6.649 ± 0.179
1.391LeuHis: 1.391 ± 0.065
5.535LeuIle: 5.535 ± 0.147
8.881LeuLys: 8.881 ± 0.181
7.932LeuLeu: 7.932 ± 0.192
2.196LeuMet: 2.196 ± 0.093
5.15LeuAsn: 5.15 ± 0.153
3.271LeuPro: 3.271 ± 0.111
2.821LeuGln: 2.821 ± 0.109
3.849LeuArg: 3.849 ± 0.118
7.019LeuSer: 7.019 ± 0.18
5.06LeuThr: 5.06 ± 0.154
6.081LeuVal: 6.081 ± 0.152
0.525LeuTrp: 0.525 ± 0.048
2.67LeuTyr: 2.67 ± 0.093
0.0LeuXaa: 0.0 ± 0.0
Met
1.959MetAla: 1.959 ± 0.088
0.259MetCys: 0.259 ± 0.025
1.143MetAsp: 1.143 ± 0.059
1.423MetGlu: 1.423 ± 0.074
0.999MetPhe: 0.999 ± 0.062
1.66MetGly: 1.66 ± 0.082
0.37MetHis: 0.37 ± 0.035
1.315MetIle: 1.315 ± 0.074
1.689MetLys: 1.689 ± 0.071
2.458MetLeu: 2.458 ± 0.093
0.51MetMet: 0.51 ± 0.044
1.103MetAsn: 1.103 ± 0.06
1.362MetPro: 1.362 ± 0.072
1.118MetGln: 1.118 ± 0.062
1.157MetArg: 1.157 ± 0.066
1.653MetSer: 1.653 ± 0.068
1.254MetThr: 1.254 ± 0.063
1.33MetVal: 1.33 ± 0.079
0.122MetTrp: 0.122 ± 0.02
0.633MetTyr: 0.633 ± 0.044
0.0MetXaa: 0.0 ± 0.0
Asn
3.882AsnAla: 3.882 ± 0.125
0.615AsnCys: 0.615 ± 0.041
2.243AsnAsp: 2.243 ± 0.093
3.001AsnGlu: 3.001 ± 0.11
2.706AsnPhe: 2.706 ± 0.102
2.491AsnGly: 2.491 ± 0.118
0.6AsnHis: 0.6 ± 0.047
4.586AsnIle: 4.586 ± 0.131
3.792AsnLys: 3.792 ± 0.118
4.679AsnLeu: 4.679 ± 0.129
1.341AsnMet: 1.341 ± 0.075
2.351AsnAsn: 2.351 ± 0.105
2.512AsnPro: 2.512 ± 0.095
1.247AsnGln: 1.247 ± 0.067
1.538AsnArg: 1.538 ± 0.073
3.36AsnSer: 3.36 ± 0.135
2.886AsnThr: 2.886 ± 0.099
3.465AsnVal: 3.465 ± 0.122
0.428AsnTrp: 0.428 ± 0.038
1.887AsnTyr: 1.887 ± 0.101
0.0AsnXaa: 0.0 ± 0.0
Pro
2.167ProAla: 2.167 ± 0.096
0.503ProCys: 0.503 ± 0.049
2.2ProAsp: 2.2 ± 0.094
3.044ProGlu: 3.044 ± 0.095
1.592ProPhe: 1.592 ± 0.091
1.772ProGly: 1.772 ± 0.082
0.622ProHis: 0.622 ± 0.05
2.315ProIle: 2.315 ± 0.102
2.67ProLys: 2.67 ± 0.097
3.091ProLeu: 3.091 ± 0.113
0.791ProMet: 0.791 ± 0.058
1.646ProAsn: 1.646 ± 0.073
1.197ProPro: 1.197 ± 0.071
1.599ProGln: 1.599 ± 0.087
1.143ProArg: 1.143 ± 0.056
2.174ProSer: 2.174 ± 0.08
1.804ProThr: 1.804 ± 0.081
2.422ProVal: 2.422 ± 0.108
0.259ProTrp: 0.259 ± 0.032
1.351ProTyr: 1.351 ± 0.081
0.0ProXaa: 0.0 ± 0.0
Gln
2.864GlnAla: 2.864 ± 0.103
0.295GlnCys: 0.295 ± 0.036
1.775GlnAsp: 1.775 ± 0.076
2.882GlnGlu: 2.882 ± 0.102
0.895GlnPhe: 0.895 ± 0.062
2.207GlnGly: 2.207 ± 0.099
0.471GlnHis: 0.471 ± 0.04
2.415GlnIle: 2.415 ± 0.09
3.307GlnLys: 3.307 ± 0.103
2.53GlnLeu: 2.53 ± 0.099
0.992GlnMet: 0.992 ± 0.056
2.128GlnAsn: 2.128 ± 0.102
0.773GlnPro: 0.773 ± 0.071
0.981GlnGln: 0.981 ± 0.064
1.344GlnArg: 1.344 ± 0.079
1.901GlnSer: 1.901 ± 0.09
1.833GlnThr: 1.833 ± 0.087
2.031GlnVal: 2.031 ± 0.097
0.173GlnTrp: 0.173 ± 0.027
0.906GlnTyr: 0.906 ± 0.058
0.0GlnXaa: 0.0 ± 0.0
Arg
3.716ArgAla: 3.716 ± 0.101
0.385ArgCys: 0.385 ± 0.033
2.009ArgAsp: 2.009 ± 0.082
3.716ArgGlu: 3.716 ± 0.145
1.707ArgPhe: 1.707 ± 0.075
2.545ArgGly: 2.545 ± 0.117
0.744ArgHis: 0.744 ± 0.056
2.793ArgIle: 2.793 ± 0.101
2.764ArgLys: 2.764 ± 0.116
3.867ArgLeu: 3.867 ± 0.133
0.931ArgMet: 0.931 ± 0.062
1.617ArgAsn: 1.617 ± 0.076
1.308ArgPro: 1.308 ± 0.065
1.538ArgGln: 1.538 ± 0.078
1.556ArgArg: 1.556 ± 0.083
1.923ArgSer: 1.923 ± 0.085
1.926ArgThr: 1.926 ± 0.09
3.073ArgVal: 3.073 ± 0.118
0.298ArgTrp: 0.298 ± 0.039
1.326ArgTyr: 1.326 ± 0.065
0.0ArgXaa: 0.0 ± 0.0
Ser
5.621SerAla: 5.621 ± 0.152
0.827SerCys: 0.827 ± 0.064
3.501SerAsp: 3.501 ± 0.105
4.697SerGlu: 4.697 ± 0.117
2.728SerPhe: 2.728 ± 0.094
5.039SerGly: 5.039 ± 0.154
1.168SerHis: 1.168 ± 0.063
4.137SerIle: 4.137 ± 0.119
4.723SerLys: 4.723 ± 0.114
5.898SerLeu: 5.898 ± 0.157
1.287SerMet: 1.287 ± 0.076
2.746SerAsn: 2.746 ± 0.112
2.225SerPro: 2.225 ± 0.091
2.185SerGln: 2.185 ± 0.098
2.782SerArg: 2.782 ± 0.103
4.349SerSer: 4.349 ± 0.151
3.145SerThr: 3.145 ± 0.128
4.219SerVal: 4.219 ± 0.131
0.464SerTrp: 0.464 ± 0.038
1.959SerTyr: 1.959 ± 0.084
0.0SerXaa: 0.0 ± 0.0
Thr
3.889ThrAla: 3.889 ± 0.137
0.582ThrCys: 0.582 ± 0.049
2.484ThrAsp: 2.484 ± 0.11
2.9ThrGlu: 2.9 ± 0.109
2.099ThrPhe: 2.099 ± 0.093
3.921ThrGly: 3.921 ± 0.116
0.97ThrHis: 0.97 ± 0.067
3.925ThrIle: 3.925 ± 0.113
3.824ThrLys: 3.824 ± 0.135
5.542ThrLeu: 5.542 ± 0.158
1.179ThrMet: 1.179 ± 0.07
2.455ThrAsn: 2.455 ± 0.089
2.44ThrPro: 2.44 ± 0.097
1.937ThrGln: 1.937 ± 0.087
2.243ThrArg: 2.243 ± 0.085
3.342ThrSer: 3.342 ± 0.138
3.077ThrThr: 3.077 ± 0.121
4.083ThrVal: 4.083 ± 0.125
0.288ThrTrp: 0.288 ± 0.032
1.646ThrTyr: 1.646 ± 0.074
0.0ThrXaa: 0.0 ± 0.0
Val
5.438ValAla: 5.438 ± 0.17
1.028ValCys: 1.028 ± 0.066
3.856ValAsp: 3.856 ± 0.133
4.971ValGlu: 4.971 ± 0.132
3.163ValPhe: 3.163 ± 0.123
4.46ValGly: 4.46 ± 0.136
1.129ValHis: 1.129 ± 0.062
4.306ValIle: 4.306 ± 0.144
5.193ValLys: 5.193 ± 0.134
6.757ValLeu: 6.757 ± 0.192
1.51ValMet: 1.51 ± 0.084
3.066ValAsn: 3.066 ± 0.125
2.401ValPro: 2.401 ± 0.096
2.297ValGln: 2.297 ± 0.097
2.645ValArg: 2.645 ± 0.102
4.64ValSer: 4.64 ± 0.14
3.116ValThr: 3.116 ± 0.112
4.838ValVal: 4.838 ± 0.159
0.446ValTrp: 0.446 ± 0.036
2.11ValTyr: 2.11 ± 0.095
0.0ValXaa: 0.0 ± 0.0
Trp
0.532TrpAla: 0.532 ± 0.042
0.09TrpCys: 0.09 ± 0.018
0.399TrpAsp: 0.399 ± 0.036
0.539TrpGlu: 0.539 ± 0.045
0.252TrpPhe: 0.252 ± 0.026
0.571TrpGly: 0.571 ± 0.041
0.162TrpHis: 0.162 ± 0.026
0.37TrpIle: 0.37 ± 0.038
0.298TrpLys: 0.298 ± 0.036
0.661TrpLeu: 0.661 ± 0.053
0.129TrpMet: 0.129 ± 0.022
0.23TrpAsn: 0.23 ± 0.029
0.137TrpPro: 0.137 ± 0.023
0.295TrpGln: 0.295 ± 0.029
0.316TrpArg: 0.316 ± 0.04
0.352TrpSer: 0.352 ± 0.032
0.27TrpThr: 0.27 ± 0.028
0.471TrpVal: 0.471 ± 0.046
0.101TrpTrp: 0.101 ± 0.02
0.226TrpTyr: 0.226 ± 0.033
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.25TyrAla: 2.25 ± 0.096
0.521TyrCys: 0.521 ± 0.039
2.164TyrAsp: 2.164 ± 0.096
2.34TyrGlu: 2.34 ± 0.091
1.729TyrPhe: 1.729 ± 0.078
2.153TyrGly: 2.153 ± 0.09
0.446TyrHis: 0.446 ± 0.048
2.275TyrIle: 2.275 ± 0.088
2.609TyrLys: 2.609 ± 0.111
3.138TyrLeu: 3.138 ± 0.114
0.686TyrMet: 0.686 ± 0.052
1.743TyrAsn: 1.743 ± 0.08
1.305TyrPro: 1.305 ± 0.07
1.021TyrGln: 1.021 ± 0.066
1.398TyrArg: 1.398 ± 0.067
2.462TyrSer: 2.462 ± 0.095
1.79TyrThr: 1.79 ± 0.081
1.912TyrVal: 1.912 ± 0.073
0.23TyrTrp: 0.23 ± 0.033
1.186TyrTyr: 1.186 ± 0.071
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1073 proteins (278237 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski