Amino acid dipepetide frequency for Candidatus Aquiluna sp. IMCC13023

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.265AlaAla: 12.265 ± 0.235
0.551AlaCys: 0.551 ± 0.038
5.376AlaAsp: 5.376 ± 0.123
6.746AlaGlu: 6.746 ± 0.142
3.709AlaPhe: 3.709 ± 0.117
9.399AlaGly: 9.399 ± 0.168
1.659AlaHis: 1.659 ± 0.069
6.975AlaIle: 6.975 ± 0.136
5.836AlaLys: 5.836 ± 0.142
11.609AlaLeu: 11.609 ± 0.171
2.596AlaMet: 2.596 ± 0.079
3.568AlaAsn: 3.568 ± 0.103
3.786AlaPro: 3.786 ± 0.134
3.503AlaGln: 3.503 ± 0.1
4.998AlaArg: 4.998 ± 0.103
7.114AlaSer: 7.114 ± 0.132
5.97AlaThr: 5.97 ± 0.133
8.055AlaVal: 8.055 ± 0.138
1.066AlaTrp: 1.066 ± 0.055
1.99AlaTyr: 1.99 ± 0.065
0.0AlaXaa: 0.0 ± 0.0
Cys
0.527CysAla: 0.527 ± 0.039
0.034CysCys: 0.034 ± 0.01
0.4CysAsp: 0.4 ± 0.033
0.383CysGlu: 0.383 ± 0.032
0.211CysPhe: 0.211 ± 0.023
0.577CysGly: 0.577 ± 0.044
0.108CysHis: 0.108 ± 0.017
0.244CysIle: 0.244 ± 0.023
0.199CysLys: 0.199 ± 0.022
0.455CysLeu: 0.455 ± 0.036
0.084CysMet: 0.084 ± 0.014
0.158CysAsn: 0.158 ± 0.021
0.28CysPro: 0.28 ± 0.029
0.192CysGln: 0.192 ± 0.021
0.22CysArg: 0.22 ± 0.023
0.414CysSer: 0.414 ± 0.033
0.326CysThr: 0.326 ± 0.028
0.366CysVal: 0.366 ± 0.027
0.069CysTrp: 0.069 ± 0.013
0.122CysTyr: 0.122 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
5.35AspAla: 5.35 ± 0.121
0.352AspCys: 0.352 ± 0.03
2.562AspAsp: 2.562 ± 0.08
3.587AspGlu: 3.587 ± 0.098
2.325AspPhe: 2.325 ± 0.073
4.138AspGly: 4.138 ± 0.107
0.946AspHis: 0.946 ± 0.052
2.948AspIle: 2.948 ± 0.089
2.088AspLys: 2.088 ± 0.07
6.346AspLeu: 6.346 ± 0.127
1.063AspMet: 1.063 ± 0.058
1.422AspAsn: 1.422 ± 0.067
3.087AspPro: 3.087 ± 0.088
2.34AspGln: 2.34 ± 0.075
2.895AspArg: 2.895 ± 0.088
3.93AspSer: 3.93 ± 0.102
2.392AspThr: 2.392 ± 0.075
4.049AspVal: 4.049 ± 0.093
0.776AspTrp: 0.776 ± 0.045
1.549AspTyr: 1.549 ± 0.061
0.0AspXaa: 0.0 ± 0.0
Glu
6.635GluAla: 6.635 ± 0.135
0.259GluCys: 0.259 ± 0.025
2.919GluAsp: 2.919 ± 0.084
3.484GluGlu: 3.484 ± 0.092
2.495GluPhe: 2.495 ± 0.077
3.87GluGly: 3.87 ± 0.11
1.118GluHis: 1.118 ± 0.057
4.741GluIle: 4.741 ± 0.115
2.936GluLys: 2.936 ± 0.096
7.89GluLeu: 7.89 ± 0.156
1.406GluMet: 1.406 ± 0.068
2.09GluAsn: 2.09 ± 0.07
2.689GluPro: 2.689 ± 0.086
2.656GluGln: 2.656 ± 0.081
3.554GluArg: 3.554 ± 0.116
3.891GluSer: 3.891 ± 0.095
3.412GluThr: 3.412 ± 0.098
5.488GluVal: 5.488 ± 0.127
0.587GluTrp: 0.587 ± 0.037
1.351GluTyr: 1.351 ± 0.058
0.0GluXaa: 0.0 ± 0.0
Phe
4.243PheAla: 4.243 ± 0.112
0.23PheCys: 0.23 ± 0.023
2.684PheAsp: 2.684 ± 0.08
2.725PheGlu: 2.725 ± 0.086
1.626PhePhe: 1.626 ± 0.078
4.112PheGly: 4.112 ± 0.113
0.57PheHis: 0.57 ± 0.037
1.983PheIle: 1.983 ± 0.071
1.152PheLys: 1.152 ± 0.047
3.508PheLeu: 3.508 ± 0.101
0.74PheMet: 0.74 ± 0.043
1.042PheAsn: 1.042 ± 0.049
1.473PhePro: 1.473 ± 0.062
1.102PheGln: 1.102 ± 0.047
1.717PheArg: 1.717 ± 0.06
2.766PheSer: 2.766 ± 0.082
1.995PheThr: 1.995 ± 0.077
3.01PheVal: 3.01 ± 0.081
0.469PheTrp: 0.469 ± 0.029
0.912PheTyr: 0.912 ± 0.057
0.0PheXaa: 0.0 ± 0.0
Gly
7.794GlyAla: 7.794 ± 0.155
0.577GlyCys: 0.577 ± 0.036
3.949GlyAsp: 3.949 ± 0.113
4.562GlyGlu: 4.562 ± 0.108
3.769GlyPhe: 3.769 ± 0.104
5.953GlyGly: 5.953 ± 0.162
1.578GlyHis: 1.578 ± 0.073
5.5GlyIle: 5.5 ± 0.115
4.049GlyLys: 4.049 ± 0.114
9.092GlyLeu: 9.092 ± 0.163
1.961GlyMet: 1.961 ± 0.073
2.426GlyAsn: 2.426 ± 0.08
2.95GlyPro: 2.95 ± 0.091
3.084GlyGln: 3.084 ± 0.083
4.037GlyArg: 4.037 ± 0.102
5.702GlySer: 5.702 ± 0.142
4.339GlyThr: 4.339 ± 0.119
6.616GlyVal: 6.616 ± 0.138
1.125GlyTrp: 1.125 ± 0.051
2.22GlyTyr: 2.22 ± 0.072
0.0GlyXaa: 0.0 ± 0.0
His
1.59HisAla: 1.59 ± 0.07
0.103HisCys: 0.103 ± 0.016
0.939HisAsp: 0.939 ± 0.058
1.099HisGlu: 1.099 ± 0.048
0.656HisPhe: 0.656 ± 0.037
1.528HisGly: 1.528 ± 0.064
0.407HisHis: 0.407 ± 0.033
0.864HisIle: 0.864 ± 0.05
0.647HisLys: 0.647 ± 0.042
1.731HisLeu: 1.731 ± 0.063
0.292HisMet: 0.292 ± 0.024
0.474HisAsn: 0.474 ± 0.03
1.066HisPro: 1.066 ± 0.06
0.577HisGln: 0.577 ± 0.043
1.02HisArg: 1.02 ± 0.049
1.173HisSer: 1.173 ± 0.056
0.785HisThr: 0.785 ± 0.043
1.188HisVal: 1.188 ± 0.055
0.216HisTrp: 0.216 ± 0.022
0.438HisTyr: 0.438 ± 0.033
0.0HisXaa: 0.0 ± 0.0
Ile
7.593IleAla: 7.593 ± 0.135
0.4IleCys: 0.4 ± 0.027
3.944IleAsp: 3.944 ± 0.107
4.385IleGlu: 4.385 ± 0.099
2.035IlePhe: 2.035 ± 0.086
5.534IleGly: 5.534 ± 0.141
0.862IleHis: 0.862 ± 0.047
3.051IleIle: 3.051 ± 0.092
2.138IleLys: 2.138 ± 0.066
4.962IleLeu: 4.962 ± 0.109
1.027IleMet: 1.027 ± 0.047
1.758IleAsn: 1.758 ± 0.073
2.902IlePro: 2.902 ± 0.088
1.861IleGln: 1.861 ± 0.072
3.329IleArg: 3.329 ± 0.094
4.717IleSer: 4.717 ± 0.11
3.669IleThr: 3.669 ± 0.109
4.492IleVal: 4.492 ± 0.096
0.75IleTrp: 0.75 ± 0.045
1.24IleTyr: 1.24 ± 0.054
0.0IleXaa: 0.0 ± 0.0
Lys
4.571LysAla: 4.571 ± 0.131
0.168LysCys: 0.168 ± 0.021
2.356LysAsp: 2.356 ± 0.079
2.62LysGlu: 2.62 ± 0.079
1.406LysPhe: 1.406 ± 0.058
2.799LysGly: 2.799 ± 0.082
0.793LysHis: 0.793 ± 0.045
2.687LysIle: 2.687 ± 0.082
2.407LysLys: 2.407 ± 0.099
4.76LysLeu: 4.76 ± 0.102
1.109LysMet: 1.109 ± 0.053
1.846LysAsn: 1.846 ± 0.066
2.268LysPro: 2.268 ± 0.078
1.616LysGln: 1.616 ± 0.068
2.589LysArg: 2.589 ± 0.084
2.981LysSer: 2.981 ± 0.082
2.689LysThr: 2.689 ± 0.081
3.391LysVal: 3.391 ± 0.101
0.46LysTrp: 0.46 ± 0.037
0.91LysTyr: 0.91 ± 0.047
0.0LysXaa: 0.0 ± 0.0
Leu
12.699LeuAla: 12.699 ± 0.195
0.481LeuCys: 0.481 ± 0.038
6.142LeuAsp: 6.142 ± 0.132
7.098LeuGlu: 7.098 ± 0.156
3.539LeuPhe: 3.539 ± 0.103
9.116LeuGly: 9.116 ± 0.185
1.518LeuHis: 1.518 ± 0.063
6.197LeuIle: 6.197 ± 0.127
4.243LeuLys: 4.243 ± 0.104
10.067LeuLeu: 10.067 ± 0.208
2.323LeuMet: 2.323 ± 0.082
3.034LeuAsn: 3.034 ± 0.087
4.626LeuPro: 4.626 ± 0.115
3.214LeuGln: 3.214 ± 0.08
5.532LeuArg: 5.532 ± 0.118
7.895LeuSer: 7.895 ± 0.167
5.85LeuThr: 5.85 ± 0.122
9.325LeuVal: 9.325 ± 0.173
1.145LeuTrp: 1.145 ± 0.053
1.753LeuTyr: 1.753 ± 0.059
0.0LeuXaa: 0.0 ± 0.0
Met
2.656MetAla: 2.656 ± 0.084
0.105MetCys: 0.105 ± 0.018
1.054MetAsp: 1.054 ± 0.057
1.111MetGlu: 1.111 ± 0.048
0.687MetPhe: 0.687 ± 0.041
1.628MetGly: 1.628 ± 0.06
0.376MetHis: 0.376 ± 0.03
1.315MetIle: 1.315 ± 0.052
0.941MetLys: 0.941 ± 0.051
2.256MetLeu: 2.256 ± 0.084
0.448MetMet: 0.448 ± 0.031
0.793MetAsn: 0.793 ± 0.046
1.121MetPro: 1.121 ± 0.053
0.864MetGln: 0.864 ± 0.043
1.298MetArg: 1.298 ± 0.054
1.652MetSer: 1.652 ± 0.059
1.559MetThr: 1.559 ± 0.056
1.76MetVal: 1.76 ± 0.062
0.211MetTrp: 0.211 ± 0.021
0.321MetTyr: 0.321 ± 0.029
0.0MetXaa: 0.0 ± 0.0
Asn
3.032AsnAla: 3.032 ± 0.099
0.218AsnCys: 0.218 ± 0.023
1.406AsnAsp: 1.406 ± 0.051
1.774AsnGlu: 1.774 ± 0.068
1.276AsnPhe: 1.276 ± 0.049
2.359AsnGly: 2.359 ± 0.071
0.572AsnHis: 0.572 ± 0.035
1.722AsnIle: 1.722 ± 0.068
1.365AsnLys: 1.365 ± 0.059
3.642AsnLeu: 3.642 ± 0.097
0.73AsnMet: 0.73 ± 0.044
1.013AsnAsn: 1.013 ± 0.046
2.15AsnPro: 2.15 ± 0.072
1.492AsnGln: 1.492 ± 0.055
1.719AsnArg: 1.719 ± 0.06
2.083AsnSer: 2.083 ± 0.069
1.592AsnThr: 1.592 ± 0.07
2.059AsnVal: 2.059 ± 0.067
0.556AsnTrp: 0.556 ± 0.036
0.898AsnTyr: 0.898 ± 0.049
0.0AsnXaa: 0.0 ± 0.0
Pro
4.361ProAla: 4.361 ± 0.12
0.163ProCys: 0.163 ± 0.021
2.668ProAsp: 2.668 ± 0.073
3.666ProGlu: 3.666 ± 0.112
1.48ProPhe: 1.48 ± 0.065
3.736ProGly: 3.736 ± 0.101
0.771ProHis: 0.771 ± 0.048
2.658ProIle: 2.658 ± 0.082
2.258ProLys: 2.258 ± 0.077
4.102ProLeu: 4.102 ± 0.097
0.903ProMet: 0.903 ± 0.047
1.679ProAsn: 1.679 ± 0.062
1.061ProPro: 1.061 ± 0.053
1.42ProGln: 1.42 ± 0.06
1.868ProArg: 1.868 ± 0.064
2.701ProSer: 2.701 ± 0.084
2.701ProThr: 2.701 ± 0.086
3.647ProVal: 3.647 ± 0.11
0.534ProTrp: 0.534 ± 0.033
0.965ProTyr: 0.965 ± 0.053
0.0ProXaa: 0.0 ± 0.0
Gln
3.736GlnAla: 3.736 ± 0.099
0.172GlnCys: 0.172 ± 0.023
1.93GlnAsp: 1.93 ± 0.059
2.232GlnGlu: 2.232 ± 0.07
1.2GlnPhe: 1.2 ± 0.056
2.416GlnGly: 2.416 ± 0.07
0.56GlnHis: 0.56 ± 0.038
2.383GlnIle: 2.383 ± 0.075
1.612GlnLys: 1.612 ± 0.065
4.394GlnLeu: 4.394 ± 0.104
0.864GlnMet: 0.864 ± 0.038
1.193GlnAsn: 1.193 ± 0.051
1.291GlnPro: 1.291 ± 0.061
1.36GlnGln: 1.36 ± 0.06
1.911GlnArg: 1.911 ± 0.073
2.203GlnSer: 2.203 ± 0.077
1.743GlnThr: 1.743 ± 0.07
3.139GlnVal: 3.139 ± 0.094
0.354GlnTrp: 0.354 ± 0.028
0.706GlnTyr: 0.706 ± 0.045
0.0GlnXaa: 0.0 ± 0.0
Arg
5.05ArgAla: 5.05 ± 0.11
0.259ArgCys: 0.259 ± 0.027
2.943ArgAsp: 2.943 ± 0.079
3.206ArgGlu: 3.206 ± 0.103
2.184ArgPhe: 2.184 ± 0.06
3.539ArgGly: 3.539 ± 0.101
0.975ArgHis: 0.975 ± 0.051
3.264ArgIle: 3.264 ± 0.095
2.435ArgLys: 2.435 ± 0.076
5.469ArgLeu: 5.469 ± 0.128
1.26ArgMet: 1.26 ± 0.055
1.667ArgAsn: 1.667 ± 0.061
2.071ArgPro: 2.071 ± 0.066
1.988ArgGln: 1.988 ± 0.069
2.943ArgArg: 2.943 ± 0.097
3.266ArgSer: 3.266 ± 0.095
2.663ArgThr: 2.663 ± 0.078
4.389ArgVal: 4.389 ± 0.103
0.711ArgTrp: 0.711 ± 0.043
1.315ArgTyr: 1.315 ± 0.061
0.0ArgXaa: 0.0 ± 0.0
Ser
6.851SerAla: 6.851 ± 0.139
0.407SerCys: 0.407 ± 0.031
3.659SerAsp: 3.659 ± 0.099
4.339SerGlu: 4.339 ± 0.124
2.73SerPhe: 2.73 ± 0.087
6.477SerGly: 6.477 ± 0.135
1.118SerHis: 1.118 ± 0.047
4.145SerIle: 4.145 ± 0.102
3.403SerLys: 3.403 ± 0.096
7.241SerLeu: 7.241 ± 0.168
1.729SerMet: 1.729 ± 0.065
2.162SerAsn: 2.162 ± 0.069
2.823SerPro: 2.823 ± 0.076
2.615SerGln: 2.615 ± 0.079
3.506SerArg: 3.506 ± 0.09
5.122SerSer: 5.122 ± 0.151
3.618SerThr: 3.618 ± 0.105
5.309SerVal: 5.309 ± 0.125
0.922SerTrp: 0.922 ± 0.045
1.791SerTyr: 1.791 ± 0.07
0.0SerXaa: 0.0 ± 0.0
Thr
5.709ThrAla: 5.709 ± 0.118
0.271ThrCys: 0.271 ± 0.027
3.067ThrAsp: 3.067 ± 0.095
3.386ThrGlu: 3.386 ± 0.097
1.959ThrPhe: 1.959 ± 0.07
5.16ThrGly: 5.16 ± 0.119
0.965ThrHis: 0.965 ± 0.05
3.077ThrIle: 3.077 ± 0.091
2.665ThrLys: 2.665 ± 0.082
5.292ThrLeu: 5.292 ± 0.111
1.116ThrMet: 1.116 ± 0.048
1.944ThrAsn: 1.944 ± 0.072
2.881ThrPro: 2.881 ± 0.081
1.655ThrGln: 1.655 ± 0.062
2.601ThrArg: 2.601 ± 0.079
3.817ThrSer: 3.817 ± 0.116
3.24ThrThr: 3.24 ± 0.12
4.394ThrVal: 4.394 ± 0.103
0.644ThrTrp: 0.644 ± 0.04
1.147ThrTyr: 1.147 ± 0.057
0.0ThrXaa: 0.0 ± 0.0
Val
8.872ValAla: 8.872 ± 0.146
0.378ValCys: 0.378 ± 0.033
4.337ValAsp: 4.337 ± 0.109
5.005ValGlu: 5.005 ± 0.116
3.182ValPhe: 3.182 ± 0.087
6.25ValGly: 6.25 ± 0.141
1.317ValHis: 1.317 ± 0.06
5.366ValIle: 5.366 ± 0.111
2.809ValLys: 2.809 ± 0.098
8.625ValLeu: 8.625 ± 0.143
1.894ValMet: 1.894 ± 0.075
2.387ValAsn: 2.387 ± 0.07
3.259ValPro: 3.259 ± 0.097
2.347ValGln: 2.347 ± 0.071
3.86ValArg: 3.86 ± 0.095
6.075ValSer: 6.075 ± 0.149
4.902ValThr: 4.902 ± 0.142
7.043ValVal: 7.043 ± 0.14
0.817ValTrp: 0.817 ± 0.044
1.545ValTyr: 1.545 ± 0.061
0.0ValXaa: 0.0 ± 0.0
Trp
1.19TrpAla: 1.19 ± 0.057
0.069TrpCys: 0.069 ± 0.014
0.599TrpAsp: 0.599 ± 0.039
0.594TrpGlu: 0.594 ± 0.036
0.591TrpPhe: 0.591 ± 0.038
0.838TrpGly: 0.838 ± 0.047
0.232TrpHis: 0.232 ± 0.025
0.656TrpIle: 0.656 ± 0.039
0.431TrpLys: 0.431 ± 0.027
1.652TrpLeu: 1.652 ± 0.072
0.287TrpMet: 0.287 ± 0.028
0.4TrpAsn: 0.4 ± 0.028
0.503TrpPro: 0.503 ± 0.035
0.515TrpGln: 0.515 ± 0.042
0.666TrpArg: 0.666 ± 0.038
0.881TrpSer: 0.881 ± 0.051
0.472TrpThr: 0.472 ± 0.031
0.927TrpVal: 0.927 ± 0.052
0.227TrpTrp: 0.227 ± 0.023
0.235TrpTyr: 0.235 ± 0.024
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.047TyrAla: 2.047 ± 0.071
0.172TyrCys: 0.172 ± 0.019
1.212TyrAsp: 1.212 ± 0.056
1.47TyrGlu: 1.47 ± 0.065
0.996TyrPhe: 0.996 ± 0.05
1.856TyrGly: 1.856 ± 0.073
0.347TyrHis: 0.347 ± 0.027
0.872TyrIle: 0.872 ± 0.048
0.812TyrLys: 0.812 ± 0.041
2.701TyrLeu: 2.701 ± 0.079
0.321TyrMet: 0.321 ± 0.032
0.603TyrAsn: 0.603 ± 0.041
1.054TyrPro: 1.054 ± 0.054
1.001TyrGln: 1.001 ± 0.049
1.336TyrArg: 1.336 ± 0.052
1.583TyrSer: 1.583 ± 0.065
1.073TyrThr: 1.073 ± 0.058
1.624TyrVal: 1.624 ± 0.06
0.328TyrTrp: 0.328 ± 0.03
0.488TyrTyr: 0.488 ± 0.037
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1364 proteins (417606 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski