Amino acid dipepetide frequency for Candidatus Xenolissoclinum pacificiensis L6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.312AlaAla: 1.312 ± 0.079
0.762AlaCys: 0.762 ± 0.057
2.266AlaAsp: 2.266 ± 0.097
2.404AlaGlu: 2.404 ± 0.091
2.144AlaPhe: 2.144 ± 0.089
2.631AlaGly: 2.631 ± 0.111
1.152AlaHis: 1.152 ± 0.06
4.681AlaIle: 4.681 ± 0.15
3.292AlaLys: 3.292 ± 0.107
4.912AlaLeu: 4.912 ± 0.166
1.126AlaMet: 1.126 ± 0.064
2.385AlaAsn: 2.385 ± 0.103
1.286AlaPro: 1.286 ± 0.075
1.787AlaGln: 1.787 ± 0.08
2.073AlaArg: 2.073 ± 0.089
3.604AlaSer: 3.604 ± 0.125
2.664AlaThr: 2.664 ± 0.101
3.24AlaVal: 3.24 ± 0.111
0.29AlaTrp: 0.29 ± 0.033
1.962AlaTyr: 1.962 ± 0.078
0.0AlaXaa: 0.0 ± 0.0
Cys
0.747CysAla: 0.747 ± 0.051
0.245CysCys: 0.245 ± 0.029
0.892CysAsp: 0.892 ± 0.055
0.494CysGlu: 0.494 ± 0.041
0.799CysPhe: 0.799 ± 0.053
0.907CysGly: 0.907 ± 0.074
0.327CysHis: 0.327 ± 0.036
1.416CysIle: 1.416 ± 0.071
1.022CysLys: 1.022 ± 0.075
1.326CysLeu: 1.326 ± 0.082
0.446CysMet: 0.446 ± 0.04
0.951CysAsn: 0.951 ± 0.066
0.338CysPro: 0.338 ± 0.036
0.476CysGln: 0.476 ± 0.042
0.568CysArg: 0.568 ± 0.043
1.133CysSer: 1.133 ± 0.073
0.736CysThr: 0.736 ± 0.049
1.022CysVal: 1.022 ± 0.066
0.108CysTrp: 0.108 ± 0.024
0.713CysTyr: 0.713 ± 0.05
0.0CysXaa: 0.0 ± 0.0
Asp
2.671AspAla: 2.671 ± 0.095
0.658AspCys: 0.658 ± 0.047
3.385AspAsp: 3.385 ± 0.143
2.998AspGlu: 2.998 ± 0.126
2.727AspPhe: 2.727 ± 0.109
2.452AspGly: 2.452 ± 0.121
1.36AspHis: 1.36 ± 0.077
6.714AspIle: 6.714 ± 0.178
4.091AspLys: 4.091 ± 0.149
5.321AspLeu: 5.321 ± 0.148
1.75AspMet: 1.75 ± 0.098
3.898AspAsn: 3.898 ± 0.138
1.653AspPro: 1.653 ± 0.087
1.995AspGln: 1.995 ± 0.098
2.144AspArg: 2.144 ± 0.103
3.883AspSer: 3.883 ± 0.125
3.028AspThr: 3.028 ± 0.128
3.99AspVal: 3.99 ± 0.13
0.36AspTrp: 0.36 ± 0.04
2.333AspTyr: 2.333 ± 0.097
0.0AspXaa: 0.0 ± 0.0
Glu
2.218GluAla: 2.218 ± 0.101
0.773GluCys: 0.773 ± 0.05
3.121GluAsp: 3.121 ± 0.142
3.589GluGlu: 3.589 ± 0.142
2.114GluPhe: 2.114 ± 0.1
2.623GluGly: 2.623 ± 0.098
1.323GluHis: 1.323 ± 0.064
5.558GluIle: 5.558 ± 0.151
4.782GluLys: 4.782 ± 0.15
4.366GluLeu: 4.366 ± 0.15
1.549GluMet: 1.549 ± 0.09
4.05GluAsn: 4.05 ± 0.117
1.144GluPro: 1.144 ± 0.078
2.185GluGln: 2.185 ± 0.12
2.348GluArg: 2.348 ± 0.109
3.749GluSer: 3.749 ± 0.131
2.185GluThr: 2.185 ± 0.098
3.675GluVal: 3.675 ± 0.153
0.379GluTrp: 0.379 ± 0.04
3.177GluTyr: 3.177 ± 0.134
0.0GluXaa: 0.0 ± 0.0
Phe
2.497PheAla: 2.497 ± 0.104
0.751PheCys: 0.751 ± 0.054
2.53PheAsp: 2.53 ± 0.106
2.032PheGlu: 2.032 ± 0.087
2.995PhePhe: 2.995 ± 0.126
2.307PheGly: 2.307 ± 0.1
0.985PheHis: 0.985 ± 0.059
4.158PheIle: 4.158 ± 0.157
2.501PheLys: 2.501 ± 0.101
5.246PheLeu: 5.246 ± 0.158
1.144PheMet: 1.144 ± 0.075
2.367PheAsn: 2.367 ± 0.102
1.516PhePro: 1.516 ± 0.086
1.323PheGln: 1.323 ± 0.068
1.642PheArg: 1.642 ± 0.073
4.603PheSer: 4.603 ± 0.151
2.359PheThr: 2.359 ± 0.107
2.972PheVal: 2.972 ± 0.094
0.331PheTrp: 0.331 ± 0.035
1.817PheTyr: 1.817 ± 0.083
0.0PheXaa: 0.0 ± 0.0
Gly
2.649GlyAla: 2.649 ± 0.113
0.903GlyCys: 0.903 ± 0.065
2.697GlyAsp: 2.697 ± 0.107
2.263GlyGlu: 2.263 ± 0.105
2.556GlyPhe: 2.556 ± 0.104
3.225GlyGly: 3.225 ± 0.149
1.196GlyHis: 1.196 ± 0.076
5.447GlyIle: 5.447 ± 0.146
3.634GlyLys: 3.634 ± 0.13
4.529GlyLeu: 4.529 ± 0.145
1.65GlyMet: 1.65 ± 0.091
2.645GlyAsn: 2.645 ± 0.102
0.866GlyPro: 0.866 ± 0.055
1.646GlyGln: 1.646 ± 0.081
2.155GlyArg: 2.155 ± 0.1
3.719GlySer: 3.719 ± 0.142
2.549GlyThr: 2.549 ± 0.095
3.953GlyVal: 3.953 ± 0.147
0.36GlyTrp: 0.36 ± 0.035
2.527GlyTyr: 2.527 ± 0.095
0.0GlyXaa: 0.0 ± 0.0
His
1.17HisAla: 1.17 ± 0.058
0.308HisCys: 0.308 ± 0.034
1.367HisAsp: 1.367 ± 0.075
1.133HisGlu: 1.133 ± 0.072
0.977HisPhe: 0.977 ± 0.056
1.159HisGly: 1.159 ± 0.071
0.676HisHis: 0.676 ± 0.055
2.274HisIle: 2.274 ± 0.095
1.787HisLys: 1.787 ± 0.092
1.798HisLeu: 1.798 ± 0.077
0.62HisMet: 0.62 ± 0.053
1.865HisAsn: 1.865 ± 0.107
0.888HisPro: 0.888 ± 0.059
0.888HisGln: 0.888 ± 0.083
0.832HisArg: 0.832 ± 0.06
1.683HisSer: 1.683 ± 0.075
1.252HisThr: 1.252 ± 0.082
1.456HisVal: 1.456 ± 0.074
0.104HisTrp: 0.104 ± 0.019
0.966HisTyr: 0.966 ± 0.059
0.0HisXaa: 0.0 ± 0.0
Ile
5.257IleAla: 5.257 ± 0.159
1.401IleCys: 1.401 ± 0.076
5.603IleAsp: 5.603 ± 0.174
5.458IleGlu: 5.458 ± 0.164
4.429IlePhe: 4.429 ± 0.168
4.916IleGly: 4.916 ± 0.162
2.188IleHis: 2.188 ± 0.101
8.921IleIle: 8.921 ± 0.266
6.346IleLys: 6.346 ± 0.152
9.683IleLeu: 9.683 ± 0.266
2.408IleMet: 2.408 ± 0.091
5.577IleAsn: 5.577 ± 0.187
3.89IlePro: 3.89 ± 0.142
3.474IleGln: 3.474 ± 0.127
3.667IleArg: 3.667 ± 0.126
8.676IleSer: 8.676 ± 0.204
5.116IleThr: 5.116 ± 0.159
6.32IleVal: 6.32 ± 0.176
0.539IleTrp: 0.539 ± 0.045
3.868IleTyr: 3.868 ± 0.14
0.0IleXaa: 0.0 ± 0.0
Lys
2.664LysAla: 2.664 ± 0.099
0.981LysCys: 0.981 ± 0.068
3.972LysAsp: 3.972 ± 0.134
4.585LysGlu: 4.585 ± 0.148
2.367LysPhe: 2.367 ± 0.083
3.396LysGly: 3.396 ± 0.123
1.806LysHis: 1.806 ± 0.084
7.494LysIle: 7.494 ± 0.175
6.164LysLys: 6.164 ± 0.203
5.543LysLeu: 5.543 ± 0.157
1.884LysMet: 1.884 ± 0.087
5.373LysAsn: 5.373 ± 0.161
1.735LysPro: 1.735 ± 0.09
2.501LysGln: 2.501 ± 0.108
2.913LysArg: 2.913 ± 0.107
5.265LysSer: 5.265 ± 0.161
3.604LysThr: 3.604 ± 0.123
4.555LysVal: 4.555 ± 0.13
0.409LysTrp: 0.409 ± 0.042
3.834LysTyr: 3.834 ± 0.12
0.0LysXaa: 0.0 ± 0.0
Leu
4.158LeuAla: 4.158 ± 0.122
1.542LeuCys: 1.542 ± 0.078
5.228LeuAsp: 5.228 ± 0.142
5.83LeuGlu: 5.83 ± 0.157
4.715LeuPhe: 4.715 ± 0.164
4.473LeuGly: 4.473 ± 0.144
2.348LeuHis: 2.348 ± 0.09
7.074LeuIle: 7.074 ± 0.226
6.35LeuLys: 6.35 ± 0.167
9.872LeuLeu: 9.872 ± 0.238
2.263LeuMet: 2.263 ± 0.091
5.239LeuAsn: 5.239 ± 0.148
3.348LeuPro: 3.348 ± 0.109
4.098LeuGln: 4.098 ± 0.158
3.593LeuArg: 3.593 ± 0.147
9.512LeuSer: 9.512 ± 0.212
4.076LeuThr: 4.076 ± 0.138
5.573LeuVal: 5.573 ± 0.167
0.58LeuTrp: 0.58 ± 0.048
4.362LeuTyr: 4.362 ± 0.13
0.0LeuXaa: 0.0 ± 0.0
Met
1.022MetAla: 1.022 ± 0.064
0.42MetCys: 0.42 ± 0.037
1.208MetAsp: 1.208 ± 0.069
1.352MetGlu: 1.352 ± 0.08
1.315MetPhe: 1.315 ± 0.064
1.382MetGly: 1.382 ± 0.079
0.646MetHis: 0.646 ± 0.046
2.735MetIle: 2.735 ± 0.102
1.828MetLys: 1.828 ± 0.086
2.898MetLeu: 2.898 ± 0.099
0.869MetMet: 0.869 ± 0.068
1.605MetAsn: 1.605 ± 0.084
0.862MetPro: 0.862 ± 0.068
1.204MetGln: 1.204 ± 0.065
1.103MetArg: 1.103 ± 0.067
2.218MetSer: 2.218 ± 0.102
1.345MetThr: 1.345 ± 0.083
1.683MetVal: 1.683 ± 0.08
0.13MetTrp: 0.13 ± 0.024
1.156MetTyr: 1.156 ± 0.072
0.0MetXaa: 0.0 ± 0.0
Asn
2.905AsnAla: 2.905 ± 0.111
0.684AsnCys: 0.684 ± 0.056
3.325AsnAsp: 3.325 ± 0.114
2.422AsnGlu: 2.422 ± 0.092
2.753AsnPhe: 2.753 ± 0.104
2.638AsnGly: 2.638 ± 0.12
1.583AsnHis: 1.583 ± 0.072
7.951AsnIle: 7.951 ± 0.203
4.689AsnLys: 4.689 ± 0.13
5.447AsnLeu: 5.447 ± 0.154
1.88AsnMet: 1.88 ± 0.088
4.622AsnAsn: 4.622 ± 0.177
2.032AsnPro: 2.032 ± 0.09
2.103AsnGln: 2.103 ± 0.108
2.285AsnArg: 2.285 ± 0.085
4.551AsnSer: 4.551 ± 0.176
3.998AsnThr: 3.998 ± 0.151
3.823AsnVal: 3.823 ± 0.108
0.409AsnTrp: 0.409 ± 0.042
2.541AsnTyr: 2.541 ± 0.125
0.0AsnXaa: 0.0 ± 0.0
Pro
1.245ProAla: 1.245 ± 0.076
0.424ProCys: 0.424 ± 0.036
2.058ProAsp: 2.058 ± 0.102
2.077ProGlu: 2.077 ± 0.102
1.412ProPhe: 1.412 ± 0.063
1.754ProGly: 1.754 ± 0.087
0.591ProHis: 0.591 ± 0.051
2.98ProIle: 2.98 ± 0.117
1.902ProLys: 1.902 ± 0.093
2.582ProLeu: 2.582 ± 0.097
0.795ProMet: 0.795 ± 0.052
1.78ProAsn: 1.78 ± 0.086
0.985ProPro: 0.985 ± 0.076
0.851ProGln: 0.851 ± 0.059
0.866ProArg: 0.866 ± 0.057
2.382ProSer: 2.382 ± 0.097
1.412ProThr: 1.412 ± 0.09
2.426ProVal: 2.426 ± 0.103
0.182ProTrp: 0.182 ± 0.029
1.289ProTyr: 1.289 ± 0.073
0.0ProXaa: 0.0 ± 0.0
Gln
1.449GlnAla: 1.449 ± 0.082
0.479GlnCys: 0.479 ± 0.045
2.879GlnAsp: 2.879 ± 0.129
3.143GlnGlu: 3.143 ± 0.125
1.107GlnPhe: 1.107 ± 0.058
1.88GlnGly: 1.88 ± 0.105
0.933GlnHis: 0.933 ± 0.062
2.976GlnIle: 2.976 ± 0.111
3.221GlnLys: 3.221 ± 0.123
2.396GlnLeu: 2.396 ± 0.116
0.921GlnMet: 0.921 ± 0.051
2.675GlnAsn: 2.675 ± 0.103
0.959GlnPro: 0.959 ± 0.062
1.802GlnGln: 1.802 ± 0.113
1.334GlnArg: 1.334 ± 0.084
2.445GlnSer: 2.445 ± 0.113
1.393GlnThr: 1.393 ± 0.086
2.318GlnVal: 2.318 ± 0.113
0.208GlnTrp: 0.208 ± 0.03
1.999GlnTyr: 1.999 ± 0.084
0.0GlnXaa: 0.0 ± 0.0
Arg
1.895ArgAla: 1.895 ± 0.09
0.509ArgCys: 0.509 ± 0.044
2.285ArgAsp: 2.285 ± 0.104
2.17ArgGlu: 2.17 ± 0.089
1.824ArgPhe: 1.824 ± 0.086
1.835ArgGly: 1.835 ± 0.096
0.71ArgHis: 0.71 ± 0.046
3.727ArgIle: 3.727 ± 0.109
3.244ArgLys: 3.244 ± 0.112
3.37ArgLeu: 3.37 ± 0.138
1.159ArgMet: 1.159 ± 0.067
2.534ArgAsn: 2.534 ± 0.109
0.962ArgPro: 0.962 ± 0.056
1.185ArgGln: 1.185 ± 0.07
1.672ArgArg: 1.672 ± 0.093
2.965ArgSer: 2.965 ± 0.114
1.75ArgThr: 1.75 ± 0.09
2.504ArgVal: 2.504 ± 0.103
0.242ArgTrp: 0.242 ± 0.027
1.913ArgTyr: 1.913 ± 0.076
0.0ArgXaa: 0.0 ± 0.0
Ser
3.708SerAla: 3.708 ± 0.113
1.252SerCys: 1.252 ± 0.063
4.522SerAsp: 4.522 ± 0.143
4.329SerGlu: 4.329 ± 0.125
4.002SerPhe: 4.002 ± 0.122
4.678SerGly: 4.678 ± 0.165
1.679SerHis: 1.679 ± 0.064
8.011SerIle: 8.011 ± 0.204
5.15SerLys: 5.15 ± 0.161
8.081SerLeu: 8.081 ± 0.188
2.136SerMet: 2.136 ± 0.09
4.975SerAsn: 4.975 ± 0.177
2.107SerPro: 2.107 ± 0.109
2.842SerGln: 2.842 ± 0.122
2.879SerArg: 2.879 ± 0.106
6.654SerSer: 6.654 ± 0.168
3.86SerThr: 3.86 ± 0.141
5.989SerVal: 5.989 ± 0.164
0.587SerTrp: 0.587 ± 0.053
3.426SerTyr: 3.426 ± 0.118
0.0SerXaa: 0.0 ± 0.0
Thr
2.478ThrAla: 2.478 ± 0.103
0.784ThrCys: 0.784 ± 0.053
2.783ThrAsp: 2.783 ± 0.118
2.805ThrGlu: 2.805 ± 0.114
1.854ThrPhe: 1.854 ± 0.094
3.069ThrGly: 3.069 ± 0.121
1.07ThrHis: 1.07 ± 0.061
4.73ThrIle: 4.73 ± 0.131
3.258ThrLys: 3.258 ± 0.101
4.934ThrLeu: 4.934 ± 0.138
1.278ThrMet: 1.278 ± 0.067
2.787ThrAsn: 2.787 ± 0.122
1.936ThrPro: 1.936 ± 0.099
2.058ThrGln: 2.058 ± 0.117
1.887ThrArg: 1.887 ± 0.091
3.849ThrSer: 3.849 ± 0.134
3.188ThrThr: 3.188 ± 0.153
3.663ThrVal: 3.663 ± 0.111
0.282ThrTrp: 0.282 ± 0.037
1.739ThrTyr: 1.739 ± 0.076
0.0ThrXaa: 0.0 ± 0.0
Val
3.604ValAla: 3.604 ± 0.12
1.074ValCys: 1.074 ± 0.071
3.875ValAsp: 3.875 ± 0.124
3.556ValGlu: 3.556 ± 0.112
3.548ValPhe: 3.548 ± 0.126
3.262ValGly: 3.262 ± 0.135
1.367ValHis: 1.367 ± 0.075
6.138ValIle: 6.138 ± 0.168
4.187ValLys: 4.187 ± 0.116
7.394ValLeu: 7.394 ± 0.192
1.813ValMet: 1.813 ± 0.086
3.753ValAsn: 3.753 ± 0.126
2.04ValPro: 2.04 ± 0.086
2.144ValGln: 2.144 ± 0.092
2.46ValArg: 2.46 ± 0.098
5.659ValSer: 5.659 ± 0.153
3.299ValThr: 3.299 ± 0.13
4.812ValVal: 4.812 ± 0.172
0.431ValTrp: 0.431 ± 0.042
2.556ValTyr: 2.556 ± 0.097
0.0ValXaa: 0.0 ± 0.0
Trp
0.249TrpAla: 0.249 ± 0.033
0.104TrpCys: 0.104 ± 0.018
0.305TrpAsp: 0.305 ± 0.038
0.297TrpGlu: 0.297 ± 0.034
0.349TrpPhe: 0.349 ± 0.042
0.294TrpGly: 0.294 ± 0.043
0.13TrpHis: 0.13 ± 0.02
0.583TrpIle: 0.583 ± 0.05
0.487TrpLys: 0.487 ± 0.043
0.702TrpLeu: 0.702 ± 0.05
0.145TrpMet: 0.145 ± 0.023
0.442TrpAsn: 0.442 ± 0.045
0.134TrpPro: 0.134 ± 0.022
0.245TrpGln: 0.245 ± 0.033
0.308TrpArg: 0.308 ± 0.035
0.453TrpSer: 0.453 ± 0.045
0.208TrpThr: 0.208 ± 0.029
0.424TrpVal: 0.424 ± 0.036
0.056TrpTrp: 0.056 ± 0.015
0.342TrpTyr: 0.342 ± 0.032
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.073TyrAla: 2.073 ± 0.087
0.613TyrCys: 0.613 ± 0.045
3.11TyrAsp: 3.11 ± 0.117
1.969TyrGlu: 1.969 ± 0.088
2.088TyrPhe: 2.088 ± 0.092
2.289TyrGly: 2.289 ± 0.104
1.137TyrHis: 1.137 ± 0.076
4.169TyrIle: 4.169 ± 0.144
3.114TyrLys: 3.114 ± 0.104
3.831TyrLeu: 3.831 ± 0.13
1.141TyrMet: 1.141 ± 0.069
3.121TyrAsn: 3.121 ± 0.12
1.312TyrPro: 1.312 ± 0.081
1.713TyrGln: 1.713 ± 0.079
1.724TyrArg: 1.724 ± 0.086
3.864TyrSer: 3.864 ± 0.117
2.411TyrThr: 2.411 ± 0.096
2.571TyrVal: 2.571 ± 0.101
0.271TyrTrp: 0.271 ± 0.034
1.943TyrTyr: 1.943 ± 0.105
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 956 proteins (269146 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski