Amino acid dipepetide frequency for Methyloceanibacter methanicus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.143AlaAla: 16.143 ± 0.243
1.163AlaCys: 1.163 ± 0.034
6.869AlaAsp: 6.869 ± 0.121
8.036AlaGlu: 8.036 ± 0.124
4.274AlaPhe: 4.274 ± 0.076
10.021AlaGly: 10.021 ± 0.136
2.304AlaHis: 2.304 ± 0.054
5.841AlaIle: 5.841 ± 0.101
4.754AlaLys: 4.754 ± 0.103
13.576AlaLeu: 13.576 ± 0.176
3.323AlaMet: 3.323 ± 0.067
2.862AlaAsn: 2.862 ± 0.065
6.135AlaPro: 6.135 ± 0.111
3.821AlaGln: 3.821 ± 0.075
8.676AlaArg: 8.676 ± 0.149
5.992AlaSer: 5.992 ± 0.099
5.706AlaThr: 5.706 ± 0.079
8.672AlaVal: 8.672 ± 0.12
1.5AlaTrp: 1.5 ± 0.045
2.751AlaTyr: 2.751 ± 0.064
0.0AlaXaa: 0.0 ± 0.0
Cys
1.102CysAla: 1.102 ± 0.044
0.137CysCys: 0.137 ± 0.013
0.546CysAsp: 0.546 ± 0.03
0.476CysGlu: 0.476 ± 0.028
0.343CysPhe: 0.343 ± 0.019
1.036CysGly: 1.036 ± 0.038
0.265CysHis: 0.265 ± 0.021
0.381CysIle: 0.381 ± 0.022
0.259CysLys: 0.259 ± 0.021
0.873CysLeu: 0.873 ± 0.036
0.194CysMet: 0.194 ± 0.016
0.267CysAsn: 0.267 ± 0.018
0.553CysPro: 0.553 ± 0.03
0.255CysGln: 0.255 ± 0.018
0.731CysArg: 0.731 ± 0.031
0.513CysSer: 0.513 ± 0.027
0.45CysThr: 0.45 ± 0.026
0.652CysVal: 0.652 ± 0.033
0.122CysTrp: 0.122 ± 0.013
0.204CysTyr: 0.204 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
7.37AspAla: 7.37 ± 0.128
0.523AspCys: 0.523 ± 0.026
3.604AspAsp: 3.604 ± 0.092
4.043AspGlu: 4.043 ± 0.086
2.223AspPhe: 2.223 ± 0.052
5.499AspGly: 5.499 ± 0.101
1.192AspHis: 1.192 ± 0.041
2.992AspIle: 2.992 ± 0.064
2.118AspLys: 2.118 ± 0.061
5.991AspLeu: 5.991 ± 0.113
1.371AspMet: 1.371 ± 0.039
1.296AspAsn: 1.296 ± 0.047
3.488AspPro: 3.488 ± 0.078
1.765AspGln: 1.765 ± 0.053
4.107AspArg: 4.107 ± 0.08
2.322AspSer: 2.322 ± 0.055
2.853AspThr: 2.853 ± 0.065
4.59AspVal: 4.59 ± 0.082
0.991AspTrp: 0.991 ± 0.034
1.613AspTyr: 1.613 ± 0.046
0.0AspXaa: 0.0 ± 0.0
Glu
8.335GluAla: 8.335 ± 0.144
0.408GluCys: 0.408 ± 0.024
3.649GluAsp: 3.649 ± 0.08
3.754GluGlu: 3.754 ± 0.087
1.67GluPhe: 1.67 ± 0.049
4.571GluGly: 4.571 ± 0.083
1.233GluHis: 1.233 ± 0.044
3.555GluIle: 3.555 ± 0.066
2.464GluLys: 2.464 ± 0.058
5.489GluLeu: 5.489 ± 0.092
1.405GluMet: 1.405 ± 0.044
1.595GluAsn: 1.595 ± 0.048
3.204GluPro: 3.204 ± 0.083
2.015GluGln: 2.015 ± 0.05
4.96GluArg: 4.96 ± 0.081
2.749GluSer: 2.749 ± 0.062
4.15GluThr: 4.15 ± 0.082
3.999GluVal: 3.999 ± 0.085
0.702GluTrp: 0.702 ± 0.029
1.045GluTyr: 1.045 ± 0.041
0.0GluXaa: 0.0 ± 0.0
Phe
4.521PheAla: 4.521 ± 0.091
0.394PheCys: 0.394 ± 0.023
2.642PheAsp: 2.642 ± 0.062
2.309PheGlu: 2.309 ± 0.053
1.422PhePhe: 1.422 ± 0.045
3.687PheGly: 3.687 ± 0.07
0.667PheHis: 0.667 ± 0.028
1.56PheIle: 1.56 ± 0.052
1.125PheLys: 1.125 ± 0.039
3.31PheLeu: 3.31 ± 0.076
0.765PheMet: 0.765 ± 0.034
0.956PheAsn: 0.956 ± 0.036
1.578PhePro: 1.578 ± 0.042
0.993PheGln: 0.993 ± 0.036
2.219PheArg: 2.219 ± 0.053
2.166PheSer: 2.166 ± 0.056
1.876PheThr: 1.876 ± 0.045
3.043PheVal: 3.043 ± 0.069
0.579PheTrp: 0.579 ± 0.027
0.95PheTyr: 0.95 ± 0.034
0.0PheXaa: 0.0 ± 0.0
Gly
9.365GlyAla: 9.365 ± 0.124
0.942GlyCys: 0.942 ± 0.039
4.819GlyAsp: 4.819 ± 0.088
4.967GlyGlu: 4.967 ± 0.089
3.574GlyPhe: 3.574 ± 0.066
7.21GlyGly: 7.21 ± 0.147
1.989GlyHis: 1.989 ± 0.049
4.195GlyIle: 4.195 ± 0.076
3.441GlyLys: 3.441 ± 0.081
8.974GlyLeu: 8.974 ± 0.124
2.052GlyMet: 2.052 ± 0.047
2.118GlyAsn: 2.118 ± 0.064
3.861GlyPro: 3.861 ± 0.077
2.798GlyGln: 2.798 ± 0.078
5.837GlyArg: 5.837 ± 0.095
4.547GlySer: 4.547 ± 0.082
4.921GlyThr: 4.921 ± 0.088
6.058GlyVal: 6.058 ± 0.102
1.242GlyTrp: 1.242 ± 0.041
2.261GlyTyr: 2.261 ± 0.058
0.0GlyXaa: 0.0 ± 0.0
His
2.36HisAla: 2.36 ± 0.059
0.228HisCys: 0.228 ± 0.016
1.276HisAsp: 1.276 ± 0.04
1.184HisGlu: 1.184 ± 0.043
0.865HisPhe: 0.865 ± 0.037
1.963HisGly: 1.963 ± 0.06
0.552HisHis: 0.552 ± 0.032
0.902HisIle: 0.902 ± 0.033
0.588HisLys: 0.588 ± 0.024
2.019HisLeu: 2.019 ± 0.05
0.455HisMet: 0.455 ± 0.024
0.509HisAsn: 0.509 ± 0.025
1.274HisPro: 1.274 ± 0.041
0.524HisGln: 0.524 ± 0.027
1.358HisArg: 1.358 ± 0.04
0.863HisSer: 0.863 ± 0.034
0.876HisThr: 0.876 ± 0.034
1.523HisVal: 1.523 ± 0.043
0.304HisTrp: 0.304 ± 0.018
0.578HisTyr: 0.578 ± 0.025
0.0HisXaa: 0.0 ± 0.0
Ile
6.849IleAla: 6.849 ± 0.105
0.5IleCys: 0.5 ± 0.023
3.429IleAsp: 3.429 ± 0.062
3.594IleGlu: 3.594 ± 0.061
1.577IlePhe: 1.577 ± 0.057
4.703IleGly: 4.703 ± 0.089
0.891IleHis: 0.891 ± 0.037
1.824IleIle: 1.824 ± 0.053
1.62IleLys: 1.62 ± 0.052
4.167IleLeu: 4.167 ± 0.077
0.887IleMet: 0.887 ± 0.031
1.133IleAsn: 1.133 ± 0.04
2.323IlePro: 2.323 ± 0.059
1.111IleGln: 1.111 ± 0.038
2.767IleArg: 2.767 ± 0.059
2.473IleSer: 2.473 ± 0.067
2.408IleThr: 2.408 ± 0.066
4.5IleVal: 4.5 ± 0.091
0.55IleTrp: 0.55 ± 0.026
1.099IleTyr: 1.099 ± 0.041
0.0IleXaa: 0.0 ± 0.0
Lys
4.41LysAla: 4.41 ± 0.086
0.22LysCys: 0.22 ± 0.017
2.334LysAsp: 2.334 ± 0.056
2.158LysGlu: 2.158 ± 0.054
1.002LysPhe: 1.002 ± 0.036
2.961LysGly: 2.961 ± 0.069
0.675LysHis: 0.675 ± 0.034
1.846LysIle: 1.846 ± 0.056
1.691LysLys: 1.691 ± 0.053
3.507LysLeu: 3.507 ± 0.075
0.761LysMet: 0.761 ± 0.035
1.008LysAsn: 1.008 ± 0.039
2.251LysPro: 2.251 ± 0.054
1.132LysGln: 1.132 ± 0.042
2.722LysArg: 2.722 ± 0.066
2.062LysSer: 2.062 ± 0.056
2.446LysThr: 2.446 ± 0.071
2.671LysVal: 2.671 ± 0.066
0.39LysTrp: 0.39 ± 0.027
0.726LysTyr: 0.726 ± 0.034
0.0LysXaa: 0.0 ± 0.0
Leu
13.113LeuAla: 13.113 ± 0.148
0.916LeuCys: 0.916 ± 0.037
6.066LeuAsp: 6.066 ± 0.094
5.619LeuGlu: 5.619 ± 0.092
3.657LeuPhe: 3.657 ± 0.084
8.455LeuGly: 8.455 ± 0.132
1.842LeuHis: 1.842 ± 0.045
4.934LeuIle: 4.934 ± 0.099
4.116LeuLys: 4.116 ± 0.081
9.333LeuLeu: 9.333 ± 0.149
2.126LeuMet: 2.126 ± 0.049
2.585LeuAsn: 2.585 ± 0.058
5.235LeuPro: 5.235 ± 0.09
2.763LeuGln: 2.763 ± 0.062
6.635LeuArg: 6.635 ± 0.102
6.099LeuSer: 6.099 ± 0.098
5.728LeuThr: 5.728 ± 0.085
7.443LeuVal: 7.443 ± 0.112
1.153LeuTrp: 1.153 ± 0.042
2.226LeuTyr: 2.226 ± 0.051
0.0LeuXaa: 0.0 ± 0.0
Met
2.888MetAla: 2.888 ± 0.063
0.165MetCys: 0.165 ± 0.013
1.151MetAsp: 1.151 ± 0.037
1.179MetGlu: 1.179 ± 0.04
0.736MetPhe: 0.736 ± 0.03
1.737MetGly: 1.737 ± 0.047
0.467MetHis: 0.467 ± 0.024
1.227MetIle: 1.227 ± 0.042
0.859MetLys: 0.859 ± 0.032
2.266MetLeu: 2.266 ± 0.06
0.532MetMet: 0.532 ± 0.025
0.63MetAsn: 0.63 ± 0.029
1.451MetPro: 1.451 ± 0.049
0.736MetGln: 0.736 ± 0.033
1.884MetArg: 1.884 ± 0.044
1.514MetSer: 1.514 ± 0.039
1.833MetThr: 1.833 ± 0.047
1.572MetVal: 1.572 ± 0.049
0.22MetTrp: 0.22 ± 0.017
0.304MetTyr: 0.304 ± 0.023
0.0MetXaa: 0.0 ± 0.0
Asn
3.086AsnAla: 3.086 ± 0.065
0.271AsnCys: 0.271 ± 0.018
1.487AsnAsp: 1.487 ± 0.052
1.385AsnGlu: 1.385 ± 0.049
0.926AsnPhe: 0.926 ± 0.038
2.282AsnGly: 2.282 ± 0.059
0.429AsnHis: 0.429 ± 0.02
1.215AsnIle: 1.215 ± 0.044
0.825AsnLys: 0.825 ± 0.032
2.344AsnLeu: 2.344 ± 0.058
0.637AsnMet: 0.637 ± 0.028
0.632AsnAsn: 0.632 ± 0.032
1.76AsnPro: 1.76 ± 0.047
0.714AsnGln: 0.714 ± 0.03
1.729AsnArg: 1.729 ± 0.049
1.073AsnSer: 1.073 ± 0.042
1.24AsnThr: 1.24 ± 0.043
2.076AsnVal: 2.076 ± 0.056
0.397AsnTrp: 0.397 ± 0.024
0.674AsnTyr: 0.674 ± 0.033
0.0AsnXaa: 0.0 ± 0.0
Pro
5.894ProAla: 5.894 ± 0.108
0.428ProCys: 0.428 ± 0.025
3.879ProAsp: 3.879 ± 0.072
4.182ProGlu: 4.182 ± 0.084
1.998ProPhe: 1.998 ± 0.053
4.743ProGly: 4.743 ± 0.088
1.155ProHis: 1.155 ± 0.042
2.31ProIle: 2.31 ± 0.06
2.13ProLys: 2.13 ± 0.059
4.705ProLeu: 4.705 ± 0.085
1.238ProMet: 1.238 ± 0.04
1.454ProAsn: 1.454 ± 0.047
2.771ProPro: 2.771 ± 0.079
1.752ProGln: 1.752 ± 0.061
3.26ProArg: 3.26 ± 0.077
3.025ProSer: 3.025 ± 0.062
2.534ProThr: 2.534 ± 0.057
4.164ProVal: 4.164 ± 0.084
0.686ProTrp: 0.686 ± 0.033
1.339ProTyr: 1.339 ± 0.047
0.0ProXaa: 0.0 ± 0.0
Gln
3.738GlnAla: 3.738 ± 0.075
0.229GlnCys: 0.229 ± 0.017
1.621GlnAsp: 1.621 ± 0.049
1.613GlnGlu: 1.613 ± 0.046
1.045GlnPhe: 1.045 ± 0.038
2.403GlnGly: 2.403 ± 0.065
0.627GlnHis: 0.627 ± 0.032
1.583GlnIle: 1.583 ± 0.053
1.084GlnLys: 1.084 ± 0.039
2.788GlnLeu: 2.788 ± 0.062
0.787GlnMet: 0.787 ± 0.032
0.843GlnAsn: 0.843 ± 0.036
1.616GlnPro: 1.616 ± 0.06
1.052GlnGln: 1.052 ± 0.044
2.243GlnArg: 2.243 ± 0.058
1.746GlnSer: 1.746 ± 0.049
1.785GlnThr: 1.785 ± 0.048
2.193GlnVal: 2.193 ± 0.066
0.354GlnTrp: 0.354 ± 0.021
0.639GlnTyr: 0.639 ± 0.035
0.0GlnXaa: 0.0 ± 0.0
Arg
8.025ArgAla: 8.025 ± 0.124
0.602ArgCys: 0.602 ± 0.031
4.029ArgAsp: 4.029 ± 0.058
4.073ArgGlu: 4.073 ± 0.069
2.848ArgPhe: 2.848 ± 0.066
4.864ArgGly: 4.864 ± 0.092
1.622ArgHis: 1.622 ± 0.045
3.626ArgIle: 3.626 ± 0.076
2.472ArgLys: 2.472 ± 0.062
7.711ArgLeu: 7.711 ± 0.11
1.672ArgMet: 1.672 ± 0.05
1.794ArgAsn: 1.794 ± 0.048
3.797ArgPro: 3.797 ± 0.09
2.268ArgGln: 2.268 ± 0.068
5.498ArgArg: 5.498 ± 0.108
3.686ArgSer: 3.686 ± 0.074
3.354ArgThr: 3.354 ± 0.072
4.676ArgVal: 4.676 ± 0.074
0.929ArgTrp: 0.929 ± 0.039
1.823ArgTyr: 1.823 ± 0.048
0.0ArgXaa: 0.0 ± 0.0
Ser
6.138SerAla: 6.138 ± 0.089
0.49SerCys: 0.49 ± 0.027
3.048SerAsp: 3.048 ± 0.062
3.029SerGlu: 3.029 ± 0.068
2.176SerPhe: 2.176 ± 0.051
5.254SerGly: 5.254 ± 0.098
1.056SerHis: 1.056 ± 0.044
2.383SerIle: 2.383 ± 0.058
1.836SerLys: 1.836 ± 0.062
5.241SerLeu: 5.241 ± 0.098
1.357SerMet: 1.357 ± 0.045
1.341SerAsn: 1.341 ± 0.039
2.996SerPro: 2.996 ± 0.067
1.575SerGln: 1.575 ± 0.046
3.506SerArg: 3.506 ± 0.077
2.876SerSer: 2.876 ± 0.072
2.649SerThr: 2.649 ± 0.058
3.87SerVal: 3.87 ± 0.074
0.712SerTrp: 0.712 ± 0.034
1.244SerTyr: 1.244 ± 0.039
0.0SerXaa: 0.0 ± 0.0
Thr
6.269ThrAla: 6.269 ± 0.105
0.529ThrCys: 0.529 ± 0.028
2.958ThrAsp: 2.958 ± 0.07
2.99ThrGlu: 2.99 ± 0.064
2.074ThrPhe: 2.074 ± 0.053
5.033ThrGly: 5.033 ± 0.09
1.019ThrHis: 1.019 ± 0.042
2.731ThrIle: 2.731 ± 0.058
1.867ThrLys: 1.867 ± 0.05
5.848ThrLeu: 5.848 ± 0.08
1.233ThrMet: 1.233 ± 0.044
1.276ThrAsn: 1.276 ± 0.046
3.36ThrPro: 3.36 ± 0.072
1.453ThrGln: 1.453 ± 0.042
3.347ThrArg: 3.347 ± 0.067
2.771ThrSer: 2.771 ± 0.064
2.896ThrThr: 2.896 ± 0.072
4.48ThrVal: 4.48 ± 0.082
0.645ThrTrp: 0.645 ± 0.032
1.283ThrTyr: 1.283 ± 0.048
0.0ThrXaa: 0.0 ± 0.0
Val
9.036ValAla: 9.036 ± 0.106
0.73ValCys: 0.73 ± 0.03
4.342ValAsp: 4.342 ± 0.084
4.521ValGlu: 4.521 ± 0.068
2.803ValPhe: 2.803 ± 0.062
5.593ValGly: 5.593 ± 0.091
1.4ValHis: 1.4 ± 0.041
3.722ValIle: 3.722 ± 0.074
2.541ValLys: 2.541 ± 0.059
7.973ValLeu: 7.973 ± 0.135
1.707ValMet: 1.707 ± 0.048
1.912ValAsn: 1.912 ± 0.057
4.138ValPro: 4.138 ± 0.075
1.988ValGln: 1.988 ± 0.048
4.834ValArg: 4.834 ± 0.078
4.395ValSer: 4.395 ± 0.074
4.478ValThr: 4.478 ± 0.072
6.086ValVal: 6.086 ± 0.112
0.934ValTrp: 0.934 ± 0.043
1.719ValTyr: 1.719 ± 0.05
0.0ValXaa: 0.0 ± 0.0
Trp
1.172TrpAla: 1.172 ± 0.042
0.16TrpCys: 0.16 ± 0.017
0.639TrpAsp: 0.639 ± 0.027
0.523TrpGlu: 0.523 ± 0.028
0.527TrpPhe: 0.527 ± 0.027
0.92TrpGly: 0.92 ± 0.039
0.325TrpHis: 0.325 ± 0.02
0.666TrpIle: 0.666 ± 0.029
0.451TrpLys: 0.451 ± 0.027
1.579TrpLeu: 1.579 ± 0.044
0.345TrpMet: 0.345 ± 0.024
0.364TrpAsn: 0.364 ± 0.023
0.712TrpPro: 0.712 ± 0.031
0.523TrpGln: 0.523 ± 0.028
1.199TrpArg: 1.199 ± 0.038
0.812TrpSer: 0.812 ± 0.037
0.791TrpThr: 0.791 ± 0.034
0.797TrpVal: 0.797 ± 0.033
0.232TrpTrp: 0.232 ± 0.017
0.333TrpTyr: 0.333 ± 0.023
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.604TyrAla: 2.604 ± 0.056
0.324TyrCys: 0.324 ± 0.02
1.561TyrAsp: 1.561 ± 0.054
1.319TyrGlu: 1.319 ± 0.045
1.021TyrPhe: 1.021 ± 0.043
2.238TyrGly: 2.238 ± 0.064
0.488TyrHis: 0.488 ± 0.025
0.912TyrIle: 0.912 ± 0.033
0.734TyrLys: 0.734 ± 0.032
2.348TyrLeu: 2.348 ± 0.063
0.502TyrMet: 0.502 ± 0.022
0.608TyrAsn: 0.608 ± 0.028
1.147TyrPro: 1.147 ± 0.039
0.748TyrGln: 0.748 ± 0.033
1.854TyrArg: 1.854 ± 0.045
1.097TyrSer: 1.097 ± 0.044
1.09TyrThr: 1.09 ± 0.043
1.786TyrVal: 1.786 ± 0.044
0.428TyrTrp: 0.428 ± 0.023
0.654TyrTyr: 0.654 ± 0.031
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2710 proteins (768685 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski