Amino acid dipepetide frequency for Candidatus Marinamargulisbacteria bacterium SCGC AG-410-N11

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.696AlaAla: 2.696 ± 0.104
0.583AlaCys: 0.583 ± 0.041
2.254AlaAsp: 2.254 ± 0.116
2.287AlaGlu: 2.287 ± 0.084
2.216AlaPhe: 2.216 ± 0.078
2.782AlaGly: 2.782 ± 0.111
0.869AlaHis: 0.869 ± 0.05
4.818AlaIle: 4.818 ± 0.113
3.715AlaLys: 3.715 ± 0.109
4.646AlaLeu: 4.646 ± 0.127
1.019AlaMet: 1.019 ± 0.054
2.951AlaAsn: 2.951 ± 0.093
1.377AlaPro: 1.377 ± 0.06
1.692AlaGln: 1.692 ± 0.072
1.504AlaArg: 1.504 ± 0.068
3.513AlaSer: 3.513 ± 0.084
2.969AlaThr: 2.969 ± 0.118
2.673AlaVal: 2.673 ± 0.09
0.328AlaTrp: 0.328 ± 0.027
1.514AlaTyr: 1.514 ± 0.064
0.0AlaXaa: 0.0 ± 0.0
Cys
0.414CysAla: 0.414 ± 0.024
0.202CysCys: 0.202 ± 0.023
0.573CysAsp: 0.573 ± 0.035
0.465CysGlu: 0.465 ± 0.031
0.689CysPhe: 0.689 ± 0.044
0.712CysGly: 0.712 ± 0.039
0.279CysHis: 0.279 ± 0.028
0.907CysIle: 0.907 ± 0.049
0.663CysLys: 0.663 ± 0.044
1.261CysLeu: 1.261 ± 0.055
0.193CysMet: 0.193 ± 0.021
0.603CysAsn: 0.603 ± 0.042
0.375CysPro: 0.375 ± 0.035
0.465CysGln: 0.465 ± 0.034
0.375CysArg: 0.375 ± 0.032
0.849CysSer: 0.849 ± 0.049
0.498CysThr: 0.498 ± 0.035
0.556CysVal: 0.556 ± 0.036
0.105CysTrp: 0.105 ± 0.016
0.47CysTyr: 0.47 ± 0.038
0.0CysXaa: 0.0 ± 0.0
Asp
2.319AspAla: 2.319 ± 0.1
0.592AspCys: 0.592 ± 0.038
2.921AspAsp: 2.921 ± 0.137
2.574AspGlu: 2.574 ± 0.094
2.786AspPhe: 2.786 ± 0.089
2.649AspGly: 2.649 ± 0.191
1.306AspHis: 1.306 ± 0.052
5.734AspIle: 5.734 ± 0.143
3.773AspLys: 3.773 ± 0.09
5.783AspLeu: 5.783 ± 0.136
0.995AspMet: 0.995 ± 0.051
3.428AspAsn: 3.428 ± 0.117
2.078AspPro: 2.078 ± 0.093
2.799AspGln: 2.799 ± 0.097
1.909AspArg: 1.909 ± 0.07
4.391AspSer: 4.391 ± 0.128
3.145AspThr: 3.145 ± 0.123
2.951AspVal: 2.951 ± 0.121
0.506AspTrp: 0.506 ± 0.038
2.565AspTyr: 2.565 ± 0.087
0.0AspXaa: 0.0 ± 0.0
Glu
2.417GluAla: 2.417 ± 0.101
0.504GluCys: 0.504 ± 0.033
2.872GluAsp: 2.872 ± 0.111
3.007GluGlu: 3.007 ± 0.121
2.555GluPhe: 2.555 ± 0.09
2.443GluGly: 2.443 ± 0.087
0.982GluHis: 0.982 ± 0.052
4.335GluIle: 4.335 ± 0.121
4.989GluLys: 4.989 ± 0.142
6.145GluLeu: 6.145 ± 0.13
1.042GluMet: 1.042 ± 0.05
3.475GluAsn: 3.475 ± 0.113
1.536GluPro: 1.536 ± 0.061
2.205GluGln: 2.205 ± 0.089
1.873GluArg: 1.873 ± 0.071
4.586GluSer: 4.586 ± 0.116
3.32GluThr: 3.32 ± 0.098
2.668GluVal: 2.668 ± 0.096
0.468GluTrp: 0.468 ± 0.032
1.87GluTyr: 1.87 ± 0.055
0.0GluXaa: 0.0 ± 0.0
Phe
1.697PheAla: 1.697 ± 0.072
0.804PheCys: 0.804 ± 0.045
2.909PheAsp: 2.909 ± 0.094
2.861PheGlu: 2.861 ± 0.091
2.874PhePhe: 2.874 ± 0.093
2.557PheGly: 2.557 ± 0.079
0.909PheHis: 0.909 ± 0.047
4.369PheIle: 4.369 ± 0.133
4.601PheLys: 4.601 ± 0.118
5.131PheLeu: 5.131 ± 0.133
0.952PheMet: 0.952 ± 0.049
3.816PheAsn: 3.816 ± 0.099
1.532PhePro: 1.532 ± 0.063
1.8PheGln: 1.8 ± 0.069
1.555PheArg: 1.555 ± 0.06
4.575PheSer: 4.575 ± 0.118
2.533PheThr: 2.533 ± 0.072
2.415PheVal: 2.415 ± 0.077
0.474PheTrp: 0.474 ± 0.032
2.115PheTyr: 2.115 ± 0.074
0.0PheXaa: 0.0 ± 0.0
Gly
2.894GlyAla: 2.894 ± 0.115
0.689GlyCys: 0.689 ± 0.039
2.728GlyAsp: 2.728 ± 0.125
2.304GlyGlu: 2.304 ± 0.09
2.866GlyPhe: 2.866 ± 0.094
3.258GlyGly: 3.258 ± 0.14
1.225GlyHis: 1.225 ± 0.058
4.921GlyIle: 4.921 ± 0.172
3.361GlyLys: 3.361 ± 0.093
5.283GlyLeu: 5.283 ± 0.126
1.096GlyMet: 1.096 ± 0.05
2.945GlyAsn: 2.945 ± 0.155
1.431GlyPro: 1.431 ± 0.055
1.697GlyGln: 1.697 ± 0.068
1.69GlyArg: 1.69 ± 0.076
3.837GlySer: 3.837 ± 0.109
3.211GlyThr: 3.211 ± 0.156
3.426GlyVal: 3.426 ± 0.107
0.515GlyTrp: 0.515 ± 0.031
2.126GlyTyr: 2.126 ± 0.071
0.0GlyXaa: 0.0 ± 0.0
His
0.8HisAla: 0.8 ± 0.047
0.285HisCys: 0.285 ± 0.024
0.98HisAsp: 0.98 ± 0.043
0.832HisGlu: 0.832 ± 0.048
1.158HisPhe: 1.158 ± 0.052
0.935HisGly: 0.935 ± 0.045
0.624HisHis: 0.624 ± 0.04
1.821HisIle: 1.821 ± 0.066
1.313HisLys: 1.313 ± 0.062
2.372HisLeu: 2.372 ± 0.073
0.311HisMet: 0.311 ± 0.023
1.281HisAsn: 1.281 ± 0.058
1.09HisPro: 1.09 ± 0.049
0.978HisGln: 0.978 ± 0.05
0.712HisArg: 0.712 ± 0.038
1.695HisSer: 1.695 ± 0.071
0.955HisThr: 0.955 ± 0.05
1.175HisVal: 1.175 ± 0.058
0.234HisTrp: 0.234 ± 0.023
1.032HisTyr: 1.032 ± 0.055
0.0HisXaa: 0.0 ± 0.0
Ile
4.329IleAla: 4.329 ± 0.127
1.012IleCys: 1.012 ± 0.053
5.478IleAsp: 5.478 ± 0.134
5.298IleGlu: 5.298 ± 0.12
4.138IlePhe: 4.138 ± 0.123
5.069IleGly: 5.069 ± 0.189
1.737IleHis: 1.737 ± 0.063
7.887IleIle: 7.887 ± 0.236
8.342IleLys: 8.342 ± 0.181
8.462IleLeu: 8.462 ± 0.193
1.375IleMet: 1.375 ± 0.058
7.033IleAsn: 7.033 ± 0.176
3.327IlePro: 3.327 ± 0.094
3.981IleGln: 3.981 ± 0.108
2.866IleArg: 2.866 ± 0.086
7.385IleSer: 7.385 ± 0.138
5.457IleThr: 5.457 ± 0.156
4.399IleVal: 4.399 ± 0.127
0.577IleTrp: 0.577 ± 0.033
2.771IleTyr: 2.771 ± 0.098
0.0IleXaa: 0.0 ± 0.0
Lys
3.612LysAla: 3.612 ± 0.112
0.493LysCys: 0.493 ± 0.039
4.921LysAsp: 4.921 ± 0.13
5.751LysGlu: 5.751 ± 0.128
2.758LysPhe: 2.758 ± 0.094
3.863LysGly: 3.863 ± 0.101
1.69LysHis: 1.69 ± 0.062
6.866LysIle: 6.866 ± 0.157
8.423LysLys: 8.423 ± 0.214
8.069LysLeu: 8.069 ± 0.159
1.609LysMet: 1.609 ± 0.052
5.976LysAsn: 5.976 ± 0.133
2.797LysPro: 2.797 ± 0.099
4.464LysGln: 4.464 ± 0.137
3.067LysArg: 3.067 ± 0.107
6.497LysSer: 6.497 ± 0.128
4.972LysThr: 4.972 ± 0.113
4.397LysVal: 4.397 ± 0.103
0.693LysTrp: 0.693 ± 0.037
2.675LysTyr: 2.675 ± 0.085
0.0LysXaa: 0.0 ± 0.0
Leu
5.212LeuAla: 5.212 ± 0.139
1.0LeuCys: 1.0 ± 0.044
6.047LeuAsp: 6.047 ± 0.141
5.618LeuGlu: 5.618 ± 0.148
5.744LeuPhe: 5.744 ± 0.142
5.416LeuGly: 5.416 ± 0.138
1.828LeuHis: 1.828 ± 0.076
8.895LeuIle: 8.895 ± 0.207
8.848LeuLys: 8.848 ± 0.17
9.91LeuLeu: 9.91 ± 0.232
2.102LeuMet: 2.102 ± 0.077
7.799LeuAsn: 7.799 ± 0.154
3.391LeuPro: 3.391 ± 0.097
3.338LeuGln: 3.338 ± 0.093
2.969LeuArg: 2.969 ± 0.102
9.434LeuSer: 9.434 ± 0.153
6.197LeuThr: 6.197 ± 0.137
5.206LeuVal: 5.206 ± 0.132
0.586LeuTrp: 0.586 ± 0.037
3.132LeuTyr: 3.132 ± 0.097
0.0LeuXaa: 0.0 ± 0.0
Met
1.201MetAla: 1.201 ± 0.063
0.178MetCys: 0.178 ± 0.021
0.942MetAsp: 0.942 ± 0.047
0.862MetGlu: 0.862 ± 0.044
0.849MetPhe: 0.849 ± 0.041
1.214MetGly: 1.214 ± 0.068
0.281MetHis: 0.281 ± 0.026
1.948MetIle: 1.948 ± 0.069
1.632MetLys: 1.632 ± 0.068
1.692MetLeu: 1.692 ± 0.063
0.373MetMet: 0.373 ± 0.032
1.092MetAsn: 1.092 ± 0.048
0.605MetPro: 0.605 ± 0.033
0.536MetGln: 0.536 ± 0.036
0.532MetArg: 0.532 ± 0.035
1.512MetSer: 1.512 ± 0.057
1.208MetThr: 1.208 ± 0.053
1.193MetVal: 1.193 ± 0.053
0.114MetTrp: 0.114 ± 0.017
0.489MetTyr: 0.489 ± 0.034
0.0MetXaa: 0.0 ± 0.0
Asn
2.784AsnAla: 2.784 ± 0.091
0.693AsnCys: 0.693 ± 0.045
3.471AsnAsp: 3.471 ± 0.12
3.522AsnGlu: 3.522 ± 0.104
2.831AsnPhe: 2.831 ± 0.085
3.134AsnGly: 3.134 ± 0.116
1.705AsnHis: 1.705 ± 0.075
6.909AsnIle: 6.909 ± 0.154
5.999AsnLys: 5.999 ± 0.147
6.739AsnLeu: 6.739 ± 0.148
1.175AsnMet: 1.175 ± 0.05
5.103AsnAsn: 5.103 ± 0.157
2.872AsnPro: 2.872 ± 0.098
4.329AsnGln: 4.329 ± 0.12
2.407AsnArg: 2.407 ± 0.079
5.152AsnSer: 5.152 ± 0.132
3.599AsnThr: 3.599 ± 0.119
3.312AsnVal: 3.312 ± 0.108
0.624AsnTrp: 0.624 ± 0.037
2.595AsnTyr: 2.595 ± 0.08
0.0AsnXaa: 0.0 ± 0.0
Pro
1.529ProAla: 1.529 ± 0.053
0.305ProCys: 0.305 ± 0.027
1.838ProAsp: 1.838 ± 0.061
2.076ProGlu: 2.076 ± 0.067
2.044ProPhe: 2.044 ± 0.075
1.641ProGly: 1.641 ± 0.071
0.68ProHis: 0.68 ± 0.038
3.333ProIle: 3.333 ± 0.091
2.857ProLys: 2.857 ± 0.11
3.496ProLeu: 3.496 ± 0.105
0.579ProMet: 0.579 ± 0.033
2.778ProAsn: 2.778 ± 0.085
1.06ProPro: 1.06 ± 0.059
1.098ProGln: 1.098 ± 0.048
0.864ProArg: 0.864 ± 0.044
2.791ProSer: 2.791 ± 0.1
2.188ProThr: 2.188 ± 0.093
2.1ProVal: 2.1 ± 0.071
0.283ProTrp: 0.283 ± 0.024
1.293ProTyr: 1.293 ± 0.059
0.0ProXaa: 0.0 ± 0.0
Gln
2.033GlnAla: 2.033 ± 0.075
0.414GlnCys: 0.414 ± 0.031
2.329GlnAsp: 2.329 ± 0.084
2.396GlnGlu: 2.396 ± 0.093
2.287GlnPhe: 2.287 ± 0.069
1.656GlnGly: 1.656 ± 0.066
1.015GlnHis: 1.015 ± 0.056
3.378GlnIle: 3.378 ± 0.086
3.769GlnLys: 3.769 ± 0.117
5.118GlnLeu: 5.118 ± 0.147
0.684GlnMet: 0.684 ± 0.039
2.919GlnAsn: 2.919 ± 0.109
1.233GlnPro: 1.233 ± 0.064
2.181GlnGln: 2.181 ± 0.086
1.396GlnArg: 1.396 ± 0.064
3.511GlnSer: 3.511 ± 0.09
2.359GlnThr: 2.359 ± 0.08
2.188GlnVal: 2.188 ± 0.064
0.493GlnTrp: 0.493 ± 0.041
1.566GlnTyr: 1.566 ± 0.065
0.0GlnXaa: 0.0 ± 0.0
Arg
1.514ArgAla: 1.514 ± 0.061
0.356ArgCys: 0.356 ± 0.029
1.516ArgAsp: 1.516 ± 0.07
1.675ArgGlu: 1.675 ± 0.071
1.918ArgPhe: 1.918 ± 0.064
1.555ArgGly: 1.555 ± 0.079
0.757ArgHis: 0.757 ± 0.038
2.645ArgIle: 2.645 ± 0.078
2.347ArgLys: 2.347 ± 0.078
3.861ArgLeu: 3.861 ± 0.099
0.648ArgMet: 0.648 ± 0.038
1.716ArgAsn: 1.716 ± 0.064
1.072ArgPro: 1.072 ± 0.045
1.542ArgGln: 1.542 ± 0.069
1.304ArgArg: 1.304 ± 0.066
2.471ArgSer: 2.471 ± 0.09
1.542ArgThr: 1.542 ± 0.063
2.109ArgVal: 2.109 ± 0.069
0.317ArgTrp: 0.317 ± 0.026
1.568ArgTyr: 1.568 ± 0.059
0.0ArgXaa: 0.0 ± 0.0
Ser
3.411SerAla: 3.411 ± 0.111
0.734SerCys: 0.734 ± 0.045
4.577SerAsp: 4.577 ± 0.126
4.258SerGlu: 4.258 ± 0.107
4.67SerPhe: 4.67 ± 0.12
4.116SerGly: 4.116 ± 0.123
1.63SerHis: 1.63 ± 0.072
7.797SerIle: 7.797 ± 0.138
6.787SerLys: 6.787 ± 0.136
8.531SerLeu: 8.531 ± 0.157
1.424SerMet: 1.424 ± 0.061
5.568SerAsn: 5.568 ± 0.145
2.829SerPro: 2.829 ± 0.109
3.492SerGln: 3.492 ± 0.106
2.454SerArg: 2.454 ± 0.07
6.761SerSer: 6.761 ± 0.178
4.489SerThr: 4.489 ± 0.106
4.374SerVal: 4.374 ± 0.107
0.624SerTrp: 0.624 ± 0.042
2.954SerTyr: 2.954 ± 0.087
0.0SerXaa: 0.0 ± 0.0
Thr
2.846ThrAla: 2.846 ± 0.105
0.616ThrCys: 0.616 ± 0.037
3.061ThrAsp: 3.061 ± 0.156
2.692ThrGlu: 2.692 ± 0.079
2.924ThrPhe: 2.924 ± 0.091
2.93ThrGly: 2.93 ± 0.093
1.165ThrHis: 1.165 ± 0.053
5.963ThrIle: 5.963 ± 0.15
4.552ThrLys: 4.552 ± 0.114
6.06ThrLeu: 6.06 ± 0.117
1.049ThrMet: 1.049 ± 0.045
3.805ThrAsn: 3.805 ± 0.125
2.625ThrPro: 2.625 ± 0.098
2.323ThrGln: 2.323 ± 0.084
1.624ThrArg: 1.624 ± 0.062
4.174ThrSer: 4.174 ± 0.126
3.477ThrThr: 3.477 ± 0.123
3.25ThrVal: 3.25 ± 0.116
0.416ThrTrp: 0.416 ± 0.03
1.896ThrTyr: 1.896 ± 0.062
0.0ThrXaa: 0.0 ± 0.0
Val
2.853ValAla: 2.853 ± 0.1
0.62ValCys: 0.62 ± 0.035
3.222ValAsp: 3.222 ± 0.11
2.67ValGlu: 2.67 ± 0.089
2.533ValPhe: 2.533 ± 0.067
3.284ValGly: 3.284 ± 0.156
0.897ValHis: 0.897 ± 0.044
4.764ValIle: 4.764 ± 0.13
4.112ValLys: 4.112 ± 0.101
4.923ValLeu: 4.923 ± 0.119
1.154ValMet: 1.154 ± 0.049
3.754ValAsn: 3.754 ± 0.102
2.016ValPro: 2.016 ± 0.061
1.675ValGln: 1.675 ± 0.068
1.731ValArg: 1.731 ± 0.072
4.807ValSer: 4.807 ± 0.139
3.303ValThr: 3.303 ± 0.105
3.325ValVal: 3.325 ± 0.106
0.41ValTrp: 0.41 ± 0.031
1.74ValTyr: 1.74 ± 0.065
0.0ValXaa: 0.0 ± 0.0
Trp
0.373TrpAla: 0.373 ± 0.029
0.103TrpCys: 0.103 ± 0.018
0.498TrpAsp: 0.498 ± 0.034
0.444TrpGlu: 0.444 ± 0.033
0.433TrpPhe: 0.433 ± 0.031
0.457TrpGly: 0.457 ± 0.036
0.152TrpHis: 0.152 ± 0.021
0.654TrpIle: 0.654 ± 0.038
0.633TrpLys: 0.633 ± 0.038
1.012TrpLeu: 1.012 ± 0.053
0.139TrpMet: 0.139 ± 0.019
0.534TrpAsn: 0.534 ± 0.036
0.193TrpPro: 0.193 ± 0.02
0.367TrpGln: 0.367 ± 0.035
0.27TrpArg: 0.27 ± 0.024
0.562TrpSer: 0.562 ± 0.03
0.438TrpThr: 0.438 ± 0.033
0.521TrpVal: 0.521 ± 0.039
0.079TrpTrp: 0.079 ± 0.016
0.309TrpTyr: 0.309 ± 0.026
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.471TyrAla: 1.471 ± 0.058
0.472TyrCys: 0.472 ± 0.035
2.07TyrAsp: 2.07 ± 0.061
1.701TyrGlu: 1.701 ± 0.069
2.207TyrPhe: 2.207 ± 0.072
1.83TyrGly: 1.83 ± 0.068
0.852TyrHis: 0.852 ± 0.051
2.979TyrIle: 2.979 ± 0.091
2.879TyrLys: 2.879 ± 0.093
4.174TyrLeu: 4.174 ± 0.121
0.53TyrMet: 0.53 ± 0.034
2.381TyrAsn: 2.381 ± 0.072
1.394TyrPro: 1.394 ± 0.059
1.965TyrGln: 1.965 ± 0.06
1.332TyrArg: 1.332 ± 0.051
2.975TyrSer: 2.975 ± 0.092
1.587TyrThr: 1.587 ± 0.06
1.555TyrVal: 1.555 ± 0.06
0.326TyrTrp: 0.326 ± 0.027
1.516TyrTyr: 1.516 ± 0.069
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1171 proteins (466208 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski