Amino acid dipepetide frequency for candidate division MSBL1 archaeon SCGC-AAA259E19

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.882AlaAla: 3.882 ± 0.132
0.719AlaCys: 0.719 ± 0.052
3.388AlaAsp: 3.388 ± 0.124
6.047AlaGlu: 6.047 ± 0.157
2.265AlaPhe: 2.265 ± 0.088
4.783AlaGly: 4.783 ± 0.135
1.078AlaHis: 1.078 ± 0.063
3.991AlaIle: 3.991 ± 0.114
3.769AlaLys: 3.769 ± 0.112
5.662AlaLeu: 5.662 ± 0.149
1.505AlaMet: 1.505 ± 0.074
1.716AlaAsn: 1.716 ± 0.081
2.053AlaPro: 2.053 ± 0.082
1.415AlaGln: 1.415 ± 0.062
3.763AlaArg: 3.763 ± 0.123
3.721AlaSer: 3.721 ± 0.102
2.826AlaThr: 2.826 ± 0.109
4.575AlaVal: 4.575 ± 0.141
0.699AlaTrp: 0.699 ± 0.053
1.787AlaTyr: 1.787 ± 0.076
0.0AlaXaa: 0.0 ± 0.0
Cys
0.459CysAla: 0.459 ± 0.037
0.119CysCys: 0.119 ± 0.022
0.6CysAsp: 0.6 ± 0.051
0.812CysGlu: 0.812 ± 0.052
0.423CysPhe: 0.423 ± 0.04
1.062CysGly: 1.062 ± 0.073
0.263CysHis: 0.263 ± 0.029
0.504CysIle: 0.504 ± 0.037
0.478CysLys: 0.478 ± 0.038
0.773CysLeu: 0.773 ± 0.049
0.305CysMet: 0.305 ± 0.034
0.292CysAsn: 0.292 ± 0.027
0.709CysPro: 0.709 ± 0.052
0.273CysGln: 0.273 ± 0.034
0.68CysArg: 0.68 ± 0.051
0.812CysSer: 0.812 ± 0.053
0.346CysThr: 0.346 ± 0.038
0.533CysVal: 0.533 ± 0.037
0.132CysTrp: 0.132 ± 0.023
0.302CysTyr: 0.302 ± 0.03
0.0CysXaa: 0.0 ± 0.0
Asp
3.163AspAla: 3.163 ± 0.109
0.616AspCys: 0.616 ± 0.049
2.785AspAsp: 2.785 ± 0.101
5.694AspGlu: 5.694 ± 0.15
2.785AspPhe: 2.785 ± 0.091
3.923AspGly: 3.923 ± 0.133
1.049AspHis: 1.049 ± 0.064
3.901AspIle: 3.901 ± 0.108
3.237AspLys: 3.237 ± 0.122
6.897AspLeu: 6.897 ± 0.188
1.351AspMet: 1.351 ± 0.064
1.806AspAsn: 1.806 ± 0.113
2.958AspPro: 2.958 ± 0.103
1.235AspGln: 1.235 ± 0.067
3.413AspArg: 3.413 ± 0.111
3.474AspSer: 3.474 ± 0.105
2.262AspThr: 2.262 ± 0.096
4.279AspVal: 4.279 ± 0.127
0.901AspTrp: 0.901 ± 0.058
2.149AspTyr: 2.149 ± 0.089
0.0AspXaa: 0.0 ± 0.0
Glu
5.967GluAla: 5.967 ± 0.165
0.747GluCys: 0.747 ± 0.055
6.256GluAsp: 6.256 ± 0.151
13.397GluGlu: 13.397 ± 0.299
3.359GluPhe: 3.359 ± 0.105
7.305GluGly: 7.305 ± 0.172
1.293GluHis: 1.293 ± 0.06
7.735GluIle: 7.735 ± 0.177
10.355GluLys: 10.355 ± 0.198
8.546GluLeu: 8.546 ± 0.207
2.611GluMet: 2.611 ± 0.092
5.438GluAsn: 5.438 ± 0.159
3.096GluPro: 3.096 ± 0.105
1.495GluGln: 1.495 ± 0.073
6.438GluArg: 6.438 ± 0.158
5.261GluSer: 5.261 ± 0.126
4.312GluThr: 4.312 ± 0.116
7.173GluVal: 7.173 ± 0.168
1.007GluTrp: 1.007 ± 0.061
2.579GluTyr: 2.579 ± 0.111
0.0GluXaa: 0.0 ± 0.0
Phe
2.291PheAla: 2.291 ± 0.081
0.475PheCys: 0.475 ± 0.038
2.566PheAsp: 2.566 ± 0.082
3.721PheGlu: 3.721 ± 0.117
1.841PhePhe: 1.841 ± 0.114
3.16PheGly: 3.16 ± 0.121
0.873PheHis: 0.873 ± 0.047
1.918PheIle: 1.918 ± 0.093
1.941PheLys: 1.941 ± 0.081
4.353PheLeu: 4.353 ± 0.164
0.744PheMet: 0.744 ± 0.046
1.232PheAsn: 1.232 ± 0.066
1.7PhePro: 1.7 ± 0.079
1.001PheGln: 1.001 ± 0.055
2.005PheArg: 2.005 ± 0.092
3.388PheSer: 3.388 ± 0.118
1.806PheThr: 1.806 ± 0.085
2.582PheVal: 2.582 ± 0.085
0.465PheTrp: 0.465 ± 0.039
1.267PheTyr: 1.267 ± 0.054
0.0PheXaa: 0.0 ± 0.0
Gly
4.453GlyAla: 4.453 ± 0.12
0.792GlyCys: 0.792 ± 0.05
4.1GlyAsp: 4.1 ± 0.133
7.831GlyGlu: 7.831 ± 0.182
3.06GlyPhe: 3.06 ± 0.115
6.378GlyGly: 6.378 ± 0.182
1.129GlyHis: 1.129 ± 0.069
5.569GlyIle: 5.569 ± 0.155
5.941GlyLys: 5.941 ± 0.152
6.554GlyLeu: 6.554 ± 0.174
2.005GlyMet: 2.005 ± 0.082
2.56GlyAsn: 2.56 ± 0.09
2.64GlyPro: 2.64 ± 0.096
1.517GlyGln: 1.517 ± 0.072
4.623GlyArg: 4.623 ± 0.128
4.802GlySer: 4.802 ± 0.132
3.731GlyThr: 3.731 ± 0.131
5.409GlyVal: 5.409 ± 0.145
1.046GlyTrp: 1.046 ± 0.065
2.374GlyTyr: 2.374 ± 0.087
0.0GlyXaa: 0.0 ± 0.0
His
0.927HisAla: 0.927 ± 0.05
0.279HisCys: 0.279 ± 0.028
0.959HisAsp: 0.959 ± 0.061
1.421HisGlu: 1.421 ± 0.066
0.776HisPhe: 0.776 ± 0.05
1.36HisGly: 1.36 ± 0.068
0.439HisHis: 0.439 ± 0.039
0.837HisIle: 0.837 ± 0.054
0.78HisLys: 0.78 ± 0.054
1.957HisLeu: 1.957 ± 0.085
0.327HisMet: 0.327 ± 0.032
0.571HisAsn: 0.571 ± 0.044
1.161HisPro: 1.161 ± 0.067
0.475HisGln: 0.475 ± 0.039
1.027HisArg: 1.027 ± 0.064
1.126HisSer: 1.126 ± 0.062
0.715HisThr: 0.715 ± 0.054
1.075HisVal: 1.075 ± 0.058
0.205HisTrp: 0.205 ± 0.027
0.616HisTyr: 0.616 ± 0.046
0.0HisXaa: 0.0 ± 0.0
Ile
4.427IleAla: 4.427 ± 0.128
0.757IleCys: 0.757 ± 0.05
3.827IleAsp: 3.827 ± 0.107
6.692IleGlu: 6.692 ± 0.144
2.682IlePhe: 2.682 ± 0.119
5.117IleGly: 5.117 ± 0.128
1.248IleHis: 1.248 ± 0.06
3.673IleIle: 3.673 ± 0.119
3.724IleLys: 3.724 ± 0.113
5.977IleLeu: 5.977 ± 0.173
1.116IleMet: 1.116 ± 0.06
2.082IleAsn: 2.082 ± 0.089
3.336IlePro: 3.336 ± 0.105
1.633IleGln: 1.633 ± 0.077
3.676IleArg: 3.676 ± 0.101
5.117IleSer: 5.117 ± 0.158
3.208IleThr: 3.208 ± 0.116
4.421IleVal: 4.421 ± 0.12
0.638IleTrp: 0.638 ± 0.04
1.95IleTyr: 1.95 ± 0.08
0.0IleXaa: 0.0 ± 0.0
Lys
4.186LysAla: 4.186 ± 0.135
0.642LysCys: 0.642 ± 0.05
4.013LysAsp: 4.013 ± 0.132
8.026LysGlu: 8.026 ± 0.173
2.618LysPhe: 2.618 ± 0.094
4.802LysGly: 4.802 ± 0.136
1.123LysHis: 1.123 ± 0.059
5.848LysIle: 5.848 ± 0.147
6.445LysLys: 6.445 ± 0.179
6.114LysLeu: 6.114 ± 0.145
1.585LysMet: 1.585 ± 0.072
3.327LysAsn: 3.327 ± 0.097
2.281LysPro: 2.281 ± 0.098
1.476LysGln: 1.476 ± 0.074
4.267LysArg: 4.267 ± 0.114
4.283LysSer: 4.283 ± 0.121
3.551LysThr: 3.551 ± 0.118
4.767LysVal: 4.767 ± 0.142
0.828LysTrp: 0.828 ± 0.054
2.021LysTyr: 2.021 ± 0.092
0.0LysXaa: 0.0 ± 0.0
Leu
5.951LeuAla: 5.951 ± 0.167
0.792LeuCys: 0.792 ± 0.053
5.993LeuAsp: 5.993 ± 0.138
10.07LeuGlu: 10.07 ± 0.214
3.426LeuPhe: 3.426 ± 0.142
6.936LeuGly: 6.936 ± 0.164
1.45LeuHis: 1.45 ± 0.073
5.319LeuIle: 5.319 ± 0.165
6.669LeuLys: 6.669 ± 0.169
7.972LeuLeu: 7.972 ± 0.233
1.967LeuMet: 1.967 ± 0.075
3.394LeuAsn: 3.394 ± 0.104
3.853LeuPro: 3.853 ± 0.115
2.104LeuGln: 2.104 ± 0.081
5.736LeuArg: 5.736 ± 0.155
6.583LeuSer: 6.583 ± 0.151
4.565LeuThr: 4.565 ± 0.099
5.624LeuVal: 5.624 ± 0.145
0.969LeuTrp: 0.969 ± 0.054
2.342LeuTyr: 2.342 ± 0.081
0.0LeuXaa: 0.0 ± 0.0
Met
1.578MetAla: 1.578 ± 0.072
0.176MetCys: 0.176 ± 0.024
1.402MetAsp: 1.402 ± 0.077
2.329MetGlu: 2.329 ± 0.083
0.597MetPhe: 0.597 ± 0.044
1.713MetGly: 1.713 ± 0.079
0.302MetHis: 0.302 ± 0.031
1.53MetIle: 1.53 ± 0.078
2.088MetLys: 2.088 ± 0.102
1.575MetLeu: 1.575 ± 0.087
0.516MetMet: 0.516 ± 0.039
0.978MetAsn: 0.978 ± 0.049
0.978MetPro: 0.978 ± 0.066
0.372MetGln: 0.372 ± 0.034
1.293MetArg: 1.293 ± 0.068
1.549MetSer: 1.549 ± 0.06
1.2MetThr: 1.2 ± 0.056
1.415MetVal: 1.415 ± 0.059
0.269MetTrp: 0.269 ± 0.032
0.404MetTyr: 0.404 ± 0.039
0.0MetXaa: 0.0 ± 0.0
Asn
1.967AsnAla: 1.967 ± 0.089
0.504AsnCys: 0.504 ± 0.042
1.636AsnAsp: 1.636 ± 0.072
2.624AsnGlu: 2.624 ± 0.095
1.79AsnPhe: 1.79 ± 0.085
2.345AsnGly: 2.345 ± 0.102
0.712AsnHis: 0.712 ± 0.047
2.438AsnIle: 2.438 ± 0.086
2.14AsnLys: 2.14 ± 0.082
4.34AsnLeu: 4.34 ± 0.12
0.802AsnMet: 0.802 ± 0.048
1.136AsnAsn: 1.136 ± 0.094
2.425AsnPro: 2.425 ± 0.087
1.068AsnGln: 1.068 ± 0.066
2.133AsnArg: 2.133 ± 0.071
2.496AsnSer: 2.496 ± 0.09
1.752AsnThr: 1.752 ± 0.092
2.541AsnVal: 2.541 ± 0.1
0.699AsnTrp: 0.699 ± 0.049
1.537AsnTyr: 1.537 ± 0.073
0.0AsnXaa: 0.0 ± 0.0
Pro
2.159ProAla: 2.159 ± 0.083
0.385ProCys: 0.385 ± 0.034
2.852ProAsp: 2.852 ± 0.103
4.988ProGlu: 4.988 ± 0.141
1.62ProPhe: 1.62 ± 0.077
3.349ProGly: 3.349 ± 0.103
0.934ProHis: 0.934 ± 0.063
2.509ProIle: 2.509 ± 0.102
2.692ProLys: 2.692 ± 0.084
3.529ProLeu: 3.529 ± 0.106
0.844ProMet: 0.844 ± 0.055
1.431ProAsn: 1.431 ± 0.076
2.191ProPro: 2.191 ± 0.1
1.097ProGln: 1.097 ± 0.059
2.178ProArg: 2.178 ± 0.091
3.032ProSer: 3.032 ± 0.099
2.015ProThr: 2.015 ± 0.076
2.865ProVal: 2.865 ± 0.101
0.494ProTrp: 0.494 ± 0.042
1.399ProTyr: 1.399 ± 0.058
0.0ProXaa: 0.0 ± 0.0
Gln
1.54GlnAla: 1.54 ± 0.072
0.18GlnCys: 0.18 ± 0.029
1.322GlnAsp: 1.322 ± 0.063
2.268GlnGlu: 2.268 ± 0.083
0.789GlnPhe: 0.789 ± 0.05
1.521GlnGly: 1.521 ± 0.067
0.353GlnHis: 0.353 ± 0.034
1.768GlnIle: 1.768 ± 0.079
1.89GlnLys: 1.89 ± 0.077
1.857GlnLeu: 1.857 ± 0.083
0.552GlnMet: 0.552 ± 0.045
1.014GlnAsn: 1.014 ± 0.057
0.786GlnPro: 0.786 ± 0.044
0.472GlnGln: 0.472 ± 0.042
1.28GlnArg: 1.28 ± 0.06
1.171GlnSer: 1.171 ± 0.065
1.027GlnThr: 1.027 ± 0.054
1.582GlnVal: 1.582 ± 0.071
0.218GlnTrp: 0.218 ± 0.031
0.613GlnTyr: 0.613 ± 0.043
0.0GlnXaa: 0.0 ± 0.0
Arg
3.702ArgAla: 3.702 ± 0.123
0.433ArgCys: 0.433 ± 0.037
3.009ArgAsp: 3.009 ± 0.106
6.843ArgGlu: 6.843 ± 0.164
2.169ArgPhe: 2.169 ± 0.083
4.581ArgGly: 4.581 ± 0.126
0.86ArgHis: 0.86 ± 0.059
4.186ArgIle: 4.186 ± 0.106
5.726ArgLys: 5.726 ± 0.14
4.501ArgLeu: 4.501 ± 0.125
1.46ArgMet: 1.46 ± 0.077
2.493ArgAsn: 2.493 ± 0.091
2.111ArgPro: 2.111 ± 0.082
1.222ArgGln: 1.222 ± 0.07
4.174ArgArg: 4.174 ± 0.132
3.429ArgSer: 3.429 ± 0.107
2.858ArgThr: 2.858 ± 0.106
3.737ArgVal: 3.737 ± 0.109
0.664ArgTrp: 0.664 ± 0.048
1.662ArgTyr: 1.662 ± 0.081
0.0ArgXaa: 0.0 ± 0.0
Ser
3.715SerAla: 3.715 ± 0.121
0.603SerCys: 0.603 ± 0.05
3.798SerAsp: 3.798 ± 0.096
6.58SerGlu: 6.58 ± 0.144
2.961SerPhe: 2.961 ± 0.107
5.457SerGly: 5.457 ± 0.165
1.081SerHis: 1.081 ± 0.059
4.241SerIle: 4.241 ± 0.12
4.315SerLys: 4.315 ± 0.149
6.204SerLeu: 6.204 ± 0.129
1.405SerMet: 1.405 ± 0.064
2.117SerAsn: 2.117 ± 0.08
3.218SerPro: 3.218 ± 0.107
1.748SerGln: 1.748 ± 0.069
3.875SerArg: 3.875 ± 0.141
4.687SerSer: 4.687 ± 0.163
3.006SerThr: 3.006 ± 0.114
4.247SerVal: 4.247 ± 0.104
0.747SerTrp: 0.747 ± 0.046
1.95SerTyr: 1.95 ± 0.091
0.0SerXaa: 0.0 ± 0.0
Thr
3.083ThrAla: 3.083 ± 0.112
0.497ThrCys: 0.497 ± 0.048
2.554ThrAsp: 2.554 ± 0.105
4.318ThrGlu: 4.318 ± 0.129
1.796ThrPhe: 1.796 ± 0.079
4.186ThrGly: 4.186 ± 0.125
0.85ThrHis: 0.85 ± 0.057
3.083ThrIle: 3.083 ± 0.089
2.72ThrLys: 2.72 ± 0.102
4.283ThrLeu: 4.283 ± 0.115
0.857ThrMet: 0.857 ± 0.054
1.492ThrAsn: 1.492 ± 0.072
2.323ThrPro: 2.323 ± 0.09
1.02ThrGln: 1.02 ± 0.059
2.544ThrArg: 2.544 ± 0.097
3.195ThrSer: 3.195 ± 0.124
2.48ThrThr: 2.48 ± 0.114
3.744ThrVal: 3.744 ± 0.13
0.613ThrTrp: 0.613 ± 0.046
1.45ThrTyr: 1.45 ± 0.075
0.0ThrXaa: 0.0 ± 0.0
Val
4.116ValAla: 4.116 ± 0.117
0.728ValCys: 0.728 ± 0.051
4.437ValAsp: 4.437 ± 0.15
6.949ValGlu: 6.949 ± 0.186
2.621ValPhe: 2.621 ± 0.098
5.235ValGly: 5.235 ± 0.144
1.206ValHis: 1.206 ± 0.06
3.811ValIle: 3.811 ± 0.104
4.735ValLys: 4.735 ± 0.141
6.249ValLeu: 6.249 ± 0.162
1.395ValMet: 1.395 ± 0.06
2.384ValAsn: 2.384 ± 0.081
2.996ValPro: 2.996 ± 0.098
1.444ValGln: 1.444 ± 0.064
3.965ValArg: 3.965 ± 0.118
4.793ValSer: 4.793 ± 0.122
3.41ValThr: 3.41 ± 0.112
4.674ValVal: 4.674 ± 0.147
0.68ValTrp: 0.68 ± 0.048
1.845ValTyr: 1.845 ± 0.077
0.0ValXaa: 0.0 ± 0.0
Trp
0.581TrpAla: 0.581 ± 0.045
0.077TrpCys: 0.077 ± 0.014
0.638TrpAsp: 0.638 ± 0.043
1.081TrpGlu: 1.081 ± 0.063
0.417TrpPhe: 0.417 ± 0.035
0.847TrpGly: 0.847 ± 0.053
0.186TrpHis: 0.186 ± 0.025
0.95TrpIle: 0.95 ± 0.059
0.998TrpLys: 0.998 ± 0.059
1.065TrpLeu: 1.065 ± 0.062
0.363TrpMet: 0.363 ± 0.036
0.648TrpAsn: 0.648 ± 0.049
0.302TrpPro: 0.302 ± 0.027
0.25TrpGln: 0.25 ± 0.027
0.869TrpArg: 0.869 ± 0.061
0.847TrpSer: 0.847 ± 0.053
0.664TrpThr: 0.664 ± 0.052
0.674TrpVal: 0.674 ± 0.05
0.167TrpTrp: 0.167 ± 0.023
0.318TrpTyr: 0.318 ± 0.038
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.501TyrAla: 1.501 ± 0.071
0.353TyrCys: 0.353 ± 0.032
1.745TyrAsp: 1.745 ± 0.077
2.679TyrGlu: 2.679 ± 0.105
1.335TyrPhe: 1.335 ± 0.068
2.454TyrGly: 2.454 ± 0.09
0.606TyrHis: 0.606 ± 0.04
1.492TyrIle: 1.492 ± 0.075
1.559TyrLys: 1.559 ± 0.081
3.304TyrLeu: 3.304 ± 0.115
0.555TyrMet: 0.555 ± 0.044
0.975TyrAsn: 0.975 ± 0.059
1.444TyrPro: 1.444 ± 0.069
0.882TyrGln: 0.882 ± 0.065
1.88TyrArg: 1.88 ± 0.081
2.191TyrSer: 2.191 ± 0.091
1.354TyrThr: 1.354 ± 0.074
1.758TyrVal: 1.758 ± 0.076
0.497TyrTrp: 0.497 ± 0.04
0.985TyrTyr: 0.985 ± 0.057
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1417 proteins (311721 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski