Amino acid dipepetide frequency for Candidatus Micrarchaeum acidiphilum ARMAN-2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.111AlaAla: 8.111 ± 0.234
0.635AlaCys: 0.635 ± 0.048
3.667AlaAsp: 3.667 ± 0.104
5.228AlaGlu: 5.228 ± 0.152
3.707AlaPhe: 3.707 ± 0.134
5.909AlaGly: 5.909 ± 0.148
1.218AlaHis: 1.218 ± 0.068
6.972AlaIle: 6.972 ± 0.175
5.899AlaLys: 5.899 ± 0.162
8.134AlaLeu: 8.134 ± 0.178
2.67AlaMet: 2.67 ± 0.095
3.598AlaAsn: 3.598 ± 0.16
2.476AlaPro: 2.476 ± 0.106
2.304AlaGln: 2.304 ± 0.091
3.717AlaArg: 3.717 ± 0.118
5.949AlaSer: 5.949 ± 0.186
3.865AlaThr: 3.865 ± 0.134
6.288AlaVal: 6.288 ± 0.178
0.576AlaTrp: 0.576 ± 0.05
3.391AlaTyr: 3.391 ± 0.103
0.0AlaXaa: 0.0 ± 0.0
Cys
0.635CysAla: 0.635 ± 0.055
0.066CysCys: 0.066 ± 0.013
0.382CysAsp: 0.382 ± 0.036
0.418CysGlu: 0.418 ± 0.037
0.234CysPhe: 0.234 ± 0.031
0.856CysGly: 0.856 ± 0.066
0.135CysHis: 0.135 ± 0.02
0.586CysIle: 0.586 ± 0.049
0.533CysLys: 0.533 ± 0.04
0.402CysLeu: 0.402 ± 0.041
0.194CysMet: 0.194 ± 0.026
0.382CysAsn: 0.382 ± 0.04
0.421CysPro: 0.421 ± 0.035
0.191CysGln: 0.191 ± 0.026
0.405CysArg: 0.405 ± 0.042
0.701CysSer: 0.701 ± 0.051
0.504CysThr: 0.504 ± 0.051
0.494CysVal: 0.494 ± 0.047
0.033CysTrp: 0.033 ± 0.011
0.329CysTyr: 0.329 ± 0.04
0.0CysXaa: 0.0 ± 0.0
Asp
4.168AspAla: 4.168 ± 0.145
0.362AspCys: 0.362 ± 0.037
1.755AspAsp: 1.755 ± 0.09
3.134AspGlu: 3.134 ± 0.112
2.268AspPhe: 2.268 ± 0.089
3.289AspGly: 3.289 ± 0.112
0.546AspHis: 0.546 ± 0.039
4.434AspIle: 4.434 ± 0.137
3.312AspLys: 3.312 ± 0.141
4.128AspLeu: 4.128 ± 0.118
1.501AspMet: 1.501 ± 0.072
1.616AspAsn: 1.616 ± 0.083
1.764AspPro: 1.764 ± 0.075
0.833AspGln: 0.833 ± 0.061
2.393AspArg: 2.393 ± 0.106
3.717AspSer: 3.717 ± 0.138
2.295AspThr: 2.295 ± 0.082
3.154AspVal: 3.154 ± 0.106
0.349AspTrp: 0.349 ± 0.034
2.176AspTyr: 2.176 ± 0.088
0.0AspXaa: 0.0 ± 0.0
Glu
4.77GluAla: 4.77 ± 0.143
0.425GluCys: 0.425 ± 0.036
2.805GluAsp: 2.805 ± 0.11
4.112GluGlu: 4.112 ± 0.132
2.798GluPhe: 2.798 ± 0.097
3.457GluGly: 3.457 ± 0.119
1.08GluHis: 1.08 ± 0.071
5.287GluIle: 5.287 ± 0.165
5.764GluLys: 5.764 ± 0.181
5.712GluLeu: 5.712 ± 0.171
1.867GluMet: 1.867 ± 0.072
3.088GluAsn: 3.088 ± 0.108
1.583GluPro: 1.583 ± 0.078
1.748GluGln: 1.748 ± 0.079
3.302GluArg: 3.302 ± 0.13
4.266GluSer: 4.266 ± 0.133
2.597GluThr: 2.597 ± 0.093
3.746GluVal: 3.746 ± 0.109
0.491GluTrp: 0.491 ± 0.037
2.515GluTyr: 2.515 ± 0.101
0.0GluXaa: 0.0 ± 0.0
Phe
3.453PheAla: 3.453 ± 0.136
0.26PheCys: 0.26 ± 0.03
2.35PheAsp: 2.35 ± 0.098
2.604PheGlu: 2.604 ± 0.115
1.745PhePhe: 1.745 ± 0.09
4.131PheGly: 4.131 ± 0.141
0.454PheHis: 0.454 ± 0.04
3.483PheIle: 3.483 ± 0.133
2.383PheLys: 2.383 ± 0.085
3.516PheLeu: 3.516 ± 0.121
1.241PheMet: 1.241 ± 0.071
2.008PheAsn: 2.008 ± 0.1
1.29PhePro: 1.29 ± 0.065
0.728PheGln: 0.728 ± 0.051
1.732PheArg: 1.732 ± 0.092
3.746PheSer: 3.746 ± 0.125
2.35PheThr: 2.35 ± 0.105
3.351PheVal: 3.351 ± 0.122
0.369PheTrp: 0.369 ± 0.035
1.89PheTyr: 1.89 ± 0.089
0.0PheXaa: 0.0 ± 0.0
Gly
5.84GlyAla: 5.84 ± 0.132
0.635GlyCys: 0.635 ± 0.055
2.933GlyAsp: 2.933 ± 0.104
3.605GlyGlu: 3.605 ± 0.12
3.545GlyPhe: 3.545 ± 0.116
5.373GlyGly: 5.373 ± 0.207
1.007GlyHis: 1.007 ± 0.07
7.45GlyIle: 7.45 ± 0.161
5.682GlyLys: 5.682 ± 0.147
5.975GlyLeu: 5.975 ± 0.156
2.489GlyMet: 2.489 ± 0.094
3.565GlyAsn: 3.565 ± 0.128
2.11GlyPro: 2.11 ± 0.1
1.682GlyGln: 1.682 ± 0.067
3.493GlyArg: 3.493 ± 0.148
5.837GlySer: 5.837 ± 0.162
4.349GlyThr: 4.349 ± 0.182
4.935GlyVal: 4.935 ± 0.142
0.635GlyTrp: 0.635 ± 0.055
3.355GlyTyr: 3.355 ± 0.118
0.0GlyXaa: 0.0 ± 0.0
His
1.185HisAla: 1.185 ± 0.063
0.119HisCys: 0.119 ± 0.021
0.652HisAsp: 0.652 ± 0.047
0.905HisGlu: 0.905 ± 0.056
0.639HisPhe: 0.639 ± 0.046
1.294HisGly: 1.294 ± 0.066
0.227HisHis: 0.227 ± 0.026
1.188HisIle: 1.188 ± 0.063
0.915HisLys: 0.915 ± 0.054
1.136HisLeu: 1.136 ± 0.066
0.461HisMet: 0.461 ± 0.036
0.639HisAsn: 0.639 ± 0.047
0.681HisPro: 0.681 ± 0.044
0.326HisGln: 0.326 ± 0.03
0.826HisArg: 0.826 ± 0.056
1.165HisSer: 1.165 ± 0.069
0.704HisThr: 0.704 ± 0.053
0.859HisVal: 0.859 ± 0.051
0.138HisTrp: 0.138 ± 0.025
0.688HisTyr: 0.688 ± 0.051
0.0HisXaa: 0.0 ± 0.0
Ile
7.562IleAla: 7.562 ± 0.204
0.537IleCys: 0.537 ± 0.043
4.421IleAsp: 4.421 ± 0.135
5.28IleGlu: 5.28 ± 0.162
3.671IlePhe: 3.671 ± 0.131
6.255IleGly: 6.255 ± 0.166
0.862IleHis: 0.862 ± 0.047
6.452IleIle: 6.452 ± 0.196
6.13IleLys: 6.13 ± 0.182
6.166IleLeu: 6.166 ± 0.167
1.926IleMet: 1.926 ± 0.073
4.085IleAsn: 4.085 ± 0.127
3.068IlePro: 3.068 ± 0.096
1.31IleGln: 1.31 ± 0.073
3.654IleArg: 3.654 ± 0.129
6.956IleSer: 6.956 ± 0.151
4.158IleThr: 4.158 ± 0.117
5.34IleVal: 5.34 ± 0.149
0.477IleTrp: 0.477 ± 0.042
2.953IleTyr: 2.953 ± 0.101
0.0IleXaa: 0.0 ± 0.0
Lys
5.83LysAla: 5.83 ± 0.164
0.563LysCys: 0.563 ± 0.051
3.944LysAsp: 3.944 ± 0.145
5.32LysGlu: 5.32 ± 0.175
2.769LysPhe: 2.769 ± 0.104
4.698LysGly: 4.698 ± 0.151
1.093LysHis: 1.093 ± 0.066
5.534LysIle: 5.534 ± 0.174
6.281LysLys: 6.281 ± 0.184
6.373LysLeu: 6.373 ± 0.159
2.318LysMet: 2.318 ± 0.085
3.746LysAsn: 3.746 ± 0.127
2.69LysPro: 2.69 ± 0.105
1.623LysGln: 1.623 ± 0.08
3.799LysArg: 3.799 ± 0.122
5.386LysSer: 5.386 ± 0.146
3.177LysThr: 3.177 ± 0.117
4.418LysVal: 4.418 ± 0.134
0.543LysTrp: 0.543 ± 0.044
2.854LysTyr: 2.854 ± 0.104
0.0LysXaa: 0.0 ± 0.0
Leu
7.318LeuAla: 7.318 ± 0.175
0.662LeuCys: 0.662 ± 0.045
4.431LeuAsp: 4.431 ± 0.148
5.274LeuGlu: 5.274 ± 0.149
3.848LeuPhe: 3.848 ± 0.126
6.923LeuGly: 6.923 ± 0.182
1.432LeuHis: 1.432 ± 0.062
6.479LeuIle: 6.479 ± 0.181
6.195LeuLys: 6.195 ± 0.152
8.516LeuLeu: 8.516 ± 0.252
2.258LeuMet: 2.258 ± 0.084
4.082LeuAsn: 4.082 ± 0.125
3.473LeuPro: 3.473 ± 0.126
2.136LeuGln: 2.136 ± 0.089
4.069LeuArg: 4.069 ± 0.126
7.295LeuSer: 7.295 ± 0.182
3.792LeuThr: 3.792 ± 0.128
5.695LeuVal: 5.695 ± 0.161
0.609LeuTrp: 0.609 ± 0.053
3.572LeuTyr: 3.572 ± 0.104
0.0LeuXaa: 0.0 ± 0.0
Met
2.584MetAla: 2.584 ± 0.104
0.165MetCys: 0.165 ± 0.024
1.511MetAsp: 1.511 ± 0.085
1.797MetGlu: 1.797 ± 0.076
1.083MetPhe: 1.083 ± 0.063
1.709MetGly: 1.709 ± 0.065
0.813MetHis: 0.813 ± 0.051
1.685MetIle: 1.685 ± 0.073
1.955MetLys: 1.955 ± 0.072
3.315MetLeu: 3.315 ± 0.104
0.583MetMet: 0.583 ± 0.044
1.04MetAsn: 1.04 ± 0.062
1.57MetPro: 1.57 ± 0.071
1.172MetGln: 1.172 ± 0.062
1.327MetArg: 1.327 ± 0.066
1.768MetSer: 1.768 ± 0.069
1.09MetThr: 1.09 ± 0.062
1.903MetVal: 1.903 ± 0.084
0.174MetTrp: 0.174 ± 0.024
0.862MetTyr: 0.862 ± 0.051
0.0MetXaa: 0.0 ± 0.0
Asn
4.388AsnAla: 4.388 ± 0.133
0.464AsnCys: 0.464 ± 0.048
1.89AsnAsp: 1.89 ± 0.081
2.755AsnGlu: 2.755 ± 0.095
2.084AsnPhe: 2.084 ± 0.093
3.776AsnGly: 3.776 ± 0.156
0.484AsnHis: 0.484 ± 0.038
3.703AsnIle: 3.703 ± 0.126
2.775AsnLys: 2.775 ± 0.114
3.713AsnLeu: 3.713 ± 0.117
1.313AsnMet: 1.313 ± 0.067
1.939AsnAsn: 1.939 ± 0.107
2.136AsnPro: 2.136 ± 0.111
1.053AsnGln: 1.053 ± 0.061
1.893AsnArg: 1.893 ± 0.086
4.099AsnSer: 4.099 ± 0.147
2.413AsnThr: 2.413 ± 0.118
3.368AsnVal: 3.368 ± 0.129
0.323AsnTrp: 0.323 ± 0.031
2.061AsnTyr: 2.061 ± 0.107
0.0AsnXaa: 0.0 ± 0.0
Pro
2.825ProAla: 2.825 ± 0.094
0.227ProCys: 0.227 ± 0.029
1.867ProAsp: 1.867 ± 0.086
2.69ProGlu: 2.69 ± 0.109
1.649ProPhe: 1.649 ± 0.079
2.729ProGly: 2.729 ± 0.103
0.622ProHis: 0.622 ± 0.047
2.558ProIle: 2.558 ± 0.107
2.634ProLys: 2.634 ± 0.112
2.923ProLeu: 2.923 ± 0.108
0.751ProMet: 0.751 ± 0.048
1.666ProAsn: 1.666 ± 0.094
1.294ProPro: 1.294 ± 0.08
1.146ProGln: 1.146 ± 0.065
1.35ProArg: 1.35 ± 0.068
2.94ProSer: 2.94 ± 0.143
1.886ProThr: 1.886 ± 0.086
2.568ProVal: 2.568 ± 0.1
0.3ProTrp: 0.3 ± 0.033
1.623ProTyr: 1.623 ± 0.08
0.0ProXaa: 0.0 ± 0.0
Gln
1.857GlnAla: 1.857 ± 0.078
0.168GlnCys: 0.168 ± 0.02
1.063GlnAsp: 1.063 ± 0.065
1.422GlnGlu: 1.422 ± 0.073
0.889GlnPhe: 0.889 ± 0.057
1.481GlnGly: 1.481 ± 0.082
0.398GlnHis: 0.398 ± 0.037
1.883GlnIle: 1.883 ± 0.077
2.14GlnLys: 2.14 ± 0.091
2.268GlnLeu: 2.268 ± 0.097
0.731GlnMet: 0.731 ± 0.049
1.267GlnAsn: 1.267 ± 0.07
1.014GlnPro: 1.014 ± 0.063
0.938GlnGln: 0.938 ± 0.069
1.198GlnArg: 1.198 ± 0.072
1.837GlnSer: 1.837 ± 0.083
1.205GlnThr: 1.205 ± 0.059
1.287GlnVal: 1.287 ± 0.063
0.161GlnTrp: 0.161 ± 0.02
0.922GlnTyr: 0.922 ± 0.06
0.0GlnXaa: 0.0 ± 0.0
Arg
3.397ArgAla: 3.397 ± 0.106
0.336ArgCys: 0.336 ± 0.032
2.248ArgAsp: 2.248 ± 0.11
2.825ArgGlu: 2.825 ± 0.118
1.972ArgPhe: 1.972 ± 0.089
3.17ArgGly: 3.17 ± 0.11
0.876ArgHis: 0.876 ± 0.054
3.99ArgIle: 3.99 ± 0.141
3.759ArgLys: 3.759 ± 0.128
4.55ArgLeu: 4.55 ± 0.138
1.514ArgMet: 1.514 ± 0.079
2.186ArgAsn: 2.186 ± 0.085
1.498ArgPro: 1.498 ± 0.068
1.363ArgGln: 1.363 ± 0.065
2.607ArgArg: 2.607 ± 0.117
3.104ArgSer: 3.104 ± 0.102
1.83ArgThr: 1.83 ± 0.091
2.709ArgVal: 2.709 ± 0.091
0.319ArgTrp: 0.319 ± 0.032
1.876ArgTyr: 1.876 ± 0.089
0.0ArgXaa: 0.0 ± 0.0
Ser
6.488SerAla: 6.488 ± 0.152
0.672SerCys: 0.672 ± 0.063
3.46SerAsp: 3.46 ± 0.129
4.862SerGlu: 4.862 ± 0.16
3.197SerPhe: 3.197 ± 0.115
6.923SerGly: 6.923 ± 0.222
0.922SerHis: 0.922 ± 0.058
6.719SerIle: 6.719 ± 0.16
6.156SerLys: 6.156 ± 0.174
6.074SerLeu: 6.074 ± 0.162
2.206SerMet: 2.206 ± 0.086
3.825SerAsn: 3.825 ± 0.174
2.413SerPro: 2.413 ± 0.115
1.876SerGln: 1.876 ± 0.075
3.644SerArg: 3.644 ± 0.121
6.617SerSer: 6.617 ± 0.242
4.454SerThr: 4.454 ± 0.217
4.74SerVal: 4.74 ± 0.113
0.537SerTrp: 0.537 ± 0.043
3.062SerTyr: 3.062 ± 0.105
0.0SerXaa: 0.0 ± 0.0
Thr
4.286ThrAla: 4.286 ± 0.166
0.497ThrCys: 0.497 ± 0.064
2.209ThrAsp: 2.209 ± 0.099
2.568ThrGlu: 2.568 ± 0.116
2.051ThrPhe: 2.051 ± 0.089
4.069ThrGly: 4.069 ± 0.143
0.718ThrHis: 0.718 ± 0.052
4.089ThrIle: 4.089 ± 0.126
3.029ThrLys: 3.029 ± 0.092
4.217ThrLeu: 4.217 ± 0.129
1.195ThrMet: 1.195 ± 0.063
2.377ThrAsn: 2.377 ± 0.13
2.077ThrPro: 2.077 ± 0.094
1.142ThrGln: 1.142 ± 0.062
1.913ThrArg: 1.913 ± 0.078
3.848ThrSer: 3.848 ± 0.237
3.164ThrThr: 3.164 ± 0.243
3.506ThrVal: 3.506 ± 0.119
0.323ThrTrp: 0.323 ± 0.032
2.248ThrTyr: 2.248 ± 0.121
0.0ThrXaa: 0.0 ± 0.0
Val
5.721ValAla: 5.721 ± 0.156
0.546ValCys: 0.546 ± 0.054
3.236ValAsp: 3.236 ± 0.114
4.0ValGlu: 4.0 ± 0.14
2.95ValPhe: 2.95 ± 0.11
4.494ValGly: 4.494 ± 0.13
1.103ValHis: 1.103 ± 0.063
5.129ValIle: 5.129 ± 0.144
4.395ValLys: 4.395 ± 0.139
6.284ValLeu: 6.284 ± 0.175
1.62ValMet: 1.62 ± 0.081
3.006ValAsn: 3.006 ± 0.099
2.89ValPro: 2.89 ± 0.098
1.577ValGln: 1.577 ± 0.073
2.604ValArg: 2.604 ± 0.109
5.468ValSer: 5.468 ± 0.14
3.104ValThr: 3.104 ± 0.117
4.984ValVal: 4.984 ± 0.163
0.461ValTrp: 0.461 ± 0.04
2.739ValTyr: 2.739 ± 0.099
0.0ValXaa: 0.0 ± 0.0
Trp
0.51TrpAla: 0.51 ± 0.04
0.095TrpCys: 0.095 ± 0.019
0.339TrpAsp: 0.339 ± 0.039
0.326TrpGlu: 0.326 ± 0.031
0.293TrpPhe: 0.293 ± 0.033
0.428TrpGly: 0.428 ± 0.039
0.161TrpHis: 0.161 ± 0.024
0.625TrpIle: 0.625 ± 0.039
0.494TrpLys: 0.494 ± 0.042
0.807TrpLeu: 0.807 ± 0.056
0.207TrpMet: 0.207 ± 0.031
0.395TrpAsn: 0.395 ± 0.042
0.309TrpPro: 0.309 ± 0.035
0.25TrpGln: 0.25 ± 0.029
0.369TrpArg: 0.369 ± 0.041
0.5TrpSer: 0.5 ± 0.04
0.352TrpThr: 0.352 ± 0.037
0.326TrpVal: 0.326 ± 0.032
0.072TrpTrp: 0.072 ± 0.016
0.392TrpTyr: 0.392 ± 0.034
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.384TyrAla: 3.384 ± 0.128
0.467TyrCys: 0.467 ± 0.047
2.031TyrAsp: 2.031 ± 0.081
2.288TyrGlu: 2.288 ± 0.085
1.695TyrPhe: 1.695 ± 0.093
3.532TyrGly: 3.532 ± 0.104
0.596TyrHis: 0.596 ± 0.039
3.015TyrIle: 3.015 ± 0.094
2.528TyrLys: 2.528 ± 0.1
3.852TyrLeu: 3.852 ± 0.115
1.06TyrMet: 1.06 ± 0.057
2.12TyrAsn: 2.12 ± 0.107
1.485TyrPro: 1.485 ± 0.092
0.731TyrGln: 0.731 ± 0.054
1.837TyrArg: 1.837 ± 0.084
3.648TyrSer: 3.648 ± 0.123
2.235TyrThr: 2.235 ± 0.116
2.64TyrVal: 2.64 ± 0.105
0.362TyrTrp: 0.362 ± 0.037
1.89TyrTyr: 1.89 ± 0.079
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1025 proteins (303770 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski