Amino acid dipepetide frequency for Candidatus Nitrosocaldus cavascurensis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.482AlaAla: 4.482 ± 0.14
0.915AlaCys: 0.915 ± 0.045
3.883AlaAsp: 3.883 ± 0.1
4.525AlaGlu: 4.525 ± 0.102
2.511AlaPhe: 2.511 ± 0.081
5.273AlaGly: 5.273 ± 0.134
1.171AlaHis: 1.171 ± 0.057
6.332AlaIle: 6.332 ± 0.146
4.358AlaLys: 4.358 ± 0.095
8.437AlaLeu: 8.437 ± 0.177
2.83AlaMet: 2.83 ± 0.085
2.47AlaAsn: 2.47 ± 0.077
1.693AlaPro: 1.693 ± 0.068
1.409AlaGln: 1.409 ± 0.058
6.618AlaArg: 6.618 ± 0.129
6.126AlaSer: 6.126 ± 0.132
3.135AlaThr: 3.135 ± 0.088
5.835AlaVal: 5.835 ± 0.136
0.729AlaTrp: 0.729 ± 0.048
3.396AlaTyr: 3.396 ± 0.096
0.0AlaXaa: 0.0 ± 0.0
Cys
0.507CysAla: 0.507 ± 0.034
0.126CysCys: 0.126 ± 0.015
0.536CysAsp: 0.536 ± 0.04
0.416CysGlu: 0.416 ± 0.026
0.334CysPhe: 0.334 ± 0.03
0.696CysGly: 0.696 ± 0.043
0.147CysHis: 0.147 ± 0.018
1.271CysIle: 1.271 ± 0.056
0.746CysLys: 0.746 ± 0.043
0.683CysLeu: 0.683 ± 0.036
0.41CysMet: 0.41 ± 0.027
0.672CysAsn: 0.672 ± 0.042
0.49CysPro: 0.49 ± 0.037
0.106CysGln: 0.106 ± 0.015
0.75CysArg: 0.75 ± 0.044
0.969CysSer: 0.969 ± 0.049
0.523CysThr: 0.523 ± 0.037
0.596CysVal: 0.596 ± 0.033
0.098CysTrp: 0.098 ± 0.015
0.525CysTyr: 0.525 ± 0.038
0.0CysXaa: 0.0 ± 0.0
Asp
5.946AspAla: 5.946 ± 0.137
0.436AspCys: 0.436 ± 0.033
4.421AspAsp: 4.421 ± 0.12
5.39AspGlu: 5.39 ± 0.115
1.336AspPhe: 1.336 ± 0.052
5.716AspGly: 5.716 ± 0.17
0.889AspHis: 0.889 ± 0.042
4.449AspIle: 4.449 ± 0.102
3.127AspLys: 3.127 ± 0.088
4.378AspLeu: 4.378 ± 0.107
2.079AspMet: 2.079 ± 0.072
1.993AspAsn: 1.993 ± 0.079
2.22AspPro: 2.22 ± 0.075
0.698AspGln: 0.698 ± 0.046
3.406AspArg: 3.406 ± 0.081
3.287AspSer: 3.287 ± 0.096
2.812AspThr: 2.812 ± 0.091
5.733AspVal: 5.733 ± 0.114
0.475AspTrp: 0.475 ± 0.033
2.112AspTyr: 2.112 ± 0.066
0.0AspXaa: 0.0 ± 0.0
Glu
4.363GluAla: 4.363 ± 0.095
0.802GluCys: 0.802 ± 0.044
3.738GluAsp: 3.738 ± 0.091
5.954GluGlu: 5.954 ± 0.149
2.609GluPhe: 2.609 ± 0.08
4.593GluGly: 4.593 ± 0.106
2.502GluHis: 2.502 ± 0.076
4.137GluIle: 4.137 ± 0.106
3.487GluLys: 3.487 ± 0.116
5.972GluLeu: 5.972 ± 0.131
2.298GluMet: 2.298 ± 0.066
1.68GluAsn: 1.68 ± 0.064
2.35GluPro: 2.35 ± 0.072
2.259GluGln: 2.259 ± 0.079
4.877GluArg: 4.877 ± 0.112
3.554GluSer: 3.554 ± 0.098
1.394GluThr: 1.394 ± 0.064
5.284GluVal: 5.284 ± 0.125
0.661GluTrp: 0.661 ± 0.035
3.181GluTyr: 3.181 ± 0.088
0.0GluXaa: 0.0 ± 0.0
Phe
2.483PheAla: 2.483 ± 0.074
0.323PheCys: 0.323 ± 0.027
1.706PheAsp: 1.706 ± 0.062
1.743PheGlu: 1.743 ± 0.07
1.067PhePhe: 1.067 ± 0.049
2.197PheGly: 2.197 ± 0.072
0.54PheHis: 0.54 ± 0.038
3.192PheIle: 3.192 ± 0.088
2.194PheLys: 2.194 ± 0.067
2.522PheLeu: 2.522 ± 0.082
1.108PheMet: 1.108 ± 0.052
1.772PheAsn: 1.772 ± 0.063
1.078PhePro: 1.078 ± 0.051
0.481PheGln: 0.481 ± 0.031
1.68PheArg: 1.68 ± 0.057
2.129PheSer: 2.129 ± 0.075
1.865PheThr: 1.865 ± 0.073
2.073PheVal: 2.073 ± 0.071
0.336PheTrp: 0.336 ± 0.029
1.314PheTyr: 1.314 ± 0.049
0.0PheXaa: 0.0 ± 0.0
Gly
4.226GlyAla: 4.226 ± 0.115
0.7GlyCys: 0.7 ± 0.047
3.43GlyAsp: 3.43 ± 0.118
4.044GlyGlu: 4.044 ± 0.09
2.522GlyPhe: 2.522 ± 0.072
4.339GlyGly: 4.339 ± 0.151
1.119GlyHis: 1.119 ± 0.051
6.822GlyIle: 6.822 ± 0.113
5.154GlyLys: 5.154 ± 0.11
5.909GlyLeu: 5.909 ± 0.121
2.94GlyMet: 2.94 ± 0.078
2.964GlyAsn: 2.964 ± 0.135
1.416GlyPro: 1.416 ± 0.062
1.036GlyGln: 1.036 ± 0.056
4.632GlyArg: 4.632 ± 0.092
5.701GlySer: 5.701 ± 0.123
2.843GlyThr: 2.843 ± 0.077
5.089GlyVal: 5.089 ± 0.117
0.679GlyTrp: 0.679 ± 0.039
3.528GlyTyr: 3.528 ± 0.095
0.0GlyXaa: 0.0 ± 0.0
His
1.73HisAla: 1.73 ± 0.068
0.247HisCys: 0.247 ± 0.024
1.273HisAsp: 1.273 ± 0.05
1.223HisGlu: 1.223 ± 0.052
0.54HisPhe: 0.54 ± 0.034
1.748HisGly: 1.748 ± 0.066
0.403HisHis: 0.403 ± 0.033
1.654HisIle: 1.654 ± 0.058
0.794HisLys: 0.794 ± 0.042
1.633HisLeu: 1.633 ± 0.06
0.655HisMet: 0.655 ± 0.037
0.82HisAsn: 0.82 ± 0.043
0.928HisPro: 0.928 ± 0.049
0.265HisGln: 0.265 ± 0.027
1.03HisArg: 1.03 ± 0.05
1.071HisSer: 1.071 ± 0.045
1.058HisThr: 1.058 ± 0.054
1.633HisVal: 1.633 ± 0.061
0.208HisTrp: 0.208 ± 0.021
0.848HisTyr: 0.848 ± 0.046
0.0HisXaa: 0.0 ± 0.0
Ile
8.185IleAla: 8.185 ± 0.171
0.856IleCys: 0.856 ± 0.046
6.076IleAsp: 6.076 ± 0.109
5.471IleGlu: 5.471 ± 0.135
2.457IlePhe: 2.457 ± 0.084
5.635IleGly: 5.635 ± 0.112
1.483IleHis: 1.483 ± 0.062
6.0IleIle: 6.0 ± 0.15
4.243IleLys: 4.243 ± 0.097
6.893IleLeu: 6.893 ± 0.138
2.422IleMet: 2.422 ± 0.066
3.437IleAsn: 3.437 ± 0.086
3.352IlePro: 3.352 ± 0.089
1.483IleGln: 1.483 ± 0.063
4.89IleArg: 4.89 ± 0.108
5.406IleSer: 5.406 ± 0.119
4.146IleThr: 4.146 ± 0.091
7.236IleVal: 7.236 ± 0.125
0.637IleTrp: 0.637 ± 0.039
2.552IleTyr: 2.552 ± 0.077
0.0IleXaa: 0.0 ± 0.0
Lys
4.608LysAla: 4.608 ± 0.108
0.531LysCys: 0.531 ± 0.034
4.64LysAsp: 4.64 ± 0.115
4.519LysGlu: 4.519 ± 0.116
1.253LysPhe: 1.253 ± 0.062
5.375LysGly: 5.375 ± 0.116
1.633LysHis: 1.633 ± 0.067
3.651LysIle: 3.651 ± 0.085
2.609LysLys: 2.609 ± 0.088
3.179LysLeu: 3.179 ± 0.089
1.665LysMet: 1.665 ± 0.062
1.802LysAsn: 1.802 ± 0.066
2.324LysPro: 2.324 ± 0.072
1.589LysGln: 1.589 ± 0.071
4.274LysArg: 4.274 ± 0.102
3.664LysSer: 3.664 ± 0.097
1.589LysThr: 1.589 ± 0.048
5.579LysVal: 5.579 ± 0.12
0.362LysTrp: 0.362 ± 0.03
1.98LysTyr: 1.98 ± 0.069
0.0LysXaa: 0.0 ± 0.0
Leu
8.738LeuAla: 8.738 ± 0.175
0.911LeuCys: 0.911 ± 0.042
5.382LeuAsp: 5.382 ± 0.108
6.091LeuGlu: 6.091 ± 0.102
3.044LeuPhe: 3.044 ± 0.103
5.575LeuGly: 5.575 ± 0.112
1.798LeuHis: 1.798 ± 0.062
6.806LeuIle: 6.806 ± 0.126
5.954LeuLys: 5.954 ± 0.116
9.517LeuLeu: 9.517 ± 0.202
2.637LeuMet: 2.637 ± 0.092
4.118LeuAsn: 4.118 ± 0.1
3.339LeuPro: 3.339 ± 0.08
2.125LeuGln: 2.125 ± 0.067
5.31LeuArg: 5.31 ± 0.124
6.206LeuSer: 6.206 ± 0.121
4.239LeuThr: 4.239 ± 0.097
6.154LeuVal: 6.154 ± 0.107
0.737LeuTrp: 0.737 ± 0.04
3.417LeuTyr: 3.417 ± 0.099
0.0LeuXaa: 0.0 ± 0.0
Met
1.895MetAla: 1.895 ± 0.063
0.219MetCys: 0.219 ± 0.02
2.253MetAsp: 2.253 ± 0.079
1.687MetGlu: 1.687 ± 0.069
0.735MetPhe: 0.735 ± 0.04
2.032MetGly: 2.032 ± 0.068
1.112MetHis: 1.112 ± 0.049
2.511MetIle: 2.511 ± 0.085
1.884MetLys: 1.884 ± 0.065
5.87MetLeu: 5.87 ± 0.129
1.271MetMet: 1.271 ± 0.056
1.453MetAsn: 1.453 ± 0.054
1.318MetPro: 1.318 ± 0.052
1.164MetGln: 1.164 ± 0.051
2.04MetArg: 2.04 ± 0.069
2.043MetSer: 2.043 ± 0.067
0.744MetThr: 0.744 ± 0.042
2.873MetVal: 2.873 ± 0.072
0.121MetTrp: 0.121 ± 0.017
1.128MetTyr: 1.128 ± 0.055
0.0MetXaa: 0.0 ± 0.0
Asn
3.56AsnAla: 3.56 ± 0.092
0.395AsnCys: 0.395 ± 0.032
2.619AsnAsp: 2.619 ± 0.128
2.147AsnGlu: 2.147 ± 0.067
1.086AsnPhe: 1.086 ± 0.049
3.4AsnGly: 3.4 ± 0.1
0.62AsnHis: 0.62 ± 0.042
3.45AsnIle: 3.45 ± 0.096
2.006AsnLys: 2.006 ± 0.067
3.112AsnLeu: 3.112 ± 0.087
1.34AsnMet: 1.34 ± 0.058
2.561AsnAsn: 2.561 ± 0.139
1.915AsnPro: 1.915 ± 0.078
0.497AsnGln: 0.497 ± 0.038
2.403AsnArg: 2.403 ± 0.072
3.073AsnSer: 3.073 ± 0.12
2.073AsnThr: 2.073 ± 0.076
3.027AsnVal: 3.027 ± 0.08
0.269AsnTrp: 0.269 ± 0.027
1.563AsnTyr: 1.563 ± 0.074
0.0AsnXaa: 0.0 ± 0.0
Pro
2.305ProAla: 2.305 ± 0.08
0.36ProCys: 0.36 ± 0.03
2.045ProAsp: 2.045 ± 0.077
2.281ProGlu: 2.281 ± 0.078
1.379ProPhe: 1.379 ± 0.06
1.652ProGly: 1.652 ± 0.069
0.566ProHis: 0.566 ± 0.04
2.645ProIle: 2.645 ± 0.079
1.698ProLys: 1.698 ± 0.065
3.5ProLeu: 3.5 ± 0.1
1.006ProMet: 1.006 ± 0.045
1.316ProAsn: 1.316 ± 0.05
1.134ProPro: 1.134 ± 0.056
0.594ProGln: 0.594 ± 0.034
1.752ProArg: 1.752 ± 0.053
2.739ProSer: 2.739 ± 0.08
1.85ProThr: 1.85 ± 0.074
2.68ProVal: 2.68 ± 0.082
0.399ProTrp: 0.399 ± 0.032
1.748ProTyr: 1.748 ± 0.06
0.0ProXaa: 0.0 ± 0.0
Gln
1.44GlnAla: 1.44 ± 0.061
0.226GlnCys: 0.226 ± 0.021
1.084GlnAsp: 1.084 ± 0.053
1.626GlnGlu: 1.626 ± 0.068
0.707GlnPhe: 0.707 ± 0.033
1.581GlnGly: 1.581 ± 0.058
0.486GlnHis: 0.486 ± 0.034
1.553GlnIle: 1.553 ± 0.066
0.846GlnLys: 0.846 ± 0.048
1.787GlnLeu: 1.787 ± 0.066
0.722GlnMet: 0.722 ± 0.038
0.594GlnAsn: 0.594 ± 0.038
0.598GlnPro: 0.598 ± 0.035
0.852GlnGln: 0.852 ± 0.063
1.182GlnArg: 1.182 ± 0.048
1.206GlnSer: 1.206 ± 0.051
0.655GlnThr: 0.655 ± 0.047
1.748GlnVal: 1.748 ± 0.073
0.236GlnTrp: 0.236 ± 0.029
0.811GlnTyr: 0.811 ± 0.044
0.0GlnXaa: 0.0 ± 0.0
Arg
4.285ArgAla: 4.285 ± 0.102
0.976ArgCys: 0.976 ± 0.046
4.087ArgAsp: 4.087 ± 0.102
5.061ArgGlu: 5.061 ± 0.114
2.958ArgPhe: 2.958 ± 0.078
3.623ArgGly: 3.623 ± 0.089
1.151ArgHis: 1.151 ± 0.058
5.677ArgIle: 5.677 ± 0.122
2.806ArgLys: 2.806 ± 0.069
7.936ArgLeu: 7.936 ± 0.151
2.394ArgMet: 2.394 ± 0.075
1.83ArgAsn: 1.83 ± 0.065
1.494ArgPro: 1.494 ± 0.064
1.251ArgGln: 1.251 ± 0.053
4.423ArgArg: 4.423 ± 0.11
3.925ArgSer: 3.925 ± 0.099
1.737ArgThr: 1.737 ± 0.073
5.169ArgVal: 5.169 ± 0.096
0.739ArgTrp: 0.739 ± 0.044
3.103ArgTyr: 3.103 ± 0.077
0.0ArgXaa: 0.0 ± 0.0
Ser
4.068SerAla: 4.068 ± 0.098
0.657SerCys: 0.657 ± 0.035
3.331SerAsp: 3.331 ± 0.09
2.704SerGlu: 2.704 ± 0.073
1.618SerPhe: 1.618 ± 0.064
3.814SerGly: 3.814 ± 0.081
0.83SerHis: 0.83 ± 0.038
8.934SerIle: 8.934 ± 0.166
5.358SerLys: 5.358 ± 0.104
5.451SerLeu: 5.451 ± 0.12
3.14SerMet: 3.14 ± 0.088
4.439SerAsn: 4.439 ± 0.13
1.847SerPro: 1.847 ± 0.064
0.85SerGln: 0.85 ± 0.044
4.842SerArg: 4.842 ± 0.103
7.427SerSer: 7.427 ± 0.219
3.801SerThr: 3.801 ± 0.095
4.345SerVal: 4.345 ± 0.108
0.514SerTrp: 0.514 ± 0.038
2.24SerTyr: 2.24 ± 0.069
0.0SerXaa: 0.0 ± 0.0
Thr
3.294ThrAla: 3.294 ± 0.075
0.481ThrCys: 0.481 ± 0.031
2.066ThrAsp: 2.066 ± 0.076
1.667ThrGlu: 1.667 ± 0.068
1.648ThrPhe: 1.648 ± 0.07
3.255ThrGly: 3.255 ± 0.092
0.815ThrHis: 0.815 ± 0.041
4.018ThrIle: 4.018 ± 0.117
1.689ThrLys: 1.689 ± 0.06
4.89ThrLeu: 4.89 ± 0.12
1.331ThrMet: 1.331 ± 0.062
1.915ThrAsn: 1.915 ± 0.074
1.886ThrPro: 1.886 ± 0.078
0.948ThrGln: 0.948 ± 0.052
2.097ThrArg: 2.097 ± 0.057
3.216ThrSer: 3.216 ± 0.072
2.353ThrThr: 2.353 ± 0.088
3.346ThrVal: 3.346 ± 0.093
0.392ThrTrp: 0.392 ± 0.032
1.713ThrTyr: 1.713 ± 0.063
0.0ThrXaa: 0.0 ± 0.0
Val
5.939ValAla: 5.939 ± 0.116
0.863ValCys: 0.863 ± 0.044
5.369ValAsp: 5.369 ± 0.116
6.201ValGlu: 6.201 ± 0.129
2.359ValPhe: 2.359 ± 0.081
5.041ValGly: 5.041 ± 0.121
1.418ValHis: 1.418 ± 0.061
5.722ValIle: 5.722 ± 0.102
5.308ValLys: 5.308 ± 0.121
6.481ValLeu: 6.481 ± 0.112
2.494ValMet: 2.494 ± 0.082
2.971ValAsn: 2.971 ± 0.08
2.416ValPro: 2.416 ± 0.077
1.481ValGln: 1.481 ± 0.044
5.228ValArg: 5.228 ± 0.118
4.896ValSer: 4.896 ± 0.108
3.591ValThr: 3.591 ± 0.103
6.477ValVal: 6.477 ± 0.155
0.733ValTrp: 0.733 ± 0.038
3.096ValTyr: 3.096 ± 0.092
0.0ValXaa: 0.0 ± 0.0
Trp
0.468TrpAla: 0.468 ± 0.031
0.115TrpCys: 0.115 ± 0.014
0.529TrpAsp: 0.529 ± 0.035
0.434TrpGlu: 0.434 ± 0.03
0.471TrpPhe: 0.471 ± 0.035
0.516TrpGly: 0.516 ± 0.033
0.286TrpHis: 0.286 ± 0.023
0.703TrpIle: 0.703 ± 0.034
0.54TrpLys: 0.54 ± 0.035
0.924TrpLeu: 0.924 ± 0.056
0.306TrpMet: 0.306 ± 0.026
0.371TrpAsn: 0.371 ± 0.031
0.256TrpPro: 0.256 ± 0.024
0.275TrpGln: 0.275 ± 0.027
0.549TrpArg: 0.549 ± 0.033
0.629TrpSer: 0.629 ± 0.036
0.28TrpThr: 0.28 ± 0.026
0.659TrpVal: 0.659 ± 0.04
0.134TrpTrp: 0.134 ± 0.017
0.397TrpTyr: 0.397 ± 0.027
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.606TyrAla: 3.606 ± 0.103
0.462TyrCys: 0.462 ± 0.032
2.435TyrAsp: 2.435 ± 0.072
2.511TyrGlu: 2.511 ± 0.08
1.171TyrPhe: 1.171 ± 0.065
3.031TyrGly: 3.031 ± 0.091
0.789TyrHis: 0.789 ± 0.041
3.335TyrIle: 3.335 ± 0.093
2.027TyrLys: 2.027 ± 0.065
3.027TyrLeu: 3.027 ± 0.097
1.303TyrMet: 1.303 ± 0.052
1.938TyrAsn: 1.938 ± 0.072
1.592TyrPro: 1.592 ± 0.07
0.572TyrGln: 0.572 ± 0.037
2.643TyrArg: 2.643 ± 0.078
2.945TyrSer: 2.945 ± 0.085
2.333TyrThr: 2.333 ± 0.076
2.55TyrVal: 2.55 ± 0.078
0.379TyrTrp: 0.379 ± 0.029
1.75TyrTyr: 1.75 ± 0.068
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1766 proteins (461185 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski