Amino acid dipepetide frequency for Oblitimonas alkaliphila

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.547AlaAla: 11.547 ± 0.18
1.113AlaCys: 1.113 ± 0.038
5.141AlaAsp: 5.141 ± 0.085
7.699AlaGlu: 7.699 ± 0.122
3.47AlaPhe: 3.47 ± 0.089
7.493AlaGly: 7.493 ± 0.12
1.968AlaHis: 1.968 ± 0.06
5.964AlaIle: 5.964 ± 0.099
5.025AlaLys: 5.025 ± 0.107
12.376AlaLeu: 12.376 ± 0.163
2.672AlaMet: 2.672 ± 0.061
3.344AlaAsn: 3.344 ± 0.075
3.781AlaPro: 3.781 ± 0.077
5.85AlaGln: 5.85 ± 0.103
4.926AlaArg: 4.926 ± 0.09
5.474AlaSer: 5.474 ± 0.089
4.822AlaThr: 4.822 ± 0.087
7.029AlaVal: 7.029 ± 0.113
1.401AlaTrp: 1.401 ± 0.053
2.565AlaTyr: 2.565 ± 0.061
0.0AlaXaa: 0.0 ± 0.0
Cys
0.952CysAla: 0.952 ± 0.037
0.128CysCys: 0.128 ± 0.013
0.492CysAsp: 0.492 ± 0.028
0.559CysGlu: 0.559 ± 0.027
0.358CysPhe: 0.358 ± 0.021
0.842CysGly: 0.842 ± 0.036
0.319CysHis: 0.319 ± 0.025
0.518CysIle: 0.518 ± 0.029
0.318CysLys: 0.318 ± 0.022
1.114CysLeu: 1.114 ± 0.043
0.195CysMet: 0.195 ± 0.016
0.302CysAsn: 0.302 ± 0.022
0.483CysPro: 0.483 ± 0.028
0.53CysGln: 0.53 ± 0.031
0.493CysArg: 0.493 ± 0.026
0.663CysSer: 0.663 ± 0.03
0.453CysThr: 0.453 ± 0.03
0.664CysVal: 0.664 ± 0.033
0.147CysTrp: 0.147 ± 0.015
0.285CysTyr: 0.285 ± 0.022
0.0CysXaa: 0.0 ± 0.0
Asp
4.446AspAla: 4.446 ± 0.075
0.535AspCys: 0.535 ± 0.028
2.054AspAsp: 2.054 ± 0.057
3.166AspGlu: 3.166 ± 0.072
2.095AspPhe: 2.095 ± 0.067
3.088AspGly: 3.088 ± 0.075
1.007AspHis: 1.007 ± 0.038
2.754AspIle: 2.754 ± 0.067
2.16AspLys: 2.16 ± 0.069
5.499AspLeu: 5.499 ± 0.095
1.165AspMet: 1.165 ± 0.046
1.705AspAsn: 1.705 ± 0.049
2.259AspPro: 2.259 ± 0.058
2.499AspGln: 2.499 ± 0.055
2.308AspArg: 2.308 ± 0.056
2.794AspSer: 2.794 ± 0.062
2.252AspThr: 2.252 ± 0.059
2.971AspVal: 2.971 ± 0.075
0.82AspTrp: 0.82 ± 0.037
1.759AspTyr: 1.759 ± 0.057
0.0AspXaa: 0.0 ± 0.0
Glu
5.927GluAla: 5.927 ± 0.114
0.453GluCys: 0.453 ± 0.03
2.481GluAsp: 2.481 ± 0.059
3.35GluGlu: 3.35 ± 0.1
2.087GluPhe: 2.087 ± 0.06
3.564GluGly: 3.564 ± 0.072
1.74GluHis: 1.74 ± 0.053
3.271GluIle: 3.271 ± 0.077
2.667GluLys: 2.667 ± 0.077
7.491GluLeu: 7.491 ± 0.115
1.407GluMet: 1.407 ± 0.046
1.854GluAsn: 1.854 ± 0.051
2.303GluPro: 2.303 ± 0.065
5.854GluGln: 5.854 ± 0.12
4.052GluArg: 4.052 ± 0.088
2.655GluSer: 2.655 ± 0.067
2.729GluThr: 2.729 ± 0.065
4.556GluVal: 4.556 ± 0.098
0.796GluTrp: 0.796 ± 0.033
1.545GluTyr: 1.545 ± 0.048
0.0GluXaa: 0.0 ± 0.0
Phe
3.944PheAla: 3.944 ± 0.083
0.426PheCys: 0.426 ± 0.026
2.255PheAsp: 2.255 ± 0.061
2.118PheGlu: 2.118 ± 0.062
1.579PhePhe: 1.579 ± 0.057
2.921PheGly: 2.921 ± 0.076
0.723PheHis: 0.723 ± 0.03
2.332PheIle: 2.332 ± 0.064
1.711PheLys: 1.711 ± 0.052
3.433PheLeu: 3.433 ± 0.078
0.879PheMet: 0.879 ± 0.039
1.762PheAsn: 1.762 ± 0.053
1.463PhePro: 1.463 ± 0.049
1.278PheGln: 1.278 ± 0.04
1.538PheArg: 1.538 ± 0.056
2.959PheSer: 2.959 ± 0.069
2.173PheThr: 2.173 ± 0.051
2.396PheVal: 2.396 ± 0.063
0.569PheTrp: 0.569 ± 0.034
1.194PheTyr: 1.194 ± 0.047
0.0PheXaa: 0.0 ± 0.0
Gly
6.315GlyAla: 6.315 ± 0.111
0.894GlyCys: 0.894 ± 0.037
3.23GlyAsp: 3.23 ± 0.071
4.139GlyGlu: 4.139 ± 0.087
3.108GlyPhe: 3.108 ± 0.071
4.962GlyGly: 4.962 ± 0.122
1.644GlyHis: 1.644 ± 0.053
4.314GlyIle: 4.314 ± 0.105
3.515GlyLys: 3.515 ± 0.089
8.407GlyLeu: 8.407 ± 0.147
1.897GlyMet: 1.897 ± 0.057
2.151GlyAsn: 2.151 ± 0.072
2.035GlyPro: 2.035 ± 0.058
3.388GlyGln: 3.388 ± 0.078
3.656GlyArg: 3.656 ± 0.083
3.934GlySer: 3.934 ± 0.081
3.289GlyThr: 3.289 ± 0.088
5.562GlyVal: 5.562 ± 0.109
1.04GlyTrp: 1.04 ± 0.037
2.308GlyTyr: 2.308 ± 0.06
0.0GlyXaa: 0.0 ± 0.0
His
2.057HisAla: 2.057 ± 0.053
0.352HisCys: 0.352 ± 0.025
0.924HisAsp: 0.924 ± 0.036
1.012HisGlu: 1.012 ± 0.048
0.977HisPhe: 0.977 ± 0.042
1.539HisGly: 1.539 ± 0.052
0.542HisHis: 0.542 ± 0.033
1.126HisIle: 1.126 ± 0.046
0.829HisLys: 0.829 ± 0.033
2.687HisLeu: 2.687 ± 0.069
0.483HisMet: 0.483 ± 0.028
0.827HisAsn: 0.827 ± 0.037
1.352HisPro: 1.352 ± 0.049
1.319HisGln: 1.319 ± 0.047
1.08HisArg: 1.08 ± 0.035
1.496HisSer: 1.496 ± 0.05
1.147HisThr: 1.147 ± 0.044
1.203HisVal: 1.203 ± 0.046
0.486HisTrp: 0.486 ± 0.028
0.872HisTyr: 0.872 ± 0.04
0.0HisXaa: 0.0 ± 0.0
Ile
6.169IleAla: 6.169 ± 0.105
0.55IleCys: 0.55 ± 0.03
3.167IleAsp: 3.167 ± 0.069
4.06IleGlu: 4.06 ± 0.083
1.945IlePhe: 1.945 ± 0.06
4.226IleGly: 4.226 ± 0.096
1.049IleHis: 1.049 ± 0.041
3.063IleIle: 3.063 ± 0.077
2.69IleLys: 2.69 ± 0.079
4.916IleLeu: 4.916 ± 0.078
1.113IleMet: 1.113 ± 0.042
2.31IleAsn: 2.31 ± 0.066
2.448IlePro: 2.448 ± 0.055
2.438IleGln: 2.438 ± 0.051
2.659IleArg: 2.659 ± 0.058
3.723IleSer: 3.723 ± 0.079
3.283IleThr: 3.283 ± 0.069
3.494IleVal: 3.494 ± 0.073
0.559IleTrp: 0.559 ± 0.03
1.483IleTyr: 1.483 ± 0.046
0.0IleXaa: 0.0 ± 0.0
Lys
4.528LysAla: 4.528 ± 0.11
0.233LysCys: 0.233 ± 0.019
2.065LysAsp: 2.065 ± 0.064
2.338LysGlu: 2.338 ± 0.065
1.139LysPhe: 1.139 ± 0.045
2.822LysGly: 2.822 ± 0.083
1.064LysHis: 1.064 ± 0.044
2.313LysIle: 2.313 ± 0.069
2.133LysLys: 2.133 ± 0.081
4.439LysLeu: 4.439 ± 0.09
1.077LysMet: 1.077 ± 0.039
1.676LysAsn: 1.676 ± 0.058
2.181LysPro: 2.181 ± 0.062
3.18LysGln: 3.18 ± 0.069
2.563LysArg: 2.563 ± 0.066
2.181LysSer: 2.181 ± 0.057
2.493LysThr: 2.493 ± 0.062
3.222LysVal: 3.222 ± 0.077
0.388LysTrp: 0.388 ± 0.024
1.031LysTyr: 1.031 ± 0.046
0.0LysXaa: 0.0 ± 0.0
Leu
14.037LeuAla: 14.037 ± 0.188
1.105LeuCys: 1.105 ± 0.043
5.771LeuAsp: 5.771 ± 0.095
6.917LeuGlu: 6.917 ± 0.11
4.281LeuPhe: 4.281 ± 0.101
8.387LeuGly: 8.387 ± 0.14
2.234LeuHis: 2.234 ± 0.064
6.334LeuIle: 6.334 ± 0.114
4.965LeuLys: 4.965 ± 0.09
13.653LeuLeu: 13.653 ± 0.232
2.665LeuMet: 2.665 ± 0.066
4.156LeuAsn: 4.156 ± 0.077
6.025LeuPro: 6.025 ± 0.104
6.033LeuGln: 6.033 ± 0.12
5.749LeuArg: 5.749 ± 0.103
7.704LeuSer: 7.704 ± 0.129
6.645LeuThr: 6.645 ± 0.116
7.615LeuVal: 7.615 ± 0.135
1.362LeuTrp: 1.362 ± 0.054
2.745LeuTyr: 2.745 ± 0.06
0.0LeuXaa: 0.0 ± 0.0
Met
2.558MetAla: 2.558 ± 0.066
0.162MetCys: 0.162 ± 0.017
1.013MetAsp: 1.013 ± 0.04
0.98MetGlu: 0.98 ± 0.042
0.668MetPhe: 0.668 ± 0.03
1.745MetGly: 1.745 ± 0.063
0.548MetHis: 0.548 ± 0.032
1.203MetIle: 1.203 ± 0.047
0.921MetLys: 0.921 ± 0.037
2.797MetLeu: 2.797 ± 0.067
0.55MetMet: 0.55 ± 0.031
0.781MetAsn: 0.781 ± 0.033
1.264MetPro: 1.264 ± 0.049
1.625MetGln: 1.625 ± 0.042
1.349MetArg: 1.349 ± 0.046
1.734MetSer: 1.734 ± 0.051
1.242MetThr: 1.242 ± 0.047
1.527MetVal: 1.527 ± 0.047
0.184MetTrp: 0.184 ± 0.016
0.429MetTyr: 0.429 ± 0.028
0.0MetXaa: 0.0 ± 0.0
Asn
3.128AsnAla: 3.128 ± 0.073
0.391AsnCys: 0.391 ± 0.025
1.539AsnAsp: 1.539 ± 0.052
1.74AsnGlu: 1.74 ± 0.052
1.341AsnPhe: 1.341 ± 0.043
2.291AsnGly: 2.291 ± 0.066
0.71AsnHis: 0.71 ± 0.029
1.891AsnIle: 1.891 ± 0.063
1.597AsnLys: 1.597 ± 0.055
3.919AsnLeu: 3.919 ± 0.087
0.772AsnMet: 0.772 ± 0.037
1.245AsnAsn: 1.245 ± 0.049
2.102AsnPro: 2.102 ± 0.055
2.068AsnGln: 2.068 ± 0.07
1.708AsnArg: 1.708 ± 0.049
2.029AsnSer: 2.029 ± 0.065
1.772AsnThr: 1.772 ± 0.054
2.008AsnVal: 2.008 ± 0.05
0.517AsnTrp: 0.517 ± 0.031
1.16AsnTyr: 1.16 ± 0.046
0.0AsnXaa: 0.0 ± 0.0
Pro
4.504ProAla: 4.504 ± 0.093
0.373ProCys: 0.373 ± 0.025
2.271ProAsp: 2.271 ± 0.066
3.886ProGlu: 3.886 ± 0.08
1.688ProPhe: 1.688 ± 0.049
2.646ProGly: 2.646 ± 0.062
0.966ProHis: 0.966 ± 0.04
2.426ProIle: 2.426 ± 0.065
1.992ProLys: 1.992 ± 0.054
5.018ProLeu: 5.018 ± 0.106
1.089ProMet: 1.089 ± 0.035
1.671ProAsn: 1.671 ± 0.047
1.355ProPro: 1.355 ± 0.052
2.08ProGln: 2.08 ± 0.066
1.793ProArg: 1.793 ± 0.05
2.708ProSer: 2.708 ± 0.067
2.245ProThr: 2.245 ± 0.064
3.47ProVal: 3.47 ± 0.065
0.657ProTrp: 0.657 ± 0.033
1.285ProTyr: 1.285 ± 0.047
0.0ProXaa: 0.0 ± 0.0
Gln
7.398GlnAla: 7.398 ± 0.12
0.379GlnCys: 0.379 ± 0.028
2.023GlnAsp: 2.023 ± 0.053
2.744GlnGlu: 2.744 ± 0.07
1.775GlnPhe: 1.775 ± 0.054
4.06GlnGly: 4.06 ± 0.095
1.772GlnHis: 1.772 ± 0.058
2.695GlnIle: 2.695 ± 0.065
1.662GlnLys: 1.662 ± 0.047
8.445GlnLeu: 8.445 ± 0.164
1.117GlnMet: 1.117 ± 0.039
1.255GlnAsn: 1.255 ± 0.05
2.895GlnPro: 2.895 ± 0.086
6.272GlnGln: 6.272 ± 0.193
3.911GlnArg: 3.911 ± 0.094
2.472GlnSer: 2.472 ± 0.066
2.399GlnThr: 2.399 ± 0.054
4.298GlnVal: 4.298 ± 0.093
0.882GlnTrp: 0.882 ± 0.037
1.243GlnTyr: 1.243 ± 0.039
0.0GlnXaa: 0.0 ± 0.0
Arg
4.474ArgAla: 4.474 ± 0.088
0.514ArgCys: 0.514 ± 0.026
2.481ArgAsp: 2.481 ± 0.066
3.258ArgGlu: 3.258 ± 0.081
2.374ArgPhe: 2.374 ± 0.056
3.163ArgGly: 3.163 ± 0.072
1.275ArgHis: 1.275 ± 0.047
3.027ArgIle: 3.027 ± 0.07
2.209ArgLys: 2.209 ± 0.058
6.691ArgLeu: 6.691 ± 0.112
1.301ArgMet: 1.301 ± 0.043
1.792ArgAsn: 1.792 ± 0.053
2.054ArgPro: 2.054 ± 0.056
3.044ArgGln: 3.044 ± 0.076
2.903ArgArg: 2.903 ± 0.074
2.891ArgSer: 2.891 ± 0.069
2.206ArgThr: 2.206 ± 0.066
3.641ArgVal: 3.641 ± 0.074
0.905ArgTrp: 0.905 ± 0.036
1.961ArgTyr: 1.961 ± 0.048
0.0ArgXaa: 0.0 ± 0.0
Ser
5.777SerAla: 5.777 ± 0.11
0.553SerCys: 0.553 ± 0.031
2.534SerAsp: 2.534 ± 0.063
3.273SerGlu: 3.273 ± 0.076
2.531SerPhe: 2.531 ± 0.067
4.572SerGly: 4.572 ± 0.089
1.365SerHis: 1.365 ± 0.051
3.247SerIle: 3.247 ± 0.075
2.363SerLys: 2.363 ± 0.068
7.212SerLeu: 7.212 ± 0.122
1.448SerMet: 1.448 ± 0.052
1.964SerAsn: 1.964 ± 0.055
2.362SerPro: 2.362 ± 0.063
3.068SerGln: 3.068 ± 0.075
2.935SerArg: 2.935 ± 0.064
3.77SerSer: 3.77 ± 0.086
2.95SerThr: 2.95 ± 0.065
3.94SerVal: 3.94 ± 0.089
0.878SerTrp: 0.878 ± 0.037
1.71SerTyr: 1.71 ± 0.054
0.0SerXaa: 0.0 ± 0.0
Thr
5.326ThrAla: 5.326 ± 0.098
0.459ThrCys: 0.459 ± 0.027
2.524ThrAsp: 2.524 ± 0.059
3.27ThrGlu: 3.27 ± 0.069
1.728ThrPhe: 1.728 ± 0.057
4.009ThrGly: 4.009 ± 0.089
1.181ThrHis: 1.181 ± 0.039
2.442ThrIle: 2.442 ± 0.058
1.713ThrLys: 1.713 ± 0.057
6.563ThrLeu: 6.563 ± 0.108
0.888ThrMet: 0.888 ± 0.04
1.352ThrAsn: 1.352 ± 0.046
2.891ThrPro: 2.891 ± 0.074
2.788ThrGln: 2.788 ± 0.067
2.505ThrArg: 2.505 ± 0.063
2.619ThrSer: 2.619 ± 0.068
2.601ThrThr: 2.601 ± 0.072
3.506ThrVal: 3.506 ± 0.078
0.602ThrTrp: 0.602 ± 0.031
1.251ThrTyr: 1.251 ± 0.044
0.0ThrXaa: 0.0 ± 0.0
Val
7.046ValAla: 7.046 ± 0.132
0.704ValCys: 0.704 ± 0.034
3.541ValAsp: 3.541 ± 0.073
4.388ValGlu: 4.388 ± 0.087
2.736ValPhe: 2.736 ± 0.064
4.737ValGly: 4.737 ± 0.1
1.227ValHis: 1.227 ± 0.041
4.436ValIle: 4.436 ± 0.075
3.192ValLys: 3.192 ± 0.069
7.928ValLeu: 7.928 ± 0.135
1.693ValMet: 1.693 ± 0.043
2.47ValAsn: 2.47 ± 0.067
2.851ValPro: 2.851 ± 0.063
3.044ValGln: 3.044 ± 0.072
3.4ValArg: 3.4 ± 0.072
4.148ValSer: 4.148 ± 0.087
3.775ValThr: 3.775 ± 0.075
5.015ValVal: 5.015 ± 0.092
0.764ValTrp: 0.764 ± 0.038
1.606ValTyr: 1.606 ± 0.046
0.0ValXaa: 0.0 ± 0.0
Trp
0.985TrpAla: 0.985 ± 0.039
0.192TrpCys: 0.192 ± 0.016
0.523TrpAsp: 0.523 ± 0.029
0.544TrpGlu: 0.544 ± 0.027
0.55TrpPhe: 0.55 ± 0.029
0.744TrpGly: 0.744 ± 0.036
0.355TrpHis: 0.355 ± 0.024
0.648TrpIle: 0.648 ± 0.032
0.435TrpLys: 0.435 ± 0.027
2.358TrpLeu: 2.358 ± 0.073
0.325TrpMet: 0.325 ± 0.026
0.455TrpAsn: 0.455 ± 0.029
0.633TrpPro: 0.633 ± 0.03
1.266TrpGln: 1.266 ± 0.053
0.859TrpArg: 0.859 ± 0.04
0.781TrpSer: 0.781 ± 0.034
0.507TrpThr: 0.507 ± 0.027
0.894TrpVal: 0.894 ± 0.041
0.204TrpTrp: 0.204 ± 0.019
0.379TrpTyr: 0.379 ± 0.027
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.543TyrAla: 2.543 ± 0.058
0.3TyrCys: 0.3 ± 0.02
1.309TyrAsp: 1.309 ± 0.047
1.343TyrGlu: 1.343 ± 0.043
1.224TyrPhe: 1.224 ± 0.044
1.974TyrGly: 1.974 ± 0.057
0.621TyrHis: 0.621 ± 0.032
1.337TyrIle: 1.337 ± 0.052
0.967TyrLys: 0.967 ± 0.042
3.437TyrLeu: 3.437 ± 0.069
0.544TyrMet: 0.544 ± 0.028
0.857TyrAsn: 0.857 ± 0.04
1.385TyrPro: 1.385 ± 0.045
2.075TyrGln: 2.075 ± 0.062
1.775TyrArg: 1.775 ± 0.05
1.72TyrSer: 1.72 ± 0.043
1.234TyrThr: 1.234 ± 0.05
1.691TyrVal: 1.691 ± 0.052
0.475TyrTrp: 0.475 ± 0.026
0.848TyrTyr: 0.848 ± 0.042
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2127 proteins (673168 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski