Amino acid dipepetide frequency for Candidatus Paraburkholderia kirkii

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.43AlaAla: 15.43 ± 0.234
1.233AlaCys: 1.233 ± 0.046
6.681AlaAsp: 6.681 ± 0.105
6.405AlaGlu: 6.405 ± 0.113
4.191AlaPhe: 4.191 ± 0.072
9.35AlaGly: 9.35 ± 0.151
2.873AlaHis: 2.873 ± 0.068
5.398AlaIle: 5.398 ± 0.094
4.526AlaLys: 4.526 ± 0.092
13.446AlaLeu: 13.446 ± 0.174
3.245AlaMet: 3.245 ± 0.067
3.216AlaAsn: 3.216 ± 0.076
5.34AlaPro: 5.34 ± 0.092
4.906AlaGln: 4.906 ± 0.098
9.293AlaArg: 9.293 ± 0.163
6.942AlaSer: 6.942 ± 0.132
5.405AlaThr: 5.405 ± 0.099
8.242AlaVal: 8.242 ± 0.108
1.483AlaTrp: 1.483 ± 0.051
2.634AlaTyr: 2.634 ± 0.067
0.0AlaXaa: 0.0 ± 0.0
Cys
1.179CysAla: 1.179 ± 0.042
0.208CysCys: 0.208 ± 0.018
0.784CysAsp: 0.784 ± 0.038
0.562CysGlu: 0.562 ± 0.026
0.338CysPhe: 0.338 ± 0.024
1.034CysGly: 1.034 ± 0.038
0.267CysHis: 0.267 ± 0.023
0.437CysIle: 0.437 ± 0.024
0.243CysLys: 0.243 ± 0.019
0.785CysLeu: 0.785 ± 0.031
0.182CysMet: 0.182 ± 0.016
0.258CysAsn: 0.258 ± 0.02
0.527CysPro: 0.527 ± 0.027
0.223CysGln: 0.223 ± 0.018
0.675CysArg: 0.675 ± 0.036
0.621CysSer: 0.621 ± 0.031
0.518CysThr: 0.518 ± 0.03
0.719CysVal: 0.719 ± 0.034
0.109CysTrp: 0.109 ± 0.013
0.282CysTyr: 0.282 ± 0.021
0.0CysXaa: 0.0 ± 0.0
Asp
7.917AspAla: 7.917 ± 0.145
0.55AspCys: 0.55 ± 0.029
3.486AspAsp: 3.486 ± 0.082
3.873AspGlu: 3.873 ± 0.069
2.235AspPhe: 2.235 ± 0.057
4.631AspGly: 4.631 ± 0.087
1.222AspHis: 1.222 ± 0.046
2.821AspIle: 2.821 ± 0.071
1.843AspLys: 1.843 ± 0.069
5.287AspLeu: 5.287 ± 0.096
1.365AspMet: 1.365 ± 0.046
1.5AspAsn: 1.5 ± 0.049
2.883AspPro: 2.883 ± 0.067
1.643AspGln: 1.643 ± 0.051
3.613AspArg: 3.613 ± 0.076
2.526AspSer: 2.526 ± 0.059
2.854AspThr: 2.854 ± 0.071
4.536AspVal: 4.536 ± 0.083
0.981AspTrp: 0.981 ± 0.035
1.766AspTyr: 1.766 ± 0.057
0.0AspXaa: 0.0 ± 0.0
Glu
7.035GluAla: 7.035 ± 0.124
0.497GluCys: 0.497 ± 0.029
2.416GluAsp: 2.416 ± 0.061
2.909GluGlu: 2.909 ± 0.084
2.057GluPhe: 2.057 ± 0.054
3.757GluGly: 3.757 ± 0.091
1.622GluHis: 1.622 ± 0.05
3.121GluIle: 3.121 ± 0.078
2.428GluLys: 2.428 ± 0.072
5.73GluLeu: 5.73 ± 0.089
1.39GluMet: 1.39 ± 0.051
1.736GluAsn: 1.736 ± 0.057
2.573GluPro: 2.573 ± 0.071
2.332GluGln: 2.332 ± 0.065
5.216GluArg: 5.216 ± 0.095
2.945GluSer: 2.945 ± 0.069
3.157GluThr: 3.157 ± 0.076
3.947GluVal: 3.947 ± 0.096
0.794GluTrp: 0.794 ± 0.032
1.42GluTyr: 1.42 ± 0.045
0.002GluXaa: 0.002 ± 0.001
Phe
4.396PheAla: 4.396 ± 0.087
0.436PheCys: 0.436 ± 0.026
2.87PheAsp: 2.87 ± 0.073
2.351PheGlu: 2.351 ± 0.063
1.468PhePhe: 1.468 ± 0.055
3.6PheGly: 3.6 ± 0.082
0.847PheHis: 0.847 ± 0.036
1.798PheIle: 1.798 ± 0.05
1.148PheLys: 1.148 ± 0.048
2.879PheLeu: 2.879 ± 0.069
0.814PheMet: 0.814 ± 0.033
1.16PheAsn: 1.16 ± 0.04
1.578PhePro: 1.578 ± 0.056
0.99PheGln: 0.99 ± 0.036
2.244PheArg: 2.244 ± 0.06
2.223PheSer: 2.223 ± 0.061
2.086PheThr: 2.086 ± 0.064
3.059PheVal: 3.059 ± 0.067
0.417PheTrp: 0.417 ± 0.027
0.922PheTyr: 0.922 ± 0.039
0.0PheXaa: 0.0 ± 0.0
Gly
8.793GlyAla: 8.793 ± 0.162
0.862GlyCys: 0.862 ± 0.034
4.256GlyAsp: 4.256 ± 0.084
4.633GlyGlu: 4.633 ± 0.08
3.393GlyPhe: 3.393 ± 0.072
6.327GlyGly: 6.327 ± 0.132
1.896GlyHis: 1.896 ± 0.061
4.224GlyIle: 4.224 ± 0.079
3.649GlyLys: 3.649 ± 0.079
7.471GlyLeu: 7.471 ± 0.111
2.276GlyMet: 2.276 ± 0.066
2.188GlyAsn: 2.188 ± 0.058
2.612GlyPro: 2.612 ± 0.069
2.469GlyGln: 2.469 ± 0.065
5.49GlyArg: 5.49 ± 0.113
4.246GlySer: 4.246 ± 0.074
4.148GlyThr: 4.148 ± 0.087
6.461GlyVal: 6.461 ± 0.104
1.148GlyTrp: 1.148 ± 0.043
2.249GlyTyr: 2.249 ± 0.061
0.002GlyXaa: 0.002 ± 0.002
His
3.126HisAla: 3.126 ± 0.072
0.291HisCys: 0.291 ± 0.021
1.548HisAsp: 1.548 ± 0.045
1.299HisGlu: 1.299 ± 0.04
0.999HisPhe: 0.999 ± 0.038
2.274HisGly: 2.274 ± 0.064
0.6HisHis: 0.6 ± 0.032
0.972HisIle: 0.972 ± 0.038
0.677HisLys: 0.677 ± 0.033
2.166HisLeu: 2.166 ± 0.061
0.518HisMet: 0.518 ± 0.032
0.556HisAsn: 0.556 ± 0.033
1.489HisPro: 1.489 ± 0.052
0.609HisGln: 0.609 ± 0.03
1.735HisArg: 1.735 ± 0.055
1.157HisSer: 1.157 ± 0.046
1.067HisThr: 1.067 ± 0.044
1.795HisVal: 1.795 ± 0.051
0.342HisTrp: 0.342 ± 0.023
0.672HisTyr: 0.672 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
7.01IleAla: 7.01 ± 0.113
0.488IleCys: 0.488 ± 0.026
3.723IleAsp: 3.723 ± 0.069
3.682IleGlu: 3.682 ± 0.076
1.552IlePhe: 1.552 ± 0.053
4.841IleGly: 4.841 ± 0.088
1.016IleHis: 1.016 ± 0.04
1.816IleIle: 1.816 ± 0.062
1.724IleLys: 1.724 ± 0.055
3.543IleLeu: 3.543 ± 0.078
0.864IleMet: 0.864 ± 0.041
1.445IleAsn: 1.445 ± 0.05
2.069IlePro: 2.069 ± 0.061
1.27IleGln: 1.27 ± 0.044
2.934IleArg: 2.934 ± 0.063
2.558IleSer: 2.558 ± 0.069
2.321IleThr: 2.321 ± 0.058
4.488IleVal: 4.488 ± 0.097
0.506IleTrp: 0.506 ± 0.029
0.995IleTyr: 0.995 ± 0.042
0.0IleXaa: 0.0 ± 0.0
Lys
3.997LysAla: 3.997 ± 0.103
0.25LysCys: 0.25 ± 0.017
1.843LysAsp: 1.843 ± 0.059
1.858LysGlu: 1.858 ± 0.055
1.168LysPhe: 1.168 ± 0.047
2.475LysGly: 2.475 ± 0.066
0.861LysHis: 0.861 ± 0.038
2.009LysIle: 2.009 ± 0.069
1.76LysLys: 1.76 ± 0.06
3.92LysLeu: 3.92 ± 0.082
1.069LysMet: 1.069 ± 0.04
1.182LysAsn: 1.182 ± 0.043
2.341LysPro: 2.341 ± 0.064
1.507LysGln: 1.507 ± 0.048
3.044LysArg: 3.044 ± 0.073
2.105LysSer: 2.105 ± 0.063
2.294LysThr: 2.294 ± 0.061
2.613LysVal: 2.613 ± 0.078
0.399LysTrp: 0.399 ± 0.024
0.906LysTyr: 0.906 ± 0.039
0.0LysXaa: 0.0 ± 0.0
Leu
12.868LeuAla: 12.868 ± 0.179
0.942LeuCys: 0.942 ± 0.032
6.059LeuAsp: 6.059 ± 0.104
5.337LeuGlu: 5.337 ± 0.094
3.365LeuPhe: 3.365 ± 0.078
7.584LeuGly: 7.584 ± 0.111
2.258LeuHis: 2.258 ± 0.061
4.634LeuIle: 4.634 ± 0.095
4.128LeuLys: 4.128 ± 0.088
9.23LeuLeu: 9.23 ± 0.151
2.265LeuMet: 2.265 ± 0.059
2.848LeuAsn: 2.848 ± 0.068
5.233LeuPro: 5.233 ± 0.091
3.279LeuGln: 3.279 ± 0.073
6.924LeuArg: 6.924 ± 0.112
6.034LeuSer: 6.034 ± 0.109
5.02LeuThr: 5.02 ± 0.078
7.046LeuVal: 7.046 ± 0.12
0.966LeuTrp: 0.966 ± 0.036
2.069LeuTyr: 2.069 ± 0.062
0.0LeuXaa: 0.0 ± 0.0
Met
2.485MetAla: 2.485 ± 0.065
0.181MetCys: 0.181 ± 0.018
1.022MetAsp: 1.022 ± 0.047
1.126MetGlu: 1.126 ± 0.043
0.873MetPhe: 0.873 ± 0.042
1.471MetGly: 1.471 ± 0.051
0.606MetHis: 0.606 ± 0.033
1.188MetIle: 1.188 ± 0.043
1.179MetLys: 1.179 ± 0.04
2.856MetLeu: 2.856 ± 0.072
0.653MetMet: 0.653 ± 0.031
0.912MetAsn: 0.912 ± 0.034
1.721MetPro: 1.721 ± 0.051
1.014MetGln: 1.014 ± 0.034
2.078MetArg: 2.078 ± 0.051
1.754MetSer: 1.754 ± 0.053
1.54MetThr: 1.54 ± 0.044
1.581MetVal: 1.581 ± 0.056
0.223MetTrp: 0.223 ± 0.021
0.475MetTyr: 0.475 ± 0.031
0.0MetXaa: 0.0 ± 0.0
Asn
3.497AsnAla: 3.497 ± 0.073
0.315AsnCys: 0.315 ± 0.023
1.602AsnAsp: 1.602 ± 0.045
1.546AsnGlu: 1.546 ± 0.054
1.063AsnPhe: 1.063 ± 0.044
2.601AsnGly: 2.601 ± 0.065
0.559AsnHis: 0.559 ± 0.031
1.355AsnIle: 1.355 ± 0.047
0.924AsnLys: 0.924 ± 0.041
2.876AsnLeu: 2.876 ± 0.073
0.717AsnMet: 0.717 ± 0.033
0.853AsnAsn: 0.853 ± 0.057
1.91AsnPro: 1.91 ± 0.055
0.892AsnGln: 0.892 ± 0.036
1.967AsnArg: 1.967 ± 0.047
1.249AsnSer: 1.249 ± 0.044
1.418AsnThr: 1.418 ± 0.048
2.325AsnVal: 2.325 ± 0.061
0.467AsnTrp: 0.467 ± 0.029
0.735AsnTyr: 0.735 ± 0.034
0.0AsnXaa: 0.0 ± 0.0
Pro
5.816ProAla: 5.816 ± 0.118
0.387ProCys: 0.387 ± 0.025
3.353ProAsp: 3.353 ± 0.065
3.172ProGlu: 3.172 ± 0.077
1.92ProPhe: 1.92 ± 0.055
3.825ProGly: 3.825 ± 0.086
1.31ProHis: 1.31 ± 0.042
2.033ProIle: 2.033 ± 0.052
1.742ProLys: 1.742 ± 0.049
4.713ProLeu: 4.713 ± 0.087
1.06ProMet: 1.06 ± 0.042
1.53ProAsn: 1.53 ± 0.046
2.325ProPro: 2.325 ± 0.066
1.78ProGln: 1.78 ± 0.052
3.153ProArg: 3.153 ± 0.083
2.628ProSer: 2.628 ± 0.064
2.448ProThr: 2.448 ± 0.06
4.236ProVal: 4.236 ± 0.071
0.558ProTrp: 0.558 ± 0.029
1.219ProTyr: 1.219 ± 0.04
0.0ProXaa: 0.0 ± 0.0
Gln
4.197GlnAla: 4.197 ± 0.078
0.294GlnCys: 0.294 ± 0.021
1.429GlnAsp: 1.429 ± 0.047
1.525GlnGlu: 1.525 ± 0.051
1.18GlnPhe: 1.18 ± 0.042
2.411GlnGly: 2.411 ± 0.053
0.809GlnHis: 0.809 ± 0.029
1.946GlnIle: 1.946 ± 0.051
1.379GlnLys: 1.379 ± 0.049
3.346GlnLeu: 3.346 ± 0.082
0.998GlnMet: 0.998 ± 0.043
1.005GlnAsn: 1.005 ± 0.037
1.875GlnPro: 1.875 ± 0.051
1.769GlnGln: 1.769 ± 0.15
2.794GlnArg: 2.794 ± 0.077
1.869GlnSer: 1.869 ± 0.06
1.806GlnThr: 1.806 ± 0.053
2.426GlnVal: 2.426 ± 0.063
0.484GlnTrp: 0.484 ± 0.028
0.894GlnTyr: 0.894 ± 0.039
0.0GlnXaa: 0.0 ± 0.0
Arg
8.556ArgAla: 8.556 ± 0.145
0.668ArgCys: 0.668 ± 0.031
4.208ArgAsp: 4.208 ± 0.072
4.995ArgGlu: 4.995 ± 0.098
3.147ArgPhe: 3.147 ± 0.07
4.998ArgGly: 4.998 ± 0.089
2.209ArgHis: 2.209 ± 0.064
3.873ArgIle: 3.873 ± 0.059
2.405ArgLys: 2.405 ± 0.063
6.854ArgLeu: 6.854 ± 0.108
1.944ArgMet: 1.944 ± 0.06
2.01ArgAsn: 2.01 ± 0.057
3.079ArgPro: 3.079 ± 0.067
2.461ArgGln: 2.461 ± 0.061
6.114ArgArg: 6.114 ± 0.134
3.778ArgSer: 3.778 ± 0.091
3.593ArgThr: 3.593 ± 0.068
5.533ArgVal: 5.533 ± 0.102
1.073ArgTrp: 1.073 ± 0.045
1.991ArgTyr: 1.991 ± 0.06
0.0ArgXaa: 0.0 ± 0.0
Ser
6.36SerAla: 6.36 ± 0.092
0.595SerCys: 0.595 ± 0.032
2.957SerAsp: 2.957 ± 0.064
2.713SerGlu: 2.713 ± 0.068
2.105SerPhe: 2.105 ± 0.057
4.954SerGly: 4.954 ± 0.094
1.289SerHis: 1.289 ± 0.046
2.779SerIle: 2.779 ± 0.068
1.9SerLys: 1.9 ± 0.054
5.338SerLeu: 5.338 ± 0.109
1.394SerMet: 1.394 ± 0.051
1.771SerAsn: 1.771 ± 0.045
2.764SerPro: 2.764 ± 0.065
1.718SerGln: 1.718 ± 0.055
4.128SerArg: 4.128 ± 0.082
3.42SerSer: 3.42 ± 0.091
2.99SerThr: 2.99 ± 0.071
4.015SerVal: 4.015 ± 0.084
0.806SerTrp: 0.806 ± 0.039
1.251SerTyr: 1.251 ± 0.043
0.0SerXaa: 0.0 ± 0.0
Thr
5.285ThrAla: 5.285 ± 0.097
0.463ThrCys: 0.463 ± 0.026
2.847ThrAsp: 2.847 ± 0.062
2.413ThrGlu: 2.413 ± 0.066
1.851ThrPhe: 1.851 ± 0.048
4.551ThrGly: 4.551 ± 0.088
1.219ThrHis: 1.219 ± 0.045
2.776ThrIle: 2.776 ± 0.074
1.563ThrLys: 1.563 ± 0.048
6.037ThrLeu: 6.037 ± 0.103
1.293ThrMet: 1.293 ± 0.041
1.329ThrAsn: 1.329 ± 0.051
3.289ThrPro: 3.289 ± 0.073
1.756ThrGln: 1.756 ± 0.055
3.551ThrArg: 3.551 ± 0.076
2.681ThrSer: 2.681 ± 0.066
2.716ThrThr: 2.716 ± 0.065
4.062ThrVal: 4.062 ± 0.075
0.553ThrTrp: 0.553 ± 0.028
1.272ThrTyr: 1.272 ± 0.041
0.0ThrXaa: 0.0 ± 0.0
Val
8.607ValAla: 8.607 ± 0.127
0.814ValCys: 0.814 ± 0.035
4.502ValAsp: 4.502 ± 0.087
4.707ValGlu: 4.707 ± 0.093
2.811ValPhe: 2.811 ± 0.065
5.194ValGly: 5.194 ± 0.096
1.585ValHis: 1.585 ± 0.051
3.95ValIle: 3.95 ± 0.086
3.106ValLys: 3.106 ± 0.08
7.481ValLeu: 7.481 ± 0.123
1.986ValMet: 1.986 ± 0.056
2.285ValAsn: 2.285 ± 0.061
3.873ValPro: 3.873 ± 0.075
2.399ValGln: 2.399 ± 0.06
5.296ValArg: 5.296 ± 0.085
4.452ValSer: 4.452 ± 0.07
4.273ValThr: 4.273 ± 0.086
6.045ValVal: 6.045 ± 0.105
0.879ValTrp: 0.879 ± 0.042
1.667ValTyr: 1.667 ± 0.053
0.0ValXaa: 0.0 ± 0.0
Trp
1.097TrpAla: 1.097 ± 0.043
0.216TrpCys: 0.216 ± 0.018
0.585TrpAsp: 0.585 ± 0.029
0.484TrpGlu: 0.484 ± 0.025
0.509TrpPhe: 0.509 ± 0.028
0.725TrpGly: 0.725 ± 0.033
0.329TrpHis: 0.329 ± 0.025
0.719TrpIle: 0.719 ± 0.032
0.46TrpLys: 0.46 ± 0.026
1.748TrpLeu: 1.748 ± 0.051
0.405TrpMet: 0.405 ± 0.024
0.419TrpAsn: 0.419 ± 0.027
0.577TrpPro: 0.577 ± 0.032
0.616TrpGln: 0.616 ± 0.036
1.108TrpArg: 1.108 ± 0.042
0.743TrpSer: 0.743 ± 0.037
0.613TrpThr: 0.613 ± 0.03
0.868TrpVal: 0.868 ± 0.035
0.194TrpTrp: 0.194 ± 0.017
0.292TrpTyr: 0.292 ± 0.022
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.589TyrAla: 2.589 ± 0.063
0.261TyrCys: 0.261 ± 0.017
1.365TyrAsp: 1.365 ± 0.046
1.494TyrGlu: 1.494 ± 0.054
1.063TyrPhe: 1.063 ± 0.036
2.187TyrGly: 2.187 ± 0.056
0.518TyrHis: 0.518 ± 0.028
0.995TyrIle: 0.995 ± 0.035
0.817TyrLys: 0.817 ± 0.036
2.381TyrLeu: 2.381 ± 0.055
0.497TyrMet: 0.497 ± 0.028
0.668TyrAsn: 0.668 ± 0.034
1.151TyrPro: 1.151 ± 0.038
0.802TyrGln: 0.802 ± 0.032
2.131TyrArg: 2.131 ± 0.061
1.316TyrSer: 1.316 ± 0.044
1.24TyrThr: 1.24 ± 0.044
1.913TyrVal: 1.913 ± 0.051
0.324TyrTrp: 0.324 ± 0.021
0.656TyrTyr: 0.656 ± 0.034
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.002XaaLeu: 0.002 ± 0.002
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.002XaaArg: 0.002 ± 0.001
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2202 proteins (663521 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski