Amino acid dipepetide frequency for Bordetella hinzii

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.241AlaAla: 19.241 ± 0.173
1.376AlaCys: 1.376 ± 0.04
6.77AlaAsp: 6.77 ± 0.076
6.851AlaGlu: 6.851 ± 0.077
3.946AlaPhe: 3.946 ± 0.054
12.67AlaGly: 12.67 ± 0.151
2.553AlaHis: 2.553 ± 0.049
5.531AlaIle: 5.531 ± 0.057
2.933AlaLys: 2.933 ± 0.055
15.841AlaLeu: 15.841 ± 0.162
3.675AlaMet: 3.675 ± 0.058
2.61AlaAsn: 2.61 ± 0.058
6.284AlaPro: 6.284 ± 0.084
6.01AlaGln: 6.01 ± 0.082
11.31AlaArg: 11.31 ± 0.116
6.527AlaSer: 6.527 ± 0.073
5.386AlaThr: 5.386 ± 0.076
9.183AlaVal: 9.183 ± 0.089
2.081AlaTrp: 2.081 ± 0.046
2.958AlaTyr: 2.958 ± 0.045
0.0AlaXaa: 0.0 ± 0.0
Cys
1.151CysAla: 1.151 ± 0.032
0.113CysCys: 0.113 ± 0.009
0.496CysAsp: 0.496 ± 0.018
0.439CysGlu: 0.439 ± 0.018
0.285CysPhe: 0.285 ± 0.015
0.933CysGly: 0.933 ± 0.024
0.233CysHis: 0.233 ± 0.012
0.347CysIle: 0.347 ± 0.017
0.168CysLys: 0.168 ± 0.012
0.943CysLeu: 0.943 ± 0.029
0.197CysMet: 0.197 ± 0.012
0.201CysAsn: 0.201 ± 0.013
0.451CysPro: 0.451 ± 0.02
0.273CysGln: 0.273 ± 0.014
0.614CysArg: 0.614 ± 0.023
0.403CysSer: 0.403 ± 0.015
0.364CysThr: 0.364 ± 0.016
0.699CysVal: 0.699 ± 0.023
0.11CysTrp: 0.11 ± 0.008
0.199CysTyr: 0.199 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
6.697AspAla: 6.697 ± 0.077
0.454AspCys: 0.454 ± 0.019
2.659AspAsp: 2.659 ± 0.044
3.042AspGlu: 3.042 ± 0.058
2.044AspPhe: 2.044 ± 0.036
4.717AspGly: 4.717 ± 0.075
1.062AspHis: 1.062 ± 0.031
2.819AspIle: 2.819 ± 0.044
1.486AspLys: 1.486 ± 0.034
5.498AspLeu: 5.498 ± 0.064
1.326AspMet: 1.326 ± 0.029
1.197AspAsn: 1.197 ± 0.032
3.199AspPro: 3.199 ± 0.046
1.605AspGln: 1.605 ± 0.034
3.481AspArg: 3.481 ± 0.053
2.331AspSer: 2.331 ± 0.049
2.569AspThr: 2.569 ± 0.04
3.978AspVal: 3.978 ± 0.062
0.978AspTrp: 0.978 ± 0.027
1.515AspTyr: 1.515 ± 0.034
0.0AspXaa: 0.0 ± 0.0
Glu
6.897GluAla: 6.897 ± 0.081
0.363GluCys: 0.363 ± 0.017
2.383GluAsp: 2.383 ± 0.047
2.291GluGlu: 2.291 ± 0.05
1.59GluPhe: 1.59 ± 0.035
3.789GluGly: 3.789 ± 0.052
1.286GluHis: 1.286 ± 0.033
2.681GluIle: 2.681 ± 0.047
1.53GluLys: 1.53 ± 0.041
5.506GluLeu: 5.506 ± 0.071
1.148GluMet: 1.148 ± 0.03
1.302GluAsn: 1.302 ± 0.03
2.525GluPro: 2.525 ± 0.048
2.497GluGln: 2.497 ± 0.044
4.593GluArg: 4.593 ± 0.073
2.288GluSer: 2.288 ± 0.042
2.449GluThr: 2.449 ± 0.045
3.831GluVal: 3.831 ± 0.058
0.619GluTrp: 0.619 ± 0.024
1.06GluTyr: 1.06 ± 0.028
0.0GluXaa: 0.0 ± 0.0
Phe
3.986PheAla: 3.986 ± 0.061
0.35PheCys: 0.35 ± 0.016
2.313PheAsp: 2.313 ± 0.043
1.877PheGlu: 1.877 ± 0.039
1.263PhePhe: 1.263 ± 0.039
3.37PheGly: 3.37 ± 0.055
0.667PheHis: 0.667 ± 0.02
1.476PheIle: 1.476 ± 0.029
1.012PheLys: 1.012 ± 0.027
3.062PheLeu: 3.062 ± 0.057
0.789PheMet: 0.789 ± 0.028
0.974PheAsn: 0.974 ± 0.024
1.486PhePro: 1.486 ± 0.034
1.08PheGln: 1.08 ± 0.031
1.859PheArg: 1.859 ± 0.038
2.016PheSer: 2.016 ± 0.04
1.656PheThr: 1.656 ± 0.034
2.588PheVal: 2.588 ± 0.048
0.51PheTrp: 0.51 ± 0.02
0.882PheTyr: 0.882 ± 0.027
0.0PheXaa: 0.0 ± 0.0
Gly
9.865GlyAla: 9.865 ± 0.142
0.805GlyCys: 0.805 ± 0.025
3.975GlyAsp: 3.975 ± 0.079
4.392GlyGlu: 4.392 ± 0.052
3.145GlyPhe: 3.145 ± 0.053
7.564GlyGly: 7.564 ± 0.154
1.913GlyHis: 1.913 ± 0.042
4.077GlyIle: 4.077 ± 0.057
3.157GlyLys: 3.157 ± 0.052
10.341GlyLeu: 10.341 ± 0.107
2.521GlyMet: 2.521 ± 0.041
2.133GlyAsn: 2.133 ± 0.062
3.48GlyPro: 3.48 ± 0.059
4.052GlyGln: 4.052 ± 0.061
6.365GlyArg: 6.365 ± 0.079
4.587GlySer: 4.587 ± 0.105
4.133GlyThr: 4.133 ± 0.075
6.767GlyVal: 6.767 ± 0.074
1.517GlyTrp: 1.517 ± 0.033
2.586GlyTyr: 2.586 ± 0.039
0.0GlyXaa: 0.0 ± 0.0
His
2.764HisAla: 2.764 ± 0.05
0.256HisCys: 0.256 ± 0.015
1.141HisAsp: 1.141 ± 0.031
1.065HisGlu: 1.065 ± 0.029
0.759HisPhe: 0.759 ± 0.02
2.082HisGly: 2.082 ± 0.039
0.549HisHis: 0.549 ± 0.019
0.977HisIle: 0.977 ± 0.025
0.461HisLys: 0.461 ± 0.018
2.073HisLeu: 2.073 ± 0.04
0.489HisMet: 0.489 ± 0.019
0.453HisAsn: 0.453 ± 0.02
1.492HisPro: 1.492 ± 0.035
0.677HisGln: 0.677 ± 0.021
1.412HisArg: 1.412 ± 0.034
0.914HisSer: 0.914 ± 0.025
0.976HisThr: 0.976 ± 0.026
1.505HisVal: 1.505 ± 0.033
0.365HisTrp: 0.365 ± 0.017
0.648HisTyr: 0.648 ± 0.022
0.0HisXaa: 0.0 ± 0.0
Ile
6.093IleAla: 6.093 ± 0.07
0.423IleCys: 0.423 ± 0.019
2.925IleAsp: 2.925 ± 0.053
2.986IleGlu: 2.986 ± 0.045
1.306IlePhe: 1.306 ± 0.033
4.15IleGly: 4.15 ± 0.065
0.82IleHis: 0.82 ± 0.025
1.77IleIle: 1.77 ± 0.038
1.368IleLys: 1.368 ± 0.033
3.989IleLeu: 3.989 ± 0.068
0.927IleMet: 0.927 ± 0.029
1.38IleAsn: 1.38 ± 0.036
2.11IlePro: 2.11 ± 0.045
1.33IleGln: 1.33 ± 0.029
2.693IleArg: 2.693 ± 0.042
2.457IleSer: 2.457 ± 0.044
2.348IleThr: 2.348 ± 0.041
3.621IleVal: 3.621 ± 0.06
0.543IleTrp: 0.543 ± 0.021
0.981IleTyr: 0.981 ± 0.026
0.0IleXaa: 0.0 ± 0.0
Lys
3.194LysAla: 3.194 ± 0.06
0.121LysCys: 0.121 ± 0.009
1.312LysAsp: 1.312 ± 0.033
1.244LysGlu: 1.244 ± 0.038
0.741LysPhe: 0.741 ± 0.024
2.087LysGly: 2.087 ± 0.038
0.531LysHis: 0.531 ± 0.021
1.257LysIle: 1.257 ± 0.032
1.031LysLys: 1.031 ± 0.034
3.01LysLeu: 3.01 ± 0.054
0.647LysMet: 0.647 ± 0.021
0.741LysAsn: 0.741 ± 0.028
1.8LysPro: 1.8 ± 0.042
1.16LysGln: 1.16 ± 0.028
2.087LysArg: 2.087 ± 0.036
1.435LysSer: 1.435 ± 0.035
1.696LysThr: 1.696 ± 0.038
2.181LysVal: 2.181 ± 0.044
0.347LysTrp: 0.347 ± 0.017
0.614LysTyr: 0.614 ± 0.022
0.0LysXaa: 0.0 ± 0.0
Leu
17.439LeuAla: 17.439 ± 0.157
1.01LeuCys: 1.01 ± 0.027
6.281LeuAsp: 6.281 ± 0.081
4.986LeuGlu: 4.986 ± 0.076
3.513LeuPhe: 3.513 ± 0.061
9.589LeuGly: 9.589 ± 0.106
2.144LeuHis: 2.144 ± 0.042
4.564LeuIle: 4.564 ± 0.069
3.048LeuLys: 3.048 ± 0.05
11.605LeuLeu: 11.605 ± 0.164
2.515LeuMet: 2.515 ± 0.047
2.702LeuAsn: 2.702 ± 0.046
6.69LeuPro: 6.69 ± 0.077
3.771LeuGln: 3.771 ± 0.054
8.883LeuArg: 8.883 ± 0.101
6.395LeuSer: 6.395 ± 0.067
5.383LeuThr: 5.383 ± 0.073
7.67LeuVal: 7.67 ± 0.082
1.274LeuTrp: 1.274 ± 0.03
2.229LeuTyr: 2.229 ± 0.041
0.0LeuXaa: 0.0 ± 0.0
Met
3.146MetAla: 3.146 ± 0.054
0.148MetCys: 0.148 ± 0.012
1.114MetAsp: 1.114 ± 0.032
0.952MetGlu: 0.952 ± 0.025
0.686MetPhe: 0.686 ± 0.023
1.906MetGly: 1.906 ± 0.041
0.509MetHis: 0.509 ± 0.021
0.972MetIle: 0.972 ± 0.028
0.84MetLys: 0.84 ± 0.025
2.821MetLeu: 2.821 ± 0.052
0.618MetMet: 0.618 ± 0.024
0.732MetAsn: 0.732 ± 0.019
1.469MetPro: 1.469 ± 0.035
1.134MetGln: 1.134 ± 0.028
1.885MetArg: 1.885 ± 0.035
1.703MetSer: 1.703 ± 0.033
1.581MetThr: 1.581 ± 0.031
1.687MetVal: 1.687 ± 0.037
0.187MetTrp: 0.187 ± 0.011
0.403MetTyr: 0.403 ± 0.017
0.0MetXaa: 0.0 ± 0.0
Asn
2.987AsnAla: 2.987 ± 0.061
0.196AsnCys: 0.196 ± 0.012
1.192AsnAsp: 1.192 ± 0.038
1.069AsnGlu: 1.069 ± 0.026
0.816AsnPhe: 0.816 ± 0.022
2.088AsnGly: 2.088 ± 0.05
0.479AsnHis: 0.479 ± 0.018
1.231AsnIle: 1.231 ± 0.037
0.741AsnLys: 0.741 ± 0.027
2.645AsnLeu: 2.645 ± 0.043
0.585AsnMet: 0.585 ± 0.02
0.685AsnAsn: 0.685 ± 0.027
1.708AsnPro: 1.708 ± 0.035
0.802AsnGln: 0.802 ± 0.027
1.642AsnArg: 1.642 ± 0.033
1.146AsnSer: 1.146 ± 0.035
1.382AsnThr: 1.382 ± 0.048
1.858AsnVal: 1.858 ± 0.052
0.353AsnTrp: 0.353 ± 0.015
0.671AsnTyr: 0.671 ± 0.022
0.0AsnXaa: 0.0 ± 0.0
Pro
8.016ProAla: 8.016 ± 0.101
0.336ProCys: 0.336 ± 0.015
3.437ProAsp: 3.437 ± 0.053
3.34ProGlu: 3.34 ± 0.053
1.71ProPhe: 1.71 ± 0.034
5.069ProGly: 5.069 ± 0.067
1.054ProHis: 1.054 ± 0.027
1.831ProIle: 1.831 ± 0.035
1.257ProLys: 1.257 ± 0.032
5.41ProLeu: 5.41 ± 0.073
1.28ProMet: 1.28 ± 0.029
1.17ProAsn: 1.17 ± 0.028
2.747ProPro: 2.747 ± 0.053
2.283ProGln: 2.283 ± 0.045
3.475ProArg: 3.475 ± 0.054
2.604ProSer: 2.604 ± 0.042
2.18ProThr: 2.18 ± 0.039
4.262ProVal: 4.262 ± 0.061
0.796ProTrp: 0.796 ± 0.026
1.382ProTyr: 1.382 ± 0.031
0.0ProXaa: 0.0 ± 0.0
Gln
7.27GlnAla: 7.27 ± 0.098
0.242GlnCys: 0.242 ± 0.012
1.883GlnAsp: 1.883 ± 0.04
1.716GlnGlu: 1.716 ± 0.035
1.111GlnPhe: 1.111 ± 0.027
3.567GlnGly: 3.567 ± 0.055
0.83GlnHis: 0.83 ± 0.023
1.717GlnIle: 1.717 ± 0.039
0.927GlnLys: 0.927 ± 0.03
3.84GlnLeu: 3.84 ± 0.056
0.922GlnMet: 0.922 ± 0.024
0.829GlnAsn: 0.829 ± 0.026
2.226GlnPro: 2.226 ± 0.032
1.747GlnGln: 1.747 ± 0.037
3.243GlnArg: 3.243 ± 0.052
1.872GlnSer: 1.872 ± 0.039
1.906GlnThr: 1.906 ± 0.036
2.878GlnVal: 2.878 ± 0.048
0.717GlnTrp: 0.717 ± 0.022
0.904GlnTyr: 0.904 ± 0.025
0.0GlnXaa: 0.0 ± 0.0
Arg
9.356ArgAla: 9.356 ± 0.099
0.573ArgCys: 0.573 ± 0.021
4.05ArgAsp: 4.05 ± 0.059
4.449ArgGlu: 4.449 ± 0.065
2.624ArgPhe: 2.624 ± 0.04
5.338ArgGly: 5.338 ± 0.058
2.037ArgHis: 2.037 ± 0.043
3.729ArgIle: 3.729 ± 0.053
1.996ArgLys: 1.996 ± 0.041
9.059ArgLeu: 9.059 ± 0.107
1.89ArgMet: 1.89 ± 0.039
1.85ArgAsn: 1.85 ± 0.033
3.821ArgPro: 3.821 ± 0.066
3.728ArgGln: 3.728 ± 0.054
6.057ArgArg: 6.057 ± 0.09
3.347ArgSer: 3.347 ± 0.043
2.683ArgThr: 2.683 ± 0.042
5.288ArgVal: 5.288 ± 0.062
1.15ArgTrp: 1.15 ± 0.031
2.154ArgTyr: 2.154 ± 0.042
0.0ArgXaa: 0.0 ± 0.0
Ser
6.224SerAla: 6.224 ± 0.073
0.411SerCys: 0.411 ± 0.017
2.32SerAsp: 2.32 ± 0.043
2.259SerGlu: 2.259 ± 0.041
1.918SerPhe: 1.918 ± 0.036
5.145SerGly: 5.145 ± 0.08
1.101SerHis: 1.101 ± 0.024
2.254SerIle: 2.254 ± 0.047
1.252SerLys: 1.252 ± 0.031
6.076SerLeu: 6.076 ± 0.07
1.296SerMet: 1.296 ± 0.029
1.26SerAsn: 1.26 ± 0.036
2.746SerPro: 2.746 ± 0.042
1.916SerGln: 1.916 ± 0.041
3.683SerArg: 3.683 ± 0.045
2.717SerSer: 2.717 ± 0.052
2.503SerThr: 2.503 ± 0.052
3.979SerVal: 3.979 ± 0.071
0.79SerTrp: 0.79 ± 0.019
1.291SerTyr: 1.291 ± 0.029
0.0SerXaa: 0.0 ± 0.0
Thr
5.482ThrAla: 5.482 ± 0.064
0.323ThrCys: 0.323 ± 0.014
2.333ThrAsp: 2.333 ± 0.036
2.222ThrGlu: 2.222 ± 0.036
1.488ThrPhe: 1.488 ± 0.033
4.459ThrGly: 4.459 ± 0.082
1.024ThrHis: 1.024 ± 0.027
2.008ThrIle: 2.008 ± 0.05
0.854ThrLys: 0.854 ± 0.027
6.504ThrLeu: 6.504 ± 0.094
0.97ThrMet: 0.97 ± 0.029
0.934ThrAsn: 0.934 ± 0.029
3.296ThrPro: 3.296 ± 0.05
1.78ThrGln: 1.78 ± 0.047
3.412ThrArg: 3.412 ± 0.049
2.212ThrSer: 2.212 ± 0.046
2.274ThrThr: 2.274 ± 0.056
3.964ThrVal: 3.964 ± 0.075
0.638ThrTrp: 0.638 ± 0.024
0.999ThrTyr: 0.999 ± 0.024
0.0ThrXaa: 0.0 ± 0.0
Val
9.529ValAla: 9.529 ± 0.1
0.741ValCys: 0.741 ± 0.025
3.98ValAsp: 3.98 ± 0.057
3.71ValGlu: 3.71 ± 0.057
2.837ValPhe: 2.837 ± 0.051
5.609ValGly: 5.609 ± 0.07
1.423ValHis: 1.423 ± 0.03
3.433ValIle: 3.433 ± 0.053
2.15ValLys: 2.15 ± 0.044
8.683ValLeu: 8.683 ± 0.083
1.813ValMet: 1.813 ± 0.032
2.108ValAsn: 2.108 ± 0.063
4.008ValPro: 4.008 ± 0.057
2.83ValGln: 2.83 ± 0.041
5.108ValArg: 5.108 ± 0.067
4.282ValSer: 4.282 ± 0.081
3.717ValThr: 3.717 ± 0.06
6.027ValVal: 6.027 ± 0.071
0.905ValTrp: 0.905 ± 0.027
1.745ValTyr: 1.745 ± 0.041
0.0ValXaa: 0.0 ± 0.0
Trp
1.352TrpAla: 1.352 ± 0.036
0.128TrpCys: 0.128 ± 0.011
0.629TrpAsp: 0.629 ± 0.022
0.544TrpGlu: 0.544 ± 0.022
0.529TrpPhe: 0.529 ± 0.022
0.948TrpGly: 0.948 ± 0.029
0.364TrpHis: 0.364 ± 0.016
0.659TrpIle: 0.659 ± 0.018
0.402TrpLys: 0.402 ± 0.019
2.169TrpLeu: 2.169 ± 0.046
0.429TrpMet: 0.429 ± 0.017
0.407TrpAsn: 0.407 ± 0.015
0.773TrpPro: 0.773 ± 0.022
0.781TrpGln: 0.781 ± 0.026
1.456TrpArg: 1.456 ± 0.033
0.731TrpSer: 0.731 ± 0.023
0.701TrpThr: 0.701 ± 0.023
0.947TrpVal: 0.947 ± 0.026
0.245TrpTrp: 0.245 ± 0.012
0.314TrpTyr: 0.314 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.045TyrAla: 3.045 ± 0.047
0.251TyrCys: 0.251 ± 0.016
1.463TyrAsp: 1.463 ± 0.034
1.194TyrGlu: 1.194 ± 0.033
0.905TyrPhe: 0.905 ± 0.023
2.294TyrGly: 2.294 ± 0.043
0.479TyrHis: 0.479 ± 0.019
0.888TyrIle: 0.888 ± 0.024
0.627TyrLys: 0.627 ± 0.022
2.614TyrLeu: 2.614 ± 0.046
0.455TyrMet: 0.455 ± 0.017
0.608TyrAsn: 0.608 ± 0.02
1.327TyrPro: 1.327 ± 0.031
0.894TyrGln: 0.894 ± 0.025
1.925TyrArg: 1.925 ± 0.043
1.173TyrSer: 1.173 ± 0.029
1.285TyrThr: 1.285 ± 0.03
1.725TyrVal: 1.725 ± 0.032
0.383TyrTrp: 0.383 ± 0.018
0.633TyrTyr: 0.633 ± 0.026
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4557 proteins (1500318 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski