Amino acid dipepetide frequency for Burkholderiales bacterium 8X

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.957AlaAla: 18.957 ± 0.149
1.374AlaCys: 1.374 ± 0.034
6.567AlaAsp: 6.567 ± 0.07
7.276AlaGlu: 7.276 ± 0.076
4.272AlaPhe: 4.272 ± 0.053
11.793AlaGly: 11.793 ± 0.108
2.491AlaHis: 2.491 ± 0.038
5.45AlaIle: 5.45 ± 0.066
3.712AlaLys: 3.712 ± 0.065
13.798AlaLeu: 13.798 ± 0.12
3.393AlaMet: 3.393 ± 0.05
2.927AlaAsn: 2.927 ± 0.045
6.446AlaPro: 6.446 ± 0.086
4.78AlaGln: 4.78 ± 0.058
9.857AlaArg: 9.857 ± 0.118
7.86AlaSer: 7.86 ± 0.091
6.057AlaThr: 6.057 ± 0.066
8.824AlaVal: 8.824 ± 0.089
1.869AlaTrp: 1.869 ± 0.038
2.268AlaTyr: 2.268 ± 0.042
0.0AlaXaa: 0.0 ± 0.0
Cys
1.14CysAla: 1.14 ± 0.03
0.117CysCys: 0.117 ± 0.009
0.553CysAsp: 0.553 ± 0.018
0.562CysGlu: 0.562 ± 0.02
0.313CysPhe: 0.313 ± 0.015
1.036CysGly: 1.036 ± 0.03
0.256CysHis: 0.256 ± 0.013
0.475CysIle: 0.475 ± 0.018
0.216CysLys: 0.216 ± 0.013
0.843CysLeu: 0.843 ± 0.024
0.211CysMet: 0.211 ± 0.012
0.266CysAsn: 0.266 ± 0.015
0.438CysPro: 0.438 ± 0.02
0.264CysGln: 0.264 ± 0.015
0.761CysArg: 0.761 ± 0.027
0.54CysSer: 0.54 ± 0.02
0.501CysThr: 0.501 ± 0.024
0.696CysVal: 0.696 ± 0.022
0.145CysTrp: 0.145 ± 0.01
0.182CysTyr: 0.182 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
7.437AspAla: 7.437 ± 0.067
0.453AspCys: 0.453 ± 0.017
2.781AspAsp: 2.781 ± 0.05
3.242AspGlu: 3.242 ± 0.05
2.103AspPhe: 2.103 ± 0.036
4.655AspGly: 4.655 ± 0.067
1.181AspHis: 1.181 ± 0.027
2.305AspIle: 2.305 ± 0.045
1.537AspLys: 1.537 ± 0.038
5.442AspLeu: 5.442 ± 0.06
1.095AspMet: 1.095 ± 0.028
1.109AspAsn: 1.109 ± 0.029
3.223AspPro: 3.223 ± 0.049
1.677AspGln: 1.677 ± 0.032
4.314AspArg: 4.314 ± 0.055
2.294AspSer: 2.294 ± 0.041
2.188AspThr: 2.188 ± 0.04
3.68AspVal: 3.68 ± 0.048
0.864AspTrp: 0.864 ± 0.021
1.239AspTyr: 1.239 ± 0.028
0.0AspXaa: 0.0 ± 0.0
Glu
7.221GluAla: 7.221 ± 0.089
0.377GluCys: 0.377 ± 0.016
2.254GluAsp: 2.254 ± 0.041
2.505GluGlu: 2.505 ± 0.054
1.771GluPhe: 1.771 ± 0.039
4.02GluGly: 4.02 ± 0.056
1.264GluHis: 1.264 ± 0.032
2.577GluIle: 2.577 ± 0.048
1.728GluLys: 1.728 ± 0.04
5.906GluLeu: 5.906 ± 0.066
1.263GluMet: 1.263 ± 0.026
1.17GluAsn: 1.17 ± 0.029
2.893GluPro: 2.893 ± 0.043
2.26GluGln: 2.26 ± 0.043
5.069GluArg: 5.069 ± 0.066
2.905GluSer: 2.905 ± 0.042
2.442GluThr: 2.442 ± 0.041
4.176GluVal: 4.176 ± 0.056
0.753GluTrp: 0.753 ± 0.024
0.884GluTyr: 0.884 ± 0.023
0.0GluXaa: 0.0 ± 0.0
Phe
4.388PheAla: 4.388 ± 0.059
0.403PheCys: 0.403 ± 0.018
2.647PheAsp: 2.647 ± 0.037
2.297PheGlu: 2.297 ± 0.033
1.369PhePhe: 1.369 ± 0.035
3.543PheGly: 3.543 ± 0.049
0.718PheHis: 0.718 ± 0.022
1.442PheIle: 1.442 ± 0.034
1.201PheLys: 1.201 ± 0.031
2.843PheLeu: 2.843 ± 0.044
0.787PheMet: 0.787 ± 0.028
0.997PheAsn: 0.997 ± 0.029
1.528PhePro: 1.528 ± 0.028
0.98PheGln: 0.98 ± 0.028
2.045PheArg: 2.045 ± 0.033
2.078PheSer: 2.078 ± 0.038
1.736PheThr: 1.736 ± 0.04
2.842PheVal: 2.842 ± 0.045
0.485PheTrp: 0.485 ± 0.02
0.831PheTyr: 0.831 ± 0.027
0.0PheXaa: 0.0 ± 0.0
Gly
10.055GlyAla: 10.055 ± 0.105
0.968GlyCys: 0.968 ± 0.023
4.132GlyAsp: 4.132 ± 0.054
4.46GlyGlu: 4.46 ± 0.057
3.435GlyPhe: 3.435 ± 0.047
7.943GlyGly: 7.943 ± 0.145
1.981GlyHis: 1.981 ± 0.041
4.148GlyIle: 4.148 ± 0.054
3.211GlyLys: 3.211 ± 0.049
8.943GlyLeu: 8.943 ± 0.082
2.242GlyMet: 2.242 ± 0.037
2.267GlyAsn: 2.267 ± 0.045
3.649GlyPro: 3.649 ± 0.056
3.195GlyGln: 3.195 ± 0.048
6.759GlyArg: 6.759 ± 0.102
5.476GlySer: 5.476 ± 0.073
4.546GlyThr: 4.546 ± 0.077
6.164GlyVal: 6.164 ± 0.067
1.461GlyTrp: 1.461 ± 0.031
2.121GlyTyr: 2.121 ± 0.041
0.0GlyXaa: 0.0 ± 0.0
His
2.968HisAla: 2.968 ± 0.046
0.275HisCys: 0.275 ± 0.012
1.152HisAsp: 1.152 ± 0.03
1.092HisGlu: 1.092 ± 0.029
0.801HisPhe: 0.801 ± 0.025
2.202HisGly: 2.202 ± 0.045
0.696HisHis: 0.696 ± 0.023
0.789HisIle: 0.789 ± 0.022
0.449HisLys: 0.449 ± 0.016
2.203HisLeu: 2.203 ± 0.043
0.429HisMet: 0.429 ± 0.015
0.433HisAsn: 0.433 ± 0.016
1.629HisPro: 1.629 ± 0.037
0.724HisGln: 0.724 ± 0.025
2.117HisArg: 2.117 ± 0.058
1.016HisSer: 1.016 ± 0.026
0.834HisThr: 0.834 ± 0.025
1.523HisVal: 1.523 ± 0.031
0.348HisTrp: 0.348 ± 0.015
0.517HisTyr: 0.517 ± 0.017
0.0HisXaa: 0.0 ± 0.0
Ile
6.347IleAla: 6.347 ± 0.077
0.44IleCys: 0.44 ± 0.018
3.311IleAsp: 3.311 ± 0.044
3.409IleGlu: 3.409 ± 0.05
1.285IlePhe: 1.285 ± 0.031
4.512IleGly: 4.512 ± 0.058
0.824IleHis: 0.824 ± 0.023
1.267IleIle: 1.267 ± 0.036
1.384IleLys: 1.384 ± 0.036
3.067IleLeu: 3.067 ± 0.047
0.674IleMet: 0.674 ± 0.025
1.259IleAsn: 1.259 ± 0.031
1.919IlePro: 1.919 ± 0.034
1.212IleGln: 1.212 ± 0.031
2.714IleArg: 2.714 ± 0.041
2.234IleSer: 2.234 ± 0.04
2.153IleThr: 2.153 ± 0.042
3.553IleVal: 3.553 ± 0.051
0.485IleTrp: 0.485 ± 0.018
0.921IleTyr: 0.921 ± 0.027
0.0IleXaa: 0.0 ± 0.0
Lys
3.861LysAla: 3.861 ± 0.065
0.15LysCys: 0.15 ± 0.01
1.548LysAsp: 1.548 ± 0.037
1.468LysGlu: 1.468 ± 0.038
0.893LysPhe: 0.893 ± 0.026
2.337LysGly: 2.337 ± 0.047
0.546LysHis: 0.546 ± 0.019
1.321LysIle: 1.321 ± 0.035
1.373LysLys: 1.373 ± 0.04
3.49LysLeu: 3.49 ± 0.055
0.712LysMet: 0.712 ± 0.024
0.851LysAsn: 0.851 ± 0.028
2.122LysPro: 2.122 ± 0.04
1.07LysGln: 1.07 ± 0.028
2.155LysArg: 2.155 ± 0.043
1.812LysSer: 1.812 ± 0.044
1.8LysThr: 1.8 ± 0.039
2.473LysVal: 2.473 ± 0.054
0.362LysTrp: 0.362 ± 0.015
0.585LysTyr: 0.585 ± 0.022
0.0LysXaa: 0.0 ± 0.0
Leu
14.543LeuAla: 14.543 ± 0.116
1.014LeuCys: 1.014 ± 0.028
5.889LeuAsp: 5.889 ± 0.059
5.044LeuGlu: 5.044 ± 0.062
3.346LeuPhe: 3.346 ± 0.061
8.697LeuGly: 8.697 ± 0.091
2.396LeuHis: 2.396 ± 0.044
4.012LeuIle: 4.012 ± 0.057
3.211LeuLys: 3.211 ± 0.049
10.823LeuLeu: 10.823 ± 0.118
2.453LeuMet: 2.453 ± 0.041
2.388LeuAsn: 2.388 ± 0.043
6.173LeuPro: 6.173 ± 0.065
4.116LeuGln: 4.116 ± 0.054
8.17LeuArg: 8.17 ± 0.101
5.936LeuSer: 5.936 ± 0.078
4.551LeuThr: 4.551 ± 0.054
7.858LeuVal: 7.858 ± 0.089
1.202LeuTrp: 1.202 ± 0.032
1.852LeuTyr: 1.852 ± 0.036
0.0LeuXaa: 0.0 ± 0.0
Met
2.946MetAla: 2.946 ± 0.046
0.173MetCys: 0.173 ± 0.01
0.983MetAsp: 0.983 ± 0.024
0.94MetGlu: 0.94 ± 0.025
0.699MetPhe: 0.699 ± 0.022
1.71MetGly: 1.71 ± 0.038
0.514MetHis: 0.514 ± 0.02
0.947MetIle: 0.947 ± 0.029
1.058MetLys: 1.058 ± 0.029
2.649MetLeu: 2.649 ± 0.038
0.55MetMet: 0.55 ± 0.023
0.842MetAsn: 0.842 ± 0.024
1.605MetPro: 1.605 ± 0.032
1.011MetGln: 1.011 ± 0.02
1.757MetArg: 1.757 ± 0.033
1.601MetSer: 1.601 ± 0.03
1.519MetThr: 1.519 ± 0.033
1.711MetVal: 1.711 ± 0.041
0.188MetTrp: 0.188 ± 0.012
0.351MetTyr: 0.351 ± 0.015
0.0MetXaa: 0.0 ± 0.0
Asn
3.183AsnAla: 3.183 ± 0.047
0.254AsnCys: 0.254 ± 0.014
1.376AsnAsp: 1.376 ± 0.036
1.212AsnGlu: 1.212 ± 0.029
0.945AsnPhe: 0.945 ± 0.026
2.188AsnGly: 2.188 ± 0.056
0.477AsnHis: 0.477 ± 0.019
1.139AsnIle: 1.139 ± 0.026
0.784AsnLys: 0.784 ± 0.024
2.454AsnLeu: 2.454 ± 0.038
0.536AsnMet: 0.536 ± 0.017
0.707AsnAsn: 0.707 ± 0.029
1.753AsnPro: 1.753 ± 0.035
0.82AsnGln: 0.82 ± 0.023
1.651AsnArg: 1.651 ± 0.036
1.202AsnSer: 1.202 ± 0.034
1.275AsnThr: 1.275 ± 0.033
1.802AsnVal: 1.802 ± 0.034
0.383AsnTrp: 0.383 ± 0.016
0.641AsnTyr: 0.641 ± 0.019
0.0AsnXaa: 0.0 ± 0.0
Pro
7.561ProAla: 7.561 ± 0.088
0.408ProCys: 0.408 ± 0.017
3.261ProAsp: 3.261 ± 0.048
3.449ProGlu: 3.449 ± 0.056
1.814ProPhe: 1.814 ± 0.037
5.159ProGly: 5.159 ± 0.069
1.181ProHis: 1.181 ± 0.029
2.217ProIle: 2.217 ± 0.041
1.51ProLys: 1.51 ± 0.035
5.137ProLeu: 5.137 ± 0.064
1.356ProMet: 1.356 ± 0.03
1.397ProAsn: 1.397 ± 0.031
3.131ProPro: 3.131 ± 0.05
1.935ProGln: 1.935 ± 0.035
3.534ProArg: 3.534 ± 0.061
3.456ProSer: 3.456 ± 0.058
2.813ProThr: 2.813 ± 0.046
4.257ProVal: 4.257 ± 0.055
0.761ProTrp: 0.761 ± 0.021
1.106ProTyr: 1.106 ± 0.03
0.0ProXaa: 0.0 ± 0.0
Gln
4.858GlnAla: 4.858 ± 0.051
0.305GlnCys: 0.305 ± 0.015
1.564GlnAsp: 1.564 ± 0.034
1.486GlnGlu: 1.486 ± 0.033
1.136GlnPhe: 1.136 ± 0.027
2.978GlnGly: 2.978 ± 0.042
0.893GlnHis: 0.893 ± 0.023
1.564GlnIle: 1.564 ± 0.03
1.12GlnLys: 1.12 ± 0.029
4.013GlnLeu: 4.013 ± 0.053
0.857GlnMet: 0.857 ± 0.021
0.77GlnAsn: 0.77 ± 0.023
2.354GlnPro: 2.354 ± 0.041
1.801GlnGln: 1.801 ± 0.054
3.312GlnArg: 3.312 ± 0.054
1.983GlnSer: 1.983 ± 0.039
1.586GlnThr: 1.586 ± 0.029
3.009GlnVal: 3.009 ± 0.044
0.547GlnTrp: 0.547 ± 0.019
0.65GlnTyr: 0.65 ± 0.021
0.0GlnXaa: 0.0 ± 0.0
Arg
8.578ArgAla: 8.578 ± 0.097
0.779ArgCys: 0.779 ± 0.027
3.828ArgAsp: 3.828 ± 0.057
4.284ArgGlu: 4.284 ± 0.062
2.994ArgPhe: 2.994 ± 0.047
5.593ArgGly: 5.593 ± 0.071
2.214ArgHis: 2.214 ± 0.05
3.801ArgIle: 3.801 ± 0.048
2.251ArgLys: 2.251 ± 0.037
8.404ArgLeu: 8.404 ± 0.086
2.019ArgMet: 2.019 ± 0.035
1.923ArgAsn: 1.923 ± 0.034
4.055ArgPro: 4.055 ± 0.072
3.162ArgGln: 3.162 ± 0.053
6.939ArgArg: 6.939 ± 0.128
4.574ArgSer: 4.574 ± 0.063
3.49ArgThr: 3.49 ± 0.053
5.068ArgVal: 5.068 ± 0.06
1.246ArgTrp: 1.246 ± 0.032
1.748ArgTyr: 1.748 ± 0.04
0.0ArgXaa: 0.0 ± 0.0
Ser
7.102SerAla: 7.102 ± 0.094
0.519SerCys: 0.519 ± 0.024
2.704SerAsp: 2.704 ± 0.038
2.634SerGlu: 2.634 ± 0.047
2.24SerPhe: 2.24 ± 0.038
5.505SerGly: 5.505 ± 0.078
1.21SerHis: 1.21 ± 0.029
2.684SerIle: 2.684 ± 0.041
1.661SerLys: 1.661 ± 0.041
5.808SerLeu: 5.808 ± 0.071
1.526SerMet: 1.526 ± 0.033
1.561SerAsn: 1.561 ± 0.032
3.305SerPro: 3.305 ± 0.047
1.957SerGln: 1.957 ± 0.036
4.204SerArg: 4.204 ± 0.054
3.756SerSer: 3.756 ± 0.06
3.25SerThr: 3.25 ± 0.049
3.865SerVal: 3.865 ± 0.057
0.789SerTrp: 0.789 ± 0.023
1.204SerTyr: 1.204 ± 0.03
0.0SerXaa: 0.0 ± 0.0
Thr
5.808ThrAla: 5.808 ± 0.082
0.412ThrCys: 0.412 ± 0.019
2.403ThrAsp: 2.403 ± 0.04
2.256ThrGlu: 2.256 ± 0.04
1.651ThrPhe: 1.651 ± 0.031
4.622ThrGly: 4.622 ± 0.069
1.078ThrHis: 1.078 ± 0.024
2.133ThrIle: 2.133 ± 0.042
1.097ThrLys: 1.097 ± 0.034
5.675ThrLeu: 5.675 ± 0.066
1.078ThrMet: 1.078 ± 0.026
1.085ThrAsn: 1.085 ± 0.031
3.374ThrPro: 3.374 ± 0.05
1.682ThrGln: 1.682 ± 0.037
3.405ThrArg: 3.405 ± 0.043
2.846ThrSer: 2.846 ± 0.052
2.726ThrThr: 2.726 ± 0.058
3.801ThrVal: 3.801 ± 0.055
0.634ThrTrp: 0.634 ± 0.02
0.945ThrTyr: 0.945 ± 0.027
0.0ThrXaa: 0.0 ± 0.0
Val
9.379ValAla: 9.379 ± 0.091
0.687ValCys: 0.687 ± 0.023
4.171ValAsp: 4.171 ± 0.056
4.199ValGlu: 4.199 ± 0.048
2.715ValPhe: 2.715 ± 0.042
5.901ValGly: 5.901 ± 0.074
1.57ValHis: 1.57 ± 0.03
3.25ValIle: 3.25 ± 0.046
2.311ValLys: 2.311 ± 0.044
8.025ValLeu: 8.025 ± 0.094
1.785ValMet: 1.785 ± 0.037
1.863ValAsn: 1.863 ± 0.035
4.074ValPro: 4.074 ± 0.06
2.69ValGln: 2.69 ± 0.042
5.301ValArg: 5.301 ± 0.061
4.004ValSer: 4.004 ± 0.057
3.505ValThr: 3.505 ± 0.055
6.254ValVal: 6.254 ± 0.076
0.871ValTrp: 0.871 ± 0.027
1.419ValTyr: 1.419 ± 0.034
0.0ValXaa: 0.0 ± 0.0
Trp
1.252TrpAla: 1.252 ± 0.031
0.176TrpCys: 0.176 ± 0.01
0.571TrpAsp: 0.571 ± 0.018
0.514TrpGlu: 0.514 ± 0.019
0.55TrpPhe: 0.55 ± 0.019
0.903TrpGly: 0.903 ± 0.026
0.338TrpHis: 0.338 ± 0.014
0.705TrpIle: 0.705 ± 0.022
0.479TrpLys: 0.479 ± 0.019
1.965TrpLeu: 1.965 ± 0.038
0.419TrpMet: 0.419 ± 0.015
0.439TrpAsn: 0.439 ± 0.017
0.724TrpPro: 0.724 ± 0.024
0.701TrpGln: 0.701 ± 0.02
1.301TrpArg: 1.301 ± 0.028
0.818TrpSer: 0.818 ± 0.023
0.725TrpThr: 0.725 ± 0.024
0.893TrpVal: 0.893 ± 0.028
0.261TrpTrp: 0.261 ± 0.013
0.281TrpTyr: 0.281 ± 0.013
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.386TyrAla: 2.386 ± 0.045
0.23TyrCys: 0.23 ± 0.012
1.123TyrAsp: 1.123 ± 0.028
1.111TyrGlu: 1.111 ± 0.027
0.828TyrPhe: 0.828 ± 0.026
1.859TyrGly: 1.859 ± 0.035
0.37TyrHis: 0.37 ± 0.015
0.747TyrIle: 0.747 ± 0.024
0.637TyrLys: 0.637 ± 0.022
2.187TyrLeu: 2.187 ± 0.036
0.367TyrMet: 0.367 ± 0.016
0.528TyrAsn: 0.528 ± 0.018
1.048TyrPro: 1.048 ± 0.026
0.75TyrGln: 0.75 ± 0.022
1.583TyrArg: 1.583 ± 0.032
1.093TyrSer: 1.093 ± 0.028
1.014TyrThr: 1.014 ± 0.027
1.524TyrVal: 1.524 ± 0.03
0.359TyrTrp: 0.359 ± 0.018
0.525TyrTyr: 0.525 ± 0.022
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4766 proteins (1542289 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski