Amino acid dipepetide frequency for Burkholderia sp. WAC0059

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.843AlaAla: 18.843 ± 0.189
1.316AlaCys: 1.316 ± 0.037
6.855AlaAsp: 6.855 ± 0.074
6.048AlaGlu: 6.048 ± 0.082
4.514AlaPhe: 4.514 ± 0.059
12.127AlaGly: 12.127 ± 0.145
2.995AlaHis: 2.995 ± 0.047
5.48AlaIle: 5.48 ± 0.066
3.181AlaLys: 3.181 ± 0.061
15.17AlaLeu: 15.17 ± 0.144
3.057AlaMet: 3.057 ± 0.051
3.053AlaAsn: 3.053 ± 0.055
6.371AlaPro: 6.371 ± 0.099
5.47AlaGln: 5.47 ± 0.077
10.417AlaArg: 10.417 ± 0.102
7.765AlaSer: 7.765 ± 0.1
6.243AlaThr: 6.243 ± 0.077
9.213AlaVal: 9.213 ± 0.087
1.825AlaTrp: 1.825 ± 0.036
2.545AlaTyr: 2.545 ± 0.046
0.0AlaXaa: 0.0 ± 0.0
Cys
1.222CysAla: 1.222 ± 0.027
0.128CysCys: 0.128 ± 0.008
0.486CysAsp: 0.486 ± 0.017
0.508CysGlu: 0.508 ± 0.019
0.328CysPhe: 0.328 ± 0.013
0.956CysGly: 0.956 ± 0.026
0.212CysHis: 0.212 ± 0.012
0.358CysIle: 0.358 ± 0.017
0.145CysLys: 0.145 ± 0.01
0.825CysLeu: 0.825 ± 0.025
0.181CysMet: 0.181 ± 0.012
0.221CysAsn: 0.221 ± 0.013
0.426CysPro: 0.426 ± 0.018
0.207CysGln: 0.207 ± 0.01
0.692CysArg: 0.692 ± 0.023
0.48CysSer: 0.48 ± 0.018
0.454CysThr: 0.454 ± 0.02
0.709CysVal: 0.709 ± 0.023
0.125CysTrp: 0.125 ± 0.01
0.203CysTyr: 0.203 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
8.381AspAla: 8.381 ± 0.086
0.442AspCys: 0.442 ± 0.019
3.36AspAsp: 3.36 ± 0.058
3.61AspGlu: 3.61 ± 0.06
1.92AspPhe: 1.92 ± 0.038
5.087AspGly: 5.087 ± 0.068
1.091AspHis: 1.091 ± 0.026
2.266AspIle: 2.266 ± 0.046
1.107AspLys: 1.107 ± 0.032
5.335AspLeu: 5.335 ± 0.058
1.033AspMet: 1.033 ± 0.029
1.127AspAsn: 1.127 ± 0.031
2.945AspPro: 2.945 ± 0.045
1.436AspGln: 1.436 ± 0.033
3.586AspArg: 3.586 ± 0.061
2.464AspSer: 2.464 ± 0.052
2.758AspThr: 2.758 ± 0.048
4.478AspVal: 4.478 ± 0.052
0.909AspTrp: 0.909 ± 0.025
1.512AspTyr: 1.512 ± 0.035
0.0AspXaa: 0.0 ± 0.0
Glu
7.277GluAla: 7.277 ± 0.086
0.369GluCys: 0.369 ± 0.018
2.12GluAsp: 2.12 ± 0.048
2.222GluGlu: 2.222 ± 0.047
1.657GluPhe: 1.657 ± 0.033
3.223GluGly: 3.223 ± 0.052
1.408GluHis: 1.408 ± 0.034
2.71GluIle: 2.71 ± 0.048
1.622GluLys: 1.622 ± 0.037
5.231GluLeu: 5.231 ± 0.074
1.111GluMet: 1.111 ± 0.032
1.406GluAsn: 1.406 ± 0.031
2.626GluPro: 2.626 ± 0.045
2.193GluGln: 2.193 ± 0.037
5.057GluArg: 5.057 ± 0.065
2.652GluSer: 2.652 ± 0.047
3.085GluThr: 3.085 ± 0.051
3.478GluVal: 3.478 ± 0.055
0.668GluTrp: 0.668 ± 0.02
1.121GluTyr: 1.121 ± 0.032
0.0GluXaa: 0.0 ± 0.0
Phe
4.796PheAla: 4.796 ± 0.06
0.412PheCys: 0.412 ± 0.016
2.657PheAsp: 2.657 ± 0.052
2.025PheGlu: 2.025 ± 0.037
1.44PhePhe: 1.44 ± 0.039
3.793PheGly: 3.793 ± 0.055
0.735PheHis: 0.735 ± 0.023
1.402PheIle: 1.402 ± 0.031
0.725PheLys: 0.725 ± 0.023
2.922PheLeu: 2.922 ± 0.056
0.751PheMet: 0.751 ± 0.02
0.989PheAsn: 0.989 ± 0.027
1.564PhePro: 1.564 ± 0.036
0.911PheGln: 0.911 ± 0.026
2.156PheArg: 2.156 ± 0.036
2.309PheSer: 2.309 ± 0.038
1.717PheThr: 1.717 ± 0.037
3.334PheVal: 3.334 ± 0.06
0.505PheTrp: 0.505 ± 0.02
0.879PheTyr: 0.879 ± 0.025
0.0PheXaa: 0.0 ± 0.0
Gly
10.144GlyAla: 10.144 ± 0.096
0.846GlyCys: 0.846 ± 0.025
4.145GlyAsp: 4.145 ± 0.064
4.672GlyGlu: 4.672 ± 0.061
3.49GlyPhe: 3.49 ± 0.052
7.506GlyGly: 7.506 ± 0.256
2.017GlyHis: 2.017 ± 0.039
4.352GlyIle: 4.352 ± 0.067
2.712GlyLys: 2.712 ± 0.055
8.63GlyLeu: 8.63 ± 0.082
2.205GlyMet: 2.205 ± 0.042
2.451GlyAsn: 2.451 ± 0.069
3.186GlyPro: 3.186 ± 0.052
2.944GlyGln: 2.944 ± 0.047
6.072GlyArg: 6.072 ± 0.083
4.953GlySer: 4.953 ± 0.128
5.269GlyThr: 5.269 ± 0.168
6.777GlyVal: 6.777 ± 0.07
1.409GlyTrp: 1.409 ± 0.031
2.511GlyTyr: 2.511 ± 0.038
0.0GlyXaa: 0.0 ± 0.0
His
3.192HisAla: 3.192 ± 0.043
0.251HisCys: 0.251 ± 0.013
1.406HisAsp: 1.406 ± 0.031
1.221HisGlu: 1.221 ± 0.029
0.918HisPhe: 0.918 ± 0.027
2.262HisGly: 2.262 ± 0.045
0.612HisHis: 0.612 ± 0.025
0.805HisIle: 0.805 ± 0.026
0.41HisLys: 0.41 ± 0.018
2.255HisLeu: 2.255 ± 0.039
0.428HisMet: 0.428 ± 0.015
0.514HisAsn: 0.514 ± 0.02
1.477HisPro: 1.477 ± 0.039
0.538HisGln: 0.538 ± 0.018
1.639HisArg: 1.639 ± 0.034
0.989HisSer: 0.989 ± 0.027
1.033HisThr: 1.033 ± 0.025
1.739HisVal: 1.739 ± 0.038
0.38HisTrp: 0.38 ± 0.017
0.686HisTyr: 0.686 ± 0.023
0.0HisXaa: 0.0 ± 0.0
Ile
6.604IleAla: 6.604 ± 0.067
0.424IleCys: 0.424 ± 0.018
3.264IleAsp: 3.264 ± 0.054
3.085IleGlu: 3.085 ± 0.044
1.324IlePhe: 1.324 ± 0.026
4.737IleGly: 4.737 ± 0.072
0.854IleHis: 0.854 ± 0.023
1.287IleIle: 1.287 ± 0.036
0.994IleLys: 0.994 ± 0.029
3.309IleLeu: 3.309 ± 0.05
0.67IleMet: 0.67 ± 0.021
1.217IleAsn: 1.217 ± 0.033
2.044IlePro: 2.044 ± 0.038
1.078IleGln: 1.078 ± 0.027
2.958IleArg: 2.958 ± 0.052
2.33IleSer: 2.33 ± 0.049
2.08IleThr: 2.08 ± 0.041
4.262IleVal: 4.262 ± 0.06
0.472IleTrp: 0.472 ± 0.019
0.925IleTyr: 0.925 ± 0.029
0.0IleXaa: 0.0 ± 0.0
Lys
3.015LysAla: 3.015 ± 0.054
0.117LysCys: 0.117 ± 0.01
1.237LysAsp: 1.237 ± 0.033
1.097LysGlu: 1.097 ± 0.031
0.72LysPhe: 0.72 ± 0.023
1.802LysGly: 1.802 ± 0.042
0.528LysHis: 0.528 ± 0.021
1.37LysIle: 1.37 ± 0.031
0.923LysLys: 0.923 ± 0.035
2.778LysLeu: 2.778 ± 0.048
0.581LysMet: 0.581 ± 0.022
0.696LysAsn: 0.696 ± 0.024
1.673LysPro: 1.673 ± 0.041
0.945LysGln: 0.945 ± 0.031
1.983LysArg: 1.983 ± 0.045
1.405LysSer: 1.405 ± 0.031
1.573LysThr: 1.573 ± 0.036
1.867LysVal: 1.867 ± 0.039
0.304LysTrp: 0.304 ± 0.016
0.572LysTyr: 0.572 ± 0.021
0.0LysXaa: 0.0 ± 0.0
Leu
15.156LeuAla: 15.156 ± 0.145
1.039LeuCys: 1.039 ± 0.026
6.397LeuAsp: 6.397 ± 0.076
5.094LeuGlu: 5.094 ± 0.07
3.552LeuPhe: 3.552 ± 0.063
8.494LeuGly: 8.494 ± 0.084
2.307LeuHis: 2.307 ± 0.036
4.235LeuIle: 4.235 ± 0.055
2.839LeuLys: 2.839 ± 0.052
10.243LeuLeu: 10.243 ± 0.107
2.088LeuMet: 2.088 ± 0.043
2.639LeuAsn: 2.639 ± 0.048
6.069LeuPro: 6.069 ± 0.072
3.233LeuGln: 3.233 ± 0.048
7.726LeuArg: 7.726 ± 0.091
6.363LeuSer: 6.363 ± 0.112
5.281LeuThr: 5.281 ± 0.065
8.113LeuVal: 8.113 ± 0.09
1.15LeuTrp: 1.15 ± 0.033
2.223LeuTyr: 2.223 ± 0.041
0.0LeuXaa: 0.0 ± 0.0
Met
2.405MetAla: 2.405 ± 0.04
0.143MetCys: 0.143 ± 0.009
0.912MetAsp: 0.912 ± 0.026
0.833MetGlu: 0.833 ± 0.024
0.698MetPhe: 0.698 ± 0.026
1.434MetGly: 1.434 ± 0.034
0.486MetHis: 0.486 ± 0.018
0.991MetIle: 0.991 ± 0.026
0.851MetLys: 0.851 ± 0.022
2.663MetLeu: 2.663 ± 0.045
0.51MetMet: 0.51 ± 0.021
0.853MetAsn: 0.853 ± 0.024
1.425MetPro: 1.425 ± 0.033
0.892MetGln: 0.892 ± 0.022
1.67MetArg: 1.67 ± 0.04
1.602MetSer: 1.602 ± 0.034
1.506MetThr: 1.506 ± 0.036
1.399MetVal: 1.399 ± 0.028
0.181MetTrp: 0.181 ± 0.011
0.343MetTyr: 0.343 ± 0.014
0.0MetXaa: 0.0 ± 0.0
Asn
3.396AsnAla: 3.396 ± 0.064
0.213AsnCys: 0.213 ± 0.013
1.366AsnAsp: 1.366 ± 0.03
1.246AsnGlu: 1.246 ± 0.031
0.936AsnPhe: 0.936 ± 0.028
2.466AsnGly: 2.466 ± 0.06
0.504AsnHis: 0.504 ± 0.02
1.033AsnIle: 1.033 ± 0.032
0.466AsnLys: 0.466 ± 0.02
2.741AsnLeu: 2.741 ± 0.043
0.505AsnMet: 0.505 ± 0.017
0.72AsnAsn: 0.72 ± 0.028
1.792AsnPro: 1.792 ± 0.039
0.783AsnGln: 0.783 ± 0.022
1.841AsnArg: 1.841 ± 0.034
1.233AsnSer: 1.233 ± 0.041
1.362AsnThr: 1.362 ± 0.036
2.2AsnVal: 2.2 ± 0.049
0.384AsnTrp: 0.384 ± 0.016
0.669AsnTyr: 0.669 ± 0.022
0.0AsnXaa: 0.0 ± 0.0
Pro
7.313ProAla: 7.313 ± 0.096
0.337ProCys: 0.337 ± 0.016
3.662ProAsp: 3.662 ± 0.05
3.019ProGlu: 3.019 ± 0.051
1.946ProPhe: 1.946 ± 0.04
4.794ProGly: 4.794 ± 0.066
1.263ProHis: 1.263 ± 0.03
1.875ProIle: 1.875 ± 0.036
1.173ProLys: 1.173 ± 0.032
5.273ProLeu: 5.273 ± 0.058
1.089ProMet: 1.089 ± 0.028
1.27ProAsn: 1.27 ± 0.031
2.834ProPro: 2.834 ± 0.063
1.881ProGln: 1.881 ± 0.038
3.187ProArg: 3.187 ± 0.049
2.825ProSer: 2.825 ± 0.045
2.514ProThr: 2.514 ± 0.043
4.514ProVal: 4.514 ± 0.061
0.702ProTrp: 0.702 ± 0.021
1.232ProTyr: 1.232 ± 0.033
0.0ProXaa: 0.0 ± 0.0
Gln
4.305GlnAla: 4.305 ± 0.06
0.254GlnCys: 0.254 ± 0.012
1.332GlnAsp: 1.332 ± 0.03
1.231GlnGlu: 1.231 ± 0.031
1.211GlnPhe: 1.211 ± 0.031
2.303GlnGly: 2.303 ± 0.041
0.899GlnHis: 0.899 ± 0.025
1.962GlnIle: 1.962 ± 0.044
0.939GlnLys: 0.939 ± 0.027
3.698GlnLeu: 3.698 ± 0.057
0.957GlnMet: 0.957 ± 0.027
0.883GlnAsn: 0.883 ± 0.027
2.063GlnPro: 2.063 ± 0.043
1.778GlnGln: 1.778 ± 0.045
2.837GlnArg: 2.837 ± 0.053
1.899GlnSer: 1.899 ± 0.036
2.152GlnThr: 2.152 ± 0.077
2.46GlnVal: 2.46 ± 0.047
0.545GlnTrp: 0.545 ± 0.019
0.915GlnTyr: 0.915 ± 0.024
0.0GlnXaa: 0.0 ± 0.0
Arg
9.2ArgAla: 9.2 ± 0.096
0.569ArgCys: 0.569 ± 0.02
3.967ArgAsp: 3.967 ± 0.059
4.694ArgGlu: 4.694 ± 0.064
3.116ArgPhe: 3.116 ± 0.052
4.959ArgGly: 4.959 ± 0.058
2.064ArgHis: 2.064 ± 0.043
3.821ArgIle: 3.821 ± 0.06
1.822ArgLys: 1.822 ± 0.039
7.971ArgLeu: 7.971 ± 0.099
1.781ArgMet: 1.781 ± 0.037
1.895ArgAsn: 1.895 ± 0.037
3.606ArgPro: 3.606 ± 0.058
2.728ArgGln: 2.728 ± 0.049
6.428ArgArg: 6.428 ± 0.092
3.584ArgSer: 3.584 ± 0.054
3.532ArgThr: 3.532 ± 0.049
5.56ArgVal: 5.56 ± 0.07
1.123ArgTrp: 1.123 ± 0.028
2.104ArgTyr: 2.104 ± 0.041
0.0ArgXaa: 0.0 ± 0.0
Ser
7.05SerAla: 7.05 ± 0.093
0.397SerCys: 0.397 ± 0.017
2.676SerAsp: 2.676 ± 0.052
2.299SerGlu: 2.299 ± 0.039
2.01SerPhe: 2.01 ± 0.034
6.293SerGly: 6.293 ± 0.221
1.193SerHis: 1.193 ± 0.031
2.59SerIle: 2.59 ± 0.038
1.299SerLys: 1.299 ± 0.03
5.889SerLeu: 5.889 ± 0.085
1.328SerMet: 1.328 ± 0.034
1.53SerAsn: 1.53 ± 0.042
2.997SerPro: 2.997 ± 0.047
1.835SerGln: 1.835 ± 0.057
3.828SerArg: 3.828 ± 0.052
3.41SerSer: 3.41 ± 0.08
3.159SerThr: 3.159 ± 0.116
4.356SerVal: 4.356 ± 0.07
0.664SerTrp: 0.664 ± 0.023
1.195SerTyr: 1.195 ± 0.026
0.0SerXaa: 0.0 ± 0.0
Thr
6.128ThrAla: 6.128 ± 0.079
0.412ThrCys: 0.412 ± 0.018
2.66ThrAsp: 2.66 ± 0.049
2.123ThrGlu: 2.123 ± 0.045
1.893ThrPhe: 1.893 ± 0.039
4.994ThrGly: 4.994 ± 0.094
1.194ThrHis: 1.194 ± 0.024
2.529ThrIle: 2.529 ± 0.067
1.083ThrLys: 1.083 ± 0.028
6.528ThrLeu: 6.528 ± 0.079
1.069ThrMet: 1.069 ± 0.028
1.239ThrAsn: 1.239 ± 0.036
3.422ThrPro: 3.422 ± 0.055
1.865ThrGln: 1.865 ± 0.061
3.506ThrArg: 3.506 ± 0.05
2.861ThrSer: 2.861 ± 0.055
3.056ThrThr: 3.056 ± 0.073
4.616ThrVal: 4.616 ± 0.132
0.67ThrTrp: 0.67 ± 0.022
1.127ThrTyr: 1.127 ± 0.026
0.0ThrXaa: 0.0 ± 0.0
Val
10.039ValAla: 10.039 ± 0.102
0.809ValCys: 0.809 ± 0.024
4.415ValAsp: 4.415 ± 0.068
4.479ValGlu: 4.479 ± 0.063
2.843ValPhe: 2.843 ± 0.049
5.935ValGly: 5.935 ± 0.072
1.526ValHis: 1.526 ± 0.036
3.416ValIle: 3.416 ± 0.053
2.061ValLys: 2.061 ± 0.046
8.281ValLeu: 8.281 ± 0.097
1.705ValMet: 1.705 ± 0.035
2.171ValAsn: 2.171 ± 0.051
4.279ValPro: 4.279 ± 0.062
2.424ValGln: 2.424 ± 0.045
5.534ValArg: 5.534 ± 0.068
4.942ValSer: 4.942 ± 0.079
4.348ValThr: 4.348 ± 0.063
6.806ValVal: 6.806 ± 0.081
0.993ValTrp: 0.993 ± 0.025
1.688ValTyr: 1.688 ± 0.036
0.0ValXaa: 0.0 ± 0.0
Trp
1.218TrpAla: 1.218 ± 0.032
0.144TrpCys: 0.144 ± 0.011
0.561TrpAsp: 0.561 ± 0.022
0.493TrpGlu: 0.493 ± 0.019
0.544TrpPhe: 0.544 ± 0.019
0.879TrpGly: 0.879 ± 0.026
0.405TrpHis: 0.405 ± 0.018
0.702TrpIle: 0.702 ± 0.022
0.37TrpLys: 0.37 ± 0.017
1.94TrpLeu: 1.94 ± 0.045
0.341TrpMet: 0.341 ± 0.017
0.402TrpAsn: 0.402 ± 0.016
0.68TrpPro: 0.68 ± 0.023
0.702TrpGln: 0.702 ± 0.024
1.338TrpArg: 1.338 ± 0.032
0.775TrpSer: 0.775 ± 0.026
0.684TrpThr: 0.684 ± 0.022
0.897TrpVal: 0.897 ± 0.022
0.206TrpTrp: 0.206 ± 0.013
0.3TrpTyr: 0.3 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.803TyrAla: 2.803 ± 0.047
0.243TyrCys: 0.243 ± 0.012
1.37TyrAsp: 1.37 ± 0.031
1.235TyrGlu: 1.235 ± 0.031
0.963TyrPhe: 0.963 ± 0.029
2.247TyrGly: 2.247 ± 0.039
0.462TyrHis: 0.462 ± 0.018
0.734TyrIle: 0.734 ± 0.023
0.463TyrLys: 0.463 ± 0.02
2.561TyrLeu: 2.561 ± 0.042
0.406TyrMet: 0.406 ± 0.015
0.582TyrAsn: 0.582 ± 0.023
1.221TyrPro: 1.221 ± 0.032
0.781TyrGln: 0.781 ± 0.025
2.069TyrArg: 2.069 ± 0.038
1.158TyrSer: 1.158 ± 0.034
1.167TyrThr: 1.167 ± 0.033
1.917TyrVal: 1.917 ± 0.036
0.368TyrTrp: 0.368 ± 0.016
0.652TyrTyr: 0.652 ± 0.022
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4564 proteins (1499025 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski