Amino acid dipepetide frequency for Cerasibacillus quisquiliarum

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.295AlaAla: 4.295 ± 0.101
0.549AlaCys: 0.549 ± 0.029
2.999AlaAsp: 2.999 ± 0.067
4.036AlaGlu: 4.036 ± 0.079
3.203AlaPhe: 3.203 ± 0.079
4.372AlaGly: 4.372 ± 0.086
1.419AlaHis: 1.419 ± 0.044
6.113AlaIle: 6.113 ± 0.114
4.663AlaLys: 4.663 ± 0.086
6.92AlaLeu: 6.92 ± 0.093
1.914AlaMet: 1.914 ± 0.051
2.723AlaAsn: 2.723 ± 0.057
1.791AlaPro: 1.791 ± 0.048
2.107AlaGln: 2.107 ± 0.06
2.538AlaArg: 2.538 ± 0.068
3.504AlaSer: 3.504 ± 0.075
3.56AlaThr: 3.56 ± 0.062
4.477AlaVal: 4.477 ± 0.085
0.536AlaTrp: 0.536 ± 0.025
2.395AlaTyr: 2.395 ± 0.06
0.0AlaXaa: 0.0 ± 0.0
Cys
0.348CysAla: 0.348 ± 0.021
0.087CysCys: 0.087 ± 0.012
0.366CysAsp: 0.366 ± 0.022
0.36CysGlu: 0.36 ± 0.023
0.245CysPhe: 0.245 ± 0.019
0.555CysGly: 0.555 ± 0.029
0.24CysHis: 0.24 ± 0.02
0.433CysIle: 0.433 ± 0.023
0.347CysLys: 0.347 ± 0.022
0.547CysLeu: 0.547 ± 0.026
0.157CysMet: 0.157 ± 0.013
0.243CysAsn: 0.243 ± 0.018
0.323CysPro: 0.323 ± 0.021
0.288CysGln: 0.288 ± 0.021
0.265CysArg: 0.265 ± 0.017
0.43CysSer: 0.43 ± 0.025
0.36CysThr: 0.36 ± 0.023
0.392CysVal: 0.392 ± 0.021
0.045CysTrp: 0.045 ± 0.009
0.227CysTyr: 0.227 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
3.213AspAla: 3.213 ± 0.065
0.299AspCys: 0.299 ± 0.023
3.165AspAsp: 3.165 ± 0.084
4.856AspGlu: 4.856 ± 0.087
2.364AspPhe: 2.364 ± 0.055
3.306AspGly: 3.306 ± 0.085
1.289AspHis: 1.289 ± 0.043
5.03AspIle: 5.03 ± 0.08
3.572AspLys: 3.572 ± 0.077
5.107AspLeu: 5.107 ± 0.081
1.66AspMet: 1.66 ± 0.048
1.792AspAsn: 1.792 ± 0.05
1.965AspPro: 1.965 ± 0.052
1.957AspGln: 1.957 ± 0.061
2.243AspArg: 2.243 ± 0.054
2.41AspSer: 2.41 ± 0.055
2.574AspThr: 2.574 ± 0.06
4.162AspVal: 4.162 ± 0.077
0.588AspTrp: 0.588 ± 0.029
2.36AspTyr: 2.36 ± 0.059
0.0AspXaa: 0.0 ± 0.0
Glu
5.188GluAla: 5.188 ± 0.093
0.294GluCys: 0.294 ± 0.02
3.716GluAsp: 3.716 ± 0.09
6.577GluGlu: 6.577 ± 0.134
2.292GluPhe: 2.292 ± 0.057
3.844GluGly: 3.844 ± 0.08
1.772GluHis: 1.772 ± 0.043
5.607GluIle: 5.607 ± 0.092
7.08GluLys: 7.08 ± 0.127
6.667GluLeu: 6.667 ± 0.095
2.308GluMet: 2.308 ± 0.056
3.571GluAsn: 3.571 ± 0.076
2.029GluPro: 2.029 ± 0.057
3.781GluGln: 3.781 ± 0.085
3.761GluArg: 3.761 ± 0.078
3.314GluSer: 3.314 ± 0.071
4.244GluThr: 4.244 ± 0.081
4.795GluVal: 4.795 ± 0.097
0.771GluTrp: 0.771 ± 0.033
2.155GluTyr: 2.155 ± 0.045
0.0GluXaa: 0.0 ± 0.0
Phe
2.68PheAla: 2.68 ± 0.065
0.269PheCys: 0.269 ± 0.018
2.272PheAsp: 2.272 ± 0.051
2.539PheGlu: 2.539 ± 0.053
2.377PhePhe: 2.377 ± 0.071
3.117PheGly: 3.117 ± 0.072
1.019PheHis: 1.019 ± 0.032
4.68PheIle: 4.68 ± 0.116
2.703PheLys: 2.703 ± 0.058
4.519PheLeu: 4.519 ± 0.093
1.271PheMet: 1.271 ± 0.044
2.064PheAsn: 2.064 ± 0.05
1.641PhePro: 1.641 ± 0.04
1.564PheGln: 1.564 ± 0.046
1.529PheArg: 1.529 ± 0.047
3.108PheSer: 3.108 ± 0.073
2.678PheThr: 2.678 ± 0.061
3.016PheVal: 3.016 ± 0.07
0.423PheTrp: 0.423 ± 0.028
1.753PheTyr: 1.753 ± 0.056
0.0PheXaa: 0.0 ± 0.0
Gly
4.186GlyAla: 4.186 ± 0.087
0.514GlyCys: 0.514 ± 0.028
3.184GlyAsp: 3.184 ± 0.071
4.232GlyGlu: 4.232 ± 0.079
3.181GlyPhe: 3.181 ± 0.067
4.27GlyGly: 4.27 ± 0.094
1.494GlyHis: 1.494 ± 0.046
5.845GlyIle: 5.845 ± 0.094
5.073GlyLys: 5.073 ± 0.089
6.036GlyLeu: 6.036 ± 0.082
2.025GlyMet: 2.025 ± 0.049
2.382GlyAsn: 2.382 ± 0.067
1.641GlyPro: 1.641 ± 0.045
1.997GlyGln: 1.997 ± 0.059
2.552GlyArg: 2.552 ± 0.063
3.508GlySer: 3.508 ± 0.075
3.889GlyThr: 3.889 ± 0.068
4.658GlyVal: 4.658 ± 0.089
0.693GlyTrp: 0.693 ± 0.027
2.646GlyTyr: 2.646 ± 0.057
0.0GlyXaa: 0.0 ± 0.0
His
1.517HisAla: 1.517 ± 0.045
0.18HisCys: 0.18 ± 0.015
1.332HisAsp: 1.332 ± 0.041
1.508HisGlu: 1.508 ± 0.042
1.165HisPhe: 1.165 ± 0.043
1.451HisGly: 1.451 ± 0.049
0.867HisHis: 0.867 ± 0.036
2.18HisIle: 2.18 ± 0.053
1.233HisLys: 1.233 ± 0.039
2.525HisLeu: 2.525 ± 0.056
0.664HisMet: 0.664 ± 0.032
0.808HisAsn: 0.808 ± 0.037
1.248HisPro: 1.248 ± 0.045
0.925HisGln: 0.925 ± 0.033
0.973HisArg: 0.973 ± 0.036
1.162HisSer: 1.162 ± 0.04
1.217HisThr: 1.217 ± 0.043
1.926HisVal: 1.926 ± 0.045
0.211HisTrp: 0.211 ± 0.017
0.992HisTyr: 0.992 ± 0.036
0.0HisXaa: 0.0 ± 0.0
Ile
6.289IleAla: 6.289 ± 0.1
0.577IleCys: 0.577 ± 0.033
5.101IleAsp: 5.101 ± 0.081
6.278IleGlu: 6.278 ± 0.09
3.749IlePhe: 3.749 ± 0.091
6.34IleGly: 6.34 ± 0.122
2.025IleHis: 2.025 ± 0.052
7.385IleIle: 7.385 ± 0.136
5.432IleLys: 5.432 ± 0.09
7.802IleLeu: 7.802 ± 0.126
2.046IleMet: 2.046 ± 0.053
3.653IleAsn: 3.653 ± 0.068
3.614IlePro: 3.614 ± 0.064
3.239IleGln: 3.239 ± 0.065
3.325IleArg: 3.325 ± 0.063
5.186IleSer: 5.186 ± 0.089
4.623IleThr: 4.623 ± 0.078
6.251IleVal: 6.251 ± 0.098
0.678IleTrp: 0.678 ± 0.028
2.971IleTyr: 2.971 ± 0.066
0.0IleXaa: 0.0 ± 0.0
Lys
4.654LysAla: 4.654 ± 0.088
0.329LysCys: 0.329 ± 0.02
4.214LysAsp: 4.214 ± 0.076
7.111LysGlu: 7.111 ± 0.119
1.727LysPhe: 1.727 ± 0.047
4.308LysGly: 4.308 ± 0.067
1.877LysHis: 1.877 ± 0.049
5.017LysIle: 5.017 ± 0.084
6.396LysLys: 6.396 ± 0.111
5.803LysLeu: 5.803 ± 0.082
2.346LysMet: 2.346 ± 0.059
3.562LysAsn: 3.562 ± 0.075
2.278LysPro: 2.278 ± 0.062
4.171LysGln: 4.171 ± 0.069
4.013LysArg: 4.013 ± 0.086
3.553LysSer: 3.553 ± 0.071
3.982LysThr: 3.982 ± 0.067
4.351LysVal: 4.351 ± 0.076
0.777LysTrp: 0.777 ± 0.035
2.313LysTyr: 2.313 ± 0.063
0.0LysXaa: 0.0 ± 0.0
Leu
6.618LeuAla: 6.618 ± 0.094
0.533LeuCys: 0.533 ± 0.026
5.002LeuAsp: 5.002 ± 0.082
6.531LeuGlu: 6.531 ± 0.113
5.084LeuPhe: 5.084 ± 0.114
6.002LeuGly: 6.002 ± 0.113
1.995LeuHis: 1.995 ± 0.051
8.081LeuIle: 8.081 ± 0.142
6.992LeuLys: 6.992 ± 0.097
10.032LeuLeu: 10.032 ± 0.176
2.472LeuMet: 2.472 ± 0.063
4.432LeuAsn: 4.432 ± 0.068
3.838LeuPro: 3.838 ± 0.071
3.495LeuGln: 3.495 ± 0.063
3.546LeuArg: 3.546 ± 0.077
6.351LeuSer: 6.351 ± 0.101
5.883LeuThr: 5.883 ± 0.091
5.85LeuVal: 5.85 ± 0.089
0.725LeuTrp: 0.725 ± 0.035
3.148LeuTyr: 3.148 ± 0.068
0.0LeuXaa: 0.0 ± 0.0
Met
1.893MetAla: 1.893 ± 0.047
0.121MetCys: 0.121 ± 0.015
1.462MetAsp: 1.462 ± 0.047
1.836MetGlu: 1.836 ± 0.048
1.141MetPhe: 1.141 ± 0.042
1.595MetGly: 1.595 ± 0.047
0.466MetHis: 0.466 ± 0.022
2.629MetIle: 2.629 ± 0.056
2.536MetLys: 2.536 ± 0.052
2.612MetLeu: 2.612 ± 0.054
0.988MetMet: 0.988 ± 0.035
1.719MetAsn: 1.719 ± 0.05
1.011MetPro: 1.011 ± 0.038
0.911MetGln: 0.911 ± 0.035
1.219MetArg: 1.219 ± 0.04
1.772MetSer: 1.772 ± 0.045
1.98MetThr: 1.98 ± 0.052
1.709MetVal: 1.709 ± 0.049
0.193MetTrp: 0.193 ± 0.015
0.807MetTyr: 0.807 ± 0.036
0.0MetXaa: 0.0 ± 0.0
Asn
2.38AsnAla: 2.38 ± 0.061
0.287AsnCys: 0.287 ± 0.02
2.563AsnAsp: 2.563 ± 0.068
3.698AsnGlu: 3.698 ± 0.07
1.44AsnPhe: 1.44 ± 0.042
2.995AsnGly: 2.995 ± 0.073
1.308AsnHis: 1.308 ± 0.043
3.816AsnIle: 3.816 ± 0.069
3.549AsnLys: 3.549 ± 0.071
3.56AsnLeu: 3.56 ± 0.074
1.295AsnMet: 1.295 ± 0.043
2.012AsnAsn: 2.012 ± 0.06
1.953AsnPro: 1.953 ± 0.054
2.117AsnGln: 2.117 ± 0.051
2.232AsnArg: 2.232 ± 0.052
1.708AsnSer: 1.708 ± 0.051
2.121AsnThr: 2.121 ± 0.059
2.651AsnVal: 2.651 ± 0.061
0.446AsnTrp: 0.446 ± 0.024
1.461AsnTyr: 1.461 ± 0.045
0.0AsnXaa: 0.0 ± 0.0
Pro
1.921ProAla: 1.921 ± 0.05
0.207ProCys: 0.207 ± 0.017
1.949ProAsp: 1.949 ± 0.06
2.888ProGlu: 2.888 ± 0.062
1.989ProPhe: 1.989 ± 0.049
2.137ProGly: 2.137 ± 0.055
0.878ProHis: 0.878 ± 0.032
3.195ProIle: 3.195 ± 0.07
2.224ProLys: 2.224 ± 0.055
3.372ProLeu: 3.372 ± 0.057
0.802ProMet: 0.802 ± 0.037
1.767ProAsn: 1.767 ± 0.05
0.989ProPro: 0.989 ± 0.038
1.065ProGln: 1.065 ± 0.037
1.041ProArg: 1.041 ± 0.037
2.089ProSer: 2.089 ± 0.058
2.04ProThr: 2.04 ± 0.049
2.678ProVal: 2.678 ± 0.068
0.324ProTrp: 0.324 ± 0.021
1.489ProTyr: 1.489 ± 0.041
0.0ProXaa: 0.0 ± 0.0
Gln
2.892GlnAla: 2.892 ± 0.062
0.161GlnCys: 0.161 ± 0.013
1.853GlnAsp: 1.853 ± 0.054
2.768GlnGlu: 2.768 ± 0.059
1.837GlnPhe: 1.837 ± 0.048
1.931GlnGly: 1.931 ± 0.048
0.939GlnHis: 0.939 ± 0.037
3.131GlnIle: 3.131 ± 0.067
2.736GlnLys: 2.736 ± 0.06
4.581GlnLeu: 4.581 ± 0.094
1.28GlnMet: 1.28 ± 0.042
1.412GlnAsn: 1.412 ± 0.04
1.273GlnPro: 1.273 ± 0.047
1.849GlnGln: 1.849 ± 0.056
1.541GlnArg: 1.541 ± 0.046
2.328GlnSer: 2.328 ± 0.062
2.233GlnThr: 2.233 ± 0.052
2.476GlnVal: 2.476 ± 0.059
0.397GlnTrp: 0.397 ± 0.025
1.372GlnTyr: 1.372 ± 0.047
0.0GlnXaa: 0.0 ± 0.0
Arg
2.395ArgAla: 2.395 ± 0.062
0.266ArgCys: 0.266 ± 0.018
2.143ArgAsp: 2.143 ± 0.052
3.224ArgGlu: 3.224 ± 0.063
1.979ArgPhe: 1.979 ± 0.051
2.319ArgGly: 2.319 ± 0.057
1.092ArgHis: 1.092 ± 0.037
3.341ArgIle: 3.341 ± 0.07
3.464ArgLys: 3.464 ± 0.07
4.187ArgLeu: 4.187 ± 0.087
1.341ArgMet: 1.341 ± 0.044
1.777ArgAsn: 1.777 ± 0.046
1.33ArgPro: 1.33 ± 0.039
1.799ArgGln: 1.799 ± 0.051
2.006ArgArg: 2.006 ± 0.058
2.098ArgSer: 2.098 ± 0.062
2.155ArgThr: 2.155 ± 0.054
2.651ArgVal: 2.651 ± 0.065
0.37ArgTrp: 0.37 ± 0.024
1.693ArgTyr: 1.693 ± 0.046
0.0ArgXaa: 0.0 ± 0.0
Ser
3.097SerAla: 3.097 ± 0.066
0.333SerCys: 0.333 ± 0.022
2.954SerAsp: 2.954 ± 0.061
3.693SerGlu: 3.693 ± 0.073
3.099SerPhe: 3.099 ± 0.08
4.078SerGly: 4.078 ± 0.079
1.401SerHis: 1.401 ± 0.045
5.091SerIle: 5.091 ± 0.088
3.605SerLys: 3.605 ± 0.067
5.742SerLeu: 5.742 ± 0.104
1.564SerMet: 1.564 ± 0.045
2.258SerAsn: 2.258 ± 0.052
1.795SerPro: 1.795 ± 0.046
1.974SerGln: 1.974 ± 0.054
2.219SerArg: 2.219 ± 0.053
3.176SerSer: 3.176 ± 0.078
2.746SerThr: 2.746 ± 0.058
3.618SerVal: 3.618 ± 0.066
0.56SerTrp: 0.56 ± 0.029
2.219SerTyr: 2.219 ± 0.049
0.0SerXaa: 0.0 ± 0.0
Thr
3.643ThrAla: 3.643 ± 0.071
0.406ThrCys: 0.406 ± 0.023
3.075ThrAsp: 3.075 ± 0.069
3.676ThrGlu: 3.676 ± 0.073
2.841ThrPhe: 2.841 ± 0.065
4.005ThrGly: 4.005 ± 0.077
1.209ThrHis: 1.209 ± 0.037
5.259ThrIle: 5.259 ± 0.089
3.622ThrLys: 3.622 ± 0.077
5.449ThrLeu: 5.449 ± 0.091
1.38ThrMet: 1.38 ± 0.043
2.598ThrAsn: 2.598 ± 0.063
2.196ThrPro: 2.196 ± 0.058
1.438ThrGln: 1.438 ± 0.044
1.938ThrArg: 1.938 ± 0.053
3.113ThrSer: 3.113 ± 0.068
3.148ThrThr: 3.148 ± 0.069
4.225ThrVal: 4.225 ± 0.08
0.533ThrTrp: 0.533 ± 0.025
2.364ThrTyr: 2.364 ± 0.053
0.0ThrXaa: 0.0 ± 0.0
Val
4.397ValAla: 4.397 ± 0.088
0.517ValCys: 0.517 ± 0.026
3.832ValAsp: 3.832 ± 0.087
4.659ValGlu: 4.659 ± 0.08
3.115ValPhe: 3.115 ± 0.079
4.405ValGly: 4.405 ± 0.081
1.458ValHis: 1.458 ± 0.044
6.119ValIle: 6.119 ± 0.086
4.668ValLys: 4.668 ± 0.083
6.457ValLeu: 6.457 ± 0.108
1.854ValMet: 1.854 ± 0.051
2.98ValAsn: 2.98 ± 0.073
2.475ValPro: 2.475 ± 0.058
2.317ValGln: 2.317 ± 0.057
2.541ValArg: 2.541 ± 0.055
4.065ValSer: 4.065 ± 0.065
4.238ValThr: 4.238 ± 0.077
4.564ValVal: 4.564 ± 0.091
0.559ValTrp: 0.559 ± 0.029
2.338ValTyr: 2.338 ± 0.047
0.0ValXaa: 0.0 ± 0.0
Trp
0.451TrpAla: 0.451 ± 0.022
0.055TrpCys: 0.055 ± 0.007
0.533TrpAsp: 0.533 ± 0.027
0.612TrpGlu: 0.612 ± 0.028
0.488TrpPhe: 0.488 ± 0.027
0.574TrpGly: 0.574 ± 0.026
0.229TrpHis: 0.229 ± 0.018
0.852TrpIle: 0.852 ± 0.034
0.659TrpLys: 0.659 ± 0.03
1.1TrpLeu: 1.1 ± 0.044
0.296TrpMet: 0.296 ± 0.021
0.436TrpAsn: 0.436 ± 0.023
0.229TrpPro: 0.229 ± 0.018
0.352TrpGln: 0.352 ± 0.021
0.38TrpArg: 0.38 ± 0.023
0.501TrpSer: 0.501 ± 0.025
0.468TrpThr: 0.468 ± 0.026
0.641TrpVal: 0.641 ± 0.028
0.131TrpTrp: 0.131 ± 0.013
0.339TrpTyr: 0.339 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.056TyrAla: 2.056 ± 0.052
0.276TyrCys: 0.276 ± 0.022
2.198TyrAsp: 2.198 ± 0.05
2.685TyrGlu: 2.685 ± 0.06
1.9TyrPhe: 1.9 ± 0.058
2.401TyrGly: 2.401 ± 0.055
1.108TyrHis: 1.108 ± 0.034
2.76TyrIle: 2.76 ± 0.058
2.227TyrLys: 2.227 ± 0.055
3.593TyrLeu: 3.593 ± 0.073
0.937TyrMet: 0.937 ± 0.033
1.406TyrAsn: 1.406 ± 0.044
1.426TyrPro: 1.426 ± 0.038
1.645TyrGln: 1.645 ± 0.038
1.769TyrArg: 1.769 ± 0.051
1.844TyrSer: 1.844 ± 0.054
1.979TyrThr: 1.979 ± 0.058
2.467TyrVal: 2.467 ± 0.055
0.366TyrTrp: 0.366 ± 0.022
1.564TyrTyr: 1.564 ± 0.042
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2778 proteins (778283 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski