Amino acid dipepetide frequency for Paenibacillus sabinae T27

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.752AlaAla: 10.752 ± 0.148
0.754AlaCys: 0.754 ± 0.022
4.508AlaAsp: 4.508 ± 0.061
6.374AlaGlu: 6.374 ± 0.074
3.476AlaPhe: 3.476 ± 0.049
8.214AlaGly: 8.214 ± 0.099
1.386AlaHis: 1.386 ± 0.03
5.091AlaIle: 5.091 ± 0.07
4.35AlaLys: 4.35 ± 0.059
8.954AlaLeu: 8.954 ± 0.094
2.266AlaMet: 2.266 ± 0.042
2.427AlaAsn: 2.427 ± 0.043
3.13AlaPro: 3.13 ± 0.058
2.645AlaGln: 2.645 ± 0.049
3.972AlaArg: 3.972 ± 0.05
5.423AlaSer: 5.423 ± 0.071
3.275AlaThr: 3.275 ± 0.057
7.433AlaVal: 7.433 ± 0.086
0.895AlaTrp: 0.895 ± 0.026
2.67AlaTyr: 2.67 ± 0.042
0.0AlaXaa: 0.0 ± 0.0
Cys
0.583CysAla: 0.583 ± 0.019
0.116CysCys: 0.116 ± 0.008
0.392CysAsp: 0.392 ± 0.018
0.448CysGlu: 0.448 ± 0.021
0.307CysPhe: 0.307 ± 0.017
0.883CysGly: 0.883 ± 0.031
0.189CysHis: 0.189 ± 0.012
0.5CysIle: 0.5 ± 0.019
0.305CysLys: 0.305 ± 0.016
0.797CysLeu: 0.797 ± 0.023
0.202CysMet: 0.202 ± 0.011
0.263CysAsn: 0.263 ± 0.015
0.406CysPro: 0.406 ± 0.019
0.216CysGln: 0.216 ± 0.012
0.54CysArg: 0.54 ± 0.02
0.596CysSer: 0.596 ± 0.02
0.405CysThr: 0.405 ± 0.018
0.468CysVal: 0.468 ± 0.019
0.09CysTrp: 0.09 ± 0.008
0.251CysTyr: 0.251 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
3.928AspAla: 3.928 ± 0.057
0.39AspCys: 0.39 ± 0.017
2.246AspAsp: 2.246 ± 0.041
3.68AspGlu: 3.68 ± 0.053
2.172AspPhe: 2.172 ± 0.039
4.016AspGly: 4.016 ± 0.06
1.056AspHis: 1.056 ± 0.03
3.624AspIle: 3.624 ± 0.054
2.754AspLys: 2.754 ± 0.049
4.889AspLeu: 4.889 ± 0.051
1.31AspMet: 1.31 ± 0.03
1.797AspAsn: 1.797 ± 0.036
2.337AspPro: 2.337 ± 0.037
1.566AspGln: 1.566 ± 0.032
2.861AspArg: 2.861 ± 0.048
2.972AspSer: 2.972 ± 0.056
2.508AspThr: 2.508 ± 0.045
3.278AspVal: 3.278 ± 0.052
0.785AspTrp: 0.785 ± 0.021
1.994AspTyr: 1.994 ± 0.041
0.0AspXaa: 0.0 ± 0.0
Glu
6.368GluAla: 6.368 ± 0.077
0.426GluCys: 0.426 ± 0.019
3.249GluAsp: 3.249 ± 0.05
5.804GluGlu: 5.804 ± 0.088
2.138GluPhe: 2.138 ± 0.041
4.795GluGly: 4.795 ± 0.06
1.471GluHis: 1.471 ± 0.038
4.406GluIle: 4.406 ± 0.059
3.821GluLys: 3.821 ± 0.059
7.393GluLeu: 7.393 ± 0.081
2.059GluMet: 2.059 ± 0.045
2.399GluAsn: 2.399 ± 0.044
2.34GluPro: 2.34 ± 0.043
3.27GluGln: 3.27 ± 0.054
4.373GluArg: 4.373 ± 0.066
3.657GluSer: 3.657 ± 0.044
3.386GluThr: 3.386 ± 0.049
4.378GluVal: 4.378 ± 0.058
0.883GluTrp: 0.883 ± 0.028
1.989GluTyr: 1.989 ± 0.043
0.0GluXaa: 0.0 ± 0.0
Phe
3.406PheAla: 3.406 ± 0.048
0.362PheCys: 0.362 ± 0.015
2.108PheAsp: 2.108 ± 0.035
2.254PheGlu: 2.254 ± 0.042
1.785PhePhe: 1.785 ± 0.036
3.451PheGly: 3.451 ± 0.046
0.886PheHis: 0.886 ± 0.026
2.78PheIle: 2.78 ± 0.046
1.862PheLys: 1.862 ± 0.038
3.805PheLeu: 3.805 ± 0.062
1.103PheMet: 1.103 ± 0.031
1.518PheAsn: 1.518 ± 0.029
1.657PhePro: 1.657 ± 0.036
1.249PheGln: 1.249 ± 0.03
2.11PheArg: 2.11 ± 0.042
2.796PheSer: 2.796 ± 0.047
2.313PheThr: 2.313 ± 0.043
2.763PheVal: 2.763 ± 0.044
0.529PheTrp: 0.529 ± 0.02
1.427PheTyr: 1.427 ± 0.031
0.0PheXaa: 0.0 ± 0.0
Gly
6.358GlyAla: 6.358 ± 0.078
0.795GlyCys: 0.795 ± 0.027
3.723GlyAsp: 3.723 ± 0.052
5.038GlyGlu: 5.038 ± 0.062
3.38GlyPhe: 3.38 ± 0.048
6.558GlyGly: 6.558 ± 0.088
1.56GlyHis: 1.56 ± 0.032
5.969GlyIle: 5.969 ± 0.069
4.682GlyLys: 4.682 ± 0.057
8.013GlyLeu: 8.013 ± 0.087
2.463GlyMet: 2.463 ± 0.047
2.722GlyAsn: 2.722 ± 0.047
2.262GlyPro: 2.262 ± 0.042
2.652GlyGln: 2.652 ± 0.046
4.269GlyArg: 4.269 ± 0.064
5.35GlySer: 5.35 ± 0.075
4.568GlyThr: 4.568 ± 0.055
5.446GlyVal: 5.446 ± 0.06
1.089GlyTrp: 1.089 ± 0.031
3.032GlyTyr: 3.032 ± 0.052
0.0GlyXaa: 0.0 ± 0.0
His
1.478HisAla: 1.478 ± 0.029
0.19HisCys: 0.19 ± 0.011
0.926HisAsp: 0.926 ± 0.028
1.167HisGlu: 1.167 ± 0.03
0.95HisPhe: 0.95 ± 0.026
1.523HisGly: 1.523 ± 0.037
0.528HisHis: 0.528 ± 0.025
1.304HisIle: 1.304 ± 0.035
0.822HisLys: 0.822 ± 0.027
2.113HisLeu: 2.113 ± 0.065
0.501HisMet: 0.501 ± 0.02
0.621HisAsn: 0.621 ± 0.021
1.265HisPro: 1.265 ± 0.033
0.64HisGln: 0.64 ± 0.022
1.132HisArg: 1.132 ± 0.028
1.24HisSer: 1.24 ± 0.031
1.057HisThr: 1.057 ± 0.025
1.331HisVal: 1.331 ± 0.028
0.268HisTrp: 0.268 ± 0.014
0.813HisTyr: 0.813 ± 0.022
0.0HisXaa: 0.0 ± 0.0
Ile
6.062IleAla: 6.062 ± 0.076
0.619IleCys: 0.619 ± 0.022
3.43IleAsp: 3.43 ± 0.046
3.997IleGlu: 3.997 ± 0.059
2.402IlePhe: 2.402 ± 0.045
5.595IleGly: 5.595 ± 0.064
1.38IleHis: 1.38 ± 0.031
4.135IleIle: 4.135 ± 0.065
2.956IleLys: 2.956 ± 0.049
5.879IleLeu: 5.879 ± 0.078
1.549IleMet: 1.549 ± 0.038
2.333IleAsn: 2.333 ± 0.042
3.174IlePro: 3.174 ± 0.048
2.203IleGln: 2.203 ± 0.041
3.798IleArg: 3.798 ± 0.048
4.451IleSer: 4.451 ± 0.063
3.647IleThr: 3.647 ± 0.054
4.89IleVal: 4.89 ± 0.065
0.662IleTrp: 0.662 ± 0.024
1.932IleTyr: 1.932 ± 0.041
0.0IleXaa: 0.0 ± 0.0
Lys
4.634LysAla: 4.634 ± 0.059
0.273LysCys: 0.273 ± 0.015
2.853LysAsp: 2.853 ± 0.048
4.36LysGlu: 4.36 ± 0.062
1.499LysPhe: 1.499 ± 0.037
3.78LysGly: 3.78 ± 0.057
0.95LysHis: 0.95 ± 0.026
3.089LysIle: 3.089 ± 0.048
3.102LysLys: 3.102 ± 0.05
5.396LysLeu: 5.396 ± 0.063
1.464LysMet: 1.464 ± 0.031
1.941LysAsn: 1.941 ± 0.039
2.294LysPro: 2.294 ± 0.038
2.079LysGln: 2.079 ± 0.039
2.886LysArg: 2.886 ± 0.05
2.994LysSer: 2.994 ± 0.051
2.746LysThr: 2.746 ± 0.045
3.634LysVal: 3.634 ± 0.051
0.692LysTrp: 0.692 ± 0.021
1.79LysTyr: 1.79 ± 0.035
0.0LysXaa: 0.0 ± 0.0
Leu
9.072LeuAla: 9.072 ± 0.099
0.824LeuCys: 0.824 ± 0.024
5.161LeuAsp: 5.161 ± 0.06
6.615LeuGlu: 6.615 ± 0.067
4.376LeuPhe: 4.376 ± 0.073
7.498LeuGly: 7.498 ± 0.078
2.055LeuHis: 2.055 ± 0.042
6.575LeuIle: 6.575 ± 0.082
5.613LeuLys: 5.613 ± 0.075
11.657LeuLeu: 11.657 ± 0.123
2.591LeuMet: 2.591 ± 0.042
3.696LeuAsn: 3.696 ± 0.057
4.864LeuPro: 4.864 ± 0.06
3.603LeuGln: 3.603 ± 0.051
5.523LeuArg: 5.523 ± 0.068
7.42LeuSer: 7.42 ± 0.069
5.697LeuThr: 5.697 ± 0.065
6.324LeuVal: 6.324 ± 0.072
1.037LeuTrp: 1.037 ± 0.024
3.205LeuTyr: 3.205 ± 0.053
0.0LeuXaa: 0.0 ± 0.0
Met
2.319MetAla: 2.319 ± 0.038
0.16MetCys: 0.16 ± 0.011
1.472MetAsp: 1.472 ± 0.032
1.797MetGlu: 1.797 ± 0.042
0.931MetPhe: 0.931 ± 0.027
1.736MetGly: 1.736 ± 0.036
0.446MetHis: 0.446 ± 0.015
1.835MetIle: 1.835 ± 0.037
1.907MetLys: 1.907 ± 0.038
2.898MetLeu: 2.898 ± 0.046
0.791MetMet: 0.791 ± 0.025
1.431MetAsn: 1.431 ± 0.033
1.099MetPro: 1.099 ± 0.026
0.882MetGln: 0.882 ± 0.024
1.339MetArg: 1.339 ± 0.032
1.762MetSer: 1.762 ± 0.037
1.602MetThr: 1.602 ± 0.033
1.64MetVal: 1.64 ± 0.031
0.186MetTrp: 0.186 ± 0.012
0.692MetTyr: 0.692 ± 0.022
0.0MetXaa: 0.0 ± 0.0
Asn
2.788AsnAla: 2.788 ± 0.041
0.259AsnCys: 0.259 ± 0.014
1.694AsnAsp: 1.694 ± 0.037
2.285AsnGlu: 2.285 ± 0.039
1.251AsnPhe: 1.251 ± 0.029
3.007AsnGly: 3.007 ± 0.054
0.758AsnHis: 0.758 ± 0.022
2.3AsnIle: 2.3 ± 0.047
1.831AsnLys: 1.831 ± 0.042
3.309AsnLeu: 3.309 ± 0.049
0.945AsnMet: 0.945 ± 0.025
1.402AsnAsn: 1.402 ± 0.037
2.061AsnPro: 2.061 ± 0.043
1.242AsnGln: 1.242 ± 0.027
2.117AsnArg: 2.117 ± 0.04
1.935AsnSer: 1.935 ± 0.037
1.873AsnThr: 1.873 ± 0.043
2.434AsnVal: 2.434 ± 0.039
0.461AsnTrp: 0.461 ± 0.019
1.206AsnTyr: 1.206 ± 0.033
0.0AsnXaa: 0.0 ± 0.0
Pro
3.873ProAla: 3.873 ± 0.057
0.278ProCys: 0.278 ± 0.013
2.703ProAsp: 2.703 ± 0.051
3.864ProGlu: 3.864 ± 0.061
1.847ProPhe: 1.847 ± 0.042
3.61ProGly: 3.61 ± 0.056
0.841ProHis: 0.841 ± 0.024
2.199ProIle: 2.199 ± 0.038
1.9ProLys: 1.9 ± 0.039
4.294ProLeu: 4.294 ± 0.067
0.906ProMet: 0.906 ± 0.027
1.303ProAsn: 1.303 ± 0.028
1.447ProPro: 1.447 ± 0.038
1.423ProGln: 1.423 ± 0.033
1.662ProArg: 1.662 ± 0.032
2.72ProSer: 2.72 ± 0.041
1.685ProThr: 1.685 ± 0.039
3.611ProVal: 3.611 ± 0.05
0.485ProTrp: 0.485 ± 0.019
1.586ProTyr: 1.586 ± 0.036
0.0ProXaa: 0.0 ± 0.0
Gln
3.259GlnAla: 3.259 ± 0.05
0.194GlnCys: 0.194 ± 0.014
1.59GlnAsp: 1.59 ± 0.032
2.488GlnGlu: 2.488 ± 0.048
1.328GlnPhe: 1.328 ± 0.028
2.589GlnGly: 2.589 ± 0.044
0.709GlnHis: 0.709 ± 0.024
2.25GlnIle: 2.25 ± 0.041
1.776GlnLys: 1.776 ± 0.037
3.641GlnLeu: 3.641 ± 0.052
1.038GlnMet: 1.038 ± 0.027
1.187GlnAsn: 1.187 ± 0.027
1.458GlnPro: 1.458 ± 0.035
1.53GlnGln: 1.53 ± 0.038
1.884GlnArg: 1.884 ± 0.038
2.175GlnSer: 2.175 ± 0.042
1.782GlnThr: 1.782 ± 0.036
2.124GlnVal: 2.124 ± 0.035
0.453GlnTrp: 0.453 ± 0.018
1.224GlnTyr: 1.224 ± 0.029
0.0GlnXaa: 0.0 ± 0.0
Arg
3.762ArgAla: 3.762 ± 0.045
0.443ArgCys: 0.443 ± 0.019
2.553ArgAsp: 2.553 ± 0.049
4.347ArgGlu: 4.347 ± 0.067
2.256ArgPhe: 2.256 ± 0.035
3.505ArgGly: 3.505 ± 0.059
1.173ArgHis: 1.173 ± 0.036
3.776ArgIle: 3.776 ± 0.049
3.284ArgLys: 3.284 ± 0.047
6.049ArgLeu: 6.049 ± 0.079
1.663ArgMet: 1.663 ± 0.036
1.938ArgAsn: 1.938 ± 0.037
2.012ArgPro: 2.012 ± 0.042
2.188ArgGln: 2.188 ± 0.04
3.481ArgArg: 3.481 ± 0.068
3.485ArgSer: 3.485 ± 0.052
2.795ArgThr: 2.795 ± 0.045
3.066ArgVal: 3.066 ± 0.044
0.643ArgTrp: 0.643 ± 0.024
1.941ArgTyr: 1.941 ± 0.037
0.001ArgXaa: 0.001 ± 0.001
Ser
5.664SerAla: 5.664 ± 0.072
0.481SerCys: 0.481 ± 0.019
3.021SerAsp: 3.021 ± 0.052
3.855SerGlu: 3.855 ± 0.055
2.894SerPhe: 2.894 ± 0.048
6.236SerGly: 6.236 ± 0.071
1.168SerHis: 1.168 ± 0.03
4.153SerIle: 4.153 ± 0.066
3.058SerLys: 3.058 ± 0.045
6.797SerLeu: 6.797 ± 0.075
1.696SerMet: 1.696 ± 0.034
1.906SerAsn: 1.906 ± 0.037
2.888SerPro: 2.888 ± 0.054
1.907SerGln: 1.907 ± 0.032
3.702SerArg: 3.702 ± 0.052
4.387SerSer: 4.387 ± 0.068
2.906SerThr: 2.906 ± 0.053
4.743SerVal: 4.743 ± 0.064
0.772SerTrp: 0.772 ± 0.025
2.135SerTyr: 2.135 ± 0.036
0.0SerXaa: 0.0 ± 0.0
Thr
4.934ThrAla: 4.934 ± 0.069
0.341ThrCys: 0.341 ± 0.015
2.674ThrAsp: 2.674 ± 0.043
3.199ThrGlu: 3.199 ± 0.047
2.171ThrPhe: 2.171 ± 0.037
4.999ThrGly: 4.999 ± 0.055
0.957ThrHis: 0.957 ± 0.026
3.221ThrIle: 3.221 ± 0.044
2.255ThrLys: 2.255 ± 0.039
5.389ThrLeu: 5.389 ± 0.072
1.209ThrMet: 1.209 ± 0.026
1.599ThrAsn: 1.599 ± 0.038
2.593ThrPro: 2.593 ± 0.05
1.337ThrGln: 1.337 ± 0.031
2.354ThrArg: 2.354 ± 0.042
3.056ThrSer: 3.056 ± 0.047
2.512ThrThr: 2.512 ± 0.048
4.553ThrVal: 4.553 ± 0.061
0.538ThrTrp: 0.538 ± 0.02
1.628ThrTyr: 1.628 ± 0.033
0.0ThrXaa: 0.0 ± 0.0
Val
5.249ValAla: 5.249 ± 0.077
0.651ValCys: 0.651 ± 0.022
3.434ValAsp: 3.434 ± 0.05
4.113ValGlu: 4.113 ± 0.063
2.964ValPhe: 2.964 ± 0.049
4.38ValGly: 4.38 ± 0.062
1.4ValHis: 1.4 ± 0.034
4.958ValIle: 4.958 ± 0.061
3.912ValLys: 3.912 ± 0.054
7.504ValLeu: 7.504 ± 0.074
1.956ValMet: 1.956 ± 0.039
2.756ValAsn: 2.756 ± 0.051
3.12ValPro: 3.12 ± 0.045
2.504ValGln: 2.504 ± 0.04
3.586ValArg: 3.586 ± 0.051
4.963ValSer: 4.963 ± 0.054
4.316ValThr: 4.316 ± 0.066
4.775ValVal: 4.775 ± 0.063
0.764ValTrp: 0.764 ± 0.027
2.426ValTyr: 2.426 ± 0.038
0.0ValXaa: 0.0 ± 0.0
Trp
0.824TrpAla: 0.824 ± 0.027
0.079TrpCys: 0.079 ± 0.007
0.599TrpAsp: 0.599 ± 0.021
0.738TrpGlu: 0.738 ± 0.026
0.493TrpPhe: 0.493 ± 0.017
0.802TrpGly: 0.802 ± 0.025
0.223TrpHis: 0.223 ± 0.013
0.875TrpIle: 0.875 ± 0.024
0.703TrpLys: 0.703 ± 0.023
1.374TrpLeu: 1.374 ± 0.035
0.415TrpMet: 0.415 ± 0.015
0.654TrpAsn: 0.654 ± 0.023
0.339TrpPro: 0.339 ± 0.014
0.425TrpGln: 0.425 ± 0.016
0.642TrpArg: 0.642 ± 0.023
0.791TrpSer: 0.791 ± 0.026
0.633TrpThr: 0.633 ± 0.021
0.717TrpVal: 0.717 ± 0.022
0.176TrpTrp: 0.176 ± 0.011
0.332TrpTyr: 0.332 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.682TyrAla: 2.682 ± 0.039
0.323TyrCys: 0.323 ± 0.018
1.827TyrAsp: 1.827 ± 0.038
2.182TyrGlu: 2.182 ± 0.042
1.544TyrPhe: 1.544 ± 0.034
2.75TyrGly: 2.75 ± 0.051
0.734TyrHis: 0.734 ± 0.024
2.053TyrIle: 2.053 ± 0.038
1.562TyrLys: 1.562 ± 0.032
3.255TyrLeu: 3.255 ± 0.051
0.799TyrMet: 0.799 ± 0.027
1.265TyrAsn: 1.265 ± 0.032
1.491TyrPro: 1.491 ± 0.032
1.115TyrGln: 1.115 ± 0.03
2.131TyrArg: 2.131 ± 0.043
2.197TyrSer: 2.197 ± 0.039
1.813TyrThr: 1.813 ± 0.033
2.123TyrVal: 2.123 ± 0.04
0.426TyrTrp: 0.426 ± 0.018
1.283TyrTyr: 1.283 ± 0.03
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.001
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.001XaaVal: 0.001 ± 0.001
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4776 proteins (1487397 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski