Amino acid dipepetide frequency for Bacillus solimangrovi

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.598AlaAla: 4.598 ± 0.094
0.508AlaCys: 0.508 ± 0.022
3.1AlaAsp: 3.1 ± 0.058
4.352AlaGlu: 4.352 ± 0.082
2.905AlaPhe: 2.905 ± 0.06
4.553AlaGly: 4.553 ± 0.09
1.228AlaHis: 1.228 ± 0.037
5.654AlaIle: 5.654 ± 0.081
4.272AlaLys: 4.272 ± 0.069
6.494AlaLeu: 6.494 ± 0.081
1.803AlaMet: 1.803 ± 0.042
2.889AlaAsn: 2.889 ± 0.057
1.813AlaPro: 1.813 ± 0.044
2.28AlaGln: 2.28 ± 0.048
2.328AlaArg: 2.328 ± 0.059
3.878AlaSer: 3.878 ± 0.067
3.32AlaThr: 3.32 ± 0.061
4.844AlaVal: 4.844 ± 0.08
0.524AlaTrp: 0.524 ± 0.023
2.114AlaTyr: 2.114 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
0.433CysAla: 0.433 ± 0.021
0.085CysCys: 0.085 ± 0.01
0.378CysAsp: 0.378 ± 0.02
0.564CysGlu: 0.564 ± 0.025
0.292CysPhe: 0.292 ± 0.017
0.679CysGly: 0.679 ± 0.027
0.199CysHis: 0.199 ± 0.013
0.544CysIle: 0.544 ± 0.023
0.369CysLys: 0.369 ± 0.021
0.634CysLeu: 0.634 ± 0.025
0.197CysMet: 0.197 ± 0.014
0.334CysAsn: 0.334 ± 0.016
0.322CysPro: 0.322 ± 0.019
0.237CysGln: 0.237 ± 0.016
0.268CysArg: 0.268 ± 0.017
0.522CysSer: 0.522 ± 0.021
0.408CysThr: 0.408 ± 0.022
0.467CysVal: 0.467 ± 0.021
0.06CysTrp: 0.06 ± 0.007
0.282CysTyr: 0.282 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
3.112AspAla: 3.112 ± 0.072
0.396AspCys: 0.396 ± 0.021
2.879AspAsp: 2.879 ± 0.064
4.965AspGlu: 4.965 ± 0.095
2.341AspPhe: 2.341 ± 0.048
3.568AspGly: 3.568 ± 0.076
0.998AspHis: 0.998 ± 0.029
4.371AspIle: 4.371 ± 0.078
3.368AspLys: 3.368 ± 0.064
4.864AspLeu: 4.864 ± 0.082
1.337AspMet: 1.337 ± 0.037
2.418AspAsn: 2.418 ± 0.06
1.735AspPro: 1.735 ± 0.043
1.791AspGln: 1.791 ± 0.038
2.016AspArg: 2.016 ± 0.041
2.809AspSer: 2.809 ± 0.054
2.471AspThr: 2.471 ± 0.049
4.194AspVal: 4.194 ± 0.064
0.609AspTrp: 0.609 ± 0.027
2.202AspTyr: 2.202 ± 0.047
0.0AspXaa: 0.0 ± 0.0
Glu
5.157GluAla: 5.157 ± 0.092
0.381GluCys: 0.381 ± 0.018
3.996GluAsp: 3.996 ± 0.095
7.241GluGlu: 7.241 ± 0.113
2.575GluPhe: 2.575 ± 0.05
4.518GluGly: 4.518 ± 0.072
1.827GluHis: 1.827 ± 0.043
6.019GluIle: 6.019 ± 0.081
6.321GluLys: 6.321 ± 0.088
7.343GluLeu: 7.343 ± 0.1
2.418GluMet: 2.418 ± 0.045
3.912GluAsn: 3.912 ± 0.058
1.917GluPro: 1.917 ± 0.053
4.558GluGln: 4.558 ± 0.087
3.97GluArg: 3.97 ± 0.065
3.811GluSer: 3.811 ± 0.068
3.941GluThr: 3.941 ± 0.068
5.667GluVal: 5.667 ± 0.1
0.796GluTrp: 0.796 ± 0.029
2.346GluTyr: 2.346 ± 0.05
0.0GluXaa: 0.0 ± 0.0
Phe
2.802PheAla: 2.802 ± 0.058
0.315PheCys: 0.315 ± 0.015
2.645PheAsp: 2.645 ± 0.061
3.367PheGlu: 3.367 ± 0.057
2.249PhePhe: 2.249 ± 0.056
3.129PheGly: 3.129 ± 0.057
0.947PheHis: 0.947 ± 0.028
3.927PheIle: 3.927 ± 0.079
2.282PheLys: 2.282 ± 0.051
4.191PheLeu: 4.191 ± 0.078
1.073PheMet: 1.073 ± 0.039
2.198PheAsn: 2.198 ± 0.053
1.533PhePro: 1.533 ± 0.038
1.594PheGln: 1.594 ± 0.046
1.529PheArg: 1.529 ± 0.04
3.289PheSer: 3.289 ± 0.064
2.481PheThr: 2.481 ± 0.047
3.25PheVal: 3.25 ± 0.064
0.425PheTrp: 0.425 ± 0.021
1.672PheTyr: 1.672 ± 0.045
0.0PheXaa: 0.0 ± 0.0
Gly
4.435GlyAla: 4.435 ± 0.076
0.594GlyCys: 0.594 ± 0.026
3.201GlyAsp: 3.201 ± 0.066
4.537GlyGlu: 4.537 ± 0.072
3.402GlyPhe: 3.402 ± 0.067
4.403GlyGly: 4.403 ± 0.089
1.251GlyHis: 1.251 ± 0.034
6.002GlyIle: 6.002 ± 0.082
4.567GlyLys: 4.567 ± 0.071
6.194GlyLeu: 6.194 ± 0.096
1.971GlyMet: 1.971 ± 0.047
2.914GlyAsn: 2.914 ± 0.066
1.468GlyPro: 1.468 ± 0.037
2.105GlyGln: 2.105 ± 0.049
2.454GlyArg: 2.454 ± 0.049
3.739GlySer: 3.739 ± 0.057
3.929GlyThr: 3.929 ± 0.07
4.875GlyVal: 4.875 ± 0.066
0.718GlyTrp: 0.718 ± 0.028
2.682GlyTyr: 2.682 ± 0.052
0.0GlyXaa: 0.0 ± 0.0
His
1.266HisAla: 1.266 ± 0.042
0.203HisCys: 0.203 ± 0.014
1.109HisAsp: 1.109 ± 0.035
1.53HisGlu: 1.53 ± 0.042
0.975HisPhe: 0.975 ± 0.03
1.294HisGly: 1.294 ± 0.034
0.669HisHis: 0.669 ± 0.029
1.633HisIle: 1.633 ± 0.04
1.028HisLys: 1.028 ± 0.035
2.096HisLeu: 2.096 ± 0.043
0.561HisMet: 0.561 ± 0.021
0.933HisAsn: 0.933 ± 0.032
1.057HisPro: 1.057 ± 0.033
0.741HisGln: 0.741 ± 0.027
0.858HisArg: 0.858 ± 0.03
1.242HisSer: 1.242 ± 0.035
1.161HisThr: 1.161 ± 0.033
1.485HisVal: 1.485 ± 0.037
0.232HisTrp: 0.232 ± 0.015
0.917HisTyr: 0.917 ± 0.033
0.0HisXaa: 0.0 ± 0.0
Ile
5.616IleAla: 5.616 ± 0.091
0.706IleCys: 0.706 ± 0.027
4.888IleAsp: 4.888 ± 0.074
6.687IleGlu: 6.687 ± 0.1
3.317IlePhe: 3.317 ± 0.066
6.03IleGly: 6.03 ± 0.09
1.806IleHis: 1.806 ± 0.043
6.528IleIle: 6.528 ± 0.119
4.458IleLys: 4.458 ± 0.073
7.131IleLeu: 7.131 ± 0.107
1.86IleMet: 1.86 ± 0.044
3.829IleAsn: 3.829 ± 0.07
3.192IlePro: 3.192 ± 0.053
3.164IleGln: 3.164 ± 0.061
3.199IleArg: 3.199 ± 0.057
5.434IleSer: 5.434 ± 0.079
4.723IleThr: 4.723 ± 0.072
6.025IleVal: 6.025 ± 0.078
0.63IleTrp: 0.63 ± 0.023
2.622IleTyr: 2.622 ± 0.058
0.0IleXaa: 0.0 ± 0.0
Lys
4.04LysAla: 4.04 ± 0.082
0.437LysCys: 0.437 ± 0.022
3.576LysAsp: 3.576 ± 0.072
6.356LysGlu: 6.356 ± 0.091
2.015LysPhe: 2.015 ± 0.042
4.141LysGly: 4.141 ± 0.071
1.388LysHis: 1.388 ± 0.041
4.566LysIle: 4.566 ± 0.072
5.439LysLys: 5.439 ± 0.08
5.971LysLeu: 5.971 ± 0.081
2.136LysMet: 2.136 ± 0.052
3.165LysAsn: 3.165 ± 0.063
1.994LysPro: 1.994 ± 0.039
3.399LysGln: 3.399 ± 0.064
3.291LysArg: 3.291 ± 0.053
3.458LysSer: 3.458 ± 0.052
3.374LysThr: 3.374 ± 0.054
4.922LysVal: 4.922 ± 0.07
0.779LysTrp: 0.779 ± 0.03
2.191LysTyr: 2.191 ± 0.048
0.0LysXaa: 0.0 ± 0.0
Leu
6.367LeuAla: 6.367 ± 0.096
0.687LeuCys: 0.687 ± 0.028
4.65LeuAsp: 4.65 ± 0.072
6.744LeuGlu: 6.744 ± 0.089
4.768LeuPhe: 4.768 ± 0.079
5.942LeuGly: 5.942 ± 0.081
1.976LeuHis: 1.976 ± 0.049
7.788LeuIle: 7.788 ± 0.117
6.367LeuLys: 6.367 ± 0.09
9.867LeuLeu: 9.867 ± 0.151
2.338LeuMet: 2.338 ± 0.054
4.768LeuAsn: 4.768 ± 0.068
3.609LeuPro: 3.609 ± 0.067
3.744LeuGln: 3.744 ± 0.063
3.654LeuArg: 3.654 ± 0.058
6.902LeuSer: 6.902 ± 0.087
5.752LeuThr: 5.752 ± 0.075
6.121LeuVal: 6.121 ± 0.09
0.764LeuTrp: 0.764 ± 0.03
3.089LeuTyr: 3.089 ± 0.057
0.0LeuXaa: 0.0 ± 0.0
Met
1.728MetAla: 1.728 ± 0.043
0.156MetCys: 0.156 ± 0.012
1.27MetAsp: 1.27 ± 0.034
1.804MetGlu: 1.804 ± 0.045
1.228MetPhe: 1.228 ± 0.038
1.58MetGly: 1.58 ± 0.04
0.434MetHis: 0.434 ± 0.021
2.151MetIle: 2.151 ± 0.046
2.565MetLys: 2.565 ± 0.046
2.704MetLeu: 2.704 ± 0.057
0.937MetMet: 0.937 ± 0.028
1.772MetAsn: 1.772 ± 0.04
0.965MetPro: 0.965 ± 0.037
0.957MetGln: 0.957 ± 0.032
1.138MetArg: 1.138 ± 0.035
1.816MetSer: 1.816 ± 0.043
1.596MetThr: 1.596 ± 0.045
1.65MetVal: 1.65 ± 0.043
0.201MetTrp: 0.201 ± 0.014
0.812MetTyr: 0.812 ± 0.028
0.0MetXaa: 0.0 ± 0.0
Asn
2.791AsnAla: 2.791 ± 0.052
0.368AsnCys: 0.368 ± 0.025
2.828AsnAsp: 2.828 ± 0.061
4.506AsnGlu: 4.506 ± 0.07
1.731AsnPhe: 1.731 ± 0.038
3.566AsnGly: 3.566 ± 0.075
1.095AsnHis: 1.095 ± 0.03
3.926AsnIle: 3.926 ± 0.069
3.379AsnLys: 3.379 ± 0.061
4.066AsnLeu: 4.066 ± 0.068
1.36AsnMet: 1.36 ± 0.038
2.823AsnAsn: 2.823 ± 0.089
1.924AsnPro: 1.924 ± 0.049
1.978AsnGln: 1.978 ± 0.046
2.082AsnArg: 2.082 ± 0.047
2.673AsnSer: 2.673 ± 0.054
2.342AsnThr: 2.342 ± 0.06
3.625AsnVal: 3.625 ± 0.062
0.534AsnTrp: 0.534 ± 0.022
1.704AsnTyr: 1.704 ± 0.045
0.0AsnXaa: 0.0 ± 0.0
Pro
1.778ProAla: 1.778 ± 0.04
0.19ProCys: 0.19 ± 0.015
1.798ProAsp: 1.798 ± 0.044
2.727ProGlu: 2.727 ± 0.059
1.787ProPhe: 1.787 ± 0.043
1.804ProGly: 1.804 ± 0.045
0.756ProHis: 0.756 ± 0.03
2.974ProIle: 2.974 ± 0.053
2.029ProLys: 2.029 ± 0.048
3.37ProLeu: 3.37 ± 0.059
0.824ProMet: 0.824 ± 0.029
1.849ProAsn: 1.849 ± 0.049
0.931ProPro: 0.931 ± 0.028
1.061ProGln: 1.061 ± 0.031
1.005ProArg: 1.005 ± 0.029
2.201ProSer: 2.201 ± 0.046
2.032ProThr: 2.032 ± 0.049
2.561ProVal: 2.561 ± 0.057
0.309ProTrp: 0.309 ± 0.017
1.401ProTyr: 1.401 ± 0.036
0.0ProXaa: 0.0 ± 0.0
Gln
2.684GlnAla: 2.684 ± 0.057
0.239GlnCys: 0.239 ± 0.015
1.625GlnAsp: 1.625 ± 0.038
2.84GlnGlu: 2.84 ± 0.06
1.815GlnPhe: 1.815 ± 0.041
2.171GlnGly: 2.171 ± 0.047
0.853GlnHis: 0.853 ± 0.029
2.866GlnIle: 2.866 ± 0.057
2.553GlnLys: 2.553 ± 0.056
4.48GlnLeu: 4.48 ± 0.076
1.229GlnMet: 1.229 ± 0.036
1.701GlnAsn: 1.701 ± 0.045
1.259GlnPro: 1.259 ± 0.036
2.081GlnGln: 2.081 ± 0.061
1.577GlnArg: 1.577 ± 0.042
2.413GlnSer: 2.413 ± 0.06
2.282GlnThr: 2.282 ± 0.052
2.647GlnVal: 2.647 ± 0.064
0.418GlnTrp: 0.418 ± 0.021
1.49GlnTyr: 1.49 ± 0.039
0.0GlnXaa: 0.0 ± 0.0
Arg
2.35ArgAla: 2.35 ± 0.051
0.273ArgCys: 0.273 ± 0.016
2.024ArgAsp: 2.024 ± 0.045
3.251ArgGlu: 3.251 ± 0.075
1.922ArgPhe: 1.922 ± 0.041
2.363ArgGly: 2.363 ± 0.051
0.757ArgHis: 0.757 ± 0.026
3.168ArgIle: 3.168 ± 0.055
3.06ArgLys: 3.06 ± 0.065
3.859ArgLeu: 3.859 ± 0.066
1.256ArgMet: 1.256 ± 0.032
2.129ArgAsn: 2.129 ± 0.045
1.255ArgPro: 1.255 ± 0.034
1.509ArgGln: 1.509 ± 0.046
1.751ArgArg: 1.751 ± 0.049
2.323ArgSer: 2.323 ± 0.046
2.092ArgThr: 2.092 ± 0.046
2.648ArgVal: 2.648 ± 0.056
0.346ArgTrp: 0.346 ± 0.018
1.541ArgTyr: 1.541 ± 0.036
0.0ArgXaa: 0.0 ± 0.0
Ser
3.489SerAla: 3.489 ± 0.063
0.435SerCys: 0.435 ± 0.025
3.037SerAsp: 3.037 ± 0.056
4.408SerGlu: 4.408 ± 0.068
3.364SerPhe: 3.364 ± 0.071
4.289SerGly: 4.289 ± 0.067
1.287SerHis: 1.287 ± 0.038
5.477SerIle: 5.477 ± 0.085
3.899SerLys: 3.899 ± 0.052
6.271SerLeu: 6.271 ± 0.086
1.714SerMet: 1.714 ± 0.041
3.263SerAsn: 3.263 ± 0.064
2.011SerPro: 2.011 ± 0.046
2.056SerGln: 2.056 ± 0.053
2.133SerArg: 2.133 ± 0.048
4.233SerSer: 4.233 ± 0.084
3.23SerThr: 3.23 ± 0.058
4.232SerVal: 4.232 ± 0.072
0.578SerTrp: 0.578 ± 0.024
2.353SerTyr: 2.353 ± 0.052
0.0SerXaa: 0.0 ± 0.0
Thr
3.581ThrAla: 3.581 ± 0.063
0.356ThrCys: 0.356 ± 0.017
2.859ThrAsp: 2.859 ± 0.05
3.774ThrGlu: 3.774 ± 0.068
2.753ThrPhe: 2.753 ± 0.05
3.878ThrGly: 3.878 ± 0.064
1.041ThrHis: 1.041 ± 0.034
4.899ThrIle: 4.899 ± 0.063
3.237ThrLys: 3.237 ± 0.05
5.264ThrLeu: 5.264 ± 0.086
1.363ThrMet: 1.363 ± 0.033
2.781ThrAsn: 2.781 ± 0.053
2.175ThrPro: 2.175 ± 0.044
1.601ThrGln: 1.601 ± 0.044
1.855ThrArg: 1.855 ± 0.037
3.488ThrSer: 3.488 ± 0.063
2.99ThrThr: 2.99 ± 0.061
4.243ThrVal: 4.243 ± 0.071
0.499ThrTrp: 0.499 ± 0.027
2.047ThrTyr: 2.047 ± 0.051
0.0ThrXaa: 0.0 ± 0.0
Val
4.721ValAla: 4.721 ± 0.083
0.568ValCys: 0.568 ± 0.026
3.932ValAsp: 3.932 ± 0.07
5.483ValGlu: 5.483 ± 0.081
3.087ValPhe: 3.087 ± 0.062
4.534ValGly: 4.534 ± 0.077
1.427ValHis: 1.427 ± 0.036
5.987ValIle: 5.987 ± 0.079
4.619ValLys: 4.619 ± 0.075
6.624ValLeu: 6.624 ± 0.085
1.886ValMet: 1.886 ± 0.042
3.483ValAsn: 3.483 ± 0.063
2.621ValPro: 2.621 ± 0.055
2.649ValGln: 2.649 ± 0.046
2.798ValArg: 2.798 ± 0.056
4.727ValSer: 4.727 ± 0.075
4.24ValThr: 4.24 ± 0.074
5.18ValVal: 5.18 ± 0.093
0.612ValTrp: 0.612 ± 0.027
2.293ValTyr: 2.293 ± 0.05
0.0ValXaa: 0.0 ± 0.0
Trp
0.542TrpAla: 0.542 ± 0.025
0.076TrpCys: 0.076 ± 0.009
0.51TrpAsp: 0.51 ± 0.025
0.651TrpGlu: 0.651 ± 0.023
0.514TrpPhe: 0.514 ± 0.026
0.603TrpGly: 0.603 ± 0.03
0.197TrpHis: 0.197 ± 0.013
0.825TrpIle: 0.825 ± 0.029
0.673TrpLys: 0.673 ± 0.029
1.069TrpLeu: 1.069 ± 0.036
0.302TrpMet: 0.302 ± 0.016
0.514TrpAsn: 0.514 ± 0.023
0.216TrpPro: 0.216 ± 0.014
0.342TrpGln: 0.342 ± 0.018
0.351TrpArg: 0.351 ± 0.019
0.598TrpSer: 0.598 ± 0.025
0.475TrpThr: 0.475 ± 0.022
0.574TrpVal: 0.574 ± 0.027
0.135TrpTrp: 0.135 ± 0.012
0.372TrpTyr: 0.372 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.965TyrAla: 1.965 ± 0.043
0.3TyrCys: 0.3 ± 0.019
2.138TyrAsp: 2.138 ± 0.047
2.93TyrGlu: 2.93 ± 0.055
1.858TyrPhe: 1.858 ± 0.047
2.397TyrGly: 2.397 ± 0.05
0.858TyrHis: 0.858 ± 0.028
2.539TyrIle: 2.539 ± 0.048
2.153TyrLys: 2.153 ± 0.051
3.405TyrLeu: 3.405 ± 0.066
0.897TyrMet: 0.897 ± 0.031
1.628TyrAsn: 1.628 ± 0.043
1.389TyrPro: 1.389 ± 0.041
1.375TyrGln: 1.375 ± 0.037
1.623TyrArg: 1.623 ± 0.044
2.202TyrSer: 2.202 ± 0.05
1.844TyrThr: 1.844 ± 0.044
2.26TyrVal: 2.26 ± 0.044
0.372TyrTrp: 0.372 ± 0.018
1.567TyrTyr: 1.567 ± 0.046
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3530 proteins (1021236 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski