Amino acid dipepetide frequency for Bacillus sp. YR335

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.891AlaAla: 4.891 ± 0.069
0.567AlaCys: 0.567 ± 0.021
3.117AlaAsp: 3.117 ± 0.047
4.247AlaGlu: 4.247 ± 0.064
3.095AlaPhe: 3.095 ± 0.049
4.903AlaGly: 4.903 ± 0.076
1.158AlaHis: 1.158 ± 0.027
6.04AlaIle: 6.04 ± 0.074
4.565AlaLys: 4.565 ± 0.061
6.687AlaLeu: 6.687 ± 0.078
1.873AlaMet: 1.873 ± 0.037
2.939AlaAsn: 2.939 ± 0.049
1.897AlaPro: 1.897 ± 0.039
2.091AlaGln: 2.091 ± 0.047
2.383AlaArg: 2.383 ± 0.044
3.937AlaSer: 3.937 ± 0.058
3.592AlaThr: 3.592 ± 0.056
4.859AlaVal: 4.859 ± 0.069
0.571AlaTrp: 0.571 ± 0.021
2.195AlaTyr: 2.195 ± 0.036
0.0AlaXaa: 0.0 ± 0.0
Cys
0.42CysAla: 0.42 ± 0.018
0.118CysCys: 0.118 ± 0.009
0.415CysAsp: 0.415 ± 0.016
0.512CysGlu: 0.512 ± 0.021
0.354CysPhe: 0.354 ± 0.015
0.676CysGly: 0.676 ± 0.022
0.21CysHis: 0.21 ± 0.013
0.593CysIle: 0.593 ± 0.019
0.417CysLys: 0.417 ± 0.018
0.716CysLeu: 0.716 ± 0.025
0.192CysMet: 0.192 ± 0.012
0.308CysAsn: 0.308 ± 0.015
0.346CysPro: 0.346 ± 0.016
0.248CysGln: 0.248 ± 0.013
0.267CysArg: 0.267 ± 0.014
0.556CysSer: 0.556 ± 0.022
0.395CysThr: 0.395 ± 0.016
0.428CysVal: 0.428 ± 0.019
0.069CysTrp: 0.069 ± 0.007
0.269CysTyr: 0.269 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
3.042AspAla: 3.042 ± 0.045
0.395AspCys: 0.395 ± 0.016
2.449AspAsp: 2.449 ± 0.053
4.281AspGlu: 4.281 ± 0.062
2.509AspPhe: 2.509 ± 0.039
3.21AspGly: 3.21 ± 0.051
1.248AspHis: 1.248 ± 0.03
4.101AspIle: 4.101 ± 0.052
3.149AspLys: 3.149 ± 0.052
5.077AspLeu: 5.077 ± 0.065
1.295AspMet: 1.295 ± 0.029
1.832AspAsn: 1.832 ± 0.037
1.856AspPro: 1.856 ± 0.038
2.102AspGln: 2.102 ± 0.044
2.14AspArg: 2.14 ± 0.042
2.583AspSer: 2.583 ± 0.044
2.372AspThr: 2.372 ± 0.047
3.845AspVal: 3.845 ± 0.057
0.585AspTrp: 0.585 ± 0.021
2.148AspTyr: 2.148 ± 0.038
0.0AspXaa: 0.0 ± 0.0
Glu
4.888GluAla: 4.888 ± 0.074
0.431GluCys: 0.431 ± 0.016
3.667GluAsp: 3.667 ± 0.063
6.622GluGlu: 6.622 ± 0.092
2.654GluPhe: 2.654 ± 0.049
4.19GluGly: 4.19 ± 0.058
1.48GluHis: 1.48 ± 0.035
5.968GluIle: 5.968 ± 0.068
6.7GluLys: 6.7 ± 0.077
7.219GluLeu: 7.219 ± 0.086
2.242GluMet: 2.242 ± 0.034
3.986GluAsn: 3.986 ± 0.047
1.838GluPro: 1.838 ± 0.046
3.467GluGln: 3.467 ± 0.065
3.296GluArg: 3.296 ± 0.058
3.544GluSer: 3.544 ± 0.049
3.859GluThr: 3.859 ± 0.06
5.055GluVal: 5.055 ± 0.061
0.792GluTrp: 0.792 ± 0.022
2.324GluTyr: 2.324 ± 0.043
0.0GluXaa: 0.0 ± 0.0
Phe
2.967PheAla: 2.967 ± 0.053
0.369PheCys: 0.369 ± 0.016
2.41PheAsp: 2.41 ± 0.036
2.932PheGlu: 2.932 ± 0.044
2.438PhePhe: 2.438 ± 0.056
3.381PheGly: 3.381 ± 0.06
1.059PheHis: 1.059 ± 0.028
4.105PheIle: 4.105 ± 0.067
2.568PheLys: 2.568 ± 0.047
4.661PheLeu: 4.661 ± 0.082
1.122PheMet: 1.122 ± 0.025
1.998PheAsn: 1.998 ± 0.032
1.636PhePro: 1.636 ± 0.034
1.628PheGln: 1.628 ± 0.037
1.461PheArg: 1.461 ± 0.035
3.354PheSer: 3.354 ± 0.048
2.676PheThr: 2.676 ± 0.042
3.261PheVal: 3.261 ± 0.044
0.469PheTrp: 0.469 ± 0.018
1.668PheTyr: 1.668 ± 0.034
0.0PheXaa: 0.0 ± 0.0
Gly
4.762GlyAla: 4.762 ± 0.071
0.641GlyCys: 0.641 ± 0.023
3.081GlyAsp: 3.081 ± 0.057
4.479GlyGlu: 4.479 ± 0.074
3.329GlyPhe: 3.329 ± 0.048
4.7GlyGly: 4.7 ± 0.07
1.323GlyHis: 1.323 ± 0.033
6.186GlyIle: 6.186 ± 0.073
4.998GlyLys: 4.998 ± 0.056
6.482GlyLeu: 6.482 ± 0.073
2.073GlyMet: 2.073 ± 0.043
2.86GlyAsn: 2.86 ± 0.055
1.765GlyPro: 1.765 ± 0.061
2.11GlyGln: 2.11 ± 0.04
2.514GlyArg: 2.514 ± 0.044
4.045GlySer: 4.045 ± 0.064
3.941GlyThr: 3.941 ± 0.053
5.023GlyVal: 5.023 ± 0.068
0.767GlyTrp: 0.767 ± 0.029
2.759GlyTyr: 2.759 ± 0.045
0.0GlyXaa: 0.0 ± 0.0
His
1.246HisAla: 1.246 ± 0.031
0.188HisCys: 0.188 ± 0.012
1.044HisAsp: 1.044 ± 0.028
1.362HisGlu: 1.362 ± 0.036
1.151HisPhe: 1.151 ± 0.033
1.339HisGly: 1.339 ± 0.033
0.705HisHis: 0.705 ± 0.026
1.511HisIle: 1.511 ± 0.033
1.097HisLys: 1.097 ± 0.03
2.146HisLeu: 2.146 ± 0.042
0.48HisMet: 0.48 ± 0.019
0.78HisAsn: 0.78 ± 0.023
1.127HisPro: 1.127 ± 0.029
0.858HisGln: 0.858 ± 0.022
0.863HisArg: 0.863 ± 0.027
1.323HisSer: 1.323 ± 0.034
1.034HisThr: 1.034 ± 0.026
1.398HisVal: 1.398 ± 0.03
0.214HisTrp: 0.214 ± 0.012
0.91HisTyr: 0.91 ± 0.025
0.0HisXaa: 0.0 ± 0.0
Ile
6.068IleAla: 6.068 ± 0.066
0.725IleCys: 0.725 ± 0.023
4.65IleAsp: 4.65 ± 0.06
6.261IleGlu: 6.261 ± 0.082
3.485IlePhe: 3.485 ± 0.063
6.329IleGly: 6.329 ± 0.081
1.743IleHis: 1.743 ± 0.037
6.546IleIle: 6.546 ± 0.094
5.031IleLys: 5.031 ± 0.067
7.482IleLeu: 7.482 ± 0.1
1.886IleMet: 1.886 ± 0.037
3.69IleAsn: 3.69 ± 0.058
3.422IlePro: 3.422 ± 0.045
3.014IleGln: 3.014 ± 0.05
3.046IleArg: 3.046 ± 0.045
5.493IleSer: 5.493 ± 0.065
4.586IleThr: 4.586 ± 0.052
6.036IleVal: 6.036 ± 0.081
0.688IleTrp: 0.688 ± 0.02
2.561IleTyr: 2.561 ± 0.043
0.0IleXaa: 0.0 ± 0.0
Lys
4.463LysAla: 4.463 ± 0.058
0.365LysCys: 0.365 ± 0.016
3.837LysAsp: 3.837 ± 0.055
6.929LysGlu: 6.929 ± 0.089
2.047LysPhe: 2.047 ± 0.038
4.481LysGly: 4.481 ± 0.056
1.319LysHis: 1.319 ± 0.035
5.158LysIle: 5.158 ± 0.062
5.96LysLys: 5.96 ± 0.077
6.158LysLeu: 6.158 ± 0.057
2.229LysMet: 2.229 ± 0.037
3.722LysAsn: 3.722 ± 0.063
2.138LysPro: 2.138 ± 0.042
3.323LysGln: 3.323 ± 0.062
3.3LysArg: 3.3 ± 0.055
3.784LysSer: 3.784 ± 0.054
3.639LysThr: 3.639 ± 0.045
4.77LysVal: 4.77 ± 0.059
0.806LysTrp: 0.806 ± 0.022
2.261LysTyr: 2.261 ± 0.047
0.0LysXaa: 0.0 ± 0.0
Leu
6.888LeuAla: 6.888 ± 0.074
0.742LeuCys: 0.742 ± 0.025
4.839LeuAsp: 4.839 ± 0.063
6.548LeuGlu: 6.548 ± 0.08
4.949LeuPhe: 4.949 ± 0.086
6.46LeuGly: 6.46 ± 0.078
1.983LeuHis: 1.983 ± 0.04
7.707LeuIle: 7.707 ± 0.089
6.796LeuLys: 6.796 ± 0.07
10.206LeuLeu: 10.206 ± 0.13
2.359LeuMet: 2.359 ± 0.043
4.631LeuAsn: 4.631 ± 0.058
3.879LeuPro: 3.879 ± 0.057
3.624LeuGln: 3.624 ± 0.052
3.384LeuArg: 3.384 ± 0.043
6.966LeuSer: 6.966 ± 0.074
5.783LeuThr: 5.783 ± 0.065
6.496LeuVal: 6.496 ± 0.074
0.795LeuTrp: 0.795 ± 0.028
3.076LeuTyr: 3.076 ± 0.047
0.0LeuXaa: 0.0 ± 0.0
Met
1.893MetAla: 1.893 ± 0.042
0.159MetCys: 0.159 ± 0.01
1.328MetAsp: 1.328 ± 0.031
1.801MetGlu: 1.801 ± 0.033
1.074MetPhe: 1.074 ± 0.03
1.747MetGly: 1.747 ± 0.038
0.397MetHis: 0.397 ± 0.018
2.378MetIle: 2.378 ± 0.044
2.508MetLys: 2.508 ± 0.038
2.522MetLeu: 2.522 ± 0.045
0.872MetMet: 0.872 ± 0.03
1.662MetAsn: 1.662 ± 0.032
0.931MetPro: 0.931 ± 0.023
0.777MetGln: 0.777 ± 0.022
0.938MetArg: 0.938 ± 0.022
1.754MetSer: 1.754 ± 0.034
1.64MetThr: 1.64 ± 0.033
1.737MetVal: 1.737 ± 0.037
0.19MetTrp: 0.19 ± 0.014
0.722MetTyr: 0.722 ± 0.018
0.0MetXaa: 0.0 ± 0.0
Asn
2.69AsnAla: 2.69 ± 0.045
0.362AsnCys: 0.362 ± 0.019
2.479AsnAsp: 2.479 ± 0.044
3.969AsnGlu: 3.969 ± 0.057
1.719AsnPhe: 1.719 ± 0.035
3.502AsnGly: 3.502 ± 0.056
1.162AsnHis: 1.162 ± 0.029
3.628AsnIle: 3.628 ± 0.049
3.231AsnLys: 3.231 ± 0.051
4.229AsnLeu: 4.229 ± 0.055
1.259AsnMet: 1.259 ± 0.031
2.242AsnAsn: 2.242 ± 0.052
2.183AsnPro: 2.183 ± 0.047
2.172AsnGln: 2.172 ± 0.041
2.033AsnArg: 2.033 ± 0.04
2.614AsnSer: 2.614 ± 0.052
2.272AsnThr: 2.272 ± 0.045
3.247AsnVal: 3.247 ± 0.048
0.497AsnTrp: 0.497 ± 0.017
1.606AsnTyr: 1.606 ± 0.037
0.0AsnXaa: 0.0 ± 0.0
Pro
2.037ProAla: 2.037 ± 0.042
0.195ProCys: 0.195 ± 0.012
1.811ProAsp: 1.811 ± 0.038
2.761ProGlu: 2.761 ± 0.047
1.996ProPhe: 1.996 ± 0.037
2.15ProGly: 2.15 ± 0.056
0.778ProHis: 0.778 ± 0.023
3.047ProIle: 3.047 ± 0.048
2.192ProLys: 2.192 ± 0.049
3.436ProLeu: 3.436 ± 0.053
0.806ProMet: 0.806 ± 0.025
1.754ProAsn: 1.754 ± 0.038
0.94ProPro: 0.94 ± 0.029
1.121ProGln: 1.121 ± 0.031
1.02ProArg: 1.02 ± 0.026
2.31ProSer: 2.31 ± 0.044
2.135ProThr: 2.135 ± 0.04
2.584ProVal: 2.584 ± 0.048
0.37ProTrp: 0.37 ± 0.019
1.451ProTyr: 1.451 ± 0.03
0.0ProXaa: 0.0 ± 0.0
Gln
2.569GlnAla: 2.569 ± 0.044
0.219GlnCys: 0.219 ± 0.012
1.636GlnAsp: 1.636 ± 0.033
2.756GlnGlu: 2.756 ± 0.05
1.769GlnPhe: 1.769 ± 0.038
2.233GlnGly: 2.233 ± 0.047
0.797GlnHis: 0.797 ± 0.024
2.838GlnIle: 2.838 ± 0.044
2.779GlnLys: 2.779 ± 0.047
4.058GlnLeu: 4.058 ± 0.069
1.09GlnMet: 1.09 ± 0.031
1.795GlnAsn: 1.795 ± 0.04
1.204GlnPro: 1.204 ± 0.03
1.961GlnGln: 1.961 ± 0.077
1.475GlnArg: 1.475 ± 0.034
2.291GlnSer: 2.291 ± 0.049
2.013GlnThr: 2.013 ± 0.043
2.438GlnVal: 2.438 ± 0.03
0.412GlnTrp: 0.412 ± 0.017
1.317GlnTyr: 1.317 ± 0.031
0.0GlnXaa: 0.0 ± 0.0
Arg
2.204ArgAla: 2.204 ± 0.045
0.243ArgCys: 0.243 ± 0.014
1.989ArgAsp: 1.989 ± 0.043
2.999ArgGlu: 2.999 ± 0.049
1.832ArgPhe: 1.832 ± 0.035
2.236ArgGly: 2.236 ± 0.044
0.752ArgHis: 0.752 ± 0.022
3.036ArgIle: 3.036 ± 0.05
3.115ArgLys: 3.115 ± 0.048
3.724ArgLeu: 3.724 ± 0.066
1.197ArgMet: 1.197 ± 0.027
1.95ArgAsn: 1.95 ± 0.045
1.273ArgPro: 1.273 ± 0.03
1.423ArgGln: 1.423 ± 0.031
1.636ArgArg: 1.636 ± 0.037
2.117ArgSer: 2.117 ± 0.039
1.987ArgThr: 1.987 ± 0.037
2.516ArgVal: 2.516 ± 0.043
0.394ArgTrp: 0.394 ± 0.017
1.418ArgTyr: 1.418 ± 0.033
0.0ArgXaa: 0.0 ± 0.0
Ser
3.581SerAla: 3.581 ± 0.056
0.413SerCys: 0.413 ± 0.016
2.842SerAsp: 2.842 ± 0.046
4.072SerGlu: 4.072 ± 0.053
3.486SerPhe: 3.486 ± 0.056
4.419SerGly: 4.419 ± 0.057
1.23SerHis: 1.23 ± 0.033
5.534SerIle: 5.534 ± 0.06
4.094SerLys: 4.094 ± 0.048
6.436SerLeu: 6.436 ± 0.073
1.743SerMet: 1.743 ± 0.042
2.876SerAsn: 2.876 ± 0.047
2.115SerPro: 2.115 ± 0.037
2.025SerGln: 2.025 ± 0.04
2.187SerArg: 2.187 ± 0.037
4.172SerSer: 4.172 ± 0.068
3.381SerThr: 3.381 ± 0.055
4.164SerVal: 4.164 ± 0.058
0.639SerTrp: 0.639 ± 0.022
2.325SerTyr: 2.325 ± 0.041
0.0SerXaa: 0.0 ± 0.0
Thr
3.636ThrAla: 3.636 ± 0.05
0.365ThrCys: 0.365 ± 0.015
2.603ThrAsp: 2.603 ± 0.052
3.539ThrGlu: 3.539 ± 0.054
2.766ThrPhe: 2.766 ± 0.047
4.049ThrGly: 4.049 ± 0.069
1.042ThrHis: 1.042 ± 0.029
5.048ThrIle: 5.048 ± 0.066
3.645ThrLys: 3.645 ± 0.051
5.398ThrLeu: 5.398 ± 0.065
1.343ThrMet: 1.343 ± 0.029
2.678ThrAsn: 2.678 ± 0.045
2.174ThrPro: 2.174 ± 0.041
1.485ThrGln: 1.485 ± 0.035
1.691ThrArg: 1.691 ± 0.036
3.531ThrSer: 3.531 ± 0.053
3.143ThrThr: 3.143 ± 0.054
4.146ThrVal: 4.146 ± 0.061
0.546ThrTrp: 0.546 ± 0.019
2.018ThrTyr: 2.018 ± 0.036
0.0ThrXaa: 0.0 ± 0.0
Val
4.854ValAla: 4.854 ± 0.064
0.582ValCys: 0.582 ± 0.021
3.676ValAsp: 3.676 ± 0.056
4.968ValGlu: 4.968 ± 0.067
3.131ValPhe: 3.131 ± 0.048
4.753ValGly: 4.753 ± 0.062
1.326ValHis: 1.326 ± 0.035
5.972ValIle: 5.972 ± 0.068
4.872ValLys: 4.872 ± 0.067
6.722ValLeu: 6.722 ± 0.07
1.811ValMet: 1.811 ± 0.035
3.353ValAsn: 3.353 ± 0.049
2.497ValPro: 2.497 ± 0.045
2.376ValGln: 2.376 ± 0.039
2.511ValArg: 2.511 ± 0.04
4.572ValSer: 4.572 ± 0.061
4.121ValThr: 4.121 ± 0.06
5.001ValVal: 5.001 ± 0.07
0.569ValTrp: 0.569 ± 0.022
2.249ValTyr: 2.249 ± 0.042
0.0ValXaa: 0.0 ± 0.0
Trp
0.549TrpAla: 0.549 ± 0.021
0.09TrpCys: 0.09 ± 0.009
0.481TrpAsp: 0.481 ± 0.017
0.677TrpGlu: 0.677 ± 0.023
0.507TrpPhe: 0.507 ± 0.019
0.67TrpGly: 0.67 ± 0.021
0.215TrpHis: 0.215 ± 0.013
0.833TrpIle: 0.833 ± 0.026
0.758TrpLys: 0.758 ± 0.025
1.122TrpLeu: 1.122 ± 0.03
0.325TrpMet: 0.325 ± 0.014
0.549TrpAsn: 0.549 ± 0.022
0.214TrpPro: 0.214 ± 0.014
0.328TrpGln: 0.328 ± 0.017
0.377TrpArg: 0.377 ± 0.016
0.613TrpSer: 0.613 ± 0.021
0.463TrpThr: 0.463 ± 0.019
0.639TrpVal: 0.639 ± 0.023
0.133TrpTrp: 0.133 ± 0.01
0.334TrpTyr: 0.334 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.958TyrAla: 1.958 ± 0.039
0.338TyrCys: 0.338 ± 0.014
1.865TyrAsp: 1.865 ± 0.037
2.507TyrGlu: 2.507 ± 0.041
1.872TyrPhe: 1.872 ± 0.038
2.413TyrGly: 2.413 ± 0.04
0.848TyrHis: 0.848 ± 0.026
2.513TyrIle: 2.513 ± 0.043
2.217TyrLys: 2.217 ± 0.041
3.628TyrLeu: 3.628 ± 0.056
0.835TyrMet: 0.835 ± 0.024
1.492TyrAsn: 1.492 ± 0.031
1.457TyrPro: 1.457 ± 0.033
1.518TyrGln: 1.518 ± 0.03
1.519TyrArg: 1.519 ± 0.033
2.176TyrSer: 2.176 ± 0.034
1.813TyrThr: 1.813 ± 0.038
2.271TyrVal: 2.271 ± 0.038
0.369TyrTrp: 0.369 ± 0.018
1.431TyrTyr: 1.431 ± 0.032
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5212 proteins (1454700 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski