Amino acid dipepetide frequency for Kutzneria albida DSM 43870

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.576AlaAla: 19.576 ± 0.124
1.112AlaCys: 1.112 ± 0.02
7.656AlaAsp: 7.656 ± 0.049
9.403AlaGlu: 9.403 ± 0.076
3.259AlaPhe: 3.259 ± 0.033
12.058AlaGly: 12.058 ± 0.079
2.762AlaHis: 2.762 ± 0.03
3.73AlaIle: 3.73 ± 0.046
2.792AlaLys: 2.792 ± 0.039
14.668AlaLeu: 14.668 ± 0.093
2.474AlaMet: 2.474 ± 0.03
2.504AlaAsn: 2.504 ± 0.034
5.918AlaPro: 5.918 ± 0.044
4.342AlaGln: 4.342 ± 0.038
9.338AlaArg: 9.338 ± 0.071
5.86AlaSer: 5.86 ± 0.044
6.999AlaThr: 6.999 ± 0.055
12.575AlaVal: 12.575 ± 0.075
1.797AlaTrp: 1.797 ± 0.026
2.263AlaTyr: 2.263 ± 0.03
0.0AlaXaa: 0.0 ± 0.0
Cys
1.204CysAla: 1.204 ± 0.022
0.114CysCys: 0.114 ± 0.007
0.472CysAsp: 0.472 ± 0.012
0.475CysGlu: 0.475 ± 0.012
0.245CysPhe: 0.245 ± 0.008
1.039CysGly: 1.039 ± 0.021
0.207CysHis: 0.207 ± 0.008
0.141CysIle: 0.141 ± 0.007
0.126CysLys: 0.126 ± 0.007
0.845CysLeu: 0.845 ± 0.017
0.136CysMet: 0.136 ± 0.007
0.145CysAsn: 0.145 ± 0.007
0.511CysPro: 0.511 ± 0.017
0.256CysGln: 0.256 ± 0.01
0.639CysArg: 0.639 ± 0.017
0.506CysSer: 0.506 ± 0.015
0.58CysThr: 0.58 ± 0.015
0.737CysVal: 0.737 ± 0.016
0.161CysTrp: 0.161 ± 0.007
0.2CysTyr: 0.2 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
6.223AspAla: 6.223 ± 0.054
0.481AspCys: 0.481 ± 0.014
2.258AspAsp: 2.258 ± 0.028
3.452AspGlu: 3.452 ± 0.038
1.452AspPhe: 1.452 ± 0.027
5.387AspGly: 5.387 ± 0.042
1.348AspHis: 1.348 ± 0.021
1.512AspIle: 1.512 ± 0.024
1.162AspLys: 1.162 ± 0.021
6.843AspLeu: 6.843 ± 0.064
0.728AspMet: 0.728 ± 0.016
1.174AspAsn: 1.174 ± 0.024
4.375AspPro: 4.375 ± 0.037
2.112AspGln: 2.112 ± 0.032
4.873AspArg: 4.873 ± 0.044
2.825AspSer: 2.825 ± 0.032
3.037AspThr: 3.037 ± 0.033
4.313AspVal: 4.313 ± 0.043
1.046AspTrp: 1.046 ± 0.021
1.295AspTyr: 1.295 ± 0.024
0.0AspXaa: 0.0 ± 0.0
Glu
6.11GluAla: 6.11 ± 0.057
0.386GluCys: 0.386 ± 0.012
2.839GluAsp: 2.839 ± 0.035
2.718GluGlu: 2.718 ± 0.039
1.787GluPhe: 1.787 ± 0.028
3.458GluGly: 3.458 ± 0.041
1.846GluHis: 1.846 ± 0.027
1.993GluIle: 1.993 ± 0.029
0.872GluLys: 0.872 ± 0.019
8.265GluLeu: 8.265 ± 0.075
0.719GluMet: 0.719 ± 0.018
0.865GluAsn: 0.865 ± 0.017
3.193GluPro: 3.193 ± 0.035
3.227GluGln: 3.227 ± 0.044
4.993GluArg: 4.993 ± 0.048
2.456GluSer: 2.456 ± 0.026
2.133GluThr: 2.133 ± 0.028
5.5GluVal: 5.5 ± 0.049
0.688GluTrp: 0.688 ± 0.018
0.981GluTyr: 0.981 ± 0.017
0.0GluXaa: 0.0 ± 0.0
Phe
3.782PheAla: 3.782 ± 0.036
0.293PheCys: 0.293 ± 0.01
2.11PheAsp: 2.11 ± 0.029
1.423PheGlu: 1.423 ± 0.023
0.837PhePhe: 0.837 ± 0.019
3.214PheGly: 3.214 ± 0.034
0.616PheHis: 0.616 ± 0.015
0.593PheIle: 0.593 ± 0.015
0.391PheLys: 0.391 ± 0.012
2.587PheLeu: 2.587 ± 0.034
0.351PheMet: 0.351 ± 0.011
0.548PheAsn: 0.548 ± 0.014
1.372PhePro: 1.372 ± 0.021
0.758PheGln: 0.758 ± 0.017
1.728PheArg: 1.728 ± 0.023
1.556PheSer: 1.556 ± 0.029
2.064PheThr: 2.064 ± 0.029
2.415PheVal: 2.415 ± 0.035
0.398PheTrp: 0.398 ± 0.013
0.574PheTyr: 0.574 ± 0.013
0.0PheXaa: 0.0 ± 0.0
Gly
10.001GlyAla: 10.001 ± 0.067
0.896GlyCys: 0.896 ± 0.019
4.572GlyAsp: 4.572 ± 0.043
5.066GlyGlu: 5.066 ± 0.044
2.877GlyPhe: 2.877 ± 0.03
8.245GlyGly: 8.245 ± 0.086
2.208GlyHis: 2.208 ± 0.033
3.206GlyIle: 3.206 ± 0.034
2.281GlyLys: 2.281 ± 0.033
9.853GlyLeu: 9.853 ± 0.062
1.967GlyMet: 1.967 ± 0.029
1.886GlyAsn: 1.886 ± 0.035
4.521GlyPro: 4.521 ± 0.041
3.342GlyGln: 3.342 ± 0.041
6.782GlyArg: 6.782 ± 0.05
5.338GlySer: 5.338 ± 0.054
5.822GlyThr: 5.822 ± 0.057
8.212GlyVal: 8.212 ± 0.059
1.758GlyTrp: 1.758 ± 0.027
2.347GlyTyr: 2.347 ± 0.033
0.0GlyXaa: 0.0 ± 0.0
His
2.715HisAla: 2.715 ± 0.033
0.268HisCys: 0.268 ± 0.01
1.221HisAsp: 1.221 ± 0.022
1.414HisGlu: 1.414 ± 0.024
0.633HisPhe: 0.633 ± 0.016
2.359HisGly: 2.359 ± 0.031
0.683HisHis: 0.683 ± 0.019
0.582HisIle: 0.582 ± 0.014
0.342HisLys: 0.342 ± 0.011
2.528HisLeu: 2.528 ± 0.03
0.303HisMet: 0.303 ± 0.009
0.482HisAsn: 0.482 ± 0.013
1.714HisPro: 1.714 ± 0.029
0.761HisGln: 0.761 ± 0.017
2.111HisArg: 2.111 ± 0.031
1.14HisSer: 1.14 ± 0.019
1.254HisThr: 1.254 ± 0.024
1.94HisVal: 1.94 ± 0.026
0.409HisTrp: 0.409 ± 0.013
0.531HisTyr: 0.531 ± 0.013
0.0HisXaa: 0.0 ± 0.0
Ile
4.813IleAla: 4.813 ± 0.04
0.291IleCys: 0.291 ± 0.011
2.04IleAsp: 2.04 ± 0.03
1.89IleGlu: 1.89 ± 0.026
0.596IlePhe: 0.596 ± 0.015
3.724IleGly: 3.724 ± 0.039
0.525IleHis: 0.525 ± 0.012
0.8IleIle: 0.8 ± 0.02
0.629IleLys: 0.629 ± 0.016
1.998IleLeu: 1.998 ± 0.031
0.405IleMet: 0.405 ± 0.012
0.779IleAsn: 0.779 ± 0.018
1.683IlePro: 1.683 ± 0.028
0.738IleGln: 0.738 ± 0.019
2.151IleArg: 2.151 ± 0.029
1.957IleSer: 1.957 ± 0.027
2.428IleThr: 2.428 ± 0.033
2.489IleVal: 2.489 ± 0.035
0.343IleTrp: 0.343 ± 0.012
0.544IleTyr: 0.544 ± 0.014
0.0IleXaa: 0.0 ± 0.0
Lys
2.445LysAla: 2.445 ± 0.037
0.104LysCys: 0.104 ± 0.005
0.943LysAsp: 0.943 ± 0.02
0.747LysGlu: 0.747 ± 0.019
0.455LysPhe: 0.455 ± 0.013
1.358LysGly: 1.358 ± 0.026
0.415LysHis: 0.415 ± 0.013
0.747LysIle: 0.747 ± 0.016
0.415LysLys: 0.415 ± 0.015
2.202LysLeu: 2.202 ± 0.035
0.313LysMet: 0.313 ± 0.01
0.358LysAsn: 0.358 ± 0.01
1.327LysPro: 1.327 ± 0.026
0.772LysGln: 0.772 ± 0.02
1.287LysArg: 1.287 ± 0.021
1.05LysSer: 1.05 ± 0.021
1.015LysThr: 1.015 ± 0.019
1.872LysVal: 1.872 ± 0.028
0.267LysTrp: 0.267 ± 0.011
0.369LysTyr: 0.369 ± 0.014
0.0LysXaa: 0.0 ± 0.0
Leu
17.104LeuAla: 17.104 ± 0.095
0.975LeuCys: 0.975 ± 0.021
7.219LeuAsp: 7.219 ± 0.061
4.308LeuGlu: 4.308 ± 0.045
2.833LeuPhe: 2.833 ± 0.041
10.136LeuGly: 10.136 ± 0.067
2.618LeuHis: 2.618 ± 0.031
3.27LeuIle: 3.27 ± 0.039
1.459LeuLys: 1.459 ± 0.023
12.323LeuLeu: 12.323 ± 0.09
1.405LeuMet: 1.405 ± 0.023
1.875LeuAsn: 1.875 ± 0.027
6.876LeuPro: 6.876 ± 0.064
2.036LeuGln: 2.036 ± 0.025
9.479LeuArg: 9.479 ± 0.067
6.206LeuSer: 6.206 ± 0.038
6.984LeuThr: 6.984 ± 0.049
10.961LeuVal: 10.961 ± 0.075
1.365LeuTrp: 1.365 ± 0.025
1.756LeuTyr: 1.756 ± 0.025
0.0LeuXaa: 0.0 ± 0.0
Met
2.093MetAla: 2.093 ± 0.028
0.127MetCys: 0.127 ± 0.007
0.817MetAsp: 0.817 ± 0.017
0.521MetGlu: 0.521 ± 0.016
0.438MetPhe: 0.438 ± 0.012
1.173MetGly: 1.173 ± 0.022
0.347MetHis: 0.347 ± 0.011
0.697MetIle: 0.697 ± 0.016
0.302MetLys: 0.302 ± 0.011
1.725MetLeu: 1.725 ± 0.024
0.25MetMet: 0.25 ± 0.01
0.402MetAsn: 0.402 ± 0.011
1.039MetPro: 1.039 ± 0.021
0.417MetGln: 0.417 ± 0.012
1.405MetArg: 1.405 ± 0.025
1.292MetSer: 1.292 ± 0.024
1.42MetThr: 1.42 ± 0.023
1.343MetVal: 1.343 ± 0.018
0.194MetTrp: 0.194 ± 0.008
0.268MetTyr: 0.268 ± 0.011
0.0MetXaa: 0.0 ± 0.0
Asn
2.324AsnAla: 2.324 ± 0.032
0.2AsnCys: 0.2 ± 0.009
0.938AsnAsp: 0.938 ± 0.018
0.908AsnGlu: 0.908 ± 0.016
0.516AsnPhe: 0.516 ± 0.013
2.168AsnGly: 2.168 ± 0.035
0.422AsnHis: 0.422 ± 0.012
0.567AsnIle: 0.567 ± 0.015
0.385AsnLys: 0.385 ± 0.015
1.989AsnLeu: 1.989 ± 0.032
0.284AsnMet: 0.284 ± 0.01
0.498AsnAsn: 0.498 ± 0.019
1.579AsnPro: 1.579 ± 0.027
0.671AsnGln: 0.671 ± 0.017
1.417AsnArg: 1.417 ± 0.026
1.179AsnSer: 1.179 ± 0.023
1.244AsnThr: 1.244 ± 0.025
1.317AsnVal: 1.317 ± 0.021
0.349AsnTrp: 0.349 ± 0.012
0.508AsnTyr: 0.508 ± 0.015
0.0AsnXaa: 0.0 ± 0.0
Pro
7.704ProAla: 7.704 ± 0.051
0.384ProCys: 0.384 ± 0.01
3.807ProAsp: 3.807 ± 0.037
4.202ProGlu: 4.202 ± 0.044
1.486ProPhe: 1.486 ± 0.022
6.04ProGly: 6.04 ± 0.062
1.253ProHis: 1.253 ± 0.022
1.605ProIle: 1.605 ± 0.023
1.114ProLys: 1.114 ± 0.02
5.385ProLeu: 5.385 ± 0.045
1.026ProMet: 1.026 ± 0.022
1.171ProAsn: 1.171 ± 0.022
3.14ProPro: 3.14 ± 0.044
1.872ProGln: 1.872 ± 0.029
3.598ProArg: 3.598 ± 0.035
3.072ProSer: 3.072 ± 0.036
3.407ProThr: 3.407 ± 0.038
5.832ProVal: 5.832 ± 0.045
0.935ProTrp: 0.935 ± 0.017
1.07ProTyr: 1.07 ± 0.022
0.0ProXaa: 0.0 ± 0.0
Gln
4.412GlnAla: 4.412 ± 0.039
0.254GlnCys: 0.254 ± 0.008
1.661GlnAsp: 1.661 ± 0.023
1.332GlnGlu: 1.332 ± 0.019
0.919GlnPhe: 0.919 ± 0.017
2.392GlnGly: 2.392 ± 0.031
0.872GlnHis: 0.872 ± 0.017
1.091GlnIle: 1.091 ± 0.02
0.448GlnLys: 0.448 ± 0.014
4.063GlnLeu: 4.063 ± 0.036
0.44GlnMet: 0.44 ± 0.014
0.544GlnAsn: 0.544 ± 0.017
2.08GlnPro: 2.08 ± 0.03
1.59GlnGln: 1.59 ± 0.035
3.096GlnArg: 3.096 ± 0.038
1.502GlnSer: 1.502 ± 0.023
1.438GlnThr: 1.438 ± 0.026
3.512GlnVal: 3.512 ± 0.043
0.597GlnTrp: 0.597 ± 0.015
0.676GlnTyr: 0.676 ± 0.017
0.0GlnXaa: 0.0 ± 0.0
Arg
9.637ArgAla: 9.637 ± 0.071
0.732ArgCys: 0.732 ± 0.017
3.854ArgAsp: 3.854 ± 0.04
4.708ArgGlu: 4.708 ± 0.049
2.455ArgPhe: 2.455 ± 0.029
5.767ArgGly: 5.767 ± 0.05
2.003ArgHis: 2.003 ± 0.027
2.939ArgIle: 2.939 ± 0.031
1.55ArgLys: 1.55 ± 0.026
8.807ArgLeu: 8.807 ± 0.07
1.737ArgMet: 1.737 ± 0.027
1.426ArgAsn: 1.426 ± 0.022
4.47ArgPro: 4.47 ± 0.048
2.722ArgGln: 2.722 ± 0.036
7.136ArgArg: 7.136 ± 0.064
4.072ArgSer: 4.072 ± 0.041
4.698ArgThr: 4.698 ± 0.038
6.311ArgVal: 6.311 ± 0.048
1.454ArgTrp: 1.454 ± 0.023
1.854ArgTyr: 1.854 ± 0.024
0.0ArgXaa: 0.0 ± 0.0
Ser
7.2SerAla: 7.2 ± 0.049
0.501SerCys: 0.501 ± 0.013
2.387SerAsp: 2.387 ± 0.029
2.491SerGlu: 2.491 ± 0.029
1.593SerPhe: 1.593 ± 0.025
6.01SerGly: 6.01 ± 0.051
1.008SerHis: 1.008 ± 0.019
1.595SerIle: 1.595 ± 0.025
1.022SerLys: 1.022 ± 0.021
5.192SerLeu: 5.192 ± 0.047
1.125SerMet: 1.125 ± 0.02
0.992SerAsn: 0.992 ± 0.022
3.145SerPro: 3.145 ± 0.037
1.567SerGln: 1.567 ± 0.025
3.696SerArg: 3.696 ± 0.04
3.094SerSer: 3.094 ± 0.037
3.764SerThr: 3.764 ± 0.04
4.74SerVal: 4.74 ± 0.046
1.103SerTrp: 1.103 ± 0.019
1.281SerTyr: 1.281 ± 0.023
0.0SerXaa: 0.0 ± 0.0
Thr
8.44ThrAla: 8.44 ± 0.063
0.475ThrCys: 0.475 ± 0.013
3.211ThrAsp: 3.211 ± 0.033
3.61ThrGlu: 3.61 ± 0.032
1.488ThrPhe: 1.488 ± 0.027
6.643ThrGly: 6.643 ± 0.051
1.091ThrHis: 1.091 ± 0.018
1.916ThrIle: 1.916 ± 0.026
1.139ThrLys: 1.139 ± 0.022
5.381ThrLeu: 5.381 ± 0.044
0.914ThrMet: 0.914 ± 0.018
1.13ThrAsn: 1.13 ± 0.023
3.777ThrPro: 3.777 ± 0.041
1.519ThrGln: 1.519 ± 0.022
3.711ThrArg: 3.711 ± 0.04
3.407ThrSer: 3.407 ± 0.04
4.042ThrThr: 4.042 ± 0.076
6.088ThrVal: 6.088 ± 0.058
0.943ThrTrp: 0.943 ± 0.018
1.221ThrTyr: 1.221 ± 0.028
0.0ThrXaa: 0.0 ± 0.0
Val
11.286ValAla: 11.286 ± 0.087
0.785ValCys: 0.785 ± 0.015
5.929ValAsp: 5.929 ± 0.057
4.761ValGlu: 4.761 ± 0.047
2.566ValPhe: 2.566 ± 0.035
7.043ValGly: 7.043 ± 0.061
2.299ValHis: 2.299 ± 0.031
2.918ValIle: 2.918 ± 0.033
1.46ValLys: 1.46 ± 0.025
12.054ValLeu: 12.054 ± 0.087
1.214ValMet: 1.214 ± 0.02
1.859ValAsn: 1.859 ± 0.029
5.401ValPro: 5.401 ± 0.052
2.594ValGln: 2.594 ± 0.03
7.556ValArg: 7.556 ± 0.051
4.909ValSer: 4.909 ± 0.045
5.574ValThr: 5.574 ± 0.051
9.162ValVal: 9.162 ± 0.08
1.131ValTrp: 1.131 ± 0.02
1.564ValTyr: 1.564 ± 0.027
0.0ValXaa: 0.0 ± 0.0
Trp
1.715TrpAla: 1.715 ± 0.024
0.171TrpCys: 0.171 ± 0.008
0.77TrpAsp: 0.77 ± 0.022
0.585TrpGlu: 0.585 ± 0.013
0.533TrpPhe: 0.533 ± 0.016
1.027TrpGly: 1.027 ± 0.02
0.426TrpHis: 0.426 ± 0.013
0.535TrpIle: 0.535 ± 0.015
0.263TrpLys: 0.263 ± 0.01
2.052TrpLeu: 2.052 ± 0.03
0.272TrpMet: 0.272 ± 0.01
0.387TrpAsn: 0.387 ± 0.012
0.847TrpPro: 0.847 ± 0.019
0.813TrpGln: 0.813 ± 0.018
1.435TrpArg: 1.435 ± 0.023
0.993TrpSer: 0.993 ± 0.02
1.051TrpThr: 1.051 ± 0.018
1.11TrpVal: 1.11 ± 0.02
0.358TrpTrp: 0.358 ± 0.011
0.33TrpTyr: 0.33 ± 0.011
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.303TyrAla: 2.303 ± 0.03
0.189TyrCys: 0.189 ± 0.008
1.191TyrAsp: 1.191 ± 0.02
1.015TyrGlu: 1.015 ± 0.019
0.645TyrPhe: 0.645 ± 0.016
1.939TyrGly: 1.939 ± 0.028
0.439TyrHis: 0.439 ± 0.012
0.385TyrIle: 0.385 ± 0.012
0.309TyrLys: 0.309 ± 0.01
2.43TyrLeu: 2.43 ± 0.031
0.209TyrMet: 0.209 ± 0.009
0.436TyrAsn: 0.436 ± 0.014
1.134TyrPro: 1.134 ± 0.021
0.808TyrGln: 0.808 ± 0.016
1.9TyrArg: 1.9 ± 0.029
1.091TyrSer: 1.091 ± 0.02
1.194TyrThr: 1.194 ± 0.021
1.636TyrVal: 1.636 ± 0.028
0.377TyrTrp: 0.377 ± 0.012
0.481TyrTyr: 0.481 ± 0.014
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8775 proteins (2947740 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski