Amino acid dipepetide frequency for Aureimonas ureilytica

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.1AlaAla: 17.1 ± 0.169
0.973AlaCys: 0.973 ± 0.032
7.044AlaAsp: 7.044 ± 0.08
8.257AlaGlu: 8.257 ± 0.093
4.621AlaPhe: 4.621 ± 0.063
11.176AlaGly: 11.176 ± 0.108
2.271AlaHis: 2.271 ± 0.039
6.471AlaIle: 6.471 ± 0.074
3.937AlaLys: 3.937 ± 0.068
15.12AlaLeu: 15.12 ± 0.153
3.703AlaMet: 3.703 ± 0.044
2.673AlaAsn: 2.673 ± 0.056
6.006AlaPro: 6.006 ± 0.068
3.785AlaGln: 3.785 ± 0.062
9.544AlaArg: 9.544 ± 0.096
7.371AlaSer: 7.371 ± 0.083
5.787AlaThr: 5.787 ± 0.083
8.5AlaVal: 8.5 ± 0.08
1.291AlaTrp: 1.291 ± 0.032
2.494AlaTyr: 2.494 ± 0.047
0.0AlaXaa: 0.0 ± 0.0
Cys
0.831CysAla: 0.831 ± 0.024
0.084CysCys: 0.084 ± 0.008
0.52CysAsp: 0.52 ± 0.019
0.464CysGlu: 0.464 ± 0.018
0.305CysPhe: 0.305 ± 0.015
0.852CysGly: 0.852 ± 0.027
0.208CysHis: 0.208 ± 0.012
0.354CysIle: 0.354 ± 0.018
0.15CysLys: 0.15 ± 0.01
0.832CysLeu: 0.832 ± 0.025
0.119CysMet: 0.119 ± 0.01
0.165CysAsn: 0.165 ± 0.011
0.371CysPro: 0.371 ± 0.017
0.191CysGln: 0.191 ± 0.011
0.636CysArg: 0.636 ± 0.022
0.383CysSer: 0.383 ± 0.017
0.331CysThr: 0.331 ± 0.015
0.574CysVal: 0.574 ± 0.021
0.11CysTrp: 0.11 ± 0.01
0.15CysTyr: 0.15 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
7.233AspAla: 7.233 ± 0.09
0.424AspCys: 0.424 ± 0.021
2.831AspAsp: 2.831 ± 0.058
3.736AspGlu: 3.736 ± 0.062
2.063AspPhe: 2.063 ± 0.042
5.267AspGly: 5.267 ± 0.059
1.189AspHis: 1.189 ± 0.039
2.858AspIle: 2.858 ± 0.047
1.51AspLys: 1.51 ± 0.038
6.405AspLeu: 6.405 ± 0.089
1.315AspMet: 1.315 ± 0.032
1.09AspAsn: 1.09 ± 0.029
3.461AspPro: 3.461 ± 0.054
1.565AspGln: 1.565 ± 0.035
4.416AspArg: 4.416 ± 0.061
1.869AspSer: 1.869 ± 0.043
2.586AspThr: 2.586 ± 0.051
4.077AspVal: 4.077 ± 0.052
0.939AspTrp: 0.939 ± 0.028
1.319AspTyr: 1.319 ± 0.03
0.0AspXaa: 0.0 ± 0.0
Glu
9.072GluAla: 9.072 ± 0.086
0.321GluCys: 0.321 ± 0.016
2.878GluAsp: 2.878 ± 0.055
3.215GluGlu: 3.215 ± 0.064
1.737GluPhe: 1.737 ± 0.034
4.898GluGly: 4.898 ± 0.06
1.108GluHis: 1.108 ± 0.024
3.501GluIle: 3.501 ± 0.055
2.041GluLys: 2.041 ± 0.047
5.043GluLeu: 5.043 ± 0.067
1.475GluMet: 1.475 ± 0.033
1.504GluAsn: 1.504 ± 0.035
3.042GluPro: 3.042 ± 0.053
1.688GluGln: 1.688 ± 0.035
5.89GluArg: 5.89 ± 0.087
2.417GluSer: 2.417 ± 0.039
3.908GluThr: 3.908 ± 0.058
3.757GluVal: 3.757 ± 0.058
0.68GluTrp: 0.68 ± 0.024
0.776GluTyr: 0.776 ± 0.028
0.0GluXaa: 0.0 ± 0.0
Phe
4.892PheAla: 4.892 ± 0.066
0.378PheCys: 0.378 ± 0.017
2.65PheAsp: 2.65 ± 0.047
2.414PheGlu: 2.414 ± 0.049
1.489PhePhe: 1.489 ± 0.044
3.826PheGly: 3.826 ± 0.063
0.789PheHis: 0.789 ± 0.026
1.527PheIle: 1.527 ± 0.038
0.88PheLys: 0.88 ± 0.028
3.743PheLeu: 3.743 ± 0.066
0.749PheMet: 0.749 ± 0.023
0.923PheAsn: 0.923 ± 0.029
1.641PhePro: 1.641 ± 0.032
1.129PheGln: 1.129 ± 0.029
2.531PheArg: 2.531 ± 0.046
2.268PheSer: 2.268 ± 0.039
1.806PheThr: 1.806 ± 0.035
3.125PheVal: 3.125 ± 0.053
0.54PheTrp: 0.54 ± 0.024
0.855PheTyr: 0.855 ± 0.028
0.0PheXaa: 0.0 ± 0.0
Gly
10.133GlyAla: 10.133 ± 0.109
0.784GlyCys: 0.784 ± 0.025
4.344GlyAsp: 4.344 ± 0.072
5.512GlyGlu: 5.512 ± 0.071
3.994GlyPhe: 3.994 ± 0.059
7.568GlyGly: 7.568 ± 0.107
1.962GlyHis: 1.962 ± 0.041
4.361GlyIle: 4.361 ± 0.06
2.616GlyLys: 2.616 ± 0.055
9.637GlyLeu: 9.637 ± 0.108
2.107GlyMet: 2.107 ± 0.042
1.961GlyAsn: 1.961 ± 0.055
3.593GlyPro: 3.593 ± 0.057
2.954GlyGln: 2.954 ± 0.055
6.906GlyArg: 6.906 ± 0.076
4.941GlySer: 4.941 ± 0.084
4.809GlyThr: 4.809 ± 0.068
6.21GlyVal: 6.21 ± 0.074
1.311GlyTrp: 1.311 ± 0.034
2.051GlyTyr: 2.051 ± 0.042
0.0GlyXaa: 0.0 ± 0.0
His
2.172HisAla: 2.172 ± 0.041
0.2HisCys: 0.2 ± 0.013
1.16HisAsp: 1.16 ± 0.034
1.081HisGlu: 1.081 ± 0.031
0.832HisPhe: 0.832 ± 0.028
1.871HisGly: 1.871 ± 0.043
0.561HisHis: 0.561 ± 0.029
0.812HisIle: 0.812 ± 0.027
0.451HisLys: 0.451 ± 0.016
2.135HisLeu: 2.135 ± 0.045
0.483HisMet: 0.483 ± 0.021
0.36HisAsn: 0.36 ± 0.017
1.337HisPro: 1.337 ± 0.036
0.563HisGln: 0.563 ± 0.021
1.491HisArg: 1.491 ± 0.034
0.961HisSer: 0.961 ± 0.028
0.721HisThr: 0.721 ± 0.022
1.574HisVal: 1.574 ± 0.035
0.303HisTrp: 0.303 ± 0.017
0.497HisTyr: 0.497 ± 0.022
0.0HisXaa: 0.0 ± 0.0
Ile
6.958IleAla: 6.958 ± 0.073
0.447IleCys: 0.447 ± 0.02
3.364IleAsp: 3.364 ± 0.052
3.454IleGlu: 3.454 ± 0.054
1.623IlePhe: 1.623 ± 0.042
5.059IleGly: 5.059 ± 0.072
0.913IleHis: 0.913 ± 0.026
1.801IleIle: 1.801 ± 0.044
1.096IleLys: 1.096 ± 0.032
4.88IleLeu: 4.88 ± 0.063
0.841IleMet: 0.841 ± 0.024
1.111IleAsn: 1.111 ± 0.028
2.223IlePro: 2.223 ± 0.043
1.299IleGln: 1.299 ± 0.032
3.414IleArg: 3.414 ± 0.048
2.615IleSer: 2.615 ± 0.041
2.163IleThr: 2.163 ± 0.042
4.241IleVal: 4.241 ± 0.061
0.537IleTrp: 0.537 ± 0.018
1.032IleTyr: 1.032 ± 0.027
0.0IleXaa: 0.0 ± 0.0
Lys
4.076LysAla: 4.076 ± 0.069
0.119LysCys: 0.119 ± 0.01
1.576LysAsp: 1.576 ± 0.042
1.279LysGlu: 1.279 ± 0.04
0.667LysPhe: 0.667 ± 0.024
2.716LysGly: 2.716 ± 0.048
0.414LysHis: 0.414 ± 0.017
1.383LysIle: 1.383 ± 0.033
0.922LysLys: 0.922 ± 0.03
3.02LysLeu: 3.02 ± 0.047
0.622LysMet: 0.622 ± 0.02
0.694LysAsn: 0.694 ± 0.025
2.002LysPro: 2.002 ± 0.038
0.732LysGln: 0.732 ± 0.025
2.365LysArg: 2.365 ± 0.044
1.664LysSer: 1.664 ± 0.038
1.731LysThr: 1.731 ± 0.04
2.091LysVal: 2.091 ± 0.044
0.279LysTrp: 0.279 ± 0.015
0.425LysTyr: 0.425 ± 0.018
0.0LysXaa: 0.0 ± 0.0
Leu
14.468LeuAla: 14.468 ± 0.126
0.979LeuCys: 0.979 ± 0.029
6.269LeuAsp: 6.269 ± 0.067
5.27LeuGlu: 5.27 ± 0.069
4.018LeuPhe: 4.018 ± 0.068
8.889LeuGly: 8.889 ± 0.096
1.847LeuHis: 1.847 ± 0.038
4.472LeuIle: 4.472 ± 0.06
3.336LeuLys: 3.336 ± 0.049
10.146LeuLeu: 10.146 ± 0.141
2.271LeuMet: 2.271 ± 0.042
2.474LeuAsn: 2.474 ± 0.048
5.868LeuPro: 5.868 ± 0.073
2.912LeuGln: 2.912 ± 0.049
7.694LeuArg: 7.694 ± 0.1
7.453LeuSer: 7.453 ± 0.091
5.751LeuThr: 5.751 ± 0.068
8.176LeuVal: 8.176 ± 0.1
1.16LeuTrp: 1.16 ± 0.033
2.044LeuTyr: 2.044 ± 0.035
0.0LeuXaa: 0.0 ± 0.0
Met
3.173MetAla: 3.173 ± 0.049
0.107MetCys: 0.107 ± 0.008
1.182MetAsp: 1.182 ± 0.028
1.153MetGlu: 1.153 ± 0.03
0.612MetPhe: 0.612 ± 0.02
1.835MetGly: 1.835 ± 0.037
0.328MetHis: 0.328 ± 0.015
1.233MetIle: 1.233 ± 0.031
0.907MetLys: 0.907 ± 0.027
2.219MetLeu: 2.219 ± 0.046
0.635MetMet: 0.635 ± 0.023
0.692MetAsn: 0.692 ± 0.021
1.375MetPro: 1.375 ± 0.035
0.665MetGln: 0.665 ± 0.022
1.93MetArg: 1.93 ± 0.042
1.736MetSer: 1.736 ± 0.039
1.772MetThr: 1.772 ± 0.035
1.706MetVal: 1.706 ± 0.036
0.159MetTrp: 0.159 ± 0.012
0.24MetTyr: 0.24 ± 0.014
0.0MetXaa: 0.0 ± 0.0
Asn
3.007AsnAla: 3.007 ± 0.051
0.19AsnCys: 0.19 ± 0.012
1.279AsnAsp: 1.279 ± 0.041
1.288AsnGlu: 1.288 ± 0.03
0.894AsnPhe: 0.894 ± 0.027
2.362AsnGly: 2.362 ± 0.054
0.438AsnHis: 0.438 ± 0.02
1.123AsnIle: 1.123 ± 0.03
0.552AsnLys: 0.552 ± 0.02
2.321AsnLeu: 2.321 ± 0.044
0.551AsnMet: 0.551 ± 0.02
0.59AsnAsn: 0.59 ± 0.024
1.577AsnPro: 1.577 ± 0.036
0.709AsnGln: 0.709 ± 0.025
1.709AsnArg: 1.709 ± 0.039
1.102AsnSer: 1.102 ± 0.031
1.122AsnThr: 1.122 ± 0.036
1.722AsnVal: 1.722 ± 0.047
0.379AsnTrp: 0.379 ± 0.018
0.58AsnTyr: 0.58 ± 0.025
0.0AsnXaa: 0.0 ± 0.0
Pro
6.8ProAla: 6.8 ± 0.083
0.278ProCys: 0.278 ± 0.015
3.575ProAsp: 3.575 ± 0.049
3.575ProGlu: 3.575 ± 0.055
2.208ProPhe: 2.208 ± 0.04
4.524ProGly: 4.524 ± 0.06
1.061ProHis: 1.061 ± 0.032
2.487ProIle: 2.487 ± 0.039
1.554ProLys: 1.554 ± 0.035
5.036ProLeu: 5.036 ± 0.068
1.176ProMet: 1.176 ± 0.028
1.382ProAsn: 1.382 ± 0.031
2.442ProPro: 2.442 ± 0.055
1.534ProGln: 1.534 ± 0.036
3.094ProArg: 3.094 ± 0.053
3.136ProSer: 3.136 ± 0.049
2.513ProThr: 2.513 ± 0.047
4.212ProVal: 4.212 ± 0.056
0.605ProTrp: 0.605 ± 0.024
1.122ProTyr: 1.122 ± 0.028
0.0ProXaa: 0.0 ± 0.0
Gln
4.07GlnAla: 4.07 ± 0.069
0.159GlnCys: 0.159 ± 0.011
1.431GlnAsp: 1.431 ± 0.03
1.349GlnGlu: 1.349 ± 0.033
0.968GlnPhe: 0.968 ± 0.025
2.327GlnGly: 2.327 ± 0.048
0.534GlnHis: 0.534 ± 0.019
1.781GlnIle: 1.781 ± 0.036
1.003GlnLys: 1.003 ± 0.031
2.672GlnLeu: 2.672 ± 0.051
0.785GlnMet: 0.785 ± 0.027
0.799GlnAsn: 0.799 ± 0.029
1.838GlnPro: 1.838 ± 0.044
0.984GlnGln: 0.984 ± 0.041
2.51GlnArg: 2.51 ± 0.045
1.849GlnSer: 1.849 ± 0.044
1.835GlnThr: 1.835 ± 0.037
2.038GlnVal: 2.038 ± 0.042
0.373GlnTrp: 0.373 ± 0.017
0.525GlnTyr: 0.525 ± 0.019
0.0GlnXaa: 0.0 ± 0.0
Arg
8.753ArgAla: 8.753 ± 0.097
0.495ArgCys: 0.495 ± 0.019
4.135ArgAsp: 4.135 ± 0.058
4.83ArgGlu: 4.83 ± 0.069
3.521ArgPhe: 3.521 ± 0.057
5.042ArgGly: 5.042 ± 0.057
1.736ArgHis: 1.736 ± 0.039
4.245ArgIle: 4.245 ± 0.06
1.973ArgLys: 1.973 ± 0.038
8.92ArgLeu: 8.92 ± 0.099
1.96ArgMet: 1.96 ± 0.034
1.743ArgAsn: 1.743 ± 0.035
3.897ArgPro: 3.897 ± 0.068
2.743ArgGln: 2.743 ± 0.046
6.624ArgArg: 6.624 ± 0.082
4.133ArgSer: 4.133 ± 0.063
3.597ArgThr: 3.597 ± 0.057
5.074ArgVal: 5.074 ± 0.062
1.051ArgTrp: 1.051 ± 0.031
1.633ArgTyr: 1.633 ± 0.034
0.0ArgXaa: 0.0 ± 0.0
Ser
6.898SerAla: 6.898 ± 0.069
0.396SerCys: 0.396 ± 0.016
3.159SerAsp: 3.159 ± 0.049
3.087SerGlu: 3.087 ± 0.052
2.621SerPhe: 2.621 ± 0.04
5.865SerGly: 5.865 ± 0.075
1.081SerHis: 1.081 ± 0.028
2.812SerIle: 2.812 ± 0.05
1.411SerLys: 1.411 ± 0.041
6.064SerLeu: 6.064 ± 0.072
1.285SerMet: 1.285 ± 0.032
1.339SerAsn: 1.339 ± 0.031
2.947SerPro: 2.947 ± 0.044
1.706SerGln: 1.706 ± 0.038
3.945SerArg: 3.945 ± 0.059
3.047SerSer: 3.047 ± 0.06
2.72SerThr: 2.72 ± 0.062
4.279SerVal: 4.279 ± 0.06
0.723SerTrp: 0.723 ± 0.026
1.237SerTyr: 1.237 ± 0.03
0.0SerXaa: 0.0 ± 0.0
Thr
5.749ThrAla: 5.749 ± 0.08
0.381ThrCys: 0.381 ± 0.017
2.738ThrAsp: 2.738 ± 0.052
2.528ThrGlu: 2.528 ± 0.04
1.91ThrPhe: 1.91 ± 0.038
4.948ThrGly: 4.948 ± 0.074
1.078ThrHis: 1.078 ± 0.03
2.974ThrIle: 2.974 ± 0.051
1.44ThrLys: 1.44 ± 0.036
6.153ThrLeu: 6.153 ± 0.071
1.124ThrMet: 1.124 ± 0.031
1.213ThrAsn: 1.213 ± 0.034
3.133ThrPro: 3.133 ± 0.048
1.56ThrGln: 1.56 ± 0.039
3.371ThrArg: 3.371 ± 0.051
2.961ThrSer: 2.961 ± 0.054
2.56ThrThr: 2.56 ± 0.05
4.213ThrVal: 4.213 ± 0.066
0.58ThrTrp: 0.58 ± 0.024
1.202ThrTyr: 1.202 ± 0.029
0.0ThrXaa: 0.0 ± 0.0
Val
9.118ValAla: 9.118 ± 0.087
0.582ValCys: 0.582 ± 0.02
4.119ValAsp: 4.119 ± 0.061
4.786ValGlu: 4.786 ± 0.066
2.791ValPhe: 2.791 ± 0.05
5.806ValGly: 5.806 ± 0.073
1.322ValHis: 1.322 ± 0.032
3.525ValIle: 3.525 ± 0.053
2.058ValLys: 2.058 ± 0.046
7.534ValLeu: 7.534 ± 0.073
1.728ValMet: 1.728 ± 0.038
1.871ValAsn: 1.871 ± 0.036
3.863ValPro: 3.863 ± 0.049
1.976ValGln: 1.976 ± 0.038
5.124ValArg: 5.124 ± 0.068
4.886ValSer: 4.886 ± 0.071
4.373ValThr: 4.373 ± 0.06
5.882ValVal: 5.882 ± 0.073
0.907ValTrp: 0.907 ± 0.026
1.417ValTyr: 1.417 ± 0.03
0.0ValXaa: 0.0 ± 0.0
Trp
1.167TrpAla: 1.167 ± 0.038
0.133TrpCys: 0.133 ± 0.011
0.583TrpAsp: 0.583 ± 0.021
0.499TrpGlu: 0.499 ± 0.021
0.5TrpPhe: 0.5 ± 0.021
0.85TrpGly: 0.85 ± 0.027
0.302TrpHis: 0.302 ± 0.015
0.572TrpIle: 0.572 ± 0.02
0.415TrpLys: 0.415 ± 0.019
1.505TrpLeu: 1.505 ± 0.037
0.368TrpMet: 0.368 ± 0.018
0.415TrpAsn: 0.415 ± 0.018
0.666TrpPro: 0.666 ± 0.023
0.472TrpGln: 0.472 ± 0.02
1.18TrpArg: 1.18 ± 0.031
0.796TrpSer: 0.796 ± 0.026
0.794TrpThr: 0.794 ± 0.026
0.75TrpVal: 0.75 ± 0.027
0.207TrpTrp: 0.207 ± 0.013
0.265TrpTyr: 0.265 ± 0.013
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.455TyrAla: 2.455 ± 0.045
0.202TyrCys: 0.202 ± 0.014
1.315TyrAsp: 1.315 ± 0.032
1.173TyrGlu: 1.173 ± 0.033
0.781TyrPhe: 0.781 ± 0.024
2.071TyrGly: 2.071 ± 0.04
0.425TyrHis: 0.425 ± 0.019
0.778TyrIle: 0.778 ± 0.024
0.525TyrLys: 0.525 ± 0.021
2.112TyrLeu: 2.112 ± 0.041
0.362TyrMet: 0.362 ± 0.017
0.496TyrAsn: 0.496 ± 0.021
1.03TyrPro: 1.03 ± 0.027
0.662TyrGln: 0.662 ± 0.021
1.733TyrArg: 1.733 ± 0.035
1.035TyrSer: 1.035 ± 0.029
0.933TyrThr: 0.933 ± 0.026
1.469TyrVal: 1.469 ± 0.036
0.308TyrTrp: 0.308 ± 0.015
0.508TyrTyr: 0.508 ± 0.021
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4329 proteins (1349042 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski