Amino acid dipepetide frequency for Aurantimicrobium minutum

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.649AlaAla: 13.649 ± 0.215
0.564AlaCys: 0.564 ± 0.033
5.963AlaAsp: 5.963 ± 0.127
7.356AlaGlu: 7.356 ± 0.152
3.792AlaPhe: 3.792 ± 0.089
9.769AlaGly: 9.769 ± 0.149
2.217AlaHis: 2.217 ± 0.073
6.36AlaIle: 6.36 ± 0.12
4.441AlaLys: 4.441 ± 0.155
11.779AlaLeu: 11.779 ± 0.181
2.453AlaMet: 2.453 ± 0.084
3.207AlaAsn: 3.207 ± 0.085
4.729AlaPro: 4.729 ± 0.117
4.411AlaGln: 4.411 ± 0.099
5.9AlaArg: 5.9 ± 0.115
6.757AlaSer: 6.757 ± 0.126
6.777AlaThr: 6.777 ± 0.131
9.096AlaVal: 9.096 ± 0.152
1.419AlaTrp: 1.419 ± 0.057
2.417AlaTyr: 2.417 ± 0.066
0.0AlaXaa: 0.0 ± 0.0
Cys
0.645CysAla: 0.645 ± 0.033
0.048CysCys: 0.048 ± 0.01
0.331CysAsp: 0.331 ± 0.025
0.292CysGlu: 0.292 ± 0.025
0.167CysPhe: 0.167 ± 0.019
0.639CysGly: 0.639 ± 0.036
0.101CysHis: 0.101 ± 0.014
0.23CysIle: 0.23 ± 0.019
0.121CysLys: 0.121 ± 0.015
0.411CysLeu: 0.411 ± 0.031
0.093CysMet: 0.093 ± 0.013
0.173CysAsn: 0.173 ± 0.02
0.298CysPro: 0.298 ± 0.028
0.135CysGln: 0.135 ± 0.018
0.25CysArg: 0.25 ± 0.022
0.403CysSer: 0.403 ± 0.033
0.371CysThr: 0.371 ± 0.029
0.458CysVal: 0.458 ± 0.037
0.064CysTrp: 0.064 ± 0.011
0.125CysTyr: 0.125 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
6.366AspAla: 6.366 ± 0.128
0.323AspCys: 0.323 ± 0.025
2.778AspAsp: 2.778 ± 0.1
3.836AspGlu: 3.836 ± 0.102
2.101AspPhe: 2.101 ± 0.07
4.401AspGly: 4.401 ± 0.107
1.135AspHis: 1.135 ± 0.052
3.163AspIle: 3.163 ± 0.088
1.933AspLys: 1.933 ± 0.068
5.495AspLeu: 5.495 ± 0.113
1.006AspMet: 1.006 ± 0.04
1.506AspAsn: 1.506 ± 0.057
3.137AspPro: 3.137 ± 0.087
1.649AspGln: 1.649 ± 0.064
2.905AspArg: 2.905 ± 0.097
3.09AspSer: 3.09 ± 0.078
2.679AspThr: 2.679 ± 0.071
5.128AspVal: 5.128 ± 0.077
0.804AspTrp: 0.804 ± 0.039
1.423AspTyr: 1.423 ± 0.059
0.0AspXaa: 0.0 ± 0.0
Glu
6.884GluAla: 6.884 ± 0.153
0.314GluCys: 0.314 ± 0.025
3.022GluAsp: 3.022 ± 0.079
4.112GluGlu: 4.112 ± 0.11
2.163GluPhe: 2.163 ± 0.075
4.27GluGly: 4.27 ± 0.094
1.508GluHis: 1.508 ± 0.056
3.788GluIle: 3.788 ± 0.101
2.677GluLys: 2.677 ± 0.079
7.529GluLeu: 7.529 ± 0.136
1.175GluMet: 1.175 ± 0.05
2.131GluAsn: 2.131 ± 0.059
2.542GluPro: 2.542 ± 0.07
2.536GluGln: 2.536 ± 0.06
3.945GluArg: 3.945 ± 0.118
3.248GluSer: 3.248 ± 0.079
3.407GluThr: 3.407 ± 0.082
5.052GluVal: 5.052 ± 0.116
0.79GluTrp: 0.79 ± 0.042
1.492GluTyr: 1.492 ± 0.054
0.0GluXaa: 0.0 ± 0.0
Phe
3.987PheAla: 3.987 ± 0.086
0.21PheCys: 0.21 ± 0.023
2.419PheAsp: 2.419 ± 0.074
2.145PheGlu: 2.145 ± 0.061
1.351PhePhe: 1.351 ± 0.059
3.498PheGly: 3.498 ± 0.089
0.605PheHis: 0.605 ± 0.036
1.947PheIle: 1.947 ± 0.062
0.941PheLys: 0.941 ± 0.045
3.157PheLeu: 3.157 ± 0.091
0.661PheMet: 0.661 ± 0.037
1.147PheAsn: 1.147 ± 0.053
1.54PhePro: 1.54 ± 0.056
0.867PheGln: 0.867 ± 0.043
1.554PheArg: 1.554 ± 0.06
2.371PheSer: 2.371 ± 0.074
2.52PheThr: 2.52 ± 0.07
3.086PheVal: 3.086 ± 0.088
0.538PheTrp: 0.538 ± 0.032
0.839PheTyr: 0.839 ± 0.039
0.0PheXaa: 0.0 ± 0.0
Gly
8.182GlyAla: 8.182 ± 0.145
0.514GlyCys: 0.514 ± 0.032
4.122GlyAsp: 4.122 ± 0.085
4.731GlyGlu: 4.731 ± 0.097
3.441GlyPhe: 3.441 ± 0.086
6.41GlyGly: 6.41 ± 0.157
1.792GlyHis: 1.792 ± 0.061
5.519GlyIle: 5.519 ± 0.124
3.612GlyLys: 3.612 ± 0.09
8.017GlyLeu: 8.017 ± 0.155
1.931GlyMet: 1.931 ± 0.064
2.469GlyAsn: 2.469 ± 0.072
3.195GlyPro: 3.195 ± 0.078
2.732GlyGln: 2.732 ± 0.077
4.272GlyArg: 4.272 ± 0.084
5.322GlySer: 5.322 ± 0.131
5.126GlyThr: 5.126 ± 0.116
7.056GlyVal: 7.056 ± 0.121
1.266GlyTrp: 1.266 ± 0.049
2.29GlyTyr: 2.29 ± 0.062
0.0GlyXaa: 0.0 ± 0.0
His
2.0HisAla: 2.0 ± 0.073
0.079HisCys: 0.079 ± 0.013
1.238HisAsp: 1.238 ± 0.053
1.284HisGlu: 1.284 ± 0.065
0.633HisPhe: 0.633 ± 0.037
1.669HisGly: 1.669 ± 0.064
0.51HisHis: 0.51 ± 0.035
1.036HisIle: 1.036 ± 0.05
0.635HisLys: 0.635 ± 0.038
1.806HisLeu: 1.806 ± 0.054
0.437HisMet: 0.437 ± 0.029
0.651HisAsn: 0.651 ± 0.038
1.27HisPro: 1.27 ± 0.053
0.599HisGln: 0.599 ± 0.036
1.173HisArg: 1.173 ± 0.052
1.234HisSer: 1.234 ± 0.05
1.08HisThr: 1.08 ± 0.045
1.611HisVal: 1.611 ± 0.059
0.306HisTrp: 0.306 ± 0.026
0.494HisTyr: 0.494 ± 0.032
0.0HisXaa: 0.0 ± 0.0
Ile
7.164IleAla: 7.164 ± 0.132
0.312IleCys: 0.312 ± 0.023
3.798IleAsp: 3.798 ± 0.085
3.766IleGlu: 3.766 ± 0.091
1.788IlePhe: 1.788 ± 0.06
4.572IleGly: 4.572 ± 0.099
1.026IleHis: 1.026 ± 0.041
2.961IleIle: 2.961 ± 0.087
1.764IleLys: 1.764 ± 0.068
4.812IleLeu: 4.812 ± 0.096
1.03IleMet: 1.03 ± 0.046
1.818IleAsn: 1.818 ± 0.061
3.24IlePro: 3.24 ± 0.08
1.522IleGln: 1.522 ± 0.055
2.859IleArg: 2.859 ± 0.073
3.885IleSer: 3.885 ± 0.089
4.038IleThr: 4.038 ± 0.089
5.007IleVal: 5.007 ± 0.119
0.647IleTrp: 0.647 ± 0.039
1.169IleTyr: 1.169 ± 0.047
0.0IleXaa: 0.0 ± 0.0
Lys
4.159LysAla: 4.159 ± 0.104
0.157LysCys: 0.157 ± 0.015
1.939LysAsp: 1.939 ± 0.054
2.056LysGlu: 2.056 ± 0.071
1.085LysPhe: 1.085 ± 0.041
2.357LysGly: 2.357 ± 0.076
0.768LysHis: 0.768 ± 0.042
1.949LysIle: 1.949 ± 0.055
2.246LysLys: 2.246 ± 0.096
3.562LysLeu: 3.562 ± 0.096
0.945LysMet: 0.945 ± 0.053
1.324LysAsn: 1.324 ± 0.053
2.161LysPro: 2.161 ± 0.084
1.266LysGln: 1.266 ± 0.048
2.238LysArg: 2.238 ± 0.073
2.226LysSer: 2.226 ± 0.071
2.236LysThr: 2.236 ± 0.068
2.975LysVal: 2.975 ± 0.074
0.468LysTrp: 0.468 ± 0.034
0.879LysTyr: 0.879 ± 0.044
0.0LysXaa: 0.0 ± 0.0
Leu
12.281LeuAla: 12.281 ± 0.187
0.486LeuCys: 0.486 ± 0.032
5.719LeuAsp: 5.719 ± 0.11
6.084LeuGlu: 6.084 ± 0.114
3.048LeuPhe: 3.048 ± 0.085
8.374LeuGly: 8.374 ± 0.146
1.748LeuHis: 1.748 ± 0.052
5.711LeuIle: 5.711 ± 0.121
3.217LeuLys: 3.217 ± 0.081
8.938LeuLeu: 8.938 ± 0.189
1.992LeuMet: 1.992 ± 0.068
2.975LeuAsn: 2.975 ± 0.082
4.884LeuPro: 4.884 ± 0.109
2.679LeuGln: 2.679 ± 0.078
5.511LeuArg: 5.511 ± 0.114
6.88LeuSer: 6.88 ± 0.137
6.388LeuThr: 6.388 ± 0.117
8.324LeuVal: 8.324 ± 0.138
1.214LeuTrp: 1.214 ± 0.059
1.806LeuTyr: 1.806 ± 0.062
0.0LeuXaa: 0.0 ± 0.0
Met
2.546MetAla: 2.546 ± 0.085
0.117MetCys: 0.117 ± 0.015
1.0MetAsp: 1.0 ± 0.039
0.927MetGlu: 0.927 ± 0.049
0.697MetPhe: 0.697 ± 0.039
1.593MetGly: 1.593 ± 0.062
0.363MetHis: 0.363 ± 0.026
1.026MetIle: 1.026 ± 0.048
0.78MetLys: 0.78 ± 0.042
1.927MetLeu: 1.927 ± 0.06
0.433MetMet: 0.433 ± 0.034
0.726MetAsn: 0.726 ± 0.039
1.149MetPro: 1.149 ± 0.048
0.625MetGln: 0.625 ± 0.038
1.137MetArg: 1.137 ± 0.048
1.899MetSer: 1.899 ± 0.063
1.703MetThr: 1.703 ± 0.047
1.542MetVal: 1.542 ± 0.056
0.29MetTrp: 0.29 ± 0.026
0.389MetTyr: 0.389 ± 0.027
0.0MetXaa: 0.0 ± 0.0
Asn
3.3AsnAla: 3.3 ± 0.077
0.185AsnCys: 0.185 ± 0.021
1.633AsnAsp: 1.633 ± 0.055
1.742AsnGlu: 1.742 ± 0.065
1.145AsnPhe: 1.145 ± 0.049
2.576AsnGly: 2.576 ± 0.08
0.532AsnHis: 0.532 ± 0.033
1.802AsnIle: 1.802 ± 0.059
1.252AsnLys: 1.252 ± 0.052
2.986AsnLeu: 2.986 ± 0.081
0.687AsnMet: 0.687 ± 0.037
1.085AsnAsn: 1.085 ± 0.057
2.242AsnPro: 2.242 ± 0.068
0.966AsnGln: 0.966 ± 0.043
1.544AsnArg: 1.544 ± 0.064
1.889AsnSer: 1.889 ± 0.074
1.917AsnThr: 1.917 ± 0.073
2.524AsnVal: 2.524 ± 0.076
0.472AsnTrp: 0.472 ± 0.034
0.827AsnTyr: 0.827 ± 0.056
0.0AsnXaa: 0.0 ± 0.0
Pro
5.663ProAla: 5.663 ± 0.131
0.222ProCys: 0.222 ± 0.021
2.742ProAsp: 2.742 ± 0.078
4.253ProGlu: 4.253 ± 0.092
1.595ProPhe: 1.595 ± 0.058
4.235ProGly: 4.235 ± 0.091
1.06ProHis: 1.06 ± 0.044
2.405ProIle: 2.405 ± 0.076
1.597ProLys: 1.597 ± 0.065
4.362ProLeu: 4.362 ± 0.099
0.837ProMet: 0.837 ± 0.045
1.443ProAsn: 1.443 ± 0.052
1.476ProPro: 1.476 ± 0.06
1.758ProGln: 1.758 ± 0.056
2.308ProArg: 2.308 ± 0.072
3.056ProSer: 3.056 ± 0.077
3.161ProThr: 3.161 ± 0.099
4.29ProVal: 4.29 ± 0.095
0.697ProTrp: 0.697 ± 0.044
1.095ProTyr: 1.095 ± 0.043
0.0ProXaa: 0.0 ± 0.0
Gln
4.016GlnAla: 4.016 ± 0.094
0.151GlnCys: 0.151 ± 0.016
1.534GlnAsp: 1.534 ± 0.056
1.861GlnGlu: 1.861 ± 0.062
1.074GlnPhe: 1.074 ± 0.044
2.502GlnGly: 2.502 ± 0.069
0.637GlnHis: 0.637 ± 0.032
1.812GlnIle: 1.812 ± 0.061
1.222GlnLys: 1.222 ± 0.053
3.709GlnLeu: 3.709 ± 0.09
0.718GlnMet: 0.718 ± 0.042
0.949GlnAsn: 0.949 ± 0.049
1.474GlnPro: 1.474 ± 0.061
1.385GlnGln: 1.385 ± 0.06
2.07GlnArg: 2.07 ± 0.07
1.75GlnSer: 1.75 ± 0.065
1.877GlnThr: 1.877 ± 0.066
2.687GlnVal: 2.687 ± 0.073
0.55GlnTrp: 0.55 ± 0.036
0.687GlnTyr: 0.687 ± 0.037
0.0GlnXaa: 0.0 ± 0.0
Arg
5.55ArgAla: 5.55 ± 0.121
0.278ArgCys: 0.278 ± 0.024
3.074ArgAsp: 3.074 ± 0.082
3.844ArgGlu: 3.844 ± 0.097
1.953ArgPhe: 1.953 ± 0.07
3.899ArgGly: 3.899 ± 0.093
1.143ArgHis: 1.143 ± 0.049
3.346ArgIle: 3.346 ± 0.075
2.205ArgLys: 2.205 ± 0.06
5.265ArgLeu: 5.265 ± 0.121
1.391ArgMet: 1.391 ± 0.06
1.76ArgAsn: 1.76 ± 0.065
2.294ArgPro: 2.294 ± 0.086
1.778ArgGln: 1.778 ± 0.057
3.572ArgArg: 3.572 ± 0.1
3.223ArgSer: 3.223 ± 0.084
2.986ArgThr: 2.986 ± 0.087
4.453ArgVal: 4.453 ± 0.104
0.784ArgTrp: 0.784 ± 0.036
1.349ArgTyr: 1.349 ± 0.052
0.0ArgXaa: 0.0 ± 0.0
Ser
7.015SerAla: 7.015 ± 0.129
0.323SerCys: 0.323 ± 0.029
3.292SerAsp: 3.292 ± 0.083
3.677SerGlu: 3.677 ± 0.091
2.445SerPhe: 2.445 ± 0.076
5.888SerGly: 5.888 ± 0.148
1.242SerHis: 1.242 ± 0.053
3.381SerIle: 3.381 ± 0.079
2.193SerLys: 2.193 ± 0.071
6.193SerLeu: 6.193 ± 0.126
1.399SerMet: 1.399 ± 0.05
2.016SerAsn: 2.016 ± 0.078
3.012SerPro: 3.012 ± 0.082
2.139SerGln: 2.139 ± 0.066
3.407SerArg: 3.407 ± 0.085
4.677SerSer: 4.677 ± 0.14
4.167SerThr: 4.167 ± 0.126
5.134SerVal: 5.134 ± 0.127
0.98SerTrp: 0.98 ± 0.047
1.5SerTyr: 1.5 ± 0.056
0.0SerXaa: 0.0 ± 0.0
Thr
6.642ThrAla: 6.642 ± 0.116
0.341ThrCys: 0.341 ± 0.029
3.106ThrAsp: 3.106 ± 0.065
3.578ThrGlu: 3.578 ± 0.09
2.189ThrPhe: 2.189 ± 0.081
5.721ThrGly: 5.721 ± 0.131
1.153ThrHis: 1.153 ± 0.051
3.544ThrIle: 3.544 ± 0.087
2.199ThrLys: 2.199 ± 0.07
5.87ThrLeu: 5.87 ± 0.108
1.165ThrMet: 1.165 ± 0.055
1.929ThrAsn: 1.929 ± 0.083
3.856ThrPro: 3.856 ± 0.108
2.038ThrGln: 2.038 ± 0.071
3.006ThrArg: 3.006 ± 0.075
3.997ThrSer: 3.997 ± 0.12
3.917ThrThr: 3.917 ± 0.117
5.618ThrVal: 5.618 ± 0.157
0.79ThrTrp: 0.79 ± 0.038
1.365ThrTyr: 1.365 ± 0.058
0.0ThrXaa: 0.0 ± 0.0
Val
9.293ValAla: 9.293 ± 0.148
0.502ValCys: 0.502 ± 0.03
5.092ValAsp: 5.092 ± 0.111
5.06ValGlu: 5.06 ± 0.109
3.056ValPhe: 3.056 ± 0.083
6.531ValGly: 6.531 ± 0.138
1.586ValHis: 1.586 ± 0.054
5.386ValIle: 5.386 ± 0.121
2.758ValLys: 2.758 ± 0.079
8.477ValLeu: 8.477 ± 0.14
1.756ValMet: 1.756 ± 0.057
2.649ValAsn: 2.649 ± 0.078
4.098ValPro: 4.098 ± 0.095
2.306ValGln: 2.306 ± 0.077
4.348ValArg: 4.348 ± 0.089
5.749ValSer: 5.749 ± 0.123
5.755ValThr: 5.755 ± 0.146
8.047ValVal: 8.047 ± 0.162
0.905ValTrp: 0.905 ± 0.051
1.679ValTyr: 1.679 ± 0.061
0.0ValXaa: 0.0 ± 0.0
Trp
1.351TrpAla: 1.351 ± 0.054
0.085TrpCys: 0.085 ± 0.013
0.689TrpAsp: 0.689 ± 0.038
0.637TrpGlu: 0.637 ± 0.036
0.645TrpPhe: 0.645 ± 0.035
1.01TrpGly: 1.01 ± 0.048
0.294TrpHis: 0.294 ± 0.026
0.839TrpIle: 0.839 ± 0.046
0.504TrpLys: 0.504 ± 0.032
1.552TrpLeu: 1.552 ± 0.062
0.369TrpMet: 0.369 ± 0.027
0.572TrpAsn: 0.572 ± 0.036
0.554TrpPro: 0.554 ± 0.031
0.494TrpGln: 0.494 ± 0.031
0.81TrpArg: 0.81 ± 0.041
0.863TrpSer: 0.863 ± 0.039
0.704TrpThr: 0.704 ± 0.036
1.052TrpVal: 1.052 ± 0.046
0.294TrpTrp: 0.294 ± 0.025
0.282TrpTyr: 0.282 ± 0.026
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.361TyrAla: 2.361 ± 0.075
0.143TyrCys: 0.143 ± 0.018
1.367TyrAsp: 1.367 ± 0.051
1.345TyrGlu: 1.345 ± 0.055
1.016TyrPhe: 1.016 ± 0.046
2.082TyrGly: 2.082 ± 0.072
0.325TyrHis: 0.325 ± 0.026
0.974TyrIle: 0.974 ± 0.041
0.7TyrLys: 0.7 ± 0.042
2.413TyrLeu: 2.413 ± 0.074
0.389TyrMet: 0.389 ± 0.028
0.78TyrAsn: 0.78 ± 0.043
1.105TyrPro: 1.105 ± 0.051
0.78TyrGln: 0.78 ± 0.035
1.337TyrArg: 1.337 ± 0.058
1.562TyrSer: 1.562 ± 0.061
1.212TyrThr: 1.212 ± 0.058
1.891TyrVal: 1.891 ± 0.065
0.323TyrTrp: 0.323 ± 0.025
0.506TyrTyr: 0.506 ± 0.035
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1574 proteins (503936 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski