Amino acid dipepetide frequency for alpha proteobacterium HIMB114

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.933AlaAla: 3.933 ± 0.144
0.641AlaCys: 0.641 ± 0.047
2.608AlaAsp: 2.608 ± 0.092
2.975AlaGlu: 2.975 ± 0.106
2.562AlaPhe: 2.562 ± 0.093
4.084AlaGly: 4.084 ± 0.111
0.898AlaHis: 0.898 ± 0.055
4.886AlaIle: 4.886 ± 0.128
5.245AlaLys: 5.245 ± 0.12
5.318AlaLeu: 5.318 ± 0.119
1.436AlaMet: 1.436 ± 0.074
2.869AlaAsn: 2.869 ± 0.086
1.501AlaPro: 1.501 ± 0.07
1.529AlaGln: 1.529 ± 0.067
1.868AlaArg: 1.868 ± 0.066
3.674AlaSer: 3.674 ± 0.103
2.567AlaThr: 2.567 ± 0.084
3.342AlaVal: 3.342 ± 0.124
0.531AlaTrp: 0.531 ± 0.039
1.76AlaTyr: 1.76 ± 0.07
0.0AlaXaa: 0.0 ± 0.0
Cys
0.639CysAla: 0.639 ± 0.042
0.156CysCys: 0.156 ± 0.02
0.593CysAsp: 0.593 ± 0.044
0.598CysGlu: 0.598 ± 0.041
0.568CysPhe: 0.568 ± 0.034
0.885CysGly: 0.885 ± 0.048
0.234CysHis: 0.234 ± 0.025
0.82CysIle: 0.82 ± 0.047
0.938CysLys: 0.938 ± 0.045
1.018CysLeu: 1.018 ± 0.05
0.161CysMet: 0.161 ± 0.02
0.586CysAsn: 0.586 ± 0.04
0.375CysPro: 0.375 ± 0.029
0.231CysGln: 0.231 ± 0.022
0.279CysArg: 0.279 ± 0.026
0.724CysSer: 0.724 ± 0.041
0.465CysThr: 0.465 ± 0.032
0.616CysVal: 0.616 ± 0.037
0.078CysTrp: 0.078 ± 0.013
0.337CysTyr: 0.337 ± 0.034
0.0CysXaa: 0.0 ± 0.0
Asp
2.655AspAla: 2.655 ± 0.083
0.521AspCys: 0.521 ± 0.035
2.53AspAsp: 2.53 ± 0.096
3.651AspGlu: 3.651 ± 0.095
3.098AspPhe: 3.098 ± 0.101
3.126AspGly: 3.126 ± 0.097
0.986AspHis: 0.986 ± 0.051
4.783AspIle: 4.783 ± 0.124
5.182AspLys: 5.182 ± 0.136
5.283AspLeu: 5.283 ± 0.112
0.943AspMet: 0.943 ± 0.052
2.922AspAsn: 2.922 ± 0.092
1.841AspPro: 1.841 ± 0.071
2.21AspGln: 2.21 ± 0.079
1.738AspArg: 1.738 ± 0.064
2.457AspSer: 2.457 ± 0.094
2.344AspThr: 2.344 ± 0.072
2.99AspVal: 2.99 ± 0.103
0.556AspTrp: 0.556 ± 0.04
2.097AspTyr: 2.097 ± 0.075
0.0AspXaa: 0.0 ± 0.0
Glu
3.286GluAla: 3.286 ± 0.097
0.508GluCys: 0.508 ± 0.033
2.912GluAsp: 2.912 ± 0.088
3.867GluGlu: 3.867 ± 0.126
2.849GluPhe: 2.849 ± 0.087
2.879GluGly: 2.879 ± 0.084
0.827GluHis: 0.827 ± 0.039
6.583GluIle: 6.583 ± 0.139
7.508GluLys: 7.508 ± 0.151
5.228GluLeu: 5.228 ± 0.128
1.295GluMet: 1.295 ± 0.049
4.536GluAsn: 4.536 ± 0.126
1.37GluPro: 1.37 ± 0.069
1.7GluGln: 1.7 ± 0.069
1.898GluArg: 1.898 ± 0.085
2.874GluSer: 2.874 ± 0.093
2.942GluThr: 2.942 ± 0.086
3.568GluVal: 3.568 ± 0.105
0.556GluTrp: 0.556 ± 0.041
1.753GluTyr: 1.753 ± 0.072
0.0GluXaa: 0.0 ± 0.0
Phe
2.942PheAla: 2.942 ± 0.098
0.732PheCys: 0.732 ± 0.043
3.078PheAsp: 3.078 ± 0.091
3.234PheGlu: 3.234 ± 0.097
3.794PhePhe: 3.794 ± 0.162
3.586PheGly: 3.586 ± 0.102
0.774PheHis: 0.774 ± 0.047
4.813PheIle: 4.813 ± 0.132
5.379PheLys: 5.379 ± 0.13
5.985PheLeu: 5.985 ± 0.194
0.986PheMet: 0.986 ± 0.056
3.603PheAsn: 3.603 ± 0.106
1.531PhePro: 1.531 ± 0.058
1.383PheGln: 1.383 ± 0.056
1.509PheArg: 1.509 ± 0.055
4.239PheSer: 4.239 ± 0.119
2.487PheThr: 2.487 ± 0.079
3.005PheVal: 3.005 ± 0.096
0.455PheTrp: 0.455 ± 0.039
2.306PheTyr: 2.306 ± 0.087
0.0PheXaa: 0.0 ± 0.0
Gly
3.832GlyAla: 3.832 ± 0.131
0.79GlyCys: 0.79 ± 0.045
3.032GlyAsp: 3.032 ± 0.09
3.136GlyGlu: 3.136 ± 0.098
3.42GlyPhe: 3.42 ± 0.098
4.42GlyGly: 4.42 ± 0.126
1.222GlyHis: 1.222 ± 0.059
5.565GlyIle: 5.565 ± 0.117
5.494GlyLys: 5.494 ± 0.134
5.776GlyLeu: 5.776 ± 0.124
1.567GlyMet: 1.567 ± 0.069
2.733GlyAsn: 2.733 ± 0.096
1.843GlyPro: 1.843 ± 0.064
1.69GlyGln: 1.69 ± 0.082
2.059GlyArg: 2.059 ± 0.073
4.395GlySer: 4.395 ± 0.102
3.39GlyThr: 3.39 ± 0.092
3.885GlyVal: 3.885 ± 0.121
0.656GlyTrp: 0.656 ± 0.041
2.391GlyTyr: 2.391 ± 0.085
0.0GlyXaa: 0.0 ± 0.0
His
0.95HisAla: 0.95 ± 0.05
0.231HisCys: 0.231 ± 0.024
0.852HisAsp: 0.852 ± 0.048
0.847HisGlu: 0.847 ± 0.048
0.797HisPhe: 0.797 ± 0.042
1.212HisGly: 1.212 ± 0.063
0.38HisHis: 0.38 ± 0.035
1.235HisIle: 1.235 ± 0.058
1.333HisLys: 1.333 ± 0.061
1.564HisLeu: 1.564 ± 0.065
0.337HisMet: 0.337 ± 0.03
0.878HisAsn: 0.878 ± 0.05
0.84HisPro: 0.84 ± 0.047
0.521HisGln: 0.521 ± 0.033
0.518HisArg: 0.518 ± 0.038
0.998HisSer: 0.998 ± 0.053
0.664HisThr: 0.664 ± 0.043
0.827HisVal: 0.827 ± 0.044
0.153HisTrp: 0.153 ± 0.019
0.674HisTyr: 0.674 ± 0.042
0.0HisXaa: 0.0 ± 0.0
Ile
5.104IleAla: 5.104 ± 0.119
1.074IleCys: 1.074 ± 0.058
5.396IleAsp: 5.396 ± 0.119
5.736IleGlu: 5.736 ± 0.137
5.811IlePhe: 5.811 ± 0.172
5.71IleGly: 5.71 ± 0.139
1.406IleHis: 1.406 ± 0.054
8.831IleIle: 8.831 ± 0.183
10.385IleLys: 10.385 ± 0.191
9.248IleLeu: 9.248 ± 0.181
1.634IleMet: 1.634 ± 0.067
6.774IleAsn: 6.774 ± 0.149
3.103IlePro: 3.103 ± 0.1
2.449IleGln: 2.449 ± 0.088
2.603IleArg: 2.603 ± 0.089
7.174IleSer: 7.174 ± 0.142
4.353IleThr: 4.353 ± 0.113
4.868IleVal: 4.868 ± 0.105
0.624IleTrp: 0.624 ± 0.046
3.236IleTyr: 3.236 ± 0.101
0.0IleXaa: 0.0 ± 0.0
Lys
4.358LysAla: 4.358 ± 0.131
0.739LysCys: 0.739 ± 0.047
5.771LysAsp: 5.771 ± 0.131
6.724LysGlu: 6.724 ± 0.149
5.308LysPhe: 5.308 ± 0.132
4.717LysGly: 4.717 ± 0.124
1.315LysHis: 1.315 ± 0.051
12.479LysIle: 12.479 ± 0.248
16.065LysLys: 16.065 ± 0.358
8.745LysLeu: 8.745 ± 0.171
2.147LysMet: 2.147 ± 0.073
10.599LysAsn: 10.599 ± 0.202
2.665LysPro: 2.665 ± 0.084
2.731LysGln: 2.731 ± 0.085
3.176LysArg: 3.176 ± 0.079
6.545LysSer: 6.545 ± 0.142
4.931LysThr: 4.931 ± 0.106
5.645LysVal: 5.645 ± 0.129
0.641LysTrp: 0.641 ± 0.042
3.787LysTyr: 3.787 ± 0.112
0.0LysXaa: 0.0 ± 0.0
Leu
5.293LeuAla: 5.293 ± 0.116
0.885LeuCys: 0.885 ± 0.048
5.162LeuAsp: 5.162 ± 0.116
5.62LeuGlu: 5.62 ± 0.127
4.898LeuPhe: 4.898 ± 0.163
5.879LeuGly: 5.879 ± 0.119
1.272LeuHis: 1.272 ± 0.058
9.314LeuIle: 9.314 ± 0.213
10.868LeuLys: 10.868 ± 0.178
7.782LeuLeu: 7.782 ± 0.174
1.971LeuMet: 1.971 ± 0.074
6.751LeuAsn: 6.751 ± 0.14
3.254LeuPro: 3.254 ± 0.116
2.188LeuGln: 2.188 ± 0.072
3.098LeuArg: 3.098 ± 0.1
7.129LeuSer: 7.129 ± 0.135
4.39LeuThr: 4.39 ± 0.126
5.464LeuVal: 5.464 ± 0.124
0.656LeuTrp: 0.656 ± 0.045
2.572LeuTyr: 2.572 ± 0.073
0.0LeuXaa: 0.0 ± 0.0
Met
1.375MetAla: 1.375 ± 0.068
0.176MetCys: 0.176 ± 0.019
0.835MetAsp: 0.835 ± 0.05
0.933MetGlu: 0.933 ± 0.051
0.913MetPhe: 0.913 ± 0.05
1.418MetGly: 1.418 ± 0.066
0.339MetHis: 0.339 ± 0.03
1.858MetIle: 1.858 ± 0.076
2.18MetLys: 2.18 ± 0.082
1.768MetLeu: 1.768 ± 0.085
0.553MetMet: 0.553 ± 0.042
1.277MetAsn: 1.277 ± 0.059
0.961MetPro: 0.961 ± 0.045
0.666MetGln: 0.666 ± 0.045
0.689MetArg: 0.689 ± 0.05
1.858MetSer: 1.858 ± 0.069
1.081MetThr: 1.081 ± 0.06
1.134MetVal: 1.134 ± 0.055
0.156MetTrp: 0.156 ± 0.022
0.548MetTyr: 0.548 ± 0.033
0.0MetXaa: 0.0 ± 0.0
Asn
2.819AsnAla: 2.819 ± 0.08
0.651AsnCys: 0.651 ± 0.041
3.083AsnAsp: 3.083 ± 0.104
3.842AsnGlu: 3.842 ± 0.105
4.647AsnPhe: 4.647 ± 0.122
2.897AsnGly: 2.897 ± 0.084
1.064AsnHis: 1.064 ± 0.051
7.0AsnIle: 7.0 ± 0.153
7.873AsnLys: 7.873 ± 0.174
7.445AsnLeu: 7.445 ± 0.157
1.232AsnMet: 1.232 ± 0.056
4.853AsnAsn: 4.853 ± 0.139
2.235AsnPro: 2.235 ± 0.078
2.361AsnGln: 2.361 ± 0.084
1.795AsnArg: 1.795 ± 0.079
4.262AsnSer: 4.262 ± 0.112
2.542AsnThr: 2.542 ± 0.08
3.058AsnVal: 3.058 ± 0.094
0.49AsnTrp: 0.49 ± 0.032
2.829AsnTyr: 2.829 ± 0.092
0.0AsnXaa: 0.0 ± 0.0
Pro
1.577ProAla: 1.577 ± 0.076
0.287ProCys: 0.287 ± 0.027
1.818ProAsp: 1.818 ± 0.069
2.215ProGlu: 2.215 ± 0.08
1.765ProPhe: 1.765 ± 0.072
2.213ProGly: 2.213 ± 0.081
0.551ProHis: 0.551 ± 0.04
2.834ProIle: 2.834 ± 0.087
2.972ProLys: 2.972 ± 0.098
2.665ProLeu: 2.665 ± 0.089
0.659ProMet: 0.659 ± 0.037
1.919ProAsn: 1.919 ± 0.072
0.847ProPro: 0.847 ± 0.052
0.82ProGln: 0.82 ± 0.046
0.988ProArg: 0.988 ± 0.049
2.233ProSer: 2.233 ± 0.07
1.554ProThr: 1.554 ± 0.063
2.21ProVal: 2.21 ± 0.086
0.352ProTrp: 0.352 ± 0.033
0.998ProTyr: 0.998 ± 0.051
0.0ProXaa: 0.0 ± 0.0
Gln
1.569GlnAla: 1.569 ± 0.061
0.272GlnCys: 0.272 ± 0.026
1.365GlnAsp: 1.365 ± 0.057
1.529GlnGlu: 1.529 ± 0.07
1.396GlnPhe: 1.396 ± 0.061
1.506GlnGly: 1.506 ± 0.062
0.397GlnHis: 0.397 ± 0.034
3.02GlnIle: 3.02 ± 0.09
3.281GlnLys: 3.281 ± 0.095
2.469GlnLeu: 2.469 ± 0.078
0.586GlnMet: 0.586 ± 0.039
2.13GlnAsn: 2.13 ± 0.07
0.797GlnPro: 0.797 ± 0.042
0.84GlnGln: 0.84 ± 0.059
1.054GlnArg: 1.054 ± 0.053
2.087GlnSer: 2.087 ± 0.078
1.378GlnThr: 1.378 ± 0.063
1.685GlnVal: 1.685 ± 0.065
0.206GlnTrp: 0.206 ± 0.021
0.822GlnTyr: 0.822 ± 0.043
0.0GlnXaa: 0.0 ± 0.0
Arg
1.76ArgAla: 1.76 ± 0.072
0.332ArgCys: 0.332 ± 0.027
1.667ArgAsp: 1.667 ± 0.056
1.959ArgGlu: 1.959 ± 0.076
1.574ArgPhe: 1.574 ± 0.07
2.039ArgGly: 2.039 ± 0.08
0.571ArgHis: 0.571 ± 0.041
2.685ArgIle: 2.685 ± 0.083
2.955ArgLys: 2.955 ± 0.088
2.899ArgLeu: 2.899 ± 0.085
0.707ArgMet: 0.707 ± 0.045
1.873ArgAsn: 1.873 ± 0.075
1.046ArgPro: 1.046 ± 0.046
0.935ArgGln: 0.935 ± 0.053
1.25ArgArg: 1.25 ± 0.066
2.351ArgSer: 2.351 ± 0.089
1.358ArgThr: 1.358 ± 0.069
2.032ArgVal: 2.032 ± 0.073
0.292ArgTrp: 0.292 ± 0.028
1.104ArgTyr: 1.104 ± 0.049
0.0ArgXaa: 0.0 ± 0.0
Ser
3.676SerAla: 3.676 ± 0.116
0.651SerCys: 0.651 ± 0.041
3.47SerAsp: 3.47 ± 0.097
3.84SerGlu: 3.84 ± 0.117
4.026SerPhe: 4.026 ± 0.105
4.669SerGly: 4.669 ± 0.122
1.099SerHis: 1.099 ± 0.054
6.191SerIle: 6.191 ± 0.152
7.652SerLys: 7.652 ± 0.157
6.663SerLeu: 6.663 ± 0.135
1.501SerMet: 1.501 ± 0.059
4.247SerAsn: 4.247 ± 0.111
2.034SerPro: 2.034 ± 0.07
1.921SerGln: 1.921 ± 0.07
2.175SerArg: 2.175 ± 0.075
4.981SerSer: 4.981 ± 0.139
2.962SerThr: 2.962 ± 0.112
3.679SerVal: 3.679 ± 0.088
0.576SerTrp: 0.576 ± 0.038
2.306SerTyr: 2.306 ± 0.077
0.0SerXaa: 0.0 ± 0.0
Thr
2.731ThrAla: 2.731 ± 0.103
0.508ThrCys: 0.508 ± 0.036
2.426ThrAsp: 2.426 ± 0.088
2.442ThrGlu: 2.442 ± 0.079
2.452ThrPhe: 2.452 ± 0.084
3.513ThrGly: 3.513 ± 0.108
0.797ThrHis: 0.797 ± 0.046
4.3ThrIle: 4.3 ± 0.1
4.355ThrLys: 4.355 ± 0.114
4.441ThrLeu: 4.441 ± 0.115
0.893ThrMet: 0.893 ± 0.045
2.731ThrAsn: 2.731 ± 0.086
1.826ThrPro: 1.826 ± 0.062
1.209ThrGln: 1.209 ± 0.057
1.551ThrArg: 1.551 ± 0.066
3.146ThrSer: 3.146 ± 0.099
2.268ThrThr: 2.268 ± 0.084
2.844ThrVal: 2.844 ± 0.101
0.36ThrTrp: 0.36 ± 0.034
1.514ThrTyr: 1.514 ± 0.061
0.0ThrXaa: 0.0 ± 0.0
Val
3.566ValAla: 3.566 ± 0.111
0.629ValCys: 0.629 ± 0.042
3.027ValAsp: 3.027 ± 0.095
3.344ValGlu: 3.344 ± 0.107
3.07ValPhe: 3.07 ± 0.1
3.925ValGly: 3.925 ± 0.112
0.855ValHis: 0.855 ± 0.055
5.112ValIle: 5.112 ± 0.102
5.067ValLys: 5.067 ± 0.122
5.454ValLeu: 5.454 ± 0.124
1.38ValMet: 1.38 ± 0.062
3.271ValAsn: 3.271 ± 0.1
1.914ValPro: 1.914 ± 0.074
1.431ValGln: 1.431 ± 0.05
1.707ValArg: 1.707 ± 0.071
4.101ValSer: 4.101 ± 0.109
3.005ValThr: 3.005 ± 0.099
3.488ValVal: 3.488 ± 0.097
0.493ValTrp: 0.493 ± 0.032
1.68ValTyr: 1.68 ± 0.067
0.0ValXaa: 0.0 ± 0.0
Trp
0.432TrpAla: 0.432 ± 0.031
0.121TrpCys: 0.121 ± 0.017
0.395TrpAsp: 0.395 ± 0.031
0.392TrpGlu: 0.392 ± 0.034
0.485TrpPhe: 0.485 ± 0.037
0.558TrpGly: 0.558 ± 0.041
0.181TrpHis: 0.181 ± 0.022
0.676TrpIle: 0.676 ± 0.041
0.8TrpLys: 0.8 ± 0.043
0.832TrpLeu: 0.832 ± 0.047
0.191TrpMet: 0.191 ± 0.018
0.513TrpAsn: 0.513 ± 0.036
0.287TrpPro: 0.287 ± 0.031
0.241TrpGln: 0.241 ± 0.021
0.382TrpArg: 0.382 ± 0.031
0.634TrpSer: 0.634 ± 0.044
0.367TrpThr: 0.367 ± 0.032
0.465TrpVal: 0.465 ± 0.037
0.106TrpTrp: 0.106 ± 0.016
0.267TrpTyr: 0.267 ± 0.029
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.73TyrAla: 1.73 ± 0.072
0.4TyrCys: 0.4 ± 0.035
1.888TyrAsp: 1.888 ± 0.064
2.057TyrGlu: 2.057 ± 0.078
2.386TyrPhe: 2.386 ± 0.082
2.059TyrGly: 2.059 ± 0.073
0.644TyrHis: 0.644 ± 0.034
2.535TyrIle: 2.535 ± 0.102
3.558TyrLys: 3.558 ± 0.095
3.862TyrLeu: 3.862 ± 0.106
0.538TyrMet: 0.538 ± 0.039
1.914TyrAsn: 1.914 ± 0.065
1.207TyrPro: 1.207 ± 0.053
1.333TyrGln: 1.333 ± 0.059
1.051TyrArg: 1.051 ± 0.059
2.457TyrSer: 2.457 ± 0.075
1.257TyrThr: 1.257 ± 0.053
1.715TyrVal: 1.715 ± 0.062
0.39TyrTrp: 0.39 ± 0.03
1.242TyrTyr: 1.242 ± 0.059
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1321 proteins (397694 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski