Amino acid dipepetide frequency for bacterium B13(2017)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.564AlaAla: 2.564 ± 0.178
0.364AlaCys: 0.364 ± 0.053
2.82AlaAsp: 2.82 ± 0.14
3.187AlaGlu: 3.187 ± 0.208
2.3AlaPhe: 2.3 ± 0.116
2.996AlaGly: 2.996 ± 0.187
0.792AlaHis: 0.792 ± 0.073
4.282AlaIle: 4.282 ± 0.19
3.953AlaLys: 3.953 ± 0.21
5.062AlaLeu: 5.062 ± 0.206
1.159AlaMet: 1.159 ± 0.076
2.648AlaAsn: 2.648 ± 0.104
1.075AlaPro: 1.075 ± 0.069
1.599AlaGln: 1.599 ± 0.105
2.112AlaArg: 2.112 ± 0.154
3.298AlaSer: 3.298 ± 0.115
2.522AlaThr: 2.522 ± 0.133
2.487AlaVal: 2.487 ± 0.107
0.429AlaTrp: 0.429 ± 0.047
2.097AlaTyr: 2.097 ± 0.102
0.0AlaXaa: 0.0 ± 0.0
Cys
0.295CysAla: 0.295 ± 0.039
0.057CysCys: 0.057 ± 0.019
0.417CysAsp: 0.417 ± 0.053
0.52CysGlu: 0.52 ± 0.061
0.352CysPhe: 0.352 ± 0.056
0.474CysGly: 0.474 ± 0.072
0.13CysHis: 0.13 ± 0.026
0.559CysIle: 0.559 ± 0.081
0.444CysLys: 0.444 ± 0.054
0.547CysLeu: 0.547 ± 0.078
0.149CysMet: 0.149 ± 0.026
0.36CysAsn: 0.36 ± 0.061
0.272CysPro: 0.272 ± 0.046
0.18CysGln: 0.18 ± 0.03
0.241CysArg: 0.241 ± 0.041
0.432CysSer: 0.432 ± 0.06
0.295CysThr: 0.295 ± 0.054
0.318CysVal: 0.318 ± 0.052
0.046CysTrp: 0.046 ± 0.014
0.279CysTyr: 0.279 ± 0.049
0.0CysXaa: 0.0 ± 0.0
Asp
3.187AspAla: 3.187 ± 0.16
0.36AspCys: 0.36 ± 0.044
3.773AspAsp: 3.773 ± 0.183
5.384AspGlu: 5.384 ± 0.21
3.578AspPhe: 3.578 ± 0.131
5.227AspGly: 5.227 ± 0.395
0.903AspHis: 0.903 ± 0.068
5.663AspIle: 5.663 ± 0.172
4.098AspLys: 4.098 ± 0.206
6.333AspLeu: 6.333 ± 0.191
1.228AspMet: 1.228 ± 0.064
3.765AspAsn: 3.765 ± 0.187
2.537AspPro: 2.537 ± 0.12
2.078AspGln: 2.078 ± 0.125
2.277AspArg: 2.277 ± 0.143
4.481AspSer: 4.481 ± 0.174
2.946AspThr: 2.946 ± 0.157
3.291AspVal: 3.291 ± 0.172
0.777AspTrp: 0.777 ± 0.047
3.141AspTyr: 3.141 ± 0.175
0.0AspXaa: 0.0 ± 0.0
Glu
3.849GluAla: 3.849 ± 0.259
0.329GluCys: 0.329 ± 0.041
4.657GluAsp: 4.657 ± 0.193
6.995GluGlu: 6.995 ± 0.326
3.75GluPhe: 3.75 ± 0.151
4.733GluGly: 4.733 ± 0.237
1.071GluHis: 1.071 ± 0.057
8.093GluIle: 8.093 ± 0.355
7.224GluLys: 7.224 ± 0.448
7.201GluLeu: 7.201 ± 0.31
1.825GluMet: 1.825 ± 0.119
5.981GluAsn: 5.981 ± 0.269
1.343GluPro: 1.343 ± 0.081
2.556GluGln: 2.556 ± 0.184
3.245GluArg: 3.245 ± 0.217
5.013GluSer: 5.013 ± 0.24
4.917GluThr: 4.917 ± 0.344
4.446GluVal: 4.446 ± 0.154
0.781GluTrp: 0.781 ± 0.061
3.803GluTyr: 3.803 ± 0.2
0.004GluXaa: 0.004 ± 0.004
Phe
2.089PheAla: 2.089 ± 0.108
0.452PheCys: 0.452 ± 0.073
4.048PheAsp: 4.048 ± 0.224
3.849PheGlu: 3.849 ± 0.131
2.024PhePhe: 2.024 ± 0.171
2.877PheGly: 2.877 ± 0.147
0.8PheHis: 0.8 ± 0.078
3.957PheIle: 3.957 ± 0.242
3.524PheLys: 3.524 ± 0.127
3.635PheLeu: 3.635 ± 0.221
0.788PheMet: 0.788 ± 0.063
3.589PheAsn: 3.589 ± 0.165
1.194PhePro: 1.194 ± 0.129
1.496PheGln: 1.496 ± 0.071
1.649PheArg: 1.649 ± 0.119
3.696PheSer: 3.696 ± 0.153
2.483PheThr: 2.483 ± 0.106
2.369PheVal: 2.369 ± 0.097
0.444PheTrp: 0.444 ± 0.046
2.066PheTyr: 2.066 ± 0.109
0.0PheXaa: 0.0 ± 0.0
Gly
3.318GlyAla: 3.318 ± 0.158
0.39GlyCys: 0.39 ± 0.051
3.834GlyAsp: 3.834 ± 0.231
5.407GlyGlu: 5.407 ± 0.245
3.073GlyPhe: 3.073 ± 0.152
4.006GlyGly: 4.006 ± 0.217
0.918GlyHis: 0.918 ± 0.063
5.277GlyIle: 5.277 ± 0.219
5.093GlyLys: 5.093 ± 0.221
4.68GlyLeu: 4.68 ± 0.171
1.286GlyMet: 1.286 ± 0.071
4.309GlyAsn: 4.309 ± 0.263
0.922GlyPro: 0.922 ± 0.097
2.147GlyGln: 2.147 ± 0.169
2.759GlyArg: 2.759 ± 0.16
3.983GlySer: 3.983 ± 0.201
3.344GlyThr: 3.344 ± 0.157
3.677GlyVal: 3.677 ± 0.151
0.601GlyTrp: 0.601 ± 0.066
2.74GlyTyr: 2.74 ± 0.144
0.0GlyXaa: 0.0 ± 0.0
His
0.677HisAla: 0.677 ± 0.061
0.149HisCys: 0.149 ± 0.032
0.907HisAsp: 0.907 ± 0.076
1.24HisGlu: 1.24 ± 0.085
0.849HisPhe: 0.849 ± 0.064
0.995HisGly: 0.995 ± 0.062
0.318HisHis: 0.318 ± 0.039
1.205HisIle: 1.205 ± 0.078
0.915HisLys: 0.915 ± 0.071
1.496HisLeu: 1.496 ± 0.105
0.21HisMet: 0.21 ± 0.037
0.849HisAsn: 0.849 ± 0.054
0.62HisPro: 0.62 ± 0.083
0.44HisGln: 0.44 ± 0.042
0.628HisArg: 0.628 ± 0.062
1.098HisSer: 1.098 ± 0.086
0.838HisThr: 0.838 ± 0.07
0.827HisVal: 0.827 ± 0.064
0.172HisTrp: 0.172 ± 0.026
0.892HisTyr: 0.892 ± 0.063
0.0HisXaa: 0.0 ± 0.0
Ile
4.167IleAla: 4.167 ± 0.163
0.62IleCys: 0.62 ± 0.092
6.176IleAsp: 6.176 ± 0.145
7.182IleGlu: 7.182 ± 0.318
3.914IlePhe: 3.914 ± 0.244
4.726IleGly: 4.726 ± 0.18
1.37IleHis: 1.37 ± 0.083
7.928IleIle: 7.928 ± 0.41
7.634IleLys: 7.634 ± 0.389
7.446IleLeu: 7.446 ± 0.269
1.576IleMet: 1.576 ± 0.096
6.578IleAsn: 6.578 ± 0.253
2.87IlePro: 2.87 ± 0.237
3.161IleGln: 3.161 ± 0.109
3.413IleArg: 3.413 ± 0.166
6.823IleSer: 6.823 ± 0.19
5.2IleThr: 5.2 ± 0.191
4.266IleVal: 4.266 ± 0.154
0.57IleTrp: 0.57 ± 0.052
3.436IleTyr: 3.436 ± 0.196
0.0IleXaa: 0.0 ± 0.0
Lys
4.094LysAla: 4.094 ± 0.29
0.409LysCys: 0.409 ± 0.058
5.403LysAsp: 5.403 ± 0.248
7.584LysGlu: 7.584 ± 0.508
2.778LysPhe: 2.778 ± 0.113
4.4LysGly: 4.4 ± 0.189
1.194LysHis: 1.194 ± 0.084
7.68LysIle: 7.68 ± 0.376
7.741LysLys: 7.741 ± 0.541
6.501LysLeu: 6.501 ± 0.346
1.584LysMet: 1.584 ± 0.097
5.464LysAsn: 5.464 ± 0.335
1.73LysPro: 1.73 ± 0.151
2.721LysGln: 2.721 ± 0.161
3.019LysArg: 3.019 ± 0.159
4.833LysSer: 4.833 ± 0.206
5.223LysThr: 5.223 ± 0.25
4.53LysVal: 4.53 ± 0.188
0.765LysTrp: 0.765 ± 0.063
3.363LysTyr: 3.363 ± 0.183
0.0LysXaa: 0.0 ± 0.0
Leu
4.343LeuAla: 4.343 ± 0.183
0.597LeuCys: 0.597 ± 0.087
5.411LeuAsp: 5.411 ± 0.188
7.266LeuGlu: 7.266 ± 0.301
3.945LeuPhe: 3.945 ± 0.237
5.12LeuGly: 5.12 ± 0.163
1.312LeuHis: 1.312 ± 0.088
7.125LeuIle: 7.125 ± 0.291
8.032LeuLys: 8.032 ± 0.368
7.446LeuLeu: 7.446 ± 0.281
1.695LeuMet: 1.695 ± 0.112
6.731LeuAsn: 6.731 ± 0.182
2.629LeuPro: 2.629 ± 0.172
2.877LeuGln: 2.877 ± 0.161
3.451LeuArg: 3.451 ± 0.187
7.209LeuSer: 7.209 ± 0.172
5.02LeuThr: 5.02 ± 0.133
4.083LeuVal: 4.083 ± 0.14
0.624LeuTrp: 0.624 ± 0.06
2.713LeuTyr: 2.713 ± 0.134
0.004LeuXaa: 0.004 ± 0.004
Met
1.006MetAla: 1.006 ± 0.076
0.08MetCys: 0.08 ± 0.018
1.343MetAsp: 1.343 ± 0.085
1.492MetGlu: 1.492 ± 0.08
0.719MetPhe: 0.719 ± 0.047
1.148MetGly: 1.148 ± 0.073
0.314MetHis: 0.314 ± 0.035
1.71MetIle: 1.71 ± 0.099
1.722MetLys: 1.722 ± 0.108
1.642MetLeu: 1.642 ± 0.102
0.463MetMet: 0.463 ± 0.051
1.469MetAsn: 1.469 ± 0.079
0.478MetPro: 0.478 ± 0.05
0.819MetGln: 0.819 ± 0.076
0.796MetArg: 0.796 ± 0.058
1.416MetSer: 1.416 ± 0.089
1.163MetThr: 1.163 ± 0.077
1.033MetVal: 1.033 ± 0.072
0.168MetTrp: 0.168 ± 0.025
0.559MetTyr: 0.559 ± 0.048
0.0MetXaa: 0.0 ± 0.0
Asn
3.662AsnAla: 3.662 ± 0.181
0.49AsnCys: 0.49 ± 0.082
3.788AsnAsp: 3.788 ± 0.174
5.108AsnGlu: 5.108 ± 0.245
3.268AsnPhe: 3.268 ± 0.166
4.661AsnGly: 4.661 ± 0.255
1.018AsnHis: 1.018 ± 0.086
6.562AsnIle: 6.562 ± 0.238
4.806AsnLys: 4.806 ± 0.216
6.011AsnLeu: 6.011 ± 0.151
1.19AsnMet: 1.19 ± 0.091
4.5AsnAsn: 4.5 ± 0.2
2.15AsnPro: 2.15 ± 0.1
2.476AsnGln: 2.476 ± 0.111
2.365AsnArg: 2.365 ± 0.111
4.117AsnSer: 4.117 ± 0.161
3.053AsnThr: 3.053 ± 0.147
3.379AsnVal: 3.379 ± 0.132
0.842AsnTrp: 0.842 ± 0.059
3.712AsnTyr: 3.712 ± 0.257
0.0AsnXaa: 0.0 ± 0.0
Pro
1.029ProAla: 1.029 ± 0.096
0.138ProCys: 0.138 ± 0.032
1.986ProAsp: 1.986 ± 0.127
2.545ProGlu: 2.545 ± 0.092
1.358ProPhe: 1.358 ± 0.131
1.775ProGly: 1.775 ± 0.142
0.386ProHis: 0.386 ± 0.05
2.097ProIle: 2.097 ± 0.205
2.005ProLys: 2.005 ± 0.186
2.476ProLeu: 2.476 ± 0.169
0.375ProMet: 0.375 ± 0.042
1.446ProAsn: 1.446 ± 0.12
0.98ProPro: 0.98 ± 0.104
0.96ProGln: 0.96 ± 0.072
0.781ProArg: 0.781 ± 0.079
1.948ProSer: 1.948 ± 0.127
1.144ProThr: 1.144 ± 0.086
1.592ProVal: 1.592 ± 0.152
0.302ProTrp: 0.302 ± 0.038
0.922ProTyr: 0.922 ± 0.086
0.0ProXaa: 0.0 ± 0.0
Gln
1.898GlnAla: 1.898 ± 0.159
0.157GlnCys: 0.157 ± 0.029
1.841GlnAsp: 1.841 ± 0.085
2.732GlnGlu: 2.732 ± 0.127
1.492GlnPhe: 1.492 ± 0.08
1.772GlnGly: 1.772 ± 0.122
0.375GlnHis: 0.375 ± 0.039
3.455GlnIle: 3.455 ± 0.138
3.149GlnLys: 3.149 ± 0.165
2.824GlnLeu: 2.824 ± 0.166
0.903GlnMet: 0.903 ± 0.09
2.227GlnAsn: 2.227 ± 0.098
0.608GlnPro: 0.608 ± 0.047
1.171GlnGln: 1.171 ± 0.128
1.469GlnArg: 1.469 ± 0.11
1.94GlnSer: 1.94 ± 0.119
2.017GlnThr: 2.017 ± 0.134
2.074GlnVal: 2.074 ± 0.106
0.256GlnTrp: 0.256 ± 0.035
1.408GlnTyr: 1.408 ± 0.108
0.0GlnXaa: 0.0 ± 0.0
Arg
1.925ArgAla: 1.925 ± 0.152
0.18ArgCys: 0.18 ± 0.034
2.204ArgAsp: 2.204 ± 0.105
3.306ArgGlu: 3.306 ± 0.239
1.833ArgPhe: 1.833 ± 0.094
2.319ArgGly: 2.319 ± 0.116
0.551ArgHis: 0.551 ± 0.056
3.635ArgIle: 3.635 ± 0.142
2.958ArgLys: 2.958 ± 0.142
3.536ArgLeu: 3.536 ± 0.182
0.88ArgMet: 0.88 ± 0.073
2.491ArgAsn: 2.491 ± 0.114
0.811ArgPro: 0.811 ± 0.077
1.221ArgGln: 1.221 ± 0.117
1.867ArgArg: 1.867 ± 0.205
2.51ArgSer: 2.51 ± 0.138
2.07ArgThr: 2.07 ± 0.139
2.261ArgVal: 2.261 ± 0.135
0.333ArgTrp: 0.333 ± 0.048
1.676ArgTyr: 1.676 ± 0.106
0.0ArgXaa: 0.0 ± 0.0
Ser
2.962SerAla: 2.962 ± 0.124
0.482SerCys: 0.482 ± 0.064
4.404SerAsp: 4.404 ± 0.177
5.694SerGlu: 5.694 ± 0.314
3.792SerPhe: 3.792 ± 0.143
4.699SerGly: 4.699 ± 0.188
1.056SerHis: 1.056 ± 0.079
6.05SerIle: 6.05 ± 0.159
5.698SerLys: 5.698 ± 0.257
6.482SerLeu: 6.482 ± 0.154
1.343SerMet: 1.343 ± 0.085
4.198SerAsn: 4.198 ± 0.13
1.772SerPro: 1.772 ± 0.142
2.51SerGln: 2.51 ± 0.148
2.472SerArg: 2.472 ± 0.126
4.997SerSer: 4.997 ± 0.171
3.536SerThr: 3.536 ± 0.186
3.601SerVal: 3.601 ± 0.123
0.723SerTrp: 0.723 ± 0.072
3.57SerTyr: 3.57 ± 0.227
0.0SerXaa: 0.0 ± 0.0
Thr
2.319ThrAla: 2.319 ± 0.127
0.329ThrCys: 0.329 ± 0.047
3.26ThrAsp: 3.26 ± 0.179
4.186ThrGlu: 4.186 ± 0.286
2.881ThrPhe: 2.881 ± 0.168
3.96ThrGly: 3.96 ± 0.172
1.098ThrHis: 1.098 ± 0.073
5.506ThrIle: 5.506 ± 0.22
3.914ThrLys: 3.914 ± 0.235
5.162ThrLeu: 5.162 ± 0.138
1.029ThrMet: 1.029 ± 0.078
3.122ThrAsn: 3.122 ± 0.142
1.576ThrPro: 1.576 ± 0.112
1.921ThrGln: 1.921 ± 0.134
1.974ThrArg: 1.974 ± 0.106
3.999ThrSer: 3.999 ± 0.197
3.555ThrThr: 3.555 ± 0.295
2.545ThrVal: 2.545 ± 0.157
0.716ThrTrp: 0.716 ± 0.058
3.195ThrTyr: 3.195 ± 0.28
0.0ThrXaa: 0.0 ± 0.0
Val
2.2ValAla: 2.2 ± 0.106
0.413ValCys: 0.413 ± 0.057
3.7ValAsp: 3.7 ± 0.154
4.002ValGlu: 4.002 ± 0.158
2.518ValPhe: 2.518 ± 0.112
2.56ValGly: 2.56 ± 0.117
0.846ValHis: 0.846 ± 0.064
4.699ValIle: 4.699 ± 0.139
4.052ValLys: 4.052 ± 0.185
4.86ValLeu: 4.86 ± 0.143
1.033ValMet: 1.033 ± 0.072
3.214ValAsn: 3.214 ± 0.099
1.328ValPro: 1.328 ± 0.108
1.569ValGln: 1.569 ± 0.099
1.963ValArg: 1.963 ± 0.119
4.075ValSer: 4.075 ± 0.166
3.585ValThr: 3.585 ± 0.208
2.694ValVal: 2.694 ± 0.115
0.505ValTrp: 0.505 ± 0.049
2.082ValTyr: 2.082 ± 0.124
0.0ValXaa: 0.0 ± 0.0
Trp
0.436TrpAla: 0.436 ± 0.038
0.099TrpCys: 0.099 ± 0.023
0.842TrpAsp: 0.842 ± 0.066
0.689TrpGlu: 0.689 ± 0.054
0.417TrpPhe: 0.417 ± 0.039
0.685TrpGly: 0.685 ± 0.055
0.138TrpHis: 0.138 ± 0.029
0.716TrpIle: 0.716 ± 0.075
0.704TrpLys: 0.704 ± 0.058
0.784TrpLeu: 0.784 ± 0.061
0.207TrpMet: 0.207 ± 0.034
0.742TrpAsn: 0.742 ± 0.053
0.195TrpPro: 0.195 ± 0.028
0.39TrpGln: 0.39 ± 0.046
0.318TrpArg: 0.318 ± 0.031
0.693TrpSer: 0.693 ± 0.054
0.639TrpThr: 0.639 ± 0.069
0.425TrpVal: 0.425 ± 0.047
0.092TrpTrp: 0.092 ± 0.018
0.333TrpTyr: 0.333 ± 0.043
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.726TyrAla: 1.726 ± 0.086
0.272TyrCys: 0.272 ± 0.044
4.213TyrAsp: 4.213 ± 0.369
3.448TyrGlu: 3.448 ± 0.165
2.189TyrPhe: 2.189 ± 0.121
2.629TyrGly: 2.629 ± 0.143
0.735TyrHis: 0.735 ± 0.056
2.858TyrIle: 2.858 ± 0.118
3.352TyrLys: 3.352 ± 0.192
3.505TyrLeu: 3.505 ± 0.167
0.612TyrMet: 0.612 ± 0.052
3.471TyrAsn: 3.471 ± 0.297
1.156TyrPro: 1.156 ± 0.13
1.477TyrGln: 1.477 ± 0.074
1.768TyrArg: 1.768 ± 0.083
3.482TyrSer: 3.482 ± 0.172
2.698TyrThr: 2.698 ± 0.219
2.001TyrVal: 2.001 ± 0.099
0.398TyrTrp: 0.398 ± 0.039
2.215TyrTyr: 2.215 ± 0.123
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.004XaaLeu: 0.004 ± 0.004
0.0XaaMet: 0.0 ± 0.0
0.004XaaAsn: 0.004 ± 0.004
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.004XaaXaa: 0.004 ± 0.004
Statistics based on 465 proteins (261342 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski