Amino acid dipepetide frequency for Dickeya virus Limestone

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.977AlaAla: 5.977 ± 0.481
0.763AlaCys: 0.763 ± 0.12
4.493AlaAsp: 4.493 ± 0.328
4.493AlaGlu: 4.493 ± 0.367
2.649AlaPhe: 2.649 ± 0.242
4.854AlaGly: 4.854 ± 0.354
1.568AlaHis: 1.568 ± 0.173
4.409AlaIle: 4.409 ± 0.3
4.451AlaLys: 4.451 ± 0.292
5.829AlaLeu: 5.829 ± 0.423
1.759AlaMet: 1.759 ± 0.163
2.734AlaAsn: 2.734 ± 0.263
2.565AlaPro: 2.565 ± 0.25
2.628AlaGln: 2.628 ± 0.224
3.179AlaArg: 3.179 ± 0.215
4.048AlaSer: 4.048 ± 0.356
4.027AlaThr: 4.027 ± 0.306
5.129AlaVal: 5.129 ± 0.35
1.123AlaTrp: 1.123 ± 0.147
2.374AlaTyr: 2.374 ± 0.247
0.0AlaXaa: 0.0 ± 0.0
Cys
0.699CysAla: 0.699 ± 0.133
0.233CysCys: 0.233 ± 0.076
0.657CysAsp: 0.657 ± 0.118
0.805CysGlu: 0.805 ± 0.133
0.36CysPhe: 0.36 ± 0.081
0.805CysGly: 0.805 ± 0.152
0.445CysHis: 0.445 ± 0.118
0.784CysIle: 0.784 ± 0.12
0.805CysLys: 0.805 ± 0.148
0.848CysLeu: 0.848 ± 0.147
0.403CysMet: 0.403 ± 0.095
0.636CysAsn: 0.636 ± 0.117
0.657CysPro: 0.657 ± 0.129
0.36CysGln: 0.36 ± 0.075
0.657CysArg: 0.657 ± 0.11
1.06CysSer: 1.06 ± 0.161
0.572CysThr: 0.572 ± 0.109
1.145CysVal: 1.145 ± 0.168
0.106CysTrp: 0.106 ± 0.049
0.318CysTyr: 0.318 ± 0.086
0.0CysXaa: 0.0 ± 0.0
Asp
4.281AspAla: 4.281 ± 0.369
0.445AspCys: 0.445 ± 0.104
3.963AspAsp: 3.963 ± 0.378
4.472AspGlu: 4.472 ± 0.332
3.179AspPhe: 3.179 ± 0.255
4.621AspGly: 4.621 ± 0.306
1.251AspHis: 1.251 ± 0.147
4.303AspIle: 4.303 ± 0.309
3.879AspLys: 3.879 ± 0.265
5.32AspLeu: 5.32 ± 0.299
2.077AspMet: 2.077 ± 0.225
2.713AspAsn: 2.713 ± 0.237
3.052AspPro: 3.052 ± 0.29
2.014AspGln: 2.014 ± 0.214
2.543AspArg: 2.543 ± 0.212
3.328AspSer: 3.328 ± 0.267
3.54AspThr: 3.54 ± 0.291
4.345AspVal: 4.345 ± 0.332
0.975AspTrp: 0.975 ± 0.148
3.264AspTyr: 3.264 ± 0.3
0.0AspXaa: 0.0 ± 0.0
Glu
4.621GluAla: 4.621 ± 0.382
0.827GluCys: 0.827 ± 0.145
4.112GluAsp: 4.112 ± 0.304
4.917GluGlu: 4.917 ± 0.385
3.37GluPhe: 3.37 ± 0.257
4.112GluGly: 4.112 ± 0.314
1.611GluHis: 1.611 ± 0.207
3.624GluIle: 3.624 ± 0.241
3.794GluLys: 3.794 ± 0.36
5.956GluLeu: 5.956 ± 0.346
2.416GluMet: 2.416 ± 0.205
2.883GluAsn: 2.883 ± 0.222
2.014GluPro: 2.014 ± 0.185
2.543GluGln: 2.543 ± 0.222
3.942GluArg: 3.942 ± 0.309
3.561GluSer: 3.561 ± 0.288
3.752GluThr: 3.752 ± 0.296
4.663GluVal: 4.663 ± 0.328
1.229GluTrp: 1.229 ± 0.149
2.819GluTyr: 2.819 ± 0.229
0.0GluXaa: 0.0 ± 0.0
Phe
2.692PheAla: 2.692 ± 0.248
0.487PheCys: 0.487 ± 0.114
2.586PheAsp: 2.586 ± 0.262
3.264PheGlu: 3.264 ± 0.267
1.696PhePhe: 1.696 ± 0.205
3.052PheGly: 3.052 ± 0.242
1.017PheHis: 1.017 ± 0.131
2.755PheIle: 2.755 ± 0.231
2.671PheLys: 2.671 ± 0.238
3.158PheLeu: 3.158 ± 0.265
1.335PheMet: 1.335 ± 0.154
2.692PheAsn: 2.692 ± 0.247
1.611PhePro: 1.611 ± 0.186
1.611PheGln: 1.611 ± 0.166
2.119PheArg: 2.119 ± 0.201
2.946PheSer: 2.946 ± 0.243
3.01PheThr: 3.01 ± 0.27
2.925PheVal: 2.925 ± 0.23
0.678PheTrp: 0.678 ± 0.135
1.462PheTyr: 1.462 ± 0.147
0.0PheXaa: 0.0 ± 0.0
Gly
3.942GlyAla: 3.942 ± 0.315
1.06GlyCys: 1.06 ± 0.177
3.646GlyAsp: 3.646 ± 0.251
4.324GlyGlu: 4.324 ± 0.28
2.755GlyPhe: 2.755 ± 0.292
4.811GlyGly: 4.811 ± 0.372
1.251GlyHis: 1.251 ± 0.156
4.769GlyIle: 4.769 ± 0.334
5.193GlyLys: 5.193 ± 0.299
5.235GlyLeu: 5.235 ± 0.245
1.992GlyMet: 1.992 ± 0.155
3.349GlyAsn: 3.349 ± 0.255
1.251GlyPro: 1.251 ± 0.144
2.374GlyGln: 2.374 ± 0.218
2.84GlyArg: 2.84 ± 0.27
4.154GlySer: 4.154 ± 0.334
3.879GlyThr: 3.879 ± 0.322
5.659GlyVal: 5.659 ± 0.37
1.251GlyTrp: 1.251 ± 0.18
2.395GlyTyr: 2.395 ± 0.205
0.0GlyXaa: 0.0 ± 0.0
His
1.06HisAla: 1.06 ± 0.158
0.339HisCys: 0.339 ± 0.102
1.187HisAsp: 1.187 ± 0.179
0.848HisGlu: 0.848 ± 0.136
0.975HisPhe: 0.975 ± 0.133
1.462HisGly: 1.462 ± 0.19
0.657HisHis: 0.657 ± 0.119
1.462HisIle: 1.462 ± 0.172
1.59HisLys: 1.59 ± 0.223
1.611HisLeu: 1.611 ± 0.169
0.53HisMet: 0.53 ± 0.095
0.89HisAsn: 0.89 ± 0.142
1.145HisPro: 1.145 ± 0.16
0.678HisGln: 0.678 ± 0.126
1.145HisArg: 1.145 ± 0.165
1.123HisSer: 1.123 ± 0.196
1.102HisThr: 1.102 ± 0.154
1.187HisVal: 1.187 ± 0.189
0.212HisTrp: 0.212 ± 0.078
0.848HisTyr: 0.848 ± 0.131
0.0HisXaa: 0.0 ± 0.0
Ile
3.667IleAla: 3.667 ± 0.296
0.89IleCys: 0.89 ± 0.146
4.472IleAsp: 4.472 ± 0.332
4.663IleGlu: 4.663 ± 0.283
1.484IlePhe: 1.484 ± 0.185
3.582IleGly: 3.582 ± 0.249
1.229IleHis: 1.229 ± 0.169
3.158IleIle: 3.158 ± 0.287
4.069IleLys: 4.069 ± 0.286
4.303IleLeu: 4.303 ± 0.28
1.42IleMet: 1.42 ± 0.182
3.2IleAsn: 3.2 ± 0.283
3.264IlePro: 3.264 ± 0.219
2.946IleGln: 2.946 ± 0.278
2.904IleArg: 2.904 ± 0.235
3.306IleSer: 3.306 ± 0.285
4.387IleThr: 4.387 ± 0.338
3.921IleVal: 3.921 ± 0.232
0.636IleTrp: 0.636 ± 0.12
1.844IleTyr: 1.844 ± 0.211
0.0IleXaa: 0.0 ± 0.0
Lys
4.26LysAla: 4.26 ± 0.315
0.636LysCys: 0.636 ± 0.165
4.536LysAsp: 4.536 ± 0.295
4.854LysGlu: 4.854 ± 0.386
3.476LysPhe: 3.476 ± 0.243
4.027LysGly: 4.027 ± 0.338
1.123LysHis: 1.123 ± 0.153
3.667LysIle: 3.667 ± 0.235
4.366LysLys: 4.366 ± 0.38
5.617LysLeu: 5.617 ± 0.332
2.692LysMet: 2.692 ± 0.232
2.861LysAsn: 2.861 ± 0.222
2.649LysPro: 2.649 ± 0.255
2.416LysGln: 2.416 ± 0.22
3.646LysArg: 3.646 ± 0.309
4.006LysSer: 4.006 ± 0.287
3.603LysThr: 3.603 ± 0.313
4.642LysVal: 4.642 ± 0.292
0.954LysTrp: 0.954 ± 0.127
2.692LysTyr: 2.692 ± 0.271
0.0LysXaa: 0.0 ± 0.0
Leu
6.337LeuAla: 6.337 ± 0.327
0.869LeuCys: 0.869 ± 0.117
5.426LeuAsp: 5.426 ± 0.341
5.002LeuGlu: 5.002 ± 0.357
3.158LeuPhe: 3.158 ± 0.228
4.917LeuGly: 4.917 ± 0.298
1.293LeuHis: 1.293 ± 0.164
4.154LeuIle: 4.154 ± 0.336
6.189LeuLys: 6.189 ± 0.386
6.74LeuLeu: 6.74 ± 0.4
1.992LeuMet: 1.992 ± 0.188
4.536LeuAsn: 4.536 ± 0.317
3.688LeuPro: 3.688 ± 0.27
2.967LeuGln: 2.967 ± 0.249
3.709LeuArg: 3.709 ± 0.284
5.553LeuSer: 5.553 ± 0.283
5.087LeuThr: 5.087 ± 0.35
5.68LeuVal: 5.68 ± 0.339
0.869LeuTrp: 0.869 ± 0.147
2.798LeuTyr: 2.798 ± 0.242
0.0LeuXaa: 0.0 ± 0.0
Met
2.416MetAla: 2.416 ± 0.261
0.382MetCys: 0.382 ± 0.105
1.547MetAsp: 1.547 ± 0.181
1.484MetGlu: 1.484 ± 0.191
1.611MetPhe: 1.611 ± 0.202
1.717MetGly: 1.717 ± 0.188
0.466MetHis: 0.466 ± 0.108
1.505MetIle: 1.505 ± 0.199
2.522MetLys: 2.522 ± 0.221
2.48MetLeu: 2.48 ± 0.205
1.017MetMet: 1.017 ± 0.131
1.484MetAsn: 1.484 ± 0.158
1.123MetPro: 1.123 ± 0.139
1.081MetGln: 1.081 ± 0.15
1.802MetArg: 1.802 ± 0.177
2.247MetSer: 2.247 ± 0.196
1.568MetThr: 1.568 ± 0.175
1.886MetVal: 1.886 ± 0.213
0.445MetTrp: 0.445 ± 0.108
0.827MetTyr: 0.827 ± 0.129
0.0MetXaa: 0.0 ± 0.0
Asn
3.773AsnAla: 3.773 ± 0.284
0.678AsnCys: 0.678 ± 0.13
2.48AsnAsp: 2.48 ± 0.211
2.437AsnGlu: 2.437 ± 0.248
2.268AsnPhe: 2.268 ± 0.233
4.175AsnGly: 4.175 ± 0.275
0.869AsnHis: 0.869 ± 0.129
2.967AsnIle: 2.967 ± 0.251
2.946AsnLys: 2.946 ± 0.229
3.815AsnLeu: 3.815 ± 0.252
1.484AsnMet: 1.484 ± 0.195
2.713AsnAsn: 2.713 ± 0.256
2.798AsnPro: 2.798 ± 0.244
1.78AsnGln: 1.78 ± 0.173
2.798AsnArg: 2.798 ± 0.254
2.755AsnSer: 2.755 ± 0.248
2.84AsnThr: 2.84 ± 0.256
3.476AsnVal: 3.476 ± 0.279
0.636AsnTrp: 0.636 ± 0.115
1.717AsnTyr: 1.717 ± 0.24
0.0AsnXaa: 0.0 ± 0.0
Pro
2.755ProAla: 2.755 ± 0.223
0.403ProCys: 0.403 ± 0.083
3.031ProAsp: 3.031 ± 0.218
3.497ProGlu: 3.497 ± 0.312
1.823ProPhe: 1.823 ± 0.243
2.459ProGly: 2.459 ± 0.215
0.678ProHis: 0.678 ± 0.122
1.886ProIle: 1.886 ± 0.196
2.247ProLys: 2.247 ± 0.201
3.052ProLeu: 3.052 ± 0.245
1.208ProMet: 1.208 ± 0.141
1.908ProAsn: 1.908 ± 0.19
1.102ProPro: 1.102 ± 0.189
1.441ProGln: 1.441 ± 0.194
1.844ProArg: 1.844 ± 0.216
2.883ProSer: 2.883 ± 0.231
2.522ProThr: 2.522 ± 0.251
3.031ProVal: 3.031 ± 0.258
0.615ProTrp: 0.615 ± 0.112
1.293ProTyr: 1.293 ± 0.192
0.0ProXaa: 0.0 ± 0.0
Gln
2.692GlnAla: 2.692 ± 0.254
0.403GlnCys: 0.403 ± 0.087
2.48GlnAsp: 2.48 ± 0.188
2.31GlnGlu: 2.31 ± 0.289
2.014GlnPhe: 2.014 ± 0.239
2.247GlnGly: 2.247 ± 0.209
0.742GlnHis: 0.742 ± 0.107
2.204GlnIle: 2.204 ± 0.188
2.501GlnLys: 2.501 ± 0.215
2.904GlnLeu: 2.904 ± 0.23
1.166GlnMet: 1.166 ± 0.137
1.653GlnAsn: 1.653 ± 0.206
1.293GlnPro: 1.293 ± 0.155
1.95GlnGln: 1.95 ± 0.291
1.759GlnArg: 1.759 ± 0.186
2.247GlnSer: 2.247 ± 0.23
2.395GlnThr: 2.395 ± 0.214
2.289GlnVal: 2.289 ± 0.199
0.424GlnTrp: 0.424 ± 0.085
1.632GlnTyr: 1.632 ± 0.156
0.0GlnXaa: 0.0 ± 0.0
Arg
3.137ArgAla: 3.137 ± 0.256
0.763ArgCys: 0.763 ± 0.127
3.094ArgAsp: 3.094 ± 0.275
3.285ArgGlu: 3.285 ± 0.283
2.459ArgPhe: 2.459 ± 0.225
2.819ArgGly: 2.819 ± 0.214
1.166ArgHis: 1.166 ± 0.146
3.306ArgIle: 3.306 ± 0.258
3.412ArgLys: 3.412 ± 0.27
4.684ArgLeu: 4.684 ± 0.347
1.929ArgMet: 1.929 ± 0.209
2.353ArgAsn: 2.353 ± 0.21
1.611ArgPro: 1.611 ± 0.198
1.886ArgGln: 1.886 ± 0.166
2.84ArgArg: 2.84 ± 0.26
3.031ArgSer: 3.031 ± 0.219
2.459ArgThr: 2.459 ± 0.205
3.222ArgVal: 3.222 ± 0.292
0.615ArgTrp: 0.615 ± 0.117
2.225ArgTyr: 2.225 ± 0.241
0.0ArgXaa: 0.0 ± 0.0
Ser
4.006SerAla: 4.006 ± 0.296
0.657SerCys: 0.657 ± 0.174
3.624SerAsp: 3.624 ± 0.268
3.857SerGlu: 3.857 ± 0.27
2.798SerPhe: 2.798 ± 0.254
4.726SerGly: 4.726 ± 0.368
1.229SerHis: 1.229 ± 0.177
3.921SerIle: 3.921 ± 0.275
4.387SerLys: 4.387 ± 0.34
5.002SerLeu: 5.002 ± 0.365
1.59SerMet: 1.59 ± 0.188
3.752SerAsn: 3.752 ± 0.291
2.331SerPro: 2.331 ± 0.232
2.162SerGln: 2.162 ± 0.246
2.904SerArg: 2.904 ± 0.216
4.642SerSer: 4.642 ± 0.432
3.306SerThr: 3.306 ± 0.315
4.175SerVal: 4.175 ± 0.3
0.678SerTrp: 0.678 ± 0.111
2.459SerTyr: 2.459 ± 0.256
0.0SerXaa: 0.0 ± 0.0
Thr
4.303ThrAla: 4.303 ± 0.345
0.615ThrCys: 0.615 ± 0.113
3.349ThrAsp: 3.349 ± 0.247
4.197ThrGlu: 4.197 ± 0.303
2.671ThrPhe: 2.671 ± 0.224
4.345ThrGly: 4.345 ± 0.329
0.869ThrHis: 0.869 ± 0.147
3.688ThrIle: 3.688 ± 0.323
3.963ThrLys: 3.963 ± 0.251
4.769ThrLeu: 4.769 ± 0.355
1.187ThrMet: 1.187 ± 0.166
2.883ThrAsn: 2.883 ± 0.277
3.37ThrPro: 3.37 ± 0.245
1.95ThrGln: 1.95 ± 0.215
3.179ThrArg: 3.179 ± 0.29
3.412ThrSer: 3.412 ± 0.279
3.667ThrThr: 3.667 ± 0.356
4.387ThrVal: 4.387 ± 0.382
0.784ThrTrp: 0.784 ± 0.151
1.696ThrTyr: 1.696 ± 0.214
0.0ThrXaa: 0.0 ± 0.0
Val
4.811ValAla: 4.811 ± 0.346
0.933ValCys: 0.933 ± 0.146
5.193ValAsp: 5.193 ± 0.309
4.642ValGlu: 4.642 ± 0.339
2.883ValPhe: 2.883 ± 0.241
4.218ValGly: 4.218 ± 0.31
1.251ValHis: 1.251 ± 0.149
4.006ValIle: 4.006 ± 0.242
4.917ValLys: 4.917 ± 0.35
5.256ValLeu: 5.256 ± 0.351
2.035ValMet: 2.035 ± 0.189
3.349ValAsn: 3.349 ± 0.303
2.437ValPro: 2.437 ± 0.25
2.734ValGln: 2.734 ± 0.254
3.328ValArg: 3.328 ± 0.242
4.854ValSer: 4.854 ± 0.349
5.108ValThr: 5.108 ± 0.363
5.998ValVal: 5.998 ± 0.402
1.102ValTrp: 1.102 ± 0.147
2.967ValTyr: 2.967 ± 0.236
0.0ValXaa: 0.0 ± 0.0
Trp
0.933TrpAla: 0.933 ± 0.139
0.276TrpCys: 0.276 ± 0.075
0.89TrpAsp: 0.89 ± 0.15
1.208TrpGlu: 1.208 ± 0.141
0.615TrpPhe: 0.615 ± 0.117
0.721TrpGly: 0.721 ± 0.109
0.276TrpHis: 0.276 ± 0.074
0.699TrpIle: 0.699 ± 0.114
0.954TrpLys: 0.954 ± 0.168
1.547TrpLeu: 1.547 ± 0.162
0.36TrpMet: 0.36 ± 0.092
0.742TrpAsn: 0.742 ± 0.108
0.339TrpPro: 0.339 ± 0.093
0.318TrpGln: 0.318 ± 0.069
1.06TrpArg: 1.06 ± 0.132
0.699TrpSer: 0.699 ± 0.145
0.636TrpThr: 0.636 ± 0.107
1.145TrpVal: 1.145 ± 0.124
0.17TrpTrp: 0.17 ± 0.058
0.509TrpTyr: 0.509 ± 0.115
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.692TyrAla: 2.692 ± 0.222
0.699TyrCys: 0.699 ± 0.129
2.798TyrAsp: 2.798 ± 0.234
2.204TyrGlu: 2.204 ± 0.194
1.526TyrPhe: 1.526 ± 0.18
2.459TyrGly: 2.459 ± 0.22
1.06TyrHis: 1.06 ± 0.173
2.077TyrIle: 2.077 ± 0.205
1.865TyrLys: 1.865 ± 0.186
2.84TyrLeu: 2.84 ± 0.233
0.996TyrMet: 0.996 ± 0.137
2.077TyrAsn: 2.077 ± 0.218
1.399TyrPro: 1.399 ± 0.153
1.462TyrGln: 1.462 ± 0.139
2.098TyrArg: 2.098 ± 0.208
2.353TyrSer: 2.353 ± 0.198
1.823TyrThr: 1.823 ± 0.235
3.116TyrVal: 3.116 ± 0.208
0.593TyrTrp: 0.593 ± 0.11
1.251TyrTyr: 1.251 ± 0.152
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 201 proteins (47182 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski