Amino acid dipepetide frequency for Acidovorax phage ACP17

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.722AlaAla: 10.722 ± 0.758
0.887AlaCys: 0.887 ± 0.141
6.0AlaAsp: 6.0 ± 0.447
5.938AlaGlu: 5.938 ± 0.452
3.856AlaPhe: 3.856 ± 0.304
7.34AlaGly: 7.34 ± 0.56
1.979AlaHis: 1.979 ± 0.227
4.722AlaIle: 4.722 ± 0.309
5.278AlaLys: 5.278 ± 0.436
8.021AlaLeu: 8.021 ± 0.425
3.031AlaMet: 3.031 ± 0.266
3.608AlaAsn: 3.608 ± 0.327
3.155AlaPro: 3.155 ± 0.309
3.588AlaGln: 3.588 ± 0.284
5.093AlaArg: 5.093 ± 0.348
5.691AlaSer: 5.691 ± 0.442
5.093AlaThr: 5.093 ± 0.299
6.804AlaVal: 6.804 ± 0.437
1.299AlaTrp: 1.299 ± 0.163
3.217AlaTyr: 3.217 ± 0.229
0.0AlaXaa: 0.0 ± 0.0
Cys
1.155CysAla: 1.155 ± 0.195
0.268CysCys: 0.268 ± 0.08
0.928CysAsp: 0.928 ± 0.149
0.804CysGlu: 0.804 ± 0.144
0.619CysPhe: 0.619 ± 0.112
1.01CysGly: 1.01 ± 0.16
0.268CysHis: 0.268 ± 0.076
0.454CysIle: 0.454 ± 0.09
0.701CysLys: 0.701 ± 0.122
0.619CysLeu: 0.619 ± 0.119
0.351CysMet: 0.351 ± 0.09
0.227CysAsn: 0.227 ± 0.056
0.515CysPro: 0.515 ± 0.101
0.454CysGln: 0.454 ± 0.101
0.639CysArg: 0.639 ± 0.132
0.722CysSer: 0.722 ± 0.143
0.577CysThr: 0.577 ± 0.103
0.804CysVal: 0.804 ± 0.144
0.165CysTrp: 0.165 ± 0.055
0.289CysTyr: 0.289 ± 0.089
0.0CysXaa: 0.0 ± 0.0
Asp
6.598AspAla: 6.598 ± 0.432
0.577AspCys: 0.577 ± 0.101
3.711AspAsp: 3.711 ± 0.368
4.268AspGlu: 4.268 ± 0.309
3.134AspPhe: 3.134 ± 0.233
4.763AspGly: 4.763 ± 0.349
1.217AspHis: 1.217 ± 0.166
3.402AspIle: 3.402 ± 0.238
2.722AspLys: 2.722 ± 0.229
5.093AspLeu: 5.093 ± 0.342
1.608AspMet: 1.608 ± 0.213
2.351AspAsn: 2.351 ± 0.2
3.072AspPro: 3.072 ± 0.211
1.876AspGln: 1.876 ± 0.207
2.969AspArg: 2.969 ± 0.243
3.753AspSer: 3.753 ± 0.363
2.866AspThr: 2.866 ± 0.258
3.959AspVal: 3.959 ± 0.307
1.093AspTrp: 1.093 ± 0.157
2.577AspTyr: 2.577 ± 0.196
0.0AspXaa: 0.0 ± 0.0
Glu
7.196GluAla: 7.196 ± 0.481
0.763GluCys: 0.763 ± 0.122
3.382GluAsp: 3.382 ± 0.273
4.681GluGlu: 4.681 ± 0.383
2.99GluPhe: 2.99 ± 0.278
4.268GluGly: 4.268 ± 0.304
1.402GluHis: 1.402 ± 0.21
3.67GluIle: 3.67 ± 0.33
3.835GluLys: 3.835 ± 0.304
5.443GluLeu: 5.443 ± 0.339
2.495GluMet: 2.495 ± 0.201
2.247GluAsn: 2.247 ± 0.237
2.392GluPro: 2.392 ± 0.242
1.979GluGln: 1.979 ± 0.204
3.711GluArg: 3.711 ± 0.33
3.588GluSer: 3.588 ± 0.262
3.876GluThr: 3.876 ± 0.25
5.196GluVal: 5.196 ± 0.374
1.175GluTrp: 1.175 ± 0.17
2.309GluTyr: 2.309 ± 0.215
0.0GluXaa: 0.0 ± 0.0
Phe
3.443PheAla: 3.443 ± 0.303
0.763PheCys: 0.763 ± 0.134
3.443PheAsp: 3.443 ± 0.311
3.443PheGlu: 3.443 ± 0.278
2.289PhePhe: 2.289 ± 0.225
2.804PheGly: 2.804 ± 0.228
1.031PheHis: 1.031 ± 0.154
1.794PheIle: 1.794 ± 0.157
2.557PheLys: 2.557 ± 0.27
2.619PheLeu: 2.619 ± 0.225
1.155PheMet: 1.155 ± 0.19
1.856PheAsn: 1.856 ± 0.19
1.67PhePro: 1.67 ± 0.187
1.588PheGln: 1.588 ± 0.19
2.371PheArg: 2.371 ± 0.202
2.68PheSer: 2.68 ± 0.229
2.289PheThr: 2.289 ± 0.216
3.237PheVal: 3.237 ± 0.241
0.763PheTrp: 0.763 ± 0.132
1.856PheTyr: 1.856 ± 0.21
0.0PheXaa: 0.0 ± 0.0
Gly
6.041GlyAla: 6.041 ± 0.468
1.031GlyCys: 1.031 ± 0.164
4.103GlyAsp: 4.103 ± 0.289
4.825GlyGlu: 4.825 ± 0.428
3.546GlyPhe: 3.546 ± 0.268
5.217GlyGly: 5.217 ± 0.422
1.381GlyHis: 1.381 ± 0.165
3.67GlyIle: 3.67 ± 0.246
4.474GlyLys: 4.474 ± 0.319
5.835GlyLeu: 5.835 ± 0.351
1.773GlyMet: 1.773 ± 0.2
2.866GlyAsn: 2.866 ± 0.28
1.918GlyPro: 1.918 ± 0.226
2.247GlyGln: 2.247 ± 0.214
3.897GlyArg: 3.897 ± 0.263
4.681GlySer: 4.681 ± 0.343
5.608GlyThr: 5.608 ± 0.451
4.949GlyVal: 4.949 ± 0.369
1.402GlyTrp: 1.402 ± 0.169
2.907GlyTyr: 2.907 ± 0.234
0.0GlyXaa: 0.0 ± 0.0
His
1.794HisAla: 1.794 ± 0.207
0.454HisCys: 0.454 ± 0.096
1.155HisAsp: 1.155 ± 0.174
1.113HisGlu: 1.113 ± 0.148
1.052HisPhe: 1.052 ± 0.143
1.217HisGly: 1.217 ± 0.145
0.454HisHis: 0.454 ± 0.122
0.99HisIle: 0.99 ± 0.14
0.784HisLys: 0.784 ± 0.12
1.485HisLeu: 1.485 ± 0.208
0.598HisMet: 0.598 ± 0.113
0.66HisAsn: 0.66 ± 0.12
1.134HisPro: 1.134 ± 0.159
0.639HisGln: 0.639 ± 0.125
1.402HisArg: 1.402 ± 0.156
1.175HisSer: 1.175 ± 0.172
1.072HisThr: 1.072 ± 0.156
1.217HisVal: 1.217 ± 0.15
0.186HisTrp: 0.186 ± 0.068
0.866HisTyr: 0.866 ± 0.156
0.0HisXaa: 0.0 ± 0.0
Ile
5.196IleAla: 5.196 ± 0.326
0.536IleCys: 0.536 ± 0.098
3.608IleAsp: 3.608 ± 0.254
3.753IleGlu: 3.753 ± 0.303
1.732IlePhe: 1.732 ± 0.17
3.34IleGly: 3.34 ± 0.278
1.196IleHis: 1.196 ± 0.169
2.124IleIle: 2.124 ± 0.222
2.763IleLys: 2.763 ± 0.231
3.505IleLeu: 3.505 ± 0.287
1.052IleMet: 1.052 ± 0.124
2.206IleAsn: 2.206 ± 0.228
2.557IlePro: 2.557 ± 0.251
1.876IleGln: 1.876 ± 0.18
3.031IleArg: 3.031 ± 0.238
2.577IleSer: 2.577 ± 0.226
2.845IleThr: 2.845 ± 0.2
4.041IleVal: 4.041 ± 0.33
0.619IleTrp: 0.619 ± 0.122
2.062IleTyr: 2.062 ± 0.182
0.0IleXaa: 0.0 ± 0.0
Lys
5.732LysAla: 5.732 ± 0.395
0.577LysCys: 0.577 ± 0.106
2.949LysAsp: 2.949 ± 0.286
3.67LysGlu: 3.67 ± 0.336
2.412LysPhe: 2.412 ± 0.21
3.258LysGly: 3.258 ± 0.3
1.155LysHis: 1.155 ± 0.175
2.928LysIle: 2.928 ± 0.267
3.835LysLys: 3.835 ± 0.309
4.681LysLeu: 4.681 ± 0.323
2.351LysMet: 2.351 ± 0.199
2.433LysAsn: 2.433 ± 0.256
2.412LysPro: 2.412 ± 0.271
1.938LysGln: 1.938 ± 0.199
2.866LysArg: 2.866 ± 0.237
3.732LysSer: 3.732 ± 0.325
3.773LysThr: 3.773 ± 0.34
4.33LysVal: 4.33 ± 0.211
0.928LysTrp: 0.928 ± 0.138
1.608LysTyr: 1.608 ± 0.179
0.0LysXaa: 0.0 ± 0.0
Leu
7.423LeuAla: 7.423 ± 0.415
0.866LeuCys: 0.866 ± 0.142
5.258LeuAsp: 5.258 ± 0.347
6.083LeuGlu: 6.083 ± 0.461
2.99LeuPhe: 2.99 ± 0.244
5.547LeuGly: 5.547 ± 0.363
1.691LeuHis: 1.691 ± 0.168
3.093LeuIle: 3.093 ± 0.286
4.66LeuLys: 4.66 ± 0.337
5.753LeuLeu: 5.753 ± 0.381
1.794LeuMet: 1.794 ± 0.218
3.959LeuAsn: 3.959 ± 0.273
3.196LeuPro: 3.196 ± 0.27
2.969LeuGln: 2.969 ± 0.242
4.371LeuArg: 4.371 ± 0.297
4.557LeuSer: 4.557 ± 0.369
4.103LeuThr: 4.103 ± 0.283
5.175LeuVal: 5.175 ± 0.308
0.866LeuTrp: 0.866 ± 0.126
2.309LeuTyr: 2.309 ± 0.229
0.0LeuXaa: 0.0 ± 0.0
Met
2.598MetAla: 2.598 ± 0.198
0.124MetCys: 0.124 ± 0.053
1.67MetAsp: 1.67 ± 0.164
1.835MetGlu: 1.835 ± 0.221
0.887MetPhe: 0.887 ± 0.139
1.814MetGly: 1.814 ± 0.193
0.639MetHis: 0.639 ± 0.116
1.217MetIle: 1.217 ± 0.156
2.083MetLys: 2.083 ± 0.218
2.103MetLeu: 2.103 ± 0.196
0.598MetMet: 0.598 ± 0.12
1.196MetAsn: 1.196 ± 0.171
1.031MetPro: 1.031 ± 0.153
0.948MetGln: 0.948 ± 0.144
1.814MetArg: 1.814 ± 0.182
2.103MetSer: 2.103 ± 0.2
2.351MetThr: 2.351 ± 0.215
1.67MetVal: 1.67 ± 0.171
0.309MetTrp: 0.309 ± 0.085
0.742MetTyr: 0.742 ± 0.109
0.0MetXaa: 0.0 ± 0.0
Asn
3.979AsnAla: 3.979 ± 0.345
0.536AsnCys: 0.536 ± 0.1
2.412AsnAsp: 2.412 ± 0.241
2.289AsnGlu: 2.289 ± 0.212
1.794AsnPhe: 1.794 ± 0.223
3.732AsnGly: 3.732 ± 0.332
0.701AsnHis: 0.701 ± 0.118
2.103AsnIle: 2.103 ± 0.247
2.021AsnLys: 2.021 ± 0.204
3.072AsnLeu: 3.072 ± 0.248
0.887AsnMet: 0.887 ± 0.127
1.402AsnAsn: 1.402 ± 0.185
2.722AsnPro: 2.722 ± 0.232
1.443AsnGln: 1.443 ± 0.186
2.186AsnArg: 2.186 ± 0.221
2.309AsnSer: 2.309 ± 0.224
2.083AsnThr: 2.083 ± 0.245
2.928AsnVal: 2.928 ± 0.262
0.68AsnTrp: 0.68 ± 0.102
1.258AsnTyr: 1.258 ± 0.185
0.0AsnXaa: 0.0 ± 0.0
Pro
3.629ProAla: 3.629 ± 0.324
0.454ProCys: 0.454 ± 0.106
2.412ProAsp: 2.412 ± 0.23
3.237ProGlu: 3.237 ± 0.279
1.773ProPhe: 1.773 ± 0.174
3.629ProGly: 3.629 ± 0.291
0.598ProHis: 0.598 ± 0.11
1.526ProIle: 1.526 ± 0.197
2.907ProLys: 2.907 ± 0.32
2.495ProLeu: 2.495 ± 0.233
1.01ProMet: 1.01 ± 0.147
1.856ProAsn: 1.856 ± 0.209
1.835ProPro: 1.835 ± 0.188
1.381ProGln: 1.381 ± 0.17
2.206ProArg: 2.206 ± 0.248
3.278ProSer: 3.278 ± 0.269
2.598ProThr: 2.598 ± 0.23
2.907ProVal: 2.907 ± 0.258
0.887ProTrp: 0.887 ± 0.145
1.732ProTyr: 1.732 ± 0.203
0.0ProXaa: 0.0 ± 0.0
Gln
3.299GlnAla: 3.299 ± 0.285
0.495GlnCys: 0.495 ± 0.096
1.732GlnAsp: 1.732 ± 0.177
2.577GlnGlu: 2.577 ± 0.285
1.464GlnPhe: 1.464 ± 0.167
2.371GlnGly: 2.371 ± 0.186
0.66GlnHis: 0.66 ± 0.117
1.897GlnIle: 1.897 ± 0.176
1.979GlnLys: 1.979 ± 0.218
2.866GlnLeu: 2.866 ± 0.224
1.113GlnMet: 1.113 ± 0.147
1.34GlnAsn: 1.34 ± 0.171
1.505GlnPro: 1.505 ± 0.178
1.175GlnGln: 1.175 ± 0.155
1.753GlnArg: 1.753 ± 0.201
2.021GlnSer: 2.021 ± 0.178
1.918GlnThr: 1.918 ± 0.205
2.722GlnVal: 2.722 ± 0.204
0.66GlnTrp: 0.66 ± 0.118
0.948GlnTyr: 0.948 ± 0.153
0.0GlnXaa: 0.0 ± 0.0
Arg
5.34ArgAla: 5.34 ± 0.36
0.68ArgCys: 0.68 ± 0.124
3.278ArgAsp: 3.278 ± 0.317
3.794ArgGlu: 3.794 ± 0.314
2.165ArgPhe: 2.165 ± 0.205
3.34ArgGly: 3.34 ± 0.307
0.969ArgHis: 0.969 ± 0.157
3.753ArgIle: 3.753 ± 0.255
2.99ArgLys: 2.99 ± 0.254
4.33ArgLeu: 4.33 ± 0.296
1.588ArgMet: 1.588 ± 0.203
1.979ArgAsn: 1.979 ± 0.212
1.918ArgPro: 1.918 ± 0.236
1.876ArgGln: 1.876 ± 0.215
2.887ArgArg: 2.887 ± 0.269
3.175ArgSer: 3.175 ± 0.265
2.639ArgThr: 2.639 ± 0.219
4.598ArgVal: 4.598 ± 0.318
0.887ArgTrp: 0.887 ± 0.137
1.979ArgTyr: 1.979 ± 0.211
0.0ArgXaa: 0.0 ± 0.0
Ser
5.588SerAla: 5.588 ± 0.364
0.495SerCys: 0.495 ± 0.122
3.588SerAsp: 3.588 ± 0.29
3.485SerGlu: 3.485 ± 0.236
2.536SerPhe: 2.536 ± 0.226
5.052SerGly: 5.052 ± 0.375
0.907SerHis: 0.907 ± 0.143
3.938SerIle: 3.938 ± 0.329
3.34SerLys: 3.34 ± 0.254
4.701SerLeu: 4.701 ± 0.357
1.65SerMet: 1.65 ± 0.189
2.722SerAsn: 2.722 ± 0.197
2.742SerPro: 2.742 ± 0.27
2.289SerGln: 2.289 ± 0.208
3.217SerArg: 3.217 ± 0.273
4.021SerSer: 4.021 ± 0.32
3.629SerThr: 3.629 ± 0.245
4.392SerVal: 4.392 ± 0.284
1.175SerTrp: 1.175 ± 0.165
2.206SerTyr: 2.206 ± 0.225
0.0SerXaa: 0.0 ± 0.0
Thr
5.34ThrAla: 5.34 ± 0.363
0.598ThrCys: 0.598 ± 0.113
2.804ThrAsp: 2.804 ± 0.261
2.804ThrGlu: 2.804 ± 0.258
2.701ThrPhe: 2.701 ± 0.254
5.505ThrGly: 5.505 ± 0.397
0.969ThrHis: 0.969 ± 0.14
2.845ThrIle: 2.845 ± 0.256
3.031ThrLys: 3.031 ± 0.25
4.516ThrLeu: 4.516 ± 0.27
1.361ThrMet: 1.361 ± 0.165
2.206ThrAsn: 2.206 ± 0.295
3.196ThrPro: 3.196 ± 0.232
1.897ThrGln: 1.897 ± 0.228
3.031ThrArg: 3.031 ± 0.222
3.691ThrSer: 3.691 ± 0.334
3.794ThrThr: 3.794 ± 0.34
5.175ThrVal: 5.175 ± 0.327
1.031ThrTrp: 1.031 ± 0.145
2.021ThrTyr: 2.021 ± 0.204
0.0ThrXaa: 0.0 ± 0.0
Val
6.186ValAla: 6.186 ± 0.368
0.742ValCys: 0.742 ± 0.124
5.732ValAsp: 5.732 ± 0.402
4.866ValGlu: 4.866 ± 0.301
3.278ValPhe: 3.278 ± 0.282
4.784ValGly: 4.784 ± 0.337
1.278ValHis: 1.278 ± 0.157
3.979ValIle: 3.979 ± 0.298
4.289ValLys: 4.289 ± 0.282
5.505ValLeu: 5.505 ± 0.306
1.856ValMet: 1.856 ± 0.186
2.969ValAsn: 2.969 ± 0.278
3.629ValPro: 3.629 ± 0.272
2.763ValGln: 2.763 ± 0.273
4.103ValArg: 4.103 ± 0.267
4.845ValSer: 4.845 ± 0.311
4.206ValThr: 4.206 ± 0.407
6.144ValVal: 6.144 ± 0.348
0.99ValTrp: 0.99 ± 0.125
2.227ValTyr: 2.227 ± 0.199
0.0ValXaa: 0.0 ± 0.0
Trp
1.196TrpAla: 1.196 ± 0.195
0.247TrpCys: 0.247 ± 0.072
1.134TrpAsp: 1.134 ± 0.151
1.031TrpGlu: 1.031 ± 0.14
0.763TrpPhe: 0.763 ± 0.112
1.072TrpGly: 1.072 ± 0.133
0.33TrpHis: 0.33 ± 0.097
0.701TrpIle: 0.701 ± 0.118
1.155TrpLys: 1.155 ± 0.137
1.34TrpLeu: 1.34 ± 0.188
0.474TrpMet: 0.474 ± 0.107
0.804TrpAsn: 0.804 ± 0.135
0.392TrpPro: 0.392 ± 0.09
0.598TrpGln: 0.598 ± 0.115
0.969TrpArg: 0.969 ± 0.141
0.722TrpSer: 0.722 ± 0.115
1.072TrpThr: 1.072 ± 0.135
1.237TrpVal: 1.237 ± 0.162
0.227TrpTrp: 0.227 ± 0.075
0.495TrpTyr: 0.495 ± 0.093
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.866TyrAla: 2.866 ± 0.24
0.474TyrCys: 0.474 ± 0.083
2.309TyrAsp: 2.309 ± 0.231
2.103TyrGlu: 2.103 ± 0.193
1.67TyrPhe: 1.67 ± 0.208
2.309TyrGly: 2.309 ± 0.249
0.619TyrHis: 0.619 ± 0.103
1.814TyrIle: 1.814 ± 0.226
2.165TyrLys: 2.165 ± 0.224
2.701TyrLeu: 2.701 ± 0.191
0.969TyrMet: 0.969 ± 0.153
1.65TyrAsn: 1.65 ± 0.173
1.361TyrPro: 1.361 ± 0.181
0.969TyrGln: 0.969 ± 0.149
1.65TyrArg: 1.65 ± 0.189
2.351TyrSer: 2.351 ± 0.228
2.124TyrThr: 2.124 ± 0.235
2.887TyrVal: 2.887 ± 0.269
0.619TyrTrp: 0.619 ± 0.119
1.175TyrTyr: 1.175 ± 0.15
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 253 proteins (48500 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski