Amino acid dipepetide frequency for Phaeobacter phage MD18

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.719AlaAla: 11.719 ± 1.111
1.04AlaCys: 1.04 ± 0.168
6.19AlaAsp: 6.19 ± 0.372
7.632AlaGlu: 7.632 ± 0.45
3.757AlaPhe: 3.757 ± 0.296
6.639AlaGly: 6.639 ± 0.498
2.174AlaHis: 2.174 ± 0.206
4.277AlaIle: 4.277 ± 0.356
4.749AlaLys: 4.749 ± 0.407
7.75AlaLeu: 7.75 ± 0.474
4.371AlaMet: 4.371 ± 0.403
3.45AlaAsn: 3.45 ± 0.269
3.568AlaPro: 3.568 ± 0.383
3.355AlaGln: 3.355 ± 0.362
6.261AlaArg: 6.261 ± 0.385
6.521AlaSer: 6.521 ± 0.488
5.387AlaThr: 5.387 ± 0.385
6.592AlaVal: 6.592 ± 0.447
1.276AlaTrp: 1.276 ± 0.179
3.001AlaTyr: 3.001 ± 0.289
0.0AlaXaa: 0.0 ± 0.0
Cys
0.803CysAla: 0.803 ± 0.127
0.189CysCys: 0.189 ± 0.081
0.732CysAsp: 0.732 ± 0.118
0.992CysGlu: 0.992 ± 0.157
0.307CysPhe: 0.307 ± 0.083
0.898CysGly: 0.898 ± 0.179
0.331CysHis: 0.331 ± 0.085
0.331CysIle: 0.331 ± 0.093
0.402CysLys: 0.402 ± 0.1
0.685CysLeu: 0.685 ± 0.14
0.284CysMet: 0.284 ± 0.086
0.284CysAsn: 0.284 ± 0.073
0.851CysPro: 0.851 ± 0.168
0.213CysGln: 0.213 ± 0.073
0.78CysArg: 0.78 ± 0.135
0.709CysSer: 0.709 ± 0.131
0.614CysThr: 0.614 ± 0.119
0.473CysVal: 0.473 ± 0.1
0.142CysTrp: 0.142 ± 0.078
0.307CysTyr: 0.307 ± 0.084
0.0CysXaa: 0.0 ± 0.0
Asp
7.608AspAla: 7.608 ± 0.379
0.78AspCys: 0.78 ± 0.131
4.253AspAsp: 4.253 ± 0.38
4.584AspGlu: 4.584 ± 0.359
2.457AspPhe: 2.457 ± 0.25
6.568AspGly: 6.568 ± 0.384
1.796AspHis: 1.796 ± 0.199
3.355AspIle: 3.355 ± 0.261
2.315AspLys: 2.315 ± 0.235
5.151AspLeu: 5.151 ± 0.355
2.079AspMet: 2.079 ± 0.201
2.079AspAsn: 2.079 ± 0.197
3.308AspPro: 3.308 ± 0.304
1.748AspGln: 1.748 ± 0.22
4.347AspArg: 4.347 ± 0.291
2.646AspSer: 2.646 ± 0.255
3.686AspThr: 3.686 ± 0.247
5.174AspVal: 5.174 ± 0.449
1.678AspTrp: 1.678 ± 0.211
2.197AspTyr: 2.197 ± 0.244
0.0AspXaa: 0.0 ± 0.0
Glu
7.962GluAla: 7.962 ± 0.517
0.685GluCys: 0.685 ± 0.151
5.009GluAsp: 5.009 ± 0.439
6.19GluGlu: 6.19 ± 0.43
3.237GluPhe: 3.237 ± 0.245
6.167GluGly: 6.167 ± 0.366
1.748GluHis: 1.748 ± 0.207
4.347GluIle: 4.347 ± 0.303
3.45GluLys: 3.45 ± 0.301
6.592GluLeu: 6.592 ± 0.42
2.292GluMet: 2.292 ± 0.204
2.575GluAsn: 2.575 ± 0.234
2.575GluPro: 2.575 ± 0.301
2.41GluGln: 2.41 ± 0.256
4.466GluArg: 4.466 ± 0.335
2.788GluSer: 2.788 ± 0.26
3.969GluThr: 3.969 ± 0.28
5.482GluVal: 5.482 ± 0.368
1.465GluTrp: 1.465 ± 0.163
1.654GluTyr: 1.654 ± 0.231
0.0GluXaa: 0.0 ± 0.0
Phe
3.568PheAla: 3.568 ± 0.247
0.473PheCys: 0.473 ± 0.102
3.686PheAsp: 3.686 ± 0.264
2.953PheGlu: 2.953 ± 0.238
1.559PhePhe: 1.559 ± 0.199
2.575PheGly: 2.575 ± 0.276
0.662PheHis: 0.662 ± 0.125
1.937PheIle: 1.937 ± 0.25
1.583PheLys: 1.583 ± 0.197
2.315PheLeu: 2.315 ± 0.197
0.945PheMet: 0.945 ± 0.143
1.441PheAsn: 1.441 ± 0.167
1.89PhePro: 1.89 ± 0.191
0.851PheGln: 0.851 ± 0.117
3.001PheArg: 3.001 ± 0.268
2.552PheSer: 2.552 ± 0.277
2.504PheThr: 2.504 ± 0.289
2.339PheVal: 2.339 ± 0.221
0.638PheTrp: 0.638 ± 0.12
1.205PheTyr: 1.205 ± 0.19
0.0PheXaa: 0.0 ± 0.0
Gly
7.49GlyAla: 7.49 ± 0.554
0.709GlyCys: 0.709 ± 0.157
5.293GlyAsp: 5.293 ± 0.348
5.482GlyGlu: 5.482 ± 0.353
2.977GlyPhe: 2.977 ± 0.281
6.805GlyGly: 6.805 ± 0.622
1.678GlyHis: 1.678 ± 0.203
3.331GlyIle: 3.331 ± 0.279
4.158GlyLys: 4.158 ± 0.269
5.103GlyLeu: 5.103 ± 0.362
2.221GlyMet: 2.221 ± 0.202
2.41GlyAsn: 2.41 ± 0.266
2.528GlyPro: 2.528 ± 0.271
2.67GlyGln: 2.67 ± 0.236
4.017GlyArg: 4.017 ± 0.317
5.103GlySer: 5.103 ± 0.446
5.552GlyThr: 5.552 ± 0.407
6.356GlyVal: 6.356 ± 0.378
1.725GlyTrp: 1.725 ± 0.195
2.835GlyTyr: 2.835 ± 0.252
0.0GlyXaa: 0.0 ± 0.0
His
2.245HisAla: 2.245 ± 0.227
0.331HisCys: 0.331 ± 0.087
1.418HisAsp: 1.418 ± 0.187
1.347HisGlu: 1.347 ± 0.169
1.087HisPhe: 1.087 ± 0.178
1.796HisGly: 1.796 ± 0.182
0.52HisHis: 0.52 ± 0.108
1.323HisIle: 1.323 ± 0.179
0.921HisLys: 0.921 ± 0.161
1.772HisLeu: 1.772 ± 0.239
0.662HisMet: 0.662 ± 0.116
0.567HisAsn: 0.567 ± 0.113
1.063HisPro: 1.063 ± 0.163
0.709HisGln: 0.709 ± 0.128
1.654HisArg: 1.654 ± 0.22
1.441HisSer: 1.441 ± 0.145
0.992HisThr: 0.992 ± 0.152
1.512HisVal: 1.512 ± 0.205
0.52HisTrp: 0.52 ± 0.122
0.709HisTyr: 0.709 ± 0.122
0.0HisXaa: 0.0 ± 0.0
Ile
4.82IleAla: 4.82 ± 0.418
0.638IleCys: 0.638 ± 0.139
3.922IleAsp: 3.922 ± 0.341
4.56IleGlu: 4.56 ± 0.318
1.394IlePhe: 1.394 ± 0.168
3.78IleGly: 3.78 ± 0.25
1.087IleHis: 1.087 ± 0.134
2.386IleIle: 2.386 ± 0.24
2.41IleLys: 2.41 ± 0.201
3.237IleLeu: 3.237 ± 0.251
0.992IleMet: 0.992 ± 0.137
1.748IleAsn: 1.748 ± 0.187
2.646IlePro: 2.646 ± 0.234
1.607IleGln: 1.607 ± 0.207
3.095IleArg: 3.095 ± 0.267
2.764IleSer: 2.764 ± 0.244
3.072IleThr: 3.072 ± 0.264
2.977IleVal: 2.977 ± 0.217
0.496IleTrp: 0.496 ± 0.118
1.276IleTyr: 1.276 ± 0.163
0.0IleXaa: 0.0 ± 0.0
Lys
4.702LysAla: 4.702 ± 0.313
0.425LysCys: 0.425 ± 0.101
2.93LysAsp: 2.93 ± 0.299
3.662LysGlu: 3.662 ± 0.269
1.607LysPhe: 1.607 ± 0.214
3.308LysGly: 3.308 ± 0.3
1.205LysHis: 1.205 ± 0.176
1.89LysIle: 1.89 ± 0.192
2.552LysLys: 2.552 ± 0.288
3.591LysLeu: 3.591 ± 0.282
1.441LysMet: 1.441 ± 0.17
1.63LysAsn: 1.63 ± 0.21
1.867LysPro: 1.867 ± 0.227
1.418LysGln: 1.418 ± 0.201
3.591LysArg: 3.591 ± 0.317
2.197LysSer: 2.197 ± 0.234
3.19LysThr: 3.19 ± 0.292
3.851LysVal: 3.851 ± 0.308
0.756LysTrp: 0.756 ± 0.135
1.181LysTyr: 1.181 ± 0.149
0.0LysXaa: 0.0 ± 0.0
Leu
6.592LeuAla: 6.592 ± 0.489
0.78LeuCys: 0.78 ± 0.137
5.033LeuAsp: 5.033 ± 0.4
5.458LeuGlu: 5.458 ± 0.376
2.717LeuPhe: 2.717 ± 0.268
5.151LeuGly: 5.151 ± 0.387
1.678LeuHis: 1.678 ± 0.214
3.922LeuIle: 3.922 ± 0.291
2.883LeuLys: 2.883 ± 0.317
4.867LeuLeu: 4.867 ± 0.391
2.008LeuMet: 2.008 ± 0.168
2.812LeuAsn: 2.812 ± 0.238
3.355LeuPro: 3.355 ± 0.268
2.032LeuGln: 2.032 ± 0.227
5.718LeuArg: 5.718 ± 0.4
5.363LeuSer: 5.363 ± 0.345
4.56LeuThr: 4.56 ± 0.314
4.962LeuVal: 4.962 ± 0.329
1.04LeuTrp: 1.04 ± 0.143
1.937LeuTyr: 1.937 ± 0.214
0.0LeuXaa: 0.0 ± 0.0
Met
3.284MetAla: 3.284 ± 0.251
0.236MetCys: 0.236 ± 0.061
1.89MetAsp: 1.89 ± 0.208
2.245MetGlu: 2.245 ± 0.215
1.229MetPhe: 1.229 ± 0.16
2.315MetGly: 2.315 ± 0.243
0.614MetHis: 0.614 ± 0.098
1.441MetIle: 1.441 ± 0.2
1.512MetLys: 1.512 ± 0.196
1.654MetLeu: 1.654 ± 0.176
0.945MetMet: 0.945 ± 0.131
0.921MetAsn: 0.921 ± 0.13
1.583MetPro: 1.583 ± 0.159
0.78MetGln: 0.78 ± 0.138
2.15MetArg: 2.15 ± 0.237
2.197MetSer: 2.197 ± 0.229
1.914MetThr: 1.914 ± 0.196
1.725MetVal: 1.725 ± 0.184
0.354MetTrp: 0.354 ± 0.077
0.685MetTyr: 0.685 ± 0.123
0.0MetXaa: 0.0 ± 0.0
Asn
3.993AsnAla: 3.993 ± 0.34
0.354AsnCys: 0.354 ± 0.092
1.796AsnAsp: 1.796 ± 0.177
2.339AsnGlu: 2.339 ± 0.228
1.276AsnPhe: 1.276 ± 0.16
2.906AsnGly: 2.906 ± 0.308
0.709AsnHis: 0.709 ± 0.11
1.607AsnIle: 1.607 ± 0.193
1.299AsnLys: 1.299 ± 0.158
2.363AsnLeu: 2.363 ± 0.216
1.016AsnMet: 1.016 ± 0.168
1.323AsnAsn: 1.323 ± 0.182
2.315AsnPro: 2.315 ± 0.202
0.945AsnGln: 0.945 ± 0.165
2.174AsnArg: 2.174 ± 0.216
1.961AsnSer: 1.961 ± 0.213
1.772AsnThr: 1.772 ± 0.209
2.528AsnVal: 2.528 ± 0.244
0.709AsnTrp: 0.709 ± 0.12
0.992AsnTyr: 0.992 ± 0.145
0.0AsnXaa: 0.0 ± 0.0
Pro
3.922ProAla: 3.922 ± 0.329
0.425ProCys: 0.425 ± 0.103
3.568ProAsp: 3.568 ± 0.309
3.875ProGlu: 3.875 ± 0.281
2.032ProPhe: 2.032 ± 0.22
3.52ProGly: 3.52 ± 0.275
0.969ProHis: 0.969 ± 0.15
2.245ProIle: 2.245 ± 0.193
2.008ProLys: 2.008 ± 0.224
2.457ProLeu: 2.457 ± 0.264
1.299ProMet: 1.299 ± 0.166
1.559ProAsn: 1.559 ± 0.173
1.701ProPro: 1.701 ± 0.229
0.851ProGln: 0.851 ± 0.135
2.575ProArg: 2.575 ± 0.302
2.41ProSer: 2.41 ± 0.29
3.119ProThr: 3.119 ± 0.272
4.277ProVal: 4.277 ± 0.353
0.567ProTrp: 0.567 ± 0.111
1.347ProTyr: 1.347 ± 0.19
0.0ProXaa: 0.0 ± 0.0
Gln
3.095GlnAla: 3.095 ± 0.392
0.095GlnCys: 0.095 ± 0.049
1.299GlnAsp: 1.299 ± 0.19
1.465GlnGlu: 1.465 ± 0.181
1.512GlnPhe: 1.512 ± 0.202
2.15GlnGly: 2.15 ± 0.205
0.685GlnHis: 0.685 ± 0.152
1.772GlnIle: 1.772 ± 0.243
1.725GlnLys: 1.725 ± 0.203
1.914GlnLeu: 1.914 ± 0.213
0.969GlnMet: 0.969 ± 0.155
0.969GlnAsn: 0.969 ± 0.154
1.418GlnPro: 1.418 ± 0.251
1.229GlnGln: 1.229 ± 0.242
2.339GlnArg: 2.339 ± 0.241
1.748GlnSer: 1.748 ± 0.206
1.678GlnThr: 1.678 ± 0.231
1.819GlnVal: 1.819 ± 0.224
0.591GlnTrp: 0.591 ± 0.135
1.016GlnTyr: 1.016 ± 0.178
0.0GlnXaa: 0.0 ± 0.0
Arg
6.592ArgAla: 6.592 ± 0.471
0.591ArgCys: 0.591 ± 0.13
4.253ArgAsp: 4.253 ± 0.324
4.702ArgGlu: 4.702 ± 0.353
2.694ArgPhe: 2.694 ± 0.253
5.269ArgGly: 5.269 ± 0.443
1.772ArgHis: 1.772 ± 0.283
3.379ArgIle: 3.379 ± 0.321
3.78ArgLys: 3.78 ± 0.29
5.198ArgLeu: 5.198 ± 0.362
1.819ArgMet: 1.819 ± 0.204
2.315ArgAsn: 2.315 ± 0.21
2.552ArgPro: 2.552 ± 0.289
2.032ArgGln: 2.032 ± 0.201
4.607ArgArg: 4.607 ± 0.318
3.402ArgSer: 3.402 ± 0.253
3.142ArgThr: 3.142 ± 0.229
4.678ArgVal: 4.678 ± 0.418
1.158ArgTrp: 1.158 ± 0.186
1.796ArgTyr: 1.796 ± 0.225
0.0ArgXaa: 0.0 ± 0.0
Ser
5.812SerAla: 5.812 ± 0.562
0.496SerCys: 0.496 ± 0.114
4.064SerAsp: 4.064 ± 0.292
4.158SerGlu: 4.158 ± 0.3
2.339SerPhe: 2.339 ± 0.242
5.552SerGly: 5.552 ± 0.401
1.158SerHis: 1.158 ± 0.139
2.741SerIle: 2.741 ± 0.276
2.623SerLys: 2.623 ± 0.25
4.277SerLeu: 4.277 ± 0.351
1.583SerMet: 1.583 ± 0.2
2.174SerAsn: 2.174 ± 0.235
2.504SerPro: 2.504 ± 0.268
1.725SerGln: 1.725 ± 0.25
3.308SerArg: 3.308 ± 0.276
3.52SerSer: 3.52 ± 0.384
2.953SerThr: 2.953 ± 0.343
4.347SerVal: 4.347 ± 0.349
1.016SerTrp: 1.016 ± 0.163
1.748SerTyr: 1.748 ± 0.199
0.0SerXaa: 0.0 ± 0.0
Thr
5.552ThrAla: 5.552 ± 0.418
0.732ThrCys: 0.732 ± 0.133
3.733ThrAsp: 3.733 ± 0.293
3.804ThrGlu: 3.804 ± 0.312
2.552ThrPhe: 2.552 ± 0.209
5.08ThrGly: 5.08 ± 0.429
1.063ThrHis: 1.063 ± 0.155
3.662ThrIle: 3.662 ± 0.36
2.599ThrLys: 2.599 ± 0.263
4.796ThrLeu: 4.796 ± 0.344
1.748ThrMet: 1.748 ± 0.185
1.843ThrAsn: 1.843 ± 0.231
3.639ThrPro: 3.639 ± 0.286
1.583ThrGln: 1.583 ± 0.208
3.331ThrArg: 3.331 ± 0.283
3.402ThrSer: 3.402 ± 0.289
2.835ThrThr: 2.835 ± 0.275
4.395ThrVal: 4.395 ± 0.393
0.614ThrTrp: 0.614 ± 0.136
1.678ThrTyr: 1.678 ± 0.208
0.0ThrXaa: 0.0 ± 0.0
Val
5.671ValAla: 5.671 ± 0.433
0.685ValCys: 0.685 ± 0.148
4.702ValAsp: 4.702 ± 0.318
6.545ValGlu: 6.545 ± 0.441
2.764ValPhe: 2.764 ± 0.272
4.395ValGly: 4.395 ± 0.352
1.418ValHis: 1.418 ± 0.178
3.095ValIle: 3.095 ± 0.33
3.497ValLys: 3.497 ± 0.294
5.718ValLeu: 5.718 ± 0.38
1.937ValMet: 1.937 ± 0.203
2.835ValAsn: 2.835 ± 0.251
3.544ValPro: 3.544 ± 0.281
2.126ValGln: 2.126 ± 0.215
5.08ValArg: 5.08 ± 0.347
4.938ValSer: 4.938 ± 0.353
4.725ValThr: 4.725 ± 0.375
4.867ValVal: 4.867 ± 0.422
1.181ValTrp: 1.181 ± 0.187
2.315ValTyr: 2.315 ± 0.244
0.0ValXaa: 0.0 ± 0.0
Trp
1.512TrpAla: 1.512 ± 0.214
0.213TrpCys: 0.213 ± 0.081
1.583TrpAsp: 1.583 ± 0.226
1.158TrpGlu: 1.158 ± 0.165
0.473TrpPhe: 0.473 ± 0.108
1.063TrpGly: 1.063 ± 0.158
0.52TrpHis: 0.52 ± 0.115
0.78TrpIle: 0.78 ± 0.15
1.158TrpLys: 1.158 ± 0.169
1.299TrpLeu: 1.299 ± 0.185
0.402TrpMet: 0.402 ± 0.087
0.591TrpAsn: 0.591 ± 0.124
0.402TrpPro: 0.402 ± 0.092
0.402TrpGln: 0.402 ± 0.096
1.063TrpArg: 1.063 ± 0.152
0.969TrpSer: 0.969 ± 0.148
1.016TrpThr: 1.016 ± 0.188
1.37TrpVal: 1.37 ± 0.2
0.284TrpTrp: 0.284 ± 0.079
0.449TrpTyr: 0.449 ± 0.102
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.764TyrAla: 2.764 ± 0.262
0.449TyrCys: 0.449 ± 0.096
2.457TyrAsp: 2.457 ± 0.183
1.914TyrGlu: 1.914 ± 0.212
0.709TyrPhe: 0.709 ± 0.114
2.504TyrGly: 2.504 ± 0.246
0.803TyrHis: 0.803 ± 0.148
1.087TyrIle: 1.087 ± 0.178
1.394TyrLys: 1.394 ± 0.198
2.126TyrLeu: 2.126 ± 0.243
0.638TyrMet: 0.638 ± 0.118
0.851TyrAsn: 0.851 ± 0.134
1.418TyrPro: 1.418 ± 0.2
0.803TyrGln: 0.803 ± 0.125
2.174TyrArg: 2.174 ± 0.235
1.465TyrSer: 1.465 ± 0.195
1.961TyrThr: 1.961 ± 0.24
2.315TyrVal: 2.315 ± 0.257
0.496TyrTrp: 0.496 ± 0.107
0.803TyrTyr: 0.803 ± 0.139
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 229 proteins (42325 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski