Amino acid dipepetide frequency for Pseudoalteromonas phage H101

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.112AlaAla: 4.112 ± 0.544
0.451AlaCys: 0.451 ± 0.118
3.82AlaAsp: 3.82 ± 0.36
4.404AlaGlu: 4.404 ± 0.42
1.83AlaPhe: 1.83 ± 0.244
3.634AlaGly: 3.634 ± 0.389
0.875AlaHis: 0.875 ± 0.177
3.634AlaIle: 3.634 ± 0.309
4.059AlaLys: 4.059 ± 0.407
4.695AlaLeu: 4.695 ± 0.409
1.406AlaMet: 1.406 ± 0.207
2.281AlaAsn: 2.281 ± 0.257
1.353AlaPro: 1.353 ± 0.198
2.334AlaGln: 2.334 ± 0.288
2.228AlaArg: 2.228 ± 0.233
3.183AlaSer: 3.183 ± 0.341
3.183AlaThr: 3.183 ± 0.351
2.971AlaVal: 2.971 ± 0.299
0.637AlaTrp: 0.637 ± 0.111
2.547AlaTyr: 2.547 ± 0.247
0.0AlaXaa: 0.0 ± 0.0
Cys
0.557CysAla: 0.557 ± 0.122
0.292CysCys: 0.292 ± 0.087
0.849CysAsp: 0.849 ± 0.145
1.088CysGlu: 1.088 ± 0.163
0.504CysPhe: 0.504 ± 0.132
1.406CysGly: 1.406 ± 0.173
0.371CysHis: 0.371 ± 0.096
1.088CysIle: 1.088 ± 0.166
1.406CysLys: 1.406 ± 0.22
1.114CysLeu: 1.114 ± 0.18
0.265CysMet: 0.265 ± 0.084
0.69CysAsn: 0.69 ± 0.133
0.557CysPro: 0.557 ± 0.168
0.477CysGln: 0.477 ± 0.092
0.61CysArg: 0.61 ± 0.157
1.035CysSer: 1.035 ± 0.205
0.61CysThr: 0.61 ± 0.11
0.849CysVal: 0.849 ± 0.153
0.133CysTrp: 0.133 ± 0.073
0.928CysTyr: 0.928 ± 0.182
0.0CysXaa: 0.0 ± 0.0
Asp
3.422AspAla: 3.422 ± 0.381
1.008AspCys: 1.008 ± 0.177
3.581AspAsp: 3.581 ± 0.341
4.908AspGlu: 4.908 ± 0.387
3.236AspPhe: 3.236 ± 0.324
4.297AspGly: 4.297 ± 0.424
1.22AspHis: 1.22 ± 0.198
4.748AspIle: 4.748 ± 0.417
5.491AspLys: 5.491 ± 0.377
5.359AspLeu: 5.359 ± 0.382
1.883AspMet: 1.883 ± 0.229
4.828AspAsn: 4.828 ± 0.339
1.432AspPro: 1.432 ± 0.164
1.804AspGln: 1.804 ± 0.228
2.255AspArg: 2.255 ± 0.232
4.35AspSer: 4.35 ± 0.383
3.953AspThr: 3.953 ± 0.342
4.51AspVal: 4.51 ± 0.3
1.22AspTrp: 1.22 ± 0.183
4.032AspTyr: 4.032 ± 0.32
0.0AspXaa: 0.0 ± 0.0
Glu
4.138GluAla: 4.138 ± 0.465
0.955GluCys: 0.955 ± 0.15
6.446GluAsp: 6.446 ± 0.474
6.871GluGlu: 6.871 ± 0.525
2.918GluPhe: 2.918 ± 0.26
5.571GluGly: 5.571 ± 0.367
1.432GluHis: 1.432 ± 0.209
4.165GluIle: 4.165 ± 0.286
4.43GluLys: 4.43 ± 0.414
6.711GluLeu: 6.711 ± 0.517
2.202GluMet: 2.202 ± 0.276
3.793GluAsn: 3.793 ± 0.34
1.432GluPro: 1.432 ± 0.208
3.157GluGln: 3.157 ± 0.346
2.626GluArg: 2.626 ± 0.268
4.536GluSer: 4.536 ± 0.339
3.82GluThr: 3.82 ± 0.307
5.942GluVal: 5.942 ± 0.376
1.804GluTrp: 1.804 ± 0.219
3.767GluTyr: 3.767 ± 0.355
0.0GluXaa: 0.0 ± 0.0
Phe
2.069PheAla: 2.069 ± 0.199
0.61PheCys: 0.61 ± 0.119
3.024PheAsp: 3.024 ± 0.268
3.236PheGlu: 3.236 ± 0.264
1.618PhePhe: 1.618 ± 0.209
3.024PheGly: 3.024 ± 0.308
0.584PheHis: 0.584 ± 0.133
2.679PheIle: 2.679 ± 0.271
3.608PheLys: 3.608 ± 0.326
2.573PheLeu: 2.573 ± 0.27
1.194PheMet: 1.194 ± 0.221
2.706PheAsn: 2.706 ± 0.267
1.141PhePro: 1.141 ± 0.188
1.008PheGln: 1.008 ± 0.156
1.459PheArg: 1.459 ± 0.183
2.891PheSer: 2.891 ± 0.288
2.573PheThr: 2.573 ± 0.244
2.441PheVal: 2.441 ± 0.251
0.531PheTrp: 0.531 ± 0.139
1.936PheTyr: 1.936 ± 0.21
0.0PheXaa: 0.0 ± 0.0
Gly
3.13GlyAla: 3.13 ± 0.398
1.194GlyCys: 1.194 ± 0.182
4.112GlyAsp: 4.112 ± 0.302
5.04GlyGlu: 5.04 ± 0.376
2.494GlyPhe: 2.494 ± 0.238
4.616GlyGly: 4.616 ± 0.516
1.141GlyHis: 1.141 ± 0.181
4.112GlyIle: 4.112 ± 0.33
5.624GlyLys: 5.624 ± 0.397
5.014GlyLeu: 5.014 ± 0.354
1.459GlyMet: 1.459 ± 0.221
3.846GlyAsn: 3.846 ± 0.285
0.239GlyPro: 0.239 ± 0.071
2.175GlyGln: 2.175 ± 0.29
2.387GlyArg: 2.387 ± 0.283
4.908GlySer: 4.908 ± 0.494
3.714GlyThr: 3.714 ± 0.373
5.597GlyVal: 5.597 ± 0.388
1.061GlyTrp: 1.061 ± 0.182
4.138GlyTyr: 4.138 ± 0.357
0.0GlyXaa: 0.0 ± 0.0
His
0.716HisAla: 0.716 ± 0.151
0.531HisCys: 0.531 ± 0.117
1.061HisAsp: 1.061 ± 0.185
1.114HisGlu: 1.114 ± 0.16
0.796HisPhe: 0.796 ± 0.148
1.141HisGly: 1.141 ± 0.16
0.371HisHis: 0.371 ± 0.105
1.353HisIle: 1.353 ± 0.172
1.486HisLys: 1.486 ± 0.172
1.618HisLeu: 1.618 ± 0.198
0.637HisMet: 0.637 ± 0.119
1.247HisAsn: 1.247 ± 0.21
0.849HisPro: 0.849 ± 0.132
0.398HisGln: 0.398 ± 0.097
1.008HisArg: 1.008 ± 0.192
1.035HisSer: 1.035 ± 0.15
1.061HisThr: 1.061 ± 0.193
0.796HisVal: 0.796 ± 0.138
0.159HisTrp: 0.159 ± 0.073
0.663HisTyr: 0.663 ± 0.13
0.0HisXaa: 0.0 ± 0.0
Ile
3.846IleAla: 3.846 ± 0.38
0.822IleCys: 0.822 ± 0.138
4.669IleAsp: 4.669 ± 0.398
4.669IleGlu: 4.669 ± 0.371
1.99IlePhe: 1.99 ± 0.248
2.971IleGly: 2.971 ± 0.291
0.982IleHis: 0.982 ± 0.162
3.581IleIle: 3.581 ± 0.326
5.305IleLys: 5.305 ± 0.462
3.873IleLeu: 3.873 ± 0.36
1.114IleMet: 1.114 ± 0.151
3.767IleAsn: 3.767 ± 0.345
1.963IlePro: 1.963 ± 0.252
1.963IleGln: 1.963 ± 0.215
2.467IleArg: 2.467 ± 0.251
4.404IleSer: 4.404 ± 0.355
4.881IleThr: 4.881 ± 0.346
3.793IleVal: 3.793 ± 0.302
0.822IleTrp: 0.822 ± 0.138
1.963IleTyr: 1.963 ± 0.228
0.0IleXaa: 0.0 ± 0.0
Lys
4.695LysAla: 4.695 ± 0.46
0.902LysCys: 0.902 ± 0.198
5.836LysAsp: 5.836 ± 0.399
6.552LysGlu: 6.552 ± 0.52
2.971LysPhe: 2.971 ± 0.336
5.385LysGly: 5.385 ± 0.436
1.645LysHis: 1.645 ± 0.224
3.687LysIle: 3.687 ± 0.379
5.093LysLys: 5.093 ± 0.465
6.048LysLeu: 6.048 ± 0.343
1.777LysMet: 1.777 ± 0.21
2.918LysAsn: 2.918 ± 0.235
2.6LysPro: 2.6 ± 0.324
2.865LysGln: 2.865 ± 0.348
3.74LysArg: 3.74 ± 0.451
5.067LysSer: 5.067 ± 0.389
4.377LysThr: 4.377 ± 0.333
5.889LysVal: 5.889 ± 0.373
0.716LysTrp: 0.716 ± 0.146
3.289LysTyr: 3.289 ± 0.294
0.0LysXaa: 0.0 ± 0.0
Leu
4.987LeuAla: 4.987 ± 0.389
1.512LeuCys: 1.512 ± 0.193
5.863LeuAsp: 5.863 ± 0.429
7.03LeuGlu: 7.03 ± 0.453
3.13LeuPhe: 3.13 ± 0.301
5.146LeuGly: 5.146 ± 0.4
1.406LeuHis: 1.406 ± 0.215
4.563LeuIle: 4.563 ± 0.33
6.313LeuLys: 6.313 ± 0.467
7.534LeuLeu: 7.534 ± 0.607
2.387LeuMet: 2.387 ± 0.268
4.165LeuAsn: 4.165 ± 0.317
2.626LeuPro: 2.626 ± 0.286
3.289LeuGln: 3.289 ± 0.272
3.289LeuArg: 3.289 ± 0.285
6.287LeuSer: 6.287 ± 0.401
5.305LeuThr: 5.305 ± 0.325
5.146LeuVal: 5.146 ± 0.306
0.875LeuTrp: 0.875 ± 0.147
3.502LeuTyr: 3.502 ± 0.309
0.0LeuXaa: 0.0 ± 0.0
Met
1.141MetAla: 1.141 ± 0.163
0.239MetCys: 0.239 ± 0.077
1.141MetAsp: 1.141 ± 0.168
1.777MetGlu: 1.777 ± 0.236
1.141MetPhe: 1.141 ± 0.132
1.141MetGly: 1.141 ± 0.147
0.424MetHis: 0.424 ± 0.094
1.883MetIle: 1.883 ± 0.219
2.52MetLys: 2.52 ± 0.335
2.494MetLeu: 2.494 ± 0.276
0.424MetMet: 0.424 ± 0.102
1.379MetAsn: 1.379 ± 0.2
0.584MetPro: 0.584 ± 0.122
1.167MetGln: 1.167 ± 0.189
0.796MetArg: 0.796 ± 0.156
1.698MetSer: 1.698 ± 0.199
1.353MetThr: 1.353 ± 0.154
1.539MetVal: 1.539 ± 0.214
0.292MetTrp: 0.292 ± 0.079
1.008MetTyr: 1.008 ± 0.169
0.0MetXaa: 0.0 ± 0.0
Asn
2.334AsnAla: 2.334 ± 0.253
0.849AsnCys: 0.849 ± 0.173
2.52AsnAsp: 2.52 ± 0.258
3.024AsnGlu: 3.024 ± 0.285
2.096AsnPhe: 2.096 ± 0.23
3.979AsnGly: 3.979 ± 0.401
1.114AsnHis: 1.114 ± 0.168
3.846AsnIle: 3.846 ± 0.411
5.093AsnLys: 5.093 ± 0.413
4.775AsnLeu: 4.775 ± 0.339
1.326AsnMet: 1.326 ± 0.171
3.528AsnAsn: 3.528 ± 0.412
2.361AsnPro: 2.361 ± 0.254
1.592AsnGln: 1.592 ± 0.232
2.043AsnArg: 2.043 ± 0.22
3.793AsnSer: 3.793 ± 0.282
4.138AsnThr: 4.138 ± 0.36
3.183AsnVal: 3.183 ± 0.307
0.716AsnTrp: 0.716 ± 0.114
2.494AsnTyr: 2.494 ± 0.281
0.0AsnXaa: 0.0 ± 0.0
Pro
1.83ProAla: 1.83 ± 0.25
0.584ProCys: 0.584 ± 0.142
2.441ProAsp: 2.441 ± 0.275
2.971ProGlu: 2.971 ± 0.272
1.3ProPhe: 1.3 ± 0.171
0.08ProGly: 0.08 ± 0.047
0.584ProHis: 0.584 ± 0.1
1.141ProIle: 1.141 ± 0.157
2.202ProLys: 2.202 ± 0.278
2.653ProLeu: 2.653 ± 0.244
0.477ProMet: 0.477 ± 0.098
1.512ProAsn: 1.512 ± 0.24
0.822ProPro: 0.822 ± 0.17
0.796ProGln: 0.796 ± 0.13
0.982ProArg: 0.982 ± 0.165
2.175ProSer: 2.175 ± 0.227
2.043ProThr: 2.043 ± 0.225
2.52ProVal: 2.52 ± 0.246
0.212ProTrp: 0.212 ± 0.069
1.486ProTyr: 1.486 ± 0.214
0.0ProXaa: 0.0 ± 0.0
Gln
1.963GlnAla: 1.963 ± 0.227
0.398GlnCys: 0.398 ± 0.108
2.281GlnAsp: 2.281 ± 0.259
3.687GlnGlu: 3.687 ± 0.378
1.592GlnPhe: 1.592 ± 0.226
2.785GlnGly: 2.785 ± 0.315
0.504GlnHis: 0.504 ± 0.09
1.91GlnIle: 1.91 ± 0.227
1.671GlnLys: 1.671 ± 0.282
3.263GlnLeu: 3.263 ± 0.282
1.008GlnMet: 1.008 ± 0.161
1.22GlnAsn: 1.22 ± 0.188
0.928GlnPro: 0.928 ± 0.147
1.91GlnGln: 1.91 ± 0.274
1.751GlnArg: 1.751 ± 0.253
2.149GlnSer: 2.149 ± 0.264
1.724GlnThr: 1.724 ± 0.238
2.175GlnVal: 2.175 ± 0.23
0.716GlnTrp: 0.716 ± 0.163
1.671GlnTyr: 1.671 ± 0.23
0.0GlnXaa: 0.0 ± 0.0
Arg
1.592ArgAla: 1.592 ± 0.22
0.769ArgCys: 0.769 ± 0.148
2.069ArgAsp: 2.069 ± 0.184
3.183ArgGlu: 3.183 ± 0.32
1.565ArgPhe: 1.565 ± 0.205
2.732ArgGly: 2.732 ± 0.255
0.504ArgHis: 0.504 ± 0.108
2.414ArgIle: 2.414 ± 0.252
3.077ArgLys: 3.077 ± 0.341
3.369ArgLeu: 3.369 ± 0.278
0.743ArgMet: 0.743 ± 0.161
2.122ArgAsn: 2.122 ± 0.231
1.114ArgPro: 1.114 ± 0.15
1.512ArgGln: 1.512 ± 0.22
1.326ArgArg: 1.326 ± 0.202
2.626ArgSer: 2.626 ± 0.222
1.883ArgThr: 1.883 ± 0.202
3.634ArgVal: 3.634 ± 0.322
0.424ArgTrp: 0.424 ± 0.105
1.751ArgTyr: 1.751 ± 0.204
0.0ArgXaa: 0.0 ± 0.0
Ser
3.634SerAla: 3.634 ± 0.306
1.114SerCys: 1.114 ± 0.143
5.093SerAsp: 5.093 ± 0.37
3.555SerGlu: 3.555 ± 0.329
3.183SerPhe: 3.183 ± 0.265
5.12SerGly: 5.12 ± 0.463
1.008SerHis: 1.008 ± 0.156
3.634SerIle: 3.634 ± 0.288
4.616SerLys: 4.616 ± 0.338
6.897SerLeu: 6.897 ± 0.487
1.459SerMet: 1.459 ± 0.212
3.395SerAsn: 3.395 ± 0.315
1.857SerPro: 1.857 ± 0.234
2.467SerGln: 2.467 ± 0.275
2.308SerArg: 2.308 ± 0.235
5.04SerSer: 5.04 ± 0.581
3.82SerThr: 3.82 ± 0.334
4.828SerVal: 4.828 ± 0.438
1.061SerTrp: 1.061 ± 0.142
3.74SerTyr: 3.74 ± 0.327
0.0SerXaa: 0.0 ± 0.0
Thr
2.998ThrAla: 2.998 ± 0.356
0.61ThrCys: 0.61 ± 0.131
3.369ThrAsp: 3.369 ± 0.367
3.077ThrGlu: 3.077 ± 0.279
2.759ThrPhe: 2.759 ± 0.286
4.748ThrGly: 4.748 ± 0.402
1.379ThrHis: 1.379 ± 0.194
3.263ThrIle: 3.263 ± 0.325
3.767ThrLys: 3.767 ± 0.298
5.412ThrLeu: 5.412 ± 0.319
1.379ThrMet: 1.379 ± 0.167
3.687ThrAsn: 3.687 ± 0.31
3.263ThrPro: 3.263 ± 0.31
2.043ThrGln: 2.043 ± 0.219
2.281ThrArg: 2.281 ± 0.265
4.138ThrSer: 4.138 ± 0.394
3.422ThrThr: 3.422 ± 0.434
4.006ThrVal: 4.006 ± 0.309
0.982ThrTrp: 0.982 ± 0.172
3.104ThrTyr: 3.104 ± 0.321
0.0ThrXaa: 0.0 ± 0.0
Val
3.846ValAla: 3.846 ± 0.304
0.955ValCys: 0.955 ± 0.173
5.305ValAsp: 5.305 ± 0.397
5.571ValGlu: 5.571 ± 0.417
3.236ValPhe: 3.236 ± 0.303
4.404ValGly: 4.404 ± 0.388
1.432ValHis: 1.432 ± 0.24
4.589ValIle: 4.589 ± 0.311
4.801ValLys: 4.801 ± 0.314
4.934ValLeu: 4.934 ± 0.316
1.671ValMet: 1.671 ± 0.205
3.979ValAsn: 3.979 ± 0.3
1.698ValPro: 1.698 ± 0.22
2.096ValGln: 2.096 ± 0.273
2.228ValArg: 2.228 ± 0.249
4.536ValSer: 4.536 ± 0.361
4.483ValThr: 4.483 ± 0.332
5.491ValVal: 5.491 ± 0.356
1.167ValTrp: 1.167 ± 0.147
3.422ValTyr: 3.422 ± 0.312
0.0ValXaa: 0.0 ± 0.0
Trp
0.584TrpAla: 0.584 ± 0.124
0.318TrpCys: 0.318 ± 0.092
1.194TrpAsp: 1.194 ± 0.166
1.088TrpGlu: 1.088 ± 0.161
0.716TrpPhe: 0.716 ± 0.134
0.716TrpGly: 0.716 ± 0.157
0.212TrpHis: 0.212 ± 0.075
0.849TrpIle: 0.849 ± 0.162
1.194TrpLys: 1.194 ± 0.191
1.512TrpLeu: 1.512 ± 0.184
0.318TrpMet: 0.318 ± 0.1
0.822TrpAsn: 0.822 ± 0.138
0.212TrpPro: 0.212 ± 0.066
0.663TrpGln: 0.663 ± 0.14
0.504TrpArg: 0.504 ± 0.101
0.663TrpSer: 0.663 ± 0.135
0.61TrpThr: 0.61 ± 0.117
1.247TrpVal: 1.247 ± 0.176
0.265TrpTrp: 0.265 ± 0.083
0.849TrpTyr: 0.849 ± 0.145
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.043TyrAla: 2.043 ± 0.229
0.716TyrCys: 0.716 ± 0.141
2.891TyrAsp: 2.891 ± 0.272
3.873TyrGlu: 3.873 ± 0.371
1.83TyrPhe: 1.83 ± 0.214
3.104TyrGly: 3.104 ± 0.292
0.849TyrHis: 0.849 ± 0.143
2.706TyrIle: 2.706 ± 0.281
4.297TyrLys: 4.297 ± 0.382
4.51TyrLeu: 4.51 ± 0.333
0.982TyrMet: 0.982 ± 0.16
2.918TyrAsn: 2.918 ± 0.26
1.777TyrPro: 1.777 ± 0.225
1.592TyrGln: 1.592 ± 0.192
2.122TyrArg: 2.122 ± 0.275
3.316TyrSer: 3.316 ± 0.31
2.732TyrThr: 2.732 ± 0.256
3.289TyrVal: 3.289 ± 0.287
0.743TyrTrp: 0.743 ± 0.13
2.414TyrTyr: 2.414 ± 0.235
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 228 proteins (37698 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski