Amino acid dipepetide frequency for Pectobacterium phage DU_PP_V

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.994AlaAla: 6.994 ± 0.899
0.471AlaCys: 0.471 ± 0.138
3.696AlaAsp: 3.696 ± 0.323
4.856AlaGlu: 4.856 ± 0.495
2.319AlaPhe: 2.319 ± 0.313
4.457AlaGly: 4.457 ± 0.434
1.268AlaHis: 1.268 ± 0.194
5.146AlaIle: 5.146 ± 0.482
4.856AlaLys: 4.856 ± 0.576
6.849AlaLeu: 6.849 ± 0.573
1.703AlaMet: 1.703 ± 0.262
4.059AlaAsn: 4.059 ± 0.452
1.739AlaPro: 1.739 ± 0.24
3.08AlaGln: 3.08 ± 0.437
3.551AlaArg: 3.551 ± 0.305
4.602AlaSer: 4.602 ± 0.46
4.639AlaThr: 4.639 ± 0.625
4.566AlaVal: 4.566 ± 0.501
0.616AlaTrp: 0.616 ± 0.144
2.355AlaTyr: 2.355 ± 0.341
0.0AlaXaa: 0.0 ± 0.0
Cys
0.362CysAla: 0.362 ± 0.113
0.109CysCys: 0.109 ± 0.059
0.399CysAsp: 0.399 ± 0.108
0.544CysGlu: 0.544 ± 0.161
0.652CysPhe: 0.652 ± 0.137
0.942CysGly: 0.942 ± 0.225
0.29CysHis: 0.29 ± 0.098
0.797CysIle: 0.797 ± 0.18
0.689CysLys: 0.689 ± 0.159
0.833CysLeu: 0.833 ± 0.175
0.326CysMet: 0.326 ± 0.101
0.399CysAsn: 0.399 ± 0.119
0.507CysPro: 0.507 ± 0.181
0.435CysGln: 0.435 ± 0.142
0.507CysArg: 0.507 ± 0.137
0.616CysSer: 0.616 ± 0.133
0.652CysThr: 0.652 ± 0.158
0.833CysVal: 0.833 ± 0.184
0.181CysTrp: 0.181 ± 0.087
0.29CysTyr: 0.29 ± 0.107
0.0CysXaa: 0.0 ± 0.0
Asp
4.349AspAla: 4.349 ± 0.38
0.362AspCys: 0.362 ± 0.132
2.138AspAsp: 2.138 ± 0.328
3.805AspGlu: 3.805 ± 0.407
2.319AspPhe: 2.319 ± 0.277
3.37AspGly: 3.37 ± 0.354
0.797AspHis: 0.797 ± 0.181
4.566AspIle: 4.566 ± 0.403
3.515AspLys: 3.515 ± 0.379
5.689AspLeu: 5.689 ± 0.486
2.029AspMet: 2.029 ± 0.315
2.827AspAsn: 2.827 ± 0.384
2.319AspPro: 2.319 ± 0.257
1.594AspGln: 1.594 ± 0.238
2.754AspArg: 2.754 ± 0.327
4.131AspSer: 4.131 ± 0.414
4.457AspThr: 4.457 ± 0.379
3.588AspVal: 3.588 ± 0.431
1.087AspTrp: 1.087 ± 0.227
3.443AspTyr: 3.443 ± 0.345
0.0AspXaa: 0.0 ± 0.0
Glu
5.617GluAla: 5.617 ± 0.554
0.725GluCys: 0.725 ± 0.199
4.204GluAsp: 4.204 ± 0.374
5.436GluGlu: 5.436 ± 0.54
2.754GluPhe: 2.754 ± 0.321
3.624GluGly: 3.624 ± 0.313
1.594GluHis: 1.594 ± 0.246
4.856GluIle: 4.856 ± 0.446
4.204GluLys: 4.204 ± 0.41
7.755GluLeu: 7.755 ± 0.589
1.921GluMet: 1.921 ± 0.249
3.261GluAsn: 3.261 ± 0.41
1.232GluPro: 1.232 ± 0.204
2.79GluGln: 2.79 ± 0.472
2.355GluArg: 2.355 ± 0.253
4.602GluSer: 4.602 ± 0.453
3.261GluThr: 3.261 ± 0.366
4.965GluVal: 4.965 ± 0.457
0.797GluTrp: 0.797 ± 0.142
3.878GluTyr: 3.878 ± 0.427
0.0GluXaa: 0.0 ± 0.0
Phe
2.319PheAla: 2.319 ± 0.301
0.544PheCys: 0.544 ± 0.149
2.428PheAsp: 2.428 ± 0.315
2.682PheGlu: 2.682 ± 0.332
1.413PhePhe: 1.413 ± 0.245
2.428PheGly: 2.428 ± 0.34
1.051PheHis: 1.051 ± 0.224
3.298PheIle: 3.298 ± 0.358
2.645PheLys: 2.645 ± 0.35
3.37PheLeu: 3.37 ± 0.452
0.725PheMet: 0.725 ± 0.157
2.827PheAsn: 2.827 ± 0.322
1.341PhePro: 1.341 ± 0.238
0.906PheGln: 0.906 ± 0.184
1.486PheArg: 1.486 ± 0.269
2.935PheSer: 2.935 ± 0.379
2.935PheThr: 2.935 ± 0.379
2.718PheVal: 2.718 ± 0.327
0.435PheTrp: 0.435 ± 0.122
1.594PheTyr: 1.594 ± 0.231
0.0PheXaa: 0.0 ± 0.0
Gly
4.24GlyAla: 4.24 ± 0.406
0.978GlyCys: 0.978 ± 0.234
3.805GlyAsp: 3.805 ± 0.377
3.769GlyGlu: 3.769 ± 0.335
3.261GlyPhe: 3.261 ± 0.316
2.682GlyGly: 2.682 ± 0.288
1.087GlyHis: 1.087 ± 0.207
4.602GlyIle: 4.602 ± 0.437
5.001GlyLys: 5.001 ± 0.474
5.073GlyLeu: 5.073 ± 0.47
1.268GlyMet: 1.268 ± 0.243
3.08GlyAsn: 3.08 ± 0.348
0.87GlyPro: 0.87 ± 0.213
2.211GlyGln: 2.211 ± 0.26
2.863GlyArg: 2.863 ± 0.289
4.602GlySer: 4.602 ± 0.416
4.131GlyThr: 4.131 ± 0.511
5.037GlyVal: 5.037 ± 0.413
1.087GlyTrp: 1.087 ± 0.203
2.972GlyTyr: 2.972 ± 0.378
0.0GlyXaa: 0.0 ± 0.0
His
1.051HisAla: 1.051 ± 0.199
0.326HisCys: 0.326 ± 0.109
1.015HisAsp: 1.015 ± 0.265
0.725HisGlu: 0.725 ± 0.157
0.544HisPhe: 0.544 ± 0.168
1.015HisGly: 1.015 ± 0.19
0.254HisHis: 0.254 ± 0.097
1.413HisIle: 1.413 ± 0.289
1.268HisLys: 1.268 ± 0.199
1.631HisLeu: 1.631 ± 0.245
0.399HisMet: 0.399 ± 0.126
0.978HisAsn: 0.978 ± 0.239
0.761HisPro: 0.761 ± 0.208
0.471HisGln: 0.471 ± 0.128
0.942HisArg: 0.942 ± 0.171
1.196HisSer: 1.196 ± 0.206
0.978HisThr: 0.978 ± 0.191
1.196HisVal: 1.196 ± 0.256
0.036HisTrp: 0.036 ± 0.032
0.761HisTyr: 0.761 ± 0.156
0.0HisXaa: 0.0 ± 0.0
Ile
4.639IleAla: 4.639 ± 0.437
0.689IleCys: 0.689 ± 0.144
4.167IleAsp: 4.167 ± 0.362
4.892IleGlu: 4.892 ± 0.377
2.682IlePhe: 2.682 ± 0.332
3.334IleGly: 3.334 ± 0.332
1.051IleHis: 1.051 ± 0.244
3.624IleIle: 3.624 ± 0.352
4.131IleLys: 4.131 ± 0.527
5.689IleLeu: 5.689 ± 0.528
1.196IleMet: 1.196 ± 0.246
3.878IleAsn: 3.878 ± 0.364
3.406IlePro: 3.406 ± 0.331
2.645IleGln: 2.645 ± 0.307
3.08IleArg: 3.08 ± 0.372
4.928IleSer: 4.928 ± 0.417
5.11IleThr: 5.11 ± 0.563
3.406IleVal: 3.406 ± 0.285
0.507IleTrp: 0.507 ± 0.133
2.283IleTyr: 2.283 ± 0.31
0.0IleXaa: 0.0 ± 0.0
Lys
5.798LysAla: 5.798 ± 0.572
0.689LysCys: 0.689 ± 0.159
4.131LysAsp: 4.131 ± 0.357
5.001LysGlu: 5.001 ± 0.491
2.464LysPhe: 2.464 ± 0.316
3.551LysGly: 3.551 ± 0.392
0.906LysHis: 0.906 ± 0.158
3.406LysIle: 3.406 ± 0.376
4.059LysLys: 4.059 ± 0.471
6.197LysLeu: 6.197 ± 0.627
2.247LysMet: 2.247 ± 0.243
2.573LysAsn: 2.573 ± 0.285
2.392LysPro: 2.392 ± 0.344
2.283LysGln: 2.283 ± 0.29
2.972LysArg: 2.972 ± 0.322
4.167LysSer: 4.167 ± 0.523
3.261LysThr: 3.261 ± 0.308
4.53LysVal: 4.53 ± 0.525
0.725LysTrp: 0.725 ± 0.173
2.5LysTyr: 2.5 ± 0.323
0.0LysXaa: 0.0 ± 0.0
Leu
6.849LeuAla: 6.849 ± 0.696
1.087LeuCys: 1.087 ± 0.24
6.777LeuAsp: 6.777 ± 0.54
7.211LeuGlu: 7.211 ± 0.553
2.827LeuPhe: 2.827 ± 0.343
6.414LeuGly: 6.414 ± 0.646
1.305LeuHis: 1.305 ± 0.216
4.53LeuIle: 4.53 ± 0.426
5.907LeuLys: 5.907 ± 0.559
6.994LeuLeu: 6.994 ± 0.651
1.921LeuMet: 1.921 ± 0.23
4.783LeuAsn: 4.783 ± 0.422
3.95LeuPro: 3.95 ± 0.454
3.298LeuGln: 3.298 ± 0.469
3.986LeuArg: 3.986 ± 0.386
6.632LeuSer: 6.632 ± 0.434
4.892LeuThr: 4.892 ± 0.398
6.378LeuVal: 6.378 ± 0.537
0.797LeuTrp: 0.797 ± 0.19
2.754LeuTyr: 2.754 ± 0.296
0.0LeuXaa: 0.0 ± 0.0
Met
1.486MetAla: 1.486 ± 0.215
0.181MetCys: 0.181 ± 0.071
1.268MetAsp: 1.268 ± 0.222
1.45MetGlu: 1.45 ± 0.242
1.087MetPhe: 1.087 ± 0.21
1.486MetGly: 1.486 ± 0.188
0.507MetHis: 0.507 ± 0.137
1.522MetIle: 1.522 ± 0.228
1.377MetLys: 1.377 ± 0.24
2.138MetLeu: 2.138 ± 0.3
0.435MetMet: 0.435 ± 0.122
0.906MetAsn: 0.906 ± 0.189
0.761MetPro: 0.761 ± 0.157
1.123MetGln: 1.123 ± 0.276
1.377MetArg: 1.377 ± 0.257
2.211MetSer: 2.211 ± 0.288
1.884MetThr: 1.884 ± 0.263
1.594MetVal: 1.594 ± 0.229
0.109MetTrp: 0.109 ± 0.062
0.942MetTyr: 0.942 ± 0.154
0.0MetXaa: 0.0 ± 0.0
Asn
3.261AsnAla: 3.261 ± 0.386
0.29AsnCys: 0.29 ± 0.098
2.029AsnAsp: 2.029 ± 0.243
2.863AsnGlu: 2.863 ± 0.303
1.776AsnPhe: 1.776 ± 0.268
3.914AsnGly: 3.914 ± 0.423
1.015AsnHis: 1.015 ± 0.218
3.588AsnIle: 3.588 ± 0.436
3.443AsnLys: 3.443 ± 0.37
4.602AsnLeu: 4.602 ± 0.444
1.522AsnMet: 1.522 ± 0.252
3.008AsnAsn: 3.008 ± 0.333
2.464AsnPro: 2.464 ± 0.342
1.667AsnGln: 1.667 ± 0.256
2.609AsnArg: 2.609 ± 0.353
4.385AsnSer: 4.385 ± 0.47
2.935AsnThr: 2.935 ± 0.335
2.899AsnVal: 2.899 ± 0.327
1.015AsnTrp: 1.015 ± 0.213
1.993AsnTyr: 1.993 ± 0.277
0.0AsnXaa: 0.0 ± 0.0
Pro
2.645ProAla: 2.645 ± 0.332
0.254ProCys: 0.254 ± 0.112
1.921ProAsp: 1.921 ± 0.266
3.153ProGlu: 3.153 ± 0.363
1.812ProPhe: 1.812 ± 0.31
1.667ProGly: 1.667 ± 0.234
0.725ProHis: 0.725 ± 0.17
2.355ProIle: 2.355 ± 0.269
2.029ProLys: 2.029 ± 0.304
2.428ProLeu: 2.428 ± 0.249
0.544ProMet: 0.544 ± 0.139
2.102ProAsn: 2.102 ± 0.307
1.16ProPro: 1.16 ± 0.267
0.833ProGln: 0.833 ± 0.157
1.812ProArg: 1.812 ± 0.263
2.754ProSer: 2.754 ± 0.344
2.682ProThr: 2.682 ± 0.315
2.573ProVal: 2.573 ± 0.36
0.399ProTrp: 0.399 ± 0.119
1.667ProTyr: 1.667 ± 0.248
0.0ProXaa: 0.0 ± 0.0
Gln
2.645GlnAla: 2.645 ± 0.432
0.399GlnCys: 0.399 ± 0.114
2.102GlnAsp: 2.102 ± 0.29
2.754GlnGlu: 2.754 ± 0.365
1.268GlnPhe: 1.268 ± 0.231
2.645GlnGly: 2.645 ± 0.326
0.471GlnHis: 0.471 ± 0.131
1.486GlnIle: 1.486 ± 0.216
1.957GlnLys: 1.957 ± 0.291
3.66GlnLeu: 3.66 ± 0.505
0.616GlnMet: 0.616 ± 0.132
1.377GlnAsn: 1.377 ± 0.239
0.797GlnPro: 0.797 ± 0.168
1.776GlnGln: 1.776 ± 0.386
1.631GlnArg: 1.631 ± 0.26
2.682GlnSer: 2.682 ± 0.358
1.848GlnThr: 1.848 ± 0.246
2.428GlnVal: 2.428 ± 0.304
0.652GlnTrp: 0.652 ± 0.173
1.812GlnTyr: 1.812 ± 0.289
0.0GlnXaa: 0.0 ± 0.0
Arg
3.588ArgAla: 3.588 ± 0.414
0.145ArgCys: 0.145 ± 0.08
2.863ArgAsp: 2.863 ± 0.308
3.153ArgGlu: 3.153 ± 0.357
1.667ArgPhe: 1.667 ± 0.251
3.334ArgGly: 3.334 ± 0.374
0.471ArgHis: 0.471 ± 0.117
2.863ArgIle: 2.863 ± 0.328
2.754ArgLys: 2.754 ± 0.294
4.095ArgLeu: 4.095 ± 0.421
1.268ArgMet: 1.268 ± 0.22
2.718ArgAsn: 2.718 ± 0.316
1.305ArgPro: 1.305 ± 0.226
1.341ArgGln: 1.341 ± 0.241
2.645ArgArg: 2.645 ± 0.358
2.682ArgSer: 2.682 ± 0.296
2.537ArgThr: 2.537 ± 0.276
4.022ArgVal: 4.022 ± 0.325
0.435ArgTrp: 0.435 ± 0.128
1.921ArgTyr: 1.921 ± 0.329
0.0ArgXaa: 0.0 ± 0.0
Ser
4.457SerAla: 4.457 ± 0.519
0.942SerCys: 0.942 ± 0.171
4.204SerAsp: 4.204 ± 0.405
4.965SerGlu: 4.965 ± 0.482
3.733SerPhe: 3.733 ± 0.469
5.798SerGly: 5.798 ± 0.564
1.051SerHis: 1.051 ± 0.214
5.363SerIle: 5.363 ± 0.446
5.001SerLys: 5.001 ± 0.537
6.342SerLeu: 6.342 ± 0.378
1.884SerMet: 1.884 ± 0.282
3.334SerAsn: 3.334 ± 0.363
2.211SerPro: 2.211 ± 0.253
1.993SerGln: 1.993 ± 0.276
3.515SerArg: 3.515 ± 0.316
5.581SerSer: 5.581 ± 0.593
4.965SerThr: 4.965 ± 0.383
5.544SerVal: 5.544 ± 0.436
1.051SerTrp: 1.051 ± 0.207
2.319SerTyr: 2.319 ± 0.266
0.0SerXaa: 0.0 ± 0.0
Thr
4.022ThrAla: 4.022 ± 0.446
0.58ThrCys: 0.58 ± 0.148
3.878ThrAsp: 3.878 ± 0.418
4.204ThrGlu: 4.204 ± 0.337
2.392ThrPhe: 2.392 ± 0.222
5.037ThrGly: 5.037 ± 0.582
0.797ThrHis: 0.797 ± 0.167
4.675ThrIle: 4.675 ± 0.495
3.986ThrLys: 3.986 ± 0.381
5.291ThrLeu: 5.291 ± 0.537
1.051ThrMet: 1.051 ± 0.183
3.696ThrAsn: 3.696 ± 0.389
2.863ThrPro: 2.863 ± 0.396
2.138ThrGln: 2.138 ± 0.271
2.79ThrArg: 2.79 ± 0.345
4.385ThrSer: 4.385 ± 0.488
3.878ThrThr: 3.878 ± 0.484
4.53ThrVal: 4.53 ± 0.538
0.544ThrTrp: 0.544 ± 0.15
2.283ThrTyr: 2.283 ± 0.285
0.0ThrXaa: 0.0 ± 0.0
Val
4.494ValAla: 4.494 ± 0.491
0.689ValCys: 0.689 ± 0.17
4.385ValAsp: 4.385 ± 0.474
4.747ValGlu: 4.747 ± 0.394
3.044ValPhe: 3.044 ± 0.334
3.696ValGly: 3.696 ± 0.434
1.305ValHis: 1.305 ± 0.231
4.131ValIle: 4.131 ± 0.396
3.986ValLys: 3.986 ± 0.34
6.124ValLeu: 6.124 ± 0.447
1.594ValMet: 1.594 ± 0.249
2.935ValAsn: 2.935 ± 0.373
3.225ValPro: 3.225 ± 0.377
2.609ValGln: 2.609 ± 0.36
2.537ValArg: 2.537 ± 0.352
6.45ValSer: 6.45 ± 0.493
4.675ValThr: 4.675 ± 0.491
4.82ValVal: 4.82 ± 0.542
0.689ValTrp: 0.689 ± 0.165
2.863ValTyr: 2.863 ± 0.313
0.0ValXaa: 0.0 ± 0.0
Trp
0.544TrpAla: 0.544 ± 0.157
0.181TrpCys: 0.181 ± 0.101
1.015TrpAsp: 1.015 ± 0.186
0.906TrpGlu: 0.906 ± 0.179
0.399TrpPhe: 0.399 ± 0.104
0.906TrpGly: 0.906 ± 0.226
0.181TrpHis: 0.181 ± 0.08
0.652TrpIle: 0.652 ± 0.163
0.652TrpLys: 0.652 ± 0.153
1.268TrpLeu: 1.268 ± 0.192
0.29TrpMet: 0.29 ± 0.098
0.652TrpAsn: 0.652 ± 0.151
0.29TrpPro: 0.29 ± 0.116
0.326TrpGln: 0.326 ± 0.116
0.29TrpArg: 0.29 ± 0.095
0.87TrpSer: 0.87 ± 0.209
0.833TrpThr: 0.833 ± 0.179
0.978TrpVal: 0.978 ± 0.197
0.109TrpTrp: 0.109 ± 0.066
0.58TrpTyr: 0.58 ± 0.169
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.464TyrAla: 2.464 ± 0.254
0.725TyrCys: 0.725 ± 0.158
2.718TyrAsp: 2.718 ± 0.319
2.754TyrGlu: 2.754 ± 0.4
1.667TyrPhe: 1.667 ± 0.266
2.319TyrGly: 2.319 ± 0.343
0.833TyrHis: 0.833 ± 0.161
2.464TyrIle: 2.464 ± 0.321
2.392TyrLys: 2.392 ± 0.32
3.261TyrLeu: 3.261 ± 0.286
0.87TyrMet: 0.87 ± 0.197
1.884TyrAsn: 1.884 ± 0.225
1.921TyrPro: 1.921 ± 0.283
1.486TyrGln: 1.486 ± 0.245
2.102TyrArg: 2.102 ± 0.313
3.841TyrSer: 3.841 ± 0.421
2.464TyrThr: 2.464 ± 0.317
2.392TyrVal: 2.392 ± 0.28
0.652TyrTrp: 0.652 ± 0.148
1.848TyrTyr: 1.848 ± 0.302
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 127 proteins (27596 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski