Amino acid dipepetide frequency for Stenotrophomonas phage Pokken

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.928AlaAla: 9.928 ± 1.131
0.841AlaCys: 0.841 ± 0.275
5.637AlaAsp: 5.637 ± 0.71
6.436AlaGlu: 6.436 ± 0.617
2.272AlaPhe: 2.272 ± 0.278
7.236AlaGly: 7.236 ± 0.715
1.346AlaHis: 1.346 ± 0.246
4.081AlaIle: 4.081 ± 0.509
5.721AlaLys: 5.721 ± 0.586
8.035AlaLeu: 8.035 ± 1.057
2.819AlaMet: 2.819 ± 0.309
4.291AlaAsn: 4.291 ± 0.443
4.417AlaPro: 4.417 ± 0.535
5.216AlaGln: 5.216 ± 0.86
4.922AlaArg: 4.922 ± 0.589
5.259AlaSer: 5.259 ± 0.393
5.553AlaThr: 5.553 ± 0.618
6.058AlaVal: 6.058 ± 0.521
1.01AlaTrp: 1.01 ± 0.196
2.819AlaTyr: 2.819 ± 0.336
0.0AlaXaa: 0.0 ± 0.0
Cys
0.463CysAla: 0.463 ± 0.162
0.168CysCys: 0.168 ± 0.089
0.379CysAsp: 0.379 ± 0.136
0.505CysGlu: 0.505 ± 0.183
0.21CysPhe: 0.21 ± 0.107
0.589CysGly: 0.589 ± 0.191
0.168CysHis: 0.168 ± 0.102
0.421CysIle: 0.421 ± 0.155
0.421CysLys: 0.421 ± 0.141
0.421CysLeu: 0.421 ± 0.168
0.21CysMet: 0.21 ± 0.106
0.589CysAsn: 0.589 ± 0.213
0.294CysPro: 0.294 ± 0.126
0.337CysGln: 0.337 ± 0.146
0.21CysArg: 0.21 ± 0.121
0.547CysSer: 0.547 ± 0.181
0.379CysThr: 0.379 ± 0.155
0.631CysVal: 0.631 ± 0.184
0.168CysTrp: 0.168 ± 0.096
0.21CysTyr: 0.21 ± 0.103
0.0CysXaa: 0.0 ± 0.0
Asp
6.268AspAla: 6.268 ± 0.782
0.631AspCys: 0.631 ± 0.2
3.113AspAsp: 3.113 ± 0.346
3.786AspGlu: 3.786 ± 0.345
2.23AspPhe: 2.23 ± 0.25
4.88AspGly: 4.88 ± 0.468
1.472AspHis: 1.472 ± 0.346
3.071AspIle: 3.071 ± 0.436
3.155AspLys: 3.155 ± 0.355
5.974AspLeu: 5.974 ± 0.394
1.136AspMet: 1.136 ± 0.188
1.935AspAsn: 1.935 ± 0.333
3.66AspPro: 3.66 ± 0.512
2.734AspGln: 2.734 ± 0.352
3.786AspArg: 3.786 ± 0.353
3.197AspSer: 3.197 ± 0.48
3.534AspThr: 3.534 ± 0.556
3.45AspVal: 3.45 ± 0.31
1.01AspTrp: 1.01 ± 0.223
2.314AspTyr: 2.314 ± 0.313
0.0AspXaa: 0.0 ± 0.0
Glu
6.184GluAla: 6.184 ± 0.689
0.379GluCys: 0.379 ± 0.147
3.45GluAsp: 3.45 ± 0.393
4.291GluGlu: 4.291 ± 0.528
2.692GluPhe: 2.692 ± 0.342
4.375GluGly: 4.375 ± 0.348
1.346GluHis: 1.346 ± 0.237
3.786GluIle: 3.786 ± 0.503
2.734GluLys: 2.734 ± 0.386
5.048GluLeu: 5.048 ± 0.666
2.019GluMet: 2.019 ± 0.275
2.188GluAsn: 2.188 ± 0.278
2.356GluPro: 2.356 ± 0.349
3.45GluGln: 3.45 ± 0.379
2.482GluArg: 2.482 ± 0.402
3.576GluSer: 3.576 ± 0.36
3.45GluThr: 3.45 ± 0.356
4.838GluVal: 4.838 ± 0.403
0.883GluTrp: 0.883 ± 0.201
1.725GluTyr: 1.725 ± 0.264
0.0GluXaa: 0.0 ± 0.0
Phe
2.524PheAla: 2.524 ± 0.326
0.252PheCys: 0.252 ± 0.108
2.734PheAsp: 2.734 ± 0.329
1.977PheGlu: 1.977 ± 0.355
1.136PhePhe: 1.136 ± 0.226
3.113PheGly: 3.113 ± 0.425
0.757PheHis: 0.757 ± 0.203
1.767PheIle: 1.767 ± 0.27
2.145PheLys: 2.145 ± 0.305
2.524PheLeu: 2.524 ± 0.317
1.557PheMet: 1.557 ± 0.281
2.482PheAsn: 2.482 ± 0.316
1.22PhePro: 1.22 ± 0.198
1.557PheGln: 1.557 ± 0.201
1.977PheArg: 1.977 ± 0.27
2.061PheSer: 2.061 ± 0.244
1.893PheThr: 1.893 ± 0.235
2.061PheVal: 2.061 ± 0.257
0.337PheTrp: 0.337 ± 0.124
1.514PheTyr: 1.514 ± 0.24
0.0PheXaa: 0.0 ± 0.0
Gly
6.31GlyAla: 6.31 ± 0.573
0.463GlyCys: 0.463 ± 0.149
4.249GlyAsp: 4.249 ± 0.457
3.996GlyGlu: 3.996 ± 0.396
3.239GlyPhe: 3.239 ± 0.409
4.459GlyGly: 4.459 ± 0.7
1.22GlyHis: 1.22 ± 0.27
3.408GlyIle: 3.408 ± 0.349
3.912GlyLys: 3.912 ± 0.56
5.932GlyLeu: 5.932 ± 0.744
2.145GlyMet: 2.145 ± 0.283
3.828GlyAsn: 3.828 ± 0.549
2.23GlyPro: 2.23 ± 0.28
3.323GlyGln: 3.323 ± 0.376
3.323GlyArg: 3.323 ± 0.594
4.964GlySer: 4.964 ± 0.476
4.67GlyThr: 4.67 ± 0.422
4.712GlyVal: 4.712 ± 0.46
1.346GlyTrp: 1.346 ± 0.318
2.776GlyTyr: 2.776 ± 0.31
0.0GlyXaa: 0.0 ± 0.0
His
1.641HisAla: 1.641 ± 0.26
0.168HisCys: 0.168 ± 0.083
1.514HisAsp: 1.514 ± 0.29
0.841HisGlu: 0.841 ± 0.165
0.631HisPhe: 0.631 ± 0.175
1.683HisGly: 1.683 ± 0.327
0.589HisHis: 0.589 ± 0.208
0.631HisIle: 0.631 ± 0.232
1.346HisLys: 1.346 ± 0.326
2.019HisLeu: 2.019 ± 0.377
0.421HisMet: 0.421 ± 0.143
0.841HisAsn: 0.841 ± 0.199
1.178HisPro: 1.178 ± 0.334
0.715HisGln: 0.715 ± 0.178
1.262HisArg: 1.262 ± 0.241
1.22HisSer: 1.22 ± 0.249
0.841HisThr: 0.841 ± 0.182
1.22HisVal: 1.22 ± 0.252
0.463HisTrp: 0.463 ± 0.125
0.715HisTyr: 0.715 ± 0.166
0.0HisXaa: 0.0 ± 0.0
Ile
4.123IleAla: 4.123 ± 0.36
0.379IleCys: 0.379 ± 0.146
4.67IleAsp: 4.67 ± 0.393
3.87IleGlu: 3.87 ± 0.435
1.472IlePhe: 1.472 ± 0.233
3.954IleGly: 3.954 ± 0.366
0.925IleHis: 0.925 ± 0.21
3.239IleIle: 3.239 ± 0.537
3.197IleLys: 3.197 ± 0.464
3.87IleLeu: 3.87 ± 0.348
1.262IleMet: 1.262 ± 0.217
2.65IleAsn: 2.65 ± 0.294
2.44IlePro: 2.44 ± 0.448
2.23IleGln: 2.23 ± 0.24
3.45IleArg: 3.45 ± 0.424
2.734IleSer: 2.734 ± 0.319
4.417IleThr: 4.417 ± 0.523
2.65IleVal: 2.65 ± 0.383
0.715IleTrp: 0.715 ± 0.217
1.935IleTyr: 1.935 ± 0.301
0.0IleXaa: 0.0 ± 0.0
Lys
5.216LysAla: 5.216 ± 0.533
0.252LysCys: 0.252 ± 0.125
3.197LysAsp: 3.197 ± 0.387
2.987LysGlu: 2.987 ± 0.271
1.641LysPhe: 1.641 ± 0.259
3.323LysGly: 3.323 ± 0.37
1.052LysHis: 1.052 ± 0.206
3.576LysIle: 3.576 ± 0.261
4.796LysLys: 4.796 ± 0.488
5.006LysLeu: 5.006 ± 0.6
1.472LysMet: 1.472 ± 0.26
2.903LysAsn: 2.903 ± 0.348
2.524LysPro: 2.524 ± 0.444
3.071LysGln: 3.071 ± 0.434
3.155LysArg: 3.155 ± 0.564
2.734LysSer: 2.734 ± 0.305
4.375LysThr: 4.375 ± 0.434
3.996LysVal: 3.996 ± 0.49
0.757LysTrp: 0.757 ± 0.227
1.599LysTyr: 1.599 ± 0.284
0.0LysXaa: 0.0 ± 0.0
Leu
7.488LeuAla: 7.488 ± 0.799
0.715LeuCys: 0.715 ± 0.205
5.343LeuAsp: 5.343 ± 0.416
5.132LeuGlu: 5.132 ± 0.397
2.903LeuPhe: 2.903 ± 0.316
4.922LeuGly: 4.922 ± 0.377
1.767LeuHis: 1.767 ± 0.228
4.838LeuIle: 4.838 ± 0.576
4.585LeuLys: 4.585 ± 0.385
6.436LeuLeu: 6.436 ± 0.634
2.482LeuMet: 2.482 ± 0.297
4.712LeuAsn: 4.712 ± 0.564
3.66LeuPro: 3.66 ± 0.31
3.365LeuGln: 3.365 ± 0.272
4.501LeuArg: 4.501 ± 0.476
4.922LeuSer: 4.922 ± 0.429
6.563LeuThr: 6.563 ± 0.569
5.259LeuVal: 5.259 ± 0.497
1.01LeuTrp: 1.01 ± 0.257
2.145LeuTyr: 2.145 ± 0.304
0.0LeuXaa: 0.0 ± 0.0
Met
3.534MetAla: 3.534 ± 0.373
0.294MetCys: 0.294 ± 0.104
1.767MetAsp: 1.767 ± 0.278
1.599MetGlu: 1.599 ± 0.241
1.01MetPhe: 1.01 ± 0.193
1.472MetGly: 1.472 ± 0.202
0.294MetHis: 0.294 ± 0.091
1.43MetIle: 1.43 ± 0.264
1.683MetLys: 1.683 ± 0.229
2.398MetLeu: 2.398 ± 0.331
0.589MetMet: 0.589 ± 0.173
1.304MetAsn: 1.304 ± 0.216
1.472MetPro: 1.472 ± 0.243
1.262MetGln: 1.262 ± 0.291
1.767MetArg: 1.767 ± 0.215
2.356MetSer: 2.356 ± 0.325
1.851MetThr: 1.851 ± 0.278
1.767MetVal: 1.767 ± 0.296
0.294MetTrp: 0.294 ± 0.101
0.631MetTyr: 0.631 ± 0.15
0.0MetXaa: 0.0 ± 0.0
Asn
4.459AsnAla: 4.459 ± 0.682
0.421AsnCys: 0.421 ± 0.152
2.524AsnAsp: 2.524 ± 0.361
3.239AsnGlu: 3.239 ± 0.368
1.851AsnPhe: 1.851 ± 0.242
3.744AsnGly: 3.744 ± 0.382
1.01AsnHis: 1.01 ± 0.205
1.935AsnIle: 1.935 ± 0.263
3.029AsnLys: 3.029 ± 0.33
3.702AsnLeu: 3.702 ± 0.386
1.725AsnMet: 1.725 ± 0.26
2.272AsnAsn: 2.272 ± 0.292
3.113AsnPro: 3.113 ± 0.352
2.903AsnGln: 2.903 ± 0.372
2.819AsnArg: 2.819 ± 0.32
3.071AsnSer: 3.071 ± 0.347
2.524AsnThr: 2.524 ± 0.431
2.692AsnVal: 2.692 ± 0.332
0.673AsnTrp: 0.673 ± 0.167
1.725AsnTyr: 1.725 ± 0.222
0.0AsnXaa: 0.0 ± 0.0
Pro
3.996ProAla: 3.996 ± 0.341
0.294ProCys: 0.294 ± 0.113
2.945ProAsp: 2.945 ± 0.438
3.45ProGlu: 3.45 ± 0.419
1.472ProPhe: 1.472 ± 0.261
2.987ProGly: 2.987 ± 0.485
0.883ProHis: 0.883 ± 0.202
2.356ProIle: 2.356 ± 0.325
2.23ProLys: 2.23 ± 0.316
3.239ProLeu: 3.239 ± 0.345
1.388ProMet: 1.388 ± 0.227
2.903ProAsn: 2.903 ± 0.301
1.178ProPro: 1.178 ± 0.338
1.641ProGln: 1.641 ± 0.344
1.599ProArg: 1.599 ± 0.343
2.566ProSer: 2.566 ± 0.414
3.45ProThr: 3.45 ± 0.374
3.281ProVal: 3.281 ± 0.387
0.673ProTrp: 0.673 ± 0.157
1.599ProTyr: 1.599 ± 0.272
0.0ProXaa: 0.0 ± 0.0
Gln
4.459GlnAla: 4.459 ± 0.634
0.042GlnCys: 0.042 ± 0.045
1.893GlnAsp: 1.893 ± 0.3
2.819GlnGlu: 2.819 ± 0.375
2.356GlnPhe: 2.356 ± 0.387
3.071GlnGly: 3.071 ± 0.585
1.346GlnHis: 1.346 ± 0.252
2.566GlnIle: 2.566 ± 0.32
2.65GlnLys: 2.65 ± 0.309
4.964GlnLeu: 4.964 ± 0.456
1.683GlnMet: 1.683 ± 0.257
2.103GlnAsn: 2.103 ± 0.286
1.641GlnPro: 1.641 ± 0.289
2.524GlnGln: 2.524 ± 0.461
2.188GlnArg: 2.188 ± 0.352
3.029GlnSer: 3.029 ± 0.402
2.945GlnThr: 2.945 ± 0.337
2.987GlnVal: 2.987 ± 0.272
0.421GlnTrp: 0.421 ± 0.103
1.388GlnTyr: 1.388 ± 0.251
0.0GlnXaa: 0.0 ± 0.0
Arg
4.039ArgAla: 4.039 ± 0.573
0.421ArgCys: 0.421 ± 0.169
2.861ArgAsp: 2.861 ± 0.325
2.945ArgGlu: 2.945 ± 0.462
2.188ArgPhe: 2.188 ± 0.369
3.618ArgGly: 3.618 ± 0.341
1.136ArgHis: 1.136 ± 0.243
2.819ArgIle: 2.819 ± 0.306
3.197ArgLys: 3.197 ± 0.347
4.754ArgLeu: 4.754 ± 0.446
1.136ArgMet: 1.136 ± 0.212
2.145ArgAsn: 2.145 ± 0.292
1.767ArgPro: 1.767 ± 0.303
2.608ArgGln: 2.608 ± 0.417
2.314ArgArg: 2.314 ± 0.344
3.744ArgSer: 3.744 ± 0.56
2.734ArgThr: 2.734 ± 0.342
3.534ArgVal: 3.534 ± 0.436
0.883ArgTrp: 0.883 ± 0.198
1.725ArgTyr: 1.725 ± 0.305
0.0ArgXaa: 0.0 ± 0.0
Ser
5.132SerAla: 5.132 ± 0.475
0.252SerCys: 0.252 ± 0.115
3.239SerAsp: 3.239 ± 0.37
3.66SerGlu: 3.66 ± 0.353
2.145SerPhe: 2.145 ± 0.261
4.838SerGly: 4.838 ± 0.552
0.925SerHis: 0.925 ± 0.21
3.912SerIle: 3.912 ± 0.411
3.155SerLys: 3.155 ± 0.371
5.637SerLeu: 5.637 ± 0.737
2.145SerMet: 2.145 ± 0.362
2.608SerAsn: 2.608 ± 0.244
2.861SerPro: 2.861 ± 0.297
2.272SerGln: 2.272 ± 0.239
2.734SerArg: 2.734 ± 0.274
3.492SerSer: 3.492 ± 0.554
3.828SerThr: 3.828 ± 0.406
3.576SerVal: 3.576 ± 0.368
0.715SerTrp: 0.715 ± 0.176
2.145SerTyr: 2.145 ± 0.259
0.0SerXaa: 0.0 ± 0.0
Thr
6.31ThrAla: 6.31 ± 0.66
0.379ThrCys: 0.379 ± 0.147
3.954ThrAsp: 3.954 ± 0.383
3.365ThrGlu: 3.365 ± 0.328
2.188ThrPhe: 2.188 ± 0.3
4.375ThrGly: 4.375 ± 0.379
1.388ThrHis: 1.388 ± 0.246
4.207ThrIle: 4.207 ± 0.489
2.987ThrLys: 2.987 ± 0.375
4.796ThrLeu: 4.796 ± 0.357
1.599ThrMet: 1.599 ± 0.268
3.492ThrAsn: 3.492 ± 0.399
3.45ThrPro: 3.45 ± 0.334
2.987ThrGln: 2.987 ± 0.322
2.524ThrArg: 2.524 ± 0.48
3.618ThrSer: 3.618 ± 0.44
4.417ThrThr: 4.417 ± 0.629
5.174ThrVal: 5.174 ± 0.476
0.841ThrTrp: 0.841 ± 0.135
2.314ThrTyr: 2.314 ± 0.447
0.0ThrXaa: 0.0 ± 0.0
Val
7.362ValAla: 7.362 ± 0.782
0.379ValCys: 0.379 ± 0.167
4.796ValAsp: 4.796 ± 0.447
3.996ValGlu: 3.996 ± 0.384
2.061ValPhe: 2.061 ± 0.254
4.501ValGly: 4.501 ± 0.477
1.22ValHis: 1.22 ± 0.236
3.66ValIle: 3.66 ± 0.333
3.786ValLys: 3.786 ± 0.419
4.922ValLeu: 4.922 ± 0.522
1.641ValMet: 1.641 ± 0.268
3.113ValAsn: 3.113 ± 0.351
3.239ValPro: 3.239 ± 0.342
2.44ValGln: 2.44 ± 0.312
2.987ValArg: 2.987 ± 0.301
3.954ValSer: 3.954 ± 0.435
4.796ValThr: 4.796 ± 0.543
4.459ValVal: 4.459 ± 0.469
0.968ValTrp: 0.968 ± 0.235
1.599ValTyr: 1.599 ± 0.282
0.0ValXaa: 0.0 ± 0.0
Trp
1.178TrpAla: 1.178 ± 0.173
0.126TrpCys: 0.126 ± 0.086
1.094TrpAsp: 1.094 ± 0.28
0.799TrpGlu: 0.799 ± 0.211
0.883TrpPhe: 0.883 ± 0.237
0.883TrpGly: 0.883 ± 0.184
0.505TrpHis: 0.505 ± 0.223
0.631TrpIle: 0.631 ± 0.142
0.968TrpLys: 0.968 ± 0.177
1.136TrpLeu: 1.136 ± 0.259
0.463TrpMet: 0.463 ± 0.117
0.968TrpAsn: 0.968 ± 0.228
0.294TrpPro: 0.294 ± 0.092
0.547TrpGln: 0.547 ± 0.144
0.589TrpArg: 0.589 ± 0.171
0.673TrpSer: 0.673 ± 0.139
0.463TrpThr: 0.463 ± 0.142
0.968TrpVal: 0.968 ± 0.154
0.084TrpTrp: 0.084 ± 0.062
0.547TrpTyr: 0.547 ± 0.148
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.281TyrAla: 3.281 ± 0.358
0.463TyrCys: 0.463 ± 0.157
1.977TyrAsp: 1.977 ± 0.266
1.557TyrGlu: 1.557 ± 0.227
1.01TyrPhe: 1.01 ± 0.245
2.482TyrGly: 2.482 ± 0.294
0.589TyrHis: 0.589 ± 0.186
1.851TyrIle: 1.851 ± 0.249
1.935TyrLys: 1.935 ± 0.318
1.935TyrLeu: 1.935 ± 0.279
0.841TyrMet: 0.841 ± 0.228
2.103TyrAsn: 2.103 ± 0.29
1.136TyrPro: 1.136 ± 0.239
1.935TyrGln: 1.935 ± 0.294
1.977TyrArg: 1.977 ± 0.315
1.557TyrSer: 1.557 ± 0.249
1.683TyrThr: 1.683 ± 0.296
2.566TyrVal: 2.566 ± 0.312
0.547TyrTrp: 0.547 ± 0.127
0.883TyrTyr: 0.883 ± 0.227
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 93 proteins (23772 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski