Amino acid dipepetide frequency for Gordonia phage Phistory

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.347AlaAla: 13.347 ± 1.181
0.827AlaCys: 0.827 ± 0.245
6.437AlaAsp: 6.437 ± 0.664
8.268AlaGlu: 8.268 ± 0.967
2.598AlaPhe: 2.598 ± 0.737
9.862AlaGly: 9.862 ± 0.82
1.595AlaHis: 1.595 ± 0.299
6.142AlaIle: 6.142 ± 0.713
3.839AlaLys: 3.839 ± 0.513
9.036AlaLeu: 9.036 ± 0.675
3.78AlaMet: 3.78 ± 0.523
2.953AlaAsn: 2.953 ± 0.449
4.725AlaPro: 4.725 ± 0.472
4.488AlaGln: 4.488 ± 0.618
7.146AlaArg: 7.146 ± 0.706
6.142AlaSer: 6.142 ± 0.508
5.965AlaThr: 5.965 ± 0.759
8.091AlaVal: 8.091 ± 1.384
2.067AlaTrp: 2.067 ± 0.339
2.244AlaTyr: 2.244 ± 0.353
0.0AlaXaa: 0.0 ± 0.0
Cys
0.768CysAla: 0.768 ± 0.227
0.236CysCys: 0.236 ± 0.138
0.945CysAsp: 0.945 ± 0.231
0.886CysGlu: 0.886 ± 0.273
0.059CysPhe: 0.059 ± 0.061
1.417CysGly: 1.417 ± 0.445
0.472CysHis: 0.472 ± 0.171
0.354CysIle: 0.354 ± 0.138
0.472CysLys: 0.472 ± 0.18
0.827CysLeu: 0.827 ± 0.261
0.177CysMet: 0.177 ± 0.096
0.472CysAsn: 0.472 ± 0.208
0.886CysPro: 0.886 ± 0.262
0.591CysGln: 0.591 ± 0.216
1.004CysArg: 1.004 ± 0.299
0.768CysSer: 0.768 ± 0.428
0.591CysThr: 0.591 ± 0.185
1.004CysVal: 1.004 ± 0.277
0.295CysTrp: 0.295 ± 0.136
0.177CysTyr: 0.177 ± 0.102
0.0CysXaa: 0.0 ± 0.0
Asp
6.851AspAla: 6.851 ± 0.694
0.591AspCys: 0.591 ± 0.205
5.02AspAsp: 5.02 ± 0.669
4.193AspGlu: 4.193 ± 0.59
1.476AspPhe: 1.476 ± 0.266
6.142AspGly: 6.142 ± 0.717
1.181AspHis: 1.181 ± 0.239
2.717AspIle: 2.717 ± 0.427
2.362AspLys: 2.362 ± 0.344
5.61AspLeu: 5.61 ± 0.569
1.122AspMet: 1.122 ± 0.272
1.24AspAsn: 1.24 ± 0.27
4.075AspPro: 4.075 ± 0.427
1.595AspGln: 1.595 ± 0.339
4.843AspArg: 4.843 ± 0.573
3.543AspSer: 3.543 ± 0.489
3.425AspThr: 3.425 ± 0.436
4.665AspVal: 4.665 ± 0.505
1.654AspTrp: 1.654 ± 0.288
1.772AspTyr: 1.772 ± 0.299
0.0AspXaa: 0.0 ± 0.0
Glu
6.732GluAla: 6.732 ± 0.683
0.532GluCys: 0.532 ± 0.202
2.953GluAsp: 2.953 ± 0.43
2.835GluGlu: 2.835 ± 0.532
1.831GluPhe: 1.831 ± 0.291
3.957GluGly: 3.957 ± 0.478
1.831GluHis: 1.831 ± 0.3
2.598GluIle: 2.598 ± 0.395
2.717GluLys: 2.717 ± 0.422
6.024GluLeu: 6.024 ± 0.651
1.358GluMet: 1.358 ± 0.302
0.886GluAsn: 0.886 ± 0.209
3.484GluPro: 3.484 ± 0.636
2.776GluGln: 2.776 ± 0.382
4.902GluArg: 4.902 ± 0.702
3.543GluSer: 3.543 ± 0.536
3.366GluThr: 3.366 ± 0.413
4.311GluVal: 4.311 ± 0.518
1.595GluTrp: 1.595 ± 0.388
0.945GluTyr: 0.945 ± 0.259
0.0GluXaa: 0.0 ± 0.0
Phe
2.598PheAla: 2.598 ± 0.404
0.354PheCys: 0.354 ± 0.149
1.476PheAsp: 1.476 ± 0.318
1.063PheGlu: 1.063 ± 0.257
0.768PhePhe: 0.768 ± 0.187
3.071PheGly: 3.071 ± 0.448
0.591PheHis: 0.591 ± 0.167
0.945PheIle: 0.945 ± 0.252
1.181PheLys: 1.181 ± 0.257
1.417PheLeu: 1.417 ± 0.316
0.827PheMet: 0.827 ± 0.264
0.945PheAsn: 0.945 ± 0.258
1.24PhePro: 1.24 ± 0.253
1.063PheGln: 1.063 ± 0.207
1.89PheArg: 1.89 ± 0.31
1.772PheSer: 1.772 ± 0.302
1.772PheThr: 1.772 ± 0.263
1.772PheVal: 1.772 ± 0.375
0.532PheTrp: 0.532 ± 0.2
0.945PheTyr: 0.945 ± 0.245
0.0PheXaa: 0.0 ± 0.0
Gly
8.091GlyAla: 8.091 ± 1.062
0.591GlyCys: 0.591 ± 0.234
4.961GlyAsp: 4.961 ± 0.64
4.075GlyGlu: 4.075 ± 0.56
3.13GlyPhe: 3.13 ± 0.53
6.732GlyGly: 6.732 ± 1.228
1.949GlyHis: 1.949 ± 0.398
4.252GlyIle: 4.252 ± 0.555
3.189GlyLys: 3.189 ± 0.402
6.732GlyLeu: 6.732 ± 0.629
2.126GlyMet: 2.126 ± 0.407
2.126GlyAsn: 2.126 ± 0.401
3.484GlyPro: 3.484 ± 0.421
2.835GlyGln: 2.835 ± 0.355
6.496GlyArg: 6.496 ± 0.473
4.725GlySer: 4.725 ± 0.57
5.669GlyThr: 5.669 ± 0.57
7.323GlyVal: 7.323 ± 0.748
2.658GlyTrp: 2.658 ± 0.342
2.126GlyTyr: 2.126 ± 0.359
0.0GlyXaa: 0.0 ± 0.0
His
2.008HisAla: 2.008 ± 0.365
0.532HisCys: 0.532 ± 0.204
1.181HisAsp: 1.181 ± 0.224
1.299HisGlu: 1.299 ± 0.376
0.532HisPhe: 0.532 ± 0.156
1.949HisGly: 1.949 ± 0.28
0.65HisHis: 0.65 ± 0.183
1.181HisIle: 1.181 ± 0.314
0.532HisLys: 0.532 ± 0.173
1.949HisLeu: 1.949 ± 0.387
0.886HisMet: 0.886 ± 0.226
0.709HisAsn: 0.709 ± 0.196
1.122HisPro: 1.122 ± 0.217
0.827HisGln: 0.827 ± 0.236
1.831HisArg: 1.831 ± 0.36
0.886HisSer: 0.886 ± 0.205
1.535HisThr: 1.535 ± 0.277
2.008HisVal: 2.008 ± 0.305
0.65HisTrp: 0.65 ± 0.269
0.709HisTyr: 0.709 ± 0.228
0.0HisXaa: 0.0 ± 0.0
Ile
6.201IleAla: 6.201 ± 0.598
0.532IleCys: 0.532 ± 0.233
3.898IleAsp: 3.898 ± 0.561
3.248IleGlu: 3.248 ± 0.433
0.945IlePhe: 0.945 ± 0.333
4.547IleGly: 4.547 ± 0.629
0.945IleHis: 0.945 ± 0.238
1.595IleIle: 1.595 ± 0.392
2.126IleLys: 2.126 ± 0.582
3.307IleLeu: 3.307 ± 0.414
0.532IleMet: 0.532 ± 0.191
1.476IleAsn: 1.476 ± 0.309
2.658IlePro: 2.658 ± 0.309
1.535IleGln: 1.535 ± 0.295
3.484IleArg: 3.484 ± 0.391
1.949IleSer: 1.949 ± 0.329
3.189IleThr: 3.189 ± 0.491
3.484IleVal: 3.484 ± 0.474
0.709IleTrp: 0.709 ± 0.191
0.886IleTyr: 0.886 ± 0.257
0.0IleXaa: 0.0 ± 0.0
Lys
4.429LysAla: 4.429 ± 0.55
0.413LysCys: 0.413 ± 0.169
2.362LysAsp: 2.362 ± 0.558
1.772LysGlu: 1.772 ± 0.344
1.358LysPhe: 1.358 ± 0.267
3.071LysGly: 3.071 ± 0.385
0.945LysHis: 0.945 ± 0.202
1.417LysIle: 1.417 ± 0.23
2.067LysLys: 2.067 ± 0.468
3.13LysLeu: 3.13 ± 0.319
0.65LysMet: 0.65 ± 0.205
0.886LysAsn: 0.886 ± 0.283
1.358LysPro: 1.358 ± 0.3
1.299LysGln: 1.299 ± 0.315
2.421LysArg: 2.421 ± 0.423
2.067LysSer: 2.067 ± 0.348
2.598LysThr: 2.598 ± 0.362
2.658LysVal: 2.658 ± 0.406
0.886LysTrp: 0.886 ± 0.201
0.709LysTyr: 0.709 ± 0.168
0.0LysXaa: 0.0 ± 0.0
Leu
10.099LeuAla: 10.099 ± 0.87
0.827LeuCys: 0.827 ± 0.256
4.784LeuAsp: 4.784 ± 0.596
4.488LeuGlu: 4.488 ± 0.663
1.654LeuPhe: 1.654 ± 0.31
6.26LeuGly: 6.26 ± 0.699
1.595LeuHis: 1.595 ± 0.334
3.543LeuIle: 3.543 ± 0.475
2.185LeuLys: 2.185 ± 0.368
5.492LeuLeu: 5.492 ± 0.589
1.299LeuMet: 1.299 ± 0.264
1.89LeuAsn: 1.89 ± 0.355
4.665LeuPro: 4.665 ± 0.538
2.48LeuGln: 2.48 ± 0.384
6.024LeuArg: 6.024 ± 0.613
4.902LeuSer: 4.902 ± 0.492
5.728LeuThr: 5.728 ± 0.613
6.201LeuVal: 6.201 ± 0.646
1.713LeuTrp: 1.713 ± 0.403
1.476LeuTyr: 1.476 ± 0.269
0.0LeuXaa: 0.0 ± 0.0
Met
2.776MetAla: 2.776 ± 0.465
0.354MetCys: 0.354 ± 0.139
1.122MetAsp: 1.122 ± 0.267
0.472MetGlu: 0.472 ± 0.161
0.413MetPhe: 0.413 ± 0.133
1.535MetGly: 1.535 ± 0.36
0.413MetHis: 0.413 ± 0.173
0.886MetIle: 0.886 ± 0.228
1.24MetLys: 1.24 ± 0.292
1.24MetLeu: 1.24 ± 0.271
0.354MetMet: 0.354 ± 0.114
0.945MetAsn: 0.945 ± 0.198
1.595MetPro: 1.595 ± 0.325
0.591MetGln: 0.591 ± 0.158
1.535MetArg: 1.535 ± 0.414
2.894MetSer: 2.894 ± 0.342
2.776MetThr: 2.776 ± 0.394
1.476MetVal: 1.476 ± 0.282
0.532MetTrp: 0.532 ± 0.187
0.413MetTyr: 0.413 ± 0.15
0.0MetXaa: 0.0 ± 0.0
Asn
3.839AsnAla: 3.839 ± 0.594
0.532AsnCys: 0.532 ± 0.211
2.067AsnAsp: 2.067 ± 0.348
1.299AsnGlu: 1.299 ± 0.29
0.472AsnPhe: 0.472 ± 0.157
2.894AsnGly: 2.894 ± 0.467
0.709AsnHis: 0.709 ± 0.195
1.358AsnIle: 1.358 ± 0.362
0.591AsnLys: 0.591 ± 0.171
2.303AsnLeu: 2.303 ± 0.385
0.295AsnMet: 0.295 ± 0.1
0.532AsnAsn: 0.532 ± 0.198
1.772AsnPro: 1.772 ± 0.247
0.886AsnGln: 0.886 ± 0.231
2.303AsnArg: 2.303 ± 0.428
1.24AsnSer: 1.24 ± 0.256
1.417AsnThr: 1.417 ± 0.238
2.126AsnVal: 2.126 ± 0.381
0.295AsnTrp: 0.295 ± 0.106
0.827AsnTyr: 0.827 ± 0.217
0.0AsnXaa: 0.0 ± 0.0
Pro
5.197ProAla: 5.197 ± 0.447
0.827ProCys: 0.827 ± 0.239
3.898ProAsp: 3.898 ± 0.491
3.839ProGlu: 3.839 ± 0.47
1.358ProPhe: 1.358 ± 0.313
4.016ProGly: 4.016 ± 0.462
1.004ProHis: 1.004 ± 0.228
2.776ProIle: 2.776 ± 0.396
3.012ProLys: 3.012 ± 0.426
3.189ProLeu: 3.189 ± 0.396
1.24ProMet: 1.24 ± 0.247
1.713ProAsn: 1.713 ± 0.366
2.776ProPro: 2.776 ± 0.573
1.535ProGln: 1.535 ± 0.315
3.13ProArg: 3.13 ± 0.507
3.839ProSer: 3.839 ± 0.452
3.543ProThr: 3.543 ± 0.458
4.311ProVal: 4.311 ± 0.473
0.945ProTrp: 0.945 ± 0.226
1.24ProTyr: 1.24 ± 0.313
0.0ProXaa: 0.0 ± 0.0
Gln
3.484GlnAla: 3.484 ± 0.573
0.709GlnCys: 0.709 ± 0.2
1.772GlnAsp: 1.772 ± 0.287
2.008GlnGlu: 2.008 ± 0.305
1.181GlnPhe: 1.181 ± 0.224
1.713GlnGly: 1.713 ± 0.368
0.768GlnHis: 0.768 ± 0.235
2.008GlnIle: 2.008 ± 0.297
0.768GlnLys: 0.768 ± 0.191
2.776GlnLeu: 2.776 ± 0.412
1.358GlnMet: 1.358 ± 0.243
1.181GlnAsn: 1.181 ± 0.248
1.713GlnPro: 1.713 ± 0.275
1.713GlnGln: 1.713 ± 0.308
3.248GlnArg: 3.248 ± 0.394
2.421GlnSer: 2.421 ± 0.395
2.126GlnThr: 2.126 ± 0.388
2.421GlnVal: 2.421 ± 0.35
0.886GlnTrp: 0.886 ± 0.215
0.591GlnTyr: 0.591 ± 0.178
0.0GlnXaa: 0.0 ± 0.0
Arg
7.5ArgAla: 7.5 ± 0.616
0.945ArgCys: 0.945 ± 0.253
5.079ArgAsp: 5.079 ± 0.521
4.606ArgGlu: 4.606 ± 0.567
1.772ArgPhe: 1.772 ± 0.273
4.843ArgGly: 4.843 ± 0.544
2.421ArgHis: 2.421 ± 0.435
4.429ArgIle: 4.429 ± 0.516
2.48ArgLys: 2.48 ± 0.412
5.965ArgLeu: 5.965 ± 0.63
2.008ArgMet: 2.008 ± 0.305
2.421ArgAsn: 2.421 ± 0.397
3.189ArgPro: 3.189 ± 0.583
2.717ArgGln: 2.717 ± 0.41
6.851ArgArg: 6.851 ± 0.963
4.016ArgSer: 4.016 ± 0.601
3.721ArgThr: 3.721 ± 0.441
5.079ArgVal: 5.079 ± 0.657
2.067ArgTrp: 2.067 ± 0.374
2.244ArgTyr: 2.244 ± 0.341
0.0ArgXaa: 0.0 ± 0.0
Ser
6.555SerAla: 6.555 ± 0.571
0.768SerCys: 0.768 ± 0.249
3.602SerAsp: 3.602 ± 0.517
3.957SerGlu: 3.957 ± 0.497
1.299SerPhe: 1.299 ± 0.259
6.91SerGly: 6.91 ± 0.73
1.654SerHis: 1.654 ± 0.302
2.185SerIle: 2.185 ± 0.367
1.89SerLys: 1.89 ± 0.411
4.547SerLeu: 4.547 ± 0.498
1.949SerMet: 1.949 ± 0.375
1.417SerAsn: 1.417 ± 0.33
2.539SerPro: 2.539 ± 0.321
1.299SerGln: 1.299 ± 0.325
4.547SerArg: 4.547 ± 0.642
4.016SerSer: 4.016 ± 0.535
3.898SerThr: 3.898 ± 0.596
4.311SerVal: 4.311 ± 0.652
1.24SerTrp: 1.24 ± 0.222
1.299SerTyr: 1.299 ± 0.297
0.0SerXaa: 0.0 ± 0.0
Thr
7.264ThrAla: 7.264 ± 0.604
0.827ThrCys: 0.827 ± 0.344
3.721ThrAsp: 3.721 ± 0.513
3.13ThrGlu: 3.13 ± 0.485
1.949ThrPhe: 1.949 ± 0.301
5.079ThrGly: 5.079 ± 0.586
2.067ThrHis: 2.067 ± 0.365
4.134ThrIle: 4.134 ± 0.562
1.476ThrLys: 1.476 ± 0.31
5.847ThrLeu: 5.847 ± 0.516
0.591ThrMet: 0.591 ± 0.17
1.949ThrAsn: 1.949 ± 0.327
5.079ThrPro: 5.079 ± 0.481
2.185ThrGln: 2.185 ± 0.391
4.134ThrArg: 4.134 ± 0.596
3.602ThrSer: 3.602 ± 0.48
4.37ThrThr: 4.37 ± 0.588
5.197ThrVal: 5.197 ± 0.625
1.24ThrTrp: 1.24 ± 0.308
1.358ThrTyr: 1.358 ± 0.306
0.0ThrXaa: 0.0 ± 0.0
Val
7.677ValAla: 7.677 ± 0.731
1.063ValCys: 1.063 ± 0.385
4.725ValAsp: 4.725 ± 0.557
5.315ValGlu: 5.315 ± 0.644
2.185ValPhe: 2.185 ± 0.327
5.492ValGly: 5.492 ± 0.653
1.299ValHis: 1.299 ± 0.252
2.776ValIle: 2.776 ± 0.485
2.658ValLys: 2.658 ± 0.468
5.138ValLeu: 5.138 ± 0.477
1.713ValMet: 1.713 ± 0.364
2.303ValAsn: 2.303 ± 0.392
4.311ValPro: 4.311 ± 0.488
2.894ValGln: 2.894 ± 0.399
5.138ValArg: 5.138 ± 0.639
5.138ValSer: 5.138 ± 0.674
5.315ValThr: 5.315 ± 0.754
6.319ValVal: 6.319 ± 0.902
2.658ValTrp: 2.658 ± 0.423
1.772ValTyr: 1.772 ± 0.416
0.0ValXaa: 0.0 ± 0.0
Trp
2.185TrpAla: 2.185 ± 0.443
0.591TrpCys: 0.591 ± 0.175
1.713TrpAsp: 1.713 ± 0.342
1.181TrpGlu: 1.181 ± 0.208
0.472TrpPhe: 0.472 ± 0.162
1.358TrpGly: 1.358 ± 0.275
0.413TrpHis: 0.413 ± 0.167
1.417TrpIle: 1.417 ± 0.259
0.709TrpLys: 0.709 ± 0.212
1.831TrpLeu: 1.831 ± 0.394
0.827TrpMet: 0.827 ± 0.216
1.181TrpAsn: 1.181 ± 0.331
1.24TrpPro: 1.24 ± 0.259
1.004TrpGln: 1.004 ± 0.225
1.417TrpArg: 1.417 ± 0.276
1.063TrpSer: 1.063 ± 0.249
2.126TrpThr: 2.126 ± 0.316
1.535TrpVal: 1.535 ± 0.343
0.472TrpTrp: 0.472 ± 0.185
0.768TrpTyr: 0.768 ± 0.197
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.949TyrAla: 1.949 ± 0.34
0.354TyrCys: 0.354 ± 0.152
2.362TyrAsp: 2.362 ± 0.397
1.24TyrGlu: 1.24 ± 0.309
0.768TyrPhe: 0.768 ± 0.18
2.421TyrGly: 2.421 ± 0.364
0.472TyrHis: 0.472 ± 0.15
0.532TyrIle: 0.532 ± 0.208
1.004TyrLys: 1.004 ± 0.263
1.004TyrLeu: 1.004 ± 0.288
0.413TyrMet: 0.413 ± 0.139
0.532TyrAsn: 0.532 ± 0.153
1.476TyrPro: 1.476 ± 0.286
0.532TyrGln: 0.532 ± 0.197
2.008TyrArg: 2.008 ± 0.342
1.358TyrSer: 1.358 ± 0.274
2.067TyrThr: 2.067 ± 0.349
1.595TyrVal: 1.595 ± 0.246
0.413TyrTrp: 0.413 ± 0.176
0.413TyrTyr: 0.413 ± 0.143
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 106 proteins (16934 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski