Amino acid dipepetide frequency for Flavobacterium phage fF4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.199AlaAla: 0.199 ± 0.128
0.496AlaCys: 0.496 ± 0.21
3.873AlaAsp: 3.873 ± 0.878
2.78AlaGlu: 2.78 ± 0.613
2.979AlaPhe: 2.979 ± 0.469
3.475AlaGly: 3.475 ± 1.0
1.092AlaHis: 1.092 ± 0.352
4.667AlaIle: 4.667 ± 0.649
5.362AlaLys: 5.362 ± 0.767
6.951AlaLeu: 6.951 ± 0.866
1.986AlaMet: 1.986 ± 0.432
2.979AlaAsn: 2.979 ± 0.46
1.39AlaPro: 1.39 ± 0.366
3.177AlaGln: 3.177 ± 0.483
1.787AlaArg: 1.787 ± 0.378
5.858AlaSer: 5.858 ± 0.966
3.873AlaThr: 3.873 ± 0.564
3.674AlaVal: 3.674 ± 0.508
0.496AlaTrp: 0.496 ± 0.184
3.078AlaTyr: 3.078 ± 0.627
0.0AlaXaa: 0.0 ± 0.0
Cys
0.496CysAla: 0.496 ± 0.218
0.199CysCys: 0.199 ± 0.144
0.298CysAsp: 0.298 ± 0.149
1.092CysGlu: 1.092 ± 0.361
0.596CysPhe: 0.596 ± 0.22
0.199CysGly: 0.199 ± 0.134
0.199CysHis: 0.199 ± 0.144
0.298CysIle: 0.298 ± 0.155
1.092CysLys: 1.092 ± 0.294
0.894CysLeu: 0.894 ± 0.339
0.199CysMet: 0.199 ± 0.143
0.596CysAsn: 0.596 ± 0.224
0.298CysPro: 0.298 ± 0.19
0.596CysGln: 0.596 ± 0.214
0.695CysArg: 0.695 ± 0.232
0.695CysSer: 0.695 ± 0.272
0.596CysThr: 0.596 ± 0.229
0.695CysVal: 0.695 ± 0.215
0.099CysTrp: 0.099 ± 0.106
0.099CysTyr: 0.099 ± 0.104
0.0CysXaa: 0.0 ± 0.0
Asp
2.88AspAla: 2.88 ± 0.493
0.794AspCys: 0.794 ± 0.239
2.88AspAsp: 2.88 ± 0.491
2.78AspGlu: 2.78 ± 0.491
3.674AspPhe: 3.674 ± 0.702
3.773AspGly: 3.773 ± 0.611
0.596AspHis: 0.596 ± 0.206
4.369AspIle: 4.369 ± 0.757
5.858AspLys: 5.858 ± 0.664
5.461AspLeu: 5.461 ± 0.689
1.291AspMet: 1.291 ± 0.326
2.979AspAsn: 2.979 ± 0.671
1.589AspPro: 1.589 ± 0.311
1.489AspGln: 1.489 ± 0.415
1.092AspArg: 1.092 ± 0.296
2.979AspSer: 2.979 ± 0.513
3.674AspThr: 3.674 ± 0.537
3.376AspVal: 3.376 ± 0.557
1.192AspTrp: 1.192 ± 0.392
1.787AspTyr: 1.787 ± 0.472
0.0AspXaa: 0.0 ± 0.0
Glu
4.468GluAla: 4.468 ± 0.579
0.794GluCys: 0.794 ± 0.24
2.284GluAsp: 2.284 ± 0.476
5.064GluGlu: 5.064 ± 0.864
3.475GluPhe: 3.475 ± 0.692
3.575GluGly: 3.575 ± 0.599
1.589GluHis: 1.589 ± 0.487
4.965GluIle: 4.965 ± 0.649
6.256GluLys: 6.256 ± 0.821
7.646GluLeu: 7.646 ± 0.899
1.39GluMet: 1.39 ± 0.291
4.369GluAsn: 4.369 ± 0.543
1.489GluPro: 1.489 ± 0.311
2.88GluGln: 2.88 ± 0.636
2.582GluArg: 2.582 ± 0.492
3.475GluSer: 3.475 ± 0.559
4.071GluThr: 4.071 ± 0.751
4.965GluVal: 4.965 ± 0.573
0.596GluTrp: 0.596 ± 0.232
2.383GluTyr: 2.383 ± 0.378
0.0GluXaa: 0.0 ± 0.0
Phe
1.986PheAla: 1.986 ± 0.414
0.496PheCys: 0.496 ± 0.207
3.674PheAsp: 3.674 ± 0.614
4.865PheGlu: 4.865 ± 0.755
0.894PhePhe: 0.894 ± 0.382
2.383PheGly: 2.383 ± 0.5
0.496PheHis: 0.496 ± 0.327
2.085PheIle: 2.085 ± 0.472
5.461PheLys: 5.461 ± 0.612
3.376PheLeu: 3.376 ± 0.651
1.589PheMet: 1.589 ± 0.459
3.078PheAsn: 3.078 ± 0.604
1.887PhePro: 1.887 ± 0.363
1.589PheGln: 1.589 ± 0.368
1.092PheArg: 1.092 ± 0.268
3.674PheSer: 3.674 ± 0.548
2.383PheThr: 2.383 ± 0.464
3.376PheVal: 3.376 ± 0.606
0.298PheTrp: 0.298 ± 0.191
2.482PheTyr: 2.482 ± 0.645
0.0PheXaa: 0.0 ± 0.0
Gly
3.873GlyAla: 3.873 ± 0.912
0.496GlyCys: 0.496 ± 0.242
3.277GlyAsp: 3.277 ± 0.627
3.873GlyGlu: 3.873 ± 0.684
3.972GlyPhe: 3.972 ± 0.436
2.184GlyGly: 2.184 ± 0.593
0.298GlyHis: 0.298 ± 0.183
5.064GlyIle: 5.064 ± 0.867
5.163GlyLys: 5.163 ± 0.881
5.66GlyLeu: 5.66 ± 0.829
1.092GlyMet: 1.092 ± 0.31
2.78GlyAsn: 2.78 ± 0.725
0.099GlyPro: 0.099 ± 0.095
2.085GlyGln: 2.085 ± 0.449
1.589GlyArg: 1.589 ± 0.396
3.376GlySer: 3.376 ± 0.503
3.177GlyThr: 3.177 ± 0.48
3.873GlyVal: 3.873 ± 0.656
0.496GlyTrp: 0.496 ± 0.223
2.582GlyTyr: 2.582 ± 0.441
0.0GlyXaa: 0.0 ± 0.0
His
0.596HisAla: 0.596 ± 0.193
0.298HisCys: 0.298 ± 0.15
0.397HisAsp: 0.397 ± 0.203
0.397HisGlu: 0.397 ± 0.174
0.993HisPhe: 0.993 ± 0.289
1.092HisGly: 1.092 ± 0.287
0.0HisHis: 0.0 ± 0.0
1.192HisIle: 1.192 ± 0.351
1.688HisLys: 1.688 ± 0.329
1.39HisLeu: 1.39 ± 0.375
0.199HisMet: 0.199 ± 0.136
0.993HisAsn: 0.993 ± 0.319
0.596HisPro: 0.596 ± 0.253
0.993HisGln: 0.993 ± 0.272
0.695HisArg: 0.695 ± 0.323
0.596HisSer: 0.596 ± 0.271
1.887HisThr: 1.887 ± 0.375
0.596HisVal: 0.596 ± 0.238
0.496HisTrp: 0.496 ± 0.199
0.496HisTyr: 0.496 ± 0.189
0.0HisXaa: 0.0 ± 0.0
Ile
4.071IleAla: 4.071 ± 0.688
0.496IleCys: 0.496 ± 0.214
4.865IleAsp: 4.865 ± 0.738
7.348IleGlu: 7.348 ± 0.983
2.184IlePhe: 2.184 ± 0.464
4.468IleGly: 4.468 ± 0.896
1.092IleHis: 1.092 ± 0.282
6.057IleIle: 6.057 ± 0.849
5.958IleLys: 5.958 ± 0.747
4.27IleLeu: 4.27 ± 0.659
1.192IleMet: 1.192 ± 0.404
6.057IleAsn: 6.057 ± 0.557
2.284IlePro: 2.284 ± 0.503
2.979IleGln: 2.979 ± 0.419
3.376IleArg: 3.376 ± 0.645
3.773IleSer: 3.773 ± 0.577
4.766IleThr: 4.766 ± 0.663
4.468IleVal: 4.468 ± 0.718
0.894IleTrp: 0.894 ± 0.262
2.582IleTyr: 2.582 ± 0.543
0.0IleXaa: 0.0 ± 0.0
Lys
6.355LysAla: 6.355 ± 0.763
0.596LysCys: 0.596 ± 0.257
4.369LysAsp: 4.369 ± 1.004
7.546LysGlu: 7.546 ± 0.788
3.972LysPhe: 3.972 ± 0.57
5.66LysGly: 5.66 ± 0.689
1.192LysHis: 1.192 ± 0.284
7.05LysIle: 7.05 ± 0.817
6.951LysLys: 6.951 ± 0.715
9.83LysLeu: 9.83 ± 1.143
2.979LysMet: 2.979 ± 0.414
5.163LysAsn: 5.163 ± 0.862
3.177LysPro: 3.177 ± 0.646
3.972LysGln: 3.972 ± 0.624
4.766LysArg: 4.766 ± 0.717
5.561LysSer: 5.561 ± 0.915
7.149LysThr: 7.149 ± 0.672
6.355LysVal: 6.355 ± 0.695
0.496LysTrp: 0.496 ± 0.214
2.88LysTyr: 2.88 ± 0.604
0.0LysXaa: 0.0 ± 0.0
Leu
6.156LeuAla: 6.156 ± 0.623
0.695LeuCys: 0.695 ± 0.268
4.965LeuAsp: 4.965 ± 0.728
7.05LeuGlu: 7.05 ± 0.824
5.561LeuPhe: 5.561 ± 0.699
4.667LeuGly: 4.667 ± 0.701
0.894LeuHis: 0.894 ± 0.303
6.355LeuIle: 6.355 ± 0.976
8.937LeuLys: 8.937 ± 1.09
9.83LeuLeu: 9.83 ± 1.05
1.887LeuMet: 1.887 ± 0.374
6.951LeuAsn: 6.951 ± 0.778
3.277LeuPro: 3.277 ± 0.514
3.277LeuGln: 3.277 ± 0.587
3.277LeuArg: 3.277 ± 0.562
5.561LeuSer: 5.561 ± 0.659
5.064LeuThr: 5.064 ± 0.854
4.766LeuVal: 4.766 ± 0.702
1.092LeuTrp: 1.092 ± 0.366
2.979LeuTyr: 2.979 ± 0.489
0.0LeuXaa: 0.0 ± 0.0
Met
2.582MetAla: 2.582 ± 0.48
0.0MetCys: 0.0 ± 0.0
1.489MetAsp: 1.489 ± 0.39
1.986MetGlu: 1.986 ± 0.55
0.695MetPhe: 0.695 ± 0.241
1.489MetGly: 1.489 ± 0.393
0.298MetHis: 0.298 ± 0.165
1.489MetIle: 1.489 ± 0.361
1.787MetLys: 1.787 ± 0.377
1.986MetLeu: 1.986 ± 0.341
0.695MetMet: 0.695 ± 0.231
1.589MetAsn: 1.589 ± 0.331
0.695MetPro: 0.695 ± 0.319
1.688MetGln: 1.688 ± 0.544
0.894MetArg: 0.894 ± 0.324
1.589MetSer: 1.589 ± 0.421
1.291MetThr: 1.291 ± 0.335
1.589MetVal: 1.589 ± 0.392
0.199MetTrp: 0.199 ± 0.142
0.596MetTyr: 0.596 ± 0.29
0.0MetXaa: 0.0 ± 0.0
Asn
3.277AsnAla: 3.277 ± 0.594
0.794AsnCys: 0.794 ± 0.304
3.575AsnAsp: 3.575 ± 0.514
3.773AsnGlu: 3.773 ± 0.71
1.887AsnPhe: 1.887 ± 0.408
3.575AsnGly: 3.575 ± 0.667
1.787AsnHis: 1.787 ± 0.346
4.27AsnIle: 4.27 ± 0.774
6.553AsnLys: 6.553 ± 0.68
5.461AsnLeu: 5.461 ± 0.743
1.489AsnMet: 1.489 ± 0.312
4.369AsnAsn: 4.369 ± 0.67
2.383AsnPro: 2.383 ± 0.395
3.773AsnGln: 3.773 ± 0.561
1.986AsnArg: 1.986 ± 0.397
3.475AsnSer: 3.475 ± 0.6
3.773AsnThr: 3.773 ± 0.564
3.873AsnVal: 3.873 ± 0.58
1.092AsnTrp: 1.092 ± 0.312
3.277AsnTyr: 3.277 ± 0.552
0.0AsnXaa: 0.0 ± 0.0
Pro
1.688ProAla: 1.688 ± 0.421
0.298ProCys: 0.298 ± 0.146
2.085ProAsp: 2.085 ± 0.402
1.39ProGlu: 1.39 ± 0.337
1.192ProPhe: 1.192 ± 0.4
0.199ProGly: 0.199 ± 0.122
0.596ProHis: 0.596 ± 0.227
2.184ProIle: 2.184 ± 0.551
3.674ProLys: 3.674 ± 0.475
3.773ProLeu: 3.773 ± 0.607
0.794ProMet: 0.794 ± 0.217
1.887ProAsn: 1.887 ± 0.441
0.596ProPro: 0.596 ± 0.265
1.291ProGln: 1.291 ± 0.309
0.596ProArg: 0.596 ± 0.232
1.589ProSer: 1.589 ± 0.409
1.787ProThr: 1.787 ± 0.42
1.39ProVal: 1.39 ± 0.336
0.397ProTrp: 0.397 ± 0.2
0.695ProTyr: 0.695 ± 0.269
0.0ProXaa: 0.0 ± 0.0
Gln
2.681GlnAla: 2.681 ± 0.566
0.099GlnCys: 0.099 ± 0.115
1.688GlnAsp: 1.688 ± 0.417
3.277GlnGlu: 3.277 ± 0.508
1.986GlnPhe: 1.986 ± 0.489
2.88GlnGly: 2.88 ± 0.504
0.993GlnHis: 0.993 ± 0.301
2.681GlnIle: 2.681 ± 0.532
3.773GlnLys: 3.773 ± 0.601
4.17GlnLeu: 4.17 ± 0.543
0.894GlnMet: 0.894 ± 0.275
2.681GlnAsn: 2.681 ± 0.362
0.894GlnPro: 0.894 ± 0.312
1.39GlnGln: 1.39 ± 0.334
1.787GlnArg: 1.787 ± 0.471
3.376GlnSer: 3.376 ± 0.573
2.184GlnThr: 2.184 ± 0.482
1.688GlnVal: 1.688 ± 0.425
1.192GlnTrp: 1.192 ± 0.297
1.489GlnTyr: 1.489 ± 0.518
0.0GlnXaa: 0.0 ± 0.0
Arg
1.39ArgAla: 1.39 ± 0.406
0.397ArgCys: 0.397 ± 0.202
0.993ArgAsp: 0.993 ± 0.307
1.589ArgGlu: 1.589 ± 0.338
1.688ArgPhe: 1.688 ± 0.486
1.589ArgGly: 1.589 ± 0.414
0.596ArgHis: 0.596 ± 0.196
3.277ArgIle: 3.277 ± 0.631
4.766ArgLys: 4.766 ± 0.74
3.873ArgLeu: 3.873 ± 0.714
0.993ArgMet: 0.993 ± 0.295
2.482ArgAsn: 2.482 ± 0.425
0.695ArgPro: 0.695 ± 0.298
1.39ArgGln: 1.39 ± 0.427
1.192ArgArg: 1.192 ± 0.489
2.284ArgSer: 2.284 ± 0.435
2.284ArgThr: 2.284 ± 0.367
1.589ArgVal: 1.589 ± 0.403
0.596ArgTrp: 0.596 ± 0.264
1.39ArgTyr: 1.39 ± 0.345
0.0ArgXaa: 0.0 ± 0.0
Ser
5.263SerAla: 5.263 ± 0.782
0.596SerCys: 0.596 ± 0.23
3.972SerAsp: 3.972 ± 0.445
3.575SerGlu: 3.575 ± 0.496
2.582SerPhe: 2.582 ± 0.409
4.568SerGly: 4.568 ± 0.657
0.993SerHis: 0.993 ± 0.333
4.865SerIle: 4.865 ± 0.598
6.653SerLys: 6.653 ± 0.766
5.362SerLeu: 5.362 ± 0.764
1.39SerMet: 1.39 ± 0.323
3.475SerAsn: 3.475 ± 0.643
1.489SerPro: 1.489 ± 0.331
3.277SerGln: 3.277 ± 0.549
1.887SerArg: 1.887 ± 0.497
3.972SerSer: 3.972 ± 0.776
2.085SerThr: 2.085 ± 0.448
3.873SerVal: 3.873 ± 0.69
0.794SerTrp: 0.794 ± 0.254
2.582SerTyr: 2.582 ± 0.52
0.0SerXaa: 0.0 ± 0.0
Thr
4.865ThrAla: 4.865 ± 0.602
0.695ThrCys: 0.695 ± 0.24
3.873ThrAsp: 3.873 ± 0.777
3.277ThrGlu: 3.277 ± 0.479
2.582ThrPhe: 2.582 ± 0.479
4.17ThrGly: 4.17 ± 0.635
0.596ThrHis: 0.596 ± 0.208
5.263ThrIle: 5.263 ± 0.644
4.369ThrLys: 4.369 ± 0.881
3.773ThrLeu: 3.773 ± 0.576
1.589ThrMet: 1.589 ± 0.371
3.873ThrAsn: 3.873 ± 0.548
2.085ThrPro: 2.085 ± 0.369
1.787ThrGln: 1.787 ± 0.427
1.489ThrArg: 1.489 ± 0.373
3.873ThrSer: 3.873 ± 0.534
3.277ThrThr: 3.277 ± 0.686
4.568ThrVal: 4.568 ± 0.662
0.596ThrTrp: 0.596 ± 0.223
1.589ThrTyr: 1.589 ± 0.424
0.0ThrXaa: 0.0 ± 0.0
Val
3.773ValAla: 3.773 ± 0.653
0.794ValCys: 0.794 ± 0.319
3.475ValAsp: 3.475 ± 0.596
3.873ValGlu: 3.873 ± 0.627
3.575ValPhe: 3.575 ± 0.617
2.979ValGly: 2.979 ± 0.562
1.291ValHis: 1.291 ± 0.437
3.873ValIle: 3.873 ± 0.658
6.256ValLys: 6.256 ± 0.776
4.667ValLeu: 4.667 ± 0.772
1.589ValMet: 1.589 ± 0.437
4.17ValAsn: 4.17 ± 0.45
2.284ValPro: 2.284 ± 0.467
1.589ValGln: 1.589 ± 0.362
2.482ValArg: 2.482 ± 0.453
4.766ValSer: 4.766 ± 0.782
2.681ValThr: 2.681 ± 0.502
4.071ValVal: 4.071 ± 0.673
0.397ValTrp: 0.397 ± 0.175
2.582ValTyr: 2.582 ± 0.528
0.0ValXaa: 0.0 ± 0.0
Trp
1.192TrpAla: 1.192 ± 0.308
0.397TrpCys: 0.397 ± 0.18
0.794TrpAsp: 0.794 ± 0.336
0.298TrpGlu: 0.298 ± 0.152
1.291TrpPhe: 1.291 ± 0.264
0.397TrpGly: 0.397 ± 0.188
0.298TrpHis: 0.298 ± 0.154
0.397TrpIle: 0.397 ± 0.157
0.993TrpLys: 0.993 ± 0.333
1.489TrpLeu: 1.489 ± 0.329
0.298TrpMet: 0.298 ± 0.172
0.496TrpAsn: 0.496 ± 0.205
0.0TrpPro: 0.0 ± 0.0
0.496TrpGln: 0.496 ± 0.238
0.496TrpArg: 0.496 ± 0.183
0.794TrpSer: 0.794 ± 0.274
0.496TrpThr: 0.496 ± 0.198
0.496TrpVal: 0.496 ± 0.213
0.199TrpTrp: 0.199 ± 0.148
0.894TrpTyr: 0.894 ± 0.31
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.482TyrAla: 2.482 ± 0.428
0.496TyrCys: 0.496 ± 0.208
1.688TyrAsp: 1.688 ± 0.318
2.482TyrGlu: 2.482 ± 0.451
1.39TyrPhe: 1.39 ± 0.332
1.688TyrGly: 1.688 ± 0.383
0.695TyrHis: 0.695 ± 0.257
2.78TyrIle: 2.78 ± 0.492
4.468TyrLys: 4.468 ± 0.603
3.376TyrLeu: 3.376 ± 0.511
0.993TyrMet: 0.993 ± 0.289
3.475TyrAsn: 3.475 ± 0.662
0.894TyrPro: 0.894 ± 0.331
2.085TyrGln: 2.085 ± 0.553
1.291TyrArg: 1.291 ± 0.381
1.986TyrSer: 1.986 ± 0.416
1.688TyrThr: 1.688 ± 0.497
1.887TyrVal: 1.887 ± 0.409
0.596TyrTrp: 0.596 ± 0.253
1.589TyrTyr: 1.589 ± 0.346
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 50 proteins (10072 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski