Amino acid dipepetide frequency for Vibrio phage JSF35

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.385AlaAla: 9.385 ± 1.214
0.695AlaCys: 0.695 ± 0.285
4.866AlaAsp: 4.866 ± 0.774
5.909AlaGlu: 5.909 ± 0.953
2.694AlaPhe: 2.694 ± 0.467
5.648AlaGly: 5.648 ± 0.866
1.13AlaHis: 1.13 ± 0.247
5.388AlaIle: 5.388 ± 0.683
6.778AlaLys: 6.778 ± 0.72
7.386AlaLeu: 7.386 ± 0.898
3.128AlaMet: 3.128 ± 0.645
4.258AlaAsn: 4.258 ± 0.579
2.346AlaPro: 2.346 ± 0.398
3.91AlaGln: 3.91 ± 0.698
3.65AlaArg: 3.65 ± 0.476
5.214AlaSer: 5.214 ± 0.648
3.823AlaThr: 3.823 ± 0.814
5.648AlaVal: 5.648 ± 0.68
1.303AlaTrp: 1.303 ± 0.506
3.823AlaTyr: 3.823 ± 0.65
0.0AlaXaa: 0.0 ± 0.0
Cys
0.956CysAla: 0.956 ± 0.304
0.174CysCys: 0.174 ± 0.111
0.521CysAsp: 0.521 ± 0.233
0.782CysGlu: 0.782 ± 0.274
0.782CysPhe: 0.782 ± 0.292
0.695CysGly: 0.695 ± 0.237
0.521CysHis: 0.521 ± 0.175
0.521CysIle: 0.521 ± 0.234
0.348CysLys: 0.348 ± 0.219
0.695CysLeu: 0.695 ± 0.293
0.348CysMet: 0.348 ± 0.208
0.087CysAsn: 0.087 ± 0.101
0.608CysPro: 0.608 ± 0.208
0.348CysGln: 0.348 ± 0.173
0.782CysArg: 0.782 ± 0.367
0.261CysSer: 0.261 ± 0.155
0.174CysThr: 0.174 ± 0.131
0.608CysVal: 0.608 ± 0.214
0.087CysTrp: 0.087 ± 0.105
0.174CysTyr: 0.174 ± 0.18
0.0CysXaa: 0.0 ± 0.0
Asp
5.388AspAla: 5.388 ± 0.893
0.869AspCys: 0.869 ± 0.306
3.91AspAsp: 3.91 ± 0.779
3.91AspGlu: 3.91 ± 0.647
2.694AspPhe: 2.694 ± 0.537
4.345AspGly: 4.345 ± 0.537
0.782AspHis: 0.782 ± 0.238
3.737AspIle: 3.737 ± 0.541
3.91AspLys: 3.91 ± 0.682
3.997AspLeu: 3.997 ± 0.537
1.651AspMet: 1.651 ± 0.354
2.172AspAsn: 2.172 ± 0.437
2.607AspPro: 2.607 ± 0.594
1.477AspGln: 1.477 ± 0.386
2.346AspArg: 2.346 ± 0.462
3.65AspSer: 3.65 ± 0.672
4.171AspThr: 4.171 ± 0.656
4.866AspVal: 4.866 ± 0.787
1.13AspTrp: 1.13 ± 0.371
2.694AspTyr: 2.694 ± 0.629
0.0AspXaa: 0.0 ± 0.0
Glu
7.994GluAla: 7.994 ± 0.902
0.782GluCys: 0.782 ± 0.249
5.648GluAsp: 5.648 ± 0.858
6.343GluGlu: 6.343 ± 0.835
2.346GluPhe: 2.346 ± 0.316
5.301GluGly: 5.301 ± 0.717
1.564GluHis: 1.564 ± 0.466
4.084GluIle: 4.084 ± 0.679
3.563GluLys: 3.563 ± 0.463
6.257GluLeu: 6.257 ± 0.848
2.868GluMet: 2.868 ± 0.569
2.52GluAsn: 2.52 ± 0.376
1.651GluPro: 1.651 ± 0.382
3.737GluGln: 3.737 ± 0.494
3.91GluArg: 3.91 ± 0.48
4.779GluSer: 4.779 ± 0.892
3.041GluThr: 3.041 ± 0.448
4.605GluVal: 4.605 ± 0.689
1.217GluTrp: 1.217 ± 0.296
2.954GluTyr: 2.954 ± 0.446
0.0GluXaa: 0.0 ± 0.0
Phe
2.433PheAla: 2.433 ± 0.477
0.608PheCys: 0.608 ± 0.219
2.781PheAsp: 2.781 ± 0.528
3.041PheGlu: 3.041 ± 0.483
1.303PhePhe: 1.303 ± 0.434
3.128PheGly: 3.128 ± 0.568
0.695PheHis: 0.695 ± 0.303
2.259PheIle: 2.259 ± 0.492
3.128PheLys: 3.128 ± 0.584
3.389PheLeu: 3.389 ± 0.598
1.477PheMet: 1.477 ± 0.409
2.781PheAsn: 2.781 ± 0.575
1.043PhePro: 1.043 ± 0.311
1.39PheGln: 1.39 ± 0.284
1.651PheArg: 1.651 ± 0.334
2.781PheSer: 2.781 ± 0.463
2.52PheThr: 2.52 ± 0.513
2.52PheVal: 2.52 ± 0.575
0.261PheTrp: 0.261 ± 0.16
1.477PheTyr: 1.477 ± 0.308
0.0PheXaa: 0.0 ± 0.0
Gly
6.343GlyAla: 6.343 ± 0.957
0.434GlyCys: 0.434 ± 0.222
4.084GlyAsp: 4.084 ± 0.629
5.127GlyGlu: 5.127 ± 0.654
2.52GlyPhe: 2.52 ± 0.385
5.214GlyGly: 5.214 ± 0.648
1.477GlyHis: 1.477 ± 0.311
3.302GlyIle: 3.302 ± 0.481
5.301GlyLys: 5.301 ± 0.911
6.604GlyLeu: 6.604 ± 0.969
2.433GlyMet: 2.433 ± 0.462
3.215GlyAsn: 3.215 ± 0.497
0.087GlyPro: 0.087 ± 0.083
2.781GlyGln: 2.781 ± 0.477
3.823GlyArg: 3.823 ± 0.431
3.823GlySer: 3.823 ± 0.66
3.91GlyThr: 3.91 ± 0.565
4.258GlyVal: 4.258 ± 0.688
0.782GlyTrp: 0.782 ± 0.267
3.302GlyTyr: 3.302 ± 0.428
0.0GlyXaa: 0.0 ± 0.0
His
1.217HisAla: 1.217 ± 0.297
0.261HisCys: 0.261 ± 0.169
1.13HisAsp: 1.13 ± 0.303
1.477HisGlu: 1.477 ± 0.324
1.39HisPhe: 1.39 ± 0.302
0.956HisGly: 0.956 ± 0.279
0.174HisHis: 0.174 ± 0.108
1.217HisIle: 1.217 ± 0.252
0.782HisLys: 0.782 ± 0.276
1.477HisLeu: 1.477 ± 0.388
0.434HisMet: 0.434 ± 0.228
0.695HisAsn: 0.695 ± 0.257
0.782HisPro: 0.782 ± 0.282
0.348HisGln: 0.348 ± 0.158
1.043HisArg: 1.043 ± 0.375
1.13HisSer: 1.13 ± 0.342
0.695HisThr: 0.695 ± 0.319
1.564HisVal: 1.564 ± 0.34
0.608HisTrp: 0.608 ± 0.229
0.434HisTyr: 0.434 ± 0.215
0.0HisXaa: 0.0 ± 0.0
Ile
4.084IleAla: 4.084 ± 0.547
0.782IleCys: 0.782 ± 0.334
2.868IleAsp: 2.868 ± 0.492
4.084IleGlu: 4.084 ± 0.668
1.13IlePhe: 1.13 ± 0.326
3.389IleGly: 3.389 ± 0.468
0.869IleHis: 0.869 ± 0.277
2.868IleIle: 2.868 ± 0.579
4.953IleLys: 4.953 ± 0.636
4.605IleLeu: 4.605 ± 0.616
1.303IleMet: 1.303 ± 0.321
2.868IleAsn: 2.868 ± 0.489
2.607IlePro: 2.607 ± 0.463
2.433IleGln: 2.433 ± 0.491
3.302IleArg: 3.302 ± 0.506
2.954IleSer: 2.954 ± 0.514
3.215IleThr: 3.215 ± 0.53
4.084IleVal: 4.084 ± 0.596
0.521IleTrp: 0.521 ± 0.197
1.39IleTyr: 1.39 ± 0.295
0.0IleXaa: 0.0 ± 0.0
Lys
6.517LysAla: 6.517 ± 0.798
0.695LysCys: 0.695 ± 0.278
4.258LysAsp: 4.258 ± 0.692
4.692LysGlu: 4.692 ± 0.67
2.954LysPhe: 2.954 ± 0.494
4.692LysGly: 4.692 ± 0.653
1.39LysHis: 1.39 ± 0.459
3.041LysIle: 3.041 ± 0.387
5.388LysLys: 5.388 ± 0.837
5.735LysLeu: 5.735 ± 0.842
2.346LysMet: 2.346 ± 0.418
2.346LysAsn: 2.346 ± 0.504
3.128LysPro: 3.128 ± 0.765
3.476LysGln: 3.476 ± 0.653
3.997LysArg: 3.997 ± 0.656
4.084LysSer: 4.084 ± 0.632
3.997LysThr: 3.997 ± 0.508
4.432LysVal: 4.432 ± 0.781
0.956LysTrp: 0.956 ± 0.365
2.781LysTyr: 2.781 ± 0.563
0.0LysXaa: 0.0 ± 0.0
Leu
6.865LeuAla: 6.865 ± 0.769
0.608LeuCys: 0.608 ± 0.272
4.432LeuAsp: 4.432 ± 0.735
6.865LeuGlu: 6.865 ± 0.685
2.52LeuPhe: 2.52 ± 0.495
5.127LeuGly: 5.127 ± 0.795
1.39LeuHis: 1.39 ± 0.323
5.127LeuIle: 5.127 ± 0.546
7.734LeuLys: 7.734 ± 0.822
5.648LeuLeu: 5.648 ± 0.797
2.346LeuMet: 2.346 ± 0.384
4.692LeuAsn: 4.692 ± 0.68
2.52LeuPro: 2.52 ± 0.405
3.91LeuGln: 3.91 ± 0.52
5.127LeuArg: 5.127 ± 0.601
4.258LeuSer: 4.258 ± 0.511
4.866LeuThr: 4.866 ± 0.732
4.432LeuVal: 4.432 ± 0.473
1.043LeuTrp: 1.043 ± 0.28
2.781LeuTyr: 2.781 ± 0.464
0.0LeuXaa: 0.0 ± 0.0
Met
3.128MetAla: 3.128 ± 0.49
0.348MetCys: 0.348 ± 0.164
1.217MetAsp: 1.217 ± 0.336
2.433MetGlu: 2.433 ± 0.392
1.043MetPhe: 1.043 ± 0.325
1.912MetGly: 1.912 ± 0.532
0.348MetHis: 0.348 ± 0.184
1.303MetIle: 1.303 ± 0.251
1.825MetLys: 1.825 ± 0.461
1.912MetLeu: 1.912 ± 0.366
0.348MetMet: 0.348 ± 0.162
1.477MetAsn: 1.477 ± 0.357
1.651MetPro: 1.651 ± 0.407
1.564MetGln: 1.564 ± 0.41
1.217MetArg: 1.217 ± 0.358
1.912MetSer: 1.912 ± 0.364
2.694MetThr: 2.694 ± 0.603
2.172MetVal: 2.172 ± 0.361
0.174MetTrp: 0.174 ± 0.108
0.348MetTyr: 0.348 ± 0.166
0.0MetXaa: 0.0 ± 0.0
Asn
3.476AsnAla: 3.476 ± 0.674
0.434AsnCys: 0.434 ± 0.193
2.868AsnAsp: 2.868 ± 0.499
2.607AsnGlu: 2.607 ± 0.502
1.825AsnPhe: 1.825 ± 0.362
3.563AsnGly: 3.563 ± 0.656
1.043AsnHis: 1.043 ± 0.298
2.259AsnIle: 2.259 ± 0.527
2.607AsnLys: 2.607 ± 0.455
4.345AsnLeu: 4.345 ± 0.62
1.564AsnMet: 1.564 ± 0.394
2.172AsnAsn: 2.172 ± 0.33
2.954AsnPro: 2.954 ± 0.427
1.564AsnGln: 1.564 ± 0.426
2.259AsnArg: 2.259 ± 0.524
2.52AsnSer: 2.52 ± 0.611
2.346AsnThr: 2.346 ± 0.375
3.563AsnVal: 3.563 ± 0.422
0.521AsnTrp: 0.521 ± 0.244
1.564AsnTyr: 1.564 ± 0.344
0.0AsnXaa: 0.0 ± 0.0
Pro
2.954ProAla: 2.954 ± 0.61
0.261ProCys: 0.261 ± 0.134
2.694ProAsp: 2.694 ± 0.568
4.345ProGlu: 4.345 ± 0.55
1.477ProPhe: 1.477 ± 0.387
0.0ProGly: 0.0 ± 0.0
0.869ProHis: 0.869 ± 0.195
1.217ProIle: 1.217 ± 0.293
1.825ProLys: 1.825 ± 0.428
2.52ProLeu: 2.52 ± 0.405
0.869ProMet: 0.869 ± 0.292
2.607ProAsn: 2.607 ± 0.537
0.695ProPro: 0.695 ± 0.215
1.564ProGln: 1.564 ± 0.482
1.13ProArg: 1.13 ± 0.302
3.128ProSer: 3.128 ± 0.467
1.825ProThr: 1.825 ± 0.421
3.737ProVal: 3.737 ± 0.478
0.608ProTrp: 0.608 ± 0.262
1.13ProTyr: 1.13 ± 0.381
0.0ProXaa: 0.0 ± 0.0
Gln
5.301GlnAla: 5.301 ± 0.979
0.261GlnCys: 0.261 ± 0.148
2.433GlnAsp: 2.433 ± 0.575
3.302GlnGlu: 3.302 ± 0.668
1.825GlnPhe: 1.825 ± 0.358
2.607GlnGly: 2.607 ± 0.461
0.521GlnHis: 0.521 ± 0.194
2.172GlnIle: 2.172 ± 0.429
2.259GlnLys: 2.259 ± 0.46
4.171GlnLeu: 4.171 ± 0.638
0.782GlnMet: 0.782 ± 0.267
0.956GlnAsn: 0.956 ± 0.295
0.956GlnPro: 0.956 ± 0.253
2.086GlnGln: 2.086 ± 0.574
2.086GlnArg: 2.086 ± 0.379
2.781GlnSer: 2.781 ± 0.431
2.346GlnThr: 2.346 ± 0.502
3.65GlnVal: 3.65 ± 0.56
0.608GlnTrp: 0.608 ± 0.247
1.303GlnTyr: 1.303 ± 0.294
0.0GlnXaa: 0.0 ± 0.0
Arg
3.997ArgAla: 3.997 ± 0.495
0.521ArgCys: 0.521 ± 0.265
3.215ArgAsp: 3.215 ± 0.474
3.476ArgGlu: 3.476 ± 0.634
2.433ArgPhe: 2.433 ± 0.393
3.563ArgGly: 3.563 ± 0.563
0.434ArgHis: 0.434 ± 0.207
3.041ArgIle: 3.041 ± 0.583
3.737ArgLys: 3.737 ± 0.638
3.737ArgLeu: 3.737 ± 0.611
1.043ArgMet: 1.043 ± 0.289
2.433ArgAsn: 2.433 ± 0.456
2.172ArgPro: 2.172 ± 0.343
2.433ArgGln: 2.433 ± 0.494
2.346ArgArg: 2.346 ± 0.377
2.954ArgSer: 2.954 ± 0.544
2.607ArgThr: 2.607 ± 0.51
3.215ArgVal: 3.215 ± 0.577
0.782ArgTrp: 0.782 ± 0.233
1.564ArgTyr: 1.564 ± 0.367
0.0ArgXaa: 0.0 ± 0.0
Ser
4.519SerAla: 4.519 ± 0.645
0.608SerCys: 0.608 ± 0.244
3.91SerAsp: 3.91 ± 0.609
3.041SerGlu: 3.041 ± 0.605
3.041SerPhe: 3.041 ± 0.455
5.996SerGly: 5.996 ± 0.736
0.869SerHis: 0.869 ± 0.299
3.65SerIle: 3.65 ± 0.526
4.432SerLys: 4.432 ± 0.757
5.735SerLeu: 5.735 ± 1.007
1.651SerMet: 1.651 ± 0.408
2.607SerAsn: 2.607 ± 0.547
1.912SerPro: 1.912 ± 0.396
1.825SerGln: 1.825 ± 0.31
2.172SerArg: 2.172 ± 0.341
3.65SerSer: 3.65 ± 0.748
2.868SerThr: 2.868 ± 0.496
2.868SerVal: 2.868 ± 0.385
0.348SerTrp: 0.348 ± 0.129
2.694SerTyr: 2.694 ± 0.428
0.0SerXaa: 0.0 ± 0.0
Thr
3.215ThrAla: 3.215 ± 0.668
0.348ThrCys: 0.348 ± 0.156
2.694ThrAsp: 2.694 ± 0.432
3.997ThrGlu: 3.997 ± 0.543
3.823ThrPhe: 3.823 ± 0.702
5.735ThrGly: 5.735 ± 0.665
1.043ThrHis: 1.043 ± 0.271
3.563ThrIle: 3.563 ± 0.62
3.389ThrLys: 3.389 ± 0.684
5.127ThrLeu: 5.127 ± 0.676
1.217ThrMet: 1.217 ± 0.299
2.259ThrAsn: 2.259 ± 0.511
2.781ThrPro: 2.781 ± 0.42
2.52ThrGln: 2.52 ± 0.505
2.607ThrArg: 2.607 ± 0.371
2.433ThrSer: 2.433 ± 0.464
3.737ThrThr: 3.737 ± 0.514
3.215ThrVal: 3.215 ± 0.488
0.608ThrTrp: 0.608 ± 0.307
1.999ThrTyr: 1.999 ± 0.311
0.0ThrXaa: 0.0 ± 0.0
Val
5.388ValAla: 5.388 ± 0.604
0.174ValCys: 0.174 ± 0.127
3.563ValAsp: 3.563 ± 0.568
6.343ValGlu: 6.343 ± 0.816
2.52ValPhe: 2.52 ± 0.445
3.476ValGly: 3.476 ± 0.573
1.564ValHis: 1.564 ± 0.337
2.954ValIle: 2.954 ± 0.516
5.735ValLys: 5.735 ± 0.783
5.214ValLeu: 5.214 ± 0.527
1.825ValMet: 1.825 ± 0.471
3.563ValAsn: 3.563 ± 0.653
2.781ValPro: 2.781 ± 0.473
2.433ValGln: 2.433 ± 0.405
3.389ValArg: 3.389 ± 0.532
3.91ValSer: 3.91 ± 0.637
4.432ValThr: 4.432 ± 0.679
4.345ValVal: 4.345 ± 0.705
0.608ValTrp: 0.608 ± 0.197
2.694ValTyr: 2.694 ± 0.415
0.0ValXaa: 0.0 ± 0.0
Trp
0.956TrpAla: 0.956 ± 0.276
0.087TrpCys: 0.087 ± 0.087
0.782TrpAsp: 0.782 ± 0.281
0.782TrpGlu: 0.782 ± 0.222
0.434TrpPhe: 0.434 ± 0.203
0.434TrpGly: 0.434 ± 0.169
0.434TrpHis: 0.434 ± 0.229
0.608TrpIle: 0.608 ± 0.291
1.217TrpLys: 1.217 ± 0.328
1.13TrpLeu: 1.13 ± 0.335
0.087TrpMet: 0.087 ± 0.078
0.695TrpAsn: 0.695 ± 0.28
0.261TrpPro: 0.261 ± 0.147
0.434TrpGln: 0.434 ± 0.16
0.782TrpArg: 0.782 ± 0.221
0.695TrpSer: 0.695 ± 0.231
1.217TrpThr: 1.217 ± 0.315
0.869TrpVal: 0.869 ± 0.339
0.174TrpTrp: 0.174 ± 0.106
0.695TrpTyr: 0.695 ± 0.315
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.868TyrAla: 2.868 ± 0.533
0.521TyrCys: 0.521 ± 0.227
2.433TyrAsp: 2.433 ± 0.353
2.694TyrGlu: 2.694 ± 0.473
1.912TyrPhe: 1.912 ± 0.471
3.041TyrGly: 3.041 ± 0.459
0.695TyrHis: 0.695 ± 0.263
1.999TyrIle: 1.999 ± 0.472
1.999TyrLys: 1.999 ± 0.382
2.607TyrLeu: 2.607 ± 0.507
1.217TyrMet: 1.217 ± 0.287
1.651TyrAsn: 1.651 ± 0.368
1.912TyrPro: 1.912 ± 0.391
2.172TyrGln: 2.172 ± 0.444
2.086TyrArg: 2.086 ± 0.322
1.477TyrSer: 1.477 ± 0.33
1.825TyrThr: 1.825 ± 0.493
2.259TyrVal: 2.259 ± 0.522
0.434TyrTrp: 0.434 ± 0.188
0.434TyrTyr: 0.434 ± 0.192
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 48 proteins (11509 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski