Amino acid dipepetide frequency for Vibrio phage jenny 12G5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.128AlaAla: 6.128 ± 1.349
1.062AlaCys: 1.062 ± 0.322
3.105AlaAsp: 3.105 ± 0.678
5.475AlaGlu: 5.475 ± 0.682
1.961AlaPhe: 1.961 ± 0.383
3.922AlaGly: 3.922 ± 0.773
0.817AlaHis: 0.817 ± 0.257
5.148AlaIle: 5.148 ± 0.675
6.619AlaLys: 6.619 ± 1.149
6.374AlaLeu: 6.374 ± 0.743
1.471AlaMet: 1.471 ± 0.319
3.677AlaAsn: 3.677 ± 0.447
2.533AlaPro: 2.533 ± 0.527
3.187AlaGln: 3.187 ± 0.539
3.105AlaArg: 3.105 ± 0.537
4.576AlaSer: 4.576 ± 0.7
5.72AlaThr: 5.72 ± 0.912
5.148AlaVal: 5.148 ± 0.583
0.899AlaTrp: 0.899 ± 0.233
2.86AlaTyr: 2.86 ± 0.542
0.0AlaXaa: 0.0 ± 0.0
Cys
0.817CysAla: 0.817 ± 0.287
0.163CysCys: 0.163 ± 0.11
0.327CysAsp: 0.327 ± 0.175
1.471CysGlu: 1.471 ± 0.354
1.062CysPhe: 1.062 ± 0.37
0.817CysGly: 0.817 ± 0.324
0.409CysHis: 0.409 ± 0.175
0.817CysIle: 0.817 ± 0.253
1.144CysLys: 1.144 ± 0.316
1.062CysLeu: 1.062 ± 0.384
0.327CysMet: 0.327 ± 0.161
0.899CysAsn: 0.899 ± 0.272
0.49CysPro: 0.49 ± 0.176
0.409CysGln: 0.409 ± 0.191
0.899CysArg: 0.899 ± 0.219
1.144CysSer: 1.144 ± 0.287
0.817CysThr: 0.817 ± 0.321
1.062CysVal: 1.062 ± 0.278
0.327CysTrp: 0.327 ± 0.184
0.409CysTyr: 0.409 ± 0.182
0.0CysXaa: 0.0 ± 0.0
Asp
3.922AspAla: 3.922 ± 0.644
0.981AspCys: 0.981 ± 0.289
3.677AspAsp: 3.677 ± 0.508
4.739AspGlu: 4.739 ± 0.572
2.37AspPhe: 2.37 ± 0.458
5.066AspGly: 5.066 ± 0.831
1.062AspHis: 1.062 ± 0.305
4.086AspIle: 4.086 ± 0.542
5.066AspLys: 5.066 ± 0.555
4.739AspLeu: 4.739 ± 0.573
1.716AspMet: 1.716 ± 0.341
3.514AspAsn: 3.514 ± 0.453
1.307AspPro: 1.307 ± 0.327
1.879AspGln: 1.879 ± 0.406
1.879AspArg: 1.879 ± 0.33
5.148AspSer: 5.148 ± 0.522
3.187AspThr: 3.187 ± 0.609
3.677AspVal: 3.677 ± 0.501
1.226AspTrp: 1.226 ± 0.328
2.86AspTyr: 2.86 ± 0.594
0.0AspXaa: 0.0 ± 0.0
Glu
6.619GluAla: 6.619 ± 0.776
1.144GluCys: 1.144 ± 0.296
4.167GluAsp: 4.167 ± 0.514
3.922GluGlu: 3.922 ± 0.533
3.187GluPhe: 3.187 ± 0.553
3.514GluGly: 3.514 ± 0.521
1.389GluHis: 1.389 ± 0.286
4.331GluIle: 4.331 ± 0.612
4.249GluLys: 4.249 ± 0.69
8.335GluLeu: 8.335 ± 0.886
2.778GluMet: 2.778 ± 0.664
3.84GluAsn: 3.84 ± 0.526
1.307GluPro: 1.307 ± 0.301
2.86GluGln: 2.86 ± 0.75
3.84GluArg: 3.84 ± 0.575
4.331GluSer: 4.331 ± 0.495
3.595GluThr: 3.595 ± 0.727
4.984GluVal: 4.984 ± 0.664
2.043GluTrp: 2.043 ± 0.516
2.533GluTyr: 2.533 ± 0.431
0.0GluXaa: 0.0 ± 0.0
Phe
1.716PheAla: 1.716 ± 0.371
0.572PheCys: 0.572 ± 0.2
2.778PheAsp: 2.778 ± 0.46
3.105PheGlu: 3.105 ± 0.64
0.981PhePhe: 0.981 ± 0.266
2.206PheGly: 2.206 ± 0.319
0.817PheHis: 0.817 ± 0.267
2.125PheIle: 2.125 ± 0.436
3.105PheLys: 3.105 ± 0.402
2.451PheLeu: 2.451 ± 0.395
1.062PheMet: 1.062 ± 0.33
3.023PheAsn: 3.023 ± 0.538
0.735PhePro: 0.735 ± 0.274
1.144PheGln: 1.144 ± 0.301
1.062PheArg: 1.062 ± 0.259
2.697PheSer: 2.697 ± 0.508
2.37PheThr: 2.37 ± 0.448
1.879PheVal: 1.879 ± 0.408
0.49PheTrp: 0.49 ± 0.203
1.716PheTyr: 1.716 ± 0.351
0.0PheXaa: 0.0 ± 0.0
Gly
4.984GlyAla: 4.984 ± 0.765
1.226GlyCys: 1.226 ± 0.45
3.432GlyAsp: 3.432 ± 0.595
4.086GlyGlu: 4.086 ± 0.447
2.697GlyPhe: 2.697 ± 0.555
4.739GlyGly: 4.739 ± 0.697
0.981GlyHis: 0.981 ± 0.244
3.677GlyIle: 3.677 ± 0.647
5.311GlyLys: 5.311 ± 0.65
4.658GlyLeu: 4.658 ± 0.643
2.125GlyMet: 2.125 ± 0.434
3.514GlyAsn: 3.514 ± 0.729
0.899GlyPro: 0.899 ± 0.275
2.37GlyGln: 2.37 ± 0.513
2.697GlyArg: 2.697 ± 0.414
4.739GlySer: 4.739 ± 0.648
4.249GlyThr: 4.249 ± 0.621
5.311GlyVal: 5.311 ± 0.564
1.062GlyTrp: 1.062 ± 0.294
2.615GlyTyr: 2.615 ± 0.485
0.0GlyXaa: 0.0 ± 0.0
His
1.634HisAla: 1.634 ± 0.37
0.409HisCys: 0.409 ± 0.167
0.981HisAsp: 0.981 ± 0.253
1.553HisGlu: 1.553 ± 0.35
0.981HisPhe: 0.981 ± 0.251
1.471HisGly: 1.471 ± 0.353
0.409HisHis: 0.409 ± 0.18
0.654HisIle: 0.654 ± 0.198
1.062HisLys: 1.062 ± 0.3
0.49HisLeu: 0.49 ± 0.159
0.327HisMet: 0.327 ± 0.13
0.572HisAsn: 0.572 ± 0.231
0.49HisPro: 0.49 ± 0.222
0.899HisGln: 0.899 ± 0.256
1.144HisArg: 1.144 ± 0.292
1.553HisSer: 1.553 ± 0.325
0.817HisThr: 0.817 ± 0.251
1.144HisVal: 1.144 ± 0.26
0.572HisTrp: 0.572 ± 0.221
0.572HisTyr: 0.572 ± 0.244
0.0HisXaa: 0.0 ± 0.0
Ile
4.658IleAla: 4.658 ± 0.568
1.144IleCys: 1.144 ± 0.328
6.21IleAsp: 6.21 ± 0.631
5.393IleGlu: 5.393 ± 0.623
0.981IlePhe: 0.981 ± 0.263
4.331IleGly: 4.331 ± 0.817
0.899IleHis: 0.899 ± 0.268
2.778IleIle: 2.778 ± 0.534
5.72IleLys: 5.72 ± 0.683
2.778IleLeu: 2.778 ± 0.402
0.572IleMet: 0.572 ± 0.228
3.677IleAsn: 3.677 ± 0.439
1.961IlePro: 1.961 ± 0.367
1.798IleGln: 1.798 ± 0.351
3.023IleArg: 3.023 ± 0.479
3.35IleSer: 3.35 ± 0.526
3.105IleThr: 3.105 ± 0.571
3.187IleVal: 3.187 ± 0.455
0.654IleTrp: 0.654 ± 0.218
1.634IleTyr: 1.634 ± 0.354
0.0IleXaa: 0.0 ± 0.0
Lys
5.965LysAla: 5.965 ± 0.859
1.062LysCys: 1.062 ± 0.386
4.004LysAsp: 4.004 ± 0.575
4.984LysGlu: 4.984 ± 0.798
2.778LysPhe: 2.778 ± 0.596
4.821LysGly: 4.821 ± 0.518
1.553LysHis: 1.553 ± 0.342
4.249LysIle: 4.249 ± 0.42
5.802LysLys: 5.802 ± 0.869
5.148LysLeu: 5.148 ± 0.864
2.451LysMet: 2.451 ± 0.477
2.697LysAsn: 2.697 ± 0.519
3.105LysPro: 3.105 ± 0.505
3.432LysGln: 3.432 ± 0.718
4.739LysArg: 4.739 ± 0.829
3.35LysSer: 3.35 ± 0.566
4.331LysThr: 4.331 ± 0.6
5.148LysVal: 5.148 ± 0.711
1.144LysTrp: 1.144 ± 0.314
2.697LysTyr: 2.697 ± 0.498
0.0LysXaa: 0.0 ± 0.0
Leu
5.066LeuAla: 5.066 ± 0.719
1.144LeuCys: 1.144 ± 0.302
4.658LeuAsp: 4.658 ± 0.484
5.148LeuGlu: 5.148 ± 0.703
2.533LeuPhe: 2.533 ± 0.391
4.821LeuGly: 4.821 ± 0.702
0.981LeuHis: 0.981 ± 0.242
5.066LeuIle: 5.066 ± 0.532
4.658LeuLys: 4.658 ± 0.709
4.658LeuLeu: 4.658 ± 0.609
1.716LeuMet: 1.716 ± 0.415
4.494LeuAsn: 4.494 ± 0.62
2.942LeuPro: 2.942 ± 0.508
2.778LeuGln: 2.778 ± 0.467
3.595LeuArg: 3.595 ± 0.597
5.311LeuSer: 5.311 ± 0.619
5.965LeuThr: 5.965 ± 0.777
4.903LeuVal: 4.903 ± 0.643
0.49LeuTrp: 0.49 ± 0.205
2.206LeuTyr: 2.206 ± 0.517
0.0LeuXaa: 0.0 ± 0.0
Met
2.451MetAla: 2.451 ± 0.531
0.163MetCys: 0.163 ± 0.113
1.226MetAsp: 1.226 ± 0.283
1.226MetGlu: 1.226 ± 0.311
0.899MetPhe: 0.899 ± 0.295
1.716MetGly: 1.716 ± 0.36
0.327MetHis: 0.327 ± 0.158
1.634MetIle: 1.634 ± 0.389
1.961MetLys: 1.961 ± 0.427
1.389MetLeu: 1.389 ± 0.363
0.981MetMet: 0.981 ± 0.312
1.634MetAsn: 1.634 ± 0.365
0.981MetPro: 0.981 ± 0.31
1.144MetGln: 1.144 ± 0.315
1.471MetArg: 1.471 ± 0.343
3.269MetSer: 3.269 ± 0.546
1.961MetThr: 1.961 ± 0.437
1.879MetVal: 1.879 ± 0.447
0.409MetTrp: 0.409 ± 0.153
0.572MetTyr: 0.572 ± 0.199
0.0MetXaa: 0.0 ± 0.0
Asn
4.249AsnAla: 4.249 ± 0.553
0.735AsnCys: 0.735 ± 0.243
2.942AsnAsp: 2.942 ± 0.556
4.249AsnGlu: 4.249 ± 0.585
1.634AsnPhe: 1.634 ± 0.355
5.066AsnGly: 5.066 ± 0.561
1.471AsnHis: 1.471 ± 0.285
2.206AsnIle: 2.206 ± 0.504
4.576AsnLys: 4.576 ± 0.746
3.432AsnLeu: 3.432 ± 0.53
2.125AsnMet: 2.125 ± 0.403
4.004AsnAsn: 4.004 ± 0.607
3.269AsnPro: 3.269 ± 0.494
1.798AsnGln: 1.798 ± 0.371
2.533AsnArg: 2.533 ± 0.457
3.922AsnSer: 3.922 ± 0.612
2.778AsnThr: 2.778 ± 0.424
3.432AsnVal: 3.432 ± 0.556
0.735AsnTrp: 0.735 ± 0.244
1.879AsnTyr: 1.879 ± 0.367
0.0AsnXaa: 0.0 ± 0.0
Pro
2.288ProAla: 2.288 ± 0.398
0.409ProCys: 0.409 ± 0.214
1.716ProAsp: 1.716 ± 0.403
3.269ProGlu: 3.269 ± 0.56
0.899ProPhe: 0.899 ± 0.21
1.062ProGly: 1.062 ± 0.349
0.49ProHis: 0.49 ± 0.155
1.634ProIle: 1.634 ± 0.484
2.37ProLys: 2.37 ± 0.43
2.288ProLeu: 2.288 ± 0.407
0.899ProMet: 0.899 ± 0.274
2.125ProAsn: 2.125 ± 0.488
0.817ProPro: 0.817 ± 0.239
1.307ProGln: 1.307 ± 0.44
0.981ProArg: 0.981 ± 0.332
2.615ProSer: 2.615 ± 0.404
2.37ProThr: 2.37 ± 0.52
2.206ProVal: 2.206 ± 0.441
0.082ProTrp: 0.082 ± 0.075
1.389ProTyr: 1.389 ± 0.307
0.0ProXaa: 0.0 ± 0.0
Gln
3.187GlnAla: 3.187 ± 0.524
0.572GlnCys: 0.572 ± 0.184
1.716GlnAsp: 1.716 ± 0.37
2.697GlnGlu: 2.697 ± 0.42
1.553GlnPhe: 1.553 ± 0.428
2.778GlnGly: 2.778 ± 0.396
0.49GlnHis: 0.49 ± 0.24
2.451GlnIle: 2.451 ± 0.46
1.307GlnLys: 1.307 ± 0.367
3.269GlnLeu: 3.269 ± 0.571
1.226GlnMet: 1.226 ± 0.346
1.879GlnAsn: 1.879 ± 0.41
1.716GlnPro: 1.716 ± 0.341
3.595GlnGln: 3.595 ± 1.366
1.634GlnArg: 1.634 ± 0.331
3.595GlnSer: 3.595 ± 0.492
2.37GlnThr: 2.37 ± 0.411
2.778GlnVal: 2.778 ± 0.449
0.817GlnTrp: 0.817 ± 0.237
1.553GlnTyr: 1.553 ± 0.314
0.0GlnXaa: 0.0 ± 0.0
Arg
2.697ArgAla: 2.697 ± 0.473
0.572ArgCys: 0.572 ± 0.274
2.615ArgAsp: 2.615 ± 0.492
3.105ArgGlu: 3.105 ± 0.503
1.961ArgPhe: 1.961 ± 0.387
2.206ArgGly: 2.206 ± 0.457
1.144ArgHis: 1.144 ± 0.31
2.942ArgIle: 2.942 ± 0.51
3.595ArgLys: 3.595 ± 0.862
3.84ArgLeu: 3.84 ± 0.506
0.899ArgMet: 0.899 ± 0.288
2.206ArgAsn: 2.206 ± 0.379
1.389ArgPro: 1.389 ± 0.362
1.879ArgGln: 1.879 ± 0.436
1.307ArgArg: 1.307 ± 0.298
3.105ArgSer: 3.105 ± 0.419
2.533ArgThr: 2.533 ± 0.449
3.84ArgVal: 3.84 ± 0.591
0.409ArgTrp: 0.409 ± 0.154
1.553ArgTyr: 1.553 ± 0.338
0.0ArgXaa: 0.0 ± 0.0
Ser
4.167SerAla: 4.167 ± 0.446
0.735SerCys: 0.735 ± 0.235
4.576SerAsp: 4.576 ± 0.685
5.311SerGlu: 5.311 ± 0.8
2.942SerPhe: 2.942 ± 0.437
4.412SerGly: 4.412 ± 0.571
1.553SerHis: 1.553 ± 0.309
4.412SerIle: 4.412 ± 0.55
5.393SerLys: 5.393 ± 0.819
5.883SerLeu: 5.883 ± 0.588
1.961SerMet: 1.961 ± 0.425
4.249SerAsn: 4.249 ± 0.479
2.043SerPro: 2.043 ± 0.376
3.514SerGln: 3.514 ± 0.524
3.105SerArg: 3.105 ± 0.451
4.331SerSer: 4.331 ± 0.572
4.004SerThr: 4.004 ± 0.672
4.576SerVal: 4.576 ± 0.681
0.981SerTrp: 0.981 ± 0.301
3.514SerTyr: 3.514 ± 0.605
0.0SerXaa: 0.0 ± 0.0
Thr
4.658ThrAla: 4.658 ± 0.702
0.572ThrCys: 0.572 ± 0.217
4.249ThrAsp: 4.249 ± 0.742
4.412ThrGlu: 4.412 ± 0.727
1.879ThrPhe: 1.879 ± 0.612
4.821ThrGly: 4.821 ± 0.816
1.144ThrHis: 1.144 ± 0.283
3.922ThrIle: 3.922 ± 0.508
3.595ThrLys: 3.595 ± 0.415
4.494ThrLeu: 4.494 ± 0.665
1.961ThrMet: 1.961 ± 0.385
3.84ThrAsn: 3.84 ± 0.591
2.37ThrPro: 2.37 ± 0.505
2.37ThrGln: 2.37 ± 0.432
2.043ThrArg: 2.043 ± 0.369
4.739ThrSer: 4.739 ± 0.685
3.432ThrThr: 3.432 ± 0.68
4.167ThrVal: 4.167 ± 0.549
0.572ThrTrp: 0.572 ± 0.194
2.533ThrTyr: 2.533 ± 0.544
0.0ThrXaa: 0.0 ± 0.0
Val
5.066ValAla: 5.066 ± 0.864
0.981ValCys: 0.981 ± 0.321
5.393ValAsp: 5.393 ± 0.696
4.576ValGlu: 4.576 ± 0.697
2.125ValPhe: 2.125 ± 0.372
4.167ValGly: 4.167 ± 0.61
0.572ValHis: 0.572 ± 0.212
3.105ValIle: 3.105 ± 0.531
4.984ValLys: 4.984 ± 0.745
3.84ValLeu: 3.84 ± 0.531
1.798ValMet: 1.798 ± 0.434
4.903ValAsn: 4.903 ± 0.665
1.798ValPro: 1.798 ± 0.336
2.125ValGln: 2.125 ± 0.384
2.697ValArg: 2.697 ± 0.419
5.965ValSer: 5.965 ± 0.949
5.23ValThr: 5.23 ± 0.922
4.331ValVal: 4.331 ± 0.564
0.409ValTrp: 0.409 ± 0.172
2.942ValTyr: 2.942 ± 0.536
0.0ValXaa: 0.0 ± 0.0
Trp
0.735TrpAla: 0.735 ± 0.195
0.327TrpCys: 0.327 ± 0.166
0.981TrpAsp: 0.981 ± 0.3
1.062TrpGlu: 1.062 ± 0.291
0.572TrpPhe: 0.572 ± 0.308
0.817TrpGly: 0.817 ± 0.274
0.654TrpHis: 0.654 ± 0.23
0.49TrpIle: 0.49 ± 0.237
0.981TrpLys: 0.981 ± 0.263
1.471TrpLeu: 1.471 ± 0.38
0.327TrpMet: 0.327 ± 0.194
0.49TrpAsn: 0.49 ± 0.218
0.163TrpPro: 0.163 ± 0.107
0.817TrpGln: 0.817 ± 0.228
0.899TrpArg: 0.899 ± 0.349
0.981TrpSer: 0.981 ± 0.276
0.572TrpThr: 0.572 ± 0.242
1.062TrpVal: 1.062 ± 0.34
0.163TrpTrp: 0.163 ± 0.101
0.245TrpTyr: 0.245 ± 0.115
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.778TyrAla: 2.778 ± 0.471
0.899TyrCys: 0.899 ± 0.274
3.35TyrAsp: 3.35 ± 0.531
3.187TyrGlu: 3.187 ± 0.493
2.043TyrPhe: 2.043 ± 0.358
2.615TyrGly: 2.615 ± 0.394
0.572TyrHis: 0.572 ± 0.192
2.125TyrIle: 2.125 ± 0.498
1.961TyrLys: 1.961 ± 0.356
2.533TyrLeu: 2.533 ± 0.439
0.49TyrMet: 0.49 ± 0.263
1.798TyrAsn: 1.798 ± 0.352
0.899TyrPro: 0.899 ± 0.261
1.879TyrGln: 1.879 ± 0.381
1.144TyrArg: 1.144 ± 0.279
2.942TyrSer: 2.942 ± 0.575
2.37TyrThr: 2.37 ± 0.404
2.37TyrVal: 2.37 ± 0.525
0.327TyrTrp: 0.327 ± 0.152
1.634TyrTyr: 1.634 ± 0.331
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 74 proteins (12239 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski