Amino acid dipepetide frequency for Weissella phage phiYS61

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.2AlaAla: 0.2 ± 0.124
0.0AlaCys: 0.0 ± 0.0
4.194AlaAsp: 4.194 ± 0.681
5.193AlaGlu: 5.193 ± 0.694
2.896AlaPhe: 2.896 ± 0.518
7.989AlaGly: 7.989 ± 1.523
1.398AlaHis: 1.398 ± 0.39
5.592AlaIle: 5.592 ± 0.659
5.293AlaLys: 5.293 ± 0.777
6.291AlaLeu: 6.291 ± 0.741
2.796AlaMet: 2.796 ± 0.516
5.392AlaAsn: 5.392 ± 0.529
3.196AlaPro: 3.196 ± 0.777
2.996AlaGln: 2.996 ± 0.477
2.896AlaArg: 2.896 ± 0.685
5.392AlaSer: 5.392 ± 0.751
5.492AlaThr: 5.492 ± 0.962
5.093AlaVal: 5.093 ± 0.769
0.899AlaTrp: 0.899 ± 0.262
2.796AlaTyr: 2.796 ± 0.562
0.0AlaXaa: 0.0 ± 0.0
Cys
0.1CysAla: 0.1 ± 0.096
0.0CysCys: 0.0 ± 0.0
0.3CysAsp: 0.3 ± 0.177
0.0CysGlu: 0.0 ± 0.0
0.1CysPhe: 0.1 ± 0.102
0.3CysGly: 0.3 ± 0.259
0.1CysHis: 0.1 ± 0.096
0.1CysIle: 0.1 ± 0.102
0.2CysLys: 0.2 ± 0.118
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.1CysAsn: 0.1 ± 0.104
0.0CysPro: 0.0 ± 0.0
0.1CysGln: 0.1 ± 0.083
0.1CysArg: 0.1 ± 0.083
0.0CysSer: 0.0 ± 0.0
0.1CysThr: 0.1 ± 0.113
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.1CysTyr: 0.1 ± 0.105
0.0CysXaa: 0.0 ± 0.0
Asp
4.494AspAla: 4.494 ± 0.75
0.1AspCys: 0.1 ± 0.096
2.796AspAsp: 2.796 ± 0.596
3.395AspGlu: 3.395 ± 0.695
2.796AspPhe: 2.796 ± 0.63
4.294AspGly: 4.294 ± 0.72
1.098AspHis: 1.098 ± 0.248
3.695AspIle: 3.695 ± 0.521
3.994AspLys: 3.994 ± 0.642
4.693AspLeu: 4.693 ± 0.69
1.797AspMet: 1.797 ± 0.386
4.494AspAsn: 4.494 ± 0.506
4.094AspPro: 4.094 ± 0.856
1.897AspGln: 1.897 ± 0.464
1.797AspArg: 1.797 ± 0.367
3.495AspSer: 3.495 ± 0.672
5.293AspThr: 5.293 ± 0.717
4.594AspVal: 4.594 ± 0.691
1.897AspTrp: 1.897 ± 0.399
2.696AspTyr: 2.696 ± 0.431
0.0AspXaa: 0.0 ± 0.0
Glu
6.89GluAla: 6.89 ± 0.825
0.1GluCys: 0.1 ± 0.113
4.194GluAsp: 4.194 ± 0.631
5.992GluGlu: 5.992 ± 0.883
3.695GluPhe: 3.695 ± 0.515
4.793GluGly: 4.793 ± 0.762
0.799GluHis: 0.799 ± 0.309
2.696GluIle: 2.696 ± 0.651
3.295GluLys: 3.295 ± 0.689
6.091GluLeu: 6.091 ± 0.994
2.297GluMet: 2.297 ± 0.458
2.197GluAsn: 2.197 ± 0.624
2.596GluPro: 2.596 ± 0.605
3.795GluGln: 3.795 ± 0.594
2.596GluArg: 2.596 ± 0.426
2.796GluSer: 2.796 ± 0.513
4.394GluThr: 4.394 ± 0.757
4.793GluVal: 4.793 ± 0.728
1.098GluTrp: 1.098 ± 0.395
2.596GluTyr: 2.596 ± 0.611
0.0GluXaa: 0.0 ± 0.0
Phe
3.096PheAla: 3.096 ± 0.539
0.0PheCys: 0.0 ± 0.0
3.895PheAsp: 3.895 ± 0.487
3.196PheGlu: 3.196 ± 0.566
1.897PhePhe: 1.897 ± 0.387
3.395PheGly: 3.395 ± 0.566
1.198PheHis: 1.198 ± 0.409
2.097PheIle: 2.097 ± 0.554
3.295PheLys: 3.295 ± 0.559
1.797PheLeu: 1.797 ± 0.472
1.797PheMet: 1.797 ± 0.414
2.796PheAsn: 2.796 ± 0.545
1.098PhePro: 1.098 ± 0.33
1.897PheGln: 1.897 ± 0.349
1.098PheArg: 1.098 ± 0.351
2.497PheSer: 2.497 ± 0.453
2.996PheThr: 2.996 ± 0.55
2.796PheVal: 2.796 ± 0.612
0.499PheTrp: 0.499 ± 0.22
1.298PheTyr: 1.298 ± 0.465
0.0PheXaa: 0.0 ± 0.0
Gly
6.291GlyAla: 6.291 ± 1.073
0.0GlyCys: 0.0 ± 0.0
5.293GlyAsp: 5.293 ± 1.428
3.595GlyGlu: 3.595 ± 0.638
3.295GlyPhe: 3.295 ± 0.665
8.189GlyGly: 8.189 ± 1.834
0.699GlyHis: 0.699 ± 0.225
3.495GlyIle: 3.495 ± 0.546
5.392GlyLys: 5.392 ± 0.767
5.392GlyLeu: 5.392 ± 0.94
1.598GlyMet: 1.598 ± 0.466
4.094GlyAsn: 4.094 ± 0.668
3.795GlyPro: 3.795 ± 1.946
2.397GlyGln: 2.397 ± 0.377
2.796GlyArg: 2.796 ± 0.631
4.394GlySer: 4.394 ± 0.76
5.592GlyThr: 5.592 ± 0.822
4.893GlyVal: 4.893 ± 0.667
1.098GlyTrp: 1.098 ± 0.39
4.094GlyTyr: 4.094 ± 0.625
0.0GlyXaa: 0.0 ± 0.0
His
1.298HisAla: 1.298 ± 0.388
0.0HisCys: 0.0 ± 0.0
1.797HisAsp: 1.797 ± 0.415
1.098HisGlu: 1.098 ± 0.255
0.599HisPhe: 0.599 ± 0.22
1.498HisGly: 1.498 ± 0.463
0.399HisHis: 0.399 ± 0.2
1.098HisIle: 1.098 ± 0.373
0.999HisLys: 0.999 ± 0.313
1.598HisLeu: 1.598 ± 0.324
0.2HisMet: 0.2 ± 0.145
1.298HisAsn: 1.298 ± 0.222
0.999HisPro: 0.999 ± 0.333
0.2HisGln: 0.2 ± 0.149
0.0HisArg: 0.0 ± 0.0
1.498HisSer: 1.498 ± 0.42
1.398HisThr: 1.398 ± 0.354
0.999HisVal: 0.999 ± 0.335
0.499HisTrp: 0.499 ± 0.192
0.399HisTyr: 0.399 ± 0.176
0.0HisXaa: 0.0 ± 0.0
Ile
3.395IleAla: 3.395 ± 0.563
0.1IleCys: 0.1 ± 0.083
4.094IleAsp: 4.094 ± 0.735
5.093IleGlu: 5.093 ± 0.633
1.797IlePhe: 1.797 ± 0.42
3.495IleGly: 3.495 ± 0.491
0.999IleHis: 0.999 ± 0.286
2.696IleIle: 2.696 ± 0.532
5.093IleLys: 5.093 ± 0.895
3.495IleLeu: 3.495 ± 0.677
0.999IleMet: 0.999 ± 0.309
3.395IleAsn: 3.395 ± 0.47
2.197IlePro: 2.197 ± 0.417
2.596IleGln: 2.596 ± 0.492
2.097IleArg: 2.097 ± 0.48
2.297IleSer: 2.297 ± 0.524
3.695IleThr: 3.695 ± 0.498
3.395IleVal: 3.395 ± 0.535
0.3IleTrp: 0.3 ± 0.153
2.397IleTyr: 2.397 ± 0.562
0.0IleXaa: 0.0 ± 0.0
Lys
4.993LysAla: 4.993 ± 0.833
0.0LysCys: 0.0 ± 0.0
2.896LysAsp: 2.896 ± 0.661
5.492LysGlu: 5.492 ± 0.985
2.696LysPhe: 2.696 ± 0.709
5.992LysGly: 5.992 ± 1.083
1.298LysHis: 1.298 ± 0.444
2.097LysIle: 2.097 ± 0.599
4.494LysLys: 4.494 ± 1.063
4.893LysLeu: 4.893 ± 0.735
1.897LysMet: 1.897 ± 0.484
3.695LysAsn: 3.695 ± 0.593
2.097LysPro: 2.097 ± 0.541
3.295LysGln: 3.295 ± 0.54
2.796LysArg: 2.796 ± 0.613
4.194LysSer: 4.194 ± 0.68
4.194LysThr: 4.194 ± 0.562
4.793LysVal: 4.793 ± 0.657
1.098LysTrp: 1.098 ± 0.404
2.996LysTyr: 2.996 ± 0.428
0.0LysXaa: 0.0 ± 0.0
Leu
5.892LeuAla: 5.892 ± 0.757
0.499LeuCys: 0.499 ± 0.222
5.193LeuAsp: 5.193 ± 0.676
3.795LeuGlu: 3.795 ± 0.605
3.096LeuPhe: 3.096 ± 0.665
4.094LeuGly: 4.094 ± 0.727
1.398LeuHis: 1.398 ± 0.348
3.096LeuIle: 3.096 ± 0.541
4.194LeuLys: 4.194 ± 0.839
3.895LeuLeu: 3.895 ± 0.776
2.696LeuMet: 2.696 ± 0.591
4.094LeuAsn: 4.094 ± 0.579
2.796LeuPro: 2.796 ± 0.486
3.395LeuGln: 3.395 ± 0.631
3.196LeuArg: 3.196 ± 0.631
4.893LeuSer: 4.893 ± 0.623
5.093LeuThr: 5.093 ± 0.627
5.592LeuVal: 5.592 ± 0.642
0.799LeuTrp: 0.799 ± 0.395
3.495LeuTyr: 3.495 ± 0.715
0.0LeuXaa: 0.0 ± 0.0
Met
2.696MetAla: 2.696 ± 0.455
0.0MetCys: 0.0 ± 0.0
1.098MetAsp: 1.098 ± 0.357
1.198MetGlu: 1.198 ± 0.339
1.797MetPhe: 1.797 ± 0.446
1.198MetGly: 1.198 ± 0.42
0.499MetHis: 0.499 ± 0.262
1.398MetIle: 1.398 ± 0.325
1.598MetLys: 1.598 ± 0.356
1.797MetLeu: 1.797 ± 0.44
0.499MetMet: 0.499 ± 0.265
2.197MetAsn: 2.197 ± 0.377
1.398MetPro: 1.398 ± 0.399
1.598MetGln: 1.598 ± 0.374
1.098MetArg: 1.098 ± 0.333
1.997MetSer: 1.997 ± 0.406
1.897MetThr: 1.897 ± 0.442
1.498MetVal: 1.498 ± 0.361
0.399MetTrp: 0.399 ± 0.266
1.098MetTyr: 1.098 ± 0.234
0.0MetXaa: 0.0 ± 0.0
Asn
5.892AsnAla: 5.892 ± 0.856
0.2AsnCys: 0.2 ± 0.133
3.395AsnAsp: 3.395 ± 0.674
3.495AsnGlu: 3.495 ± 0.657
2.297AsnPhe: 2.297 ± 0.456
4.394AsnGly: 4.394 ± 0.647
1.198AsnHis: 1.198 ± 0.37
3.395AsnIle: 3.395 ± 0.462
4.893AsnLys: 4.893 ± 0.61
4.893AsnLeu: 4.893 ± 0.735
1.698AsnMet: 1.698 ± 0.328
3.994AsnAsn: 3.994 ± 0.788
3.994AsnPro: 3.994 ± 0.759
2.996AsnGln: 2.996 ± 0.725
2.896AsnArg: 2.896 ± 0.566
2.197AsnSer: 2.197 ± 0.537
3.795AsnThr: 3.795 ± 0.509
3.595AsnVal: 3.595 ± 0.615
0.899AsnTrp: 0.899 ± 0.297
2.297AsnTyr: 2.297 ± 0.424
0.0AsnXaa: 0.0 ± 0.0
Pro
3.695ProAla: 3.695 ± 0.63
0.1ProCys: 0.1 ± 0.102
2.696ProAsp: 2.696 ± 0.624
3.196ProGlu: 3.196 ± 0.525
1.298ProPhe: 1.298 ± 0.499
3.295ProGly: 3.295 ± 0.885
0.599ProHis: 0.599 ± 0.225
2.097ProIle: 2.097 ± 0.545
2.696ProLys: 2.696 ± 0.783
2.497ProLeu: 2.497 ± 0.469
0.599ProMet: 0.599 ± 0.238
1.797ProAsn: 1.797 ± 0.378
0.699ProPro: 0.699 ± 0.231
1.997ProGln: 1.997 ± 0.911
1.098ProArg: 1.098 ± 0.326
2.896ProSer: 2.896 ± 0.517
2.497ProThr: 2.497 ± 0.542
3.196ProVal: 3.196 ± 0.604
0.799ProTrp: 0.799 ± 0.262
2.596ProTyr: 2.596 ± 0.474
0.0ProXaa: 0.0 ± 0.0
Gln
4.394GlnAla: 4.394 ± 0.635
0.0GlnCys: 0.0 ± 0.0
1.698GlnAsp: 1.698 ± 0.384
2.097GlnGlu: 2.097 ± 0.477
2.896GlnPhe: 2.896 ± 0.701
3.795GlnGly: 3.795 ± 1.166
0.999GlnHis: 0.999 ± 0.309
2.696GlnIle: 2.696 ± 0.473
3.196GlnLys: 3.196 ± 0.655
3.096GlnLeu: 3.096 ± 0.469
1.098GlnMet: 1.098 ± 0.287
3.495GlnAsn: 3.495 ± 0.674
0.799GlnPro: 0.799 ± 0.332
1.498GlnGln: 1.498 ± 0.338
1.698GlnArg: 1.698 ± 0.35
2.497GlnSer: 2.497 ± 0.609
2.397GlnThr: 2.397 ± 0.53
2.996GlnVal: 2.996 ± 0.467
0.599GlnTrp: 0.599 ± 0.236
1.698GlnTyr: 1.698 ± 0.385
0.0GlnXaa: 0.0 ± 0.0
Arg
2.696ArgAla: 2.696 ± 0.457
0.0ArgCys: 0.0 ± 0.0
3.595ArgAsp: 3.595 ± 0.511
2.696ArgGlu: 2.696 ± 0.619
1.797ArgPhe: 1.797 ± 0.428
2.696ArgGly: 2.696 ± 0.575
0.599ArgHis: 0.599 ± 0.222
2.397ArgIle: 2.397 ± 0.578
2.297ArgLys: 2.297 ± 0.586
3.295ArgLeu: 3.295 ± 0.72
1.098ArgMet: 1.098 ± 0.368
2.596ArgAsn: 2.596 ± 0.481
1.198ArgPro: 1.198 ± 0.288
1.098ArgGln: 1.098 ± 0.277
1.498ArgArg: 1.498 ± 0.285
1.997ArgSer: 1.997 ± 0.52
1.797ArgThr: 1.797 ± 0.524
3.495ArgVal: 3.495 ± 0.534
0.499ArgTrp: 0.499 ± 0.278
1.897ArgTyr: 1.897 ± 0.465
0.0ArgXaa: 0.0 ± 0.0
Ser
5.592SerAla: 5.592 ± 0.949
0.0SerCys: 0.0 ± 0.0
3.895SerAsp: 3.895 ± 0.685
3.895SerGlu: 3.895 ± 0.644
2.197SerPhe: 2.197 ± 0.46
3.895SerGly: 3.895 ± 0.887
1.398SerHis: 1.398 ± 0.36
4.094SerIle: 4.094 ± 0.635
4.094SerLys: 4.094 ± 0.723
3.395SerLeu: 3.395 ± 0.413
1.298SerMet: 1.298 ± 0.318
4.094SerAsn: 4.094 ± 0.652
1.598SerPro: 1.598 ± 0.323
3.595SerGln: 3.595 ± 0.641
2.896SerArg: 2.896 ± 0.659
4.194SerSer: 4.194 ± 1.14
3.595SerThr: 3.595 ± 0.638
3.895SerVal: 3.895 ± 0.715
1.198SerTrp: 1.198 ± 0.327
1.997SerTyr: 1.997 ± 0.445
0.0SerXaa: 0.0 ± 0.0
Thr
5.193ThrAla: 5.193 ± 0.87
0.3ThrCys: 0.3 ± 0.194
4.693ThrAsp: 4.693 ± 0.642
3.795ThrGlu: 3.795 ± 0.601
2.297ThrPhe: 2.297 ± 0.474
5.492ThrGly: 5.492 ± 0.686
0.999ThrHis: 0.999 ± 0.45
3.295ThrIle: 3.295 ± 0.82
3.895ThrLys: 3.895 ± 0.59
4.993ThrLeu: 4.993 ± 0.555
1.897ThrMet: 1.897 ± 0.388
4.194ThrAsn: 4.194 ± 0.734
4.394ThrPro: 4.394 ± 0.796
3.096ThrGln: 3.096 ± 0.579
3.196ThrArg: 3.196 ± 0.506
4.494ThrSer: 4.494 ± 0.586
3.096ThrThr: 3.096 ± 0.579
4.394ThrVal: 4.394 ± 0.703
0.599ThrTrp: 0.599 ± 0.194
2.197ThrTyr: 2.197 ± 0.4
0.0ThrXaa: 0.0 ± 0.0
Val
5.193ValAla: 5.193 ± 0.754
0.1ValCys: 0.1 ± 0.083
4.594ValAsp: 4.594 ± 0.537
5.792ValGlu: 5.792 ± 1.063
2.297ValPhe: 2.297 ± 0.482
3.495ValGly: 3.495 ± 0.456
1.198ValHis: 1.198 ± 0.376
5.193ValIle: 5.193 ± 0.77
3.895ValLys: 3.895 ± 0.502
3.395ValLeu: 3.395 ± 0.691
1.298ValMet: 1.298 ± 0.382
4.194ValAsn: 4.194 ± 0.824
1.897ValPro: 1.897 ± 0.543
2.896ValGln: 2.896 ± 0.557
2.996ValArg: 2.996 ± 0.527
5.293ValSer: 5.293 ± 0.816
4.993ValThr: 4.993 ± 0.886
4.394ValVal: 4.394 ± 0.788
1.198ValTrp: 1.198 ± 0.335
2.497ValTyr: 2.497 ± 0.527
0.0ValXaa: 0.0 ± 0.0
Trp
1.098TrpAla: 1.098 ± 0.343
0.0TrpCys: 0.0 ± 0.0
0.3TrpAsp: 0.3 ± 0.148
1.698TrpGlu: 1.698 ± 0.561
0.799TrpPhe: 0.799 ± 0.256
0.899TrpGly: 0.899 ± 0.373
0.1TrpHis: 0.1 ± 0.085
0.699TrpIle: 0.699 ± 0.23
0.499TrpLys: 0.499 ± 0.214
1.897TrpLeu: 1.897 ± 0.453
0.1TrpMet: 0.1 ± 0.104
1.198TrpAsn: 1.198 ± 0.37
0.1TrpPro: 0.1 ± 0.115
1.298TrpGln: 1.298 ± 0.299
0.799TrpArg: 0.799 ± 0.301
1.598TrpSer: 1.598 ± 0.37
0.799TrpThr: 0.799 ± 0.358
0.599TrpVal: 0.599 ± 0.204
0.3TrpTrp: 0.3 ± 0.165
0.799TrpTyr: 0.799 ± 0.255
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.896TyrAla: 2.896 ± 0.396
0.2TyrCys: 0.2 ± 0.149
2.796TyrAsp: 2.796 ± 0.497
2.996TyrGlu: 2.996 ± 0.462
2.097TyrPhe: 2.097 ± 0.552
3.196TyrGly: 3.196 ± 0.603
0.899TyrHis: 0.899 ± 0.248
1.698TyrIle: 1.698 ± 0.532
2.596TyrLys: 2.596 ± 0.38
3.795TyrLeu: 3.795 ± 0.444
1.198TyrMet: 1.198 ± 0.443
3.096TyrAsn: 3.096 ± 0.404
1.398TyrPro: 1.398 ± 0.4
0.999TyrGln: 0.999 ± 0.265
1.698TyrArg: 1.698 ± 0.43
2.197TyrSer: 2.197 ± 0.472
3.395TyrThr: 3.395 ± 0.527
1.797TyrVal: 1.797 ± 0.448
0.899TyrTrp: 0.899 ± 0.296
1.398TyrTyr: 1.398 ± 0.403
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 48 proteins (10015 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski