Amino acid dipepetide frequency for Yersinia phage YeP4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.063AlaAla: 13.063 ± 1.74
0.373AlaCys: 0.373 ± 0.191
6.718AlaAsp: 6.718 ± 0.778
8.211AlaGlu: 8.211 ± 0.921
4.292AlaPhe: 4.292 ± 0.574
8.305AlaGly: 8.305 ± 0.888
2.706AlaHis: 2.706 ± 0.55
5.879AlaIle: 5.879 ± 0.627
6.065AlaLys: 6.065 ± 0.898
9.798AlaLeu: 9.798 ± 1.037
1.773AlaMet: 1.773 ± 0.41
5.225AlaAsn: 5.225 ± 0.732
3.452AlaPro: 3.452 ± 0.663
5.505AlaGln: 5.505 ± 1.087
4.852AlaArg: 4.852 ± 0.619
6.438AlaSer: 6.438 ± 0.622
5.879AlaThr: 5.879 ± 0.681
6.998AlaVal: 6.998 ± 1.15
1.493AlaTrp: 1.493 ± 0.302
3.546AlaTyr: 3.546 ± 0.574
0.0AlaXaa: 0.0 ± 0.0
Cys
0.84CysAla: 0.84 ± 0.322
0.0CysCys: 0.0 ± 0.0
0.56CysAsp: 0.56 ± 0.264
0.653CysGlu: 0.653 ± 0.265
0.28CysPhe: 0.28 ± 0.176
0.746CysGly: 0.746 ± 0.336
0.0CysHis: 0.0 ± 0.0
0.373CysIle: 0.373 ± 0.175
1.306CysLys: 1.306 ± 0.379
0.84CysLeu: 0.84 ± 0.287
0.187CysMet: 0.187 ± 0.132
0.467CysAsn: 0.467 ± 0.224
0.373CysPro: 0.373 ± 0.241
0.093CysGln: 0.093 ± 0.104
0.373CysArg: 0.373 ± 0.224
0.467CysSer: 0.467 ± 0.21
0.28CysThr: 0.28 ± 0.174
0.653CysVal: 0.653 ± 0.226
0.093CysTrp: 0.093 ± 0.107
0.56CysTyr: 0.56 ± 0.269
0.0CysXaa: 0.0 ± 0.0
Asp
5.972AspAla: 5.972 ± 0.805
0.746AspCys: 0.746 ± 0.345
2.893AspAsp: 2.893 ± 0.469
4.479AspGlu: 4.479 ± 0.726
2.426AspPhe: 2.426 ± 0.529
4.759AspGly: 4.759 ± 0.673
1.586AspHis: 1.586 ± 0.42
2.613AspIle: 2.613 ± 0.44
3.732AspLys: 3.732 ± 0.736
4.292AspLeu: 4.292 ± 0.754
1.96AspMet: 1.96 ± 0.503
2.146AspAsn: 2.146 ± 0.456
2.706AspPro: 2.706 ± 0.547
2.613AspGln: 2.613 ± 0.344
2.706AspArg: 2.706 ± 0.486
4.759AspSer: 4.759 ± 0.533
3.732AspThr: 3.732 ± 0.562
4.106AspVal: 4.106 ± 0.487
0.653AspTrp: 0.653 ± 0.222
2.146AspTyr: 2.146 ± 0.505
0.0AspXaa: 0.0 ± 0.0
Glu
7.371GluAla: 7.371 ± 0.949
1.026GluCys: 1.026 ± 0.259
3.452GluAsp: 3.452 ± 0.743
2.986GluGlu: 2.986 ± 0.702
2.239GluPhe: 2.239 ± 0.436
3.732GluGly: 3.732 ± 0.518
0.933GluHis: 0.933 ± 0.238
3.452GluIle: 3.452 ± 0.621
3.732GluLys: 3.732 ± 0.561
5.785GluLeu: 5.785 ± 0.953
2.519GluMet: 2.519 ± 0.442
2.706GluAsn: 2.706 ± 0.423
2.239GluPro: 2.239 ± 0.426
3.266GluGln: 3.266 ± 0.535
3.919GluArg: 3.919 ± 0.75
3.919GluSer: 3.919 ± 0.522
3.266GluThr: 3.266 ± 0.62
3.079GluVal: 3.079 ± 0.591
1.4GluTrp: 1.4 ± 0.458
1.586GluTyr: 1.586 ± 0.328
0.0GluXaa: 0.0 ± 0.0
Phe
2.893PheAla: 2.893 ± 0.435
0.933PheCys: 0.933 ± 0.347
2.613PheAsp: 2.613 ± 0.438
1.586PheGlu: 1.586 ± 0.308
0.56PhePhe: 0.56 ± 0.21
3.173PheGly: 3.173 ± 0.451
0.467PheHis: 0.467 ± 0.166
2.893PheIle: 2.893 ± 0.534
1.866PheLys: 1.866 ± 0.447
2.426PheLeu: 2.426 ± 0.376
0.746PheMet: 0.746 ± 0.225
2.333PheAsn: 2.333 ± 0.499
1.12PhePro: 1.12 ± 0.389
0.746PheGln: 0.746 ± 0.258
1.866PheArg: 1.866 ± 0.32
1.493PheSer: 1.493 ± 0.394
2.706PheThr: 2.706 ± 0.581
1.866PheVal: 1.866 ± 0.422
0.56PheTrp: 0.56 ± 0.197
1.026PheTyr: 1.026 ± 0.287
0.0PheXaa: 0.0 ± 0.0
Gly
9.051GlyAla: 9.051 ± 1.347
0.84GlyCys: 0.84 ± 0.287
4.012GlyAsp: 4.012 ± 0.52
3.919GlyGlu: 3.919 ± 0.621
2.426GlyPhe: 2.426 ± 0.454
6.905GlyGly: 6.905 ± 0.864
1.026GlyHis: 1.026 ± 0.329
3.546GlyIle: 3.546 ± 0.503
3.826GlyLys: 3.826 ± 0.614
4.479GlyLeu: 4.479 ± 0.672
2.146GlyMet: 2.146 ± 0.548
3.919GlyAsn: 3.919 ± 0.656
1.96GlyPro: 1.96 ± 0.439
2.799GlyGln: 2.799 ± 0.517
3.732GlyArg: 3.732 ± 0.656
4.292GlySer: 4.292 ± 0.673
5.599GlyThr: 5.599 ± 0.656
5.599GlyVal: 5.599 ± 0.876
1.12GlyTrp: 1.12 ± 0.287
3.173GlyTyr: 3.173 ± 0.643
0.0GlyXaa: 0.0 ± 0.0
His
1.68HisAla: 1.68 ± 0.373
0.187HisCys: 0.187 ± 0.126
1.68HisAsp: 1.68 ± 0.392
1.586HisGlu: 1.586 ± 0.407
1.213HisPhe: 1.213 ± 0.42
0.653HisGly: 0.653 ± 0.21
0.093HisHis: 0.093 ± 0.077
1.026HisIle: 1.026 ± 0.274
0.746HisLys: 0.746 ± 0.217
1.493HisLeu: 1.493 ± 0.485
0.28HisMet: 0.28 ± 0.14
0.373HisAsn: 0.373 ± 0.197
1.213HisPro: 1.213 ± 0.368
0.187HisGln: 0.187 ± 0.127
1.493HisArg: 1.493 ± 0.306
1.12HisSer: 1.12 ± 0.328
0.84HisThr: 0.84 ± 0.294
1.4HisVal: 1.4 ± 0.309
0.187HisTrp: 0.187 ± 0.124
0.373HisTyr: 0.373 ± 0.146
0.0HisXaa: 0.0 ± 0.0
Ile
6.625IleAla: 6.625 ± 0.727
0.187IleCys: 0.187 ± 0.14
3.359IleAsp: 3.359 ± 0.611
4.386IleGlu: 4.386 ± 0.802
1.12IlePhe: 1.12 ± 0.282
4.012IleGly: 4.012 ± 0.659
1.213IleHis: 1.213 ± 0.267
2.613IleIle: 2.613 ± 0.504
2.519IleLys: 2.519 ± 0.536
3.452IleLeu: 3.452 ± 0.469
0.56IleMet: 0.56 ± 0.183
4.012IleAsn: 4.012 ± 0.553
2.426IlePro: 2.426 ± 0.419
2.799IleGln: 2.799 ± 0.566
3.173IleArg: 3.173 ± 0.47
3.452IleSer: 3.452 ± 0.711
3.546IleThr: 3.546 ± 0.591
2.986IleVal: 2.986 ± 0.536
0.84IleTrp: 0.84 ± 0.3
1.026IleTyr: 1.026 ± 0.277
0.0IleXaa: 0.0 ± 0.0
Lys
6.532LysAla: 6.532 ± 0.776
0.933LysCys: 0.933 ± 0.34
3.266LysAsp: 3.266 ± 0.654
2.893LysGlu: 2.893 ± 0.524
1.493LysPhe: 1.493 ± 0.346
3.452LysGly: 3.452 ± 0.561
0.467LysHis: 0.467 ± 0.252
2.986LysIle: 2.986 ± 0.632
3.639LysLys: 3.639 ± 0.636
5.972LysLeu: 5.972 ± 0.784
0.933LysMet: 0.933 ± 0.213
2.239LysAsn: 2.239 ± 0.43
2.613LysPro: 2.613 ± 0.463
2.426LysGln: 2.426 ± 0.504
3.546LysArg: 3.546 ± 0.62
2.893LysSer: 2.893 ± 0.411
3.452LysThr: 3.452 ± 0.791
3.359LysVal: 3.359 ± 0.603
0.84LysTrp: 0.84 ± 0.256
1.493LysTyr: 1.493 ± 0.459
0.0LysXaa: 0.0 ± 0.0
Leu
10.451LeuAla: 10.451 ± 0.922
0.373LeuCys: 0.373 ± 0.209
5.039LeuAsp: 5.039 ± 0.596
4.386LeuGlu: 4.386 ± 0.606
2.146LeuPhe: 2.146 ± 0.425
5.599LeuGly: 5.599 ± 0.708
1.493LeuHis: 1.493 ± 0.316
4.386LeuIle: 4.386 ± 0.645
4.292LeuLys: 4.292 ± 0.857
6.345LeuLeu: 6.345 ± 0.656
1.773LeuMet: 1.773 ± 0.463
3.639LeuAsn: 3.639 ± 0.577
3.639LeuPro: 3.639 ± 0.723
3.919LeuGln: 3.919 ± 0.583
5.319LeuArg: 5.319 ± 0.861
6.252LeuSer: 6.252 ± 0.691
7.092LeuThr: 7.092 ± 0.795
5.039LeuVal: 5.039 ± 0.795
1.213LeuTrp: 1.213 ± 0.355
2.053LeuTyr: 2.053 ± 0.441
0.0LeuXaa: 0.0 ± 0.0
Met
3.079MetAla: 3.079 ± 0.449
0.093MetCys: 0.093 ± 0.1
1.12MetAsp: 1.12 ± 0.313
1.026MetGlu: 1.026 ± 0.262
0.653MetPhe: 0.653 ± 0.264
1.493MetGly: 1.493 ± 0.394
0.56MetHis: 0.56 ± 0.374
0.746MetIle: 0.746 ± 0.279
1.493MetLys: 1.493 ± 0.4
1.68MetLeu: 1.68 ± 0.382
0.467MetMet: 0.467 ± 0.199
0.653MetAsn: 0.653 ± 0.218
1.586MetPro: 1.586 ± 0.292
1.4MetGln: 1.4 ± 0.29
2.053MetArg: 2.053 ± 0.455
1.213MetSer: 1.213 ± 0.239
1.493MetThr: 1.493 ± 0.394
0.84MetVal: 0.84 ± 0.288
0.56MetTrp: 0.56 ± 0.184
0.28MetTyr: 0.28 ± 0.161
0.0MetXaa: 0.0 ± 0.0
Asn
5.319AsnAla: 5.319 ± 0.673
0.467AsnCys: 0.467 ± 0.234
2.799AsnAsp: 2.799 ± 0.436
2.426AsnGlu: 2.426 ± 0.369
1.493AsnPhe: 1.493 ± 0.433
3.826AsnGly: 3.826 ± 0.947
0.56AsnHis: 0.56 ± 0.197
2.799AsnIle: 2.799 ± 0.559
2.799AsnLys: 2.799 ± 0.561
3.079AsnLeu: 3.079 ± 0.601
0.84AsnMet: 0.84 ± 0.293
1.866AsnAsn: 1.866 ± 0.478
2.053AsnPro: 2.053 ± 0.433
2.426AsnGln: 2.426 ± 0.409
2.613AsnArg: 2.613 ± 0.47
3.359AsnSer: 3.359 ± 0.458
2.519AsnThr: 2.519 ± 0.509
1.866AsnVal: 1.866 ± 0.534
1.12AsnTrp: 1.12 ± 0.33
1.213AsnTyr: 1.213 ± 0.268
0.0AsnXaa: 0.0 ± 0.0
Pro
3.639ProAla: 3.639 ± 0.576
0.0ProCys: 0.0 ± 0.0
3.639ProAsp: 3.639 ± 0.785
3.732ProGlu: 3.732 ± 0.492
1.306ProPhe: 1.306 ± 0.392
2.799ProGly: 2.799 ± 0.618
0.56ProHis: 0.56 ± 0.183
1.586ProIle: 1.586 ± 0.37
2.146ProLys: 2.146 ± 0.466
3.826ProLeu: 3.826 ± 0.631
0.56ProMet: 0.56 ± 0.271
0.933ProAsn: 0.933 ± 0.269
0.56ProPro: 0.56 ± 0.197
1.68ProGln: 1.68 ± 0.39
2.893ProArg: 2.893 ± 0.576
2.519ProSer: 2.519 ± 0.628
2.426ProThr: 2.426 ± 0.61
3.732ProVal: 3.732 ± 0.465
0.187ProTrp: 0.187 ± 0.13
1.306ProTyr: 1.306 ± 0.468
0.0ProXaa: 0.0 ± 0.0
Gln
4.945GlnAla: 4.945 ± 0.987
0.187GlnCys: 0.187 ± 0.127
2.613GlnAsp: 2.613 ± 0.458
2.799GlnGlu: 2.799 ± 0.385
2.146GlnPhe: 2.146 ± 0.49
2.613GlnGly: 2.613 ± 0.611
0.933GlnHis: 0.933 ± 0.308
2.893GlnIle: 2.893 ± 0.526
2.333GlnLys: 2.333 ± 0.481
3.732GlnLeu: 3.732 ± 0.6
1.773GlnMet: 1.773 ± 0.492
1.96GlnAsn: 1.96 ± 0.491
1.4GlnPro: 1.4 ± 0.323
3.266GlnGln: 3.266 ± 0.626
2.893GlnArg: 2.893 ± 0.531
1.213GlnSer: 1.213 ± 0.274
2.426GlnThr: 2.426 ± 0.623
3.079GlnVal: 3.079 ± 0.637
0.56GlnTrp: 0.56 ± 0.244
0.84GlnTyr: 0.84 ± 0.277
0.0GlnXaa: 0.0 ± 0.0
Arg
5.505ArgAla: 5.505 ± 0.528
0.56ArgCys: 0.56 ± 0.228
4.106ArgAsp: 4.106 ± 0.538
3.919ArgGlu: 3.919 ± 0.758
1.96ArgPhe: 1.96 ± 0.397
2.519ArgGly: 2.519 ± 0.696
0.653ArgHis: 0.653 ± 0.206
3.359ArgIle: 3.359 ± 0.631
3.173ArgLys: 3.173 ± 0.47
6.532ArgLeu: 6.532 ± 0.728
1.213ArgMet: 1.213 ± 0.312
2.146ArgAsn: 2.146 ± 0.406
1.306ArgPro: 1.306 ± 0.251
2.053ArgGln: 2.053 ± 0.462
2.799ArgArg: 2.799 ± 0.381
3.266ArgSer: 3.266 ± 0.573
3.266ArgThr: 3.266 ± 0.526
3.919ArgVal: 3.919 ± 0.511
1.4ArgTrp: 1.4 ± 0.343
1.773ArgTyr: 1.773 ± 0.46
0.0ArgXaa: 0.0 ± 0.0
Ser
6.438SerAla: 6.438 ± 0.766
0.093SerCys: 0.093 ± 0.085
3.266SerAsp: 3.266 ± 0.5
3.826SerGlu: 3.826 ± 0.573
2.426SerPhe: 2.426 ± 0.376
6.252SerGly: 6.252 ± 1.083
0.84SerHis: 0.84 ± 0.244
3.079SerIle: 3.079 ± 0.646
2.333SerLys: 2.333 ± 0.424
5.039SerLeu: 5.039 ± 0.706
1.026SerMet: 1.026 ± 0.3
2.426SerAsn: 2.426 ± 0.633
2.799SerPro: 2.799 ± 0.548
2.146SerGln: 2.146 ± 0.48
2.893SerArg: 2.893 ± 0.41
2.519SerSer: 2.519 ± 0.593
3.639SerThr: 3.639 ± 0.496
3.826SerVal: 3.826 ± 0.585
0.56SerTrp: 0.56 ± 0.214
1.586SerTyr: 1.586 ± 0.371
0.0SerXaa: 0.0 ± 0.0
Thr
6.905ThrAla: 6.905 ± 0.999
0.467ThrCys: 0.467 ± 0.212
3.546ThrAsp: 3.546 ± 0.381
3.173ThrGlu: 3.173 ± 0.511
2.426ThrPhe: 2.426 ± 0.399
5.879ThrGly: 5.879 ± 0.957
1.4ThrHis: 1.4 ± 0.37
3.826ThrIle: 3.826 ± 0.565
3.452ThrLys: 3.452 ± 0.702
6.158ThrLeu: 6.158 ± 0.762
1.213ThrMet: 1.213 ± 0.321
2.519ThrAsn: 2.519 ± 0.603
3.266ThrPro: 3.266 ± 0.497
2.519ThrGln: 2.519 ± 0.514
2.426ThrArg: 2.426 ± 0.568
2.799ThrSer: 2.799 ± 0.646
4.572ThrThr: 4.572 ± 0.793
4.665ThrVal: 4.665 ± 0.851
0.653ThrTrp: 0.653 ± 0.2
1.213ThrTyr: 1.213 ± 0.366
0.0ThrXaa: 0.0 ± 0.0
Val
6.718ValAla: 6.718 ± 0.573
0.84ValCys: 0.84 ± 0.3
3.079ValAsp: 3.079 ± 0.624
5.225ValGlu: 5.225 ± 0.587
1.96ValPhe: 1.96 ± 0.32
4.665ValGly: 4.665 ± 0.909
1.773ValHis: 1.773 ± 0.475
4.106ValIle: 4.106 ± 0.623
3.732ValLys: 3.732 ± 0.669
5.785ValLeu: 5.785 ± 0.713
1.493ValMet: 1.493 ± 0.378
3.546ValAsn: 3.546 ± 0.485
2.893ValPro: 2.893 ± 0.492
2.613ValGln: 2.613 ± 0.417
2.706ValArg: 2.706 ± 0.523
2.519ValSer: 2.519 ± 0.459
4.012ValThr: 4.012 ± 0.734
5.132ValVal: 5.132 ± 0.892
1.306ValTrp: 1.306 ± 0.329
1.306ValTyr: 1.306 ± 0.521
0.0ValXaa: 0.0 ± 0.0
Trp
1.493TrpAla: 1.493 ± 0.412
0.467TrpCys: 0.467 ± 0.257
1.306TrpAsp: 1.306 ± 0.297
0.373TrpGlu: 0.373 ± 0.174
0.746TrpPhe: 0.746 ± 0.286
0.933TrpGly: 0.933 ± 0.218
0.093TrpHis: 0.093 ± 0.085
0.84TrpIle: 0.84 ± 0.263
0.653TrpLys: 0.653 ± 0.292
1.68TrpLeu: 1.68 ± 0.408
0.653TrpMet: 0.653 ± 0.223
0.84TrpAsn: 0.84 ± 0.238
0.653TrpPro: 0.653 ± 0.247
0.653TrpGln: 0.653 ± 0.217
0.84TrpArg: 0.84 ± 0.288
0.933TrpSer: 0.933 ± 0.313
0.84TrpThr: 0.84 ± 0.282
1.213TrpVal: 1.213 ± 0.415
0.373TrpTrp: 0.373 ± 0.178
0.187TrpTyr: 0.187 ± 0.108
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.053TyrAla: 2.053 ± 0.547
0.373TyrCys: 0.373 ± 0.178
1.96TyrAsp: 1.96 ± 0.391
1.026TyrGlu: 1.026 ± 0.246
0.56TyrPhe: 0.56 ± 0.228
2.053TyrGly: 2.053 ± 0.364
0.467TyrHis: 0.467 ± 0.196
1.306TyrIle: 1.306 ± 0.374
1.586TyrLys: 1.586 ± 0.363
2.053TyrLeu: 2.053 ± 0.387
0.28TyrMet: 0.28 ± 0.147
1.493TyrAsn: 1.493 ± 0.515
1.773TyrPro: 1.773 ± 0.495
1.493TyrGln: 1.493 ± 0.379
2.333TyrArg: 2.333 ± 0.457
1.586TyrSer: 1.586 ± 0.301
1.493TyrThr: 1.493 ± 0.364
2.053TyrVal: 2.053 ± 0.637
0.653TyrTrp: 0.653 ± 0.283
0.56TyrTyr: 0.56 ± 0.244
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 39 proteins (10718 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski