Amino acid dipepetide frequency for Pseudomonas phage phiYY

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.747AlaAla: 14.747 ± 2.774
1.17AlaCys: 1.17 ± 0.557
3.979AlaAsp: 3.979 ± 0.901
7.022AlaGlu: 7.022 ± 1.07
4.213AlaPhe: 4.213 ± 1.269
8.895AlaGly: 8.895 ± 1.658
1.873AlaHis: 1.873 ± 0.628
6.086AlaIle: 6.086 ± 1.291
6.788AlaLys: 6.788 ± 1.012
12.64AlaLeu: 12.64 ± 1.614
4.448AlaMet: 4.448 ± 0.854
3.979AlaAsn: 3.979 ± 1.142
4.448AlaPro: 4.448 ± 0.972
4.448AlaGln: 4.448 ± 1.038
5.852AlaArg: 5.852 ± 1.126
8.193AlaSer: 8.193 ± 1.698
7.257AlaThr: 7.257 ± 1.632
10.534AlaVal: 10.534 ± 2.075
2.809AlaTrp: 2.809 ± 1.326
2.341AlaTyr: 2.341 ± 0.812
0.0AlaXaa: 0.0 ± 0.0
Cys
1.404CysAla: 1.404 ± 0.715
0.702CysCys: 0.702 ± 0.55
0.468CysAsp: 0.468 ± 0.307
0.0CysGlu: 0.0 ± 0.0
0.936CysPhe: 0.936 ± 0.616
0.468CysGly: 0.468 ± 0.323
0.234CysHis: 0.234 ± 0.261
0.468CysIle: 0.468 ± 0.257
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.468CysMet: 0.468 ± 0.348
0.0CysAsn: 0.0 ± 0.0
0.468CysPro: 0.468 ± 0.419
0.0CysGln: 0.0 ± 0.0
0.234CysArg: 0.234 ± 0.234
0.936CysSer: 0.936 ± 0.54
0.702CysThr: 0.702 ± 0.354
0.234CysVal: 0.234 ± 0.267
0.0CysTrp: 0.0 ± 0.0
0.702CysTyr: 0.702 ± 0.471
0.0CysXaa: 0.0 ± 0.0
Asp
3.745AspAla: 3.745 ± 1.034
0.234AspCys: 0.234 ± 0.333
3.043AspAsp: 3.043 ± 0.857
4.448AspGlu: 4.448 ± 1.06
1.639AspPhe: 1.639 ± 0.588
3.979AspGly: 3.979 ± 0.82
1.639AspHis: 1.639 ± 0.982
1.639AspIle: 1.639 ± 0.646
3.043AspLys: 3.043 ± 0.868
3.511AspLeu: 3.511 ± 0.69
0.468AspMet: 0.468 ± 0.323
0.702AspAsn: 0.702 ± 0.341
3.979AspPro: 3.979 ± 1.011
1.404AspGln: 1.404 ± 0.759
2.341AspArg: 2.341 ± 0.653
3.511AspSer: 3.511 ± 0.826
2.107AspThr: 2.107 ± 0.76
3.511AspVal: 3.511 ± 1.029
0.936AspTrp: 0.936 ± 0.622
1.639AspTyr: 1.639 ± 0.684
0.0AspXaa: 0.0 ± 0.0
Glu
7.022GluAla: 7.022 ± 1.301
0.468GluCys: 0.468 ± 0.477
1.17GluAsp: 1.17 ± 0.437
3.745GluGlu: 3.745 ± 1.458
1.873GluPhe: 1.873 ± 0.746
5.15GluGly: 5.15 ± 0.956
1.17GluHis: 1.17 ± 0.47
2.341GluIle: 2.341 ± 0.785
2.107GluLys: 2.107 ± 0.744
6.086GluLeu: 6.086 ± 1.529
1.873GluMet: 1.873 ± 0.667
1.404GluAsn: 1.404 ± 0.597
2.575GluPro: 2.575 ± 0.785
1.873GluGln: 1.873 ± 0.69
2.809GluArg: 2.809 ± 1.096
4.682GluSer: 4.682 ± 1.423
3.043GluThr: 3.043 ± 0.702
4.448GluVal: 4.448 ± 0.92
0.702GluTrp: 0.702 ± 0.457
1.404GluTyr: 1.404 ± 0.705
0.0GluXaa: 0.0 ± 0.0
Phe
3.745PheAla: 3.745 ± 0.776
0.468PheCys: 0.468 ± 0.419
1.873PheAsp: 1.873 ± 0.778
1.639PheGlu: 1.639 ± 0.793
2.107PhePhe: 2.107 ± 1.078
2.575PheGly: 2.575 ± 0.704
0.234PheHis: 0.234 ± 0.332
1.639PheIle: 1.639 ± 0.57
0.936PheLys: 0.936 ± 0.551
3.277PheLeu: 3.277 ± 0.941
0.936PheMet: 0.936 ± 0.369
0.936PheAsn: 0.936 ± 0.583
1.404PhePro: 1.404 ± 0.526
0.234PheGln: 0.234 ± 0.19
1.17PheArg: 1.17 ± 0.612
4.213PheSer: 4.213 ± 1.202
1.639PheThr: 1.639 ± 0.782
3.277PheVal: 3.277 ± 0.731
0.234PheTrp: 0.234 ± 0.21
1.17PheTyr: 1.17 ± 0.394
0.0PheXaa: 0.0 ± 0.0
Gly
8.661GlyAla: 8.661 ± 1.502
0.468GlyCys: 0.468 ± 0.305
2.575GlyAsp: 2.575 ± 0.733
4.213GlyGlu: 4.213 ± 0.865
1.404GlyPhe: 1.404 ± 0.592
5.384GlyGly: 5.384 ± 1.023
1.404GlyHis: 1.404 ± 0.499
4.213GlyIle: 4.213 ± 0.871
3.979GlyLys: 3.979 ± 1.055
7.257GlyLeu: 7.257 ± 1.519
2.575GlyMet: 2.575 ± 0.73
2.341GlyAsn: 2.341 ± 0.879
3.277GlyPro: 3.277 ± 0.973
1.17GlyGln: 1.17 ± 0.586
5.15GlyArg: 5.15 ± 1.081
5.852GlySer: 5.852 ± 1.182
3.745GlyThr: 3.745 ± 0.937
5.618GlyVal: 5.618 ± 1.451
1.404GlyTrp: 1.404 ± 0.618
3.043GlyTyr: 3.043 ± 0.974
0.0GlyXaa: 0.0 ± 0.0
His
1.404HisAla: 1.404 ± 0.448
0.0HisCys: 0.0 ± 0.0
1.404HisAsp: 1.404 ± 0.435
1.404HisGlu: 1.404 ± 0.503
0.468HisPhe: 0.468 ± 0.392
0.936HisGly: 0.936 ± 0.447
0.234HisHis: 0.234 ± 0.21
0.936HisIle: 0.936 ± 0.346
0.468HisLys: 0.468 ± 0.405
2.107HisLeu: 2.107 ± 0.535
0.468HisMet: 0.468 ± 0.332
0.234HisAsn: 0.234 ± 0.251
0.936HisPro: 0.936 ± 0.738
0.702HisGln: 0.702 ± 0.404
0.936HisArg: 0.936 ± 0.415
1.17HisSer: 1.17 ± 0.616
2.107HisThr: 2.107 ± 0.878
1.873HisVal: 1.873 ± 0.743
0.0HisTrp: 0.0 ± 0.0
0.702HisTyr: 0.702 ± 0.335
0.0HisXaa: 0.0 ± 0.0
Ile
5.15IleAla: 5.15 ± 1.233
0.702IleCys: 0.702 ± 0.466
3.043IleAsp: 3.043 ± 1.044
2.107IleGlu: 2.107 ± 0.906
1.17IlePhe: 1.17 ± 0.573
3.043IleGly: 3.043 ± 0.926
0.702IleHis: 0.702 ± 0.378
1.17IleIle: 1.17 ± 0.815
1.639IleLys: 1.639 ± 0.778
3.745IleLeu: 3.745 ± 0.839
1.404IleMet: 1.404 ± 0.522
2.107IleAsn: 2.107 ± 1.013
3.745IlePro: 3.745 ± 0.781
0.702IleGln: 0.702 ± 0.396
3.979IleArg: 3.979 ± 0.801
4.916IleSer: 4.916 ± 1.129
1.639IleThr: 1.639 ± 0.624
3.511IleVal: 3.511 ± 1.083
0.234IleTrp: 0.234 ± 0.224
0.936IleTyr: 0.936 ± 0.395
0.0IleXaa: 0.0 ± 0.0
Lys
5.384LysAla: 5.384 ± 1.512
0.234LysCys: 0.234 ± 0.225
2.341LysAsp: 2.341 ± 0.608
2.575LysGlu: 2.575 ± 0.759
0.936LysPhe: 0.936 ± 0.453
3.511LysGly: 3.511 ± 0.972
0.936LysHis: 0.936 ± 0.31
0.468LysIle: 0.468 ± 0.375
3.043LysLys: 3.043 ± 1.226
4.916LysLeu: 4.916 ± 1.424
1.17LysMet: 1.17 ± 0.688
1.404LysAsn: 1.404 ± 0.679
2.575LysPro: 2.575 ± 0.956
2.107LysGln: 2.107 ± 0.829
2.809LysArg: 2.809 ± 0.765
2.341LysSer: 2.341 ± 0.559
2.341LysThr: 2.341 ± 0.588
3.277LysVal: 3.277 ± 0.709
0.0LysTrp: 0.0 ± 0.0
0.234LysTyr: 0.234 ± 0.21
0.0LysXaa: 0.0 ± 0.0
Leu
11.236LeuAla: 11.236 ± 1.499
0.234LeuCys: 0.234 ± 0.333
3.277LeuAsp: 3.277 ± 0.769
3.511LeuGlu: 3.511 ± 0.925
2.341LeuPhe: 2.341 ± 0.788
7.491LeuGly: 7.491 ± 1.292
1.639LeuHis: 1.639 ± 0.565
5.618LeuIle: 5.618 ± 0.787
3.745LeuLys: 3.745 ± 1.34
7.959LeuLeu: 7.959 ± 1.271
3.511LeuMet: 3.511 ± 0.711
2.575LeuAsn: 2.575 ± 0.782
5.15LeuPro: 5.15 ± 0.881
3.745LeuGln: 3.745 ± 1.27
5.384LeuArg: 5.384 ± 1.017
7.959LeuSer: 7.959 ± 1.155
6.32LeuThr: 6.32 ± 0.907
7.725LeuVal: 7.725 ± 1.718
1.17LeuTrp: 1.17 ± 0.361
2.575LeuTyr: 2.575 ± 0.494
0.0LeuXaa: 0.0 ± 0.0
Met
2.575MetAla: 2.575 ± 0.837
0.0MetCys: 0.0 ± 0.0
1.873MetAsp: 1.873 ± 0.676
1.17MetGlu: 1.17 ± 0.491
1.17MetPhe: 1.17 ± 0.475
2.107MetGly: 2.107 ± 0.736
0.468MetHis: 0.468 ± 0.321
1.639MetIle: 1.639 ± 0.637
0.936MetLys: 0.936 ± 0.413
3.277MetLeu: 3.277 ± 0.867
1.17MetMet: 1.17 ± 0.568
0.702MetAsn: 0.702 ± 0.383
2.107MetPro: 2.107 ± 0.781
1.17MetGln: 1.17 ± 0.559
2.107MetArg: 2.107 ± 0.514
2.341MetSer: 2.341 ± 0.723
3.277MetThr: 3.277 ± 0.757
3.043MetVal: 3.043 ± 0.691
0.234MetTrp: 0.234 ± 0.19
0.702MetTyr: 0.702 ± 0.656
0.0MetXaa: 0.0 ± 0.0
Asn
4.448AsnAla: 4.448 ± 1.109
0.234AsnCys: 0.234 ± 0.264
2.107AsnAsp: 2.107 ± 0.649
2.341AsnGlu: 2.341 ± 0.964
1.404AsnPhe: 1.404 ± 0.679
1.639AsnGly: 1.639 ± 0.588
0.0AsnHis: 0.0 ± 0.0
1.17AsnIle: 1.17 ± 0.635
0.468AsnLys: 0.468 ± 0.265
3.979AsnLeu: 3.979 ± 1.224
1.639AsnMet: 1.639 ± 0.603
1.639AsnAsn: 1.639 ± 1.008
1.873AsnPro: 1.873 ± 0.378
0.468AsnGln: 0.468 ± 0.266
2.107AsnArg: 2.107 ± 0.883
2.107AsnSer: 2.107 ± 0.578
2.809AsnThr: 2.809 ± 0.701
2.107AsnVal: 2.107 ± 0.696
0.234AsnTrp: 0.234 ± 0.19
0.234AsnTyr: 0.234 ± 0.29
0.0AsnXaa: 0.0 ± 0.0
Pro
5.384ProAla: 5.384 ± 1.546
1.17ProCys: 1.17 ± 0.596
2.107ProAsp: 2.107 ± 0.587
1.639ProGlu: 1.639 ± 0.506
2.341ProPhe: 2.341 ± 0.827
4.448ProGly: 4.448 ± 1.023
0.468ProHis: 0.468 ± 0.272
1.639ProIle: 1.639 ± 0.664
2.107ProLys: 2.107 ± 0.707
4.448ProLeu: 4.448 ± 1.148
2.575ProMet: 2.575 ± 0.729
1.17ProAsn: 1.17 ± 0.574
1.639ProPro: 1.639 ± 0.707
0.936ProGln: 0.936 ± 0.442
2.575ProArg: 2.575 ± 0.924
6.788ProSer: 6.788 ± 1.197
2.575ProThr: 2.575 ± 0.531
4.916ProVal: 4.916 ± 0.831
0.234ProTrp: 0.234 ± 0.285
1.639ProTyr: 1.639 ± 0.856
0.0ProXaa: 0.0 ± 0.0
Gln
4.682GlnAla: 4.682 ± 1.354
0.0GlnCys: 0.0 ± 0.0
1.17GlnAsp: 1.17 ± 0.487
1.639GlnGlu: 1.639 ± 0.765
0.936GlnPhe: 0.936 ± 0.536
2.341GlnGly: 2.341 ± 0.638
1.17GlnHis: 1.17 ± 0.59
1.404GlnIle: 1.404 ± 0.602
0.702GlnLys: 0.702 ± 0.38
3.979GlnLeu: 3.979 ± 0.654
1.17GlnMet: 1.17 ± 0.489
0.0GlnAsn: 0.0 ± 0.0
1.17GlnPro: 1.17 ± 0.61
0.936GlnGln: 0.936 ± 0.596
2.575GlnArg: 2.575 ± 0.836
1.873GlnSer: 1.873 ± 0.727
1.17GlnThr: 1.17 ± 0.741
0.234GlnVal: 0.234 ± 0.225
0.0GlnTrp: 0.0 ± 0.0
1.17GlnTyr: 1.17 ± 0.511
0.0GlnXaa: 0.0 ± 0.0
Arg
8.895ArgAla: 8.895 ± 1.424
0.234ArgCys: 0.234 ± 0.251
3.043ArgAsp: 3.043 ± 1.369
3.979ArgGlu: 3.979 ± 1.299
2.809ArgPhe: 2.809 ± 0.837
2.809ArgGly: 2.809 ± 0.599
1.404ArgHis: 1.404 ± 0.596
3.043ArgIle: 3.043 ± 0.703
1.404ArgLys: 1.404 ± 0.511
5.384ArgLeu: 5.384 ± 1.527
1.17ArgMet: 1.17 ± 0.51
3.277ArgAsn: 3.277 ± 0.714
2.107ArgPro: 2.107 ± 0.885
2.809ArgGln: 2.809 ± 1.084
3.277ArgArg: 3.277 ± 0.891
3.043ArgSer: 3.043 ± 0.841
3.511ArgThr: 3.511 ± 0.935
4.916ArgVal: 4.916 ± 1.021
1.404ArgTrp: 1.404 ± 0.941
1.404ArgTyr: 1.404 ± 0.54
0.0ArgXaa: 0.0 ± 0.0
Ser
9.831SerAla: 9.831 ± 1.338
0.234SerCys: 0.234 ± 0.261
4.448SerAsp: 4.448 ± 0.894
3.979SerGlu: 3.979 ± 0.842
3.511SerPhe: 3.511 ± 0.841
4.682SerGly: 4.682 ± 1.384
1.873SerHis: 1.873 ± 0.634
2.575SerIle: 2.575 ± 1.076
3.979SerLys: 3.979 ± 0.885
5.618SerLeu: 5.618 ± 1.313
1.873SerMet: 1.873 ± 0.946
3.979SerAsn: 3.979 ± 1.308
4.682SerPro: 4.682 ± 1.108
1.639SerGln: 1.639 ± 0.71
4.448SerArg: 4.448 ± 1.039
6.554SerSer: 6.554 ± 1.51
3.745SerThr: 3.745 ± 0.739
5.618SerVal: 5.618 ± 1.425
1.404SerTrp: 1.404 ± 0.76
2.341SerTyr: 2.341 ± 0.746
0.0SerXaa: 0.0 ± 0.0
Thr
7.725ThrAla: 7.725 ± 1.292
0.468ThrCys: 0.468 ± 0.353
2.575ThrAsp: 2.575 ± 1.016
3.277ThrGlu: 3.277 ± 1.368
1.17ThrPhe: 1.17 ± 0.668
4.916ThrGly: 4.916 ± 1.133
1.404ThrHis: 1.404 ± 0.57
3.043ThrIle: 3.043 ± 0.848
2.107ThrLys: 2.107 ± 0.665
5.384ThrLeu: 5.384 ± 1.178
2.341ThrMet: 2.341 ± 0.566
1.17ThrAsn: 1.17 ± 0.578
1.639ThrPro: 1.639 ± 0.68
1.404ThrGln: 1.404 ± 0.619
4.448ThrArg: 4.448 ± 1.434
3.511ThrSer: 3.511 ± 0.893
4.448ThrThr: 4.448 ± 1.037
4.916ThrVal: 4.916 ± 1.001
0.936ThrTrp: 0.936 ± 0.345
0.468ThrTyr: 0.468 ± 0.419
0.0ThrXaa: 0.0 ± 0.0
Val
11.938ValAla: 11.938 ± 2.304
0.702ValCys: 0.702 ± 0.428
4.448ValAsp: 4.448 ± 0.919
4.916ValGlu: 4.916 ± 0.966
2.107ValPhe: 2.107 ± 0.65
5.15ValGly: 5.15 ± 1.014
0.936ValHis: 0.936 ± 0.376
4.916ValIle: 4.916 ± 1.529
3.745ValLys: 3.745 ± 1.202
6.32ValLeu: 6.32 ± 1.283
1.873ValMet: 1.873 ± 0.618
2.575ValAsn: 2.575 ± 0.996
4.682ValPro: 4.682 ± 1.154
1.17ValGln: 1.17 ± 0.576
4.448ValArg: 4.448 ± 0.856
5.618ValSer: 5.618 ± 1.161
2.809ValThr: 2.809 ± 1.018
6.788ValVal: 6.788 ± 1.689
1.404ValTrp: 1.404 ± 0.745
2.809ValTyr: 2.809 ± 0.661
0.0ValXaa: 0.0 ± 0.0
Trp
1.404TrpAla: 1.404 ± 0.911
0.0TrpCys: 0.0 ± 0.0
1.404TrpAsp: 1.404 ± 0.428
0.468TrpGlu: 0.468 ± 0.274
0.234TrpPhe: 0.234 ± 0.225
1.639TrpGly: 1.639 ± 0.637
0.468TrpHis: 0.468 ± 0.321
0.702TrpIle: 0.702 ± 0.308
0.702TrpLys: 0.702 ± 0.426
1.17TrpLeu: 1.17 ± 0.469
0.0TrpMet: 0.0 ± 0.237
0.936TrpAsn: 0.936 ± 0.408
1.404TrpPro: 1.404 ± 0.538
0.468TrpGln: 0.468 ± 0.337
0.702TrpArg: 0.702 ± 0.312
0.234TrpSer: 0.234 ± 0.21
0.234TrpThr: 0.234 ± 0.333
0.702TrpVal: 0.702 ± 0.299
0.468TrpTrp: 0.468 ± 0.268
0.702TrpTyr: 0.702 ± 0.361
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.341TyrAla: 2.341 ± 0.855
0.702TyrCys: 0.702 ± 0.375
1.17TyrAsp: 1.17 ± 0.642
1.873TyrGlu: 1.873 ± 0.552
0.936TyrPhe: 0.936 ± 0.6
2.575TyrGly: 2.575 ± 0.789
0.234TyrHis: 0.234 ± 0.267
0.936TyrIle: 0.936 ± 0.499
0.702TyrLys: 0.702 ± 0.308
2.107TyrLeu: 2.107 ± 0.673
0.702TyrMet: 0.702 ± 0.471
1.873TyrAsn: 1.873 ± 0.913
0.936TyrPro: 0.936 ± 0.381
0.936TyrGln: 0.936 ± 0.418
2.575TyrArg: 2.575 ± 0.766
1.17TyrSer: 1.17 ± 0.498
1.873TyrThr: 1.873 ± 0.639
2.107TyrVal: 2.107 ± 0.636
0.468TyrTrp: 0.468 ± 0.266
0.936TyrTyr: 0.936 ± 0.404
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 18 proteins (4273 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski