Amino acid dipepetide frequency for Streptococcus satellite phage Javan329

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.629AlaAla: 0.629 ± 0.461
0.314AlaCys: 0.314 ± 0.305
2.83AlaAsp: 2.83 ± 1.061
3.774AlaGlu: 3.774 ± 0.912
2.83AlaPhe: 2.83 ± 0.643
1.887AlaGly: 1.887 ± 0.682
0.629AlaHis: 0.629 ± 0.372
3.459AlaIle: 3.459 ± 1.393
5.975AlaLys: 5.975 ± 1.72
3.774AlaLeu: 3.774 ± 1.231
1.887AlaMet: 1.887 ± 0.723
2.516AlaAsn: 2.516 ± 0.819
0.943AlaPro: 0.943 ± 0.526
2.201AlaGln: 2.201 ± 0.855
2.83AlaArg: 2.83 ± 1.079
1.887AlaSer: 1.887 ± 1.01
5.346AlaThr: 5.346 ± 1.193
1.572AlaVal: 1.572 ± 0.682
0.0AlaTrp: 0.0 ± 0.0
2.201AlaTyr: 2.201 ± 0.754
0.0AlaXaa: 0.0 ± 0.0
Cys
0.629CysAla: 0.629 ± 0.458
0.0CysCys: 0.0 ± 0.0
0.314CysAsp: 0.314 ± 0.354
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.629CysGly: 0.629 ± 0.395
0.0CysHis: 0.0 ± 0.0
0.314CysIle: 0.314 ± 0.294
0.629CysLys: 0.629 ± 0.443
0.943CysLeu: 0.943 ± 0.646
0.314CysMet: 0.314 ± 0.337
0.0CysAsn: 0.0 ± 0.0
0.629CysPro: 0.629 ± 0.441
0.0CysGln: 0.0 ± 0.0
0.314CysArg: 0.314 ± 0.251
0.314CysSer: 0.314 ± 0.305
0.314CysThr: 0.314 ± 0.312
0.314CysVal: 0.314 ± 0.354
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.258AspAla: 1.258 ± 0.577
0.629AspCys: 0.629 ± 0.376
5.66AspAsp: 5.66 ± 1.482
4.403AspGlu: 4.403 ± 1.473
3.774AspPhe: 3.774 ± 1.084
4.088AspGly: 4.088 ± 0.775
0.629AspHis: 0.629 ± 0.376
6.289AspIle: 6.289 ± 1.124
5.975AspLys: 5.975 ± 1.202
6.918AspLeu: 6.918 ± 1.551
0.629AspMet: 0.629 ± 0.47
4.717AspAsn: 4.717 ± 1.195
0.629AspPro: 0.629 ± 0.519
1.258AspGln: 1.258 ± 0.439
2.201AspArg: 2.201 ± 0.732
5.975AspSer: 5.975 ± 1.198
3.459AspThr: 3.459 ± 0.819
3.774AspVal: 3.774 ± 0.749
0.314AspTrp: 0.314 ± 0.354
2.201AspTyr: 2.201 ± 0.96
0.0AspXaa: 0.0 ± 0.0
Glu
2.83GluAla: 2.83 ± 0.823
0.943GluCys: 0.943 ± 0.514
5.975GluAsp: 5.975 ± 1.624
5.031GluGlu: 5.031 ± 1.341
4.403GluPhe: 4.403 ± 0.843
3.459GluGly: 3.459 ± 1.187
2.201GluHis: 2.201 ± 0.894
9.434GluIle: 9.434 ± 1.76
10.377GluLys: 10.377 ± 1.494
9.748GluLeu: 9.748 ± 2.084
1.258GluMet: 1.258 ± 0.627
4.717GluAsn: 4.717 ± 0.879
1.572GluPro: 1.572 ± 0.721
3.774GluGln: 3.774 ± 0.903
2.201GluArg: 2.201 ± 0.791
5.346GluSer: 5.346 ± 1.064
5.66GluThr: 5.66 ± 1.096
2.201GluVal: 2.201 ± 0.933
0.629GluTrp: 0.629 ± 0.365
3.774GluTyr: 3.774 ± 1.4
0.0GluXaa: 0.0 ± 0.0
Phe
1.258PheAla: 1.258 ± 0.501
0.314PheCys: 0.314 ± 0.354
4.088PheAsp: 4.088 ± 1.179
3.145PheGlu: 3.145 ± 1.203
2.516PhePhe: 2.516 ± 0.83
1.887PheGly: 1.887 ± 0.657
1.258PheHis: 1.258 ± 0.602
2.516PheIle: 2.516 ± 0.625
5.66PheLys: 5.66 ± 1.206
4.088PheLeu: 4.088 ± 1.129
0.629PheMet: 0.629 ± 0.396
1.258PheAsn: 1.258 ± 0.795
0.943PhePro: 0.943 ± 0.622
0.943PheGln: 0.943 ± 0.448
1.572PheArg: 1.572 ± 0.485
4.717PheSer: 4.717 ± 1.103
2.516PheThr: 2.516 ± 0.928
1.887PheVal: 1.887 ± 1.06
0.943PheTrp: 0.943 ± 0.458
0.943PheTyr: 0.943 ± 0.417
0.0PheXaa: 0.0 ± 0.0
Gly
2.516GlyAla: 2.516 ± 0.815
0.314GlyCys: 0.314 ± 0.312
0.943GlyAsp: 0.943 ± 0.614
2.83GlyGlu: 2.83 ± 0.997
2.201GlyPhe: 2.201 ± 0.707
2.201GlyGly: 2.201 ± 1.273
1.887GlyHis: 1.887 ± 0.574
2.83GlyIle: 2.83 ± 0.776
5.346GlyLys: 5.346 ± 0.915
4.403GlyLeu: 4.403 ± 1.251
1.887GlyMet: 1.887 ± 0.737
2.83GlyAsn: 2.83 ± 0.893
0.0GlyPro: 0.0 ± 0.0
1.258GlyGln: 1.258 ± 0.899
2.516GlyArg: 2.516 ± 0.639
3.145GlySer: 3.145 ± 1.428
2.83GlyThr: 2.83 ± 1.008
2.516GlyVal: 2.516 ± 0.655
0.314GlyTrp: 0.314 ± 0.354
4.717GlyTyr: 4.717 ± 1.628
0.0GlyXaa: 0.0 ± 0.0
His
3.145HisAla: 3.145 ± 1.172
0.0HisCys: 0.0 ± 0.0
1.258HisAsp: 1.258 ± 0.606
0.629HisGlu: 0.629 ± 0.421
0.314HisPhe: 0.314 ± 0.233
0.314HisGly: 0.314 ± 0.28
0.0HisHis: 0.0 ± 0.0
1.258HisIle: 1.258 ± 0.436
1.887HisLys: 1.887 ± 0.627
2.516HisLeu: 2.516 ± 0.939
0.629HisMet: 0.629 ± 0.461
0.629HisAsn: 0.629 ± 0.385
0.629HisPro: 0.629 ± 0.39
0.943HisGln: 0.943 ± 0.517
0.0HisArg: 0.0 ± 0.0
1.887HisSer: 1.887 ± 0.755
1.258HisThr: 1.258 ± 0.726
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.943HisTyr: 0.943 ± 0.536
0.0HisXaa: 0.0 ± 0.0
Ile
2.516IleAla: 2.516 ± 0.912
0.0IleCys: 0.0 ± 0.0
5.031IleAsp: 5.031 ± 0.941
5.975IleGlu: 5.975 ± 1.836
1.887IlePhe: 1.887 ± 0.847
2.516IleGly: 2.516 ± 0.556
1.258IleHis: 1.258 ± 0.587
5.975IleIle: 5.975 ± 1.378
7.233IleLys: 7.233 ± 1.575
9.119IleLeu: 9.119 ± 1.172
0.0IleMet: 0.0 ± 0.0
6.604IleAsn: 6.604 ± 1.164
2.516IlePro: 2.516 ± 0.761
3.145IleGln: 3.145 ± 0.871
3.459IleArg: 3.459 ± 0.86
6.289IleSer: 6.289 ± 1.161
5.346IleThr: 5.346 ± 1.112
1.887IleVal: 1.887 ± 0.606
0.0IleTrp: 0.0 ± 0.0
3.774IleTyr: 3.774 ± 0.983
0.0IleXaa: 0.0 ± 0.0
Lys
3.774LysAla: 3.774 ± 1.058
0.943LysCys: 0.943 ± 0.538
5.975LysAsp: 5.975 ± 1.63
12.579LysGlu: 12.579 ± 1.546
3.774LysPhe: 3.774 ± 1.36
6.289LysGly: 6.289 ± 1.611
2.201LysHis: 2.201 ± 0.8
9.119LysIle: 9.119 ± 1.437
9.434LysLys: 9.434 ± 1.916
5.66LysLeu: 5.66 ± 1.201
2.201LysMet: 2.201 ± 0.817
7.547LysAsn: 7.547 ± 1.339
2.83LysPro: 2.83 ± 1.015
3.774LysGln: 3.774 ± 1.231
5.975LysArg: 5.975 ± 1.54
5.975LysSer: 5.975 ± 1.36
7.233LysThr: 7.233 ± 1.678
4.717LysVal: 4.717 ± 1.146
0.943LysTrp: 0.943 ± 0.438
4.403LysTyr: 4.403 ± 0.916
0.0LysXaa: 0.0 ± 0.0
Leu
6.604LeuAla: 6.604 ± 1.579
0.629LeuCys: 0.629 ± 0.401
8.176LeuAsp: 8.176 ± 1.472
9.748LeuGlu: 9.748 ± 1.935
3.774LeuPhe: 3.774 ± 0.915
4.717LeuGly: 4.717 ± 0.953
1.258LeuHis: 1.258 ± 0.977
5.346LeuIle: 5.346 ± 1.254
8.805LeuLys: 8.805 ± 1.387
7.862LeuLeu: 7.862 ± 1.492
3.145LeuMet: 3.145 ± 0.745
5.346LeuAsn: 5.346 ± 1.362
2.201LeuPro: 2.201 ± 0.723
2.201LeuGln: 2.201 ± 0.784
3.774LeuArg: 3.774 ± 0.745
5.975LeuSer: 5.975 ± 1.278
5.66LeuThr: 5.66 ± 1.461
6.604LeuVal: 6.604 ± 1.366
0.314LeuTrp: 0.314 ± 0.294
3.774LeuTyr: 3.774 ± 1.13
0.0LeuXaa: 0.0 ± 0.0
Met
1.258MetAla: 1.258 ± 0.589
0.314MetCys: 0.314 ± 0.28
2.516MetAsp: 2.516 ± 0.704
1.887MetGlu: 1.887 ± 0.622
0.943MetPhe: 0.943 ± 0.513
0.314MetGly: 0.314 ± 0.38
0.0MetHis: 0.0 ± 0.0
0.314MetIle: 0.314 ± 0.364
2.516MetLys: 2.516 ± 0.633
1.887MetLeu: 1.887 ± 0.514
0.314MetMet: 0.314 ± 0.284
0.943MetAsn: 0.943 ± 0.564
0.629MetPro: 0.629 ± 0.382
0.943MetGln: 0.943 ± 0.511
0.943MetArg: 0.943 ± 0.474
1.572MetSer: 1.572 ± 0.686
2.516MetThr: 2.516 ± 1.278
1.258MetVal: 1.258 ± 0.597
0.0MetTrp: 0.0 ± 0.0
0.629MetTyr: 0.629 ± 0.407
0.0MetXaa: 0.0 ± 0.0
Asn
3.145AsnAla: 3.145 ± 0.97
0.314AsnCys: 0.314 ± 0.312
1.572AsnAsp: 1.572 ± 0.708
3.774AsnGlu: 3.774 ± 1.082
2.83AsnPhe: 2.83 ± 0.726
3.145AsnGly: 3.145 ± 1.238
1.887AsnHis: 1.887 ± 0.796
2.516AsnIle: 2.516 ± 0.788
5.66AsnLys: 5.66 ± 1.425
5.031AsnLeu: 5.031 ± 1.131
1.887AsnMet: 1.887 ± 0.906
3.145AsnAsn: 3.145 ± 0.988
2.201AsnPro: 2.201 ± 0.756
2.83AsnGln: 2.83 ± 0.901
2.201AsnArg: 2.201 ± 0.586
3.459AsnSer: 3.459 ± 1.166
5.031AsnThr: 5.031 ± 1.633
1.887AsnVal: 1.887 ± 0.832
0.629AsnTrp: 0.629 ± 0.364
4.403AsnTyr: 4.403 ± 1.226
0.0AsnXaa: 0.0 ± 0.0
Pro
0.943ProAla: 0.943 ± 0.417
0.0ProCys: 0.0 ± 0.0
1.887ProAsp: 1.887 ± 0.74
2.83ProGlu: 2.83 ± 1.006
1.572ProPhe: 1.572 ± 0.551
0.314ProGly: 0.314 ± 0.294
0.314ProHis: 0.314 ± 0.305
1.258ProIle: 1.258 ± 0.569
3.459ProLys: 3.459 ± 0.9
0.943ProLeu: 0.943 ± 0.513
0.314ProMet: 0.314 ± 0.364
0.0ProAsn: 0.0 ± 0.0
0.629ProPro: 0.629 ± 0.44
0.314ProGln: 0.314 ± 0.337
1.258ProArg: 1.258 ± 0.534
0.629ProSer: 0.629 ± 0.421
1.258ProThr: 1.258 ± 0.694
1.572ProVal: 1.572 ± 0.521
0.314ProTrp: 0.314 ± 0.28
1.572ProTyr: 1.572 ± 0.55
0.0ProXaa: 0.0 ± 0.0
Gln
4.088GlnAla: 4.088 ± 1.026
0.0GlnCys: 0.0 ± 0.0
2.83GlnAsp: 2.83 ± 0.909
2.83GlnGlu: 2.83 ± 0.876
1.572GlnPhe: 1.572 ± 0.717
0.629GlnGly: 0.629 ± 0.434
0.629GlnHis: 0.629 ± 0.376
2.516GlnIle: 2.516 ± 0.381
3.459GlnLys: 3.459 ± 0.976
1.258GlnLeu: 1.258 ± 0.555
0.314GlnMet: 0.314 ± 0.354
1.887GlnAsn: 1.887 ± 0.756
0.943GlnPro: 0.943 ± 0.504
0.629GlnGln: 0.629 ± 0.452
1.572GlnArg: 1.572 ± 0.783
1.887GlnSer: 1.887 ± 0.674
1.572GlnThr: 1.572 ± 0.982
3.459GlnVal: 3.459 ± 0.905
0.629GlnTrp: 0.629 ± 0.42
1.258GlnTyr: 1.258 ± 0.649
0.0GlnXaa: 0.0 ± 0.0
Arg
2.201ArgAla: 2.201 ± 0.774
0.0ArgCys: 0.0 ± 0.0
2.83ArgAsp: 2.83 ± 0.739
3.459ArgGlu: 3.459 ± 0.967
1.887ArgPhe: 1.887 ± 0.746
2.201ArgGly: 2.201 ± 0.814
0.943ArgHis: 0.943 ± 0.456
2.83ArgIle: 2.83 ± 0.739
5.66ArgLys: 5.66 ± 0.95
5.031ArgLeu: 5.031 ± 1.291
1.258ArgMet: 1.258 ± 0.517
3.145ArgAsn: 3.145 ± 0.699
0.943ArgPro: 0.943 ± 0.552
2.201ArgGln: 2.201 ± 0.559
2.516ArgArg: 2.516 ± 0.852
1.572ArgSer: 1.572 ± 0.852
2.516ArgThr: 2.516 ± 0.78
0.629ArgVal: 0.629 ± 0.365
0.314ArgTrp: 0.314 ± 0.251
2.516ArgTyr: 2.516 ± 0.943
0.0ArgXaa: 0.0 ± 0.0
Ser
3.459SerAla: 3.459 ± 1.777
0.314SerCys: 0.314 ± 0.294
3.774SerAsp: 3.774 ± 1.03
9.748SerGlu: 9.748 ± 2.013
2.201SerPhe: 2.201 ± 1.085
3.774SerGly: 3.774 ± 0.903
1.258SerHis: 1.258 ± 0.782
4.403SerIle: 4.403 ± 0.923
6.604SerLys: 6.604 ± 1.767
8.176SerLeu: 8.176 ± 1.489
0.629SerMet: 0.629 ± 0.4
2.83SerAsn: 2.83 ± 1.025
1.258SerPro: 1.258 ± 0.502
1.572SerGln: 1.572 ± 0.866
1.572SerArg: 1.572 ± 0.504
5.031SerSer: 5.031 ± 1.539
4.717SerThr: 4.717 ± 1.232
2.201SerVal: 2.201 ± 0.763
0.629SerTrp: 0.629 ± 0.446
1.887SerTyr: 1.887 ± 0.705
0.0SerXaa: 0.0 ± 0.0
Thr
3.145ThrAla: 3.145 ± 1.055
0.314ThrCys: 0.314 ± 0.28
1.258ThrAsp: 1.258 ± 0.499
3.774ThrGlu: 3.774 ± 1.093
3.145ThrPhe: 3.145 ± 1.131
5.346ThrGly: 5.346 ± 1.08
1.258ThrHis: 1.258 ± 0.479
6.604ThrIle: 6.604 ± 1.55
5.346ThrLys: 5.346 ± 1.571
6.918ThrLeu: 6.918 ± 1.03
1.887ThrMet: 1.887 ± 0.698
3.145ThrAsn: 3.145 ± 0.794
1.258ThrPro: 1.258 ± 0.718
2.201ThrGln: 2.201 ± 1.213
3.459ThrArg: 3.459 ± 1.294
3.145ThrSer: 3.145 ± 1.35
3.774ThrThr: 3.774 ± 1.121
5.66ThrVal: 5.66 ± 1.18
0.943ThrTrp: 0.943 ± 0.428
3.459ThrTyr: 3.459 ± 1.542
0.0ThrXaa: 0.0 ± 0.0
Val
2.201ValAla: 2.201 ± 0.807
0.314ValCys: 0.314 ± 0.354
4.403ValAsp: 4.403 ± 0.905
3.145ValGlu: 3.145 ± 0.986
0.943ValPhe: 0.943 ± 0.555
2.201ValGly: 2.201 ± 0.657
0.0ValHis: 0.0 ± 0.0
4.403ValIle: 4.403 ± 1.053
5.975ValLys: 5.975 ± 1.473
3.459ValLeu: 3.459 ± 1.019
0.629ValMet: 0.629 ± 0.421
4.088ValAsn: 4.088 ± 0.832
0.629ValPro: 0.629 ± 0.422
1.572ValGln: 1.572 ± 0.861
1.887ValArg: 1.887 ± 0.679
3.774ValSer: 3.774 ± 0.821
2.201ValThr: 2.201 ± 0.737
3.145ValVal: 3.145 ± 0.707
0.0ValTrp: 0.0 ± 0.0
2.516ValTyr: 2.516 ± 0.93
0.0ValXaa: 0.0 ± 0.0
Trp
1.258TrpAla: 1.258 ± 0.51
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
2.201TrpGlu: 2.201 ± 0.748
0.314TrpPhe: 0.314 ± 0.28
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.943TrpIle: 0.943 ± 0.495
0.314TrpLys: 0.314 ± 0.294
0.943TrpLeu: 0.943 ± 0.686
0.0TrpMet: 0.0 ± 0.0
0.314TrpAsn: 0.314 ± 0.251
0.0TrpPro: 0.0 ± 0.0
0.314TrpGln: 0.314 ± 0.312
0.0TrpArg: 0.0 ± 0.0
0.314TrpSer: 0.314 ± 0.312
0.0TrpThr: 0.0 ± 0.0
0.314TrpVal: 0.314 ± 0.251
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.629TyrAla: 0.629 ± 0.468
0.0TyrCys: 0.0 ± 0.0
3.459TyrAsp: 3.459 ± 0.948
4.088TyrGlu: 4.088 ± 0.809
2.201TyrPhe: 2.201 ± 0.871
1.572TyrGly: 1.572 ± 0.484
0.629TyrHis: 0.629 ± 0.372
2.201TyrIle: 2.201 ± 0.706
5.031TyrLys: 5.031 ± 1.454
7.233TyrLeu: 7.233 ± 1.466
1.572TyrMet: 1.572 ± 0.643
1.887TyrAsn: 1.887 ± 0.728
0.0TyrPro: 0.0 ± 0.0
1.887TyrGln: 1.887 ± 0.704
4.403TyrArg: 4.403 ± 1.331
3.145TyrSer: 3.145 ± 1.089
2.83TyrThr: 2.83 ± 0.669
1.887TyrVal: 1.887 ± 0.717
0.314TyrTrp: 0.314 ± 0.354
0.943TyrTyr: 0.943 ± 0.495
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 18 proteins (3181 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski