Amino acid dipepetide frequency for Streptococcus satellite phage Javan114

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.978AlaAla: 2.978 ± 1.135
0.893AlaCys: 0.893 ± 0.402
5.658AlaAsp: 5.658 ± 1.46
6.552AlaGlu: 6.552 ± 1.588
3.574AlaPhe: 3.574 ± 0.881
4.169AlaGly: 4.169 ± 1.092
0.0AlaHis: 0.0 ± 0.0
4.467AlaIle: 4.467 ± 0.915
3.276AlaLys: 3.276 ± 0.933
5.956AlaLeu: 5.956 ± 1.057
2.085AlaMet: 2.085 ± 0.685
3.276AlaAsn: 3.276 ± 0.783
2.68AlaPro: 2.68 ± 0.703
3.574AlaGln: 3.574 ± 1.176
3.276AlaArg: 3.276 ± 0.765
3.871AlaSer: 3.871 ± 0.967
5.063AlaThr: 5.063 ± 1.531
3.871AlaVal: 3.871 ± 1.059
0.0AlaTrp: 0.0 ± 0.0
1.191AlaTyr: 1.191 ± 0.438
0.0AlaXaa: 0.0 ± 0.0
Cys
0.298CysAla: 0.298 ± 0.306
0.0CysCys: 0.0 ± 0.0
0.298CysAsp: 0.298 ± 0.291
0.298CysGlu: 0.298 ± 0.306
0.596CysPhe: 0.596 ± 0.479
0.596CysGly: 0.596 ± 0.532
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.596CysLys: 0.596 ± 0.318
0.596CysLeu: 0.596 ± 0.409
0.0CysMet: 0.0 ± 0.0
0.893CysAsn: 0.893 ± 0.558
0.596CysPro: 0.596 ± 0.409
0.596CysGln: 0.596 ± 0.532
0.596CysArg: 0.596 ± 0.384
0.298CysSer: 0.298 ± 0.368
0.596CysThr: 0.596 ± 0.355
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.489AspAla: 1.489 ± 0.618
0.596AspCys: 0.596 ± 0.393
1.787AspAsp: 1.787 ± 0.552
3.871AspGlu: 3.871 ± 0.982
3.574AspPhe: 3.574 ± 0.961
3.871AspGly: 3.871 ± 0.925
0.893AspHis: 0.893 ± 0.436
4.467AspIle: 4.467 ± 0.764
2.68AspLys: 2.68 ± 0.729
5.36AspLeu: 5.36 ± 1.361
1.191AspMet: 1.191 ± 0.486
2.085AspAsn: 2.085 ± 0.963
1.489AspPro: 1.489 ± 0.539
1.489AspGln: 1.489 ± 0.594
2.382AspArg: 2.382 ± 0.749
2.978AspSer: 2.978 ± 0.701
3.871AspThr: 3.871 ± 1.222
2.68AspVal: 2.68 ± 0.799
0.298AspTrp: 0.298 ± 0.243
3.574AspTyr: 3.574 ± 1.0
0.0AspXaa: 0.0 ± 0.0
Glu
7.445GluAla: 7.445 ± 1.543
0.298GluCys: 0.298 ± 0.306
3.871GluAsp: 3.871 ± 0.955
5.36GluGlu: 5.36 ± 1.363
1.489GluPhe: 1.489 ± 0.596
2.978GluGly: 2.978 ± 1.0
0.893GluHis: 0.893 ± 0.51
6.254GluIle: 6.254 ± 1.383
8.636GluLys: 8.636 ± 1.839
10.125GluLeu: 10.125 ± 1.879
3.574GluMet: 3.574 ± 0.946
4.765GluAsn: 4.765 ± 1.231
4.765GluPro: 4.765 ± 1.274
1.191GluGln: 1.191 ± 0.594
5.063GluArg: 5.063 ± 1.645
4.765GluSer: 4.765 ± 1.019
3.574GluThr: 3.574 ± 0.881
2.382GluVal: 2.382 ± 0.727
0.893GluTrp: 0.893 ± 0.489
4.467GluTyr: 4.467 ± 1.322
0.0GluXaa: 0.0 ± 0.0
Phe
2.085PheAla: 2.085 ± 0.742
0.596PheCys: 0.596 ± 0.354
4.765PheAsp: 4.765 ± 1.19
4.467PheGlu: 4.467 ± 1.106
2.085PhePhe: 2.085 ± 1.202
2.085PheGly: 2.085 ± 1.035
0.893PheHis: 0.893 ± 0.514
3.276PheIle: 3.276 ± 0.839
3.276PheLys: 3.276 ± 1.118
2.978PheLeu: 2.978 ± 0.793
0.893PheMet: 0.893 ± 0.528
3.276PheAsn: 3.276 ± 1.056
0.596PhePro: 0.596 ± 0.374
1.489PheGln: 1.489 ± 0.671
2.382PheArg: 2.382 ± 0.691
2.382PheSer: 2.382 ± 0.808
2.085PheThr: 2.085 ± 0.748
2.68PheVal: 2.68 ± 0.742
0.596PheTrp: 0.596 ± 0.339
0.893PheTyr: 0.893 ± 0.389
0.0PheXaa: 0.0 ± 0.0
Gly
3.276GlyAla: 3.276 ± 0.894
0.596GlyCys: 0.596 ± 0.384
2.085GlyAsp: 2.085 ± 1.005
2.382GlyGlu: 2.382 ± 0.737
1.489GlyPhe: 1.489 ± 0.738
2.382GlyGly: 2.382 ± 0.858
0.596GlyHis: 0.596 ± 0.355
5.36GlyIle: 5.36 ± 1.1
5.063GlyLys: 5.063 ± 0.749
5.063GlyLeu: 5.063 ± 1.221
1.191GlyMet: 1.191 ± 0.718
2.382GlyAsn: 2.382 ± 0.904
0.298GlyPro: 0.298 ± 0.286
1.489GlyGln: 1.489 ± 0.589
3.574GlyArg: 3.574 ± 1.018
4.765GlySer: 4.765 ± 1.336
2.68GlyThr: 2.68 ± 0.608
2.978GlyVal: 2.978 ± 1.338
1.191GlyTrp: 1.191 ± 0.583
2.68GlyTyr: 2.68 ± 1.006
0.0GlyXaa: 0.0 ± 0.0
His
2.085HisAla: 2.085 ± 1.085
0.0HisCys: 0.0 ± 0.0
0.298HisAsp: 0.298 ± 0.306
1.489HisGlu: 1.489 ± 0.602
1.191HisPhe: 1.191 ± 0.43
1.191HisGly: 1.191 ± 0.495
0.298HisHis: 0.298 ± 0.306
0.298HisIle: 0.298 ± 0.308
1.191HisLys: 1.191 ± 0.514
2.978HisLeu: 2.978 ± 0.823
0.596HisMet: 0.596 ± 0.509
0.298HisAsn: 0.298 ± 0.243
0.893HisPro: 0.893 ± 0.48
0.893HisGln: 0.893 ± 0.516
0.0HisArg: 0.0 ± 0.0
1.191HisSer: 1.191 ± 0.441
1.489HisThr: 1.489 ± 0.569
0.298HisVal: 0.298 ± 0.255
0.0HisTrp: 0.0 ± 0.0
1.191HisTyr: 1.191 ± 0.483
0.0HisXaa: 0.0 ± 0.0
Ile
6.552IleAla: 6.552 ± 1.201
0.0IleCys: 0.0 ± 0.0
4.169IleAsp: 4.169 ± 0.633
5.956IleGlu: 5.956 ± 1.28
3.276IlePhe: 3.276 ± 1.007
2.085IleGly: 2.085 ± 0.758
1.489IleHis: 1.489 ± 0.624
4.765IleIle: 4.765 ± 0.896
5.36IleLys: 5.36 ± 0.821
3.574IleLeu: 3.574 ± 0.802
1.191IleMet: 1.191 ± 0.579
4.467IleAsn: 4.467 ± 0.936
3.871IlePro: 3.871 ± 1.088
2.085IleGln: 2.085 ± 1.111
2.68IleArg: 2.68 ± 0.996
5.956IleSer: 5.956 ± 1.913
5.063IleThr: 5.063 ± 0.823
3.276IleVal: 3.276 ± 0.844
0.596IleTrp: 0.596 ± 0.382
2.085IleTyr: 2.085 ± 0.628
0.0IleXaa: 0.0 ± 0.0
Lys
5.956LysAla: 5.956 ± 1.836
0.596LysCys: 0.596 ± 0.443
2.382LysAsp: 2.382 ± 0.78
8.934LysGlu: 8.934 ± 1.546
2.382LysPhe: 2.382 ± 0.84
3.871LysGly: 3.871 ± 1.006
2.382LysHis: 2.382 ± 0.991
6.849LysIle: 6.849 ± 1.43
5.36LysLys: 5.36 ± 1.182
8.636LysLeu: 8.636 ± 1.274
2.085LysMet: 2.085 ± 0.928
4.169LysAsn: 4.169 ± 1.05
4.467LysPro: 4.467 ± 1.296
3.276LysGln: 3.276 ± 1.159
4.467LysArg: 4.467 ± 1.394
4.467LysSer: 4.467 ± 0.972
5.36LysThr: 5.36 ± 0.882
4.467LysVal: 4.467 ± 0.894
0.298LysTrp: 0.298 ± 0.291
2.085LysTyr: 2.085 ± 0.747
0.0LysXaa: 0.0 ± 0.0
Leu
5.658LeuAla: 5.658 ± 1.587
0.596LeuCys: 0.596 ± 0.44
6.254LeuAsp: 6.254 ± 0.994
10.423LeuGlu: 10.423 ± 1.798
3.574LeuPhe: 3.574 ± 1.053
4.765LeuGly: 4.765 ± 1.123
0.596LeuHis: 0.596 ± 0.382
4.765LeuIle: 4.765 ± 1.035
6.552LeuLys: 6.552 ± 1.425
10.721LeuLeu: 10.721 ± 1.764
2.085LeuMet: 2.085 ± 0.822
3.574LeuAsn: 3.574 ± 0.762
3.276LeuPro: 3.276 ± 0.974
2.978LeuGln: 2.978 ± 0.722
4.467LeuArg: 4.467 ± 1.189
5.658LeuSer: 5.658 ± 1.088
7.147LeuThr: 7.147 ± 1.162
6.552LeuVal: 6.552 ± 1.04
1.191LeuTrp: 1.191 ± 0.457
4.467LeuTyr: 4.467 ± 1.004
0.0LeuXaa: 0.0 ± 0.0
Met
2.68MetAla: 2.68 ± 0.88
0.0MetCys: 0.0 ± 0.0
1.489MetAsp: 1.489 ± 0.572
1.489MetGlu: 1.489 ± 0.483
0.893MetPhe: 0.893 ± 0.455
1.191MetGly: 1.191 ± 0.524
0.298MetHis: 0.298 ± 0.255
1.489MetIle: 1.489 ± 0.733
2.382MetLys: 2.382 ± 0.718
1.489MetLeu: 1.489 ± 0.597
0.0MetMet: 0.0 ± 0.0
1.489MetAsn: 1.489 ± 0.749
0.596MetPro: 0.596 ± 0.329
1.191MetGln: 1.191 ± 0.621
2.085MetArg: 2.085 ± 0.705
0.298MetSer: 0.298 ± 0.243
2.382MetThr: 2.382 ± 0.493
1.489MetVal: 1.489 ± 0.551
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.68AsnAla: 2.68 ± 0.79
0.0AsnCys: 0.0 ± 0.0
3.276AsnAsp: 3.276 ± 1.069
2.68AsnGlu: 2.68 ± 0.942
1.489AsnPhe: 1.489 ± 0.713
4.467AsnGly: 4.467 ± 0.873
1.191AsnHis: 1.191 ± 0.659
2.68AsnIle: 2.68 ± 1.228
3.871AsnLys: 3.871 ± 1.116
5.063AsnLeu: 5.063 ± 1.61
1.191AsnMet: 1.191 ± 0.711
2.978AsnAsn: 2.978 ± 1.007
2.382AsnPro: 2.382 ± 0.74
1.489AsnGln: 1.489 ± 0.469
2.978AsnArg: 2.978 ± 1.27
3.574AsnSer: 3.574 ± 0.984
2.085AsnThr: 2.085 ± 0.972
2.978AsnVal: 2.978 ± 0.915
0.298AsnTrp: 0.298 ± 0.368
2.68AsnTyr: 2.68 ± 0.809
0.0AsnXaa: 0.0 ± 0.0
Pro
2.085ProAla: 2.085 ± 0.58
0.0ProCys: 0.0 ± 0.0
0.596ProAsp: 0.596 ± 0.329
3.276ProGlu: 3.276 ± 1.124
2.68ProPhe: 2.68 ± 0.804
1.191ProGly: 1.191 ± 0.588
0.596ProHis: 0.596 ± 0.394
1.191ProIle: 1.191 ± 0.627
5.36ProLys: 5.36 ± 1.163
2.382ProLeu: 2.382 ± 0.665
0.893ProMet: 0.893 ± 0.516
2.382ProAsn: 2.382 ± 1.192
1.489ProPro: 1.489 ± 0.577
1.489ProGln: 1.489 ± 0.748
3.276ProArg: 3.276 ± 0.859
2.382ProSer: 2.382 ± 0.674
2.382ProThr: 2.382 ± 0.686
4.169ProVal: 4.169 ± 0.73
0.298ProTrp: 0.298 ± 0.243
1.489ProTyr: 1.489 ± 0.74
0.0ProXaa: 0.0 ± 0.0
Gln
4.169GlnAla: 4.169 ± 1.516
0.0GlnCys: 0.0 ± 0.0
1.191GlnAsp: 1.191 ± 0.428
3.574GlnGlu: 3.574 ± 0.778
2.68GlnPhe: 2.68 ± 1.229
1.489GlnGly: 1.489 ± 0.642
0.596GlnHis: 0.596 ± 0.397
2.085GlnIle: 2.085 ± 0.648
2.978GlnLys: 2.978 ± 1.668
3.574GlnLeu: 3.574 ± 0.855
0.0GlnMet: 0.0 ± 0.0
2.382GlnAsn: 2.382 ± 0.836
0.893GlnPro: 0.893 ± 0.451
2.382GlnGln: 2.382 ± 0.909
2.382GlnArg: 2.382 ± 0.567
2.085GlnSer: 2.085 ± 0.553
1.191GlnThr: 1.191 ± 0.495
1.191GlnVal: 1.191 ± 0.701
0.0GlnTrp: 0.0 ± 0.0
1.191GlnTyr: 1.191 ± 0.482
0.0GlnXaa: 0.0 ± 0.0
Arg
3.276ArgAla: 3.276 ± 1.11
0.596ArgCys: 0.596 ± 0.384
2.382ArgAsp: 2.382 ± 0.87
4.765ArgGlu: 4.765 ± 1.138
2.68ArgPhe: 2.68 ± 0.657
2.382ArgGly: 2.382 ± 0.936
1.191ArgHis: 1.191 ± 0.509
4.467ArgIle: 4.467 ± 0.918
5.36ArgLys: 5.36 ± 1.309
5.063ArgLeu: 5.063 ± 1.753
1.191ArgMet: 1.191 ± 0.525
2.382ArgAsn: 2.382 ± 0.752
2.68ArgPro: 2.68 ± 0.881
2.382ArgGln: 2.382 ± 1.051
2.68ArgArg: 2.68 ± 0.648
2.68ArgSer: 2.68 ± 0.885
3.276ArgThr: 3.276 ± 0.74
3.276ArgVal: 3.276 ± 0.932
0.298ArgTrp: 0.298 ± 0.287
2.68ArgTyr: 2.68 ± 1.015
0.0ArgXaa: 0.0 ± 0.0
Ser
3.276SerAla: 3.276 ± 0.909
0.596SerCys: 0.596 ± 0.44
2.68SerAsp: 2.68 ± 0.709
4.467SerGlu: 4.467 ± 1.169
1.489SerPhe: 1.489 ± 0.551
3.574SerGly: 3.574 ± 0.939
2.382SerHis: 2.382 ± 0.955
3.871SerIle: 3.871 ± 1.238
5.956SerLys: 5.956 ± 1.451
6.552SerLeu: 6.552 ± 1.38
1.191SerMet: 1.191 ± 0.645
3.574SerAsn: 3.574 ± 1.034
2.085SerPro: 2.085 ± 0.818
0.893SerGln: 0.893 ± 0.65
2.68SerArg: 2.68 ± 0.819
3.871SerSer: 3.871 ± 1.152
4.169SerThr: 4.169 ± 1.266
3.276SerVal: 3.276 ± 1.159
0.596SerTrp: 0.596 ± 0.395
4.169SerTyr: 4.169 ± 1.248
0.0SerXaa: 0.0 ± 0.0
Thr
4.467ThrAla: 4.467 ± 1.243
0.0ThrCys: 0.0 ± 0.0
2.68ThrAsp: 2.68 ± 0.66
6.254ThrGlu: 6.254 ± 1.433
5.36ThrPhe: 5.36 ± 1.42
3.871ThrGly: 3.871 ± 0.893
1.489ThrHis: 1.489 ± 0.417
4.765ThrIle: 4.765 ± 1.423
4.467ThrLys: 4.467 ± 1.056
6.552ThrLeu: 6.552 ± 1.318
0.893ThrMet: 0.893 ± 0.368
1.787ThrAsn: 1.787 ± 0.975
1.787ThrPro: 1.787 ± 0.813
1.787ThrGln: 1.787 ± 0.823
3.276ThrArg: 3.276 ± 0.792
2.978ThrSer: 2.978 ± 0.866
2.085ThrThr: 2.085 ± 1.005
4.169ThrVal: 4.169 ± 1.48
0.596ThrTrp: 0.596 ± 0.345
1.787ThrTyr: 1.787 ± 0.717
0.0ThrXaa: 0.0 ± 0.0
Val
4.765ValAla: 4.765 ± 1.015
1.191ValCys: 1.191 ± 0.503
2.68ValAsp: 2.68 ± 1.151
3.276ValGlu: 3.276 ± 0.959
1.489ValPhe: 1.489 ± 0.548
2.68ValGly: 2.68 ± 0.578
1.191ValHis: 1.191 ± 0.859
2.382ValIle: 2.382 ± 0.722
4.765ValLys: 4.765 ± 1.412
3.574ValLeu: 3.574 ± 0.883
0.596ValMet: 0.596 ± 0.481
2.978ValAsn: 2.978 ± 0.732
2.68ValPro: 2.68 ± 0.939
1.191ValGln: 1.191 ± 0.65
3.574ValArg: 3.574 ± 0.862
4.169ValSer: 4.169 ± 1.135
4.467ValThr: 4.467 ± 1.083
5.063ValVal: 5.063 ± 1.372
0.0ValTrp: 0.0 ± 0.0
3.574ValTyr: 3.574 ± 1.361
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.893TrpGlu: 0.893 ± 0.531
0.0TrpPhe: 0.0 ± 0.0
0.596TrpGly: 0.596 ± 0.509
0.298TrpHis: 0.298 ± 0.243
1.191TrpIle: 1.191 ± 0.513
0.298TrpLys: 0.298 ± 0.255
1.191TrpLeu: 1.191 ± 0.409
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.596TrpPro: 0.596 ± 0.405
0.298TrpGln: 0.298 ± 0.243
0.596TrpArg: 0.596 ± 0.395
0.893TrpSer: 0.893 ± 0.439
0.0TrpThr: 0.0 ± 0.0
0.596TrpVal: 0.596 ± 0.362
0.0TrpTrp: 0.0 ± 0.0
0.298TrpTyr: 0.298 ± 0.243
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.191TyrAla: 1.191 ± 0.535
0.298TyrCys: 0.298 ± 0.291
2.085TyrAsp: 2.085 ± 0.776
2.382TyrGlu: 2.382 ± 0.885
1.787TyrPhe: 1.787 ± 0.882
2.085TyrGly: 2.085 ± 0.636
0.596TyrHis: 0.596 ± 0.329
3.871TyrIle: 3.871 ± 1.0
5.063TyrLys: 5.063 ± 1.278
3.871TyrLeu: 3.871 ± 0.577
1.787TyrMet: 1.787 ± 0.653
0.893TyrAsn: 0.893 ± 0.413
1.489TyrPro: 1.489 ± 0.859
3.871TyrGln: 3.871 ± 0.859
3.276TyrArg: 3.276 ± 1.054
2.085TyrSer: 2.085 ± 0.851
2.085TyrThr: 2.085 ± 0.539
1.191TyrVal: 1.191 ± 0.441
0.596TyrTrp: 0.596 ± 0.35
2.68TyrTyr: 2.68 ± 1.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 17 proteins (3359 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski