Amino acid dipepetide frequency for Streptococcus satellite phage Javan727

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.779AlaAla: 0.779 ± 0.372
0.779AlaCys: 0.779 ± 0.543
2.857AlaAsp: 2.857 ± 0.8
5.974AlaGlu: 5.974 ± 1.058
2.857AlaPhe: 2.857 ± 0.96
3.117AlaGly: 3.117 ± 0.674
0.519AlaHis: 0.519 ± 0.374
4.416AlaIle: 4.416 ± 0.964
4.935AlaLys: 4.935 ± 0.876
5.195AlaLeu: 5.195 ± 0.799
1.558AlaMet: 1.558 ± 0.827
2.338AlaAsn: 2.338 ± 0.767
1.558AlaPro: 1.558 ± 0.537
2.078AlaGln: 2.078 ± 0.732
3.896AlaArg: 3.896 ± 1.161
1.818AlaSer: 1.818 ± 0.594
3.636AlaThr: 3.636 ± 1.173
3.896AlaVal: 3.896 ± 0.705
0.26AlaTrp: 0.26 ± 0.255
1.299AlaTyr: 1.299 ± 0.491
0.0AlaXaa: 0.0 ± 0.0
Cys
0.779CysAla: 0.779 ± 0.456
0.0CysCys: 0.0 ± 0.0
1.039CysAsp: 1.039 ± 0.436
0.0CysGlu: 0.0 ± 0.0
0.26CysPhe: 0.26 ± 0.255
0.779CysGly: 0.779 ± 0.61
0.26CysHis: 0.26 ± 0.207
0.0CysIle: 0.0 ± 0.0
0.519CysLys: 0.519 ± 0.363
1.558CysLeu: 1.558 ± 0.858
0.26CysMet: 0.26 ± 0.296
0.519CysAsn: 0.519 ± 0.493
0.519CysPro: 0.519 ± 0.369
0.519CysGln: 0.519 ± 0.367
0.519CysArg: 0.519 ± 0.342
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.26CysVal: 0.26 ± 0.255
0.0CysTrp: 0.0 ± 0.0
0.519CysTyr: 0.519 ± 0.347
0.0CysXaa: 0.0 ± 0.0
Asp
0.519AspAla: 0.519 ± 0.306
1.039AspCys: 1.039 ± 0.537
4.156AspAsp: 4.156 ± 0.812
6.234AspGlu: 6.234 ± 1.346
4.156AspPhe: 4.156 ± 1.127
3.896AspGly: 3.896 ± 1.057
0.0AspHis: 0.0 ± 0.0
5.195AspIle: 5.195 ± 1.088
4.935AspLys: 4.935 ± 1.009
7.013AspLeu: 7.013 ± 1.551
2.338AspMet: 2.338 ± 0.511
2.597AspAsn: 2.597 ± 0.942
1.299AspPro: 1.299 ± 0.573
0.519AspGln: 0.519 ± 0.373
2.597AspArg: 2.597 ± 0.856
2.338AspSer: 2.338 ± 0.848
2.857AspThr: 2.857 ± 0.577
2.857AspVal: 2.857 ± 0.778
0.519AspTrp: 0.519 ± 0.33
3.896AspTyr: 3.896 ± 1.342
0.0AspXaa: 0.0 ± 0.0
Glu
4.156GluAla: 4.156 ± 0.99
0.779GluCys: 0.779 ± 0.375
2.857GluAsp: 2.857 ± 1.006
5.455GluGlu: 5.455 ± 1.073
2.338GluPhe: 2.338 ± 0.853
2.338GluGly: 2.338 ± 0.631
1.558GluHis: 1.558 ± 0.609
7.532GluIle: 7.532 ± 1.658
6.753GluLys: 6.753 ± 0.863
9.87GluLeu: 9.87 ± 1.974
3.117GluMet: 3.117 ± 0.723
5.974GluAsn: 5.974 ± 1.37
3.117GluPro: 3.117 ± 0.781
4.156GluGln: 4.156 ± 0.978
4.675GluArg: 4.675 ± 0.913
5.714GluSer: 5.714 ± 1.145
4.935GluThr: 4.935 ± 1.196
5.455GluVal: 5.455 ± 1.466
0.779GluTrp: 0.779 ± 0.531
3.636GluTyr: 3.636 ± 0.973
0.0GluXaa: 0.0 ± 0.0
Phe
1.558PheAla: 1.558 ± 0.705
0.0PheCys: 0.0 ± 0.0
3.377PheAsp: 3.377 ± 0.743
2.857PheGlu: 2.857 ± 0.908
1.039PhePhe: 1.039 ± 0.431
2.857PheGly: 2.857 ± 0.905
0.519PheHis: 0.519 ± 0.302
2.857PheIle: 2.857 ± 0.818
3.636PheLys: 3.636 ± 0.772
4.675PheLeu: 4.675 ± 0.971
0.779PheMet: 0.779 ± 0.412
3.377PheAsn: 3.377 ± 0.983
0.779PhePro: 0.779 ± 0.374
1.558PheGln: 1.558 ± 0.555
1.818PheArg: 1.818 ± 0.692
2.338PheSer: 2.338 ± 0.633
2.597PheThr: 2.597 ± 0.888
1.039PheVal: 1.039 ± 0.445
0.779PheTrp: 0.779 ± 0.417
2.078PheTyr: 2.078 ± 0.586
0.0PheXaa: 0.0 ± 0.0
Gly
1.039GlyAla: 1.039 ± 0.605
0.26GlyCys: 0.26 ± 0.207
3.117GlyAsp: 3.117 ± 0.896
4.156GlyGlu: 4.156 ± 0.816
1.818GlyPhe: 1.818 ± 0.578
2.597GlyGly: 2.597 ± 0.878
0.519GlyHis: 0.519 ± 0.345
3.377GlyIle: 3.377 ± 0.788
5.195GlyLys: 5.195 ± 1.058
5.974GlyLeu: 5.974 ± 1.079
1.558GlyMet: 1.558 ± 0.644
2.857GlyAsn: 2.857 ± 0.938
0.26GlyPro: 0.26 ± 0.207
2.338GlyGln: 2.338 ± 0.655
3.117GlyArg: 3.117 ± 0.681
2.078GlySer: 2.078 ± 0.71
3.636GlyThr: 3.636 ± 0.941
3.117GlyVal: 3.117 ± 0.977
0.779GlyTrp: 0.779 ± 0.532
3.117GlyTyr: 3.117 ± 0.883
0.0GlyXaa: 0.0 ± 0.0
His
1.818HisAla: 1.818 ± 0.709
0.0HisCys: 0.0 ± 0.0
0.519HisAsp: 0.519 ± 0.349
0.779HisGlu: 0.779 ± 0.515
1.039HisPhe: 1.039 ± 0.564
1.299HisGly: 1.299 ± 0.574
0.26HisHis: 0.26 ± 0.255
1.039HisIle: 1.039 ± 0.553
1.039HisLys: 1.039 ± 0.585
1.039HisLeu: 1.039 ± 0.438
0.0HisMet: 0.0 ± 0.0
1.558HisAsn: 1.558 ± 0.788
0.0HisPro: 0.0 ± 0.0
0.779HisGln: 0.779 ± 0.465
1.039HisArg: 1.039 ± 0.611
0.779HisSer: 0.779 ± 0.359
1.299HisThr: 1.299 ± 0.616
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.039HisTyr: 1.039 ± 0.456
0.0HisXaa: 0.0 ± 0.0
Ile
5.714IleAla: 5.714 ± 1.123
0.26IleCys: 0.26 ± 0.268
5.974IleAsp: 5.974 ± 2.021
5.714IleGlu: 5.714 ± 1.261
3.117IlePhe: 3.117 ± 0.944
2.857IleGly: 2.857 ± 0.957
1.039IleHis: 1.039 ± 0.607
3.636IleIle: 3.636 ± 0.746
5.714IleLys: 5.714 ± 1.14
3.896IleLeu: 3.896 ± 0.735
0.519IleMet: 0.519 ± 0.33
2.078IleAsn: 2.078 ± 0.959
3.377IlePro: 3.377 ± 0.84
2.857IleGln: 2.857 ± 0.783
3.896IleArg: 3.896 ± 1.067
5.195IleSer: 5.195 ± 0.878
3.896IleThr: 3.896 ± 0.854
3.117IleVal: 3.117 ± 0.666
0.26IleTrp: 0.26 ± 0.282
2.597IleTyr: 2.597 ± 0.707
0.0IleXaa: 0.0 ± 0.0
Lys
7.532LysAla: 7.532 ± 1.329
0.26LysCys: 0.26 ± 0.255
2.597LysAsp: 2.597 ± 0.947
8.571LysGlu: 8.571 ± 1.316
2.857LysPhe: 2.857 ± 0.873
4.416LysGly: 4.416 ± 1.078
4.675LysHis: 4.675 ± 1.147
4.935LysIle: 4.935 ± 1.044
6.494LysLys: 6.494 ± 1.375
6.753LysLeu: 6.753 ± 1.087
1.558LysMet: 1.558 ± 0.857
4.416LysAsn: 4.416 ± 1.111
3.636LysPro: 3.636 ± 1.345
4.416LysGln: 4.416 ± 0.99
7.273LysArg: 7.273 ± 1.361
5.974LysSer: 5.974 ± 0.903
6.753LysThr: 6.753 ± 1.232
4.416LysVal: 4.416 ± 0.809
1.299LysTrp: 1.299 ± 0.477
2.338LysTyr: 2.338 ± 0.869
0.0LysXaa: 0.0 ± 0.0
Leu
5.974LeuAla: 5.974 ± 1.211
1.039LeuCys: 1.039 ± 0.458
9.61LeuAsp: 9.61 ± 1.756
10.39LeuGlu: 10.39 ± 1.677
3.377LeuPhe: 3.377 ± 0.987
3.636LeuGly: 3.636 ± 0.867
0.779LeuHis: 0.779 ± 0.419
4.156LeuIle: 4.156 ± 0.961
8.831LeuLys: 8.831 ± 1.45
8.831LeuLeu: 8.831 ± 1.615
2.078LeuMet: 2.078 ± 0.693
7.273LeuAsn: 7.273 ± 1.454
3.377LeuPro: 3.377 ± 1.035
3.377LeuGln: 3.377 ± 0.985
3.636LeuArg: 3.636 ± 0.999
5.974LeuSer: 5.974 ± 0.741
8.052LeuThr: 8.052 ± 1.105
5.195LeuVal: 5.195 ± 0.732
0.0LeuTrp: 0.0 ± 0.0
3.377LeuTyr: 3.377 ± 0.706
0.0LeuXaa: 0.0 ± 0.0
Met
1.818MetAla: 1.818 ± 0.725
0.0MetCys: 0.0 ± 0.0
1.299MetAsp: 1.299 ± 0.551
3.636MetGlu: 3.636 ± 0.98
0.519MetPhe: 0.519 ± 0.433
2.857MetGly: 2.857 ± 0.912
0.26MetHis: 0.26 ± 0.271
1.558MetIle: 1.558 ± 0.585
2.338MetLys: 2.338 ± 0.906
1.558MetLeu: 1.558 ± 0.581
1.299MetMet: 1.299 ± 0.489
2.338MetAsn: 2.338 ± 0.874
0.519MetPro: 0.519 ± 0.333
1.299MetGln: 1.299 ± 0.656
1.299MetArg: 1.299 ± 0.66
1.818MetSer: 1.818 ± 0.711
3.377MetThr: 3.377 ± 1.184
0.779MetVal: 0.779 ± 0.37
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.338AsnAla: 2.338 ± 0.608
0.519AsnCys: 0.519 ± 0.551
3.896AsnAsp: 3.896 ± 1.133
2.857AsnGlu: 2.857 ± 0.791
2.338AsnPhe: 2.338 ± 0.775
4.675AsnGly: 4.675 ± 0.877
1.299AsnHis: 1.299 ± 0.662
3.636AsnIle: 3.636 ± 1.195
5.714AsnLys: 5.714 ± 0.67
6.234AsnLeu: 6.234 ± 1.313
2.857AsnMet: 2.857 ± 1.061
2.597AsnAsn: 2.597 ± 0.825
2.857AsnPro: 2.857 ± 0.721
2.338AsnGln: 2.338 ± 0.619
1.299AsnArg: 1.299 ± 0.589
2.857AsnSer: 2.857 ± 0.79
2.857AsnThr: 2.857 ± 0.834
2.338AsnVal: 2.338 ± 0.627
0.519AsnTrp: 0.519 ± 0.291
2.338AsnTyr: 2.338 ± 0.843
0.0AsnXaa: 0.0 ± 0.0
Pro
2.078ProAla: 2.078 ± 0.561
0.26ProCys: 0.26 ± 0.276
1.818ProAsp: 1.818 ± 0.727
2.597ProGlu: 2.597 ± 1.047
2.078ProPhe: 2.078 ± 0.951
1.039ProGly: 1.039 ± 0.647
0.0ProHis: 0.0 ± 0.0
1.558ProIle: 1.558 ± 0.53
3.896ProLys: 3.896 ± 0.966
1.818ProLeu: 1.818 ± 0.648
0.779ProMet: 0.779 ± 0.433
2.338ProAsn: 2.338 ± 0.901
1.558ProPro: 1.558 ± 0.663
1.558ProGln: 1.558 ± 0.685
4.156ProArg: 4.156 ± 0.749
1.558ProSer: 1.558 ± 0.631
1.039ProThr: 1.039 ± 0.443
1.818ProVal: 1.818 ± 0.546
0.26ProTrp: 0.26 ± 0.246
1.558ProTyr: 1.558 ± 0.436
0.0ProXaa: 0.0 ± 0.0
Gln
2.857GlnAla: 2.857 ± 1.006
0.0GlnCys: 0.0 ± 0.0
2.078GlnAsp: 2.078 ± 0.845
4.156GlnGlu: 4.156 ± 0.744
0.779GlnPhe: 0.779 ± 0.401
2.078GlnGly: 2.078 ± 0.711
0.0GlnHis: 0.0 ± 0.0
2.338GlnIle: 2.338 ± 0.637
4.416GlnLys: 4.416 ± 0.848
4.935GlnLeu: 4.935 ± 1.131
0.26GlnMet: 0.26 ± 0.257
2.597GlnAsn: 2.597 ± 0.864
1.039GlnPro: 1.039 ± 0.562
2.078GlnGln: 2.078 ± 0.635
2.078GlnArg: 2.078 ± 0.761
1.818GlnSer: 1.818 ± 0.573
2.078GlnThr: 2.078 ± 0.629
2.078GlnVal: 2.078 ± 0.681
0.779GlnTrp: 0.779 ± 0.383
1.299GlnTyr: 1.299 ± 0.544
0.0GlnXaa: 0.0 ± 0.0
Arg
3.377ArgAla: 3.377 ± 0.902
0.519ArgCys: 0.519 ± 0.393
2.857ArgAsp: 2.857 ± 0.996
6.234ArgGlu: 6.234 ± 1.02
3.377ArgPhe: 3.377 ± 0.911
3.117ArgGly: 3.117 ± 0.85
0.779ArgHis: 0.779 ± 0.314
4.156ArgIle: 4.156 ± 0.842
3.896ArgLys: 3.896 ± 0.834
6.234ArgLeu: 6.234 ± 1.473
1.558ArgMet: 1.558 ± 0.705
3.117ArgAsn: 3.117 ± 1.066
1.039ArgPro: 1.039 ± 0.48
2.338ArgGln: 2.338 ± 0.773
1.818ArgArg: 1.818 ± 0.862
1.039ArgSer: 1.039 ± 0.445
3.117ArgThr: 3.117 ± 0.7
2.857ArgVal: 2.857 ± 0.729
0.26ArgTrp: 0.26 ± 0.255
3.377ArgTyr: 3.377 ± 1.135
0.0ArgXaa: 0.0 ± 0.0
Ser
2.597SerAla: 2.597 ± 0.736
0.519SerCys: 0.519 ± 0.342
2.078SerAsp: 2.078 ± 0.554
4.675SerGlu: 4.675 ± 0.999
2.597SerPhe: 2.597 ± 0.888
2.857SerGly: 2.857 ± 0.602
0.519SerHis: 0.519 ± 0.321
4.675SerIle: 4.675 ± 0.698
4.156SerLys: 4.156 ± 0.734
5.195SerLeu: 5.195 ± 1.213
2.338SerMet: 2.338 ± 0.79
4.156SerAsn: 4.156 ± 1.242
1.558SerPro: 1.558 ± 0.631
2.078SerGln: 2.078 ± 0.619
2.597SerArg: 2.597 ± 1.11
2.597SerSer: 2.597 ± 0.958
2.857SerThr: 2.857 ± 0.862
0.779SerVal: 0.779 ± 0.369
0.519SerTrp: 0.519 ± 0.313
1.558SerTyr: 1.558 ± 0.743
0.0SerXaa: 0.0 ± 0.0
Thr
2.597ThrAla: 2.597 ± 0.82
0.779ThrCys: 0.779 ± 0.467
3.117ThrAsp: 3.117 ± 0.7
3.117ThrGlu: 3.117 ± 0.661
3.117ThrPhe: 3.117 ± 1.624
1.818ThrGly: 1.818 ± 0.691
1.299ThrHis: 1.299 ± 0.563
4.416ThrIle: 4.416 ± 0.938
7.792ThrLys: 7.792 ± 1.701
7.532ThrLeu: 7.532 ± 1.019
2.597ThrMet: 2.597 ± 0.861
1.299ThrAsn: 1.299 ± 0.611
2.857ThrPro: 2.857 ± 1.283
2.078ThrGln: 2.078 ± 0.682
2.857ThrArg: 2.857 ± 0.77
2.857ThrSer: 2.857 ± 0.655
4.416ThrThr: 4.416 ± 1.338
3.636ThrVal: 3.636 ± 1.237
0.519ThrTrp: 0.519 ± 0.362
5.195ThrTyr: 5.195 ± 1.416
0.0ThrXaa: 0.0 ± 0.0
Val
3.377ValAla: 3.377 ± 1.024
0.26ValCys: 0.26 ± 0.246
2.597ValAsp: 2.597 ± 0.674
4.675ValGlu: 4.675 ± 1.137
1.039ValPhe: 1.039 ± 0.38
2.338ValGly: 2.338 ± 0.856
0.0ValHis: 0.0 ± 0.0
3.117ValIle: 3.117 ± 0.625
5.195ValLys: 5.195 ± 1.009
4.935ValLeu: 4.935 ± 0.878
1.039ValMet: 1.039 ± 0.614
3.117ValAsn: 3.117 ± 0.878
2.078ValPro: 2.078 ± 0.781
1.039ValGln: 1.039 ± 0.447
2.597ValArg: 2.597 ± 1.069
1.558ValSer: 1.558 ± 0.779
4.156ValThr: 4.156 ± 0.793
3.636ValVal: 3.636 ± 1.014
0.519ValTrp: 0.519 ± 0.42
2.857ValTyr: 2.857 ± 0.834
0.0ValXaa: 0.0 ± 0.0
Trp
0.519TrpAla: 0.519 ± 0.326
0.0TrpCys: 0.0 ± 0.0
0.26TrpAsp: 0.26 ± 0.255
1.299TrpGlu: 1.299 ± 0.531
0.0TrpPhe: 0.0 ± 0.0
0.519TrpGly: 0.519 ± 0.42
0.26TrpHis: 0.26 ± 0.246
0.26TrpIle: 0.26 ± 0.246
0.519TrpLys: 0.519 ± 0.414
0.779TrpLeu: 0.779 ± 0.4
0.0TrpMet: 0.0 ± 0.0
0.26TrpAsn: 0.26 ± 0.254
0.26TrpPro: 0.26 ± 0.246
0.519TrpGln: 0.519 ± 0.339
0.519TrpArg: 0.519 ± 0.336
1.039TrpSer: 1.039 ± 0.474
0.26TrpThr: 0.26 ± 0.24
0.779TrpVal: 0.779 ± 0.335
0.0TrpTrp: 0.0 ± 0.0
0.519TrpTyr: 0.519 ± 0.336
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.338TyrAla: 2.338 ± 0.739
1.039TyrCys: 1.039 ± 0.452
2.857TyrAsp: 2.857 ± 0.622
1.818TyrGlu: 1.818 ± 0.814
1.818TyrPhe: 1.818 ± 0.808
1.818TyrGly: 1.818 ± 0.863
0.779TyrHis: 0.779 ± 0.391
2.857TyrIle: 2.857 ± 0.629
5.455TyrLys: 5.455 ± 1.616
4.935TyrLeu: 4.935 ± 1.074
1.818TyrMet: 1.818 ± 0.624
1.558TyrAsn: 1.558 ± 0.511
2.078TyrPro: 2.078 ± 0.891
1.818TyrGln: 1.818 ± 0.705
3.377TyrArg: 3.377 ± 0.861
1.558TyrSer: 1.558 ± 0.489
2.078TyrThr: 2.078 ± 0.575
2.078TyrVal: 2.078 ± 0.75
0.519TyrTrp: 0.519 ± 0.339
1.558TyrTyr: 1.558 ± 0.658
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 26 proteins (3851 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski