Amino acid dipepetide frequency for Streptococcus satellite phage Javan154

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.795AlaCys: 0.795 ± 0.384
4.242AlaAsp: 4.242 ± 1.18
5.567AlaGlu: 5.567 ± 0.828
3.446AlaPhe: 3.446 ± 0.792
0.795AlaGly: 0.795 ± 0.55
0.53AlaHis: 0.53 ± 0.5
6.363AlaIle: 6.363 ± 1.215
5.037AlaLys: 5.037 ± 1.253
5.037AlaLeu: 5.037 ± 1.085
0.795AlaMet: 0.795 ± 0.451
4.507AlaAsn: 4.507 ± 0.688
0.795AlaPro: 0.795 ± 0.382
2.651AlaGln: 2.651 ± 0.589
2.121AlaArg: 2.121 ± 0.61
3.181AlaSer: 3.181 ± 1.135
4.242AlaThr: 4.242 ± 0.72
3.712AlaVal: 3.712 ± 0.942
0.53AlaTrp: 0.53 ± 0.47
2.121AlaTyr: 2.121 ± 0.65
0.0AlaXaa: 0.0 ± 0.0
Cys
0.53CysAla: 0.53 ± 0.314
0.265CysCys: 0.265 ± 0.263
0.795CysAsp: 0.795 ± 0.427
0.265CysGlu: 0.265 ± 0.263
0.265CysPhe: 0.265 ± 0.25
1.06CysGly: 1.06 ± 0.583
0.265CysHis: 0.265 ± 0.271
0.265CysIle: 0.265 ± 0.233
0.265CysLys: 0.265 ± 0.25
1.06CysLeu: 1.06 ± 0.451
0.0CysMet: 0.0 ± 0.0
0.53CysAsn: 0.53 ± 0.372
0.265CysPro: 0.265 ± 0.263
0.265CysGln: 0.265 ± 0.263
0.795CysArg: 0.795 ± 0.441
0.53CysSer: 0.53 ± 0.338
0.0CysThr: 0.0 ± 0.0
0.265CysVal: 0.265 ± 0.233
0.0CysTrp: 0.0 ± 0.0
0.53CysTyr: 0.53 ± 0.363
0.0CysXaa: 0.0 ± 0.0
Asp
1.06AspAla: 1.06 ± 0.443
1.856AspCys: 1.856 ± 0.824
2.916AspAsp: 2.916 ± 0.875
3.977AspGlu: 3.977 ± 1.246
2.651AspPhe: 2.651 ± 0.849
2.651AspGly: 2.651 ± 0.879
0.795AspHis: 0.795 ± 0.414
5.302AspIle: 5.302 ± 0.876
5.302AspLys: 5.302 ± 1.604
5.037AspLeu: 5.037 ± 0.856
0.795AspMet: 0.795 ± 0.65
2.651AspAsn: 2.651 ± 0.71
0.53AspPro: 0.53 ± 0.334
2.121AspGln: 2.121 ± 0.623
3.181AspArg: 3.181 ± 0.782
3.446AspSer: 3.446 ± 1.149
3.446AspThr: 3.446 ± 0.796
1.326AspVal: 1.326 ± 0.572
0.265AspTrp: 0.265 ± 0.235
5.302AspTyr: 5.302 ± 1.393
0.0AspXaa: 0.0 ± 0.0
Glu
5.832GluAla: 5.832 ± 1.207
1.06GluCys: 1.06 ± 0.556
4.242GluAsp: 4.242 ± 1.136
4.772GluGlu: 4.772 ± 1.054
3.181GluPhe: 3.181 ± 1.311
1.856GluGly: 1.856 ± 0.614
1.591GluHis: 1.591 ± 0.581
6.893GluIle: 6.893 ± 1.187
7.158GluLys: 7.158 ± 0.935
10.074GluLeu: 10.074 ± 1.725
1.856GluMet: 1.856 ± 0.763
2.651GluAsn: 2.651 ± 0.956
1.326GluPro: 1.326 ± 0.704
4.507GluGln: 4.507 ± 1.423
4.242GluArg: 4.242 ± 1.028
2.121GluSer: 2.121 ± 0.596
5.302GluThr: 5.302 ± 1.072
3.712GluVal: 3.712 ± 1.045
0.795GluTrp: 0.795 ± 0.407
3.446GluTyr: 3.446 ± 0.825
0.0GluXaa: 0.0 ± 0.0
Phe
2.651PheAla: 2.651 ± 0.682
0.0PheCys: 0.0 ± 0.0
2.386PheAsp: 2.386 ± 0.663
3.181PheGlu: 3.181 ± 0.768
1.06PhePhe: 1.06 ± 0.472
2.121PheGly: 2.121 ± 0.606
1.591PheHis: 1.591 ± 0.464
3.181PheIle: 3.181 ± 0.785
3.712PheLys: 3.712 ± 0.862
2.916PheLeu: 2.916 ± 0.892
0.53PheMet: 0.53 ± 0.348
3.446PheAsn: 3.446 ± 0.824
1.06PhePro: 1.06 ± 0.574
0.795PheGln: 0.795 ± 0.393
2.121PheArg: 2.121 ± 0.565
2.651PheSer: 2.651 ± 0.8
3.446PheThr: 3.446 ± 0.987
2.386PheVal: 2.386 ± 0.613
0.0PheTrp: 0.0 ± 0.0
1.326PheTyr: 1.326 ± 0.519
0.0PheXaa: 0.0 ± 0.0
Gly
2.121GlyAla: 2.121 ± 1.015
0.53GlyCys: 0.53 ± 0.323
1.856GlyAsp: 1.856 ± 0.881
2.916GlyGlu: 2.916 ± 0.897
2.121GlyPhe: 2.121 ± 0.632
3.181GlyGly: 3.181 ± 1.097
1.06GlyHis: 1.06 ± 0.617
3.712GlyIle: 3.712 ± 0.783
4.772GlyLys: 4.772 ± 1.372
5.567GlyLeu: 5.567 ± 1.486
1.06GlyMet: 1.06 ± 0.393
2.386GlyAsn: 2.386 ± 0.96
0.0GlyPro: 0.0 ± 0.0
1.591GlyGln: 1.591 ± 0.596
1.591GlyArg: 1.591 ± 0.475
1.856GlySer: 1.856 ± 0.834
3.181GlyThr: 3.181 ± 0.953
1.591GlyVal: 1.591 ± 0.736
0.53GlyTrp: 0.53 ± 0.466
3.446GlyTyr: 3.446 ± 1.14
0.0GlyXaa: 0.0 ± 0.0
His
1.591HisAla: 1.591 ± 0.759
0.265HisCys: 0.265 ± 0.233
0.0HisAsp: 0.0 ± 0.0
0.265HisGlu: 0.265 ± 0.268
0.265HisPhe: 0.265 ± 0.235
0.795HisGly: 0.795 ± 0.441
0.0HisHis: 0.0 ± 0.0
2.386HisIle: 2.386 ± 0.619
1.06HisLys: 1.06 ± 0.568
1.326HisLeu: 1.326 ± 0.672
0.0HisMet: 0.0 ± 0.0
1.06HisAsn: 1.06 ± 0.711
1.06HisPro: 1.06 ± 0.705
1.06HisGln: 1.06 ± 0.647
1.06HisArg: 1.06 ± 0.56
0.265HisSer: 0.265 ± 0.25
1.326HisThr: 1.326 ± 0.609
0.53HisVal: 0.53 ± 0.406
0.265HisTrp: 0.265 ± 0.263
1.856HisTyr: 1.856 ± 0.762
0.0HisXaa: 0.0 ± 0.0
Ile
5.567IleAla: 5.567 ± 1.269
0.53IleCys: 0.53 ± 0.341
6.098IleAsp: 6.098 ± 1.021
6.363IleGlu: 6.363 ± 1.572
3.446IlePhe: 3.446 ± 0.929
1.856IleGly: 1.856 ± 0.532
1.06IleHis: 1.06 ± 0.617
4.772IleIle: 4.772 ± 0.962
9.014IleLys: 9.014 ± 1.631
4.772IleLeu: 4.772 ± 0.805
0.53IleMet: 0.53 ± 0.341
3.977IleAsn: 3.977 ± 1.047
2.651IlePro: 2.651 ± 0.948
2.386IleGln: 2.386 ± 0.695
2.651IleArg: 2.651 ± 0.509
4.507IleSer: 4.507 ± 1.274
5.037IleThr: 5.037 ± 1.016
2.916IleVal: 2.916 ± 0.725
0.0IleTrp: 0.0 ± 0.0
3.181IleTyr: 3.181 ± 0.648
0.0IleXaa: 0.0 ± 0.0
Lys
9.544LysAla: 9.544 ± 1.735
0.0LysCys: 0.0 ± 0.0
2.651LysAsp: 2.651 ± 0.882
11.135LysGlu: 11.135 ± 1.297
2.916LysPhe: 2.916 ± 0.647
4.242LysGly: 4.242 ± 1.149
1.856LysHis: 1.856 ± 0.679
4.242LysIle: 4.242 ± 1.103
6.893LysLys: 6.893 ± 1.368
7.953LysLeu: 7.953 ± 1.519
3.181LysMet: 3.181 ± 0.938
6.098LysAsn: 6.098 ± 0.998
3.977LysPro: 3.977 ± 1.291
4.772LysGln: 4.772 ± 1.113
6.363LysArg: 6.363 ± 1.083
4.772LysSer: 4.772 ± 0.961
5.567LysThr: 5.567 ± 1.417
6.363LysVal: 6.363 ± 1.137
1.326LysTrp: 1.326 ± 0.622
3.181LysTyr: 3.181 ± 0.92
0.0LysXaa: 0.0 ± 0.0
Leu
5.302LeuAla: 5.302 ± 0.976
0.53LeuCys: 0.53 ± 0.325
7.423LeuAsp: 7.423 ± 1.098
9.809LeuGlu: 9.809 ± 1.779
3.712LeuPhe: 3.712 ± 1.069
5.567LeuGly: 5.567 ± 1.038
0.795LeuHis: 0.795 ± 0.424
8.218LeuIle: 8.218 ± 1.322
8.484LeuLys: 8.484 ± 1.496
9.809LeuLeu: 9.809 ± 2.092
1.856LeuMet: 1.856 ± 0.599
5.567LeuAsn: 5.567 ± 1.327
3.977LeuPro: 3.977 ± 1.142
2.651LeuGln: 2.651 ± 0.801
1.856LeuArg: 1.856 ± 0.629
7.423LeuSer: 7.423 ± 1.428
5.567LeuThr: 5.567 ± 1.08
3.977LeuVal: 3.977 ± 1.175
1.06LeuTrp: 1.06 ± 0.406
3.181LeuTyr: 3.181 ± 0.877
0.0LeuXaa: 0.0 ± 0.0
Met
1.06MetAla: 1.06 ± 0.385
0.0MetCys: 0.0 ± 0.0
0.53MetAsp: 0.53 ± 0.373
1.06MetGlu: 1.06 ± 0.392
0.53MetPhe: 0.53 ± 0.407
0.265MetGly: 0.265 ± 0.297
0.0MetHis: 0.0 ± 0.0
1.06MetIle: 1.06 ± 0.452
2.386MetLys: 2.386 ± 0.657
3.181MetLeu: 3.181 ± 1.11
0.265MetMet: 0.265 ± 0.259
1.856MetAsn: 1.856 ± 0.588
0.0MetPro: 0.0 ± 0.0
0.265MetGln: 0.265 ± 0.259
1.06MetArg: 1.06 ± 0.501
1.856MetSer: 1.856 ± 0.641
2.916MetThr: 2.916 ± 1.178
0.265MetVal: 0.265 ± 0.271
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.977AsnAla: 3.977 ± 0.835
0.0AsnCys: 0.0 ± 0.0
2.916AsnAsp: 2.916 ± 0.963
2.386AsnGlu: 2.386 ± 0.93
1.591AsnPhe: 1.591 ± 0.563
3.446AsnGly: 3.446 ± 0.963
1.856AsnHis: 1.856 ± 0.526
3.712AsnIle: 3.712 ± 1.16
6.363AsnLys: 6.363 ± 1.394
4.242AsnLeu: 4.242 ± 0.873
1.591AsnMet: 1.591 ± 0.821
2.121AsnAsn: 2.121 ± 0.574
2.916AsnPro: 2.916 ± 0.718
5.037AsnGln: 5.037 ± 1.251
3.712AsnArg: 3.712 ± 0.759
2.121AsnSer: 2.121 ± 0.609
3.181AsnThr: 3.181 ± 1.056
3.181AsnVal: 3.181 ± 0.893
0.53AsnTrp: 0.53 ± 0.357
3.181AsnTyr: 3.181 ± 0.871
0.0AsnXaa: 0.0 ± 0.0
Pro
1.06ProAla: 1.06 ± 0.54
0.265ProCys: 0.265 ± 0.235
0.53ProAsp: 0.53 ± 0.382
3.446ProGlu: 3.446 ± 0.968
1.06ProPhe: 1.06 ± 0.552
0.795ProGly: 0.795 ± 0.457
0.53ProHis: 0.53 ± 0.292
2.386ProIle: 2.386 ± 0.759
4.242ProLys: 4.242 ± 1.038
2.121ProLeu: 2.121 ± 0.737
0.53ProMet: 0.53 ± 0.31
2.121ProAsn: 2.121 ± 0.92
0.795ProPro: 0.795 ± 0.387
0.795ProGln: 0.795 ± 0.544
2.386ProArg: 2.386 ± 0.858
2.121ProSer: 2.121 ± 0.569
2.651ProThr: 2.651 ± 0.622
2.386ProVal: 2.386 ± 0.662
0.265ProTrp: 0.265 ± 0.233
1.591ProTyr: 1.591 ± 0.663
0.0ProXaa: 0.0 ± 0.0
Gln
2.651GlnAla: 2.651 ± 0.76
0.265GlnCys: 0.265 ± 0.267
1.326GlnAsp: 1.326 ± 0.687
2.651GlnGlu: 2.651 ± 0.684
1.326GlnPhe: 1.326 ± 0.571
3.712GlnGly: 3.712 ± 0.92
0.795GlnHis: 0.795 ± 0.471
2.651GlnIle: 2.651 ± 0.759
3.712GlnLys: 3.712 ± 0.859
5.832GlnLeu: 5.832 ± 0.826
0.53GlnMet: 0.53 ± 0.411
2.386GlnAsn: 2.386 ± 0.838
2.386GlnPro: 2.386 ± 0.855
2.121GlnGln: 2.121 ± 0.718
2.916GlnArg: 2.916 ± 0.698
2.121GlnSer: 2.121 ± 0.782
2.386GlnThr: 2.386 ± 0.779
3.977GlnVal: 3.977 ± 0.64
0.53GlnTrp: 0.53 ± 0.469
1.591GlnTyr: 1.591 ± 0.818
0.0GlnXaa: 0.0 ± 0.0
Arg
1.06ArgAla: 1.06 ± 0.563
0.53ArgCys: 0.53 ± 0.325
2.916ArgAsp: 2.916 ± 0.792
2.386ArgGlu: 2.386 ± 0.705
2.121ArgPhe: 2.121 ± 0.625
2.121ArgGly: 2.121 ± 0.769
1.326ArgHis: 1.326 ± 0.415
2.651ArgIle: 2.651 ± 0.99
5.302ArgLys: 5.302 ± 1.129
5.832ArgLeu: 5.832 ± 0.929
0.795ArgMet: 0.795 ± 0.426
3.446ArgAsn: 3.446 ± 1.032
1.591ArgPro: 1.591 ± 0.713
3.181ArgGln: 3.181 ± 0.67
2.121ArgArg: 2.121 ± 0.666
2.651ArgSer: 2.651 ± 1.098
1.856ArgThr: 1.856 ± 0.533
4.242ArgVal: 4.242 ± 0.701
0.795ArgTrp: 0.795 ± 0.468
3.181ArgTyr: 3.181 ± 0.773
0.0ArgXaa: 0.0 ± 0.0
Ser
2.651SerAla: 2.651 ± 0.778
0.265SerCys: 0.265 ± 0.263
5.832SerAsp: 5.832 ± 1.04
3.181SerGlu: 3.181 ± 1.225
1.856SerPhe: 1.856 ± 0.637
2.121SerGly: 2.121 ± 0.833
0.53SerHis: 0.53 ± 0.38
3.181SerIle: 3.181 ± 1.243
6.098SerLys: 6.098 ± 1.159
5.302SerLeu: 5.302 ± 1.107
1.06SerMet: 1.06 ± 0.424
2.386SerAsn: 2.386 ± 0.744
2.121SerPro: 2.121 ± 0.604
3.446SerGln: 3.446 ± 1.122
2.386SerArg: 2.386 ± 0.869
3.181SerSer: 3.181 ± 0.637
3.712SerThr: 3.712 ± 1.018
3.181SerVal: 3.181 ± 1.114
0.53SerTrp: 0.53 ± 0.329
2.916SerTyr: 2.916 ± 0.88
0.0SerXaa: 0.0 ± 0.0
Thr
4.507ThrAla: 4.507 ± 1.491
0.0ThrCys: 0.0 ± 0.0
2.121ThrAsp: 2.121 ± 0.745
3.446ThrGlu: 3.446 ± 0.673
2.651ThrPhe: 2.651 ± 1.558
4.242ThrGly: 4.242 ± 0.688
0.265ThrHis: 0.265 ± 0.25
4.242ThrIle: 4.242 ± 1.358
6.098ThrLys: 6.098 ± 1.213
6.628ThrLeu: 6.628 ± 1.531
0.53ThrMet: 0.53 ± 0.34
2.651ThrAsn: 2.651 ± 1.171
3.712ThrPro: 3.712 ± 1.396
3.181ThrGln: 3.181 ± 0.773
3.446ThrArg: 3.446 ± 0.83
3.977ThrSer: 3.977 ± 1.225
3.181ThrThr: 3.181 ± 1.156
3.712ThrVal: 3.712 ± 0.995
0.53ThrTrp: 0.53 ± 0.367
4.772ThrTyr: 4.772 ± 1.133
0.0ThrXaa: 0.0 ± 0.0
Val
3.446ValAla: 3.446 ± 0.775
0.53ValCys: 0.53 ± 0.381
2.386ValAsp: 2.386 ± 0.957
3.712ValGlu: 3.712 ± 0.873
3.181ValPhe: 3.181 ± 0.737
2.386ValGly: 2.386 ± 0.49
0.265ValHis: 0.265 ± 0.263
3.712ValIle: 3.712 ± 0.803
5.567ValLys: 5.567 ± 0.736
5.567ValLeu: 5.567 ± 1.037
1.06ValMet: 1.06 ± 0.464
3.181ValAsn: 3.181 ± 0.893
1.856ValPro: 1.856 ± 0.649
1.326ValGln: 1.326 ± 0.614
1.591ValArg: 1.591 ± 0.547
4.242ValSer: 4.242 ± 0.95
4.507ValThr: 4.507 ± 1.133
3.181ValVal: 3.181 ± 0.925
0.53ValTrp: 0.53 ± 0.466
1.326ValTyr: 1.326 ± 0.765
0.0ValXaa: 0.0 ± 0.0
Trp
0.265TrpAla: 0.265 ± 0.269
0.0TrpCys: 0.0 ± 0.0
1.326TrpAsp: 1.326 ± 0.589
0.795TrpGlu: 0.795 ± 0.422
0.265TrpPhe: 0.265 ± 0.233
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.795TrpIle: 0.795 ± 0.481
0.53TrpLys: 0.53 ± 0.339
1.326TrpLeu: 1.326 ± 0.449
0.0TrpMet: 0.0 ± 0.0
0.265TrpAsn: 0.265 ± 0.269
0.0TrpPro: 0.0 ± 0.0
0.53TrpGln: 0.53 ± 0.363
0.265TrpArg: 0.265 ± 0.263
1.06TrpSer: 1.06 ± 0.39
0.265TrpThr: 0.265 ± 0.233
1.06TrpVal: 1.06 ± 0.484
0.265TrpTrp: 0.265 ± 0.269
0.265TrpTyr: 0.265 ± 0.25
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.591TyrAla: 1.591 ± 0.664
0.53TyrCys: 0.53 ± 0.33
2.121TyrAsp: 2.121 ± 0.662
5.302TyrGlu: 5.302 ± 1.049
2.916TyrPhe: 2.916 ± 0.694
1.856TyrGly: 1.856 ± 0.73
1.326TyrHis: 1.326 ± 0.553
1.591TyrIle: 1.591 ± 0.7
5.037TyrLys: 5.037 ± 1.081
3.181TyrLeu: 3.181 ± 0.593
1.326TyrMet: 1.326 ± 0.59
4.772TyrAsn: 4.772 ± 0.828
1.06TyrPro: 1.06 ± 0.784
3.446TyrGln: 3.446 ± 0.979
4.242TyrArg: 4.242 ± 1.021
1.856TyrSer: 1.856 ± 0.593
2.121TyrThr: 2.121 ± 0.604
1.591TyrVal: 1.591 ± 0.65
0.53TyrTrp: 0.53 ± 0.527
3.181TyrTyr: 3.181 ± 0.934
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 25 proteins (3773 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski