Amino acid dipepetide frequency for Streptococcus satellite phage Javan375

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.339AlaCys: 0.339 ± 0.298
3.394AlaAsp: 3.394 ± 1.036
5.431AlaGlu: 5.431 ± 1.582
3.394AlaPhe: 3.394 ± 0.973
1.697AlaGly: 1.697 ± 0.816
1.358AlaHis: 1.358 ± 0.705
3.734AlaIle: 3.734 ± 1.101
4.073AlaLys: 4.073 ± 0.859
4.413AlaLeu: 4.413 ± 0.989
1.697AlaMet: 1.697 ± 1.03
3.394AlaAsn: 3.394 ± 0.739
1.018AlaPro: 1.018 ± 0.422
2.716AlaGln: 2.716 ± 0.955
3.055AlaArg: 3.055 ± 1.067
2.716AlaSer: 2.716 ± 0.656
2.037AlaThr: 2.037 ± 0.66
3.394AlaVal: 3.394 ± 0.939
1.018AlaTrp: 1.018 ± 0.407
3.734AlaTyr: 3.734 ± 0.998
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.339CysCys: 0.339 ± 0.269
1.018CysAsp: 1.018 ± 0.394
0.0CysGlu: 0.0 ± 0.0
0.679CysPhe: 0.679 ± 0.439
0.339CysGly: 0.339 ± 0.269
0.0CysHis: 0.0 ± 0.0
0.339CysIle: 0.339 ± 0.269
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.679CysAsn: 0.679 ± 0.375
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.339CysArg: 0.339 ± 0.312
0.0CysSer: 0.0 ± 0.0
0.339CysThr: 0.339 ± 0.313
0.339CysVal: 0.339 ± 0.298
0.0CysTrp: 0.0 ± 0.0
1.018CysTyr: 1.018 ± 0.531
0.0CysXaa: 0.0 ± 0.0
Asp
1.358AspAla: 1.358 ± 0.618
1.018AspCys: 1.018 ± 0.457
2.376AspAsp: 2.376 ± 0.933
3.394AspGlu: 3.394 ± 1.064
4.073AspPhe: 4.073 ± 1.103
3.394AspGly: 3.394 ± 0.903
0.679AspHis: 0.679 ± 0.498
4.413AspIle: 4.413 ± 1.477
4.752AspLys: 4.752 ± 0.778
7.807AspLeu: 7.807 ± 1.285
0.679AspMet: 0.679 ± 0.459
4.413AspAsn: 4.413 ± 1.316
2.376AspPro: 2.376 ± 0.672
0.339AspGln: 0.339 ± 0.298
3.394AspArg: 3.394 ± 0.934
3.055AspSer: 3.055 ± 0.628
1.018AspThr: 1.018 ± 0.564
1.358AspVal: 1.358 ± 0.541
0.0AspTrp: 0.0 ± 0.0
6.11AspTyr: 6.11 ± 1.09
0.0AspXaa: 0.0 ± 0.0
Glu
4.073GluAla: 4.073 ± 1.017
0.0GluCys: 0.0 ± 0.0
5.092GluAsp: 5.092 ± 1.105
8.826GluGlu: 8.826 ± 2.179
3.055GluPhe: 3.055 ± 0.902
1.358GluGly: 1.358 ± 0.665
1.358GluHis: 1.358 ± 0.399
5.431GluIle: 5.431 ± 1.303
9.844GluLys: 9.844 ± 1.545
9.844GluLeu: 9.844 ± 1.31
1.697GluMet: 1.697 ± 0.65
5.092GluAsn: 5.092 ± 1.556
2.376GluPro: 2.376 ± 0.943
6.11GluGln: 6.11 ± 1.627
4.073GluArg: 4.073 ± 0.996
7.128GluSer: 7.128 ± 0.899
5.771GluThr: 5.771 ± 1.23
4.752GluVal: 4.752 ± 1.065
1.358GluTrp: 1.358 ± 0.608
2.037GluTyr: 2.037 ± 0.861
0.0GluXaa: 0.0 ± 0.0
Phe
1.697PheAla: 1.697 ± 0.688
0.0PheCys: 0.0 ± 0.0
2.037PheAsp: 2.037 ± 0.69
2.716PheGlu: 2.716 ± 1.17
1.697PhePhe: 1.697 ± 0.509
2.716PheGly: 2.716 ± 0.646
1.018PheHis: 1.018 ± 0.501
4.073PheIle: 4.073 ± 1.311
3.734PheLys: 3.734 ± 0.887
5.771PheLeu: 5.771 ± 1.382
1.358PheMet: 1.358 ± 0.672
1.697PheAsn: 1.697 ± 0.476
1.018PhePro: 1.018 ± 0.564
2.037PheGln: 2.037 ± 0.742
1.358PheArg: 1.358 ± 0.712
2.716PheSer: 2.716 ± 1.071
1.358PheThr: 1.358 ± 0.548
4.073PheVal: 4.073 ± 1.111
0.339PheTrp: 0.339 ± 0.326
2.716PheTyr: 2.716 ± 0.762
0.0PheXaa: 0.0 ± 0.0
Gly
2.376GlyAla: 2.376 ± 1.83
0.679GlyCys: 0.679 ± 0.409
1.697GlyAsp: 1.697 ± 0.57
2.037GlyGlu: 2.037 ± 0.629
4.073GlyPhe: 4.073 ± 0.909
4.073GlyGly: 4.073 ± 1.188
0.679GlyHis: 0.679 ± 0.38
3.734GlyIle: 3.734 ± 0.783
3.394GlyLys: 3.394 ± 1.223
4.752GlyLeu: 4.752 ± 1.142
1.358GlyMet: 1.358 ± 0.641
1.018GlyAsn: 1.018 ± 0.488
0.679GlyPro: 0.679 ± 0.443
2.037GlyGln: 2.037 ± 0.713
0.339GlyArg: 0.339 ± 0.316
2.716GlySer: 2.716 ± 0.745
1.697GlyThr: 1.697 ± 0.572
3.055GlyVal: 3.055 ± 1.28
0.0GlyTrp: 0.0 ± 0.0
2.376GlyTyr: 2.376 ± 0.85
0.0GlyXaa: 0.0 ± 0.0
His
0.679HisAla: 0.679 ± 0.409
0.339HisCys: 0.339 ± 0.336
1.018HisAsp: 1.018 ± 0.507
1.358HisGlu: 1.358 ± 0.608
1.018HisPhe: 1.018 ± 0.684
2.037HisGly: 2.037 ± 0.7
0.339HisHis: 0.339 ± 0.313
1.018HisIle: 1.018 ± 0.407
0.679HisLys: 0.679 ± 0.373
1.018HisLeu: 1.018 ± 0.407
0.0HisMet: 0.0 ± 0.0
1.018HisAsn: 1.018 ± 0.653
0.679HisPro: 0.679 ± 0.37
0.679HisGln: 0.679 ± 0.451
1.358HisArg: 1.358 ± 0.558
1.018HisSer: 1.018 ± 0.456
2.037HisThr: 2.037 ± 0.924
1.018HisVal: 1.018 ± 0.404
0.0HisTrp: 0.0 ± 0.0
0.679HisTyr: 0.679 ± 0.425
0.0HisXaa: 0.0 ± 0.0
Ile
3.394IleAla: 3.394 ± 0.868
0.679IleCys: 0.679 ± 0.375
6.449IleAsp: 6.449 ± 1.567
8.147IleGlu: 8.147 ± 2.016
1.697IlePhe: 1.697 ± 0.741
2.716IleGly: 2.716 ± 0.736
0.679IleHis: 0.679 ± 0.625
6.449IleIle: 6.449 ± 1.349
7.807IleLys: 7.807 ± 1.526
8.147IleLeu: 8.147 ± 1.059
1.358IleMet: 1.358 ± 0.565
3.394IleAsn: 3.394 ± 0.832
1.358IlePro: 1.358 ± 0.738
2.376IleGln: 2.376 ± 0.747
2.037IleArg: 2.037 ± 0.8
6.789IleSer: 6.789 ± 2.164
5.092IleThr: 5.092 ± 1.469
4.413IleVal: 4.413 ± 0.917
0.0IleTrp: 0.0 ± 0.0
2.716IleTyr: 2.716 ± 1.148
0.0IleXaa: 0.0 ± 0.0
Lys
4.073LysAla: 4.073 ± 0.881
0.0LysCys: 0.0 ± 0.0
4.073LysAsp: 4.073 ± 0.801
10.862LysGlu: 10.862 ± 1.958
1.697LysPhe: 1.697 ± 0.572
2.376LysGly: 2.376 ± 0.702
2.716LysHis: 2.716 ± 1.098
6.449LysIle: 6.449 ± 1.523
8.147LysLys: 8.147 ± 1.377
7.468LysLeu: 7.468 ± 1.328
3.055LysMet: 3.055 ± 1.446
4.752LysAsn: 4.752 ± 1.428
3.055LysPro: 3.055 ± 0.857
4.413LysGln: 4.413 ± 1.39
6.11LysArg: 6.11 ± 1.129
7.128LysSer: 7.128 ± 1.176
8.147LysThr: 8.147 ± 2.443
5.431LysVal: 5.431 ± 1.466
0.339LysTrp: 0.339 ± 0.417
3.055LysTyr: 3.055 ± 1.363
0.0LysXaa: 0.0 ± 0.0
Leu
8.147LeuAla: 8.147 ± 1.484
0.339LeuCys: 0.339 ± 0.298
6.11LeuAsp: 6.11 ± 1.51
10.523LeuGlu: 10.523 ± 2.082
3.734LeuPhe: 3.734 ± 0.923
4.073LeuGly: 4.073 ± 0.913
0.679LeuHis: 0.679 ± 0.409
6.449LeuIle: 6.449 ± 1.677
9.844LeuLys: 9.844 ± 1.07
9.844LeuLeu: 9.844 ± 1.901
2.037LeuMet: 2.037 ± 0.735
5.771LeuAsn: 5.771 ± 1.215
3.394LeuPro: 3.394 ± 1.647
4.073LeuGln: 4.073 ± 0.708
3.394LeuArg: 3.394 ± 1.286
8.486LeuSer: 8.486 ± 1.63
4.752LeuThr: 4.752 ± 0.94
4.073LeuVal: 4.073 ± 1.424
0.339LeuTrp: 0.339 ± 0.312
3.734LeuTyr: 3.734 ± 0.983
0.0LeuXaa: 0.0 ± 0.0
Met
1.697MetAla: 1.697 ± 0.635
0.339MetCys: 0.339 ± 0.387
1.697MetAsp: 1.697 ± 0.631
2.716MetGlu: 2.716 ± 0.755
0.679MetPhe: 0.679 ± 0.509
0.339MetGly: 0.339 ± 0.298
0.0MetHis: 0.0 ± 0.0
2.037MetIle: 2.037 ± 0.769
1.697MetLys: 1.697 ± 0.725
1.358MetLeu: 1.358 ± 0.755
0.0MetMet: 0.0 ± 0.0
2.376MetAsn: 2.376 ± 1.002
0.0MetPro: 0.0 ± 0.0
1.697MetGln: 1.697 ± 0.455
1.358MetArg: 1.358 ± 0.626
1.358MetSer: 1.358 ± 0.57
3.055MetThr: 3.055 ± 1.302
0.679MetVal: 0.679 ± 0.627
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.394AsnAla: 3.394 ± 0.935
0.0AsnCys: 0.0 ± 0.0
3.055AsnAsp: 3.055 ± 1.025
3.734AsnGlu: 3.734 ± 0.771
2.037AsnPhe: 2.037 ± 0.613
5.092AsnGly: 5.092 ± 1.306
1.358AsnHis: 1.358 ± 0.758
3.734AsnIle: 3.734 ± 1.298
5.431AsnLys: 5.431 ± 0.868
5.431AsnLeu: 5.431 ± 0.939
2.037AsnMet: 2.037 ± 0.811
4.413AsnAsn: 4.413 ± 0.829
2.037AsnPro: 2.037 ± 0.533
2.716AsnGln: 2.716 ± 0.741
2.037AsnArg: 2.037 ± 0.607
3.055AsnSer: 3.055 ± 1.185
3.394AsnThr: 3.394 ± 1.054
2.037AsnVal: 2.037 ± 0.966
0.679AsnTrp: 0.679 ± 0.396
0.679AsnTyr: 0.679 ± 0.409
0.0AsnXaa: 0.0 ± 0.0
Pro
1.358ProAla: 1.358 ± 0.7
0.339ProCys: 0.339 ± 0.269
1.697ProAsp: 1.697 ± 0.787
2.376ProGlu: 2.376 ± 0.856
1.018ProPhe: 1.018 ± 0.677
0.339ProGly: 0.339 ± 0.336
0.0ProHis: 0.0 ± 0.0
1.697ProIle: 1.697 ± 0.716
2.716ProLys: 2.716 ± 0.708
3.394ProLeu: 3.394 ± 0.865
0.0ProMet: 0.0 ± 0.0
3.055ProAsn: 3.055 ± 1.015
0.679ProPro: 0.679 ± 0.449
0.679ProGln: 0.679 ± 0.493
1.018ProArg: 1.018 ± 0.489
1.018ProSer: 1.018 ± 0.601
2.376ProThr: 2.376 ± 0.879
0.679ProVal: 0.679 ± 0.393
0.679ProTrp: 0.679 ± 0.403
2.376ProTyr: 2.376 ± 0.706
0.0ProXaa: 0.0 ± 0.0
Gln
4.073GlnAla: 4.073 ± 1.478
0.0GlnCys: 0.0 ± 0.0
1.358GlnAsp: 1.358 ± 0.714
4.413GlnGlu: 4.413 ± 0.731
1.358GlnPhe: 1.358 ± 0.672
1.358GlnGly: 1.358 ± 0.695
1.018GlnHis: 1.018 ± 0.393
2.716GlnIle: 2.716 ± 0.835
5.092GlnLys: 5.092 ± 0.882
4.073GlnLeu: 4.073 ± 1.315
1.358GlnMet: 1.358 ± 0.695
2.376GlnAsn: 2.376 ± 0.808
0.679GlnPro: 0.679 ± 0.5
2.037GlnGln: 2.037 ± 0.706
0.679GlnArg: 0.679 ± 0.42
1.018GlnSer: 1.018 ± 0.479
2.376GlnThr: 2.376 ± 0.745
3.394GlnVal: 3.394 ± 1.095
0.339GlnTrp: 0.339 ± 0.336
2.716GlnTyr: 2.716 ± 0.832
0.0GlnXaa: 0.0 ± 0.0
Arg
2.716ArgAla: 2.716 ± 0.633
0.0ArgCys: 0.0 ± 0.0
2.037ArgAsp: 2.037 ± 0.643
3.055ArgGlu: 3.055 ± 0.82
2.376ArgPhe: 2.376 ± 0.739
0.679ArgGly: 0.679 ± 0.396
1.358ArgHis: 1.358 ± 0.547
3.055ArgIle: 3.055 ± 1.349
5.431ArgLys: 5.431 ± 1.187
4.752ArgLeu: 4.752 ± 1.386
0.679ArgMet: 0.679 ± 0.409
2.376ArgAsn: 2.376 ± 0.805
1.018ArgPro: 1.018 ± 0.752
1.358ArgGln: 1.358 ± 0.714
2.037ArgArg: 2.037 ± 0.698
1.358ArgSer: 1.358 ± 0.589
3.055ArgThr: 3.055 ± 0.836
2.716ArgVal: 2.716 ± 0.848
0.339ArgTrp: 0.339 ± 0.32
3.055ArgTyr: 3.055 ± 0.84
0.0ArgXaa: 0.0 ± 0.0
Ser
4.413SerAla: 4.413 ± 1.051
0.339SerCys: 0.339 ± 0.269
4.752SerAsp: 4.752 ± 0.706
7.128SerGlu: 7.128 ± 1.925
2.376SerPhe: 2.376 ± 0.732
1.358SerGly: 1.358 ± 0.66
1.018SerHis: 1.018 ± 0.436
5.771SerIle: 5.771 ± 0.914
7.128SerLys: 7.128 ± 1.108
6.789SerLeu: 6.789 ± 1.863
1.358SerMet: 1.358 ± 0.453
2.376SerAsn: 2.376 ± 0.602
2.376SerPro: 2.376 ± 0.79
2.376SerGln: 2.376 ± 1.087
2.037SerArg: 2.037 ± 0.884
2.716SerSer: 2.716 ± 0.771
3.734SerThr: 3.734 ± 0.883
3.394SerVal: 3.394 ± 1.028
0.339SerTrp: 0.339 ± 0.326
3.055SerTyr: 3.055 ± 0.873
0.0SerXaa: 0.0 ± 0.0
Thr
2.716ThrAla: 2.716 ± 0.691
0.0ThrCys: 0.0 ± 0.0
3.734ThrAsp: 3.734 ± 1.354
3.734ThrGlu: 3.734 ± 0.877
3.394ThrPhe: 3.394 ± 0.986
2.716ThrGly: 2.716 ± 0.779
1.697ThrHis: 1.697 ± 0.598
6.789ThrIle: 6.789 ± 1.396
4.413ThrLys: 4.413 ± 1.02
6.11ThrLeu: 6.11 ± 1.184
1.018ThrMet: 1.018 ± 0.576
2.716ThrAsn: 2.716 ± 1.023
1.358ThrPro: 1.358 ± 0.605
2.376ThrGln: 2.376 ± 1.012
4.413ThrArg: 4.413 ± 1.102
3.394ThrSer: 3.394 ± 0.804
3.394ThrThr: 3.394 ± 0.767
4.073ThrVal: 4.073 ± 1.416
0.0ThrTrp: 0.0 ± 0.0
1.697ThrTyr: 1.697 ± 0.708
0.0ThrXaa: 0.0 ± 0.0
Val
4.073ValAla: 4.073 ± 1.64
0.339ValCys: 0.339 ± 0.269
2.037ValAsp: 2.037 ± 1.153
3.394ValGlu: 3.394 ± 1.368
2.716ValPhe: 2.716 ± 0.864
3.734ValGly: 3.734 ± 1.195
0.679ValHis: 0.679 ± 0.392
3.055ValIle: 3.055 ± 0.873
4.413ValLys: 4.413 ± 0.832
4.073ValLeu: 4.073 ± 1.19
1.697ValMet: 1.697 ± 0.559
2.376ValAsn: 2.376 ± 0.839
2.376ValPro: 2.376 ± 0.687
1.358ValGln: 1.358 ± 0.756
1.697ValArg: 1.697 ± 0.731
5.092ValSer: 5.092 ± 1.557
4.413ValThr: 4.413 ± 1.171
3.734ValVal: 3.734 ± 1.38
1.018ValTrp: 1.018 ± 0.539
1.358ValTyr: 1.358 ± 0.577
0.0ValXaa: 0.0 ± 0.0
Trp
0.679TrpAla: 0.679 ± 0.443
0.0TrpCys: 0.0 ± 0.0
0.679TrpAsp: 0.679 ± 0.493
2.376TrpGlu: 2.376 ± 0.741
0.339TrpPhe: 0.339 ± 0.326
0.339TrpGly: 0.339 ± 0.316
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.679TrpLys: 0.679 ± 0.373
0.679TrpLeu: 0.679 ± 0.493
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.679TrpArg: 0.679 ± 0.439
0.679TrpSer: 0.679 ± 0.373
0.0TrpThr: 0.0 ± 0.0
0.339TrpVal: 0.339 ± 0.269
0.339TrpTrp: 0.339 ± 0.312
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.697TyrAla: 1.697 ± 0.721
0.339TyrCys: 0.339 ± 0.298
1.697TyrAsp: 1.697 ± 0.684
3.055TyrGlu: 3.055 ± 1.063
2.716TyrPhe: 2.716 ± 0.589
2.376TyrGly: 2.376 ± 0.742
1.358TyrHis: 1.358 ± 0.8
4.752TyrIle: 4.752 ± 1.258
4.073TyrLys: 4.073 ± 1.572
4.073TyrLeu: 4.073 ± 1.046
1.358TyrMet: 1.358 ± 0.578
3.055TyrAsn: 3.055 ± 0.762
1.018TyrPro: 1.018 ± 0.407
3.055TyrGln: 3.055 ± 1.243
2.037TyrArg: 2.037 ± 1.004
3.394TyrSer: 3.394 ± 1.168
1.697TyrThr: 1.697 ± 0.549
0.679TyrVal: 0.679 ± 0.53
0.679TyrTrp: 0.679 ± 0.379
1.018TyrTyr: 1.018 ± 0.553
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 19 proteins (2947 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski