Amino acid dipepetide frequency for Streptococcus satellite phage Javan356

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.399AlaAla: 0.399 ± 0.304
0.399AlaCys: 0.399 ± 0.391
2.394AlaAsp: 2.394 ± 0.925
2.793AlaGlu: 2.793 ± 0.863
2.394AlaPhe: 2.394 ± 0.985
1.596AlaGly: 1.596 ± 0.715
0.399AlaHis: 0.399 ± 0.391
1.995AlaIle: 1.995 ± 0.886
6.784AlaLys: 6.784 ± 1.613
4.389AlaLeu: 4.389 ± 1.674
1.596AlaMet: 1.596 ± 0.942
3.591AlaAsn: 3.591 ± 0.912
0.399AlaPro: 0.399 ± 0.363
0.798AlaGln: 0.798 ± 0.502
3.192AlaArg: 3.192 ± 0.88
1.995AlaSer: 1.995 ± 1.258
3.192AlaThr: 3.192 ± 0.745
2.394AlaVal: 2.394 ± 0.774
0.0AlaTrp: 0.0 ± 0.0
3.192AlaTyr: 3.192 ± 1.036
0.0AlaXaa: 0.0 ± 0.0
Cys
0.399CysAla: 0.399 ± 0.391
0.0CysCys: 0.0 ± 0.0
0.798CysAsp: 0.798 ± 0.46
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.197CysGly: 1.197 ± 0.597
0.0CysHis: 0.0 ± 0.0
0.399CysIle: 0.399 ± 0.304
0.798CysLys: 0.798 ± 0.508
1.197CysLeu: 1.197 ± 0.683
0.399CysMet: 0.399 ± 0.374
0.0CysAsn: 0.0 ± 0.0
0.399CysPro: 0.399 ± 0.413
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.399CysSer: 0.399 ± 0.391
0.798CysThr: 0.798 ± 0.468
0.399CysVal: 0.399 ± 0.413
0.0CysTrp: 0.0 ± 0.0
0.399CysTyr: 0.399 ± 0.391
0.0CysXaa: 0.0 ± 0.0
Asp
0.399AspAla: 0.399 ± 0.31
0.798AspCys: 0.798 ± 0.47
3.99AspAsp: 3.99 ± 1.236
1.995AspGlu: 1.995 ± 0.546
4.389AspPhe: 4.389 ± 1.498
4.389AspGly: 4.389 ± 1.04
0.399AspHis: 0.399 ± 0.33
7.582AspIle: 7.582 ± 2.088
9.178AspLys: 9.178 ± 1.792
7.981AspLeu: 7.981 ± 1.558
1.197AspMet: 1.197 ± 0.645
3.99AspAsn: 3.99 ± 0.993
0.399AspPro: 0.399 ± 0.427
0.798AspGln: 0.798 ± 0.448
1.596AspArg: 1.596 ± 0.795
3.99AspSer: 3.99 ± 0.962
3.192AspThr: 3.192 ± 1.172
3.99AspVal: 3.99 ± 1.215
0.399AspTrp: 0.399 ± 0.413
3.192AspTyr: 3.192 ± 1.013
0.0AspXaa: 0.0 ± 0.0
Glu
2.394GluAla: 2.394 ± 0.94
1.197GluCys: 1.197 ± 0.534
6.385GluAsp: 6.385 ± 1.478
3.99GluGlu: 3.99 ± 1.079
3.99GluPhe: 3.99 ± 0.901
2.793GluGly: 2.793 ± 1.119
1.197GluHis: 1.197 ± 0.521
7.183GluIle: 7.183 ± 1.713
12.37GluLys: 12.37 ± 2.288
7.981GluLeu: 7.981 ± 1.954
1.596GluMet: 1.596 ± 0.641
4.389GluAsn: 4.389 ± 1.398
0.798GluPro: 0.798 ± 0.502
4.389GluGln: 4.389 ± 1.109
2.793GluArg: 2.793 ± 1.073
3.591GluSer: 3.591 ± 1.098
5.188GluThr: 5.188 ± 1.615
2.394GluVal: 2.394 ± 1.03
0.798GluTrp: 0.798 ± 0.428
3.99GluTyr: 3.99 ± 1.131
0.0GluXaa: 0.0 ± 0.0
Phe
1.197PheAla: 1.197 ± 0.655
0.399PheCys: 0.399 ± 0.413
4.389PheAsp: 4.389 ± 0.966
4.789PheGlu: 4.789 ± 1.075
2.793PhePhe: 2.793 ± 0.917
2.394PheGly: 2.394 ± 0.766
0.399PheHis: 0.399 ± 0.33
1.995PheIle: 1.995 ± 0.913
5.188PheLys: 5.188 ± 1.843
5.188PheLeu: 5.188 ± 1.366
0.798PheMet: 0.798 ± 0.512
1.596PheAsn: 1.596 ± 0.921
0.798PhePro: 0.798 ± 0.607
0.798PheGln: 0.798 ± 0.529
0.798PheArg: 0.798 ± 0.473
4.789PheSer: 4.789 ± 1.32
2.394PheThr: 2.394 ± 0.775
3.591PheVal: 3.591 ± 1.128
0.798PheTrp: 0.798 ± 0.428
0.798PheTyr: 0.798 ± 0.523
0.0PheXaa: 0.0 ± 0.0
Gly
1.596GlyAla: 1.596 ± 0.978
0.399GlyCys: 0.399 ± 0.33
1.596GlyAsp: 1.596 ± 0.817
1.596GlyGlu: 1.596 ± 0.978
2.793GlyPhe: 2.793 ± 0.891
3.192GlyGly: 3.192 ± 1.698
0.399GlyHis: 0.399 ± 0.33
2.394GlyIle: 2.394 ± 0.736
6.784GlyLys: 6.784 ± 2.157
4.389GlyLeu: 4.389 ± 1.371
0.798GlyMet: 0.798 ± 0.509
2.793GlyAsn: 2.793 ± 1.053
0.0GlyPro: 0.0 ± 0.0
1.197GlyGln: 1.197 ± 0.576
0.798GlyArg: 0.798 ± 0.473
3.99GlySer: 3.99 ± 1.737
2.793GlyThr: 2.793 ± 0.94
2.394GlyVal: 2.394 ± 0.885
0.798GlyTrp: 0.798 ± 0.568
4.389GlyTyr: 4.389 ± 1.324
0.0GlyXaa: 0.0 ± 0.0
His
2.793HisAla: 2.793 ± 1.317
0.0HisCys: 0.0 ± 0.0
0.399HisAsp: 0.399 ± 0.391
1.197HisGlu: 1.197 ± 0.609
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.399HisHis: 0.399 ± 0.363
0.798HisIle: 0.798 ± 0.448
1.995HisLys: 1.995 ± 0.835
1.995HisLeu: 1.995 ± 1.054
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.399HisPro: 0.399 ± 0.464
1.596HisGln: 1.596 ± 0.863
0.399HisArg: 0.399 ± 0.391
1.197HisSer: 1.197 ± 0.788
2.793HisThr: 2.793 ± 0.934
0.399HisVal: 0.399 ± 0.37
0.0HisTrp: 0.0 ± 0.0
0.798HisTyr: 0.798 ± 0.666
0.0HisXaa: 0.0 ± 0.0
Ile
1.995IleAla: 1.995 ± 0.647
0.399IleCys: 0.399 ± 0.391
6.385IleAsp: 6.385 ± 1.41
3.591IleGlu: 3.591 ± 1.466
2.793IlePhe: 2.793 ± 0.958
2.793IleGly: 2.793 ± 0.712
1.197IleHis: 1.197 ± 0.567
4.789IleIle: 4.789 ± 1.411
8.38IleLys: 8.38 ± 1.411
6.784IleLeu: 6.784 ± 0.827
0.399IleMet: 0.399 ± 0.493
5.587IleAsn: 5.587 ± 1.37
2.394IlePro: 2.394 ± 0.6
3.99IleGln: 3.99 ± 1.236
3.192IleArg: 3.192 ± 1.134
5.986IleSer: 5.986 ± 1.389
3.192IleThr: 3.192 ± 1.366
2.394IleVal: 2.394 ± 1.16
0.399IleTrp: 0.399 ± 0.391
2.394IleTyr: 2.394 ± 0.67
0.0IleXaa: 0.0 ± 0.0
Lys
5.587LysAla: 5.587 ± 1.639
0.399LysCys: 0.399 ± 0.417
6.385LysAsp: 6.385 ± 1.26
14.366LysGlu: 14.366 ± 2.424
2.793LysPhe: 2.793 ± 1.094
5.188LysGly: 5.188 ± 1.764
3.591LysHis: 3.591 ± 0.821
9.976LysIle: 9.976 ± 2.016
9.577LysLys: 9.577 ± 1.902
5.587LysLeu: 5.587 ± 0.994
3.99LysMet: 3.99 ± 1.473
6.385LysAsn: 6.385 ± 1.867
1.995LysPro: 1.995 ± 0.82
7.183LysGln: 7.183 ± 1.661
7.981LysArg: 7.981 ± 1.164
6.385LysSer: 6.385 ± 1.745
8.38LysThr: 8.38 ± 2.001
6.784LysVal: 6.784 ± 1.227
0.798LysTrp: 0.798 ± 0.528
4.389LysTyr: 4.389 ± 1.262
0.0LysXaa: 0.0 ± 0.0
Leu
5.188LeuAla: 5.188 ± 1.143
0.798LeuCys: 0.798 ± 0.494
7.582LeuAsp: 7.582 ± 1.629
11.173LeuGlu: 11.173 ± 1.91
4.789LeuPhe: 4.789 ± 1.182
6.385LeuGly: 6.385 ± 1.038
2.394LeuHis: 2.394 ± 1.205
6.784LeuIle: 6.784 ± 1.584
9.577LeuLys: 9.577 ± 1.647
5.188LeuLeu: 5.188 ± 1.18
2.394LeuMet: 2.394 ± 0.724
4.789LeuAsn: 4.789 ± 1.676
1.995LeuPro: 1.995 ± 0.781
3.99LeuGln: 3.99 ± 0.731
2.394LeuArg: 2.394 ± 0.973
5.986LeuSer: 5.986 ± 1.03
5.587LeuThr: 5.587 ± 1.331
6.385LeuVal: 6.385 ± 1.603
0.399LeuTrp: 0.399 ± 0.304
1.596LeuTyr: 1.596 ± 0.674
0.0LeuXaa: 0.0 ± 0.0
Met
1.197MetAla: 1.197 ± 0.621
0.0MetCys: 0.0 ± 0.0
3.591MetAsp: 3.591 ± 1.117
2.394MetGlu: 2.394 ± 1.164
0.798MetPhe: 0.798 ± 0.567
0.399MetGly: 0.399 ± 0.516
0.0MetHis: 0.0 ± 0.0
0.798MetIle: 0.798 ± 0.582
1.995MetLys: 1.995 ± 0.609
1.995MetLeu: 1.995 ± 0.596
0.399MetMet: 0.399 ± 0.304
1.197MetAsn: 1.197 ± 0.696
0.399MetPro: 0.399 ± 0.304
0.798MetGln: 0.798 ± 0.6
1.995MetArg: 1.995 ± 0.713
0.399MetSer: 0.399 ± 0.391
2.793MetThr: 2.793 ± 1.43
1.596MetVal: 1.596 ± 0.828
0.0MetTrp: 0.0 ± 0.0
0.399MetTyr: 0.399 ± 0.391
0.0MetXaa: 0.0 ± 0.0
Asn
3.99AsnAla: 3.99 ± 0.762
0.399AsnCys: 0.399 ± 0.33
1.995AsnAsp: 1.995 ± 0.747
1.995AsnGlu: 1.995 ± 0.93
3.99AsnPhe: 3.99 ± 0.766
3.192AsnGly: 3.192 ± 1.229
1.995AsnHis: 1.995 ± 0.574
2.394AsnIle: 2.394 ± 1.142
5.188AsnLys: 5.188 ± 1.268
7.183AsnLeu: 7.183 ± 1.417
1.197AsnMet: 1.197 ± 0.963
3.591AsnAsn: 3.591 ± 1.008
2.793AsnPro: 2.793 ± 1.169
2.793AsnGln: 2.793 ± 1.095
1.596AsnArg: 1.596 ± 0.863
2.394AsnSer: 2.394 ± 1.033
5.587AsnThr: 5.587 ± 1.531
1.995AsnVal: 1.995 ± 1.29
0.0AsnTrp: 0.0 ± 0.0
3.99AsnTyr: 3.99 ± 0.995
0.0AsnXaa: 0.0 ± 0.0
Pro
0.399ProAla: 0.399 ± 0.33
0.0ProCys: 0.0 ± 0.0
0.798ProAsp: 0.798 ± 0.402
1.995ProGlu: 1.995 ± 0.751
0.798ProPhe: 0.798 ± 0.402
0.399ProGly: 0.399 ± 0.304
0.0ProHis: 0.0 ± 0.0
2.394ProIle: 2.394 ± 0.917
3.591ProLys: 3.591 ± 0.882
0.399ProLeu: 0.399 ± 0.31
0.0ProMet: 0.0 ± 0.0
1.197ProAsn: 1.197 ± 0.593
0.399ProPro: 0.399 ± 0.415
1.197ProGln: 1.197 ± 0.889
0.798ProArg: 0.798 ± 0.523
1.596ProSer: 1.596 ± 0.894
0.399ProThr: 0.399 ± 0.304
0.798ProVal: 0.798 ± 0.473
0.0ProTrp: 0.0 ± 0.0
0.798ProTyr: 0.798 ± 0.468
0.0ProXaa: 0.0 ± 0.0
Gln
5.188GlnAla: 5.188 ± 1.466
0.798GlnCys: 0.798 ± 0.508
1.995GlnAsp: 1.995 ± 0.818
3.99GlnGlu: 3.99 ± 0.984
1.197GlnPhe: 1.197 ± 0.9
0.399GlnGly: 0.399 ± 0.374
1.197GlnHis: 1.197 ± 0.724
1.596GlnIle: 1.596 ± 0.48
3.99GlnLys: 3.99 ± 0.988
2.394GlnLeu: 2.394 ± 0.883
0.798GlnMet: 0.798 ± 0.809
1.596GlnAsn: 1.596 ± 0.779
0.0GlnPro: 0.0 ± 0.0
1.596GlnGln: 1.596 ± 0.693
2.394GlnArg: 2.394 ± 0.863
1.197GlnSer: 1.197 ± 0.615
3.591GlnThr: 3.591 ± 0.941
4.389GlnVal: 4.389 ± 1.037
1.197GlnTrp: 1.197 ± 0.664
1.596GlnTyr: 1.596 ± 0.554
0.0GlnXaa: 0.0 ± 0.0
Arg
1.596ArgAla: 1.596 ± 0.725
0.0ArgCys: 0.0 ± 0.0
3.192ArgAsp: 3.192 ± 0.863
3.591ArgGlu: 3.591 ± 1.013
2.793ArgPhe: 2.793 ± 1.096
1.995ArgGly: 1.995 ± 0.831
0.798ArgHis: 0.798 ± 0.448
1.995ArgIle: 1.995 ± 0.791
8.38ArgLys: 8.38 ± 1.308
3.99ArgLeu: 3.99 ± 1.02
0.399ArgMet: 0.399 ± 0.304
3.591ArgAsn: 3.591 ± 0.796
0.399ArgPro: 0.399 ± 0.304
1.596ArgGln: 1.596 ± 0.69
1.596ArgArg: 1.596 ± 0.779
2.394ArgSer: 2.394 ± 1.005
2.394ArgThr: 2.394 ± 1.054
0.798ArgVal: 0.798 ± 0.659
0.0ArgTrp: 0.0 ± 0.0
1.995ArgTyr: 1.995 ± 0.637
0.0ArgXaa: 0.0 ± 0.0
Ser
1.197SerAla: 1.197 ± 0.651
0.0SerCys: 0.0 ± 0.0
3.591SerAsp: 3.591 ± 0.784
6.784SerGlu: 6.784 ± 1.575
3.192SerPhe: 3.192 ± 1.018
3.192SerGly: 3.192 ± 1.268
1.995SerHis: 1.995 ± 0.897
2.793SerIle: 2.793 ± 1.103
7.183SerLys: 7.183 ± 2.064
6.385SerLeu: 6.385 ± 1.779
2.793SerMet: 2.793 ± 0.894
1.995SerAsn: 1.995 ± 0.84
1.995SerPro: 1.995 ± 0.683
1.596SerGln: 1.596 ± 0.819
1.197SerArg: 1.197 ± 0.557
4.389SerSer: 4.389 ± 1.142
3.591SerThr: 3.591 ± 0.836
3.591SerVal: 3.591 ± 1.218
0.798SerTrp: 0.798 ± 0.537
2.793SerTyr: 2.793 ± 0.874
0.0SerXaa: 0.0 ± 0.0
Thr
4.389ThrAla: 4.389 ± 1.451
0.399ThrCys: 0.399 ± 0.304
1.197ThrAsp: 1.197 ± 0.689
3.99ThrGlu: 3.99 ± 1.275
2.394ThrPhe: 2.394 ± 0.766
4.389ThrGly: 4.389 ± 1.062
0.798ThrHis: 0.798 ± 0.523
5.188ThrIle: 5.188 ± 1.492
5.587ThrLys: 5.587 ± 1.268
9.178ThrLeu: 9.178 ± 1.366
1.197ThrMet: 1.197 ± 0.615
3.99ThrAsn: 3.99 ± 1.044
0.798ThrPro: 0.798 ± 0.473
2.394ThrGln: 2.394 ± 1.142
4.389ThrArg: 4.389 ± 1.625
3.591ThrSer: 3.591 ± 1.276
3.192ThrThr: 3.192 ± 0.935
3.99ThrVal: 3.99 ± 1.361
0.798ThrTrp: 0.798 ± 0.441
2.793ThrTyr: 2.793 ± 1.67
0.0ThrXaa: 0.0 ± 0.0
Val
1.995ValAla: 1.995 ± 0.888
0.399ValCys: 0.399 ± 0.413
4.789ValAsp: 4.789 ± 1.161
4.789ValGlu: 4.789 ± 1.292
1.197ValPhe: 1.197 ± 0.569
0.399ValGly: 0.399 ± 0.363
0.0ValHis: 0.0 ± 0.0
3.591ValIle: 3.591 ± 1.227
6.385ValLys: 6.385 ± 1.403
5.587ValLeu: 5.587 ± 1.16
0.798ValMet: 0.798 ± 0.478
4.389ValAsn: 4.389 ± 1.419
1.197ValPro: 1.197 ± 0.706
1.596ValGln: 1.596 ± 0.615
1.596ValArg: 1.596 ± 0.618
3.99ValSer: 3.99 ± 1.381
3.99ValThr: 3.99 ± 1.047
2.793ValVal: 2.793 ± 0.897
0.0ValTrp: 0.0 ± 0.0
3.591ValTyr: 3.591 ± 0.86
0.0ValXaa: 0.0 ± 0.0
Trp
0.798TrpAla: 0.798 ± 0.473
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.596TrpGlu: 1.596 ± 0.954
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.798TrpIle: 0.798 ± 0.488
0.399TrpLys: 0.399 ± 0.304
1.197TrpLeu: 1.197 ± 0.667
0.798TrpMet: 0.798 ± 0.62
0.399TrpAsn: 0.399 ± 0.37
0.0TrpPro: 0.0 ± 0.0
0.399TrpGln: 0.399 ± 0.33
0.0TrpArg: 0.0 ± 0.0
0.399TrpSer: 0.399 ± 0.33
0.0TrpThr: 0.0 ± 0.0
0.798TrpVal: 0.798 ± 0.51
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.197TyrAla: 1.197 ± 0.856
0.798TyrCys: 0.798 ± 0.782
2.394TyrAsp: 2.394 ± 0.74
2.793TyrGlu: 2.793 ± 0.865
2.793TyrPhe: 2.793 ± 0.67
0.798TyrGly: 0.798 ± 0.473
0.0TyrHis: 0.0 ± 0.0
3.591TyrIle: 3.591 ± 0.999
4.389TyrLys: 4.389 ± 1.276
6.385TyrLeu: 6.385 ± 0.934
1.197TyrMet: 1.197 ± 0.711
3.192TyrAsn: 3.192 ± 0.743
0.399TyrPro: 0.399 ± 0.391
2.394TyrGln: 2.394 ± 0.687
4.789TyrArg: 4.789 ± 1.384
2.394TyrSer: 2.394 ± 0.958
1.596TyrThr: 1.596 ± 0.693
1.197TyrVal: 1.197 ± 0.794
0.399TyrTrp: 0.399 ± 0.413
1.995TyrTyr: 1.995 ± 1.243
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 15 proteins (2507 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski