Amino acid dipepetide frequency for Streptococcus satellite phage Javan606

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.444AlaCys: 0.444 ± 0.361
4.882AlaAsp: 4.882 ± 1.338
3.107AlaGlu: 3.107 ± 1.0
2.219AlaPhe: 2.219 ± 1.166
3.107AlaGly: 3.107 ± 1.085
0.0AlaHis: 0.0 ± 0.0
4.882AlaIle: 4.882 ± 1.107
3.995AlaLys: 3.995 ± 1.484
3.995AlaLeu: 3.995 ± 1.743
0.888AlaMet: 0.888 ± 0.738
5.77AlaAsn: 5.77 ± 1.821
1.332AlaPro: 1.332 ± 0.614
3.551AlaGln: 3.551 ± 1.305
4.439AlaArg: 4.439 ± 1.092
3.551AlaSer: 3.551 ± 1.093
3.551AlaThr: 3.551 ± 1.107
2.663AlaVal: 2.663 ± 0.815
0.444AlaTrp: 0.444 ± 0.386
3.107AlaTyr: 3.107 ± 0.941
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.888CysAsp: 0.888 ± 0.926
0.0CysGlu: 0.0 ± 0.0
0.444CysPhe: 0.444 ± 0.565
0.444CysGly: 0.444 ± 0.361
0.444CysHis: 0.444 ± 0.34
0.0CysIle: 0.0 ± 0.0
0.444CysLys: 0.444 ± 0.361
0.444CysLeu: 0.444 ± 0.34
0.0CysMet: 0.0 ± 0.0
0.444CysAsn: 0.444 ± 0.361
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.444CysArg: 0.444 ± 0.386
0.444CysSer: 0.444 ± 0.588
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.444CysTyr: 0.444 ± 0.516
0.0CysXaa: 0.0 ± 0.0
Asp
0.0AspAla: 0.0 ± 0.0
0.444AspCys: 0.444 ± 0.516
2.219AspAsp: 2.219 ± 0.87
5.326AspGlu: 5.326 ± 1.541
3.107AspPhe: 3.107 ± 0.915
3.107AspGly: 3.107 ± 1.356
0.888AspHis: 0.888 ± 0.745
5.77AspIle: 5.77 ± 2.043
5.326AspLys: 5.326 ± 1.525
3.995AspLeu: 3.995 ± 1.437
1.775AspMet: 1.775 ± 1.17
5.326AspAsn: 5.326 ± 1.811
1.332AspPro: 1.332 ± 0.708
0.0AspGln: 0.0 ± 0.0
1.332AspArg: 1.332 ± 0.757
4.439AspSer: 4.439 ± 1.552
2.663AspThr: 2.663 ± 0.9
3.551AspVal: 3.551 ± 1.138
0.888AspTrp: 0.888 ± 0.722
4.882AspTyr: 4.882 ± 2.001
0.0AspXaa: 0.0 ± 0.0
Glu
5.77GluAla: 5.77 ± 1.312
0.444GluCys: 0.444 ± 0.555
3.107GluAsp: 3.107 ± 1.372
4.439GluGlu: 4.439 ± 1.319
2.663GluPhe: 2.663 ± 1.067
3.107GluGly: 3.107 ± 1.026
0.888GluHis: 0.888 ± 0.735
3.107GluIle: 3.107 ± 0.732
6.214GluLys: 6.214 ± 1.834
11.54GluLeu: 11.54 ± 1.442
1.332GluMet: 1.332 ± 0.718
3.107GluAsn: 3.107 ± 1.098
2.219GluPro: 2.219 ± 1.217
4.439GluGln: 4.439 ± 2.555
3.107GluArg: 3.107 ± 1.45
3.995GluSer: 3.995 ± 1.085
6.214GluThr: 6.214 ± 1.882
3.107GluVal: 3.107 ± 1.465
0.444GluTrp: 0.444 ± 0.485
3.107GluTyr: 3.107 ± 1.421
0.0GluXaa: 0.0 ± 0.0
Phe
0.888PheAla: 0.888 ± 0.448
0.0PheCys: 0.0 ± 0.0
3.551PheAsp: 3.551 ± 0.975
3.551PheGlu: 3.551 ± 1.838
0.0PhePhe: 0.0 ± 0.0
2.219PheGly: 2.219 ± 0.938
0.444PheHis: 0.444 ± 0.386
2.219PheIle: 2.219 ± 0.953
3.551PheLys: 3.551 ± 0.988
4.882PheLeu: 4.882 ± 0.729
0.444PheMet: 0.444 ± 0.634
3.995PheAsn: 3.995 ± 1.999
0.0PhePro: 0.0 ± 0.0
0.888PheGln: 0.888 ± 0.539
1.775PheArg: 1.775 ± 0.847
1.775PheSer: 1.775 ± 0.735
1.332PheThr: 1.332 ± 0.882
0.888PheVal: 0.888 ± 0.683
0.444PheTrp: 0.444 ± 0.361
1.332PheTyr: 1.332 ± 0.553
0.0PheXaa: 0.0 ± 0.0
Gly
2.219GlyAla: 2.219 ± 0.815
0.888GlyCys: 0.888 ± 0.626
3.551GlyAsp: 3.551 ± 1.757
3.995GlyGlu: 3.995 ± 0.807
3.107GlyPhe: 3.107 ± 0.822
1.775GlyGly: 1.775 ± 0.65
0.444GlyHis: 0.444 ± 0.386
4.439GlyIle: 4.439 ± 1.218
2.663GlyLys: 2.663 ± 1.357
5.77GlyLeu: 5.77 ± 1.903
0.444GlyMet: 0.444 ± 0.392
3.995GlyAsn: 3.995 ± 1.763
0.0GlyPro: 0.0 ± 0.0
1.332GlyGln: 1.332 ± 0.726
1.775GlyArg: 1.775 ± 0.648
1.332GlySer: 1.332 ± 0.877
3.551GlyThr: 3.551 ± 1.175
6.214GlyVal: 6.214 ± 1.939
0.888GlyTrp: 0.888 ± 0.722
6.214GlyTyr: 6.214 ± 2.25
0.0GlyXaa: 0.0 ± 0.0
His
2.663HisAla: 2.663 ± 1.065
0.444HisCys: 0.444 ± 0.34
0.444HisAsp: 0.444 ± 0.485
2.219HisGlu: 2.219 ± 0.814
1.332HisPhe: 1.332 ± 0.72
0.888HisGly: 0.888 ± 0.449
0.0HisHis: 0.0 ± 0.0
0.444HisIle: 0.444 ± 0.386
0.888HisLys: 0.888 ± 0.594
0.888HisLeu: 0.888 ± 0.449
0.0HisMet: 0.0 ± 0.0
0.888HisAsn: 0.888 ± 0.817
0.444HisPro: 0.444 ± 0.548
0.888HisGln: 0.888 ± 0.69
0.444HisArg: 0.444 ± 0.386
0.888HisSer: 0.888 ± 0.594
2.219HisThr: 2.219 ± 0.711
0.444HisVal: 0.444 ± 0.386
0.0HisTrp: 0.0 ± 0.0
1.332HisTyr: 1.332 ± 0.704
0.0HisXaa: 0.0 ± 0.0
Ile
3.551IleAla: 3.551 ± 1.361
0.444IleCys: 0.444 ± 0.565
4.882IleAsp: 4.882 ± 1.447
4.439IleGlu: 4.439 ± 0.995
1.775IlePhe: 1.775 ± 0.731
2.663IleGly: 2.663 ± 0.75
2.219IleHis: 2.219 ± 0.865
3.107IleIle: 3.107 ± 1.395
3.995IleLys: 3.995 ± 0.888
5.326IleLeu: 5.326 ± 1.297
1.332IleMet: 1.332 ± 0.557
3.107IleAsn: 3.107 ± 1.079
3.995IlePro: 3.995 ± 1.899
2.219IleGln: 2.219 ± 0.731
0.888IleArg: 0.888 ± 0.68
2.663IleSer: 2.663 ± 1.055
4.882IleThr: 4.882 ± 1.083
3.551IleVal: 3.551 ± 1.355
0.888IleTrp: 0.888 ± 1.13
3.995IleTyr: 3.995 ± 0.99
0.0IleXaa: 0.0 ± 0.0
Lys
5.326LysAla: 5.326 ± 1.744
0.0LysCys: 0.0 ± 0.0
3.551LysAsp: 3.551 ± 1.353
7.545LysGlu: 7.545 ± 2.106
3.551LysPhe: 3.551 ± 1.408
5.77LysGly: 5.77 ± 1.076
3.551LysHis: 3.551 ± 1.163
4.882LysIle: 4.882 ± 1.16
11.984LysLys: 11.984 ± 3.382
9.321LysLeu: 9.321 ± 2.074
1.775LysMet: 1.775 ± 0.711
6.214LysAsn: 6.214 ± 1.397
3.995LysPro: 3.995 ± 1.659
4.439LysGln: 4.439 ± 1.244
7.989LysArg: 7.989 ± 1.595
4.439LysSer: 4.439 ± 1.237
3.995LysThr: 3.995 ± 0.79
4.439LysVal: 4.439 ± 1.53
0.888LysTrp: 0.888 ± 0.687
2.663LysTyr: 2.663 ± 0.676
0.0LysXaa: 0.0 ± 0.0
Leu
8.433LeuAla: 8.433 ± 1.898
0.444LeuCys: 0.444 ± 0.361
7.102LeuAsp: 7.102 ± 1.598
8.877LeuGlu: 8.877 ± 3.008
1.775LeuPhe: 1.775 ± 0.881
7.545LeuGly: 7.545 ± 1.926
1.775LeuHis: 1.775 ± 0.775
4.882LeuIle: 4.882 ± 1.055
11.096LeuLys: 11.096 ± 2.285
8.877LeuLeu: 8.877 ± 1.665
2.219LeuMet: 2.219 ± 0.999
4.439LeuAsn: 4.439 ± 1.846
3.551LeuPro: 3.551 ± 1.107
3.995LeuGln: 3.995 ± 1.326
3.107LeuArg: 3.107 ± 1.792
6.214LeuSer: 6.214 ± 1.249
3.995LeuThr: 3.995 ± 1.458
4.882LeuVal: 4.882 ± 1.365
0.888LeuTrp: 0.888 ± 0.539
2.663LeuTyr: 2.663 ± 0.867
0.0LeuXaa: 0.0 ± 0.0
Met
1.332MetAla: 1.332 ± 0.771
0.0MetCys: 0.0 ± 0.0
1.775MetAsp: 1.775 ± 0.7
1.332MetGlu: 1.332 ± 0.862
0.0MetPhe: 0.0 ± 0.0
0.888MetGly: 0.888 ± 0.515
0.0MetHis: 0.0 ± 0.0
0.888MetIle: 0.888 ± 0.735
2.219MetLys: 2.219 ± 1.334
1.332MetLeu: 1.332 ± 1.034
0.888MetMet: 0.888 ± 0.419
1.775MetAsn: 1.775 ± 0.633
0.444MetPro: 0.444 ± 0.34
0.444MetGln: 0.444 ± 0.462
0.444MetArg: 0.444 ± 0.34
1.775MetSer: 1.775 ± 1.178
2.663MetThr: 2.663 ± 1.072
2.219MetVal: 2.219 ± 1.373
0.444MetTrp: 0.444 ± 0.516
0.888MetTyr: 0.888 ± 0.623
0.0MetXaa: 0.0 ± 0.0
Asn
3.995AsnAla: 3.995 ± 1.316
0.0AsnCys: 0.0 ± 0.0
3.551AsnAsp: 3.551 ± 1.507
4.882AsnGlu: 4.882 ± 1.077
0.444AsnPhe: 0.444 ± 0.555
5.326AsnGly: 5.326 ± 1.402
0.888AsnHis: 0.888 ± 0.594
2.663AsnIle: 2.663 ± 1.36
6.658AsnLys: 6.658 ± 1.411
5.326AsnLeu: 5.326 ± 1.507
2.219AsnMet: 2.219 ± 0.971
4.439AsnAsn: 4.439 ± 1.093
3.551AsnPro: 3.551 ± 1.187
0.888AsnGln: 0.888 ± 0.479
1.775AsnArg: 1.775 ± 0.756
6.214AsnSer: 6.214 ± 2.374
5.326AsnThr: 5.326 ± 1.533
3.551AsnVal: 3.551 ± 1.022
0.0AsnTrp: 0.0 ± 0.0
3.551AsnTyr: 3.551 ± 0.878
0.0AsnXaa: 0.0 ± 0.0
Pro
2.663ProAla: 2.663 ± 0.741
0.0ProCys: 0.0 ± 0.0
1.332ProAsp: 1.332 ± 0.687
1.332ProGlu: 1.332 ± 0.885
2.663ProPhe: 2.663 ± 1.209
0.444ProGly: 0.444 ± 0.485
0.444ProHis: 0.444 ± 0.34
2.219ProIle: 2.219 ± 0.771
4.439ProLys: 4.439 ± 2.254
2.219ProLeu: 2.219 ± 0.759
0.444ProMet: 0.444 ± 0.462
3.551ProAsn: 3.551 ± 2.41
2.663ProPro: 2.663 ± 1.17
0.888ProGln: 0.888 ± 0.758
1.775ProArg: 1.775 ± 1.0
1.775ProSer: 1.775 ± 1.012
1.775ProThr: 1.775 ± 0.751
2.663ProVal: 2.663 ± 1.128
0.444ProTrp: 0.444 ± 0.361
0.888ProTyr: 0.888 ± 0.449
0.0ProXaa: 0.0 ± 0.0
Gln
3.107GlnAla: 3.107 ± 1.425
0.0GlnCys: 0.0 ± 0.0
1.775GlnAsp: 1.775 ± 1.218
5.326GlnGlu: 5.326 ± 1.152
1.775GlnPhe: 1.775 ± 0.828
2.219GlnGly: 2.219 ± 0.908
0.888GlnHis: 0.888 ± 0.773
3.551GlnIle: 3.551 ± 1.26
4.439GlnLys: 4.439 ± 1.504
1.332GlnLeu: 1.332 ± 0.656
0.444GlnMet: 0.444 ± 0.565
0.444GlnAsn: 0.444 ± 0.361
1.332GlnPro: 1.332 ± 0.838
2.219GlnGln: 2.219 ± 1.144
1.332GlnArg: 1.332 ± 0.745
3.107GlnSer: 3.107 ± 1.065
3.107GlnThr: 3.107 ± 1.176
1.775GlnVal: 1.775 ± 0.637
0.0GlnTrp: 0.0 ± 0.0
1.332GlnTyr: 1.332 ± 0.764
0.0GlnXaa: 0.0 ± 0.0
Arg
2.663ArgAla: 2.663 ± 1.332
0.0ArgCys: 0.0 ± 0.0
1.775ArgAsp: 1.775 ± 0.775
1.775ArgGlu: 1.775 ± 0.925
0.444ArgPhe: 0.444 ± 0.34
1.775ArgGly: 1.775 ± 0.91
0.888ArgHis: 0.888 ± 0.626
3.107ArgIle: 3.107 ± 1.543
2.219ArgLys: 2.219 ± 0.871
3.551ArgLeu: 3.551 ± 0.902
0.444ArgMet: 0.444 ± 0.496
3.995ArgAsn: 3.995 ± 0.947
1.775ArgPro: 1.775 ± 0.896
3.551ArgGln: 3.551 ± 1.87
1.332ArgArg: 1.332 ± 0.637
0.444ArgSer: 0.444 ± 0.361
3.551ArgThr: 3.551 ± 1.219
4.439ArgVal: 4.439 ± 1.399
0.888ArgTrp: 0.888 ± 0.621
2.663ArgTyr: 2.663 ± 0.952
0.0ArgXaa: 0.0 ± 0.0
Ser
3.107SerAla: 3.107 ± 1.119
0.0SerCys: 0.0 ± 0.0
4.439SerAsp: 4.439 ± 0.926
3.107SerGlu: 3.107 ± 1.626
1.332SerPhe: 1.332 ± 0.812
2.663SerGly: 2.663 ± 0.854
0.444SerHis: 0.444 ± 0.386
3.551SerIle: 3.551 ± 1.218
7.102SerLys: 7.102 ± 1.451
5.77SerLeu: 5.77 ± 1.449
0.888SerMet: 0.888 ± 0.419
4.882SerAsn: 4.882 ± 1.936
1.332SerPro: 1.332 ± 1.014
3.107SerGln: 3.107 ± 1.303
1.332SerArg: 1.332 ± 0.72
3.551SerSer: 3.551 ± 2.333
4.439SerThr: 4.439 ± 1.798
3.107SerVal: 3.107 ± 1.536
1.775SerTrp: 1.775 ± 1.356
2.219SerTyr: 2.219 ± 1.31
0.0SerXaa: 0.0 ± 0.0
Thr
3.551ThrAla: 3.551 ± 0.862
0.0ThrCys: 0.0 ± 0.0
2.663ThrAsp: 2.663 ± 1.559
1.775ThrGlu: 1.775 ± 1.094
3.995ThrPhe: 3.995 ± 2.139
3.551ThrGly: 3.551 ± 1.473
1.775ThrHis: 1.775 ± 0.828
3.551ThrIle: 3.551 ± 1.849
7.989ThrLys: 7.989 ± 1.697
7.102ThrLeu: 7.102 ± 1.634
3.551ThrMet: 3.551 ± 1.069
2.663ThrAsn: 2.663 ± 1.179
2.219ThrPro: 2.219 ± 0.971
2.663ThrGln: 2.663 ± 1.677
3.995ThrArg: 3.995 ± 1.48
3.551ThrSer: 3.551 ± 1.408
3.995ThrThr: 3.995 ± 1.417
7.545ThrVal: 7.545 ± 1.271
0.444ThrTrp: 0.444 ± 0.34
2.663ThrTyr: 2.663 ± 1.026
0.0ThrXaa: 0.0 ± 0.0
Val
5.326ValAla: 5.326 ± 1.711
0.888ValCys: 0.888 ± 0.657
2.219ValAsp: 2.219 ± 0.884
1.775ValGlu: 1.775 ± 0.953
1.775ValPhe: 1.775 ± 0.789
3.551ValGly: 3.551 ± 1.144
0.888ValHis: 0.888 ± 0.449
3.107ValIle: 3.107 ± 0.821
5.77ValLys: 5.77 ± 1.441
8.877ValLeu: 8.877 ± 1.558
0.444ValMet: 0.444 ± 0.524
2.663ValAsn: 2.663 ± 0.903
2.663ValPro: 2.663 ± 1.013
1.332ValGln: 1.332 ± 0.749
0.444ValArg: 0.444 ± 0.361
4.439ValSer: 4.439 ± 1.55
9.765ValThr: 9.765 ± 2.063
4.882ValVal: 4.882 ± 1.277
0.0ValTrp: 0.0 ± 0.0
1.332ValTyr: 1.332 ± 0.965
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.444TrpAsp: 0.444 ± 0.386
0.888TrpGlu: 0.888 ± 0.636
0.0TrpPhe: 0.0 ± 0.0
0.444TrpGly: 0.444 ± 0.485
0.888TrpHis: 0.888 ± 0.515
0.444TrpIle: 0.444 ± 0.361
1.332TrpLys: 1.332 ± 0.652
1.775TrpLeu: 1.775 ± 1.314
0.444TrpMet: 0.444 ± 0.516
0.888TrpAsn: 0.888 ± 0.652
0.0TrpPro: 0.0 ± 0.0
0.444TrpGln: 0.444 ± 0.361
0.444TrpArg: 0.444 ± 0.361
0.888TrpSer: 0.888 ± 0.449
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.444TrpTrp: 0.444 ± 0.386
0.888TrpTyr: 0.888 ± 0.515
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.332TyrAla: 1.332 ± 0.891
0.444TyrCys: 0.444 ± 0.34
2.663TyrAsp: 2.663 ± 0.786
6.658TyrGlu: 6.658 ± 2.198
1.775TyrPhe: 1.775 ± 1.094
3.107TyrGly: 3.107 ± 0.913
0.0TyrHis: 0.0 ± 0.0
3.107TyrIle: 3.107 ± 1.001
3.995TyrLys: 3.995 ± 1.684
4.439TyrLeu: 4.439 ± 1.165
1.332TyrMet: 1.332 ± 0.829
2.663TyrAsn: 2.663 ± 1.375
1.775TyrPro: 1.775 ± 0.76
2.219TyrGln: 2.219 ± 1.123
3.107TyrArg: 3.107 ± 1.024
2.663TyrSer: 2.663 ± 1.073
2.219TyrThr: 2.219 ± 0.919
2.219TyrVal: 2.219 ± 1.143
0.444TyrTrp: 0.444 ± 0.453
2.219TyrTyr: 2.219 ± 1.004
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 14 proteins (2254 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski