Amino acid dipepetide frequency for Streptococcus satellite phage Javan24

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.272AlaAla: 0.272 ± 0.248
1.36AlaCys: 1.36 ± 0.462
4.353AlaAsp: 4.353 ± 0.997
4.353AlaGlu: 4.353 ± 1.202
2.72AlaPhe: 2.72 ± 0.674
1.632AlaGly: 1.632 ± 0.701
0.272AlaHis: 0.272 ± 0.281
5.713AlaIle: 5.713 ± 1.032
4.625AlaLys: 4.625 ± 1.035
4.897AlaLeu: 4.897 ± 1.071
2.176AlaMet: 2.176 ± 0.768
4.081AlaAsn: 4.081 ± 0.824
2.176AlaPro: 2.176 ± 0.641
1.088AlaGln: 1.088 ± 0.506
2.448AlaArg: 2.448 ± 0.83
2.176AlaSer: 2.176 ± 0.687
4.625AlaThr: 4.625 ± 1.08
2.448AlaVal: 2.448 ± 0.773
1.088AlaTrp: 1.088 ± 0.468
2.72AlaTyr: 2.72 ± 0.877
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.544CysAsp: 0.544 ± 0.338
0.544CysGlu: 0.544 ± 0.395
0.0CysPhe: 0.0 ± 0.0
0.544CysGly: 0.544 ± 0.578
0.272CysHis: 0.272 ± 0.261
0.0CysIle: 0.0 ± 0.0
0.272CysLys: 0.272 ± 0.236
0.816CysLeu: 0.816 ± 0.493
0.544CysMet: 0.544 ± 0.431
1.088CysAsn: 1.088 ± 0.5
0.272CysPro: 0.272 ± 0.289
0.544CysGln: 0.544 ± 0.578
0.816CysArg: 0.816 ± 0.466
0.544CysSer: 0.544 ± 0.41
0.0CysThr: 0.0 ± 0.0
0.272CysVal: 0.272 ± 0.261
0.272CysTrp: 0.272 ± 0.304
0.544CysTyr: 0.544 ± 0.494
0.0CysXaa: 0.0 ± 0.0
Asp
1.36AspAla: 1.36 ± 0.591
1.36AspCys: 1.36 ± 0.796
2.992AspAsp: 2.992 ± 0.891
4.081AspGlu: 4.081 ± 0.947
5.169AspPhe: 5.169 ± 1.178
1.904AspGly: 1.904 ± 0.599
0.0AspHis: 0.0 ± 0.0
6.801AspIle: 6.801 ± 1.549
6.257AspLys: 6.257 ± 1.46
5.985AspLeu: 5.985 ± 1.061
0.816AspMet: 0.816 ± 0.447
3.264AspAsn: 3.264 ± 0.991
1.36AspPro: 1.36 ± 0.567
0.544AspGln: 0.544 ± 0.429
3.264AspArg: 3.264 ± 0.743
3.536AspSer: 3.536 ± 0.901
2.72AspThr: 2.72 ± 0.862
2.448AspVal: 2.448 ± 0.873
0.272AspTrp: 0.272 ± 0.236
4.353AspTyr: 4.353 ± 1.277
0.0AspXaa: 0.0 ± 0.0
Glu
7.345GluAla: 7.345 ± 1.0
0.272GluCys: 0.272 ± 0.289
4.897GluAsp: 4.897 ± 1.372
4.625GluGlu: 4.625 ± 0.957
2.72GluPhe: 2.72 ± 0.626
3.536GluGly: 3.536 ± 0.944
0.816GluHis: 0.816 ± 0.382
4.353GluIle: 4.353 ± 1.152
6.257GluLys: 6.257 ± 0.916
9.793GluLeu: 9.793 ± 1.208
1.904GluMet: 1.904 ± 0.57
4.625GluAsn: 4.625 ± 1.166
1.36GluPro: 1.36 ± 0.446
4.081GluGln: 4.081 ± 1.428
2.992GluArg: 2.992 ± 0.701
3.264GluSer: 3.264 ± 0.888
4.353GluThr: 4.353 ± 0.776
2.448GluVal: 2.448 ± 1.094
0.816GluTrp: 0.816 ± 0.45
4.625GluTyr: 4.625 ± 1.04
0.0GluXaa: 0.0 ± 0.0
Phe
1.904PheAla: 1.904 ± 0.518
0.816PheCys: 0.816 ± 0.487
2.992PheAsp: 2.992 ± 1.043
4.081PheGlu: 4.081 ± 0.71
1.632PhePhe: 1.632 ± 0.657
1.904PheGly: 1.904 ± 0.662
1.088PheHis: 1.088 ± 0.446
5.985PheIle: 5.985 ± 1.168
4.081PheLys: 4.081 ± 0.851
5.169PheLeu: 5.169 ± 1.02
0.272PheMet: 0.272 ± 0.278
4.625PheAsn: 4.625 ± 0.844
1.36PhePro: 1.36 ± 0.816
1.36PheGln: 1.36 ± 0.548
1.904PheArg: 1.904 ± 0.716
4.353PheSer: 4.353 ± 0.942
2.72PheThr: 2.72 ± 0.911
0.816PheVal: 0.816 ± 0.556
0.272PheTrp: 0.272 ± 0.236
2.448PheTyr: 2.448 ± 0.772
0.0PheXaa: 0.0 ± 0.0
Gly
2.992GlyAla: 2.992 ± 0.971
0.544GlyCys: 0.544 ± 0.396
2.992GlyAsp: 2.992 ± 1.06
2.992GlyGlu: 2.992 ± 0.824
3.536GlyPhe: 3.536 ± 0.85
1.36GlyGly: 1.36 ± 0.568
0.272GlyHis: 0.272 ± 0.262
3.536GlyIle: 3.536 ± 0.953
4.081GlyLys: 4.081 ± 0.895
5.441GlyLeu: 5.441 ± 1.396
0.816GlyMet: 0.816 ± 0.432
2.448GlyAsn: 2.448 ± 0.877
1.088GlyPro: 1.088 ± 0.587
0.544GlyGln: 0.544 ± 0.499
1.904GlyArg: 1.904 ± 0.643
1.904GlySer: 1.904 ± 1.051
2.992GlyThr: 2.992 ± 0.75
2.992GlyVal: 2.992 ± 0.917
0.544GlyTrp: 0.544 ± 0.472
2.72GlyTyr: 2.72 ± 0.949
0.0GlyXaa: 0.0 ± 0.0
His
1.36HisAla: 1.36 ± 0.629
0.0HisCys: 0.0 ± 0.0
0.544HisAsp: 0.544 ± 0.397
0.816HisGlu: 0.816 ± 0.521
1.36HisPhe: 1.36 ± 0.532
1.088HisGly: 1.088 ± 0.499
0.272HisHis: 0.272 ± 0.248
1.36HisIle: 1.36 ± 0.673
2.176HisLys: 2.176 ± 0.785
0.816HisLeu: 0.816 ± 0.345
0.0HisMet: 0.0 ± 0.0
0.544HisAsn: 0.544 ± 0.349
0.272HisPro: 0.272 ± 0.302
1.088HisGln: 1.088 ± 0.551
0.272HisArg: 0.272 ± 0.31
1.088HisSer: 1.088 ± 0.395
1.36HisThr: 1.36 ± 0.627
0.816HisVal: 0.816 ± 0.399
0.544HisTrp: 0.544 ± 0.578
1.904HisTyr: 1.904 ± 0.624
0.0HisXaa: 0.0 ± 0.0
Ile
5.441IleAla: 5.441 ± 1.289
0.544IleCys: 0.544 ± 0.365
5.169IleAsp: 5.169 ± 1.053
6.801IleGlu: 6.801 ± 1.556
4.625IlePhe: 4.625 ± 0.828
3.264IleGly: 3.264 ± 0.835
0.816IleHis: 0.816 ± 0.587
5.713IleIle: 5.713 ± 1.463
8.161IleLys: 8.161 ± 1.661
4.353IleLeu: 4.353 ± 0.945
2.176IleMet: 2.176 ± 0.671
4.081IleAsn: 4.081 ± 1.325
3.808IlePro: 3.808 ± 0.915
1.904IleGln: 1.904 ± 0.824
2.176IleArg: 2.176 ± 0.748
3.264IleSer: 3.264 ± 1.199
5.985IleThr: 5.985 ± 1.061
3.536IleVal: 3.536 ± 0.816
0.0IleTrp: 0.0 ± 0.0
4.625IleTyr: 4.625 ± 0.951
0.0IleXaa: 0.0 ± 0.0
Lys
7.617LysAla: 7.617 ± 1.405
0.0LysCys: 0.0 ± 0.0
5.441LysAsp: 5.441 ± 1.375
7.345LysGlu: 7.345 ± 1.488
3.536LysPhe: 3.536 ± 0.894
5.713LysGly: 5.713 ± 1.256
3.264LysHis: 3.264 ± 0.756
5.985LysIle: 5.985 ± 0.809
8.977LysLys: 8.977 ± 1.631
7.073LysLeu: 7.073 ± 1.246
1.904LysMet: 1.904 ± 0.777
4.897LysAsn: 4.897 ± 1.046
4.625LysPro: 4.625 ± 1.46
3.808LysGln: 3.808 ± 1.13
4.081LysArg: 4.081 ± 1.136
5.441LysSer: 5.441 ± 1.549
4.353LysThr: 4.353 ± 0.832
6.257LysVal: 6.257 ± 1.255
0.544LysTrp: 0.544 ± 0.397
5.713LysTyr: 5.713 ± 1.246
0.0LysXaa: 0.0 ± 0.0
Leu
5.441LeuAla: 5.441 ± 1.632
0.544LeuCys: 0.544 ± 0.425
7.889LeuAsp: 7.889 ± 1.408
9.249LeuGlu: 9.249 ± 2.163
5.713LeuPhe: 5.713 ± 1.082
4.625LeuGly: 4.625 ± 1.079
1.36LeuHis: 1.36 ± 0.629
5.441LeuIle: 5.441 ± 1.262
8.705LeuLys: 8.705 ± 1.647
7.889LeuLeu: 7.889 ± 1.347
2.992LeuMet: 2.992 ± 0.889
7.073LeuAsn: 7.073 ± 1.485
3.536LeuPro: 3.536 ± 1.037
2.72LeuGln: 2.72 ± 0.716
3.808LeuArg: 3.808 ± 1.257
6.257LeuSer: 6.257 ± 0.992
3.264LeuThr: 3.264 ± 0.816
4.353LeuVal: 4.353 ± 0.992
1.088LeuTrp: 1.088 ± 0.408
4.353LeuTyr: 4.353 ± 0.867
0.0LeuXaa: 0.0 ± 0.0
Met
1.904MetAla: 1.904 ± 0.82
0.0MetCys: 0.0 ± 0.0
1.088MetAsp: 1.088 ± 0.555
1.632MetGlu: 1.632 ± 0.571
0.544MetPhe: 0.544 ± 0.373
1.088MetGly: 1.088 ± 0.428
0.0MetHis: 0.0 ± 0.0
1.088MetIle: 1.088 ± 0.652
2.992MetLys: 2.992 ± 0.798
2.992MetLeu: 2.992 ± 0.889
0.0MetMet: 0.0 ± 0.0
1.36MetAsn: 1.36 ± 0.633
0.544MetPro: 0.544 ± 0.391
0.272MetGln: 0.272 ± 0.262
1.088MetArg: 1.088 ± 0.513
0.544MetSer: 0.544 ± 0.384
1.904MetThr: 1.904 ± 0.708
0.544MetVal: 0.544 ± 0.333
0.0MetTrp: 0.0 ± 0.0
0.272MetTyr: 0.272 ± 0.247
0.0MetXaa: 0.0 ± 0.0
Asn
4.081AsnAla: 4.081 ± 0.675
0.272AsnCys: 0.272 ± 0.247
2.72AsnAsp: 2.72 ± 0.946
4.353AsnGlu: 4.353 ± 0.967
2.176AsnPhe: 2.176 ± 0.77
4.353AsnGly: 4.353 ± 1.286
1.088AsnHis: 1.088 ± 0.514
5.441AsnIle: 5.441 ± 1.358
4.625AsnLys: 4.625 ± 1.029
5.985AsnLeu: 5.985 ± 1.397
1.088AsnMet: 1.088 ± 0.439
1.904AsnAsn: 1.904 ± 0.527
3.536AsnPro: 3.536 ± 0.597
3.536AsnGln: 3.536 ± 1.134
3.808AsnArg: 3.808 ± 0.73
2.176AsnSer: 2.176 ± 0.508
2.448AsnThr: 2.448 ± 1.132
2.72AsnVal: 2.72 ± 0.875
0.544AsnTrp: 0.544 ± 0.4
2.448AsnTyr: 2.448 ± 0.738
0.0AsnXaa: 0.0 ± 0.0
Pro
1.36ProAla: 1.36 ± 0.472
0.272ProCys: 0.272 ± 0.26
1.904ProAsp: 1.904 ± 0.608
2.72ProGlu: 2.72 ± 1.086
1.904ProPhe: 1.904 ± 0.691
0.816ProGly: 0.816 ± 0.474
0.272ProHis: 0.272 ± 0.247
1.088ProIle: 1.088 ± 0.615
5.169ProLys: 5.169 ± 1.119
3.536ProLeu: 3.536 ± 0.827
0.272ProMet: 0.272 ± 0.294
2.72ProAsn: 2.72 ± 1.114
1.088ProPro: 1.088 ± 0.434
1.904ProGln: 1.904 ± 0.893
2.176ProArg: 2.176 ± 0.812
1.904ProSer: 1.904 ± 0.608
2.176ProThr: 2.176 ± 0.623
1.904ProVal: 1.904 ± 0.77
0.272ProTrp: 0.272 ± 0.236
1.088ProTyr: 1.088 ± 0.485
0.0ProXaa: 0.0 ± 0.0
Gln
2.992GlnAla: 2.992 ± 0.616
0.0GlnCys: 0.0 ± 0.0
1.36GlnAsp: 1.36 ± 0.666
3.808GlnGlu: 3.808 ± 1.169
0.816GlnPhe: 0.816 ± 0.43
1.36GlnGly: 1.36 ± 0.805
0.544GlnHis: 0.544 ± 0.352
2.72GlnIle: 2.72 ± 0.834
3.264GlnLys: 3.264 ± 0.744
3.536GlnLeu: 3.536 ± 0.975
0.272GlnMet: 0.272 ± 0.258
2.448GlnAsn: 2.448 ± 0.742
0.544GlnPro: 0.544 ± 0.374
1.632GlnGln: 1.632 ± 0.543
2.176GlnArg: 2.176 ± 0.634
2.176GlnSer: 2.176 ± 0.756
1.904GlnThr: 1.904 ± 0.684
2.72GlnVal: 2.72 ± 0.745
0.544GlnTrp: 0.544 ± 0.351
1.632GlnTyr: 1.632 ± 0.822
0.0GlnXaa: 0.0 ± 0.0
Arg
1.632ArgAla: 1.632 ± 0.598
0.272ArgCys: 0.272 ± 0.289
2.448ArgAsp: 2.448 ± 0.773
3.808ArgGlu: 3.808 ± 1.186
2.176ArgPhe: 2.176 ± 0.624
2.448ArgGly: 2.448 ± 0.967
1.904ArgHis: 1.904 ± 0.814
4.353ArgIle: 4.353 ± 0.921
2.448ArgLys: 2.448 ± 0.853
4.353ArgLeu: 4.353 ± 0.92
0.272ArgMet: 0.272 ± 0.322
2.72ArgAsn: 2.72 ± 0.773
1.088ArgPro: 1.088 ± 0.532
1.904ArgGln: 1.904 ± 0.602
1.904ArgArg: 1.904 ± 0.724
1.904ArgSer: 1.904 ± 0.638
3.536ArgThr: 3.536 ± 1.25
3.808ArgVal: 3.808 ± 0.852
0.272ArgTrp: 0.272 ± 0.329
2.992ArgTyr: 2.992 ± 0.988
0.0ArgXaa: 0.0 ± 0.0
Ser
1.632SerAla: 1.632 ± 0.724
0.272SerCys: 0.272 ± 0.289
5.169SerAsp: 5.169 ± 1.085
4.081SerGlu: 4.081 ± 1.068
1.632SerPhe: 1.632 ± 0.469
1.632SerGly: 1.632 ± 0.547
0.816SerHis: 0.816 ± 0.467
5.169SerIle: 5.169 ± 1.11
5.441SerLys: 5.441 ± 1.012
5.985SerLeu: 5.985 ± 0.584
0.816SerMet: 0.816 ± 0.449
2.448SerAsn: 2.448 ± 0.646
0.816SerPro: 0.816 ± 0.362
2.992SerGln: 2.992 ± 1.223
1.632SerArg: 1.632 ± 0.602
2.992SerSer: 2.992 ± 0.902
3.536SerThr: 3.536 ± 0.911
2.448SerVal: 2.448 ± 0.766
1.088SerTrp: 1.088 ± 0.546
3.264SerTyr: 3.264 ± 0.985
0.0SerXaa: 0.0 ± 0.0
Thr
2.992ThrAla: 2.992 ± 0.969
0.272ThrCys: 0.272 ± 0.247
1.36ThrAsp: 1.36 ± 0.433
3.264ThrGlu: 3.264 ± 0.798
4.353ThrPhe: 4.353 ± 1.38
3.808ThrGly: 3.808 ± 1.034
2.176ThrHis: 2.176 ± 0.673
4.081ThrIle: 4.081 ± 1.005
6.529ThrLys: 6.529 ± 1.369
7.073ThrLeu: 7.073 ± 1.362
1.088ThrMet: 1.088 ± 0.519
1.904ThrAsn: 1.904 ± 0.928
2.448ThrPro: 2.448 ± 0.572
2.448ThrGln: 2.448 ± 0.931
3.808ThrArg: 3.808 ± 0.819
1.904ThrSer: 1.904 ± 0.74
4.081ThrThr: 4.081 ± 1.401
3.536ThrVal: 3.536 ± 1.372
0.272ThrTrp: 0.272 ± 0.242
2.176ThrTyr: 2.176 ± 0.844
0.0ThrXaa: 0.0 ± 0.0
Val
2.72ValAla: 2.72 ± 0.745
0.272ValCys: 0.272 ± 0.236
2.448ValAsp: 2.448 ± 0.754
2.992ValGlu: 2.992 ± 1.057
2.72ValPhe: 2.72 ± 0.769
1.904ValGly: 1.904 ± 0.655
0.816ValHis: 0.816 ± 0.541
4.081ValIle: 4.081 ± 0.946
4.897ValLys: 4.897 ± 1.018
5.169ValLeu: 5.169 ± 1.257
0.816ValMet: 0.816 ± 0.488
4.081ValAsn: 4.081 ± 0.844
1.904ValPro: 1.904 ± 0.975
1.632ValGln: 1.632 ± 0.675
1.36ValArg: 1.36 ± 0.563
3.808ValSer: 3.808 ± 0.979
4.353ValThr: 4.353 ± 1.233
3.264ValVal: 3.264 ± 0.671
0.272ValTrp: 0.272 ± 0.294
1.088ValTyr: 1.088 ± 0.57
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.816TrpAsp: 0.816 ± 0.475
1.088TrpGlu: 1.088 ± 0.561
0.0TrpPhe: 0.0 ± 0.0
0.272TrpGly: 0.272 ± 0.26
0.272TrpHis: 0.272 ± 0.236
0.544TrpIle: 0.544 ± 0.379
1.088TrpLys: 1.088 ± 0.535
1.36TrpLeu: 1.36 ± 0.468
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.272TrpPro: 0.272 ± 0.236
0.272TrpGln: 0.272 ± 0.236
0.816TrpArg: 0.816 ± 0.367
0.544TrpSer: 0.544 ± 0.299
0.544TrpThr: 0.544 ± 0.41
0.816TrpVal: 0.816 ± 0.392
0.272TrpTrp: 0.272 ± 0.262
0.272TrpTyr: 0.272 ± 0.236
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.448TyrAla: 2.448 ± 0.851
0.816TyrCys: 0.816 ± 0.384
1.36TyrAsp: 1.36 ± 0.493
2.448TyrGlu: 2.448 ± 0.786
2.448TyrPhe: 2.448 ± 0.529
2.448TyrGly: 2.448 ± 1.008
1.36TyrHis: 1.36 ± 0.46
2.992TyrIle: 2.992 ± 0.775
6.529TyrLys: 6.529 ± 1.347
4.081TyrLeu: 4.081 ± 0.92
1.36TyrMet: 1.36 ± 0.672
2.992TyrAsn: 2.992 ± 0.842
2.448TyrPro: 2.448 ± 0.735
2.176TyrGln: 2.176 ± 0.741
3.808TyrArg: 3.808 ± 1.143
4.081TyrSer: 4.081 ± 1.159
2.72TyrThr: 2.72 ± 0.787
2.448TyrVal: 2.448 ± 0.793
0.272TyrTrp: 0.272 ± 0.289
1.904TyrTyr: 1.904 ± 1.002
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 23 proteins (3677 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski