Amino acid dipepetide frequency for Streptococcus satellite phage Javan607

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.714AlaAla: 0.714 ± 0.777
0.714AlaCys: 0.714 ± 0.436
3.926AlaAsp: 3.926 ± 1.044
3.212AlaGlu: 3.212 ± 0.883
1.784AlaPhe: 1.784 ± 0.818
3.212AlaGly: 3.212 ± 1.015
0.357AlaHis: 0.357 ± 0.369
6.067AlaIle: 6.067 ± 1.116
4.64AlaLys: 4.64 ± 1.418
3.569AlaLeu: 3.569 ± 1.358
1.071AlaMet: 1.071 ± 0.691
5.353AlaAsn: 5.353 ± 1.55
1.784AlaPro: 1.784 ± 0.718
3.212AlaGln: 3.212 ± 1.174
3.569AlaArg: 3.569 ± 0.824
3.212AlaSer: 3.212 ± 1.083
3.212AlaThr: 3.212 ± 0.919
2.498AlaVal: 2.498 ± 0.671
0.357AlaTrp: 0.357 ± 0.309
2.498AlaTyr: 2.498 ± 0.802
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.714CysAsp: 0.714 ± 0.546
0.357CysGlu: 0.357 ± 0.417
0.357CysPhe: 0.357 ± 0.333
0.714CysGly: 0.714 ± 0.427
0.357CysHis: 0.357 ± 0.312
0.0CysIle: 0.0 ± 0.0
0.357CysLys: 0.357 ± 0.304
0.357CysLeu: 0.357 ± 0.312
0.0CysMet: 0.0 ± 0.0
0.357CysAsn: 0.357 ± 0.304
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.357CysArg: 0.357 ± 0.309
0.714CysSer: 0.714 ± 0.571
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.714CysTyr: 0.714 ± 0.527
0.0CysXaa: 0.0 ± 0.0
Asp
1.428AspAla: 1.428 ± 1.186
0.357AspCys: 0.357 ± 0.389
3.212AspAsp: 3.212 ± 1.213
7.852AspGlu: 7.852 ± 1.551
3.926AspPhe: 3.926 ± 1.135
3.926AspGly: 3.926 ± 1.336
0.357AspHis: 0.357 ± 0.42
5.353AspIle: 5.353 ± 1.425
6.424AspLys: 6.424 ± 1.159
4.64AspLeu: 4.64 ± 1.471
1.784AspMet: 1.784 ± 0.668
3.926AspAsn: 3.926 ± 1.035
1.071AspPro: 1.071 ± 0.692
0.357AspGln: 0.357 ± 0.321
1.428AspArg: 1.428 ± 0.613
4.283AspSer: 4.283 ± 1.208
3.926AspThr: 3.926 ± 1.196
3.212AspVal: 3.212 ± 0.572
0.714AspTrp: 0.714 ± 0.607
5.353AspTyr: 5.353 ± 1.534
0.0AspXaa: 0.0 ± 0.0
Glu
6.781GluAla: 6.781 ± 1.494
0.0GluCys: 0.0 ± 0.0
3.212GluAsp: 3.212 ± 0.906
4.996GluGlu: 4.996 ± 1.404
4.283GluPhe: 4.283 ± 1.052
2.498GluGly: 2.498 ± 0.994
0.714GluHis: 0.714 ± 0.462
5.353GluIle: 5.353 ± 1.539
6.424GluLys: 6.424 ± 1.513
11.42GluLeu: 11.42 ± 2.079
1.428GluMet: 1.428 ± 0.612
3.926GluAsn: 3.926 ± 0.998
2.141GluPro: 2.141 ± 0.998
2.498GluGln: 2.498 ± 1.569
2.141GluArg: 2.141 ± 1.029
3.926GluSer: 3.926 ± 1.055
5.353GluThr: 5.353 ± 1.464
2.855GluVal: 2.855 ± 1.261
0.357GluTrp: 0.357 ± 0.376
5.353GluTyr: 5.353 ± 1.377
0.0GluXaa: 0.0 ± 0.0
Phe
1.071PheAla: 1.071 ± 0.499
0.0PheCys: 0.0 ± 0.0
3.926PheAsp: 3.926 ± 0.912
5.71PheGlu: 5.71 ± 1.578
0.714PhePhe: 0.714 ± 0.439
1.428PheGly: 1.428 ± 0.646
0.357PheHis: 0.357 ± 0.309
2.498PheIle: 2.498 ± 0.769
4.283PheLys: 4.283 ± 1.017
5.71PheLeu: 5.71 ± 1.3
0.357PheMet: 0.357 ± 0.359
4.64PheAsn: 4.64 ± 1.56
0.0PhePro: 0.0 ± 0.0
1.784PheGln: 1.784 ± 0.964
1.784PheArg: 1.784 ± 0.761
1.428PheSer: 1.428 ± 0.567
1.428PheThr: 1.428 ± 0.725
1.071PheVal: 1.071 ± 0.589
1.071PheTrp: 1.071 ± 0.482
1.071PheTyr: 1.071 ± 0.495
0.0PheXaa: 0.0 ± 0.0
Gly
1.784GlyAla: 1.784 ± 0.685
0.714GlyCys: 0.714 ± 0.525
4.283GlyAsp: 4.283 ± 1.515
2.855GlyGlu: 2.855 ± 0.93
2.855GlyPhe: 2.855 ± 0.846
2.141GlyGly: 2.141 ± 0.879
0.357GlyHis: 0.357 ± 0.309
4.283GlyIle: 4.283 ± 0.998
5.71GlyLys: 5.71 ± 1.583
3.569GlyLeu: 3.569 ± 1.26
0.0GlyMet: 0.0 ± 0.0
3.926GlyAsn: 3.926 ± 1.616
0.0GlyPro: 0.0 ± 0.0
1.784GlyGln: 1.784 ± 0.77
1.428GlyArg: 1.428 ± 0.658
1.784GlySer: 1.784 ± 0.824
2.855GlyThr: 2.855 ± 1.3
5.353GlyVal: 5.353 ± 1.339
1.071GlyTrp: 1.071 ± 0.668
4.283GlyTyr: 4.283 ± 1.537
0.0GlyXaa: 0.0 ± 0.0
His
2.141HisAla: 2.141 ± 0.884
0.357HisCys: 0.357 ± 0.312
1.071HisAsp: 1.071 ± 0.784
2.498HisGlu: 2.498 ± 1.12
1.071HisPhe: 1.071 ± 0.597
0.714HisGly: 0.714 ± 0.389
0.0HisHis: 0.0 ± 0.0
0.714HisIle: 0.714 ± 0.417
1.071HisLys: 1.071 ± 0.604
1.428HisLeu: 1.428 ± 0.481
0.0HisMet: 0.0 ± 0.0
0.714HisAsn: 0.714 ± 0.512
0.0HisPro: 0.0 ± 0.0
0.714HisGln: 0.714 ± 0.496
0.714HisArg: 0.714 ± 0.417
0.714HisSer: 0.714 ± 0.47
1.784HisThr: 1.784 ± 0.697
0.357HisVal: 0.357 ± 0.309
0.0HisTrp: 0.0 ± 0.0
0.357HisTyr: 0.357 ± 0.309
0.0HisXaa: 0.0 ± 0.0
Ile
4.283IleAla: 4.283 ± 1.225
0.714IleCys: 0.714 ± 0.512
4.64IleAsp: 4.64 ± 1.425
5.353IleGlu: 5.353 ± 1.239
1.784IlePhe: 1.784 ± 0.904
2.141IleGly: 2.141 ± 0.599
1.784IleHis: 1.784 ± 0.708
2.498IleIle: 2.498 ± 0.961
6.067IleLys: 6.067 ± 1.215
4.996IleLeu: 4.996 ± 1.319
0.714IleMet: 0.714 ± 0.418
4.283IleAsn: 4.283 ± 1.788
3.212IlePro: 3.212 ± 1.621
1.784IleGln: 1.784 ± 0.649
1.428IleArg: 1.428 ± 0.741
3.569IleSer: 3.569 ± 1.123
4.283IleThr: 4.283 ± 0.902
4.996IleVal: 4.996 ± 1.776
1.071IleTrp: 1.071 ± 0.742
4.996IleTyr: 4.996 ± 1.075
0.0IleXaa: 0.0 ± 0.0
Lys
4.64LysAla: 4.64 ± 1.53
0.357LysCys: 0.357 ± 0.369
4.64LysAsp: 4.64 ± 1.002
8.922LysGlu: 8.922 ± 2.027
3.212LysPhe: 3.212 ± 1.008
5.353LysGly: 5.353 ± 1.386
2.855LysHis: 2.855 ± 0.831
5.353LysIle: 5.353 ± 1.024
9.636LysLys: 9.636 ± 2.635
8.208LysLeu: 8.208 ± 1.895
3.569LysMet: 3.569 ± 1.181
5.353LysAsn: 5.353 ± 1.159
3.926LysPro: 3.926 ± 1.478
4.283LysGln: 4.283 ± 0.966
6.424LysArg: 6.424 ± 1.346
4.64LysSer: 4.64 ± 0.878
4.996LysThr: 4.996 ± 1.211
4.64LysVal: 4.64 ± 1.103
0.714LysTrp: 0.714 ± 0.574
3.926LysTyr: 3.926 ± 1.034
0.0LysXaa: 0.0 ± 0.0
Leu
6.424LeuAla: 6.424 ± 1.165
0.714LeuCys: 0.714 ± 0.486
7.852LeuAsp: 7.852 ± 1.45
8.208LeuGlu: 8.208 ± 2.122
2.141LeuPhe: 2.141 ± 0.827
6.424LeuGly: 6.424 ± 1.587
1.784LeuHis: 1.784 ± 0.775
3.569LeuIle: 3.569 ± 0.756
7.495LeuLys: 7.495 ± 1.194
8.565LeuLeu: 8.565 ± 2.005
1.071LeuMet: 1.071 ± 0.625
4.64LeuAsn: 4.64 ± 1.612
3.569LeuPro: 3.569 ± 1.03
2.855LeuGln: 2.855 ± 1.178
2.141LeuArg: 2.141 ± 1.0
7.852LeuSer: 7.852 ± 1.679
3.926LeuThr: 3.926 ± 1.169
3.569LeuVal: 3.569 ± 1.253
1.071LeuTrp: 1.071 ± 0.602
2.855LeuTyr: 2.855 ± 0.861
0.0LeuXaa: 0.0 ± 0.0
Met
1.071MetAla: 1.071 ± 0.729
0.0MetCys: 0.0 ± 0.0
2.141MetAsp: 2.141 ± 0.749
2.141MetGlu: 2.141 ± 0.813
0.0MetPhe: 0.0 ± 0.0
0.714MetGly: 0.714 ± 0.392
0.357MetHis: 0.357 ± 0.369
1.428MetIle: 1.428 ± 0.606
1.428MetLys: 1.428 ± 1.014
1.428MetLeu: 1.428 ± 0.81
1.071MetMet: 1.071 ± 0.597
1.428MetAsn: 1.428 ± 0.553
0.714MetPro: 0.714 ± 0.48
0.357MetGln: 0.357 ± 0.353
0.714MetArg: 0.714 ± 0.495
2.855MetSer: 2.855 ± 0.951
1.784MetThr: 1.784 ± 0.689
2.141MetVal: 2.141 ± 1.138
0.714MetTrp: 0.714 ± 0.639
0.714MetTyr: 0.714 ± 0.396
0.0MetXaa: 0.0 ± 0.0
Asn
3.569AsnAla: 3.569 ± 1.068
0.0AsnCys: 0.0 ± 0.0
2.498AsnAsp: 2.498 ± 1.109
4.64AsnGlu: 4.64 ± 0.872
1.428AsnPhe: 1.428 ± 0.713
6.781AsnGly: 6.781 ± 1.317
1.071AsnHis: 1.071 ± 0.596
3.569AsnIle: 3.569 ± 1.258
6.424AsnLys: 6.424 ± 1.265
3.569AsnLeu: 3.569 ± 1.446
3.569AsnMet: 3.569 ± 1.224
3.926AsnAsn: 3.926 ± 1.109
4.283AsnPro: 4.283 ± 0.938
0.714AsnGln: 0.714 ± 0.454
1.071AsnArg: 1.071 ± 0.523
5.71AsnSer: 5.71 ± 1.798
3.569AsnThr: 3.569 ± 1.141
3.926AsnVal: 3.926 ± 1.113
0.357AsnTrp: 0.357 ± 0.321
2.855AsnTyr: 2.855 ± 0.923
0.0AsnXaa: 0.0 ± 0.0
Pro
2.855ProAla: 2.855 ± 0.703
0.0ProCys: 0.0 ± 0.0
1.428ProAsp: 1.428 ± 0.634
0.714ProGlu: 0.714 ± 0.451
2.855ProPhe: 2.855 ± 1.173
0.357ProGly: 0.357 ± 0.376
0.357ProHis: 0.357 ± 0.312
3.212ProIle: 3.212 ± 1.035
4.64ProLys: 4.64 ± 1.924
2.498ProLeu: 2.498 ± 0.573
0.357ProMet: 0.357 ± 0.353
3.212ProAsn: 3.212 ± 1.98
2.141ProPro: 2.141 ± 0.861
1.071ProGln: 1.071 ± 0.713
1.428ProArg: 1.428 ± 0.922
1.428ProSer: 1.428 ± 0.757
1.428ProThr: 1.428 ± 0.612
2.141ProVal: 2.141 ± 1.069
0.357ProTrp: 0.357 ± 0.304
1.071ProTyr: 1.071 ± 0.45
0.0ProXaa: 0.0 ± 0.0
Gln
2.498GlnAla: 2.498 ± 1.133
0.0GlnCys: 0.0 ± 0.0
2.498GlnAsp: 2.498 ± 1.152
5.353GlnGlu: 5.353 ± 1.295
2.141GlnPhe: 2.141 ± 0.722
1.784GlnGly: 1.784 ± 0.824
0.714GlnHis: 0.714 ± 0.618
2.855GlnIle: 2.855 ± 1.021
1.784GlnLys: 1.784 ± 0.682
2.498GlnLeu: 2.498 ± 1.057
0.714GlnMet: 0.714 ± 0.49
0.714GlnAsn: 0.714 ± 0.452
1.428GlnPro: 1.428 ± 0.675
1.071GlnGln: 1.071 ± 0.568
1.071GlnArg: 1.071 ± 0.592
2.855GlnSer: 2.855 ± 0.938
2.498GlnThr: 2.498 ± 0.933
1.428GlnVal: 1.428 ± 0.573
0.0GlnTrp: 0.0 ± 0.0
0.714GlnTyr: 0.714 ± 0.47
0.0GlnXaa: 0.0 ± 0.0
Arg
2.855ArgAla: 2.855 ± 1.232
0.0ArgCys: 0.0 ± 0.0
1.784ArgAsp: 1.784 ± 0.738
1.428ArgGlu: 1.428 ± 0.726
0.357ArgPhe: 0.357 ± 0.312
1.784ArgGly: 1.784 ± 0.778
0.714ArgHis: 0.714 ± 0.525
2.855ArgIle: 2.855 ± 1.139
3.212ArgLys: 3.212 ± 0.889
2.141ArgLeu: 2.141 ± 0.721
0.0ArgMet: 0.0 ± 0.0
3.569ArgAsn: 3.569 ± 1.03
1.428ArgPro: 1.428 ± 0.867
1.784ArgGln: 1.784 ± 0.929
0.714ArgArg: 0.714 ± 0.45
0.714ArgSer: 0.714 ± 0.418
2.498ArgThr: 2.498 ± 0.988
3.569ArgVal: 3.569 ± 1.044
0.714ArgTrp: 0.714 ± 0.474
3.569ArgTyr: 3.569 ± 0.938
0.0ArgXaa: 0.0 ± 0.0
Ser
2.141SerAla: 2.141 ± 0.997
0.0SerCys: 0.0 ± 0.0
4.64SerAsp: 4.64 ± 0.769
3.212SerGlu: 3.212 ± 1.209
2.141SerPhe: 2.141 ± 0.919
3.569SerGly: 3.569 ± 0.894
0.714SerHis: 0.714 ± 0.417
3.926SerIle: 3.926 ± 1.16
8.565SerLys: 8.565 ± 2.033
4.283SerLeu: 4.283 ± 0.954
1.428SerMet: 1.428 ± 0.576
4.283SerAsn: 4.283 ± 1.049
1.784SerPro: 1.784 ± 0.904
2.855SerGln: 2.855 ± 1.326
2.141SerArg: 2.141 ± 0.896
5.353SerSer: 5.353 ± 2.543
4.283SerThr: 4.283 ± 1.472
4.283SerVal: 4.283 ± 1.408
1.784SerTrp: 1.784 ± 0.976
4.283SerTyr: 4.283 ± 1.702
0.0SerXaa: 0.0 ± 0.0
Thr
3.569ThrAla: 3.569 ± 1.251
0.0ThrCys: 0.0 ± 0.0
3.569ThrAsp: 3.569 ± 1.497
1.428ThrGlu: 1.428 ± 0.675
4.283ThrPhe: 4.283 ± 1.804
2.141ThrGly: 2.141 ± 1.034
1.071ThrHis: 1.071 ± 0.597
2.855ThrIle: 2.855 ± 1.219
6.781ThrLys: 6.781 ± 1.488
6.424ThrLeu: 6.424 ± 1.361
2.498ThrMet: 2.498 ± 0.751
3.212ThrAsn: 3.212 ± 1.074
2.855ThrPro: 2.855 ± 1.029
2.141ThrGln: 2.141 ± 1.302
3.212ThrArg: 3.212 ± 1.084
4.996ThrSer: 4.996 ± 2.339
3.212ThrThr: 3.212 ± 1.211
6.781ThrVal: 6.781 ± 1.423
0.357ThrTrp: 0.357 ± 0.312
2.498ThrTyr: 2.498 ± 0.944
0.0ThrXaa: 0.0 ± 0.0
Val
5.353ValAla: 5.353 ± 1.369
0.714ValCys: 0.714 ± 0.463
2.855ValAsp: 2.855 ± 0.985
1.784ValGlu: 1.784 ± 0.885
2.498ValPhe: 2.498 ± 0.647
2.141ValGly: 2.141 ± 0.77
1.071ValHis: 1.071 ± 0.541
4.64ValIle: 4.64 ± 1.042
4.64ValLys: 4.64 ± 1.233
6.424ValLeu: 6.424 ± 1.729
1.071ValMet: 1.071 ± 0.605
2.498ValAsn: 2.498 ± 0.85
1.784ValPro: 1.784 ± 0.901
2.141ValGln: 2.141 ± 0.816
1.071ValArg: 1.071 ± 0.602
4.283ValSer: 4.283 ± 1.326
9.636ValThr: 9.636 ± 1.649
5.71ValVal: 5.71 ± 1.54
0.0ValTrp: 0.0 ± 0.0
1.784ValTyr: 1.784 ± 0.801
0.0ValXaa: 0.0 ± 0.0
Trp
0.357TrpAla: 0.357 ± 0.321
0.0TrpCys: 0.0 ± 0.0
0.714TrpAsp: 0.714 ± 0.409
0.714TrpGlu: 0.714 ± 0.442
0.357TrpPhe: 0.357 ± 0.389
0.357TrpGly: 0.357 ± 0.376
0.714TrpHis: 0.714 ± 0.392
0.714TrpIle: 0.714 ± 0.454
1.071TrpLys: 1.071 ± 0.462
1.428TrpLeu: 1.428 ± 0.779
0.357TrpMet: 0.357 ± 0.389
0.357TrpAsn: 0.357 ± 0.333
0.0TrpPro: 0.0 ± 0.0
0.357TrpGln: 0.357 ± 0.304
0.357TrpArg: 0.357 ± 0.304
1.071TrpSer: 1.071 ± 0.45
0.357TrpThr: 0.357 ± 0.432
0.714TrpVal: 0.714 ± 0.593
0.357TrpTrp: 0.357 ± 0.309
1.071TrpTyr: 1.071 ± 0.442
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.428TyrAla: 1.428 ± 0.634
0.714TyrCys: 0.714 ± 0.517
3.926TyrAsp: 3.926 ± 1.192
4.283TyrGlu: 4.283 ± 1.411
2.141TyrPhe: 2.141 ± 1.26
2.855TyrGly: 2.855 ± 0.878
0.714TyrHis: 0.714 ± 0.439
3.212TyrIle: 3.212 ± 1.179
4.996TyrLys: 4.996 ± 1.556
3.569TyrLeu: 3.569 ± 0.98
1.784TyrMet: 1.784 ± 0.803
3.212TyrAsn: 3.212 ± 1.332
1.428TyrPro: 1.428 ± 0.646
2.855TyrGln: 2.855 ± 0.965
2.498TyrArg: 2.498 ± 0.988
4.283TyrSer: 4.283 ± 1.963
2.855TyrThr: 2.855 ± 0.726
2.855TyrVal: 2.855 ± 1.063
0.357TyrTrp: 0.357 ± 0.321
2.141TyrTyr: 2.141 ± 0.859
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 18 proteins (2803 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski