Amino acid dipepetide frequency for Streptococcus satellite phage Javan622

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.878AlaAla: 0.878 ± 0.572
1.17AlaCys: 1.17 ± 0.674
4.974AlaAsp: 4.974 ± 0.854
4.096AlaGlu: 4.096 ± 1.704
1.755AlaPhe: 1.755 ± 0.657
2.341AlaGly: 2.341 ± 0.732
0.293AlaHis: 0.293 ± 0.287
3.218AlaIle: 3.218 ± 0.946
5.559AlaLys: 5.559 ± 1.729
6.729AlaLeu: 6.729 ± 2.617
1.463AlaMet: 1.463 ± 0.481
3.511AlaAsn: 3.511 ± 0.811
0.0AlaPro: 0.0 ± 0.0
1.463AlaGln: 1.463 ± 0.745
2.926AlaArg: 2.926 ± 0.973
3.803AlaSer: 3.803 ± 0.952
2.633AlaThr: 2.633 ± 0.772
2.341AlaVal: 2.341 ± 0.603
0.878AlaTrp: 0.878 ± 0.495
2.341AlaTyr: 2.341 ± 0.767
0.0AlaXaa: 0.0 ± 0.0
Cys
0.293CysAla: 0.293 ± 0.332
0.293CysCys: 0.293 ± 0.298
0.293CysAsp: 0.293 ± 0.332
0.585CysGlu: 0.585 ± 0.468
0.585CysPhe: 0.585 ± 0.34
0.0CysGly: 0.0 ± 0.0
0.293CysHis: 0.293 ± 0.313
0.878CysIle: 0.878 ± 0.684
0.0CysLys: 0.0 ± 0.0
0.585CysLeu: 0.585 ± 0.364
0.293CysMet: 0.293 ± 0.316
0.293CysAsn: 0.293 ± 0.228
0.0CysPro: 0.0 ± 0.0
0.293CysGln: 0.293 ± 0.298
0.585CysArg: 0.585 ± 0.364
0.293CysSer: 0.293 ± 0.265
0.0CysThr: 0.0 ± 0.0
0.293CysVal: 0.293 ± 0.228
0.0CysTrp: 0.0 ± 0.0
0.878CysTyr: 0.878 ± 0.417
0.0CysXaa: 0.0 ± 0.0
Asp
2.633AspAla: 2.633 ± 1.069
0.293AspCys: 0.293 ± 0.298
4.681AspAsp: 4.681 ± 1.174
4.681AspGlu: 4.681 ± 1.362
3.218AspPhe: 3.218 ± 0.736
2.048AspGly: 2.048 ± 0.915
0.0AspHis: 0.0 ± 0.0
6.729AspIle: 6.729 ± 1.773
7.607AspLys: 7.607 ± 2.408
6.144AspLeu: 6.144 ± 1.799
1.17AspMet: 1.17 ± 0.555
5.266AspAsn: 5.266 ± 1.044
0.878AspPro: 0.878 ± 0.43
2.341AspGln: 2.341 ± 0.656
1.755AspArg: 1.755 ± 0.614
2.926AspSer: 2.926 ± 0.959
1.17AspThr: 1.17 ± 0.425
4.681AspVal: 4.681 ± 0.926
0.585AspTrp: 0.585 ± 0.405
4.096AspTyr: 4.096 ± 0.961
0.0AspXaa: 0.0 ± 0.0
Glu
2.048GluAla: 2.048 ± 0.662
0.585GluCys: 0.585 ± 0.348
5.266GluAsp: 5.266 ± 1.242
6.144GluGlu: 6.144 ± 1.671
4.389GluPhe: 4.389 ± 1.04
1.755GluGly: 1.755 ± 0.892
1.17GluHis: 1.17 ± 0.598
5.559GluIle: 5.559 ± 1.22
6.144GluLys: 6.144 ± 1.728
13.458GluLeu: 13.458 ± 1.324
2.341GluMet: 2.341 ± 0.835
5.559GluAsn: 5.559 ± 1.817
2.341GluPro: 2.341 ± 0.683
1.755GluGln: 1.755 ± 0.619
6.144GluArg: 6.144 ± 1.96
3.803GluSer: 3.803 ± 0.891
4.681GluThr: 4.681 ± 1.153
4.681GluVal: 4.681 ± 1.071
0.585GluTrp: 0.585 ± 0.392
3.511GluTyr: 3.511 ± 1.066
0.0GluXaa: 0.0 ± 0.0
Phe
2.926PheAla: 2.926 ± 1.047
0.293PheCys: 0.293 ± 0.298
3.511PheAsp: 3.511 ± 1.093
4.681PheGlu: 4.681 ± 1.191
2.341PhePhe: 2.341 ± 0.802
2.926PheGly: 2.926 ± 0.574
1.17PheHis: 1.17 ± 0.427
3.511PheIle: 3.511 ± 1.116
2.633PheLys: 2.633 ± 0.633
3.511PheLeu: 3.511 ± 0.924
0.293PheMet: 0.293 ± 0.265
1.463PheAsn: 1.463 ± 0.609
0.878PhePro: 0.878 ± 0.441
0.585PheGln: 0.585 ± 0.361
2.341PheArg: 2.341 ± 1.113
1.755PheSer: 1.755 ± 0.678
1.755PheThr: 1.755 ± 0.549
3.218PheVal: 3.218 ± 0.821
0.878PheTrp: 0.878 ± 0.382
0.878PheTyr: 0.878 ± 0.582
0.0PheXaa: 0.0 ± 0.0
Gly
4.096GlyAla: 4.096 ± 0.851
0.878GlyCys: 0.878 ± 0.507
1.755GlyAsp: 1.755 ± 0.531
2.633GlyGlu: 2.633 ± 0.911
1.463GlyPhe: 1.463 ± 0.817
1.755GlyGly: 1.755 ± 0.52
0.293GlyHis: 0.293 ± 0.282
4.096GlyIle: 4.096 ± 0.735
4.974GlyLys: 4.974 ± 1.193
4.096GlyLeu: 4.096 ± 0.851
1.463GlyMet: 1.463 ± 0.564
1.463GlyAsn: 1.463 ± 0.677
0.0GlyPro: 0.0 ± 0.0
1.463GlyGln: 1.463 ± 0.568
0.878GlyArg: 0.878 ± 0.382
2.341GlySer: 2.341 ± 0.636
2.926GlyThr: 2.926 ± 0.921
3.511GlyVal: 3.511 ± 0.872
0.585GlyTrp: 0.585 ± 0.369
1.755GlyTyr: 1.755 ± 0.525
0.0GlyXaa: 0.0 ± 0.0
His
1.17HisAla: 1.17 ± 0.86
0.0HisCys: 0.0 ± 0.0
0.878HisAsp: 0.878 ± 0.439
0.293HisGlu: 0.293 ± 0.228
1.463HisPhe: 1.463 ± 0.775
0.293HisGly: 0.293 ± 0.282
0.585HisHis: 0.585 ± 0.504
2.633HisIle: 2.633 ± 1.005
1.17HisLys: 1.17 ± 0.475
1.463HisLeu: 1.463 ± 0.572
0.293HisMet: 0.293 ± 0.326
1.17HisAsn: 1.17 ± 0.534
0.293HisPro: 0.293 ± 0.298
0.585HisGln: 0.585 ± 0.373
0.878HisArg: 0.878 ± 0.517
1.17HisSer: 1.17 ± 0.679
0.878HisThr: 0.878 ± 0.603
0.585HisVal: 0.585 ± 0.53
0.293HisTrp: 0.293 ± 0.268
0.585HisTyr: 0.585 ± 0.359
0.0HisXaa: 0.0 ± 0.0
Ile
4.681IleAla: 4.681 ± 1.318
0.0IleCys: 0.0 ± 0.0
5.851IleAsp: 5.851 ± 1.678
6.729IleGlu: 6.729 ± 2.191
4.096IlePhe: 4.096 ± 1.097
3.511IleGly: 3.511 ± 0.948
2.048IleHis: 2.048 ± 0.649
2.633IleIle: 2.633 ± 0.847
8.192IleLys: 8.192 ± 1.511
6.144IleLeu: 6.144 ± 1.444
0.878IleMet: 0.878 ± 0.438
4.681IleAsn: 4.681 ± 1.155
1.755IlePro: 1.755 ± 0.623
5.266IleGln: 5.266 ± 1.087
2.341IleArg: 2.341 ± 0.51
7.022IleSer: 7.022 ± 1.616
2.926IleThr: 2.926 ± 0.943
3.511IleVal: 3.511 ± 0.926
0.0IleTrp: 0.0 ± 0.0
1.755IleTyr: 1.755 ± 0.628
0.0IleXaa: 0.0 ± 0.0
Lys
5.266LysAla: 5.266 ± 1.168
0.0LysCys: 0.0 ± 0.0
3.218LysAsp: 3.218 ± 1.131
11.41LysGlu: 11.41 ± 1.853
2.633LysPhe: 2.633 ± 1.013
2.341LysGly: 2.341 ± 0.747
1.755LysHis: 1.755 ± 0.735
5.559LysIle: 5.559 ± 1.141
10.24LysLys: 10.24 ± 1.162
9.362LysLeu: 9.362 ± 1.781
3.511LysMet: 3.511 ± 1.186
5.851LysAsn: 5.851 ± 1.207
2.633LysPro: 2.633 ± 0.676
5.266LysGln: 5.266 ± 1.034
5.559LysArg: 5.559 ± 1.172
6.729LysSer: 6.729 ± 1.494
6.437LysThr: 6.437 ± 1.187
4.974LysVal: 4.974 ± 1.047
0.585LysTrp: 0.585 ± 0.338
4.389LysTyr: 4.389 ± 1.076
0.0LysXaa: 0.0 ± 0.0
Leu
6.437LeuAla: 6.437 ± 1.251
0.878LeuCys: 0.878 ± 0.398
6.729LeuAsp: 6.729 ± 1.22
7.314LeuGlu: 7.314 ± 1.715
5.851LeuPhe: 5.851 ± 1.467
4.974LeuGly: 4.974 ± 1.205
1.17LeuHis: 1.17 ± 0.49
8.484LeuIle: 8.484 ± 1.881
9.362LeuLys: 9.362 ± 1.625
11.703LeuLeu: 11.703 ± 2.373
1.755LeuMet: 1.755 ± 0.816
7.022LeuAsn: 7.022 ± 1.677
2.048LeuPro: 2.048 ± 0.565
7.607LeuGln: 7.607 ± 1.489
3.803LeuArg: 3.803 ± 0.885
7.022LeuSer: 7.022 ± 1.341
5.851LeuThr: 5.851 ± 1.575
4.389LeuVal: 4.389 ± 1.405
0.585LeuTrp: 0.585 ± 0.338
2.926LeuTyr: 2.926 ± 0.896
0.0LeuXaa: 0.0 ± 0.0
Met
1.17MetAla: 1.17 ± 0.446
0.0MetCys: 0.0 ± 0.0
1.463MetAsp: 1.463 ± 0.516
2.926MetGlu: 2.926 ± 0.923
0.585MetPhe: 0.585 ± 0.325
0.585MetGly: 0.585 ± 0.33
0.293MetHis: 0.293 ± 0.287
0.878MetIle: 0.878 ± 0.54
2.341MetLys: 2.341 ± 0.749
2.633MetLeu: 2.633 ± 0.893
0.293MetMet: 0.293 ± 0.228
1.755MetAsn: 1.755 ± 0.726
0.878MetPro: 0.878 ± 0.547
0.878MetGln: 0.878 ± 0.47
1.755MetArg: 1.755 ± 0.706
0.293MetSer: 0.293 ± 0.298
1.755MetThr: 1.755 ± 0.797
1.17MetVal: 1.17 ± 0.578
0.585MetTrp: 0.585 ± 0.419
0.585MetTyr: 0.585 ± 0.53
0.0MetXaa: 0.0 ± 0.0
Asn
3.511AsnAla: 3.511 ± 1.008
0.0AsnCys: 0.0 ± 0.0
3.218AsnAsp: 3.218 ± 0.861
7.899AsnGlu: 7.899 ± 1.629
2.341AsnPhe: 2.341 ± 0.839
5.266AsnGly: 5.266 ± 0.857
1.463AsnHis: 1.463 ± 0.578
3.218AsnIle: 3.218 ± 1.114
7.022AsnLys: 7.022 ± 1.727
6.437AsnLeu: 6.437 ± 1.734
0.293AsnMet: 0.293 ± 0.257
3.803AsnAsn: 3.803 ± 1.175
1.463AsnPro: 1.463 ± 0.46
2.633AsnGln: 2.633 ± 0.841
2.633AsnArg: 2.633 ± 0.891
2.633AsnSer: 2.633 ± 0.74
2.926AsnThr: 2.926 ± 0.9
3.511AsnVal: 3.511 ± 1.094
0.0AsnTrp: 0.0 ± 0.0
4.389AsnTyr: 4.389 ± 0.699
0.0AsnXaa: 0.0 ± 0.0
Pro
0.878ProAla: 0.878 ± 0.538
0.0ProCys: 0.0 ± 0.0
2.048ProAsp: 2.048 ± 0.566
1.463ProGlu: 1.463 ± 0.658
1.463ProPhe: 1.463 ± 0.483
0.0ProGly: 0.0 ± 0.0
0.585ProHis: 0.585 ± 0.354
2.048ProIle: 2.048 ± 0.614
1.755ProLys: 1.755 ± 0.604
2.341ProLeu: 2.341 ± 0.878
0.293ProMet: 0.293 ± 0.257
2.633ProAsn: 2.633 ± 0.749
0.878ProPro: 0.878 ± 0.413
0.293ProGln: 0.293 ± 0.228
2.048ProArg: 2.048 ± 0.598
0.585ProSer: 0.585 ± 0.365
1.755ProThr: 1.755 ± 0.612
0.878ProVal: 0.878 ± 0.495
0.0ProTrp: 0.0 ± 0.0
0.878ProTyr: 0.878 ± 0.489
0.0ProXaa: 0.0 ± 0.0
Gln
3.803GlnAla: 3.803 ± 1.038
0.0GlnCys: 0.0 ± 0.0
1.755GlnAsp: 1.755 ± 0.779
3.511GlnGlu: 3.511 ± 0.89
0.585GlnPhe: 0.585 ± 0.456
2.341GlnGly: 2.341 ± 0.78
0.293GlnHis: 0.293 ± 0.282
3.218GlnIle: 3.218 ± 1.066
3.803GlnLys: 3.803 ± 0.813
4.096GlnLeu: 4.096 ± 1.265
0.878GlnMet: 0.878 ± 0.584
3.511GlnAsn: 3.511 ± 0.976
0.878GlnPro: 0.878 ± 0.495
1.755GlnGln: 1.755 ± 0.763
1.17GlnArg: 1.17 ± 0.503
2.048GlnSer: 2.048 ± 0.805
2.341GlnThr: 2.341 ± 1.167
2.926GlnVal: 2.926 ± 0.816
0.585GlnTrp: 0.585 ± 0.375
2.341GlnTyr: 2.341 ± 0.686
0.0GlnXaa: 0.0 ± 0.0
Arg
2.926ArgAla: 2.926 ± 0.867
0.293ArgCys: 0.293 ± 0.298
2.341ArgAsp: 2.341 ± 0.785
4.974ArgGlu: 4.974 ± 1.287
1.755ArgPhe: 1.755 ± 0.733
0.585ArgGly: 0.585 ± 0.332
1.755ArgHis: 1.755 ± 0.504
4.681ArgIle: 4.681 ± 1.24
5.266ArgLys: 5.266 ± 1.227
4.974ArgLeu: 4.974 ± 0.967
0.878ArgMet: 0.878 ± 0.634
3.511ArgAsn: 3.511 ± 0.853
0.878ArgPro: 0.878 ± 0.491
3.218ArgGln: 3.218 ± 0.947
2.341ArgArg: 2.341 ± 0.729
0.878ArgSer: 0.878 ± 0.454
3.511ArgThr: 3.511 ± 0.994
2.048ArgVal: 2.048 ± 0.774
1.463ArgTrp: 1.463 ± 0.583
2.341ArgTyr: 2.341 ± 1.181
0.0ArgXaa: 0.0 ± 0.0
Ser
2.926SerAla: 2.926 ± 0.83
0.585SerCys: 0.585 ± 0.34
4.389SerAsp: 4.389 ± 0.963
4.681SerGlu: 4.681 ± 0.701
1.17SerPhe: 1.17 ± 0.548
2.926SerGly: 2.926 ± 0.941
0.878SerHis: 0.878 ± 0.367
5.851SerIle: 5.851 ± 1.136
5.266SerLys: 5.266 ± 1.338
4.096SerLeu: 4.096 ± 1.249
1.755SerMet: 1.755 ± 0.633
2.341SerAsn: 2.341 ± 0.72
1.755SerPro: 1.755 ± 0.658
2.048SerGln: 2.048 ± 0.803
2.633SerArg: 2.633 ± 0.682
2.341SerSer: 2.341 ± 1.042
3.218SerThr: 3.218 ± 1.274
3.218SerVal: 3.218 ± 0.919
1.17SerTrp: 1.17 ± 0.736
3.218SerTyr: 3.218 ± 0.877
0.0SerXaa: 0.0 ± 0.0
Thr
1.755ThrAla: 1.755 ± 0.83
0.0ThrCys: 0.0 ± 0.0
3.511ThrAsp: 3.511 ± 0.913
3.511ThrGlu: 3.511 ± 0.903
2.048ThrPhe: 2.048 ± 0.757
2.633ThrGly: 2.633 ± 0.575
1.463ThrHis: 1.463 ± 0.708
4.096ThrIle: 4.096 ± 0.985
2.633ThrLys: 2.633 ± 0.903
4.389ThrLeu: 4.389 ± 0.874
1.463ThrMet: 1.463 ± 0.59
4.681ThrAsn: 4.681 ± 1.089
1.755ThrPro: 1.755 ± 0.605
1.17ThrGln: 1.17 ± 0.478
3.511ThrArg: 3.511 ± 0.765
2.341ThrSer: 2.341 ± 0.963
1.463ThrThr: 1.463 ± 0.698
4.096ThrVal: 4.096 ± 1.423
0.0ThrTrp: 0.0 ± 0.0
3.511ThrTyr: 3.511 ± 0.807
0.0ThrXaa: 0.0 ± 0.0
Val
2.633ValAla: 2.633 ± 0.821
0.585ValCys: 0.585 ± 0.38
3.218ValAsp: 3.218 ± 0.973
2.926ValGlu: 2.926 ± 0.817
1.755ValPhe: 1.755 ± 0.615
2.341ValGly: 2.341 ± 0.762
0.878ValHis: 0.878 ± 0.372
4.096ValIle: 4.096 ± 1.135
6.729ValLys: 6.729 ± 1.507
7.314ValLeu: 7.314 ± 0.896
1.17ValMet: 1.17 ± 0.812
3.218ValAsn: 3.218 ± 0.796
2.341ValPro: 2.341 ± 1.034
2.341ValGln: 2.341 ± 0.545
2.048ValArg: 2.048 ± 0.632
3.803ValSer: 3.803 ± 1.109
2.633ValThr: 2.633 ± 0.812
2.633ValVal: 2.633 ± 0.978
0.585ValTrp: 0.585 ± 0.325
0.878ValTyr: 0.878 ± 0.623
0.0ValXaa: 0.0 ± 0.0
Trp
0.585TrpAla: 0.585 ± 0.458
0.293TrpCys: 0.293 ± 0.298
0.585TrpAsp: 0.585 ± 0.338
0.878TrpGlu: 0.878 ± 0.461
0.585TrpPhe: 0.585 ± 0.37
0.585TrpGly: 0.585 ± 0.38
0.0TrpHis: 0.0 ± 0.0
0.585TrpIle: 0.585 ± 0.369
0.293TrpLys: 0.293 ± 0.268
0.878TrpLeu: 0.878 ± 0.56
0.585TrpMet: 0.585 ± 0.53
0.585TrpAsn: 0.585 ± 0.369
0.293TrpPro: 0.293 ± 0.313
0.0TrpGln: 0.0 ± 0.0
0.878TrpArg: 0.878 ± 0.382
0.878TrpSer: 0.878 ± 0.367
0.293TrpThr: 0.293 ± 0.265
0.0TrpVal: 0.0 ± 0.0
0.293TrpTrp: 0.293 ± 0.282
1.17TrpTyr: 1.17 ± 0.437
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.755TyrAla: 1.755 ± 0.714
0.585TyrCys: 0.585 ± 0.389
3.218TyrAsp: 3.218 ± 0.54
0.878TyrGlu: 0.878 ± 0.47
1.463TyrPhe: 1.463 ± 0.529
3.218TyrGly: 3.218 ± 0.831
0.293TyrHis: 0.293 ± 0.265
2.048TyrIle: 2.048 ± 0.607
6.729TyrLys: 6.729 ± 1.663
5.559TyrLeu: 5.559 ± 1.334
1.463TyrMet: 1.463 ± 0.523
2.341TyrAsn: 2.341 ± 0.874
0.878TyrPro: 0.878 ± 0.387
0.878TyrGln: 0.878 ± 0.443
4.389TyrArg: 4.389 ± 1.139
3.803TyrSer: 3.803 ± 1.128
0.878TyrThr: 0.878 ± 0.508
1.463TyrVal: 1.463 ± 0.779
0.585TyrTrp: 0.585 ± 0.369
1.755TyrTyr: 1.755 ± 0.589
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 18 proteins (3419 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski