Amino acid dipepetide frequency for Streptococcus satellite phage Javan304

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.349AlaAla: 0.349 ± 0.312
0.349AlaCys: 0.349 ± 0.294
4.193AlaAsp: 4.193 ± 0.934
4.193AlaGlu: 4.193 ± 1.26
4.193AlaPhe: 4.193 ± 1.155
3.145AlaGly: 3.145 ± 0.716
0.699AlaHis: 0.699 ± 0.541
6.988AlaIle: 6.988 ± 1.514
3.843AlaLys: 3.843 ± 1.315
5.59AlaLeu: 5.59 ± 1.723
1.747AlaMet: 1.747 ± 0.728
2.096AlaAsn: 2.096 ± 0.816
1.048AlaPro: 1.048 ± 0.671
3.494AlaGln: 3.494 ± 1.074
1.398AlaArg: 1.398 ± 0.551
4.892AlaSer: 4.892 ± 1.618
2.096AlaThr: 2.096 ± 0.519
3.494AlaVal: 3.494 ± 0.864
0.0AlaTrp: 0.0 ± 0.0
3.145AlaTyr: 3.145 ± 0.839
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.349CysGlu: 0.349 ± 0.359
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.699CysLeu: 0.699 ± 0.45
0.0CysMet: 0.0 ± 0.0
0.699CysAsn: 0.699 ± 0.449
0.0CysPro: 0.0 ± 0.0
1.398CysGln: 1.398 ± 0.908
0.349CysArg: 0.349 ± 0.294
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.349CysVal: 0.349 ± 0.295
0.0CysTrp: 0.0 ± 0.0
0.349CysTyr: 0.349 ± 0.284
0.0CysXaa: 0.0 ± 0.0
Asp
1.398AspAla: 1.398 ± 0.598
0.699AspCys: 0.699 ± 0.382
3.145AspAsp: 3.145 ± 0.652
5.241AspGlu: 5.241 ± 1.558
2.446AspPhe: 2.446 ± 0.666
2.096AspGly: 2.096 ± 0.929
1.048AspHis: 1.048 ± 0.676
5.94AspIle: 5.94 ± 1.32
8.036AspLys: 8.036 ± 2.158
6.988AspLeu: 6.988 ± 1.351
0.699AspMet: 0.699 ± 0.53
4.193AspAsn: 4.193 ± 0.925
0.349AspPro: 0.349 ± 0.295
1.398AspGln: 1.398 ± 0.48
3.145AspArg: 3.145 ± 0.843
3.494AspSer: 3.494 ± 0.937
2.446AspThr: 2.446 ± 0.872
3.843AspVal: 3.843 ± 0.974
0.0AspTrp: 0.0 ± 0.0
2.795AspTyr: 2.795 ± 0.687
0.0AspXaa: 0.0 ± 0.0
Glu
4.193GluAla: 4.193 ± 1.103
0.349GluCys: 0.349 ± 0.301
4.892GluAsp: 4.892 ± 1.548
4.892GluGlu: 4.892 ± 1.335
2.795GluPhe: 2.795 ± 0.934
2.446GluGly: 2.446 ± 1.059
1.747GluHis: 1.747 ± 0.751
7.338GluIle: 7.338 ± 1.725
9.085GluLys: 9.085 ± 2.222
12.579GluLeu: 12.579 ± 2.35
1.747GluMet: 1.747 ± 0.829
7.338GluAsn: 7.338 ± 1.655
2.096GluPro: 2.096 ± 0.952
2.096GluGln: 2.096 ± 0.665
3.494GluArg: 3.494 ± 1.068
3.494GluSer: 3.494 ± 0.965
3.843GluThr: 3.843 ± 0.761
5.241GluVal: 5.241 ± 1.597
1.398GluTrp: 1.398 ± 0.448
5.59GluTyr: 5.59 ± 1.424
0.0GluXaa: 0.0 ± 0.0
Phe
1.048PheAla: 1.048 ± 0.687
0.0PheCys: 0.0 ± 0.0
3.145PheAsp: 3.145 ± 1.155
3.145PheGlu: 3.145 ± 1.078
1.398PhePhe: 1.398 ± 0.726
2.795PheGly: 2.795 ± 0.961
1.747PheHis: 1.747 ± 0.557
1.398PheIle: 1.398 ± 0.7
3.843PheLys: 3.843 ± 1.305
4.892PheLeu: 4.892 ± 1.129
1.048PheMet: 1.048 ± 0.674
3.494PheAsn: 3.494 ± 1.111
0.699PhePro: 0.699 ± 0.352
0.699PheGln: 0.699 ± 0.391
2.446PheArg: 2.446 ± 0.847
5.241PheSer: 5.241 ± 0.94
2.795PheThr: 2.795 ± 0.795
2.795PheVal: 2.795 ± 1.042
0.349PheTrp: 0.349 ± 0.301
1.398PheTyr: 1.398 ± 0.711
0.0PheXaa: 0.0 ± 0.0
Gly
2.446GlyAla: 2.446 ± 0.746
0.699GlyCys: 0.699 ± 0.431
3.145GlyAsp: 3.145 ± 0.833
2.096GlyGlu: 2.096 ± 0.812
2.446GlyPhe: 2.446 ± 0.613
0.699GlyGly: 0.699 ± 0.395
0.699GlyHis: 0.699 ± 0.472
2.096GlyIle: 2.096 ± 0.923
5.241GlyLys: 5.241 ± 1.823
5.94GlyLeu: 5.94 ± 1.509
1.398GlyMet: 1.398 ± 0.583
1.048GlyAsn: 1.048 ± 0.42
0.0GlyPro: 0.0 ± 0.0
1.747GlyGln: 1.747 ± 0.806
2.096GlyArg: 2.096 ± 0.626
1.747GlySer: 1.747 ± 0.738
2.096GlyThr: 2.096 ± 0.851
3.494GlyVal: 3.494 ± 1.21
1.048GlyTrp: 1.048 ± 0.718
3.145GlyTyr: 3.145 ± 0.807
0.0GlyXaa: 0.0 ± 0.0
His
0.699HisAla: 0.699 ± 0.462
0.0HisCys: 0.0 ± 0.0
1.048HisAsp: 1.048 ± 0.496
0.349HisGlu: 0.349 ± 0.301
1.398HisPhe: 1.398 ± 0.607
1.048HisGly: 1.048 ± 0.566
0.349HisHis: 0.349 ± 0.295
0.699HisIle: 0.699 ± 0.352
1.398HisLys: 1.398 ± 0.654
2.096HisLeu: 2.096 ± 0.781
0.0HisMet: 0.0 ± 0.0
2.096HisAsn: 2.096 ± 0.732
1.747HisPro: 1.747 ± 0.544
0.699HisGln: 0.699 ± 0.478
0.349HisArg: 0.349 ± 0.362
1.048HisSer: 1.048 ± 0.533
1.747HisThr: 1.747 ± 0.969
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.349HisTyr: 0.349 ± 0.37
0.0HisXaa: 0.0 ± 0.0
Ile
3.843IleAla: 3.843 ± 1.099
0.699IleCys: 0.699 ± 0.555
4.892IleAsp: 4.892 ± 1.057
6.289IleGlu: 6.289 ± 1.369
3.494IlePhe: 3.494 ± 1.009
2.446IleGly: 2.446 ± 1.027
0.699IleHis: 0.699 ± 0.41
5.241IleIle: 5.241 ± 1.343
7.687IleLys: 7.687 ± 1.568
4.892IleLeu: 4.892 ± 1.195
1.398IleMet: 1.398 ± 0.55
3.843IleAsn: 3.843 ± 1.073
2.096IlePro: 2.096 ± 0.762
4.542IleGln: 4.542 ± 0.996
2.096IleArg: 2.096 ± 0.944
5.59IleSer: 5.59 ± 1.144
5.59IleThr: 5.59 ± 1.051
3.145IleVal: 3.145 ± 0.86
0.0IleTrp: 0.0 ± 0.0
2.446IleTyr: 2.446 ± 1.327
0.0IleXaa: 0.0 ± 0.0
Lys
8.386LysAla: 8.386 ± 1.194
0.349LysCys: 0.349 ± 0.295
6.289LysAsp: 6.289 ± 0.814
10.482LysGlu: 10.482 ± 1.584
0.699LysPhe: 0.699 ± 0.52
5.94LysGly: 5.94 ± 1.113
1.747LysHis: 1.747 ± 0.649
7.687LysIle: 7.687 ± 1.898
7.687LysLys: 7.687 ± 1.349
9.434LysLeu: 9.434 ± 1.823
2.096LysMet: 2.096 ± 0.52
5.241LysAsn: 5.241 ± 1.061
4.542LysPro: 4.542 ± 1.548
4.193LysGln: 4.193 ± 1.138
4.542LysArg: 4.542 ± 0.722
6.639LysSer: 6.639 ± 1.105
6.289LysThr: 6.289 ± 1.31
4.542LysVal: 4.542 ± 0.886
0.699LysTrp: 0.699 ± 0.308
2.795LysTyr: 2.795 ± 0.849
0.0LysXaa: 0.0 ± 0.0
Leu
7.687LeuAla: 7.687 ± 1.829
0.349LeuCys: 0.349 ± 0.295
7.687LeuAsp: 7.687 ± 2.071
8.735LeuGlu: 8.735 ± 2.064
4.193LeuPhe: 4.193 ± 1.557
5.241LeuGly: 5.241 ± 0.877
2.446LeuHis: 2.446 ± 0.577
6.639LeuIle: 6.639 ± 1.263
11.181LeuLys: 11.181 ± 1.659
11.181LeuLeu: 11.181 ± 2.809
1.747LeuMet: 1.747 ± 0.544
4.892LeuAsn: 4.892 ± 1.216
2.446LeuPro: 2.446 ± 1.076
3.843LeuGln: 3.843 ± 0.891
4.892LeuArg: 4.892 ± 1.1
4.193LeuSer: 4.193 ± 1.62
5.59LeuThr: 5.59 ± 1.339
3.843LeuVal: 3.843 ± 0.969
0.699LeuTrp: 0.699 ± 0.42
4.542LeuTyr: 4.542 ± 1.148
0.0LeuXaa: 0.0 ± 0.0
Met
2.096MetAla: 2.096 ± 0.994
0.0MetCys: 0.0 ± 0.0
1.747MetAsp: 1.747 ± 0.655
1.398MetGlu: 1.398 ± 0.65
1.048MetPhe: 1.048 ± 0.56
0.349MetGly: 0.349 ± 0.378
0.349MetHis: 0.349 ± 0.369
0.349MetIle: 0.349 ± 0.284
1.048MetLys: 1.048 ± 0.44
1.048MetLeu: 1.048 ± 0.507
0.699MetMet: 0.699 ± 0.479
2.795MetAsn: 2.795 ± 0.691
0.699MetPro: 0.699 ± 0.443
0.349MetGln: 0.349 ± 0.346
1.398MetArg: 1.398 ± 0.708
1.048MetSer: 1.048 ± 0.556
2.446MetThr: 2.446 ± 0.837
0.699MetVal: 0.699 ± 0.454
0.0MetTrp: 0.0 ± 0.0
0.349MetTyr: 0.349 ± 0.303
0.0MetXaa: 0.0 ± 0.0
Asn
4.542AsnAla: 4.542 ± 1.75
0.0AsnCys: 0.0 ± 0.0
3.494AsnAsp: 3.494 ± 0.777
5.241AsnGlu: 5.241 ± 1.641
2.096AsnPhe: 2.096 ± 0.841
4.542AsnGly: 4.542 ± 1.194
1.398AsnHis: 1.398 ± 0.63
3.494AsnIle: 3.494 ± 0.924
6.988AsnLys: 6.988 ± 1.9
6.988AsnLeu: 6.988 ± 0.956
1.398AsnMet: 1.398 ± 0.543
3.494AsnAsn: 3.494 ± 0.7
1.747AsnPro: 1.747 ± 0.638
1.398AsnGln: 1.398 ± 0.475
3.494AsnArg: 3.494 ± 0.832
2.795AsnSer: 2.795 ± 0.777
3.494AsnThr: 3.494 ± 1.056
3.494AsnVal: 3.494 ± 0.967
1.398AsnTrp: 1.398 ± 0.732
2.795AsnTyr: 2.795 ± 1.423
0.0AsnXaa: 0.0 ± 0.0
Pro
2.096ProAla: 2.096 ± 0.766
0.0ProCys: 0.0 ± 0.0
2.446ProAsp: 2.446 ± 0.784
3.145ProGlu: 3.145 ± 1.365
1.048ProPhe: 1.048 ± 0.64
0.0ProGly: 0.0 ± 0.0
0.0ProHis: 0.0 ± 0.0
2.096ProIle: 2.096 ± 0.644
2.096ProLys: 2.096 ± 0.624
2.446ProLeu: 2.446 ± 1.133
0.349ProMet: 0.349 ± 0.309
1.747ProAsn: 1.747 ± 0.768
0.0ProPro: 0.0 ± 0.0
0.349ProGln: 0.349 ± 0.301
1.398ProArg: 1.398 ± 0.581
1.048ProSer: 1.048 ± 0.484
2.446ProThr: 2.446 ± 1.033
1.398ProVal: 1.398 ± 0.772
0.0ProTrp: 0.0 ± 0.0
1.048ProTyr: 1.048 ± 0.649
0.0ProXaa: 0.0 ± 0.0
Gln
2.795GlnAla: 2.795 ± 1.175
0.349GlnCys: 0.349 ± 0.295
2.096GlnAsp: 2.096 ± 0.704
5.241GlnGlu: 5.241 ± 1.292
1.747GlnPhe: 1.747 ± 0.547
1.747GlnGly: 1.747 ± 0.692
0.349GlnHis: 0.349 ± 0.295
3.145GlnIle: 3.145 ± 0.775
4.193GlnLys: 4.193 ± 0.876
3.494GlnLeu: 3.494 ± 0.863
1.048GlnMet: 1.048 ± 0.605
2.446GlnAsn: 2.446 ± 0.843
0.0GlnPro: 0.0 ± 0.0
4.892GlnGln: 4.892 ± 1.226
2.795GlnArg: 2.795 ± 0.825
2.096GlnSer: 2.096 ± 0.824
2.096GlnThr: 2.096 ± 0.78
2.096GlnVal: 2.096 ± 1.016
0.0GlnTrp: 0.0 ± 0.0
2.096GlnTyr: 2.096 ± 1.081
0.0GlnXaa: 0.0 ± 0.0
Arg
3.145ArgAla: 3.145 ± 1.336
0.0ArgCys: 0.0 ± 0.0
1.747ArgAsp: 1.747 ± 0.573
5.94ArgGlu: 5.94 ± 1.754
2.795ArgPhe: 2.795 ± 1.355
1.747ArgGly: 1.747 ± 0.958
0.699ArgHis: 0.699 ± 0.308
2.795ArgIle: 2.795 ± 1.004
5.59ArgLys: 5.59 ± 0.789
5.241ArgLeu: 5.241 ± 0.898
1.398ArgMet: 1.398 ± 0.553
3.145ArgAsn: 3.145 ± 1.131
1.747ArgPro: 1.747 ± 1.504
2.795ArgGln: 2.795 ± 0.624
0.349ArgArg: 0.349 ± 0.301
1.048ArgSer: 1.048 ± 0.551
2.096ArgThr: 2.096 ± 0.865
1.398ArgVal: 1.398 ± 0.54
0.349ArgTrp: 0.349 ± 0.328
3.145ArgTyr: 3.145 ± 0.806
0.0ArgXaa: 0.0 ± 0.0
Ser
2.446SerAla: 2.446 ± 1.055
0.0SerCys: 0.0 ± 0.0
1.747SerAsp: 1.747 ± 0.524
4.892SerGlu: 4.892 ± 1.481
4.193SerPhe: 4.193 ± 1.228
3.145SerGly: 3.145 ± 1.235
0.349SerHis: 0.349 ± 0.328
4.193SerIle: 4.193 ± 1.285
4.542SerLys: 4.542 ± 0.975
3.843SerLeu: 3.843 ± 1.085
1.398SerMet: 1.398 ± 0.676
5.59SerAsn: 5.59 ± 1.133
3.145SerPro: 3.145 ± 0.678
2.795SerGln: 2.795 ± 0.905
2.446SerArg: 2.446 ± 1.136
3.145SerSer: 3.145 ± 0.773
3.145SerThr: 3.145 ± 0.824
3.145SerVal: 3.145 ± 0.903
0.349SerTrp: 0.349 ± 0.294
1.398SerTyr: 1.398 ± 0.502
0.0SerXaa: 0.0 ± 0.0
Thr
3.494ThrAla: 3.494 ± 0.696
0.0ThrCys: 0.0 ± 0.0
1.747ThrAsp: 1.747 ± 0.815
5.94ThrGlu: 5.94 ± 1.284
2.446ThrPhe: 2.446 ± 0.684
3.145ThrGly: 3.145 ± 1.029
0.699ThrHis: 0.699 ± 0.371
3.494ThrIle: 3.494 ± 0.761
4.892ThrLys: 4.892 ± 1.184
5.59ThrLeu: 5.59 ± 1.037
0.699ThrMet: 0.699 ± 0.431
2.795ThrAsn: 2.795 ± 1.007
1.747ThrPro: 1.747 ± 0.575
4.193ThrGln: 4.193 ± 1.277
2.795ThrArg: 2.795 ± 0.997
2.446ThrSer: 2.446 ± 0.781
3.843ThrThr: 3.843 ± 1.424
3.843ThrVal: 3.843 ± 1.242
0.349ThrTrp: 0.349 ± 0.378
1.747ThrTyr: 1.747 ± 0.82
0.0ThrXaa: 0.0 ± 0.0
Val
3.494ValAla: 3.494 ± 1.021
0.0ValCys: 0.0 ± 0.0
3.494ValAsp: 3.494 ± 1.521
4.193ValGlu: 4.193 ± 1.434
3.145ValPhe: 3.145 ± 0.956
1.398ValGly: 1.398 ± 0.559
0.699ValHis: 0.699 ± 0.472
4.193ValIle: 4.193 ± 1.174
5.59ValLys: 5.59 ± 1.171
3.843ValLeu: 3.843 ± 1.075
0.349ValMet: 0.349 ± 0.334
3.843ValAsn: 3.843 ± 0.688
0.699ValPro: 0.699 ± 0.427
2.096ValGln: 2.096 ± 0.757
3.145ValArg: 3.145 ± 0.833
2.096ValSer: 2.096 ± 0.9
2.446ValThr: 2.446 ± 1.026
1.398ValVal: 1.398 ± 0.743
1.398ValTrp: 1.398 ± 0.44
2.446ValTyr: 2.446 ± 1.004
0.0ValXaa: 0.0 ± 0.0
Trp
0.699TrpAla: 0.699 ± 0.587
0.0TrpCys: 0.0 ± 0.0
0.699TrpAsp: 0.699 ± 0.454
0.699TrpGlu: 0.699 ± 0.459
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.349TrpHis: 0.349 ± 0.294
1.048TrpIle: 1.048 ± 0.473
1.747TrpLys: 1.747 ± 0.539
1.048TrpLeu: 1.048 ± 0.63
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.699TrpGln: 0.699 ± 0.539
0.699TrpArg: 0.699 ± 0.528
0.699TrpSer: 0.699 ± 0.479
0.0TrpThr: 0.0 ± 0.0
0.349TrpVal: 0.349 ± 0.301
0.349TrpTrp: 0.349 ± 0.294
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.096TyrAla: 2.096 ± 0.768
0.349TyrCys: 0.349 ± 0.295
1.747TyrAsp: 1.747 ± 0.784
4.193TyrGlu: 4.193 ± 0.674
2.795TyrPhe: 2.795 ± 1.002
1.398TyrGly: 1.398 ± 0.554
1.048TyrHis: 1.048 ± 0.611
2.096TyrIle: 2.096 ± 0.776
5.94TyrLys: 5.94 ± 1.651
4.193TyrLeu: 4.193 ± 1.267
0.349TyrMet: 0.349 ± 0.359
3.494TyrAsn: 3.494 ± 1.015
0.349TyrPro: 0.349 ± 0.301
1.048TyrGln: 1.048 ± 0.512
3.843TyrArg: 3.843 ± 1.174
3.145TyrSer: 3.145 ± 0.922
1.398TyrThr: 1.398 ± 0.429
1.398TyrVal: 1.398 ± 0.596
0.699TyrTrp: 0.699 ± 0.422
1.048TyrTyr: 1.048 ± 0.53
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 18 proteins (2863 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski