Amino acid dipepetide frequency for Streptococcus satellite phage Javan204

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.461AlaCys: 0.461 ± 0.352
5.988AlaAsp: 5.988 ± 1.95
3.224AlaGlu: 3.224 ± 1.663
3.224AlaPhe: 3.224 ± 1.006
1.382AlaGly: 1.382 ± 1.163
0.0AlaHis: 0.0 ± 0.0
2.764AlaIle: 2.764 ± 1.356
3.685AlaLys: 3.685 ± 1.09
4.606AlaLeu: 4.606 ± 1.207
1.842AlaMet: 1.842 ± 1.098
3.685AlaAsn: 3.685 ± 1.116
1.382AlaPro: 1.382 ± 0.737
0.921AlaGln: 0.921 ± 0.734
2.303AlaArg: 2.303 ± 1.032
3.685AlaSer: 3.685 ± 1.361
2.764AlaThr: 2.764 ± 1.452
3.224AlaVal: 3.224 ± 1.287
0.0AlaTrp: 0.0 ± 0.0
3.685AlaTyr: 3.685 ± 1.422
0.0AlaXaa: 0.0 ± 0.0
Cys
0.461CysAla: 0.461 ± 0.463
0.0CysCys: 0.0 ± 0.0
0.461CysAsp: 0.461 ± 0.498
0.921CysGlu: 0.921 ± 1.139
0.461CysPhe: 0.461 ± 0.388
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.921CysIle: 0.921 ± 0.512
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
1.382CysMet: 1.382 ± 0.538
0.921CysAsn: 0.921 ± 0.704
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.461CysSer: 0.461 ± 0.352
0.461CysThr: 0.461 ± 0.352
0.461CysVal: 0.461 ± 0.632
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.842AspAla: 1.842 ± 0.735
0.461AspCys: 0.461 ± 0.388
4.146AspAsp: 4.146 ± 1.069
3.685AspGlu: 3.685 ± 1.212
1.842AspPhe: 1.842 ± 1.103
0.461AspGly: 0.461 ± 0.666
0.461AspHis: 0.461 ± 0.587
4.606AspIle: 4.606 ± 1.183
5.067AspLys: 5.067 ± 1.344
3.685AspLeu: 3.685 ± 1.235
3.685AspMet: 3.685 ± 1.577
4.606AspAsn: 4.606 ± 1.232
0.921AspPro: 0.921 ± 0.614
0.0AspGln: 0.0 ± 0.0
0.921AspArg: 0.921 ± 0.557
3.224AspSer: 3.224 ± 1.014
3.685AspThr: 3.685 ± 1.255
4.146AspVal: 4.146 ± 1.082
0.461AspTrp: 0.461 ± 0.352
4.146AspTyr: 4.146 ± 1.611
0.0AspXaa: 0.0 ± 0.0
Glu
5.527GluAla: 5.527 ± 1.675
0.921GluCys: 0.921 ± 0.677
2.303GluAsp: 2.303 ± 1.069
6.909GluGlu: 6.909 ± 2.204
4.146GluPhe: 4.146 ± 1.435
1.842GluGly: 1.842 ± 0.903
2.303GluHis: 2.303 ± 0.812
8.291GluIle: 8.291 ± 1.763
6.909GluLys: 6.909 ± 1.955
14.279GluLeu: 14.279 ± 2.248
1.842GluMet: 1.842 ± 1.449
5.067GluAsn: 5.067 ± 2.097
3.685GluPro: 3.685 ± 1.778
6.449GluGln: 6.449 ± 2.215
3.685GluArg: 3.685 ± 1.41
2.303GluSer: 2.303 ± 1.023
4.606GluThr: 4.606 ± 0.972
4.606GluVal: 4.606 ± 2.116
1.382GluTrp: 1.382 ± 0.548
1.842GluTyr: 1.842 ± 1.025
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.0PheCys: 0.0 ± 0.0
3.224PheAsp: 3.224 ± 1.208
4.606PheGlu: 4.606 ± 1.315
1.382PhePhe: 1.382 ± 0.9
2.303PheGly: 2.303 ± 0.528
0.461PheHis: 0.461 ± 0.388
1.842PheIle: 1.842 ± 0.863
5.527PheLys: 5.527 ± 1.315
0.921PheLeu: 0.921 ± 0.512
0.461PheMet: 0.461 ± 0.388
3.685PheAsn: 3.685 ± 1.249
0.461PhePro: 0.461 ± 0.463
0.921PheGln: 0.921 ± 0.557
2.303PheArg: 2.303 ± 0.774
5.067PheSer: 5.067 ± 1.499
2.303PheThr: 2.303 ± 0.783
2.303PheVal: 2.303 ± 0.641
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
0.461GlyAla: 0.461 ± 0.587
0.0GlyCys: 0.0 ± 0.0
2.303GlyAsp: 2.303 ± 0.648
3.685GlyGlu: 3.685 ± 0.822
3.685GlyPhe: 3.685 ± 0.912
1.382GlyGly: 1.382 ± 1.056
1.382GlyHis: 1.382 ± 0.761
2.764GlyIle: 2.764 ± 0.952
5.988GlyLys: 5.988 ± 2.292
4.606GlyLeu: 4.606 ± 1.261
0.921GlyMet: 0.921 ± 0.909
4.146GlyAsn: 4.146 ± 1.456
0.461GlyPro: 0.461 ± 0.498
0.461GlyGln: 0.461 ± 0.498
1.842GlyArg: 1.842 ± 0.575
1.382GlySer: 1.382 ± 0.747
2.303GlyThr: 2.303 ± 0.738
1.842GlyVal: 1.842 ± 1.025
0.921GlyTrp: 0.921 ± 0.704
2.303GlyTyr: 2.303 ± 0.911
0.0GlyXaa: 0.0 ± 0.0
His
1.842HisAla: 1.842 ± 1.121
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.382HisGlu: 1.382 ± 0.742
0.461HisPhe: 0.461 ± 0.666
0.921HisGly: 0.921 ± 0.543
0.0HisHis: 0.0 ± 0.0
0.461HisIle: 0.461 ± 0.352
1.382HisLys: 1.382 ± 0.965
2.303HisLeu: 2.303 ± 1.126
0.0HisMet: 0.0 ± 0.0
0.461HisAsn: 0.461 ± 0.457
0.921HisPro: 0.921 ± 0.557
0.461HisGln: 0.461 ± 0.457
1.382HisArg: 1.382 ± 0.543
1.382HisSer: 1.382 ± 0.548
1.842HisThr: 1.842 ± 1.017
0.461HisVal: 0.461 ± 0.587
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
6.909IleAla: 6.909 ± 1.845
0.0IleCys: 0.0 ± 0.0
4.146IleAsp: 4.146 ± 1.353
5.527IleGlu: 5.527 ± 1.858
1.842IlePhe: 1.842 ± 0.866
3.685IleGly: 3.685 ± 1.423
0.0IleHis: 0.0 ± 0.0
3.685IleIle: 3.685 ± 0.862
9.212IleLys: 9.212 ± 2.077
5.067IleLeu: 5.067 ± 1.012
0.921IleMet: 0.921 ± 0.717
4.606IleAsn: 4.606 ± 1.125
1.842IlePro: 1.842 ± 1.104
1.382IleGln: 1.382 ± 0.958
1.842IleArg: 1.842 ± 0.789
4.146IleSer: 4.146 ± 1.418
3.685IleThr: 3.685 ± 0.858
2.303IleVal: 2.303 ± 0.952
0.461IleTrp: 0.461 ± 0.352
5.988IleTyr: 5.988 ± 1.177
0.0IleXaa: 0.0 ± 0.0
Lys
5.988LysAla: 5.988 ± 1.335
0.0LysCys: 0.0 ± 0.0
5.988LysAsp: 5.988 ± 1.086
12.897LysGlu: 12.897 ± 1.306
2.764LysPhe: 2.764 ± 1.273
3.685LysGly: 3.685 ± 1.364
2.303LysHis: 2.303 ± 0.747
4.146LysIle: 4.146 ± 1.325
9.212LysLys: 9.212 ± 2.455
6.909LysLeu: 6.909 ± 2.362
2.303LysMet: 2.303 ± 1.4
5.988LysAsn: 5.988 ± 2.231
4.146LysPro: 4.146 ± 1.376
6.449LysGln: 6.449 ± 1.425
5.988LysArg: 5.988 ± 1.449
5.527LysSer: 5.527 ± 1.216
7.37LysThr: 7.37 ± 1.636
6.449LysVal: 6.449 ± 2.002
0.0LysTrp: 0.0 ± 0.0
3.685LysTyr: 3.685 ± 0.631
0.0LysXaa: 0.0 ± 0.0
Leu
4.606LeuAla: 4.606 ± 1.895
1.842LeuCys: 1.842 ± 1.061
4.606LeuAsp: 4.606 ± 1.445
11.976LeuGlu: 11.976 ± 3.441
1.842LeuPhe: 1.842 ± 1.129
4.606LeuGly: 4.606 ± 1.033
0.921LeuHis: 0.921 ± 0.673
5.988LeuIle: 5.988 ± 1.901
10.134LeuLys: 10.134 ± 2.972
9.673LeuLeu: 9.673 ± 1.964
2.764LeuMet: 2.764 ± 1.052
5.527LeuAsn: 5.527 ± 1.944
5.527LeuPro: 5.527 ± 1.505
2.764LeuGln: 2.764 ± 1.084
5.067LeuArg: 5.067 ± 1.245
5.988LeuSer: 5.988 ± 2.067
6.449LeuThr: 6.449 ± 1.615
6.909LeuVal: 6.909 ± 2.395
1.382LeuTrp: 1.382 ± 0.664
4.146LeuTyr: 4.146 ± 1.093
0.0LeuXaa: 0.0 ± 0.0
Met
2.764MetAla: 2.764 ± 1.349
0.0MetCys: 0.0 ± 0.0
0.461MetAsp: 0.461 ± 0.388
2.303MetGlu: 2.303 ± 1.083
0.461MetPhe: 0.461 ± 0.352
0.921MetGly: 0.921 ± 0.622
0.0MetHis: 0.0 ± 0.0
1.842MetIle: 1.842 ± 1.039
1.382MetLys: 1.382 ± 0.548
4.606MetLeu: 4.606 ± 1.955
0.461MetMet: 0.461 ± 0.457
3.224MetAsn: 3.224 ± 1.023
0.0MetPro: 0.0 ± 0.0
0.921MetGln: 0.921 ± 0.631
0.461MetArg: 0.461 ± 0.601
0.461MetSer: 0.461 ± 0.463
2.764MetThr: 2.764 ± 1.227
2.764MetVal: 2.764 ± 1.389
0.0MetTrp: 0.0 ± 0.0
0.461MetTyr: 0.461 ± 0.463
0.0MetXaa: 0.0 ± 0.0
Asn
2.764AsnAla: 2.764 ± 1.217
0.461AsnCys: 0.461 ± 0.498
3.685AsnAsp: 3.685 ± 1.041
6.449AsnGlu: 6.449 ± 2.025
2.303AsnPhe: 2.303 ± 0.716
8.752AsnGly: 8.752 ± 1.294
1.382AsnHis: 1.382 ± 0.664
4.146AsnIle: 4.146 ± 0.898
8.291AsnLys: 8.291 ± 1.82
5.067AsnLeu: 5.067 ± 1.175
1.382AsnMet: 1.382 ± 0.75
0.921AsnAsn: 0.921 ± 0.512
2.303AsnPro: 2.303 ± 1.312
0.461AsnGln: 0.461 ± 0.601
2.764AsnArg: 2.764 ± 1.501
2.303AsnSer: 2.303 ± 1.068
4.146AsnThr: 4.146 ± 1.818
2.303AsnVal: 2.303 ± 1.613
0.921AsnTrp: 0.921 ± 0.96
2.303AsnTyr: 2.303 ± 0.865
0.0AsnXaa: 0.0 ± 0.0
Pro
1.382ProAla: 1.382 ± 0.989
0.461ProCys: 0.461 ± 0.57
0.921ProAsp: 0.921 ± 0.704
4.146ProGlu: 4.146 ± 1.655
1.842ProPhe: 1.842 ± 1.05
0.0ProGly: 0.0 ± 0.0
0.0ProHis: 0.0 ± 0.0
1.842ProIle: 1.842 ± 0.855
3.685ProLys: 3.685 ± 1.078
1.382ProLeu: 1.382 ± 0.747
0.921ProMet: 0.921 ± 0.591
2.764ProAsn: 2.764 ± 0.787
0.461ProPro: 0.461 ± 0.352
1.382ProGln: 1.382 ± 0.517
2.764ProArg: 2.764 ± 1.562
2.764ProSer: 2.764 ± 1.372
1.842ProThr: 1.842 ± 0.724
3.224ProVal: 3.224 ± 1.072
0.0ProTrp: 0.0 ± 0.0
0.921ProTyr: 0.921 ± 0.801
0.0ProXaa: 0.0 ± 0.0
Gln
2.303GlnAla: 2.303 ± 0.926
0.461GlnCys: 0.461 ± 0.388
1.382GlnAsp: 1.382 ± 1.01
2.764GlnGlu: 2.764 ± 1.675
0.921GlnPhe: 0.921 ± 0.414
0.461GlnGly: 0.461 ± 0.463
1.842GlnHis: 1.842 ± 0.925
2.764GlnIle: 2.764 ± 1.009
3.685GlnLys: 3.685 ± 1.296
7.83GlnLeu: 7.83 ± 1.306
0.461GlnMet: 0.461 ± 0.601
0.461GlnAsn: 0.461 ± 0.498
0.921GlnPro: 0.921 ± 0.704
2.303GlnGln: 2.303 ± 1.045
1.382GlnArg: 1.382 ± 0.794
1.382GlnSer: 1.382 ± 0.735
1.842GlnThr: 1.842 ± 0.828
1.842GlnVal: 1.842 ± 0.739
0.921GlnTrp: 0.921 ± 0.557
0.921GlnTyr: 0.921 ± 0.414
0.0GlnXaa: 0.0 ± 0.0
Arg
1.842ArgAla: 1.842 ± 0.863
0.0ArgCys: 0.0 ± 0.0
1.382ArgAsp: 1.382 ± 0.721
2.764ArgGlu: 2.764 ± 0.97
2.303ArgPhe: 2.303 ± 1.0
1.842ArgGly: 1.842 ± 0.739
1.382ArgHis: 1.382 ± 0.737
5.067ArgIle: 5.067 ± 1.265
4.606ArgLys: 4.606 ± 0.809
5.527ArgLeu: 5.527 ± 1.177
1.382ArgMet: 1.382 ± 0.817
0.921ArgAsn: 0.921 ± 0.926
1.382ArgPro: 1.382 ± 0.842
2.764ArgGln: 2.764 ± 1.227
2.764ArgArg: 2.764 ± 1.511
1.382ArgSer: 1.382 ± 0.805
3.224ArgThr: 3.224 ± 1.157
1.842ArgVal: 1.842 ± 0.735
0.0ArgTrp: 0.0 ± 0.0
3.224ArgTyr: 3.224 ± 1.18
0.0ArgXaa: 0.0 ± 0.0
Ser
1.842SerAla: 1.842 ± 0.705
0.0SerCys: 0.0 ± 0.0
3.685SerAsp: 3.685 ± 1.629
5.988SerGlu: 5.988 ± 1.692
2.764SerPhe: 2.764 ± 0.964
2.764SerGly: 2.764 ± 1.002
0.461SerHis: 0.461 ± 0.388
4.146SerIle: 4.146 ± 1.429
6.449SerLys: 6.449 ± 0.943
5.527SerLeu: 5.527 ± 1.634
0.461SerMet: 0.461 ± 0.558
4.606SerAsn: 4.606 ± 1.012
2.303SerPro: 2.303 ± 1.099
2.764SerGln: 2.764 ± 0.973
1.842SerArg: 1.842 ± 0.692
2.764SerSer: 2.764 ± 0.634
3.685SerThr: 3.685 ± 1.31
2.764SerVal: 2.764 ± 0.634
0.0SerTrp: 0.0 ± 0.0
1.842SerTyr: 1.842 ± 0.992
0.0SerXaa: 0.0 ± 0.0
Thr
3.224ThrAla: 3.224 ± 1.428
0.921ThrCys: 0.921 ± 0.569
4.146ThrAsp: 4.146 ± 1.328
3.224ThrGlu: 3.224 ± 1.16
2.303ThrPhe: 2.303 ± 0.848
3.224ThrGly: 3.224 ± 1.269
1.382ThrHis: 1.382 ± 0.926
5.067ThrIle: 5.067 ± 1.575
5.527ThrLys: 5.527 ± 1.794
8.291ThrLeu: 8.291 ± 1.727
1.842ThrMet: 1.842 ± 0.699
3.224ThrAsn: 3.224 ± 1.439
1.382ThrPro: 1.382 ± 0.83
1.382ThrGln: 1.382 ± 0.805
2.303ThrArg: 2.303 ± 0.959
2.303ThrSer: 2.303 ± 0.848
3.224ThrThr: 3.224 ± 0.891
6.449ThrVal: 6.449 ± 2.029
1.382ThrTrp: 1.382 ± 0.847
1.842ThrTyr: 1.842 ± 0.921
0.0ThrXaa: 0.0 ± 0.0
Val
3.224ValAla: 3.224 ± 0.987
0.0ValCys: 0.0 ± 0.0
0.921ValAsp: 0.921 ± 0.966
2.764ValGlu: 2.764 ± 1.097
2.303ValPhe: 2.303 ± 1.056
2.764ValGly: 2.764 ± 1.24
0.461ValHis: 0.461 ± 0.463
3.685ValIle: 3.685 ± 1.646
6.909ValLys: 6.909 ± 1.998
6.909ValLeu: 6.909 ± 2.772
2.303ValMet: 2.303 ± 0.954
5.527ValAsn: 5.527 ± 1.693
2.303ValPro: 2.303 ± 0.89
2.303ValGln: 2.303 ± 0.906
2.303ValArg: 2.303 ± 1.415
5.527ValSer: 5.527 ± 2.943
4.606ValThr: 4.606 ± 1.287
2.303ValVal: 2.303 ± 0.954
0.921ValTrp: 0.921 ± 0.769
2.764ValTyr: 2.764 ± 1.223
0.0ValXaa: 0.0 ± 0.0
Trp
0.921TrpAla: 0.921 ± 0.769
0.461TrpCys: 0.461 ± 0.352
0.0TrpAsp: 0.0 ± 0.0
0.461TrpGlu: 0.461 ± 0.498
0.461TrpPhe: 0.461 ± 0.352
0.0TrpGly: 0.0 ± 0.0
0.461TrpHis: 0.461 ± 0.352
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
1.842TrpLeu: 1.842 ± 0.867
0.0TrpMet: 0.0 ± 0.0
0.461TrpAsn: 0.461 ± 0.457
0.0TrpPro: 0.0 ± 0.0
0.461TrpGln: 0.461 ± 0.352
0.461TrpArg: 0.461 ± 0.352
0.921TrpSer: 0.921 ± 0.591
0.0TrpThr: 0.0 ± 0.0
1.382TrpVal: 1.382 ± 0.736
0.461TrpTrp: 0.461 ± 0.457
0.461TrpTyr: 0.461 ± 0.352
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.921TyrAla: 0.921 ± 0.414
0.461TyrCys: 0.461 ± 0.463
2.303TyrAsp: 2.303 ± 1.46
3.224TyrGlu: 3.224 ± 1.096
0.461TyrPhe: 0.461 ± 0.498
2.303TyrGly: 2.303 ± 1.538
0.461TyrHis: 0.461 ± 0.388
3.224TyrIle: 3.224 ± 0.971
4.146TyrLys: 4.146 ± 1.578
3.685TyrLeu: 3.685 ± 0.957
0.461TyrMet: 0.461 ± 0.601
2.303TyrAsn: 2.303 ± 1.1
2.303TyrPro: 2.303 ± 0.881
2.303TyrGln: 2.303 ± 0.824
3.224TyrArg: 3.224 ± 1.001
3.685TyrSer: 3.685 ± 1.29
1.842TyrThr: 1.842 ± 0.741
3.224TyrVal: 3.224 ± 1.39
0.0TyrTrp: 0.0 ± 0.0
0.921TyrTyr: 0.921 ± 0.646
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 13 proteins (2172 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski