Amino acid dipepetide frequency for Streptococcus satellite phage Javan203

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.503AlaCys: 0.503 ± 0.481
4.026AlaAsp: 4.026 ± 1.105
4.529AlaGlu: 4.529 ± 1.219
1.51AlaPhe: 1.51 ± 0.829
2.516AlaGly: 2.516 ± 1.145
0.503AlaHis: 0.503 ± 0.605
5.536AlaIle: 5.536 ± 1.486
3.523AlaLys: 3.523 ± 1.37
6.039AlaLeu: 6.039 ± 1.562
2.516AlaMet: 2.516 ± 1.226
2.013AlaAsn: 2.013 ± 1.292
0.503AlaPro: 0.503 ± 0.43
2.013AlaGln: 2.013 ± 0.924
2.516AlaArg: 2.516 ± 0.961
2.516AlaSer: 2.516 ± 1.137
5.033AlaThr: 5.033 ± 1.882
3.02AlaVal: 3.02 ± 1.278
0.503AlaTrp: 0.503 ± 0.512
4.026AlaTyr: 4.026 ± 1.174
0.0AlaXaa: 0.0 ± 0.0
Cys
0.503CysAla: 0.503 ± 0.43
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.503CysGlu: 0.503 ± 0.512
0.0CysPhe: 0.0 ± 0.0
0.503CysGly: 0.503 ± 0.462
0.503CysHis: 0.503 ± 0.452
1.007CysIle: 1.007 ± 0.569
0.503CysLys: 0.503 ± 0.512
1.007CysLeu: 1.007 ± 0.575
0.0CysMet: 0.0 ± 0.0
0.503CysAsn: 0.503 ± 0.481
1.007CysPro: 1.007 ± 0.676
1.51CysGln: 1.51 ± 1.539
1.007CysArg: 1.007 ± 1.026
0.0CysSer: 0.0 ± 0.0
0.503CysThr: 0.503 ± 0.452
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.503AspAla: 0.503 ± 0.622
2.013AspCys: 2.013 ± 0.777
2.516AspAsp: 2.516 ± 1.131
2.013AspGlu: 2.013 ± 1.018
3.523AspPhe: 3.523 ± 1.075
2.516AspGly: 2.516 ± 1.098
0.0AspHis: 0.0 ± 0.0
5.033AspIle: 5.033 ± 1.171
8.052AspLys: 8.052 ± 1.818
5.536AspLeu: 5.536 ± 1.601
1.51AspMet: 1.51 ± 0.808
2.516AspAsn: 2.516 ± 1.202
1.51AspPro: 1.51 ± 0.848
1.007AspGln: 1.007 ± 0.594
1.51AspArg: 1.51 ± 1.005
2.516AspSer: 2.516 ± 1.839
1.51AspThr: 1.51 ± 0.778
2.516AspVal: 2.516 ± 0.783
0.503AspTrp: 0.503 ± 0.598
4.529AspTyr: 4.529 ± 1.073
0.0AspXaa: 0.0 ± 0.0
Glu
6.543GluAla: 6.543 ± 1.976
0.503GluCys: 0.503 ± 0.43
4.529GluAsp: 4.529 ± 2.543
4.529GluGlu: 4.529 ± 2.025
4.026GluPhe: 4.026 ± 1.24
2.516GluGly: 2.516 ± 1.324
0.503GluHis: 0.503 ± 0.462
9.059GluIle: 9.059 ± 2.11
5.536GluLys: 5.536 ± 1.634
11.575GluLeu: 11.575 ± 1.582
2.516GluMet: 2.516 ± 1.311
7.046GluAsn: 7.046 ± 1.953
1.007GluPro: 1.007 ± 0.596
3.02GluGln: 3.02 ± 1.023
5.033GluArg: 5.033 ± 1.351
4.529GluSer: 4.529 ± 1.629
2.013GluThr: 2.013 ± 0.797
4.026GluVal: 4.026 ± 1.3
0.503GluTrp: 0.503 ± 0.437
1.51GluTyr: 1.51 ± 0.869
0.0GluXaa: 0.0 ± 0.0
Phe
1.007PheAla: 1.007 ± 0.7
0.503PheCys: 0.503 ± 0.452
3.523PheAsp: 3.523 ± 1.436
3.523PheGlu: 3.523 ± 1.3
3.523PhePhe: 3.523 ± 0.949
2.516PheGly: 2.516 ± 0.838
1.51PheHis: 1.51 ± 0.783
2.516PheIle: 2.516 ± 0.842
1.51PheLys: 1.51 ± 0.819
5.033PheLeu: 5.033 ± 1.1
1.007PheMet: 1.007 ± 0.705
1.51PheAsn: 1.51 ± 1.027
0.503PhePro: 0.503 ± 0.513
0.503PheGln: 0.503 ± 0.51
1.51PheArg: 1.51 ± 0.657
4.026PheSer: 4.026 ± 1.227
3.523PheThr: 3.523 ± 1.433
1.51PheVal: 1.51 ± 0.933
1.007PheTrp: 1.007 ± 0.641
4.026PheTyr: 4.026 ± 2.073
0.0PheXaa: 0.0 ± 0.0
Gly
4.026GlyAla: 4.026 ± 1.614
0.0GlyCys: 0.0 ± 0.0
1.007GlyAsp: 1.007 ± 0.598
3.02GlyGlu: 3.02 ± 1.201
2.516GlyPhe: 2.516 ± 1.159
3.02GlyGly: 3.02 ± 1.027
0.503GlyHis: 0.503 ± 0.481
4.529GlyIle: 4.529 ± 1.284
1.007GlyLys: 1.007 ± 0.606
4.026GlyLeu: 4.026 ± 1.135
0.503GlyMet: 0.503 ± 0.529
1.51GlyAsn: 1.51 ± 0.775
0.503GlyPro: 0.503 ± 0.462
0.503GlyGln: 0.503 ± 0.51
2.013GlyArg: 2.013 ± 1.114
2.516GlySer: 2.516 ± 1.18
2.516GlyThr: 2.516 ± 1.309
3.523GlyVal: 3.523 ± 1.093
0.503GlyTrp: 0.503 ± 0.598
5.536GlyTyr: 5.536 ± 1.312
0.0GlyXaa: 0.0 ± 0.0
His
1.007HisAla: 1.007 ± 0.962
0.0HisCys: 0.0 ± 0.0
1.007HisAsp: 1.007 ± 0.667
0.503HisGlu: 0.503 ± 0.452
0.503HisPhe: 0.503 ± 0.481
0.503HisGly: 0.503 ± 0.481
0.0HisHis: 0.0 ± 0.0
0.503HisIle: 0.503 ± 0.605
2.516HisLys: 2.516 ± 0.977
2.013HisLeu: 2.013 ± 0.698
0.0HisMet: 0.0 ± 0.0
2.516HisAsn: 2.516 ± 1.009
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.503HisArg: 0.503 ± 0.481
0.503HisSer: 0.503 ± 0.481
1.007HisThr: 1.007 ± 0.657
0.503HisVal: 0.503 ± 0.605
0.0HisTrp: 0.0 ± 0.0
0.503HisTyr: 0.503 ± 0.513
0.0HisXaa: 0.0 ± 0.0
Ile
3.523IleAla: 3.523 ± 1.343
0.503IleCys: 0.503 ± 0.452
4.529IleAsp: 4.529 ± 1.386
7.046IleGlu: 7.046 ± 1.955
2.516IlePhe: 2.516 ± 1.195
0.503IleGly: 0.503 ± 0.462
1.007IleHis: 1.007 ± 0.647
3.02IleIle: 3.02 ± 1.303
8.556IleLys: 8.556 ± 1.927
5.536IleLeu: 5.536 ± 1.811
1.51IleMet: 1.51 ± 0.835
5.536IleAsn: 5.536 ± 1.215
2.013IlePro: 2.013 ± 1.009
2.516IleGln: 2.516 ± 1.052
3.523IleArg: 3.523 ± 1.257
4.026IleSer: 4.026 ± 1.269
2.516IleThr: 2.516 ± 0.743
3.02IleVal: 3.02 ± 1.469
1.51IleTrp: 1.51 ± 0.808
6.039IleTyr: 6.039 ± 1.733
0.0IleXaa: 0.0 ± 0.0
Lys
6.543LysAla: 6.543 ± 2.815
1.007LysCys: 1.007 ± 0.594
2.516LysAsp: 2.516 ± 1.215
12.079LysGlu: 12.079 ± 1.842
2.013LysPhe: 2.013 ± 0.934
7.046LysGly: 7.046 ± 2.395
1.51LysHis: 1.51 ± 0.707
6.039LysIle: 6.039 ± 1.869
10.065LysLys: 10.065 ± 2.18
10.065LysLeu: 10.065 ± 1.581
3.02LysMet: 3.02 ± 1.186
6.543LysAsn: 6.543 ± 1.755
3.02LysPro: 3.02 ± 0.859
4.026LysGln: 4.026 ± 1.403
7.549LysArg: 7.549 ± 2.113
3.523LysSer: 3.523 ± 1.213
6.039LysThr: 6.039 ± 1.516
4.026LysVal: 4.026 ± 1.004
1.51LysTrp: 1.51 ± 0.704
4.529LysTyr: 4.529 ± 1.216
0.0LysXaa: 0.0 ± 0.0
Leu
6.039LeuAla: 6.039 ± 1.473
1.007LeuCys: 1.007 ± 0.598
8.556LeuAsp: 8.556 ± 1.553
8.052LeuGlu: 8.052 ± 1.302
5.536LeuPhe: 5.536 ± 1.845
6.039LeuGly: 6.039 ± 1.58
1.51LeuHis: 1.51 ± 0.933
5.033LeuIle: 5.033 ± 1.54
14.595LeuLys: 14.595 ± 1.775
10.065LeuLeu: 10.065 ± 2.436
3.523LeuMet: 3.523 ± 1.307
6.543LeuAsn: 6.543 ± 2.346
3.02LeuPro: 3.02 ± 0.891
3.523LeuGln: 3.523 ± 1.68
3.523LeuArg: 3.523 ± 1.572
4.529LeuSer: 4.529 ± 1.635
6.543LeuThr: 6.543 ± 1.766
4.026LeuVal: 4.026 ± 1.852
0.503LeuTrp: 0.503 ± 0.513
2.516LeuTyr: 2.516 ± 0.913
0.0LeuXaa: 0.0 ± 0.0
Met
3.02MetAla: 3.02 ± 1.344
0.503MetCys: 0.503 ± 0.513
1.51MetAsp: 1.51 ± 0.911
1.51MetGlu: 1.51 ± 1.095
1.51MetPhe: 1.51 ± 0.959
0.503MetGly: 0.503 ± 0.51
0.0MetHis: 0.0 ± 0.0
1.007MetIle: 1.007 ± 0.924
2.516MetLys: 2.516 ± 1.001
3.523MetLeu: 3.523 ± 1.152
0.0MetMet: 0.0 ± 0.0
2.516MetAsn: 2.516 ± 0.778
0.503MetPro: 0.503 ± 0.462
0.503MetGln: 0.503 ± 0.437
1.007MetArg: 1.007 ± 0.58
1.51MetSer: 1.51 ± 0.836
2.013MetThr: 2.013 ± 0.896
2.516MetVal: 2.516 ± 0.941
0.0MetTrp: 0.0 ± 0.0
0.503MetTyr: 0.503 ± 0.529
0.0MetXaa: 0.0 ± 0.0
Asn
4.529AsnAla: 4.529 ± 1.578
0.0AsnCys: 0.0 ± 0.0
2.516AsnAsp: 2.516 ± 1.33
3.02AsnGlu: 3.02 ± 1.007
3.02AsnPhe: 3.02 ± 1.155
2.013AsnGly: 2.013 ± 0.811
1.007AsnHis: 1.007 ± 0.703
2.516AsnIle: 2.516 ± 1.327
4.026AsnLys: 4.026 ± 1.341
8.556AsnLeu: 8.556 ± 2.135
2.516AsnMet: 2.516 ± 1.203
4.529AsnAsn: 4.529 ± 1.217
2.516AsnPro: 2.516 ± 1.275
2.516AsnGln: 2.516 ± 1.477
3.02AsnArg: 3.02 ± 0.816
3.02AsnSer: 3.02 ± 1.013
4.529AsnThr: 4.529 ± 2.001
4.026AsnVal: 4.026 ± 1.597
1.007AsnTrp: 1.007 ± 0.704
2.013AsnTyr: 2.013 ± 0.868
0.0AsnXaa: 0.0 ± 0.0
Pro
1.007ProAla: 1.007 ± 0.615
0.0ProCys: 0.0 ± 0.0
1.007ProAsp: 1.007 ± 0.676
2.013ProGlu: 2.013 ± 1.027
1.51ProPhe: 1.51 ± 1.103
0.0ProGly: 0.0 ± 0.0
0.503ProHis: 0.503 ± 0.516
1.007ProIle: 1.007 ± 0.655
5.536ProLys: 5.536 ± 1.244
1.51ProLeu: 1.51 ± 0.948
1.007ProMet: 1.007 ± 0.676
2.013ProAsn: 2.013 ± 0.815
0.503ProPro: 0.503 ± 0.437
0.503ProGln: 0.503 ± 0.462
2.013ProArg: 2.013 ± 0.686
0.0ProSer: 0.0 ± 0.0
2.013ProThr: 2.013 ± 1.081
1.51ProVal: 1.51 ± 0.781
0.0ProTrp: 0.0 ± 0.0
1.007ProTyr: 1.007 ± 0.58
0.0ProXaa: 0.0 ± 0.0
Gln
2.516GlnAla: 2.516 ± 1.211
0.503GlnCys: 0.503 ± 0.462
3.02GlnAsp: 3.02 ± 1.053
4.026GlnGlu: 4.026 ± 1.395
0.503GlnPhe: 0.503 ± 0.437
2.013GlnGly: 2.013 ± 0.964
1.007GlnHis: 1.007 ± 0.626
3.02GlnIle: 3.02 ± 1.493
5.033GlnLys: 5.033 ± 1.1
3.523GlnLeu: 3.523 ± 1.156
0.0GlnMet: 0.0 ± 0.0
1.007GlnAsn: 1.007 ± 0.704
0.503GlnPro: 0.503 ± 0.481
4.026GlnGln: 4.026 ± 1.776
1.007GlnArg: 1.007 ± 0.677
4.529GlnSer: 4.529 ± 1.572
2.013GlnThr: 2.013 ± 0.856
2.013GlnVal: 2.013 ± 1.033
0.0GlnTrp: 0.0 ± 0.0
2.516GlnTyr: 2.516 ± 0.785
0.0GlnXaa: 0.0 ± 0.0
Arg
2.013ArgAla: 2.013 ± 0.8
1.007ArgCys: 1.007 ± 1.026
2.013ArgAsp: 2.013 ± 0.975
3.02ArgGlu: 3.02 ± 0.981
1.007ArgPhe: 1.007 ± 0.682
3.523ArgGly: 3.523 ± 1.164
0.503ArgHis: 0.503 ± 0.481
4.529ArgIle: 4.529 ± 1.119
5.033ArgLys: 5.033 ± 1.334
7.549ArgLeu: 7.549 ± 2.102
0.503ArgMet: 0.503 ± 0.651
3.02ArgAsn: 3.02 ± 1.846
2.516ArgPro: 2.516 ± 1.05
3.02ArgGln: 3.02 ± 1.027
3.523ArgArg: 3.523 ± 1.536
1.007ArgSer: 1.007 ± 0.682
1.007ArgThr: 1.007 ± 0.749
2.516ArgVal: 2.516 ± 1.309
1.007ArgTrp: 1.007 ± 0.691
2.516ArgTyr: 2.516 ± 1.39
0.0ArgXaa: 0.0 ± 0.0
Ser
1.51SerAla: 1.51 ± 0.684
0.0SerCys: 0.0 ± 0.0
3.523SerAsp: 3.523 ± 1.285
5.536SerGlu: 5.536 ± 1.504
2.516SerPhe: 2.516 ± 1.259
1.007SerGly: 1.007 ± 0.691
0.0SerHis: 0.0 ± 0.0
4.026SerIle: 4.026 ± 1.869
3.523SerLys: 3.523 ± 0.918
5.033SerLeu: 5.033 ± 1.337
1.51SerMet: 1.51 ± 0.781
1.007SerAsn: 1.007 ± 0.674
0.503SerPro: 0.503 ± 0.462
4.529SerGln: 4.529 ± 2.188
3.02SerArg: 3.02 ± 1.297
1.51SerSer: 1.51 ± 0.923
2.013SerThr: 2.013 ± 0.776
3.02SerVal: 3.02 ± 1.537
0.0SerTrp: 0.0 ± 0.0
5.033SerTyr: 5.033 ± 1.327
0.0SerXaa: 0.0 ± 0.0
Thr
3.02ThrAla: 3.02 ± 0.992
0.0ThrCys: 0.0 ± 0.0
2.516ThrAsp: 2.516 ± 1.016
6.039ThrGlu: 6.039 ± 1.99
1.007ThrPhe: 1.007 ± 0.695
2.516ThrGly: 2.516 ± 1.228
1.007ThrHis: 1.007 ± 0.692
4.529ThrIle: 4.529 ± 1.318
8.556ThrLys: 8.556 ± 1.739
4.026ThrLeu: 4.026 ± 0.99
1.007ThrMet: 1.007 ± 0.575
2.013ThrAsn: 2.013 ± 0.934
3.02ThrPro: 3.02 ± 1.22
2.013ThrGln: 2.013 ± 0.972
0.503ThrArg: 0.503 ± 0.512
1.007ThrSer: 1.007 ± 0.698
2.516ThrThr: 2.516 ± 1.182
4.529ThrVal: 4.529 ± 1.098
1.007ThrTrp: 1.007 ± 0.905
3.02ThrTyr: 3.02 ± 0.816
0.0ThrXaa: 0.0 ± 0.0
Val
1.51ValAla: 1.51 ± 0.788
0.0ValCys: 0.0 ± 0.0
0.503ValAsp: 0.503 ± 0.462
4.026ValGlu: 4.026 ± 1.913
5.033ValPhe: 5.033 ± 1.422
1.51ValGly: 1.51 ± 0.792
1.007ValHis: 1.007 ± 0.626
3.02ValIle: 3.02 ± 1.169
6.039ValLys: 6.039 ± 2.147
4.529ValLeu: 4.529 ± 1.412
1.51ValMet: 1.51 ± 1.136
3.02ValAsn: 3.02 ± 1.726
1.007ValPro: 1.007 ± 0.712
2.516ValGln: 2.516 ± 1.069
3.02ValArg: 3.02 ± 1.222
3.02ValSer: 3.02 ± 0.928
3.02ValThr: 3.02 ± 0.886
3.02ValVal: 3.02 ± 1.512
0.0ValTrp: 0.0 ± 0.0
4.026ValTyr: 4.026 ± 1.179
0.0ValXaa: 0.0 ± 0.0
Trp
2.013TrpAla: 2.013 ± 0.845
0.0TrpCys: 0.0 ± 0.0
1.007TrpAsp: 1.007 ± 0.712
2.013TrpGlu: 2.013 ± 0.887
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.503TrpHis: 0.503 ± 0.452
0.503TrpIle: 0.503 ± 0.452
0.0TrpLys: 0.0 ± 0.0
1.007TrpLeu: 1.007 ± 0.676
0.0TrpMet: 0.0 ± 0.0
1.007TrpAsn: 1.007 ± 0.691
0.0TrpPro: 0.0 ± 0.0
0.503TrpGln: 0.503 ± 0.512
0.503TrpArg: 0.503 ± 0.516
0.503TrpSer: 0.503 ± 0.437
0.0TrpThr: 0.0 ± 0.0
0.503TrpVal: 0.503 ± 0.516
1.007TrpTrp: 1.007 ± 0.594
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.516TyrAla: 2.516 ± 1.554
1.007TyrCys: 1.007 ± 0.682
1.51TyrAsp: 1.51 ± 0.729
3.523TyrGlu: 3.523 ± 1.362
2.516TyrPhe: 2.516 ± 1.344
2.516TyrGly: 2.516 ± 0.986
1.007TyrHis: 1.007 ± 0.667
2.516TyrIle: 2.516 ± 1.384
6.543TyrLys: 6.543 ± 1.711
4.529TyrLeu: 4.529 ± 1.433
2.013TyrMet: 2.013 ± 1.195
4.529TyrAsn: 4.529 ± 1.229
0.503TyrPro: 0.503 ± 0.516
4.026TyrGln: 4.026 ± 1.072
5.033TyrArg: 5.033 ± 1.822
4.026TyrSer: 4.026 ± 1.108
3.523TyrThr: 3.523 ± 1.286
1.51TyrVal: 1.51 ± 0.816
0.503TyrTrp: 0.503 ± 0.513
3.02TyrTyr: 3.02 ± 1.561
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 16 proteins (1988 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski