Amino acid dipepetide frequency for Streptococcus satellite phage Javan409

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.887AlaCys: 0.887 ± 0.424
3.252AlaAsp: 3.252 ± 0.893
5.025AlaGlu: 5.025 ± 1.422
1.774AlaPhe: 1.774 ± 0.901
1.478AlaGly: 1.478 ± 0.653
0.296AlaHis: 0.296 ± 0.351
6.503AlaIle: 6.503 ± 1.454
4.138AlaLys: 4.138 ± 0.833
5.321AlaLeu: 5.321 ± 1.216
1.478AlaMet: 1.478 ± 0.425
1.774AlaAsn: 1.774 ± 0.6
2.069AlaPro: 2.069 ± 1.033
2.365AlaGln: 2.365 ± 0.728
1.478AlaArg: 1.478 ± 0.564
2.365AlaSer: 2.365 ± 0.858
2.66AlaThr: 2.66 ± 0.924
2.069AlaVal: 2.069 ± 0.666
0.887AlaTrp: 0.887 ± 0.435
3.547AlaTyr: 3.547 ± 0.84
0.0AlaXaa: 0.0 ± 0.0
Cys
1.182CysAla: 1.182 ± 0.651
0.0CysCys: 0.0 ± 0.0
0.887CysAsp: 0.887 ± 0.504
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.296CysHis: 0.296 ± 0.224
0.887CysIle: 0.887 ± 0.466
0.0CysLys: 0.0 ± 0.0
1.478CysLeu: 1.478 ± 0.784
0.296CysMet: 0.296 ± 0.334
0.591CysAsn: 0.591 ± 0.448
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.296CysVal: 0.296 ± 0.316
0.0CysTrp: 0.0 ± 0.0
0.296CysTyr: 0.296 ± 0.304
0.0CysXaa: 0.0 ± 0.0
Asp
0.591AspAla: 0.591 ± 0.46
0.887AspCys: 0.887 ± 0.846
7.685AspAsp: 7.685 ± 2.514
4.138AspGlu: 4.138 ± 1.153
4.434AspPhe: 4.434 ± 1.273
2.66AspGly: 2.66 ± 0.666
0.887AspHis: 0.887 ± 0.593
6.208AspIle: 6.208 ± 0.622
6.503AspLys: 6.503 ± 1.235
7.981AspLeu: 7.981 ± 1.367
2.956AspMet: 2.956 ± 0.954
3.252AspAsn: 3.252 ± 0.826
0.591AspPro: 0.591 ± 0.448
1.182AspGln: 1.182 ± 0.456
1.478AspArg: 1.478 ± 0.443
3.843AspSer: 3.843 ± 1.914
4.73AspThr: 4.73 ± 1.698
1.182AspVal: 1.182 ± 0.487
0.0AspTrp: 0.0 ± 0.0
5.912AspTyr: 5.912 ± 1.362
0.0AspXaa: 0.0 ± 0.0
Glu
7.39GluAla: 7.39 ± 1.607
0.591GluCys: 0.591 ± 0.414
3.547GluAsp: 3.547 ± 1.158
6.208GluGlu: 6.208 ± 2.451
2.66GluPhe: 2.66 ± 0.932
1.182GluGly: 1.182 ± 0.534
2.365GluHis: 2.365 ± 0.981
5.321GluIle: 5.321 ± 1.971
5.912GluLys: 5.912 ± 1.34
10.641GluLeu: 10.641 ± 2.132
1.478GluMet: 1.478 ± 0.512
6.799GluAsn: 6.799 ± 1.769
4.73GluPro: 4.73 ± 1.201
4.138GluGln: 4.138 ± 1.232
4.138GluArg: 4.138 ± 0.838
3.252GluSer: 3.252 ± 1.103
6.503GluThr: 6.503 ± 1.169
5.616GluVal: 5.616 ± 1.439
0.591GluTrp: 0.591 ± 0.366
5.616GluTyr: 5.616 ± 1.399
0.0GluXaa: 0.0 ± 0.0
Phe
1.182PheAla: 1.182 ± 0.723
0.0PheCys: 0.0 ± 0.0
3.547PheAsp: 3.547 ± 0.906
4.138PheGlu: 4.138 ± 1.445
0.887PhePhe: 0.887 ± 0.526
1.774PheGly: 1.774 ± 0.598
0.591PheHis: 0.591 ± 0.382
4.138PheIle: 4.138 ± 1.233
5.321PheLys: 5.321 ± 1.437
2.66PheLeu: 2.66 ± 0.848
0.591PheMet: 0.591 ± 0.434
3.843PheAsn: 3.843 ± 0.861
0.296PhePro: 0.296 ± 0.288
1.478PheGln: 1.478 ± 0.789
1.478PheArg: 1.478 ± 0.778
1.774PheSer: 1.774 ± 0.558
2.069PheThr: 2.069 ± 0.972
0.296PheVal: 0.296 ± 0.288
0.296PheTrp: 0.296 ± 0.316
2.956PheTyr: 2.956 ± 1.041
0.0PheXaa: 0.0 ± 0.0
Gly
2.365GlyAla: 2.365 ± 0.849
0.296GlyCys: 0.296 ± 0.316
2.365GlyAsp: 2.365 ± 0.932
1.478GlyGlu: 1.478 ± 0.436
2.365GlyPhe: 2.365 ± 0.736
1.182GlyGly: 1.182 ± 0.492
0.296GlyHis: 0.296 ± 0.25
2.66GlyIle: 2.66 ± 0.569
5.616GlyLys: 5.616 ± 1.501
4.434GlyLeu: 4.434 ± 1.252
1.774GlyMet: 1.774 ± 0.663
4.434GlyAsn: 4.434 ± 0.786
0.296GlyPro: 0.296 ± 0.318
1.774GlyGln: 1.774 ± 0.743
0.887GlyArg: 0.887 ± 0.453
2.069GlySer: 2.069 ± 0.78
2.365GlyThr: 2.365 ± 0.605
1.182GlyVal: 1.182 ± 0.603
1.182GlyTrp: 1.182 ± 0.687
2.956GlyTyr: 2.956 ± 0.723
0.0GlyXaa: 0.0 ± 0.0
His
2.365HisAla: 2.365 ± 0.964
0.296HisCys: 0.296 ± 0.224
0.296HisAsp: 0.296 ± 0.316
1.182HisGlu: 1.182 ± 0.61
1.478HisPhe: 1.478 ± 0.515
0.296HisGly: 0.296 ± 0.28
0.296HisHis: 0.296 ± 0.282
0.887HisIle: 0.887 ± 0.413
0.887HisLys: 0.887 ± 0.511
2.956HisLeu: 2.956 ± 0.785
0.0HisMet: 0.0 ± 0.0
0.591HisAsn: 0.591 ± 0.4
0.0HisPro: 0.0 ± 0.0
0.296HisGln: 0.296 ± 0.353
0.0HisArg: 0.0 ± 0.0
1.478HisSer: 1.478 ± 0.597
1.182HisThr: 1.182 ± 0.582
0.296HisVal: 0.296 ± 0.224
0.0HisTrp: 0.0 ± 0.0
0.591HisTyr: 0.591 ± 0.328
0.0HisXaa: 0.0 ± 0.0
Ile
4.138IleAla: 4.138 ± 0.968
0.296IleCys: 0.296 ± 0.316
4.434IleAsp: 4.434 ± 0.947
7.094IleGlu: 7.094 ± 1.512
2.365IlePhe: 2.365 ± 0.882
4.434IleGly: 4.434 ± 1.115
0.887IleHis: 0.887 ± 0.436
6.503IleIle: 6.503 ± 1.267
7.685IleLys: 7.685 ± 1.472
6.503IleLeu: 6.503 ± 1.763
1.478IleMet: 1.478 ± 0.599
5.025IleAsn: 5.025 ± 1.271
3.252IlePro: 3.252 ± 0.979
3.547IleGln: 3.547 ± 0.661
2.365IleArg: 2.365 ± 0.562
7.685IleSer: 7.685 ± 1.144
4.138IleThr: 4.138 ± 0.806
4.138IleVal: 4.138 ± 1.229
0.296IleTrp: 0.296 ± 0.224
3.252IleTyr: 3.252 ± 0.863
0.0IleXaa: 0.0 ± 0.0
Lys
5.616LysAla: 5.616 ± 1.394
0.0LysCys: 0.0 ± 0.0
4.138LysAsp: 4.138 ± 0.917
14.189LysGlu: 14.189 ± 2.79
2.66LysPhe: 2.66 ± 1.249
2.365LysGly: 2.365 ± 0.664
2.66LysHis: 2.66 ± 0.654
6.799LysIle: 6.799 ± 1.832
10.641LysLys: 10.641 ± 2.578
8.277LysLeu: 8.277 ± 1.558
4.138LysMet: 4.138 ± 1.077
4.73LysAsn: 4.73 ± 0.975
3.252LysPro: 3.252 ± 1.04
2.66LysGln: 2.66 ± 0.745
5.321LysArg: 5.321 ± 1.392
5.616LysSer: 5.616 ± 1.283
5.616LysThr: 5.616 ± 1.395
3.843LysVal: 3.843 ± 1.074
0.591LysTrp: 0.591 ± 0.374
4.138LysTyr: 4.138 ± 0.865
0.0LysXaa: 0.0 ± 0.0
Leu
6.208LeuAla: 6.208 ± 1.365
0.591LeuCys: 0.591 ± 0.448
12.119LeuAsp: 12.119 ± 2.085
9.459LeuGlu: 9.459 ± 1.89
2.069LeuPhe: 2.069 ± 0.892
6.799LeuGly: 6.799 ± 1.296
0.887LeuHis: 0.887 ± 0.394
7.39LeuIle: 7.39 ± 1.348
6.503LeuLys: 6.503 ± 1.302
13.006LeuLeu: 13.006 ± 1.775
2.365LeuMet: 2.365 ± 0.773
7.39LeuAsn: 7.39 ± 1.847
3.547LeuPro: 3.547 ± 1.147
3.547LeuGln: 3.547 ± 0.865
3.252LeuArg: 3.252 ± 1.007
4.138LeuSer: 4.138 ± 1.026
7.981LeuThr: 7.981 ± 1.579
2.956LeuVal: 2.956 ± 0.91
0.591LeuTrp: 0.591 ± 0.328
3.252LeuTyr: 3.252 ± 0.634
0.0LeuXaa: 0.0 ± 0.0
Met
2.66MetAla: 2.66 ± 1.008
0.296MetCys: 0.296 ± 0.318
0.887MetAsp: 0.887 ± 0.408
1.774MetGlu: 1.774 ± 0.737
0.591MetPhe: 0.591 ± 0.328
0.887MetGly: 0.887 ± 0.555
0.0MetHis: 0.0 ± 0.0
1.182MetIle: 1.182 ± 0.56
5.025MetLys: 5.025 ± 1.1
2.365MetLeu: 2.365 ± 0.851
0.0MetMet: 0.0 ± 0.0
2.956MetAsn: 2.956 ± 0.961
0.296MetPro: 0.296 ± 0.224
0.296MetGln: 0.296 ± 0.224
2.069MetArg: 2.069 ± 0.759
1.478MetSer: 1.478 ± 0.477
1.478MetThr: 1.478 ± 0.598
2.069MetVal: 2.069 ± 0.74
0.0MetTrp: 0.0 ± 0.0
0.296MetTyr: 0.296 ± 0.266
0.0MetXaa: 0.0 ± 0.0
Asn
3.843AsnAla: 3.843 ± 0.912
0.887AsnCys: 0.887 ± 0.451
5.321AsnAsp: 5.321 ± 1.17
8.277AsnGlu: 8.277 ± 1.577
2.66AsnPhe: 2.66 ± 0.835
3.547AsnGly: 3.547 ± 1.199
1.182AsnHis: 1.182 ± 0.456
4.73AsnIle: 4.73 ± 1.524
6.208AsnLys: 6.208 ± 1.14
5.912AsnLeu: 5.912 ± 1.132
1.478AsnMet: 1.478 ± 0.941
2.365AsnAsn: 2.365 ± 0.689
2.069AsnPro: 2.069 ± 0.65
1.774AsnGln: 1.774 ± 0.869
1.774AsnArg: 1.774 ± 0.606
3.252AsnSer: 3.252 ± 1.126
2.956AsnThr: 2.956 ± 1.094
2.069AsnVal: 2.069 ± 0.602
0.591AsnTrp: 0.591 ± 0.408
4.434AsnTyr: 4.434 ± 1.2
0.0AsnXaa: 0.0 ± 0.0
Pro
0.887ProAla: 0.887 ± 0.344
0.0ProCys: 0.0 ± 0.0
2.365ProAsp: 2.365 ± 0.88
3.252ProGlu: 3.252 ± 0.886
1.774ProPhe: 1.774 ± 0.699
0.296ProGly: 0.296 ± 0.295
0.296ProHis: 0.296 ± 0.353
1.478ProIle: 1.478 ± 0.537
4.434ProLys: 4.434 ± 1.049
2.365ProLeu: 2.365 ± 0.692
1.478ProMet: 1.478 ± 0.706
2.365ProAsn: 2.365 ± 0.774
1.478ProPro: 1.478 ± 0.561
0.887ProGln: 0.887 ± 0.713
2.365ProArg: 2.365 ± 0.906
0.887ProSer: 0.887 ± 0.49
1.182ProThr: 1.182 ± 0.709
1.478ProVal: 1.478 ± 0.614
0.0ProTrp: 0.0 ± 0.0
2.365ProTyr: 2.365 ± 1.213
0.0ProXaa: 0.0 ± 0.0
Gln
3.843GlnAla: 3.843 ± 1.124
0.591GlnCys: 0.591 ± 0.68
0.887GlnAsp: 0.887 ± 0.461
2.069GlnGlu: 2.069 ± 0.796
1.182GlnPhe: 1.182 ± 0.549
1.182GlnGly: 1.182 ± 0.523
0.296GlnHis: 0.296 ± 0.28
2.956GlnIle: 2.956 ± 0.802
1.774GlnLys: 1.774 ± 0.677
4.138GlnLeu: 4.138 ± 1.365
0.591GlnMet: 0.591 ± 0.419
0.591GlnAsn: 0.591 ± 0.411
1.182GlnPro: 1.182 ± 0.7
1.182GlnGln: 1.182 ± 0.468
1.774GlnArg: 1.774 ± 0.68
1.478GlnSer: 1.478 ± 0.868
1.478GlnThr: 1.478 ± 0.544
2.069GlnVal: 2.069 ± 0.838
0.296GlnTrp: 0.296 ± 0.304
2.069GlnTyr: 2.069 ± 0.672
0.0GlnXaa: 0.0 ± 0.0
Arg
0.591ArgAla: 0.591 ± 0.339
0.0ArgCys: 0.0 ± 0.0
2.66ArgAsp: 2.66 ± 0.862
3.547ArgGlu: 3.547 ± 0.801
0.887ArgPhe: 0.887 ± 0.486
2.956ArgGly: 2.956 ± 0.828
0.591ArgHis: 0.591 ± 0.5
2.365ArgIle: 2.365 ± 0.813
5.912ArgLys: 5.912 ± 1.163
6.799ArgLeu: 6.799 ± 1.348
0.296ArgMet: 0.296 ± 0.308
1.182ArgAsn: 1.182 ± 0.545
0.887ArgPro: 0.887 ± 0.511
0.591ArgGln: 0.591 ± 0.363
0.296ArgArg: 0.296 ± 0.224
1.182ArgSer: 1.182 ± 0.582
2.365ArgThr: 2.365 ± 0.69
3.547ArgVal: 3.547 ± 0.854
0.296ArgTrp: 0.296 ± 0.33
0.591ArgTyr: 0.591 ± 0.448
0.0ArgXaa: 0.0 ± 0.0
Ser
1.478SerAla: 1.478 ± 0.672
0.0SerCys: 0.0 ± 0.0
5.912SerAsp: 5.912 ± 1.351
4.138SerGlu: 4.138 ± 0.873
2.365SerPhe: 2.365 ± 0.987
2.66SerGly: 2.66 ± 0.639
1.478SerHis: 1.478 ± 0.588
4.138SerIle: 4.138 ± 1.194
4.73SerLys: 4.73 ± 1.268
3.843SerLeu: 3.843 ± 0.67
0.887SerMet: 0.887 ± 0.686
5.321SerAsn: 5.321 ± 1.314
1.478SerPro: 1.478 ± 0.481
2.365SerGln: 2.365 ± 0.551
1.182SerArg: 1.182 ± 0.429
1.774SerSer: 1.774 ± 0.699
1.478SerThr: 1.478 ± 0.868
2.66SerVal: 2.66 ± 0.95
0.296SerTrp: 0.296 ± 0.33
4.434SerTyr: 4.434 ± 1.35
0.0SerXaa: 0.0 ± 0.0
Thr
1.774ThrAla: 1.774 ± 0.637
0.296ThrCys: 0.296 ± 0.224
1.774ThrAsp: 1.774 ± 0.945
5.616ThrGlu: 5.616 ± 1.302
5.321ThrPhe: 5.321 ± 1.202
3.252ThrGly: 3.252 ± 0.81
0.591ThrHis: 0.591 ± 0.382
5.616ThrIle: 5.616 ± 1.558
3.843ThrLys: 3.843 ± 1.164
8.277ThrLeu: 8.277 ± 1.534
1.478ThrMet: 1.478 ± 0.607
2.956ThrAsn: 2.956 ± 1.083
2.956ThrPro: 2.956 ± 0.882
1.774ThrGln: 1.774 ± 0.801
2.66ThrArg: 2.66 ± 0.782
1.478ThrSer: 1.478 ± 0.588
3.547ThrThr: 3.547 ± 0.933
4.434ThrVal: 4.434 ± 1.229
0.591ThrTrp: 0.591 ± 0.385
2.66ThrTyr: 2.66 ± 0.875
0.0ThrXaa: 0.0 ± 0.0
Val
2.365ValAla: 2.365 ± 0.823
0.0ValCys: 0.0 ± 0.0
2.069ValAsp: 2.069 ± 0.686
1.182ValGlu: 1.182 ± 0.722
1.774ValPhe: 1.774 ± 0.716
2.069ValGly: 2.069 ± 0.753
0.0ValHis: 0.0 ± 0.0
3.843ValIle: 3.843 ± 1.107
5.912ValLys: 5.912 ± 0.967
2.66ValLeu: 2.66 ± 0.886
1.182ValMet: 1.182 ± 0.565
5.025ValAsn: 5.025 ± 1.048
2.069ValPro: 2.069 ± 0.926
0.296ValGln: 0.296 ± 0.318
0.591ValArg: 0.591 ± 0.422
4.73ValSer: 4.73 ± 1.077
5.321ValThr: 5.321 ± 1.444
2.66ValVal: 2.66 ± 0.676
0.0ValTrp: 0.0 ± 0.0
1.478ValTyr: 1.478 ± 0.72
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.296TrpAsp: 0.296 ± 0.25
1.182TrpGlu: 1.182 ± 0.628
0.591TrpPhe: 0.591 ± 0.448
0.591TrpGly: 0.591 ± 0.36
0.296TrpHis: 0.296 ± 0.224
1.182TrpIle: 1.182 ± 0.547
0.296TrpLys: 0.296 ± 0.25
0.591TrpLeu: 0.591 ± 0.459
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.296TrpGln: 0.296 ± 0.224
0.296TrpArg: 0.296 ± 0.224
0.591TrpSer: 0.591 ± 0.327
0.0TrpThr: 0.0 ± 0.0
0.296TrpVal: 0.296 ± 0.316
0.296TrpTrp: 0.296 ± 0.25
0.296TrpTyr: 0.296 ± 0.224
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.887TyrAla: 0.887 ± 0.457
0.296TyrCys: 0.296 ± 0.288
2.66TyrAsp: 2.66 ± 1.321
4.73TyrGlu: 4.73 ± 1.279
1.774TyrPhe: 1.774 ± 0.558
2.365TyrGly: 2.365 ± 0.951
1.182TyrHis: 1.182 ± 0.581
4.73TyrIle: 4.73 ± 0.972
5.912TyrLys: 5.912 ± 1.656
3.843TyrLeu: 3.843 ± 0.911
2.365TyrMet: 2.365 ± 1.044
4.138TyrAsn: 4.138 ± 1.05
1.182TyrPro: 1.182 ± 0.605
1.182TyrGln: 1.182 ± 0.541
4.138TyrArg: 4.138 ± 1.079
3.547TyrSer: 3.547 ± 0.986
3.843TyrThr: 3.843 ± 0.993
2.069TyrVal: 2.069 ± 0.653
0.296TyrTrp: 0.296 ± 0.318
1.478TyrTyr: 1.478 ± 0.692
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 24 proteins (3384 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski