Amino acid dipepetide frequency for Salmonella phage astrithr

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.683AlaAla: 4.683 ± 1.764
0.0AlaCys: 0.0 ± 0.0
6.061AlaAsp: 6.061 ± 1.231
2.755AlaGlu: 2.755 ± 1.02
3.306AlaPhe: 3.306 ± 1.058
4.132AlaGly: 4.132 ± 1.062
0.0AlaHis: 0.0 ± 0.0
4.408AlaIle: 4.408 ± 1.024
6.061AlaLys: 6.061 ± 1.36
6.887AlaLeu: 6.887 ± 1.274
1.653AlaMet: 1.653 ± 0.919
4.132AlaAsn: 4.132 ± 1.122
1.928AlaPro: 1.928 ± 0.967
2.755AlaGln: 2.755 ± 1.029
2.755AlaArg: 2.755 ± 1.063
3.306AlaSer: 3.306 ± 0.84
4.132AlaThr: 4.132 ± 1.682
6.336AlaVal: 6.336 ± 1.512
0.826AlaTrp: 0.826 ± 0.485
2.755AlaTyr: 2.755 ± 1.082
0.0AlaXaa: 0.0 ± 0.0
Cys
0.551CysAla: 0.551 ± 0.767
0.0CysCys: 0.0 ± 0.0
0.826CysAsp: 0.826 ± 0.51
1.102CysGlu: 1.102 ± 0.717
0.0CysPhe: 0.0 ± 0.0
0.275CysGly: 0.275 ± 0.244
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.275CysLys: 0.275 ± 0.256
0.551CysLeu: 0.551 ± 0.42
0.0CysMet: 0.0 ± 0.0
0.551CysAsn: 0.551 ± 0.366
0.275CysPro: 0.275 ± 0.256
0.0CysGln: 0.0 ± 0.0
0.826CysArg: 0.826 ± 0.473
0.826CysSer: 0.826 ± 0.536
0.551CysThr: 0.551 ± 0.614
0.551CysVal: 0.551 ± 0.308
0.0CysTrp: 0.0 ± 0.0
0.551CysTyr: 0.551 ± 0.285
0.0CysXaa: 0.0 ± 0.0
Asp
4.959AspAla: 4.959 ± 1.046
0.275AspCys: 0.275 ± 0.232
2.479AspAsp: 2.479 ± 0.949
3.581AspGlu: 3.581 ± 1.06
1.377AspPhe: 1.377 ± 0.464
5.51AspGly: 5.51 ± 1.174
0.275AspHis: 0.275 ± 0.383
6.061AspIle: 6.061 ± 1.132
4.408AspLys: 4.408 ± 0.72
6.887AspLeu: 6.887 ± 1.276
1.653AspMet: 1.653 ± 0.519
5.234AspAsn: 5.234 ± 1.718
1.377AspPro: 1.377 ± 1.029
1.377AspGln: 1.377 ± 0.503
1.377AspArg: 1.377 ± 1.019
6.061AspSer: 6.061 ± 1.1
4.132AspThr: 4.132 ± 1.222
4.132AspVal: 4.132 ± 0.661
1.377AspTrp: 1.377 ± 0.626
3.306AspTyr: 3.306 ± 0.883
0.0AspXaa: 0.0 ± 0.0
Glu
2.204GluAla: 2.204 ± 0.707
0.0GluCys: 0.0 ± 0.0
2.755GluAsp: 2.755 ± 0.801
1.102GluGlu: 1.102 ± 0.555
3.581GluPhe: 3.581 ± 1.42
4.132GluGly: 4.132 ± 1.046
1.653GluHis: 1.653 ± 0.807
4.683GluIle: 4.683 ± 1.106
3.581GluLys: 3.581 ± 0.958
5.51GluLeu: 5.51 ± 1.118
1.928GluMet: 1.928 ± 0.941
7.438GluAsn: 7.438 ± 0.842
0.551GluPro: 0.551 ± 0.412
2.755GluGln: 2.755 ± 1.252
3.857GluArg: 3.857 ± 0.791
4.959GluSer: 4.959 ± 1.343
5.785GluThr: 5.785 ± 1.179
4.132GluVal: 4.132 ± 1.143
1.102GluTrp: 1.102 ± 0.703
3.306GluTyr: 3.306 ± 0.882
0.0GluXaa: 0.0 ± 0.0
Phe
2.479PheAla: 2.479 ± 0.652
0.0PheCys: 0.0 ± 0.0
3.857PheAsp: 3.857 ± 0.633
1.928PheGlu: 1.928 ± 0.54
2.204PhePhe: 2.204 ± 0.626
2.204PheGly: 2.204 ± 0.562
0.275PheHis: 0.275 ± 0.255
3.03PheIle: 3.03 ± 1.279
2.204PheLys: 2.204 ± 1.074
1.377PheLeu: 1.377 ± 0.789
1.928PheMet: 1.928 ± 0.821
4.683PheAsn: 4.683 ± 1.314
1.653PhePro: 1.653 ± 0.684
0.551PheGln: 0.551 ± 0.308
0.826PheArg: 0.826 ± 0.435
2.479PheSer: 2.479 ± 0.507
5.234PheThr: 5.234 ± 1.196
1.928PheVal: 1.928 ± 0.668
0.0PheTrp: 0.0 ± 0.0
3.03PheTyr: 3.03 ± 0.806
0.0PheXaa: 0.0 ± 0.0
Gly
3.306GlyAla: 3.306 ± 0.727
0.0GlyCys: 0.0 ± 0.0
4.132GlyAsp: 4.132 ± 1.029
6.612GlyGlu: 6.612 ± 1.012
2.755GlyPhe: 2.755 ± 0.685
4.959GlyGly: 4.959 ± 1.271
0.551GlyHis: 0.551 ± 0.336
4.408GlyIle: 4.408 ± 0.972
3.857GlyLys: 3.857 ± 0.877
4.132GlyLeu: 4.132 ± 1.156
2.204GlyMet: 2.204 ± 0.858
4.408GlyAsn: 4.408 ± 0.898
0.0GlyPro: 0.0 ± 0.0
1.653GlyGln: 1.653 ± 0.709
2.755GlyArg: 2.755 ± 1.098
3.857GlySer: 3.857 ± 1.067
4.132GlyThr: 4.132 ± 1.06
7.713GlyVal: 7.713 ± 1.21
0.551GlyTrp: 0.551 ± 0.425
2.755GlyTyr: 2.755 ± 1.202
0.0GlyXaa: 0.0 ± 0.0
His
0.551HisAla: 0.551 ± 0.431
0.0HisCys: 0.0 ± 0.0
1.102HisAsp: 1.102 ± 0.482
1.653HisGlu: 1.653 ± 0.806
0.826HisPhe: 0.826 ± 0.473
0.551HisGly: 0.551 ± 0.464
0.275HisHis: 0.275 ± 0.244
1.653HisIle: 1.653 ± 0.767
0.551HisLys: 0.551 ± 0.489
1.377HisLeu: 1.377 ± 0.509
0.0HisMet: 0.0 ± 0.0
1.928HisAsn: 1.928 ± 0.72
0.275HisPro: 0.275 ± 0.232
0.551HisGln: 0.551 ± 0.489
0.0HisArg: 0.0 ± 0.0
0.826HisSer: 0.826 ± 0.57
0.551HisThr: 0.551 ± 0.489
0.275HisVal: 0.275 ± 0.383
0.0HisTrp: 0.0 ± 0.0
0.826HisTyr: 0.826 ± 0.421
0.0HisXaa: 0.0 ± 0.0
Ile
3.857IleAla: 3.857 ± 1.171
0.826IleCys: 0.826 ± 0.528
5.234IleAsp: 5.234 ± 1.115
4.408IleGlu: 4.408 ± 1.464
2.479IlePhe: 2.479 ± 0.978
3.581IleGly: 3.581 ± 0.846
1.928IleHis: 1.928 ± 0.58
3.581IleIle: 3.581 ± 0.882
7.713IleLys: 7.713 ± 1.75
3.306IleLeu: 3.306 ± 0.953
1.102IleMet: 1.102 ± 0.532
5.785IleAsn: 5.785 ± 1.313
3.03IlePro: 3.03 ± 0.931
1.928IleGln: 1.928 ± 0.779
1.653IleArg: 1.653 ± 0.757
5.234IleSer: 5.234 ± 1.541
7.163IleThr: 7.163 ± 1.807
2.755IleVal: 2.755 ± 0.794
0.275IleTrp: 0.275 ± 0.244
2.755IleTyr: 2.755 ± 0.914
0.0IleXaa: 0.0 ± 0.0
Lys
6.061LysAla: 6.061 ± 1.035
1.377LysCys: 1.377 ± 0.714
3.03LysAsp: 3.03 ± 0.812
5.51LysGlu: 5.51 ± 1.053
3.857LysPhe: 3.857 ± 0.964
3.857LysGly: 3.857 ± 1.358
1.102LysHis: 1.102 ± 0.633
4.132LysIle: 4.132 ± 1.303
2.755LysLys: 2.755 ± 0.716
7.163LysLeu: 7.163 ± 1.007
2.755LysMet: 2.755 ± 0.707
3.581LysAsn: 3.581 ± 1.002
3.03LysPro: 3.03 ± 0.864
2.204LysGln: 2.204 ± 0.813
4.408LysArg: 4.408 ± 1.311
3.857LysSer: 3.857 ± 0.59
6.612LysThr: 6.612 ± 1.412
4.683LysVal: 4.683 ± 1.486
0.275LysTrp: 0.275 ± 0.302
3.857LysTyr: 3.857 ± 1.419
0.0LysXaa: 0.0 ± 0.0
Leu
6.336LeuAla: 6.336 ± 1.678
0.826LeuCys: 0.826 ± 0.495
5.785LeuAsp: 5.785 ± 0.912
3.306LeuGlu: 3.306 ± 0.722
2.204LeuPhe: 2.204 ± 0.429
4.683LeuGly: 4.683 ± 0.959
0.551LeuHis: 0.551 ± 0.285
5.234LeuIle: 5.234 ± 1.091
4.959LeuLys: 4.959 ± 1.262
5.234LeuLeu: 5.234 ± 1.015
1.928LeuMet: 1.928 ± 1.017
5.785LeuAsn: 5.785 ± 1.23
2.479LeuPro: 2.479 ± 0.674
3.03LeuGln: 3.03 ± 0.759
1.377LeuArg: 1.377 ± 0.536
8.54LeuSer: 8.54 ± 1.344
5.785LeuThr: 5.785 ± 1.728
4.683LeuVal: 4.683 ± 0.676
0.826LeuTrp: 0.826 ± 0.512
3.581LeuTyr: 3.581 ± 0.861
0.0LeuXaa: 0.0 ± 0.0
Met
1.377MetAla: 1.377 ± 0.586
0.275MetCys: 0.275 ± 0.244
1.928MetAsp: 1.928 ± 0.701
3.03MetGlu: 3.03 ± 0.978
2.204MetPhe: 2.204 ± 0.857
1.928MetGly: 1.928 ± 0.804
0.275MetHis: 0.275 ± 0.383
1.102MetIle: 1.102 ± 0.526
3.581MetLys: 3.581 ± 0.741
2.479MetLeu: 2.479 ± 0.592
1.102MetMet: 1.102 ± 0.587
1.653MetAsn: 1.653 ± 0.978
0.551MetPro: 0.551 ± 0.557
1.102MetGln: 1.102 ± 0.358
1.377MetArg: 1.377 ± 0.572
1.928MetSer: 1.928 ± 0.963
2.204MetThr: 2.204 ± 0.884
1.102MetVal: 1.102 ± 0.439
0.551MetTrp: 0.551 ± 0.366
1.377MetTyr: 1.377 ± 0.887
0.0MetXaa: 0.0 ± 0.0
Asn
3.306AsnAla: 3.306 ± 1.012
0.275AsnCys: 0.275 ± 0.383
4.959AsnAsp: 4.959 ± 1.08
4.683AsnGlu: 4.683 ± 0.876
2.479AsnPhe: 2.479 ± 0.644
5.785AsnGly: 5.785 ± 1.497
1.377AsnHis: 1.377 ± 0.49
5.785AsnIle: 5.785 ± 1.846
8.264AsnLys: 8.264 ± 1.632
3.581AsnLeu: 3.581 ± 1.294
2.479AsnMet: 2.479 ± 0.932
4.408AsnAsn: 4.408 ± 1.076
1.653AsnPro: 1.653 ± 0.628
2.204AsnGln: 2.204 ± 0.86
1.653AsnArg: 1.653 ± 0.445
3.03AsnSer: 3.03 ± 1.035
3.581AsnThr: 3.581 ± 0.661
4.132AsnVal: 4.132 ± 1.063
0.826AsnTrp: 0.826 ± 0.462
2.755AsnTyr: 2.755 ± 0.845
0.0AsnXaa: 0.0 ± 0.0
Pro
4.132ProAla: 4.132 ± 0.982
0.275ProCys: 0.275 ± 0.307
2.755ProAsp: 2.755 ± 1.51
2.479ProGlu: 2.479 ± 0.543
1.653ProPhe: 1.653 ± 0.694
0.0ProGly: 0.0 ± 0.0
0.551ProHis: 0.551 ± 0.31
1.653ProIle: 1.653 ± 0.799
1.653ProLys: 1.653 ± 0.58
1.377ProLeu: 1.377 ± 0.613
0.826ProMet: 0.826 ± 0.334
1.377ProAsn: 1.377 ± 0.639
0.551ProPro: 0.551 ± 0.605
0.551ProGln: 0.551 ± 0.366
0.275ProArg: 0.275 ± 0.276
1.377ProSer: 1.377 ± 0.522
0.826ProThr: 0.826 ± 0.347
1.102ProVal: 1.102 ± 0.49
0.275ProTrp: 0.275 ± 0.244
2.755ProTyr: 2.755 ± 0.539
0.0ProXaa: 0.0 ± 0.0
Gln
3.857GlnAla: 3.857 ± 1.604
0.826GlnCys: 0.826 ± 0.495
2.204GlnAsp: 2.204 ± 0.771
2.755GlnGlu: 2.755 ± 1.001
1.653GlnPhe: 1.653 ± 0.674
1.653GlnGly: 1.653 ± 0.635
0.551GlnHis: 0.551 ± 0.366
1.653GlnIle: 1.653 ± 0.576
1.928GlnLys: 1.928 ± 0.896
3.03GlnLeu: 3.03 ± 0.943
1.653GlnMet: 1.653 ± 0.772
0.551GlnAsn: 0.551 ± 0.4
0.826GlnPro: 0.826 ± 0.365
1.377GlnGln: 1.377 ± 1.122
0.826GlnArg: 0.826 ± 0.499
1.377GlnSer: 1.377 ± 0.719
1.928GlnThr: 1.928 ± 0.832
1.653GlnVal: 1.653 ± 0.623
0.0GlnTrp: 0.0 ± 0.0
1.653GlnTyr: 1.653 ± 0.596
0.0GlnXaa: 0.0 ± 0.0
Arg
1.102ArgAla: 1.102 ± 0.567
0.551ArgCys: 0.551 ± 0.308
1.653ArgAsp: 1.653 ± 0.599
2.755ArgGlu: 2.755 ± 0.825
1.928ArgPhe: 1.928 ± 0.587
1.653ArgGly: 1.653 ± 0.622
0.275ArgHis: 0.275 ± 0.244
2.479ArgIle: 2.479 ± 0.894
2.755ArgLys: 2.755 ± 0.999
3.857ArgLeu: 3.857 ± 1.118
0.551ArgMet: 0.551 ± 0.37
1.928ArgAsn: 1.928 ± 0.666
0.275ArgPro: 0.275 ± 0.244
1.928ArgGln: 1.928 ± 0.749
1.928ArgArg: 1.928 ± 0.899
1.102ArgSer: 1.102 ± 0.686
1.377ArgThr: 1.377 ± 0.484
2.479ArgVal: 2.479 ± 0.796
0.275ArgTrp: 0.275 ± 0.256
0.826ArgTyr: 0.826 ± 0.525
0.0ArgXaa: 0.0 ± 0.0
Ser
4.132SerAla: 4.132 ± 1.248
0.275SerCys: 0.275 ± 0.276
4.959SerAsp: 4.959 ± 1.487
4.959SerGlu: 4.959 ± 1.529
0.826SerPhe: 0.826 ± 0.589
6.061SerGly: 6.061 ± 2.197
1.377SerHis: 1.377 ± 0.431
4.683SerIle: 4.683 ± 0.806
5.785SerLys: 5.785 ± 1.366
4.408SerLeu: 4.408 ± 0.776
2.755SerMet: 2.755 ± 0.951
2.479SerAsn: 2.479 ± 0.54
2.479SerPro: 2.479 ± 0.381
1.102SerGln: 1.102 ± 0.448
1.102SerArg: 1.102 ± 0.511
4.683SerSer: 4.683 ± 0.936
3.581SerThr: 3.581 ± 1.083
6.336SerVal: 6.336 ± 1.516
0.551SerTrp: 0.551 ± 0.502
1.928SerTyr: 1.928 ± 1.024
0.0SerXaa: 0.0 ± 0.0
Thr
7.713ThrAla: 7.713 ± 1.726
0.0ThrCys: 0.0 ± 0.0
4.408ThrAsp: 4.408 ± 0.865
3.306ThrGlu: 3.306 ± 0.944
1.377ThrPhe: 1.377 ± 0.324
6.061ThrGly: 6.061 ± 1.224
0.826ThrHis: 0.826 ± 0.733
4.132ThrIle: 4.132 ± 0.969
4.132ThrLys: 4.132 ± 1.051
7.713ThrLeu: 7.713 ± 1.67
1.928ThrMet: 1.928 ± 0.967
2.755ThrAsn: 2.755 ± 0.442
1.928ThrPro: 1.928 ± 0.549
2.204ThrGln: 2.204 ± 0.872
1.102ThrArg: 1.102 ± 0.576
3.857ThrSer: 3.857 ± 0.6
5.234ThrThr: 5.234 ± 1.249
7.438ThrVal: 7.438 ± 1.147
0.826ThrTrp: 0.826 ± 0.372
3.581ThrTyr: 3.581 ± 0.979
0.0ThrXaa: 0.0 ± 0.0
Val
4.132ValAla: 4.132 ± 1.111
0.551ValCys: 0.551 ± 0.422
4.959ValAsp: 4.959 ± 1.034
5.51ValGlu: 5.51 ± 1.082
3.306ValPhe: 3.306 ± 0.761
3.857ValGly: 3.857 ± 1.057
1.377ValHis: 1.377 ± 0.609
5.785ValIle: 5.785 ± 1.548
6.061ValLys: 6.061 ± 1.339
4.408ValLeu: 4.408 ± 1.032
2.204ValMet: 2.204 ± 0.581
3.581ValAsn: 3.581 ± 0.719
1.928ValPro: 1.928 ± 0.906
1.377ValGln: 1.377 ± 0.586
2.204ValArg: 2.204 ± 0.762
4.408ValSer: 4.408 ± 1.168
4.959ValThr: 4.959 ± 1.436
5.234ValVal: 5.234 ± 1.444
0.826ValTrp: 0.826 ± 0.443
4.132ValTyr: 4.132 ± 0.796
0.0ValXaa: 0.0 ± 0.0
Trp
0.275TrpAla: 0.275 ± 0.278
0.275TrpCys: 0.275 ± 0.383
1.377TrpAsp: 1.377 ± 0.727
1.653TrpGlu: 1.653 ± 0.645
0.826TrpPhe: 0.826 ± 0.547
0.826TrpGly: 0.826 ± 0.443
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.551TrpLys: 0.551 ± 0.379
0.551TrpLeu: 0.551 ± 0.355
0.551TrpMet: 0.551 ± 0.37
0.275TrpAsn: 0.275 ± 0.232
0.551TrpPro: 0.551 ± 0.481
1.102TrpGln: 1.102 ± 0.551
0.0TrpArg: 0.0 ± 0.0
0.275TrpSer: 0.275 ± 0.244
0.0TrpThr: 0.0 ± 0.0
0.275TrpVal: 0.275 ± 0.244
0.0TrpTrp: 0.0 ± 0.0
0.551TrpTyr: 0.551 ± 0.367
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.857TyrAla: 3.857 ± 1.198
1.102TyrCys: 1.102 ± 0.619
1.377TyrAsp: 1.377 ± 0.39
2.204TyrGlu: 2.204 ± 1.776
3.03TyrPhe: 3.03 ± 1.058
3.03TyrGly: 3.03 ± 0.695
0.551TyrHis: 0.551 ± 0.489
4.132TyrIle: 4.132 ± 1.355
3.03TyrLys: 3.03 ± 0.782
3.306TyrLeu: 3.306 ± 1.059
1.653TyrMet: 1.653 ± 0.493
4.683TyrAsn: 4.683 ± 1.237
1.102TyrPro: 1.102 ± 0.717
2.204TyrGln: 2.204 ± 0.973
1.377TyrArg: 1.377 ± 0.53
2.479TyrSer: 2.479 ± 1.116
2.755TyrThr: 2.755 ± 1.322
3.857TyrVal: 3.857 ± 1.333
0.551TyrTrp: 0.551 ± 0.382
1.928TyrTyr: 1.928 ± 0.862
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 15 proteins (3631 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski