Amino acid dipepetide frequency for Stenotrophomonas phage SMA7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.937AlaAla: 20.937 ± 2.134
0.551AlaCys: 0.551 ± 0.501
7.163AlaAsp: 7.163 ± 2.593
2.755AlaGlu: 2.755 ± 2.181
1.102AlaPhe: 1.102 ± 0.77
11.57AlaGly: 11.57 ± 2.99
3.306AlaHis: 3.306 ± 1.318
4.959AlaIle: 4.959 ± 1.658
7.163AlaLys: 7.163 ± 1.277
13.223AlaLeu: 13.223 ± 3.609
4.959AlaMet: 4.959 ± 2.462
2.755AlaAsn: 2.755 ± 1.676
3.857AlaPro: 3.857 ± 1.56
8.264AlaGln: 8.264 ± 2.281
10.468AlaArg: 10.468 ± 4.711
7.163AlaSer: 7.163 ± 1.515
6.061AlaThr: 6.061 ± 1.416
11.57AlaVal: 11.57 ± 3.107
1.653AlaTrp: 1.653 ± 1.317
1.653AlaTyr: 1.653 ± 0.95
0.0AlaXaa: 0.0 ± 0.0
Cys
3.306CysAla: 3.306 ± 1.151
0.0CysCys: 0.0 ± 0.0
0.551CysAsp: 0.551 ± 0.44
0.0CysGlu: 0.0 ± 0.0
0.551CysPhe: 0.551 ± 0.501
2.204CysGly: 2.204 ± 1.255
0.0CysHis: 0.0 ± 0.0
0.551CysIle: 0.551 ± 0.588
1.102CysLys: 1.102 ± 0.566
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.102CysPro: 1.102 ± 0.878
0.551CysGln: 0.551 ± 0.776
1.653CysArg: 1.653 ± 0.891
0.0CysSer: 0.0 ± 0.0
1.102CysThr: 1.102 ± 0.878
2.755CysVal: 2.755 ± 0.806
0.551CysTrp: 0.551 ± 0.475
0.551CysTyr: 0.551 ± 0.501
0.0CysXaa: 0.0 ± 0.0
Asp
2.755AspAla: 2.755 ± 0.936
0.551AspCys: 0.551 ± 0.44
4.408AspAsp: 4.408 ± 3.362
3.857AspGlu: 3.857 ± 1.487
2.755AspPhe: 2.755 ± 2.802
6.061AspGly: 6.061 ± 1.723
0.0AspHis: 0.0 ± 0.0
1.653AspIle: 1.653 ± 0.551
2.755AspLys: 2.755 ± 0.903
3.857AspLeu: 3.857 ± 1.486
2.755AspMet: 2.755 ± 1.176
2.204AspAsn: 2.204 ± 1.313
2.204AspPro: 2.204 ± 0.949
1.653AspGln: 1.653 ± 0.63
3.857AspArg: 3.857 ± 0.967
2.204AspSer: 2.204 ± 0.993
2.204AspThr: 2.204 ± 0.794
2.755AspVal: 2.755 ± 2.056
1.653AspTrp: 1.653 ± 0.851
1.653AspTyr: 1.653 ± 1.031
0.0AspXaa: 0.0 ± 0.0
Glu
4.408GluAla: 4.408 ± 1.285
2.204GluCys: 2.204 ± 2.478
1.653GluAsp: 1.653 ± 1.763
2.755GluGlu: 2.755 ± 1.252
3.306GluPhe: 3.306 ± 1.447
2.204GluGly: 2.204 ± 0.909
1.102GluHis: 1.102 ± 0.97
2.204GluIle: 2.204 ± 1.0
3.306GluLys: 3.306 ± 1.357
6.061GluLeu: 6.061 ± 2.051
0.551GluMet: 0.551 ± 0.501
1.102GluAsn: 1.102 ± 0.72
1.653GluPro: 1.653 ± 1.609
2.755GluGln: 2.755 ± 0.777
4.408GluArg: 4.408 ± 2.77
3.306GluSer: 3.306 ± 1.292
2.755GluThr: 2.755 ± 1.717
6.061GluVal: 6.061 ± 2.005
0.551GluTrp: 0.551 ± 0.619
1.653GluTyr: 1.653 ± 0.706
0.0GluXaa: 0.0 ± 0.0
Phe
2.204PheAla: 2.204 ± 0.931
1.653PheCys: 1.653 ± 0.918
1.102PheAsp: 1.102 ± 0.72
1.102PheGlu: 1.102 ± 0.627
2.755PhePhe: 2.755 ± 1.252
2.755PheGly: 2.755 ± 1.378
0.0PheHis: 0.0 ± 0.0
1.653PheIle: 1.653 ± 1.363
1.102PheLys: 1.102 ± 0.95
3.306PheLeu: 3.306 ± 1.3
1.102PheMet: 1.102 ± 0.696
0.0PheAsn: 0.0 ± 0.0
1.102PhePro: 1.102 ± 0.881
1.102PheGln: 1.102 ± 1.101
3.306PheArg: 3.306 ± 1.226
3.857PheSer: 3.857 ± 1.288
2.755PheThr: 2.755 ± 1.974
1.102PheVal: 1.102 ± 0.77
0.0PheTrp: 0.0 ± 0.0
1.102PheTyr: 1.102 ± 0.566
0.0PheXaa: 0.0 ± 0.0
Gly
7.713GlyAla: 7.713 ± 1.752
0.551GlyCys: 0.551 ± 0.475
4.959GlyAsp: 4.959 ± 2.77
6.061GlyGlu: 6.061 ± 1.548
1.653GlyPhe: 1.653 ± 1.064
7.713GlyGly: 7.713 ± 1.989
2.755GlyHis: 2.755 ± 1.252
2.204GlyIle: 2.204 ± 0.689
2.755GlyLys: 2.755 ± 1.433
6.061GlyLeu: 6.061 ± 1.639
4.408GlyMet: 4.408 ± 1.685
1.102GlyAsn: 1.102 ± 0.808
3.857GlyPro: 3.857 ± 2.103
3.306GlyGln: 3.306 ± 0.984
4.959GlyArg: 4.959 ± 2.108
3.857GlySer: 3.857 ± 1.552
6.061GlyThr: 6.061 ± 1.711
6.612GlyVal: 6.612 ± 1.785
1.653GlyTrp: 1.653 ± 1.026
1.653GlyTyr: 1.653 ± 1.62
0.0GlyXaa: 0.0 ± 0.0
His
3.857HisAla: 3.857 ± 1.235
0.551HisCys: 0.551 ± 0.501
0.551HisAsp: 0.551 ± 0.588
1.653HisGlu: 1.653 ± 1.321
0.0HisPhe: 0.0 ± 0.0
1.653HisGly: 1.653 ± 0.935
1.102HisHis: 1.102 ± 0.586
2.204HisIle: 2.204 ± 1.294
0.0HisLys: 0.0 ± 0.0
1.102HisLeu: 1.102 ± 0.688
0.551HisMet: 0.551 ± 0.501
0.0HisAsn: 0.0 ± 0.0
0.551HisPro: 0.551 ± 0.501
0.551HisGln: 0.551 ± 0.501
0.551HisArg: 0.551 ± 0.44
1.102HisSer: 1.102 ± 1.001
0.0HisThr: 0.0 ± 0.0
1.102HisVal: 1.102 ± 0.881
0.551HisTrp: 0.551 ± 0.475
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
6.612IleAla: 6.612 ± 1.223
0.551IleCys: 0.551 ± 0.701
2.755IleAsp: 2.755 ± 1.383
4.408IleGlu: 4.408 ± 2.264
1.102IlePhe: 1.102 ± 0.669
2.204IleGly: 2.204 ± 1.387
0.551IleHis: 0.551 ± 0.701
2.204IleIle: 2.204 ± 1.294
1.653IleLys: 1.653 ± 0.706
3.857IleLeu: 3.857 ± 0.844
0.551IleMet: 0.551 ± 0.701
0.551IleAsn: 0.551 ± 0.619
1.102IlePro: 1.102 ± 0.688
0.551IleGln: 0.551 ± 0.701
4.408IleArg: 4.408 ± 1.904
2.204IleSer: 2.204 ± 1.298
1.653IleThr: 1.653 ± 1.09
7.163IleVal: 7.163 ± 2.505
0.551IleTrp: 0.551 ± 0.44
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
5.51LysAla: 5.51 ± 0.972
0.551LysCys: 0.551 ± 0.475
0.551LysAsp: 0.551 ± 0.44
3.306LysGlu: 3.306 ± 1.228
1.102LysPhe: 1.102 ± 0.72
4.408LysGly: 4.408 ± 1.211
1.102LysHis: 1.102 ± 0.881
2.204LysIle: 2.204 ± 0.794
1.653LysLys: 1.653 ± 0.947
3.306LysLeu: 3.306 ± 1.206
0.0LysMet: 0.0 ± 0.0
1.102LysAsn: 1.102 ± 0.566
0.0LysPro: 0.0 ± 0.0
1.653LysGln: 1.653 ± 0.753
3.306LysArg: 3.306 ± 1.55
2.755LysSer: 2.755 ± 1.383
2.755LysThr: 2.755 ± 1.221
1.653LysVal: 1.653 ± 0.758
1.102LysTrp: 1.102 ± 0.949
1.102LysTyr: 1.102 ± 0.881
0.0LysXaa: 0.0 ± 0.0
Leu
10.468LeuAla: 10.468 ± 4.592
0.551LeuCys: 0.551 ± 0.501
8.264LeuAsp: 8.264 ± 2.704
3.306LeuGlu: 3.306 ± 1.417
1.102LeuPhe: 1.102 ± 1.101
5.51LeuGly: 5.51 ± 1.708
1.653LeuHis: 1.653 ± 0.908
4.959LeuIle: 4.959 ± 1.411
0.551LeuLys: 0.551 ± 0.44
3.857LeuLeu: 3.857 ± 1.257
1.653LeuMet: 1.653 ± 1.027
2.755LeuAsn: 2.755 ± 1.109
6.612LeuPro: 6.612 ± 1.28
2.204LeuGln: 2.204 ± 1.407
8.264LeuArg: 8.264 ± 2.631
5.51LeuSer: 5.51 ± 1.932
4.959LeuThr: 4.959 ± 2.213
8.264LeuVal: 8.264 ± 3.391
3.306LeuTrp: 3.306 ± 1.61
1.102LeuTyr: 1.102 ± 0.566
0.0LeuXaa: 0.0 ± 0.0
Met
4.959MetAla: 4.959 ± 1.683
0.0MetCys: 0.0 ± 0.0
0.551MetAsp: 0.551 ± 0.588
0.551MetGlu: 0.551 ± 0.619
1.102MetPhe: 1.102 ± 0.699
5.51MetGly: 5.51 ± 3.898
0.0MetHis: 0.0 ± 0.0
2.204MetIle: 2.204 ± 0.851
1.102MetLys: 1.102 ± 0.767
2.755MetLeu: 2.755 ± 1.33
0.551MetMet: 0.551 ± 0.619
0.0MetAsn: 0.0 ± 0.0
1.102MetPro: 1.102 ± 0.699
0.551MetGln: 0.551 ± 0.776
0.551MetArg: 0.551 ± 0.501
0.551MetSer: 0.551 ± 0.701
1.102MetThr: 1.102 ± 0.952
1.102MetVal: 1.102 ± 0.566
0.551MetTrp: 0.551 ± 0.475
0.551MetTyr: 0.551 ± 0.701
0.0MetXaa: 0.0 ± 0.0
Asn
2.204AsnAla: 2.204 ± 0.833
0.551AsnCys: 0.551 ± 0.475
1.102AsnAsp: 1.102 ± 0.68
2.204AsnGlu: 2.204 ± 1.351
1.653AsnPhe: 1.653 ± 0.551
1.653AsnGly: 1.653 ± 0.935
0.551AsnHis: 0.551 ± 0.67
0.0AsnIle: 0.0 ± 0.0
0.551AsnLys: 0.551 ± 0.619
2.204AsnLeu: 2.204 ± 1.122
0.0AsnMet: 0.0 ± 0.0
0.0AsnAsn: 0.0 ± 0.0
0.551AsnPro: 0.551 ± 0.475
0.0AsnGln: 0.0 ± 0.0
2.204AsnArg: 2.204 ± 1.387
0.551AsnSer: 0.551 ± 0.475
1.102AsnThr: 1.102 ± 0.881
1.653AsnVal: 1.653 ± 1.233
0.0AsnTrp: 0.0 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
9.917ProAla: 9.917 ± 1.91
0.0ProCys: 0.0 ± 0.0
3.306ProAsp: 3.306 ± 1.301
6.612ProGlu: 6.612 ± 1.525
1.653ProPhe: 1.653 ± 0.903
3.857ProGly: 3.857 ± 1.51
1.102ProHis: 1.102 ± 0.586
1.102ProIle: 1.102 ± 1.001
1.653ProLys: 1.653 ± 0.947
3.306ProLeu: 3.306 ± 0.881
0.0ProMet: 0.0 ± 0.0
0.0ProAsn: 0.0 ± 0.0
1.653ProPro: 1.653 ± 0.997
1.102ProGln: 1.102 ± 0.781
3.857ProArg: 3.857 ± 1.592
2.204ProSer: 2.204 ± 0.845
3.306ProThr: 3.306 ± 0.779
2.755ProVal: 2.755 ± 0.902
2.204ProTrp: 2.204 ± 1.004
1.102ProTyr: 1.102 ± 0.692
0.0ProXaa: 0.0 ± 0.0
Gln
4.408GlnAla: 4.408 ± 1.627
0.551GlnCys: 0.551 ± 0.727
2.204GlnAsp: 2.204 ± 1.115
0.551GlnGlu: 0.551 ± 0.619
0.551GlnPhe: 0.551 ± 0.475
1.653GlnGly: 1.653 ± 0.947
0.0GlnHis: 0.0 ± 0.0
2.755GlnIle: 2.755 ± 2.018
2.204GlnLys: 2.204 ± 0.872
2.204GlnLeu: 2.204 ± 1.008
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
4.959GlnPro: 4.959 ± 1.638
0.0GlnGln: 0.0 ± 0.0
3.857GlnArg: 3.857 ± 1.471
1.653GlnSer: 1.653 ± 0.758
1.653GlnThr: 1.653 ± 1.136
2.204GlnVal: 2.204 ± 0.734
1.653GlnTrp: 1.653 ± 1.104
1.102GlnTyr: 1.102 ± 0.566
0.0GlnXaa: 0.0 ± 0.0
Arg
11.019ArgAla: 11.019 ± 1.944
1.653ArgCys: 1.653 ± 0.905
2.755ArgAsp: 2.755 ± 1.083
2.755ArgGlu: 2.755 ± 0.784
2.755ArgPhe: 2.755 ± 1.206
4.408ArgGly: 4.408 ± 1.39
1.653ArgHis: 1.653 ± 0.896
3.857ArgIle: 3.857 ± 2.072
2.755ArgLys: 2.755 ± 0.735
6.612ArgLeu: 6.612 ± 1.857
2.204ArgMet: 2.204 ± 1.102
2.755ArgAsn: 2.755 ± 1.249
5.51ArgPro: 5.51 ± 2.308
2.755ArgGln: 2.755 ± 1.47
8.264ArgArg: 8.264 ± 3.556
4.959ArgSer: 4.959 ± 2.541
3.306ArgThr: 3.306 ± 1.757
5.51ArgVal: 5.51 ± 2.064
2.204ArgTrp: 2.204 ± 1.496
1.102ArgTyr: 1.102 ± 0.566
0.0ArgXaa: 0.0 ± 0.0
Ser
4.959SerAla: 4.959 ± 2.203
1.653SerCys: 1.653 ± 0.905
2.204SerAsp: 2.204 ± 0.974
1.653SerGlu: 1.653 ± 0.947
4.408SerPhe: 4.408 ± 1.386
3.857SerGly: 3.857 ± 1.076
0.0SerHis: 0.0 ± 0.0
2.204SerIle: 2.204 ± 0.657
2.204SerLys: 2.204 ± 0.667
4.959SerLeu: 4.959 ± 1.249
1.102SerMet: 1.102 ± 0.783
1.653SerAsn: 1.653 ± 0.947
2.755SerPro: 2.755 ± 1.401
2.755SerGln: 2.755 ± 1.151
3.306SerArg: 3.306 ± 2.471
5.51SerSer: 5.51 ± 3.301
5.51SerThr: 5.51 ± 2.853
4.408SerVal: 4.408 ± 1.398
0.551SerTrp: 0.551 ± 0.67
1.102SerTyr: 1.102 ± 0.566
0.0SerXaa: 0.0 ± 0.0
Thr
11.019ThrAla: 11.019 ± 3.607
0.0ThrCys: 0.0 ± 0.0
2.204ThrAsp: 2.204 ± 1.256
3.306ThrGlu: 3.306 ± 1.456
0.551ThrPhe: 0.551 ± 0.44
5.51ThrGly: 5.51 ± 1.448
0.551ThrHis: 0.551 ± 0.501
0.551ThrIle: 0.551 ± 0.619
2.204ThrLys: 2.204 ± 1.294
6.612ThrLeu: 6.612 ± 1.43
0.551ThrMet: 0.551 ± 0.648
1.102ThrAsn: 1.102 ± 0.68
3.857ThrPro: 3.857 ± 1.51
2.204ThrGln: 2.204 ± 1.038
3.857ThrArg: 3.857 ± 0.963
2.755ThrSer: 2.755 ± 1.052
4.408ThrThr: 4.408 ± 1.645
4.959ThrVal: 4.959 ± 1.693
2.204ThrTrp: 2.204 ± 0.833
1.102ThrTyr: 1.102 ± 0.566
0.0ThrXaa: 0.0 ± 0.0
Val
9.917ValAla: 9.917 ± 3.055
2.204ValCys: 2.204 ± 0.854
3.857ValAsp: 3.857 ± 1.162
3.857ValGlu: 3.857 ± 1.073
2.755ValPhe: 2.755 ± 1.766
6.061ValGly: 6.061 ± 1.137
1.102ValHis: 1.102 ± 0.586
3.857ValIle: 3.857 ± 1.227
2.755ValLys: 2.755 ± 1.224
8.815ValLeu: 8.815 ± 3.398
2.755ValMet: 2.755 ± 1.611
1.102ValAsn: 1.102 ± 0.944
3.306ValPro: 3.306 ± 1.487
1.653ValGln: 1.653 ± 0.896
6.061ValArg: 6.061 ± 1.41
4.408ValSer: 4.408 ± 2.599
6.612ValThr: 6.612 ± 2.46
7.713ValVal: 7.713 ± 2.537
0.551ValTrp: 0.551 ± 0.501
2.204ValTyr: 2.204 ± 0.912
0.0ValXaa: 0.0 ± 0.0
Trp
3.306TrpAla: 3.306 ± 1.859
2.204TrpCys: 2.204 ± 1.679
0.0TrpAsp: 0.0 ± 0.0
0.551TrpGlu: 0.551 ± 0.44
2.204TrpPhe: 2.204 ± 1.1
0.0TrpGly: 0.0 ± 0.0
0.551TrpHis: 0.551 ± 0.701
1.102TrpIle: 1.102 ± 0.795
1.102TrpLys: 1.102 ± 0.566
1.653TrpLeu: 1.653 ± 0.976
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
2.204TrpPro: 2.204 ± 0.971
0.0TrpGln: 0.0 ± 0.0
2.204TrpArg: 2.204 ± 1.54
1.653TrpSer: 1.653 ± 0.778
1.102TrpThr: 1.102 ± 1.34
1.102TrpVal: 1.102 ± 0.949
0.0TrpTrp: 0.0 ± 0.0
1.102TrpTyr: 1.102 ± 0.566
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.653TyrAla: 1.653 ± 0.72
0.0TyrCys: 0.0 ± 0.0
1.653TyrAsp: 1.653 ± 0.63
2.755TyrGlu: 2.755 ± 1.12
0.0TyrPhe: 0.0 ± 0.0
1.102TyrGly: 1.102 ± 0.566
0.551TyrHis: 0.551 ± 0.44
2.204TyrIle: 2.204 ± 0.911
0.551TyrLys: 0.551 ± 0.44
1.102TyrLeu: 1.102 ± 0.77
1.102TyrMet: 1.102 ± 0.861
0.551TyrAsn: 0.551 ± 0.501
2.755TyrPro: 2.755 ± 1.357
0.551TyrGln: 0.551 ± 0.619
0.0TyrArg: 0.0 ± 0.0
0.551TyrSer: 0.551 ± 0.44
1.102TyrThr: 1.102 ± 0.566
1.102TyrVal: 1.102 ± 0.772
0.551TyrTrp: 0.551 ± 0.44
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (1816 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski