Amino acid dipepetide frequency for Enterococcus phage heks

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.417AlaAla: 0.417 ± 0.224
0.417AlaCys: 0.417 ± 0.225
2.921AlaAsp: 2.921 ± 0.535
4.089AlaGlu: 4.089 ± 0.66
2.253AlaPhe: 2.253 ± 0.431
3.338AlaGly: 3.338 ± 0.6
0.918AlaHis: 0.918 ± 0.294
5.424AlaIle: 5.424 ± 0.868
5.508AlaLys: 5.508 ± 0.574
4.339AlaLeu: 4.339 ± 0.655
2.67AlaMet: 2.67 ± 0.557
3.588AlaAsn: 3.588 ± 0.521
2.17AlaPro: 2.17 ± 0.493
2.42AlaGln: 2.42 ± 0.55
2.337AlaArg: 2.337 ± 0.405
2.921AlaSer: 2.921 ± 0.697
4.339AlaThr: 4.339 ± 0.726
3.922AlaVal: 3.922 ± 0.587
0.584AlaTrp: 0.584 ± 0.195
3.088AlaTyr: 3.088 ± 0.421
0.0AlaXaa: 0.0 ± 0.0
Cys
0.417CysAla: 0.417 ± 0.228
0.0CysCys: 0.0 ± 0.0
0.668CysAsp: 0.668 ± 0.234
0.668CysGlu: 0.668 ± 0.26
0.0CysPhe: 0.0 ± 0.0
0.668CysGly: 0.668 ± 0.287
0.25CysHis: 0.25 ± 0.15
0.25CysIle: 0.25 ± 0.188
1.085CysLys: 1.085 ± 0.333
0.584CysLeu: 0.584 ± 0.261
0.167CysMet: 0.167 ± 0.133
0.751CysAsn: 0.751 ± 0.28
0.0CysPro: 0.0 ± 0.0
0.167CysGln: 0.167 ± 0.11
0.334CysArg: 0.334 ± 0.17
0.751CysSer: 0.751 ± 0.29
0.668CysThr: 0.668 ± 0.27
0.334CysVal: 0.334 ± 0.168
0.083CysTrp: 0.083 ± 0.093
0.167CysTyr: 0.167 ± 0.126
0.0CysXaa: 0.0 ± 0.0
Asp
3.505AspAla: 3.505 ± 0.59
0.417AspCys: 0.417 ± 0.17
2.921AspAsp: 2.921 ± 0.54
5.424AspGlu: 5.424 ± 0.731
2.837AspPhe: 2.837 ± 0.551
5.341AspGly: 5.341 ± 0.6
0.501AspHis: 0.501 ± 0.271
4.423AspIle: 4.423 ± 0.639
5.341AspLys: 5.341 ± 0.824
5.925AspLeu: 5.925 ± 0.728
2.17AspMet: 2.17 ± 0.378
4.339AspAsn: 4.339 ± 0.604
1.752AspPro: 1.752 ± 0.369
1.168AspGln: 1.168 ± 0.335
2.587AspArg: 2.587 ± 0.486
2.587AspSer: 2.587 ± 0.41
2.837AspThr: 2.837 ± 0.559
4.339AspVal: 4.339 ± 0.598
1.001AspTrp: 1.001 ± 0.259
2.837AspTyr: 2.837 ± 0.471
0.0AspXaa: 0.0 ± 0.0
Glu
4.423GluAla: 4.423 ± 0.732
0.751GluCys: 0.751 ± 0.267
4.924GluAsp: 4.924 ± 0.755
7.01GluGlu: 7.01 ± 1.157
3.839GluPhe: 3.839 ± 0.619
3.171GluGly: 3.171 ± 0.514
1.085GluHis: 1.085 ± 0.272
4.423GluIle: 4.423 ± 0.563
5.842GluLys: 5.842 ± 0.594
10.014GluLeu: 10.014 ± 1.105
3.088GluMet: 3.088 ± 0.601
4.339GluAsn: 4.339 ± 0.556
2.42GluPro: 2.42 ± 0.57
2.837GluGln: 2.837 ± 0.619
4.423GluArg: 4.423 ± 0.666
3.839GluSer: 3.839 ± 0.578
4.506GluThr: 4.506 ± 0.627
6.426GluVal: 6.426 ± 1.152
1.252GluTrp: 1.252 ± 0.41
4.84GluTyr: 4.84 ± 0.886
0.0GluXaa: 0.0 ± 0.0
Phe
1.252PheAla: 1.252 ± 0.302
0.25PheCys: 0.25 ± 0.137
2.587PheAsp: 2.587 ± 0.544
2.837PheGlu: 2.837 ± 0.546
0.918PhePhe: 0.918 ± 0.28
3.255PheGly: 3.255 ± 0.556
0.167PheHis: 0.167 ± 0.121
3.338PheIle: 3.338 ± 0.613
3.839PheLys: 3.839 ± 0.664
2.337PheLeu: 2.337 ± 0.478
0.835PheMet: 0.835 ± 0.305
3.088PheAsn: 3.088 ± 0.49
0.584PhePro: 0.584 ± 0.213
1.502PheGln: 1.502 ± 0.433
1.502PheArg: 1.502 ± 0.345
2.337PheSer: 2.337 ± 0.552
3.755PheThr: 3.755 ± 0.641
2.67PheVal: 2.67 ± 0.453
0.334PheTrp: 0.334 ± 0.179
1.419PheTyr: 1.419 ± 0.364
0.0PheXaa: 0.0 ± 0.0
Gly
3.922GlyAla: 3.922 ± 1.034
0.584GlyCys: 0.584 ± 0.239
3.755GlyAsp: 3.755 ± 0.608
4.089GlyGlu: 4.089 ± 0.508
3.171GlyPhe: 3.171 ± 0.46
4.006GlyGly: 4.006 ± 0.768
1.085GlyHis: 1.085 ± 0.28
5.424GlyIle: 5.424 ± 0.928
5.341GlyLys: 5.341 ± 0.762
5.925GlyLeu: 5.925 ± 0.817
1.669GlyMet: 1.669 ± 0.362
3.755GlyAsn: 3.755 ± 0.448
0.668GlyPro: 0.668 ± 0.304
2.253GlyGln: 2.253 ± 0.393
2.17GlyArg: 2.17 ± 0.444
3.004GlySer: 3.004 ± 0.545
4.423GlyThr: 4.423 ± 0.854
4.423GlyVal: 4.423 ± 0.8
1.001GlyTrp: 1.001 ± 0.275
2.504GlyTyr: 2.504 ± 0.457
0.0GlyXaa: 0.0 ± 0.0
His
1.001HisAla: 1.001 ± 0.278
0.334HisCys: 0.334 ± 0.181
0.584HisAsp: 0.584 ± 0.229
1.335HisGlu: 1.335 ± 0.33
0.835HisPhe: 0.835 ± 0.353
1.252HisGly: 1.252 ± 0.363
0.25HisHis: 0.25 ± 0.135
0.918HisIle: 0.918 ± 0.244
1.586HisLys: 1.586 ± 0.36
1.085HisLeu: 1.085 ± 0.403
0.334HisMet: 0.334 ± 0.174
1.085HisAsn: 1.085 ± 0.331
0.584HisPro: 0.584 ± 0.227
0.334HisGln: 0.334 ± 0.225
0.668HisArg: 0.668 ± 0.219
0.584HisSer: 0.584 ± 0.219
0.918HisThr: 0.918 ± 0.356
0.334HisVal: 0.334 ± 0.156
0.25HisTrp: 0.25 ± 0.132
1.085HisTyr: 1.085 ± 0.317
0.0HisXaa: 0.0 ± 0.0
Ile
3.839IleAla: 3.839 ± 0.613
0.501IleCys: 0.501 ± 0.201
5.257IleAsp: 5.257 ± 0.579
6.342IleGlu: 6.342 ± 0.963
1.752IlePhe: 1.752 ± 0.397
3.922IleGly: 3.922 ± 0.69
1.252IleHis: 1.252 ± 0.336
4.757IleIle: 4.757 ± 0.64
5.675IleLys: 5.675 ± 0.795
5.424IleLeu: 5.424 ± 0.645
1.419IleMet: 1.419 ± 0.41
4.59IleAsn: 4.59 ± 0.787
2.837IlePro: 2.837 ± 0.435
3.004IleGln: 3.004 ± 0.385
1.669IleArg: 1.669 ± 0.372
4.423IleSer: 4.423 ± 0.677
3.338IleThr: 3.338 ± 0.516
4.006IleVal: 4.006 ± 0.686
0.835IleTrp: 0.835 ± 0.23
1.919IleTyr: 1.919 ± 0.377
0.0IleXaa: 0.0 ± 0.0
Lys
6.342LysAla: 6.342 ± 0.808
0.584LysCys: 0.584 ± 0.27
5.174LysAsp: 5.174 ± 0.716
9.347LysGlu: 9.347 ± 1.445
3.004LysPhe: 3.004 ± 0.392
5.424LysGly: 5.424 ± 0.699
1.419LysHis: 1.419 ± 0.42
3.505LysIle: 3.505 ± 0.596
7.511LysLys: 7.511 ± 0.783
6.092LysLeu: 6.092 ± 0.878
3.422LysMet: 3.422 ± 0.542
5.257LysAsn: 5.257 ± 0.537
2.837LysPro: 2.837 ± 0.537
3.255LysGln: 3.255 ± 0.556
4.256LysArg: 4.256 ± 0.489
3.588LysSer: 3.588 ± 0.564
4.924LysThr: 4.924 ± 0.567
5.925LysVal: 5.925 ± 0.529
1.085LysTrp: 1.085 ± 0.345
3.505LysTyr: 3.505 ± 0.483
0.0LysXaa: 0.0 ± 0.0
Leu
5.091LeuAla: 5.091 ± 0.871
0.668LeuCys: 0.668 ± 0.259
7.093LeuAsp: 7.093 ± 0.791
8.178LeuGlu: 8.178 ± 0.843
2.837LeuPhe: 2.837 ± 0.443
5.341LeuGly: 5.341 ± 0.908
0.751LeuHis: 0.751 ± 0.288
4.256LeuIle: 4.256 ± 0.765
8.262LeuLys: 8.262 ± 0.767
6.676LeuLeu: 6.676 ± 1.064
2.253LeuMet: 2.253 ± 0.504
6.676LeuAsn: 6.676 ± 0.695
3.088LeuPro: 3.088 ± 0.559
3.755LeuGln: 3.755 ± 0.556
2.754LeuArg: 2.754 ± 0.567
3.839LeuSer: 3.839 ± 0.635
5.007LeuThr: 5.007 ± 0.629
5.758LeuVal: 5.758 ± 0.658
1.001LeuTrp: 1.001 ± 0.268
2.42LeuTyr: 2.42 ± 0.551
0.0LeuXaa: 0.0 ± 0.0
Met
1.919MetAla: 1.919 ± 0.443
0.334MetCys: 0.334 ± 0.179
1.836MetAsp: 1.836 ± 0.425
2.587MetGlu: 2.587 ± 0.566
1.001MetPhe: 1.001 ± 0.263
1.669MetGly: 1.669 ± 0.546
0.417MetHis: 0.417 ± 0.205
2.086MetIle: 2.086 ± 0.484
2.337MetLys: 2.337 ± 0.476
3.255MetLeu: 3.255 ± 0.742
0.918MetMet: 0.918 ± 0.267
2.003MetAsn: 2.003 ± 0.442
0.668MetPro: 0.668 ± 0.294
1.252MetGln: 1.252 ± 0.352
1.335MetArg: 1.335 ± 0.361
1.502MetSer: 1.502 ± 0.294
1.586MetThr: 1.586 ± 0.365
1.335MetVal: 1.335 ± 0.37
0.584MetTrp: 0.584 ± 0.284
1.335MetTyr: 1.335 ± 0.428
0.0MetXaa: 0.0 ± 0.0
Asn
4.423AsnAla: 4.423 ± 0.882
0.25AsnCys: 0.25 ± 0.132
3.505AsnAsp: 3.505 ± 0.573
5.424AsnGlu: 5.424 ± 0.661
1.836AsnPhe: 1.836 ± 0.458
6.259AsnGly: 6.259 ± 0.892
0.751AsnHis: 0.751 ± 0.27
4.173AsnIle: 4.173 ± 0.689
5.925AsnLys: 5.925 ± 0.771
5.424AsnLeu: 5.424 ± 0.679
2.337AsnMet: 2.337 ± 0.477
3.922AsnAsn: 3.922 ± 0.634
1.919AsnPro: 1.919 ± 0.351
1.669AsnGln: 1.669 ± 0.283
1.252AsnArg: 1.252 ± 0.302
3.004AsnSer: 3.004 ± 0.567
4.506AsnThr: 4.506 ± 0.753
3.839AsnVal: 3.839 ± 0.511
0.835AsnTrp: 0.835 ± 0.311
3.171AsnTyr: 3.171 ± 0.603
0.0AsnXaa: 0.0 ± 0.0
Pro
1.919ProAla: 1.919 ± 0.48
0.167ProCys: 0.167 ± 0.114
1.586ProAsp: 1.586 ± 0.338
3.255ProGlu: 3.255 ± 0.476
1.168ProPhe: 1.168 ± 0.286
0.167ProGly: 0.167 ± 0.126
0.25ProHis: 0.25 ± 0.143
1.669ProIle: 1.669 ± 0.364
3.171ProLys: 3.171 ± 0.638
3.088ProLeu: 3.088 ± 0.507
1.168ProMet: 1.168 ± 0.304
1.752ProAsn: 1.752 ± 0.485
0.584ProPro: 0.584 ± 0.22
1.419ProGln: 1.419 ± 0.43
0.501ProArg: 0.501 ± 0.183
1.502ProSer: 1.502 ± 0.401
1.586ProThr: 1.586 ± 0.332
1.836ProVal: 1.836 ± 0.386
0.25ProTrp: 0.25 ± 0.136
1.502ProTyr: 1.502 ± 0.33
0.0ProXaa: 0.0 ± 0.0
Gln
2.921GlnAla: 2.921 ± 0.68
0.668GlnCys: 0.668 ± 0.266
1.752GlnAsp: 1.752 ± 0.374
3.004GlnGlu: 3.004 ± 0.616
1.502GlnPhe: 1.502 ± 0.344
1.752GlnGly: 1.752 ± 0.363
0.835GlnHis: 0.835 ± 0.262
2.587GlnIle: 2.587 ± 0.499
2.086GlnLys: 2.086 ± 0.473
3.422GlnLeu: 3.422 ± 0.418
0.584GlnMet: 0.584 ± 0.215
2.253GlnAsn: 2.253 ± 0.372
1.001GlnPro: 1.001 ± 0.267
2.003GlnGln: 2.003 ± 0.383
2.17GlnArg: 2.17 ± 0.358
2.337GlnSer: 2.337 ± 0.433
2.086GlnThr: 2.086 ± 0.445
2.42GlnVal: 2.42 ± 0.495
0.501GlnTrp: 0.501 ± 0.36
2.003GlnTyr: 2.003 ± 0.401
0.0GlnXaa: 0.0 ± 0.0
Arg
1.586ArgAla: 1.586 ± 0.377
0.501ArgCys: 0.501 ± 0.209
2.337ArgAsp: 2.337 ± 0.439
2.337ArgGlu: 2.337 ± 0.468
2.17ArgPhe: 2.17 ± 0.312
1.669ArgGly: 1.669 ± 0.409
0.918ArgHis: 0.918 ± 0.306
2.587ArgIle: 2.587 ± 0.446
2.921ArgLys: 2.921 ± 0.715
3.422ArgLeu: 3.422 ± 0.649
1.252ArgMet: 1.252 ± 0.327
2.42ArgAsn: 2.42 ± 0.461
1.085ArgPro: 1.085 ± 0.195
1.919ArgGln: 1.919 ± 0.457
1.252ArgArg: 1.252 ± 0.256
1.919ArgSer: 1.919 ± 0.551
1.919ArgThr: 1.919 ± 0.46
2.337ArgVal: 2.337 ± 0.482
0.334ArgTrp: 0.334 ± 0.177
2.003ArgTyr: 2.003 ± 0.412
0.0ArgXaa: 0.0 ± 0.0
Ser
3.088SerAla: 3.088 ± 0.56
0.083SerCys: 0.083 ± 0.09
3.505SerAsp: 3.505 ± 0.617
3.922SerGlu: 3.922 ± 0.611
2.587SerPhe: 2.587 ± 0.387
4.673SerGly: 4.673 ± 0.759
1.752SerHis: 1.752 ± 0.423
3.839SerIle: 3.839 ± 0.887
4.339SerLys: 4.339 ± 0.673
3.672SerLeu: 3.672 ± 0.748
1.419SerMet: 1.419 ± 0.363
2.754SerAsn: 2.754 ± 0.539
1.085SerPro: 1.085 ± 0.288
1.919SerGln: 1.919 ± 0.518
1.419SerArg: 1.419 ± 0.45
2.587SerSer: 2.587 ± 0.691
3.588SerThr: 3.588 ± 0.719
3.255SerVal: 3.255 ± 0.419
0.668SerTrp: 0.668 ± 0.266
2.086SerTyr: 2.086 ± 0.689
0.0SerXaa: 0.0 ± 0.0
Thr
3.422ThrAla: 3.422 ± 0.576
0.25ThrCys: 0.25 ± 0.152
3.171ThrAsp: 3.171 ± 0.574
4.59ThrGlu: 4.59 ± 0.772
2.17ThrPhe: 2.17 ± 0.464
4.59ThrGly: 4.59 ± 0.648
1.001ThrHis: 1.001 ± 0.34
4.59ThrIle: 4.59 ± 0.76
5.424ThrLys: 5.424 ± 0.79
4.84ThrLeu: 4.84 ± 0.699
1.335ThrMet: 1.335 ± 0.254
3.171ThrAsn: 3.171 ± 0.483
2.17ThrPro: 2.17 ± 0.5
3.338ThrGln: 3.338 ± 0.498
1.919ThrArg: 1.919 ± 0.375
2.754ThrSer: 2.754 ± 0.392
4.506ThrThr: 4.506 ± 0.927
4.339ThrVal: 4.339 ± 0.746
0.835ThrTrp: 0.835 ± 0.368
2.253ThrTyr: 2.253 ± 0.533
0.0ThrXaa: 0.0 ± 0.0
Val
5.257ValAla: 5.257 ± 0.524
0.501ValCys: 0.501 ± 0.211
3.922ValAsp: 3.922 ± 0.492
4.59ValGlu: 4.59 ± 0.687
2.837ValPhe: 2.837 ± 0.448
3.505ValGly: 3.505 ± 0.538
0.835ValHis: 0.835 ± 0.35
3.839ValIle: 3.839 ± 0.673
5.424ValLys: 5.424 ± 0.818
5.758ValLeu: 5.758 ± 0.688
1.586ValMet: 1.586 ± 0.38
4.84ValAsn: 4.84 ± 0.909
1.836ValPro: 1.836 ± 0.335
2.003ValGln: 2.003 ± 0.629
2.587ValArg: 2.587 ± 0.468
5.508ValSer: 5.508 ± 0.682
3.171ValThr: 3.171 ± 0.535
4.924ValVal: 4.924 ± 0.804
1.001ValTrp: 1.001 ± 0.363
2.086ValTyr: 2.086 ± 0.484
0.0ValXaa: 0.0 ± 0.0
Trp
0.417TrpAla: 0.417 ± 0.182
0.167TrpCys: 0.167 ± 0.15
1.001TrpAsp: 1.001 ± 0.488
1.168TrpGlu: 1.168 ± 0.314
0.751TrpPhe: 0.751 ± 0.267
1.001TrpGly: 1.001 ± 0.34
0.25TrpHis: 0.25 ± 0.138
0.417TrpIle: 0.417 ± 0.218
1.419TrpLys: 1.419 ± 0.337
1.252TrpLeu: 1.252 ± 0.299
0.083TrpMet: 0.083 ± 0.09
0.417TrpAsn: 0.417 ± 0.16
0.0TrpPro: 0.0 ± 0.0
0.417TrpGln: 0.417 ± 0.189
0.835TrpArg: 0.835 ± 0.3
0.835TrpSer: 0.835 ± 0.242
0.751TrpThr: 0.751 ± 0.247
1.419TrpVal: 1.419 ± 0.317
0.334TrpTrp: 0.334 ± 0.151
0.25TrpTyr: 0.25 ± 0.139
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.504TyrAla: 2.504 ± 0.461
0.584TyrCys: 0.584 ± 0.237
3.755TyrAsp: 3.755 ± 0.652
3.255TyrGlu: 3.255 ± 0.501
1.335TyrPhe: 1.335 ± 0.327
2.17TyrGly: 2.17 ± 0.431
0.668TyrHis: 0.668 ± 0.263
4.256TyrIle: 4.256 ± 0.605
3.505TyrLys: 3.505 ± 0.57
3.171TyrLeu: 3.171 ± 0.486
1.168TyrMet: 1.168 ± 0.3
3.422TyrAsn: 3.422 ± 0.584
1.252TyrPro: 1.252 ± 0.296
1.252TyrGln: 1.252 ± 0.377
0.918TyrArg: 0.918 ± 0.381
2.42TyrSer: 2.42 ± 0.56
2.253TyrThr: 2.253 ± 0.56
2.17TyrVal: 2.17 ± 0.411
0.417TyrTrp: 0.417 ± 0.19
2.086TyrTyr: 2.086 ± 0.498
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 64 proteins (11984 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski