Amino acid dipepetide frequency for Enterobacteria phage P4 (Bacteriophage P4)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.359AlaAla: 13.359 ± 2.778
1.908AlaCys: 1.908 ± 0.741
4.135AlaAsp: 4.135 ± 1.06
5.407AlaGlu: 5.407 ± 0.795
2.226AlaPhe: 2.226 ± 0.359
6.361AlaGly: 6.361 ± 1.59
1.59AlaHis: 1.59 ± 1.054
6.679AlaIle: 6.679 ± 1.489
4.135AlaLys: 4.135 ± 1.239
10.178AlaLeu: 10.178 ± 1.493
2.863AlaMet: 2.863 ± 0.737
2.226AlaAsn: 2.226 ± 0.786
4.135AlaPro: 4.135 ± 0.867
3.817AlaGln: 3.817 ± 1.342
7.952AlaArg: 7.952 ± 1.825
4.771AlaSer: 4.771 ± 0.892
5.407AlaThr: 5.407 ± 0.931
6.361AlaVal: 6.361 ± 1.382
1.272AlaTrp: 1.272 ± 0.788
3.181AlaTyr: 3.181 ± 1.015
0.0AlaXaa: 0.0 ± 0.0
Cys
1.908CysAla: 1.908 ± 1.042
0.318CysCys: 0.318 ± 0.381
1.272CysAsp: 1.272 ± 0.512
1.272CysGlu: 1.272 ± 0.468
0.0CysPhe: 0.0 ± 0.0
2.226CysGly: 2.226 ± 1.075
0.318CysHis: 0.318 ± 0.242
0.318CysIle: 0.318 ± 0.306
0.318CysLys: 0.318 ± 0.381
0.318CysLeu: 0.318 ± 0.344
0.318CysMet: 0.318 ± 0.306
0.318CysAsn: 0.318 ± 0.242
1.272CysPro: 1.272 ± 0.438
0.636CysGln: 0.636 ± 0.345
1.59CysArg: 1.59 ± 0.813
1.59CysSer: 1.59 ± 0.874
0.318CysThr: 0.318 ± 0.381
0.318CysVal: 0.318 ± 0.306
0.0CysTrp: 0.0 ± 0.0
0.318CysTyr: 0.318 ± 0.306
0.0CysXaa: 0.0 ± 0.0
Asp
5.407AspAla: 5.407 ± 1.761
0.318AspCys: 0.318 ± 0.344
2.545AspAsp: 2.545 ± 0.536
3.817AspGlu: 3.817 ± 0.671
3.181AspPhe: 3.181 ± 0.824
3.817AspGly: 3.817 ± 0.944
1.59AspHis: 1.59 ± 0.764
3.817AspIle: 3.817 ± 1.536
2.226AspLys: 2.226 ± 0.947
3.181AspLeu: 3.181 ± 1.09
0.636AspMet: 0.636 ± 0.509
1.908AspAsn: 1.908 ± 0.858
3.181AspPro: 3.181 ± 1.295
0.636AspGln: 0.636 ± 0.37
3.499AspArg: 3.499 ± 1.195
4.771AspSer: 4.771 ± 1.531
1.908AspThr: 1.908 ± 0.673
3.817AspVal: 3.817 ± 0.958
1.272AspTrp: 1.272 ± 0.716
1.272AspTyr: 1.272 ± 0.595
0.0AspXaa: 0.0 ± 0.0
Glu
6.679GluAla: 6.679 ± 0.982
0.318GluCys: 0.318 ± 0.242
2.545GluAsp: 2.545 ± 0.751
2.226GluGlu: 2.226 ± 0.895
3.181GluPhe: 3.181 ± 1.045
2.226GluGly: 2.226 ± 0.871
1.908GluHis: 1.908 ± 0.75
2.545GluIle: 2.545 ± 0.93
5.407GluLys: 5.407 ± 1.346
9.542GluLeu: 9.542 ± 2.197
1.59GluMet: 1.59 ± 0.889
4.135GluAsn: 4.135 ± 1.469
0.636GluPro: 0.636 ± 0.451
1.908GluGln: 1.908 ± 0.775
5.407GluArg: 5.407 ± 1.372
3.499GluSer: 3.499 ± 1.045
3.181GluThr: 3.181 ± 1.304
4.453GluVal: 4.453 ± 1.228
0.954GluTrp: 0.954 ± 0.499
2.226GluTyr: 2.226 ± 0.853
0.0GluXaa: 0.0 ± 0.0
Phe
0.954PheAla: 0.954 ± 0.426
0.636PheCys: 0.636 ± 0.572
2.863PheAsp: 2.863 ± 0.553
2.863PheGlu: 2.863 ± 1.411
0.318PhePhe: 0.318 ± 0.381
1.272PheGly: 1.272 ± 0.73
0.318PheHis: 0.318 ± 0.242
1.908PheIle: 1.908 ± 0.863
2.226PheLys: 2.226 ± 1.199
2.545PheLeu: 2.545 ± 0.78
0.954PheMet: 0.954 ± 0.464
0.0PheAsn: 0.0 ± 0.0
2.863PhePro: 2.863 ± 0.872
0.636PheGln: 0.636 ± 0.333
1.908PheArg: 1.908 ± 0.431
3.181PheSer: 3.181 ± 0.798
2.226PheThr: 2.226 ± 0.681
2.226PheVal: 2.226 ± 1.151
0.318PheTrp: 0.318 ± 0.242
0.636PheTyr: 0.636 ± 0.501
0.0PheXaa: 0.0 ± 0.0
Gly
2.545GlyAla: 2.545 ± 0.818
0.318GlyCys: 0.318 ± 0.381
5.407GlyAsp: 5.407 ± 1.399
4.135GlyGlu: 4.135 ± 1.215
3.499GlyPhe: 3.499 ± 0.739
4.453GlyGly: 4.453 ± 1.658
1.272GlyHis: 1.272 ± 0.432
2.545GlyIle: 2.545 ± 0.934
2.226GlyLys: 2.226 ± 0.63
5.725GlyLeu: 5.725 ± 1.184
2.545GlyMet: 2.545 ± 1.024
1.59GlyAsn: 1.59 ± 0.477
0.636GlyPro: 0.636 ± 0.372
2.226GlyGln: 2.226 ± 0.808
4.453GlyArg: 4.453 ± 0.945
3.181GlySer: 3.181 ± 1.023
2.863GlyThr: 2.863 ± 1.048
3.499GlyVal: 3.499 ± 1.239
0.954GlyTrp: 0.954 ± 0.715
3.181GlyTyr: 3.181 ± 0.645
0.0GlyXaa: 0.0 ± 0.0
His
2.226HisAla: 2.226 ± 0.576
0.318HisCys: 0.318 ± 0.286
1.272HisAsp: 1.272 ± 0.59
0.954HisGlu: 0.954 ± 0.612
0.636HisPhe: 0.636 ± 0.381
1.59HisGly: 1.59 ± 0.596
1.272HisHis: 1.272 ± 0.701
2.226HisIle: 2.226 ± 0.602
0.636HisLys: 0.636 ± 0.458
3.499HisLeu: 3.499 ± 1.04
0.318HisMet: 0.318 ± 0.436
0.318HisAsn: 0.318 ± 0.321
1.272HisPro: 1.272 ± 0.736
1.272HisGln: 1.272 ± 0.527
1.908HisArg: 1.908 ± 0.523
1.272HisSer: 1.272 ± 0.434
1.908HisThr: 1.908 ± 0.522
0.954HisVal: 0.954 ± 0.542
0.636HisTrp: 0.636 ± 0.485
1.272HisTyr: 1.272 ± 0.411
0.0HisXaa: 0.0 ± 0.0
Ile
6.043IleAla: 6.043 ± 1.086
0.636IleCys: 0.636 ± 0.611
2.545IleAsp: 2.545 ± 1.018
3.817IleGlu: 3.817 ± 1.222
1.59IlePhe: 1.59 ± 0.645
2.863IleGly: 2.863 ± 0.957
1.272IleHis: 1.272 ± 0.406
3.499IleIle: 3.499 ± 0.754
3.181IleLys: 3.181 ± 1.095
4.453IleLeu: 4.453 ± 1.067
1.59IleMet: 1.59 ± 0.464
2.863IleAsn: 2.863 ± 1.269
4.135IlePro: 4.135 ± 0.945
2.226IleGln: 2.226 ± 0.507
3.817IleArg: 3.817 ± 1.116
3.817IleSer: 3.817 ± 0.987
2.226IleThr: 2.226 ± 0.67
2.545IleVal: 2.545 ± 0.673
0.0IleTrp: 0.0 ± 0.0
2.545IleTyr: 2.545 ± 0.917
0.0IleXaa: 0.0 ± 0.0
Lys
5.407LysAla: 5.407 ± 1.508
0.318LysCys: 0.318 ± 0.242
1.908LysAsp: 1.908 ± 0.511
3.499LysGlu: 3.499 ± 1.358
1.908LysPhe: 1.908 ± 1.044
4.135LysGly: 4.135 ± 0.698
0.636LysHis: 0.636 ± 0.321
3.181LysIle: 3.181 ± 1.572
2.863LysLys: 2.863 ± 0.982
2.545LysLeu: 2.545 ± 0.665
2.226LysMet: 2.226 ± 1.085
2.545LysAsn: 2.545 ± 0.466
0.318LysPro: 0.318 ± 0.255
2.226LysGln: 2.226 ± 0.928
5.089LysArg: 5.089 ± 1.335
3.499LysSer: 3.499 ± 1.302
2.545LysThr: 2.545 ± 0.907
3.817LysVal: 3.817 ± 0.916
1.59LysTrp: 1.59 ± 0.603
2.545LysTyr: 2.545 ± 0.96
0.0LysXaa: 0.0 ± 0.0
Leu
12.405LeuAla: 12.405 ± 1.532
1.272LeuCys: 1.272 ± 0.523
2.863LeuAsp: 2.863 ± 0.597
6.361LeuGlu: 6.361 ± 1.483
1.59LeuPhe: 1.59 ± 0.7
3.181LeuGly: 3.181 ± 0.611
2.545LeuHis: 2.545 ± 1.132
4.135LeuIle: 4.135 ± 1.05
7.634LeuLys: 7.634 ± 1.739
6.679LeuLeu: 6.679 ± 1.266
3.181LeuMet: 3.181 ± 0.718
6.361LeuAsn: 6.361 ± 1.614
5.407LeuPro: 5.407 ± 1.456
3.181LeuGln: 3.181 ± 0.623
5.407LeuArg: 5.407 ± 1.291
9.542LeuSer: 9.542 ± 1.703
5.407LeuThr: 5.407 ± 1.103
4.771LeuVal: 4.771 ± 1.006
0.954LeuTrp: 0.954 ± 0.526
2.545LeuTyr: 2.545 ± 0.549
0.0LeuXaa: 0.0 ± 0.0
Met
3.181MetAla: 3.181 ± 0.853
0.318MetCys: 0.318 ± 0.255
1.272MetAsp: 1.272 ± 0.853
1.908MetGlu: 1.908 ± 0.839
0.636MetPhe: 0.636 ± 0.334
1.272MetGly: 1.272 ± 0.414
0.636MetHis: 0.636 ± 0.516
1.908MetIle: 1.908 ± 0.927
2.545MetLys: 2.545 ± 0.952
2.226MetLeu: 2.226 ± 0.72
0.636MetMet: 0.636 ± 0.4
1.272MetAsn: 1.272 ± 0.553
1.272MetPro: 1.272 ± 0.605
1.908MetGln: 1.908 ± 0.768
1.59MetArg: 1.59 ± 0.73
1.908MetSer: 1.908 ± 0.552
0.636MetThr: 0.636 ± 0.372
1.59MetVal: 1.59 ± 0.745
0.0MetTrp: 0.0 ± 0.0
0.318MetTyr: 0.318 ± 0.242
0.0MetXaa: 0.0 ± 0.0
Asn
3.181AsnAla: 3.181 ± 0.898
0.318AsnCys: 0.318 ± 0.328
2.545AsnAsp: 2.545 ± 0.762
2.545AsnGlu: 2.545 ± 1.302
0.636AsnPhe: 0.636 ± 0.454
2.545AsnGly: 2.545 ± 1.007
0.636AsnHis: 0.636 ± 0.642
2.226AsnIle: 2.226 ± 0.895
3.817AsnLys: 3.817 ± 1.388
2.863AsnLeu: 2.863 ± 1.067
0.954AsnMet: 0.954 ± 0.742
2.545AsnAsn: 2.545 ± 0.584
1.59AsnPro: 1.59 ± 0.483
2.863AsnGln: 2.863 ± 0.757
2.226AsnArg: 2.226 ± 0.54
3.817AsnSer: 3.817 ± 1.089
1.272AsnThr: 1.272 ± 0.595
2.226AsnVal: 2.226 ± 0.62
0.636AsnTrp: 0.636 ± 0.432
1.59AsnTyr: 1.59 ± 0.468
0.0AsnXaa: 0.0 ± 0.0
Pro
5.407ProAla: 5.407 ± 1.274
0.636ProCys: 0.636 ± 0.368
3.181ProAsp: 3.181 ± 1.557
4.453ProGlu: 4.453 ± 1.134
0.954ProPhe: 0.954 ± 0.457
2.226ProGly: 2.226 ± 0.503
1.908ProHis: 1.908 ± 0.521
0.318ProIle: 0.318 ± 0.255
1.908ProLys: 1.908 ± 0.551
2.863ProLeu: 2.863 ± 0.889
0.636ProMet: 0.636 ± 0.485
0.0ProAsn: 0.0 ± 0.0
2.863ProPro: 2.863 ± 1.132
3.499ProGln: 3.499 ± 1.134
2.545ProArg: 2.545 ± 0.731
3.181ProSer: 3.181 ± 0.651
0.954ProThr: 0.954 ± 0.509
5.407ProVal: 5.407 ± 1.446
0.318ProTrp: 0.318 ± 0.306
0.636ProTyr: 0.636 ± 0.612
0.0ProXaa: 0.0 ± 0.0
Gln
3.181GlnAla: 3.181 ± 0.808
0.636GlnCys: 0.636 ± 0.381
1.59GlnAsp: 1.59 ± 0.548
4.135GlnGlu: 4.135 ± 1.363
0.0GlnPhe: 0.0 ± 0.0
1.272GlnGly: 1.272 ± 0.643
1.272GlnHis: 1.272 ± 0.9
2.545GlnIle: 2.545 ± 0.608
2.863GlnLys: 2.863 ± 0.605
3.499GlnLeu: 3.499 ± 0.805
0.954GlnMet: 0.954 ± 0.464
3.181GlnAsn: 3.181 ± 1.037
1.272GlnPro: 1.272 ± 0.493
2.545GlnGln: 2.545 ± 0.849
2.863GlnArg: 2.863 ± 0.877
1.59GlnSer: 1.59 ± 0.56
1.59GlnThr: 1.59 ± 0.738
2.226GlnVal: 2.226 ± 0.53
0.636GlnTrp: 0.636 ± 0.413
0.954GlnTyr: 0.954 ± 0.353
0.0GlnXaa: 0.0 ± 0.0
Arg
7.316ArgAla: 7.316 ± 1.252
0.636ArgCys: 0.636 ± 0.495
4.135ArgAsp: 4.135 ± 1.022
6.997ArgGlu: 6.997 ± 1.333
3.499ArgPhe: 3.499 ± 0.768
2.863ArgGly: 2.863 ± 0.741
2.863ArgHis: 2.863 ± 0.758
2.863ArgIle: 2.863 ± 0.701
4.453ArgLys: 4.453 ± 1.285
6.997ArgLeu: 6.997 ± 1.75
1.272ArgMet: 1.272 ± 0.449
5.089ArgAsn: 5.089 ± 1.002
2.545ArgPro: 2.545 ± 0.816
2.863ArgGln: 2.863 ± 1.235
5.089ArgArg: 5.089 ± 1.17
3.817ArgSer: 3.817 ± 0.952
4.771ArgThr: 4.771 ± 1.025
3.499ArgVal: 3.499 ± 1.112
0.954ArgTrp: 0.954 ± 0.526
3.181ArgTyr: 3.181 ± 0.671
0.0ArgXaa: 0.0 ± 0.0
Ser
5.725SerAla: 5.725 ± 1.148
0.954SerCys: 0.954 ± 0.797
5.089SerAsp: 5.089 ± 0.906
5.407SerGlu: 5.407 ± 1.337
1.908SerPhe: 1.908 ± 0.854
6.043SerGly: 6.043 ± 1.386
2.545SerHis: 2.545 ± 0.704
4.771SerIle: 4.771 ± 1.492
1.59SerLys: 1.59 ± 1.114
8.906SerLeu: 8.906 ± 1.494
0.954SerMet: 0.954 ± 0.468
2.863SerAsn: 2.863 ± 1.322
3.181SerPro: 3.181 ± 1.051
1.272SerGln: 1.272 ± 0.521
6.997SerArg: 6.997 ± 1.608
3.817SerSer: 3.817 ± 0.984
1.908SerThr: 1.908 ± 0.569
3.499SerVal: 3.499 ± 0.801
1.272SerTrp: 1.272 ± 0.564
2.863SerTyr: 2.863 ± 0.732
0.0SerXaa: 0.0 ± 0.0
Thr
5.089ThrAla: 5.089 ± 1.434
1.272ThrCys: 1.272 ± 0.46
1.908ThrAsp: 1.908 ± 0.52
1.272ThrGlu: 1.272 ± 0.576
1.908ThrPhe: 1.908 ± 0.551
4.135ThrGly: 4.135 ± 1.763
1.59ThrHis: 1.59 ± 0.781
3.181ThrIle: 3.181 ± 1.246
2.226ThrLys: 2.226 ± 0.965
5.089ThrLeu: 5.089 ± 2.044
0.636ThrMet: 0.636 ± 0.321
0.954ThrAsn: 0.954 ± 0.399
2.863ThrPro: 2.863 ± 0.886
1.59ThrGln: 1.59 ± 0.628
2.226ThrArg: 2.226 ± 0.809
4.453ThrSer: 4.453 ± 1.061
2.545ThrThr: 2.545 ± 0.685
3.181ThrVal: 3.181 ± 0.764
0.318ThrTrp: 0.318 ± 0.242
0.318ThrTyr: 0.318 ± 0.255
0.0ThrXaa: 0.0 ± 0.0
Val
6.043ValAla: 6.043 ± 0.97
2.226ValCys: 2.226 ± 0.768
3.181ValAsp: 3.181 ± 1.211
1.272ValGlu: 1.272 ± 0.456
2.226ValPhe: 2.226 ± 0.688
1.59ValGly: 1.59 ± 0.852
0.318ValHis: 0.318 ± 0.242
2.863ValIle: 2.863 ± 0.946
1.59ValLys: 1.59 ± 0.801
6.043ValLeu: 6.043 ± 1.186
3.181ValMet: 3.181 ± 0.585
2.226ValAsn: 2.226 ± 0.593
1.59ValPro: 1.59 ± 0.459
2.226ValGln: 2.226 ± 0.95
4.771ValArg: 4.771 ± 1.014
6.361ValSer: 6.361 ± 1.278
4.771ValThr: 4.771 ± 1.153
4.771ValVal: 4.771 ± 0.972
1.908ValTrp: 1.908 ± 0.697
1.272ValTyr: 1.272 ± 0.673
0.0ValXaa: 0.0 ± 0.0
Trp
0.636TrpAla: 0.636 ± 0.4
0.318TrpCys: 0.318 ± 0.306
0.318TrpAsp: 0.318 ± 0.255
0.318TrpGlu: 0.318 ± 0.306
0.318TrpPhe: 0.318 ± 0.321
0.318TrpGly: 0.318 ± 0.286
0.318TrpHis: 0.318 ± 0.255
0.954TrpIle: 0.954 ± 0.527
0.318TrpLys: 0.318 ± 0.242
3.181TrpLeu: 3.181 ± 0.992
0.636TrpMet: 0.636 ± 0.363
0.636TrpAsn: 0.636 ± 0.334
1.272TrpPro: 1.272 ± 0.411
0.954TrpGln: 0.954 ± 0.418
2.226TrpArg: 2.226 ± 0.69
1.272TrpSer: 1.272 ± 0.501
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.318TrpTrp: 0.318 ± 0.255
0.636TrpTyr: 0.636 ± 0.294
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.272TyrAla: 1.272 ± 0.424
1.272TyrCys: 1.272 ± 1.089
2.226TyrAsp: 2.226 ± 0.651
1.908TyrGlu: 1.908 ± 0.696
0.636TyrPhe: 0.636 ± 0.572
2.863TyrGly: 2.863 ± 0.83
1.272TyrHis: 1.272 ± 0.684
3.181TyrIle: 3.181 ± 1.143
0.636TyrLys: 0.636 ± 0.38
5.089TyrLeu: 5.089 ± 0.953
0.636TyrMet: 0.636 ± 0.485
0.0TyrAsn: 0.0 ± 0.0
1.59TyrPro: 1.59 ± 0.73
0.318TyrGln: 0.318 ± 0.286
4.135TyrArg: 4.135 ± 1.247
2.226TyrSer: 2.226 ± 0.739
0.318TyrThr: 0.318 ± 0.255
1.272TyrVal: 1.272 ± 0.615
0.636TyrTrp: 0.636 ± 0.509
0.318TyrTyr: 0.318 ± 0.286
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 13 proteins (3145 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski