Amino acid dipepetide frequency for Pseudomonas phage Pf3 (Bacteriophage Pf3)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.958AlaAla: 9.958 ± 2.24
0.524AlaCys: 0.524 ± 0.46
4.717AlaAsp: 4.717 ± 0.925
3.145AlaGlu: 3.145 ± 1.342
4.717AlaPhe: 4.717 ± 1.626
5.765AlaGly: 5.765 ± 2.344
0.524AlaHis: 0.524 ± 0.46
5.241AlaIle: 5.241 ± 1.297
4.193AlaLys: 4.193 ± 1.835
9.958AlaLeu: 9.958 ± 2.959
2.621AlaMet: 2.621 ± 1.38
1.572AlaAsn: 1.572 ± 0.84
3.669AlaPro: 3.669 ± 1.565
3.145AlaGln: 3.145 ± 1.358
3.145AlaArg: 3.145 ± 1.885
9.434AlaSer: 9.434 ± 1.573
3.669AlaThr: 3.669 ± 1.032
6.813AlaVal: 6.813 ± 2.21
0.524AlaTrp: 0.524 ± 0.87
2.096AlaTyr: 2.096 ± 1.262
0.0AlaXaa: 0.0 ± 0.0
Cys
0.524CysAla: 0.524 ± 0.499
0.0CysCys: 0.0 ± 0.0
1.572CysAsp: 1.572 ± 0.553
0.524CysGlu: 0.524 ± 0.411
1.048CysPhe: 1.048 ± 0.704
0.524CysGly: 0.524 ± 0.425
1.048CysHis: 1.048 ± 1.185
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
2.096CysLeu: 2.096 ± 0.741
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.048CysPro: 1.048 ± 0.517
1.572CysGln: 1.572 ± 0.905
0.0CysArg: 0.0 ± 0.0
1.048CysSer: 1.048 ± 0.534
0.0CysThr: 0.0 ± 0.0
1.048CysVal: 1.048 ± 0.938
0.0CysTrp: 0.0 ± 0.0
0.524CysTyr: 0.524 ± 0.499
0.0CysXaa: 0.0 ± 0.0
Asp
4.717AspAla: 4.717 ± 1.629
0.524AspCys: 0.524 ± 0.499
4.193AspAsp: 4.193 ± 0.778
4.193AspGlu: 4.193 ± 0.729
3.145AspPhe: 3.145 ± 0.924
3.145AspGly: 3.145 ± 1.29
0.0AspHis: 0.0 ± 0.0
4.193AspIle: 4.193 ± 1.532
1.572AspLys: 1.572 ± 0.905
6.813AspLeu: 6.813 ± 1.986
0.524AspMet: 0.524 ± 0.411
1.048AspAsn: 1.048 ± 0.534
1.048AspPro: 1.048 ± 0.823
1.048AspGln: 1.048 ± 0.534
2.096AspArg: 2.096 ± 0.769
3.145AspSer: 3.145 ± 1.052
2.096AspThr: 2.096 ± 0.941
4.193AspVal: 4.193 ± 1.527
3.669AspTrp: 3.669 ± 1.259
0.524AspTyr: 0.524 ± 0.499
0.0AspXaa: 0.0 ± 0.0
Glu
5.765GluAla: 5.765 ± 1.388
0.0GluCys: 0.0 ± 0.0
1.048GluAsp: 1.048 ± 0.669
1.048GluGlu: 1.048 ± 0.823
2.096GluPhe: 2.096 ± 0.833
3.145GluGly: 3.145 ± 1.067
1.572GluHis: 1.572 ± 0.893
1.572GluIle: 1.572 ± 0.462
1.572GluLys: 1.572 ± 0.462
3.669GluLeu: 3.669 ± 0.916
0.524GluMet: 0.524 ± 0.499
2.621GluAsn: 2.621 ± 0.877
4.193GluPro: 4.193 ± 2.248
1.048GluGln: 1.048 ± 0.527
3.145GluArg: 3.145 ± 1.192
3.145GluSer: 3.145 ± 1.789
3.669GluThr: 3.669 ± 1.348
3.669GluVal: 3.669 ± 1.74
3.669GluTrp: 3.669 ± 1.687
1.572GluTyr: 1.572 ± 1.234
0.0GluXaa: 0.0 ± 0.0
Phe
4.717PheAla: 4.717 ± 1.446
2.096PheCys: 2.096 ± 1.229
4.193PheAsp: 4.193 ± 0.958
1.572PheGlu: 1.572 ± 0.789
3.145PhePhe: 3.145 ± 1.091
1.572PheGly: 1.572 ± 0.853
0.524PheHis: 0.524 ± 0.566
0.524PheIle: 0.524 ± 0.46
2.621PheLys: 2.621 ± 1.317
6.289PheLeu: 6.289 ± 1.529
3.145PheMet: 3.145 ± 0.879
1.572PheAsn: 1.572 ± 0.485
2.096PhePro: 2.096 ± 0.661
1.572PheGln: 1.572 ± 0.764
1.048PheArg: 1.048 ± 0.51
2.096PheSer: 2.096 ± 0.728
4.193PheThr: 4.193 ± 1.334
3.669PheVal: 3.669 ± 1.965
0.0PheTrp: 0.0 ± 0.0
1.572PheTyr: 1.572 ± 0.863
0.0PheXaa: 0.0 ± 0.0
Gly
4.717GlyAla: 4.717 ± 0.905
0.524GlyCys: 0.524 ± 0.46
4.193GlyAsp: 4.193 ± 1.48
1.048GlyGlu: 1.048 ± 0.517
4.717GlyPhe: 4.717 ± 0.863
4.193GlyGly: 4.193 ± 1.266
1.048GlyHis: 1.048 ± 0.66
4.717GlyIle: 4.717 ± 1.634
1.572GlyLys: 1.572 ± 0.553
8.91GlyLeu: 8.91 ± 2.149
1.572GlyMet: 1.572 ± 0.892
2.096GlyAsn: 2.096 ± 0.973
2.096GlyPro: 2.096 ± 0.728
2.621GlyGln: 2.621 ± 0.975
4.717GlyArg: 4.717 ± 1.069
7.862GlySer: 7.862 ± 1.031
3.145GlyThr: 3.145 ± 0.861
6.289GlyVal: 6.289 ± 2.095
1.572GlyTrp: 1.572 ± 0.929
2.096GlyTyr: 2.096 ± 0.94
0.0GlyXaa: 0.0 ± 0.0
His
2.621HisAla: 2.621 ± 0.987
1.048HisCys: 1.048 ± 0.782
0.524HisAsp: 0.524 ± 0.46
1.048HisGlu: 1.048 ± 0.892
0.524HisPhe: 0.524 ± 0.46
1.572HisGly: 1.572 ± 0.916
1.572HisHis: 1.572 ± 0.893
1.048HisIle: 1.048 ± 0.919
0.524HisLys: 0.524 ± 0.46
3.145HisLeu: 3.145 ± 1.434
0.524HisMet: 0.524 ± 0.564
0.524HisAsn: 0.524 ± 0.566
0.524HisPro: 0.524 ± 0.566
0.0HisGln: 0.0 ± 0.0
1.572HisArg: 1.572 ± 0.863
0.0HisSer: 0.0 ± 0.0
1.572HisThr: 1.572 ± 1.045
1.048HisVal: 1.048 ± 0.527
0.524HisTrp: 0.524 ± 0.499
1.048HisTyr: 1.048 ± 0.517
0.0HisXaa: 0.0 ± 0.0
Ile
3.145IleAla: 3.145 ± 1.037
0.524IleCys: 0.524 ± 0.566
4.193IleAsp: 4.193 ± 1.66
5.241IleGlu: 5.241 ± 0.993
1.572IlePhe: 1.572 ± 0.754
4.717IleGly: 4.717 ± 1.372
1.572IleHis: 1.572 ± 0.863
2.096IleIle: 2.096 ± 0.925
0.524IleLys: 0.524 ± 0.843
3.145IleLeu: 3.145 ± 1.339
0.524IleMet: 0.524 ± 0.499
2.096IleAsn: 2.096 ± 1.104
3.145IlePro: 3.145 ± 0.917
2.621IleGln: 2.621 ± 1.443
4.193IleArg: 4.193 ± 1.443
4.193IleSer: 4.193 ± 1.457
5.241IleThr: 5.241 ± 2.031
4.717IleVal: 4.717 ± 1.078
0.0IleTrp: 0.0 ± 0.0
0.524IleTyr: 0.524 ± 0.411
0.0IleXaa: 0.0 ± 0.0
Lys
3.669LysAla: 3.669 ± 1.725
0.0LysCys: 0.0 ± 0.0
1.048LysAsp: 1.048 ± 0.483
2.621LysGlu: 2.621 ± 0.926
1.572LysPhe: 1.572 ± 1.379
1.048LysGly: 1.048 ± 0.669
1.572LysHis: 1.572 ± 0.796
1.572LysIle: 1.572 ± 0.658
3.145LysLys: 3.145 ± 1.098
2.096LysLeu: 2.096 ± 1.085
1.048LysMet: 1.048 ± 0.843
0.524LysAsn: 0.524 ± 0.46
2.096LysPro: 2.096 ± 0.661
0.524LysGln: 0.524 ± 0.425
2.621LysArg: 2.621 ± 1.223
3.145LysSer: 3.145 ± 1.244
1.048LysThr: 1.048 ± 0.66
1.572LysVal: 1.572 ± 0.929
0.0LysTrp: 0.0 ± 0.0
1.572LysTyr: 1.572 ± 0.863
0.0LysXaa: 0.0 ± 0.0
Leu
7.338LeuAla: 7.338 ± 2.462
0.524LeuCys: 0.524 ± 0.566
4.717LeuAsp: 4.717 ± 1.989
4.193LeuGlu: 4.193 ± 1.863
4.717LeuPhe: 4.717 ± 2.095
8.386LeuGly: 8.386 ± 1.602
2.096LeuHis: 2.096 ± 0.77
2.621LeuIle: 2.621 ± 0.697
2.621LeuLys: 2.621 ± 1.499
12.055LeuLeu: 12.055 ± 5.637
2.096LeuMet: 2.096 ± 0.735
3.669LeuAsn: 3.669 ± 1.135
5.241LeuPro: 5.241 ± 0.911
3.145LeuGln: 3.145 ± 1.64
3.669LeuArg: 3.669 ± 1.27
12.579LeuSer: 12.579 ± 1.871
6.813LeuThr: 6.813 ± 2.774
8.91LeuVal: 8.91 ± 2.206
0.524LeuTrp: 0.524 ± 0.46
3.145LeuTyr: 3.145 ± 1.974
0.0LeuXaa: 0.0 ± 0.0
Met
1.572MetAla: 1.572 ± 0.797
0.0MetCys: 0.0 ± 0.0
1.048MetAsp: 1.048 ± 0.534
1.048MetGlu: 1.048 ± 0.527
0.0MetPhe: 0.0 ± 0.0
1.572MetGly: 1.572 ± 0.669
1.048MetHis: 1.048 ± 0.66
1.572MetIle: 1.572 ± 0.719
0.524MetLys: 0.524 ± 0.639
1.572MetLeu: 1.572 ± 1.066
0.0MetMet: 0.0 ± 0.0
0.524MetAsn: 0.524 ± 0.639
2.096MetPro: 2.096 ± 0.711
0.524MetGln: 0.524 ± 0.843
1.572MetArg: 1.572 ± 1.498
2.096MetSer: 2.096 ± 1.044
0.524MetThr: 0.524 ± 0.425
1.572MetVal: 1.572 ± 0.691
0.524MetTrp: 0.524 ± 0.499
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.621AsnAla: 2.621 ± 0.91
0.524AsnCys: 0.524 ± 0.499
1.572AsnAsp: 1.572 ± 0.898
3.145AsnGlu: 3.145 ± 1.033
0.524AsnPhe: 0.524 ± 0.425
2.096AsnGly: 2.096 ± 0.661
1.048AsnHis: 1.048 ± 0.51
3.145AsnIle: 3.145 ± 1.251
0.524AsnLys: 0.524 ± 0.411
1.572AsnLeu: 1.572 ± 0.797
0.0AsnMet: 0.0 ± 0.0
1.572AsnAsn: 1.572 ± 0.908
2.096AsnPro: 2.096 ± 0.844
1.048AsnGln: 1.048 ± 0.534
2.096AsnArg: 2.096 ± 1.112
3.145AsnSer: 3.145 ± 1.102
1.572AsnThr: 1.572 ± 0.853
2.621AsnVal: 2.621 ± 1.217
1.048AsnTrp: 1.048 ± 0.851
0.524AsnTyr: 0.524 ± 0.411
0.0AsnXaa: 0.0 ± 0.0
Pro
3.145ProAla: 3.145 ± 0.905
0.524ProCys: 0.524 ± 0.425
4.193ProAsp: 4.193 ± 1.657
5.765ProGlu: 5.765 ± 1.965
1.048ProPhe: 1.048 ± 0.736
3.669ProGly: 3.669 ± 0.694
2.096ProHis: 2.096 ± 0.784
2.621ProIle: 2.621 ± 1.065
1.048ProLys: 1.048 ± 0.919
3.669ProLeu: 3.669 ± 0.75
0.524ProMet: 0.524 ± 0.67
3.145ProAsn: 3.145 ± 1.462
3.145ProPro: 3.145 ± 1.096
0.524ProGln: 0.524 ± 0.499
3.145ProArg: 3.145 ± 0.63
5.765ProSer: 5.765 ± 1.759
1.572ProThr: 1.572 ± 0.485
7.338ProVal: 7.338 ± 2.141
0.0ProTrp: 0.0 ± 0.0
3.145ProTyr: 3.145 ± 1.221
0.0ProXaa: 0.0 ± 0.0
Gln
3.669GlnAla: 3.669 ± 1.38
1.048GlnCys: 1.048 ± 0.768
0.524GlnAsp: 0.524 ± 0.411
1.048GlnGlu: 1.048 ± 0.669
1.048GlnPhe: 1.048 ± 1.179
3.669GlnGly: 3.669 ± 0.916
0.0GlnHis: 0.0 ± 0.0
1.572GlnIle: 1.572 ± 1.889
0.524GlnLys: 0.524 ± 0.499
3.669GlnLeu: 3.669 ± 1.742
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
1.572GlnPro: 1.572 ± 0.805
2.621GlnGln: 2.621 ± 1.187
2.621GlnArg: 2.621 ± 1.147
2.621GlnSer: 2.621 ± 1.357
1.048GlnThr: 1.048 ± 0.534
2.621GlnVal: 2.621 ± 1.097
1.572GlnTrp: 1.572 ± 0.553
2.096GlnTyr: 2.096 ± 0.8
0.0GlnXaa: 0.0 ± 0.0
Arg
4.193ArgAla: 4.193 ± 1.294
1.048ArgCys: 1.048 ± 0.66
2.621ArgAsp: 2.621 ± 0.576
0.0ArgGlu: 0.0 ± 0.0
1.048ArgPhe: 1.048 ± 0.735
3.669ArgGly: 3.669 ± 1.319
1.048ArgHis: 1.048 ± 0.66
3.669ArgIle: 3.669 ± 1.565
1.572ArgLys: 1.572 ± 0.893
5.241ArgLeu: 5.241 ± 1.275
1.572ArgMet: 1.572 ± 1.324
1.048ArgAsn: 1.048 ± 0.517
4.193ArgPro: 4.193 ± 1.074
2.096ArgGln: 2.096 ± 0.995
3.669ArgArg: 3.669 ± 1.876
6.813ArgSer: 6.813 ± 2.007
2.096ArgThr: 2.096 ± 1.068
4.193ArgVal: 4.193 ± 1.914
1.572ArgTrp: 1.572 ± 1.214
0.524ArgTyr: 0.524 ± 0.411
0.0ArgXaa: 0.0 ± 0.0
Ser
8.91SerAla: 8.91 ± 1.347
0.524SerCys: 0.524 ± 0.411
4.717SerAsp: 4.717 ± 1.265
3.145SerGlu: 3.145 ± 0.692
6.289SerPhe: 6.289 ± 2.309
8.91SerGly: 8.91 ± 2.573
1.048SerHis: 1.048 ± 0.66
6.813SerIle: 6.813 ± 1.802
3.145SerLys: 3.145 ± 0.695
6.813SerLeu: 6.813 ± 2.557
1.572SerMet: 1.572 ± 0.84
2.621SerAsn: 2.621 ± 1.337
5.241SerPro: 5.241 ± 1.004
2.621SerGln: 2.621 ± 0.877
4.193SerArg: 4.193 ± 1.554
6.813SerSer: 6.813 ± 2.212
4.193SerThr: 4.193 ± 0.944
9.958SerVal: 9.958 ± 2.562
1.048SerTrp: 1.048 ± 0.517
2.621SerTyr: 2.621 ± 1.42
0.0SerXaa: 0.0 ± 0.0
Thr
5.765ThrAla: 5.765 ± 1.544
0.0ThrCys: 0.0 ± 0.0
3.145ThrAsp: 3.145 ± 1.326
3.145ThrGlu: 3.145 ± 1.707
3.145ThrPhe: 3.145 ± 1.46
4.193ThrGly: 4.193 ± 1.491
1.048ThrHis: 1.048 ± 0.919
2.621ThrIle: 2.621 ± 0.99
0.524ThrLys: 0.524 ± 0.411
5.765ThrLeu: 5.765 ± 1.061
0.524ThrMet: 0.524 ± 0.635
4.717ThrAsn: 4.717 ± 1.173
3.669ThrPro: 3.669 ± 1.438
1.048ThrGln: 1.048 ± 0.483
0.524ThrArg: 0.524 ± 0.67
4.193ThrSer: 4.193 ± 0.91
5.241ThrThr: 5.241 ± 3.089
5.241ThrVal: 5.241 ± 1.396
0.524ThrTrp: 0.524 ± 0.411
1.048ThrTyr: 1.048 ± 0.724
0.0ThrXaa: 0.0 ± 0.0
Val
6.289ValAla: 6.289 ± 2.02
2.621ValCys: 2.621 ± 1.909
3.145ValAsp: 3.145 ± 1.298
3.145ValGlu: 3.145 ± 0.917
3.669ValPhe: 3.669 ± 1.255
6.289ValGly: 6.289 ± 2.599
1.572ValHis: 1.572 ± 0.719
6.289ValIle: 6.289 ± 1.645
4.717ValLys: 4.717 ± 0.846
11.006ValLeu: 11.006 ± 2.426
1.048ValMet: 1.048 ± 0.483
2.096ValAsn: 2.096 ± 1.151
7.862ValPro: 7.862 ± 1.185
1.048ValGln: 1.048 ± 1.358
3.669ValArg: 3.669 ± 1.143
7.862ValSer: 7.862 ± 2.122
6.289ValThr: 6.289 ± 2.504
7.338ValVal: 7.338 ± 1.815
1.048ValTrp: 1.048 ± 0.637
1.572ValTyr: 1.572 ± 0.485
0.0ValXaa: 0.0 ± 0.0
Trp
1.048TrpAla: 1.048 ± 0.883
0.524TrpCys: 0.524 ± 0.499
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
1.572TrpPhe: 1.572 ± 0.789
1.572TrpGly: 1.572 ± 0.969
0.0TrpHis: 0.0 ± 0.0
0.524TrpIle: 0.524 ± 0.843
0.524TrpLys: 0.524 ± 0.46
0.0TrpLeu: 0.0 ± 0.0
1.048TrpMet: 1.048 ± 0.735
0.0TrpAsn: 0.0 ± 0.0
0.524TrpPro: 0.524 ± 0.499
1.572TrpGln: 1.572 ± 0.929
2.621TrpArg: 2.621 ± 0.777
3.145TrpSer: 3.145 ± 0.752
0.524TrpThr: 0.524 ± 0.411
2.096TrpVal: 2.096 ± 0.905
0.0TrpTrp: 0.0 ± 0.0
1.572TrpTyr: 1.572 ± 0.789
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.048TyrAla: 1.048 ± 0.841
0.0TyrCys: 0.0 ± 0.0
0.524TyrAsp: 0.524 ± 0.46
3.669TyrGlu: 3.669 ± 0.756
2.621TyrPhe: 2.621 ± 1.009
0.0TyrGly: 0.0 ± 0.0
0.524TyrHis: 0.524 ± 0.566
1.572TyrIle: 1.572 ± 0.969
1.048TyrLys: 1.048 ± 0.66
2.096TyrLeu: 2.096 ± 0.833
0.0TyrMet: 0.0 ± 0.0
1.048TyrAsn: 1.048 ± 0.517
0.524TyrPro: 0.524 ± 0.46
3.145TyrGln: 3.145 ± 1.238
1.572TyrArg: 1.572 ± 0.694
2.096TyrSer: 2.096 ± 1.229
1.572TyrThr: 1.572 ± 0.694
3.669TyrVal: 3.669 ± 1.533
1.048TyrTrp: 1.048 ± 0.823
1.048TyrTyr: 1.048 ± 0.883
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (1909 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski