Amino acid dipepetide frequency for Pseudomonas phage Pf1 (Bacteriophage Pf1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.66AlaAla: 11.66 ± 3.231
2.332AlaCys: 2.332 ± 1.05
4.664AlaAsp: 4.664 ± 1.503
2.332AlaGlu: 2.332 ± 0.572
3.887AlaPhe: 3.887 ± 1.041
5.83AlaGly: 5.83 ± 1.649
1.555AlaHis: 1.555 ± 1.003
4.275AlaIle: 4.275 ± 1.013
5.052AlaLys: 5.052 ± 1.266
8.939AlaLeu: 8.939 ± 2.472
1.943AlaMet: 1.943 ± 1.049
1.943AlaAsn: 1.943 ± 0.751
7.384AlaPro: 7.384 ± 3.227
4.275AlaGln: 4.275 ± 1.979
8.162AlaArg: 8.162 ± 2.244
8.162AlaSer: 8.162 ± 1.078
4.664AlaThr: 4.664 ± 0.751
7.384AlaVal: 7.384 ± 1.736
1.943AlaTrp: 1.943 ± 0.964
2.332AlaTyr: 2.332 ± 0.974
0.0AlaXaa: 0.0 ± 0.0
Cys
2.721CysAla: 2.721 ± 0.963
0.0CysCys: 0.0 ± 0.0
1.555CysAsp: 1.555 ± 0.841
3.498CysGlu: 3.498 ± 1.277
0.0CysPhe: 0.0 ± 0.0
1.166CysGly: 1.166 ± 0.426
0.0CysHis: 0.0 ± 0.0
0.777CysIle: 0.777 ± 0.508
1.166CysLys: 1.166 ± 0.497
1.166CysLeu: 1.166 ± 0.549
0.0CysMet: 0.0 ± 0.0
0.389CysAsn: 0.389 ± 0.328
1.166CysPro: 1.166 ± 0.53
0.777CysGln: 0.777 ± 0.58
2.332CysArg: 2.332 ± 1.003
1.943CysSer: 1.943 ± 0.894
1.166CysThr: 1.166 ± 0.704
1.555CysVal: 1.555 ± 0.853
0.389CysTrp: 0.389 ± 0.425
0.777CysTyr: 0.777 ± 0.657
0.0CysXaa: 0.0 ± 0.0
Asp
4.275AspAla: 4.275 ± 1.626
1.166AspCys: 1.166 ± 0.885
3.109AspAsp: 3.109 ± 0.884
0.777AspGlu: 0.777 ± 0.468
2.721AspPhe: 2.721 ± 1.296
8.55AspGly: 8.55 ± 2.698
0.389AspHis: 0.389 ± 0.303
2.332AspIle: 2.332 ± 1.076
1.555AspLys: 1.555 ± 0.999
5.83AspLeu: 5.83 ± 1.688
1.166AspMet: 1.166 ± 0.623
0.389AspAsn: 0.389 ± 0.328
7.384AspPro: 7.384 ± 2.552
2.332AspGln: 2.332 ± 1.233
3.109AspArg: 3.109 ± 0.7
3.109AspSer: 3.109 ± 0.896
2.721AspThr: 2.721 ± 0.859
1.555AspVal: 1.555 ± 0.728
0.777AspTrp: 0.777 ± 0.402
1.555AspTyr: 1.555 ± 0.539
0.0AspXaa: 0.0 ± 0.0
Glu
3.887GluAla: 3.887 ± 0.959
1.943GluCys: 1.943 ± 0.492
2.721GluAsp: 2.721 ± 1.202
3.109GluGlu: 3.109 ± 1.381
1.943GluPhe: 1.943 ± 1.14
4.275GluGly: 4.275 ± 1.526
0.389GluHis: 0.389 ± 0.401
1.555GluIle: 1.555 ± 0.919
1.943GluLys: 1.943 ± 0.841
4.664GluLeu: 4.664 ± 1.123
1.166GluMet: 1.166 ± 0.647
1.555GluAsn: 1.555 ± 0.713
1.166GluPro: 1.166 ± 0.71
1.166GluGln: 1.166 ± 0.838
3.887GluArg: 3.887 ± 1.399
3.109GluSer: 3.109 ± 0.658
3.498GluThr: 3.498 ± 1.355
2.721GluVal: 2.721 ± 0.974
0.777GluTrp: 0.777 ± 0.502
1.943GluTyr: 1.943 ± 0.713
0.0GluXaa: 0.0 ± 0.0
Phe
5.441PheAla: 5.441 ± 1.755
1.555PheCys: 1.555 ± 0.921
2.332PheAsp: 2.332 ± 0.673
1.166PheGlu: 1.166 ± 0.842
1.943PhePhe: 1.943 ± 1.437
2.332PheGly: 2.332 ± 0.946
0.777PheHis: 0.777 ± 0.464
0.0PheIle: 0.0 ± 0.0
0.389PheLys: 0.389 ± 0.401
2.332PheLeu: 2.332 ± 0.744
1.166PheMet: 1.166 ± 0.533
0.389PheAsn: 0.389 ± 0.335
1.555PhePro: 1.555 ± 0.748
1.555PheGln: 1.555 ± 0.786
3.109PheArg: 3.109 ± 0.658
2.332PheSer: 2.332 ± 0.699
1.166PheThr: 1.166 ± 0.537
2.721PheVal: 2.721 ± 0.835
0.777PheTrp: 0.777 ± 0.606
0.389PheTyr: 0.389 ± 0.401
0.0PheXaa: 0.0 ± 0.0
Gly
5.441GlyAla: 5.441 ± 1.07
1.166GlyCys: 1.166 ± 0.426
8.55GlyAsp: 8.55 ± 3.197
3.109GlyGlu: 3.109 ± 0.661
4.275GlyPhe: 4.275 ± 0.995
11.66GlyGly: 11.66 ± 4.759
0.777GlyHis: 0.777 ± 0.642
2.332GlyIle: 2.332 ± 0.636
4.275GlyLys: 4.275 ± 1.371
6.996GlyLeu: 6.996 ± 2.14
1.943GlyMet: 1.943 ± 0.754
3.887GlyAsn: 3.887 ± 2.124
4.275GlyPro: 4.275 ± 1.773
4.275GlyGln: 4.275 ± 1.059
8.55GlyArg: 8.55 ± 1.668
6.607GlySer: 6.607 ± 1.343
3.109GlyThr: 3.109 ± 1.906
4.275GlyVal: 4.275 ± 1.661
2.721GlyTrp: 2.721 ± 0.818
2.721GlyTyr: 2.721 ± 1.019
0.0GlyXaa: 0.0 ± 0.0
His
2.721HisAla: 2.721 ± 1.105
0.777HisCys: 0.777 ± 0.606
0.777HisAsp: 0.777 ± 0.515
0.389HisGlu: 0.389 ± 0.401
1.166HisPhe: 1.166 ± 0.576
2.332HisGly: 2.332 ± 1.038
0.777HisHis: 0.777 ± 0.417
1.166HisIle: 1.166 ± 0.602
1.166HisLys: 1.166 ± 0.602
1.943HisLeu: 1.943 ± 0.789
0.0HisMet: 0.0 ± 0.0
0.777HisAsn: 0.777 ± 0.468
1.166HisPro: 1.166 ± 0.536
0.777HisGln: 0.777 ± 0.566
1.943HisArg: 1.943 ± 0.828
1.166HisSer: 1.166 ± 0.444
0.777HisThr: 0.777 ± 0.803
0.777HisVal: 0.777 ± 0.468
0.389HisTrp: 0.389 ± 0.468
1.166HisTyr: 1.166 ± 0.755
0.0HisXaa: 0.0 ± 0.0
Ile
4.664IleAla: 4.664 ± 1.364
0.777IleCys: 0.777 ± 0.472
1.943IleAsp: 1.943 ± 1.088
3.498IleGlu: 3.498 ± 2.096
1.166IlePhe: 1.166 ± 0.611
4.275IleGly: 4.275 ± 1.586
1.166IleHis: 1.166 ± 0.647
1.943IleIle: 1.943 ± 1.229
2.721IleLys: 2.721 ± 0.996
4.664IleLeu: 4.664 ± 0.98
0.777IleMet: 0.777 ± 0.503
0.389IleAsn: 0.389 ± 0.335
2.332IlePro: 2.332 ± 0.908
0.777IleGln: 0.777 ± 0.548
3.887IleArg: 3.887 ± 1.654
2.721IleSer: 2.721 ± 0.888
1.943IleThr: 1.943 ± 0.818
2.332IleVal: 2.332 ± 0.893
1.166IleTrp: 1.166 ± 0.756
1.943IleTyr: 1.943 ± 0.76
0.0IleXaa: 0.0 ± 0.0
Lys
5.052LysAla: 5.052 ± 1.45
0.389LysCys: 0.389 ± 0.455
1.943LysAsp: 1.943 ± 0.848
2.721LysGlu: 2.721 ± 1.503
0.777LysPhe: 0.777 ± 0.646
6.218LysGly: 6.218 ± 1.692
1.555LysHis: 1.555 ± 0.668
1.166LysIle: 1.166 ± 0.537
1.555LysLys: 1.555 ± 0.64
2.332LysLeu: 2.332 ± 1.191
1.555LysMet: 1.555 ± 0.7
0.777LysAsn: 0.777 ± 0.412
1.166LysPro: 1.166 ± 0.672
2.332LysGln: 2.332 ± 0.928
0.777LysArg: 0.777 ± 0.468
3.498LysSer: 3.498 ± 0.933
5.052LysThr: 5.052 ± 0.96
3.109LysVal: 3.109 ± 1.517
0.0LysTrp: 0.0 ± 0.0
0.777LysTyr: 0.777 ± 0.611
0.0LysXaa: 0.0 ± 0.0
Leu
8.55LeuAla: 8.55 ± 2.223
3.498LeuCys: 3.498 ± 1.231
4.664LeuAsp: 4.664 ± 0.909
6.607LeuGlu: 6.607 ± 2.202
3.498LeuPhe: 3.498 ± 0.941
3.887LeuGly: 3.887 ± 1.7
2.332LeuHis: 2.332 ± 1.134
7.384LeuIle: 7.384 ± 1.434
3.498LeuLys: 3.498 ± 1.306
10.494LeuLeu: 10.494 ± 2.788
0.777LeuMet: 0.777 ± 0.421
3.109LeuAsn: 3.109 ± 0.854
4.664LeuPro: 4.664 ± 1.881
3.109LeuGln: 3.109 ± 1.529
6.607LeuArg: 6.607 ± 1.285
5.441LeuSer: 5.441 ± 2.132
4.275LeuThr: 4.275 ± 1.66
5.052LeuVal: 5.052 ± 1.026
0.777LeuTrp: 0.777 ± 0.491
1.166LeuTyr: 1.166 ± 0.444
0.0LeuXaa: 0.0 ± 0.0
Met
2.721MetAla: 2.721 ± 0.818
0.389MetCys: 0.389 ± 0.303
0.389MetAsp: 0.389 ± 0.303
0.777MetGlu: 0.777 ± 0.575
1.555MetPhe: 1.555 ± 0.725
1.943MetGly: 1.943 ± 0.699
0.389MetHis: 0.389 ± 0.303
0.389MetIle: 0.389 ± 0.468
1.943MetLys: 1.943 ± 1.236
1.943MetLeu: 1.943 ± 0.846
0.389MetMet: 0.389 ± 0.401
1.166MetAsn: 1.166 ± 0.642
0.389MetPro: 0.389 ± 0.401
0.0MetGln: 0.0 ± 0.0
0.777MetArg: 0.777 ± 0.468
1.943MetSer: 1.943 ± 0.753
2.332MetThr: 2.332 ± 1.166
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.555MetTyr: 1.555 ± 0.924
0.0MetXaa: 0.0 ± 0.0
Asn
2.721AsnAla: 2.721 ± 0.889
0.777AsnCys: 0.777 ± 0.421
0.777AsnAsp: 0.777 ± 0.657
0.0AsnGlu: 0.0 ± 0.0
0.0AsnPhe: 0.0 ± 0.0
5.441AsnGly: 5.441 ± 2.006
1.555AsnHis: 1.555 ± 0.916
1.555AsnIle: 1.555 ± 0.617
0.389AsnLys: 0.389 ± 0.303
2.332AsnLeu: 2.332 ± 0.773
0.389AsnMet: 0.389 ± 0.416
1.166AsnAsn: 1.166 ± 0.668
2.721AsnPro: 2.721 ± 0.925
0.389AsnGln: 0.389 ± 0.303
1.166AsnArg: 1.166 ± 0.576
0.777AsnSer: 0.777 ± 0.402
1.943AsnThr: 1.943 ± 0.806
0.777AsnVal: 0.777 ± 0.402
0.389AsnTrp: 0.389 ± 0.335
1.555AsnTyr: 1.555 ± 0.997
0.0AsnXaa: 0.0 ± 0.0
Pro
9.716ProAla: 9.716 ± 3.399
0.777ProCys: 0.777 ± 0.515
3.887ProAsp: 3.887 ± 1.25
3.498ProGlu: 3.498 ± 1.28
1.943ProPhe: 1.943 ± 0.715
6.218ProGly: 6.218 ± 1.508
0.777ProHis: 0.777 ± 0.606
1.943ProIle: 1.943 ± 0.84
2.721ProLys: 2.721 ± 1.023
5.441ProLeu: 5.441 ± 1.747
0.777ProMet: 0.777 ± 0.515
2.332ProAsn: 2.332 ± 0.919
4.275ProPro: 4.275 ± 1.413
1.943ProGln: 1.943 ± 1.595
1.943ProArg: 1.943 ± 0.736
3.498ProSer: 3.498 ± 1.608
6.607ProThr: 6.607 ± 2.039
3.887ProVal: 3.887 ± 1.102
0.777ProTrp: 0.777 ± 0.476
1.166ProTyr: 1.166 ± 0.885
0.0ProXaa: 0.0 ± 0.0
Gln
5.052GlnAla: 5.052 ± 1.972
0.389GlnCys: 0.389 ± 0.328
2.721GlnAsp: 2.721 ± 1.326
0.389GlnGlu: 0.389 ± 0.455
1.166GlnPhe: 1.166 ± 0.909
3.498GlnGly: 3.498 ± 1.293
0.389GlnHis: 0.389 ± 0.335
1.555GlnIle: 1.555 ± 0.861
0.389GlnLys: 0.389 ± 0.335
3.887GlnLeu: 3.887 ± 1.276
1.943GlnMet: 1.943 ± 0.924
0.777GlnAsn: 0.777 ± 0.502
1.555GlnPro: 1.555 ± 1.135
1.166GlnGln: 1.166 ± 0.678
3.498GlnArg: 3.498 ± 2.29
2.721GlnSer: 2.721 ± 0.645
2.332GlnThr: 2.332 ± 0.815
3.109GlnVal: 3.109 ± 1.142
1.166GlnTrp: 1.166 ± 0.929
0.389GlnTyr: 0.389 ± 0.303
0.0GlnXaa: 0.0 ± 0.0
Arg
3.498ArgAla: 3.498 ± 1.333
1.943ArgCys: 1.943 ± 1.257
3.109ArgAsp: 3.109 ± 0.915
3.498ArgGlu: 3.498 ± 1.354
0.777ArgPhe: 0.777 ± 0.515
4.275ArgGly: 4.275 ± 1.414
1.943ArgHis: 1.943 ± 0.962
5.441ArgIle: 5.441 ± 1.469
4.664ArgLys: 4.664 ± 1.056
5.052ArgLeu: 5.052 ± 3.207
1.555ArgMet: 1.555 ± 0.723
1.166ArgAsn: 1.166 ± 0.535
6.218ArgPro: 6.218 ± 2.089
5.441ArgGln: 5.441 ± 1.693
6.607ArgArg: 6.607 ± 1.72
4.275ArgSer: 4.275 ± 1.008
2.332ArgThr: 2.332 ± 0.95
5.052ArgVal: 5.052 ± 1.36
1.943ArgTrp: 1.943 ± 1.124
3.887ArgTyr: 3.887 ± 1.569
0.0ArgXaa: 0.0 ± 0.0
Ser
4.664SerAla: 4.664 ± 1.167
1.555SerCys: 1.555 ± 0.955
1.166SerAsp: 1.166 ± 0.668
1.943SerGlu: 1.943 ± 0.623
2.332SerPhe: 2.332 ± 1.02
8.55SerGly: 8.55 ± 1.56
1.166SerHis: 1.166 ± 0.602
3.498SerIle: 3.498 ± 0.908
0.777SerLys: 0.777 ± 0.534
7.773SerLeu: 7.773 ± 1.374
1.555SerMet: 1.555 ± 1.053
0.389SerAsn: 0.389 ± 0.335
6.218SerPro: 6.218 ± 0.866
1.166SerGln: 1.166 ± 0.599
3.887SerArg: 3.887 ± 0.864
2.332SerSer: 2.332 ± 1.273
3.498SerThr: 3.498 ± 1.168
5.052SerVal: 5.052 ± 1.35
1.943SerTrp: 1.943 ± 0.743
2.721SerTyr: 2.721 ± 1.039
0.0SerXaa: 0.0 ± 0.0
Thr
4.664ThrAla: 4.664 ± 0.917
1.943ThrCys: 1.943 ± 1.035
3.887ThrAsp: 3.887 ± 1.262
1.555ThrGlu: 1.555 ± 0.688
1.943ThrPhe: 1.943 ± 0.823
6.218ThrGly: 6.218 ± 1.888
1.555ThrHis: 1.555 ± 1.13
2.721ThrIle: 2.721 ± 1.27
3.109ThrLys: 3.109 ± 0.938
3.887ThrLeu: 3.887 ± 0.549
0.777ThrMet: 0.777 ± 0.803
1.555ThrAsn: 1.555 ± 0.505
5.83ThrPro: 5.83 ± 1.514
1.943ThrGln: 1.943 ± 1.372
3.109ThrArg: 3.109 ± 1.081
3.109ThrSer: 3.109 ± 1.195
1.943ThrThr: 1.943 ± 1.267
3.109ThrVal: 3.109 ± 0.995
1.555ThrTrp: 1.555 ± 0.622
1.943ThrTyr: 1.943 ± 0.597
0.0ThrXaa: 0.0 ± 0.0
Val
6.218ValAla: 6.218 ± 1.589
0.777ValCys: 0.777 ± 0.639
3.498ValAsp: 3.498 ± 1.168
4.664ValGlu: 4.664 ± 1.4
1.943ValPhe: 1.943 ± 0.782
2.721ValGly: 2.721 ± 1.008
2.332ValHis: 2.332 ± 1.276
2.721ValIle: 2.721 ± 1.055
2.721ValLys: 2.721 ± 1.29
4.664ValLeu: 4.664 ± 1.574
1.166ValMet: 1.166 ± 0.538
1.943ValAsn: 1.943 ± 0.792
3.498ValPro: 3.498 ± 1.237
1.943ValGln: 1.943 ± 1.043
5.83ValArg: 5.83 ± 1.388
1.943ValSer: 1.943 ± 0.842
3.498ValThr: 3.498 ± 1.172
5.441ValVal: 5.441 ± 1.694
1.555ValTrp: 1.555 ± 0.6
1.555ValTyr: 1.555 ± 0.635
0.0ValXaa: 0.0 ± 0.0
Trp
1.555TrpAla: 1.555 ± 0.574
0.0TrpCys: 0.0 ± 0.0
1.166TrpAsp: 1.166 ± 0.528
1.943TrpGlu: 1.943 ± 0.703
0.389TrpPhe: 0.389 ± 0.335
0.777TrpGly: 0.777 ± 0.493
1.555TrpHis: 1.555 ± 0.673
0.777TrpIle: 0.777 ± 0.599
0.777TrpLys: 0.777 ± 0.402
1.555TrpLeu: 1.555 ± 1.099
0.0TrpMet: 0.0 ± 0.0
0.777TrpAsn: 0.777 ± 0.421
0.389TrpPro: 0.389 ± 0.482
0.777TrpGln: 0.777 ± 0.524
1.166TrpArg: 1.166 ± 0.693
1.943TrpSer: 1.943 ± 0.858
1.943TrpThr: 1.943 ± 0.644
1.555TrpVal: 1.555 ± 0.882
0.0TrpTrp: 0.0 ± 0.0
0.777TrpTyr: 0.777 ± 0.468
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.498TyrAla: 3.498 ± 0.898
0.389TyrCys: 0.389 ± 0.328
1.943TyrAsp: 1.943 ± 0.79
1.943TyrGlu: 1.943 ± 0.789
0.0TyrPhe: 0.0 ± 0.0
0.777TyrGly: 0.777 ± 0.421
0.777TyrHis: 0.777 ± 0.412
1.555TyrIle: 1.555 ± 0.725
1.943TyrLys: 1.943 ± 1.329
3.498TyrLeu: 3.498 ± 1.09
1.555TyrMet: 1.555 ± 0.67
1.943TyrAsn: 1.943 ± 0.959
0.777TyrPro: 0.777 ± 0.468
2.332TyrGln: 2.332 ± 0.75
2.332TyrArg: 2.332 ± 0.827
1.555TyrSer: 1.555 ± 0.833
1.555TyrThr: 1.555 ± 0.79
1.166TyrVal: 1.166 ± 0.724
0.777TyrTrp: 0.777 ± 0.468
0.777TyrTyr: 0.777 ± 0.468
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 14 proteins (2574 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski