Amino acid dipepetide frequency for Human papillomavirus type 48

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.483AlaAla: 5.483 ± 1.178
1.687AlaCys: 1.687 ± 0.898
3.796AlaAsp: 3.796 ± 1.224
3.374AlaGlu: 3.374 ± 0.853
2.531AlaPhe: 2.531 ± 0.391
3.374AlaGly: 3.374 ± 1.163
0.422AlaHis: 0.422 ± 0.385
2.952AlaIle: 2.952 ± 1.418
2.531AlaLys: 2.531 ± 0.761
5.061AlaLeu: 5.061 ± 0.95
0.422AlaMet: 0.422 ± 0.371
2.109AlaAsn: 2.109 ± 0.971
3.796AlaPro: 3.796 ± 1.539
1.687AlaGln: 1.687 ± 0.788
4.218AlaArg: 4.218 ± 1.259
4.639AlaSer: 4.639 ± 1.268
5.061AlaThr: 5.061 ± 1.4
2.531AlaVal: 2.531 ± 0.998
0.0AlaTrp: 0.0 ± 0.0
2.531AlaTyr: 2.531 ± 0.596
0.0AlaXaa: 0.0 ± 0.0
Cys
0.422CysAla: 0.422 ± 0.385
1.265CysCys: 1.265 ± 0.824
0.0CysAsp: 0.0 ± 0.0
1.265CysGlu: 1.265 ± 1.114
2.952CysPhe: 2.952 ± 1.538
0.422CysGly: 0.422 ± 0.368
0.422CysHis: 0.422 ± 0.368
2.109CysIle: 2.109 ± 1.115
2.952CysLys: 2.952 ± 1.219
2.109CysLeu: 2.109 ± 0.834
0.0CysMet: 0.0 ± 0.0
0.844CysAsn: 0.844 ± 0.392
1.265CysPro: 1.265 ± 0.791
0.422CysGln: 0.422 ± 0.513
0.844CysArg: 0.844 ± 0.643
0.844CysSer: 0.844 ± 0.511
2.531CysThr: 2.531 ± 1.308
0.422CysVal: 0.422 ± 0.371
1.687CysTrp: 1.687 ± 0.7
1.687CysTyr: 1.687 ± 0.489
0.0CysXaa: 0.0 ± 0.0
Asp
4.218AspAla: 4.218 ± 1.587
0.844AspCys: 0.844 ± 0.362
4.218AspAsp: 4.218 ± 1.595
4.218AspGlu: 4.218 ± 0.956
4.218AspPhe: 4.218 ± 0.389
1.687AspGly: 1.687 ± 0.587
1.687AspHis: 1.687 ± 0.961
5.061AspIle: 5.061 ± 2.136
3.374AspLys: 3.374 ± 1.393
7.17AspLeu: 7.17 ± 1.775
1.265AspMet: 1.265 ± 0.743
3.374AspAsn: 3.374 ± 0.853
5.061AspPro: 5.061 ± 2.043
1.265AspGln: 1.265 ± 0.555
1.265AspArg: 1.265 ± 0.563
6.748AspSer: 6.748 ± 0.637
5.483AspThr: 5.483 ± 1.209
3.374AspVal: 3.374 ± 1.22
0.422AspTrp: 0.422 ± 0.322
2.109AspTyr: 2.109 ± 1.424
0.0AspXaa: 0.0 ± 0.0
Glu
2.952GluAla: 2.952 ± 0.997
0.844GluCys: 0.844 ± 0.743
3.796GluAsp: 3.796 ± 1.126
7.17GluGlu: 7.17 ± 1.546
3.374GluPhe: 3.374 ± 1.136
2.531GluGly: 2.531 ± 1.135
1.265GluHis: 1.265 ± 0.75
2.109GluIle: 2.109 ± 0.372
2.109GluLys: 2.109 ± 1.124
5.905GluLeu: 5.905 ± 1.574
1.265GluMet: 1.265 ± 1.114
5.905GluAsn: 5.905 ± 1.641
3.796GluPro: 3.796 ± 0.665
2.531GluGln: 2.531 ± 0.688
3.374GluArg: 3.374 ± 1.449
6.326GluSer: 6.326 ± 2.461
3.796GluThr: 3.796 ± 0.968
0.844GluVal: 0.844 ± 0.472
1.265GluTrp: 1.265 ± 0.66
1.687GluTyr: 1.687 ± 1.098
0.0GluXaa: 0.0 ± 0.0
Phe
4.218PheAla: 4.218 ± 1.674
2.531PheCys: 2.531 ± 1.342
3.374PheAsp: 3.374 ± 0.419
2.531PheGlu: 2.531 ± 1.203
2.952PhePhe: 2.952 ± 1.023
2.952PheGly: 2.952 ± 0.96
0.422PheHis: 0.422 ± 0.513
2.109PheIle: 2.109 ± 0.895
4.218PheLys: 4.218 ± 2.027
6.748PheLeu: 6.748 ± 1.315
0.422PheMet: 0.422 ± 0.322
2.531PheAsn: 2.531 ± 0.87
2.531PhePro: 2.531 ± 0.546
1.265PheGln: 1.265 ± 0.496
1.265PheArg: 1.265 ± 0.341
2.952PheSer: 2.952 ± 0.428
2.952PheThr: 2.952 ± 0.55
2.109PheVal: 2.109 ± 0.815
1.265PheTrp: 1.265 ± 0.66
2.109PheTyr: 2.109 ± 0.66
0.0PheXaa: 0.0 ± 0.0
Gly
2.109GlyAla: 2.109 ± 0.488
1.265GlyCys: 1.265 ± 0.585
3.796GlyAsp: 3.796 ± 1.419
3.374GlyGlu: 3.374 ± 1.042
0.422GlyPhe: 0.422 ± 0.385
3.374GlyGly: 3.374 ± 1.862
0.844GlyHis: 0.844 ± 0.471
2.952GlyIle: 2.952 ± 0.759
2.531GlyLys: 2.531 ± 0.845
5.061GlyLeu: 5.061 ± 1.498
0.0GlyMet: 0.0 ± 0.0
2.952GlyAsn: 2.952 ± 0.935
3.796GlyPro: 3.796 ± 1.318
1.687GlyGln: 1.687 ± 0.507
3.374GlyArg: 3.374 ± 1.083
6.748GlySer: 6.748 ± 2.043
2.952GlyThr: 2.952 ± 1.068
3.796GlyVal: 3.796 ± 0.943
0.0GlyTrp: 0.0 ± 0.0
1.265GlyTyr: 1.265 ± 0.496
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.422HisCys: 0.422 ± 0.513
0.422HisAsp: 0.422 ± 0.445
0.422HisGlu: 0.422 ± 0.513
0.0HisPhe: 0.0 ± 0.0
0.422HisGly: 0.422 ± 0.419
0.0HisHis: 0.0 ± 0.0
1.687HisIle: 1.687 ± 0.753
1.265HisLys: 1.265 ± 0.75
2.531HisLeu: 2.531 ± 1.361
0.844HisMet: 0.844 ± 0.466
2.531HisAsn: 2.531 ± 0.704
1.687HisPro: 1.687 ± 0.936
0.844HisGln: 0.844 ± 0.602
0.0HisArg: 0.0 ± 0.0
0.0HisSer: 0.0 ± 0.0
0.844HisThr: 0.844 ± 0.471
1.265HisVal: 1.265 ± 0.341
0.0HisTrp: 0.0 ± 0.0
0.422HisTyr: 0.422 ± 0.322
0.0HisXaa: 0.0 ± 0.0
Ile
2.952IleAla: 2.952 ± 1.418
0.844IleCys: 0.844 ± 0.392
2.531IleAsp: 2.531 ± 0.763
2.952IleGlu: 2.952 ± 1.109
1.265IlePhe: 1.265 ± 0.84
4.639IleGly: 4.639 ± 1.553
0.0IleHis: 0.0 ± 0.0
1.687IleIle: 1.687 ± 1.006
0.844IleLys: 0.844 ± 0.392
2.531IleLeu: 2.531 ± 0.682
0.0IleMet: 0.0 ± 0.0
2.531IleAsn: 2.531 ± 0.886
3.374IlePro: 3.374 ± 1.25
4.639IleGln: 4.639 ± 0.729
2.952IleArg: 2.952 ± 1.445
6.326IleSer: 6.326 ± 1.183
5.061IleThr: 5.061 ± 0.992
4.639IleVal: 4.639 ± 0.782
0.844IleTrp: 0.844 ± 0.601
1.687IleTyr: 1.687 ± 0.587
0.0IleXaa: 0.0 ± 0.0
Lys
2.109LysAla: 2.109 ± 0.877
2.952LysCys: 2.952 ± 1.023
3.374LysAsp: 3.374 ± 1.106
3.796LysGlu: 3.796 ± 1.229
2.952LysPhe: 2.952 ± 0.851
1.687LysGly: 1.687 ± 0.627
0.422LysHis: 0.422 ± 0.371
1.687LysIle: 1.687 ± 0.535
2.531LysLys: 2.531 ± 0.821
6.326LysLeu: 6.326 ± 2.021
0.844LysMet: 0.844 ± 0.392
2.531LysAsn: 2.531 ± 0.439
0.844LysPro: 0.844 ± 0.471
2.109LysGln: 2.109 ± 0.66
4.218LysArg: 4.218 ± 0.6
4.218LysSer: 4.218 ± 1.879
3.796LysThr: 3.796 ± 1.141
3.374LysVal: 3.374 ± 0.738
1.265LysTrp: 1.265 ± 0.893
2.952LysTyr: 2.952 ± 0.95
0.0LysXaa: 0.0 ± 0.0
Leu
4.639LeuAla: 4.639 ± 1.297
2.109LeuCys: 2.109 ± 0.663
6.748LeuAsp: 6.748 ± 1.901
6.326LeuGlu: 6.326 ± 0.771
7.17LeuPhe: 7.17 ± 0.745
5.061LeuGly: 5.061 ± 2.011
0.844LeuHis: 0.844 ± 0.534
3.796LeuIle: 3.796 ± 0.89
6.326LeuLys: 6.326 ± 1.918
9.701LeuLeu: 9.701 ± 1.826
1.265LeuMet: 1.265 ± 0.513
4.639LeuAsn: 4.639 ± 0.598
5.061LeuPro: 5.061 ± 0.783
5.483LeuGln: 5.483 ± 1.614
5.061LeuArg: 5.061 ± 0.766
5.483LeuSer: 5.483 ± 1.161
4.218LeuThr: 4.218 ± 0.831
8.013LeuVal: 8.013 ± 0.89
0.844LeuTrp: 0.844 ± 0.472
2.952LeuTyr: 2.952 ± 1.518
0.0LeuXaa: 0.0 ± 0.0
Met
1.265MetAla: 1.265 ± 0.682
0.844MetCys: 0.844 ± 0.743
0.844MetAsp: 0.844 ± 0.471
1.265MetGlu: 1.265 ± 0.756
0.422MetPhe: 0.422 ± 0.371
0.844MetGly: 0.844 ± 0.77
0.0MetHis: 0.0 ± 0.0
0.844MetIle: 0.844 ± 0.743
0.422MetLys: 0.422 ± 0.371
0.422MetLeu: 0.422 ± 0.371
0.0MetMet: 0.0 ± 0.0
0.422MetAsn: 0.422 ± 0.385
0.422MetPro: 0.422 ± 0.371
1.265MetGln: 1.265 ± 0.381
0.422MetArg: 0.422 ± 0.368
1.687MetSer: 1.687 ± 0.723
1.265MetThr: 1.265 ± 0.682
0.844MetVal: 0.844 ± 0.743
0.0MetTrp: 0.0 ± 0.0
0.422MetTyr: 0.422 ± 0.371
0.0MetXaa: 0.0 ± 0.0
Asn
2.109AsnAla: 2.109 ± 0.856
1.687AsnCys: 1.687 ± 0.727
4.218AsnAsp: 4.218 ± 1.225
0.844AsnGlu: 0.844 ± 0.392
2.952AsnPhe: 2.952 ± 0.722
4.218AsnGly: 4.218 ± 1.559
0.0AsnHis: 0.0 ± 0.0
2.952AsnIle: 2.952 ± 1.079
3.374AsnLys: 3.374 ± 0.49
3.796AsnLeu: 3.796 ± 0.434
0.844AsnMet: 0.844 ± 0.602
2.952AsnAsn: 2.952 ± 0.879
2.952AsnPro: 2.952 ± 1.309
1.265AsnGln: 1.265 ± 0.66
2.531AsnArg: 2.531 ± 0.923
5.061AsnSer: 5.061 ± 1.342
4.218AsnThr: 4.218 ± 1.032
3.796AsnVal: 3.796 ± 0.516
1.265AsnTrp: 1.265 ± 0.57
1.265AsnTyr: 1.265 ± 0.341
0.0AsnXaa: 0.0 ± 0.0
Pro
5.483ProAla: 5.483 ± 2.604
0.422ProCys: 0.422 ± 0.385
5.061ProAsp: 5.061 ± 1.259
2.952ProGlu: 2.952 ± 0.801
2.109ProPhe: 2.109 ± 0.703
2.109ProGly: 2.109 ± 0.814
0.844ProHis: 0.844 ± 0.521
2.531ProIle: 2.531 ± 1.417
3.374ProLys: 3.374 ± 0.696
6.326ProLeu: 6.326 ± 1.436
0.844ProMet: 0.844 ± 0.743
2.531ProAsn: 2.531 ± 0.391
4.639ProPro: 4.639 ± 1.883
0.844ProGln: 0.844 ± 0.617
4.639ProArg: 4.639 ± 1.48
4.639ProSer: 4.639 ± 1.079
5.061ProThr: 5.061 ± 1.604
1.265ProVal: 1.265 ± 0.709
0.0ProTrp: 0.0 ± 0.0
2.109ProTyr: 2.109 ± 1.122
0.0ProXaa: 0.0 ± 0.0
Gln
0.422GlnAla: 0.422 ± 0.322
0.422GlnCys: 0.422 ± 0.385
2.531GlnAsp: 2.531 ± 0.923
2.952GlnGlu: 2.952 ± 1.235
3.374GlnPhe: 3.374 ± 1.124
2.109GlnGly: 2.109 ± 0.588
0.844GlnHis: 0.844 ± 0.577
1.687GlnIle: 1.687 ± 0.79
0.844GlnLys: 0.844 ± 0.602
5.483GlnLeu: 5.483 ± 1.148
0.422GlnMet: 0.422 ± 0.385
2.531GlnAsn: 2.531 ± 0.656
1.687GlnPro: 1.687 ± 1.239
1.687GlnGln: 1.687 ± 0.961
2.109GlnArg: 2.109 ± 1.028
1.265GlnSer: 1.265 ± 0.496
2.952GlnThr: 2.952 ± 1.199
2.109GlnVal: 2.109 ± 0.943
0.422GlnTrp: 0.422 ± 0.371
1.265GlnTyr: 1.265 ± 0.797
0.0GlnXaa: 0.0 ± 0.0
Arg
4.218ArgAla: 4.218 ± 1.37
2.109ArgCys: 2.109 ± 0.601
2.952ArgAsp: 2.952 ± 0.455
3.796ArgGlu: 3.796 ± 1.149
1.687ArgPhe: 1.687 ± 0.788
2.952ArgGly: 2.952 ± 1.148
2.109ArgHis: 2.109 ± 1.004
1.687ArgIle: 1.687 ± 0.828
3.796ArgLys: 3.796 ± 0.871
6.748ArgLeu: 6.748 ± 1.458
1.265ArgMet: 1.265 ± 0.533
3.374ArgAsn: 3.374 ± 1.653
3.374ArgPro: 3.374 ± 1.372
1.687ArgGln: 1.687 ± 0.503
6.326ArgArg: 6.326 ± 2.417
3.796ArgSer: 3.796 ± 0.573
1.265ArgThr: 1.265 ± 0.797
2.531ArgVal: 2.531 ± 1.102
0.422ArgTrp: 0.422 ± 0.419
0.422ArgTyr: 0.422 ± 0.385
0.0ArgXaa: 0.0 ± 0.0
Ser
5.483SerAla: 5.483 ± 1.188
0.0SerCys: 0.0 ± 0.0
6.326SerAsp: 6.326 ± 0.552
5.061SerGlu: 5.061 ± 2.138
4.639SerPhe: 4.639 ± 1.41
4.218SerGly: 4.218 ± 1.629
1.687SerHis: 1.687 ± 1.096
4.218SerIle: 4.218 ± 0.884
1.687SerLys: 1.687 ± 0.734
5.905SerLeu: 5.905 ± 1.18
2.531SerMet: 2.531 ± 1.038
2.109SerAsn: 2.109 ± 1.457
2.531SerPro: 2.531 ± 1.215
2.531SerGln: 2.531 ± 0.993
6.326SerArg: 6.326 ± 2.114
6.748SerSer: 6.748 ± 3.299
6.326SerThr: 6.326 ± 1.502
5.061SerVal: 5.061 ± 0.941
1.265SerTrp: 1.265 ± 1.114
2.531SerTyr: 2.531 ± 0.929
0.0SerXaa: 0.0 ± 0.0
Thr
5.061ThrAla: 5.061 ± 1.349
2.531ThrCys: 2.531 ± 1.289
5.905ThrAsp: 5.905 ± 1.222
6.748ThrGlu: 6.748 ± 1.44
2.109ThrPhe: 2.109 ± 0.905
3.374ThrGly: 3.374 ± 1.264
1.265ThrHis: 1.265 ± 0.563
7.17ThrIle: 7.17 ± 1.491
4.639ThrLys: 4.639 ± 1.175
4.218ThrLeu: 4.218 ± 2.246
0.422ThrMet: 0.422 ± 0.371
2.952ThrAsn: 2.952 ± 0.637
6.748ThrPro: 6.748 ± 1.259
1.265ThrGln: 1.265 ± 0.699
2.109ThrArg: 2.109 ± 0.42
3.374ThrSer: 3.374 ± 1.942
4.218ThrThr: 4.218 ± 1.703
4.639ThrVal: 4.639 ± 0.574
0.422ThrTrp: 0.422 ± 0.513
0.844ThrTyr: 0.844 ± 0.392
0.0ThrXaa: 0.0 ± 0.0
Val
2.531ValAla: 2.531 ± 0.483
0.844ValCys: 0.844 ± 1.025
4.639ValAsp: 4.639 ± 0.979
2.531ValGlu: 2.531 ± 1.206
2.531ValPhe: 2.531 ± 1.44
3.374ValGly: 3.374 ± 1.802
1.687ValHis: 1.687 ± 0.567
2.531ValIle: 2.531 ± 0.43
2.952ValLys: 2.952 ± 0.455
5.061ValLeu: 5.061 ± 1.166
0.422ValMet: 0.422 ± 0.385
3.374ValAsn: 3.374 ± 0.696
3.374ValPro: 3.374 ± 1.599
2.952ValGln: 2.952 ± 1.137
2.109ValArg: 2.109 ± 0.592
3.796ValSer: 3.796 ± 1.033
4.218ValThr: 4.218 ± 1.106
1.687ValVal: 1.687 ± 1.006
1.265ValTrp: 1.265 ± 0.806
2.531ValTyr: 2.531 ± 1.44
0.0ValXaa: 0.0 ± 0.0
Trp
0.422TrpAla: 0.422 ± 0.371
0.422TrpCys: 0.422 ± 0.371
1.265TrpAsp: 1.265 ± 0.555
0.0TrpGlu: 0.0 ± 0.0
0.844TrpPhe: 0.844 ± 0.478
0.844TrpGly: 0.844 ± 0.534
0.422TrpHis: 0.422 ± 0.419
0.844TrpIle: 0.844 ± 0.743
1.687TrpLys: 1.687 ± 0.7
1.687TrpLeu: 1.687 ± 0.239
0.0TrpMet: 0.0 ± 0.0
0.422TrpAsn: 0.422 ± 0.371
0.422TrpPro: 0.422 ± 0.385
0.422TrpGln: 0.422 ± 0.385
1.265TrpArg: 1.265 ± 1.06
0.422TrpSer: 0.422 ± 0.385
1.687TrpThr: 1.687 ± 0.7
0.844TrpVal: 0.844 ± 0.743
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.109TyrAla: 2.109 ± 0.778
0.422TyrCys: 0.422 ± 0.513
1.265TyrAsp: 1.265 ± 0.416
1.265TyrGlu: 1.265 ± 0.722
2.952TyrPhe: 2.952 ± 0.85
2.109TyrGly: 2.109 ± 0.694
0.422TyrHis: 0.422 ± 0.513
1.687TyrIle: 1.687 ± 1.036
2.531TyrLys: 2.531 ± 0.439
3.374TyrLeu: 3.374 ± 0.994
0.422TyrMet: 0.422 ± 0.322
1.687TyrAsn: 1.687 ± 0.749
0.422TyrPro: 0.422 ± 0.322
1.265TyrGln: 1.265 ± 0.381
2.109TyrArg: 2.109 ± 0.835
2.109TyrSer: 2.109 ± 0.815
2.109TyrThr: 2.109 ± 0.871
1.265TyrVal: 1.265 ± 0.381
1.265TyrTrp: 1.265 ± 0.752
2.109TyrTyr: 2.109 ± 1.162
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2372 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski