Amino acid dipepetide frequency for Human papillomavirus 204

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.481AlaAla: 5.481 ± 1.629
0.0AlaCys: 0.0 ± 0.0
2.108AlaAsp: 2.108 ± 0.905
3.794AlaGlu: 3.794 ± 1.152
3.794AlaPhe: 3.794 ± 1.133
2.108AlaGly: 2.108 ± 0.724
0.422AlaHis: 0.422 ± 0.511
1.265AlaIle: 1.265 ± 0.454
3.373AlaLys: 3.373 ± 1.491
3.373AlaLeu: 3.373 ± 1.525
1.265AlaMet: 1.265 ± 0.656
1.265AlaAsn: 1.265 ± 0.454
5.059AlaPro: 5.059 ± 1.701
3.794AlaGln: 3.794 ± 2.097
5.481AlaArg: 5.481 ± 2.643
3.373AlaSer: 3.373 ± 1.32
4.216AlaThr: 4.216 ± 0.732
2.108AlaVal: 2.108 ± 0.916
0.843AlaTrp: 0.843 ± 0.399
1.265AlaTyr: 1.265 ± 0.712
0.0AlaXaa: 0.0 ± 0.0
Cys
2.951CysAla: 2.951 ± 2.018
1.265CysCys: 1.265 ± 0.957
0.422CysAsp: 0.422 ± 0.349
0.843CysGlu: 0.843 ± 0.729
0.422CysPhe: 0.422 ± 0.349
0.422CysGly: 0.422 ± 0.349
0.422CysHis: 0.422 ± 0.803
0.0CysIle: 0.0 ± 0.0
1.265CysLys: 1.265 ± 0.378
2.53CysLeu: 2.53 ± 1.551
0.422CysMet: 0.422 ± 0.349
0.843CysAsn: 0.843 ± 0.811
1.686CysPro: 1.686 ± 0.596
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
1.265CysSer: 1.265 ± 2.409
0.843CysThr: 0.843 ± 0.699
0.0CysVal: 0.0 ± 0.0
0.422CysTrp: 0.422 ± 0.349
0.843CysTyr: 0.843 ± 1.088
0.0CysXaa: 0.0 ± 0.0
Asp
0.843AspAla: 0.843 ± 0.399
1.265AspCys: 1.265 ± 0.667
2.108AspAsp: 2.108 ± 1.302
5.481AspGlu: 5.481 ± 1.076
3.373AspPhe: 3.373 ± 1.049
3.373AspGly: 3.373 ± 1.048
1.686AspHis: 1.686 ± 0.704
7.167AspIle: 7.167 ± 2.268
2.53AspLys: 2.53 ± 0.841
7.589AspLeu: 7.589 ± 2.291
0.843AspMet: 0.843 ± 0.399
5.059AspAsn: 5.059 ± 1.363
2.951AspPro: 2.951 ± 1.183
1.265AspGln: 1.265 ± 0.454
0.843AspArg: 0.843 ± 0.736
5.481AspSer: 5.481 ± 1.41
2.951AspThr: 2.951 ± 1.385
2.951AspVal: 2.951 ± 0.697
1.686AspTrp: 1.686 ± 0.687
1.265AspTyr: 1.265 ± 0.378
0.0AspXaa: 0.0 ± 0.0
Glu
2.53GluAla: 2.53 ± 1.171
0.843GluCys: 0.843 ± 0.699
4.637GluAsp: 4.637 ± 1.925
5.481GluGlu: 5.481 ± 4.371
1.686GluPhe: 1.686 ± 0.5
1.686GluGly: 1.686 ± 0.896
1.265GluHis: 1.265 ± 0.454
2.108GluIle: 2.108 ± 0.681
2.108GluLys: 2.108 ± 0.903
6.745GluLeu: 6.745 ± 4.836
2.108GluMet: 2.108 ± 1.348
5.481GluAsn: 5.481 ± 1.343
2.951GluPro: 2.951 ± 0.634
5.481GluGln: 5.481 ± 1.52
3.373GluArg: 3.373 ± 1.312
6.324GluSer: 6.324 ± 1.981
4.637GluThr: 4.637 ± 1.812
2.108GluVal: 2.108 ± 1.168
0.422GluTrp: 0.422 ± 0.349
1.265GluTyr: 1.265 ± 0.786
0.0GluXaa: 0.0 ± 0.0
Phe
3.373PheAla: 3.373 ± 1.308
0.843PheCys: 0.843 ± 0.811
3.794PheAsp: 3.794 ± 0.88
3.373PheGlu: 3.373 ± 1.987
2.108PhePhe: 2.108 ± 0.521
2.108PheGly: 2.108 ± 0.521
1.265PheHis: 1.265 ± 0.461
4.216PheIle: 4.216 ± 0.815
2.951PheLys: 2.951 ± 1.606
4.637PheLeu: 4.637 ± 1.851
0.843PheMet: 0.843 ± 0.461
2.53PheAsn: 2.53 ± 1.366
1.265PhePro: 1.265 ± 0.378
0.422PheGln: 0.422 ± 0.349
2.108PheArg: 2.108 ± 0.687
2.108PheSer: 2.108 ± 1.099
2.108PheThr: 2.108 ± 0.801
2.53PheVal: 2.53 ± 0.832
1.686PheTrp: 1.686 ± 0.969
3.794PheTyr: 3.794 ± 1.328
0.0PheXaa: 0.0 ± 0.0
Gly
1.265GlyAla: 1.265 ± 0.728
0.843GlyCys: 0.843 ± 0.829
4.637GlyAsp: 4.637 ± 1.258
4.216GlyGlu: 4.216 ± 1.59
2.108GlyPhe: 2.108 ± 0.998
5.481GlyGly: 5.481 ± 2.447
0.422GlyHis: 0.422 ± 0.368
2.951GlyIle: 2.951 ± 1.314
3.794GlyLys: 3.794 ± 1.45
2.53GlyLeu: 2.53 ± 1.21
0.843GlyMet: 0.843 ± 0.83
4.637GlyAsn: 4.637 ± 1.012
2.108GlyPro: 2.108 ± 1.02
3.373GlyGln: 3.373 ± 0.858
2.108GlyArg: 2.108 ± 1.411
4.637GlySer: 4.637 ± 1.302
4.216GlyThr: 4.216 ± 1.158
4.637GlyVal: 4.637 ± 1.589
0.422GlyTrp: 0.422 ± 0.349
1.265GlyTyr: 1.265 ± 0.461
0.0GlyXaa: 0.0 ± 0.0
His
0.843HisAla: 0.843 ± 0.736
0.422HisCys: 0.422 ± 0.623
0.843HisAsp: 0.843 ± 0.425
0.422HisGlu: 0.422 ± 0.374
2.108HisPhe: 2.108 ± 0.888
0.0HisGly: 0.0 ± 0.0
0.422HisHis: 0.422 ± 0.415
0.422HisIle: 0.422 ± 0.368
1.686HisLys: 1.686 ± 0.984
1.265HisLeu: 1.265 ± 0.958
0.422HisMet: 0.422 ± 0.482
0.0HisAsn: 0.0 ± 0.0
0.843HisPro: 0.843 ± 0.425
0.843HisGln: 0.843 ± 0.699
1.265HisArg: 1.265 ± 0.958
2.108HisSer: 2.108 ± 1.02
1.265HisThr: 1.265 ± 0.811
2.108HisVal: 2.108 ± 0.743
1.265HisTrp: 1.265 ± 0.854
0.843HisTyr: 0.843 ± 0.466
0.0HisXaa: 0.0 ± 0.0
Ile
3.794IleAla: 3.794 ± 1.436
0.0IleCys: 0.0 ± 0.0
3.373IleAsp: 3.373 ± 1.391
4.216IleGlu: 4.216 ± 0.735
3.794IlePhe: 3.794 ± 1.627
2.951IleGly: 2.951 ± 1.803
1.686IleHis: 1.686 ± 0.5
2.951IleIle: 2.951 ± 0.835
2.108IleLys: 2.108 ± 0.9
3.794IleLeu: 3.794 ± 1.069
0.843IleMet: 0.843 ± 0.592
1.265IleAsn: 1.265 ± 0.547
4.216IlePro: 4.216 ± 2.379
1.686IleGln: 1.686 ± 0.956
1.686IleArg: 1.686 ± 1.219
6.324IleSer: 6.324 ± 2.238
3.794IleThr: 3.794 ± 1.273
2.951IleVal: 2.951 ± 0.896
0.843IleTrp: 0.843 ± 0.839
1.686IleTyr: 1.686 ± 0.754
0.0IleXaa: 0.0 ± 0.0
Lys
2.53LysAla: 2.53 ± 0.918
2.108LysCys: 2.108 ± 0.786
1.686LysAsp: 1.686 ± 0.754
2.53LysGlu: 2.53 ± 0.88
2.951LysPhe: 2.951 ± 1.411
3.373LysGly: 3.373 ± 1.557
1.265LysHis: 1.265 ± 1.048
3.794LysIle: 3.794 ± 0.816
2.951LysLys: 2.951 ± 1.679
4.216LysLeu: 4.216 ± 1.729
0.843LysMet: 0.843 ± 0.399
1.686LysAsn: 1.686 ± 0.969
1.686LysPro: 1.686 ± 1.031
3.794LysGln: 3.794 ± 1.565
3.794LysArg: 3.794 ± 1.273
4.216LysSer: 4.216 ± 2.267
1.686LysThr: 1.686 ± 0.873
1.686LysVal: 1.686 ± 0.796
0.0LysTrp: 0.0 ± 0.0
1.686LysTyr: 1.686 ± 0.683
0.0LysXaa: 0.0 ± 0.0
Leu
4.637LeuAla: 4.637 ± 1.422
2.951LeuCys: 2.951 ± 1.6
6.324LeuAsp: 6.324 ± 1.601
5.481LeuGlu: 5.481 ± 1.871
5.902LeuPhe: 5.902 ± 2.119
6.745LeuGly: 6.745 ± 2.878
1.686LeuHis: 1.686 ± 0.799
3.373LeuIle: 3.373 ± 1.476
4.637LeuLys: 4.637 ± 1.902
10.54LeuLeu: 10.54 ± 3.303
2.108LeuMet: 2.108 ± 1.028
2.53LeuAsn: 2.53 ± 1.012
3.794LeuPro: 3.794 ± 1.328
7.589LeuGln: 7.589 ± 1.227
5.902LeuArg: 5.902 ± 1.2
8.853LeuSer: 8.853 ± 1.769
5.059LeuThr: 5.059 ± 1.713
5.902LeuVal: 5.902 ± 1.69
0.0LeuTrp: 0.0 ± 0.0
1.265LeuTyr: 1.265 ± 0.728
0.0LeuXaa: 0.0 ± 0.0
Met
2.108MetAla: 2.108 ± 0.937
0.0MetCys: 0.0 ± 0.0
0.843MetAsp: 0.843 ± 0.425
0.843MetGlu: 0.843 ± 0.83
0.422MetPhe: 0.422 ± 0.349
0.422MetGly: 0.422 ± 0.374
0.0MetHis: 0.0 ± 0.0
0.422MetIle: 0.422 ± 0.349
0.843MetLys: 0.843 ± 0.52
1.686MetLeu: 1.686 ± 0.664
0.422MetMet: 0.422 ± 0.368
0.843MetAsn: 0.843 ± 0.425
0.0MetPro: 0.0 ± 0.0
0.843MetGln: 0.843 ± 0.51
0.843MetArg: 0.843 ± 0.633
1.265MetSer: 1.265 ± 0.653
0.843MetThr: 0.843 ± 0.699
1.686MetVal: 1.686 ± 0.63
0.0MetTrp: 0.0 ± 0.0
0.422MetTyr: 0.422 ± 0.511
0.0MetXaa: 0.0 ± 0.0
Asn
2.108AsnAla: 2.108 ± 1.348
0.843AsnCys: 0.843 ± 0.448
1.686AsnAsp: 1.686 ± 0.898
2.53AsnGlu: 2.53 ± 0.904
1.265AsnPhe: 1.265 ± 0.776
2.108AsnGly: 2.108 ± 0.907
0.422AsnHis: 0.422 ± 0.415
2.53AsnIle: 2.53 ± 1.495
2.108AsnLys: 2.108 ± 0.687
4.216AsnLeu: 4.216 ± 1.607
0.0AsnMet: 0.0 ± 0.0
2.53AsnAsn: 2.53 ± 1.19
4.216AsnPro: 4.216 ± 1.392
3.373AsnGln: 3.373 ± 1.895
3.794AsnArg: 3.794 ± 1.48
4.637AsnSer: 4.637 ± 1.657
3.373AsnThr: 3.373 ± 1.023
5.902AsnVal: 5.902 ± 1.294
0.422AsnTrp: 0.422 ± 0.349
0.843AsnTyr: 0.843 ± 0.399
0.0AsnXaa: 0.0 ± 0.0
Pro
5.059ProAla: 5.059 ± 1.558
0.843ProCys: 0.843 ± 1.088
3.794ProAsp: 3.794 ± 1.471
3.373ProGlu: 3.373 ± 0.529
1.686ProPhe: 1.686 ± 0.906
2.53ProGly: 2.53 ± 1.504
0.422ProHis: 0.422 ± 0.374
2.53ProIle: 2.53 ± 0.978
4.216ProLys: 4.216 ± 1.013
5.902ProLeu: 5.902 ± 0.992
0.422ProMet: 0.422 ± 0.374
3.373ProAsn: 3.373 ± 1.358
5.059ProPro: 5.059 ± 1.62
1.265ProGln: 1.265 ± 0.454
3.373ProArg: 3.373 ± 1.2
4.637ProSer: 4.637 ± 1.981
5.059ProThr: 5.059 ± 2.432
2.53ProVal: 2.53 ± 1.802
0.422ProTrp: 0.422 ± 0.415
2.53ProTyr: 2.53 ± 1.485
0.0ProXaa: 0.0 ± 0.0
Gln
2.108GlnAla: 2.108 ± 0.796
1.265GlnCys: 1.265 ± 0.811
1.686GlnAsp: 1.686 ± 1.347
5.481GlnGlu: 5.481 ± 1.28
0.843GlnPhe: 0.843 ± 0.466
1.686GlnGly: 1.686 ± 0.683
0.422GlnHis: 0.422 ± 0.415
3.373GlnIle: 3.373 ± 1.45
0.843GlnLys: 0.843 ± 0.699
5.059GlnLeu: 5.059 ± 1.128
0.422GlnMet: 0.422 ± 0.349
3.794GlnAsn: 3.794 ± 1.446
3.794GlnPro: 3.794 ± 0.912
3.794GlnGln: 3.794 ± 2.139
2.53GlnArg: 2.53 ± 1.522
3.373GlnSer: 3.373 ± 0.903
4.216GlnThr: 4.216 ± 1.374
2.951GlnVal: 2.951 ± 0.634
0.422GlnTrp: 0.422 ± 0.349
2.108GlnTyr: 2.108 ± 0.724
0.0GlnXaa: 0.0 ± 0.0
Arg
3.794ArgAla: 3.794 ± 0.995
0.843ArgCys: 0.843 ± 0.829
2.108ArgAsp: 2.108 ± 1.136
3.373ArgGlu: 3.373 ± 0.885
2.951ArgPhe: 2.951 ± 1.291
2.951ArgGly: 2.951 ± 1.394
2.108ArgHis: 2.108 ± 0.839
0.843ArgIle: 0.843 ± 0.448
3.794ArgLys: 3.794 ± 0.73
7.589ArgLeu: 7.589 ± 1.831
0.843ArgMet: 0.843 ± 0.51
2.53ArgAsn: 2.53 ± 0.723
4.216ArgPro: 4.216 ± 1.209
1.265ArgGln: 1.265 ± 0.778
5.902ArgArg: 5.902 ± 1.708
3.373ArgSer: 3.373 ± 1.394
4.216ArgThr: 4.216 ± 1.824
4.637ArgVal: 4.637 ± 0.803
0.843ArgTrp: 0.843 ± 0.811
2.53ArgTyr: 2.53 ± 1.034
0.0ArgXaa: 0.0 ± 0.0
Ser
2.53SerAla: 2.53 ± 1.919
0.0SerCys: 0.0 ± 0.0
6.324SerAsp: 6.324 ± 1.773
2.53SerGlu: 2.53 ± 0.781
4.216SerPhe: 4.216 ± 1.717
5.481SerGly: 5.481 ± 1.736
0.843SerHis: 0.843 ± 0.466
5.059SerIle: 5.059 ± 1.156
4.216SerLys: 4.216 ± 1.604
9.275SerLeu: 9.275 ± 2.07
0.843SerMet: 0.843 ± 0.685
3.373SerAsn: 3.373 ± 1.195
5.902SerPro: 5.902 ± 2.111
3.373SerGln: 3.373 ± 1.253
7.589SerArg: 7.589 ± 2.009
5.902SerSer: 5.902 ± 2.182
5.059SerThr: 5.059 ± 1.867
5.902SerVal: 5.902 ± 1.402
0.0SerTrp: 0.0 ± 0.0
1.686SerTyr: 1.686 ± 0.326
0.0SerXaa: 0.0 ± 0.0
Thr
3.373ThrAla: 3.373 ± 0.767
2.108ThrCys: 2.108 ± 1.486
7.589ThrAsp: 7.589 ± 1.636
3.794ThrGlu: 3.794 ± 1.9
3.373ThrPhe: 3.373 ± 0.732
3.373ThrGly: 3.373 ± 1.168
1.265ThrHis: 1.265 ± 0.663
5.059ThrIle: 5.059 ± 2.227
1.686ThrLys: 1.686 ± 0.683
5.059ThrLeu: 5.059 ± 1.415
0.422ThrMet: 0.422 ± 0.349
3.794ThrAsn: 3.794 ± 1.103
3.373ThrPro: 3.373 ± 1.86
2.53ThrGln: 2.53 ± 0.595
3.373ThrArg: 3.373 ± 0.909
4.216ThrSer: 4.216 ± 1.28
5.059ThrThr: 5.059 ± 1.851
5.902ThrVal: 5.902 ± 2.079
0.422ThrTrp: 0.422 ± 0.415
1.686ThrTyr: 1.686 ± 0.326
0.0ThrXaa: 0.0 ± 0.0
Val
2.951ValAla: 2.951 ± 1.068
0.422ValCys: 0.422 ± 0.349
5.481ValAsp: 5.481 ± 1.514
4.216ValGlu: 4.216 ± 1.393
2.53ValPhe: 2.53 ± 0.986
6.324ValGly: 6.324 ± 0.977
1.686ValHis: 1.686 ± 0.956
3.373ValIle: 3.373 ± 1.002
2.108ValLys: 2.108 ± 0.587
3.794ValLeu: 3.794 ± 1.543
0.422ValMet: 0.422 ± 0.349
1.265ValAsn: 1.265 ± 0.701
3.794ValPro: 3.794 ± 1.357
2.53ValGln: 2.53 ± 0.783
2.951ValArg: 2.951 ± 0.709
6.324ValSer: 6.324 ± 1.9
5.481ValThr: 5.481 ± 1.528
0.843ValVal: 0.843 ± 0.7
1.265ValTrp: 1.265 ± 0.786
1.686ValTyr: 1.686 ± 0.754
0.0ValXaa: 0.0 ± 0.0
Trp
0.843TrpAla: 0.843 ± 0.399
0.0TrpCys: 0.0 ± 0.0
0.422TrpAsp: 0.422 ± 0.349
0.0TrpGlu: 0.0 ± 0.0
0.422TrpPhe: 0.422 ± 0.349
0.0TrpGly: 0.0 ± 0.0
0.843TrpHis: 0.843 ± 0.83
0.843TrpIle: 0.843 ± 0.699
0.843TrpLys: 0.843 ± 0.811
2.108TrpLeu: 2.108 ± 0.747
0.0TrpMet: 0.0 ± 0.0
0.422TrpAsn: 0.422 ± 0.368
0.0TrpPro: 0.0 ± 0.0
1.265TrpGln: 1.265 ± 0.653
1.686TrpArg: 1.686 ± 0.933
0.0TrpSer: 0.0 ± 0.0
1.265TrpThr: 1.265 ± 0.854
0.843TrpVal: 0.843 ± 0.466
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.265TyrAla: 1.265 ± 0.712
0.0TyrCys: 0.0 ± 0.0
0.843TyrAsp: 0.843 ± 0.448
1.265TyrGlu: 1.265 ± 1.092
2.53TyrPhe: 2.53 ± 0.481
3.373TyrGly: 3.373 ± 0.82
0.843TyrHis: 0.843 ± 0.425
1.686TyrIle: 1.686 ± 0.326
0.422TyrLys: 0.422 ± 0.349
3.373TyrLeu: 3.373 ± 0.988
0.0TyrMet: 0.0 ± 0.0
1.686TyrAsn: 1.686 ± 0.68
2.108TyrPro: 2.108 ± 1.109
1.686TyrGln: 1.686 ± 0.683
2.108TyrArg: 2.108 ± 1.273
1.686TyrSer: 1.686 ± 0.68
2.108TyrThr: 2.108 ± 1.128
1.265TyrVal: 1.265 ± 0.683
0.422TyrTrp: 0.422 ± 0.368
2.108TyrTyr: 2.108 ± 0.844
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2373 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski