Amino acid dipepetide frequency for Human papillomavirus 9

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.705AlaAla: 1.705 ± 0.216
0.853AlaCys: 0.853 ± 0.766
3.836AlaAsp: 3.836 ± 0.893
3.41AlaGlu: 3.41 ± 1.151
3.836AlaPhe: 3.836 ± 0.925
2.131AlaGly: 2.131 ± 0.886
0.426AlaHis: 0.426 ± 0.362
1.705AlaIle: 1.705 ± 0.593
3.836AlaLys: 3.836 ± 1.108
2.131AlaLeu: 2.131 ± 0.388
0.426AlaMet: 0.426 ± 0.383
4.263AlaAsn: 4.263 ± 1.769
1.279AlaPro: 1.279 ± 0.383
3.836AlaGln: 3.836 ± 0.908
5.541AlaArg: 5.541 ± 1.401
2.558AlaSer: 2.558 ± 0.852
5.115AlaThr: 5.115 ± 1.299
5.968AlaVal: 5.968 ± 1.121
0.426AlaTrp: 0.426 ± 0.362
3.41AlaTyr: 3.41 ± 1.459
0.0AlaXaa: 0.0 ± 0.0
Cys
1.705CysAla: 1.705 ± 1.118
0.853CysCys: 0.853 ± 0.723
0.426CysAsp: 0.426 ± 0.341
0.426CysGlu: 0.426 ± 0.383
1.279CysPhe: 1.279 ± 0.599
1.279CysGly: 1.279 ± 1.678
0.0CysHis: 0.0 ± 0.0
0.853CysIle: 0.853 ± 0.621
2.131CysLys: 2.131 ± 0.678
2.558CysLeu: 2.558 ± 1.233
0.0CysMet: 0.0 ± 0.0
0.426CysAsn: 0.426 ± 0.496
1.705CysPro: 1.705 ± 0.855
0.853CysGln: 0.853 ± 0.723
2.558CysArg: 2.558 ± 1.358
1.705CysSer: 1.705 ± 1.118
0.426CysThr: 0.426 ± 0.362
0.0CysVal: 0.0 ± 0.0
0.426CysTrp: 0.426 ± 0.362
1.705CysTyr: 1.705 ± 1.053
0.0CysXaa: 0.0 ± 0.0
Asp
4.689AspAla: 4.689 ± 1.708
0.0AspCys: 0.0 ± 0.0
2.131AspAsp: 2.131 ± 0.916
4.263AspGlu: 4.263 ± 1.076
1.705AspPhe: 1.705 ± 0.64
4.263AspGly: 4.263 ± 1.011
2.131AspHis: 2.131 ± 0.369
5.541AspIle: 5.541 ± 0.701
1.705AspLys: 1.705 ± 0.704
8.951AspLeu: 8.951 ± 1.582
1.279AspMet: 1.279 ± 0.72
2.131AspAsn: 2.131 ± 0.501
3.836AspPro: 3.836 ± 0.981
2.558AspGln: 2.558 ± 1.373
2.558AspArg: 2.558 ± 0.904
3.41AspSer: 3.41 ± 0.645
4.689AspThr: 4.689 ± 1.384
3.41AspVal: 3.41 ± 1.446
1.279AspTrp: 1.279 ± 0.402
0.853AspTyr: 0.853 ± 0.452
0.0AspXaa: 0.0 ± 0.0
Glu
3.41GluAla: 3.41 ± 1.378
2.131GluCys: 2.131 ± 1.565
5.541GluAsp: 5.541 ± 0.71
5.115GluGlu: 5.115 ± 2.781
1.279GluPhe: 1.279 ± 0.534
5.541GluGly: 5.541 ± 1.85
1.705GluHis: 1.705 ± 0.716
2.131GluIle: 2.131 ± 1.517
2.131GluLys: 2.131 ± 0.792
5.115GluLeu: 5.115 ± 3.062
2.131GluMet: 2.131 ± 0.958
1.279GluAsn: 1.279 ± 0.739
2.984GluPro: 2.984 ± 0.938
3.836GluGln: 3.836 ± 1.193
2.984GluArg: 2.984 ± 1.258
5.115GluSer: 5.115 ± 2.272
5.541GluThr: 5.541 ± 1.157
4.689GluVal: 4.689 ± 0.775
0.426GluTrp: 0.426 ± 0.362
2.558GluTyr: 2.558 ± 1.44
0.0GluXaa: 0.0 ± 0.0
Phe
3.836PheAla: 3.836 ± 1.359
0.853PheCys: 0.853 ± 0.565
3.836PheAsp: 3.836 ± 0.636
4.263PheGlu: 4.263 ± 1.709
2.558PhePhe: 2.558 ± 0.993
1.705PheGly: 1.705 ± 0.216
0.426PheHis: 0.426 ± 0.362
2.131PheIle: 2.131 ± 0.707
2.131PheLys: 2.131 ± 1.054
3.836PheLeu: 3.836 ± 0.944
0.0PheMet: 0.0 ± 0.0
3.41PheAsn: 3.41 ± 0.893
1.705PhePro: 1.705 ± 0.864
2.131PheGln: 2.131 ± 0.642
0.853PheArg: 0.853 ± 0.464
0.426PheSer: 0.426 ± 0.399
1.279PheThr: 1.279 ± 0.599
0.853PheVal: 0.853 ± 0.766
1.705PheTrp: 1.705 ± 0.844
1.705PheTyr: 1.705 ± 0.804
0.0PheXaa: 0.0 ± 0.0
Gly
3.41GlyAla: 3.41 ± 0.977
1.705GlyCys: 1.705 ± 1.405
5.115GlyAsp: 5.115 ± 0.974
6.394GlyGlu: 6.394 ± 2.282
1.279GlyPhe: 1.279 ± 0.72
5.115GlyGly: 5.115 ± 2.324
2.131GlyHis: 2.131 ± 1.224
2.558GlyIle: 2.558 ± 0.952
2.558GlyLys: 2.558 ± 1.233
4.263GlyLeu: 4.263 ± 0.81
0.426GlyMet: 0.426 ± 0.341
3.41GlyAsn: 3.41 ± 1.171
3.41GlyPro: 3.41 ± 1.342
2.131GlyGln: 2.131 ± 0.489
6.394GlyArg: 6.394 ± 2.674
4.263GlySer: 4.263 ± 1.099
6.394GlyThr: 6.394 ± 1.442
3.836GlyVal: 3.836 ± 1.308
0.0GlyTrp: 0.0 ± 0.0
1.705GlyTyr: 1.705 ± 1.157
0.0GlyXaa: 0.0 ± 0.0
His
0.426HisAla: 0.426 ± 0.383
1.279HisCys: 1.279 ± 0.993
0.426HisAsp: 0.426 ± 0.383
0.853HisGlu: 0.853 ± 0.803
0.853HisPhe: 0.853 ± 0.562
1.279HisGly: 1.279 ± 0.685
0.426HisHis: 0.426 ± 0.341
0.853HisIle: 0.853 ± 0.799
1.279HisLys: 1.279 ± 0.809
0.0HisLeu: 0.0 ± 0.0
0.0HisMet: 0.0 ± 0.0
1.279HisAsn: 1.279 ± 0.739
2.131HisPro: 2.131 ± 1.193
0.426HisGln: 0.426 ± 0.362
1.279HisArg: 1.279 ± 0.685
1.705HisSer: 1.705 ± 0.592
0.853HisThr: 0.853 ± 0.452
1.705HisVal: 1.705 ± 0.614
0.853HisTrp: 0.853 ± 0.464
1.705HisTyr: 1.705 ± 0.608
0.0HisXaa: 0.0 ± 0.0
Ile
2.558IleAla: 2.558 ± 0.917
0.426IleCys: 0.426 ± 0.496
4.263IleAsp: 4.263 ± 0.993
4.689IleGlu: 4.689 ± 1.032
1.705IlePhe: 1.705 ± 0.909
2.984IleGly: 2.984 ± 1.221
1.279IleHis: 1.279 ± 0.749
2.131IleIle: 2.131 ± 1.517
0.426IleLys: 0.426 ± 0.383
3.41IleLeu: 3.41 ± 0.842
0.426IleMet: 0.426 ± 0.496
3.41IleAsn: 3.41 ± 0.751
2.558IlePro: 2.558 ± 1.306
1.705IleGln: 1.705 ± 0.919
2.558IleArg: 2.558 ± 0.791
4.263IleSer: 4.263 ± 1.362
0.426IleThr: 0.426 ± 0.362
4.263IleVal: 4.263 ± 1.513
1.279IleTrp: 1.279 ± 0.59
1.279IleTyr: 1.279 ± 0.452
0.0IleXaa: 0.0 ± 0.0
Lys
2.984LysAla: 2.984 ± 0.507
1.279LysCys: 1.279 ± 0.672
3.41LysAsp: 3.41 ± 1.254
3.836LysGlu: 3.836 ± 1.525
2.558LysPhe: 2.558 ± 1.285
2.984LysGly: 2.984 ± 1.375
1.279LysHis: 1.279 ± 0.685
0.853LysIle: 0.853 ± 0.781
2.131LysLys: 2.131 ± 1.316
3.836LysLeu: 3.836 ± 2.146
0.0LysMet: 0.0 ± 0.0
0.853LysAsn: 0.853 ± 0.464
1.705LysPro: 1.705 ± 1.118
1.279LysGln: 1.279 ± 0.686
4.689LysArg: 4.689 ± 0.934
3.41LysSer: 3.41 ± 2.026
1.705LysThr: 1.705 ± 0.674
4.263LysVal: 4.263 ± 1.999
0.0LysTrp: 0.0 ± 0.0
2.984LysTyr: 2.984 ± 0.715
0.0LysXaa: 0.0 ± 0.0
Leu
5.115LeuAla: 5.115 ± 1.788
2.131LeuCys: 2.131 ± 1.166
5.541LeuAsp: 5.541 ± 1.587
4.263LeuGlu: 4.263 ± 1.04
3.836LeuPhe: 3.836 ± 1.562
5.968LeuGly: 5.968 ± 1.698
2.131LeuHis: 2.131 ± 0.876
4.263LeuIle: 4.263 ± 1.877
1.705LeuLys: 1.705 ± 1.019
10.656LeuLeu: 10.656 ± 2.873
2.558LeuMet: 2.558 ± 0.883
0.426LeuAsn: 0.426 ± 0.496
2.984LeuPro: 2.984 ± 1.431
7.673LeuGln: 7.673 ± 0.923
3.836LeuArg: 3.836 ± 1.355
8.525LeuSer: 8.525 ± 2.061
5.968LeuThr: 5.968 ± 1.848
5.541LeuVal: 5.541 ± 1.987
1.279LeuTrp: 1.279 ± 0.72
2.131LeuTyr: 2.131 ± 0.897
0.0LeuXaa: 0.0 ± 0.0
Met
0.853MetAla: 0.853 ± 0.422
0.426MetCys: 0.426 ± 0.362
1.279MetAsp: 1.279 ± 0.672
0.426MetGlu: 0.426 ± 0.341
1.279MetPhe: 1.279 ± 0.72
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.279MetIle: 1.279 ± 0.865
1.705MetLys: 1.705 ± 1.019
0.853MetLeu: 0.853 ± 0.402
0.0MetMet: 0.0 ± 0.0
0.853MetAsn: 0.853 ± 0.422
0.426MetPro: 0.426 ± 0.362
0.426MetGln: 0.426 ± 0.399
1.279MetArg: 1.279 ± 0.383
3.41MetSer: 3.41 ± 1.019
0.0MetThr: 0.0 ± 0.0
1.279MetVal: 1.279 ± 0.383
0.0MetTrp: 0.0 ± 0.0
0.426MetTyr: 0.426 ± 0.496
0.0MetXaa: 0.0 ± 0.0
Asn
3.41AsnAla: 3.41 ± 1.145
0.853AsnCys: 0.853 ± 0.723
1.705AsnAsp: 1.705 ± 0.216
0.426AsnGlu: 0.426 ± 0.362
2.558AsnPhe: 2.558 ± 0.89
2.131AsnGly: 2.131 ± 0.916
0.853AsnHis: 0.853 ± 0.723
3.41AsnIle: 3.41 ± 0.847
3.836AsnLys: 3.836 ± 1.108
0.426AsnLeu: 0.426 ± 0.753
0.0AsnMet: 0.0 ± 0.0
2.131AsnAsn: 2.131 ± 1.441
2.131AsnPro: 2.131 ± 0.703
1.705AsnGln: 1.705 ± 1.531
1.705AsnArg: 1.705 ± 0.83
3.41AsnSer: 3.41 ± 1.055
2.558AsnThr: 2.558 ± 1.478
2.558AsnVal: 2.558 ± 1.207
0.0AsnTrp: 0.0 ± 0.0
1.279AsnTyr: 1.279 ± 0.402
0.0AsnXaa: 0.0 ± 0.0
Pro
3.836ProAla: 3.836 ± 0.858
1.279ProCys: 1.279 ± 0.599
5.115ProAsp: 5.115 ± 1.641
2.558ProGlu: 2.558 ± 1.357
0.853ProPhe: 0.853 ± 0.766
1.279ProGly: 1.279 ± 1.198
0.426ProHis: 0.426 ± 0.399
2.131ProIle: 2.131 ± 0.741
2.984ProLys: 2.984 ± 1.373
5.541ProLeu: 5.541 ± 1.057
1.279ProMet: 1.279 ± 0.749
0.853ProAsn: 0.853 ± 0.422
6.82ProPro: 6.82 ± 1.192
2.558ProGln: 2.558 ± 1.242
2.131ProArg: 2.131 ± 1.563
2.558ProSer: 2.558 ± 2.397
6.82ProThr: 6.82 ± 2.086
3.836ProVal: 3.836 ± 1.624
0.426ProTrp: 0.426 ± 0.341
0.853ProTyr: 0.853 ± 0.761
0.0ProXaa: 0.0 ± 0.0
Gln
1.279GlnAla: 1.279 ± 0.686
1.279GlnCys: 1.279 ± 0.748
1.705GlnAsp: 1.705 ± 0.568
3.836GlnGlu: 3.836 ± 1.696
3.836GlnPhe: 3.836 ± 1.194
3.836GlnGly: 3.836 ± 0.659
0.853GlnHis: 0.853 ± 0.723
3.41GlnIle: 3.41 ± 1.138
0.853GlnLys: 0.853 ± 0.402
6.82GlnLeu: 6.82 ± 1.555
0.853GlnMet: 0.853 ± 0.422
1.705GlnAsn: 1.705 ± 0.608
1.705GlnPro: 1.705 ± 0.784
2.558GlnGln: 2.558 ± 0.911
4.689GlnArg: 4.689 ± 1.214
3.41GlnSer: 3.41 ± 0.836
3.41GlnThr: 3.41 ± 0.708
2.558GlnVal: 2.558 ± 0.992
0.426GlnTrp: 0.426 ± 0.362
2.984GlnTyr: 2.984 ± 0.885
0.0GlnXaa: 0.0 ± 0.0
Arg
4.689ArgAla: 4.689 ± 1.36
0.426ArgCys: 0.426 ± 0.496
2.558ArgAsp: 2.558 ± 0.992
2.984ArgGlu: 2.984 ± 0.507
2.558ArgPhe: 2.558 ± 0.528
6.82ArgGly: 6.82 ± 2.371
2.131ArgHis: 2.131 ± 1.045
0.853ArgIle: 0.853 ± 0.766
4.689ArgLys: 4.689 ± 0.933
7.673ArgLeu: 7.673 ± 1.857
0.426ArgMet: 0.426 ± 0.341
1.705ArgAsn: 1.705 ± 0.832
2.984ArgPro: 2.984 ± 1.489
4.263ArgGln: 4.263 ± 1.476
6.82ArgArg: 6.82 ± 2.176
7.246ArgSer: 7.246 ± 4.056
5.968ArgThr: 5.968 ± 1.538
3.41ArgVal: 3.41 ± 1.006
0.0ArgTrp: 0.0 ± 0.0
2.558ArgTyr: 2.558 ± 0.528
0.0ArgXaa: 0.0 ± 0.0
Ser
3.41SerAla: 3.41 ± 0.895
0.853SerCys: 0.853 ± 0.6
4.263SerAsp: 4.263 ± 2.161
3.836SerGlu: 3.836 ± 1.621
3.41SerPhe: 3.41 ± 1.009
6.394SerGly: 6.394 ± 2.283
0.426SerHis: 0.426 ± 0.341
1.705SerIle: 1.705 ± 1.12
2.131SerLys: 2.131 ± 0.789
7.246SerLeu: 7.246 ± 1.341
1.279SerMet: 1.279 ± 1.085
2.558SerAsn: 2.558 ± 1.709
3.836SerPro: 3.836 ± 1.516
2.558SerGln: 2.558 ± 0.459
8.951SerArg: 8.951 ± 3.687
6.394SerSer: 6.394 ± 1.825
7.246SerThr: 7.246 ± 2.253
4.689SerVal: 4.689 ± 0.97
1.279SerTrp: 1.279 ± 0.59
1.705SerTyr: 1.705 ± 0.823
0.0SerXaa: 0.0 ± 0.0
Thr
2.984ThrAla: 2.984 ± 0.977
1.705ThrCys: 1.705 ± 0.475
4.263ThrAsp: 4.263 ± 1.554
5.968ThrGlu: 5.968 ± 1.929
0.853ThrPhe: 0.853 ± 0.723
3.41ThrGly: 3.41 ± 1.485
0.853ThrHis: 0.853 ± 0.565
4.263ThrIle: 4.263 ± 1.521
2.558ThrLys: 2.558 ± 1.219
3.836ThrLeu: 3.836 ± 1.484
2.984ThrMet: 2.984 ± 1.117
2.131ThrAsn: 2.131 ± 0.867
5.541ThrPro: 5.541 ± 3.23
2.984ThrGln: 2.984 ± 1.694
3.836ThrArg: 3.836 ± 0.954
6.394ThrSer: 6.394 ± 1.973
4.263ThrThr: 4.263 ± 1.715
8.099ThrVal: 8.099 ± 1.295
1.279ThrTrp: 1.279 ± 0.652
1.279ThrTyr: 1.279 ± 0.718
0.0ThrXaa: 0.0 ± 0.0
Val
2.984ValAla: 2.984 ± 1.295
2.558ValCys: 2.558 ± 1.437
4.263ValAsp: 4.263 ± 1.809
5.541ValGlu: 5.541 ± 1.779
1.705ValPhe: 1.705 ± 0.671
6.82ValGly: 6.82 ± 1.385
1.279ValHis: 1.279 ± 0.452
2.558ValIle: 2.558 ± 0.958
3.41ValLys: 3.41 ± 1.24
3.836ValLeu: 3.836 ± 1.844
0.853ValMet: 0.853 ± 0.672
2.558ValAsn: 2.558 ± 0.459
4.263ValPro: 4.263 ± 1.222
4.689ValGln: 4.689 ± 1.36
5.541ValArg: 5.541 ± 1.22
3.41ValSer: 3.41 ± 1.187
3.836ValThr: 3.836 ± 1.604
2.984ValVal: 2.984 ± 1.613
0.853ValTrp: 0.853 ± 0.464
2.984ValTyr: 2.984 ± 1.076
0.0ValXaa: 0.0 ± 0.0
Trp
0.426TrpAla: 0.426 ± 0.383
0.0TrpCys: 0.0 ± 0.0
0.426TrpAsp: 0.426 ± 0.383
1.279TrpGlu: 1.279 ± 0.652
0.0TrpPhe: 0.0 ± 0.0
0.426TrpGly: 0.426 ± 0.383
0.0TrpHis: 0.0 ± 0.0
0.853TrpIle: 0.853 ± 0.723
1.279TrpLys: 1.279 ± 0.599
2.131TrpLeu: 2.131 ± 1.08
0.0TrpMet: 0.0 ± 0.0
0.426TrpAsn: 0.426 ± 0.341
0.0TrpPro: 0.0 ± 0.0
0.853TrpGln: 0.853 ± 0.464
0.0TrpArg: 0.0 ± 0.0
1.279TrpSer: 1.279 ± 0.652
1.279TrpThr: 1.279 ± 0.59
0.853TrpVal: 0.853 ± 0.402
0.0TrpTrp: 0.0 ± 0.0
0.426TrpTyr: 0.426 ± 0.362
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.984TyrAla: 2.984 ± 0.973
0.426TyrCys: 0.426 ± 0.496
1.279TyrAsp: 1.279 ± 0.402
1.279TyrGlu: 1.279 ± 1.083
1.705TyrPhe: 1.705 ± 0.78
2.131TyrGly: 2.131 ± 0.761
0.853TyrHis: 0.853 ± 0.454
2.558TyrIle: 2.558 ± 0.719
2.558TyrLys: 2.558 ± 1.591
3.41TyrLeu: 3.41 ± 0.815
0.853TyrMet: 0.853 ± 0.723
1.279TyrAsn: 1.279 ± 0.739
2.131TyrPro: 2.131 ± 0.953
2.984TyrGln: 2.984 ± 0.981
2.558TyrArg: 2.558 ± 0.903
1.279TyrSer: 1.279 ± 0.452
1.705TyrThr: 1.705 ± 1.157
2.558TyrVal: 2.558 ± 0.752
0.0TyrTrp: 0.0 ± 0.0
3.41TyrTyr: 3.41 ± 1.477
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2347 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski