Amino acid dipepetide frequency for Human papillomavirus type 16

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.771AlaAla: 5.771 ± 1.285
1.539AlaCys: 1.539 ± 0.763
3.463AlaAsp: 3.463 ± 0.956
2.693AlaGlu: 2.693 ± 1.052
2.693AlaPhe: 2.693 ± 0.899
4.232AlaGly: 4.232 ± 1.262
1.154AlaHis: 1.154 ± 0.698
3.848AlaIle: 3.848 ± 1.026
3.848AlaLys: 3.848 ± 1.467
3.463AlaLeu: 3.463 ± 1.157
1.539AlaMet: 1.539 ± 0.441
3.463AlaAsn: 3.463 ± 1.146
3.463AlaPro: 3.463 ± 1.292
1.539AlaGln: 1.539 ± 0.681
1.539AlaArg: 1.539 ± 0.621
3.078AlaSer: 3.078 ± 1.018
3.848AlaThr: 3.848 ± 0.827
4.232AlaVal: 4.232 ± 1.283
0.0AlaTrp: 0.0 ± 0.0
1.924AlaTyr: 1.924 ± 0.641
0.0AlaXaa: 0.0 ± 0.0
Cys
1.154CysAla: 1.154 ± 0.409
1.539CysCys: 1.539 ± 0.882
1.924CysAsp: 1.924 ± 1.147
0.385CysGlu: 0.385 ± 0.322
1.539CysPhe: 1.539 ± 1.325
0.385CysGly: 0.385 ± 0.304
0.77CysHis: 0.77 ± 0.633
2.693CysIle: 2.693 ± 1.095
2.309CysLys: 2.309 ± 1.139
2.309CysLeu: 2.309 ± 0.887
0.77CysMet: 0.77 ± 0.552
1.539CysAsn: 1.539 ± 0.752
2.309CysPro: 2.309 ± 0.51
1.924CysGln: 1.924 ± 0.706
0.77CysArg: 0.77 ± 0.552
1.539CysSer: 1.539 ± 0.981
2.693CysThr: 2.693 ± 0.959
3.078CysVal: 3.078 ± 1.17
0.77CysTrp: 0.77 ± 0.34
1.154CysTyr: 1.154 ± 0.927
0.0CysXaa: 0.0 ± 0.0
Asp
3.848AspAla: 3.848 ± 0.878
1.154AspCys: 1.154 ± 0.536
3.463AspAsp: 3.463 ± 1.177
1.924AspGlu: 1.924 ± 0.906
2.309AspPhe: 2.309 ± 0.833
3.078AspGly: 3.078 ± 0.996
0.385AspHis: 0.385 ± 0.322
4.232AspIle: 4.232 ± 1.38
1.924AspLys: 1.924 ± 1.094
4.232AspLeu: 4.232 ± 1.61
0.385AspMet: 0.385 ± 0.359
3.078AspAsn: 3.078 ± 1.187
4.232AspPro: 4.232 ± 1.685
2.309AspGln: 2.309 ± 1.041
0.77AspArg: 0.77 ± 0.504
6.156AspSer: 6.156 ± 1.933
6.541AspThr: 6.541 ± 1.568
2.309AspVal: 2.309 ± 0.889
1.154AspTrp: 1.154 ± 0.536
1.924AspTyr: 1.924 ± 0.812
0.0AspXaa: 0.0 ± 0.0
Glu
1.539GluAla: 1.539 ± 0.558
1.154GluCys: 1.154 ± 0.594
3.463GluAsp: 3.463 ± 1.494
4.232GluGlu: 4.232 ± 1.017
0.77GluPhe: 0.77 ± 0.608
2.693GluGly: 2.693 ± 1.072
0.385GluHis: 0.385 ± 0.359
2.309GluIle: 2.309 ± 0.761
3.078GluLys: 3.078 ± 1.134
3.078GluLeu: 3.078 ± 0.939
0.385GluMet: 0.385 ± 0.322
2.693GluAsn: 2.693 ± 1.045
1.924GluPro: 1.924 ± 0.596
1.539GluGln: 1.539 ± 0.672
0.77GluArg: 0.77 ± 0.552
0.385GluSer: 0.385 ± 0.304
6.541GluThr: 6.541 ± 1.427
4.232GluVal: 4.232 ± 1.258
1.154GluTrp: 1.154 ± 0.6
1.924GluTyr: 1.924 ± 1.22
0.0GluXaa: 0.0 ± 0.0
Phe
0.385PheAla: 0.385 ± 0.5
0.77PheCys: 0.77 ± 0.578
0.77PheAsp: 0.77 ± 0.41
0.77PheGlu: 0.77 ± 0.608
1.539PhePhe: 1.539 ± 0.585
2.693PheGly: 2.693 ± 0.864
0.385PheHis: 0.385 ± 0.5
2.309PheIle: 2.309 ± 0.93
3.078PheLys: 3.078 ± 0.995
4.617PheLeu: 4.617 ± 1.568
1.154PheMet: 1.154 ± 0.6
1.924PheAsn: 1.924 ± 0.662
2.309PhePro: 2.309 ± 1.26
0.77PheGln: 0.77 ± 0.555
1.154PheArg: 1.154 ± 0.749
2.693PheSer: 2.693 ± 0.883
2.309PheThr: 2.309 ± 0.671
2.693PheVal: 2.693 ± 0.954
0.77PheTrp: 0.77 ± 0.34
1.924PheTyr: 1.924 ± 0.558
0.0PheXaa: 0.0 ± 0.0
Gly
2.309GlyAla: 2.309 ± 0.939
1.924GlyCys: 1.924 ± 0.8
4.617GlyAsp: 4.617 ± 0.777
2.309GlyGlu: 2.309 ± 1.021
2.309GlyPhe: 2.309 ± 0.873
4.232GlyGly: 4.232 ± 1.524
2.309GlyHis: 2.309 ± 1.031
3.463GlyIle: 3.463 ± 0.913
2.309GlyLys: 2.309 ± 0.671
3.463GlyLeu: 3.463 ± 1.022
1.154GlyMet: 1.154 ± 0.912
3.078GlyAsn: 3.078 ± 0.662
1.154GlyPro: 1.154 ± 0.775
1.924GlyGln: 1.924 ± 0.705
3.078GlyArg: 3.078 ± 0.985
4.617GlySer: 4.617 ± 1.276
5.002GlyThr: 5.002 ± 0.812
2.693GlyVal: 2.693 ± 0.894
0.385GlyTrp: 0.385 ± 0.304
1.539GlyTyr: 1.539 ± 0.698
0.0GlyXaa: 0.0 ± 0.0
His
1.924HisAla: 1.924 ± 0.491
0.77HisCys: 0.77 ± 0.633
0.385HisAsp: 0.385 ± 0.5
1.539HisGlu: 1.539 ± 0.704
0.77HisPhe: 0.77 ± 0.397
1.154HisGly: 1.154 ± 0.603
0.0HisHis: 0.0 ± 0.0
1.539HisIle: 1.539 ± 0.972
1.924HisLys: 1.924 ± 1.256
3.078HisLeu: 3.078 ± 1.52
0.385HisMet: 0.385 ± 0.322
2.693HisAsn: 2.693 ± 0.656
2.309HisPro: 2.309 ± 0.862
0.77HisGln: 0.77 ± 0.565
2.309HisArg: 2.309 ± 0.738
0.77HisSer: 0.77 ± 0.34
3.078HisThr: 3.078 ± 1.326
0.385HisVal: 0.385 ± 0.421
1.154HisTrp: 1.154 ± 0.645
2.309HisTyr: 2.309 ± 0.717
0.0HisXaa: 0.0 ± 0.0
Ile
2.693IleAla: 2.693 ± 1.012
3.463IleCys: 3.463 ± 1.129
2.309IleAsp: 2.309 ± 1.014
1.924IleGlu: 1.924 ± 0.89
1.154IlePhe: 1.154 ± 0.867
2.309IleGly: 2.309 ± 0.939
1.539IleHis: 1.539 ± 0.945
2.693IleIle: 2.693 ± 1.338
2.309IleLys: 2.309 ± 0.933
5.002IleLeu: 5.002 ± 1.114
0.385IleMet: 0.385 ± 0.575
3.848IleAsn: 3.848 ± 1.158
5.771IlePro: 5.771 ± 3.167
1.924IleGln: 1.924 ± 0.556
3.078IleArg: 3.078 ± 1.331
3.848IleSer: 3.848 ± 1.142
4.232IleThr: 4.232 ± 1.661
6.541IleVal: 6.541 ± 1.516
0.0IleTrp: 0.0 ± 0.0
1.924IleTyr: 1.924 ± 0.987
0.0IleXaa: 0.0 ± 0.0
Lys
3.078LysAla: 3.078 ± 0.914
2.693LysCys: 2.693 ± 1.027
1.154LysAsp: 1.154 ± 0.538
1.924LysGlu: 1.924 ± 0.947
2.693LysPhe: 2.693 ± 1.215
2.309LysGly: 2.309 ± 1.29
3.848LysHis: 3.848 ± 1.668
3.078LysIle: 3.078 ± 0.772
4.232LysLys: 4.232 ± 0.99
4.232LysLeu: 4.232 ± 1.08
0.385LysMet: 0.385 ± 0.359
2.309LysAsn: 2.309 ± 0.918
3.078LysPro: 3.078 ± 1.813
3.848LysGln: 3.848 ± 1.477
5.002LysArg: 5.002 ± 1.302
3.848LysSer: 3.848 ± 1.408
3.078LysThr: 3.078 ± 1.123
2.309LysVal: 2.309 ± 1.07
0.77LysTrp: 0.77 ± 0.633
2.693LysTyr: 2.693 ± 0.884
0.0LysXaa: 0.0 ± 0.0
Leu
4.617LeuAla: 4.617 ± 0.678
5.002LeuCys: 5.002 ± 2.19
3.848LeuAsp: 3.848 ± 0.909
3.463LeuGlu: 3.463 ± 1.518
2.693LeuPhe: 2.693 ± 1.067
3.848LeuGly: 3.848 ± 1.266
3.848LeuHis: 3.848 ± 1.145
3.848LeuIle: 3.848 ± 1.844
6.156LeuLys: 6.156 ± 1.788
9.619LeuLeu: 9.619 ± 3.913
1.924LeuMet: 1.924 ± 0.733
1.924LeuAsn: 1.924 ± 0.659
2.309LeuPro: 2.309 ± 0.843
8.08LeuGln: 8.08 ± 1.412
4.617LeuArg: 4.617 ± 0.384
5.002LeuSer: 5.002 ± 1.578
6.156LeuThr: 6.156 ± 1.514
4.617LeuVal: 4.617 ± 1.685
0.77LeuTrp: 0.77 ± 0.59
5.387LeuTyr: 5.387 ± 1.028
0.0LeuXaa: 0.0 ± 0.0
Met
1.154MetAla: 1.154 ± 0.793
0.77MetCys: 0.77 ± 0.608
1.539MetAsp: 1.539 ± 0.571
0.385MetGlu: 0.385 ± 0.322
1.154MetPhe: 1.154 ± 0.506
1.154MetGly: 1.154 ± 0.536
1.154MetHis: 1.154 ± 0.636
0.385MetIle: 0.385 ± 0.304
0.385MetLys: 0.385 ± 0.304
1.924MetLeu: 1.924 ± 0.95
0.385MetMet: 0.385 ± 0.304
0.385MetAsn: 0.385 ± 0.359
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
0.77MetArg: 0.77 ± 0.446
3.463MetSer: 3.463 ± 0.921
0.77MetThr: 0.77 ± 0.59
1.924MetVal: 1.924 ± 0.947
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.309AsnAla: 2.309 ± 1.072
1.539AsnCys: 1.539 ± 0.82
3.848AsnAsp: 3.848 ± 2.037
1.924AsnGlu: 1.924 ± 0.826
0.77AsnPhe: 0.77 ± 0.718
1.924AsnGly: 1.924 ± 0.733
1.154AsnHis: 1.154 ± 0.796
3.848AsnIle: 3.848 ± 1.123
3.848AsnLys: 3.848 ± 1.512
1.924AsnLeu: 1.924 ± 0.772
0.385AsnMet: 0.385 ± 0.359
2.693AsnAsn: 2.693 ± 1.103
3.463AsnPro: 3.463 ± 0.794
0.385AsnGln: 0.385 ± 0.359
1.539AsnArg: 1.539 ± 0.681
3.848AsnSer: 3.848 ± 1.099
5.771AsnThr: 5.771 ± 1.132
2.309AsnVal: 2.309 ± 0.771
0.77AsnTrp: 0.77 ± 0.41
1.154AsnTyr: 1.154 ± 0.335
0.0AsnXaa: 0.0 ± 0.0
Pro
5.387ProAla: 5.387 ± 2.083
1.924ProCys: 1.924 ± 0.556
5.002ProAsp: 5.002 ± 2.023
2.309ProGlu: 2.309 ± 1.015
1.154ProPhe: 1.154 ± 0.619
1.539ProGly: 1.539 ± 0.842
0.0ProHis: 0.0 ± 0.0
4.232ProIle: 4.232 ± 1.369
3.848ProLys: 3.848 ± 1.036
7.695ProLeu: 7.695 ± 1.781
1.539ProMet: 1.539 ± 0.641
1.539ProAsn: 1.539 ± 0.571
5.387ProPro: 5.387 ± 1.972
1.154ProGln: 1.154 ± 0.975
1.924ProArg: 1.924 ± 0.986
5.002ProSer: 5.002 ± 2.756
5.387ProThr: 5.387 ± 1.802
2.693ProVal: 2.693 ± 1.228
0.385ProTrp: 0.385 ± 0.455
2.309ProTyr: 2.309 ± 0.843
0.0ProXaa: 0.0 ± 0.0
Gln
3.463GlnAla: 3.463 ± 0.924
0.0GlnCys: 0.0 ± 0.0
1.924GlnAsp: 1.924 ± 0.886
0.77GlnGlu: 0.77 ± 0.552
2.309GlnPhe: 2.309 ± 1.125
1.539GlnGly: 1.539 ± 0.803
1.539GlnHis: 1.539 ± 0.711
1.154GlnIle: 1.154 ± 0.335
1.539GlnLys: 1.539 ± 1.078
4.232GlnLeu: 4.232 ± 1.345
1.539GlnMet: 1.539 ± 0.803
0.385GlnAsn: 0.385 ± 0.304
1.924GlnPro: 1.924 ± 0.747
1.924GlnGln: 1.924 ± 1.032
3.848GlnArg: 3.848 ± 1.593
1.924GlnSer: 1.924 ± 1.167
3.463GlnThr: 3.463 ± 0.671
3.078GlnVal: 3.078 ± 1.24
1.154GlnTrp: 1.154 ± 0.736
3.078GlnTyr: 3.078 ± 0.692
0.0GlnXaa: 0.0 ± 0.0
Arg
2.309ArgAla: 2.309 ± 0.747
1.924ArgCys: 1.924 ± 1.372
3.078ArgAsp: 3.078 ± 1.738
2.309ArgGlu: 2.309 ± 0.997
2.309ArgPhe: 2.309 ± 0.887
0.77ArgGly: 0.77 ± 0.555
3.078ArgHis: 3.078 ± 0.927
1.539ArgIle: 1.539 ± 0.628
3.078ArgLys: 3.078 ± 1.145
4.617ArgLeu: 4.617 ± 0.926
0.0ArgMet: 0.0 ± 0.0
0.385ArgAsn: 0.385 ± 0.304
4.232ArgPro: 4.232 ± 1.279
1.154ArgGln: 1.154 ± 0.6
2.693ArgArg: 2.693 ± 1.013
2.693ArgSer: 2.693 ± 0.656
4.232ArgThr: 4.232 ± 1.496
0.77ArgVal: 0.77 ± 0.34
0.77ArgTrp: 0.77 ± 0.552
1.924ArgTyr: 1.924 ± 0.669
0.0ArgXaa: 0.0 ± 0.0
Ser
3.848SerAla: 3.848 ± 1.162
0.77SerCys: 0.77 ± 0.748
2.693SerAsp: 2.693 ± 0.812
5.771SerGlu: 5.771 ± 1.273
2.309SerPhe: 2.309 ± 1.014
6.156SerGly: 6.156 ± 1.838
1.539SerHis: 1.539 ± 0.596
4.232SerIle: 4.232 ± 1.747
2.309SerLys: 2.309 ± 0.759
6.156SerLeu: 6.156 ± 0.938
2.309SerMet: 2.309 ± 0.84
5.002SerAsn: 5.002 ± 1.534
3.463SerPro: 3.463 ± 0.644
4.232SerGln: 4.232 ± 1.29
4.232SerArg: 4.232 ± 1.205
7.311SerSer: 7.311 ± 2.379
10.773SerThr: 10.773 ± 2.764
3.463SerVal: 3.463 ± 1.258
0.385SerTrp: 0.385 ± 0.304
0.385SerTyr: 0.385 ± 0.341
0.0SerXaa: 0.0 ± 0.0
Thr
6.541ThrAla: 6.541 ± 1.332
1.154ThrCys: 1.154 ± 0.674
3.463ThrAsp: 3.463 ± 1.551
3.078ThrGlu: 3.078 ± 0.984
2.693ThrPhe: 2.693 ± 0.894
7.311ThrGly: 7.311 ± 1.863
3.078ThrHis: 3.078 ± 1.133
5.771ThrIle: 5.771 ± 1.836
3.078ThrLys: 3.078 ± 1.522
10.389ThrLeu: 10.389 ± 2.2
1.154ThrMet: 1.154 ± 0.537
4.232ThrAsn: 4.232 ± 1.165
7.695ThrPro: 7.695 ± 1.882
3.848ThrGln: 3.848 ± 0.558
1.539ThrArg: 1.539 ± 0.686
8.08ThrSer: 8.08 ± 3.173
10.004ThrThr: 10.004 ± 3.182
5.771ThrVal: 5.771 ± 1.077
1.539ThrTrp: 1.539 ± 0.746
3.463ThrTyr: 3.463 ± 1.216
0.0ThrXaa: 0.0 ± 0.0
Val
2.309ValAla: 2.309 ± 0.861
1.539ValCys: 1.539 ± 0.751
5.387ValAsp: 5.387 ± 0.699
3.848ValGlu: 3.848 ± 1.314
1.539ValPhe: 1.539 ± 0.377
2.309ValGly: 2.309 ± 1.732
1.924ValHis: 1.924 ± 1.038
1.539ValIle: 1.539 ± 0.542
2.693ValLys: 2.693 ± 1.146
3.078ValLeu: 3.078 ± 1.332
0.77ValMet: 0.77 ± 0.34
1.539ValAsn: 1.539 ± 0.681
3.848ValPro: 3.848 ± 1.48
2.309ValGln: 2.309 ± 1.036
1.539ValArg: 1.539 ± 0.482
9.234ValSer: 9.234 ± 1.427
4.617ValThr: 4.617 ± 1.406
3.078ValVal: 3.078 ± 0.804
0.77ValTrp: 0.77 ± 0.457
3.463ValTyr: 3.463 ± 1.67
0.0ValXaa: 0.0 ± 0.0
Trp
1.154TrpAla: 1.154 ± 0.558
0.385TrpCys: 0.385 ± 0.304
0.0TrpAsp: 0.0 ± 0.0
0.77TrpGlu: 0.77 ± 0.457
0.385TrpPhe: 0.385 ± 0.304
1.154TrpGly: 1.154 ± 0.63
0.77TrpHis: 0.77 ± 0.633
1.154TrpIle: 1.154 ± 0.684
1.539TrpLys: 1.539 ± 0.711
1.154TrpLeu: 1.154 ± 0.63
0.0TrpMet: 0.0 ± 0.0
0.77TrpAsn: 0.77 ± 0.34
0.77TrpPro: 0.77 ± 0.531
0.77TrpGln: 0.77 ± 0.633
0.0TrpArg: 0.0 ± 0.0
0.385TrpSer: 0.385 ± 0.304
2.309TrpThr: 2.309 ± 1.239
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.385TrpTyr: 0.385 ± 0.304
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.539TyrAla: 1.539 ± 0.412
1.539TyrCys: 1.539 ± 1.063
3.078TyrAsp: 3.078 ± 0.86
1.924TyrGlu: 1.924 ± 0.734
1.924TyrPhe: 1.924 ± 0.849
3.078TyrGly: 3.078 ± 0.939
0.385TyrHis: 0.385 ± 0.359
3.463TyrIle: 3.463 ± 1.204
3.078TyrLys: 3.078 ± 0.919
3.463TyrLeu: 3.463 ± 0.987
0.77TyrMet: 0.77 ± 0.522
1.924TyrAsn: 1.924 ± 0.638
0.77TyrPro: 0.77 ± 0.585
0.385TyrGln: 0.385 ± 0.304
2.693TyrArg: 2.693 ± 1.131
3.463TyrSer: 3.463 ± 1.185
3.078TyrThr: 3.078 ± 1.14
1.154TyrVal: 1.154 ± 0.409
1.154TyrTrp: 1.154 ± 0.409
2.693TyrTyr: 2.693 ± 1.281
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (2600 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski