Amino acid dipepetide frequency for Human papillomavirus 126

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.692AlaAla: 3.692 ± 2.604
1.641AlaCys: 1.641 ± 1.281
5.332AlaAsp: 5.332 ± 1.548
2.051AlaGlu: 2.051 ± 1.0
3.692AlaPhe: 3.692 ± 1.095
1.231AlaGly: 1.231 ± 1.12
0.41AlaHis: 0.41 ± 0.361
3.692AlaIle: 3.692 ± 0.846
2.871AlaLys: 2.871 ± 1.123
6.153AlaLeu: 6.153 ± 2.337
0.41AlaMet: 0.41 ± 0.361
2.051AlaAsn: 2.051 ± 0.726
3.692AlaPro: 3.692 ± 1.043
3.692AlaGln: 3.692 ± 1.633
3.692AlaArg: 3.692 ± 1.182
3.281AlaSer: 3.281 ± 1.222
3.692AlaThr: 3.692 ± 1.073
2.051AlaVal: 2.051 ± 0.916
0.0AlaTrp: 0.0 ± 0.0
1.231AlaTyr: 1.231 ± 0.647
0.0AlaXaa: 0.0 ± 0.0
Cys
1.231CysAla: 1.231 ± 0.532
0.82CysCys: 0.82 ± 0.525
1.641CysAsp: 1.641 ± 0.865
2.051CysGlu: 2.051 ± 0.803
1.231CysPhe: 1.231 ± 0.88
0.41CysGly: 0.41 ± 0.417
0.41CysHis: 0.41 ± 0.417
2.461CysIle: 2.461 ± 1.721
2.461CysLys: 2.461 ± 1.203
3.281CysLeu: 3.281 ± 2.917
0.0CysMet: 0.0 ± 0.0
0.82CysAsn: 0.82 ± 0.616
2.051CysPro: 2.051 ± 0.898
0.0CysGln: 0.0 ± 0.0
0.82CysArg: 0.82 ± 0.525
2.051CysSer: 2.051 ± 1.251
1.231CysThr: 1.231 ± 1.149
0.0CysVal: 0.0 ± 0.0
1.231CysTrp: 1.231 ± 0.404
0.82CysTyr: 0.82 ± 0.525
0.0CysXaa: 0.0 ± 0.0
Asp
4.512AspAla: 4.512 ± 0.99
2.051AspCys: 2.051 ± 0.69
3.692AspAsp: 3.692 ± 2.611
5.742AspGlu: 5.742 ± 1.211
2.051AspPhe: 2.051 ± 0.946
3.281AspGly: 3.281 ± 0.687
0.41AspHis: 0.41 ± 0.361
5.332AspIle: 5.332 ± 2.056
2.051AspLys: 2.051 ± 0.401
8.614AspLeu: 8.614 ± 0.949
0.82AspMet: 0.82 ± 0.578
2.461AspAsn: 2.461 ± 0.807
3.281AspPro: 3.281 ± 1.011
1.231AspGln: 1.231 ± 0.53
2.461AspArg: 2.461 ± 1.09
4.102AspSer: 4.102 ± 0.904
6.973AspThr: 6.973 ± 1.676
5.332AspVal: 5.332 ± 1.786
0.41AspTrp: 0.41 ± 0.338
0.82AspTyr: 0.82 ± 0.693
0.0AspXaa: 0.0 ± 0.0
Glu
3.281GluAla: 3.281 ± 0.981
2.051GluCys: 2.051 ± 1.367
6.153GluAsp: 6.153 ± 1.048
7.383GluGlu: 7.383 ± 4.32
1.641GluPhe: 1.641 ± 1.078
3.692GluGly: 3.692 ± 1.395
0.82GluHis: 0.82 ± 0.415
3.692GluIle: 3.692 ± 1.068
2.871GluLys: 2.871 ± 1.052
6.153GluLeu: 6.153 ± 1.369
0.82GluMet: 0.82 ± 0.415
4.512GluAsn: 4.512 ± 1.215
2.871GluPro: 2.871 ± 0.799
2.871GluGln: 2.871 ± 1.342
3.692GluArg: 3.692 ± 0.584
4.102GluSer: 4.102 ± 0.843
4.102GluThr: 4.102 ± 0.836
4.512GluVal: 4.512 ± 1.516
1.231GluTrp: 1.231 ± 0.404
1.231GluTyr: 1.231 ± 0.71
0.0GluXaa: 0.0 ± 0.0
Phe
1.641PheAla: 1.641 ± 0.669
2.871PheCys: 2.871 ± 1.399
2.871PheAsp: 2.871 ± 0.868
5.332PheGlu: 5.332 ± 1.985
2.871PhePhe: 2.871 ± 0.987
2.461PheGly: 2.461 ± 1.084
0.41PheHis: 0.41 ± 0.361
4.922PheIle: 4.922 ± 1.288
2.871PheLys: 2.871 ± 1.478
4.102PheLeu: 4.102 ± 1.974
1.641PheMet: 1.641 ± 0.565
2.461PheAsn: 2.461 ± 0.686
1.641PhePro: 1.641 ± 0.516
1.231PheGln: 1.231 ± 0.778
2.461PheArg: 2.461 ± 0.663
2.051PheSer: 2.051 ± 0.703
0.82PheThr: 0.82 ± 0.693
2.051PheVal: 2.051 ± 0.52
1.231PheTrp: 1.231 ± 0.685
1.231PheTyr: 1.231 ± 0.805
0.0PheXaa: 0.0 ± 0.0
Gly
3.281GlyAla: 3.281 ± 1.219
0.41GlyCys: 0.41 ± 0.361
2.871GlyAsp: 2.871 ± 1.235
2.461GlyGlu: 2.461 ± 0.838
1.231GlyPhe: 1.231 ± 0.356
3.281GlyGly: 3.281 ± 1.081
1.231GlyHis: 1.231 ± 0.783
4.922GlyIle: 4.922 ± 1.199
3.281GlyLys: 3.281 ± 1.043
4.512GlyLeu: 4.512 ± 2.108
0.41GlyMet: 0.41 ± 0.373
3.692GlyAsn: 3.692 ± 1.199
2.871GlyPro: 2.871 ± 1.667
2.461GlyGln: 2.461 ± 0.962
2.461GlyArg: 2.461 ± 1.018
4.922GlySer: 4.922 ± 1.473
4.922GlyThr: 4.922 ± 1.205
2.051GlyVal: 2.051 ± 0.91
0.0GlyTrp: 0.0 ± 0.0
1.231GlyTyr: 1.231 ± 0.76
0.0GlyXaa: 0.0 ± 0.0
His
1.641HisAla: 1.641 ± 0.516
0.82HisCys: 0.82 ± 0.675
0.41HisAsp: 0.41 ± 0.338
0.0HisGlu: 0.0 ± 0.0
1.231HisPhe: 1.231 ± 1.084
0.82HisGly: 0.82 ± 0.66
0.0HisHis: 0.0 ± 0.0
0.82HisIle: 0.82 ± 0.477
0.82HisLys: 0.82 ± 0.373
0.41HisLeu: 0.41 ± 0.417
0.82HisMet: 0.82 ± 0.618
0.82HisAsn: 0.82 ± 0.401
2.871HisPro: 2.871 ± 0.817
0.41HisGln: 0.41 ± 0.338
1.641HisArg: 1.641 ± 0.669
1.641HisSer: 1.641 ± 0.571
1.231HisThr: 1.231 ± 0.71
0.41HisVal: 0.41 ± 0.373
0.82HisTrp: 0.82 ± 0.513
0.41HisTyr: 0.41 ± 0.346
0.0HisXaa: 0.0 ± 0.0
Ile
2.871IleAla: 2.871 ± 1.187
2.871IleCys: 2.871 ± 1.208
6.563IleAsp: 6.563 ± 1.89
3.281IleGlu: 3.281 ± 0.768
1.231IlePhe: 1.231 ± 0.553
4.102IleGly: 4.102 ± 2.332
0.82IleHis: 0.82 ± 0.401
2.871IleIle: 2.871 ± 0.797
2.051IleLys: 2.051 ± 0.986
4.922IleLeu: 4.922 ± 1.386
0.82IleMet: 0.82 ± 0.625
2.051IleAsn: 2.051 ± 0.827
4.102IlePro: 4.102 ± 1.764
2.051IleGln: 2.051 ± 0.52
1.641IleArg: 1.641 ± 1.235
4.922IleSer: 4.922 ± 1.174
4.922IleThr: 4.922 ± 1.572
4.102IleVal: 4.102 ± 1.93
0.0IleTrp: 0.0 ± 0.0
2.461IleTyr: 2.461 ± 0.519
0.0IleXaa: 0.0 ± 0.0
Lys
0.41LysAla: 0.41 ± 0.338
2.461LysCys: 2.461 ± 1.351
4.922LysAsp: 4.922 ± 1.485
4.102LysGlu: 4.102 ± 1.402
2.871LysPhe: 2.871 ± 1.257
2.871LysGly: 2.871 ± 0.974
2.051LysHis: 2.051 ± 0.991
1.641LysIle: 1.641 ± 0.636
4.922LysLys: 4.922 ± 4.005
7.383LysLeu: 7.383 ± 3.728
1.641LysMet: 1.641 ± 1.154
2.051LysAsn: 2.051 ± 0.62
1.641LysPro: 1.641 ± 0.857
2.051LysGln: 2.051 ± 0.726
4.512LysArg: 4.512 ± 0.878
4.922LysSer: 4.922 ± 1.971
1.231LysThr: 1.231 ± 0.887
2.871LysVal: 2.871 ± 0.816
0.82LysTrp: 0.82 ± 0.477
2.461LysTyr: 2.461 ± 0.878
0.0LysXaa: 0.0 ± 0.0
Leu
4.922LeuAla: 4.922 ± 1.553
1.231LeuCys: 1.231 ± 0.726
6.563LeuAsp: 6.563 ± 0.556
4.512LeuGlu: 4.512 ± 0.912
5.332LeuPhe: 5.332 ± 1.587
4.102LeuGly: 4.102 ± 1.446
2.051LeuHis: 2.051 ± 0.66
5.742LeuIle: 5.742 ± 1.339
9.024LeuLys: 9.024 ± 3.514
9.844LeuLeu: 9.844 ± 3.924
1.641LeuMet: 1.641 ± 1.026
3.281LeuAsn: 3.281 ± 1.147
6.973LeuPro: 6.973 ± 2.539
4.922LeuGln: 4.922 ± 1.808
4.102LeuArg: 4.102 ± 0.73
10.254LeuSer: 10.254 ± 2.062
3.692LeuThr: 3.692 ± 1.626
3.281LeuVal: 3.281 ± 1.428
0.41LeuTrp: 0.41 ± 0.417
4.102LeuTyr: 4.102 ± 0.969
0.0LeuXaa: 0.0 ± 0.0
Met
1.641MetAla: 1.641 ± 0.551
0.0MetCys: 0.0 ± 0.0
0.82MetAsp: 0.82 ± 0.72
0.82MetGlu: 0.82 ± 0.633
1.641MetPhe: 1.641 ± 0.609
1.641MetGly: 1.641 ± 0.992
0.0MetHis: 0.0 ± 0.0
0.41MetIle: 0.41 ± 0.346
1.231MetLys: 1.231 ± 0.565
0.82MetLeu: 0.82 ± 0.628
0.41MetMet: 0.41 ± 0.581
1.641MetAsn: 1.641 ± 0.559
0.82MetPro: 0.82 ± 0.723
0.41MetGln: 0.41 ± 0.373
0.82MetArg: 0.82 ± 0.401
1.641MetSer: 1.641 ± 0.559
1.641MetThr: 1.641 ± 0.595
1.231MetVal: 1.231 ± 0.622
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.871AsnAla: 2.871 ± 0.539
0.82AsnCys: 0.82 ± 0.373
1.231AsnAsp: 1.231 ± 0.753
3.281AsnGlu: 3.281 ± 1.176
2.051AsnPhe: 2.051 ± 1.062
2.461AsnGly: 2.461 ± 0.658
0.82AsnHis: 0.82 ± 0.373
3.692AsnIle: 3.692 ± 1.096
3.692AsnLys: 3.692 ± 1.098
3.692AsnLeu: 3.692 ± 1.125
1.231AsnMet: 1.231 ± 0.658
2.871AsnAsn: 2.871 ± 0.827
2.461AsnPro: 2.461 ± 0.611
1.641AsnGln: 1.641 ± 0.669
4.512AsnArg: 4.512 ± 0.82
3.692AsnSer: 3.692 ± 1.139
1.641AsnThr: 1.641 ± 1.019
4.102AsnVal: 4.102 ± 1.433
1.231AsnTrp: 1.231 ± 0.641
1.231AsnTyr: 1.231 ± 0.787
0.0AsnXaa: 0.0 ± 0.0
Pro
4.102ProAla: 4.102 ± 1.62
1.231ProCys: 1.231 ± 0.53
3.281ProAsp: 3.281 ± 1.17
4.102ProGlu: 4.102 ± 0.731
2.461ProPhe: 2.461 ± 0.611
2.871ProGly: 2.871 ± 1.307
0.41ProHis: 0.41 ± 0.57
3.692ProIle: 3.692 ± 1.533
3.692ProLys: 3.692 ± 0.941
4.512ProLeu: 4.512 ± 1.441
1.641ProMet: 1.641 ± 0.585
2.461ProAsn: 2.461 ± 0.604
9.024ProPro: 9.024 ± 2.242
0.82ProGln: 0.82 ± 0.401
2.871ProArg: 2.871 ± 1.088
6.153ProSer: 6.153 ± 2.281
5.332ProThr: 5.332 ± 2.273
1.231ProVal: 1.231 ± 0.563
0.0ProTrp: 0.0 ± 0.0
1.641ProTyr: 1.641 ± 0.656
0.0ProXaa: 0.0 ± 0.0
Gln
0.82GlnAla: 0.82 ± 0.415
1.231GlnCys: 1.231 ± 0.778
1.231GlnAsp: 1.231 ± 0.473
2.871GlnGlu: 2.871 ± 0.929
2.461GlnPhe: 2.461 ± 0.966
2.461GlnGly: 2.461 ± 0.953
0.82GlnHis: 0.82 ± 0.401
1.231GlnIle: 1.231 ± 0.636
0.82GlnLys: 0.82 ± 0.723
5.332GlnLeu: 5.332 ± 1.028
1.641GlnMet: 1.641 ± 0.721
2.871GlnAsn: 2.871 ± 1.371
0.82GlnPro: 0.82 ± 0.401
2.461GlnGln: 2.461 ± 0.659
2.051GlnArg: 2.051 ± 1.156
0.0GlnSer: 0.0 ± 0.0
2.461GlnThr: 2.461 ± 0.947
2.461GlnVal: 2.461 ± 0.997
0.82GlnTrp: 0.82 ± 0.373
1.231GlnTyr: 1.231 ± 0.751
0.0GlnXaa: 0.0 ± 0.0
Arg
4.512ArgAla: 4.512 ± 1.959
1.231ArgCys: 1.231 ± 0.885
3.281ArgAsp: 3.281 ± 0.894
3.281ArgGlu: 3.281 ± 0.597
2.461ArgPhe: 2.461 ± 1.09
3.281ArgGly: 3.281 ± 0.861
1.641ArgHis: 1.641 ± 1.107
2.051ArgIle: 2.051 ± 0.72
4.512ArgLys: 4.512 ± 1.121
7.383ArgLeu: 7.383 ± 1.829
0.41ArgMet: 0.41 ± 0.361
2.871ArgAsn: 2.871 ± 1.284
3.281ArgPro: 3.281 ± 2.034
1.231ArgGln: 1.231 ± 0.88
8.203ArgArg: 8.203 ± 2.566
3.281ArgSer: 3.281 ± 0.79
2.461ArgThr: 2.461 ± 0.755
2.461ArgVal: 2.461 ± 0.952
0.0ArgTrp: 0.0 ± 0.0
1.231ArgTyr: 1.231 ± 0.582
0.0ArgXaa: 0.0 ± 0.0
Ser
4.922SerAla: 4.922 ± 1.021
1.231SerCys: 1.231 ± 0.641
3.281SerAsp: 3.281 ± 1.473
3.692SerGlu: 3.692 ± 0.401
4.102SerPhe: 4.102 ± 1.779
3.692SerGly: 3.692 ± 0.814
1.641SerHis: 1.641 ± 0.619
3.281SerIle: 3.281 ± 0.558
2.871SerLys: 2.871 ± 1.544
8.614SerLeu: 8.614 ± 1.758
1.231SerMet: 1.231 ± 0.572
4.102SerAsn: 4.102 ± 2.089
5.332SerPro: 5.332 ± 1.11
2.871SerGln: 2.871 ± 1.212
3.692SerArg: 3.692 ± 0.75
4.922SerSer: 4.922 ± 0.819
7.383SerThr: 7.383 ± 2.228
5.742SerVal: 5.742 ± 1.623
0.82SerTrp: 0.82 ± 0.525
0.82SerTyr: 0.82 ± 0.616
0.0SerXaa: 0.0 ± 0.0
Thr
2.871ThrAla: 2.871 ± 0.949
0.41ThrCys: 0.41 ± 0.361
4.922ThrAsp: 4.922 ± 0.975
6.563ThrGlu: 6.563 ± 0.962
2.871ThrPhe: 2.871 ± 0.54
3.692ThrGly: 3.692 ± 1.063
2.051ThrHis: 2.051 ± 0.52
3.692ThrIle: 3.692 ± 1.619
1.641ThrLys: 1.641 ± 0.74
2.461ThrLeu: 2.461 ± 1.501
0.82ThrMet: 0.82 ± 1.14
2.051ThrAsn: 2.051 ± 0.52
4.922ThrPro: 4.922 ± 1.301
1.231ThrGln: 1.231 ± 0.404
4.102ThrArg: 4.102 ± 1.375
6.973ThrSer: 6.973 ± 2.338
4.512ThrThr: 4.512 ± 1.653
4.512ThrVal: 4.512 ± 0.724
2.051ThrTrp: 2.051 ± 0.401
1.641ThrTyr: 1.641 ± 1.019
0.0ThrXaa: 0.0 ± 0.0
Val
2.871ValAla: 2.871 ± 0.724
0.41ValCys: 0.41 ± 0.581
4.102ValAsp: 4.102 ± 0.511
4.922ValGlu: 4.922 ± 0.595
1.641ValPhe: 1.641 ± 0.83
3.281ValGly: 3.281 ± 1.414
1.641ValHis: 1.641 ± 0.595
2.871ValIle: 2.871 ± 1.216
1.641ValLys: 1.641 ± 0.56
3.281ValLeu: 3.281 ± 1.12
0.0ValMet: 0.0 ± 0.0
4.102ValAsn: 4.102 ± 1.073
2.051ValPro: 2.051 ± 0.66
2.871ValGln: 2.871 ± 0.709
2.051ValArg: 2.051 ± 0.841
4.512ValSer: 4.512 ± 0.751
3.692ValThr: 3.692 ± 1.261
1.641ValVal: 1.641 ± 0.746
1.231ValTrp: 1.231 ± 0.585
3.692ValTyr: 3.692 ± 1.352
0.0ValXaa: 0.0 ± 0.0
Trp
0.82TrpAla: 0.82 ± 0.415
0.0TrpCys: 0.0 ± 0.0
0.41TrpAsp: 0.41 ± 0.361
0.0TrpGlu: 0.0 ± 0.0
0.41TrpPhe: 0.41 ± 0.346
0.41TrpGly: 0.41 ± 0.361
0.41TrpHis: 0.41 ± 0.346
0.0TrpIle: 0.0 ± 0.0
1.231TrpLys: 1.231 ± 0.778
2.051TrpLeu: 2.051 ± 0.991
0.82TrpMet: 0.82 ± 0.578
0.0TrpAsn: 0.0 ± 0.0
0.41TrpPro: 0.41 ± 0.361
0.41TrpGln: 0.41 ± 0.361
1.641TrpArg: 1.641 ± 0.814
0.0TrpSer: 0.0 ± 0.0
1.641TrpThr: 1.641 ± 0.636
1.641TrpVal: 1.641 ± 0.928
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.641TyrAla: 1.641 ± 0.669
1.231TyrCys: 1.231 ± 1.205
2.051TyrAsp: 2.051 ± 0.774
1.231TyrGlu: 1.231 ± 0.533
3.692TyrPhe: 3.692 ± 0.974
2.461TyrGly: 2.461 ± 0.914
0.0TyrHis: 0.0 ± 0.0
1.641TyrIle: 1.641 ± 1.034
3.281TyrLys: 3.281 ± 1.104
2.461TyrLeu: 2.461 ± 1.083
0.0TyrMet: 0.0 ± 0.0
2.051TyrAsn: 2.051 ± 0.752
0.41TyrPro: 0.41 ± 0.361
1.231TyrGln: 1.231 ± 0.751
1.641TyrArg: 1.641 ± 0.817
1.231TyrSer: 1.231 ± 0.647
0.41TyrThr: 0.41 ± 0.346
0.82TyrVal: 0.82 ± 0.421
0.0TyrTrp: 0.0 ± 0.0
2.871TyrTyr: 2.871 ± 0.824
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2439 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski