Amino acid dipepetide frequency for Human papillomavirus 42

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.733AlaAla: 3.733 ± 1.202
1.866AlaCys: 1.866 ± 0.871
4.106AlaAsp: 4.106 ± 1.275
3.359AlaGlu: 3.359 ± 0.776
2.613AlaPhe: 2.613 ± 1.328
2.24AlaGly: 2.24 ± 0.614
1.12AlaHis: 1.12 ± 0.604
3.359AlaIle: 3.359 ± 0.758
4.106AlaLys: 4.106 ± 1.25
3.733AlaLeu: 3.733 ± 0.841
0.747AlaMet: 0.747 ± 0.388
1.493AlaAsn: 1.493 ± 0.634
2.24AlaPro: 2.24 ± 0.982
2.986AlaGln: 2.986 ± 0.943
1.866AlaArg: 1.866 ± 0.717
5.972AlaSer: 5.972 ± 1.536
3.733AlaThr: 3.733 ± 1.081
2.24AlaVal: 2.24 ± 0.906
0.747AlaTrp: 0.747 ± 0.385
2.613AlaTyr: 2.613 ± 0.425
0.0AlaXaa: 0.0 ± 0.0
Cys
2.24CysAla: 2.24 ± 1.742
0.0CysCys: 0.0 ± 0.0
0.373CysAsp: 0.373 ± 0.427
0.373CysGlu: 0.373 ± 0.473
0.747CysPhe: 0.747 ± 0.385
3.359CysGly: 3.359 ± 1.476
0.373CysHis: 0.373 ± 0.358
1.12CysIle: 1.12 ± 0.701
2.613CysLys: 2.613 ± 0.902
3.733CysLeu: 3.733 ± 1.53
0.0CysMet: 0.0 ± 0.0
1.12CysAsn: 1.12 ± 0.799
1.866CysPro: 1.866 ± 0.709
1.866CysGln: 1.866 ± 0.944
1.493CysArg: 1.493 ± 0.661
1.866CysSer: 1.866 ± 0.754
4.853CysThr: 4.853 ± 1.629
2.986CysVal: 2.986 ± 1.311
0.747CysTrp: 0.747 ± 0.349
0.747CysTyr: 0.747 ± 0.946
0.0CysXaa: 0.0 ± 0.0
Asp
2.613AspAla: 2.613 ± 0.552
2.613AspCys: 2.613 ± 1.015
2.613AspAsp: 2.613 ± 1.228
2.986AspGlu: 2.986 ± 0.782
2.613AspPhe: 2.613 ± 0.719
2.986AspGly: 2.986 ± 0.903
0.373AspHis: 0.373 ± 0.354
5.226AspIle: 5.226 ± 2.55
1.866AspLys: 1.866 ± 1.126
3.359AspLeu: 3.359 ± 0.729
0.373AspMet: 0.373 ± 0.326
3.359AspAsn: 3.359 ± 0.646
3.359AspPro: 3.359 ± 1.168
1.493AspGln: 1.493 ± 0.537
0.747AspArg: 0.747 ± 0.565
5.599AspSer: 5.599 ± 2.191
5.972AspThr: 5.972 ± 1.193
1.866AspVal: 1.866 ± 1.045
0.747AspTrp: 0.747 ± 0.565
1.493AspTyr: 1.493 ± 0.648
0.0AspXaa: 0.0 ± 0.0
Glu
3.359GluAla: 3.359 ± 1.034
0.0GluCys: 0.0 ± 0.0
2.613GluAsp: 2.613 ± 1.038
3.733GluGlu: 3.733 ± 0.601
1.493GluPhe: 1.493 ± 1.052
1.866GluGly: 1.866 ± 1.008
1.866GluHis: 1.866 ± 0.436
1.493GluIle: 1.493 ± 0.76
1.493GluLys: 1.493 ± 1.109
4.106GluLeu: 4.106 ± 1.104
1.12GluMet: 1.12 ± 0.483
4.853GluAsn: 4.853 ± 1.43
1.493GluPro: 1.493 ± 0.328
1.866GluGln: 1.866 ± 0.947
1.12GluArg: 1.12 ± 1.03
1.12GluSer: 1.12 ± 0.614
3.733GluThr: 3.733 ± 1.203
4.106GluVal: 4.106 ± 0.972
0.747GluTrp: 0.747 ± 0.349
1.493GluTyr: 1.493 ± 0.919
0.0GluXaa: 0.0 ± 0.0
Phe
1.493PheAla: 1.493 ± 0.661
0.747PheCys: 0.747 ± 0.526
1.493PheAsp: 1.493 ± 0.764
0.747PheGlu: 0.747 ± 0.414
2.613PhePhe: 2.613 ± 0.949
3.359PheGly: 3.359 ± 0.988
0.373PheHis: 0.373 ± 0.354
2.986PheIle: 2.986 ± 0.654
2.986PheLys: 2.986 ± 1.015
4.106PheLeu: 4.106 ± 0.811
0.747PheMet: 0.747 ± 0.414
2.24PheAsn: 2.24 ± 0.632
2.24PhePro: 2.24 ± 0.727
1.866PheGln: 1.866 ± 0.82
1.866PheArg: 1.866 ± 0.528
0.747PheSer: 0.747 ± 0.394
1.866PheThr: 1.866 ± 0.682
1.866PheVal: 1.866 ± 0.714
1.866PheTrp: 1.866 ± 0.613
0.747PheTyr: 0.747 ± 0.457
0.0PheXaa: 0.0 ± 0.0
Gly
3.733GlyAla: 3.733 ± 0.955
0.747GlyCys: 0.747 ± 0.349
3.733GlyAsp: 3.733 ± 1.265
1.493GlyGlu: 1.493 ± 0.735
1.866GlyPhe: 1.866 ± 0.548
2.986GlyGly: 2.986 ± 1.268
2.24GlyHis: 2.24 ± 1.042
3.733GlyIle: 3.733 ± 0.925
2.613GlyLys: 2.613 ± 0.902
6.346GlyLeu: 6.346 ± 0.849
0.747GlyMet: 0.747 ± 0.565
2.24GlyAsn: 2.24 ± 0.657
2.613GlyPro: 2.613 ± 1.643
2.613GlyGln: 2.613 ± 0.606
2.613GlyArg: 2.613 ± 0.948
5.226GlySer: 5.226 ± 0.723
7.839GlyThr: 7.839 ± 1.255
3.733GlyVal: 3.733 ± 0.792
0.747GlyTrp: 0.747 ± 0.414
1.866GlyTyr: 1.866 ± 0.52
0.0GlyXaa: 0.0 ± 0.0
His
2.986HisAla: 2.986 ± 0.856
1.866HisCys: 1.866 ± 0.836
0.747HisAsp: 0.747 ± 0.425
0.373HisGlu: 0.373 ± 0.518
1.12HisPhe: 1.12 ± 0.575
1.866HisGly: 1.866 ± 1.008
1.493HisHis: 1.493 ± 0.895
1.12HisIle: 1.12 ± 0.557
0.747HisLys: 0.747 ± 0.349
4.479HisLeu: 4.479 ± 2.411
0.373HisMet: 0.373 ± 0.358
1.493HisAsn: 1.493 ± 0.919
2.613HisPro: 2.613 ± 1.175
0.373HisGln: 0.373 ± 0.358
1.12HisArg: 1.12 ± 0.587
3.733HisSer: 3.733 ± 1.077
1.866HisThr: 1.866 ± 0.584
0.747HisVal: 0.747 ± 0.425
0.747HisTrp: 0.747 ± 0.457
1.866HisTyr: 1.866 ± 0.819
0.0HisXaa: 0.0 ± 0.0
Ile
2.24IleAla: 2.24 ± 1.228
0.747IleCys: 0.747 ± 0.555
3.359IleAsp: 3.359 ± 0.827
1.493IleGlu: 1.493 ± 0.647
0.747IlePhe: 0.747 ± 0.457
3.733IleGly: 3.733 ± 0.59
3.733IleHis: 3.733 ± 1.395
1.12IleIle: 1.12 ± 0.712
1.493IleLys: 1.493 ± 0.571
2.986IleLeu: 2.986 ± 1.092
0.747IleMet: 0.747 ± 0.452
0.747IleAsn: 0.747 ± 0.414
6.719IlePro: 6.719 ± 1.868
2.986IleGln: 2.986 ± 0.856
2.613IleArg: 2.613 ± 0.829
3.359IleSer: 3.359 ± 0.671
4.479IleThr: 4.479 ± 1.395
4.479IleVal: 4.479 ± 1.12
0.373IleTrp: 0.373 ± 0.501
1.866IleTyr: 1.866 ± 0.436
0.0IleXaa: 0.0 ± 0.0
Lys
3.733LysAla: 3.733 ± 0.793
2.986LysCys: 2.986 ± 1.178
1.866LysAsp: 1.866 ± 1.076
2.24LysGlu: 2.24 ± 1.059
2.24LysPhe: 2.24 ± 1.048
0.747LysGly: 0.747 ± 0.349
1.493LysHis: 1.493 ± 0.661
2.613LysIle: 2.613 ± 0.425
3.359LysLys: 3.359 ± 1.054
2.986LysLeu: 2.986 ± 0.805
0.747LysMet: 0.747 ± 0.609
1.866LysAsn: 1.866 ± 0.82
2.24LysPro: 2.24 ± 0.712
2.986LysGln: 2.986 ± 0.856
4.853LysArg: 4.853 ± 1.533
3.733LysSer: 3.733 ± 1.412
2.24LysThr: 2.24 ± 0.788
2.24LysVal: 2.24 ± 1.214
0.373LysTrp: 0.373 ± 0.283
1.493LysTyr: 1.493 ± 0.531
0.0LysXaa: 0.0 ± 0.0
Leu
2.986LeuAla: 2.986 ± 1.055
4.479LeuCys: 4.479 ± 1.904
5.972LeuAsp: 5.972 ± 1.789
4.853LeuGlu: 4.853 ± 1.21
4.853LeuPhe: 4.853 ± 0.892
5.226LeuGly: 5.226 ± 1.432
2.24LeuHis: 2.24 ± 0.682
3.359LeuIle: 3.359 ± 1.681
3.733LeuLys: 3.733 ± 0.948
10.452LeuLeu: 10.452 ± 3.198
0.373LeuMet: 0.373 ± 0.593
2.24LeuAsn: 2.24 ± 0.697
2.24LeuPro: 2.24 ± 1.013
8.585LeuGln: 8.585 ± 2.195
3.733LeuArg: 3.733 ± 1.338
5.226LeuSer: 5.226 ± 1.703
7.092LeuThr: 7.092 ± 1.639
4.479LeuVal: 4.479 ± 0.551
0.747LeuTrp: 0.747 ± 0.385
4.479LeuTyr: 4.479 ± 0.828
0.0LeuXaa: 0.0 ± 0.0
Met
1.12MetAla: 1.12 ± 0.604
0.373MetCys: 0.373 ± 0.283
1.12MetAsp: 1.12 ± 0.674
0.747MetGlu: 0.747 ± 0.715
0.747MetPhe: 0.747 ± 0.652
1.12MetGly: 1.12 ± 0.573
0.747MetHis: 0.747 ± 0.425
0.0MetIle: 0.0 ± 0.0
0.747MetLys: 0.747 ± 0.513
2.24MetLeu: 2.24 ± 0.571
0.0MetMet: 0.0 ± 0.0
0.373MetAsn: 0.373 ± 0.326
0.747MetPro: 0.747 ± 0.708
0.373MetGln: 0.373 ± 0.354
0.747MetArg: 0.747 ± 0.517
3.733MetSer: 3.733 ± 0.68
0.373MetThr: 0.373 ± 0.326
2.24MetVal: 2.24 ± 0.848
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.24AsnAla: 2.24 ± 0.812
2.613AsnCys: 2.613 ± 1.436
2.613AsnAsp: 2.613 ± 0.81
1.12AsnGlu: 1.12 ± 0.573
1.493AsnPhe: 1.493 ± 0.328
1.866AsnGly: 1.866 ± 0.626
2.24AsnHis: 2.24 ± 1.014
2.24AsnIle: 2.24 ± 0.59
3.359AsnLys: 3.359 ± 1.548
3.733AsnLeu: 3.733 ± 0.93
1.12AsnMet: 1.12 ± 0.709
0.747AsnAsn: 0.747 ± 0.652
4.479AsnPro: 4.479 ± 0.852
1.12AsnGln: 1.12 ± 0.407
1.493AsnArg: 1.493 ± 0.699
2.986AsnSer: 2.986 ± 0.798
3.359AsnThr: 3.359 ± 1.32
2.24AsnVal: 2.24 ± 0.534
0.373AsnTrp: 0.373 ± 0.283
0.747AsnTyr: 0.747 ± 0.495
0.0AsnXaa: 0.0 ± 0.0
Pro
4.479ProAla: 4.479 ± 2.325
0.747ProCys: 0.747 ± 0.565
2.986ProAsp: 2.986 ± 1.714
1.493ProGlu: 1.493 ± 0.919
1.12ProPhe: 1.12 ± 0.716
2.24ProGly: 2.24 ± 0.534
1.12ProHis: 1.12 ± 1.004
2.986ProIle: 2.986 ± 0.991
1.866ProLys: 1.866 ± 0.51
6.719ProLeu: 6.719 ± 1.188
1.493ProMet: 1.493 ± 1.108
2.613ProAsn: 2.613 ± 0.906
7.092ProPro: 7.092 ± 1.665
1.866ProGln: 1.866 ± 1.019
2.613ProArg: 2.613 ± 0.715
8.212ProSer: 8.212 ± 2.896
4.479ProThr: 4.479 ± 1.224
3.733ProVal: 3.733 ± 0.732
1.12ProTrp: 1.12 ± 0.55
2.613ProTyr: 2.613 ± 0.846
0.0ProXaa: 0.0 ± 0.0
Gln
3.733GlnAla: 3.733 ± 1.056
1.866GlnCys: 1.866 ± 1.15
1.12GlnAsp: 1.12 ± 0.607
1.866GlnGlu: 1.866 ± 0.935
3.359GlnPhe: 3.359 ± 0.816
3.359GlnGly: 3.359 ± 1.22
2.24GlnHis: 2.24 ± 1.046
1.493GlnIle: 1.493 ± 0.987
1.12GlnLys: 1.12 ± 0.753
3.359GlnLeu: 3.359 ± 1.106
1.493GlnMet: 1.493 ± 0.605
0.373GlnAsn: 0.373 ± 0.427
1.866GlnPro: 1.866 ± 0.689
3.359GlnGln: 3.359 ± 0.583
4.853GlnArg: 4.853 ± 0.993
3.359GlnSer: 3.359 ± 0.97
4.853GlnThr: 4.853 ± 1.861
2.986GlnVal: 2.986 ± 1.255
1.866GlnTrp: 1.866 ± 0.62
0.747GlnTyr: 0.747 ± 0.495
0.0GlnXaa: 0.0 ± 0.0
Arg
2.24ArgAla: 2.24 ± 0.614
1.493ArgCys: 1.493 ± 0.958
1.866ArgAsp: 1.866 ± 0.763
1.12ArgGlu: 1.12 ± 0.709
1.866ArgPhe: 1.866 ± 0.575
1.866ArgGly: 1.866 ± 0.652
2.613ArgHis: 2.613 ± 1.331
0.747ArgIle: 0.747 ± 0.566
3.733ArgLys: 3.733 ± 0.632
6.719ArgLeu: 6.719 ± 1.077
1.12ArgMet: 1.12 ± 0.407
3.359ArgAsn: 3.359 ± 1.489
2.986ArgPro: 2.986 ± 1.33
1.493ArgGln: 1.493 ± 0.571
4.479ArgArg: 4.479 ± 1.758
2.24ArgSer: 2.24 ± 0.727
3.359ArgThr: 3.359 ± 0.603
3.359ArgVal: 3.359 ± 1.272
0.0ArgTrp: 0.0 ± 0.0
2.24ArgTyr: 2.24 ± 0.795
0.0ArgXaa: 0.0 ± 0.0
Ser
5.599SerAla: 5.599 ± 1.697
2.24SerCys: 2.24 ± 0.781
4.106SerAsp: 4.106 ± 1.112
3.733SerGlu: 3.733 ± 0.618
2.24SerPhe: 2.24 ± 0.882
5.972SerGly: 5.972 ± 1.405
1.866SerHis: 1.866 ± 0.52
6.346SerIle: 6.346 ± 1.121
2.613SerLys: 2.613 ± 0.728
2.986SerLeu: 2.986 ± 1.42
2.986SerMet: 2.986 ± 0.806
3.359SerAsn: 3.359 ± 0.832
4.853SerPro: 4.853 ± 1.293
3.359SerGln: 3.359 ± 1.262
2.986SerArg: 2.986 ± 1.105
7.092SerSer: 7.092 ± 1.532
10.825SerThr: 10.825 ± 2.426
5.599SerVal: 5.599 ± 1.442
0.747SerTrp: 0.747 ± 0.712
2.24SerTyr: 2.24 ± 0.78
0.0SerXaa: 0.0 ± 0.0
Thr
3.359ThrAla: 3.359 ± 1.285
2.986ThrCys: 2.986 ± 1.263
4.479ThrAsp: 4.479 ± 1.07
3.733ThrGlu: 3.733 ± 0.905
1.12ThrPhe: 1.12 ± 0.716
6.346ThrGly: 6.346 ± 2.08
2.24ThrHis: 2.24 ± 0.752
2.613ThrIle: 2.613 ± 1.277
2.986ThrLys: 2.986 ± 1.219
7.839ThrLeu: 7.839 ± 1.332
0.373ThrMet: 0.373 ± 0.283
5.599ThrAsn: 5.599 ± 1.039
6.719ThrPro: 6.719 ± 1.682
4.106ThrGln: 4.106 ± 1.465
2.986ThrArg: 2.986 ± 0.838
8.959ThrSer: 8.959 ± 2.644
11.571ThrThr: 11.571 ± 3.132
7.839ThrVal: 7.839 ± 1.893
1.493ThrTrp: 1.493 ± 0.728
2.24ThrTyr: 2.24 ± 0.539
0.0ThrXaa: 0.0 ± 0.0
Val
1.493ValAla: 1.493 ± 0.634
2.24ValCys: 2.24 ± 1.32
4.479ValAsp: 4.479 ± 0.813
4.479ValGlu: 4.479 ± 1.177
1.12ValPhe: 1.12 ± 0.674
4.853ValGly: 4.853 ± 2.238
1.866ValHis: 1.866 ± 0.741
4.479ValIle: 4.479 ± 1.203
2.613ValLys: 2.613 ± 0.995
2.986ValLeu: 2.986 ± 1.511
1.866ValMet: 1.866 ± 0.706
2.24ValAsn: 2.24 ± 0.74
3.359ValPro: 3.359 ± 0.857
4.853ValGln: 4.853 ± 1.212
3.733ValArg: 3.733 ± 0.648
6.346ValSer: 6.346 ± 1.377
3.359ValThr: 3.359 ± 1.049
5.226ValVal: 5.226 ± 1.104
1.866ValTrp: 1.866 ± 1.089
0.747ValTyr: 0.747 ± 0.349
0.0ValXaa: 0.0 ± 0.0
Trp
1.12TrpAla: 1.12 ± 0.456
0.747TrpCys: 0.747 ± 0.586
0.747TrpAsp: 0.747 ± 0.708
0.747TrpGlu: 0.747 ± 0.457
1.12TrpPhe: 1.12 ± 0.546
1.493TrpGly: 1.493 ± 0.561
0.747TrpHis: 0.747 ± 0.513
1.12TrpIle: 1.12 ± 0.628
1.493TrpLys: 1.493 ± 0.818
1.866TrpLeu: 1.866 ± 0.574
0.0TrpMet: 0.0 ± 0.0
0.373TrpAsn: 0.373 ± 0.326
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.12TrpArg: 1.12 ± 0.565
0.373TrpSer: 0.373 ± 0.283
1.866TrpThr: 1.866 ± 0.911
0.373TrpVal: 0.373 ± 0.283
0.0TrpTrp: 0.0 ± 0.0
0.747TrpTyr: 0.747 ± 0.765
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.747TyrAla: 0.747 ± 0.526
1.12TyrCys: 1.12 ± 0.767
1.866TyrAsp: 1.866 ± 0.364
2.986TyrGlu: 2.986 ± 0.914
1.866TyrPhe: 1.866 ± 0.992
2.24TyrGly: 2.24 ± 0.813
0.747TyrHis: 0.747 ± 0.555
1.866TyrIle: 1.866 ± 1.006
1.493TyrLys: 1.493 ± 0.537
2.613TyrLeu: 2.613 ± 0.996
0.747TyrMet: 0.747 ± 0.513
1.493TyrAsn: 1.493 ± 0.73
1.866TyrPro: 1.866 ± 1.12
1.12TyrGln: 1.12 ± 0.575
1.866TyrArg: 1.866 ± 0.879
1.866TyrSer: 1.866 ± 0.626
1.866TyrThr: 1.866 ± 0.677
1.866TyrVal: 1.866 ± 0.71
0.747TyrTrp: 0.747 ± 0.349
1.866TyrTyr: 1.866 ± 0.981
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (2680 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski