Amino acid dipepetide frequency for Human papillomavirus 24

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.116AlaAla: 3.116 ± 1.081
0.779AlaCys: 0.779 ± 0.51
4.285AlaAsp: 4.285 ± 0.47
2.727AlaGlu: 2.727 ± 0.965
4.675AlaPhe: 4.675 ± 1.561
3.896AlaGly: 3.896 ± 1.712
1.169AlaHis: 1.169 ± 0.548
1.169AlaIle: 1.169 ± 0.331
2.727AlaLys: 2.727 ± 0.707
3.896AlaLeu: 3.896 ± 1.017
1.169AlaMet: 1.169 ± 0.567
3.506AlaAsn: 3.506 ± 1.436
5.454AlaPro: 5.454 ± 1.349
2.337AlaGln: 2.337 ± 0.897
2.337AlaArg: 2.337 ± 0.705
2.727AlaSer: 2.727 ± 0.715
6.623AlaThr: 6.623 ± 1.118
1.558AlaVal: 1.558 ± 0.417
0.779AlaTrp: 0.779 ± 0.58
1.169AlaTyr: 1.169 ± 0.533
0.0AlaXaa: 0.0 ± 0.0
Cys
1.169CysAla: 1.169 ± 0.573
1.948CysCys: 1.948 ± 1.16
0.779CysAsp: 0.779 ± 0.35
0.39CysGlu: 0.39 ± 0.49
1.169CysPhe: 1.169 ± 0.352
1.169CysGly: 1.169 ± 1.131
0.39CysHis: 0.39 ± 0.448
0.39CysIle: 0.39 ± 0.293
2.337CysLys: 2.337 ± 0.94
2.337CysLeu: 2.337 ± 1.3
0.0CysMet: 0.0 ± 0.0
0.779CysAsn: 0.779 ± 0.496
1.558CysPro: 1.558 ± 0.69
0.0CysGln: 0.0 ± 0.0
2.337CysArg: 2.337 ± 1.202
1.558CysSer: 1.558 ± 0.7
0.0CysThr: 0.0 ± 0.0
1.169CysVal: 1.169 ± 0.899
0.779CysTrp: 0.779 ± 0.346
0.39CysTyr: 0.39 ± 0.293
0.0CysXaa: 0.0 ± 0.0
Asp
3.896AspAla: 3.896 ± 0.93
1.558AspCys: 1.558 ± 0.826
5.454AspAsp: 5.454 ± 1.312
2.337AspGlu: 2.337 ± 0.425
3.116AspPhe: 3.116 ± 1.192
3.116AspGly: 3.116 ± 1.029
0.39AspHis: 0.39 ± 0.29
3.896AspIle: 3.896 ± 1.285
1.169AspLys: 1.169 ± 0.411
8.181AspLeu: 8.181 ± 1.818
1.169AspMet: 1.169 ± 0.573
3.506AspAsn: 3.506 ± 0.732
4.285AspPro: 4.285 ± 0.973
1.948AspGln: 1.948 ± 0.597
0.779AspArg: 0.779 ± 0.587
3.116AspSer: 3.116 ± 0.701
5.843AspThr: 5.843 ± 0.924
3.896AspVal: 3.896 ± 0.639
0.779AspTrp: 0.779 ± 0.35
0.779AspTyr: 0.779 ± 0.506
0.0AspXaa: 0.0 ± 0.0
Glu
3.896GluAla: 3.896 ± 1.075
0.39GluCys: 0.39 ± 0.29
3.116GluAsp: 3.116 ± 0.997
6.623GluGlu: 6.623 ± 1.653
1.558GluPhe: 1.558 ± 0.629
4.285GluGly: 4.285 ± 2.434
1.558GluHis: 1.558 ± 0.454
3.116GluIle: 3.116 ± 1.319
1.558GluLys: 1.558 ± 0.476
7.012GluLeu: 7.012 ± 1.798
0.779GluMet: 0.779 ± 0.58
4.285GluAsn: 4.285 ± 0.92
3.116GluPro: 3.116 ± 1.113
5.454GluGln: 5.454 ± 1.129
2.337GluArg: 2.337 ± 1.245
6.233GluSer: 6.233 ± 1.39
4.285GluThr: 4.285 ± 0.698
4.675GluVal: 4.675 ± 1.516
0.779GluTrp: 0.779 ± 0.346
1.558GluTyr: 1.558 ± 1.174
0.0GluXaa: 0.0 ± 0.0
Phe
2.727PheAla: 2.727 ± 0.571
0.39PheCys: 0.39 ± 0.448
3.116PheAsp: 3.116 ± 0.492
3.896PheGlu: 3.896 ± 1.724
1.948PhePhe: 1.948 ± 0.779
1.169PheGly: 1.169 ± 0.482
0.779PheHis: 0.779 ± 0.549
1.948PheIle: 1.948 ± 0.59
3.116PheLys: 3.116 ± 1.4
3.506PheLeu: 3.506 ± 0.569
0.0PheMet: 0.0 ± 0.0
1.169PheAsn: 1.169 ± 0.88
1.948PhePro: 1.948 ± 0.757
1.169PheGln: 1.169 ± 0.567
3.116PheArg: 3.116 ± 0.6
1.558PheSer: 1.558 ± 0.654
1.169PheThr: 1.169 ± 0.559
3.506PheVal: 3.506 ± 1.074
1.558PheTrp: 1.558 ± 0.693
1.558PheTyr: 1.558 ± 0.797
0.0PheXaa: 0.0 ± 0.0
Gly
2.337GlyAla: 2.337 ± 0.936
1.558GlyCys: 1.558 ± 0.789
4.675GlyAsp: 4.675 ± 0.95
5.454GlyGlu: 5.454 ± 1.44
1.169GlyPhe: 1.169 ± 0.411
4.285GlyGly: 4.285 ± 1.647
2.727GlyHis: 2.727 ± 0.817
2.337GlyIle: 2.337 ± 0.622
3.506GlyLys: 3.506 ± 1.144
3.116GlyLeu: 3.116 ± 0.973
0.0GlyMet: 0.0 ± 0.0
2.727GlyAsn: 2.727 ± 0.622
2.337GlyPro: 2.337 ± 1.13
3.506GlyGln: 3.506 ± 0.524
7.791GlyArg: 7.791 ± 2.307
5.843GlySer: 5.843 ± 1.204
6.233GlyThr: 6.233 ± 1.76
5.064GlyVal: 5.064 ± 0.754
0.779GlyTrp: 0.779 ± 0.895
1.948GlyTyr: 1.948 ± 0.963
0.0GlyXaa: 0.0 ± 0.0
His
0.39HisAla: 0.39 ± 0.293
1.169HisCys: 1.169 ± 0.712
0.0HisAsp: 0.0 ± 0.0
0.39HisGlu: 0.39 ± 0.29
1.948HisPhe: 1.948 ± 0.354
2.727HisGly: 2.727 ± 1.145
0.779HisHis: 0.779 ± 0.606
0.39HisIle: 0.39 ± 0.347
1.169HisLys: 1.169 ± 0.678
0.779HisLeu: 0.779 ± 0.58
0.0HisMet: 0.0 ± 0.0
1.948HisAsn: 1.948 ± 0.728
2.337HisPro: 2.337 ± 0.881
0.39HisGln: 0.39 ± 0.29
0.39HisArg: 0.39 ± 0.293
1.169HisSer: 1.169 ± 0.641
0.779HisThr: 0.779 ± 0.406
0.39HisVal: 0.39 ± 0.293
0.779HisTrp: 0.779 ± 0.389
1.169HisTyr: 1.169 ± 0.601
0.0HisXaa: 0.0 ± 0.0
Ile
2.337IleAla: 2.337 ± 1.282
0.779IleCys: 0.779 ± 0.517
1.558IleAsp: 1.558 ± 0.66
5.064IleGlu: 5.064 ± 1.751
0.39IlePhe: 0.39 ± 0.293
4.675IleGly: 4.675 ± 1.301
0.779IleHis: 0.779 ± 0.356
2.727IleIle: 2.727 ± 0.718
0.779IleLys: 0.779 ± 0.346
2.727IleLeu: 2.727 ± 0.891
0.779IleMet: 0.779 ± 0.542
3.116IleAsn: 3.116 ± 0.503
2.337IlePro: 2.337 ± 0.897
1.948IleGln: 1.948 ± 0.847
2.337IleArg: 2.337 ± 0.911
3.896IleSer: 3.896 ± 1.055
1.169IleThr: 1.169 ± 0.339
2.727IleVal: 2.727 ± 0.928
0.779IleTrp: 0.779 ± 0.496
3.116IleTyr: 3.116 ± 0.885
0.0IleXaa: 0.0 ± 0.0
Lys
3.896LysAla: 3.896 ± 1.473
0.779LysCys: 0.779 ± 0.51
1.558LysAsp: 1.558 ± 0.635
2.727LysGlu: 2.727 ± 0.815
1.558LysPhe: 1.558 ± 0.693
3.506LysGly: 3.506 ± 0.587
0.779LysHis: 0.779 ± 0.35
1.558LysIle: 1.558 ± 0.449
2.337LysLys: 2.337 ± 0.866
5.454LysLeu: 5.454 ± 1.045
0.39LysMet: 0.39 ± 0.293
2.337LysAsn: 2.337 ± 0.804
1.948LysPro: 1.948 ± 0.912
3.116LysGln: 3.116 ± 0.71
5.064LysArg: 5.064 ± 0.403
1.558LysSer: 1.558 ± 0.819
1.948LysThr: 1.948 ± 0.623
3.506LysVal: 3.506 ± 1.632
0.779LysTrp: 0.779 ± 0.35
1.948LysTyr: 1.948 ± 0.494
0.0LysXaa: 0.0 ± 0.0
Leu
3.506LeuAla: 3.506 ± 0.542
2.337LeuCys: 2.337 ± 1.194
6.623LeuAsp: 6.623 ± 1.176
6.623LeuGlu: 6.623 ± 1.425
4.675LeuPhe: 4.675 ± 1.062
5.843LeuGly: 5.843 ± 1.588
2.337LeuHis: 2.337 ± 1.07
3.896LeuIle: 3.896 ± 1.141
4.285LeuLys: 4.285 ± 0.593
11.687LeuLeu: 11.687 ± 1.74
1.948LeuMet: 1.948 ± 0.563
1.558LeuAsn: 1.558 ± 0.983
3.896LeuPro: 3.896 ± 1.009
8.181LeuGln: 8.181 ± 1.518
3.116LeuArg: 3.116 ± 1.386
7.012LeuSer: 7.012 ± 2.062
5.064LeuThr: 5.064 ± 1.312
5.843LeuVal: 5.843 ± 0.634
0.39LeuTrp: 0.39 ± 0.29
1.558LeuTyr: 1.558 ± 0.7
0.0LeuXaa: 0.0 ± 0.0
Met
1.558MetAla: 1.558 ± 0.653
0.39MetCys: 0.39 ± 0.403
0.779MetAsp: 0.779 ± 0.35
0.779MetGlu: 0.779 ± 0.406
0.779MetPhe: 0.779 ± 0.346
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.39MetIle: 0.39 ± 0.49
0.39MetLys: 0.39 ± 0.29
0.39MetLeu: 0.39 ± 0.29
0.39MetMet: 0.39 ± 0.403
0.779MetAsn: 0.779 ± 0.389
0.39MetPro: 0.39 ± 0.403
0.39MetGln: 0.39 ± 0.293
0.779MetArg: 0.779 ± 0.58
3.116MetSer: 3.116 ± 1.663
0.0MetThr: 0.0 ± 0.0
1.558MetVal: 1.558 ± 0.471
0.39MetTrp: 0.39 ± 0.316
0.39MetTyr: 0.39 ± 0.293
0.0MetXaa: 0.0 ± 0.0
Asn
3.116AsnAla: 3.116 ± 1.583
0.39AsnCys: 0.39 ± 0.29
3.896AsnAsp: 3.896 ± 1.475
1.948AsnGlu: 1.948 ± 0.744
1.558AsnPhe: 1.558 ± 0.672
2.337AsnGly: 2.337 ± 0.823
0.779AsnHis: 0.779 ± 0.633
3.896AsnIle: 3.896 ± 1.027
2.727AsnLys: 2.727 ± 0.408
2.337AsnLeu: 2.337 ± 0.681
0.39AsnMet: 0.39 ± 0.29
1.558AsnAsn: 1.558 ± 0.842
3.116AsnPro: 3.116 ± 1.24
1.558AsnGln: 1.558 ± 0.672
3.896AsnArg: 3.896 ± 1.185
3.896AsnSer: 3.896 ± 0.656
3.116AsnThr: 3.116 ± 0.755
1.558AsnVal: 1.558 ± 0.878
0.0AsnTrp: 0.0 ± 0.0
1.558AsnTyr: 1.558 ± 0.804
0.0AsnXaa: 0.0 ± 0.0
Pro
4.675ProAla: 4.675 ± 1.787
1.558ProCys: 1.558 ± 0.672
5.064ProAsp: 5.064 ± 1.142
5.064ProGlu: 5.064 ± 1.358
0.779ProPhe: 0.779 ± 0.346
4.285ProGly: 4.285 ± 1.542
0.779ProHis: 0.779 ± 0.806
2.337ProIle: 2.337 ± 0.425
3.506ProLys: 3.506 ± 1.128
6.623ProLeu: 6.623 ± 1.745
0.779ProMet: 0.779 ± 0.58
2.337ProAsn: 2.337 ± 0.771
9.739ProPro: 9.739 ± 4.704
3.116ProGln: 3.116 ± 1.791
3.116ProArg: 3.116 ± 1.346
2.727ProSer: 2.727 ± 1.287
4.285ProThr: 4.285 ± 1.872
3.896ProVal: 3.896 ± 1.255
0.0ProTrp: 0.0 ± 0.0
1.558ProTyr: 1.558 ± 0.878
0.0ProXaa: 0.0 ± 0.0
Gln
3.896GlnAla: 3.896 ± 0.765
0.779GlnCys: 0.779 ± 0.517
3.506GlnAsp: 3.506 ± 1.386
1.948GlnGlu: 1.948 ± 1.21
0.779GlnPhe: 0.779 ± 0.536
3.116GlnGly: 3.116 ± 0.472
1.558GlnHis: 1.558 ± 0.449
3.506GlnIle: 3.506 ± 0.928
1.948GlnLys: 1.948 ± 0.83
4.285GlnLeu: 4.285 ± 1.036
1.558GlnMet: 1.558 ± 0.451
1.558GlnAsn: 1.558 ± 0.546
3.506GlnPro: 3.506 ± 1.466
3.506GlnGln: 3.506 ± 0.939
3.506GlnArg: 3.506 ± 1.232
3.896GlnSer: 3.896 ± 0.603
2.727GlnThr: 2.727 ± 0.923
1.558GlnVal: 1.558 ± 0.968
1.558GlnTrp: 1.558 ± 0.7
1.169GlnTyr: 1.169 ± 0.339
0.0GlnXaa: 0.0 ± 0.0
Arg
3.506ArgAla: 3.506 ± 1.19
3.116ArgCys: 3.116 ± 0.877
1.558ArgAsp: 1.558 ± 1.01
3.896ArgGlu: 3.896 ± 0.832
3.506ArgPhe: 3.506 ± 0.714
6.623ArgGly: 6.623 ± 3.013
0.779ArgHis: 0.779 ± 0.587
0.39ArgIle: 0.39 ± 0.316
3.896ArgLys: 3.896 ± 0.4
7.402ArgLeu: 7.402 ± 1.166
0.0ArgMet: 0.0 ± 0.0
2.337ArgAsn: 2.337 ± 0.622
1.948ArgPro: 1.948 ± 0.85
3.506ArgGln: 3.506 ± 0.587
7.402ArgArg: 7.402 ± 2.775
8.181ArgSer: 8.181 ± 2.782
3.896ArgThr: 3.896 ± 1.49
1.948ArgVal: 1.948 ± 0.91
0.0ArgTrp: 0.0 ± 0.0
3.506ArgTyr: 3.506 ± 0.729
0.0ArgXaa: 0.0 ± 0.0
Ser
3.116SerAla: 3.116 ± 1.419
0.39SerCys: 0.39 ± 0.293
5.454SerAsp: 5.454 ± 1.253
3.116SerGlu: 3.116 ± 0.604
4.285SerPhe: 4.285 ± 0.783
7.012SerGly: 7.012 ± 1.657
0.779SerHis: 0.779 ± 0.506
2.337SerIle: 2.337 ± 0.425
3.896SerLys: 3.896 ± 0.4
7.402SerLeu: 7.402 ± 1.25
1.558SerMet: 1.558 ± 0.514
3.506SerAsn: 3.506 ± 1.837
4.285SerPro: 4.285 ± 1.589
2.337SerGln: 2.337 ± 0.936
7.402SerArg: 7.402 ± 3.296
10.518SerSer: 10.518 ± 3.465
8.96SerThr: 8.96 ± 1.837
3.116SerVal: 3.116 ± 1.319
1.169SerTrp: 1.169 ± 0.601
1.948SerTyr: 1.948 ± 0.673
0.0SerXaa: 0.0 ± 0.0
Thr
1.169ThrAla: 1.169 ± 0.494
1.558ThrCys: 1.558 ± 0.592
3.506ThrAsp: 3.506 ± 0.739
5.843ThrGlu: 5.843 ± 0.997
2.337ThrPhe: 2.337 ± 0.477
3.896ThrGly: 3.896 ± 1.044
0.779ThrHis: 0.779 ± 0.506
2.337ThrIle: 2.337 ± 1.647
2.727ThrLys: 2.727 ± 0.481
4.285ThrLeu: 4.285 ± 1.181
1.558ThrMet: 1.558 ± 0.542
2.727ThrAsn: 2.727 ± 0.9
7.402ThrPro: 7.402 ± 1.995
3.896ThrGln: 3.896 ± 1.548
3.506ThrArg: 3.506 ± 0.983
6.623ThrSer: 6.623 ± 2.024
1.948ThrThr: 1.948 ± 0.899
5.454ThrVal: 5.454 ± 0.733
0.39ThrTrp: 0.39 ± 0.403
1.558ThrTyr: 1.558 ± 0.524
0.0ThrXaa: 0.0 ± 0.0
Val
5.454ValAla: 5.454 ± 0.616
0.0ValCys: 0.0 ± 0.0
3.116ValAsp: 3.116 ± 1.035
5.454ValGlu: 5.454 ± 0.828
1.948ValPhe: 1.948 ± 0.727
2.337ValGly: 2.337 ± 0.654
1.169ValHis: 1.169 ± 1.042
4.285ValIle: 4.285 ± 0.962
0.39ValLys: 0.39 ± 0.29
4.285ValLeu: 4.285 ± 1.52
0.0ValMet: 0.0 ± 0.0
2.727ValAsn: 2.727 ± 1.053
4.285ValPro: 4.285 ± 1.797
1.948ValGln: 1.948 ± 0.59
5.454ValArg: 5.454 ± 1.73
6.233ValSer: 6.233 ± 1.359
3.116ValThr: 3.116 ± 0.457
3.116ValVal: 3.116 ± 0.896
0.779ValTrp: 0.779 ± 0.587
2.337ValTyr: 2.337 ± 1.442
0.0ValXaa: 0.0 ± 0.0
Trp
0.779TrpAla: 0.779 ± 0.346
0.39TrpCys: 0.39 ± 0.29
0.0TrpAsp: 0.0 ± 0.0
1.558TrpGlu: 1.558 ± 0.607
0.0TrpPhe: 0.0 ± 0.0
0.39TrpGly: 0.39 ± 0.293
0.0TrpHis: 0.0 ± 0.0
1.169TrpIle: 1.169 ± 0.869
1.558TrpLys: 1.558 ± 0.876
1.558TrpLeu: 1.558 ± 0.693
0.0TrpMet: 0.0 ± 0.0
0.39TrpAsn: 0.39 ± 0.293
0.0TrpPro: 0.0 ± 0.0
1.169TrpGln: 1.169 ± 0.646
0.779TrpArg: 0.779 ± 0.506
1.169TrpSer: 1.169 ± 0.601
0.39TrpThr: 0.39 ± 0.316
1.169TrpVal: 1.169 ± 0.559
0.0TrpTrp: 0.0 ± 0.0
0.39TrpTyr: 0.39 ± 0.29
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.558TyrAla: 1.558 ± 0.546
0.39TyrCys: 0.39 ± 0.448
1.169TyrAsp: 1.169 ± 0.352
0.779TyrGlu: 0.779 ± 0.389
1.169TyrPhe: 1.169 ± 0.712
2.727TyrGly: 2.727 ± 0.504
0.779TyrHis: 0.779 ± 0.399
1.948TyrIle: 1.948 ± 0.9
2.727TyrLys: 2.727 ± 0.642
3.116TyrLeu: 3.116 ± 0.592
0.779TyrMet: 0.779 ± 0.35
0.779TyrAsn: 0.779 ± 0.587
2.337TyrPro: 2.337 ± 0.853
0.779TyrGln: 0.779 ± 0.346
1.948TyrArg: 1.948 ± 0.435
1.558TyrSer: 1.558 ± 0.587
1.948TyrThr: 1.948 ± 0.668
2.727TyrVal: 2.727 ± 1.239
0.39TyrTrp: 0.39 ± 0.403
2.727TyrTyr: 2.727 ± 1.149
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2568 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski