Amino acid dipepetide frequency for Human papillomavirus 43

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.222AlaAla: 3.222 ± 1.018
1.611AlaCys: 1.611 ± 0.789
4.51AlaAsp: 4.51 ± 1.209
2.577AlaGlu: 2.577 ± 1.016
1.933AlaPhe: 1.933 ± 0.639
4.51AlaGly: 4.51 ± 0.928
0.644AlaHis: 0.644 ± 0.622
2.255AlaIle: 2.255 ± 0.561
1.611AlaLys: 1.611 ± 0.771
4.51AlaLeu: 4.51 ± 1.688
2.255AlaMet: 2.255 ± 0.471
1.611AlaAsn: 1.611 ± 0.67
4.188AlaPro: 4.188 ± 0.952
3.544AlaGln: 3.544 ± 0.868
3.222AlaArg: 3.222 ± 1.652
7.41AlaSer: 7.41 ± 2.208
2.899AlaThr: 2.899 ± 1.1
1.933AlaVal: 1.933 ± 0.619
0.0AlaTrp: 0.0 ± 0.0
1.289AlaTyr: 1.289 ± 0.604
0.0AlaXaa: 0.0 ± 0.0
Cys
1.289CysAla: 1.289 ± 0.332
0.644CysCys: 0.644 ± 0.541
0.644CysAsp: 0.644 ± 0.637
0.0CysGlu: 0.0 ± 0.0
2.255CysPhe: 2.255 ± 0.767
1.611CysGly: 1.611 ± 0.652
0.322CysHis: 0.322 ± 0.404
3.222CysIle: 3.222 ± 1.036
1.289CysLys: 1.289 ± 0.774
3.544CysLeu: 3.544 ± 2.144
0.966CysMet: 0.966 ± 0.731
2.255CysAsn: 2.255 ± 0.732
1.933CysPro: 1.933 ± 0.535
1.933CysGln: 1.933 ± 0.917
0.644CysArg: 0.644 ± 0.54
1.289CysSer: 1.289 ± 0.689
2.577CysThr: 2.577 ± 0.817
0.966CysVal: 0.966 ± 0.498
0.644CysTrp: 0.644 ± 0.428
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.289AspAla: 1.289 ± 1.081
1.611AspCys: 1.611 ± 0.607
0.966AspAsp: 0.966 ± 0.369
3.222AspGlu: 3.222 ± 1.184
1.933AspPhe: 1.933 ± 0.324
1.611AspGly: 1.611 ± 0.565
0.966AspHis: 0.966 ± 0.701
5.477AspIle: 5.477 ± 1.877
1.611AspLys: 1.611 ± 1.351
3.544AspLeu: 3.544 ± 0.695
2.577AspMet: 2.577 ± 0.717
3.222AspAsn: 3.222 ± 0.89
4.51AspPro: 4.51 ± 1.534
0.966AspGln: 0.966 ± 0.596
0.644AspArg: 0.644 ± 0.54
5.799AspSer: 5.799 ± 1.535
6.765AspThr: 6.765 ± 1.018
2.899AspVal: 2.899 ± 0.48
1.289AspTrp: 1.289 ± 0.541
2.255AspTyr: 2.255 ± 0.787
0.0AspXaa: 0.0 ± 0.0
Glu
3.222GluAla: 3.222 ± 1.448
1.611GluCys: 1.611 ± 1.095
5.477GluAsp: 5.477 ± 1.63
3.866GluGlu: 3.866 ± 1.156
0.322GluPhe: 0.322 ± 0.377
1.611GluGly: 1.611 ± 0.884
0.966GluHis: 0.966 ± 0.571
1.289GluIle: 1.289 ± 0.352
1.611GluLys: 1.611 ± 0.903
3.866GluLeu: 3.866 ± 0.949
0.322GluMet: 0.322 ± 0.27
2.577GluAsn: 2.577 ± 0.802
2.255GluPro: 2.255 ± 0.617
3.222GluGln: 3.222 ± 1.167
1.289GluArg: 1.289 ± 0.499
1.933GluSer: 1.933 ± 0.826
2.255GluThr: 2.255 ± 0.975
5.155GluVal: 5.155 ± 0.988
0.644GluTrp: 0.644 ± 0.54
3.544GluTyr: 3.544 ± 1.197
0.0GluXaa: 0.0 ± 0.0
Phe
2.255PheAla: 2.255 ± 0.866
0.322PheCys: 0.322 ± 0.404
1.611PheAsp: 1.611 ± 0.879
1.289PheGlu: 1.289 ± 0.916
4.188PhePhe: 4.188 ± 1.128
4.51PheGly: 4.51 ± 1.267
2.255PheHis: 2.255 ± 0.759
1.933PheIle: 1.933 ± 0.623
2.255PheLys: 2.255 ± 0.645
6.121PheLeu: 6.121 ± 1.49
0.322PheMet: 0.322 ± 0.298
1.611PheAsn: 1.611 ± 0.673
3.222PhePro: 3.222 ± 0.81
0.644PheGln: 0.644 ± 0.356
1.933PheArg: 1.933 ± 0.492
3.544PheSer: 3.544 ± 1.013
1.611PheThr: 1.611 ± 1.177
2.899PheVal: 2.899 ± 1.147
1.289PheTrp: 1.289 ± 0.541
1.289PheTyr: 1.289 ± 0.531
0.0PheXaa: 0.0 ± 0.0
Gly
2.577GlyAla: 2.577 ± 0.625
2.577GlyCys: 2.577 ± 0.854
2.899GlyAsp: 2.899 ± 1.108
1.611GlyGlu: 1.611 ± 0.607
3.222GlyPhe: 3.222 ± 0.864
3.222GlyGly: 3.222 ± 1.835
2.899GlyHis: 2.899 ± 1.258
4.188GlyIle: 4.188 ± 1.235
3.544GlyLys: 3.544 ± 0.742
2.255GlyLeu: 2.255 ± 0.547
1.289GlyMet: 1.289 ± 0.541
2.899GlyAsn: 2.899 ± 1.063
2.255GlyPro: 2.255 ± 0.454
3.544GlyGln: 3.544 ± 1.199
3.866GlyArg: 3.866 ± 1.232
4.188GlySer: 4.188 ± 0.795
4.832GlyThr: 4.832 ± 1.619
5.155GlyVal: 5.155 ± 1.514
0.322GlyTrp: 0.322 ± 0.27
3.544GlyTyr: 3.544 ± 1.085
0.0GlyXaa: 0.0 ± 0.0
His
0.322HisAla: 0.322 ± 0.27
0.966HisCys: 0.966 ± 0.428
0.0HisAsp: 0.0 ± 0.0
0.322HisGlu: 0.322 ± 0.298
2.255HisPhe: 2.255 ± 0.708
1.611HisGly: 1.611 ± 0.745
0.322HisHis: 0.322 ± 0.377
1.933HisIle: 1.933 ± 0.662
1.289HisLys: 1.289 ± 0.87
1.611HisLeu: 1.611 ± 0.944
0.966HisMet: 0.966 ± 0.81
1.289HisAsn: 1.289 ± 0.492
2.577HisPro: 2.577 ± 0.717
0.322HisGln: 0.322 ± 0.377
0.966HisArg: 0.966 ± 0.571
1.933HisSer: 1.933 ± 0.772
2.899HisThr: 2.899 ± 0.853
1.933HisVal: 1.933 ± 0.534
0.322HisTrp: 0.322 ± 0.377
0.966HisTyr: 0.966 ± 0.701
0.0HisXaa: 0.0 ± 0.0
Ile
3.866IleAla: 3.866 ± 1.065
1.611IleCys: 1.611 ± 0.641
0.966IleAsp: 0.966 ± 0.81
2.255IleGlu: 2.255 ± 0.467
2.899IlePhe: 2.899 ± 1.462
2.577IleGly: 2.577 ± 0.728
2.255IleHis: 2.255 ± 0.767
3.222IleIle: 3.222 ± 0.827
2.255IleLys: 2.255 ± 0.999
2.577IleLeu: 2.577 ± 1.159
0.322IleMet: 0.322 ± 0.479
1.289IleAsn: 1.289 ± 0.352
3.866IlePro: 3.866 ± 2.025
2.255IleGln: 2.255 ± 0.46
1.933IleArg: 1.933 ± 1.058
4.51IleSer: 4.51 ± 1.286
4.188IleThr: 4.188 ± 0.832
6.121IleVal: 6.121 ± 1.916
0.322IleTrp: 0.322 ± 0.295
2.255IleTyr: 2.255 ± 0.616
0.0IleXaa: 0.0 ± 0.0
Lys
4.188LysAla: 4.188 ± 0.944
2.255LysCys: 2.255 ± 0.989
0.966LysAsp: 0.966 ± 0.533
2.255LysGlu: 2.255 ± 0.593
3.544LysPhe: 3.544 ± 1.388
0.644LysGly: 0.644 ± 0.428
2.577LysHis: 2.577 ± 1.757
2.899LysIle: 2.899 ± 0.89
3.866LysLys: 3.866 ± 0.94
3.866LysLeu: 3.866 ± 1.57
0.322LysMet: 0.322 ± 0.27
1.933LysAsn: 1.933 ± 0.739
2.255LysPro: 2.255 ± 0.736
1.933LysGln: 1.933 ± 0.655
5.477LysArg: 5.477 ± 0.682
3.544LysSer: 3.544 ± 1.176
2.899LysThr: 2.899 ± 0.94
3.866LysVal: 3.866 ± 0.746
1.289LysTrp: 1.289 ± 0.556
3.866LysTyr: 3.866 ± 1.018
0.0LysXaa: 0.0 ± 0.0
Leu
4.51LeuAla: 4.51 ± 1.239
4.51LeuCys: 4.51 ± 1.368
3.544LeuAsp: 3.544 ± 0.879
5.155LeuGlu: 5.155 ± 1.942
4.832LeuPhe: 4.832 ± 1.253
5.799LeuGly: 5.799 ± 0.809
3.222LeuHis: 3.222 ± 0.803
2.577LeuIle: 2.577 ± 1.48
5.155LeuLys: 5.155 ± 0.837
9.987LeuLeu: 9.987 ± 3.373
0.644LeuMet: 0.644 ± 0.411
5.155LeuAsn: 5.155 ± 0.543
2.577LeuPro: 2.577 ± 1.072
6.765LeuGln: 6.765 ± 1.25
3.544LeuArg: 3.544 ± 1.501
3.866LeuSer: 3.866 ± 1.116
6.765LeuThr: 6.765 ± 0.803
4.51LeuVal: 4.51 ± 1.19
2.255LeuTrp: 2.255 ± 0.822
2.577LeuTyr: 2.577 ± 1.032
0.0LeuXaa: 0.0 ± 0.0
Met
1.933MetAla: 1.933 ± 0.691
1.611MetCys: 1.611 ± 0.824
2.899MetAsp: 2.899 ± 0.938
0.644MetGlu: 0.644 ± 0.574
0.644MetPhe: 0.644 ± 0.356
2.577MetGly: 2.577 ± 0.585
0.966MetHis: 0.966 ± 0.52
0.0MetIle: 0.0 ± 0.0
0.644MetLys: 0.644 ± 0.54
1.611MetLeu: 1.611 ± 1.077
0.644MetMet: 0.644 ± 0.711
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
1.289MetGln: 1.289 ± 0.499
0.322MetArg: 0.322 ± 0.27
1.289MetSer: 1.289 ± 0.757
0.966MetThr: 0.966 ± 0.509
1.933MetVal: 1.933 ± 0.726
0.644MetTrp: 0.644 ± 0.356
0.966MetTyr: 0.966 ± 0.376
0.0MetXaa: 0.0 ± 0.0
Asn
5.155AsnAla: 5.155 ± 0.982
0.644AsnCys: 0.644 ± 0.523
1.933AsnAsp: 1.933 ± 0.726
0.322AsnGlu: 0.322 ± 0.397
0.644AsnPhe: 0.644 ± 0.356
2.899AsnGly: 2.899 ± 1.193
0.0AsnHis: 0.0 ± 0.0
0.966AsnIle: 0.966 ± 0.451
5.477AsnLys: 5.477 ± 1.685
3.866AsnLeu: 3.866 ± 0.887
0.644AsnMet: 0.644 ± 0.379
1.933AsnAsn: 1.933 ± 0.34
3.222AsnPro: 3.222 ± 0.482
0.644AsnGln: 0.644 ± 0.356
1.289AsnArg: 1.289 ± 0.332
2.577AsnSer: 2.577 ± 0.45
2.899AsnThr: 2.899 ± 0.582
1.933AsnVal: 1.933 ± 0.642
0.322AsnTrp: 0.322 ± 0.27
0.322AsnTyr: 0.322 ± 0.298
0.0AsnXaa: 0.0 ± 0.0
Pro
3.544ProAla: 3.544 ± 0.95
0.966ProCys: 0.966 ± 0.436
3.222ProAsp: 3.222 ± 0.634
1.933ProGlu: 1.933 ± 0.65
1.289ProPhe: 1.289 ± 0.764
2.577ProGly: 2.577 ± 0.694
0.322ProHis: 0.322 ± 0.377
3.866ProIle: 3.866 ± 1.18
4.51ProLys: 4.51 ± 1.042
6.765ProLeu: 6.765 ± 1.206
0.966ProMet: 0.966 ± 0.376
1.289ProAsn: 1.289 ± 0.352
5.477ProPro: 5.477 ± 1.232
1.289ProGln: 1.289 ± 0.884
3.222ProArg: 3.222 ± 1.232
6.443ProSer: 6.443 ± 1.692
6.121ProThr: 6.121 ± 2.416
2.899ProVal: 2.899 ± 0.897
0.322ProTrp: 0.322 ± 0.377
3.866ProTyr: 3.866 ± 1.215
0.0ProXaa: 0.0 ± 0.0
Gln
2.899GlnAla: 2.899 ± 0.908
0.644GlnCys: 0.644 ± 0.53
2.255GlnAsp: 2.255 ± 0.626
1.611GlnGlu: 1.611 ± 1.059
2.577GlnPhe: 2.577 ± 0.93
2.899GlnGly: 2.899 ± 0.886
0.966GlnHis: 0.966 ± 0.535
2.255GlnIle: 2.255 ± 0.83
3.222GlnLys: 3.222 ± 1.353
5.155GlnLeu: 5.155 ± 1.449
1.289GlnMet: 1.289 ± 0.332
0.966GlnAsn: 0.966 ± 0.585
2.899GlnPro: 2.899 ± 1.162
1.933GlnGln: 1.933 ± 0.938
3.222GlnArg: 3.222 ± 0.424
1.289GlnSer: 1.289 ± 0.889
3.544GlnThr: 3.544 ± 1.319
1.611GlnVal: 1.611 ± 0.884
1.611GlnTrp: 1.611 ± 0.828
0.966GlnTyr: 0.966 ± 0.509
0.0GlnXaa: 0.0 ± 0.0
Arg
1.289ArgAla: 1.289 ± 0.492
0.322ArgCys: 0.322 ± 0.404
2.577ArgAsp: 2.577 ± 1.207
2.899ArgGlu: 2.899 ± 0.912
1.933ArgPhe: 1.933 ± 0.803
3.544ArgGly: 3.544 ± 1.164
1.933ArgHis: 1.933 ± 0.709
1.289ArgIle: 1.289 ± 0.734
2.899ArgLys: 2.899 ± 0.48
6.443ArgLeu: 6.443 ± 1.176
0.322ArgMet: 0.322 ± 0.346
0.322ArgAsn: 0.322 ± 0.27
2.255ArgPro: 2.255 ± 0.942
1.611ArgGln: 1.611 ± 0.771
2.899ArgArg: 2.899 ± 0.691
5.477ArgSer: 5.477 ± 1.106
3.222ArgThr: 3.222 ± 0.617
3.222ArgVal: 3.222 ± 0.86
0.644ArgTrp: 0.644 ± 0.374
1.933ArgTyr: 1.933 ± 1.123
0.0ArgXaa: 0.0 ± 0.0
Ser
3.866SerAla: 3.866 ± 1.211
2.255SerCys: 2.255 ± 1.138
6.121SerAsp: 6.121 ± 1.587
4.188SerGlu: 4.188 ± 0.574
1.933SerPhe: 1.933 ± 0.929
6.121SerGly: 6.121 ± 1.676
1.611SerHis: 1.611 ± 0.867
4.188SerIle: 4.188 ± 1.451
3.544SerLys: 3.544 ± 1.277
6.443SerLeu: 6.443 ± 0.775
0.966SerMet: 0.966 ± 0.571
2.577SerAsn: 2.577 ± 0.576
5.155SerPro: 5.155 ± 0.733
2.899SerGln: 2.899 ± 0.791
3.866SerArg: 3.866 ± 1.121
7.732SerSer: 7.732 ± 1.174
8.698SerThr: 8.698 ± 1.319
3.544SerVal: 3.544 ± 1.199
0.322SerTrp: 0.322 ± 0.311
1.933SerTyr: 1.933 ± 0.776
0.0SerXaa: 0.0 ± 0.0
Thr
3.222ThrAla: 3.222 ± 1.145
1.933ThrCys: 1.933 ± 0.789
4.832ThrAsp: 4.832 ± 1.445
5.477ThrGlu: 5.477 ± 1.233
3.222ThrPhe: 3.222 ± 1.051
4.832ThrGly: 4.832 ± 1.412
0.966ThrHis: 0.966 ± 0.894
2.577ThrIle: 2.577 ± 0.958
1.933ThrLys: 1.933 ± 0.713
8.054ThrLeu: 8.054 ± 2.337
2.577ThrMet: 2.577 ± 0.516
3.222ThrAsn: 3.222 ± 0.568
3.544ThrPro: 3.544 ± 0.851
4.51ThrGln: 4.51 ± 1.064
2.577ThrArg: 2.577 ± 0.61
6.443ThrSer: 6.443 ± 1.346
6.765ThrThr: 6.765 ± 1.536
8.376ThrVal: 8.376 ± 1.15
0.966ThrTrp: 0.966 ± 0.907
3.544ThrTyr: 3.544 ± 0.794
0.0ThrXaa: 0.0 ± 0.0
Val
2.899ValAla: 2.899 ± 0.619
0.966ValCys: 0.966 ± 0.7
3.544ValAsp: 3.544 ± 0.724
6.121ValGlu: 6.121 ± 0.984
2.899ValPhe: 2.899 ± 1.271
3.866ValGly: 3.866 ± 1.076
0.966ValHis: 0.966 ± 0.894
2.899ValIle: 2.899 ± 0.605
3.222ValLys: 3.222 ± 0.774
1.611ValLeu: 1.611 ± 0.744
2.577ValMet: 2.577 ± 0.616
2.255ValAsn: 2.255 ± 0.665
7.088ValPro: 7.088 ± 1.348
3.222ValGln: 3.222 ± 0.905
2.255ValArg: 2.255 ± 1.022
6.765ValSer: 6.765 ± 2.448
6.443ValThr: 6.443 ± 1.298
4.51ValVal: 4.51 ± 0.875
1.933ValTrp: 1.933 ± 0.973
1.933ValTyr: 1.933 ± 0.586
0.0ValXaa: 0.0 ± 0.0
Trp
1.289TrpAla: 1.289 ± 0.399
0.322TrpCys: 0.322 ± 0.311
0.644TrpAsp: 0.644 ± 0.356
0.0TrpGlu: 0.0 ± 0.0
0.644TrpPhe: 0.644 ± 0.54
1.611TrpGly: 1.611 ± 0.673
0.644TrpHis: 0.644 ± 0.434
1.289TrpIle: 1.289 ± 0.541
1.289TrpLys: 1.289 ± 0.857
1.933TrpLeu: 1.933 ± 0.925
0.322TrpMet: 0.322 ± 0.442
1.289TrpAsn: 1.289 ± 0.332
0.322TrpPro: 0.322 ± 0.401
0.0TrpGln: 0.0 ± 0.0
1.933TrpArg: 1.933 ± 0.851
0.322TrpSer: 0.322 ± 0.377
1.289TrpThr: 1.289 ± 0.829
0.644TrpVal: 0.644 ± 0.54
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.899TyrAla: 2.899 ± 0.934
0.644TyrCys: 0.644 ± 0.541
3.544TyrAsp: 3.544 ± 0.704
1.289TyrGlu: 1.289 ± 1.062
1.933TyrPhe: 1.933 ± 0.752
2.255TyrGly: 2.255 ± 0.621
0.0TyrHis: 0.0 ± 0.0
3.222TyrIle: 3.222 ± 1.012
3.222TyrLys: 3.222 ± 0.711
4.188TyrLeu: 4.188 ± 1.045
0.966TyrMet: 0.966 ± 0.586
0.0TyrAsn: 0.0 ± 0.0
1.289TyrPro: 1.289 ± 0.698
1.289TyrGln: 1.289 ± 0.352
2.255TyrArg: 2.255 ± 0.787
1.611TyrSer: 1.611 ± 0.749
1.933TyrThr: 1.933 ± 0.534
3.866TyrVal: 3.866 ± 0.848
0.644TyrTrp: 0.644 ± 0.356
1.933TyrTyr: 1.933 ± 0.612
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (3105 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski