Amino acid dipepetide frequency for Human papillomavirus 11

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.527AlaAla: 4.527 ± 1.685
2.641AlaCys: 2.641 ± 1.266
3.772AlaAsp: 3.772 ± 1.571
3.395AlaGlu: 3.395 ± 0.719
3.772AlaPhe: 3.772 ± 1.574
1.886AlaGly: 1.886 ± 0.919
0.754AlaHis: 0.754 ± 0.638
5.281AlaIle: 5.281 ± 1.248
2.641AlaLys: 2.641 ± 1.561
3.395AlaLeu: 3.395 ± 1.014
0.754AlaMet: 0.754 ± 0.423
1.509AlaAsn: 1.509 ± 0.708
6.035AlaPro: 6.035 ± 1.545
3.018AlaGln: 3.018 ± 0.84
4.149AlaArg: 4.149 ± 0.938
4.904AlaSer: 4.904 ± 1.084
4.149AlaThr: 4.149 ± 0.527
3.018AlaVal: 3.018 ± 1.226
0.377AlaTrp: 0.377 ± 0.319
2.263AlaTyr: 2.263 ± 1.33
0.0AlaXaa: 0.0 ± 0.0
Cys
2.641CysAla: 2.641 ± 0.881
1.132CysCys: 1.132 ± 1.13
0.377CysAsp: 0.377 ± 0.514
1.132CysGlu: 1.132 ± 0.503
2.263CysPhe: 2.263 ± 1.085
1.132CysGly: 1.132 ± 0.632
1.509CysHis: 1.509 ± 0.895
1.132CysIle: 1.132 ± 0.577
1.886CysLys: 1.886 ± 0.604
1.509CysLeu: 1.509 ± 0.978
1.132CysMet: 1.132 ± 0.478
0.0CysAsn: 0.0 ± 0.0
1.886CysPro: 1.886 ± 0.641
0.754CysGln: 0.754 ± 0.449
1.132CysArg: 1.132 ± 0.608
1.509CysSer: 1.509 ± 0.638
2.263CysThr: 2.263 ± 0.679
2.263CysVal: 2.263 ± 0.923
1.132CysTrp: 1.132 ± 0.478
1.509CysTyr: 1.509 ± 0.936
0.0CysXaa: 0.0 ± 0.0
Asp
4.149AspAla: 4.149 ± 1.548
1.132AspCys: 1.132 ± 0.6
2.641AspAsp: 2.641 ± 1.147
1.509AspGlu: 1.509 ± 0.745
1.509AspPhe: 1.509 ± 0.734
3.395AspGly: 3.395 ± 1.044
0.0AspHis: 0.0 ± 0.0
5.658AspIle: 5.658 ± 1.768
1.509AspLys: 1.509 ± 1.167
3.018AspLeu: 3.018 ± 1.267
1.132AspMet: 1.132 ± 0.676
3.395AspAsn: 3.395 ± 1.074
4.149AspPro: 4.149 ± 1.582
1.509AspGln: 1.509 ± 0.567
2.263AspArg: 2.263 ± 1.139
7.167AspSer: 7.167 ± 1.969
4.149AspThr: 4.149 ± 0.899
2.263AspVal: 2.263 ± 1.372
1.132AspTrp: 1.132 ± 0.368
1.886AspTyr: 1.886 ± 1.414
0.0AspXaa: 0.0 ± 0.0
Glu
5.281GluAla: 5.281 ± 1.826
0.377GluCys: 0.377 ± 0.514
6.035GluAsp: 6.035 ± 1.049
5.281GluGlu: 5.281 ± 1.345
0.754GluPhe: 0.754 ± 0.381
1.886GluGly: 1.886 ± 0.913
2.641GluHis: 2.641 ± 1.048
1.886GluIle: 1.886 ± 0.915
3.018GluLys: 3.018 ± 1.083
3.772GluLeu: 3.772 ± 0.558
1.509GluMet: 1.509 ± 0.859
2.263GluAsn: 2.263 ± 0.711
3.395GluPro: 3.395 ± 0.77
1.509GluGln: 1.509 ± 0.75
0.754GluArg: 0.754 ± 0.638
3.018GluSer: 3.018 ± 1.036
1.132GluThr: 1.132 ± 0.503
4.527GluVal: 4.527 ± 0.925
0.377GluTrp: 0.377 ± 0.319
1.132GluTyr: 1.132 ± 0.402
0.0GluXaa: 0.0 ± 0.0
Phe
2.263PheAla: 2.263 ± 1.217
0.754PheCys: 0.754 ± 0.453
2.641PheAsp: 2.641 ± 0.56
1.886PheGlu: 1.886 ± 0.959
2.263PhePhe: 2.263 ± 0.806
1.886PheGly: 1.886 ± 0.663
0.0PheHis: 0.0 ± 0.0
1.886PheIle: 1.886 ± 0.545
2.641PheLys: 2.641 ± 1.181
3.772PheLeu: 3.772 ± 0.962
1.509PheMet: 1.509 ± 0.484
2.263PheAsn: 2.263 ± 0.916
2.641PhePro: 2.641 ± 0.823
1.509PheGln: 1.509 ± 0.607
1.509PheArg: 1.509 ± 0.708
1.132PheSer: 1.132 ± 0.686
2.263PheThr: 2.263 ± 1.321
2.263PheVal: 2.263 ± 0.777
0.754PheTrp: 0.754 ± 0.381
1.509PheTyr: 1.509 ± 0.826
0.0PheXaa: 0.0 ± 0.0
Gly
1.509GlyAla: 1.509 ± 1.002
1.509GlyCys: 1.509 ± 0.536
4.527GlyAsp: 4.527 ± 1.652
1.886GlyGlu: 1.886 ± 0.663
1.886GlyPhe: 1.886 ± 0.338
3.395GlyGly: 3.395 ± 1.29
1.886GlyHis: 1.886 ± 0.989
2.641GlyIle: 2.641 ± 0.704
1.886GlyLys: 1.886 ± 0.856
3.772GlyLeu: 3.772 ± 0.945
0.754GlyMet: 0.754 ± 0.381
3.018GlyAsn: 3.018 ± 0.868
2.641GlyPro: 2.641 ± 1.238
2.263GlyGln: 2.263 ± 0.595
2.641GlyArg: 2.641 ± 1.019
4.904GlySer: 4.904 ± 1.463
6.035GlyThr: 6.035 ± 1.368
3.018GlyVal: 3.018 ± 0.593
0.377GlyTrp: 0.377 ± 0.319
2.641GlyTyr: 2.641 ± 1.017
0.0GlyXaa: 0.0 ± 0.0
His
1.132HisAla: 1.132 ± 0.6
0.754HisCys: 0.754 ± 0.684
0.0HisAsp: 0.0 ± 0.0
0.377HisGlu: 0.377 ± 0.349
1.132HisPhe: 1.132 ± 0.478
2.263HisGly: 2.263 ± 1.266
0.377HisHis: 0.377 ± 0.319
3.395HisIle: 3.395 ± 1.123
2.641HisLys: 2.641 ± 1.674
2.641HisLeu: 2.641 ± 1.459
0.377HisMet: 0.377 ± 0.319
1.132HisAsn: 1.132 ± 0.429
1.509HisPro: 1.509 ± 0.851
1.132HisGln: 1.132 ± 0.536
1.509HisArg: 1.509 ± 0.669
2.263HisSer: 2.263 ± 0.867
2.641HisThr: 2.641 ± 0.828
1.132HisVal: 1.132 ± 0.57
1.886HisTrp: 1.886 ± 1.131
1.886HisTyr: 1.886 ± 0.798
0.0HisXaa: 0.0 ± 0.0
Ile
3.395IleAla: 3.395 ± 1.186
1.886IleCys: 1.886 ± 0.704
1.886IleAsp: 1.886 ± 0.894
3.772IleGlu: 3.772 ± 1.683
0.754IlePhe: 0.754 ± 0.733
1.886IleGly: 1.886 ± 0.958
2.263IleHis: 2.263 ± 0.621
1.132IleIle: 1.132 ± 0.858
3.018IleLys: 3.018 ± 1.379
4.149IleLeu: 4.149 ± 2.132
0.754IleMet: 0.754 ± 0.429
1.509IleAsn: 1.509 ± 0.661
3.772IlePro: 3.772 ± 1.867
1.886IleGln: 1.886 ± 0.545
4.149IleArg: 4.149 ± 1.257
3.395IleSer: 3.395 ± 1.834
4.527IleThr: 4.527 ± 1.813
5.281IleVal: 5.281 ± 1.837
0.0IleTrp: 0.0 ± 0.0
2.263IleTyr: 2.263 ± 0.903
0.0IleXaa: 0.0 ± 0.0
Lys
1.509LysAla: 1.509 ± 0.567
3.018LysCys: 3.018 ± 1.43
2.263LysAsp: 2.263 ± 0.849
2.641LysGlu: 2.641 ± 1.328
2.263LysPhe: 2.263 ± 0.951
1.886LysGly: 1.886 ± 0.94
3.395LysHis: 3.395 ± 1.358
1.509LysIle: 1.509 ± 0.597
2.263LysLys: 2.263 ± 0.87
3.018LysLeu: 3.018 ± 1.544
0.754LysMet: 0.754 ± 0.383
1.509LysAsn: 1.509 ± 0.597
3.018LysPro: 3.018 ± 1.07
2.263LysGln: 2.263 ± 0.827
4.527LysArg: 4.527 ± 1.047
2.263LysSer: 2.263 ± 0.967
2.641LysThr: 2.641 ± 1.204
5.658LysVal: 5.658 ± 0.941
0.754LysTrp: 0.754 ± 0.446
4.149LysTyr: 4.149 ± 1.326
0.0LysXaa: 0.0 ± 0.0
Leu
1.886LeuAla: 1.886 ± 0.733
3.395LeuCys: 3.395 ± 1.739
3.772LeuAsp: 3.772 ± 0.868
3.772LeuGlu: 3.772 ± 1.425
4.527LeuPhe: 4.527 ± 1.211
4.904LeuGly: 4.904 ± 0.968
5.281LeuHis: 5.281 ± 1.328
4.904LeuIle: 4.904 ± 1.97
4.527LeuLys: 4.527 ± 1.252
10.939LeuLeu: 10.939 ± 3.722
1.509LeuMet: 1.509 ± 0.653
4.149LeuAsn: 4.149 ± 1.268
2.263LeuPro: 2.263 ± 1.079
7.544LeuGln: 7.544 ± 2.009
1.132LeuArg: 1.132 ± 0.492
4.904LeuSer: 4.904 ± 1.095
7.544LeuThr: 7.544 ± 2.259
4.527LeuVal: 4.527 ± 1.563
1.132LeuTrp: 1.132 ± 0.709
2.641LeuTyr: 2.641 ± 0.849
0.0LeuXaa: 0.0 ± 0.0
Met
1.886MetAla: 1.886 ± 0.8
0.377MetCys: 0.377 ± 0.319
1.132MetAsp: 1.132 ± 0.368
1.886MetGlu: 1.886 ± 1.239
0.377MetPhe: 0.377 ± 0.367
0.754MetGly: 0.754 ± 0.437
1.132MetHis: 1.132 ± 0.86
0.0MetIle: 0.0 ± 0.0
0.754MetLys: 0.754 ± 0.408
0.754MetLeu: 0.754 ± 0.525
0.0MetMet: 0.0 ± 0.0
1.132MetAsn: 1.132 ± 0.852
0.0MetPro: 0.0 ± 0.0
0.754MetGln: 0.754 ± 0.429
0.754MetArg: 0.754 ± 0.381
2.641MetSer: 2.641 ± 0.912
0.377MetThr: 0.377 ± 0.367
2.641MetVal: 2.641 ± 1.188
0.754MetTrp: 0.754 ± 0.449
0.377MetTyr: 0.377 ± 0.349
0.0MetXaa: 0.0 ± 0.0
Asn
4.149AsnAla: 4.149 ± 1.337
1.886AsnCys: 1.886 ± 1.014
2.263AsnAsp: 2.263 ± 0.884
0.754AsnGlu: 0.754 ± 0.453
1.509AsnPhe: 1.509 ± 0.807
2.263AsnGly: 2.263 ± 1.063
1.509AsnHis: 1.509 ± 0.297
3.395AsnIle: 3.395 ± 1.094
3.395AsnLys: 3.395 ± 1.667
1.886AsnLeu: 1.886 ± 0.936
0.754AsnMet: 0.754 ± 0.538
3.395AsnAsn: 3.395 ± 1.595
2.641AsnPro: 2.641 ± 0.939
0.754AsnGln: 0.754 ± 0.865
1.886AsnArg: 1.886 ± 1.073
3.395AsnSer: 3.395 ± 1.118
2.641AsnThr: 2.641 ± 0.929
1.509AsnVal: 1.509 ± 0.536
0.754AsnTrp: 0.754 ± 0.638
1.132AsnTyr: 1.132 ± 0.655
0.0AsnXaa: 0.0 ± 0.0
Pro
4.527ProAla: 4.527 ± 1.96
0.754ProCys: 0.754 ± 0.469
3.772ProAsp: 3.772 ± 1.588
2.641ProGlu: 2.641 ± 0.671
2.641ProPhe: 2.641 ± 1.234
0.754ProGly: 0.754 ± 0.381
1.132ProHis: 1.132 ± 0.739
2.263ProIle: 2.263 ± 0.737
3.395ProLys: 3.395 ± 0.676
8.676ProLeu: 8.676 ± 2.282
1.132ProMet: 1.132 ± 0.601
1.886ProAsn: 1.886 ± 0.937
9.053ProPro: 9.053 ± 2.084
0.754ProGln: 0.754 ± 0.697
3.018ProArg: 3.018 ± 1.287
3.772ProSer: 3.772 ± 1.532
6.035ProThr: 6.035 ± 2.059
4.904ProVal: 4.904 ± 2.901
0.754ProTrp: 0.754 ± 0.586
1.886ProTyr: 1.886 ± 1.374
0.0ProXaa: 0.0 ± 0.0
Gln
1.886GlnAla: 1.886 ± 0.809
1.132GlnCys: 1.132 ± 0.715
3.772GlnAsp: 3.772 ± 1.3
0.754GlnGlu: 0.754 ± 0.638
3.395GlnPhe: 3.395 ± 0.58
2.263GlnGly: 2.263 ± 0.797
1.509GlnHis: 1.509 ± 1.189
1.886GlnIle: 1.886 ± 0.902
1.509GlnLys: 1.509 ± 0.638
4.149GlnLeu: 4.149 ± 1.544
1.886GlnMet: 1.886 ± 0.941
1.509GlnAsn: 1.509 ± 0.541
3.018GlnPro: 3.018 ± 1.183
1.509GlnGln: 1.509 ± 0.694
3.395GlnArg: 3.395 ± 1.076
2.641GlnSer: 2.641 ± 0.974
2.641GlnThr: 2.641 ± 0.578
1.509GlnVal: 1.509 ± 0.656
1.509GlnTrp: 1.509 ± 0.923
1.132GlnTyr: 1.132 ± 0.578
0.0GlnXaa: 0.0 ± 0.0
Arg
4.527ArgAla: 4.527 ± 1.32
1.886ArgCys: 1.886 ± 0.885
1.132ArgAsp: 1.132 ± 0.653
1.886ArgGlu: 1.886 ± 0.959
1.132ArgPhe: 1.132 ± 0.524
3.018ArgGly: 3.018 ± 0.471
3.018ArgHis: 3.018 ± 0.771
1.132ArgIle: 1.132 ± 1.046
3.772ArgLys: 3.772 ± 0.711
6.035ArgLeu: 6.035 ± 1.234
0.0ArgMet: 0.0 ± 0.0
2.263ArgAsn: 2.263 ± 0.866
3.018ArgPro: 3.018 ± 1.156
1.886ArgGln: 1.886 ± 0.797
4.149ArgArg: 4.149 ± 2.038
2.641ArgSer: 2.641 ± 0.73
3.018ArgThr: 3.018 ± 1.524
1.509ArgVal: 1.509 ± 0.61
0.0ArgTrp: 0.0 ± 0.0
1.132ArgTyr: 1.132 ± 0.569
0.0ArgXaa: 0.0 ± 0.0
Ser
5.281SerAla: 5.281 ± 1.631
0.754SerCys: 0.754 ± 0.638
4.149SerAsp: 4.149 ± 1.273
4.904SerGlu: 4.904 ± 1.05
0.754SerPhe: 0.754 ± 0.381
6.035SerGly: 6.035 ± 1.94
1.132SerHis: 1.132 ± 0.636
5.281SerIle: 5.281 ± 0.545
2.263SerLys: 2.263 ± 1.015
5.281SerLeu: 5.281 ± 0.629
0.754SerMet: 0.754 ± 0.697
4.149SerAsn: 4.149 ± 1.645
3.772SerPro: 3.772 ± 0.836
1.886SerGln: 1.886 ± 0.685
2.263SerArg: 2.263 ± 0.943
8.299SerSer: 8.299 ± 1.932
8.676SerThr: 8.676 ± 2.051
7.544SerVal: 7.544 ± 1.344
0.754SerTrp: 0.754 ± 0.437
1.132SerTyr: 1.132 ± 0.431
0.0SerXaa: 0.0 ± 0.0
Thr
2.641ThrAla: 2.641 ± 0.975
3.018ThrCys: 3.018 ± 0.802
3.018ThrAsp: 3.018 ± 0.609
2.263ThrGlu: 2.263 ± 0.736
2.263ThrPhe: 2.263 ± 1.093
4.904ThrGly: 4.904 ± 1.7
0.754ThrHis: 0.754 ± 0.437
3.018ThrIle: 3.018 ± 1.337
2.263ThrLys: 2.263 ± 0.828
7.544ThrLeu: 7.544 ± 1.566
0.377ThrMet: 0.377 ± 0.367
3.772ThrAsn: 3.772 ± 1.165
6.413ThrPro: 6.413 ± 1.897
5.658ThrGln: 5.658 ± 1.193
1.886ThrArg: 1.886 ± 0.809
7.167ThrSer: 7.167 ± 2.045
10.562ThrThr: 10.562 ± 2.759
8.676ThrVal: 8.676 ± 1.977
1.509ThrTrp: 1.509 ± 0.734
3.772ThrTyr: 3.772 ± 0.992
0.0ThrXaa: 0.0 ± 0.0
Val
4.527ValAla: 4.527 ± 0.813
1.886ValCys: 1.886 ± 1.126
4.904ValAsp: 4.904 ± 1.32
6.79ValGlu: 6.79 ± 1.776
1.886ValPhe: 1.886 ± 1.088
4.527ValGly: 4.527 ± 1.756
0.754ValHis: 0.754 ± 0.585
3.018ValIle: 3.018 ± 0.755
3.018ValLys: 3.018 ± 1.058
4.527ValLeu: 4.527 ± 1.564
1.132ValMet: 1.132 ± 0.674
2.263ValAsn: 2.263 ± 0.951
3.395ValPro: 3.395 ± 1.173
5.281ValGln: 5.281 ± 1.12
2.263ValArg: 2.263 ± 0.737
7.167ValSer: 7.167 ± 2.079
6.035ValThr: 6.035 ± 1.792
6.035ValVal: 6.035 ± 1.651
1.509ValTrp: 1.509 ± 0.755
2.263ValTyr: 2.263 ± 1.012
0.0ValXaa: 0.0 ± 0.0
Trp
1.509TrpAla: 1.509 ± 0.492
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.754TrpGlu: 0.754 ± 0.449
1.132TrpPhe: 1.132 ± 0.645
1.886TrpGly: 1.886 ± 0.728
0.754TrpHis: 0.754 ± 0.527
0.754TrpIle: 0.754 ± 0.638
1.886TrpLys: 1.886 ± 0.697
3.018TrpLeu: 3.018 ± 1.195
0.0TrpMet: 0.0 ± 0.0
0.377TrpAsn: 0.377 ± 0.367
0.377TrpPro: 0.377 ± 0.43
0.0TrpGln: 0.0 ± 0.0
0.754TrpArg: 0.754 ± 0.549
0.0TrpSer: 0.0 ± 0.0
1.886TrpThr: 1.886 ± 0.885
0.754TrpVal: 0.754 ± 0.429
0.0TrpTrp: 0.0 ± 0.0
0.377TrpTyr: 0.377 ± 0.349
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.018TyrAla: 3.018 ± 1.695
0.0TyrCys: 0.0 ± 0.0
1.509TyrAsp: 1.509 ± 0.541
2.263TyrGlu: 2.263 ± 0.894
1.132TyrPhe: 1.132 ± 0.858
2.263TyrGly: 2.263 ± 1.052
0.377TyrHis: 0.377 ± 0.367
1.509TyrIle: 1.509 ± 1.033
3.018TyrLys: 3.018 ± 0.908
3.395TyrLeu: 3.395 ± 0.637
1.886TyrMet: 1.886 ± 1.008
0.754TyrAsn: 0.754 ± 0.527
0.754TyrPro: 0.754 ± 0.546
1.132TyrGln: 1.132 ± 0.653
3.018TyrArg: 3.018 ± 1.107
1.886TyrSer: 1.886 ± 0.338
2.263TyrThr: 2.263 ± 0.82
4.149TyrVal: 4.149 ± 1.511
0.754TyrTrp: 0.754 ± 0.381
1.132TyrTyr: 1.132 ± 0.732
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (2652 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski