Amino acid dipepetide frequency for Human papillomavirus 158

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.101AlaAla: 3.101 ± 0.889
0.443AlaCys: 0.443 ± 0.508
5.76AlaAsp: 5.76 ± 1.602
4.431AlaGlu: 4.431 ± 1.495
2.658AlaPhe: 2.658 ± 0.865
2.658AlaGly: 2.658 ± 1.051
0.443AlaHis: 0.443 ± 0.355
2.215AlaIle: 2.215 ± 0.592
2.215AlaLys: 2.215 ± 0.859
5.317AlaLeu: 5.317 ± 1.839
1.772AlaMet: 1.772 ± 0.204
3.101AlaAsn: 3.101 ± 1.173
1.329AlaPro: 1.329 ± 0.385
3.545AlaGln: 3.545 ± 1.258
3.545AlaArg: 3.545 ± 0.805
2.658AlaSer: 2.658 ± 0.816
3.545AlaThr: 3.545 ± 0.607
4.874AlaVal: 4.874 ± 1.081
0.443AlaTrp: 0.443 ± 0.448
2.215AlaTyr: 2.215 ± 1.105
0.0AlaXaa: 0.0 ± 0.0
Cys
1.772CysAla: 1.772 ± 0.807
2.658CysCys: 2.658 ± 1.775
0.443CysAsp: 0.443 ± 0.355
1.772CysGlu: 1.772 ± 0.746
0.443CysPhe: 0.443 ± 0.354
0.0CysGly: 0.0 ± 0.0
0.443CysHis: 0.443 ± 0.508
0.886CysIle: 0.886 ± 1.017
1.772CysLys: 1.772 ± 0.453
1.772CysLeu: 1.772 ± 0.913
0.443CysMet: 0.443 ± 0.508
0.886CysAsn: 0.886 ± 0.707
1.329CysPro: 1.329 ± 0.757
0.443CysGln: 0.443 ± 0.48
1.329CysArg: 1.329 ± 1.012
0.443CysSer: 0.443 ± 0.354
1.329CysThr: 1.329 ± 0.654
0.443CysVal: 0.443 ± 0.48
1.329CysTrp: 1.329 ± 0.426
1.772CysTyr: 1.772 ± 0.818
0.0CysXaa: 0.0 ± 0.0
Asp
3.988AspAla: 3.988 ± 1.677
1.329AspCys: 1.329 ± 0.367
3.101AspAsp: 3.101 ± 0.876
3.101AspGlu: 3.101 ± 1.324
3.101AspPhe: 3.101 ± 1.607
2.215AspGly: 2.215 ± 0.682
2.215AspHis: 2.215 ± 0.808
3.988AspIle: 3.988 ± 2.708
2.658AspLys: 2.658 ± 1.219
6.646AspLeu: 6.646 ± 2.376
0.443AspMet: 0.443 ± 0.355
2.658AspAsn: 2.658 ± 0.443
5.76AspPro: 5.76 ± 1.839
3.101AspGln: 3.101 ± 1.199
2.215AspArg: 2.215 ± 0.796
4.874AspSer: 4.874 ± 1.074
4.431AspThr: 4.431 ± 0.523
4.874AspVal: 4.874 ± 0.848
2.215AspTrp: 2.215 ± 1.315
1.772AspTyr: 1.772 ± 0.715
0.0AspXaa: 0.0 ± 0.0
Glu
3.545GluAla: 3.545 ± 0.607
0.886GluCys: 0.886 ± 0.707
4.431GluAsp: 4.431 ± 0.879
7.089GluGlu: 7.089 ± 2.263
1.329GluPhe: 1.329 ± 0.611
0.886GluGly: 0.886 ± 0.389
1.329GluHis: 1.329 ± 0.774
3.101GluIle: 3.101 ± 0.792
2.658GluLys: 2.658 ± 0.812
3.545GluLeu: 3.545 ± 0.966
2.215GluMet: 2.215 ± 0.872
5.317GluAsn: 5.317 ± 0.424
2.215GluPro: 2.215 ± 1.24
2.215GluGln: 2.215 ± 0.334
3.101GluArg: 3.101 ± 1.418
4.874GluSer: 4.874 ± 1.299
5.317GluThr: 5.317 ± 1.1
3.545GluVal: 3.545 ± 0.777
0.443GluTrp: 0.443 ± 0.354
1.772GluTyr: 1.772 ± 0.975
0.0GluXaa: 0.0 ± 0.0
Phe
1.772PheAla: 1.772 ± 0.573
1.329PheCys: 1.329 ± 1.012
2.215PheAsp: 2.215 ± 0.966
4.874PheGlu: 4.874 ± 1.645
2.658PhePhe: 2.658 ± 1.086
3.545PheGly: 3.545 ± 0.533
0.886PheHis: 0.886 ± 0.625
2.658PheIle: 2.658 ± 0.941
3.101PheLys: 3.101 ± 1.623
4.431PheLeu: 4.431 ± 1.14
0.443PheMet: 0.443 ± 0.388
3.101PheAsn: 3.101 ± 0.888
1.329PhePro: 1.329 ± 0.426
2.215PheGln: 2.215 ± 0.571
1.772PheArg: 1.772 ± 0.746
1.772PheSer: 1.772 ± 0.779
3.101PheThr: 3.101 ± 0.389
3.545PheVal: 3.545 ± 0.668
0.886PheTrp: 0.886 ± 0.389
1.772PheTyr: 1.772 ± 0.453
0.0PheXaa: 0.0 ± 0.0
Gly
1.329GlyAla: 1.329 ± 0.611
1.772GlyCys: 1.772 ± 0.913
5.76GlyAsp: 5.76 ± 1.564
2.658GlyGlu: 2.658 ± 1.271
1.772GlyPhe: 1.772 ± 0.568
4.431GlyGly: 4.431 ± 1.591
0.886GlyHis: 0.886 ± 0.443
4.431GlyIle: 4.431 ± 1.973
3.545GlyLys: 3.545 ± 1.602
3.545GlyLeu: 3.545 ± 1.61
0.443GlyMet: 0.443 ± 0.354
2.215GlyAsn: 2.215 ± 0.334
2.215GlyPro: 2.215 ± 0.45
2.658GlyGln: 2.658 ± 1.831
4.874GlyArg: 4.874 ± 1.146
6.203GlySer: 6.203 ± 2.377
7.532GlyThr: 7.532 ± 1.565
1.329GlyVal: 1.329 ± 0.385
0.0GlyTrp: 0.0 ± 0.0
0.886GlyTyr: 0.886 ± 0.478
0.0GlyXaa: 0.0 ± 0.0
His
2.215HisAla: 2.215 ± 0.808
0.443HisCys: 0.443 ± 0.354
0.0HisAsp: 0.0 ± 0.0
0.886HisGlu: 0.886 ± 0.523
1.329HisPhe: 1.329 ± 0.47
0.443HisGly: 0.443 ± 0.354
0.0HisHis: 0.0 ± 0.0
1.772HisIle: 1.772 ± 0.91
1.772HisLys: 1.772 ± 0.582
2.215HisLeu: 2.215 ± 0.974
0.443HisMet: 0.443 ± 0.318
1.329HisAsn: 1.329 ± 0.59
1.772HisPro: 1.772 ± 0.886
0.443HisGln: 0.443 ± 0.354
0.443HisArg: 0.443 ± 0.448
1.772HisSer: 1.772 ± 0.625
0.443HisThr: 0.443 ± 0.448
0.886HisVal: 0.886 ± 0.389
0.886HisTrp: 0.886 ± 0.389
0.443HisTyr: 0.443 ± 0.448
0.0HisXaa: 0.0 ± 0.0
Ile
3.101IleAla: 3.101 ± 1.04
0.443IleCys: 0.443 ± 0.355
3.988IleAsp: 3.988 ± 1.474
3.101IleGlu: 3.101 ± 0.882
1.329IlePhe: 1.329 ± 0.59
4.431IleGly: 4.431 ± 2.101
1.329IleHis: 1.329 ± 0.367
4.874IleIle: 4.874 ± 1.208
0.886IleLys: 0.886 ± 0.434
3.545IleLeu: 3.545 ± 0.607
0.443IleMet: 0.443 ± 0.448
3.101IleAsn: 3.101 ± 0.547
4.874IlePro: 4.874 ± 2.521
1.329IleGln: 1.329 ± 0.426
1.772IleArg: 1.772 ± 1.082
3.545IleSer: 3.545 ± 0.913
4.431IleThr: 4.431 ± 0.966
2.658IleVal: 2.658 ± 0.764
0.443IleTrp: 0.443 ± 0.448
1.329IleTyr: 1.329 ± 0.611
0.0IleXaa: 0.0 ± 0.0
Lys
3.101LysAla: 3.101 ± 1.63
2.658LysCys: 2.658 ± 1.187
1.329LysAsp: 1.329 ± 0.654
1.329LysGlu: 1.329 ± 0.693
1.772LysPhe: 1.772 ± 1.087
2.658LysGly: 2.658 ± 0.918
1.772LysHis: 1.772 ± 1.257
2.215LysIle: 2.215 ± 0.872
3.101LysLys: 3.101 ± 0.615
4.431LysLeu: 4.431 ± 1.08
0.886LysMet: 0.886 ± 1.023
3.101LysAsn: 3.101 ± 1.178
0.886LysPro: 0.886 ± 0.389
4.874LysGln: 4.874 ± 1.986
4.874LysArg: 4.874 ± 1.345
4.431LysSer: 4.431 ± 1.616
1.329LysThr: 1.329 ± 0.654
2.658LysVal: 2.658 ± 0.764
1.772LysTrp: 1.772 ± 0.664
1.329LysTyr: 1.329 ± 0.789
0.0LysXaa: 0.0 ± 0.0
Leu
4.874LeuAla: 4.874 ± 0.976
1.772LeuCys: 1.772 ± 1.141
5.317LeuAsp: 5.317 ± 1.494
5.76LeuGlu: 5.76 ± 1.424
6.203LeuPhe: 6.203 ± 1.391
5.76LeuGly: 5.76 ± 3.104
1.329LeuHis: 1.329 ± 1.061
3.101LeuIle: 3.101 ± 0.684
7.975LeuLys: 7.975 ± 2.463
9.304LeuLeu: 9.304 ± 3.399
0.443LeuMet: 0.443 ± 0.434
3.545LeuAsn: 3.545 ± 1.043
4.874LeuPro: 4.874 ± 1.401
7.532LeuGln: 7.532 ± 2.027
3.988LeuArg: 3.988 ± 0.703
7.089LeuSer: 7.089 ± 1.753
3.545LeuThr: 3.545 ± 1.2
5.76LeuVal: 5.76 ± 1.662
0.0LeuTrp: 0.0 ± 0.0
3.101LeuTyr: 3.101 ± 0.389
0.0LeuXaa: 0.0 ± 0.0
Met
1.772MetAla: 1.772 ± 0.799
0.443MetCys: 0.443 ± 0.355
0.0MetAsp: 0.0 ± 0.0
0.886MetGlu: 0.886 ± 0.523
0.886MetPhe: 0.886 ± 0.562
0.443MetGly: 0.443 ± 0.354
0.0MetHis: 0.0 ± 0.0
0.443MetIle: 0.443 ± 0.354
0.443MetLys: 0.443 ± 0.354
0.443MetLeu: 0.443 ± 0.354
0.443MetMet: 0.443 ± 0.347
1.329MetAsn: 1.329 ± 0.703
0.443MetPro: 0.443 ± 0.354
0.0MetGln: 0.0 ± 0.0
0.443MetArg: 0.443 ± 0.48
1.329MetSer: 1.329 ± 0.426
1.329MetThr: 1.329 ± 0.743
1.772MetVal: 1.772 ± 0.582
0.0MetTrp: 0.0 ± 0.0
1.329MetTyr: 1.329 ± 0.426
0.0MetXaa: 0.0 ± 0.0
Asn
2.658AsnAla: 2.658 ± 0.971
1.329AsnCys: 1.329 ± 1.061
3.101AsnAsp: 3.101 ± 0.913
1.772AsnGlu: 1.772 ± 0.564
2.215AsnPhe: 2.215 ± 1.019
3.988AsnGly: 3.988 ± 1.061
1.772AsnHis: 1.772 ± 1.257
3.545AsnIle: 3.545 ± 1.607
1.772AsnLys: 1.772 ± 0.204
4.431AsnLeu: 4.431 ± 0.964
0.0AsnMet: 0.0 ± 0.0
1.772AsnAsn: 1.772 ± 0.568
2.215AsnPro: 2.215 ± 1.174
1.772AsnGln: 1.772 ± 0.577
2.658AsnArg: 2.658 ± 0.879
3.101AsnSer: 3.101 ± 0.617
3.988AsnThr: 3.988 ± 1.411
4.431AsnVal: 4.431 ± 1.434
1.329AsnTrp: 1.329 ± 0.907
2.658AsnTyr: 2.658 ± 0.443
0.0AsnXaa: 0.0 ± 0.0
Pro
2.658ProAla: 2.658 ± 1.051
1.329ProCys: 1.329 ± 1.083
5.317ProAsp: 5.317 ± 2.041
2.658ProGlu: 2.658 ± 0.771
2.215ProPhe: 2.215 ± 1.01
1.772ProGly: 1.772 ± 0.879
0.443ProHis: 0.443 ± 0.388
3.545ProIle: 3.545 ± 0.668
4.431ProLys: 4.431 ± 0.986
4.874ProLeu: 4.874 ± 0.501
0.443ProMet: 0.443 ± 0.448
1.329ProAsn: 1.329 ± 0.367
6.646ProPro: 6.646 ± 1.646
3.101ProGln: 3.101 ± 0.711
1.329ProArg: 1.329 ± 0.665
5.317ProSer: 5.317 ± 1.968
4.431ProThr: 4.431 ± 1.573
2.658ProVal: 2.658 ± 1.33
0.0ProTrp: 0.0 ± 0.0
1.772ProTyr: 1.772 ± 1.06
0.0ProXaa: 0.0 ± 0.0
Gln
1.772GlnAla: 1.772 ± 0.625
1.329GlnCys: 1.329 ± 1.115
3.101GlnAsp: 3.101 ± 1.039
2.215GlnGlu: 2.215 ± 1.236
2.215GlnPhe: 2.215 ± 0.872
3.101GlnGly: 3.101 ± 1.178
0.886GlnHis: 0.886 ± 0.434
1.329GlnIle: 1.329 ± 0.385
0.443GlnLys: 0.443 ± 0.355
6.203GlnLeu: 6.203 ± 1.452
0.443GlnMet: 0.443 ± 0.42
2.658GlnAsn: 2.658 ± 0.397
1.329GlnPro: 1.329 ± 0.426
3.545GlnGln: 3.545 ± 1.291
1.329GlnArg: 1.329 ± 0.385
3.545GlnSer: 3.545 ± 0.777
3.545GlnThr: 3.545 ± 1.384
4.431GlnVal: 4.431 ± 1.378
0.443GlnTrp: 0.443 ± 0.354
1.772GlnTyr: 1.772 ± 1.022
0.0GlnXaa: 0.0 ± 0.0
Arg
4.431ArgAla: 4.431 ± 0.512
1.329ArgCys: 1.329 ± 0.59
2.658ArgAsp: 2.658 ± 0.478
2.658ArgGlu: 2.658 ± 0.972
3.988ArgPhe: 3.988 ± 1.52
4.431ArgGly: 4.431 ± 1.366
1.772ArgHis: 1.772 ± 0.638
1.772ArgIle: 1.772 ± 0.568
4.431ArgLys: 4.431 ± 1.373
3.545ArgLeu: 3.545 ± 1.221
0.443ArgMet: 0.443 ± 0.354
2.215ArgAsn: 2.215 ± 1.03
3.101ArgPro: 3.101 ± 0.611
1.772ArgGln: 1.772 ± 0.573
8.861ArgArg: 8.861 ± 2.046
2.658ArgSer: 2.658 ± 0.651
2.215ArgThr: 2.215 ± 0.79
3.988ArgVal: 3.988 ± 1.021
0.0ArgTrp: 0.0 ± 0.0
0.886ArgTyr: 0.886 ± 0.478
0.0ArgXaa: 0.0 ± 0.0
Ser
4.874SerAla: 4.874 ± 1.296
0.443SerCys: 0.443 ± 0.354
3.988SerAsp: 3.988 ± 1.336
5.317SerGlu: 5.317 ± 1.205
3.545SerPhe: 3.545 ± 1.176
6.646SerGly: 6.646 ± 1.404
1.329SerHis: 1.329 ± 0.747
1.772SerIle: 1.772 ± 1.114
3.101SerLys: 3.101 ± 1.214
10.191SerLeu: 10.191 ± 3.474
1.329SerMet: 1.329 ± 0.682
4.431SerAsn: 4.431 ± 2.081
2.658SerPro: 2.658 ± 0.771
1.772SerGln: 1.772 ± 0.981
6.203SerArg: 6.203 ± 1.485
7.975SerSer: 7.975 ± 1.527
4.874SerThr: 4.874 ± 1.791
5.76SerVal: 5.76 ± 2.161
0.0SerTrp: 0.0 ± 0.0
1.772SerTyr: 1.772 ± 0.568
0.0SerXaa: 0.0 ± 0.0
Thr
2.658ThrAla: 2.658 ± 1.185
0.886ThrCys: 0.886 ± 0.692
3.988ThrAsp: 3.988 ± 0.737
3.101ThrGlu: 3.101 ± 1.2
2.658ThrPhe: 2.658 ± 0.933
4.431ThrGly: 4.431 ± 0.891
0.886ThrHis: 0.886 ± 0.389
3.545ThrIle: 3.545 ± 2.127
0.886ThrLys: 0.886 ± 0.562
6.646ThrLeu: 6.646 ± 2.021
1.329ThrMet: 1.329 ± 0.69
3.101ThrAsn: 3.101 ± 0.504
8.861ThrPro: 8.861 ± 1.314
0.443ThrGln: 0.443 ± 0.355
3.101ThrArg: 3.101 ± 0.765
7.089ThrSer: 7.089 ± 1.827
4.431ThrThr: 4.431 ± 1.844
4.431ThrVal: 4.431 ± 0.901
0.886ThrTrp: 0.886 ± 0.456
1.329ThrTyr: 1.329 ± 0.682
0.0ThrXaa: 0.0 ± 0.0
Val
3.545ValAla: 3.545 ± 1.058
0.443ValCys: 0.443 ± 0.354
6.203ValAsp: 6.203 ± 2.126
4.431ValGlu: 4.431 ± 1.873
3.545ValPhe: 3.545 ± 1.058
4.431ValGly: 4.431 ± 1.962
1.329ValHis: 1.329 ± 0.385
3.545ValIle: 3.545 ± 1.28
2.215ValLys: 2.215 ± 0.741
5.317ValLeu: 5.317 ± 1.175
0.886ValMet: 0.886 ± 0.71
3.101ValAsn: 3.101 ± 1.623
3.545ValPro: 3.545 ± 1.4
3.545ValGln: 3.545 ± 0.873
3.101ValArg: 3.101 ± 0.504
7.089ValSer: 7.089 ± 1.448
2.215ValThr: 2.215 ± 0.334
4.431ValVal: 4.431 ± 0.901
1.329ValTrp: 1.329 ± 0.787
1.772ValTyr: 1.772 ± 1.084
0.0ValXaa: 0.0 ± 0.0
Trp
0.443TrpAla: 0.443 ± 0.354
0.0TrpCys: 0.0 ± 0.0
0.443TrpAsp: 0.443 ± 0.355
0.443TrpGlu: 0.443 ± 0.448
0.886TrpPhe: 0.886 ± 0.707
0.443TrpGly: 0.443 ± 0.355
0.443TrpHis: 0.443 ± 0.448
1.329TrpIle: 1.329 ± 1.061
1.329TrpLys: 1.329 ± 0.654
1.772TrpLeu: 1.772 ± 0.577
0.0TrpMet: 0.0 ± 0.0
0.886TrpAsn: 0.886 ± 0.897
0.443TrpPro: 0.443 ± 0.355
0.443TrpGln: 0.443 ± 0.355
1.329TrpArg: 1.329 ± 1.182
0.443TrpSer: 0.443 ± 0.355
1.329TrpThr: 1.329 ± 0.832
0.443TrpVal: 0.443 ± 0.448
0.0TrpTrp: 0.0 ± 0.0
0.443TrpTyr: 0.443 ± 0.448
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.658TyrAla: 2.658 ± 0.558
0.443TyrCys: 0.443 ± 0.508
3.545TyrAsp: 3.545 ± 1.079
1.329TyrGlu: 1.329 ± 0.47
2.658TyrPhe: 2.658 ± 1.073
1.772TyrGly: 1.772 ± 0.683
0.443TyrHis: 0.443 ± 0.48
0.886TyrIle: 0.886 ± 0.707
2.215TyrLys: 2.215 ± 0.682
3.101TyrLeu: 3.101 ± 1.162
0.443TyrMet: 0.443 ± 0.355
1.329TyrAsn: 1.329 ± 0.426
0.443TyrPro: 0.443 ± 0.388
0.886TyrGln: 0.886 ± 0.523
0.886TyrArg: 0.886 ± 0.71
1.772TyrSer: 1.772 ± 0.715
1.329TyrThr: 1.329 ± 0.747
3.101TyrVal: 3.101 ± 1.637
0.886TyrTrp: 0.886 ± 0.523
3.101TyrTyr: 3.101 ± 1.502
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2258 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski