Amino acid dipepetide frequency for Human papillomavirus type 26

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.016AlaAla: 4.016 ± 0.975
1.339AlaCys: 1.339 ± 0.778
3.124AlaAsp: 3.124 ± 0.66
3.57AlaGlu: 3.57 ± 0.719
2.231AlaPhe: 2.231 ± 1.652
4.462AlaGly: 4.462 ± 1.381
0.0AlaHis: 0.0 ± 0.0
2.677AlaIle: 2.677 ± 0.73
3.124AlaLys: 3.124 ± 1.546
4.462AlaLeu: 4.462 ± 0.679
0.892AlaMet: 0.892 ± 0.451
0.446AlaAsn: 0.446 ± 0.33
3.124AlaPro: 3.124 ± 1.8
3.124AlaGln: 3.124 ± 1.161
1.339AlaArg: 1.339 ± 0.695
4.462AlaSer: 4.462 ± 1.892
6.247AlaThr: 6.247 ± 0.756
4.016AlaVal: 4.016 ± 0.865
0.892AlaTrp: 0.892 ± 0.815
3.57AlaTyr: 3.57 ± 1.317
0.0AlaXaa: 0.0 ± 0.0
Cys
2.231CysAla: 2.231 ± 0.932
1.785CysCys: 1.785 ± 0.735
0.446CysAsp: 0.446 ± 0.716
1.339CysGlu: 1.339 ± 0.895
0.446CysPhe: 0.446 ± 0.407
0.446CysGly: 0.446 ± 0.407
0.892CysHis: 0.892 ± 1.045
1.785CysIle: 1.785 ± 0.588
3.57CysLys: 3.57 ± 1.505
1.785CysLeu: 1.785 ± 0.921
1.339CysMet: 1.339 ± 0.74
2.231CysAsn: 2.231 ± 1.239
1.785CysPro: 1.785 ± 0.298
2.231CysGln: 2.231 ± 0.889
0.0CysArg: 0.0 ± 0.0
2.677CysSer: 2.677 ± 1.316
1.339CysThr: 1.339 ± 0.694
2.231CysVal: 2.231 ± 1.374
1.339CysTrp: 1.339 ± 0.694
0.892CysTyr: 0.892 ± 1.174
0.0CysXaa: 0.0 ± 0.0
Asp
3.57AspAla: 3.57 ± 1.382
1.785AspCys: 1.785 ± 0.588
2.677AspAsp: 2.677 ± 1.596
2.677AspGlu: 2.677 ± 1.333
1.339AspPhe: 1.339 ± 0.492
1.339AspGly: 1.339 ± 0.391
0.446AspHis: 0.446 ± 0.33
5.801AspIle: 5.801 ± 2.247
2.231AspLys: 2.231 ± 1.237
4.016AspLeu: 4.016 ± 1.204
0.446AspMet: 0.446 ± 0.405
3.57AspAsn: 3.57 ± 0.94
3.124AspPro: 3.124 ± 1.246
1.785AspGln: 1.785 ± 0.962
0.892AspArg: 0.892 ± 0.74
4.462AspSer: 4.462 ± 1.956
7.14AspThr: 7.14 ± 2.139
4.462AspVal: 4.462 ± 1.144
0.892AspTrp: 0.892 ± 0.451
4.016AspTyr: 4.016 ± 1.884
0.0AspXaa: 0.0 ± 0.0
Glu
3.57GluAla: 3.57 ± 1.788
0.446GluCys: 0.446 ± 0.407
5.355GluAsp: 5.355 ± 1.894
4.016GluGlu: 4.016 ± 1.135
0.446GluPhe: 0.446 ± 0.407
2.677GluGly: 2.677 ± 0.907
1.339GluHis: 1.339 ± 0.391
3.57GluIle: 3.57 ± 0.777
2.677GluLys: 2.677 ± 1.45
4.016GluLeu: 4.016 ± 0.583
0.446GluMet: 0.446 ± 0.407
2.677GluAsn: 2.677 ± 0.915
4.462GluPro: 4.462 ± 1.546
2.231GluGln: 2.231 ± 1.594
0.892GluArg: 0.892 ± 0.719
1.785GluSer: 1.785 ± 0.567
4.016GluThr: 4.016 ± 1.71
2.231GluVal: 2.231 ± 0.591
0.446GluTrp: 0.446 ± 0.33
1.339GluTyr: 1.339 ± 0.921
0.0GluXaa: 0.0 ± 0.0
Phe
2.677PheAla: 2.677 ± 0.681
0.0PheCys: 0.0 ± 0.0
3.57PheAsp: 3.57 ± 1.141
1.339PheGlu: 1.339 ± 0.948
1.785PhePhe: 1.785 ± 0.682
1.785PheGly: 1.785 ± 1.174
0.446PheHis: 0.446 ± 0.716
3.57PheIle: 3.57 ± 0.975
4.016PheLys: 4.016 ± 1.284
4.016PheLeu: 4.016 ± 1.223
0.892PheMet: 0.892 ± 0.451
1.339PheAsn: 1.339 ± 0.813
1.339PhePro: 1.339 ± 0.679
1.785PheGln: 1.785 ± 0.903
0.446PheArg: 0.446 ± 0.405
3.124PheSer: 3.124 ± 0.982
1.339PheThr: 1.339 ± 0.391
1.785PheVal: 1.785 ± 0.733
0.892PheTrp: 0.892 ± 0.451
1.785PheTyr: 1.785 ± 0.686
0.0PheXaa: 0.0 ± 0.0
Gly
3.124GlyAla: 3.124 ± 1.031
0.892GlyCys: 0.892 ± 0.451
3.57GlyAsp: 3.57 ± 0.901
2.231GlyGlu: 2.231 ± 0.749
1.785GlyPhe: 1.785 ± 0.636
4.462GlyGly: 4.462 ± 2.086
2.231GlyHis: 2.231 ± 1.083
4.909GlyIle: 4.909 ± 1.912
1.339GlyLys: 1.339 ± 0.431
4.462GlyLeu: 4.462 ± 0.671
0.0GlyMet: 0.0 ± 0.0
3.57GlyAsn: 3.57 ± 0.541
2.231GlyPro: 2.231 ± 0.85
3.124GlyGln: 3.124 ± 1.156
3.124GlyArg: 3.124 ± 0.929
4.909GlySer: 4.909 ± 1.673
6.693GlyThr: 6.693 ± 2.29
2.231GlyVal: 2.231 ± 0.85
0.892GlyTrp: 0.892 ± 0.418
1.339GlyTyr: 1.339 ± 0.636
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.446HisCys: 0.446 ± 0.407
0.892HisAsp: 0.892 ± 0.411
1.339HisGlu: 1.339 ± 1.408
0.892HisPhe: 0.892 ± 0.451
1.339HisGly: 1.339 ± 0.685
0.446HisHis: 0.446 ± 0.407
1.339HisIle: 1.339 ± 0.431
0.892HisLys: 0.892 ± 0.518
1.785HisLeu: 1.785 ± 1.183
0.0HisMet: 0.0 ± 0.0
0.892HisAsn: 0.892 ± 0.451
1.339HisPro: 1.339 ± 0.813
1.339HisGln: 1.339 ± 0.655
1.785HisArg: 1.785 ± 0.812
2.231HisSer: 2.231 ± 0.736
1.339HisThr: 1.339 ± 0.729
1.785HisVal: 1.785 ± 0.566
0.892HisTrp: 0.892 ± 0.499
0.892HisTyr: 0.892 ± 0.411
0.0HisXaa: 0.0 ± 0.0
Ile
2.231IleAla: 2.231 ± 0.761
2.231IleCys: 2.231 ± 0.807
5.355IleAsp: 5.355 ± 0.999
4.462IleGlu: 4.462 ± 1.594
2.231IlePhe: 2.231 ± 0.821
4.016IleGly: 4.016 ± 1.318
1.785IleHis: 1.785 ± 0.733
3.57IleIle: 3.57 ± 1.407
2.231IleLys: 2.231 ± 1.105
1.785IleLeu: 1.785 ± 0.729
0.446IleMet: 0.446 ± 0.407
2.231IleAsn: 2.231 ± 0.934
4.909IlePro: 4.909 ± 1.956
3.124IleGln: 3.124 ± 1.307
1.339IleArg: 1.339 ± 0.778
4.462IleSer: 4.462 ± 1.01
4.016IleThr: 4.016 ± 0.967
4.462IleVal: 4.462 ± 1.252
0.0IleTrp: 0.0 ± 0.0
3.57IleTyr: 3.57 ± 0.788
0.0IleXaa: 0.0 ± 0.0
Lys
4.016LysAla: 4.016 ± 0.781
1.339LysCys: 1.339 ± 0.634
0.446LysAsp: 0.446 ± 0.33
3.57LysGlu: 3.57 ± 1.129
3.57LysPhe: 3.57 ± 2.01
3.57LysGly: 3.57 ± 1.439
2.231LysHis: 2.231 ± 1.022
3.124LysIle: 3.124 ± 0.76
1.785LysLys: 1.785 ± 1.013
3.124LysLeu: 3.124 ± 1.128
1.339LysMet: 1.339 ± 0.75
1.785LysAsn: 1.785 ± 0.903
0.892LysPro: 0.892 ± 0.81
4.462LysGln: 4.462 ± 0.84
5.801LysArg: 5.801 ± 0.936
3.124LysSer: 3.124 ± 1.59
3.57LysThr: 3.57 ± 0.69
1.785LysVal: 1.785 ± 0.682
0.0LysTrp: 0.0 ± 0.0
2.677LysTyr: 2.677 ± 0.915
0.0LysXaa: 0.0 ± 0.0
Leu
1.785LeuAla: 1.785 ± 0.547
3.124LeuCys: 3.124 ± 1.993
4.909LeuAsp: 4.909 ± 0.915
4.016LeuGlu: 4.016 ± 0.976
2.677LeuPhe: 2.677 ± 1.44
5.801LeuGly: 5.801 ± 1.716
2.231LeuHis: 2.231 ± 0.85
3.57LeuIle: 3.57 ± 0.944
4.016LeuLys: 4.016 ± 1.423
4.462LeuLeu: 4.462 ± 2.521
2.231LeuMet: 2.231 ± 0.595
1.339LeuAsn: 1.339 ± 0.785
4.462LeuPro: 4.462 ± 1.559
6.693LeuGln: 6.693 ± 1.715
3.124LeuArg: 3.124 ± 0.987
3.57LeuSer: 3.57 ± 0.975
6.247LeuThr: 6.247 ± 1.025
3.124LeuVal: 3.124 ± 0.978
0.892LeuTrp: 0.892 ± 0.453
4.462LeuTyr: 4.462 ± 0.949
0.0LeuXaa: 0.0 ± 0.0
Met
0.892MetAla: 0.892 ± 0.81
0.892MetCys: 0.892 ± 0.58
1.339MetAsp: 1.339 ± 0.434
1.339MetGlu: 1.339 ± 0.834
0.892MetPhe: 0.892 ± 0.719
0.0MetGly: 0.0 ± 0.0
0.446MetHis: 0.446 ± 0.587
0.446MetIle: 0.446 ± 0.405
0.446MetLys: 0.446 ± 0.389
1.785MetLeu: 1.785 ± 0.659
0.0MetMet: 0.0 ± 0.0
0.446MetAsn: 0.446 ± 0.405
0.892MetPro: 0.892 ± 0.744
0.892MetGln: 0.892 ± 0.418
0.892MetArg: 0.892 ± 0.733
2.231MetSer: 2.231 ± 0.393
0.892MetThr: 0.892 ± 0.418
1.339MetVal: 1.339 ± 0.391
0.0MetTrp: 0.0 ± 0.0
0.446MetTyr: 0.446 ± 0.407
0.0MetXaa: 0.0 ± 0.0
Asn
3.124AsnAla: 3.124 ± 1.128
1.785AsnCys: 1.785 ± 1.128
0.446AsnAsp: 0.446 ± 0.407
0.446AsnGlu: 0.446 ± 0.33
2.677AsnPhe: 2.677 ± 0.503
2.677AsnGly: 2.677 ± 0.878
0.446AsnHis: 0.446 ± 0.407
2.231AsnIle: 2.231 ± 1.195
3.124AsnLys: 3.124 ± 1.989
3.124AsnLeu: 3.124 ± 1.317
0.892AsnMet: 0.892 ± 0.733
2.231AsnAsn: 2.231 ± 1.074
3.124AsnPro: 3.124 ± 1.069
1.785AsnGln: 1.785 ± 0.71
0.892AsnArg: 0.892 ± 0.451
5.801AsnSer: 5.801 ± 1.818
5.355AsnThr: 5.355 ± 1.113
3.124AsnVal: 3.124 ± 0.616
0.892AsnTrp: 0.892 ± 0.661
0.446AsnTyr: 0.446 ± 0.389
0.0AsnXaa: 0.0 ± 0.0
Pro
2.231ProAla: 2.231 ± 1.471
0.446ProCys: 0.446 ± 0.33
4.462ProAsp: 4.462 ± 1.515
2.231ProGlu: 2.231 ± 1.07
1.785ProPhe: 1.785 ± 0.903
2.231ProGly: 2.231 ± 1.102
0.0ProHis: 0.0 ± 0.0
3.57ProIle: 3.57 ± 2.611
3.57ProLys: 3.57 ± 1.275
7.14ProLeu: 7.14 ± 1.825
0.446ProMet: 0.446 ± 0.389
1.785ProAsn: 1.785 ± 0.565
6.693ProPro: 6.693 ± 2.338
2.231ProGln: 2.231 ± 0.512
3.124ProArg: 3.124 ± 2.392
4.462ProSer: 4.462 ± 1.953
4.909ProThr: 4.909 ± 2.639
4.016ProVal: 4.016 ± 0.781
0.0ProTrp: 0.0 ± 0.0
2.677ProTyr: 2.677 ± 0.84
0.0ProXaa: 0.0 ± 0.0
Gln
4.909GlnAla: 4.909 ± 1.2
2.231GlnCys: 2.231 ± 1.512
0.446GlnAsp: 0.446 ± 0.33
2.231GlnGlu: 2.231 ± 0.929
2.677GlnPhe: 2.677 ± 1.093
2.231GlnGly: 2.231 ± 0.75
0.892GlnHis: 0.892 ± 0.661
1.785GlnIle: 1.785 ± 0.731
2.677GlnLys: 2.677 ± 0.562
4.909GlnLeu: 4.909 ± 2.159
0.892GlnMet: 0.892 ± 1.494
2.231GlnAsn: 2.231 ± 1.031
2.231GlnPro: 2.231 ± 0.512
3.124GlnGln: 3.124 ± 1.099
3.57GlnArg: 3.57 ± 1.423
3.57GlnSer: 3.57 ± 1.16
2.677GlnThr: 2.677 ± 0.944
4.016GlnVal: 4.016 ± 1.596
2.677GlnTrp: 2.677 ± 1.365
2.231GlnTyr: 2.231 ± 0.85
0.0GlnXaa: 0.0 ± 0.0
Arg
2.231ArgAla: 2.231 ± 0.692
2.231ArgCys: 2.231 ± 2.331
2.231ArgAsp: 2.231 ± 1.234
1.785ArgGlu: 1.785 ± 0.926
2.231ArgPhe: 2.231 ± 0.744
1.339ArgGly: 1.339 ± 0.791
1.339ArgHis: 1.339 ± 0.791
0.892ArgIle: 0.892 ± 1.045
3.57ArgLys: 3.57 ± 1.147
3.57ArgLeu: 3.57 ± 0.596
0.892ArgMet: 0.892 ± 0.778
1.785ArgAsn: 1.785 ± 0.588
3.124ArgPro: 3.124 ± 1.452
2.231ArgGln: 2.231 ± 1.478
4.462ArgArg: 4.462 ± 1.829
2.677ArgSer: 2.677 ± 0.851
2.231ArgThr: 2.231 ± 0.688
3.57ArgVal: 3.57 ± 0.947
0.446ArgTrp: 0.446 ± 0.407
2.231ArgTyr: 2.231 ± 1.173
0.0ArgXaa: 0.0 ± 0.0
Ser
4.909SerAla: 4.909 ± 2.528
0.446SerCys: 0.446 ± 0.33
6.693SerAsp: 6.693 ± 1.619
0.892SerGlu: 0.892 ± 0.661
2.677SerPhe: 2.677 ± 1.272
5.801SerGly: 5.801 ± 2.266
0.892SerHis: 0.892 ± 0.451
6.247SerIle: 6.247 ± 1.446
2.677SerLys: 2.677 ± 0.851
4.909SerLeu: 4.909 ± 1.568
1.339SerMet: 1.339 ± 0.791
5.801SerAsn: 5.801 ± 2.377
2.231SerPro: 2.231 ± 1.143
3.124SerGln: 3.124 ± 1.918
3.124SerArg: 3.124 ± 0.749
9.817SerSer: 9.817 ± 2.678
10.71SerThr: 10.71 ± 3.337
4.909SerVal: 4.909 ± 1.867
0.446SerTrp: 0.446 ± 0.33
1.785SerTyr: 1.785 ± 0.706
0.0SerXaa: 0.0 ± 0.0
Thr
3.124ThrAla: 3.124 ± 1.013
4.909ThrCys: 4.909 ± 0.735
5.355ThrAsp: 5.355 ± 0.754
4.462ThrGlu: 4.462 ± 1.717
2.677ThrPhe: 2.677 ± 1.033
6.247ThrGly: 6.247 ± 1.099
1.785ThrHis: 1.785 ± 0.821
2.231ThrIle: 2.231 ± 0.786
3.124ThrLys: 3.124 ± 1.414
6.693ThrLeu: 6.693 ± 2.478
1.785ThrMet: 1.785 ± 0.731
4.909ThrAsn: 4.909 ± 1.244
5.355ThrPro: 5.355 ± 1.725
5.355ThrGln: 5.355 ± 1.489
2.677ThrArg: 2.677 ± 0.782
8.032ThrSer: 8.032 ± 2.822
9.817ThrThr: 9.817 ± 2.869
7.586ThrVal: 7.586 ± 3.189
1.339ThrTrp: 1.339 ± 0.431
1.785ThrTyr: 1.785 ± 0.565
0.0ThrXaa: 0.0 ± 0.0
Val
3.124ValAla: 3.124 ± 1.416
2.677ValCys: 2.677 ± 1.31
4.909ValAsp: 4.909 ± 0.897
2.677ValGlu: 2.677 ± 1.143
2.677ValPhe: 2.677 ± 0.715
2.231ValGly: 2.231 ± 1.083
2.677ValHis: 2.677 ± 1.186
3.57ValIle: 3.57 ± 1.246
1.785ValLys: 1.785 ± 0.566
2.231ValLeu: 2.231 ± 0.954
0.892ValMet: 0.892 ± 0.616
3.124ValAsn: 3.124 ± 1.64
4.909ValPro: 4.909 ± 0.695
2.677ValGln: 2.677 ± 1.485
3.57ValArg: 3.57 ± 1.141
4.016ValSer: 4.016 ± 1.173
7.586ValThr: 7.586 ± 1.545
3.124ValVal: 3.124 ± 1.285
0.446ValTrp: 0.446 ± 0.405
4.016ValTyr: 4.016 ± 2.588
0.0ValXaa: 0.0 ± 0.0
Trp
1.785TrpAla: 1.785 ± 0.686
0.446TrpCys: 0.446 ± 0.407
0.446TrpAsp: 0.446 ± 0.405
0.0TrpGlu: 0.0 ± 0.0
0.446TrpPhe: 0.446 ± 0.33
1.339TrpGly: 1.339 ± 0.791
0.446TrpHis: 0.446 ± 0.407
0.446TrpIle: 0.446 ± 0.33
2.231TrpLys: 2.231 ± 1.035
1.785TrpLeu: 1.785 ± 0.968
0.0TrpMet: 0.0 ± 0.0
0.446TrpAsn: 0.446 ± 0.405
0.892TrpPro: 0.892 ± 0.778
0.446TrpGln: 0.446 ± 0.407
0.892TrpArg: 0.892 ± 0.719
0.892TrpSer: 0.892 ± 0.411
0.892TrpThr: 0.892 ± 0.815
0.446TrpVal: 0.446 ± 0.33
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.57TyrAla: 3.57 ± 0.971
1.339TyrCys: 1.339 ± 0.785
0.0TyrAsp: 0.0 ± 0.0
4.016TyrGlu: 4.016 ± 1.522
1.785TyrPhe: 1.785 ± 0.565
3.124TyrGly: 3.124 ± 1.173
0.892TyrHis: 0.892 ± 0.411
3.124TyrIle: 3.124 ± 0.9
3.124TyrLys: 3.124 ± 0.906
2.677TyrLeu: 2.677 ± 0.782
0.892TyrMet: 0.892 ± 0.492
1.785TyrAsn: 1.785 ± 0.926
0.892TyrPro: 0.892 ± 0.499
0.892TyrGln: 0.892 ± 0.453
3.124TyrArg: 3.124 ± 1.5
3.124TyrSer: 3.124 ± 1.288
2.231TyrThr: 2.231 ± 0.933
2.677TyrVal: 2.677 ± 1.053
0.892TyrTrp: 0.892 ± 0.499
4.016TyrTyr: 4.016 ± 1.35
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2242 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski