Amino acid dipepetide frequency for human papillomavirus 86

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.665AlaAla: 4.665 ± 1.475
1.696AlaCys: 1.696 ± 0.993
3.393AlaAsp: 3.393 ± 1.308
4.241AlaGlu: 4.241 ± 1.195
1.696AlaPhe: 1.696 ± 0.599
3.817AlaGly: 3.817 ± 0.976
1.272AlaHis: 1.272 ± 0.735
2.969AlaIle: 2.969 ± 0.739
2.969AlaLys: 2.969 ± 1.206
3.393AlaLeu: 3.393 ± 1.163
2.12AlaMet: 2.12 ± 0.971
0.848AlaAsn: 0.848 ± 0.407
4.665AlaPro: 4.665 ± 1.174
2.545AlaGln: 2.545 ± 1.161
5.089AlaArg: 5.089 ± 1.103
6.785AlaSer: 6.785 ± 1.046
7.209AlaThr: 7.209 ± 2.094
2.12AlaVal: 2.12 ± 0.925
0.424AlaTrp: 0.424 ± 0.326
1.696AlaTyr: 1.696 ± 1.29
0.0AlaXaa: 0.0 ± 0.0
Cys
2.12CysAla: 2.12 ± 0.966
0.0CysCys: 0.0 ± 0.0
1.272CysAsp: 1.272 ± 1.453
0.848CysGlu: 0.848 ± 0.715
0.424CysPhe: 0.424 ± 0.535
1.696CysGly: 1.696 ± 1.572
0.848CysHis: 0.848 ± 1.071
1.272CysIle: 1.272 ± 1.141
2.12CysLys: 2.12 ± 1.122
2.969CysLeu: 2.969 ± 0.87
1.272CysMet: 1.272 ± 0.58
0.424CysAsn: 0.424 ± 0.33
2.12CysPro: 2.12 ± 0.703
1.696CysGln: 1.696 ± 0.568
2.545CysArg: 2.545 ± 2.261
0.848CysSer: 0.848 ± 0.629
0.424CysThr: 0.424 ± 0.806
0.424CysVal: 0.424 ± 0.33
1.696CysTrp: 1.696 ± 0.568
0.424CysTyr: 0.424 ± 0.437
0.0CysXaa: 0.0 ± 0.0
Asp
3.817AspAla: 3.817 ± 1.601
1.272AspCys: 1.272 ± 0.943
2.969AspAsp: 2.969 ± 0.997
2.545AspGlu: 2.545 ± 0.509
2.12AspPhe: 2.12 ± 0.478
3.817AspGly: 3.817 ± 0.947
0.424AspHis: 0.424 ± 0.326
4.241AspIle: 4.241 ± 1.802
2.969AspLys: 2.969 ± 1.589
4.241AspLeu: 4.241 ± 0.675
1.272AspMet: 1.272 ± 0.412
3.393AspAsn: 3.393 ± 0.67
3.393AspPro: 3.393 ± 1.017
1.272AspGln: 1.272 ± 0.77
2.12AspArg: 2.12 ± 0.445
6.361AspSer: 6.361 ± 1.156
7.634AspThr: 7.634 ± 1.32
2.545AspVal: 2.545 ± 1.07
0.848AspTrp: 0.848 ± 0.661
1.272AspTyr: 1.272 ± 0.707
0.0AspXaa: 0.0 ± 0.0
Glu
4.241GluAla: 4.241 ± 1.646
0.848GluCys: 0.848 ± 1.247
5.089GluAsp: 5.089 ± 1.127
5.937GluGlu: 5.937 ± 2.862
0.848GluPhe: 0.848 ± 0.428
2.12GluGly: 2.12 ± 0.626
2.545GluHis: 2.545 ± 0.823
2.12GluIle: 2.12 ± 0.783
2.12GluLys: 2.12 ± 1.272
4.241GluLeu: 4.241 ± 2.482
0.424GluMet: 0.424 ± 0.437
1.696GluAsn: 1.696 ± 1.039
4.665GluPro: 4.665 ± 0.939
2.545GluGln: 2.545 ± 0.828
1.272GluArg: 1.272 ± 0.412
1.696GluSer: 1.696 ± 0.941
2.969GluThr: 2.969 ± 0.778
3.817GluVal: 3.817 ± 0.744
1.272GluTrp: 1.272 ± 0.645
2.969GluTyr: 2.969 ± 0.968
0.0GluXaa: 0.0 ± 0.0
Phe
1.272PheAla: 1.272 ± 0.616
1.272PheCys: 1.272 ± 0.58
1.272PheAsp: 1.272 ± 0.616
1.696PheGlu: 1.696 ± 0.673
2.12PhePhe: 2.12 ± 0.712
2.969PheGly: 2.969 ± 0.719
0.424PheHis: 0.424 ± 0.535
1.272PheIle: 1.272 ± 0.412
3.393PheLys: 3.393 ± 1.453
3.817PheLeu: 3.817 ± 0.847
0.848PheMet: 0.848 ± 0.661
1.696PheAsn: 1.696 ± 1.461
1.696PhePro: 1.696 ± 0.507
1.272PheGln: 1.272 ± 0.645
0.848PheArg: 0.848 ± 0.407
0.848PheSer: 0.848 ± 0.391
2.969PheThr: 2.969 ± 1.471
3.817PheVal: 3.817 ± 0.707
0.848PheTrp: 0.848 ± 0.407
1.272PheTyr: 1.272 ± 0.373
0.0PheXaa: 0.0 ± 0.0
Gly
5.089GlyAla: 5.089 ± 0.937
2.12GlyCys: 2.12 ± 1.21
5.513GlyAsp: 5.513 ± 1.238
3.817GlyGlu: 3.817 ± 1.311
1.696GlyPhe: 1.696 ± 0.867
7.634GlyGly: 7.634 ± 2.398
3.393GlyHis: 3.393 ± 0.862
2.12GlyIle: 2.12 ± 1.215
2.969GlyLys: 2.969 ± 0.82
3.393GlyLeu: 3.393 ± 0.596
0.848GlyMet: 0.848 ± 0.407
1.696GlyAsn: 1.696 ± 1.322
2.545GlyPro: 2.545 ± 0.887
3.817GlyGln: 3.817 ± 1.011
2.969GlyArg: 2.969 ± 0.824
5.513GlySer: 5.513 ± 1.276
6.361GlyThr: 6.361 ± 1.925
4.665GlyVal: 4.665 ± 1.293
0.424GlyTrp: 0.424 ± 0.33
2.12GlyTyr: 2.12 ± 0.709
0.0GlyXaa: 0.0 ± 0.0
His
3.393HisAla: 3.393 ± 0.855
0.424HisCys: 0.424 ± 0.73
0.848HisAsp: 0.848 ± 0.391
0.848HisGlu: 0.848 ± 0.532
1.696HisPhe: 1.696 ± 0.507
1.696HisGly: 1.696 ± 1.492
0.848HisHis: 0.848 ± 0.428
1.696HisIle: 1.696 ± 0.599
1.696HisLys: 1.696 ± 0.911
2.12HisLeu: 2.12 ± 1.157
0.424HisMet: 0.424 ± 0.535
0.848HisAsn: 0.848 ± 0.407
1.272HisPro: 1.272 ± 0.707
0.424HisGln: 0.424 ± 0.437
0.848HisArg: 0.848 ± 0.391
2.545HisSer: 2.545 ± 2.334
1.696HisThr: 1.696 ± 0.684
1.696HisVal: 1.696 ± 0.358
1.272HisTrp: 1.272 ± 0.707
1.272HisTyr: 1.272 ± 0.46
0.0HisXaa: 0.0 ± 0.0
Ile
2.12IleAla: 2.12 ± 0.781
1.696IleCys: 1.696 ± 0.797
1.272IleAsp: 1.272 ± 0.646
2.545IleGlu: 2.545 ± 1.173
2.12IlePhe: 2.12 ± 1.144
3.393IleGly: 3.393 ± 0.889
0.848IleHis: 0.848 ± 0.875
1.272IleIle: 1.272 ± 0.766
0.848IleLys: 0.848 ± 0.715
1.696IleLeu: 1.696 ± 0.673
1.272IleMet: 1.272 ± 0.845
0.848IleAsn: 0.848 ± 0.653
2.969IlePro: 2.969 ± 0.854
1.696IleGln: 1.696 ± 0.358
0.848IleArg: 0.848 ± 0.678
2.545IleSer: 2.545 ± 0.673
3.817IleThr: 3.817 ± 0.904
2.969IleVal: 2.969 ± 0.596
0.0IleTrp: 0.0 ± 0.0
1.696IleTyr: 1.696 ± 1.063
0.0IleXaa: 0.0 ± 0.0
Lys
3.393LysAla: 3.393 ± 1.269
2.969LysCys: 2.969 ± 1.358
2.12LysAsp: 2.12 ± 0.613
2.969LysGlu: 2.969 ± 1.118
2.12LysPhe: 2.12 ± 1.084
3.393LysGly: 3.393 ± 1.685
2.12LysHis: 2.12 ± 0.871
1.696LysIle: 1.696 ± 0.911
2.12LysLys: 2.12 ± 0.861
3.817LysLeu: 3.817 ± 1.156
0.424LysMet: 0.424 ± 0.336
1.272LysAsn: 1.272 ± 0.6
2.12LysPro: 2.12 ± 1.289
2.12LysGln: 2.12 ± 0.757
3.817LysArg: 3.817 ± 0.865
2.969LysSer: 2.969 ± 1.483
1.696LysThr: 1.696 ± 0.551
3.817LysVal: 3.817 ± 1.189
0.424LysTrp: 0.424 ± 0.326
1.272LysTyr: 1.272 ± 0.412
0.0LysXaa: 0.0 ± 0.0
Leu
3.393LeuAla: 3.393 ± 1.043
2.545LeuCys: 2.545 ± 2.467
6.785LeuAsp: 6.785 ± 1.023
2.12LeuGlu: 2.12 ± 1.356
4.665LeuPhe: 4.665 ± 0.945
4.665LeuGly: 4.665 ± 1.149
3.817LeuHis: 3.817 ± 1.709
0.848LeuIle: 0.848 ± 0.407
5.513LeuLys: 5.513 ± 1.179
9.33LeuLeu: 9.33 ± 2.972
1.272LeuMet: 1.272 ± 0.814
2.545LeuAsn: 2.545 ± 0.999
2.969LeuPro: 2.969 ± 1.21
8.058LeuGln: 8.058 ± 1.483
5.513LeuArg: 5.513 ± 1.775
6.785LeuSer: 6.785 ± 1.307
4.241LeuThr: 4.241 ± 1.058
3.817LeuVal: 3.817 ± 1.603
0.848LeuTrp: 0.848 ± 0.505
4.665LeuTyr: 4.665 ± 0.965
0.0LeuXaa: 0.0 ± 0.0
Met
1.696MetAla: 1.696 ± 0.797
0.424MetCys: 0.424 ± 0.437
0.848MetAsp: 0.848 ± 0.505
0.848MetGlu: 0.848 ± 0.428
0.848MetPhe: 0.848 ± 0.731
1.696MetGly: 1.696 ± 0.941
0.848MetHis: 0.848 ± 1.247
0.424MetIle: 0.424 ± 0.33
0.0MetLys: 0.0 ± 0.0
0.848MetLeu: 0.848 ± 0.661
0.0MetMet: 0.0 ± 0.0
0.424MetAsn: 0.424 ± 0.365
0.848MetPro: 0.848 ± 0.584
0.848MetGln: 0.848 ± 0.391
0.424MetArg: 0.424 ± 0.326
2.545MetSer: 2.545 ± 1.107
0.0MetThr: 0.0 ± 0.0
4.665MetVal: 4.665 ± 1.379
1.272MetTrp: 1.272 ± 1.097
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.272AsnAla: 1.272 ± 0.646
0.848AsnCys: 0.848 ± 0.551
0.0AsnAsp: 0.0 ± 0.0
2.969AsnGlu: 2.969 ± 1.239
1.272AsnPhe: 1.272 ± 0.707
0.848AsnGly: 0.848 ± 0.661
0.0AsnHis: 0.0 ± 0.0
2.545AsnIle: 2.545 ± 0.999
2.12AsnLys: 2.12 ± 1.392
2.545AsnLeu: 2.545 ± 0.877
0.424AsnMet: 0.424 ± 0.437
2.969AsnAsn: 2.969 ± 0.915
1.696AsnPro: 1.696 ± 0.714
0.424AsnGln: 0.424 ± 0.365
1.696AsnArg: 1.696 ± 0.913
2.969AsnSer: 2.969 ± 1.124
2.545AsnThr: 2.545 ± 1.107
2.545AsnVal: 2.545 ± 0.873
0.424AsnTrp: 0.424 ± 0.33
0.424AsnTyr: 0.424 ± 0.326
0.0AsnXaa: 0.0 ± 0.0
Pro
4.241ProAla: 4.241 ± 1.65
0.424ProCys: 0.424 ± 0.33
6.361ProAsp: 6.361 ± 1.167
2.12ProGlu: 2.12 ± 0.847
0.424ProPhe: 0.424 ± 0.33
4.241ProGly: 4.241 ± 0.885
0.848ProHis: 0.848 ± 0.551
2.12ProIle: 2.12 ± 1.278
3.817ProLys: 3.817 ± 0.864
8.482ProLeu: 8.482 ± 1.774
0.424ProMet: 0.424 ± 0.546
2.12ProAsn: 2.12 ± 0.547
8.482ProPro: 8.482 ± 5.83
2.12ProGln: 2.12 ± 1.28
4.241ProArg: 4.241 ± 2.105
5.513ProSer: 5.513 ± 2.449
3.817ProThr: 3.817 ± 1.392
5.513ProVal: 5.513 ± 1.472
0.424ProTrp: 0.424 ± 0.437
2.12ProTyr: 2.12 ± 1.4
0.0ProXaa: 0.0 ± 0.0
Gln
3.393GlnAla: 3.393 ± 1.159
0.424GlnCys: 0.424 ± 0.33
3.393GlnAsp: 3.393 ± 0.723
2.545GlnGlu: 2.545 ± 0.823
2.969GlnPhe: 2.969 ± 1.059
3.393GlnGly: 3.393 ± 0.815
0.0GlnHis: 0.0 ± 0.0
0.0GlnIle: 0.0 ± 0.0
2.12GlnLys: 2.12 ± 1.084
5.513GlnLeu: 5.513 ± 2.876
0.848GlnMet: 0.848 ± 0.731
0.848GlnAsn: 0.848 ± 0.505
2.969GlnPro: 2.969 ± 0.881
4.241GlnGln: 4.241 ± 1.405
2.545GlnArg: 2.545 ± 0.965
2.545GlnSer: 2.545 ± 1.743
2.545GlnThr: 2.545 ± 0.877
3.817GlnVal: 3.817 ± 0.99
1.696GlnTrp: 1.696 ± 0.691
1.272GlnTyr: 1.272 ± 0.872
0.0GlnXaa: 0.0 ± 0.0
Arg
3.393ArgAla: 3.393 ± 1.002
2.12ArgCys: 2.12 ± 2.038
1.696ArgAsp: 1.696 ± 0.995
2.545ArgGlu: 2.545 ± 1.165
1.272ArgPhe: 1.272 ± 0.645
2.969ArgGly: 2.969 ± 1.178
2.545ArgHis: 2.545 ± 1.009
0.848ArgIle: 0.848 ± 0.653
4.665ArgLys: 4.665 ± 0.788
7.209ArgLeu: 7.209 ± 1.587
1.272ArgMet: 1.272 ± 0.991
0.0ArgAsn: 0.0 ± 0.0
4.665ArgPro: 4.665 ± 1.301
2.969ArgGln: 2.969 ± 1.041
5.513ArgArg: 5.513 ± 1.881
3.393ArgSer: 3.393 ± 1.461
2.545ArgThr: 2.545 ± 0.827
3.817ArgVal: 3.817 ± 0.951
0.424ArgTrp: 0.424 ± 0.437
2.969ArgTyr: 2.969 ± 0.844
0.0ArgXaa: 0.0 ± 0.0
Ser
3.817SerAla: 3.817 ± 1.411
1.272SerCys: 1.272 ± 0.627
3.817SerAsp: 3.817 ± 0.821
4.665SerGlu: 4.665 ± 1.733
4.665SerPhe: 4.665 ± 1.158
7.209SerGly: 7.209 ± 2.576
2.12SerHis: 2.12 ± 0.918
2.545SerIle: 2.545 ± 0.914
1.272SerLys: 1.272 ± 1.096
6.361SerLeu: 6.361 ± 1.371
2.545SerMet: 2.545 ± 0.642
2.969SerAsn: 2.969 ± 0.87
3.393SerPro: 3.393 ± 1.247
1.696SerGln: 1.696 ± 0.938
5.937SerArg: 5.937 ± 1.258
11.874SerSer: 11.874 ± 3.26
8.906SerThr: 8.906 ± 2.456
3.393SerVal: 3.393 ± 1.322
1.696SerTrp: 1.696 ± 0.934
1.696SerTyr: 1.696 ± 0.828
0.0SerXaa: 0.0 ± 0.0
Thr
4.665ThrAla: 4.665 ± 1.006
2.545ThrCys: 2.545 ± 0.59
2.969ThrAsp: 2.969 ± 0.906
3.817ThrGlu: 3.817 ± 1.021
1.272ThrPhe: 1.272 ± 0.64
5.513ThrGly: 5.513 ± 1.859
1.272ThrHis: 1.272 ± 0.766
2.545ThrIle: 2.545 ± 0.644
1.696ThrLys: 1.696 ± 0.746
5.513ThrLeu: 5.513 ± 2.589
2.12ThrMet: 2.12 ± 1.815
2.12ThrAsn: 2.12 ± 1.096
7.209ThrPro: 7.209 ± 1.234
5.089ThrGln: 5.089 ± 1.508
2.545ThrArg: 2.545 ± 0.553
6.785ThrSer: 6.785 ± 1.648
5.089ThrThr: 5.089 ± 1.655
7.634ThrVal: 7.634 ± 2.708
0.848ThrTrp: 0.848 ± 0.875
2.12ThrTyr: 2.12 ± 0.712
0.0ThrXaa: 0.0 ± 0.0
Val
2.969ValAla: 2.969 ± 1.368
2.12ValCys: 2.12 ± 1.052
5.937ValAsp: 5.937 ± 1.01
2.545ValGlu: 2.545 ± 1.016
2.545ValPhe: 2.545 ± 0.912
3.393ValGly: 3.393 ± 1.199
2.545ValHis: 2.545 ± 1.042
2.545ValIle: 2.545 ± 0.916
1.696ValLys: 1.696 ± 0.808
4.241ValLeu: 4.241 ± 1.747
0.848ValMet: 0.848 ± 0.407
2.12ValAsn: 2.12 ± 0.923
8.058ValPro: 8.058 ± 2.545
3.393ValGln: 3.393 ± 0.988
3.817ValArg: 3.817 ± 0.929
7.209ValSer: 7.209 ± 2.328
5.513ValThr: 5.513 ± 1.508
5.513ValVal: 5.513 ± 1.249
0.848ValTrp: 0.848 ± 0.629
2.545ValTyr: 2.545 ± 0.912
0.0ValXaa: 0.0 ± 0.0
Trp
1.696TrpAla: 1.696 ± 0.571
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.696TrpGlu: 1.696 ± 0.939
0.848TrpPhe: 0.848 ± 0.407
1.272TrpGly: 1.272 ± 0.412
0.424TrpHis: 0.424 ± 0.437
0.848TrpIle: 0.848 ± 0.661
1.272TrpLys: 1.272 ± 0.6
0.848TrpLeu: 0.848 ± 0.407
0.424TrpMet: 0.424 ± 0.338
0.848TrpAsn: 0.848 ± 0.505
0.424TrpPro: 0.424 ± 0.535
0.424TrpGln: 0.424 ± 0.437
1.272TrpArg: 1.272 ± 0.412
0.424TrpSer: 0.424 ± 0.437
1.696TrpThr: 1.696 ± 1.189
1.272TrpVal: 1.272 ± 0.991
0.0TrpTrp: 0.0 ± 0.0
0.848TrpTyr: 0.848 ± 0.428
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.969TyrAla: 2.969 ± 0.906
0.424TyrCys: 0.424 ± 0.437
2.969TyrAsp: 2.969 ± 0.852
2.12TyrGlu: 2.12 ± 0.871
0.0TyrPhe: 0.0 ± 0.0
2.969TyrGly: 2.969 ± 0.968
0.424TyrHis: 0.424 ± 0.326
2.545TyrIle: 2.545 ± 0.964
1.272TyrLys: 1.272 ± 0.645
3.817TyrLeu: 3.817 ± 1.652
0.0TyrMet: 0.0 ± 0.0
0.848TyrAsn: 0.848 ± 0.731
2.12TyrPro: 2.12 ± 1.594
0.424TyrGln: 0.424 ± 0.365
2.969TyrArg: 2.969 ± 0.781
1.696TyrSer: 1.696 ± 1.145
1.696TyrThr: 1.696 ± 0.808
2.545TyrVal: 2.545 ± 0.784
0.848TyrTrp: 0.848 ± 0.407
2.969TyrTyr: 2.969 ± 0.997
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2359 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski