Amino acid dipepetide frequency for Human papillomavirus 201

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.049AlaAla: 4.049 ± 1.636
0.405AlaCys: 0.405 ± 0.557
5.263AlaAsp: 5.263 ± 1.64
2.834AlaGlu: 2.834 ± 0.762
3.644AlaPhe: 3.644 ± 1.148
2.429AlaGly: 2.429 ± 0.827
0.81AlaHis: 0.81 ± 0.45
2.834AlaIle: 2.834 ± 0.823
2.834AlaLys: 2.834 ± 1.174
6.073AlaLeu: 6.073 ± 1.54
0.81AlaMet: 0.81 ± 0.423
0.81AlaAsn: 0.81 ± 0.413
2.834AlaPro: 2.834 ± 1.017
2.429AlaGln: 2.429 ± 0.724
2.834AlaArg: 2.834 ± 0.944
3.239AlaSer: 3.239 ± 1.11
4.858AlaThr: 4.858 ± 1.308
2.024AlaVal: 2.024 ± 0.547
0.405AlaTrp: 0.405 ± 0.329
1.215AlaTyr: 1.215 ± 0.637
0.0AlaXaa: 0.0 ± 0.0
Cys
0.405CysAla: 0.405 ± 0.341
2.429CysCys: 2.429 ± 2.36
0.405CysAsp: 0.405 ± 0.329
0.81CysGlu: 0.81 ± 0.658
1.215CysPhe: 1.215 ± 0.637
0.405CysGly: 0.405 ± 0.644
0.405CysHis: 0.405 ± 0.644
1.619CysIle: 1.619 ± 0.865
2.429CysLys: 2.429 ± 1.602
1.215CysLeu: 1.215 ± 0.801
1.215CysMet: 1.215 ± 0.792
0.81CysAsn: 0.81 ± 0.658
1.215CysPro: 1.215 ± 0.842
0.81CysGln: 0.81 ± 0.57
1.215CysArg: 1.215 ± 1.11
1.215CysSer: 1.215 ± 0.802
2.834CysThr: 2.834 ± 0.956
2.024CysVal: 2.024 ± 1.932
0.405CysTrp: 0.405 ± 0.341
0.81CysTyr: 0.81 ± 1.114
0.0CysXaa: 0.0 ± 0.0
Asp
2.429AspAla: 2.429 ± 1.256
1.619AspCys: 1.619 ± 1.154
3.644AspAsp: 3.644 ± 1.393
3.644AspGlu: 3.644 ± 2.234
2.429AspPhe: 2.429 ± 0.97
2.429AspGly: 2.429 ± 1.185
0.405AspHis: 0.405 ± 0.39
4.453AspIle: 4.453 ± 1.129
0.405AspLys: 0.405 ± 0.329
5.668AspLeu: 5.668 ± 2.095
0.405AspMet: 0.405 ± 0.341
3.644AspAsn: 3.644 ± 1.135
4.049AspPro: 4.049 ± 1.292
5.668AspGln: 5.668 ± 0.745
1.215AspArg: 1.215 ± 0.662
4.049AspSer: 4.049 ± 1.338
4.049AspThr: 4.049 ± 1.317
4.858AspVal: 4.858 ± 1.718
1.215AspTrp: 1.215 ± 0.67
2.024AspTyr: 2.024 ± 1.057
0.0AspXaa: 0.0 ± 0.0
Glu
4.049GluAla: 4.049 ± 1.298
1.215GluCys: 1.215 ± 0.987
2.429GluAsp: 2.429 ± 0.859
10.931GluGlu: 10.931 ± 4.57
2.429GluPhe: 2.429 ± 0.721
2.834GluGly: 2.834 ± 1.021
0.81GluHis: 0.81 ± 0.596
3.239GluIle: 3.239 ± 0.856
2.429GluLys: 2.429 ± 0.903
6.073GluLeu: 6.073 ± 1.712
1.619GluMet: 1.619 ± 0.827
4.453GluAsn: 4.453 ± 1.723
4.858GluPro: 4.858 ± 1.018
2.429GluGln: 2.429 ± 1.383
2.834GluArg: 2.834 ± 1.317
4.858GluSer: 4.858 ± 1.285
6.073GluThr: 6.073 ± 1.785
2.429GluVal: 2.429 ± 0.519
0.405GluTrp: 0.405 ± 0.39
2.834GluTyr: 2.834 ± 0.979
0.0GluXaa: 0.0 ± 0.0
Phe
1.215PheAla: 1.215 ± 0.465
0.81PheCys: 0.81 ± 0.978
4.453PheAsp: 4.453 ± 1.158
2.834PheGlu: 2.834 ± 1.175
3.239PhePhe: 3.239 ± 1.41
4.453PheGly: 4.453 ± 1.018
1.619PheHis: 1.619 ± 0.758
2.024PheIle: 2.024 ± 0.547
3.644PheLys: 3.644 ± 1.311
4.049PheLeu: 4.049 ± 1.252
0.405PheMet: 0.405 ± 0.308
1.215PheAsn: 1.215 ± 0.719
2.834PhePro: 2.834 ± 0.696
2.024PheGln: 2.024 ± 0.643
2.024PheArg: 2.024 ± 0.701
2.834PheSer: 2.834 ± 0.641
3.644PheThr: 3.644 ± 1.125
2.834PheVal: 2.834 ± 0.931
1.215PheTrp: 1.215 ± 0.687
2.024PheTyr: 2.024 ± 0.957
0.0PheXaa: 0.0 ± 0.0
Gly
2.429GlyAla: 2.429 ± 0.768
0.81GlyCys: 0.81 ± 0.668
3.239GlyAsp: 3.239 ± 1.194
3.644GlyGlu: 3.644 ± 0.791
0.81GlyPhe: 0.81 ± 0.78
3.644GlyGly: 3.644 ± 1.686
2.024GlyHis: 2.024 ± 1.002
6.478GlyIle: 6.478 ± 1.427
4.049GlyLys: 4.049 ± 1.059
4.453GlyLeu: 4.453 ± 1.598
0.0GlyMet: 0.0 ± 0.0
4.049GlyAsn: 4.049 ± 1.523
1.619GlyPro: 1.619 ± 0.652
2.429GlyGln: 2.429 ± 1.465
3.239GlyArg: 3.239 ± 1.447
4.453GlySer: 4.453 ± 0.913
5.668GlyThr: 5.668 ± 2.127
1.619GlyVal: 1.619 ± 1.509
0.0GlyTrp: 0.0 ± 0.0
1.215GlyTyr: 1.215 ± 0.596
0.0GlyXaa: 0.0 ± 0.0
His
0.405HisAla: 0.405 ± 0.341
0.81HisCys: 0.81 ± 0.6
0.405HisAsp: 0.405 ± 0.552
1.619HisGlu: 1.619 ± 1.018
1.215HisPhe: 1.215 ± 0.729
1.619HisGly: 1.619 ± 0.653
0.81HisHis: 0.81 ± 0.861
1.619HisIle: 1.619 ± 0.777
0.405HisLys: 0.405 ± 0.39
2.024HisLeu: 2.024 ± 1.042
0.405HisMet: 0.405 ± 0.329
0.405HisAsn: 0.405 ± 0.341
1.619HisPro: 1.619 ± 0.901
1.619HisGln: 1.619 ± 1.648
1.619HisArg: 1.619 ± 1.094
1.215HisSer: 1.215 ± 0.384
1.215HisThr: 1.215 ± 1.061
0.405HisVal: 0.405 ± 0.341
0.81HisTrp: 0.81 ± 0.582
1.619HisTyr: 1.619 ± 1.104
0.0HisXaa: 0.0 ± 0.0
Ile
3.644IleAla: 3.644 ± 1.75
0.81IleCys: 0.81 ± 0.681
5.263IleAsp: 5.263 ± 1.505
3.239IleGlu: 3.239 ± 1.182
2.429IlePhe: 2.429 ± 0.791
2.834IleGly: 2.834 ± 1.451
0.81IleHis: 0.81 ± 0.861
4.858IleIle: 4.858 ± 1.048
1.215IleLys: 1.215 ± 0.566
4.453IleLeu: 4.453 ± 1.472
0.0IleMet: 0.0 ± 0.0
4.453IleAsn: 4.453 ± 0.881
3.239IlePro: 3.239 ± 2.099
0.81IleGln: 0.81 ± 0.681
4.049IleArg: 4.049 ± 1.27
5.668IleSer: 5.668 ± 2.095
4.049IleThr: 4.049 ± 1.899
2.834IleVal: 2.834 ± 1.229
0.405IleTrp: 0.405 ± 0.557
2.024IleTyr: 2.024 ± 0.527
0.0IleXaa: 0.0 ± 0.0
Lys
1.215LysAla: 1.215 ± 0.384
2.429LysCys: 2.429 ± 1.108
2.429LysAsp: 2.429 ± 0.792
2.429LysGlu: 2.429 ± 1.684
2.429LysPhe: 2.429 ± 1.339
3.239LysGly: 3.239 ± 1.11
1.619LysHis: 1.619 ± 0.652
2.834LysIle: 2.834 ± 0.774
2.024LysLys: 2.024 ± 1.021
3.239LysLeu: 3.239 ± 1.605
1.619LysMet: 1.619 ± 0.811
2.024LysAsn: 2.024 ± 0.647
1.215LysPro: 1.215 ± 1.112
4.858LysGln: 4.858 ± 1.589
5.263LysArg: 5.263 ± 1.213
3.239LysSer: 3.239 ± 1.153
4.049LysThr: 4.049 ± 1.072
2.834LysVal: 2.834 ± 0.91
1.215LysTrp: 1.215 ± 0.465
2.429LysTyr: 2.429 ± 1.048
0.0LysXaa: 0.0 ± 0.0
Leu
3.644LeuAla: 3.644 ± 1.053
1.619LeuCys: 1.619 ± 0.84
4.453LeuAsp: 4.453 ± 2.102
6.478LeuGlu: 6.478 ± 2.512
4.858LeuPhe: 4.858 ± 1.418
5.668LeuGly: 5.668 ± 2.244
1.215LeuHis: 1.215 ± 0.667
4.049LeuIle: 4.049 ± 1.163
6.073LeuLys: 6.073 ± 1.944
7.287LeuLeu: 7.287 ± 2.272
1.619LeuMet: 1.619 ± 0.722
2.834LeuAsn: 2.834 ± 0.775
5.668LeuPro: 5.668 ± 2.095
7.692LeuGln: 7.692 ± 1.525
4.858LeuArg: 4.858 ± 0.869
7.692LeuSer: 7.692 ± 1.661
4.049LeuThr: 4.049 ± 0.998
4.858LeuVal: 4.858 ± 1.507
0.405LeuTrp: 0.405 ± 0.341
2.834LeuTyr: 2.834 ± 0.747
0.0LeuXaa: 0.0 ± 0.0
Met
0.81MetAla: 0.81 ± 0.417
0.0MetCys: 0.0 ± 0.0
1.215MetAsp: 1.215 ± 0.792
0.405MetGlu: 0.405 ± 0.39
2.024MetPhe: 2.024 ± 0.676
1.619MetGly: 1.619 ± 0.535
0.0MetHis: 0.0 ± 0.0
1.215MetIle: 1.215 ± 0.802
0.81MetLys: 0.81 ± 0.6
0.81MetLeu: 0.81 ± 0.637
0.0MetMet: 0.0 ± 0.0
1.215MetAsn: 1.215 ± 0.423
0.81MetPro: 0.81 ± 0.383
0.405MetGln: 0.405 ± 0.39
0.81MetArg: 0.81 ± 0.413
1.619MetSer: 1.619 ± 0.969
0.405MetThr: 0.405 ± 0.329
0.405MetVal: 0.405 ± 0.329
0.405MetTrp: 0.405 ± 0.329
0.405MetTyr: 0.405 ± 0.39
0.0MetXaa: 0.0 ± 0.0
Asn
2.834AsnAla: 2.834 ± 0.758
1.619AsnCys: 1.619 ± 0.487
2.024AsnAsp: 2.024 ± 0.809
2.429AsnGlu: 2.429 ± 0.899
2.834AsnPhe: 2.834 ± 1.316
2.429AsnGly: 2.429 ± 0.753
1.215AsnHis: 1.215 ± 0.67
4.049AsnIle: 4.049 ± 1.853
3.644AsnLys: 3.644 ± 1.045
3.239AsnLeu: 3.239 ± 1.341
0.81AsnMet: 0.81 ± 0.658
3.239AsnAsn: 3.239 ± 1.799
3.239AsnPro: 3.239 ± 1.522
2.834AsnGln: 2.834 ± 1.061
2.024AsnArg: 2.024 ± 0.809
2.429AsnSer: 2.429 ± 0.466
4.049AsnThr: 4.049 ± 1.37
2.834AsnVal: 2.834 ± 0.919
1.619AsnTrp: 1.619 ± 0.926
1.215AsnTyr: 1.215 ± 0.384
0.0AsnXaa: 0.0 ± 0.0
Pro
4.049ProAla: 4.049 ± 1.25
0.405ProCys: 0.405 ± 0.557
5.263ProAsp: 5.263 ± 1.679
6.073ProGlu: 6.073 ± 1.845
0.81ProPhe: 0.81 ± 0.473
1.215ProGly: 1.215 ± 0.842
1.215ProHis: 1.215 ± 0.596
2.429ProIle: 2.429 ± 1.649
4.453ProLys: 4.453 ± 2.142
6.883ProLeu: 6.883 ± 2.609
0.81ProMet: 0.81 ± 0.417
2.429ProAsn: 2.429 ± 0.971
7.287ProPro: 7.287 ± 2.128
1.215ProGln: 1.215 ± 0.647
3.239ProArg: 3.239 ± 0.962
5.263ProSer: 5.263 ± 1.77
3.644ProThr: 3.644 ± 0.949
4.049ProVal: 4.049 ± 0.652
0.0ProTrp: 0.0 ± 0.0
1.215ProTyr: 1.215 ± 1.022
0.0ProXaa: 0.0 ± 0.0
Gln
2.429GlnAla: 2.429 ± 0.724
1.215GlnCys: 1.215 ± 0.541
4.453GlnAsp: 4.453 ± 1.58
2.024GlnGlu: 2.024 ± 1.258
2.834GlnPhe: 2.834 ± 0.893
2.834GlnGly: 2.834 ± 0.596
0.81GlnHis: 0.81 ± 0.6
2.429GlnIle: 2.429 ± 0.866
0.81GlnLys: 0.81 ± 0.582
6.478GlnLeu: 6.478 ± 1.083
1.619GlnMet: 1.619 ± 0.958
2.024GlnAsn: 2.024 ± 0.828
2.834GlnPro: 2.834 ± 0.99
2.834GlnGln: 2.834 ± 1.069
3.239GlnArg: 3.239 ± 1.386
3.239GlnSer: 3.239 ± 0.752
2.024GlnThr: 2.024 ± 0.765
1.619GlnVal: 1.619 ± 0.784
0.81GlnTrp: 0.81 ± 0.607
1.215GlnTyr: 1.215 ± 0.664
0.0GlnXaa: 0.0 ± 0.0
Arg
4.858ArgAla: 4.858 ± 1.265
3.239ArgCys: 3.239 ± 1.787
2.429ArgAsp: 2.429 ± 0.791
3.239ArgGlu: 3.239 ± 1.353
1.619ArgPhe: 1.619 ± 0.664
2.429ArgGly: 2.429 ± 1.546
4.049ArgHis: 4.049 ± 1.874
2.024ArgIle: 2.024 ± 1.444
4.049ArgLys: 4.049 ± 1.118
4.858ArgLeu: 4.858 ± 0.779
0.81ArgMet: 0.81 ± 0.637
1.619ArgAsn: 1.619 ± 0.535
4.049ArgPro: 4.049 ± 1.667
3.239ArgGln: 3.239 ± 0.708
6.073ArgArg: 6.073 ± 2.817
3.239ArgSer: 3.239 ± 0.676
1.619ArgThr: 1.619 ± 0.731
2.834ArgVal: 2.834 ± 0.728
0.0ArgTrp: 0.0 ± 0.0
3.239ArgTyr: 3.239 ± 1.572
0.0ArgXaa: 0.0 ± 0.0
Ser
5.263SerAla: 5.263 ± 1.993
1.619SerCys: 1.619 ± 0.834
0.81SerAsp: 0.81 ± 0.383
5.263SerGlu: 5.263 ± 1.434
6.073SerPhe: 6.073 ± 2.104
6.883SerGly: 6.883 ± 1.588
1.215SerHis: 1.215 ± 0.566
2.024SerIle: 2.024 ± 0.625
4.453SerLys: 4.453 ± 1.612
6.883SerLeu: 6.883 ± 1.381
1.215SerMet: 1.215 ± 0.624
4.858SerAsn: 4.858 ± 1.35
4.858SerPro: 4.858 ± 2.071
2.024SerGln: 2.024 ± 0.809
6.073SerArg: 6.073 ± 2.218
5.668SerSer: 5.668 ± 2.168
3.239SerThr: 3.239 ± 1.141
4.858SerVal: 4.858 ± 0.996
0.405SerTrp: 0.405 ± 0.329
1.215SerTyr: 1.215 ± 0.4
0.0SerXaa: 0.0 ± 0.0
Thr
2.834ThrAla: 2.834 ± 0.742
0.81ThrCys: 0.81 ± 0.648
3.644ThrAsp: 3.644 ± 1.681
6.478ThrGlu: 6.478 ± 1.491
3.644ThrPhe: 3.644 ± 1.121
4.453ThrGly: 4.453 ± 1.726
1.215ThrHis: 1.215 ± 0.465
4.453ThrIle: 4.453 ± 1.606
1.215ThrLys: 1.215 ± 0.662
6.478ThrLeu: 6.478 ± 1.045
0.405ThrMet: 0.405 ± 0.329
2.834ThrAsn: 2.834 ± 1.226
4.858ThrPro: 4.858 ± 1.424
1.215ThrGln: 1.215 ± 0.624
3.239ThrArg: 3.239 ± 0.802
7.692ThrSer: 7.692 ± 2.158
4.049ThrThr: 4.049 ± 1.314
4.453ThrVal: 4.453 ± 1.379
0.405ThrTrp: 0.405 ± 0.39
0.81ThrTyr: 0.81 ± 0.423
0.0ThrXaa: 0.0 ± 0.0
Val
3.644ValAla: 3.644 ± 1.395
1.215ValCys: 1.215 ± 0.848
4.453ValAsp: 4.453 ± 1.775
3.239ValGlu: 3.239 ± 1.318
2.024ValPhe: 2.024 ± 0.454
3.644ValGly: 3.644 ± 1.071
1.215ValHis: 1.215 ± 0.625
2.429ValIle: 2.429 ± 1.71
3.239ValLys: 3.239 ± 0.699
4.049ValLeu: 4.049 ± 1.032
0.0ValMet: 0.0 ± 0.0
3.644ValAsn: 3.644 ± 1.562
3.239ValPro: 3.239 ± 0.484
1.619ValGln: 1.619 ± 0.704
2.024ValArg: 2.024 ± 1.33
5.263ValSer: 5.263 ± 0.808
2.834ValThr: 2.834 ± 0.747
2.024ValVal: 2.024 ± 0.857
1.215ValTrp: 1.215 ± 0.739
1.215ValTyr: 1.215 ± 0.687
0.0ValXaa: 0.0 ± 0.0
Trp
0.81TrpAla: 0.81 ± 0.607
0.0TrpCys: 0.0 ± 0.0
0.81TrpAsp: 0.81 ± 0.681
0.81TrpGlu: 0.81 ± 0.423
0.405TrpPhe: 0.405 ± 0.39
0.0TrpGly: 0.0 ± 0.0
0.405TrpHis: 0.405 ± 0.39
0.405TrpIle: 0.405 ± 0.329
0.81TrpLys: 0.81 ± 0.658
1.215TrpLeu: 1.215 ± 0.423
0.0TrpMet: 0.0 ± 0.0
0.405TrpAsn: 0.405 ± 0.39
0.405TrpPro: 0.405 ± 0.341
0.81TrpGln: 0.81 ± 0.417
2.024TrpArg: 2.024 ± 1.117
0.0TrpSer: 0.0 ± 0.0
1.215TrpThr: 1.215 ± 0.739
1.215TrpVal: 1.215 ± 0.637
0.0TrpTrp: 0.0 ± 0.0
0.405TrpTyr: 0.405 ± 0.329
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.619TyrAla: 1.619 ± 0.574
0.81TyrCys: 0.81 ± 1.114
1.215TyrAsp: 1.215 ± 0.732
0.81TyrGlu: 0.81 ± 0.423
2.429TyrPhe: 2.429 ± 1.269
0.81TyrGly: 0.81 ± 0.596
0.0TyrHis: 0.0 ± 0.0
1.619TyrIle: 1.619 ± 0.834
3.644TyrLys: 3.644 ± 1.255
2.834TyrLeu: 2.834 ± 0.77
1.215TyrMet: 1.215 ± 0.792
3.644TyrAsn: 3.644 ± 0.791
1.215TyrPro: 1.215 ± 0.801
0.81TyrGln: 0.81 ± 0.423
1.619TyrArg: 1.619 ± 0.729
2.024TyrSer: 2.024 ± 0.892
1.619TyrThr: 1.619 ± 0.731
1.215TyrVal: 1.215 ± 0.67
0.81TyrTrp: 0.81 ± 0.423
1.619TyrTyr: 1.619 ± 0.901
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2471 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski