Amino acid dipepetide frequency for Human papillomavirus 107

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.277AlaAla: 6.277 ± 3.322
0.392AlaCys: 0.392 ± 0.344
3.531AlaAsp: 3.531 ± 0.673
2.354AlaGlu: 2.354 ± 0.806
2.354AlaPhe: 2.354 ± 0.911
3.531AlaGly: 3.531 ± 0.57
0.392AlaHis: 0.392 ± 0.428
2.354AlaIle: 2.354 ± 0.668
2.746AlaLys: 2.746 ± 0.652
5.492AlaLeu: 5.492 ± 1.49
0.392AlaMet: 0.392 ± 0.344
2.354AlaAsn: 2.354 ± 0.904
3.138AlaPro: 3.138 ± 0.793
3.531AlaGln: 3.531 ± 1.224
3.923AlaArg: 3.923 ± 0.943
3.138AlaSer: 3.138 ± 1.04
5.1AlaThr: 5.1 ± 1.373
1.962AlaVal: 1.962 ± 1.378
0.0AlaTrp: 0.0 ± 0.0
1.569AlaTyr: 1.569 ± 0.819
0.0AlaXaa: 0.0 ± 0.0
Cys
1.569CysAla: 1.569 ± 1.03
1.177CysCys: 1.177 ± 1.09
0.392CysAsp: 0.392 ± 0.344
0.392CysGlu: 0.392 ± 0.549
1.569CysPhe: 1.569 ± 0.636
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.962CysLys: 1.962 ± 1.004
1.177CysLeu: 1.177 ± 0.7
0.392CysMet: 0.392 ± 0.325
0.392CysAsn: 0.392 ± 0.325
2.354CysPro: 2.354 ± 1.293
0.392CysGln: 0.392 ± 0.325
1.177CysArg: 1.177 ± 0.856
1.569CysSer: 1.569 ± 0.982
0.785CysThr: 0.785 ± 0.412
0.785CysVal: 0.785 ± 0.588
0.785CysTrp: 0.785 ± 0.65
1.177CysTyr: 1.177 ± 0.629
0.0CysXaa: 0.0 ± 0.0
Asp
4.708AspAla: 4.708 ± 1.176
0.392AspCys: 0.392 ± 0.325
2.746AspAsp: 2.746 ± 1.173
1.177AspGlu: 1.177 ± 0.452
1.177AspPhe: 1.177 ± 0.573
3.531AspGly: 3.531 ± 1.249
0.392AspHis: 0.392 ± 0.344
4.708AspIle: 4.708 ± 1.2
1.962AspLys: 1.962 ± 0.507
6.669AspLeu: 6.669 ± 2.152
1.962AspMet: 1.962 ± 1.077
2.746AspAsn: 2.746 ± 0.862
4.315AspPro: 4.315 ± 1.363
1.962AspGln: 1.962 ± 0.702
3.138AspArg: 3.138 ± 1.003
3.923AspSer: 3.923 ± 1.573
5.492AspThr: 5.492 ± 1.272
2.746AspVal: 2.746 ± 1.474
1.962AspTrp: 1.962 ± 0.903
1.569AspTyr: 1.569 ± 0.788
0.0AspXaa: 0.0 ± 0.0
Glu
5.1GluAla: 5.1 ± 1.243
0.392GluCys: 0.392 ± 0.325
4.708GluAsp: 4.708 ± 0.74
7.846GluGlu: 7.846 ± 3.233
2.354GluPhe: 2.354 ± 1.032
5.1GluGly: 5.1 ± 2.766
1.962GluHis: 1.962 ± 0.647
3.531GluIle: 3.531 ± 1.594
2.354GluLys: 2.354 ± 0.796
5.1GluLeu: 5.1 ± 2.59
1.177GluMet: 1.177 ± 0.587
3.531GluAsn: 3.531 ± 0.887
2.354GluPro: 2.354 ± 0.953
2.746GluGln: 2.746 ± 0.959
6.277GluArg: 6.277 ± 1.983
3.531GluSer: 3.531 ± 2.019
5.885GluThr: 5.885 ± 1.853
6.277GluVal: 6.277 ± 1.478
0.0GluTrp: 0.0 ± 0.0
1.569GluTyr: 1.569 ± 1.374
0.0GluXaa: 0.0 ± 0.0
Phe
2.354PheAla: 2.354 ± 0.979
0.785PheCys: 0.785 ± 1.098
3.138PheAsp: 3.138 ± 0.62
3.531PheGlu: 3.531 ± 1.506
3.138PhePhe: 3.138 ± 0.877
3.138PheGly: 3.138 ± 1.079
0.785PheHis: 0.785 ± 0.65
2.354PheIle: 2.354 ± 1.121
1.962PheLys: 1.962 ± 1.175
4.315PheLeu: 4.315 ± 1.289
0.392PheMet: 0.392 ± 0.428
2.354PheAsn: 2.354 ± 0.91
1.569PhePro: 1.569 ± 0.952
1.962PheGln: 1.962 ± 0.743
1.569PheArg: 1.569 ± 0.679
2.746PheSer: 2.746 ± 0.879
2.354PheThr: 2.354 ± 1.135
1.962PheVal: 1.962 ± 1.077
1.569PheTrp: 1.569 ± 0.823
1.569PheTyr: 1.569 ± 0.27
0.0PheXaa: 0.0 ± 0.0
Gly
2.746GlyAla: 2.746 ± 0.8
1.962GlyCys: 1.962 ± 0.778
3.923GlyAsp: 3.923 ± 1.16
5.885GlyGlu: 5.885 ± 1.806
1.569GlyPhe: 1.569 ± 0.72
5.885GlyGly: 5.885 ± 1.891
2.746GlyHis: 2.746 ± 1.102
3.138GlyIle: 3.138 ± 1.009
4.708GlyLys: 4.708 ± 1.406
2.354GlyLeu: 2.354 ± 0.805
0.392GlyMet: 0.392 ± 0.375
6.277GlyAsn: 6.277 ± 1.907
2.354GlyPro: 2.354 ± 0.975
2.746GlyGln: 2.746 ± 0.604
7.846GlyArg: 7.846 ± 4.021
3.138GlySer: 3.138 ± 0.766
3.923GlyThr: 3.923 ± 0.938
2.746GlyVal: 2.746 ± 0.69
0.0GlyTrp: 0.0 ± 0.0
1.962GlyTyr: 1.962 ± 1.079
0.0GlyXaa: 0.0 ± 0.0
His
0.392HisAla: 0.392 ± 0.344
1.569HisCys: 1.569 ± 0.982
0.392HisAsp: 0.392 ± 0.344
0.392HisGlu: 0.392 ± 0.549
1.962HisPhe: 1.962 ± 1.048
0.785HisGly: 0.785 ± 0.63
0.0HisHis: 0.0 ± 0.0
0.392HisIle: 0.392 ± 0.325
1.569HisLys: 1.569 ± 0.721
1.569HisLeu: 1.569 ± 0.742
0.0HisMet: 0.0 ± 0.0
0.785HisAsn: 0.785 ± 0.417
2.354HisPro: 2.354 ± 1.145
1.177HisGln: 1.177 ± 0.644
0.392HisArg: 0.392 ± 0.325
1.569HisSer: 1.569 ± 0.535
0.785HisThr: 0.785 ± 0.65
1.569HisVal: 1.569 ± 0.721
0.785HisTrp: 0.785 ± 0.428
1.569HisTyr: 1.569 ± 0.484
0.0HisXaa: 0.0 ± 0.0
Ile
2.746IleAla: 2.746 ± 0.791
0.392IleCys: 0.392 ± 0.581
3.923IleAsp: 3.923 ± 1.677
6.277IleGlu: 6.277 ± 1.876
0.785IlePhe: 0.785 ± 0.417
3.531IleGly: 3.531 ± 1.095
0.785IleHis: 0.785 ± 0.389
1.177IleIle: 1.177 ± 1.08
1.177IleLys: 1.177 ± 0.661
3.923IleLeu: 3.923 ± 0.819
0.785IleMet: 0.785 ± 0.598
3.138IleAsn: 3.138 ± 1.172
2.746IlePro: 2.746 ± 1.643
0.785IleGln: 0.785 ± 0.72
3.531IleArg: 3.531 ± 1.78
3.923IleSer: 3.923 ± 1.031
1.177IleThr: 1.177 ± 0.776
2.354IleVal: 2.354 ± 1.412
1.177IleTrp: 1.177 ± 0.644
2.746IleTyr: 2.746 ± 0.836
0.0IleXaa: 0.0 ± 0.0
Lys
1.962LysAla: 1.962 ± 0.823
1.177LysCys: 1.177 ± 0.726
2.746LysAsp: 2.746 ± 0.509
2.746LysGlu: 2.746 ± 0.96
3.531LysPhe: 3.531 ± 1.375
4.315LysGly: 4.315 ± 1.103
1.962LysHis: 1.962 ± 0.81
1.569LysIle: 1.569 ± 0.777
1.569LysLys: 1.569 ± 0.757
4.315LysLeu: 4.315 ± 0.674
0.0LysMet: 0.0 ± 0.0
1.962LysAsn: 1.962 ± 1.553
1.177LysPro: 1.177 ± 0.823
1.569LysGln: 1.569 ± 0.584
5.492LysArg: 5.492 ± 0.84
3.923LysSer: 3.923 ± 1.877
2.354LysThr: 2.354 ± 0.717
3.923LysVal: 3.923 ± 1.839
0.392LysTrp: 0.392 ± 0.428
2.354LysTyr: 2.354 ± 0.904
0.0LysXaa: 0.0 ± 0.0
Leu
3.923LeuAla: 3.923 ± 1.595
1.962LeuCys: 1.962 ± 0.722
4.708LeuAsp: 4.708 ± 1.164
7.062LeuGlu: 7.062 ± 1.446
3.138LeuPhe: 3.138 ± 1.331
4.708LeuGly: 4.708 ± 1.859
1.177LeuHis: 1.177 ± 0.697
1.962LeuIle: 1.962 ± 0.967
4.315LeuLys: 4.315 ± 1.362
11.377LeuLeu: 11.377 ± 2.85
3.531LeuMet: 3.531 ± 1.313
1.569LeuAsn: 1.569 ± 0.78
4.315LeuPro: 4.315 ± 1.454
9.023LeuGln: 9.023 ± 1.768
3.923LeuArg: 3.923 ± 0.954
6.277LeuSer: 6.277 ± 2.341
5.1LeuThr: 5.1 ± 1.817
5.1LeuVal: 5.1 ± 1.256
1.569LeuTrp: 1.569 ± 0.679
1.962LeuTyr: 1.962 ± 1.181
0.0LeuXaa: 0.0 ± 0.0
Met
0.785MetAla: 0.785 ± 0.417
0.392MetCys: 0.392 ± 0.325
0.785MetAsp: 0.785 ± 0.659
1.177MetGlu: 1.177 ± 0.736
1.962MetPhe: 1.962 ± 0.779
0.785MetGly: 0.785 ± 0.457
0.785MetHis: 0.785 ± 0.65
1.962MetIle: 1.962 ± 0.732
0.392MetLys: 0.392 ± 0.375
2.354MetLeu: 2.354 ± 1.158
0.392MetMet: 0.392 ± 0.428
1.177MetAsn: 1.177 ± 0.657
0.0MetPro: 0.0 ± 0.0
0.392MetGln: 0.392 ± 0.325
0.785MetArg: 0.785 ± 0.457
2.354MetSer: 2.354 ± 1.315
0.785MetThr: 0.785 ± 0.412
1.177MetVal: 1.177 ± 0.661
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.177AsnAla: 1.177 ± 0.657
1.177AsnCys: 1.177 ± 0.644
1.569AsnAsp: 1.569 ± 0.721
2.746AsnGlu: 2.746 ± 1.268
1.962AsnPhe: 1.962 ± 0.81
3.531AsnGly: 3.531 ± 1.568
0.392AsnHis: 0.392 ± 0.549
2.746AsnIle: 2.746 ± 1.448
3.531AsnLys: 3.531 ± 1.228
2.354AsnLeu: 2.354 ± 1.677
0.392AsnMet: 0.392 ± 0.325
1.962AsnAsn: 1.962 ± 1.077
1.962AsnPro: 1.962 ± 0.93
2.354AsnGln: 2.354 ± 1.171
3.531AsnArg: 3.531 ± 1.431
4.708AsnSer: 4.708 ± 2.123
2.354AsnThr: 2.354 ± 0.823
1.569AsnVal: 1.569 ± 0.911
0.785AsnTrp: 0.785 ± 0.659
1.962AsnTyr: 1.962 ± 1.256
0.0AsnXaa: 0.0 ± 0.0
Pro
2.746ProAla: 2.746 ± 0.728
1.569ProCys: 1.569 ± 0.917
6.669ProAsp: 6.669 ± 2.004
4.315ProGlu: 4.315 ± 2.136
1.177ProPhe: 1.177 ± 0.685
1.962ProGly: 1.962 ± 1.394
0.785ProHis: 0.785 ± 0.566
3.138ProIle: 3.138 ± 2.174
3.138ProLys: 3.138 ± 1.412
5.885ProLeu: 5.885 ± 1.37
0.392ProMet: 0.392 ± 0.325
1.177ProAsn: 1.177 ± 0.657
6.669ProPro: 6.669 ± 2.283
3.923ProGln: 3.923 ± 1.093
3.138ProArg: 3.138 ± 1.423
5.492ProSer: 5.492 ± 1.943
3.923ProThr: 3.923 ± 1.29
5.1ProVal: 5.1 ± 2.01
0.0ProTrp: 0.0 ± 0.0
0.785ProTyr: 0.785 ± 0.616
0.0ProXaa: 0.0 ± 0.0
Gln
2.354GlnAla: 2.354 ± 0.768
1.177GlnCys: 1.177 ± 0.638
4.315GlnAsp: 4.315 ± 0.883
2.746GlnGlu: 2.746 ± 1.367
3.138GlnPhe: 3.138 ± 1.171
3.531GlnGly: 3.531 ± 0.647
0.785GlnHis: 0.785 ± 0.588
3.138GlnIle: 3.138 ± 0.494
1.177GlnLys: 1.177 ± 0.736
5.492GlnLeu: 5.492 ± 0.997
1.962GlnMet: 1.962 ± 0.773
2.746GlnAsn: 2.746 ± 1.052
3.531GlnPro: 3.531 ± 0.565
3.138GlnGln: 3.138 ± 0.955
1.962GlnArg: 1.962 ± 1.401
3.138GlnSer: 3.138 ± 0.828
3.138GlnThr: 3.138 ± 0.82
3.923GlnVal: 3.923 ± 1.059
0.392GlnTrp: 0.392 ± 0.325
1.962GlnTyr: 1.962 ± 0.725
0.0GlnXaa: 0.0 ± 0.0
Arg
5.492ArgAla: 5.492 ± 1.415
0.785ArgCys: 0.785 ± 0.71
1.569ArgAsp: 1.569 ± 1.01
6.669ArgGlu: 6.669 ± 2.338
3.138ArgPhe: 3.138 ± 1.07
4.315ArgGly: 4.315 ± 1.124
2.746ArgHis: 2.746 ± 1.247
1.177ArgIle: 1.177 ± 0.644
3.923ArgLys: 3.923 ± 0.958
6.669ArgLeu: 6.669 ± 1.241
1.569ArgMet: 1.569 ± 0.721
1.962ArgAsn: 1.962 ± 0.81
3.138ArgPro: 3.138 ± 1.298
4.315ArgGln: 4.315 ± 1.083
4.315ArgArg: 4.315 ± 1.742
7.062ArgSer: 7.062 ± 4.395
5.492ArgThr: 5.492 ± 1.331
2.746ArgVal: 2.746 ± 0.791
0.0ArgTrp: 0.0 ± 0.0
1.569ArgTyr: 1.569 ± 0.543
0.0ArgXaa: 0.0 ± 0.0
Ser
3.138SerAla: 3.138 ± 0.757
1.177SerCys: 1.177 ± 0.561
5.1SerAsp: 5.1 ± 1.654
5.1SerGlu: 5.1 ± 1.587
3.138SerPhe: 3.138 ± 0.691
5.885SerGly: 5.885 ± 1.117
0.392SerHis: 0.392 ± 0.325
3.138SerIle: 3.138 ± 0.757
2.746SerLys: 2.746 ± 0.488
6.669SerLeu: 6.669 ± 1.737
1.177SerMet: 1.177 ± 0.975
2.354SerAsn: 2.354 ± 1.322
5.1SerPro: 5.1 ± 1.392
3.138SerGln: 3.138 ± 0.691
8.239SerArg: 8.239 ± 3.292
6.277SerSer: 6.277 ± 1.371
5.492SerThr: 5.492 ± 2.042
5.1SerVal: 5.1 ± 1.643
0.785SerTrp: 0.785 ± 0.428
0.785SerTyr: 0.785 ± 0.465
0.0SerXaa: 0.0 ± 0.0
Thr
1.569ThrAla: 1.569 ± 0.757
0.785ThrCys: 0.785 ± 0.389
3.923ThrAsp: 3.923 ± 0.803
3.531ThrGlu: 3.531 ± 0.687
1.962ThrPhe: 1.962 ± 0.967
4.708ThrGly: 4.708 ± 1.696
1.177ThrHis: 1.177 ± 0.858
3.531ThrIle: 3.531 ± 1.277
3.923ThrLys: 3.923 ± 1.574
3.531ThrLeu: 3.531 ± 0.759
1.569ThrMet: 1.569 ± 0.775
1.962ThrAsn: 1.962 ± 1.333
7.062ThrPro: 7.062 ± 3.118
3.531ThrGln: 3.531 ± 1.323
4.315ThrArg: 4.315 ± 1.068
5.492ThrSer: 5.492 ± 2.483
5.1ThrThr: 5.1 ± 1.581
4.708ThrVal: 4.708 ± 1.299
0.785ThrTrp: 0.785 ± 0.749
0.785ThrTyr: 0.785 ± 0.412
0.0ThrXaa: 0.0 ± 0.0
Val
3.138ValAla: 3.138 ± 1.346
0.392ValCys: 0.392 ± 0.325
2.354ValAsp: 2.354 ± 0.556
5.885ValGlu: 5.885 ± 1.366
2.746ValPhe: 2.746 ± 1.289
4.315ValGly: 4.315 ± 0.791
1.569ValHis: 1.569 ± 0.617
3.531ValIle: 3.531 ± 1.921
1.962ValLys: 1.962 ± 0.732
4.315ValLeu: 4.315 ± 0.849
0.392ValMet: 0.392 ± 0.428
1.962ValAsn: 1.962 ± 0.507
5.1ValPro: 5.1 ± 1.343
4.315ValGln: 4.315 ± 0.904
3.531ValArg: 3.531 ± 1.06
3.923ValSer: 3.923 ± 1.467
3.923ValThr: 3.923 ± 1.453
2.746ValVal: 2.746 ± 1.161
0.785ValTrp: 0.785 ± 0.465
1.962ValTyr: 1.962 ± 0.979
0.0ValXaa: 0.0 ± 0.0
Trp
0.785TrpAla: 0.785 ± 0.412
0.0TrpCys: 0.0 ± 0.0
0.392TrpAsp: 0.392 ± 0.344
0.785TrpGlu: 0.785 ± 0.537
0.392TrpPhe: 0.392 ± 0.325
0.392TrpGly: 0.392 ± 0.344
0.392TrpHis: 0.392 ± 0.549
1.569TrpIle: 1.569 ± 0.856
0.785TrpLys: 0.785 ± 0.588
1.177TrpLeu: 1.177 ± 0.685
0.785TrpMet: 0.785 ± 0.457
0.785TrpAsn: 0.785 ± 0.428
0.0TrpPro: 0.0 ± 0.0
1.177TrpGln: 1.177 ± 0.771
0.392TrpArg: 0.392 ± 0.344
1.569TrpSer: 1.569 ± 1.086
0.392TrpThr: 0.392 ± 0.325
0.392TrpVal: 0.392 ± 0.325
0.0TrpTrp: 0.0 ± 0.0
0.392TrpTyr: 0.392 ± 0.325
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.569TyrAla: 1.569 ± 0.584
0.392TyrCys: 0.392 ± 0.325
0.392TyrAsp: 0.392 ± 0.344
1.177TyrGlu: 1.177 ± 0.858
2.354TyrPhe: 2.354 ± 1.089
2.354TyrGly: 2.354 ± 0.633
0.785TyrHis: 0.785 ± 0.417
1.962TyrIle: 1.962 ± 0.434
3.138TyrLys: 3.138 ± 1.045
1.962TyrLeu: 1.962 ± 0.684
0.392TyrMet: 0.392 ± 0.375
1.569TyrAsn: 1.569 ± 0.464
2.746TyrPro: 2.746 ± 0.96
1.569TyrGln: 1.569 ± 0.679
1.177TyrArg: 1.177 ± 0.452
1.177TyrSer: 1.177 ± 0.676
0.785TyrThr: 0.785 ± 0.408
1.962TyrVal: 1.962 ± 0.807
0.785TyrTrp: 0.785 ± 0.514
1.569TyrTyr: 1.569 ± 0.638
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2550 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski