Amino acid dipepetide frequency for Human papillomavirus type 63

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.49AlaAla: 3.49 ± 0.849
1.163AlaCys: 1.163 ± 0.578
2.326AlaAsp: 2.326 ± 0.937
4.653AlaGlu: 4.653 ± 0.675
4.265AlaPhe: 4.265 ± 0.773
3.102AlaGly: 3.102 ± 0.833
0.0AlaHis: 0.0 ± 0.0
1.939AlaIle: 1.939 ± 1.035
3.102AlaLys: 3.102 ± 0.906
2.326AlaLeu: 2.326 ± 1.404
0.775AlaMet: 0.775 ± 0.64
1.939AlaAsn: 1.939 ± 0.923
4.265AlaPro: 4.265 ± 1.59
2.326AlaGln: 2.326 ± 0.758
5.816AlaArg: 5.816 ± 1.976
7.755AlaSer: 7.755 ± 2.222
5.041AlaThr: 5.041 ± 1.437
3.102AlaVal: 3.102 ± 0.496
0.775AlaTrp: 0.775 ± 0.572
1.551AlaTyr: 1.551 ± 0.771
0.0AlaXaa: 0.0 ± 0.0
Cys
1.551CysAla: 1.551 ± 0.755
1.163CysCys: 1.163 ± 1.016
0.388CysAsp: 0.388 ± 0.522
0.0CysGlu: 0.0 ± 0.0
0.388CysPhe: 0.388 ± 0.286
0.388CysGly: 0.388 ± 0.522
0.0CysHis: 0.0 ± 0.0
0.388CysIle: 0.388 ± 0.286
1.163CysLys: 1.163 ± 0.673
2.714CysLeu: 2.714 ± 1.095
0.388CysMet: 0.388 ± 0.286
1.163CysAsn: 1.163 ± 0.679
1.939CysPro: 1.939 ± 0.7
0.775CysGln: 0.775 ± 0.534
0.775CysArg: 0.775 ± 0.534
1.163CysSer: 1.163 ± 0.679
1.551CysThr: 1.551 ± 0.753
1.163CysVal: 1.163 ± 0.758
0.775CysTrp: 0.775 ± 0.333
0.388CysTyr: 0.388 ± 0.522
0.0CysXaa: 0.0 ± 0.0
Asp
1.551AspAla: 1.551 ± 0.922
1.163AspCys: 1.163 ± 0.521
2.714AspAsp: 2.714 ± 1.285
4.653AspGlu: 4.653 ± 2.058
2.326AspPhe: 2.326 ± 0.77
3.49AspGly: 3.49 ± 0.593
0.388AspHis: 0.388 ± 0.401
8.53AspIle: 8.53 ± 3.295
2.326AspLys: 2.326 ± 0.805
7.755AspLeu: 7.755 ± 2.621
1.163AspMet: 1.163 ± 0.649
4.653AspAsn: 4.653 ± 1.226
3.49AspPro: 3.49 ± 1.215
1.939AspGln: 1.939 ± 0.491
2.714AspArg: 2.714 ± 1.015
3.102AspSer: 3.102 ± 0.762
4.265AspThr: 4.265 ± 1.222
3.877AspVal: 3.877 ± 0.735
1.551AspTrp: 1.551 ± 0.886
2.326AspTyr: 2.326 ± 0.956
0.0AspXaa: 0.0 ± 0.0
Glu
3.102GluAla: 3.102 ± 0.88
0.388GluCys: 0.388 ± 0.286
4.653GluAsp: 4.653 ± 0.822
5.041GluGlu: 5.041 ± 2.596
1.939GluPhe: 1.939 ± 0.834
1.939GluGly: 1.939 ± 0.843
2.326GluHis: 2.326 ± 0.711
1.551GluIle: 1.551 ± 0.488
3.877GluLys: 3.877 ± 1.151
5.041GluLeu: 5.041 ± 1.237
1.163GluMet: 1.163 ± 0.643
3.102GluAsn: 3.102 ± 0.994
1.939GluPro: 1.939 ± 0.807
5.428GluGln: 5.428 ± 1.037
3.102GluArg: 3.102 ± 0.615
5.816GluSer: 5.816 ± 1.099
4.265GluThr: 4.265 ± 0.87
4.653GluVal: 4.653 ± 0.959
0.0GluTrp: 0.0 ± 0.0
1.551GluTyr: 1.551 ± 0.838
0.0GluXaa: 0.0 ± 0.0
Phe
1.551PheAla: 1.551 ± 0.841
0.775PheCys: 0.775 ± 0.534
3.877PheAsp: 3.877 ± 1.389
3.877PheGlu: 3.877 ± 0.984
1.163PhePhe: 1.163 ± 0.606
0.775PheGly: 0.775 ± 0.674
0.775PheHis: 0.775 ± 0.453
4.265PheIle: 4.265 ± 0.885
2.326PheLys: 2.326 ± 1.111
5.041PheLeu: 5.041 ± 0.698
0.775PheMet: 0.775 ± 0.534
0.775PheAsn: 0.775 ± 0.333
1.163PhePro: 1.163 ± 0.606
1.551PheGln: 1.551 ± 0.547
2.326PheArg: 2.326 ± 1.047
0.388PheSer: 0.388 ± 0.286
1.939PheThr: 1.939 ± 0.418
1.163PheVal: 1.163 ± 0.84
1.163PheTrp: 1.163 ± 0.521
1.939PheTyr: 1.939 ± 0.702
0.0PheXaa: 0.0 ± 0.0
Gly
1.939GlyAla: 1.939 ± 0.596
0.388GlyCys: 0.388 ± 0.337
4.265GlyAsp: 4.265 ± 1.05
5.428GlyGlu: 5.428 ± 1.478
0.388GlyPhe: 0.388 ± 0.286
5.041GlyGly: 5.041 ± 1.439
1.163GlyHis: 1.163 ± 0.562
3.877GlyIle: 3.877 ± 1.126
2.326GlyLys: 2.326 ± 1.085
3.877GlyLeu: 3.877 ± 1.183
0.0GlyMet: 0.0 ± 0.0
1.551GlyAsn: 1.551 ± 0.665
3.49GlyPro: 3.49 ± 1.257
4.265GlyGln: 4.265 ± 1.031
4.653GlyArg: 4.653 ± 1.373
5.816GlySer: 5.816 ± 2.112
2.714GlyThr: 2.714 ± 0.744
3.49GlyVal: 3.49 ± 0.957
0.388GlyTrp: 0.388 ± 0.358
0.775GlyTyr: 0.775 ± 0.387
0.0GlyXaa: 0.0 ± 0.0
His
1.939HisAla: 1.939 ± 1.025
0.388HisCys: 0.388 ± 0.286
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.775HisPhe: 0.775 ± 0.387
0.775HisGly: 0.775 ± 0.465
0.775HisHis: 0.775 ± 0.539
1.551HisIle: 1.551 ± 0.58
0.775HisLys: 0.775 ± 0.534
1.939HisLeu: 1.939 ± 0.834
0.388HisMet: 0.388 ± 0.496
1.163HisAsn: 1.163 ± 0.562
2.326HisPro: 2.326 ± 0.637
1.163HisGln: 1.163 ± 0.638
0.0HisArg: 0.0 ± 0.0
1.551HisSer: 1.551 ± 0.622
0.775HisThr: 0.775 ± 0.801
1.551HisVal: 1.551 ± 0.74
1.163HisTrp: 1.163 ± 0.562
1.163HisTyr: 1.163 ± 0.597
0.0HisXaa: 0.0 ± 0.0
Ile
3.49IleAla: 3.49 ± 1.234
1.551IleCys: 1.551 ± 0.461
2.714IleAsp: 2.714 ± 1.236
5.428IleGlu: 5.428 ± 1.575
2.714IlePhe: 2.714 ± 1.073
3.102IleGly: 3.102 ± 1.093
0.775IleHis: 0.775 ± 0.45
2.326IleIle: 2.326 ± 0.755
1.551IleLys: 1.551 ± 0.461
3.49IleLeu: 3.49 ± 0.816
1.939IleMet: 1.939 ± 0.8
2.326IleAsn: 2.326 ± 0.831
3.49IlePro: 3.49 ± 1.985
1.163IleGln: 1.163 ± 0.509
0.388IleArg: 0.388 ± 0.522
6.592IleSer: 6.592 ± 1.536
2.714IleThr: 2.714 ± 1.023
4.653IleVal: 4.653 ± 1.34
0.388IleTrp: 0.388 ± 0.522
2.326IleTyr: 2.326 ± 0.82
0.0IleXaa: 0.0 ± 0.0
Lys
5.816LysAla: 5.816 ± 1.467
3.102LysCys: 3.102 ± 0.978
2.714LysAsp: 2.714 ± 0.902
1.163LysGlu: 1.163 ± 0.47
2.714LysPhe: 2.714 ± 1.146
3.877LysGly: 3.877 ± 2.016
1.163LysHis: 1.163 ± 0.858
1.551LysIle: 1.551 ± 0.589
1.939LysLys: 1.939 ± 0.89
2.714LysLeu: 2.714 ± 1.656
0.388LysMet: 0.388 ± 0.337
1.939LysAsn: 1.939 ± 1.149
1.163LysPro: 1.163 ± 0.858
1.551LysGln: 1.551 ± 0.63
4.265LysArg: 4.265 ± 0.585
3.102LysSer: 3.102 ± 1.29
1.939LysThr: 1.939 ± 0.805
3.102LysVal: 3.102 ± 0.833
0.0LysTrp: 0.0 ± 0.0
1.551LysTyr: 1.551 ± 0.665
0.0LysXaa: 0.0 ± 0.0
Leu
5.428LeuAla: 5.428 ± 1.444
1.939LeuCys: 1.939 ± 1.176
5.428LeuAsp: 5.428 ± 0.789
4.653LeuGlu: 4.653 ± 1.385
3.877LeuPhe: 3.877 ± 0.804
7.755LeuGly: 7.755 ± 2.078
1.939LeuHis: 1.939 ± 0.709
3.102LeuIle: 3.102 ± 0.615
3.49LeuLys: 3.49 ± 1.28
7.367LeuLeu: 7.367 ± 2.059
0.388LeuMet: 0.388 ± 0.286
4.265LeuAsn: 4.265 ± 1.017
2.326LeuPro: 2.326 ± 0.952
6.204LeuGln: 6.204 ± 1.332
9.306LeuArg: 9.306 ± 2.12
5.041LeuSer: 5.041 ± 0.932
5.041LeuThr: 5.041 ± 1.978
5.041LeuVal: 5.041 ± 1.151
0.0LeuTrp: 0.0 ± 0.0
3.102LeuTyr: 3.102 ± 1.121
0.0LeuXaa: 0.0 ± 0.0
Met
1.551MetAla: 1.551 ± 0.771
0.0MetCys: 0.0 ± 0.0
1.551MetAsp: 1.551 ± 0.828
0.388MetGlu: 0.388 ± 0.401
1.163MetPhe: 1.163 ± 0.606
0.775MetGly: 0.775 ± 0.405
0.388MetHis: 0.388 ± 0.286
0.775MetIle: 0.775 ± 0.572
0.388MetLys: 0.388 ± 0.42
1.551MetLeu: 1.551 ± 0.709
0.0MetMet: 0.0 ± 0.0
0.388MetAsn: 0.388 ± 0.337
0.388MetPro: 0.388 ± 0.286
0.388MetGln: 0.388 ± 0.401
0.775MetArg: 0.775 ± 0.398
2.714MetSer: 2.714 ± 1.284
1.551MetThr: 1.551 ± 0.543
1.163MetVal: 1.163 ± 0.709
0.0MetTrp: 0.0 ± 0.0
1.163MetTyr: 1.163 ± 0.355
0.0MetXaa: 0.0 ± 0.0
Asn
2.714AsnAla: 2.714 ± 0.842
1.163AsnCys: 1.163 ± 0.594
1.551AsnAsp: 1.551 ± 0.501
1.551AsnGlu: 1.551 ± 0.665
1.551AsnPhe: 1.551 ± 1.093
2.326AsnGly: 2.326 ± 0.857
0.388AsnHis: 0.388 ± 0.286
2.714AsnIle: 2.714 ± 0.801
3.49AsnLys: 3.49 ± 0.693
1.163AsnLeu: 1.163 ± 0.42
0.388AsnMet: 0.388 ± 0.337
3.102AsnAsn: 3.102 ± 0.589
3.49AsnPro: 3.49 ± 0.7
1.939AsnGln: 1.939 ± 0.783
4.653AsnArg: 4.653 ± 1.345
4.653AsnSer: 4.653 ± 1.506
3.49AsnThr: 3.49 ± 1.817
2.326AsnVal: 2.326 ± 0.868
0.388AsnTrp: 0.388 ± 0.286
1.551AsnTyr: 1.551 ± 0.547
0.0AsnXaa: 0.0 ± 0.0
Pro
6.979ProAla: 6.979 ± 1.522
0.775ProCys: 0.775 ± 0.781
3.102ProAsp: 3.102 ± 0.914
3.102ProGlu: 3.102 ± 1.588
1.551ProPhe: 1.551 ± 0.69
1.163ProGly: 1.163 ± 0.509
0.388ProHis: 0.388 ± 0.358
3.877ProIle: 3.877 ± 1.259
4.653ProLys: 4.653 ± 1.562
7.367ProLeu: 7.367 ± 1.474
0.388ProMet: 0.388 ± 0.358
2.714ProAsn: 2.714 ± 0.915
6.979ProPro: 6.979 ± 1.259
2.714ProGln: 2.714 ± 1.023
2.326ProArg: 2.326 ± 0.831
3.102ProSer: 3.102 ± 0.94
6.592ProThr: 6.592 ± 2.423
2.326ProVal: 2.326 ± 1.188
0.0ProTrp: 0.0 ± 0.0
1.939ProTyr: 1.939 ± 0.906
0.0ProXaa: 0.0 ± 0.0
Gln
1.163GlnAla: 1.163 ± 0.521
0.775GlnCys: 0.775 ± 0.534
3.102GlnAsp: 3.102 ± 0.998
5.816GlnGlu: 5.816 ± 1.097
1.163GlnPhe: 1.163 ± 0.606
1.551GlnGly: 1.551 ± 0.461
0.0GlnHis: 0.0 ± 0.0
2.326GlnIle: 2.326 ± 0.725
0.388GlnLys: 0.388 ± 0.286
5.041GlnLeu: 5.041 ± 0.906
1.163GlnMet: 1.163 ± 0.606
3.102GlnAsn: 3.102 ± 1.02
4.265GlnPro: 4.265 ± 1.316
1.551GlnGln: 1.551 ± 1.144
3.102GlnArg: 3.102 ± 0.969
4.265GlnSer: 4.265 ± 0.79
3.102GlnThr: 3.102 ± 0.621
2.326GlnVal: 2.326 ± 0.82
1.163GlnTrp: 1.163 ± 0.806
1.939GlnTyr: 1.939 ± 0.727
0.0GlnXaa: 0.0 ± 0.0
Arg
4.265ArgAla: 4.265 ± 0.649
0.775ArgCys: 0.775 ± 0.612
5.428ArgAsp: 5.428 ± 1.743
2.326ArgGlu: 2.326 ± 0.907
5.428ArgPhe: 5.428 ± 1.543
3.102ArgGly: 3.102 ± 0.763
3.49ArgHis: 3.49 ± 1.634
2.326ArgIle: 2.326 ± 0.818
5.041ArgLys: 5.041 ± 1.085
7.367ArgLeu: 7.367 ± 1.541
1.939ArgMet: 1.939 ± 0.306
1.551ArgAsn: 1.551 ± 0.593
4.265ArgPro: 4.265 ± 1.991
0.775ArgGln: 0.775 ± 0.465
10.081ArgArg: 10.081 ± 2.504
6.979ArgSer: 6.979 ± 1.187
1.939ArgThr: 1.939 ± 0.596
3.877ArgVal: 3.877 ± 0.833
0.388ArgTrp: 0.388 ± 0.401
2.714ArgTyr: 2.714 ± 1.006
0.0ArgXaa: 0.0 ± 0.0
Ser
5.428SerAla: 5.428 ± 0.926
0.0SerCys: 0.0 ± 0.0
7.367SerAsp: 7.367 ± 1.049
4.265SerGlu: 4.265 ± 1.077
2.714SerPhe: 2.714 ± 0.791
5.428SerGly: 5.428 ± 0.883
0.775SerHis: 0.775 ± 0.387
2.714SerIle: 2.714 ± 0.413
1.163SerLys: 1.163 ± 0.806
10.081SerLeu: 10.081 ± 1.77
2.714SerMet: 2.714 ± 1.354
3.49SerAsn: 3.49 ± 1.083
7.755SerPro: 7.755 ± 2.842
4.265SerGln: 4.265 ± 1.1
5.428SerArg: 5.428 ± 1.265
6.592SerSer: 6.592 ± 1.497
6.592SerThr: 6.592 ± 2.121
4.653SerVal: 4.653 ± 1.358
0.388SerTrp: 0.388 ± 0.337
1.163SerTyr: 1.163 ± 0.6
0.0SerXaa: 0.0 ± 0.0
Thr
2.714ThrAla: 2.714 ± 0.88
1.551ThrCys: 1.551 ± 0.837
5.428ThrAsp: 5.428 ± 0.953
2.714ThrGlu: 2.714 ± 0.413
0.775ThrPhe: 0.775 ± 0.398
2.326ThrGly: 2.326 ± 0.545
0.775ThrHis: 0.775 ± 0.674
3.49ThrIle: 3.49 ± 0.965
3.102ThrLys: 3.102 ± 0.907
3.49ThrLeu: 3.49 ± 1.176
0.775ThrMet: 0.775 ± 0.674
2.326ThrAsn: 2.326 ± 0.808
3.877ThrPro: 3.877 ± 1.789
4.265ThrGln: 4.265 ± 0.858
5.816ThrArg: 5.816 ± 0.651
5.041ThrSer: 5.041 ± 0.933
4.653ThrThr: 4.653 ± 1.385
7.367ThrVal: 7.367 ± 1.862
1.163ThrTrp: 1.163 ± 0.521
1.551ThrTyr: 1.551 ± 0.589
0.0ThrXaa: 0.0 ± 0.0
Val
3.49ValAla: 3.49 ± 1.286
1.163ValCys: 1.163 ± 0.56
4.265ValAsp: 4.265 ± 1.446
3.102ValGlu: 3.102 ± 0.854
1.939ValPhe: 1.939 ± 0.689
5.041ValGly: 5.041 ± 1.277
3.102ValHis: 3.102 ± 1.155
3.49ValIle: 3.49 ± 0.615
2.326ValLys: 2.326 ± 1.018
3.102ValLeu: 3.102 ± 1.041
0.388ValMet: 0.388 ± 0.401
2.714ValAsn: 2.714 ± 0.557
4.265ValPro: 4.265 ± 1.215
3.49ValGln: 3.49 ± 0.831
4.653ValArg: 4.653 ± 1.364
6.592ValSer: 6.592 ± 2.451
3.102ValThr: 3.102 ± 0.949
4.265ValVal: 4.265 ± 1.406
1.551ValTrp: 1.551 ± 0.612
0.775ValTyr: 0.775 ± 0.387
0.0ValXaa: 0.0 ± 0.0
Trp
0.388TrpAla: 0.388 ± 0.286
0.0TrpCys: 0.0 ± 0.0
0.775TrpAsp: 0.775 ± 0.333
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
1.163TrpGly: 1.163 ± 0.521
1.163TrpHis: 1.163 ± 0.657
0.388TrpIle: 0.388 ± 0.286
0.775TrpLys: 0.775 ± 0.572
1.551TrpLeu: 1.551 ± 0.612
0.775TrpMet: 0.775 ± 0.572
0.388TrpAsn: 0.388 ± 0.401
0.0TrpPro: 0.0 ± 0.0
0.388TrpGln: 0.388 ± 0.337
1.551TrpArg: 1.551 ± 0.859
1.163TrpSer: 1.163 ± 0.42
0.0TrpThr: 0.0 ± 0.0
1.163TrpVal: 1.163 ± 0.521
0.0TrpTrp: 0.0 ± 0.0
0.775TrpTyr: 0.775 ± 0.572
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.163TyrAla: 1.163 ± 0.633
0.0TyrCys: 0.0 ± 0.0
2.326TyrAsp: 2.326 ± 0.616
2.714TyrGlu: 2.714 ± 0.884
0.775TyrPhe: 0.775 ± 0.612
2.714TyrGly: 2.714 ± 0.8
0.388TyrHis: 0.388 ± 0.337
2.326TyrIle: 2.326 ± 0.851
1.551TyrLys: 1.551 ± 0.771
2.714TyrLeu: 2.714 ± 0.65
0.775TyrMet: 0.775 ± 0.387
1.939TyrAsn: 1.939 ± 0.706
1.163TyrPro: 1.163 ± 0.629
1.163TyrGln: 1.163 ± 0.63
2.326TyrArg: 2.326 ± 0.686
1.551TyrSer: 1.551 ± 0.879
1.939TyrThr: 1.939 ± 0.418
1.551TyrVal: 1.551 ± 0.547
1.163TyrTrp: 1.163 ± 0.521
2.326TyrTyr: 2.326 ± 0.772
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (2580 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski