Amino acid dipepetide frequency for Human papillomavirus 30

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.235AlaAla: 2.235 ± 0.988
2.682AlaCys: 2.682 ± 0.956
4.47AlaAsp: 4.47 ± 1.503
1.788AlaGlu: 1.788 ± 1.07
3.129AlaPhe: 3.129 ± 0.867
3.576AlaGly: 3.576 ± 2.048
0.894AlaHis: 0.894 ± 0.843
1.788AlaIle: 1.788 ± 1.074
3.576AlaLys: 3.576 ± 1.291
3.576AlaLeu: 3.576 ± 1.328
2.235AlaMet: 2.235 ± 0.891
2.682AlaAsn: 2.682 ± 1.331
3.576AlaPro: 3.576 ± 1.901
3.576AlaGln: 3.576 ± 1.751
3.576AlaArg: 3.576 ± 1.634
3.576AlaSer: 3.576 ± 1.132
4.917AlaThr: 4.917 ± 0.769
5.364AlaVal: 5.364 ± 1.409
0.0AlaTrp: 0.0 ± 0.0
1.341AlaTyr: 1.341 ± 1.111
0.0AlaXaa: 0.0 ± 0.0
Cys
1.341CysAla: 1.341 ± 1.107
0.894CysCys: 0.894 ± 0.579
0.0CysAsp: 0.0 ± 0.0
1.341CysGlu: 1.341 ± 1.179
1.788CysPhe: 1.788 ± 0.895
1.341CysGly: 1.341 ± 0.646
0.447CysHis: 0.447 ± 0.338
0.447CysIle: 0.447 ± 0.37
2.682CysLys: 2.682 ± 1.084
3.129CysLeu: 3.129 ± 1.642
0.447CysMet: 0.447 ± 0.338
0.447CysAsn: 0.447 ± 0.537
2.235CysPro: 2.235 ± 0.508
2.682CysGln: 2.682 ± 1.288
1.788CysArg: 1.788 ± 0.71
0.894CysSer: 0.894 ± 0.675
1.341CysThr: 1.341 ± 0.676
1.788CysVal: 1.788 ± 0.625
1.341CysTrp: 1.341 ± 0.623
2.235CysTyr: 2.235 ± 1.403
0.0CysXaa: 0.0 ± 0.0
Asp
3.576AspAla: 3.576 ± 0.614
1.788AspCys: 1.788 ± 0.568
2.235AspAsp: 2.235 ± 0.863
4.917AspGlu: 4.917 ± 1.731
1.788AspPhe: 1.788 ± 0.269
4.023AspGly: 4.023 ± 0.94
0.447AspHis: 0.447 ± 0.459
4.917AspIle: 4.917 ± 1.679
1.341AspLys: 1.341 ± 0.405
3.129AspLeu: 3.129 ± 1.661
1.341AspMet: 1.341 ± 0.416
4.023AspAsn: 4.023 ± 0.868
3.129AspPro: 3.129 ± 1.614
0.894AspGln: 0.894 ± 0.741
0.894AspArg: 0.894 ± 0.675
5.811AspSer: 5.811 ± 1.695
4.47AspThr: 4.47 ± 1.013
4.917AspVal: 4.917 ± 1.164
0.894AspTrp: 0.894 ± 0.675
0.894AspTyr: 0.894 ± 0.501
0.0AspXaa: 0.0 ± 0.0
Glu
3.576GluAla: 3.576 ± 0.931
0.0GluCys: 0.0 ± 0.0
4.917GluAsp: 4.917 ± 1.126
4.47GluGlu: 4.47 ± 1.925
0.0GluPhe: 0.0 ± 0.0
2.682GluGly: 2.682 ± 1.627
0.894GluHis: 0.894 ± 0.465
2.235GluIle: 2.235 ± 0.942
1.341GluLys: 1.341 ± 0.709
4.917GluLeu: 4.917 ± 2.2
0.447GluMet: 0.447 ± 0.422
3.129GluAsn: 3.129 ± 1.423
1.788GluPro: 1.788 ± 1.109
2.682GluGln: 2.682 ± 1.371
1.341GluArg: 1.341 ± 0.623
3.129GluSer: 3.129 ± 1.301
5.364GluThr: 5.364 ± 1.218
4.47GluVal: 4.47 ± 1.782
0.894GluTrp: 0.894 ± 0.447
3.576GluTyr: 3.576 ± 1.843
0.0GluXaa: 0.0 ± 0.0
Phe
2.235PheAla: 2.235 ± 0.66
0.447PheCys: 0.447 ± 0.537
2.235PheAsp: 2.235 ± 0.741
2.682PheGlu: 2.682 ± 0.733
1.341PhePhe: 1.341 ± 0.662
2.235PheGly: 2.235 ± 0.719
0.894PheHis: 0.894 ± 0.607
0.447PheIle: 0.447 ± 0.338
2.682PheLys: 2.682 ± 0.921
5.811PheLeu: 5.811 ± 1.578
0.0PheMet: 0.0 ± 0.326
2.235PheAsn: 2.235 ± 1.42
1.788PhePro: 1.788 ± 0.639
2.235PheGln: 2.235 ± 0.719
0.894PheArg: 0.894 ± 0.428
2.235PheSer: 2.235 ± 1.093
1.788PheThr: 1.788 ± 1.687
2.235PheVal: 2.235 ± 0.469
0.894PheTrp: 0.894 ± 0.428
2.235PheTyr: 2.235 ± 0.925
0.0PheXaa: 0.0 ± 0.0
Gly
4.47GlyAla: 4.47 ± 1.014
0.894GlyCys: 0.894 ± 0.428
3.576GlyAsp: 3.576 ± 1.291
3.129GlyGlu: 3.129 ± 0.451
1.341GlyPhe: 1.341 ± 0.405
4.023GlyGly: 4.023 ± 1.269
1.788GlyHis: 1.788 ± 0.681
2.235GlyIle: 2.235 ± 0.352
4.023GlyLys: 4.023 ± 1.236
2.682GlyLeu: 2.682 ± 1.105
0.447GlyMet: 0.447 ± 0.338
4.023GlyAsn: 4.023 ± 0.768
1.788GlyPro: 1.788 ± 1.192
0.447GlyGln: 0.447 ± 0.37
1.341GlyArg: 1.341 ± 0.727
4.47GlySer: 4.47 ± 1.523
5.811GlyThr: 5.811 ± 2.349
5.364GlyVal: 5.364 ± 1.198
1.341GlyTrp: 1.341 ± 0.416
1.788GlyTyr: 1.788 ± 0.791
0.0GlyXaa: 0.0 ± 0.0
His
1.341HisAla: 1.341 ± 0.416
0.894HisCys: 0.894 ± 0.882
1.788HisAsp: 1.788 ± 0.564
0.447HisGlu: 0.447 ± 0.459
0.894HisPhe: 0.894 ± 0.44
1.788HisGly: 1.788 ± 0.909
0.447HisHis: 0.447 ± 0.537
1.788HisIle: 1.788 ± 0.564
0.894HisLys: 0.894 ± 0.447
1.788HisLeu: 1.788 ± 1.018
0.894HisMet: 0.894 ± 0.918
1.341HisAsn: 1.341 ± 0.676
0.894HisPro: 0.894 ± 0.639
0.447HisGln: 0.447 ± 0.459
1.341HisArg: 1.341 ± 0.854
1.788HisSer: 1.788 ± 0.954
2.682HisThr: 2.682 ± 0.984
2.235HisVal: 2.235 ± 0.761
0.894HisTrp: 0.894 ± 0.501
1.788HisTyr: 1.788 ± 0.484
0.0HisXaa: 0.0 ± 0.0
Ile
2.235IleAla: 2.235 ± 1.093
1.341IleCys: 1.341 ± 0.676
1.788IleAsp: 1.788 ± 1.011
3.129IleGlu: 3.129 ± 1.488
0.894IlePhe: 0.894 ± 0.741
4.023IleGly: 4.023 ± 1.993
0.894IleHis: 0.894 ± 0.44
2.682IleIle: 2.682 ± 1.627
2.682IleLys: 2.682 ± 0.9
2.235IleLeu: 2.235 ± 0.889
0.447IleMet: 0.447 ± 0.338
1.341IleAsn: 1.341 ± 0.635
2.682IlePro: 2.682 ± 1.22
3.129IleGln: 3.129 ± 1.462
1.788IleArg: 1.788 ± 1.214
5.811IleSer: 5.811 ± 1.596
3.576IleThr: 3.576 ± 0.681
3.576IleVal: 3.576 ± 1.488
0.0IleTrp: 0.0 ± 0.0
2.235IleTyr: 2.235 ± 0.948
0.0IleXaa: 0.0 ± 0.0
Lys
2.682LysAla: 2.682 ± 1.696
1.341LysCys: 1.341 ± 0.646
2.682LysAsp: 2.682 ± 1.288
2.682LysGlu: 2.682 ± 1.317
2.235LysPhe: 2.235 ± 1.118
2.235LysGly: 2.235 ± 1.366
2.235LysHis: 2.235 ± 0.788
4.023LysIle: 4.023 ± 1.524
3.576LysLys: 3.576 ± 1.968
2.682LysLeu: 2.682 ± 1.142
0.894LysMet: 0.894 ± 0.428
0.447LysAsn: 0.447 ± 0.338
1.341LysPro: 1.341 ± 0.727
4.47LysGln: 4.47 ± 0.497
5.364LysArg: 5.364 ± 1.038
4.023LysSer: 4.023 ± 1.762
3.129LysThr: 3.129 ± 0.936
4.47LysVal: 4.47 ± 1.28
0.0LysTrp: 0.0 ± 0.0
3.129LysTyr: 3.129 ± 0.747
0.0LysXaa: 0.0 ± 0.0
Leu
3.576LeuAla: 3.576 ± 0.951
3.129LeuCys: 3.129 ± 1.334
7.152LeuAsp: 7.152 ± 1.216
3.576LeuGlu: 3.576 ± 1.404
4.47LeuPhe: 4.47 ± 1.053
4.917LeuGly: 4.917 ± 1.164
3.576LeuHis: 3.576 ± 1.144
2.235LeuIle: 2.235 ± 0.881
3.576LeuLys: 3.576 ± 1.342
4.023LeuLeu: 4.023 ± 1.562
1.341LeuMet: 1.341 ± 0.585
2.235LeuAsn: 2.235 ± 0.948
2.682LeuPro: 2.682 ± 1.453
6.258LeuGln: 6.258 ± 2.149
4.47LeuArg: 4.47 ± 1.088
4.47LeuSer: 4.47 ± 1.054
2.235LeuThr: 2.235 ± 1.083
5.364LeuVal: 5.364 ± 1.504
0.447LeuTrp: 0.447 ± 0.422
4.023LeuTyr: 4.023 ± 0.793
0.0LeuXaa: 0.0 ± 0.0
Met
2.235MetAla: 2.235 ± 0.767
0.447MetCys: 0.447 ± 0.338
0.894MetAsp: 0.894 ± 0.428
0.447MetGlu: 0.447 ± 0.459
0.894MetPhe: 0.894 ± 0.428
1.341MetGly: 1.341 ± 1.121
0.447MetHis: 0.447 ± 0.578
0.0MetIle: 0.0 ± 0.0
0.447MetLys: 0.447 ± 0.338
1.788MetLeu: 1.788 ± 1.011
0.447MetMet: 0.447 ± 0.408
0.894MetAsn: 0.894 ± 0.428
0.447MetPro: 0.447 ± 0.338
1.788MetGln: 1.788 ± 0.708
0.894MetArg: 0.894 ± 0.44
1.341MetSer: 1.341 ± 0.676
0.894MetThr: 0.894 ± 0.501
1.788MetVal: 1.788 ± 0.639
0.894MetTrp: 0.894 ± 0.501
0.447MetTyr: 0.447 ± 0.338
0.0MetXaa: 0.0 ± 0.0
Asn
1.341AsnAla: 1.341 ± 0.662
0.447AsnCys: 0.447 ± 0.338
1.788AsnAsp: 1.788 ± 0.895
1.341AsnGlu: 1.341 ± 0.841
1.788AsnPhe: 1.788 ± 1.102
1.341AsnGly: 1.341 ± 0.416
0.447AsnHis: 0.447 ± 0.338
3.129AsnIle: 3.129 ± 1.023
2.235AsnLys: 2.235 ± 0.977
2.682AsnLeu: 2.682 ± 0.775
1.788AsnMet: 1.788 ± 0.568
3.576AsnAsn: 3.576 ± 0.918
3.576AsnPro: 3.576 ± 1.167
1.341AsnGln: 1.341 ± 0.726
3.129AsnArg: 3.129 ± 1.621
4.47AsnSer: 4.47 ± 1.394
6.258AsnThr: 6.258 ± 2.269
4.023AsnVal: 4.023 ± 1.115
0.447AsnTrp: 0.447 ± 0.338
0.447AsnTyr: 0.447 ± 0.537
0.0AsnXaa: 0.0 ± 0.0
Pro
3.576ProAla: 3.576 ± 1.468
0.894ProCys: 0.894 ± 0.765
3.576ProAsp: 3.576 ± 1.311
2.682ProGlu: 2.682 ± 0.752
1.788ProPhe: 1.788 ± 0.954
1.341ProGly: 1.341 ± 0.446
0.0ProHis: 0.0 ± 0.0
4.023ProIle: 4.023 ± 1.792
2.235ProLys: 2.235 ± 0.708
6.705ProLeu: 6.705 ± 1.764
0.447ProMet: 0.447 ± 0.422
1.788ProAsn: 1.788 ± 0.269
6.258ProPro: 6.258 ± 2.015
2.235ProGln: 2.235 ± 1.301
1.341ProArg: 1.341 ± 0.688
6.705ProSer: 6.705 ± 2.297
5.811ProThr: 5.811 ± 2.97
4.917ProVal: 4.917 ± 1.322
0.0ProTrp: 0.0 ± 0.0
2.235ProTyr: 2.235 ± 0.884
0.0ProXaa: 0.0 ± 0.0
Gln
3.129GlnAla: 3.129 ± 1.269
3.129GlnCys: 3.129 ± 1.539
2.682GlnAsp: 2.682 ± 1.284
2.682GlnGlu: 2.682 ± 0.937
2.682GlnPhe: 2.682 ± 1.353
1.788GlnGly: 1.788 ± 0.856
0.894GlnHis: 0.894 ± 0.579
2.235GlnIle: 2.235 ± 0.863
1.341GlnLys: 1.341 ± 0.75
6.705GlnLeu: 6.705 ± 2.15
2.235GlnMet: 2.235 ± 0.638
0.894GlnAsn: 0.894 ± 0.447
3.129GlnPro: 3.129 ± 0.794
5.364GlnGln: 5.364 ± 2.528
2.235GlnArg: 2.235 ± 0.634
1.341GlnSer: 1.341 ± 0.75
4.917GlnThr: 4.917 ± 1.107
2.235GlnVal: 2.235 ± 0.634
1.788GlnTrp: 1.788 ± 0.564
1.341GlnTyr: 1.341 ± 0.726
0.0GlnXaa: 0.0 ± 0.0
Arg
4.023ArgAla: 4.023 ± 1.357
2.235ArgCys: 2.235 ± 1.372
0.894ArgAsp: 0.894 ± 0.741
2.682ArgGlu: 2.682 ± 0.921
0.894ArgPhe: 0.894 ± 0.575
1.341ArgGly: 1.341 ± 0.405
3.129ArgHis: 3.129 ± 1.293
2.235ArgIle: 2.235 ± 0.645
3.576ArgLys: 3.576 ± 0.844
4.917ArgLeu: 4.917 ± 0.612
0.894ArgMet: 0.894 ± 0.606
0.894ArgAsn: 0.894 ± 0.428
3.576ArgPro: 3.576 ± 0.965
2.235ArgGln: 2.235 ± 1.406
5.364ArgArg: 5.364 ± 3.088
2.235ArgSer: 2.235 ± 0.469
4.47ArgThr: 4.47 ± 0.965
1.788ArgVal: 1.788 ± 0.681
0.0ArgTrp: 0.0 ± 0.0
2.235ArgTyr: 2.235 ± 0.836
0.0ArgXaa: 0.0 ± 0.0
Ser
5.811SerAla: 5.811 ± 2.599
0.447SerCys: 0.447 ± 0.338
3.129SerAsp: 3.129 ± 1.495
3.129SerGlu: 3.129 ± 1.132
3.129SerPhe: 3.129 ± 1.458
4.917SerGly: 4.917 ± 2.014
1.788SerHis: 1.788 ± 0.564
3.129SerIle: 3.129 ± 0.637
3.576SerLys: 3.576 ± 1.311
4.917SerLeu: 4.917 ± 1.476
1.341SerMet: 1.341 ± 0.726
3.129SerAsn: 3.129 ± 1.888
4.917SerPro: 4.917 ± 0.645
2.235SerGln: 2.235 ± 0.719
4.023SerArg: 4.023 ± 1.07
9.388SerSer: 9.388 ± 2.036
8.494SerThr: 8.494 ± 2.394
4.023SerVal: 4.023 ± 0.79
0.0SerTrp: 0.0 ± 0.0
1.788SerTyr: 1.788 ± 0.564
0.0SerXaa: 0.0 ± 0.0
Thr
3.576ThrAla: 3.576 ± 0.617
3.576ThrCys: 3.576 ± 0.614
4.47ThrAsp: 4.47 ± 0.705
4.917ThrGlu: 4.917 ± 2.082
3.129ThrPhe: 3.129 ± 1.981
5.364ThrGly: 5.364 ± 2.09
1.788ThrHis: 1.788 ± 0.744
2.682ThrIle: 2.682 ± 0.699
4.023ThrLys: 4.023 ± 1.24
6.258ThrLeu: 6.258 ± 1.065
1.341ThrMet: 1.341 ± 0.727
5.364ThrAsn: 5.364 ± 1.091
7.152ThrPro: 7.152 ± 1.073
4.023ThrGln: 4.023 ± 0.8
2.682ThrArg: 2.682 ± 1.259
4.023ThrSer: 4.023 ± 0.986
9.835ThrThr: 9.835 ± 3.213
3.129ThrVal: 3.129 ± 1.209
2.235ThrTrp: 2.235 ± 0.645
2.682ThrTyr: 2.682 ± 1.147
0.0ThrXaa: 0.0 ± 0.0
Val
4.023ValAla: 4.023 ± 1.143
2.682ValCys: 2.682 ± 1.179
5.364ValAsp: 5.364 ± 0.706
4.023ValGlu: 4.023 ± 1.141
2.235ValPhe: 2.235 ± 1.405
2.235ValGly: 2.235 ± 0.977
3.576ValHis: 3.576 ± 1.885
3.129ValIle: 3.129 ± 1.5
3.576ValLys: 3.576 ± 0.632
1.788ValLeu: 1.788 ± 0.744
0.894ValMet: 0.894 ± 0.619
4.023ValAsn: 4.023 ± 0.845
6.258ValPro: 6.258 ± 1.293
4.023ValGln: 4.023 ± 2.117
3.129ValArg: 3.129 ± 0.791
4.917ValSer: 4.917 ± 1.679
3.576ValThr: 3.576 ± 1.495
7.599ValVal: 7.599 ± 2.899
1.788ValTrp: 1.788 ± 1.002
4.917ValTyr: 4.917 ± 2.031
0.0ValXaa: 0.0 ± 0.0
Trp
1.341TrpAla: 1.341 ± 0.726
0.0TrpCys: 0.0 ± 0.0
0.894TrpAsp: 0.894 ± 0.465
0.447TrpGlu: 0.447 ± 0.459
1.341TrpPhe: 1.341 ± 0.416
1.341TrpGly: 1.341 ± 0.405
0.447TrpHis: 0.447 ± 0.459
1.341TrpIle: 1.341 ± 1.013
0.894TrpLys: 0.894 ± 0.447
0.894TrpLeu: 0.894 ± 0.428
0.0TrpMet: 0.0 ± 0.0
0.447TrpAsn: 0.447 ± 0.37
0.894TrpPro: 0.894 ± 0.843
0.447TrpGln: 0.447 ± 0.459
1.788TrpArg: 1.788 ± 0.793
0.447TrpSer: 0.447 ± 0.338
1.788TrpThr: 1.788 ± 1.491
0.447TrpVal: 0.447 ± 0.459
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.235TyrAla: 2.235 ± 0.852
1.341TyrCys: 1.341 ± 1.075
0.894TyrAsp: 0.894 ± 0.843
1.341TyrGlu: 1.341 ± 0.689
2.235TyrPhe: 2.235 ± 0.761
3.129TyrGly: 3.129 ± 1.192
1.341TyrHis: 1.341 ± 0.416
1.788TyrIle: 1.788 ± 0.609
5.364TyrLys: 5.364 ± 2.867
3.129TyrLeu: 3.129 ± 1.402
0.447TyrMet: 0.447 ± 0.338
2.682TyrAsn: 2.682 ± 1.695
0.447TyrPro: 0.447 ± 0.37
2.235TyrGln: 2.235 ± 0.758
2.235TyrArg: 2.235 ± 1.243
2.235TyrSer: 2.235 ± 1.028
0.894TyrThr: 0.894 ± 0.447
4.023TyrVal: 4.023 ± 1.05
1.341TyrTrp: 1.341 ± 0.499
3.576TyrTyr: 3.576 ± 0.976
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2238 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski