Amino acid dipepetide frequency for human papillomavirus 71

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.515AlaAla: 5.515 ± 1.866
2.97AlaCys: 2.97 ± 1.528
4.667AlaAsp: 4.667 ± 1.34
2.546AlaGlu: 2.546 ± 1.166
2.546AlaPhe: 2.546 ± 0.836
6.788AlaGly: 6.788 ± 1.052
2.121AlaHis: 2.121 ± 1.078
2.121AlaIle: 2.121 ± 0.878
4.243AlaLys: 4.243 ± 1.925
6.788AlaLeu: 6.788 ± 1.161
1.697AlaMet: 1.697 ± 0.587
2.121AlaAsn: 2.121 ± 1.012
4.243AlaPro: 4.243 ± 1.709
1.697AlaGln: 1.697 ± 0.568
3.394AlaArg: 3.394 ± 1.32
9.758AlaSer: 9.758 ± 2.326
3.394AlaThr: 3.394 ± 0.829
2.121AlaVal: 2.121 ± 1.017
0.0AlaTrp: 0.0 ± 0.0
2.97AlaTyr: 2.97 ± 0.869
0.0AlaXaa: 0.0 ± 0.0
Cys
1.697CysAla: 1.697 ± 0.654
0.849CysCys: 0.849 ± 0.687
0.424CysAsp: 0.424 ± 0.394
1.273CysGlu: 1.273 ± 0.706
0.424CysPhe: 0.424 ± 0.348
0.849CysGly: 0.849 ± 0.648
0.849CysHis: 0.849 ± 0.687
1.273CysIle: 1.273 ± 0.52
2.97CysLys: 2.97 ± 1.248
2.546CysLeu: 2.546 ± 1.186
0.424CysMet: 0.424 ± 0.348
0.849CysAsn: 0.849 ± 0.638
1.697CysPro: 1.697 ± 0.49
1.697CysGln: 1.697 ± 0.585
1.697CysArg: 1.697 ± 0.977
1.273CysSer: 1.273 ± 0.52
1.697CysThr: 1.697 ± 1.026
2.121CysVal: 2.121 ± 0.65
2.546CysTrp: 2.546 ± 1.02
0.849CysTyr: 0.849 ± 0.638
0.0CysXaa: 0.0 ± 0.0
Asp
4.243AspAla: 4.243 ± 1.261
1.697AspCys: 1.697 ± 1.139
2.97AspAsp: 2.97 ± 1.592
3.394AspGlu: 3.394 ± 1.634
1.273AspPhe: 1.273 ± 0.398
4.667AspGly: 4.667 ± 1.154
1.273AspHis: 1.273 ± 0.771
4.667AspIle: 4.667 ± 1.742
0.849AspLys: 0.849 ± 0.432
3.818AspLeu: 3.818 ± 1.593
0.849AspMet: 0.849 ± 0.389
2.546AspAsn: 2.546 ± 0.436
2.546AspPro: 2.546 ± 1.105
1.697AspGln: 1.697 ± 1.139
2.121AspArg: 2.121 ± 0.878
5.515AspSer: 5.515 ± 2.422
8.061AspThr: 8.061 ± 1.145
4.243AspVal: 4.243 ± 1.136
1.273AspTrp: 1.273 ± 0.642
2.121AspTyr: 2.121 ± 1.424
0.0AspXaa: 0.0 ± 0.0
Glu
5.091GluAla: 5.091 ± 1.737
0.424GluCys: 0.424 ± 0.348
3.394GluAsp: 3.394 ± 0.965
4.243GluGlu: 4.243 ± 1.654
0.849GluPhe: 0.849 ± 0.389
2.97GluGly: 2.97 ± 0.736
2.121GluHis: 2.121 ± 1.057
2.121GluIle: 2.121 ± 0.85
2.121GluLys: 2.121 ± 1.268
3.818GluLeu: 3.818 ± 0.828
0.424GluMet: 0.424 ± 0.394
0.849GluAsn: 0.849 ± 0.458
2.97GluPro: 2.97 ± 1.405
4.667GluGln: 4.667 ± 1.516
2.121GluArg: 2.121 ± 0.622
2.546GluSer: 2.546 ± 0.671
3.818GluThr: 3.818 ± 1.478
4.667GluVal: 4.667 ± 0.839
0.424GluTrp: 0.424 ± 0.348
1.697GluTyr: 1.697 ± 0.575
0.0GluXaa: 0.0 ± 0.0
Phe
3.818PheAla: 3.818 ± 1.689
0.849PheCys: 0.849 ± 0.49
1.697PheAsp: 1.697 ± 0.68
1.273PheGlu: 1.273 ± 0.749
2.121PhePhe: 2.121 ± 0.747
2.546PheGly: 2.546 ± 1.333
0.424PheHis: 0.424 ± 0.471
2.121PheIle: 2.121 ± 0.863
1.697PheLys: 1.697 ± 1.011
4.667PheLeu: 4.667 ± 1.187
0.849PheMet: 0.849 ± 0.478
0.849PheAsn: 0.849 ± 0.725
2.121PhePro: 2.121 ± 0.683
1.273PheGln: 1.273 ± 0.642
2.121PheArg: 2.121 ± 0.91
2.546PheSer: 2.546 ± 0.93
1.697PheThr: 1.697 ± 1.116
1.697PheVal: 1.697 ± 0.243
0.849PheTrp: 0.849 ± 0.389
1.697PheTyr: 1.697 ± 0.627
0.0PheXaa: 0.0 ± 0.0
Gly
4.243GlyAla: 4.243 ± 1.094
0.424GlyCys: 0.424 ± 0.363
5.515GlyAsp: 5.515 ± 1.554
4.243GlyGlu: 4.243 ± 1.034
1.697GlyPhe: 1.697 ± 0.873
4.243GlyGly: 4.243 ± 1.476
2.546GlyHis: 2.546 ± 0.74
5.091GlyIle: 5.091 ± 1.78
2.121GlyLys: 2.121 ± 0.547
3.394GlyLeu: 3.394 ± 0.965
0.424GlyMet: 0.424 ± 0.363
2.97GlyAsn: 2.97 ± 1.204
3.818GlyPro: 3.818 ± 1.889
2.121GlyGln: 2.121 ± 0.57
3.394GlyArg: 3.394 ± 0.717
3.818GlySer: 3.818 ± 1.746
9.334GlyThr: 9.334 ± 2.385
3.394GlyVal: 3.394 ± 1.459
0.424GlyTrp: 0.424 ± 0.348
2.546GlyTyr: 2.546 ± 0.617
0.0GlyXaa: 0.0 ± 0.0
His
1.273HisAla: 1.273 ± 0.405
0.424HisCys: 0.424 ± 0.348
1.697HisAsp: 1.697 ± 0.673
1.273HisGlu: 1.273 ± 0.616
1.273HisPhe: 1.273 ± 0.398
2.121HisGly: 2.121 ± 0.819
0.424HisHis: 0.424 ± 0.616
1.273HisIle: 1.273 ± 0.405
1.273HisLys: 1.273 ± 1.05
2.121HisLeu: 2.121 ± 1.501
0.0HisMet: 0.0 ± 0.0
0.424HisAsn: 0.424 ± 0.393
2.121HisPro: 2.121 ± 0.841
1.273HisGln: 1.273 ± 0.405
0.849HisArg: 0.849 ± 0.432
1.273HisSer: 1.273 ± 1.21
1.273HisThr: 1.273 ± 0.831
1.697HisVal: 1.697 ± 1.561
1.697HisTrp: 1.697 ± 0.938
0.849HisTyr: 0.849 ± 0.432
0.0HisXaa: 0.0 ± 0.0
Ile
2.121IleAla: 2.121 ± 0.9
1.697IleCys: 1.697 ± 0.752
2.546IleAsp: 2.546 ± 1.356
2.546IleGlu: 2.546 ± 0.811
2.97IlePhe: 2.97 ± 1.095
2.546IleGly: 2.546 ± 1.105
0.849IleHis: 0.849 ± 0.455
2.121IleIle: 2.121 ± 0.707
0.849IleLys: 0.849 ± 0.788
2.121IleLeu: 2.121 ± 0.906
0.424IleMet: 0.424 ± 0.363
1.273IleAsn: 1.273 ± 0.398
2.97IlePro: 2.97 ± 1.356
2.546IleGln: 2.546 ± 0.53
2.121IleArg: 2.121 ± 0.704
2.121IleSer: 2.121 ± 1.082
3.394IleThr: 3.394 ± 0.932
4.667IleVal: 4.667 ± 1.227
0.424IleTrp: 0.424 ± 0.394
2.121IleTyr: 2.121 ± 1.493
0.0IleXaa: 0.0 ± 0.0
Lys
3.394LysAla: 3.394 ± 2.199
2.546LysCys: 2.546 ± 1.472
2.121LysAsp: 2.121 ± 0.65
2.546LysGlu: 2.546 ± 1.126
2.121LysPhe: 2.121 ± 1.003
3.818LysGly: 3.818 ± 0.759
1.697LysHis: 1.697 ± 1.017
1.697LysIle: 1.697 ± 0.655
2.121LysLys: 2.121 ± 0.52
2.121LysLeu: 2.121 ± 0.4
0.0LysMet: 0.0 ± 0.0
2.121LysAsn: 2.121 ± 1.003
2.546LysPro: 2.546 ± 1.354
1.273LysGln: 1.273 ± 0.45
4.667LysArg: 4.667 ± 1.087
2.121LysSer: 2.121 ± 0.897
2.97LysThr: 2.97 ± 1.209
2.546LysVal: 2.546 ± 1.091
0.424LysTrp: 0.424 ± 0.471
1.697LysTyr: 1.697 ± 0.721
0.0LysXaa: 0.0 ± 0.0
Leu
6.364LeuAla: 6.364 ± 2.003
2.121LeuCys: 2.121 ± 1.304
4.243LeuAsp: 4.243 ± 1.158
3.818LeuGlu: 3.818 ± 1.197
4.243LeuPhe: 4.243 ± 1.38
8.061LeuGly: 8.061 ± 1.149
2.121LeuHis: 2.121 ± 0.396
2.121LeuIle: 2.121 ± 1.321
4.243LeuLys: 4.243 ± 1.093
7.637LeuLeu: 7.637 ± 2.176
1.273LeuMet: 1.273 ± 0.606
3.394LeuAsn: 3.394 ± 1.157
2.121LeuPro: 2.121 ± 1.397
8.485LeuGln: 8.485 ± 1.796
7.213LeuArg: 7.213 ± 0.927
4.243LeuSer: 4.243 ± 1.501
4.243LeuThr: 4.243 ± 1.896
3.394LeuVal: 3.394 ± 0.807
1.273LeuTrp: 1.273 ± 0.393
4.243LeuTyr: 4.243 ± 0.689
0.0LeuXaa: 0.0 ± 0.0
Met
1.697MetAla: 1.697 ± 1.0
0.849MetCys: 0.849 ± 0.45
1.697MetAsp: 1.697 ± 1.079
1.697MetGlu: 1.697 ± 0.585
0.849MetPhe: 0.849 ± 0.725
0.424MetGly: 0.424 ± 0.348
0.424MetHis: 0.424 ± 0.616
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
1.697MetLeu: 1.697 ± 1.0
0.424MetMet: 0.424 ± 0.616
0.0MetAsn: 0.0 ± 0.0
0.849MetPro: 0.849 ± 0.786
0.424MetGln: 0.424 ± 0.393
0.424MetArg: 0.424 ± 0.475
2.546MetSer: 2.546 ± 0.844
0.0MetThr: 0.0 ± 0.0
2.121MetVal: 2.121 ± 0.681
0.424MetTrp: 0.424 ± 0.394
0.424MetTyr: 0.424 ± 0.348
0.0MetXaa: 0.0 ± 0.0
Asn
2.121AsnAla: 2.121 ± 1.289
0.849AsnCys: 0.849 ± 0.696
2.121AsnAsp: 2.121 ± 0.863
1.697AsnGlu: 1.697 ± 1.115
1.273AsnPhe: 1.273 ± 0.398
0.849AsnGly: 0.849 ± 0.389
0.0AsnHis: 0.0 ± 0.0
0.849AsnIle: 0.849 ± 0.389
2.546AsnLys: 2.546 ± 0.798
2.121AsnLeu: 2.121 ± 1.271
0.849AsnMet: 0.849 ± 0.389
0.849AsnAsn: 0.849 ± 0.595
2.121AsnPro: 2.121 ± 0.808
2.121AsnGln: 2.121 ± 0.942
2.97AsnArg: 2.97 ± 1.096
2.97AsnSer: 2.97 ± 1.189
2.97AsnThr: 2.97 ± 0.863
1.697AsnVal: 1.697 ± 0.639
0.424AsnTrp: 0.424 ± 0.348
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
5.091ProAla: 5.091 ± 1.982
0.849ProCys: 0.849 ± 0.595
4.243ProAsp: 4.243 ± 1.743
1.697ProGlu: 1.697 ± 0.681
1.697ProPhe: 1.697 ± 0.55
2.546ProGly: 2.546 ± 0.53
0.0ProHis: 0.0 ± 0.0
3.394ProIle: 3.394 ± 1.962
3.818ProLys: 3.818 ± 1.101
8.061ProLeu: 8.061 ± 2.013
1.273ProMet: 1.273 ± 0.486
1.697ProAsn: 1.697 ± 0.536
8.061ProPro: 8.061 ± 2.658
1.697ProGln: 1.697 ± 0.673
2.546ProArg: 2.546 ± 1.287
5.515ProSer: 5.515 ± 2.564
6.364ProThr: 6.364 ± 2.366
3.394ProVal: 3.394 ± 1.452
0.0ProTrp: 0.0 ± 0.0
2.121ProTyr: 2.121 ± 0.992
0.0ProXaa: 0.0 ± 0.0
Gln
3.394GlnAla: 3.394 ± 0.623
1.697GlnCys: 1.697 ± 1.109
3.394GlnAsp: 3.394 ± 1.189
2.97GlnGlu: 2.97 ± 1.247
2.97GlnPhe: 2.97 ± 0.773
1.273GlnGly: 1.273 ± 0.666
0.849GlnHis: 0.849 ± 0.45
1.273GlnIle: 1.273 ± 0.405
2.121GlnLys: 2.121 ± 0.511
6.788GlnLeu: 6.788 ± 1.704
1.273GlnMet: 1.273 ± 0.666
0.849GlnAsn: 0.849 ± 0.455
4.243GlnPro: 4.243 ± 1.013
4.667GlnGln: 4.667 ± 1.544
2.546GlnArg: 2.546 ± 1.075
0.849GlnSer: 0.849 ± 0.478
2.546GlnThr: 2.546 ± 0.796
3.394GlnVal: 3.394 ± 1.346
0.849GlnTrp: 0.849 ± 0.696
0.849GlnTyr: 0.849 ± 0.595
0.0GlnXaa: 0.0 ± 0.0
Arg
5.515ArgAla: 5.515 ± 1.172
1.697ArgCys: 1.697 ± 0.758
2.121ArgAsp: 2.121 ± 0.763
1.697ArgGlu: 1.697 ± 0.685
2.121ArgPhe: 2.121 ± 0.683
1.697ArgGly: 1.697 ± 0.861
2.97ArgHis: 2.97 ± 0.795
0.849ArgIle: 0.849 ± 0.595
3.818ArgLys: 3.818 ± 1.126
6.788ArgLeu: 6.788 ± 0.986
0.849ArgMet: 0.849 ± 0.705
0.849ArgAsn: 0.849 ± 0.49
3.818ArgPro: 3.818 ± 1.589
3.394ArgGln: 3.394 ± 1.184
2.97ArgArg: 2.97 ± 0.985
3.394ArgSer: 3.394 ± 1.277
4.667ArgThr: 4.667 ± 1.383
5.515ArgVal: 5.515 ± 1.887
1.697ArgTrp: 1.697 ± 0.673
1.697ArgTyr: 1.697 ± 0.878
0.0ArgXaa: 0.0 ± 0.0
Ser
3.394SerAla: 3.394 ± 0.787
1.273SerCys: 1.273 ± 0.398
4.243SerAsp: 4.243 ± 1.53
1.273SerGlu: 1.273 ± 0.666
2.121SerPhe: 2.121 ± 1.049
5.515SerGly: 5.515 ± 1.952
1.273SerHis: 1.273 ± 0.405
2.97SerIle: 2.97 ± 1.393
2.121SerLys: 2.121 ± 0.734
4.243SerLeu: 4.243 ± 1.844
2.97SerMet: 2.97 ± 1.154
2.546SerAsn: 2.546 ± 1.22
3.394SerPro: 3.394 ± 0.671
2.121SerGln: 2.121 ± 0.675
5.515SerArg: 5.515 ± 1.921
8.91SerSer: 8.91 ± 1.706
11.031SerThr: 11.031 ± 2.737
3.818SerVal: 3.818 ± 1.389
0.849SerTrp: 0.849 ± 0.643
2.546SerTyr: 2.546 ± 1.25
0.0SerXaa: 0.0 ± 0.0
Thr
3.818ThrAla: 3.818 ± 1.837
3.394ThrCys: 3.394 ± 0.548
2.97ThrAsp: 2.97 ± 0.696
5.515ThrGlu: 5.515 ± 1.231
2.546ThrPhe: 2.546 ± 0.53
8.91ThrGly: 8.91 ± 2.491
0.849ThrHis: 0.849 ± 0.455
2.121ThrIle: 2.121 ± 0.719
1.697ThrLys: 1.697 ± 1.295
7.637ThrLeu: 7.637 ± 2.33
0.849ThrMet: 0.849 ± 0.45
5.091ThrAsn: 5.091 ± 1.002
7.637ThrPro: 7.637 ± 1.979
3.818ThrGln: 3.818 ± 1.182
3.394ThrArg: 3.394 ± 0.821
5.091ThrSer: 5.091 ± 2.142
5.94ThrThr: 5.94 ± 1.294
8.061ThrVal: 8.061 ± 0.968
0.849ThrTrp: 0.849 ± 0.788
1.273ThrTyr: 1.273 ± 0.752
0.0ThrXaa: 0.0 ± 0.0
Val
3.394ValAla: 3.394 ± 1.05
2.121ValCys: 2.121 ± 0.806
6.364ValAsp: 6.364 ± 0.851
4.667ValGlu: 4.667 ± 1.542
1.273ValPhe: 1.273 ± 0.398
2.546ValGly: 2.546 ± 0.61
2.121ValHis: 2.121 ± 1.107
4.243ValIle: 4.243 ± 0.82
1.697ValLys: 1.697 ± 0.957
3.818ValLeu: 3.818 ± 1.744
0.849ValMet: 0.849 ± 0.676
0.849ValAsn: 0.849 ± 0.458
4.667ValPro: 4.667 ± 0.787
1.697ValGln: 1.697 ± 0.603
3.818ValArg: 3.818 ± 0.788
5.94ValSer: 5.94 ± 2.23
6.788ValThr: 6.788 ± 1.921
6.364ValVal: 6.364 ± 1.272
0.849ValTrp: 0.849 ± 0.595
2.546ValTyr: 2.546 ± 0.939
0.0ValXaa: 0.0 ± 0.0
Trp
2.546TrpAla: 2.546 ± 0.733
0.0TrpCys: 0.0 ± 0.0
0.424TrpAsp: 0.424 ± 0.394
1.697TrpGlu: 1.697 ± 1.139
1.273TrpPhe: 1.273 ± 0.706
1.273TrpGly: 1.273 ± 0.398
1.273TrpHis: 1.273 ± 1.072
0.849TrpIle: 0.849 ± 0.696
1.697TrpLys: 1.697 ± 0.732
0.849TrpLeu: 0.849 ± 0.389
0.0TrpMet: 0.0 ± 0.0
0.424TrpAsn: 0.424 ± 0.363
0.0TrpPro: 0.0 ± 0.0
0.849TrpGln: 0.849 ± 0.638
2.121TrpArg: 2.121 ± 0.946
0.424TrpSer: 0.424 ± 0.348
0.424TrpThr: 0.424 ± 0.394
0.424TrpVal: 0.424 ± 0.348
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.97TyrAla: 2.97 ± 1.035
1.273TyrCys: 1.273 ± 0.922
2.97TyrAsp: 2.97 ± 1.066
1.273TyrGlu: 1.273 ± 0.405
1.273TyrPhe: 1.273 ± 0.398
2.121TyrGly: 2.121 ± 0.897
0.424TyrHis: 0.424 ± 0.393
1.273TyrIle: 1.273 ± 0.749
2.546TyrLys: 2.546 ± 0.708
3.818TyrLeu: 3.818 ± 1.4
0.849TyrMet: 0.849 ± 0.45
0.849TyrAsn: 0.849 ± 0.725
2.121TyrPro: 2.121 ± 0.641
1.273TyrGln: 1.273 ± 0.796
2.121TyrArg: 2.121 ± 0.728
1.273TyrSer: 1.273 ± 0.753
0.849TyrThr: 0.849 ± 0.788
1.697TyrVal: 1.697 ± 0.55
1.273TyrTrp: 1.273 ± 0.52
1.697TyrTyr: 1.697 ± 0.68
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2358 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski