Amino acid dipepetide frequency for Human papillomavirus 179

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.206AlaAla: 6.206 ± 3.074
1.33AlaCys: 1.33 ± 0.84
5.762AlaAsp: 5.762 ± 0.772
4.433AlaGlu: 4.433 ± 1.568
4.433AlaPhe: 4.433 ± 1.142
1.773AlaGly: 1.773 ± 1.105
1.33AlaHis: 1.33 ± 0.394
2.216AlaIle: 2.216 ± 0.684
3.989AlaLys: 3.989 ± 1.353
7.092AlaLeu: 7.092 ± 1.897
0.887AlaMet: 0.887 ± 0.67
2.216AlaAsn: 2.216 ± 1.202
2.216AlaPro: 2.216 ± 0.648
2.216AlaGln: 2.216 ± 0.974
3.989AlaArg: 3.989 ± 1.784
4.433AlaSer: 4.433 ± 1.064
1.33AlaThr: 1.33 ± 1.434
2.216AlaVal: 2.216 ± 1.003
0.443AlaTrp: 0.443 ± 0.342
0.887AlaTyr: 0.887 ± 0.683
0.0AlaXaa: 0.0 ± 0.0
Cys
2.216CysAla: 2.216 ± 1.379
1.773CysCys: 1.773 ± 1.022
0.887CysAsp: 0.887 ± 0.511
1.33CysGlu: 1.33 ± 0.679
0.887CysPhe: 0.887 ± 0.511
0.887CysGly: 0.887 ± 0.728
0.0CysHis: 0.0 ± 0.0
2.216CysIle: 2.216 ± 1.077
1.33CysLys: 1.33 ± 0.837
0.887CysLeu: 0.887 ± 0.866
0.0CysMet: 0.0 ± 0.0
0.887CysAsn: 0.887 ± 0.422
1.773CysPro: 1.773 ± 0.639
0.887CysGln: 0.887 ± 0.728
1.33CysArg: 1.33 ± 0.528
1.33CysSer: 1.33 ± 0.84
1.33CysThr: 1.33 ± 0.729
0.887CysVal: 0.887 ± 0.422
0.887CysTrp: 0.887 ± 0.518
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.546AspAla: 3.546 ± 1.936
2.216AspCys: 2.216 ± 0.829
3.546AspAsp: 3.546 ± 1.055
3.989AspGlu: 3.989 ± 2.302
2.66AspPhe: 2.66 ± 1.211
0.887AspGly: 0.887 ± 0.422
1.33AspHis: 1.33 ± 0.468
9.309AspIle: 9.309 ± 3.44
1.33AspLys: 1.33 ± 0.468
7.535AspLeu: 7.535 ± 1.57
1.773AspMet: 1.773 ± 1.025
3.103AspAsn: 3.103 ± 0.693
5.762AspPro: 5.762 ± 1.443
2.66AspGln: 2.66 ± 1.263
2.66AspArg: 2.66 ± 0.708
2.66AspSer: 2.66 ± 0.944
4.876AspThr: 4.876 ± 1.209
6.649AspVal: 6.649 ± 2.231
0.0AspTrp: 0.0 ± 0.0
1.773AspTyr: 1.773 ± 1.219
0.0AspXaa: 0.0 ± 0.0
Glu
3.103GluAla: 3.103 ± 1.005
1.773GluCys: 1.773 ± 0.714
5.762GluAsp: 5.762 ± 0.76
6.649GluGlu: 6.649 ± 1.496
2.66GluPhe: 2.66 ± 1.05
2.66GluGly: 2.66 ± 1.035
1.33GluHis: 1.33 ± 0.468
3.989GluIle: 3.989 ± 1.505
1.773GluLys: 1.773 ± 1.155
3.546GluLeu: 3.546 ± 1.259
0.443GluMet: 0.443 ± 0.342
2.216GluAsn: 2.216 ± 0.965
2.216GluPro: 2.216 ± 0.644
1.33GluGln: 1.33 ± 0.603
4.433GluArg: 4.433 ± 1.949
4.433GluSer: 4.433 ± 1.002
2.216GluThr: 2.216 ± 0.366
2.66GluVal: 2.66 ± 1.09
1.33GluTrp: 1.33 ± 1.025
3.103GluTyr: 3.103 ± 1.349
0.0GluXaa: 0.0 ± 0.0
Phe
2.66PheAla: 2.66 ± 1.04
0.887PheCys: 0.887 ± 0.578
3.103PheAsp: 3.103 ± 0.818
3.546PheGlu: 3.546 ± 1.318
3.989PhePhe: 3.989 ± 1.131
4.433PheGly: 4.433 ± 1.464
1.773PheHis: 1.773 ± 1.072
3.546PheIle: 3.546 ± 1.328
4.433PheLys: 4.433 ± 1.845
3.989PheLeu: 3.989 ± 1.527
0.443PheMet: 0.443 ± 0.342
2.66PheAsn: 2.66 ± 1.27
0.887PhePro: 0.887 ± 0.422
0.443PheGln: 0.443 ± 0.467
3.103PheArg: 3.103 ± 0.721
1.33PheSer: 1.33 ± 0.761
3.103PheThr: 3.103 ± 0.634
3.103PheVal: 3.103 ± 1.388
0.887PheTrp: 0.887 ± 0.422
1.33PheTyr: 1.33 ± 0.528
0.0PheXaa: 0.0 ± 0.0
Gly
2.66GlyAla: 2.66 ± 0.973
0.887GlyCys: 0.887 ± 0.713
5.319GlyAsp: 5.319 ± 1.337
3.546GlyGlu: 3.546 ± 0.419
1.33GlyPhe: 1.33 ± 0.394
3.989GlyGly: 3.989 ± 2.119
0.887GlyHis: 0.887 ± 0.713
3.989GlyIle: 3.989 ± 1.449
3.546GlyLys: 3.546 ± 0.954
3.103GlyLeu: 3.103 ± 1.79
0.443GlyMet: 0.443 ± 0.356
4.433GlyAsn: 4.433 ± 1.104
4.433GlyPro: 4.433 ± 1.379
1.773GlyGln: 1.773 ± 0.644
2.66GlyArg: 2.66 ± 0.641
4.876GlySer: 4.876 ± 1.037
3.546GlyThr: 3.546 ± 1.628
0.887GlyVal: 0.887 ± 0.773
0.0GlyTrp: 0.0 ± 0.0
1.773GlyTyr: 1.773 ± 0.546
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.443HisAsp: 0.443 ± 0.342
1.33HisGlu: 1.33 ± 0.697
0.887HisPhe: 0.887 ± 0.785
0.887HisGly: 0.887 ± 0.773
0.443HisHis: 0.443 ± 0.467
1.773HisIle: 1.773 ± 0.676
0.887HisLys: 0.887 ± 0.485
2.66HisLeu: 2.66 ± 0.641
0.443HisMet: 0.443 ± 0.342
1.773HisAsn: 1.773 ± 0.635
0.887HisPro: 0.887 ± 0.713
0.443HisGln: 0.443 ± 0.342
0.443HisArg: 0.443 ± 0.467
2.216HisSer: 2.216 ± 0.609
2.216HisThr: 2.216 ± 0.785
0.443HisVal: 0.443 ± 0.342
1.33HisTrp: 1.33 ± 0.468
0.443HisTyr: 0.443 ± 0.356
0.0HisXaa: 0.0 ± 0.0
Ile
3.103IleAla: 3.103 ± 1.115
1.33IleCys: 1.33 ± 1.434
3.546IleAsp: 3.546 ± 1.313
3.546IleGlu: 3.546 ± 2.469
3.103IlePhe: 3.103 ± 1.889
4.876IleGly: 4.876 ± 1.519
1.33IleHis: 1.33 ± 0.394
5.762IleIle: 5.762 ± 1.856
2.216IleLys: 2.216 ± 0.863
3.546IleLeu: 3.546 ± 1.11
0.443IleMet: 0.443 ± 0.342
5.319IleAsn: 5.319 ± 2.282
2.66IlePro: 2.66 ± 1.269
2.66IleGln: 2.66 ± 1.0
1.33IleArg: 1.33 ± 1.177
4.433IleSer: 4.433 ± 0.763
5.319IleThr: 5.319 ± 3.122
4.433IleVal: 4.433 ± 1.117
0.443IleTrp: 0.443 ± 0.467
1.33IleTyr: 1.33 ± 0.756
0.0IleXaa: 0.0 ± 0.0
Lys
3.546LysAla: 3.546 ± 0.767
2.216LysCys: 2.216 ± 1.002
1.33LysAsp: 1.33 ± 0.702
1.773LysGlu: 1.773 ± 0.633
2.66LysPhe: 2.66 ± 1.0
3.989LysGly: 3.989 ± 1.05
1.773LysHis: 1.773 ± 0.97
1.33LysIle: 1.33 ± 0.729
2.216LysLys: 2.216 ± 0.708
2.216LysLeu: 2.216 ± 0.862
1.33LysMet: 1.33 ± 0.468
1.33LysAsn: 1.33 ± 0.378
2.216LysPro: 2.216 ± 1.161
0.887LysGln: 0.887 ± 0.422
5.319LysArg: 5.319 ± 0.922
5.319LysSer: 5.319 ± 1.779
1.773LysThr: 1.773 ± 0.843
3.989LysVal: 3.989 ± 1.076
0.887LysTrp: 0.887 ± 0.518
2.216LysTyr: 2.216 ± 0.592
0.0LysXaa: 0.0 ± 0.0
Leu
5.319LeuAla: 5.319 ± 1.309
1.773LeuCys: 1.773 ± 0.9
6.649LeuAsp: 6.649 ± 1.695
2.66LeuGlu: 2.66 ± 1.925
5.319LeuPhe: 5.319 ± 1.099
6.206LeuGly: 6.206 ± 1.869
1.33LeuHis: 1.33 ± 0.707
2.216LeuIle: 2.216 ± 0.778
4.876LeuLys: 4.876 ± 1.181
7.535LeuLeu: 7.535 ± 2.817
1.33LeuMet: 1.33 ± 0.945
5.319LeuAsn: 5.319 ± 1.066
5.762LeuPro: 5.762 ± 1.257
5.319LeuGln: 5.319 ± 1.48
4.876LeuArg: 4.876 ± 0.928
4.433LeuSer: 4.433 ± 0.76
6.649LeuThr: 6.649 ± 0.893
5.762LeuVal: 5.762 ± 0.805
0.443LeuTrp: 0.443 ± 0.356
5.762LeuTyr: 5.762 ± 1.24
0.0LeuXaa: 0.0 ± 0.0
Met
1.33MetAla: 1.33 ± 0.394
0.0MetCys: 0.0 ± 0.0
0.443MetAsp: 0.443 ± 0.342
0.887MetGlu: 0.887 ± 0.578
0.443MetPhe: 0.443 ± 0.342
0.887MetGly: 0.887 ± 0.422
0.0MetHis: 0.0 ± 0.0
0.443MetIle: 0.443 ± 0.342
0.443MetLys: 0.443 ± 0.75
1.773MetLeu: 1.773 ± 0.782
1.33MetMet: 1.33 ± 0.889
1.773MetAsn: 1.773 ± 0.676
0.887MetPro: 0.887 ± 0.683
0.887MetGln: 0.887 ± 0.446
0.887MetArg: 0.887 ± 0.683
2.216MetSer: 2.216 ± 1.102
0.0MetThr: 0.0 ± 0.0
0.443MetVal: 0.443 ± 0.342
0.0MetTrp: 0.0 ± 0.0
1.33MetTyr: 1.33 ± 0.697
0.0MetXaa: 0.0 ± 0.0
Asn
2.66AsnAla: 2.66 ± 0.944
2.216AsnCys: 2.216 ± 1.149
3.989AsnAsp: 3.989 ± 0.537
3.546AsnGlu: 3.546 ± 0.579
1.773AsnPhe: 1.773 ± 0.803
2.216AsnGly: 2.216 ± 0.974
1.33AsnHis: 1.33 ± 1.025
2.66AsnIle: 2.66 ± 0.788
2.66AsnLys: 2.66 ± 0.965
4.876AsnLeu: 4.876 ± 1.542
0.887AsnMet: 0.887 ± 0.422
3.546AsnAsn: 3.546 ± 0.42
5.319AsnPro: 5.319 ± 2.29
3.103AsnGln: 3.103 ± 0.609
2.66AsnArg: 2.66 ± 0.472
4.876AsnSer: 4.876 ± 1.393
3.103AsnThr: 3.103 ± 0.763
4.876AsnVal: 4.876 ± 1.403
0.887AsnTrp: 0.887 ± 0.511
0.443AsnTyr: 0.443 ± 0.467
0.0AsnXaa: 0.0 ± 0.0
Pro
3.103ProAla: 3.103 ± 0.739
0.443ProCys: 0.443 ± 0.356
5.319ProAsp: 5.319 ± 1.683
3.989ProGlu: 3.989 ± 1.691
0.887ProPhe: 0.887 ± 0.579
0.887ProGly: 0.887 ± 0.469
0.443ProHis: 0.443 ± 0.393
3.989ProIle: 3.989 ± 2.283
4.876ProLys: 4.876 ± 1.469
5.319ProLeu: 5.319 ± 1.434
0.887ProMet: 0.887 ± 0.42
3.103ProAsn: 3.103 ± 1.194
5.762ProPro: 5.762 ± 1.577
1.33ProGln: 1.33 ± 0.697
2.66ProArg: 2.66 ± 0.435
3.103ProSer: 3.103 ± 1.619
4.876ProThr: 4.876 ± 1.69
3.989ProVal: 3.989 ± 1.297
0.443ProTrp: 0.443 ± 0.467
2.66ProTyr: 2.66 ± 1.715
0.0ProXaa: 0.0 ± 0.0
Gln
2.216GlnAla: 2.216 ± 0.96
0.443GlnCys: 0.443 ± 0.342
1.773GlnAsp: 1.773 ± 0.21
1.33GlnGlu: 1.33 ± 0.85
2.216GlnPhe: 2.216 ± 1.708
2.216GlnGly: 2.216 ± 0.446
0.443GlnHis: 0.443 ± 0.75
3.103GlnIle: 3.103 ± 1.007
2.216GlnLys: 2.216 ± 0.366
3.989GlnLeu: 3.989 ± 1.638
1.33GlnMet: 1.33 ± 1.025
1.773GlnAsn: 1.773 ± 0.21
1.33GlnPro: 1.33 ± 0.761
2.66GlnGln: 2.66 ± 1.158
2.66GlnArg: 2.66 ± 0.592
2.216GlnSer: 2.216 ± 1.214
1.773GlnThr: 1.773 ± 0.623
1.773GlnVal: 1.773 ± 0.546
1.33GlnTrp: 1.33 ± 0.889
1.33GlnTyr: 1.33 ± 0.756
0.0GlnXaa: 0.0 ± 0.0
Arg
4.876ArgAla: 4.876 ± 0.766
1.773ArgCys: 1.773 ± 0.938
3.546ArgAsp: 3.546 ± 1.476
2.216ArgGlu: 2.216 ± 1.149
3.546ArgPhe: 3.546 ± 1.501
3.989ArgGly: 3.989 ± 1.204
3.103ArgHis: 3.103 ± 0.963
1.33ArgIle: 1.33 ± 0.87
2.216ArgLys: 2.216 ± 1.017
9.309ArgLeu: 9.309 ± 1.313
0.887ArgMet: 0.887 ± 0.766
3.989ArgAsn: 3.989 ± 1.134
3.103ArgPro: 3.103 ± 0.781
2.66ArgGln: 2.66 ± 1.158
7.092ArgArg: 7.092 ± 4.098
3.989ArgSer: 3.989 ± 1.657
1.33ArgThr: 1.33 ± 0.837
2.216ArgVal: 2.216 ± 0.797
0.443ArgTrp: 0.443 ± 0.356
1.773ArgTyr: 1.773 ± 0.635
0.0ArgXaa: 0.0 ± 0.0
Ser
2.66SerAla: 2.66 ± 0.841
0.887SerCys: 0.887 ± 0.728
3.546SerAsp: 3.546 ± 1.098
3.989SerGlu: 3.989 ± 1.807
1.773SerPhe: 1.773 ± 0.984
2.66SerGly: 2.66 ± 0.965
1.33SerHis: 1.33 ± 0.679
4.876SerIle: 4.876 ± 2.639
3.103SerLys: 3.103 ± 1.106
7.979SerLeu: 7.979 ± 2.487
1.33SerMet: 1.33 ± 0.716
5.319SerAsn: 5.319 ± 1.407
4.876SerPro: 4.876 ± 2.17
2.66SerGln: 2.66 ± 0.936
6.206SerArg: 6.206 ± 2.703
3.989SerSer: 3.989 ± 2.456
4.876SerThr: 4.876 ± 0.915
4.433SerVal: 4.433 ± 0.838
0.0SerTrp: 0.0 ± 0.0
2.216SerTyr: 2.216 ± 0.884
0.0SerXaa: 0.0 ± 0.0
Thr
3.989ThrAla: 3.989 ± 1.302
0.887ThrCys: 0.887 ± 0.511
5.762ThrAsp: 5.762 ± 0.879
2.216ThrGlu: 2.216 ± 0.422
3.103ThrPhe: 3.103 ± 1.69
5.319ThrGly: 5.319 ± 1.173
0.443ThrHis: 0.443 ± 0.467
3.546ThrIle: 3.546 ± 0.627
1.773ThrLys: 1.773 ± 0.491
6.206ThrLeu: 6.206 ± 1.072
0.0ThrMet: 0.0 ± 0.0
2.66ThrAsn: 2.66 ± 0.5
2.216ThrPro: 2.216 ± 0.745
2.66ThrGln: 2.66 ± 0.787
3.989ThrArg: 3.989 ± 0.849
6.206ThrSer: 6.206 ± 2.588
9.752ThrThr: 9.752 ± 3.208
3.546ThrVal: 3.546 ± 0.924
0.443ThrTrp: 0.443 ± 0.342
1.33ThrTyr: 1.33 ± 0.702
0.0ThrXaa: 0.0 ± 0.0
Val
3.989ValAla: 3.989 ± 1.762
0.443ValCys: 0.443 ± 0.473
4.876ValAsp: 4.876 ± 0.827
4.433ValGlu: 4.433 ± 1.009
3.103ValPhe: 3.103 ± 1.17
2.66ValGly: 2.66 ± 0.842
0.887ValHis: 0.887 ± 0.42
2.216ValIle: 2.216 ± 1.398
0.887ValLys: 0.887 ± 0.469
4.433ValLeu: 4.433 ± 1.056
1.773ValMet: 1.773 ± 0.628
3.989ValAsn: 3.989 ± 0.94
4.433ValPro: 4.433 ± 1.585
2.216ValGln: 2.216 ± 1.049
2.66ValArg: 2.66 ± 1.026
3.989ValSer: 3.989 ± 0.959
4.433ValThr: 4.433 ± 1.138
3.103ValVal: 3.103 ± 0.806
0.443ValTrp: 0.443 ± 0.356
2.216ValTyr: 2.216 ± 1.166
0.0ValXaa: 0.0 ± 0.0
Trp
0.887TrpAla: 0.887 ± 0.485
0.0TrpCys: 0.0 ± 0.0
0.887TrpAsp: 0.887 ± 0.713
0.443TrpGlu: 0.443 ± 0.467
0.443TrpPhe: 0.443 ± 0.342
0.0TrpGly: 0.0 ± 0.0
0.443TrpHis: 0.443 ± 0.467
0.443TrpIle: 0.443 ± 0.342
0.887TrpLys: 0.887 ± 0.683
1.33TrpLeu: 1.33 ± 0.468
0.0TrpMet: 0.0 ± 0.0
0.443TrpAsn: 0.443 ± 0.356
0.443TrpPro: 0.443 ± 0.356
0.443TrpGln: 0.443 ± 0.342
1.33TrpArg: 1.33 ± 0.723
0.443TrpSer: 0.443 ± 0.356
0.887TrpThr: 0.887 ± 0.934
0.887TrpVal: 0.887 ± 0.683
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.33TyrAla: 1.33 ± 0.468
0.443TyrCys: 0.443 ± 0.473
1.773TyrAsp: 1.773 ± 0.546
1.773TyrGlu: 1.773 ± 0.782
4.433TyrPhe: 4.433 ± 1.415
2.66TyrGly: 2.66 ± 0.936
0.0TyrHis: 0.0 ± 0.0
1.773TyrIle: 1.773 ± 0.843
2.66TyrLys: 2.66 ± 1.271
2.66TyrLeu: 2.66 ± 0.932
0.0TyrMet: 0.0 ± 0.0
1.773TyrAsn: 1.773 ± 0.21
0.887TyrPro: 0.887 ± 0.713
0.887TyrGln: 0.887 ± 0.422
3.103TyrArg: 3.103 ± 0.701
2.216TyrSer: 2.216 ± 0.775
2.66TyrThr: 2.66 ± 1.098
0.887TyrVal: 0.887 ± 0.683
0.0TyrTrp: 0.0 ± 0.0
2.66TyrTyr: 2.66 ± 1.241
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2257 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski