Amino acid dipepetide frequency for Human papillomavirus 165

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.654AlaAla: 7.654 ± 1.781
1.351AlaCys: 1.351 ± 1.165
4.052AlaAsp: 4.052 ± 1.648
5.853AlaGlu: 5.853 ± 0.657
4.502AlaPhe: 4.502 ± 0.989
2.701AlaGly: 2.701 ± 1.09
0.9AlaHis: 0.9 ± 0.534
3.152AlaIle: 3.152 ± 1.124
0.45AlaLys: 0.45 ± 0.482
3.602AlaLeu: 3.602 ± 0.761
0.45AlaMet: 0.45 ± 0.392
2.701AlaAsn: 2.701 ± 0.889
4.052AlaPro: 4.052 ± 2.874
2.251AlaGln: 2.251 ± 0.433
2.701AlaArg: 2.701 ± 1.787
3.602AlaSer: 3.602 ± 1.151
3.602AlaThr: 3.602 ± 0.733
4.953AlaVal: 4.953 ± 1.9
0.0AlaTrp: 0.0 ± 0.0
1.351AlaTyr: 1.351 ± 0.388
0.0AlaXaa: 0.0 ± 0.0
Cys
0.45CysAla: 0.45 ± 0.553
1.801CysCys: 1.801 ± 1.239
0.45CysAsp: 0.45 ± 0.392
2.251CysGlu: 2.251 ± 1.311
1.351CysPhe: 1.351 ± 0.863
0.45CysGly: 0.45 ± 0.553
0.45CysHis: 0.45 ± 0.553
0.9CysIle: 0.9 ± 0.783
1.801CysLys: 1.801 ± 1.328
1.351CysLeu: 1.351 ± 1.641
0.0CysMet: 0.0 ± 0.0
1.801CysAsn: 1.801 ± 0.752
1.801CysPro: 1.801 ± 0.878
0.9CysGln: 0.9 ± 0.466
0.45CysArg: 0.45 ± 0.547
1.351CysSer: 1.351 ± 1.12
1.801CysThr: 1.801 ± 0.892
1.801CysVal: 1.801 ± 1.025
1.351CysTrp: 1.351 ± 0.758
0.45CysTyr: 0.45 ± 0.547
0.0CysXaa: 0.0 ± 0.0
Asp
6.303AspAla: 6.303 ± 1.258
2.251AspCys: 2.251 ± 1.154
3.152AspAsp: 3.152 ± 0.679
3.152AspGlu: 3.152 ± 0.818
1.801AspPhe: 1.801 ± 0.586
2.251AspGly: 2.251 ± 0.957
0.9AspHis: 0.9 ± 0.466
8.104AspIle: 8.104 ± 3.289
3.152AspLys: 3.152 ± 1.24
3.602AspLeu: 3.602 ± 1.246
0.9AspMet: 0.9 ± 0.698
3.152AspAsn: 3.152 ± 1.405
4.052AspPro: 4.052 ± 1.738
0.0AspGln: 0.0 ± 0.0
0.9AspArg: 0.9 ± 0.534
7.204AspSer: 7.204 ± 2.607
5.403AspThr: 5.403 ± 0.783
5.403AspVal: 5.403 ± 2.092
0.45AspTrp: 0.45 ± 0.392
1.801AspTyr: 1.801 ± 0.624
0.0AspXaa: 0.0 ± 0.0
Glu
1.801GluAla: 1.801 ± 0.931
0.9GluCys: 0.9 ± 0.783
4.953GluAsp: 4.953 ± 1.291
6.303GluGlu: 6.303 ± 1.669
4.502GluPhe: 4.502 ± 1.762
3.602GluGly: 3.602 ± 0.75
1.351GluHis: 1.351 ± 1.248
4.052GluIle: 4.052 ± 1.493
0.9GluLys: 0.9 ± 0.678
3.602GluLeu: 3.602 ± 1.118
0.45GluMet: 0.45 ± 0.392
5.403GluAsn: 5.403 ± 0.841
4.953GluPro: 4.953 ± 1.111
2.251GluGln: 2.251 ± 1.072
3.602GluArg: 3.602 ± 1.026
4.953GluSer: 4.953 ± 1.817
5.403GluThr: 5.403 ± 1.474
2.251GluVal: 2.251 ± 0.773
0.45GluTrp: 0.45 ± 0.349
2.251GluTyr: 2.251 ± 1.472
0.0GluXaa: 0.0 ± 0.0
Phe
3.152PheAla: 3.152 ± 0.745
2.701PheCys: 2.701 ± 1.908
2.701PheAsp: 2.701 ± 0.572
3.602PheGlu: 3.602 ± 1.869
2.701PhePhe: 2.701 ± 0.819
1.801PheGly: 1.801 ± 0.82
1.801PheHis: 1.801 ± 1.173
2.701PheIle: 2.701 ± 1.229
4.052PheLys: 4.052 ± 2.064
4.502PheLeu: 4.502 ± 2.004
0.45PheMet: 0.45 ± 0.427
2.251PheAsn: 2.251 ± 1.335
1.801PhePro: 1.801 ± 0.632
2.701PheGln: 2.701 ± 1.087
2.251PheArg: 2.251 ± 0.83
2.251PheSer: 2.251 ± 0.712
2.251PheThr: 2.251 ± 1.358
2.701PheVal: 2.701 ± 0.969
1.351PheTrp: 1.351 ± 0.682
2.251PheTyr: 2.251 ± 1.192
0.0PheXaa: 0.0 ± 0.0
Gly
3.602GlyAla: 3.602 ± 2.046
0.9GlyCys: 0.9 ± 0.534
4.953GlyAsp: 4.953 ± 1.572
5.403GlyGlu: 5.403 ± 1.279
1.351GlyPhe: 1.351 ± 0.416
2.701GlyGly: 2.701 ± 1.337
0.9GlyHis: 0.9 ± 0.534
2.701GlyIle: 2.701 ± 0.932
3.152GlyLys: 3.152 ± 1.011
3.152GlyLeu: 3.152 ± 1.204
0.9GlyMet: 0.9 ± 0.624
3.602GlyAsn: 3.602 ± 1.003
2.701GlyPro: 2.701 ± 0.581
1.351GlyGln: 1.351 ± 0.416
2.701GlyArg: 2.701 ± 0.878
2.701GlySer: 2.701 ± 0.763
5.853GlyThr: 5.853 ± 2.269
1.351GlyVal: 1.351 ± 0.388
0.0GlyTrp: 0.0 ± 0.0
2.251GlyTyr: 2.251 ± 0.423
0.0GlyXaa: 0.0 ± 0.0
His
0.9HisAla: 0.9 ± 0.698
0.45HisCys: 0.45 ± 0.547
0.9HisAsp: 0.9 ± 0.686
0.9HisGlu: 0.9 ± 0.574
1.351HisPhe: 1.351 ± 0.718
1.351HisGly: 1.351 ± 0.727
0.45HisHis: 0.45 ± 0.482
1.801HisIle: 1.801 ± 0.636
1.351HisLys: 1.351 ± 1.041
0.9HisLeu: 0.9 ± 0.637
0.9HisMet: 0.9 ± 0.546
0.0HisAsn: 0.0 ± 0.0
0.45HisPro: 0.45 ± 0.349
0.9HisGln: 0.9 ± 0.441
0.9HisArg: 0.9 ± 0.443
1.801HisSer: 1.801 ± 0.931
1.801HisThr: 1.801 ± 0.242
0.9HisVal: 0.9 ± 0.466
1.351HisTrp: 1.351 ± 1.041
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.152IleAla: 3.152 ± 1.124
0.45IleCys: 0.45 ± 0.349
1.351IleAsp: 1.351 ± 0.779
5.403IleGlu: 5.403 ± 1.162
2.701IlePhe: 2.701 ± 1.215
4.052IleGly: 4.052 ± 1.297
0.0IleHis: 0.0 ± 0.0
4.052IleIle: 4.052 ± 1.335
1.351IleLys: 1.351 ± 0.416
4.052IleLeu: 4.052 ± 1.143
0.45IleMet: 0.45 ± 0.428
2.701IleAsn: 2.701 ± 1.49
3.602IlePro: 3.602 ± 1.977
0.9IleGln: 0.9 ± 0.698
3.602IleArg: 3.602 ± 1.14
4.052IleSer: 4.052 ± 1.684
2.701IleThr: 2.701 ± 1.498
6.754IleVal: 6.754 ± 0.949
0.45IleTrp: 0.45 ± 0.547
2.251IleTyr: 2.251 ± 0.93
0.0IleXaa: 0.0 ± 0.0
Lys
3.152LysAla: 3.152 ± 1.05
1.351LysCys: 1.351 ± 0.435
2.251LysAsp: 2.251 ± 1.107
3.152LysGlu: 3.152 ± 1.724
3.152LysPhe: 3.152 ± 1.043
2.251LysGly: 2.251 ± 0.423
1.801LysHis: 1.801 ± 0.805
1.801LysIle: 1.801 ± 0.586
2.251LysLys: 2.251 ± 1.3
2.701LysLeu: 2.701 ± 1.041
1.801LysMet: 1.801 ± 0.607
4.052LysAsn: 4.052 ± 1.607
1.351LysPro: 1.351 ± 0.729
2.701LysGln: 2.701 ± 0.701
4.502LysArg: 4.502 ± 0.914
2.701LysSer: 2.701 ± 1.429
2.251LysThr: 2.251 ± 0.359
4.502LysVal: 4.502 ± 1.307
0.0LysTrp: 0.0 ± 0.0
2.251LysTyr: 2.251 ± 1.089
0.0LysXaa: 0.0 ± 0.0
Leu
5.403LeuAla: 5.403 ± 1.241
1.801LeuCys: 1.801 ± 1.072
4.953LeuAsp: 4.953 ± 0.859
5.853LeuGlu: 5.853 ± 1.494
5.853LeuPhe: 5.853 ± 1.035
5.403LeuGly: 5.403 ± 1.761
1.801LeuHis: 1.801 ± 1.069
4.502LeuIle: 4.502 ± 0.337
3.602LeuLys: 3.602 ± 0.827
6.303LeuLeu: 6.303 ± 1.665
0.45LeuMet: 0.45 ± 0.43
4.052LeuAsn: 4.052 ± 0.696
4.953LeuPro: 4.953 ± 1.021
5.403LeuGln: 5.403 ± 1.847
3.602LeuArg: 3.602 ± 1.688
5.853LeuSer: 5.853 ± 2.278
4.953LeuThr: 4.953 ± 1.373
4.953LeuVal: 4.953 ± 1.356
0.45LeuTrp: 0.45 ± 0.349
4.953LeuTyr: 4.953 ± 0.431
0.0LeuXaa: 0.0 ± 0.0
Met
0.45MetAla: 0.45 ± 0.392
0.0MetCys: 0.0 ± 0.0
0.45MetAsp: 0.45 ± 0.428
0.9MetGlu: 0.9 ± 0.678
0.45MetPhe: 0.45 ± 0.392
1.351MetGly: 1.351 ± 0.758
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
0.9MetLeu: 0.9 ± 0.466
0.0MetMet: 0.0 ± 0.0
1.351MetAsn: 1.351 ± 0.693
0.9MetPro: 0.9 ± 0.443
0.45MetGln: 0.45 ± 0.482
0.9MetArg: 0.9 ± 0.64
0.9MetSer: 0.9 ± 0.534
1.351MetThr: 1.351 ± 0.435
1.351MetVal: 1.351 ± 1.175
0.45MetTrp: 0.45 ± 0.349
0.45MetTyr: 0.45 ± 0.392
0.0MetXaa: 0.0 ± 0.0
Asn
2.251AsnAla: 2.251 ± 1.958
1.351AsnCys: 1.351 ± 1.041
3.602AsnAsp: 3.602 ± 1.078
1.351AsnGlu: 1.351 ± 0.749
3.152AsnPhe: 3.152 ± 0.97
3.152AsnGly: 3.152 ± 1.6
0.45AsnHis: 0.45 ± 0.482
1.801AsnIle: 1.801 ± 1.122
3.602AsnLys: 3.602 ± 0.827
4.502AsnLeu: 4.502 ± 1.377
0.9AsnMet: 0.9 ± 0.732
3.602AsnAsn: 3.602 ± 1.764
3.152AsnPro: 3.152 ± 1.331
0.45AsnGln: 0.45 ± 0.392
2.701AsnArg: 2.701 ± 0.577
4.502AsnSer: 4.502 ± 1.901
5.403AsnThr: 5.403 ± 0.772
6.754AsnVal: 6.754 ± 1.234
0.9AsnTrp: 0.9 ± 0.466
1.351AsnTyr: 1.351 ± 0.705
0.0AsnXaa: 0.0 ± 0.0
Pro
4.052ProAla: 4.052 ± 1.99
1.351ProCys: 1.351 ± 0.758
4.953ProAsp: 4.953 ± 1.091
3.152ProGlu: 3.152 ± 1.575
1.801ProPhe: 1.801 ± 0.622
1.801ProGly: 1.801 ± 0.711
0.9ProHis: 0.9 ± 0.467
4.052ProIle: 4.052 ± 2.372
3.152ProLys: 3.152 ± 0.559
6.754ProLeu: 6.754 ± 1.95
0.45ProMet: 0.45 ± 0.392
3.602ProAsn: 3.602 ± 1.003
4.052ProPro: 4.052 ± 1.612
1.801ProGln: 1.801 ± 0.632
3.152ProArg: 3.152 ± 0.685
4.052ProSer: 4.052 ± 1.799
4.052ProThr: 4.052 ± 1.642
3.602ProVal: 3.602 ± 1.232
0.45ProTrp: 0.45 ± 0.482
1.801ProTyr: 1.801 ± 1.06
0.0ProXaa: 0.0 ± 0.0
Gln
0.45GlnAla: 0.45 ± 0.349
0.9GlnCys: 0.9 ± 1.094
3.152GlnAsp: 3.152 ± 1.224
2.251GlnGlu: 2.251 ± 1.3
0.9GlnPhe: 0.9 ± 0.443
2.701GlnGly: 2.701 ± 1.202
0.9GlnHis: 0.9 ± 0.585
0.45GlnIle: 0.45 ± 0.349
0.9GlnLys: 0.9 ± 0.574
4.052GlnLeu: 4.052 ± 1.341
0.9GlnMet: 0.9 ± 0.466
1.801GlnAsn: 1.801 ± 0.242
1.801GlnPro: 1.801 ± 0.915
2.701GlnGln: 2.701 ± 0.711
1.801GlnArg: 1.801 ± 0.922
0.9GlnSer: 0.9 ± 0.636
2.251GlnThr: 2.251 ± 0.892
4.502GlnVal: 4.502 ± 1.376
1.801GlnTrp: 1.801 ± 1.051
2.251GlnTyr: 2.251 ± 0.902
0.0GlnXaa: 0.0 ± 0.0
Arg
1.801ArgAla: 1.801 ± 0.915
0.9ArgCys: 0.9 ± 0.664
1.801ArgAsp: 1.801 ± 0.915
1.351ArgGlu: 1.351 ± 0.758
1.801ArgPhe: 1.801 ± 0.711
3.602ArgGly: 3.602 ± 1.29
0.9ArgHis: 0.9 ± 0.698
0.9ArgIle: 0.9 ± 0.441
5.853ArgLys: 5.853 ± 1.01
9.005ArgLeu: 9.005 ± 1.721
0.9ArgMet: 0.9 ± 0.466
2.701ArgAsn: 2.701 ± 1.008
2.701ArgPro: 2.701 ± 1.49
2.701ArgGln: 2.701 ± 1.168
6.303ArgArg: 6.303 ± 2.857
3.152ArgSer: 3.152 ± 1.046
2.701ArgThr: 2.701 ± 0.763
3.152ArgVal: 3.152 ± 0.772
0.0ArgTrp: 0.0 ± 0.0
0.9ArgTyr: 0.9 ± 0.678
0.0ArgXaa: 0.0 ± 0.0
Ser
4.953SerAla: 4.953 ± 1.599
0.45SerCys: 0.45 ± 0.392
3.152SerAsp: 3.152 ± 0.775
4.502SerGlu: 4.502 ± 1.734
3.152SerPhe: 3.152 ± 1.178
3.602SerGly: 3.602 ± 0.996
1.801SerHis: 1.801 ± 1.051
2.251SerIle: 2.251 ± 1.19
2.251SerLys: 2.251 ± 0.957
10.356SerLeu: 10.356 ± 1.219
0.9SerMet: 0.9 ± 0.783
4.052SerAsn: 4.052 ± 2.506
4.052SerPro: 4.052 ± 1.124
3.152SerGln: 3.152 ± 1.045
2.701SerArg: 2.701 ± 1.294
6.303SerSer: 6.303 ± 2.191
7.204SerThr: 7.204 ± 1.868
4.502SerVal: 4.502 ± 1.322
0.9SerTrp: 0.9 ± 0.441
0.9SerTyr: 0.9 ± 0.857
0.0SerXaa: 0.0 ± 0.0
Thr
3.602ThrAla: 3.602 ± 1.822
1.801ThrCys: 1.801 ± 1.281
5.853ThrAsp: 5.853 ± 1.81
5.403ThrGlu: 5.403 ± 1.36
1.801ThrPhe: 1.801 ± 0.59
3.152ThrGly: 3.152 ± 0.79
1.801ThrHis: 1.801 ± 0.705
4.953ThrIle: 4.953 ± 2.687
3.602ThrLys: 3.602 ± 1.354
7.204ThrLeu: 7.204 ± 2.668
0.45ThrMet: 0.45 ± 0.392
4.502ThrAsn: 4.502 ± 1.329
6.303ThrPro: 6.303 ± 1.258
3.152ThrGln: 3.152 ± 1.155
3.152ThrArg: 3.152 ± 0.745
4.052ThrSer: 4.052 ± 1.659
4.953ThrThr: 4.953 ± 0.743
5.853ThrVal: 5.853 ± 1.687
0.45ThrTrp: 0.45 ± 0.428
1.801ThrTyr: 1.801 ± 1.122
0.0ThrXaa: 0.0 ± 0.0
Val
3.602ValAla: 3.602 ± 1.288
1.351ValCys: 1.351 ± 1.256
8.104ValAsp: 8.104 ± 1.4
2.251ValGlu: 2.251 ± 1.271
3.602ValPhe: 3.602 ± 0.916
3.152ValGly: 3.152 ± 0.432
1.351ValHis: 1.351 ± 0.416
3.602ValIle: 3.602 ± 1.152
4.052ValLys: 4.052 ± 1.703
4.502ValLeu: 4.502 ± 1.968
0.45ValMet: 0.45 ± 0.349
1.801ValAsn: 1.801 ± 0.569
4.502ValPro: 4.502 ± 1.156
1.801ValGln: 1.801 ± 0.838
4.502ValArg: 4.502 ± 1.422
8.104ValSer: 8.104 ± 1.708
6.754ValThr: 6.754 ± 1.097
1.351ValVal: 1.351 ± 0.659
0.9ValTrp: 0.9 ± 0.534
2.701ValTyr: 2.701 ± 0.87
0.0ValXaa: 0.0 ± 0.0
Trp
1.351TrpAla: 1.351 ± 0.718
0.0TrpCys: 0.0 ± 0.0
1.351TrpAsp: 1.351 ± 0.763
0.0TrpGlu: 0.0 ± 0.0
0.45TrpPhe: 0.45 ± 0.482
0.45TrpGly: 0.45 ± 0.349
0.45TrpHis: 0.45 ± 0.482
1.351TrpIle: 1.351 ± 1.175
1.351TrpLys: 1.351 ± 0.682
1.351TrpLeu: 1.351 ± 0.693
0.0TrpMet: 0.0 ± 0.0
0.9TrpAsn: 0.9 ± 0.534
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.45TrpArg: 0.45 ± 0.547
0.9TrpSer: 0.9 ± 0.534
1.351TrpThr: 1.351 ± 0.435
0.9TrpVal: 0.9 ± 0.678
0.0TrpTrp: 0.0 ± 0.0
0.45TrpTyr: 0.45 ± 0.392
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.801TyrAla: 1.801 ± 0.798
0.9TyrCys: 0.9 ± 0.636
1.801TyrAsp: 1.801 ± 0.59
0.9TyrGlu: 0.9 ± 0.664
4.052TyrPhe: 4.052 ± 1.033
2.701TyrGly: 2.701 ± 0.701
0.45TyrHis: 0.45 ± 0.428
0.9TyrIle: 0.9 ± 0.443
3.152TyrLys: 3.152 ± 0.95
2.701TyrLeu: 2.701 ± 0.986
0.45TyrMet: 0.45 ± 0.392
0.9TyrAsn: 0.9 ± 0.783
1.801TyrPro: 1.801 ± 1.176
1.801TyrGln: 1.801 ± 0.622
2.251TyrArg: 2.251 ± 0.765
1.801TyrSer: 1.801 ± 0.586
1.801TyrThr: 1.801 ± 0.805
0.9TyrVal: 0.9 ± 0.441
1.351TyrTrp: 1.351 ± 0.763
2.701TyrTyr: 2.701 ± 1.009
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2222 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski