Amino acid dipepetide frequency for Human papillomavirus 178

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.35AlaAla: 7.35 ± 2.185
1.633AlaCys: 1.633 ± 0.633
3.267AlaAsp: 3.267 ± 0.719
4.9AlaGlu: 4.9 ± 0.882
2.45AlaPhe: 2.45 ± 0.906
2.45AlaGly: 2.45 ± 0.895
0.817AlaHis: 0.817 ± 0.704
3.267AlaIle: 3.267 ± 1.642
2.45AlaLys: 2.45 ± 0.731
5.308AlaLeu: 5.308 ± 1.993
0.0AlaMet: 0.0 ± 0.0
1.633AlaAsn: 1.633 ± 1.113
4.492AlaPro: 4.492 ± 0.836
3.267AlaGln: 3.267 ± 1.096
2.042AlaArg: 2.042 ± 0.691
2.858AlaSer: 2.858 ± 0.59
5.308AlaThr: 5.308 ± 1.318
3.675AlaVal: 3.675 ± 1.57
0.408AlaTrp: 0.408 ± 0.377
2.042AlaTyr: 2.042 ± 0.984
0.0AlaXaa: 0.0 ± 0.0
Cys
0.408CysAla: 0.408 ± 0.352
0.817CysCys: 0.817 ± 0.945
1.633CysAsp: 1.633 ± 0.722
1.633CysGlu: 1.633 ± 0.919
1.633CysPhe: 1.633 ± 0.728
0.408CysGly: 0.408 ± 0.472
0.408CysHis: 0.408 ± 0.317
2.45CysIle: 2.45 ± 1.004
1.633CysLys: 1.633 ± 0.56
1.633CysLeu: 1.633 ± 1.019
0.408CysMet: 0.408 ± 0.432
2.042CysAsn: 2.042 ± 0.81
1.633CysPro: 1.633 ± 0.802
0.408CysGln: 0.408 ± 0.464
1.225CysArg: 1.225 ± 1.017
0.817CysSer: 0.817 ± 0.495
0.817CysThr: 0.817 ± 0.509
1.225CysVal: 1.225 ± 0.89
1.225CysTrp: 1.225 ± 0.448
0.408CysTyr: 0.408 ± 0.432
0.0CysXaa: 0.0 ± 0.0
Asp
5.717AspAla: 5.717 ± 1.885
2.042AspCys: 2.042 ± 1.268
5.308AspAsp: 5.308 ± 1.611
3.675AspGlu: 3.675 ± 0.903
4.492AspPhe: 4.492 ± 1.786
1.633AspGly: 1.633 ± 0.469
0.408AspHis: 0.408 ± 0.317
6.942AspIle: 6.942 ± 0.989
2.042AspLys: 2.042 ± 0.901
2.45AspLeu: 2.45 ± 1.036
1.633AspMet: 1.633 ± 0.802
4.492AspAsn: 4.492 ± 0.703
3.267AspPro: 3.267 ± 1.354
2.858AspGln: 2.858 ± 0.929
2.858AspArg: 2.858 ± 0.574
7.758AspSer: 7.758 ± 1.607
2.45AspThr: 2.45 ± 0.767
4.492AspVal: 4.492 ± 1.865
1.633AspTrp: 1.633 ± 0.97
1.633AspTyr: 1.633 ± 0.665
0.0AspXaa: 0.0 ± 0.0
Glu
3.675GluAla: 3.675 ± 1.145
1.225GluCys: 1.225 ± 0.888
2.858GluAsp: 2.858 ± 0.685
8.167GluGlu: 8.167 ± 5.075
2.042GluPhe: 2.042 ± 0.456
2.042GluGly: 2.042 ± 0.789
0.408GluHis: 0.408 ± 0.472
2.858GluIle: 2.858 ± 0.668
2.042GluLys: 2.042 ± 1.037
6.942GluLeu: 6.942 ± 1.505
0.408GluMet: 0.408 ± 0.317
4.492GluAsn: 4.492 ± 1.373
2.042GluPro: 2.042 ± 1.35
3.675GluGln: 3.675 ± 0.885
3.267GluArg: 3.267 ± 1.292
3.267GluSer: 3.267 ± 0.91
4.083GluThr: 4.083 ± 0.633
3.675GluVal: 3.675 ± 1.345
0.408GluTrp: 0.408 ± 0.369
2.45GluTyr: 2.45 ± 0.887
0.0GluXaa: 0.0 ± 0.0
Phe
1.633PheAla: 1.633 ± 0.641
1.633PheCys: 1.633 ± 0.98
4.083PheAsp: 4.083 ± 1.2
2.858PheGlu: 2.858 ± 1.142
2.45PhePhe: 2.45 ± 1.212
2.45PheGly: 2.45 ± 0.917
0.817PheHis: 0.817 ± 0.58
0.817PheIle: 0.817 ± 0.456
4.083PheLys: 4.083 ± 2.135
4.9PheLeu: 4.9 ± 0.976
0.817PheMet: 0.817 ± 0.527
3.675PheAsn: 3.675 ± 1.399
2.45PhePro: 2.45 ± 0.802
4.9PheGln: 4.9 ± 1.14
1.225PheArg: 1.225 ± 0.493
2.042PheSer: 2.042 ± 0.7
2.042PheThr: 2.042 ± 0.717
3.267PheVal: 3.267 ± 1.066
0.408PheTrp: 0.408 ± 0.352
2.45PheTyr: 2.45 ± 0.886
0.0PheXaa: 0.0 ± 0.0
Gly
4.9GlyAla: 4.9 ± 1.952
0.408GlyCys: 0.408 ± 0.352
2.042GlyAsp: 2.042 ± 0.965
2.042GlyGlu: 2.042 ± 1.168
0.408GlyPhe: 0.408 ± 0.352
5.717GlyGly: 5.717 ± 2.728
1.225GlyHis: 1.225 ± 0.6
3.267GlyIle: 3.267 ± 0.655
3.267GlyLys: 3.267 ± 0.591
4.492GlyLeu: 4.492 ± 1.228
0.0GlyMet: 0.0 ± 0.0
2.45GlyAsn: 2.45 ± 1.212
2.042GlyPro: 2.042 ± 0.433
1.633GlyGln: 1.633 ± 1.07
5.717GlyArg: 5.717 ± 1.807
4.492GlySer: 4.492 ± 1.601
5.308GlyThr: 5.308 ± 1.435
4.492GlyVal: 4.492 ± 1.121
0.0GlyTrp: 0.0 ± 0.0
0.408GlyTyr: 0.408 ± 0.369
0.0GlyXaa: 0.0 ± 0.0
His
0.408HisAla: 0.408 ± 0.317
0.408HisCys: 0.408 ± 0.317
0.408HisAsp: 0.408 ± 0.464
0.408HisGlu: 0.408 ± 0.317
1.633HisPhe: 1.633 ± 1.094
0.408HisGly: 0.408 ± 0.472
0.0HisHis: 0.0 ± 0.0
0.817HisIle: 0.817 ± 0.704
0.817HisLys: 0.817 ± 0.456
1.633HisLeu: 1.633 ± 0.842
0.408HisMet: 0.408 ± 0.408
1.225HisAsn: 1.225 ± 0.396
2.042HisPro: 2.042 ± 0.69
0.817HisGln: 0.817 ± 0.404
1.225HisArg: 1.225 ± 0.409
1.225HisSer: 1.225 ± 0.396
0.408HisThr: 0.408 ± 0.377
1.225HisVal: 1.225 ± 0.448
0.817HisTrp: 0.817 ± 0.603
0.817HisTyr: 0.817 ± 0.485
0.0HisXaa: 0.0 ± 0.0
Ile
4.083IleAla: 4.083 ± 0.903
1.225IleCys: 1.225 ± 0.493
2.858IleAsp: 2.858 ± 0.973
3.675IleGlu: 3.675 ± 0.932
2.45IlePhe: 2.45 ± 0.59
5.308IleGly: 5.308 ± 2.112
0.0IleHis: 0.0 ± 0.0
3.267IleIle: 3.267 ± 1.6
1.633IleLys: 1.633 ± 1.199
4.083IleLeu: 4.083 ± 0.47
1.225IleMet: 1.225 ± 0.744
3.675IleAsn: 3.675 ± 1.19
4.083IlePro: 4.083 ± 1.896
4.083IleGln: 4.083 ± 1.055
2.858IleArg: 2.858 ± 1.148
3.675IleSer: 3.675 ± 1.107
2.858IleThr: 2.858 ± 0.897
1.225IleVal: 1.225 ± 0.307
0.817IleTrp: 0.817 ± 0.456
1.225IleTyr: 1.225 ± 0.409
0.0IleXaa: 0.0 ± 0.0
Lys
2.45LysAla: 2.45 ± 0.894
1.633LysCys: 1.633 ± 0.637
2.042LysAsp: 2.042 ± 0.821
3.675LysGlu: 3.675 ± 1.525
2.45LysPhe: 2.45 ± 1.271
3.267LysGly: 3.267 ± 0.762
2.042LysHis: 2.042 ± 0.962
3.675LysIle: 3.675 ± 1.176
2.45LysLys: 2.45 ± 1.243
4.492LysLeu: 4.492 ± 1.493
1.225LysMet: 1.225 ± 0.663
2.042LysAsn: 2.042 ± 1.162
0.408LysPro: 0.408 ± 0.317
3.267LysGln: 3.267 ± 0.632
5.308LysArg: 5.308 ± 1.096
5.308LysSer: 5.308 ± 2.526
2.042LysThr: 2.042 ± 0.792
2.45LysVal: 2.45 ± 0.543
0.408LysTrp: 0.408 ± 0.464
2.858LysTyr: 2.858 ± 1.108
0.0LysXaa: 0.0 ± 0.0
Leu
4.492LeuAla: 4.492 ± 0.558
1.633LeuCys: 1.633 ± 1.122
6.533LeuAsp: 6.533 ± 0.776
4.492LeuGlu: 4.492 ± 1.4
5.717LeuPhe: 5.717 ± 1.71
3.267LeuGly: 3.267 ± 1.767
2.042LeuHis: 2.042 ± 1.15
4.492LeuIle: 4.492 ± 1.366
4.083LeuLys: 4.083 ± 1.19
8.167LeuLeu: 8.167 ± 1.924
0.817LeuMet: 0.817 ± 0.554
2.45LeuAsn: 2.45 ± 0.692
4.492LeuPro: 4.492 ± 1.067
6.125LeuGln: 6.125 ± 1.491
2.45LeuArg: 2.45 ± 0.886
5.717LeuSer: 5.717 ± 0.461
6.533LeuThr: 6.533 ± 0.88
6.533LeuVal: 6.533 ± 0.954
0.817LeuTrp: 0.817 ± 0.404
4.083LeuTyr: 4.083 ± 0.655
0.0LeuXaa: 0.0 ± 0.0
Met
2.45MetAla: 2.45 ± 0.884
0.817MetCys: 0.817 ± 0.404
0.817MetAsp: 0.817 ± 0.443
0.408MetGlu: 0.408 ± 0.317
0.408MetPhe: 0.408 ± 0.317
0.408MetGly: 0.408 ± 0.432
0.0MetHis: 0.0 ± 0.0
0.408MetIle: 0.408 ± 0.464
0.817MetLys: 0.817 ± 0.495
1.225MetLeu: 1.225 ± 0.602
0.0MetMet: 0.0 ± 0.0
0.408MetAsn: 0.408 ± 0.352
0.408MetPro: 0.408 ± 0.369
1.225MetGln: 1.225 ± 0.703
0.817MetArg: 0.817 ± 0.633
0.408MetSer: 0.408 ± 0.317
1.633MetThr: 1.633 ± 0.56
1.225MetVal: 1.225 ± 0.448
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.225AsnAla: 1.225 ± 0.448
1.225AsnCys: 1.225 ± 0.915
2.858AsnAsp: 2.858 ± 1.143
2.45AsnGlu: 2.45 ± 0.922
2.042AsnPhe: 2.042 ± 0.456
2.042AsnGly: 2.042 ± 0.766
0.408AsnHis: 0.408 ± 0.317
2.042AsnIle: 2.042 ± 0.387
3.675AsnLys: 3.675 ± 0.952
4.492AsnLeu: 4.492 ± 1.203
1.225AsnMet: 1.225 ± 0.621
1.633AsnAsn: 1.633 ± 0.665
2.042AsnPro: 2.042 ± 0.884
3.267AsnGln: 3.267 ± 0.805
2.858AsnArg: 2.858 ± 1.206
1.633AsnSer: 1.633 ± 0.241
3.675AsnThr: 3.675 ± 1.399
3.675AsnVal: 3.675 ± 0.471
1.225AsnTrp: 1.225 ± 0.689
1.633AsnTyr: 1.633 ± 0.779
0.0AsnXaa: 0.0 ± 0.0
Pro
2.45ProAla: 2.45 ± 0.976
1.225ProCys: 1.225 ± 0.661
5.308ProAsp: 5.308 ± 1.631
2.45ProGlu: 2.45 ± 0.77
1.633ProPhe: 1.633 ± 0.448
2.45ProGly: 2.45 ± 1.054
0.408ProHis: 0.408 ± 0.377
3.675ProIle: 3.675 ± 1.054
4.492ProLys: 4.492 ± 0.747
6.533ProLeu: 6.533 ± 1.114
0.0ProMet: 0.0 ± 0.0
2.858ProAsn: 2.858 ± 0.911
7.35ProPro: 7.35 ± 2.039
4.9ProGln: 4.9 ± 1.12
1.633ProArg: 1.633 ± 0.241
2.858ProSer: 2.858 ± 1.017
2.45ProThr: 2.45 ± 0.976
6.125ProVal: 6.125 ± 2.456
0.0ProTrp: 0.0 ± 0.0
2.042ProTyr: 2.042 ± 0.96
0.0ProXaa: 0.0 ± 0.0
Gln
2.858GlnAla: 2.858 ± 0.777
2.45GlnCys: 2.45 ± 0.88
4.9GlnAsp: 4.9 ± 0.855
1.633GlnGlu: 1.633 ± 0.532
4.083GlnPhe: 4.083 ± 0.904
2.45GlnGly: 2.45 ± 0.82
1.225GlnHis: 1.225 ± 0.548
3.267GlnIle: 3.267 ± 1.006
2.042GlnLys: 2.042 ± 1.125
5.308GlnLeu: 5.308 ± 2.262
1.225GlnMet: 1.225 ± 0.744
2.858GlnAsn: 2.858 ± 1.272
3.675GlnPro: 3.675 ± 1.19
1.633GlnGln: 1.633 ± 1.049
2.042GlnArg: 2.042 ± 1.204
4.492GlnSer: 4.492 ± 0.929
1.633GlnThr: 1.633 ± 0.86
2.858GlnVal: 2.858 ± 0.616
1.225GlnTrp: 1.225 ± 0.635
2.45GlnTyr: 2.45 ± 0.778
0.0GlnXaa: 0.0 ± 0.0
Arg
3.675ArgAla: 3.675 ± 1.404
0.817ArgCys: 0.817 ± 0.864
2.042ArgAsp: 2.042 ± 0.815
4.492ArgGlu: 4.492 ± 0.837
2.858ArgPhe: 2.858 ± 1.117
2.858ArgGly: 2.858 ± 0.582
1.633ArgHis: 1.633 ± 0.665
1.225ArgIle: 1.225 ± 0.646
5.308ArgLys: 5.308 ± 1.062
5.717ArgLeu: 5.717 ± 0.744
0.408ArgMet: 0.408 ± 0.377
0.817ArgAsn: 0.817 ± 0.411
2.858ArgPro: 2.858 ± 1.045
2.858ArgGln: 2.858 ± 0.592
6.533ArgArg: 6.533 ± 1.577
4.083ArgSer: 4.083 ± 0.701
3.267ArgThr: 3.267 ± 0.68
4.492ArgVal: 4.492 ± 1.688
0.0ArgTrp: 0.0 ± 0.0
1.225ArgTyr: 1.225 ± 0.706
0.0ArgXaa: 0.0 ± 0.0
Ser
3.267SerAla: 3.267 ± 1.442
0.817SerCys: 0.817 ± 0.495
6.125SerAsp: 6.125 ± 1.614
2.858SerGlu: 2.858 ± 1.134
3.675SerPhe: 3.675 ± 1.107
6.942SerGly: 6.942 ± 1.815
1.225SerHis: 1.225 ± 0.628
2.042SerIle: 2.042 ± 0.884
2.45SerLys: 2.45 ± 0.982
4.492SerLeu: 4.492 ± 0.721
0.817SerMet: 0.817 ± 0.468
3.267SerAsn: 3.267 ± 1.94
4.9SerPro: 4.9 ± 1.636
2.45SerGln: 2.45 ± 1.032
4.083SerArg: 4.083 ± 1.045
4.492SerSer: 4.492 ± 1.547
7.758SerThr: 7.758 ± 1.924
4.083SerVal: 4.083 ± 0.866
0.0SerTrp: 0.0 ± 0.0
2.45SerTyr: 2.45 ± 0.665
0.0SerXaa: 0.0 ± 0.0
Thr
2.042ThrAla: 2.042 ± 0.643
0.408ThrCys: 0.408 ± 0.352
4.9ThrAsp: 4.9 ± 0.731
4.9ThrGlu: 4.9 ± 1.073
2.858ThrPhe: 2.858 ± 1.054
4.083ThrGly: 4.083 ± 1.818
1.633ThrHis: 1.633 ± 0.886
4.083ThrIle: 4.083 ± 1.011
3.267ThrLys: 3.267 ± 1.062
5.308ThrLeu: 5.308 ± 1.425
0.817ThrMet: 0.817 ± 0.633
2.858ThrAsn: 2.858 ± 1.497
6.125ThrPro: 6.125 ± 1.703
2.45ThrGln: 2.45 ± 0.477
2.858ThrArg: 2.858 ± 1.453
4.492ThrSer: 4.492 ± 1.124
2.858ThrThr: 2.858 ± 0.848
4.492ThrVal: 4.492 ± 1.149
0.817ThrTrp: 0.817 ± 0.456
1.225ThrTyr: 1.225 ± 0.557
0.0ThrXaa: 0.0 ± 0.0
Val
3.675ValAla: 3.675 ± 0.745
1.633ValCys: 1.633 ± 1.316
7.758ValAsp: 7.758 ± 1.881
4.492ValGlu: 4.492 ± 0.658
1.225ValPhe: 1.225 ± 0.557
2.858ValGly: 2.858 ± 1.046
1.633ValHis: 1.633 ± 0.905
3.675ValIle: 3.675 ± 1.552
1.633ValLys: 1.633 ± 0.637
4.083ValLeu: 4.083 ± 0.892
0.817ValMet: 0.817 ± 0.404
0.817ValAsn: 0.817 ± 0.633
4.492ValPro: 4.492 ± 1.841
2.858ValGln: 2.858 ± 0.967
4.9ValArg: 4.9 ± 1.14
6.942ValSer: 6.942 ± 1.185
4.083ValThr: 4.083 ± 1.11
2.858ValVal: 2.858 ± 1.679
2.042ValTrp: 2.042 ± 1.098
3.675ValTyr: 3.675 ± 0.564
0.0ValXaa: 0.0 ± 0.0
Trp
0.408TrpAla: 0.408 ± 0.317
0.0TrpCys: 0.0 ± 0.0
0.817TrpAsp: 0.817 ± 0.443
0.0TrpGlu: 0.0 ± 0.0
0.817TrpPhe: 0.817 ± 0.456
0.817TrpGly: 0.817 ± 0.485
0.817TrpHis: 0.817 ± 0.456
0.817TrpIle: 0.817 ± 0.633
1.633TrpLys: 1.633 ± 1.019
0.817TrpLeu: 0.817 ± 0.443
0.408TrpMet: 0.408 ± 0.333
0.817TrpAsn: 0.817 ± 0.755
0.0TrpPro: 0.0 ± 0.0
0.817TrpGln: 0.817 ± 0.704
0.817TrpArg: 0.817 ± 0.443
0.408TrpSer: 0.408 ± 0.377
1.225TrpThr: 1.225 ± 0.448
1.225TrpVal: 1.225 ± 0.689
0.0TrpTrp: 0.0 ± 0.0
0.408TrpTyr: 0.408 ± 0.317
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.45TyrAla: 2.45 ± 1.055
0.817TyrCys: 0.817 ± 0.509
1.225TyrAsp: 1.225 ± 0.557
0.817TyrGlu: 0.817 ± 0.509
3.675TyrPhe: 3.675 ± 1.15
2.45TyrGly: 2.45 ± 0.707
0.0TyrHis: 0.0 ± 0.0
1.225TyrIle: 1.225 ± 0.599
3.675TyrLys: 3.675 ± 1.052
2.858TyrLeu: 2.858 ± 0.423
0.817TyrMet: 0.817 ± 0.633
1.225TyrAsn: 1.225 ± 0.409
2.042TyrPro: 2.042 ± 0.562
0.817TyrGln: 0.817 ± 0.443
2.45TyrArg: 2.45 ± 1.242
1.225TyrSer: 1.225 ± 0.646
2.042TyrThr: 2.042 ± 0.623
2.858TyrVal: 2.858 ± 1.215
0.817TyrTrp: 0.817 ± 0.443
1.633TyrTyr: 1.633 ± 1.094
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2450 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski