Amino acid dipepetide frequency for Human papillomavirus type 41

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.98AlaAla: 3.98 ± 0.61
1.085AlaCys: 1.085 ± 0.967
4.342AlaAsp: 4.342 ± 1.05
5.065AlaGlu: 5.065 ± 1.541
3.618AlaPhe: 3.618 ± 1.338
3.256AlaGly: 3.256 ± 0.879
1.085AlaHis: 1.085 ± 0.626
2.894AlaIle: 2.894 ± 1.223
3.256AlaLys: 3.256 ± 1.041
5.789AlaLeu: 5.789 ± 1.138
2.894AlaMet: 2.894 ± 1.365
2.171AlaAsn: 2.171 ± 0.699
3.618AlaPro: 3.618 ± 1.042
1.447AlaGln: 1.447 ± 0.49
4.342AlaArg: 4.342 ± 1.6
4.342AlaSer: 4.342 ± 1.322
2.533AlaThr: 2.533 ± 1.254
2.894AlaVal: 2.894 ± 0.574
0.362AlaTrp: 0.362 ± 0.473
1.085AlaTyr: 1.085 ± 0.419
0.0AlaXaa: 0.0 ± 0.0
Cys
1.085CysAla: 1.085 ± 0.526
1.447CysCys: 1.447 ± 0.82
0.724CysAsp: 0.724 ± 0.543
0.724CysGlu: 0.724 ± 0.483
0.724CysPhe: 0.724 ± 0.712
1.085CysGly: 1.085 ± 0.702
0.724CysHis: 0.724 ± 0.657
1.447CysIle: 1.447 ± 1.165
1.447CysLys: 1.447 ± 0.613
2.894CysLeu: 2.894 ± 1.401
0.362CysMet: 0.362 ± 0.415
0.362CysAsn: 0.362 ± 0.672
1.447CysPro: 1.447 ± 0.787
1.085CysGln: 1.085 ± 0.891
0.362CysArg: 0.362 ± 0.473
1.447CysSer: 1.447 ± 0.909
1.085CysThr: 1.085 ± 0.886
1.809CysVal: 1.809 ± 1.275
0.0CysTrp: 0.0 ± 0.0
0.362CysTyr: 0.362 ± 0.415
0.0CysXaa: 0.0 ± 0.0
Asp
5.065AspAla: 5.065 ± 1.625
0.362AspCys: 0.362 ± 0.312
4.703AspAsp: 4.703 ± 0.767
3.618AspGlu: 3.618 ± 1.332
0.362AspPhe: 0.362 ± 0.373
4.703AspGly: 4.703 ± 1.619
2.171AspHis: 2.171 ± 0.838
7.236AspIle: 7.236 ± 2.098
1.809AspLys: 1.809 ± 1.015
6.151AspLeu: 6.151 ± 1.31
0.724AspMet: 0.724 ± 0.424
2.533AspAsn: 2.533 ± 1.126
3.618AspPro: 3.618 ± 1.066
1.809AspGln: 1.809 ± 0.614
2.171AspArg: 2.171 ± 1.114
4.342AspSer: 4.342 ± 0.884
4.342AspThr: 4.342 ± 1.061
2.533AspVal: 2.533 ± 1.445
0.362AspTrp: 0.362 ± 0.332
1.447AspTyr: 1.447 ± 0.47
0.0AspXaa: 0.0 ± 0.0
Glu
2.533GluAla: 2.533 ± 0.97
0.362GluCys: 0.362 ± 0.287
4.703GluAsp: 4.703 ± 0.921
6.151GluGlu: 6.151 ± 1.442
2.171GluPhe: 2.171 ± 0.661
4.703GluGly: 4.703 ± 1.053
1.085GluHis: 1.085 ± 0.624
3.256GluIle: 3.256 ± 1.175
2.171GluLys: 2.171 ± 1.155
3.618GluLeu: 3.618 ± 0.98
0.362GluMet: 0.362 ± 0.373
5.065GluAsn: 5.065 ± 1.109
2.533GluPro: 2.533 ± 1.179
3.98GluGln: 3.98 ± 1.686
4.342GluArg: 4.342 ± 1.8
5.065GluSer: 5.065 ± 1.802
3.618GluThr: 3.618 ± 1.761
4.342GluVal: 4.342 ± 0.909
0.362GluTrp: 0.362 ± 0.373
1.809GluTyr: 1.809 ± 0.932
0.0GluXaa: 0.0 ± 0.0
Phe
1.447PheAla: 1.447 ± 0.58
1.085PheCys: 1.085 ± 0.565
2.894PheAsp: 2.894 ± 0.617
2.171PheGlu: 2.171 ± 0.591
3.256PhePhe: 3.256 ± 0.876
0.362PheGly: 0.362 ± 0.312
1.085PheHis: 1.085 ± 0.86
2.171PheIle: 2.171 ± 0.696
1.447PheLys: 1.447 ± 0.882
5.789PheLeu: 5.789 ± 1.675
1.447PheMet: 1.447 ± 0.624
1.447PheAsn: 1.447 ± 0.958
1.809PhePro: 1.809 ± 0.599
1.809PheGln: 1.809 ± 0.912
1.809PheArg: 1.809 ± 0.525
2.171PheSer: 2.171 ± 0.939
2.533PheThr: 2.533 ± 0.847
2.171PheVal: 2.171 ± 0.896
1.809PheTrp: 1.809 ± 0.932
1.447PheTyr: 1.447 ± 0.958
0.0PheXaa: 0.0 ± 0.0
Gly
3.98GlyAla: 3.98 ± 0.981
0.362GlyCys: 0.362 ± 0.481
3.98GlyAsp: 3.98 ± 1.544
5.789GlyGlu: 5.789 ± 1.406
0.0GlyPhe: 0.0 ± 0.0
6.512GlyGly: 6.512 ± 3.045
2.171GlyHis: 2.171 ± 1.073
6.151GlyIle: 6.151 ± 1.555
2.171GlyLys: 2.171 ± 0.922
3.98GlyLeu: 3.98 ± 1.178
0.362GlyMet: 0.362 ± 0.615
4.703GlyAsn: 4.703 ± 1.023
3.98GlyPro: 3.98 ± 0.907
2.894GlyGln: 2.894 ± 0.644
5.065GlyArg: 5.065 ± 2.365
4.703GlySer: 4.703 ± 1.24
5.789GlyThr: 5.789 ± 1.363
3.618GlyVal: 3.618 ± 0.7
0.362GlyTrp: 0.362 ± 0.373
0.724GlyTyr: 0.724 ± 0.573
0.0GlyXaa: 0.0 ± 0.0
His
2.533HisAla: 2.533 ± 0.899
0.0HisCys: 0.0 ± 0.0
0.362HisAsp: 0.362 ± 0.287
1.085HisGlu: 1.085 ± 0.523
2.533HisPhe: 2.533 ± 1.103
0.724HisGly: 0.724 ± 0.573
0.362HisHis: 0.362 ± 0.287
1.085HisIle: 1.085 ± 0.41
0.0HisLys: 0.0 ± 0.0
1.809HisLeu: 1.809 ± 0.836
1.085HisMet: 1.085 ± 0.579
1.085HisAsn: 1.085 ± 0.647
1.085HisPro: 1.085 ± 0.58
0.724HisGln: 0.724 ± 0.684
2.894HisArg: 2.894 ± 1.224
1.085HisSer: 1.085 ± 0.898
1.447HisThr: 1.447 ± 0.773
0.362HisVal: 0.362 ± 0.287
0.362HisTrp: 0.362 ± 0.287
1.809HisTyr: 1.809 ± 0.613
0.0HisXaa: 0.0 ± 0.0
Ile
1.447IleAla: 1.447 ± 1.033
0.724IleCys: 0.724 ± 0.483
4.342IleAsp: 4.342 ± 1.469
4.703IleGlu: 4.703 ± 1.835
0.724IlePhe: 0.724 ± 0.405
3.618IleGly: 3.618 ± 1.306
1.085IleHis: 1.085 ± 0.476
3.618IleIle: 3.618 ± 1.029
1.447IleLys: 1.447 ± 0.601
6.151IleLeu: 6.151 ± 0.952
2.533IleMet: 2.533 ± 0.664
0.362IleAsn: 0.362 ± 0.332
2.894IlePro: 2.894 ± 1.425
2.171IleGln: 2.171 ± 0.47
4.703IleArg: 4.703 ± 1.372
3.98IleSer: 3.98 ± 1.425
3.618IleThr: 3.618 ± 0.702
4.342IleVal: 4.342 ± 1.403
0.724IleTrp: 0.724 ± 0.535
1.447IleTyr: 1.447 ± 0.682
0.0IleXaa: 0.0 ± 0.0
Lys
3.256LysAla: 3.256 ± 0.847
1.085LysCys: 1.085 ± 0.842
2.533LysAsp: 2.533 ± 1.051
2.171LysGlu: 2.171 ± 0.827
3.98LysPhe: 3.98 ± 1.344
2.171LysGly: 2.171 ± 1.339
0.724LysHis: 0.724 ± 0.573
0.724LysIle: 0.724 ± 0.535
1.809LysLys: 1.809 ± 0.782
2.533LysLeu: 2.533 ± 1.405
1.809LysMet: 1.809 ± 0.47
1.809LysAsn: 1.809 ± 0.792
1.447LysPro: 1.447 ± 0.633
1.809LysGln: 1.809 ± 1.418
5.065LysArg: 5.065 ± 1.122
3.98LysSer: 3.98 ± 1.556
2.533LysThr: 2.533 ± 1.047
1.809LysVal: 1.809 ± 1.058
0.724LysTrp: 0.724 ± 0.747
1.447LysTyr: 1.447 ± 0.333
0.0LysXaa: 0.0 ± 0.0
Leu
6.151LeuAla: 6.151 ± 1.323
2.533LeuCys: 2.533 ± 1.373
5.065LeuAsp: 5.065 ± 0.942
5.427LeuGlu: 5.427 ± 1.28
3.98LeuPhe: 3.98 ± 1.576
5.427LeuGly: 5.427 ± 1.622
2.171LeuHis: 2.171 ± 0.662
2.894LeuIle: 2.894 ± 1.138
3.98LeuLys: 3.98 ± 0.978
7.959LeuLeu: 7.959 ± 2.071
1.085LeuMet: 1.085 ± 0.832
3.256LeuAsn: 3.256 ± 1.632
3.618LeuPro: 3.618 ± 1.015
5.427LeuGln: 5.427 ± 1.289
7.598LeuArg: 7.598 ± 1.761
6.151LeuSer: 6.151 ± 2.096
5.789LeuThr: 5.789 ± 1.771
6.151LeuVal: 6.151 ± 1.592
1.447LeuTrp: 1.447 ± 0.794
4.342LeuTyr: 4.342 ± 0.863
0.0LeuXaa: 0.0 ± 0.0
Met
2.894MetAla: 2.894 ± 1.301
0.362MetCys: 0.362 ± 0.287
1.085MetAsp: 1.085 ± 0.672
1.809MetGlu: 1.809 ± 0.954
0.724MetPhe: 0.724 ± 0.424
0.362MetGly: 0.362 ± 0.287
0.362MetHis: 0.362 ± 0.287
1.085MetIle: 1.085 ± 0.647
1.085MetLys: 1.085 ± 1.109
2.171MetLeu: 2.171 ± 0.9
0.362MetMet: 0.362 ± 0.473
0.0MetAsn: 0.0 ± 0.0
1.085MetPro: 1.085 ± 0.645
1.085MetGln: 1.085 ± 0.419
1.809MetArg: 1.809 ± 0.91
1.447MetSer: 1.447 ± 0.58
1.447MetThr: 1.447 ± 0.621
1.085MetVal: 1.085 ± 0.666
0.0MetTrp: 0.0 ± 0.0
1.447MetTyr: 1.447 ± 0.867
0.0MetXaa: 0.0 ± 0.0
Asn
3.98AsnAla: 3.98 ± 1.319
0.724AsnCys: 0.724 ± 0.538
2.171AsnAsp: 2.171 ± 0.729
2.894AsnGlu: 2.894 ± 1.036
1.809AsnPhe: 1.809 ± 0.702
3.256AsnGly: 3.256 ± 0.891
0.362AsnHis: 0.362 ± 0.672
2.171AsnIle: 2.171 ± 0.927
3.256AsnLys: 3.256 ± 1.403
2.894AsnLeu: 2.894 ± 1.044
0.724AsnMet: 0.724 ± 0.573
2.533AsnAsn: 2.533 ± 0.947
3.618AsnPro: 3.618 ± 1.571
1.447AsnGln: 1.447 ± 0.833
1.809AsnArg: 1.809 ± 0.836
3.256AsnSer: 3.256 ± 1.172
1.809AsnThr: 1.809 ± 0.888
1.809AsnVal: 1.809 ± 0.657
0.362AsnTrp: 0.362 ± 0.287
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.256ProAla: 3.256 ± 0.864
0.724ProCys: 0.724 ± 0.661
1.447ProAsp: 1.447 ± 0.613
3.98ProGlu: 3.98 ± 0.912
3.98ProPhe: 3.98 ± 1.024
1.809ProGly: 1.809 ± 0.651
0.724ProHis: 0.724 ± 0.83
2.894ProIle: 2.894 ± 1.781
3.618ProLys: 3.618 ± 1.196
6.151ProLeu: 6.151 ± 1.581
1.447ProMet: 1.447 ± 0.762
2.171ProAsn: 2.171 ± 1.234
3.256ProPro: 3.256 ± 0.944
1.447ProGln: 1.447 ± 1.022
3.256ProArg: 3.256 ± 1.435
6.512ProSer: 6.512 ± 2.588
3.618ProThr: 3.618 ± 1.421
2.171ProVal: 2.171 ± 1.805
0.0ProTrp: 0.0 ± 0.0
2.171ProTyr: 2.171 ± 0.927
0.0ProXaa: 0.0 ± 0.0
Gln
1.447GlnAla: 1.447 ± 1.078
2.171GlnCys: 2.171 ± 1.225
1.085GlnAsp: 1.085 ± 0.604
2.894GlnGlu: 2.894 ± 1.13
1.085GlnPhe: 1.085 ± 0.647
2.894GlnGly: 2.894 ± 1.155
0.724GlnHis: 0.724 ± 0.375
2.171GlnIle: 2.171 ± 1.169
1.809GlnLys: 1.809 ± 1.127
3.256GlnLeu: 3.256 ± 1.495
1.447GlnMet: 1.447 ± 0.763
1.085GlnAsn: 1.085 ± 0.646
2.533GlnPro: 2.533 ± 0.548
3.256GlnGln: 3.256 ± 0.968
4.342GlnArg: 4.342 ± 1.235
3.618GlnSer: 3.618 ± 1.866
3.256GlnThr: 3.256 ± 1.128
1.447GlnVal: 1.447 ± 1.078
1.085GlnTrp: 1.085 ± 0.643
1.447GlnTyr: 1.447 ± 0.893
0.0GlnXaa: 0.0 ± 0.0
Arg
3.98ArgAla: 3.98 ± 1.043
3.618ArgCys: 3.618 ± 1.624
2.171ArgAsp: 2.171 ± 0.802
3.618ArgGlu: 3.618 ± 1.793
3.256ArgPhe: 3.256 ± 0.811
7.236ArgGly: 7.236 ± 1.326
2.894ArgHis: 2.894 ± 0.792
1.085ArgIle: 1.085 ± 1.123
4.703ArgLys: 4.703 ± 1.056
6.874ArgLeu: 6.874 ± 1.299
1.809ArgMet: 1.809 ± 0.752
3.256ArgAsn: 3.256 ± 2.16
5.427ArgPro: 5.427 ± 2.356
2.533ArgGln: 2.533 ± 0.836
8.683ArgArg: 8.683 ± 3.216
2.533ArgSer: 2.533 ± 0.952
4.703ArgThr: 4.703 ± 0.728
5.789ArgVal: 5.789 ± 1.853
0.362ArgTrp: 0.362 ± 0.408
2.894ArgTyr: 2.894 ± 0.716
0.0ArgXaa: 0.0 ± 0.0
Ser
2.894SerAla: 2.894 ± 0.94
1.447SerCys: 1.447 ± 1.272
7.598SerAsp: 7.598 ± 1.396
2.171SerGlu: 2.171 ± 0.981
1.085SerPhe: 1.085 ± 0.86
6.512SerGly: 6.512 ± 2.154
0.724SerHis: 0.724 ± 0.482
3.618SerIle: 3.618 ± 1.324
1.447SerLys: 1.447 ± 0.729
7.598SerLeu: 7.598 ± 1.086
0.724SerMet: 0.724 ± 0.355
2.533SerAsn: 2.533 ± 0.803
3.98SerPro: 3.98 ± 1.306
3.618SerGln: 3.618 ± 1.057
6.874SerArg: 6.874 ± 1.356
6.151SerSer: 6.151 ± 1.731
5.427SerThr: 5.427 ± 1.74
7.598SerVal: 7.598 ± 1.992
0.724SerTrp: 0.724 ± 0.475
1.447SerTyr: 1.447 ± 0.749
0.0SerXaa: 0.0 ± 0.0
Thr
3.618ThrAla: 3.618 ± 0.942
1.085ThrCys: 1.085 ± 0.493
5.789ThrAsp: 5.789 ± 1.386
3.256ThrGlu: 3.256 ± 1.394
2.894ThrPhe: 2.894 ± 0.666
5.427ThrGly: 5.427 ± 0.763
0.724ThrHis: 0.724 ± 0.355
4.342ThrIle: 4.342 ± 1.539
2.894ThrLys: 2.894 ± 1.328
4.342ThrLeu: 4.342 ± 0.889
1.085ThrMet: 1.085 ± 0.514
1.085ThrAsn: 1.085 ± 0.647
4.342ThrPro: 4.342 ± 0.975
2.533ThrGln: 2.533 ± 1.141
5.065ThrArg: 5.065 ± 1.344
4.342ThrSer: 4.342 ± 1.15
4.703ThrThr: 4.703 ± 1.476
3.618ThrVal: 3.618 ± 1.088
0.724ThrTrp: 0.724 ± 0.433
1.085ThrTyr: 1.085 ± 0.526
0.0ThrXaa: 0.0 ± 0.0
Val
4.342ValAla: 4.342 ± 0.934
1.085ValCys: 1.085 ± 1.144
3.618ValAsp: 3.618 ± 1.202
2.894ValGlu: 2.894 ± 1.043
2.171ValPhe: 2.171 ± 1.187
4.703ValGly: 4.703 ± 1.551
1.809ValHis: 1.809 ± 0.875
1.085ValIle: 1.085 ± 0.909
1.809ValLys: 1.809 ± 0.6
5.789ValLeu: 5.789 ± 1.21
0.0ValMet: 0.0 ± 0.0
2.171ValAsn: 2.171 ± 0.748
3.618ValPro: 3.618 ± 1.347
2.533ValGln: 2.533 ± 1.396
4.342ValArg: 4.342 ± 1.178
6.874ValSer: 6.874 ± 1.047
3.618ValThr: 3.618 ± 1.443
2.894ValVal: 2.894 ± 0.724
1.447ValTrp: 1.447 ± 0.877
2.533ValTyr: 2.533 ± 0.921
0.0ValXaa: 0.0 ± 0.0
Trp
0.362TrpAla: 0.362 ± 0.287
0.0TrpCys: 0.0 ± 0.0
0.362TrpAsp: 0.362 ± 0.332
0.362TrpGlu: 0.362 ± 0.332
0.362TrpPhe: 0.362 ± 0.287
0.724TrpGly: 0.724 ± 0.483
1.085TrpHis: 1.085 ± 0.775
0.0TrpIle: 0.0 ± 0.0
1.085TrpLys: 1.085 ± 0.833
1.085TrpLeu: 1.085 ± 0.647
0.362TrpMet: 0.362 ± 0.473
0.724TrpAsn: 0.724 ± 0.433
0.0TrpPro: 0.0 ± 0.0
0.362TrpGln: 0.362 ± 0.461
1.809TrpArg: 1.809 ± 0.941
1.085TrpSer: 1.085 ± 0.89
0.362TrpThr: 0.362 ± 0.373
0.724TrpVal: 0.724 ± 0.433
0.362TrpTrp: 0.362 ± 0.473
0.724TrpTyr: 0.724 ± 0.747
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.809TyrAla: 1.809 ± 0.613
0.362TyrCys: 0.362 ± 0.415
1.085TyrAsp: 1.085 ± 0.705
1.447TyrGlu: 1.447 ± 0.57
1.447TyrPhe: 1.447 ± 0.531
1.809TyrGly: 1.809 ± 0.491
0.362TyrHis: 0.362 ± 0.287
4.703TyrIle: 4.703 ± 1.361
2.171TyrLys: 2.171 ± 0.599
3.618TyrLeu: 3.618 ± 1.009
0.724TyrMet: 0.724 ± 0.551
2.533TyrAsn: 2.533 ± 1.034
1.085TyrPro: 1.085 ± 0.493
1.085TyrGln: 1.085 ± 0.604
1.085TyrArg: 1.085 ± 0.663
1.085TyrSer: 1.085 ± 0.408
0.724TyrThr: 0.724 ± 0.573
2.171TyrVal: 2.171 ± 0.838
0.362TyrTrp: 0.362 ± 0.332
2.894TyrTyr: 2.894 ± 1.238
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 11 proteins (2765 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski