Amino acid dipepetide frequency for human papillomavirus 118

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.37AlaAla: 3.37 ± 0.651
1.264AlaCys: 1.264 ± 0.767
3.37AlaAsp: 3.37 ± 0.646
4.634AlaGlu: 4.634 ± 1.386
2.949AlaPhe: 2.949 ± 1.709
4.212AlaGly: 4.212 ± 1.284
1.264AlaHis: 1.264 ± 0.353
2.949AlaIle: 2.949 ± 0.657
2.949AlaLys: 2.949 ± 0.83
3.791AlaLeu: 3.791 ± 1.091
0.842AlaMet: 0.842 ± 0.434
1.685AlaAsn: 1.685 ± 0.209
3.791AlaPro: 3.791 ± 0.802
2.527AlaGln: 2.527 ± 1.11
4.212AlaArg: 4.212 ± 0.81
4.634AlaSer: 4.634 ± 1.662
5.055AlaThr: 5.055 ± 1.707
4.212AlaVal: 4.212 ± 1.178
0.421AlaTrp: 0.421 ± 0.389
1.264AlaTyr: 1.264 ± 0.763
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.685CysCys: 1.685 ± 1.388
0.842CysAsp: 0.842 ± 0.662
0.421CysGlu: 0.421 ± 0.389
1.264CysPhe: 1.264 ± 0.353
1.264CysGly: 1.264 ± 1.371
0.421CysHis: 0.421 ± 0.597
1.685CysIle: 1.685 ± 1.055
3.37CysLys: 3.37 ± 1.379
0.842CysLeu: 0.842 ± 0.862
0.0CysMet: 0.0 ± 0.0
0.842CysAsn: 0.842 ± 1.194
1.685CysPro: 1.685 ± 0.958
0.0CysGln: 0.0 ± 0.0
0.842CysArg: 0.842 ± 0.862
1.685CysSer: 1.685 ± 1.046
1.685CysThr: 1.685 ± 1.055
1.264CysVal: 1.264 ± 0.674
0.421CysTrp: 0.421 ± 0.389
0.842CysTyr: 0.842 ± 0.734
0.0CysXaa: 0.0 ± 0.0
Asp
5.476AspAla: 5.476 ± 0.843
0.842AspCys: 0.842 ± 0.779
3.791AspAsp: 3.791 ± 1.21
2.106AspGlu: 2.106 ± 1.308
2.527AspPhe: 2.527 ± 0.937
2.106AspGly: 2.106 ± 0.877
0.842AspHis: 0.842 ± 0.482
5.476AspIle: 5.476 ± 1.692
1.685AspLys: 1.685 ± 1.015
5.897AspLeu: 5.897 ± 1.541
1.685AspMet: 1.685 ± 0.663
4.634AspAsn: 4.634 ± 0.513
3.791AspPro: 3.791 ± 1.177
1.685AspGln: 1.685 ± 0.544
2.527AspArg: 2.527 ± 1.248
4.212AspSer: 4.212 ± 1.023
4.212AspThr: 4.212 ± 1.583
3.37AspVal: 3.37 ± 0.646
0.842AspTrp: 0.842 ± 0.434
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.791GluAla: 3.791 ± 1.087
0.842GluCys: 0.842 ± 0.779
4.634GluAsp: 4.634 ± 1.541
7.582GluGlu: 7.582 ± 2.603
1.264GluPhe: 1.264 ± 0.67
2.106GluGly: 2.106 ± 1.241
1.685GluHis: 1.685 ± 0.557
3.37GluIle: 3.37 ± 1.024
2.106GluLys: 2.106 ± 0.692
5.897GluLeu: 5.897 ± 2.319
0.0GluMet: 0.0 ± 0.0
2.106GluAsn: 2.106 ± 0.382
2.949GluPro: 2.949 ± 1.361
4.634GluGln: 4.634 ± 1.164
3.37GluArg: 3.37 ± 1.134
5.055GluSer: 5.055 ± 1.566
2.949GluThr: 2.949 ± 1.271
5.476GluVal: 5.476 ± 0.701
0.842GluTrp: 0.842 ± 0.4
2.106GluTyr: 2.106 ± 1.213
0.0GluXaa: 0.0 ± 0.0
Phe
2.527PheAla: 2.527 ± 1.472
1.264PheCys: 1.264 ± 1.35
2.949PheAsp: 2.949 ± 0.542
4.212PheGlu: 4.212 ± 2.312
2.106PhePhe: 2.106 ± 0.775
1.685PheGly: 1.685 ± 0.872
0.421PheHis: 0.421 ± 0.597
2.106PheIle: 2.106 ± 1.171
3.37PheLys: 3.37 ± 2.002
3.37PheLeu: 3.37 ± 0.985
0.0PheMet: 0.0 ± 0.0
2.527PheAsn: 2.527 ± 1.99
2.106PhePro: 2.106 ± 0.961
2.106PheGln: 2.106 ± 0.77
1.264PheArg: 1.264 ± 0.794
2.949PheSer: 2.949 ± 0.914
1.685PheThr: 1.685 ± 1.055
2.527PheVal: 2.527 ± 0.482
1.685PheTrp: 1.685 ± 0.868
2.106PheTyr: 2.106 ± 0.968
0.0PheXaa: 0.0 ± 0.0
Gly
2.527GlyAla: 2.527 ± 0.748
2.106GlyCys: 2.106 ± 0.797
2.949GlyAsp: 2.949 ± 0.653
2.106GlyGlu: 2.106 ± 0.842
1.264GlyPhe: 1.264 ± 0.674
3.791GlyGly: 3.791 ± 1.35
2.949GlyHis: 2.949 ± 1.099
2.949GlyIle: 2.949 ± 1.506
3.791GlyLys: 3.791 ± 1.76
2.527GlyLeu: 2.527 ± 0.825
0.0GlyMet: 0.0 ± 0.0
5.055GlyAsn: 5.055 ± 2.321
2.949GlyPro: 2.949 ± 1.079
3.791GlyGln: 3.791 ± 0.821
6.74GlyArg: 6.74 ± 2.932
3.791GlySer: 3.791 ± 0.797
4.634GlyThr: 4.634 ± 1.997
1.264GlyVal: 1.264 ± 0.369
0.421GlyTrp: 0.421 ± 0.597
2.106GlyTyr: 2.106 ± 0.948
0.0GlyXaa: 0.0 ± 0.0
His
0.421HisAla: 0.421 ± 0.426
0.842HisCys: 0.842 ± 0.7
1.685HisAsp: 1.685 ± 0.677
0.0HisGlu: 0.0 ± 0.0
1.264HisPhe: 1.264 ± 0.763
1.685HisGly: 1.685 ± 0.882
0.421HisHis: 0.421 ± 0.597
2.106HisIle: 2.106 ± 0.715
1.264HisLys: 1.264 ± 0.689
1.264HisLeu: 1.264 ± 0.419
0.0HisMet: 0.0 ± 0.0
1.685HisAsn: 1.685 ± 0.615
2.106HisPro: 2.106 ± 0.719
0.421HisGln: 0.421 ± 0.389
0.842HisArg: 0.842 ± 0.682
2.106HisSer: 2.106 ± 1.241
0.842HisThr: 0.842 ± 0.482
0.421HisVal: 0.421 ± 0.426
1.685HisTrp: 1.685 ± 0.598
0.842HisTyr: 0.842 ± 0.4
0.0HisXaa: 0.0 ± 0.0
Ile
2.527IleAla: 2.527 ± 0.968
0.842IleCys: 0.842 ± 0.682
3.791IleAsp: 3.791 ± 0.693
3.791IleGlu: 3.791 ± 1.482
1.264IlePhe: 1.264 ± 0.369
5.476IleGly: 5.476 ± 1.775
1.685IleHis: 1.685 ± 0.557
2.527IleIle: 2.527 ± 1.25
1.264IleLys: 1.264 ± 0.689
4.634IleLeu: 4.634 ± 0.758
0.842IleMet: 0.842 ± 0.4
3.37IleAsn: 3.37 ± 0.795
3.37IlePro: 3.37 ± 1.662
0.842IleGln: 0.842 ± 0.851
1.264IleArg: 1.264 ± 0.978
2.949IleSer: 2.949 ± 1.301
2.949IleThr: 2.949 ± 0.723
3.37IleVal: 3.37 ± 1.05
0.421IleTrp: 0.421 ± 0.597
2.949IleTyr: 2.949 ± 0.914
0.0IleXaa: 0.0 ± 0.0
Lys
3.791LysAla: 3.791 ± 1.537
0.421LysCys: 0.421 ± 0.385
2.527LysAsp: 2.527 ± 1.107
2.106LysGlu: 2.106 ± 1.069
2.949LysPhe: 2.949 ± 1.462
2.949LysGly: 2.949 ± 1.781
0.842LysHis: 0.842 ± 0.779
2.106LysIle: 2.106 ± 1.034
1.264LysLys: 1.264 ± 1.156
2.527LysLeu: 2.527 ± 1.175
1.685LysMet: 1.685 ± 0.914
2.949LysAsn: 2.949 ± 1.211
0.842LysPro: 0.842 ± 0.851
1.685LysGln: 1.685 ± 0.608
5.476LysArg: 5.476 ± 1.224
5.897LysSer: 5.897 ± 1.827
1.264LysThr: 1.264 ± 0.658
3.37LysVal: 3.37 ± 1.048
0.0LysTrp: 0.0 ± 0.0
3.791LysTyr: 3.791 ± 0.676
0.0LysXaa: 0.0 ± 0.0
Leu
5.476LeuAla: 5.476 ± 1.745
2.949LeuCys: 2.949 ± 1.78
6.318LeuAsp: 6.318 ± 1.588
6.318LeuGlu: 6.318 ± 2.198
5.055LeuPhe: 5.055 ± 1.112
4.634LeuGly: 4.634 ± 2.013
1.685LeuHis: 1.685 ± 1.127
3.37LeuIle: 3.37 ± 1.418
4.212LeuLys: 4.212 ± 0.926
8.846LeuLeu: 8.846 ± 3.055
2.106LeuMet: 2.106 ± 0.703
0.842LeuAsn: 0.842 ± 0.508
2.106LeuPro: 2.106 ± 0.997
6.318LeuGln: 6.318 ± 1.199
4.212LeuArg: 4.212 ± 1.001
5.055LeuSer: 5.055 ± 1.959
4.212LeuThr: 4.212 ± 1.947
3.37LeuVal: 3.37 ± 1.222
0.842LeuTrp: 0.842 ± 0.434
1.264LeuTyr: 1.264 ± 0.689
0.0LeuXaa: 0.0 ± 0.0
Met
1.685MetAla: 1.685 ± 0.868
0.0MetCys: 0.0 ± 0.0
0.421MetAsp: 0.421 ± 0.389
0.842MetGlu: 0.842 ± 0.662
0.842MetPhe: 0.842 ± 0.434
0.421MetGly: 0.421 ± 0.336
0.421MetHis: 0.421 ± 0.385
1.264MetIle: 1.264 ± 0.706
0.0MetLys: 0.0 ± 0.0
0.421MetLeu: 0.421 ± 0.385
0.421MetMet: 0.421 ± 0.385
0.842MetAsn: 0.842 ± 0.508
0.0MetPro: 0.0 ± 0.0
0.421MetGln: 0.421 ± 0.426
0.842MetArg: 0.842 ± 0.7
2.106MetSer: 2.106 ± 1.424
0.0MetThr: 0.0 ± 0.0
2.106MetVal: 2.106 ± 0.81
0.421MetTrp: 0.421 ± 0.385
0.842MetTyr: 0.842 ± 0.508
0.0MetXaa: 0.0 ± 0.0
Asn
2.527AsnAla: 2.527 ± 0.85
0.421AsnCys: 0.421 ± 0.389
2.106AsnAsp: 2.106 ± 1.171
3.791AsnGlu: 3.791 ± 1.149
2.527AsnPhe: 2.527 ± 0.767
1.685AsnGly: 1.685 ± 1.162
0.421AsnHis: 0.421 ± 0.385
3.37AsnIle: 3.37 ± 0.752
3.791AsnLys: 3.791 ± 0.736
1.264AsnLeu: 1.264 ± 0.754
0.0AsnMet: 0.0 ± 0.0
1.264AsnAsn: 1.264 ± 0.767
3.37AsnPro: 3.37 ± 1.735
2.106AsnGln: 2.106 ± 1.034
4.634AsnArg: 4.634 ± 1.377
1.264AsnSer: 1.264 ± 0.706
6.318AsnThr: 6.318 ± 1.159
2.106AsnVal: 2.106 ± 0.498
0.0AsnTrp: 0.0 ± 0.0
1.264AsnTyr: 1.264 ± 0.706
0.0AsnXaa: 0.0 ± 0.0
Pro
4.212ProAla: 4.212 ± 1.7
2.527ProCys: 2.527 ± 1.709
3.791ProAsp: 3.791 ± 2.026
3.791ProGlu: 3.791 ± 1.437
1.685ProPhe: 1.685 ± 0.608
2.527ProGly: 2.527 ± 1.177
0.0ProHis: 0.0 ± 0.0
2.949ProIle: 2.949 ± 1.284
2.527ProLys: 2.527 ± 0.707
4.212ProLeu: 4.212 ± 1.332
0.421ProMet: 0.421 ± 0.597
2.106ProAsn: 2.106 ± 0.382
5.897ProPro: 5.897 ± 0.965
2.949ProGln: 2.949 ± 1.069
3.791ProArg: 3.791 ± 1.483
4.212ProSer: 4.212 ± 2.255
5.897ProThr: 5.897 ± 1.04
4.634ProVal: 4.634 ± 1.251
0.421ProTrp: 0.421 ± 0.385
1.264ProTyr: 1.264 ± 1.032
0.0ProXaa: 0.0 ± 0.0
Gln
3.37GlnAla: 3.37 ± 0.929
0.842GlnCys: 0.842 ± 0.434
2.949GlnAsp: 2.949 ± 1.204
2.106GlnGlu: 2.106 ± 1.504
1.685GlnPhe: 1.685 ± 0.854
3.37GlnGly: 3.37 ± 0.646
2.106GlnHis: 2.106 ± 0.523
2.527GlnIle: 2.527 ± 0.471
2.949GlnLys: 2.949 ± 0.911
6.74GlnLeu: 6.74 ± 1.937
1.685GlnMet: 1.685 ± 0.89
1.685GlnAsn: 1.685 ± 1.206
2.527GlnPro: 2.527 ± 0.592
3.791GlnGln: 3.791 ± 1.99
2.527GlnArg: 2.527 ± 0.638
1.685GlnSer: 1.685 ± 0.598
2.527GlnThr: 2.527 ± 0.996
1.685GlnVal: 1.685 ± 1.344
0.421GlnTrp: 0.421 ± 0.389
2.527GlnTyr: 2.527 ± 0.979
0.0GlnXaa: 0.0 ± 0.0
Arg
4.212ArgAla: 4.212 ± 1.555
2.106ArgCys: 2.106 ± 0.964
2.527ArgAsp: 2.527 ± 1.049
2.106ArgGlu: 2.106 ± 1.106
3.37ArgPhe: 3.37 ± 0.419
4.212ArgGly: 4.212 ± 1.14
2.106ArgHis: 2.106 ± 1.029
1.685ArgIle: 1.685 ± 0.854
4.634ArgLys: 4.634 ± 1.283
7.161ArgLeu: 7.161 ± 1.328
0.421ArgMet: 0.421 ± 0.385
2.527ArgAsn: 2.527 ± 0.603
3.37ArgPro: 3.37 ± 0.861
3.791ArgGln: 3.791 ± 1.16
9.267ArgArg: 9.267 ± 5.018
9.267ArgSer: 9.267 ± 4.927
4.212ArgThr: 4.212 ± 2.222
3.791ArgVal: 3.791 ± 1.91
0.0ArgTrp: 0.0 ± 0.0
2.106ArgTyr: 2.106 ± 0.382
0.0ArgXaa: 0.0 ± 0.0
Ser
3.37SerAla: 3.37 ± 1.298
0.421SerCys: 0.421 ± 0.385
5.476SerAsp: 5.476 ± 2.061
4.212SerGlu: 4.212 ± 1.444
2.949SerPhe: 2.949 ± 0.507
5.897SerGly: 5.897 ± 1.211
1.264SerHis: 1.264 ± 1.168
2.106SerIle: 2.106 ± 0.77
4.212SerLys: 4.212 ± 1.893
7.582SerLeu: 7.582 ± 1.019
1.264SerMet: 1.264 ± 0.767
0.842SerAsn: 0.842 ± 0.779
5.476SerPro: 5.476 ± 3.067
2.949SerGln: 2.949 ± 1.328
9.267SerArg: 9.267 ± 3.337
11.373SerSer: 11.373 ± 5.105
8.425SerThr: 8.425 ± 3.242
3.37SerVal: 3.37 ± 1.276
1.264SerTrp: 1.264 ± 0.689
1.264SerTyr: 1.264 ± 0.805
0.0SerXaa: 0.0 ± 0.0
Thr
3.37ThrAla: 3.37 ± 1.298
1.264ThrCys: 1.264 ± 0.682
3.37ThrAsp: 3.37 ± 1.009
3.791ThrGlu: 3.791 ± 0.927
3.791ThrPhe: 3.791 ± 0.958
3.37ThrGly: 3.37 ± 1.072
0.842ThrHis: 0.842 ± 0.672
2.106ThrIle: 2.106 ± 1.68
0.842ThrLys: 0.842 ± 0.77
4.212ThrLeu: 4.212 ± 1.537
1.264ThrMet: 1.264 ± 0.689
4.634ThrAsn: 4.634 ± 0.613
6.74ThrPro: 6.74 ± 2.893
2.106ThrGln: 2.106 ± 1.1
5.055ThrArg: 5.055 ± 1.382
6.74ThrSer: 6.74 ± 1.883
5.476ThrThr: 5.476 ± 2.379
5.897ThrVal: 5.897 ± 0.819
1.264ThrTrp: 1.264 ± 0.682
2.527ThrTyr: 2.527 ± 0.592
0.0ThrXaa: 0.0 ± 0.0
Val
4.212ValAla: 4.212 ± 0.968
0.0ValCys: 0.0 ± 0.0
2.949ValAsp: 2.949 ± 1.269
5.476ValGlu: 5.476 ± 1.088
2.949ValPhe: 2.949 ± 1.269
2.949ValGly: 2.949 ± 0.683
0.842ValHis: 0.842 ± 0.442
2.949ValIle: 2.949 ± 1.1
1.264ValLys: 1.264 ± 0.701
3.37ValLeu: 3.37 ± 0.996
0.421ValMet: 0.421 ± 0.385
2.949ValAsn: 2.949 ± 0.937
3.791ValPro: 3.791 ± 0.655
3.37ValGln: 3.37 ± 0.889
5.476ValArg: 5.476 ± 2.147
5.897ValSer: 5.897 ± 2.151
4.212ValThr: 4.212 ± 1.497
2.527ValVal: 2.527 ± 1.464
1.264ValTrp: 1.264 ± 0.767
2.106ValTyr: 2.106 ± 1.573
0.0ValXaa: 0.0 ± 0.0
Trp
1.264TrpAla: 1.264 ± 0.706
0.421TrpCys: 0.421 ± 0.389
0.421TrpAsp: 0.421 ± 0.426
0.421TrpGlu: 0.421 ± 0.385
0.0TrpPhe: 0.0 ± 0.0
0.421TrpGly: 0.421 ± 0.426
0.0TrpHis: 0.0 ± 0.0
0.842TrpIle: 0.842 ± 0.779
1.685TrpLys: 1.685 ± 1.191
1.685TrpLeu: 1.685 ± 1.055
0.0TrpMet: 0.0 ± 0.0
0.842TrpAsn: 0.842 ± 0.434
0.0TrpPro: 0.0 ± 0.0
2.106TrpGln: 2.106 ± 1.276
0.0TrpArg: 0.0 ± 0.0
0.842TrpSer: 0.842 ± 0.4
0.421TrpThr: 0.421 ± 0.385
0.842TrpVal: 0.842 ± 0.4
0.0TrpTrp: 0.0 ± 0.0
0.421TrpTyr: 0.421 ± 0.389
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.264TyrAla: 1.264 ± 0.767
0.0TyrCys: 0.0 ± 0.0
0.842TyrAsp: 0.842 ± 0.508
2.949TyrGlu: 2.949 ± 0.869
1.264TyrPhe: 1.264 ± 0.968
2.949TyrGly: 2.949 ± 0.825
1.685TyrHis: 1.685 ± 0.615
1.685TyrIle: 1.685 ± 0.788
1.685TyrLys: 1.685 ± 0.882
2.949TyrLeu: 2.949 ± 0.747
0.842TyrMet: 0.842 ± 0.4
0.842TyrAsn: 0.842 ± 0.508
2.949TyrPro: 2.949 ± 0.621
2.106TyrGln: 2.106 ± 0.382
1.264TyrArg: 1.264 ± 0.495
1.264TyrSer: 1.264 ± 0.664
1.685TyrThr: 1.685 ± 0.209
3.37TyrVal: 3.37 ± 0.646
0.0TyrTrp: 0.0 ± 0.0
2.949TyrTyr: 2.949 ± 0.914
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2375 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski