Amino acid dipepetide frequency for Human papillomavirus 73

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.527AlaAla: 2.527 ± 1.103
1.264AlaCys: 1.264 ± 0.848
2.949AlaAsp: 2.949 ± 0.958
2.527AlaGlu: 2.527 ± 0.895
2.527AlaPhe: 2.527 ± 1.181
2.527AlaGly: 2.527 ± 1.013
0.421AlaHis: 0.421 ± 0.314
3.791AlaIle: 3.791 ± 0.821
4.212AlaLys: 4.212 ± 0.884
5.055AlaLeu: 5.055 ± 1.417
0.421AlaMet: 0.421 ± 0.391
1.685AlaAsn: 1.685 ± 0.629
2.527AlaPro: 2.527 ± 1.184
2.527AlaGln: 2.527 ± 1.169
1.264AlaArg: 1.264 ± 0.461
2.527AlaSer: 2.527 ± 1.469
4.212AlaThr: 4.212 ± 1.274
2.106AlaVal: 2.106 ± 0.985
0.0AlaTrp: 0.0 ± 0.0
0.421AlaTyr: 0.421 ± 0.314
0.0AlaXaa: 0.0 ± 0.0
Cys
1.264CysAla: 1.264 ± 0.595
0.421CysCys: 0.421 ± 0.403
1.264CysAsp: 1.264 ± 0.582
1.264CysGlu: 1.264 ± 0.739
0.842CysPhe: 0.842 ± 0.806
1.264CysGly: 1.264 ± 0.582
0.421CysHis: 0.421 ± 0.457
2.106CysIle: 2.106 ± 0.872
1.685CysLys: 1.685 ± 0.721
3.37CysLeu: 3.37 ± 1.771
0.842CysMet: 0.842 ± 0.385
0.842CysAsn: 0.842 ± 0.532
2.106CysPro: 2.106 ± 0.634
2.949CysGln: 2.949 ± 1.426
0.842CysArg: 0.842 ± 0.385
0.842CysSer: 0.842 ± 0.778
4.212CysThr: 4.212 ± 1.867
2.527CysVal: 2.527 ± 1.364
1.685CysTrp: 1.685 ± 0.529
0.842CysTyr: 0.842 ± 1.367
0.0CysXaa: 0.0 ± 0.0
Asp
2.949AspAla: 2.949 ± 1.07
2.527AspCys: 2.527 ± 0.82
3.791AspAsp: 3.791 ± 1.549
3.791AspGlu: 3.791 ± 1.549
1.685AspPhe: 1.685 ± 0.481
3.37AspGly: 3.37 ± 0.852
1.264AspHis: 1.264 ± 0.775
5.476AspIle: 5.476 ± 2.015
3.37AspLys: 3.37 ± 1.054
4.634AspLeu: 4.634 ± 1.825
0.842AspMet: 0.842 ± 0.385
3.37AspAsn: 3.37 ± 1.083
2.949AspPro: 2.949 ± 0.874
0.842AspGln: 0.842 ± 0.523
0.842AspArg: 0.842 ± 0.644
5.055AspSer: 5.055 ± 0.843
6.74AspThr: 6.74 ± 1.667
3.37AspVal: 3.37 ± 1.853
0.421AspTrp: 0.421 ± 0.314
1.264AspTyr: 1.264 ± 0.831
0.0AspXaa: 0.0 ± 0.0
Glu
1.264GluAla: 1.264 ± 0.687
1.264GluCys: 1.264 ± 0.925
2.106GluAsp: 2.106 ± 1.255
4.634GluGlu: 4.634 ± 0.742
1.685GluPhe: 1.685 ± 0.556
2.106GluGly: 2.106 ± 0.779
0.421GluHis: 0.421 ± 0.391
2.527GluIle: 2.527 ± 0.828
2.106GluLys: 2.106 ± 1.127
2.527GluLeu: 2.527 ± 1.154
1.685GluMet: 1.685 ± 0.678
2.949GluAsn: 2.949 ± 0.687
0.842GluPro: 0.842 ± 0.399
1.264GluGln: 1.264 ± 0.516
2.949GluArg: 2.949 ± 0.904
4.634GluSer: 4.634 ± 1.546
4.212GluThr: 4.212 ± 0.502
4.212GluVal: 4.212 ± 1.278
1.264GluTrp: 1.264 ± 0.41
1.264GluTyr: 1.264 ± 0.808
0.0GluXaa: 0.0 ± 0.0
Phe
0.421PheAla: 0.421 ± 0.42
1.264PheCys: 1.264 ± 1.349
2.527PheAsp: 2.527 ± 0.617
0.842PheGlu: 0.842 ± 0.46
1.264PhePhe: 1.264 ± 0.656
2.527PheGly: 2.527 ± 0.948
0.421PheHis: 0.421 ± 0.42
2.527PheIle: 2.527 ± 1.08
3.37PheLys: 3.37 ± 0.614
5.897PheLeu: 5.897 ± 1.333
0.842PheMet: 0.842 ± 0.385
1.264PheAsn: 1.264 ± 0.71
2.106PhePro: 2.106 ± 1.072
0.842PheGln: 0.842 ± 0.437
0.842PheArg: 0.842 ± 0.523
2.106PheSer: 2.106 ± 0.856
2.949PheThr: 2.949 ± 1.209
1.685PheVal: 1.685 ± 0.891
1.264PheTrp: 1.264 ± 0.41
2.527PheTyr: 2.527 ± 1.139
0.0PheXaa: 0.0 ± 0.0
Gly
1.264GlyAla: 1.264 ± 0.389
1.264GlyCys: 1.264 ± 0.585
4.634GlyAsp: 4.634 ± 1.777
0.421GlyGlu: 0.421 ± 0.391
2.106GlyPhe: 2.106 ± 0.95
2.949GlyGly: 2.949 ± 0.954
0.421GlyHis: 0.421 ± 0.391
4.212GlyIle: 4.212 ± 0.706
2.527GlyLys: 2.527 ± 0.94
4.634GlyLeu: 4.634 ± 1.447
1.264GlyMet: 1.264 ± 0.71
4.212GlyAsn: 4.212 ± 1.948
2.949GlyPro: 2.949 ± 1.434
2.106GlyGln: 2.106 ± 1.064
4.634GlyArg: 4.634 ± 1.346
5.476GlySer: 5.476 ± 2.223
4.212GlyThr: 4.212 ± 1.603
3.791GlyVal: 3.791 ± 0.932
0.842GlyTrp: 0.842 ± 0.399
2.106GlyTyr: 2.106 ± 1.035
0.0GlyXaa: 0.0 ± 0.0
His
0.842HisAla: 0.842 ± 0.747
0.842HisCys: 0.842 ± 0.519
0.842HisAsp: 0.842 ± 0.536
0.421HisGlu: 0.421 ± 0.403
1.264HisPhe: 1.264 ± 0.687
0.842HisGly: 0.842 ± 0.858
0.421HisHis: 0.421 ± 0.457
2.106HisIle: 2.106 ± 0.844
1.264HisLys: 1.264 ± 0.739
2.527HisLeu: 2.527 ± 1.368
0.421HisMet: 0.421 ± 0.403
2.106HisAsn: 2.106 ± 0.834
1.685HisPro: 1.685 ± 0.67
0.421HisGln: 0.421 ± 0.42
0.842HisArg: 0.842 ± 0.46
2.527HisSer: 2.527 ± 1.064
0.842HisThr: 0.842 ± 0.806
1.685HisVal: 1.685 ± 1.164
1.264HisTrp: 1.264 ± 0.848
0.842HisTyr: 0.842 ± 0.659
0.0HisXaa: 0.0 ± 0.0
Ile
2.949IleAla: 2.949 ± 0.581
1.264IleCys: 1.264 ± 0.41
3.791IleAsp: 3.791 ± 0.69
2.949IleGlu: 2.949 ± 1.245
0.842IlePhe: 0.842 ± 0.685
1.685IleGly: 1.685 ± 1.076
2.106IleHis: 2.106 ± 0.602
1.685IleIle: 1.685 ± 1.235
2.106IleLys: 2.106 ± 0.939
2.949IleLeu: 2.949 ± 0.877
0.421IleMet: 0.421 ± 0.473
1.264IleAsn: 1.264 ± 0.599
6.74IlePro: 6.74 ± 2.188
2.106IleGln: 2.106 ± 0.815
2.527IleArg: 2.527 ± 1.352
5.055IleSer: 5.055 ± 1.436
4.634IleThr: 4.634 ± 1.914
8.003IleVal: 8.003 ± 1.795
0.421IleTrp: 0.421 ± 0.403
2.949IleTyr: 2.949 ± 0.828
0.0IleXaa: 0.0 ± 0.0
Lys
3.37LysAla: 3.37 ± 1.001
2.949LysCys: 2.949 ± 1.185
2.949LysAsp: 2.949 ± 0.852
2.527LysGlu: 2.527 ± 1.42
2.949LysPhe: 2.949 ± 1.156
2.527LysGly: 2.527 ± 1.475
1.264LysHis: 1.264 ± 0.582
2.106LysIle: 2.106 ± 1.061
2.527LysLys: 2.527 ± 0.978
3.37LysLeu: 3.37 ± 0.858
0.421LysMet: 0.421 ± 0.314
2.527LysAsn: 2.527 ± 0.52
2.106LysPro: 2.106 ± 1.012
3.791LysGln: 3.791 ± 1.177
7.582LysArg: 7.582 ± 1.37
2.106LysSer: 2.106 ± 1.149
3.37LysThr: 3.37 ± 0.59
4.212LysVal: 4.212 ± 1.022
0.0LysTrp: 0.0 ± 0.0
3.37LysTyr: 3.37 ± 1.028
0.0LysXaa: 0.0 ± 0.0
Leu
3.37LeuAla: 3.37 ± 1.252
4.212LeuCys: 4.212 ± 2.504
6.318LeuAsp: 6.318 ± 1.708
4.212LeuGlu: 4.212 ± 1.387
3.791LeuPhe: 3.791 ± 0.789
5.055LeuGly: 5.055 ± 1.856
4.212LeuHis: 4.212 ± 1.942
2.527LeuIle: 2.527 ± 1.328
6.318LeuLys: 6.318 ± 0.946
8.003LeuLeu: 8.003 ± 2.284
1.685LeuMet: 1.685 ± 1.537
1.264LeuAsn: 1.264 ± 0.595
1.685LeuPro: 1.685 ± 0.874
9.267LeuGln: 9.267 ± 1.185
3.37LeuArg: 3.37 ± 1.107
5.476LeuSer: 5.476 ± 1.753
6.318LeuThr: 6.318 ± 1.335
5.055LeuVal: 5.055 ± 1.974
0.421LeuTrp: 0.421 ± 0.403
5.476LeuTyr: 5.476 ± 1.416
0.0LeuXaa: 0.0 ± 0.0
Met
2.106MetAla: 2.106 ± 0.911
0.421MetCys: 0.421 ± 0.314
0.842MetAsp: 0.842 ± 0.385
0.842MetGlu: 0.842 ± 0.399
0.842MetPhe: 0.842 ± 0.523
1.685MetGly: 1.685 ± 1.058
0.421MetHis: 0.421 ± 0.683
1.685MetIle: 1.685 ± 0.879
0.421MetLys: 0.421 ± 0.314
1.264MetLeu: 1.264 ± 0.658
0.421MetMet: 0.421 ± 0.403
0.842MetAsn: 0.842 ± 0.782
0.421MetPro: 0.421 ± 0.314
1.264MetGln: 1.264 ± 0.775
0.842MetArg: 0.842 ± 0.437
1.685MetSer: 1.685 ± 0.509
0.421MetThr: 0.421 ± 0.391
0.421MetVal: 0.421 ± 0.391
0.842MetTrp: 0.842 ± 0.782
0.421MetTyr: 0.421 ± 0.314
0.0MetXaa: 0.0 ± 0.0
Asn
3.791AsnAla: 3.791 ± 1.525
0.842AsnCys: 0.842 ± 0.644
2.949AsnAsp: 2.949 ± 1.684
1.685AsnGlu: 1.685 ± 0.864
1.264AsnPhe: 1.264 ± 1.173
1.685AsnGly: 1.685 ± 0.546
0.0AsnHis: 0.0 ± 0.0
2.949AsnIle: 2.949 ± 1.014
2.949AsnLys: 2.949 ± 1.523
2.527AsnLeu: 2.527 ± 1.184
0.421AsnMet: 0.421 ± 0.314
1.264AsnAsn: 1.264 ± 0.41
4.212AsnPro: 4.212 ± 1.104
1.264AsnGln: 1.264 ± 0.41
0.842AsnArg: 0.842 ± 0.385
4.634AsnSer: 4.634 ± 1.125
4.634AsnThr: 4.634 ± 1.625
4.212AsnVal: 4.212 ± 1.927
1.264AsnTrp: 1.264 ± 0.943
0.421AsnTyr: 0.421 ± 0.314
0.0AsnXaa: 0.0 ± 0.0
Pro
2.949ProAla: 2.949 ± 2.364
1.264ProCys: 1.264 ± 0.516
4.634ProAsp: 4.634 ± 1.451
1.264ProGlu: 1.264 ± 0.595
2.527ProPhe: 2.527 ± 0.828
2.949ProGly: 2.949 ± 1.19
0.0ProHis: 0.0 ± 0.0
5.055ProIle: 5.055 ± 1.447
3.791ProLys: 3.791 ± 1.537
7.582ProLeu: 7.582 ± 1.879
0.421ProMet: 0.421 ± 0.662
1.685ProAsn: 1.685 ± 0.762
7.582ProPro: 7.582 ± 2.501
2.527ProGln: 2.527 ± 0.873
1.264ProArg: 1.264 ± 0.943
6.74ProSer: 6.74 ± 2.461
4.634ProThr: 4.634 ± 1.852
2.527ProVal: 2.527 ± 1.411
0.842ProTrp: 0.842 ± 0.906
2.949ProTyr: 2.949 ± 1.522
0.0ProXaa: 0.0 ± 0.0
Gln
4.634GlnAla: 4.634 ± 1.065
2.106GlnCys: 2.106 ± 1.4
2.527GlnAsp: 2.527 ± 0.512
0.842GlnGlu: 0.842 ± 0.629
2.106GlnPhe: 2.106 ± 0.686
0.842GlnGly: 0.842 ± 0.385
2.106GlnHis: 2.106 ± 0.719
2.527GlnIle: 2.527 ± 0.788
2.106GlnLys: 2.106 ± 0.979
3.791GlnLeu: 3.791 ± 1.591
1.685GlnMet: 1.685 ± 0.586
2.106GlnAsn: 2.106 ± 0.939
3.791GlnPro: 3.791 ± 1.772
2.949GlnGln: 2.949 ± 1.05
2.106GlnArg: 2.106 ± 0.788
2.106GlnSer: 2.106 ± 1.026
3.37GlnThr: 3.37 ± 0.533
1.264GlnVal: 1.264 ± 0.416
1.264GlnTrp: 1.264 ± 0.767
1.264GlnTyr: 1.264 ± 0.406
0.0GlnXaa: 0.0 ± 0.0
Arg
2.527ArgAla: 2.527 ± 1.311
2.106ArgCys: 2.106 ± 1.425
1.264ArgAsp: 1.264 ± 0.416
2.527ArgGlu: 2.527 ± 0.978
1.264ArgPhe: 1.264 ± 0.582
1.264ArgGly: 1.264 ± 0.61
1.685ArgHis: 1.685 ± 0.694
1.685ArgIle: 1.685 ± 0.555
5.476ArgLys: 5.476 ± 1.524
5.897ArgLeu: 5.897 ± 1.335
0.421ArgMet: 0.421 ± 0.314
1.264ArgAsn: 1.264 ± 0.748
4.212ArgPro: 4.212 ± 1.223
2.949ArgGln: 2.949 ± 1.154
1.685ArgArg: 1.685 ± 0.481
2.949ArgSer: 2.949 ± 0.494
2.106ArgThr: 2.106 ± 0.607
1.685ArgVal: 1.685 ± 0.555
0.0ArgTrp: 0.0 ± 0.0
2.527ArgTyr: 2.527 ± 1.133
0.0ArgXaa: 0.0 ± 0.0
Ser
3.791SerAla: 3.791 ± 1.379
1.264SerCys: 1.264 ± 0.831
4.212SerAsp: 4.212 ± 1.508
2.106SerGlu: 2.106 ± 1.151
1.685SerPhe: 1.685 ± 0.874
8.003SerGly: 8.003 ± 2.455
1.685SerHis: 1.685 ± 0.805
4.634SerIle: 4.634 ± 2.522
3.791SerLys: 3.791 ± 0.873
7.582SerLeu: 7.582 ± 2.148
2.106SerMet: 2.106 ± 1.049
5.055SerAsn: 5.055 ± 1.659
2.949SerPro: 2.949 ± 1.569
2.949SerGln: 2.949 ± 1.17
2.527SerArg: 2.527 ± 0.978
6.74SerSer: 6.74 ± 1.689
7.582SerThr: 7.582 ± 3.019
5.897SerVal: 5.897 ± 2.092
0.421SerTrp: 0.421 ± 0.403
2.106SerTyr: 2.106 ± 1.231
0.0SerXaa: 0.0 ± 0.0
Thr
1.264ThrAla: 1.264 ± 0.725
2.527ThrCys: 2.527 ± 0.643
5.476ThrAsp: 5.476 ± 1.04
5.055ThrGlu: 5.055 ± 1.013
2.527ThrPhe: 2.527 ± 1.183
5.897ThrGly: 5.897 ± 1.419
2.949ThrHis: 2.949 ± 1.201
2.527ThrIle: 2.527 ± 1.033
0.842ThrLys: 0.842 ± 0.644
5.897ThrLeu: 5.897 ± 2.505
0.842ThrMet: 0.842 ± 0.619
5.055ThrAsn: 5.055 ± 1.188
8.846ThrPro: 8.846 ± 2.798
2.949ThrGln: 2.949 ± 0.531
3.791ThrArg: 3.791 ± 1.211
6.74ThrSer: 6.74 ± 1.591
12.216ThrThr: 12.216 ± 4.748
5.055ThrVal: 5.055 ± 0.773
1.685ThrTrp: 1.685 ± 0.864
3.791ThrTyr: 3.791 ± 1.096
0.0ThrXaa: 0.0 ± 0.0
Val
0.842ValAla: 0.842 ± 0.46
2.949ValCys: 2.949 ± 1.623
2.949ValAsp: 2.949 ± 0.851
4.212ValGlu: 4.212 ± 1.414
5.055ValPhe: 5.055 ± 1.699
3.791ValGly: 3.791 ± 1.186
2.106ValHis: 2.106 ± 1.367
2.527ValIle: 2.527 ± 0.892
2.949ValLys: 2.949 ± 0.955
5.897ValLeu: 5.897 ± 2.09
1.264ValMet: 1.264 ± 0.496
2.949ValAsn: 2.949 ± 0.59
3.791ValPro: 3.791 ± 0.929
1.264ValGln: 1.264 ± 0.943
2.106ValArg: 2.106 ± 0.708
7.582ValSer: 7.582 ± 2.073
6.318ValThr: 6.318 ± 2.169
3.791ValVal: 3.791 ± 0.857
0.421ValTrp: 0.421 ± 0.391
2.949ValTyr: 2.949 ± 1.36
0.0ValXaa: 0.0 ± 0.0
Trp
1.264TrpAla: 1.264 ± 0.41
0.0TrpCys: 0.0 ± 0.0
0.421TrpAsp: 0.421 ± 0.391
1.264TrpGlu: 1.264 ± 0.739
0.842TrpPhe: 0.842 ± 0.385
0.842TrpGly: 0.842 ± 0.385
0.842TrpHis: 0.842 ± 0.523
0.842TrpIle: 0.842 ± 0.629
1.685TrpLys: 1.685 ± 0.864
1.264TrpLeu: 1.264 ± 0.672
0.0TrpMet: 0.0 ± 0.0
0.421TrpAsn: 0.421 ± 0.391
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.842TrpArg: 0.842 ± 0.512
0.842TrpSer: 0.842 ± 0.519
2.106TrpThr: 2.106 ± 1.194
0.842TrpVal: 0.842 ± 0.399
0.0TrpTrp: 0.0 ± 0.0
0.842TrpTyr: 0.842 ± 0.399
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.106TyrAla: 2.106 ± 1.073
0.842TyrCys: 0.842 ± 0.399
2.106TyrAsp: 2.106 ± 0.767
2.527TyrGlu: 2.527 ± 1.277
0.421TyrPhe: 0.421 ± 0.391
4.212TyrGly: 4.212 ± 1.15
0.842TyrHis: 0.842 ± 0.685
3.37TyrIle: 3.37 ± 1.325
2.527TyrLys: 2.527 ± 0.872
3.37TyrLeu: 3.37 ± 1.215
1.264TyrMet: 1.264 ± 0.393
1.685TyrAsn: 1.685 ± 1.221
1.685TyrPro: 1.685 ± 1.1
1.264TyrGln: 1.264 ± 0.943
3.791TyrArg: 3.791 ± 1.802
1.264TyrSer: 1.264 ± 0.588
0.842TyrThr: 0.842 ± 0.46
3.37TyrVal: 3.37 ± 0.741
0.842TyrTrp: 0.842 ± 0.385
2.106TyrTyr: 2.106 ± 0.904
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (2375 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski