Amino acid dipepetide frequency for Human papillomavirus 121

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.326AlaAla: 3.326 ± 1.007
0.832AlaCys: 0.832 ± 0.62
2.495AlaAsp: 2.495 ± 0.848
4.574AlaGlu: 4.574 ± 1.514
2.911AlaPhe: 2.911 ± 0.945
2.495AlaGly: 2.495 ± 0.775
1.663AlaHis: 1.663 ± 0.95
4.574AlaIle: 4.574 ± 1.137
2.495AlaLys: 2.495 ± 1.029
4.99AlaLeu: 4.99 ± 0.925
0.416AlaMet: 0.416 ± 0.375
1.663AlaAsn: 1.663 ± 0.621
2.079AlaPro: 2.079 ± 0.763
3.326AlaGln: 3.326 ± 1.535
2.911AlaArg: 2.911 ± 0.613
4.574AlaSer: 4.574 ± 0.616
2.911AlaThr: 2.911 ± 1.076
1.663AlaVal: 1.663 ± 0.652
0.832AlaTrp: 0.832 ± 0.456
2.495AlaTyr: 2.495 ± 1.014
0.0AlaXaa: 0.0 ± 0.0
Cys
1.663CysAla: 1.663 ± 0.988
1.247CysCys: 1.247 ± 0.776
0.832CysAsp: 0.832 ± 0.689
0.0CysGlu: 0.0 ± 0.0
0.416CysPhe: 0.416 ± 0.422
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.247CysIle: 1.247 ± 0.638
2.495CysLys: 2.495 ± 0.707
1.663CysLeu: 1.663 ± 1.825
0.416CysMet: 0.416 ± 0.634
1.247CysAsn: 1.247 ± 0.776
1.247CysPro: 1.247 ± 0.578
0.0CysGln: 0.0 ± 0.0
2.495CysArg: 2.495 ± 1.935
1.663CysSer: 1.663 ± 1.239
2.495CysThr: 2.495 ± 1.502
0.832CysVal: 0.832 ± 0.706
0.832CysTrp: 0.832 ± 0.456
0.832CysTyr: 0.832 ± 0.706
0.0CysXaa: 0.0 ± 0.0
Asp
2.911AspAla: 2.911 ± 0.843
1.663AspCys: 1.663 ± 0.666
3.742AspAsp: 3.742 ± 0.938
3.742AspGlu: 3.742 ± 2.364
4.574AspPhe: 4.574 ± 1.003
1.663AspGly: 1.663 ± 0.995
0.832AspHis: 0.832 ± 0.432
6.237AspIle: 6.237 ± 0.85
1.663AspLys: 1.663 ± 0.921
6.653AspLeu: 6.653 ± 2.213
0.832AspMet: 0.832 ± 0.432
4.158AspAsn: 4.158 ± 1.002
4.574AspPro: 4.574 ± 1.265
1.247AspGln: 1.247 ± 0.726
1.663AspArg: 1.663 ± 0.652
4.574AspSer: 4.574 ± 1.526
3.742AspThr: 3.742 ± 0.988
5.821AspVal: 5.821 ± 1.785
1.247AspTrp: 1.247 ± 0.685
1.663AspTyr: 1.663 ± 1.327
0.0AspXaa: 0.0 ± 0.0
Glu
4.574GluAla: 4.574 ± 1.416
0.416GluCys: 0.416 ± 0.344
6.653GluAsp: 6.653 ± 1.562
8.316GluGlu: 8.316 ± 2.048
2.079GluPhe: 2.079 ± 0.691
2.495GluGly: 2.495 ± 0.871
1.663GluHis: 1.663 ± 0.76
2.911GluIle: 2.911 ± 0.747
2.495GluLys: 2.495 ± 1.18
6.237GluLeu: 6.237 ± 1.531
2.079GluMet: 2.079 ± 1.317
2.911GluAsn: 2.911 ± 0.868
2.495GluPro: 2.495 ± 0.871
4.99GluGln: 4.99 ± 1.273
1.663GluArg: 1.663 ± 0.76
3.742GluSer: 3.742 ± 0.714
2.911GluThr: 2.911 ± 1.409
2.079GluVal: 2.079 ± 1.083
0.832GluTrp: 0.832 ± 0.456
1.247GluTyr: 1.247 ± 0.638
0.0GluXaa: 0.0 ± 0.0
Phe
0.832PheAla: 0.832 ± 0.417
1.663PheCys: 1.663 ± 1.825
3.326PheAsp: 3.326 ± 1.206
4.158PheGlu: 4.158 ± 1.352
1.663PhePhe: 1.663 ± 0.603
2.911PheGly: 2.911 ± 0.496
0.832PheHis: 0.832 ± 0.461
3.742PheIle: 3.742 ± 1.083
3.742PheLys: 3.742 ± 1.456
4.158PheLeu: 4.158 ± 0.821
0.0PheMet: 0.0 ± 0.0
2.495PheAsn: 2.495 ± 1.807
0.416PhePro: 0.416 ± 0.332
2.079PheGln: 2.079 ± 0.894
1.247PheArg: 1.247 ± 0.956
1.663PheSer: 1.663 ± 0.608
2.079PheThr: 2.079 ± 0.691
3.326PheVal: 3.326 ± 0.946
0.832PheTrp: 0.832 ± 0.432
2.495PheTyr: 2.495 ± 0.886
0.0PheXaa: 0.0 ± 0.0
Gly
2.495GlyAla: 2.495 ± 0.666
1.247GlyCys: 1.247 ± 0.411
5.821GlyAsp: 5.821 ± 2.007
3.742GlyGlu: 3.742 ± 1.018
1.247GlyPhe: 1.247 ± 0.698
2.495GlyGly: 2.495 ± 1.168
2.079GlyHis: 2.079 ± 0.83
4.158GlyIle: 4.158 ± 0.604
2.911GlyLys: 2.911 ± 1.247
3.326GlyLeu: 3.326 ± 0.709
0.0GlyMet: 0.0 ± 0.0
4.574GlyAsn: 4.574 ± 1.123
4.574GlyPro: 4.574 ± 1.425
1.247GlyGln: 1.247 ± 0.654
2.911GlyArg: 2.911 ± 1.102
5.405GlySer: 5.405 ± 1.528
5.821GlyThr: 5.821 ± 1.424
3.326GlyVal: 3.326 ± 1.062
0.416GlyTrp: 0.416 ± 0.422
0.832GlyTyr: 0.832 ± 0.716
0.0GlyXaa: 0.0 ± 0.0
His
0.416HisAla: 0.416 ± 0.375
0.0HisCys: 0.0 ± 0.0
0.832HisAsp: 0.832 ± 0.689
0.0HisGlu: 0.0 ± 0.0
2.079HisPhe: 2.079 ± 1.079
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
2.079HisIle: 2.079 ± 0.809
2.079HisLys: 2.079 ± 1.017
1.247HisLeu: 1.247 ± 0.81
0.416HisMet: 0.416 ± 0.344
1.247HisAsn: 1.247 ± 0.411
1.663HisPro: 1.663 ± 0.834
0.832HisGln: 0.832 ± 0.456
1.247HisArg: 1.247 ± 0.411
1.663HisSer: 1.663 ± 0.621
1.663HisThr: 1.663 ± 0.619
0.832HisVal: 0.832 ± 0.461
0.832HisTrp: 0.832 ± 0.461
1.663HisTyr: 1.663 ± 0.87
0.0HisXaa: 0.0 ± 0.0
Ile
2.911IleAla: 2.911 ± 1.412
0.832IleCys: 0.832 ± 0.629
7.069IleAsp: 7.069 ± 2.337
4.574IleGlu: 4.574 ± 1.464
0.416IlePhe: 0.416 ± 0.332
3.326IleGly: 3.326 ± 0.743
0.416IleHis: 0.416 ± 0.332
4.574IleIle: 4.574 ± 2.219
1.247IleLys: 1.247 ± 0.66
4.99IleLeu: 4.99 ± 1.128
1.663IleMet: 1.663 ± 0.82
0.832IleAsn: 0.832 ± 0.446
5.405IlePro: 5.405 ± 1.881
2.079IleGln: 2.079 ± 0.603
3.326IleArg: 3.326 ± 1.67
3.326IleSer: 3.326 ± 1.014
4.158IleThr: 4.158 ± 1.066
4.158IleVal: 4.158 ± 1.401
0.832IleTrp: 0.832 ± 0.689
1.663IleTyr: 1.663 ± 0.652
0.0IleXaa: 0.0 ± 0.0
Lys
3.326LysAla: 3.326 ± 0.767
1.663LysCys: 1.663 ± 0.988
2.079LysAsp: 2.079 ± 1.544
3.326LysGlu: 3.326 ± 0.948
1.663LysPhe: 1.663 ± 0.864
2.911LysGly: 2.911 ± 0.554
3.742LysHis: 3.742 ± 0.674
1.247LysIle: 1.247 ± 0.667
2.495LysLys: 2.495 ± 0.821
4.99LysLeu: 4.99 ± 1.572
0.416LysMet: 0.416 ± 0.375
2.495LysAsn: 2.495 ± 0.748
2.495LysPro: 2.495 ± 0.944
3.326LysGln: 3.326 ± 0.86
5.821LysArg: 5.821 ± 1.359
3.326LysSer: 3.326 ± 1.523
2.911LysThr: 2.911 ± 0.995
4.99LysVal: 4.99 ± 1.285
1.247LysTrp: 1.247 ± 0.685
2.911LysTyr: 2.911 ± 1.126
0.0LysXaa: 0.0 ± 0.0
Leu
6.653LeuAla: 6.653 ± 1.247
1.247LeuCys: 1.247 ± 0.902
4.574LeuAsp: 4.574 ± 0.705
4.99LeuGlu: 4.99 ± 1.13
6.237LeuPhe: 6.237 ± 1.281
7.484LeuGly: 7.484 ± 3.379
1.247LeuHis: 1.247 ± 0.722
2.495LeuIle: 2.495 ± 1.225
4.99LeuLys: 4.99 ± 1.513
10.811LeuLeu: 10.811 ± 3.366
1.247LeuMet: 1.247 ± 0.408
3.326LeuAsn: 3.326 ± 0.53
4.574LeuPro: 4.574 ± 2.527
5.821LeuGln: 5.821 ± 2.006
3.326LeuArg: 3.326 ± 1.305
7.9LeuSer: 7.9 ± 2.159
4.99LeuThr: 4.99 ± 1.077
4.574LeuVal: 4.574 ± 1.275
1.247LeuTrp: 1.247 ± 0.935
4.158LeuTyr: 4.158 ± 0.737
0.0LeuXaa: 0.0 ± 0.0
Met
0.416MetAla: 0.416 ± 0.375
1.247MetCys: 1.247 ± 1.033
0.416MetAsp: 0.416 ± 0.634
0.416MetGlu: 0.416 ± 0.422
0.416MetPhe: 0.416 ± 0.375
1.247MetGly: 1.247 ± 0.731
0.0MetHis: 0.0 ± 0.0
0.416MetIle: 0.416 ± 0.555
0.416MetLys: 0.416 ± 0.344
1.247MetLeu: 1.247 ± 0.411
0.0MetMet: 0.0 ± 0.0
1.247MetAsn: 1.247 ± 0.388
1.663MetPro: 1.663 ± 0.784
0.416MetGln: 0.416 ± 0.375
1.663MetArg: 1.663 ± 1.239
0.832MetSer: 0.832 ± 0.432
1.663MetThr: 1.663 ± 0.641
0.832MetVal: 0.832 ± 0.689
0.416MetTrp: 0.416 ± 0.422
0.416MetTyr: 0.416 ± 0.422
0.0MetXaa: 0.0 ± 0.0
Asn
3.326AsnAla: 3.326 ± 1.081
1.663AsnCys: 1.663 ± 0.61
1.663AsnAsp: 1.663 ± 1.377
1.663AsnGlu: 1.663 ± 0.984
2.911AsnPhe: 2.911 ± 1.188
4.574AsnGly: 4.574 ± 2.019
0.416AsnHis: 0.416 ± 0.344
2.495AsnIle: 2.495 ± 1.295
3.742AsnLys: 3.742 ± 1.209
1.663AsnLeu: 1.663 ± 0.455
0.832AsnMet: 0.832 ± 0.461
3.326AsnAsn: 3.326 ± 1.727
3.326AsnPro: 3.326 ± 1.673
2.495AsnGln: 2.495 ± 0.94
2.079AsnArg: 2.079 ± 0.834
4.158AsnSer: 4.158 ± 1.702
3.742AsnThr: 3.742 ± 0.794
1.247AsnVal: 1.247 ± 0.654
0.832AsnTrp: 0.832 ± 0.62
0.416AsnTyr: 0.416 ± 0.344
0.0AsnXaa: 0.0 ± 0.0
Pro
4.158ProAla: 4.158 ± 1.611
1.247ProCys: 1.247 ± 0.935
4.99ProAsp: 4.99 ± 1.622
2.495ProGlu: 2.495 ± 1.192
2.495ProPhe: 2.495 ± 0.811
4.99ProGly: 4.99 ± 1.677
0.0ProHis: 0.0 ± 0.0
2.495ProIle: 2.495 ± 0.739
4.99ProLys: 4.99 ± 0.84
7.9ProLeu: 7.9 ± 2.863
0.416ProMet: 0.416 ± 0.634
1.247ProAsn: 1.247 ± 0.731
4.574ProPro: 4.574 ± 1.572
2.495ProGln: 2.495 ± 0.643
2.079ProArg: 2.079 ± 0.61
4.574ProSer: 4.574 ± 1.444
4.99ProThr: 4.99 ± 2.284
1.663ProVal: 1.663 ± 0.787
0.416ProTrp: 0.416 ± 0.422
2.079ProTyr: 2.079 ± 1.154
0.0ProXaa: 0.0 ± 0.0
Gln
1.663GlnAla: 1.663 ± 0.747
1.663GlnCys: 1.663 ± 0.757
2.495GlnAsp: 2.495 ± 0.853
2.079GlnGlu: 2.079 ± 1.356
2.079GlnPhe: 2.079 ± 0.744
2.079GlnGly: 2.079 ± 0.899
1.247GlnHis: 1.247 ± 1.274
2.079GlnIle: 2.079 ± 0.72
1.663GlnLys: 1.663 ± 0.666
5.821GlnLeu: 5.821 ± 1.763
1.247GlnMet: 1.247 ± 0.648
3.326GlnAsn: 3.326 ± 1.681
2.495GlnPro: 2.495 ± 0.753
4.158GlnGln: 4.158 ± 1.222
2.495GlnArg: 2.495 ± 1.016
2.495GlnSer: 2.495 ± 0.748
4.158GlnThr: 4.158 ± 0.889
3.326GlnVal: 3.326 ± 1.241
0.832GlnTrp: 0.832 ± 0.399
0.832GlnTyr: 0.832 ± 0.749
0.0GlnXaa: 0.0 ± 0.0
Arg
1.663ArgAla: 1.663 ± 0.65
1.247ArgCys: 1.247 ± 0.858
2.495ArgAsp: 2.495 ± 0.871
2.911ArgGlu: 2.911 ± 0.974
3.742ArgPhe: 3.742 ± 1.076
3.742ArgGly: 3.742 ± 1.867
1.247ArgHis: 1.247 ± 0.726
2.079ArgIle: 2.079 ± 0.791
4.574ArgLys: 4.574 ± 1.065
6.237ArgLeu: 6.237 ± 1.495
0.832ArgMet: 0.832 ± 0.845
0.832ArgAsn: 0.832 ± 0.716
2.911ArgPro: 2.911 ± 1.015
3.326ArgGln: 3.326 ± 1.562
5.405ArgArg: 5.405 ± 2.409
2.911ArgSer: 2.911 ± 0.554
2.911ArgThr: 2.911 ± 1.209
3.742ArgVal: 3.742 ± 0.895
0.0ArgTrp: 0.0 ± 0.0
0.416ArgTyr: 0.416 ± 0.503
0.0ArgXaa: 0.0 ± 0.0
Ser
2.911SerAla: 2.911 ± 0.989
1.663SerCys: 1.663 ± 1.825
3.742SerAsp: 3.742 ± 1.333
5.405SerGlu: 5.405 ± 1.235
3.326SerPhe: 3.326 ± 1.257
4.574SerGly: 4.574 ± 1.317
2.495SerHis: 2.495 ± 0.821
5.405SerIle: 5.405 ± 1.593
3.742SerLys: 3.742 ± 1.96
9.148SerLeu: 9.148 ± 1.85
0.416SerMet: 0.416 ± 0.375
3.326SerAsn: 3.326 ± 1.523
4.158SerPro: 4.158 ± 2.054
1.663SerGln: 1.663 ± 0.691
3.742SerArg: 3.742 ± 1.072
7.069SerSer: 7.069 ± 2.728
4.158SerThr: 4.158 ± 1.304
4.158SerVal: 4.158 ± 1.052
0.416SerTrp: 0.416 ± 0.344
2.079SerTyr: 2.079 ± 0.902
0.0SerXaa: 0.0 ± 0.0
Thr
2.495ThrAla: 2.495 ± 0.804
0.416ThrCys: 0.416 ± 0.375
3.742ThrAsp: 3.742 ± 1.496
5.405ThrGlu: 5.405 ± 1.127
0.832ThrPhe: 0.832 ± 1.269
7.069ThrGly: 7.069 ± 1.673
0.832ThrHis: 0.832 ± 0.417
4.158ThrIle: 4.158 ± 2.144
3.326ThrLys: 3.326 ± 1.382
3.742ThrLeu: 3.742 ± 1.306
1.663ThrMet: 1.663 ± 0.603
2.911ThrAsn: 2.911 ± 0.927
6.237ThrPro: 6.237 ± 2.067
3.326ThrGln: 3.326 ± 1.11
3.326ThrArg: 3.326 ± 1.111
6.653ThrSer: 6.653 ± 1.949
2.911ThrThr: 2.911 ± 1.487
4.99ThrVal: 4.99 ± 1.141
0.416ThrTrp: 0.416 ± 0.422
1.663ThrTyr: 1.663 ± 0.995
0.0ThrXaa: 0.0 ± 0.0
Val
3.742ValAla: 3.742 ± 0.657
0.832ValCys: 0.832 ± 1.11
4.158ValAsp: 4.158 ± 1.053
2.495ValGlu: 2.495 ± 0.927
1.663ValPhe: 1.663 ± 0.272
2.079ValGly: 2.079 ± 0.776
1.663ValHis: 1.663 ± 0.578
2.911ValIle: 2.911 ± 0.52
3.742ValLys: 3.742 ± 0.896
2.495ValLeu: 2.495 ± 1.259
0.832ValMet: 0.832 ± 0.637
2.495ValAsn: 2.495 ± 0.821
3.326ValPro: 3.326 ± 1.663
3.742ValGln: 3.742 ± 1.321
3.326ValArg: 3.326 ± 1.858
4.99ValSer: 4.99 ± 1.329
5.821ValThr: 5.821 ± 1.035
4.158ValVal: 4.158 ± 1.374
0.416ValTrp: 0.416 ± 0.375
3.326ValTyr: 3.326 ± 1.4
0.0ValXaa: 0.0 ± 0.0
Trp
0.416TrpAla: 0.416 ± 0.344
0.0TrpCys: 0.0 ± 0.0
0.416TrpAsp: 0.416 ± 0.375
0.832TrpGlu: 0.832 ± 0.845
0.0TrpPhe: 0.0 ± 0.0
0.416TrpGly: 0.416 ± 0.503
0.416TrpHis: 0.416 ± 0.422
1.247TrpIle: 1.247 ± 1.033
2.495TrpLys: 2.495 ± 1.035
1.247TrpLeu: 1.247 ± 0.388
0.416TrpMet: 0.416 ± 0.375
1.247TrpAsn: 1.247 ± 0.726
0.416TrpPro: 0.416 ± 0.375
0.0TrpGln: 0.0 ± 0.0
1.247TrpArg: 1.247 ± 0.698
0.832TrpSer: 0.832 ± 0.456
0.0TrpThr: 0.0 ± 0.0
1.247TrpVal: 1.247 ± 0.411
0.0TrpTrp: 0.0 ± 0.0
0.416TrpTyr: 0.416 ± 0.344
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.326TyrAla: 3.326 ± 1.154
0.832TyrCys: 0.832 ± 1.269
1.663TyrAsp: 1.663 ± 0.979
1.663TyrGlu: 1.663 ± 0.621
2.911TyrPhe: 2.911 ± 1.126
2.079TyrGly: 2.079 ± 0.802
0.0TyrHis: 0.0 ± 0.0
1.663TyrIle: 1.663 ± 0.911
2.079TyrLys: 2.079 ± 1.092
3.326TyrLeu: 3.326 ± 0.998
0.832TyrMet: 0.832 ± 0.689
1.663TyrAsn: 1.663 ± 0.65
1.663TyrPro: 1.663 ± 0.747
1.663TyrGln: 1.663 ± 0.554
1.247TyrArg: 1.247 ± 0.885
1.247TyrSer: 1.247 ± 0.435
2.079TyrThr: 2.079 ± 0.489
1.247TyrVal: 1.247 ± 0.714
0.416TyrTrp: 0.416 ± 0.375
2.495TyrTyr: 2.495 ± 2.017
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2406 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski