Amino acid dipepetide frequency for Human papillomavirus 39

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.237AlaAla: 4.237 ± 1.161
1.541AlaCys: 1.541 ± 1.098
2.696AlaAsp: 2.696 ± 1.022
3.467AlaGlu: 3.467 ± 1.373
3.082AlaPhe: 3.082 ± 1.397
1.926AlaGly: 1.926 ± 0.667
0.385AlaHis: 0.385 ± 0.365
4.622AlaIle: 4.622 ± 1.142
1.541AlaLys: 1.541 ± 0.713
3.852AlaLeu: 3.852 ± 1.38
2.696AlaMet: 2.696 ± 1.046
2.311AlaAsn: 2.311 ± 0.721
2.311AlaPro: 2.311 ± 1.063
3.467AlaGln: 3.467 ± 1.359
2.696AlaArg: 2.696 ± 0.915
3.082AlaSer: 3.082 ± 1.454
6.934AlaThr: 6.934 ± 1.305
2.696AlaVal: 2.696 ± 0.743
0.0AlaTrp: 0.0 ± 0.0
1.156AlaTyr: 1.156 ± 0.873
0.0AlaXaa: 0.0 ± 0.0
Cys
2.311CysAla: 2.311 ± 0.87
1.156CysCys: 1.156 ± 1.104
0.385CysAsp: 0.385 ± 0.33
0.0CysGlu: 0.0 ± 0.0
0.77CysPhe: 0.77 ± 0.435
1.156CysGly: 1.156 ± 0.473
0.385CysHis: 0.385 ± 0.533
1.541CysIle: 1.541 ± 0.924
2.696CysLys: 2.696 ± 0.827
1.541CysLeu: 1.541 ± 0.648
1.156CysMet: 1.156 ± 0.755
1.926CysAsn: 1.926 ± 0.906
3.467CysPro: 3.467 ± 1.213
1.541CysGln: 1.541 ± 0.752
1.156CysArg: 1.156 ± 0.981
1.541CysSer: 1.541 ± 0.743
2.696CysThr: 2.696 ± 0.873
2.696CysVal: 2.696 ± 0.835
1.156CysTrp: 1.156 ± 0.544
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.852AspAla: 3.852 ± 1.044
1.156AspCys: 1.156 ± 0.667
2.311AspAsp: 2.311 ± 1.334
3.852AspGlu: 3.852 ± 2.443
1.541AspPhe: 1.541 ± 0.567
5.008AspGly: 5.008 ± 0.809
1.541AspHis: 1.541 ± 0.961
5.008AspIle: 5.008 ± 1.597
3.467AspLys: 3.467 ± 1.206
4.622AspLeu: 4.622 ± 1.482
1.156AspMet: 1.156 ± 0.775
2.696AspAsn: 2.696 ± 0.853
2.311AspPro: 2.311 ± 0.905
1.156AspGln: 1.156 ± 0.449
1.541AspArg: 1.541 ± 0.946
6.549AspSer: 6.549 ± 1.141
7.319AspThr: 7.319 ± 0.861
1.926AspVal: 1.926 ± 1.026
0.77AspTrp: 0.77 ± 0.66
1.541AspTyr: 1.541 ± 1.083
0.0AspXaa: 0.0 ± 0.0
Glu
1.156GluAla: 1.156 ± 0.538
0.77GluCys: 0.77 ± 0.631
3.082GluAsp: 3.082 ± 1.077
1.926GluGlu: 1.926 ± 0.495
1.156GluPhe: 1.156 ± 0.755
2.696GluGly: 2.696 ± 1.51
1.156GluHis: 1.156 ± 0.449
2.311GluIle: 2.311 ± 1.542
1.541GluLys: 1.541 ± 0.752
3.852GluLeu: 3.852 ± 1.478
0.0GluMet: 0.0 ± 0.0
3.082GluAsn: 3.082 ± 0.845
5.393GluPro: 5.393 ± 1.968
1.156GluGln: 1.156 ± 0.723
1.541GluArg: 1.541 ± 0.794
3.467GluSer: 3.467 ± 0.737
3.082GluThr: 3.082 ± 0.713
5.008GluVal: 5.008 ± 1.088
0.77GluTrp: 0.77 ± 0.435
1.926GluTyr: 1.926 ± 0.847
0.0GluXaa: 0.0 ± 0.0
Phe
1.541PheAla: 1.541 ± 0.726
0.77PheCys: 0.77 ± 0.426
2.696PheAsp: 2.696 ± 1.398
1.926PheGlu: 1.926 ± 0.794
2.311PhePhe: 2.311 ± 1.46
1.156PheGly: 1.156 ± 0.775
0.77PheHis: 0.77 ± 0.979
1.926PheIle: 1.926 ± 0.826
3.082PheLys: 3.082 ± 1.54
5.393PheLeu: 5.393 ± 1.242
0.385PheMet: 0.385 ± 0.533
1.156PheAsn: 1.156 ± 0.344
1.156PhePro: 1.156 ± 0.667
0.385PheGln: 0.385 ± 0.368
0.77PheArg: 0.77 ± 0.426
2.696PheSer: 2.696 ± 1.171
1.926PheThr: 1.926 ± 0.649
2.696PheVal: 2.696 ± 2.454
1.156PheTrp: 1.156 ± 0.724
0.77PheTyr: 0.77 ± 0.571
0.0PheXaa: 0.0 ± 0.0
Gly
1.541GlyAla: 1.541 ± 0.664
1.156GlyCys: 1.156 ± 0.449
5.393GlyAsp: 5.393 ± 2.113
1.926GlyGlu: 1.926 ± 0.625
1.156GlyPhe: 1.156 ± 0.538
2.696GlyGly: 2.696 ± 0.864
1.926GlyHis: 1.926 ± 1.121
4.622GlyIle: 4.622 ± 0.832
3.852GlyLys: 3.852 ± 0.961
2.311GlyLeu: 2.311 ± 0.475
1.156GlyMet: 1.156 ± 0.746
1.926GlyAsn: 1.926 ± 0.754
2.696GlyPro: 2.696 ± 1.226
1.156GlyGln: 1.156 ± 0.544
2.311GlyArg: 2.311 ± 0.905
4.622GlySer: 4.622 ± 0.717
9.245GlyThr: 9.245 ± 4.155
2.696GlyVal: 2.696 ± 0.854
0.385GlyTrp: 0.385 ± 0.33
3.082GlyTyr: 3.082 ± 0.879
0.0GlyXaa: 0.0 ± 0.0
His
1.156HisAla: 1.156 ± 0.87
0.385HisCys: 0.385 ± 0.369
0.385HisAsp: 0.385 ± 0.365
0.385HisGlu: 0.385 ± 0.533
1.541HisPhe: 1.541 ± 0.962
1.541HisGly: 1.541 ± 0.8
0.0HisHis: 0.0 ± 0.0
0.385HisIle: 0.385 ± 0.33
1.156HisLys: 1.156 ± 0.796
2.311HisLeu: 2.311 ± 1.279
0.385HisMet: 0.385 ± 0.33
1.926HisAsn: 1.926 ± 0.825
1.541HisPro: 1.541 ± 0.589
1.541HisGln: 1.541 ± 1.114
1.156HisArg: 1.156 ± 1.009
1.926HisSer: 1.926 ± 1.143
1.541HisThr: 1.541 ± 0.929
1.926HisVal: 1.926 ± 0.752
1.156HisTrp: 1.156 ± 0.563
1.156HisTyr: 1.156 ± 0.47
0.0HisXaa: 0.0 ± 0.0
Ile
2.696IleAla: 2.696 ± 1.35
1.926IleCys: 1.926 ± 1.221
3.082IleAsp: 3.082 ± 0.845
2.696IleGlu: 2.696 ± 0.832
1.926IlePhe: 1.926 ± 0.55
1.926IleGly: 1.926 ± 1.031
1.541IleHis: 1.541 ± 0.779
1.926IleIle: 1.926 ± 1.038
1.541IleLys: 1.541 ± 0.713
3.852IleLeu: 3.852 ± 1.672
0.385IleMet: 0.385 ± 0.75
1.541IleAsn: 1.541 ± 0.87
5.008IlePro: 5.008 ± 1.711
2.696IleGln: 2.696 ± 1.121
2.696IleArg: 2.696 ± 1.193
5.008IleSer: 5.008 ± 1.537
3.467IleThr: 3.467 ± 1.767
2.696IleVal: 2.696 ± 0.968
0.77IleTrp: 0.77 ± 0.503
2.696IleTyr: 2.696 ± 1.133
0.0IleXaa: 0.0 ± 0.0
Lys
1.926LysAla: 1.926 ± 0.536
3.852LysCys: 3.852 ± 1.605
1.926LysAsp: 1.926 ± 1.018
1.541LysGlu: 1.541 ± 0.64
3.852LysPhe: 3.852 ± 1.587
3.467LysGly: 3.467 ± 0.826
0.77LysHis: 0.77 ± 0.426
2.311LysIle: 2.311 ± 1.201
3.082LysLys: 3.082 ± 0.741
2.311LysLeu: 2.311 ± 1.633
0.385LysMet: 0.385 ± 0.348
3.082LysAsn: 3.082 ± 1.249
1.541LysPro: 1.541 ± 0.997
3.467LysGln: 3.467 ± 0.574
5.008LysArg: 5.008 ± 0.871
2.696LysSer: 2.696 ± 1.529
2.696LysThr: 2.696 ± 1.206
2.311LysVal: 2.311 ± 0.852
1.156LysTrp: 1.156 ± 0.848
3.082LysTyr: 3.082 ± 0.983
0.0LysXaa: 0.0 ± 0.0
Leu
3.082LeuAla: 3.082 ± 1.229
2.311LeuCys: 2.311 ± 1.166
6.934LeuAsp: 6.934 ± 0.654
3.467LeuGlu: 3.467 ± 1.251
3.082LeuPhe: 3.082 ± 1.329
4.237LeuGly: 4.237 ± 1.917
2.696LeuHis: 2.696 ± 0.612
2.696LeuIle: 2.696 ± 1.299
5.393LeuLys: 5.393 ± 1.401
7.319LeuLeu: 7.319 ± 3.539
2.311LeuMet: 2.311 ± 0.935
2.311LeuAsn: 2.311 ± 0.95
3.852LeuPro: 3.852 ± 2.256
8.089LeuGln: 8.089 ± 1.698
4.237LeuArg: 4.237 ± 1.179
3.852LeuSer: 3.852 ± 1.096
5.393LeuThr: 5.393 ± 0.63
4.622LeuVal: 4.622 ± 1.648
0.77LeuTrp: 0.77 ± 0.738
3.852LeuTyr: 3.852 ± 1.094
0.0LeuXaa: 0.0 ± 0.0
Met
2.311MetAla: 2.311 ± 0.745
1.156MetCys: 1.156 ± 0.572
1.156MetAsp: 1.156 ± 0.988
1.156MetGlu: 1.156 ± 0.47
0.385MetPhe: 0.385 ± 0.368
0.0MetGly: 0.0 ± 0.0
0.77MetHis: 0.77 ± 0.435
1.156MetIle: 1.156 ± 1.559
1.156MetLys: 1.156 ± 0.736
1.926MetLeu: 1.926 ± 1.649
0.385MetMet: 0.385 ± 0.369
1.541MetAsn: 1.541 ± 0.919
0.385MetPro: 0.385 ± 0.365
0.385MetGln: 0.385 ± 0.33
0.77MetArg: 0.77 ± 0.549
1.926MetSer: 1.926 ± 1.07
0.77MetThr: 0.77 ± 0.503
2.311MetVal: 2.311 ± 0.905
0.77MetTrp: 0.77 ± 0.832
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.082AsnAla: 3.082 ± 1.403
0.77AsnCys: 0.77 ± 0.66
0.77AsnAsp: 0.77 ± 0.66
1.541AsnGlu: 1.541 ± 0.846
1.156AsnPhe: 1.156 ± 0.667
2.696AsnGly: 2.696 ± 0.825
0.385AsnHis: 0.385 ± 0.533
3.082AsnIle: 3.082 ± 1.377
1.926AsnLys: 1.926 ± 1.129
0.77AsnLeu: 0.77 ± 0.855
0.77AsnMet: 0.77 ± 0.4
3.467AsnAsn: 3.467 ± 0.871
3.467AsnPro: 3.467 ± 0.499
1.156AsnGln: 1.156 ± 0.641
2.311AsnArg: 2.311 ± 1.041
2.696AsnSer: 2.696 ± 0.698
8.86AsnThr: 8.86 ± 1.768
2.311AsnVal: 2.311 ± 1.176
0.77AsnTrp: 0.77 ± 0.426
2.311AsnTyr: 2.311 ± 0.871
0.0AsnXaa: 0.0 ± 0.0
Pro
5.008ProAla: 5.008 ± 1.926
0.77ProCys: 0.77 ± 0.503
5.008ProAsp: 5.008 ± 1.216
1.541ProGlu: 1.541 ± 0.852
1.156ProPhe: 1.156 ± 0.344
0.77ProGly: 0.77 ± 0.392
0.385ProHis: 0.385 ± 0.61
2.696ProIle: 2.696 ± 0.73
2.696ProLys: 2.696 ± 0.943
7.704ProLeu: 7.704 ± 2.623
0.77ProMet: 0.77 ± 0.819
1.926ProAsn: 1.926 ± 0.661
5.778ProPro: 5.778 ± 1.451
1.541ProGln: 1.541 ± 1.038
1.926ProArg: 1.926 ± 0.763
7.704ProSer: 7.704 ± 3.121
5.008ProThr: 5.008 ± 1.259
3.852ProVal: 3.852 ± 1.693
0.77ProTrp: 0.77 ± 0.838
3.082ProTyr: 3.082 ± 0.979
0.0ProXaa: 0.0 ± 0.0
Gln
3.082GlnAla: 3.082 ± 1.464
1.926GlnCys: 1.926 ± 1.026
3.467GlnAsp: 3.467 ± 1.029
1.541GlnGlu: 1.541 ± 0.764
1.541GlnPhe: 1.541 ± 0.852
2.311GlnGly: 2.311 ± 0.767
0.77GlnHis: 0.77 ± 0.549
1.541GlnIle: 1.541 ± 0.72
3.082GlnLys: 3.082 ± 1.353
5.393GlnLeu: 5.393 ± 3.472
1.926GlnMet: 1.926 ± 0.739
0.77GlnAsn: 0.77 ± 0.66
1.926GlnPro: 1.926 ± 0.843
1.926GlnGln: 1.926 ± 0.953
2.696GlnArg: 2.696 ± 0.84
3.467GlnSer: 3.467 ± 1.748
2.696GlnThr: 2.696 ± 0.636
1.926GlnVal: 1.926 ± 0.795
1.156GlnTrp: 1.156 ± 0.636
0.385GlnTyr: 0.385 ± 0.368
0.0GlnXaa: 0.0 ± 0.0
Arg
2.311ArgAla: 2.311 ± 0.899
1.156ArgCys: 1.156 ± 0.981
2.311ArgAsp: 2.311 ± 1.062
3.852ArgGlu: 3.852 ± 1.527
1.156ArgPhe: 1.156 ± 1.033
3.082ArgGly: 3.082 ± 0.857
1.541ArgHis: 1.541 ± 0.997
0.77ArgIle: 0.77 ± 0.4
4.237ArgLys: 4.237 ± 1.088
5.778ArgLeu: 5.778 ± 1.412
0.385ArgMet: 0.385 ± 0.369
1.156ArgAsn: 1.156 ± 0.989
4.622ArgPro: 4.622 ± 1.037
2.696ArgGln: 2.696 ± 1.048
5.778ArgArg: 5.778 ± 2.831
1.541ArgSer: 1.541 ± 0.852
2.311ArgThr: 2.311 ± 0.773
2.696ArgVal: 2.696 ± 1.214
0.385ArgTrp: 0.385 ± 0.33
3.852ArgTyr: 3.852 ± 1.067
0.0ArgXaa: 0.0 ± 0.0
Ser
3.082SerAla: 3.082 ± 1.549
2.696SerCys: 2.696 ± 1.092
5.393SerAsp: 5.393 ± 0.995
1.926SerGlu: 1.926 ± 0.612
1.156SerPhe: 1.156 ± 0.636
5.393SerGly: 5.393 ± 2.358
1.156SerHis: 1.156 ± 0.636
3.467SerIle: 3.467 ± 1.624
2.311SerLys: 2.311 ± 0.812
6.163SerLeu: 6.163 ± 1.05
1.926SerMet: 1.926 ± 1.018
4.622SerAsn: 4.622 ± 1.545
2.696SerPro: 2.696 ± 1.107
2.311SerGln: 2.311 ± 0.748
4.237SerArg: 4.237 ± 1.255
8.475SerSer: 8.475 ± 2.813
9.245SerThr: 9.245 ± 1.884
7.704SerVal: 7.704 ± 1.183
0.385SerTrp: 0.385 ± 0.365
1.926SerTyr: 1.926 ± 0.592
0.0SerXaa: 0.0 ± 0.0
Thr
3.467ThrAla: 3.467 ± 1.062
2.311ThrCys: 2.311 ± 0.859
5.008ThrAsp: 5.008 ± 1.848
6.549ThrGlu: 6.549 ± 3.179
1.926ThrPhe: 1.926 ± 1.01
8.475ThrGly: 8.475 ± 2.55
2.696ThrHis: 2.696 ± 1.2
4.622ThrIle: 4.622 ± 0.963
3.082ThrLys: 3.082 ± 0.866
8.089ThrLeu: 8.089 ± 1.945
1.541ThrMet: 1.541 ± 0.345
2.696ThrAsn: 2.696 ± 0.854
6.163ThrPro: 6.163 ± 2.402
4.237ThrGln: 4.237 ± 0.566
4.237ThrArg: 4.237 ± 0.656
7.704ThrSer: 7.704 ± 3.016
11.941ThrThr: 11.941 ± 3.679
7.319ThrVal: 7.319 ± 1.436
1.156ThrTrp: 1.156 ± 0.473
3.467ThrTyr: 3.467 ± 1.209
0.0ThrXaa: 0.0 ± 0.0
Val
3.467ValAla: 3.467 ± 0.841
1.541ValCys: 1.541 ± 1.83
4.622ValAsp: 4.622 ± 1.197
3.467ValGlu: 3.467 ± 0.696
3.467ValPhe: 3.467 ± 2.865
3.467ValGly: 3.467 ± 1.957
2.696ValHis: 2.696 ± 1.07
1.926ValIle: 1.926 ± 1.01
1.156ValLys: 1.156 ± 0.473
2.696ValLeu: 2.696 ± 0.957
0.77ValMet: 0.77 ± 0.527
1.926ValAsn: 1.926 ± 0.741
4.622ValPro: 4.622 ± 1.027
3.082ValGln: 3.082 ± 1.543
3.082ValArg: 3.082 ± 0.945
5.393ValSer: 5.393 ± 1.337
6.934ValThr: 6.934 ± 1.446
4.237ValVal: 4.237 ± 1.34
1.541ValTrp: 1.541 ± 1.579
4.237ValTyr: 4.237 ± 2.056
0.0ValXaa: 0.0 ± 0.0
Trp
0.77TrpAla: 0.77 ± 0.426
1.156TrpCys: 1.156 ± 0.61
0.0TrpAsp: 0.0 ± 0.0
0.385TrpGlu: 0.385 ± 0.369
0.77TrpPhe: 0.77 ± 0.748
1.156TrpGly: 1.156 ± 0.748
1.541TrpHis: 1.541 ± 0.779
1.926TrpIle: 1.926 ± 0.804
1.541TrpLys: 1.541 ± 0.78
1.156TrpLeu: 1.156 ± 0.775
0.0TrpMet: 0.0 ± 0.0
1.541TrpAsn: 1.541 ± 1.003
0.385TrpPro: 0.385 ± 0.33
0.0TrpGln: 0.0 ± 0.0
0.77TrpArg: 0.77 ± 0.426
0.77TrpSer: 0.77 ± 0.45
1.541TrpThr: 1.541 ± 0.919
0.385TrpVal: 0.385 ± 0.33
0.0TrpTrp: 0.0 ± 0.0
0.77TrpTyr: 0.77 ± 0.435
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.622TyrAla: 4.622 ± 1.065
0.77TyrCys: 0.77 ± 0.549
2.696TyrAsp: 2.696 ± 1.367
1.926TyrGlu: 1.926 ± 1.056
1.541TyrPhe: 1.541 ± 0.567
2.696TyrGly: 2.696 ± 1.254
0.385TyrHis: 0.385 ± 0.365
1.541TyrIle: 1.541 ± 0.958
1.926TyrLys: 1.926 ± 0.65
3.467TyrLeu: 3.467 ± 0.965
1.541TyrMet: 1.541 ± 0.72
2.696TyrAsn: 2.696 ± 0.986
0.77TyrPro: 0.77 ± 0.645
1.541TyrGln: 1.541 ± 0.583
2.696TyrArg: 2.696 ± 0.552
1.156TyrSer: 1.156 ± 0.58
3.082TyrThr: 3.082 ± 1.43
2.696TyrVal: 2.696 ± 0.779
1.541TyrTrp: 1.541 ± 0.703
3.467TyrTyr: 3.467 ± 1.424
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (2597 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski