Amino acid dipepetide frequency for human papillomavirus 87

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.276AlaAla: 5.276 ± 1.2
1.623AlaCys: 1.623 ± 0.875
3.247AlaAsp: 3.247 ± 1.849
1.623AlaGlu: 1.623 ± 0.643
1.623AlaPhe: 1.623 ± 0.704
2.841AlaGly: 2.841 ± 1.281
1.218AlaHis: 1.218 ± 0.731
2.841AlaIle: 2.841 ± 0.73
3.247AlaLys: 3.247 ± 1.427
4.87AlaLeu: 4.87 ± 1.404
1.623AlaMet: 1.623 ± 0.713
2.029AlaAsn: 2.029 ± 0.792
4.87AlaPro: 4.87 ± 1.891
2.029AlaGln: 2.029 ± 0.846
5.682AlaArg: 5.682 ± 1.437
5.682AlaSer: 5.682 ± 1.23
5.276AlaThr: 5.276 ± 1.125
2.435AlaVal: 2.435 ± 0.97
0.406AlaTrp: 0.406 ± 0.358
1.623AlaTyr: 1.623 ± 0.882
0.0AlaXaa: 0.0 ± 0.0
Cys
2.029CysAla: 2.029 ± 0.586
0.406CysCys: 0.406 ± 0.514
0.812CysAsp: 0.812 ± 0.574
0.406CysGlu: 0.406 ± 0.395
0.406CysPhe: 0.406 ± 0.395
2.029CysGly: 2.029 ± 1.338
0.406CysHis: 0.406 ± 0.514
1.218CysIle: 1.218 ± 0.601
2.841CysLys: 2.841 ± 1.165
2.435CysLeu: 2.435 ± 1.159
1.623CysMet: 1.623 ± 0.748
0.812CysAsn: 0.812 ± 0.546
2.841CysPro: 2.841 ± 0.832
2.029CysGln: 2.029 ± 0.752
1.623CysArg: 1.623 ± 1.11
0.406CysSer: 0.406 ± 0.514
1.623CysThr: 1.623 ± 0.577
1.623CysVal: 1.623 ± 0.724
1.623CysTrp: 1.623 ± 0.53
0.406CysTyr: 0.406 ± 0.395
0.0CysXaa: 0.0 ± 0.0
Asp
4.058AspAla: 4.058 ± 1.584
1.623AspCys: 1.623 ± 0.776
3.247AspAsp: 3.247 ± 0.758
2.029AspGlu: 2.029 ± 0.586
2.029AspPhe: 2.029 ± 0.399
2.841AspGly: 2.841 ± 0.72
0.812AspHis: 0.812 ± 0.578
3.653AspIle: 3.653 ± 2.066
2.435AspLys: 2.435 ± 1.452
4.464AspLeu: 4.464 ± 0.763
1.218AspMet: 1.218 ± 0.403
3.653AspAsn: 3.653 ± 0.941
3.247AspPro: 3.247 ± 1.017
1.218AspGln: 1.218 ± 0.646
2.029AspArg: 2.029 ± 0.547
5.276AspSer: 5.276 ± 0.771
6.088AspThr: 6.088 ± 1.269
3.247AspVal: 3.247 ± 1.149
1.218AspTrp: 1.218 ± 0.94
1.623AspTyr: 1.623 ± 0.598
0.0AspXaa: 0.0 ± 0.0
Glu
3.653GluAla: 3.653 ± 1.341
0.406GluCys: 0.406 ± 0.567
3.247GluAsp: 3.247 ± 0.753
7.305GluGlu: 7.305 ± 2.344
0.812GluPhe: 0.812 ± 0.427
1.218GluGly: 1.218 ± 0.442
1.623GluHis: 1.623 ± 0.476
1.218GluIle: 1.218 ± 0.646
2.841GluLys: 2.841 ± 1.471
3.653GluLeu: 3.653 ± 1.957
0.406GluMet: 0.406 ± 0.395
1.623GluAsn: 1.623 ± 0.938
3.247GluPro: 3.247 ± 0.917
2.435GluGln: 2.435 ± 0.775
0.812GluArg: 0.812 ± 0.37
2.841GluSer: 2.841 ± 0.664
2.841GluThr: 2.841 ± 0.881
4.87GluVal: 4.87 ± 1.127
0.812GluTrp: 0.812 ± 0.37
2.841GluTyr: 2.841 ± 1.146
0.0GluXaa: 0.0 ± 0.0
Phe
1.623PheAla: 1.623 ± 0.559
1.218PheCys: 1.218 ± 0.526
0.812PheAsp: 0.812 ± 0.631
2.029PheGlu: 2.029 ± 0.985
2.029PhePhe: 2.029 ± 0.757
2.841PheGly: 2.841 ± 0.803
0.812PheHis: 0.812 ± 0.565
1.218PheIle: 1.218 ± 0.6
2.029PheLys: 2.029 ± 0.813
5.276PheLeu: 5.276 ± 1.439
0.812PheMet: 0.812 ± 0.627
1.218PheAsn: 1.218 ± 0.999
2.029PhePro: 2.029 ± 0.657
1.218PheGln: 1.218 ± 0.631
2.029PheArg: 2.029 ± 0.773
0.812PheSer: 0.812 ± 0.426
2.029PheThr: 2.029 ± 0.82
2.435PheVal: 2.435 ± 0.884
0.812PheTrp: 0.812 ± 0.37
1.218PheTyr: 1.218 ± 0.398
0.0PheXaa: 0.0 ± 0.0
Gly
4.058GlyAla: 4.058 ± 0.816
2.029GlyCys: 2.029 ± 0.772
6.088GlyAsp: 6.088 ± 1.21
3.247GlyGlu: 3.247 ± 1.363
1.218GlyPhe: 1.218 ± 0.794
5.682GlyGly: 5.682 ± 1.329
3.247GlyHis: 3.247 ± 0.832
2.029GlyIle: 2.029 ± 0.409
2.029GlyLys: 2.029 ± 0.906
4.058GlyLeu: 4.058 ± 0.695
0.812GlyMet: 0.812 ± 0.37
1.218GlyAsn: 1.218 ± 0.94
3.247GlyPro: 3.247 ± 1.261
4.058GlyGln: 4.058 ± 0.93
2.841GlyArg: 2.841 ± 0.586
2.841GlySer: 2.841 ± 0.834
7.711GlyThr: 7.711 ± 1.973
3.653GlyVal: 3.653 ± 1.004
0.0GlyTrp: 0.0 ± 0.0
2.435GlyTyr: 2.435 ± 0.494
0.0GlyXaa: 0.0 ± 0.0
His
2.435HisAla: 2.435 ± 1.156
0.812HisCys: 0.812 ± 0.574
0.812HisAsp: 0.812 ± 0.426
0.406HisGlu: 0.406 ± 0.395
1.623HisPhe: 1.623 ± 0.704
1.623HisGly: 1.623 ± 1.922
0.406HisHis: 0.406 ± 0.313
2.435HisIle: 2.435 ± 0.723
1.218HisLys: 1.218 ± 0.727
1.623HisLeu: 1.623 ± 0.905
0.812HisMet: 0.812 ± 0.631
0.812HisAsn: 0.812 ± 0.421
2.029HisPro: 2.029 ± 0.59
0.812HisGln: 0.812 ± 0.789
1.218HisArg: 1.218 ± 0.903
1.218HisSer: 1.218 ± 0.775
2.029HisThr: 2.029 ± 0.615
1.623HisVal: 1.623 ± 0.336
1.218HisTrp: 1.218 ± 0.625
1.623HisTyr: 1.623 ± 0.336
0.0HisXaa: 0.0 ± 0.0
Ile
1.623IleAla: 1.623 ± 0.759
1.623IleCys: 1.623 ± 0.691
2.029IleAsp: 2.029 ± 0.469
2.435IleGlu: 2.435 ± 1.279
2.029IlePhe: 2.029 ± 1.07
4.058IleGly: 4.058 ± 1.426
1.218IleHis: 1.218 ± 1.334
3.653IleIle: 3.653 ± 1.454
0.812IleLys: 0.812 ± 0.574
2.841IleLeu: 2.841 ± 0.862
0.406IleMet: 0.406 ± 0.568
0.812IleAsn: 0.812 ± 0.715
3.247IlePro: 3.247 ± 0.916
2.435IleGln: 2.435 ± 0.536
0.812IleArg: 0.812 ± 0.631
3.653IleSer: 3.653 ± 0.615
2.841IleThr: 2.841 ± 0.629
2.841IleVal: 2.841 ± 0.648
0.0IleTrp: 0.0 ± 0.0
3.247IleTyr: 3.247 ± 1.325
0.0IleXaa: 0.0 ± 0.0
Lys
3.247LysAla: 3.247 ± 1.886
2.029LysCys: 2.029 ± 0.936
2.029LysAsp: 2.029 ± 0.772
3.247LysGlu: 3.247 ± 1.339
1.623LysPhe: 1.623 ± 0.938
3.247LysGly: 3.247 ± 1.259
1.623LysHis: 1.623 ± 0.612
0.812LysIle: 0.812 ± 0.546
2.841LysLys: 2.841 ± 0.95
3.247LysLeu: 3.247 ± 1.139
0.812LysMet: 0.812 ± 0.624
0.406LysAsn: 0.406 ± 0.395
2.029LysPro: 2.029 ± 1.154
1.623LysGln: 1.623 ± 0.706
4.87LysArg: 4.87 ± 0.844
3.653LysSer: 3.653 ± 1.689
2.029LysThr: 2.029 ± 0.875
3.653LysVal: 3.653 ± 1.222
1.218LysTrp: 1.218 ± 0.799
2.841LysTyr: 2.841 ± 0.92
0.0LysXaa: 0.0 ± 0.0
Leu
5.276LeuAla: 5.276 ± 1.425
2.435LeuCys: 2.435 ± 2.083
7.305LeuAsp: 7.305 ± 1.262
2.029LeuGlu: 2.029 ± 1.265
5.276LeuPhe: 5.276 ± 1.449
3.247LeuGly: 3.247 ± 0.862
4.058LeuHis: 4.058 ± 1.068
2.841LeuIle: 2.841 ± 1.19
4.87LeuLys: 4.87 ± 1.549
9.74LeuLeu: 9.74 ± 3.049
1.218LeuMet: 1.218 ± 0.824
1.623LeuAsn: 1.623 ± 0.612
2.841LeuPro: 2.841 ± 1.277
8.523LeuGln: 8.523 ± 1.982
5.682LeuArg: 5.682 ± 1.962
6.494LeuSer: 6.494 ± 1.05
4.464LeuThr: 4.464 ± 1.146
3.653LeuVal: 3.653 ± 1.125
1.218LeuTrp: 1.218 ± 0.693
5.276LeuTyr: 5.276 ± 0.994
0.0LeuXaa: 0.0 ± 0.0
Met
2.029MetAla: 2.029 ± 0.976
1.218MetCys: 1.218 ± 0.625
1.218MetAsp: 1.218 ± 0.637
0.812MetGlu: 0.812 ± 0.427
1.218MetPhe: 1.218 ± 0.631
1.218MetGly: 1.218 ± 0.554
0.406MetHis: 0.406 ± 0.567
0.812MetIle: 0.812 ± 0.546
0.0MetLys: 0.0 ± 0.0
2.029MetLeu: 2.029 ± 0.627
0.406MetMet: 0.406 ± 0.568
0.812MetAsn: 0.812 ± 0.609
0.812MetPro: 0.812 ± 0.565
0.812MetGln: 0.812 ± 0.426
0.406MetArg: 0.406 ± 0.358
2.435MetSer: 2.435 ± 1.184
0.812MetThr: 0.812 ± 0.574
3.653MetVal: 3.653 ± 1.331
1.218MetTrp: 1.218 ± 0.975
0.406MetTyr: 0.406 ± 0.358
0.0MetXaa: 0.0 ± 0.0
Asn
0.812AsnAla: 0.812 ± 0.627
1.218AsnCys: 1.218 ± 0.601
0.0AsnAsp: 0.0 ± 0.0
2.029AsnGlu: 2.029 ± 1.18
1.218AsnPhe: 1.218 ± 0.669
2.841AsnGly: 2.841 ± 1.121
0.0AsnHis: 0.0 ± 0.0
2.435AsnIle: 2.435 ± 0.975
2.841AsnLys: 2.841 ± 1.169
2.841AsnLeu: 2.841 ± 1.308
0.812AsnMet: 0.812 ± 0.693
2.841AsnAsn: 2.841 ± 1.43
1.623AsnPro: 1.623 ± 0.673
0.406AsnGln: 0.406 ± 0.333
2.029AsnArg: 2.029 ± 0.629
2.029AsnSer: 2.029 ± 0.469
3.653AsnThr: 3.653 ± 1.266
2.435AsnVal: 2.435 ± 0.783
0.812AsnTrp: 0.812 ± 0.574
0.406AsnTyr: 0.406 ± 0.358
0.0AsnXaa: 0.0 ± 0.0
Pro
3.247ProAla: 3.247 ± 1.648
0.406ProCys: 0.406 ± 0.313
4.87ProAsp: 4.87 ± 1.262
2.029ProGlu: 2.029 ± 0.657
2.029ProPhe: 2.029 ± 0.755
2.841ProGly: 2.841 ± 0.482
0.812ProHis: 0.812 ± 0.37
2.841ProIle: 2.841 ± 0.947
4.058ProLys: 4.058 ± 0.918
8.117ProLeu: 8.117 ± 1.226
0.812ProMet: 0.812 ± 0.715
2.435ProAsn: 2.435 ± 0.858
7.305ProPro: 7.305 ± 2.88
1.218ProGln: 1.218 ± 0.442
3.653ProArg: 3.653 ± 1.719
6.088ProSer: 6.088 ± 2.726
4.87ProThr: 4.87 ± 1.538
4.87ProVal: 4.87 ± 0.959
1.218ProTrp: 1.218 ± 0.685
3.247ProTyr: 3.247 ± 1.576
0.0ProXaa: 0.0 ± 0.0
Gln
2.029GlnAla: 2.029 ± 0.792
1.218GlnCys: 1.218 ± 0.601
2.841GlnAsp: 2.841 ± 1.166
2.029GlnGlu: 2.029 ± 0.583
2.435GlnPhe: 2.435 ± 0.891
4.058GlnGly: 4.058 ± 1.23
0.0GlnHis: 0.0 ± 0.0
0.406GlnIle: 0.406 ± 0.395
1.623GlnLys: 1.623 ± 1.059
6.088GlnLeu: 6.088 ± 2.495
1.218GlnMet: 1.218 ± 0.999
1.623GlnAsn: 1.623 ± 0.748
2.841GlnPro: 2.841 ± 0.482
3.247GlnGln: 3.247 ± 1.163
2.841GlnArg: 2.841 ± 0.616
2.029GlnSer: 2.029 ± 1.015
3.653GlnThr: 3.653 ± 0.963
3.653GlnVal: 3.653 ± 1.038
1.218GlnTrp: 1.218 ± 0.637
0.812GlnTyr: 0.812 ± 0.455
0.0GlnXaa: 0.0 ± 0.0
Arg
2.841ArgAla: 2.841 ± 0.898
2.029ArgCys: 2.029 ± 1.808
1.623ArgAsp: 1.623 ± 1.127
2.841ArgGlu: 2.841 ± 1.616
0.812ArgPhe: 0.812 ± 0.546
2.841ArgGly: 2.841 ± 1.021
3.247ArgHis: 3.247 ± 1.028
0.812ArgIle: 0.812 ± 0.715
4.058ArgLys: 4.058 ± 0.6
8.117ArgLeu: 8.117 ± 1.629
1.218ArgMet: 1.218 ± 0.94
0.0ArgAsn: 0.0 ± 0.0
5.682ArgPro: 5.682 ± 0.964
2.841ArgGln: 2.841 ± 0.908
6.088ArgArg: 6.088 ± 1.85
2.841ArgSer: 2.841 ± 1.195
2.841ArgThr: 2.841 ± 0.776
4.464ArgVal: 4.464 ± 1.133
0.406ArgTrp: 0.406 ± 0.395
3.247ArgTyr: 3.247 ± 0.569
0.0ArgXaa: 0.0 ± 0.0
Ser
3.247SerAla: 3.247 ± 1.404
2.029SerCys: 2.029 ± 0.937
4.464SerAsp: 4.464 ± 1.392
4.058SerGlu: 4.058 ± 1.356
2.841SerPhe: 2.841 ± 1.207
6.088SerGly: 6.088 ± 2.093
1.218SerHis: 1.218 ± 0.442
2.435SerIle: 2.435 ± 1.326
1.218SerLys: 1.218 ± 0.999
5.276SerLeu: 5.276 ± 1.174
2.029SerMet: 2.029 ± 0.565
3.653SerAsn: 3.653 ± 1.356
2.841SerPro: 2.841 ± 0.865
1.623SerGln: 1.623 ± 0.56
5.276SerArg: 5.276 ± 1.9
8.523SerSer: 8.523 ± 3.227
9.334SerThr: 9.334 ± 2.727
5.276SerVal: 5.276 ± 1.495
0.812SerTrp: 0.812 ± 0.427
1.623SerTyr: 1.623 ± 0.752
0.0SerXaa: 0.0 ± 0.0
Thr
5.682ThrAla: 5.682 ± 1.851
2.841ThrCys: 2.841 ± 0.674
3.653ThrAsp: 3.653 ± 1.02
4.464ThrGlu: 4.464 ± 0.864
1.218ThrPhe: 1.218 ± 1.073
5.682ThrGly: 5.682 ± 2.057
2.029ThrHis: 2.029 ± 1.163
3.247ThrIle: 3.247 ± 1.109
3.247ThrLys: 3.247 ± 1.045
4.464ThrLeu: 4.464 ± 1.358
3.653ThrMet: 3.653 ± 1.615
2.435ThrAsn: 2.435 ± 0.886
6.899ThrPro: 6.899 ± 1.194
5.276ThrGln: 5.276 ± 1.767
2.841ThrArg: 2.841 ± 1.01
6.088ThrSer: 6.088 ± 1.293
6.088ThrThr: 6.088 ± 1.837
6.899ThrVal: 6.899 ± 1.531
0.812ThrTrp: 0.812 ± 0.789
1.218ThrTyr: 1.218 ± 0.631
0.0ThrXaa: 0.0 ± 0.0
Val
3.247ValAla: 3.247 ± 1.058
2.435ValCys: 2.435 ± 1.011
4.87ValAsp: 4.87 ± 0.739
2.435ValGlu: 2.435 ± 0.713
2.029ValPhe: 2.029 ± 0.844
3.247ValGly: 3.247 ± 1.103
2.435ValHis: 2.435 ± 0.81
2.029ValIle: 2.029 ± 0.755
2.029ValLys: 2.029 ± 0.935
4.058ValLeu: 4.058 ± 1.684
1.623ValMet: 1.623 ± 0.529
3.247ValAsn: 3.247 ± 1.054
6.899ValPro: 6.899 ± 2.376
2.841ValGln: 2.841 ± 0.801
3.653ValArg: 3.653 ± 1.141
7.711ValSer: 7.711 ± 1.792
6.088ValThr: 6.088 ± 2.308
4.87ValVal: 4.87 ± 1.353
0.812ValTrp: 0.812 ± 0.561
2.435ValTyr: 2.435 ± 0.875
0.0ValXaa: 0.0 ± 0.0
Trp
1.623TrpAla: 1.623 ± 0.581
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.623TrpGlu: 1.623 ± 0.875
0.812TrpPhe: 0.812 ± 0.666
0.812TrpGly: 0.812 ± 0.427
0.0TrpHis: 0.0 ± 0.0
0.812TrpIle: 0.812 ± 0.627
1.218TrpLys: 1.218 ± 0.645
1.218TrpLeu: 1.218 ± 0.403
0.406TrpMet: 0.406 ± 0.323
1.218TrpAsn: 1.218 ± 0.685
0.812TrpPro: 0.812 ± 0.561
0.406TrpGln: 0.406 ± 0.395
1.623TrpArg: 1.623 ± 0.53
0.406TrpSer: 0.406 ± 0.333
2.029TrpThr: 2.029 ± 1.43
0.812TrpVal: 0.812 ± 0.627
0.0TrpTrp: 0.0 ± 0.0
0.812TrpTyr: 0.812 ± 0.427
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.029TyrAla: 2.029 ± 0.844
0.0TyrCys: 0.0 ± 0.0
2.841TyrAsp: 2.841 ± 0.893
1.623TyrGlu: 1.623 ± 0.858
1.218TyrPhe: 1.218 ± 0.999
3.247TyrGly: 3.247 ± 1.091
1.218TyrHis: 1.218 ± 0.78
4.87TyrIle: 4.87 ± 1.014
1.623TyrLys: 1.623 ± 0.56
4.058TyrLeu: 4.058 ± 1.79
0.406TyrMet: 0.406 ± 0.313
0.812TyrAsn: 0.812 ± 0.666
1.623TyrPro: 1.623 ± 1.219
0.812TyrGln: 0.812 ± 0.37
2.841TyrArg: 2.841 ± 0.759
2.841TyrSer: 2.841 ± 1.357
2.435TyrThr: 2.435 ± 0.943
2.029TyrVal: 2.029 ± 0.8
0.812TyrTrp: 0.812 ± 0.37
2.841TyrTyr: 2.841 ± 0.872
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2465 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski