Amino acid dipepetide frequency for Human papillomavirus 33

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.337AlaAla: 6.337 ± 1.182
1.584AlaCys: 1.584 ± 0.725
7.921AlaAsp: 7.921 ± 1.589
1.98AlaGlu: 1.98 ± 0.878
3.168AlaPhe: 3.168 ± 0.761
3.168AlaGly: 3.168 ± 1.017
0.0AlaHis: 0.0 ± 0.0
2.376AlaIle: 2.376 ± 0.804
3.96AlaLys: 3.96 ± 0.984
4.752AlaLeu: 4.752 ± 1.551
1.188AlaMet: 1.188 ± 0.402
1.98AlaAsn: 1.98 ± 0.958
3.168AlaPro: 3.168 ± 1.224
3.168AlaGln: 3.168 ± 1.669
2.376AlaArg: 2.376 ± 0.879
2.376AlaSer: 2.376 ± 0.922
3.564AlaThr: 3.564 ± 1.393
1.98AlaVal: 1.98 ± 0.638
0.396AlaTrp: 0.396 ± 0.543
1.188AlaTyr: 1.188 ± 0.99
0.0AlaXaa: 0.0 ± 0.0
Cys
2.772CysAla: 2.772 ± 1.044
0.396CysCys: 0.396 ± 0.548
0.396CysAsp: 0.396 ± 0.33
1.98CysGlu: 1.98 ± 1.079
0.792CysPhe: 0.792 ± 0.673
1.188CysGly: 1.188 ± 0.615
0.792CysHis: 0.792 ± 0.655
1.188CysIle: 1.188 ± 0.705
2.772CysLys: 2.772 ± 1.189
2.772CysLeu: 2.772 ± 0.975
1.188CysMet: 1.188 ± 0.615
0.396CysAsn: 0.396 ± 0.548
2.376CysPro: 2.376 ± 0.547
1.188CysGln: 1.188 ± 1.002
0.792CysArg: 0.792 ± 0.66
2.376CysSer: 2.376 ± 1.035
3.168CysThr: 3.168 ± 0.755
2.376CysVal: 2.376 ± 0.942
0.792CysTrp: 0.792 ± 0.574
0.792CysTyr: 0.792 ± 0.82
0.0CysXaa: 0.0 ± 0.0
Asp
0.792AspAla: 0.792 ± 0.44
0.792AspCys: 0.792 ± 0.368
4.752AspAsp: 4.752 ± 1.947
3.168AspGlu: 3.168 ± 1.695
1.584AspPhe: 1.584 ± 0.726
2.376AspGly: 2.376 ± 0.746
0.792AspHis: 0.792 ± 0.576
3.168AspIle: 3.168 ± 0.775
1.584AspLys: 1.584 ± 0.88
7.525AspLeu: 7.525 ± 1.847
0.396AspMet: 0.396 ± 0.333
5.941AspAsn: 5.941 ± 2.381
3.96AspPro: 3.96 ± 1.068
0.792AspGln: 0.792 ± 0.425
1.584AspArg: 1.584 ± 1.032
5.149AspSer: 5.149 ± 2.282
8.317AspThr: 8.317 ± 1.971
3.96AspVal: 3.96 ± 1.694
0.792AspTrp: 0.792 ± 0.368
2.376AspTyr: 2.376 ± 0.857
0.0AspXaa: 0.0 ± 0.0
Glu
3.564GluAla: 3.564 ± 0.645
1.584GluCys: 1.584 ± 0.507
5.149GluAsp: 5.149 ± 1.459
2.772GluGlu: 2.772 ± 0.682
1.188GluPhe: 1.188 ± 0.99
1.98GluGly: 1.98 ± 0.732
0.792GluHis: 0.792 ± 0.449
2.376GluIle: 2.376 ± 0.757
3.168GluLys: 3.168 ± 1.015
3.564GluLeu: 3.564 ± 0.695
0.792GluMet: 0.792 ± 0.561
3.168GluAsn: 3.168 ± 1.823
1.98GluPro: 1.98 ± 0.538
2.772GluGln: 2.772 ± 1.263
0.396GluArg: 0.396 ± 0.33
3.168GluSer: 3.168 ± 0.627
4.752GluThr: 4.752 ± 1.188
3.168GluVal: 3.168 ± 0.86
0.396GluTrp: 0.396 ± 0.33
1.98GluTyr: 1.98 ± 1.198
0.0GluXaa: 0.0 ± 0.0
Phe
1.584PheAla: 1.584 ± 0.899
1.188PheCys: 1.188 ± 0.964
2.772PheAsp: 2.772 ± 1.2
1.188PheGlu: 1.188 ± 0.664
3.564PhePhe: 3.564 ± 1.374
2.376PheGly: 2.376 ± 0.945
0.792PheHis: 0.792 ± 0.717
1.98PheIle: 1.98 ± 0.908
3.96PheLys: 3.96 ± 1.481
5.149PheLeu: 5.149 ± 1.024
1.188PheMet: 1.188 ± 0.628
1.584PheAsn: 1.584 ± 0.737
2.772PhePro: 2.772 ± 1.159
1.584PheGln: 1.584 ± 0.608
1.188PheArg: 1.188 ± 0.615
2.376PheSer: 2.376 ± 0.701
1.188PheThr: 1.188 ± 0.633
4.752PheVal: 4.752 ± 2.019
0.792PheTrp: 0.792 ± 0.368
0.396PheTyr: 0.396 ± 0.333
0.0PheXaa: 0.0 ± 0.0
Gly
2.376GlyAla: 2.376 ± 0.942
1.584GlyCys: 1.584 ± 0.737
1.98GlyAsp: 1.98 ± 0.958
3.564GlyGlu: 3.564 ± 0.861
2.772GlyPhe: 2.772 ± 0.979
1.98GlyGly: 1.98 ± 0.789
1.98GlyHis: 1.98 ± 1.109
4.356GlyIle: 4.356 ± 1.045
1.98GlyLys: 1.98 ± 0.56
3.564GlyLeu: 3.564 ± 1.564
1.584GlyMet: 1.584 ± 0.954
3.564GlyAsn: 3.564 ± 1.592
1.188GlyPro: 1.188 ± 0.633
2.376GlyGln: 2.376 ± 0.656
1.98GlyArg: 1.98 ± 1.108
3.96GlySer: 3.96 ± 1.862
4.752GlyThr: 4.752 ± 0.783
1.584GlyVal: 1.584 ± 0.618
0.396GlyTrp: 0.396 ± 0.33
1.188GlyTyr: 1.188 ± 0.664
0.0GlyXaa: 0.0 ± 0.0
His
0.792HisAla: 0.792 ± 0.625
0.0HisCys: 0.0 ± 0.0
0.792HisAsp: 0.792 ± 0.508
0.0HisGlu: 0.0 ± 0.0
1.188HisPhe: 1.188 ± 0.4
0.396HisGly: 0.396 ± 0.33
0.396HisHis: 0.396 ± 0.535
0.0HisIle: 0.0 ± 0.0
1.584HisLys: 1.584 ± 0.946
2.376HisLeu: 2.376 ± 0.983
0.396HisMet: 0.396 ± 0.543
1.584HisAsn: 1.584 ± 1.004
1.584HisPro: 1.584 ± 0.849
0.396HisGln: 0.396 ± 0.403
0.792HisArg: 0.792 ± 0.399
2.376HisSer: 2.376 ± 1.328
1.584HisThr: 1.584 ± 0.993
1.584HisVal: 1.584 ± 0.827
1.584HisTrp: 1.584 ± 0.906
1.188HisTyr: 1.188 ± 0.506
0.0HisXaa: 0.0 ± 0.0
Ile
1.188IleAla: 1.188 ± 0.633
1.98IleCys: 1.98 ± 1.06
2.772IleAsp: 2.772 ± 1.878
4.752IleGlu: 4.752 ± 1.071
1.584IlePhe: 1.584 ± 1.027
2.772IleGly: 2.772 ± 0.89
1.584IleHis: 1.584 ± 0.608
2.772IleIle: 2.772 ± 0.594
1.188IleLys: 1.188 ± 0.62
3.96IleLeu: 3.96 ± 1.053
0.396IleMet: 0.396 ± 0.33
2.376IleAsn: 2.376 ± 0.723
4.356IlePro: 4.356 ± 1.93
3.564IleGln: 3.564 ± 0.922
1.98IleArg: 1.98 ± 0.453
8.713IleSer: 8.713 ± 2.071
1.98IleThr: 1.98 ± 0.637
4.752IleVal: 4.752 ± 1.673
0.0IleTrp: 0.0 ± 0.0
0.792IleTyr: 0.792 ± 0.449
0.0IleXaa: 0.0 ± 0.0
Lys
2.376LysAla: 2.376 ± 0.664
1.98LysCys: 1.98 ± 0.887
0.396LysAsp: 0.396 ± 0.33
4.356LysGlu: 4.356 ± 1.04
2.772LysPhe: 2.772 ± 1.321
2.376LysGly: 2.376 ± 0.59
0.396LysHis: 0.396 ± 0.33
3.168LysIle: 3.168 ± 1.164
4.356LysLys: 4.356 ± 1.443
4.752LysLeu: 4.752 ± 1.12
1.188LysMet: 1.188 ± 0.593
3.96LysAsn: 3.96 ± 1.191
3.564LysPro: 3.564 ± 1.876
3.168LysGln: 3.168 ± 1.112
4.752LysArg: 4.752 ± 0.823
3.168LysSer: 3.168 ± 1.775
3.168LysThr: 3.168 ± 1.015
3.564LysVal: 3.564 ± 1.229
0.396LysTrp: 0.396 ± 0.624
2.772LysTyr: 2.772 ± 1.079
0.0LysXaa: 0.0 ± 0.0
Leu
1.188LeuAla: 1.188 ± 0.386
4.752LeuCys: 4.752 ± 2.243
4.356LeuAsp: 4.356 ± 1.137
3.96LeuGlu: 3.96 ± 1.121
3.168LeuPhe: 3.168 ± 1.182
4.356LeuGly: 4.356 ± 1.425
3.168LeuHis: 3.168 ± 1.211
4.356LeuIle: 4.356 ± 1.22
8.713LeuLys: 8.713 ± 1.999
9.109LeuLeu: 9.109 ± 3.561
1.98LeuMet: 1.98 ± 1.112
3.168LeuAsn: 3.168 ± 0.848
1.584LeuPro: 1.584 ± 0.851
7.921LeuGln: 7.921 ± 1.657
3.96LeuArg: 3.96 ± 0.847
4.356LeuSer: 4.356 ± 1.24
5.941LeuThr: 5.941 ± 1.79
2.376LeuVal: 2.376 ± 1.191
0.396LeuTrp: 0.396 ± 0.543
5.545LeuTyr: 5.545 ± 1.297
0.0LeuXaa: 0.0 ± 0.0
Met
1.584MetAla: 1.584 ± 0.896
0.792MetCys: 0.792 ± 0.586
1.584MetAsp: 1.584 ± 0.704
1.584MetGlu: 1.584 ± 0.88
1.584MetPhe: 1.584 ± 0.869
1.188MetGly: 1.188 ± 0.623
0.0MetHis: 0.0 ± 0.0
0.792MetIle: 0.792 ± 0.586
0.0MetLys: 0.0 ± 0.0
0.396MetLeu: 0.396 ± 0.33
0.396MetMet: 0.396 ± 0.543
0.396MetAsn: 0.396 ± 0.333
0.0MetPro: 0.0 ± 0.0
0.396MetGln: 0.396 ± 0.309
1.188MetArg: 1.188 ± 0.574
3.168MetSer: 3.168 ± 0.903
2.376MetThr: 2.376 ± 1.214
2.376MetVal: 2.376 ± 0.84
0.396MetTrp: 0.396 ± 0.403
0.396MetTyr: 0.396 ± 0.403
0.0MetXaa: 0.0 ± 0.0
Asn
2.772AsnAla: 2.772 ± 1.105
1.188AsnCys: 1.188 ± 0.958
3.96AsnAsp: 3.96 ± 0.899
1.98AsnGlu: 1.98 ± 0.614
0.792AsnPhe: 0.792 ± 0.625
2.376AsnGly: 2.376 ± 0.759
0.396AsnHis: 0.396 ± 0.33
4.356AsnIle: 4.356 ± 2.155
4.752AsnLys: 4.752 ± 1.526
0.792AsnLeu: 0.792 ± 0.66
0.396AsnMet: 0.396 ± 0.333
0.396AsnAsn: 0.396 ± 0.333
3.564AsnPro: 3.564 ± 1.021
1.188AsnGln: 1.188 ± 0.682
2.772AsnArg: 2.772 ± 1.179
3.564AsnSer: 3.564 ± 1.501
4.356AsnThr: 4.356 ± 1.197
3.564AsnVal: 3.564 ± 1.166
1.188AsnTrp: 1.188 ± 0.665
0.396AsnTyr: 0.396 ± 0.478
0.0AsnXaa: 0.0 ± 0.0
Pro
5.941ProAla: 5.941 ± 2.228
0.396ProCys: 0.396 ± 0.33
4.356ProAsp: 4.356 ± 2.215
2.376ProGlu: 2.376 ± 1.028
2.772ProPhe: 2.772 ± 1.125
1.188ProGly: 1.188 ± 0.698
0.396ProHis: 0.396 ± 0.309
3.96ProIle: 3.96 ± 1.235
2.772ProLys: 2.772 ± 0.799
8.713ProLeu: 8.713 ± 2.259
1.188ProMet: 1.188 ± 0.717
1.188ProAsn: 1.188 ± 0.597
7.525ProPro: 7.525 ± 1.72
1.584ProGln: 1.584 ± 1.196
0.396ProArg: 0.396 ± 0.478
4.752ProSer: 4.752 ± 1.496
5.941ProThr: 5.941 ± 2.224
2.376ProVal: 2.376 ± 0.975
0.396ProTrp: 0.396 ± 0.535
2.772ProTyr: 2.772 ± 0.762
0.0ProXaa: 0.0 ± 0.0
Gln
3.564GlnAla: 3.564 ± 0.959
0.792GlnCys: 0.792 ± 0.52
1.188GlnAsp: 1.188 ± 0.577
1.98GlnGlu: 1.98 ± 0.811
1.98GlnPhe: 1.98 ± 0.909
1.188GlnGly: 1.188 ± 0.386
2.376GlnHis: 2.376 ± 0.885
2.772GlnIle: 2.772 ± 1.592
0.792GlnLys: 0.792 ± 0.425
3.96GlnLeu: 3.96 ± 1.322
3.168GlnMet: 3.168 ± 1.502
0.792GlnAsn: 0.792 ± 0.66
3.168GlnPro: 3.168 ± 0.723
5.149GlnGln: 5.149 ± 2.448
2.772GlnArg: 2.772 ± 1.526
1.98GlnSer: 1.98 ± 0.585
5.149GlnThr: 5.149 ± 2.821
2.772GlnVal: 2.772 ± 1.072
1.188GlnTrp: 1.188 ± 0.665
1.584GlnTyr: 1.584 ± 0.816
0.0GlnXaa: 0.0 ± 0.0
Arg
2.772ArgAla: 2.772 ± 1.05
1.584ArgCys: 1.584 ± 1.041
0.0ArgAsp: 0.0 ± 0.0
1.584ArgGlu: 1.584 ± 1.147
2.376ArgPhe: 2.376 ± 0.934
1.188ArgGly: 1.188 ± 0.595
2.772ArgHis: 2.772 ± 1.024
0.396ArgIle: 0.396 ± 0.309
3.168ArgLys: 3.168 ± 1.195
4.356ArgLeu: 4.356 ± 0.683
0.396ArgMet: 0.396 ± 0.403
0.792ArgAsn: 0.792 ± 0.399
5.545ArgPro: 5.545 ± 1.48
0.396ArgGln: 0.396 ± 0.535
5.149ArgArg: 5.149 ± 1.623
3.168ArgSer: 3.168 ± 1.063
6.337ArgThr: 6.337 ± 1.406
1.584ArgVal: 1.584 ± 0.849
0.792ArgTrp: 0.792 ± 0.52
1.188ArgTyr: 1.188 ± 0.628
0.0ArgXaa: 0.0 ± 0.0
Ser
3.564SerAla: 3.564 ± 1.109
2.376SerCys: 2.376 ± 1.591
4.752SerAsp: 4.752 ± 1.587
3.168SerGlu: 3.168 ± 1.294
2.376SerPhe: 2.376 ± 0.947
5.941SerGly: 5.941 ± 1.712
0.792SerHis: 0.792 ± 0.44
5.941SerIle: 5.941 ± 1.539
3.564SerLys: 3.564 ± 1.386
5.149SerLeu: 5.149 ± 0.899
2.376SerMet: 2.376 ± 0.602
6.337SerAsn: 6.337 ± 2.323
3.96SerPro: 3.96 ± 2.031
3.564SerGln: 3.564 ± 1.27
4.356SerArg: 4.356 ± 1.337
9.109SerSer: 9.109 ± 2.636
9.109SerThr: 9.109 ± 2.533
2.376SerVal: 2.376 ± 0.749
0.396SerTrp: 0.396 ± 0.33
1.98SerTyr: 1.98 ± 1.207
0.0SerXaa: 0.0 ± 0.0
Thr
7.525ThrAla: 7.525 ± 2.814
4.356ThrCys: 4.356 ± 1.495
5.941ThrAsp: 5.941 ± 1.243
3.564ThrGlu: 3.564 ± 1.072
2.772ThrPhe: 2.772 ± 0.754
5.545ThrGly: 5.545 ± 0.925
0.792ThrHis: 0.792 ± 0.399
3.168ThrIle: 3.168 ± 1.508
1.98ThrLys: 1.98 ± 1.048
5.941ThrLeu: 5.941 ± 1.53
0.792ThrMet: 0.792 ± 0.806
5.149ThrAsn: 5.149 ± 1.278
4.752ThrPro: 4.752 ± 1.764
3.564ThrGln: 3.564 ± 1.06
2.376ThrArg: 2.376 ± 0.975
10.297ThrSer: 10.297 ± 2.658
5.149ThrThr: 5.149 ± 1.46
9.901ThrVal: 9.901 ± 2.532
1.188ThrTrp: 1.188 ± 0.628
3.168ThrTyr: 3.168 ± 0.952
0.0ThrXaa: 0.0 ± 0.0
Val
2.772ValAla: 2.772 ± 0.478
1.98ValCys: 1.98 ± 1.013
4.356ValAsp: 4.356 ± 1.136
2.376ValGlu: 2.376 ± 0.852
2.772ValPhe: 2.772 ± 1.278
3.564ValGly: 3.564 ± 1.06
1.584ValHis: 1.584 ± 0.939
2.376ValIle: 2.376 ± 1.05
2.772ValLys: 2.772 ± 0.521
4.356ValLeu: 4.356 ± 2.098
0.792ValMet: 0.792 ± 0.425
1.188ValAsn: 1.188 ± 1.079
3.96ValPro: 3.96 ± 1.882
3.96ValGln: 3.96 ± 1.583
2.376ValArg: 2.376 ± 0.815
3.96ValSer: 3.96 ± 1.785
7.525ValThr: 7.525 ± 1.972
3.564ValVal: 3.564 ± 0.875
1.188ValTrp: 1.188 ± 0.681
2.772ValTyr: 2.772 ± 1.288
0.0ValXaa: 0.0 ± 0.0
Trp
1.188TrpAla: 1.188 ± 0.536
0.792TrpCys: 0.792 ± 0.66
0.0TrpAsp: 0.0 ± 0.0
0.792TrpGlu: 0.792 ± 0.449
0.792TrpPhe: 0.792 ± 0.66
1.188TrpGly: 1.188 ± 0.681
0.792TrpHis: 0.792 ± 0.701
0.792TrpIle: 0.792 ± 0.66
1.188TrpLys: 1.188 ± 0.628
1.584TrpLeu: 1.584 ± 0.604
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.396TrpPro: 0.396 ± 0.33
0.396TrpGln: 0.396 ± 0.333
1.188TrpArg: 1.188 ± 0.536
0.0TrpSer: 0.0 ± 0.0
1.98TrpThr: 1.98 ± 1.048
0.396TrpVal: 0.396 ± 0.543
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.98TyrAla: 1.98 ± 0.659
0.396TyrCys: 0.396 ± 0.548
2.376TyrAsp: 2.376 ± 0.587
1.584TyrGlu: 1.584 ± 0.821
2.376TyrPhe: 2.376 ± 0.827
2.376TyrGly: 2.376 ± 0.852
0.0TyrHis: 0.0 ± 0.0
3.168TyrIle: 3.168 ± 1.24
2.376TyrLys: 2.376 ± 0.59
1.98TyrLeu: 1.98 ± 1.176
0.396TyrMet: 0.396 ± 0.33
0.792TyrAsn: 0.792 ± 0.574
1.584TyrPro: 1.584 ± 1.016
1.188TyrGln: 1.188 ± 0.633
3.168TyrArg: 3.168 ± 1.355
3.168TyrSer: 3.168 ± 1.456
1.584TyrThr: 1.584 ± 0.698
1.188TyrVal: 1.188 ± 0.642
0.792TyrTrp: 0.792 ± 0.368
2.772TyrTyr: 2.772 ± 0.628
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (2526 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski