Amino acid dipepetide frequency for Human papillomavirus type 53

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.255AlaAla: 1.255 ± 0.743
2.092AlaCys: 2.092 ± 0.82
2.929AlaAsp: 2.929 ± 0.958
2.092AlaGlu: 2.092 ± 0.704
2.929AlaPhe: 2.929 ± 1.057
1.255AlaGly: 1.255 ± 0.736
0.837AlaHis: 0.837 ± 0.635
3.347AlaIle: 3.347 ± 0.949
1.674AlaLys: 1.674 ± 0.943
3.766AlaLeu: 3.766 ± 0.835
0.837AlaMet: 0.837 ± 0.396
1.674AlaAsn: 1.674 ± 0.686
4.184AlaPro: 4.184 ± 1.663
3.347AlaGln: 3.347 ± 1.412
2.929AlaArg: 2.929 ± 1.451
3.766AlaSer: 3.766 ± 1.103
4.603AlaThr: 4.603 ± 0.79
4.184AlaVal: 4.184 ± 0.876
0.0AlaTrp: 0.0 ± 0.0
2.092AlaTyr: 2.092 ± 1.161
0.0AlaXaa: 0.0 ± 0.0
Cys
1.255CysAla: 1.255 ± 0.646
0.837CysCys: 0.837 ± 0.628
0.418CysAsp: 0.418 ± 0.449
1.255CysGlu: 1.255 ± 0.825
1.674CysPhe: 1.674 ± 0.842
1.674CysGly: 1.674 ± 0.749
0.837CysHis: 0.837 ± 0.628
0.837CysIle: 0.837 ± 0.55
4.184CysLys: 4.184 ± 1.403
2.51CysLeu: 2.51 ± 1.069
0.418CysMet: 0.418 ± 0.33
0.418CysAsn: 0.418 ± 0.385
2.51CysPro: 2.51 ± 0.676
2.092CysGln: 2.092 ± 0.717
2.092CysArg: 2.092 ± 1.006
1.674CysSer: 1.674 ± 1.012
0.418CysThr: 0.418 ± 0.33
2.51CysVal: 2.51 ± 1.036
1.255CysTrp: 1.255 ± 0.534
0.837CysTyr: 0.837 ± 0.735
0.0CysXaa: 0.0 ± 0.0
Asp
1.674AspAla: 1.674 ± 0.794
1.674AspCys: 1.674 ± 0.566
2.092AspAsp: 2.092 ± 0.806
5.021AspGlu: 5.021 ± 2.501
0.418AspPhe: 0.418 ± 0.379
5.021AspGly: 5.021 ± 1.146
0.837AspHis: 0.837 ± 0.717
4.603AspIle: 4.603 ± 1.145
1.255AspLys: 1.255 ± 0.366
2.929AspLeu: 2.929 ± 1.158
1.255AspMet: 1.255 ± 0.405
3.766AspAsn: 3.766 ± 0.9
3.766AspPro: 3.766 ± 1.767
1.255AspGln: 1.255 ± 1.122
1.255AspArg: 1.255 ± 0.73
4.603AspSer: 4.603 ± 1.504
3.766AspThr: 3.766 ± 1.562
5.858AspVal: 5.858 ± 2.073
1.255AspTrp: 1.255 ± 0.627
1.255AspTyr: 1.255 ± 0.604
0.0AspXaa: 0.0 ± 0.0
Glu
5.021GluAla: 5.021 ± 1.523
0.0GluCys: 0.0 ± 0.0
5.021GluAsp: 5.021 ± 1.723
3.347GluGlu: 3.347 ± 0.921
0.418GluPhe: 0.418 ± 0.33
1.255GluGly: 1.255 ± 0.651
0.418GluHis: 0.418 ± 0.374
2.51GluIle: 2.51 ± 0.879
1.674GluLys: 1.674 ± 0.841
4.184GluLeu: 4.184 ± 1.852
0.418GluMet: 0.418 ± 0.379
2.929GluAsn: 2.929 ± 1.218
1.674GluPro: 1.674 ± 0.935
3.347GluGln: 3.347 ± 1.666
2.092GluArg: 2.092 ± 0.933
5.439GluSer: 5.439 ± 0.848
4.603GluThr: 4.603 ± 1.319
4.603GluVal: 4.603 ± 1.106
0.837GluTrp: 0.837 ± 0.421
2.092GluTyr: 2.092 ± 1.033
0.0GluXaa: 0.0 ± 0.0
Phe
1.674PheAla: 1.674 ± 0.811
0.837PheCys: 0.837 ± 0.897
1.674PheAsp: 1.674 ± 0.67
2.929PheGlu: 2.929 ± 0.904
1.674PhePhe: 1.674 ± 0.532
2.51PheGly: 2.51 ± 0.731
1.674PheHis: 1.674 ± 1.012
1.255PheIle: 1.255 ± 0.754
2.929PheLys: 2.929 ± 1.161
4.603PheLeu: 4.603 ± 1.441
0.418PheMet: 0.418 ± 0.309
1.674PheAsn: 1.674 ± 1.496
1.255PhePro: 1.255 ± 0.366
1.674PheGln: 1.674 ± 0.708
0.837PheArg: 0.837 ± 0.476
2.51PheSer: 2.51 ± 1.335
1.674PheThr: 1.674 ± 1.09
1.674PheVal: 1.674 ± 0.708
0.837PheTrp: 0.837 ± 0.397
1.674PheTyr: 1.674 ± 0.769
0.0PheXaa: 0.0 ± 0.0
Gly
2.092GlyAla: 2.092 ± 0.804
1.674GlyCys: 1.674 ± 0.501
2.929GlyAsp: 2.929 ± 1.381
2.51GlyGlu: 2.51 ± 0.662
1.255GlyPhe: 1.255 ± 0.366
3.766GlyGly: 3.766 ± 1.491
2.929GlyHis: 2.929 ± 1.701
2.092GlyIle: 2.092 ± 0.381
2.929GlyLys: 2.929 ± 0.641
2.51GlyLeu: 2.51 ± 1.35
0.418GlyMet: 0.418 ± 0.33
2.929GlyAsn: 2.929 ± 0.891
1.674GlyPro: 1.674 ± 1.068
0.837GlyGln: 0.837 ± 0.476
2.092GlyArg: 2.092 ± 1.064
6.695GlySer: 6.695 ± 1.568
6.276GlyThr: 6.276 ± 2.473
5.439GlyVal: 5.439 ± 1.311
0.837GlyTrp: 0.837 ± 0.421
1.255GlyTyr: 1.255 ± 0.561
0.0GlyXaa: 0.0 ± 0.0
His
1.674HisAla: 1.674 ± 0.794
0.418HisCys: 0.418 ± 0.449
1.255HisAsp: 1.255 ± 0.412
0.418HisGlu: 0.418 ± 0.576
0.837HisPhe: 0.837 ± 0.397
1.255HisGly: 1.255 ± 0.835
0.418HisHis: 0.418 ± 0.508
1.255HisIle: 1.255 ± 1.156
2.929HisLys: 2.929 ± 1.395
1.255HisLeu: 1.255 ± 0.94
1.674HisMet: 1.674 ± 0.781
0.837HisAsn: 0.837 ± 0.397
2.092HisPro: 2.092 ± 1.077
0.837HisGln: 0.837 ± 0.556
0.837HisArg: 0.837 ± 0.757
2.929HisSer: 2.929 ± 1.441
0.837HisThr: 0.837 ± 0.565
1.674HisVal: 1.674 ± 0.93
0.837HisTrp: 0.837 ± 0.476
1.674HisTyr: 1.674 ± 0.709
0.0HisXaa: 0.0 ± 0.0
Ile
2.092IleAla: 2.092 ± 0.524
0.837IleCys: 0.837 ± 0.397
2.929IleAsp: 2.929 ± 1.166
4.184IleGlu: 4.184 ± 1.382
2.092IlePhe: 2.092 ± 0.663
3.347IleGly: 3.347 ± 1.656
0.837IleHis: 0.837 ± 0.421
2.51IleIle: 2.51 ± 1.188
2.929IleLys: 2.929 ± 0.811
2.51IleLeu: 2.51 ± 0.903
0.418IleMet: 0.418 ± 0.33
2.51IleAsn: 2.51 ± 0.507
4.184IlePro: 4.184 ± 1.451
2.092IleGln: 2.092 ± 0.567
2.092IleArg: 2.092 ± 1.13
5.439IleSer: 5.439 ± 1.825
3.347IleThr: 3.347 ± 0.903
4.184IleVal: 4.184 ± 1.011
0.418IleTrp: 0.418 ± 0.508
2.929IleTyr: 2.929 ± 1.194
0.0IleXaa: 0.0 ± 0.0
Lys
4.184LysAla: 4.184 ± 1.856
2.51LysCys: 2.51 ± 1.258
2.929LysAsp: 2.929 ± 1.008
1.255LysGlu: 1.255 ± 0.821
2.51LysPhe: 2.51 ± 1.01
1.674LysGly: 1.674 ± 0.952
2.092LysHis: 2.092 ± 1.046
2.929LysIle: 2.929 ± 0.702
3.347LysLys: 3.347 ± 1.684
3.766LysLeu: 3.766 ± 1.607
1.255LysMet: 1.255 ± 0.673
0.418LysAsn: 0.418 ± 0.33
2.092LysPro: 2.092 ± 0.622
5.021LysGln: 5.021 ± 1.068
5.858LysArg: 5.858 ± 0.818
3.347LysSer: 3.347 ± 1.275
4.603LysThr: 4.603 ± 1.323
3.347LysVal: 3.347 ± 0.815
0.418LysTrp: 0.418 ± 0.385
2.51LysTyr: 2.51 ± 0.975
0.0LysXaa: 0.0 ± 0.0
Leu
2.092LeuAla: 2.092 ± 0.967
2.929LeuCys: 2.929 ± 1.223
6.276LeuAsp: 6.276 ± 1.395
3.347LeuGlu: 3.347 ± 1.5
4.184LeuPhe: 4.184 ± 1.269
4.603LeuGly: 4.603 ± 0.647
3.347LeuHis: 3.347 ± 1.007
4.184LeuIle: 4.184 ± 1.279
3.766LeuLys: 3.766 ± 1.134
5.021LeuLeu: 5.021 ± 1.517
1.255LeuMet: 1.255 ± 0.863
2.51LeuAsn: 2.51 ± 0.945
3.347LeuPro: 3.347 ± 1.143
7.113LeuGln: 7.113 ± 1.921
5.439LeuArg: 5.439 ± 1.388
3.766LeuSer: 3.766 ± 1.239
3.347LeuThr: 3.347 ± 1.756
4.184LeuVal: 4.184 ± 1.339
0.418LeuTrp: 0.418 ± 0.379
2.51LeuTyr: 2.51 ± 0.961
0.0LeuXaa: 0.0 ± 0.0
Met
1.255MetAla: 1.255 ± 0.697
0.418MetCys: 0.418 ± 0.33
0.837MetAsp: 0.837 ± 0.508
0.837MetGlu: 0.837 ± 0.77
0.0MetPhe: 0.0 ± 0.0
1.255MetGly: 1.255 ± 0.94
0.418MetHis: 0.418 ± 0.576
0.418MetIle: 0.418 ± 0.374
0.418MetLys: 0.418 ± 0.33
1.674MetLeu: 1.674 ± 1.256
0.418MetMet: 0.418 ± 0.359
0.837MetAsn: 0.837 ± 0.397
0.418MetPro: 0.418 ± 0.33
2.51MetGln: 2.51 ± 0.86
0.837MetArg: 0.837 ± 0.635
1.255MetSer: 1.255 ± 0.697
1.674MetThr: 1.674 ± 0.882
1.255MetVal: 1.255 ± 0.366
1.255MetTrp: 1.255 ± 0.604
0.418MetTyr: 0.418 ± 0.33
0.0MetXaa: 0.0 ± 0.0
Asn
1.674AsnAla: 1.674 ± 0.842
0.837AsnCys: 0.837 ± 0.508
1.674AsnAsp: 1.674 ± 0.92
1.255AsnGlu: 1.255 ± 0.412
2.092AsnPhe: 2.092 ± 0.402
1.674AsnGly: 1.674 ± 0.794
0.0AsnHis: 0.0 ± 0.0
3.347AsnIle: 3.347 ± 0.994
1.255AsnLys: 1.255 ± 0.736
0.837AsnLeu: 0.837 ± 0.397
2.51AsnMet: 2.51 ± 0.766
3.347AsnAsn: 3.347 ± 1.783
2.929AsnPro: 2.929 ± 1.113
1.674AsnGln: 1.674 ± 0.501
2.092AsnArg: 2.092 ± 0.82
4.603AsnSer: 4.603 ± 1.478
5.858AsnThr: 5.858 ± 1.308
3.347AsnVal: 3.347 ± 0.994
0.418AsnTrp: 0.418 ± 0.33
0.418AsnTyr: 0.418 ± 0.385
0.0AsnXaa: 0.0 ± 0.0
Pro
2.51ProAla: 2.51 ± 1.335
1.255ProCys: 1.255 ± 0.94
4.603ProAsp: 4.603 ± 1.549
3.347ProGlu: 3.347 ± 1.046
1.674ProPhe: 1.674 ± 0.815
1.674ProGly: 1.674 ± 0.686
0.837ProHis: 0.837 ± 1.017
4.603ProIle: 4.603 ± 1.933
2.51ProLys: 2.51 ± 0.662
7.531ProLeu: 7.531 ± 1.897
0.837ProMet: 0.837 ± 0.407
2.092ProAsn: 2.092 ± 0.622
8.787ProPro: 8.787 ± 3.023
2.929ProGln: 2.929 ± 2.125
1.255ProArg: 1.255 ± 0.765
4.184ProSer: 4.184 ± 1.68
8.787ProThr: 8.787 ± 3.486
5.021ProVal: 5.021 ± 0.594
0.418ProTrp: 0.418 ± 0.508
2.092ProTyr: 2.092 ± 0.984
0.0ProXaa: 0.0 ± 0.0
Gln
2.092GlnAla: 2.092 ± 1.374
2.51GlnCys: 2.51 ± 1.408
2.092GlnAsp: 2.092 ± 0.996
2.929GlnGlu: 2.929 ± 1.001
2.092GlnPhe: 2.092 ± 0.996
2.929GlnGly: 2.929 ± 0.548
2.092GlnHis: 2.092 ± 0.871
2.51GlnIle: 2.51 ± 0.657
2.092GlnLys: 2.092 ± 0.561
7.113GlnLeu: 7.113 ± 1.915
1.255GlnMet: 1.255 ± 0.821
1.255GlnAsn: 1.255 ± 0.627
3.347GlnPro: 3.347 ± 0.771
4.184GlnGln: 4.184 ± 2.304
1.255GlnArg: 1.255 ± 0.781
3.347GlnSer: 3.347 ± 1.087
5.021GlnThr: 5.021 ± 1.809
2.092GlnVal: 2.092 ± 0.561
1.255GlnTrp: 1.255 ± 0.637
1.674GlnTyr: 1.674 ± 0.848
0.0GlnXaa: 0.0 ± 0.0
Arg
2.51ArgAla: 2.51 ± 0.617
1.255ArgCys: 1.255 ± 1.073
2.51ArgAsp: 2.51 ± 0.877
2.929ArgGlu: 2.929 ± 0.904
1.674ArgPhe: 1.674 ± 0.867
1.255ArgGly: 1.255 ± 0.366
2.092ArgHis: 2.092 ± 1.049
1.674ArgIle: 1.674 ± 0.842
4.184ArgLys: 4.184 ± 0.722
6.276ArgLeu: 6.276 ± 0.69
0.837ArgMet: 0.837 ± 1.059
1.255ArgAsn: 1.255 ± 0.405
3.766ArgPro: 3.766 ± 1.795
1.674ArgGln: 1.674 ± 0.842
5.021ArgArg: 5.021 ± 2.37
2.51ArgSer: 2.51 ± 0.853
3.766ArgThr: 3.766 ± 0.583
2.092ArgVal: 2.092 ± 0.865
0.0ArgTrp: 0.0 ± 0.0
2.51ArgTyr: 2.51 ± 1.133
0.0ArgXaa: 0.0 ± 0.0
Ser
5.021SerAla: 5.021 ± 2.806
1.255SerCys: 1.255 ± 0.534
2.929SerAsp: 2.929 ± 1.234
2.51SerGlu: 2.51 ± 0.902
2.092SerPhe: 2.092 ± 0.678
7.113SerGly: 7.113 ± 2.753
1.674SerHis: 1.674 ± 0.693
2.929SerIle: 2.929 ± 0.676
5.439SerLys: 5.439 ± 2.656
5.021SerLeu: 5.021 ± 0.864
1.674SerMet: 1.674 ± 1.105
4.184SerAsn: 4.184 ± 1.538
5.021SerPro: 5.021 ± 1.521
1.255SerGln: 1.255 ± 0.754
3.766SerArg: 3.766 ± 1.158
7.531SerSer: 7.531 ± 2.61
10.46SerThr: 10.46 ± 2.434
5.021SerVal: 5.021 ± 0.829
0.418SerTrp: 0.418 ± 0.379
2.092SerTyr: 2.092 ± 0.742
0.0SerXaa: 0.0 ± 0.0
Thr
4.603ThrAla: 4.603 ± 1.049
3.766ThrCys: 3.766 ± 0.583
4.603ThrAsp: 4.603 ± 1.176
5.439ThrGlu: 5.439 ± 1.467
3.766ThrPhe: 3.766 ± 2.04
4.603ThrGly: 4.603 ± 2.391
1.255ThrHis: 1.255 ± 0.603
1.674ThrIle: 1.674 ± 0.532
4.184ThrLys: 4.184 ± 1.167
5.858ThrLeu: 5.858 ± 0.858
1.255ThrMet: 1.255 ± 0.366
3.347ThrAsn: 3.347 ± 0.963
7.531ThrPro: 7.531 ± 2.316
5.858ThrGln: 5.858 ± 0.844
3.347ThrArg: 3.347 ± 1.15
5.439ThrSer: 5.439 ± 1.986
10.042ThrThr: 10.042 ± 2.993
5.021ThrVal: 5.021 ± 1.145
2.092ThrTrp: 2.092 ± 0.723
2.51ThrTyr: 2.51 ± 1.154
0.0ThrXaa: 0.0 ± 0.0
Val
3.766ValAla: 3.766 ± 1.207
2.929ValCys: 2.929 ± 1.15
4.603ValAsp: 4.603 ± 0.716
3.347ValGlu: 3.347 ± 0.989
1.674ValPhe: 1.674 ± 0.875
2.51ValGly: 2.51 ± 1.377
2.092ValHis: 2.092 ± 1.223
5.021ValIle: 5.021 ± 1.053
3.766ValLys: 3.766 ± 0.818
2.092ValLeu: 2.092 ± 0.723
0.418ValMet: 0.418 ± 0.558
3.347ValAsn: 3.347 ± 1.085
5.858ValPro: 5.858 ± 1.559
4.184ValGln: 4.184 ± 1.879
3.347ValArg: 3.347 ± 0.778
6.276ValSer: 6.276 ± 1.688
3.766ValThr: 3.766 ± 1.643
5.439ValVal: 5.439 ± 1.722
1.674ValTrp: 1.674 ± 0.952
4.184ValTyr: 4.184 ± 1.208
0.0ValXaa: 0.0 ± 0.0
Trp
1.674TrpAla: 1.674 ± 0.485
0.418TrpCys: 0.418 ± 0.385
0.418TrpAsp: 0.418 ± 0.379
1.255TrpGlu: 1.255 ± 0.626
1.255TrpPhe: 1.255 ± 0.405
0.837TrpGly: 0.837 ± 0.407
0.0TrpHis: 0.0 ± 0.0
1.255TrpIle: 1.255 ± 0.99
1.674TrpLys: 1.674 ± 0.842
0.837TrpLeu: 0.837 ± 0.397
0.0TrpMet: 0.0 ± 0.0
0.837TrpAsn: 0.837 ± 0.748
0.837TrpPro: 0.837 ± 0.757
0.0TrpGln: 0.0 ± 0.0
1.255TrpArg: 1.255 ± 0.534
0.418TrpSer: 0.418 ± 0.33
1.674TrpThr: 1.674 ± 0.841
0.418TrpVal: 0.418 ± 0.385
0.0TrpTrp: 0.0 ± 0.0
0.418TrpTyr: 0.418 ± 0.385
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.674TyrAla: 1.674 ± 0.67
1.255TyrCys: 1.255 ± 0.737
0.837TyrAsp: 0.837 ± 0.757
1.674TyrGlu: 1.674 ± 0.737
1.674TyrPhe: 1.674 ± 0.815
2.929TyrGly: 2.929 ± 1.03
1.255TyrHis: 1.255 ± 0.405
2.929TyrIle: 2.929 ± 0.856
2.929TyrLys: 2.929 ± 0.784
3.347TyrLeu: 3.347 ± 1.487
0.418TyrMet: 0.418 ± 0.508
1.674TyrAsn: 1.674 ± 0.952
1.674TyrPro: 1.674 ± 0.875
1.255TyrGln: 1.255 ± 0.637
2.092TyrArg: 2.092 ± 1.105
1.674TyrSer: 1.674 ± 0.867
1.674TyrThr: 1.674 ± 1.098
3.347TyrVal: 3.347 ± 1.336
0.837TyrTrp: 0.837 ± 0.476
3.766TyrTyr: 3.766 ± 1.659
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2391 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski