Amino acid dipepetide frequency for Human papillomavirus type 54

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.91AlaAla: 5.91 ± 1.604
2.111AlaCys: 2.111 ± 1.294
5.065AlaAsp: 5.065 ± 1.233
2.533AlaGlu: 2.533 ± 0.981
2.955AlaPhe: 2.955 ± 1.306
2.111AlaGly: 2.111 ± 1.429
0.422AlaHis: 0.422 ± 0.348
2.533AlaIle: 2.533 ± 0.689
3.377AlaLys: 3.377 ± 1.129
8.02AlaLeu: 8.02 ± 2.188
2.955AlaMet: 2.955 ± 0.607
1.266AlaAsn: 1.266 ± 0.487
3.799AlaPro: 3.799 ± 1.459
2.111AlaGln: 2.111 ± 0.862
4.221AlaArg: 4.221 ± 0.779
5.488AlaSer: 5.488 ± 0.796
5.065AlaThr: 5.065 ± 1.317
4.221AlaVal: 4.221 ± 0.666
0.0AlaTrp: 0.0 ± 0.0
1.266AlaTyr: 1.266 ± 0.673
0.0AlaXaa: 0.0 ± 0.0
Cys
2.111CysAla: 2.111 ± 1.434
0.422CysCys: 0.422 ± 0.556
1.266CysAsp: 1.266 ± 0.838
0.0CysGlu: 0.0 ± 0.0
1.688CysPhe: 1.688 ± 0.942
1.688CysGly: 1.688 ± 0.965
0.422CysHis: 0.422 ± 0.605
1.688CysIle: 1.688 ± 0.472
2.111CysLys: 2.111 ± 1.396
2.955CysLeu: 2.955 ± 1.8
0.0CysMet: 0.0 ± 0.0
1.266CysAsn: 1.266 ± 1.159
1.688CysPro: 1.688 ± 0.552
1.688CysGln: 1.688 ± 0.472
1.266CysArg: 1.266 ± 0.778
0.422CysSer: 0.422 ± 0.414
2.955CysThr: 2.955 ± 1.276
2.533CysVal: 2.533 ± 0.99
1.688CysTrp: 1.688 ± 1.137
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.533AspAla: 2.533 ± 1.188
0.844AspCys: 0.844 ± 0.369
2.533AspAsp: 2.533 ± 1.074
2.111AspGlu: 2.111 ± 0.938
2.955AspPhe: 2.955 ± 0.553
3.377AspGly: 3.377 ± 0.684
0.844AspHis: 0.844 ± 0.39
3.377AspIle: 3.377 ± 0.692
1.266AspLys: 1.266 ± 0.778
6.754AspLeu: 6.754 ± 1.705
0.844AspMet: 0.844 ± 0.369
4.643AspAsn: 4.643 ± 1.557
4.221AspPro: 4.221 ± 1.133
2.955AspGln: 2.955 ± 0.683
2.111AspArg: 2.111 ± 0.656
4.643AspSer: 4.643 ± 1.333
3.799AspThr: 3.799 ± 1.018
5.488AspVal: 5.488 ± 1.378
0.844AspTrp: 0.844 ± 0.369
1.266AspTyr: 1.266 ± 0.905
0.0AspXaa: 0.0 ± 0.0
Glu
2.533GluAla: 2.533 ± 0.964
0.844GluCys: 0.844 ± 0.696
3.799GluAsp: 3.799 ± 1.769
5.065GluGlu: 5.065 ± 1.601
0.0GluPhe: 0.0 ± 0.0
3.377GluGly: 3.377 ± 1.055
1.688GluHis: 1.688 ± 1.166
2.533GluIle: 2.533 ± 1.142
1.688GluLys: 1.688 ± 0.567
2.955GluLeu: 2.955 ± 0.766
0.422GluMet: 0.422 ± 0.414
3.799GluAsn: 3.799 ± 0.952
4.643GluPro: 4.643 ± 0.851
2.533GluGln: 2.533 ± 0.981
1.688GluArg: 1.688 ± 0.728
0.844GluSer: 0.844 ± 0.531
4.221GluThr: 4.221 ± 0.971
1.688GluVal: 1.688 ± 0.609
0.844GluTrp: 0.844 ± 0.608
2.955GluTyr: 2.955 ± 1.214
0.0GluXaa: 0.0 ± 0.0
Phe
1.266PheAla: 1.266 ± 0.674
0.844PheCys: 0.844 ± 0.593
2.533PheAsp: 2.533 ± 1.264
1.266PheGlu: 1.266 ± 0.748
2.533PhePhe: 2.533 ± 0.71
2.955PheGly: 2.955 ± 0.983
0.422PheHis: 0.422 ± 0.605
2.533PheIle: 2.533 ± 0.638
4.221PheLys: 4.221 ± 1.337
4.221PheLeu: 4.221 ± 0.801
1.266PheMet: 1.266 ± 0.607
1.266PheAsn: 1.266 ± 1.108
2.533PhePro: 2.533 ± 0.604
1.266PheGln: 1.266 ± 0.652
2.111PheArg: 2.111 ± 0.502
1.266PheSer: 1.266 ± 0.457
0.0PheThr: 0.0 ± 0.0
2.111PheVal: 2.111 ± 0.68
0.844PheTrp: 0.844 ± 0.369
0.844PheTyr: 0.844 ± 0.438
0.0PheXaa: 0.0 ± 0.0
Gly
2.955GlyAla: 2.955 ± 0.98
1.688GlyCys: 1.688 ± 0.728
3.799GlyAsp: 3.799 ± 0.765
2.955GlyGlu: 2.955 ± 1.101
2.111GlyPhe: 2.111 ± 0.639
2.955GlyGly: 2.955 ± 1.621
2.955GlyHis: 2.955 ± 0.647
3.799GlyIle: 3.799 ± 0.729
2.111GlyLys: 2.111 ± 0.939
4.643GlyLeu: 4.643 ± 1.435
1.688GlyMet: 1.688 ± 0.831
3.799GlyAsn: 3.799 ± 1.022
1.266GlyPro: 1.266 ± 0.571
2.533GlyGln: 2.533 ± 0.833
4.221GlyArg: 4.221 ± 0.636
2.533GlySer: 2.533 ± 0.566
6.754GlyThr: 6.754 ± 1.423
2.533GlyVal: 2.533 ± 0.567
0.844GlyTrp: 0.844 ± 0.475
2.111GlyTyr: 2.111 ± 0.744
0.0GlyXaa: 0.0 ± 0.0
His
1.266HisAla: 1.266 ± 0.591
1.688HisCys: 1.688 ± 1.25
0.422HisAsp: 0.422 ± 0.348
0.0HisGlu: 0.0 ± 0.0
1.688HisPhe: 1.688 ± 0.701
2.111HisGly: 2.111 ± 1.381
0.422HisHis: 0.422 ± 0.415
1.688HisIle: 1.688 ± 0.599
1.266HisLys: 1.266 ± 0.973
1.688HisLeu: 1.688 ± 1.216
0.422HisMet: 0.422 ± 0.369
1.688HisAsn: 1.688 ± 0.877
1.688HisPro: 1.688 ± 0.876
1.266HisGln: 1.266 ± 0.937
1.688HisArg: 1.688 ± 0.661
1.688HisSer: 1.688 ± 0.599
3.377HisThr: 3.377 ± 1.005
2.111HisVal: 2.111 ± 0.66
1.266HisTrp: 1.266 ± 0.78
0.422HisTyr: 0.422 ± 0.304
0.0HisXaa: 0.0 ± 0.0
Ile
3.799IleAla: 3.799 ± 1.393
0.844IleCys: 0.844 ± 0.739
2.111IleAsp: 2.111 ± 1.223
2.111IleGlu: 2.111 ± 0.854
3.377IlePhe: 3.377 ± 0.657
2.111IleGly: 2.111 ± 0.744
1.266IleHis: 1.266 ± 0.361
2.111IleIle: 2.111 ± 0.637
1.266IleLys: 1.266 ± 0.591
2.533IleLeu: 2.533 ± 0.829
1.266IleMet: 1.266 ± 0.538
1.266IleAsn: 1.266 ± 0.571
5.91IlePro: 5.91 ± 1.97
1.266IleGln: 1.266 ± 0.402
3.377IleArg: 3.377 ± 1.134
3.377IleSer: 3.377 ± 1.514
2.955IleThr: 2.955 ± 0.782
3.377IleVal: 3.377 ± 1.313
0.422IleTrp: 0.422 ± 0.415
4.221IleTyr: 4.221 ± 0.856
0.0IleXaa: 0.0 ± 0.0
Lys
3.799LysAla: 3.799 ± 1.256
2.955LysCys: 2.955 ± 1.299
2.533LysAsp: 2.533 ± 0.941
2.111LysGlu: 2.111 ± 0.923
1.266LysPhe: 1.266 ± 1.108
2.533LysGly: 2.533 ± 1.25
2.111LysHis: 2.111 ± 0.919
0.844LysIle: 0.844 ± 0.593
1.266LysLys: 1.266 ± 0.684
1.688LysLeu: 1.688 ± 0.554
1.266LysMet: 1.266 ± 0.566
2.111LysAsn: 2.111 ± 0.798
2.955LysPro: 2.955 ± 2.534
2.533LysGln: 2.533 ± 1.128
5.488LysArg: 5.488 ± 0.904
2.111LysSer: 2.111 ± 0.906
4.221LysThr: 4.221 ± 1.365
4.221LysVal: 4.221 ± 1.509
0.0LysTrp: 0.0 ± 0.0
2.533LysTyr: 2.533 ± 0.804
0.0LysXaa: 0.0 ± 0.0
Leu
5.91LeuAla: 5.91 ± 1.249
3.799LeuCys: 3.799 ± 2.875
5.065LeuAsp: 5.065 ± 0.877
6.754LeuGlu: 6.754 ± 1.408
2.111LeuPhe: 2.111 ± 0.601
6.332LeuGly: 6.332 ± 1.044
5.065LeuHis: 5.065 ± 1.926
2.533LeuIle: 2.533 ± 1.214
3.799LeuLys: 3.799 ± 0.623
6.754LeuLeu: 6.754 ± 1.799
0.844LeuMet: 0.844 ± 0.683
2.111LeuAsn: 2.111 ± 0.81
2.111LeuPro: 2.111 ± 1.118
8.02LeuGln: 8.02 ± 1.622
2.533LeuArg: 2.533 ± 0.919
4.643LeuSer: 4.643 ± 1.468
3.799LeuThr: 3.799 ± 0.767
2.955LeuVal: 2.955 ± 0.655
0.422LeuTrp: 0.422 ± 0.605
5.065LeuTyr: 5.065 ± 0.804
0.0LeuXaa: 0.0 ± 0.0
Met
1.688MetAla: 1.688 ± 0.618
0.844MetCys: 0.844 ± 0.608
3.799MetAsp: 3.799 ± 0.779
0.422MetGlu: 0.422 ± 0.414
0.844MetPhe: 0.844 ± 0.369
2.533MetGly: 2.533 ± 1.098
0.844MetHis: 0.844 ± 0.921
0.844MetIle: 0.844 ± 0.593
0.0MetLys: 0.0 ± 0.0
1.688MetLeu: 1.688 ± 0.831
0.0MetMet: 0.0 ± 0.0
0.422MetAsn: 0.422 ± 0.369
0.0MetPro: 0.0 ± 0.0
2.111MetGln: 2.111 ± 0.808
0.422MetArg: 0.422 ± 0.304
2.111MetSer: 2.111 ± 0.974
0.422MetThr: 0.422 ± 0.304
2.533MetVal: 2.533 ± 1.133
1.266MetTrp: 1.266 ± 0.554
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.799AsnAla: 3.799 ± 1.153
1.266AsnCys: 1.266 ± 0.681
1.688AsnAsp: 1.688 ± 0.554
0.844AsnGlu: 0.844 ± 0.469
1.266AsnPhe: 1.266 ± 0.673
2.955AsnGly: 2.955 ± 1.053
0.422AsnHis: 0.422 ± 0.415
2.533AsnIle: 2.533 ± 0.828
3.377AsnLys: 3.377 ± 1.604
1.266AsnLeu: 1.266 ± 1.108
0.422AsnMet: 0.422 ± 0.415
2.955AsnAsn: 2.955 ± 1.338
2.955AsnPro: 2.955 ± 0.935
1.266AsnGln: 1.266 ± 0.402
2.533AsnArg: 2.533 ± 0.657
3.377AsnSer: 3.377 ± 0.833
4.221AsnThr: 4.221 ± 0.898
1.266AsnVal: 1.266 ± 0.965
0.422AsnTrp: 0.422 ± 0.304
1.266AsnTyr: 1.266 ± 0.723
0.0AsnXaa: 0.0 ± 0.0
Pro
5.488ProAla: 5.488 ± 2.194
1.266ProCys: 1.266 ± 0.657
3.377ProAsp: 3.377 ± 1.118
2.533ProGlu: 2.533 ± 0.814
1.688ProPhe: 1.688 ± 0.716
1.266ProGly: 1.266 ± 0.487
2.111ProHis: 2.111 ± 1.229
4.643ProIle: 4.643 ± 2.201
3.377ProLys: 3.377 ± 1.134
5.488ProLeu: 5.488 ± 1.694
1.688ProMet: 1.688 ± 1.017
2.533ProAsn: 2.533 ± 0.901
8.02ProPro: 8.02 ± 1.52
1.688ProGln: 1.688 ± 0.854
3.799ProArg: 3.799 ± 1.158
6.754ProSer: 6.754 ± 3.161
3.799ProThr: 3.799 ± 1.3
4.221ProVal: 4.221 ± 1.137
0.844ProTrp: 0.844 ± 0.513
2.533ProTyr: 2.533 ± 1.006
0.0ProXaa: 0.0 ± 0.0
Gln
3.377GlnAla: 3.377 ± 0.954
0.844GlnCys: 0.844 ± 0.604
3.377GlnAsp: 3.377 ± 1.057
1.688GlnGlu: 1.688 ± 0.905
2.111GlnPhe: 2.111 ± 0.651
2.533GlnGly: 2.533 ± 0.68
0.844GlnHis: 0.844 ± 0.475
2.533GlnIle: 2.533 ± 0.687
2.533GlnLys: 2.533 ± 0.792
4.221GlnLeu: 4.221 ± 0.708
1.688GlnMet: 1.688 ± 0.504
1.266GlnAsn: 1.266 ± 0.554
3.799GlnPro: 3.799 ± 1.236
2.955GlnGln: 2.955 ± 1.101
2.111GlnArg: 2.111 ± 0.985
1.688GlnSer: 1.688 ± 0.852
5.065GlnThr: 5.065 ± 1.173
2.111GlnVal: 2.111 ± 0.583
2.111GlnTrp: 2.111 ± 0.919
2.533GlnTyr: 2.533 ± 0.87
0.0GlnXaa: 0.0 ± 0.0
Arg
5.91ArgAla: 5.91 ± 1.401
2.111ArgCys: 2.111 ± 1.258
0.422ArgAsp: 0.422 ± 0.605
2.533ArgGlu: 2.533 ± 1.285
2.955ArgPhe: 2.955 ± 0.705
2.111ArgGly: 2.111 ± 0.835
3.799ArgHis: 3.799 ± 1.773
0.844ArgIle: 0.844 ± 0.829
4.221ArgLys: 4.221 ± 0.786
5.91ArgLeu: 5.91 ± 1.194
0.844ArgMet: 0.844 ± 0.428
0.422ArgAsn: 0.422 ± 0.304
5.488ArgPro: 5.488 ± 2.043
2.111ArgGln: 2.111 ± 1.134
4.643ArgArg: 4.643 ± 1.229
1.688ArgSer: 1.688 ± 0.504
2.533ArgThr: 2.533 ± 0.529
4.221ArgVal: 4.221 ± 0.977
1.688ArgTrp: 1.688 ± 0.644
1.266ArgTyr: 1.266 ± 0.607
0.0ArgXaa: 0.0 ± 0.0
Ser
1.688SerAla: 1.688 ± 0.736
0.0SerCys: 0.0 ± 0.0
4.643SerAsp: 4.643 ± 0.963
2.111SerGlu: 2.111 ± 1.02
2.955SerPhe: 2.955 ± 1.026
5.065SerGly: 5.065 ± 1.304
1.266SerHis: 1.266 ± 0.674
5.065SerIle: 5.065 ± 1.455
3.377SerLys: 3.377 ± 1.217
5.488SerLeu: 5.488 ± 0.496
2.111SerMet: 2.111 ± 0.745
4.221SerAsn: 4.221 ± 2.09
2.533SerPro: 2.533 ± 0.953
2.955SerGln: 2.955 ± 0.828
3.377SerArg: 3.377 ± 0.858
10.553SerSer: 10.553 ± 3.096
8.864SerThr: 8.864 ± 3.033
2.533SerVal: 2.533 ± 0.903
0.422SerTrp: 0.422 ± 0.304
1.688SerTyr: 1.688 ± 0.508
0.0SerXaa: 0.0 ± 0.0
Thr
4.643ThrAla: 4.643 ± 1.47
2.111ThrCys: 2.111 ± 0.723
5.065ThrAsp: 5.065 ± 1.692
3.799ThrGlu: 3.799 ± 0.908
0.844ThrPhe: 0.844 ± 0.39
5.065ThrGly: 5.065 ± 1.03
0.422ThrHis: 0.422 ± 0.556
2.955ThrIle: 2.955 ± 1.205
2.955ThrLys: 2.955 ± 1.178
6.754ThrLeu: 6.754 ± 1.672
1.688ThrMet: 1.688 ± 0.554
1.266ThrAsn: 1.266 ± 0.673
8.442ThrPro: 8.442 ± 2.207
3.377ThrGln: 3.377 ± 1.049
3.377ThrArg: 3.377 ± 0.717
7.598ThrSer: 7.598 ± 2.586
8.02ThrThr: 8.02 ± 2.974
6.754ThrVal: 6.754 ± 1.401
0.422ThrTrp: 0.422 ± 0.414
1.688ThrTyr: 1.688 ± 0.677
0.0ThrXaa: 0.0 ± 0.0
Val
2.533ValAla: 2.533 ± 0.69
2.955ValCys: 2.955 ± 1.834
3.799ValAsp: 3.799 ± 0.677
7.598ValGlu: 7.598 ± 2.023
1.688ValPhe: 1.688 ± 0.852
2.533ValGly: 2.533 ± 1.036
1.266ValHis: 1.266 ± 0.687
1.688ValIle: 1.688 ± 0.644
1.688ValLys: 1.688 ± 0.831
2.955ValLeu: 2.955 ± 1.589
1.266ValMet: 1.266 ± 0.78
1.688ValAsn: 1.688 ± 0.504
4.221ValPro: 4.221 ± 1.324
4.643ValGln: 4.643 ± 0.952
3.799ValArg: 3.799 ± 0.923
5.91ValSer: 5.91 ± 1.107
4.643ValThr: 4.643 ± 1.228
3.799ValVal: 3.799 ± 1.44
1.266ValTrp: 1.266 ± 0.83
3.377ValTyr: 3.377 ± 1.045
0.0ValXaa: 0.0 ± 0.0
Trp
2.533TrpAla: 2.533 ± 0.932
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.844TrpGlu: 0.844 ± 0.696
0.422TrpPhe: 0.422 ± 0.304
1.688TrpGly: 1.688 ± 0.329
0.422TrpHis: 0.422 ± 0.414
0.844TrpIle: 0.844 ± 0.469
2.111TrpLys: 2.111 ± 1.182
2.111TrpLeu: 2.111 ± 0.831
0.422TrpMet: 0.422 ± 0.415
1.266TrpAsn: 1.266 ± 0.402
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.266TrpArg: 1.266 ± 0.591
0.422TrpSer: 0.422 ± 0.304
1.266TrpThr: 1.266 ± 0.78
0.844TrpVal: 0.844 ± 0.469
0.0TrpTrp: 0.0 ± 0.0
0.422TrpTyr: 0.422 ± 0.304
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.688TyrAla: 1.688 ± 0.508
0.422TyrCys: 0.422 ± 0.556
2.111TyrAsp: 2.111 ± 0.655
0.844TyrGlu: 0.844 ± 0.495
0.844TyrPhe: 0.844 ± 0.438
2.955TyrGly: 2.955 ± 0.84
0.844TyrHis: 0.844 ± 0.369
3.377TyrIle: 3.377 ± 1.256
2.533TyrLys: 2.533 ± 1.554
2.955TyrLeu: 2.955 ± 1.071
1.266TyrMet: 1.266 ± 0.681
0.844TyrAsn: 0.844 ± 0.675
0.844TyrPro: 0.844 ± 0.569
2.111TyrGln: 2.111 ± 0.502
1.688TyrArg: 1.688 ± 0.787
3.799TyrSer: 3.799 ± 1.156
1.266TyrThr: 1.266 ± 1.043
3.799TyrVal: 3.799 ± 0.714
1.266TyrTrp: 1.266 ± 0.591
1.688TyrTyr: 1.688 ± 1.166
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2370 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski