Amino acid dipepetide frequency for Human papillomavirus type 203

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.955AlaAla: 2.955 ± 1.652
1.266AlaCys: 1.266 ± 1.018
5.488AlaAsp: 5.488 ± 1.393
3.377AlaGlu: 3.377 ± 1.764
4.221AlaPhe: 4.221 ± 0.936
2.111AlaGly: 2.111 ± 1.226
0.422AlaHis: 0.422 ± 0.353
2.111AlaIle: 2.111 ± 0.713
2.111AlaLys: 2.111 ± 0.719
5.488AlaLeu: 5.488 ± 0.932
0.844AlaMet: 0.844 ± 0.392
2.533AlaAsn: 2.533 ± 0.555
3.799AlaPro: 3.799 ± 1.057
2.111AlaGln: 2.111 ± 0.49
4.643AlaArg: 4.643 ± 1.273
1.266AlaSer: 1.266 ± 0.456
4.643AlaThr: 4.643 ± 0.916
2.111AlaVal: 2.111 ± 0.799
0.844AlaTrp: 0.844 ± 0.463
2.533AlaTyr: 2.533 ± 1.017
0.0AlaXaa: 0.0 ± 0.0
Cys
1.266CysAla: 1.266 ± 0.801
1.266CysCys: 1.266 ± 0.801
2.111CysAsp: 2.111 ± 1.208
1.688CysGlu: 1.688 ± 0.775
0.844CysPhe: 0.844 ± 0.798
0.0CysGly: 0.0 ± 0.0
0.422CysHis: 0.422 ± 0.36
1.266CysIle: 1.266 ± 1.213
1.688CysLys: 1.688 ± 0.634
3.377CysLeu: 3.377 ± 3.374
1.266CysMet: 1.266 ± 0.801
0.422CysAsn: 0.422 ± 0.718
1.266CysPro: 1.266 ± 0.796
0.422CysGln: 0.422 ± 0.318
1.266CysArg: 1.266 ± 1.341
2.955CysSer: 2.955 ± 2.226
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.844CysTrp: 0.844 ± 0.392
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.643AspAla: 4.643 ± 1.273
2.111AspCys: 2.111 ± 0.917
2.955AspAsp: 2.955 ± 1.007
5.488AspGlu: 5.488 ± 2.536
4.221AspPhe: 4.221 ± 1.269
2.955AspGly: 2.955 ± 0.822
0.844AspHis: 0.844 ± 0.67
5.488AspIle: 5.488 ± 1.95
2.955AspLys: 2.955 ± 0.745
4.221AspLeu: 4.221 ± 1.878
0.422AspMet: 0.422 ± 0.353
2.955AspAsn: 2.955 ± 0.797
4.221AspPro: 4.221 ± 1.405
0.844AspGln: 0.844 ± 0.441
0.844AspArg: 0.844 ± 0.707
4.221AspSer: 4.221 ± 1.731
5.065AspThr: 5.065 ± 1.304
6.332AspVal: 6.332 ± 1.847
1.266AspTrp: 1.266 ± 1.081
1.688AspTyr: 1.688 ± 0.307
0.0AspXaa: 0.0 ± 0.0
Glu
3.799GluAla: 3.799 ± 1.082
0.0GluCys: 0.0 ± 0.0
5.91GluAsp: 5.91 ± 1.166
5.488GluGlu: 5.488 ± 1.571
1.266GluPhe: 1.266 ± 0.369
4.221GluGly: 4.221 ± 1.6
0.844GluHis: 0.844 ± 0.447
1.688GluIle: 1.688 ± 0.64
2.533GluLys: 2.533 ± 0.863
7.598GluLeu: 7.598 ± 1.75
1.266GluMet: 1.266 ± 0.573
5.065GluAsn: 5.065 ± 0.84
2.533GluPro: 2.533 ± 0.852
2.533GluGln: 2.533 ± 0.566
2.955GluArg: 2.955 ± 1.106
5.065GluSer: 5.065 ± 1.749
4.221GluThr: 4.221 ± 1.061
2.533GluVal: 2.533 ± 2.301
0.844GluTrp: 0.844 ± 0.757
2.533GluTyr: 2.533 ± 0.863
0.0GluXaa: 0.0 ± 0.0
Phe
2.533PheAla: 2.533 ± 1.215
0.844PheCys: 0.844 ± 0.67
2.533PheAsp: 2.533 ± 0.831
2.533PheGlu: 2.533 ± 1.138
2.111PhePhe: 2.111 ± 1.313
3.377PheGly: 3.377 ± 1.281
0.422PheHis: 0.422 ± 0.718
2.533PheIle: 2.533 ± 0.836
1.688PheLys: 1.688 ± 0.896
4.643PheLeu: 4.643 ± 0.895
0.422PheMet: 0.422 ± 0.36
2.955PheAsn: 2.955 ± 0.899
0.844PhePro: 0.844 ± 0.392
2.955PheGln: 2.955 ± 1.298
3.799PheArg: 3.799 ± 1.281
2.533PheSer: 2.533 ± 0.619
0.844PheThr: 0.844 ± 0.707
3.799PheVal: 3.799 ± 0.861
1.266PheTrp: 1.266 ± 0.664
1.266PheTyr: 1.266 ± 0.456
0.0PheXaa: 0.0 ± 0.0
Gly
0.422GlyAla: 0.422 ± 0.353
0.844GlyCys: 0.844 ± 0.751
3.799GlyAsp: 3.799 ± 1.251
4.221GlyGlu: 4.221 ± 1.22
0.844GlyPhe: 0.844 ± 0.392
4.643GlyGly: 4.643 ± 2.285
0.844GlyHis: 0.844 ± 0.707
5.065GlyIle: 5.065 ± 1.277
1.266GlyLys: 1.266 ± 0.635
5.065GlyLeu: 5.065 ± 1.538
0.0GlyMet: 0.0 ± 0.0
4.221GlyAsn: 4.221 ± 0.941
2.533GlyPro: 2.533 ± 1.208
2.533GlyGln: 2.533 ± 1.255
4.221GlyArg: 4.221 ± 1.19
5.065GlySer: 5.065 ± 1.479
5.91GlyThr: 5.91 ± 3.123
3.377GlyVal: 3.377 ± 1.459
0.0GlyTrp: 0.0 ± 0.0
1.688GlyTyr: 1.688 ± 0.675
0.0GlyXaa: 0.0 ± 0.0
His
0.422HisAla: 0.422 ± 0.353
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.266HisGlu: 1.266 ± 0.595
2.111HisPhe: 2.111 ± 0.481
0.422HisGly: 0.422 ± 0.592
0.422HisHis: 0.422 ± 0.718
1.688HisIle: 1.688 ± 1.183
0.422HisLys: 0.422 ± 0.367
1.688HisLeu: 1.688 ± 0.947
0.0HisMet: 0.0 ± 0.0
1.266HisAsn: 1.266 ± 0.66
1.266HisPro: 1.266 ± 0.741
0.844HisGln: 0.844 ± 0.72
1.688HisArg: 1.688 ± 0.64
1.266HisSer: 1.266 ± 0.66
1.266HisThr: 1.266 ± 0.72
0.422HisVal: 0.422 ± 0.592
1.266HisTrp: 1.266 ± 0.787
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.688IleAla: 1.688 ± 0.634
0.844IleCys: 0.844 ± 0.392
2.955IleAsp: 2.955 ± 1.296
4.643IleGlu: 4.643 ± 1.184
1.688IlePhe: 1.688 ± 0.307
3.377IleGly: 3.377 ± 1.506
1.266IleHis: 1.266 ± 1.08
4.643IleIle: 4.643 ± 2.455
3.377IleLys: 3.377 ± 2.677
3.799IleLeu: 3.799 ± 1.985
1.266IleMet: 1.266 ± 0.923
2.111IleAsn: 2.111 ± 0.692
1.688IlePro: 1.688 ± 0.571
0.844IleGln: 0.844 ± 0.392
1.266IleArg: 1.266 ± 0.679
4.221IleSer: 4.221 ± 0.627
2.533IleThr: 2.533 ± 1.321
4.221IleVal: 4.221 ± 1.208
0.0IleTrp: 0.0 ± 0.0
2.533IleTyr: 2.533 ± 0.918
0.0IleXaa: 0.0 ± 0.0
Lys
1.266LysAla: 1.266 ± 0.654
1.688LysCys: 1.688 ± 1.362
2.111LysAsp: 2.111 ± 0.967
2.955LysGlu: 2.955 ± 0.991
3.799LysPhe: 3.799 ± 1.584
2.955LysGly: 2.955 ± 1.073
2.111LysHis: 2.111 ± 1.418
2.111LysIle: 2.111 ± 1.124
2.111LysLys: 2.111 ± 1.01
5.91LysLeu: 5.91 ± 1.898
1.688LysMet: 1.688 ± 1.436
2.955LysAsn: 2.955 ± 1.16
0.422LysPro: 0.422 ± 0.36
1.688LysGln: 1.688 ± 0.634
3.799LysArg: 3.799 ± 1.167
3.377LysSer: 3.377 ± 1.197
2.955LysThr: 2.955 ± 1.065
2.955LysVal: 2.955 ± 0.73
0.844LysTrp: 0.844 ± 0.78
2.533LysTyr: 2.533 ± 0.818
0.0LysXaa: 0.0 ± 0.0
Leu
6.332LeuAla: 6.332 ± 0.933
4.221LeuCys: 4.221 ± 4.228
6.332LeuAsp: 6.332 ± 3.97
5.488LeuGlu: 5.488 ± 1.412
5.065LeuPhe: 5.065 ± 0.942
6.332LeuGly: 6.332 ± 2.038
1.688LeuHis: 1.688 ± 0.752
2.533LeuIle: 2.533 ± 0.619
5.91LeuLys: 5.91 ± 2.176
6.332LeuLeu: 6.332 ± 2.548
0.844LeuMet: 0.844 ± 0.447
4.221LeuAsn: 4.221 ± 0.619
6.332LeuPro: 6.332 ± 1.314
6.754LeuGln: 6.754 ± 1.44
4.221LeuArg: 4.221 ± 0.836
5.91LeuSer: 5.91 ± 1.146
8.02LeuThr: 8.02 ± 0.985
4.643LeuVal: 4.643 ± 1.432
0.844LeuTrp: 0.844 ± 0.78
3.377LeuTyr: 3.377 ± 1.025
0.0LeuXaa: 0.0 ± 0.0
Met
0.844MetAla: 0.844 ± 0.809
0.844MetCys: 0.844 ± 0.392
0.0MetAsp: 0.0 ± 0.0
1.266MetGlu: 1.266 ± 0.721
1.688MetPhe: 1.688 ± 0.995
0.844MetGly: 0.844 ± 0.392
0.844MetHis: 0.844 ± 0.735
0.422MetIle: 0.422 ± 0.592
1.266MetLys: 1.266 ± 1.158
0.422MetLeu: 0.422 ± 0.36
2.533MetMet: 2.533 ± 2.893
1.688MetAsn: 1.688 ± 0.975
0.844MetPro: 0.844 ± 0.774
0.0MetGln: 0.0 ± 0.0
0.422MetArg: 0.422 ± 0.752
1.688MetSer: 1.688 ± 0.64
0.422MetThr: 0.422 ± 0.36
1.266MetVal: 1.266 ± 1.081
0.0MetTrp: 0.0 ± 0.0
1.266MetTyr: 1.266 ± 0.946
0.0MetXaa: 0.0 ± 0.0
Asn
3.799AsnAla: 3.799 ± 1.218
0.422AsnCys: 0.422 ± 0.318
1.688AsnAsp: 1.688 ± 0.307
1.266AsnGlu: 1.266 ± 1.081
1.688AsnPhe: 1.688 ± 0.975
2.955AsnGly: 2.955 ± 0.97
0.844AsnHis: 0.844 ± 0.441
1.688AsnIle: 1.688 ± 0.895
2.111AsnLys: 2.111 ± 1.171
6.332AsnLeu: 6.332 ± 1.051
1.266AsnMet: 1.266 ± 0.963
2.955AsnAsn: 2.955 ± 0.807
4.221AsnPro: 4.221 ± 1.257
2.111AsnGln: 2.111 ± 0.737
4.221AsnArg: 4.221 ± 1.853
5.91AsnSer: 5.91 ± 1.187
2.955AsnThr: 2.955 ± 1.423
4.643AsnVal: 4.643 ± 1.127
0.422AsnTrp: 0.422 ± 0.353
0.422AsnTyr: 0.422 ± 0.36
0.0AsnXaa: 0.0 ± 0.0
Pro
5.065ProAla: 5.065 ± 1.173
0.844ProCys: 0.844 ± 0.392
5.065ProAsp: 5.065 ± 1.662
2.111ProGlu: 2.111 ± 0.713
1.266ProPhe: 1.266 ± 0.629
1.688ProGly: 1.688 ± 0.969
0.0ProHis: 0.0 ± 0.0
0.0ProIle: 0.0 ± 0.0
4.643ProLys: 4.643 ± 1.216
5.91ProLeu: 5.91 ± 1.592
0.844ProMet: 0.844 ± 0.463
2.533ProAsn: 2.533 ± 1.481
5.065ProPro: 5.065 ± 1.446
4.221ProGln: 4.221 ± 1.209
2.955ProArg: 2.955 ± 1.224
3.377ProSer: 3.377 ± 0.794
4.643ProThr: 4.643 ± 1.719
3.799ProVal: 3.799 ± 0.784
0.0ProTrp: 0.0 ± 0.0
0.844ProTyr: 0.844 ± 0.707
0.0ProXaa: 0.0 ± 0.0
Gln
0.844GlnAla: 0.844 ± 0.392
1.688GlnCys: 1.688 ± 0.81
3.377GlnAsp: 3.377 ± 0.897
2.111GlnGlu: 2.111 ± 0.793
1.688GlnPhe: 1.688 ± 0.791
0.844GlnGly: 0.844 ± 0.441
1.266GlnHis: 1.266 ± 1.226
2.533GlnIle: 2.533 ± 0.939
1.266GlnLys: 1.266 ± 0.416
5.91GlnLeu: 5.91 ± 1.302
0.844GlnMet: 0.844 ± 0.757
1.266GlnAsn: 1.266 ± 0.741
3.799GlnPro: 3.799 ± 1.148
2.533GlnGln: 2.533 ± 1.061
2.111GlnArg: 2.111 ± 0.814
2.955GlnSer: 2.955 ± 0.905
1.688GlnThr: 1.688 ± 0.68
2.111GlnVal: 2.111 ± 0.466
0.422GlnTrp: 0.422 ± 0.36
1.266GlnTyr: 1.266 ± 0.416
0.0GlnXaa: 0.0 ± 0.0
Arg
5.065ArgAla: 5.065 ± 0.91
2.111ArgCys: 2.111 ± 1.289
2.111ArgAsp: 2.111 ± 1.095
2.533ArgGlu: 2.533 ± 1.048
2.955ArgPhe: 2.955 ± 1.717
3.377ArgGly: 3.377 ± 0.711
1.688ArgHis: 1.688 ± 1.108
1.266ArgIle: 1.266 ± 0.721
3.799ArgLys: 3.799 ± 1.084
7.176ArgLeu: 7.176 ± 2.466
0.422ArgMet: 0.422 ± 0.563
2.111ArgAsn: 2.111 ± 0.656
3.377ArgPro: 3.377 ± 1.255
0.422ArgGln: 0.422 ± 0.353
7.176ArgArg: 7.176 ± 2.488
3.377ArgSer: 3.377 ± 1.239
2.955ArgThr: 2.955 ± 1.062
5.065ArgVal: 5.065 ± 1.631
0.422ArgTrp: 0.422 ± 0.367
1.688ArgTyr: 1.688 ± 0.62
0.0ArgXaa: 0.0 ± 0.0
Ser
2.955SerAla: 2.955 ± 0.953
2.533SerCys: 2.533 ± 1.654
6.332SerAsp: 6.332 ± 1.276
4.221SerGlu: 4.221 ± 1.306
2.111SerPhe: 2.111 ± 0.692
6.332SerGly: 6.332 ± 1.125
1.688SerHis: 1.688 ± 0.745
2.955SerIle: 2.955 ± 0.82
2.111SerLys: 2.111 ± 0.953
6.754SerLeu: 6.754 ± 1.38
0.422SerMet: 0.422 ± 0.36
5.065SerAsn: 5.065 ± 0.966
3.377SerPro: 3.377 ± 1.15
2.111SerGln: 2.111 ± 0.808
3.799SerArg: 3.799 ± 0.75
10.131SerSer: 10.131 ± 2.184
5.91SerThr: 5.91 ± 2.376
5.488SerVal: 5.488 ± 1.601
1.266SerTrp: 1.266 ± 0.755
2.111SerTyr: 2.111 ± 0.936
0.0SerXaa: 0.0 ± 0.0
Thr
3.377ThrAla: 3.377 ± 0.956
0.844ThrCys: 0.844 ± 0.751
3.377ThrAsp: 3.377 ± 1.076
6.332ThrGlu: 6.332 ± 1.323
2.111ThrPhe: 2.111 ± 0.737
4.643ThrGly: 4.643 ± 1.922
0.422ThrHis: 0.422 ± 0.36
5.488ThrIle: 5.488 ± 0.949
1.266ThrLys: 1.266 ± 0.744
5.065ThrLeu: 5.065 ± 1.408
1.266ThrMet: 1.266 ± 0.946
2.111ThrAsn: 2.111 ± 1.197
2.955ThrPro: 2.955 ± 1.224
1.266ThrGln: 1.266 ± 0.648
4.221ThrArg: 4.221 ± 1.126
5.488ThrSer: 5.488 ± 2.026
5.065ThrThr: 5.065 ± 1.346
7.176ThrVal: 7.176 ± 1.197
0.0ThrTrp: 0.0 ± 0.0
2.955ThrTyr: 2.955 ± 0.765
0.0ThrXaa: 0.0 ± 0.0
Val
4.643ValAla: 4.643 ± 0.746
0.0ValCys: 0.0 ± 0.0
4.221ValAsp: 4.221 ± 1.374
4.221ValGlu: 4.221 ± 1.101
2.111ValPhe: 2.111 ± 0.676
4.221ValGly: 4.221 ± 1.143
0.844ValHis: 0.844 ± 0.418
2.111ValIle: 2.111 ± 0.714
3.377ValLys: 3.377 ± 0.794
4.643ValLeu: 4.643 ± 1.524
2.111ValMet: 2.111 ± 0.899
3.377ValAsn: 3.377 ± 1.282
4.643ValPro: 4.643 ± 1.43
3.799ValGln: 3.799 ± 1.167
3.377ValArg: 3.377 ± 1.457
7.176ValSer: 7.176 ± 0.617
4.643ValThr: 4.643 ± 1.154
7.176ValVal: 7.176 ± 1.161
0.844ValTrp: 0.844 ± 0.392
2.111ValTyr: 2.111 ± 0.838
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.844TrpAsp: 0.844 ± 0.441
0.0TrpGlu: 0.0 ± 0.0
0.422TrpPhe: 0.422 ± 0.36
0.422TrpGly: 0.422 ± 0.367
0.422TrpHis: 0.422 ± 0.367
1.688TrpIle: 1.688 ± 0.858
1.688TrpLys: 1.688 ± 1.549
1.266TrpLeu: 1.266 ± 0.664
0.422TrpMet: 0.422 ± 0.36
0.844TrpAsn: 0.844 ± 0.707
0.422TrpPro: 0.422 ± 0.353
0.0TrpGln: 0.0 ± 0.0
1.266TrpArg: 1.266 ± 0.787
0.0TrpSer: 0.0 ± 0.0
0.422TrpThr: 0.422 ± 0.367
1.266TrpVal: 1.266 ± 0.416
0.0TrpTrp: 0.0 ± 0.0
0.422TrpTyr: 0.422 ± 0.36
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.955TyrAla: 2.955 ± 0.658
0.0TyrCys: 0.0 ± 0.0
2.533TyrAsp: 2.533 ± 0.566
1.688TyrGlu: 1.688 ± 0.928
1.688TyrPhe: 1.688 ± 0.857
0.844TyrGly: 0.844 ± 0.441
0.422TyrHis: 0.422 ± 0.36
1.688TyrIle: 1.688 ± 1.213
4.643TyrLys: 4.643 ± 1.969
3.799TyrLeu: 3.799 ± 0.869
0.0TyrMet: 0.0 ± 0.0
1.688TyrAsn: 1.688 ± 0.977
1.266TyrPro: 1.266 ± 0.645
2.533TyrGln: 2.533 ± 0.566
0.844TyrArg: 0.844 ± 0.751
1.688TyrSer: 1.688 ± 0.558
1.266TyrThr: 1.266 ± 0.72
1.266TyrVal: 1.266 ± 0.741
0.422TyrTrp: 0.422 ± 0.353
2.533TyrTyr: 2.533 ± 1.887
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2370 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski