Amino acid dipepetide frequency for human papillomavirus 150

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.351AlaAla: 2.351 ± 0.925
1.567AlaCys: 1.567 ± 0.691
3.527AlaAsp: 3.527 ± 0.983
4.702AlaGlu: 4.702 ± 0.875
1.959AlaPhe: 1.959 ± 1.207
3.918AlaGly: 3.918 ± 3.634
1.176AlaHis: 1.176 ± 1.001
1.959AlaIle: 1.959 ± 0.665
1.959AlaLys: 1.959 ± 0.995
3.527AlaLeu: 3.527 ± 1.193
1.176AlaMet: 1.176 ± 0.382
1.959AlaAsn: 1.959 ± 0.918
3.527AlaPro: 3.527 ± 1.002
1.959AlaGln: 1.959 ± 1.089
5.878AlaArg: 5.878 ± 2.396
3.527AlaSer: 3.527 ± 1.055
3.135AlaThr: 3.135 ± 0.886
4.31AlaVal: 4.31 ± 1.472
0.784AlaTrp: 0.784 ± 0.409
1.567AlaTyr: 1.567 ± 0.721
0.0AlaXaa: 0.0 ± 0.0
Cys
1.176CysAla: 1.176 ± 0.58
1.567CysCys: 1.567 ± 1.542
0.392CysAsp: 0.392 ± 0.457
0.392CysGlu: 0.392 ± 0.457
1.176CysPhe: 1.176 ± 0.382
1.176CysGly: 1.176 ± 0.866
0.0CysHis: 0.0 ± 0.0
1.176CysIle: 1.176 ± 0.617
1.959CysLys: 1.959 ± 0.808
1.959CysLeu: 1.959 ± 1.091
0.392CysMet: 0.392 ± 0.334
0.784CysAsn: 0.784 ± 0.746
1.959CysPro: 1.959 ± 0.673
0.0CysGln: 0.0 ± 0.0
2.351CysArg: 2.351 ± 1.153
2.351CysSer: 2.351 ± 1.729
0.392CysThr: 0.392 ± 0.334
0.784CysVal: 0.784 ± 0.619
0.784CysTrp: 0.784 ± 0.35
1.176CysTyr: 1.176 ± 0.635
0.0CysXaa: 0.0 ± 0.0
Asp
4.702AspAla: 4.702 ± 1.391
1.176AspCys: 1.176 ± 1.001
2.743AspAsp: 2.743 ± 1.055
1.959AspGlu: 1.959 ± 0.778
0.784AspPhe: 0.784 ± 0.418
3.527AspGly: 3.527 ± 1.073
0.392AspHis: 0.392 ± 0.334
3.918AspIle: 3.918 ± 0.722
1.959AspLys: 1.959 ± 0.918
7.445AspLeu: 7.445 ± 2.165
0.784AspMet: 0.784 ± 0.35
3.135AspAsn: 3.135 ± 0.846
3.918AspPro: 3.918 ± 1.19
3.527AspGln: 3.527 ± 1.093
2.351AspArg: 2.351 ± 0.855
1.959AspSer: 1.959 ± 0.666
4.702AspThr: 4.702 ± 1.878
4.31AspVal: 4.31 ± 1.171
0.784AspTrp: 0.784 ± 0.409
1.959AspTyr: 1.959 ± 0.684
0.0AspXaa: 0.0 ± 0.0
Glu
5.094GluAla: 5.094 ± 1.476
0.784GluCys: 0.784 ± 0.667
5.094GluAsp: 5.094 ± 1.042
6.27GluGlu: 6.27 ± 1.673
1.567GluPhe: 1.567 ± 0.526
4.31GluGly: 4.31 ± 1.344
1.176GluHis: 1.176 ± 0.334
3.918GluIle: 3.918 ± 1.902
3.135GluLys: 3.135 ± 1.437
7.053GluLeu: 7.053 ± 1.932
0.392GluMet: 0.392 ± 0.334
4.702GluAsn: 4.702 ± 1.635
3.918GluPro: 3.918 ± 1.032
3.918GluGln: 3.918 ± 0.897
3.527GluArg: 3.527 ± 1.955
6.27GluSer: 6.27 ± 2.038
4.702GluThr: 4.702 ± 1.412
6.27GluVal: 6.27 ± 1.482
1.567GluTrp: 1.567 ± 0.55
1.567GluTyr: 1.567 ± 1.134
0.0GluXaa: 0.0 ± 0.0
Phe
2.351PheAla: 2.351 ± 1.29
1.959PheCys: 1.959 ± 0.887
1.959PheAsp: 1.959 ± 0.787
2.743PheGlu: 2.743 ± 1.108
1.567PhePhe: 1.567 ± 0.58
3.527PheGly: 3.527 ± 0.876
1.567PheHis: 1.567 ± 0.965
1.959PheIle: 1.959 ± 0.776
2.351PheLys: 2.351 ± 0.973
5.486PheLeu: 5.486 ± 1.093
0.392PheMet: 0.392 ± 0.334
1.176PheAsn: 1.176 ± 0.741
1.567PhePro: 1.567 ± 0.521
1.959PheGln: 1.959 ± 0.658
3.135PheArg: 3.135 ± 0.758
1.959PheSer: 1.959 ± 0.479
0.392PheThr: 0.392 ± 0.347
1.567PheVal: 1.567 ± 0.784
1.176PheTrp: 1.176 ± 0.617
1.176PheTyr: 1.176 ± 1.085
0.0PheXaa: 0.0 ± 0.0
Gly
3.527GlyAla: 3.527 ± 2.157
1.959GlyCys: 1.959 ± 0.642
3.918GlyAsp: 3.918 ± 1.313
3.918GlyGlu: 3.918 ± 0.883
1.567GlyPhe: 1.567 ± 0.551
6.661GlyGly: 6.661 ± 2.182
2.351GlyHis: 2.351 ± 0.881
2.743GlyIle: 2.743 ± 0.591
4.702GlyLys: 4.702 ± 2.161
3.135GlyLeu: 3.135 ± 1.025
0.392GlyMet: 0.392 ± 0.351
3.527GlyAsn: 3.527 ± 1.502
3.527GlyPro: 3.527 ± 1.279
2.743GlyGln: 2.743 ± 1.096
6.27GlyArg: 6.27 ± 2.77
6.661GlySer: 6.661 ± 1.285
5.094GlyThr: 5.094 ± 1.382
3.527GlyVal: 3.527 ± 0.629
0.0GlyTrp: 0.0 ± 0.0
3.135GlyTyr: 3.135 ± 1.01
0.0GlyXaa: 0.0 ± 0.0
His
0.392HisAla: 0.392 ± 0.351
1.567HisCys: 1.567 ± 0.923
0.0HisAsp: 0.0 ± 0.0
1.176HisGlu: 1.176 ± 0.514
0.784HisPhe: 0.784 ± 0.418
1.567HisGly: 1.567 ± 1.02
0.784HisHis: 0.784 ± 0.667
1.567HisIle: 1.567 ± 0.592
1.959HisLys: 1.959 ± 1.324
1.567HisLeu: 1.567 ± 0.54
0.392HisMet: 0.392 ± 0.351
1.567HisAsn: 1.567 ± 0.54
1.959HisPro: 1.959 ± 0.791
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
1.567HisSer: 1.567 ± 0.648
1.176HisThr: 1.176 ± 0.662
0.784HisVal: 0.784 ± 0.35
1.176HisTrp: 1.176 ± 0.382
1.567HisTyr: 1.567 ± 0.963
0.0HisXaa: 0.0 ± 0.0
Ile
2.743IleAla: 2.743 ± 0.846
1.176IleCys: 1.176 ± 0.69
3.135IleAsp: 3.135 ± 0.682
3.527IleGlu: 3.527 ± 1.266
1.959IlePhe: 1.959 ± 0.479
2.351IleGly: 2.351 ± 0.946
1.176IleHis: 1.176 ± 0.334
2.351IleIle: 2.351 ± 0.79
1.959IleLys: 1.959 ± 0.709
5.094IleLeu: 5.094 ± 0.597
1.567IleMet: 1.567 ± 0.957
2.351IleAsn: 2.351 ± 1.133
3.135IlePro: 3.135 ± 1.679
1.176IleGln: 1.176 ± 0.662
2.351IleArg: 2.351 ± 0.998
4.702IleSer: 4.702 ± 1.37
1.567IleThr: 1.567 ± 0.965
3.135IleVal: 3.135 ± 0.938
0.784IleTrp: 0.784 ± 0.576
2.743IleTyr: 2.743 ± 1.057
0.0IleXaa: 0.0 ± 0.0
Lys
3.135LysAla: 3.135 ± 0.668
0.784LysCys: 0.784 ± 0.409
2.351LysAsp: 2.351 ± 1.471
3.135LysGlu: 3.135 ± 1.275
2.743LysPhe: 2.743 ± 1.093
3.527LysGly: 3.527 ± 1.124
1.567LysHis: 1.567 ± 1.002
1.176LysIle: 1.176 ± 0.866
1.959LysLys: 1.959 ± 0.816
3.918LysLeu: 3.918 ± 1.439
1.176LysMet: 1.176 ± 0.627
2.351LysAsn: 2.351 ± 1.001
1.567LysPro: 1.567 ± 1.374
2.351LysGln: 2.351 ± 0.697
5.094LysArg: 5.094 ± 1.167
3.918LysSer: 3.918 ± 1.588
2.743LysThr: 2.743 ± 1.309
2.351LysVal: 2.351 ± 1.425
0.784LysTrp: 0.784 ± 0.517
2.351LysTyr: 2.351 ± 1.121
0.0LysXaa: 0.0 ± 0.0
Leu
4.31LeuAla: 4.31 ± 1.245
2.743LeuCys: 2.743 ± 1.096
5.878LeuAsp: 5.878 ± 1.467
9.796LeuGlu: 9.796 ± 1.852
3.135LeuPhe: 3.135 ± 0.651
5.878LeuGly: 5.878 ± 2.109
3.135LeuHis: 3.135 ± 1.185
3.527LeuIle: 3.527 ± 1.292
5.878LeuLys: 5.878 ± 0.899
11.364LeuLeu: 11.364 ± 3.622
1.176LeuMet: 1.176 ± 0.428
2.351LeuAsn: 2.351 ± 0.819
3.527LeuPro: 3.527 ± 1.055
5.486LeuGln: 5.486 ± 0.608
3.135LeuArg: 3.135 ± 0.973
8.621LeuSer: 8.621 ± 2.689
4.702LeuThr: 4.702 ± 1.163
3.527LeuVal: 3.527 ± 1.032
0.784LeuTrp: 0.784 ± 0.35
2.351LeuTyr: 2.351 ± 0.733
0.0LeuXaa: 0.0 ± 0.0
Met
2.351MetAla: 2.351 ± 0.46
0.0MetCys: 0.0 ± 0.0
1.176MetAsp: 1.176 ± 0.582
1.176MetGlu: 1.176 ± 0.382
1.959MetPhe: 1.959 ± 0.907
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.176MetIle: 1.176 ± 0.58
0.0MetLys: 0.0 ± 0.0
1.959MetLeu: 1.959 ± 0.816
0.0MetMet: 0.0 ± 0.0
0.784MetAsn: 0.784 ± 0.35
0.0MetPro: 0.0 ± 0.0
0.392MetGln: 0.392 ± 0.334
1.176MetArg: 1.176 ± 0.682
1.567MetSer: 1.567 ± 0.888
0.0MetThr: 0.0 ± 0.0
0.392MetVal: 0.392 ± 0.334
0.0MetTrp: 0.0 ± 0.0
0.392MetTyr: 0.392 ± 0.334
0.0MetXaa: 0.0 ± 0.0
Asn
2.743AsnAla: 2.743 ± 0.965
0.784AsnCys: 0.784 ± 0.409
1.567AsnAsp: 1.567 ± 0.652
2.743AsnGlu: 2.743 ± 1.86
3.135AsnPhe: 3.135 ± 1.216
3.135AsnGly: 3.135 ± 1.422
0.392AsnHis: 0.392 ± 0.351
3.135AsnIle: 3.135 ± 0.932
1.959AsnLys: 1.959 ± 0.69
3.918AsnLeu: 3.918 ± 1.826
0.784AsnMet: 0.784 ± 0.408
1.959AsnAsn: 1.959 ± 0.999
3.135AsnPro: 3.135 ± 1.41
3.135AsnGln: 3.135 ± 0.963
1.959AsnArg: 1.959 ± 0.862
2.351AsnSer: 2.351 ± 0.788
2.743AsnThr: 2.743 ± 1.8
1.567AsnVal: 1.567 ± 0.603
0.392AsnTrp: 0.392 ± 0.334
1.959AsnTyr: 1.959 ± 0.645
0.0AsnXaa: 0.0 ± 0.0
Pro
2.743ProAla: 2.743 ± 0.946
1.176ProCys: 1.176 ± 0.784
6.27ProAsp: 6.27 ± 2.426
5.486ProGlu: 5.486 ± 1.409
1.176ProPhe: 1.176 ± 0.826
1.959ProGly: 1.959 ± 1.001
0.0ProHis: 0.0 ± 0.0
1.959ProIle: 1.959 ± 0.714
2.743ProLys: 2.743 ± 0.883
4.702ProLeu: 4.702 ± 1.198
1.176ProMet: 1.176 ± 1.001
2.351ProAsn: 2.351 ± 0.658
7.053ProPro: 7.053 ± 1.896
1.567ProGln: 1.567 ± 0.724
3.135ProArg: 3.135 ± 1.384
6.661ProSer: 6.661 ± 2.508
5.486ProThr: 5.486 ± 2.316
3.527ProVal: 3.527 ± 1.226
0.392ProTrp: 0.392 ± 0.347
0.784ProTyr: 0.784 ± 0.46
0.0ProXaa: 0.0 ± 0.0
Gln
2.743GlnAla: 2.743 ± 0.845
0.392GlnCys: 0.392 ± 0.351
2.743GlnAsp: 2.743 ± 0.597
3.135GlnGlu: 3.135 ± 1.706
3.527GlnPhe: 3.527 ± 0.891
1.567GlnGly: 1.567 ± 0.521
0.784GlnHis: 0.784 ± 0.667
4.31GlnIle: 4.31 ± 0.932
1.567GlnLys: 1.567 ± 0.72
3.527GlnLeu: 3.527 ± 0.866
1.567GlnMet: 1.567 ± 0.616
1.567GlnAsn: 1.567 ± 0.609
1.567GlnPro: 1.567 ± 0.664
2.351GlnGln: 2.351 ± 1.303
2.743GlnArg: 2.743 ± 1.629
2.351GlnSer: 2.351 ± 0.971
0.784GlnThr: 0.784 ± 0.408
3.135GlnVal: 3.135 ± 1.239
0.784GlnTrp: 0.784 ± 0.409
1.176GlnTyr: 1.176 ± 0.692
0.0GlnXaa: 0.0 ± 0.0
Arg
3.918ArgAla: 3.918 ± 0.78
1.959ArgCys: 1.959 ± 1.074
3.135ArgAsp: 3.135 ± 1.096
4.31ArgGlu: 4.31 ± 0.563
3.918ArgPhe: 3.918 ± 1.396
8.229ArgGly: 8.229 ± 3.085
2.743ArgHis: 2.743 ± 1.26
1.176ArgIle: 1.176 ± 0.635
5.094ArgLys: 5.094 ± 1.273
6.661ArgLeu: 6.661 ± 1.709
0.392ArgMet: 0.392 ± 0.433
0.784ArgAsn: 0.784 ± 0.35
3.135ArgPro: 3.135 ± 0.801
2.743ArgGln: 2.743 ± 0.815
7.053ArgArg: 7.053 ± 2.576
6.661ArgSer: 6.661 ± 4.348
3.135ArgThr: 3.135 ± 1.581
3.918ArgVal: 3.918 ± 1.19
0.0ArgTrp: 0.0 ± 0.0
1.567ArgTyr: 1.567 ± 0.68
0.0ArgXaa: 0.0 ± 0.0
Ser
1.959SerAla: 1.959 ± 0.787
0.392SerCys: 0.392 ± 0.351
4.702SerAsp: 4.702 ± 1.577
5.878SerGlu: 5.878 ± 1.734
3.135SerPhe: 3.135 ± 0.552
7.445SerGly: 7.445 ± 1.817
0.784SerHis: 0.784 ± 0.746
2.743SerIle: 2.743 ± 1.12
3.135SerLys: 3.135 ± 1.559
7.445SerLeu: 7.445 ± 0.631
0.784SerMet: 0.784 ± 0.667
4.31SerAsn: 4.31 ± 1.606
6.661SerPro: 6.661 ± 1.735
2.351SerGln: 2.351 ± 0.602
10.188SerArg: 10.188 ± 3.416
7.445SerSer: 7.445 ± 2.899
5.878SerThr: 5.878 ± 1.474
3.918SerVal: 3.918 ± 1.029
1.176SerTrp: 1.176 ± 0.566
0.784SerTyr: 0.784 ± 0.416
0.0SerXaa: 0.0 ± 0.0
Thr
1.567ThrAla: 1.567 ± 1.005
1.176ThrCys: 1.176 ± 0.334
2.351ThrAsp: 2.351 ± 0.629
6.27ThrGlu: 6.27 ± 0.966
2.743ThrPhe: 2.743 ± 1.248
4.702ThrGly: 4.702 ± 1.538
0.392ThrHis: 0.392 ± 0.334
3.918ThrIle: 3.918 ± 1.428
1.567ThrLys: 1.567 ± 0.779
4.31ThrLeu: 4.31 ± 1.777
0.784ThrMet: 0.784 ± 0.35
2.351ThrAsn: 2.351 ± 0.596
5.094ThrPro: 5.094 ± 2.461
1.176ThrGln: 1.176 ± 1.029
3.135ThrArg: 3.135 ± 0.855
4.702ThrSer: 4.702 ± 1.884
4.702ThrThr: 4.702 ± 1.226
4.702ThrVal: 4.702 ± 0.848
0.392ThrTrp: 0.392 ± 0.347
1.176ThrTyr: 1.176 ± 0.692
0.0ThrXaa: 0.0 ± 0.0
Val
3.527ValAla: 3.527 ± 1.171
0.392ValCys: 0.392 ± 0.517
3.527ValAsp: 3.527 ± 0.743
5.878ValGlu: 5.878 ± 1.633
1.176ValPhe: 1.176 ± 0.334
3.918ValGly: 3.918 ± 1.25
1.567ValHis: 1.567 ± 0.58
2.743ValIle: 2.743 ± 0.964
1.567ValLys: 1.567 ± 0.521
3.135ValLeu: 3.135 ± 0.873
0.392ValMet: 0.392 ± 0.334
2.743ValAsn: 2.743 ± 1.206
3.918ValPro: 3.918 ± 1.19
3.918ValGln: 3.918 ± 0.699
5.094ValArg: 5.094 ± 0.936
5.486ValSer: 5.486 ± 0.983
3.527ValThr: 3.527 ± 1.385
3.527ValVal: 3.527 ± 1.449
0.392ValTrp: 0.392 ± 0.351
1.567ValTyr: 1.567 ± 0.7
0.0ValXaa: 0.0 ± 0.0
Trp
0.784TrpAla: 0.784 ± 0.35
0.0TrpCys: 0.0 ± 0.0
0.784TrpAsp: 0.784 ± 0.702
0.784TrpGlu: 0.784 ± 0.538
0.392TrpPhe: 0.392 ± 0.334
0.392TrpGly: 0.392 ± 0.351
0.0TrpHis: 0.0 ± 0.0
0.784TrpIle: 0.784 ± 0.667
1.959TrpLys: 1.959 ± 1.089
1.567TrpLeu: 1.567 ± 0.7
0.392TrpMet: 0.392 ± 0.334
0.392TrpAsn: 0.392 ± 0.334
0.0TrpPro: 0.0 ± 0.0
1.176TrpGln: 1.176 ± 0.605
0.0TrpArg: 0.0 ± 0.0
1.176TrpSer: 1.176 ± 0.682
0.784TrpThr: 0.784 ± 0.695
1.176TrpVal: 1.176 ± 0.661
0.0TrpTrp: 0.0 ± 0.0
0.392TrpTyr: 0.392 ± 0.334
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.351TyrAla: 2.351 ± 0.48
0.392TyrCys: 0.392 ± 0.517
0.784TyrAsp: 0.784 ± 0.409
1.959TyrGlu: 1.959 ± 1.081
1.567TyrPhe: 1.567 ± 0.919
1.567TyrGly: 1.567 ± 0.54
1.567TyrHis: 1.567 ± 0.685
2.743TyrIle: 2.743 ± 0.837
1.567TyrLys: 1.567 ± 0.737
3.527TyrLeu: 3.527 ± 0.917
0.0TyrMet: 0.0 ± 0.0
2.351TyrAsn: 2.351 ± 0.596
1.176TyrPro: 1.176 ± 0.334
0.392TyrGln: 0.392 ± 0.466
2.351TyrArg: 2.351 ± 0.502
1.176TyrSer: 1.176 ± 0.445
1.567TyrThr: 1.567 ± 0.992
1.567TyrVal: 1.567 ± 0.264
0.784TyrTrp: 0.784 ± 0.517
2.351TyrTyr: 2.351 ± 0.855
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2553 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski