Amino acid dipepetide frequency for Human papillomavirus 184

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.927AlaAla: 3.927 ± 0.742
0.436AlaCys: 0.436 ± 0.504
3.49AlaAsp: 3.49 ± 1.053
3.49AlaGlu: 3.49 ± 1.79
2.182AlaPhe: 2.182 ± 0.717
2.618AlaGly: 2.618 ± 0.84
1.309AlaHis: 1.309 ± 0.696
2.618AlaIle: 2.618 ± 0.928
3.49AlaLys: 3.49 ± 0.944
6.108AlaLeu: 6.108 ± 1.1
1.309AlaMet: 1.309 ± 0.804
3.927AlaAsn: 3.927 ± 1.255
3.054AlaPro: 3.054 ± 0.722
2.182AlaGln: 2.182 ± 0.743
3.927AlaArg: 3.927 ± 1.308
3.054AlaSer: 3.054 ± 1.395
3.49AlaThr: 3.49 ± 0.607
3.49AlaVal: 3.49 ± 1.171
0.0AlaTrp: 0.0 ± 0.0
1.745AlaTyr: 1.745 ± 0.557
0.0AlaXaa: 0.0 ± 0.0
Cys
2.618CysAla: 2.618 ± 1.316
1.745CysCys: 1.745 ± 1.149
1.745CysAsp: 1.745 ± 0.959
1.745CysGlu: 1.745 ± 0.802
1.745CysPhe: 1.745 ± 0.792
0.436CysGly: 0.436 ± 0.56
0.0CysHis: 0.0 ± 0.0
0.436CysIle: 0.436 ± 0.419
2.618CysLys: 2.618 ± 1.33
3.054CysLeu: 3.054 ± 2.642
0.0CysMet: 0.0 ± 0.0
0.436CysAsn: 0.436 ± 0.363
1.745CysPro: 1.745 ± 0.732
0.0CysGln: 0.0 ± 0.0
0.436CysArg: 0.436 ± 0.504
0.873CysSer: 0.873 ± 0.574
0.436CysThr: 0.436 ± 0.363
0.873CysVal: 0.873 ± 0.574
0.873CysTrp: 0.873 ± 0.465
0.873CysTyr: 0.873 ± 0.574
0.0CysXaa: 0.0 ± 0.0
Asp
2.182AspAla: 2.182 ± 0.656
2.182AspCys: 2.182 ± 1.052
3.49AspAsp: 3.49 ± 0.945
4.799AspGlu: 4.799 ± 1.125
3.49AspPhe: 3.49 ± 1.765
1.745AspGly: 1.745 ± 0.697
0.873AspHis: 0.873 ± 0.725
3.49AspIle: 3.49 ± 1.728
3.49AspLys: 3.49 ± 1.839
6.545AspLeu: 6.545 ± 1.746
1.745AspMet: 1.745 ± 0.802
4.799AspAsn: 4.799 ± 0.984
4.799AspPro: 4.799 ± 1.982
0.0AspGln: 0.0 ± 0.0
2.182AspArg: 2.182 ± 0.362
5.672AspSer: 5.672 ± 1.313
3.49AspThr: 3.49 ± 0.711
6.108AspVal: 6.108 ± 1.881
0.436AspTrp: 0.436 ± 0.363
1.309AspTyr: 1.309 ± 0.419
0.0AspXaa: 0.0 ± 0.0
Glu
2.618GluAla: 2.618 ± 1.396
1.309GluCys: 1.309 ± 1.088
5.236GluAsp: 5.236 ± 1.96
5.236GluGlu: 5.236 ± 2.131
1.309GluPhe: 1.309 ± 1.105
3.054GluGly: 3.054 ± 0.662
0.436GluHis: 0.436 ± 0.419
2.618GluIle: 2.618 ± 1.363
2.618GluLys: 2.618 ± 1.162
6.108GluLeu: 6.108 ± 1.122
0.436GluMet: 0.436 ± 0.363
6.981GluAsn: 6.981 ± 1.922
2.182GluPro: 2.182 ± 0.999
3.054GluGln: 3.054 ± 0.704
2.618GluArg: 2.618 ± 1.441
4.363GluSer: 4.363 ± 1.393
5.236GluThr: 5.236 ± 1.107
3.927GluVal: 3.927 ± 2.029
0.873GluTrp: 0.873 ± 0.725
1.309GluTyr: 1.309 ± 0.826
0.0GluXaa: 0.0 ± 0.0
Phe
2.618PheAla: 2.618 ± 0.987
0.873PheCys: 0.873 ± 1.009
3.054PheAsp: 3.054 ± 1.354
3.054PheGlu: 3.054 ± 1.263
2.618PhePhe: 2.618 ± 0.569
3.49PheGly: 3.49 ± 0.934
0.873PheHis: 0.873 ± 0.465
2.618PheIle: 2.618 ± 0.957
4.363PheLys: 4.363 ± 2.268
3.054PheLeu: 3.054 ± 0.368
2.182PheMet: 2.182 ± 1.385
2.182PheAsn: 2.182 ± 1.191
3.054PhePro: 3.054 ± 0.463
2.618PheGln: 2.618 ± 0.824
2.182PheArg: 2.182 ± 1.112
3.49PheSer: 3.49 ± 0.919
0.436PheThr: 0.436 ± 0.363
1.745PheVal: 1.745 ± 0.504
0.873PheTrp: 0.873 ± 0.401
2.618PheTyr: 2.618 ± 1.075
0.0PheXaa: 0.0 ± 0.0
Gly
1.745GlyAla: 1.745 ± 1.168
0.436GlyCys: 0.436 ± 0.419
4.799GlyAsp: 4.799 ± 1.313
3.927GlyGlu: 3.927 ± 1.066
2.182GlyPhe: 2.182 ± 0.59
3.49GlyGly: 3.49 ± 1.631
1.745GlyHis: 1.745 ± 0.835
3.054GlyIle: 3.054 ± 0.368
3.49GlyLys: 3.49 ± 1.174
4.363GlyLeu: 4.363 ± 1.165
0.873GlyMet: 0.873 ± 0.696
3.927GlyAsn: 3.927 ± 0.634
2.182GlyPro: 2.182 ± 0.813
1.309GlyGln: 1.309 ± 0.753
3.054GlyArg: 3.054 ± 0.722
4.363GlySer: 4.363 ± 1.116
5.672GlyThr: 5.672 ± 1.206
2.182GlyVal: 2.182 ± 1.202
0.0GlyTrp: 0.0 ± 0.0
1.745GlyTyr: 1.745 ± 0.663
0.0GlyXaa: 0.0 ± 0.0
His
0.436HisAla: 0.436 ± 0.368
0.436HisCys: 0.436 ± 0.419
0.436HisAsp: 0.436 ± 0.56
0.436HisGlu: 0.436 ± 0.348
1.745HisPhe: 1.745 ± 0.522
1.309HisGly: 1.309 ± 0.758
0.0HisHis: 0.0 ± 0.0
0.436HisIle: 0.436 ± 0.419
1.745HisLys: 1.745 ± 0.658
0.436HisLeu: 0.436 ± 0.368
0.436HisMet: 0.436 ± 0.5
0.873HisAsn: 0.873 ± 0.725
1.309HisPro: 1.309 ± 0.777
0.436HisGln: 0.436 ± 0.363
0.436HisArg: 0.436 ± 0.348
0.873HisSer: 0.873 ± 0.6
1.745HisThr: 1.745 ± 0.658
0.436HisVal: 0.436 ± 0.363
1.309HisTrp: 1.309 ± 0.608
0.436HisTyr: 0.436 ± 0.348
0.0HisXaa: 0.0 ± 0.0
Ile
3.49IleAla: 3.49 ± 1.339
0.436IleCys: 0.436 ± 0.363
3.49IleAsp: 3.49 ± 0.762
4.363IleGlu: 4.363 ± 1.523
0.873IlePhe: 0.873 ± 0.411
4.363IleGly: 4.363 ± 1.582
0.873IleHis: 0.873 ± 0.411
1.745IleIle: 1.745 ± 1.029
0.436IleLys: 0.436 ± 0.368
4.799IleLeu: 4.799 ± 1.468
0.873IleMet: 0.873 ± 0.725
3.054IleAsn: 3.054 ± 1.212
3.49IlePro: 3.49 ± 1.937
2.618IleGln: 2.618 ± 0.939
0.873IleArg: 0.873 ± 0.6
4.799IleSer: 4.799 ± 1.058
3.054IleThr: 3.054 ± 1.573
3.054IleVal: 3.054 ± 1.07
0.436IleTrp: 0.436 ± 0.363
1.309IleTyr: 1.309 ± 0.443
0.0IleXaa: 0.0 ± 0.0
Lys
3.054LysAla: 3.054 ± 0.596
2.182LysCys: 2.182 ± 0.717
2.182LysAsp: 2.182 ± 0.647
3.054LysGlu: 3.054 ± 1.065
3.054LysPhe: 3.054 ± 1.214
1.309LysGly: 1.309 ± 0.285
1.745LysHis: 1.745 ± 0.658
0.0LysIle: 0.0 ± 0.0
2.182LysLys: 2.182 ± 0.834
3.927LysLeu: 3.927 ± 1.274
0.436LysMet: 0.436 ± 0.401
3.054LysAsn: 3.054 ± 1.329
2.618LysPro: 2.618 ± 0.957
2.618LysGln: 2.618 ± 0.55
5.672LysArg: 5.672 ± 1.039
5.236LysSer: 5.236 ± 2.337
3.49LysThr: 3.49 ± 1.171
4.799LysVal: 4.799 ± 1.076
0.0LysTrp: 0.0 ± 0.0
4.363LysTyr: 4.363 ± 1.18
0.0LysXaa: 0.0 ± 0.0
Leu
4.363LeuAla: 4.363 ± 0.557
0.873LeuCys: 0.873 ± 0.686
5.236LeuAsp: 5.236 ± 1.757
5.236LeuGlu: 5.236 ± 1.148
5.672LeuPhe: 5.672 ± 1.548
6.108LeuGly: 6.108 ± 1.665
1.745LeuHis: 1.745 ± 0.889
3.927LeuIle: 3.927 ± 0.673
4.799LeuLys: 4.799 ± 1.602
5.236LeuLeu: 5.236 ± 1.319
1.745LeuMet: 1.745 ± 0.862
1.745LeuAsn: 1.745 ± 0.522
5.236LeuPro: 5.236 ± 1.035
9.599LeuGln: 9.599 ± 1.944
3.927LeuArg: 3.927 ± 1.75
6.108LeuSer: 6.108 ± 1.46
6.981LeuThr: 6.981 ± 0.783
6.545LeuVal: 6.545 ± 1.765
1.745LeuTrp: 1.745 ± 0.897
5.236LeuTyr: 5.236 ± 1.707
0.0LeuXaa: 0.0 ± 0.0
Met
0.873MetAla: 0.873 ± 0.639
0.873MetCys: 0.873 ± 0.401
2.182MetAsp: 2.182 ± 1.109
0.436MetGlu: 0.436 ± 0.363
0.436MetPhe: 0.436 ± 0.419
0.873MetGly: 0.873 ± 0.725
0.0MetHis: 0.0 ± 0.0
0.873MetIle: 0.873 ± 0.6
0.873MetLys: 0.873 ± 0.725
1.309MetLeu: 1.309 ± 0.453
0.0MetMet: 0.0 ± 0.0
1.309MetAsn: 1.309 ± 0.419
0.873MetPro: 0.873 ± 0.476
1.745MetGln: 1.745 ± 0.623
0.873MetArg: 0.873 ± 0.401
1.309MetSer: 1.309 ± 0.419
1.309MetThr: 1.309 ± 0.758
0.873MetVal: 0.873 ± 0.465
0.0MetTrp: 0.0 ± 0.0
1.745MetTyr: 1.745 ± 0.663
0.0MetXaa: 0.0 ± 0.0
Asn
3.49AsnAla: 3.49 ± 1.013
1.309AsnCys: 1.309 ± 0.57
2.182AsnAsp: 2.182 ± 1.167
3.054AsnGlu: 3.054 ± 0.821
1.745AsnPhe: 1.745 ± 0.901
2.182AsnGly: 2.182 ± 0.813
0.0AsnHis: 0.0 ± 0.0
4.363AsnIle: 4.363 ± 0.824
2.182AsnLys: 2.182 ± 0.59
4.363AsnLeu: 4.363 ± 0.887
1.309AsnMet: 1.309 ± 0.682
3.49AsnAsn: 3.49 ± 1.605
3.49AsnPro: 3.49 ± 1.435
2.182AsnGln: 2.182 ± 1.282
2.182AsnArg: 2.182 ± 1.071
3.054AsnSer: 3.054 ± 1.085
3.054AsnThr: 3.054 ± 1.07
3.49AsnVal: 3.49 ± 0.517
1.309AsnTrp: 1.309 ± 0.419
0.873AsnTyr: 0.873 ± 0.639
0.0AsnXaa: 0.0 ± 0.0
Pro
7.417ProAla: 7.417 ± 2.137
0.873ProCys: 0.873 ± 0.574
5.672ProAsp: 5.672 ± 2.234
2.618ProGlu: 2.618 ± 0.849
1.745ProPhe: 1.745 ± 0.82
1.745ProGly: 1.745 ± 0.732
0.436ProHis: 0.436 ± 0.363
3.49ProIle: 3.49 ± 1.507
3.927ProLys: 3.927 ± 0.696
7.417ProLeu: 7.417 ± 2.425
0.436ProMet: 0.436 ± 0.419
1.309ProAsn: 1.309 ± 0.736
5.672ProPro: 5.672 ± 0.808
2.182ProGln: 2.182 ± 0.49
0.873ProArg: 0.873 ± 0.401
5.236ProSer: 5.236 ± 1.752
3.927ProThr: 3.927 ± 1.662
1.309ProVal: 1.309 ± 0.826
0.0ProTrp: 0.0 ± 0.0
3.054ProTyr: 3.054 ± 1.468
0.0ProXaa: 0.0 ± 0.0
Gln
1.309GlnAla: 1.309 ± 0.712
0.0GlnCys: 0.0 ± 0.0
3.054GlnAsp: 3.054 ± 0.596
4.363GlnGlu: 4.363 ± 0.603
3.49GlnPhe: 3.49 ± 0.838
2.182GlnGly: 2.182 ± 0.743
0.436GlnHis: 0.436 ± 0.363
2.618GlnIle: 2.618 ± 1.024
1.745GlnLys: 1.745 ± 0.822
6.108GlnLeu: 6.108 ± 1.154
2.618GlnMet: 2.618 ± 1.033
1.309GlnAsn: 1.309 ± 0.989
3.49GlnPro: 3.49 ± 1.262
1.745GlnGln: 1.745 ± 0.678
1.309GlnArg: 1.309 ± 0.723
1.745GlnSer: 1.745 ± 0.732
2.182GlnThr: 2.182 ± 0.962
1.745GlnVal: 1.745 ± 1.113
0.873GlnTrp: 0.873 ± 0.465
1.309GlnTyr: 1.309 ± 0.419
0.0GlnXaa: 0.0 ± 0.0
Arg
2.182ArgAla: 2.182 ± 0.9
1.745ArgCys: 1.745 ± 1.217
2.182ArgAsp: 2.182 ± 0.706
2.182ArgGlu: 2.182 ± 0.78
1.745ArgPhe: 1.745 ± 0.608
3.49ArgGly: 3.49 ± 1.512
1.745ArgHis: 1.745 ± 0.697
0.873ArgIle: 0.873 ± 0.574
4.363ArgLys: 4.363 ± 1.197
6.545ArgLeu: 6.545 ± 1.232
1.309ArgMet: 1.309 ± 0.72
2.182ArgAsn: 2.182 ± 0.804
3.49ArgPro: 3.49 ± 2.045
1.745ArgGln: 1.745 ± 0.696
3.927ArgArg: 3.927 ± 1.51
4.799ArgSer: 4.799 ± 1.638
2.618ArgThr: 2.618 ± 1.446
2.182ArgVal: 2.182 ± 1.202
0.873ArgTrp: 0.873 ± 0.696
3.927ArgTyr: 3.927 ± 0.671
0.0ArgXaa: 0.0 ± 0.0
Ser
3.054SerAla: 3.054 ± 0.871
2.182SerCys: 2.182 ± 1.245
3.49SerAsp: 3.49 ± 1.065
3.927SerGlu: 3.927 ± 0.488
3.927SerPhe: 3.927 ± 1.414
3.927SerGly: 3.927 ± 1.179
1.745SerHis: 1.745 ± 1.127
3.927SerIle: 3.927 ± 0.821
3.49SerLys: 3.49 ± 1.013
8.726SerLeu: 8.726 ± 0.776
0.873SerMet: 0.873 ± 0.476
3.054SerAsn: 3.054 ± 1.68
2.618SerPro: 2.618 ± 1.007
3.49SerGln: 3.49 ± 1.315
3.927SerArg: 3.927 ± 1.433
5.672SerSer: 5.672 ± 1.582
8.29SerThr: 8.29 ± 2.086
4.363SerVal: 4.363 ± 1.543
0.436SerTrp: 0.436 ± 0.348
2.182SerTyr: 2.182 ± 1.365
0.0SerXaa: 0.0 ± 0.0
Thr
6.108ThrAla: 6.108 ± 2.724
0.436ThrCys: 0.436 ± 0.419
4.799ThrAsp: 4.799 ± 0.78
2.182ThrGlu: 2.182 ± 0.505
2.618ThrPhe: 2.618 ± 0.52
4.799ThrGly: 4.799 ± 1.845
0.0ThrHis: 0.0 ± 0.0
3.49ThrIle: 3.49 ± 0.756
2.182ThrLys: 2.182 ± 0.929
5.236ThrLeu: 5.236 ± 0.687
0.436ThrMet: 0.436 ± 0.348
2.618ThrAsn: 2.618 ± 0.615
3.927ThrPro: 3.927 ± 0.895
1.745ThrGln: 1.745 ± 0.886
5.672ThrArg: 5.672 ± 1.17
4.363ThrSer: 4.363 ± 0.91
5.672ThrThr: 5.672 ± 2.479
8.29ThrVal: 8.29 ± 1.663
0.436ThrTrp: 0.436 ± 0.363
3.49ThrTyr: 3.49 ± 0.854
0.0ThrXaa: 0.0 ± 0.0
Val
1.745ValAla: 1.745 ± 0.822
1.745ValCys: 1.745 ± 0.608
3.49ValAsp: 3.49 ± 1.073
4.363ValGlu: 4.363 ± 1.639
3.054ValPhe: 3.054 ± 1.108
4.799ValGly: 4.799 ± 1.662
1.309ValHis: 1.309 ± 0.443
4.363ValIle: 4.363 ± 1.36
1.745ValLys: 1.745 ± 0.802
3.927ValLeu: 3.927 ± 1.044
0.873ValMet: 0.873 ± 0.838
1.309ValAsn: 1.309 ± 0.682
3.927ValPro: 3.927 ± 1.501
2.182ValGln: 2.182 ± 0.9
6.545ValArg: 6.545 ± 1.761
5.236ValSer: 5.236 ± 2.048
4.363ValThr: 4.363 ± 0.915
2.618ValVal: 2.618 ± 0.839
1.309ValTrp: 1.309 ± 0.608
2.618ValTyr: 2.618 ± 0.777
0.0ValXaa: 0.0 ± 0.0
Trp
0.436TrpAla: 0.436 ± 0.348
0.0TrpCys: 0.0 ± 0.0
0.873TrpAsp: 0.873 ± 0.476
0.436TrpGlu: 0.436 ± 0.348
0.436TrpPhe: 0.436 ± 0.363
0.436TrpGly: 0.436 ± 0.419
0.436TrpHis: 0.436 ± 0.348
0.873TrpIle: 0.873 ± 0.725
2.182TrpLys: 2.182 ± 0.992
1.309TrpLeu: 1.309 ± 0.581
0.436TrpMet: 0.436 ± 0.419
0.436TrpAsn: 0.436 ± 0.419
0.0TrpPro: 0.0 ± 0.0
0.873TrpGln: 0.873 ± 0.401
1.309TrpArg: 1.309 ± 0.748
0.873TrpSer: 0.873 ± 0.696
0.436TrpThr: 0.436 ± 0.348
0.436TrpVal: 0.436 ± 0.363
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.182TyrAla: 2.182 ± 0.49
2.618TyrCys: 2.618 ± 1.723
1.745TyrAsp: 1.745 ± 0.682
2.618TyrGlu: 2.618 ± 0.55
3.927TyrPhe: 3.927 ± 1.529
2.182TyrGly: 2.182 ± 0.743
0.0TyrHis: 0.0 ± 0.0
2.618TyrIle: 2.618 ± 1.206
3.49TyrLys: 3.49 ± 0.799
3.49TyrLeu: 3.49 ± 0.371
0.0TyrMet: 0.0 ± 0.0
1.309TyrAsn: 1.309 ± 0.736
2.182TyrPro: 2.182 ± 0.812
1.309TyrGln: 1.309 ± 0.419
2.182TyrArg: 2.182 ± 0.49
2.182TyrSer: 2.182 ± 1.104
2.618TyrThr: 2.618 ± 0.615
3.054TyrVal: 3.054 ± 0.569
0.436TyrTrp: 0.436 ± 0.419
2.618TyrTyr: 2.618 ± 1.08
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2293 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski