Amino acid dipepetide frequency for human papillomavirus 108

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.53AlaAla: 6.53 ± 2.921
1.399AlaCys: 1.399 ± 0.793
5.131AlaAsp: 5.131 ± 1.28
4.664AlaGlu: 4.664 ± 1.356
2.332AlaPhe: 2.332 ± 0.956
1.866AlaGly: 1.866 ± 1.187
0.0AlaHis: 0.0 ± 0.0
2.332AlaIle: 2.332 ± 0.561
3.731AlaLys: 3.731 ± 0.788
4.664AlaLeu: 4.664 ± 0.868
1.399AlaMet: 1.399 ± 1.054
0.466AlaAsn: 0.466 ± 0.355
2.332AlaPro: 2.332 ± 0.798
2.799AlaGln: 2.799 ± 0.589
2.799AlaArg: 2.799 ± 1.57
7.463AlaSer: 7.463 ± 1.113
4.664AlaThr: 4.664 ± 0.856
2.799AlaVal: 2.799 ± 1.48
0.466AlaTrp: 0.466 ± 0.355
1.399AlaTyr: 1.399 ± 0.634
0.0AlaXaa: 0.0 ± 0.0
Cys
1.399CysAla: 1.399 ± 0.928
0.466CysCys: 0.466 ± 0.355
1.399CysAsp: 1.399 ± 0.634
0.466CysGlu: 0.466 ± 0.395
0.933CysPhe: 0.933 ± 0.71
1.866CysGly: 1.866 ± 2.496
0.933CysHis: 0.933 ± 0.853
0.933CysIle: 0.933 ± 0.71
1.866CysLys: 1.866 ± 0.697
1.399CysLeu: 1.399 ± 1.066
0.466CysMet: 0.466 ± 0.846
0.466CysAsn: 0.466 ± 0.355
2.332CysPro: 2.332 ± 1.668
0.0CysGln: 0.0 ± 0.0
0.466CysArg: 0.466 ± 0.355
0.0CysSer: 0.0 ± 0.0
0.466CysThr: 0.466 ± 0.395
1.866CysVal: 1.866 ± 1.037
1.866CysTrp: 1.866 ± 0.697
0.466CysTyr: 0.466 ± 0.355
0.0CysXaa: 0.0 ± 0.0
Asp
3.731AspAla: 3.731 ± 0.889
0.933AspCys: 0.933 ± 0.71
1.866AspAsp: 1.866 ± 0.595
1.866AspGlu: 1.866 ± 1.228
1.866AspPhe: 1.866 ± 0.604
2.799AspGly: 2.799 ± 0.844
0.466AspHis: 0.466 ± 0.395
5.131AspIle: 5.131 ± 1.917
3.265AspLys: 3.265 ± 1.255
6.53AspLeu: 6.53 ± 1.831
0.466AspMet: 0.466 ± 0.395
2.332AspAsn: 2.332 ± 0.758
6.53AspPro: 6.53 ± 1.995
0.0AspGln: 0.0 ± 0.0
0.0AspArg: 0.0 ± 0.0
6.063AspSer: 6.063 ± 0.557
7.929AspThr: 7.929 ± 1.774
4.664AspVal: 4.664 ± 1.075
1.866AspTrp: 1.866 ± 0.95
0.466AspTyr: 0.466 ± 0.395
0.0AspXaa: 0.0 ± 0.0
Glu
5.131GluAla: 5.131 ± 2.615
1.399GluCys: 1.399 ± 0.996
2.799GluAsp: 2.799 ± 0.95
5.597GluGlu: 5.597 ± 2.343
0.933GluPhe: 0.933 ± 0.391
4.664GluGly: 4.664 ± 1.283
0.466GluHis: 0.466 ± 0.395
0.0GluIle: 0.0 ± 0.0
1.866GluLys: 1.866 ± 0.917
3.265GluLeu: 3.265 ± 0.823
1.399GluMet: 1.399 ± 1.066
4.198GluAsn: 4.198 ± 1.158
3.265GluPro: 3.265 ± 0.655
5.597GluGln: 5.597 ± 2.413
3.265GluArg: 3.265 ± 1.339
6.063GluSer: 6.063 ± 1.319
4.198GluThr: 4.198 ± 1.299
3.265GluVal: 3.265 ± 1.456
0.0GluTrp: 0.0 ± 0.0
0.466GluTyr: 0.466 ± 0.355
0.0GluXaa: 0.0 ± 0.0
Phe
2.332PheAla: 2.332 ± 0.423
0.933PheCys: 0.933 ± 0.71
1.399PheAsp: 1.399 ± 0.795
5.131PheGlu: 5.131 ± 1.575
2.799PhePhe: 2.799 ± 1.401
3.265PheGly: 3.265 ± 0.983
0.466PheHis: 0.466 ± 0.395
3.265PheIle: 3.265 ± 1.117
2.332PheLys: 2.332 ± 0.758
2.799PheLeu: 2.799 ± 0.844
0.933PheMet: 0.933 ± 0.43
0.933PheAsn: 0.933 ± 0.391
1.866PhePro: 1.866 ± 0.551
2.332PheGln: 2.332 ± 1.164
0.466PheArg: 0.466 ± 0.395
0.0PheSer: 0.0 ± 0.0
2.332PheThr: 2.332 ± 0.493
2.799PheVal: 2.799 ± 1.095
0.933PheTrp: 0.933 ± 0.391
0.933PheTyr: 0.933 ± 0.445
0.0PheXaa: 0.0 ± 0.0
Gly
3.731GlyAla: 3.731 ± 1.413
0.466GlyCys: 0.466 ± 0.395
2.799GlyAsp: 2.799 ± 1.113
4.198GlyGlu: 4.198 ± 1.042
1.866GlyPhe: 1.866 ± 1.129
4.198GlyGly: 4.198 ± 2.071
4.198GlyHis: 4.198 ± 0.999
2.332GlyIle: 2.332 ± 0.982
3.265GlyLys: 3.265 ± 0.764
6.53GlyLeu: 6.53 ± 1.55
0.0GlyMet: 0.0 ± 0.0
2.332GlyAsn: 2.332 ± 0.833
3.265GlyPro: 3.265 ± 0.642
2.332GlyGln: 2.332 ± 0.968
2.799GlyArg: 2.799 ± 1.095
3.731GlySer: 3.731 ± 0.944
8.396GlyThr: 8.396 ± 1.757
5.131GlyVal: 5.131 ± 1.063
0.0GlyTrp: 0.0 ± 0.0
1.866GlyTyr: 1.866 ± 0.551
0.0GlyXaa: 0.0 ± 0.0
His
3.265HisAla: 3.265 ± 0.679
0.933HisCys: 0.933 ± 0.445
0.0HisAsp: 0.0 ± 0.0
0.933HisGlu: 0.933 ± 0.43
0.466HisPhe: 0.466 ± 0.395
0.466HisGly: 0.466 ± 0.42
0.0HisHis: 0.0 ± 0.0
0.466HisIle: 0.466 ± 0.395
3.265HisLys: 3.265 ± 1.504
0.0HisLeu: 0.0 ± 0.0
0.466HisMet: 0.466 ± 0.772
0.0HisAsn: 0.0 ± 0.0
2.332HisPro: 2.332 ± 1.115
0.466HisGln: 0.466 ± 0.355
1.399HisArg: 1.399 ± 1.261
3.265HisSer: 3.265 ± 1.221
0.933HisThr: 0.933 ± 0.457
1.399HisVal: 1.399 ± 0.708
0.466HisTrp: 0.466 ± 0.395
1.866HisTyr: 1.866 ± 0.788
0.0HisXaa: 0.0 ± 0.0
Ile
0.466IleAla: 0.466 ± 0.372
0.466IleCys: 0.466 ± 0.395
3.265IleAsp: 3.265 ± 0.849
1.866IleGlu: 1.866 ± 1.084
1.399IlePhe: 1.399 ± 0.424
6.063IleGly: 6.063 ± 2.213
0.466IleHis: 0.466 ± 0.372
2.799IleIle: 2.799 ± 0.955
1.399IleLys: 1.399 ± 0.793
4.664IleLeu: 4.664 ± 0.885
0.466IleMet: 0.466 ± 0.42
2.332IleAsn: 2.332 ± 0.991
2.799IlePro: 2.799 ± 0.967
2.799IleGln: 2.799 ± 0.697
1.866IleArg: 1.866 ± 1.009
4.664IleSer: 4.664 ± 1.801
2.332IleThr: 2.332 ± 0.703
4.198IleVal: 4.198 ± 1.573
0.933IleTrp: 0.933 ± 0.71
1.866IleTyr: 1.866 ± 0.202
0.0IleXaa: 0.0 ± 0.0
Lys
2.799LysAla: 2.799 ± 1.551
2.332LysCys: 2.332 ± 1.186
1.866LysAsp: 1.866 ± 0.854
2.799LysGlu: 2.799 ± 0.893
0.933LysPhe: 0.933 ± 0.391
4.198LysGly: 4.198 ± 2.003
1.866LysHis: 1.866 ± 0.749
2.332LysIle: 2.332 ± 0.638
2.799LysLys: 2.799 ± 1.335
3.731LysLeu: 3.731 ± 1.087
1.399LysMet: 1.399 ± 0.668
0.933LysAsn: 0.933 ± 0.71
0.933LysPro: 0.933 ± 0.391
1.866LysGln: 1.866 ± 0.873
6.996LysArg: 6.996 ± 1.269
2.799LysSer: 2.799 ± 1.656
3.731LysThr: 3.731 ± 1.15
2.799LysVal: 2.799 ± 0.907
1.399LysTrp: 1.399 ± 0.478
3.265LysTyr: 3.265 ± 1.165
0.0LysXaa: 0.0 ± 0.0
Leu
6.063LeuAla: 6.063 ± 1.219
3.265LeuCys: 3.265 ± 2.438
5.131LeuAsp: 5.131 ± 0.667
4.664LeuGlu: 4.664 ± 1.588
4.198LeuPhe: 4.198 ± 1.452
4.664LeuGly: 4.664 ± 1.114
1.866LeuHis: 1.866 ± 0.965
3.731LeuIle: 3.731 ± 1.52
5.131LeuLys: 5.131 ± 1.945
5.131LeuLeu: 5.131 ± 1.115
0.466LeuMet: 0.466 ± 0.372
4.198LeuAsn: 4.198 ± 1.127
5.131LeuPro: 5.131 ± 1.04
5.597LeuGln: 5.597 ± 1.158
3.265LeuArg: 3.265 ± 0.766
4.198LeuSer: 4.198 ± 1.823
4.664LeuThr: 4.664 ± 1.729
4.664LeuVal: 4.664 ± 1.435
0.933LeuTrp: 0.933 ± 0.549
3.731LeuTyr: 3.731 ± 2.031
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.933MetCys: 0.933 ± 0.391
1.399MetAsp: 1.399 ± 0.347
0.933MetGlu: 0.933 ± 0.43
0.933MetPhe: 0.933 ± 0.391
0.466MetGly: 0.466 ± 0.355
0.466MetHis: 0.466 ± 0.42
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
1.866MetLeu: 1.866 ± 0.979
0.466MetMet: 0.466 ± 0.42
0.0MetAsn: 0.0 ± 0.0
0.933MetPro: 0.933 ± 0.43
2.799MetGln: 2.799 ± 0.971
1.866MetArg: 1.866 ± 1.004
1.399MetSer: 1.399 ± 0.7
0.0MetThr: 0.0 ± 0.0
1.399MetVal: 1.399 ± 0.634
0.0MetTrp: 0.0 ± 0.0
0.933MetTyr: 0.933 ± 0.853
0.0MetXaa: 0.0 ± 0.0
Asn
1.866AsnAla: 1.866 ± 0.604
0.466AsnCys: 0.466 ± 0.355
0.933AsnAsp: 0.933 ± 0.789
2.799AsnGlu: 2.799 ± 1.656
2.799AsnPhe: 2.799 ± 0.87
2.332AsnGly: 2.332 ± 1.072
0.466AsnHis: 0.466 ± 0.355
3.731AsnIle: 3.731 ± 1.347
2.332AsnLys: 2.332 ± 0.493
1.399AsnLeu: 1.399 ± 0.928
0.0AsnMet: 0.0 ± 0.0
2.799AsnAsn: 2.799 ± 1.267
2.332AsnPro: 2.332 ± 0.638
0.933AsnGln: 0.933 ± 0.391
3.265AsnArg: 3.265 ± 0.477
1.866AsnSer: 1.866 ± 0.551
3.265AsnThr: 3.265 ± 0.55
4.198AsnVal: 4.198 ± 1.089
0.0AsnTrp: 0.0 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.799ProAla: 2.799 ± 1.51
1.399ProCys: 1.399 ± 0.7
6.063ProAsp: 6.063 ± 1.982
2.799ProGlu: 2.799 ± 1.211
1.866ProPhe: 1.866 ± 0.587
1.399ProGly: 1.399 ± 0.743
1.866ProHis: 1.866 ± 1.037
2.799ProIle: 2.799 ± 1.293
3.731ProLys: 3.731 ± 1.121
6.53ProLeu: 6.53 ± 2.239
2.332ProMet: 2.332 ± 0.859
2.332ProAsn: 2.332 ± 1.051
6.996ProPro: 6.996 ± 1.478
1.399ProGln: 1.399 ± 0.478
4.664ProArg: 4.664 ± 1.388
7.463ProSer: 7.463 ± 0.901
3.731ProThr: 3.731 ± 1.782
3.731ProVal: 3.731 ± 1.637
0.466ProTrp: 0.466 ± 0.42
3.265ProTyr: 3.265 ± 1.542
0.0ProXaa: 0.0 ± 0.0
Gln
2.799GlnAla: 2.799 ± 0.589
0.466GlnCys: 0.466 ± 0.355
2.799GlnAsp: 2.799 ± 0.934
3.265GlnGlu: 3.265 ± 0.84
3.265GlnPhe: 3.265 ± 1.135
2.332GlnGly: 2.332 ± 0.703
0.466GlnHis: 0.466 ± 0.372
1.866GlnIle: 1.866 ± 0.587
0.466GlnLys: 0.466 ± 0.395
3.731GlnLeu: 3.731 ± 1.721
1.399GlnMet: 1.399 ± 0.773
1.866GlnAsn: 1.866 ± 1.068
4.664GlnPro: 4.664 ± 1.884
3.731GlnGln: 3.731 ± 1.1
3.731GlnArg: 3.731 ± 2.189
0.933GlnSer: 0.933 ± 0.445
2.799GlnThr: 2.799 ± 0.87
3.265GlnVal: 3.265 ± 0.879
0.933GlnTrp: 0.933 ± 0.71
2.799GlnTyr: 2.799 ± 0.848
0.0GlnXaa: 0.0 ± 0.0
Arg
2.332ArgAla: 2.332 ± 0.942
0.933ArgCys: 0.933 ± 0.549
3.265ArgAsp: 3.265 ± 0.809
1.399ArgGlu: 1.399 ± 0.983
1.866ArgPhe: 1.866 ± 0.748
4.198ArgGly: 4.198 ± 1.565
5.131ArgHis: 5.131 ± 1.288
0.933ArgIle: 0.933 ± 0.457
3.731ArgLys: 3.731 ± 1.5
8.396ArgLeu: 8.396 ± 1.174
0.466ArgMet: 0.466 ± 0.395
2.332ArgAsn: 2.332 ± 1.045
3.731ArgPro: 3.731 ± 1.605
2.332ArgGln: 2.332 ± 0.325
7.463ArgArg: 7.463 ± 2.949
3.265ArgSer: 3.265 ± 0.983
2.332ArgThr: 2.332 ± 0.955
3.731ArgVal: 3.731 ± 1.254
0.466ArgTrp: 0.466 ± 0.42
1.866ArgTyr: 1.866 ± 0.86
0.0ArgXaa: 0.0 ± 0.0
Ser
3.731SerAla: 3.731 ± 0.785
0.933SerCys: 0.933 ± 0.43
4.664SerAsp: 4.664 ± 0.648
3.265SerGlu: 3.265 ± 1.04
2.332SerPhe: 2.332 ± 0.694
5.597SerGly: 5.597 ± 1.398
0.933SerHis: 0.933 ± 0.549
4.664SerIle: 4.664 ± 1.537
2.799SerLys: 2.799 ± 1.153
7.463SerLeu: 7.463 ± 1.11
1.399SerMet: 1.399 ± 1.017
6.063SerAsn: 6.063 ± 3.116
4.664SerPro: 4.664 ± 0.65
4.198SerGln: 4.198 ± 0.646
4.664SerArg: 4.664 ± 1.486
10.728SerSer: 10.728 ± 3.267
6.53SerThr: 6.53 ± 2.233
6.063SerVal: 6.063 ± 0.713
0.933SerTrp: 0.933 ± 0.71
1.399SerTyr: 1.399 ± 0.347
0.0SerXaa: 0.0 ± 0.0
Thr
3.265ThrAla: 3.265 ± 0.983
1.399ThrCys: 1.399 ± 0.347
4.664ThrAsp: 4.664 ± 1.553
5.131ThrGlu: 5.131 ± 1.364
4.664ThrPhe: 4.664 ± 1.735
6.53ThrGly: 6.53 ± 0.931
0.933ThrHis: 0.933 ± 0.43
3.731ThrIle: 3.731 ± 1.402
3.731ThrLys: 3.731 ± 1.087
3.731ThrLeu: 3.731 ± 1.07
0.933ThrMet: 0.933 ± 0.391
0.466ThrAsn: 0.466 ± 0.355
5.131ThrPro: 5.131 ± 2.145
1.866ThrGln: 1.866 ± 0.558
3.731ThrArg: 3.731 ± 1.19
9.795ThrSer: 9.795 ± 2.336
4.664ThrThr: 4.664 ± 1.486
4.664ThrVal: 4.664 ± 0.7
0.466ThrTrp: 0.466 ± 0.42
0.933ThrTyr: 0.933 ± 0.436
0.0ThrXaa: 0.0 ± 0.0
Val
5.131ValAla: 5.131 ± 0.654
0.933ValCys: 0.933 ± 1.693
6.996ValAsp: 6.996 ± 0.906
4.198ValGlu: 4.198 ± 0.801
0.933ValPhe: 0.933 ± 0.789
2.799ValGly: 2.799 ± 0.87
0.933ValHis: 0.933 ± 0.445
3.265ValIle: 3.265 ± 0.912
2.332ValLys: 2.332 ± 0.758
5.131ValLeu: 5.131 ± 1.049
1.399ValMet: 1.399 ± 0.375
1.866ValAsn: 1.866 ± 0.748
6.063ValPro: 6.063 ± 2.465
3.731ValGln: 3.731 ± 1.034
3.265ValArg: 3.265 ± 1.302
7.929ValSer: 7.929 ± 2.02
4.664ValThr: 4.664 ± 1.937
6.063ValVal: 6.063 ± 1.56
0.933ValTrp: 0.933 ± 0.549
2.332ValTyr: 2.332 ± 0.694
0.0ValXaa: 0.0 ± 0.0
Trp
0.466TrpAla: 0.466 ± 0.355
0.0TrpCys: 0.0 ± 0.0
0.933TrpAsp: 0.933 ± 0.549
0.466TrpGlu: 0.466 ± 0.355
0.0TrpPhe: 0.0 ± 0.0
1.866TrpGly: 1.866 ± 0.749
0.466TrpHis: 0.466 ± 0.42
1.399TrpIle: 1.399 ± 0.347
1.399TrpLys: 1.399 ± 0.668
1.866TrpLeu: 1.866 ± 0.95
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.933TrpPro: 0.933 ± 0.391
0.933TrpGln: 0.933 ± 0.549
1.399TrpArg: 1.399 ± 0.858
0.466TrpSer: 0.466 ± 0.355
0.466TrpThr: 0.466 ± 0.395
0.933TrpVal: 0.933 ± 0.43
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.866TyrAla: 1.866 ± 0.551
0.0TyrCys: 0.0 ± 0.0
1.399TyrAsp: 1.399 ± 0.388
1.399TyrGlu: 1.399 ± 0.424
1.866TyrPhe: 1.866 ± 0.748
1.866TyrGly: 1.866 ± 0.202
0.0TyrHis: 0.0 ± 0.0
1.399TyrIle: 1.399 ± 0.478
2.332TyrLys: 2.332 ± 1.287
1.866TyrLeu: 1.866 ± 0.551
0.466TyrMet: 0.466 ± 0.355
1.866TyrAsn: 1.866 ± 0.595
1.399TyrPro: 1.399 ± 0.894
1.866TyrGln: 1.866 ± 0.979
2.799TyrArg: 2.799 ± 0.775
0.933TyrSer: 0.933 ± 0.43
2.332TyrThr: 2.332 ± 0.914
3.265TyrVal: 3.265 ± 0.916
0.933TyrTrp: 0.933 ± 0.549
2.332TyrTyr: 2.332 ± 0.847
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2145 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski