Amino acid dipepetide frequency for Human papillomavirus 120

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.238AlaAla: 3.238 ± 1.296
1.619AlaCys: 1.619 ± 0.719
4.047AlaAsp: 4.047 ± 1.436
2.428AlaGlu: 2.428 ± 0.962
2.428AlaPhe: 2.428 ± 0.51
1.214AlaGly: 1.214 ± 0.424
0.0AlaHis: 0.0 ± 0.0
3.238AlaIle: 3.238 ± 0.875
4.047AlaLys: 4.047 ± 1.519
3.642AlaLeu: 3.642 ± 1.048
0.809AlaMet: 0.809 ± 0.593
2.023AlaAsn: 2.023 ± 0.893
3.642AlaPro: 3.642 ± 0.853
4.047AlaGln: 4.047 ± 0.878
2.428AlaArg: 2.428 ± 0.762
6.475AlaSer: 6.475 ± 1.244
4.047AlaThr: 4.047 ± 1.327
4.047AlaVal: 4.047 ± 0.747
0.405AlaTrp: 0.405 ± 0.326
2.023AlaTyr: 2.023 ± 0.922
0.0AlaXaa: 0.0 ± 0.0
Cys
1.214CysAla: 1.214 ± 0.553
1.619CysCys: 1.619 ± 1.106
0.405CysAsp: 0.405 ± 0.633
0.405CysGlu: 0.405 ± 0.326
1.214CysPhe: 1.214 ± 0.379
0.405CysGly: 0.405 ± 0.633
0.405CysHis: 0.405 ± 0.633
1.214CysIle: 1.214 ± 0.588
2.833CysLys: 2.833 ± 1.088
1.214CysLeu: 1.214 ± 0.988
0.405CysMet: 0.405 ± 0.539
0.405CysAsn: 0.405 ± 0.326
2.023CysPro: 2.023 ± 0.857
1.214CysGln: 1.214 ± 0.767
1.214CysArg: 1.214 ± 0.971
2.023CysSer: 2.023 ± 1.339
0.809CysThr: 0.809 ± 0.422
0.809CysVal: 0.809 ± 0.519
0.405CysTrp: 0.405 ± 0.326
1.214CysTyr: 1.214 ± 0.528
0.0CysXaa: 0.0 ± 0.0
Asp
5.666AspAla: 5.666 ± 1.293
0.809AspCys: 0.809 ± 0.356
4.047AspAsp: 4.047 ± 2.052
2.428AspGlu: 2.428 ± 0.548
1.619AspPhe: 1.619 ± 0.938
3.642AspGly: 3.642 ± 1.437
2.023AspHis: 2.023 ± 0.593
6.07AspIle: 6.07 ± 1.632
3.238AspLys: 3.238 ± 0.643
6.07AspLeu: 6.07 ± 1.275
2.428AspMet: 2.428 ± 0.951
2.833AspAsn: 2.833 ± 0.942
3.642AspPro: 3.642 ± 1.819
1.619AspGln: 1.619 ± 0.747
1.619AspArg: 1.619 ± 0.6
5.666AspSer: 5.666 ± 2.04
6.07AspThr: 6.07 ± 1.466
2.833AspVal: 2.833 ± 1.356
0.405AspTrp: 0.405 ± 0.348
2.023AspTyr: 2.023 ± 0.719
0.0AspXaa: 0.0 ± 0.0
Glu
4.452GluAla: 4.452 ± 0.599
0.809GluCys: 0.809 ± 0.653
5.666GluAsp: 5.666 ± 0.598
7.285GluGlu: 7.285 ± 4.007
2.023GluPhe: 2.023 ± 0.867
5.261GluGly: 5.261 ± 1.885
0.405GluHis: 0.405 ± 0.35
1.619GluIle: 1.619 ± 0.763
4.047GluLys: 4.047 ± 2.102
4.856GluLeu: 4.856 ± 1.66
1.214GluMet: 1.214 ± 0.728
2.428GluAsn: 2.428 ± 1.001
2.833GluPro: 2.833 ± 0.965
4.047GluGln: 4.047 ± 0.781
3.238GluArg: 3.238 ± 1.318
4.452GluSer: 4.452 ± 1.586
4.856GluThr: 4.856 ± 1.944
5.666GluVal: 5.666 ± 0.826
0.405GluTrp: 0.405 ± 0.326
3.238GluTyr: 3.238 ± 1.416
0.0GluXaa: 0.0 ± 0.0
Phe
1.214PheAla: 1.214 ± 0.553
0.809PheCys: 0.809 ± 0.519
5.666PheAsp: 5.666 ± 0.788
3.642PheGlu: 3.642 ± 1.568
1.619PhePhe: 1.619 ± 0.276
2.023PheGly: 2.023 ± 0.68
0.405PheHis: 0.405 ± 0.485
1.619PheIle: 1.619 ± 0.844
2.428PheLys: 2.428 ± 1.177
4.047PheLeu: 4.047 ± 1.323
0.0PheMet: 0.0 ± 0.0
1.619PheAsn: 1.619 ± 0.591
4.452PhePro: 4.452 ± 1.188
2.428PheGln: 2.428 ± 0.664
2.023PheArg: 2.023 ± 0.498
0.809PheSer: 0.809 ± 0.412
1.619PheThr: 1.619 ± 0.628
2.833PheVal: 2.833 ± 1.116
1.619PheTrp: 1.619 ± 0.844
1.214PheTyr: 1.214 ± 0.615
0.0PheXaa: 0.0 ± 0.0
Gly
3.238GlyAla: 3.238 ± 0.797
2.023GlyCys: 2.023 ± 0.855
2.833GlyAsp: 2.833 ± 1.071
3.642GlyGlu: 3.642 ± 1.448
0.809GlyPhe: 0.809 ± 0.647
8.499GlyGly: 8.499 ± 3.658
2.833GlyHis: 2.833 ± 1.327
3.238GlyIle: 3.238 ± 1.056
3.642GlyLys: 3.642 ± 2.039
4.047GlyLeu: 4.047 ± 1.054
0.0GlyMet: 0.0 ± 0.0
4.452GlyAsn: 4.452 ± 1.672
4.047GlyPro: 4.047 ± 1.547
3.642GlyGln: 3.642 ± 0.441
4.856GlyArg: 4.856 ± 1.744
3.642GlySer: 3.642 ± 0.871
3.238GlyThr: 3.238 ± 1.098
2.833GlyVal: 2.833 ± 0.776
0.0GlyTrp: 0.0 ± 0.0
0.809GlyTyr: 0.809 ± 0.457
0.0GlyXaa: 0.0 ± 0.0
His
1.214HisAla: 1.214 ± 0.424
1.619HisCys: 1.619 ± 1.042
0.809HisAsp: 0.809 ± 0.647
0.405HisGlu: 0.405 ± 0.633
0.405HisPhe: 0.405 ± 0.35
0.405HisGly: 0.405 ± 0.539
0.809HisHis: 0.809 ± 0.566
0.809HisIle: 0.809 ± 0.39
1.619HisLys: 1.619 ± 0.746
0.405HisLeu: 0.405 ± 0.539
0.0HisMet: 0.0 ± 0.0
0.809HisAsn: 0.809 ± 0.412
2.023HisPro: 2.023 ± 0.789
0.405HisGln: 0.405 ± 0.326
0.405HisArg: 0.405 ± 0.326
0.809HisSer: 0.809 ± 0.39
0.0HisThr: 0.0 ± 0.0
2.428HisVal: 2.428 ± 0.536
1.214HisTrp: 1.214 ± 0.52
0.809HisTyr: 0.809 ± 0.422
0.0HisXaa: 0.0 ± 0.0
Ile
2.833IleAla: 2.833 ± 0.97
0.809IleCys: 0.809 ± 0.97
4.047IleAsp: 4.047 ± 1.28
4.856IleGlu: 4.856 ± 2.586
1.619IlePhe: 1.619 ± 0.659
4.856IleGly: 4.856 ± 1.511
0.405IleHis: 0.405 ± 0.326
4.047IleIle: 4.047 ± 1.323
2.833IleLys: 2.833 ± 0.594
4.047IleLeu: 4.047 ± 0.643
0.0IleMet: 0.0 ± 0.0
2.428IleAsn: 2.428 ± 0.764
2.023IlePro: 2.023 ± 1.026
2.833IleGln: 2.833 ± 1.409
1.214IleArg: 1.214 ± 1.275
3.642IleSer: 3.642 ± 1.523
1.619IleThr: 1.619 ± 0.521
4.856IleVal: 4.856 ± 1.261
1.214IleTrp: 1.214 ± 0.52
2.833IleTyr: 2.833 ± 0.768
0.0IleXaa: 0.0 ± 0.0
Lys
3.238LysAla: 3.238 ± 0.465
1.619LysCys: 1.619 ± 0.903
1.619LysAsp: 1.619 ± 1.013
3.238LysGlu: 3.238 ± 1.187
3.642LysPhe: 3.642 ± 1.313
3.238LysGly: 3.238 ± 0.644
1.619LysHis: 1.619 ± 0.882
3.238LysIle: 3.238 ± 0.615
4.047LysLys: 4.047 ± 1.353
4.856LysLeu: 4.856 ± 1.91
0.0LysMet: 0.0 ± 0.0
2.428LysAsn: 2.428 ± 0.485
2.023LysPro: 2.023 ± 2.11
1.214LysGln: 1.214 ± 0.655
6.07LysArg: 6.07 ± 0.383
4.047LysSer: 4.047 ± 1.415
2.428LysThr: 2.428 ± 0.859
2.023LysVal: 2.023 ± 0.623
0.405LysTrp: 0.405 ± 0.539
3.238LysTyr: 3.238 ± 1.181
0.0LysXaa: 0.0 ± 0.0
Leu
5.261LeuAla: 5.261 ± 0.847
1.619LeuCys: 1.619 ± 0.844
6.07LeuAsp: 6.07 ± 1.369
8.903LeuGlu: 8.903 ± 1.837
4.452LeuPhe: 4.452 ± 1.133
4.856LeuGly: 4.856 ± 1.449
2.023LeuHis: 2.023 ± 0.629
4.047LeuIle: 4.047 ± 1.478
4.452LeuLys: 4.452 ± 0.878
8.903LeuLeu: 8.903 ± 2.024
2.023LeuMet: 2.023 ± 0.507
1.214LeuAsn: 1.214 ± 0.767
2.428LeuPro: 2.428 ± 1.006
5.261LeuGln: 5.261 ± 1.194
5.261LeuArg: 5.261 ± 0.954
6.475LeuSer: 6.475 ± 1.954
4.856LeuThr: 4.856 ± 1.328
4.047LeuVal: 4.047 ± 1.573
1.214LeuTrp: 1.214 ± 0.678
1.214LeuTyr: 1.214 ± 0.371
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
1.619MetAsp: 1.619 ± 0.521
0.405MetGlu: 0.405 ± 0.348
1.619MetPhe: 1.619 ± 0.844
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.214MetIle: 1.214 ± 0.781
2.023MetLys: 2.023 ± 0.854
1.214MetLeu: 1.214 ± 0.622
0.405MetMet: 0.405 ± 0.539
1.619MetAsn: 1.619 ± 0.596
0.405MetPro: 0.405 ± 0.348
0.809MetGln: 0.809 ± 1.078
0.809MetArg: 0.809 ± 0.422
1.619MetSer: 1.619 ± 0.982
0.809MetThr: 0.809 ± 0.647
1.214MetVal: 1.214 ± 0.527
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.833AsnAla: 2.833 ± 0.79
0.809AsnCys: 0.809 ± 0.631
2.428AsnAsp: 2.428 ± 0.857
2.023AsnGlu: 2.023 ± 0.854
2.023AsnPhe: 2.023 ± 0.66
2.428AsnGly: 2.428 ± 1.09
0.809AsnHis: 0.809 ± 0.59
2.833AsnIle: 2.833 ± 1.232
2.428AsnLys: 2.428 ± 0.951
1.619AsnLeu: 1.619 ± 0.693
0.405AsnMet: 0.405 ± 0.35
2.833AsnAsn: 2.833 ± 0.79
3.238AsnPro: 3.238 ± 1.098
1.214AsnGln: 1.214 ± 0.715
2.023AsnArg: 2.023 ± 0.473
4.047AsnSer: 4.047 ± 0.758
2.428AsnThr: 2.428 ± 0.847
2.428AsnVal: 2.428 ± 0.808
0.405AsnTrp: 0.405 ± 0.323
1.619AsnTyr: 1.619 ± 0.599
0.0AsnXaa: 0.0 ± 0.0
Pro
4.047ProAla: 4.047 ± 1.442
1.619ProCys: 1.619 ± 0.648
5.261ProAsp: 5.261 ± 2.275
4.856ProGlu: 4.856 ± 1.883
2.833ProPhe: 2.833 ± 0.97
2.428ProGly: 2.428 ± 2.035
0.0ProHis: 0.0 ± 0.0
2.428ProIle: 2.428 ± 1.576
3.238ProLys: 3.238 ± 1.11
6.07ProLeu: 6.07 ± 0.824
0.405ProMet: 0.405 ± 0.326
2.023ProAsn: 2.023 ± 0.6
9.713ProPro: 9.713 ± 5.455
1.214ProGln: 1.214 ± 1.172
2.023ProArg: 2.023 ± 0.946
4.856ProSer: 4.856 ± 1.662
5.261ProThr: 5.261 ± 1.934
5.261ProVal: 5.261 ± 1.241
0.405ProTrp: 0.405 ± 0.348
1.619ProTyr: 1.619 ± 0.65
0.0ProXaa: 0.0 ± 0.0
Gln
1.214GlnAla: 1.214 ± 1.221
0.809GlnCys: 0.809 ± 0.422
2.023GlnAsp: 2.023 ± 0.725
4.047GlnGlu: 4.047 ± 0.517
2.833GlnPhe: 2.833 ± 0.811
2.023GlnGly: 2.023 ± 0.99
0.809GlnHis: 0.809 ± 0.653
4.856GlnIle: 4.856 ± 0.859
1.214GlnLys: 1.214 ± 0.691
6.475GlnLeu: 6.475 ± 1.432
2.023GlnMet: 2.023 ± 0.683
2.023GlnAsn: 2.023 ± 0.6
2.428GlnPro: 2.428 ± 0.991
4.047GlnGln: 4.047 ± 1.098
3.642GlnArg: 3.642 ± 0.885
1.619GlnSer: 1.619 ± 0.747
1.619GlnThr: 1.619 ± 0.882
2.428GlnVal: 2.428 ± 0.569
0.809GlnTrp: 0.809 ± 0.653
3.238GlnTyr: 3.238 ± 0.825
0.0GlnXaa: 0.0 ± 0.0
Arg
4.047ArgAla: 4.047 ± 0.921
1.214ArgCys: 1.214 ± 0.988
3.238ArgAsp: 3.238 ± 1.206
5.261ArgGlu: 5.261 ± 1.216
3.642ArgPhe: 3.642 ± 1.098
3.642ArgGly: 3.642 ± 1.462
2.023ArgHis: 2.023 ± 0.609
2.023ArgIle: 2.023 ± 0.504
3.642ArgLys: 3.642 ± 1.307
6.475ArgLeu: 6.475 ± 0.707
1.214ArgMet: 1.214 ± 0.623
2.023ArgAsn: 2.023 ± 0.538
3.238ArgPro: 3.238 ± 1.527
2.833ArgGln: 2.833 ± 1.084
6.07ArgArg: 6.07 ± 1.841
5.666ArgSer: 5.666 ± 2.682
2.833ArgThr: 2.833 ± 0.818
3.238ArgVal: 3.238 ± 1.545
0.405ArgTrp: 0.405 ± 0.348
1.619ArgTyr: 1.619 ± 0.672
0.0ArgXaa: 0.0 ± 0.0
Ser
3.642SerAla: 3.642 ± 0.932
0.405SerCys: 0.405 ± 0.633
4.856SerAsp: 4.856 ± 1.869
3.642SerGlu: 3.642 ± 0.887
3.642SerPhe: 3.642 ± 1.146
4.856SerGly: 4.856 ± 1.173
0.0SerHis: 0.0 ± 0.0
2.428SerIle: 2.428 ± 0.935
2.428SerLys: 2.428 ± 1.158
8.903SerLeu: 8.903 ± 1.186
1.619SerMet: 1.619 ± 1.305
4.047SerAsn: 4.047 ± 0.897
4.047SerPro: 4.047 ± 1.546
2.833SerGln: 2.833 ± 0.569
7.285SerArg: 7.285 ± 2.628
4.047SerSer: 4.047 ± 2.219
4.856SerThr: 4.856 ± 1.491
2.833SerVal: 2.833 ± 0.636
1.214SerTrp: 1.214 ± 0.588
2.428SerTyr: 2.428 ± 0.507
0.0SerXaa: 0.0 ± 0.0
Thr
1.619ThrAla: 1.619 ± 1.049
0.809ThrCys: 0.809 ± 0.39
4.047ThrAsp: 4.047 ± 0.637
3.642ThrGlu: 3.642 ± 2.389
1.214ThrPhe: 1.214 ± 0.382
3.642ThrGly: 3.642 ± 1.514
0.0ThrHis: 0.0 ± 0.0
2.428ThrIle: 2.428 ± 1.387
2.428ThrLys: 2.428 ± 0.831
3.238ThrLeu: 3.238 ± 1.056
1.214ThrMet: 1.214 ± 0.682
1.214ThrAsn: 1.214 ± 0.424
6.88ThrPro: 6.88 ± 1.824
3.238ThrGln: 3.238 ± 0.732
5.261ThrArg: 5.261 ± 1.655
3.642ThrSer: 3.642 ± 0.946
3.238ThrThr: 3.238 ± 1.383
7.689ThrVal: 7.689 ± 1.665
0.809ThrTrp: 0.809 ± 0.695
1.619ThrTyr: 1.619 ± 0.719
0.0ThrXaa: 0.0 ± 0.0
Val
3.238ValAla: 3.238 ± 0.49
1.619ValCys: 1.619 ± 0.819
4.856ValAsp: 4.856 ± 1.491
5.261ValGlu: 5.261 ± 1.125
2.428ValPhe: 2.428 ± 1.004
4.856ValGly: 4.856 ± 1.405
2.428ValHis: 2.428 ± 0.979
2.833ValIle: 2.833 ± 1.16
1.214ValLys: 1.214 ± 0.788
3.238ValLeu: 3.238 ± 1.188
0.405ValMet: 0.405 ± 0.354
1.214ValAsn: 1.214 ± 0.655
4.452ValPro: 4.452 ± 0.677
4.047ValGln: 4.047 ± 1.181
6.07ValArg: 6.07 ± 1.426
4.452ValSer: 4.452 ± 1.003
3.642ValThr: 3.642 ± 1.131
2.428ValVal: 2.428 ± 1.309
0.809ValTrp: 0.809 ± 0.412
2.428ValTyr: 2.428 ± 0.554
0.0ValXaa: 0.0 ± 0.0
Trp
1.214TrpAla: 1.214 ± 0.682
0.405TrpCys: 0.405 ± 0.326
0.405TrpAsp: 0.405 ± 0.323
1.214TrpGlu: 1.214 ± 0.93
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.214TrpIle: 1.214 ± 0.979
0.809TrpLys: 0.809 ± 0.548
1.214TrpLeu: 1.214 ± 0.678
0.405TrpMet: 0.405 ± 0.539
0.809TrpAsn: 0.809 ± 0.647
0.0TrpPro: 0.0 ± 0.0
1.214TrpGln: 1.214 ± 0.682
0.405TrpArg: 0.405 ± 0.348
1.214TrpSer: 1.214 ± 1.043
0.809TrpThr: 0.809 ± 0.548
1.214TrpVal: 1.214 ± 0.588
0.0TrpTrp: 0.0 ± 0.0
0.405TrpTyr: 0.405 ± 0.326
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.428TyrAla: 2.428 ± 0.646
0.405TyrCys: 0.405 ± 0.485
0.809TyrAsp: 0.809 ± 0.39
1.214TyrGlu: 1.214 ± 0.52
2.023TyrPhe: 2.023 ± 1.045
4.047TyrGly: 4.047 ± 0.761
0.405TyrHis: 0.405 ± 0.323
1.619TyrIle: 1.619 ± 0.713
1.619TyrLys: 1.619 ± 1.042
4.047TyrLeu: 4.047 ± 1.231
0.405TyrMet: 0.405 ± 0.323
2.428TyrAsn: 2.428 ± 0.764
2.023TyrPro: 2.023 ± 0.725
2.023TyrGln: 2.023 ± 1.027
2.428TyrArg: 2.428 ± 0.84
0.809TyrSer: 0.809 ± 0.386
2.833TyrThr: 2.833 ± 0.773
1.214TyrVal: 1.214 ± 0.663
0.809TyrTrp: 0.809 ± 0.593
2.428TyrTyr: 2.428 ± 1.017
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2472 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski