Amino acid dipepetide frequency for Human papillomavirus 82

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.774AlaAla: 3.774 ± 1.41
1.258AlaCys: 1.258 ± 1.129
2.516AlaAsp: 2.516 ± 0.985
2.516AlaGlu: 2.516 ± 0.544
3.774AlaPhe: 3.774 ± 0.743
4.612AlaGly: 4.612 ± 0.959
0.839AlaHis: 0.839 ± 0.451
1.677AlaIle: 1.677 ± 0.638
5.031AlaLys: 5.031 ± 2.032
4.193AlaLeu: 4.193 ± 1.155
1.677AlaMet: 1.677 ± 0.599
2.516AlaAsn: 2.516 ± 0.892
5.031AlaPro: 5.031 ± 2.521
4.193AlaGln: 4.193 ± 1.185
3.354AlaArg: 3.354 ± 1.275
2.516AlaSer: 2.516 ± 0.916
5.87AlaThr: 5.87 ± 1.41
4.193AlaVal: 4.193 ± 0.913
0.419AlaTrp: 0.419 ± 0.318
1.258AlaTyr: 1.258 ± 0.947
0.0AlaXaa: 0.0 ± 0.0
Cys
2.935CysAla: 2.935 ± 0.986
1.258CysCys: 1.258 ± 0.917
1.258CysAsp: 1.258 ± 0.677
0.419CysGlu: 0.419 ± 0.516
1.677CysPhe: 1.677 ± 1.871
1.258CysGly: 1.258 ± 0.78
1.258CysHis: 1.258 ± 0.653
0.839CysIle: 0.839 ± 0.388
2.516CysLys: 2.516 ± 1.311
3.354CysLeu: 3.354 ± 2.142
0.419CysMet: 0.419 ± 0.318
0.419CysAsn: 0.419 ± 0.516
2.516CysPro: 2.516 ± 0.78
1.677CysGln: 1.677 ± 0.914
1.258CysArg: 1.258 ± 1.235
2.096CysSer: 2.096 ± 1.373
0.839CysThr: 0.839 ± 0.636
2.935CysVal: 2.935 ± 1.333
1.258CysTrp: 1.258 ± 0.394
1.677CysTyr: 1.677 ± 1.355
0.0CysXaa: 0.0 ± 0.0
Asp
1.677AspAla: 1.677 ± 0.776
1.258AspCys: 1.258 ± 0.394
2.516AspAsp: 2.516 ± 1.167
2.096AspGlu: 2.096 ± 1.321
1.677AspPhe: 1.677 ± 0.67
2.516AspGly: 2.516 ± 0.678
0.839AspHis: 0.839 ± 0.451
4.193AspIle: 4.193 ± 1.545
3.354AspLys: 3.354 ± 0.865
3.354AspLeu: 3.354 ± 1.42
0.419AspMet: 0.419 ± 0.316
2.935AspAsn: 2.935 ± 0.693
3.354AspPro: 3.354 ± 1.245
2.096AspGln: 2.096 ± 0.754
1.677AspArg: 1.677 ± 0.599
7.128AspSer: 7.128 ± 1.678
6.289AspThr: 6.289 ± 1.053
3.774AspVal: 3.774 ± 0.734
0.839AspTrp: 0.839 ± 0.636
1.677AspTyr: 1.677 ± 0.71
0.0AspXaa: 0.0 ± 0.0
Glu
3.774GluAla: 3.774 ± 1.334
0.419GluCys: 0.419 ± 0.389
5.031GluAsp: 5.031 ± 1.175
5.031GluGlu: 5.031 ± 1.114
0.0GluPhe: 0.0 ± 0.0
1.677GluGly: 1.677 ± 1.003
0.419GluHis: 0.419 ± 0.316
3.774GluIle: 3.774 ± 0.809
0.839GluLys: 0.839 ± 0.669
5.87GluLeu: 5.87 ± 1.881
0.839GluMet: 0.839 ± 0.493
2.935GluAsn: 2.935 ± 1.059
3.774GluPro: 3.774 ± 2.157
1.677GluGln: 1.677 ± 0.807
1.677GluArg: 1.677 ± 0.574
0.839GluSer: 0.839 ± 0.678
3.354GluThr: 3.354 ± 1.283
2.516GluVal: 2.516 ± 0.909
0.419GluTrp: 0.419 ± 0.318
2.516GluTyr: 2.516 ± 1.217
0.0GluXaa: 0.0 ± 0.0
Phe
3.354PheAla: 3.354 ± 0.95
0.419PheCys: 0.419 ± 0.318
2.935PheAsp: 2.935 ± 0.954
2.096PheGlu: 2.096 ± 1.024
0.839PhePhe: 0.839 ± 0.388
1.677PheGly: 1.677 ± 0.92
0.839PheHis: 0.839 ± 0.586
4.193PheIle: 4.193 ± 0.8
3.354PheLys: 3.354 ± 1.414
5.031PheLeu: 5.031 ± 1.908
1.258PheMet: 1.258 ± 0.561
1.258PheAsn: 1.258 ± 0.947
0.419PhePro: 0.419 ± 0.318
1.677PheGln: 1.677 ± 0.87
0.419PheArg: 0.419 ± 0.316
3.354PheSer: 3.354 ± 1.433
2.516PheThr: 2.516 ± 0.889
2.516PheVal: 2.516 ± 1.568
1.258PheTrp: 1.258 ± 0.575
2.516PheTyr: 2.516 ± 1.14
0.0PheXaa: 0.0 ± 0.0
Gly
3.354GlyAla: 3.354 ± 0.809
2.096GlyCys: 2.096 ± 1.003
3.354GlyAsp: 3.354 ± 1.436
2.516GlyGlu: 2.516 ± 0.49
1.258GlyPhe: 1.258 ± 0.8
4.612GlyGly: 4.612 ± 1.954
2.516GlyHis: 2.516 ± 1.015
3.774GlyIle: 3.774 ± 0.84
2.096GlyLys: 2.096 ± 0.682
2.935GlyLeu: 2.935 ± 1.257
0.0GlyMet: 0.0 ± 0.0
4.193GlyAsn: 4.193 ± 1.332
2.096GlyPro: 2.096 ± 0.575
2.096GlyGln: 2.096 ± 0.717
2.516GlyArg: 2.516 ± 1.158
3.774GlySer: 3.774 ± 1.298
7.547GlyThr: 7.547 ± 2.33
2.935GlyVal: 2.935 ± 0.957
0.419GlyTrp: 0.419 ± 0.318
1.258GlyTyr: 1.258 ± 0.686
0.0GlyXaa: 0.0 ± 0.0
His
0.419HisAla: 0.419 ± 0.391
1.258HisCys: 1.258 ± 0.904
0.839HisAsp: 0.839 ± 0.365
0.839HisGlu: 0.839 ± 1.031
0.419HisPhe: 0.419 ± 0.318
1.677HisGly: 1.677 ± 0.528
0.419HisHis: 0.419 ± 0.582
0.839HisIle: 0.839 ± 0.365
0.839HisLys: 0.839 ± 0.388
2.096HisLeu: 2.096 ± 0.84
0.419HisMet: 0.419 ± 0.389
1.677HisAsn: 1.677 ± 0.698
1.258HisPro: 1.258 ± 0.808
1.258HisGln: 1.258 ± 1.166
2.516HisArg: 2.516 ± 0.682
1.258HisSer: 1.258 ± 0.953
1.258HisThr: 1.258 ± 0.8
0.419HisVal: 0.419 ± 0.391
0.839HisTrp: 0.839 ± 0.391
1.258HisTyr: 1.258 ± 0.383
0.0HisXaa: 0.0 ± 0.0
Ile
3.774IleAla: 3.774 ± 0.966
2.096IleCys: 2.096 ± 1.156
3.354IleAsp: 3.354 ± 1.403
2.935IleGlu: 2.935 ± 1.316
2.935IlePhe: 2.935 ± 1.434
2.096IleGly: 2.096 ± 1.138
0.839IleHis: 0.839 ± 0.493
2.935IleIle: 2.935 ± 1.433
1.258IleLys: 1.258 ± 0.722
1.677IleLeu: 1.677 ± 0.688
0.839IleMet: 0.839 ± 0.391
1.677IleAsn: 1.677 ± 0.556
5.451IlePro: 5.451 ± 2.386
2.516IleGln: 2.516 ± 1.039
2.935IleArg: 2.935 ± 1.334
4.193IleSer: 4.193 ± 1.232
5.451IleThr: 5.451 ± 1.017
4.193IleVal: 4.193 ± 0.593
0.0IleTrp: 0.0 ± 0.0
2.096IleTyr: 2.096 ± 0.756
0.0IleXaa: 0.0 ± 0.0
Lys
2.935LysAla: 2.935 ± 0.707
2.096LysCys: 2.096 ± 0.956
2.516LysAsp: 2.516 ± 0.871
4.612LysGlu: 4.612 ± 1.029
2.935LysPhe: 2.935 ± 1.032
2.096LysGly: 2.096 ± 0.581
1.677LysHis: 1.677 ± 0.902
1.258LysIle: 1.258 ± 0.458
2.935LysLys: 2.935 ± 1.634
2.096LysLeu: 2.096 ± 0.688
0.839LysMet: 0.839 ± 0.591
1.677LysAsn: 1.677 ± 0.776
2.516LysPro: 2.516 ± 0.894
2.516LysGln: 2.516 ± 0.531
3.774LysArg: 3.774 ± 0.738
3.354LysSer: 3.354 ± 1.109
2.935LysThr: 2.935 ± 1.219
3.774LysVal: 3.774 ± 0.545
0.0LysTrp: 0.0 ± 0.0
2.516LysTyr: 2.516 ± 0.9
0.0LysXaa: 0.0 ± 0.0
Leu
2.935LeuAla: 2.935 ± 0.717
3.354LeuCys: 3.354 ± 2.255
4.193LeuAsp: 4.193 ± 0.715
4.193LeuGlu: 4.193 ± 0.819
2.935LeuPhe: 2.935 ± 0.826
5.451LeuGly: 5.451 ± 1.403
2.516LeuHis: 2.516 ± 0.82
2.935LeuIle: 2.935 ± 1.422
5.451LeuLys: 5.451 ± 1.715
8.805LeuLeu: 8.805 ± 4.16
1.677LeuMet: 1.677 ± 0.727
3.774LeuAsn: 3.774 ± 1.402
2.935LeuPro: 2.935 ± 1.345
6.709LeuGln: 6.709 ± 2.022
3.774LeuArg: 3.774 ± 0.81
5.031LeuSer: 5.031 ± 1.325
3.774LeuThr: 3.774 ± 1.287
4.612LeuVal: 4.612 ± 1.256
0.839LeuTrp: 0.839 ± 0.391
2.935LeuTyr: 2.935 ± 1.118
0.0LeuXaa: 0.0 ± 0.0
Met
1.677MetAla: 1.677 ± 0.553
0.839MetCys: 0.839 ± 0.451
1.677MetAsp: 1.677 ± 0.553
1.258MetGlu: 1.258 ± 0.798
2.935MetPhe: 2.935 ± 1.166
0.0MetGly: 0.0 ± 0.0
0.839MetHis: 0.839 ± 0.669
0.419MetIle: 0.419 ± 0.316
0.0MetLys: 0.0 ± 0.0
0.839MetLeu: 0.839 ± 0.68
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
1.258MetGln: 1.258 ± 0.8
1.258MetArg: 1.258 ± 1.254
1.677MetSer: 1.677 ± 0.926
0.839MetThr: 0.839 ± 0.451
1.677MetVal: 1.677 ± 0.528
0.0MetTrp: 0.0 ± 0.0
1.677MetTyr: 1.677 ± 1.003
0.0MetXaa: 0.0 ± 0.0
Asn
2.096AsnAla: 2.096 ± 1.231
1.677AsnCys: 1.677 ± 0.746
2.516AsnAsp: 2.516 ± 1.01
1.677AsnGlu: 1.677 ± 0.7
2.096AsnPhe: 2.096 ± 1.003
1.677AsnGly: 1.677 ± 0.776
0.419AsnHis: 0.419 ± 0.389
2.935AsnIle: 2.935 ± 0.959
3.774AsnLys: 3.774 ± 1.62
3.354AsnLeu: 3.354 ± 0.785
1.677AsnMet: 1.677 ± 1.186
1.677AsnAsn: 1.677 ± 0.856
2.935AsnPro: 2.935 ± 1.019
0.839AsnGln: 0.839 ± 0.388
0.419AsnArg: 0.419 ± 0.318
3.354AsnSer: 3.354 ± 1.069
5.451AsnThr: 5.451 ± 1.98
4.612AsnVal: 4.612 ± 1.042
0.839AsnTrp: 0.839 ± 0.636
0.839AsnTyr: 0.839 ± 0.493
0.0AsnXaa: 0.0 ± 0.0
Pro
5.031ProAla: 5.031 ± 2.302
0.839ProCys: 0.839 ± 0.623
5.451ProAsp: 5.451 ± 1.895
2.096ProGlu: 2.096 ± 1.033
2.516ProPhe: 2.516 ± 1.371
1.677ProGly: 1.677 ± 0.67
0.839ProHis: 0.839 ± 0.365
2.935ProIle: 2.935 ± 1.651
3.354ProLys: 3.354 ± 0.955
7.966ProLeu: 7.966 ± 1.277
1.677ProMet: 1.677 ± 0.967
2.096ProAsn: 2.096 ± 0.678
6.709ProPro: 6.709 ± 2.348
3.774ProGln: 3.774 ± 1.687
4.193ProArg: 4.193 ± 2.9
5.031ProSer: 5.031 ± 1.995
4.612ProThr: 4.612 ± 3.283
1.677ProVal: 1.677 ± 0.6
1.258ProTrp: 1.258 ± 0.963
1.258ProTyr: 1.258 ± 0.787
0.0ProXaa: 0.0 ± 0.0
Gln
4.193GlnAla: 4.193 ± 1.499
1.258GlnCys: 1.258 ± 0.816
1.258GlnAsp: 1.258 ± 0.717
0.839GlnGlu: 0.839 ± 0.451
2.096GlnPhe: 2.096 ± 0.963
2.935GlnGly: 2.935 ± 0.689
2.096GlnHis: 2.096 ± 1.134
2.935GlnIle: 2.935 ± 0.756
2.096GlnLys: 2.096 ± 1.114
5.87GlnLeu: 5.87 ± 1.733
1.677GlnMet: 1.677 ± 1.013
0.839GlnAsn: 0.839 ± 0.636
3.354GlnPro: 3.354 ± 1.303
2.516GlnGln: 2.516 ± 0.871
2.935GlnArg: 2.935 ± 1.439
2.516GlnSer: 2.516 ± 0.697
3.774GlnThr: 3.774 ± 0.705
2.935GlnVal: 2.935 ± 1.01
1.677GlnTrp: 1.677 ± 0.302
1.677GlnTyr: 1.677 ± 0.687
0.0GlnXaa: 0.0 ± 0.0
Arg
3.354ArgAla: 3.354 ± 1.11
2.096ArgCys: 2.096 ± 1.308
2.096ArgAsp: 2.096 ± 0.965
2.516ArgGlu: 2.516 ± 1.021
3.354ArgPhe: 3.354 ± 0.677
1.677ArgGly: 1.677 ± 0.922
2.096ArgHis: 2.096 ± 1.194
2.516ArgIle: 2.516 ± 1.616
3.774ArgLys: 3.774 ± 0.738
4.612ArgLeu: 4.612 ± 0.819
0.0ArgMet: 0.0 ± 0.0
1.258ArgAsn: 1.258 ± 0.78
3.354ArgPro: 3.354 ± 1.507
2.096ArgGln: 2.096 ± 1.022
2.935ArgArg: 2.935 ± 1.344
3.354ArgSer: 3.354 ± 1.107
2.935ArgThr: 2.935 ± 0.86
2.096ArgVal: 2.096 ± 1.0
0.839ArgTrp: 0.839 ± 0.669
2.096ArgTyr: 2.096 ± 0.971
0.0ArgXaa: 0.0 ± 0.0
Ser
5.451SerAla: 5.451 ± 1.87
0.839SerCys: 0.839 ± 0.388
3.774SerAsp: 3.774 ± 1.265
1.677SerGlu: 1.677 ± 0.746
1.258SerPhe: 1.258 ± 0.561
5.451SerGly: 5.451 ± 2.256
0.419SerHis: 0.419 ± 0.318
3.774SerIle: 3.774 ± 1.283
1.258SerLys: 1.258 ± 0.458
5.031SerLeu: 5.031 ± 0.977
1.677SerMet: 1.677 ± 0.722
4.193SerAsn: 4.193 ± 2.015
5.031SerPro: 5.031 ± 1.462
3.774SerGln: 3.774 ± 1.615
4.193SerArg: 4.193 ± 1.185
10.901SerSer: 10.901 ± 2.584
10.063SerThr: 10.063 ± 2.917
4.193SerVal: 4.193 ± 1.244
0.0SerTrp: 0.0 ± 0.0
2.935SerTyr: 2.935 ± 0.85
0.0SerXaa: 0.0 ± 0.0
Thr
2.935ThrAla: 2.935 ± 1.175
4.612ThrCys: 4.612 ± 1.053
2.096ThrAsp: 2.096 ± 0.678
3.354ThrGlu: 3.354 ± 0.677
4.193ThrPhe: 4.193 ± 1.311
7.966ThrGly: 7.966 ± 1.753
0.419ThrHis: 0.419 ± 0.391
3.774ThrIle: 3.774 ± 0.822
2.096ThrLys: 2.096 ± 0.954
4.612ThrLeu: 4.612 ± 1.17
2.096ThrMet: 2.096 ± 0.786
5.87ThrAsn: 5.87 ± 0.476
9.224ThrPro: 9.224 ± 2.996
5.451ThrGln: 5.451 ± 1.37
2.516ThrArg: 2.516 ± 0.852
6.709ThrSer: 6.709 ± 2.609
6.709ThrThr: 6.709 ± 2.174
6.289ThrVal: 6.289 ± 2.501
0.839ThrTrp: 0.839 ± 0.451
3.354ThrTyr: 3.354 ± 1.03
0.0ThrXaa: 0.0 ± 0.0
Val
4.193ValAla: 4.193 ± 1.226
3.354ValCys: 3.354 ± 1.852
5.451ValAsp: 5.451 ± 0.969
3.354ValGlu: 3.354 ± 0.866
2.516ValPhe: 2.516 ± 1.08
2.096ValGly: 2.096 ± 1.222
1.258ValHis: 1.258 ± 0.677
2.516ValIle: 2.516 ± 0.544
2.935ValLys: 2.935 ± 0.707
3.354ValLeu: 3.354 ± 1.659
0.839ValMet: 0.839 ± 0.838
2.516ValAsn: 2.516 ± 0.678
3.774ValPro: 3.774 ± 0.96
2.935ValGln: 2.935 ± 1.364
2.096ValArg: 2.096 ± 0.577
4.612ValSer: 4.612 ± 1.134
5.87ValThr: 5.87 ± 2.733
3.774ValVal: 3.774 ± 0.828
0.839ValTrp: 0.839 ± 0.466
4.193ValTyr: 4.193 ± 2.264
0.0ValXaa: 0.0 ± 0.0
Trp
0.839TrpAla: 0.839 ± 0.388
0.419TrpCys: 0.419 ± 0.635
0.0TrpAsp: 0.0 ± 0.0
0.419TrpGlu: 0.419 ± 0.389
0.839TrpPhe: 0.839 ± 0.623
1.258TrpGly: 1.258 ± 0.394
0.419TrpHis: 0.419 ± 0.389
0.839TrpIle: 0.839 ± 0.636
0.419TrpLys: 0.419 ± 0.318
0.839TrpLeu: 0.839 ± 0.388
0.0TrpMet: 0.0 ± 0.0
1.258TrpAsn: 1.258 ± 0.947
0.419TrpPro: 0.419 ± 0.391
0.0TrpGln: 0.0 ± 0.0
0.839TrpArg: 0.839 ± 0.388
0.839TrpSer: 0.839 ± 0.365
2.935TrpThr: 2.935 ± 1.714
0.839TrpVal: 0.839 ± 0.451
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.516TyrAla: 2.516 ± 0.594
0.419TyrCys: 0.419 ± 0.516
0.839TyrAsp: 0.839 ± 0.466
2.935TyrGlu: 2.935 ± 1.186
1.677TyrPhe: 1.677 ± 0.528
2.935TyrGly: 2.935 ± 1.119
0.839TyrHis: 0.839 ± 0.365
4.193TyrIle: 4.193 ± 1.196
1.677TyrLys: 1.677 ± 0.68
2.935TyrLeu: 2.935 ± 0.967
0.419TyrMet: 0.419 ± 0.389
2.096TyrAsn: 2.096 ± 0.435
0.839TyrPro: 0.839 ± 0.388
0.419TyrGln: 0.419 ± 0.318
3.774TyrArg: 3.774 ± 1.25
3.354TyrSer: 3.354 ± 1.446
2.096TyrThr: 2.096 ± 1.489
2.935TyrVal: 2.935 ± 1.437
0.839TyrTrp: 0.839 ± 0.391
3.774TyrTyr: 3.774 ± 0.704
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (2386 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski