Amino acid dipepetide frequency for Human papillomavirus 40

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.667AlaAla: 6.667 ± 1.285
1.667AlaCys: 1.667 ± 0.768
5.0AlaAsp: 5.0 ± 1.727
4.167AlaGlu: 4.167 ± 0.976
2.5AlaPhe: 2.5 ± 1.294
4.583AlaGly: 4.583 ± 1.782
0.833AlaHis: 0.833 ± 0.477
3.333AlaIle: 3.333 ± 1.387
2.917AlaLys: 2.917 ± 1.329
3.75AlaLeu: 3.75 ± 1.646
2.083AlaMet: 2.083 ± 1.033
2.083AlaAsn: 2.083 ± 0.575
4.583AlaPro: 4.583 ± 1.002
2.083AlaGln: 2.083 ± 0.659
3.333AlaArg: 3.333 ± 1.627
3.333AlaSer: 3.333 ± 1.348
5.0AlaThr: 5.0 ± 1.657
3.333AlaVal: 3.333 ± 1.708
0.417AlaTrp: 0.417 ± 0.4
2.5AlaTyr: 2.5 ± 0.621
0.0AlaXaa: 0.0 ± 0.0
Cys
1.667CysAla: 1.667 ± 0.653
0.417CysCys: 0.417 ± 0.575
0.833CysAsp: 0.833 ± 0.594
0.833CysGlu: 0.833 ± 0.582
1.667CysPhe: 1.667 ± 0.674
0.833CysGly: 0.833 ± 0.713
0.417CysHis: 0.417 ± 0.575
0.833CysIle: 0.833 ± 0.582
2.5CysLys: 2.5 ± 1.243
2.5CysLeu: 2.5 ± 1.909
0.833CysMet: 0.833 ± 1.077
2.083CysAsn: 2.083 ± 1.077
2.5CysPro: 2.5 ± 0.84
1.667CysGln: 1.667 ± 0.902
0.833CysArg: 0.833 ± 0.605
2.083CysSer: 2.083 ± 0.866
1.667CysThr: 1.667 ± 0.675
1.667CysVal: 1.667 ± 0.956
0.833CysTrp: 0.833 ± 0.594
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.0AspAla: 5.0 ± 1.203
1.667AspCys: 1.667 ± 0.566
3.75AspAsp: 3.75 ± 1.511
3.75AspGlu: 3.75 ± 1.632
2.083AspPhe: 2.083 ± 0.802
2.917AspGly: 2.917 ± 1.435
0.833AspHis: 0.833 ± 0.577
3.75AspIle: 3.75 ± 1.431
1.25AspLys: 1.25 ± 0.548
4.167AspLeu: 4.167 ± 0.993
1.667AspMet: 1.667 ± 0.541
1.25AspAsn: 1.25 ± 0.607
4.167AspPro: 4.167 ± 1.515
1.667AspGln: 1.667 ± 0.786
1.25AspArg: 1.25 ± 0.726
5.417AspSer: 5.417 ± 1.706
7.5AspThr: 7.5 ± 1.301
4.583AspVal: 4.583 ± 1.354
1.25AspTrp: 1.25 ± 0.585
2.083AspTyr: 2.083 ± 0.761
0.0AspXaa: 0.0 ± 0.0
Glu
2.5GluAla: 2.5 ± 0.855
0.417GluCys: 0.417 ± 0.291
5.833GluAsp: 5.833 ± 2.45
4.167GluGlu: 4.167 ± 1.265
0.833GluPhe: 0.833 ± 0.403
1.667GluGly: 1.667 ± 0.543
1.667GluHis: 1.667 ± 0.938
1.25GluIle: 1.25 ± 0.649
1.25GluLys: 1.25 ± 1.132
5.833GluLeu: 5.833 ± 1.284
1.25GluMet: 1.25 ± 0.567
2.5GluAsn: 2.5 ± 0.928
2.083GluPro: 2.083 ± 0.737
4.167GluGln: 4.167 ± 1.724
1.667GluArg: 1.667 ± 1.015
3.333GluSer: 3.333 ± 1.729
5.0GluThr: 5.0 ± 1.233
3.75GluVal: 3.75 ± 0.976
0.833GluTrp: 0.833 ± 0.582
2.083GluTyr: 2.083 ± 1.248
0.0GluXaa: 0.0 ± 0.0
Phe
1.667PheAla: 1.667 ± 0.566
1.25PheCys: 1.25 ± 0.955
2.083PheAsp: 2.083 ± 0.379
1.667PheGlu: 1.667 ± 0.584
2.917PhePhe: 2.917 ± 0.935
3.333PheGly: 3.333 ± 0.906
0.833PheHis: 0.833 ± 0.605
1.25PheIle: 1.25 ± 0.58
4.167PheLys: 4.167 ± 0.993
2.917PheLeu: 2.917 ± 0.87
0.833PheMet: 0.833 ± 0.388
1.667PheAsn: 1.667 ± 1.562
2.083PhePro: 2.083 ± 0.555
1.25PheGln: 1.25 ± 0.545
2.5PheArg: 2.5 ± 0.975
2.083PheSer: 2.083 ± 0.859
0.417PheThr: 0.417 ± 0.32
2.083PheVal: 2.083 ± 0.723
1.25PheTrp: 1.25 ± 0.585
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
1.667GlyAla: 1.667 ± 0.584
1.25GlyCys: 1.25 ± 0.385
4.167GlyAsp: 4.167 ± 1.523
2.917GlyGlu: 2.917 ± 0.718
2.083GlyPhe: 2.083 ± 0.778
6.25GlyGly: 6.25 ± 1.875
3.333GlyHis: 3.333 ± 1.569
1.667GlyIle: 1.667 ± 0.316
4.167GlyLys: 4.167 ± 1.437
2.917GlyLeu: 2.917 ± 1.008
2.083GlyMet: 2.083 ± 0.691
2.083GlyAsn: 2.083 ± 0.962
2.5GlyPro: 2.5 ± 0.546
2.5GlyGln: 2.5 ± 0.557
3.333GlyArg: 3.333 ± 1.334
5.417GlySer: 5.417 ± 0.745
7.917GlyThr: 7.917 ± 2.113
3.333GlyVal: 3.333 ± 1.348
0.417GlyTrp: 0.417 ± 0.291
2.083GlyTyr: 2.083 ± 0.905
0.0GlyXaa: 0.0 ± 0.0
His
2.5HisAla: 2.5 ± 0.819
1.667HisCys: 1.667 ± 1.141
0.417HisAsp: 0.417 ± 0.32
0.417HisGlu: 0.417 ± 0.547
1.667HisPhe: 1.667 ± 0.541
2.5HisGly: 2.5 ± 0.649
0.417HisHis: 0.417 ± 0.32
1.667HisIle: 1.667 ± 0.885
1.25HisLys: 1.25 ± 0.583
1.667HisLeu: 1.667 ± 1.305
0.417HisMet: 0.417 ± 0.291
0.833HisAsn: 0.833 ± 0.42
2.083HisPro: 2.083 ± 1.028
1.25HisGln: 1.25 ± 0.475
1.667HisArg: 1.667 ± 0.568
2.5HisSer: 2.5 ± 0.887
2.5HisThr: 2.5 ± 0.986
2.083HisVal: 2.083 ± 0.763
0.417HisTrp: 0.417 ± 0.4
0.833HisTyr: 0.833 ± 0.382
0.0HisXaa: 0.0 ± 0.0
Ile
2.5IleAla: 2.5 ± 1.21
1.25IleCys: 1.25 ± 0.385
3.333IleAsp: 3.333 ± 1.675
2.5IleGlu: 2.5 ± 0.84
0.833IlePhe: 0.833 ± 0.781
2.5IleGly: 2.5 ± 0.997
1.25IleHis: 1.25 ± 0.636
2.5IleIle: 2.5 ± 0.84
1.667IleLys: 1.667 ± 0.764
2.5IleLeu: 2.5 ± 0.949
0.0IleMet: 0.0 ± 0.0
0.417IleAsn: 0.417 ± 0.291
3.333IlePro: 3.333 ± 1.843
2.083IleGln: 2.083 ± 0.573
1.667IleArg: 1.667 ± 0.619
2.917IleSer: 2.917 ± 0.938
5.0IleThr: 5.0 ± 1.392
5.833IleVal: 5.833 ± 1.39
0.0IleTrp: 0.0 ± 0.0
0.833IleTyr: 0.833 ± 0.487
0.0IleXaa: 0.0 ± 0.0
Lys
3.333LysAla: 3.333 ± 0.744
2.917LysCys: 2.917 ± 1.349
1.667LysAsp: 1.667 ± 0.721
1.667LysGlu: 1.667 ± 0.674
1.667LysPhe: 1.667 ± 1.111
2.083LysGly: 2.083 ± 1.279
2.083LysHis: 2.083 ± 0.899
0.833LysIle: 0.833 ± 0.487
3.75LysLys: 3.75 ± 1.233
1.667LysLeu: 1.667 ± 0.951
0.833LysMet: 0.833 ± 0.56
1.25LysAsn: 1.25 ± 0.548
2.083LysPro: 2.083 ± 0.795
1.667LysGln: 1.667 ± 0.654
5.0LysArg: 5.0 ± 0.898
3.75LysSer: 3.75 ± 1.027
3.75LysThr: 3.75 ± 2.489
3.75LysVal: 3.75 ± 0.607
1.25LysTrp: 1.25 ± 0.607
2.5LysTyr: 2.5 ± 0.798
0.0LysXaa: 0.0 ± 0.0
Leu
5.833LeuAla: 5.833 ± 1.696
3.75LeuCys: 3.75 ± 1.432
5.417LeuAsp: 5.417 ± 1.252
4.167LeuGlu: 4.167 ± 1.004
3.333LeuPhe: 3.333 ± 0.906
5.417LeuGly: 5.417 ± 1.734
5.417LeuHis: 5.417 ± 1.887
2.083LeuIle: 2.083 ± 1.255
4.583LeuLys: 4.583 ± 1.045
10.0LeuLeu: 10.0 ± 4.327
2.083LeuMet: 2.083 ± 0.658
2.917LeuAsn: 2.917 ± 1.167
2.917LeuPro: 2.917 ± 1.018
7.5LeuGln: 7.5 ± 1.746
2.083LeuArg: 2.083 ± 0.677
4.167LeuSer: 4.167 ± 0.742
6.25LeuThr: 6.25 ± 1.002
3.75LeuVal: 3.75 ± 0.917
2.5LeuTrp: 2.5 ± 1.292
5.417LeuTyr: 5.417 ± 1.294
0.0LeuXaa: 0.0 ± 0.0
Met
1.667MetAla: 1.667 ± 0.716
0.833MetCys: 0.833 ± 0.382
2.5MetAsp: 2.5 ± 1.048
0.833MetGlu: 0.833 ± 0.713
0.417MetPhe: 0.417 ± 0.39
2.083MetGly: 2.083 ± 0.783
1.25MetHis: 1.25 ± 0.61
0.417MetIle: 0.417 ± 0.291
0.0MetLys: 0.0 ± 0.0
2.917MetLeu: 2.917 ± 1.178
0.417MetMet: 0.417 ± 0.375
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
1.25MetGln: 1.25 ± 0.385
0.833MetArg: 0.833 ± 0.582
1.25MetSer: 1.25 ± 0.737
0.417MetThr: 0.417 ± 0.39
1.667MetVal: 1.667 ± 0.541
0.417MetTrp: 0.417 ± 0.39
0.417MetTyr: 0.417 ± 0.32
0.0MetXaa: 0.0 ± 0.0
Asn
3.333AsnAla: 3.333 ± 0.956
2.083AsnCys: 2.083 ± 1.238
0.417AsnAsp: 0.417 ± 0.291
1.25AsnGlu: 1.25 ± 0.632
1.25AsnPhe: 1.25 ± 0.847
2.917AsnGly: 2.917 ± 0.817
0.833AsnHis: 0.833 ± 0.791
1.25AsnIle: 1.25 ± 0.737
2.5AsnLys: 2.5 ± 1.925
2.5AsnLeu: 2.5 ± 1.024
0.0AsnMet: 0.0 ± 0.0
1.667AsnAsn: 1.667 ± 0.675
2.083AsnPro: 2.083 ± 0.976
2.083AsnGln: 2.083 ± 0.795
0.833AsnArg: 0.833 ± 0.403
3.75AsnSer: 3.75 ± 1.322
1.25AsnThr: 1.25 ± 0.706
1.667AsnVal: 1.667 ± 0.316
0.417AsnTrp: 0.417 ± 0.291
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
5.0ProAla: 5.0 ± 2.273
0.417ProCys: 0.417 ± 0.39
3.75ProAsp: 3.75 ± 1.023
2.917ProGlu: 2.917 ± 1.171
0.833ProPhe: 0.833 ± 0.582
1.667ProGly: 1.667 ± 0.541
1.25ProHis: 1.25 ± 0.764
3.75ProIle: 3.75 ± 0.883
3.75ProLys: 3.75 ± 0.848
6.667ProLeu: 6.667 ± 1.773
0.0ProMet: 0.0 ± 0.0
1.25ProAsn: 1.25 ± 0.545
7.5ProPro: 7.5 ± 1.43
0.833ProGln: 0.833 ± 0.512
4.167ProArg: 4.167 ± 0.931
5.417ProSer: 5.417 ± 1.894
5.833ProThr: 5.833 ± 1.683
4.583ProVal: 4.583 ± 1.782
0.417ProTrp: 0.417 ± 0.4
2.917ProTyr: 2.917 ± 1.788
0.0ProXaa: 0.0 ± 0.0
Gln
3.333GlnAla: 3.333 ± 0.923
0.833GlnCys: 0.833 ± 0.87
2.5GlnAsp: 2.5 ± 1.024
0.833GlnGlu: 0.833 ± 0.801
2.917GlnPhe: 2.917 ± 0.714
1.667GlnGly: 1.667 ± 0.478
0.833GlnHis: 0.833 ± 0.382
2.083GlnIle: 2.083 ± 0.616
0.833GlnLys: 0.833 ± 0.403
5.833GlnLeu: 5.833 ± 1.842
1.25GlnMet: 1.25 ± 0.801
0.833GlnAsn: 0.833 ± 0.801
2.5GlnPro: 2.5 ± 1.078
2.5GlnGln: 2.5 ± 1.227
4.583GlnArg: 4.583 ± 0.588
2.5GlnSer: 2.5 ± 0.792
3.75GlnThr: 3.75 ± 0.854
3.75GlnVal: 3.75 ± 1.001
0.833GlnTrp: 0.833 ± 0.582
2.083GlnTyr: 2.083 ± 0.555
0.0GlnXaa: 0.0 ± 0.0
Arg
2.917ArgAla: 2.917 ± 0.938
1.667ArgCys: 1.667 ± 1.913
2.083ArgAsp: 2.083 ± 0.555
2.917ArgGlu: 2.917 ± 1.353
2.5ArgPhe: 2.5 ± 0.928
2.083ArgGly: 2.083 ± 0.859
1.25ArgHis: 1.25 ± 0.833
1.667ArgIle: 1.667 ± 1.008
3.75ArgLys: 3.75 ± 0.942
6.667ArgLeu: 6.667 ± 0.849
0.833ArgMet: 0.833 ± 0.398
2.083ArgAsn: 2.083 ± 1.061
4.167ArgPro: 4.167 ± 1.708
2.083ArgGln: 2.083 ± 1.058
4.583ArgArg: 4.583 ± 1.494
1.667ArgSer: 1.667 ± 0.849
3.75ArgThr: 3.75 ± 1.098
3.333ArgVal: 3.333 ± 1.088
0.833ArgTrp: 0.833 ± 0.801
2.083ArgTyr: 2.083 ± 0.906
0.0ArgXaa: 0.0 ± 0.0
Ser
3.75SerAla: 3.75 ± 1.079
1.25SerCys: 1.25 ± 0.873
4.167SerAsp: 4.167 ± 0.92
3.333SerGlu: 3.333 ± 1.213
1.25SerPhe: 1.25 ± 0.733
6.25SerGly: 6.25 ± 1.942
2.083SerHis: 2.083 ± 0.736
4.583SerIle: 4.583 ± 1.689
2.917SerLys: 2.917 ± 0.627
7.917SerLeu: 7.917 ± 1.445
1.25SerMet: 1.25 ± 0.607
3.333SerAsn: 3.333 ± 0.863
5.0SerPro: 5.0 ± 1.287
2.083SerGln: 2.083 ± 0.718
4.583SerArg: 4.583 ± 1.093
7.5SerSer: 7.5 ± 2.05
5.833SerThr: 5.833 ± 1.689
2.917SerVal: 2.917 ± 0.83
0.417SerTrp: 0.417 ± 0.575
1.25SerTyr: 1.25 ± 0.656
0.0SerXaa: 0.0 ± 0.0
Thr
3.333ThrAla: 3.333 ± 1.939
2.5ThrCys: 2.5 ± 0.964
6.25ThrAsp: 6.25 ± 0.901
4.583ThrGlu: 4.583 ± 0.811
1.25ThrPhe: 1.25 ± 0.706
5.0ThrGly: 5.0 ± 1.427
0.833ThrHis: 0.833 ± 0.388
4.167ThrIle: 4.167 ± 0.624
1.25ThrLys: 1.25 ± 0.646
10.0ThrLeu: 10.0 ± 2.948
1.25ThrMet: 1.25 ± 0.634
2.083ThrAsn: 2.083 ± 0.757
7.917ThrPro: 7.917 ± 2.431
4.583ThrGln: 4.583 ± 1.247
2.5ThrArg: 2.5 ± 0.971
6.25ThrSer: 6.25 ± 1.842
7.5ThrThr: 7.5 ± 2.638
7.5ThrVal: 7.5 ± 1.835
1.25ThrTrp: 1.25 ± 0.841
2.5ThrTyr: 2.5 ± 1.199
0.0ThrXaa: 0.0 ± 0.0
Val
3.75ValAla: 3.75 ± 1.434
0.833ValCys: 0.833 ± 1.094
4.583ValAsp: 4.583 ± 1.274
7.5ValGlu: 7.5 ± 1.246
4.583ValPhe: 4.583 ± 1.342
4.167ValGly: 4.167 ± 0.936
2.917ValHis: 2.917 ± 1.411
2.917ValIle: 2.917 ± 0.627
1.667ValLys: 1.667 ± 0.91
4.167ValLeu: 4.167 ± 2.037
0.833ValMet: 0.833 ± 0.408
2.083ValAsn: 2.083 ± 0.803
4.583ValPro: 4.583 ± 1.354
3.333ValGln: 3.333 ± 0.844
4.583ValArg: 4.583 ± 1.206
5.833ValSer: 5.833 ± 1.497
3.75ValThr: 3.75 ± 1.239
7.083ValVal: 7.083 ± 1.756
0.833ValTrp: 0.833 ± 0.61
2.083ValTyr: 2.083 ± 0.379
0.0ValXaa: 0.0 ± 0.0
Trp
1.25TrpAla: 1.25 ± 0.528
0.0TrpCys: 0.0 ± 0.0
0.417TrpAsp: 0.417 ± 0.39
0.0TrpGlu: 0.0 ± 0.0
0.833TrpPhe: 0.833 ± 0.582
1.667TrpGly: 1.667 ± 0.541
0.0TrpHis: 0.0 ± 0.0
1.25TrpIle: 1.25 ± 0.585
1.25TrpLys: 1.25 ± 0.656
1.25TrpLeu: 1.25 ± 0.548
0.833TrpMet: 0.833 ± 1.067
0.833TrpAsn: 0.833 ± 0.403
0.0TrpPro: 0.0 ± 0.0
0.417TrpGln: 0.417 ± 0.39
0.833TrpArg: 0.833 ± 0.713
0.0TrpSer: 0.0 ± 0.0
3.333TrpThr: 3.333 ± 2.072
0.833TrpVal: 0.833 ± 0.513
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.5TyrAla: 2.5 ± 1.167
0.417TyrCys: 0.417 ± 0.575
0.833TyrAsp: 0.833 ± 0.388
1.667TyrGlu: 1.667 ± 0.91
0.833TyrPhe: 0.833 ± 0.781
2.5TyrGly: 2.5 ± 0.798
0.417TyrHis: 0.417 ± 0.39
1.667TyrIle: 1.667 ± 0.675
2.083TyrLys: 2.083 ± 1.033
3.75TyrLeu: 3.75 ± 1.079
0.417TyrMet: 0.417 ± 0.291
0.833TyrAsn: 0.833 ± 0.487
0.833TyrPro: 0.833 ± 0.547
1.25TyrGln: 1.25 ± 0.607
2.083TyrArg: 2.083 ± 0.718
2.5TyrSer: 2.5 ± 0.549
2.083TyrThr: 2.083 ± 1.31
4.583TyrVal: 4.583 ± 0.922
0.417TyrTrp: 0.417 ± 0.39
3.333TyrTyr: 3.333 ± 0.632
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (2401 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski