Amino acid dipepetide frequency for Human papillomavirus 22

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.945AlaAla: 3.945 ± 1.283
1.183AlaCys: 1.183 ± 0.512
3.156AlaAsp: 3.156 ± 0.899
3.55AlaGlu: 3.55 ± 1.001
2.367AlaPhe: 2.367 ± 1.13
1.183AlaGly: 1.183 ± 0.432
0.394AlaHis: 0.394 ± 0.335
2.761AlaIle: 2.761 ± 0.605
3.55AlaLys: 3.55 ± 1.124
3.55AlaLeu: 3.55 ± 0.873
0.789AlaMet: 0.789 ± 0.448
1.972AlaAsn: 1.972 ± 0.76
3.55AlaPro: 3.55 ± 0.948
3.156AlaGln: 3.156 ± 0.806
4.734AlaArg: 4.734 ± 1.427
5.523AlaSer: 5.523 ± 0.357
3.55AlaThr: 3.55 ± 1.16
1.972AlaVal: 1.972 ± 0.607
0.394AlaTrp: 0.394 ± 0.335
1.972AlaTyr: 1.972 ± 1.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.183CysAla: 1.183 ± 0.512
1.578CysCys: 1.578 ± 1.011
0.789CysAsp: 0.789 ± 0.875
0.789CysGlu: 0.789 ± 0.361
1.183CysPhe: 1.183 ± 0.354
0.789CysGly: 0.789 ± 0.528
0.394CysHis: 0.394 ± 0.437
0.789CysIle: 0.789 ± 0.361
1.972CysLys: 1.972 ± 0.644
1.972CysLeu: 1.972 ± 0.592
0.394CysMet: 0.394 ± 0.506
0.394CysAsn: 0.394 ± 0.335
1.578CysPro: 1.578 ± 0.766
1.183CysGln: 1.183 ± 0.728
1.578CysArg: 1.578 ± 1.061
2.367CysSer: 2.367 ± 1.517
1.183CysThr: 1.183 ± 1.006
0.0CysVal: 0.0 ± 0.0
0.394CysTrp: 0.394 ± 0.335
1.183CysTyr: 1.183 ± 0.63
0.0CysXaa: 0.0 ± 0.0
Asp
7.101AspAla: 7.101 ± 1.664
0.394AspCys: 0.394 ± 0.318
4.339AspAsp: 4.339 ± 1.565
2.761AspGlu: 2.761 ± 0.55
1.972AspPhe: 1.972 ± 0.771
3.156AspGly: 3.156 ± 0.97
1.183AspHis: 1.183 ± 0.572
4.734AspIle: 4.734 ± 1.328
2.761AspLys: 2.761 ± 0.955
5.523AspLeu: 5.523 ± 1.031
2.367AspMet: 2.367 ± 1.053
2.761AspAsn: 2.761 ± 1.124
3.945AspPro: 3.945 ± 1.532
2.367AspGln: 2.367 ± 1.235
2.761AspArg: 2.761 ± 1.016
4.339AspSer: 4.339 ± 1.201
5.128AspThr: 5.128 ± 0.72
2.761AspVal: 2.761 ± 1.041
0.0AspTrp: 0.0 ± 0.0
1.972AspTyr: 1.972 ± 0.694
0.0AspXaa: 0.0 ± 0.0
Glu
1.578GluAla: 1.578 ± 0.708
1.578GluCys: 1.578 ± 1.341
3.156GluAsp: 3.156 ± 0.757
7.89GluGlu: 7.89 ± 2.976
1.972GluPhe: 1.972 ± 0.555
4.734GluGly: 4.734 ± 1.305
0.789GluHis: 0.789 ± 0.412
2.367GluIle: 2.367 ± 0.903
3.945GluLys: 3.945 ± 1.302
5.917GluLeu: 5.917 ± 2.006
1.183GluMet: 1.183 ± 0.702
2.761GluAsn: 2.761 ± 0.946
3.156GluPro: 3.156 ± 1.263
3.55GluGln: 3.55 ± 0.863
4.339GluArg: 4.339 ± 1.35
6.706GluSer: 6.706 ± 2.444
4.339GluThr: 4.339 ± 1.701
5.523GluVal: 5.523 ± 0.837
0.789GluTrp: 0.789 ± 0.387
1.972GluTyr: 1.972 ± 1.045
0.0GluXaa: 0.0 ± 0.0
Phe
1.183PheAla: 1.183 ± 0.512
0.789PheCys: 0.789 ± 0.58
5.128PheAsp: 5.128 ± 0.718
3.55PheGlu: 3.55 ± 1.518
1.578PhePhe: 1.578 ± 0.539
1.972PheGly: 1.972 ± 0.607
0.789PheHis: 0.789 ± 0.556
1.972PheIle: 1.972 ± 0.492
1.972PheLys: 1.972 ± 1.016
3.55PheLeu: 3.55 ± 1.295
1.183PheMet: 1.183 ± 0.512
1.578PheAsn: 1.578 ± 1.01
3.156PhePro: 3.156 ± 1.357
2.367PheGln: 2.367 ± 0.538
2.761PheArg: 2.761 ± 0.489
2.367PheSer: 2.367 ± 0.702
0.789PheThr: 0.789 ± 0.411
2.367PheVal: 2.367 ± 0.953
1.578PheTrp: 1.578 ± 0.722
0.394PheTyr: 0.394 ± 0.318
0.0PheXaa: 0.0 ± 0.0
Gly
3.55GlyAla: 3.55 ± 1.275
2.367GlyCys: 2.367 ± 0.907
3.945GlyAsp: 3.945 ± 1.166
5.128GlyGlu: 5.128 ± 2.467
0.789GlyPhe: 0.789 ± 0.754
6.706GlyGly: 6.706 ± 2.41
3.945GlyHis: 3.945 ± 1.015
1.972GlyIle: 1.972 ± 0.772
2.761GlyLys: 2.761 ± 0.879
3.945GlyLeu: 3.945 ± 1.146
0.0GlyMet: 0.0 ± 0.0
1.972GlyAsn: 1.972 ± 0.607
5.128GlyPro: 5.128 ± 2.147
2.761GlyGln: 2.761 ± 0.605
5.917GlyArg: 5.917 ± 2.02
2.761GlySer: 2.761 ± 0.489
3.55GlyThr: 3.55 ± 0.685
5.523GlyVal: 5.523 ± 1.203
0.394GlyTrp: 0.394 ± 0.335
0.789GlyTyr: 0.789 ± 0.411
0.0GlyXaa: 0.0 ± 0.0
His
1.183HisAla: 1.183 ± 0.432
1.578HisCys: 1.578 ± 0.827
1.578HisAsp: 1.578 ± 1.01
0.789HisGlu: 0.789 ± 0.688
1.183HisPhe: 1.183 ± 0.565
0.789HisGly: 0.789 ± 0.448
1.183HisHis: 1.183 ± 0.789
0.394HisIle: 0.394 ± 0.335
1.578HisLys: 1.578 ± 0.643
0.394HisLeu: 0.394 ± 0.506
0.0HisMet: 0.0 ± 0.0
1.972HisAsn: 1.972 ± 0.666
2.367HisPro: 2.367 ± 1.165
0.394HisGln: 0.394 ± 0.335
0.789HisArg: 0.789 ± 0.387
1.578HisSer: 1.578 ± 0.632
0.789HisThr: 0.789 ± 0.635
1.578HisVal: 1.578 ± 0.64
1.972HisTrp: 1.972 ± 0.826
0.394HisTyr: 0.394 ± 0.335
0.0HisXaa: 0.0 ± 0.0
Ile
1.972IleAla: 1.972 ± 0.76
1.183IleCys: 1.183 ± 0.512
2.367IleAsp: 2.367 ± 1.045
4.734IleGlu: 4.734 ± 1.714
0.394IlePhe: 0.394 ± 0.323
3.945IleGly: 3.945 ± 1.154
1.183IleHis: 1.183 ± 0.586
2.761IleIle: 2.761 ± 1.107
0.789IleLys: 0.789 ± 0.578
5.917IleLeu: 5.917 ± 1.318
0.789IleMet: 0.789 ± 0.615
1.972IleAsn: 1.972 ± 0.656
2.761IlePro: 2.761 ± 1.405
1.972IleGln: 1.972 ± 0.76
2.367IleArg: 2.367 ± 0.82
4.734IleSer: 4.734 ± 1.032
1.183IleThr: 1.183 ± 0.412
3.156IleVal: 3.156 ± 0.94
0.394IleTrp: 0.394 ± 0.454
3.156IleTyr: 3.156 ± 0.828
0.0IleXaa: 0.0 ± 0.0
Lys
3.55LysAla: 3.55 ± 0.615
1.183LysCys: 1.183 ± 0.679
2.367LysAsp: 2.367 ± 0.984
1.183LysGlu: 1.183 ± 0.586
3.55LysPhe: 3.55 ± 1.661
3.55LysGly: 3.55 ± 0.791
1.578LysHis: 1.578 ± 0.733
3.156LysIle: 3.156 ± 0.912
3.55LysLys: 3.55 ± 0.948
5.917LysLeu: 5.917 ± 1.885
0.0LysMet: 0.0 ± 0.0
1.578LysAsn: 1.578 ± 0.642
1.183LysPro: 1.183 ± 1.518
1.578LysGln: 1.578 ± 0.701
4.734LysArg: 4.734 ± 0.802
3.55LysSer: 3.55 ± 1.329
2.761LysThr: 2.761 ± 1.042
1.578LysVal: 1.578 ± 0.592
0.394LysTrp: 0.394 ± 0.506
2.761LysTyr: 2.761 ± 0.83
0.0LysXaa: 0.0 ± 0.0
Leu
3.945LeuAla: 3.945 ± 1.524
2.761LeuCys: 2.761 ± 1.122
6.706LeuAsp: 6.706 ± 1.511
7.495LeuGlu: 7.495 ± 1.182
3.55LeuPhe: 3.55 ± 0.969
5.523LeuGly: 5.523 ± 1.635
1.578LeuHis: 1.578 ± 0.435
3.945LeuIle: 3.945 ± 1.451
5.128LeuLys: 5.128 ± 1.404
9.862LeuLeu: 9.862 ± 2.125
1.183LeuMet: 1.183 ± 0.575
1.578LeuAsn: 1.578 ± 1.144
5.128LeuPro: 5.128 ± 1.732
5.523LeuGln: 5.523 ± 1.289
5.128LeuArg: 5.128 ± 1.377
5.523LeuSer: 5.523 ± 1.167
6.312LeuThr: 6.312 ± 1.297
4.339LeuVal: 4.339 ± 1.335
1.578LeuTrp: 1.578 ± 0.722
1.972LeuTyr: 1.972 ± 0.662
0.0LeuXaa: 0.0 ± 0.0
Met
0.789MetAla: 0.789 ± 0.528
0.0MetCys: 0.0 ± 0.0
0.789MetAsp: 0.789 ± 0.387
0.394MetGlu: 0.394 ± 0.318
1.578MetPhe: 1.578 ± 1.01
0.789MetGly: 0.789 ± 0.556
0.0MetHis: 0.0 ± 0.0
1.578MetIle: 1.578 ± 1.063
1.578MetLys: 1.578 ± 0.96
1.578MetLeu: 1.578 ± 1.144
0.0MetMet: 0.0 ± 0.0
1.183MetAsn: 1.183 ± 0.586
0.0MetPro: 0.0 ± 0.0
0.394MetGln: 0.394 ± 0.454
1.578MetArg: 1.578 ± 0.642
2.367MetSer: 2.367 ± 0.812
1.183MetThr: 1.183 ± 0.658
1.183MetVal: 1.183 ± 0.789
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.183AsnAla: 1.183 ± 0.325
0.394AsnCys: 0.394 ± 0.335
1.578AsnAsp: 1.578 ± 0.708
1.972AsnGlu: 1.972 ± 0.814
1.578AsnPhe: 1.578 ± 0.722
3.156AsnGly: 3.156 ± 0.836
0.789AsnHis: 0.789 ± 0.361
2.367AsnIle: 2.367 ± 0.869
2.761AsnLys: 2.761 ± 0.747
0.789AsnLeu: 0.789 ± 0.645
0.394AsnMet: 0.394 ± 0.335
1.578AsnAsn: 1.578 ± 1.01
2.761AsnPro: 2.761 ± 1.096
1.183AsnGln: 1.183 ± 0.867
1.578AsnArg: 1.578 ± 0.817
3.156AsnSer: 3.156 ± 1.354
2.367AsnThr: 2.367 ± 1.052
2.761AsnVal: 2.761 ± 0.65
0.394AsnTrp: 0.394 ± 0.377
0.789AsnTyr: 0.789 ± 0.618
0.0AsnXaa: 0.0 ± 0.0
Pro
3.945ProAla: 3.945 ± 0.753
1.183ProCys: 1.183 ± 0.512
4.734ProAsp: 4.734 ± 1.975
4.339ProGlu: 4.339 ± 1.524
2.367ProPhe: 2.367 ± 0.697
2.367ProGly: 2.367 ± 1.209
0.789ProHis: 0.789 ± 1.012
2.761ProIle: 2.761 ± 1.581
2.761ProLys: 2.761 ± 0.65
7.101ProLeu: 7.101 ± 1.106
0.394ProMet: 0.394 ± 0.335
1.578ProAsn: 1.578 ± 0.708
7.495ProPro: 7.495 ± 2.518
1.972ProGln: 1.972 ± 1.448
1.972ProArg: 1.972 ± 0.651
4.734ProSer: 4.734 ± 2.949
7.101ProThr: 7.101 ± 2.171
5.523ProVal: 5.523 ± 1.588
0.394ProTrp: 0.394 ± 0.318
1.183ProTyr: 1.183 ± 0.516
0.0ProXaa: 0.0 ± 0.0
Gln
0.789GlnAla: 0.789 ± 0.528
0.789GlnCys: 0.789 ± 0.361
1.972GlnAsp: 1.972 ± 0.603
3.156GlnGlu: 3.156 ± 0.856
3.55GlnPhe: 3.55 ± 0.869
3.156GlnGly: 3.156 ± 1.533
1.578GlnHis: 1.578 ± 0.695
1.972GlnIle: 1.972 ± 0.406
1.183GlnLys: 1.183 ± 0.698
5.523GlnLeu: 5.523 ± 1.584
2.367GlnMet: 2.367 ± 0.903
1.578GlnAsn: 1.578 ± 0.549
1.972GlnPro: 1.972 ± 0.896
4.339GlnGln: 4.339 ± 1.588
3.156GlnArg: 3.156 ± 1.098
2.761GlnSer: 2.761 ± 0.818
2.367GlnThr: 2.367 ± 0.875
3.945GlnVal: 3.945 ± 0.641
0.394GlnTrp: 0.394 ± 0.335
3.55GlnTyr: 3.55 ± 0.873
0.0GlnXaa: 0.0 ± 0.0
Arg
4.339ArgAla: 4.339 ± 1.173
1.578ArgCys: 1.578 ± 0.966
3.156ArgAsp: 3.156 ± 1.22
5.523ArgGlu: 5.523 ± 1.346
1.972ArgPhe: 1.972 ± 1.096
3.156ArgGly: 3.156 ± 1.134
2.367ArgHis: 2.367 ± 0.832
1.972ArgIle: 1.972 ± 0.473
3.945ArgLys: 3.945 ± 1.018
5.523ArgLeu: 5.523 ± 1.071
0.789ArgMet: 0.789 ± 0.593
1.972ArgAsn: 1.972 ± 0.864
2.367ArgPro: 2.367 ± 1.204
3.55ArgGln: 3.55 ± 1.454
7.495ArgArg: 7.495 ± 2.106
7.89ArgSer: 7.89 ± 3.013
3.55ArgThr: 3.55 ± 0.965
3.55ArgVal: 3.55 ± 1.891
0.0ArgTrp: 0.0 ± 0.0
3.156ArgTyr: 3.156 ± 0.727
0.0ArgXaa: 0.0 ± 0.0
Ser
3.945SerAla: 3.945 ± 0.687
0.394SerCys: 0.394 ± 0.377
5.917SerAsp: 5.917 ± 1.543
2.761SerGlu: 2.761 ± 0.724
3.55SerPhe: 3.55 ± 0.685
7.495SerGly: 7.495 ± 1.05
0.394SerHis: 0.394 ± 0.335
2.761SerIle: 2.761 ± 0.715
2.761SerLys: 2.761 ± 1.092
7.89SerLeu: 7.89 ± 1.084
2.761SerMet: 2.761 ± 1.903
2.367SerAsn: 2.367 ± 1.177
6.312SerPro: 6.312 ± 2.392
3.156SerGln: 3.156 ± 0.819
7.101SerArg: 7.101 ± 2.32
3.55SerSer: 3.55 ± 0.874
4.734SerThr: 4.734 ± 2.26
5.128SerVal: 5.128 ± 1.288
1.183SerTrp: 1.183 ± 0.623
1.578SerTyr: 1.578 ± 0.911
0.0SerXaa: 0.0 ± 0.0
Thr
1.972ThrAla: 1.972 ± 0.603
1.183ThrCys: 1.183 ± 0.533
5.917ThrAsp: 5.917 ± 0.796
3.55ThrGlu: 3.55 ± 0.758
2.761ThrPhe: 2.761 ± 0.65
3.156ThrGly: 3.156 ± 0.733
0.789ThrHis: 0.789 ± 0.411
3.55ThrIle: 3.55 ± 1.387
1.578ThrLys: 1.578 ± 0.688
4.734ThrLeu: 4.734 ± 1.592
0.789ThrMet: 0.789 ± 0.361
0.789ThrAsn: 0.789 ± 0.406
7.101ThrPro: 7.101 ± 2.202
3.55ThrGln: 3.55 ± 0.55
4.339ThrArg: 4.339 ± 1.231
5.128ThrSer: 5.128 ± 1.589
3.156ThrThr: 3.156 ± 1.134
5.917ThrVal: 5.917 ± 1.694
0.394ThrTrp: 0.394 ± 0.318
1.972ThrTyr: 1.972 ± 0.712
0.0ThrXaa: 0.0 ± 0.0
Val
2.761ValAla: 2.761 ± 1.112
0.789ValCys: 0.789 ± 0.361
3.156ValAsp: 3.156 ± 0.554
6.312ValGlu: 6.312 ± 1.376
3.156ValPhe: 3.156 ± 1.391
5.917ValGly: 5.917 ± 0.908
1.972ValHis: 1.972 ± 0.656
2.367ValIle: 2.367 ± 0.942
2.367ValLys: 2.367 ± 0.721
3.156ValLeu: 3.156 ± 1.324
0.789ValMet: 0.789 ± 0.387
2.367ValAsn: 2.367 ± 0.469
3.156ValPro: 3.156 ± 0.857
3.945ValGln: 3.945 ± 0.859
3.945ValArg: 3.945 ± 1.321
4.734ValSer: 4.734 ± 1.42
4.734ValThr: 4.734 ± 1.807
3.156ValVal: 3.156 ± 1.366
0.789ValTrp: 0.789 ± 0.448
2.367ValTyr: 2.367 ± 0.522
0.0ValXaa: 0.0 ± 0.0
Trp
1.183TrpAla: 1.183 ± 0.586
0.0TrpCys: 0.0 ± 0.0
0.394TrpAsp: 0.394 ± 0.377
1.183TrpGlu: 1.183 ± 0.734
0.394TrpPhe: 0.394 ± 0.335
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.789TrpIle: 0.789 ± 0.67
0.789TrpLys: 0.789 ± 0.556
1.578TrpLeu: 1.578 ± 0.722
0.0TrpMet: 0.0 ± 0.0
0.789TrpAsn: 0.789 ± 0.754
0.0TrpPro: 0.0 ± 0.0
1.183TrpGln: 1.183 ± 0.586
0.0TrpArg: 0.0 ± 0.0
1.183TrpSer: 1.183 ± 0.953
1.578TrpThr: 1.578 ± 0.635
0.789TrpVal: 0.789 ± 0.387
0.0TrpTrp: 0.0 ± 0.0
0.394TrpTyr: 0.394 ± 0.335
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.156TyrAla: 3.156 ± 1.072
0.789TyrCys: 0.789 ± 0.639
1.183TyrAsp: 1.183 ± 0.325
1.183TyrGlu: 1.183 ± 0.412
1.578TyrPhe: 1.578 ± 0.62
2.367TyrGly: 2.367 ± 1.001
0.789TyrHis: 0.789 ± 0.406
2.761TyrIle: 2.761 ± 1.041
1.972TyrLys: 1.972 ± 0.78
4.339TyrLeu: 4.339 ± 1.122
0.394TyrMet: 0.394 ± 0.454
0.789TyrAsn: 0.789 ± 0.412
1.578TyrPro: 1.578 ± 0.571
2.367TyrGln: 2.367 ± 0.692
1.578TyrArg: 1.578 ± 0.896
0.789TyrSer: 0.789 ± 0.528
1.972TyrThr: 1.972 ± 0.719
1.183TyrVal: 1.183 ± 0.586
0.789TyrTrp: 0.789 ± 0.618
3.945TyrTyr: 3.945 ± 1.342
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2536 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski