Amino acid dipepetide frequency for Human papillomavirus 93

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.334AlaAla: 2.334 ± 1.045
0.389AlaCys: 0.389 ± 0.315
3.501AlaAsp: 3.501 ± 0.747
3.112AlaGlu: 3.112 ± 1.474
3.501AlaPhe: 3.501 ± 1.278
2.334AlaGly: 2.334 ± 0.895
1.167AlaHis: 1.167 ± 0.387
1.945AlaIle: 1.945 ± 0.62
3.501AlaLys: 3.501 ± 1.13
3.501AlaLeu: 3.501 ± 0.848
1.556AlaMet: 1.556 ± 0.879
2.723AlaAsn: 2.723 ± 1.426
4.278AlaPro: 4.278 ± 1.293
1.945AlaGln: 1.945 ± 0.873
3.89AlaArg: 3.89 ± 1.331
4.278AlaSer: 4.278 ± 1.305
5.445AlaThr: 5.445 ± 1.267
2.334AlaVal: 2.334 ± 0.87
0.778AlaTrp: 0.778 ± 0.593
1.556AlaTyr: 1.556 ± 0.479
0.0AlaXaa: 0.0 ± 0.0
Cys
0.389CysAla: 0.389 ± 0.316
1.556CysCys: 1.556 ± 1.22
0.778CysAsp: 0.778 ± 0.384
0.778CysGlu: 0.778 ± 0.657
1.556CysPhe: 1.556 ± 0.616
1.167CysGly: 1.167 ± 1.317
0.0CysHis: 0.0 ± 0.0
0.389CysIle: 0.389 ± 0.399
2.723CysLys: 2.723 ± 0.734
1.945CysLeu: 1.945 ± 1.158
0.389CysMet: 0.389 ± 0.399
0.778CysAsn: 0.778 ± 0.486
1.556CysPro: 1.556 ± 0.807
0.0CysGln: 0.0 ± 0.0
1.945CysArg: 1.945 ± 1.323
0.778CysSer: 0.778 ± 0.593
0.778CysThr: 0.778 ± 0.402
0.778CysVal: 0.778 ± 0.384
0.778CysTrp: 0.778 ± 0.384
0.389CysTyr: 0.389 ± 0.315
0.0CysXaa: 0.0 ± 0.0
Asp
3.112AspAla: 3.112 ± 0.715
0.389AspCys: 0.389 ± 0.297
4.667AspAsp: 4.667 ± 0.963
3.112AspGlu: 3.112 ± 1.342
3.112AspPhe: 3.112 ± 1.176
3.112AspGly: 3.112 ± 0.833
0.389AspHis: 0.389 ± 0.316
4.278AspIle: 4.278 ± 1.287
0.778AspLys: 0.778 ± 0.402
7.001AspLeu: 7.001 ± 2.185
1.167AspMet: 1.167 ± 0.637
3.89AspAsn: 3.89 ± 0.922
3.89AspPro: 3.89 ± 0.907
4.278AspGln: 4.278 ± 1.97
1.556AspArg: 1.556 ± 0.803
3.112AspSer: 3.112 ± 0.95
3.501AspThr: 3.501 ± 0.93
4.278AspVal: 4.278 ± 0.74
1.167AspTrp: 1.167 ± 0.59
1.167AspTyr: 1.167 ± 0.563
0.0AspXaa: 0.0 ± 0.0
Glu
4.667GluAla: 4.667 ± 1.808
0.778GluCys: 0.778 ± 0.593
5.834GluAsp: 5.834 ± 1.625
5.056GluGlu: 5.056 ± 2.436
2.334GluPhe: 2.334 ± 1.003
5.834GluGly: 5.834 ± 4.254
1.167GluHis: 1.167 ± 0.396
2.723GluIle: 2.723 ± 0.986
2.334GluLys: 2.334 ± 1.144
6.612GluLeu: 6.612 ± 2.079
0.778GluMet: 0.778 ± 0.593
2.723GluAsn: 2.723 ± 0.997
2.334GluPro: 2.334 ± 1.481
5.056GluGln: 5.056 ± 1.712
2.334GluArg: 2.334 ± 0.959
5.834GluSer: 5.834 ± 2.007
4.278GluThr: 4.278 ± 1.402
6.223GluVal: 6.223 ± 0.737
0.389GluTrp: 0.389 ± 0.315
1.556GluTyr: 1.556 ± 1.262
0.0GluXaa: 0.0 ± 0.0
Phe
2.723PheAla: 2.723 ± 0.973
0.778PheCys: 0.778 ± 0.74
3.112PheAsp: 3.112 ± 0.656
2.723PheGlu: 2.723 ± 1.37
1.945PhePhe: 1.945 ± 0.838
1.556PheGly: 1.556 ± 0.729
0.0PheHis: 0.0 ± 0.0
2.334PheIle: 2.334 ± 0.71
2.723PheLys: 2.723 ± 0.938
3.112PheLeu: 3.112 ± 0.475
0.0PheMet: 0.0 ± 0.0
0.778PheAsn: 0.778 ± 0.631
2.334PhePro: 2.334 ± 0.683
1.556PheGln: 1.556 ± 0.814
2.334PheArg: 2.334 ± 0.671
1.945PheSer: 1.945 ± 0.759
1.556PheThr: 1.556 ± 0.858
2.334PheVal: 2.334 ± 0.791
1.556PheTrp: 1.556 ± 0.768
2.334PheTyr: 2.334 ± 0.583
0.0PheXaa: 0.0 ± 0.0
Gly
2.334GlyAla: 2.334 ± 0.725
1.167GlyCys: 1.167 ± 0.743
4.278GlyAsp: 4.278 ± 0.915
5.834GlyGlu: 5.834 ± 2.799
0.389GlyPhe: 0.389 ± 0.315
4.667GlyGly: 4.667 ± 2.872
3.112GlyHis: 3.112 ± 1.32
2.723GlyIle: 2.723 ± 0.633
3.89GlyLys: 3.89 ± 0.843
2.723GlyLeu: 2.723 ± 0.626
0.389GlyMet: 0.389 ± 0.316
2.723GlyAsn: 2.723 ± 1.072
2.723GlyPro: 2.723 ± 0.96
2.723GlyGln: 2.723 ± 0.736
9.335GlyArg: 9.335 ± 3.878
5.834GlySer: 5.834 ± 1.772
4.667GlyThr: 4.667 ± 1.501
3.89GlyVal: 3.89 ± 0.62
0.778GlyTrp: 0.778 ± 0.798
1.945GlyTyr: 1.945 ± 0.99
0.0GlyXaa: 0.0 ± 0.0
His
0.778HisAla: 0.778 ± 0.384
0.778HisCys: 0.778 ± 0.486
0.0HisAsp: 0.0 ± 0.0
0.389HisGlu: 0.389 ± 0.316
1.945HisPhe: 1.945 ± 0.442
1.167HisGly: 1.167 ± 0.861
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.778HisLys: 0.778 ± 0.563
0.778HisLeu: 0.778 ± 0.593
0.0HisMet: 0.0 ± 0.0
1.556HisAsn: 1.556 ± 0.476
2.334HisPro: 2.334 ± 1.148
0.389HisGln: 0.389 ± 0.297
0.778HisArg: 0.778 ± 0.384
0.389HisSer: 0.389 ± 0.297
2.723HisThr: 2.723 ± 1.246
0.778HisVal: 0.778 ± 0.384
1.167HisTrp: 1.167 ± 0.396
0.778HisTyr: 0.778 ± 0.368
0.0HisXaa: 0.0 ± 0.0
Ile
2.723IleAla: 2.723 ± 1.007
1.556IleCys: 1.556 ± 0.736
1.167IleAsp: 1.167 ± 0.68
3.89IleGlu: 3.89 ± 1.53
1.167IlePhe: 1.167 ± 0.701
3.89IleGly: 3.89 ± 1.185
0.778IleHis: 0.778 ± 0.369
1.556IleIle: 1.556 ± 0.706
1.167IleLys: 1.167 ± 0.59
2.334IleLeu: 2.334 ± 0.454
0.389IleMet: 0.389 ± 0.355
2.723IleAsn: 2.723 ± 0.784
1.945IlePro: 1.945 ± 1.103
2.334IleGln: 2.334 ± 0.983
2.723IleArg: 2.723 ± 0.421
3.89IleSer: 3.89 ± 1.31
1.167IleThr: 1.167 ± 0.518
3.501IleVal: 3.501 ± 1.344
0.778IleTrp: 0.778 ± 0.486
2.723IleTyr: 2.723 ± 0.963
0.0IleXaa: 0.0 ± 0.0
Lys
2.334LysAla: 2.334 ± 0.792
1.167LysCys: 1.167 ± 0.65
1.556LysAsp: 1.556 ± 0.533
5.056LysGlu: 5.056 ± 1.541
2.723LysPhe: 2.723 ± 0.951
4.278LysGly: 4.278 ± 1.082
1.167LysHis: 1.167 ± 0.619
1.945LysIle: 1.945 ± 1.014
3.89LysLys: 3.89 ± 1.689
4.278LysLeu: 4.278 ± 1.366
0.778LysMet: 0.778 ± 0.658
1.945LysAsn: 1.945 ± 0.573
2.334LysPro: 2.334 ± 1.98
2.334LysGln: 2.334 ± 0.951
5.056LysArg: 5.056 ± 0.898
3.501LysSer: 3.501 ± 1.651
1.167LysThr: 1.167 ± 0.619
3.501LysVal: 3.501 ± 1.018
0.778LysTrp: 0.778 ± 0.368
1.945LysTyr: 1.945 ± 0.881
0.0LysXaa: 0.0 ± 0.0
Leu
4.278LeuAla: 4.278 ± 1.677
2.723LeuCys: 2.723 ± 1.149
6.223LeuAsp: 6.223 ± 0.628
7.001LeuGlu: 7.001 ± 1.5
3.89LeuPhe: 3.89 ± 1.053
5.834LeuGly: 5.834 ± 1.661
1.556LeuHis: 1.556 ± 0.828
3.89LeuIle: 3.89 ± 1.161
3.112LeuLys: 3.112 ± 0.898
10.891LeuLeu: 10.891 ± 2.569
2.334LeuMet: 2.334 ± 0.702
3.112LeuAsn: 3.112 ± 1.149
4.278LeuPro: 4.278 ± 1.692
6.612LeuGln: 6.612 ± 1.295
3.89LeuArg: 3.89 ± 1.198
5.834LeuSer: 5.834 ± 1.83
4.667LeuThr: 4.667 ± 1.196
5.834LeuVal: 5.834 ± 0.779
0.389LeuTrp: 0.389 ± 0.297
1.945LeuTyr: 1.945 ± 0.697
0.0LeuXaa: 0.0 ± 0.0
Met
1.556MetAla: 1.556 ± 0.694
0.0MetCys: 0.0 ± 0.0
0.389MetAsp: 0.389 ± 0.297
0.389MetGlu: 0.389 ± 0.316
0.389MetPhe: 0.389 ± 0.315
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.389MetIle: 0.389 ± 0.632
0.778MetLys: 0.778 ± 0.486
0.778MetLeu: 0.778 ± 0.368
0.389MetMet: 0.389 ± 0.316
0.778MetAsn: 0.778 ± 0.402
0.0MetPro: 0.0 ± 0.0
1.167MetGln: 1.167 ± 0.563
0.778MetArg: 0.778 ± 0.593
3.112MetSer: 3.112 ± 1.757
0.389MetThr: 0.389 ± 0.355
1.945MetVal: 1.945 ± 0.745
0.389MetTrp: 0.389 ± 0.316
0.778MetTyr: 0.778 ± 0.384
0.0MetXaa: 0.0 ± 0.0
Asn
2.334AsnAla: 2.334 ± 0.683
1.167AsnCys: 1.167 ± 1.273
3.89AsnAsp: 3.89 ± 1.753
1.556AsnGlu: 1.556 ± 0.684
1.556AsnPhe: 1.556 ± 0.707
2.334AsnGly: 2.334 ± 0.943
0.778AsnHis: 0.778 ± 0.632
1.945AsnIle: 1.945 ± 0.545
3.112AsnLys: 3.112 ± 0.654
4.278AsnLeu: 4.278 ± 1.207
0.389AsnMet: 0.389 ± 0.549
2.723AsnAsn: 2.723 ± 1.11
3.89AsnPro: 3.89 ± 1.325
1.556AsnGln: 1.556 ± 0.719
3.501AsnArg: 3.501 ± 0.997
3.112AsnSer: 3.112 ± 0.91
3.112AsnThr: 3.112 ± 1.012
2.723AsnVal: 2.723 ± 0.81
0.0AsnTrp: 0.0 ± 0.0
0.778AsnTyr: 0.778 ± 0.481
0.0AsnXaa: 0.0 ± 0.0
Pro
3.501ProAla: 3.501 ± 1.74
1.556ProCys: 1.556 ± 0.707
4.667ProAsp: 4.667 ± 1.074
4.667ProGlu: 4.667 ± 0.982
0.778ProPhe: 0.778 ± 0.384
3.112ProGly: 3.112 ± 1.114
0.389ProHis: 0.389 ± 0.549
0.778ProIle: 0.778 ± 0.394
3.112ProLys: 3.112 ± 1.302
7.39ProLeu: 7.39 ± 2.648
1.167ProMet: 1.167 ± 0.89
2.334ProAsn: 2.334 ± 0.793
8.946ProPro: 8.946 ± 4.757
3.112ProGln: 3.112 ± 2.135
3.112ProArg: 3.112 ± 1.589
3.89ProSer: 3.89 ± 2.058
5.445ProThr: 5.445 ± 1.788
5.056ProVal: 5.056 ± 1.857
0.0ProTrp: 0.0 ± 0.0
1.167ProTyr: 1.167 ± 0.946
0.0ProXaa: 0.0 ± 0.0
Gln
1.945GlnAla: 1.945 ± 0.689
0.778GlnCys: 0.778 ± 0.481
3.112GlnAsp: 3.112 ± 1.268
1.556GlnGlu: 1.556 ± 0.522
0.778GlnPhe: 0.778 ± 0.599
3.112GlnGly: 3.112 ± 0.802
1.945GlnHis: 1.945 ± 0.89
4.278GlnIle: 4.278 ± 0.96
2.334GlnLys: 2.334 ± 1.149
5.445GlnLeu: 5.445 ± 1.129
1.556GlnMet: 1.556 ± 0.728
3.112GlnAsn: 3.112 ± 1.345
2.334GlnPro: 2.334 ± 0.845
2.723GlnGln: 2.723 ± 0.97
2.334GlnArg: 2.334 ± 0.946
4.667GlnSer: 4.667 ± 0.831
3.112GlnThr: 3.112 ± 0.849
1.945GlnVal: 1.945 ± 0.751
0.778GlnTrp: 0.778 ± 0.368
1.945GlnTyr: 1.945 ± 0.652
0.0GlnXaa: 0.0 ± 0.0
Arg
3.112ArgAla: 3.112 ± 0.86
2.334ArgCys: 2.334 ± 1.422
2.723ArgAsp: 2.723 ± 0.992
3.501ArgGlu: 3.501 ± 1.237
2.334ArgPhe: 2.334 ± 0.561
7.001ArgGly: 7.001 ± 2.895
1.556ArgHis: 1.556 ± 0.707
0.778ArgIle: 0.778 ± 0.65
4.278ArgLys: 4.278 ± 0.787
6.612ArgLeu: 6.612 ± 1.237
0.0ArgMet: 0.0 ± 0.0
2.334ArgAsn: 2.334 ± 0.653
1.945ArgPro: 1.945 ± 1.13
3.112ArgGln: 3.112 ± 0.988
7.001ArgArg: 7.001 ± 2.919
10.502ArgSer: 10.502 ± 3.684
3.89ArgThr: 3.89 ± 1.23
3.112ArgVal: 3.112 ± 1.176
0.389ArgTrp: 0.389 ± 0.316
2.334ArgTyr: 2.334 ± 0.561
0.0ArgXaa: 0.0 ± 0.0
Ser
5.445SerAla: 5.445 ± 1.36
0.389SerCys: 0.389 ± 0.315
5.056SerAsp: 5.056 ± 1.023
5.056SerGlu: 5.056 ± 0.75
4.278SerPhe: 4.278 ± 1.215
7.779SerGly: 7.779 ± 1.495
1.167SerHis: 1.167 ± 0.517
3.112SerIle: 3.112 ± 0.809
4.278SerLys: 4.278 ± 0.645
6.612SerLeu: 6.612 ± 1.659
1.167SerMet: 1.167 ± 0.61
1.556SerAsn: 1.556 ± 1.187
4.667SerPro: 4.667 ± 1.519
2.334SerGln: 2.334 ± 1.038
7.779SerArg: 7.779 ± 3.237
10.891SerSer: 10.891 ± 4.384
7.001SerThr: 7.001 ± 1.869
2.723SerVal: 2.723 ± 0.906
0.778SerTrp: 0.778 ± 0.368
1.167SerTyr: 1.167 ± 0.661
0.0SerXaa: 0.0 ± 0.0
Thr
1.945ThrAla: 1.945 ± 0.851
1.556ThrCys: 1.556 ± 0.597
2.334ThrAsp: 2.334 ± 0.877
7.001ThrGlu: 7.001 ± 2.116
2.334ThrPhe: 2.334 ± 0.683
3.112ThrGly: 3.112 ± 0.907
0.389ThrHis: 0.389 ± 0.297
2.723ThrIle: 2.723 ± 1.369
1.556ThrLys: 1.556 ± 0.51
3.89ThrLeu: 3.89 ± 1.348
1.167ThrMet: 1.167 ± 0.59
3.501ThrAsn: 3.501 ± 0.488
7.39ThrPro: 7.39 ± 2.627
3.89ThrGln: 3.89 ± 0.934
3.501ThrArg: 3.501 ± 0.883
3.89ThrSer: 3.89 ± 1.912
3.112ThrThr: 3.112 ± 1.164
5.445ThrVal: 5.445 ± 1.061
0.778ThrTrp: 0.778 ± 0.552
3.112ThrTyr: 3.112 ± 0.842
0.0ThrXaa: 0.0 ± 0.0
Val
5.445ValAla: 5.445 ± 1.135
0.0ValCys: 0.0 ± 0.0
3.112ValAsp: 3.112 ± 0.968
5.834ValGlu: 5.834 ± 1.355
1.945ValPhe: 1.945 ± 0.682
3.112ValGly: 3.112 ± 1.119
1.167ValHis: 1.167 ± 0.661
3.89ValIle: 3.89 ± 1.146
2.334ValLys: 2.334 ± 1.12
3.89ValLeu: 3.89 ± 1.032
0.0ValMet: 0.0 ± 0.0
3.89ValAsn: 3.89 ± 0.615
5.056ValPro: 5.056 ± 1.384
2.334ValGln: 2.334 ± 0.775
5.056ValArg: 5.056 ± 1.645
5.056ValSer: 5.056 ± 0.885
3.112ValThr: 3.112 ± 1.141
2.723ValVal: 2.723 ± 0.955
0.778ValTrp: 0.778 ± 0.631
3.112ValTyr: 3.112 ± 1.503
0.0ValXaa: 0.0 ± 0.0
Trp
1.556TrpAla: 1.556 ± 0.707
0.389TrpCys: 0.389 ± 0.297
0.389TrpAsp: 0.389 ± 0.315
0.778TrpGlu: 0.778 ± 0.552
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.778TrpIle: 0.778 ± 0.593
1.945TrpLys: 1.945 ± 1.075
1.945TrpLeu: 1.945 ± 0.975
0.0TrpMet: 0.0 ± 0.0
0.389TrpAsn: 0.389 ± 0.315
0.0TrpPro: 0.0 ± 0.0
0.778TrpGln: 0.778 ± 0.402
0.389TrpArg: 0.389 ± 0.316
1.167TrpSer: 1.167 ± 0.619
0.778TrpThr: 0.778 ± 0.368
1.167TrpVal: 1.167 ± 0.59
0.0TrpTrp: 0.0 ± 0.0
0.389TrpTyr: 0.389 ± 0.297
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.556TyrAla: 1.556 ± 0.614
0.0TyrCys: 0.0 ± 0.0
1.167TyrAsp: 1.167 ± 0.428
1.556TyrGlu: 1.556 ± 0.542
0.778TyrPhe: 0.778 ± 0.402
2.723TyrGly: 2.723 ± 0.625
0.778TyrHis: 0.778 ± 0.394
1.945TyrIle: 1.945 ± 0.649
3.501TyrLys: 3.501 ± 1.101
4.278TyrLeu: 4.278 ± 0.952
0.389TyrMet: 0.389 ± 0.297
1.167TyrAsn: 1.167 ± 0.946
1.945TyrPro: 1.945 ± 0.915
1.556TyrGln: 1.556 ± 0.614
1.556TyrArg: 1.556 ± 0.561
1.556TyrSer: 1.556 ± 0.788
2.723TyrThr: 2.723 ± 1.042
1.556TyrVal: 1.556 ± 0.675
0.389TyrTrp: 0.389 ± 0.549
2.334TyrTyr: 2.334 ± 1.021
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2572 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski