Amino acid dipepetide frequency for human papillomavirus 81

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.907AlaAla: 7.907 ± 2.117
2.081AlaCys: 2.081 ± 1.611
5.826AlaAsp: 5.826 ± 0.854
2.497AlaGlu: 2.497 ± 0.875
2.497AlaPhe: 2.497 ± 1.075
2.913AlaGly: 2.913 ± 0.905
1.665AlaHis: 1.665 ± 0.887
3.745AlaIle: 3.745 ± 0.849
2.081AlaLys: 2.081 ± 0.783
4.578AlaLeu: 4.578 ± 1.322
2.081AlaMet: 2.081 ± 0.601
2.081AlaAsn: 2.081 ± 0.783
5.826AlaPro: 5.826 ± 1.839
3.745AlaGln: 3.745 ± 0.976
4.994AlaArg: 4.994 ± 0.9
7.907AlaSer: 7.907 ± 1.95
5.826AlaThr: 5.826 ± 1.71
4.161AlaVal: 4.161 ± 1.052
0.416AlaTrp: 0.416 ± 0.363
2.497AlaTyr: 2.497 ± 1.107
0.0AlaXaa: 0.0 ± 0.0
Cys
2.497CysAla: 2.497 ± 0.723
0.832CysCys: 0.832 ± 0.731
0.0CysAsp: 0.0 ± 0.0
1.248CysGlu: 1.248 ± 0.844
0.416CysPhe: 0.416 ± 0.531
1.665CysGly: 1.665 ± 0.798
1.665CysHis: 1.665 ± 1.194
2.081CysIle: 2.081 ± 0.865
3.329CysLys: 3.329 ± 1.372
2.081CysLeu: 2.081 ± 1.049
1.665CysMet: 1.665 ± 0.619
0.832CysAsn: 0.832 ± 0.459
2.081CysPro: 2.081 ± 0.608
1.248CysGln: 1.248 ± 0.396
2.081CysArg: 2.081 ± 1.064
0.0CysSer: 0.0 ± 0.0
1.665CysThr: 1.665 ± 0.811
0.416CysVal: 0.416 ± 0.316
1.248CysTrp: 1.248 ± 0.396
0.416CysTyr: 0.416 ± 0.531
0.0CysXaa: 0.0 ± 0.0
Asp
4.994AspAla: 4.994 ± 1.304
1.248AspCys: 1.248 ± 0.65
3.745AspAsp: 3.745 ± 0.929
1.665AspGlu: 1.665 ± 0.705
1.665AspPhe: 1.665 ± 0.597
5.41AspGly: 5.41 ± 1.326
0.832AspHis: 0.832 ± 0.631
4.161AspIle: 4.161 ± 1.712
1.248AspLys: 1.248 ± 0.355
4.994AspLeu: 4.994 ± 1.618
1.665AspMet: 1.665 ± 0.889
2.081AspAsn: 2.081 ± 0.608
4.161AspPro: 4.161 ± 1.844
1.665AspGln: 1.665 ± 0.27
2.913AspArg: 2.913 ± 0.879
3.745AspSer: 3.745 ± 1.236
5.826AspThr: 5.826 ± 1.662
3.329AspVal: 3.329 ± 1.365
1.248AspTrp: 1.248 ± 0.625
0.832AspTyr: 0.832 ± 0.717
0.0AspXaa: 0.0 ± 0.0
Glu
3.745GluAla: 3.745 ± 0.632
1.248GluCys: 1.248 ± 0.741
3.745GluAsp: 3.745 ± 1.342
8.323GluGlu: 8.323 ± 2.729
1.248GluPhe: 1.248 ± 0.691
2.081GluGly: 2.081 ± 0.79
1.248GluHis: 1.248 ± 0.456
1.665GluIle: 1.665 ± 1.031
2.081GluLys: 2.081 ± 0.842
5.41GluLeu: 5.41 ± 2.561
0.416GluMet: 0.416 ± 0.359
2.497GluAsn: 2.497 ± 0.875
3.329GluPro: 3.329 ± 0.851
3.329GluGln: 3.329 ± 1.5
0.832GluArg: 0.832 ± 0.4
1.665GluSer: 1.665 ± 0.709
3.329GluThr: 3.329 ± 0.66
3.745GluVal: 3.745 ± 1.068
0.832GluTrp: 0.832 ± 0.531
2.913GluTyr: 2.913 ± 0.941
0.0GluXaa: 0.0 ± 0.0
Phe
2.913PheAla: 2.913 ± 1.039
1.665PheCys: 1.665 ± 0.692
1.248PheAsp: 1.248 ± 0.715
0.832PheGlu: 0.832 ± 0.726
2.081PhePhe: 2.081 ± 0.711
2.497PheGly: 2.497 ± 0.905
0.416PheHis: 0.416 ± 0.531
0.832PheIle: 0.832 ± 0.4
3.329PheLys: 3.329 ± 1.245
4.578PheLeu: 4.578 ± 0.65
2.081PheMet: 2.081 ± 1.183
1.665PheAsn: 1.665 ± 1.026
1.665PhePro: 1.665 ± 1.031
1.248PheGln: 1.248 ± 0.625
0.832PheArg: 0.832 ± 0.4
0.832PheSer: 0.832 ± 0.631
2.081PheThr: 2.081 ± 1.379
2.497PheVal: 2.497 ± 0.774
0.832PheTrp: 0.832 ± 0.4
1.248PheTyr: 1.248 ± 0.698
0.0PheXaa: 0.0 ± 0.0
Gly
4.994GlyAla: 4.994 ± 0.555
1.248GlyCys: 1.248 ± 0.637
5.41GlyAsp: 5.41 ± 1.836
4.994GlyGlu: 4.994 ± 0.889
1.248GlyPhe: 1.248 ± 0.756
4.578GlyGly: 4.578 ± 1.305
4.578GlyHis: 4.578 ± 1.027
2.497GlyIle: 2.497 ± 0.567
2.913GlyLys: 2.913 ± 0.421
2.913GlyLeu: 2.913 ± 0.977
0.416GlyMet: 0.416 ± 0.316
2.081GlyAsn: 2.081 ± 0.79
3.329GlyPro: 3.329 ± 1.398
2.913GlyGln: 2.913 ± 0.561
2.081GlyArg: 2.081 ± 0.79
3.745GlySer: 3.745 ± 1.742
7.907GlyThr: 7.907 ± 2.106
2.913GlyVal: 2.913 ± 0.611
0.832GlyTrp: 0.832 ± 0.571
2.913GlyTyr: 2.913 ± 0.504
0.0GlyXaa: 0.0 ± 0.0
His
2.081HisAla: 2.081 ± 0.842
1.665HisCys: 1.665 ± 0.805
0.832HisAsp: 0.832 ± 0.387
1.665HisGlu: 1.665 ± 1.005
1.665HisPhe: 1.665 ± 0.464
2.081HisGly: 2.081 ± 0.763
0.416HisHis: 0.416 ± 0.493
1.248HisIle: 1.248 ± 0.399
1.248HisLys: 1.248 ± 0.738
2.913HisLeu: 2.913 ± 1.059
0.0HisMet: 0.0 ± 0.0
1.248HisAsn: 1.248 ± 0.73
2.497HisPro: 2.497 ± 0.957
0.832HisGln: 0.832 ± 0.76
2.497HisArg: 2.497 ± 0.725
1.665HisSer: 1.665 ± 0.541
1.665HisThr: 1.665 ± 0.641
0.832HisVal: 0.832 ± 0.475
1.665HisTrp: 1.665 ± 1.114
1.248HisTyr: 1.248 ± 0.606
0.0HisXaa: 0.0 ± 0.0
Ile
1.248IleAla: 1.248 ± 0.617
1.665IleCys: 1.665 ± 1.156
2.913IleAsp: 2.913 ± 0.502
2.081IleGlu: 2.081 ± 0.651
1.665IlePhe: 1.665 ± 0.821
4.161IleGly: 4.161 ± 1.538
2.081IleHis: 2.081 ± 0.642
1.665IleIle: 1.665 ± 0.947
2.081IleLys: 2.081 ± 0.85
1.665IleLeu: 1.665 ± 1.349
0.416IleMet: 0.416 ± 0.359
0.416IleAsn: 0.416 ± 0.316
2.913IlePro: 2.913 ± 1.184
2.081IleGln: 2.081 ± 0.7
1.665IleArg: 1.665 ± 1.08
2.913IleSer: 2.913 ± 0.839
3.745IleThr: 3.745 ± 0.932
3.745IleVal: 3.745 ± 1.56
0.832IleTrp: 0.832 ± 0.76
2.497IleTyr: 2.497 ± 1.001
0.0IleXaa: 0.0 ± 0.0
Lys
3.329LysAla: 3.329 ± 1.086
2.913LysCys: 2.913 ± 1.075
2.081LysAsp: 2.081 ± 0.741
2.497LysGlu: 2.497 ± 1.474
2.081LysPhe: 2.081 ± 0.7
2.913LysGly: 2.913 ± 1.396
2.081LysHis: 2.081 ± 0.7
2.081LysIle: 2.081 ± 0.515
2.497LysLys: 2.497 ± 1.201
2.497LysLeu: 2.497 ± 1.144
1.248LysMet: 1.248 ± 0.391
0.832LysAsn: 0.832 ± 0.566
1.665LysPro: 1.665 ± 0.692
1.248LysGln: 1.248 ± 0.396
4.161LysArg: 4.161 ± 1.02
3.329LysSer: 3.329 ± 1.697
1.665LysThr: 1.665 ± 0.887
3.745LysVal: 3.745 ± 0.943
0.832LysTrp: 0.832 ± 0.475
2.497LysTyr: 2.497 ± 0.823
0.0LysXaa: 0.0 ± 0.0
Leu
5.826LeuAla: 5.826 ± 1.321
2.081LeuCys: 2.081 ± 1.083
7.491LeuAsp: 7.491 ± 1.127
2.913LeuGlu: 2.913 ± 1.082
4.161LeuPhe: 4.161 ± 0.654
3.329LeuGly: 3.329 ± 0.434
3.329LeuHis: 3.329 ± 1.029
3.745LeuIle: 3.745 ± 1.295
4.578LeuLys: 4.578 ± 1.864
8.739LeuLeu: 8.739 ± 2.374
1.665LeuMet: 1.665 ± 0.872
1.665LeuAsn: 1.665 ± 0.615
2.913LeuPro: 2.913 ± 1.703
6.242LeuGln: 6.242 ± 1.774
5.41LeuArg: 5.41 ± 0.98
5.826LeuSer: 5.826 ± 1.122
4.161LeuThr: 4.161 ± 2.018
3.329LeuVal: 3.329 ± 1.683
0.416LeuTrp: 0.416 ± 0.616
5.41LeuTyr: 5.41 ± 1.016
0.0LeuXaa: 0.0 ± 0.0
Met
3.329MetAla: 3.329 ± 0.944
0.832MetCys: 0.832 ± 0.399
0.832MetAsp: 0.832 ± 0.646
0.416MetGlu: 0.416 ± 0.38
1.248MetPhe: 1.248 ± 1.076
1.665MetGly: 1.665 ± 0.619
0.416MetHis: 0.416 ± 0.512
0.0MetIle: 0.0 ± 0.0
0.416MetLys: 0.416 ± 0.359
0.832MetLeu: 0.832 ± 0.631
0.0MetMet: 0.0 ± 0.0
0.832MetAsn: 0.832 ± 0.59
1.248MetPro: 1.248 ± 0.681
0.832MetGln: 0.832 ± 0.399
0.0MetArg: 0.0 ± 0.0
2.081MetSer: 2.081 ± 0.578
0.416MetThr: 0.416 ± 0.316
3.745MetVal: 3.745 ± 0.921
1.248MetTrp: 1.248 ± 0.396
0.832MetTyr: 0.832 ± 0.646
0.0MetXaa: 0.0 ± 0.0
Asn
2.497AsnAla: 2.497 ± 1.222
0.416AsnCys: 0.416 ± 0.316
0.0AsnAsp: 0.0 ± 0.0
1.665AsnGlu: 1.665 ± 0.642
2.081AsnPhe: 2.081 ± 1.086
1.248AsnGly: 1.248 ± 0.625
0.416AsnHis: 0.416 ± 0.493
1.248AsnIle: 1.248 ± 0.396
3.329AsnLys: 3.329 ± 1.835
2.497AsnLeu: 2.497 ± 0.761
0.416AsnMet: 0.416 ± 0.359
1.248AsnAsn: 1.248 ± 0.396
2.913AsnPro: 2.913 ± 0.675
0.416AsnGln: 0.416 ± 0.493
1.665AsnArg: 1.665 ± 0.642
0.832AsnSer: 0.832 ± 0.72
4.161AsnThr: 4.161 ± 1.125
2.081AsnVal: 2.081 ± 0.578
1.248AsnTrp: 1.248 ± 0.617
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
6.658ProAla: 6.658 ± 1.616
2.081ProCys: 2.081 ± 0.914
4.578ProAsp: 4.578 ± 1.151
2.497ProGlu: 2.497 ± 0.633
0.832ProPhe: 0.832 ± 0.531
2.913ProGly: 2.913 ± 0.859
0.832ProHis: 0.832 ± 0.76
2.081ProIle: 2.081 ± 0.922
3.745ProLys: 3.745 ± 0.72
7.907ProLeu: 7.907 ± 1.545
0.832ProMet: 0.832 ± 0.726
2.081ProAsn: 2.081 ± 0.439
7.907ProPro: 7.907 ± 2.4
0.832ProGln: 0.832 ± 0.546
2.081ProArg: 2.081 ± 1.434
5.826ProSer: 5.826 ± 3.253
4.994ProThr: 4.994 ± 1.331
4.578ProVal: 4.578 ± 1.648
0.416ProTrp: 0.416 ± 0.38
2.497ProTyr: 2.497 ± 1.383
0.0ProXaa: 0.0 ± 0.0
Gln
3.745GlnAla: 3.745 ± 1.905
0.416GlnCys: 0.416 ± 0.493
1.665GlnAsp: 1.665 ± 0.55
3.745GlnGlu: 3.745 ± 1.286
2.497GlnPhe: 2.497 ± 0.875
2.081GlnGly: 2.081 ± 0.748
0.832GlnHis: 0.832 ± 0.475
0.832GlnIle: 0.832 ± 0.682
1.665GlnLys: 1.665 ± 0.594
5.41GlnLeu: 5.41 ± 1.85
1.248GlnMet: 1.248 ± 0.691
1.248GlnAsn: 1.248 ± 0.396
2.913GlnPro: 2.913 ± 0.776
2.913GlnGln: 2.913 ± 1.273
2.913GlnArg: 2.913 ± 1.019
2.081GlnSer: 2.081 ± 0.839
2.913GlnThr: 2.913 ± 1.385
2.497GlnVal: 2.497 ± 0.799
1.665GlnTrp: 1.665 ± 0.887
1.248GlnTyr: 1.248 ± 0.664
0.0GlnXaa: 0.0 ± 0.0
Arg
3.745ArgAla: 3.745 ± 0.825
0.832ArgCys: 0.832 ± 0.531
2.081ArgAsp: 2.081 ± 0.423
2.081ArgGlu: 2.081 ± 1.611
1.665ArgPhe: 1.665 ± 0.906
2.913ArgGly: 2.913 ± 0.885
1.665ArgHis: 1.665 ± 1.026
1.248ArgIle: 1.248 ± 0.698
3.329ArgLys: 3.329 ± 1.122
6.242ArgLeu: 6.242 ± 1.135
1.665ArgMet: 1.665 ± 0.587
1.248ArgAsn: 1.248 ± 0.611
3.745ArgPro: 3.745 ± 1.09
2.913ArgGln: 2.913 ± 1.098
3.329ArgArg: 3.329 ± 0.993
4.994ArgSer: 4.994 ± 1.078
3.329ArgThr: 3.329 ± 1.263
3.329ArgVal: 3.329 ± 1.156
0.416ArgTrp: 0.416 ± 0.531
1.248ArgTyr: 1.248 ± 0.611
0.0ArgXaa: 0.0 ± 0.0
Ser
2.497SerAla: 2.497 ± 0.955
1.665SerCys: 1.665 ± 0.669
3.329SerAsp: 3.329 ± 0.707
3.329SerGlu: 3.329 ± 1.004
3.329SerPhe: 3.329 ± 0.852
7.074SerGly: 7.074 ± 2.323
1.665SerHis: 1.665 ± 0.651
2.081SerIle: 2.081 ± 1.065
1.248SerLys: 1.248 ± 0.396
5.41SerLeu: 5.41 ± 1.087
1.665SerMet: 1.665 ± 0.55
1.665SerAsn: 1.665 ± 0.27
3.329SerPro: 3.329 ± 0.741
1.248SerGln: 1.248 ± 0.625
4.578SerArg: 4.578 ± 1.51
7.907SerSer: 7.907 ± 2.873
8.323SerThr: 8.323 ± 1.684
5.826SerVal: 5.826 ± 2.393
0.832SerTrp: 0.832 ± 0.631
2.497SerTyr: 2.497 ± 0.834
0.0SerXaa: 0.0 ± 0.0
Thr
4.578ThrAla: 4.578 ± 0.649
2.913ThrCys: 2.913 ± 0.671
4.578ThrAsp: 4.578 ± 1.338
3.745ThrGlu: 3.745 ± 0.856
1.665ThrPhe: 1.665 ± 0.597
7.074ThrGly: 7.074 ± 2.597
1.665ThrHis: 1.665 ± 0.817
4.161ThrIle: 4.161 ± 1.359
1.248ThrLys: 1.248 ± 0.78
7.907ThrLeu: 7.907 ± 2.054
0.832ThrMet: 0.832 ± 0.4
2.497ThrAsn: 2.497 ± 0.686
5.41ThrPro: 5.41 ± 1.632
4.578ThrGln: 4.578 ± 1.205
2.497ThrArg: 2.497 ± 0.82
6.242ThrSer: 6.242 ± 1.065
5.826ThrThr: 5.826 ± 1.689
7.491ThrVal: 7.491 ± 0.928
0.832ThrTrp: 0.832 ± 0.617
0.832ThrTyr: 0.832 ± 0.566
0.0ThrXaa: 0.0 ± 0.0
Val
2.081ValAla: 2.081 ± 0.651
1.248ValCys: 1.248 ± 0.766
4.578ValAsp: 4.578 ± 1.174
4.994ValGlu: 4.994 ± 1.506
2.081ValPhe: 2.081 ± 0.838
4.161ValGly: 4.161 ± 0.565
2.497ValHis: 2.497 ± 1.004
3.329ValIle: 3.329 ± 0.56
0.832ValLys: 0.832 ± 0.459
2.497ValLeu: 2.497 ± 1.348
1.248ValMet: 1.248 ± 0.776
2.081ValAsn: 2.081 ± 1.091
5.41ValPro: 5.41 ± 1.315
4.161ValGln: 4.161 ± 1.39
4.578ValArg: 4.578 ± 1.326
7.074ValSer: 7.074 ± 1.916
6.242ValThr: 6.242 ± 1.847
6.658ValVal: 6.658 ± 2.038
0.832ValTrp: 0.832 ± 0.566
2.913ValTyr: 2.913 ± 0.956
0.0ValXaa: 0.0 ± 0.0
Trp
2.497TrpAla: 2.497 ± 0.727
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.416TrpGlu: 0.416 ± 0.38
1.248TrpPhe: 1.248 ± 0.647
1.248TrpGly: 1.248 ± 0.713
1.248TrpHis: 1.248 ± 1.14
0.832TrpIle: 0.832 ± 0.631
1.248TrpLys: 1.248 ± 0.611
1.248TrpLeu: 1.248 ± 0.625
0.832TrpMet: 0.832 ± 0.632
0.832TrpAsn: 0.832 ± 0.459
0.416TrpPro: 0.416 ± 0.531
0.416TrpGln: 0.416 ± 0.531
0.832TrpArg: 0.832 ± 0.566
0.0TrpSer: 0.0 ± 0.0
1.665TrpThr: 1.665 ± 0.887
1.665TrpVal: 1.665 ± 0.887
0.0TrpTrp: 0.0 ± 0.0
0.832TrpTyr: 0.832 ± 0.399
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.161TyrAla: 4.161 ± 0.683
0.416TyrCys: 0.416 ± 0.359
2.497TyrAsp: 2.497 ± 0.661
2.081TyrGlu: 2.081 ± 1.022
0.0TyrPhe: 0.0 ± 0.0
2.497TyrGly: 2.497 ± 0.966
0.416TyrHis: 0.416 ± 0.363
2.913TyrIle: 2.913 ± 0.76
3.745TyrLys: 3.745 ± 0.998
3.329TyrLeu: 3.329 ± 1.217
0.416TyrMet: 0.416 ± 0.359
1.248TyrAsn: 1.248 ± 0.691
2.497TyrPro: 2.497 ± 0.999
1.665TyrGln: 1.665 ± 0.464
2.081TyrArg: 2.081 ± 0.758
0.832TyrSer: 0.832 ± 0.641
0.832TyrThr: 0.832 ± 0.726
2.913TyrVal: 2.913 ± 0.955
0.832TyrTrp: 0.832 ± 0.4
1.665TyrTyr: 1.665 ± 0.665
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (2404 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski