Amino acid dipepetide frequency for Human papillomavirus 197

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.728AlaAla: 5.728 ± 1.946
2.455AlaCys: 2.455 ± 1.236
3.273AlaAsp: 3.273 ± 0.535
3.682AlaGlu: 3.682 ± 1.438
4.501AlaPhe: 4.501 ± 1.51
2.455AlaGly: 2.455 ± 0.996
1.227AlaHis: 1.227 ± 0.709
3.273AlaIle: 3.273 ± 1.062
2.455AlaLys: 2.455 ± 0.807
3.682AlaLeu: 3.682 ± 1.507
0.0AlaMet: 0.0 ± 0.0
3.273AlaAsn: 3.273 ± 0.689
3.682AlaPro: 3.682 ± 1.789
3.273AlaGln: 3.273 ± 0.745
1.637AlaArg: 1.637 ± 0.73
4.501AlaSer: 4.501 ± 0.875
3.273AlaThr: 3.273 ± 0.768
3.273AlaVal: 3.273 ± 1.173
1.227AlaTrp: 1.227 ± 1.053
1.637AlaTyr: 1.637 ± 0.622
0.0AlaXaa: 0.0 ± 0.0
Cys
0.818CysAla: 0.818 ± 0.712
0.818CysCys: 0.818 ± 0.697
2.046CysAsp: 2.046 ± 0.84
0.818CysGlu: 0.818 ± 0.697
1.227CysPhe: 1.227 ± 0.645
0.818CysGly: 0.818 ± 0.568
0.409CysHis: 0.409 ± 0.487
2.046CysIle: 2.046 ± 1.443
1.637CysLys: 1.637 ± 0.56
2.455CysLeu: 2.455 ± 1.593
0.409CysMet: 0.409 ± 0.487
2.046CysAsn: 2.046 ± 0.868
1.637CysPro: 1.637 ± 0.736
0.0CysGln: 0.0 ± 0.0
0.818CysArg: 0.818 ± 0.663
1.637CysSer: 1.637 ± 0.806
1.227CysThr: 1.227 ± 0.806
1.227CysVal: 1.227 ± 0.903
1.227CysTrp: 1.227 ± 0.434
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.91AspAla: 4.91 ± 1.344
2.046AspCys: 2.046 ± 1.36
6.137AspAsp: 6.137 ± 1.805
4.91AspGlu: 4.91 ± 1.817
3.273AspPhe: 3.273 ± 1.247
2.455AspGly: 2.455 ± 0.89
1.227AspHis: 1.227 ± 0.635
4.501AspIle: 4.501 ± 1.178
1.227AspLys: 1.227 ± 0.623
2.864AspLeu: 2.864 ± 1.709
0.818AspMet: 0.818 ± 0.369
4.501AspAsn: 4.501 ± 0.904
5.319AspPro: 5.319 ± 1.883
2.046AspGln: 2.046 ± 0.901
3.273AspArg: 3.273 ± 1.347
7.774AspSer: 7.774 ± 1.42
3.273AspThr: 3.273 ± 2.24
5.319AspVal: 5.319 ± 2.719
0.409AspTrp: 0.409 ± 0.349
2.046AspTyr: 2.046 ± 0.647
0.0AspXaa: 0.0 ± 0.0
Glu
2.455GluAla: 2.455 ± 1.386
0.818GluCys: 0.818 ± 0.369
4.092GluAsp: 4.092 ± 1.096
6.547GluGlu: 6.547 ± 2.998
2.046GluPhe: 2.046 ± 0.733
1.637GluGly: 1.637 ± 0.267
1.227GluHis: 1.227 ± 1.078
2.864GluIle: 2.864 ± 0.981
2.864GluLys: 2.864 ± 1.092
4.092GluLeu: 4.092 ± 0.76
0.818GluMet: 0.818 ± 0.369
4.501GluAsn: 4.501 ± 1.009
2.046GluPro: 2.046 ± 1.042
3.273GluGln: 3.273 ± 0.874
2.046GluArg: 2.046 ± 0.746
5.319GluSer: 5.319 ± 1.049
5.319GluThr: 5.319 ± 1.217
4.092GluVal: 4.092 ± 0.659
1.637GluTrp: 1.637 ± 0.267
1.637GluTyr: 1.637 ± 0.637
0.0GluXaa: 0.0 ± 0.0
Phe
2.455PheAla: 2.455 ± 1.199
1.637PheCys: 1.637 ± 1.021
2.455PheAsp: 2.455 ± 0.809
3.682PheGlu: 3.682 ± 1.381
1.227PhePhe: 1.227 ± 0.635
0.818PheGly: 0.818 ± 0.369
1.227PheHis: 1.227 ± 0.529
1.637PheIle: 1.637 ± 0.56
4.092PheLys: 4.092 ± 1.477
4.501PheLeu: 4.501 ± 1.317
0.409PheMet: 0.409 ± 0.356
2.046PheAsn: 2.046 ± 0.999
2.046PhePro: 2.046 ± 1.033
4.092PheGln: 4.092 ± 1.335
1.637PheArg: 1.637 ± 0.989
0.818PheSer: 0.818 ± 0.712
1.637PheThr: 1.637 ± 0.54
2.864PheVal: 2.864 ± 0.721
0.818PheTrp: 0.818 ± 0.369
2.864PheTyr: 2.864 ± 0.901
0.0PheXaa: 0.0 ± 0.0
Gly
4.501GlyAla: 4.501 ± 1.178
1.227GlyCys: 1.227 ± 0.692
5.728GlyAsp: 5.728 ± 2.057
4.91GlyGlu: 4.91 ± 1.594
0.409GlyPhe: 0.409 ± 0.356
4.501GlyGly: 4.501 ± 2.162
1.227GlyHis: 1.227 ± 0.692
3.682GlyIle: 3.682 ± 1.233
2.864GlyLys: 2.864 ± 0.811
3.682GlyLeu: 3.682 ± 0.808
0.818GlyMet: 0.818 ± 0.462
2.455GlyAsn: 2.455 ± 1.106
2.046GlyPro: 2.046 ± 0.475
2.455GlyGln: 2.455 ± 0.919
4.501GlyArg: 4.501 ± 1.657
4.092GlySer: 4.092 ± 2.01
3.682GlyThr: 3.682 ± 2.017
3.273GlyVal: 3.273 ± 0.55
0.0GlyTrp: 0.0 ± 0.0
1.227GlyTyr: 1.227 ± 0.722
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
1.637HisCys: 1.637 ± 0.87
0.818HisAsp: 0.818 ± 0.802
1.637HisGlu: 1.637 ± 0.63
0.818HisPhe: 0.818 ± 0.444
1.227HisGly: 1.227 ± 1.109
0.0HisHis: 0.0 ± 0.0
2.455HisIle: 2.455 ± 0.855
0.818HisLys: 0.818 ± 0.636
1.227HisLeu: 1.227 ± 0.748
0.0HisMet: 0.0 ± 0.0
1.227HisAsn: 1.227 ± 0.593
1.227HisPro: 1.227 ± 0.743
0.818HisGln: 0.818 ± 0.568
1.637HisArg: 1.637 ± 0.563
0.409HisSer: 0.409 ± 0.349
0.409HisThr: 0.409 ± 0.393
1.637HisVal: 1.637 ± 0.615
0.409HisTrp: 0.409 ± 0.356
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.455IleAla: 2.455 ± 0.959
0.818IleCys: 0.818 ± 0.712
4.91IleAsp: 4.91 ± 2.376
4.092IleGlu: 4.092 ± 1.018
2.864IlePhe: 2.864 ± 1.003
4.501IleGly: 4.501 ± 2.138
0.818IleHis: 0.818 ± 0.636
4.91IleIle: 4.91 ± 0.76
0.0IleLys: 0.0 ± 0.0
4.91IleLeu: 4.91 ± 1.555
0.409IleMet: 0.409 ± 0.557
2.455IleAsn: 2.455 ± 0.94
3.682IlePro: 3.682 ± 2.549
2.455IleGln: 2.455 ± 0.929
4.092IleArg: 4.092 ± 1.85
3.273IleSer: 3.273 ± 0.692
3.273IleThr: 3.273 ± 0.745
3.682IleVal: 3.682 ± 1.493
0.0IleTrp: 0.0 ± 0.0
3.273IleTyr: 3.273 ± 0.575
0.0IleXaa: 0.0 ± 0.0
Lys
2.046LysAla: 2.046 ± 0.631
1.637LysCys: 1.637 ± 0.615
2.046LysAsp: 2.046 ± 0.831
3.682LysGlu: 3.682 ± 1.339
1.637LysPhe: 1.637 ± 0.738
1.637LysGly: 1.637 ± 1.105
1.637LysHis: 1.637 ± 1.03
2.046LysIle: 2.046 ± 0.859
3.273LysLys: 3.273 ± 0.975
5.728LysLeu: 5.728 ± 1.808
1.637LysMet: 1.637 ± 0.723
2.455LysAsn: 2.455 ± 0.708
1.227LysPro: 1.227 ± 0.645
3.273LysGln: 3.273 ± 1.098
6.137LysArg: 6.137 ± 1.106
2.864LysSer: 2.864 ± 1.331
2.046LysThr: 2.046 ± 0.909
3.682LysVal: 3.682 ± 0.818
0.818LysTrp: 0.818 ± 0.462
2.864LysTyr: 2.864 ± 1.376
0.0LysXaa: 0.0 ± 0.0
Leu
3.273LeuAla: 3.273 ± 0.613
1.227LeuCys: 1.227 ± 0.589
5.319LeuAsp: 5.319 ± 0.991
6.137LeuGlu: 6.137 ± 0.902
6.956LeuPhe: 6.956 ± 1.726
6.547LeuGly: 6.547 ± 2.385
1.637LeuHis: 1.637 ± 0.826
4.092LeuIle: 4.092 ± 1.193
5.319LeuLys: 5.319 ± 1.416
9.002LeuLeu: 9.002 ± 1.858
1.227LeuMet: 1.227 ± 0.594
2.046LeuAsn: 2.046 ± 0.786
5.319LeuPro: 5.319 ± 1.062
7.365LeuGln: 7.365 ± 1.485
2.046LeuArg: 2.046 ± 1.369
5.319LeuSer: 5.319 ± 1.276
6.137LeuThr: 6.137 ± 0.871
5.319LeuVal: 5.319 ± 2.052
0.818LeuTrp: 0.818 ± 0.443
4.501LeuTyr: 4.501 ± 0.838
0.0LeuXaa: 0.0 ± 0.0
Met
2.455MetAla: 2.455 ± 0.905
0.818MetCys: 0.818 ± 0.369
0.409MetAsp: 0.409 ± 0.557
1.227MetGlu: 1.227 ± 0.581
0.0MetPhe: 0.0 ± 0.0
0.409MetGly: 0.409 ± 0.349
0.818MetHis: 0.818 ± 0.802
0.0MetIle: 0.0 ± 0.0
1.227MetLys: 1.227 ± 1.07
1.227MetLeu: 1.227 ± 0.718
0.0MetMet: 0.0 ± 0.0
0.818MetAsn: 0.818 ± 0.369
0.409MetPro: 0.409 ± 0.349
0.818MetGln: 0.818 ± 0.568
0.818MetArg: 0.818 ± 0.697
0.818MetSer: 0.818 ± 0.422
0.409MetThr: 0.409 ± 0.356
0.818MetVal: 0.818 ± 0.369
0.409MetTrp: 0.409 ± 0.393
1.227MetTyr: 1.227 ± 0.434
0.0MetXaa: 0.0 ± 0.0
Asn
3.273AsnAla: 3.273 ± 1.138
1.637AsnCys: 1.637 ± 1.084
3.273AsnAsp: 3.273 ± 0.965
3.273AsnGlu: 3.273 ± 0.874
1.637AsnPhe: 1.637 ± 1.192
1.637AsnGly: 1.637 ± 1.084
0.409AsnHis: 0.409 ± 0.349
2.455AsnIle: 2.455 ± 0.679
4.092AsnLys: 4.092 ± 1.174
4.91AsnLeu: 4.91 ± 0.68
1.227AsnMet: 1.227 ± 0.725
3.682AsnAsn: 3.682 ± 1.384
2.455AsnPro: 2.455 ± 1.454
2.864AsnGln: 2.864 ± 1.077
2.455AsnArg: 2.455 ± 0.803
2.864AsnSer: 2.864 ± 0.748
4.092AsnThr: 4.092 ± 1.34
3.682AsnVal: 3.682 ± 0.976
0.818AsnTrp: 0.818 ± 0.462
2.046AsnTyr: 2.046 ± 0.436
0.0AsnXaa: 0.0 ± 0.0
Pro
2.864ProAla: 2.864 ± 1.127
0.409ProCys: 0.409 ± 0.487
3.273ProAsp: 3.273 ± 1.181
2.864ProGlu: 2.864 ± 1.205
0.409ProPhe: 0.409 ± 0.393
2.864ProGly: 2.864 ± 1.718
0.0ProHis: 0.0 ± 0.0
3.682ProIle: 3.682 ± 1.138
4.91ProLys: 4.91 ± 1.004
8.183ProLeu: 8.183 ± 1.592
0.409ProMet: 0.409 ± 0.379
2.046ProAsn: 2.046 ± 0.872
7.774ProPro: 7.774 ± 4.172
2.864ProGln: 2.864 ± 1.327
2.046ProArg: 2.046 ± 0.815
3.682ProSer: 3.682 ± 2.083
3.273ProThr: 3.273 ± 1.367
2.864ProVal: 2.864 ± 1.117
0.0ProTrp: 0.0 ± 0.0
2.864ProTyr: 2.864 ± 1.105
0.0ProXaa: 0.0 ± 0.0
Gln
3.682GlnAla: 3.682 ± 0.934
1.637GlnCys: 1.637 ± 0.724
2.455GlnAsp: 2.455 ± 0.466
0.409GlnGlu: 0.409 ± 0.393
3.682GlnPhe: 3.682 ± 0.886
2.864GlnGly: 2.864 ± 0.718
1.227GlnHis: 1.227 ± 0.831
2.455GlnIle: 2.455 ± 0.509
1.637GlnLys: 1.637 ± 0.615
6.137GlnLeu: 6.137 ± 2.319
2.455GlnMet: 2.455 ± 1.505
3.682GlnAsn: 3.682 ± 1.689
2.455GlnPro: 2.455 ± 0.574
2.455GlnGln: 2.455 ± 0.668
2.455GlnArg: 2.455 ± 0.788
3.682GlnSer: 3.682 ± 0.483
2.455GlnThr: 2.455 ± 1.038
1.637GlnVal: 1.637 ± 0.738
0.818GlnTrp: 0.818 ± 0.645
1.637GlnTyr: 1.637 ± 1.03
0.0GlnXaa: 0.0 ± 0.0
Arg
3.682ArgAla: 3.682 ± 1.506
0.818ArgCys: 0.818 ± 0.663
3.273ArgAsp: 3.273 ± 1.06
2.046ArgGlu: 2.046 ± 0.576
2.046ArgPhe: 2.046 ± 1.133
4.092ArgGly: 4.092 ± 1.297
2.046ArgHis: 2.046 ± 0.475
2.046ArgIle: 2.046 ± 0.733
4.092ArgLys: 4.092 ± 0.727
5.728ArgLeu: 5.728 ± 1.306
0.409ArgMet: 0.409 ± 0.393
4.092ArgAsn: 4.092 ± 1.209
2.455ArgPro: 2.455 ± 1.256
1.637ArgGln: 1.637 ± 0.605
7.365ArgArg: 7.365 ± 2.313
4.092ArgSer: 4.092 ± 0.65
3.273ArgThr: 3.273 ± 0.915
3.682ArgVal: 3.682 ± 0.919
0.0ArgTrp: 0.0 ± 0.0
0.409ArgTyr: 0.409 ± 0.393
0.0ArgXaa: 0.0 ± 0.0
Ser
2.864SerAla: 2.864 ± 0.979
0.409SerCys: 0.409 ± 0.487
4.092SerAsp: 4.092 ± 1.554
2.046SerGlu: 2.046 ± 0.985
3.682SerPhe: 3.682 ± 0.765
5.728SerGly: 5.728 ± 0.987
1.227SerHis: 1.227 ± 0.434
3.682SerIle: 3.682 ± 1.571
2.046SerLys: 2.046 ± 0.909
4.501SerLeu: 4.501 ± 1.675
1.227SerMet: 1.227 ± 0.629
4.092SerAsn: 4.092 ± 2.065
3.273SerPro: 3.273 ± 1.599
2.864SerGln: 2.864 ± 0.753
4.092SerArg: 4.092 ± 1.079
4.092SerSer: 4.092 ± 1.267
7.365SerThr: 7.365 ± 2.585
5.728SerVal: 5.728 ± 0.766
0.818SerTrp: 0.818 ± 0.462
1.637SerTyr: 1.637 ± 0.54
0.0SerXaa: 0.0 ± 0.0
Thr
4.092ThrAla: 4.092 ± 1.244
0.409ThrCys: 0.409 ± 0.349
5.319ThrAsp: 5.319 ± 1.677
2.864ThrGlu: 2.864 ± 1.003
2.046ThrPhe: 2.046 ± 1.36
6.137ThrGly: 6.137 ± 2.352
0.409ThrHis: 0.409 ± 0.393
5.319ThrIle: 5.319 ± 2.08
3.273ThrLys: 3.273 ± 1.58
7.365ThrLeu: 7.365 ± 1.948
1.227ThrMet: 1.227 ± 0.761
2.046ThrAsn: 2.046 ± 0.947
4.092ThrPro: 4.092 ± 1.817
2.864ThrGln: 2.864 ± 0.965
2.455ThrArg: 2.455 ± 0.778
4.092ThrSer: 4.092 ± 1.301
4.092ThrThr: 4.092 ± 1.581
3.682ThrVal: 3.682 ± 1.339
0.818ThrTrp: 0.818 ± 0.462
2.046ThrTyr: 2.046 ± 0.475
0.0ThrXaa: 0.0 ± 0.0
Val
5.319ValAla: 5.319 ± 1.126
1.637ValCys: 1.637 ± 0.921
6.956ValAsp: 6.956 ± 3.127
3.682ValGlu: 3.682 ± 1.296
0.818ValPhe: 0.818 ± 0.568
2.455ValGly: 2.455 ± 1.63
1.227ValHis: 1.227 ± 0.748
4.092ValIle: 4.092 ± 1.842
2.455ValLys: 2.455 ± 0.977
4.092ValLeu: 4.092 ± 1.126
0.409ValMet: 0.409 ± 0.356
1.637ValAsn: 1.637 ± 0.622
4.501ValPro: 4.501 ± 1.293
3.273ValGln: 3.273 ± 1.546
3.682ValArg: 3.682 ± 0.876
4.501ValSer: 4.501 ± 0.758
5.728ValThr: 5.728 ± 1.578
2.046ValVal: 2.046 ± 0.936
1.227ValTrp: 1.227 ± 0.704
1.637ValTyr: 1.637 ± 0.741
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.818TrpAsp: 0.818 ± 0.491
0.0TrpGlu: 0.0 ± 0.0
0.818TrpPhe: 0.818 ± 0.785
1.227TrpGly: 1.227 ± 0.593
0.409TrpHis: 0.409 ± 0.393
0.818TrpIle: 0.818 ± 0.697
1.637TrpLys: 1.637 ± 0.722
1.637TrpLeu: 1.637 ± 0.63
0.409TrpMet: 0.409 ± 0.348
0.818TrpAsn: 0.818 ± 0.491
0.0TrpPro: 0.0 ± 0.0
0.409TrpGln: 0.409 ± 0.356
1.227TrpArg: 1.227 ± 0.704
0.818TrpSer: 0.818 ± 0.462
0.818TrpThr: 0.818 ± 0.491
0.818TrpVal: 0.818 ± 0.462
0.0TrpTrp: 0.0 ± 0.0
0.409TrpTyr: 0.409 ± 0.349
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.637TyrAla: 1.637 ± 0.615
1.227TyrCys: 1.227 ± 0.593
1.637TyrAsp: 1.637 ± 0.845
0.818TyrGlu: 0.818 ± 0.697
2.864TyrPhe: 2.864 ± 1.254
3.273TyrGly: 3.273 ± 0.859
0.0TyrHis: 0.0 ± 0.0
0.818TyrIle: 0.818 ± 0.627
2.455TyrLys: 2.455 ± 0.903
3.682TyrLeu: 3.682 ± 1.216
0.409TyrMet: 0.409 ± 0.393
2.864TyrAsn: 2.864 ± 0.597
1.637TyrPro: 1.637 ± 0.876
0.818TyrGln: 0.818 ± 0.491
2.864TyrArg: 2.864 ± 1.13
1.227TyrSer: 1.227 ± 0.459
2.864TyrThr: 2.864 ± 1.175
2.046TyrVal: 2.046 ± 0.947
0.818TyrTrp: 0.818 ± 0.491
2.046TyrTyr: 2.046 ± 1.166
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2445 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski