Amino acid dipepetide frequency for human papillomavirus 74

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.027AlaAla: 6.027 ± 1.028
1.205AlaCys: 1.205 ± 0.779
3.214AlaAsp: 3.214 ± 1.163
3.214AlaGlu: 3.214 ± 0.696
3.214AlaPhe: 3.214 ± 1.43
2.009AlaGly: 2.009 ± 0.378
1.205AlaHis: 1.205 ± 0.647
4.419AlaIle: 4.419 ± 0.6
2.411AlaLys: 2.411 ± 0.808
4.821AlaLeu: 4.821 ± 0.959
1.205AlaMet: 1.205 ± 0.709
3.616AlaAsn: 3.616 ± 1.403
4.821AlaPro: 4.821 ± 1.604
2.411AlaGln: 2.411 ± 0.414
3.616AlaArg: 3.616 ± 0.647
4.419AlaSer: 4.419 ± 0.751
4.821AlaThr: 4.821 ± 1.191
2.812AlaVal: 2.812 ± 0.557
0.402AlaTrp: 0.402 ± 0.296
2.009AlaTyr: 2.009 ± 0.65
0.0AlaXaa: 0.0 ± 0.0
Cys
2.009CysAla: 2.009 ± 0.85
0.804CysCys: 0.804 ± 0.944
0.804CysAsp: 0.804 ± 0.592
0.402CysGlu: 0.402 ± 0.296
2.009CysPhe: 2.009 ± 0.779
0.804CysGly: 0.804 ± 0.592
0.804CysHis: 0.804 ± 0.947
1.205CysIle: 1.205 ± 0.729
2.812CysLys: 2.812 ± 0.468
1.205CysLeu: 1.205 ± 1.014
1.205CysMet: 1.205 ± 0.535
2.009CysAsn: 2.009 ± 1.079
2.411CysPro: 2.411 ± 0.8
1.205CysGln: 1.205 ± 0.399
0.804CysArg: 0.804 ± 0.621
2.009CysSer: 2.009 ± 0.551
1.205CysThr: 1.205 ± 0.521
3.214CysVal: 3.214 ± 1.429
1.205CysTrp: 1.205 ± 0.535
1.607CysTyr: 1.607 ± 0.812
0.0CysXaa: 0.0 ± 0.0
Asp
3.214AspAla: 3.214 ± 1.163
2.411AspCys: 2.411 ± 0.786
2.009AspAsp: 2.009 ± 0.774
2.812AspGlu: 2.812 ± 1.376
0.804AspPhe: 0.804 ± 0.533
2.411AspGly: 2.411 ± 0.677
0.804AspHis: 0.804 ± 0.45
5.223AspIle: 5.223 ± 1.798
0.804AspLys: 0.804 ± 0.461
3.214AspLeu: 3.214 ± 1.045
1.607AspMet: 1.607 ± 0.536
3.616AspAsn: 3.616 ± 0.833
3.616AspPro: 3.616 ± 1.249
1.205AspGln: 1.205 ± 0.58
1.607AspArg: 1.607 ± 0.845
2.812AspSer: 2.812 ± 0.821
5.625AspThr: 5.625 ± 0.907
4.018AspVal: 4.018 ± 0.821
0.804AspTrp: 0.804 ± 0.375
2.009AspTyr: 2.009 ± 0.879
0.0AspXaa: 0.0 ± 0.0
Glu
3.214GluAla: 3.214 ± 1.087
0.0GluCys: 0.0 ± 0.0
4.821GluAsp: 4.821 ± 1.302
4.821GluGlu: 4.821 ± 1.459
0.804GluPhe: 0.804 ± 0.375
2.411GluGly: 2.411 ± 0.801
1.607GluHis: 1.607 ± 0.536
2.411GluIle: 2.411 ± 1.014
2.009GluLys: 2.009 ± 0.915
3.616GluLeu: 3.616 ± 0.861
1.607GluMet: 1.607 ± 1.072
1.607GluAsn: 1.607 ± 0.523
2.009GluPro: 2.009 ± 0.619
2.411GluGln: 2.411 ± 0.839
0.0GluArg: 0.0 ± 0.0
3.214GluSer: 3.214 ± 0.936
4.018GluThr: 4.018 ± 1.228
4.419GluVal: 4.419 ± 0.645
0.402GluTrp: 0.402 ± 0.438
0.402GluTyr: 0.402 ± 0.438
0.0GluXaa: 0.0 ± 0.0
Phe
2.812PheAla: 2.812 ± 1.161
0.804PheCys: 0.804 ± 0.524
2.411PheAsp: 2.411 ± 0.774
1.205PheGlu: 1.205 ± 0.385
2.812PhePhe: 2.812 ± 0.925
1.607PheGly: 1.607 ± 0.627
0.0PheHis: 0.0 ± 0.0
3.616PheIle: 3.616 ± 0.647
2.411PheLys: 2.411 ± 1.025
4.018PheLeu: 4.018 ± 1.067
0.402PheMet: 0.402 ± 0.337
1.607PheAsn: 1.607 ± 1.072
2.411PhePro: 2.411 ± 0.707
2.009PheGln: 2.009 ± 0.922
2.411PheArg: 2.411 ± 0.407
2.009PheSer: 2.009 ± 0.613
1.607PheThr: 1.607 ± 0.746
1.205PheVal: 1.205 ± 0.892
1.205PheTrp: 1.205 ± 0.654
1.607PheTyr: 1.607 ± 0.697
0.0PheXaa: 0.0 ± 0.0
Gly
0.804GlyAla: 0.804 ± 0.433
1.205GlyCys: 1.205 ± 0.583
4.419GlyAsp: 4.419 ± 0.772
2.009GlyGlu: 2.009 ± 0.684
0.804GlyPhe: 0.804 ± 0.433
4.018GlyGly: 4.018 ± 1.535
2.009GlyHis: 2.009 ± 0.457
3.214GlyIle: 3.214 ± 1.347
3.616GlyLys: 3.616 ± 0.965
4.419GlyLeu: 4.419 ± 1.154
1.205GlyMet: 1.205 ± 0.838
4.018GlyAsn: 4.018 ± 1.066
2.812GlyPro: 2.812 ± 1.201
2.812GlyGln: 2.812 ± 0.574
3.214GlyArg: 3.214 ± 0.605
3.214GlySer: 3.214 ± 1.005
6.428GlyThr: 6.428 ± 1.266
1.205GlyVal: 1.205 ± 0.387
0.402GlyTrp: 0.402 ± 0.296
2.411GlyTyr: 2.411 ± 0.64
0.0GlyXaa: 0.0 ± 0.0
His
1.607HisAla: 1.607 ± 0.622
0.402HisCys: 0.402 ± 0.474
0.402HisAsp: 0.402 ± 0.438
0.804HisGlu: 0.804 ± 0.877
2.009HisPhe: 2.009 ± 0.68
2.009HisGly: 2.009 ± 0.574
1.205HisHis: 1.205 ± 0.399
2.009HisIle: 2.009 ± 0.796
1.607HisLys: 1.607 ± 0.91
1.607HisLeu: 1.607 ± 1.146
0.402HisMet: 0.402 ± 0.296
3.214HisAsn: 3.214 ± 1.174
1.205HisPro: 1.205 ± 0.697
0.804HisGln: 0.804 ± 0.373
0.804HisArg: 0.804 ± 0.622
0.804HisSer: 0.804 ± 0.373
2.812HisThr: 2.812 ± 1.283
2.009HisVal: 2.009 ± 0.674
1.607HisTrp: 1.607 ± 0.75
1.607HisTyr: 1.607 ± 0.865
0.0HisXaa: 0.0 ± 0.0
Ile
4.018IleAla: 4.018 ± 1.519
1.607IleCys: 1.607 ± 0.759
2.812IleAsp: 2.812 ± 1.533
2.411IleGlu: 2.411 ± 0.787
0.804IlePhe: 0.804 ± 0.683
2.812IleGly: 2.812 ± 1.198
1.607IleHis: 1.607 ± 0.627
2.411IleIle: 2.411 ± 1.289
3.214IleLys: 3.214 ± 1.036
5.223IleLeu: 5.223 ± 2.078
0.804IleMet: 0.804 ± 0.447
1.205IleAsn: 1.205 ± 0.798
4.419IlePro: 4.419 ± 1.246
3.214IleGln: 3.214 ± 0.844
1.607IleArg: 1.607 ± 0.719
3.616IleSer: 3.616 ± 0.732
6.027IleThr: 6.027 ± 1.43
5.625IleVal: 5.625 ± 1.569
0.0IleTrp: 0.0 ± 0.0
1.607IleTyr: 1.607 ± 0.622
0.0IleXaa: 0.0 ± 0.0
Lys
2.009LysAla: 2.009 ± 0.963
2.411LysCys: 2.411 ± 1.025
2.812LysAsp: 2.812 ± 1.365
1.607LysGlu: 1.607 ± 0.731
2.411LysPhe: 2.411 ± 0.877
2.009LysGly: 2.009 ± 0.963
4.419LysHis: 4.419 ± 1.853
0.804LysIle: 0.804 ± 0.373
2.411LysLys: 2.411 ± 0.91
3.616LysLeu: 3.616 ± 0.901
0.402LysMet: 0.402 ± 0.296
2.411LysAsn: 2.411 ± 1.212
2.411LysPro: 2.411 ± 1.311
3.214LysGln: 3.214 ± 1.505
3.214LysArg: 3.214 ± 0.711
2.812LysSer: 2.812 ± 1.026
2.009LysThr: 2.009 ± 0.887
4.419LysVal: 4.419 ± 1.065
0.402LysTrp: 0.402 ± 0.337
3.616LysTyr: 3.616 ± 1.325
0.0LysXaa: 0.0 ± 0.0
Leu
4.018LeuAla: 4.018 ± 0.922
4.821LeuCys: 4.821 ± 1.748
5.625LeuAsp: 5.625 ± 0.675
5.223LeuGlu: 5.223 ± 1.282
3.616LeuPhe: 3.616 ± 1.006
5.223LeuGly: 5.223 ± 0.685
6.428LeuHis: 6.428 ± 1.792
5.223LeuIle: 5.223 ± 1.836
4.821LeuLys: 4.821 ± 0.855
10.848LeuLeu: 10.848 ± 2.829
0.804LeuMet: 0.804 ± 0.554
1.607LeuAsn: 1.607 ± 0.845
3.616LeuPro: 3.616 ± 1.058
7.232LeuGln: 7.232 ± 2.05
1.607LeuArg: 1.607 ± 1.018
4.821LeuSer: 4.821 ± 1.053
4.018LeuThr: 4.018 ± 1.372
4.419LeuVal: 4.419 ± 1.158
0.804LeuTrp: 0.804 ± 0.533
4.821LeuTyr: 4.821 ± 1.412
0.0LeuXaa: 0.0 ± 0.0
Met
1.607MetAla: 1.607 ± 0.627
0.402MetCys: 0.402 ± 0.296
1.205MetAsp: 1.205 ± 0.504
2.411MetGlu: 2.411 ± 1.474
1.607MetPhe: 1.607 ± 0.865
0.804MetGly: 0.804 ± 0.592
1.205MetHis: 1.205 ± 0.77
0.0MetIle: 0.0 ± 0.0
0.804MetLys: 0.804 ± 0.592
0.804MetLeu: 0.804 ± 0.592
0.0MetMet: 0.0 ± 0.0
1.205MetAsn: 1.205 ± 0.654
0.0MetPro: 0.0 ± 0.0
0.804MetGln: 0.804 ± 0.877
0.804MetArg: 0.804 ± 0.375
1.205MetSer: 1.205 ± 0.623
0.804MetThr: 0.804 ± 0.461
2.411MetVal: 2.411 ± 1.126
1.205MetTrp: 1.205 ± 0.832
0.402MetTyr: 0.402 ± 0.448
0.0MetXaa: 0.0 ± 0.0
Asn
3.616AsnAla: 3.616 ± 1.666
1.607AsnCys: 1.607 ± 0.9
0.402AsnAsp: 0.402 ± 0.296
1.205AsnGlu: 1.205 ± 0.612
1.607AsnPhe: 1.607 ± 0.759
2.812AsnGly: 2.812 ± 0.615
0.402AsnHis: 0.402 ± 0.438
3.616AsnIle: 3.616 ± 1.27
2.812AsnLys: 2.812 ± 1.167
3.616AsnLeu: 3.616 ± 1.121
1.607AsnMet: 1.607 ± 0.651
3.616AsnAsn: 3.616 ± 3.03
4.419AsnPro: 4.419 ± 1.511
1.607AsnGln: 1.607 ± 0.729
1.607AsnArg: 1.607 ± 0.731
3.616AsnSer: 3.616 ± 1.454
4.821AsnThr: 4.821 ± 1.729
0.804AsnVal: 0.804 ± 0.683
0.804AsnTrp: 0.804 ± 0.592
0.402AsnTyr: 0.402 ± 0.474
0.0AsnXaa: 0.0 ± 0.0
Pro
4.821ProAla: 4.821 ± 1.771
0.804ProCys: 0.804 ± 0.491
4.821ProAsp: 4.821 ± 1.616
2.411ProGlu: 2.411 ± 1.096
2.411ProPhe: 2.411 ± 0.797
2.411ProGly: 2.411 ± 1.003
0.0ProHis: 0.0 ± 0.0
3.214ProIle: 3.214 ± 0.729
2.812ProLys: 2.812 ± 0.591
8.437ProLeu: 8.437 ± 2.184
0.804ProMet: 0.804 ± 0.896
2.009ProAsn: 2.009 ± 0.879
10.044ProPro: 10.044 ± 3.758
1.205ProGln: 1.205 ± 0.62
2.009ProArg: 2.009 ± 0.756
4.821ProSer: 4.821 ± 2.301
6.83ProThr: 6.83 ± 2.443
4.419ProVal: 4.419 ± 1.956
0.804ProTrp: 0.804 ± 0.592
2.411ProTyr: 2.411 ± 0.966
0.0ProXaa: 0.0 ± 0.0
Gln
2.812GlnAla: 2.812 ± 0.794
1.205GlnCys: 1.205 ± 0.689
3.214GlnAsp: 3.214 ± 0.795
0.804GlnGlu: 0.804 ± 0.45
3.616GlnPhe: 3.616 ± 0.638
2.812GlnGly: 2.812 ± 1.155
1.205GlnHis: 1.205 ± 0.648
2.009GlnIle: 2.009 ± 0.878
1.205GlnLys: 1.205 ± 1.024
4.821GlnLeu: 4.821 ± 1.686
2.009GlnMet: 2.009 ± 1.113
1.205GlnAsn: 1.205 ± 0.887
3.214GlnPro: 3.214 ± 1.02
2.411GlnGln: 2.411 ± 1.025
2.411GlnArg: 2.411 ± 0.742
3.214GlnSer: 3.214 ± 0.864
4.419GlnThr: 4.419 ± 0.863
2.812GlnVal: 2.812 ± 0.625
1.607GlnTrp: 1.607 ± 0.865
0.804GlnTyr: 0.804 ± 0.683
0.0GlnXaa: 0.0 ± 0.0
Arg
3.214ArgAla: 3.214 ± 0.773
2.009ArgCys: 2.009 ± 1.096
0.804ArgAsp: 0.804 ± 0.45
0.804ArgGlu: 0.804 ± 0.592
0.804ArgPhe: 0.804 ± 0.552
2.812ArgGly: 2.812 ± 0.849
2.009ArgHis: 2.009 ± 1.065
1.607ArgIle: 1.607 ± 0.843
3.616ArgLys: 3.616 ± 1.099
5.223ArgLeu: 5.223 ± 0.745
0.804ArgMet: 0.804 ± 0.382
1.205ArgAsn: 1.205 ± 0.583
3.616ArgPro: 3.616 ± 1.475
1.607ArgGln: 1.607 ± 0.9
2.411ArgArg: 2.411 ± 1.448
4.419ArgSer: 4.419 ± 0.907
1.205ArgThr: 1.205 ± 0.387
2.009ArgVal: 2.009 ± 0.933
0.0ArgTrp: 0.0 ± 0.0
0.804ArgTyr: 0.804 ± 0.461
0.0ArgXaa: 0.0 ± 0.0
Ser
6.428SerAla: 6.428 ± 1.892
1.607SerCys: 1.607 ± 1.047
2.411SerAsp: 2.411 ± 0.774
3.214SerGlu: 3.214 ± 1.011
1.607SerPhe: 1.607 ± 0.775
4.821SerGly: 4.821 ± 1.974
1.607SerHis: 1.607 ± 0.746
4.018SerIle: 4.018 ± 1.504
2.009SerLys: 2.009 ± 0.761
6.428SerLeu: 6.428 ± 1.16
0.402SerMet: 0.402 ± 0.337
3.616SerAsn: 3.616 ± 1.454
3.616SerPro: 3.616 ± 0.727
2.411SerGln: 2.411 ± 0.485
3.214SerArg: 3.214 ± 1.074
9.642SerSer: 9.642 ± 2.49
8.437SerThr: 8.437 ± 2.283
4.419SerVal: 4.419 ± 0.765
0.402SerTrp: 0.402 ± 0.337
2.009SerTyr: 2.009 ± 0.779
0.0SerXaa: 0.0 ± 0.0
Thr
2.411ThrAla: 2.411 ± 0.544
2.411ThrCys: 2.411 ± 0.8
3.616ThrAsp: 3.616 ± 0.954
2.009ThrGlu: 2.009 ± 0.865
2.411ThrPhe: 2.411 ± 0.849
4.821ThrGly: 4.821 ± 1.342
0.804ThrHis: 0.804 ± 0.373
3.616ThrIle: 3.616 ± 1.545
2.812ThrLys: 2.812 ± 0.769
6.428ThrLeu: 6.428 ± 2.946
1.205ThrMet: 1.205 ± 0.492
3.616ThrAsn: 3.616 ± 2.393
6.83ThrPro: 6.83 ± 2.994
4.821ThrGln: 4.821 ± 1.509
4.018ThrArg: 4.018 ± 0.685
6.83ThrSer: 6.83 ± 1.915
12.053ThrThr: 12.053 ± 3.101
10.044ThrVal: 10.044 ± 1.348
1.607ThrTrp: 1.607 ± 0.771
2.812ThrTyr: 2.812 ± 0.704
0.0ThrXaa: 0.0 ± 0.0
Val
3.214ValAla: 3.214 ± 1.318
3.214ValCys: 3.214 ± 1.985
3.616ValAsp: 3.616 ± 0.952
6.83ValGlu: 6.83 ± 1.304
2.812ValPhe: 2.812 ± 0.888
3.616ValGly: 3.616 ± 1.443
0.804ValHis: 0.804 ± 0.373
2.411ValIle: 2.411 ± 0.569
2.411ValLys: 2.411 ± 0.736
5.223ValLeu: 5.223 ± 1.441
1.607ValMet: 1.607 ± 0.296
2.812ValAsn: 2.812 ± 1.301
4.018ValPro: 4.018 ± 1.254
5.223ValGln: 5.223 ± 1.022
2.812ValArg: 2.812 ± 0.596
7.232ValSer: 7.232 ± 1.829
4.018ValThr: 4.018 ± 1.037
3.214ValVal: 3.214 ± 0.58
0.402ValTrp: 0.402 ± 0.341
2.812ValTyr: 2.812 ± 0.933
0.0ValXaa: 0.0 ± 0.0
Trp
0.804TrpAla: 0.804 ± 0.375
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.804TrpGlu: 0.804 ± 0.461
0.804TrpPhe: 0.804 ± 0.592
2.009TrpGly: 2.009 ± 0.766
0.402TrpHis: 0.402 ± 0.438
0.804TrpIle: 0.804 ± 0.592
2.009TrpLys: 2.009 ± 0.948
2.411TrpLeu: 2.411 ± 0.902
0.0TrpMet: 0.0 ± 0.0
0.402TrpAsn: 0.402 ± 0.341
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.402TrpArg: 0.402 ± 0.341
0.0TrpSer: 0.0 ± 0.0
2.009TrpThr: 2.009 ± 1.254
1.205TrpVal: 1.205 ± 0.623
0.0TrpTrp: 0.0 ± 0.0
0.402TrpTyr: 0.402 ± 0.337
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.214TyrAla: 3.214 ± 0.811
0.804TyrCys: 0.804 ± 0.877
1.205TyrAsp: 1.205 ± 0.623
0.804TyrGlu: 0.804 ± 0.447
1.205TyrPhe: 1.205 ± 0.703
2.009TyrGly: 2.009 ± 0.779
0.402TyrHis: 0.402 ± 0.341
3.214TyrIle: 3.214 ± 0.773
2.812TyrLys: 2.812 ± 0.801
3.616TyrLeu: 3.616 ± 0.855
1.205TyrMet: 1.205 ± 0.399
0.402TyrAsn: 0.402 ± 0.438
2.411TyrPro: 2.411 ± 0.82
1.205TyrGln: 1.205 ± 0.605
2.411TyrArg: 2.411 ± 0.798
1.607TyrSer: 1.607 ± 0.733
2.009TyrThr: 2.009 ± 1.041
3.616TyrVal: 3.616 ± 0.851
0.402TyrTrp: 0.402 ± 0.296
0.804TyrTyr: 0.804 ± 0.877
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (2490 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski