Amino acid dipepetide frequency for Human papillomavirus 135

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.392AlaAla: 5.392 ± 2.976
1.244AlaCys: 1.244 ± 0.971
5.392AlaAsp: 5.392 ± 1.466
2.903AlaGlu: 2.903 ± 1.509
2.074AlaPhe: 2.074 ± 0.444
0.83AlaGly: 0.83 ± 0.396
0.415AlaHis: 0.415 ± 0.355
1.244AlaIle: 1.244 ± 0.385
3.733AlaLys: 3.733 ± 1.159
6.221AlaLeu: 6.221 ± 1.902
0.83AlaMet: 0.83 ± 0.711
2.489AlaAsn: 2.489 ± 0.556
2.074AlaPro: 2.074 ± 0.933
2.489AlaGln: 2.489 ± 1.188
2.489AlaArg: 2.489 ± 1.002
5.392AlaSer: 5.392 ± 0.913
2.903AlaThr: 2.903 ± 0.697
2.903AlaVal: 2.903 ± 0.929
0.83AlaTrp: 0.83 ± 0.435
2.074AlaTyr: 2.074 ± 0.367
0.0AlaXaa: 0.0 ± 0.0
Cys
1.659CysAla: 1.659 ± 1.372
1.659CysCys: 1.659 ± 1.009
1.659CysAsp: 1.659 ± 1.009
1.244CysGlu: 1.244 ± 0.63
0.83CysPhe: 0.83 ± 0.592
0.415CysGly: 0.415 ± 0.393
0.415CysHis: 0.415 ± 0.324
2.074CysIle: 2.074 ± 1.107
0.83CysLys: 0.83 ± 0.561
0.83CysLeu: 0.83 ± 0.592
0.0CysMet: 0.0 ± 0.0
0.415CysAsn: 0.415 ± 0.355
1.244CysPro: 1.244 ± 0.697
0.415CysGln: 0.415 ± 0.393
0.83CysArg: 0.83 ± 0.544
2.074CysSer: 2.074 ± 1.107
0.83CysThr: 0.83 ± 0.648
1.659CysVal: 1.659 ± 0.547
0.83CysTrp: 0.83 ± 0.436
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.733AspAla: 3.733 ± 0.782
1.659AspCys: 1.659 ± 0.891
4.977AspAsp: 4.977 ± 1.625
5.392AspGlu: 5.392 ± 1.596
3.733AspPhe: 3.733 ± 1.561
2.489AspGly: 2.489 ± 1.085
1.659AspHis: 1.659 ± 0.673
6.636AspIle: 6.636 ± 1.544
2.074AspLys: 2.074 ± 1.07
6.636AspLeu: 6.636 ± 1.729
2.074AspMet: 2.074 ± 0.91
2.903AspAsn: 2.903 ± 0.421
5.807AspPro: 5.807 ± 0.797
2.074AspGln: 2.074 ± 0.97
3.318AspArg: 3.318 ± 0.536
1.659AspSer: 1.659 ± 1.098
6.221AspThr: 6.221 ± 1.202
6.636AspVal: 6.636 ± 1.743
0.83AspTrp: 0.83 ± 0.648
0.415AspTyr: 0.415 ± 0.355
0.0AspXaa: 0.0 ± 0.0
Glu
2.074GluAla: 2.074 ± 0.801
1.244GluCys: 1.244 ± 0.732
4.562GluAsp: 4.562 ± 1.246
8.295GluGlu: 8.295 ± 1.983
2.489GluPhe: 2.489 ± 0.98
2.903GluGly: 2.903 ± 1.436
1.244GluHis: 1.244 ± 0.63
2.074GluIle: 2.074 ± 1.248
2.489GluLys: 2.489 ± 1.33
4.562GluLeu: 4.562 ± 1.391
0.415GluMet: 0.415 ± 0.324
4.562GluAsn: 4.562 ± 2.536
3.318GluPro: 3.318 ± 0.937
2.074GluGln: 2.074 ± 0.651
2.074GluArg: 2.074 ± 0.506
2.074GluSer: 2.074 ± 0.763
2.903GluThr: 2.903 ± 0.437
2.489GluVal: 2.489 ± 0.785
0.83GluTrp: 0.83 ± 0.435
2.903GluTyr: 2.903 ± 1.3
0.0GluXaa: 0.0 ± 0.0
Phe
0.83PheAla: 0.83 ± 0.561
0.83PheCys: 0.83 ± 0.561
2.074PheAsp: 2.074 ± 0.706
2.489PheGlu: 2.489 ± 0.898
4.148PhePhe: 4.148 ± 1.288
4.148PheGly: 4.148 ± 1.538
0.83PheHis: 0.83 ± 0.752
1.244PheIle: 1.244 ± 0.604
4.562PheLys: 4.562 ± 1.571
4.148PheLeu: 4.148 ± 1.474
0.415PheMet: 0.415 ± 0.346
4.977PheAsn: 4.977 ± 1.673
2.489PhePro: 2.489 ± 0.881
2.489PheGln: 2.489 ± 0.754
1.244PheArg: 1.244 ± 0.493
2.074PheSer: 2.074 ± 0.813
1.244PheThr: 1.244 ± 0.604
3.318PheVal: 3.318 ± 1.489
0.83PheTrp: 0.83 ± 0.396
2.903PheTyr: 2.903 ± 1.333
0.0PheXaa: 0.0 ± 0.0
Gly
3.318GlyAla: 3.318 ± 1.128
0.415GlyCys: 0.415 ± 0.355
5.392GlyAsp: 5.392 ± 1.338
3.733GlyGlu: 3.733 ± 0.857
1.659GlyPhe: 1.659 ± 0.215
4.977GlyGly: 4.977 ± 1.947
1.659GlyHis: 1.659 ± 0.8
3.318GlyIle: 3.318 ± 1.124
4.562GlyLys: 4.562 ± 0.692
2.903GlyLeu: 2.903 ± 1.118
0.415GlyMet: 0.415 ± 0.355
2.074GlyAsn: 2.074 ± 0.978
3.733GlyPro: 3.733 ± 1.188
1.659GlyGln: 1.659 ± 1.052
3.318GlyArg: 3.318 ± 0.674
4.562GlySer: 4.562 ± 0.951
4.148GlyThr: 4.148 ± 1.075
1.659GlyVal: 1.659 ± 0.717
0.0GlyTrp: 0.0 ± 0.0
1.244GlyTyr: 1.244 ± 0.604
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.83HisAsp: 0.83 ± 0.435
0.83HisGlu: 0.83 ± 0.648
0.0HisPhe: 0.0 ± 0.0
0.83HisGly: 0.83 ± 0.396
0.83HisHis: 0.83 ± 0.493
1.244HisIle: 1.244 ± 0.679
2.074HisLys: 2.074 ± 1.08
2.074HisLeu: 2.074 ± 0.835
0.415HisMet: 0.415 ± 0.324
2.489HisAsn: 2.489 ± 0.865
2.074HisPro: 2.074 ± 1.101
0.83HisGln: 0.83 ± 0.509
1.659HisArg: 1.659 ± 1.279
1.244HisSer: 1.244 ± 0.564
1.244HisThr: 1.244 ± 0.346
0.83HisVal: 0.83 ± 0.396
1.244HisTrp: 1.244 ± 0.591
0.415HisTyr: 0.415 ± 0.366
0.0HisXaa: 0.0 ± 0.0
Ile
3.318IleAla: 3.318 ± 1.166
0.83IleCys: 0.83 ± 0.493
2.903IleAsp: 2.903 ± 0.687
2.903IleGlu: 2.903 ± 0.965
2.489IlePhe: 2.489 ± 0.985
3.733IleGly: 3.733 ± 1.34
0.83IleHis: 0.83 ± 0.542
2.903IleIle: 2.903 ± 0.898
0.83IleLys: 0.83 ± 0.752
4.148IleLeu: 4.148 ± 0.955
0.415IleMet: 0.415 ± 0.324
4.562IleAsn: 4.562 ± 1.335
2.903IlePro: 2.903 ± 0.967
1.244IleGln: 1.244 ± 0.778
3.318IleArg: 3.318 ± 1.35
5.807IleSer: 5.807 ± 1.037
4.977IleThr: 4.977 ± 1.594
1.659IleVal: 1.659 ± 0.921
0.415IleTrp: 0.415 ± 0.485
0.83IleTyr: 0.83 ± 0.436
0.0IleXaa: 0.0 ± 0.0
Lys
2.903LysAla: 2.903 ± 0.87
2.074LysCys: 2.074 ± 0.98
1.244LysAsp: 1.244 ± 0.493
2.074LysGlu: 2.074 ± 0.97
3.733LysPhe: 3.733 ± 1.548
3.733LysGly: 3.733 ± 1.234
1.244LysHis: 1.244 ± 0.668
3.318LysIle: 3.318 ± 0.992
4.148LysLys: 4.148 ± 1.424
3.733LysLeu: 3.733 ± 1.309
1.659LysMet: 1.659 ± 0.885
2.903LysAsn: 2.903 ± 0.85
1.659LysPro: 1.659 ± 0.791
1.244LysGln: 1.244 ± 0.63
5.807LysArg: 5.807 ± 0.767
3.733LysSer: 3.733 ± 1.6
1.659LysThr: 1.659 ± 0.891
4.977LysVal: 4.977 ± 0.929
1.244LysTrp: 1.244 ± 0.625
2.074LysTyr: 2.074 ± 0.602
0.0LysXaa: 0.0 ± 0.0
Leu
6.221LeuAla: 6.221 ± 0.768
2.074LeuCys: 2.074 ± 1.089
9.125LeuAsp: 9.125 ± 2.201
2.489LeuGlu: 2.489 ± 0.594
4.562LeuPhe: 4.562 ± 1.132
5.807LeuGly: 5.807 ± 1.846
2.903LeuHis: 2.903 ± 0.64
2.489LeuIle: 2.489 ± 0.479
5.807LeuLys: 5.807 ± 1.446
7.881LeuLeu: 7.881 ± 2.61
1.659LeuMet: 1.659 ± 0.827
1.659LeuAsn: 1.659 ± 0.603
4.562LeuPro: 4.562 ± 1.325
5.807LeuGln: 5.807 ± 1.271
3.733LeuArg: 3.733 ± 0.944
6.221LeuSer: 6.221 ± 1.19
4.562LeuThr: 4.562 ± 0.885
5.807LeuVal: 5.807 ± 1.404
0.0LeuTrp: 0.0 ± 0.0
6.221LeuTyr: 6.221 ± 1.142
0.0LeuXaa: 0.0 ± 0.0
Met
0.83MetAla: 0.83 ± 0.396
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.83MetGlu: 0.83 ± 0.561
0.415MetPhe: 0.415 ± 0.324
0.83MetGly: 0.83 ± 0.396
0.0MetHis: 0.0 ± 0.0
0.415MetIle: 0.415 ± 0.376
0.83MetLys: 0.83 ± 0.522
1.659MetLeu: 1.659 ± 0.215
0.83MetMet: 0.83 ± 0.435
2.074MetAsn: 2.074 ± 0.933
0.415MetPro: 0.415 ± 0.324
0.83MetGln: 0.83 ± 0.604
0.415MetArg: 0.415 ± 0.324
2.489MetSer: 2.489 ± 1.188
0.415MetThr: 0.415 ± 0.324
0.83MetVal: 0.83 ± 0.648
0.0MetTrp: 0.0 ± 0.0
0.83MetTyr: 0.83 ± 0.435
0.0MetXaa: 0.0 ± 0.0
Asn
3.318AsnAla: 3.318 ± 0.952
1.244AsnCys: 1.244 ± 0.53
2.074AsnAsp: 2.074 ± 0.806
1.244AsnGlu: 1.244 ± 0.346
1.659AsnPhe: 1.659 ± 0.791
2.489AsnGly: 2.489 ± 0.965
0.83AsnHis: 0.83 ± 0.648
2.903AsnIle: 2.903 ± 0.773
3.318AsnLys: 3.318 ± 0.905
4.562AsnLeu: 4.562 ± 0.763
0.415AsnMet: 0.415 ± 0.324
3.318AsnAsn: 3.318 ± 1.584
4.562AsnPro: 4.562 ± 1.526
3.318AsnGln: 3.318 ± 1.323
2.489AsnArg: 2.489 ± 0.52
3.318AsnSer: 3.318 ± 0.982
3.318AsnThr: 3.318 ± 1.284
5.392AsnVal: 5.392 ± 0.85
0.83AsnTrp: 0.83 ± 0.396
0.83AsnTyr: 0.83 ± 0.542
0.0AsnXaa: 0.0 ± 0.0
Pro
1.659ProAla: 1.659 ± 0.215
0.415ProCys: 0.415 ± 0.355
5.392ProAsp: 5.392 ± 1.24
4.562ProGlu: 4.562 ± 1.663
1.659ProPhe: 1.659 ± 0.891
1.244ProGly: 1.244 ± 0.677
1.244ProHis: 1.244 ± 1.033
2.489ProIle: 2.489 ± 0.998
2.489ProLys: 2.489 ± 0.775
7.881ProLeu: 7.881 ± 2.08
0.415ProMet: 0.415 ± 0.372
2.074ProAsn: 2.074 ± 0.634
7.881ProPro: 7.881 ± 1.827
2.074ProGln: 2.074 ± 0.707
4.562ProArg: 4.562 ± 2.123
6.636ProSer: 6.636 ± 2.191
5.807ProThr: 5.807 ± 1.902
1.244ProVal: 1.244 ± 0.672
0.415ProTrp: 0.415 ± 0.376
3.318ProTyr: 3.318 ± 1.717
0.0ProXaa: 0.0 ± 0.0
Gln
2.074GlnAla: 2.074 ± 0.706
0.83GlnCys: 0.83 ± 0.648
3.318GlnAsp: 3.318 ± 0.815
2.074GlnGlu: 2.074 ± 1.227
1.659GlnPhe: 1.659 ± 0.967
1.659GlnGly: 1.659 ± 0.215
0.415GlnHis: 0.415 ± 0.376
3.318GlnIle: 3.318 ± 0.566
2.074GlnLys: 2.074 ± 0.634
4.148GlnLeu: 4.148 ± 1.723
1.244GlnMet: 1.244 ± 0.971
2.074GlnAsn: 2.074 ± 1.361
2.903GlnPro: 2.903 ± 1.784
3.733GlnGln: 3.733 ± 0.784
2.074GlnArg: 2.074 ± 0.388
1.659GlnSer: 1.659 ± 0.92
2.489GlnThr: 2.489 ± 0.634
2.903GlnVal: 2.903 ± 1.145
1.659GlnTrp: 1.659 ± 0.87
0.83GlnTyr: 0.83 ± 0.711
0.0GlnXaa: 0.0 ± 0.0
Arg
4.148ArgAla: 4.148 ± 0.671
2.074ArgCys: 2.074 ± 1.338
1.659ArgAsp: 1.659 ± 0.743
2.489ArgGlu: 2.489 ± 1.33
2.489ArgPhe: 2.489 ± 1.026
4.977ArgGly: 4.977 ± 1.508
2.074ArgHis: 2.074 ± 0.706
1.659ArgIle: 1.659 ± 0.592
3.318ArgLys: 3.318 ± 0.896
7.051ArgLeu: 7.051 ± 0.97
0.83ArgMet: 0.83 ± 0.435
1.659ArgAsn: 1.659 ± 0.642
3.318ArgPro: 3.318 ± 0.905
2.074ArgGln: 2.074 ± 0.986
3.733ArgArg: 3.733 ± 2.164
3.318ArgSer: 3.318 ± 1.12
4.562ArgThr: 4.562 ± 1.195
2.074ArgVal: 2.074 ± 1.025
0.0ArgTrp: 0.0 ± 0.0
2.489ArgTyr: 2.489 ± 0.809
0.0ArgXaa: 0.0 ± 0.0
Ser
3.733SerAla: 3.733 ± 0.829
0.415SerCys: 0.415 ± 0.393
3.318SerAsp: 3.318 ± 0.96
3.733SerGlu: 3.733 ± 0.765
3.733SerPhe: 3.733 ± 1.197
4.148SerGly: 4.148 ± 0.758
2.074SerHis: 2.074 ± 0.444
3.733SerIle: 3.733 ± 1.561
4.148SerLys: 4.148 ± 0.691
7.466SerLeu: 7.466 ± 1.038
0.83SerMet: 0.83 ± 0.648
4.562SerAsn: 4.562 ± 1.264
4.562SerPro: 4.562 ± 0.946
3.733SerGln: 3.733 ± 1.4
4.977SerArg: 4.977 ± 1.502
5.392SerSer: 5.392 ± 2.119
4.562SerThr: 4.562 ± 0.686
5.807SerVal: 5.807 ± 0.569
0.0SerTrp: 0.0 ± 0.0
2.489SerTyr: 2.489 ± 1.159
0.0SerXaa: 0.0 ± 0.0
Thr
4.148ThrAla: 4.148 ± 0.571
1.244ThrCys: 1.244 ± 0.535
7.881ThrAsp: 7.881 ± 0.911
2.074ThrGlu: 2.074 ± 0.706
2.489ThrPhe: 2.489 ± 1.278
4.148ThrGly: 4.148 ± 0.887
0.0ThrHis: 0.0 ± 0.0
3.733ThrIle: 3.733 ± 0.801
2.489ThrLys: 2.489 ± 0.632
4.977ThrLeu: 4.977 ± 1.46
0.0ThrMet: 0.0 ± 0.0
1.659ThrAsn: 1.659 ± 0.731
3.318ThrPro: 3.318 ± 0.929
2.489ThrGln: 2.489 ± 0.492
4.148ThrArg: 4.148 ± 0.741
6.636ThrSer: 6.636 ± 1.053
7.881ThrThr: 7.881 ± 2.704
5.392ThrVal: 5.392 ± 1.673
0.415ThrTrp: 0.415 ± 0.324
2.489ThrTyr: 2.489 ± 0.836
0.0ThrXaa: 0.0 ± 0.0
Val
2.489ValAla: 2.489 ± 1.685
0.83ValCys: 0.83 ± 0.592
5.807ValAsp: 5.807 ± 0.892
3.318ValGlu: 3.318 ± 0.566
3.318ValPhe: 3.318 ± 1.128
2.903ValGly: 2.903 ± 1.017
2.074ValHis: 2.074 ± 0.834
4.562ValIle: 4.562 ± 1.324
3.733ValLys: 3.733 ± 0.751
4.562ValLeu: 4.562 ± 0.573
1.659ValMet: 1.659 ± 0.599
1.659ValAsn: 1.659 ± 0.515
4.977ValPro: 4.977 ± 1.729
2.489ValGln: 2.489 ± 1.207
1.659ValArg: 1.659 ± 0.845
4.977ValSer: 4.977 ± 1.794
4.148ValThr: 4.148 ± 1.046
3.733ValVal: 3.733 ± 1.019
1.244ValTrp: 1.244 ± 0.826
2.074ValTyr: 2.074 ± 0.762
0.0ValXaa: 0.0 ± 0.0
Trp
0.415TrpAla: 0.415 ± 0.324
0.0TrpCys: 0.0 ± 0.0
1.244TrpAsp: 1.244 ± 0.625
0.83TrpGlu: 0.83 ± 0.752
0.415TrpPhe: 0.415 ± 0.324
0.415TrpGly: 0.415 ± 0.485
0.415TrpHis: 0.415 ± 0.376
0.0TrpIle: 0.0 ± 0.0
0.83TrpLys: 0.83 ± 0.648
1.244TrpLeu: 1.244 ± 0.405
0.0TrpMet: 0.0 ± 0.0
0.415TrpAsn: 0.415 ± 0.355
0.415TrpPro: 0.415 ± 0.355
0.415TrpGln: 0.415 ± 0.324
1.659TrpArg: 1.659 ± 0.952
0.415TrpSer: 0.415 ± 0.355
0.83TrpThr: 0.83 ± 0.752
1.659TrpVal: 1.659 ± 0.92
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.659TyrAla: 1.659 ± 0.491
0.83TyrCys: 0.83 ± 0.592
2.074TyrAsp: 2.074 ± 0.367
2.074TyrGlu: 2.074 ± 0.506
3.733TyrPhe: 3.733 ± 0.974
2.074TyrGly: 2.074 ± 0.761
0.0TyrHis: 0.0 ± 0.0
1.659TyrIle: 1.659 ± 0.792
1.244TyrLys: 1.244 ± 0.63
2.903TyrLeu: 2.903 ± 1.016
0.0TyrMet: 0.0 ± 0.0
2.903TyrAsn: 2.903 ± 0.783
1.244TyrPro: 1.244 ± 0.697
1.659TyrGln: 1.659 ± 0.531
2.489TyrArg: 2.489 ± 1.092
3.733TyrSer: 3.733 ± 0.884
2.903TyrThr: 2.903 ± 0.547
1.244TyrVal: 1.244 ± 0.732
0.0TyrTrp: 0.0 ± 0.0
2.489TyrTyr: 2.489 ± 1.026
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2412 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski