Amino acid dipepetide frequency for Human papillomavirus 35

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.885AlaAla: 3.885 ± 1.384
1.943AlaCys: 1.943 ± 0.57
3.108AlaAsp: 3.108 ± 1.189
3.108AlaGlu: 3.108 ± 1.182
2.72AlaPhe: 2.72 ± 1.46
3.497AlaGly: 3.497 ± 1.186
0.389AlaHis: 0.389 ± 0.458
4.274AlaIle: 4.274 ± 1.179
3.497AlaLys: 3.497 ± 1.465
4.274AlaLeu: 4.274 ± 1.677
2.331AlaMet: 2.331 ± 0.782
1.943AlaAsn: 1.943 ± 0.673
3.497AlaPro: 3.497 ± 1.162
3.108AlaGln: 3.108 ± 1.222
1.166AlaArg: 1.166 ± 0.455
3.497AlaSer: 3.497 ± 0.489
3.885AlaThr: 3.885 ± 1.144
4.662AlaVal: 4.662 ± 0.938
0.0AlaTrp: 0.0 ± 0.0
2.72AlaTyr: 2.72 ± 0.986
0.0AlaXaa: 0.0 ± 0.0
Cys
1.554CysAla: 1.554 ± 0.96
0.389CysCys: 0.389 ± 0.563
1.166CysAsp: 1.166 ± 0.738
0.777CysGlu: 0.777 ± 0.561
1.554CysPhe: 1.554 ± 1.364
0.777CysGly: 0.777 ± 0.539
0.389CysHis: 0.389 ± 0.464
2.72CysIle: 2.72 ± 1.272
1.943CysLys: 1.943 ± 0.988
3.885CysLeu: 3.885 ± 1.376
0.777CysMet: 0.777 ± 0.929
1.554CysAsn: 1.554 ± 0.941
2.331CysPro: 2.331 ± 0.618
1.166CysGln: 1.166 ± 0.551
1.166CysArg: 1.166 ± 0.586
3.885CysSer: 3.885 ± 0.998
2.72CysThr: 2.72 ± 1.205
2.331CysVal: 2.331 ± 1.206
1.166CysTrp: 1.166 ± 0.547
0.777CysTyr: 0.777 ± 0.803
0.0CysXaa: 0.0 ± 0.0
Asp
3.497AspAla: 3.497 ± 1.327
1.554AspCys: 1.554 ± 0.755
3.108AspAsp: 3.108 ± 1.724
1.943AspGlu: 1.943 ± 1.159
3.108AspPhe: 3.108 ± 0.812
3.497AspGly: 3.497 ± 0.9
0.389AspHis: 0.389 ± 0.35
5.051AspIle: 5.051 ± 2.11
3.885AspLys: 3.885 ± 1.873
4.662AspLeu: 4.662 ± 2.409
1.554AspMet: 1.554 ± 0.761
1.943AspAsn: 1.943 ± 0.699
3.885AspPro: 3.885 ± 1.032
1.166AspGln: 1.166 ± 0.336
2.331AspArg: 2.331 ± 1.148
5.051AspSer: 5.051 ± 0.738
5.439AspThr: 5.439 ± 1.909
2.331AspVal: 2.331 ± 0.92
1.166AspTrp: 1.166 ± 0.586
1.166AspTyr: 1.166 ± 0.907
0.0AspXaa: 0.0 ± 0.0
Glu
2.331GluAla: 2.331 ± 1.032
1.554GluCys: 1.554 ± 0.895
4.662GluAsp: 4.662 ± 1.052
5.051GluGlu: 5.051 ± 1.808
0.389GluPhe: 0.389 ± 0.317
3.108GluGly: 3.108 ± 1.202
1.166GluHis: 1.166 ± 0.455
2.331GluIle: 2.331 ± 0.8
2.72GluLys: 2.72 ± 1.522
4.662GluLeu: 4.662 ± 1.36
1.166GluMet: 1.166 ± 0.674
1.554GluAsn: 1.554 ± 0.862
0.777GluPro: 0.777 ± 0.652
1.554GluGln: 1.554 ± 0.646
0.777GluArg: 0.777 ± 0.534
1.943GluSer: 1.943 ± 0.682
7.77GluThr: 7.77 ± 2.574
3.497GluVal: 3.497 ± 1.324
0.389GluTrp: 0.389 ± 0.317
2.72GluTyr: 2.72 ± 1.159
0.0GluXaa: 0.0 ± 0.0
Phe
0.777PheAla: 0.777 ± 0.519
0.777PheCys: 0.777 ± 0.915
1.554PheAsp: 1.554 ± 0.645
1.166PheGlu: 1.166 ± 0.528
1.943PhePhe: 1.943 ± 0.618
3.108PheGly: 3.108 ± 0.923
1.166PheHis: 1.166 ± 0.745
1.943PheIle: 1.943 ± 0.618
2.72PheLys: 2.72 ± 1.42
3.497PheLeu: 3.497 ± 1.342
0.777PheMet: 0.777 ± 0.686
1.554PheAsn: 1.554 ± 0.605
1.943PhePro: 1.943 ± 1.047
0.777PheGln: 0.777 ± 0.576
1.166PheArg: 1.166 ± 0.742
1.554PheSer: 1.554 ± 0.321
1.943PheThr: 1.943 ± 0.618
2.331PheVal: 2.331 ± 0.65
0.777PheTrp: 0.777 ± 0.379
1.943PheTyr: 1.943 ± 0.711
0.0PheXaa: 0.0 ± 0.0
Gly
1.943GlyAla: 1.943 ± 0.618
1.554GlyCys: 1.554 ± 0.559
3.885GlyAsp: 3.885 ± 0.928
3.497GlyGlu: 3.497 ± 1.408
1.166GlyPhe: 1.166 ± 0.732
2.72GlyGly: 2.72 ± 1.176
1.943GlyHis: 1.943 ± 0.953
3.108GlyIle: 3.108 ± 0.622
3.108GlyLys: 3.108 ± 0.626
3.108GlyLeu: 3.108 ± 1.012
1.166GlyMet: 1.166 ± 0.62
2.331GlyAsn: 2.331 ± 0.804
1.554GlyPro: 1.554 ± 1.161
1.943GlyGln: 1.943 ± 0.712
1.943GlyArg: 1.943 ± 1.075
3.885GlySer: 3.885 ± 1.131
6.993GlyThr: 6.993 ± 1.692
3.885GlyVal: 3.885 ± 1.301
0.389GlyTrp: 0.389 ± 0.317
1.943GlyTyr: 1.943 ± 0.749
0.0GlyXaa: 0.0 ± 0.0
His
1.943HisAla: 1.943 ± 0.575
0.0HisCys: 0.0 ± 0.0
1.554HisAsp: 1.554 ± 0.688
1.166HisGlu: 1.166 ± 0.521
1.166HisPhe: 1.166 ± 0.62
1.554HisGly: 1.554 ± 0.559
0.0HisHis: 0.0 ± 0.0
1.554HisIle: 1.554 ± 0.835
1.943HisLys: 1.943 ± 0.708
2.72HisLeu: 2.72 ± 1.435
0.0HisMet: 0.0 ± 0.0
1.554HisAsn: 1.554 ± 0.453
1.554HisPro: 1.554 ± 0.853
0.777HisGln: 0.777 ± 0.7
0.389HisArg: 0.389 ± 0.343
1.554HisSer: 1.554 ± 0.676
1.554HisThr: 1.554 ± 0.811
0.777HisVal: 0.777 ± 0.468
0.389HisTrp: 0.389 ± 0.381
1.554HisTyr: 1.554 ± 0.727
0.0HisXaa: 0.0 ± 0.0
Ile
3.885IleAla: 3.885 ± 0.99
2.331IleCys: 2.331 ± 0.992
2.331IleAsp: 2.331 ± 1.276
2.331IleGlu: 2.331 ± 0.932
1.554IlePhe: 1.554 ± 0.605
2.331IleGly: 2.331 ± 1.058
1.166IleHis: 1.166 ± 0.547
1.943IleIle: 1.943 ± 1.027
1.943IleLys: 1.943 ± 0.688
4.662IleLeu: 4.662 ± 1.031
0.777IleMet: 0.777 ± 0.463
1.166IleAsn: 1.166 ± 0.684
4.274IlePro: 4.274 ± 2.716
3.108IleGln: 3.108 ± 2.191
2.331IleArg: 2.331 ± 0.869
5.051IleSer: 5.051 ± 1.317
4.662IleThr: 4.662 ± 1.804
5.051IleVal: 5.051 ± 1.641
0.389IleTrp: 0.389 ± 0.35
2.72IleTyr: 2.72 ± 0.71
0.0IleXaa: 0.0 ± 0.0
Lys
5.051LysAla: 5.051 ± 1.579
3.108LysCys: 3.108 ± 1.279
1.943LysAsp: 1.943 ± 0.62
1.943LysGlu: 1.943 ± 0.934
2.72LysPhe: 2.72 ± 0.986
3.885LysGly: 3.885 ± 1.516
0.777LysHis: 0.777 ± 0.433
3.108LysIle: 3.108 ± 0.767
3.497LysLys: 3.497 ± 1.258
3.885LysLeu: 3.885 ± 0.742
0.389LysMet: 0.389 ± 0.381
3.497LysAsn: 3.497 ± 1.653
3.497LysPro: 3.497 ± 1.847
3.497LysGln: 3.497 ± 1.86
6.993LysArg: 6.993 ± 1.482
3.497LysSer: 3.497 ± 1.757
2.331LysThr: 2.331 ± 0.923
2.331LysVal: 2.331 ± 0.975
0.389LysTrp: 0.389 ± 0.397
3.497LysTyr: 3.497 ± 0.838
0.0LysXaa: 0.0 ± 0.0
Leu
2.72LeuAla: 2.72 ± 0.967
5.439LeuCys: 5.439 ± 3.381
3.885LeuAsp: 3.885 ± 0.94
4.274LeuGlu: 4.274 ± 1.946
2.72LeuPhe: 2.72 ± 1.3
3.108LeuGly: 3.108 ± 1.18
4.662LeuHis: 4.662 ± 1.523
4.274LeuIle: 4.274 ± 1.356
7.77LeuLys: 7.77 ± 1.428
8.159LeuLeu: 8.159 ± 3.986
1.554LeuMet: 1.554 ± 0.754
3.885LeuAsn: 3.885 ± 0.542
3.108LeuPro: 3.108 ± 0.835
6.216LeuGln: 6.216 ± 1.403
4.662LeuArg: 4.662 ± 1.32
6.216LeuSer: 6.216 ± 2.076
3.885LeuThr: 3.885 ± 1.417
2.331LeuVal: 2.331 ± 0.806
1.166LeuTrp: 1.166 ± 0.977
4.662LeuTyr: 4.662 ± 0.906
0.0LeuXaa: 0.0 ± 0.0
Met
1.554MetAla: 1.554 ± 0.609
0.389MetCys: 0.389 ± 0.317
1.554MetAsp: 1.554 ± 0.853
0.389MetGlu: 0.389 ± 0.35
1.166MetPhe: 1.166 ± 0.552
1.166MetGly: 1.166 ± 0.619
1.554MetHis: 1.554 ± 0.636
0.777MetIle: 0.777 ± 0.915
0.389MetLys: 0.389 ± 0.464
1.943MetLeu: 1.943 ± 0.827
0.389MetMet: 0.389 ± 0.35
0.389MetAsn: 0.389 ± 0.381
0.0MetPro: 0.0 ± 0.0
0.389MetGln: 0.389 ± 0.35
0.389MetArg: 0.389 ± 0.343
3.108MetSer: 3.108 ± 0.844
1.166MetThr: 1.166 ± 0.336
2.331MetVal: 2.331 ± 1.137
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.72AsnAla: 2.72 ± 1.408
0.777AsnCys: 0.777 ± 0.534
3.885AsnAsp: 3.885 ± 1.208
1.554AsnGlu: 1.554 ± 0.684
1.554AsnPhe: 1.554 ± 1.05
1.166AsnGly: 1.166 ± 0.586
1.166AsnHis: 1.166 ± 0.818
3.885AsnIle: 3.885 ± 0.884
4.274AsnLys: 4.274 ± 1.889
0.389AsnLeu: 0.389 ± 0.317
0.389AsnMet: 0.389 ± 0.381
1.943AsnAsn: 1.943 ± 0.618
3.108AsnPro: 3.108 ± 0.8
0.777AsnGln: 0.777 ± 0.762
1.166AsnArg: 1.166 ± 0.691
3.497AsnSer: 3.497 ± 0.932
6.993AsnThr: 6.993 ± 2.332
0.0AsnVal: 0.0 ± 0.0
0.777AsnTrp: 0.777 ± 0.433
1.943AsnTyr: 1.943 ± 0.852
0.0AsnXaa: 0.0 ± 0.0
Pro
6.216ProAla: 6.216 ± 1.764
0.777ProCys: 0.777 ± 0.379
3.885ProAsp: 3.885 ± 1.595
1.166ProGlu: 1.166 ± 0.619
0.777ProPhe: 0.777 ± 0.634
0.777ProGly: 0.777 ± 0.652
0.389ProHis: 0.389 ± 0.343
3.885ProIle: 3.885 ± 1.481
2.72ProLys: 2.72 ± 0.998
6.605ProLeu: 6.605 ± 1.896
0.389ProMet: 0.389 ± 0.343
1.943ProAsn: 1.943 ± 0.673
5.439ProPro: 5.439 ± 1.994
0.777ProGln: 0.777 ± 0.55
1.554ProArg: 1.554 ± 0.929
4.662ProSer: 4.662 ± 2.033
5.439ProThr: 5.439 ± 1.868
2.72ProVal: 2.72 ± 0.829
0.777ProTrp: 0.777 ± 0.955
3.885ProTyr: 3.885 ± 0.925
0.0ProXaa: 0.0 ± 0.0
Gln
3.497GlnAla: 3.497 ± 0.986
1.166GlnCys: 1.166 ± 0.745
3.108GlnAsp: 3.108 ± 0.973
1.166GlnGlu: 1.166 ± 0.546
1.554GlnPhe: 1.554 ± 0.696
1.554GlnGly: 1.554 ± 0.64
0.777GlnHis: 0.777 ± 0.388
2.72GlnIle: 2.72 ± 0.516
1.943GlnLys: 1.943 ± 0.826
4.662GlnLeu: 4.662 ± 0.84
0.777GlnMet: 0.777 ± 0.571
1.166GlnAsn: 1.166 ± 0.541
1.943GlnPro: 1.943 ± 0.553
1.554GlnGln: 1.554 ± 0.905
3.497GlnArg: 3.497 ± 1.15
0.389GlnSer: 0.389 ± 0.563
1.554GlnThr: 1.554 ± 0.676
2.72GlnVal: 2.72 ± 0.814
0.777GlnTrp: 0.777 ± 0.634
1.554GlnTyr: 1.554 ± 0.479
0.0GlnXaa: 0.0 ± 0.0
Arg
2.331ArgAla: 2.331 ± 1.001
1.943ArgCys: 1.943 ± 0.988
1.943ArgAsp: 1.943 ± 1.005
2.72ArgGlu: 2.72 ± 0.989
1.166ArgPhe: 1.166 ± 0.745
2.331ArgGly: 2.331 ± 0.964
2.331ArgHis: 2.331 ± 0.654
0.389ArgIle: 0.389 ± 0.343
2.331ArgLys: 2.331 ± 0.781
6.216ArgLeu: 6.216 ± 1.657
0.0ArgMet: 0.0 ± 0.0
0.389ArgAsn: 0.389 ± 0.317
3.108ArgPro: 3.108 ± 0.996
0.777ArgGln: 0.777 ± 0.503
3.497ArgArg: 3.497 ± 0.796
3.108ArgSer: 3.108 ± 0.999
2.72ArgThr: 2.72 ± 1.055
3.497ArgVal: 3.497 ± 1.1
1.943ArgTrp: 1.943 ± 0.988
2.72ArgTyr: 2.72 ± 0.743
0.0ArgXaa: 0.0 ± 0.0
Ser
3.885SerAla: 3.885 ± 0.677
2.331SerCys: 2.331 ± 1.062
3.497SerAsp: 3.497 ± 1.03
3.497SerGlu: 3.497 ± 1.465
1.943SerPhe: 1.943 ± 0.76
5.828SerGly: 5.828 ± 2.125
0.389SerHis: 0.389 ± 0.317
5.439SerIle: 5.439 ± 1.701
3.497SerLys: 3.497 ± 0.522
4.662SerLeu: 4.662 ± 1.535
2.72SerMet: 2.72 ± 1.216
5.828SerAsn: 5.828 ± 1.427
1.943SerPro: 1.943 ± 0.893
2.331SerGln: 2.331 ± 0.701
3.108SerArg: 3.108 ± 0.941
9.324SerSer: 9.324 ± 1.791
11.655SerThr: 11.655 ± 2.78
5.828SerVal: 5.828 ± 1.12
0.389SerTrp: 0.389 ± 0.317
1.554SerTyr: 1.554 ± 0.526
0.0SerXaa: 0.0 ± 0.0
Thr
4.274ThrAla: 4.274 ± 0.785
3.108ThrCys: 3.108 ± 0.647
3.885ThrAsp: 3.885 ± 0.726
7.382ThrGlu: 7.382 ± 2.786
2.331ThrPhe: 2.331 ± 0.653
4.274ThrGly: 4.274 ± 0.977
2.72ThrHis: 2.72 ± 0.996
2.72ThrIle: 2.72 ± 1.694
2.72ThrLys: 2.72 ± 1.055
7.77ThrLeu: 7.77 ± 1.852
1.554ThrMet: 1.554 ± 0.645
5.439ThrAsn: 5.439 ± 1.62
6.605ThrPro: 6.605 ± 1.602
3.108ThrGln: 3.108 ± 1.293
2.72ThrArg: 2.72 ± 0.648
8.159ThrSer: 8.159 ± 2.109
11.655ThrThr: 11.655 ± 3.876
8.936ThrVal: 8.936 ± 1.985
1.554ThrTrp: 1.554 ± 0.82
2.72ThrTyr: 2.72 ± 1.081
0.0ThrXaa: 0.0 ± 0.0
Val
2.331ValAla: 2.331 ± 0.97
1.943ValCys: 1.943 ± 0.935
4.662ValAsp: 4.662 ± 1.05
4.662ValGlu: 4.662 ± 1.077
2.331ValPhe: 2.331 ± 0.707
3.497ValGly: 3.497 ± 1.71
1.943ValHis: 1.943 ± 1.147
1.166ValIle: 1.166 ± 0.466
3.885ValLys: 3.885 ± 0.729
4.274ValLeu: 4.274 ± 1.682
0.777ValMet: 0.777 ± 0.543
1.166ValAsn: 1.166 ± 0.674
5.051ValPro: 5.051 ± 1.667
2.72ValGln: 2.72 ± 1.044
1.943ValArg: 1.943 ± 0.575
6.993ValSer: 6.993 ± 1.501
6.216ValThr: 6.216 ± 1.937
5.439ValVal: 5.439 ± 1.704
0.389ValTrp: 0.389 ± 0.381
3.108ValTyr: 3.108 ± 1.774
0.0ValXaa: 0.0 ± 0.0
Trp
1.554TrpAla: 1.554 ± 0.676
0.777TrpCys: 0.777 ± 0.634
0.0TrpAsp: 0.0 ± 0.0
0.777TrpGlu: 0.777 ± 0.437
0.777TrpPhe: 0.777 ± 0.634
0.777TrpGly: 0.777 ± 0.379
0.0TrpHis: 0.0 ± 0.0
0.777TrpIle: 0.777 ± 0.634
1.554TrpLys: 1.554 ± 0.661
0.777TrpLeu: 0.777 ± 0.379
0.389TrpMet: 0.389 ± 0.458
0.389TrpAsn: 0.389 ± 0.381
0.389TrpPro: 0.389 ± 0.317
0.0TrpGln: 0.0 ± 0.0
1.554TrpArg: 1.554 ± 0.496
0.389TrpSer: 0.389 ± 0.381
2.331TrpThr: 2.331 ± 1.189
0.389TrpVal: 0.389 ± 0.458
0.0TrpTrp: 0.0 ± 0.0
0.389TrpTyr: 0.389 ± 0.464
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.554TyrAla: 1.554 ± 0.495
1.166TyrCys: 1.166 ± 0.782
3.108TyrAsp: 3.108 ± 0.905
1.554TyrGlu: 1.554 ± 0.912
0.777TyrPhe: 0.777 ± 0.426
3.108TyrGly: 3.108 ± 0.916
0.389TyrHis: 0.389 ± 0.381
1.943TyrIle: 1.943 ± 0.688
3.497TyrLys: 3.497 ± 0.956
3.885TyrLeu: 3.885 ± 1.363
0.777TyrMet: 0.777 ± 0.539
2.331TyrAsn: 2.331 ± 1.47
0.777TyrPro: 0.777 ± 0.639
2.331TyrGln: 2.331 ± 1.043
3.108TyrArg: 3.108 ± 1.367
3.885TyrSer: 3.885 ± 1.568
2.72TyrThr: 2.72 ± 1.089
3.497TyrVal: 3.497 ± 0.935
1.166TyrTrp: 1.166 ± 0.393
2.331TyrTyr: 2.331 ± 0.91
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (2575 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski