Amino acid dipepetide frequency for Human papillomavirus 148

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.07AlaAla: 5.07 ± 0.834
0.845AlaCys: 0.845 ± 1.035
3.802AlaAsp: 3.802 ± 1.601
4.225AlaGlu: 4.225 ± 1.174
4.225AlaPhe: 4.225 ± 1.119
3.38AlaGly: 3.38 ± 1.424
0.422AlaHis: 0.422 ± 0.319
2.112AlaIle: 2.112 ± 0.697
3.38AlaLys: 3.38 ± 0.827
4.225AlaLeu: 4.225 ± 0.813
0.845AlaMet: 0.845 ± 0.585
2.957AlaAsn: 2.957 ± 0.946
2.957AlaPro: 2.957 ± 2.035
2.112AlaGln: 2.112 ± 0.412
3.38AlaArg: 3.38 ± 1.171
2.957AlaSer: 2.957 ± 0.955
4.225AlaThr: 4.225 ± 1.054
5.07AlaVal: 5.07 ± 1.56
0.845AlaTrp: 0.845 ± 0.437
1.69AlaTyr: 1.69 ± 0.611
0.0AlaXaa: 0.0 ± 0.0
Cys
1.69CysAla: 1.69 ± 0.811
1.69CysCys: 1.69 ± 0.938
0.845CysAsp: 0.845 ± 0.585
0.845CysGlu: 0.845 ± 0.609
2.535CysPhe: 2.535 ± 1.011
0.845CysGly: 0.845 ± 0.609
1.267CysHis: 1.267 ± 0.643
1.267CysIle: 1.267 ± 0.567
2.112CysLys: 2.112 ± 1.216
2.957CysLeu: 2.957 ± 1.504
0.0CysMet: 0.0 ± 0.0
1.267CysAsn: 1.267 ± 0.59
0.845CysPro: 0.845 ± 0.616
0.0CysGln: 0.0 ± 0.0
0.845CysArg: 0.845 ± 1.035
2.535CysSer: 2.535 ± 1.512
0.422CysThr: 0.422 ± 0.292
0.845CysVal: 0.845 ± 0.609
1.69CysTrp: 1.69 ± 0.611
0.422CysTyr: 0.422 ± 0.518
0.0CysXaa: 0.0 ± 0.0
Asp
5.492AspAla: 5.492 ± 1.534
2.535AspCys: 2.535 ± 0.799
4.225AspAsp: 4.225 ± 1.608
2.535AspGlu: 2.535 ± 0.729
2.957AspPhe: 2.957 ± 1.56
2.957AspGly: 2.957 ± 1.096
1.69AspHis: 1.69 ± 0.774
8.027AspIle: 8.027 ± 2.574
0.845AspLys: 0.845 ± 0.549
5.492AspLeu: 5.492 ± 0.646
0.845AspMet: 0.845 ± 0.638
2.535AspAsn: 2.535 ± 0.734
3.802AspPro: 3.802 ± 1.285
1.267AspGln: 1.267 ± 0.67
2.112AspArg: 2.112 ± 0.887
5.915AspSer: 5.915 ± 1.616
4.225AspThr: 4.225 ± 0.69
5.915AspVal: 5.915 ± 1.277
0.422AspTrp: 0.422 ± 0.319
3.802AspTyr: 3.802 ± 1.097
0.0AspXaa: 0.0 ± 0.0
Glu
2.112GluAla: 2.112 ± 0.842
1.267GluCys: 1.267 ± 0.535
5.07GluAsp: 5.07 ± 0.95
5.915GluGlu: 5.915 ± 1.619
2.957GluPhe: 2.957 ± 1.19
2.535GluGly: 2.535 ± 0.886
1.267GluHis: 1.267 ± 0.711
2.957GluIle: 2.957 ± 0.484
2.957GluLys: 2.957 ± 0.684
4.647GluLeu: 4.647 ± 1.539
1.267GluMet: 1.267 ± 0.366
4.647GluAsn: 4.647 ± 0.988
2.957GluPro: 2.957 ± 1.308
2.535GluGln: 2.535 ± 0.835
3.802GluArg: 3.802 ± 0.526
2.957GluSer: 2.957 ± 1.106
4.647GluThr: 4.647 ± 0.985
2.535GluVal: 2.535 ± 0.946
0.845GluTrp: 0.845 ± 0.437
1.69GluTyr: 1.69 ± 0.598
0.0GluXaa: 0.0 ± 0.0
Phe
2.957PheAla: 2.957 ± 0.656
1.267PheCys: 1.267 ± 0.711
5.07PheAsp: 5.07 ± 1.325
3.38PheGlu: 3.38 ± 1.181
2.957PhePhe: 2.957 ± 1.092
2.112PheGly: 2.112 ± 0.863
0.845PheHis: 0.845 ± 0.615
1.69PheIle: 1.69 ± 0.659
3.802PheLys: 3.802 ± 1.598
3.802PheLeu: 3.802 ± 0.604
0.422PheMet: 0.422 ± 0.292
2.957PheAsn: 2.957 ± 1.157
3.38PhePro: 3.38 ± 0.838
0.845PheGln: 0.845 ± 0.437
2.535PheArg: 2.535 ± 1.18
2.535PheSer: 2.535 ± 0.461
3.802PheThr: 3.802 ± 0.653
1.267PheVal: 1.267 ± 0.612
0.845PheTrp: 0.845 ± 0.329
2.535PheTyr: 2.535 ± 0.659
0.0PheXaa: 0.0 ± 0.0
Gly
2.957GlyAla: 2.957 ± 0.887
1.267GlyCys: 1.267 ± 0.771
4.225GlyAsp: 4.225 ± 1.036
4.225GlyGlu: 4.225 ± 0.976
1.69GlyPhe: 1.69 ± 0.432
2.112GlyGly: 2.112 ± 1.332
0.422GlyHis: 0.422 ± 0.319
2.957GlyIle: 2.957 ± 0.933
2.535GlyLys: 2.535 ± 0.988
4.225GlyLeu: 4.225 ± 1.524
0.422GlyMet: 0.422 ± 0.292
2.957GlyAsn: 2.957 ± 0.484
2.535GlyPro: 2.535 ± 0.766
2.535GlyGln: 2.535 ± 1.077
2.535GlyArg: 2.535 ± 0.899
4.225GlySer: 4.225 ± 1.22
4.647GlyThr: 4.647 ± 1.39
2.112GlyVal: 2.112 ± 1.332
0.422GlyTrp: 0.422 ± 0.292
1.69GlyTyr: 1.69 ± 0.942
0.0GlyXaa: 0.0 ± 0.0
His
0.845HisAla: 0.845 ± 0.638
1.267HisCys: 1.267 ± 0.682
0.845HisAsp: 0.845 ± 0.381
0.422HisGlu: 0.422 ± 0.565
0.845HisPhe: 0.845 ± 0.381
0.845HisGly: 0.845 ± 0.767
0.0HisHis: 0.0 ± 0.257
2.957HisIle: 2.957 ± 0.621
2.112HisLys: 2.112 ± 0.682
2.535HisLeu: 2.535 ± 0.809
0.422HisMet: 0.422 ± 0.376
0.422HisAsn: 0.422 ± 0.292
1.267HisPro: 1.267 ± 0.728
0.845HisGln: 0.845 ± 0.564
0.422HisArg: 0.422 ± 0.357
0.422HisSer: 0.422 ± 0.351
0.845HisThr: 0.845 ± 0.598
0.422HisVal: 0.422 ± 0.292
0.0HisTrp: 0.0 ± 0.0
0.422HisTyr: 0.422 ± 0.351
0.0HisXaa: 0.0 ± 0.0
Ile
2.957IleAla: 2.957 ± 1.357
1.267IleCys: 1.267 ± 0.643
3.38IleAsp: 3.38 ± 1.524
3.38IleGlu: 3.38 ± 0.838
2.112IlePhe: 2.112 ± 0.87
3.38IleGly: 3.38 ± 1.197
0.0IleHis: 0.0 ± 0.0
2.535IleIle: 2.535 ± 0.441
2.535IleLys: 2.535 ± 0.631
1.69IleLeu: 1.69 ± 0.576
0.0IleMet: 0.0 ± 0.0
3.802IleAsn: 3.802 ± 0.526
3.802IlePro: 3.802 ± 1.529
1.267IleGln: 1.267 ± 0.778
2.535IleArg: 2.535 ± 1.354
5.915IleSer: 5.915 ± 1.174
5.492IleThr: 5.492 ± 1.11
3.802IleVal: 3.802 ± 0.851
0.422IleTrp: 0.422 ± 0.319
1.267IleTyr: 1.267 ± 0.463
0.0IleXaa: 0.0 ± 0.0
Lys
1.267LysAla: 1.267 ± 0.656
2.535LysCys: 2.535 ± 0.849
1.267LysAsp: 1.267 ± 0.65
3.802LysGlu: 3.802 ± 0.926
2.957LysPhe: 2.957 ± 1.325
0.845LysGly: 0.845 ± 0.437
0.845LysHis: 0.845 ± 0.456
0.845LysIle: 0.845 ± 0.6
2.112LysLys: 2.112 ± 0.548
4.647LysLeu: 4.647 ± 0.941
1.267LysMet: 1.267 ± 0.777
2.112LysAsn: 2.112 ± 0.839
2.112LysPro: 2.112 ± 0.636
2.957LysGln: 2.957 ± 1.02
6.337LysArg: 6.337 ± 1.469
4.647LysSer: 4.647 ± 1.692
2.957LysThr: 2.957 ± 1.034
3.802LysVal: 3.802 ± 0.604
0.422LysTrp: 0.422 ± 0.565
2.957LysTyr: 2.957 ± 0.456
0.0LysXaa: 0.0 ± 0.0
Leu
3.802LeuAla: 3.802 ± 0.448
2.112LeuCys: 2.112 ± 1.17
5.492LeuAsp: 5.492 ± 1.562
6.76LeuGlu: 6.76 ± 1.071
3.802LeuPhe: 3.802 ± 1.337
5.07LeuGly: 5.07 ± 1.957
2.112LeuHis: 2.112 ± 1.157
5.915LeuIle: 5.915 ± 1.773
5.915LeuLys: 5.915 ± 0.869
7.605LeuLeu: 7.605 ± 2.459
0.422LeuMet: 0.422 ± 0.446
2.957LeuAsn: 2.957 ± 0.94
5.07LeuPro: 5.07 ± 2.196
5.915LeuGln: 5.915 ± 1.36
5.07LeuArg: 5.07 ± 1.824
5.915LeuSer: 5.915 ± 1.402
4.225LeuThr: 4.225 ± 1.086
4.647LeuVal: 4.647 ± 1.531
0.845LeuTrp: 0.845 ± 0.638
5.07LeuTyr: 5.07 ± 0.842
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
0.422MetAsp: 0.422 ± 0.319
1.69MetGlu: 1.69 ± 0.983
0.845MetPhe: 0.845 ± 0.585
1.267MetGly: 1.267 ± 0.535
0.0MetHis: 0.0 ± 0.0
0.845MetIle: 0.845 ± 0.585
0.422MetLys: 0.422 ± 0.351
1.69MetLeu: 1.69 ± 0.916
0.0MetMet: 0.0 ± 0.0
0.845MetAsn: 0.845 ± 0.329
0.422MetPro: 0.422 ± 0.292
1.69MetGln: 1.69 ± 0.603
0.422MetArg: 0.422 ± 0.292
0.845MetSer: 0.845 ± 0.329
1.267MetThr: 1.267 ± 0.728
2.535MetVal: 2.535 ± 1.07
0.0MetTrp: 0.0 ± 0.0
0.422MetTyr: 0.422 ± 0.351
0.0MetXaa: 0.0 ± 0.0
Asn
3.802AsnAla: 3.802 ± 0.855
0.845AsnCys: 0.845 ± 1.035
3.802AsnAsp: 3.802 ± 0.526
2.112AsnGlu: 2.112 ± 0.697
1.267AsnPhe: 1.267 ± 0.579
2.112AsnGly: 2.112 ± 0.738
0.845AsnHis: 0.845 ± 0.485
3.802AsnIle: 3.802 ± 1.489
2.535AsnLys: 2.535 ± 1.043
4.225AsnLeu: 4.225 ± 0.644
2.112AsnMet: 2.112 ± 0.794
1.69AsnAsn: 1.69 ± 0.432
2.535AsnPro: 2.535 ± 0.874
2.535AsnGln: 2.535 ± 0.934
2.112AsnArg: 2.112 ± 0.624
4.647AsnSer: 4.647 ± 1.361
4.647AsnThr: 4.647 ± 1.115
2.957AsnVal: 2.957 ± 0.663
0.422AsnTrp: 0.422 ± 0.292
0.422AsnTyr: 0.422 ± 0.518
0.0AsnXaa: 0.0 ± 0.0
Pro
4.225ProAla: 4.225 ± 1.434
0.845ProCys: 0.845 ± 0.329
7.182ProAsp: 7.182 ± 2.33
3.38ProGlu: 3.38 ± 0.727
1.267ProPhe: 1.267 ± 0.732
0.845ProGly: 0.845 ± 0.714
1.267ProHis: 1.267 ± 0.764
1.267ProIle: 1.267 ± 0.379
2.957ProLys: 2.957 ± 0.656
5.915ProLeu: 5.915 ± 1.511
0.422ProMet: 0.422 ± 0.292
2.112ProAsn: 2.112 ± 0.777
7.605ProPro: 7.605 ± 2.961
2.535ProGln: 2.535 ± 0.553
4.225ProArg: 4.225 ± 1.547
4.225ProSer: 4.225 ± 2.664
5.492ProThr: 5.492 ± 1.845
2.957ProVal: 2.957 ± 1.849
0.422ProTrp: 0.422 ± 0.351
2.112ProTyr: 2.112 ± 1.324
0.0ProXaa: 0.0 ± 0.0
Gln
1.69GlnAla: 1.69 ± 0.576
1.267GlnCys: 1.267 ± 0.839
2.112GlnAsp: 2.112 ± 0.933
3.38GlnGlu: 3.38 ± 1.645
2.112GlnPhe: 2.112 ± 0.584
3.802GlnGly: 3.802 ± 0.809
2.112GlnHis: 2.112 ± 1.047
1.267GlnIle: 1.267 ± 0.535
1.69GlnLys: 1.69 ± 0.549
3.38GlnLeu: 3.38 ± 1.066
1.267GlnMet: 1.267 ± 0.656
0.845GlnAsn: 0.845 ± 0.714
2.957GlnPro: 2.957 ± 0.85
2.535GlnGln: 2.535 ± 0.92
2.535GlnArg: 2.535 ± 0.537
1.69GlnSer: 1.69 ± 0.611
2.112GlnThr: 2.112 ± 1.105
4.647GlnVal: 4.647 ± 0.974
1.267GlnTrp: 1.267 ± 0.656
0.845GlnTyr: 0.845 ± 0.329
0.0GlnXaa: 0.0 ± 0.0
Arg
3.802ArgAla: 3.802 ± 1.167
1.267ArgCys: 1.267 ± 0.65
2.112ArgAsp: 2.112 ± 0.923
2.112ArgGlu: 2.112 ± 0.89
2.112ArgPhe: 2.112 ± 0.794
2.112ArgGly: 2.112 ± 1.174
1.267ArgHis: 1.267 ± 0.593
1.69ArgIle: 1.69 ± 0.772
5.07ArgLys: 5.07 ± 1.21
6.76ArgLeu: 6.76 ± 1.099
0.845ArgMet: 0.845 ± 0.474
3.802ArgAsn: 3.802 ± 1.638
3.38ArgPro: 3.38 ± 1.437
2.112ArgGln: 2.112 ± 0.933
6.337ArgArg: 6.337 ± 2.473
2.957ArgSer: 2.957 ± 0.687
3.38ArgThr: 3.38 ± 1.289
3.38ArgVal: 3.38 ± 1.506
0.422ArgTrp: 0.422 ± 0.518
1.69ArgTyr: 1.69 ± 1.101
0.0ArgXaa: 0.0 ± 0.0
Ser
5.07SerAla: 5.07 ± 1.89
0.845SerCys: 0.845 ± 0.549
4.225SerAsp: 4.225 ± 1.29
3.38SerGlu: 3.38 ± 0.801
4.225SerPhe: 4.225 ± 0.871
3.802SerGly: 3.802 ± 0.682
1.267SerHis: 1.267 ± 0.737
2.957SerIle: 2.957 ± 0.79
1.69SerLys: 1.69 ± 1.169
8.027SerLeu: 8.027 ± 1.725
0.845SerMet: 0.845 ± 0.585
5.07SerAsn: 5.07 ± 2.372
3.802SerPro: 3.802 ± 0.998
3.38SerGln: 3.38 ± 0.94
3.38SerArg: 3.38 ± 1.016
5.492SerSer: 5.492 ± 1.198
7.182SerThr: 7.182 ± 1.72
4.225SerVal: 4.225 ± 1.601
0.0SerTrp: 0.0 ± 0.0
2.112SerTyr: 2.112 ± 1.191
0.0SerXaa: 0.0 ± 0.0
Thr
3.38ThrAla: 3.38 ± 0.76
1.267ThrCys: 1.267 ± 0.768
6.337ThrAsp: 6.337 ± 1.087
4.225ThrGlu: 4.225 ± 0.792
2.112ThrPhe: 2.112 ± 1.074
5.07ThrGly: 5.07 ± 1.451
1.267ThrHis: 1.267 ± 0.629
3.38ThrIle: 3.38 ± 1.488
2.535ThrLys: 2.535 ± 0.768
8.027ThrLeu: 8.027 ± 1.591
1.267ThrMet: 1.267 ± 0.72
3.38ThrAsn: 3.38 ± 0.572
5.492ThrPro: 5.492 ± 2.564
2.535ThrGln: 2.535 ± 1.011
2.957ThrArg: 2.957 ± 1.21
4.647ThrSer: 4.647 ± 1.434
4.225ThrThr: 4.225 ± 0.799
6.337ThrVal: 6.337 ± 1.046
0.422ThrTrp: 0.422 ± 0.357
2.957ThrTyr: 2.957 ± 0.695
0.0ThrXaa: 0.0 ± 0.0
Val
3.802ValAla: 3.802 ± 1.134
1.267ValCys: 1.267 ± 1.221
5.492ValAsp: 5.492 ± 1.254
2.112ValGlu: 2.112 ± 0.968
3.802ValPhe: 3.802 ± 1.345
4.225ValGly: 4.225 ± 0.647
0.845ValHis: 0.845 ± 0.476
3.38ValIle: 3.38 ± 1.859
2.535ValLys: 2.535 ± 1.07
4.647ValLeu: 4.647 ± 1.166
0.845ValMet: 0.845 ± 0.383
2.535ValAsn: 2.535 ± 0.553
4.225ValPro: 4.225 ± 0.727
3.38ValGln: 3.38 ± 1.715
2.112ValArg: 2.112 ± 0.59
5.915ValSer: 5.915 ± 0.868
4.647ValThr: 4.647 ± 1.61
4.647ValVal: 4.647 ± 2.462
0.845ValTrp: 0.845 ± 0.615
1.69ValTyr: 1.69 ± 0.773
0.0ValXaa: 0.0 ± 0.0
Trp
1.69TrpAla: 1.69 ± 0.432
0.422TrpCys: 0.422 ± 0.292
1.267TrpAsp: 1.267 ± 0.771
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.845TrpGly: 0.845 ± 0.6
0.0TrpHis: 0.0 ± 0.0
0.422TrpIle: 0.422 ± 0.292
0.845TrpLys: 0.845 ± 0.549
1.267TrpLeu: 1.267 ± 0.579
0.845TrpMet: 0.845 ± 0.437
0.845TrpAsn: 0.845 ± 0.638
0.422TrpPro: 0.422 ± 0.319
0.845TrpGln: 0.845 ± 0.474
0.422TrpArg: 0.422 ± 0.518
0.422TrpSer: 0.422 ± 0.351
1.267TrpThr: 1.267 ± 0.656
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.112TyrAla: 2.112 ± 0.564
0.845TyrCys: 0.845 ± 0.558
0.422TyrAsp: 0.422 ± 0.292
1.69TyrGlu: 1.69 ± 0.549
4.225TyrPhe: 4.225 ± 1.019
2.535TyrGly: 2.535 ± 0.659
0.422TyrHis: 0.422 ± 0.357
1.267TyrIle: 1.267 ± 0.632
2.535TyrLys: 2.535 ± 0.628
2.957TyrLeu: 2.957 ± 0.63
0.845TyrMet: 0.845 ± 0.549
1.69TyrAsn: 1.69 ± 0.675
1.69TyrPro: 1.69 ± 0.547
1.69TyrGln: 1.69 ± 0.782
2.535TyrArg: 2.535 ± 0.554
2.112TyrSer: 2.112 ± 0.413
2.112TyrThr: 2.112 ± 0.499
0.845TyrVal: 0.845 ± 0.329
1.267TyrTrp: 1.267 ± 0.728
2.535TyrTyr: 2.535 ± 0.653
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2368 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski