Amino acid dipepetide frequency for human papillomavirus 164

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.095AlaAla: 4.095 ± 1.168
0.819AlaCys: 0.819 ± 0.477
4.505AlaAsp: 4.505 ± 0.808
4.095AlaGlu: 4.095 ± 1.112
3.276AlaPhe: 3.276 ± 1.061
2.048AlaGly: 2.048 ± 0.475
0.819AlaHis: 0.819 ± 0.406
4.914AlaIle: 4.914 ± 1.457
2.867AlaLys: 2.867 ± 0.865
4.505AlaLeu: 4.505 ± 0.835
0.41AlaMet: 0.41 ± 0.323
2.048AlaAsn: 2.048 ± 0.566
1.638AlaPro: 1.638 ± 0.545
1.229AlaGln: 1.229 ± 0.679
2.457AlaArg: 2.457 ± 0.5
4.505AlaSer: 4.505 ± 1.622
4.505AlaThr: 4.505 ± 0.623
5.324AlaVal: 5.324 ± 1.238
0.41AlaTrp: 0.41 ± 0.483
2.457AlaTyr: 2.457 ± 0.413
0.0AlaXaa: 0.0 ± 0.0
Cys
2.048CysAla: 2.048 ± 1.368
1.638CysCys: 1.638 ± 1.38
0.819CysAsp: 0.819 ± 0.682
0.0CysGlu: 0.0 ± 0.0
2.457CysPhe: 2.457 ± 1.116
0.41CysGly: 0.41 ± 0.581
0.41CysHis: 0.41 ± 0.489
1.229CysIle: 1.229 ± 1.316
2.048CysLys: 2.048 ± 0.764
1.229CysLeu: 1.229 ± 1.241
1.229CysMet: 1.229 ± 0.496
2.048CysAsn: 2.048 ± 1.054
1.229CysPro: 1.229 ± 0.681
0.819CysGln: 0.819 ± 0.436
1.638CysArg: 1.638 ± 1.38
1.229CysSer: 1.229 ± 0.716
1.229CysThr: 1.229 ± 0.716
0.819CysVal: 0.819 ± 0.477
1.638CysTrp: 1.638 ± 0.591
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.638AspAla: 1.638 ± 0.638
2.457AspCys: 2.457 ± 0.792
4.914AspAsp: 4.914 ± 1.534
4.914AspGlu: 4.914 ± 0.934
1.638AspPhe: 1.638 ± 0.527
2.457AspGly: 2.457 ± 1.219
1.229AspHis: 1.229 ± 0.671
6.143AspIle: 6.143 ± 1.555
1.638AspLys: 1.638 ± 0.706
6.143AspLeu: 6.143 ± 1.359
1.229AspMet: 1.229 ± 0.677
1.638AspAsn: 1.638 ± 1.107
6.552AspPro: 6.552 ± 2.363
1.638AspGln: 1.638 ± 0.615
2.867AspArg: 2.867 ± 1.298
5.324AspSer: 5.324 ± 1.26
4.095AspThr: 4.095 ± 0.925
4.505AspVal: 4.505 ± 1.208
0.0AspTrp: 0.0 ± 0.0
3.276AspTyr: 3.276 ± 1.122
0.0AspXaa: 0.0 ± 0.0
Glu
4.914GluAla: 4.914 ± 2.054
0.819GluCys: 0.819 ± 0.556
3.276GluAsp: 3.276 ± 1.128
9.009GluGlu: 9.009 ± 2.559
3.276GluPhe: 3.276 ± 1.059
2.048GluGly: 2.048 ± 0.991
0.819GluHis: 0.819 ± 0.406
4.095GluIle: 4.095 ± 1.4
1.229GluLys: 1.229 ± 0.91
6.143GluLeu: 6.143 ± 0.967
0.0GluMet: 0.0 ± 0.0
5.733GluAsn: 5.733 ± 1.474
1.638GluPro: 1.638 ± 0.581
3.686GluGln: 3.686 ± 1.336
4.095GluArg: 4.095 ± 1.383
3.686GluSer: 3.686 ± 1.135
4.914GluThr: 4.914 ± 1.101
3.276GluVal: 3.276 ± 1.172
0.819GluTrp: 0.819 ± 0.487
0.819GluTyr: 0.819 ± 0.967
0.0GluXaa: 0.0 ± 0.0
Phe
2.867PheAla: 2.867 ± 0.68
1.229PheCys: 1.229 ± 0.651
3.276PheAsp: 3.276 ± 1.282
5.324PheGlu: 5.324 ± 1.948
2.867PhePhe: 2.867 ± 1.46
1.229PheGly: 1.229 ± 0.699
0.0PheHis: 0.0 ± 0.0
1.229PheIle: 1.229 ± 0.569
2.867PheLys: 2.867 ± 0.813
4.095PheLeu: 4.095 ± 1.872
0.0PheMet: 0.0 ± 0.301
2.457PheAsn: 2.457 ± 0.641
2.048PhePro: 2.048 ± 0.693
2.867PheGln: 2.867 ± 1.167
2.048PheArg: 2.048 ± 0.41
3.276PheSer: 3.276 ± 0.98
2.867PheThr: 2.867 ± 1.105
2.867PheVal: 2.867 ± 0.621
1.638PheTrp: 1.638 ± 0.825
2.048PheTyr: 2.048 ± 0.41
0.0PheXaa: 0.0 ± 0.0
Gly
1.638GlyAla: 1.638 ± 0.843
1.229GlyCys: 1.229 ± 0.738
3.276GlyAsp: 3.276 ± 1.042
4.095GlyGlu: 4.095 ± 1.073
0.819GlyPhe: 0.819 ± 0.422
2.048GlyGly: 2.048 ± 0.692
2.048GlyHis: 2.048 ± 0.711
2.048GlyIle: 2.048 ± 0.692
2.867GlyLys: 2.867 ± 0.86
4.914GlyLeu: 4.914 ± 0.756
0.0GlyMet: 0.0 ± 0.0
2.457GlyAsn: 2.457 ± 1.014
2.048GlyPro: 2.048 ± 0.665
2.457GlyGln: 2.457 ± 0.924
3.686GlyArg: 3.686 ± 1.103
5.324GlySer: 5.324 ± 1.096
5.324GlyThr: 5.324 ± 1.394
2.048GlyVal: 2.048 ± 0.475
0.41GlyTrp: 0.41 ± 0.323
0.819GlyTyr: 0.819 ± 0.496
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.41HisCys: 0.41 ± 0.489
1.229HisAsp: 1.229 ± 0.438
0.41HisGlu: 0.41 ± 0.489
1.229HisPhe: 1.229 ± 0.681
0.819HisGly: 0.819 ± 0.496
0.41HisHis: 0.41 ± 0.554
0.41HisIle: 0.41 ± 0.489
1.638HisLys: 1.638 ± 0.975
2.867HisLeu: 2.867 ± 1.014
0.41HisMet: 0.41 ± 0.483
0.819HisAsn: 0.819 ± 0.477
1.638HisPro: 1.638 ± 0.843
0.0HisGln: 0.0 ± 0.0
0.41HisArg: 0.41 ± 0.483
1.229HisSer: 1.229 ± 0.569
0.0HisThr: 0.0 ± 0.0
1.229HisVal: 1.229 ± 0.65
0.41HisTrp: 0.41 ± 0.341
2.048HisTyr: 2.048 ± 0.603
0.0HisXaa: 0.0 ± 0.0
Ile
4.914IleAla: 4.914 ± 2.384
0.41IleCys: 0.41 ± 0.341
3.276IleAsp: 3.276 ± 0.986
4.914IleGlu: 4.914 ± 1.545
2.867IlePhe: 2.867 ± 1.039
2.457IleGly: 2.457 ± 1.08
0.819IleHis: 0.819 ± 0.496
2.457IleIle: 2.457 ± 1.026
2.457IleLys: 2.457 ± 0.739
6.143IleLeu: 6.143 ± 1.406
1.229IleMet: 1.229 ± 1.062
2.867IleAsn: 2.867 ± 1.298
2.867IlePro: 2.867 ± 1.284
2.048IleGln: 2.048 ± 1.043
1.638IleArg: 1.638 ± 0.78
4.095IleSer: 4.095 ± 1.6
2.457IleThr: 2.457 ± 1.001
2.457IleVal: 2.457 ± 1.398
0.0IleTrp: 0.0 ± 0.0
1.229IleTyr: 1.229 ± 0.438
0.0IleXaa: 0.0 ± 0.0
Lys
3.276LysAla: 3.276 ± 0.796
2.048LysCys: 2.048 ± 0.877
1.638LysAsp: 1.638 ± 0.813
3.686LysGlu: 3.686 ± 1.475
1.229LysPhe: 1.229 ± 0.677
2.457LysGly: 2.457 ± 0.91
2.867LysHis: 2.867 ± 1.248
3.276LysIle: 3.276 ± 1.157
1.638LysLys: 1.638 ± 0.591
3.686LysLeu: 3.686 ± 0.872
2.048LysMet: 2.048 ± 0.731
2.048LysAsn: 2.048 ± 1.283
2.457LysPro: 2.457 ± 1.052
2.048LysGln: 2.048 ± 0.682
5.324LysArg: 5.324 ± 1.407
2.457LysSer: 2.457 ± 1.564
2.867LysThr: 2.867 ± 0.774
3.276LysVal: 3.276 ± 1.023
0.41LysTrp: 0.41 ± 0.341
2.867LysTyr: 2.867 ± 0.968
0.0LysXaa: 0.0 ± 0.0
Leu
4.095LeuAla: 4.095 ± 1.23
2.457LeuCys: 2.457 ± 1.087
5.324LeuAsp: 5.324 ± 1.072
5.733LeuGlu: 5.733 ± 1.907
5.733LeuPhe: 5.733 ± 1.535
6.143LeuGly: 6.143 ± 1.378
3.276LeuHis: 3.276 ± 1.558
4.505LeuIle: 4.505 ± 0.96
4.095LeuLys: 4.095 ± 1.19
9.828LeuLeu: 9.828 ± 2.527
1.638LeuMet: 1.638 ± 0.336
5.733LeuAsn: 5.733 ± 1.008
6.552LeuPro: 6.552 ± 1.789
6.962LeuGln: 6.962 ± 2.067
5.324LeuArg: 5.324 ± 1.813
5.733LeuSer: 5.733 ± 2.217
6.962LeuThr: 6.962 ± 1.591
3.686LeuVal: 3.686 ± 0.798
0.819LeuTrp: 0.819 ± 0.406
2.867LeuTyr: 2.867 ± 0.793
0.0LeuXaa: 0.0 ± 0.0
Met
1.229MetAla: 1.229 ± 0.651
0.41MetCys: 0.41 ± 0.341
2.048MetAsp: 2.048 ± 0.824
0.819MetGlu: 0.819 ± 0.617
0.41MetPhe: 0.41 ± 0.323
0.41MetGly: 0.41 ± 0.483
0.0MetHis: 0.0 ± 0.0
0.819MetIle: 0.819 ± 0.667
1.229MetLys: 1.229 ± 1.099
0.819MetLeu: 0.819 ± 0.493
0.0MetMet: 0.0 ± 0.0
1.229MetAsn: 1.229 ± 0.629
0.819MetPro: 0.819 ± 0.477
0.41MetGln: 0.41 ± 0.483
0.0MetArg: 0.0 ± 0.0
0.819MetSer: 0.819 ± 0.406
0.41MetThr: 0.41 ± 0.341
2.048MetVal: 2.048 ± 0.909
0.0MetTrp: 0.0 ± 0.0
0.41MetTyr: 0.41 ± 0.353
0.0MetXaa: 0.0 ± 0.0
Asn
3.276AsnAla: 3.276 ± 1.032
2.048AsnCys: 2.048 ± 1.435
1.638AsnAsp: 1.638 ± 0.319
1.229AsnGlu: 1.229 ± 0.659
2.457AsnPhe: 2.457 ± 0.994
2.457AsnGly: 2.457 ± 0.753
0.41AsnHis: 0.41 ± 0.554
4.095AsnIle: 4.095 ± 1.571
2.867AsnLys: 2.867 ± 0.917
2.867AsnLeu: 2.867 ± 1.422
0.819AsnMet: 0.819 ± 0.914
3.686AsnAsn: 3.686 ± 1.305
3.276AsnPro: 3.276 ± 1.474
0.819AsnGln: 0.819 ± 0.477
2.457AsnArg: 2.457 ± 0.584
3.686AsnSer: 3.686 ± 0.62
4.914AsnThr: 4.914 ± 1.092
6.552AsnVal: 6.552 ± 2.039
1.638AsnTrp: 1.638 ± 0.828
1.229AsnTyr: 1.229 ± 0.677
0.0AsnXaa: 0.0 ± 0.0
Pro
4.505ProAla: 4.505 ± 1.986
1.229ProCys: 1.229 ± 0.496
6.962ProAsp: 6.962 ± 2.722
1.638ProGlu: 1.638 ± 0.574
1.229ProPhe: 1.229 ± 0.58
0.819ProGly: 0.819 ± 0.599
0.41ProHis: 0.41 ± 0.554
3.276ProIle: 3.276 ± 2.011
5.324ProLys: 5.324 ± 1.434
6.962ProLeu: 6.962 ± 2.972
0.0ProMet: 0.0 ± 0.0
1.638ProAsn: 1.638 ± 0.992
6.143ProPro: 6.143 ± 1.858
2.457ProGln: 2.457 ± 1.252
3.276ProArg: 3.276 ± 1.88
4.505ProSer: 4.505 ± 1.73
4.914ProThr: 4.914 ± 1.046
4.095ProVal: 4.095 ± 1.39
0.0ProTrp: 0.0 ± 0.0
2.457ProTyr: 2.457 ± 1.194
0.0ProXaa: 0.0 ± 0.0
Gln
1.638GlnAla: 1.638 ± 0.615
1.229GlnCys: 1.229 ± 0.681
4.505GlnAsp: 4.505 ± 1.105
2.048GlnGlu: 2.048 ± 1.409
0.819GlnPhe: 0.819 ± 0.422
4.095GlnGly: 4.095 ± 1.325
0.819GlnHis: 0.819 ± 0.556
2.048GlnIle: 2.048 ± 0.936
0.819GlnLys: 0.819 ± 0.493
5.733GlnLeu: 5.733 ± 2.069
0.41GlnMet: 0.41 ± 0.323
1.229GlnAsn: 1.229 ± 0.963
4.505GlnPro: 4.505 ± 1.807
3.686GlnGln: 3.686 ± 1.228
3.276GlnArg: 3.276 ± 1.659
2.867GlnSer: 2.867 ± 1.117
1.229GlnThr: 1.229 ± 0.438
1.229GlnVal: 1.229 ± 0.433
1.229GlnTrp: 1.229 ± 0.606
2.048GlnTyr: 2.048 ± 0.991
0.0GlnXaa: 0.0 ± 0.0
Arg
3.686ArgAla: 3.686 ± 1.286
1.638ArgCys: 1.638 ± 1.119
2.048ArgAsp: 2.048 ± 1.179
2.048ArgGlu: 2.048 ± 0.381
3.276ArgPhe: 3.276 ± 1.653
1.638ArgGly: 1.638 ± 0.609
1.229ArgHis: 1.229 ± 0.697
1.638ArgIle: 1.638 ± 1.054
4.095ArgLys: 4.095 ± 0.834
6.143ArgLeu: 6.143 ± 1.567
0.819ArgMet: 0.819 ± 0.496
4.505ArgAsn: 4.505 ± 1.498
4.505ArgPro: 4.505 ± 2.456
2.867ArgGln: 2.867 ± 1.063
6.143ArgArg: 6.143 ± 2.174
4.095ArgSer: 4.095 ± 1.052
1.229ArgThr: 1.229 ± 0.659
4.095ArgVal: 4.095 ± 1.219
0.819ArgTrp: 0.819 ± 0.703
1.638ArgTyr: 1.638 ± 0.638
0.0ArgXaa: 0.0 ± 0.0
Ser
4.095SerAla: 4.095 ± 1.329
0.0SerCys: 0.0 ± 0.0
2.867SerAsp: 2.867 ± 0.918
5.324SerGlu: 5.324 ± 1.274
2.867SerPhe: 2.867 ± 1.274
7.781SerGly: 7.781 ± 1.988
0.819SerHis: 0.819 ± 0.646
2.457SerIle: 2.457 ± 1.023
4.095SerLys: 4.095 ± 1.281
8.19SerLeu: 8.19 ± 1.947
1.229SerMet: 1.229 ± 0.572
4.095SerAsn: 4.095 ± 2.485
4.095SerPro: 4.095 ± 1.921
3.686SerGln: 3.686 ± 0.587
4.095SerArg: 4.095 ± 1.406
6.962SerSer: 6.962 ± 3.603
6.143SerThr: 6.143 ± 1.745
2.457SerVal: 2.457 ± 0.83
0.41SerTrp: 0.41 ± 0.323
1.638SerTyr: 1.638 ± 0.647
0.0SerXaa: 0.0 ± 0.0
Thr
2.048ThrAla: 2.048 ± 0.684
1.638ThrCys: 1.638 ± 0.57
4.914ThrAsp: 4.914 ± 1.567
3.276ThrGlu: 3.276 ± 0.707
2.457ThrPhe: 2.457 ± 0.819
3.686ThrGly: 3.686 ± 1.217
0.0ThrHis: 0.0 ± 0.0
3.276ThrIle: 3.276 ± 2.011
2.048ThrLys: 2.048 ± 0.798
6.552ThrLeu: 6.552 ± 2.425
0.0ThrMet: 0.0 ± 0.0
3.686ThrAsn: 3.686 ± 0.968
2.867ThrPro: 2.867 ± 1.398
3.276ThrGln: 3.276 ± 2.162
4.505ThrArg: 4.505 ± 1.247
5.733ThrSer: 5.733 ± 2.424
2.867ThrThr: 2.867 ± 1.702
6.552ThrVal: 6.552 ± 1.64
0.819ThrTrp: 0.819 ± 0.487
2.048ThrTyr: 2.048 ± 0.89
0.0ThrXaa: 0.0 ± 0.0
Val
3.276ValAla: 3.276 ± 1.353
1.638ValCys: 1.638 ± 1.324
6.143ValAsp: 6.143 ± 1.32
3.276ValGlu: 3.276 ± 1.552
4.914ValPhe: 4.914 ± 0.899
4.095ValGly: 4.095 ± 1.149
1.229ValHis: 1.229 ± 0.459
2.457ValIle: 2.457 ± 0.867
2.457ValLys: 2.457 ± 1.212
4.095ValLeu: 4.095 ± 1.355
0.819ValMet: 0.819 ± 0.493
2.457ValAsn: 2.457 ± 1.06
5.324ValPro: 5.324 ± 1.161
2.048ValGln: 2.048 ± 1.035
3.276ValArg: 3.276 ± 1.633
6.552ValSer: 6.552 ± 1.398
2.867ValThr: 2.867 ± 1.224
2.457ValVal: 2.457 ± 0.819
0.819ValTrp: 0.819 ± 0.493
0.819ValTyr: 0.819 ± 0.496
0.0ValXaa: 0.0 ± 0.0
Trp
0.819TrpAla: 0.819 ± 0.646
0.0TrpCys: 0.0 ± 0.0
0.819TrpAsp: 0.819 ± 0.493
0.41TrpGlu: 0.41 ± 0.554
1.229TrpPhe: 1.229 ± 0.915
0.41TrpGly: 0.41 ± 0.323
0.0TrpHis: 0.0 ± 0.0
0.819TrpIle: 0.819 ± 0.406
1.638TrpLys: 1.638 ± 0.591
1.638TrpLeu: 1.638 ± 0.591
0.0TrpMet: 0.0 ± 0.0
0.41TrpAsn: 0.41 ± 0.341
0.819TrpPro: 0.819 ± 0.682
1.638TrpGln: 1.638 ± 0.828
0.819TrpArg: 0.819 ± 0.692
0.0TrpSer: 0.0 ± 0.0
0.819TrpThr: 0.819 ± 0.967
0.819TrpVal: 0.819 ± 0.637
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.867TyrAla: 2.867 ± 0.839
0.819TyrCys: 0.819 ± 0.836
0.819TyrAsp: 0.819 ± 0.585
2.048TyrGlu: 2.048 ± 0.469
2.457TyrPhe: 2.457 ± 0.973
2.048TyrGly: 2.048 ± 0.861
0.0TyrHis: 0.0 ± 0.0
0.41TyrIle: 0.41 ± 0.341
3.686TyrLys: 3.686 ± 1.01
4.914TyrLeu: 4.914 ± 1.099
1.638TyrMet: 1.638 ± 0.926
1.638TyrAsn: 1.638 ± 0.319
0.819TyrPro: 0.819 ± 0.406
0.819TyrGln: 0.819 ± 0.703
0.819TyrArg: 0.819 ± 0.682
1.229TyrSer: 1.229 ± 1.06
1.638TyrThr: 1.638 ± 0.821
1.229TyrVal: 1.229 ± 0.438
0.819TyrTrp: 0.819 ± 0.493
2.867TyrTyr: 2.867 ± 1.114
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2443 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski