Amino acid dipepetide frequency for Human papillomavirus type 161

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.329AlaAla: 5.329 ± 0.809
1.332AlaCys: 1.332 ± 0.93
5.773AlaAsp: 5.773 ± 1.0
6.661AlaGlu: 6.661 ± 1.956
3.552AlaPhe: 3.552 ± 1.208
2.664AlaGly: 2.664 ± 0.81
0.888AlaHis: 0.888 ± 0.743
3.108AlaIle: 3.108 ± 0.639
2.664AlaLys: 2.664 ± 0.93
7.993AlaLeu: 7.993 ± 2.784
0.0AlaMet: 0.0 ± 0.0
1.332AlaAsn: 1.332 ± 0.448
1.332AlaPro: 1.332 ± 0.907
0.888AlaGln: 0.888 ± 0.439
3.108AlaArg: 3.108 ± 0.965
3.996AlaSer: 3.996 ± 1.312
3.108AlaThr: 3.108 ± 0.674
2.664AlaVal: 2.664 ± 0.956
0.444AlaTrp: 0.444 ± 0.363
1.332AlaTyr: 1.332 ± 0.649
0.0AlaXaa: 0.0 ± 0.0
Cys
2.22CysAla: 2.22 ± 1.452
1.776CysCys: 1.776 ± 1.359
2.22CysAsp: 2.22 ± 1.174
0.444CysGlu: 0.444 ± 0.569
0.888CysPhe: 0.888 ± 0.377
0.444CysGly: 0.444 ± 0.569
0.0CysHis: 0.0 ± 0.0
2.664CysIle: 2.664 ± 1.859
1.332CysLys: 1.332 ± 0.618
1.776CysLeu: 1.776 ± 1.082
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
2.664CysPro: 2.664 ± 1.481
0.444CysGln: 0.444 ± 0.363
0.888CysArg: 0.888 ± 0.907
3.552CysSer: 3.552 ± 1.415
0.888CysThr: 0.888 ± 0.439
1.332CysVal: 1.332 ± 0.778
0.888CysTrp: 0.888 ± 0.438
1.332CysTyr: 1.332 ± 0.561
0.0CysXaa: 0.0 ± 0.0
Asp
3.996AspAla: 3.996 ± 1.153
2.664AspCys: 2.664 ± 0.755
3.108AspAsp: 3.108 ± 0.763
4.885AspGlu: 4.885 ± 0.805
1.776AspPhe: 1.776 ± 0.822
2.664AspGly: 2.664 ± 1.228
0.888AspHis: 0.888 ± 0.438
6.217AspIle: 6.217 ± 1.858
0.888AspLys: 0.888 ± 0.437
6.661AspLeu: 6.661 ± 1.6
0.888AspMet: 0.888 ± 0.439
3.552AspAsn: 3.552 ± 0.97
6.661AspPro: 6.661 ± 1.381
1.776AspGln: 1.776 ± 0.217
3.552AspArg: 3.552 ± 1.049
4.885AspSer: 4.885 ± 1.016
4.885AspThr: 4.885 ± 0.818
5.773AspVal: 5.773 ± 1.974
1.776AspTrp: 1.776 ± 0.578
2.22AspTyr: 2.22 ± 0.463
0.0AspXaa: 0.0 ± 0.0
Glu
3.108GluAla: 3.108 ± 1.514
0.888GluCys: 0.888 ± 0.726
2.664GluAsp: 2.664 ± 0.845
5.329GluGlu: 5.329 ± 2.016
2.22GluPhe: 2.22 ± 0.607
3.552GluGly: 3.552 ± 0.753
0.444GluHis: 0.444 ± 0.418
3.552GluIle: 3.552 ± 1.468
2.664GluLys: 2.664 ± 1.478
4.885GluLeu: 4.885 ± 2.144
1.332GluMet: 1.332 ± 0.636
1.776GluAsn: 1.776 ± 0.65
4.885GluPro: 4.885 ± 1.441
3.552GluGln: 3.552 ± 1.089
2.664GluArg: 2.664 ± 1.029
6.217GluSer: 6.217 ± 1.193
3.996GluThr: 3.996 ± 0.815
3.552GluVal: 3.552 ± 1.293
0.444GluTrp: 0.444 ± 0.363
2.22GluTyr: 2.22 ± 0.757
0.0GluXaa: 0.0 ± 0.0
Phe
3.996PheAla: 3.996 ± 0.947
1.332PheCys: 1.332 ± 0.93
3.108PheAsp: 3.108 ± 0.987
3.108PheGlu: 3.108 ± 1.898
3.108PhePhe: 3.108 ± 1.872
2.22PheGly: 2.22 ± 0.412
1.776PheHis: 1.776 ± 0.927
3.552PheIle: 3.552 ± 0.544
0.888PheLys: 0.888 ± 0.437
4.44PheLeu: 4.44 ± 0.733
0.888PheMet: 0.888 ± 0.374
1.776PheAsn: 1.776 ± 0.879
2.22PhePro: 2.22 ± 0.366
2.22PheGln: 2.22 ± 0.578
2.664PheArg: 2.664 ± 0.959
2.22PheSer: 2.22 ± 0.81
2.22PheThr: 2.22 ± 0.933
2.22PheVal: 2.22 ± 0.703
1.332PheTrp: 1.332 ± 0.728
2.22PheTyr: 2.22 ± 0.463
0.0PheXaa: 0.0 ± 0.0
Gly
2.664GlyAla: 2.664 ± 0.424
0.888GlyCys: 0.888 ± 0.743
6.217GlyAsp: 6.217 ± 1.018
3.552GlyGlu: 3.552 ± 1.302
1.332GlyPhe: 1.332 ± 0.33
4.885GlyGly: 4.885 ± 2.752
2.22GlyHis: 2.22 ± 0.973
2.22GlyIle: 2.22 ± 0.652
1.776GlyLys: 1.776 ± 1.046
3.996GlyLeu: 3.996 ± 1.652
0.444GlyMet: 0.444 ± 0.418
4.885GlyAsn: 4.885 ± 0.633
4.44GlyPro: 4.44 ± 1.771
1.332GlyGln: 1.332 ± 0.771
3.996GlyArg: 3.996 ± 1.315
3.552GlySer: 3.552 ± 1.215
3.996GlyThr: 3.996 ± 0.369
2.664GlyVal: 2.664 ± 1.291
0.0GlyTrp: 0.0 ± 0.0
0.444GlyTyr: 0.444 ± 0.355
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
1.332HisCys: 1.332 ± 1.089
0.888HisAsp: 0.888 ± 0.437
0.0HisGlu: 0.0 ± 0.0
1.332HisPhe: 1.332 ± 0.601
0.444HisGly: 0.444 ± 0.569
0.444HisHis: 0.444 ± 0.569
1.776HisIle: 1.776 ± 0.822
0.888HisLys: 0.888 ± 0.437
0.0HisLeu: 0.0 ± 0.0
0.444HisMet: 0.444 ± 0.372
0.444HisAsn: 0.444 ± 0.372
2.22HisPro: 2.22 ± 1.058
0.888HisGln: 0.888 ± 0.439
1.776HisArg: 1.776 ± 0.791
1.776HisSer: 1.776 ± 0.753
0.444HisThr: 0.444 ± 0.372
1.332HisVal: 1.332 ± 0.448
0.444HisTrp: 0.444 ± 0.418
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.108IleAla: 3.108 ± 0.813
0.888IleCys: 0.888 ± 0.665
4.44IleAsp: 4.44 ± 1.147
4.885IleGlu: 4.885 ± 0.674
3.108IlePhe: 3.108 ± 0.549
2.664IleGly: 2.664 ± 1.352
0.0IleHis: 0.0 ± 0.0
3.108IleIle: 3.108 ± 1.893
1.776IleLys: 1.776 ± 0.578
1.332IleLeu: 1.332 ± 0.673
0.444IleMet: 0.444 ± 0.355
3.552IleAsn: 3.552 ± 0.768
4.885IlePro: 4.885 ± 2.442
1.332IleGln: 1.332 ± 0.33
3.108IleArg: 3.108 ± 1.323
2.22IleSer: 2.22 ± 0.783
4.885IleThr: 4.885 ± 1.208
4.44IleVal: 4.44 ± 1.144
0.0IleTrp: 0.0 ± 0.0
2.22IleTyr: 2.22 ± 0.948
0.0IleXaa: 0.0 ± 0.0
Lys
2.664LysAla: 2.664 ± 0.926
2.22LysCys: 2.22 ± 0.665
1.776LysAsp: 1.776 ± 0.879
1.776LysGlu: 1.776 ± 0.647
2.664LysPhe: 2.664 ± 1.025
3.108LysGly: 3.108 ± 1.376
1.332LysHis: 1.332 ± 1.089
2.664LysIle: 2.664 ± 0.907
1.776LysLys: 1.776 ± 0.873
2.664LysLeu: 2.664 ± 0.981
2.22LysMet: 2.22 ± 1.121
2.664LysAsn: 2.664 ± 1.158
1.332LysPro: 1.332 ± 1.115
1.776LysGln: 1.776 ± 0.217
5.329LysArg: 5.329 ± 0.775
4.885LysSer: 4.885 ± 2.2
2.664LysThr: 2.664 ± 0.93
3.552LysVal: 3.552 ± 1.481
0.888LysTrp: 0.888 ± 0.412
4.44LysTyr: 4.44 ± 1.89
0.0LysXaa: 0.0 ± 0.0
Leu
4.885LeuAla: 4.885 ± 1.311
3.552LeuCys: 3.552 ± 2.097
6.217LeuAsp: 6.217 ± 2.072
2.22LeuGlu: 2.22 ± 1.35
3.996LeuPhe: 3.996 ± 1.016
5.773LeuGly: 5.773 ± 2.026
1.776LeuHis: 1.776 ± 0.753
2.664LeuIle: 2.664 ± 1.431
5.773LeuLys: 5.773 ± 1.481
7.549LeuLeu: 7.549 ± 1.756
2.22LeuMet: 2.22 ± 1.06
3.996LeuAsn: 3.996 ± 0.812
2.664LeuPro: 2.664 ± 1.227
6.217LeuGln: 6.217 ± 1.678
3.996LeuArg: 3.996 ± 1.657
5.329LeuSer: 5.329 ± 1.727
3.996LeuThr: 3.996 ± 1.103
5.329LeuVal: 5.329 ± 1.243
0.0LeuTrp: 0.0 ± 0.0
5.773LeuTyr: 5.773 ± 1.319
0.0LeuXaa: 0.0 ± 0.0
Met
1.332MetAla: 1.332 ± 0.476
0.0MetCys: 0.0 ± 0.0
0.888MetAsp: 0.888 ± 0.726
1.776MetGlu: 1.776 ± 0.217
1.776MetPhe: 1.776 ± 1.027
0.888MetGly: 0.888 ± 0.726
0.444MetHis: 0.444 ± 0.355
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
0.888MetLeu: 0.888 ± 0.437
0.0MetMet: 0.0 ± 0.0
0.888MetAsn: 0.888 ± 0.439
0.444MetPro: 0.444 ± 0.363
0.444MetGln: 0.444 ± 0.372
1.776MetArg: 1.776 ± 0.789
2.664MetSer: 2.664 ± 1.431
1.332MetThr: 1.332 ± 0.362
0.888MetVal: 0.888 ± 0.726
0.0MetTrp: 0.0 ± 0.0
0.444MetTyr: 0.444 ± 0.355
0.0MetXaa: 0.0 ± 0.0
Asn
2.664AsnAla: 2.664 ± 1.312
0.888AsnCys: 0.888 ± 0.586
2.664AsnAsp: 2.664 ± 0.493
3.108AsnGlu: 3.108 ± 1.192
0.888AsnPhe: 0.888 ± 0.743
1.332AsnGly: 1.332 ± 0.697
0.444AsnHis: 0.444 ± 0.363
2.664AsnIle: 2.664 ± 0.408
5.773AsnLys: 5.773 ± 1.431
3.108AsnLeu: 3.108 ± 1.302
0.444AsnMet: 0.444 ± 0.363
3.108AsnAsn: 3.108 ± 1.233
3.996AsnPro: 3.996 ± 2.019
1.332AsnGln: 1.332 ± 1.115
2.22AsnArg: 2.22 ± 0.691
2.664AsnSer: 2.664 ± 1.746
3.108AsnThr: 3.108 ± 1.157
4.44AsnVal: 4.44 ± 0.823
2.22AsnTrp: 2.22 ± 1.091
2.22AsnTyr: 2.22 ± 0.616
0.0AsnXaa: 0.0 ± 0.0
Pro
3.996ProAla: 3.996 ± 2.252
0.888ProCys: 0.888 ± 0.907
6.661ProAsp: 6.661 ± 2.269
3.552ProGlu: 3.552 ± 0.884
2.664ProPhe: 2.664 ± 0.424
1.332ProGly: 1.332 ± 0.673
0.444ProHis: 0.444 ± 0.355
3.996ProIle: 3.996 ± 2.388
4.44ProLys: 4.44 ± 0.927
4.885ProLeu: 4.885 ± 0.973
0.444ProMet: 0.444 ± 0.372
2.664ProAsn: 2.664 ± 0.697
5.773ProPro: 5.773 ± 1.507
1.332ProGln: 1.332 ± 0.33
2.22ProArg: 2.22 ± 0.694
4.885ProSer: 4.885 ± 1.196
4.885ProThr: 4.885 ± 1.9
3.552ProVal: 3.552 ± 2.377
0.888ProTrp: 0.888 ± 0.438
2.664ProTyr: 2.664 ± 0.579
0.0ProXaa: 0.0 ± 0.0
Gln
2.664GlnAla: 2.664 ± 0.76
0.888GlnCys: 0.888 ± 0.377
2.22GlnAsp: 2.22 ± 0.352
1.776GlnGlu: 1.776 ± 0.771
2.22GlnPhe: 2.22 ± 0.652
2.664GlnGly: 2.664 ± 0.493
0.888GlnHis: 0.888 ± 0.437
2.664GlnIle: 2.664 ± 0.796
0.444GlnLys: 0.444 ± 0.418
3.996GlnLeu: 3.996 ± 1.359
0.888GlnMet: 0.888 ± 0.62
0.888GlnAsn: 0.888 ± 0.438
1.776GlnPro: 1.776 ± 0.217
1.332GlnGln: 1.332 ± 0.686
1.332GlnArg: 1.332 ± 0.473
0.888GlnSer: 0.888 ± 0.438
2.22GlnThr: 2.22 ± 0.832
3.996GlnVal: 3.996 ± 1.766
0.888GlnTrp: 0.888 ± 0.437
0.444GlnTyr: 0.444 ± 0.363
0.0GlnXaa: 0.0 ± 0.0
Arg
2.664ArgAla: 2.664 ± 1.346
1.776ArgCys: 1.776 ± 0.927
4.44ArgAsp: 4.44 ± 1.373
1.776ArgGlu: 1.776 ± 0.217
1.776ArgPhe: 1.776 ± 0.791
6.217ArgGly: 6.217 ± 2.126
1.332ArgHis: 1.332 ± 0.697
1.332ArgIle: 1.332 ± 0.362
7.105ArgLys: 7.105 ± 1.596
6.217ArgLeu: 6.217 ± 0.661
0.888ArgMet: 0.888 ± 0.581
2.22ArgAsn: 2.22 ± 0.955
3.552ArgPro: 3.552 ± 1.449
0.888ArgGln: 0.888 ± 0.439
4.44ArgArg: 4.44 ± 1.721
4.885ArgSer: 4.885 ± 1.776
1.776ArgThr: 1.776 ± 0.65
3.552ArgVal: 3.552 ± 1.853
0.444ArgTrp: 0.444 ± 0.418
0.888ArgTyr: 0.888 ± 0.438
0.0ArgXaa: 0.0 ± 0.0
Ser
4.44SerAla: 4.44 ± 1.415
0.888SerCys: 0.888 ± 0.439
3.108SerAsp: 3.108 ± 0.754
2.22SerGlu: 2.22 ± 0.843
3.108SerPhe: 3.108 ± 1.562
5.329SerGly: 5.329 ± 0.687
1.776SerHis: 1.776 ± 0.217
4.885SerIle: 4.885 ± 1.066
3.552SerLys: 3.552 ± 0.836
9.325SerLeu: 9.325 ± 2.371
1.332SerMet: 1.332 ± 0.715
4.44SerAsn: 4.44 ± 2.203
4.44SerPro: 4.44 ± 0.928
2.664SerGln: 2.664 ± 0.777
4.885SerArg: 4.885 ± 1.591
6.217SerSer: 6.217 ± 2.485
6.661SerThr: 6.661 ± 1.823
5.329SerVal: 5.329 ± 0.545
0.444SerTrp: 0.444 ± 0.372
0.888SerTyr: 0.888 ± 0.377
0.0SerXaa: 0.0 ± 0.0
Thr
3.552ThrAla: 3.552 ± 0.948
0.888ThrCys: 0.888 ± 0.437
6.217ThrAsp: 6.217 ± 0.899
6.661ThrGlu: 6.661 ± 1.837
3.996ThrPhe: 3.996 ± 1.087
3.552ThrGly: 3.552 ± 0.78
0.888ThrHis: 0.888 ± 0.412
1.332ThrIle: 1.332 ± 0.636
1.332ThrLys: 1.332 ± 0.728
4.44ThrLeu: 4.44 ± 0.717
0.444ThrMet: 0.444 ± 0.363
3.552ThrAsn: 3.552 ± 1.062
2.664ThrPro: 2.664 ± 0.493
0.888ThrGln: 0.888 ± 0.412
2.22ThrArg: 2.22 ± 1.333
5.329ThrSer: 5.329 ± 1.563
6.661ThrThr: 6.661 ± 2.996
6.661ThrVal: 6.661 ± 1.598
1.332ThrTrp: 1.332 ± 0.627
1.776ThrTyr: 1.776 ± 0.875
0.0ThrXaa: 0.0 ± 0.0
Val
2.22ValAla: 2.22 ± 0.662
1.776ValCys: 1.776 ± 1.124
3.996ValAsp: 3.996 ± 0.97
3.996ValGlu: 3.996 ± 1.145
3.108ValPhe: 3.108 ± 0.803
3.552ValGly: 3.552 ± 1.3
0.444ValHis: 0.444 ± 0.355
3.108ValIle: 3.108 ± 0.763
4.44ValLys: 4.44 ± 1.278
4.885ValLeu: 4.885 ± 1.288
1.776ValMet: 1.776 ± 0.85
3.996ValAsn: 3.996 ± 0.505
3.996ValPro: 3.996 ± 0.668
3.108ValGln: 3.108 ± 1.346
3.108ValArg: 3.108 ± 0.805
7.549ValSer: 7.549 ± 0.774
4.885ValThr: 4.885 ± 0.51
3.108ValVal: 3.108 ± 1.755
2.22ValTrp: 2.22 ± 0.978
1.332ValTyr: 1.332 ± 0.627
0.0ValXaa: 0.0 ± 0.0
Trp
1.332TrpAla: 1.332 ± 0.33
0.0TrpCys: 0.0 ± 0.0
1.776TrpAsp: 1.776 ± 0.876
0.0TrpGlu: 0.0 ± 0.0
1.332TrpPhe: 1.332 ± 0.448
0.888TrpGly: 0.888 ± 0.743
0.444TrpHis: 0.444 ± 0.418
0.888TrpIle: 0.888 ± 0.437
1.332TrpLys: 1.332 ± 0.728
1.332TrpLeu: 1.332 ± 0.715
0.444TrpMet: 0.444 ± 0.372
0.0TrpAsn: 0.0 ± 0.0
0.444TrpPro: 0.444 ± 0.372
0.888TrpGln: 0.888 ± 0.726
2.22TrpArg: 2.22 ± 1.475
0.0TrpSer: 0.0 ± 0.0
0.888TrpThr: 0.888 ± 0.835
0.888TrpVal: 0.888 ± 0.437
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.332TyrAla: 1.332 ± 0.362
0.888TyrCys: 0.888 ± 0.541
1.332TyrAsp: 1.332 ± 0.618
2.22TyrGlu: 2.22 ± 0.843
2.664TyrPhe: 2.664 ± 0.431
1.332TyrGly: 1.332 ± 0.448
0.0TyrHis: 0.0 ± 0.0
0.0TyrIle: 0.0 ± 0.0
3.552TyrLys: 3.552 ± 1.021
3.552TyrLeu: 3.552 ± 0.836
0.888TyrMet: 0.888 ± 0.726
3.552TyrAsn: 3.552 ± 0.996
1.776TyrPro: 1.776 ± 0.712
2.22TyrGln: 2.22 ± 0.463
2.664TyrArg: 2.664 ± 0.632
2.22TyrSer: 2.22 ± 0.888
0.888TyrThr: 0.888 ± 0.412
1.332TyrVal: 1.332 ± 0.362
0.444TyrTrp: 0.444 ± 0.372
1.332TyrTyr: 1.332 ± 0.771
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2253 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski