Amino acid dipepetide frequency for Human papillomavirus 95

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.271AlaAla: 3.271 ± 1.238
1.226AlaCys: 1.226 ± 0.514
4.088AlaAsp: 4.088 ± 1.166
4.497AlaGlu: 4.497 ± 0.702
1.635AlaPhe: 1.635 ± 0.619
2.044AlaGly: 2.044 ± 0.727
0.409AlaHis: 0.409 ± 0.328
3.679AlaIle: 3.679 ± 1.088
3.271AlaLys: 3.271 ± 0.664
6.95AlaLeu: 6.95 ± 1.923
0.409AlaMet: 0.409 ± 0.366
3.271AlaAsn: 3.271 ± 1.291
2.862AlaPro: 2.862 ± 0.566
3.271AlaGln: 3.271 ± 1.132
4.088AlaArg: 4.088 ± 1.188
1.635AlaSer: 1.635 ± 0.995
4.088AlaThr: 4.088 ± 1.006
2.453AlaVal: 2.453 ± 1.152
0.0AlaTrp: 0.0 ± 0.0
2.044AlaTyr: 2.044 ± 0.911
0.0AlaXaa: 0.0 ± 0.0
Cys
0.409CysAla: 0.409 ± 0.515
2.044CysCys: 2.044 ± 1.507
0.818CysAsp: 0.818 ± 0.409
1.226CysGlu: 1.226 ± 0.801
0.818CysPhe: 0.818 ± 0.491
0.409CysGly: 0.409 ± 0.425
0.409CysHis: 0.409 ± 0.578
2.453CysIle: 2.453 ± 1.095
0.409CysLys: 0.409 ± 0.328
2.044CysLeu: 2.044 ± 1.763
0.0CysMet: 0.0 ± 0.0
0.409CysAsn: 0.409 ± 0.343
1.226CysPro: 1.226 ± 0.657
0.818CysGln: 0.818 ± 0.422
1.635CysArg: 1.635 ± 1.238
2.453CysSer: 2.453 ± 1.234
0.818CysThr: 0.818 ± 0.45
0.818CysVal: 0.818 ± 0.807
1.226CysTrp: 1.226 ± 0.442
1.635CysTyr: 1.635 ± 1.164
0.0CysXaa: 0.0 ± 0.0
Asp
6.541AspAla: 6.541 ± 1.712
1.226AspCys: 1.226 ± 0.405
4.497AspAsp: 4.497 ± 1.189
4.088AspGlu: 4.088 ± 1.729
2.862AspPhe: 2.862 ± 1.301
2.862AspGly: 2.862 ± 1.569
0.818AspHis: 0.818 ± 0.482
6.132AspIle: 6.132 ± 2.25
2.862AspLys: 2.862 ± 1.439
6.541AspLeu: 6.541 ± 2.807
0.409AspMet: 0.409 ± 0.328
4.497AspAsn: 4.497 ± 1.806
4.088AspPro: 4.088 ± 1.518
2.453AspGln: 2.453 ± 0.512
1.635AspArg: 1.635 ± 0.542
3.271AspSer: 3.271 ± 0.733
3.271AspThr: 3.271 ± 1.327
3.679AspVal: 3.679 ± 1.313
1.226AspTrp: 1.226 ± 1.03
1.635AspTyr: 1.635 ± 0.888
0.0AspXaa: 0.0 ± 0.0
Glu
2.453GluAla: 2.453 ± 0.932
1.635GluCys: 1.635 ± 1.373
5.315GluAsp: 5.315 ± 1.072
9.812GluGlu: 9.812 ± 5.042
2.453GluPhe: 2.453 ± 0.541
3.271GluGly: 3.271 ± 0.617
0.409GluHis: 0.409 ± 0.328
3.271GluIle: 3.271 ± 1.133
2.044GluLys: 2.044 ± 1.26
4.906GluLeu: 4.906 ± 0.978
1.226GluMet: 1.226 ± 0.68
4.497GluAsn: 4.497 ± 1.089
2.862GluPro: 2.862 ± 0.784
2.862GluGln: 2.862 ± 0.723
2.862GluArg: 2.862 ± 1.423
4.497GluSer: 4.497 ± 1.102
2.862GluThr: 2.862 ± 0.594
3.679GluVal: 3.679 ± 1.168
0.409GluTrp: 0.409 ± 0.343
1.635GluTyr: 1.635 ± 0.91
0.0GluXaa: 0.0 ± 0.0
Phe
1.226PheAla: 1.226 ± 0.72
1.226PheCys: 1.226 ± 0.738
3.271PheAsp: 3.271 ± 1.495
4.088PheGlu: 4.088 ± 0.75
3.271PhePhe: 3.271 ± 0.987
3.271PheGly: 3.271 ± 0.883
0.818PheHis: 0.818 ± 0.619
1.226PheIle: 1.226 ± 0.985
2.862PheLys: 2.862 ± 1.109
3.679PheLeu: 3.679 ± 1.862
0.409PheMet: 0.409 ± 0.314
1.226PheAsn: 1.226 ± 0.985
2.044PhePro: 2.044 ± 0.407
2.044PheGln: 2.044 ± 0.628
1.635PheArg: 1.635 ± 0.704
2.862PheSer: 2.862 ± 0.821
2.453PheThr: 2.453 ± 0.998
3.271PheVal: 3.271 ± 0.741
0.818PheTrp: 0.818 ± 0.409
2.044PheTyr: 2.044 ± 0.664
0.0PheXaa: 0.0 ± 0.0
Gly
1.635GlyAla: 1.635 ± 0.534
0.818GlyCys: 0.818 ± 0.549
4.906GlyAsp: 4.906 ± 1.214
4.497GlyGlu: 4.497 ± 1.639
2.453GlyPhe: 2.453 ± 0.541
3.679GlyGly: 3.679 ± 1.693
1.226GlyHis: 1.226 ± 0.707
4.088GlyIle: 4.088 ± 1.264
2.453GlyLys: 2.453 ± 0.952
6.132GlyLeu: 6.132 ± 2.741
0.0GlyMet: 0.0 ± 0.0
1.635GlyAsn: 1.635 ± 0.797
3.271GlyPro: 3.271 ± 0.958
4.088GlyGln: 4.088 ± 1.572
5.315GlyArg: 5.315 ± 1.086
7.768GlySer: 7.768 ± 1.853
6.541GlyThr: 6.541 ± 2.347
1.635GlyVal: 1.635 ± 0.986
0.0GlyTrp: 0.0 ± 0.0
2.044GlyTyr: 2.044 ± 0.984
0.0GlyXaa: 0.0 ± 0.0
His
1.226HisAla: 1.226 ± 0.442
0.818HisCys: 0.818 ± 0.651
0.409HisAsp: 0.409 ± 0.578
1.226HisGlu: 1.226 ± 0.744
2.044HisPhe: 2.044 ± 0.598
0.409HisGly: 0.409 ± 0.343
0.409HisHis: 0.409 ± 0.425
2.044HisIle: 2.044 ± 0.671
0.818HisLys: 0.818 ± 0.45
1.226HisLeu: 1.226 ± 0.405
0.0HisMet: 0.0 ± 0.0
0.409HisAsn: 0.409 ± 0.328
2.044HisPro: 2.044 ± 0.917
0.818HisGln: 0.818 ± 0.651
0.818HisArg: 0.818 ± 0.45
1.635HisSer: 1.635 ± 0.605
0.409HisThr: 0.409 ± 0.425
0.409HisVal: 0.409 ± 0.343
0.409HisTrp: 0.409 ± 0.328
0.818HisTyr: 0.818 ± 0.45
0.0HisXaa: 0.0 ± 0.0
Ile
4.497IleAla: 4.497 ± 1.111
1.226IleCys: 1.226 ± 0.514
4.497IleAsp: 4.497 ± 1.591
3.679IleGlu: 3.679 ± 1.588
2.044IlePhe: 2.044 ± 0.752
5.724IleGly: 5.724 ± 2.827
0.818IleHis: 0.818 ± 0.687
1.635IleIle: 1.635 ± 0.846
1.635IleLys: 1.635 ± 1.128
3.271IleLeu: 3.271 ± 1.136
0.818IleMet: 0.818 ± 0.849
2.044IleAsn: 2.044 ± 0.454
2.453IlePro: 2.453 ± 1.328
3.679IleGln: 3.679 ± 1.139
1.635IleArg: 1.635 ± 1.267
3.679IleSer: 3.679 ± 1.05
3.679IleThr: 3.679 ± 1.281
3.679IleVal: 3.679 ± 1.689
0.409IleTrp: 0.409 ± 0.425
1.635IleTyr: 1.635 ± 0.846
0.0IleXaa: 0.0 ± 0.0
Lys
1.635LysAla: 1.635 ± 0.955
1.226LysCys: 1.226 ± 0.387
2.453LysAsp: 2.453 ± 1.092
2.044LysGlu: 2.044 ± 1.232
1.226LysPhe: 1.226 ± 0.646
3.679LysGly: 3.679 ± 1.855
2.044LysHis: 2.044 ± 0.911
1.635LysIle: 1.635 ± 0.91
2.453LysLys: 2.453 ± 1.005
4.497LysLeu: 4.497 ± 2.001
1.226LysMet: 1.226 ± 0.622
0.818LysAsn: 0.818 ± 0.422
1.635LysPro: 1.635 ± 1.156
3.679LysGln: 3.679 ± 1.223
5.724LysArg: 5.724 ± 1.252
5.315LysSer: 5.315 ± 3.187
1.635LysThr: 1.635 ± 0.751
3.271LysVal: 3.271 ± 0.766
0.818LysTrp: 0.818 ± 0.619
1.635LysTyr: 1.635 ± 0.629
0.0LysXaa: 0.0 ± 0.0
Leu
4.497LeuAla: 4.497 ± 1.418
2.044LeuCys: 2.044 ± 0.854
6.132LeuAsp: 6.132 ± 1.981
2.862LeuGlu: 2.862 ± 0.917
4.906LeuPhe: 4.906 ± 0.783
6.541LeuGly: 6.541 ± 2.313
1.635LeuHis: 1.635 ± 1.292
4.906LeuIle: 4.906 ± 0.959
6.95LeuLys: 6.95 ± 2.418
9.812LeuLeu: 9.812 ± 2.589
1.226LeuMet: 1.226 ± 0.688
2.862LeuAsn: 2.862 ± 0.769
4.906LeuPro: 4.906 ± 1.117
6.132LeuGln: 6.132 ± 1.966
4.497LeuArg: 4.497 ± 0.861
7.768LeuSer: 7.768 ± 1.167
4.906LeuThr: 4.906 ± 1.61
4.497LeuVal: 4.497 ± 1.822
0.818LeuTrp: 0.818 ± 0.769
6.541LeuTyr: 6.541 ± 1.048
0.0LeuXaa: 0.0 ± 0.0
Met
1.635MetAla: 1.635 ± 0.467
0.409MetCys: 0.409 ± 0.328
0.409MetAsp: 0.409 ± 0.343
0.818MetGlu: 0.818 ± 0.482
0.409MetPhe: 0.409 ± 0.328
0.409MetGly: 0.409 ± 0.328
0.409MetHis: 0.409 ± 0.425
0.409MetIle: 0.409 ± 0.425
0.0MetLys: 0.0 ± 0.0
2.044MetLeu: 2.044 ± 0.939
0.0MetMet: 0.0 ± 0.0
0.818MetAsn: 0.818 ± 0.423
0.409MetPro: 0.409 ± 0.343
0.0MetGln: 0.0 ± 0.0
0.818MetArg: 0.818 ± 0.579
2.044MetSer: 2.044 ± 0.911
0.409MetThr: 0.409 ± 0.328
1.635MetVal: 1.635 ± 0.797
0.0MetTrp: 0.0 ± 0.0
0.818MetTyr: 0.818 ± 0.687
0.0MetXaa: 0.0 ± 0.0
Asn
2.453AsnAla: 2.453 ± 1.014
1.226AsnCys: 1.226 ± 0.738
0.409AsnAsp: 0.409 ± 0.328
0.818AsnGlu: 0.818 ± 0.45
1.635AsnPhe: 1.635 ± 0.955
2.862AsnGly: 2.862 ± 1.047
1.635AsnHis: 1.635 ± 0.821
2.044AsnIle: 2.044 ± 1.326
1.226AsnLys: 1.226 ± 0.405
3.271AsnLeu: 3.271 ± 1.269
0.0AsnMet: 0.0 ± 0.0
3.679AsnAsn: 3.679 ± 1.314
3.271AsnPro: 3.271 ± 0.954
1.635AsnGln: 1.635 ± 0.788
3.271AsnArg: 3.271 ± 1.21
3.679AsnSer: 3.679 ± 1.162
4.497AsnThr: 4.497 ± 1.347
2.453AsnVal: 2.453 ± 0.725
1.635AsnTrp: 1.635 ± 0.765
2.044AsnTyr: 2.044 ± 1.825
0.0AsnXaa: 0.0 ± 0.0
Pro
4.906ProAla: 4.906 ± 1.634
0.409ProCys: 0.409 ± 0.549
4.088ProAsp: 4.088 ± 1.659
3.679ProGlu: 3.679 ± 1.044
2.453ProPhe: 2.453 ± 0.77
2.044ProGly: 2.044 ± 1.177
0.818ProHis: 0.818 ± 0.423
1.226ProIle: 1.226 ± 0.405
2.862ProLys: 2.862 ± 0.964
5.724ProLeu: 5.724 ± 1.643
1.226ProMet: 1.226 ± 0.679
2.044ProAsn: 2.044 ± 0.885
9.812ProPro: 9.812 ± 3.552
3.271ProGln: 3.271 ± 0.734
1.635ProArg: 1.635 ± 0.837
6.132ProSer: 6.132 ± 2.193
3.679ProThr: 3.679 ± 1.674
3.271ProVal: 3.271 ± 1.295
0.409ProTrp: 0.409 ± 0.425
1.226ProTyr: 1.226 ± 0.707
0.0ProXaa: 0.0 ± 0.0
Gln
2.044GlnAla: 2.044 ± 0.756
1.226GlnCys: 1.226 ± 1.267
3.271GlnAsp: 3.271 ± 0.497
3.271GlnGlu: 3.271 ± 1.024
2.044GlnPhe: 2.044 ± 0.759
2.453GlnGly: 2.453 ± 0.838
0.818GlnHis: 0.818 ± 0.422
2.862GlnIle: 2.862 ± 0.947
1.226GlnLys: 1.226 ± 0.835
6.95GlnLeu: 6.95 ± 1.34
1.226GlnMet: 1.226 ± 1.192
1.635GlnAsn: 1.635 ± 0.636
2.044GlnPro: 2.044 ± 0.598
2.453GlnGln: 2.453 ± 1.265
1.226GlnArg: 1.226 ± 0.637
3.271GlnSer: 3.271 ± 0.77
5.315GlnThr: 5.315 ± 1.342
2.862GlnVal: 2.862 ± 0.922
0.409GlnTrp: 0.409 ± 0.343
1.635GlnTyr: 1.635 ± 0.955
0.0GlnXaa: 0.0 ± 0.0
Arg
4.088ArgAla: 4.088 ± 1.078
1.226ArgCys: 1.226 ± 0.514
2.044ArgAsp: 2.044 ± 1.115
3.679ArgGlu: 3.679 ± 0.803
2.862ArgPhe: 2.862 ± 1.696
7.359ArgGly: 7.359 ± 2.515
1.226ArgHis: 1.226 ± 0.707
2.044ArgIle: 2.044 ± 1.171
4.088ArgLys: 4.088 ± 0.655
5.315ArgLeu: 5.315 ± 1.342
1.635ArgMet: 1.635 ± 0.803
3.679ArgAsn: 3.679 ± 1.571
2.862ArgPro: 2.862 ± 1.486
2.453ArgGln: 2.453 ± 0.85
6.132ArgArg: 6.132 ± 1.321
2.044ArgSer: 2.044 ± 0.746
0.818ArgThr: 0.818 ± 0.687
4.088ArgVal: 4.088 ± 1.183
0.409ArgTrp: 0.409 ± 0.366
1.226ArgTyr: 1.226 ± 0.805
0.0ArgXaa: 0.0 ± 0.0
Ser
3.271SerAla: 3.271 ± 0.698
1.226SerCys: 1.226 ± 0.66
4.906SerAsp: 4.906 ± 1.702
4.497SerGlu: 4.497 ± 1.383
3.271SerPhe: 3.271 ± 1.204
8.177SerGly: 8.177 ± 3.328
1.635SerHis: 1.635 ± 0.843
2.862SerIle: 2.862 ± 1.447
2.453SerLys: 2.453 ± 0.96
8.177SerLeu: 8.177 ± 1.561
0.409SerMet: 0.409 ± 0.343
3.679SerAsn: 3.679 ± 0.986
2.862SerPro: 2.862 ± 1.182
4.088SerGln: 4.088 ± 1.508
7.768SerArg: 7.768 ± 1.387
7.359SerSer: 7.359 ± 1.893
6.541SerThr: 6.541 ± 2.057
3.679SerVal: 3.679 ± 0.711
0.818SerTrp: 0.818 ± 0.409
1.635SerTyr: 1.635 ± 0.986
0.0SerXaa: 0.0 ± 0.0
Thr
2.453ThrAla: 2.453 ± 0.541
0.409ThrCys: 0.409 ± 0.515
4.497ThrAsp: 4.497 ± 0.847
3.679ThrGlu: 3.679 ± 1.376
1.635ThrPhe: 1.635 ± 0.977
4.088ThrGly: 4.088 ± 1.13
0.818ThrHis: 0.818 ± 0.482
5.315ThrIle: 5.315 ± 3.442
2.862ThrLys: 2.862 ± 1.412
5.315ThrLeu: 5.315 ± 1.403
1.635ThrMet: 1.635 ± 0.629
2.044ThrAsn: 2.044 ± 0.844
6.541ThrPro: 6.541 ± 1.792
2.044ThrGln: 2.044 ± 0.476
2.453ThrArg: 2.453 ± 0.809
6.541ThrSer: 6.541 ± 1.209
4.088ThrThr: 4.088 ± 1.14
3.679ThrVal: 3.679 ± 0.834
0.409ThrTrp: 0.409 ± 0.343
2.453ThrTyr: 2.453 ± 0.541
0.0ThrXaa: 0.0 ± 0.0
Val
2.862ValAla: 2.862 ± 0.597
1.635ValCys: 1.635 ± 1.264
5.315ValAsp: 5.315 ± 1.668
3.271ValGlu: 3.271 ± 1.007
2.044ValPhe: 2.044 ± 1.128
2.862ValGly: 2.862 ± 1.407
1.226ValHis: 1.226 ± 0.612
1.635ValIle: 1.635 ± 0.605
4.497ValLys: 4.497 ± 0.943
4.088ValLeu: 4.088 ± 0.953
0.818ValMet: 0.818 ± 0.657
1.226ValAsn: 1.226 ± 0.68
4.088ValPro: 4.088 ± 1.681
1.226ValGln: 1.226 ± 0.676
3.271ValArg: 3.271 ± 0.83
5.724ValSer: 5.724 ± 1.255
2.044ValThr: 2.044 ± 0.955
4.088ValVal: 4.088 ± 1.257
1.226ValTrp: 1.226 ± 0.847
2.044ValTyr: 2.044 ± 0.651
0.0ValXaa: 0.0 ± 0.0
Trp
0.409TrpAla: 0.409 ± 0.343
0.0TrpCys: 0.0 ± 0.0
1.226TrpAsp: 1.226 ± 0.805
0.409TrpGlu: 0.409 ± 0.425
0.409TrpPhe: 0.409 ± 0.425
0.0TrpGly: 0.0 ± 0.0
0.409TrpHis: 0.409 ± 0.425
0.818TrpIle: 0.818 ± 0.687
0.818TrpLys: 0.818 ± 0.687
1.635TrpLeu: 1.635 ± 0.605
0.0TrpMet: 0.0 ± 0.0
0.409TrpAsn: 0.409 ± 0.425
0.409TrpPro: 0.409 ± 0.328
0.409TrpGln: 0.409 ± 0.328
1.226TrpArg: 1.226 ± 0.785
0.0TrpSer: 0.0 ± 0.0
2.044TrpThr: 2.044 ± 1.155
0.818TrpVal: 0.818 ± 0.45
0.0TrpTrp: 0.0 ± 0.0
0.818TrpTyr: 0.818 ± 0.687
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.679TyrAla: 3.679 ± 1.213
0.818TyrCys: 0.818 ± 1.098
2.044TyrAsp: 2.044 ± 0.407
0.818TyrGlu: 0.818 ± 0.651
3.271TyrPhe: 3.271 ± 1.086
2.044TyrGly: 2.044 ± 0.91
0.409TyrHis: 0.409 ± 0.343
2.453TyrIle: 2.453 ± 0.978
2.862TyrLys: 2.862 ± 1.397
2.862TyrLeu: 2.862 ± 0.774
0.818TyrMet: 0.818 ± 0.463
2.862TyrAsn: 2.862 ± 0.9
1.226TyrPro: 1.226 ± 0.866
1.226TyrGln: 1.226 ± 0.442
1.635TyrArg: 1.635 ± 0.629
1.635TyrSer: 1.635 ± 0.583
2.862TyrThr: 2.862 ± 0.808
1.226TyrVal: 1.226 ± 0.676
0.818TyrTrp: 0.818 ± 0.482
1.635TyrTyr: 1.635 ± 0.904
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2447 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski