Amino acid dipepetide frequency for Human papillomavirus sp.

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.357AlaAla: 3.357 ± 1.044
0.839AlaCys: 0.839 ± 0.641
3.777AlaAsp: 3.777 ± 0.784
2.098AlaGlu: 2.098 ± 0.908
2.937AlaPhe: 2.937 ± 1.024
2.518AlaGly: 2.518 ± 0.73
0.42AlaHis: 0.42 ± 0.443
2.098AlaIle: 2.098 ± 0.425
2.098AlaLys: 2.098 ± 1.42
4.196AlaLeu: 4.196 ± 2.091
1.259AlaMet: 1.259 ± 0.625
1.679AlaAsn: 1.679 ± 1.136
4.196AlaPro: 4.196 ± 1.248
3.357AlaGln: 3.357 ± 1.944
4.196AlaArg: 4.196 ± 2.029
3.357AlaSer: 3.357 ± 0.879
5.036AlaThr: 5.036 ± 1.291
1.679AlaVal: 1.679 ± 0.589
0.42AlaTrp: 0.42 ± 0.342
1.259AlaTyr: 1.259 ± 0.693
0.0AlaXaa: 0.0 ± 0.0
Cys
2.098CysAla: 2.098 ± 1.12
0.839CysCys: 0.839 ± 0.747
0.42CysAsp: 0.42 ± 0.321
1.259CysGlu: 1.259 ± 0.519
0.839CysPhe: 0.839 ± 0.532
0.42CysGly: 0.42 ± 0.321
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.259CysLys: 1.259 ± 0.365
2.098CysLeu: 2.098 ± 0.98
0.42CysMet: 0.42 ± 0.321
0.42CysAsn: 0.42 ± 0.709
1.679CysPro: 1.679 ± 0.589
0.42CysGln: 0.42 ± 0.709
0.42CysArg: 0.42 ± 0.342
2.098CysSer: 2.098 ± 2.937
0.839CysThr: 0.839 ± 0.641
0.839CysVal: 0.839 ± 0.548
0.42CysTrp: 0.42 ± 0.321
0.839CysTyr: 0.839 ± 0.858
0.0CysXaa: 0.0 ± 0.0
Asp
1.259AspAla: 1.259 ± 0.622
1.259AspCys: 1.259 ± 0.519
2.518AspAsp: 2.518 ± 0.85
5.875AspGlu: 5.875 ± 1.293
2.937AspPhe: 2.937 ± 0.771
2.098AspGly: 2.098 ± 0.818
0.839AspHis: 0.839 ± 1.417
4.196AspIle: 4.196 ± 0.857
2.098AspLys: 2.098 ± 1.315
8.393AspLeu: 8.393 ± 1.813
0.42AspMet: 0.42 ± 0.342
6.295AspAsn: 6.295 ± 1.277
3.777AspPro: 3.777 ± 1.326
2.518AspGln: 2.518 ± 0.676
1.679AspArg: 1.679 ± 0.939
3.777AspSer: 3.777 ± 1.264
3.777AspThr: 3.777 ± 1.618
2.937AspVal: 2.937 ± 0.934
1.679AspTrp: 1.679 ± 0.691
1.259AspTyr: 1.259 ± 0.365
0.0AspXaa: 0.0 ± 0.0
Glu
2.937GluAla: 2.937 ± 1.518
0.839GluCys: 0.839 ± 0.747
4.196GluAsp: 4.196 ± 0.92
7.134GluGlu: 7.134 ± 3.071
2.518GluPhe: 2.518 ± 0.616
1.259GluGly: 1.259 ± 0.754
2.098GluHis: 2.098 ± 0.781
3.357GluIle: 3.357 ± 1.523
2.098GluLys: 2.098 ± 0.824
7.134GluLeu: 7.134 ± 3.518
2.098GluMet: 2.098 ± 1.2
5.875GluAsn: 5.875 ± 0.979
2.937GluPro: 2.937 ± 0.504
2.937GluGln: 2.937 ± 0.715
2.937GluArg: 2.937 ± 1.0
5.455GluSer: 5.455 ± 1.5
4.616GluThr: 4.616 ± 1.529
3.777GluVal: 3.777 ± 0.981
0.42GluTrp: 0.42 ± 0.321
2.098GluTyr: 2.098 ± 1.063
0.0GluXaa: 0.0 ± 0.0
Phe
2.098PheAla: 2.098 ± 0.665
2.098PheCys: 2.098 ± 1.029
3.777PheAsp: 3.777 ± 0.94
1.679PheGlu: 1.679 ± 0.835
1.679PhePhe: 1.679 ± 0.589
2.937PheGly: 2.937 ± 0.608
0.839PheHis: 0.839 ± 0.703
3.777PheIle: 3.777 ± 1.486
2.518PheLys: 2.518 ± 1.25
4.196PheLeu: 4.196 ± 1.723
0.839PheMet: 0.839 ± 0.419
2.937PheAsn: 2.937 ± 0.915
1.679PhePro: 1.679 ± 0.589
1.679PheGln: 1.679 ± 0.912
3.357PheArg: 3.357 ± 0.755
3.357PheSer: 3.357 ± 0.921
0.839PheThr: 0.839 ± 0.417
2.098PheVal: 2.098 ± 0.425
1.679PheTrp: 1.679 ± 0.911
3.357PheTyr: 3.357 ± 1.211
0.0PheXaa: 0.0 ± 0.0
Gly
2.098GlyAla: 2.098 ± 0.676
0.839GlyCys: 0.839 ± 0.75
4.196GlyAsp: 4.196 ± 1.003
3.777GlyGlu: 3.777 ± 1.911
2.098GlyPhe: 2.098 ± 0.891
5.036GlyGly: 5.036 ± 1.935
0.839GlyHis: 0.839 ± 0.386
3.357GlyIle: 3.357 ± 1.119
3.357GlyLys: 3.357 ± 0.698
2.518GlyLeu: 2.518 ± 1.142
0.42GlyMet: 0.42 ± 0.403
3.357GlyAsn: 3.357 ± 0.831
2.518GlyPro: 2.518 ± 1.095
3.777GlyGln: 3.777 ± 0.995
1.679GlyArg: 1.679 ± 1.013
4.616GlySer: 4.616 ± 1.903
4.196GlyThr: 4.196 ± 1.097
4.196GlyVal: 4.196 ± 1.448
0.839GlyTrp: 0.839 ± 0.641
1.259GlyTyr: 1.259 ± 0.828
0.0GlyXaa: 0.0 ± 0.0
His
0.839HisAla: 0.839 ± 0.684
0.42HisCys: 0.42 ± 0.443
1.259HisAsp: 1.259 ± 0.474
0.42HisGlu: 0.42 ± 0.351
1.679HisPhe: 1.679 ± 0.808
0.42HisGly: 0.42 ± 0.709
0.42HisHis: 0.42 ± 0.403
0.839HisIle: 0.839 ± 0.716
2.098HisLys: 2.098 ± 0.952
0.42HisLeu: 0.42 ± 0.321
0.0HisMet: 0.0 ± 0.468
0.42HisAsn: 0.42 ± 0.403
0.839HisPro: 0.839 ± 0.416
0.839HisGln: 0.839 ± 0.641
0.839HisArg: 0.839 ± 0.532
1.679HisSer: 1.679 ± 0.66
2.518HisThr: 2.518 ± 0.845
2.937HisVal: 2.937 ± 0.914
1.259HisTrp: 1.259 ± 0.796
1.259HisTyr: 1.259 ± 0.755
0.0HisXaa: 0.0 ± 0.0
Ile
2.098IleAla: 2.098 ± 0.917
0.0IleCys: 0.0 ± 0.0
6.295IleAsp: 6.295 ± 1.509
3.357IleGlu: 3.357 ± 1.114
2.098IlePhe: 2.098 ± 1.07
2.098IleGly: 2.098 ± 1.055
2.518IleHis: 2.518 ± 0.616
3.777IleIle: 3.777 ± 1.194
2.098IleLys: 2.098 ± 0.6
3.777IleLeu: 3.777 ± 1.194
1.259IleMet: 1.259 ± 0.933
0.42IleAsn: 0.42 ± 0.443
3.357IlePro: 3.357 ± 2.399
2.518IleGln: 2.518 ± 0.722
2.098IleArg: 2.098 ± 1.03
7.134IleSer: 7.134 ± 1.764
4.196IleThr: 4.196 ± 1.133
2.937IleVal: 2.937 ± 0.622
0.839IleTrp: 0.839 ± 0.753
1.679IleTyr: 1.679 ± 0.775
0.0IleXaa: 0.0 ± 0.0
Lys
3.777LysAla: 3.777 ± 0.654
2.098LysCys: 2.098 ± 0.743
2.098LysAsp: 2.098 ± 0.752
2.098LysGlu: 2.098 ± 0.632
2.937LysPhe: 2.937 ± 1.36
4.616LysGly: 4.616 ± 0.901
2.098LysHis: 2.098 ± 1.2
3.777LysIle: 3.777 ± 0.757
3.777LysLys: 3.777 ± 1.426
5.455LysLeu: 5.455 ± 1.905
0.42LysMet: 0.42 ± 0.342
3.357LysAsn: 3.357 ± 1.129
1.679LysPro: 1.679 ± 0.837
2.937LysGln: 2.937 ± 1.459
3.777LysArg: 3.777 ± 1.095
2.098LysSer: 2.098 ± 1.603
2.098LysThr: 2.098 ± 0.733
1.259LysVal: 1.259 ± 0.784
0.0LysTrp: 0.0 ± 0.0
1.679LysTyr: 1.679 ± 0.606
0.0LysXaa: 0.0 ± 0.0
Leu
5.036LeuAla: 5.036 ± 1.289
2.098LeuCys: 2.098 ± 1.326
4.196LeuAsp: 4.196 ± 1.225
5.875LeuGlu: 5.875 ± 1.875
6.295LeuPhe: 6.295 ± 2.021
6.714LeuGly: 6.714 ± 2.471
2.098LeuHis: 2.098 ± 0.712
4.196LeuIle: 4.196 ± 0.946
5.455LeuLys: 5.455 ± 2.321
10.491LeuLeu: 10.491 ± 4.244
1.259LeuMet: 1.259 ± 0.866
2.937LeuAsn: 2.937 ± 1.111
3.357LeuPro: 3.357 ± 1.082
7.554LeuGln: 7.554 ± 1.352
5.455LeuArg: 5.455 ± 1.189
7.554LeuSer: 7.554 ± 1.367
5.875LeuThr: 5.875 ± 2.34
6.295LeuVal: 6.295 ± 1.589
0.0LeuTrp: 0.0 ± 0.0
2.098LeuTyr: 2.098 ± 0.953
0.0LeuXaa: 0.0 ± 0.0
Met
2.518MetAla: 2.518 ± 1.103
0.42MetCys: 0.42 ± 0.478
0.0MetAsp: 0.0 ± 0.0
0.42MetGlu: 0.42 ± 0.403
0.42MetPhe: 0.42 ± 0.321
0.839MetGly: 0.839 ± 0.703
0.0MetHis: 0.0 ± 0.0
1.259MetIle: 1.259 ± 0.391
0.42MetLys: 0.42 ± 0.321
1.679MetLeu: 1.679 ± 0.571
0.0MetMet: 0.0 ± 0.0
0.839MetAsn: 0.839 ± 0.416
0.0MetPro: 0.0 ± 0.0
0.42MetGln: 0.42 ± 0.403
0.839MetArg: 0.839 ± 0.485
1.259MetSer: 1.259 ± 0.625
0.0MetThr: 0.0 ± 0.0
1.679MetVal: 1.679 ± 0.691
0.0MetTrp: 0.0 ± 0.0
0.42MetTyr: 0.42 ± 0.478
0.0MetXaa: 0.0 ± 0.0
Asn
2.518AsnAla: 2.518 ± 0.893
0.839AsnCys: 0.839 ± 0.392
2.098AsnAsp: 2.098 ± 0.806
3.357AsnGlu: 3.357 ± 1.021
1.259AsnPhe: 1.259 ± 0.885
2.518AsnGly: 2.518 ± 0.601
0.42AsnHis: 0.42 ± 0.321
4.616AsnIle: 4.616 ± 1.923
2.098AsnLys: 2.098 ± 0.998
4.616AsnLeu: 4.616 ± 1.661
0.0AsnMet: 0.0 ± 0.0
2.937AsnAsn: 2.937 ± 0.979
2.937AsnPro: 2.937 ± 1.111
2.518AsnGln: 2.518 ± 1.41
3.777AsnArg: 3.777 ± 1.059
3.357AsnSer: 3.357 ± 1.094
4.616AsnThr: 4.616 ± 1.12
4.196AsnVal: 4.196 ± 1.52
0.42AsnTrp: 0.42 ± 0.321
1.259AsnTyr: 1.259 ± 0.519
0.0AsnXaa: 0.0 ± 0.0
Pro
3.357ProAla: 3.357 ± 1.198
0.42ProCys: 0.42 ± 0.709
2.937ProAsp: 2.937 ± 1.171
4.616ProGlu: 4.616 ± 1.232
1.679ProPhe: 1.679 ± 0.837
2.098ProGly: 2.098 ± 0.619
0.42ProHis: 0.42 ± 0.351
1.679ProIle: 1.679 ± 0.433
4.616ProLys: 4.616 ± 1.099
6.295ProLeu: 6.295 ± 1.047
0.839ProMet: 0.839 ± 0.703
2.518ProAsn: 2.518 ± 0.787
4.616ProPro: 4.616 ± 1.651
2.098ProGln: 2.098 ± 0.689
3.777ProArg: 3.777 ± 1.214
5.875ProSer: 5.875 ± 2.002
5.875ProThr: 5.875 ± 1.95
2.098ProVal: 2.098 ± 1.756
0.0ProTrp: 0.0 ± 0.0
2.098ProTyr: 2.098 ± 1.302
0.0ProXaa: 0.0 ± 0.0
Gln
2.098GlnAla: 2.098 ± 0.674
0.839GlnCys: 0.839 ± 0.485
1.679GlnAsp: 1.679 ± 0.775
5.455GlnGlu: 5.455 ± 1.17
2.098GlnPhe: 2.098 ± 1.236
1.259GlnGly: 1.259 ± 0.391
0.42GlnHis: 0.42 ± 0.403
3.357GlnIle: 3.357 ± 0.76
0.839GlnLys: 0.839 ± 0.386
5.455GlnLeu: 5.455 ± 0.968
0.839GlnMet: 0.839 ± 0.455
2.098GlnAsn: 2.098 ± 0.945
4.196GlnPro: 4.196 ± 0.979
4.196GlnGln: 4.196 ± 1.442
2.937GlnArg: 2.937 ± 1.506
5.036GlnSer: 5.036 ± 0.916
2.518GlnThr: 2.518 ± 0.512
2.518GlnVal: 2.518 ± 1.023
0.42GlnTrp: 0.42 ± 0.321
1.679GlnTyr: 1.679 ± 0.603
0.0GlnXaa: 0.0 ± 0.0
Arg
4.616ArgAla: 4.616 ± 1.019
0.839ArgCys: 0.839 ± 0.75
1.679ArgAsp: 1.679 ± 0.922
3.357ArgGlu: 3.357 ± 0.598
3.357ArgPhe: 3.357 ± 0.76
3.357ArgGly: 3.357 ± 1.611
1.259ArgHis: 1.259 ± 0.705
0.839ArgIle: 0.839 ± 0.392
5.455ArgLys: 5.455 ± 0.759
6.295ArgLeu: 6.295 ± 1.742
0.42ArgMet: 0.42 ± 0.403
2.937ArgAsn: 2.937 ± 0.694
3.777ArgPro: 3.777 ± 0.757
1.259ArgGln: 1.259 ± 0.885
5.455ArgArg: 5.455 ± 1.367
4.196ArgSer: 4.196 ± 1.077
2.518ArgThr: 2.518 ± 1.13
4.616ArgVal: 4.616 ± 1.035
0.839ArgTrp: 0.839 ± 0.747
2.518ArgTyr: 2.518 ± 1.177
0.0ArgXaa: 0.0 ± 0.0
Ser
2.937SerAla: 2.937 ± 1.229
0.42SerCys: 0.42 ± 0.443
6.714SerAsp: 6.714 ± 1.207
5.036SerGlu: 5.036 ± 1.735
2.098SerPhe: 2.098 ± 0.993
4.616SerGly: 4.616 ± 1.615
0.839SerHis: 0.839 ± 0.392
3.357SerIle: 3.357 ± 0.688
4.616SerLys: 4.616 ± 0.945
9.232SerLeu: 9.232 ± 1.573
0.839SerMet: 0.839 ± 0.641
3.777SerAsn: 3.777 ± 1.087
4.196SerPro: 4.196 ± 1.675
2.098SerGln: 2.098 ± 0.676
6.714SerArg: 6.714 ± 1.302
4.616SerSer: 4.616 ± 1.657
5.875SerThr: 5.875 ± 2.682
5.455SerVal: 5.455 ± 1.073
0.42SerTrp: 0.42 ± 0.403
2.098SerTyr: 2.098 ± 0.752
0.0SerXaa: 0.0 ± 0.0
Thr
2.518ThrAla: 2.518 ± 0.79
1.259ThrCys: 1.259 ± 0.705
4.616ThrAsp: 4.616 ± 1.283
2.937ThrGlu: 2.937 ± 1.1
5.875ThrPhe: 5.875 ± 1.035
3.777ThrGly: 3.777 ± 1.483
1.259ThrHis: 1.259 ± 0.63
4.616ThrIle: 4.616 ± 2.217
3.357ThrLys: 3.357 ± 1.022
4.616ThrLeu: 4.616 ± 0.85
0.42ThrMet: 0.42 ± 0.351
2.518ThrAsn: 2.518 ± 1.249
5.455ThrPro: 5.455 ± 2.551
1.679ThrGln: 1.679 ± 0.305
3.777ThrArg: 3.777 ± 1.145
3.777ThrSer: 3.777 ± 1.832
5.036ThrThr: 5.036 ± 1.866
7.554ThrVal: 7.554 ± 1.369
0.839ThrTrp: 0.839 ± 0.417
1.259ThrTyr: 1.259 ± 0.474
0.0ThrXaa: 0.0 ± 0.0
Val
1.259ValAla: 1.259 ± 0.519
0.839ValCys: 0.839 ± 0.485
4.616ValAsp: 4.616 ± 1.242
5.455ValGlu: 5.455 ± 1.213
2.518ValPhe: 2.518 ± 0.96
6.295ValGly: 6.295 ± 1.586
2.518ValHis: 2.518 ± 1.467
3.357ValIle: 3.357 ± 1.938
2.518ValLys: 2.518 ± 1.099
2.937ValLeu: 2.937 ± 0.929
0.839ValMet: 0.839 ± 0.618
2.518ValAsn: 2.518 ± 1.021
4.196ValPro: 4.196 ± 1.614
5.036ValGln: 5.036 ± 1.301
2.518ValArg: 2.518 ± 0.853
4.616ValSer: 4.616 ± 0.945
4.196ValThr: 4.196 ± 1.129
0.839ValVal: 0.839 ± 0.386
1.259ValTrp: 1.259 ± 0.705
2.098ValTyr: 2.098 ± 0.615
0.0ValXaa: 0.0 ± 0.0
Trp
1.259TrpAla: 1.259 ± 0.519
0.0TrpCys: 0.0 ± 0.0
0.42TrpAsp: 0.42 ± 0.321
0.0TrpGlu: 0.0 ± 0.0
0.42TrpPhe: 0.42 ± 0.321
0.42TrpGly: 0.42 ± 0.321
1.259TrpHis: 1.259 ± 0.755
0.839TrpIle: 0.839 ± 0.641
0.839TrpLys: 0.839 ± 0.747
1.679TrpLeu: 1.679 ± 0.773
0.0TrpMet: 0.0 ± 0.0
0.42TrpAsn: 0.42 ± 0.342
0.0TrpPro: 0.0 ± 0.0
0.839TrpGln: 0.839 ± 0.386
1.679TrpArg: 1.679 ± 0.881
0.42TrpSer: 0.42 ± 0.403
1.259TrpThr: 1.259 ± 0.796
0.839TrpVal: 0.839 ± 0.417
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.098TyrAla: 2.098 ± 0.807
0.42TyrCys: 0.42 ± 0.342
2.098TyrAsp: 2.098 ± 0.665
2.518TyrGlu: 2.518 ± 1.467
1.679TyrPhe: 1.679 ± 0.694
1.679TyrGly: 1.679 ± 0.305
1.259TyrHis: 1.259 ± 0.474
1.259TyrIle: 1.259 ± 0.365
0.42TyrLys: 0.42 ± 0.321
3.357TyrLeu: 3.357 ± 1.129
0.42TyrMet: 0.42 ± 0.359
2.098TyrAsn: 2.098 ± 0.796
2.518TyrPro: 2.518 ± 1.034
1.259TyrGln: 1.259 ± 0.705
1.679TyrArg: 1.679 ± 0.922
1.679TyrSer: 1.679 ± 0.799
1.259TyrThr: 1.259 ± 0.978
1.679TyrVal: 1.679 ± 0.922
0.839TyrTrp: 0.839 ± 0.548
2.518TyrTyr: 2.518 ± 0.601
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2384 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski