Amino acid dipepetide frequency for Human papillomavirus type 90

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.691AlaAla: 4.691 ± 1.194
3.412AlaCys: 3.412 ± 1.27
2.985AlaAsp: 2.985 ± 1.523
2.132AlaGlu: 2.132 ± 0.619
2.559AlaPhe: 2.559 ± 1.121
6.397AlaGly: 6.397 ± 0.88
0.426AlaHis: 0.426 ± 0.313
2.985AlaIle: 2.985 ± 0.668
4.691AlaLys: 4.691 ± 1.809
5.544AlaLeu: 5.544 ± 0.962
1.706AlaMet: 1.706 ± 0.583
1.279AlaAsn: 1.279 ± 0.71
5.544AlaPro: 5.544 ± 1.033
3.838AlaGln: 3.838 ± 1.118
2.985AlaArg: 2.985 ± 0.843
6.823AlaSer: 6.823 ± 1.716
4.264AlaThr: 4.264 ± 0.623
2.985AlaVal: 2.985 ± 0.889
0.0AlaTrp: 0.0 ± 0.0
2.985AlaTyr: 2.985 ± 0.657
0.0AlaXaa: 0.0 ± 0.0
Cys
1.706CysAla: 1.706 ± 0.669
1.706CysCys: 1.706 ± 1.558
0.426CysAsp: 0.426 ± 0.313
0.0CysGlu: 0.0 ± 0.0
0.426CysPhe: 0.426 ± 0.313
0.853CysGly: 0.853 ± 0.489
0.426CysHis: 0.426 ± 0.412
0.426CysIle: 0.426 ± 0.313
2.132CysLys: 2.132 ± 0.728
2.559CysLeu: 2.559 ± 0.963
1.279CysMet: 1.279 ± 0.71
1.279CysAsn: 1.279 ± 0.719
2.559CysPro: 2.559 ± 0.893
1.706CysGln: 1.706 ± 0.871
1.706CysArg: 1.706 ± 0.806
0.853CysSer: 0.853 ± 0.555
1.706CysThr: 1.706 ± 0.669
2.132CysVal: 2.132 ± 0.912
2.559CysTrp: 2.559 ± 0.846
0.853CysTyr: 0.853 ± 0.555
0.0CysXaa: 0.0 ± 0.0
Asp
5.544AspAla: 5.544 ± 1.478
0.426AspCys: 0.426 ± 0.326
2.985AspAsp: 2.985 ± 1.026
2.985AspGlu: 2.985 ± 1.083
1.706AspPhe: 1.706 ± 0.494
4.691AspGly: 4.691 ± 1.341
0.853AspHis: 0.853 ± 0.388
5.544AspIle: 5.544 ± 1.42
0.853AspLys: 0.853 ± 0.419
4.264AspLeu: 4.264 ± 1.173
0.853AspMet: 0.853 ± 0.356
2.985AspAsn: 2.985 ± 0.471
3.412AspPro: 3.412 ± 0.861
3.838AspGln: 3.838 ± 1.521
1.706AspArg: 1.706 ± 0.821
6.397AspSer: 6.397 ± 1.897
7.249AspThr: 7.249 ± 1.472
2.985AspVal: 2.985 ± 0.412
1.279AspTrp: 1.279 ± 0.587
1.279AspTyr: 1.279 ± 0.514
0.0AspXaa: 0.0 ± 0.0
Glu
3.412GluAla: 3.412 ± 0.999
0.426GluCys: 0.426 ± 0.313
4.691GluAsp: 4.691 ± 1.3
6.823GluGlu: 6.823 ± 2.44
0.853GluPhe: 0.853 ± 0.356
2.559GluGly: 2.559 ± 1.112
1.279GluHis: 1.279 ± 0.645
1.279GluIle: 1.279 ± 0.627
1.279GluLys: 1.279 ± 0.901
5.117GluLeu: 5.117 ± 1.362
1.279GluMet: 1.279 ± 0.836
1.279GluAsn: 1.279 ± 0.779
4.264GluPro: 4.264 ± 1.424
2.985GluGln: 2.985 ± 1.108
1.279GluArg: 1.279 ± 0.511
1.706GluSer: 1.706 ± 0.669
2.985GluThr: 2.985 ± 0.734
3.838GluVal: 3.838 ± 0.569
0.853GluTrp: 0.853 ± 0.627
1.279GluTyr: 1.279 ± 0.511
0.0GluXaa: 0.0 ± 0.0
Phe
2.985PheAla: 2.985 ± 0.992
1.279PheCys: 1.279 ± 0.511
1.706PheAsp: 1.706 ± 0.838
1.279PheGlu: 1.279 ± 0.712
1.706PhePhe: 1.706 ± 0.579
2.132PheGly: 2.132 ± 0.772
0.426PheHis: 0.426 ± 0.412
2.559PheIle: 2.559 ± 0.78
1.706PheLys: 1.706 ± 0.871
5.544PheLeu: 5.544 ± 0.849
0.426PheMet: 0.426 ± 0.313
0.426PheAsn: 0.426 ± 0.326
2.132PhePro: 2.132 ± 0.584
1.279PheGln: 1.279 ± 0.587
1.279PheArg: 1.279 ± 0.511
1.279PheSer: 1.279 ± 0.484
1.706PheThr: 1.706 ± 0.63
2.132PheVal: 2.132 ± 0.654
1.279PheTrp: 1.279 ± 0.587
0.853PheTyr: 0.853 ± 0.365
0.0PheXaa: 0.0 ± 0.0
Gly
3.412GlyAla: 3.412 ± 0.901
0.426GlyCys: 0.426 ± 0.326
8.102GlyAsp: 8.102 ± 1.784
3.838GlyGlu: 3.838 ± 0.786
1.279GlyPhe: 1.279 ± 0.592
3.838GlyGly: 3.838 ± 1.461
2.985GlyHis: 2.985 ± 0.819
1.279GlyIle: 1.279 ± 0.321
2.559GlyLys: 2.559 ± 0.868
4.264GlyLeu: 4.264 ± 0.613
1.706GlyMet: 1.706 ± 0.515
3.838GlyAsn: 3.838 ± 0.793
2.559GlyPro: 2.559 ± 0.793
3.412GlyGln: 3.412 ± 0.879
2.559GlyArg: 2.559 ± 0.786
4.691GlySer: 4.691 ± 2.031
8.955GlyThr: 8.955 ± 1.653
5.117GlyVal: 5.117 ± 1.279
0.853GlyTrp: 0.853 ± 0.61
2.985GlyTyr: 2.985 ± 1.049
0.0GlyXaa: 0.0 ± 0.0
His
1.279HisAla: 1.279 ± 0.719
0.853HisCys: 0.853 ± 0.566
1.279HisAsp: 1.279 ± 0.56
0.853HisGlu: 0.853 ± 0.388
1.706HisPhe: 1.706 ± 0.452
2.559HisGly: 2.559 ± 1.046
0.426HisHis: 0.426 ± 0.313
0.853HisIle: 0.853 ± 0.566
2.132HisLys: 2.132 ± 1.435
1.279HisLeu: 1.279 ± 0.914
0.426HisMet: 0.426 ± 0.313
0.0HisAsn: 0.0 ± 0.0
2.985HisPro: 2.985 ± 0.412
0.853HisGln: 0.853 ± 0.479
1.706HisArg: 1.706 ± 1.047
0.853HisSer: 0.853 ± 0.61
1.279HisThr: 1.279 ± 1.189
2.559HisVal: 2.559 ± 1.295
0.853HisTrp: 0.853 ± 0.538
0.426HisTyr: 0.426 ± 0.313
0.0HisXaa: 0.0 ± 0.0
Ile
1.706IleAla: 1.706 ± 0.769
1.279IleCys: 1.279 ± 0.607
2.132IleAsp: 2.132 ± 1.17
2.559IleGlu: 2.559 ± 0.561
1.279IlePhe: 1.279 ± 0.694
2.985IleGly: 2.985 ± 1.072
0.426IleHis: 0.426 ± 0.357
1.706IleIle: 1.706 ± 0.579
0.426IleLys: 0.426 ± 0.326
2.132IleLeu: 2.132 ± 0.864
0.853IleMet: 0.853 ± 0.856
1.279IleAsn: 1.279 ± 0.645
2.985IlePro: 2.985 ± 1.051
1.706IleGln: 1.706 ± 0.515
2.559IleArg: 2.559 ± 0.664
3.838IleSer: 3.838 ± 1.474
2.559IleThr: 2.559 ± 0.756
3.838IleVal: 3.838 ± 0.951
0.0IleTrp: 0.0 ± 0.0
2.559IleTyr: 2.559 ± 0.561
0.0IleXaa: 0.0 ± 0.0
Lys
2.985LysAla: 2.985 ± 1.493
2.559LysCys: 2.559 ± 1.408
2.559LysAsp: 2.559 ± 0.782
1.279LysGlu: 1.279 ± 0.879
2.132LysPhe: 2.132 ± 0.654
2.132LysGly: 2.132 ± 0.654
1.279LysHis: 1.279 ± 0.496
2.132LysIle: 2.132 ± 0.953
2.132LysLys: 2.132 ± 0.641
2.132LysLeu: 2.132 ± 0.708
0.0LysMet: 0.0 ± 0.0
1.706LysAsn: 1.706 ± 0.739
2.559LysPro: 2.559 ± 1.524
1.706LysGln: 1.706 ± 1.075
5.97LysArg: 5.97 ± 1.139
2.985LysSer: 2.985 ± 1.17
2.559LysThr: 2.559 ± 0.695
2.985LysVal: 2.985 ± 1.172
0.0LysTrp: 0.0 ± 0.0
1.279LysTyr: 1.279 ± 0.587
0.0LysXaa: 0.0 ± 0.0
Leu
5.97LeuAla: 5.97 ± 1.515
2.985LeuCys: 2.985 ± 1.478
5.97LeuAsp: 5.97 ± 1.263
4.691LeuGlu: 4.691 ± 0.798
3.412LeuPhe: 3.412 ± 0.586
6.397LeuGly: 6.397 ± 0.921
2.985LeuHis: 2.985 ± 1.155
2.985LeuIle: 2.985 ± 1.073
5.117LeuLys: 5.117 ± 1.063
8.102LeuLeu: 8.102 ± 1.428
1.706LeuMet: 1.706 ± 0.708
2.132LeuAsn: 2.132 ± 0.789
3.838LeuPro: 3.838 ± 1.409
7.249LeuGln: 7.249 ± 1.525
4.264LeuArg: 4.264 ± 1.318
5.117LeuSer: 5.117 ± 0.912
3.838LeuThr: 3.838 ± 1.63
3.838LeuVal: 3.838 ± 1.536
1.279LeuTrp: 1.279 ± 0.496
5.117LeuTyr: 5.117 ± 0.769
0.0LeuXaa: 0.0 ± 0.0
Met
2.132MetAla: 2.132 ± 0.97
0.853MetCys: 0.853 ± 0.388
0.426MetAsp: 0.426 ± 0.326
0.853MetGlu: 0.853 ± 0.388
0.853MetPhe: 0.853 ± 0.652
1.279MetGly: 1.279 ± 0.587
1.706MetHis: 1.706 ± 0.911
0.426MetIle: 0.426 ± 0.326
0.0MetLys: 0.0 ± 0.0
1.706MetLeu: 1.706 ± 0.776
0.426MetMet: 0.426 ± 0.565
0.426MetAsn: 0.426 ± 0.326
0.0MetPro: 0.0 ± 0.0
1.279MetGln: 1.279 ± 0.729
0.426MetArg: 0.426 ± 0.326
1.279MetSer: 1.279 ± 0.587
2.132MetThr: 2.132 ± 0.569
1.706MetVal: 1.706 ± 0.871
0.853MetTrp: 0.853 ± 0.538
0.426MetTyr: 0.426 ± 0.428
0.0MetXaa: 0.0 ± 0.0
Asn
2.559AsnAla: 2.559 ± 1.013
0.853AsnCys: 0.853 ± 0.627
0.853AsnAsp: 0.853 ± 0.489
0.853AsnGlu: 0.853 ± 0.689
1.279AsnPhe: 1.279 ± 0.607
2.132AsnGly: 2.132 ± 0.584
0.426AsnHis: 0.426 ± 0.428
2.132AsnIle: 2.132 ± 0.834
2.985AsnLys: 2.985 ± 1.542
1.706AsnLeu: 1.706 ± 0.906
0.853AsnMet: 0.853 ± 0.652
0.853AsnAsn: 0.853 ± 0.356
3.412AsnPro: 3.412 ± 0.729
1.279AsnGln: 1.279 ± 0.587
2.559AsnArg: 2.559 ± 0.905
0.853AsnSer: 0.853 ± 0.627
2.985AsnThr: 2.985 ± 0.942
1.706AsnVal: 1.706 ± 0.66
0.853AsnTrp: 0.853 ± 0.627
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
5.97ProAla: 5.97 ± 1.098
0.853ProCys: 0.853 ± 0.519
5.97ProAsp: 5.97 ± 1.932
1.279ProGlu: 1.279 ± 0.647
0.853ProPhe: 0.853 ± 0.489
2.985ProGly: 2.985 ± 1.072
1.706ProHis: 1.706 ± 0.897
2.985ProIle: 2.985 ± 0.889
2.559ProLys: 2.559 ± 0.561
9.382ProLeu: 9.382 ± 1.569
0.426ProMet: 0.426 ± 0.481
1.706ProAsn: 1.706 ± 0.647
8.102ProPro: 8.102 ± 1.921
1.279ProGln: 1.279 ± 0.674
2.985ProArg: 2.985 ± 1.868
5.117ProSer: 5.117 ± 2.19
7.249ProThr: 7.249 ± 3.513
4.691ProVal: 4.691 ± 1.716
0.426ProTrp: 0.426 ± 0.428
2.985ProTyr: 2.985 ± 1.41
0.0ProXaa: 0.0 ± 0.0
Gln
3.412GlnAla: 3.412 ± 0.79
1.706GlnCys: 1.706 ± 0.787
4.264GlnAsp: 4.264 ± 0.72
3.412GlnGlu: 3.412 ± 1.323
2.132GlnPhe: 2.132 ± 0.778
1.279GlnGly: 1.279 ± 0.612
1.706GlnHis: 1.706 ± 0.776
0.853GlnIle: 0.853 ± 0.388
1.706GlnLys: 1.706 ± 0.739
5.97GlnLeu: 5.97 ± 1.623
1.279GlnMet: 1.279 ± 0.607
0.0GlnAsn: 0.0 ± 0.0
3.838GlnPro: 3.838 ± 1.56
2.132GlnGln: 2.132 ± 0.984
3.838GlnArg: 3.838 ± 1.295
2.132GlnSer: 2.132 ± 0.729
3.412GlnThr: 3.412 ± 0.586
4.264GlnVal: 4.264 ± 1.082
0.853GlnTrp: 0.853 ± 0.419
1.279GlnTyr: 1.279 ± 0.511
0.0GlnXaa: 0.0 ± 0.0
Arg
3.412ArgAla: 3.412 ± 0.651
1.279ArgCys: 1.279 ± 0.952
1.706ArgAsp: 1.706 ± 0.773
1.706ArgGlu: 1.706 ± 1.04
3.838ArgPhe: 3.838 ± 0.481
3.412ArgGly: 3.412 ± 0.724
3.412ArgHis: 3.412 ± 1.092
1.279ArgIle: 1.279 ± 0.667
3.412ArgLys: 3.412 ± 0.783
8.102ArgLeu: 8.102 ± 1.316
0.0ArgMet: 0.0 ± 0.0
0.853ArgAsn: 0.853 ± 0.489
3.838ArgPro: 3.838 ± 1.129
2.985ArgGln: 2.985 ± 1.144
3.412ArgArg: 3.412 ± 1.878
4.264ArgSer: 4.264 ± 1.153
2.985ArgThr: 2.985 ± 0.965
2.985ArgVal: 2.985 ± 1.208
0.853ArgTrp: 0.853 ± 0.388
2.559ArgTyr: 2.559 ± 0.781
0.0ArgXaa: 0.0 ± 0.0
Ser
4.691SerAla: 4.691 ± 1.469
0.853SerCys: 0.853 ± 0.565
4.264SerAsp: 4.264 ± 1.248
3.838SerGlu: 3.838 ± 1.312
1.279SerPhe: 1.279 ± 0.532
6.823SerGly: 6.823 ± 1.024
1.279SerHis: 1.279 ± 0.712
2.132SerIle: 2.132 ± 1.177
1.706SerLys: 1.706 ± 0.672
4.264SerLeu: 4.264 ± 1.153
0.853SerMet: 0.853 ± 0.634
3.412SerAsn: 3.412 ± 0.822
2.985SerPro: 2.985 ± 0.577
2.559SerGln: 2.559 ± 0.752
4.691SerArg: 4.691 ± 0.825
8.529SerSer: 8.529 ± 1.664
9.382SerThr: 9.382 ± 2.692
4.691SerVal: 4.691 ± 2.194
0.0SerTrp: 0.0 ± 0.0
1.706SerTyr: 1.706 ± 0.983
0.0SerXaa: 0.0 ± 0.0
Thr
4.264ThrAla: 4.264 ± 1.6
3.412ThrCys: 3.412 ± 0.738
3.412ThrAsp: 3.412 ± 1.413
4.691ThrGlu: 4.691 ± 0.632
2.559ThrPhe: 2.559 ± 0.456
7.676ThrGly: 7.676 ± 2.278
0.853ThrHis: 0.853 ± 0.856
1.706ThrIle: 1.706 ± 0.672
1.706ThrLys: 1.706 ± 0.956
7.249ThrLeu: 7.249 ± 1.644
2.559ThrMet: 2.559 ± 0.846
4.691ThrAsn: 4.691 ± 1.211
8.529ThrPro: 8.529 ± 4.326
2.985ThrGln: 2.985 ± 0.73
3.412ThrArg: 3.412 ± 1.2
5.117ThrSer: 5.117 ± 1.426
5.97ThrThr: 5.97 ± 1.203
6.397ThrVal: 6.397 ± 1.154
1.279ThrTrp: 1.279 ± 1.285
2.985ThrTyr: 2.985 ± 0.719
0.0ThrXaa: 0.0 ± 0.0
Val
2.985ValAla: 2.985 ± 1.453
1.279ValCys: 1.279 ± 0.748
5.117ValAsp: 5.117 ± 1.223
3.412ValGlu: 3.412 ± 1.308
2.132ValPhe: 2.132 ± 1.01
3.838ValGly: 3.838 ± 0.683
1.706ValHis: 1.706 ± 0.867
3.412ValIle: 3.412 ± 0.876
1.279ValLys: 1.279 ± 0.56
3.838ValLeu: 3.838 ± 1.132
1.279ValMet: 1.279 ± 0.581
0.853ValAsn: 0.853 ± 0.419
5.117ValPro: 5.117 ± 1.193
3.838ValGln: 3.838 ± 0.793
4.691ValArg: 4.691 ± 0.625
5.97ValSer: 5.97 ± 1.839
7.676ValThr: 7.676 ± 1.936
6.397ValVal: 6.397 ± 1.025
1.279ValTrp: 1.279 ± 0.514
1.706ValTyr: 1.706 ± 0.703
0.0ValXaa: 0.0 ± 0.0
Trp
2.132TrpAla: 2.132 ± 0.827
0.0TrpCys: 0.0 ± 0.0
0.426TrpAsp: 0.426 ± 0.428
0.853TrpGlu: 0.853 ± 0.538
0.426TrpPhe: 0.426 ± 0.313
1.706TrpGly: 1.706 ± 0.271
0.853TrpHis: 0.853 ± 0.628
0.853TrpIle: 0.853 ± 0.627
2.132TrpLys: 2.132 ± 0.569
0.853TrpLeu: 0.853 ± 0.356
0.0TrpMet: 0.0 ± 0.0
0.426TrpAsn: 0.426 ± 0.326
0.426TrpPro: 0.426 ± 0.357
0.426TrpGln: 0.426 ± 0.412
2.132TrpArg: 2.132 ± 1.004
0.853TrpSer: 0.853 ± 0.388
0.853TrpThr: 0.853 ± 0.856
0.853TrpVal: 0.853 ± 0.627
0.0TrpTrp: 0.0 ± 0.0
0.426TrpTyr: 0.426 ± 0.313
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.985TyrAla: 2.985 ± 0.808
0.853TyrCys: 0.853 ± 0.489
2.559TyrAsp: 2.559 ± 0.512
2.132TyrGlu: 2.132 ± 0.782
1.706TyrPhe: 1.706 ± 0.271
2.985TyrGly: 2.985 ± 0.888
0.0TyrHis: 0.0 ± 0.0
1.279TyrIle: 1.279 ± 0.382
2.559TyrLys: 2.559 ± 0.693
2.985TyrLeu: 2.985 ± 0.667
0.853TyrMet: 0.853 ± 0.538
1.706TyrAsn: 1.706 ± 0.906
0.853TyrPro: 0.853 ± 0.631
2.132TyrGln: 2.132 ± 0.944
2.132TyrArg: 2.132 ± 0.727
1.279TyrSer: 1.279 ± 0.785
1.706TyrThr: 1.706 ± 1.164
1.706TyrVal: 1.706 ± 0.773
1.279TyrTrp: 1.279 ± 0.511
1.279TyrTyr: 1.279 ± 0.592
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2346 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski