Amino acid dipepetide frequency for Human papillomavirus type 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.898AlaAla: 4.898 ± 0.64
2.449AlaCys: 2.449 ± 0.758
5.306AlaAsp: 5.306 ± 1.438
1.633AlaGlu: 1.633 ± 0.603
2.857AlaPhe: 2.857 ± 0.93
4.082AlaGly: 4.082 ± 1.331
0.816AlaHis: 0.816 ± 0.416
2.857AlaIle: 2.857 ± 0.471
2.449AlaLys: 2.449 ± 0.966
4.49AlaLeu: 4.49 ± 1.053
1.633AlaMet: 1.633 ± 0.575
2.449AlaAsn: 2.449 ± 0.966
4.082AlaPro: 4.082 ± 2.22
2.857AlaGln: 2.857 ± 1.009
2.041AlaArg: 2.041 ± 0.783
6.939AlaSer: 6.939 ± 1.425
5.306AlaThr: 5.306 ± 1.131
3.673AlaVal: 3.673 ± 0.599
0.408AlaTrp: 0.408 ± 0.398
2.041AlaTyr: 2.041 ± 0.716
0.0AlaXaa: 0.0 ± 0.0
Cys
2.449CysAla: 2.449 ± 0.978
0.408CysCys: 0.408 ± 0.607
0.408CysAsp: 0.408 ± 0.607
0.0CysGlu: 0.0 ± 0.0
0.816CysPhe: 0.816 ± 0.442
2.449CysGly: 2.449 ± 1.705
0.408CysHis: 0.408 ± 0.532
0.816CysIle: 0.816 ± 0.686
2.857CysLys: 2.857 ± 1.48
2.041CysLeu: 2.041 ± 1.458
0.816CysMet: 0.816 ± 0.545
0.408CysAsn: 0.408 ± 0.367
2.449CysPro: 2.449 ± 0.861
1.633CysGln: 1.633 ± 0.696
0.816CysArg: 0.816 ± 0.686
0.816CysSer: 0.816 ± 0.442
1.633CysThr: 1.633 ± 0.86
1.224CysVal: 1.224 ± 0.544
1.224CysTrp: 1.224 ± 0.413
1.224CysTyr: 1.224 ± 1.117
0.0CysXaa: 0.0 ± 0.0
Asp
3.673AspAla: 3.673 ± 1.629
0.408AspCys: 0.408 ± 0.367
2.449AspAsp: 2.449 ± 0.937
4.898AspGlu: 4.898 ± 1.784
1.633AspPhe: 1.633 ± 0.253
4.082AspGly: 4.082 ± 1.269
0.408AspHis: 0.408 ± 0.343
4.49AspIle: 4.49 ± 1.876
0.816AspLys: 0.816 ± 0.454
6.122AspLeu: 6.122 ± 1.443
1.224AspMet: 1.224 ± 0.436
1.633AspAsn: 1.633 ± 0.782
4.082AspPro: 4.082 ± 1.587
1.224AspGln: 1.224 ± 0.413
1.224AspArg: 1.224 ± 0.683
6.122AspSer: 6.122 ± 1.596
5.714AspThr: 5.714 ± 2.064
4.898AspVal: 4.898 ± 1.191
1.224AspTrp: 1.224 ± 0.669
1.224AspTyr: 1.224 ± 0.719
0.0AspXaa: 0.0 ± 0.0
Glu
3.265GluAla: 3.265 ± 0.935
1.224GluCys: 1.224 ± 1.052
5.306GluAsp: 5.306 ± 2.171
7.347GluGlu: 7.347 ± 2.464
0.816GluPhe: 0.816 ± 0.442
1.633GluGly: 1.633 ± 0.987
0.408GluHis: 0.408 ± 0.367
2.041GluIle: 2.041 ± 1.127
1.633GluLys: 1.633 ± 0.723
4.082GluLeu: 4.082 ± 1.33
0.816GluMet: 0.816 ± 0.454
2.449GluAsn: 2.449 ± 0.836
5.714GluPro: 5.714 ± 1.983
4.49GluGln: 4.49 ± 1.289
1.633GluArg: 1.633 ± 1.074
2.449GluSer: 2.449 ± 0.825
4.082GluThr: 4.082 ± 1.098
3.265GluVal: 3.265 ± 0.849
0.408GluTrp: 0.408 ± 0.343
1.633GluTyr: 1.633 ± 0.751
0.0GluXaa: 0.0 ± 0.0
Phe
2.041PheAla: 2.041 ± 0.682
0.408PheCys: 0.408 ± 0.343
1.633PheAsp: 1.633 ± 0.959
1.224PheGlu: 1.224 ± 0.65
2.449PhePhe: 2.449 ± 0.947
3.265PheGly: 3.265 ± 0.813
1.633PheHis: 1.633 ± 0.511
2.041PheIle: 2.041 ± 0.488
2.449PheLys: 2.449 ± 1.206
5.714PheLeu: 5.714 ± 1.083
0.816PheMet: 0.816 ± 0.545
1.224PheAsn: 1.224 ± 0.719
1.224PhePro: 1.224 ± 0.544
1.224PheGln: 1.224 ± 0.372
1.633PheArg: 1.633 ± 0.987
1.224PheSer: 1.224 ± 0.719
2.041PheThr: 2.041 ± 0.69
3.265PheVal: 3.265 ± 0.978
0.816PheTrp: 0.816 ± 0.416
1.633PheTyr: 1.633 ± 0.782
0.0PheXaa: 0.0 ± 0.0
Gly
4.082GlyAla: 4.082 ± 1.119
1.224GlyCys: 1.224 ± 0.413
6.122GlyAsp: 6.122 ± 0.944
3.673GlyGlu: 3.673 ± 1.354
2.449GlyPhe: 2.449 ± 0.547
5.306GlyGly: 5.306 ± 1.165
3.265GlyHis: 3.265 ± 0.72
1.224GlyIle: 1.224 ± 0.871
2.449GlyLys: 2.449 ± 1.071
3.673GlyLeu: 3.673 ± 0.634
2.041GlyMet: 2.041 ± 0.408
3.265GlyAsn: 3.265 ± 1.13
2.449GlyPro: 2.449 ± 1.273
2.857GlyGln: 2.857 ± 1.183
4.49GlyArg: 4.49 ± 1.237
6.531GlySer: 6.531 ± 1.845
6.939GlyThr: 6.939 ± 1.965
5.306GlyVal: 5.306 ± 0.924
0.408GlyTrp: 0.408 ± 0.343
2.041GlyTyr: 2.041 ± 0.443
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
1.224HisCys: 1.224 ± 1.38
1.224HisAsp: 1.224 ± 0.399
0.408HisGlu: 0.408 ± 0.345
2.041HisPhe: 2.041 ± 0.884
2.041HisGly: 2.041 ± 0.56
0.408HisHis: 0.408 ± 0.398
0.408HisIle: 0.408 ± 0.343
1.633HisLys: 1.633 ± 0.853
1.224HisLeu: 1.224 ± 0.66
1.224HisMet: 1.224 ± 0.544
1.224HisAsn: 1.224 ± 0.841
2.857HisPro: 2.857 ± 0.602
0.816HisGln: 0.816 ± 0.624
0.408HisArg: 0.408 ± 0.345
0.408HisSer: 0.408 ± 0.343
2.041HisThr: 2.041 ± 0.601
2.041HisVal: 2.041 ± 0.488
1.633HisTrp: 1.633 ± 0.89
1.633HisTyr: 1.633 ± 0.703
0.0HisXaa: 0.0 ± 0.0
Ile
0.816IleAla: 0.816 ± 0.649
0.816IleCys: 0.816 ± 0.6
1.633IleAsp: 1.633 ± 0.962
2.449IleGlu: 2.449 ± 0.471
1.633IlePhe: 1.633 ± 0.835
2.449IleGly: 2.449 ± 1.208
2.041IleHis: 2.041 ± 0.875
1.633IleIle: 1.633 ± 0.968
1.633IleLys: 1.633 ± 0.826
1.224IleLeu: 1.224 ± 0.69
0.816IleMet: 0.816 ± 0.428
0.816IleAsn: 0.816 ± 0.391
4.898IlePro: 4.898 ± 1.377
0.816IleGln: 0.816 ± 0.686
0.816IleArg: 0.816 ± 0.454
3.673IleSer: 3.673 ± 0.654
3.265IleThr: 3.265 ± 0.716
4.49IleVal: 4.49 ± 1.648
0.408IleTrp: 0.408 ± 0.532
1.224IleTyr: 1.224 ± 0.768
0.0IleXaa: 0.0 ± 0.0
Lys
2.449LysAla: 2.449 ± 1.071
2.449LysCys: 2.449 ± 1.082
2.041LysAsp: 2.041 ± 1.031
1.633LysGlu: 1.633 ± 1.583
2.041LysPhe: 2.041 ± 0.727
2.041LysGly: 2.041 ± 0.752
1.633LysHis: 1.633 ± 0.884
1.224LysIle: 1.224 ± 0.372
2.449LysLys: 2.449 ± 1.071
2.449LysLeu: 2.449 ± 0.829
0.816LysMet: 0.816 ± 0.733
1.633LysAsn: 1.633 ± 1.372
2.449LysPro: 2.449 ± 0.962
1.224LysGln: 1.224 ± 0.685
6.531LysArg: 6.531 ± 1.06
2.449LysSer: 2.449 ± 1.249
2.449LysThr: 2.449 ± 0.919
2.449LysVal: 2.449 ± 1.307
0.816LysTrp: 0.816 ± 0.557
1.224LysTyr: 1.224 ± 0.669
0.0LysXaa: 0.0 ± 0.0
Leu
3.673LeuAla: 3.673 ± 1.005
3.265LeuCys: 3.265 ± 1.55
5.714LeuAsp: 5.714 ± 1.159
4.49LeuGlu: 4.49 ± 1.816
4.898LeuPhe: 4.898 ± 0.9
3.673LeuGly: 3.673 ± 0.944
1.633LeuHis: 1.633 ± 1.1
2.857LeuIle: 2.857 ± 0.913
4.082LeuLys: 4.082 ± 1.543
8.98LeuLeu: 8.98 ± 1.972
1.633LeuMet: 1.633 ± 0.562
2.449LeuAsn: 2.449 ± 1.19
3.265LeuPro: 3.265 ± 1.379
5.306LeuGln: 5.306 ± 2.181
7.755LeuArg: 7.755 ± 0.985
4.49LeuSer: 4.49 ± 1.286
6.531LeuThr: 6.531 ± 1.408
3.673LeuVal: 3.673 ± 1.499
1.633LeuTrp: 1.633 ± 0.665
5.714LeuTyr: 5.714 ± 1.045
0.0LeuXaa: 0.0 ± 0.0
Met
1.224MetAla: 1.224 ± 1.1
0.816MetCys: 0.816 ± 0.416
1.224MetAsp: 1.224 ± 1.1
2.041MetGlu: 2.041 ± 0.417
0.816MetPhe: 0.816 ± 0.733
1.633MetGly: 1.633 ± 0.486
0.816MetHis: 0.816 ± 0.852
0.816MetIle: 0.816 ± 0.442
0.0MetLys: 0.0 ± 0.0
2.041MetLeu: 2.041 ± 0.878
0.0MetMet: 0.0 ± 0.0
0.408MetAsn: 0.408 ± 0.532
0.816MetPro: 0.816 ± 0.428
0.816MetGln: 0.816 ± 0.454
0.408MetArg: 0.408 ± 0.532
3.673MetSer: 3.673 ± 0.879
0.408MetThr: 0.408 ± 0.398
2.041MetVal: 2.041 ± 1.06
0.408MetTrp: 0.408 ± 0.398
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.633AsnAla: 1.633 ± 0.574
2.041AsnCys: 2.041 ± 0.962
0.816AsnAsp: 0.816 ± 0.416
0.408AsnGlu: 0.408 ± 0.367
0.816AsnPhe: 0.816 ± 0.733
2.041AsnGly: 2.041 ± 0.727
0.816AsnHis: 0.816 ± 0.788
1.633AsnIle: 1.633 ± 0.991
3.673AsnLys: 3.673 ± 1.142
0.816AsnLeu: 0.816 ± 0.686
0.816AsnMet: 0.816 ± 0.416
1.633AsnAsn: 1.633 ± 0.781
2.041AsnPro: 2.041 ± 0.83
0.816AsnGln: 0.816 ± 0.557
4.082AsnArg: 4.082 ± 0.83
1.224AsnSer: 1.224 ± 1.029
2.857AsnThr: 2.857 ± 0.905
2.041AsnVal: 2.041 ± 0.981
0.408AsnTrp: 0.408 ± 0.343
0.816AsnTyr: 0.816 ± 0.461
0.0AsnXaa: 0.0 ± 0.0
Pro
5.714ProAla: 5.714 ± 2.105
0.408ProCys: 0.408 ± 0.532
5.714ProAsp: 5.714 ± 2.01
2.857ProGlu: 2.857 ± 1.151
2.041ProPhe: 2.041 ± 0.948
4.49ProGly: 4.49 ± 1.365
1.633ProHis: 1.633 ± 0.737
2.857ProIle: 2.857 ± 0.471
3.265ProLys: 3.265 ± 0.656
8.571ProLeu: 8.571 ± 1.596
0.408ProMet: 0.408 ± 0.345
1.633ProAsn: 1.633 ± 0.856
8.98ProPro: 8.98 ± 2.117
3.265ProGln: 3.265 ± 1.432
4.49ProArg: 4.49 ± 1.847
3.673ProSer: 3.673 ± 1.723
5.306ProThr: 5.306 ± 2.039
4.898ProVal: 4.898 ± 1.649
0.0ProTrp: 0.0 ± 0.0
2.041ProTyr: 2.041 ± 1.833
0.0ProXaa: 0.0 ± 0.0
Gln
4.082GlnAla: 4.082 ± 1.723
0.408GlnCys: 0.408 ± 0.343
1.633GlnAsp: 1.633 ± 0.603
2.041GlnGlu: 2.041 ± 1.107
2.857GlnPhe: 2.857 ± 1.184
1.633GlnGly: 1.633 ± 0.978
0.816GlnHis: 0.816 ± 0.454
0.816GlnIle: 0.816 ± 0.454
0.816GlnLys: 0.816 ± 0.461
4.898GlnLeu: 4.898 ± 1.463
1.633GlnMet: 1.633 ± 0.833
0.816GlnAsn: 0.816 ± 0.557
2.449GlnPro: 2.449 ± 0.93
2.449GlnGln: 2.449 ± 0.914
2.857GlnArg: 2.857 ± 0.905
2.857GlnSer: 2.857 ± 1.258
3.265GlnThr: 3.265 ± 0.634
2.857GlnVal: 2.857 ± 0.96
0.408GlnTrp: 0.408 ± 0.343
1.633GlnTyr: 1.633 ± 0.683
0.0GlnXaa: 0.0 ± 0.0
Arg
4.49ArgAla: 4.49 ± 1.246
1.633ArgCys: 1.633 ± 1.112
1.633ArgAsp: 1.633 ± 0.695
2.449ArgGlu: 2.449 ± 0.872
1.633ArgPhe: 1.633 ± 0.778
4.082ArgGly: 4.082 ± 1.175
2.041ArgHis: 2.041 ± 0.809
0.0ArgIle: 0.0 ± 0.0
3.673ArgLys: 3.673 ± 0.958
6.122ArgLeu: 6.122 ± 0.682
0.0ArgMet: 0.0 ± 0.346
0.816ArgAsn: 0.816 ± 0.545
4.898ArgPro: 4.898 ± 1.774
2.857ArgGln: 2.857 ± 1.259
4.49ArgArg: 4.49 ± 1.662
3.265ArgSer: 3.265 ± 0.999
4.898ArgThr: 4.898 ± 1.592
6.122ArgVal: 6.122 ± 2.0
1.224ArgTrp: 1.224 ± 0.595
1.224ArgTyr: 1.224 ± 0.413
0.0ArgXaa: 0.0 ± 0.0
Ser
4.898SerAla: 4.898 ± 2.044
1.633SerCys: 1.633 ± 0.991
3.673SerAsp: 3.673 ± 0.931
4.49SerGlu: 4.49 ± 1.198
1.224SerPhe: 1.224 ± 0.372
6.531SerGly: 6.531 ± 2.343
1.633SerHis: 1.633 ± 0.584
3.265SerIle: 3.265 ± 0.85
3.673SerLys: 3.673 ± 1.512
5.306SerLeu: 5.306 ± 1.425
2.857SerMet: 2.857 ± 1.505
2.449SerAsn: 2.449 ± 0.825
4.082SerPro: 4.082 ± 1.284
2.041SerGln: 2.041 ± 1.049
3.673SerArg: 3.673 ± 1.154
9.796SerSer: 9.796 ± 1.352
6.939SerThr: 6.939 ± 2.04
6.122SerVal: 6.122 ± 1.676
2.041SerTrp: 2.041 ± 0.903
2.041SerTyr: 2.041 ± 1.016
0.0SerXaa: 0.0 ± 0.0
Thr
4.49ThrAla: 4.49 ± 0.849
2.857ThrCys: 2.857 ± 1.108
4.898ThrAsp: 4.898 ± 0.933
5.306ThrGlu: 5.306 ± 0.747
2.857ThrPhe: 2.857 ± 1.183
6.939ThrGly: 6.939 ± 1.914
0.408ThrHis: 0.408 ± 0.343
2.449ThrIle: 2.449 ± 1.072
0.0ThrLys: 0.0 ± 0.0
9.388ThrLeu: 9.388 ± 2.64
0.408ThrMet: 0.408 ± 0.343
3.673ThrAsn: 3.673 ± 2.09
7.347ThrPro: 7.347 ± 2.427
2.449ThrGln: 2.449 ± 0.825
3.265ThrArg: 3.265 ± 1.287
6.122ThrSer: 6.122 ± 1.752
4.49ThrThr: 4.49 ± 1.509
6.939ThrVal: 6.939 ± 0.447
1.224ThrTrp: 1.224 ± 0.98
0.816ThrTyr: 0.816 ± 0.461
0.0ThrXaa: 0.0 ± 0.0
Val
4.082ValAla: 4.082 ± 0.996
0.816ValCys: 0.816 ± 0.624
3.265ValAsp: 3.265 ± 0.785
5.306ValGlu: 5.306 ± 0.636
2.857ValPhe: 2.857 ± 0.905
5.714ValGly: 5.714 ± 0.718
2.041ValHis: 2.041 ± 0.783
2.449ValIle: 2.449 ± 0.931
1.633ValLys: 1.633 ± 0.253
4.082ValLeu: 4.082 ± 1.789
1.633ValMet: 1.633 ± 0.828
0.816ValAsn: 0.816 ± 0.461
5.714ValPro: 5.714 ± 1.077
3.673ValGln: 3.673 ± 1.245
4.49ValArg: 4.49 ± 0.801
9.388ValSer: 9.388 ± 1.835
5.306ValThr: 5.306 ± 1.568
3.673ValVal: 3.673 ± 1.269
0.816ValTrp: 0.816 ± 1.063
4.082ValTyr: 4.082 ± 1.451
0.0ValXaa: 0.0 ± 0.0
Trp
1.633TrpAla: 1.633 ± 0.89
0.408TrpCys: 0.408 ± 0.532
0.816TrpAsp: 0.816 ± 0.461
0.816TrpGlu: 0.816 ± 0.649
0.408TrpPhe: 0.408 ± 0.343
2.041TrpGly: 2.041 ± 0.554
1.224TrpHis: 1.224 ± 0.936
0.816TrpIle: 0.816 ± 0.686
1.633TrpLys: 1.633 ± 0.987
1.224TrpLeu: 1.224 ± 0.574
0.0TrpMet: 0.0 ± 0.0
0.408TrpAsn: 0.408 ± 0.367
0.408TrpPro: 0.408 ± 0.345
0.0TrpGln: 0.0 ± 0.0
0.816TrpArg: 0.816 ± 0.6
1.633TrpSer: 1.633 ± 0.724
0.816TrpThr: 0.816 ± 0.649
0.408TrpVal: 0.408 ± 0.343
0.0TrpTrp: 0.0 ± 0.0
0.816TrpTyr: 0.816 ± 0.442
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.673TyrAla: 3.673 ± 1.722
0.0TyrCys: 0.0 ± 0.0
2.041TyrAsp: 2.041 ± 0.725
1.224TyrGlu: 1.224 ± 0.399
0.816TyrPhe: 0.816 ± 0.391
3.673TyrGly: 3.673 ± 1.307
0.816TyrHis: 0.816 ± 0.428
2.857TyrIle: 2.857 ± 0.982
1.633TyrLys: 1.633 ± 0.833
3.265TyrLeu: 3.265 ± 1.254
0.408TyrMet: 0.408 ± 0.398
1.224TyrAsn: 1.224 ± 0.413
2.041TyrPro: 2.041 ± 0.913
0.408TyrGln: 0.408 ± 0.367
2.041TyrArg: 2.041 ± 0.769
1.633TyrSer: 1.633 ± 0.629
2.041TyrThr: 2.041 ± 0.811
2.449TyrVal: 2.449 ± 0.931
0.816TyrTrp: 0.816 ± 0.416
2.449TyrTyr: 2.449 ± 0.825
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2451 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski