Amino acid dipepetide frequency for Human papillomavirus type 128

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.066AlaAla: 2.066 ± 0.975
0.413AlaCys: 0.413 ± 0.487
2.893AlaAsp: 2.893 ± 0.667
4.959AlaGlu: 4.959 ± 1.541
3.719AlaPhe: 3.719 ± 1.693
2.479AlaGly: 2.479 ± 1.386
0.826AlaHis: 0.826 ± 0.537
2.066AlaIle: 2.066 ± 0.699
2.479AlaLys: 2.479 ± 0.821
3.719AlaLeu: 3.719 ± 0.619
0.0AlaMet: 0.0 ± 0.0
2.479AlaAsn: 2.479 ± 0.885
3.306AlaPro: 3.306 ± 0.891
1.653AlaGln: 1.653 ± 0.58
2.066AlaArg: 2.066 ± 0.621
4.959AlaSer: 4.959 ± 1.384
3.719AlaThr: 3.719 ± 1.017
4.545AlaVal: 4.545 ± 1.502
0.413AlaTrp: 0.413 ± 0.348
0.413AlaTyr: 0.413 ± 0.448
0.0AlaXaa: 0.0 ± 0.0
Cys
0.826CysAla: 0.826 ± 0.403
2.066CysCys: 2.066 ± 1.2
1.24CysAsp: 1.24 ± 0.556
1.24CysGlu: 1.24 ± 0.806
1.24CysPhe: 1.24 ± 1.011
0.0CysGly: 0.0 ± 0.0
0.413CysHis: 0.413 ± 0.348
1.653CysIle: 1.653 ± 0.863
1.653CysLys: 1.653 ± 0.787
1.653CysLeu: 1.653 ± 1.948
0.826CysMet: 0.826 ± 0.594
1.653CysAsn: 1.653 ± 1.094
1.24CysPro: 1.24 ± 0.8
1.24CysGln: 1.24 ± 0.997
0.826CysArg: 0.826 ± 0.672
0.826CysSer: 0.826 ± 0.607
1.653CysThr: 1.653 ± 1.144
0.413CysVal: 0.413 ± 0.348
0.826CysTrp: 0.826 ± 0.515
0.826CysTyr: 0.826 ± 0.493
0.0CysXaa: 0.0 ± 0.0
Asp
3.306AspAla: 3.306 ± 0.611
1.653AspCys: 1.653 ± 0.591
6.198AspAsp: 6.198 ± 1.655
6.612AspGlu: 6.612 ± 0.725
3.306AspPhe: 3.306 ± 0.745
2.066AspGly: 2.066 ± 1.059
0.826AspHis: 0.826 ± 0.687
4.132AspIle: 4.132 ± 0.804
1.653AspLys: 1.653 ± 1.076
7.025AspLeu: 7.025 ± 1.745
0.826AspMet: 0.826 ± 0.555
2.479AspAsn: 2.479 ± 0.943
3.719AspPro: 3.719 ± 0.474
4.132AspGln: 4.132 ± 0.63
2.479AspArg: 2.479 ± 0.542
4.959AspSer: 4.959 ± 0.504
3.306AspThr: 3.306 ± 0.807
4.132AspVal: 4.132 ± 0.992
1.24AspTrp: 1.24 ± 1.044
1.24AspTyr: 1.24 ± 0.917
0.0AspXaa: 0.0 ± 0.0
Glu
3.306GluAla: 3.306 ± 1.122
1.24GluCys: 1.24 ± 0.843
4.545GluAsp: 4.545 ± 1.491
6.198GluGlu: 6.198 ± 1.666
0.826GluPhe: 0.826 ± 0.515
4.132GluGly: 4.132 ± 1.129
0.826GluHis: 0.826 ± 0.537
2.479GluIle: 2.479 ± 0.933
4.959GluLys: 4.959 ± 1.01
7.438GluLeu: 7.438 ± 1.573
0.413GluMet: 0.413 ± 0.357
5.372GluAsn: 5.372 ± 1.242
2.479GluPro: 2.479 ± 1.359
2.893GluGln: 2.893 ± 0.833
1.653GluArg: 1.653 ± 1.012
4.545GluSer: 4.545 ± 1.01
2.893GluThr: 2.893 ± 0.851
2.479GluVal: 2.479 ± 0.741
0.413GluTrp: 0.413 ± 0.348
1.653GluTyr: 1.653 ± 0.744
0.0GluXaa: 0.0 ± 0.0
Phe
4.132PheAla: 4.132 ± 1.688
1.653PheCys: 1.653 ± 1.07
3.306PheAsp: 3.306 ± 0.934
3.719PheGlu: 3.719 ± 0.845
2.479PhePhe: 2.479 ± 1.021
4.959PheGly: 4.959 ± 1.257
0.413PheHis: 0.413 ± 0.348
2.479PheIle: 2.479 ± 0.48
2.479PheLys: 2.479 ± 0.912
4.959PheLeu: 4.959 ± 1.618
0.413PheMet: 0.413 ± 0.348
2.066PheAsn: 2.066 ± 0.694
1.653PhePro: 1.653 ± 0.507
1.653PheGln: 1.653 ± 1.01
2.066PheArg: 2.066 ± 0.385
2.893PheSer: 2.893 ± 1.258
2.066PheThr: 2.066 ± 0.694
2.479PheVal: 2.479 ± 0.734
1.24PheTrp: 1.24 ± 0.686
3.719PheTyr: 3.719 ± 0.707
0.0PheXaa: 0.0 ± 0.0
Gly
3.306GlyAla: 3.306 ± 0.599
0.413GlyCys: 0.413 ± 0.357
4.959GlyAsp: 4.959 ± 1.091
1.24GlyGlu: 1.24 ± 0.681
0.413GlyPhe: 0.413 ± 0.357
4.545GlyGly: 4.545 ± 2.43
2.066GlyHis: 2.066 ± 0.724
2.479GlyIle: 2.479 ± 1.104
4.545GlyLys: 4.545 ± 0.815
4.545GlyLeu: 4.545 ± 1.138
0.826GlyMet: 0.826 ± 0.566
2.066GlyAsn: 2.066 ± 1.016
2.066GlyPro: 2.066 ± 0.659
2.893GlyGln: 2.893 ± 0.673
4.545GlyArg: 4.545 ± 1.629
4.132GlySer: 4.132 ± 1.242
5.785GlyThr: 5.785 ± 1.592
1.653GlyVal: 1.653 ± 0.591
0.0GlyTrp: 0.0 ± 0.0
1.653GlyTyr: 1.653 ± 0.987
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
1.24HisCys: 1.24 ± 0.797
0.413HisAsp: 0.413 ± 0.348
0.413HisGlu: 0.413 ± 0.348
0.826HisPhe: 0.826 ± 0.403
1.24HisGly: 1.24 ± 0.643
0.0HisHis: 0.0 ± 0.0
0.826HisIle: 0.826 ± 0.403
0.826HisLys: 0.826 ± 0.43
2.066HisLeu: 2.066 ± 1.099
0.413HisMet: 0.413 ± 0.348
0.413HisAsn: 0.413 ± 0.448
1.24HisPro: 1.24 ± 0.68
1.653HisGln: 1.653 ± 1.278
1.24HisArg: 1.24 ± 0.642
1.24HisSer: 1.24 ± 0.589
0.826HisThr: 0.826 ± 0.714
0.413HisVal: 0.413 ± 0.357
1.24HisTrp: 1.24 ± 0.626
1.653HisTyr: 1.653 ± 0.862
0.0HisXaa: 0.0 ± 0.0
Ile
1.24IleAla: 1.24 ± 0.641
1.24IleCys: 1.24 ± 0.599
5.372IleAsp: 5.372 ± 1.35
3.306IleGlu: 3.306 ± 1.637
2.479IlePhe: 2.479 ± 1.109
2.066IleGly: 2.066 ± 0.694
0.0IleHis: 0.0 ± 0.0
2.893IleIle: 2.893 ± 0.982
0.0IleLys: 0.0 ± 0.0
4.959IleLeu: 4.959 ± 0.996
0.0IleMet: 0.0 ± 0.0
5.372IleAsn: 5.372 ± 1.78
2.479IlePro: 2.479 ± 1.349
2.893IleGln: 2.893 ± 0.911
2.479IleArg: 2.479 ± 1.072
4.959IleSer: 4.959 ± 1.408
2.479IleThr: 2.479 ± 0.907
2.893IleVal: 2.893 ± 0.831
0.0IleTrp: 0.0 ± 0.0
2.066IleTyr: 2.066 ± 0.724
0.0IleXaa: 0.0 ± 0.0
Lys
1.24LysAla: 1.24 ± 0.367
2.066LysCys: 2.066 ± 0.656
0.826LysAsp: 0.826 ± 0.43
2.893LysGlu: 2.893 ± 1.209
2.066LysPhe: 2.066 ± 0.761
2.066LysGly: 2.066 ± 0.621
1.24LysHis: 1.24 ± 0.673
2.479LysIle: 2.479 ± 1.164
3.719LysLys: 3.719 ± 0.616
5.372LysLeu: 5.372 ± 0.918
0.826LysMet: 0.826 ± 0.683
3.306LysAsn: 3.306 ± 1.392
1.653LysPro: 1.653 ± 0.731
3.306LysGln: 3.306 ± 0.703
4.959LysArg: 4.959 ± 0.665
5.372LysSer: 5.372 ± 1.991
4.132LysThr: 4.132 ± 1.036
3.719LysVal: 3.719 ± 1.063
1.653LysTrp: 1.653 ± 0.991
2.066LysTyr: 2.066 ± 0.95
0.0LysXaa: 0.0 ± 0.0
Leu
4.959LeuAla: 4.959 ± 1.187
1.24LeuCys: 1.24 ± 0.698
7.025LeuAsp: 7.025 ± 1.858
4.959LeuGlu: 4.959 ± 1.584
5.372LeuPhe: 5.372 ± 1.368
4.545LeuGly: 4.545 ± 1.87
2.893LeuHis: 2.893 ± 1.198
4.132LeuIle: 4.132 ± 1.063
5.785LeuLys: 5.785 ± 1.353
8.678LeuLeu: 8.678 ± 1.715
3.306LeuMet: 3.306 ± 1.101
5.372LeuAsn: 5.372 ± 1.254
6.198LeuPro: 6.198 ± 1.368
5.372LeuGln: 5.372 ± 1.144
2.479LeuArg: 2.479 ± 1.0
9.091LeuSer: 9.091 ± 1.629
4.959LeuThr: 4.959 ± 0.862
5.372LeuVal: 5.372 ± 1.657
0.413LeuTrp: 0.413 ± 0.448
5.372LeuTyr: 5.372 ± 1.029
0.0LeuXaa: 0.0 ± 0.0
Met
1.24MetAla: 1.24 ± 0.676
0.413MetCys: 0.413 ± 0.357
0.413MetAsp: 0.413 ± 0.437
0.0MetGlu: 0.0 ± 0.0
0.826MetPhe: 0.826 ± 0.493
0.826MetGly: 0.826 ± 0.696
0.0MetHis: 0.0 ± 0.0
0.413MetIle: 0.413 ± 0.348
0.826MetLys: 0.826 ± 0.594
1.653MetLeu: 1.653 ± 0.755
0.0MetMet: 0.0 ± 0.0
2.066MetAsn: 2.066 ± 0.95
0.826MetPro: 0.826 ± 0.696
0.826MetGln: 0.826 ± 0.493
1.24MetArg: 1.24 ± 0.556
2.479MetSer: 2.479 ± 1.043
0.826MetThr: 0.826 ± 0.493
1.653MetVal: 1.653 ± 1.036
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.479AsnAla: 2.479 ± 0.886
1.653AsnCys: 1.653 ± 1.144
3.719AsnAsp: 3.719 ± 1.743
4.132AsnGlu: 4.132 ± 1.062
3.719AsnPhe: 3.719 ± 1.005
2.893AsnGly: 2.893 ± 1.057
0.826AsnHis: 0.826 ± 0.696
2.066AsnIle: 2.066 ± 1.059
2.893AsnLys: 2.893 ± 1.219
5.785AsnLeu: 5.785 ± 1.703
2.066AsnMet: 2.066 ± 0.385
2.893AsnAsn: 2.893 ± 0.738
3.719AsnPro: 3.719 ± 1.633
3.719AsnGln: 3.719 ± 1.219
2.479AsnArg: 2.479 ± 0.557
3.306AsnSer: 3.306 ± 1.106
2.066AsnThr: 2.066 ± 0.481
3.306AsnVal: 3.306 ± 1.315
1.24AsnTrp: 1.24 ± 0.411
1.653AsnTyr: 1.653 ± 0.649
0.0AsnXaa: 0.0 ± 0.0
Pro
3.719ProAla: 3.719 ± 0.922
0.413ProCys: 0.413 ± 0.357
4.132ProAsp: 4.132 ± 0.952
4.132ProGlu: 4.132 ± 1.33
1.653ProPhe: 1.653 ± 1.111
0.826ProGly: 0.826 ± 0.537
1.24ProHis: 1.24 ± 1.312
2.066ProIle: 2.066 ± 0.675
3.306ProLys: 3.306 ± 0.599
6.612ProLeu: 6.612 ± 2.258
0.0ProMet: 0.0 ± 0.0
3.719ProAsn: 3.719 ± 1.138
5.785ProPro: 5.785 ± 1.193
2.066ProGln: 2.066 ± 0.95
2.479ProArg: 2.479 ± 0.77
4.545ProSer: 4.545 ± 2.152
3.719ProThr: 3.719 ± 1.091
2.479ProVal: 2.479 ± 0.51
0.0ProTrp: 0.0 ± 0.0
2.479ProTyr: 2.479 ± 1.248
0.0ProXaa: 0.0 ± 0.0
Gln
2.479GlnAla: 2.479 ± 0.675
0.413GlnCys: 0.413 ± 0.487
3.306GlnAsp: 3.306 ± 1.166
1.653GlnGlu: 1.653 ± 0.981
2.066GlnPhe: 2.066 ± 0.71
2.893GlnGly: 2.893 ± 0.869
1.653GlnHis: 1.653 ± 0.587
2.893GlnIle: 2.893 ± 0.767
1.653GlnLys: 1.653 ± 0.868
4.545GlnLeu: 4.545 ± 0.666
2.893GlnMet: 2.893 ± 1.609
3.306GlnAsn: 3.306 ± 1.134
2.066GlnPro: 2.066 ± 0.399
2.066GlnGln: 2.066 ± 1.081
1.653GlnArg: 1.653 ± 1.036
3.306GlnSer: 3.306 ± 1.089
2.479GlnThr: 2.479 ± 0.761
4.132GlnVal: 4.132 ± 0.925
1.24GlnTrp: 1.24 ± 1.044
1.653GlnTyr: 1.653 ± 0.806
0.0GlnXaa: 0.0 ± 0.0
Arg
3.306ArgAla: 3.306 ± 0.875
2.893ArgCys: 2.893 ± 1.515
4.132ArgAsp: 4.132 ± 1.387
2.066ArgGlu: 2.066 ± 1.111
3.306ArgPhe: 3.306 ± 0.65
2.066ArgGly: 2.066 ± 0.667
2.066ArgHis: 2.066 ± 1.391
1.653ArgIle: 1.653 ± 0.58
4.959ArgLys: 4.959 ± 1.031
6.198ArgLeu: 6.198 ± 1.162
0.413ArgMet: 0.413 ± 0.357
2.066ArgAsn: 2.066 ± 1.53
2.893ArgPro: 2.893 ± 1.014
1.653ArgGln: 1.653 ± 0.618
5.372ArgArg: 5.372 ± 2.034
3.306ArgSer: 3.306 ± 0.672
0.826ArgThr: 0.826 ± 0.411
2.893ArgVal: 2.893 ± 1.043
1.24ArgTrp: 1.24 ± 0.735
0.826ArgTyr: 0.826 ± 0.43
0.0ArgXaa: 0.0 ± 0.0
Ser
4.545SerAla: 4.545 ± 1.049
0.413SerCys: 0.413 ± 0.344
3.719SerAsp: 3.719 ± 1.559
3.719SerGlu: 3.719 ± 1.464
5.372SerPhe: 5.372 ± 1.923
5.372SerGly: 5.372 ± 1.281
0.826SerHis: 0.826 ± 0.714
3.306SerIle: 3.306 ± 1.692
3.306SerLys: 3.306 ± 0.907
10.744SerLeu: 10.744 ± 1.83
1.24SerMet: 1.24 ± 0.669
2.479SerAsn: 2.479 ± 2.088
5.785SerPro: 5.785 ± 1.952
2.479SerGln: 2.479 ± 0.919
4.959SerArg: 4.959 ± 2.379
7.438SerSer: 7.438 ± 4.443
5.785SerThr: 5.785 ± 1.86
6.198SerVal: 6.198 ± 0.894
0.0SerTrp: 0.0 ± 0.0
2.066SerTyr: 2.066 ± 0.824
0.0SerXaa: 0.0 ± 0.0
Thr
4.545ThrAla: 4.545 ± 1.471
0.826ThrCys: 0.826 ± 0.561
3.719ThrAsp: 3.719 ± 1.546
4.545ThrGlu: 4.545 ± 1.17
3.306ThrPhe: 3.306 ± 0.649
4.132ThrGly: 4.132 ± 0.85
0.413ThrHis: 0.413 ± 0.344
4.132ThrIle: 4.132 ± 1.593
2.066ThrLys: 2.066 ± 0.689
4.132ThrLeu: 4.132 ± 0.89
0.0ThrMet: 0.0 ± 0.0
3.306ThrAsn: 3.306 ± 1.022
2.479ThrPro: 2.479 ± 0.741
2.479ThrGln: 2.479 ± 0.542
4.959ThrArg: 4.959 ± 1.61
6.198ThrSer: 6.198 ± 1.466
4.959ThrThr: 4.959 ± 2.165
3.719ThrVal: 3.719 ± 0.799
0.413ThrTrp: 0.413 ± 0.348
2.066ThrTyr: 2.066 ± 1.017
0.0ThrXaa: 0.0 ± 0.0
Val
0.413ValAla: 0.413 ± 0.448
0.826ValCys: 0.826 ± 0.558
2.893ValAsp: 2.893 ± 0.736
2.893ValGlu: 2.893 ± 0.738
3.719ValPhe: 3.719 ± 0.755
4.132ValGly: 4.132 ± 1.416
1.24ValHis: 1.24 ± 0.806
4.545ValIle: 4.545 ± 1.888
3.719ValLys: 3.719 ± 0.741
2.893ValLeu: 2.893 ± 0.565
0.826ValMet: 0.826 ± 0.714
2.479ValAsn: 2.479 ± 0.932
4.959ValPro: 4.959 ± 2.063
2.479ValGln: 2.479 ± 1.349
2.479ValArg: 2.479 ± 1.234
4.959ValSer: 4.959 ± 0.965
4.545ValThr: 4.545 ± 0.953
2.893ValVal: 2.893 ± 0.641
0.413ValTrp: 0.413 ± 0.357
3.306ValTyr: 3.306 ± 0.973
0.0ValXaa: 0.0 ± 0.0
Trp
0.413TrpAla: 0.413 ± 0.348
0.0TrpCys: 0.0 ± 0.0
1.653TrpAsp: 1.653 ± 0.868
0.826TrpGlu: 0.826 ± 0.625
0.413TrpPhe: 0.413 ± 0.348
0.0TrpGly: 0.0 ± 0.0
0.413TrpHis: 0.413 ± 0.448
1.653TrpIle: 1.653 ± 0.938
1.24TrpLys: 1.24 ± 0.673
1.653TrpLeu: 1.653 ± 0.822
0.0TrpMet: 0.0 ± 0.0
0.413TrpAsn: 0.413 ± 0.357
0.413TrpPro: 0.413 ± 0.357
0.826TrpGln: 0.826 ± 0.411
0.826TrpArg: 0.826 ± 0.607
0.0TrpSer: 0.0 ± 0.0
1.24TrpThr: 1.24 ± 0.806
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.413TrpTyr: 0.413 ± 0.487
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.653TyrAla: 1.653 ± 0.595
1.24TyrCys: 1.24 ± 1.461
0.826TyrAsp: 0.826 ± 0.493
1.24TyrGlu: 1.24 ± 1.006
3.719TyrPhe: 3.719 ± 0.474
2.893TyrGly: 2.893 ± 0.839
0.0TyrHis: 0.0 ± 0.0
0.826TyrIle: 0.826 ± 0.411
3.306TyrLys: 3.306 ± 0.661
2.893TyrLeu: 2.893 ± 0.727
0.826TyrMet: 0.826 ± 0.496
3.306TyrAsn: 3.306 ± 0.808
0.413TyrPro: 0.413 ± 0.344
2.066TyrGln: 2.066 ± 0.824
2.893TyrArg: 2.893 ± 0.579
1.24TyrSer: 1.24 ± 0.723
4.132TyrThr: 4.132 ± 1.089
1.24TyrVal: 1.24 ± 0.411
0.413TyrTrp: 0.413 ± 0.357
0.413TyrTyr: 0.413 ± 0.448
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2421 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski