Amino acid dipepetide frequency for Human papillomavirus 21

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.557AlaAla: 2.557 ± 0.945
0.73AlaCys: 0.73 ± 0.537
4.383AlaAsp: 4.383 ± 1.0
2.922AlaGlu: 2.922 ± 0.601
2.191AlaPhe: 2.191 ± 1.104
2.557AlaGly: 2.557 ± 1.048
0.73AlaHis: 0.73 ± 0.426
2.191AlaIle: 2.191 ± 0.853
2.922AlaLys: 2.922 ± 0.875
3.652AlaLeu: 3.652 ± 1.116
1.826AlaMet: 1.826 ± 1.162
2.191AlaAsn: 2.191 ± 0.585
3.287AlaPro: 3.287 ± 0.55
2.557AlaGln: 2.557 ± 1.564
2.557AlaArg: 2.557 ± 0.907
2.191AlaSer: 2.191 ± 0.755
4.018AlaThr: 4.018 ± 1.456
4.018AlaVal: 4.018 ± 0.794
1.096AlaTrp: 1.096 ± 0.576
2.557AlaTyr: 2.557 ± 0.595
0.0AlaXaa: 0.0 ± 0.0
Cys
1.096CysAla: 1.096 ± 0.837
1.461CysCys: 1.461 ± 0.981
0.365CysAsp: 0.365 ± 0.303
0.73CysGlu: 0.73 ± 0.622
1.096CysPhe: 1.096 ± 0.415
1.461CysGly: 1.461 ± 1.049
0.0CysHis: 0.0 ± 0.0
0.73CysIle: 0.73 ± 0.606
3.287CysLys: 3.287 ± 1.086
1.096CysLeu: 1.096 ± 0.619
0.365CysMet: 0.365 ± 0.425
0.73CysAsn: 0.73 ± 0.518
1.826CysPro: 1.826 ± 0.838
0.365CysGln: 0.365 ± 0.447
1.461CysArg: 1.461 ± 0.976
0.73CysSer: 0.73 ± 0.606
0.365CysThr: 0.365 ± 0.303
0.365CysVal: 0.365 ± 0.425
0.73CysTrp: 0.73 ± 0.42
0.73CysTyr: 0.73 ± 0.537
0.0CysXaa: 0.0 ± 0.0
Asp
1.096AspAla: 1.096 ± 0.438
0.73AspCys: 0.73 ± 0.518
4.383AspAsp: 4.383 ± 1.34
3.287AspGlu: 3.287 ± 0.888
2.557AspPhe: 2.557 ± 1.631
2.191AspGly: 2.191 ± 1.214
0.73AspHis: 0.73 ± 0.538
6.209AspIle: 6.209 ± 1.598
2.557AspLys: 2.557 ± 1.028
4.748AspLeu: 4.748 ± 1.275
1.826AspMet: 1.826 ± 0.817
3.287AspAsn: 3.287 ± 0.957
4.018AspPro: 4.018 ± 1.506
2.557AspGln: 2.557 ± 1.007
3.287AspArg: 3.287 ± 0.638
6.209AspSer: 6.209 ± 1.167
4.748AspThr: 4.748 ± 1.945
4.018AspVal: 4.018 ± 1.604
0.73AspTrp: 0.73 ± 0.606
0.73AspTyr: 0.73 ± 0.497
0.0AspXaa: 0.0 ± 0.0
Glu
2.922GluAla: 2.922 ± 1.034
1.461GluCys: 1.461 ± 0.976
4.018GluAsp: 4.018 ± 1.402
6.939GluGlu: 6.939 ± 2.763
1.826GluPhe: 1.826 ± 1.236
4.748GluGly: 4.748 ± 2.514
1.461GluHis: 1.461 ± 0.322
2.191GluIle: 2.191 ± 0.943
1.096GluLys: 1.096 ± 0.898
6.574GluLeu: 6.574 ± 2.692
0.365GluMet: 0.365 ± 0.303
4.018GluAsn: 4.018 ± 0.52
2.922GluPro: 2.922 ± 1.424
2.557GluGln: 2.557 ± 0.725
2.191GluArg: 2.191 ± 1.197
6.209GluSer: 6.209 ± 1.221
2.557GluThr: 2.557 ± 1.083
5.113GluVal: 5.113 ± 1.411
0.73GluTrp: 0.73 ± 0.42
1.826GluTyr: 1.826 ± 1.173
0.0GluXaa: 0.0 ± 0.0
Phe
2.557PheAla: 2.557 ± 0.93
0.73PheCys: 0.73 ± 0.622
3.652PheAsp: 3.652 ± 1.004
2.922PheGlu: 2.922 ± 1.425
2.557PhePhe: 2.557 ± 0.631
0.73PheGly: 0.73 ± 0.426
0.73PheHis: 0.73 ± 0.518
1.461PheIle: 1.461 ± 0.386
2.191PheLys: 2.191 ± 0.691
3.287PheLeu: 3.287 ± 1.439
0.73PheMet: 0.73 ± 0.513
2.922PheAsn: 2.922 ± 1.195
1.826PhePro: 1.826 ± 0.816
2.557PheGln: 2.557 ± 0.84
1.826PheArg: 1.826 ± 1.326
1.826PheSer: 1.826 ± 0.797
1.096PheThr: 1.096 ± 0.909
2.191PheVal: 2.191 ± 1.055
1.461PheTrp: 1.461 ± 0.839
2.191PheTyr: 2.191 ± 0.6
0.0PheXaa: 0.0 ± 0.0
Gly
1.826GlyAla: 1.826 ± 0.858
0.73GlyCys: 0.73 ± 0.594
4.018GlyAsp: 4.018 ± 1.746
4.018GlyGlu: 4.018 ± 1.113
1.096GlyPhe: 1.096 ± 0.62
5.844GlyGly: 5.844 ± 2.376
4.383GlyHis: 4.383 ± 2.081
3.287GlyIle: 3.287 ± 0.811
4.383GlyLys: 4.383 ± 0.502
2.922GlyLeu: 2.922 ± 0.612
0.365GlyMet: 0.365 ± 0.333
2.922GlyAsn: 2.922 ± 0.998
5.478GlyPro: 5.478 ± 2.612
2.922GlyGln: 2.922 ± 0.731
6.939GlyArg: 6.939 ± 2.41
6.574GlySer: 6.574 ± 1.487
4.748GlyThr: 4.748 ± 1.683
2.191GlyVal: 2.191 ± 0.827
0.0GlyTrp: 0.0 ± 0.0
1.826GlyTyr: 1.826 ± 0.954
0.0GlyXaa: 0.0 ± 0.0
His
0.365HisAla: 0.365 ± 0.39
0.73HisCys: 0.73 ± 0.518
1.096HisAsp: 1.096 ± 0.507
1.096HisGlu: 1.096 ± 0.896
1.826HisPhe: 1.826 ± 0.856
0.365HisGly: 0.365 ± 0.473
0.0HisHis: 0.0 ± 0.0
0.365HisIle: 0.365 ± 0.359
1.461HisLys: 1.461 ± 0.692
1.461HisLeu: 1.461 ± 0.948
0.0HisMet: 0.0 ± 0.0
1.826HisAsn: 1.826 ± 0.522
2.557HisPro: 2.557 ± 1.236
1.461HisGln: 1.461 ± 0.629
0.365HisArg: 0.365 ± 0.39
1.461HisSer: 1.461 ± 0.851
1.096HisThr: 1.096 ± 0.425
1.096HisVal: 1.096 ± 0.62
1.461HisTrp: 1.461 ± 0.625
0.73HisTyr: 0.73 ± 0.36
0.0HisXaa: 0.0 ± 0.0
Ile
2.557IleAla: 2.557 ± 1.237
1.096IleCys: 1.096 ± 0.887
2.191IleAsp: 2.191 ± 0.759
3.652IleGlu: 3.652 ± 1.284
1.096IlePhe: 1.096 ± 0.704
4.383IleGly: 4.383 ± 1.08
2.191IleHis: 2.191 ± 0.903
2.557IleIle: 2.557 ± 1.172
2.557IleLys: 2.557 ± 1.183
2.922IleLeu: 2.922 ± 0.383
0.365IleMet: 0.365 ± 0.303
2.557IleAsn: 2.557 ± 0.957
2.922IlePro: 2.922 ± 1.404
1.461IleGln: 1.461 ± 0.631
2.557IleArg: 2.557 ± 1.168
2.557IleSer: 2.557 ± 0.409
2.191IleThr: 2.191 ± 0.839
2.191IleVal: 2.191 ± 0.801
0.365IleTrp: 0.365 ± 0.303
2.922IleTyr: 2.922 ± 0.84
0.0IleXaa: 0.0 ± 0.0
Lys
2.922LysAla: 2.922 ± 1.311
0.73LysCys: 0.73 ± 0.513
1.461LysAsp: 1.461 ± 0.716
4.748LysGlu: 4.748 ± 1.267
2.557LysPhe: 2.557 ± 0.929
4.748LysGly: 4.748 ± 1.291
1.461LysHis: 1.461 ± 0.981
2.191LysIle: 2.191 ± 0.918
1.461LysLys: 1.461 ± 0.597
4.383LysLeu: 4.383 ± 1.59
0.365LysMet: 0.365 ± 0.39
1.826LysAsn: 1.826 ± 1.5
0.365LysPro: 0.365 ± 0.447
2.557LysGln: 2.557 ± 0.409
4.748LysArg: 4.748 ± 0.785
4.018LysSer: 4.018 ± 1.81
2.557LysThr: 2.557 ± 0.817
4.018LysVal: 4.018 ± 1.826
0.0LysTrp: 0.0 ± 0.0
1.826LysTyr: 1.826 ± 0.532
0.0LysXaa: 0.0 ± 0.0
Leu
2.922LeuAla: 2.922 ± 0.778
2.191LeuCys: 2.191 ± 1.032
6.209LeuAsp: 6.209 ± 1.215
6.209LeuGlu: 6.209 ± 2.556
5.478LeuPhe: 5.478 ± 1.537
5.113LeuGly: 5.113 ± 1.384
1.096LeuHis: 1.096 ± 0.646
3.652LeuIle: 3.652 ± 0.586
2.557LeuLys: 2.557 ± 0.746
9.131LeuLeu: 9.131 ± 2.933
2.557LeuMet: 2.557 ± 0.88
1.826LeuAsn: 1.826 ± 0.608
2.922LeuPro: 2.922 ± 1.2
6.939LeuGln: 6.939 ± 1.795
3.652LeuArg: 3.652 ± 1.235
6.209LeuSer: 6.209 ± 0.961
5.478LeuThr: 5.478 ± 1.885
4.018LeuVal: 4.018 ± 1.415
0.73LeuTrp: 0.73 ± 0.518
0.73LeuTyr: 0.73 ± 0.666
0.0LeuXaa: 0.0 ± 0.0
Met
2.191MetAla: 2.191 ± 0.763
0.0MetCys: 0.0 ± 0.0
0.365MetAsp: 0.365 ± 0.303
0.365MetGlu: 0.365 ± 0.333
1.461MetPhe: 1.461 ± 0.859
0.365MetGly: 0.365 ± 0.303
0.0MetHis: 0.0 ± 0.0
0.73MetIle: 0.73 ± 0.69
1.826MetLys: 1.826 ± 0.889
2.191MetLeu: 2.191 ± 0.975
1.096MetMet: 1.096 ± 0.903
1.096MetAsn: 1.096 ± 0.796
0.0MetPro: 0.0 ± 0.0
1.461MetGln: 1.461 ± 0.454
0.0MetArg: 0.0 ± 0.0
2.557MetSer: 2.557 ± 1.747
0.73MetThr: 0.73 ± 0.497
0.73MetVal: 0.73 ± 0.606
0.365MetTrp: 0.365 ± 0.333
0.365MetTyr: 0.365 ± 0.39
0.0MetXaa: 0.0 ± 0.0
Asn
3.652AsnAla: 3.652 ± 1.235
0.73AsnCys: 0.73 ± 0.532
2.191AsnAsp: 2.191 ± 0.915
3.287AsnGlu: 3.287 ± 1.204
1.826AsnPhe: 1.826 ± 1.051
2.922AsnGly: 2.922 ± 1.169
0.73AsnHis: 0.73 ± 0.666
2.191AsnIle: 2.191 ± 0.655
1.826AsnLys: 1.826 ± 0.242
1.461AsnLeu: 1.461 ± 0.677
0.73AsnMet: 0.73 ± 0.714
1.461AsnAsn: 1.461 ± 0.88
4.383AsnPro: 4.383 ± 2.149
1.826AsnGln: 1.826 ± 0.817
2.557AsnArg: 2.557 ± 0.817
2.557AsnSer: 2.557 ± 1.439
4.383AsnThr: 4.383 ± 1.222
3.652AsnVal: 3.652 ± 0.422
0.0AsnTrp: 0.0 ± 0.0
1.461AsnTyr: 1.461 ± 0.554
0.0AsnXaa: 0.0 ± 0.0
Pro
4.383ProAla: 4.383 ± 0.86
2.191ProCys: 2.191 ± 0.9
6.209ProAsp: 6.209 ± 1.628
4.748ProGlu: 4.748 ± 1.807
0.73ProPhe: 0.73 ± 0.42
3.652ProGly: 3.652 ± 2.437
0.365ProHis: 0.365 ± 0.447
1.826ProIle: 1.826 ± 0.736
4.018ProLys: 4.018 ± 1.253
4.383ProLeu: 4.383 ± 1.258
1.461ProMet: 1.461 ± 0.88
2.557ProAsn: 2.557 ± 0.903
17.166ProPro: 17.166 ± 11.231
1.826ProGln: 1.826 ± 0.798
2.557ProArg: 2.557 ± 0.935
5.844ProSer: 5.844 ± 1.831
4.018ProThr: 4.018 ± 2.358
4.383ProVal: 4.383 ± 1.243
0.365ProTrp: 0.365 ± 0.333
1.096ProTyr: 1.096 ± 0.751
0.0ProXaa: 0.0 ± 0.0
Gln
3.287GlnAla: 3.287 ± 0.935
1.096GlnCys: 1.096 ± 0.507
2.557GlnAsp: 2.557 ± 0.925
2.191GlnGlu: 2.191 ± 1.047
1.826GlnPhe: 1.826 ± 0.941
1.826GlnGly: 1.826 ± 0.242
0.73GlnHis: 0.73 ± 0.42
2.191GlnIle: 2.191 ± 0.549
1.461GlnLys: 1.461 ± 0.721
6.209GlnLeu: 6.209 ± 1.233
1.096GlnMet: 1.096 ± 0.506
1.826GlnAsn: 1.826 ± 1.138
2.922GlnPro: 2.922 ± 1.08
3.652GlnGln: 3.652 ± 1.226
2.922GlnArg: 2.922 ± 0.783
2.922GlnSer: 2.922 ± 1.12
3.287GlnThr: 3.287 ± 0.814
2.557GlnVal: 2.557 ± 1.093
0.73GlnTrp: 0.73 ± 0.36
2.922GlnTyr: 2.922 ± 0.889
0.0GlnXaa: 0.0 ± 0.0
Arg
1.826ArgAla: 1.826 ± 0.532
0.73ArgCys: 0.73 ± 0.537
2.557ArgAsp: 2.557 ± 1.597
1.461ArgGlu: 1.461 ± 0.839
1.826ArgPhe: 1.826 ± 0.532
8.4ArgGly: 8.4 ± 1.963
1.826ArgHis: 1.826 ± 0.821
1.826ArgIle: 1.826 ± 0.812
4.018ArgLys: 4.018 ± 0.821
6.939ArgLeu: 6.939 ± 1.185
0.365ArgMet: 0.365 ± 0.333
2.557ArgAsn: 2.557 ± 0.771
4.748ArgPro: 4.748 ± 2.116
1.461ArgGln: 1.461 ± 0.625
6.209ArgArg: 6.209 ± 2.814
9.131ArgSer: 9.131 ± 5.934
3.652ArgThr: 3.652 ± 1.632
3.652ArgVal: 3.652 ± 1.565
0.0ArgTrp: 0.0 ± 0.0
2.922ArgTyr: 2.922 ± 0.875
0.0ArgXaa: 0.0 ± 0.0
Ser
5.478SerAla: 5.478 ± 1.242
0.73SerCys: 0.73 ± 0.779
4.383SerAsp: 4.383 ± 0.574
4.383SerGlu: 4.383 ± 1.63
4.018SerPhe: 4.018 ± 0.806
5.478SerGly: 5.478 ± 1.765
0.365SerHis: 0.365 ± 0.447
2.191SerIle: 2.191 ± 0.844
3.652SerLys: 3.652 ± 1.284
7.305SerLeu: 7.305 ± 1.481
1.826SerMet: 1.826 ± 0.87
1.461SerAsn: 1.461 ± 0.847
4.383SerPro: 4.383 ± 1.849
3.287SerGln: 3.287 ± 0.989
10.957SerArg: 10.957 ± 5.797
8.766SerSer: 8.766 ± 3.328
11.322SerThr: 11.322 ± 3.493
2.922SerVal: 2.922 ± 1.23
0.73SerTrp: 0.73 ± 0.36
2.191SerTyr: 2.191 ± 0.658
0.0SerXaa: 0.0 ± 0.0
Thr
2.922ThrAla: 2.922 ± 0.836
1.826ThrCys: 1.826 ± 0.685
4.383ThrAsp: 4.383 ± 0.651
2.557ThrGlu: 2.557 ± 0.702
2.191ThrPhe: 2.191 ± 1.219
4.383ThrGly: 4.383 ± 1.938
1.096ThrHis: 1.096 ± 0.621
4.383ThrIle: 4.383 ± 2.305
1.826ThrLys: 1.826 ± 0.242
3.287ThrLeu: 3.287 ± 0.526
1.461ThrMet: 1.461 ± 0.863
3.287ThrAsn: 3.287 ± 0.682
6.209ThrPro: 6.209 ± 2.557
2.922ThrGln: 2.922 ± 1.114
5.478ThrArg: 5.478 ± 1.421
6.574ThrSer: 6.574 ± 2.771
8.035ThrThr: 8.035 ± 3.507
4.018ThrVal: 4.018 ± 1.109
1.096ThrTrp: 1.096 ± 0.958
1.461ThrTyr: 1.461 ± 0.717
0.0ThrXaa: 0.0 ± 0.0
Val
3.652ValAla: 3.652 ± 1.146
0.73ValCys: 0.73 ± 0.85
4.018ValAsp: 4.018 ± 0.846
4.018ValGlu: 4.018 ± 1.665
1.461ValPhe: 1.461 ± 0.531
4.748ValGly: 4.748 ± 0.902
1.826ValHis: 1.826 ± 1.199
2.191ValIle: 2.191 ± 1.483
1.096ValLys: 1.096 ± 0.607
3.287ValLeu: 3.287 ± 0.838
0.0ValMet: 0.0 ± 0.0
4.383ValAsn: 4.383 ± 1.253
4.018ValPro: 4.018 ± 1.127
4.018ValGln: 4.018 ± 1.448
3.652ValArg: 3.652 ± 1.085
5.844ValSer: 5.844 ± 2.123
3.652ValThr: 3.652 ± 1.441
2.191ValVal: 2.191 ± 0.788
0.73ValTrp: 0.73 ± 0.779
2.191ValTyr: 2.191 ± 1.456
0.0ValXaa: 0.0 ± 0.0
Trp
0.73TrpAla: 0.73 ± 0.42
0.365TrpCys: 0.365 ± 0.303
0.73TrpAsp: 0.73 ± 0.779
0.73TrpGlu: 0.73 ± 0.513
0.365TrpPhe: 0.365 ± 0.39
0.0TrpGly: 0.0 ± 0.0
0.365TrpHis: 0.365 ± 0.333
1.461TrpIle: 1.461 ± 1.212
1.096TrpLys: 1.096 ± 0.898
1.096TrpLeu: 1.096 ± 0.62
0.365TrpMet: 0.365 ± 0.447
0.365TrpAsn: 0.365 ± 0.303
0.0TrpPro: 0.0 ± 0.0
1.096TrpGln: 1.096 ± 0.796
0.365TrpArg: 0.365 ± 0.333
1.096TrpSer: 1.096 ± 0.624
0.365TrpThr: 0.365 ± 0.447
0.73TrpVal: 0.73 ± 0.36
0.0TrpTrp: 0.0 ± 0.0
0.365TrpTyr: 0.365 ± 0.303
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.191TyrAla: 2.191 ± 0.83
0.0TyrCys: 0.0 ± 0.0
0.73TyrAsp: 0.73 ± 0.47
1.461TyrGlu: 1.461 ± 0.896
1.461TyrPhe: 1.461 ± 0.94
2.557TyrGly: 2.557 ± 0.65
0.73TyrHis: 0.73 ± 0.403
1.461TyrIle: 1.461 ± 0.756
3.652TyrLys: 3.652 ± 1.555
3.287TyrLeu: 3.287 ± 0.987
0.365TyrMet: 0.365 ± 0.303
0.73TyrAsn: 0.73 ± 0.537
1.826TyrPro: 1.826 ± 0.542
1.096TyrGln: 1.096 ± 0.576
1.826TyrArg: 1.826 ± 0.542
2.557TyrSer: 2.557 ± 1.078
1.096TyrThr: 1.096 ± 0.578
3.652TyrVal: 3.652 ± 1.516
0.365TyrTrp: 0.365 ± 0.447
2.191TyrTyr: 2.191 ± 0.877
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2739 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski