Amino acid dipepetide frequency for Human papillomavirus type 144

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.161AlaAla: 4.161 ± 2.187
0.832AlaCys: 0.832 ± 0.685
3.745AlaAsp: 3.745 ± 1.173
4.578AlaGlu: 4.578 ± 1.888
2.497AlaPhe: 2.497 ± 0.951
1.665AlaGly: 1.665 ± 0.81
1.248AlaHis: 1.248 ± 0.464
4.994AlaIle: 4.994 ± 0.888
3.329AlaLys: 3.329 ± 1.5
4.994AlaLeu: 4.994 ± 1.834
0.832AlaMet: 0.832 ± 0.39
1.665AlaAsn: 1.665 ± 0.99
2.913AlaPro: 2.913 ± 0.869
2.497AlaGln: 2.497 ± 1.094
5.826AlaArg: 5.826 ± 2.027
4.161AlaSer: 4.161 ± 1.534
2.913AlaThr: 2.913 ± 0.552
2.497AlaVal: 2.497 ± 1.422
0.832AlaTrp: 0.832 ± 0.409
0.832AlaTyr: 0.832 ± 0.709
0.0AlaXaa: 0.0 ± 0.0
Cys
1.248CysAla: 1.248 ± 1.118
1.248CysCys: 1.248 ± 1.13
1.665CysAsp: 1.665 ± 0.78
0.832CysGlu: 0.832 ± 0.844
1.248CysPhe: 1.248 ± 0.893
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
2.497CysIle: 2.497 ± 1.471
1.248CysLys: 1.248 ± 0.605
0.832CysLeu: 0.832 ± 0.685
0.832CysMet: 0.832 ± 0.645
0.832CysAsn: 0.832 ± 0.409
1.665CysPro: 1.665 ± 0.917
0.416CysGln: 0.416 ± 0.354
0.416CysArg: 0.416 ± 0.534
1.665CysSer: 1.665 ± 1.196
1.248CysThr: 1.248 ± 0.605
0.0CysVal: 0.0 ± 0.0
0.832CysTrp: 0.832 ± 0.466
0.416CysTyr: 0.416 ± 0.534
0.0CysXaa: 0.0 ± 0.0
Asp
4.578AspAla: 4.578 ± 0.888
0.832AspCys: 0.832 ± 0.39
4.578AspAsp: 4.578 ± 1.038
4.161AspGlu: 4.161 ± 1.307
3.329AspPhe: 3.329 ± 0.874
4.994AspGly: 4.994 ± 1.426
0.832AspHis: 0.832 ± 0.621
7.491AspIle: 7.491 ± 2.11
1.248AspLys: 1.248 ± 0.875
4.578AspLeu: 4.578 ± 1.932
1.248AspMet: 1.248 ± 0.768
4.578AspAsn: 4.578 ± 1.299
4.578AspPro: 4.578 ± 1.01
1.248AspGln: 1.248 ± 1.11
1.665AspArg: 1.665 ± 0.629
4.578AspSer: 4.578 ± 1.19
5.41AspThr: 5.41 ± 1.163
6.242AspVal: 6.242 ± 2.15
0.832AspTrp: 0.832 ± 0.709
0.832AspTyr: 0.832 ± 0.587
0.0AspXaa: 0.0 ± 0.0
Glu
3.329GluAla: 3.329 ± 1.316
1.665GluCys: 1.665 ± 0.65
5.826GluAsp: 5.826 ± 0.81
8.323GluGlu: 8.323 ± 2.756
0.832GluPhe: 0.832 ± 0.709
2.497GluGly: 2.497 ± 1.329
1.665GluHis: 1.665 ± 0.569
2.497GluIle: 2.497 ± 0.58
1.665GluLys: 1.665 ± 1.073
6.658GluLeu: 6.658 ± 1.088
1.248GluMet: 1.248 ± 0.616
4.994GluAsn: 4.994 ± 1.782
1.665GluPro: 1.665 ± 1.246
5.41GluGln: 5.41 ± 0.516
4.161GluArg: 4.161 ± 1.528
3.329GluSer: 3.329 ± 1.244
1.665GluThr: 1.665 ± 0.696
2.497GluVal: 2.497 ± 1.016
0.832GluTrp: 0.832 ± 0.39
1.665GluTyr: 1.665 ± 0.841
0.0GluXaa: 0.0 ± 0.0
Phe
1.665PheAla: 1.665 ± 0.84
0.832PheCys: 0.832 ± 0.645
3.745PheAsp: 3.745 ± 0.874
2.081PheGlu: 2.081 ± 0.644
3.745PhePhe: 3.745 ± 0.815
2.497PheGly: 2.497 ± 0.478
2.081PheHis: 2.081 ± 0.928
2.081PheIle: 2.081 ± 0.416
2.913PheLys: 2.913 ± 1.182
6.242PheLeu: 6.242 ± 2.068
0.832PheMet: 0.832 ± 0.402
1.665PheAsn: 1.665 ± 0.844
2.497PhePro: 2.497 ± 1.188
2.497PheGln: 2.497 ± 1.066
1.665PheArg: 1.665 ± 0.782
3.329PheSer: 3.329 ± 1.06
1.665PheThr: 1.665 ± 0.712
4.161PheVal: 4.161 ± 1.123
1.665PheTrp: 1.665 ± 0.78
1.248PheTyr: 1.248 ± 0.693
0.0PheXaa: 0.0 ± 0.0
Gly
2.913GlyAla: 2.913 ± 0.682
0.832GlyCys: 0.832 ± 0.713
5.41GlyAsp: 5.41 ± 1.234
2.497GlyGlu: 2.497 ± 0.495
0.416GlyPhe: 0.416 ± 0.37
3.745GlyGly: 3.745 ± 1.317
1.665GlyHis: 1.665 ± 0.841
3.329GlyIle: 3.329 ± 0.924
3.329GlyLys: 3.329 ± 0.423
6.242GlyLeu: 6.242 ± 1.087
0.416GlyMet: 0.416 ± 0.343
3.329GlyAsn: 3.329 ± 1.06
2.081GlyPro: 2.081 ± 0.408
3.329GlyGln: 3.329 ± 1.225
2.913GlyArg: 2.913 ± 0.827
2.913GlySer: 2.913 ± 0.682
5.41GlyThr: 5.41 ± 2.635
1.665GlyVal: 1.665 ± 0.701
0.0GlyTrp: 0.0 ± 0.0
1.248GlyTyr: 1.248 ± 0.62
0.0GlyXaa: 0.0 ± 0.0
His
0.832HisAla: 0.832 ± 0.74
0.416HisCys: 0.416 ± 0.354
0.416HisAsp: 0.416 ± 0.343
1.248HisGlu: 1.248 ± 0.647
2.913HisPhe: 2.913 ± 1.15
1.248HisGly: 1.248 ± 0.668
0.0HisHis: 0.0 ± 0.0
1.665HisIle: 1.665 ± 0.749
0.832HisLys: 0.832 ± 0.587
1.665HisLeu: 1.665 ± 0.622
0.832HisMet: 0.832 ± 0.685
0.0HisAsn: 0.0 ± 0.0
2.081HisPro: 2.081 ± 1.013
1.248HisGln: 1.248 ± 0.736
0.832HisArg: 0.832 ± 0.409
2.081HisSer: 2.081 ± 0.856
1.248HisThr: 1.248 ± 0.75
0.416HisVal: 0.416 ± 0.37
0.832HisTrp: 0.832 ± 0.589
0.416HisTyr: 0.416 ± 0.534
0.0HisXaa: 0.0 ± 0.0
Ile
1.665IleAla: 1.665 ± 0.864
1.248IleCys: 1.248 ± 0.605
6.242IleAsp: 6.242 ± 1.696
3.745IleGlu: 3.745 ± 1.045
3.329IlePhe: 3.329 ± 1.275
2.497IleGly: 2.497 ± 1.429
1.665IleHis: 1.665 ± 0.917
3.329IleIle: 3.329 ± 0.747
2.497IleLys: 2.497 ± 0.817
3.745IleLeu: 3.745 ± 0.857
0.416IleMet: 0.416 ± 0.447
1.665IleAsn: 1.665 ± 0.817
3.329IlePro: 3.329 ± 1.252
1.665IleGln: 1.665 ± 0.53
3.745IleArg: 3.745 ± 1.534
5.41IleSer: 5.41 ± 1.65
3.329IleThr: 3.329 ± 0.938
2.913IleVal: 2.913 ± 0.869
0.832IleTrp: 0.832 ± 0.645
1.665IleTyr: 1.665 ± 1.113
0.0IleXaa: 0.0 ± 0.0
Lys
1.248LysAla: 1.248 ± 0.851
1.665LysCys: 1.665 ± 1.011
2.081LysAsp: 2.081 ± 0.621
3.745LysGlu: 3.745 ± 1.034
1.248LysPhe: 1.248 ± 0.464
2.081LysGly: 2.081 ± 0.732
0.832LysHis: 0.832 ± 0.516
2.081LysIle: 2.081 ± 0.65
4.161LysLys: 4.161 ± 0.928
4.161LysLeu: 4.161 ± 0.805
0.832LysMet: 0.832 ± 0.39
2.497LysAsn: 2.497 ± 0.937
2.497LysPro: 2.497 ± 1.063
4.161LysGln: 4.161 ± 1.046
5.41LysArg: 5.41 ± 0.607
3.745LysSer: 3.745 ± 2.274
2.081LysThr: 2.081 ± 0.416
2.913LysVal: 2.913 ± 0.9
0.416LysTrp: 0.416 ± 0.37
3.329LysTyr: 3.329 ± 0.977
0.0LysXaa: 0.0 ± 0.0
Leu
6.658LeuAla: 6.658 ± 1.727
0.416LeuCys: 0.416 ± 0.534
6.242LeuAsp: 6.242 ± 2.112
4.994LeuGlu: 4.994 ± 1.177
7.491LeuPhe: 7.491 ± 1.324
6.658LeuGly: 6.658 ± 1.603
1.248LeuHis: 1.248 ± 0.666
4.161LeuIle: 4.161 ± 0.808
4.994LeuLys: 4.994 ± 1.644
9.571LeuLeu: 9.571 ± 3.454
1.665LeuMet: 1.665 ± 0.793
2.913LeuAsn: 2.913 ± 1.647
4.578LeuPro: 4.578 ± 1.019
7.074LeuGln: 7.074 ± 1.906
2.913LeuArg: 2.913 ± 0.999
6.242LeuSer: 6.242 ± 1.348
4.161LeuThr: 4.161 ± 0.916
7.491LeuVal: 7.491 ± 0.943
0.0LeuTrp: 0.0 ± 0.0
2.913LeuTyr: 2.913 ± 0.755
0.0LeuXaa: 0.0 ± 0.0
Met
2.913MetAla: 2.913 ± 0.752
0.416MetCys: 0.416 ± 0.37
1.248MetAsp: 1.248 ± 0.565
0.832MetGlu: 0.832 ± 0.587
0.416MetPhe: 0.416 ± 0.343
0.832MetGly: 0.832 ± 0.39
0.416MetHis: 0.416 ± 0.447
0.416MetIle: 0.416 ± 0.37
0.416MetLys: 0.416 ± 0.343
1.248MetLeu: 1.248 ± 0.647
0.832MetMet: 0.832 ± 0.516
0.832MetAsn: 0.832 ± 0.39
0.416MetPro: 0.416 ± 0.354
0.416MetGln: 0.416 ± 0.534
1.665MetArg: 1.665 ± 0.72
1.248MetSer: 1.248 ± 0.396
1.665MetThr: 1.665 ± 0.648
0.416MetVal: 0.416 ± 0.354
0.416MetTrp: 0.416 ± 0.343
1.248MetTyr: 1.248 ± 0.685
0.0MetXaa: 0.0 ± 0.0
Asn
1.665AsnAla: 1.665 ± 0.78
1.665AsnCys: 1.665 ± 0.696
2.081AsnAsp: 2.081 ± 0.707
1.248AsnGlu: 1.248 ± 0.399
4.578AsnPhe: 4.578 ± 1.224
1.665AsnGly: 1.665 ± 0.814
0.416AsnHis: 0.416 ± 0.354
2.081AsnIle: 2.081 ± 1.521
2.081AsnLys: 2.081 ± 0.732
3.745AsnLeu: 3.745 ± 1.455
0.832AsnMet: 0.832 ± 0.709
3.329AsnAsn: 3.329 ± 1.041
2.913AsnPro: 2.913 ± 0.839
1.665AsnGln: 1.665 ± 0.498
2.081AsnArg: 2.081 ± 0.978
3.329AsnSer: 3.329 ± 1.468
4.578AsnThr: 4.578 ± 0.8
3.745AsnVal: 3.745 ± 1.358
1.248AsnTrp: 1.248 ± 0.685
0.832AsnTyr: 0.832 ± 0.466
0.0AsnXaa: 0.0 ± 0.0
Pro
4.578ProAla: 4.578 ± 1.67
0.0ProCys: 0.0 ± 0.0
2.913ProAsp: 2.913 ± 1.352
3.745ProGlu: 3.745 ± 0.763
2.913ProPhe: 2.913 ± 0.884
1.248ProGly: 1.248 ± 0.748
0.0ProHis: 0.0 ± 0.0
2.081ProIle: 2.081 ± 0.979
4.161ProLys: 4.161 ± 1.057
7.074ProLeu: 7.074 ± 1.428
0.416ProMet: 0.416 ± 0.343
2.913ProAsn: 2.913 ± 1.071
7.491ProPro: 7.491 ± 1.278
1.248ProGln: 1.248 ± 0.33
4.161ProArg: 4.161 ± 1.894
6.658ProSer: 6.658 ± 2.48
4.161ProThr: 4.161 ± 1.495
3.329ProVal: 3.329 ± 1.518
0.0ProTrp: 0.0 ± 0.0
2.081ProTyr: 2.081 ± 1.034
0.0ProXaa: 0.0 ± 0.0
Gln
1.248GlnAla: 1.248 ± 0.396
1.665GlnCys: 1.665 ± 1.069
3.329GlnAsp: 3.329 ± 1.164
3.745GlnGlu: 3.745 ± 0.839
2.497GlnPhe: 2.497 ± 0.495
2.913GlnGly: 2.913 ± 0.611
0.416GlnHis: 0.416 ± 0.354
0.0GlnIle: 0.0 ± 0.0
2.497GlnLys: 2.497 ± 1.265
5.826GlnLeu: 5.826 ± 1.557
0.832GlnMet: 0.832 ± 0.868
0.832GlnAsn: 0.832 ± 0.434
3.745GlnPro: 3.745 ± 0.977
3.329GlnGln: 3.329 ± 0.683
2.081GlnArg: 2.081 ± 0.841
3.745GlnSer: 3.745 ± 2.252
2.913GlnThr: 2.913 ± 1.147
4.578GlnVal: 4.578 ± 1.1
1.248GlnTrp: 1.248 ± 0.396
1.665GlnTyr: 1.665 ± 0.569
0.0GlnXaa: 0.0 ± 0.0
Arg
4.161ArgAla: 4.161 ± 0.72
2.081ArgCys: 2.081 ± 1.583
2.913ArgAsp: 2.913 ± 0.892
3.329ArgGlu: 3.329 ± 1.67
2.081ArgPhe: 2.081 ± 1.131
3.745ArgGly: 3.745 ± 1.642
2.497ArgHis: 2.497 ± 0.858
1.665ArgIle: 1.665 ± 0.821
3.745ArgLys: 3.745 ± 0.538
6.242ArgLeu: 6.242 ± 1.09
0.832ArgMet: 0.832 ± 0.509
2.081ArgAsn: 2.081 ± 0.714
4.161ArgPro: 4.161 ± 0.936
2.913ArgGln: 2.913 ± 0.441
6.658ArgArg: 6.658 ± 2.464
3.745ArgSer: 3.745 ± 1.086
2.913ArgThr: 2.913 ± 1.631
4.161ArgVal: 4.161 ± 1.45
0.416ArgTrp: 0.416 ± 0.447
1.665ArgTyr: 1.665 ± 1.136
0.0ArgXaa: 0.0 ± 0.0
Ser
4.161SerAla: 4.161 ± 1.611
1.665SerCys: 1.665 ± 1.106
4.578SerAsp: 4.578 ± 1.749
1.248SerGlu: 1.248 ± 0.685
2.497SerPhe: 2.497 ± 0.507
5.826SerGly: 5.826 ± 0.869
2.913SerHis: 2.913 ± 0.954
4.578SerIle: 4.578 ± 1.441
1.248SerLys: 1.248 ± 0.396
6.242SerLeu: 6.242 ± 1.189
2.081SerMet: 2.081 ± 0.715
3.745SerAsn: 3.745 ± 2.724
2.913SerPro: 2.913 ± 0.884
4.161SerGln: 4.161 ± 1.119
4.994SerArg: 4.994 ± 1.436
7.074SerSer: 7.074 ± 1.395
7.491SerThr: 7.491 ± 2.022
4.578SerVal: 4.578 ± 1.058
0.416SerTrp: 0.416 ± 0.343
1.665SerTyr: 1.665 ± 0.651
0.0SerXaa: 0.0 ± 0.0
Thr
2.913ThrAla: 2.913 ± 1.205
0.416ThrCys: 0.416 ± 0.37
4.161ThrAsp: 4.161 ± 1.182
5.41ThrGlu: 5.41 ± 1.075
2.081ThrPhe: 2.081 ± 0.964
4.578ThrGly: 4.578 ± 1.543
0.416ThrHis: 0.416 ± 0.37
2.913ThrIle: 2.913 ± 1.447
1.665ThrLys: 1.665 ± 0.629
5.826ThrLeu: 5.826 ± 2.396
0.832ThrMet: 0.832 ± 0.709
2.913ThrAsn: 2.913 ± 0.755
5.41ThrPro: 5.41 ± 2.59
1.665ThrGln: 1.665 ± 0.569
4.578ThrArg: 4.578 ± 0.837
4.161ThrSer: 4.161 ± 1.44
3.329ThrThr: 3.329 ± 1.244
4.994ThrVal: 4.994 ± 1.212
0.416ThrTrp: 0.416 ± 0.354
2.081ThrTyr: 2.081 ± 0.992
0.0ThrXaa: 0.0 ± 0.0
Val
4.161ValAla: 4.161 ± 0.716
0.416ValCys: 0.416 ± 0.57
5.41ValAsp: 5.41 ± 1.973
4.161ValGlu: 4.161 ± 0.645
2.081ValPhe: 2.081 ± 0.644
2.913ValGly: 2.913 ± 0.669
2.081ValHis: 2.081 ± 0.644
3.745ValIle: 3.745 ± 0.856
3.745ValLys: 3.745 ± 0.6
5.826ValLeu: 5.826 ± 0.806
1.248ValMet: 1.248 ± 0.985
2.913ValAsn: 2.913 ± 1.146
4.578ValPro: 4.578 ± 1.626
1.665ValGln: 1.665 ± 1.032
2.913ValArg: 2.913 ± 1.21
5.826ValSer: 5.826 ± 1.252
2.081ValThr: 2.081 ± 0.493
2.081ValVal: 2.081 ± 1.076
0.416ValTrp: 0.416 ± 0.37
1.248ValTyr: 1.248 ± 0.399
0.0ValXaa: 0.0 ± 0.0
Trp
0.832TrpAla: 0.832 ± 0.39
0.0TrpCys: 0.0 ± 0.0
0.416TrpAsp: 0.416 ± 0.37
0.416TrpGlu: 0.416 ± 0.447
0.416TrpPhe: 0.416 ± 0.354
0.832TrpGly: 0.832 ± 0.39
0.832TrpHis: 0.832 ± 0.685
0.416TrpIle: 0.416 ± 0.354
2.081TrpLys: 2.081 ± 1.008
0.832TrpLeu: 0.832 ± 0.709
0.0TrpMet: 0.0 ± 0.0
0.416TrpAsn: 0.416 ± 0.343
0.416TrpPro: 0.416 ± 0.37
0.832TrpGln: 0.832 ± 0.39
1.248TrpArg: 1.248 ± 1.106
0.0TrpSer: 0.0 ± 0.0
1.248TrpThr: 1.248 ± 0.729
0.832TrpVal: 0.832 ± 0.409
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.081TyrAla: 2.081 ± 0.788
0.832TyrCys: 0.832 ± 1.068
0.832TyrAsp: 0.832 ± 0.409
2.081TyrGlu: 2.081 ± 0.756
2.497TyrPhe: 2.497 ± 0.833
1.248TyrGly: 1.248 ± 0.468
0.0TyrHis: 0.0 ± 0.0
2.913TyrIle: 2.913 ± 1.034
3.329TyrLys: 3.329 ± 0.748
0.832TyrLeu: 0.832 ± 0.39
0.832TyrMet: 0.832 ± 0.74
1.665TyrAsn: 1.665 ± 1.013
1.248TyrPro: 1.248 ± 0.693
1.665TyrGln: 1.665 ± 0.253
2.081TyrArg: 2.081 ± 0.492
0.832TyrSer: 0.832 ± 0.737
1.248TyrThr: 1.248 ± 0.729
0.416TyrVal: 0.416 ± 0.354
0.416TyrTrp: 0.416 ± 0.343
1.248TyrTyr: 1.248 ± 0.712
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2404 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski