Amino acid dipepetide frequency for Human papillomavirus 98

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.034AlaAla: 3.034 ± 1.396
0.758AlaCys: 0.758 ± 0.602
4.171AlaAsp: 4.171 ± 0.531
3.413AlaGlu: 3.413 ± 1.116
3.792AlaPhe: 3.792 ± 1.33
3.413AlaGly: 3.413 ± 1.421
0.758AlaHis: 0.758 ± 0.414
0.758AlaIle: 0.758 ± 0.409
2.655AlaLys: 2.655 ± 1.151
2.655AlaLeu: 2.655 ± 1.171
1.517AlaMet: 1.517 ± 0.847
2.275AlaAsn: 2.275 ± 1.366
4.93AlaPro: 4.93 ± 0.881
3.792AlaGln: 3.792 ± 1.51
2.655AlaArg: 2.655 ± 0.644
3.413AlaSer: 3.413 ± 0.902
6.826AlaThr: 6.826 ± 1.941
3.034AlaVal: 3.034 ± 1.013
0.379AlaTrp: 0.379 ± 0.299
1.517AlaTyr: 1.517 ± 0.752
0.0AlaXaa: 0.0 ± 0.0
Cys
1.138CysAla: 1.138 ± 0.662
1.138CysCys: 1.138 ± 0.828
0.758CysAsp: 0.758 ± 0.409
0.379CysGlu: 0.379 ± 0.494
1.138CysPhe: 1.138 ± 0.434
1.896CysGly: 1.896 ± 1.416
0.379CysHis: 0.379 ± 0.422
1.896CysIle: 1.896 ± 1.232
2.275CysLys: 2.275 ± 0.626
2.275CysLeu: 2.275 ± 1.166
0.0CysMet: 0.0 ± 0.0
0.758CysAsn: 0.758 ± 0.458
1.517CysPro: 1.517 ± 0.798
0.0CysGln: 0.0 ± 0.0
1.517CysArg: 1.517 ± 0.958
0.379CysSer: 0.379 ± 0.299
0.0CysThr: 0.0 ± 0.0
0.379CysVal: 0.379 ± 0.422
0.758CysTrp: 0.758 ± 0.376
0.379CysTyr: 0.379 ± 0.35
0.0CysXaa: 0.0 ± 0.0
Asp
4.551AspAla: 4.551 ± 1.333
1.517AspCys: 1.517 ± 0.774
4.551AspAsp: 4.551 ± 0.761
2.655AspGlu: 2.655 ± 0.742
3.413AspPhe: 3.413 ± 0.747
4.171AspGly: 4.171 ± 1.071
1.138AspHis: 1.138 ± 0.662
5.688AspIle: 5.688 ± 1.456
2.275AspLys: 2.275 ± 0.679
6.447AspLeu: 6.447 ± 2.064
0.758AspMet: 0.758 ± 0.699
3.792AspAsn: 3.792 ± 0.85
4.93AspPro: 4.93 ± 1.798
2.655AspGln: 2.655 ± 1.033
1.138AspArg: 1.138 ± 0.668
2.275AspSer: 2.275 ± 0.554
3.034AspThr: 3.034 ± 0.93
4.93AspVal: 4.93 ± 1.674
0.758AspTrp: 0.758 ± 0.409
0.758AspTyr: 0.758 ± 0.376
0.0AspXaa: 0.0 ± 0.0
Glu
4.171GluAla: 4.171 ± 1.389
1.138GluCys: 1.138 ± 0.898
4.551GluAsp: 4.551 ± 1.707
5.688GluGlu: 5.688 ± 1.69
1.138GluPhe: 1.138 ± 0.649
4.551GluGly: 4.551 ± 2.611
0.758GluHis: 0.758 ± 0.455
2.655GluIle: 2.655 ± 0.73
1.517GluLys: 1.517 ± 0.818
6.447GluLeu: 6.447 ± 1.907
0.758GluMet: 0.758 ± 0.599
3.413GluAsn: 3.413 ± 1.178
4.171GluPro: 4.171 ± 1.473
4.93GluGln: 4.93 ± 1.087
2.655GluArg: 2.655 ± 1.123
4.551GluSer: 4.551 ± 1.821
3.413GluThr: 3.413 ± 0.608
4.551GluVal: 4.551 ± 1.556
0.758GluTrp: 0.758 ± 0.376
1.138GluTyr: 1.138 ± 1.049
0.0GluXaa: 0.0 ± 0.0
Phe
2.655PheAla: 2.655 ± 0.702
0.379PheCys: 0.379 ± 0.422
3.034PheAsp: 3.034 ± 0.911
3.034PheGlu: 3.034 ± 1.496
1.896PhePhe: 1.896 ± 0.897
0.758PheGly: 0.758 ± 0.414
0.758PheHis: 0.758 ± 0.537
1.896PheIle: 1.896 ± 0.657
2.275PheLys: 2.275 ± 0.956
3.413PheLeu: 3.413 ± 0.65
0.0PheMet: 0.0 ± 0.0
0.758PheAsn: 0.758 ± 0.699
1.896PhePro: 1.896 ± 0.83
1.138PheGln: 1.138 ± 0.583
3.413PheArg: 3.413 ± 0.607
2.275PheSer: 2.275 ± 0.639
2.275PheThr: 2.275 ± 0.701
2.275PheVal: 2.275 ± 0.748
1.517PheTrp: 1.517 ± 0.752
1.517PheTyr: 1.517 ± 0.811
0.0PheXaa: 0.0 ± 0.0
Gly
3.413GlyAla: 3.413 ± 0.958
1.138GlyCys: 1.138 ± 0.837
6.068GlyAsp: 6.068 ± 1.296
3.413GlyGlu: 3.413 ± 1.057
0.758GlyPhe: 0.758 ± 0.455
4.93GlyGly: 4.93 ± 2.14
3.034GlyHis: 3.034 ± 1.17
1.517GlyIle: 1.517 ± 0.776
3.792GlyLys: 3.792 ± 1.112
2.655GlyLeu: 2.655 ± 0.381
0.0GlyMet: 0.0 ± 0.0
1.896GlyAsn: 1.896 ± 0.897
2.275GlyPro: 2.275 ± 0.81
4.171GlyGln: 4.171 ± 1.171
9.101GlyArg: 9.101 ± 3.121
7.205GlySer: 7.205 ± 1.745
5.688GlyThr: 5.688 ± 1.525
3.792GlyVal: 3.792 ± 0.818
0.758GlyTrp: 0.758 ± 0.844
2.275GlyTyr: 2.275 ± 0.868
0.0GlyXaa: 0.0 ± 0.0
His
0.379HisAla: 0.379 ± 0.35
0.758HisCys: 0.758 ± 0.626
0.379HisAsp: 0.379 ± 0.357
0.379HisGlu: 0.379 ± 0.299
1.896HisPhe: 1.896 ± 0.444
1.517HisGly: 1.517 ± 1.075
0.758HisHis: 0.758 ± 0.602
0.0HisIle: 0.0 ± 0.0
1.138HisLys: 1.138 ± 0.631
0.758HisLeu: 0.758 ± 0.599
0.0HisMet: 0.0 ± 0.0
1.517HisAsn: 1.517 ± 0.865
2.275HisPro: 2.275 ± 1.033
0.758HisGln: 0.758 ± 0.474
0.758HisArg: 0.758 ± 0.508
1.517HisSer: 1.517 ± 0.794
1.517HisThr: 1.517 ± 0.595
1.138HisVal: 1.138 ± 0.372
1.138HisTrp: 1.138 ± 0.434
0.758HisTyr: 0.758 ± 0.409
0.0HisXaa: 0.0 ± 0.0
Ile
1.896IleAla: 1.896 ± 1.022
1.138IleCys: 1.138 ± 0.622
2.275IleAsp: 2.275 ± 1.358
3.413IleGlu: 3.413 ± 1.341
0.758IlePhe: 0.758 ± 0.591
4.551IleGly: 4.551 ± 1.416
1.138IleHis: 1.138 ± 0.713
3.413IleIle: 3.413 ± 1.942
0.758IleLys: 0.758 ± 0.376
3.034IleLeu: 3.034 ± 0.908
1.138IleMet: 1.138 ± 0.911
2.275IleAsn: 2.275 ± 0.549
2.655IlePro: 2.655 ± 1.034
2.655IleGln: 2.655 ± 1.45
3.034IleArg: 3.034 ± 0.861
4.171IleSer: 4.171 ± 1.247
1.517IleThr: 1.517 ± 0.446
3.413IleVal: 3.413 ± 0.787
1.138IleTrp: 1.138 ± 0.583
2.275IleTyr: 2.275 ± 0.579
0.0IleXaa: 0.0 ± 0.0
Lys
3.034LysAla: 3.034 ± 0.967
0.0LysCys: 0.0 ± 0.0
1.517LysAsp: 1.517 ± 0.847
4.171LysGlu: 4.171 ± 1.425
1.896LysPhe: 1.896 ± 0.934
3.792LysGly: 3.792 ± 1.169
0.758LysHis: 0.758 ± 0.409
1.138LysIle: 1.138 ± 0.535
1.896LysLys: 1.896 ± 0.634
4.551LysLeu: 4.551 ± 1.595
0.379LysMet: 0.379 ± 0.35
3.413LysAsn: 3.413 ± 0.828
1.896LysPro: 1.896 ± 1.318
2.655LysGln: 2.655 ± 0.524
3.792LysArg: 3.792 ± 0.297
1.896LysSer: 1.896 ± 0.83
2.275LysThr: 2.275 ± 1.03
3.034LysVal: 3.034 ± 1.96
0.758LysTrp: 0.758 ± 0.409
2.275LysTyr: 2.275 ± 0.549
0.0LysXaa: 0.0 ± 0.0
Leu
3.792LeuAla: 3.792 ± 0.99
1.896LeuCys: 1.896 ± 0.733
5.309LeuAsp: 5.309 ± 1.353
6.826LeuGlu: 6.826 ± 1.192
3.792LeuPhe: 3.792 ± 1.128
6.068LeuGly: 6.068 ± 1.718
2.655LeuHis: 2.655 ± 1.158
3.792LeuIle: 3.792 ± 1.312
2.655LeuLys: 2.655 ± 0.836
10.618LeuLeu: 10.618 ± 2.171
2.655LeuMet: 2.655 ± 0.718
1.138LeuAsn: 1.138 ± 0.866
3.792LeuPro: 3.792 ± 1.709
6.826LeuGln: 6.826 ± 1.498
3.792LeuArg: 3.792 ± 1.776
6.447LeuSer: 6.447 ± 2.037
4.551LeuThr: 4.551 ± 0.849
6.068LeuVal: 6.068 ± 0.755
0.379LeuTrp: 0.379 ± 0.299
1.896LeuTyr: 1.896 ± 1.082
0.0LeuXaa: 0.0 ± 0.0
Met
1.138MetAla: 1.138 ± 0.468
0.379MetCys: 0.379 ± 0.436
0.758MetAsp: 0.758 ± 0.409
1.138MetGlu: 1.138 ± 0.383
0.379MetPhe: 0.379 ± 0.35
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.379MetIle: 0.379 ± 0.494
0.379MetLys: 0.379 ± 0.299
0.379MetLeu: 0.379 ± 0.299
0.758MetMet: 0.758 ± 0.871
0.379MetAsn: 0.379 ± 0.35
0.758MetPro: 0.758 ± 0.871
0.758MetGln: 0.758 ± 0.376
0.758MetArg: 0.758 ± 0.599
2.655MetSer: 2.655 ± 1.423
0.758MetThr: 0.758 ± 0.478
1.896MetVal: 1.896 ± 0.552
0.758MetTrp: 0.758 ± 0.478
1.138MetTyr: 1.138 ± 0.468
0.0MetXaa: 0.0 ± 0.0
Asn
3.413AsnAla: 3.413 ± 1.281
0.379AsnCys: 0.379 ± 0.299
3.413AsnAsp: 3.413 ± 1.669
1.138AsnGlu: 1.138 ± 0.662
1.517AsnPhe: 1.517 ± 0.688
2.655AsnGly: 2.655 ± 1.198
0.758AsnHis: 0.758 ± 0.68
2.655AsnIle: 2.655 ± 0.96
3.034AsnLys: 3.034 ± 0.396
1.896AsnLeu: 1.896 ± 0.627
0.379AsnMet: 0.379 ± 0.357
1.896AsnAsn: 1.896 ± 0.973
3.792AsnPro: 3.792 ± 1.329
1.517AsnGln: 1.517 ± 0.688
2.655AsnArg: 2.655 ± 0.816
3.034AsnSer: 3.034 ± 0.998
2.655AsnThr: 2.655 ± 0.813
1.896AsnVal: 1.896 ± 0.945
0.0AsnTrp: 0.0 ± 0.0
0.379AsnTyr: 0.379 ± 0.299
0.0AsnXaa: 0.0 ± 0.0
Pro
4.171ProAla: 4.171 ± 1.81
1.896ProCys: 1.896 ± 0.723
4.551ProAsp: 4.551 ± 1.24
5.309ProGlu: 5.309 ± 1.652
1.517ProPhe: 1.517 ± 0.679
3.792ProGly: 3.792 ± 1.429
0.758ProHis: 0.758 ± 0.871
2.655ProIle: 2.655 ± 1.69
3.413ProLys: 3.413 ± 0.723
7.205ProLeu: 7.205 ± 1.909
1.138ProMet: 1.138 ± 0.898
1.896ProAsn: 1.896 ± 0.877
9.101ProPro: 9.101 ± 4.551
3.034ProGln: 3.034 ± 2.329
2.655ProArg: 2.655 ± 1.121
3.792ProSer: 3.792 ± 0.689
3.034ProThr: 3.034 ± 1.168
4.551ProVal: 4.551 ± 1.5
0.0ProTrp: 0.0 ± 0.0
0.758ProTyr: 0.758 ± 0.699
0.0ProXaa: 0.0 ± 0.0
Gln
3.413GlnAla: 3.413 ± 0.604
1.138GlnCys: 1.138 ± 0.622
4.551GlnAsp: 4.551 ± 1.581
3.034GlnGlu: 3.034 ± 1.037
0.379GlnPhe: 0.379 ± 0.299
3.034GlnGly: 3.034 ± 0.629
1.517GlnHis: 1.517 ± 0.513
4.551GlnIle: 4.551 ± 0.84
1.517GlnLys: 1.517 ± 0.818
4.551GlnLeu: 4.551 ± 1.017
1.517GlnMet: 1.517 ± 0.685
1.517GlnAsn: 1.517 ± 0.674
3.034GlnPro: 3.034 ± 1.276
3.413GlnGln: 3.413 ± 2.079
3.034GlnArg: 3.034 ± 1.43
2.655GlnSer: 2.655 ± 0.893
4.171GlnThr: 4.171 ± 0.915
1.896GlnVal: 1.896 ± 1.068
1.138GlnTrp: 1.138 ± 0.69
2.275GlnTyr: 2.275 ± 0.483
0.0GlnXaa: 0.0 ± 0.0
Arg
3.792ArgAla: 3.792 ± 0.772
2.275ArgCys: 2.275 ± 0.778
2.275ArgAsp: 2.275 ± 1.297
2.655ArgGlu: 2.655 ± 0.729
2.655ArgPhe: 2.655 ± 0.644
4.551ArgGly: 4.551 ± 1.931
1.138ArgHis: 1.138 ± 0.737
0.758ArgIle: 0.758 ± 0.586
3.413ArgLys: 3.413 ± 1.187
7.584ArgLeu: 7.584 ± 1.024
0.379ArgMet: 0.379 ± 0.34
2.275ArgAsn: 2.275 ± 0.787
2.275ArgPro: 2.275 ± 0.895
3.413ArgGln: 3.413 ± 1.277
9.86ArgArg: 9.86 ± 4.678
12.514ArgSer: 12.514 ± 4.667
3.413ArgThr: 3.413 ± 1.21
3.034ArgVal: 3.034 ± 0.814
0.0ArgTrp: 0.0 ± 0.0
3.034ArgTyr: 3.034 ± 0.777
0.0ArgXaa: 0.0 ± 0.0
Ser
3.792SerAla: 3.792 ± 1.303
0.758SerCys: 0.758 ± 0.508
6.068SerAsp: 6.068 ± 1.183
3.413SerGlu: 3.413 ± 0.864
3.792SerPhe: 3.792 ± 1.062
7.964SerGly: 7.964 ± 1.611
0.758SerHis: 0.758 ± 0.409
3.792SerIle: 3.792 ± 0.771
3.413SerLys: 3.413 ± 0.575
7.584SerLeu: 7.584 ± 1.679
1.517SerMet: 1.517 ± 0.708
2.275SerAsn: 2.275 ± 1.796
4.551SerPro: 4.551 ± 1.632
2.655SerGln: 2.655 ± 1.035
9.101SerArg: 9.101 ± 4.173
9.48SerSer: 9.48 ± 2.766
7.205SerThr: 7.205 ± 1.469
3.413SerVal: 3.413 ± 1.515
1.138SerTrp: 1.138 ± 0.631
2.275SerTyr: 2.275 ± 1.115
0.0SerXaa: 0.0 ± 0.0
Thr
1.138ThrAla: 1.138 ± 0.566
1.517ThrCys: 1.517 ± 0.624
2.655ThrAsp: 2.655 ± 0.875
4.93ThrGlu: 4.93 ± 1.255
3.413ThrPhe: 3.413 ± 1.216
3.413ThrGly: 3.413 ± 1.272
1.138ThrHis: 1.138 ± 0.798
2.275ThrIle: 2.275 ± 1.125
2.655ThrLys: 2.655 ± 0.902
3.413ThrLeu: 3.413 ± 0.881
1.138ThrMet: 1.138 ± 0.631
2.655ThrAsn: 2.655 ± 0.889
4.93ThrPro: 4.93 ± 1.91
3.034ThrGln: 3.034 ± 1.626
3.792ThrArg: 3.792 ± 1.124
7.205ThrSer: 7.205 ± 2.913
3.413ThrThr: 3.413 ± 1.116
6.068ThrVal: 6.068 ± 1.411
0.379ThrTrp: 0.379 ± 0.436
2.275ThrTyr: 2.275 ± 0.561
0.0ThrXaa: 0.0 ± 0.0
Val
4.551ValAla: 4.551 ± 1.051
0.0ValCys: 0.0 ± 0.0
2.655ValAsp: 2.655 ± 0.873
5.688ValGlu: 5.688 ± 1.002
2.275ValPhe: 2.275 ± 0.788
3.034ValGly: 3.034 ± 1.031
0.379ValHis: 0.379 ± 0.357
4.171ValIle: 4.171 ± 1.368
0.758ValLys: 0.758 ± 0.376
4.93ValLeu: 4.93 ± 0.983
0.0ValMet: 0.0 ± 0.0
3.413ValAsn: 3.413 ± 0.752
4.551ValPro: 4.551 ± 1.342
2.655ValGln: 2.655 ± 0.963
5.688ValArg: 5.688 ± 1.814
6.826ValSer: 6.826 ± 1.642
3.792ValThr: 3.792 ± 1.227
3.413ValVal: 3.413 ± 1.094
0.758ValTrp: 0.758 ± 0.699
3.413ValTyr: 3.413 ± 1.11
0.0ValXaa: 0.0 ± 0.0
Trp
0.758TrpAla: 0.758 ± 0.376
0.379TrpCys: 0.379 ± 0.299
0.379TrpAsp: 0.379 ± 0.35
1.138TrpGlu: 1.138 ± 0.659
0.0TrpPhe: 0.0 ± 0.0
0.379TrpGly: 0.379 ± 0.35
0.0TrpHis: 0.0 ± 0.0
0.758TrpIle: 0.758 ± 0.599
1.517TrpLys: 1.517 ± 0.911
1.517TrpLeu: 1.517 ± 0.752
0.379TrpMet: 0.379 ± 0.436
0.379TrpAsn: 0.379 ± 0.35
0.0TrpPro: 0.0 ± 0.0
1.138TrpGln: 1.138 ± 0.724
0.379TrpArg: 0.379 ± 0.436
0.758TrpSer: 0.758 ± 0.409
0.758TrpThr: 0.758 ± 0.409
1.517TrpVal: 1.517 ± 0.899
0.0TrpTrp: 0.0 ± 0.0
0.758TrpTyr: 0.758 ± 0.474
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.517TyrAla: 1.517 ± 0.674
0.379TyrCys: 0.379 ± 0.299
1.517TyrAsp: 1.517 ± 0.287
1.138TyrGlu: 1.138 ± 0.434
0.758TyrPhe: 0.758 ± 0.508
3.413TyrGly: 3.413 ± 0.686
0.758TyrHis: 0.758 ± 0.405
1.896TyrIle: 1.896 ± 1.018
3.792TyrLys: 3.792 ± 1.261
3.034TyrLeu: 3.034 ± 1.05
0.758TyrMet: 0.758 ± 0.409
1.138TyrAsn: 1.138 ± 1.049
1.896TyrPro: 1.896 ± 0.947
1.138TyrGln: 1.138 ± 0.434
1.517TyrArg: 1.517 ± 0.811
2.275TyrSer: 2.275 ± 0.957
1.138TyrThr: 1.138 ± 0.841
2.655TyrVal: 2.655 ± 1.363
0.379TyrTrp: 0.379 ± 0.436
3.034TyrTyr: 3.034 ± 1.071
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2638 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski