Amino acid dipepetide frequency for Human papillomavirus 29

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.517AlaAla: 3.517 ± 1.471
3.517AlaCys: 3.517 ± 1.128
5.08AlaAsp: 5.08 ± 2.061
3.126AlaGlu: 3.126 ± 0.43
3.908AlaPhe: 3.908 ± 0.624
3.517AlaGly: 3.517 ± 1.074
1.563AlaHis: 1.563 ± 0.523
2.735AlaIle: 2.735 ± 0.813
3.908AlaLys: 3.908 ± 1.224
4.689AlaLeu: 4.689 ± 0.897
1.172AlaMet: 1.172 ± 0.595
1.954AlaAsn: 1.954 ± 0.77
3.908AlaPro: 3.908 ± 1.117
1.954AlaGln: 1.954 ± 0.668
5.471AlaArg: 5.471 ± 1.009
2.735AlaSer: 2.735 ± 1.024
4.299AlaThr: 4.299 ± 1.335
2.735AlaVal: 2.735 ± 0.727
0.0AlaTrp: 0.0 ± 0.0
1.954AlaTyr: 1.954 ± 0.608
0.0AlaXaa: 0.0 ± 0.0
Cys
3.517CysAla: 3.517 ± 1.28
0.0CysCys: 0.0 ± 0.0
0.782CysAsp: 0.782 ± 0.616
1.563CysGlu: 1.563 ± 0.681
1.172CysPhe: 1.172 ± 0.748
0.782CysGly: 0.782 ± 0.413
0.782CysHis: 0.782 ± 0.585
1.563CysIle: 1.563 ± 0.604
1.954CysLys: 1.954 ± 0.749
1.172CysLeu: 1.172 ± 0.844
0.782CysMet: 0.782 ± 0.481
0.391CysAsn: 0.391 ± 0.458
1.954CysPro: 1.954 ± 0.63
1.954CysGln: 1.954 ± 0.701
0.782CysArg: 0.782 ± 0.481
1.172CysSer: 1.172 ± 0.533
2.345CysThr: 2.345 ± 0.806
1.954CysVal: 1.954 ± 1.252
1.563CysTrp: 1.563 ± 0.556
0.391CysTyr: 0.391 ± 0.416
0.0CysXaa: 0.0 ± 0.0
Asp
3.517AspAla: 3.517 ± 1.111
2.735AspCys: 2.735 ± 1.226
3.126AspAsp: 3.126 ± 1.66
3.908AspGlu: 3.908 ± 1.668
1.172AspPhe: 1.172 ± 0.364
3.908AspGly: 3.908 ± 1.16
0.391AspHis: 0.391 ± 0.311
3.908AspIle: 3.908 ± 1.708
1.563AspLys: 1.563 ± 0.927
3.126AspLeu: 3.126 ± 1.403
1.172AspMet: 1.172 ± 0.364
2.345AspAsn: 2.345 ± 0.789
5.471AspPro: 5.471 ± 1.564
1.954AspGln: 1.954 ± 0.852
2.735AspArg: 2.735 ± 1.084
4.689AspSer: 4.689 ± 1.415
4.689AspThr: 4.689 ± 1.236
4.299AspVal: 4.299 ± 1.136
1.172AspTrp: 1.172 ± 0.584
1.563AspTyr: 1.563 ± 1.014
0.0AspXaa: 0.0 ± 0.0
Glu
4.689GluAla: 4.689 ± 1.06
0.391GluCys: 0.391 ± 0.389
5.471GluAsp: 5.471 ± 0.86
6.252GluGlu: 6.252 ± 1.884
1.172GluPhe: 1.172 ± 0.57
4.689GluGly: 4.689 ± 1.628
0.782GluHis: 0.782 ± 0.369
1.563GluIle: 1.563 ± 1.055
1.563GluLys: 1.563 ± 0.833
4.689GluLeu: 4.689 ± 1.301
1.172GluMet: 1.172 ± 0.739
2.735GluAsn: 2.735 ± 0.813
4.689GluPro: 4.689 ± 1.351
1.563GluGln: 1.563 ± 0.691
3.126GluArg: 3.126 ± 0.588
3.126GluSer: 3.126 ± 0.549
3.908GluThr: 3.908 ± 0.889
2.735GluVal: 2.735 ± 1.052
0.782GluTrp: 0.782 ± 0.621
1.563GluTyr: 1.563 ± 0.738
0.0GluXaa: 0.0 ± 0.0
Phe
1.563PheAla: 1.563 ± 0.588
1.172PheCys: 1.172 ± 0.844
0.782PheAsp: 0.782 ± 0.458
1.563PheGlu: 1.563 ± 1.033
1.563PhePhe: 1.563 ± 0.979
1.563PheGly: 1.563 ± 0.738
0.391PheHis: 0.391 ± 0.389
1.172PheIle: 1.172 ± 0.584
1.172PheLys: 1.172 ± 0.932
5.08PheLeu: 5.08 ± 0.609
1.954PheMet: 1.954 ± 0.594
1.172PheAsn: 1.172 ± 0.694
0.782PhePro: 0.782 ± 0.429
1.954PheGln: 1.954 ± 0.8
1.954PheArg: 1.954 ± 0.577
2.345PheSer: 2.345 ± 1.226
1.954PheThr: 1.954 ± 0.965
3.126PheVal: 3.126 ± 1.321
0.782PheTrp: 0.782 ± 0.369
0.782PheTyr: 0.782 ± 0.413
0.0PheXaa: 0.0 ± 0.0
Gly
3.126GlyAla: 3.126 ± 1.12
0.782GlyCys: 0.782 ± 0.369
5.862GlyAsp: 5.862 ± 0.791
5.08GlyGlu: 5.08 ± 2.353
0.782GlyPhe: 0.782 ± 0.734
4.689GlyGly: 4.689 ± 1.854
3.126GlyHis: 3.126 ± 1.093
2.345GlyIle: 2.345 ± 0.464
3.126GlyLys: 3.126 ± 0.871
5.471GlyLeu: 5.471 ± 1.193
1.954GlyMet: 1.954 ± 0.77
1.954GlyAsn: 1.954 ± 0.664
6.643GlyPro: 6.643 ± 2.827
3.908GlyGln: 3.908 ± 0.567
3.517GlyArg: 3.517 ± 1.321
3.126GlySer: 3.126 ± 0.873
7.034GlyThr: 7.034 ± 4.162
4.689GlyVal: 4.689 ± 0.946
0.391GlyTrp: 0.391 ± 0.311
2.735GlyTyr: 2.735 ± 0.892
0.0GlyXaa: 0.0 ± 0.0
His
0.782HisAla: 0.782 ± 0.369
0.0HisCys: 0.0 ± 0.0
0.782HisAsp: 0.782 ± 0.409
1.172HisGlu: 1.172 ± 0.555
1.563HisPhe: 1.563 ± 0.86
1.172HisGly: 1.172 ± 0.584
0.391HisHis: 0.391 ± 0.389
0.782HisIle: 0.782 ± 0.369
1.563HisLys: 1.563 ± 1.07
1.172HisLeu: 1.172 ± 0.781
1.172HisMet: 1.172 ± 0.57
1.563HisAsn: 1.563 ± 0.899
1.563HisPro: 1.563 ± 0.839
1.172HisGln: 1.172 ± 0.892
1.563HisArg: 1.563 ± 0.748
1.172HisSer: 1.172 ± 0.738
1.954HisThr: 1.954 ± 0.382
1.172HisVal: 1.172 ± 1.166
0.782HisTrp: 0.782 ± 0.5
1.563HisTyr: 1.563 ± 0.631
0.0HisXaa: 0.0 ± 0.0
Ile
1.172IleAla: 1.172 ± 0.651
0.782IleCys: 0.782 ± 0.704
1.563IleAsp: 1.563 ± 0.745
2.735IleGlu: 2.735 ± 0.571
1.563IlePhe: 1.563 ± 0.746
1.954IleGly: 1.954 ± 0.544
1.172IleHis: 1.172 ± 0.892
1.172IleIle: 1.172 ± 0.734
1.563IleLys: 1.563 ± 0.64
3.517IleLeu: 3.517 ± 0.915
0.391IleMet: 0.391 ± 0.352
0.391IleAsn: 0.391 ± 0.367
4.689IlePro: 4.689 ± 1.69
0.782IleGln: 0.782 ± 0.621
1.563IleArg: 1.563 ± 0.919
2.345IleSer: 2.345 ± 0.773
3.126IleThr: 3.126 ± 1.163
2.735IleVal: 2.735 ± 0.919
1.172IleTrp: 1.172 ± 0.892
1.954IleTyr: 1.954 ± 1.036
0.0IleXaa: 0.0 ± 0.0
Lys
3.908LysAla: 3.908 ± 1.689
1.563LysCys: 1.563 ± 0.776
3.126LysAsp: 3.126 ± 0.851
1.954LysGlu: 1.954 ± 0.632
2.735LysPhe: 2.735 ± 0.668
3.126LysGly: 3.126 ± 0.864
1.172LysHis: 1.172 ± 0.643
2.735LysIle: 2.735 ± 1.146
2.345LysLys: 2.345 ± 0.827
2.735LysLeu: 2.735 ± 0.668
0.782LysMet: 0.782 ± 0.359
0.782LysAsn: 0.782 ± 0.481
2.345LysPro: 2.345 ± 1.138
1.563LysGln: 1.563 ± 0.727
6.252LysArg: 6.252 ± 0.722
2.735LysSer: 2.735 ± 1.153
1.172LysThr: 1.172 ± 0.594
3.908LysVal: 3.908 ± 1.391
0.0LysTrp: 0.0 ± 0.0
1.954LysTyr: 1.954 ± 0.758
0.0LysXaa: 0.0 ± 0.0
Leu
3.908LeuAla: 3.908 ± 0.624
2.735LeuCys: 2.735 ± 1.016
3.908LeuAsp: 3.908 ± 0.941
4.689LeuGlu: 4.689 ± 1.282
1.954LeuPhe: 1.954 ± 0.549
5.862LeuGly: 5.862 ± 0.808
1.563LeuHis: 1.563 ± 0.76
1.954LeuIle: 1.954 ± 1.252
3.908LeuLys: 3.908 ± 0.938
7.425LeuLeu: 7.425 ± 2.011
1.563LeuMet: 1.563 ± 0.435
1.954LeuAsn: 1.954 ± 0.765
3.517LeuPro: 3.517 ± 1.277
6.643LeuGln: 6.643 ± 1.16
5.08LeuArg: 5.08 ± 1.256
5.471LeuSer: 5.471 ± 1.56
4.689LeuThr: 4.689 ± 1.464
3.517LeuVal: 3.517 ± 1.051
1.563LeuTrp: 1.563 ± 0.325
5.862LeuTyr: 5.862 ± 1.718
0.0LeuXaa: 0.0 ± 0.0
Met
2.735MetAla: 2.735 ± 0.934
1.563MetCys: 1.563 ± 0.706
1.172MetAsp: 1.172 ± 0.717
0.391MetGlu: 0.391 ± 0.389
0.391MetPhe: 0.391 ± 0.352
1.563MetGly: 1.563 ± 0.64
0.782MetHis: 0.782 ± 0.531
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
0.782MetLeu: 0.782 ± 0.621
0.0MetMet: 0.0 ± 0.0
0.391MetAsn: 0.391 ± 0.352
0.0MetPro: 0.0 ± 0.0
0.782MetGln: 0.782 ± 0.616
0.782MetArg: 0.782 ± 0.458
3.126MetSer: 3.126 ± 1.01
1.172MetThr: 1.172 ± 0.43
2.735MetVal: 2.735 ± 1.047
0.391MetTrp: 0.391 ± 0.389
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.172AsnAla: 1.172 ± 0.619
0.391AsnCys: 0.391 ± 0.311
1.172AsnAsp: 1.172 ± 0.535
1.563AsnGlu: 1.563 ± 0.919
1.172AsnPhe: 1.172 ± 0.694
3.517AsnGly: 3.517 ± 0.942
0.0AsnHis: 0.0 ± 0.0
2.345AsnIle: 2.345 ± 0.903
2.345AsnLys: 2.345 ± 1.666
1.172AsnLeu: 1.172 ± 0.824
0.391AsnMet: 0.391 ± 0.352
1.172AsnAsn: 1.172 ± 0.694
2.735AsnPro: 2.735 ± 0.455
1.563AsnGln: 1.563 ± 0.528
2.345AsnArg: 2.345 ± 0.827
3.126AsnSer: 3.126 ± 0.491
1.172AsnThr: 1.172 ± 0.57
3.517AsnVal: 3.517 ± 1.016
0.391AsnTrp: 0.391 ± 0.311
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
5.08ProAla: 5.08 ± 1.851
0.782ProCys: 0.782 ± 0.621
4.689ProAsp: 4.689 ± 1.168
4.299ProGlu: 4.299 ± 0.974
1.954ProPhe: 1.954 ± 0.827
3.908ProGly: 3.908 ± 1.151
0.391ProHis: 0.391 ± 0.352
1.563ProIle: 1.563 ± 0.817
3.517ProLys: 3.517 ± 0.595
8.597ProLeu: 8.597 ± 0.984
0.782ProMet: 0.782 ± 0.429
1.172ProAsn: 1.172 ± 0.647
6.252ProPro: 6.252 ± 2.307
1.172ProGln: 1.172 ± 0.62
2.735ProArg: 2.735 ± 1.614
7.034ProSer: 7.034 ± 1.693
7.034ProThr: 7.034 ± 2.236
7.034ProVal: 7.034 ± 2.761
0.391ProTrp: 0.391 ± 0.389
2.735ProTyr: 2.735 ± 0.993
0.0ProXaa: 0.0 ± 0.0
Gln
4.299GlnAla: 4.299 ± 1.707
0.782GlnCys: 0.782 ± 0.481
1.563GlnAsp: 1.563 ± 0.669
3.517GlnGlu: 3.517 ± 1.472
1.563GlnPhe: 1.563 ± 0.738
1.172GlnGly: 1.172 ± 0.584
0.782GlnHis: 0.782 ± 0.616
1.172GlnIle: 1.172 ± 0.773
1.172GlnLys: 1.172 ± 0.364
3.517GlnLeu: 3.517 ± 0.996
0.0GlnMet: 0.0 ± 0.0
3.517GlnAsn: 3.517 ± 1.492
2.345GlnPro: 2.345 ± 0.808
11.723GlnGln: 11.723 ± 7.614
1.172GlnArg: 1.172 ± 0.57
2.735GlnSer: 2.735 ± 0.929
2.345GlnThr: 2.345 ± 0.368
4.299GlnVal: 4.299 ± 1.022
2.345GlnTrp: 2.345 ± 0.774
1.172GlnTyr: 1.172 ± 0.492
0.0GlnXaa: 0.0 ± 0.0
Arg
3.517ArgAla: 3.517 ± 0.709
2.345ArgCys: 2.345 ± 1.3
2.735ArgAsp: 2.735 ± 1.343
2.735ArgGlu: 2.735 ± 1.161
1.563ArgPhe: 1.563 ± 0.588
3.517ArgGly: 3.517 ± 0.943
2.345ArgHis: 2.345 ± 0.671
1.563ArgIle: 1.563 ± 0.699
4.299ArgLys: 4.299 ± 0.764
7.425ArgLeu: 7.425 ± 1.147
0.782ArgMet: 0.782 ± 0.45
1.172ArgAsn: 1.172 ± 0.689
3.126ArgPro: 3.126 ± 1.961
1.172ArgGln: 1.172 ± 0.932
7.034ArgArg: 7.034 ± 1.353
4.299ArgSer: 4.299 ± 1.031
5.08ArgThr: 5.08 ± 0.833
4.689ArgVal: 4.689 ± 1.987
1.563ArgTrp: 1.563 ± 0.722
2.345ArgTyr: 2.345 ± 0.725
0.0ArgXaa: 0.0 ± 0.0
Ser
5.471SerAla: 5.471 ± 1.912
1.954SerCys: 1.954 ± 0.977
4.299SerAsp: 4.299 ± 0.771
2.735SerGlu: 2.735 ± 1.484
0.782SerPhe: 0.782 ± 0.409
6.643SerGly: 6.643 ± 2.256
1.954SerHis: 1.954 ± 0.892
2.345SerIle: 2.345 ± 0.68
1.563SerLys: 1.563 ± 0.738
4.689SerLeu: 4.689 ± 0.742
1.954SerMet: 1.954 ± 0.635
3.908SerAsn: 3.908 ± 1.57
4.689SerPro: 4.689 ± 0.975
3.126SerGln: 3.126 ± 0.985
4.689SerArg: 4.689 ± 1.202
11.723SerSer: 11.723 ± 4.607
7.816SerThr: 7.816 ± 1.964
3.126SerVal: 3.126 ± 1.359
0.0SerTrp: 0.0 ± 0.0
1.954SerTyr: 1.954 ± 0.809
0.0SerXaa: 0.0 ± 0.0
Thr
1.172ThrAla: 1.172 ± 0.364
1.563ThrCys: 1.563 ± 0.859
1.563ThrAsp: 1.563 ± 0.86
3.517ThrGlu: 3.517 ± 1.092
1.954ThrPhe: 1.954 ± 1.412
8.206ThrGly: 8.206 ± 2.134
1.954ThrHis: 1.954 ± 0.815
2.345ThrIle: 2.345 ± 0.977
1.954ThrLys: 1.954 ± 0.594
6.252ThrLeu: 6.252 ± 1.212
1.172ThrMet: 1.172 ± 0.584
2.345ThrAsn: 2.345 ± 0.73
7.816ThrPro: 7.816 ± 2.455
5.471ThrGln: 5.471 ± 2.174
5.08ThrArg: 5.08 ± 1.574
8.206ThrSer: 8.206 ± 1.583
6.643ThrThr: 6.643 ± 2.193
5.862ThrVal: 5.862 ± 1.479
0.782ThrTrp: 0.782 ± 0.616
1.563ThrTyr: 1.563 ± 0.588
0.0ThrXaa: 0.0 ± 0.0
Val
3.908ValAla: 3.908 ± 1.629
1.954ValCys: 1.954 ± 1.016
6.643ValAsp: 6.643 ± 1.19
3.908ValGlu: 3.908 ± 1.035
3.908ValPhe: 3.908 ± 0.567
4.689ValGly: 4.689 ± 1.325
3.126ValHis: 3.126 ± 1.083
3.908ValIle: 3.908 ± 1.195
4.299ValLys: 4.299 ± 1.669
2.345ValLeu: 2.345 ± 0.889
0.782ValMet: 0.782 ± 0.612
0.782ValAsn: 0.782 ± 0.369
6.643ValPro: 6.643 ± 0.992
1.954ValGln: 1.954 ± 1.178
3.126ValArg: 3.126 ± 0.675
3.517ValSer: 3.517 ± 1.693
6.252ValThr: 6.252 ± 1.885
5.08ValVal: 5.08 ± 1.319
0.782ValTrp: 0.782 ± 0.518
3.126ValTyr: 3.126 ± 0.905
0.0ValXaa: 0.0 ± 0.0
Trp
1.563TrpAla: 1.563 ± 0.738
0.0TrpCys: 0.0 ± 0.0
0.782TrpAsp: 0.782 ± 0.5
0.391TrpGlu: 0.391 ± 0.389
0.391TrpPhe: 0.391 ± 0.311
1.563TrpGly: 1.563 ± 0.523
0.0TrpHis: 0.0 ± 0.0
0.782TrpIle: 0.782 ± 0.621
2.345TrpLys: 2.345 ± 1.051
0.782TrpLeu: 0.782 ± 0.369
0.0TrpMet: 0.0 ± 0.0
0.391TrpAsn: 0.391 ± 0.352
0.782TrpPro: 0.782 ± 0.734
0.391TrpGln: 0.391 ± 0.416
1.563TrpArg: 1.563 ± 0.754
1.563TrpSer: 1.563 ± 0.683
0.782TrpThr: 0.782 ± 0.567
0.782TrpVal: 0.782 ± 0.413
0.0TrpTrp: 0.0 ± 0.0
1.563TrpTyr: 1.563 ± 0.814
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.126TyrAla: 3.126 ± 0.527
1.563TyrCys: 1.563 ± 0.683
2.345TyrAsp: 2.345 ± 0.64
1.563TyrGlu: 1.563 ± 0.682
1.563TyrPhe: 1.563 ± 0.817
4.299TyrGly: 4.299 ± 0.743
0.391TyrHis: 0.391 ± 0.367
0.391TyrIle: 0.391 ± 0.367
2.735TyrLys: 2.735 ± 0.936
3.517TyrLeu: 3.517 ± 1.44
0.391TyrMet: 0.391 ± 0.416
1.172TyrAsn: 1.172 ± 0.651
1.172TyrPro: 1.172 ± 0.707
0.391TyrGln: 0.391 ± 0.311
2.735TyrArg: 2.735 ± 1.404
1.172TyrSer: 1.172 ± 0.782
1.954TyrThr: 1.954 ± 0.843
2.735TyrVal: 2.735 ± 0.646
1.563TyrTrp: 1.563 ± 0.833
1.954TyrTyr: 1.954 ± 0.914
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (2560 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski