Amino acid dipepetide frequency for Human papillomavirus 44

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.329AlaAla: 6.329 ± 1.249
0.844AlaCys: 0.844 ± 0.59
4.641AlaAsp: 4.641 ± 1.439
3.797AlaGlu: 3.797 ± 0.541
3.376AlaPhe: 3.376 ± 1.713
3.376AlaGly: 3.376 ± 1.299
1.266AlaHis: 1.266 ± 0.715
4.641AlaIle: 4.641 ± 0.696
2.954AlaLys: 2.954 ± 1.06
4.641AlaLeu: 4.641 ± 1.419
1.266AlaMet: 1.266 ± 0.765
1.688AlaAsn: 1.688 ± 0.554
2.954AlaPro: 2.954 ± 1.004
2.532AlaGln: 2.532 ± 0.451
4.641AlaArg: 4.641 ± 0.986
5.907AlaSer: 5.907 ± 1.779
5.485AlaThr: 5.485 ± 1.074
3.376AlaVal: 3.376 ± 0.703
0.844AlaTrp: 0.844 ± 0.552
2.11AlaTyr: 2.11 ± 0.672
0.0AlaXaa: 0.0 ± 0.0
Cys
2.11CysAla: 2.11 ± 0.864
0.0CysCys: 0.0 ± 0.0
0.422CysAsp: 0.422 ± 0.343
0.844CysGlu: 0.844 ± 0.687
1.688CysPhe: 1.688 ± 0.776
1.266CysGly: 1.266 ± 0.756
0.844CysHis: 0.844 ± 0.984
2.11CysIle: 2.11 ± 0.764
2.954CysLys: 2.954 ± 0.476
1.688CysLeu: 1.688 ± 0.967
0.844CysMet: 0.844 ± 0.589
2.11CysAsn: 2.11 ± 0.688
2.11CysPro: 2.11 ± 0.636
1.266CysGln: 1.266 ± 0.364
0.422CysArg: 0.422 ± 0.492
2.532CysSer: 2.532 ± 0.852
1.688CysThr: 1.688 ± 0.614
2.11CysVal: 2.11 ± 1.28
1.266CysTrp: 1.266 ± 0.576
0.844CysTyr: 0.844 ± 0.589
0.0CysXaa: 0.0 ± 0.0
Asp
2.954AspAla: 2.954 ± 0.671
2.532AspCys: 2.532 ± 0.855
2.11AspAsp: 2.11 ± 0.807
2.532AspGlu: 2.532 ± 1.222
0.844AspPhe: 0.844 ± 0.687
2.11AspGly: 2.11 ± 0.859
0.422AspHis: 0.422 ± 0.343
5.485AspIle: 5.485 ± 1.993
1.688AspLys: 1.688 ± 0.554
2.532AspLeu: 2.532 ± 1.076
1.688AspMet: 1.688 ± 0.518
3.376AspAsn: 3.376 ± 0.906
4.641AspPro: 4.641 ± 1.911
1.266AspGln: 1.266 ± 0.609
2.11AspArg: 2.11 ± 1.24
2.954AspSer: 2.954 ± 0.914
5.063AspThr: 5.063 ± 0.497
5.063AspVal: 5.063 ± 1.319
0.422AspTrp: 0.422 ± 0.343
1.688AspTyr: 1.688 ± 1.019
0.0AspXaa: 0.0 ± 0.0
Glu
3.376GluAla: 3.376 ± 1.808
1.266GluCys: 1.266 ± 0.622
6.329GluAsp: 6.329 ± 2.022
4.219GluGlu: 4.219 ± 1.316
0.844GluPhe: 0.844 ± 0.359
1.266GluGly: 1.266 ± 0.665
1.266GluHis: 1.266 ± 0.346
2.11GluIle: 2.11 ± 1.061
2.532GluLys: 2.532 ± 0.886
4.219GluLeu: 4.219 ± 0.691
1.266GluMet: 1.266 ± 0.805
1.688GluAsn: 1.688 ± 0.582
2.954GluPro: 2.954 ± 0.995
3.797GluGln: 3.797 ± 1.01
0.422GluArg: 0.422 ± 0.343
1.688GluSer: 1.688 ± 0.795
5.063GluThr: 5.063 ± 1.565
3.797GluVal: 3.797 ± 1.053
0.422GluTrp: 0.422 ± 0.343
0.844GluTyr: 0.844 ± 0.434
0.0GluXaa: 0.0 ± 0.0
Phe
4.641PheAla: 4.641 ± 1.71
1.266PheCys: 1.266 ± 0.828
2.954PheAsp: 2.954 ± 0.635
1.688PheGlu: 1.688 ± 0.581
2.11PhePhe: 2.11 ± 0.732
1.688PheGly: 1.688 ± 0.572
0.422PheHis: 0.422 ± 0.492
2.11PheIle: 2.11 ± 0.895
2.954PheLys: 2.954 ± 0.923
3.376PheLeu: 3.376 ± 1.258
0.844PheMet: 0.844 ± 0.455
1.688PheAsn: 1.688 ± 1.159
1.688PhePro: 1.688 ± 0.777
1.688PheGln: 1.688 ± 0.572
1.688PheArg: 1.688 ± 1.123
1.266PheSer: 1.266 ± 0.752
0.422PheThr: 0.422 ± 0.343
1.266PheVal: 1.266 ± 0.535
1.266PheTrp: 1.266 ± 0.633
1.266PheTyr: 1.266 ± 0.752
0.0PheXaa: 0.0 ± 0.0
Gly
1.688GlyAla: 1.688 ± 1.098
1.266GlyCys: 1.266 ± 0.568
2.954GlyAsp: 2.954 ± 0.634
2.11GlyGlu: 2.11 ± 0.596
1.266GlyPhe: 1.266 ± 0.731
3.797GlyGly: 3.797 ± 1.507
2.532GlyHis: 2.532 ± 1.32
2.532GlyIle: 2.532 ± 0.967
2.532GlyLys: 2.532 ± 0.683
5.063GlyLeu: 5.063 ± 0.608
0.844GlyMet: 0.844 ± 0.434
4.219GlyAsn: 4.219 ± 0.797
2.532GlyPro: 2.532 ± 0.768
2.532GlyGln: 2.532 ± 0.345
4.219GlyArg: 4.219 ± 0.962
2.954GlySer: 2.954 ± 0.935
7.173GlyThr: 7.173 ± 1.34
2.532GlyVal: 2.532 ± 0.787
0.422GlyTrp: 0.422 ± 0.343
1.688GlyTyr: 1.688 ± 0.582
0.0GlyXaa: 0.0 ± 0.0
His
1.266HisAla: 1.266 ± 0.535
1.266HisCys: 1.266 ± 0.721
0.0HisAsp: 0.0 ± 0.0
0.422HisGlu: 0.422 ± 0.375
2.11HisPhe: 2.11 ± 0.699
1.688HisGly: 1.688 ± 0.745
0.844HisHis: 0.844 ± 0.534
2.954HisIle: 2.954 ± 1.274
1.688HisLys: 1.688 ± 0.904
1.266HisLeu: 1.266 ± 0.758
0.422HisMet: 0.422 ± 0.343
2.11HisAsn: 2.11 ± 0.699
1.688HisPro: 1.688 ± 0.773
1.266HisGln: 1.266 ± 0.44
0.844HisArg: 0.844 ± 0.486
1.688HisSer: 1.688 ± 0.581
2.11HisThr: 2.11 ± 0.636
1.266HisVal: 1.266 ± 0.682
0.844HisTrp: 0.844 ± 0.51
0.844HisTyr: 0.844 ± 0.687
0.0HisXaa: 0.0 ± 0.0
Ile
2.532IleAla: 2.532 ± 0.936
2.11IleCys: 2.11 ± 1.145
2.532IleAsp: 2.532 ± 1.319
2.532IleGlu: 2.532 ± 0.597
0.422IlePhe: 0.422 ± 0.375
3.376IleGly: 3.376 ± 1.756
1.266IleHis: 1.266 ± 0.827
2.954IleIle: 2.954 ± 1.487
2.954IleLys: 2.954 ± 1.306
5.063IleLeu: 5.063 ± 2.06
0.422IleMet: 0.422 ± 0.406
0.422IleAsn: 0.422 ± 0.375
3.797IlePro: 3.797 ± 1.622
2.954IleGln: 2.954 ± 1.203
2.532IleArg: 2.532 ± 0.451
4.641IleSer: 4.641 ± 1.552
3.797IleThr: 3.797 ± 1.057
4.641IleVal: 4.641 ± 1.967
0.0IleTrp: 0.0 ± 0.0
2.11IleTyr: 2.11 ± 1.035
0.0IleXaa: 0.0 ± 0.0
Lys
2.532LysAla: 2.532 ± 0.92
2.11LysCys: 2.11 ± 1.045
1.688LysAsp: 1.688 ± 0.761
2.11LysGlu: 2.11 ± 0.688
3.376LysPhe: 3.376 ± 1.058
2.532LysGly: 2.532 ± 1.314
2.11LysHis: 2.11 ± 0.789
0.422LysIle: 0.422 ± 0.343
2.11LysLys: 2.11 ± 0.766
2.954LysLeu: 2.954 ± 0.798
1.266LysMet: 1.266 ± 0.697
2.532LysAsn: 2.532 ± 0.906
2.11LysPro: 2.11 ± 0.851
3.797LysGln: 3.797 ± 1.643
4.219LysArg: 4.219 ± 0.808
2.532LysSer: 2.532 ± 1.007
3.797LysThr: 3.797 ± 0.929
5.485LysVal: 5.485 ± 1.416
0.844LysTrp: 0.844 ± 0.535
3.797LysTyr: 3.797 ± 1.144
0.0LysXaa: 0.0 ± 0.0
Leu
4.219LeuAla: 4.219 ± 1.367
4.641LeuCys: 4.641 ± 2.052
6.329LeuAsp: 6.329 ± 1.17
3.797LeuGlu: 3.797 ± 1.457
3.376LeuPhe: 3.376 ± 0.641
4.219LeuGly: 4.219 ± 0.991
5.485LeuHis: 5.485 ± 1.656
4.219LeuIle: 4.219 ± 1.758
2.954LeuLys: 2.954 ± 0.914
7.595LeuLeu: 7.595 ± 2.11
0.844LeuMet: 0.844 ± 0.441
2.954LeuAsn: 2.954 ± 1.125
3.376LeuPro: 3.376 ± 1.372
6.329LeuGln: 6.329 ± 2.255
2.11LeuArg: 2.11 ± 0.686
4.219LeuSer: 4.219 ± 1.132
3.376LeuThr: 3.376 ± 1.219
5.907LeuVal: 5.907 ± 1.5
0.0LeuTrp: 0.0 ± 0.0
4.219LeuTyr: 4.219 ± 1.072
0.0LeuXaa: 0.0 ± 0.0
Met
2.11MetAla: 2.11 ± 0.596
0.422MetCys: 0.422 ± 0.343
1.266MetAsp: 1.266 ± 0.346
2.11MetGlu: 2.11 ± 1.213
1.266MetPhe: 1.266 ± 0.731
0.844MetGly: 0.844 ± 0.54
0.844MetHis: 0.844 ± 0.631
0.0MetIle: 0.0 ± 0.0
0.844MetLys: 0.844 ± 0.687
0.422MetLeu: 0.422 ± 0.343
0.422MetMet: 0.422 ± 0.343
1.266MetAsn: 1.266 ± 0.633
0.0MetPro: 0.0 ± 0.0
0.844MetGln: 0.844 ± 0.883
0.844MetArg: 0.844 ± 0.359
0.844MetSer: 0.844 ± 0.434
1.266MetThr: 1.266 ± 0.364
2.532MetVal: 2.532 ± 1.053
1.266MetTrp: 1.266 ± 0.364
0.844MetTyr: 0.844 ± 0.486
0.0MetXaa: 0.0 ± 0.0
Asn
5.063AsnAla: 5.063 ± 1.539
1.266AsnCys: 1.266 ± 0.646
0.844AsnAsp: 0.844 ± 0.552
0.844AsnGlu: 0.844 ± 0.534
1.688AsnPhe: 1.688 ± 0.865
2.11AsnGly: 2.11 ± 0.859
0.422AsnHis: 0.422 ± 0.442
3.376AsnIle: 3.376 ± 1.45
3.797AsnLys: 3.797 ± 1.643
1.266AsnLeu: 1.266 ± 0.828
1.266AsnMet: 1.266 ± 0.568
2.532AsnAsn: 2.532 ± 1.764
2.532AsnPro: 2.532 ± 0.962
1.688AsnGln: 1.688 ± 0.761
1.266AsnArg: 1.266 ± 0.633
5.063AsnSer: 5.063 ± 2.144
5.063AsnThr: 5.063 ± 0.846
0.844AsnVal: 0.844 ± 0.723
0.844AsnTrp: 0.844 ± 0.687
1.266AsnTyr: 1.266 ± 0.776
0.0AsnXaa: 0.0 ± 0.0
Pro
6.329ProAla: 6.329 ± 3.264
0.422ProCys: 0.422 ± 0.442
4.219ProAsp: 4.219 ± 1.75
3.376ProGlu: 3.376 ± 1.408
2.532ProPhe: 2.532 ± 0.88
1.688ProGly: 1.688 ± 0.627
0.844ProHis: 0.844 ± 0.812
2.954ProIle: 2.954 ± 0.977
3.797ProLys: 3.797 ± 0.765
7.173ProLeu: 7.173 ± 1.982
1.266ProMet: 1.266 ± 0.924
2.532ProAsn: 2.532 ± 1.239
12.236ProPro: 12.236 ± 4.052
0.844ProGln: 0.844 ± 0.486
2.11ProArg: 2.11 ± 0.678
4.219ProSer: 4.219 ± 2.203
3.376ProThr: 3.376 ± 1.383
3.797ProVal: 3.797 ± 1.445
1.266ProTrp: 1.266 ± 0.559
2.532ProTyr: 2.532 ± 1.012
0.0ProXaa: 0.0 ± 0.0
Gln
3.797GlnAla: 3.797 ± 1.043
0.844GlnCys: 0.844 ± 0.636
3.797GlnAsp: 3.797 ± 0.927
1.266GlnGlu: 1.266 ± 0.805
2.532GlnPhe: 2.532 ± 0.856
2.11GlnGly: 2.11 ± 0.819
0.844GlnHis: 0.844 ± 0.434
2.11GlnIle: 2.11 ± 0.674
1.266GlnLys: 1.266 ± 1.085
5.485GlnLeu: 5.485 ± 2.037
1.688GlnMet: 1.688 ± 1.019
0.844GlnAsn: 0.844 ± 0.687
3.376GlnPro: 3.376 ± 0.81
1.688GlnGln: 1.688 ± 0.867
2.532GlnArg: 2.532 ± 0.895
3.376GlnSer: 3.376 ± 0.942
4.219GlnThr: 4.219 ± 0.832
2.532GlnVal: 2.532 ± 0.484
2.11GlnTrp: 2.11 ± 0.766
1.688GlnTyr: 1.688 ± 1.447
0.0GlnXaa: 0.0 ± 0.0
Arg
2.532ArgAla: 2.532 ± 0.868
1.688ArgCys: 1.688 ± 1.177
0.844ArgAsp: 0.844 ± 0.434
0.844ArgGlu: 0.844 ± 0.51
1.266ArgPhe: 1.266 ± 0.59
2.11ArgGly: 2.11 ± 0.821
2.532ArgHis: 2.532 ± 1.154
1.266ArgIle: 1.266 ± 1.126
4.641ArgLys: 4.641 ± 1.244
5.063ArgLeu: 5.063 ± 0.774
0.422ArgMet: 0.422 ± 0.343
2.11ArgAsn: 2.11 ± 1.045
5.063ArgPro: 5.063 ± 1.168
1.688ArgGln: 1.688 ± 0.939
2.954ArgArg: 2.954 ± 1.475
2.954ArgSer: 2.954 ± 0.874
1.688ArgThr: 1.688 ± 0.494
2.954ArgVal: 2.954 ± 0.874
0.422ArgTrp: 0.422 ± 0.442
1.688ArgTyr: 1.688 ± 0.902
0.0ArgXaa: 0.0 ± 0.0
Ser
3.797SerAla: 3.797 ± 1.745
1.266SerCys: 1.266 ± 0.828
2.954SerAsp: 2.954 ± 1.129
4.641SerGlu: 4.641 ± 1.374
1.688SerPhe: 1.688 ± 0.822
5.907SerGly: 5.907 ± 1.917
2.11SerHis: 2.11 ± 0.734
4.219SerIle: 4.219 ± 1.291
2.11SerLys: 2.11 ± 0.893
5.485SerLeu: 5.485 ± 1.333
0.844SerMet: 0.844 ± 0.416
3.797SerAsn: 3.797 ± 1.547
3.797SerPro: 3.797 ± 0.718
2.11SerGln: 2.11 ± 0.464
3.797SerArg: 3.797 ± 0.77
12.236SerSer: 12.236 ± 2.503
7.173SerThr: 7.173 ± 3.23
4.641SerVal: 4.641 ± 1.573
0.0SerTrp: 0.0 ± 0.0
2.11SerTyr: 2.11 ± 0.624
0.0SerXaa: 0.0 ± 0.0
Thr
4.641ThrAla: 4.641 ± 0.936
3.376ThrCys: 3.376 ± 1.087
2.532ThrAsp: 2.532 ± 0.467
2.532ThrGlu: 2.532 ± 1.154
1.266ThrPhe: 1.266 ± 0.715
6.329ThrGly: 6.329 ± 1.616
1.266ThrHis: 1.266 ± 0.665
3.797ThrIle: 3.797 ± 0.723
2.11ThrLys: 2.11 ± 0.908
7.595ThrLeu: 7.595 ± 2.57
1.266ThrMet: 1.266 ± 0.514
3.797ThrAsn: 3.797 ± 1.467
7.173ThrPro: 7.173 ± 2.513
4.641ThrGln: 4.641 ± 1.773
2.532ThrArg: 2.532 ± 0.906
6.751ThrSer: 6.751 ± 1.39
10.127ThrThr: 10.127 ± 2.214
7.173ThrVal: 7.173 ± 1.157
1.266ThrTrp: 1.266 ± 0.667
2.954ThrTyr: 2.954 ± 1.087
0.0ThrXaa: 0.0 ± 0.0
Val
3.376ValAla: 3.376 ± 0.862
1.688ValCys: 1.688 ± 1.45
3.797ValAsp: 3.797 ± 1.331
7.595ValGlu: 7.595 ± 2.334
2.532ValPhe: 2.532 ± 1.1
4.219ValGly: 4.219 ± 2.095
0.844ValHis: 0.844 ± 0.434
2.532ValIle: 2.532 ± 0.822
2.954ValLys: 2.954 ± 1.06
3.797ValLeu: 3.797 ± 1.342
0.844ValMet: 0.844 ± 0.366
2.11ValAsn: 2.11 ± 0.845
3.797ValPro: 3.797 ± 1.274
5.907ValGln: 5.907 ± 0.816
2.954ValArg: 2.954 ± 0.497
7.173ValSer: 7.173 ± 1.837
8.017ValThr: 8.017 ± 1.365
2.954ValVal: 2.954 ± 0.81
0.844ValTrp: 0.844 ± 0.51
1.688ValTyr: 1.688 ± 1.01
0.0ValXaa: 0.0 ± 0.0
Trp
0.844TrpAla: 0.844 ± 0.359
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.844TrpGlu: 0.844 ± 0.51
0.844TrpPhe: 0.844 ± 0.687
2.11TrpGly: 2.11 ± 0.732
0.0TrpHis: 0.0 ± 0.0
0.844TrpIle: 0.844 ± 0.687
2.11TrpLys: 2.11 ± 0.953
2.11TrpLeu: 2.11 ± 0.766
0.0TrpMet: 0.0 ± 0.0
0.422TrpAsn: 0.422 ± 0.362
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.844TrpArg: 0.844 ± 0.51
0.0TrpSer: 0.0 ± 0.0
2.11TrpThr: 2.11 ± 1.113
1.266TrpVal: 1.266 ± 0.646
0.422TrpTrp: 0.422 ± 0.442
0.422TrpTyr: 0.422 ± 0.375
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.954TyrAla: 2.954 ± 1.031
0.844TyrCys: 0.844 ± 0.883
1.266TyrAsp: 1.266 ± 0.602
1.266TyrGlu: 1.266 ± 0.906
1.266TyrPhe: 1.266 ± 0.731
2.11TyrGly: 2.11 ± 0.715
0.422TyrHis: 0.422 ± 0.362
0.422TyrIle: 0.422 ± 0.375
3.376TyrLys: 3.376 ± 1.095
3.376TyrLeu: 3.376 ± 0.946
2.11TyrMet: 2.11 ± 0.624
0.844TyrAsn: 0.844 ± 0.54
1.688TyrPro: 1.688 ± 0.802
1.266TyrGln: 1.266 ± 0.827
1.688TyrArg: 1.688 ± 0.865
1.688TyrSer: 1.688 ± 0.795
2.11TyrThr: 2.11 ± 0.511
5.485TyrVal: 5.485 ± 1.473
0.422TyrTrp: 0.422 ± 0.343
1.266TyrTyr: 1.266 ± 0.882
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2371 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski