Amino acid dipepetide frequency for Human papillomavirus KC5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.501AlaAla: 5.501 ± 1.122
0.423AlaCys: 0.423 ± 0.519
2.962AlaAsp: 2.962 ± 0.871
7.194AlaGlu: 7.194 ± 1.945
3.386AlaPhe: 3.386 ± 0.738
2.962AlaGly: 2.962 ± 1.361
1.693AlaHis: 1.693 ± 0.9
2.539AlaIle: 2.539 ± 0.672
1.693AlaLys: 1.693 ± 0.621
4.232AlaLeu: 4.232 ± 2.019
0.423AlaMet: 0.423 ± 0.377
2.962AlaAsn: 2.962 ± 0.454
2.116AlaPro: 2.116 ± 1.534
1.27AlaGln: 1.27 ± 0.713
3.386AlaArg: 3.386 ± 0.684
5.925AlaSer: 5.925 ± 0.959
4.232AlaThr: 4.232 ± 1.508
2.539AlaVal: 2.539 ± 0.541
0.423AlaTrp: 0.423 ± 0.393
2.116AlaTyr: 2.116 ± 0.804
0.0AlaXaa: 0.0 ± 0.0
Cys
0.423CysAla: 0.423 ± 0.514
0.846CysCys: 0.846 ± 0.753
0.423CysAsp: 0.423 ± 0.377
1.27CysGlu: 1.27 ± 1.13
2.539CysPhe: 2.539 ± 1.33
0.423CysGly: 0.423 ± 0.377
0.423CysHis: 0.423 ± 0.519
1.27CysIle: 1.27 ± 0.762
2.539CysLys: 2.539 ± 2.016
2.962CysLeu: 2.962 ± 1.964
0.423CysMet: 0.423 ± 0.385
0.846CysAsn: 0.846 ± 0.57
1.27CysPro: 1.27 ± 0.732
0.0CysGln: 0.0 ± 0.0
1.693CysArg: 1.693 ± 0.572
1.693CysSer: 1.693 ± 0.662
0.846CysThr: 0.846 ± 0.579
2.116CysVal: 2.116 ± 0.809
0.846CysTrp: 0.846 ± 0.463
1.27CysTyr: 1.27 ± 1.279
0.0CysXaa: 0.0 ± 0.0
Asp
4.232AspAla: 4.232 ± 1.275
2.116AspCys: 2.116 ± 0.86
2.962AspAsp: 2.962 ± 1.083
4.655AspGlu: 4.655 ± 1.816
2.539AspPhe: 2.539 ± 1.119
0.846AspGly: 0.846 ± 0.463
1.27AspHis: 1.27 ± 0.732
5.078AspIle: 5.078 ± 3.052
1.27AspLys: 1.27 ± 0.74
5.925AspLeu: 5.925 ± 0.769
1.27AspMet: 1.27 ± 0.723
2.962AspAsn: 2.962 ± 1.193
4.232AspPro: 4.232 ± 1.603
0.846AspGln: 0.846 ± 0.723
2.116AspArg: 2.116 ± 0.466
5.501AspSer: 5.501 ± 1.486
3.386AspThr: 3.386 ± 0.686
4.655AspVal: 4.655 ± 1.881
0.423AspTrp: 0.423 ± 0.377
3.386AspTyr: 3.386 ± 1.202
0.0AspXaa: 0.0 ± 0.0
Glu
2.962GluAla: 2.962 ± 0.941
1.693GluCys: 1.693 ± 1.074
5.078GluAsp: 5.078 ± 1.282
8.887GluGlu: 8.887 ± 3.323
4.655GluPhe: 4.655 ± 1.074
2.539GluGly: 2.539 ± 0.986
0.846GluHis: 0.846 ± 0.57
2.116GluIle: 2.116 ± 0.55
2.962GluLys: 2.962 ± 0.704
6.348GluLeu: 6.348 ± 1.411
0.423GluMet: 0.423 ± 0.377
6.348GluAsn: 6.348 ± 1.517
2.539GluPro: 2.539 ± 1.304
3.386GluGln: 3.386 ± 1.476
3.386GluArg: 3.386 ± 1.816
4.655GluSer: 4.655 ± 1.61
5.925GluThr: 5.925 ± 1.615
2.116GluVal: 2.116 ± 0.91
0.846GluTrp: 0.846 ± 0.753
2.962GluTyr: 2.962 ± 0.618
0.0GluXaa: 0.0 ± 0.0
Phe
4.655PheAla: 4.655 ± 0.627
2.116PheCys: 2.116 ± 1.205
2.116PheAsp: 2.116 ± 0.738
2.539PheGlu: 2.539 ± 1.341
2.962PhePhe: 2.962 ± 1.018
1.693PheGly: 1.693 ± 0.772
1.27PheHis: 1.27 ± 1.15
2.116PheIle: 2.116 ± 0.725
5.501PheLys: 5.501 ± 2.677
5.925PheLeu: 5.925 ± 2.224
1.27PheMet: 1.27 ± 0.439
2.539PheAsn: 2.539 ± 0.962
1.693PhePro: 1.693 ± 0.674
0.846PheGln: 0.846 ± 0.463
2.962PheArg: 2.962 ± 1.066
2.116PheSer: 2.116 ± 1.105
2.116PheThr: 2.116 ± 0.545
4.232PheVal: 4.232 ± 2.227
1.27PheTrp: 1.27 ± 0.74
1.693PheTyr: 1.693 ± 0.829
0.0PheXaa: 0.0 ± 0.0
Gly
0.846GlyAla: 0.846 ± 0.492
0.846GlyCys: 0.846 ± 0.723
4.232GlyAsp: 4.232 ± 1.332
2.539GlyGlu: 2.539 ± 1.209
1.693GlyPhe: 1.693 ± 0.662
4.655GlyGly: 4.655 ± 2.2
1.693GlyHis: 1.693 ± 0.997
4.655GlyIle: 4.655 ± 0.561
2.539GlyLys: 2.539 ± 0.783
2.539GlyLeu: 2.539 ± 0.603
0.846GlyMet: 0.846 ± 0.671
2.962GlyAsn: 2.962 ± 1.349
3.809GlyPro: 3.809 ± 0.887
1.693GlyGln: 1.693 ± 1.159
2.962GlyArg: 2.962 ± 0.815
5.501GlySer: 5.501 ± 1.541
4.655GlyThr: 4.655 ± 1.47
2.539GlyVal: 2.539 ± 1.193
0.0GlyTrp: 0.0 ± 0.0
0.846GlyTyr: 0.846 ± 0.505
0.0GlyXaa: 0.0 ± 0.0
His
0.846HisAla: 0.846 ± 0.417
1.27HisCys: 1.27 ± 1.034
0.0HisAsp: 0.0 ± 0.0
1.27HisGlu: 1.27 ± 1.05
0.846HisPhe: 0.846 ± 0.384
0.423HisGly: 0.423 ± 0.514
0.423HisHis: 0.423 ± 0.519
1.693HisIle: 1.693 ± 0.662
1.27HisLys: 1.27 ± 0.665
2.539HisLeu: 2.539 ± 1.482
0.846HisMet: 0.846 ± 0.947
0.846HisAsn: 0.846 ± 0.384
3.386HisPro: 3.386 ± 1.596
0.846HisGln: 0.846 ± 0.451
0.846HisArg: 0.846 ± 0.492
1.693HisSer: 1.693 ± 1.121
0.423HisThr: 0.423 ± 0.377
0.846HisVal: 0.846 ± 0.417
0.846HisTrp: 0.846 ± 0.685
0.423HisTyr: 0.423 ± 0.519
0.0HisXaa: 0.0 ± 0.0
Ile
2.116IleAla: 2.116 ± 1.359
1.27IleCys: 1.27 ± 0.405
2.539IleAsp: 2.539 ± 0.863
5.501IleGlu: 5.501 ± 1.623
1.27IlePhe: 1.27 ± 0.656
4.655IleGly: 4.655 ± 1.586
0.846IleHis: 0.846 ± 0.77
2.962IleIle: 2.962 ± 1.004
3.809IleLys: 3.809 ± 1.468
6.771IleLeu: 6.771 ± 1.814
0.423IleMet: 0.423 ± 0.385
2.539IleAsn: 2.539 ± 0.904
3.386IlePro: 3.386 ± 1.304
3.386IleGln: 3.386 ± 0.947
1.693IleArg: 1.693 ± 1.25
3.386IleSer: 3.386 ± 1.09
3.809IleThr: 3.809 ± 1.232
4.232IleVal: 4.232 ± 1.457
0.846IleTrp: 0.846 ± 0.57
1.27IleTyr: 1.27 ± 0.405
0.0IleXaa: 0.0 ± 0.0
Lys
2.116LysAla: 2.116 ± 0.776
1.27LysCys: 1.27 ± 0.665
1.693LysAsp: 1.693 ± 1.34
3.809LysGlu: 3.809 ± 1.483
2.539LysPhe: 2.539 ± 1.524
2.116LysGly: 2.116 ± 0.778
1.693LysHis: 1.693 ± 0.987
1.27LysIle: 1.27 ± 0.643
3.386LysLys: 3.386 ± 0.8
6.771LysLeu: 6.771 ± 2.49
0.846LysMet: 0.846 ± 0.466
3.386LysAsn: 3.386 ± 1.228
4.232LysPro: 4.232 ± 1.577
3.809LysGln: 3.809 ± 0.777
4.232LysArg: 4.232 ± 1.489
3.386LysSer: 3.386 ± 1.324
2.539LysThr: 2.539 ± 1.04
3.386LysVal: 3.386 ± 1.263
0.846LysTrp: 0.846 ± 0.804
2.962LysTyr: 2.962 ± 1.231
0.0LysXaa: 0.0 ± 0.0
Leu
6.771LeuAla: 6.771 ± 1.421
1.693LeuCys: 1.693 ± 0.915
5.925LeuAsp: 5.925 ± 0.769
7.194LeuGlu: 7.194 ± 1.96
5.925LeuPhe: 5.925 ± 1.257
4.655LeuGly: 4.655 ± 2.15
2.962LeuHis: 2.962 ± 0.716
4.655LeuIle: 4.655 ± 1.885
4.655LeuLys: 4.655 ± 1.219
5.501LeuLeu: 5.501 ± 2.038
1.27LeuMet: 1.27 ± 0.733
3.386LeuAsn: 3.386 ± 0.768
5.078LeuPro: 5.078 ± 1.824
5.501LeuGln: 5.501 ± 1.961
3.386LeuArg: 3.386 ± 1.579
5.925LeuSer: 5.925 ± 1.993
2.962LeuThr: 2.962 ± 0.875
5.078LeuVal: 5.078 ± 1.141
0.846LeuTrp: 0.846 ± 0.723
4.232LeuTyr: 4.232 ± 0.954
0.0LeuXaa: 0.0 ± 0.0
Met
0.423MetAla: 0.423 ± 0.377
0.0MetCys: 0.0 ± 0.0
0.423MetAsp: 0.423 ± 0.377
0.423MetGlu: 0.423 ± 0.519
2.116MetPhe: 2.116 ± 1.203
1.27MetGly: 1.27 ± 0.74
0.0MetHis: 0.0 ± 0.0
0.423MetIle: 0.423 ± 0.377
0.846MetLys: 0.846 ± 0.716
0.423MetLeu: 0.423 ± 0.377
0.0MetMet: 0.0 ± 0.0
1.27MetAsn: 1.27 ± 0.74
1.693MetPro: 1.693 ± 0.769
0.846MetGln: 0.846 ± 0.451
0.846MetArg: 0.846 ± 0.561
1.693MetSer: 1.693 ± 0.292
0.423MetThr: 0.423 ± 0.361
0.846MetVal: 0.846 ± 0.753
0.0MetTrp: 0.0 ± 0.0
0.846MetTyr: 0.846 ± 0.463
0.0MetXaa: 0.0 ± 0.0
Asn
3.809AsnAla: 3.809 ± 1.348
1.693AsnCys: 1.693 ± 0.572
3.809AsnAsp: 3.809 ± 0.765
3.386AsnGlu: 3.386 ± 0.791
1.693AsnPhe: 1.693 ± 1.068
1.693AsnGly: 1.693 ± 0.997
0.423AsnHis: 0.423 ± 0.393
3.809AsnIle: 3.809 ± 1.869
3.809AsnLys: 3.809 ± 1.244
2.539AsnLeu: 2.539 ± 1.36
0.846AsnMet: 0.846 ± 0.753
2.116AsnAsn: 2.116 ± 0.973
1.693AsnPro: 1.693 ± 1.068
1.27AsnGln: 1.27 ± 0.74
2.962AsnArg: 2.962 ± 0.752
1.693AsnSer: 1.693 ± 0.662
5.501AsnThr: 5.501 ± 1.324
3.809AsnVal: 3.809 ± 1.207
1.27AsnTrp: 1.27 ± 0.405
0.423AsnTyr: 0.423 ± 0.514
0.0AsnXaa: 0.0 ± 0.0
Pro
5.078ProAla: 5.078 ± 2.156
0.846ProCys: 0.846 ± 0.463
5.078ProAsp: 5.078 ± 1.737
3.386ProGlu: 3.386 ± 1.629
0.423ProPhe: 0.423 ± 0.694
1.693ProGly: 1.693 ± 0.661
0.846ProHis: 0.846 ± 1.389
3.809ProIle: 3.809 ± 2.695
3.809ProLys: 3.809 ± 0.913
5.501ProLeu: 5.501 ± 0.812
0.846ProMet: 0.846 ± 0.753
2.539ProAsn: 2.539 ± 1.362
8.041ProPro: 8.041 ± 3.165
2.539ProGln: 2.539 ± 1.17
2.539ProArg: 2.539 ± 1.468
5.078ProSer: 5.078 ± 1.489
4.655ProThr: 4.655 ± 1.633
3.809ProVal: 3.809 ± 1.226
0.0ProTrp: 0.0 ± 0.0
2.539ProTyr: 2.539 ± 1.347
0.0ProXaa: 0.0 ± 0.0
Gln
0.846GlnAla: 0.846 ± 0.703
1.27GlnCys: 1.27 ± 1.035
1.27GlnAsp: 1.27 ± 0.732
1.693GlnGlu: 1.693 ± 0.651
1.27GlnPhe: 1.27 ± 0.527
2.962GlnGly: 2.962 ± 1.006
1.27GlnHis: 1.27 ± 0.762
2.539GlnIle: 2.539 ± 1.047
1.27GlnLys: 1.27 ± 0.752
5.078GlnLeu: 5.078 ± 1.351
0.846GlnMet: 0.846 ± 0.463
0.846GlnAsn: 0.846 ± 0.492
1.693GlnPro: 1.693 ± 0.645
1.693GlnGln: 1.693 ± 1.074
2.962GlnArg: 2.962 ± 0.674
2.539GlnSer: 2.539 ± 1.153
1.693GlnThr: 1.693 ± 0.775
2.962GlnVal: 2.962 ± 0.88
1.27GlnTrp: 1.27 ± 0.732
1.27GlnTyr: 1.27 ± 1.084
0.0GlnXaa: 0.0 ± 0.0
Arg
3.386ArgAla: 3.386 ± 0.647
2.116ArgCys: 2.116 ± 1.58
1.27ArgAsp: 1.27 ± 0.671
2.962ArgGlu: 2.962 ± 0.815
2.539ArgPhe: 2.539 ± 1.084
2.962ArgGly: 2.962 ± 1.035
2.116ArgHis: 2.116 ± 0.997
1.693ArgIle: 1.693 ± 0.736
4.655ArgLys: 4.655 ± 1.35
8.464ArgLeu: 8.464 ± 1.445
0.846ArgMet: 0.846 ± 0.723
2.116ArgAsn: 2.116 ± 1.31
2.539ArgPro: 2.539 ± 1.839
1.27ArgGln: 1.27 ± 0.671
4.232ArgArg: 4.232 ± 1.808
4.232ArgSer: 4.232 ± 0.72
2.539ArgThr: 2.539 ± 0.587
4.232ArgVal: 4.232 ± 0.883
0.0ArgTrp: 0.0 ± 0.0
1.27ArgTyr: 1.27 ± 0.725
0.0ArgXaa: 0.0 ± 0.0
Ser
4.655SerAla: 4.655 ± 1.625
0.0SerCys: 0.0 ± 0.0
5.501SerAsp: 5.501 ± 1.129
4.655SerGlu: 4.655 ± 0.917
4.655SerPhe: 4.655 ± 1.021
6.771SerGly: 6.771 ± 1.235
1.693SerHis: 1.693 ± 1.09
3.809SerIle: 3.809 ± 1.184
4.655SerLys: 4.655 ± 1.36
5.501SerLeu: 5.501 ± 1.364
1.27SerMet: 1.27 ± 1.13
3.809SerAsn: 3.809 ± 1.902
4.655SerPro: 4.655 ± 1.228
1.693SerGln: 1.693 ± 0.638
4.232SerArg: 4.232 ± 1.036
5.078SerSer: 5.078 ± 0.978
4.655SerThr: 4.655 ± 1.904
2.962SerVal: 2.962 ± 0.674
0.846SerTrp: 0.846 ± 0.753
0.423SerTyr: 0.423 ± 0.377
0.0SerXaa: 0.0 ± 0.0
Thr
3.386ThrAla: 3.386 ± 1.154
1.693ThrCys: 1.693 ± 1.125
5.501ThrAsp: 5.501 ± 1.141
2.962ThrGlu: 2.962 ± 1.668
2.116ThrPhe: 2.116 ± 0.466
3.386ThrGly: 3.386 ± 1.304
0.846ThrHis: 0.846 ± 0.753
5.501ThrIle: 5.501 ± 3.14
1.27ThrLys: 1.27 ± 0.771
4.232ThrLeu: 4.232 ± 1.568
0.846ThrMet: 0.846 ± 0.463
2.539ThrAsn: 2.539 ± 0.586
3.809ThrPro: 3.809 ± 1.723
3.386ThrGln: 3.386 ± 0.719
3.809ThrArg: 3.809 ± 1.186
6.771ThrSer: 6.771 ± 2.937
5.501ThrThr: 5.501 ± 2.255
5.501ThrVal: 5.501 ± 1.264
0.423ThrTrp: 0.423 ± 0.393
1.693ThrTyr: 1.693 ± 0.693
0.0ThrXaa: 0.0 ± 0.0
Val
2.962ValAla: 2.962 ± 0.618
0.846ValCys: 0.846 ± 0.825
5.078ValAsp: 5.078 ± 1.052
3.386ValGlu: 3.386 ± 1.158
4.232ValPhe: 4.232 ± 1.757
3.809ValGly: 3.809 ± 0.912
0.423ValHis: 0.423 ± 0.377
3.809ValIle: 3.809 ± 1.58
3.809ValLys: 3.809 ± 1.69
3.809ValLeu: 3.809 ± 1.265
0.846ValMet: 0.846 ± 0.723
1.27ValAsn: 1.27 ± 0.806
5.078ValPro: 5.078 ± 1.045
1.27ValGln: 1.27 ± 0.819
4.655ValArg: 4.655 ± 1.635
3.809ValSer: 3.809 ± 1.146
5.925ValThr: 5.925 ± 1.823
1.693ValVal: 1.693 ± 1.051
0.423ValTrp: 0.423 ± 0.361
2.116ValTyr: 2.116 ± 0.82
0.0ValXaa: 0.0 ± 0.0
Trp
1.27TrpAla: 1.27 ± 0.762
0.423TrpCys: 0.423 ± 0.377
1.693TrpAsp: 1.693 ± 1.09
0.846TrpGlu: 0.846 ± 0.716
0.423TrpPhe: 0.423 ± 0.377
0.423TrpGly: 0.423 ± 0.361
0.846TrpHis: 0.846 ± 0.417
2.116TrpIle: 2.116 ± 1.094
0.846TrpLys: 0.846 ± 0.579
1.27TrpLeu: 1.27 ± 0.74
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.423TrpPro: 0.423 ± 0.361
0.0TrpGln: 0.0 ± 0.0
0.846TrpArg: 0.846 ± 0.57
0.0TrpSer: 0.0 ± 0.0
1.27TrpThr: 1.27 ± 0.758
0.423TrpVal: 0.423 ± 0.393
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.693TyrAla: 1.693 ± 0.638
1.693TyrCys: 1.693 ± 1.37
2.116TyrAsp: 2.116 ± 1.005
2.116TyrGlu: 2.116 ± 0.871
3.809TyrPhe: 3.809 ± 1.039
2.116TyrGly: 2.116 ± 0.725
0.423TyrHis: 0.423 ± 0.361
0.846TyrIle: 0.846 ± 0.505
1.693TyrLys: 1.693 ± 0.697
1.27TyrLeu: 1.27 ± 0.74
0.0TyrMet: 0.0 ± 0.0
2.962TyrAsn: 2.962 ± 0.981
1.693TyrPro: 1.693 ± 0.433
1.693TyrGln: 1.693 ± 0.768
1.693TyrArg: 1.693 ± 0.693
0.846TyrSer: 0.846 ± 0.608
2.116TyrThr: 2.116 ± 1.075
1.27TyrVal: 1.27 ± 0.762
1.693TyrTrp: 1.693 ± 0.786
2.116TyrTyr: 2.116 ± 0.763
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2364 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski