Amino acid dipepetide frequency for Bos taurus papillomavirus 17

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.43AlaAla: 0.43 ± 0.351
0.43AlaCys: 0.43 ± 0.513
2.149AlaAsp: 2.149 ± 0.327
5.587AlaGlu: 5.587 ± 1.784
2.149AlaPhe: 2.149 ± 1.172
1.289AlaGly: 1.289 ± 0.701
0.43AlaHis: 0.43 ± 0.411
1.719AlaIle: 1.719 ± 0.214
5.587AlaLys: 5.587 ± 1.378
3.008AlaLeu: 3.008 ± 1.152
1.719AlaMet: 1.719 ± 0.536
0.859AlaAsn: 0.859 ± 0.435
3.868AlaPro: 3.868 ± 0.867
3.008AlaGln: 3.008 ± 1.074
3.008AlaArg: 3.008 ± 0.684
4.727AlaSer: 4.727 ± 0.747
4.297AlaThr: 4.297 ± 1.479
3.438AlaVal: 3.438 ± 1.371
0.859AlaTrp: 0.859 ± 0.379
1.719AlaTyr: 1.719 ± 0.786
0.0AlaXaa: 0.0 ± 0.0
Cys
1.289CysAla: 1.289 ± 0.853
1.289CysCys: 1.289 ± 0.853
0.43CysAsp: 0.43 ± 0.337
1.719CysGlu: 1.719 ± 0.781
1.719CysPhe: 1.719 ± 0.755
0.859CysGly: 0.859 ± 1.072
0.0CysHis: 0.0 ± 0.0
1.289CysIle: 1.289 ± 0.611
2.149CysLys: 2.149 ± 1.178
2.578CysLeu: 2.578 ± 1.606
1.289CysMet: 1.289 ± 1.538
0.43CysAsn: 0.43 ± 0.536
1.719CysPro: 1.719 ± 1.022
0.859CysGln: 0.859 ± 0.748
0.43CysArg: 0.43 ± 0.374
2.149CysSer: 2.149 ± 1.018
1.289CysThr: 1.289 ± 0.612
1.289CysVal: 1.289 ± 0.806
0.43CysTrp: 0.43 ± 0.411
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.149AspAla: 2.149 ± 0.552
2.149AspCys: 2.149 ± 1.07
3.868AspAsp: 3.868 ± 1.354
3.438AspGlu: 3.438 ± 0.908
3.438AspPhe: 3.438 ± 1.078
3.008AspGly: 3.008 ± 0.684
0.0AspHis: 0.0 ± 0.0
4.297AspIle: 4.297 ± 1.013
3.008AspLys: 3.008 ± 1.087
6.446AspLeu: 6.446 ± 1.932
0.43AspMet: 0.43 ± 0.351
3.868AspAsn: 3.868 ± 0.666
2.578AspPro: 2.578 ± 0.821
1.719AspGln: 1.719 ± 0.606
2.578AspArg: 2.578 ± 1.183
2.578AspSer: 2.578 ± 0.858
2.578AspThr: 2.578 ± 0.778
3.438AspVal: 3.438 ± 1.223
1.719AspTrp: 1.719 ± 0.912
1.289AspTyr: 1.289 ± 0.389
0.0AspXaa: 0.0 ± 0.0
Glu
3.868GluAla: 3.868 ± 1.447
1.719GluCys: 1.719 ± 1.347
6.446GluAsp: 6.446 ± 1.761
5.157GluGlu: 5.157 ± 1.629
3.438GluPhe: 3.438 ± 1.044
3.008GluGly: 3.008 ± 0.487
1.289GluHis: 1.289 ± 0.388
2.149GluIle: 2.149 ± 1.359
1.289GluLys: 1.289 ± 1.056
3.868GluLeu: 3.868 ± 1.1
1.289GluMet: 1.289 ± 1.01
3.868GluAsn: 3.868 ± 1.477
2.149GluPro: 2.149 ± 0.822
4.727GluGln: 4.727 ± 0.546
1.719GluArg: 1.719 ± 0.536
3.438GluSer: 3.438 ± 1.044
6.016GluThr: 6.016 ± 1.056
6.876GluVal: 6.876 ± 1.7
0.0GluTrp: 0.0 ± 0.0
1.289GluTyr: 1.289 ± 1.058
0.0GluXaa: 0.0 ± 0.0
Phe
2.578PheAla: 2.578 ± 0.776
1.289PheCys: 1.289 ± 1.085
3.868PheAsp: 3.868 ± 0.753
2.578PheGlu: 2.578 ± 0.761
3.008PhePhe: 3.008 ± 1.029
1.719PheGly: 1.719 ± 0.909
0.43PheHis: 0.43 ± 0.536
2.149PheIle: 2.149 ± 0.648
2.578PheLys: 2.578 ± 0.918
5.157PheLeu: 5.157 ± 0.983
0.0PheMet: 0.0 ± 0.0
3.868PheAsn: 3.868 ± 0.949
2.149PhePro: 2.149 ± 0.807
2.578PheGln: 2.578 ± 0.416
2.149PheArg: 2.149 ± 0.648
5.157PheSer: 5.157 ± 1.389
3.438PheThr: 3.438 ± 0.964
3.008PheVal: 3.008 ± 1.265
1.719PheTrp: 1.719 ± 0.912
2.578PheTyr: 2.578 ± 0.451
0.0PheXaa: 0.0 ± 0.0
Gly
2.149GlyAla: 2.149 ± 1.381
3.008GlyCys: 3.008 ± 1.316
1.289GlyAsp: 1.289 ± 0.388
3.868GlyGlu: 3.868 ± 1.187
2.578GlyPhe: 2.578 ± 0.523
3.868GlyGly: 3.868 ± 1.247
1.289GlyHis: 1.289 ± 0.611
3.438GlyIle: 3.438 ± 0.738
3.868GlyLys: 3.868 ± 1.257
3.008GlyLeu: 3.008 ± 0.487
0.43GlyMet: 0.43 ± 0.552
6.446GlyAsn: 6.446 ± 1.907
3.438GlyPro: 3.438 ± 1.102
2.578GlyGln: 2.578 ± 0.919
3.868GlyArg: 3.868 ± 1.705
5.587GlySer: 5.587 ± 1.302
3.868GlyThr: 3.868 ± 1.588
3.438GlyVal: 3.438 ± 1.606
0.0GlyTrp: 0.0 ± 0.0
1.719GlyTyr: 1.719 ± 0.6
0.0GlyXaa: 0.0 ± 0.0
His
0.43HisAla: 0.43 ± 0.374
0.0HisCys: 0.0 ± 0.0
0.43HisAsp: 0.43 ± 0.337
0.43HisGlu: 0.43 ± 0.374
1.719HisPhe: 1.719 ± 0.957
1.719HisGly: 1.719 ± 0.214
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
2.149HisLys: 2.149 ± 1.068
1.289HisLeu: 1.289 ± 0.674
0.0HisMet: 0.0 ± 0.0
1.289HisAsn: 1.289 ± 0.389
2.149HisPro: 2.149 ± 0.726
0.859HisGln: 0.859 ± 0.659
1.719HisArg: 1.719 ± 1.164
1.719HisSer: 1.719 ± 0.214
0.43HisThr: 0.43 ± 0.351
1.289HisVal: 1.289 ± 0.776
1.289HisTrp: 1.289 ± 0.388
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.297IleAla: 4.297 ± 1.453
0.859IleCys: 0.859 ± 0.617
2.149IleAsp: 2.149 ± 0.903
3.008IleGlu: 3.008 ± 0.67
2.149IlePhe: 2.149 ± 0.966
4.297IleGly: 4.297 ± 1.557
1.719IleHis: 1.719 ± 0.786
3.438IleIle: 3.438 ± 0.393
1.289IleLys: 1.289 ± 0.388
2.578IleLeu: 2.578 ± 1.063
0.43IleMet: 0.43 ± 0.337
1.719IleAsn: 1.719 ± 0.593
2.578IlePro: 2.578 ± 1.465
2.149IleGln: 2.149 ± 0.642
3.008IleArg: 3.008 ± 0.993
5.157IleSer: 5.157 ± 1.743
2.149IleThr: 2.149 ± 1.231
3.008IleVal: 3.008 ± 0.832
0.43IleTrp: 0.43 ± 0.513
1.289IleTyr: 1.289 ± 0.776
0.0IleXaa: 0.0 ± 0.0
Lys
2.578LysAla: 2.578 ± 1.115
1.719LysCys: 1.719 ± 0.575
1.719LysAsp: 1.719 ± 0.606
2.578LysGlu: 2.578 ± 0.821
5.587LysPhe: 5.587 ± 1.765
1.719LysGly: 1.719 ± 0.748
3.008LysHis: 3.008 ± 1.143
0.859LysIle: 0.859 ± 0.604
0.43LysLys: 0.43 ± 0.411
4.727LysLeu: 4.727 ± 1.118
1.289LysMet: 1.289 ± 0.611
2.578LysAsn: 2.578 ± 1.305
2.578LysPro: 2.578 ± 0.705
1.289LysGln: 1.289 ± 0.611
4.727LysArg: 4.727 ± 1.698
3.868LysSer: 3.868 ± 1.619
2.578LysThr: 2.578 ± 1.137
3.868LysVal: 3.868 ± 1.252
0.43LysTrp: 0.43 ± 0.351
2.578LysTyr: 2.578 ± 0.629
0.0LysXaa: 0.0 ± 0.0
Leu
3.868LeuAla: 3.868 ± 0.912
0.859LeuCys: 0.859 ± 0.617
2.578LeuAsp: 2.578 ± 1.136
4.297LeuGlu: 4.297 ± 0.863
3.438LeuPhe: 3.438 ± 0.646
7.735LeuGly: 7.735 ± 1.487
1.289LeuHis: 1.289 ± 0.675
5.587LeuIle: 5.587 ± 1.45
3.868LeuLys: 3.868 ± 1.276
4.727LeuLeu: 4.727 ± 1.188
3.008LeuMet: 3.008 ± 1.098
4.727LeuAsn: 4.727 ± 1.618
5.157LeuPro: 5.157 ± 1.746
5.587LeuGln: 5.587 ± 1.222
2.149LeuArg: 2.149 ± 1.2
6.446LeuSer: 6.446 ± 1.606
5.157LeuThr: 5.157 ± 1.626
3.868LeuVal: 3.868 ± 1.59
0.859LeuTrp: 0.859 ± 0.748
3.008LeuTyr: 3.008 ± 0.633
0.0LeuXaa: 0.0 ± 0.0
Met
1.719MetAla: 1.719 ± 0.804
0.43MetCys: 0.43 ± 0.337
0.859MetAsp: 0.859 ± 0.435
1.289MetGlu: 1.289 ± 0.389
1.289MetPhe: 1.289 ± 0.388
0.859MetGly: 0.859 ± 0.618
0.43MetHis: 0.43 ± 0.351
1.719MetIle: 1.719 ± 0.528
0.0MetLys: 0.0 ± 0.0
1.719MetLeu: 1.719 ± 0.536
0.0MetMet: 0.0 ± 0.0
0.859MetAsn: 0.859 ± 0.49
0.859MetPro: 0.859 ± 0.618
0.43MetGln: 0.43 ± 0.337
1.289MetArg: 1.289 ± 0.831
1.289MetSer: 1.289 ± 0.388
0.859MetThr: 0.859 ± 0.435
0.859MetVal: 0.859 ± 0.443
0.43MetTrp: 0.43 ± 0.374
0.43MetTyr: 0.43 ± 0.513
0.0MetXaa: 0.0 ± 0.0
Asn
2.578AsnAla: 2.578 ± 0.882
0.859AsnCys: 0.859 ± 0.615
3.008AsnAsp: 3.008 ± 1.126
3.008AsnGlu: 3.008 ± 1.001
1.289AsnPhe: 1.289 ± 0.677
2.578AsnGly: 2.578 ± 0.909
0.43AsnHis: 0.43 ± 0.337
3.008AsnIle: 3.008 ± 0.621
3.438AsnLys: 3.438 ± 0.876
6.016AsnLeu: 6.016 ± 1.647
0.43AsnMet: 0.43 ± 0.374
2.149AsnAsn: 2.149 ± 0.778
3.868AsnPro: 3.868 ± 1.47
2.149AsnGln: 2.149 ± 0.96
3.438AsnArg: 3.438 ± 0.738
5.157AsnSer: 5.157 ± 1.381
4.727AsnThr: 4.727 ± 1.022
2.578AsnVal: 2.578 ± 0.451
1.289AsnTrp: 1.289 ± 0.638
1.719AsnTyr: 1.719 ± 1.022
0.0AsnXaa: 0.0 ± 0.0
Pro
3.438ProAla: 3.438 ± 0.67
1.289ProCys: 1.289 ± 0.864
5.157ProAsp: 5.157 ± 1.653
7.306ProGlu: 7.306 ± 1.758
3.008ProPhe: 3.008 ± 0.621
4.297ProGly: 4.297 ± 1.748
0.859ProHis: 0.859 ± 0.379
1.289ProIle: 1.289 ± 0.389
2.149ProLys: 2.149 ± 0.956
5.157ProLeu: 5.157 ± 1.125
0.859ProMet: 0.859 ± 0.617
3.008ProAsn: 3.008 ± 0.487
6.446ProPro: 6.446 ± 1.386
0.859ProGln: 0.859 ± 0.49
1.719ProArg: 1.719 ± 0.575
5.587ProSer: 5.587 ± 0.936
3.438ProThr: 3.438 ± 2.056
2.578ProVal: 2.578 ± 1.14
0.43ProTrp: 0.43 ± 0.411
2.149ProTyr: 2.149 ± 1.143
0.0ProXaa: 0.0 ± 0.0
Gln
3.008GlnAla: 3.008 ± 1.887
0.43GlnCys: 0.43 ± 0.536
2.578GlnAsp: 2.578 ± 1.09
1.719GlnGlu: 1.719 ± 0.786
2.149GlnPhe: 2.149 ± 0.327
2.578GlnGly: 2.578 ± 0.821
0.859GlnHis: 0.859 ± 0.455
2.578GlnIle: 2.578 ± 0.912
1.289GlnLys: 1.289 ± 0.352
3.868GlnLeu: 3.868 ± 0.839
1.719GlnMet: 1.719 ± 1.039
3.008GlnAsn: 3.008 ± 0.697
3.008GlnPro: 3.008 ± 0.918
2.578GlnGln: 2.578 ± 0.629
3.438GlnArg: 3.438 ± 1.877
4.727GlnSer: 4.727 ± 2.146
2.578GlnThr: 2.578 ± 0.641
1.289GlnVal: 1.289 ± 0.675
0.859GlnTrp: 0.859 ± 0.435
1.289GlnTyr: 1.289 ± 1.123
0.0GlnXaa: 0.0 ± 0.0
Arg
2.578ArgAla: 2.578 ± 0.759
1.719ArgCys: 1.719 ± 1.063
1.719ArgAsp: 1.719 ± 0.789
3.438ArgGlu: 3.438 ± 1.822
2.149ArgPhe: 2.149 ± 0.327
3.868ArgGly: 3.868 ± 1.532
1.719ArgHis: 1.719 ± 0.98
3.438ArgIle: 3.438 ± 1.013
5.157ArgLys: 5.157 ± 1.201
5.587ArgLeu: 5.587 ± 1.462
1.289ArgMet: 1.289 ± 0.648
2.149ArgAsn: 2.149 ± 0.822
4.297ArgPro: 4.297 ± 1.105
1.289ArgGln: 1.289 ± 1.233
4.727ArgArg: 4.727 ± 2.494
10.314ArgSer: 10.314 ± 4.566
3.868ArgThr: 3.868 ± 0.902
2.149ArgVal: 2.149 ± 0.944
0.859ArgTrp: 0.859 ± 0.822
0.43ArgTyr: 0.43 ± 0.351
0.0ArgXaa: 0.0 ± 0.0
Ser
5.157SerAla: 5.157 ± 1.406
1.719SerCys: 1.719 ± 1.007
5.587SerAsp: 5.587 ± 0.428
4.727SerGlu: 4.727 ± 1.579
2.578SerPhe: 2.578 ± 0.946
6.876SerGly: 6.876 ± 0.574
1.289SerHis: 1.289 ± 0.388
2.149SerIle: 2.149 ± 1.145
4.727SerLys: 4.727 ± 1.44
6.446SerLeu: 6.446 ± 1.335
1.719SerMet: 1.719 ± 0.569
5.587SerAsn: 5.587 ± 1.506
5.157SerPro: 5.157 ± 1.696
3.438SerGln: 3.438 ± 1.463
12.033SerArg: 12.033 ± 6.191
11.173SerSer: 11.173 ± 3.393
3.868SerThr: 3.868 ± 1.709
4.727SerVal: 4.727 ± 1.037
1.289SerTrp: 1.289 ± 0.389
1.719SerTyr: 1.719 ± 0.655
0.0SerXaa: 0.0 ± 0.0
Thr
2.149ThrAla: 2.149 ± 0.977
1.289ThrCys: 1.289 ± 0.601
4.297ThrAsp: 4.297 ± 0.813
3.868ThrGlu: 3.868 ± 0.912
6.016ThrPhe: 6.016 ± 1.337
4.297ThrGly: 4.297 ± 1.843
2.149ThrHis: 2.149 ± 0.327
3.438ThrIle: 3.438 ± 2.055
1.289ThrLys: 1.289 ± 0.652
3.868ThrLeu: 3.868 ± 0.674
0.859ThrMet: 0.859 ± 0.379
1.289ThrAsn: 1.289 ± 0.611
4.297ThrPro: 4.297 ± 1.658
3.438ThrGln: 3.438 ± 0.428
3.438ThrArg: 3.438 ± 1.195
5.157ThrSer: 5.157 ± 2.448
3.438ThrThr: 3.438 ± 0.798
5.157ThrVal: 5.157 ± 1.696
1.289ThrTrp: 1.289 ± 1.01
0.859ThrTyr: 0.859 ± 0.673
0.0ThrXaa: 0.0 ± 0.0
Val
3.868ValAla: 3.868 ± 1.282
1.289ValCys: 1.289 ± 0.674
4.297ValAsp: 4.297 ± 0.798
2.149ValGlu: 2.149 ± 0.726
3.008ValPhe: 3.008 ± 1.177
2.578ValGly: 2.578 ± 1.022
0.859ValHis: 0.859 ± 0.401
2.578ValIle: 2.578 ± 1.349
2.578ValLys: 2.578 ± 0.821
3.008ValLeu: 3.008 ± 0.714
0.43ValMet: 0.43 ± 0.351
3.438ValAsn: 3.438 ± 0.916
3.008ValPro: 3.008 ± 0.856
5.587ValGln: 5.587 ± 1.521
3.868ValArg: 3.868 ± 0.883
5.587ValSer: 5.587 ± 0.75
3.868ValThr: 3.868 ± 1.065
5.157ValVal: 5.157 ± 1.732
1.289ValTrp: 1.289 ± 0.823
2.578ValTyr: 2.578 ± 0.593
0.0ValXaa: 0.0 ± 0.0
Trp
0.43TrpAla: 0.43 ± 0.337
0.0TrpCys: 0.0 ± 0.0
0.43TrpAsp: 0.43 ± 0.374
1.719TrpGlu: 1.719 ± 1.178
0.43TrpPhe: 0.43 ± 0.374
1.719TrpGly: 1.719 ± 0.626
0.43TrpHis: 0.43 ± 0.411
2.149TrpIle: 2.149 ± 0.327
1.289TrpLys: 1.289 ± 1.01
1.719TrpLeu: 1.719 ± 0.912
0.0TrpMet: 0.0 ± 0.0
0.859TrpAsn: 0.859 ± 0.379
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.719TrpArg: 1.719 ± 1.235
0.43TrpSer: 0.43 ± 0.411
1.289TrpThr: 1.289 ± 0.388
1.289TrpVal: 1.289 ± 0.611
0.43TrpTrp: 0.43 ± 0.337
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.719TyrAla: 1.719 ± 0.781
0.859TyrCys: 0.859 ± 0.604
1.719TyrAsp: 1.719 ± 0.692
1.719TyrGlu: 1.719 ± 0.853
0.859TyrPhe: 0.859 ± 0.435
1.289TyrGly: 1.289 ± 0.612
0.43TyrHis: 0.43 ± 0.374
0.43TyrIle: 0.43 ± 0.351
2.578TyrLys: 2.578 ± 0.733
3.438TyrLeu: 3.438 ± 0.611
0.0TyrMet: 0.0 ± 0.0
1.719TyrAsn: 1.719 ± 0.593
1.719TyrPro: 1.719 ± 0.734
0.43TyrGln: 0.43 ± 0.337
1.719TyrArg: 1.719 ± 1.106
1.719TyrSer: 1.719 ± 0.734
2.149TyrThr: 2.149 ± 0.433
1.719TyrVal: 1.719 ± 0.692
0.43TyrTrp: 0.43 ± 0.374
2.578TyrTyr: 2.578 ± 1.009
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2328 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski