Amino acid dipepetide frequency for Capra hircus papillomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.448AlaAla: 3.448 ± 1.552
0.766AlaCys: 0.766 ± 0.565
2.299AlaAsp: 2.299 ± 0.754
3.831AlaGlu: 3.831 ± 1.569
1.149AlaPhe: 1.149 ± 0.551
2.299AlaGly: 2.299 ± 0.913
0.766AlaHis: 0.766 ± 0.494
1.916AlaIle: 1.916 ± 0.619
1.916AlaLys: 1.916 ± 0.387
4.215AlaLeu: 4.215 ± 1.145
1.533AlaMet: 1.533 ± 0.487
1.149AlaAsn: 1.149 ± 0.547
4.215AlaPro: 4.215 ± 1.149
3.065AlaGln: 3.065 ± 1.58
4.981AlaArg: 4.981 ± 1.273
7.28AlaSer: 7.28 ± 1.748
3.831AlaThr: 3.831 ± 1.001
4.215AlaVal: 4.215 ± 1.475
0.383AlaTrp: 0.383 ± 0.282
0.766AlaTyr: 0.766 ± 0.404
0.0AlaXaa: 0.0 ± 0.0
Cys
1.533CysAla: 1.533 ± 0.743
0.383CysCys: 0.383 ± 0.282
1.533CysAsp: 1.533 ± 0.757
0.0CysGlu: 0.0 ± 0.0
1.149CysPhe: 1.149 ± 0.559
1.533CysGly: 1.533 ± 1.017
0.383CysHis: 0.383 ± 0.282
1.533CysIle: 1.533 ± 0.793
1.149CysLys: 1.149 ± 0.552
1.533CysLeu: 1.533 ± 0.904
0.0CysMet: 0.0 ± 0.0
1.533CysAsn: 1.533 ± 0.661
1.149CysPro: 1.149 ± 0.704
0.383CysGln: 0.383 ± 0.282
1.916CysArg: 1.916 ± 1.975
1.149CysSer: 1.149 ± 1.022
0.766CysThr: 0.766 ± 0.531
0.766CysVal: 0.766 ± 0.564
1.533CysTrp: 1.533 ± 0.596
1.149CysTyr: 1.149 ± 0.523
0.0CysXaa: 0.0 ± 0.0
Asp
3.065AspAla: 3.065 ± 0.843
1.533AspCys: 1.533 ± 0.501
4.215AspAsp: 4.215 ± 1.781
3.448AspGlu: 3.448 ± 1.219
2.682AspPhe: 2.682 ± 0.86
1.533AspGly: 1.533 ± 0.726
1.149AspHis: 1.149 ± 0.601
4.598AspIle: 4.598 ± 1.193
1.916AspLys: 1.916 ± 0.7
5.747AspLeu: 5.747 ± 1.698
1.149AspMet: 1.149 ± 0.553
0.766AspAsn: 0.766 ± 0.358
7.28AspPro: 7.28 ± 3.575
3.831AspGln: 3.831 ± 1.34
1.916AspArg: 1.916 ± 0.907
8.046AspSer: 8.046 ± 1.678
4.215AspThr: 4.215 ± 0.784
2.682AspVal: 2.682 ± 0.828
0.766AspTrp: 0.766 ± 0.404
0.383AspTyr: 0.383 ± 0.453
0.0AspXaa: 0.0 ± 0.0
Glu
2.682GluAla: 2.682 ± 0.544
1.149GluCys: 1.149 ± 0.552
5.747GluAsp: 5.747 ± 1.825
9.195GluGlu: 9.195 ± 3.177
1.533GluPhe: 1.533 ± 0.574
2.299GluGly: 2.299 ± 1.112
1.533GluHis: 1.533 ± 0.665
1.149GluIle: 1.149 ± 0.525
3.065GluLys: 3.065 ± 1.216
4.981GluLeu: 4.981 ± 1.026
0.383GluMet: 0.383 ± 0.282
3.831GluAsn: 3.831 ± 0.915
1.916GluPro: 1.916 ± 1.055
1.149GluGln: 1.149 ± 0.523
3.448GluArg: 3.448 ± 0.67
1.149GluSer: 1.149 ± 0.477
4.598GluThr: 4.598 ± 0.967
4.598GluVal: 4.598 ± 1.085
0.383GluTrp: 0.383 ± 0.282
1.916GluTyr: 1.916 ± 0.669
0.0GluXaa: 0.0 ± 0.0
Phe
2.299PheAla: 2.299 ± 1.382
1.916PheCys: 1.916 ± 1.902
2.682PheAsp: 2.682 ± 0.506
3.831PheGlu: 3.831 ± 0.796
1.533PhePhe: 1.533 ± 0.808
1.149PheGly: 1.149 ± 0.698
0.383PheHis: 0.383 ± 0.453
2.299PheIle: 2.299 ± 0.982
3.065PheLys: 3.065 ± 1.121
4.598PheLeu: 4.598 ± 1.041
0.766PheMet: 0.766 ± 0.404
1.916PheAsn: 1.916 ± 0.861
1.533PhePro: 1.533 ± 0.461
1.916PheGln: 1.916 ± 0.773
1.916PheArg: 1.916 ± 0.511
3.065PheSer: 3.065 ± 1.036
1.533PheThr: 1.533 ± 0.808
1.916PheVal: 1.916 ± 1.348
1.149PheTrp: 1.149 ± 0.601
1.533PheTyr: 1.533 ± 0.808
0.0PheXaa: 0.0 ± 0.0
Gly
4.215GlyAla: 4.215 ± 0.612
0.383GlyCys: 0.383 ± 0.353
0.766GlyAsp: 0.766 ± 0.521
4.215GlyGlu: 4.215 ± 2.455
1.916GlyPhe: 1.916 ± 0.59
8.046GlyGly: 8.046 ± 2.8
2.299GlyHis: 2.299 ± 0.426
2.682GlyIle: 2.682 ± 0.642
2.682GlyLys: 2.682 ± 1.072
4.215GlyLeu: 4.215 ± 1.419
1.149GlyMet: 1.149 ± 0.559
2.682GlyAsn: 2.682 ± 1.194
4.215GlyPro: 4.215 ± 2.13
3.448GlyGln: 3.448 ± 1.481
6.897GlyArg: 6.897 ± 2.152
9.195GlySer: 9.195 ± 2.572
3.448GlyThr: 3.448 ± 1.355
4.215GlyVal: 4.215 ± 0.884
0.0GlyTrp: 0.0 ± 0.0
1.533GlyTyr: 1.533 ± 0.904
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
1.149HisCys: 1.149 ± 0.707
0.383HisAsp: 0.383 ± 0.453
0.383HisGlu: 0.383 ± 0.342
0.766HisPhe: 0.766 ± 0.564
1.916HisGly: 1.916 ± 0.655
0.383HisHis: 0.383 ± 0.342
1.533HisIle: 1.533 ± 0.8
1.149HisLys: 1.149 ± 0.547
1.916HisLeu: 1.916 ± 0.963
0.0HisMet: 0.0 ± 0.0
1.916HisAsn: 1.916 ± 0.387
1.533HisPro: 1.533 ± 1.309
1.916HisGln: 1.916 ± 1.313
1.533HisArg: 1.533 ± 0.844
2.299HisSer: 2.299 ± 0.918
1.149HisThr: 1.149 ± 0.673
2.299HisVal: 2.299 ± 0.546
0.0HisTrp: 0.0 ± 0.0
1.533HisTyr: 1.533 ± 0.8
0.0HisXaa: 0.0 ± 0.0
Ile
2.682IleAla: 2.682 ± 0.627
1.533IleCys: 1.533 ± 0.674
3.065IleAsp: 3.065 ± 1.289
4.215IleGlu: 4.215 ± 1.114
1.916IlePhe: 1.916 ± 0.561
4.215IleGly: 4.215 ± 1.72
0.766IleHis: 0.766 ± 0.439
1.533IleIle: 1.533 ± 0.657
1.916IleLys: 1.916 ± 0.869
4.215IleLeu: 4.215 ± 0.698
0.383IleMet: 0.383 ± 0.325
0.766IleAsn: 0.766 ± 0.4
1.916IlePro: 1.916 ± 0.628
2.299IleGln: 2.299 ± 0.657
1.149IleArg: 1.149 ± 0.496
3.448IleSer: 3.448 ± 1.069
1.916IleThr: 1.916 ± 0.966
3.065IleVal: 3.065 ± 1.137
0.0IleTrp: 0.0 ± 0.0
1.533IleTyr: 1.533 ± 0.54
0.0IleXaa: 0.0 ± 0.0
Lys
1.916LysAla: 1.916 ± 0.811
1.149LysCys: 1.149 ± 0.559
1.916LysAsp: 1.916 ± 0.761
1.533LysGlu: 1.533 ± 0.637
3.065LysPhe: 3.065 ± 1.793
3.448LysGly: 3.448 ± 0.966
1.916LysHis: 1.916 ± 0.744
1.149LysIle: 1.149 ± 0.601
3.831LysLys: 3.831 ± 0.839
5.364LysLeu: 5.364 ± 1.777
0.766LysMet: 0.766 ± 0.404
1.916LysAsn: 1.916 ± 0.753
1.916LysPro: 1.916 ± 0.657
2.682LysGln: 2.682 ± 0.724
4.598LysArg: 4.598 ± 0.955
4.215LysSer: 4.215 ± 1.418
1.916LysThr: 1.916 ± 0.894
0.766LysVal: 0.766 ± 0.706
1.533LysTrp: 1.533 ± 0.625
2.299LysTyr: 2.299 ± 0.977
0.0LysXaa: 0.0 ± 0.0
Leu
4.598LeuAla: 4.598 ± 1.026
0.766LeuCys: 0.766 ± 0.502
8.429LeuAsp: 8.429 ± 2.048
4.598LeuGlu: 4.598 ± 0.785
3.831LeuPhe: 3.831 ± 0.9
8.046LeuGly: 8.046 ± 1.992
4.215LeuHis: 4.215 ± 1.416
1.149LeuIle: 1.149 ± 0.37
4.981LeuLys: 4.981 ± 0.758
9.195LeuLeu: 9.195 ± 2.182
1.149LeuMet: 1.149 ± 0.705
3.448LeuAsn: 3.448 ± 0.881
4.215LeuPro: 4.215 ± 1.216
3.831LeuGln: 3.831 ± 0.885
4.215LeuArg: 4.215 ± 1.531
8.429LeuSer: 8.429 ± 1.851
4.598LeuThr: 4.598 ± 0.892
4.981LeuVal: 4.981 ± 1.195
0.383LeuTrp: 0.383 ± 0.331
2.682LeuTyr: 2.682 ± 0.997
0.0LeuXaa: 0.0 ± 0.0
Met
1.149MetAla: 1.149 ± 0.704
0.766MetCys: 0.766 ± 0.494
0.383MetAsp: 0.383 ± 0.353
0.766MetGlu: 0.766 ± 0.357
0.766MetPhe: 0.766 ± 0.404
0.383MetGly: 0.383 ± 0.353
0.0MetHis: 0.0 ± 0.0
1.149MetIle: 1.149 ± 0.718
0.766MetLys: 0.766 ± 0.502
1.149MetLeu: 1.149 ± 0.547
0.383MetMet: 0.383 ± 0.41
0.766MetAsn: 0.766 ± 0.404
0.0MetPro: 0.0 ± 0.0
0.766MetGln: 0.766 ± 0.404
0.0MetArg: 0.0 ± 0.0
0.766MetSer: 0.766 ± 0.357
2.299MetThr: 2.299 ± 0.722
1.149MetVal: 1.149 ± 0.551
0.0MetTrp: 0.0 ± 0.0
0.383MetTyr: 0.383 ± 0.342
0.0MetXaa: 0.0 ± 0.0
Asn
3.065AsnAla: 3.065 ± 1.248
0.383AsnCys: 0.383 ± 0.282
1.916AsnAsp: 1.916 ± 0.657
0.383AsnGlu: 0.383 ± 0.331
2.682AsnPhe: 2.682 ± 0.906
2.299AsnGly: 2.299 ± 0.657
0.383AsnHis: 0.383 ± 0.282
2.299AsnIle: 2.299 ± 0.35
2.299AsnLys: 2.299 ± 1.296
2.682AsnLeu: 2.682 ± 0.812
0.383AsnMet: 0.383 ± 0.353
1.916AsnAsn: 1.916 ± 0.717
3.448AsnPro: 3.448 ± 0.589
1.916AsnGln: 1.916 ± 0.387
2.299AsnArg: 2.299 ± 0.541
3.448AsnSer: 3.448 ± 0.854
3.065AsnThr: 3.065 ± 0.976
2.299AsnVal: 2.299 ± 0.497
1.149AsnTrp: 1.149 ± 0.846
1.533AsnTyr: 1.533 ± 1.128
0.0AsnXaa: 0.0 ± 0.0
Pro
3.448ProAla: 3.448 ± 1.266
1.916ProCys: 1.916 ± 1.095
4.598ProAsp: 4.598 ± 1.501
3.448ProGlu: 3.448 ± 0.95
1.533ProPhe: 1.533 ± 0.286
4.598ProGly: 4.598 ± 1.404
1.533ProHis: 1.533 ± 0.438
2.299ProIle: 2.299 ± 0.958
3.448ProLys: 3.448 ± 1.132
4.215ProLeu: 4.215 ± 1.482
0.766ProMet: 0.766 ± 0.696
1.916ProAsn: 1.916 ± 0.387
10.728ProPro: 10.728 ± 5.502
4.598ProGln: 4.598 ± 3.491
5.364ProArg: 5.364 ± 2.797
6.13ProSer: 6.13 ± 2.296
3.065ProThr: 3.065 ± 1.533
4.215ProVal: 4.215 ± 1.397
0.766ProTrp: 0.766 ± 0.502
1.916ProTyr: 1.916 ± 1.058
0.0ProXaa: 0.0 ± 0.0
Gln
2.682GlnAla: 2.682 ± 0.986
0.0GlnCys: 0.0 ± 0.0
3.448GlnAsp: 3.448 ± 1.218
2.299GlnGlu: 2.299 ± 0.799
2.299GlnPhe: 2.299 ± 0.92
3.065GlnGly: 3.065 ± 1.112
1.533GlnHis: 1.533 ± 0.681
1.916GlnIle: 1.916 ± 0.958
0.766GlnLys: 0.766 ± 0.703
5.747GlnLeu: 5.747 ± 1.08
0.383GlnMet: 0.383 ± 0.353
2.299GlnAsn: 2.299 ± 0.983
4.981GlnPro: 4.981 ± 1.665
3.448GlnGln: 3.448 ± 1.214
3.065GlnArg: 3.065 ± 0.675
3.831GlnSer: 3.831 ± 1.721
1.916GlnThr: 1.916 ± 0.561
1.533GlnVal: 1.533 ± 0.942
0.766GlnTrp: 0.766 ± 0.358
1.916GlnTyr: 1.916 ± 0.958
0.0GlnXaa: 0.0 ± 0.0
Arg
5.364ArgAla: 5.364 ± 1.348
1.916ArgCys: 1.916 ± 0.803
5.747ArgAsp: 5.747 ± 2.741
1.916ArgGlu: 1.916 ± 0.26
0.766ArgPhe: 0.766 ± 0.547
7.28ArgGly: 7.28 ± 2.403
2.299ArgHis: 2.299 ± 0.886
1.916ArgIle: 1.916 ± 1.04
5.747ArgLys: 5.747 ± 1.03
8.046ArgLeu: 8.046 ± 1.79
0.766ArgMet: 0.766 ± 0.414
1.149ArgAsn: 1.149 ± 0.501
4.981ArgPro: 4.981 ± 1.741
3.065ArgGln: 3.065 ± 0.837
5.747ArgArg: 5.747 ± 2.748
7.28ArgSer: 7.28 ± 4.822
1.916ArgThr: 1.916 ± 0.761
3.448ArgVal: 3.448 ± 1.126
0.766ArgTrp: 0.766 ± 0.502
1.149ArgTyr: 1.149 ± 0.565
0.0ArgXaa: 0.0 ± 0.0
Ser
4.215SerAla: 4.215 ± 1.217
0.383SerCys: 0.383 ± 0.353
5.364SerAsp: 5.364 ± 1.393
3.831SerGlu: 3.831 ± 1.041
5.364SerPhe: 5.364 ± 0.852
7.28SerGly: 7.28 ± 2.176
1.533SerHis: 1.533 ± 0.962
5.364SerIle: 5.364 ± 2.465
3.831SerLys: 3.831 ± 0.96
5.747SerLeu: 5.747 ± 1.23
2.299SerMet: 2.299 ± 0.636
4.598SerAsn: 4.598 ± 1.618
4.981SerPro: 4.981 ± 1.65
4.981SerGln: 4.981 ± 1.75
8.046SerArg: 8.046 ± 3.089
10.728SerSer: 10.728 ± 4.814
6.13SerThr: 6.13 ± 2.422
4.598SerVal: 4.598 ± 0.765
1.149SerTrp: 1.149 ± 0.771
2.682SerTyr: 2.682 ± 0.85
0.0SerXaa: 0.0 ± 0.0
Thr
2.682ThrAla: 2.682 ± 1.222
1.533ThrCys: 1.533 ± 0.831
3.065ThrAsp: 3.065 ± 1.257
2.682ThrGlu: 2.682 ± 0.818
3.065ThrPhe: 3.065 ± 1.112
3.831ThrGly: 3.831 ± 1.272
0.383ThrHis: 0.383 ± 0.331
3.065ThrIle: 3.065 ± 1.439
0.766ThrLys: 0.766 ± 0.706
3.831ThrLeu: 3.831 ± 0.77
0.766ThrMet: 0.766 ± 0.564
2.299ThrAsn: 2.299 ± 0.886
6.13ThrPro: 6.13 ± 1.688
0.383ThrGln: 0.383 ± 0.342
4.215ThrArg: 4.215 ± 0.657
6.13ThrSer: 6.13 ± 1.146
3.831ThrThr: 3.831 ± 0.723
4.981ThrVal: 4.981 ± 0.862
0.383ThrTrp: 0.383 ± 0.342
1.533ThrTyr: 1.533 ± 0.596
0.0ThrXaa: 0.0 ± 0.0
Val
2.299ValAla: 2.299 ± 0.776
1.533ValCys: 1.533 ± 0.879
3.448ValAsp: 3.448 ± 0.777
3.065ValGlu: 3.065 ± 1.267
3.065ValPhe: 3.065 ± 0.396
3.065ValGly: 3.065 ± 1.358
1.533ValHis: 1.533 ± 0.613
3.831ValIle: 3.831 ± 1.453
2.682ValLys: 2.682 ± 0.85
4.981ValLeu: 4.981 ± 1.495
0.383ValMet: 0.383 ± 0.283
2.299ValAsn: 2.299 ± 0.614
4.598ValPro: 4.598 ± 1.082
3.065ValGln: 3.065 ± 1.13
6.513ValArg: 6.513 ± 1.491
3.448ValSer: 3.448 ± 1.266
3.448ValThr: 3.448 ± 1.432
3.831ValVal: 3.831 ± 1.364
0.383ValTrp: 0.383 ± 0.353
1.533ValTyr: 1.533 ± 0.734
0.0ValXaa: 0.0 ± 0.0
Trp
1.149TrpAla: 1.149 ± 0.517
0.383TrpCys: 0.383 ± 0.342
0.383TrpAsp: 0.383 ± 0.342
1.149TrpGlu: 1.149 ± 0.673
0.766TrpPhe: 0.766 ± 0.439
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.383TrpIle: 0.383 ± 0.282
1.149TrpLys: 1.149 ± 0.601
2.299TrpLeu: 2.299 ± 1.141
0.0TrpMet: 0.0 ± 0.0
1.149TrpAsn: 1.149 ± 1.059
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.533TrpArg: 1.533 ± 1.06
0.766TrpSer: 0.766 ± 0.358
0.766TrpThr: 0.766 ± 0.414
1.149TrpVal: 1.149 ± 0.547
0.383TrpTrp: 0.383 ± 0.486
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.533TyrAla: 1.533 ± 0.99
1.533TyrCys: 1.533 ± 1.141
1.533TyrAsp: 1.533 ± 0.737
1.916TyrGlu: 1.916 ± 0.865
1.149TyrPhe: 1.149 ± 0.807
1.916TyrGly: 1.916 ± 0.521
0.766TyrHis: 0.766 ± 0.414
0.766TyrIle: 0.766 ± 0.502
0.766TyrLys: 0.766 ± 0.565
3.065TyrLeu: 3.065 ± 1.043
0.0TyrMet: 0.0 ± 0.0
1.533TyrAsn: 1.533 ± 0.501
1.149TyrPro: 1.149 ± 0.875
1.149TyrGln: 1.149 ± 0.517
1.916TyrArg: 1.916 ± 0.762
2.299TyrSer: 2.299 ± 0.832
1.149TyrThr: 1.149 ± 0.698
2.299TyrVal: 2.299 ± 0.972
1.533TyrTrp: 1.533 ± 1.015
3.831TyrTyr: 3.831 ± 0.966
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2611 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski