Amino acid dipepetide frequency for Canis familiaris papillomavirus 9

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.812AlaAla: 5.812 ± 2.322
1.937AlaCys: 1.937 ± 1.347
4.262AlaAsp: 4.262 ± 1.0
2.712AlaGlu: 2.712 ± 0.889
3.874AlaPhe: 3.874 ± 1.159
6.587AlaGly: 6.587 ± 2.623
1.162AlaHis: 1.162 ± 0.386
1.162AlaIle: 1.162 ± 0.374
2.712AlaLys: 2.712 ± 1.622
5.812AlaLeu: 5.812 ± 2.016
1.162AlaMet: 1.162 ± 0.343
0.775AlaAsn: 0.775 ± 0.567
5.037AlaPro: 5.037 ± 1.312
2.712AlaGln: 2.712 ± 0.84
2.712AlaArg: 2.712 ± 0.743
5.037AlaSer: 5.037 ± 0.799
3.874AlaThr: 3.874 ± 1.142
4.649AlaVal: 4.649 ± 1.932
0.387AlaTrp: 0.387 ± 0.284
2.325AlaTyr: 2.325 ± 1.368
0.0AlaXaa: 0.0 ± 0.0
Cys
1.937CysAla: 1.937 ± 1.5
0.775CysCys: 0.775 ± 0.654
0.775CysAsp: 0.775 ± 0.479
1.162CysGlu: 1.162 ± 0.481
0.775CysPhe: 0.775 ± 0.656
1.162CysGly: 1.162 ± 1.057
0.775CysHis: 0.775 ± 0.396
1.162CysIle: 1.162 ± 0.558
0.387CysLys: 0.387 ± 0.319
1.162CysLeu: 1.162 ± 0.667
0.0CysMet: 0.0 ± 0.0
1.937CysAsn: 1.937 ± 0.899
2.325CysPro: 2.325 ± 0.725
0.0CysGln: 0.0 ± 0.0
0.775CysArg: 0.775 ± 0.654
2.325CysSer: 2.325 ± 0.596
2.325CysThr: 2.325 ± 1.477
1.55CysVal: 1.55 ± 1.196
0.387CysTrp: 0.387 ± 0.329
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.649AspAla: 4.649 ± 1.21
1.937AspCys: 1.937 ± 0.774
3.874AspAsp: 3.874 ± 0.982
2.325AspGlu: 2.325 ± 1.055
1.937AspPhe: 1.937 ± 1.207
3.1AspGly: 3.1 ± 0.674
0.775AspHis: 0.775 ± 0.567
1.937AspIle: 1.937 ± 0.869
1.937AspLys: 1.937 ± 0.975
5.812AspLeu: 5.812 ± 1.832
1.55AspMet: 1.55 ± 0.533
3.487AspAsn: 3.487 ± 0.672
4.262AspPro: 4.262 ± 0.838
3.1AspGln: 3.1 ± 0.659
1.55AspArg: 1.55 ± 0.463
5.037AspSer: 5.037 ± 1.959
4.649AspThr: 4.649 ± 1.254
5.037AspVal: 5.037 ± 2.027
1.55AspTrp: 1.55 ± 0.752
1.55AspTyr: 1.55 ± 0.557
0.0AspXaa: 0.0 ± 0.0
Glu
2.712GluAla: 2.712 ± 1.051
0.387GluCys: 0.387 ± 0.284
3.874GluAsp: 3.874 ± 1.477
6.587GluGlu: 6.587 ± 2.218
2.325GluPhe: 2.325 ± 0.862
4.262GluGly: 4.262 ± 1.458
1.162GluHis: 1.162 ± 0.792
1.162GluIle: 1.162 ± 0.621
1.937GluLys: 1.937 ± 0.895
4.649GluLeu: 4.649 ± 1.238
0.387GluMet: 0.387 ± 0.328
3.487GluAsn: 3.487 ± 0.874
3.1GluPro: 3.1 ± 0.795
2.325GluGln: 2.325 ± 0.765
4.262GluArg: 4.262 ± 1.672
4.262GluSer: 4.262 ± 1.572
2.325GluThr: 2.325 ± 0.951
6.199GluVal: 6.199 ± 1.598
0.775GluTrp: 0.775 ± 0.567
0.775GluTyr: 0.775 ± 0.359
0.0GluXaa: 0.0 ± 0.0
Phe
2.712PheAla: 2.712 ± 0.88
2.325PheCys: 2.325 ± 0.843
1.55PheAsp: 1.55 ± 0.806
2.712PheGlu: 2.712 ± 1.072
3.1PhePhe: 3.1 ± 1.087
3.1PheGly: 3.1 ± 0.809
0.387PheHis: 0.387 ± 0.319
2.325PheIle: 2.325 ± 0.8
2.325PheLys: 2.325 ± 1.129
1.55PheLeu: 1.55 ± 0.589
0.775PheMet: 0.775 ± 0.381
0.775PheAsn: 0.775 ± 0.359
2.325PhePro: 2.325 ± 0.428
1.162PheGln: 1.162 ± 0.515
3.1PheArg: 3.1 ± 0.61
1.55PheSer: 1.55 ± 0.952
1.937PheThr: 1.937 ± 0.626
1.55PheVal: 1.55 ± 0.792
1.55PheTrp: 1.55 ± 0.533
0.775PheTyr: 0.775 ± 0.453
0.0PheXaa: 0.0 ± 0.0
Gly
7.361GlyAla: 7.361 ± 1.434
0.387GlyCys: 0.387 ± 0.329
5.037GlyAsp: 5.037 ± 1.658
5.812GlyGlu: 5.812 ± 1.633
2.712GlyPhe: 2.712 ± 0.918
9.299GlyGly: 9.299 ± 3.238
1.162GlyHis: 1.162 ± 0.374
4.262GlyIle: 4.262 ± 1.131
2.325GlyLys: 2.325 ± 0.599
4.262GlyLeu: 4.262 ± 1.36
0.775GlyMet: 0.775 ± 0.451
1.55GlyAsn: 1.55 ± 0.809
6.974GlyPro: 6.974 ± 2.537
1.937GlyGln: 1.937 ± 0.717
5.424GlyArg: 5.424 ± 1.447
5.424GlySer: 5.424 ± 1.889
7.749GlyThr: 7.749 ± 1.84
3.874GlyVal: 3.874 ± 1.26
0.387GlyTrp: 0.387 ± 0.446
2.325GlyTyr: 2.325 ± 0.939
0.0GlyXaa: 0.0 ± 0.0
His
1.937HisAla: 1.937 ± 0.54
0.0HisCys: 0.0 ± 0.0
0.387HisAsp: 0.387 ± 0.329
1.162HisGlu: 1.162 ± 0.638
1.55HisPhe: 1.55 ± 0.571
1.55HisGly: 1.55 ± 0.736
1.162HisHis: 1.162 ± 0.572
0.387HisIle: 0.387 ± 0.284
0.775HisLys: 0.775 ± 0.567
0.775HisLeu: 0.775 ± 0.359
0.0HisMet: 0.0 ± 0.0
0.387HisAsn: 0.387 ± 0.284
2.325HisPro: 2.325 ± 0.872
0.775HisGln: 0.775 ± 0.387
0.775HisArg: 0.775 ± 0.893
1.55HisSer: 1.55 ± 0.643
1.937HisThr: 1.937 ± 0.684
1.162HisVal: 1.162 ± 0.386
1.162HisTrp: 1.162 ± 0.65
1.162HisTyr: 1.162 ± 0.595
0.0HisXaa: 0.0 ± 0.0
Ile
1.55IleAla: 1.55 ± 0.816
1.937IleCys: 1.937 ± 0.584
0.775IleAsp: 0.775 ± 0.366
2.325IleGlu: 2.325 ± 1.171
1.162IlePhe: 1.162 ± 0.636
4.649IleGly: 4.649 ± 0.942
1.162IleHis: 1.162 ± 0.745
0.775IleIle: 0.775 ± 0.394
0.0IleLys: 0.0 ± 0.0
2.712IleLeu: 2.712 ± 0.792
0.387IleMet: 0.387 ± 0.284
0.387IleAsn: 0.387 ± 0.319
1.55IlePro: 1.55 ± 0.557
1.937IleGln: 1.937 ± 0.409
2.712IleArg: 2.712 ± 0.528
3.487IleSer: 3.487 ± 1.195
2.712IleThr: 2.712 ± 1.271
3.487IleVal: 3.487 ± 1.13
0.0IleTrp: 0.0 ± 0.0
0.775IleTyr: 0.775 ± 0.387
0.0IleXaa: 0.0 ± 0.0
Lys
2.712LysAla: 2.712 ± 0.829
0.775LysCys: 0.775 ± 0.656
1.937LysAsp: 1.937 ± 0.567
1.162LysGlu: 1.162 ± 0.65
0.775LysPhe: 0.775 ± 0.359
1.162LysGly: 1.162 ± 0.649
2.325LysHis: 2.325 ± 1.047
1.937LysIle: 1.937 ± 0.667
3.487LysLys: 3.487 ± 1.27
1.937LysLeu: 1.937 ± 1.042
0.387LysMet: 0.387 ± 0.615
1.55LysAsn: 1.55 ± 0.653
0.775LysPro: 0.775 ± 0.556
1.937LysGln: 1.937 ± 0.657
3.874LysArg: 3.874 ± 1.583
1.937LysSer: 1.937 ± 1.115
2.325LysThr: 2.325 ± 0.772
3.487LysVal: 3.487 ± 1.412
0.0LysTrp: 0.0 ± 0.0
3.1LysTyr: 3.1 ± 1.225
0.0LysXaa: 0.0 ± 0.0
Leu
4.649LeuAla: 4.649 ± 0.806
2.325LeuCys: 2.325 ± 1.472
6.974LeuAsp: 6.974 ± 1.168
7.361LeuGlu: 7.361 ± 0.698
2.325LeuPhe: 2.325 ± 0.662
7.361LeuGly: 7.361 ± 2.423
1.55LeuHis: 1.55 ± 0.272
1.55LeuIle: 1.55 ± 0.719
3.1LeuLys: 3.1 ± 1.0
10.461LeuLeu: 10.461 ± 2.483
1.162LeuMet: 1.162 ± 0.564
1.55LeuAsn: 1.55 ± 0.719
6.587LeuPro: 6.587 ± 2.158
5.037LeuGln: 5.037 ± 1.037
7.361LeuArg: 7.361 ± 1.102
6.199LeuSer: 6.199 ± 1.711
5.037LeuThr: 5.037 ± 1.584
3.874LeuVal: 3.874 ± 1.241
1.937LeuTrp: 1.937 ± 0.753
2.325LeuTyr: 2.325 ± 0.468
0.0LeuXaa: 0.0 ± 0.0
Met
0.387MetAla: 0.387 ± 0.284
0.387MetCys: 0.387 ± 0.328
1.162MetAsp: 1.162 ± 0.511
1.162MetGlu: 1.162 ± 0.386
0.387MetPhe: 0.387 ± 0.329
0.387MetGly: 0.387 ± 0.284
0.387MetHis: 0.387 ± 0.328
0.387MetIle: 0.387 ± 0.456
0.387MetLys: 0.387 ± 0.284
1.162MetLeu: 1.162 ± 0.374
0.775MetMet: 0.775 ± 0.35
0.387MetAsn: 0.387 ± 0.446
0.775MetPro: 0.775 ± 0.639
0.775MetGln: 0.775 ± 0.387
0.387MetArg: 0.387 ± 0.328
2.325MetSer: 2.325 ± 0.791
1.55MetThr: 1.55 ± 0.589
1.162MetVal: 1.162 ± 0.572
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.55AsnAla: 1.55 ± 0.847
0.387AsnCys: 0.387 ± 0.328
2.325AsnAsp: 2.325 ± 1.114
1.162AsnGlu: 1.162 ± 0.558
1.162AsnPhe: 1.162 ± 0.343
2.325AsnGly: 2.325 ± 0.428
0.387AsnHis: 0.387 ± 0.328
1.55AsnIle: 1.55 ± 0.409
0.775AsnLys: 0.775 ± 0.366
4.262AsnLeu: 4.262 ± 1.46
0.387AsnMet: 0.387 ± 0.329
1.55AsnAsn: 1.55 ± 0.409
2.325AsnPro: 2.325 ± 0.892
1.937AsnGln: 1.937 ± 1.002
2.325AsnArg: 2.325 ± 0.66
2.712AsnSer: 2.712 ± 0.509
2.325AsnThr: 2.325 ± 0.566
2.325AsnVal: 2.325 ± 0.823
0.387AsnTrp: 0.387 ± 0.284
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
6.199ProAla: 6.199 ± 1.897
2.325ProCys: 2.325 ± 1.492
4.649ProAsp: 4.649 ± 0.791
4.262ProGlu: 4.262 ± 0.882
2.325ProPhe: 2.325 ± 0.549
5.812ProGly: 5.812 ± 2.262
1.937ProHis: 1.937 ± 1.115
1.162ProIle: 1.162 ± 0.374
3.1ProLys: 3.1 ± 1.152
8.136ProLeu: 8.136 ± 2.468
0.775ProMet: 0.775 ± 0.387
2.712ProAsn: 2.712 ± 1.512
10.461ProPro: 10.461 ± 2.594
1.162ProGln: 1.162 ± 0.893
6.199ProArg: 6.199 ± 2.764
5.812ProSer: 5.812 ± 1.736
5.037ProThr: 5.037 ± 1.791
6.199ProVal: 6.199 ± 1.996
0.387ProTrp: 0.387 ± 0.394
1.162ProTyr: 1.162 ± 0.621
0.0ProXaa: 0.0 ± 0.0
Gln
3.487GlnAla: 3.487 ± 0.961
0.0GlnCys: 0.0 ± 0.0
2.325GlnAsp: 2.325 ± 0.973
2.712GlnGlu: 2.712 ± 0.81
1.162GlnPhe: 1.162 ± 0.628
1.162GlnGly: 1.162 ± 0.374
1.937GlnHis: 1.937 ± 0.608
1.937GlnIle: 1.937 ± 0.292
1.937GlnLys: 1.937 ± 0.968
3.874GlnLeu: 3.874 ± 0.81
0.775GlnMet: 0.775 ± 0.477
1.162GlnAsn: 1.162 ± 0.728
2.712GlnPro: 2.712 ± 0.961
3.487GlnGln: 3.487 ± 0.773
4.262GlnArg: 4.262 ± 0.717
1.937GlnSer: 1.937 ± 0.473
2.325GlnThr: 2.325 ± 0.566
3.487GlnVal: 3.487 ± 0.973
1.162GlnTrp: 1.162 ± 0.659
1.55GlnTyr: 1.55 ± 0.605
0.0GlnXaa: 0.0 ± 0.0
Arg
4.262ArgAla: 4.262 ± 1.229
1.937ArgCys: 1.937 ± 0.923
2.325ArgAsp: 2.325 ± 0.701
3.487ArgGlu: 3.487 ± 0.626
4.262ArgPhe: 4.262 ± 0.807
5.424ArgGly: 5.424 ± 1.137
2.325ArgHis: 2.325 ± 0.884
3.1ArgIle: 3.1 ± 1.018
3.487ArgLys: 3.487 ± 1.039
7.749ArgLeu: 7.749 ± 0.801
0.387ArgMet: 0.387 ± 0.284
1.937ArgAsn: 1.937 ± 0.621
5.037ArgPro: 5.037 ± 2.376
2.325ArgGln: 2.325 ± 0.919
8.136ArgArg: 8.136 ± 3.008
5.424ArgSer: 5.424 ± 1.671
3.487ArgThr: 3.487 ± 0.948
4.262ArgVal: 4.262 ± 1.721
1.162ArgTrp: 1.162 ± 0.481
2.712ArgTyr: 2.712 ± 0.663
0.0ArgXaa: 0.0 ± 0.0
Ser
3.487SerAla: 3.487 ± 1.456
0.0SerCys: 0.0 ± 0.0
5.424SerAsp: 5.424 ± 1.134
4.649SerGlu: 4.649 ± 1.283
1.937SerPhe: 1.937 ± 0.738
7.749SerGly: 7.749 ± 2.213
0.387SerHis: 0.387 ± 0.329
2.325SerIle: 2.325 ± 0.509
2.712SerLys: 2.712 ± 1.56
8.136SerLeu: 8.136 ± 1.635
1.55SerMet: 1.55 ± 0.406
4.262SerAsn: 4.262 ± 0.565
7.749SerPro: 7.749 ± 2.661
4.649SerGln: 4.649 ± 0.709
3.874SerArg: 3.874 ± 1.49
5.424SerSer: 5.424 ± 1.81
3.874SerThr: 3.874 ± 1.189
3.874SerVal: 3.874 ± 1.367
0.775SerTrp: 0.775 ± 0.441
1.162SerTyr: 1.162 ± 0.558
0.0SerXaa: 0.0 ± 0.0
Thr
3.874ThrAla: 3.874 ± 1.727
2.325ThrCys: 2.325 ± 1.049
3.874ThrAsp: 3.874 ± 0.621
1.937ThrGlu: 1.937 ± 0.657
1.162ThrPhe: 1.162 ± 0.498
4.262ThrGly: 4.262 ± 1.258
0.775ThrHis: 0.775 ± 0.381
1.55ThrIle: 1.55 ± 0.533
2.325ThrLys: 2.325 ± 0.891
5.424ThrLeu: 5.424 ± 0.549
1.55ThrMet: 1.55 ± 0.825
2.325ThrAsn: 2.325 ± 0.865
6.974ThrPro: 6.974 ± 2.949
3.1ThrGln: 3.1 ± 1.21
5.424ThrArg: 5.424 ± 1.007
7.361ThrSer: 7.361 ± 1.556
4.262ThrThr: 4.262 ± 1.394
6.587ThrVal: 6.587 ± 2.024
1.55ThrTrp: 1.55 ± 0.967
2.325ThrTyr: 2.325 ± 1.289
0.0ThrXaa: 0.0 ± 0.0
Val
3.1ValAla: 3.1 ± 0.897
0.0ValCys: 0.0 ± 0.0
5.037ValAsp: 5.037 ± 1.811
3.1ValGlu: 3.1 ± 0.663
2.325ValPhe: 2.325 ± 0.955
6.587ValGly: 6.587 ± 1.355
0.775ValHis: 0.775 ± 0.366
4.262ValIle: 4.262 ± 1.196
2.325ValLys: 2.325 ± 0.823
6.587ValLeu: 6.587 ± 1.051
1.162ValMet: 1.162 ± 0.613
0.775ValAsn: 0.775 ± 0.359
6.587ValPro: 6.587 ± 2.416
3.487ValGln: 3.487 ± 1.202
5.424ValArg: 5.424 ± 1.85
4.262ValSer: 4.262 ± 1.405
6.974ValThr: 6.974 ± 1.729
3.1ValVal: 3.1 ± 1.256
1.162ValTrp: 1.162 ± 0.746
1.937ValTyr: 1.937 ± 0.496
0.0ValXaa: 0.0 ± 0.0
Trp
0.775TrpAla: 0.775 ± 0.387
0.387TrpCys: 0.387 ± 0.456
1.55TrpAsp: 1.55 ± 0.818
0.775TrpGlu: 0.775 ± 0.453
0.775TrpPhe: 0.775 ± 0.504
0.775TrpGly: 0.775 ± 0.658
0.0TrpHis: 0.0 ± 0.0
0.387TrpIle: 0.387 ± 0.284
1.162TrpLys: 1.162 ± 0.667
1.55TrpLeu: 1.55 ± 0.719
0.0TrpMet: 0.0 ± 0.0
1.162TrpAsn: 1.162 ± 0.649
0.387TrpPro: 0.387 ± 0.446
0.775TrpGln: 0.775 ± 0.656
1.162TrpArg: 1.162 ± 0.481
1.162TrpSer: 1.162 ± 0.48
0.775TrpThr: 0.775 ± 0.387
1.162TrpVal: 1.162 ± 0.558
0.0TrpTrp: 0.0 ± 0.0
0.775TrpTyr: 0.775 ± 0.387
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.55TyrAla: 1.55 ± 0.847
1.162TyrCys: 1.162 ± 0.658
1.937TyrAsp: 1.937 ± 0.84
0.387TyrGlu: 0.387 ± 0.328
1.55TyrPhe: 1.55 ± 0.806
2.325TyrGly: 2.325 ± 0.652
0.387TyrHis: 0.387 ± 0.284
1.162TyrIle: 1.162 ± 0.558
0.775TyrLys: 0.775 ± 0.366
2.712TyrLeu: 2.712 ± 0.961
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
1.55TyrPro: 1.55 ± 0.912
0.775TyrGln: 0.775 ± 0.567
3.487TyrArg: 3.487 ± 1.118
0.775TyrSer: 0.775 ± 0.359
3.487TyrThr: 3.487 ± 0.975
1.937TyrVal: 1.937 ± 0.496
0.775TyrTrp: 0.775 ± 0.396
1.162TyrTyr: 1.162 ± 0.374
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2582 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski