Amino acid dipepetide frequency for Canis familiaris papillomavirus 16

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.776AlaAla: 3.776 ± 0.773
0.755AlaCys: 0.755 ± 0.631
3.776AlaAsp: 3.776 ± 0.905
3.021AlaGlu: 3.021 ± 1.197
3.399AlaPhe: 3.399 ± 0.838
7.175AlaGly: 7.175 ± 2.039
1.511AlaHis: 1.511 ± 0.605
3.399AlaIle: 3.399 ± 0.918
3.776AlaLys: 3.776 ± 1.072
6.798AlaLeu: 6.798 ± 1.343
1.511AlaMet: 1.511 ± 0.259
1.133AlaAsn: 1.133 ± 0.416
3.776AlaPro: 3.776 ± 1.397
1.511AlaGln: 1.511 ± 0.843
4.532AlaArg: 4.532 ± 1.53
4.154AlaSer: 4.154 ± 0.549
4.909AlaThr: 4.909 ± 1.065
2.266AlaVal: 2.266 ± 0.434
0.755AlaTrp: 0.755 ± 0.615
1.133AlaTyr: 1.133 ± 0.63
0.0AlaXaa: 0.0 ± 0.0
Cys
2.266CysAla: 2.266 ± 1.252
1.133CysCys: 1.133 ± 1.257
0.755CysAsp: 0.755 ± 0.505
0.755CysGlu: 0.755 ± 0.574
0.755CysPhe: 0.755 ± 0.364
0.755CysGly: 0.755 ± 0.868
0.378CysHis: 0.378 ± 0.365
0.755CysIle: 0.755 ± 0.574
0.755CysLys: 0.755 ± 0.396
2.266CysLeu: 2.266 ± 0.891
0.0CysMet: 0.0 ± 0.0
1.888CysAsn: 1.888 ± 0.953
2.266CysPro: 2.266 ± 0.705
0.0CysGln: 0.0 ± 0.0
0.755CysArg: 0.755 ± 0.636
1.511CysSer: 1.511 ± 1.229
1.511CysThr: 1.511 ± 1.413
1.511CysVal: 1.511 ± 0.485
0.378CysTrp: 0.378 ± 0.365
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.287AspAla: 5.287 ± 1.288
0.755AspCys: 0.755 ± 0.364
4.532AspAsp: 4.532 ± 1.771
2.644AspGlu: 2.644 ± 1.52
1.888AspPhe: 1.888 ± 1.12
3.021AspGly: 3.021 ± 1.024
0.755AspHis: 0.755 ± 0.409
3.776AspIle: 3.776 ± 1.276
1.133AspLys: 1.133 ± 0.592
5.665AspLeu: 5.665 ± 1.302
1.133AspMet: 1.133 ± 0.544
2.644AspAsn: 2.644 ± 0.589
3.399AspPro: 3.399 ± 0.983
2.266AspGln: 2.266 ± 0.634
3.021AspArg: 3.021 ± 0.521
8.686AspSer: 8.686 ± 1.665
5.665AspThr: 5.665 ± 1.414
3.776AspVal: 3.776 ± 1.721
1.888AspTrp: 1.888 ± 0.895
1.511AspTyr: 1.511 ± 0.548
0.0AspXaa: 0.0 ± 0.0
Glu
2.266GluAla: 2.266 ± 0.562
0.755GluCys: 0.755 ± 0.615
5.287GluAsp: 5.287 ± 1.376
7.931GluGlu: 7.931 ± 2.689
2.266GluPhe: 2.266 ± 0.938
5.665GluGly: 5.665 ± 1.679
1.511GluHis: 1.511 ± 0.563
0.755GluIle: 0.755 ± 0.617
0.755GluLys: 0.755 ± 0.45
3.399GluLeu: 3.399 ± 1.283
0.755GluMet: 0.755 ± 0.409
1.511GluAsn: 1.511 ± 0.625
4.532GluPro: 4.532 ± 1.738
1.133GluGln: 1.133 ± 0.616
3.399GluArg: 3.399 ± 1.658
3.776GluSer: 3.776 ± 0.982
3.021GluThr: 3.021 ± 0.618
4.909GluVal: 4.909 ± 1.834
0.755GluTrp: 0.755 ± 0.378
0.755GluTyr: 0.755 ± 0.364
0.0GluXaa: 0.0 ± 0.0
Phe
1.511PheAla: 1.511 ± 0.716
1.133PheCys: 1.133 ± 0.57
2.266PheAsp: 2.266 ± 1.239
3.399PheGlu: 3.399 ± 1.843
3.776PhePhe: 3.776 ± 1.878
2.644PheGly: 2.644 ± 0.813
0.755PheHis: 0.755 ± 0.409
1.511PheIle: 1.511 ± 0.5
3.776PheLys: 3.776 ± 1.902
3.776PheLeu: 3.776 ± 1.319
0.0PheMet: 0.0 ± 0.0
1.511PheAsn: 1.511 ± 0.605
1.511PhePro: 1.511 ± 0.605
0.378PheGln: 0.378 ± 0.365
2.266PheArg: 2.266 ± 0.528
1.888PheSer: 1.888 ± 1.007
1.511PheThr: 1.511 ± 0.645
1.888PheVal: 1.888 ± 0.88
1.888PheTrp: 1.888 ± 0.559
0.755PheTyr: 0.755 ± 0.772
0.0PheXaa: 0.0 ± 0.0
Gly
6.042GlyAla: 6.042 ± 1.652
1.133GlyCys: 1.133 ± 0.626
6.42GlyAsp: 6.42 ± 2.047
5.287GlyGlu: 5.287 ± 0.826
1.888GlyPhe: 1.888 ± 1.058
7.553GlyGly: 7.553 ± 2.194
1.888GlyHis: 1.888 ± 0.417
5.287GlyIle: 5.287 ± 1.223
2.266GlyLys: 2.266 ± 0.763
4.532GlyLeu: 4.532 ± 1.325
1.133GlyMet: 1.133 ± 0.601
2.644GlyAsn: 2.644 ± 0.741
4.909GlyPro: 4.909 ± 1.909
1.888GlyGln: 1.888 ± 0.859
3.021GlyArg: 3.021 ± 0.733
4.909GlySer: 4.909 ± 1.483
4.909GlyThr: 4.909 ± 1.54
4.154GlyVal: 4.154 ± 1.847
0.755GlyTrp: 0.755 ± 0.439
1.133GlyTyr: 1.133 ± 0.627
0.0GlyXaa: 0.0 ± 0.0
His
1.511HisAla: 1.511 ± 0.75
0.378HisCys: 0.378 ± 0.426
1.133HisAsp: 1.133 ± 0.437
0.378HisGlu: 0.378 ± 0.359
1.511HisPhe: 1.511 ± 0.259
0.755HisGly: 0.755 ± 0.505
0.755HisHis: 0.755 ± 0.378
0.755HisIle: 0.755 ± 0.402
1.888HisLys: 1.888 ± 0.853
1.133HisLeu: 1.133 ± 0.749
0.0HisMet: 0.0 ± 0.0
1.133HisAsn: 1.133 ± 0.524
1.511HisPro: 1.511 ± 0.414
1.511HisGln: 1.511 ± 0.791
1.888HisArg: 1.888 ± 0.906
2.266HisSer: 2.266 ± 1.167
1.133HisThr: 1.133 ± 0.669
1.888HisVal: 1.888 ± 1.21
0.755HisTrp: 0.755 ± 0.421
1.133HisTyr: 1.133 ± 0.861
0.0HisXaa: 0.0 ± 0.0
Ile
1.133IleAla: 1.133 ± 0.612
1.511IleCys: 1.511 ± 0.727
1.888IleAsp: 1.888 ± 0.711
3.021IleGlu: 3.021 ± 1.304
1.133IlePhe: 1.133 ± 0.701
2.644IleGly: 2.644 ± 0.905
0.755IleHis: 0.755 ± 0.574
1.888IleIle: 1.888 ± 0.711
0.378IleLys: 0.378 ± 0.287
2.266IleLeu: 2.266 ± 0.761
0.755IleMet: 0.755 ± 0.574
2.266IleAsn: 2.266 ± 0.637
4.154IlePro: 4.154 ± 1.949
1.511IleGln: 1.511 ± 0.802
1.888IleArg: 1.888 ± 0.875
3.021IleSer: 3.021 ± 0.723
3.399IleThr: 3.399 ± 1.043
4.909IleVal: 4.909 ± 0.945
0.378IleTrp: 0.378 ± 0.359
0.378IleTyr: 0.378 ± 0.607
0.0IleXaa: 0.0 ± 0.0
Lys
2.644LysAla: 2.644 ± 0.45
1.133LysCys: 1.133 ± 0.716
3.399LysAsp: 3.399 ± 1.341
1.511LysGlu: 1.511 ± 1.035
1.133LysPhe: 1.133 ± 0.669
2.644LysGly: 2.644 ± 1.24
1.511LysHis: 1.511 ± 1.148
0.755LysIle: 0.755 ± 0.505
2.644LysLys: 2.644 ± 0.715
2.266LysLeu: 2.266 ± 0.824
0.378LysMet: 0.378 ± 0.365
0.378LysAsn: 0.378 ± 0.607
0.755LysPro: 0.755 ± 0.868
3.021LysGln: 3.021 ± 0.974
6.042LysArg: 6.042 ± 1.504
4.909LysSer: 4.909 ± 1.952
1.511LysThr: 1.511 ± 0.625
2.266LysVal: 2.266 ± 1.113
0.378LysTrp: 0.378 ± 0.359
1.511LysTyr: 1.511 ± 1.014
0.0LysXaa: 0.0 ± 0.0
Leu
5.287LeuAla: 5.287 ± 1.2
2.266LeuCys: 2.266 ± 1.678
7.931LeuAsp: 7.931 ± 1.166
2.644LeuGlu: 2.644 ± 0.733
3.399LeuPhe: 3.399 ± 1.615
6.798LeuGly: 6.798 ± 1.762
3.399LeuHis: 3.399 ± 1.365
3.021LeuIle: 3.021 ± 1.699
3.399LeuLys: 3.399 ± 0.881
10.952LeuLeu: 10.952 ± 2.176
2.644LeuMet: 2.644 ± 0.775
2.266LeuAsn: 2.266 ± 0.97
5.287LeuPro: 5.287 ± 1.404
5.665LeuGln: 5.665 ± 0.863
5.665LeuArg: 5.665 ± 1.066
6.42LeuSer: 6.42 ± 1.886
6.42LeuThr: 6.42 ± 1.273
1.888LeuVal: 1.888 ± 0.51
0.755LeuTrp: 0.755 ± 0.505
1.511LeuTyr: 1.511 ± 0.945
0.0LeuXaa: 0.0 ± 0.0
Met
1.888MetAla: 1.888 ± 0.679
0.0MetCys: 0.0 ± 0.0
1.133MetAsp: 1.133 ± 0.544
1.511MetGlu: 1.511 ± 0.259
1.133MetPhe: 1.133 ± 0.367
1.133MetGly: 1.133 ± 0.512
0.0MetHis: 0.0 ± 0.0
0.755MetIle: 0.755 ± 0.632
0.755MetLys: 0.755 ± 0.574
0.755MetLeu: 0.755 ± 0.378
0.378MetMet: 0.378 ± 0.412
0.0MetAsn: 0.0 ± 0.0
0.378MetPro: 0.378 ± 0.365
0.755MetGln: 0.755 ± 0.411
1.133MetArg: 1.133 ± 0.612
1.511MetSer: 1.511 ± 0.727
1.133MetThr: 1.133 ± 0.749
1.133MetVal: 1.133 ± 0.596
0.378MetTrp: 0.378 ± 0.359
0.378MetTyr: 0.378 ± 0.308
0.0MetXaa: 0.0 ± 0.0
Asn
2.644AsnAla: 2.644 ± 1.468
1.511AsnCys: 1.511 ± 0.855
1.511AsnAsp: 1.511 ± 0.79
1.133AsnGlu: 1.133 ± 0.544
1.888AsnPhe: 1.888 ± 1.12
2.266AsnGly: 2.266 ± 0.733
1.133AsnHis: 1.133 ± 0.544
0.755AsnIle: 0.755 ± 0.574
0.755AsnLys: 0.755 ± 0.396
3.021AsnLeu: 3.021 ± 0.914
1.133AsnMet: 1.133 ± 0.544
2.266AsnAsn: 2.266 ± 0.481
3.776AsnPro: 3.776 ± 1.473
0.755AsnGln: 0.755 ± 0.439
3.776AsnArg: 3.776 ± 1.791
2.266AsnSer: 2.266 ± 0.481
3.021AsnThr: 3.021 ± 0.747
2.266AsnVal: 2.266 ± 0.589
0.378AsnTrp: 0.378 ± 0.287
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
7.553ProAla: 7.553 ± 2.04
1.133ProCys: 1.133 ± 0.743
4.909ProAsp: 4.909 ± 1.081
4.532ProGlu: 4.532 ± 1.11
2.266ProPhe: 2.266 ± 0.669
5.287ProGly: 5.287 ± 1.682
1.133ProHis: 1.133 ± 0.928
1.888ProIle: 1.888 ± 0.98
3.399ProLys: 3.399 ± 1.426
9.063ProLeu: 9.063 ± 2.21
0.378ProMet: 0.378 ± 0.287
1.511ProAsn: 1.511 ± 1.038
12.462ProPro: 12.462 ± 2.288
1.133ProGln: 1.133 ± 0.768
3.399ProArg: 3.399 ± 1.634
5.287ProSer: 5.287 ± 1.387
4.532ProThr: 4.532 ± 1.267
6.42ProVal: 6.42 ± 1.792
0.378ProTrp: 0.378 ± 0.607
2.266ProTyr: 2.266 ± 0.749
0.0ProXaa: 0.0 ± 0.0
Gln
3.021GlnAla: 3.021 ± 1.073
1.133GlnCys: 1.133 ± 0.861
1.511GlnAsp: 1.511 ± 0.825
1.511GlnGlu: 1.511 ± 0.259
1.511GlnPhe: 1.511 ± 0.5
2.266GlnGly: 2.266 ± 0.355
1.511GlnHis: 1.511 ± 0.467
2.266GlnIle: 2.266 ± 0.752
3.399GlnLys: 3.399 ± 0.84
3.776GlnLeu: 3.776 ± 0.682
0.755GlnMet: 0.755 ± 0.689
2.266GlnAsn: 2.266 ± 1.29
2.266GlnPro: 2.266 ± 0.647
1.511GlnGln: 1.511 ± 0.763
1.133GlnArg: 1.133 ± 0.367
2.266GlnSer: 2.266 ± 0.817
3.021GlnThr: 3.021 ± 0.99
3.021GlnVal: 3.021 ± 0.849
0.755GlnTrp: 0.755 ± 0.615
1.133GlnTyr: 1.133 ± 0.367
0.0GlnXaa: 0.0 ± 0.0
Arg
3.776ArgAla: 3.776 ± 0.975
1.888ArgCys: 1.888 ± 1.128
2.644ArgAsp: 2.644 ± 0.831
2.644ArgGlu: 2.644 ± 0.555
2.644ArgPhe: 2.644 ± 0.636
4.154ArgGly: 4.154 ± 0.889
1.888ArgHis: 1.888 ± 0.808
1.888ArgIle: 1.888 ± 0.698
4.532ArgLys: 4.532 ± 1.447
6.042ArgLeu: 6.042 ± 1.671
0.755ArgMet: 0.755 ± 0.396
1.888ArgAsn: 1.888 ± 0.51
4.909ArgPro: 4.909 ± 1.86
1.888ArgGln: 1.888 ± 0.635
8.308ArgArg: 8.308 ± 3.109
2.266ArgSer: 2.266 ± 0.696
6.798ArgThr: 6.798 ± 1.053
6.042ArgVal: 6.042 ± 1.12
2.266ArgTrp: 2.266 ± 0.706
3.021ArgTyr: 3.021 ± 1.197
0.0ArgXaa: 0.0 ± 0.0
Ser
4.154SerAla: 4.154 ± 0.873
0.378SerCys: 0.378 ± 0.287
4.909SerAsp: 4.909 ± 1.044
4.154SerGlu: 4.154 ± 1.144
1.511SerPhe: 1.511 ± 0.599
5.665SerGly: 5.665 ± 0.963
0.755SerHis: 0.755 ± 0.378
2.644SerIle: 2.644 ± 1.068
1.511SerLys: 1.511 ± 0.716
5.287SerLeu: 5.287 ± 0.998
1.511SerMet: 1.511 ± 0.461
4.532SerAsn: 4.532 ± 1.104
7.175SerPro: 7.175 ± 2.655
6.798SerGln: 6.798 ± 0.971
5.287SerArg: 5.287 ± 0.892
6.042SerSer: 6.042 ± 2.074
7.553SerThr: 7.553 ± 2.141
4.532SerVal: 4.532 ± 0.956
0.755SerTrp: 0.755 ± 0.401
0.378SerTyr: 0.378 ± 0.47
0.0SerXaa: 0.0 ± 0.0
Thr
3.021ThrAla: 3.021 ± 1.123
1.888ThrCys: 1.888 ± 0.628
3.399ThrAsp: 3.399 ± 0.755
5.287ThrGlu: 5.287 ± 0.76
1.511ThrPhe: 1.511 ± 0.548
4.532ThrGly: 4.532 ± 1.151
1.133ThrHis: 1.133 ± 0.437
3.021ThrIle: 3.021 ± 1.209
1.888ThrLys: 1.888 ± 0.874
8.308ThrLeu: 8.308 ± 1.705
0.755ThrMet: 0.755 ± 0.411
3.021ThrAsn: 3.021 ± 0.773
8.686ThrPro: 8.686 ± 2.313
2.644ThrGln: 2.644 ± 0.678
6.42ThrArg: 6.42 ± 0.87
5.665ThrSer: 5.665 ± 1.038
4.909ThrThr: 4.909 ± 1.729
4.532ThrVal: 4.532 ± 1.298
1.133ThrTrp: 1.133 ± 0.61
1.888ThrTyr: 1.888 ± 0.76
0.0ThrXaa: 0.0 ± 0.0
Val
2.644ValAla: 2.644 ± 1.018
0.755ValCys: 0.755 ± 1.214
4.909ValAsp: 4.909 ± 1.768
1.888ValGlu: 1.888 ± 0.614
1.888ValPhe: 1.888 ± 0.848
4.909ValGly: 4.909 ± 1.388
1.888ValHis: 1.888 ± 0.867
3.021ValIle: 3.021 ± 0.754
1.888ValLys: 1.888 ± 0.636
4.532ValLeu: 4.532 ± 1.131
1.511ValMet: 1.511 ± 0.501
1.511ValAsn: 1.511 ± 1.014
5.665ValPro: 5.665 ± 1.156
3.776ValGln: 3.776 ± 1.605
5.665ValArg: 5.665 ± 1.332
6.042ValSer: 6.042 ± 1.607
5.665ValThr: 5.665 ± 1.426
4.909ValVal: 4.909 ± 1.295
0.755ValTrp: 0.755 ± 0.729
2.644ValTyr: 2.644 ± 0.555
0.0ValXaa: 0.0 ± 0.0
Trp
1.133TrpAla: 1.133 ± 0.367
0.378TrpCys: 0.378 ± 0.47
0.755TrpAsp: 0.755 ± 0.411
0.755TrpGlu: 0.755 ± 0.723
0.378TrpPhe: 0.378 ± 0.359
0.755TrpGly: 0.755 ± 0.631
0.0TrpHis: 0.0 ± 0.0
0.755TrpIle: 0.755 ± 0.378
1.133TrpLys: 1.133 ± 0.743
1.511TrpLeu: 1.511 ± 0.727
0.378TrpMet: 0.378 ± 0.365
1.511TrpAsn: 1.511 ± 0.75
0.378TrpPro: 0.378 ± 0.302
0.755TrpGln: 0.755 ± 0.717
0.755TrpArg: 0.755 ± 0.615
1.511TrpSer: 1.511 ± 0.749
1.511TrpThr: 1.511 ± 1.14
1.133TrpVal: 1.133 ± 0.544
0.0TrpTrp: 0.0 ± 0.0
0.755TrpTyr: 0.755 ± 0.411
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.511TyrAla: 1.511 ± 0.507
0.378TyrCys: 0.378 ± 0.47
0.378TyrAsp: 0.378 ± 0.365
1.133TyrGlu: 1.133 ± 0.367
2.266TyrPhe: 2.266 ± 0.844
1.133TyrGly: 1.133 ± 0.443
0.755TyrHis: 0.755 ± 0.411
0.378TyrIle: 0.378 ± 0.287
0.755TyrLys: 0.755 ± 0.378
2.266TyrLeu: 2.266 ± 0.624
0.378TyrMet: 0.378 ± 0.287
0.378TyrAsn: 0.378 ± 0.607
1.133TyrPro: 1.133 ± 0.609
1.133TyrGln: 1.133 ± 0.63
1.888TyrArg: 1.888 ± 0.76
1.133TyrSer: 1.133 ± 0.416
1.511TyrThr: 1.511 ± 0.58
3.021TyrVal: 3.021 ± 0.818
0.755TyrTrp: 0.755 ± 0.421
1.133TyrTyr: 1.133 ± 0.705
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (2649 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski