Amino acid dipepetide frequency for Canine papillomavirus 21

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.945AlaAla: 5.945 ± 2.558
2.378AlaCys: 2.378 ± 1.751
3.171AlaAsp: 3.171 ± 1.219
3.964AlaGlu: 3.964 ± 1.22
3.964AlaPhe: 3.964 ± 0.947
4.36AlaGly: 4.36 ± 1.095
1.585AlaHis: 1.585 ± 0.941
2.774AlaIle: 2.774 ± 1.507
3.964AlaLys: 3.964 ± 1.24
6.342AlaLeu: 6.342 ± 1.586
1.189AlaMet: 1.189 ± 0.357
1.585AlaAsn: 1.585 ± 0.573
3.964AlaPro: 3.964 ± 1.23
2.378AlaGln: 2.378 ± 1.332
5.549AlaArg: 5.549 ± 1.795
4.36AlaSer: 4.36 ± 1.398
4.36AlaThr: 4.36 ± 0.885
1.982AlaVal: 1.982 ± 0.623
0.793AlaTrp: 0.793 ± 0.373
0.793AlaTyr: 0.793 ± 0.414
0.0AlaXaa: 0.0 ± 0.0
Cys
1.585CysAla: 1.585 ± 0.83
1.585CysCys: 1.585 ± 0.664
0.793CysAsp: 0.793 ± 0.415
0.793CysGlu: 0.793 ± 0.373
1.982CysPhe: 1.982 ± 0.834
2.378CysGly: 2.378 ± 1.89
0.396CysHis: 0.396 ± 0.379
2.774CysIle: 2.774 ± 2.196
2.378CysLys: 2.378 ± 0.84
2.378CysLeu: 2.378 ± 1.279
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.793CysPro: 0.793 ± 0.651
1.585CysGln: 1.585 ± 0.303
1.982CysArg: 1.982 ± 0.736
2.378CysSer: 2.378 ± 1.29
1.585CysThr: 1.585 ± 1.158
1.982CysVal: 1.982 ± 1.148
0.793CysTrp: 0.793 ± 0.555
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.774AspAla: 2.774 ± 0.936
2.378AspCys: 2.378 ± 1.026
7.531AspAsp: 7.531 ± 3.25
5.945AspGlu: 5.945 ± 1.373
2.774AspPhe: 2.774 ± 0.792
5.153AspGly: 5.153 ± 1.781
0.0AspHis: 0.0 ± 0.0
2.378AspIle: 2.378 ± 1.2
1.982AspLys: 1.982 ± 0.38
4.756AspLeu: 4.756 ± 1.721
0.0AspMet: 0.0 ± 0.0
1.585AspAsn: 1.585 ± 0.573
5.945AspPro: 5.945 ± 1.431
2.774AspGln: 2.774 ± 1.276
1.585AspArg: 1.585 ± 0.548
4.756AspSer: 4.756 ± 1.596
3.964AspThr: 3.964 ± 1.614
3.964AspVal: 3.964 ± 1.024
1.982AspTrp: 1.982 ± 0.997
0.793AspTyr: 0.793 ± 0.447
0.0AspXaa: 0.0 ± 0.0
Glu
3.567GluAla: 3.567 ± 1.343
1.585GluCys: 1.585 ± 0.659
7.134GluAsp: 7.134 ± 1.506
3.964GluGlu: 3.964 ± 1.35
1.585GluPhe: 1.585 ± 0.632
4.756GluGly: 4.756 ± 1.695
1.189GluHis: 1.189 ± 0.728
0.793GluIle: 0.793 ± 0.415
1.585GluLys: 1.585 ± 1.018
4.756GluLeu: 4.756 ± 1.666
0.793GluMet: 0.793 ± 0.667
2.774GluAsn: 2.774 ± 0.881
3.171GluPro: 3.171 ± 1.071
2.378GluGln: 2.378 ± 0.64
3.964GluArg: 3.964 ± 1.368
5.153GluSer: 5.153 ± 0.922
2.774GluThr: 2.774 ± 1.242
3.171GluVal: 3.171 ± 1.128
0.396GluTrp: 0.396 ± 0.379
1.189GluTyr: 1.189 ± 0.679
0.0GluXaa: 0.0 ± 0.0
Phe
1.189PheAla: 1.189 ± 0.732
1.585PheCys: 1.585 ± 0.664
3.567PheAsp: 3.567 ± 1.481
3.567PheGlu: 3.567 ± 0.954
2.774PhePhe: 2.774 ± 1.295
3.567PheGly: 3.567 ± 0.954
0.793PheHis: 0.793 ± 0.415
1.585PheIle: 1.585 ± 0.92
2.774PheLys: 2.774 ± 0.843
3.567PheLeu: 3.567 ± 1.715
0.396PheMet: 0.396 ± 0.441
1.585PheAsn: 1.585 ± 0.548
0.793PhePro: 0.793 ± 0.373
1.189PheGln: 1.189 ± 0.772
1.982PheArg: 1.982 ± 0.471
2.378PheSer: 2.378 ± 0.78
1.982PheThr: 1.982 ± 0.851
2.774PheVal: 2.774 ± 1.006
1.189PheTrp: 1.189 ± 0.667
1.585PheTyr: 1.585 ± 0.632
0.0PheXaa: 0.0 ± 0.0
Gly
3.567GlyAla: 3.567 ± 0.644
0.793GlyCys: 0.793 ± 0.421
7.134GlyAsp: 7.134 ± 1.561
5.153GlyGlu: 5.153 ± 1.771
1.982GlyPhe: 1.982 ± 0.861
9.116GlyGly: 9.116 ± 3.665
2.378GlyHis: 2.378 ± 1.15
2.378GlyIle: 2.378 ± 0.981
3.964GlyLys: 3.964 ± 1.117
5.945GlyLeu: 5.945 ± 1.495
1.982GlyMet: 1.982 ± 0.685
3.171GlyAsn: 3.171 ± 0.999
4.36GlyPro: 4.36 ± 1.32
1.982GlyGln: 1.982 ± 0.635
9.116GlyArg: 9.116 ± 5.38
5.153GlySer: 5.153 ± 1.663
3.171GlyThr: 3.171 ± 1.26
2.774GlyVal: 2.774 ± 0.929
0.396GlyTrp: 0.396 ± 0.334
0.793GlyTyr: 0.793 ± 0.447
0.0GlyXaa: 0.0 ± 0.0
His
2.378HisAla: 2.378 ± 0.942
0.793HisCys: 0.793 ± 0.414
0.793HisAsp: 0.793 ± 0.373
0.793HisGlu: 0.793 ± 0.414
1.982HisPhe: 1.982 ± 0.709
2.378HisGly: 2.378 ± 1.215
0.793HisHis: 0.793 ± 0.559
1.189HisIle: 1.189 ± 0.655
1.585HisLys: 1.585 ± 0.986
1.585HisLeu: 1.585 ± 0.731
0.0HisMet: 0.0 ± 0.0
0.793HisAsn: 0.793 ± 0.373
1.982HisPro: 1.982 ± 1.248
1.189HisGln: 1.189 ± 0.598
0.396HisArg: 0.396 ± 0.321
1.982HisSer: 1.982 ± 1.019
0.793HisThr: 0.793 ± 0.571
1.189HisVal: 1.189 ± 0.428
0.396HisTrp: 0.396 ± 0.326
1.585HisTyr: 1.585 ± 0.608
0.0HisXaa: 0.0 ± 0.0
Ile
2.774IleAla: 2.774 ± 1.11
0.793IleCys: 0.793 ± 0.614
1.585IleAsp: 1.585 ± 0.518
1.585IleGlu: 1.585 ± 0.8
0.793IlePhe: 0.793 ± 0.559
2.774IleGly: 2.774 ± 0.598
0.793IleHis: 0.793 ± 0.415
1.585IleIle: 1.585 ± 0.759
1.189IleLys: 1.189 ± 0.518
3.171IleLeu: 3.171 ± 0.997
0.793IleMet: 0.793 ± 0.687
1.585IleAsn: 1.585 ± 0.83
3.567IlePro: 3.567 ± 2.227
0.793IleGln: 0.793 ± 0.667
0.396IleArg: 0.396 ± 0.511
1.982IleSer: 1.982 ± 0.903
0.793IleThr: 0.793 ± 0.4
4.756IleVal: 4.756 ± 1.698
0.396IleTrp: 0.396 ± 0.511
1.585IleTyr: 1.585 ± 0.573
0.0IleXaa: 0.0 ± 0.0
Lys
3.964LysAla: 3.964 ± 1.139
1.585LysCys: 1.585 ± 0.882
0.793LysAsp: 0.793 ± 0.415
2.774LysGlu: 2.774 ± 1.582
1.585LysPhe: 1.585 ± 0.573
3.964LysGly: 3.964 ± 1.122
1.585LysHis: 1.585 ± 0.986
2.378LysIle: 2.378 ± 0.872
3.171LysLys: 3.171 ± 0.721
1.982LysLeu: 1.982 ± 0.889
1.982LysMet: 1.982 ± 0.836
1.982LysAsn: 1.982 ± 0.788
1.585LysPro: 1.585 ± 0.789
1.982LysGln: 1.982 ± 0.38
5.549LysArg: 5.549 ± 1.241
5.549LysSer: 5.549 ± 3.777
1.585LysThr: 1.585 ± 0.828
3.964LysVal: 3.964 ± 0.891
0.396LysTrp: 0.396 ± 0.326
1.189LysTyr: 1.189 ± 0.766
0.0LysXaa: 0.0 ± 0.0
Leu
4.36LeuAla: 4.36 ± 1.191
2.378LeuCys: 2.378 ± 1.055
6.342LeuAsp: 6.342 ± 2.101
5.153LeuGlu: 5.153 ± 0.971
5.549LeuPhe: 5.549 ± 0.902
6.738LeuGly: 6.738 ± 1.754
2.378LeuHis: 2.378 ± 0.724
1.585LeuIle: 1.585 ± 0.988
3.964LeuLys: 3.964 ± 1.088
8.72LeuLeu: 8.72 ± 2.12
1.982LeuMet: 1.982 ± 1.166
1.189LeuAsn: 1.189 ± 0.532
2.774LeuPro: 2.774 ± 0.889
5.549LeuGln: 5.549 ± 1.834
3.171LeuArg: 3.171 ± 1.312
7.134LeuSer: 7.134 ± 1.56
5.153LeuThr: 5.153 ± 1.731
4.756LeuVal: 4.756 ± 1.412
0.0LeuTrp: 0.0 ± 0.0
2.774LeuTyr: 2.774 ± 0.811
0.0LeuXaa: 0.0 ± 0.0
Met
1.189MetAla: 1.189 ± 0.56
0.793MetCys: 0.793 ± 0.421
0.793MetAsp: 0.793 ± 0.421
0.396MetGlu: 0.396 ± 0.379
0.793MetPhe: 0.793 ± 0.415
0.396MetGly: 0.396 ± 0.334
0.0MetHis: 0.0 ± 0.0
0.793MetIle: 0.793 ± 0.64
0.793MetLys: 0.793 ± 0.667
2.378MetLeu: 2.378 ± 1.146
0.793MetMet: 0.793 ± 0.373
1.982MetAsn: 1.982 ± 1.628
1.189MetPro: 1.189 ± 0.598
1.585MetGln: 1.585 ± 0.609
2.774MetArg: 2.774 ± 1.461
2.378MetSer: 2.378 ± 0.914
1.189MetThr: 1.189 ± 0.671
1.189MetVal: 1.189 ± 0.722
0.793MetTrp: 0.793 ± 0.687
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.567AsnAla: 3.567 ± 1.082
1.585AsnCys: 1.585 ± 1.02
1.982AsnAsp: 1.982 ± 0.63
2.774AsnGlu: 2.774 ± 1.174
2.774AsnPhe: 2.774 ± 1.088
0.793AsnGly: 0.793 ± 0.415
0.0AsnHis: 0.0 ± 0.0
1.982AsnIle: 1.982 ± 0.675
1.982AsnLys: 1.982 ± 0.724
2.774AsnLeu: 2.774 ± 0.797
1.189AsnMet: 1.189 ± 0.736
1.585AsnAsn: 1.585 ± 0.939
3.567AsnPro: 3.567 ± 1.492
1.189AsnGln: 1.189 ± 0.667
2.378AsnArg: 2.378 ± 1.335
2.774AsnSer: 2.774 ± 1.241
1.982AsnThr: 1.982 ± 0.894
0.793AsnVal: 0.793 ± 0.415
0.0AsnTrp: 0.0 ± 0.0
0.396AsnTyr: 0.396 ± 0.326
0.0AsnXaa: 0.0 ± 0.0
Pro
3.964ProAla: 3.964 ± 0.941
1.585ProCys: 1.585 ± 0.882
5.945ProAsp: 5.945 ± 1.895
1.189ProGlu: 1.189 ± 0.518
1.982ProPhe: 1.982 ± 0.617
3.964ProGly: 3.964 ± 1.564
1.982ProHis: 1.982 ± 0.549
0.793ProIle: 0.793 ± 0.4
2.774ProLys: 2.774 ± 1.18
5.549ProLeu: 5.549 ± 2.013
0.396ProMet: 0.396 ± 0.334
0.793ProAsn: 0.793 ± 0.651
9.512ProPro: 9.512 ± 2.779
2.378ProGln: 2.378 ± 0.549
4.756ProArg: 4.756 ± 2.208
4.756ProSer: 4.756 ± 1.562
5.153ProThr: 5.153 ± 1.569
5.945ProVal: 5.945 ± 2.023
0.793ProTrp: 0.793 ± 0.758
3.171ProTyr: 3.171 ± 1.336
0.0ProXaa: 0.0 ± 0.0
Gln
2.774GlnAla: 2.774 ± 0.486
0.793GlnCys: 0.793 ± 0.667
1.189GlnAsp: 1.189 ± 0.732
2.378GlnGlu: 2.378 ± 1.348
1.189GlnPhe: 1.189 ± 0.518
2.774GlnGly: 2.774 ± 0.881
1.189GlnHis: 1.189 ± 0.428
1.189GlnIle: 1.189 ± 0.68
1.585GlnLys: 1.585 ± 0.609
5.549GlnLeu: 5.549 ± 1.363
1.189GlnMet: 1.189 ± 0.667
2.774GlnAsn: 2.774 ± 0.511
1.982GlnPro: 1.982 ± 0.811
1.189GlnGln: 1.189 ± 0.754
2.378GlnArg: 2.378 ± 0.787
1.982GlnSer: 1.982 ± 0.536
1.982GlnThr: 1.982 ± 1.276
1.585GlnVal: 1.585 ± 0.608
0.793GlnTrp: 0.793 ± 0.667
1.585GlnTyr: 1.585 ± 0.608
0.0GlnXaa: 0.0 ± 0.0
Arg
6.738ArgAla: 6.738 ± 1.676
1.189ArgCys: 1.189 ± 0.777
2.774ArgAsp: 2.774 ± 1.049
2.774ArgGlu: 2.774 ± 1.209
1.982ArgPhe: 1.982 ± 0.804
6.738ArgGly: 6.738 ± 5.691
3.964ArgHis: 3.964 ± 0.611
1.189ArgIle: 1.189 ± 0.68
3.171ArgLys: 3.171 ± 0.897
4.756ArgLeu: 4.756 ± 0.84
2.378ArgMet: 2.378 ± 1.232
1.982ArgAsn: 1.982 ± 0.843
5.549ArgPro: 5.549 ± 2.101
2.378ArgGln: 2.378 ± 1.34
9.909ArgArg: 9.909 ± 5.479
6.738ArgSer: 6.738 ± 1.031
4.756ArgThr: 4.756 ± 2.583
2.378ArgVal: 2.378 ± 1.003
1.189ArgTrp: 1.189 ± 0.732
1.189ArgTyr: 1.189 ± 0.732
0.0ArgXaa: 0.0 ± 0.0
Ser
5.153SerAla: 5.153 ± 1.97
1.982SerCys: 1.982 ± 1.157
3.567SerAsp: 3.567 ± 1.014
4.36SerGlu: 4.36 ± 0.71
3.567SerPhe: 3.567 ± 1.296
5.549SerGly: 5.549 ± 1.332
2.378SerHis: 2.378 ± 0.83
2.378SerIle: 2.378 ± 1.113
3.567SerLys: 3.567 ± 1.385
6.738SerLeu: 6.738 ± 1.426
2.378SerMet: 2.378 ± 1.26
3.567SerAsn: 3.567 ± 0.84
4.36SerPro: 4.36 ± 1.545
1.982SerGln: 1.982 ± 0.471
4.756SerArg: 4.756 ± 0.768
9.116SerSer: 9.116 ± 1.212
7.927SerThr: 7.927 ± 2.262
4.756SerVal: 4.756 ± 1.14
0.793SerTrp: 0.793 ± 0.373
0.793SerTyr: 0.793 ± 0.447
0.0SerXaa: 0.0 ± 0.0
Thr
3.567ThrAla: 3.567 ± 0.984
1.982ThrCys: 1.982 ± 0.676
1.982ThrAsp: 1.982 ± 0.947
2.774ThrGlu: 2.774 ± 1.125
1.189ThrPhe: 1.189 ± 0.696
3.567ThrGly: 3.567 ± 1.732
1.982ThrHis: 1.982 ± 1.055
2.378ThrIle: 2.378 ± 1.003
2.378ThrLys: 2.378 ± 1.079
5.153ThrLeu: 5.153 ± 1.109
3.171ThrMet: 3.171 ± 1.391
3.171ThrAsn: 3.171 ± 1.031
5.153ThrPro: 5.153 ± 1.442
2.774ThrGln: 2.774 ± 0.824
5.945ThrArg: 5.945 ± 2.503
5.549ThrSer: 5.549 ± 1.327
4.756ThrThr: 4.756 ± 1.584
3.964ThrVal: 3.964 ± 1.402
0.396ThrTrp: 0.396 ± 0.379
2.378ThrTyr: 2.378 ± 0.921
0.0ThrXaa: 0.0 ± 0.0
Val
3.567ValAla: 3.567 ± 0.73
1.189ValCys: 1.189 ± 0.772
4.36ValAsp: 4.36 ± 0.734
3.964ValGlu: 3.964 ± 1.483
0.793ValPhe: 0.793 ± 0.415
4.756ValGly: 4.756 ± 0.988
0.793ValHis: 0.793 ± 0.642
2.378ValIle: 2.378 ± 1.303
4.36ValLys: 4.36 ± 0.873
2.378ValLeu: 2.378 ± 0.873
1.189ValMet: 1.189 ± 0.651
1.982ValAsn: 1.982 ± 0.549
4.756ValPro: 4.756 ± 0.883
1.585ValGln: 1.585 ± 0.548
2.378ValArg: 2.378 ± 1.028
4.36ValSer: 4.36 ± 1.018
7.531ValThr: 7.531 ± 1.994
5.945ValVal: 5.945 ± 1.727
1.189ValTrp: 1.189 ± 0.357
1.189ValTyr: 1.189 ± 0.428
0.0ValXaa: 0.0 ± 0.0
Trp
0.396TrpAla: 0.396 ± 0.334
0.396TrpCys: 0.396 ± 0.379
0.396TrpAsp: 0.396 ± 0.326
0.396TrpGlu: 0.396 ± 0.379
0.396TrpPhe: 0.396 ± 0.379
0.396TrpGly: 0.396 ± 0.334
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.189TrpLys: 1.189 ± 0.56
1.585TrpLeu: 1.585 ± 0.986
0.0TrpMet: 0.0 ± 0.0
1.189TrpAsn: 1.189 ± 0.651
0.793TrpPro: 0.793 ± 0.421
0.793TrpGln: 0.793 ± 0.421
1.585TrpArg: 1.585 ± 1.179
0.396TrpSer: 0.396 ± 0.326
1.585TrpThr: 1.585 ± 0.812
1.585TrpVal: 1.585 ± 0.548
0.396TrpTrp: 0.396 ± 0.379
0.396TrpTyr: 0.396 ± 0.334
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.378TyrAla: 2.378 ± 0.978
0.793TyrCys: 0.793 ± 0.651
1.189TyrAsp: 1.189 ± 0.428
1.982TyrGlu: 1.982 ± 1.487
1.585TyrPhe: 1.585 ± 0.638
1.189TyrGly: 1.189 ± 0.752
0.396TyrHis: 0.396 ± 0.321
1.189TyrIle: 1.189 ± 0.429
1.189TyrLys: 1.189 ± 0.39
1.585TyrLeu: 1.585 ± 0.986
0.0TyrMet: 0.0 ± 0.0
0.793TyrAsn: 0.793 ± 0.651
1.585TyrPro: 1.585 ± 0.648
0.396TyrGln: 0.396 ± 0.334
3.171TyrArg: 3.171 ± 0.607
0.793TyrSer: 0.793 ± 0.421
1.189TyrThr: 1.189 ± 0.585
1.189TyrVal: 1.189 ± 0.357
0.793TyrTrp: 0.793 ± 0.421
1.982TyrTyr: 1.982 ± 1.454
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2524 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski