Amino acid dipepetide frequency for Canis familiaris papillomavirus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.324AlaAla: 6.324 ± 1.628
0.422AlaCys: 0.422 ± 0.513
5.902AlaAsp: 5.902 ± 1.423
2.108AlaGlu: 2.108 ± 1.011
5.481AlaPhe: 5.481 ± 1.009
5.059AlaGly: 5.059 ± 0.995
1.265AlaHis: 1.265 ± 0.388
3.794AlaIle: 3.794 ± 0.854
2.951AlaLys: 2.951 ± 0.867
4.637AlaLeu: 4.637 ± 2.154
1.265AlaMet: 1.265 ± 0.691
1.686AlaAsn: 1.686 ± 0.668
3.794AlaPro: 3.794 ± 1.037
1.265AlaGln: 1.265 ± 1.085
6.745AlaArg: 6.745 ± 2.444
2.951AlaSer: 2.951 ± 0.916
6.745AlaThr: 6.745 ± 1.205
2.951AlaVal: 2.951 ± 1.309
0.843AlaTrp: 0.843 ± 0.437
1.686AlaTyr: 1.686 ± 0.874
0.0AlaXaa: 0.0 ± 0.0
Cys
2.53CysAla: 2.53 ± 1.618
0.843CysCys: 0.843 ± 0.743
1.686CysAsp: 1.686 ± 1.129
0.422CysGlu: 0.422 ± 0.357
0.0CysPhe: 0.0 ± 0.0
0.843CysGly: 0.843 ± 0.587
0.422CysHis: 0.422 ± 0.4
1.265CysIle: 1.265 ± 0.636
1.686CysLys: 1.686 ± 0.631
2.108CysLeu: 2.108 ± 0.841
0.0CysMet: 0.0 ± 0.0
0.843CysAsn: 0.843 ± 0.395
3.373CysPro: 3.373 ± 1.039
0.0CysGln: 0.0 ± 0.0
0.843CysArg: 0.843 ± 0.654
0.843CysSer: 0.843 ± 0.714
2.53CysThr: 2.53 ± 1.62
1.265CysVal: 1.265 ± 0.677
0.422CysTrp: 0.422 ± 0.4
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.324AspAla: 6.324 ± 0.919
1.686AspCys: 1.686 ± 0.722
3.794AspAsp: 3.794 ± 0.76
3.794AspGlu: 3.794 ± 1.89
2.53AspPhe: 2.53 ± 1.009
2.108AspGly: 2.108 ± 0.712
0.422AspHis: 0.422 ± 0.357
3.794AspIle: 3.794 ± 1.066
2.53AspLys: 2.53 ± 1.499
6.745AspLeu: 6.745 ± 1.12
1.686AspMet: 1.686 ± 0.601
3.373AspAsn: 3.373 ± 1.009
4.216AspPro: 4.216 ± 1.343
1.686AspGln: 1.686 ± 0.544
4.216AspArg: 4.216 ± 1.545
6.324AspSer: 6.324 ± 2.09
5.059AspThr: 5.059 ± 1.355
3.794AspVal: 3.794 ± 1.141
1.265AspTrp: 1.265 ± 0.811
2.951AspTyr: 2.951 ± 0.834
0.0AspXaa: 0.0 ± 0.0
Glu
3.794GluAla: 3.794 ± 0.745
0.422GluCys: 0.422 ± 0.4
5.481GluAsp: 5.481 ± 1.292
7.589GluGlu: 7.589 ± 2.928
1.265GluPhe: 1.265 ± 0.388
9.275GluGly: 9.275 ± 2.693
1.265GluHis: 1.265 ± 0.563
1.686GluIle: 1.686 ± 0.628
2.53GluLys: 2.53 ± 0.441
2.951GluLeu: 2.951 ± 1.014
0.0GluMet: 0.0 ± 0.0
2.108GluAsn: 2.108 ± 0.737
2.951GluPro: 2.951 ± 0.676
2.53GluGln: 2.53 ± 0.887
4.216GluArg: 4.216 ± 1.744
2.108GluSer: 2.108 ± 0.667
2.53GluThr: 2.53 ± 0.844
3.373GluVal: 3.373 ± 0.758
1.265GluTrp: 1.265 ± 0.699
0.843GluTyr: 0.843 ± 0.714
0.0GluXaa: 0.0 ± 0.0
Phe
2.951PheAla: 2.951 ± 1.424
1.265PheCys: 1.265 ± 0.615
2.951PheAsp: 2.951 ± 1.15
2.951PheGlu: 2.951 ± 1.279
2.53PhePhe: 2.53 ± 1.515
3.794PheGly: 3.794 ± 0.522
0.0PheHis: 0.0 ± 0.0
1.686PheIle: 1.686 ± 0.545
2.108PheLys: 2.108 ± 0.935
4.637PheLeu: 4.637 ± 0.965
0.0PheMet: 0.0 ± 0.0
1.265PheAsn: 1.265 ± 0.757
1.265PhePro: 1.265 ± 0.379
1.265PheGln: 1.265 ± 0.725
2.53PheArg: 2.53 ± 0.797
1.686PheSer: 1.686 ± 1.13
2.53PheThr: 2.53 ± 1.779
1.265PheVal: 1.265 ± 1.085
1.265PheTrp: 1.265 ± 0.379
0.422PheTyr: 0.422 ± 0.377
0.0PheXaa: 0.0 ± 0.0
Gly
2.951GlyAla: 2.951 ± 0.885
2.53GlyCys: 2.53 ± 1.055
7.589GlyAsp: 7.589 ± 1.926
8.432GlyGlu: 8.432 ± 1.189
1.265GlyPhe: 1.265 ± 0.574
8.432GlyGly: 8.432 ± 3.194
1.265GlyHis: 1.265 ± 0.476
4.216GlyIle: 4.216 ± 1.079
2.951GlyLys: 2.951 ± 0.793
5.481GlyLeu: 5.481 ± 1.464
2.108GlyMet: 2.108 ± 0.435
2.951GlyAsn: 2.951 ± 0.673
5.481GlyPro: 5.481 ± 1.996
2.951GlyGln: 2.951 ± 1.303
4.216GlyArg: 4.216 ± 1.791
4.216GlySer: 4.216 ± 1.178
6.324GlyThr: 6.324 ± 1.659
4.637GlyVal: 4.637 ± 2.231
0.0GlyTrp: 0.0 ± 0.0
2.53GlyTyr: 2.53 ± 0.776
0.0GlyXaa: 0.0 ± 0.0
His
0.843HisAla: 0.843 ± 0.587
0.422HisCys: 0.422 ± 0.513
0.843HisAsp: 0.843 ± 0.448
0.0HisGlu: 0.0 ± 0.0
1.686HisPhe: 1.686 ± 0.631
0.843HisGly: 0.843 ± 0.743
0.422HisHis: 0.422 ± 0.357
0.422HisIle: 0.422 ± 0.377
1.265HisLys: 1.265 ± 0.811
1.265HisLeu: 1.265 ± 0.379
0.0HisMet: 0.0 ± 0.0
0.422HisAsn: 0.422 ± 0.403
2.951HisPro: 2.951 ± 1.525
1.265HisGln: 1.265 ± 0.636
1.686HisArg: 1.686 ± 0.568
1.265HisSer: 1.265 ± 0.476
2.53HisThr: 2.53 ± 0.662
1.265HisVal: 1.265 ± 0.73
0.843HisTrp: 0.843 ± 0.496
1.265HisTyr: 1.265 ± 0.636
0.0HisXaa: 0.0 ± 0.0
Ile
1.686IleAla: 1.686 ± 0.779
2.108IleCys: 2.108 ± 0.627
2.951IleAsp: 2.951 ± 1.234
2.53IleGlu: 2.53 ± 1.367
2.108IlePhe: 2.108 ± 1.073
2.951IleGly: 2.951 ± 0.836
1.265IleHis: 1.265 ± 0.913
0.843IleIle: 0.843 ± 0.434
0.843IleLys: 0.843 ± 0.799
3.373IleLeu: 3.373 ± 0.657
1.686IleMet: 1.686 ± 0.981
0.843IleAsn: 0.843 ± 0.448
3.373IlePro: 3.373 ± 0.886
2.53IleGln: 2.53 ± 0.884
2.108IleArg: 2.108 ± 1.271
2.108IleSer: 2.108 ± 0.336
2.951IleThr: 2.951 ± 1.64
2.951IleVal: 2.951 ± 0.953
0.843IleTrp: 0.843 ± 0.493
0.422IleTyr: 0.422 ± 0.377
0.0IleXaa: 0.0 ± 0.0
Lys
1.686LysAla: 1.686 ± 0.568
0.422LysCys: 0.422 ± 0.403
2.53LysAsp: 2.53 ± 1.142
2.108LysGlu: 2.108 ± 1.17
1.265LysPhe: 1.265 ± 0.615
2.951LysGly: 2.951 ± 1.221
1.686LysHis: 1.686 ± 0.568
3.373LysIle: 3.373 ± 0.657
2.108LysLys: 2.108 ± 1.011
1.686LysLeu: 1.686 ± 0.861
0.422LysMet: 0.422 ± 0.739
0.843LysAsn: 0.843 ± 0.437
0.422LysPro: 0.422 ± 0.514
2.951LysGln: 2.951 ± 1.144
5.481LysArg: 5.481 ± 1.664
2.951LysSer: 2.951 ± 1.114
2.951LysThr: 2.951 ± 1.146
4.637LysVal: 4.637 ± 1.528
0.422LysTrp: 0.422 ± 0.403
2.108LysTyr: 2.108 ± 0.839
0.0LysXaa: 0.0 ± 0.0
Leu
6.745LeuAla: 6.745 ± 2.13
4.216LeuCys: 4.216 ± 2.403
5.902LeuAsp: 5.902 ± 0.74
4.216LeuGlu: 4.216 ± 1.561
2.951LeuPhe: 2.951 ± 1.273
7.589LeuGly: 7.589 ± 2.493
2.951LeuHis: 2.951 ± 0.926
0.422LeuIle: 0.422 ± 0.4
5.059LeuLys: 5.059 ± 1.559
4.216LeuLeu: 4.216 ± 1.851
2.53LeuMet: 2.53 ± 1.637
2.951LeuAsn: 2.951 ± 1.516
3.794LeuPro: 3.794 ± 0.834
4.637LeuGln: 4.637 ± 0.965
3.794LeuArg: 3.794 ± 1.12
7.589LeuSer: 7.589 ± 0.619
3.794LeuThr: 3.794 ± 0.654
4.216LeuVal: 4.216 ± 0.761
2.108LeuTrp: 2.108 ± 0.932
2.951LeuTyr: 2.951 ± 0.472
0.0LeuXaa: 0.0 ± 0.0
Met
1.265MetAla: 1.265 ± 0.93
0.422MetCys: 0.422 ± 0.4
1.686MetAsp: 1.686 ± 0.779
1.265MetGlu: 1.265 ± 0.394
1.265MetPhe: 1.265 ± 0.691
0.422MetGly: 0.422 ± 0.377
0.422MetHis: 0.422 ± 0.403
0.843MetIle: 0.843 ± 0.587
0.422MetLys: 0.422 ± 0.357
0.843MetLeu: 0.843 ± 0.714
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.422MetPro: 0.422 ± 0.357
0.422MetGln: 0.422 ± 0.357
1.686MetArg: 1.686 ± 1.428
1.265MetSer: 1.265 ± 1.071
1.686MetThr: 1.686 ± 0.727
1.686MetVal: 1.686 ± 1.022
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.53AsnAla: 2.53 ± 0.947
0.422AsnCys: 0.422 ± 0.514
1.265AsnAsp: 1.265 ± 0.811
2.53AsnGlu: 2.53 ± 0.758
1.265AsnPhe: 1.265 ± 0.636
3.794AsnGly: 3.794 ± 0.916
0.0AsnHis: 0.0 ± 0.0
1.686AsnIle: 1.686 ± 0.545
2.53AsnLys: 2.53 ± 0.88
2.108AsnLeu: 2.108 ± 0.808
0.843AsnMet: 0.843 ± 0.448
0.843AsnAsn: 0.843 ± 0.496
2.951AsnPro: 2.951 ± 0.997
1.265AsnGln: 1.265 ± 0.757
2.108AsnArg: 2.108 ± 0.336
1.265AsnSer: 1.265 ± 0.691
2.951AsnThr: 2.951 ± 1.059
0.843AsnVal: 0.843 ± 0.395
0.422AsnTrp: 0.422 ± 0.357
0.422AsnTyr: 0.422 ± 0.357
0.0AsnXaa: 0.0 ± 0.0
Pro
7.589ProAla: 7.589 ± 2.7
1.686ProCys: 1.686 ± 0.918
2.53ProAsp: 2.53 ± 1.398
4.637ProGlu: 4.637 ± 0.822
2.108ProPhe: 2.108 ± 0.797
4.637ProGly: 4.637 ± 1.489
0.843ProHis: 0.843 ± 0.627
1.265ProIle: 1.265 ± 0.677
2.108ProLys: 2.108 ± 1.347
7.589ProLeu: 7.589 ± 1.105
0.422ProMet: 0.422 ± 0.482
2.108ProAsn: 2.108 ± 1.17
5.902ProPro: 5.902 ± 0.989
0.843ProGln: 0.843 ± 0.805
5.902ProArg: 5.902 ± 2.021
4.637ProSer: 4.637 ± 1.832
1.686ProThr: 1.686 ± 0.854
5.902ProVal: 5.902 ± 2.232
0.843ProTrp: 0.843 ± 0.627
0.843ProTyr: 0.843 ± 0.448
0.0ProXaa: 0.0 ± 0.0
Gln
1.686GlnAla: 1.686 ± 0.205
0.422GlnCys: 0.422 ± 0.357
2.951GlnAsp: 2.951 ± 1.201
0.843GlnGlu: 0.843 ± 0.395
1.265GlnPhe: 1.265 ± 0.388
2.951GlnGly: 2.951 ± 1.303
1.265GlnHis: 1.265 ± 0.636
1.686GlnIle: 1.686 ± 1.009
2.108GlnLys: 2.108 ± 1.124
4.216GlnLeu: 4.216 ± 0.715
0.0GlnMet: 0.0 ± 0.0
1.265GlnAsn: 1.265 ± 0.379
2.53GlnPro: 2.53 ± 1.625
1.686GlnGln: 1.686 ± 0.953
2.53GlnArg: 2.53 ± 0.586
1.686GlnSer: 1.686 ± 1.172
2.951GlnThr: 2.951 ± 1.468
2.951GlnVal: 2.951 ± 0.479
1.265GlnTrp: 1.265 ± 0.574
2.108GlnTyr: 2.108 ± 0.72
0.0GlnXaa: 0.0 ± 0.0
Arg
5.902ArgAla: 5.902 ± 0.967
1.265ArgCys: 1.265 ± 0.811
2.951ArgAsp: 2.951 ± 0.662
2.951ArgGlu: 2.951 ± 0.969
3.794ArgPhe: 3.794 ± 1.267
4.637ArgGly: 4.637 ± 1.153
3.373ArgHis: 3.373 ± 1.092
3.373ArgIle: 3.373 ± 1.037
4.637ArgLys: 4.637 ± 1.171
7.589ArgLeu: 7.589 ± 1.361
0.422ArgMet: 0.422 ± 0.4
2.108ArgAsn: 2.108 ± 0.969
2.53ArgPro: 2.53 ± 1.132
0.843ArgGln: 0.843 ± 0.395
9.275ArgArg: 9.275 ± 3.393
5.481ArgSer: 5.481 ± 1.209
4.637ArgThr: 4.637 ± 1.077
5.059ArgVal: 5.059 ± 0.946
2.108ArgTrp: 2.108 ± 0.737
2.108ArgTyr: 2.108 ± 0.663
0.0ArgXaa: 0.0 ± 0.0
Ser
2.951SerAla: 2.951 ± 1.29
0.422SerCys: 0.422 ± 0.514
5.481SerAsp: 5.481 ± 0.4
4.216SerGlu: 4.216 ± 0.752
2.108SerPhe: 2.108 ± 1.044
7.167SerGly: 7.167 ± 1.623
0.843SerHis: 0.843 ± 0.493
3.373SerIle: 3.373 ± 0.657
0.0SerLys: 0.0 ± 0.0
6.324SerLeu: 6.324 ± 1.243
0.422SerMet: 0.422 ± 0.4
4.637SerAsn: 4.637 ± 0.715
5.481SerPro: 5.481 ± 0.837
4.216SerGln: 4.216 ± 1.472
3.373SerArg: 3.373 ± 1.053
6.745SerSer: 6.745 ± 3.215
5.902SerThr: 5.902 ± 2.053
6.324SerVal: 6.324 ± 1.882
0.422SerTrp: 0.422 ± 0.513
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
5.059ThrAla: 5.059 ± 1.698
2.108ThrCys: 2.108 ± 0.969
1.686ThrAsp: 1.686 ± 0.205
3.794ThrGlu: 3.794 ± 0.764
1.686ThrPhe: 1.686 ± 0.544
5.059ThrGly: 5.059 ± 0.482
1.265ThrHis: 1.265 ± 0.659
1.265ThrIle: 1.265 ± 0.699
2.108ThrLys: 2.108 ± 1.099
7.589ThrLeu: 7.589 ± 2.617
2.108ThrMet: 2.108 ± 1.363
2.108ThrAsn: 2.108 ± 1.099
5.059ThrPro: 5.059 ± 0.844
3.373ThrGln: 3.373 ± 1.416
5.059ThrArg: 5.059 ± 1.05
8.853ThrSer: 8.853 ± 1.025
3.373ThrThr: 3.373 ± 1.395
5.481ThrVal: 5.481 ± 1.952
1.686ThrTrp: 1.686 ± 0.668
0.843ThrTyr: 0.843 ± 0.587
0.0ThrXaa: 0.0 ± 0.0
Val
2.108ValAla: 2.108 ± 1.117
0.0ValCys: 0.0 ± 0.0
7.589ValAsp: 7.589 ± 2.332
2.53ValGlu: 2.53 ± 0.93
1.686ValPhe: 1.686 ± 0.545
3.794ValGly: 3.794 ± 1.106
1.265ValHis: 1.265 ± 0.388
3.373ValIle: 3.373 ± 0.758
2.951ValLys: 2.951 ± 1.978
6.324ValLeu: 6.324 ± 1.559
0.422ValMet: 0.422 ± 0.357
0.422ValAsn: 0.422 ± 0.357
5.902ValPro: 5.902 ± 2.498
3.794ValGln: 3.794 ± 1.245
5.481ValArg: 5.481 ± 1.948
5.481ValSer: 5.481 ± 0.44
6.324ValThr: 6.324 ± 0.807
5.059ValVal: 5.059 ± 1.297
0.843ValTrp: 0.843 ± 0.799
2.53ValTyr: 2.53 ± 0.952
0.0ValXaa: 0.0 ± 0.0
Trp
0.843TrpAla: 0.843 ± 0.395
0.422TrpCys: 0.422 ± 0.514
1.265TrpAsp: 1.265 ± 0.636
0.843TrpGlu: 0.843 ± 0.627
0.843TrpPhe: 0.843 ± 0.627
0.843TrpGly: 0.843 ± 0.799
0.0TrpHis: 0.0 ± 0.0
1.265TrpIle: 1.265 ± 0.73
1.686TrpLys: 1.686 ± 0.722
1.686TrpLeu: 1.686 ± 0.874
0.422TrpMet: 0.422 ± 0.513
1.265TrpAsn: 1.265 ± 0.806
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.265TrpArg: 1.265 ± 0.574
2.108TrpSer: 2.108 ± 0.395
1.265TrpThr: 1.265 ± 1.208
1.265TrpVal: 1.265 ± 0.691
0.0TrpTrp: 0.0 ± 0.0
0.843TrpTyr: 0.843 ± 0.395
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.108TyrAla: 2.108 ± 0.903
0.422TyrCys: 0.422 ± 0.357
1.265TyrAsp: 1.265 ± 0.388
0.422TyrGlu: 0.422 ± 0.403
1.265TyrPhe: 1.265 ± 0.476
3.373TyrGly: 3.373 ± 0.873
1.265TyrHis: 1.265 ± 0.394
0.843TyrIle: 0.843 ± 0.799
0.843TyrLys: 0.843 ± 0.714
1.265TyrLeu: 1.265 ± 0.691
0.843TyrMet: 0.843 ± 0.714
0.0TyrAsn: 0.0 ± 0.0
1.686TyrPro: 1.686 ± 0.7
0.843TyrGln: 0.843 ± 0.714
2.53TyrArg: 2.53 ± 0.561
0.843TyrSer: 0.843 ± 0.448
0.843TyrThr: 0.843 ± 0.496
2.951TyrVal: 2.951 ± 0.622
1.265TyrTrp: 1.265 ± 0.379
0.843TyrTyr: 0.843 ± 0.805
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2373 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski