Amino acid dipepetide frequency for Canis familiaris papillomavirus 15

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.203AlaAla: 8.203 ± 2.376
1.231AlaCys: 1.231 ± 0.531
2.871AlaAsp: 2.871 ± 0.717
5.742AlaGlu: 5.742 ± 1.394
2.871AlaPhe: 2.871 ± 0.992
4.922AlaGly: 4.922 ± 2.335
0.41AlaHis: 0.41 ± 0.327
2.051AlaIle: 2.051 ± 0.758
3.281AlaLys: 3.281 ± 1.175
4.102AlaLeu: 4.102 ± 1.459
2.461AlaMet: 2.461 ± 0.679
2.461AlaAsn: 2.461 ± 0.944
5.332AlaPro: 5.332 ± 1.002
2.461AlaGln: 2.461 ± 1.084
5.742AlaArg: 5.742 ± 1.517
2.871AlaSer: 2.871 ± 0.946
4.922AlaThr: 4.922 ± 1.107
5.332AlaVal: 5.332 ± 1.165
0.41AlaTrp: 0.41 ± 0.334
1.231AlaTyr: 1.231 ± 0.31
0.0AlaXaa: 0.0 ± 0.0
Cys
2.051CysAla: 2.051 ± 0.969
0.82CysCys: 0.82 ± 0.636
1.641CysAsp: 1.641 ± 1.027
0.82CysGlu: 0.82 ± 0.561
0.82CysPhe: 0.82 ± 0.372
2.461CysGly: 2.461 ± 0.936
0.82CysHis: 0.82 ± 0.647
2.051CysIle: 2.051 ± 0.856
1.231CysLys: 1.231 ± 0.605
1.231CysLeu: 1.231 ± 1.026
0.82CysMet: 0.82 ± 0.372
0.82CysAsn: 0.82 ± 0.357
2.461CysPro: 2.461 ± 1.212
0.41CysGln: 0.41 ± 0.334
1.641CysArg: 1.641 ± 0.981
1.231CysSer: 1.231 ± 0.603
0.0CysThr: 0.0 ± 0.0
1.231CysVal: 1.231 ± 0.651
0.41CysTrp: 0.41 ± 0.351
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.153AspAla: 6.153 ± 1.408
1.231AspCys: 1.231 ± 0.623
2.871AspAsp: 2.871 ± 1.449
3.692AspGlu: 3.692 ± 1.876
2.051AspPhe: 2.051 ± 0.681
6.563AspGly: 6.563 ± 0.975
0.82AspHis: 0.82 ± 0.372
3.692AspIle: 3.692 ± 1.442
0.82AspLys: 0.82 ± 0.541
5.332AspLeu: 5.332 ± 1.406
0.82AspMet: 0.82 ± 0.357
2.871AspAsn: 2.871 ± 0.471
6.153AspPro: 6.153 ± 1.351
1.231AspGln: 1.231 ± 0.624
2.871AspArg: 2.871 ± 0.72
5.332AspSer: 5.332 ± 1.042
3.692AspThr: 3.692 ± 0.787
3.692AspVal: 3.692 ± 1.374
1.231AspTrp: 1.231 ± 0.772
2.461AspTyr: 2.461 ± 0.62
0.0AspXaa: 0.0 ± 0.0
Glu
5.332GluAla: 5.332 ± 1.233
2.051GluCys: 2.051 ± 0.969
6.563GluAsp: 6.563 ± 1.849
5.742GluGlu: 5.742 ± 2.398
2.051GluPhe: 2.051 ± 1.031
4.102GluGly: 4.102 ± 2.083
1.231GluHis: 1.231 ± 1.118
1.641GluIle: 1.641 ± 0.727
1.231GluLys: 1.231 ± 1.118
5.742GluLeu: 5.742 ± 1.003
0.82GluMet: 0.82 ± 0.561
2.461GluAsn: 2.461 ± 0.854
4.102GluPro: 4.102 ± 1.937
2.871GluGln: 2.871 ± 1.163
3.281GluArg: 3.281 ± 0.72
2.871GluSer: 2.871 ± 1.113
3.692GluThr: 3.692 ± 0.903
3.281GluVal: 3.281 ± 1.127
0.82GluTrp: 0.82 ± 0.668
0.82GluTyr: 0.82 ± 0.357
0.0GluXaa: 0.0 ± 0.0
Phe
2.461PheAla: 2.461 ± 0.854
1.231PheCys: 1.231 ± 1.032
3.281PheAsp: 3.281 ± 1.42
2.051PheGlu: 2.051 ± 0.821
3.692PhePhe: 3.692 ± 0.993
2.461PheGly: 2.461 ± 0.808
0.41PheHis: 0.41 ± 0.327
0.82PheIle: 0.82 ± 0.357
3.281PheLys: 3.281 ± 1.348
3.692PheLeu: 3.692 ± 1.142
1.231PheMet: 1.231 ± 0.341
1.231PheAsn: 1.231 ± 0.596
1.231PhePro: 1.231 ± 0.603
0.82PheGln: 0.82 ± 0.372
3.281PheArg: 3.281 ± 0.589
1.231PheSer: 1.231 ± 0.684
2.051PheThr: 2.051 ± 0.923
0.82PheVal: 0.82 ± 0.391
1.231PheTrp: 1.231 ± 0.337
1.231PheTyr: 1.231 ± 0.63
0.0PheXaa: 0.0 ± 0.0
Gly
4.102GlyAla: 4.102 ± 1.588
0.82GlyCys: 0.82 ± 0.647
6.153GlyAsp: 6.153 ± 1.132
7.793GlyGlu: 7.793 ± 1.097
2.461GlyPhe: 2.461 ± 0.979
11.485GlyGly: 11.485 ± 3.779
2.051GlyHis: 2.051 ± 0.919
3.692GlyIle: 3.692 ± 1.506
1.231GlyLys: 1.231 ± 0.858
3.692GlyLeu: 3.692 ± 1.205
0.82GlyMet: 0.82 ± 0.494
2.461GlyAsn: 2.461 ± 0.998
4.102GlyPro: 4.102 ± 1.066
3.281GlyGln: 3.281 ± 0.921
5.742GlyArg: 5.742 ± 2.387
6.153GlySer: 6.153 ± 1.222
3.692GlyThr: 3.692 ± 0.703
7.383GlyVal: 7.383 ± 1.381
0.0GlyTrp: 0.0 ± 0.0
1.641GlyTyr: 1.641 ± 0.793
0.0GlyXaa: 0.0 ± 0.0
His
1.231HisAla: 1.231 ± 0.434
0.41HisCys: 0.41 ± 0.544
0.82HisAsp: 0.82 ± 0.396
1.231HisGlu: 1.231 ± 0.656
0.82HisPhe: 0.82 ± 0.391
2.461HisGly: 2.461 ± 1.15
1.231HisHis: 1.231 ± 0.623
1.641HisIle: 1.641 ± 1.132
1.641HisLys: 1.641 ± 1.336
1.641HisLeu: 1.641 ± 0.506
0.0HisMet: 0.0 ± 0.0
0.41HisAsn: 0.41 ± 0.327
1.641HisPro: 1.641 ± 0.663
0.0HisGln: 0.0 ± 0.0
0.41HisArg: 0.41 ± 0.333
0.41HisSer: 0.41 ± 0.327
1.641HisThr: 1.641 ± 0.729
1.231HisVal: 1.231 ± 0.605
0.82HisTrp: 0.82 ± 0.44
1.231HisTyr: 1.231 ± 0.624
0.0HisXaa: 0.0 ± 0.0
Ile
1.231IleAla: 1.231 ± 0.548
0.82IleCys: 0.82 ± 0.357
1.231IleAsp: 1.231 ± 0.605
3.281IleGlu: 3.281 ± 1.094
1.231IlePhe: 1.231 ± 0.434
2.461IleGly: 2.461 ± 1.372
0.82IleHis: 0.82 ± 0.566
0.41IleIle: 0.41 ± 0.351
0.82IleLys: 0.82 ± 0.566
4.102IleLeu: 4.102 ± 0.928
0.82IleMet: 0.82 ± 0.372
2.461IleAsn: 2.461 ± 0.843
2.051IlePro: 2.051 ± 0.622
2.461IleGln: 2.461 ± 0.835
0.82IleArg: 0.82 ± 1.088
4.102IleSer: 4.102 ± 1.336
2.461IleThr: 2.461 ± 0.923
2.461IleVal: 2.461 ± 0.681
0.41IleTrp: 0.41 ± 0.333
1.231IleTyr: 1.231 ± 0.651
0.0IleXaa: 0.0 ± 0.0
Lys
2.461LysAla: 2.461 ± 0.829
3.281LysCys: 3.281 ± 1.151
1.641LysAsp: 1.641 ± 0.814
2.051LysGlu: 2.051 ± 1.204
3.692LysPhe: 3.692 ± 1.284
2.051LysGly: 2.051 ± 0.752
1.231LysHis: 1.231 ± 0.624
1.641LysIle: 1.641 ± 0.63
3.281LysLys: 3.281 ± 1.756
1.641LysLeu: 1.641 ± 1.293
0.82LysMet: 0.82 ± 0.67
0.41LysAsn: 0.41 ± 0.334
1.231LysPro: 1.231 ± 0.596
1.641LysGln: 1.641 ± 0.512
4.512LysArg: 4.512 ± 0.867
4.102LysSer: 4.102 ± 2.034
1.641LysThr: 1.641 ± 0.53
2.051LysVal: 2.051 ± 0.923
0.0LysTrp: 0.0 ± 0.0
0.82LysTyr: 0.82 ± 0.701
0.0LysXaa: 0.0 ± 0.0
Leu
4.922LeuAla: 4.922 ± 0.995
2.051LeuCys: 2.051 ± 0.897
4.102LeuAsp: 4.102 ± 1.148
3.281LeuGlu: 3.281 ± 1.225
2.871LeuPhe: 2.871 ± 1.113
6.153LeuGly: 6.153 ± 0.905
2.461LeuHis: 2.461 ± 0.463
1.231LeuIle: 1.231 ± 0.31
4.102LeuLys: 4.102 ± 1.789
8.614LeuLeu: 8.614 ± 2.21
0.82LeuMet: 0.82 ± 0.668
3.692LeuAsn: 3.692 ± 0.939
4.922LeuPro: 4.922 ± 1.524
5.742LeuGln: 5.742 ± 0.977
4.922LeuArg: 4.922 ± 0.913
6.563LeuSer: 6.563 ± 1.19
6.563LeuThr: 6.563 ± 1.197
5.332LeuVal: 5.332 ± 1.263
1.641LeuTrp: 1.641 ± 0.567
2.871LeuTyr: 2.871 ± 0.572
0.0LeuXaa: 0.0 ± 0.0
Met
2.051MetAla: 2.051 ± 0.844
0.0MetCys: 0.0 ± 0.0
1.231MetAsp: 1.231 ± 0.667
1.231MetGlu: 1.231 ± 0.531
1.231MetPhe: 1.231 ± 0.624
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.82MetIle: 0.82 ± 0.661
0.82MetLys: 0.82 ± 0.668
2.871MetLeu: 2.871 ± 1.545
0.0MetMet: 0.0 ± 0.0
1.231MetAsn: 1.231 ± 0.624
0.0MetPro: 0.0 ± 0.0
1.641MetGln: 1.641 ± 0.927
0.41MetArg: 0.41 ± 0.334
2.871MetSer: 2.871 ± 1.054
0.41MetThr: 0.41 ± 0.327
1.231MetVal: 1.231 ± 0.656
0.0MetTrp: 0.0 ± 0.0
0.41MetTyr: 0.41 ± 0.334
0.0MetXaa: 0.0 ± 0.0
Asn
1.641AsnAla: 1.641 ± 0.744
1.231AsnCys: 1.231 ± 0.624
1.231AsnAsp: 1.231 ± 0.531
0.41AsnGlu: 0.41 ± 0.334
1.231AsnPhe: 1.231 ± 0.603
1.641AsnGly: 1.641 ± 1.047
0.82AsnHis: 0.82 ± 0.357
2.051AsnIle: 2.051 ± 1.031
2.871AsnLys: 2.871 ± 1.521
2.871AsnLeu: 2.871 ± 0.842
0.82AsnMet: 0.82 ± 0.357
1.641AsnAsn: 1.641 ± 0.947
3.281AsnPro: 3.281 ± 0.617
0.82AsnGln: 0.82 ± 0.357
4.102AsnArg: 4.102 ± 1.237
2.461AsnSer: 2.461 ± 1.191
2.871AsnThr: 2.871 ± 1.298
0.82AsnVal: 0.82 ± 0.561
0.41AsnTrp: 0.41 ± 0.334
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
5.332ProAla: 5.332 ± 2.438
2.051ProCys: 2.051 ± 0.774
4.102ProAsp: 4.102 ± 1.09
3.281ProGlu: 3.281 ± 0.916
0.82ProPhe: 0.82 ± 0.396
5.332ProGly: 5.332 ± 1.978
1.641ProHis: 1.641 ± 0.734
2.461ProIle: 2.461 ± 0.576
2.871ProLys: 2.871 ± 1.267
7.793ProLeu: 7.793 ± 1.679
0.41ProMet: 0.41 ± 0.327
2.461ProAsn: 2.461 ± 0.854
11.485ProPro: 11.485 ± 3.002
2.871ProGln: 2.871 ± 1.13
6.563ProArg: 6.563 ± 2.015
6.563ProSer: 6.563 ± 2.535
2.461ProThr: 2.461 ± 0.511
8.614ProVal: 8.614 ± 2.372
0.0ProTrp: 0.0 ± 0.0
2.461ProTyr: 2.461 ± 1.488
0.0ProXaa: 0.0 ± 0.0
Gln
1.641GlnAla: 1.641 ± 0.963
0.0GlnCys: 0.0 ± 0.0
1.641GlnAsp: 1.641 ± 0.734
2.871GlnGlu: 2.871 ± 0.758
1.641GlnPhe: 1.641 ± 0.528
2.461GlnGly: 2.461 ± 0.494
1.231GlnHis: 1.231 ± 0.548
1.641GlnIle: 1.641 ± 0.578
2.461GlnLys: 2.461 ± 0.654
4.102GlnLeu: 4.102 ± 1.785
1.231GlnMet: 1.231 ± 0.337
0.41GlnAsn: 0.41 ± 0.351
3.281GlnPro: 3.281 ± 0.642
2.051GlnGln: 2.051 ± 0.684
1.231GlnArg: 1.231 ± 0.697
3.281GlnSer: 3.281 ± 1.126
3.281GlnThr: 3.281 ± 0.496
4.512GlnVal: 4.512 ± 0.873
1.641GlnTrp: 1.641 ± 0.216
1.231GlnTyr: 1.231 ± 0.337
0.0GlnXaa: 0.0 ± 0.0
Arg
4.922ArgAla: 4.922 ± 1.238
1.641ArgCys: 1.641 ± 1.091
2.461ArgAsp: 2.461 ± 0.398
3.281ArgGlu: 3.281 ± 1.259
2.461ArgPhe: 2.461 ± 1.067
6.563ArgGly: 6.563 ± 1.105
2.051ArgHis: 2.051 ± 0.583
1.231ArgIle: 1.231 ± 0.562
4.102ArgLys: 4.102 ± 0.68
8.203ArgLeu: 8.203 ± 0.932
0.41ArgMet: 0.41 ± 0.334
2.051ArgAsn: 2.051 ± 0.807
6.973ArgPro: 6.973 ± 2.734
2.051ArgGln: 2.051 ± 0.894
8.614ArgArg: 8.614 ± 2.947
2.871ArgSer: 2.871 ± 0.418
4.102ArgThr: 4.102 ± 1.333
4.922ArgVal: 4.922 ± 0.789
1.231ArgTrp: 1.231 ± 0.845
2.871ArgTyr: 2.871 ± 1.181
0.0ArgXaa: 0.0 ± 0.0
Ser
4.922SerAla: 4.922 ± 0.537
0.41SerCys: 0.41 ± 0.544
7.383SerAsp: 7.383 ± 0.941
3.281SerGlu: 3.281 ± 1.047
2.051SerPhe: 2.051 ± 0.923
7.793SerGly: 7.793 ± 1.453
1.641SerHis: 1.641 ± 0.963
2.461SerIle: 2.461 ± 0.466
0.82SerLys: 0.82 ± 0.561
4.922SerLeu: 4.922 ± 1.241
2.051SerMet: 2.051 ± 0.923
1.231SerAsn: 1.231 ± 0.656
5.332SerPro: 5.332 ± 2.274
4.922SerGln: 4.922 ± 0.642
5.332SerArg: 5.332 ± 1.225
7.793SerSer: 7.793 ± 1.792
3.281SerThr: 3.281 ± 1.413
4.922SerVal: 4.922 ± 0.886
0.82SerTrp: 0.82 ± 0.654
1.231SerTyr: 1.231 ± 0.667
0.0SerXaa: 0.0 ± 0.0
Thr
3.281ThrAla: 3.281 ± 1.062
1.641ThrCys: 1.641 ± 0.679
3.692ThrAsp: 3.692 ± 0.612
2.871ThrGlu: 2.871 ± 0.85
1.641ThrPhe: 1.641 ± 0.61
2.461ThrGly: 2.461 ± 0.463
0.0ThrHis: 0.0 ± 0.0
1.641ThrIle: 1.641 ± 0.528
0.82ThrLys: 0.82 ± 0.357
4.922ThrLeu: 4.922 ± 1.537
0.82ThrMet: 0.82 ± 0.567
1.231ThrAsn: 1.231 ± 0.596
8.203ThrPro: 8.203 ± 1.891
3.692ThrGln: 3.692 ± 1.268
4.922ThrArg: 4.922 ± 1.583
3.692ThrSer: 3.692 ± 1.703
3.692ThrThr: 3.692 ± 1.104
4.922ThrVal: 4.922 ± 0.886
2.051ThrTrp: 2.051 ± 1.337
1.231ThrTyr: 1.231 ± 0.337
0.0ThrXaa: 0.0 ± 0.0
Val
3.692ValAla: 3.692 ± 1.304
1.231ValCys: 1.231 ± 0.614
6.973ValAsp: 6.973 ± 2.348
4.512ValGlu: 4.512 ± 0.366
2.051ValPhe: 2.051 ± 0.68
4.922ValGly: 4.922 ± 1.319
1.231ValHis: 1.231 ± 0.656
2.461ValIle: 2.461 ± 0.745
2.871ValLys: 2.871 ± 1.155
4.922ValLeu: 4.922 ± 1.157
2.051ValMet: 2.051 ± 1.217
2.051ValAsn: 2.051 ± 0.68
6.563ValPro: 6.563 ± 1.51
1.231ValGln: 1.231 ± 0.31
4.512ValArg: 4.512 ± 1.064
6.563ValSer: 6.563 ± 1.17
3.692ValThr: 3.692 ± 0.612
4.512ValVal: 4.512 ± 0.865
0.82ValTrp: 0.82 ± 0.357
2.461ValTyr: 2.461 ± 0.784
0.0ValXaa: 0.0 ± 0.0
Trp
2.051TrpAla: 2.051 ± 0.708
0.0TrpCys: 0.0 ± 0.0
0.82TrpAsp: 0.82 ± 0.372
1.231TrpGlu: 1.231 ± 0.552
0.82TrpPhe: 0.82 ± 0.372
0.41TrpGly: 0.41 ± 0.327
0.0TrpHis: 0.0 ± 0.0
0.82TrpIle: 0.82 ± 0.668
1.231TrpLys: 1.231 ± 0.531
1.641TrpLeu: 1.641 ± 0.714
0.41TrpMet: 0.41 ± 0.334
0.41TrpAsn: 0.41 ± 0.351
0.0TrpPro: 0.0 ± 0.0
0.41TrpGln: 0.41 ± 0.333
2.051TrpArg: 2.051 ± 0.969
0.82TrpSer: 0.82 ± 0.428
1.231TrpThr: 1.231 ± 0.697
0.41TrpVal: 0.41 ± 0.334
0.0TrpTrp: 0.0 ± 0.0
0.41TrpTyr: 0.41 ± 0.333
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.231TyrAla: 1.231 ± 0.337
0.82TyrCys: 0.82 ± 0.661
2.871TyrAsp: 2.871 ± 0.573
2.051TyrGlu: 2.051 ± 0.968
1.231TyrPhe: 1.231 ± 0.624
2.461TyrGly: 2.461 ± 0.936
0.82TyrHis: 0.82 ± 0.372
1.231TyrIle: 1.231 ± 0.645
0.41TyrLys: 0.41 ± 0.334
0.82TyrLeu: 0.82 ± 0.668
0.41TyrMet: 0.41 ± 0.334
1.231TyrAsn: 1.231 ± 0.722
1.231TyrPro: 1.231 ± 0.434
1.231TyrGln: 1.231 ± 0.624
2.051TyrArg: 2.051 ± 0.781
0.82TyrSer: 0.82 ± 0.654
2.051TyrThr: 2.051 ± 0.469
1.641TyrVal: 1.641 ± 0.58
1.231TyrTrp: 1.231 ± 0.337
1.231TyrTyr: 1.231 ± 0.694
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2439 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski