Amino acid dipepetide frequency for Canis familiaris papillomavirus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.906AlaAla: 2.906 ± 0.905
1.453AlaCys: 1.453 ± 1.036
3.996AlaAsp: 3.996 ± 2.119
1.453AlaGlu: 1.453 ± 0.572
4.359AlaPhe: 4.359 ± 1.342
3.996AlaGly: 3.996 ± 1.347
0.726AlaHis: 0.726 ± 0.507
1.816AlaIle: 1.816 ± 0.637
5.085AlaLys: 5.085 ± 0.903
4.722AlaLeu: 4.722 ± 0.824
1.09AlaMet: 1.09 ± 0.698
1.816AlaAsn: 1.816 ± 0.949
4.722AlaPro: 4.722 ± 0.848
1.453AlaGln: 1.453 ± 0.379
2.179AlaArg: 2.179 ± 0.888
5.812AlaSer: 5.812 ± 1.784
3.269AlaThr: 3.269 ± 0.792
2.179AlaVal: 2.179 ± 0.527
0.726AlaTrp: 0.726 ± 0.4
1.09AlaTyr: 1.09 ± 0.666
0.0AlaXaa: 0.0 ± 0.0
Cys
1.09CysAla: 1.09 ± 0.562
1.09CysCys: 1.09 ± 0.743
1.453CysAsp: 1.453 ± 0.683
0.0CysGlu: 0.0 ± 0.0
2.543CysPhe: 2.543 ± 1.508
0.363CysGly: 0.363 ± 0.423
0.726CysHis: 0.726 ± 0.885
0.726CysIle: 0.726 ± 0.43
1.453CysLys: 1.453 ± 0.724
3.269CysLeu: 3.269 ± 1.898
0.0CysMet: 0.0 ± 0.0
1.816CysAsn: 1.816 ± 0.888
1.816CysPro: 1.816 ± 0.887
0.726CysGln: 0.726 ± 0.373
1.09CysArg: 1.09 ± 0.569
2.543CysSer: 2.543 ± 1.068
2.543CysThr: 2.543 ± 1.417
1.816CysVal: 1.816 ± 2.057
0.363CysTrp: 0.363 ± 0.308
1.453CysTyr: 1.453 ± 0.586
0.0CysXaa: 0.0 ± 0.0
Asp
2.906AspAla: 2.906 ± 1.026
2.543AspCys: 2.543 ± 1.153
2.543AspAsp: 2.543 ± 1.027
7.265AspGlu: 7.265 ± 2.793
1.816AspPhe: 1.816 ± 0.665
5.085AspGly: 5.085 ± 1.33
0.726AspHis: 0.726 ± 0.4
2.906AspIle: 2.906 ± 0.701
1.09AspLys: 1.09 ± 0.49
5.449AspLeu: 5.449 ± 1.401
0.726AspMet: 0.726 ± 0.362
1.816AspAsn: 1.816 ± 0.667
4.722AspPro: 4.722 ± 1.418
2.179AspGln: 2.179 ± 0.772
1.816AspArg: 1.816 ± 0.936
4.722AspSer: 4.722 ± 0.506
2.543AspThr: 2.543 ± 0.611
3.269AspVal: 3.269 ± 0.946
0.363AspTrp: 0.363 ± 0.331
1.09AspTyr: 1.09 ± 0.341
0.0AspXaa: 0.0 ± 0.0
Glu
3.269GluAla: 3.269 ± 1.208
0.363GluCys: 0.363 ± 0.308
6.538GluAsp: 6.538 ± 1.912
5.449GluGlu: 5.449 ± 1.642
2.906GluPhe: 2.906 ± 1.479
5.085GluGly: 5.085 ± 2.281
1.453GluHis: 1.453 ± 0.827
1.816GluIle: 1.816 ± 0.882
2.179GluLys: 2.179 ± 0.904
3.632GluLeu: 3.632 ± 1.389
0.726GluMet: 0.726 ± 0.362
3.269GluAsn: 3.269 ± 0.613
5.812GluPro: 5.812 ± 1.419
1.816GluGln: 1.816 ± 0.535
2.179GluArg: 2.179 ± 0.832
6.538GluSer: 6.538 ± 0.937
4.359GluThr: 4.359 ± 1.409
3.269GluVal: 3.269 ± 1.107
0.726GluTrp: 0.726 ± 0.4
0.363GluTyr: 0.363 ± 0.318
0.0GluXaa: 0.0 ± 0.0
Phe
2.179PheAla: 2.179 ± 0.614
2.543PheCys: 2.543 ± 1.046
3.632PheAsp: 3.632 ± 1.006
3.996PheGlu: 3.996 ± 1.552
2.543PhePhe: 2.543 ± 0.79
3.269PheGly: 3.269 ± 0.782
0.363PheHis: 0.363 ± 0.331
1.09PheIle: 1.09 ± 0.87
2.179PheLys: 2.179 ± 1.188
4.722PheLeu: 4.722 ± 1.179
0.726PheMet: 0.726 ± 0.823
1.09PheAsn: 1.09 ± 0.644
2.906PhePro: 2.906 ± 0.753
1.816PheGln: 1.816 ± 0.693
0.726PheArg: 0.726 ± 0.362
3.269PheSer: 3.269 ± 0.73
1.816PheThr: 1.816 ± 0.637
2.543PheVal: 2.543 ± 1.155
1.09PheTrp: 1.09 ± 0.595
1.09PheTyr: 1.09 ± 0.341
0.0PheXaa: 0.0 ± 0.0
Gly
3.632GlyAla: 3.632 ± 1.424
2.543GlyCys: 2.543 ± 0.782
4.359GlyAsp: 4.359 ± 0.88
6.175GlyGlu: 6.175 ± 1.702
2.543GlyPhe: 2.543 ± 0.573
10.897GlyGly: 10.897 ± 2.607
1.453GlyHis: 1.453 ± 0.564
2.543GlyIle: 2.543 ± 0.638
2.179GlyLys: 2.179 ± 0.897
5.449GlyLeu: 5.449 ± 1.476
0.726GlyMet: 0.726 ± 0.4
4.359GlyAsn: 4.359 ± 0.918
5.449GlyPro: 5.449 ± 3.055
1.816GlyGln: 1.816 ± 0.879
8.718GlyArg: 8.718 ± 3.867
9.807GlySer: 9.807 ± 3.101
4.722GlyThr: 4.722 ± 1.233
4.359GlyVal: 4.359 ± 0.928
0.363GlyTrp: 0.363 ± 0.331
1.09GlyTyr: 1.09 ± 0.629
0.0GlyXaa: 0.0 ± 0.0
His
1.09HisAla: 1.09 ± 0.743
0.363HisCys: 0.363 ± 0.331
0.363HisAsp: 0.363 ± 0.331
0.363HisGlu: 0.363 ± 0.331
1.09HisPhe: 1.09 ± 0.576
1.09HisGly: 1.09 ± 0.618
1.09HisHis: 1.09 ± 0.632
0.363HisIle: 0.363 ± 0.32
1.09HisLys: 1.09 ± 0.555
1.09HisLeu: 1.09 ± 0.449
0.363HisMet: 0.363 ± 0.318
0.726HisAsn: 0.726 ± 0.491
1.09HisPro: 1.09 ± 0.961
0.726HisGln: 0.726 ± 0.663
1.09HisArg: 1.09 ± 0.462
2.543HisSer: 2.543 ± 1.17
1.453HisThr: 1.453 ± 0.597
1.453HisVal: 1.453 ± 0.52
0.363HisTrp: 0.363 ± 0.318
1.816HisTyr: 1.816 ± 0.671
0.0HisXaa: 0.0 ± 0.0
Ile
1.453IleAla: 1.453 ± 0.757
0.726IleCys: 0.726 ± 0.885
3.632IleAsp: 3.632 ± 1.255
3.269IleGlu: 3.269 ± 0.729
1.453IlePhe: 1.453 ± 0.726
1.816IleGly: 1.816 ± 1.017
0.726IleHis: 0.726 ± 0.43
2.906IleIle: 2.906 ± 1.289
1.453IleLys: 1.453 ± 0.532
3.269IleLeu: 3.269 ± 0.895
1.09IleMet: 1.09 ± 0.928
1.453IleAsn: 1.453 ± 0.751
3.269IlePro: 3.269 ± 1.486
1.453IleGln: 1.453 ± 0.569
3.632IleArg: 3.632 ± 1.249
1.816IleSer: 1.816 ± 0.705
1.816IleThr: 1.816 ± 1.048
3.632IleVal: 3.632 ± 2.204
0.363IleTrp: 0.363 ± 0.318
1.816IleTyr: 1.816 ± 0.456
0.0IleXaa: 0.0 ± 0.0
Lys
3.632LysAla: 3.632 ± 0.852
0.726LysCys: 0.726 ± 0.529
1.09LysAsp: 1.09 ± 0.654
1.09LysGlu: 1.09 ± 0.469
1.453LysPhe: 1.453 ± 0.617
2.179LysGly: 2.179 ± 0.718
1.816LysHis: 1.816 ± 1.025
1.453LysIle: 1.453 ± 0.567
3.269LysLys: 3.269 ± 1.224
3.996LysLeu: 3.996 ± 1.427
0.726LysMet: 0.726 ± 0.663
2.543LysAsn: 2.543 ± 0.678
2.179LysPro: 2.179 ± 1.092
1.816LysGln: 1.816 ± 0.845
5.085LysArg: 5.085 ± 1.335
3.269LysSer: 3.269 ± 1.332
1.453LysThr: 1.453 ± 0.746
3.996LysVal: 3.996 ± 1.329
0.363LysTrp: 0.363 ± 0.318
2.543LysTyr: 2.543 ± 1.153
0.0LysXaa: 0.0 ± 0.0
Leu
5.449LeuAla: 5.449 ± 1.517
2.543LeuCys: 2.543 ± 1.345
3.632LeuAsp: 3.632 ± 0.817
6.538LeuGlu: 6.538 ± 1.794
5.812LeuPhe: 5.812 ± 1.304
8.355LeuGly: 8.355 ± 1.45
2.906LeuHis: 2.906 ± 1.051
3.269LeuIle: 3.269 ± 0.825
3.269LeuLys: 3.269 ± 1.056
8.355LeuLeu: 8.355 ± 1.698
1.453LeuMet: 1.453 ± 0.453
2.543LeuAsn: 2.543 ± 0.879
5.085LeuPro: 5.085 ± 1.312
4.359LeuGln: 4.359 ± 0.637
5.085LeuArg: 5.085 ± 0.867
7.265LeuSer: 7.265 ± 1.076
5.812LeuThr: 5.812 ± 1.549
5.812LeuVal: 5.812 ± 0.97
0.363LeuTrp: 0.363 ± 0.308
2.179LeuTyr: 2.179 ± 0.639
0.0LeuXaa: 0.0 ± 0.0
Met
1.453MetAla: 1.453 ± 0.539
1.09MetCys: 1.09 ± 0.711
1.09MetAsp: 1.09 ± 0.595
0.0MetGlu: 0.0 ± 0.0
1.453MetPhe: 1.453 ± 0.919
0.363MetGly: 0.363 ± 0.423
0.0MetHis: 0.0 ± 0.0
0.363MetIle: 0.363 ± 0.423
0.363MetLys: 0.363 ± 0.411
3.632MetLeu: 3.632 ± 1.449
0.726MetMet: 0.726 ± 0.847
0.726MetAsn: 0.726 ± 0.636
0.363MetPro: 0.363 ± 0.32
0.0MetGln: 0.0 ± 0.0
2.179MetArg: 2.179 ± 0.561
1.453MetSer: 1.453 ± 0.646
0.726MetThr: 0.726 ± 0.636
0.726MetVal: 0.726 ± 0.663
0.363MetTrp: 0.363 ± 0.308
0.363MetTyr: 0.363 ± 0.394
0.0MetXaa: 0.0 ± 0.0
Asn
2.179AsnAla: 2.179 ± 0.905
0.726AsnCys: 0.726 ± 0.501
1.816AsnAsp: 1.816 ± 0.704
2.543AsnGlu: 2.543 ± 1.159
2.543AsnPhe: 2.543 ± 1.519
1.453AsnGly: 1.453 ± 0.794
0.0AsnHis: 0.0 ± 0.0
2.543AsnIle: 2.543 ± 1.041
2.179AsnLys: 2.179 ± 0.704
2.543AsnLeu: 2.543 ± 1.017
0.726AsnMet: 0.726 ± 0.663
1.816AsnAsn: 1.816 ± 1.59
3.632AsnPro: 3.632 ± 1.851
2.179AsnGln: 2.179 ± 0.584
3.996AsnArg: 3.996 ± 0.501
2.543AsnSer: 2.543 ± 1.17
3.269AsnThr: 3.269 ± 1.587
1.816AsnVal: 1.816 ± 0.704
0.363AsnTrp: 0.363 ± 0.318
1.453AsnTyr: 1.453 ± 0.566
0.0AsnXaa: 0.0 ± 0.0
Pro
4.722ProAla: 4.722 ± 2.506
1.453ProCys: 1.453 ± 0.913
3.996ProAsp: 3.996 ± 1.67
5.449ProGlu: 5.449 ± 2.928
1.09ProPhe: 1.09 ± 0.896
5.449ProGly: 5.449 ± 2.552
1.09ProHis: 1.09 ± 0.341
2.179ProIle: 2.179 ± 0.561
5.085ProLys: 5.085 ± 0.72
6.902ProLeu: 6.902 ± 2.02
0.363ProMet: 0.363 ± 0.331
1.453ProAsn: 1.453 ± 0.638
11.624ProPro: 11.624 ± 6.921
2.906ProGln: 2.906 ± 1.305
5.449ProArg: 5.449 ± 0.988
6.538ProSer: 6.538 ± 1.512
5.085ProThr: 5.085 ± 1.913
2.906ProVal: 2.906 ± 0.757
0.726ProTrp: 0.726 ± 0.4
2.906ProTyr: 2.906 ± 1.082
0.0ProXaa: 0.0 ± 0.0
Gln
3.269GlnAla: 3.269 ± 0.723
0.726GlnCys: 0.726 ± 0.4
1.453GlnAsp: 1.453 ± 0.617
2.543GlnGlu: 2.543 ± 0.761
1.816GlnPhe: 1.816 ± 0.737
2.906GlnGly: 2.906 ± 0.639
0.726GlnHis: 0.726 ± 0.507
1.09GlnIle: 1.09 ± 0.376
1.09GlnLys: 1.09 ± 0.451
3.269GlnLeu: 3.269 ± 1.127
1.453GlnMet: 1.453 ± 0.945
2.543GlnAsn: 2.543 ± 0.801
1.816GlnPro: 1.816 ± 0.522
1.816GlnGln: 1.816 ± 0.961
2.179GlnArg: 2.179 ± 0.979
0.726GlnSer: 0.726 ± 0.392
1.09GlnThr: 1.09 ± 0.373
1.816GlnVal: 1.816 ± 0.489
2.179GlnTrp: 2.179 ± 0.93
1.453GlnTyr: 1.453 ± 0.665
0.0GlnXaa: 0.0 ± 0.0
Arg
3.269ArgAla: 3.269 ± 0.807
1.453ArgCys: 1.453 ± 0.692
1.453ArgAsp: 1.453 ± 0.579
3.632ArgGlu: 3.632 ± 0.944
1.816ArgPhe: 1.816 ± 0.565
9.081ArgGly: 9.081 ± 3.036
2.179ArgHis: 2.179 ± 0.897
1.816ArgIle: 1.816 ± 0.855
2.906ArgLys: 2.906 ± 0.738
6.175ArgLeu: 6.175 ± 1.19
2.179ArgMet: 2.179 ± 1.296
3.269ArgAsn: 3.269 ± 0.679
5.812ArgPro: 5.812 ± 1.304
1.453ArgGln: 1.453 ± 0.566
6.175ArgArg: 6.175 ± 1.756
5.449ArgSer: 5.449 ± 1.131
1.816ArgThr: 1.816 ± 0.79
2.906ArgVal: 2.906 ± 1.225
0.726ArgTrp: 0.726 ± 0.558
2.179ArgTyr: 2.179 ± 0.845
0.0ArgXaa: 0.0 ± 0.0
Ser
3.632SerAla: 3.632 ± 1.581
1.09SerCys: 1.09 ± 0.507
5.449SerAsp: 5.449 ± 0.689
3.269SerGlu: 3.269 ± 0.919
1.816SerPhe: 1.816 ± 0.671
10.534SerGly: 10.534 ± 2.557
1.09SerHis: 1.09 ± 0.629
2.906SerIle: 2.906 ± 0.955
3.996SerLys: 3.996 ± 1.292
9.807SerLeu: 9.807 ± 1.162
1.453SerMet: 1.453 ± 0.692
3.632SerAsn: 3.632 ± 1.447
5.812SerPro: 5.812 ± 1.555
2.543SerGln: 2.543 ± 0.573
4.359SerArg: 4.359 ± 1.583
9.444SerSer: 9.444 ± 4.508
7.265SerThr: 7.265 ± 2.678
5.085SerVal: 5.085 ± 1.561
1.453SerTrp: 1.453 ± 0.757
0.726SerTyr: 0.726 ± 0.373
0.0SerXaa: 0.0 ± 0.0
Thr
3.996ThrAla: 3.996 ± 1.086
2.543ThrCys: 2.543 ± 1.003
3.632ThrAsp: 3.632 ± 1.681
3.632ThrGlu: 3.632 ± 0.799
2.179ThrPhe: 2.179 ± 0.654
4.722ThrGly: 4.722 ± 0.623
2.179ThrHis: 2.179 ± 0.899
4.359ThrIle: 4.359 ± 1.284
0.726ThrLys: 0.726 ± 0.362
3.996ThrLeu: 3.996 ± 1.323
1.453ThrMet: 1.453 ± 0.739
1.816ThrAsn: 1.816 ± 1.098
5.085ThrPro: 5.085 ± 1.024
1.816ThrGln: 1.816 ± 1.007
2.906ThrArg: 2.906 ± 1.011
5.449ThrSer: 5.449 ± 1.358
1.816ThrThr: 1.816 ± 1.018
4.722ThrVal: 4.722 ± 1.17
0.726ThrTrp: 0.726 ± 0.353
0.363ThrTyr: 0.363 ± 0.32
0.0ThrXaa: 0.0 ± 0.0
Val
3.269ValAla: 3.269 ± 1.239
2.179ValCys: 2.179 ± 0.846
2.906ValAsp: 2.906 ± 0.554
2.906ValGlu: 2.906 ± 0.971
2.543ValPhe: 2.543 ± 0.891
4.359ValGly: 4.359 ± 1.446
0.363ValHis: 0.363 ± 0.331
4.359ValIle: 4.359 ± 1.879
1.816ValLys: 1.816 ± 0.686
5.812ValLeu: 5.812 ± 1.461
0.726ValMet: 0.726 ± 0.494
1.816ValAsn: 1.816 ± 1.251
4.722ValPro: 4.722 ± 1.499
1.453ValGln: 1.453 ± 0.564
3.632ValArg: 3.632 ± 1.049
5.085ValSer: 5.085 ± 1.759
4.359ValThr: 4.359 ± 1.506
3.632ValVal: 3.632 ± 1.119
1.09ValTrp: 1.09 ± 0.507
1.09ValTyr: 1.09 ± 0.644
0.0ValXaa: 0.0 ± 0.0
Trp
0.726TrpAla: 0.726 ± 0.4
0.0TrpCys: 0.0 ± 0.0
0.726TrpAsp: 0.726 ± 0.505
1.09TrpGlu: 1.09 ± 0.923
0.726TrpPhe: 0.726 ± 0.663
0.726TrpGly: 0.726 ± 0.558
0.0TrpHis: 0.0 ± 0.0
1.453TrpIle: 1.453 ± 0.566
0.726TrpLys: 0.726 ± 0.4
1.09TrpLeu: 1.09 ± 0.994
0.0TrpMet: 0.0 ± 0.0
1.09TrpAsn: 1.09 ± 0.711
0.726TrpPro: 0.726 ± 0.43
0.726TrpGln: 0.726 ± 0.615
1.453TrpArg: 1.453 ± 0.913
0.726TrpSer: 0.726 ± 0.615
1.453TrpThr: 1.453 ± 0.58
0.363TrpVal: 0.363 ± 0.331
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.726TyrAla: 0.726 ± 0.4
0.726TyrCys: 0.726 ± 0.362
1.816TyrAsp: 1.816 ± 0.456
1.09TyrGlu: 1.09 ± 0.406
1.09TyrPhe: 1.09 ± 0.523
1.453TyrGly: 1.453 ± 0.588
0.0TyrHis: 0.0 ± 0.0
1.09TyrIle: 1.09 ± 0.534
2.543TyrLys: 2.543 ± 0.771
2.906TyrLeu: 2.906 ± 1.389
0.363TyrMet: 0.363 ± 0.394
0.726TyrAsn: 0.726 ± 0.43
0.726TyrPro: 0.726 ± 0.353
2.906TyrGln: 2.906 ± 0.745
1.816TyrArg: 1.816 ± 0.711
0.363TyrSer: 0.363 ± 0.308
1.453TyrThr: 1.453 ± 0.637
2.179TyrVal: 2.179 ± 0.871
1.09TyrTrp: 1.09 ± 0.569
1.453TyrTyr: 1.453 ± 0.784
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (2754 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski