Amino acid dipepetide frequency for Canis familiaris papillomavirus 7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.612AlaAla: 4.612 ± 0.748
0.384AlaCys: 0.384 ± 0.318
3.459AlaAsp: 3.459 ± 1.14
2.69AlaGlu: 2.69 ± 0.913
4.228AlaPhe: 4.228 ± 1.166
2.69AlaGly: 2.69 ± 0.705
1.153AlaHis: 1.153 ± 0.658
2.306AlaIle: 2.306 ± 0.679
2.306AlaLys: 2.306 ± 0.718
3.459AlaLeu: 3.459 ± 0.682
0.769AlaMet: 0.769 ± 0.401
1.153AlaAsn: 1.153 ± 0.446
3.075AlaPro: 3.075 ± 1.12
1.922AlaGln: 1.922 ± 0.699
3.075AlaArg: 3.075 ± 1.17
6.533AlaSer: 6.533 ± 1.997
4.612AlaThr: 4.612 ± 0.824
4.228AlaVal: 4.228 ± 1.077
0.384AlaTrp: 0.384 ± 0.315
0.384AlaTyr: 0.384 ± 0.315
0.0AlaXaa: 0.0 ± 0.0
Cys
1.537CysAla: 1.537 ± 0.81
2.69CysCys: 2.69 ± 2.063
1.153CysAsp: 1.153 ± 0.698
1.153CysGlu: 1.153 ± 0.983
1.153CysPhe: 1.153 ± 0.38
1.153CysGly: 1.153 ± 0.492
0.769CysHis: 0.769 ± 0.668
1.153CysIle: 1.153 ± 1.005
3.075CysLys: 3.075 ± 1.285
3.843CysLeu: 3.843 ± 2.248
0.384CysMet: 0.384 ± 0.494
0.384CysAsn: 0.384 ± 0.696
1.153CysPro: 1.153 ± 0.823
0.384CysGln: 0.384 ± 0.315
1.153CysArg: 1.153 ± 0.414
1.537CysSer: 1.537 ± 0.716
1.153CysThr: 1.153 ± 0.38
0.0CysVal: 0.0 ± 0.0
0.769CysTrp: 0.769 ± 0.552
0.769CysTyr: 0.769 ± 0.703
0.0CysXaa: 0.0 ± 0.0
Asp
2.306AspAla: 2.306 ± 1.222
2.69AspCys: 2.69 ± 1.101
4.228AspAsp: 4.228 ± 1.924
4.612AspGlu: 4.612 ± 1.987
2.306AspPhe: 2.306 ± 0.886
6.918AspGly: 6.918 ± 1.543
0.0AspHis: 0.0 ± 0.0
1.537AspIle: 1.537 ± 0.907
2.306AspLys: 2.306 ± 1.146
5.38AspLeu: 5.38 ± 1.678
0.769AspMet: 0.769 ± 0.383
1.922AspAsn: 1.922 ± 0.914
4.612AspPro: 4.612 ± 1.597
2.69AspGln: 2.69 ± 0.695
2.69AspArg: 2.69 ± 0.869
4.996AspSer: 4.996 ± 1.248
3.843AspThr: 3.843 ± 0.934
3.459AspVal: 3.459 ± 0.908
0.769AspTrp: 0.769 ± 0.631
0.384AspTyr: 0.384 ± 0.318
0.0AspXaa: 0.0 ± 0.0
Glu
3.843GluAla: 3.843 ± 1.239
0.384GluCys: 0.384 ± 0.315
7.686GluAsp: 7.686 ± 3.473
7.686GluGlu: 7.686 ± 3.285
1.922GluPhe: 1.922 ± 0.497
6.533GluGly: 6.533 ± 2.348
1.922GluHis: 1.922 ± 0.661
2.69GluIle: 2.69 ± 1.096
3.459GluLys: 3.459 ± 1.589
2.306GluLeu: 2.306 ± 0.865
0.384GluMet: 0.384 ± 0.318
2.306GluAsn: 2.306 ± 0.761
5.765GluPro: 5.765 ± 2.063
1.153GluGln: 1.153 ± 0.459
2.69GluArg: 2.69 ± 0.687
6.149GluSer: 6.149 ± 1.566
3.843GluThr: 3.843 ± 1.1
2.69GluVal: 2.69 ± 1.24
0.384GluTrp: 0.384 ± 0.318
1.537GluTyr: 1.537 ± 0.628
0.0GluXaa: 0.0 ± 0.0
Phe
1.922PheAla: 1.922 ± 0.766
1.537PheCys: 1.537 ± 0.795
3.843PheAsp: 3.843 ± 0.624
2.69PheGlu: 2.69 ± 1.488
2.306PhePhe: 2.306 ± 1.229
2.306PheGly: 2.306 ± 0.963
1.537PheHis: 1.537 ± 0.735
2.306PheIle: 2.306 ± 0.779
1.153PheLys: 1.153 ± 0.633
3.843PheLeu: 3.843 ± 0.86
0.384PheMet: 0.384 ± 0.494
1.153PheAsn: 1.153 ± 0.67
3.075PhePro: 3.075 ± 0.644
1.537PheGln: 1.537 ± 0.986
1.153PheArg: 1.153 ± 0.38
2.306PheSer: 2.306 ± 0.775
2.306PheThr: 2.306 ± 0.895
1.922PheVal: 1.922 ± 1.011
1.537PheTrp: 1.537 ± 0.604
1.537PheTyr: 1.537 ± 0.597
0.0PheXaa: 0.0 ± 0.0
Gly
3.843GlyAla: 3.843 ± 1.008
1.922GlyCys: 1.922 ± 0.917
5.765GlyAsp: 5.765 ± 0.867
6.918GlyGlu: 6.918 ± 2.125
2.306GlyPhe: 2.306 ± 0.759
9.608GlyGly: 9.608 ± 2.988
1.537GlyHis: 1.537 ± 1.575
1.922GlyIle: 1.922 ± 0.819
3.459GlyLys: 3.459 ± 0.585
5.765GlyLeu: 5.765 ± 0.929
0.769GlyMet: 0.769 ± 0.389
3.459GlyAsn: 3.459 ± 1.095
3.459GlyPro: 3.459 ± 0.63
5.38GlyGln: 5.38 ± 0.793
9.608GlyArg: 9.608 ± 3.554
8.455GlySer: 8.455 ± 1.992
5.765GlyThr: 5.765 ± 2.339
2.69GlyVal: 2.69 ± 0.892
0.384GlyTrp: 0.384 ± 0.315
2.306GlyTyr: 2.306 ± 0.561
0.0GlyXaa: 0.0 ± 0.0
His
0.769HisAla: 0.769 ± 0.389
1.537HisCys: 1.537 ± 0.735
0.384HisAsp: 0.384 ± 0.315
0.384HisGlu: 0.384 ± 0.696
0.384HisPhe: 0.384 ± 0.315
1.537HisGly: 1.537 ± 0.751
0.384HisHis: 0.384 ± 0.315
0.384HisIle: 0.384 ± 0.326
0.384HisLys: 0.384 ± 0.326
0.769HisLeu: 0.769 ± 0.383
0.769HisMet: 0.769 ± 0.383
0.0HisAsn: 0.0 ± 0.0
1.922HisPro: 1.922 ± 1.14
1.922HisGln: 1.922 ± 1.219
1.537HisArg: 1.537 ± 0.801
3.075HisSer: 3.075 ± 1.519
1.153HisThr: 1.153 ± 0.685
1.153HisVal: 1.153 ± 0.414
0.769HisTrp: 0.769 ± 0.589
1.922HisTyr: 1.922 ± 0.705
0.0HisXaa: 0.0 ± 0.0
Ile
0.769IleAla: 0.769 ± 0.389
0.384IleCys: 0.384 ± 0.494
3.075IleAsp: 3.075 ± 1.045
3.843IleGlu: 3.843 ± 1.151
3.459IlePhe: 3.459 ± 0.967
2.306IleGly: 2.306 ± 0.903
0.769IleHis: 0.769 ± 0.652
2.69IleIle: 2.69 ± 1.146
1.922IleLys: 1.922 ± 1.052
3.843IleLeu: 3.843 ± 1.169
0.384IleMet: 0.384 ± 0.384
1.537IleAsn: 1.537 ± 0.985
2.69IlePro: 2.69 ± 1.29
0.384IleGln: 0.384 ± 0.326
2.306IleArg: 2.306 ± 0.556
2.306IleSer: 2.306 ± 1.258
0.384IleThr: 0.384 ± 0.338
3.075IleVal: 3.075 ± 1.291
0.384IleTrp: 0.384 ± 0.338
1.153IleTyr: 1.153 ± 0.388
0.0IleXaa: 0.0 ± 0.0
Lys
1.922LysAla: 1.922 ± 0.898
0.384LysCys: 0.384 ± 0.318
1.537LysAsp: 1.537 ± 0.802
2.306LysGlu: 2.306 ± 1.49
2.306LysPhe: 2.306 ± 0.934
2.306LysGly: 2.306 ± 0.625
1.153LysHis: 1.153 ± 0.946
1.153LysIle: 1.153 ± 0.811
3.075LysLys: 3.075 ± 1.018
4.228LysLeu: 4.228 ± 1.339
1.153LysMet: 1.153 ± 0.612
2.306LysAsn: 2.306 ± 0.74
1.922LysPro: 1.922 ± 0.882
2.306LysGln: 2.306 ± 0.891
6.149LysArg: 6.149 ± 1.355
2.306LysSer: 2.306 ± 0.723
1.537LysThr: 1.537 ± 0.579
2.69LysVal: 2.69 ± 0.951
1.153LysTrp: 1.153 ± 0.459
2.69LysTyr: 2.69 ± 1.034
0.0LysXaa: 0.0 ± 0.0
Leu
2.69LeuAla: 2.69 ± 0.678
1.922LeuCys: 1.922 ± 1.843
4.996LeuAsp: 4.996 ± 1.528
4.996LeuGlu: 4.996 ± 1.236
4.228LeuPhe: 4.228 ± 1.476
6.918LeuGly: 6.918 ± 0.959
3.075LeuHis: 3.075 ± 0.654
3.075LeuIle: 3.075 ± 1.104
3.075LeuLys: 3.075 ± 1.061
9.608LeuLeu: 9.608 ± 1.966
1.537LeuMet: 1.537 ± 0.642
3.075LeuAsn: 3.075 ± 0.89
3.843LeuPro: 3.843 ± 0.904
6.918LeuGln: 6.918 ± 2.182
4.228LeuArg: 4.228 ± 1.459
8.839LeuSer: 8.839 ± 2.231
5.38LeuThr: 5.38 ± 1.889
6.149LeuVal: 6.149 ± 0.921
0.769LeuTrp: 0.769 ± 0.389
2.306LeuTyr: 2.306 ± 0.429
0.0LeuXaa: 0.0 ± 0.0
Met
1.153MetAla: 1.153 ± 0.569
1.537MetCys: 1.537 ± 0.793
1.153MetAsp: 1.153 ± 1.014
0.384MetGlu: 0.384 ± 0.318
0.769MetPhe: 0.769 ± 0.383
0.384MetGly: 0.384 ± 0.315
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.384MetLys: 0.384 ± 0.338
2.69MetLeu: 2.69 ± 0.914
0.769MetMet: 0.769 ± 0.389
0.769MetAsn: 0.769 ± 0.383
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
2.306MetArg: 2.306 ± 0.786
0.769MetSer: 0.769 ± 0.401
1.922MetThr: 1.922 ± 0.693
0.769MetVal: 0.769 ± 0.631
0.384MetTrp: 0.384 ± 0.318
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.922AsnAla: 1.922 ± 0.728
0.0AsnCys: 0.0 ± 0.0
1.537AsnAsp: 1.537 ± 0.571
3.459AsnGlu: 3.459 ± 1.261
1.537AsnPhe: 1.537 ± 0.597
1.922AsnGly: 1.922 ± 0.855
0.0AsnHis: 0.0 ± 0.0
2.306AsnIle: 2.306 ± 0.949
1.153AsnLys: 1.153 ± 0.676
2.69AsnLeu: 2.69 ± 1.466
1.153AsnMet: 1.153 ± 0.65
1.922AsnAsn: 1.922 ± 1.69
4.612AsnPro: 4.612 ± 1.424
2.306AsnGln: 2.306 ± 1.028
2.69AsnArg: 2.69 ± 0.493
2.69AsnSer: 2.69 ± 0.866
2.69AsnThr: 2.69 ± 1.266
1.153AsnVal: 1.153 ± 0.614
0.769AsnTrp: 0.769 ± 0.589
1.537AsnTyr: 1.537 ± 0.962
0.0AsnXaa: 0.0 ± 0.0
Pro
6.149ProAla: 6.149 ± 2.346
1.537ProCys: 1.537 ± 0.586
4.996ProAsp: 4.996 ± 1.741
4.612ProGlu: 4.612 ± 1.948
0.769ProPhe: 0.769 ± 0.639
3.459ProGly: 3.459 ± 1.205
1.153ProHis: 1.153 ± 0.796
1.153ProIle: 1.153 ± 0.388
4.228ProLys: 4.228 ± 1.513
5.38ProLeu: 5.38 ± 1.68
1.153ProMet: 1.153 ± 0.569
2.69ProAsn: 2.69 ± 0.475
7.686ProPro: 7.686 ± 2.147
2.306ProGln: 2.306 ± 1.224
5.38ProArg: 5.38 ± 1.526
6.149ProSer: 6.149 ± 1.349
3.843ProThr: 3.843 ± 0.977
3.459ProVal: 3.459 ± 1.05
0.384ProTrp: 0.384 ± 0.318
3.075ProTyr: 3.075 ± 0.948
0.0ProXaa: 0.0 ± 0.0
Gln
2.69GlnAla: 2.69 ± 0.964
1.153GlnCys: 1.153 ± 0.492
1.537GlnAsp: 1.537 ± 0.403
3.459GlnGlu: 3.459 ± 0.828
2.306GlnPhe: 2.306 ± 1.109
6.149GlnGly: 6.149 ± 0.784
0.384GlnHis: 0.384 ± 0.315
1.153GlnIle: 1.153 ± 0.388
2.306GlnLys: 2.306 ± 0.775
4.996GlnLeu: 4.996 ± 1.394
1.153GlnMet: 1.153 ± 0.65
2.69GlnAsn: 2.69 ± 0.767
1.153GlnPro: 1.153 ± 0.808
2.306GlnGln: 2.306 ± 1.229
2.306GlnArg: 2.306 ± 1.217
2.306GlnSer: 2.306 ± 0.605
0.769GlnThr: 0.769 ± 0.589
3.459GlnVal: 3.459 ± 0.703
0.769GlnTrp: 0.769 ± 0.631
1.922GlnTyr: 1.922 ± 1.288
0.0GlnXaa: 0.0 ± 0.0
Arg
4.996ArgAla: 4.996 ± 1.19
1.537ArgCys: 1.537 ± 0.994
1.537ArgAsp: 1.537 ± 1.03
3.075ArgGlu: 3.075 ± 0.851
1.922ArgPhe: 1.922 ± 0.507
9.992ArgGly: 9.992 ± 4.498
2.306ArgHis: 2.306 ± 0.77
2.306ArgIle: 2.306 ± 1.072
3.075ArgLys: 3.075 ± 0.82
6.918ArgLeu: 6.918 ± 1.057
1.153ArgMet: 1.153 ± 0.496
2.306ArgAsn: 2.306 ± 0.626
5.38ArgPro: 5.38 ± 1.267
4.228ArgGln: 4.228 ± 1.352
6.918ArgArg: 6.918 ± 2.377
4.612ArgSer: 4.612 ± 1.149
1.537ArgThr: 1.537 ± 0.907
2.306ArgVal: 2.306 ± 1.041
0.0ArgTrp: 0.0 ± 0.0
1.153ArgTyr: 1.153 ± 0.717
0.0ArgXaa: 0.0 ± 0.0
Ser
2.69SerAla: 2.69 ± 0.976
1.537SerCys: 1.537 ± 0.706
4.996SerAsp: 4.996 ± 1.374
4.612SerGlu: 4.612 ± 1.817
2.69SerPhe: 2.69 ± 0.79
7.686SerGly: 7.686 ± 2.946
2.306SerHis: 2.306 ± 0.917
4.612SerIle: 4.612 ± 1.594
3.843SerLys: 3.843 ± 1.206
9.992SerLeu: 9.992 ± 2.207
0.0SerMet: 0.0 ± 0.0
4.228SerAsn: 4.228 ± 1.171
8.455SerPro: 8.455 ± 2.336
2.69SerGln: 2.69 ± 0.917
4.612SerArg: 4.612 ± 1.551
9.992SerSer: 9.992 ± 1.689
7.302SerThr: 7.302 ± 2.577
5.38SerVal: 5.38 ± 1.514
1.153SerTrp: 1.153 ± 0.637
0.769SerTyr: 0.769 ± 0.389
0.0SerXaa: 0.0 ± 0.0
Thr
4.228ThrAla: 4.228 ± 1.222
2.306ThrCys: 2.306 ± 1.23
1.537ThrAsp: 1.537 ± 0.299
4.612ThrGlu: 4.612 ± 0.674
1.537ThrPhe: 1.537 ± 0.57
5.38ThrGly: 5.38 ± 1.322
0.769ThrHis: 0.769 ± 0.74
3.075ThrIle: 3.075 ± 0.792
1.153ThrLys: 1.153 ± 0.614
2.69ThrLeu: 2.69 ± 0.933
1.153ThrMet: 1.153 ± 0.67
2.69ThrAsn: 2.69 ± 0.841
7.302ThrPro: 7.302 ± 1.973
2.306ThrGln: 2.306 ± 0.601
1.922ThrArg: 1.922 ± 0.675
8.839ThrSer: 8.839 ± 2.452
1.922ThrThr: 1.922 ± 0.758
4.228ThrVal: 4.228 ± 1.047
0.0ThrTrp: 0.0 ± 0.0
0.769ThrTyr: 0.769 ± 0.631
0.0ThrXaa: 0.0 ± 0.0
Val
2.69ValAla: 2.69 ± 0.878
1.153ValCys: 1.153 ± 0.638
3.459ValAsp: 3.459 ± 0.569
1.537ValGlu: 1.537 ± 0.375
2.69ValPhe: 2.69 ± 1.331
4.996ValGly: 4.996 ± 1.546
1.537ValHis: 1.537 ± 0.811
2.69ValIle: 2.69 ± 1.248
1.537ValLys: 1.537 ± 0.765
5.38ValLeu: 5.38 ± 2.024
0.384ValMet: 0.384 ± 0.318
1.922ValAsn: 1.922 ± 0.925
2.69ValPro: 2.69 ± 0.837
2.306ValGln: 2.306 ± 0.967
2.306ValArg: 2.306 ± 1.783
6.149ValSer: 6.149 ± 1.713
3.843ValThr: 3.843 ± 2.134
4.612ValVal: 4.612 ± 1.286
1.153ValTrp: 1.153 ± 0.569
2.306ValTyr: 2.306 ± 1.148
0.0ValXaa: 0.0 ± 0.0
Trp
0.769TrpAla: 0.769 ± 0.389
0.384TrpCys: 0.384 ± 0.494
0.384TrpAsp: 0.384 ± 0.338
0.769TrpGlu: 0.769 ± 0.471
0.769TrpPhe: 0.769 ± 0.631
0.769TrpGly: 0.769 ± 0.471
0.0TrpHis: 0.0 ± 0.0
0.769TrpIle: 0.769 ± 0.401
0.769TrpLys: 0.769 ± 0.389
1.153TrpLeu: 1.153 ± 0.946
0.384TrpMet: 0.384 ± 0.338
0.769TrpAsn: 0.769 ± 0.676
0.384TrpPro: 0.384 ± 0.338
0.384TrpGln: 0.384 ± 0.494
1.153TrpArg: 1.153 ± 0.998
0.769TrpSer: 0.769 ± 0.401
1.537TrpThr: 1.537 ± 0.586
0.769TrpVal: 0.769 ± 0.631
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.922TyrAla: 1.922 ± 0.694
0.384TyrCys: 0.384 ± 0.338
1.153TyrAsp: 1.153 ± 0.38
1.537TyrGlu: 1.537 ± 0.614
0.769TyrPhe: 0.769 ± 0.409
2.306TyrGly: 2.306 ± 1.282
0.384TyrHis: 0.384 ± 0.315
0.384TyrIle: 0.384 ± 0.326
2.306TyrLys: 2.306 ± 0.899
2.69TyrLeu: 2.69 ± 1.019
1.153TyrMet: 1.153 ± 0.437
0.769TyrAsn: 0.769 ± 0.409
0.384TyrPro: 0.384 ± 0.315
1.537TyrGln: 1.537 ± 0.571
3.075TyrArg: 3.075 ± 1.515
0.769TyrSer: 0.769 ± 0.383
3.075TyrThr: 3.075 ± 0.975
1.153TyrVal: 1.153 ± 0.65
0.769TyrTrp: 0.769 ± 0.445
2.306TyrTyr: 2.306 ± 0.954
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2603 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski