Amino acid dipepetide frequency for Equus caballus papillomavirus 1 (strain Olson) (EcPV-1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.905AlaAla: 5.905 ± 1.618
1.265AlaCys: 1.265 ± 1.03
4.639AlaAsp: 4.639 ± 1.165
2.109AlaGlu: 2.109 ± 1.33
2.531AlaPhe: 2.531 ± 0.55
3.374AlaGly: 3.374 ± 0.572
0.0AlaHis: 0.0 ± 0.0
2.531AlaIle: 2.531 ± 0.722
2.109AlaLys: 2.109 ± 0.781
6.326AlaLeu: 6.326 ± 1.992
1.265AlaMet: 1.265 ± 0.679
1.265AlaAsn: 1.265 ± 0.455
6.326AlaPro: 6.326 ± 1.649
4.218AlaGln: 4.218 ± 1.306
3.374AlaArg: 3.374 ± 1.225
2.952AlaSer: 2.952 ± 0.545
5.061AlaThr: 5.061 ± 1.11
6.748AlaVal: 6.748 ± 1.273
0.844AlaTrp: 0.844 ± 0.427
1.687AlaTyr: 1.687 ± 0.997
0.0AlaXaa: 0.0 ± 0.0
Cys
0.844CysAla: 0.844 ± 1.158
0.844CysCys: 0.844 ± 0.586
1.265CysAsp: 1.265 ± 0.614
0.0CysGlu: 0.0 ± 0.0
0.844CysPhe: 0.844 ± 0.596
1.265CysGly: 1.265 ± 0.852
0.422CysHis: 0.422 ± 0.488
0.0CysIle: 0.0 ± 0.0
1.687CysLys: 1.687 ± 0.869
0.844CysLeu: 0.844 ± 0.438
0.844CysMet: 0.844 ± 0.596
0.422CysAsn: 0.422 ± 0.488
2.531CysPro: 2.531 ± 0.903
0.422CysGln: 0.422 ± 0.62
4.218CysArg: 4.218 ± 1.22
2.109CysSer: 2.109 ± 1.139
1.265CysThr: 1.265 ± 0.688
0.844CysVal: 0.844 ± 0.857
0.422CysTrp: 0.422 ± 0.488
1.265CysTyr: 1.265 ± 0.906
0.0CysXaa: 0.0 ± 0.0
Asp
6.326AspAla: 6.326 ± 1.619
2.109AspCys: 2.109 ± 0.386
2.531AspAsp: 2.531 ± 1.679
3.374AspGlu: 3.374 ± 1.445
0.844AspPhe: 0.844 ± 0.427
4.218AspGly: 4.218 ± 1.199
0.422AspHis: 0.422 ± 0.409
1.265AspIle: 1.265 ± 0.741
2.109AspLys: 2.109 ± 0.541
6.748AspLeu: 6.748 ± 1.567
1.687AspMet: 1.687 ± 0.595
1.265AspAsn: 1.265 ± 0.386
4.639AspPro: 4.639 ± 1.195
1.687AspGln: 1.687 ± 0.858
1.687AspArg: 1.687 ± 0.779
2.952AspSer: 2.952 ± 0.545
4.218AspThr: 4.218 ± 1.601
4.218AspVal: 4.218 ± 1.774
1.265AspTrp: 1.265 ± 0.77
1.687AspTyr: 1.687 ± 0.509
0.0AspXaa: 0.0 ± 0.0
Glu
5.061GluAla: 5.061 ± 1.753
0.422GluCys: 0.422 ± 0.488
4.218GluAsp: 4.218 ± 1.437
7.17GluGlu: 7.17 ± 2.325
0.844GluPhe: 0.844 ± 0.593
5.061GluGly: 5.061 ± 1.756
1.265GluHis: 1.265 ± 0.634
2.952GluIle: 2.952 ± 0.99
2.531GluLys: 2.531 ± 1.025
7.17GluLeu: 7.17 ± 1.651
0.422GluMet: 0.422 ± 0.356
2.531GluAsn: 2.531 ± 0.909
4.218GluPro: 4.218 ± 0.605
2.952GluGln: 2.952 ± 0.604
1.687GluArg: 1.687 ± 1.202
3.796GluSer: 3.796 ± 1.358
2.531GluThr: 2.531 ± 1.115
5.905GluVal: 5.905 ± 2.185
1.687GluTrp: 1.687 ± 1.027
0.422GluTyr: 0.422 ± 0.356
0.0GluXaa: 0.0 ± 0.0
Phe
2.109PheAla: 2.109 ± 1.079
0.844PheCys: 0.844 ± 0.586
3.796PheAsp: 3.796 ± 1.799
3.374PheGlu: 3.374 ± 0.422
0.844PhePhe: 0.844 ± 0.427
2.109PheGly: 2.109 ± 0.902
1.265PheHis: 1.265 ± 0.386
1.265PheIle: 1.265 ± 0.429
0.844PheLys: 0.844 ± 0.711
2.952PheLeu: 2.952 ± 1.584
0.0PheMet: 0.0 ± 0.0
2.952PheAsn: 2.952 ± 1.492
2.109PhePro: 2.109 ± 0.75
1.687PheGln: 1.687 ± 0.57
1.687PheArg: 1.687 ± 0.854
2.531PheSer: 2.531 ± 0.903
1.687PheThr: 1.687 ± 0.288
2.109PheVal: 2.109 ± 0.541
1.687PheTrp: 1.687 ± 0.73
0.422PheTyr: 0.422 ± 0.395
0.0PheXaa: 0.0 ± 0.0
Gly
5.483GlyAla: 5.483 ± 1.534
1.687GlyCys: 1.687 ± 1.179
6.748GlyAsp: 6.748 ± 1.198
2.952GlyGlu: 2.952 ± 0.915
0.844GlyPhe: 0.844 ± 0.444
9.701GlyGly: 9.701 ± 3.938
2.531GlyHis: 2.531 ± 1.095
3.796GlyIle: 3.796 ± 1.817
2.109GlyLys: 2.109 ± 1.108
7.17GlyLeu: 7.17 ± 1.642
0.422GlyMet: 0.422 ± 0.322
3.796GlyAsn: 3.796 ± 0.945
4.639GlyPro: 4.639 ± 2.314
2.952GlyGln: 2.952 ± 1.245
6.748GlyArg: 6.748 ± 2.638
6.748GlySer: 6.748 ± 1.205
5.483GlyThr: 5.483 ± 1.141
4.218GlyVal: 4.218 ± 1.048
0.422GlyTrp: 0.422 ± 0.395
0.422GlyTyr: 0.422 ± 0.397
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.422HisCys: 0.422 ± 0.356
0.422HisAsp: 0.422 ± 0.488
1.265HisGlu: 1.265 ± 0.852
1.687HisPhe: 1.687 ± 1.0
2.531HisGly: 2.531 ± 1.444
0.0HisHis: 0.0 ± 0.0
0.844HisIle: 0.844 ± 0.484
1.687HisLys: 1.687 ± 0.857
1.687HisLeu: 1.687 ± 1.239
0.844HisMet: 0.844 ± 0.531
0.422HisAsn: 0.422 ± 0.356
1.265HisPro: 1.265 ± 0.741
0.844HisGln: 0.844 ± 0.427
0.844HisArg: 0.844 ± 0.484
1.687HisSer: 1.687 ± 0.563
0.844HisThr: 0.844 ± 0.444
1.687HisVal: 1.687 ± 0.997
0.844HisTrp: 0.844 ± 0.712
0.422HisTyr: 0.422 ± 0.397
0.0HisXaa: 0.0 ± 0.0
Ile
1.687IleAla: 1.687 ± 0.595
1.687IleCys: 1.687 ± 1.202
2.109IleAsp: 2.109 ± 0.771
2.952IleGlu: 2.952 ± 1.316
1.265IlePhe: 1.265 ± 0.483
2.952IleGly: 2.952 ± 1.422
1.265IleHis: 1.265 ± 0.688
0.844IleIle: 0.844 ± 0.794
1.265IleLys: 1.265 ± 0.679
3.374IleLeu: 3.374 ± 0.776
0.0IleMet: 0.0 ± 0.385
2.531IleAsn: 2.531 ± 0.766
1.687IlePro: 1.687 ± 0.69
1.265IleGln: 1.265 ± 0.97
1.265IleArg: 1.265 ± 1.067
2.109IleSer: 2.109 ± 1.15
2.531IleThr: 2.531 ± 0.903
2.531IleVal: 2.531 ± 0.747
0.422IleTrp: 0.422 ± 0.356
1.687IleTyr: 1.687 ± 0.777
0.0IleXaa: 0.0 ± 0.0
Lys
3.374LysAla: 3.374 ± 1.238
1.687LysCys: 1.687 ± 0.62
1.687LysAsp: 1.687 ± 0.62
1.687LysGlu: 1.687 ± 0.563
0.844LysPhe: 0.844 ± 0.427
2.531LysGly: 2.531 ± 1.203
0.422LysHis: 0.422 ± 0.356
0.422LysIle: 0.422 ± 0.356
2.952LysLys: 2.952 ± 1.519
3.374LysLeu: 3.374 ± 0.589
0.422LysMet: 0.422 ± 0.395
0.844LysAsn: 0.844 ± 0.596
2.109LysPro: 2.109 ± 0.92
2.109LysGln: 2.109 ± 0.754
4.218LysArg: 4.218 ± 1.158
2.109LysSer: 2.109 ± 1.078
2.109LysThr: 2.109 ± 0.781
3.796LysVal: 3.796 ± 0.993
0.0LysTrp: 0.0 ± 0.0
1.687LysTyr: 1.687 ± 0.742
0.0LysXaa: 0.0 ± 0.0
Leu
2.952LeuAla: 2.952 ± 1.414
2.109LeuCys: 2.109 ± 1.675
6.748LeuAsp: 6.748 ± 0.896
3.374LeuGlu: 3.374 ± 1.427
5.483LeuPhe: 5.483 ± 1.832
10.544LeuGly: 10.544 ± 2.535
2.109LeuHis: 2.109 ± 0.708
3.796LeuIle: 3.796 ± 1.649
3.374LeuLys: 3.374 ± 0.907
7.592LeuLeu: 7.592 ± 1.925
2.531LeuMet: 2.531 ± 1.315
2.109LeuAsn: 2.109 ± 0.894
4.639LeuPro: 4.639 ± 1.326
6.748LeuGln: 6.748 ± 1.585
4.639LeuArg: 4.639 ± 1.782
8.013LeuSer: 8.013 ± 1.179
8.013LeuThr: 8.013 ± 2.059
2.952LeuVal: 2.952 ± 1.181
0.844LeuTrp: 0.844 ± 0.438
2.531LeuTyr: 2.531 ± 0.673
0.0LeuXaa: 0.0 ± 0.0
Met
1.687MetAla: 1.687 ± 1.19
0.422MetCys: 0.422 ± 0.409
1.265MetAsp: 1.265 ± 0.718
1.265MetGlu: 1.265 ± 0.863
0.422MetPhe: 0.422 ± 0.395
0.0MetGly: 0.0 ± 0.0
0.422MetHis: 0.422 ± 0.356
0.422MetIle: 0.422 ± 0.579
0.422MetLys: 0.422 ± 0.356
1.687MetLeu: 1.687 ± 1.161
0.844MetMet: 0.844 ± 0.531
0.844MetAsn: 0.844 ± 0.438
0.422MetPro: 0.422 ± 0.356
0.0MetGln: 0.0 ± 0.0
0.844MetArg: 0.844 ± 0.427
2.109MetSer: 2.109 ± 1.15
0.844MetThr: 0.844 ± 0.444
2.952MetVal: 2.952 ± 0.9
0.844MetTrp: 0.844 ± 0.593
0.844MetTyr: 0.844 ± 0.458
0.0MetXaa: 0.0 ± 0.0
Asn
3.374AsnAla: 3.374 ± 1.104
1.265AsnCys: 1.265 ± 0.614
0.844AsnAsp: 0.844 ± 0.438
1.687AsnGlu: 1.687 ± 1.161
1.265AsnPhe: 1.265 ± 0.58
1.687AsnGly: 1.687 ± 1.014
0.422AsnHis: 0.422 ± 0.356
1.265AsnIle: 1.265 ± 0.718
1.687AsnLys: 1.687 ± 0.633
3.374AsnLeu: 3.374 ± 0.859
0.422AsnMet: 0.422 ± 0.44
1.687AsnAsn: 1.687 ± 1.108
3.374AsnPro: 3.374 ± 0.991
3.374AsnGln: 3.374 ± 1.354
2.952AsnArg: 2.952 ± 0.903
3.374AsnSer: 3.374 ± 1.03
1.265AsnThr: 1.265 ± 0.741
1.687AsnVal: 1.687 ± 0.73
1.265AsnTrp: 1.265 ± 0.685
0.422AsnTyr: 0.422 ± 0.356
0.0AsnXaa: 0.0 ± 0.0
Pro
5.061ProAla: 5.061 ± 0.956
0.0ProCys: 0.0 ± 0.0
3.374ProAsp: 3.374 ± 1.487
4.218ProGlu: 4.218 ± 0.84
2.109ProPhe: 2.109 ± 0.711
4.639ProGly: 4.639 ± 1.286
0.844ProHis: 0.844 ± 0.484
2.952ProIle: 2.952 ± 0.994
2.531ProLys: 2.531 ± 0.736
5.061ProLeu: 5.061 ± 1.3
0.844ProMet: 0.844 ± 0.596
2.952ProAsn: 2.952 ± 0.847
8.435ProPro: 8.435 ± 1.615
3.796ProGln: 3.796 ± 1.894
4.639ProArg: 4.639 ± 1.299
6.748ProSer: 6.748 ± 3.163
2.531ProThr: 2.531 ± 0.684
7.17ProVal: 7.17 ± 2.177
1.265ProTrp: 1.265 ± 0.745
2.109ProTyr: 2.109 ± 1.106
0.0ProXaa: 0.0 ± 0.0
Gln
3.796GlnAla: 3.796 ± 1.563
2.109GlnCys: 2.109 ± 1.377
2.109GlnAsp: 2.109 ± 1.014
5.483GlnGlu: 5.483 ± 1.438
2.531GlnPhe: 2.531 ± 1.28
2.952GlnGly: 2.952 ± 0.557
0.844GlnHis: 0.844 ± 0.711
2.531GlnIle: 2.531 ± 0.722
1.687GlnLys: 1.687 ± 0.629
2.952GlnLeu: 2.952 ± 1.407
1.265GlnMet: 1.265 ± 0.625
1.687GlnAsn: 1.687 ± 1.108
1.687GlnPro: 1.687 ± 0.987
2.531GlnGln: 2.531 ± 1.295
2.109GlnArg: 2.109 ± 1.635
3.374GlnSer: 3.374 ± 1.383
1.687GlnThr: 1.687 ± 0.62
1.687GlnVal: 1.687 ± 0.595
1.265GlnTrp: 1.265 ± 0.718
0.844GlnTyr: 0.844 ± 0.753
0.0GlnXaa: 0.0 ± 0.0
Arg
2.952ArgAla: 2.952 ± 0.761
2.109ArgCys: 2.109 ± 1.115
1.687ArgAsp: 1.687 ± 0.742
4.639ArgGlu: 4.639 ± 1.015
2.109ArgPhe: 2.109 ± 1.478
5.061ArgGly: 5.061 ± 1.571
2.952ArgHis: 2.952 ± 1.227
0.0ArgIle: 0.0 ± 0.0
4.218ArgLys: 4.218 ± 0.803
8.013ArgLeu: 8.013 ± 3.509
0.844ArgMet: 0.844 ± 0.561
1.687ArgAsn: 1.687 ± 0.57
6.326ArgPro: 6.326 ± 2.36
2.531ArgGln: 2.531 ± 0.736
7.592ArgArg: 7.592 ± 3.274
4.639ArgSer: 4.639 ± 0.991
3.796ArgThr: 3.796 ± 0.747
4.639ArgVal: 4.639 ± 1.354
1.265ArgTrp: 1.265 ± 0.77
3.796ArgTyr: 3.796 ± 0.868
0.0ArgXaa: 0.0 ± 0.0
Ser
4.639SerAla: 4.639 ± 1.848
0.422SerCys: 0.422 ± 0.488
4.639SerAsp: 4.639 ± 1.174
6.326SerGlu: 6.326 ± 1.202
3.796SerPhe: 3.796 ± 1.271
7.592SerGly: 7.592 ± 3.594
1.687SerHis: 1.687 ± 0.622
3.796SerIle: 3.796 ± 0.711
1.687SerLys: 1.687 ± 0.563
8.013SerLeu: 8.013 ± 1.57
1.265SerMet: 1.265 ± 0.647
2.952SerAsn: 2.952 ± 1.214
5.483SerPro: 5.483 ± 2.029
2.109SerGln: 2.109 ± 1.123
2.531SerArg: 2.531 ± 1.115
5.483SerSer: 5.483 ± 2.104
2.952SerThr: 2.952 ± 1.241
5.061SerVal: 5.061 ± 1.367
1.265SerTrp: 1.265 ± 0.685
2.531SerTyr: 2.531 ± 1.183
0.0SerXaa: 0.0 ± 0.0
Thr
1.687ThrAla: 1.687 ± 0.729
0.422ThrCys: 0.422 ± 0.397
2.109ThrAsp: 2.109 ± 0.621
2.531ThrGlu: 2.531 ± 0.839
2.952ThrPhe: 2.952 ± 1.008
6.748ThrGly: 6.748 ± 0.829
0.844ThrHis: 0.844 ± 0.458
1.687ThrIle: 1.687 ± 1.1
1.687ThrLys: 1.687 ± 1.131
3.796ThrLeu: 3.796 ± 1.336
2.109ThrMet: 2.109 ± 0.711
2.531ThrAsn: 2.531 ± 0.798
5.483ThrPro: 5.483 ± 0.432
0.422ThrGln: 0.422 ± 0.62
5.483ThrArg: 5.483 ± 1.281
5.061ThrSer: 5.061 ± 1.609
3.796ThrThr: 3.796 ± 0.991
4.639ThrVal: 4.639 ± 0.912
1.265ThrTrp: 1.265 ± 0.804
2.531ThrTyr: 2.531 ± 0.903
0.0ThrXaa: 0.0 ± 0.0
Val
4.639ValAla: 4.639 ± 1.668
0.844ValCys: 0.844 ± 1.158
3.374ValAsp: 3.374 ± 0.776
4.639ValGlu: 4.639 ± 1.454
2.952ValPhe: 2.952 ± 1.174
3.374ValGly: 3.374 ± 1.14
1.265ValHis: 1.265 ± 0.429
3.796ValIle: 3.796 ± 1.22
2.531ValLys: 2.531 ± 1.359
4.218ValLeu: 4.218 ± 0.924
0.422ValMet: 0.422 ± 0.397
1.687ValAsn: 1.687 ± 1.036
4.639ValPro: 4.639 ± 1.639
2.531ValGln: 2.531 ± 0.544
7.592ValArg: 7.592 ± 2.597
6.748ValSer: 6.748 ± 2.19
5.061ValThr: 5.061 ± 1.113
4.639ValVal: 4.639 ± 1.25
1.265ValTrp: 1.265 ± 0.58
2.952ValTyr: 2.952 ± 0.856
0.0ValXaa: 0.0 ± 0.0
Trp
0.844TrpAla: 0.844 ± 0.427
0.844TrpCys: 0.844 ± 0.602
0.422TrpAsp: 0.422 ± 0.356
1.687TrpGlu: 1.687 ± 0.777
0.844TrpPhe: 0.844 ± 0.711
1.687TrpGly: 1.687 ± 0.73
0.0TrpHis: 0.0 ± 0.0
0.844TrpIle: 0.844 ± 0.711
0.0TrpLys: 0.0 ± 0.0
3.374TrpLeu: 3.374 ± 0.766
0.422TrpMet: 0.422 ± 0.409
1.265TrpAsn: 1.265 ± 0.386
0.0TrpPro: 0.0 ± 0.0
1.265TrpGln: 1.265 ± 0.605
2.531TrpArg: 2.531 ± 1.019
0.844TrpSer: 0.844 ± 0.565
0.844TrpThr: 0.844 ± 0.819
0.844TrpVal: 0.844 ± 0.438
0.0TrpTrp: 0.0 ± 0.0
0.844TrpTyr: 0.844 ± 0.438
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.109TyrAla: 2.109 ± 0.707
0.844TyrCys: 0.844 ± 0.427
1.265TyrAsp: 1.265 ± 0.455
2.109TyrGlu: 2.109 ± 0.935
1.265TyrPhe: 1.265 ± 0.429
0.844TyrGly: 0.844 ± 0.596
1.265TyrHis: 1.265 ± 0.758
0.844TyrIle: 0.844 ± 0.789
1.265TyrLys: 1.265 ± 0.741
3.374TyrLeu: 3.374 ± 0.912
1.265TyrMet: 1.265 ± 0.779
1.265TyrAsn: 1.265 ± 0.777
1.265TyrPro: 1.265 ± 0.744
1.687TyrGln: 1.687 ± 1.0
4.218TyrArg: 4.218 ± 1.036
0.844TyrSer: 0.844 ± 0.484
1.265TyrThr: 1.265 ± 0.429
0.844TyrVal: 0.844 ± 0.531
1.265TyrTrp: 1.265 ± 0.455
2.531TyrTyr: 2.531 ± 1.514
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2372 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski