Amino acid dipepetide frequency for Macaca mulatta papillomavirus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.857AlaAla: 8.857 ± 1.542
2.531AlaCys: 2.531 ± 0.842
3.796AlaAsp: 3.796 ± 1.16
3.796AlaGlu: 3.796 ± 1.447
2.109AlaPhe: 2.109 ± 0.888
7.592AlaGly: 7.592 ± 0.693
0.422AlaHis: 0.422 ± 0.331
2.952AlaIle: 2.952 ± 0.487
5.905AlaLys: 5.905 ± 0.961
6.326AlaLeu: 6.326 ± 1.533
0.844AlaMet: 0.844 ± 0.587
2.952AlaAsn: 2.952 ± 0.662
6.326AlaPro: 6.326 ± 1.938
2.952AlaGln: 2.952 ± 0.921
2.531AlaArg: 2.531 ± 0.491
2.952AlaSer: 2.952 ± 0.926
5.483AlaThr: 5.483 ± 1.554
8.013AlaVal: 8.013 ± 1.569
0.422AlaTrp: 0.422 ± 0.468
3.374AlaTyr: 3.374 ± 1.56
0.0AlaXaa: 0.0 ± 0.0
Cys
2.109CysAla: 2.109 ± 1.12
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.422CysGlu: 0.422 ± 0.468
0.844CysPhe: 0.844 ± 0.409
1.265CysGly: 1.265 ± 0.985
0.844CysHis: 0.844 ± 1.095
1.687CysIle: 1.687 ± 0.929
2.952CysLys: 2.952 ± 1.209
1.265CysLeu: 1.265 ± 1.126
0.422CysMet: 0.422 ± 0.548
0.844CysAsn: 0.844 ± 0.531
1.687CysPro: 1.687 ± 0.614
2.531CysGln: 2.531 ± 1.238
0.422CysArg: 0.422 ± 0.331
2.952CysSer: 2.952 ± 1.186
0.844CysThr: 0.844 ± 0.662
1.687CysVal: 1.687 ± 0.585
1.687CysTrp: 1.687 ± 0.585
0.844CysTyr: 0.844 ± 0.734
0.0CysXaa: 0.0 ± 0.0
Asp
6.748AspAla: 6.748 ± 1.685
1.687AspCys: 1.687 ± 0.874
4.218AspAsp: 4.218 ± 1.81
2.109AspGlu: 2.109 ± 1.225
1.687AspPhe: 1.687 ± 0.608
4.639AspGly: 4.639 ± 1.291
0.844AspHis: 0.844 ± 0.409
3.374AspIle: 3.374 ± 1.726
1.687AspLys: 1.687 ± 1.539
5.061AspLeu: 5.061 ± 0.724
0.844AspMet: 0.844 ± 0.377
2.531AspAsn: 2.531 ± 0.452
5.483AspPro: 5.483 ± 1.185
1.687AspGln: 1.687 ± 1.042
2.109AspArg: 2.109 ± 0.66
5.483AspSer: 5.483 ± 1.697
7.592AspThr: 7.592 ± 1.639
2.531AspVal: 2.531 ± 1.014
1.265AspTrp: 1.265 ± 0.627
1.265AspTyr: 1.265 ± 0.695
0.0AspXaa: 0.0 ± 0.0
Glu
3.374GluAla: 3.374 ± 1.226
0.422GluCys: 0.422 ± 0.508
5.061GluAsp: 5.061 ± 1.012
5.061GluGlu: 5.061 ± 1.531
1.687GluPhe: 1.687 ± 0.585
5.061GluGly: 5.061 ± 1.444
0.844GluHis: 0.844 ± 0.377
0.422GluIle: 0.422 ± 0.331
1.265GluLys: 1.265 ± 0.735
2.109GluLeu: 2.109 ± 0.962
1.265GluMet: 1.265 ± 0.579
0.422GluAsn: 0.422 ± 0.333
5.061GluPro: 5.061 ± 2.345
3.796GluGln: 3.796 ± 1.079
0.844GluArg: 0.844 ± 0.71
2.952GluSer: 2.952 ± 0.966
2.109GluThr: 2.109 ± 0.697
5.905GluVal: 5.905 ± 1.405
0.844GluTrp: 0.844 ± 0.662
0.844GluTyr: 0.844 ± 0.377
0.0GluXaa: 0.0 ± 0.0
Phe
3.374PheAla: 3.374 ± 0.907
0.422PheCys: 0.422 ± 0.548
1.265PheAsp: 1.265 ± 0.933
1.265PheGlu: 1.265 ± 0.669
1.265PhePhe: 1.265 ± 0.629
4.218PheGly: 4.218 ± 1.182
0.0PheHis: 0.0 ± 0.0
1.687PheIle: 1.687 ± 0.754
2.109PheLys: 2.109 ± 1.246
5.905PheLeu: 5.905 ± 1.874
0.844PheMet: 0.844 ± 0.662
0.844PheAsn: 0.844 ± 0.665
1.265PhePro: 1.265 ± 0.411
0.844PheGln: 0.844 ± 0.665
2.109PheArg: 2.109 ± 0.642
2.109PheSer: 2.109 ± 0.912
1.265PheThr: 1.265 ± 0.993
0.844PheVal: 0.844 ± 0.556
0.844PheTrp: 0.844 ± 0.377
1.265PheTyr: 1.265 ± 0.933
0.0PheXaa: 0.0 ± 0.0
Gly
5.061GlyAla: 5.061 ± 1.063
0.844GlyCys: 0.844 ± 0.521
6.748GlyAsp: 6.748 ± 2.327
3.796GlyGlu: 3.796 ± 0.762
1.687GlyPhe: 1.687 ± 0.608
3.374GlyGly: 3.374 ± 1.383
3.796GlyHis: 3.796 ± 1.624
2.531GlyIle: 2.531 ± 0.724
3.374GlyLys: 3.374 ± 0.595
5.483GlyLeu: 5.483 ± 2.456
0.844GlyMet: 0.844 ± 0.582
3.374GlyAsn: 3.374 ± 0.886
2.952GlyPro: 2.952 ± 1.102
2.952GlyGln: 2.952 ± 0.628
4.218GlyArg: 4.218 ± 0.871
7.17GlySer: 7.17 ± 2.206
5.483GlyThr: 5.483 ± 2.801
3.796GlyVal: 3.796 ± 0.638
0.844GlyTrp: 0.844 ± 0.543
1.687GlyTyr: 1.687 ± 0.573
0.0GlyXaa: 0.0 ± 0.0
His
1.687HisAla: 1.687 ± 0.683
0.422HisCys: 0.422 ± 0.508
0.844HisAsp: 0.844 ± 0.411
0.422HisGlu: 0.422 ± 0.331
1.687HisPhe: 1.687 ± 0.481
2.952HisGly: 2.952 ± 1.103
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
2.109HisLys: 2.109 ± 1.169
1.265HisLeu: 1.265 ± 1.117
1.265HisMet: 1.265 ± 0.759
0.422HisAsn: 0.422 ± 0.333
2.531HisPro: 2.531 ± 1.062
1.265HisGln: 1.265 ± 0.815
0.422HisArg: 0.422 ± 0.331
1.687HisSer: 1.687 ± 0.573
1.265HisThr: 1.265 ± 0.509
1.687HisVal: 1.687 ± 1.145
0.844HisTrp: 0.844 ± 0.521
1.265HisTyr: 1.265 ± 0.372
0.0HisXaa: 0.0 ± 0.0
Ile
2.952IleAla: 2.952 ± 0.453
1.265IleCys: 1.265 ± 0.998
2.109IleAsp: 2.109 ± 1.289
2.952IleGlu: 2.952 ± 0.66
1.265IlePhe: 1.265 ± 0.701
2.531IleGly: 2.531 ± 0.99
0.844IleHis: 0.844 ± 0.521
1.265IleIle: 1.265 ± 0.411
0.422IleLys: 0.422 ± 0.331
1.687IleLeu: 1.687 ± 0.615
0.0IleMet: 0.0 ± 0.0
0.0IleAsn: 0.0 ± 0.0
2.952IlePro: 2.952 ± 0.725
0.844IleGln: 0.844 ± 0.377
1.265IleArg: 1.265 ± 0.664
4.218IleSer: 4.218 ± 1.258
2.531IleThr: 2.531 ± 0.615
4.218IleVal: 4.218 ± 2.155
0.422IleTrp: 0.422 ± 0.468
2.952IleTyr: 2.952 ± 0.779
0.0IleXaa: 0.0 ± 0.0
Lys
4.218LysAla: 4.218 ± 1.371
1.687LysCys: 1.687 ± 0.85
2.109LysAsp: 2.109 ± 0.629
2.531LysGlu: 2.531 ± 1.065
2.952LysPhe: 2.952 ± 1.117
2.109LysGly: 2.109 ± 0.803
2.109LysHis: 2.109 ± 0.986
2.952LysIle: 2.952 ± 0.817
2.109LysLys: 2.109 ± 0.655
1.687LysLeu: 1.687 ± 0.481
0.422LysMet: 0.422 ± 0.333
0.844LysAsn: 0.844 ± 0.624
2.109LysPro: 2.109 ± 0.896
2.531LysGln: 2.531 ± 0.686
5.061LysArg: 5.061 ± 1.157
3.796LysSer: 3.796 ± 1.013
0.844LysThr: 0.844 ± 0.662
3.374LysVal: 3.374 ± 1.236
0.422LysTrp: 0.422 ± 0.468
1.687LysTyr: 1.687 ± 0.718
0.0LysXaa: 0.0 ± 0.0
Leu
3.796LeuAla: 3.796 ± 1.047
3.374LeuCys: 3.374 ± 1.544
7.17LeuAsp: 7.17 ± 0.783
3.796LeuGlu: 3.796 ± 1.543
3.796LeuPhe: 3.796 ± 0.94
5.483LeuGly: 5.483 ± 0.98
2.531LeuHis: 2.531 ± 0.906
2.952LeuIle: 2.952 ± 1.234
4.218LeuLys: 4.218 ± 1.217
8.857LeuLeu: 8.857 ± 2.139
1.687LeuMet: 1.687 ± 0.651
2.952LeuAsn: 2.952 ± 1.113
2.109LeuPro: 2.109 ± 1.112
6.748LeuGln: 6.748 ± 1.545
6.326LeuArg: 6.326 ± 1.762
5.483LeuSer: 5.483 ± 1.847
4.639LeuThr: 4.639 ± 0.555
2.531LeuVal: 2.531 ± 0.693
1.687LeuTrp: 1.687 ± 0.282
5.483LeuTyr: 5.483 ± 1.338
0.0LeuXaa: 0.0 ± 0.0
Met
1.687MetAla: 1.687 ± 0.689
0.422MetCys: 0.422 ± 0.331
0.844MetAsp: 0.844 ± 0.665
1.265MetGlu: 1.265 ± 0.684
1.265MetPhe: 1.265 ± 0.629
0.422MetGly: 0.422 ± 0.331
2.109MetHis: 2.109 ± 0.841
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
1.687MetLeu: 1.687 ± 0.972
0.0MetMet: 0.0 ± 0.0
0.844MetAsn: 0.844 ± 0.377
0.0MetPro: 0.0 ± 0.0
0.844MetGln: 0.844 ± 0.377
0.0MetArg: 0.0 ± 0.0
1.687MetSer: 1.687 ± 0.929
1.687MetThr: 1.687 ± 0.642
2.952MetVal: 2.952 ± 1.55
0.422MetTrp: 0.422 ± 0.468
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.952AsnAla: 2.952 ± 1.348
0.422AsnCys: 0.422 ± 0.331
1.265AsnAsp: 1.265 ± 0.761
0.844AsnGlu: 0.844 ± 0.669
1.265AsnPhe: 1.265 ± 0.697
1.687AsnGly: 1.687 ± 0.874
0.422AsnHis: 0.422 ± 0.331
1.265AsnIle: 1.265 ± 0.836
2.109AsnLys: 2.109 ± 1.325
1.265AsnLeu: 1.265 ± 0.567
1.687AsnMet: 1.687 ± 0.581
0.844AsnAsn: 0.844 ± 0.665
2.531AsnPro: 2.531 ± 0.783
0.844AsnGln: 0.844 ± 0.377
1.265AsnArg: 1.265 ± 0.627
2.109AsnSer: 2.109 ± 0.435
2.109AsnThr: 2.109 ± 0.608
2.109AsnVal: 2.109 ± 0.661
0.844AsnTrp: 0.844 ± 0.624
0.844AsnTyr: 0.844 ± 0.624
0.0AsnXaa: 0.0 ± 0.0
Pro
5.061ProAla: 5.061 ± 1.369
1.265ProCys: 1.265 ± 0.377
6.748ProAsp: 6.748 ± 1.067
2.952ProGlu: 2.952 ± 1.025
0.844ProPhe: 0.844 ± 0.624
3.796ProGly: 3.796 ± 1.247
0.422ProHis: 0.422 ± 0.468
2.109ProIle: 2.109 ± 1.261
3.796ProLys: 3.796 ± 0.467
6.748ProLeu: 6.748 ± 1.61
0.422ProMet: 0.422 ± 0.331
2.109ProAsn: 2.109 ± 0.954
6.748ProPro: 6.748 ± 1.758
1.687ProGln: 1.687 ± 0.66
3.374ProArg: 3.374 ± 0.768
5.061ProSer: 5.061 ± 0.961
5.905ProThr: 5.905 ± 2.324
5.905ProVal: 5.905 ± 2.073
0.422ProTrp: 0.422 ± 0.479
2.952ProTyr: 2.952 ± 0.847
0.0ProXaa: 0.0 ± 0.0
Gln
4.218GlnAla: 4.218 ± 1.143
0.844GlnCys: 0.844 ± 0.624
2.952GlnAsp: 2.952 ± 0.919
3.374GlnGlu: 3.374 ± 1.111
2.952GlnPhe: 2.952 ± 0.8
2.109GlnGly: 2.109 ± 0.487
0.0GlnHis: 0.0 ± 0.0
2.109GlnIle: 2.109 ± 0.629
0.422GlnLys: 0.422 ± 0.333
5.483GlnLeu: 5.483 ± 1.532
0.422GlnMet: 0.422 ± 0.333
0.422GlnAsn: 0.422 ± 0.331
3.374GlnPro: 3.374 ± 0.886
3.374GlnGln: 3.374 ± 0.694
2.952GlnArg: 2.952 ± 1.581
2.109GlnSer: 2.109 ± 0.72
3.796GlnThr: 3.796 ± 1.219
3.374GlnVal: 3.374 ± 0.639
0.422GlnTrp: 0.422 ± 0.331
1.687GlnTyr: 1.687 ± 0.996
0.0GlnXaa: 0.0 ± 0.0
Arg
5.061ArgAla: 5.061 ± 1.447
2.952ArgCys: 2.952 ± 1.743
1.265ArgAsp: 1.265 ± 0.933
1.265ArgGlu: 1.265 ± 0.933
1.265ArgPhe: 1.265 ± 0.411
2.109ArgGly: 2.109 ± 0.492
2.531ArgHis: 2.531 ± 0.693
1.265ArgIle: 1.265 ± 0.372
4.639ArgLys: 4.639 ± 0.469
7.17ArgLeu: 7.17 ± 0.681
1.687ArgMet: 1.687 ± 0.624
0.844ArgAsn: 0.844 ± 0.662
5.061ArgPro: 5.061 ± 0.752
1.687ArgGln: 1.687 ± 1.107
6.748ArgArg: 6.748 ± 2.345
3.796ArgSer: 3.796 ± 0.505
2.109ArgThr: 2.109 ± 0.938
3.374ArgVal: 3.374 ± 1.458
1.687ArgTrp: 1.687 ± 0.902
1.687ArgTyr: 1.687 ± 0.992
0.0ArgXaa: 0.0 ± 0.0
Ser
6.326SerAla: 6.326 ± 1.505
1.265SerCys: 1.265 ± 0.693
6.326SerAsp: 6.326 ± 1.592
2.531SerGlu: 2.531 ± 1.071
1.687SerPhe: 1.687 ± 0.972
6.748SerGly: 6.748 ± 2.256
0.844SerHis: 0.844 ± 0.662
3.374SerIle: 3.374 ± 1.29
3.374SerLys: 3.374 ± 1.001
6.748SerLeu: 6.748 ± 2.189
1.687SerMet: 1.687 ± 0.581
2.952SerAsn: 2.952 ± 1.55
4.639SerPro: 4.639 ± 0.783
2.109SerGln: 2.109 ± 0.616
6.326SerArg: 6.326 ± 0.74
13.075SerSer: 13.075 ± 3.139
7.17SerThr: 7.17 ± 2.027
2.109SerVal: 2.109 ± 0.836
0.422SerTrp: 0.422 ± 0.468
2.109SerTyr: 2.109 ± 0.655
0.0SerXaa: 0.0 ± 0.0
Thr
3.796ThrAla: 3.796 ± 1.027
1.687ThrCys: 1.687 ± 0.726
1.687ThrAsp: 1.687 ± 0.498
4.218ThrGlu: 4.218 ± 0.836
1.265ThrPhe: 1.265 ± 0.695
5.061ThrGly: 5.061 ± 1.531
1.265ThrHis: 1.265 ± 0.693
1.687ThrIle: 1.687 ± 0.761
1.687ThrLys: 1.687 ± 1.042
7.592ThrLeu: 7.592 ± 2.139
1.687ThrMet: 1.687 ± 0.754
1.687ThrAsn: 1.687 ± 0.877
5.483ThrPro: 5.483 ± 1.631
3.796ThrGln: 3.796 ± 0.77
2.952ThrArg: 2.952 ± 1.15
6.326ThrSer: 6.326 ± 2.057
4.218ThrThr: 4.218 ± 1.684
6.748ThrVal: 6.748 ± 1.328
0.844ThrTrp: 0.844 ± 0.409
1.265ThrTyr: 1.265 ± 0.629
0.0ThrXaa: 0.0 ± 0.0
Val
5.483ValAla: 5.483 ± 1.626
2.952ValCys: 2.952 ± 0.974
3.796ValAsp: 3.796 ± 0.686
5.061ValGlu: 5.061 ± 1.386
2.531ValPhe: 2.531 ± 0.491
5.483ValGly: 5.483 ± 1.591
2.531ValHis: 2.531 ± 1.586
2.109ValIle: 2.109 ± 0.655
1.265ValLys: 1.265 ± 0.579
4.218ValLeu: 4.218 ± 1.039
1.265ValMet: 1.265 ± 0.627
2.109ValAsn: 2.109 ± 0.608
5.483ValPro: 5.483 ± 0.985
4.639ValGln: 4.639 ± 1.102
4.218ValArg: 4.218 ± 1.355
5.061ValSer: 5.061 ± 1.279
3.796ValThr: 3.796 ± 1.223
4.639ValVal: 4.639 ± 0.973
0.844ValTrp: 0.844 ± 0.556
1.265ValTyr: 1.265 ± 0.629
0.0ValXaa: 0.0 ± 0.0
Trp
0.844TrpAla: 0.844 ± 0.665
0.0TrpCys: 0.0 ± 0.0
1.687TrpAsp: 1.687 ± 1.085
0.422TrpGlu: 0.422 ± 0.468
0.422TrpPhe: 0.422 ± 0.331
1.265TrpGly: 1.265 ± 0.411
0.422TrpHis: 0.422 ± 0.479
0.844TrpIle: 0.844 ± 0.662
0.844TrpLys: 0.844 ± 0.409
0.844TrpLeu: 0.844 ± 0.377
0.0TrpMet: 0.0 ± 0.0
1.265TrpAsn: 1.265 ± 0.567
0.422TrpPro: 0.422 ± 0.331
0.422TrpGln: 0.422 ± 0.468
2.531TrpArg: 2.531 ± 1.091
1.265TrpSer: 1.265 ± 0.815
1.265TrpThr: 1.265 ± 0.964
0.422TrpVal: 0.422 ± 0.331
0.0TrpTrp: 0.0 ± 0.0
0.844TrpTyr: 0.844 ± 0.409
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.531TyrAla: 2.531 ± 0.868
0.844TyrCys: 0.844 ± 0.409
2.952TyrAsp: 2.952 ± 0.886
1.265TyrGlu: 1.265 ± 0.815
1.265TyrPhe: 1.265 ± 0.411
2.531TyrGly: 2.531 ± 0.554
1.265TyrHis: 1.265 ± 0.535
2.109TyrIle: 2.109 ± 0.987
1.265TyrLys: 1.265 ± 0.684
4.218TyrLeu: 4.218 ± 1.308
0.422TyrMet: 0.422 ± 0.468
0.422TyrAsn: 0.422 ± 0.333
2.109TyrPro: 2.109 ± 1.27
0.844TyrGln: 0.844 ± 0.662
2.531TyrArg: 2.531 ± 0.843
2.531TyrSer: 2.531 ± 1.181
0.844TyrThr: 0.844 ± 0.377
2.531TyrVal: 2.531 ± 1.034
0.844TyrTrp: 0.844 ± 0.377
2.109TyrTyr: 2.109 ± 0.821
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2372 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski