Amino acid dipepetide frequency for Eidolon helvum papillomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.628AlaAla: 5.628 ± 1.374
0.866AlaCys: 0.866 ± 1.377
4.329AlaAsp: 4.329 ± 1.464
5.195AlaGlu: 5.195 ± 1.224
3.896AlaPhe: 3.896 ± 0.793
3.896AlaGly: 3.896 ± 1.238
0.433AlaHis: 0.433 ± 0.334
2.165AlaIle: 2.165 ± 1.073
2.165AlaLys: 2.165 ± 0.858
4.329AlaLeu: 4.329 ± 1.436
1.299AlaMet: 1.299 ± 1.002
1.732AlaAsn: 1.732 ± 0.715
3.896AlaPro: 3.896 ± 1.096
2.165AlaGln: 2.165 ± 0.693
3.463AlaArg: 3.463 ± 1.285
5.195AlaSer: 5.195 ± 0.619
3.463AlaThr: 3.463 ± 1.066
3.896AlaVal: 3.896 ± 1.835
0.0AlaTrp: 0.0 ± 0.0
1.299AlaTyr: 1.299 ± 0.828
0.0AlaXaa: 0.0 ± 0.0
Cys
1.299CysAla: 1.299 ± 1.327
0.433CysCys: 0.433 ± 0.334
0.0CysAsp: 0.0 ± 0.0
0.866CysGlu: 0.866 ± 0.668
0.866CysPhe: 0.866 ± 0.668
1.299CysGly: 1.299 ± 1.176
0.433CysHis: 0.433 ± 0.688
1.732CysIle: 1.732 ± 1.403
0.866CysLys: 0.866 ± 0.472
1.732CysLeu: 1.732 ± 1.185
0.0CysMet: 0.0 ± 0.0
0.433CysAsn: 0.433 ± 0.688
1.732CysPro: 1.732 ± 0.914
0.866CysGln: 0.866 ± 1.377
1.299CysArg: 1.299 ± 1.237
0.866CysSer: 0.866 ± 0.668
3.03CysThr: 3.03 ± 1.059
0.433CysVal: 0.433 ± 0.334
0.0CysTrp: 0.0 ± 0.0
0.433CysTyr: 0.433 ± 0.677
0.0CysXaa: 0.0 ± 0.0
Asp
2.597AspAla: 2.597 ± 0.532
0.433AspCys: 0.433 ± 0.334
3.03AspAsp: 3.03 ± 1.511
4.329AspGlu: 4.329 ± 1.137
1.732AspPhe: 1.732 ± 0.557
3.463AspGly: 3.463 ± 0.952
0.0AspHis: 0.0 ± 0.0
3.03AspIle: 3.03 ± 1.216
2.597AspLys: 2.597 ± 0.989
5.628AspLeu: 5.628 ± 1.99
1.299AspMet: 1.299 ± 0.748
2.597AspAsn: 2.597 ± 0.955
5.195AspPro: 5.195 ± 1.617
3.463AspGln: 3.463 ± 1.066
3.463AspArg: 3.463 ± 1.338
3.463AspSer: 3.463 ± 0.637
6.061AspThr: 6.061 ± 0.99
5.628AspVal: 5.628 ± 2.28
0.866AspTrp: 0.866 ± 0.484
2.597AspTyr: 2.597 ± 0.864
0.0AspXaa: 0.0 ± 0.0
Glu
6.494GluAla: 6.494 ± 1.532
2.165GluCys: 2.165 ± 1.061
7.792GluAsp: 7.792 ± 1.135
6.494GluGlu: 6.494 ± 2.4
1.299GluPhe: 1.299 ± 1.217
2.597GluGly: 2.597 ± 0.989
1.732GluHis: 1.732 ± 0.715
4.762GluIle: 4.762 ± 1.505
3.896GluLys: 3.896 ± 2.827
6.061GluLeu: 6.061 ± 1.666
1.299GluMet: 1.299 ± 0.656
4.329GluAsn: 4.329 ± 1.078
2.165GluPro: 2.165 ± 1.076
2.165GluGln: 2.165 ± 0.779
2.165GluArg: 2.165 ± 1.517
3.896GluSer: 3.896 ± 1.457
5.195GluThr: 5.195 ± 2.075
3.896GluVal: 3.896 ± 1.403
0.866GluTrp: 0.866 ± 0.668
2.165GluTyr: 2.165 ± 1.012
0.0GluXaa: 0.0 ± 0.0
Phe
2.165PheAla: 2.165 ± 0.424
2.165PheCys: 2.165 ± 1.761
2.165PheAsp: 2.165 ± 0.97
6.061PheGlu: 6.061 ± 1.995
1.299PhePhe: 1.299 ± 0.377
3.03PheGly: 3.03 ± 1.363
0.0PheHis: 0.0 ± 0.0
2.597PheIle: 2.597 ± 0.91
0.433PheLys: 0.433 ± 0.334
6.061PheLeu: 6.061 ± 1.569
0.433PheMet: 0.433 ± 0.334
3.03PheAsn: 3.03 ± 1.277
0.866PhePro: 0.866 ± 0.421
2.165PheGln: 2.165 ± 1.253
1.299PheArg: 1.299 ± 0.647
0.866PheSer: 0.866 ± 0.421
2.165PheThr: 2.165 ± 0.829
3.463PheVal: 3.463 ± 1.756
2.165PheTrp: 2.165 ± 0.903
1.732PheTyr: 1.732 ± 1.136
0.0PheXaa: 0.0 ± 0.0
Gly
4.762GlyAla: 4.762 ± 1.017
1.299GlyCys: 1.299 ± 0.725
3.463GlyAsp: 3.463 ± 1.682
3.463GlyGlu: 3.463 ± 1.491
0.433GlyPhe: 0.433 ± 0.418
6.926GlyGly: 6.926 ± 2.894
1.299GlyHis: 1.299 ± 1.195
3.896GlyIle: 3.896 ± 1.461
0.866GlyLys: 0.866 ± 0.668
6.061GlyLeu: 6.061 ± 1.335
0.433GlyMet: 0.433 ± 0.69
3.896GlyAsn: 3.896 ± 1.155
3.03GlyPro: 3.03 ± 1.89
2.165GlyGln: 2.165 ± 1.058
5.628GlyArg: 5.628 ± 0.924
7.792GlySer: 7.792 ± 1.747
4.329GlyThr: 4.329 ± 3.019
5.195GlyVal: 5.195 ± 1.177
0.866GlyTrp: 0.866 ± 0.668
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.433HisAla: 0.433 ± 0.398
0.433HisCys: 0.433 ± 0.677
0.433HisAsp: 0.433 ± 0.406
0.866HisGlu: 0.866 ± 0.42
2.165HisPhe: 2.165 ± 0.748
0.433HisGly: 0.433 ± 0.406
0.0HisHis: 0.0 ± 0.0
0.433HisIle: 0.433 ± 0.334
1.732HisLys: 1.732 ± 1.302
1.732HisLeu: 1.732 ± 0.648
0.433HisMet: 0.433 ± 0.398
0.0HisAsn: 0.0 ± 0.0
2.165HisPro: 2.165 ± 0.786
1.732HisGln: 1.732 ± 0.967
0.866HisArg: 0.866 ± 0.484
0.866HisSer: 0.866 ± 0.484
1.299HisThr: 1.299 ± 0.377
2.597HisVal: 2.597 ± 1.113
0.433HisTrp: 0.433 ± 0.398
1.732HisTyr: 1.732 ± 0.967
0.0HisXaa: 0.0 ± 0.0
Ile
2.165IleAla: 2.165 ± 0.874
0.866IleCys: 0.866 ± 0.702
3.896IleAsp: 3.896 ± 1.07
5.195IleGlu: 5.195 ± 1.937
2.165IlePhe: 2.165 ± 0.693
4.329IleGly: 4.329 ± 1.971
0.0IleHis: 0.0 ± 0.0
3.463IleIle: 3.463 ± 1.066
2.165IleLys: 2.165 ± 1.212
3.896IleLeu: 3.896 ± 0.827
1.732IleMet: 1.732 ± 0.931
0.433IleAsn: 0.433 ± 0.406
3.463IlePro: 3.463 ± 2.243
1.732IleGln: 1.732 ± 0.805
1.732IleArg: 1.732 ± 0.769
0.866IleSer: 0.866 ± 0.421
3.896IleThr: 3.896 ± 1.682
3.03IleVal: 3.03 ± 1.566
0.0IleTrp: 0.0 ± 0.0
2.165IleTyr: 2.165 ± 1.15
0.0IleXaa: 0.0 ± 0.0
Lys
1.732LysAla: 1.732 ± 0.967
1.732LysCys: 1.732 ± 0.715
2.597LysAsp: 2.597 ± 1.246
1.732LysGlu: 1.732 ± 0.909
3.03LysPhe: 3.03 ± 1.27
0.866LysGly: 0.866 ± 0.835
3.03LysHis: 3.03 ± 0.828
2.165LysIle: 2.165 ± 0.693
3.896LysLys: 3.896 ± 1.726
4.762LysLeu: 4.762 ± 1.718
0.866LysMet: 0.866 ± 0.42
2.165LysAsn: 2.165 ± 0.866
3.463LysPro: 3.463 ± 0.916
2.165LysGln: 2.165 ± 1.325
5.628LysArg: 5.628 ± 0.973
3.03LysSer: 3.03 ± 1.211
1.299LysThr: 1.299 ± 0.767
3.463LysVal: 3.463 ± 1.066
0.0LysTrp: 0.0 ± 0.0
2.597LysTyr: 2.597 ± 0.962
0.0LysXaa: 0.0 ± 0.0
Leu
7.359LeuAla: 7.359 ± 1.326
1.732LeuCys: 1.732 ± 1.143
5.628LeuAsp: 5.628 ± 1.92
3.463LeuGlu: 3.463 ± 1.269
6.926LeuPhe: 6.926 ± 2.68
4.762LeuGly: 4.762 ± 0.876
2.597LeuHis: 2.597 ± 1.09
1.299LeuIle: 1.299 ± 0.462
6.061LeuLys: 6.061 ± 1.011
5.195LeuLeu: 5.195 ± 2.59
1.732LeuMet: 1.732 ± 1.237
1.299LeuAsn: 1.299 ± 1.237
4.329LeuPro: 4.329 ± 1.583
3.896LeuGln: 3.896 ± 0.976
3.896LeuArg: 3.896 ± 2.154
4.329LeuSer: 4.329 ± 0.883
6.926LeuThr: 6.926 ± 1.829
3.03LeuVal: 3.03 ± 1.173
0.866LeuTrp: 0.866 ± 0.498
6.061LeuTyr: 6.061 ± 1.246
0.0LeuXaa: 0.0 ± 0.0
Met
1.732MetAla: 1.732 ± 0.774
0.0MetCys: 0.0 ± 0.0
1.299MetAsp: 1.299 ± 0.748
0.866MetGlu: 0.866 ± 0.835
1.299MetPhe: 1.299 ± 0.773
0.0MetGly: 0.0 ± 0.0
1.299MetHis: 1.299 ± 0.462
0.433MetIle: 0.433 ± 0.334
0.866MetLys: 0.866 ± 0.42
0.866MetLeu: 0.866 ± 0.668
0.433MetMet: 0.433 ± 0.398
0.866MetAsn: 0.866 ± 0.668
1.299MetPro: 1.299 ± 0.696
0.866MetGln: 0.866 ± 0.668
0.866MetArg: 0.866 ± 0.748
0.866MetSer: 0.866 ± 0.691
1.299MetThr: 1.299 ± 0.377
1.299MetVal: 1.299 ± 0.643
0.0MetTrp: 0.0 ± 0.0
0.433MetTyr: 0.433 ± 0.418
0.0MetXaa: 0.0 ± 0.0
Asn
1.732AsnAla: 1.732 ± 0.805
0.866AsnCys: 0.866 ± 0.691
2.597AsnAsp: 2.597 ± 1.364
2.597AsnGlu: 2.597 ± 1.056
1.299AsnPhe: 1.299 ± 0.767
2.597AsnGly: 2.597 ± 0.609
0.433AsnHis: 0.433 ± 0.334
1.732AsnIle: 1.732 ± 0.533
0.866AsnLys: 0.866 ± 0.797
3.463AsnLeu: 3.463 ± 0.974
0.0AsnMet: 0.0 ± 0.0
2.165AsnAsn: 2.165 ± 1.012
2.597AsnPro: 2.597 ± 0.752
1.732AsnGln: 1.732 ± 1.122
3.896AsnArg: 3.896 ± 1.364
2.165AsnSer: 2.165 ± 0.903
3.896AsnThr: 3.896 ± 1.218
1.732AsnVal: 1.732 ± 0.805
0.866AsnTrp: 0.866 ± 0.788
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
5.628ProAla: 5.628 ± 1.507
0.866ProCys: 0.866 ± 0.702
4.762ProAsp: 4.762 ± 1.18
3.463ProGlu: 3.463 ± 0.891
2.165ProPhe: 2.165 ± 1.034
3.463ProGly: 3.463 ± 1.14
0.0ProHis: 0.0 ± 0.0
3.896ProIle: 3.896 ± 1.79
4.762ProLys: 4.762 ± 1.302
6.494ProLeu: 6.494 ± 1.439
0.433ProMet: 0.433 ± 0.334
0.866ProAsn: 0.866 ± 0.797
6.494ProPro: 6.494 ± 1.863
3.463ProGln: 3.463 ± 0.963
2.165ProArg: 2.165 ± 0.882
5.195ProSer: 5.195 ± 1.974
4.329ProThr: 4.329 ± 2.508
2.165ProVal: 2.165 ± 1.533
0.0ProTrp: 0.0 ± 0.0
2.597ProTyr: 2.597 ± 1.546
0.0ProXaa: 0.0 ± 0.0
Gln
1.732GlnAla: 1.732 ± 0.985
0.0GlnCys: 0.0 ± 0.0
0.866GlnAsp: 0.866 ± 0.472
3.463GlnGlu: 3.463 ± 1.594
1.299GlnPhe: 1.299 ± 0.767
3.463GlnGly: 3.463 ± 1.494
1.299GlnHis: 1.299 ± 0.635
1.732GlnIle: 1.732 ± 0.725
2.165GlnLys: 2.165 ± 1.373
4.762GlnLeu: 4.762 ± 1.206
3.03GlnMet: 3.03 ± 1.456
0.433GlnAsn: 0.433 ± 0.398
2.165GlnPro: 2.165 ± 0.6
3.03GlnGln: 3.03 ± 1.2
2.597GlnArg: 2.597 ± 0.651
2.597GlnSer: 2.597 ± 1.047
3.03GlnThr: 3.03 ± 0.909
3.03GlnVal: 3.03 ± 0.973
1.299GlnTrp: 1.299 ± 0.643
1.732GlnTyr: 1.732 ± 0.944
0.0GlnXaa: 0.0 ± 0.0
Arg
3.463ArgAla: 3.463 ± 0.354
0.866ArgCys: 0.866 ± 0.702
0.866ArgAsp: 0.866 ± 0.5
5.195ArgGlu: 5.195 ± 0.899
3.03ArgPhe: 3.03 ± 1.211
5.628ArgGly: 5.628 ± 1.552
1.732ArgHis: 1.732 ± 0.995
2.165ArgIle: 2.165 ± 1.317
4.762ArgLys: 4.762 ± 1.09
4.762ArgLeu: 4.762 ± 1.568
0.433ArgMet: 0.433 ± 0.334
1.299ArgAsn: 1.299 ± 0.719
3.896ArgPro: 3.896 ± 0.924
1.299ArgGln: 1.299 ± 0.719
7.792ArgArg: 7.792 ± 3.733
3.896ArgSer: 3.896 ± 0.804
5.628ArgThr: 5.628 ± 2.038
6.926ArgVal: 6.926 ± 1.432
1.299ArgTrp: 1.299 ± 0.647
2.165ArgTyr: 2.165 ± 0.621
0.0ArgXaa: 0.0 ± 0.0
Ser
1.732SerAla: 1.732 ± 0.755
0.433SerCys: 0.433 ± 0.677
4.762SerAsp: 4.762 ± 1.689
4.762SerGlu: 4.762 ± 0.586
3.03SerPhe: 3.03 ± 1.16
5.628SerGly: 5.628 ± 1.959
0.866SerHis: 0.866 ± 0.421
3.03SerIle: 3.03 ± 1.846
3.896SerLys: 3.896 ± 0.881
3.463SerLeu: 3.463 ± 1.325
0.433SerMet: 0.433 ± 0.398
2.597SerAsn: 2.597 ± 0.532
4.329SerPro: 4.329 ± 1.663
4.762SerGln: 4.762 ± 1.847
4.762SerArg: 4.762 ± 1.082
3.463SerSer: 3.463 ± 0.963
6.061SerThr: 6.061 ± 1.744
3.896SerVal: 3.896 ± 1.54
1.299SerTrp: 1.299 ± 0.84
0.866SerTyr: 0.866 ± 0.5
0.0SerXaa: 0.0 ± 0.0
Thr
2.597ThrAla: 2.597 ± 1.047
1.732ThrCys: 1.732 ± 0.802
4.762ThrAsp: 4.762 ± 0.9
5.628ThrGlu: 5.628 ± 1.524
3.463ThrPhe: 3.463 ± 1.321
7.359ThrGly: 7.359 ± 2.321
0.866ThrHis: 0.866 ± 0.5
4.329ThrIle: 4.329 ± 1.386
2.165ThrLys: 2.165 ± 0.779
5.195ThrLeu: 5.195 ± 1.654
1.299ThrMet: 1.299 ± 0.706
3.03ThrAsn: 3.03 ± 1.008
8.225ThrPro: 8.225 ± 2.046
2.597ThrGln: 2.597 ± 1.068
6.926ThrArg: 6.926 ± 0.626
4.329ThrSer: 4.329 ± 1.868
6.061ThrThr: 6.061 ± 1.682
3.03ThrVal: 3.03 ± 1.452
0.433ThrTrp: 0.433 ± 0.418
1.732ThrTyr: 1.732 ± 0.628
0.0ThrXaa: 0.0 ± 0.0
Val
3.463ValAla: 3.463 ± 2.356
1.299ValCys: 1.299 ± 0.462
4.762ValAsp: 4.762 ± 1.023
4.329ValGlu: 4.329 ± 0.902
1.732ValPhe: 1.732 ± 1.122
2.597ValGly: 2.597 ± 1.178
3.463ValHis: 3.463 ± 1.146
1.732ValIle: 1.732 ± 0.661
3.463ValLys: 3.463 ± 0.969
1.732ValLeu: 1.732 ± 0.718
0.433ValMet: 0.433 ± 0.418
4.329ValAsn: 4.329 ± 1.324
2.165ValPro: 2.165 ± 0.6
2.165ValGln: 2.165 ± 1.136
5.628ValArg: 5.628 ± 2.1
8.225ValSer: 8.225 ± 1.988
5.628ValThr: 5.628 ± 0.582
4.762ValVal: 4.762 ± 1.949
0.866ValTrp: 0.866 ± 0.797
2.165ValTyr: 2.165 ± 0.97
0.0ValXaa: 0.0 ± 0.0
Trp
0.866TrpAla: 0.866 ± 0.42
0.0TrpCys: 0.0 ± 0.0
0.866TrpAsp: 0.866 ± 0.42
0.866TrpGlu: 0.866 ± 0.484
0.433TrpPhe: 0.433 ± 0.334
1.732TrpGly: 1.732 ± 0.755
0.433TrpHis: 0.433 ± 0.398
0.866TrpIle: 0.866 ± 0.668
0.433TrpLys: 0.433 ± 0.334
1.732TrpLeu: 1.732 ± 0.939
0.0TrpMet: 0.0 ± 0.0
0.866TrpAsn: 0.866 ± 0.835
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.866TrpArg: 0.866 ± 0.484
0.433TrpSer: 0.433 ± 0.334
0.866TrpThr: 0.866 ± 0.484
0.866TrpVal: 0.866 ± 0.797
0.0TrpTrp: 0.0 ± 0.0
0.433TrpTyr: 0.433 ± 0.688
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.732TyrAla: 1.732 ± 0.651
0.433TyrCys: 0.433 ± 0.334
2.597TyrAsp: 2.597 ± 1.184
2.597TyrGlu: 2.597 ± 0.927
2.597TyrPhe: 2.597 ± 0.633
2.165TyrGly: 2.165 ± 0.621
0.866TyrHis: 0.866 ± 0.758
1.732TyrIle: 1.732 ± 0.716
2.165TyrLys: 2.165 ± 1.081
3.463TyrLeu: 3.463 ± 0.64
0.433TyrMet: 0.433 ± 0.406
0.866TyrAsn: 0.866 ± 0.421
1.732TyrPro: 1.732 ± 0.725
1.299TyrGln: 1.299 ± 0.773
2.165TyrArg: 2.165 ± 0.866
1.732TyrSer: 1.732 ± 0.915
1.299TyrThr: 1.299 ± 0.471
2.597TyrVal: 2.597 ± 1.139
0.433TyrTrp: 0.433 ± 0.398
1.299TyrTyr: 1.299 ± 0.84
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2311 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski