Amino acid dipepetide frequency for Eidolon helvum papillomavirus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.692AlaAla: 3.692 ± 0.743
1.231AlaCys: 1.231 ± 0.852
4.102AlaAsp: 4.102 ± 0.603
6.153AlaGlu: 6.153 ± 1.423
2.051AlaPhe: 2.051 ± 0.97
5.332AlaGly: 5.332 ± 0.987
1.231AlaHis: 1.231 ± 0.549
2.461AlaIle: 2.461 ± 0.759
2.871AlaLys: 2.871 ± 1.59
6.973AlaLeu: 6.973 ± 1.446
1.231AlaMet: 1.231 ± 1.113
0.82AlaAsn: 0.82 ± 0.405
3.692AlaPro: 3.692 ± 1.885
4.102AlaGln: 4.102 ± 1.903
3.692AlaArg: 3.692 ± 1.29
4.102AlaSer: 4.102 ± 1.61
2.871AlaThr: 2.871 ± 1.147
4.922AlaVal: 4.922 ± 1.135
1.231AlaTrp: 1.231 ± 1.113
2.871AlaTyr: 2.871 ± 1.361
0.0AlaXaa: 0.0 ± 0.0
Cys
1.231CysAla: 1.231 ± 0.771
0.82CysCys: 0.82 ± 0.929
0.41CysAsp: 0.41 ± 0.371
0.82CysGlu: 0.82 ± 0.405
0.0CysPhe: 0.0 ± 0.0
1.641CysGly: 1.641 ± 1.136
0.0CysHis: 0.0 ± 0.0
0.82CysIle: 0.82 ± 0.742
1.641CysLys: 1.641 ± 0.637
0.82CysLeu: 0.82 ± 0.501
2.051CysMet: 2.051 ± 0.952
0.82CysAsn: 0.82 ± 0.596
1.641CysPro: 1.641 ± 0.506
0.41CysGln: 0.41 ± 0.379
0.82CysArg: 0.82 ± 0.665
1.641CysSer: 1.641 ± 0.685
1.231CysThr: 1.231 ± 0.498
2.871CysVal: 2.871 ± 2.083
0.41CysTrp: 0.41 ± 0.388
1.231CysTyr: 1.231 ± 0.581
0.0CysXaa: 0.0 ± 0.0
Asp
4.102AspAla: 4.102 ± 1.255
1.231AspCys: 1.231 ± 0.453
2.871AspAsp: 2.871 ± 0.765
4.102AspGlu: 4.102 ± 1.848
1.231AspPhe: 1.231 ± 0.262
4.102AspGly: 4.102 ± 0.848
1.231AspHis: 1.231 ± 1.092
4.512AspIle: 4.512 ± 1.542
4.512AspLys: 4.512 ± 2.007
4.922AspLeu: 4.922 ± 1.246
0.41AspMet: 0.41 ± 0.379
2.051AspAsn: 2.051 ± 0.814
5.332AspPro: 5.332 ± 1.844
1.231AspGln: 1.231 ± 0.749
0.41AspArg: 0.41 ± 0.465
5.332AspSer: 5.332 ± 1.715
5.332AspThr: 5.332 ± 1.28
6.563AspVal: 6.563 ± 3.13
1.641AspTrp: 1.641 ± 1.008
0.82AspTyr: 0.82 ± 0.377
0.0AspXaa: 0.0 ± 0.0
Glu
5.332GluAla: 5.332 ± 1.316
0.0GluCys: 0.0 ± 0.0
6.153GluAsp: 6.153 ± 2.209
6.973GluGlu: 6.973 ± 1.429
2.461GluPhe: 2.461 ± 0.623
6.973GluGly: 6.973 ± 2.375
0.82GluHis: 0.82 ± 0.429
1.641GluIle: 1.641 ± 0.506
1.231GluLys: 1.231 ± 0.498
4.102GluLeu: 4.102 ± 1.031
0.82GluMet: 0.82 ± 0.482
1.231GluAsn: 1.231 ± 0.453
4.512GluPro: 4.512 ± 1.896
3.281GluGln: 3.281 ± 1.122
5.332GluArg: 5.332 ± 1.993
4.102GluSer: 4.102 ± 1.184
5.742GluThr: 5.742 ± 1.731
3.281GluVal: 3.281 ± 1.099
0.0GluTrp: 0.0 ± 0.0
2.051GluTyr: 2.051 ± 0.889
0.0GluXaa: 0.0 ± 0.0
Phe
2.461PheAla: 2.461 ± 0.542
1.231PheCys: 1.231 ± 0.749
1.641PheAsp: 1.641 ± 0.685
0.82PheGlu: 0.82 ± 0.49
2.461PhePhe: 2.461 ± 0.542
2.871PheGly: 2.871 ± 1.363
0.0PheHis: 0.0 ± 0.0
1.641PheIle: 1.641 ± 0.785
1.231PheLys: 1.231 ± 0.682
2.461PheLeu: 2.461 ± 0.454
0.41PheMet: 0.41 ± 0.388
0.82PheAsn: 0.82 ± 0.49
1.231PhePro: 1.231 ± 0.262
1.231PheGln: 1.231 ± 0.682
2.461PheArg: 2.461 ± 1.138
2.051PheSer: 2.051 ± 0.595
4.102PheThr: 4.102 ± 1.225
3.692PheVal: 3.692 ± 1.136
1.231PheTrp: 1.231 ± 0.702
1.231PheTyr: 1.231 ± 0.791
0.0PheXaa: 0.0 ± 0.0
Gly
3.692GlyAla: 3.692 ± 1.253
1.641GlyCys: 1.641 ± 0.874
2.051GlyAsp: 2.051 ± 0.577
4.512GlyGlu: 4.512 ± 1.359
2.461GlyPhe: 2.461 ± 1.155
8.203GlyGly: 8.203 ± 3.514
0.82GlyHis: 0.82 ± 0.377
4.102GlyIle: 4.102 ± 0.797
0.82GlyLys: 0.82 ± 0.501
5.742GlyLeu: 5.742 ± 0.697
1.231GlyMet: 1.231 ± 0.673
4.512GlyAsn: 4.512 ± 1.231
6.563GlyPro: 6.563 ± 2.155
4.512GlyGln: 4.512 ± 2.483
6.973GlyArg: 6.973 ± 1.978
7.383GlySer: 7.383 ± 2.72
6.563GlyThr: 6.563 ± 0.925
4.102GlyVal: 4.102 ± 0.885
0.41GlyTrp: 0.41 ± 0.364
0.82GlyTyr: 0.82 ± 0.478
0.0GlyXaa: 0.0 ± 0.0
His
0.41HisAla: 0.41 ± 0.371
1.231HisCys: 1.231 ± 0.581
1.231HisAsp: 1.231 ± 0.631
0.41HisGlu: 0.41 ± 0.371
0.41HisPhe: 0.41 ± 0.371
1.641HisGly: 1.641 ± 0.686
0.41HisHis: 0.41 ± 0.364
0.41HisIle: 0.41 ± 0.379
0.41HisLys: 0.41 ± 0.371
2.051HisLeu: 2.051 ± 0.935
0.82HisMet: 0.82 ± 0.742
0.0HisAsn: 0.0 ± 0.0
1.231HisPro: 1.231 ± 0.578
1.231HisGln: 1.231 ± 0.639
2.461HisArg: 2.461 ± 0.781
1.641HisSer: 1.641 ± 1.059
0.82HisThr: 0.82 ± 0.377
1.641HisVal: 1.641 ± 1.034
0.41HisTrp: 0.41 ± 0.371
1.231HisTyr: 1.231 ± 0.262
0.0HisXaa: 0.0 ± 0.0
Ile
2.051IleAla: 2.051 ± 1.047
1.231IleCys: 1.231 ± 0.791
3.281IleAsp: 3.281 ± 2.069
3.281IleGlu: 3.281 ± 0.649
0.41IlePhe: 0.41 ± 0.465
3.281IleGly: 3.281 ± 1.1
2.051IleHis: 2.051 ± 0.59
0.82IleIle: 0.82 ± 0.377
1.641IleLys: 1.641 ± 0.685
3.692IleLeu: 3.692 ± 0.52
0.0IleMet: 0.0 ± 0.0
1.231IleAsn: 1.231 ± 0.511
4.512IlePro: 4.512 ± 1.634
0.82IleGln: 0.82 ± 0.554
1.231IleArg: 1.231 ± 0.262
3.692IleSer: 3.692 ± 2.326
3.692IleThr: 3.692 ± 1.067
2.051IleVal: 2.051 ± 0.771
0.0IleTrp: 0.0 ± 0.0
1.641IleTyr: 1.641 ± 0.764
0.0IleXaa: 0.0 ± 0.0
Lys
1.641LysAla: 1.641 ± 0.776
2.051LysCys: 2.051 ± 0.77
2.461LysAsp: 2.461 ± 1.457
2.051LysGlu: 2.051 ± 0.941
0.82LysPhe: 0.82 ± 0.405
1.641LysGly: 1.641 ± 1.07
2.051LysHis: 2.051 ± 1.374
0.41LysIle: 0.41 ± 0.388
2.461LysLys: 2.461 ± 1.246
2.051LysLeu: 2.051 ± 0.78
0.41LysMet: 0.41 ± 0.388
1.641LysAsn: 1.641 ± 1.07
0.82LysPro: 0.82 ± 0.405
2.051LysGln: 2.051 ± 0.521
4.512LysArg: 4.512 ± 0.796
2.871LysSer: 2.871 ± 1.356
2.461LysThr: 2.461 ± 0.979
2.051LysVal: 2.051 ± 0.577
0.0LysTrp: 0.0 ± 0.0
2.051LysTyr: 2.051 ± 0.506
0.0LysXaa: 0.0 ± 0.0
Leu
4.922LeuAla: 4.922 ± 1.682
2.871LeuCys: 2.871 ± 1.58
5.742LeuAsp: 5.742 ± 0.632
5.332LeuGlu: 5.332 ± 1.734
4.102LeuPhe: 4.102 ± 0.615
4.102LeuGly: 4.102 ± 0.586
2.461LeuHis: 2.461 ± 0.591
2.051LeuIle: 2.051 ± 0.891
4.922LeuLys: 4.922 ± 1.487
6.563LeuLeu: 6.563 ± 1.478
1.641LeuMet: 1.641 ± 0.954
3.692LeuAsn: 3.692 ± 1.8
6.563LeuPro: 6.563 ± 0.919
4.102LeuGln: 4.102 ± 1.363
4.102LeuArg: 4.102 ± 1.344
4.102LeuSer: 4.102 ± 0.586
4.102LeuThr: 4.102 ± 0.673
3.692LeuVal: 3.692 ± 1.283
0.82LeuTrp: 0.82 ± 0.759
2.461LeuTyr: 2.461 ± 0.64
0.0LeuXaa: 0.0 ± 0.0
Met
2.871MetAla: 2.871 ± 1.248
0.41MetCys: 0.41 ± 0.426
0.82MetAsp: 0.82 ± 0.405
0.41MetGlu: 0.41 ± 0.364
0.82MetPhe: 0.82 ± 0.429
0.82MetGly: 0.82 ± 0.777
0.0MetHis: 0.0 ± 0.0
0.41MetIle: 0.41 ± 0.371
0.0MetLys: 0.0 ± 0.0
2.051MetLeu: 2.051 ± 0.946
0.0MetMet: 0.0 ± 0.0
0.82MetAsn: 0.82 ± 0.554
1.641MetPro: 1.641 ± 1.148
1.641MetGln: 1.641 ± 1.019
2.461MetArg: 2.461 ± 0.697
0.41MetSer: 0.41 ± 0.371
1.641MetThr: 1.641 ± 0.615
2.461MetVal: 2.461 ± 1.097
0.41MetTrp: 0.41 ± 0.388
0.82MetTyr: 0.82 ± 0.408
0.0MetXaa: 0.0 ± 0.0
Asn
3.281AsnAla: 3.281 ± 1.547
1.231AsnCys: 1.231 ± 0.42
0.82AsnAsp: 0.82 ± 0.581
1.231AsnGlu: 1.231 ± 0.524
2.461AsnPhe: 2.461 ± 1.28
0.82AsnGly: 0.82 ± 0.405
0.82AsnHis: 0.82 ± 0.581
2.461AsnIle: 2.461 ± 0.587
1.641AsnLys: 1.641 ± 0.506
2.461AsnLeu: 2.461 ± 1.042
0.41AsnMet: 0.41 ± 0.379
1.641AsnAsn: 1.641 ± 1.554
3.281AsnPro: 3.281 ± 1.135
0.82AsnGln: 0.82 ± 0.777
2.051AsnArg: 2.051 ± 0.778
1.231AsnSer: 1.231 ± 0.702
3.692AsnThr: 3.692 ± 1.001
0.82AsnVal: 0.82 ± 0.777
0.0AsnTrp: 0.0 ± 0.0
0.82AsnTyr: 0.82 ± 0.377
0.0AsnXaa: 0.0 ± 0.0
Pro
6.563ProAla: 6.563 ± 1.573
2.051ProCys: 2.051 ± 0.891
5.332ProAsp: 5.332 ± 1.006
2.871ProGlu: 2.871 ± 0.834
2.051ProPhe: 2.051 ± 1.032
3.692ProGly: 3.692 ± 0.99
0.41ProHis: 0.41 ± 0.379
0.41ProIle: 0.41 ± 0.379
2.461ProLys: 2.461 ± 1.553
6.563ProLeu: 6.563 ± 1.98
1.231ProMet: 1.231 ± 0.549
2.461ProAsn: 2.461 ± 1.403
4.512ProPro: 4.512 ± 1.507
2.871ProGln: 2.871 ± 1.346
3.692ProArg: 3.692 ± 1.099
3.281ProSer: 3.281 ± 1.652
5.742ProThr: 5.742 ± 2.312
7.793ProVal: 7.793 ± 2.571
1.231ProTrp: 1.231 ± 0.703
2.461ProTyr: 2.461 ± 0.795
0.0ProXaa: 0.0 ± 0.0
Gln
3.281GlnAla: 3.281 ± 1.561
0.41GlnCys: 0.41 ± 0.426
2.871GlnAsp: 2.871 ± 1.457
3.692GlnGlu: 3.692 ± 1.104
0.82GlnPhe: 0.82 ± 0.429
3.281GlnGly: 3.281 ± 1.145
1.641GlnHis: 1.641 ± 1.484
2.871GlnIle: 2.871 ± 0.431
1.231GlnLys: 1.231 ± 0.262
3.281GlnLeu: 3.281 ± 1.563
0.82GlnMet: 0.82 ± 0.493
0.41GlnAsn: 0.41 ± 0.379
2.461GlnPro: 2.461 ± 0.839
2.051GlnGln: 2.051 ± 0.454
3.692GlnArg: 3.692 ± 1.401
1.231GlnSer: 1.231 ± 0.855
2.871GlnThr: 2.871 ± 1.279
1.641GlnVal: 1.641 ± 0.817
1.231GlnTrp: 1.231 ± 0.714
2.461GlnTyr: 2.461 ± 1.054
0.0GlnXaa: 0.0 ± 0.0
Arg
4.102ArgAla: 4.102 ± 1.471
0.82ArgCys: 0.82 ± 0.405
4.102ArgAsp: 4.102 ± 1.415
4.102ArgGlu: 4.102 ± 1.189
3.281ArgPhe: 3.281 ± 0.846
6.973ArgGly: 6.973 ± 2.742
2.461ArgHis: 2.461 ± 0.865
2.051ArgIle: 2.051 ± 0.577
4.102ArgLys: 4.102 ± 0.921
6.973ArgLeu: 6.973 ± 2.189
1.231ArgMet: 1.231 ± 0.558
2.461ArgAsn: 2.461 ± 0.881
4.102ArgPro: 4.102 ± 1.296
1.231ArgGln: 1.231 ± 0.682
8.203ArgArg: 8.203 ± 2.366
4.922ArgSer: 4.922 ± 1.427
3.281ArgThr: 3.281 ± 0.857
5.332ArgVal: 5.332 ± 2.819
0.82ArgTrp: 0.82 ± 0.759
2.871ArgTyr: 2.871 ± 1.309
0.0ArgXaa: 0.0 ± 0.0
Ser
4.102SerAla: 4.102 ± 0.908
0.82SerCys: 0.82 ± 0.665
4.922SerAsp: 4.922 ± 1.777
4.922SerGlu: 4.922 ± 2.34
3.692SerPhe: 3.692 ± 1.423
6.153SerGly: 6.153 ± 2.332
0.82SerHis: 0.82 ± 0.405
4.102SerIle: 4.102 ± 1.813
2.051SerLys: 2.051 ± 0.595
2.461SerLeu: 2.461 ± 1.036
2.871SerMet: 2.871 ± 0.759
1.641SerAsn: 1.641 ± 0.747
2.051SerPro: 2.051 ± 1.047
2.051SerGln: 2.051 ± 0.686
5.332SerArg: 5.332 ± 1.282
3.692SerSer: 3.692 ± 1.007
8.614SerThr: 8.614 ± 1.26
4.102SerVal: 4.102 ± 0.724
0.41SerTrp: 0.41 ± 0.364
0.82SerTyr: 0.82 ± 0.742
0.0SerXaa: 0.0 ± 0.0
Thr
3.692ThrAla: 3.692 ± 2.402
1.231ThrCys: 1.231 ± 0.459
4.922ThrAsp: 4.922 ± 2.139
6.563ThrGlu: 6.563 ± 0.664
2.461ThrPhe: 2.461 ± 0.527
5.742ThrGly: 5.742 ± 2.36
0.41ThrHis: 0.41 ± 0.371
3.692ThrIle: 3.692 ± 1.515
0.41ThrLys: 0.41 ± 0.388
4.922ThrLeu: 4.922 ± 1.251
2.461ThrMet: 2.461 ± 0.723
3.692ThrAsn: 3.692 ± 0.456
6.973ThrPro: 6.973 ± 2.37
3.692ThrGln: 3.692 ± 0.886
4.922ThrArg: 4.922 ± 1.766
5.742ThrSer: 5.742 ± 0.718
6.563ThrThr: 6.563 ± 1.291
5.742ThrVal: 5.742 ± 1.771
0.41ThrTrp: 0.41 ± 0.379
0.82ThrTyr: 0.82 ± 0.377
0.0ThrXaa: 0.0 ± 0.0
Val
5.742ValAla: 5.742 ± 1.514
0.82ValCys: 0.82 ± 0.5
4.922ValAsp: 4.922 ± 2.114
6.153ValGlu: 6.153 ± 1.785
1.641ValPhe: 1.641 ± 0.27
7.793ValGly: 7.793 ± 2.577
1.641ValHis: 1.641 ± 0.686
3.692ValIle: 3.692 ± 1.023
1.641ValLys: 1.641 ± 1.102
5.332ValLeu: 5.332 ± 1.481
0.82ValMet: 0.82 ± 0.405
0.41ValAsn: 0.41 ± 0.371
3.281ValPro: 3.281 ± 1.136
4.512ValGln: 4.512 ± 0.704
7.383ValArg: 7.383 ± 1.544
5.742ValSer: 5.742 ± 0.942
2.871ValThr: 2.871 ± 1.255
5.332ValVal: 5.332 ± 1.984
0.41ValTrp: 0.41 ± 0.388
2.871ValTyr: 2.871 ± 0.996
0.0ValXaa: 0.0 ± 0.0
Trp
1.641TrpAla: 1.641 ± 0.615
0.0TrpCys: 0.0 ± 0.0
0.41TrpAsp: 0.41 ± 0.388
0.82TrpGlu: 0.82 ± 0.554
0.0TrpPhe: 0.0 ± 0.0
0.82TrpGly: 0.82 ± 0.429
0.0TrpHis: 0.0 ± 0.0
1.231TrpIle: 1.231 ± 0.673
1.231TrpLys: 1.231 ± 0.682
2.461TrpLeu: 2.461 ± 1.225
0.41TrpMet: 0.41 ± 0.379
0.0TrpAsn: 0.0 ± 0.0
0.82TrpPro: 0.82 ± 0.728
0.41TrpGln: 0.41 ± 0.388
0.41TrpArg: 0.41 ± 0.388
0.0TrpSer: 0.0 ± 0.0
1.231TrpThr: 1.231 ± 0.733
1.231TrpVal: 1.231 ± 0.714
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.82TyrAla: 0.82 ± 0.501
0.0TyrCys: 0.0 ± 0.0
2.871TyrAsp: 2.871 ± 0.72
1.231TyrGlu: 1.231 ± 0.953
1.641TyrPhe: 1.641 ± 0.785
1.231TyrGly: 1.231 ± 0.548
0.82TyrHis: 0.82 ± 0.377
0.82TyrIle: 0.82 ± 0.429
0.0TyrLys: 0.0 ± 0.0
2.461TyrLeu: 2.461 ± 0.715
1.231TyrMet: 1.231 ± 0.695
1.641TyrAsn: 1.641 ± 0.764
2.461TyrPro: 2.461 ± 0.76
0.0TyrGln: 0.0 ± 0.0
3.281TyrArg: 3.281 ± 0.351
2.461TyrSer: 2.461 ± 0.409
1.641TyrThr: 1.641 ± 0.27
3.692TyrVal: 3.692 ± 1.376
2.051TyrTrp: 2.051 ± 0.907
3.692TyrTyr: 3.692 ± 1.539
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2439 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski