Amino acid dipepetide frequency for Erethizon dorsatum papillomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.555AlaAla: 4.555 ± 1.036
0.828AlaCys: 0.828 ± 0.636
3.727AlaAsp: 3.727 ± 1.385
4.969AlaGlu: 4.969 ± 1.485
4.141AlaPhe: 4.141 ± 1.195
2.899AlaGly: 2.899 ± 0.797
0.828AlaHis: 0.828 ± 0.609
1.656AlaIle: 1.656 ± 0.656
2.07AlaLys: 2.07 ± 0.87
3.727AlaLeu: 3.727 ± 1.724
0.414AlaMet: 0.414 ± 0.354
1.242AlaAsn: 1.242 ± 0.651
4.555AlaPro: 4.555 ± 1.823
3.313AlaGln: 3.313 ± 1.012
6.625AlaArg: 6.625 ± 1.918
4.141AlaSer: 4.141 ± 0.765
3.727AlaThr: 3.727 ± 0.913
3.727AlaVal: 3.727 ± 1.258
0.414AlaTrp: 0.414 ± 0.357
1.656AlaTyr: 1.656 ± 0.807
0.0AlaXaa: 0.0 ± 0.0
Cys
1.242CysAla: 1.242 ± 0.576
0.414CysCys: 0.414 ± 0.354
0.828CysAsp: 0.828 ± 0.636
0.828CysGlu: 0.828 ± 0.459
0.414CysPhe: 0.414 ± 0.55
1.242CysGly: 1.242 ± 0.611
0.0CysHis: 0.0 ± 0.0
0.414CysIle: 0.414 ± 0.55
2.899CysLys: 2.899 ± 0.989
2.484CysLeu: 2.484 ± 2.049
0.414CysMet: 0.414 ± 0.527
1.242CysAsn: 1.242 ± 0.87
1.656CysPro: 1.656 ± 0.656
0.0CysGln: 0.0 ± 0.0
1.656CysArg: 1.656 ± 0.639
2.899CysSer: 2.899 ± 2.027
2.899CysThr: 2.899 ± 2.439
1.242CysVal: 1.242 ± 0.968
0.414CysTrp: 0.414 ± 0.357
0.828CysTyr: 0.828 ± 0.439
0.0CysXaa: 0.0 ± 0.0
Asp
4.141AspAla: 4.141 ± 1.511
1.656AspCys: 1.656 ± 1.166
2.484AspAsp: 2.484 ± 0.886
3.727AspGlu: 3.727 ± 0.978
1.242AspPhe: 1.242 ± 0.697
2.484AspGly: 2.484 ± 0.435
2.07AspHis: 2.07 ± 0.617
7.867AspIle: 7.867 ± 2.295
2.484AspLys: 2.484 ± 0.886
6.211AspLeu: 6.211 ± 1.375
1.656AspMet: 1.656 ± 1.015
2.07AspAsn: 2.07 ± 0.74
5.797AspPro: 5.797 ± 1.113
2.07AspGln: 2.07 ± 0.719
3.727AspArg: 3.727 ± 0.956
2.484AspSer: 2.484 ± 0.615
4.555AspThr: 4.555 ± 1.529
4.555AspVal: 4.555 ± 1.383
0.414AspTrp: 0.414 ± 0.395
1.656AspTyr: 1.656 ± 0.306
0.0AspXaa: 0.0 ± 0.0
Glu
2.899GluAla: 2.899 ± 1.228
0.828GluCys: 0.828 ± 0.439
7.039GluAsp: 7.039 ± 2.234
8.282GluGlu: 8.282 ± 2.61
2.484GluPhe: 2.484 ± 1.534
4.141GluGly: 4.141 ± 1.744
2.07GluHis: 2.07 ± 0.469
0.828GluIle: 0.828 ± 0.376
3.313GluLys: 3.313 ± 1.459
3.727GluLeu: 3.727 ± 1.611
1.242GluMet: 1.242 ± 0.684
3.727GluAsn: 3.727 ± 0.735
1.242GluPro: 1.242 ± 0.637
2.484GluGln: 2.484 ± 0.761
3.727GluArg: 3.727 ± 1.068
2.484GluSer: 2.484 ± 0.673
6.211GluThr: 6.211 ± 1.665
2.899GluVal: 2.899 ± 1.463
0.414GluTrp: 0.414 ± 0.354
1.242GluTyr: 1.242 ± 0.467
0.0GluXaa: 0.0 ± 0.0
Phe
1.242PheAla: 1.242 ± 0.612
0.414PheCys: 0.414 ± 0.563
1.656PheAsp: 1.656 ± 0.587
2.484PheGlu: 2.484 ± 0.743
1.656PhePhe: 1.656 ± 0.656
2.484PheGly: 2.484 ± 1.351
0.414PheHis: 0.414 ± 0.395
0.828PheIle: 0.828 ± 0.441
2.07PheLys: 2.07 ± 1.049
4.141PheLeu: 4.141 ± 1.208
1.242PheMet: 1.242 ± 0.978
4.141PheAsn: 4.141 ± 1.763
1.242PhePro: 1.242 ± 0.494
2.899PheGln: 2.899 ± 1.18
1.656PheArg: 1.656 ± 0.7
1.242PheSer: 1.242 ± 0.494
1.242PheThr: 1.242 ± 0.417
3.313PheVal: 3.313 ± 1.113
2.07PheTrp: 2.07 ± 1.354
2.07PheTyr: 2.07 ± 0.84
0.0PheXaa: 0.0 ± 0.0
Gly
2.07GlyAla: 2.07 ± 0.615
2.484GlyCys: 2.484 ± 0.927
4.969GlyAsp: 4.969 ± 1.2
5.383GlyGlu: 5.383 ± 0.794
0.414GlyPhe: 0.414 ± 0.321
6.625GlyGly: 6.625 ± 2.006
2.484GlyHis: 2.484 ± 1.251
4.969GlyIle: 4.969 ± 0.627
2.07GlyLys: 2.07 ± 0.965
4.555GlyLeu: 4.555 ± 1.567
0.414GlyMet: 0.414 ± 0.354
1.242GlyAsn: 1.242 ± 0.398
3.313GlyPro: 3.313 ± 1.418
2.899GlyGln: 2.899 ± 1.101
6.211GlyArg: 6.211 ± 2.2
7.039GlySer: 7.039 ± 1.906
6.211GlyThr: 6.211 ± 0.821
4.141GlyVal: 4.141 ± 1.257
0.0GlyTrp: 0.0 ± 0.0
0.414GlyTyr: 0.414 ± 0.55
0.0GlyXaa: 0.0 ± 0.0
His
1.242HisAla: 1.242 ± 0.417
0.828HisCys: 0.828 ± 0.586
1.242HisAsp: 1.242 ± 1.119
0.828HisGlu: 0.828 ± 0.439
1.656HisPhe: 1.656 ± 0.597
1.656HisGly: 1.656 ± 0.727
1.656HisHis: 1.656 ± 1.076
1.656HisIle: 1.656 ± 1.251
0.828HisLys: 0.828 ± 0.439
1.656HisLeu: 1.656 ± 1.015
0.828HisMet: 0.828 ± 0.586
1.242HisAsn: 1.242 ± 0.789
2.07HisPro: 2.07 ± 0.949
0.414HisGln: 0.414 ± 0.395
2.899HisArg: 2.899 ± 0.985
1.242HisSer: 1.242 ± 0.669
2.484HisThr: 2.484 ± 0.435
1.242HisVal: 1.242 ± 0.386
0.414HisTrp: 0.414 ± 0.357
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.899IleAla: 2.899 ± 0.99
0.828IleCys: 0.828 ± 0.551
3.313IleAsp: 3.313 ± 1.139
4.141IleGlu: 4.141 ± 1.604
0.414IlePhe: 0.414 ± 0.357
4.969IleGly: 4.969 ± 1.692
1.242IleHis: 1.242 ± 0.398
2.899IleIle: 2.899 ± 0.988
2.07IleLys: 2.07 ± 1.028
3.727IleLeu: 3.727 ± 1.379
1.242IleMet: 1.242 ± 0.908
1.242IleAsn: 1.242 ± 0.779
2.07IlePro: 2.07 ± 0.815
2.484IleGln: 2.484 ± 0.882
1.242IleArg: 1.242 ± 0.651
3.727IleSer: 3.727 ± 0.662
3.313IleThr: 3.313 ± 1.354
4.141IleVal: 4.141 ± 1.537
0.0IleTrp: 0.0 ± 0.0
1.656IleTyr: 1.656 ± 1.187
0.0IleXaa: 0.0 ± 0.0
Lys
2.07LysAla: 2.07 ± 0.931
1.656LysCys: 1.656 ± 0.629
2.07LysAsp: 2.07 ± 1.073
1.242LysGlu: 1.242 ± 1.0
1.656LysPhe: 1.656 ± 0.845
1.656LysGly: 1.656 ± 0.918
1.656LysHis: 1.656 ± 0.806
2.484LysIle: 2.484 ± 1.119
3.727LysLys: 3.727 ± 1.605
2.484LysLeu: 2.484 ± 1.686
1.242LysMet: 1.242 ± 0.576
1.242LysAsn: 1.242 ± 0.693
2.07LysPro: 2.07 ± 1.142
0.828LysGln: 0.828 ± 0.422
4.141LysArg: 4.141 ± 0.895
2.484LysSer: 2.484 ± 1.386
1.242LysThr: 1.242 ± 0.417
3.313LysVal: 3.313 ± 1.434
0.414LysTrp: 0.414 ± 0.321
3.727LysTyr: 3.727 ± 1.582
0.0LysXaa: 0.0 ± 0.0
Leu
3.727LeuAla: 3.727 ± 1.111
2.899LeuCys: 2.899 ± 1.872
6.211LeuAsp: 6.211 ± 1.043
7.453LeuGlu: 7.453 ± 1.362
5.383LeuPhe: 5.383 ± 1.437
4.555LeuGly: 4.555 ± 1.231
1.242LeuHis: 1.242 ± 0.655
2.484LeuIle: 2.484 ± 0.903
3.727LeuLys: 3.727 ± 1.415
9.524LeuLeu: 9.524 ± 4.061
2.899LeuMet: 2.899 ± 1.351
1.656LeuAsn: 1.656 ± 1.113
5.797LeuPro: 5.797 ± 2.355
6.211LeuGln: 6.211 ± 1.298
7.453LeuArg: 7.453 ± 1.641
6.211LeuSer: 6.211 ± 0.941
4.555LeuThr: 4.555 ± 0.89
2.484LeuVal: 2.484 ± 1.361
1.242LeuTrp: 1.242 ± 0.729
4.141LeuTyr: 4.141 ± 0.914
0.0LeuXaa: 0.0 ± 0.0
Met
2.07MetAla: 2.07 ± 1.035
0.414MetCys: 0.414 ± 0.563
1.242MetAsp: 1.242 ± 0.398
1.656MetGlu: 1.656 ± 0.591
1.656MetPhe: 1.656 ± 0.7
0.828MetGly: 0.828 ± 0.599
0.414MetHis: 0.414 ± 0.55
0.828MetIle: 0.828 ± 0.707
0.0MetLys: 0.0 ± 0.0
0.828MetLeu: 0.828 ± 0.636
0.828MetMet: 0.828 ± 0.599
0.414MetAsn: 0.414 ± 0.354
0.414MetPro: 0.414 ± 0.527
0.414MetGln: 0.414 ± 0.395
1.656MetArg: 1.656 ± 1.009
1.242MetSer: 1.242 ± 0.693
0.0MetThr: 0.0 ± 0.0
2.07MetVal: 2.07 ± 0.712
0.414MetTrp: 0.414 ± 0.395
0.828MetTyr: 0.828 ± 0.707
0.0MetXaa: 0.0 ± 0.0
Asn
2.899AsnAla: 2.899 ± 1.508
0.0AsnCys: 0.0 ± 0.0
2.899AsnAsp: 2.899 ± 1.213
2.07AsnGlu: 2.07 ± 0.647
1.656AsnPhe: 1.656 ± 1.024
2.07AsnGly: 2.07 ± 0.667
0.0AsnHis: 0.0 ± 0.0
0.828AsnIle: 0.828 ± 0.446
1.656AsnLys: 1.656 ± 0.61
1.656AsnLeu: 1.656 ± 0.845
0.0AsnMet: 0.0 ± 0.0
2.484AsnAsn: 2.484 ± 1.012
2.899AsnPro: 2.899 ± 0.824
0.828AsnGln: 0.828 ± 0.636
2.899AsnArg: 2.899 ± 1.756
1.242AsnSer: 1.242 ± 0.757
2.899AsnThr: 2.899 ± 1.043
2.899AsnVal: 2.899 ± 1.275
0.828AsnTrp: 0.828 ± 0.439
1.242AsnTyr: 1.242 ± 0.398
0.0AsnXaa: 0.0 ± 0.0
Pro
5.797ProAla: 5.797 ± 1.667
0.414ProCys: 0.414 ± 0.55
4.555ProAsp: 4.555 ± 1.43
1.242ProGlu: 1.242 ± 0.467
2.484ProPhe: 2.484 ± 0.807
2.899ProGly: 2.899 ± 0.943
1.242ProHis: 1.242 ± 0.576
2.07ProIle: 2.07 ± 1.194
2.484ProLys: 2.484 ± 1.385
7.453ProLeu: 7.453 ± 1.436
0.0ProMet: 0.0 ± 0.0
2.484ProAsn: 2.484 ± 0.926
8.282ProPro: 8.282 ± 2.867
3.727ProGln: 3.727 ± 1.448
3.727ProArg: 3.727 ± 1.783
4.555ProSer: 4.555 ± 2.092
5.797ProThr: 5.797 ± 2.385
3.313ProVal: 3.313 ± 0.736
1.242ProTrp: 1.242 ± 0.467
3.313ProTyr: 3.313 ± 1.706
0.0ProXaa: 0.0 ± 0.0
Gln
3.313GlnAla: 3.313 ± 0.736
1.242GlnCys: 1.242 ± 1.061
2.07GlnAsp: 2.07 ± 0.818
2.899GlnGlu: 2.899 ± 1.93
1.656GlnPhe: 1.656 ± 0.557
3.727GlnGly: 3.727 ± 1.177
1.656GlnHis: 1.656 ± 1.468
0.414GlnIle: 0.414 ± 0.354
1.242GlnLys: 1.242 ± 0.692
6.625GlnLeu: 6.625 ± 1.078
0.414GlnMet: 0.414 ± 0.354
0.828GlnAsn: 0.828 ± 0.441
2.07GlnPro: 2.07 ± 0.859
2.899GlnGln: 2.899 ± 1.249
4.141GlnArg: 4.141 ± 1.52
0.828GlnSer: 0.828 ± 0.714
2.899GlnThr: 2.899 ± 0.784
2.07GlnVal: 2.07 ± 1.098
0.828GlnTrp: 0.828 ± 0.707
2.484GlnTyr: 2.484 ± 0.508
0.0GlnXaa: 0.0 ± 0.0
Arg
4.555ArgAla: 4.555 ± 1.269
3.313ArgCys: 3.313 ± 1.173
2.07ArgAsp: 2.07 ± 0.736
2.484ArgGlu: 2.484 ± 0.752
4.141ArgPhe: 4.141 ± 1.464
5.797ArgGly: 5.797 ± 2.017
3.313ArgHis: 3.313 ± 1.125
2.484ArgIle: 2.484 ± 0.968
3.727ArgLys: 3.727 ± 0.708
6.211ArgLeu: 6.211 ± 0.777
1.242ArgMet: 1.242 ± 0.564
1.656ArgAsn: 1.656 ± 0.809
7.867ArgPro: 7.867 ± 2.74
2.899ArgGln: 2.899 ± 1.214
6.625ArgArg: 6.625 ± 1.959
7.867ArgSer: 7.867 ± 2.522
3.727ArgThr: 3.727 ± 1.006
1.656ArgVal: 1.656 ± 0.557
0.0ArgTrp: 0.0 ± 0.0
1.242ArgTyr: 1.242 ± 0.417
0.0ArgXaa: 0.0 ± 0.0
Ser
4.555SerAla: 4.555 ± 1.591
0.414SerCys: 0.414 ± 0.527
4.555SerAsp: 4.555 ± 0.606
4.141SerGlu: 4.141 ± 1.268
3.727SerPhe: 3.727 ± 1.549
4.555SerGly: 4.555 ± 0.534
1.656SerHis: 1.656 ± 0.577
7.039SerIle: 7.039 ± 1.895
1.242SerLys: 1.242 ± 0.417
8.282SerLeu: 8.282 ± 0.995
0.0SerMet: 0.0 ± 0.0
0.828SerAsn: 0.828 ± 0.439
3.313SerPro: 3.313 ± 1.388
2.07SerGln: 2.07 ± 0.598
3.313SerArg: 3.313 ± 1.488
5.797SerSer: 5.797 ± 0.835
5.797SerThr: 5.797 ± 1.772
2.899SerVal: 2.899 ± 1.35
0.828SerTrp: 0.828 ± 0.636
1.656SerTyr: 1.656 ± 0.984
0.0SerXaa: 0.0 ± 0.0
Thr
3.727ThrAla: 3.727 ± 0.919
2.07ThrCys: 2.07 ± 1.298
5.797ThrAsp: 5.797 ± 1.86
4.555ThrGlu: 4.555 ± 1.412
1.656ThrPhe: 1.656 ± 1.152
6.211ThrGly: 6.211 ± 1.165
0.828ThrHis: 0.828 ± 0.503
1.242ThrIle: 1.242 ± 0.386
2.484ThrLys: 2.484 ± 1.012
6.211ThrLeu: 6.211 ± 2.043
1.656ThrMet: 1.656 ± 0.622
2.07ThrAsn: 2.07 ± 0.74
7.867ThrPro: 7.867 ± 1.721
3.313ThrGln: 3.313 ± 0.747
4.141ThrArg: 4.141 ± 0.933
4.555ThrSer: 4.555 ± 1.728
5.383ThrThr: 5.383 ± 1.076
5.797ThrVal: 5.797 ± 2.534
0.0ThrTrp: 0.0 ± 0.0
1.656ThrTyr: 1.656 ± 1.125
0.0ThrXaa: 0.0 ± 0.0
Val
2.484ValAla: 2.484 ± 0.969
2.07ValCys: 2.07 ± 1.15
4.555ValAsp: 4.555 ± 0.804
2.484ValGlu: 2.484 ± 0.612
0.828ValPhe: 0.828 ± 0.636
5.797ValGly: 5.797 ± 1.726
2.07ValHis: 2.07 ± 0.719
2.899ValIle: 2.899 ± 1.213
2.07ValLys: 2.07 ± 1.354
5.383ValLeu: 5.383 ± 1.709
0.828ValMet: 0.828 ± 0.79
3.313ValAsn: 3.313 ± 0.911
3.727ValPro: 3.727 ± 1.435
2.899ValGln: 2.899 ± 1.005
3.313ValArg: 3.313 ± 1.145
4.141ValSer: 4.141 ± 1.172
4.555ValThr: 4.555 ± 1.324
2.484ValVal: 2.484 ± 0.743
0.828ValTrp: 0.828 ± 0.586
2.899ValTyr: 2.899 ± 0.755
0.0ValXaa: 0.0 ± 0.0
Trp
1.242TrpAla: 1.242 ± 0.693
0.414TrpCys: 0.414 ± 0.357
0.828TrpAsp: 0.828 ± 0.714
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.414TrpGly: 0.414 ± 0.321
1.242TrpHis: 1.242 ± 0.721
0.414TrpIle: 0.414 ± 0.55
0.414TrpLys: 0.414 ± 0.354
1.242TrpLeu: 1.242 ± 0.697
0.414TrpMet: 0.414 ± 0.395
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.242TrpArg: 1.242 ± 0.692
0.828TrpSer: 0.828 ± 0.734
0.828TrpThr: 0.828 ± 0.79
0.828TrpVal: 0.828 ± 0.376
0.0TrpTrp: 0.0 ± 0.0
0.828TrpTyr: 0.828 ± 0.707
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.484TyrAla: 2.484 ± 0.785
0.828TyrCys: 0.828 ± 0.503
1.242TyrAsp: 1.242 ± 0.963
0.414TyrGlu: 0.414 ± 0.395
1.656TyrPhe: 1.656 ± 1.009
2.899TyrGly: 2.899 ± 0.78
0.0TyrHis: 0.0 ± 0.0
3.313TyrIle: 3.313 ± 1.193
0.828TyrLys: 0.828 ± 0.707
3.727TyrLeu: 3.727 ± 1.068
0.828TyrMet: 0.828 ± 0.707
0.828TyrAsn: 0.828 ± 0.642
0.828TyrPro: 0.828 ± 0.441
1.656TyrGln: 1.656 ± 0.61
2.484TyrArg: 2.484 ± 0.972
1.656TyrSer: 1.656 ± 0.807
2.899TyrThr: 2.899 ± 0.561
4.555TyrVal: 4.555 ± 1.081
0.414TyrTrp: 0.414 ± 0.357
2.07TyrTyr: 2.07 ± 0.869
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2416 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski