Amino acid dipepetide frequency for Equus caballus papillomavirus 8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.689AlaAla: 4.689 ± 1.148
1.954AlaCys: 1.954 ± 0.74
3.126AlaAsp: 3.126 ± 1.139
5.08AlaGlu: 5.08 ± 1.665
3.517AlaPhe: 3.517 ± 1.308
5.471AlaGly: 5.471 ± 2.177
1.563AlaHis: 1.563 ± 0.557
1.172AlaIle: 1.172 ± 0.529
3.908AlaLys: 3.908 ± 1.646
4.299AlaLeu: 4.299 ± 1.025
2.345AlaMet: 2.345 ± 0.48
2.735AlaAsn: 2.735 ± 0.565
5.08AlaPro: 5.08 ± 2.551
3.908AlaGln: 3.908 ± 1.171
3.908AlaArg: 3.908 ± 0.827
3.517AlaSer: 3.517 ± 1.419
2.735AlaThr: 2.735 ± 0.837
6.252AlaVal: 6.252 ± 1.806
1.172AlaTrp: 1.172 ± 0.621
1.563AlaTyr: 1.563 ± 0.615
0.0AlaXaa: 0.0 ± 0.0
Cys
1.172CysAla: 1.172 ± 0.956
0.391CysCys: 0.391 ± 0.523
0.782CysAsp: 0.782 ± 0.491
0.391CysGlu: 0.391 ± 0.31
0.782CysPhe: 0.782 ± 0.34
1.172CysGly: 1.172 ± 0.529
0.0CysHis: 0.0 ± 0.0
0.782CysIle: 0.782 ± 1.046
1.563CysLys: 1.563 ± 0.683
1.954CysLeu: 1.954 ± 1.316
0.782CysMet: 0.782 ± 0.34
0.391CysAsn: 0.391 ± 0.317
1.954CysPro: 1.954 ± 0.734
0.782CysGln: 0.782 ± 0.619
2.735CysArg: 2.735 ± 1.611
2.345CysSer: 2.345 ± 0.922
1.172CysThr: 1.172 ± 0.579
0.782CysVal: 0.782 ± 0.688
0.782CysTrp: 0.782 ± 0.434
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.252AspAla: 6.252 ± 1.18
1.563AspCys: 1.563 ± 0.503
2.735AspAsp: 2.735 ± 1.363
2.345AspGlu: 2.345 ± 1.058
1.954AspPhe: 1.954 ± 1.104
2.735AspGly: 2.735 ± 0.582
1.172AspHis: 1.172 ± 0.95
0.782AspIle: 0.782 ± 0.559
1.172AspLys: 1.172 ± 0.44
5.471AspLeu: 5.471 ± 1.489
1.563AspMet: 1.563 ± 0.869
2.735AspAsn: 2.735 ± 1.53
5.08AspPro: 5.08 ± 1.496
2.345AspGln: 2.345 ± 1.025
1.172AspArg: 1.172 ± 0.565
5.471AspSer: 5.471 ± 1.559
2.345AspThr: 2.345 ± 0.894
4.299AspVal: 4.299 ± 1.542
1.172AspTrp: 1.172 ± 0.579
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
1.954GluAla: 1.954 ± 0.766
0.391GluCys: 0.391 ± 0.523
3.126GluAsp: 3.126 ± 1.379
4.689GluGlu: 4.689 ± 3.406
0.782GluPhe: 0.782 ± 0.432
7.425GluGly: 7.425 ± 2.387
1.954GluHis: 1.954 ± 0.388
2.345GluIle: 2.345 ± 1.262
1.172GluLys: 1.172 ± 0.631
5.08GluLeu: 5.08 ± 0.951
2.345GluMet: 2.345 ± 0.684
1.172GluAsn: 1.172 ± 0.379
7.425GluPro: 7.425 ± 4.281
2.735GluGln: 2.735 ± 0.791
5.471GluArg: 5.471 ± 2.805
2.735GluSer: 2.735 ± 0.613
2.735GluThr: 2.735 ± 0.76
4.299GluVal: 4.299 ± 1.276
0.391GluTrp: 0.391 ± 0.31
1.172GluTyr: 1.172 ± 0.44
0.0GluXaa: 0.0 ± 0.0
Phe
2.345PheAla: 2.345 ± 1.137
1.172PheCys: 1.172 ± 0.631
1.954PheAsp: 1.954 ± 0.736
1.563PheGlu: 1.563 ± 0.879
1.563PhePhe: 1.563 ± 0.581
2.735PheGly: 2.735 ± 1.362
0.391PheHis: 0.391 ± 0.317
3.126PheIle: 3.126 ± 1.055
2.345PheLys: 2.345 ± 1.158
3.517PheLeu: 3.517 ± 0.666
0.391PheMet: 0.391 ± 0.31
1.954PheAsn: 1.954 ± 1.323
1.563PhePro: 1.563 ± 0.664
1.172PheGln: 1.172 ± 0.799
3.126PheArg: 3.126 ± 0.474
1.954PheSer: 1.954 ± 0.84
1.563PheThr: 1.563 ± 0.856
1.954PheVal: 1.954 ± 0.388
1.172PheTrp: 1.172 ± 0.776
2.345PheTyr: 2.345 ± 0.686
0.0PheXaa: 0.0 ± 0.0
Gly
5.862GlyAla: 5.862 ± 0.978
1.954GlyCys: 1.954 ± 1.149
5.862GlyAsp: 5.862 ± 1.83
4.689GlyGlu: 4.689 ± 1.395
1.172GlyPhe: 1.172 ± 0.698
10.942GlyGly: 10.942 ± 4.027
4.299GlyHis: 4.299 ± 1.035
2.345GlyIle: 2.345 ± 0.734
1.954GlyLys: 1.954 ± 0.837
4.689GlyLeu: 4.689 ± 1.477
0.391GlyMet: 0.391 ± 0.317
1.172GlyAsn: 1.172 ± 0.568
10.16GlyPro: 10.16 ± 3.571
4.689GlyGln: 4.689 ± 1.982
5.862GlyArg: 5.862 ± 2.417
10.551GlySer: 10.551 ± 2.609
4.689GlyThr: 4.689 ± 1.147
5.08GlyVal: 5.08 ± 1.317
1.563GlyTrp: 1.563 ± 0.892
1.563GlyTyr: 1.563 ± 0.721
0.0GlyXaa: 0.0 ± 0.0
His
1.563HisAla: 1.563 ± 0.849
0.782HisCys: 0.782 ± 0.491
0.391HisAsp: 0.391 ± 0.317
0.391HisGlu: 0.391 ± 0.478
1.563HisPhe: 1.563 ± 0.495
0.782HisGly: 0.782 ± 0.581
0.391HisHis: 0.391 ± 0.31
1.563HisIle: 1.563 ± 0.615
1.172HisLys: 1.172 ± 0.631
0.782HisLeu: 0.782 ± 0.428
1.172HisMet: 1.172 ± 0.347
0.782HisAsn: 0.782 ± 0.362
1.954HisPro: 1.954 ± 0.46
1.172HisGln: 1.172 ± 0.631
1.172HisArg: 1.172 ± 0.579
1.954HisSer: 1.954 ± 0.303
1.563HisThr: 1.563 ± 0.503
1.954HisVal: 1.954 ± 1.088
1.172HisTrp: 1.172 ± 0.347
0.391HisTyr: 0.391 ± 0.317
0.0HisXaa: 0.0 ± 0.0
Ile
3.517IleAla: 3.517 ± 1.429
0.391IleCys: 0.391 ± 0.478
1.172IleAsp: 1.172 ± 0.631
2.735IleGlu: 2.735 ± 0.673
1.172IlePhe: 1.172 ± 0.649
2.735IleGly: 2.735 ± 1.308
0.782IleHis: 0.782 ± 0.434
0.782IleIle: 0.782 ± 0.491
0.782IleLys: 0.782 ± 0.688
1.563IleLeu: 1.563 ± 0.819
1.563IleMet: 1.563 ± 1.21
0.782IleAsn: 0.782 ± 0.581
1.563IlePro: 1.563 ± 0.856
1.563IleGln: 1.563 ± 0.852
0.782IleArg: 0.782 ± 0.581
3.126IleSer: 3.126 ± 0.873
2.735IleThr: 2.735 ± 1.083
0.391IleVal: 0.391 ± 0.478
0.782IleTrp: 0.782 ± 0.619
0.782IleTyr: 0.782 ± 0.432
0.0IleXaa: 0.0 ± 0.0
Lys
0.782LysAla: 0.782 ± 0.34
1.172LysCys: 1.172 ± 0.631
0.391LysAsp: 0.391 ± 0.317
1.563LysGlu: 1.563 ± 0.679
1.954LysPhe: 1.954 ± 0.767
1.563LysGly: 1.563 ± 0.607
1.172LysHis: 1.172 ± 0.95
1.172LysIle: 1.172 ± 0.631
3.126LysLys: 3.126 ± 1.359
1.954LysLeu: 1.954 ± 0.723
0.391LysMet: 0.391 ± 0.344
1.563LysAsn: 1.563 ± 0.503
2.735LysPro: 2.735 ± 1.964
1.954LysGln: 1.954 ± 1.134
4.689LysArg: 4.689 ± 1.031
3.126LysSer: 3.126 ± 1.243
2.735LysThr: 2.735 ± 0.748
2.345LysVal: 2.345 ± 1.262
0.0LysTrp: 0.0 ± 0.0
0.391LysTyr: 0.391 ± 0.344
0.0LysXaa: 0.0 ± 0.0
Leu
4.689LeuAla: 4.689 ± 1.338
1.172LeuCys: 1.172 ± 0.565
5.08LeuAsp: 5.08 ± 1.125
3.126LeuGlu: 3.126 ± 1.107
5.08LeuPhe: 5.08 ± 1.099
10.942LeuGly: 10.942 ± 2.53
1.954LeuHis: 1.954 ± 0.715
1.563LeuIle: 1.563 ± 0.869
2.345LeuLys: 2.345 ± 1.137
7.816LeuLeu: 7.816 ± 1.085
0.0LeuMet: 0.0 ± 0.0
1.954LeuAsn: 1.954 ± 0.406
4.689LeuPro: 4.689 ± 1.593
5.471LeuGln: 5.471 ± 0.842
7.425LeuArg: 7.425 ± 1.043
7.425LeuSer: 7.425 ± 2.341
5.08LeuThr: 5.08 ± 1.011
5.08LeuVal: 5.08 ± 1.881
0.391LeuTrp: 0.391 ± 0.317
3.126LeuTyr: 3.126 ± 0.886
0.0LeuXaa: 0.0 ± 0.0
Met
0.782MetAla: 0.782 ± 0.633
0.0MetCys: 0.0 ± 0.0
1.172MetAsp: 1.172 ± 0.717
0.782MetGlu: 0.782 ± 0.559
1.172MetPhe: 1.172 ± 0.44
0.782MetGly: 0.782 ± 0.34
0.0MetHis: 0.0 ± 0.0
0.782MetIle: 0.782 ± 0.688
0.0MetLys: 0.0 ± 0.0
2.345MetLeu: 2.345 ± 1.019
0.0MetMet: 0.0 ± 0.0
0.391MetAsn: 0.391 ± 0.31
0.391MetPro: 0.391 ± 0.317
0.0MetGln: 0.0 ± 0.0
1.172MetArg: 1.172 ± 0.674
1.954MetSer: 1.954 ± 1.266
1.954MetThr: 1.954 ± 0.46
1.172MetVal: 1.172 ± 0.579
0.391MetTrp: 0.391 ± 0.31
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.345AsnAla: 2.345 ± 1.059
0.782AsnCys: 0.782 ± 0.34
1.563AsnAsp: 1.563 ± 0.664
0.782AsnGlu: 0.782 ± 0.491
0.782AsnPhe: 0.782 ± 0.633
0.782AsnGly: 0.782 ± 0.362
0.0AsnHis: 0.0 ± 0.0
1.172AsnIle: 1.172 ± 0.621
1.563AsnLys: 1.563 ± 0.664
1.563AsnLeu: 1.563 ± 0.751
0.391AsnMet: 0.391 ± 0.344
1.954AsnAsn: 1.954 ± 0.772
3.908AsnPro: 3.908 ± 0.955
2.345AsnGln: 2.345 ± 0.94
1.954AsnArg: 1.954 ± 1.287
1.954AsnSer: 1.954 ± 0.715
1.954AsnThr: 1.954 ± 0.748
2.735AsnVal: 2.735 ± 1.336
0.782AsnTrp: 0.782 ± 0.619
1.563AsnTyr: 1.563 ± 0.834
0.0AsnXaa: 0.0 ± 0.0
Pro
5.471ProAla: 5.471 ± 1.775
0.782ProCys: 0.782 ± 0.491
5.862ProAsp: 5.862 ± 1.094
9.379ProGlu: 9.379 ± 4.094
0.0ProPhe: 0.0 ± 0.0
10.16ProGly: 10.16 ± 4.843
0.782ProHis: 0.782 ± 0.34
3.126ProIle: 3.126 ± 0.636
2.345ProLys: 2.345 ± 0.986
6.643ProLeu: 6.643 ± 0.946
0.391ProMet: 0.391 ± 0.294
4.299ProAsn: 4.299 ± 1.356
7.425ProPro: 7.425 ± 0.872
8.988ProGln: 8.988 ± 5.117
5.862ProArg: 5.862 ± 1.373
5.862ProSer: 5.862 ± 1.834
1.172ProThr: 1.172 ± 0.799
3.517ProVal: 3.517 ± 0.593
0.782ProTrp: 0.782 ± 0.619
2.345ProTyr: 2.345 ± 1.235
0.0ProXaa: 0.0 ± 0.0
Gln
3.908GlnAla: 3.908 ± 1.049
1.563GlnCys: 1.563 ± 0.667
1.954GlnAsp: 1.954 ± 0.949
3.126GlnGlu: 3.126 ± 1.289
2.735GlnPhe: 2.735 ± 0.81
1.172GlnGly: 1.172 ± 0.929
0.391GlnHis: 0.391 ± 0.31
0.391GlnIle: 0.391 ± 0.31
0.782GlnLys: 0.782 ± 0.434
5.471GlnLeu: 5.471 ± 1.374
0.0GlnMet: 0.0 ± 0.0
1.172GlnAsn: 1.172 ± 0.44
8.206GlnPro: 8.206 ± 4.756
3.126GlnGln: 3.126 ± 0.773
3.908GlnArg: 3.908 ± 0.971
3.908GlnSer: 3.908 ± 1.831
1.954GlnThr: 1.954 ± 0.558
3.517GlnVal: 3.517 ± 0.593
0.391GlnTrp: 0.391 ± 0.317
1.172GlnTyr: 1.172 ± 0.649
0.0GlnXaa: 0.0 ± 0.0
Arg
6.643ArgAla: 6.643 ± 2.028
1.563ArgCys: 1.563 ± 0.692
1.563ArgAsp: 1.563 ± 1.186
2.735ArgGlu: 2.735 ± 1.379
1.954ArgPhe: 1.954 ± 0.548
8.206ArgGly: 8.206 ± 1.994
2.735ArgHis: 2.735 ± 0.956
1.172ArgIle: 1.172 ± 0.543
2.735ArgLys: 2.735 ± 0.821
9.769ArgLeu: 9.769 ± 0.56
0.0ArgMet: 0.0 ± 0.0
1.954ArgAsn: 1.954 ± 1.189
5.471ArgPro: 5.471 ± 1.752
2.735ArgGln: 2.735 ± 0.707
5.08ArgArg: 5.08 ± 0.664
6.252ArgSer: 6.252 ± 1.821
3.908ArgThr: 3.908 ± 1.2
6.252ArgVal: 6.252 ± 0.972
1.954ArgTrp: 1.954 ± 0.949
1.172ArgTyr: 1.172 ± 0.579
0.0ArgXaa: 0.0 ± 0.0
Ser
5.862SerAla: 5.862 ± 2.447
1.172SerCys: 1.172 ± 0.883
3.517SerAsp: 3.517 ± 1.498
3.908SerGlu: 3.908 ± 0.955
3.517SerPhe: 3.517 ± 1.279
10.16SerGly: 10.16 ± 2.108
0.782SerHis: 0.782 ± 0.432
2.345SerIle: 2.345 ± 0.88
2.735SerLys: 2.735 ± 0.877
8.988SerLeu: 8.988 ± 1.604
0.782SerMet: 0.782 ± 0.633
3.908SerAsn: 3.908 ± 1.354
5.471SerPro: 5.471 ± 1.04
1.563SerGln: 1.563 ± 0.498
5.08SerArg: 5.08 ± 0.825
6.252SerSer: 6.252 ± 1.67
5.862SerThr: 5.862 ± 1.799
6.252SerVal: 6.252 ± 0.958
1.563SerTrp: 1.563 ± 0.498
3.126SerTyr: 3.126 ± 1.878
0.0SerXaa: 0.0 ± 0.0
Thr
2.735ThrAla: 2.735 ± 0.854
1.563ThrCys: 1.563 ± 0.615
4.299ThrAsp: 4.299 ± 0.917
2.735ThrGlu: 2.735 ± 1.189
2.345ThrPhe: 2.345 ± 0.922
3.517ThrGly: 3.517 ± 1.29
1.172ThrHis: 1.172 ± 0.347
0.782ThrIle: 0.782 ± 0.434
1.172ThrLys: 1.172 ± 1.032
5.08ThrLeu: 5.08 ± 0.953
0.391ThrMet: 0.391 ± 0.396
0.391ThrAsn: 0.391 ± 0.31
5.862ThrPro: 5.862 ± 1.337
0.782ThrGln: 0.782 ± 0.458
5.08ThrArg: 5.08 ± 1.24
5.862ThrSer: 5.862 ± 1.345
2.735ThrThr: 2.735 ± 1.129
3.908ThrVal: 3.908 ± 1.539
1.172ThrTrp: 1.172 ± 0.929
2.735ThrTyr: 2.735 ± 0.81
0.0ThrXaa: 0.0 ± 0.0
Val
5.471ValAla: 5.471 ± 1.429
1.563ValCys: 1.563 ± 0.553
4.299ValAsp: 4.299 ± 0.755
4.689ValGlu: 4.689 ± 1.393
4.299ValPhe: 4.299 ± 1.966
6.252ValGly: 6.252 ± 1.742
2.345ValHis: 2.345 ± 1.051
2.735ValIle: 2.735 ± 1.342
2.735ValLys: 2.735 ± 0.74
3.126ValLeu: 3.126 ± 0.41
1.172ValMet: 1.172 ± 0.579
0.782ValAsn: 0.782 ± 0.428
3.126ValPro: 3.126 ± 0.722
2.735ValGln: 2.735 ± 0.98
4.299ValArg: 4.299 ± 0.733
5.08ValSer: 5.08 ± 1.269
4.689ValThr: 4.689 ± 2.343
2.345ValVal: 2.345 ± 1.098
1.172ValTrp: 1.172 ± 0.44
1.172ValTyr: 1.172 ± 0.347
0.0ValXaa: 0.0 ± 0.0
Trp
1.172TrpAla: 1.172 ± 0.678
0.391TrpCys: 0.391 ± 0.31
1.563TrpAsp: 1.563 ± 0.495
1.563TrpGlu: 1.563 ± 0.602
0.391TrpPhe: 0.391 ± 0.411
1.172TrpGly: 1.172 ± 0.568
0.391TrpHis: 0.391 ± 0.411
1.172TrpIle: 1.172 ± 0.631
0.391TrpLys: 0.391 ± 0.317
1.954TrpLeu: 1.954 ± 0.949
0.0TrpMet: 0.0 ± 0.0
0.391TrpAsn: 0.391 ± 0.344
1.563TrpPro: 1.563 ± 1.239
0.0TrpGln: 0.0 ± 0.0
2.345TrpArg: 2.345 ± 1.182
1.172TrpSer: 1.172 ± 0.929
1.563TrpThr: 1.563 ± 0.495
1.172TrpVal: 1.172 ± 0.346
0.782TrpTrp: 0.782 ± 0.458
0.391TrpTyr: 0.391 ± 0.317
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.782TyrAla: 0.782 ± 0.581
0.391TyrCys: 0.391 ± 0.317
1.954TyrAsp: 1.954 ± 0.388
3.126TyrGlu: 3.126 ± 1.031
1.172TyrPhe: 1.172 ± 0.565
0.782TyrGly: 0.782 ± 0.458
0.391TyrHis: 0.391 ± 0.411
0.391TyrIle: 0.391 ± 0.344
0.782TyrLys: 0.782 ± 0.633
2.345TyrLeu: 2.345 ± 0.416
0.782TyrMet: 0.782 ± 0.34
0.391TyrAsn: 0.391 ± 0.344
1.563TyrPro: 1.563 ± 0.683
0.782TyrGln: 0.782 ± 0.362
2.735TyrArg: 2.735 ± 0.9
2.345TyrSer: 2.345 ± 1.073
1.563TyrThr: 1.563 ± 0.856
0.782TyrVal: 0.782 ± 0.458
1.954TyrTrp: 1.954 ± 0.917
0.782TyrTyr: 0.782 ± 0.639
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2560 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski