Amino acid dipepetide frequency for Equus asinus papillomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.894AlaAla: 10.894 ± 0.775
2.27AlaCys: 2.27 ± 0.952
4.993AlaAsp: 4.993 ± 1.734
6.809AlaGlu: 6.809 ± 1.564
3.177AlaPhe: 3.177 ± 0.549
5.901AlaGly: 5.901 ± 1.36
0.454AlaHis: 0.454 ± 0.405
1.816AlaIle: 1.816 ± 0.712
4.085AlaLys: 4.085 ± 0.845
6.355AlaLeu: 6.355 ± 1.823
2.27AlaMet: 2.27 ± 1.127
4.993AlaAsn: 4.993 ± 0.609
4.993AlaPro: 4.993 ± 0.854
3.177AlaGln: 3.177 ± 1.037
2.27AlaArg: 2.27 ± 0.839
6.809AlaSer: 6.809 ± 1.266
2.724AlaThr: 2.724 ± 0.821
6.355AlaVal: 6.355 ± 1.303
0.454AlaTrp: 0.454 ± 0.344
1.816AlaTyr: 1.816 ± 0.561
0.0AlaXaa: 0.0 ± 0.0
Cys
0.454CysAla: 0.454 ± 0.462
0.454CysCys: 0.454 ± 0.473
0.908CysAsp: 0.908 ± 0.488
1.816CysGlu: 1.816 ± 0.725
0.454CysPhe: 0.454 ± 0.344
0.908CysGly: 0.908 ± 0.675
0.0CysHis: 0.0 ± 0.0
0.908CysIle: 0.908 ± 0.689
2.724CysLys: 2.724 ± 0.452
0.454CysLeu: 0.454 ± 0.344
0.454CysMet: 0.454 ± 0.344
0.454CysAsn: 0.454 ± 0.462
3.177CysPro: 3.177 ± 0.814
0.0CysGln: 0.0 ± 0.0
1.362CysArg: 1.362 ± 0.457
2.27CysSer: 2.27 ± 1.305
0.454CysThr: 0.454 ± 0.462
0.454CysVal: 0.454 ± 0.473
0.454CysTrp: 0.454 ± 0.462
0.908CysTyr: 0.908 ± 0.675
0.0CysXaa: 0.0 ± 0.0
Asp
3.177AspAla: 3.177 ± 0.611
1.362AspCys: 1.362 ± 0.688
3.631AspAsp: 3.631 ± 1.393
4.539AspGlu: 4.539 ± 1.166
3.177AspPhe: 3.177 ± 1.423
4.085AspGly: 4.085 ± 1.327
0.908AspHis: 0.908 ± 0.689
3.631AspIle: 3.631 ± 1.642
1.362AspLys: 1.362 ± 0.857
8.171AspLeu: 8.171 ± 2.018
0.454AspMet: 0.454 ± 0.344
2.724AspAsn: 2.724 ± 0.452
3.631AspPro: 3.631 ± 0.551
1.816AspGln: 1.816 ± 0.589
2.27AspArg: 2.27 ± 0.839
4.085AspSer: 4.085 ± 1.479
3.631AspThr: 3.631 ± 1.658
3.631AspVal: 3.631 ± 1.775
1.816AspTrp: 1.816 ± 1.014
1.362AspTyr: 1.362 ± 0.857
0.0AspXaa: 0.0 ± 0.0
Glu
4.539GluAla: 4.539 ± 1.979
0.908GluCys: 0.908 ± 0.689
6.355GluAsp: 6.355 ± 1.81
5.447GluGlu: 5.447 ± 1.906
1.362GluPhe: 1.362 ± 0.361
5.447GluGly: 5.447 ± 1.489
0.454GluHis: 0.454 ± 0.405
2.724GluIle: 2.724 ± 0.436
0.454GluLys: 0.454 ± 0.453
4.993GluLeu: 4.993 ± 0.988
0.0GluMet: 0.0 ± 0.0
3.631GluAsn: 3.631 ± 1.146
2.724GluPro: 2.724 ± 0.543
3.631GluGln: 3.631 ± 1.373
6.809GluArg: 6.809 ± 0.732
1.816GluSer: 1.816 ± 0.927
5.447GluThr: 5.447 ± 2.029
4.539GluVal: 4.539 ± 0.489
0.454GluTrp: 0.454 ± 0.473
3.177GluTyr: 3.177 ± 1.421
0.0GluXaa: 0.0 ± 0.0
Phe
3.177PheAla: 3.177 ± 0.686
0.908PheCys: 0.908 ± 0.488
2.27PheAsp: 2.27 ± 0.936
2.27PheGlu: 2.27 ± 0.696
0.908PhePhe: 0.908 ± 0.47
1.362PheGly: 1.362 ± 0.8
0.454PheHis: 0.454 ± 0.453
0.908PheIle: 0.908 ± 0.689
3.631PheLys: 3.631 ± 1.809
4.993PheLeu: 4.993 ± 1.105
1.362PheMet: 1.362 ± 0.692
1.816PheAsn: 1.816 ± 1.288
0.908PhePro: 0.908 ± 0.466
1.816PheGln: 1.816 ± 0.782
1.816PheArg: 1.816 ± 0.606
2.27PheSer: 2.27 ± 1.338
2.724PheThr: 2.724 ± 0.665
1.816PheVal: 1.816 ± 0.854
2.724PheTrp: 2.724 ± 0.959
2.724PheTyr: 2.724 ± 0.665
0.0PheXaa: 0.0 ± 0.0
Gly
4.539GlyAla: 4.539 ± 1.085
1.362GlyCys: 1.362 ± 1.071
2.724GlyAsp: 2.724 ± 0.929
5.901GlyGlu: 5.901 ± 1.898
1.816GlyPhe: 1.816 ± 1.292
7.263GlyGly: 7.263 ± 1.107
2.724GlyHis: 2.724 ± 0.452
3.631GlyIle: 3.631 ± 0.851
2.724GlyLys: 2.724 ± 1.26
4.993GlyLeu: 4.993 ± 0.302
0.454GlyMet: 0.454 ± 0.453
4.085GlyAsn: 4.085 ± 0.871
4.085GlyPro: 4.085 ± 1.099
2.724GlyGln: 2.724 ± 0.722
7.717GlyArg: 7.717 ± 1.943
5.447GlySer: 5.447 ± 2.36
3.177GlyThr: 3.177 ± 0.868
3.631GlyVal: 3.631 ± 1.636
0.454GlyTrp: 0.454 ± 0.462
1.816GlyTyr: 1.816 ± 0.927
0.0GlyXaa: 0.0 ± 0.0
His
1.816HisAla: 1.816 ± 0.676
0.454HisCys: 0.454 ± 0.344
1.362HisAsp: 1.362 ± 0.615
0.454HisGlu: 0.454 ± 0.405
0.908HisPhe: 0.908 ± 0.466
1.362HisGly: 1.362 ± 0.927
1.816HisHis: 1.816 ± 0.947
0.908HisIle: 0.908 ± 0.81
0.454HisLys: 0.454 ± 0.344
2.724HisLeu: 2.724 ± 1.121
0.908HisMet: 0.908 ± 0.386
0.0HisAsn: 0.0 ± 0.0
1.362HisPro: 1.362 ± 0.698
0.908HisGln: 0.908 ± 0.689
3.177HisArg: 3.177 ± 1.977
1.362HisSer: 1.362 ± 0.433
0.0HisThr: 0.0 ± 0.0
0.908HisVal: 0.908 ± 0.427
0.908HisTrp: 0.908 ± 0.466
0.454HisTyr: 0.454 ± 0.405
0.0HisXaa: 0.0 ± 0.0
Ile
2.724IleAla: 2.724 ± 1.67
0.454IleCys: 0.454 ± 0.453
1.816IleAsp: 1.816 ± 0.589
4.993IleGlu: 4.993 ± 0.658
2.27IlePhe: 2.27 ± 0.486
3.177IleGly: 3.177 ± 1.747
0.454IleHis: 0.454 ± 0.453
1.362IleIle: 1.362 ± 0.433
0.908IleLys: 0.908 ± 0.555
2.27IleLeu: 2.27 ± 0.402
0.454IleMet: 0.454 ± 0.344
1.362IleAsn: 1.362 ± 0.782
2.27IlePro: 2.27 ± 1.443
0.908IleGln: 0.908 ± 0.689
0.908IleArg: 0.908 ± 0.488
2.724IleSer: 2.724 ± 1.352
2.27IleThr: 2.27 ± 0.402
3.177IleVal: 3.177 ± 1.125
0.0IleTrp: 0.0 ± 0.0
0.908IleTyr: 0.908 ± 0.683
0.0IleXaa: 0.0 ± 0.0
Lys
2.27LysAla: 2.27 ± 0.643
0.454LysCys: 0.454 ± 0.462
2.27LysAsp: 2.27 ± 1.253
1.816LysGlu: 1.816 ± 0.94
1.816LysPhe: 1.816 ± 1.288
0.908LysGly: 0.908 ± 0.466
1.362LysHis: 1.362 ± 0.688
0.908LysIle: 0.908 ± 0.488
1.816LysLys: 1.816 ± 0.782
2.724LysLeu: 2.724 ± 1.155
1.816LysMet: 1.816 ± 0.783
3.177LysAsn: 3.177 ± 1.186
0.908LysPro: 0.908 ± 0.47
1.362LysGln: 1.362 ± 0.433
3.177LysArg: 3.177 ± 0.879
3.177LysSer: 3.177 ± 1.189
1.362LysThr: 1.362 ± 0.615
4.539LysVal: 4.539 ± 1.886
1.362LysTrp: 1.362 ± 0.433
1.362LysTyr: 1.362 ± 0.677
0.0LysXaa: 0.0 ± 0.0
Leu
8.171LeuAla: 8.171 ± 2.298
3.177LeuCys: 3.177 ± 1.351
9.532LeuAsp: 9.532 ± 1.118
3.177LeuGlu: 3.177 ± 0.879
4.993LeuPhe: 4.993 ± 1.897
6.355LeuGly: 6.355 ± 0.839
2.27LeuHis: 2.27 ± 1.269
1.816LeuIle: 1.816 ± 0.854
4.539LeuLys: 4.539 ± 1.376
4.085LeuLeu: 4.085 ± 1.235
0.454LeuMet: 0.454 ± 0.405
0.908LeuAsn: 0.908 ± 0.47
4.085LeuPro: 4.085 ± 2.005
4.993LeuGln: 4.993 ± 1.402
4.539LeuArg: 4.539 ± 1.319
8.625LeuSer: 8.625 ± 2.432
6.355LeuThr: 6.355 ± 2.471
6.355LeuVal: 6.355 ± 0.943
0.454LeuTrp: 0.454 ± 0.344
3.631LeuTyr: 3.631 ± 1.217
0.0LeuXaa: 0.0 ± 0.0
Met
1.816MetAla: 1.816 ± 1.316
0.0MetCys: 0.0 ± 0.0
0.908MetAsp: 0.908 ± 0.464
0.908MetGlu: 0.908 ± 0.81
0.454MetPhe: 0.454 ± 0.462
0.908MetGly: 0.908 ± 0.47
0.908MetHis: 0.908 ± 0.488
0.454MetIle: 0.454 ± 0.344
0.0MetLys: 0.0 ± 0.0
1.816MetLeu: 1.816 ± 1.014
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.362MetPro: 1.362 ± 0.433
0.454MetGln: 0.454 ± 0.344
1.362MetArg: 1.362 ± 0.709
0.0MetSer: 0.0 ± 0.0
1.816MetThr: 1.816 ± 0.787
1.816MetVal: 1.816 ± 0.625
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.631AsnAla: 3.631 ± 1.38
0.454AsnCys: 0.454 ± 0.344
1.816AsnAsp: 1.816 ± 0.145
0.908AsnGlu: 0.908 ± 0.464
1.816AsnPhe: 1.816 ± 0.589
1.816AsnGly: 1.816 ± 1.063
0.0AsnHis: 0.0 ± 0.0
0.454AsnIle: 0.454 ± 0.344
1.816AsnLys: 1.816 ± 1.478
3.631AsnLeu: 3.631 ± 0.653
0.454AsnMet: 0.454 ± 0.344
2.724AsnAsn: 2.724 ± 1.807
5.901AsnPro: 5.901 ± 1.197
1.362AsnGln: 1.362 ± 0.457
4.539AsnArg: 4.539 ± 1.046
1.362AsnSer: 1.362 ± 0.457
4.539AsnThr: 4.539 ± 2.099
0.908AsnVal: 0.908 ± 0.464
0.908AsnTrp: 0.908 ± 0.689
0.908AsnTyr: 0.908 ± 0.47
0.0AsnXaa: 0.0 ± 0.0
Pro
9.986ProAla: 9.986 ± 2.565
0.454ProCys: 0.454 ± 0.462
4.539ProAsp: 4.539 ± 0.586
4.539ProGlu: 4.539 ± 0.395
2.27ProPhe: 2.27 ± 0.303
4.085ProGly: 4.085 ± 0.826
0.0ProHis: 0.0 ± 0.0
0.908ProIle: 0.908 ± 0.81
2.27ProLys: 2.27 ± 0.804
8.625ProLeu: 8.625 ± 1.189
1.816ProMet: 1.816 ± 0.625
2.27ProAsn: 2.27 ± 1.731
6.809ProPro: 6.809 ± 1.882
2.27ProGln: 2.27 ± 1.216
3.177ProArg: 3.177 ± 0.547
5.901ProSer: 5.901 ± 1.12
3.631ProThr: 3.631 ± 1.329
3.177ProVal: 3.177 ± 0.547
0.454ProTrp: 0.454 ± 0.462
2.724ProTyr: 2.724 ± 0.452
0.0ProXaa: 0.0 ± 0.0
Gln
4.539GlnAla: 4.539 ± 1.166
0.454GlnCys: 0.454 ± 0.344
1.362GlnAsp: 1.362 ± 0.822
3.177GlnGlu: 3.177 ± 1.217
0.908GlnPhe: 0.908 ± 0.464
4.539GlnGly: 4.539 ± 1.001
0.0GlnHis: 0.0 ± 0.0
1.816GlnIle: 1.816 ± 0.782
0.908GlnLys: 0.908 ± 0.923
2.724GlnLeu: 2.724 ± 1.202
0.0GlnMet: 0.0 ± 0.0
2.27GlnAsn: 2.27 ± 1.127
0.454GlnPro: 0.454 ± 0.453
2.724GlnGln: 2.724 ± 0.632
1.816GlnArg: 1.816 ± 0.738
3.177GlnSer: 3.177 ± 0.767
1.362GlnThr: 1.362 ± 0.902
4.085GlnVal: 4.085 ± 0.61
0.454GlnTrp: 0.454 ± 0.344
0.908GlnTyr: 0.908 ± 0.907
0.0GlnXaa: 0.0 ± 0.0
Arg
5.447ArgAla: 5.447 ± 1.618
2.27ArgCys: 2.27 ± 0.956
2.724ArgAsp: 2.724 ± 0.997
2.27ArgGlu: 2.27 ± 0.964
1.816ArgPhe: 1.816 ± 0.625
3.631ArgGly: 3.631 ± 0.913
3.631ArgHis: 3.631 ± 1.24
2.724ArgIle: 2.724 ± 1.143
3.177ArgLys: 3.177 ± 1.135
7.717ArgLeu: 7.717 ± 1.427
0.0ArgMet: 0.0 ± 0.0
3.177ArgAsn: 3.177 ± 1.023
5.447ArgPro: 5.447 ± 0.436
1.362ArgGln: 1.362 ± 0.907
5.447ArgArg: 5.447 ± 1.041
6.355ArgSer: 6.355 ± 0.951
3.177ArgThr: 3.177 ± 0.547
3.631ArgVal: 3.631 ± 1.247
0.454ArgTrp: 0.454 ± 0.344
2.27ArgTyr: 2.27 ± 0.581
0.0ArgXaa: 0.0 ± 0.0
Ser
6.355SerAla: 6.355 ± 1.569
1.362SerCys: 1.362 ± 0.861
4.085SerAsp: 4.085 ± 1.198
3.177SerGlu: 3.177 ± 1.261
4.085SerPhe: 4.085 ± 1.071
5.901SerGly: 5.901 ± 1.827
1.816SerHis: 1.816 ± 0.927
2.724SerIle: 2.724 ± 0.722
1.362SerLys: 1.362 ± 0.857
6.809SerLeu: 6.809 ± 1.512
0.454SerMet: 0.454 ± 0.405
1.362SerAsn: 1.362 ± 0.677
6.355SerPro: 6.355 ± 1.551
3.177SerGln: 3.177 ± 1.514
5.901SerArg: 5.901 ± 0.868
6.355SerSer: 6.355 ± 2.534
8.171SerThr: 8.171 ± 2.653
4.085SerVal: 4.085 ± 1.218
0.454SerTrp: 0.454 ± 0.462
1.816SerTyr: 1.816 ± 0.927
0.0SerXaa: 0.0 ± 0.0
Thr
4.539ThrAla: 4.539 ± 1.928
0.454ThrCys: 0.454 ± 0.405
3.177ThrAsp: 3.177 ± 0.961
4.539ThrGlu: 4.539 ± 1.712
2.724ThrPhe: 2.724 ± 1.0
6.355ThrGly: 6.355 ± 2.028
1.816ThrHis: 1.816 ± 0.625
1.816ThrIle: 1.816 ± 0.94
1.362ThrLys: 1.362 ± 0.716
6.809ThrLeu: 6.809 ± 1.658
1.816ThrMet: 1.816 ± 0.729
1.816ThrAsn: 1.816 ± 0.725
6.355ThrPro: 6.355 ± 1.146
0.908ThrGln: 0.908 ± 0.464
4.539ThrArg: 4.539 ± 1.377
4.085ThrSer: 4.085 ± 1.199
4.085ThrThr: 4.085 ± 1.245
3.177ThrVal: 3.177 ± 0.547
1.816ThrTrp: 1.816 ± 0.803
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
0.908ValAla: 0.908 ± 0.614
1.362ValCys: 1.362 ± 0.677
2.724ValAsp: 2.724 ± 0.849
4.539ValGlu: 4.539 ± 0.898
2.724ValPhe: 2.724 ± 0.452
4.085ValGly: 4.085 ± 1.178
2.27ValHis: 2.27 ± 1.162
4.085ValIle: 4.085 ± 1.099
3.177ValLys: 3.177 ± 0.698
4.539ValLeu: 4.539 ± 1.475
0.908ValMet: 0.908 ± 0.923
2.27ValAsn: 2.27 ± 1.375
5.447ValPro: 5.447 ± 1.776
2.27ValGln: 2.27 ± 0.956
2.724ValArg: 2.724 ± 0.406
7.263ValSer: 7.263 ± 1.773
4.539ValThr: 4.539 ± 1.287
3.177ValVal: 3.177 ± 1.198
1.816ValTrp: 1.816 ± 1.316
1.816ValTyr: 1.816 ± 1.18
0.0ValXaa: 0.0 ± 0.0
Trp
2.27TrpAla: 2.27 ± 0.696
0.0TrpCys: 0.0 ± 0.0
1.362TrpAsp: 1.362 ± 0.914
0.908TrpGlu: 0.908 ± 0.555
0.454TrpPhe: 0.454 ± 0.344
1.362TrpGly: 1.362 ± 0.688
0.908TrpHis: 0.908 ± 0.555
0.0TrpIle: 0.0 ± 0.0
0.454TrpLys: 0.454 ± 0.344
2.724TrpLeu: 2.724 ± 1.376
0.0TrpMet: 0.0 ± 0.0
0.454TrpAsn: 0.454 ± 0.344
1.362TrpPro: 1.362 ± 0.569
0.0TrpGln: 0.0 ± 0.0
0.908TrpArg: 0.908 ± 0.923
0.454TrpSer: 0.454 ± 0.405
1.362TrpThr: 1.362 ± 0.914
0.454TrpVal: 0.454 ± 0.344
0.0TrpTrp: 0.0 ± 0.0
1.362TrpTyr: 1.362 ± 0.861
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.362TyrAla: 1.362 ± 0.466
0.454TyrCys: 0.454 ± 0.473
0.908TyrAsp: 0.908 ± 0.464
2.724TyrGlu: 2.724 ± 0.931
2.724TyrPhe: 2.724 ± 0.941
1.816TyrGly: 1.816 ± 0.606
0.454TyrHis: 0.454 ± 0.462
1.816TyrIle: 1.816 ± 0.589
1.362TyrLys: 1.362 ± 0.615
1.362TyrLeu: 1.362 ± 0.361
0.454TyrMet: 0.454 ± 0.344
0.454TyrAsn: 0.454 ± 0.405
2.27TyrPro: 2.27 ± 0.696
1.816TyrGln: 1.816 ± 1.288
2.27TyrArg: 2.27 ± 1.194
2.27TyrSer: 2.27 ± 1.112
1.362TyrThr: 1.362 ± 0.914
2.724TyrVal: 2.724 ± 0.632
1.362TyrTrp: 1.362 ± 0.782
0.454TyrTyr: 0.454 ± 0.344
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2204 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski