Amino acid dipepetide frequency for Bos taurus papillomavirus 18

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.49AlaAla: 3.49 ± 1.136
0.873AlaCys: 0.873 ± 0.482
6.108AlaAsp: 6.108 ± 0.845
2.618AlaGlu: 2.618 ± 1.19
3.054AlaPhe: 3.054 ± 0.894
3.927AlaGly: 3.927 ± 0.77
0.873AlaHis: 0.873 ± 0.386
1.745AlaIle: 1.745 ± 0.538
3.054AlaLys: 3.054 ± 1.235
2.182AlaLeu: 2.182 ± 0.751
0.0AlaMet: 0.0 ± 0.0
3.054AlaAsn: 3.054 ± 0.862
2.618AlaPro: 2.618 ± 0.886
0.873AlaGln: 0.873 ± 0.889
4.363AlaArg: 4.363 ± 2.535
3.927AlaSer: 3.927 ± 1.385
5.672AlaThr: 5.672 ± 1.437
5.236AlaVal: 5.236 ± 0.687
0.873AlaTrp: 0.873 ± 0.599
1.309AlaTyr: 1.309 ± 0.572
0.0AlaXaa: 0.0 ± 0.0
Cys
1.309CysAla: 1.309 ± 0.649
0.873CysCys: 0.873 ± 0.482
0.436CysAsp: 0.436 ± 0.3
0.873CysGlu: 0.873 ± 0.386
1.309CysPhe: 1.309 ± 0.936
0.873CysGly: 0.873 ± 0.569
0.436CysHis: 0.436 ± 0.528
1.309CysIle: 1.309 ± 1.358
3.054CysLys: 3.054 ± 1.01
2.618CysLeu: 2.618 ± 1.087
0.873CysMet: 0.873 ± 0.482
0.0CysAsn: 0.0 ± 0.0
1.745CysPro: 1.745 ± 0.2
0.873CysGln: 0.873 ± 0.532
0.873CysArg: 0.873 ± 0.482
0.436CysSer: 0.436 ± 0.453
1.309CysThr: 1.309 ± 0.682
0.873CysVal: 0.873 ± 0.482
0.436CysTrp: 0.436 ± 0.3
0.436CysTyr: 0.436 ± 0.453
0.0CysXaa: 0.0 ± 0.0
Asp
3.927AspAla: 3.927 ± 0.817
1.745AspCys: 1.745 ± 0.485
3.49AspAsp: 3.49 ± 1.073
3.927AspGlu: 3.927 ± 1.142
2.618AspPhe: 2.618 ± 1.11
4.363AspGly: 4.363 ± 1.488
0.873AspHis: 0.873 ± 0.431
3.927AspIle: 3.927 ± 1.597
2.182AspLys: 2.182 ± 0.436
5.236AspLeu: 5.236 ± 1.471
2.618AspMet: 2.618 ± 1.664
0.873AspAsn: 0.873 ± 0.386
3.927AspPro: 3.927 ± 0.907
1.309AspGln: 1.309 ± 0.672
2.182AspArg: 2.182 ± 1.325
3.927AspSer: 3.927 ± 1.084
3.49AspThr: 3.49 ± 0.647
2.618AspVal: 2.618 ± 0.484
0.873AspTrp: 0.873 ± 0.482
3.054AspTyr: 3.054 ± 0.833
0.0AspXaa: 0.0 ± 0.0
Glu
3.054GluAla: 3.054 ± 0.986
1.309GluCys: 1.309 ± 0.682
5.672GluAsp: 5.672 ± 0.754
6.981GluGlu: 6.981 ± 1.799
1.745GluPhe: 1.745 ± 0.751
1.745GluGly: 1.745 ± 0.2
0.873GluHis: 0.873 ± 0.386
4.363GluIle: 4.363 ± 1.373
1.309GluLys: 1.309 ± 0.594
6.108GluLeu: 6.108 ± 1.101
1.745GluMet: 1.745 ± 0.921
4.363GluAsn: 4.363 ± 0.434
4.799GluPro: 4.799 ± 1.352
3.49GluGln: 3.49 ± 1.091
3.49GluArg: 3.49 ± 0.546
4.363GluSer: 4.363 ± 0.791
7.853GluThr: 7.853 ± 2.264
3.49GluVal: 3.49 ± 1.327
1.309GluTrp: 1.309 ± 0.88
2.182GluTyr: 2.182 ± 0.717
0.0GluXaa: 0.0 ± 0.0
Phe
2.182PheAla: 2.182 ± 0.794
0.436PheCys: 0.436 ± 0.453
2.618PheAsp: 2.618 ± 0.609
3.054PheGlu: 3.054 ± 1.813
1.745PhePhe: 1.745 ± 0.608
2.182PheGly: 2.182 ± 1.563
2.182PheHis: 2.182 ± 0.735
3.054PheIle: 3.054 ± 1.011
3.054PheLys: 3.054 ± 1.419
4.363PheLeu: 4.363 ± 0.56
0.873PheMet: 0.873 ± 0.386
4.363PheAsn: 4.363 ± 1.384
1.309PhePro: 1.309 ± 0.462
1.309PheGln: 1.309 ± 0.358
1.309PheArg: 1.309 ± 0.511
3.49PheSer: 3.49 ± 0.683
1.745PheThr: 1.745 ± 0.2
1.309PheVal: 1.309 ± 0.358
1.309PheTrp: 1.309 ± 0.668
2.182PheTyr: 2.182 ± 0.996
0.0PheXaa: 0.0 ± 0.0
Gly
3.49GlyAla: 3.49 ± 1.442
0.436GlyCys: 0.436 ± 0.444
4.363GlyAsp: 4.363 ± 0.773
3.49GlyGlu: 3.49 ± 0.698
0.0GlyPhe: 0.0 ± 0.0
6.545GlyGly: 6.545 ± 2.656
1.309GlyHis: 1.309 ± 0.653
2.618GlyIle: 2.618 ± 0.961
0.873GlyLys: 0.873 ± 0.386
3.927GlyLeu: 3.927 ± 0.462
0.873GlyMet: 0.873 ± 0.691
3.927GlyAsn: 3.927 ± 0.74
2.182GlyPro: 2.182 ± 0.366
1.309GlyGln: 1.309 ± 0.74
6.545GlyArg: 6.545 ± 2.18
6.545GlySer: 6.545 ± 1.961
3.927GlyThr: 3.927 ± 1.892
4.363GlyVal: 4.363 ± 1.762
0.436GlyTrp: 0.436 ± 0.444
2.182GlyTyr: 2.182 ± 0.577
0.0GlyXaa: 0.0 ± 0.0
His
0.436HisAla: 0.436 ± 0.453
1.745HisCys: 1.745 ± 0.622
1.745HisAsp: 1.745 ± 0.741
1.745HisGlu: 1.745 ± 0.671
2.618HisPhe: 2.618 ± 0.701
0.873HisGly: 0.873 ± 0.487
0.0HisHis: 0.0 ± 0.0
1.745HisIle: 1.745 ± 0.2
0.436HisLys: 0.436 ± 0.3
0.873HisLeu: 0.873 ± 0.569
0.873HisMet: 0.873 ± 0.386
1.309HisAsn: 1.309 ± 0.572
0.436HisPro: 0.436 ± 0.345
2.182HisGln: 2.182 ± 0.686
0.436HisArg: 0.436 ± 0.444
1.745HisSer: 1.745 ± 0.727
1.309HisThr: 1.309 ± 0.69
0.873HisVal: 0.873 ± 0.484
0.873HisTrp: 0.873 ± 0.487
0.436HisTyr: 0.436 ± 0.366
0.0HisXaa: 0.0 ± 0.0
Ile
3.054IleAla: 3.054 ± 0.919
1.309IleCys: 1.309 ± 0.74
3.054IleAsp: 3.054 ± 1.038
5.672IleGlu: 5.672 ± 1.436
1.745IlePhe: 1.745 ± 1.003
3.49IleGly: 3.49 ± 1.59
0.873IleHis: 0.873 ± 0.602
1.745IleIle: 1.745 ± 0.538
1.309IleLys: 1.309 ± 0.938
3.054IleLeu: 3.054 ± 0.763
0.0IleMet: 0.0 ± 0.0
4.799IleAsn: 4.799 ± 2.142
2.618IlePro: 2.618 ± 0.606
2.618IleGln: 2.618 ± 0.937
1.745IleArg: 1.745 ± 1.307
5.236IleSer: 5.236 ± 1.213
3.49IleThr: 3.49 ± 1.004
3.927IleVal: 3.927 ± 1.286
0.436IleTrp: 0.436 ± 0.453
2.182IleTyr: 2.182 ± 0.949
0.0IleXaa: 0.0 ± 0.0
Lys
3.054LysAla: 3.054 ± 1.069
2.182LysCys: 2.182 ± 0.855
2.182LysAsp: 2.182 ± 0.901
2.618LysGlu: 2.618 ± 1.09
2.182LysPhe: 2.182 ± 1.034
0.436LysGly: 0.436 ± 0.453
3.054LysHis: 3.054 ± 1.177
2.618LysIle: 2.618 ± 0.678
1.309LysLys: 1.309 ± 0.462
3.054LysLeu: 3.054 ± 0.931
0.436LysMet: 0.436 ± 0.3
2.618LysAsn: 2.618 ± 0.915
1.745LysPro: 1.745 ± 0.727
1.309LysGln: 1.309 ± 0.789
5.236LysArg: 5.236 ± 1.45
3.49LysSer: 3.49 ± 0.972
4.363LysThr: 4.363 ± 0.748
2.618LysVal: 2.618 ± 0.688
0.436LysTrp: 0.436 ± 0.3
1.745LysTyr: 1.745 ± 0.727
0.0LysXaa: 0.0 ± 0.0
Leu
5.672LeuAla: 5.672 ± 1.498
1.309LeuCys: 1.309 ± 0.511
3.927LeuAsp: 3.927 ± 0.512
5.672LeuGlu: 5.672 ± 0.834
3.927LeuPhe: 3.927 ± 0.913
3.49LeuGly: 3.49 ± 1.555
2.182LeuHis: 2.182 ± 0.647
6.108LeuIle: 6.108 ± 1.96
5.672LeuLys: 5.672 ± 1.341
9.162LeuLeu: 9.162 ± 2.5
1.309LeuMet: 1.309 ± 0.619
2.182LeuAsn: 2.182 ± 0.898
1.745LeuPro: 1.745 ± 0.973
3.49LeuGln: 3.49 ± 0.98
4.799LeuArg: 4.799 ± 0.894
7.853LeuSer: 7.853 ± 1.26
4.363LeuThr: 4.363 ± 0.612
3.49LeuVal: 3.49 ± 1.177
1.309LeuTrp: 1.309 ± 0.364
3.49LeuTyr: 3.49 ± 0.815
0.0LeuXaa: 0.0 ± 0.0
Met
1.745MetAla: 1.745 ± 0.519
0.873MetCys: 0.873 ± 0.482
0.0MetAsp: 0.0 ± 0.0
1.745MetGlu: 1.745 ± 0.707
0.873MetPhe: 0.873 ± 0.906
1.309MetGly: 1.309 ± 0.668
0.873MetHis: 0.873 ± 0.386
1.309MetIle: 1.309 ± 0.703
0.873MetLys: 0.873 ± 0.482
2.182MetLeu: 2.182 ± 0.901
0.436MetMet: 0.436 ± 0.345
0.436MetAsn: 0.436 ± 0.345
0.873MetPro: 0.873 ± 0.599
0.436MetGln: 0.436 ± 0.345
0.436MetArg: 0.436 ± 0.3
0.873MetSer: 0.873 ± 0.691
1.309MetThr: 1.309 ± 0.462
1.309MetVal: 1.309 ± 0.572
0.0MetTrp: 0.0 ± 0.0
1.309MetTyr: 1.309 ± 0.668
0.0MetXaa: 0.0 ± 0.0
Asn
3.054AsnAla: 3.054 ± 0.914
0.0AsnCys: 0.0 ± 0.0
2.182AsnAsp: 2.182 ± 0.84
3.054AsnGlu: 3.054 ± 1.4
2.618AsnPhe: 2.618 ± 0.701
1.309AsnGly: 1.309 ± 0.364
1.745AsnHis: 1.745 ± 0.816
2.182AsnIle: 2.182 ± 0.751
4.363AsnLys: 4.363 ± 1.5
4.363AsnLeu: 4.363 ± 1.521
0.873AsnMet: 0.873 ± 0.599
2.618AsnAsn: 2.618 ± 1.196
3.49AsnPro: 3.49 ± 0.67
2.618AsnGln: 2.618 ± 1.336
2.618AsnArg: 2.618 ± 0.642
4.799AsnSer: 4.799 ± 1.329
2.618AsnThr: 2.618 ± 1.294
2.618AsnVal: 2.618 ± 0.451
1.309AsnTrp: 1.309 ± 0.672
1.745AsnTyr: 1.745 ± 0.568
0.0AsnXaa: 0.0 ± 0.0
Pro
3.49ProAla: 3.49 ± 0.702
0.436ProCys: 0.436 ± 0.3
3.927ProAsp: 3.927 ± 1.077
3.927ProGlu: 3.927 ± 0.917
2.182ProPhe: 2.182 ± 0.738
1.745ProGly: 1.745 ± 1.199
0.0ProHis: 0.0 ± 0.0
3.054ProIle: 3.054 ± 1.167
3.054ProLys: 3.054 ± 0.693
4.799ProLeu: 4.799 ± 1.4
0.436ProMet: 0.436 ± 0.366
2.182ProAsn: 2.182 ± 0.654
4.799ProPro: 4.799 ± 1.149
2.618ProGln: 2.618 ± 0.896
2.182ProArg: 2.182 ± 1.349
6.981ProSer: 6.981 ± 2.101
3.054ProThr: 3.054 ± 2.019
3.054ProVal: 3.054 ± 0.896
0.873ProTrp: 0.873 ± 0.889
1.745ProTyr: 1.745 ± 0.862
0.0ProXaa: 0.0 ± 0.0
Gln
2.182GlnAla: 2.182 ± 0.585
0.436GlnCys: 0.436 ± 0.528
0.873GlnAsp: 0.873 ± 0.386
3.054GlnGlu: 3.054 ± 1.11
1.309GlnPhe: 1.309 ± 0.668
3.054GlnGly: 3.054 ± 1.011
1.309GlnHis: 1.309 ± 0.598
2.618GlnIle: 2.618 ± 1.062
0.873GlnLys: 0.873 ± 0.386
3.49GlnLeu: 3.49 ± 0.751
1.309GlnMet: 1.309 ± 0.789
3.49GlnAsn: 3.49 ± 0.67
4.363GlnPro: 4.363 ± 0.781
3.054GlnGln: 3.054 ± 0.894
1.745GlnArg: 1.745 ± 0.982
0.873GlnSer: 0.873 ± 0.431
2.618GlnThr: 2.618 ± 1.147
3.927GlnVal: 3.927 ± 0.995
0.436GlnTrp: 0.436 ± 0.3
0.873GlnTyr: 0.873 ± 0.691
0.0GlnXaa: 0.0 ± 0.0
Arg
2.182ArgAla: 2.182 ± 1.113
2.618ArgCys: 2.618 ± 1.773
0.873ArgAsp: 0.873 ± 0.732
3.054ArgGlu: 3.054 ± 0.722
3.927ArgPhe: 3.927 ± 0.611
4.799ArgGly: 4.799 ± 1.55
2.182ArgHis: 2.182 ± 1.176
1.745ArgIle: 1.745 ± 0.973
4.799ArgLys: 4.799 ± 1.149
7.417ArgLeu: 7.417 ± 1.445
1.309ArgMet: 1.309 ± 0.88
2.182ArgAsn: 2.182 ± 0.616
3.49ArgPro: 3.49 ± 1.929
1.745ArgGln: 1.745 ± 0.707
6.545ArgArg: 6.545 ± 3.201
5.236ArgSer: 5.236 ± 1.779
2.182ArgThr: 2.182 ± 0.912
1.745ArgVal: 1.745 ± 1.06
0.873ArgTrp: 0.873 ± 0.543
2.182ArgTyr: 2.182 ± 1.219
0.0ArgXaa: 0.0 ± 0.0
Ser
3.927SerAla: 3.927 ± 0.725
0.873SerCys: 0.873 ± 0.569
6.108SerAsp: 6.108 ± 1.743
5.236SerGlu: 5.236 ± 1.344
3.054SerPhe: 3.054 ± 1.003
6.108SerGly: 6.108 ± 1.314
1.745SerHis: 1.745 ± 0.597
2.618SerIle: 2.618 ± 0.829
0.873SerLys: 0.873 ± 0.691
4.799SerLeu: 4.799 ± 1.154
2.618SerMet: 2.618 ± 0.948
3.054SerAsn: 3.054 ± 1.038
5.236SerPro: 5.236 ± 0.35
5.672SerGln: 5.672 ± 0.761
6.108SerArg: 6.108 ± 1.876
7.853SerSer: 7.853 ± 3.16
8.29SerThr: 8.29 ± 2.266
6.981SerVal: 6.981 ± 1.446
0.436SerTrp: 0.436 ± 0.444
1.745SerTyr: 1.745 ± 0.909
0.0SerXaa: 0.0 ± 0.0
Thr
4.363ThrAla: 4.363 ± 0.868
2.618ThrCys: 2.618 ± 0.451
1.745ThrAsp: 1.745 ± 0.862
5.236ThrGlu: 5.236 ± 0.902
2.618ThrPhe: 2.618 ± 0.704
5.236ThrGly: 5.236 ± 1.178
0.436ThrHis: 0.436 ± 0.453
4.799ThrIle: 4.799 ± 2.575
1.745ThrLys: 1.745 ± 0.519
6.545ThrLeu: 6.545 ± 0.925
0.0ThrMet: 0.0 ± 0.0
5.236ThrAsn: 5.236 ± 0.932
3.927ThrPro: 3.927 ± 1.893
2.618ThrGln: 2.618 ± 0.699
1.745ThrArg: 1.745 ± 0.602
6.981ThrSer: 6.981 ± 2.073
3.49ThrThr: 3.49 ± 1.938
6.981ThrVal: 6.981 ± 1.75
0.436ThrTrp: 0.436 ± 0.444
1.745ThrTyr: 1.745 ± 0.973
0.0ThrXaa: 0.0 ± 0.0
Val
3.927ValAla: 3.927 ± 0.622
0.436ValCys: 0.436 ± 0.528
4.363ValAsp: 4.363 ± 0.898
5.236ValGlu: 5.236 ± 0.855
2.618ValPhe: 2.618 ± 0.98
3.49ValGly: 3.49 ± 1.41
1.309ValHis: 1.309 ± 0.364
3.49ValIle: 3.49 ± 1.151
3.054ValLys: 3.054 ± 0.853
2.618ValLeu: 2.618 ± 1.216
1.745ValMet: 1.745 ± 0.809
2.182ValAsn: 2.182 ± 0.84
3.054ValPro: 3.054 ± 1.076
2.182ValGln: 2.182 ± 0.912
4.799ValArg: 4.799 ± 1.154
6.981ValSer: 6.981 ± 1.024
3.49ValThr: 3.49 ± 1.55
1.745ValVal: 1.745 ± 1.024
0.873ValTrp: 0.873 ± 0.691
2.182ValTyr: 2.182 ± 1.07
0.0ValXaa: 0.0 ± 0.0
Trp
0.873TrpAla: 0.873 ± 0.484
0.436TrpCys: 0.436 ± 0.444
0.873TrpAsp: 0.873 ± 0.482
0.873TrpGlu: 0.873 ± 0.889
1.309TrpPhe: 1.309 ± 0.598
1.309TrpGly: 1.309 ± 0.717
0.0TrpHis: 0.0 ± 0.0
0.436TrpIle: 0.436 ± 0.3
2.618TrpLys: 2.618 ± 1.128
1.745TrpLeu: 1.745 ± 0.864
0.0TrpMet: 0.0 ± 0.0
0.436TrpAsn: 0.436 ± 0.345
0.0TrpPro: 0.0 ± 0.0
0.436TrpGln: 0.436 ± 0.366
0.873TrpArg: 0.873 ± 0.376
0.436TrpSer: 0.436 ± 0.444
1.309TrpThr: 1.309 ± 0.462
1.309TrpVal: 1.309 ± 0.668
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.436TyrAla: 0.436 ± 0.3
0.0TyrCys: 0.0 ± 0.0
2.618TyrAsp: 2.618 ± 0.451
2.182TyrGlu: 2.182 ± 0.949
2.618TyrPhe: 2.618 ± 0.898
3.054TyrGly: 3.054 ± 0.925
0.436TyrHis: 0.436 ± 0.345
0.873TyrIle: 0.873 ± 0.599
2.182TyrLys: 2.182 ± 0.934
2.618TyrLeu: 2.618 ± 1.023
0.436TyrMet: 0.436 ± 0.558
0.873TyrAsn: 0.873 ± 0.376
2.182TyrPro: 2.182 ± 1.131
1.745TyrGln: 1.745 ± 0.969
3.054TyrArg: 3.054 ± 1.068
1.309TyrSer: 1.309 ± 0.646
2.618TyrThr: 2.618 ± 0.484
1.309TyrVal: 1.309 ± 0.382
1.745TyrTrp: 1.745 ± 0.509
3.49TyrTyr: 3.49 ± 0.647
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2293 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski