Amino acid dipepetide frequency for Bos taurus papillomavirus 12

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.153AlaAla: 4.153 ± 1.44
0.831AlaCys: 0.831 ± 1.134
5.399AlaAsp: 5.399 ± 0.866
4.153AlaGlu: 4.153 ± 0.748
3.322AlaPhe: 3.322 ± 1.189
3.738AlaGly: 3.738 ± 1.086
0.831AlaHis: 0.831 ± 0.677
2.076AlaIle: 2.076 ± 0.735
3.738AlaLys: 3.738 ± 1.371
3.322AlaLeu: 3.322 ± 2.08
2.076AlaMet: 2.076 ± 0.75
1.246AlaAsn: 1.246 ± 0.673
4.153AlaPro: 4.153 ± 1.186
2.907AlaGln: 2.907 ± 0.88
3.322AlaArg: 3.322 ± 2.011
3.322AlaSer: 3.322 ± 1.049
4.983AlaThr: 4.983 ± 1.006
3.322AlaVal: 3.322 ± 1.448
0.0AlaTrp: 0.0 ± 0.0
1.661AlaTyr: 1.661 ± 0.561
0.0AlaXaa: 0.0 ± 0.0
Cys
0.831CysAla: 0.831 ± 0.849
0.831CysCys: 0.831 ± 0.638
1.246CysAsp: 1.246 ± 0.612
0.831CysGlu: 0.831 ± 0.371
0.415CysPhe: 0.415 ± 0.339
0.415CysGly: 0.415 ± 0.567
0.415CysHis: 0.415 ± 0.567
0.831CysIle: 0.831 ± 0.575
0.831CysLys: 0.831 ± 0.704
1.661CysLeu: 1.661 ± 0.759
0.415CysMet: 0.415 ± 0.36
0.831CysAsn: 0.831 ± 0.393
2.492CysPro: 2.492 ± 1.386
0.415CysGln: 0.415 ± 0.36
1.246CysArg: 1.246 ± 0.693
1.246CysSer: 1.246 ± 0.849
1.661CysThr: 1.661 ± 0.566
1.661CysVal: 1.661 ± 1.125
0.415CysTrp: 0.415 ± 0.333
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.399AspAla: 5.399 ± 1.239
0.831AspCys: 0.831 ± 0.581
5.814AspAsp: 5.814 ± 1.418
4.153AspGlu: 4.153 ± 1.129
4.568AspPhe: 4.568 ± 1.087
3.738AspGly: 3.738 ± 1.111
1.661AspHis: 1.661 ± 1.64
5.399AspIle: 5.399 ± 1.855
1.246AspLys: 1.246 ± 0.757
2.907AspLeu: 2.907 ± 0.452
0.0AspMet: 0.0 ± 0.0
2.907AspAsn: 2.907 ± 0.697
5.399AspPro: 5.399 ± 1.21
3.322AspGln: 3.322 ± 1.006
2.492AspArg: 2.492 ± 0.9
5.814AspSer: 5.814 ± 1.548
3.738AspThr: 3.738 ± 1.012
3.738AspVal: 3.738 ± 0.606
0.415AspTrp: 0.415 ± 0.339
1.246AspTyr: 1.246 ± 0.639
0.0AspXaa: 0.0 ± 0.0
Glu
2.492GluAla: 2.492 ± 0.705
1.661GluCys: 1.661 ± 1.125
4.568GluAsp: 4.568 ± 1.199
7.89GluGlu: 7.89 ± 1.673
2.492GluPhe: 2.492 ± 0.85
2.907GluGly: 2.907 ± 0.807
1.661GluHis: 1.661 ± 0.885
2.492GluIle: 2.492 ± 0.999
0.831GluLys: 0.831 ± 0.677
4.983GluLeu: 4.983 ± 2.328
2.492GluMet: 2.492 ± 0.576
3.738GluAsn: 3.738 ± 0.763
3.738GluPro: 3.738 ± 1.21
3.322GluGln: 3.322 ± 0.927
3.322GluArg: 3.322 ± 0.881
4.983GluSer: 4.983 ± 1.34
3.738GluThr: 3.738 ± 1.068
3.322GluVal: 3.322 ± 1.209
0.831GluTrp: 0.831 ± 0.371
0.415GluTyr: 0.415 ± 0.36
0.0GluXaa: 0.0 ± 0.0
Phe
2.076PheAla: 2.076 ± 0.876
1.246PheCys: 1.246 ± 1.134
1.661PheAsp: 1.661 ± 0.887
4.153PheGlu: 4.153 ± 0.977
1.246PhePhe: 1.246 ± 0.627
2.076PheGly: 2.076 ± 0.497
0.0PheHis: 0.0 ± 0.0
2.907PheIle: 2.907 ± 0.936
3.322PheLys: 3.322 ± 1.6
5.814PheLeu: 5.814 ± 1.664
0.831PheMet: 0.831 ± 0.413
2.076PheAsn: 2.076 ± 0.763
2.492PhePro: 2.492 ± 0.836
2.492PheGln: 2.492 ± 1.01
2.492PheArg: 2.492 ± 0.7
4.153PheSer: 4.153 ± 0.775
1.661PheThr: 1.661 ± 0.955
2.907PheVal: 2.907 ± 0.931
1.661PheTrp: 1.661 ± 0.742
0.831PheTyr: 0.831 ± 0.413
0.0PheXaa: 0.0 ± 0.0
Gly
2.076GlyAla: 2.076 ± 0.497
1.246GlyCys: 1.246 ± 0.342
2.492GlyAsp: 2.492 ± 1.066
4.153GlyGlu: 4.153 ± 0.792
2.492GlyPhe: 2.492 ± 0.953
5.399GlyGly: 5.399 ± 2.742
2.076GlyHis: 2.076 ± 0.712
4.568GlyIle: 4.568 ± 0.684
1.661GlyLys: 1.661 ± 0.541
5.399GlyLeu: 5.399 ± 2.226
0.0GlyMet: 0.0 ± 0.0
4.153GlyAsn: 4.153 ± 1.684
6.229GlyPro: 6.229 ± 2.462
1.246GlyGln: 1.246 ± 0.49
4.153GlyArg: 4.153 ± 1.628
6.645GlySer: 6.645 ± 0.934
5.814GlyThr: 5.814 ± 1.726
2.907GlyVal: 2.907 ± 1.477
0.0GlyTrp: 0.0 ± 0.0
1.661GlyTyr: 1.661 ± 0.742
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
1.246HisAsp: 1.246 ± 1.135
0.415HisGlu: 0.415 ± 0.36
1.246HisPhe: 1.246 ± 0.673
0.831HisGly: 0.831 ± 0.371
0.415HisHis: 0.415 ± 0.322
1.661HisIle: 1.661 ± 0.614
1.661HisLys: 1.661 ± 0.806
2.076HisLeu: 2.076 ± 0.839
0.0HisMet: 0.0 ± 0.0
0.415HisAsn: 0.415 ± 0.322
2.076HisPro: 2.076 ± 0.833
0.415HisGln: 0.415 ± 0.339
1.246HisArg: 1.246 ± 1.103
1.661HisSer: 1.661 ± 0.786
0.415HisThr: 0.415 ± 0.333
0.831HisVal: 0.831 ± 0.413
1.246HisTrp: 1.246 ± 0.6
1.246HisTyr: 1.246 ± 0.662
0.0HisXaa: 0.0 ± 0.0
Ile
3.738IleAla: 3.738 ± 1.459
0.0IleCys: 0.0 ± 0.0
1.661IleAsp: 1.661 ± 0.742
1.661IleGlu: 1.661 ± 0.686
1.661IlePhe: 1.661 ± 0.741
2.907IleGly: 2.907 ± 1.46
0.415IleHis: 0.415 ± 0.322
2.076IleIle: 2.076 ± 1.359
1.661IleLys: 1.661 ± 0.541
4.983IleLeu: 4.983 ± 2.029
1.661IleMet: 1.661 ± 1.615
1.246IleAsn: 1.246 ± 0.342
2.907IlePro: 2.907 ± 0.546
2.076IleGln: 2.076 ± 0.637
2.076IleArg: 2.076 ± 0.839
3.322IleSer: 3.322 ± 1.321
2.076IleThr: 2.076 ± 1.213
3.738IleVal: 3.738 ± 1.293
0.0IleTrp: 0.0 ± 0.0
3.738IleTyr: 3.738 ± 0.948
0.0IleXaa: 0.0 ± 0.0
Lys
2.907LysAla: 2.907 ± 0.706
0.831LysCys: 0.831 ± 0.638
2.076LysAsp: 2.076 ± 0.628
2.492LysGlu: 2.492 ± 1.794
2.492LysPhe: 2.492 ± 0.775
2.907LysGly: 2.907 ± 0.982
1.246LysHis: 1.246 ± 0.627
2.076LysIle: 2.076 ± 0.898
3.738LysLys: 3.738 ± 1.096
4.568LysLeu: 4.568 ± 1.792
1.661LysMet: 1.661 ± 0.564
2.907LysAsn: 2.907 ± 1.022
1.661LysPro: 1.661 ± 0.938
1.661LysGln: 1.661 ± 0.859
3.738LysArg: 3.738 ± 1.417
2.907LysSer: 2.907 ± 1.481
1.246LysThr: 1.246 ± 0.392
2.076LysVal: 2.076 ± 0.964
0.415LysTrp: 0.415 ± 0.322
2.907LysTyr: 2.907 ± 0.913
0.0LysXaa: 0.0 ± 0.0
Leu
4.568LeuAla: 4.568 ± 1.316
1.661LeuCys: 1.661 ± 0.513
8.721LeuAsp: 8.721 ± 1.234
5.399LeuGlu: 5.399 ± 1.045
4.568LeuPhe: 4.568 ± 1.539
5.399LeuGly: 5.399 ± 1.313
2.076LeuHis: 2.076 ± 1.174
2.492LeuIle: 2.492 ± 1.778
4.153LeuLys: 4.153 ± 1.455
10.797LeuLeu: 10.797 ± 4.342
2.907LeuMet: 2.907 ± 0.829
4.983LeuAsn: 4.983 ± 1.37
2.907LeuPro: 2.907 ± 0.687
7.475LeuGln: 7.475 ± 2.549
2.492LeuArg: 2.492 ± 0.911
4.153LeuSer: 4.153 ± 1.02
5.814LeuThr: 5.814 ± 1.799
6.229LeuVal: 6.229 ± 1.328
2.492LeuTrp: 2.492 ± 1.243
2.492LeuTyr: 2.492 ± 0.777
0.0LeuXaa: 0.0 ± 0.0
Met
0.831MetAla: 0.831 ± 0.677
0.831MetCys: 0.831 ± 0.704
0.831MetAsp: 0.831 ± 0.466
0.831MetGlu: 0.831 ± 0.688
2.076MetPhe: 2.076 ± 0.8
1.246MetGly: 1.246 ± 0.693
0.415MetHis: 0.415 ± 0.36
0.415MetIle: 0.415 ± 0.333
0.415MetLys: 0.415 ± 0.339
0.415MetLeu: 0.415 ± 0.36
0.0MetMet: 0.0 ± 0.0
0.831MetAsn: 0.831 ± 0.666
0.0MetPro: 0.0 ± 0.0
2.076MetGln: 2.076 ± 0.714
1.661MetArg: 1.661 ± 0.904
1.661MetSer: 1.661 ± 0.566
0.831MetThr: 0.831 ± 0.393
1.246MetVal: 1.246 ± 0.633
0.0MetTrp: 0.0 ± 0.0
0.831MetTyr: 0.831 ± 0.575
0.0MetXaa: 0.0 ± 0.0
Asn
2.076AsnAla: 2.076 ± 1.267
0.415AsnCys: 0.415 ± 0.339
2.492AsnAsp: 2.492 ± 0.535
2.907AsnGlu: 2.907 ± 1.573
1.246AsnPhe: 1.246 ± 1.0
3.322AsnGly: 3.322 ± 0.811
0.0AsnHis: 0.0 ± 0.0
2.076AsnIle: 2.076 ± 0.861
2.076AsnLys: 2.076 ± 0.524
3.322AsnLeu: 3.322 ± 1.326
0.415AsnMet: 0.415 ± 0.333
1.661AsnAsn: 1.661 ± 0.613
2.907AsnPro: 2.907 ± 0.833
3.322AsnGln: 3.322 ± 1.328
3.322AsnArg: 3.322 ± 1.095
3.738AsnSer: 3.738 ± 1.096
3.738AsnThr: 3.738 ± 0.598
4.153AsnVal: 4.153 ± 0.944
1.246AsnTrp: 1.246 ± 0.627
0.831AsnTyr: 0.831 ± 0.575
0.0AsnXaa: 0.0 ± 0.0
Pro
5.399ProAla: 5.399 ± 1.134
0.831ProCys: 0.831 ± 0.666
4.153ProAsp: 4.153 ± 1.24
4.983ProGlu: 4.983 ± 0.731
2.076ProPhe: 2.076 ± 0.835
6.229ProGly: 6.229 ± 2.855
0.0ProHis: 0.0 ± 0.0
1.661ProIle: 1.661 ± 0.541
2.907ProLys: 2.907 ± 1.188
6.229ProLeu: 6.229 ± 1.333
0.0ProMet: 0.0 ± 0.0
4.568ProAsn: 4.568 ± 1.367
8.306ProPro: 8.306 ± 1.295
0.831ProGln: 0.831 ± 0.466
2.907ProArg: 2.907 ± 1.474
4.153ProSer: 4.153 ± 0.801
3.322ProThr: 3.322 ± 1.033
4.153ProVal: 4.153 ± 1.313
0.415ProTrp: 0.415 ± 0.797
2.076ProTyr: 2.076 ± 1.262
0.0ProXaa: 0.0 ± 0.0
Gln
4.153GlnAla: 4.153 ± 1.408
0.831GlnCys: 0.831 ± 0.709
2.492GlnAsp: 2.492 ± 1.624
2.492GlnGlu: 2.492 ± 1.055
1.661GlnPhe: 1.661 ± 0.763
2.492GlnGly: 2.492 ± 0.785
0.0GlnHis: 0.0 ± 0.0
1.661GlnIle: 1.661 ± 0.885
1.246GlnLys: 1.246 ± 0.929
5.814GlnLeu: 5.814 ± 0.971
1.246GlnMet: 1.246 ± 0.619
1.661GlnAsn: 1.661 ± 0.5
3.322GlnPro: 3.322 ± 1.19
2.907GlnGln: 2.907 ± 1.202
4.983GlnArg: 4.983 ± 1.211
0.831GlnSer: 0.831 ± 0.393
4.568GlnThr: 4.568 ± 0.823
3.738GlnVal: 3.738 ± 0.949
0.831GlnTrp: 0.831 ± 0.393
0.831GlnTyr: 0.831 ± 0.466
0.0GlnXaa: 0.0 ± 0.0
Arg
2.492ArgAla: 2.492 ± 0.522
1.246ArgCys: 1.246 ± 0.627
0.831ArgAsp: 0.831 ± 0.371
2.907ArgGlu: 2.907 ± 1.449
1.661ArgPhe: 1.661 ± 0.755
6.229ArgGly: 6.229 ± 1.324
2.907ArgHis: 2.907 ± 1.221
1.246ArgIle: 1.246 ± 0.824
3.322ArgLys: 3.322 ± 0.421
7.06ArgLeu: 7.06 ± 1.357
0.415ArgMet: 0.415 ± 0.36
2.492ArgAsn: 2.492 ± 0.868
3.322ArgPro: 3.322 ± 1.12
3.322ArgGln: 3.322 ± 1.604
8.306ArgArg: 8.306 ± 3.736
6.645ArgSer: 6.645 ± 3.186
5.399ArgThr: 5.399 ± 1.558
2.907ArgVal: 2.907 ± 1.036
0.831ArgTrp: 0.831 ± 0.444
0.831ArgTyr: 0.831 ± 0.371
0.0ArgXaa: 0.0 ± 0.0
Ser
3.738SerAla: 3.738 ± 1.733
1.246SerCys: 1.246 ± 0.392
5.814SerAsp: 5.814 ± 0.575
4.983SerGlu: 4.983 ± 1.263
5.399SerPhe: 5.399 ± 1.132
7.06SerGly: 7.06 ± 1.585
0.831SerHis: 0.831 ± 0.371
0.831SerIle: 0.831 ± 0.601
3.322SerLys: 3.322 ± 1.297
6.645SerLeu: 6.645 ± 1.182
1.246SerMet: 1.246 ± 0.627
2.076SerAsn: 2.076 ± 0.981
2.076SerPro: 2.076 ± 0.749
2.076SerGln: 2.076 ± 0.813
7.06SerArg: 7.06 ± 2.538
4.153SerSer: 4.153 ± 1.528
5.399SerThr: 5.399 ± 1.872
4.153SerVal: 4.153 ± 0.854
1.246SerTrp: 1.246 ± 0.639
1.246SerTyr: 1.246 ± 0.693
0.0SerXaa: 0.0 ± 0.0
Thr
3.738ThrAla: 3.738 ± 0.597
1.246ThrCys: 1.246 ± 0.673
5.399ThrAsp: 5.399 ± 1.324
2.907ThrGlu: 2.907 ± 0.91
4.153ThrPhe: 4.153 ± 1.715
3.738ThrGly: 3.738 ± 1.094
0.831ThrHis: 0.831 ± 0.677
3.738ThrIle: 3.738 ± 1.468
2.907ThrLys: 2.907 ± 1.129
6.229ThrLeu: 6.229 ± 1.482
0.415ThrMet: 0.415 ± 0.333
2.907ThrAsn: 2.907 ± 0.871
4.983ThrPro: 4.983 ± 1.64
1.246ThrGln: 1.246 ± 0.375
4.568ThrArg: 4.568 ± 0.891
4.568ThrSer: 4.568 ± 1.814
4.153ThrThr: 4.153 ± 2.142
4.983ThrVal: 4.983 ± 1.349
0.415ThrTrp: 0.415 ± 0.339
2.076ThrTyr: 2.076 ± 0.982
0.0ThrXaa: 0.0 ± 0.0
Val
4.568ValAla: 4.568 ± 1.879
1.661ValCys: 1.661 ± 0.775
4.153ValAsp: 4.153 ± 0.869
2.492ValGlu: 2.492 ± 0.9
1.661ValPhe: 1.661 ± 0.541
1.246ValGly: 1.246 ± 0.619
2.492ValHis: 2.492 ± 1.11
3.738ValIle: 3.738 ± 1.305
4.153ValLys: 4.153 ± 0.856
5.399ValLeu: 5.399 ± 2.868
0.415ValMet: 0.415 ± 0.339
2.076ValAsn: 2.076 ± 0.838
4.568ValPro: 4.568 ± 1.3
5.399ValGln: 5.399 ± 1.018
3.738ValArg: 3.738 ± 1.428
4.983ValSer: 4.983 ± 1.091
3.738ValThr: 3.738 ± 0.858
2.076ValVal: 2.076 ± 0.847
0.831ValTrp: 0.831 ± 0.466
3.738ValTyr: 3.738 ± 1.197
0.0ValXaa: 0.0 ± 0.0
Trp
0.831TrpAla: 0.831 ± 0.371
0.831TrpCys: 0.831 ± 1.292
1.661TrpAsp: 1.661 ± 0.739
1.246TrpGlu: 1.246 ± 0.49
0.0TrpPhe: 0.0 ± 0.0
0.415TrpGly: 0.415 ± 0.339
0.415TrpHis: 0.415 ± 0.36
1.246TrpIle: 1.246 ± 0.627
2.076TrpLys: 2.076 ± 0.846
0.831TrpLeu: 0.831 ± 0.371
0.0TrpMet: 0.0 ± 0.0
0.831TrpAsn: 0.831 ± 0.666
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.831TrpSer: 0.831 ± 0.371
0.831TrpThr: 0.831 ± 0.72
1.661TrpVal: 1.661 ± 0.789
0.415TrpTrp: 0.415 ± 0.339
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.492TyrAla: 2.492 ± 1.155
0.415TyrCys: 0.415 ± 0.339
2.076TyrAsp: 2.076 ± 0.427
0.831TyrGlu: 0.831 ± 0.444
1.661TyrPhe: 1.661 ± 0.686
2.076TyrGly: 2.076 ± 0.674
0.415TyrHis: 0.415 ± 0.333
0.831TyrIle: 0.831 ± 0.371
1.246TyrLys: 1.246 ± 0.627
3.322TyrLeu: 3.322 ± 1.016
0.831TyrMet: 0.831 ± 0.551
1.661TyrAsn: 1.661 ± 0.986
1.661TyrPro: 1.661 ± 0.761
1.246TyrGln: 1.246 ± 0.693
1.246TyrArg: 1.246 ± 0.49
0.831TyrSer: 0.831 ± 0.72
2.076TyrThr: 2.076 ± 1.246
3.322TyrVal: 3.322 ± 1.048
0.415TyrTrp: 0.415 ± 0.333
2.907TyrTyr: 2.907 ± 1.382
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (2409 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski