Amino acid dipepetide frequency for Bos taurus papillomavirus 23

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.494AlaAla: 6.494 ± 1.927
1.299AlaCys: 1.299 ± 0.891
1.732AlaAsp: 1.732 ± 0.851
4.329AlaGlu: 4.329 ± 1.384
4.329AlaPhe: 4.329 ± 1.695
0.866AlaGly: 0.866 ± 0.467
0.433AlaHis: 0.433 ± 0.34
0.866AlaIle: 0.866 ± 0.452
3.896AlaLys: 3.896 ± 1.478
3.463AlaLeu: 3.463 ± 0.543
0.433AlaMet: 0.433 ± 0.4
2.597AlaAsn: 2.597 ± 0.564
5.195AlaPro: 5.195 ± 0.638
3.463AlaGln: 3.463 ± 1.228
3.896AlaArg: 3.896 ± 0.592
4.762AlaSer: 4.762 ± 1.145
2.165AlaThr: 2.165 ± 1.041
5.628AlaVal: 5.628 ± 1.607
0.0AlaTrp: 0.0 ± 0.0
2.165AlaTyr: 2.165 ± 0.675
0.0AlaXaa: 0.0 ± 0.0
Cys
1.299CysAla: 1.299 ± 0.874
0.866CysCys: 0.866 ± 0.704
1.732CysAsp: 1.732 ± 0.856
1.299CysGlu: 1.299 ± 0.874
1.299CysPhe: 1.299 ± 1.056
0.433CysGly: 0.433 ± 0.34
0.0CysHis: 0.0 ± 0.0
1.732CysIle: 1.732 ± 0.686
0.433CysLys: 0.433 ± 0.352
0.0CysLeu: 0.0 ± 0.0
0.866CysMet: 0.866 ± 0.452
0.433CysAsn: 0.433 ± 0.556
1.299CysPro: 1.299 ± 0.673
0.0CysGln: 0.0 ± 0.0
0.866CysArg: 0.866 ± 0.427
1.732CysSer: 1.732 ± 1.167
1.299CysThr: 1.299 ± 0.418
0.866CysVal: 0.866 ± 0.64
0.866CysTrp: 0.866 ± 0.462
0.433CysTyr: 0.433 ± 0.556
0.0CysXaa: 0.0 ± 0.0
Asp
4.329AspAla: 4.329 ± 1.281
0.866AspCys: 0.866 ± 0.396
5.195AspAsp: 5.195 ± 1.443
3.896AspGlu: 3.896 ± 1.102
3.03AspPhe: 3.03 ± 1.289
3.463AspGly: 3.463 ± 0.629
1.299AspHis: 1.299 ± 0.97
3.463AspIle: 3.463 ± 1.362
0.866AspLys: 0.866 ± 0.704
5.195AspLeu: 5.195 ± 1.534
0.433AspMet: 0.433 ± 0.352
2.597AspAsn: 2.597 ± 0.995
3.896AspPro: 3.896 ± 1.54
2.597AspGln: 2.597 ± 1.297
3.896AspArg: 3.896 ± 2.59
4.762AspSer: 4.762 ± 0.824
4.329AspThr: 4.329 ± 0.913
3.463AspVal: 3.463 ± 0.543
0.433AspTrp: 0.433 ± 0.352
2.165AspTyr: 2.165 ± 0.594
0.0AspXaa: 0.0 ± 0.0
Glu
3.463GluAla: 3.463 ± 0.769
1.299GluCys: 1.299 ± 1.056
6.926GluAsp: 6.926 ± 1.745
6.061GluGlu: 6.061 ± 2.555
3.463GluPhe: 3.463 ± 1.071
3.463GluGly: 3.463 ± 1.653
0.866GluHis: 0.866 ± 0.599
3.896GluIle: 3.896 ± 1.332
1.732GluLys: 1.732 ± 0.546
2.597GluLeu: 2.597 ± 1.417
1.732GluMet: 1.732 ± 0.542
4.329GluAsn: 4.329 ± 0.701
2.597GluPro: 2.597 ± 1.135
3.896GluGln: 3.896 ± 1.004
3.896GluArg: 3.896 ± 1.242
3.896GluSer: 3.896 ± 1.433
3.896GluThr: 3.896 ± 1.11
5.628GluVal: 5.628 ± 1.953
0.433GluTrp: 0.433 ± 0.34
0.433GluTyr: 0.433 ± 0.4
0.0GluXaa: 0.0 ± 0.0
Phe
1.732PheAla: 1.732 ± 1.018
0.433PheCys: 0.433 ± 0.556
2.165PheAsp: 2.165 ± 0.547
2.597PheGlu: 2.597 ± 0.863
1.732PhePhe: 1.732 ± 0.488
3.463PheGly: 3.463 ± 1.594
0.866PheHis: 0.866 ± 0.642
2.165PheIle: 2.165 ± 0.632
3.463PheLys: 3.463 ± 1.298
6.494PheLeu: 6.494 ± 1.245
0.866PheMet: 0.866 ± 0.367
2.165PheAsn: 2.165 ± 0.632
2.597PhePro: 2.597 ± 0.675
2.597PheGln: 2.597 ± 0.93
2.165PheArg: 2.165 ± 0.381
2.597PheSer: 2.597 ± 0.504
1.732PheThr: 1.732 ± 0.681
2.597PheVal: 2.597 ± 0.901
2.165PheTrp: 2.165 ± 1.02
0.866PheTyr: 0.866 ± 0.679
0.0PheXaa: 0.0 ± 0.0
Gly
3.896GlyAla: 3.896 ± 0.922
1.732GlyCys: 1.732 ± 0.542
2.597GlyAsp: 2.597 ± 1.443
5.195GlyGlu: 5.195 ± 1.674
1.299GlyPhe: 1.299 ± 0.338
8.658GlyGly: 8.658 ± 2.804
2.597GlyHis: 2.597 ± 0.932
5.195GlyIle: 5.195 ± 0.982
3.896GlyLys: 3.896 ± 0.855
3.463GlyLeu: 3.463 ± 0.685
0.866GlyMet: 0.866 ± 0.757
2.165GlyAsn: 2.165 ± 0.899
4.762GlyPro: 4.762 ± 1.278
4.329GlyGln: 4.329 ± 1.326
5.628GlyArg: 5.628 ± 2.089
6.926GlySer: 6.926 ± 0.899
4.329GlyThr: 4.329 ± 1.207
2.597GlyVal: 2.597 ± 0.805
0.0GlyTrp: 0.0 ± 0.0
1.732GlyTyr: 1.732 ± 0.542
0.0GlyXaa: 0.0 ± 0.0
His
0.433HisAla: 0.433 ± 0.352
0.433HisCys: 0.433 ± 0.352
1.732HisAsp: 1.732 ± 0.977
0.0HisGlu: 0.0 ± 0.0
0.433HisPhe: 0.433 ± 0.352
2.165HisGly: 2.165 ± 0.399
0.433HisHis: 0.433 ± 0.556
0.0HisIle: 0.0 ± 0.0
2.165HisLys: 2.165 ± 1.087
2.165HisLeu: 2.165 ± 0.959
0.0HisMet: 0.0 ± 0.0
0.433HisAsn: 0.433 ± 0.509
2.597HisPro: 2.597 ± 1.282
0.433HisGln: 0.433 ± 0.352
1.299HisArg: 1.299 ± 0.586
2.165HisSer: 2.165 ± 0.849
1.299HisThr: 1.299 ± 0.795
2.165HisVal: 2.165 ± 0.547
0.866HisTrp: 0.866 ± 0.427
0.866HisTyr: 0.866 ± 0.427
0.0HisXaa: 0.0 ± 0.0
Ile
3.03IleAla: 3.03 ± 0.787
1.299IleCys: 1.299 ± 0.649
3.463IleAsp: 3.463 ± 1.653
4.329IleGlu: 4.329 ± 1.148
3.03IlePhe: 3.03 ± 0.673
3.03IleGly: 3.03 ± 1.091
0.866IleHis: 0.866 ± 0.427
2.597IleIle: 2.597 ± 0.901
2.597IleLys: 2.597 ± 0.555
5.628IleLeu: 5.628 ± 1.36
1.732IleMet: 1.732 ± 0.963
1.299IleAsn: 1.299 ± 0.656
1.732IlePro: 1.732 ± 0.871
0.866IleGln: 0.866 ± 0.679
0.866IleArg: 0.866 ± 0.396
2.165IleSer: 2.165 ± 1.45
2.597IleThr: 2.597 ± 1.2
2.597IleVal: 2.597 ± 0.961
0.0IleTrp: 0.0 ± 0.0
2.165IleTyr: 2.165 ± 1.05
0.0IleXaa: 0.0 ± 0.0
Lys
3.463LysAla: 3.463 ± 0.978
0.866LysCys: 0.866 ± 0.523
2.165LysAsp: 2.165 ± 0.917
2.165LysGlu: 2.165 ± 1.115
2.597LysPhe: 2.597 ± 1.212
4.762LysGly: 4.762 ± 1.136
3.03LysHis: 3.03 ± 1.026
0.433LysIle: 0.433 ± 0.352
2.597LysLys: 2.597 ± 1.01
4.762LysLeu: 4.762 ± 0.392
1.299LysMet: 1.299 ± 0.795
1.299LysAsn: 1.299 ± 0.338
2.165LysPro: 2.165 ± 0.64
0.866LysGln: 0.866 ± 0.396
3.463LysArg: 3.463 ± 0.932
4.762LysSer: 4.762 ± 1.888
3.463LysThr: 3.463 ± 0.685
1.299LysVal: 1.299 ± 0.418
1.299LysTrp: 1.299 ± 0.545
2.165LysTyr: 2.165 ± 0.632
0.0LysXaa: 0.0 ± 0.0
Leu
4.762LeuAla: 4.762 ± 1.186
0.866LeuCys: 0.866 ± 0.661
5.628LeuAsp: 5.628 ± 1.987
6.061LeuGlu: 6.061 ± 1.712
3.896LeuPhe: 3.896 ± 0.656
5.195LeuGly: 5.195 ± 1.695
2.165LeuHis: 2.165 ± 0.873
3.463LeuIle: 3.463 ± 1.049
5.628LeuLys: 5.628 ± 1.337
8.225LeuLeu: 8.225 ± 2.311
2.165LeuMet: 2.165 ± 0.957
2.165LeuAsn: 2.165 ± 0.706
4.329LeuPro: 4.329 ± 1.692
6.494LeuGln: 6.494 ± 2.118
5.628LeuArg: 5.628 ± 1.291
5.628LeuSer: 5.628 ± 2.398
2.597LeuThr: 2.597 ± 0.439
4.762LeuVal: 4.762 ± 1.658
1.299LeuTrp: 1.299 ± 0.795
3.896LeuTyr: 3.896 ± 1.327
0.0LeuXaa: 0.0 ± 0.0
Met
0.433MetAla: 0.433 ± 0.34
0.866MetCys: 0.866 ± 0.396
0.866MetAsp: 0.866 ± 0.396
0.866MetGlu: 0.866 ± 0.801
0.866MetPhe: 0.866 ± 0.396
1.299MetGly: 1.299 ± 0.418
0.433MetHis: 0.433 ± 0.352
0.433MetIle: 0.433 ± 0.378
0.433MetLys: 0.433 ± 0.556
1.299MetLeu: 1.299 ± 0.704
0.433MetMet: 0.433 ± 0.509
0.866MetAsn: 0.866 ± 0.679
0.0MetPro: 0.0 ± 0.0
1.732MetGln: 1.732 ± 0.778
0.866MetArg: 0.866 ± 0.704
2.165MetSer: 2.165 ± 0.917
1.299MetThr: 1.299 ± 0.389
1.299MetVal: 1.299 ± 0.647
0.0MetTrp: 0.0 ± 0.0
0.433MetTyr: 0.433 ± 0.34
0.0MetXaa: 0.0 ± 0.0
Asn
3.03AsnAla: 3.03 ± 1.714
0.866AsnCys: 0.866 ± 0.523
1.299AsnAsp: 1.299 ± 0.389
4.329AsnGlu: 4.329 ± 1.822
2.165AsnPhe: 2.165 ± 0.709
2.597AsnGly: 2.597 ± 0.951
0.0AsnHis: 0.0 ± 0.0
1.732AsnIle: 1.732 ± 0.542
2.597AsnLys: 2.597 ± 0.434
3.463AsnLeu: 3.463 ± 1.86
0.866AsnMet: 0.866 ± 0.679
3.463AsnAsn: 3.463 ± 1.998
3.896AsnPro: 3.896 ± 1.079
1.732AsnGln: 1.732 ± 0.957
3.03AsnArg: 3.03 ± 1.063
3.896AsnSer: 3.896 ± 1.033
3.463AsnThr: 3.463 ± 1.05
2.165AsnVal: 2.165 ± 0.653
0.866AsnTrp: 0.866 ± 0.396
0.433AsnTyr: 0.433 ± 0.509
0.0AsnXaa: 0.0 ± 0.0
Pro
3.896ProAla: 3.896 ± 0.532
0.433ProCys: 0.433 ± 0.34
4.329ProAsp: 4.329 ± 2.061
5.628ProGlu: 5.628 ± 2.354
1.732ProPhe: 1.732 ± 0.686
5.628ProGly: 5.628 ± 1.719
1.299ProHis: 1.299 ± 0.442
2.597ProIle: 2.597 ± 1.458
3.896ProLys: 3.896 ± 1.224
7.359ProLeu: 7.359 ± 2.295
0.433ProMet: 0.433 ± 0.352
4.329ProAsn: 4.329 ± 1.428
8.658ProPro: 8.658 ± 2.017
1.299ProGln: 1.299 ± 0.913
0.866ProArg: 0.866 ± 0.64
5.628ProSer: 5.628 ± 1.478
4.762ProThr: 4.762 ± 1.705
2.165ProVal: 2.165 ± 0.547
0.0ProTrp: 0.0 ± 0.0
1.732ProTyr: 1.732 ± 1.176
0.0ProXaa: 0.0 ± 0.0
Gln
2.165GlnAla: 2.165 ± 0.807
0.433GlnCys: 0.433 ± 0.352
3.463GlnAsp: 3.463 ± 1.348
1.732GlnGlu: 1.732 ± 0.62
0.866GlnPhe: 0.866 ± 0.661
4.329GlnGly: 4.329 ± 1.491
0.866GlnHis: 0.866 ± 0.462
3.463GlnIle: 3.463 ± 1.228
0.433GlnLys: 0.433 ± 0.4
3.896GlnLeu: 3.896 ± 0.937
1.732GlnMet: 1.732 ± 0.78
3.03GlnAsn: 3.03 ± 0.904
2.165GlnPro: 2.165 ± 0.706
3.03GlnGln: 3.03 ± 0.529
3.463GlnArg: 3.463 ± 0.637
3.463GlnSer: 3.463 ± 1.049
2.597GlnThr: 2.597 ± 0.601
3.03GlnVal: 3.03 ± 0.529
0.866GlnTrp: 0.866 ± 0.523
0.866GlnTyr: 0.866 ± 0.396
0.0GlnXaa: 0.0 ± 0.0
Arg
3.463ArgAla: 3.463 ± 1.228
1.299ArgCys: 1.299 ± 0.668
2.165ArgAsp: 2.165 ± 1.175
3.896ArgGlu: 3.896 ± 1.023
2.165ArgPhe: 2.165 ± 0.86
5.628ArgGly: 5.628 ± 1.878
2.597ArgHis: 2.597 ± 0.707
2.597ArgIle: 2.597 ± 0.656
3.03ArgLys: 3.03 ± 0.788
7.359ArgLeu: 7.359 ± 1.453
0.433ArgMet: 0.433 ± 0.4
3.896ArgAsn: 3.896 ± 1.115
4.329ArgPro: 4.329 ± 1.865
1.299ArgGln: 1.299 ± 0.778
6.926ArgArg: 6.926 ± 2.433
5.195ArgSer: 5.195 ± 2.843
5.628ArgThr: 5.628 ± 1.871
2.597ArgVal: 2.597 ± 0.504
0.0ArgTrp: 0.0 ± 0.0
1.299ArgTyr: 1.299 ± 0.418
0.0ArgXaa: 0.0 ± 0.0
Ser
3.463SerAla: 3.463 ± 1.283
0.866SerCys: 0.866 ± 0.642
4.329SerAsp: 4.329 ± 1.089
3.896SerGlu: 3.896 ± 1.264
5.628SerPhe: 5.628 ± 1.059
7.359SerGly: 7.359 ± 1.379
1.299SerHis: 1.299 ± 0.704
2.597SerIle: 2.597 ± 1.305
4.329SerLys: 4.329 ± 1.151
8.225SerLeu: 8.225 ± 1.488
2.165SerMet: 2.165 ± 1.093
1.299SerAsn: 1.299 ± 1.056
5.628SerPro: 5.628 ± 1.631
3.896SerGln: 3.896 ± 1.314
6.926SerArg: 6.926 ± 2.599
7.359SerSer: 7.359 ± 2.536
4.762SerThr: 4.762 ± 2.164
2.597SerVal: 2.597 ± 0.504
1.732SerTrp: 1.732 ± 0.577
1.299SerTyr: 1.299 ± 0.778
0.0SerXaa: 0.0 ± 0.0
Thr
3.463ThrAla: 3.463 ± 1.195
2.165ThrCys: 2.165 ± 1.539
3.463ThrAsp: 3.463 ± 0.801
3.03ThrGlu: 3.03 ± 1.047
3.03ThrPhe: 3.03 ± 0.4
4.329ThrGly: 4.329 ± 1.325
0.866ThrHis: 0.866 ± 0.396
3.03ThrIle: 3.03 ± 0.846
1.732ThrLys: 1.732 ± 0.542
4.762ThrLeu: 4.762 ± 1.45
0.0ThrMet: 0.0 ± 0.0
2.165ThrAsn: 2.165 ± 0.899
6.926ThrPro: 6.926 ± 2.803
0.866ThrGln: 0.866 ± 0.679
6.061ThrArg: 6.061 ± 1.87
5.628ThrSer: 5.628 ± 1.983
1.732ThrThr: 1.732 ± 0.792
5.195ThrVal: 5.195 ± 1.441
1.299ThrTrp: 1.299 ± 0.778
2.165ThrTyr: 2.165 ± 0.917
0.0ThrXaa: 0.0 ± 0.0
Val
2.165ValAla: 2.165 ± 0.714
0.866ValCys: 0.866 ± 1.112
4.329ValAsp: 4.329 ± 0.599
1.732ValGlu: 1.732 ± 1.319
1.732ValPhe: 1.732 ± 0.583
2.597ValGly: 2.597 ± 0.434
1.299ValHis: 1.299 ± 0.732
2.597ValIle: 2.597 ± 1.007
3.03ValLys: 3.03 ± 0.765
2.597ValLeu: 2.597 ± 0.564
0.0ValMet: 0.0 ± 0.0
3.896ValAsn: 3.896 ± 0.68
3.03ValPro: 3.03 ± 0.854
4.329ValGln: 4.329 ± 1.222
4.762ValArg: 4.762 ± 2.402
5.195ValSer: 5.195 ± 1.28
6.494ValThr: 6.494 ± 2.091
1.299ValVal: 1.299 ± 0.857
0.433ValTrp: 0.433 ± 0.34
2.165ValTyr: 2.165 ± 0.414
0.0ValXaa: 0.0 ± 0.0
Trp
0.866TrpAla: 0.866 ± 0.452
0.0TrpCys: 0.0 ± 0.0
0.866TrpAsp: 0.866 ± 0.599
2.165TrpGlu: 2.165 ± 0.781
0.433TrpPhe: 0.433 ± 0.378
0.866TrpGly: 0.866 ± 0.462
0.433TrpHis: 0.433 ± 0.4
1.299TrpIle: 1.299 ± 0.668
1.299TrpLys: 1.299 ± 1.056
0.866TrpLeu: 0.866 ± 0.396
0.0TrpMet: 0.0 ± 0.0
0.866TrpAsn: 0.866 ± 0.679
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.866TrpSer: 0.866 ± 0.462
1.299TrpThr: 1.299 ± 0.778
0.866TrpVal: 0.866 ± 0.396
0.433TrpTrp: 0.433 ± 0.352
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.165TyrAla: 2.165 ± 0.793
0.0TyrCys: 0.0 ± 0.0
1.732TyrAsp: 1.732 ± 0.683
1.299TyrGlu: 1.299 ± 0.75
1.732TyrPhe: 1.732 ± 0.855
1.732TyrGly: 1.732 ± 0.686
0.433TyrHis: 0.433 ± 0.34
1.732TyrIle: 1.732 ± 0.963
0.866TyrLys: 0.866 ± 0.704
3.03TyrLeu: 3.03 ± 1.599
0.0TyrMet: 0.0 ± 0.0
2.597TyrAsn: 2.597 ± 0.439
0.866TyrPro: 0.866 ± 0.462
2.165TyrGln: 2.165 ± 0.801
1.299TyrArg: 1.299 ± 0.645
1.299TyrSer: 1.299 ± 0.938
2.165TyrThr: 2.165 ± 0.82
1.732TyrVal: 1.732 ± 0.555
0.433TyrTrp: 0.433 ± 0.4
2.597TyrTyr: 2.597 ± 1.466
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2311 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski