Amino acid dipepetide frequency for Bovine papillomavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.004AlaAla: 3.004 ± 1.119
1.717AlaCys: 1.717 ± 0.788
3.433AlaAsp: 3.433 ± 1.095
4.721AlaGlu: 4.721 ± 1.998
2.575AlaPhe: 2.575 ± 0.636
6.009AlaGly: 6.009 ± 1.529
1.288AlaHis: 1.288 ± 0.707
0.858AlaIle: 0.858 ± 0.461
3.863AlaLys: 3.863 ± 1.757
5.15AlaLeu: 5.15 ± 2.291
0.429AlaMet: 0.429 ± 0.436
2.146AlaAsn: 2.146 ± 0.524
3.004AlaPro: 3.004 ± 0.564
5.15AlaGln: 5.15 ± 1.407
3.863AlaArg: 3.863 ± 2.232
2.146AlaSer: 2.146 ± 1.682
2.575AlaThr: 2.575 ± 1.045
2.575AlaVal: 2.575 ± 1.325
2.575AlaTrp: 2.575 ± 1.323
0.858AlaTyr: 0.858 ± 0.662
0.0AlaXaa: 0.0 ± 0.0
Cys
0.858CysAla: 0.858 ± 0.439
0.858CysCys: 0.858 ± 0.989
1.288CysAsp: 1.288 ± 1.247
0.0CysGlu: 0.0 ± 0.0
1.288CysPhe: 1.288 ± 0.846
1.717CysGly: 1.717 ± 2.181
0.429CysHis: 0.429 ± 0.331
0.429CysIle: 0.429 ± 0.642
1.717CysLys: 1.717 ± 0.641
0.858CysLeu: 0.858 ± 0.648
0.429CysMet: 0.429 ± 0.331
1.288CysAsn: 1.288 ± 0.927
2.575CysPro: 2.575 ± 0.551
1.288CysGln: 1.288 ± 0.808
0.858CysArg: 0.858 ± 0.989
1.288CysSer: 1.288 ± 0.618
2.575CysThr: 2.575 ± 0.866
2.146CysVal: 2.146 ± 1.209
0.429CysTrp: 0.429 ± 0.331
1.717CysTyr: 1.717 ± 1.265
0.0CysXaa: 0.0 ± 0.0
Asp
3.433AspAla: 3.433 ± 0.73
0.429AspCys: 0.429 ± 0.642
3.433AspAsp: 3.433 ± 1.503
5.15AspGlu: 5.15 ± 0.975
1.717AspPhe: 1.717 ± 0.706
4.292AspGly: 4.292 ± 1.62
1.717AspHis: 1.717 ± 0.796
6.009AspIle: 6.009 ± 1.004
1.717AspLys: 1.717 ± 0.665
5.579AspLeu: 5.579 ± 1.855
0.429AspMet: 0.429 ± 0.331
2.146AspAsn: 2.146 ± 1.001
6.867AspPro: 6.867 ± 1.381
0.429AspGln: 0.429 ± 0.331
2.575AspArg: 2.575 ± 1.213
7.296AspSer: 7.296 ± 0.573
3.004AspThr: 3.004 ± 1.276
6.009AspVal: 6.009 ± 3.277
1.717AspTrp: 1.717 ± 0.624
2.146AspTyr: 2.146 ± 1.066
0.0AspXaa: 0.0 ± 0.0
Glu
2.575GluAla: 2.575 ± 0.543
3.004GluCys: 3.004 ± 1.539
5.15GluAsp: 5.15 ± 1.215
6.867GluGlu: 6.867 ± 3.584
1.717GluPhe: 1.717 ± 0.706
3.004GluGly: 3.004 ± 1.456
0.858GluHis: 0.858 ± 0.656
2.146GluIle: 2.146 ± 0.819
2.575GluLys: 2.575 ± 1.094
4.721GluLeu: 4.721 ± 1.132
1.288GluMet: 1.288 ± 0.674
2.575GluAsn: 2.575 ± 0.991
3.433GluPro: 3.433 ± 1.27
3.433GluGln: 3.433 ± 1.499
2.575GluArg: 2.575 ± 1.029
6.009GluSer: 6.009 ± 2.627
5.579GluThr: 5.579 ± 1.436
3.433GluVal: 3.433 ± 0.662
0.429GluTrp: 0.429 ± 0.331
2.146GluTyr: 2.146 ± 0.715
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
1.288PheCys: 1.288 ± 0.618
3.004PheAsp: 3.004 ± 0.564
1.717PheGlu: 1.717 ± 0.651
1.717PhePhe: 1.717 ± 0.878
0.858PheGly: 0.858 ± 0.478
0.858PheHis: 0.858 ± 0.758
1.717PheIle: 1.717 ± 0.632
2.575PheLys: 2.575 ± 1.348
3.004PheLeu: 3.004 ± 1.033
0.858PheMet: 0.858 ± 0.662
3.433PheAsn: 3.433 ± 0.822
1.288PhePro: 1.288 ± 0.761
2.146PheGln: 2.146 ± 0.819
1.717PheArg: 1.717 ± 0.754
3.004PheSer: 3.004 ± 0.922
1.717PheThr: 1.717 ± 0.957
2.146PheVal: 2.146 ± 1.163
0.858PheTrp: 0.858 ± 0.439
2.575PheTyr: 2.575 ± 1.349
0.0PheXaa: 0.0 ± 0.0
Gly
3.863GlyAla: 3.863 ± 1.052
2.146GlyCys: 2.146 ± 1.084
7.725GlyAsp: 7.725 ± 1.781
2.146GlyGlu: 2.146 ± 0.768
0.858GlyPhe: 0.858 ± 0.434
6.867GlyGly: 6.867 ± 2.398
2.575GlyHis: 2.575 ± 0.797
3.004GlyIle: 3.004 ± 0.647
1.288GlyLys: 1.288 ± 0.761
4.721GlyLeu: 4.721 ± 1.356
0.0GlyMet: 0.0 ± 0.0
6.438GlyAsn: 6.438 ± 2.141
2.575GlyPro: 2.575 ± 1.554
2.575GlyGln: 2.575 ± 0.696
6.867GlyArg: 6.867 ± 2.421
4.292GlySer: 4.292 ± 1.404
3.004GlyThr: 3.004 ± 0.87
3.863GlyVal: 3.863 ± 1.326
0.429GlyTrp: 0.429 ± 0.642
1.288GlyTyr: 1.288 ± 0.696
0.0GlyXaa: 0.0 ± 0.0
His
0.429HisAla: 0.429 ± 0.331
1.717HisCys: 1.717 ± 1.033
0.429HisAsp: 0.429 ± 0.331
0.429HisGlu: 0.429 ± 0.436
1.717HisPhe: 1.717 ± 0.603
0.858HisGly: 0.858 ± 0.461
0.429HisHis: 0.429 ± 0.436
0.0HisIle: 0.0 ± 0.0
0.429HisLys: 0.429 ± 0.331
2.146HisLeu: 2.146 ± 0.789
0.0HisMet: 0.0 ± 0.0
0.429HisAsn: 0.429 ± 0.642
1.717HisPro: 1.717 ± 0.631
1.717HisGln: 1.717 ± 0.894
0.0HisArg: 0.0 ± 0.0
2.146HisSer: 2.146 ± 0.845
0.429HisThr: 0.429 ± 0.642
2.146HisVal: 2.146 ± 0.852
1.288HisTrp: 1.288 ± 0.707
2.146HisTyr: 2.146 ± 1.08
0.0HisXaa: 0.0 ± 0.0
Ile
1.288IleAla: 1.288 ± 0.795
0.858IleCys: 0.858 ± 0.648
5.579IleAsp: 5.579 ± 1.708
3.433IleGlu: 3.433 ± 1.212
0.429IlePhe: 0.429 ± 0.642
3.433IleGly: 3.433 ± 1.093
1.288IleHis: 1.288 ± 0.353
1.717IleIle: 1.717 ± 0.728
2.575IleLys: 2.575 ± 1.495
4.292IleLeu: 4.292 ± 1.261
1.288IleMet: 1.288 ± 0.632
1.717IleAsn: 1.717 ± 0.728
4.292IlePro: 4.292 ± 0.622
0.858IleGln: 0.858 ± 0.766
2.146IleArg: 2.146 ± 0.807
3.433IleSer: 3.433 ± 0.983
2.575IleThr: 2.575 ± 0.991
2.146IleVal: 2.146 ± 1.024
0.429IleTrp: 0.429 ± 0.396
2.146IleTyr: 2.146 ± 0.762
0.0IleXaa: 0.0 ± 0.0
Lys
3.433LysAla: 3.433 ± 1.945
1.288LysCys: 1.288 ± 0.451
2.146LysAsp: 2.146 ± 1.084
1.717LysGlu: 1.717 ± 0.771
3.433LysPhe: 3.433 ± 1.378
1.717LysGly: 1.717 ± 1.295
2.146LysHis: 2.146 ± 1.675
2.575LysIle: 2.575 ± 0.679
2.146LysLys: 2.146 ± 0.817
4.292LysLeu: 4.292 ± 1.419
0.429LysMet: 0.429 ± 0.534
1.717LysAsn: 1.717 ± 0.771
1.717LysPro: 1.717 ± 0.615
4.292LysGln: 4.292 ± 2.586
5.579LysArg: 5.579 ± 1.557
2.575LysSer: 2.575 ± 1.246
2.575LysThr: 2.575 ± 0.543
2.146LysVal: 2.146 ± 0.67
0.0LysTrp: 0.0 ± 0.0
2.146LysTyr: 2.146 ± 0.501
0.0LysXaa: 0.0 ± 0.0
Leu
4.721LeuAla: 4.721 ± 1.147
1.288LeuCys: 1.288 ± 1.572
5.579LeuAsp: 5.579 ± 1.158
6.009LeuGlu: 6.009 ± 1.499
3.004LeuPhe: 3.004 ± 1.185
7.296LeuGly: 7.296 ± 2.327
1.288LeuHis: 1.288 ± 0.688
2.146LeuIle: 2.146 ± 1.656
5.579LeuLys: 5.579 ± 1.136
7.725LeuLeu: 7.725 ± 2.96
1.717LeuMet: 1.717 ± 0.882
2.575LeuAsn: 2.575 ± 1.298
0.858LeuPro: 0.858 ± 0.425
5.579LeuGln: 5.579 ± 1.305
4.721LeuArg: 4.721 ± 1.147
6.009LeuSer: 6.009 ± 1.764
6.438LeuThr: 6.438 ± 0.88
4.292LeuVal: 4.292 ± 2.126
1.717LeuTrp: 1.717 ± 0.687
3.863LeuTyr: 3.863 ± 0.696
0.0LeuXaa: 0.0 ± 0.0
Met
1.717MetAla: 1.717 ± 0.878
0.429MetCys: 0.429 ± 0.681
1.717MetAsp: 1.717 ± 0.788
0.429MetGlu: 0.429 ± 0.436
1.288MetPhe: 1.288 ± 0.683
0.858MetGly: 0.858 ± 0.434
0.429MetHis: 0.429 ± 0.436
0.429MetIle: 0.429 ± 0.331
0.858MetLys: 0.858 ± 0.662
1.717MetLeu: 1.717 ± 1.033
0.858MetMet: 0.858 ± 0.745
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
0.858MetGln: 0.858 ± 0.662
0.858MetArg: 0.858 ± 0.662
0.858MetSer: 0.858 ± 0.434
1.288MetThr: 1.288 ± 0.683
1.717MetVal: 1.717 ± 0.957
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.433AsnAla: 3.433 ± 0.949
0.858AsnCys: 0.858 ± 0.766
2.146AsnAsp: 2.146 ± 0.996
1.717AsnGlu: 1.717 ± 0.979
1.288AsnPhe: 1.288 ± 0.761
2.575AsnGly: 2.575 ± 0.864
0.0AsnHis: 0.0 ± 0.0
1.288AsnIle: 1.288 ± 0.401
5.15AsnLys: 5.15 ± 2.164
4.292AsnLeu: 4.292 ± 1.879
2.146AsnMet: 2.146 ± 1.049
1.717AsnAsn: 1.717 ± 0.875
1.717AsnPro: 1.717 ± 0.728
3.433AsnGln: 3.433 ± 1.945
2.146AsnArg: 2.146 ± 0.804
2.575AsnSer: 2.575 ± 1.192
4.292AsnThr: 4.292 ± 1.962
2.575AsnVal: 2.575 ± 1.124
0.858AsnTrp: 0.858 ± 0.439
0.858AsnTyr: 0.858 ± 0.461
0.0AsnXaa: 0.0 ± 0.0
Pro
4.292ProAla: 4.292 ± 1.359
0.858ProCys: 0.858 ± 0.461
5.15ProAsp: 5.15 ± 2.073
5.579ProGlu: 5.579 ± 0.728
0.429ProPhe: 0.429 ± 0.396
3.004ProGly: 3.004 ± 2.198
1.288ProHis: 1.288 ± 1.057
3.863ProIle: 3.863 ± 1.051
2.575ProLys: 2.575 ± 0.951
4.292ProLeu: 4.292 ± 0.622
0.858ProMet: 0.858 ± 0.662
3.433ProAsn: 3.433 ± 1.587
4.292ProPro: 4.292 ± 1.153
1.288ProGln: 1.288 ± 0.453
3.433ProArg: 3.433 ± 1.676
3.433ProSer: 3.433 ± 2.323
3.863ProThr: 3.863 ± 1.631
2.575ProVal: 2.575 ± 0.488
0.0ProTrp: 0.0 ± 0.0
3.433ProTyr: 3.433 ± 1.024
0.0ProXaa: 0.0 ± 0.0
Gln
4.292GlnAla: 4.292 ± 1.194
1.288GlnCys: 1.288 ± 0.683
1.288GlnAsp: 1.288 ± 0.761
4.721GlnGlu: 4.721 ± 1.374
1.717GlnPhe: 1.717 ± 0.603
0.858GlnGly: 0.858 ± 0.439
0.858GlnHis: 0.858 ± 0.872
1.717GlnIle: 1.717 ± 0.665
0.429GlnLys: 0.429 ± 0.331
6.009GlnLeu: 6.009 ± 1.333
0.858GlnMet: 0.858 ± 0.439
2.575GlnAsn: 2.575 ± 0.666
3.004GlnPro: 3.004 ± 1.119
3.863GlnGln: 3.863 ± 1.353
3.433GlnArg: 3.433 ± 1.808
3.004GlnSer: 3.004 ± 0.591
3.433GlnThr: 3.433 ± 0.88
4.721GlnVal: 4.721 ± 0.796
1.717GlnTrp: 1.717 ± 0.771
2.575GlnTyr: 2.575 ± 1.385
0.0GlnXaa: 0.0 ± 0.0
Arg
3.433ArgAla: 3.433 ± 0.87
1.717ArgCys: 1.717 ± 0.796
2.575ArgAsp: 2.575 ± 1.017
3.004ArgGlu: 3.004 ± 1.43
2.146ArgPhe: 2.146 ± 0.477
5.579ArgGly: 5.579 ± 2.441
1.717ArgHis: 1.717 ± 0.817
3.433ArgIle: 3.433 ± 1.092
4.721ArgLys: 4.721 ± 1.333
6.438ArgLeu: 6.438 ± 0.757
0.858ArgMet: 0.858 ± 0.439
2.146ArgAsn: 2.146 ± 1.162
2.146ArgPro: 2.146 ± 1.979
3.433ArgGln: 3.433 ± 2.442
9.871ArgArg: 9.871 ± 2.334
3.004ArgSer: 3.004 ± 1.24
2.146ArgThr: 2.146 ± 0.397
4.721ArgVal: 4.721 ± 1.191
0.858ArgTrp: 0.858 ± 0.648
2.146ArgTyr: 2.146 ± 0.595
0.0ArgXaa: 0.0 ± 0.0
Ser
3.004SerAla: 3.004 ± 1.298
0.429SerCys: 0.429 ± 0.642
3.863SerAsp: 3.863 ± 1.624
2.575SerGlu: 2.575 ± 0.86
2.146SerPhe: 2.146 ± 0.971
5.579SerGly: 5.579 ± 1.426
1.717SerHis: 1.717 ± 0.743
6.009SerIle: 6.009 ± 1.873
1.717SerLys: 1.717 ± 0.537
4.721SerLeu: 4.721 ± 0.728
0.858SerMet: 0.858 ± 0.648
4.292SerAsn: 4.292 ± 1.248
5.579SerPro: 5.579 ± 1.245
3.863SerGln: 3.863 ± 1.514
6.009SerArg: 6.009 ± 0.815
3.863SerSer: 3.863 ± 0.968
6.009SerThr: 6.009 ± 2.896
3.863SerVal: 3.863 ± 0.797
0.0SerTrp: 0.0 ± 0.0
1.288SerTyr: 1.288 ± 1.036
0.0SerXaa: 0.0 ± 0.0
Thr
4.292ThrAla: 4.292 ± 0.788
2.146ThrCys: 2.146 ± 0.903
3.433ThrAsp: 3.433 ± 0.88
6.009ThrGlu: 6.009 ± 1.942
3.004ThrPhe: 3.004 ± 1.442
3.004ThrGly: 3.004 ± 1.087
0.0ThrHis: 0.0 ± 0.0
3.433ThrIle: 3.433 ± 2.787
0.429ThrLys: 0.429 ± 0.372
3.433ThrLeu: 3.433 ± 0.692
1.288ThrMet: 1.288 ± 0.994
1.288ThrAsn: 1.288 ± 0.744
5.579ThrPro: 5.579 ± 1.748
3.004ThrGln: 3.004 ± 0.853
3.863ThrArg: 3.863 ± 1.582
5.15ThrSer: 5.15 ± 1.839
4.292ThrThr: 4.292 ± 0.914
7.296ThrVal: 7.296 ± 1.309
0.429ThrTrp: 0.429 ± 0.436
2.575ThrTyr: 2.575 ± 0.555
0.0ThrXaa: 0.0 ± 0.0
Val
4.721ValAla: 4.721 ± 1.556
0.858ValCys: 0.858 ± 0.648
3.863ValAsp: 3.863 ± 0.78
4.721ValGlu: 4.721 ± 1.505
3.004ValPhe: 3.004 ± 0.758
3.433ValGly: 3.433 ± 1.762
1.717ValHis: 1.717 ± 0.592
2.146ValIle: 2.146 ± 0.457
3.433ValLys: 3.433 ± 1.103
4.292ValLeu: 4.292 ± 1.918
0.429ValMet: 0.429 ± 0.396
3.433ValAsn: 3.433 ± 1.302
4.721ValPro: 4.721 ± 1.6
1.717ValGln: 1.717 ± 0.592
3.004ValArg: 3.004 ± 1.091
6.009ValSer: 6.009 ± 1.663
6.009ValThr: 6.009 ± 1.359
3.004ValVal: 3.004 ± 1.146
1.717ValTrp: 1.717 ± 1.26
1.717ValTyr: 1.717 ± 1.133
0.0ValXaa: 0.0 ± 0.0
Trp
1.717TrpAla: 1.717 ± 0.743
0.0TrpCys: 0.0 ± 0.0
1.288TrpAsp: 1.288 ± 0.451
0.858TrpGlu: 0.858 ± 0.758
0.858TrpPhe: 0.858 ± 0.461
2.146TrpGly: 2.146 ± 0.976
0.0TrpHis: 0.0 ± 0.0
1.717TrpIle: 1.717 ± 0.771
2.146TrpLys: 2.146 ± 1.421
1.288TrpLeu: 1.288 ± 0.674
0.429TrpMet: 0.429 ± 0.436
0.429TrpAsn: 0.429 ± 0.372
0.429TrpPro: 0.429 ± 0.642
0.429TrpGln: 0.429 ± 0.436
0.858TrpArg: 0.858 ± 0.656
0.0TrpSer: 0.0 ± 0.0
0.429TrpThr: 0.429 ± 0.436
1.288TrpVal: 1.288 ± 0.683
0.0TrpTrp: 0.0 ± 0.0
0.429TrpTyr: 0.429 ± 0.331
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.433TyrAla: 3.433 ± 1.691
0.858TyrCys: 0.858 ± 0.67
3.004TyrAsp: 3.004 ± 0.575
1.717TyrGlu: 1.717 ± 0.973
2.146TyrPhe: 2.146 ± 0.846
3.433TyrGly: 3.433 ± 1.102
0.0TyrHis: 0.0 ± 0.0
1.288TyrIle: 1.288 ± 0.401
2.146TyrLys: 2.146 ± 0.847
3.433TyrLeu: 3.433 ± 1.071
0.429TyrMet: 0.429 ± 0.416
1.288TyrAsn: 1.288 ± 0.761
2.146TyrPro: 2.146 ± 0.868
3.004TyrGln: 3.004 ± 1.453
1.717TyrArg: 1.717 ± 1.255
1.288TyrSer: 1.288 ± 0.696
1.717TyrThr: 1.717 ± 0.592
1.288TyrVal: 1.288 ± 0.451
1.288TyrTrp: 1.288 ± 0.683
3.004TyrTyr: 3.004 ± 1.584
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2331 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski