Amino acid dipepetide frequency for Bos taurus papillomavirus 16

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.674AlaAla: 2.674 ± 1.259
2.674AlaCys: 2.674 ± 1.699
3.438AlaAsp: 3.438 ± 1.384
3.438AlaGlu: 3.438 ± 1.129
1.91AlaPhe: 1.91 ± 0.751
3.82AlaGly: 3.82 ± 1.238
1.146AlaHis: 1.146 ± 0.526
3.056AlaIle: 3.056 ± 0.52
1.528AlaLys: 1.528 ± 0.688
2.674AlaLeu: 2.674 ± 1.274
1.146AlaMet: 1.146 ± 0.713
3.056AlaAsn: 3.056 ± 0.803
4.966AlaPro: 4.966 ± 1.115
1.91AlaGln: 1.91 ± 0.787
3.438AlaArg: 3.438 ± 1.588
4.202AlaSer: 4.202 ± 1.184
3.438AlaThr: 3.438 ± 0.612
2.292AlaVal: 2.292 ± 0.966
2.292AlaTrp: 2.292 ± 0.74
1.91AlaTyr: 1.91 ± 0.94
0.0AlaXaa: 0.0 ± 0.0
Cys
1.528CysAla: 1.528 ± 0.61
1.91CysCys: 1.91 ± 1.179
1.91CysAsp: 1.91 ± 0.86
1.146CysGlu: 1.146 ± 0.964
0.764CysPhe: 0.764 ± 0.521
1.528CysGly: 1.528 ± 1.604
0.382CysHis: 0.382 ± 0.453
2.674CysIle: 2.674 ± 0.826
1.528CysLys: 1.528 ± 0.709
1.528CysLeu: 1.528 ± 0.589
0.0CysMet: 0.0 ± 0.0
1.146CysAsn: 1.146 ± 0.681
1.91CysPro: 1.91 ± 0.755
1.91CysGln: 1.91 ± 1.112
0.764CysArg: 0.764 ± 0.557
0.382CysSer: 0.382 ± 0.344
0.764CysThr: 0.764 ± 0.531
1.146CysVal: 1.146 ± 0.473
0.764CysTrp: 0.764 ± 0.443
1.146CysTyr: 1.146 ± 0.526
0.0CysXaa: 0.0 ± 0.0
Asp
4.584AspAla: 4.584 ± 1.265
1.528AspCys: 1.528 ± 0.589
4.202AspAsp: 4.202 ± 0.728
3.438AspGlu: 3.438 ± 1.174
3.438AspPhe: 3.438 ± 1.09
3.82AspGly: 3.82 ± 1.299
1.528AspHis: 1.528 ± 0.884
4.584AspIle: 4.584 ± 1.097
3.056AspLys: 3.056 ± 1.446
3.82AspLeu: 3.82 ± 1.194
1.146AspMet: 1.146 ± 0.454
3.056AspAsn: 3.056 ± 0.759
5.348AspPro: 5.348 ± 1.578
1.91AspGln: 1.91 ± 0.491
2.674AspArg: 2.674 ± 0.689
3.438AspSer: 3.438 ± 1.125
6.875AspThr: 6.875 ± 2.542
3.82AspVal: 3.82 ± 1.713
0.764AspTrp: 0.764 ± 0.688
2.292AspTyr: 2.292 ± 0.921
0.0AspXaa: 0.0 ± 0.0
Glu
3.438GluAla: 3.438 ± 0.917
1.146GluCys: 1.146 ± 0.777
4.584GluAsp: 4.584 ± 0.985
8.785GluGlu: 8.785 ± 2.956
1.146GluPhe: 1.146 ± 0.859
4.584GluGly: 4.584 ± 1.164
1.528GluHis: 1.528 ± 0.409
3.056GluIle: 3.056 ± 0.873
3.438GluLys: 3.438 ± 1.871
4.584GluLeu: 4.584 ± 1.709
2.292GluMet: 2.292 ± 0.577
2.292GluAsn: 2.292 ± 0.648
4.584GluPro: 4.584 ± 1.208
2.674GluGln: 2.674 ± 0.983
1.528GluArg: 1.528 ± 1.007
5.73GluSer: 5.73 ± 1.567
6.494GluThr: 6.494 ± 1.869
2.292GluVal: 2.292 ± 0.736
0.764GluTrp: 0.764 ± 0.381
2.674GluTyr: 2.674 ± 0.689
0.0GluXaa: 0.0 ± 0.0
Phe
1.528PheAla: 1.528 ± 0.847
1.146PheCys: 1.146 ± 0.473
1.91PheAsp: 1.91 ± 1.037
2.292PheGlu: 2.292 ± 0.566
1.91PhePhe: 1.91 ± 0.706
1.528PheGly: 1.528 ± 0.949
1.528PheHis: 1.528 ± 0.985
1.528PheIle: 1.528 ± 0.418
2.292PheLys: 2.292 ± 1.23
3.82PheLeu: 3.82 ± 0.797
0.764PheMet: 0.764 ± 0.688
0.764PheAsn: 0.764 ± 0.47
0.764PhePro: 0.764 ± 0.391
1.528PheGln: 1.528 ± 0.971
1.528PheArg: 1.528 ± 0.61
3.438PheSer: 3.438 ± 0.918
2.292PheThr: 2.292 ± 0.766
1.528PheVal: 1.528 ± 0.841
1.528PheTrp: 1.528 ± 1.036
1.146PheTyr: 1.146 ± 0.677
0.0PheXaa: 0.0 ± 0.0
Gly
3.438GlyAla: 3.438 ± 1.983
1.91GlyCys: 1.91 ± 1.208
6.112GlyAsp: 6.112 ± 1.152
6.112GlyGlu: 6.112 ± 1.127
0.382GlyPhe: 0.382 ± 0.336
5.348GlyGly: 5.348 ± 2.822
4.202GlyHis: 4.202 ± 1.345
1.91GlyIle: 1.91 ± 0.982
3.056GlyLys: 3.056 ± 1.006
6.875GlyLeu: 6.875 ± 1.212
0.382GlyMet: 0.382 ± 0.373
2.292GlyAsn: 2.292 ± 0.588
4.584GlyPro: 4.584 ± 1.767
2.674GlyGln: 2.674 ± 0.951
3.82GlyArg: 3.82 ± 1.075
3.82GlySer: 3.82 ± 1.476
4.966GlyThr: 4.966 ± 0.976
3.438GlyVal: 3.438 ± 0.62
1.528GlyTrp: 1.528 ± 1.148
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.528HisAla: 1.528 ± 0.681
0.764HisCys: 0.764 ± 0.925
1.91HisAsp: 1.91 ± 0.778
0.764HisGlu: 0.764 ± 0.688
1.146HisPhe: 1.146 ± 0.526
1.528HisGly: 1.528 ± 0.88
0.382HisHis: 0.382 ± 0.344
0.382HisIle: 0.382 ± 0.336
0.764HisLys: 0.764 ± 0.521
3.056HisLeu: 3.056 ± 1.058
0.0HisMet: 0.0 ± 0.0
2.292HisAsn: 2.292 ± 0.518
1.91HisPro: 1.91 ± 0.558
0.764HisGln: 0.764 ± 0.426
1.91HisArg: 1.91 ± 0.794
1.528HisSer: 1.528 ± 1.112
2.292HisThr: 2.292 ± 0.902
0.382HisVal: 0.382 ± 0.373
1.528HisTrp: 1.528 ± 0.492
2.292HisTyr: 2.292 ± 0.518
0.0HisXaa: 0.0 ± 0.0
Ile
1.528IleAla: 1.528 ± 0.649
0.764IleCys: 0.764 ± 0.426
3.82IleAsp: 3.82 ± 1.34
3.438IleGlu: 3.438 ± 1.052
0.764IlePhe: 0.764 ± 0.47
3.056IleGly: 3.056 ± 1.954
0.382IleHis: 0.382 ± 0.344
4.202IleIle: 4.202 ± 2.101
1.91IleLys: 1.91 ± 0.875
1.91IleLeu: 1.91 ± 1.061
1.91IleMet: 1.91 ± 1.438
0.764IleAsn: 0.764 ± 0.426
3.056IlePro: 3.056 ± 0.659
0.764IleGln: 0.764 ± 0.557
4.584IleArg: 4.584 ± 1.321
4.202IleSer: 4.202 ± 0.999
2.292IleThr: 2.292 ± 1.28
3.438IleVal: 3.438 ± 1.209
0.764IleTrp: 0.764 ± 0.6
2.292IleTyr: 2.292 ± 1.075
0.0IleXaa: 0.0 ± 0.0
Lys
3.056LysAla: 3.056 ± 0.547
1.528LysCys: 1.528 ± 0.697
3.056LysAsp: 3.056 ± 1.618
1.91LysGlu: 1.91 ± 0.643
1.91LysPhe: 1.91 ± 1.211
1.91LysGly: 1.91 ± 0.914
2.292LysHis: 2.292 ± 0.978
1.146LysIle: 1.146 ± 0.696
4.202LysLys: 4.202 ± 1.537
3.056LysLeu: 3.056 ± 1.21
0.382LysMet: 0.382 ± 0.344
3.82LysAsn: 3.82 ± 0.783
3.056LysPro: 3.056 ± 1.006
0.764LysGln: 0.764 ± 0.426
4.584LysArg: 4.584 ± 0.638
1.528LysSer: 1.528 ± 1.377
3.438LysThr: 3.438 ± 1.04
4.202LysVal: 4.202 ± 1.047
0.764LysTrp: 0.764 ± 0.531
1.528LysTyr: 1.528 ± 0.658
0.0LysXaa: 0.0 ± 0.0
Leu
3.056LeuAla: 3.056 ± 1.315
1.146LeuCys: 1.146 ± 0.681
5.73LeuAsp: 5.73 ± 0.819
4.966LeuGlu: 4.966 ± 2.674
3.438LeuPhe: 3.438 ± 1.369
3.82LeuGly: 3.82 ± 1.359
1.146LeuHis: 1.146 ± 0.664
2.674LeuIle: 2.674 ± 1.302
5.348LeuLys: 5.348 ± 1.962
9.549LeuLeu: 9.549 ± 3.25
0.764LeuMet: 0.764 ± 0.396
1.528LeuAsn: 1.528 ± 0.489
3.438LeuPro: 3.438 ± 1.858
6.875LeuGln: 6.875 ± 1.38
5.73LeuArg: 5.73 ± 0.543
6.494LeuSer: 6.494 ± 1.399
6.112LeuThr: 6.112 ± 1.574
2.292LeuVal: 2.292 ± 1.173
1.146LeuTrp: 1.146 ± 0.654
3.056LeuTyr: 3.056 ± 1.219
0.0LeuXaa: 0.0 ± 0.0
Met
2.292MetAla: 2.292 ± 0.728
0.382MetCys: 0.382 ± 0.344
1.528MetAsp: 1.528 ± 0.947
1.91MetGlu: 1.91 ± 0.634
2.292MetPhe: 2.292 ± 0.733
1.528MetGly: 1.528 ± 0.681
1.146MetHis: 1.146 ± 0.82
0.764MetIle: 0.764 ± 0.426
0.382MetLys: 0.382 ± 0.302
0.382MetLeu: 0.382 ± 0.453
0.382MetMet: 0.382 ± 0.302
0.0MetAsn: 0.0 ± 0.0
0.764MetPro: 0.764 ± 0.381
1.146MetGln: 1.146 ± 0.397
1.146MetArg: 1.146 ± 0.983
1.528MetSer: 1.528 ± 0.642
1.528MetThr: 1.528 ± 1.07
1.91MetVal: 1.91 ± 0.839
0.0MetTrp: 0.0 ± 0.0
1.146MetTyr: 1.146 ± 0.397
0.0MetXaa: 0.0 ± 0.0
Asn
1.528AsnAla: 1.528 ± 1.036
1.528AsnCys: 1.528 ± 0.847
0.764AsnAsp: 0.764 ± 0.426
1.528AsnGlu: 1.528 ± 0.848
1.91AsnPhe: 1.91 ± 0.97
3.438AsnGly: 3.438 ± 0.886
0.764AsnHis: 0.764 ± 0.907
1.91AsnIle: 1.91 ± 1.39
2.674AsnLys: 2.674 ± 0.794
2.292AsnLeu: 2.292 ± 1.136
0.764AsnMet: 0.764 ± 0.424
4.202AsnAsn: 4.202 ± 1.191
2.292AsnPro: 2.292 ± 1.327
2.674AsnGln: 2.674 ± 0.888
2.292AsnArg: 2.292 ± 1.018
3.82AsnSer: 3.82 ± 0.849
2.674AsnThr: 2.674 ± 1.037
1.528AsnVal: 1.528 ± 0.599
1.146AsnTrp: 1.146 ± 0.397
1.528AsnTyr: 1.528 ± 0.576
0.0AsnXaa: 0.0 ± 0.0
Pro
3.056ProAla: 3.056 ± 0.687
1.528ProCys: 1.528 ± 0.61
5.73ProAsp: 5.73 ± 1.636
4.202ProGlu: 4.202 ± 1.131
1.146ProPhe: 1.146 ± 0.643
3.438ProGly: 3.438 ± 1.032
0.764ProHis: 0.764 ± 0.458
1.91ProIle: 1.91 ± 0.827
3.438ProLys: 3.438 ± 1.287
4.202ProLeu: 4.202 ± 0.919
2.674ProMet: 2.674 ± 0.718
3.056ProAsn: 3.056 ± 0.897
5.73ProPro: 5.73 ± 1.802
3.056ProGln: 3.056 ± 0.75
5.73ProArg: 5.73 ± 1.304
5.73ProSer: 5.73 ± 2.064
3.82ProThr: 3.82 ± 1.804
4.202ProVal: 4.202 ± 1.487
0.764ProTrp: 0.764 ± 0.458
1.91ProTyr: 1.91 ± 1.218
0.0ProXaa: 0.0 ± 0.0
Gln
2.674GlnAla: 2.674 ± 0.86
0.764GlnCys: 0.764 ± 0.732
2.292GlnAsp: 2.292 ± 0.992
3.438GlnGlu: 3.438 ± 1.463
1.528GlnPhe: 1.528 ± 0.549
3.82GlnGly: 3.82 ± 0.902
2.292GlnHis: 2.292 ± 1.03
1.528GlnIle: 1.528 ± 0.549
0.382GlnLys: 0.382 ± 0.453
3.82GlnLeu: 3.82 ± 1.051
2.674GlnMet: 2.674 ± 1.407
0.382GlnAsn: 0.382 ± 0.344
2.292GlnPro: 2.292 ± 0.798
1.91GlnGln: 1.91 ± 0.831
2.674GlnArg: 2.674 ± 1.685
2.292GlnSer: 2.292 ± 0.971
2.674GlnThr: 2.674 ± 0.696
4.584GlnVal: 4.584 ± 0.74
1.146GlnTrp: 1.146 ± 0.513
1.528GlnTyr: 1.528 ± 0.594
0.0GlnXaa: 0.0 ± 0.0
Arg
3.82ArgAla: 3.82 ± 0.751
2.292ArgCys: 2.292 ± 0.982
0.764ArgAsp: 0.764 ± 0.745
1.91ArgGlu: 1.91 ± 1.025
1.146ArgPhe: 1.146 ± 0.623
6.875ArgGly: 6.875 ± 2.109
3.056ArgHis: 3.056 ± 1.016
1.146ArgIle: 1.146 ± 0.446
4.584ArgLys: 4.584 ± 1.486
7.257ArgLeu: 7.257 ± 0.916
2.292ArgMet: 2.292 ± 0.969
1.91ArgAsn: 1.91 ± 0.712
4.966ArgPro: 4.966 ± 2.478
3.438ArgGln: 3.438 ± 1.64
6.875ArgArg: 6.875 ± 1.542
1.528ArgSer: 1.528 ± 0.549
2.674ArgThr: 2.674 ± 1.788
1.528ArgVal: 1.528 ± 0.681
0.764ArgTrp: 0.764 ± 0.521
3.438ArgTyr: 3.438 ± 0.799
0.0ArgXaa: 0.0 ± 0.0
Ser
2.674SerAla: 2.674 ± 1.204
0.382SerCys: 0.382 ± 0.336
4.966SerAsp: 4.966 ± 1.808
5.348SerGlu: 5.348 ± 1.394
3.056SerPhe: 3.056 ± 1.942
5.348SerGly: 5.348 ± 1.09
1.146SerHis: 1.146 ± 0.613
3.438SerIle: 3.438 ± 1.198
3.056SerLys: 3.056 ± 1.399
6.494SerLeu: 6.494 ± 0.78
1.146SerMet: 1.146 ± 0.623
3.056SerAsn: 3.056 ± 1.098
4.966SerPro: 4.966 ± 1.428
3.056SerGln: 3.056 ± 0.776
4.202SerArg: 4.202 ± 1.374
6.494SerSer: 6.494 ± 2.041
6.112SerThr: 6.112 ± 1.975
4.966SerVal: 4.966 ± 2.087
0.382SerTrp: 0.382 ± 0.373
0.764SerTyr: 0.764 ± 0.458
0.0SerXaa: 0.0 ± 0.0
Thr
4.966ThrAla: 4.966 ± 1.831
1.528ThrCys: 1.528 ± 0.681
4.584ThrAsp: 4.584 ± 1.653
5.348ThrGlu: 5.348 ± 1.476
3.438ThrPhe: 3.438 ± 1.377
3.056ThrGly: 3.056 ± 0.999
1.146ThrHis: 1.146 ± 0.37
4.584ThrIle: 4.584 ± 1.165
0.382ThrLys: 0.382 ± 0.344
6.494ThrLeu: 6.494 ± 1.31
1.146ThrMet: 1.146 ± 0.475
1.528ThrAsn: 1.528 ± 0.594
5.73ThrPro: 5.73 ± 2.014
2.292ThrGln: 2.292 ± 0.733
3.438ThrArg: 3.438 ± 0.973
4.966ThrSer: 4.966 ± 1.425
3.82ThrThr: 3.82 ± 0.837
6.494ThrVal: 6.494 ± 1.676
1.146ThrTrp: 1.146 ± 0.764
1.91ThrTyr: 1.91 ± 0.857
0.0ThrXaa: 0.0 ± 0.0
Val
3.438ValAla: 3.438 ± 0.653
1.528ValCys: 1.528 ± 0.891
6.112ValAsp: 6.112 ± 1.524
3.82ValGlu: 3.82 ± 0.94
1.91ValPhe: 1.91 ± 0.484
3.82ValGly: 3.82 ± 0.749
1.146ValHis: 1.146 ± 0.37
1.146ValIle: 1.146 ± 0.826
3.438ValLys: 3.438 ± 1.034
2.674ValLeu: 2.674 ± 1.024
1.528ValMet: 1.528 ± 0.336
2.292ValAsn: 2.292 ± 0.917
3.056ValPro: 3.056 ± 1.125
4.202ValGln: 4.202 ± 1.516
2.674ValArg: 2.674 ± 1.471
5.73ValSer: 5.73 ± 1.341
3.056ValThr: 3.056 ± 1.062
2.674ValVal: 2.674 ± 0.968
0.764ValTrp: 0.764 ± 0.424
1.146ValTyr: 1.146 ± 1.477
0.0ValXaa: 0.0 ± 0.0
Trp
1.146TrpAla: 1.146 ± 0.513
0.382TrpCys: 0.382 ± 0.373
0.382TrpAsp: 0.382 ± 0.302
0.764TrpGlu: 0.764 ± 0.381
1.146TrpPhe: 1.146 ± 0.655
1.528TrpGly: 1.528 ± 0.847
0.764TrpHis: 0.764 ± 0.443
1.528TrpIle: 1.528 ± 0.688
0.764TrpLys: 0.764 ± 0.443
1.146TrpLeu: 1.146 ± 0.623
0.0TrpMet: 0.0 ± 0.0
1.528TrpAsn: 1.528 ± 0.658
0.382TrpPro: 0.382 ± 0.406
0.764TrpGln: 0.764 ± 0.443
1.146TrpArg: 1.146 ± 0.437
2.292TrpSer: 2.292 ± 1.044
1.146TrpThr: 1.146 ± 0.553
1.146TrpVal: 1.146 ± 0.397
0.0TrpTrp: 0.0 ± 0.0
0.382TrpTyr: 0.382 ± 0.373
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.056TyrAla: 3.056 ± 0.888
0.0TyrCys: 0.0 ± 0.0
1.528TyrAsp: 1.528 ± 0.549
3.056TyrGlu: 3.056 ± 0.907
0.382TyrPhe: 0.382 ± 0.336
3.056TyrGly: 3.056 ± 1.198
0.382TyrHis: 0.382 ± 0.406
2.674TyrIle: 2.674 ± 0.991
1.91TyrLys: 1.91 ± 0.491
2.674TyrLeu: 2.674 ± 1.213
0.382TyrMet: 0.382 ± 0.329
2.292TyrAsn: 2.292 ± 0.597
2.292TyrPro: 2.292 ± 0.948
0.764TyrGln: 0.764 ± 0.391
1.528TyrArg: 1.528 ± 0.899
1.91TyrSer: 1.91 ± 0.779
1.528TyrThr: 1.528 ± 0.916
2.292TyrVal: 2.292 ± 0.745
0.382TyrTrp: 0.382 ± 0.302
4.202TyrTyr: 4.202 ± 1.392
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2619 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski