Amino acid dipepetide frequency for Bos taurus papillomavirus 20

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.326AlaAla: 7.326 ± 1.912
0.407AlaCys: 0.407 ± 0.467
4.07AlaAsp: 4.07 ± 1.26
6.512AlaGlu: 6.512 ± 1.3
3.256AlaPhe: 3.256 ± 1.018
3.256AlaGly: 3.256 ± 0.849
0.407AlaHis: 0.407 ± 0.362
1.221AlaIle: 1.221 ± 0.891
4.477AlaLys: 4.477 ± 0.999
3.256AlaLeu: 3.256 ± 1.208
0.814AlaMet: 0.814 ± 0.371
2.442AlaAsn: 2.442 ± 0.573
3.256AlaPro: 3.256 ± 1.218
4.477AlaGln: 4.477 ± 1.346
3.256AlaArg: 3.256 ± 1.111
3.256AlaSer: 3.256 ± 0.814
3.663AlaThr: 3.663 ± 1.293
2.849AlaVal: 2.849 ± 0.765
0.407AlaTrp: 0.407 ± 0.362
1.628AlaTyr: 1.628 ± 0.889
0.0AlaXaa: 0.0 ± 0.0
Cys
0.407CysAla: 0.407 ± 0.289
1.628CysCys: 1.628 ± 0.996
0.814CysAsp: 0.814 ± 0.371
2.849CysGlu: 2.849 ± 1.275
1.221CysPhe: 1.221 ± 0.596
0.407CysGly: 0.407 ± 0.54
0.0CysHis: 0.0 ± 0.0
0.407CysIle: 0.407 ± 0.289
2.035CysLys: 2.035 ± 1.524
1.628CysLeu: 1.628 ± 1.157
0.407CysMet: 0.407 ± 0.371
1.628CysAsn: 1.628 ± 0.658
1.628CysPro: 1.628 ± 0.604
0.407CysGln: 0.407 ± 0.289
0.814CysArg: 0.814 ± 0.439
0.814CysSer: 0.814 ± 0.498
1.628CysThr: 1.628 ± 0.83
1.221CysVal: 1.221 ± 0.85
0.0CysTrp: 0.0 ± 0.0
0.407CysTyr: 0.407 ± 0.54
0.0CysXaa: 0.0 ± 0.0
Asp
4.884AspAla: 4.884 ± 1.289
1.628AspCys: 1.628 ± 0.889
3.663AspAsp: 3.663 ± 0.555
3.663AspGlu: 3.663 ± 0.929
2.849AspPhe: 2.849 ± 1.113
2.849AspGly: 2.849 ± 0.969
0.407AspHis: 0.407 ± 0.289
4.477AspIle: 4.477 ± 1.714
3.663AspLys: 3.663 ± 1.473
4.477AspLeu: 4.477 ± 0.991
0.814AspMet: 0.814 ± 0.371
3.256AspAsn: 3.256 ± 0.933
5.698AspPro: 5.698 ± 2.036
2.442AspGln: 2.442 ± 1.456
2.442AspArg: 2.442 ± 1.207
3.663AspSer: 3.663 ± 0.726
2.849AspThr: 2.849 ± 1.414
2.849AspVal: 2.849 ± 1.281
0.407AspTrp: 0.407 ± 0.289
1.628AspTyr: 1.628 ± 0.535
0.0AspXaa: 0.0 ± 0.0
Glu
2.035GluAla: 2.035 ± 0.787
1.221GluCys: 1.221 ± 0.559
5.291GluAsp: 5.291 ± 1.732
6.105GluGlu: 6.105 ± 1.321
2.035GluPhe: 2.035 ± 1.116
5.698GluGly: 5.698 ± 1.329
1.628GluHis: 1.628 ± 0.806
3.663GluIle: 3.663 ± 1.562
1.221GluLys: 1.221 ± 0.807
5.698GluLeu: 5.698 ± 1.914
0.814GluMet: 0.814 ± 0.439
2.849GluAsn: 2.849 ± 0.805
2.849GluPro: 2.849 ± 1.096
4.07GluGln: 4.07 ± 1.123
2.849GluArg: 2.849 ± 1.529
4.884GluSer: 4.884 ± 1.697
3.663GluThr: 3.663 ± 0.947
4.07GluVal: 4.07 ± 1.091
0.0GluTrp: 0.0 ± 0.0
2.849GluTyr: 2.849 ± 0.607
0.0GluXaa: 0.0 ± 0.0
Phe
2.442PheAla: 2.442 ± 0.98
1.221PheCys: 1.221 ± 1.019
3.256PheAsp: 3.256 ± 1.185
2.442PheGlu: 2.442 ± 0.819
0.814PhePhe: 0.814 ± 0.371
4.07PheGly: 4.07 ± 0.587
0.407PheHis: 0.407 ± 0.346
2.035PheIle: 2.035 ± 0.837
2.442PheLys: 2.442 ± 0.835
4.884PheLeu: 4.884 ± 1.596
1.221PheMet: 1.221 ± 0.641
2.035PheAsn: 2.035 ± 0.903
2.035PhePro: 2.035 ± 1.099
2.442PheGln: 2.442 ± 0.746
2.035PheArg: 2.035 ± 0.47
2.849PheSer: 2.849 ± 1.433
2.035PheThr: 2.035 ± 1.026
2.442PheVal: 2.442 ± 0.82
2.035PheTrp: 2.035 ± 1.026
1.221PheTyr: 1.221 ± 0.679
0.0PheXaa: 0.0 ± 0.0
Gly
3.663GlyAla: 3.663 ± 0.802
0.814GlyCys: 0.814 ± 0.399
2.849GlyAsp: 2.849 ± 1.017
3.256GlyGlu: 3.256 ± 0.698
3.256GlyPhe: 3.256 ± 0.956
4.477GlyGly: 4.477 ± 1.631
1.628GlyHis: 1.628 ± 0.806
4.477GlyIle: 4.477 ± 1.353
3.663GlyLys: 3.663 ± 0.841
4.07GlyLeu: 4.07 ± 2.331
0.407GlyMet: 0.407 ± 0.543
1.628GlyAsn: 1.628 ± 0.554
5.291GlyPro: 5.291 ± 3.136
3.663GlyGln: 3.663 ± 0.877
3.663GlyArg: 3.663 ± 1.171
7.733GlySer: 7.733 ± 1.788
3.663GlyThr: 3.663 ± 1.068
3.663GlyVal: 3.663 ± 0.89
0.407GlyTrp: 0.407 ± 0.362
1.628GlyTyr: 1.628 ± 0.838
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.407HisAsp: 0.407 ± 0.346
1.221HisGlu: 1.221 ± 0.401
0.814HisPhe: 0.814 ± 0.498
0.814HisGly: 0.814 ± 0.419
0.0HisHis: 0.0 ± 0.0
1.221HisIle: 1.221 ± 0.712
0.814HisLys: 0.814 ± 0.426
2.849HisLeu: 2.849 ± 1.185
0.0HisMet: 0.0 ± 0.0
0.407HisAsn: 0.407 ± 0.346
3.256HisPro: 3.256 ± 0.638
0.407HisGln: 0.407 ± 0.289
0.814HisArg: 0.814 ± 0.519
2.035HisSer: 2.035 ± 0.434
0.814HisThr: 0.814 ± 0.692
0.814HisVal: 0.814 ± 0.371
0.814HisTrp: 0.814 ± 0.371
2.442HisTyr: 2.442 ± 1.29
0.0HisXaa: 0.0 ± 0.0
Ile
4.07IleAla: 4.07 ± 1.498
0.0IleCys: 0.0 ± 0.0
2.035IleAsp: 2.035 ± 0.775
2.035IleGlu: 2.035 ± 0.548
2.035IlePhe: 2.035 ± 0.659
3.256IleGly: 3.256 ± 1.597
0.814IleHis: 0.814 ± 0.399
2.442IleIle: 2.442 ± 2.075
2.849IleLys: 2.849 ± 0.557
2.849IleLeu: 2.849 ± 0.968
1.628IleMet: 1.628 ± 0.493
1.221IleAsn: 1.221 ± 0.559
2.442IlePro: 2.442 ± 1.619
1.628IleGln: 1.628 ± 0.806
2.035IleArg: 2.035 ± 0.408
2.849IleSer: 2.849 ± 0.936
1.628IleThr: 1.628 ± 0.72
3.663IleVal: 3.663 ± 0.663
0.407IleTrp: 0.407 ± 0.54
2.442IleTyr: 2.442 ± 1.352
0.0IleXaa: 0.0 ± 0.0
Lys
3.256LysAla: 3.256 ± 0.734
0.814LysCys: 0.814 ± 0.439
3.663LysAsp: 3.663 ± 1.222
3.256LysGlu: 3.256 ± 1.416
3.663LysPhe: 3.663 ± 1.581
3.663LysGly: 3.663 ± 1.068
2.035LysHis: 2.035 ± 0.557
0.407LysIle: 0.407 ± 0.289
4.07LysLys: 4.07 ± 1.28
6.512LysLeu: 6.512 ± 1.486
0.814LysMet: 0.814 ± 0.569
1.628LysAsn: 1.628 ± 0.799
2.035LysPro: 2.035 ± 0.849
2.849LysGln: 2.849 ± 0.891
4.477LysArg: 4.477 ± 1.334
3.663LysSer: 3.663 ± 1.323
3.663LysThr: 3.663 ± 1.172
2.849LysVal: 2.849 ± 0.738
0.814LysTrp: 0.814 ± 0.399
2.035LysTyr: 2.035 ± 0.659
0.0LysXaa: 0.0 ± 0.0
Leu
4.07LeuAla: 4.07 ± 1.134
3.663LeuCys: 3.663 ± 2.431
5.698LeuAsp: 5.698 ± 2.173
7.733LeuGlu: 7.733 ± 1.348
2.442LeuPhe: 2.442 ± 0.698
3.663LeuGly: 3.663 ± 0.736
3.256LeuHis: 3.256 ± 0.83
3.256LeuIle: 3.256 ± 0.927
4.884LeuLys: 4.884 ± 0.993
8.954LeuLeu: 8.954 ± 3.048
2.035LeuMet: 2.035 ± 0.82
3.663LeuAsn: 3.663 ± 1.233
6.105LeuPro: 6.105 ± 1.22
4.477LeuGln: 4.477 ± 1.042
4.477LeuArg: 4.477 ± 2.371
7.733LeuSer: 7.733 ± 1.261
0.407LeuThr: 0.407 ± 0.346
4.477LeuVal: 4.477 ± 1.597
0.814LeuTrp: 0.814 ± 0.371
5.291LeuTyr: 5.291 ± 0.877
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.814MetCys: 0.814 ± 0.724
0.407MetAsp: 0.407 ± 0.289
0.814MetGlu: 0.814 ± 0.637
0.814MetPhe: 0.814 ± 0.371
0.407MetGly: 0.407 ± 0.289
0.407MetHis: 0.407 ± 0.289
0.407MetIle: 0.407 ± 0.362
1.221MetLys: 1.221 ± 0.677
1.221MetLeu: 1.221 ± 0.609
0.814MetMet: 0.814 ± 0.439
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
1.221MetGln: 1.221 ± 0.362
1.628MetArg: 1.628 ± 0.8
1.628MetSer: 1.628 ± 0.554
1.221MetThr: 1.221 ± 0.401
2.035MetVal: 2.035 ± 0.659
1.221MetTrp: 1.221 ± 0.678
1.221MetTyr: 1.221 ± 0.678
0.0MetXaa: 0.0 ± 0.0
Asn
1.628AsnAla: 1.628 ± 0.82
1.628AsnCys: 1.628 ± 1.156
0.407AsnAsp: 0.407 ± 0.371
2.035AsnGlu: 2.035 ± 0.538
1.628AsnPhe: 1.628 ± 0.74
2.035AsnGly: 2.035 ± 0.408
1.221AsnHis: 1.221 ± 0.453
0.814AsnIle: 0.814 ± 0.371
2.035AsnLys: 2.035 ± 1.026
4.07AsnLeu: 4.07 ± 2.146
0.407AsnMet: 0.407 ± 0.289
2.035AsnAsn: 2.035 ± 0.923
3.256AsnPro: 3.256 ± 1.111
2.442AsnGln: 2.442 ± 0.761
2.442AsnArg: 2.442 ± 0.773
2.442AsnSer: 2.442 ± 0.725
2.035AsnThr: 2.035 ± 0.822
3.256AsnVal: 3.256 ± 0.766
0.0AsnTrp: 0.0 ± 0.0
0.814AsnTyr: 0.814 ± 0.519
0.0AsnXaa: 0.0 ± 0.0
Pro
4.884ProAla: 4.884 ± 1.512
0.814ProCys: 0.814 ± 0.724
7.326ProAsp: 7.326 ± 2.979
6.512ProGlu: 6.512 ± 1.9
2.442ProPhe: 2.442 ± 1.117
3.256ProGly: 3.256 ± 1.459
0.407ProHis: 0.407 ± 0.346
2.035ProIle: 2.035 ± 0.919
2.442ProLys: 2.442 ± 0.878
5.291ProLeu: 5.291 ± 1.317
0.0ProMet: 0.0 ± 0.0
1.628ProAsn: 1.628 ± 0.743
9.361ProPro: 9.361 ± 1.623
1.628ProGln: 1.628 ± 2.173
3.663ProArg: 3.663 ± 1.873
4.884ProSer: 4.884 ± 0.976
5.291ProThr: 5.291 ± 1.419
4.884ProVal: 4.884 ± 1.303
0.0ProTrp: 0.0 ± 0.0
2.035ProTyr: 2.035 ± 0.979
0.0ProXaa: 0.0 ± 0.0
Gln
2.442GlnAla: 2.442 ± 1.053
0.407GlnCys: 0.407 ± 0.289
5.698GlnAsp: 5.698 ± 1.22
1.628GlnGlu: 1.628 ± 1.644
1.221GlnPhe: 1.221 ± 0.595
4.07GlnGly: 4.07 ± 0.796
0.407GlnHis: 0.407 ± 0.371
3.663GlnIle: 3.663 ± 1.146
2.442GlnLys: 2.442 ± 0.824
3.256GlnLeu: 3.256 ± 1.1
2.035GlnMet: 2.035 ± 1.368
2.035GlnAsn: 2.035 ± 1.562
4.477GlnPro: 4.477 ± 1.233
3.663GlnGln: 3.663 ± 1.227
2.442GlnArg: 2.442 ± 0.815
0.407GlnSer: 0.407 ± 0.362
2.849GlnThr: 2.849 ± 0.605
2.849GlnVal: 2.849 ± 0.575
1.628GlnTrp: 1.628 ± 0.878
2.442GlnTyr: 2.442 ± 0.935
0.0GlnXaa: 0.0 ± 0.0
Arg
4.884ArgAla: 4.884 ± 0.982
1.628ArgCys: 1.628 ± 1.0
1.628ArgAsp: 1.628 ± 0.526
1.628ArgGlu: 1.628 ± 0.67
2.442ArgPhe: 2.442 ± 1.031
7.326ArgGly: 7.326 ± 2.665
1.221ArgHis: 1.221 ± 0.674
1.628ArgIle: 1.628 ± 0.598
4.07ArgLys: 4.07 ± 0.811
7.326ArgLeu: 7.326 ± 2.022
1.628ArgMet: 1.628 ± 1.199
1.221ArgAsn: 1.221 ± 0.644
3.663ArgPro: 3.663 ± 1.193
4.07ArgGln: 4.07 ± 1.64
6.105ArgArg: 6.105 ± 1.684
6.919ArgSer: 6.919 ± 2.499
4.884ArgThr: 4.884 ± 2.333
2.035ArgVal: 2.035 ± 0.741
0.814ArgTrp: 0.814 ± 0.531
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
4.477SerAla: 4.477 ± 0.872
0.814SerCys: 0.814 ± 0.519
3.663SerAsp: 3.663 ± 0.695
2.849SerGlu: 2.849 ± 0.794
3.256SerPhe: 3.256 ± 1.616
7.326SerGly: 7.326 ± 1.56
1.221SerHis: 1.221 ± 0.593
1.221SerIle: 1.221 ± 0.373
3.256SerLys: 3.256 ± 0.993
8.14SerLeu: 8.14 ± 2.591
2.442SerMet: 2.442 ± 0.938
2.035SerAsn: 2.035 ± 0.557
3.663SerPro: 3.663 ± 0.761
2.849SerGln: 2.849 ± 0.925
8.954SerArg: 8.954 ± 4.224
7.733SerSer: 7.733 ± 1.796
5.698SerThr: 5.698 ± 2.214
3.663SerVal: 3.663 ± 1.053
2.035SerTrp: 2.035 ± 0.897
0.814SerTyr: 0.814 ± 0.519
0.0SerXaa: 0.0 ± 0.0
Thr
2.849ThrAla: 2.849 ± 0.605
0.407ThrCys: 0.407 ± 0.289
3.256ThrAsp: 3.256 ± 1.597
1.628ThrGlu: 1.628 ± 0.604
5.291ThrPhe: 5.291 ± 1.072
3.663ThrGly: 3.663 ± 0.74
1.628ThrHis: 1.628 ± 0.687
4.07ThrIle: 4.07 ± 1.323
2.442ThrLys: 2.442 ± 1.257
3.256ThrLeu: 3.256 ± 1.068
0.0ThrMet: 0.0 ± 0.0
2.035ThrAsn: 2.035 ± 0.706
4.884ThrPro: 4.884 ± 1.397
2.849ThrGln: 2.849 ± 0.586
5.698ThrArg: 5.698 ± 1.636
4.884ThrSer: 4.884 ± 2.237
3.256ThrThr: 3.256 ± 1.072
5.698ThrVal: 5.698 ± 1.333
0.814ThrTrp: 0.814 ± 0.439
1.628ThrTyr: 1.628 ± 0.287
0.0ThrXaa: 0.0 ± 0.0
Val
2.849ValAla: 2.849 ± 0.976
2.442ValCys: 2.442 ± 0.725
2.442ValAsp: 2.442 ± 0.789
3.256ValGlu: 3.256 ± 0.969
2.849ValPhe: 2.849 ± 0.467
1.628ValGly: 1.628 ± 0.882
2.035ValHis: 2.035 ± 1.471
2.849ValIle: 2.849 ± 1.415
4.884ValLys: 4.884 ± 1.232
4.477ValLeu: 4.477 ± 1.152
0.814ValMet: 0.814 ± 0.379
1.221ValAsn: 1.221 ± 0.401
3.663ValPro: 3.663 ± 1.14
2.849ValGln: 2.849 ± 0.604
4.477ValArg: 4.477 ± 1.848
5.698ValSer: 5.698 ± 1.799
6.512ValThr: 6.512 ± 1.345
3.663ValVal: 3.663 ± 0.677
0.814ValTrp: 0.814 ± 0.426
0.814ValTyr: 0.814 ± 0.426
0.0ValXaa: 0.0 ± 0.0
Trp
0.814TrpAla: 0.814 ± 0.371
0.0TrpCys: 0.0 ± 0.0
0.814TrpAsp: 0.814 ± 0.724
0.814TrpGlu: 0.814 ± 0.531
0.407TrpPhe: 0.407 ± 0.362
0.407TrpGly: 0.407 ± 0.289
0.407TrpHis: 0.407 ± 0.371
0.814TrpIle: 0.814 ± 0.371
2.035TrpLys: 2.035 ± 1.167
1.628TrpLeu: 1.628 ± 0.743
0.0TrpMet: 0.0 ± 0.0
1.221TrpAsn: 1.221 ± 0.699
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.814TrpArg: 0.814 ± 1.079
0.814TrpSer: 0.814 ± 0.426
1.628TrpThr: 1.628 ± 1.199
1.221TrpVal: 1.221 ± 0.453
0.407TrpTrp: 0.407 ± 0.289
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.256TyrAla: 3.256 ± 0.999
0.0TyrCys: 0.0 ± 0.0
1.221TyrAsp: 1.221 ± 0.401
3.256TyrGlu: 3.256 ± 1.067
1.628TyrPhe: 1.628 ± 0.616
1.628TyrGly: 1.628 ± 0.67
0.407TyrHis: 0.407 ± 0.543
1.221TyrIle: 1.221 ± 0.373
1.221TyrLys: 1.221 ± 0.559
3.663TyrLeu: 3.663 ± 0.77
0.0TyrMet: 0.0 ± 0.0
2.442TyrAsn: 2.442 ± 0.95
1.221TyrPro: 1.221 ± 1.085
1.628TyrGln: 1.628 ± 0.677
2.035TyrArg: 2.035 ± 0.706
1.221TyrSer: 1.221 ± 0.802
2.849TyrThr: 2.849 ± 0.613
2.035TyrVal: 2.035 ± 0.967
0.407TyrTrp: 0.407 ± 0.371
3.256TyrTyr: 3.256 ± 1.264
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2458 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski