Amino acid dipepetide frequency for Bos taurus papillomavirus 15

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.048AlaAla: 5.048 ± 1.682
0.459AlaCys: 0.459 ± 0.73
2.295AlaAsp: 2.295 ± 1.15
4.589AlaGlu: 4.589 ± 1.097
5.966AlaPhe: 5.966 ± 1.209
1.836AlaGly: 1.836 ± 0.531
0.0AlaHis: 0.0 ± 0.0
1.377AlaIle: 1.377 ± 0.886
3.212AlaLys: 3.212 ± 1.23
1.836AlaLeu: 1.836 ± 1.128
0.0AlaMet: 0.0 ± 0.0
2.754AlaAsn: 2.754 ± 1.038
3.212AlaPro: 3.212 ± 1.923
2.295AlaGln: 2.295 ± 0.882
5.048AlaArg: 5.048 ± 2.409
3.212AlaSer: 3.212 ± 0.569
3.212AlaThr: 3.212 ± 1.136
4.589AlaVal: 4.589 ± 1.362
0.459AlaTrp: 0.459 ± 0.445
2.295AlaTyr: 2.295 ± 1.026
0.0AlaXaa: 0.0 ± 0.0
Cys
0.918CysAla: 0.918 ± 0.768
0.918CysCys: 0.918 ± 0.659
1.377CysAsp: 1.377 ± 0.467
1.377CysGlu: 1.377 ± 1.435
2.295CysPhe: 2.295 ± 1.304
0.459CysGly: 0.459 ± 0.428
0.0CysHis: 0.0 ± 0.0
0.459CysIle: 0.459 ± 0.428
0.918CysLys: 0.918 ± 0.659
1.836CysLeu: 1.836 ± 1.235
1.836CysMet: 1.836 ± 0.624
1.377CysAsn: 1.377 ± 0.929
1.377CysPro: 1.377 ± 0.815
0.459CysGln: 0.459 ± 0.329
0.459CysArg: 0.459 ± 0.428
0.918CysSer: 0.918 ± 0.768
1.377CysThr: 1.377 ± 0.467
0.459CysVal: 0.459 ± 0.403
0.459CysTrp: 0.459 ± 0.428
0.459CysTyr: 0.459 ± 0.73
0.0CysXaa: 0.0 ± 0.0
Asp
3.671AspAla: 3.671 ± 0.65
2.295AspCys: 2.295 ± 0.711
5.048AspAsp: 5.048 ± 1.035
1.836AspGlu: 1.836 ± 0.615
3.212AspPhe: 3.212 ± 0.61
1.836AspGly: 1.836 ± 0.624
0.918AspHis: 0.918 ± 0.419
5.507AspIle: 5.507 ± 2.163
2.754AspLys: 2.754 ± 0.882
4.589AspLeu: 4.589 ± 1.376
1.377AspMet: 1.377 ± 0.934
3.212AspAsn: 3.212 ± 0.866
3.212AspPro: 3.212 ± 0.899
1.836AspGln: 1.836 ± 0.798
1.377AspArg: 1.377 ± 0.794
6.884AspSer: 6.884 ± 2.801
5.507AspThr: 5.507 ± 1.159
2.754AspVal: 2.754 ± 0.629
0.0AspTrp: 0.0 ± 0.0
1.836AspTyr: 1.836 ± 1.311
0.0AspXaa: 0.0 ± 0.0
Glu
3.212GluAla: 3.212 ± 1.334
0.918GluCys: 0.918 ± 0.659
6.884GluAsp: 6.884 ± 1.163
8.261GluGlu: 8.261 ± 2.486
2.295GluPhe: 2.295 ± 1.048
4.589GluGly: 4.589 ± 1.249
0.459GluHis: 0.459 ± 0.445
5.507GluIle: 5.507 ± 1.257
0.918GluLys: 0.918 ± 0.856
4.589GluLeu: 4.589 ± 1.55
1.836GluMet: 1.836 ± 0.531
4.13GluAsn: 4.13 ± 1.363
1.836GluPro: 1.836 ± 0.615
2.295GluGln: 2.295 ± 1.622
2.754GluArg: 2.754 ± 1.49
3.671GluSer: 3.671 ± 2.267
2.754GluThr: 2.754 ± 1.403
5.507GluVal: 5.507 ± 1.421
1.377GluTrp: 1.377 ± 0.652
0.918GluTyr: 0.918 ± 0.48
0.0GluXaa: 0.0 ± 0.0
Phe
3.212PheAla: 3.212 ± 1.485
0.918PheCys: 0.918 ± 0.768
0.918PheAsp: 0.918 ± 0.514
3.212PheGlu: 3.212 ± 0.779
1.377PhePhe: 1.377 ± 0.384
2.754PheGly: 2.754 ± 1.414
0.459PheHis: 0.459 ± 0.445
2.295PheIle: 2.295 ± 1.091
2.295PheLys: 2.295 ± 0.882
6.425PheLeu: 6.425 ± 1.645
0.918PheMet: 0.918 ± 0.749
2.754PheAsn: 2.754 ± 1.164
2.295PhePro: 2.295 ± 0.977
4.13PheGln: 4.13 ± 0.713
2.754PheArg: 2.754 ± 0.512
2.295PheSer: 2.295 ± 0.818
2.754PheThr: 2.754 ± 0.85
3.212PheVal: 3.212 ± 1.269
2.754PheTrp: 2.754 ± 1.524
2.295PheTyr: 2.295 ± 1.318
0.0PheXaa: 0.0 ± 0.0
Gly
4.13GlyAla: 4.13 ± 1.156
1.377GlyCys: 1.377 ± 0.45
2.754GlyAsp: 2.754 ± 0.579
7.343GlyGlu: 7.343 ± 0.992
3.212GlyPhe: 3.212 ± 1.512
8.261GlyGly: 8.261 ± 4.135
0.918GlyHis: 0.918 ± 0.856
3.671GlyIle: 3.671 ± 1.481
4.13GlyLys: 4.13 ± 1.157
4.13GlyLeu: 4.13 ± 0.987
0.459GlyMet: 0.459 ± 0.428
1.836GlyAsn: 1.836 ± 0.885
2.295GlyPro: 2.295 ± 0.899
6.425GlyGln: 6.425 ± 1.941
2.754GlyArg: 2.754 ± 0.935
5.048GlySer: 5.048 ± 1.619
3.671GlyThr: 3.671 ± 0.709
3.212GlyVal: 3.212 ± 1.241
0.459GlyTrp: 0.459 ± 1.18
1.836GlyTyr: 1.836 ± 0.923
0.0GlyXaa: 0.0 ± 0.0
His
0.459HisAla: 0.459 ± 0.445
0.0HisCys: 0.0 ± 0.0
0.918HisAsp: 0.918 ± 0.502
0.0HisGlu: 0.0 ± 0.0
0.918HisPhe: 0.918 ± 0.498
1.377HisGly: 1.377 ± 1.171
0.0HisHis: 0.0 ± 0.0
0.459HisIle: 0.459 ± 0.403
2.295HisLys: 2.295 ± 0.852
0.918HisLeu: 0.918 ± 0.856
0.459HisMet: 0.459 ± 0.329
0.918HisAsn: 0.918 ± 0.659
3.212HisPro: 3.212 ± 1.085
0.459HisGln: 0.459 ± 0.329
1.836HisArg: 1.836 ± 1.311
2.754HisSer: 2.754 ± 0.935
0.918HisThr: 0.918 ± 0.419
2.295HisVal: 2.295 ± 0.61
0.459HisTrp: 0.459 ± 0.403
0.918HisTyr: 0.918 ± 0.48
0.0HisXaa: 0.0 ± 0.0
Ile
2.295IleAla: 2.295 ± 0.989
0.918IleCys: 0.918 ± 0.442
4.589IleAsp: 4.589 ± 1.25
4.589IleGlu: 4.589 ± 0.852
2.295IlePhe: 2.295 ± 1.217
1.836IleGly: 1.836 ± 0.697
0.459IleHis: 0.459 ± 0.403
4.13IleIle: 4.13 ± 1.068
1.836IleLys: 1.836 ± 0.248
1.836IleLeu: 1.836 ± 1.324
0.0IleMet: 0.0 ± 0.0
2.295IleAsn: 2.295 ± 0.711
2.295IlePro: 2.295 ± 1.027
2.295IleGln: 2.295 ± 1.091
2.295IleArg: 2.295 ± 0.79
4.13IleSer: 4.13 ± 0.648
2.754IleThr: 2.754 ± 1.147
5.966IleVal: 5.966 ± 2.105
0.0IleTrp: 0.0 ± 0.0
1.377IleTyr: 1.377 ± 0.861
0.0IleXaa: 0.0 ± 0.0
Lys
1.836LysAla: 1.836 ± 0.798
1.377LysCys: 1.377 ± 0.718
1.836LysAsp: 1.836 ± 1.0
2.295LysGlu: 2.295 ± 0.866
3.671LysPhe: 3.671 ± 2.424
5.507LysGly: 5.507 ± 1.124
2.754LysHis: 2.754 ± 0.984
1.377LysIle: 1.377 ± 0.455
2.754LysLys: 2.754 ± 1.201
2.754LysLeu: 2.754 ± 0.882
0.918LysMet: 0.918 ± 0.856
2.295LysAsn: 2.295 ± 1.091
1.836LysPro: 1.836 ± 0.697
0.459LysGln: 0.459 ± 0.428
4.589LysArg: 4.589 ± 0.907
5.507LysSer: 5.507 ± 2.456
3.671LysThr: 3.671 ± 1.506
2.295LysVal: 2.295 ± 0.532
0.459LysTrp: 0.459 ± 0.403
2.295LysTyr: 2.295 ± 0.724
0.0LysXaa: 0.0 ± 0.0
Leu
3.212LeuAla: 3.212 ± 1.787
0.918LeuCys: 0.918 ± 0.767
5.507LeuAsp: 5.507 ± 1.305
7.343LeuGlu: 7.343 ± 0.715
3.671LeuPhe: 3.671 ± 1.657
5.507LeuGly: 5.507 ± 1.114
3.212LeuHis: 3.212 ± 1.218
2.295LeuIle: 2.295 ± 1.148
5.048LeuLys: 5.048 ± 0.836
10.096LeuLeu: 10.096 ± 5.927
2.295LeuMet: 2.295 ± 1.368
1.836LeuAsn: 1.836 ± 1.155
2.295LeuPro: 2.295 ± 0.532
5.966LeuGln: 5.966 ± 1.622
4.589LeuArg: 4.589 ± 0.812
6.425LeuSer: 6.425 ± 2.428
4.589LeuThr: 4.589 ± 1.841
3.671LeuVal: 3.671 ± 0.83
0.918LeuTrp: 0.918 ± 0.442
5.048LeuTyr: 5.048 ± 1.027
0.0LeuXaa: 0.0 ± 0.0
Met
0.459MetAla: 0.459 ± 0.428
0.918MetCys: 0.918 ± 0.856
1.377MetAsp: 1.377 ± 0.467
0.918MetGlu: 0.918 ± 0.806
0.918MetPhe: 0.918 ± 0.442
1.836MetGly: 1.836 ± 0.697
0.0MetHis: 0.0 ± 0.0
0.918MetIle: 0.918 ± 0.498
1.377MetLys: 1.377 ± 0.929
0.918MetLeu: 0.918 ± 0.498
0.0MetMet: 0.0 ± 0.0
0.918MetAsn: 0.918 ± 0.856
0.0MetPro: 0.0 ± 0.0
0.459MetGln: 0.459 ± 0.445
0.918MetArg: 0.918 ± 0.659
1.836MetSer: 1.836 ± 2.303
1.377MetThr: 1.377 ± 0.637
1.377MetVal: 1.377 ± 0.637
0.0MetTrp: 0.0 ± 0.0
0.918MetTyr: 0.918 ± 0.856
0.0MetXaa: 0.0 ± 0.0
Asn
2.754AsnAla: 2.754 ± 1.172
1.377AsnCys: 1.377 ± 0.718
1.377AsnAsp: 1.377 ± 0.384
4.589AsnGlu: 4.589 ± 1.616
2.754AsnPhe: 2.754 ± 1.201
3.212AsnGly: 3.212 ± 1.15
0.0AsnHis: 0.0 ± 0.0
2.295AsnIle: 2.295 ± 1.121
2.754AsnLys: 2.754 ± 0.933
2.754AsnLeu: 2.754 ± 0.876
0.459AsnMet: 0.459 ± 0.331
2.754AsnAsn: 2.754 ± 1.112
4.13AsnPro: 4.13 ± 1.306
1.836AsnGln: 1.836 ± 0.934
2.754AsnArg: 2.754 ± 1.246
4.589AsnSer: 4.589 ± 1.778
2.754AsnThr: 2.754 ± 0.909
3.212AsnVal: 3.212 ± 1.056
0.918AsnTrp: 0.918 ± 0.442
0.918AsnTyr: 0.918 ± 0.659
0.0AsnXaa: 0.0 ± 0.0
Pro
4.13ProAla: 4.13 ± 1.054
0.918ProCys: 0.918 ± 0.856
4.13ProAsp: 4.13 ± 2.125
2.295ProGlu: 2.295 ± 0.532
2.754ProPhe: 2.754 ± 1.064
3.212ProGly: 3.212 ± 1.941
1.377ProHis: 1.377 ± 0.794
2.295ProIle: 2.295 ± 1.215
4.589ProLys: 4.589 ± 2.272
5.507ProLeu: 5.507 ± 1.63
0.459ProMet: 0.459 ± 0.403
3.671ProAsn: 3.671 ± 1.395
6.884ProPro: 6.884 ± 1.247
0.918ProGln: 0.918 ± 0.767
1.836ProArg: 1.836 ± 0.624
4.13ProSer: 4.13 ± 1.582
5.507ProThr: 5.507 ± 1.5
3.671ProVal: 3.671 ± 1.4
0.0ProTrp: 0.0 ± 0.0
1.377ProTyr: 1.377 ± 1.004
0.0ProXaa: 0.0 ± 0.0
Gln
1.377GlnAla: 1.377 ± 0.806
0.918GlnCys: 0.918 ± 0.498
3.212GlnAsp: 3.212 ± 1.551
1.836GlnGlu: 1.836 ± 0.531
2.295GlnPhe: 2.295 ± 0.805
2.754GlnGly: 2.754 ± 0.69
1.836GlnHis: 1.836 ± 0.997
3.212GlnIle: 3.212 ± 1.136
1.377GlnLys: 1.377 ± 0.718
3.212GlnLeu: 3.212 ± 0.63
0.918GlnMet: 0.918 ± 0.442
1.836GlnAsn: 1.836 ± 0.923
4.13GlnPro: 4.13 ± 1.4
2.295GlnGln: 2.295 ± 0.68
3.212GlnArg: 3.212 ± 0.93
0.0GlnSer: 0.0 ± 0.0
2.295GlnThr: 2.295 ± 1.159
2.754GlnVal: 2.754 ± 0.458
1.836GlnTrp: 1.836 ± 0.615
1.836GlnTyr: 1.836 ± 1.212
0.0GlnXaa: 0.0 ± 0.0
Arg
5.048ArgAla: 5.048 ± 1.501
2.295ArgCys: 2.295 ± 0.882
2.754ArgAsp: 2.754 ± 1.507
2.295ArgGlu: 2.295 ± 1.239
2.295ArgPhe: 2.295 ± 0.852
4.13ArgGly: 4.13 ± 1.582
2.754ArgHis: 2.754 ± 1.191
1.836ArgIle: 1.836 ± 0.871
4.13ArgLys: 4.13 ± 1.363
7.343ArgLeu: 7.343 ± 0.937
0.918ArgMet: 0.918 ± 0.89
3.212ArgAsn: 3.212 ± 0.903
4.13ArgPro: 4.13 ± 1.245
1.377ArgGln: 1.377 ± 0.455
5.966ArgArg: 5.966 ± 1.734
5.507ArgSer: 5.507 ± 2.746
1.377ArgThr: 1.377 ± 0.777
5.048ArgVal: 5.048 ± 0.729
0.459ArgTrp: 0.459 ± 0.445
0.459ArgTyr: 0.459 ± 0.329
0.0ArgXaa: 0.0 ± 0.0
Ser
3.671SerAla: 3.671 ± 0.732
0.0SerCys: 0.0 ± 0.0
4.589SerAsp: 4.589 ± 0.751
4.589SerGlu: 4.589 ± 0.542
2.754SerPhe: 2.754 ± 0.847
7.343SerGly: 7.343 ± 1.54
2.295SerHis: 2.295 ± 0.876
2.754SerIle: 2.754 ± 1.098
2.754SerLys: 2.754 ± 0.656
9.179SerLeu: 9.179 ± 2.293
2.754SerMet: 2.754 ± 1.525
2.295SerAsn: 2.295 ± 0.925
5.507SerPro: 5.507 ± 1.739
3.671SerGln: 3.671 ± 1.061
9.637SerArg: 9.637 ± 4.124
5.966SerSer: 5.966 ± 1.809
3.671SerThr: 3.671 ± 1.949
3.671SerVal: 3.671 ± 1.565
1.377SerTrp: 1.377 ± 0.718
0.918SerTyr: 0.918 ± 0.498
0.0SerXaa: 0.0 ± 0.0
Thr
3.212ThrAla: 3.212 ± 1.639
1.377ThrCys: 1.377 ± 1.255
3.212ThrAsp: 3.212 ± 0.648
1.836ThrGlu: 1.836 ± 0.732
3.212ThrPhe: 3.212 ± 1.13
5.048ThrGly: 5.048 ± 0.435
1.377ThrHis: 1.377 ± 0.861
2.754ThrIle: 2.754 ± 0.984
0.918ThrLys: 0.918 ± 0.514
5.048ThrLeu: 5.048 ± 1.475
0.459ThrMet: 0.459 ± 0.329
3.212ThrAsn: 3.212 ± 1.197
4.589ThrPro: 4.589 ± 2.516
2.295ThrGln: 2.295 ± 1.121
4.589ThrArg: 4.589 ± 0.907
7.802ThrSer: 7.802 ± 3.361
2.295ThrThr: 2.295 ± 2.111
4.589ThrVal: 4.589 ± 1.161
1.377ThrTrp: 1.377 ± 0.886
0.459ThrTyr: 0.459 ± 0.329
0.0ThrXaa: 0.0 ± 0.0
Val
3.212ValAla: 3.212 ± 1.636
1.836ValCys: 1.836 ± 1.403
4.13ValAsp: 4.13 ± 1.255
2.295ValGlu: 2.295 ± 1.673
2.295ValPhe: 2.295 ± 0.795
3.671ValGly: 3.671 ± 0.772
1.377ValHis: 1.377 ± 0.45
3.212ValIle: 3.212 ± 2.152
4.13ValLys: 4.13 ± 0.648
5.966ValLeu: 5.966 ± 2.054
0.918ValMet: 0.918 ± 0.393
3.212ValAsn: 3.212 ± 0.469
4.589ValPro: 4.589 ± 1.01
2.754ValGln: 2.754 ± 1.065
5.048ValArg: 5.048 ± 1.936
5.966ValSer: 5.966 ± 1.664
5.966ValThr: 5.966 ± 1.754
1.377ValVal: 1.377 ± 1.464
0.459ValTrp: 0.459 ± 0.428
1.836ValTyr: 1.836 ± 0.248
0.0ValXaa: 0.0 ± 0.0
Trp
0.459TrpAla: 0.459 ± 0.329
0.0TrpCys: 0.0 ± 0.0
1.377TrpAsp: 1.377 ± 1.382
1.377TrpGlu: 1.377 ± 0.45
0.459TrpPhe: 0.459 ± 0.403
1.377TrpGly: 1.377 ± 0.467
0.459TrpHis: 0.459 ± 0.445
0.918TrpIle: 0.918 ± 0.442
1.377TrpLys: 1.377 ± 0.988
0.918TrpLeu: 0.918 ± 0.442
0.0TrpMet: 0.0 ± 0.0
1.836TrpAsn: 1.836 ± 0.885
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.836TrpSer: 1.836 ± 1.328
0.459TrpThr: 0.459 ± 0.445
1.836TrpVal: 1.836 ± 0.615
0.459TrpTrp: 0.459 ± 0.329
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.377TyrAla: 1.377 ± 0.806
0.0TyrCys: 0.0 ± 0.0
1.377TyrAsp: 1.377 ± 0.886
1.836TyrGlu: 1.836 ± 0.871
1.377TyrPhe: 1.377 ± 1.202
1.377TyrGly: 1.377 ± 0.652
0.918TyrHis: 0.918 ± 0.514
0.918TyrIle: 0.918 ± 0.442
0.918TyrLys: 0.918 ± 0.442
4.589TyrLeu: 4.589 ± 1.558
0.0TyrMet: 0.0 ± 0.0
1.836TyrAsn: 1.836 ± 0.697
1.836TyrPro: 1.836 ± 0.96
0.918TyrGln: 0.918 ± 0.659
0.918TyrArg: 0.918 ± 0.48
0.918TyrSer: 0.918 ± 0.514
2.754TyrThr: 2.754 ± 1.235
3.212TyrVal: 3.212 ± 0.75
0.918TyrTrp: 0.918 ± 0.514
1.377TyrTyr: 1.377 ± 0.455
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2180 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski