Amino acid dipepetide frequency for Castor canadensis papillomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.886AlaAla: 5.886 ± 1.721
2.675AlaCys: 2.675 ± 1.604
4.815AlaAsp: 4.815 ± 1.455
3.745AlaGlu: 3.745 ± 1.365
3.745AlaPhe: 3.745 ± 1.069
4.28AlaGly: 4.28 ± 1.854
1.07AlaHis: 1.07 ± 0.983
1.605AlaIle: 1.605 ± 0.335
4.815AlaLys: 4.815 ± 2.028
4.815AlaLeu: 4.815 ± 1.744
1.07AlaMet: 1.07 ± 0.451
2.14AlaAsn: 2.14 ± 0.522
2.14AlaPro: 2.14 ± 0.661
3.745AlaGln: 3.745 ± 1.886
3.21AlaArg: 3.21 ± 1.974
3.21AlaSer: 3.21 ± 0.981
2.675AlaThr: 2.675 ± 1.251
4.28AlaVal: 4.28 ± 0.926
1.07AlaTrp: 1.07 ± 0.451
2.675AlaTyr: 2.675 ± 0.651
0.0AlaXaa: 0.0 ± 0.0
Cys
3.21CysAla: 3.21 ± 2.097
0.535CysCys: 0.535 ± 0.944
0.535CysAsp: 0.535 ± 0.947
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.07CysGly: 1.07 ± 1.488
0.535CysHis: 0.535 ± 0.947
2.675CysIle: 2.675 ± 1.281
3.21CysLys: 3.21 ± 1.565
0.535CysLeu: 0.535 ± 0.944
0.535CysMet: 0.535 ± 0.408
0.535CysAsn: 0.535 ± 0.408
2.675CysPro: 2.675 ± 0.864
0.535CysGln: 0.535 ± 0.944
1.07CysArg: 1.07 ± 0.93
0.535CysSer: 0.535 ± 0.408
1.07CysThr: 1.07 ± 0.815
1.07CysVal: 1.07 ± 0.983
0.535CysTrp: 0.535 ± 0.408
1.07CysTyr: 1.07 ± 1.488
0.0CysXaa: 0.0 ± 0.0
Asp
5.35AspAla: 5.35 ± 1.709
2.14AspCys: 2.14 ± 1.111
4.28AspAsp: 4.28 ± 1.523
5.886AspGlu: 5.886 ± 3.865
1.605AspPhe: 1.605 ± 0.744
3.21AspGly: 3.21 ± 0.961
1.07AspHis: 1.07 ± 1.03
4.815AspIle: 4.815 ± 2.716
2.14AspLys: 2.14 ± 0.996
4.815AspLeu: 4.815 ± 2.975
1.07AspMet: 1.07 ± 0.451
4.28AspAsn: 4.28 ± 1.092
3.745AspPro: 3.745 ± 1.961
2.675AspGln: 2.675 ± 1.042
2.675AspArg: 2.675 ± 0.875
3.745AspSer: 3.745 ± 0.789
6.956AspThr: 6.956 ± 1.334
5.886AspVal: 5.886 ± 1.236
0.535AspTrp: 0.535 ± 0.408
2.14AspTyr: 2.14 ± 0.642
0.0AspXaa: 0.0 ± 0.0
Glu
3.21GluAla: 3.21 ± 0.876
0.535GluCys: 0.535 ± 0.408
4.28GluAsp: 4.28 ± 1.045
2.14GluGlu: 2.14 ± 0.781
2.14GluPhe: 2.14 ± 0.989
4.28GluGly: 4.28 ± 1.235
3.21GluHis: 3.21 ± 1.141
2.14GluIle: 2.14 ± 0.661
2.14GluLys: 2.14 ± 1.778
8.026GluLeu: 8.026 ± 1.904
0.535GluMet: 0.535 ± 0.408
2.675GluAsn: 2.675 ± 1.205
2.14GluPro: 2.14 ± 1.067
4.28GluGln: 4.28 ± 1.651
1.07GluArg: 1.07 ± 0.865
3.21GluSer: 3.21 ± 0.845
1.07GluThr: 1.07 ± 0.913
4.815GluVal: 4.815 ± 2.645
1.07GluTrp: 1.07 ± 0.815
0.535GluTyr: 0.535 ± 0.432
0.0GluXaa: 0.0 ± 0.0
Phe
2.675PheAla: 2.675 ± 0.791
1.07PheCys: 1.07 ± 0.983
5.35PheAsp: 5.35 ± 0.774
2.675PheGlu: 2.675 ± 1.086
2.14PhePhe: 2.14 ± 0.522
3.745PheGly: 3.745 ± 1.783
0.535PheHis: 0.535 ± 0.408
2.675PheIle: 2.675 ± 0.651
2.675PheLys: 2.675 ± 1.281
4.28PheLeu: 4.28 ± 1.521
0.535PheMet: 0.535 ± 0.432
2.675PheAsn: 2.675 ± 1.6
3.21PhePro: 3.21 ± 0.845
1.605PheGln: 1.605 ± 0.744
2.14PheArg: 2.14 ± 0.903
2.14PheSer: 2.14 ± 1.314
1.605PheThr: 1.605 ± 1.223
3.21PheVal: 3.21 ± 1.01
1.605PheTrp: 1.605 ± 0.744
3.745PheTyr: 3.745 ± 1.356
0.0PheXaa: 0.0 ± 0.0
Gly
3.21GlyAla: 3.21 ± 0.678
1.07GlyCys: 1.07 ± 0.451
5.886GlyAsp: 5.886 ± 2.053
4.28GlyGlu: 4.28 ± 0.655
2.675GlyPhe: 2.675 ± 1.094
6.421GlyGly: 6.421 ± 2.988
2.675GlyHis: 2.675 ± 1.25
2.675GlyIle: 2.675 ± 0.64
2.675GlyLys: 2.675 ± 1.049
4.815GlyLeu: 4.815 ± 1.678
0.0GlyMet: 0.0 ± 0.0
3.21GlyAsn: 3.21 ± 1.261
5.35GlyPro: 5.35 ± 3.513
3.745GlyGln: 3.745 ± 1.307
4.815GlyArg: 4.815 ± 2.666
5.886GlySer: 5.886 ± 1.886
3.745GlyThr: 3.745 ± 1.038
3.745GlyVal: 3.745 ± 2.067
0.535GlyTrp: 0.535 ± 0.408
1.605GlyTyr: 1.605 ± 0.859
0.0GlyXaa: 0.0 ± 0.0
His
1.605HisAla: 1.605 ± 1.148
1.07HisCys: 1.07 ± 0.933
0.535HisAsp: 0.535 ± 0.408
0.0HisGlu: 0.0 ± 0.0
1.07HisPhe: 1.07 ± 0.983
1.605HisGly: 1.605 ± 0.859
1.07HisHis: 1.07 ± 0.815
1.605HisIle: 1.605 ± 0.789
0.535HisLys: 0.535 ± 0.944
2.14HisLeu: 2.14 ± 1.327
0.535HisMet: 0.535 ± 0.432
0.535HisAsn: 0.535 ± 0.408
3.21HisPro: 3.21 ± 1.42
1.07HisGln: 1.07 ± 0.933
1.07HisArg: 1.07 ± 0.815
1.07HisSer: 1.07 ± 0.451
2.14HisThr: 2.14 ± 0.522
0.0HisVal: 0.0 ± 0.0
0.535HisTrp: 0.535 ± 0.432
1.07HisTyr: 1.07 ± 0.431
0.0HisXaa: 0.0 ± 0.0
Ile
4.28IleAla: 4.28 ± 0.894
0.535IleCys: 0.535 ± 0.408
1.07IleAsp: 1.07 ± 0.815
5.35IleGlu: 5.35 ± 1.521
3.21IlePhe: 3.21 ± 1.461
5.886IleGly: 5.886 ± 2.137
0.0IleHis: 0.0 ± 0.0
2.14IleIle: 2.14 ± 1.825
1.605IleLys: 1.605 ± 0.335
2.675IleLeu: 2.675 ± 1.19
1.605IleMet: 1.605 ± 0.991
2.14IleAsn: 2.14 ± 0.642
3.745IlePro: 3.745 ± 1.977
1.07IleGln: 1.07 ± 0.431
1.605IleArg: 1.605 ± 0.92
3.21IleSer: 3.21 ± 0.82
3.745IleThr: 3.745 ± 0.777
5.35IleVal: 5.35 ± 1.665
0.0IleTrp: 0.0 ± 0.0
1.605IleTyr: 1.605 ± 0.704
0.0IleXaa: 0.0 ± 0.0
Lys
4.28LysAla: 4.28 ± 1.139
1.605LysCys: 1.605 ± 0.89
3.21LysAsp: 3.21 ± 1.141
1.605LysGlu: 1.605 ± 1.297
3.21LysPhe: 3.21 ± 1.234
0.0LysGly: 0.0 ± 0.0
1.07LysHis: 1.07 ± 0.815
4.28LysIle: 4.28 ± 0.859
1.605LysLys: 1.605 ± 1.223
2.14LysLeu: 2.14 ± 1.844
1.07LysMet: 1.07 ± 0.865
1.07LysAsn: 1.07 ± 0.451
0.535LysPro: 0.535 ± 0.408
0.535LysGln: 0.535 ± 0.432
3.21LysArg: 3.21 ± 0.876
3.745LysSer: 3.745 ± 1.954
2.14LysThr: 2.14 ± 1.024
3.21LysVal: 3.21 ± 1.873
0.0LysTrp: 0.0 ± 0.0
2.14LysTyr: 2.14 ± 0.903
0.0LysXaa: 0.0 ± 0.0
Leu
3.745LeuAla: 3.745 ± 0.789
2.675LeuCys: 2.675 ± 2.135
4.28LeuAsp: 4.28 ± 0.936
2.675LeuGlu: 2.675 ± 0.761
5.886LeuPhe: 5.886 ± 1.354
8.026LeuGly: 8.026 ± 1.904
2.14LeuHis: 2.14 ± 0.982
3.21LeuIle: 3.21 ± 1.84
2.14LeuLys: 2.14 ± 0.982
10.701LeuLeu: 10.701 ± 4.502
2.675LeuMet: 2.675 ± 1.016
2.675LeuAsn: 2.675 ± 1.0
4.28LeuPro: 4.28 ± 1.951
9.096LeuGln: 9.096 ± 2.653
6.956LeuArg: 6.956 ± 1.99
7.491LeuSer: 7.491 ± 3.166
5.886LeuThr: 5.886 ± 1.788
3.745LeuVal: 3.745 ± 0.925
0.0LeuTrp: 0.0 ± 0.0
2.675LeuTyr: 2.675 ± 0.864
0.0LeuXaa: 0.0 ± 0.0
Met
1.07MetAla: 1.07 ± 0.451
0.0MetCys: 0.0 ± 0.0
1.07MetAsp: 1.07 ± 0.815
0.535MetGlu: 0.535 ± 0.944
1.605MetPhe: 1.605 ± 1.102
1.07MetGly: 1.07 ± 0.533
0.535MetHis: 0.535 ± 0.408
1.605MetIle: 1.605 ± 0.987
0.0MetLys: 0.0 ± 0.0
2.14MetLeu: 2.14 ± 1.63
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.535MetPro: 0.535 ± 0.456
0.535MetGln: 0.535 ± 0.432
0.535MetArg: 0.535 ± 0.432
3.745MetSer: 3.745 ± 1.846
0.0MetThr: 0.0 ± 0.0
2.14MetVal: 2.14 ± 0.903
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.21AsnAla: 3.21 ± 1.408
1.07AsnCys: 1.07 ± 0.933
2.14AsnAsp: 2.14 ± 0.672
1.07AsnGlu: 1.07 ± 0.815
1.605AsnPhe: 1.605 ± 0.704
2.675AsnGly: 2.675 ± 0.651
0.0AsnHis: 0.0 ± 0.0
3.21AsnIle: 3.21 ± 0.974
1.605AsnLys: 1.605 ± 0.335
2.14AsnLeu: 2.14 ± 0.661
0.535AsnMet: 0.535 ± 0.456
1.07AsnAsn: 1.07 ± 0.533
3.21AsnPro: 3.21 ± 0.981
1.605AsnGln: 1.605 ± 0.784
2.675AsnArg: 2.675 ± 2.162
3.745AsnSer: 3.745 ± 1.641
4.28AsnThr: 4.28 ± 1.854
1.07AsnVal: 1.07 ± 0.815
0.535AsnTrp: 0.535 ± 0.408
1.07AsnTyr: 1.07 ± 0.865
0.0AsnXaa: 0.0 ± 0.0
Pro
3.21ProAla: 3.21 ± 1.461
1.07ProCys: 1.07 ± 1.888
4.815ProAsp: 4.815 ± 1.832
3.745ProGlu: 3.745 ± 1.582
3.21ProPhe: 3.21 ± 1.354
1.605ProGly: 1.605 ± 0.704
1.07ProHis: 1.07 ± 1.03
2.14ProIle: 2.14 ± 0.522
3.21ProLys: 3.21 ± 0.678
5.35ProLeu: 5.35 ± 1.076
0.535ProMet: 0.535 ± 0.456
3.21ProAsn: 3.21 ± 0.67
5.35ProPro: 5.35 ± 0.91
3.21ProGln: 3.21 ± 0.876
3.745ProArg: 3.745 ± 2.556
7.491ProSer: 7.491 ± 3.004
2.14ProThr: 2.14 ± 1.202
4.28ProVal: 4.28 ± 2.491
0.535ProTrp: 0.535 ± 0.456
0.535ProTyr: 0.535 ± 0.944
0.0ProXaa: 0.0 ± 0.0
Gln
1.605GlnAla: 1.605 ± 0.335
0.0GlnCys: 0.0 ± 0.0
1.07GlnAsp: 1.07 ± 0.431
3.745GlnGlu: 3.745 ± 1.704
1.605GlnPhe: 1.605 ± 0.335
2.675GlnGly: 2.675 ± 1.086
1.07GlnHis: 1.07 ± 0.815
1.605GlnIle: 1.605 ± 0.335
1.605GlnLys: 1.605 ± 0.335
6.421GlnLeu: 6.421 ± 2.296
1.605GlnMet: 1.605 ± 0.784
3.21GlnAsn: 3.21 ± 1.569
3.21GlnPro: 3.21 ± 0.678
3.21GlnGln: 3.21 ± 0.678
1.605GlnArg: 1.605 ± 0.784
3.745GlnSer: 3.745 ± 2.067
1.07GlnThr: 1.07 ± 0.533
3.745GlnVal: 3.745 ± 2.933
0.535GlnTrp: 0.535 ± 0.408
3.745GlnTyr: 3.745 ± 1.307
0.0GlnXaa: 0.0 ± 0.0
Arg
5.35ArgAla: 5.35 ± 1.24
1.605ArgCys: 1.605 ± 1.424
1.605ArgAsp: 1.605 ± 1.172
1.605ArgGlu: 1.605 ± 0.335
4.28ArgPhe: 4.28 ± 1.562
3.745ArgGly: 3.745 ± 1.643
1.605ArgHis: 1.605 ± 0.784
1.605ArgIle: 1.605 ± 0.789
3.745ArgLys: 3.745 ± 0.929
9.096ArgLeu: 9.096 ± 1.568
0.0ArgMet: 0.0 ± 0.0
0.535ArgAsn: 0.535 ± 0.408
3.21ArgPro: 3.21 ± 2.199
1.07ArgGln: 1.07 ± 0.865
4.28ArgArg: 4.28 ± 1.941
8.026ArgSer: 8.026 ± 2.943
3.745ArgThr: 3.745 ± 0.929
3.745ArgVal: 3.745 ± 1.894
0.535ArgTrp: 0.535 ± 0.408
1.07ArgTyr: 1.07 ± 0.451
0.0ArgXaa: 0.0 ± 0.0
Ser
2.675SerAla: 2.675 ± 0.977
1.07SerCys: 1.07 ± 1.488
7.491SerAsp: 7.491 ± 1.702
5.886SerGlu: 5.886 ± 0.551
5.35SerPhe: 5.35 ± 2.031
6.421SerGly: 6.421 ± 1.983
1.605SerHis: 1.605 ± 0.704
5.35SerIle: 5.35 ± 1.999
1.07SerLys: 1.07 ± 0.865
7.491SerLeu: 7.491 ± 1.452
1.605SerMet: 1.605 ± 1.223
1.605SerAsn: 1.605 ± 0.704
2.14SerPro: 2.14 ± 0.903
2.14SerGln: 2.14 ± 1.067
8.026SerArg: 8.026 ± 1.919
5.886SerSer: 5.886 ± 1.786
4.28SerThr: 4.28 ± 1.626
5.35SerVal: 5.35 ± 0.774
0.535SerTrp: 0.535 ± 0.432
1.605SerTyr: 1.605 ± 0.744
0.0SerXaa: 0.0 ± 0.0
Thr
1.605ThrAla: 1.605 ± 0.704
1.605ThrCys: 1.605 ± 0.704
7.491ThrAsp: 7.491 ± 2.138
2.14ThrGlu: 2.14 ± 0.661
2.14ThrPhe: 2.14 ± 0.962
5.35ThrGly: 5.35 ± 1.651
0.0ThrHis: 0.0 ± 0.0
2.14ThrIle: 2.14 ± 1.066
2.14ThrLys: 2.14 ± 0.903
4.28ThrLeu: 4.28 ± 1.842
0.535ThrMet: 0.535 ± 0.408
3.21ThrAsn: 3.21 ± 0.678
6.956ThrPro: 6.956 ± 0.981
2.14ThrGln: 2.14 ± 1.214
3.745ThrArg: 3.745 ± 1.069
2.675ThrSer: 2.675 ± 1.086
3.745ThrThr: 3.745 ± 1.794
4.28ThrVal: 4.28 ± 1.257
0.0ThrTrp: 0.0 ± 0.0
1.07ThrTyr: 1.07 ± 0.451
0.0ThrXaa: 0.0 ± 0.0
Val
3.21ValAla: 3.21 ± 1.294
1.605ValCys: 1.605 ± 1.856
4.815ValAsp: 4.815 ± 0.898
3.21ValGlu: 3.21 ± 0.981
3.745ValPhe: 3.745 ± 1.39
4.815ValGly: 4.815 ± 1.639
2.675ValHis: 2.675 ± 1.604
2.14ValIle: 2.14 ± 0.642
1.605ValLys: 1.605 ± 0.744
4.28ValLeu: 4.28 ± 1.949
1.07ValMet: 1.07 ± 0.815
2.675ValAsn: 2.675 ± 1.753
3.21ValPro: 3.21 ± 1.695
3.21ValGln: 3.21 ± 0.961
4.815ValArg: 4.815 ± 2.542
6.956ValSer: 6.956 ± 1.966
5.35ValThr: 5.35 ± 1.651
3.745ValVal: 3.745 ± 2.784
1.07ValTrp: 1.07 ± 0.865
2.14ValTyr: 2.14 ± 0.962
0.0ValXaa: 0.0 ± 0.0
Trp
1.07TrpAla: 1.07 ± 0.451
0.0TrpCys: 0.0 ± 0.0
1.07TrpAsp: 1.07 ± 0.451
0.535TrpGlu: 0.535 ± 0.432
0.535TrpPhe: 0.535 ± 0.408
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.605TrpIle: 1.605 ± 0.704
0.535TrpLys: 0.535 ± 0.408
2.14TrpLeu: 2.14 ± 0.903
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.535TrpGln: 0.535 ± 0.432
1.07TrpArg: 1.07 ± 0.815
0.535TrpSer: 0.535 ± 0.432
0.0TrpThr: 0.0 ± 0.0
0.535TrpVal: 0.535 ± 0.408
0.0TrpTrp: 0.0 ± 0.0
0.535TrpTyr: 0.535 ± 0.408
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.21TyrAla: 3.21 ± 1.354
0.0TyrCys: 0.0 ± 0.0
3.21TyrAsp: 3.21 ± 1.528
2.675TyrGlu: 2.675 ± 1.381
0.535TyrPhe: 0.535 ± 0.432
2.14TyrGly: 2.14 ± 0.962
0.535TyrHis: 0.535 ± 0.432
1.605TyrIle: 1.605 ± 0.784
1.605TyrLys: 1.605 ± 0.744
3.21TyrLeu: 3.21 ± 1.354
1.07TyrMet: 1.07 ± 0.815
1.07TyrAsn: 1.07 ± 0.533
1.605TyrPro: 1.605 ± 0.845
1.07TyrGln: 1.07 ± 0.933
2.14TyrArg: 2.14 ± 0.996
0.535TyrSer: 0.535 ± 0.944
1.605TyrThr: 1.605 ± 0.789
2.14TyrVal: 2.14 ± 0.642
1.07TyrTrp: 1.07 ± 0.451
1.07TyrTyr: 1.07 ± 0.913
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1870 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski