Amino acid dipepetide frequency for Equus caballus papillomavirus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.551AlaAla: 11.551 ± 2.437
1.65AlaCys: 1.65 ± 0.731
3.3AlaAsp: 3.3 ± 0.692
6.601AlaGlu: 6.601 ± 1.441
2.888AlaPhe: 2.888 ± 0.937
7.013AlaGly: 7.013 ± 2.187
0.825AlaHis: 0.825 ± 0.422
2.475AlaIle: 2.475 ± 0.81
4.125AlaLys: 4.125 ± 1.584
7.838AlaLeu: 7.838 ± 2.413
2.475AlaMet: 2.475 ± 0.945
2.475AlaAsn: 2.475 ± 1.228
5.776AlaPro: 5.776 ± 1.743
3.3AlaGln: 3.3 ± 0.503
5.363AlaArg: 5.363 ± 1.229
4.538AlaSer: 4.538 ± 1.316
3.713AlaThr: 3.713 ± 0.903
6.601AlaVal: 6.601 ± 1.678
1.65AlaTrp: 1.65 ± 1.081
3.3AlaTyr: 3.3 ± 1.024
0.0AlaXaa: 0.0 ± 0.0
Cys
1.238CysAla: 1.238 ± 0.512
0.0CysCys: 0.0 ± 0.0
1.238CysAsp: 1.238 ± 0.359
0.825CysGlu: 0.825 ± 0.561
0.413CysPhe: 0.413 ± 0.389
1.238CysGly: 1.238 ± 0.762
0.825CysHis: 0.825 ± 0.444
0.413CysIle: 0.413 ± 0.327
1.65CysLys: 1.65 ± 0.745
2.475CysLeu: 2.475 ± 1.607
0.413CysMet: 0.413 ± 0.327
0.413CysAsn: 0.413 ± 0.34
1.238CysPro: 1.238 ± 0.584
1.65CysGln: 1.65 ± 0.667
2.888CysArg: 2.888 ± 1.506
2.063CysSer: 2.063 ± 0.736
1.65CysThr: 1.65 ± 0.741
0.825CysVal: 0.825 ± 0.397
0.413CysTrp: 0.413 ± 0.34
0.413CysTyr: 0.413 ± 0.489
0.0CysXaa: 0.0 ± 0.0
Asp
4.95AspAla: 4.95 ± 1.817
0.825AspCys: 0.825 ± 0.626
2.475AspAsp: 2.475 ± 1.12
2.475AspGlu: 2.475 ± 0.955
1.65AspPhe: 1.65 ± 0.455
5.776AspGly: 5.776 ± 1.514
0.0AspHis: 0.0 ± 0.0
2.475AspIle: 2.475 ± 0.742
1.238AspLys: 1.238 ± 0.292
5.776AspLeu: 5.776 ± 1.253
1.65AspMet: 1.65 ± 0.818
2.063AspAsn: 2.063 ± 1.316
3.3AspPro: 3.3 ± 0.503
1.238AspGln: 1.238 ± 0.542
1.238AspArg: 1.238 ± 0.66
3.713AspSer: 3.713 ± 0.755
2.063AspThr: 2.063 ± 0.827
2.888AspVal: 2.888 ± 0.961
0.825AspTrp: 0.825 ± 0.654
0.825AspTyr: 0.825 ± 0.409
0.0AspXaa: 0.0 ± 0.0
Glu
5.363GluAla: 5.363 ± 1.205
1.65GluCys: 1.65 ± 0.949
4.538GluAsp: 4.538 ± 0.921
3.713GluGlu: 3.713 ± 1.339
2.888GluPhe: 2.888 ± 1.244
4.95GluGly: 4.95 ± 2.463
0.413GluHis: 0.413 ± 0.439
2.475GluIle: 2.475 ± 0.584
2.475GluLys: 2.475 ± 1.105
4.125GluLeu: 4.125 ± 1.732
0.825GluMet: 0.825 ± 0.397
3.713GluAsn: 3.713 ± 0.491
3.713GluPro: 3.713 ± 1.056
2.475GluGln: 2.475 ± 0.752
1.65GluArg: 1.65 ± 0.948
1.65GluSer: 1.65 ± 1.168
5.363GluThr: 5.363 ± 0.84
4.538GluVal: 4.538 ± 2.317
1.65GluTrp: 1.65 ± 0.562
2.888GluTyr: 2.888 ± 1.338
0.0GluXaa: 0.0 ± 0.0
Phe
3.3PheAla: 3.3 ± 1.057
0.413PheCys: 0.413 ± 0.489
2.063PheAsp: 2.063 ± 1.262
0.825PheGlu: 0.825 ± 0.679
2.475PhePhe: 2.475 ± 0.641
4.95PheGly: 4.95 ± 0.987
1.238PheHis: 1.238 ± 0.738
1.65PheIle: 1.65 ± 0.536
2.888PheLys: 2.888 ± 1.593
3.713PheLeu: 3.713 ± 0.64
0.0PheMet: 0.0 ± 0.0
1.65PheAsn: 1.65 ± 0.818
1.238PhePro: 1.238 ± 0.71
2.475PheGln: 2.475 ± 0.814
3.713PheArg: 3.713 ± 1.258
4.125PheSer: 4.125 ± 1.482
1.238PheThr: 1.238 ± 0.539
0.825PheVal: 0.825 ± 0.473
3.3PheTrp: 3.3 ± 1.032
1.65PheTyr: 1.65 ± 0.986
0.0PheXaa: 0.0 ± 0.0
Gly
7.013GlyAla: 7.013 ± 1.008
1.238GlyCys: 1.238 ± 0.677
4.95GlyAsp: 4.95 ± 0.758
8.663GlyGlu: 8.663 ± 2.777
1.65GlyPhe: 1.65 ± 1.102
12.376GlyGly: 12.376 ± 2.848
3.3GlyHis: 3.3 ± 0.652
1.238GlyIle: 1.238 ± 0.651
1.238GlyLys: 1.238 ± 0.603
3.713GlyLeu: 3.713 ± 1.143
0.825GlyMet: 0.825 ± 0.473
2.888GlyAsn: 2.888 ± 0.98
5.363GlyPro: 5.363 ± 1.709
1.238GlyGln: 1.238 ± 0.677
6.188GlyArg: 6.188 ± 1.449
8.251GlySer: 8.251 ± 3.013
4.95GlyThr: 4.95 ± 0.488
8.663GlyVal: 8.663 ± 1.012
0.825GlyTrp: 0.825 ± 0.656
1.65GlyTyr: 1.65 ± 0.708
0.0GlyXaa: 0.0 ± 0.0
His
4.125HisAla: 4.125 ± 0.819
0.0HisCys: 0.0 ± 0.0
0.413HisAsp: 0.413 ± 0.34
0.825HisGlu: 0.825 ± 0.409
0.825HisPhe: 0.825 ± 0.422
1.238HisGly: 1.238 ± 0.584
0.825HisHis: 0.825 ± 0.397
1.238HisIle: 1.238 ± 0.702
0.413HisLys: 0.413 ± 0.328
2.475HisLeu: 2.475 ± 1.388
0.0HisMet: 0.0 ± 0.0
1.238HisAsn: 1.238 ± 0.292
2.475HisPro: 2.475 ± 0.949
0.0HisGln: 0.0 ± 0.0
0.825HisArg: 0.825 ± 0.656
1.238HisSer: 1.238 ± 0.835
0.413HisThr: 0.413 ± 0.389
0.825HisVal: 0.825 ± 0.397
0.0HisTrp: 0.0 ± 0.0
0.413HisTyr: 0.413 ± 0.328
0.0HisXaa: 0.0 ± 0.0
Ile
1.238IleAla: 1.238 ± 0.651
0.825IleCys: 0.825 ± 0.878
1.65IleAsp: 1.65 ± 0.455
2.475IleGlu: 2.475 ± 0.835
0.413IlePhe: 0.413 ± 0.34
2.888IleGly: 2.888 ± 1.098
0.413IleHis: 0.413 ± 0.34
0.825IleIle: 0.825 ± 0.656
1.238IleLys: 1.238 ± 1.019
2.888IleLeu: 2.888 ± 0.795
0.413IleMet: 0.413 ± 0.327
0.413IleAsn: 0.413 ± 0.389
1.65IlePro: 1.65 ± 0.628
0.413IleGln: 0.413 ± 0.439
0.413IleArg: 0.413 ± 0.439
2.475IleSer: 2.475 ± 0.578
2.475IleThr: 2.475 ± 0.752
1.238IleVal: 1.238 ± 0.584
0.413IleTrp: 0.413 ± 0.529
1.238IleTyr: 1.238 ± 0.71
0.0IleXaa: 0.0 ± 0.0
Lys
3.3LysAla: 3.3 ± 1.107
1.238LysCys: 1.238 ± 0.603
0.825LysAsp: 0.825 ± 0.409
0.825LysGlu: 0.825 ± 0.409
2.063LysPhe: 2.063 ± 1.066
1.65LysGly: 1.65 ± 0.687
1.238LysHis: 1.238 ± 0.651
1.65LysIle: 1.65 ± 0.99
1.238LysLys: 1.238 ± 1.021
3.713LysLeu: 3.713 ± 2.163
0.825LysMet: 0.825 ± 0.409
1.238LysAsn: 1.238 ± 0.677
1.238LysPro: 1.238 ± 0.614
1.238LysGln: 1.238 ± 0.705
4.95LysArg: 4.95 ± 0.84
2.063LysSer: 2.063 ± 1.636
3.3LysThr: 3.3 ± 0.965
2.063LysVal: 2.063 ± 0.947
0.413LysTrp: 0.413 ± 0.389
1.238LysTyr: 1.238 ± 0.659
0.0LysXaa: 0.0 ± 0.0
Leu
7.013LeuAla: 7.013 ± 0.973
2.475LeuCys: 2.475 ± 1.114
6.601LeuAsp: 6.601 ± 1.305
4.95LeuGlu: 4.95 ± 1.281
4.538LeuPhe: 4.538 ± 0.565
7.013LeuGly: 7.013 ± 2.483
2.063LeuHis: 2.063 ± 0.815
0.413LeuIle: 0.413 ± 0.439
4.538LeuLys: 4.538 ± 1.556
9.076LeuLeu: 9.076 ± 2.279
0.825LeuMet: 0.825 ± 0.685
1.65LeuAsn: 1.65 ± 0.461
4.538LeuPro: 4.538 ± 1.251
6.188LeuGln: 6.188 ± 2.117
7.013LeuArg: 7.013 ± 3.791
8.663LeuSer: 8.663 ± 1.428
3.3LeuThr: 3.3 ± 0.702
4.125LeuVal: 4.125 ± 1.19
1.238LeuTrp: 1.238 ± 0.292
1.238LeuTyr: 1.238 ± 0.664
0.0LeuXaa: 0.0 ± 0.0
Met
2.475MetAla: 2.475 ± 0.768
0.413MetCys: 0.413 ± 0.328
0.825MetAsp: 0.825 ± 0.654
0.825MetGlu: 0.825 ± 0.473
0.0MetPhe: 0.0 ± 0.0
0.825MetGly: 0.825 ± 0.422
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.825MetLys: 0.825 ± 0.575
1.65MetLeu: 1.65 ± 0.498
0.0MetMet: 0.0 ± 0.0
0.413MetAsn: 0.413 ± 0.327
0.413MetPro: 0.413 ± 0.34
0.413MetGln: 0.413 ± 0.327
1.238MetArg: 1.238 ± 0.477
2.063MetSer: 2.063 ± 0.568
0.413MetThr: 0.413 ± 0.328
1.65MetVal: 1.65 ± 0.794
0.413MetTrp: 0.413 ± 0.328
0.825MetTyr: 0.825 ± 0.397
0.0MetXaa: 0.0 ± 0.0
Asn
2.063AsnAla: 2.063 ± 0.756
0.413AsnCys: 0.413 ± 0.327
0.825AsnAsp: 0.825 ± 0.561
1.238AsnGlu: 1.238 ± 0.738
0.825AsnPhe: 0.825 ± 0.409
2.888AsnGly: 2.888 ± 1.314
0.413AsnHis: 0.413 ± 0.489
0.413AsnIle: 0.413 ± 0.34
1.65AsnLys: 1.65 ± 0.99
2.888AsnLeu: 2.888 ± 1.033
1.238AsnMet: 1.238 ± 0.431
0.825AsnAsn: 0.825 ± 0.409
3.713AsnPro: 3.713 ± 1.379
1.65AsnGln: 1.65 ± 0.667
3.713AsnArg: 3.713 ± 1.166
2.475AsnSer: 2.475 ± 1.141
2.063AsnThr: 2.063 ± 0.966
0.825AsnVal: 0.825 ± 0.579
0.413AsnTrp: 0.413 ± 0.327
1.65AsnTyr: 1.65 ± 0.961
0.0AsnXaa: 0.0 ± 0.0
Pro
8.251ProAla: 8.251 ± 2.205
1.65ProCys: 1.65 ± 0.745
2.063ProAsp: 2.063 ± 1.129
5.776ProGlu: 5.776 ± 0.738
2.888ProPhe: 2.888 ± 0.927
5.363ProGly: 5.363 ± 2.378
0.413ProHis: 0.413 ± 0.328
2.475ProIle: 2.475 ± 0.584
2.888ProLys: 2.888 ± 0.795
4.95ProLeu: 4.95 ± 0.632
0.413ProMet: 0.413 ± 0.399
2.063ProAsn: 2.063 ± 1.066
10.726ProPro: 10.726 ± 3.848
2.475ProGln: 2.475 ± 1.089
5.363ProArg: 5.363 ± 1.158
6.188ProSer: 6.188 ± 2.372
2.063ProThr: 2.063 ± 0.978
3.713ProVal: 3.713 ± 1.048
0.413ProTrp: 0.413 ± 0.328
2.063ProTyr: 2.063 ± 0.849
0.0ProXaa: 0.0 ± 0.0
Gln
4.125GlnAla: 4.125 ± 0.777
0.413GlnCys: 0.413 ± 0.34
2.063GlnAsp: 2.063 ± 0.827
2.063GlnGlu: 2.063 ± 1.636
1.65GlnPhe: 1.65 ± 0.761
4.125GlnGly: 4.125 ± 1.722
0.413GlnHis: 0.413 ± 0.327
1.238GlnIle: 1.238 ± 0.677
1.238GlnLys: 1.238 ± 0.431
4.125GlnLeu: 4.125 ± 1.525
0.413GlnMet: 0.413 ± 0.328
0.825GlnAsn: 0.825 ± 0.679
3.713GlnPro: 3.713 ± 1.107
3.3GlnGln: 3.3 ± 1.0
1.238GlnArg: 1.238 ± 0.431
0.825GlnSer: 0.825 ± 0.628
4.125GlnThr: 4.125 ± 1.022
3.713GlnVal: 3.713 ± 1.487
0.825GlnTrp: 0.825 ± 0.579
1.65GlnTyr: 1.65 ± 0.577
0.0GlnXaa: 0.0 ± 0.0
Arg
6.188ArgAla: 6.188 ± 1.539
3.3ArgCys: 3.3 ± 0.933
2.475ArgAsp: 2.475 ± 0.969
2.475ArgGlu: 2.475 ± 0.982
3.713ArgPhe: 3.713 ± 0.958
4.95ArgGly: 4.95 ± 0.823
1.65ArgHis: 1.65 ± 0.99
1.65ArgIle: 1.65 ± 0.503
3.713ArgLys: 3.713 ± 0.981
7.013ArgLeu: 7.013 ± 1.834
1.238ArgMet: 1.238 ± 0.532
3.713ArgAsn: 3.713 ± 1.469
4.125ArgPro: 4.125 ± 0.621
4.125ArgGln: 4.125 ± 1.237
8.251ArgArg: 8.251 ± 1.805
3.3ArgSer: 3.3 ± 1.35
1.238ArgThr: 1.238 ± 0.76
5.776ArgVal: 5.776 ± 1.438
0.825ArgTrp: 0.825 ± 0.569
2.888ArgTyr: 2.888 ± 0.705
0.0ArgXaa: 0.0 ± 0.0
Ser
7.426SerAla: 7.426 ± 2.012
2.475SerCys: 2.475 ± 1.07
2.063SerAsp: 2.063 ± 0.888
4.538SerGlu: 4.538 ± 2.416
4.538SerPhe: 4.538 ± 1.409
3.713SerGly: 3.713 ± 0.93
1.65SerHis: 1.65 ± 0.72
2.063SerIle: 2.063 ± 0.912
1.238SerLys: 1.238 ± 1.019
5.363SerLeu: 5.363 ± 1.291
1.65SerMet: 1.65 ± 0.635
2.475SerAsn: 2.475 ± 1.045
7.838SerPro: 7.838 ± 1.848
2.475SerGln: 2.475 ± 1.586
3.713SerArg: 3.713 ± 1.07
8.663SerSer: 8.663 ± 1.232
7.426SerThr: 7.426 ± 1.421
6.188SerVal: 6.188 ± 2.195
0.413SerTrp: 0.413 ± 0.389
0.825SerTyr: 0.825 ± 0.397
0.0SerXaa: 0.0 ± 0.0
Thr
3.713ThrAla: 3.713 ± 0.842
1.238ThrCys: 1.238 ± 0.807
2.063ThrAsp: 2.063 ± 0.947
4.125ThrGlu: 4.125 ± 0.925
3.3ThrPhe: 3.3 ± 1.589
4.538ThrGly: 4.538 ± 1.791
0.825ThrHis: 0.825 ± 0.654
1.238ThrIle: 1.238 ± 0.738
2.063ThrLys: 2.063 ± 0.668
6.601ThrLeu: 6.601 ± 0.721
0.413ThrMet: 0.413 ± 0.293
0.825ThrAsn: 0.825 ± 0.409
4.95ThrPro: 4.95 ± 1.919
2.063ThrGln: 2.063 ± 1.002
4.538ThrArg: 4.538 ± 1.223
6.188ThrSer: 6.188 ± 2.274
2.063ThrThr: 2.063 ± 0.896
4.95ThrVal: 4.95 ± 1.307
0.825ThrTrp: 0.825 ± 0.656
1.238ThrTyr: 1.238 ± 0.466
0.0ThrXaa: 0.0 ± 0.0
Val
1.65ValAla: 1.65 ± 0.667
1.65ValCys: 1.65 ± 0.741
4.95ValAsp: 4.95 ± 1.206
5.776ValGlu: 5.776 ± 1.09
3.3ValPhe: 3.3 ± 0.795
7.013ValGly: 7.013 ± 3.228
1.65ValHis: 1.65 ± 0.498
1.65ValIle: 1.65 ± 0.487
0.0ValLys: 0.0 ± 0.0
3.713ValLeu: 3.713 ± 0.832
0.825ValMet: 0.825 ± 0.397
2.063ValAsn: 2.063 ± 0.926
3.713ValPro: 3.713 ± 0.748
3.713ValGln: 3.713 ± 0.874
4.95ValArg: 4.95 ± 1.057
7.838ValSer: 7.838 ± 1.208
5.776ValThr: 5.776 ± 0.83
2.888ValVal: 2.888 ± 1.488
1.238ValTrp: 1.238 ± 0.631
0.825ValTyr: 0.825 ± 0.437
0.0ValXaa: 0.0 ± 0.0
Trp
0.825TrpAla: 0.825 ± 0.409
0.413TrpCys: 0.413 ± 0.439
0.825TrpAsp: 0.825 ± 0.579
0.413TrpGlu: 0.413 ± 0.34
0.825TrpPhe: 0.825 ± 0.337
2.063TrpGly: 2.063 ± 0.693
0.825TrpHis: 0.825 ± 0.437
0.0TrpIle: 0.0 ± 0.0
1.238TrpLys: 1.238 ± 0.553
2.888TrpLeu: 2.888 ± 0.937
0.0TrpMet: 0.0 ± 0.0
0.825TrpAsn: 0.825 ± 0.575
0.413TrpPro: 0.413 ± 0.529
0.0TrpGln: 0.0 ± 0.0
2.888TrpArg: 2.888 ± 1.535
0.0TrpSer: 0.0 ± 0.0
2.063TrpThr: 2.063 ± 0.804
1.238TrpVal: 1.238 ± 0.548
0.413TrpTrp: 0.413 ± 0.328
0.413TrpTyr: 0.413 ± 0.328
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.65TyrAla: 1.65 ± 0.628
0.0TyrCys: 0.0 ± 0.0
1.238TyrAsp: 1.238 ± 0.92
1.65TyrGlu: 1.65 ± 0.757
2.888TyrPhe: 2.888 ± 0.999
1.65TyrGly: 1.65 ± 0.536
0.825TyrHis: 0.825 ± 0.506
0.413TyrIle: 0.413 ± 0.34
0.413TyrLys: 0.413 ± 0.34
2.888TyrLeu: 2.888 ± 0.703
0.825TyrMet: 0.825 ± 0.313
0.413TyrAsn: 0.413 ± 0.389
2.063TyrPro: 2.063 ± 0.971
1.65TyrGln: 1.65 ± 0.757
2.475TyrArg: 2.475 ± 0.957
0.413TyrSer: 0.413 ± 0.34
2.063TyrThr: 2.063 ± 0.662
1.65TyrVal: 1.65 ± 1.358
2.063TyrTrp: 2.063 ± 0.947
2.063TyrTyr: 2.063 ± 0.934
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2425 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski