Amino acid dipepetide frequency for Equus caballus papillomavirus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.147AlaAla: 8.147 ± 3.077
0.815AlaCys: 0.815 ± 0.659
2.444AlaAsp: 2.444 ± 0.449
4.481AlaGlu: 4.481 ± 1.255
2.037AlaPhe: 2.037 ± 0.865
5.295AlaGly: 5.295 ± 1.647
2.444AlaHis: 2.444 ± 0.956
2.851AlaIle: 2.851 ± 0.53
3.259AlaLys: 3.259 ± 1.044
5.703AlaLeu: 5.703 ± 1.327
0.407AlaMet: 0.407 ± 0.384
1.629AlaAsn: 1.629 ± 0.596
2.037AlaPro: 2.037 ± 0.66
3.259AlaGln: 3.259 ± 1.04
3.259AlaArg: 3.259 ± 0.984
2.037AlaSer: 2.037 ± 0.741
2.851AlaThr: 2.851 ± 0.645
4.073AlaVal: 4.073 ± 0.748
1.629AlaTrp: 1.629 ± 0.772
2.037AlaTyr: 2.037 ± 0.674
0.0AlaXaa: 0.0 ± 0.0
Cys
0.815CysAla: 0.815 ± 0.613
0.407CysCys: 0.407 ± 0.348
0.407CysAsp: 0.407 ± 0.348
0.407CysGlu: 0.407 ± 0.376
1.222CysPhe: 1.222 ± 0.771
2.037CysGly: 2.037 ± 1.279
0.815CysHis: 0.815 ± 0.654
0.0CysIle: 0.0 ± 0.0
2.037CysLys: 2.037 ± 1.123
2.037CysLeu: 2.037 ± 1.146
0.815CysMet: 0.815 ± 0.418
0.815CysAsn: 0.815 ± 0.654
3.259CysPro: 3.259 ± 1.107
1.629CysGln: 1.629 ± 0.846
2.037CysArg: 2.037 ± 1.146
2.444CysSer: 2.444 ± 1.332
0.815CysThr: 0.815 ± 0.695
0.815CysVal: 0.815 ± 0.817
0.815CysTrp: 0.815 ± 0.418
1.222CysTyr: 1.222 ± 0.628
0.0CysXaa: 0.0 ± 0.0
Asp
4.481AspAla: 4.481 ± 0.785
2.037AspCys: 2.037 ± 1.021
3.666AspAsp: 3.666 ± 1.332
5.295AspGlu: 5.295 ± 1.601
1.222AspPhe: 1.222 ± 0.492
4.888AspGly: 4.888 ± 1.18
0.407AspHis: 0.407 ± 0.384
2.444AspIle: 2.444 ± 1.378
0.815AspLys: 0.815 ± 0.418
4.481AspLeu: 4.481 ± 0.759
0.0AspMet: 0.0 ± 0.0
2.851AspAsn: 2.851 ± 0.53
3.666AspPro: 3.666 ± 1.419
0.815AspGln: 0.815 ± 0.418
1.629AspArg: 1.629 ± 0.657
6.517AspSer: 6.517 ± 1.335
5.703AspThr: 5.703 ± 1.374
3.666AspVal: 3.666 ± 0.991
0.815AspTrp: 0.815 ± 0.695
1.629AspTyr: 1.629 ± 0.596
0.0AspXaa: 0.0 ± 0.0
Glu
3.666GluAla: 3.666 ± 1.433
0.407GluCys: 0.407 ± 0.348
3.666GluAsp: 3.666 ± 1.559
4.073GluGlu: 4.073 ± 1.456
2.444GluPhe: 2.444 ± 0.524
5.295GluGly: 5.295 ± 1.955
0.815GluHis: 0.815 ± 0.613
2.444GluIle: 2.444 ± 0.713
2.037GluLys: 2.037 ± 1.102
6.11GluLeu: 6.11 ± 0.865
2.037GluMet: 2.037 ± 0.764
1.629GluAsn: 1.629 ± 0.657
4.073GluPro: 4.073 ± 1.054
2.037GluGln: 2.037 ± 0.894
4.481GluArg: 4.481 ± 0.996
2.851GluSer: 2.851 ± 0.628
4.073GluThr: 4.073 ± 1.029
3.259GluVal: 3.259 ± 1.108
0.0GluTrp: 0.0 ± 0.0
0.815GluTyr: 0.815 ± 0.768
0.0GluXaa: 0.0 ± 0.0
Phe
1.222PheAla: 1.222 ± 0.391
1.629PheCys: 1.629 ± 0.656
3.259PheAsp: 3.259 ± 0.926
0.815PheGlu: 0.815 ± 0.459
2.037PhePhe: 2.037 ± 1.017
1.222PheGly: 1.222 ± 0.505
0.407PheHis: 0.407 ± 0.336
1.222PheIle: 1.222 ± 0.561
2.037PheLys: 2.037 ± 0.704
4.073PheLeu: 4.073 ± 0.786
0.407PheMet: 0.407 ± 0.376
0.815PheAsn: 0.815 ± 0.459
1.222PhePro: 1.222 ± 0.643
2.444PheGln: 2.444 ± 0.929
2.444PheArg: 2.444 ± 0.745
2.851PheSer: 2.851 ± 1.069
1.629PheThr: 1.629 ± 0.919
1.222PheVal: 1.222 ± 0.374
1.629PheTrp: 1.629 ± 0.642
1.222PheTyr: 1.222 ± 0.551
0.0PheXaa: 0.0 ± 0.0
Gly
2.444GlyAla: 2.444 ± 0.353
2.037GlyCys: 2.037 ± 0.819
3.259GlyAsp: 3.259 ± 1.225
2.851GlyGlu: 2.851 ± 0.919
2.037GlyPhe: 2.037 ± 0.74
8.554GlyGly: 8.554 ± 3.554
2.851GlyHis: 2.851 ± 0.497
1.629GlyIle: 1.629 ± 0.574
2.851GlyLys: 2.851 ± 1.376
4.888GlyLeu: 4.888 ± 1.668
0.407GlyMet: 0.407 ± 0.348
2.444GlyAsn: 2.444 ± 1.213
7.332GlyPro: 7.332 ± 2.427
2.037GlyGln: 2.037 ± 0.976
5.703GlyArg: 5.703 ± 2.064
10.183GlySer: 10.183 ± 3.586
5.295GlyThr: 5.295 ± 1.146
6.517GlyVal: 6.517 ± 1.189
0.407GlyTrp: 0.407 ± 0.348
2.037GlyTyr: 2.037 ± 0.969
0.0GlyXaa: 0.0 ± 0.0
His
0.407HisAla: 0.407 ± 0.348
0.0HisCys: 0.0 ± 0.0
1.222HisAsp: 1.222 ± 0.446
0.815HisGlu: 0.815 ± 0.586
2.444HisPhe: 2.444 ± 0.353
0.815HisGly: 0.815 ± 0.613
0.407HisHis: 0.407 ± 0.527
0.815HisIle: 0.815 ± 0.5
1.222HisLys: 1.222 ± 1.043
2.851HisLeu: 2.851 ± 0.828
0.407HisMet: 0.407 ± 0.527
1.222HisAsn: 1.222 ± 0.666
1.222HisPro: 1.222 ± 0.551
0.815HisGln: 0.815 ± 0.489
0.815HisArg: 0.815 ± 0.586
0.407HisSer: 0.407 ± 0.348
1.222HisThr: 1.222 ± 0.629
2.444HisVal: 2.444 ± 0.765
0.815HisTrp: 0.815 ± 0.442
2.037HisTyr: 2.037 ± 0.765
0.0HisXaa: 0.0 ± 0.0
Ile
1.629IleAla: 1.629 ± 1.068
1.629IleCys: 1.629 ± 0.727
2.037IleAsp: 2.037 ± 0.717
3.259IleGlu: 3.259 ± 0.702
0.815IlePhe: 0.815 ± 0.768
1.629IleGly: 1.629 ± 1.008
0.407IleHis: 0.407 ± 0.348
0.407IleIle: 0.407 ± 0.353
1.629IleLys: 1.629 ± 0.774
2.444IleLeu: 2.444 ± 0.749
1.629IleMet: 1.629 ± 0.622
1.222IleAsn: 1.222 ± 0.777
3.259IlePro: 3.259 ± 0.84
1.222IleGln: 1.222 ± 0.632
0.407IleArg: 0.407 ± 0.527
2.037IleSer: 2.037 ± 0.894
1.222IleThr: 1.222 ± 0.391
2.851IleVal: 2.851 ± 1.367
0.407IleTrp: 0.407 ± 0.409
1.629IleTyr: 1.629 ± 0.657
0.0IleXaa: 0.0 ± 0.0
Lys
4.481LysAla: 4.481 ± 2.33
1.222LysCys: 1.222 ± 0.391
2.037LysAsp: 2.037 ± 1.048
1.629LysGlu: 1.629 ± 0.877
0.0LysPhe: 0.0 ± 0.0
2.444LysGly: 2.444 ± 1.02
0.815LysHis: 0.815 ± 0.489
0.407LysIle: 0.407 ± 0.503
1.222LysLys: 1.222 ± 0.721
4.073LysLeu: 4.073 ± 1.433
1.222LysMet: 1.222 ± 1.028
0.815LysAsn: 0.815 ± 0.418
2.851LysPro: 2.851 ± 0.843
2.851LysGln: 2.851 ± 1.155
3.666LysArg: 3.666 ± 1.148
5.703LysSer: 5.703 ± 1.467
3.666LysThr: 3.666 ± 0.749
4.073LysVal: 4.073 ± 1.137
0.0LysTrp: 0.0 ± 0.0
2.037LysTyr: 2.037 ± 0.669
0.0LysXaa: 0.0 ± 0.0
Leu
2.851LeuAla: 2.851 ± 0.645
2.037LeuCys: 2.037 ± 0.74
5.703LeuAsp: 5.703 ± 1.23
4.073LeuGlu: 4.073 ± 1.599
4.481LeuPhe: 4.481 ± 1.628
6.11LeuGly: 6.11 ± 1.542
1.629LeuHis: 1.629 ± 0.952
2.851LeuIle: 2.851 ± 1.252
6.11LeuLys: 6.11 ± 1.715
10.183LeuLeu: 10.183 ± 1.894
0.815LeuMet: 0.815 ± 0.668
1.629LeuAsn: 1.629 ± 0.633
6.11LeuPro: 6.11 ± 2.331
8.147LeuGln: 8.147 ± 1.052
5.295LeuArg: 5.295 ± 0.966
7.332LeuSer: 7.332 ± 1.452
4.888LeuThr: 4.888 ± 1.632
6.11LeuVal: 6.11 ± 1.418
0.407LeuTrp: 0.407 ± 0.348
2.851LeuTyr: 2.851 ± 1.984
0.0LeuXaa: 0.0 ± 0.0
Met
0.407MetAla: 0.407 ± 0.376
0.407MetCys: 0.407 ± 0.384
1.629MetAsp: 1.629 ± 0.688
0.815MetGlu: 0.815 ± 0.613
1.629MetPhe: 1.629 ± 0.838
0.0MetGly: 0.0 ± 0.0
0.815MetHis: 0.815 ± 0.442
0.815MetIle: 0.815 ± 0.459
0.815MetLys: 0.815 ± 0.695
1.629MetLeu: 1.629 ± 0.883
0.407MetMet: 0.407 ± 0.376
0.407MetAsn: 0.407 ± 0.384
0.0MetPro: 0.0 ± 0.0
1.222MetGln: 1.222 ± 0.662
0.815MetArg: 0.815 ± 0.489
2.444MetSer: 2.444 ± 0.66
1.222MetThr: 1.222 ± 0.391
2.037MetVal: 2.037 ± 1.738
0.407MetTrp: 0.407 ± 0.376
1.629MetTyr: 1.629 ± 0.919
0.0MetXaa: 0.0 ± 0.0
Asn
0.407AsnAla: 0.407 ± 0.348
0.815AsnCys: 0.815 ± 0.489
0.815AsnAsp: 0.815 ± 0.5
2.444AsnGlu: 2.444 ± 1.218
1.222AsnPhe: 1.222 ± 0.734
1.222AsnGly: 1.222 ± 0.724
1.222AsnHis: 1.222 ± 0.758
0.0AsnIle: 0.0 ± 0.0
2.851AsnLys: 2.851 ± 2.264
2.037AsnLeu: 2.037 ± 0.456
0.407AsnMet: 0.407 ± 0.336
0.815AsnAsn: 0.815 ± 0.418
1.222AsnPro: 1.222 ± 0.613
2.037AsnGln: 2.037 ± 0.729
2.851AsnArg: 2.851 ± 1.023
3.259AsnSer: 3.259 ± 0.931
2.444AsnThr: 2.444 ± 1.03
2.037AsnVal: 2.037 ± 0.824
0.815AsnTrp: 0.815 ± 0.442
0.815AsnTyr: 0.815 ± 0.695
0.0AsnXaa: 0.0 ± 0.0
Pro
6.11ProAla: 6.11 ± 1.601
2.444ProCys: 2.444 ± 0.789
5.703ProAsp: 5.703 ± 1.64
4.073ProGlu: 4.073 ± 1.285
0.815ProPhe: 0.815 ± 0.368
4.888ProGly: 4.888 ± 1.49
1.629ProHis: 1.629 ± 0.561
2.037ProIle: 2.037 ± 0.74
2.444ProLys: 2.444 ± 0.658
7.332ProLeu: 7.332 ± 1.175
1.222ProMet: 1.222 ± 0.804
1.222ProAsn: 1.222 ± 0.391
7.332ProPro: 7.332 ± 1.757
2.851ProGln: 2.851 ± 1.179
3.259ProArg: 3.259 ± 1.108
6.925ProSer: 6.925 ± 3.222
2.851ProThr: 2.851 ± 0.632
4.073ProVal: 4.073 ± 1.062
0.407ProTrp: 0.407 ± 0.376
3.259ProTyr: 3.259 ± 1.423
0.0ProXaa: 0.0 ± 0.0
Gln
4.073GlnAla: 4.073 ± 0.776
0.815GlnCys: 0.815 ± 0.695
0.815GlnAsp: 0.815 ± 0.418
2.037GlnGlu: 2.037 ± 0.669
0.0GlnPhe: 0.0 ± 0.0
3.259GlnGly: 3.259 ± 0.67
0.0GlnHis: 0.0 ± 0.0
2.037GlnIle: 2.037 ± 0.905
2.037GlnLys: 2.037 ± 0.559
3.666GlnLeu: 3.666 ± 1.366
3.666GlnMet: 3.666 ± 1.118
2.037GlnAsn: 2.037 ± 0.724
2.851GlnPro: 2.851 ± 0.813
3.259GlnGln: 3.259 ± 0.569
3.666GlnArg: 3.666 ± 1.546
2.037GlnSer: 2.037 ± 0.518
2.037GlnThr: 2.037 ± 0.514
5.295GlnVal: 5.295 ± 1.063
1.629GlnTrp: 1.629 ± 0.692
2.037GlnTyr: 2.037 ± 1.207
0.0GlnXaa: 0.0 ± 0.0
Arg
6.11ArgAla: 6.11 ± 2.103
0.815ArgCys: 0.815 ± 0.637
2.037ArgAsp: 2.037 ± 0.443
4.073ArgGlu: 4.073 ± 0.791
2.851ArgPhe: 2.851 ± 0.995
7.739ArgGly: 7.739 ± 2.936
1.629ArgHis: 1.629 ± 1.173
0.815ArgIle: 0.815 ± 0.756
3.666ArgLys: 3.666 ± 0.956
6.11ArgLeu: 6.11 ± 1.418
0.815ArgMet: 0.815 ± 0.527
2.444ArgAsn: 2.444 ± 0.897
4.073ArgPro: 4.073 ± 1.058
2.444ArgGln: 2.444 ± 0.491
8.147ArgArg: 8.147 ± 1.904
3.666ArgSer: 3.666 ± 1.114
3.259ArgThr: 3.259 ± 1.896
4.073ArgVal: 4.073 ± 1.276
0.815ArgTrp: 0.815 ± 0.5
1.222ArgTyr: 1.222 ± 0.391
0.0ArgXaa: 0.0 ± 0.0
Ser
3.666SerAla: 3.666 ± 0.869
1.629SerCys: 1.629 ± 0.772
5.703SerAsp: 5.703 ± 1.167
4.888SerGlu: 4.888 ± 1.039
2.037SerPhe: 2.037 ± 0.954
6.11SerGly: 6.11 ± 0.993
1.629SerHis: 1.629 ± 0.558
5.295SerIle: 5.295 ± 0.97
2.444SerLys: 2.444 ± 1.007
5.703SerLeu: 5.703 ± 1.112
0.407SerMet: 0.407 ± 0.353
2.037SerAsn: 2.037 ± 1.123
8.554SerPro: 8.554 ± 3.256
2.037SerGln: 2.037 ± 1.057
4.888SerArg: 4.888 ± 0.771
7.332SerSer: 7.332 ± 1.9
5.703SerThr: 5.703 ± 0.848
6.925SerVal: 6.925 ± 2.289
1.222SerTrp: 1.222 ± 0.666
2.851SerTyr: 2.851 ± 0.693
0.0SerXaa: 0.0 ± 0.0
Thr
2.444ThrAla: 2.444 ± 0.841
2.037ThrCys: 2.037 ± 1.31
3.666ThrAsp: 3.666 ± 1.726
2.444ThrGlu: 2.444 ± 0.449
1.629ThrPhe: 1.629 ± 0.617
5.295ThrGly: 5.295 ± 1.415
0.815ThrHis: 0.815 ± 0.442
2.444ThrIle: 2.444 ± 0.92
2.037ThrLys: 2.037 ± 0.669
3.259ThrLeu: 3.259 ± 1.255
2.444ThrMet: 2.444 ± 0.984
2.037ThrAsn: 2.037 ± 0.443
4.888ThrPro: 4.888 ± 1.252
2.037ThrGln: 2.037 ± 0.719
4.481ThrArg: 4.481 ± 0.955
4.481ThrSer: 4.481 ± 0.69
5.295ThrThr: 5.295 ± 1.061
3.666ThrVal: 3.666 ± 1.253
1.629ThrTrp: 1.629 ± 0.888
1.629ThrTyr: 1.629 ± 0.685
0.0ThrXaa: 0.0 ± 0.0
Val
2.037ValAla: 2.037 ± 0.636
1.222ValCys: 1.222 ± 1.116
5.703ValAsp: 5.703 ± 1.297
5.295ValGlu: 5.295 ± 1.437
2.444ValPhe: 2.444 ± 0.596
6.517ValGly: 6.517 ± 1.741
2.444ValHis: 2.444 ± 0.864
2.037ValIle: 2.037 ± 0.954
2.851ValLys: 2.851 ± 0.943
7.332ValLeu: 7.332 ± 1.904
0.815ValMet: 0.815 ± 0.751
2.851ValAsn: 2.851 ± 1.067
4.481ValPro: 4.481 ± 1.92
3.259ValGln: 3.259 ± 0.802
3.259ValArg: 3.259 ± 0.905
6.925ValSer: 6.925 ± 1.437
2.444ValThr: 2.444 ± 0.524
6.925ValVal: 6.925 ± 2.302
1.629ValTrp: 1.629 ± 0.878
2.851ValTyr: 2.851 ± 0.864
0.0ValXaa: 0.0 ± 0.0
Trp
1.629TrpAla: 1.629 ± 0.248
0.815TrpCys: 0.815 ± 0.571
1.629TrpAsp: 1.629 ± 0.663
0.815TrpGlu: 0.815 ± 0.571
1.222TrpPhe: 1.222 ± 0.743
0.815TrpGly: 0.815 ± 0.62
0.407TrpHis: 0.407 ± 0.384
0.407TrpIle: 0.407 ± 0.376
2.037TrpLys: 2.037 ± 1.021
2.444TrpLeu: 2.444 ± 1.174
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.407TrpPro: 0.407 ± 0.348
0.407TrpGln: 0.407 ± 0.348
2.037TrpArg: 2.037 ± 0.914
0.0TrpSer: 0.0 ± 0.0
0.407TrpThr: 0.407 ± 0.376
1.629TrpVal: 1.629 ± 0.596
0.0TrpTrp: 0.0 ± 0.0
0.815TrpTyr: 0.815 ± 0.442
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.666TyrAla: 3.666 ± 1.095
1.629TyrCys: 1.629 ± 0.971
1.629TyrAsp: 1.629 ± 0.663
1.629TyrGlu: 1.629 ± 0.673
1.222TyrPhe: 1.222 ± 0.732
1.629TyrGly: 1.629 ± 0.804
0.815TyrHis: 0.815 ± 0.489
0.815TyrIle: 0.815 ± 0.459
1.222TyrLys: 1.222 ± 1.043
3.259TyrLeu: 3.259 ± 0.777
0.815TyrMet: 0.815 ± 0.695
0.815TyrAsn: 0.815 ± 0.459
2.037TyrPro: 2.037 ± 0.908
2.444TyrGln: 2.444 ± 1.002
3.666TyrArg: 3.666 ± 1.438
1.629TyrSer: 1.629 ± 0.657
1.629TyrThr: 1.629 ± 0.699
1.629TyrVal: 1.629 ± 0.919
2.444TyrTrp: 2.444 ± 1.01
2.851TyrTyr: 2.851 ± 1.227
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2456 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski