Amino acid dipepetide frequency for Equus caballus papillomavirus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.276AlaAla: 7.276 ± 1.267
2.274AlaCys: 2.274 ± 0.931
6.367AlaAsp: 6.367 ± 0.698
5.002AlaGlu: 5.002 ± 1.458
3.183AlaPhe: 3.183 ± 1.545
6.367AlaGly: 6.367 ± 1.715
0.91AlaHis: 0.91 ± 0.377
0.91AlaIle: 0.91 ± 0.47
4.548AlaLys: 4.548 ± 0.909
6.821AlaLeu: 6.821 ± 0.538
2.274AlaMet: 2.274 ± 0.468
0.91AlaAsn: 0.91 ± 0.579
4.548AlaPro: 4.548 ± 1.726
3.183AlaGln: 3.183 ± 0.807
4.548AlaArg: 4.548 ± 0.66
4.548AlaSer: 4.548 ± 0.983
3.638AlaThr: 3.638 ± 0.792
0.91AlaVal: 0.91 ± 0.419
0.0AlaTrp: 0.0 ± 0.0
1.819AlaTyr: 1.819 ± 0.948
0.0AlaXaa: 0.0 ± 0.0
Cys
1.819CysAla: 1.819 ± 1.288
0.91CysCys: 0.91 ± 0.565
1.364CysAsp: 1.364 ± 0.613
2.274CysGlu: 2.274 ± 0.657
1.364CysPhe: 1.364 ± 0.435
3.183CysGly: 3.183 ± 2.517
0.0CysHis: 0.0 ± 0.0
1.364CysIle: 1.364 ± 1.02
1.364CysLys: 1.364 ± 0.688
1.364CysLeu: 1.364 ± 0.804
1.364CysMet: 1.364 ± 0.555
0.91CysAsn: 0.91 ± 0.594
1.364CysPro: 1.364 ± 0.642
0.91CysGln: 0.91 ± 0.587
1.364CysArg: 1.364 ± 1.526
0.91CysSer: 0.91 ± 0.47
1.819CysThr: 1.819 ± 1.523
0.455CysVal: 0.455 ± 0.406
0.455CysTrp: 0.455 ± 0.353
0.455CysTyr: 0.455 ± 0.509
0.0CysXaa: 0.0 ± 0.0
Asp
5.912AspAla: 5.912 ± 1.074
2.274AspCys: 2.274 ± 1.029
2.274AspAsp: 2.274 ± 0.874
3.183AspGlu: 3.183 ± 1.109
2.274AspPhe: 2.274 ± 0.86
5.002AspGly: 5.002 ± 0.54
0.91AspHis: 0.91 ± 0.489
3.183AspIle: 3.183 ± 1.2
2.274AspLys: 2.274 ± 1.764
5.002AspLeu: 5.002 ± 1.7
0.91AspMet: 0.91 ± 0.377
2.274AspAsn: 2.274 ± 0.43
2.729AspPro: 2.729 ± 1.338
2.729AspGln: 2.729 ± 0.809
1.819AspArg: 1.819 ± 0.63
2.274AspSer: 2.274 ± 1.577
4.093AspThr: 4.093 ± 0.983
4.548AspVal: 4.548 ± 1.008
0.91AspTrp: 0.91 ± 0.694
1.364AspTyr: 1.364 ± 0.439
0.0AspXaa: 0.0 ± 0.0
Glu
3.638GluAla: 3.638 ± 1.02
0.0GluCys: 0.0 ± 0.0
3.638GluAsp: 3.638 ± 1.537
3.638GluGlu: 3.638 ± 1.199
2.274GluPhe: 2.274 ± 0.693
5.002GluGly: 5.002 ± 0.936
1.364GluHis: 1.364 ± 0.608
2.274GluIle: 2.274 ± 0.633
0.91GluLys: 0.91 ± 0.578
4.548GluLeu: 4.548 ± 1.222
0.91GluMet: 0.91 ± 0.489
2.274GluAsn: 2.274 ± 0.842
3.183GluPro: 3.183 ± 1.022
1.819GluGln: 1.819 ± 0.978
5.002GluArg: 5.002 ± 1.454
2.729GluSer: 2.729 ± 0.837
1.819GluThr: 1.819 ± 0.578
5.457GluVal: 5.457 ± 1.173
0.91GluTrp: 0.91 ± 0.377
0.91GluTyr: 0.91 ± 0.706
0.0GluXaa: 0.0 ± 0.0
Phe
2.729PheAla: 2.729 ± 0.766
1.364PheCys: 1.364 ± 0.871
3.638PheAsp: 3.638 ± 0.719
3.183PheGlu: 3.183 ± 1.154
3.183PhePhe: 3.183 ± 1.026
3.183PheGly: 3.183 ± 0.671
0.455PheHis: 0.455 ± 0.34
1.819PheIle: 1.819 ± 0.572
2.274PheLys: 2.274 ± 0.711
3.638PheLeu: 3.638 ± 0.76
0.455PheMet: 0.455 ± 0.347
1.819PheAsn: 1.819 ± 0.572
2.274PhePro: 2.274 ± 0.969
0.91PheGln: 0.91 ± 0.419
1.819PheArg: 1.819 ± 0.88
5.002PheSer: 5.002 ± 1.562
2.274PheThr: 2.274 ± 0.89
1.819PheVal: 1.819 ± 0.838
2.729PheTrp: 2.729 ± 0.707
1.364PheTyr: 1.364 ± 0.811
0.0PheXaa: 0.0 ± 0.0
Gly
5.002GlyAla: 5.002 ± 1.131
1.819GlyCys: 1.819 ± 0.941
5.002GlyAsp: 5.002 ± 0.839
5.912GlyGlu: 5.912 ± 1.014
3.638GlyPhe: 3.638 ± 0.857
12.733GlyGly: 12.733 ± 4.771
3.183GlyHis: 3.183 ± 0.988
2.274GlyIle: 2.274 ± 0.905
3.638GlyLys: 3.638 ± 1.478
4.093GlyLeu: 4.093 ± 0.808
1.364GlyMet: 1.364 ± 0.816
2.274GlyAsn: 2.274 ± 0.689
6.821GlyPro: 6.821 ± 1.143
5.002GlyGln: 5.002 ± 0.962
5.912GlyArg: 5.912 ± 1.446
5.912GlySer: 5.912 ± 1.828
5.002GlyThr: 5.002 ± 1.735
11.369GlyVal: 11.369 ± 2.413
0.455GlyTrp: 0.455 ± 0.406
1.819GlyTyr: 1.819 ± 0.52
0.0GlyXaa: 0.0 ± 0.0
His
1.364HisAla: 1.364 ± 1.217
0.91HisCys: 0.91 ± 0.579
0.91HisAsp: 0.91 ± 0.47
0.0HisGlu: 0.0 ± 0.0
1.819HisPhe: 1.819 ± 0.569
1.364HisGly: 1.364 ± 0.589
0.455HisHis: 0.455 ± 0.406
0.455HisIle: 0.455 ± 0.34
0.91HisLys: 0.91 ± 0.694
3.183HisLeu: 3.183 ± 0.563
0.91HisMet: 0.91 ± 0.579
0.91HisAsn: 0.91 ± 0.591
3.183HisPro: 3.183 ± 1.176
0.91HisGln: 0.91 ± 0.694
1.819HisArg: 1.819 ± 0.881
2.274HisSer: 2.274 ± 1.098
0.91HisThr: 0.91 ± 0.706
1.364HisVal: 1.364 ± 0.439
0.91HisTrp: 0.91 ± 0.47
0.455HisTyr: 0.455 ± 0.347
0.0HisXaa: 0.0 ± 0.0
Ile
0.455IleAla: 0.455 ± 0.347
0.455IleCys: 0.455 ± 0.509
0.455IleAsp: 0.455 ± 0.353
3.183IleGlu: 3.183 ± 0.81
1.819IlePhe: 1.819 ± 0.761
4.093IleGly: 4.093 ± 1.261
0.455IleHis: 0.455 ± 0.347
0.455IleIle: 0.455 ± 0.347
0.455IleLys: 0.455 ± 0.353
2.274IleLeu: 2.274 ± 0.634
0.0IleMet: 0.0 ± 0.0
1.364IleAsn: 1.364 ± 0.632
0.91IlePro: 0.91 ± 0.413
0.0IleGln: 0.0 ± 0.0
0.91IleArg: 0.91 ± 0.681
2.729IleSer: 2.729 ± 1.2
0.91IleThr: 0.91 ± 0.594
2.274IleVal: 2.274 ± 0.9
0.455IleTrp: 0.455 ± 0.509
0.455IleTyr: 0.455 ± 0.353
0.0IleXaa: 0.0 ± 0.0
Lys
3.183LysAla: 3.183 ± 1.418
0.91LysCys: 0.91 ± 0.47
0.91LysAsp: 0.91 ± 0.413
1.819LysGlu: 1.819 ± 1.21
1.364LysPhe: 1.364 ± 0.439
1.364LysGly: 1.364 ± 0.829
1.819LysHis: 1.819 ± 1.054
0.455LysIle: 0.455 ± 0.406
2.729LysLys: 2.729 ± 1.162
1.819LysLeu: 1.819 ± 0.965
0.91LysMet: 0.91 ± 0.413
1.819LysAsn: 1.819 ± 0.948
0.91LysPro: 0.91 ± 0.578
1.364LysGln: 1.364 ± 0.632
6.821LysArg: 6.821 ± 1.717
4.093LysSer: 4.093 ± 1.192
2.729LysThr: 2.729 ± 0.927
2.274LysVal: 2.274 ± 0.92
0.455LysTrp: 0.455 ± 0.34
3.183LysTyr: 3.183 ± 0.738
0.0LysXaa: 0.0 ± 0.0
Leu
5.912LeuAla: 5.912 ± 1.003
2.729LeuCys: 2.729 ± 1.645
5.912LeuAsp: 5.912 ± 1.108
4.093LeuGlu: 4.093 ± 0.95
4.093LeuPhe: 4.093 ± 1.262
6.821LeuGly: 6.821 ± 1.946
3.638LeuHis: 3.638 ± 0.656
0.455LeuIle: 0.455 ± 0.406
4.093LeuLys: 4.093 ± 1.469
10.459LeuLeu: 10.459 ± 3.35
2.274LeuMet: 2.274 ± 0.667
0.91LeuAsn: 0.91 ± 0.579
3.183LeuPro: 3.183 ± 0.761
3.638LeuGln: 3.638 ± 0.84
4.548LeuArg: 4.548 ± 1.139
8.186LeuSer: 8.186 ± 1.632
5.002LeuThr: 5.002 ± 0.883
4.548LeuVal: 4.548 ± 0.563
1.364LeuTrp: 1.364 ± 0.613
2.729LeuTyr: 2.729 ± 0.686
0.0LeuXaa: 0.0 ± 0.0
Met
1.364MetAla: 1.364 ± 0.642
0.0MetCys: 0.0 ± 0.0
0.91MetAsp: 0.91 ± 0.377
0.455MetGlu: 0.455 ± 0.406
0.455MetPhe: 0.455 ± 0.353
1.819MetGly: 1.819 ± 1.013
0.0MetHis: 0.0 ± 0.0
0.455MetIle: 0.455 ± 0.347
0.455MetLys: 0.455 ± 0.347
1.819MetLeu: 1.819 ± 0.637
0.0MetMet: 0.0 ± 0.0
1.364MetAsn: 1.364 ± 0.356
1.364MetPro: 1.364 ± 0.555
0.91MetGln: 0.91 ± 0.419
2.274MetArg: 2.274 ± 1.287
1.819MetSer: 1.819 ± 0.488
2.729MetThr: 2.729 ± 1.468
2.729MetVal: 2.729 ± 0.87
0.455MetTrp: 0.455 ± 0.406
0.91MetTyr: 0.91 ± 0.47
0.0MetXaa: 0.0 ± 0.0
Asn
1.819AsnAla: 1.819 ± 1.008
1.364AsnCys: 1.364 ± 0.356
0.91AsnAsp: 0.91 ± 0.377
0.91AsnGlu: 0.91 ± 0.419
1.364AsnPhe: 1.364 ± 0.642
3.183AsnGly: 3.183 ± 0.561
0.455AsnHis: 0.455 ± 0.347
0.455AsnIle: 0.455 ± 0.353
2.274AsnLys: 2.274 ± 0.92
2.274AsnLeu: 2.274 ± 0.731
1.819AsnMet: 1.819 ± 0.569
0.91AsnAsn: 0.91 ± 0.377
5.002AsnPro: 5.002 ± 1.35
0.91AsnGln: 0.91 ± 0.377
1.819AsnArg: 1.819 ± 0.801
1.819AsnSer: 1.819 ± 0.27
1.819AsnThr: 1.819 ± 0.723
2.729AsnVal: 2.729 ± 0.837
0.455AsnTrp: 0.455 ± 0.347
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
6.367ProAla: 6.367 ± 1.67
2.274ProCys: 2.274 ± 1.125
5.912ProAsp: 5.912 ± 1.057
1.364ProGlu: 1.364 ± 1.058
2.729ProPhe: 2.729 ± 0.462
4.548ProGly: 4.548 ± 1.836
0.455ProHis: 0.455 ± 0.406
2.274ProIle: 2.274 ± 0.989
1.364ProLys: 1.364 ± 0.356
5.457ProLeu: 5.457 ± 0.807
0.455ProMet: 0.455 ± 0.353
2.729ProAsn: 2.729 ± 1.13
5.457ProPro: 5.457 ± 1.7
3.638ProGln: 3.638 ± 1.135
5.457ProArg: 5.457 ± 1.051
5.002ProSer: 5.002 ± 1.148
3.638ProThr: 3.638 ± 1.291
6.821ProVal: 6.821 ± 1.568
0.455ProTrp: 0.455 ± 0.406
0.455ProTyr: 0.455 ± 0.353
0.0ProXaa: 0.0 ± 0.0
Gln
2.274GlnAla: 2.274 ± 0.474
0.91GlnCys: 0.91 ± 0.489
1.819GlnAsp: 1.819 ± 0.944
2.729GlnGlu: 2.729 ± 1.101
1.819GlnPhe: 1.819 ± 0.786
4.093GlnGly: 4.093 ± 0.897
1.819GlnHis: 1.819 ± 0.637
1.364GlnIle: 1.364 ± 0.642
1.364GlnLys: 1.364 ± 0.745
2.274GlnLeu: 2.274 ± 0.722
1.364GlnMet: 1.364 ± 0.566
1.819GlnAsn: 1.819 ± 0.572
4.093GlnPro: 4.093 ± 0.462
1.364GlnGln: 1.364 ± 0.664
2.729GlnArg: 2.729 ± 0.794
3.183GlnSer: 3.183 ± 1.424
1.819GlnThr: 1.819 ± 0.78
3.183GlnVal: 3.183 ± 2.025
1.364GlnTrp: 1.364 ± 0.745
0.455GlnTyr: 0.455 ± 0.353
0.0GlnXaa: 0.0 ± 0.0
Arg
7.276ArgAla: 7.276 ± 1.334
3.183ArgCys: 3.183 ± 1.609
3.638ArgAsp: 3.638 ± 1.058
3.638ArgGlu: 3.638 ± 0.551
2.729ArgPhe: 2.729 ± 0.713
7.731ArgGly: 7.731 ± 1.059
1.364ArgHis: 1.364 ± 0.811
1.364ArgIle: 1.364 ± 0.664
4.548ArgLys: 4.548 ± 1.11
7.276ArgLeu: 7.276 ± 1.511
1.819ArgMet: 1.819 ± 0.948
1.819ArgAsn: 1.819 ± 0.863
5.912ArgPro: 5.912 ± 1.053
2.729ArgGln: 2.729 ± 0.707
5.002ArgArg: 5.002 ± 1.662
3.183ArgSer: 3.183 ± 0.924
0.455ArgThr: 0.455 ± 0.34
5.457ArgVal: 5.457 ± 1.662
0.91ArgTrp: 0.91 ± 1.017
2.274ArgTyr: 2.274 ± 0.596
0.0ArgXaa: 0.0 ± 0.0
Ser
5.002SerAla: 5.002 ± 1.511
0.455SerCys: 0.455 ± 0.509
4.093SerAsp: 4.093 ± 0.715
1.364SerGlu: 1.364 ± 0.356
3.183SerPhe: 3.183 ± 0.603
4.548SerGly: 4.548 ± 1.32
3.638SerHis: 3.638 ± 2.805
2.729SerIle: 2.729 ± 0.773
1.819SerLys: 1.819 ± 0.637
6.367SerLeu: 6.367 ± 0.843
0.91SerMet: 0.91 ± 0.419
3.183SerAsn: 3.183 ± 0.78
3.638SerPro: 3.638 ± 2.292
5.457SerGln: 5.457 ± 1.293
6.821SerArg: 6.821 ± 1.864
10.459SerSer: 10.459 ± 1.837
3.638SerThr: 3.638 ± 1.26
8.64SerVal: 8.64 ± 1.96
0.0SerTrp: 0.0 ± 0.0
0.455SerTyr: 0.455 ± 0.34
0.0SerXaa: 0.0 ± 0.0
Thr
2.729ThrAla: 2.729 ± 1.476
1.819ThrCys: 1.819 ± 0.723
1.364ThrAsp: 1.364 ± 0.726
2.729ThrGlu: 2.729 ± 0.478
3.183ThrPhe: 3.183 ± 1.135
7.731ThrGly: 7.731 ± 1.958
0.455ThrHis: 0.455 ± 0.509
0.455ThrIle: 0.455 ± 0.34
1.819ThrLys: 1.819 ± 1.411
4.548ThrLeu: 4.548 ± 1.457
1.364ThrMet: 1.364 ± 0.642
2.729ThrAsn: 2.729 ± 0.925
4.548ThrPro: 4.548 ± 1.245
1.364ThrGln: 1.364 ± 0.58
4.548ThrArg: 4.548 ± 0.636
2.729ThrSer: 2.729 ± 0.643
3.638ThrThr: 3.638 ± 0.924
3.183ThrVal: 3.183 ± 0.885
0.0ThrTrp: 0.0 ± 0.0
0.91ThrTyr: 0.91 ± 0.706
0.0ThrXaa: 0.0 ± 0.0
Val
2.274ValAla: 2.274 ± 1.09
1.364ValCys: 1.364 ± 0.746
5.002ValAsp: 5.002 ± 1.329
5.002ValGlu: 5.002 ± 0.967
5.002ValPhe: 5.002 ± 1.358
9.095ValGly: 9.095 ± 1.921
2.729ValHis: 2.729 ± 1.214
0.0ValIle: 0.0 ± 0.0
3.638ValLys: 3.638 ± 1.002
6.367ValLeu: 6.367 ± 1.465
1.819ValMet: 1.819 ± 0.539
0.91ValAsn: 0.91 ± 0.377
6.821ValPro: 6.821 ± 1.726
1.819ValGln: 1.819 ± 0.609
4.093ValArg: 4.093 ± 1.37
8.64ValSer: 8.64 ± 1.531
3.638ValThr: 3.638 ± 1.493
8.186ValVal: 8.186 ± 1.994
0.91ValTrp: 0.91 ± 0.377
2.274ValTyr: 2.274 ± 1.098
0.0ValXaa: 0.0 ± 0.0
Trp
0.91TrpAla: 0.91 ± 0.377
0.0TrpCys: 0.0 ± 0.0
0.455TrpAsp: 0.455 ± 0.347
0.455TrpGlu: 0.455 ± 0.353
0.455TrpPhe: 0.455 ± 0.406
0.0TrpGly: 0.0 ± 0.0
0.455TrpHis: 0.455 ± 0.353
0.91TrpIle: 0.91 ± 0.579
0.455TrpLys: 0.455 ± 0.347
2.729TrpLeu: 2.729 ± 0.462
0.0TrpMet: 0.0 ± 0.0
0.455TrpAsn: 0.455 ± 0.509
0.0TrpPro: 0.0 ± 0.0
1.364TrpGln: 1.364 ± 0.439
1.819TrpArg: 1.819 ± 1.014
0.455TrpSer: 0.455 ± 0.406
0.91TrpThr: 0.91 ± 0.706
0.91TrpVal: 0.91 ± 0.489
0.455TrpTrp: 0.455 ± 0.509
1.364TrpTyr: 1.364 ± 0.745
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.183TyrAla: 3.183 ± 0.848
0.0TyrCys: 0.0 ± 0.0
1.364TyrAsp: 1.364 ± 0.555
1.364TyrGlu: 1.364 ± 0.402
0.0TyrPhe: 0.0 ± 0.0
1.364TyrGly: 1.364 ± 0.439
0.455TyrHis: 0.455 ± 0.353
0.455TyrIle: 0.455 ± 0.353
0.0TyrLys: 0.0 ± 0.0
2.274TyrLeu: 2.274 ± 0.881
0.91TyrMet: 0.91 ± 0.377
0.91TyrAsn: 0.91 ± 0.413
0.91TyrPro: 0.91 ± 0.413
1.819TyrGln: 1.819 ± 0.971
3.183TyrArg: 3.183 ± 0.44
0.455TyrSer: 0.455 ± 0.347
1.364TyrThr: 1.364 ± 0.593
2.729TyrVal: 2.729 ± 1.045
0.91TyrTrp: 0.91 ± 0.377
1.364TyrTyr: 1.364 ± 0.829
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2200 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski