Amino acid dipepetide frequency for Capreolus capreolus papillomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.117AlaAla: 9.117 ± 2.419
2.072AlaCys: 2.072 ± 0.984
4.144AlaAsp: 4.144 ± 1.146
2.487AlaGlu: 2.487 ± 0.589
2.072AlaPhe: 2.072 ± 0.794
6.216AlaGly: 6.216 ± 1.184
0.829AlaHis: 0.829 ± 0.468
2.487AlaIle: 2.487 ± 0.565
2.487AlaLys: 2.487 ± 0.76
5.802AlaLeu: 5.802 ± 1.629
0.0AlaMet: 0.0 ± 0.0
2.487AlaAsn: 2.487 ± 0.885
2.072AlaPro: 2.072 ± 0.743
2.072AlaGln: 2.072 ± 0.808
5.387AlaArg: 5.387 ± 1.441
5.387AlaSer: 5.387 ± 1.277
2.901AlaThr: 2.901 ± 1.219
5.802AlaVal: 5.802 ± 1.427
0.829AlaTrp: 0.829 ± 0.445
1.658AlaTyr: 1.658 ± 0.828
0.0AlaXaa: 0.0 ± 0.0
Cys
0.829CysAla: 0.829 ± 0.664
1.243CysCys: 1.243 ± 1.304
2.072CysAsp: 2.072 ± 1.004
1.243CysGlu: 1.243 ± 1.304
0.414CysPhe: 0.414 ± 0.527
1.243CysGly: 1.243 ± 0.98
0.0CysHis: 0.0 ± 0.0
0.829CysIle: 0.829 ± 0.68
0.829CysLys: 0.829 ± 0.638
2.901CysLeu: 2.901 ± 1.264
0.414CysMet: 0.414 ± 0.524
0.414CysAsn: 0.414 ± 0.34
2.072CysPro: 2.072 ± 0.401
0.414CysGln: 0.414 ± 0.527
2.072CysArg: 2.072 ± 1.103
3.73CysSer: 3.73 ± 1.038
2.072CysThr: 2.072 ± 0.821
1.243CysVal: 1.243 ± 0.625
0.414CysTrp: 0.414 ± 0.34
0.829CysTyr: 0.829 ± 0.625
0.0CysXaa: 0.0 ± 0.0
Asp
4.144AspAla: 4.144 ± 0.468
2.487AspCys: 2.487 ± 1.264
2.072AspAsp: 2.072 ± 0.679
4.973AspGlu: 4.973 ± 2.098
2.487AspPhe: 2.487 ± 0.782
3.73AspGly: 3.73 ± 1.136
0.829AspHis: 0.829 ± 0.468
4.144AspIle: 4.144 ± 1.84
2.901AspLys: 2.901 ± 1.837
2.901AspLeu: 2.901 ± 0.701
1.243AspMet: 1.243 ± 0.392
2.072AspAsn: 2.072 ± 0.794
1.658AspPro: 1.658 ± 0.676
2.072AspGln: 2.072 ± 0.683
2.487AspArg: 2.487 ± 0.981
4.144AspSer: 4.144 ± 1.355
4.973AspThr: 4.973 ± 1.181
1.243AspVal: 1.243 ± 0.706
0.414AspTrp: 0.414 ± 0.34
0.829AspTyr: 0.829 ± 0.403
0.0AspXaa: 0.0 ± 0.0
Glu
2.072GluAla: 2.072 ± 0.852
0.414GluCys: 0.414 ± 0.34
5.387GluAsp: 5.387 ± 1.725
2.487GluGlu: 2.487 ± 0.665
0.829GluPhe: 0.829 ± 0.445
4.559GluGly: 4.559 ± 1.712
1.243GluHis: 1.243 ± 0.368
1.243GluIle: 1.243 ± 0.713
3.315GluLys: 3.315 ± 1.279
5.387GluLeu: 5.387 ± 0.959
1.658GluMet: 1.658 ± 1.075
2.072GluAsn: 2.072 ± 0.947
3.315GluPro: 3.315 ± 1.014
3.73GluGln: 3.73 ± 0.642
2.901GluArg: 2.901 ± 1.029
4.144GluSer: 4.144 ± 2.011
2.901GluThr: 2.901 ± 0.852
4.144GluVal: 4.144 ± 1.035
0.414GluTrp: 0.414 ± 0.34
2.487GluTyr: 2.487 ± 0.589
0.0GluXaa: 0.0 ± 0.0
Phe
3.315PheAla: 3.315 ± 0.598
0.414PheCys: 0.414 ± 0.541
1.658PheAsp: 1.658 ± 0.592
2.901PheGlu: 2.901 ± 1.27
2.487PhePhe: 2.487 ± 1.1
2.487PheGly: 2.487 ± 0.489
1.658PheHis: 1.658 ± 0.855
2.072PheIle: 2.072 ± 0.69
1.658PheLys: 1.658 ± 0.807
3.73PheLeu: 3.73 ± 0.994
0.829PheMet: 0.829 ± 0.694
2.901PheAsn: 2.901 ± 0.837
0.829PhePro: 0.829 ± 0.572
1.658PheGln: 1.658 ± 0.828
3.73PheArg: 3.73 ± 0.653
1.658PheSer: 1.658 ± 0.671
1.658PheThr: 1.658 ± 1.183
2.072PheVal: 2.072 ± 0.954
2.072PheTrp: 2.072 ± 0.69
1.243PheTyr: 1.243 ± 0.692
0.0PheXaa: 0.0 ± 0.0
Gly
4.144GlyAla: 4.144 ± 1.508
2.072GlyCys: 2.072 ± 1.276
3.315GlyAsp: 3.315 ± 0.92
2.901GlyGlu: 2.901 ± 1.021
0.829GlyPhe: 0.829 ± 0.572
6.631GlyGly: 6.631 ± 2.379
1.658GlyHis: 1.658 ± 0.676
3.73GlyIle: 3.73 ± 1.24
1.658GlyLys: 1.658 ± 0.745
5.802GlyLeu: 5.802 ± 2.133
2.901GlyMet: 2.901 ± 0.994
4.144GlyAsn: 4.144 ± 1.289
4.973GlyPro: 4.973 ± 1.233
2.901GlyGln: 2.901 ± 1.36
4.144GlyArg: 4.144 ± 1.104
9.117GlySer: 9.117 ± 2.044
6.216GlyThr: 6.216 ± 1.796
5.802GlyVal: 5.802 ± 2.28
0.414GlyTrp: 0.414 ± 0.36
1.243GlyTyr: 1.243 ± 0.662
0.0GlyXaa: 0.0 ± 0.0
His
2.072HisAla: 2.072 ± 0.58
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.243HisGlu: 1.243 ± 0.746
0.829HisPhe: 0.829 ± 0.68
2.072HisGly: 2.072 ± 1.348
1.243HisHis: 1.243 ± 1.035
0.829HisIle: 0.829 ± 0.433
0.414HisLys: 0.414 ± 0.34
3.73HisLeu: 3.73 ± 1.134
0.0HisMet: 0.0 ± 0.0
1.658HisAsn: 1.658 ± 1.05
1.658HisPro: 1.658 ± 0.707
1.243HisGln: 1.243 ± 0.661
1.243HisArg: 1.243 ± 0.876
3.73HisSer: 3.73 ± 1.083
0.414HisThr: 0.414 ± 0.369
1.658HisVal: 1.658 ± 0.258
0.414HisTrp: 0.414 ± 0.367
0.414HisTyr: 0.414 ± 0.34
0.0HisXaa: 0.0 ± 0.0
Ile
3.315IleAla: 3.315 ± 1.34
0.829IleCys: 0.829 ± 0.403
2.487IleAsp: 2.487 ± 1.242
2.901IleGlu: 2.901 ± 0.901
0.829IlePhe: 0.829 ± 0.445
2.901IleGly: 2.901 ± 0.944
0.0IleHis: 0.0 ± 0.0
1.243IleIle: 1.243 ± 0.761
0.829IleLys: 0.829 ± 0.4
2.487IleLeu: 2.487 ± 0.938
0.829IleMet: 0.829 ± 0.539
2.487IleAsn: 2.487 ± 1.404
2.072IlePro: 2.072 ± 1.161
2.072IleGln: 2.072 ± 0.794
0.829IleArg: 0.829 ± 0.57
2.487IleSer: 2.487 ± 1.012
3.73IleThr: 3.73 ± 0.562
1.243IleVal: 1.243 ± 0.936
0.829IleTrp: 0.829 ± 0.666
0.414IleTyr: 0.414 ± 0.367
0.0IleXaa: 0.0 ± 0.0
Lys
3.73LysAla: 3.73 ± 1.0
1.243LysCys: 1.243 ± 0.721
2.072LysAsp: 2.072 ± 0.958
2.901LysGlu: 2.901 ± 1.718
1.658LysPhe: 1.658 ± 0.807
2.072LysGly: 2.072 ± 0.401
1.243LysHis: 1.243 ± 1.02
0.829LysIle: 0.829 ± 0.414
2.072LysLys: 2.072 ± 1.153
6.216LysLeu: 6.216 ± 2.203
1.658LysMet: 1.658 ± 0.787
2.072LysAsn: 2.072 ± 0.691
3.315LysPro: 3.315 ± 2.389
2.487LysGln: 2.487 ± 0.864
5.387LysArg: 5.387 ± 1.869
2.072LysSer: 2.072 ± 0.852
2.901LysThr: 2.901 ± 0.719
2.487LysVal: 2.487 ± 1.48
0.0LysTrp: 0.0 ± 0.0
0.829LysTyr: 0.829 ± 0.68
0.0LysXaa: 0.0 ± 0.0
Leu
4.973LeuAla: 4.973 ± 1.414
3.73LeuCys: 3.73 ± 1.332
4.973LeuAsp: 4.973 ± 1.581
5.387LeuGlu: 5.387 ± 1.489
6.216LeuPhe: 6.216 ± 1.358
6.216LeuGly: 6.216 ± 1.557
2.072LeuHis: 2.072 ± 0.923
3.73LeuIle: 3.73 ± 0.81
6.631LeuLys: 6.631 ± 1.98
9.946LeuLeu: 9.946 ± 2.423
1.243LeuMet: 1.243 ± 0.368
1.243LeuAsn: 1.243 ± 0.65
6.631LeuPro: 6.631 ± 1.709
5.802LeuGln: 5.802 ± 1.532
3.73LeuArg: 3.73 ± 1.41
4.559LeuSer: 4.559 ± 1.67
6.631LeuThr: 6.631 ± 1.817
4.144LeuVal: 4.144 ± 0.687
1.658LeuTrp: 1.658 ± 1.05
5.387LeuTyr: 5.387 ± 1.172
0.0LeuXaa: 0.0 ± 0.0
Met
1.243MetAla: 1.243 ± 0.633
0.829MetCys: 0.829 ± 0.635
1.243MetAsp: 1.243 ± 0.692
1.243MetGlu: 1.243 ± 0.936
0.414MetPhe: 0.414 ± 0.367
0.414MetGly: 0.414 ± 0.369
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.829MetLys: 0.829 ± 0.414
2.487MetLeu: 2.487 ± 0.837
0.414MetMet: 0.414 ± 0.367
1.243MetAsn: 1.243 ± 0.375
1.658MetPro: 1.658 ± 0.784
0.829MetGln: 0.829 ± 0.683
0.414MetArg: 0.414 ± 0.36
2.072MetSer: 2.072 ± 0.637
1.243MetThr: 1.243 ± 0.713
1.658MetVal: 1.658 ± 0.741
0.0MetTrp: 0.0 ± 0.0
0.414MetTyr: 0.414 ± 0.524
0.0MetXaa: 0.0 ± 0.0
Asn
2.901AsnAla: 2.901 ± 0.834
0.829AsnCys: 0.829 ± 0.68
0.0AsnAsp: 0.0 ± 0.0
1.658AsnGlu: 1.658 ± 0.768
0.829AsnPhe: 0.829 ± 0.433
3.315AsnGly: 3.315 ± 0.705
1.243AsnHis: 1.243 ± 0.65
2.072AsnIle: 2.072 ± 0.728
2.072AsnLys: 2.072 ± 0.603
2.901AsnLeu: 2.901 ± 0.527
0.414AsnMet: 0.414 ± 0.34
0.829AsnAsn: 0.829 ± 0.734
2.487AsnPro: 2.487 ± 1.007
2.487AsnGln: 2.487 ± 0.952
2.072AsnArg: 2.072 ± 1.42
3.315AsnSer: 3.315 ± 1.027
1.658AsnThr: 1.658 ± 1.029
3.73AsnVal: 3.73 ± 0.883
0.414AsnTrp: 0.414 ± 0.369
0.414AsnTyr: 0.414 ± 0.36
0.0AsnXaa: 0.0 ± 0.0
Pro
4.559ProAla: 4.559 ± 1.213
0.829ProCys: 0.829 ± 0.403
4.559ProAsp: 4.559 ± 1.597
2.487ProGlu: 2.487 ± 0.765
2.901ProPhe: 2.901 ± 0.677
3.315ProGly: 3.315 ± 0.972
0.829ProHis: 0.829 ± 0.468
1.658ProIle: 1.658 ± 0.754
2.901ProLys: 2.901 ± 0.542
7.874ProLeu: 7.874 ± 1.688
0.414ProMet: 0.414 ± 0.367
2.487ProAsn: 2.487 ± 1.205
7.46ProPro: 7.46 ± 1.59
1.243ProGln: 1.243 ± 1.159
3.315ProArg: 3.315 ± 1.45
7.045ProSer: 7.045 ± 1.517
3.73ProThr: 3.73 ± 1.555
4.973ProVal: 4.973 ± 1.847
1.243ProTrp: 1.243 ± 0.876
1.658ProTyr: 1.658 ± 0.794
0.0ProXaa: 0.0 ± 0.0
Gln
2.901GlnAla: 2.901 ± 1.188
1.658GlnCys: 1.658 ± 0.858
0.414GlnAsp: 0.414 ± 0.36
2.487GlnGlu: 2.487 ± 0.823
1.243GlnPhe: 1.243 ± 1.101
5.387GlnGly: 5.387 ± 1.374
0.829GlnHis: 0.829 ± 0.683
0.0GlnIle: 0.0 ± 0.0
2.487GlnLys: 2.487 ± 1.12
4.144GlnLeu: 4.144 ± 1.142
0.829GlnMet: 0.829 ± 0.4
0.414GlnAsn: 0.414 ± 0.369
4.559GlnPro: 4.559 ± 1.629
4.144GlnGln: 4.144 ± 1.25
2.072GlnArg: 2.072 ± 0.749
1.243GlnSer: 1.243 ± 0.662
3.315GlnThr: 3.315 ± 1.34
3.73GlnVal: 3.73 ± 0.783
0.829GlnTrp: 0.829 ± 0.68
1.658GlnTyr: 1.658 ± 0.835
0.0GlnXaa: 0.0 ± 0.0
Arg
3.315ArgAla: 3.315 ± 0.807
2.487ArgCys: 2.487 ± 1.826
2.901ArgAsp: 2.901 ± 1.264
0.0ArgGlu: 0.0 ± 0.0
1.658ArgPhe: 1.658 ± 0.624
4.973ArgGly: 4.973 ± 1.339
4.559ArgHis: 4.559 ± 1.008
2.487ArgIle: 2.487 ± 0.914
4.559ArgLys: 4.559 ± 0.799
4.973ArgLeu: 4.973 ± 1.524
0.414ArgMet: 0.414 ± 0.553
2.072ArgAsn: 2.072 ± 0.603
4.559ArgPro: 4.559 ± 1.244
1.243ArgGln: 1.243 ± 0.411
4.559ArgArg: 4.559 ± 1.081
2.072ArgSer: 2.072 ± 0.514
4.559ArgThr: 4.559 ± 0.835
4.973ArgVal: 4.973 ± 1.262
0.829ArgTrp: 0.829 ± 0.799
2.487ArgTyr: 2.487 ± 1.244
0.0ArgXaa: 0.0 ± 0.0
Ser
6.631SerAla: 6.631 ± 1.85
1.243SerCys: 1.243 ± 0.539
3.315SerAsp: 3.315 ± 1.936
6.216SerGlu: 6.216 ± 0.885
4.144SerPhe: 4.144 ± 1.585
4.973SerGly: 4.973 ± 1.5
1.243SerHis: 1.243 ± 0.375
2.072SerIle: 2.072 ± 0.797
4.144SerLys: 4.144 ± 1.027
8.288SerLeu: 8.288 ± 1.967
1.658SerMet: 1.658 ± 0.624
4.559SerAsn: 4.559 ± 1.16
6.631SerPro: 6.631 ± 2.768
3.73SerGln: 3.73 ± 1.099
3.73SerArg: 3.73 ± 0.7
7.874SerSer: 7.874 ± 2.067
4.559SerThr: 4.559 ± 1.364
4.144SerVal: 4.144 ± 0.82
0.829SerTrp: 0.829 ± 0.4
1.243SerTyr: 1.243 ± 0.368
0.0SerXaa: 0.0 ± 0.0
Thr
2.901ThrAla: 2.901 ± 0.949
1.243ThrCys: 1.243 ± 0.706
4.973ThrAsp: 4.973 ± 1.31
5.387ThrGlu: 5.387 ± 1.789
2.901ThrPhe: 2.901 ± 1.188
4.559ThrGly: 4.559 ± 1.147
2.072ThrHis: 2.072 ± 1.134
0.829ThrIle: 0.829 ± 0.586
2.072ThrLys: 2.072 ± 1.399
3.315ThrLeu: 3.315 ± 2.082
2.072ThrMet: 2.072 ± 1.015
1.243ThrAsn: 1.243 ± 1.02
4.144ThrPro: 4.144 ± 1.092
1.658ThrGln: 1.658 ± 0.533
4.973ThrArg: 4.973 ± 1.336
7.46ThrSer: 7.46 ± 2.648
4.973ThrThr: 4.973 ± 2.266
7.045ThrVal: 7.045 ± 2.114
2.072ThrTrp: 2.072 ± 0.972
1.658ThrTyr: 1.658 ± 0.561
0.0ThrXaa: 0.0 ± 0.0
Val
2.072ValAla: 2.072 ± 0.736
0.829ValCys: 0.829 ± 0.562
2.072ValAsp: 2.072 ± 0.641
2.487ValGlu: 2.487 ± 0.856
4.973ValPhe: 4.973 ± 1.449
6.216ValGly: 6.216 ± 1.143
2.487ValHis: 2.487 ± 0.959
2.901ValIle: 2.901 ± 1.322
2.901ValLys: 2.901 ± 1.021
6.631ValLeu: 6.631 ± 1.039
0.414ValMet: 0.414 ± 0.369
0.414ValAsn: 0.414 ± 0.369
4.559ValPro: 4.559 ± 1.201
2.901ValGln: 2.901 ± 1.159
4.144ValArg: 4.144 ± 0.959
7.46ValSer: 7.46 ± 1.188
6.631ValThr: 6.631 ± 0.84
1.658ValVal: 1.658 ± 0.784
0.414ValTrp: 0.414 ± 0.367
3.315ValTyr: 3.315 ± 1.018
0.0ValXaa: 0.0 ± 0.0
Trp
0.829TrpAla: 0.829 ± 0.403
0.414TrpCys: 0.414 ± 0.524
0.829TrpAsp: 0.829 ± 0.445
1.658TrpGlu: 1.658 ± 0.957
0.829TrpPhe: 0.829 ± 0.586
1.658TrpGly: 1.658 ± 0.583
0.0TrpHis: 0.0 ± 0.0
1.243TrpIle: 1.243 ± 0.597
0.829TrpLys: 0.829 ± 0.68
1.658TrpLeu: 1.658 ± 0.956
0.414TrpMet: 0.414 ± 0.524
0.414TrpAsn: 0.414 ± 0.367
0.0TrpPro: 0.0 ± 0.0
0.829TrpGln: 0.829 ± 0.445
0.0TrpArg: 0.0 ± 0.0
1.243TrpSer: 1.243 ± 0.721
0.829TrpThr: 0.829 ± 0.445
0.829TrpVal: 0.829 ± 0.63
0.414TrpTrp: 0.414 ± 0.36
0.829TrpTyr: 0.829 ± 0.4
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.243TyrAla: 1.243 ± 0.375
0.414TyrCys: 0.414 ± 0.524
3.315TyrAsp: 3.315 ± 0.604
1.658TyrGlu: 1.658 ± 0.258
2.487TyrPhe: 2.487 ± 0.454
1.658TyrGly: 1.658 ± 0.592
0.829TyrHis: 0.829 ± 0.433
0.414TyrIle: 0.414 ± 0.367
1.658TyrLys: 1.658 ± 0.762
3.73TyrLeu: 3.73 ± 1.509
0.414TyrMet: 0.414 ± 0.36
0.414TyrAsn: 0.414 ± 0.34
0.829TyrPro: 0.829 ± 0.403
0.829TyrGln: 0.829 ± 0.433
2.487TyrArg: 2.487 ± 0.871
0.414TyrSer: 0.414 ± 0.36
2.072TyrThr: 2.072 ± 0.813
2.901TyrVal: 2.901 ± 1.192
1.243TyrTrp: 1.243 ± 0.929
1.658TyrTyr: 1.658 ± 0.686
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (2414 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski