Amino acid dipepetide frequency for Betapapillomavirus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.35AlaAla: 2.35 ± 0.637
0.392AlaCys: 0.392 ± 0.353
3.917AlaAsp: 3.917 ± 1.394
4.7AlaGlu: 4.7 ± 1.377
1.958AlaPhe: 1.958 ± 0.887
5.484AlaGly: 5.484 ± 3.963
0.392AlaHis: 0.392 ± 0.353
1.567AlaIle: 1.567 ± 0.718
3.525AlaLys: 3.525 ± 1.55
5.092AlaLeu: 5.092 ± 0.94
1.175AlaMet: 1.175 ± 0.392
1.958AlaAsn: 1.958 ± 0.689
3.525AlaPro: 3.525 ± 0.879
3.525AlaGln: 3.525 ± 1.004
4.309AlaArg: 4.309 ± 1.382
3.525AlaSer: 3.525 ± 1.386
4.7AlaThr: 4.7 ± 1.699
3.525AlaVal: 3.525 ± 1.049
0.783AlaTrp: 0.783 ± 0.53
3.134AlaTyr: 3.134 ± 0.761
0.0AlaXaa: 0.0 ± 0.0
Cys
1.567CysAla: 1.567 ± 0.942
1.175CysCys: 1.175 ± 0.924
1.175CysAsp: 1.175 ± 0.743
0.392CysGlu: 0.392 ± 0.353
1.567CysPhe: 1.567 ± 0.653
1.567CysGly: 1.567 ± 1.69
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
3.134CysLys: 3.134 ± 1.073
2.742CysLeu: 2.742 ± 1.448
0.392CysMet: 0.392 ± 0.353
0.783CysAsn: 0.783 ± 0.612
1.567CysPro: 1.567 ± 0.702
0.0CysGln: 0.0 ± 0.0
1.567CysArg: 1.567 ± 0.748
1.958CysSer: 1.958 ± 1.028
1.567CysThr: 1.567 ± 0.992
0.0CysVal: 0.0 ± 0.0
0.783CysTrp: 0.783 ± 0.389
0.783CysTyr: 0.783 ± 0.436
0.0CysXaa: 0.0 ± 0.0
Asp
3.134AspAla: 3.134 ± 0.58
1.567AspCys: 1.567 ± 1.154
3.525AspAsp: 3.525 ± 1.007
5.484AspGlu: 5.484 ± 0.693
1.567AspPhe: 1.567 ± 0.775
3.525AspGly: 3.525 ± 0.928
1.567AspHis: 1.567 ± 0.799
3.525AspIle: 3.525 ± 0.988
2.35AspLys: 2.35 ± 0.93
6.267AspLeu: 6.267 ± 3.005
1.567AspMet: 1.567 ± 0.897
3.917AspAsn: 3.917 ± 0.913
5.092AspPro: 5.092 ± 0.788
3.525AspGln: 3.525 ± 0.436
4.309AspArg: 4.309 ± 1.844
3.917AspSer: 3.917 ± 1.289
3.134AspThr: 3.134 ± 1.033
2.742AspVal: 2.742 ± 1.058
1.567AspTrp: 1.567 ± 0.605
2.35AspTyr: 2.35 ± 1.27
0.0AspXaa: 0.0 ± 0.0
Glu
6.659GluAla: 6.659 ± 1.512
0.783GluCys: 0.783 ± 0.706
7.442GluAsp: 7.442 ± 1.542
5.875GluGlu: 5.875 ± 2.717
1.175GluPhe: 1.175 ± 0.672
5.875GluGly: 5.875 ± 2.015
1.567GluHis: 1.567 ± 0.269
4.7GluIle: 4.7 ± 1.43
3.917GluLys: 3.917 ± 1.19
5.484GluLeu: 5.484 ± 1.495
0.392GluMet: 0.392 ± 0.353
2.35GluAsn: 2.35 ± 0.645
1.958GluPro: 1.958 ± 1.294
1.567GluGln: 1.567 ± 0.269
3.525GluArg: 3.525 ± 0.841
2.742GluSer: 2.742 ± 0.89
5.484GluThr: 5.484 ± 2.964
3.525GluVal: 3.525 ± 1.069
0.783GluTrp: 0.783 ± 0.415
2.742GluTyr: 2.742 ± 1.194
0.0GluXaa: 0.0 ± 0.0
Phe
1.567PheAla: 1.567 ± 1.006
1.175PheCys: 1.175 ± 0.666
2.742PheAsp: 2.742 ± 0.783
4.309PheGlu: 4.309 ± 1.517
1.567PhePhe: 1.567 ± 0.628
2.742PheGly: 2.742 ± 0.628
0.783PheHis: 0.783 ± 0.432
1.958PheIle: 1.958 ± 0.689
2.35PheLys: 2.35 ± 1.241
4.309PheLeu: 4.309 ± 1.115
0.0PheMet: 0.0 ± 0.0
1.958PheAsn: 1.958 ± 1.082
1.175PhePro: 1.175 ± 0.672
2.35PheGln: 2.35 ± 0.909
1.958PheArg: 1.958 ± 0.626
1.175PheSer: 1.175 ± 0.701
0.783PheThr: 0.783 ± 0.415
3.134PheVal: 3.134 ± 1.243
1.175PheTrp: 1.175 ± 0.672
1.567PheTyr: 1.567 ± 0.778
0.0PheXaa: 0.0 ± 0.0
Gly
7.051GlyAla: 7.051 ± 3.931
3.134GlyCys: 3.134 ± 1.501
3.134GlyAsp: 3.134 ± 0.482
6.267GlyGlu: 6.267 ± 2.233
1.567GlyPhe: 1.567 ± 0.778
4.7GlyGly: 4.7 ± 1.849
1.567GlyHis: 1.567 ± 0.897
3.525GlyIle: 3.525 ± 1.376
3.917GlyLys: 3.917 ± 0.986
2.35GlyLeu: 2.35 ± 0.511
0.0GlyMet: 0.0 ± 0.0
4.7GlyAsn: 4.7 ± 0.772
3.134GlyPro: 3.134 ± 0.759
1.567GlyGln: 1.567 ± 0.701
7.834GlyArg: 7.834 ± 3.68
5.875GlySer: 5.875 ± 1.647
3.525GlyThr: 3.525 ± 1.304
3.134GlyVal: 3.134 ± 0.937
0.0GlyTrp: 0.0 ± 0.0
1.958GlyTyr: 1.958 ± 1.127
0.0GlyXaa: 0.0 ± 0.0
His
0.392HisAla: 0.392 ± 0.316
1.175HisCys: 1.175 ± 0.91
0.392HisAsp: 0.392 ± 0.408
0.392HisGlu: 0.392 ± 0.353
1.175HisPhe: 1.175 ± 0.47
0.783HisGly: 0.783 ± 0.52
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.783HisLys: 0.783 ± 0.536
2.35HisLeu: 2.35 ± 0.646
0.392HisMet: 0.392 ± 0.316
1.175HisAsn: 1.175 ± 0.615
1.958HisPro: 1.958 ± 0.876
0.392HisGln: 0.392 ± 0.353
0.392HisArg: 0.392 ± 0.408
2.35HisSer: 2.35 ± 0.516
0.783HisThr: 0.783 ± 0.383
1.175HisVal: 1.175 ± 0.382
1.175HisTrp: 1.175 ± 0.392
1.567HisTyr: 1.567 ± 0.997
0.0HisXaa: 0.0 ± 0.0
Ile
2.742IleAla: 2.742 ± 1.192
0.392IleCys: 0.392 ± 0.316
3.525IleAsp: 3.525 ± 1.741
3.917IleGlu: 3.917 ± 1.079
1.175IlePhe: 1.175 ± 0.816
3.134IleGly: 3.134 ± 0.966
1.567IleHis: 1.567 ± 0.628
1.567IleIle: 1.567 ± 0.977
1.175IleLys: 1.175 ± 0.91
3.134IleLeu: 3.134 ± 0.857
1.567IleMet: 1.567 ± 0.817
1.175IleAsn: 1.175 ± 0.356
3.917IlePro: 3.917 ± 1.762
1.958IleGln: 1.958 ± 0.863
4.309IleArg: 4.309 ± 0.936
3.134IleSer: 3.134 ± 0.848
1.958IleThr: 1.958 ± 0.887
3.134IleVal: 3.134 ± 1.353
1.175IleTrp: 1.175 ± 0.555
2.742IleTyr: 2.742 ± 0.829
0.0IleXaa: 0.0 ± 0.0
Lys
5.092LysAla: 5.092 ± 0.927
0.392LysCys: 0.392 ± 0.368
2.742LysAsp: 2.742 ± 0.734
5.092LysGlu: 5.092 ± 1.431
3.134LysPhe: 3.134 ± 1.591
3.525LysGly: 3.525 ± 1.667
1.567LysHis: 1.567 ± 1.13
1.175LysIle: 1.175 ± 0.743
2.35LysLys: 2.35 ± 0.688
4.7LysLeu: 4.7 ± 1.259
0.392LysMet: 0.392 ± 0.316
2.742LysAsn: 2.742 ± 1.496
0.783LysPro: 0.783 ± 0.559
2.742LysGln: 2.742 ± 1.12
4.309LysArg: 4.309 ± 1.009
4.309LysSer: 4.309 ± 1.812
2.742LysThr: 2.742 ± 0.734
3.134LysVal: 3.134 ± 1.114
0.783LysTrp: 0.783 ± 0.52
2.742LysTyr: 2.742 ± 0.739
0.0LysXaa: 0.0 ± 0.0
Leu
3.917LeuAla: 3.917 ± 0.901
3.525LeuCys: 3.525 ± 1.98
6.267LeuAsp: 6.267 ± 0.743
6.267LeuGlu: 6.267 ± 2.047
5.092LeuPhe: 5.092 ± 1.364
4.7LeuGly: 4.7 ± 1.387
1.567LeuHis: 1.567 ± 0.696
3.525LeuIle: 3.525 ± 1.167
3.525LeuLys: 3.525 ± 0.656
11.751LeuLeu: 11.751 ± 4.051
1.175LeuMet: 1.175 ± 0.404
2.35LeuAsn: 2.35 ± 1.426
4.309LeuPro: 4.309 ± 1.581
6.659LeuGln: 6.659 ± 1.981
3.525LeuArg: 3.525 ± 1.305
5.484LeuSer: 5.484 ± 2.153
5.484LeuThr: 5.484 ± 0.898
5.484LeuVal: 5.484 ± 0.894
0.392LeuTrp: 0.392 ± 0.353
1.175LeuTyr: 1.175 ± 0.345
0.0LeuXaa: 0.0 ± 0.0
Met
1.567MetAla: 1.567 ± 0.644
0.392MetCys: 0.392 ± 0.353
0.0MetAsp: 0.0 ± 0.0
0.392MetGlu: 0.392 ± 0.368
1.567MetPhe: 1.567 ± 0.605
0.0MetGly: 0.0 ± 0.0
0.392MetHis: 0.392 ± 0.353
0.783MetIle: 0.783 ± 0.757
0.783MetLys: 0.783 ± 0.415
1.175MetLeu: 1.175 ± 0.742
0.0MetMet: 0.0 ± 0.0
1.175MetAsn: 1.175 ± 0.615
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
0.392MetArg: 0.392 ± 0.368
1.958MetSer: 1.958 ± 1.04
1.175MetThr: 1.175 ± 0.667
1.567MetVal: 1.567 ± 0.605
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.525AsnAla: 3.525 ± 0.813
1.175AsnCys: 1.175 ± 0.345
3.134AsnAsp: 3.134 ± 1.221
1.175AsnGlu: 1.175 ± 0.667
1.958AsnPhe: 1.958 ± 0.73
2.742AsnGly: 2.742 ± 1.296
0.0AsnHis: 0.0 ± 0.0
3.525AsnIle: 3.525 ± 0.643
2.742AsnLys: 2.742 ± 0.739
2.35AsnLeu: 2.35 ± 1.082
0.783AsnMet: 0.783 ± 0.59
1.567AsnAsn: 1.567 ± 0.698
3.917AsnPro: 3.917 ± 1.471
1.567AsnGln: 1.567 ± 0.856
1.175AsnArg: 1.175 ± 0.687
2.742AsnSer: 2.742 ± 0.549
3.917AsnThr: 3.917 ± 1.46
2.35AsnVal: 2.35 ± 0.934
0.392AsnTrp: 0.392 ± 0.368
0.783AsnTyr: 0.783 ± 0.389
0.0AsnXaa: 0.0 ± 0.0
Pro
3.917ProAla: 3.917 ± 1.312
1.567ProCys: 1.567 ± 0.899
4.7ProAsp: 4.7 ± 1.408
3.525ProGlu: 3.525 ± 1.104
1.175ProPhe: 1.175 ± 0.666
2.35ProGly: 2.35 ± 0.93
0.0ProHis: 0.0 ± 0.0
2.35ProIle: 2.35 ± 1.067
4.7ProLys: 4.7 ± 1.326
5.092ProLeu: 5.092 ± 1.35
0.0ProMet: 0.0 ± 0.0
1.958ProAsn: 1.958 ± 1.197
6.659ProPro: 6.659 ± 1.835
4.309ProGln: 4.309 ± 2.278
1.175ProArg: 1.175 ± 0.701
4.7ProSer: 4.7 ± 2.641
5.092ProThr: 5.092 ± 3.439
3.134ProVal: 3.134 ± 1.192
0.392ProTrp: 0.392 ± 0.316
1.958ProTyr: 1.958 ± 1.248
0.0ProXaa: 0.0 ± 0.0
Gln
1.567GlnAla: 1.567 ± 0.856
1.175GlnCys: 1.175 ± 0.672
2.35GlnAsp: 2.35 ± 1.499
2.742GlnGlu: 2.742 ± 1.144
2.742GlnPhe: 2.742 ± 0.937
1.567GlnGly: 1.567 ± 0.569
0.783GlnHis: 0.783 ± 0.415
3.525GlnIle: 3.525 ± 0.819
1.175GlnLys: 1.175 ± 0.392
6.267GlnLeu: 6.267 ± 1.486
1.567GlnMet: 1.567 ± 0.438
0.783GlnAsn: 0.783 ± 0.392
2.35GlnPro: 2.35 ± 0.879
2.742GlnGln: 2.742 ± 1.19
4.7GlnArg: 4.7 ± 1.411
1.567GlnSer: 1.567 ± 0.446
2.35GlnThr: 2.35 ± 0.932
4.309GlnVal: 4.309 ± 1.576
0.392GlnTrp: 0.392 ± 0.353
1.567GlnTyr: 1.567 ± 0.701
0.0GlnXaa: 0.0 ± 0.0
Arg
4.309ArgAla: 4.309 ± 1.346
0.392ArgCys: 0.392 ± 0.456
4.309ArgAsp: 4.309 ± 2.126
3.134ArgGlu: 3.134 ± 0.463
2.742ArgPhe: 2.742 ± 0.669
7.442ArgGly: 7.442 ± 3.262
1.958ArgHis: 1.958 ± 0.73
1.567ArgIle: 1.567 ± 1.465
4.309ArgLys: 4.309 ± 0.677
4.7ArgLeu: 4.7 ± 0.985
1.175ArgMet: 1.175 ± 0.684
1.567ArgAsn: 1.567 ± 0.642
1.958ArgPro: 1.958 ± 0.837
3.134ArgGln: 3.134 ± 1.242
6.267ArgArg: 6.267 ± 1.979
10.576ArgSer: 10.576 ± 6.673
3.917ArgThr: 3.917 ± 1.29
4.309ArgVal: 4.309 ± 1.825
0.0ArgTrp: 0.0 ± 0.0
2.35ArgTyr: 2.35 ± 0.942
0.0ArgXaa: 0.0 ± 0.0
Ser
1.958SerAla: 1.958 ± 0.697
0.783SerCys: 0.783 ± 0.787
5.875SerAsp: 5.875 ± 2.742
2.742SerGlu: 2.742 ± 1.277
3.525SerPhe: 3.525 ± 0.928
6.659SerGly: 6.659 ± 1.785
1.175SerHis: 1.175 ± 0.811
3.525SerIle: 3.525 ± 0.819
3.525SerLys: 3.525 ± 0.903
5.484SerLeu: 5.484 ± 1.269
1.175SerMet: 1.175 ± 1.059
3.917SerAsn: 3.917 ± 0.913
4.309SerPro: 4.309 ± 1.878
1.958SerGln: 1.958 ± 0.743
10.184SerArg: 10.184 ± 5.467
3.917SerSer: 3.917 ± 1.268
5.484SerThr: 5.484 ± 2.228
5.092SerVal: 5.092 ± 0.718
0.783SerTrp: 0.783 ± 0.415
0.392SerTyr: 0.392 ± 0.356
0.0SerXaa: 0.0 ± 0.0
Thr
2.742ThrAla: 2.742 ± 0.549
1.958ThrCys: 1.958 ± 0.919
3.134ThrAsp: 3.134 ± 1.049
5.092ThrGlu: 5.092 ± 0.878
2.742ThrPhe: 2.742 ± 0.568
4.309ThrGly: 4.309 ± 1.541
1.175ThrHis: 1.175 ± 0.666
5.092ThrIle: 5.092 ± 1.983
2.742ThrLys: 2.742 ± 0.912
3.917ThrLeu: 3.917 ± 1.01
0.392ThrMet: 0.392 ± 0.368
1.958ThrAsn: 1.958 ± 0.476
5.092ThrPro: 5.092 ± 1.996
2.35ThrGln: 2.35 ± 1.58
3.917ThrArg: 3.917 ± 1.108
3.917ThrSer: 3.917 ± 1.886
4.309ThrThr: 4.309 ± 1.179
5.092ThrVal: 5.092 ± 2.603
0.392ThrTrp: 0.392 ± 0.368
1.175ThrTyr: 1.175 ± 0.743
0.0ThrXaa: 0.0 ± 0.0
Val
3.134ValAla: 3.134 ± 0.874
1.175ValCys: 1.175 ± 0.97
4.309ValAsp: 4.309 ± 1.169
4.309ValGlu: 4.309 ± 1.045
1.175ValPhe: 1.175 ± 0.651
4.7ValGly: 4.7 ± 1.31
1.175ValHis: 1.175 ± 1.069
2.742ValIle: 2.742 ± 1.163
2.742ValLys: 2.742 ± 1.137
4.7ValLeu: 4.7 ± 0.785
0.392ValMet: 0.392 ± 0.368
3.134ValAsn: 3.134 ± 0.744
5.092ValPro: 5.092 ± 0.848
4.309ValGln: 4.309 ± 1.274
4.309ValArg: 4.309 ± 0.789
5.092ValSer: 5.092 ± 1.031
1.958ValThr: 1.958 ± 1.011
2.35ValVal: 2.35 ± 0.764
0.392ValTrp: 0.392 ± 0.316
1.958ValTyr: 1.958 ± 0.425
0.0ValXaa: 0.0 ± 0.0
Trp
0.783TrpAla: 0.783 ± 0.389
0.0TrpCys: 0.0 ± 0.0
0.392TrpAsp: 0.392 ± 0.316
0.783TrpGlu: 0.783 ± 0.52
0.392TrpPhe: 0.392 ± 0.353
0.392TrpGly: 0.392 ± 0.316
0.0TrpHis: 0.0 ± 0.0
0.783TrpIle: 0.783 ± 0.706
1.175TrpLys: 1.175 ± 0.924
1.175TrpLeu: 1.175 ± 0.615
0.392TrpMet: 0.392 ± 0.353
0.392TrpAsn: 0.392 ± 0.368
0.0TrpPro: 0.0 ± 0.0
0.783TrpGln: 0.783 ± 0.436
0.0TrpArg: 0.0 ± 0.0
1.958TrpSer: 1.958 ± 0.743
1.175TrpThr: 1.175 ± 0.803
1.175TrpVal: 1.175 ± 0.392
0.0TrpTrp: 0.0 ± 0.0
0.783TrpTyr: 0.783 ± 0.706
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.567TyrAla: 1.567 ± 0.416
0.392TyrCys: 0.392 ± 0.456
1.567TyrAsp: 1.567 ± 0.653
1.958TyrGlu: 1.958 ± 0.969
0.783TyrPhe: 0.783 ± 0.633
2.742TyrGly: 2.742 ± 0.521
1.567TyrHis: 1.567 ± 0.446
2.35TyrIle: 2.35 ± 0.499
3.525TyrLys: 3.525 ± 1.664
3.134TyrLeu: 3.134 ± 1.002
0.392TyrMet: 0.392 ± 0.316
1.567TyrAsn: 1.567 ± 0.893
1.958TyrPro: 1.958 ± 0.669
0.783TyrGln: 0.783 ± 0.415
1.958TyrArg: 1.958 ± 1.027
1.567TyrSer: 1.567 ± 0.569
1.958TyrThr: 1.958 ± 0.495
0.783TyrVal: 0.783 ± 0.446
1.175TyrTrp: 1.175 ± 0.628
2.742TyrTyr: 2.742 ± 1.277
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2554 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski