Amino acid dipepetide frequency for Lynx rufus papillomavirus type 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.421AlaAla: 6.421 ± 1.691
1.284AlaCys: 1.284 ± 0.867
5.993AlaAsp: 5.993 ± 1.192
3.425AlaGlu: 3.425 ± 1.44
3.853AlaPhe: 3.853 ± 1.019
1.712AlaGly: 1.712 ± 0.623
0.856AlaHis: 0.856 ± 0.695
0.856AlaIle: 0.856 ± 0.406
3.853AlaLys: 3.853 ± 1.909
5.565AlaLeu: 5.565 ± 1.265
1.284AlaMet: 1.284 ± 0.648
1.284AlaAsn: 1.284 ± 0.429
2.568AlaPro: 2.568 ± 0.919
2.568AlaGln: 2.568 ± 1.021
3.853AlaArg: 3.853 ± 0.97
3.425AlaSer: 3.425 ± 0.709
4.281AlaThr: 4.281 ± 1.147
4.281AlaVal: 4.281 ± 1.846
0.428AlaTrp: 0.428 ± 0.348
1.284AlaTyr: 1.284 ± 0.806
0.0AlaXaa: 0.0 ± 0.0
Cys
2.14CysAla: 2.14 ± 1.292
0.856CysCys: 0.856 ± 1.22
0.0CysAsp: 0.0 ± 0.0
0.856CysGlu: 0.856 ± 0.64
1.284CysPhe: 1.284 ± 0.404
2.14CysGly: 2.14 ± 1.741
0.428CysHis: 0.428 ± 0.559
1.284CysIle: 1.284 ± 0.703
2.14CysLys: 2.14 ± 0.909
2.14CysLeu: 2.14 ± 1.164
0.856CysMet: 0.856 ± 0.639
0.856CysAsn: 0.856 ± 0.375
1.712CysPro: 1.712 ± 0.603
1.284CysGln: 1.284 ± 0.806
1.712CysArg: 1.712 ± 1.251
1.284CysSer: 1.284 ± 0.961
0.428CysThr: 0.428 ± 0.61
1.284CysVal: 1.284 ± 1.358
0.428CysTrp: 0.428 ± 0.32
0.856CysTyr: 0.856 ± 0.904
0.0CysXaa: 0.0 ± 0.0
Asp
3.425AspAla: 3.425 ± 1.177
1.284AspCys: 1.284 ± 0.961
2.14AspAsp: 2.14 ± 0.613
4.281AspGlu: 4.281 ± 0.698
1.712AspPhe: 1.712 ± 0.627
3.425AspGly: 3.425 ± 1.394
0.856AspHis: 0.856 ± 0.406
5.137AspIle: 5.137 ± 1.486
4.281AspLys: 4.281 ± 0.518
8.134AspLeu: 8.134 ± 1.964
0.856AspMet: 0.856 ± 0.423
2.568AspAsn: 2.568 ± 0.808
5.137AspPro: 5.137 ± 1.732
2.14AspGln: 2.14 ± 0.822
4.281AspArg: 4.281 ± 1.719
3.853AspSer: 3.853 ± 1.214
2.568AspThr: 2.568 ± 1.332
3.853AspVal: 3.853 ± 1.344
1.284AspTrp: 1.284 ± 0.961
1.712AspTyr: 1.712 ± 0.667
0.0AspXaa: 0.0 ± 0.0
Glu
2.568GluAla: 2.568 ± 0.88
1.284GluCys: 1.284 ± 0.961
6.421GluAsp: 6.421 ± 0.858
5.137GluGlu: 5.137 ± 1.127
2.14GluPhe: 2.14 ± 0.822
2.568GluGly: 2.568 ± 1.219
0.856GluHis: 0.856 ± 0.63
2.997GluIle: 2.997 ± 0.979
2.568GluLys: 2.568 ± 0.898
3.425GluLeu: 3.425 ± 0.782
0.428GluMet: 0.428 ± 0.32
3.425GluAsn: 3.425 ± 1.146
3.853GluPro: 3.853 ± 0.683
5.565GluGln: 5.565 ± 0.996
3.853GluArg: 3.853 ± 1.902
2.997GluSer: 2.997 ± 0.618
4.281GluThr: 4.281 ± 1.521
4.709GluVal: 4.709 ± 1.278
0.856GluTrp: 0.856 ± 0.64
0.428GluTyr: 0.428 ± 0.348
0.0GluXaa: 0.0 ± 0.0
Phe
2.568PheAla: 2.568 ± 0.7
2.14PheCys: 2.14 ± 1.02
3.853PheAsp: 3.853 ± 1.08
2.568PheGlu: 2.568 ± 1.175
1.712PhePhe: 1.712 ± 0.97
2.14PheGly: 2.14 ± 0.664
0.428PheHis: 0.428 ± 0.32
0.428PheIle: 0.428 ± 0.348
1.712PheLys: 1.712 ± 0.92
5.137PheLeu: 5.137 ± 1.947
1.712PheMet: 1.712 ± 0.585
1.284PheAsn: 1.284 ± 0.694
2.14PhePro: 2.14 ± 1.05
2.14PheGln: 2.14 ± 0.564
2.997PheArg: 2.997 ± 0.606
2.568PheSer: 2.568 ± 0.782
1.284PheThr: 1.284 ± 0.806
0.428PheVal: 0.428 ± 0.348
1.712PheTrp: 1.712 ± 0.891
2.14PheTyr: 2.14 ± 0.775
0.0PheXaa: 0.0 ± 0.0
Gly
2.568GlyAla: 2.568 ± 0.975
0.856GlyCys: 0.856 ± 0.63
4.709GlyAsp: 4.709 ± 1.089
5.137GlyGlu: 5.137 ± 1.114
1.284GlyPhe: 1.284 ± 0.379
4.709GlyGly: 4.709 ± 1.523
2.14GlyHis: 2.14 ± 0.824
2.568GlyIle: 2.568 ± 0.695
2.14GlyLys: 2.14 ± 1.222
6.421GlyLeu: 6.421 ± 2.56
0.856GlyMet: 0.856 ± 0.582
1.284GlyAsn: 1.284 ± 0.379
2.997GlyPro: 2.997 ± 0.809
5.137GlyGln: 5.137 ± 0.698
3.425GlyArg: 3.425 ± 1.165
6.849GlySer: 6.849 ± 2.319
5.993GlyThr: 5.993 ± 1.872
3.425GlyVal: 3.425 ± 0.811
0.428GlyTrp: 0.428 ± 0.368
0.856GlyTyr: 0.856 ± 0.619
0.0GlyXaa: 0.0 ± 0.0
His
0.856HisAla: 0.856 ± 0.695
0.856HisCys: 0.856 ± 0.567
1.284HisAsp: 1.284 ± 0.654
1.712HisGlu: 1.712 ± 0.56
0.0HisPhe: 0.0 ± 0.0
1.284HisGly: 1.284 ± 0.973
0.0HisHis: 0.0 ± 0.0
0.856HisIle: 0.856 ± 0.684
0.856HisLys: 0.856 ± 0.639
2.14HisLeu: 2.14 ± 1.426
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.856HisPro: 0.856 ± 0.423
0.856HisGln: 0.856 ± 0.567
0.856HisArg: 0.856 ± 0.567
0.856HisSer: 0.856 ± 0.41
0.856HisThr: 0.856 ± 0.695
2.14HisVal: 2.14 ± 0.478
0.856HisTrp: 0.856 ± 0.463
0.856HisTyr: 0.856 ± 0.406
0.0HisXaa: 0.0 ± 0.0
Ile
2.14IleAla: 2.14 ± 0.705
0.428IleCys: 0.428 ± 0.32
1.712IleAsp: 1.712 ± 0.56
2.568IleGlu: 2.568 ± 1.017
0.856IlePhe: 0.856 ± 0.423
2.568IleGly: 2.568 ± 1.106
0.856IleHis: 0.856 ± 0.375
1.712IleIle: 1.712 ± 1.014
1.284IleLys: 1.284 ± 0.404
2.997IleLeu: 2.997 ± 1.226
0.0IleMet: 0.0 ± 0.0
1.712IleAsn: 1.712 ± 0.603
1.712IlePro: 1.712 ± 1.121
2.14IleGln: 2.14 ± 1.044
2.14IleArg: 2.14 ± 2.003
4.709IleSer: 4.709 ± 1.157
1.284IleThr: 1.284 ± 0.404
1.284IleVal: 1.284 ± 0.608
0.428IleTrp: 0.428 ± 0.342
1.284IleTyr: 1.284 ± 0.429
0.0IleXaa: 0.0 ± 0.0
Lys
3.853LysAla: 3.853 ± 0.867
1.712LysCys: 1.712 ± 0.627
0.856LysAsp: 0.856 ± 0.41
3.425LysGlu: 3.425 ± 0.959
2.568LysPhe: 2.568 ± 0.931
3.425LysGly: 3.425 ± 1.36
1.284LysHis: 1.284 ± 0.636
0.856LysIle: 0.856 ± 0.684
3.853LysLys: 3.853 ± 1.23
2.997LysLeu: 2.997 ± 1.432
0.428LysMet: 0.428 ± 0.596
2.14LysAsn: 2.14 ± 0.738
2.997LysPro: 2.997 ± 1.028
0.856LysGln: 0.856 ± 0.447
4.281LysArg: 4.281 ± 0.735
4.281LysSer: 4.281 ± 1.687
3.425LysThr: 3.425 ± 1.183
2.997LysVal: 2.997 ± 0.787
0.0LysTrp: 0.0 ± 0.0
1.284LysTyr: 1.284 ± 0.404
0.0LysXaa: 0.0 ± 0.0
Leu
4.281LeuAla: 4.281 ± 0.845
2.14LeuCys: 2.14 ± 1.758
4.709LeuAsp: 4.709 ± 0.545
3.853LeuGlu: 3.853 ± 1.451
6.421LeuPhe: 6.421 ± 0.92
7.705LeuGly: 7.705 ± 2.34
2.14LeuHis: 2.14 ± 0.441
1.284LeuIle: 1.284 ± 0.604
4.709LeuLys: 4.709 ± 1.331
10.702LeuLeu: 10.702 ± 2.221
2.14LeuMet: 2.14 ± 0.864
2.568LeuAsn: 2.568 ± 0.583
5.565LeuPro: 5.565 ± 1.669
7.277LeuGln: 7.277 ± 1.671
8.562LeuArg: 8.562 ± 1.558
8.134LeuSer: 8.134 ± 2.633
5.993LeuThr: 5.993 ± 2.109
5.565LeuVal: 5.565 ± 0.868
0.856LeuTrp: 0.856 ± 0.41
2.997LeuTyr: 2.997 ± 0.925
0.0LeuXaa: 0.0 ± 0.0
Met
2.14MetAla: 2.14 ± 0.979
0.428MetCys: 0.428 ± 0.32
2.568MetAsp: 2.568 ± 1.043
0.856MetGlu: 0.856 ± 0.406
0.856MetPhe: 0.856 ± 0.375
0.428MetGly: 0.428 ± 0.32
0.0MetHis: 0.0 ± 0.0
0.428MetIle: 0.428 ± 0.54
0.0MetLys: 0.0 ± 0.0
1.284MetLeu: 1.284 ± 0.378
0.0MetMet: 0.0 ± 0.0
0.428MetAsn: 0.428 ± 0.368
0.428MetPro: 0.428 ± 0.54
0.428MetGln: 0.428 ± 0.32
1.284MetArg: 1.284 ± 0.604
1.284MetSer: 1.284 ± 0.647
0.856MetThr: 0.856 ± 0.63
1.712MetVal: 1.712 ± 0.749
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.14AsnAla: 2.14 ± 0.83
0.856AsnCys: 0.856 ± 0.639
2.14AsnAsp: 2.14 ± 0.592
2.14AsnGlu: 2.14 ± 1.065
0.428AsnPhe: 0.428 ± 0.348
1.712AsnGly: 1.712 ± 0.603
0.856AsnHis: 0.856 ± 0.541
0.856AsnIle: 0.856 ± 0.463
1.712AsnLys: 1.712 ± 1.014
1.284AsnLeu: 1.284 ± 0.429
0.856AsnMet: 0.856 ± 0.423
1.284AsnAsn: 1.284 ± 0.648
3.425AsnPro: 3.425 ± 1.574
3.425AsnGln: 3.425 ± 0.906
2.14AsnArg: 2.14 ± 0.687
1.284AsnSer: 1.284 ± 0.404
2.997AsnThr: 2.997 ± 0.82
2.568AsnVal: 2.568 ± 1.265
0.0AsnTrp: 0.0 ± 0.0
1.712AsnTyr: 1.712 ± 0.603
0.0AsnXaa: 0.0 ± 0.0
Pro
4.281ProAla: 4.281 ± 2.196
0.856ProCys: 0.856 ± 0.639
4.281ProAsp: 4.281 ± 1.523
3.425ProGlu: 3.425 ± 0.831
1.712ProPhe: 1.712 ± 0.661
2.568ProGly: 2.568 ± 0.803
1.712ProHis: 1.712 ± 1.121
2.997ProIle: 2.997 ± 1.257
3.853ProLys: 3.853 ± 0.992
4.709ProLeu: 4.709 ± 1.027
0.856ProMet: 0.856 ± 0.569
2.568ProAsn: 2.568 ± 1.646
9.846ProPro: 9.846 ± 4.153
2.997ProGln: 2.997 ± 1.665
5.993ProArg: 5.993 ± 1.796
4.709ProSer: 4.709 ± 1.474
5.137ProThr: 5.137 ± 1.734
5.565ProVal: 5.565 ± 1.389
0.428ProTrp: 0.428 ± 0.368
2.14ProTyr: 2.14 ± 0.938
0.0ProXaa: 0.0 ± 0.0
Gln
2.14GlnAla: 2.14 ± 0.71
2.568GlnCys: 2.568 ± 0.781
1.284GlnAsp: 1.284 ± 0.59
2.997GlnGlu: 2.997 ± 0.568
1.712GlnPhe: 1.712 ± 0.713
5.565GlnGly: 5.565 ± 0.702
0.856GlnHis: 0.856 ± 0.567
1.284GlnIle: 1.284 ± 0.378
1.712GlnLys: 1.712 ± 0.894
5.993GlnLeu: 5.993 ± 1.339
1.712GlnMet: 1.712 ± 0.8
2.14GlnAsn: 2.14 ± 1.096
3.853GlnPro: 3.853 ± 0.908
3.425GlnGln: 3.425 ± 0.562
2.568GlnArg: 2.568 ± 0.873
3.425GlnSer: 3.425 ± 0.94
2.997GlnThr: 2.997 ± 1.019
3.425GlnVal: 3.425 ± 0.819
0.856GlnTrp: 0.856 ± 0.639
1.284GlnTyr: 1.284 ± 0.379
0.0GlnXaa: 0.0 ± 0.0
Arg
3.853ArgAla: 3.853 ± 0.617
1.712ArgCys: 1.712 ± 1.401
4.709ArgAsp: 4.709 ± 2.1
2.568ArgGlu: 2.568 ± 0.589
2.568ArgPhe: 2.568 ± 1.017
5.137ArgGly: 5.137 ± 1.301
1.712ArgHis: 1.712 ± 0.852
1.284ArgIle: 1.284 ± 0.686
3.425ArgLys: 3.425 ± 0.757
9.418ArgLeu: 9.418 ± 1.824
0.856ArgMet: 0.856 ± 0.41
1.712ArgAsn: 1.712 ± 0.969
3.853ArgPro: 3.853 ± 1.455
2.568ArgGln: 2.568 ± 1.228
5.993ArgArg: 5.993 ± 1.687
5.993ArgSer: 5.993 ± 0.766
4.281ArgThr: 4.281 ± 1.192
6.421ArgVal: 6.421 ± 1.081
1.284ArgTrp: 1.284 ± 0.404
3.425ArgTyr: 3.425 ± 0.72
0.0ArgXaa: 0.0 ± 0.0
Ser
5.137SerAla: 5.137 ± 1.201
0.428SerCys: 0.428 ± 0.61
6.421SerAsp: 6.421 ± 1.537
2.14SerGlu: 2.14 ± 0.549
2.997SerPhe: 2.997 ± 1.071
6.421SerGly: 6.421 ± 1.464
0.428SerHis: 0.428 ± 0.342
1.284SerIle: 1.284 ± 0.647
2.14SerLys: 2.14 ± 0.71
10.702SerLeu: 10.702 ± 1.707
0.428SerMet: 0.428 ± 0.32
2.14SerAsn: 2.14 ± 0.891
5.565SerPro: 5.565 ± 1.392
3.425SerGln: 3.425 ± 0.628
5.137SerArg: 5.137 ± 1.616
3.425SerSer: 3.425 ± 1.024
4.709SerThr: 4.709 ± 1.621
6.849SerVal: 6.849 ± 1.069
0.856SerTrp: 0.856 ± 0.375
1.712SerTyr: 1.712 ± 0.56
0.0SerXaa: 0.0 ± 0.0
Thr
2.997ThrAla: 2.997 ± 0.808
1.284ThrCys: 1.284 ± 0.554
4.281ThrAsp: 4.281 ± 1.17
3.425ThrGlu: 3.425 ± 1.252
2.568ThrPhe: 2.568 ± 0.655
3.425ThrGly: 3.425 ± 0.891
0.428ThrHis: 0.428 ± 0.368
2.14ThrIle: 2.14 ± 1.159
2.14ThrLys: 2.14 ± 0.909
3.853ThrLeu: 3.853 ± 0.69
1.284ThrMet: 1.284 ± 0.379
2.568ThrAsn: 2.568 ± 1.646
5.565ThrPro: 5.565 ± 1.372
1.712ThrGln: 1.712 ± 0.603
6.421ThrArg: 6.421 ± 1.802
5.993ThrSer: 5.993 ± 2.209
5.565ThrThr: 5.565 ± 1.098
4.281ThrVal: 4.281 ± 0.91
1.284ThrTrp: 1.284 ± 1.105
2.14ThrTyr: 2.14 ± 0.687
0.0ThrXaa: 0.0 ± 0.0
Val
2.568ValAla: 2.568 ± 0.763
1.284ValCys: 1.284 ± 0.706
3.425ValAsp: 3.425 ± 0.691
7.277ValGlu: 7.277 ± 2.066
3.425ValPhe: 3.425 ± 1.285
4.709ValGly: 4.709 ± 1.547
1.284ValHis: 1.284 ± 0.686
3.425ValIle: 3.425 ± 1.133
2.14ValLys: 2.14 ± 0.729
6.849ValLeu: 6.849 ± 1.199
0.428ValMet: 0.428 ± 0.289
3.425ValAsn: 3.425 ± 1.12
6.421ValPro: 6.421 ± 1.427
1.712ValGln: 1.712 ± 0.906
4.709ValArg: 4.709 ± 0.959
5.565ValSer: 5.565 ± 1.081
3.853ValThr: 3.853 ± 1.279
5.137ValVal: 5.137 ± 2.062
0.856ValTrp: 0.856 ± 0.695
0.856ValTyr: 0.856 ± 0.64
0.0ValXaa: 0.0 ± 0.0
Trp
1.712TrpAla: 1.712 ± 0.563
0.428TrpCys: 0.428 ± 0.348
0.856TrpAsp: 0.856 ± 0.406
0.856TrpGlu: 0.856 ± 0.695
0.0TrpPhe: 0.0 ± 0.0
0.856TrpGly: 0.856 ± 0.41
0.428TrpHis: 0.428 ± 0.348
1.284TrpIle: 1.284 ± 0.604
1.284TrpLys: 1.284 ± 0.703
1.712TrpLeu: 1.712 ± 0.891
0.0TrpMet: 0.0 ± 0.0
0.428TrpAsn: 0.428 ± 0.32
0.0TrpPro: 0.0 ± 0.0
0.428TrpGln: 0.428 ± 0.368
0.428TrpArg: 0.428 ± 0.368
0.428TrpSer: 0.428 ± 0.368
0.856TrpThr: 0.856 ± 0.736
1.284TrpVal: 1.284 ± 0.604
0.0TrpTrp: 0.0 ± 0.0
0.428TrpTyr: 0.428 ± 0.32
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.284TyrAla: 1.284 ± 0.648
1.284TyrCys: 1.284 ± 0.977
1.284TyrAsp: 1.284 ± 0.378
1.712TyrGlu: 1.712 ± 0.887
2.997TyrPhe: 2.997 ± 0.474
1.284TyrGly: 1.284 ± 0.379
0.428TyrHis: 0.428 ± 0.348
0.856TyrIle: 0.856 ± 0.695
2.14TyrLys: 2.14 ± 1.0
2.14TyrLeu: 2.14 ± 0.953
0.0TyrMet: 0.0 ± 0.321
0.0TyrAsn: 0.0 ± 0.0
2.14TyrPro: 2.14 ± 0.826
1.712TyrGln: 1.712 ± 0.957
2.14TyrArg: 2.14 ± 0.729
1.284TyrSer: 1.284 ± 0.654
1.712TyrThr: 1.712 ± 0.561
2.14TyrVal: 2.14 ± 0.423
0.856TyrTrp: 0.856 ± 0.375
1.284TyrTyr: 1.284 ± 0.71
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2337 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski