Amino acid dipepetide frequency for Chelonia mydas papillomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.94AlaAla: 7.94 ± 2.833
1.764AlaCys: 1.764 ± 1.685
6.617AlaAsp: 6.617 ± 1.814
5.293AlaGlu: 5.293 ± 2.039
1.323AlaPhe: 1.323 ± 0.457
1.764AlaGly: 1.764 ± 0.567
0.882AlaHis: 0.882 ± 0.538
1.764AlaIle: 1.764 ± 0.865
2.647AlaLys: 2.647 ± 1.347
3.97AlaLeu: 3.97 ± 1.852
1.323AlaMet: 1.323 ± 0.623
0.441AlaAsn: 0.441 ± 0.356
3.529AlaPro: 3.529 ± 1.289
0.882AlaGln: 0.882 ± 0.671
5.734AlaArg: 5.734 ± 2.08
5.293AlaSer: 5.293 ± 1.239
5.293AlaThr: 5.293 ± 1.924
5.293AlaVal: 5.293 ± 1.712
0.441AlaTrp: 0.441 ± 0.359
1.764AlaTyr: 1.764 ± 0.546
0.0AlaXaa: 0.0 ± 0.0
Cys
1.764CysAla: 1.764 ± 0.84
0.882CysCys: 0.882 ± 0.571
0.441CysAsp: 0.441 ± 0.359
2.647CysGlu: 2.647 ± 1.813
0.882CysPhe: 0.882 ± 0.443
0.441CysGly: 0.441 ± 0.356
0.441CysHis: 0.441 ± 0.531
1.323CysIle: 1.323 ± 0.673
0.882CysLys: 0.882 ± 0.425
1.764CysLeu: 1.764 ± 0.865
1.323CysMet: 1.323 ± 0.598
0.0CysAsn: 0.0 ± 0.0
1.764CysPro: 1.764 ± 0.629
0.441CysGln: 0.441 ± 0.448
2.647CysArg: 2.647 ± 1.651
1.323CysSer: 1.323 ± 0.393
2.206CysThr: 2.206 ± 0.743
1.323CysVal: 1.323 ± 0.824
0.882CysTrp: 0.882 ± 0.429
0.441CysTyr: 0.441 ± 0.531
0.0CysXaa: 0.0 ± 0.0
Asp
4.411AspAla: 4.411 ± 0.644
1.323AspCys: 1.323 ± 0.344
4.411AspAsp: 4.411 ± 1.127
2.206AspGlu: 2.206 ± 1.031
2.647AspPhe: 2.647 ± 1.261
4.411AspGly: 4.411 ± 0.817
0.882AspHis: 0.882 ± 0.718
2.647AspIle: 2.647 ± 1.094
3.97AspLys: 3.97 ± 1.31
3.529AspLeu: 3.529 ± 1.28
0.882AspMet: 0.882 ± 0.834
2.647AspAsn: 2.647 ± 0.683
4.852AspPro: 4.852 ± 1.174
1.323AspGln: 1.323 ± 0.598
3.97AspArg: 3.97 ± 0.695
7.94AspSer: 7.94 ± 1.715
3.529AspThr: 3.529 ± 1.291
3.529AspVal: 3.529 ± 1.182
2.206AspTrp: 2.206 ± 0.786
3.529AspTyr: 3.529 ± 1.338
0.0AspXaa: 0.0 ± 0.0
Glu
2.647GluAla: 2.647 ± 0.914
0.441GluCys: 0.441 ± 0.359
3.529GluAsp: 3.529 ± 1.822
3.088GluGlu: 3.088 ± 0.933
2.206GluPhe: 2.206 ± 0.402
3.529GluGly: 3.529 ± 1.344
0.882GluHis: 0.882 ± 0.718
1.764GluIle: 1.764 ± 1.053
1.764GluLys: 1.764 ± 0.495
5.293GluLeu: 5.293 ± 0.98
1.764GluMet: 1.764 ± 0.546
1.764GluAsn: 1.764 ± 0.591
3.529GluPro: 3.529 ± 0.808
2.206GluGln: 2.206 ± 1.021
2.206GluArg: 2.206 ± 0.745
4.852GluSer: 4.852 ± 1.587
3.088GluThr: 3.088 ± 1.449
4.852GluVal: 4.852 ± 1.579
0.0GluTrp: 0.0 ± 0.0
2.206GluTyr: 2.206 ± 0.499
0.0GluXaa: 0.0 ± 0.0
Phe
2.206PheAla: 2.206 ± 1.046
0.0PheCys: 0.0 ± 0.0
3.97PheAsp: 3.97 ± 1.812
2.647PheGlu: 2.647 ± 1.23
2.647PhePhe: 2.647 ± 0.521
2.647PheGly: 2.647 ± 0.732
0.441PheHis: 0.441 ± 0.417
1.323PheIle: 1.323 ± 0.458
1.323PheLys: 1.323 ± 1.077
3.97PheLeu: 3.97 ± 0.934
0.882PheMet: 0.882 ± 0.429
0.882PheAsn: 0.882 ± 0.468
3.97PhePro: 3.97 ± 1.351
0.882PheGln: 0.882 ± 0.434
1.323PheArg: 1.323 ± 0.719
5.734PheSer: 5.734 ± 1.119
2.206PheThr: 2.206 ± 0.506
2.206PheVal: 2.206 ± 1.135
0.882PheTrp: 0.882 ± 0.429
1.323PheTyr: 1.323 ± 0.811
0.0PheXaa: 0.0 ± 0.0
Gly
2.647GlyAla: 2.647 ± 1.256
1.764GlyCys: 1.764 ± 0.462
4.411GlyAsp: 4.411 ± 0.814
2.647GlyGlu: 2.647 ± 0.862
2.647GlyPhe: 2.647 ± 0.652
5.734GlyGly: 5.734 ± 2.346
0.441GlyHis: 0.441 ± 0.448
3.088GlyIle: 3.088 ± 0.585
1.323GlyLys: 1.323 ± 1.077
3.088GlyLeu: 3.088 ± 0.912
1.323GlyMet: 1.323 ± 0.747
3.97GlyAsn: 3.97 ± 1.751
4.852GlyPro: 4.852 ± 1.127
2.206GlyGln: 2.206 ± 1.065
4.852GlyArg: 4.852 ± 0.995
5.293GlySer: 5.293 ± 1.058
7.499GlyThr: 7.499 ± 2.414
6.176GlyVal: 6.176 ± 1.309
0.882GlyTrp: 0.882 ± 0.718
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.882HisAla: 0.882 ± 0.671
1.764HisCys: 1.764 ± 0.238
1.323HisAsp: 1.323 ± 0.624
0.882HisGlu: 0.882 ± 0.592
0.882HisPhe: 0.882 ± 0.718
1.323HisGly: 1.323 ± 0.344
0.882HisHis: 0.882 ± 0.429
0.882HisIle: 0.882 ± 0.468
0.441HisLys: 0.441 ± 0.531
1.764HisLeu: 1.764 ± 0.615
0.882HisMet: 0.882 ± 0.536
2.206HisAsn: 2.206 ± 1.07
1.764HisPro: 1.764 ± 0.95
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
1.764HisSer: 1.764 ± 0.77
1.764HisThr: 1.764 ± 0.615
0.441HisVal: 0.441 ± 0.417
0.882HisTrp: 0.882 ± 0.468
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.529IleAla: 3.529 ± 1.74
1.323IleCys: 1.323 ± 0.598
1.764IleAsp: 1.764 ± 0.99
5.734IleGlu: 5.734 ± 1.034
1.764IlePhe: 1.764 ± 0.51
3.088IleGly: 3.088 ± 0.824
0.0IleHis: 0.0 ± 0.0
0.882IleIle: 0.882 ± 0.712
2.206IleLys: 2.206 ± 0.502
3.97IleLeu: 3.97 ± 1.007
0.0IleMet: 0.0 ± 0.0
1.764IleAsn: 1.764 ± 0.629
2.206IlePro: 2.206 ± 0.707
3.529IleGln: 3.529 ± 0.633
2.206IleArg: 2.206 ± 0.894
4.852IleSer: 4.852 ± 1.521
1.323IleThr: 1.323 ± 0.702
3.529IleVal: 3.529 ± 1.281
0.441IleTrp: 0.441 ± 0.385
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.088LysAla: 3.088 ± 0.479
2.206LysCys: 2.206 ± 0.637
1.323LysAsp: 1.323 ± 0.393
2.647LysGlu: 2.647 ± 1.098
1.764LysPhe: 1.764 ± 0.724
2.206LysGly: 2.206 ± 0.806
1.323LysHis: 1.323 ± 0.393
2.647LysIle: 2.647 ± 1.411
5.293LysLys: 5.293 ± 1.137
3.088LysLeu: 3.088 ± 1.279
0.882LysMet: 0.882 ± 0.425
1.764LysAsn: 1.764 ± 0.615
1.323LysPro: 1.323 ± 0.393
2.206LysGln: 2.206 ± 0.657
2.206LysArg: 2.206 ± 0.663
4.852LysSer: 4.852 ± 1.169
1.764LysThr: 1.764 ± 0.851
1.764LysVal: 1.764 ± 0.691
1.764LysTrp: 1.764 ± 0.691
3.088LysTyr: 3.088 ± 0.937
0.0LysXaa: 0.0 ± 0.0
Leu
3.529LeuAla: 3.529 ± 0.763
3.529LeuCys: 3.529 ± 1.187
3.97LeuAsp: 3.97 ± 1.012
3.97LeuGlu: 3.97 ± 0.77
3.529LeuPhe: 3.529 ± 0.634
6.176LeuGly: 6.176 ± 0.583
3.529LeuHis: 3.529 ± 1.135
4.411LeuIle: 4.411 ± 0.989
4.411LeuLys: 4.411 ± 1.057
7.499LeuLeu: 7.499 ± 2.653
1.764LeuMet: 1.764 ± 0.958
1.764LeuAsn: 1.764 ± 0.859
4.852LeuPro: 4.852 ± 0.951
5.293LeuGln: 5.293 ± 1.146
4.411LeuArg: 4.411 ± 0.882
6.176LeuSer: 6.176 ± 1.132
4.852LeuThr: 4.852 ± 0.839
2.647LeuVal: 2.647 ± 0.563
1.323LeuTrp: 1.323 ± 0.344
3.088LeuTyr: 3.088 ± 0.848
0.0LeuXaa: 0.0 ± 0.0
Met
1.323MetAla: 1.323 ± 0.673
0.441MetCys: 0.441 ± 0.417
1.764MetAsp: 1.764 ± 0.69
0.882MetGlu: 0.882 ± 0.494
0.0MetPhe: 0.0 ± 0.0
0.882MetGly: 0.882 ± 0.834
0.441MetHis: 0.441 ± 0.48
0.882MetIle: 0.882 ± 0.712
0.882MetLys: 0.882 ± 0.494
3.088MetLeu: 3.088 ± 0.997
0.441MetMet: 0.441 ± 0.356
0.441MetAsn: 0.441 ± 0.417
1.764MetPro: 1.764 ± 1.041
1.323MetGln: 1.323 ± 0.719
0.0MetArg: 0.0 ± 0.0
1.323MetSer: 1.323 ± 0.673
1.323MetThr: 1.323 ± 0.609
2.206MetVal: 2.206 ± 1.07
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.852AsnAla: 4.852 ± 1.354
0.441AsnCys: 0.441 ± 0.359
0.882AsnAsp: 0.882 ± 0.834
1.323AsnGlu: 1.323 ± 0.719
0.882AsnPhe: 0.882 ± 0.834
2.647AsnGly: 2.647 ± 1.019
0.882AsnHis: 0.882 ± 0.834
3.088AsnIle: 3.088 ± 0.739
2.206AsnLys: 2.206 ± 1.07
0.882AsnLeu: 0.882 ± 0.468
1.323AsnMet: 1.323 ± 0.783
1.764AsnAsn: 1.764 ± 1.031
3.529AsnPro: 3.529 ± 0.857
1.764AsnGln: 1.764 ± 0.69
1.323AsnArg: 1.323 ± 0.393
2.206AsnSer: 2.206 ± 1.07
1.764AsnThr: 1.764 ± 0.629
2.206AsnVal: 2.206 ± 1.01
0.0AsnTrp: 0.0 ± 0.0
0.882AsnTyr: 0.882 ± 0.429
0.0AsnXaa: 0.0 ± 0.0
Pro
7.499ProAla: 7.499 ± 2.219
1.323ProCys: 1.323 ± 0.698
5.293ProAsp: 5.293 ± 0.91
1.764ProGlu: 1.764 ± 0.947
3.97ProPhe: 3.97 ± 0.914
6.176ProGly: 6.176 ± 2.467
1.323ProHis: 1.323 ± 0.555
2.647ProIle: 2.647 ± 0.888
3.529ProLys: 3.529 ± 1.033
6.617ProLeu: 6.617 ± 1.681
1.764ProMet: 1.764 ± 0.724
2.206ProAsn: 2.206 ± 1.07
12.351ProPro: 12.351 ± 4.283
1.764ProGln: 1.764 ± 1.048
5.293ProArg: 5.293 ± 2.327
5.734ProSer: 5.734 ± 1.342
5.293ProThr: 5.293 ± 2.235
4.411ProVal: 4.411 ± 1.005
0.0ProTrp: 0.0 ± 0.0
1.764ProTyr: 1.764 ± 1.182
0.0ProXaa: 0.0 ± 0.0
Gln
1.323GlnAla: 1.323 ± 1.077
0.441GlnCys: 0.441 ± 0.448
1.764GlnAsp: 1.764 ± 0.901
2.647GlnGlu: 2.647 ± 1.114
2.206GlnPhe: 2.206 ± 0.881
1.323GlnGly: 1.323 ± 0.393
0.441GlnHis: 0.441 ± 0.417
1.764GlnIle: 1.764 ± 0.59
1.323GlnLys: 1.323 ± 0.624
4.411GlnLeu: 4.411 ± 0.83
0.0GlnMet: 0.0 ± 0.0
1.323GlnAsn: 1.323 ± 0.698
2.206GlnPro: 2.206 ± 0.656
2.647GlnGln: 2.647 ± 1.084
4.411GlnArg: 4.411 ± 0.856
4.411GlnSer: 4.411 ± 1.541
2.647GlnThr: 2.647 ± 0.834
2.206GlnVal: 2.206 ± 1.257
0.882GlnTrp: 0.882 ± 0.425
2.206GlnTyr: 2.206 ± 1.089
0.0GlnXaa: 0.0 ± 0.0
Arg
1.764ArgAla: 1.764 ± 1.542
1.764ArgCys: 1.764 ± 0.754
3.088ArgAsp: 3.088 ± 0.829
2.206ArgGlu: 2.206 ± 0.499
3.088ArgPhe: 3.088 ± 0.689
3.97ArgGly: 3.97 ± 1.664
1.323ArgHis: 1.323 ± 0.769
3.088ArgIle: 3.088 ± 0.695
3.088ArgLys: 3.088 ± 0.884
7.94ArgLeu: 7.94 ± 0.959
0.0ArgMet: 0.0 ± 0.4
2.647ArgAsn: 2.647 ± 0.888
3.088ArgPro: 3.088 ± 0.511
3.529ArgGln: 3.529 ± 1.116
5.293ArgArg: 5.293 ± 1.431
3.97ArgSer: 3.97 ± 1.26
2.647ArgThr: 2.647 ± 1.004
4.852ArgVal: 4.852 ± 1.626
0.441ArgTrp: 0.441 ± 0.385
0.882ArgTyr: 0.882 ± 0.549
0.0ArgXaa: 0.0 ± 0.0
Ser
6.176SerAla: 6.176 ± 0.824
0.882SerCys: 0.882 ± 0.571
6.617SerAsp: 6.617 ± 3.034
2.206SerGlu: 2.206 ± 1.018
3.088SerPhe: 3.088 ± 1.07
8.381SerGly: 8.381 ± 1.259
3.529SerHis: 3.529 ± 1.119
3.97SerIle: 3.97 ± 0.691
2.206SerLys: 2.206 ± 1.257
7.499SerLeu: 7.499 ± 2.569
1.323SerMet: 1.323 ± 0.738
3.97SerAsn: 3.97 ± 0.812
7.94SerPro: 7.94 ± 3.235
2.647SerGln: 2.647 ± 1.007
4.852SerArg: 4.852 ± 0.754
7.058SerSer: 7.058 ± 1.114
10.587SerThr: 10.587 ± 1.418
5.293SerVal: 5.293 ± 1.222
0.441SerTrp: 0.441 ± 0.417
0.882SerTyr: 0.882 ± 0.616
0.0SerXaa: 0.0 ± 0.0
Thr
3.529ThrAla: 3.529 ± 1.168
1.323ThrCys: 1.323 ± 0.673
4.852ThrAsp: 4.852 ± 1.507
4.411ThrGlu: 4.411 ± 0.93
2.206ThrPhe: 2.206 ± 1.075
3.529ThrGly: 3.529 ± 1.466
0.441ThrHis: 0.441 ± 0.356
3.529ThrIle: 3.529 ± 0.634
3.97ThrLys: 3.97 ± 0.812
4.852ThrLeu: 4.852 ± 2.479
1.323ThrMet: 1.323 ± 0.811
1.323ThrAsn: 1.323 ± 0.719
8.381ThrPro: 8.381 ± 2.434
0.882ThrGln: 0.882 ± 0.718
3.97ThrArg: 3.97 ± 0.667
7.499ThrSer: 7.499 ± 1.31
4.411ThrThr: 4.411 ± 1.539
3.529ThrVal: 3.529 ± 1.114
1.323ThrTrp: 1.323 ± 0.457
3.088ThrTyr: 3.088 ± 2.078
0.0ThrXaa: 0.0 ± 0.0
Val
0.882ValAla: 0.882 ± 0.494
1.323ValCys: 1.323 ± 0.992
5.293ValAsp: 5.293 ± 1.356
3.529ValGlu: 3.529 ± 0.481
4.411ValPhe: 4.411 ± 1.209
4.852ValGly: 4.852 ± 1.402
1.323ValHis: 1.323 ± 0.738
1.323ValIle: 1.323 ± 0.811
1.764ValLys: 1.764 ± 0.831
3.529ValLeu: 3.529 ± 0.71
1.323ValMet: 1.323 ± 0.609
2.647ValAsn: 2.647 ± 0.892
6.617ValPro: 6.617 ± 1.186
5.293ValGln: 5.293 ± 0.725
1.764ValArg: 1.764 ± 0.695
5.293ValSer: 5.293 ± 1.515
3.97ValThr: 3.97 ± 0.889
3.97ValVal: 3.97 ± 2.006
1.323ValTrp: 1.323 ± 0.769
1.764ValTyr: 1.764 ± 0.706
0.0ValXaa: 0.0 ± 0.0
Trp
1.764TrpAla: 1.764 ± 0.629
0.0TrpCys: 0.0 ± 0.0
1.764TrpAsp: 1.764 ± 0.858
0.0TrpGlu: 0.0 ± 0.0
0.882TrpPhe: 0.882 ± 0.468
0.882TrpGly: 0.882 ± 0.443
0.882TrpHis: 0.882 ± 0.468
1.323TrpIle: 1.323 ± 0.393
1.323TrpLys: 1.323 ± 0.702
1.323TrpLeu: 1.323 ± 0.673
0.0TrpMet: 0.0 ± 0.0
0.441TrpAsn: 0.441 ± 0.359
0.0TrpPro: 0.0 ± 0.0
0.441TrpGln: 0.441 ± 0.417
0.882TrpArg: 0.882 ± 0.771
0.882TrpSer: 0.882 ± 0.494
1.323TrpThr: 1.323 ± 0.458
0.441TrpVal: 0.441 ± 0.359
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.206TyrAla: 2.206 ± 0.424
0.882TyrCys: 0.882 ± 0.614
1.764TyrAsp: 1.764 ± 1.436
0.882TyrGlu: 0.882 ± 0.429
0.441TyrPhe: 0.441 ± 0.417
0.882TyrGly: 0.882 ± 0.443
0.441TyrHis: 0.441 ± 0.417
1.764TyrIle: 1.764 ± 0.567
2.647TyrLys: 2.647 ± 0.941
2.647TyrLeu: 2.647 ± 0.529
0.441TyrMet: 0.441 ± 0.363
0.882TyrAsn: 0.882 ± 0.443
2.206TyrPro: 2.206 ± 0.944
1.323TyrGln: 1.323 ± 0.598
1.764TyrArg: 1.764 ± 0.69
3.088TyrSer: 3.088 ± 0.648
0.882TyrThr: 0.882 ± 0.425
1.323TyrVal: 1.323 ± 0.673
0.441TyrTrp: 0.441 ± 0.417
0.882TyrTyr: 0.882 ± 0.494
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2268 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski