Amino acid dipepetide frequency for Woodchuck hepatitis virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.389AlaAla: 5.389 ± 2.242
2.994AlaCys: 2.994 ± 1.296
2.395AlaAsp: 2.395 ± 1.513
1.198AlaGlu: 1.198 ± 1.16
1.796AlaPhe: 1.796 ± 1.135
2.994AlaGly: 2.994 ± 1.084
0.599AlaHis: 0.599 ± 0.378
2.395AlaIle: 2.395 ± 1.513
0.0AlaLys: 0.0 ± 0.0
7.186AlaLeu: 7.186 ± 1.731
0.0AlaMet: 0.0 ± 0.0
2.994AlaAsn: 2.994 ± 1.296
1.796AlaPro: 1.796 ± 1.632
2.994AlaGln: 2.994 ± 1.584
4.79AlaArg: 4.79 ± 0.988
6.587AlaSer: 6.587 ± 5.989
4.192AlaThr: 4.192 ± 1.842
2.395AlaVal: 2.395 ± 0.577
2.395AlaTrp: 2.395 ± 1.135
1.198AlaTyr: 1.198 ± 0.51
0.0AlaXaa: 0.0 ± 0.0
Cys
0.599CysAla: 0.599 ± 1.25
2.395CysCys: 2.395 ± 2.543
0.0CysAsp: 0.0 ± 0.0
1.198CysGlu: 1.198 ± 0.756
0.599CysPhe: 0.599 ± 1.25
1.796CysGly: 1.796 ± 1.135
0.0CysHis: 0.0 ± 0.0
0.599CysIle: 0.599 ± 0.378
0.599CysLys: 0.599 ± 0.641
5.389CysLeu: 5.389 ± 2.083
0.599CysMet: 0.599 ± 1.508
0.599CysAsn: 0.599 ± 0.641
1.796CysPro: 1.796 ± 1.438
1.198CysGln: 1.198 ± 1.16
1.198CysArg: 1.198 ± 1.272
2.994CysSer: 2.994 ± 1.318
2.994CysThr: 2.994 ± 1.584
1.796CysVal: 1.796 ± 0.834
1.796CysTrp: 1.796 ± 0.731
0.599CysTyr: 0.599 ± 0.378
0.0CysXaa: 0.0 ± 0.0
Asp
1.796AspAla: 1.796 ± 1.135
0.0AspCys: 0.0 ± 0.0
1.198AspAsp: 1.198 ± 0.756
1.198AspGlu: 1.198 ± 1.576
1.796AspPhe: 1.796 ± 0.834
0.599AspGly: 0.599 ± 0.641
0.0AspHis: 0.0 ± 0.0
1.198AspIle: 1.198 ± 1.012
1.796AspLys: 1.796 ± 0.63
6.587AspLeu: 6.587 ± 2.089
0.599AspMet: 0.599 ± 0.641
1.198AspAsn: 1.198 ± 0.756
3.593AspPro: 3.593 ± 2.88
1.796AspGln: 1.796 ± 1.579
0.599AspArg: 0.599 ± 0.378
1.796AspSer: 1.796 ± 1.052
1.796AspThr: 1.796 ± 1.798
1.796AspVal: 1.796 ± 1.19
2.395AspTrp: 2.395 ± 1.721
0.599AspTyr: 0.599 ± 0.378
0.0AspXaa: 0.0 ± 0.0
Glu
0.599GluAla: 0.599 ± 0.378
0.599GluCys: 0.599 ± 0.641
1.198GluAsp: 1.198 ± 0.756
3.593GluGlu: 3.593 ± 1.785
1.796GluPhe: 1.796 ± 1.632
0.599GluGly: 0.599 ± 1.25
3.593GluHis: 3.593 ± 1.668
0.599GluIle: 0.599 ± 0.641
1.796GluLys: 1.796 ± 1.135
5.389GluLeu: 5.389 ± 1.272
0.599GluMet: 0.599 ± 0.641
0.0GluAsn: 0.0 ± 0.0
1.198GluPro: 1.198 ± 0.756
0.599GluGln: 0.599 ± 0.894
1.796GluArg: 1.796 ± 0.63
1.198GluSer: 1.198 ± 1.16
0.599GluThr: 0.599 ± 0.641
0.599GluVal: 0.599 ± 0.378
0.599GluTrp: 0.599 ± 0.641
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
4.79PheAla: 4.79 ± 1.857
0.599PheCys: 0.599 ± 0.378
0.0PheAsp: 0.0 ± 0.0
0.599PheGlu: 0.599 ± 0.641
2.395PhePhe: 2.395 ± 0.577
2.395PheGly: 2.395 ± 2.514
1.796PheHis: 1.796 ± 1.798
1.796PheIle: 1.796 ± 1.095
1.198PheLys: 1.198 ± 0.756
5.389PheLeu: 5.389 ± 2.004
0.599PheMet: 0.599 ± 0.378
1.796PheAsn: 1.796 ± 0.63
6.587PhePro: 6.587 ± 1.031
1.198PheGln: 1.198 ± 1.282
2.994PheArg: 2.994 ± 1.235
4.79PheSer: 4.79 ± 1.688
1.796PheThr: 1.796 ± 0.63
2.994PheVal: 2.994 ± 2.304
0.599PheTrp: 0.599 ± 0.641
1.198PheTyr: 1.198 ± 0.756
0.0PheXaa: 0.0 ± 0.0
Gly
1.796GlyAla: 1.796 ± 0.834
0.599GlyCys: 0.599 ± 1.25
1.796GlyAsp: 1.796 ± 1.095
1.198GlyGlu: 1.198 ± 0.756
2.395GlyPhe: 2.395 ± 1.721
2.994GlyGly: 2.994 ± 1.379
1.796GlyHis: 1.796 ± 1.135
5.389GlyIle: 5.389 ± 2.069
1.796GlyLys: 1.796 ± 1.135
6.587GlyLeu: 6.587 ± 2.165
1.198GlyMet: 1.198 ± 1.576
2.395GlyAsn: 2.395 ± 1.721
4.192GlyPro: 4.192 ± 1.96
2.395GlyGln: 2.395 ± 0.577
2.994GlyArg: 2.994 ± 0.971
4.192GlySer: 4.192 ± 2.314
2.994GlyThr: 2.994 ± 1.235
3.593GlyVal: 3.593 ± 1.639
1.198GlyTrp: 1.198 ± 0.777
1.198GlyTyr: 1.198 ± 0.756
0.0GlyXaa: 0.0 ± 0.0
His
1.198HisAla: 1.198 ± 1.576
1.198HisCys: 1.198 ± 0.777
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
1.198HisPhe: 1.198 ± 0.756
0.0HisGly: 0.0 ± 0.0
1.198HisHis: 1.198 ± 0.777
1.796HisIle: 1.796 ± 1.135
2.395HisLys: 2.395 ± 1.331
9.581HisLeu: 9.581 ± 2.443
0.0HisMet: 0.0 ± 0.0
3.593HisAsn: 3.593 ± 1.588
1.796HisPro: 1.796 ± 0.63
0.0HisGln: 0.0 ± 0.0
1.198HisArg: 1.198 ± 0.756
0.0HisSer: 0.0 ± 0.0
2.994HisThr: 2.994 ± 2.395
1.198HisVal: 1.198 ± 0.756
1.198HisTrp: 1.198 ± 0.756
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.79IleAla: 4.79 ± 1.163
1.198IleCys: 1.198 ± 0.756
2.395IleAsp: 2.395 ± 1.226
0.0IleGlu: 0.0 ± 0.0
1.796IlePhe: 1.796 ± 1.579
0.599IleGly: 0.599 ± 0.378
1.198IleHis: 1.198 ± 0.756
1.198IleIle: 1.198 ± 1.012
4.192IleLys: 4.192 ± 1.512
4.79IleLeu: 4.79 ± 1.163
1.198IleMet: 1.198 ± 0.449
1.796IleAsn: 1.796 ± 0.63
4.79IlePro: 4.79 ± 2.675
1.796IleGln: 1.796 ± 1.135
2.994IleArg: 2.994 ± 2.935
2.994IleSer: 2.994 ± 1.082
2.395IleThr: 2.395 ± 1.036
1.796IleVal: 1.796 ± 0.731
1.796IleTrp: 1.796 ± 1.923
1.796IleTyr: 1.796 ± 1.095
0.0IleXaa: 0.0 ± 0.0
Lys
0.599LysAla: 0.599 ± 0.378
0.599LysCys: 0.599 ± 1.25
0.599LysAsp: 0.599 ± 1.25
0.599LysGlu: 0.599 ± 0.894
0.599LysPhe: 0.599 ± 0.378
1.796LysGly: 1.796 ± 0.63
1.198LysHis: 1.198 ± 0.51
1.796LysIle: 1.796 ± 1.095
1.198LysLys: 1.198 ± 0.756
5.389LysLeu: 5.389 ± 1.603
0.0LysMet: 0.0 ± 0.0
2.395LysAsn: 2.395 ± 0.905
3.593LysPro: 3.593 ± 2.19
1.198LysGln: 1.198 ± 0.51
1.796LysArg: 1.796 ± 1.135
3.593LysSer: 3.593 ± 2.269
4.192LysThr: 4.192 ± 1.949
1.198LysVal: 1.198 ± 1.012
1.198LysTrp: 1.198 ± 1.16
0.599LysTyr: 0.599 ± 0.378
0.0LysXaa: 0.0 ± 0.0
Leu
5.988LeuAla: 5.988 ± 2.471
4.192LeuCys: 4.192 ± 1.41
5.389LeuAsp: 5.389 ± 0.73
1.796LeuGlu: 1.796 ± 1.135
2.395LeuPhe: 2.395 ± 0.577
7.784LeuGly: 7.784 ± 2.556
3.593LeuHis: 3.593 ± 2.269
7.186LeuIle: 7.186 ± 3.513
1.198LeuLys: 1.198 ± 1.012
17.964LeuLeu: 17.964 ± 3.7
0.0LeuMet: 0.0 ± 0.0
5.988LeuAsn: 5.988 ± 1.86
10.18LeuPro: 10.18 ± 2.291
6.587LeuGln: 6.587 ± 2.317
5.988LeuArg: 5.988 ± 1.842
9.581LeuSer: 9.581 ± 1.222
7.186LeuThr: 7.186 ± 1.543
9.581LeuVal: 9.581 ± 3.038
5.988LeuTrp: 5.988 ± 0.809
3.593LeuTyr: 3.593 ± 1.639
0.0LeuXaa: 0.0 ± 0.0
Met
0.599MetAla: 0.599 ± 1.25
0.0MetCys: 0.0 ± 0.0
1.796MetAsp: 1.796 ± 1.389
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
2.395MetGly: 2.395 ± 0.905
1.796MetHis: 1.796 ± 0.834
0.599MetIle: 0.599 ± 0.641
0.599MetLys: 0.599 ± 0.641
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.599MetPro: 0.599 ± 1.25
0.0MetGln: 0.0 ± 0.0
0.599MetArg: 0.599 ± 1.25
1.198MetSer: 1.198 ± 1.012
0.0MetThr: 0.0 ± 0.0
0.599MetVal: 0.599 ± 0.378
0.0MetTrp: 0.0 ± 0.0
1.796MetTyr: 1.796 ± 1.438
0.0MetXaa: 0.0 ± 0.0
Asn
2.395AsnAla: 2.395 ± 1.552
2.395AsnCys: 2.395 ± 1.135
0.599AsnAsp: 0.599 ± 0.894
1.198AsnGlu: 1.198 ± 0.756
3.593AsnPhe: 3.593 ± 1.408
1.796AsnGly: 1.796 ± 0.63
1.796AsnHis: 1.796 ± 1.135
2.395AsnIle: 2.395 ± 0.577
1.198AsnLys: 1.198 ± 0.756
3.593AsnLeu: 3.593 ± 1.259
0.0AsnMet: 0.0 ± 0.0
3.593AsnAsn: 3.593 ± 1.588
2.994AsnPro: 2.994 ± 1.082
3.593AsnGln: 3.593 ± 1.259
2.994AsnArg: 2.994 ± 1.76
5.988AsnSer: 5.988 ± 3.059
1.198AsnThr: 1.198 ± 0.51
0.599AsnVal: 0.599 ± 0.378
0.599AsnTrp: 0.599 ± 0.378
3.593AsnTyr: 3.593 ± 0.887
0.0AsnXaa: 0.0 ± 0.0
Pro
5.988ProAla: 5.988 ± 2.207
2.395ProCys: 2.395 ± 1.289
1.198ProAsp: 1.198 ± 1.012
3.593ProGlu: 3.593 ± 0.763
4.79ProPhe: 4.79 ± 1.88
3.593ProGly: 3.593 ± 2.19
2.994ProHis: 2.994 ± 0.646
2.994ProIle: 2.994 ± 0.89
2.994ProLys: 2.994 ± 1.082
8.982ProLeu: 8.982 ± 1.373
1.796ProMet: 1.796 ± 0.949
2.395ProAsn: 2.395 ± 1.036
9.581ProPro: 9.581 ± 4.215
1.796ProGln: 1.796 ± 1.095
5.988ProArg: 5.988 ± 2.738
5.988ProSer: 5.988 ± 2.2
8.383ProThr: 8.383 ± 3.814
5.988ProVal: 5.988 ± 2.132
1.198ProTrp: 1.198 ± 0.756
2.994ProTyr: 2.994 ± 1.871
0.0ProXaa: 0.0 ± 0.0
Gln
1.796GlnAla: 1.796 ± 1.632
1.198GlnCys: 1.198 ± 0.51
1.198GlnAsp: 1.198 ± 0.51
1.796GlnGlu: 1.796 ± 0.731
2.395GlnPhe: 2.395 ± 0.905
2.395GlnGly: 2.395 ± 1.721
2.395GlnHis: 2.395 ± 0.577
1.198GlnIle: 1.198 ± 0.51
0.599GlnLys: 0.599 ± 0.641
4.79GlnLeu: 4.79 ± 3.442
0.0GlnMet: 0.0 ± 0.0
3.593GlnAsn: 3.593 ± 1.531
2.395GlnPro: 2.395 ± 1.021
2.395GlnGln: 2.395 ± 0.905
0.599GlnArg: 0.599 ± 0.378
4.79GlnSer: 4.79 ± 2.363
5.988GlnThr: 5.988 ± 2.552
1.198GlnVal: 1.198 ± 1.012
1.796GlnTrp: 1.796 ± 0.63
0.599GlnTyr: 0.599 ± 0.378
0.0GlnXaa: 0.0 ± 0.0
Arg
1.198ArgAla: 1.198 ± 0.777
0.599ArgCys: 0.599 ± 0.378
4.79ArgAsp: 4.79 ± 3.304
1.198ArgGlu: 1.198 ± 0.777
2.994ArgPhe: 2.994 ± 1.082
4.192ArgGly: 4.192 ± 1.201
1.198ArgHis: 1.198 ± 1.16
2.395ArgIle: 2.395 ± 1.513
2.994ArgLys: 2.994 ± 1.235
6.587ArgLeu: 6.587 ± 4.684
0.0ArgMet: 0.0 ± 0.0
2.395ArgAsn: 2.395 ± 1.513
3.593ArgPro: 3.593 ± 2.326
4.79ArgGln: 4.79 ± 1.269
8.383ArgArg: 8.383 ± 8.29
3.593ArgSer: 3.593 ± 2.332
4.79ArgThr: 4.79 ± 2.433
1.796ArgVal: 1.796 ± 1.135
0.599ArgTrp: 0.599 ± 0.641
1.796ArgTyr: 1.796 ± 0.63
0.0ArgXaa: 0.0 ± 0.0
Ser
5.389SerAla: 5.389 ± 4.633
1.796SerCys: 1.796 ± 0.731
1.796SerAsp: 1.796 ± 2.382
2.395SerGlu: 2.395 ± 1.036
2.994SerPhe: 2.994 ± 0.646
4.192SerGly: 4.192 ± 2.496
1.198SerHis: 1.198 ± 0.756
2.395SerIle: 2.395 ± 2.254
2.994SerLys: 2.994 ± 1.296
7.186SerLeu: 7.186 ± 0.902
0.599SerMet: 0.599 ± 0.378
4.192SerAsn: 4.192 ± 1.98
11.976SerPro: 11.976 ± 2.377
4.192SerGln: 4.192 ± 1.41
6.587SerArg: 6.587 ± 3.424
13.174SerSer: 13.174 ± 3.719
5.389SerThr: 5.389 ± 0.446
5.389SerVal: 5.389 ± 1.889
3.593SerTrp: 3.593 ± 0.971
1.198SerTyr: 1.198 ± 0.777
0.0SerXaa: 0.0 ± 0.0
Thr
5.389ThrAla: 5.389 ± 2.021
3.593ThrCys: 3.593 ± 1.512
1.796ThrAsp: 1.796 ± 1.135
1.198ThrGlu: 1.198 ± 1.282
3.593ThrPhe: 3.593 ± 1.44
5.988ThrGly: 5.988 ± 1.614
1.796ThrHis: 1.796 ± 0.63
4.79ThrIle: 4.79 ± 1.547
2.994ThrLys: 2.994 ± 0.971
2.994ThrLeu: 2.994 ± 1.318
2.395ThrMet: 2.395 ± 1.226
1.198ThrAsn: 1.198 ± 0.51
5.988ThrPro: 5.988 ± 1.822
1.198ThrGln: 1.198 ± 0.51
2.994ThrArg: 2.994 ± 1.554
7.186ThrSer: 7.186 ± 1.83
8.383ThrThr: 8.383 ± 4.772
4.79ThrVal: 4.79 ± 2.821
2.395ThrTrp: 2.395 ± 0.577
2.395ThrTyr: 2.395 ± 1.021
0.0ThrXaa: 0.0 ± 0.0
Val
2.994ValAla: 2.994 ± 1.891
1.796ValCys: 1.796 ± 0.731
2.994ValAsp: 2.994 ± 0.89
0.0ValGlu: 0.0 ± 0.0
4.192ValPhe: 4.192 ± 1.201
1.796ValGly: 1.796 ± 0.63
1.198ValHis: 1.198 ± 0.756
1.198ValIle: 1.198 ± 0.777
0.599ValLys: 0.599 ± 0.378
5.988ValLeu: 5.988 ± 2.167
0.0ValMet: 0.0 ± 0.0
4.79ValAsn: 4.79 ± 1.804
5.389ValPro: 5.389 ± 1.694
3.593ValGln: 3.593 ± 1.27
3.593ValArg: 3.593 ± 1.668
5.389ValSer: 5.389 ± 0.446
2.994ValThr: 2.994 ± 0.971
5.389ValVal: 5.389 ± 1.403
0.599ValTrp: 0.599 ± 0.894
1.796ValTyr: 1.796 ± 1.095
0.0ValXaa: 0.0 ± 0.0
Trp
2.395TrpAla: 2.395 ± 1.021
0.0TrpCys: 0.0 ± 0.0
0.599TrpAsp: 0.599 ± 0.894
2.395TrpGlu: 2.395 ± 0.94
2.395TrpPhe: 2.395 ± 1.135
3.593TrpGly: 3.593 ± 1.27
0.599TrpHis: 0.599 ± 1.25
2.395TrpIle: 2.395 ± 1.036
0.599TrpLys: 0.599 ± 0.641
3.593TrpLeu: 3.593 ± 0.763
1.796TrpMet: 1.796 ± 1.438
1.198TrpAsn: 1.198 ± 0.51
2.395TrpPro: 2.395 ± 1.021
0.599TrpGln: 0.599 ± 0.641
0.599TrpArg: 0.599 ± 0.378
0.599TrpSer: 0.599 ± 0.378
2.994TrpThr: 2.994 ± 1.084
1.198TrpVal: 1.198 ± 0.51
2.395TrpTrp: 2.395 ± 1.721
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.599TyrAla: 0.599 ± 0.378
1.198TyrCys: 1.198 ± 0.51
0.599TyrAsp: 0.599 ± 0.378
1.796TyrGlu: 1.796 ± 0.834
1.796TyrPhe: 1.796 ± 0.63
1.198TyrGly: 1.198 ± 0.51
0.599TyrHis: 0.599 ± 0.378
1.198TyrIle: 1.198 ± 1.272
2.395TyrLys: 2.395 ± 0.577
4.192TyrLeu: 4.192 ± 1.253
0.599TyrMet: 0.599 ± 0.378
0.0TyrAsn: 0.0 ± 0.0
1.198TyrPro: 1.198 ± 0.51
1.198TyrGln: 1.198 ± 1.012
1.198TyrArg: 1.198 ± 0.777
2.994TyrSer: 2.994 ± 1.891
1.796TyrThr: 1.796 ± 1.095
2.395TyrVal: 2.395 ± 1.036
0.0TyrTrp: 0.0 ± 0.0
0.599TyrTyr: 0.599 ± 0.641
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1671 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski