Amino acid dipepetide frequency for Wenzhou narna-like virus 5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.454AlaAla: 3.454 ± 1.108
0.864AlaCys: 0.864 ± 0.5
0.864AlaAsp: 0.864 ± 0.5
2.591AlaGlu: 2.591 ± 1.499
1.727AlaPhe: 1.727 ± 1.0
2.591AlaGly: 2.591 ± 3.162
0.0AlaHis: 0.0 ± 0.0
1.727AlaIle: 1.727 ± 1.0
4.318AlaLys: 4.318 ± 0.609
7.772AlaLeu: 7.772 ± 2.945
0.864AlaMet: 0.864 ± 1.054
1.727AlaAsn: 1.727 ± 2.108
0.864AlaPro: 0.864 ± 1.054
0.0AlaGln: 0.0 ± 0.0
1.727AlaArg: 1.727 ± 0.554
6.908AlaSer: 6.908 ± 0.663
6.045AlaThr: 6.045 ± 2.716
2.591AlaVal: 2.591 ± 0.054
0.0AlaTrp: 0.0 ± 0.0
1.727AlaTyr: 1.727 ± 0.554
0.0AlaXaa: 0.0 ± 0.0
Cys
0.864CysAla: 0.864 ± 0.5
0.864CysCys: 0.864 ± 0.5
0.0CysAsp: 0.0 ± 0.0
0.864CysGlu: 0.864 ± 0.5
0.864CysPhe: 0.864 ± 1.054
1.727CysGly: 1.727 ± 1.0
0.864CysHis: 0.864 ± 0.5
1.727CysIle: 1.727 ± 1.0
0.864CysLys: 0.864 ± 0.5
4.318CysLeu: 4.318 ± 0.609
0.864CysMet: 0.864 ± 0.5
0.0CysAsn: 0.0 ± 0.0
2.591CysPro: 2.591 ± 1.499
0.864CysGln: 0.864 ± 0.5
3.454CysArg: 3.454 ± 1.999
2.591CysSer: 2.591 ± 1.499
1.727CysThr: 1.727 ± 1.0
1.727CysVal: 1.727 ± 0.554
1.727CysTrp: 1.727 ± 0.554
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.454AspAla: 3.454 ± 1.108
1.727AspCys: 1.727 ± 1.0
4.318AspAsp: 4.318 ± 2.499
0.864AspGlu: 0.864 ± 0.5
0.864AspPhe: 0.864 ± 0.5
2.591AspGly: 2.591 ± 0.054
3.454AspHis: 3.454 ± 1.999
2.591AspIle: 2.591 ± 0.054
0.864AspLys: 0.864 ± 0.5
5.181AspLeu: 5.181 ± 0.109
0.864AspMet: 0.864 ± 0.5
1.727AspAsn: 1.727 ± 1.0
1.727AspPro: 1.727 ± 0.554
0.864AspGln: 0.864 ± 0.5
0.864AspArg: 0.864 ± 0.5
3.454AspSer: 3.454 ± 1.999
0.864AspThr: 0.864 ± 0.5
2.591AspVal: 2.591 ± 1.608
0.0AspTrp: 0.0 ± 0.0
2.591AspTyr: 2.591 ± 0.054
0.0AspXaa: 0.0 ± 0.0
Glu
2.591GluAla: 2.591 ± 0.054
1.727GluCys: 1.727 ± 0.554
0.864GluAsp: 0.864 ± 0.5
1.727GluGlu: 1.727 ± 1.0
6.045GluPhe: 6.045 ± 1.945
0.0GluGly: 0.0 ± 0.0
0.0GluHis: 0.0 ± 0.0
4.318GluIle: 4.318 ± 0.945
5.181GluLys: 5.181 ± 2.999
4.318GluLeu: 4.318 ± 0.945
2.591GluMet: 2.591 ± 0.054
0.0GluAsn: 0.0 ± 0.0
0.864GluPro: 0.864 ± 0.5
3.454GluGln: 3.454 ± 1.108
3.454GluArg: 3.454 ± 1.999
5.181GluSer: 5.181 ± 1.445
0.864GluThr: 0.864 ± 0.5
3.454GluVal: 3.454 ± 0.445
0.864GluTrp: 0.864 ± 0.5
1.727GluTyr: 1.727 ± 0.554
0.0GluXaa: 0.0 ± 0.0
Phe
0.864PheAla: 0.864 ± 0.5
0.864PheCys: 0.864 ± 0.5
2.591PheAsp: 2.591 ± 1.499
3.454PheGlu: 3.454 ± 0.445
0.0PhePhe: 0.0 ± 0.0
6.908PheGly: 6.908 ± 0.663
1.727PheHis: 1.727 ± 0.554
1.727PheIle: 1.727 ± 1.0
6.908PheLys: 6.908 ± 2.445
5.181PheLeu: 5.181 ± 1.445
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
4.318PhePro: 4.318 ± 0.609
1.727PheGln: 1.727 ± 0.554
3.454PheArg: 3.454 ± 0.445
6.045PheSer: 6.045 ± 0.391
5.181PheThr: 5.181 ± 3.216
6.045PheVal: 6.045 ± 0.391
0.864PheTrp: 0.864 ± 0.5
1.727PheTyr: 1.727 ± 1.0
0.0PheXaa: 0.0 ± 0.0
Gly
5.181GlyAla: 5.181 ± 1.663
0.864GlyCys: 0.864 ± 0.5
1.727GlyAsp: 1.727 ± 1.0
0.864GlyGlu: 0.864 ± 0.5
4.318GlyPhe: 4.318 ± 0.945
2.591GlyGly: 2.591 ± 1.608
0.864GlyHis: 0.864 ± 0.5
0.864GlyIle: 0.864 ± 1.054
5.181GlyLys: 5.181 ± 1.445
2.591GlyLeu: 2.591 ± 1.608
0.864GlyMet: 0.864 ± 1.054
4.318GlyAsn: 4.318 ± 3.716
1.727GlyPro: 1.727 ± 0.554
3.454GlyGln: 3.454 ± 1.108
0.0GlyArg: 0.0 ± 0.0
6.045GlySer: 6.045 ± 0.391
1.727GlyThr: 1.727 ± 2.108
5.181GlyVal: 5.181 ± 3.216
0.864GlyTrp: 0.864 ± 0.5
1.727GlyTyr: 1.727 ± 1.0
0.0GlyXaa: 0.0 ± 0.0
His
0.864HisAla: 0.864 ± 0.5
0.864HisCys: 0.864 ± 0.5
0.0HisAsp: 0.0 ± 0.0
0.864HisGlu: 0.864 ± 0.5
1.727HisPhe: 1.727 ± 1.0
0.864HisGly: 0.864 ± 0.5
0.0HisHis: 0.0 ± 0.0
1.727HisIle: 1.727 ± 0.554
2.591HisLys: 2.591 ± 0.054
1.727HisLeu: 1.727 ± 1.0
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
2.591HisPro: 2.591 ± 0.054
0.864HisGln: 0.864 ± 0.5
0.864HisArg: 0.864 ± 0.5
0.864HisSer: 0.864 ± 0.5
1.727HisThr: 1.727 ± 0.554
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.727IleAla: 1.727 ± 0.554
0.864IleCys: 0.864 ± 0.5
2.591IleAsp: 2.591 ± 1.499
1.727IleGlu: 1.727 ± 1.0
2.591IlePhe: 2.591 ± 1.499
0.864IleGly: 0.864 ± 1.054
0.864IleHis: 0.864 ± 0.5
1.727IleIle: 1.727 ± 0.554
5.181IleLys: 5.181 ± 1.445
5.181IleLeu: 5.181 ± 0.109
0.0IleMet: 0.0 ± 0.0
6.045IleAsn: 6.045 ± 1.163
3.454IlePro: 3.454 ± 1.108
0.864IleGln: 0.864 ± 0.5
1.727IleArg: 1.727 ± 0.554
6.908IleSer: 6.908 ± 0.891
2.591IleThr: 2.591 ± 0.054
0.864IleVal: 0.864 ± 1.054
0.0IleTrp: 0.0 ± 0.0
0.864IleTyr: 0.864 ± 0.5
0.0IleXaa: 0.0 ± 0.0
Lys
0.0LysAla: 0.0 ± 0.0
2.591LysCys: 2.591 ± 1.499
1.727LysAsp: 1.727 ± 1.0
6.045LysGlu: 6.045 ± 0.391
5.181LysPhe: 5.181 ± 1.445
6.045LysGly: 6.045 ± 1.945
1.727LysHis: 1.727 ± 0.554
1.727LysIle: 1.727 ± 1.0
3.454LysLys: 3.454 ± 0.445
5.181LysLeu: 5.181 ± 0.109
2.591LysMet: 2.591 ± 0.054
4.318LysAsn: 4.318 ± 0.945
1.727LysPro: 1.727 ± 0.554
4.318LysGln: 4.318 ± 0.945
6.908LysArg: 6.908 ± 3.999
7.772LysSer: 7.772 ± 0.163
4.318LysThr: 4.318 ± 0.945
5.181LysVal: 5.181 ± 1.445
0.0LysTrp: 0.0 ± 0.0
5.181LysTyr: 5.181 ± 2.999
0.0LysXaa: 0.0 ± 0.0
Leu
2.591LeuAla: 2.591 ± 1.499
1.727LeuCys: 1.727 ± 1.0
4.318LeuAsp: 4.318 ± 2.499
4.318LeuGlu: 4.318 ± 2.499
2.591LeuPhe: 2.591 ± 0.054
2.591LeuGly: 2.591 ± 0.054
0.0LeuHis: 0.0 ± 0.0
4.318LeuIle: 4.318 ± 2.162
12.09LeuLys: 12.09 ± 3.89
4.318LeuLeu: 4.318 ± 0.609
3.454LeuMet: 3.454 ± 0.445
6.045LeuAsn: 6.045 ± 3.499
6.908LeuPro: 6.908 ± 0.891
6.908LeuGln: 6.908 ± 2.217
4.318LeuArg: 4.318 ± 2.499
12.09LeuSer: 12.09 ± 0.782
6.045LeuThr: 6.045 ± 4.27
8.636LeuVal: 8.636 ± 1.217
3.454LeuTrp: 3.454 ± 1.108
3.454LeuTyr: 3.454 ± 0.445
0.0LeuXaa: 0.0 ± 0.0
Met
0.864MetAla: 0.864 ± 1.054
1.727MetCys: 1.727 ± 1.0
0.864MetAsp: 0.864 ± 1.054
0.864MetGlu: 0.864 ± 0.5
3.454MetPhe: 3.454 ± 1.108
0.864MetGly: 0.864 ± 0.5
0.0MetHis: 0.0 ± 0.0
3.454MetIle: 3.454 ± 1.108
0.864MetLys: 0.864 ± 1.054
4.318MetLeu: 4.318 ± 0.945
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.864MetPro: 0.864 ± 0.5
0.0MetGln: 0.0 ± 0.0
0.864MetArg: 0.864 ± 0.5
2.591MetSer: 2.591 ± 1.499
0.0MetThr: 0.0 ± 0.0
0.864MetVal: 0.864 ± 0.5
0.0MetTrp: 0.0 ± 0.0
1.727MetTyr: 1.727 ± 0.554
0.0MetXaa: 0.0 ± 0.0
Asn
1.727AsnAla: 1.727 ± 0.554
1.727AsnCys: 1.727 ± 0.554
1.727AsnAsp: 1.727 ± 1.0
5.181AsnGlu: 5.181 ± 1.445
1.727AsnPhe: 1.727 ± 0.554
2.591AsnGly: 2.591 ± 1.608
0.864AsnHis: 0.864 ± 0.5
0.0AsnIle: 0.0 ± 0.0
1.727AsnLys: 1.727 ± 1.0
4.318AsnLeu: 4.318 ± 0.945
0.0AsnMet: 0.0 ± 0.0
2.591AsnAsn: 2.591 ± 0.054
3.454AsnPro: 3.454 ± 2.662
0.864AsnGln: 0.864 ± 0.5
0.0AsnArg: 0.0 ± 0.0
6.908AsnSer: 6.908 ± 0.663
2.591AsnThr: 2.591 ± 1.608
4.318AsnVal: 4.318 ± 2.162
0.864AsnTrp: 0.864 ± 0.5
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
0.864ProAla: 0.864 ± 1.054
0.864ProCys: 0.864 ± 0.5
0.864ProAsp: 0.864 ± 0.5
4.318ProGlu: 4.318 ± 0.609
1.727ProPhe: 1.727 ± 0.554
0.864ProGly: 0.864 ± 1.054
0.0ProHis: 0.0 ± 0.0
2.591ProIle: 2.591 ± 0.054
4.318ProLys: 4.318 ± 2.162
7.772ProLeu: 7.772 ± 1.391
1.727ProMet: 1.727 ± 0.833
2.591ProAsn: 2.591 ± 1.499
2.591ProPro: 2.591 ± 0.054
2.591ProGln: 2.591 ± 1.608
4.318ProArg: 4.318 ± 0.609
5.181ProSer: 5.181 ± 4.77
4.318ProThr: 4.318 ± 0.945
0.864ProVal: 0.864 ± 1.054
1.727ProTrp: 1.727 ± 1.0
0.864ProTyr: 0.864 ± 1.054
0.0ProXaa: 0.0 ± 0.0
Gln
3.454GlnAla: 3.454 ± 1.108
0.864GlnCys: 0.864 ± 0.5
2.591GlnAsp: 2.591 ± 0.054
1.727GlnGlu: 1.727 ± 1.0
0.864GlnPhe: 0.864 ± 1.054
2.591GlnGly: 2.591 ± 1.499
0.0GlnHis: 0.0 ± 0.0
2.591GlnIle: 2.591 ± 0.054
3.454GlnLys: 3.454 ± 0.445
2.591GlnLeu: 2.591 ± 0.054
2.591GlnMet: 2.591 ± 0.054
1.727GlnAsn: 1.727 ± 2.108
1.727GlnPro: 1.727 ± 0.554
3.454GlnGln: 3.454 ± 2.662
2.591GlnArg: 2.591 ± 1.499
3.454GlnSer: 3.454 ± 1.108
3.454GlnThr: 3.454 ± 1.108
0.0GlnVal: 0.0 ± 0.0
0.0GlnTrp: 0.0 ± 0.0
2.591GlnTyr: 2.591 ± 0.054
0.0GlnXaa: 0.0 ± 0.0
Arg
4.318ArgAla: 4.318 ± 2.499
1.727ArgCys: 1.727 ± 1.0
0.864ArgAsp: 0.864 ± 0.5
2.591ArgGlu: 2.591 ± 0.054
6.045ArgPhe: 6.045 ± 0.391
0.0ArgGly: 0.0 ± 0.0
0.0ArgHis: 0.0 ± 0.0
4.318ArgIle: 4.318 ± 2.499
3.454ArgLys: 3.454 ± 0.445
2.591ArgLeu: 2.591 ± 1.499
2.591ArgMet: 2.591 ± 0.054
1.727ArgAsn: 1.727 ± 0.554
1.727ArgPro: 1.727 ± 0.554
1.727ArgGln: 1.727 ± 1.0
4.318ArgArg: 4.318 ± 2.162
6.045ArgSer: 6.045 ± 1.163
3.454ArgThr: 3.454 ± 0.445
0.864ArgVal: 0.864 ± 0.5
1.727ArgTrp: 1.727 ± 1.0
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
2.591SerAla: 2.591 ± 0.054
1.727SerCys: 1.727 ± 0.554
4.318SerAsp: 4.318 ± 0.609
4.318SerGlu: 4.318 ± 2.499
10.363SerPhe: 10.363 ± 1.336
6.908SerGly: 6.908 ± 0.891
2.591SerHis: 2.591 ± 0.054
2.591SerIle: 2.591 ± 0.054
8.636SerLys: 8.636 ± 3.444
14.68SerLeu: 14.68 ± 2.282
0.864SerMet: 0.864 ± 0.5
2.591SerAsn: 2.591 ± 0.054
8.636SerPro: 8.636 ± 4.325
6.045SerGln: 6.045 ± 1.945
4.318SerArg: 4.318 ± 0.609
7.772SerSer: 7.772 ± 1.391
6.908SerThr: 6.908 ± 5.324
7.772SerVal: 7.772 ± 1.717
0.0SerTrp: 0.0 ± 0.0
3.454SerTyr: 3.454 ± 1.108
0.0SerXaa: 0.0 ± 0.0
Thr
5.181ThrAla: 5.181 ± 1.663
0.864ThrCys: 0.864 ± 0.5
1.727ThrAsp: 1.727 ± 0.554
3.454ThrGlu: 3.454 ± 2.662
2.591ThrPhe: 2.591 ± 3.162
6.908ThrGly: 6.908 ± 5.324
2.591ThrHis: 2.591 ± 0.054
5.181ThrIle: 5.181 ± 1.445
2.591ThrLys: 2.591 ± 1.499
6.045ThrLeu: 6.045 ± 2.716
2.591ThrMet: 2.591 ± 1.499
3.454ThrAsn: 3.454 ± 2.662
0.0ThrPro: 0.0 ± 0.0
1.727ThrGln: 1.727 ± 0.554
2.591ThrArg: 2.591 ± 0.054
3.454ThrSer: 3.454 ± 1.108
5.181ThrThr: 5.181 ± 4.77
2.591ThrVal: 2.591 ± 1.608
0.0ThrTrp: 0.0 ± 0.0
2.591ThrTyr: 2.591 ± 3.162
0.0ThrXaa: 0.0 ± 0.0
Val
6.045ValAla: 6.045 ± 1.163
3.454ValCys: 3.454 ± 1.108
5.181ValAsp: 5.181 ± 0.109
1.727ValGlu: 1.727 ± 0.554
4.318ValPhe: 4.318 ± 2.499
2.591ValGly: 2.591 ± 1.608
2.591ValHis: 2.591 ± 1.499
2.591ValIle: 2.591 ± 0.054
2.591ValLys: 2.591 ± 0.054
4.318ValLeu: 4.318 ± 2.162
0.0ValMet: 0.0 ± 0.609
2.591ValAsn: 2.591 ± 1.499
2.591ValPro: 2.591 ± 1.608
2.591ValGln: 2.591 ± 1.608
0.0ValArg: 0.0 ± 0.0
8.636ValSer: 8.636 ± 1.217
2.591ValThr: 2.591 ± 3.162
0.0ValVal: 0.0 ± 0.0
0.0ValTrp: 0.0 ± 0.0
1.727ValTyr: 1.727 ± 2.108
0.0ValXaa: 0.0 ± 0.0
Trp
0.864TrpAla: 0.864 ± 0.5
0.864TrpCys: 0.864 ± 0.5
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.864TrpPhe: 0.864 ± 0.5
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.864TrpLys: 0.864 ± 0.5
3.454TrpLeu: 3.454 ± 1.999
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
1.727TrpPro: 1.727 ± 1.0
0.0TrpGln: 0.0 ± 0.0
0.864TrpArg: 0.864 ± 1.054
0.864TrpSer: 0.864 ± 0.5
0.864TrpThr: 0.864 ± 1.054
1.727TrpVal: 1.727 ± 0.554
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.864TyrAla: 0.864 ± 1.054
1.727TyrCys: 1.727 ± 1.0
4.318TyrAsp: 4.318 ± 2.162
1.727TyrGlu: 1.727 ± 1.0
1.727TyrPhe: 1.727 ± 0.554
0.864TyrGly: 0.864 ± 1.054
0.0TyrHis: 0.0 ± 0.0
0.864TyrIle: 0.864 ± 1.054
0.864TyrLys: 0.864 ± 0.5
4.318TyrLeu: 4.318 ± 0.945
0.864TyrMet: 0.864 ± 0.5
1.727TyrAsn: 1.727 ± 0.554
0.864TyrPro: 0.864 ± 0.5
0.0TyrGln: 0.0 ± 0.0
3.454TyrArg: 3.454 ± 1.108
4.318TyrSer: 4.318 ± 0.609
0.864TyrThr: 0.864 ± 0.5
1.727TyrVal: 1.727 ± 0.554
0.864TyrTrp: 0.864 ± 0.5
0.864TyrTyr: 0.864 ± 0.5
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1159 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski