Amino acid dipepetide frequency for Wenzhou picorna-like virus 7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.73AlaAla: 8.73 ± 0.536
1.984AlaCys: 1.984 ± 0.475
0.794AlaAsp: 0.794 ± 0.429
3.968AlaGlu: 3.968 ± 0.246
1.984AlaPhe: 1.984 ± 0.123
5.556AlaGly: 5.556 ± 0.613
0.0AlaHis: 0.0 ± 0.0
3.571AlaIle: 3.571 ± 0.46
4.762AlaLys: 4.762 ± 0.781
5.952AlaLeu: 5.952 ± 0.368
2.381AlaMet: 2.381 ± 0.314
3.571AlaAsn: 3.571 ± 0.138
3.175AlaPro: 3.175 ± 0.077
1.19AlaGln: 1.19 ± 0.046
3.175AlaArg: 3.175 ± 0.521
5.556AlaSer: 5.556 ± 0.583
2.778AlaThr: 2.778 ± 0.291
5.159AlaVal: 5.159 ± 0.797
1.19AlaTrp: 1.19 ± 0.552
2.778AlaTyr: 2.778 ± 1.487
0.0AlaXaa: 0.0 ± 0.0
Cys
1.587CysAla: 1.587 ± 0.26
0.397CysCys: 0.397 ± 0.383
0.794CysAsp: 0.794 ± 0.429
0.0CysGlu: 0.0 ± 0.0
1.19CysPhe: 1.19 ± 0.644
1.587CysGly: 1.587 ± 0.858
0.0CysHis: 0.0 ± 0.0
0.397CysIle: 0.397 ± 0.215
1.984CysLys: 1.984 ± 1.073
1.587CysLeu: 1.587 ± 0.26
0.794CysMet: 0.794 ± 0.169
0.397CysAsn: 0.397 ± 0.383
1.19CysPro: 1.19 ± 0.644
1.19CysGln: 1.19 ± 0.644
0.794CysArg: 0.794 ± 0.169
0.397CysSer: 0.397 ± 0.383
0.397CysThr: 0.397 ± 0.215
2.381CysVal: 2.381 ± 1.287
0.397CysTrp: 0.397 ± 0.215
2.381CysTyr: 2.381 ± 1.104
0.0CysXaa: 0.0 ± 0.0
Asp
3.175AspAla: 3.175 ± 0.521
0.397AspCys: 0.397 ± 0.383
5.556AspAsp: 5.556 ± 1.779
3.175AspGlu: 3.175 ± 0.521
3.175AspPhe: 3.175 ± 0.675
1.19AspGly: 1.19 ± 0.644
1.984AspHis: 1.984 ± 0.123
4.762AspIle: 4.762 ± 1.012
6.349AspLys: 6.349 ± 2.238
5.159AspLeu: 5.159 ± 2.192
1.984AspMet: 1.984 ± 1.073
2.778AspAsn: 2.778 ± 0.291
2.778AspPro: 2.778 ± 0.889
1.587AspGln: 1.587 ± 0.26
2.381AspArg: 2.381 ± 0.69
3.175AspSer: 3.175 ± 1.87
3.571AspThr: 3.571 ± 1.058
5.556AspVal: 5.556 ± 0.015
0.397AspTrp: 0.397 ± 0.215
3.175AspTyr: 3.175 ± 0.521
0.0AspXaa: 0.0 ± 0.0
Glu
2.778GluAla: 2.778 ± 0.291
1.587GluCys: 1.587 ± 0.858
4.365GluAsp: 4.365 ± 0.629
4.762GluGlu: 4.762 ± 1.012
5.556GluPhe: 5.556 ± 1.808
2.778GluGly: 2.778 ± 0.306
1.984GluHis: 1.984 ± 0.475
1.984GluIle: 1.984 ± 0.123
4.762GluLys: 4.762 ± 2.575
8.73GluLeu: 8.73 ± 1.134
0.397GluMet: 0.397 ± 0.383
2.381GluAsn: 2.381 ± 0.092
2.381GluPro: 2.381 ± 0.69
1.587GluGln: 1.587 ± 0.26
2.778GluArg: 2.778 ± 0.306
3.571GluSer: 3.571 ± 1.058
2.381GluThr: 2.381 ± 0.506
3.968GluVal: 3.968 ± 0.246
1.19GluTrp: 1.19 ± 0.046
3.175GluTyr: 3.175 ± 0.077
0.0GluXaa: 0.0 ± 0.0
Phe
3.571PheAla: 3.571 ± 1.333
0.397PheCys: 0.397 ± 0.215
4.762PheAsp: 4.762 ± 0.184
4.762PheGlu: 4.762 ± 1.379
3.571PhePhe: 3.571 ± 1.058
3.175PheGly: 3.175 ± 1.717
0.397PheHis: 0.397 ± 0.383
0.0PheIle: 0.0 ± 0.0
2.778PheLys: 2.778 ± 0.291
5.159PheLeu: 5.159 ± 0.996
1.19PheMet: 1.19 ± 0.644
2.381PheAsn: 2.381 ± 0.506
1.587PhePro: 1.587 ± 0.935
1.984PheGln: 1.984 ± 1.318
4.762PheArg: 4.762 ± 0.414
1.984PheSer: 1.984 ± 0.123
2.778PheThr: 2.778 ± 0.291
5.159PheVal: 5.159 ± 1.395
0.0PheTrp: 0.0 ± 0.0
2.381PheTyr: 2.381 ± 0.092
0.0PheXaa: 0.0 ± 0.0
Gly
4.762GlyAla: 4.762 ± 0.781
1.19GlyCys: 1.19 ± 0.644
4.365GlyAsp: 4.365 ± 1.165
5.159GlyGlu: 5.159 ± 1.993
1.19GlyPhe: 1.19 ± 0.046
2.778GlyGly: 2.778 ± 0.306
0.0GlyHis: 0.0 ± 0.0
5.159GlyIle: 5.159 ± 0.996
5.556GlyLys: 5.556 ± 0.613
4.365GlyLeu: 4.365 ± 1.165
1.19GlyMet: 1.19 ± 0.046
3.968GlyAsn: 3.968 ± 0.246
1.19GlyPro: 1.19 ± 0.046
1.587GlyGln: 1.587 ± 0.337
1.984GlyArg: 1.984 ± 1.318
5.159GlySer: 5.159 ± 0.996
3.968GlyThr: 3.968 ± 1.441
4.365GlyVal: 4.365 ± 0.031
1.587GlyTrp: 1.587 ± 0.337
0.794GlyTyr: 0.794 ± 0.169
0.0GlyXaa: 0.0 ± 0.0
His
0.397HisAla: 0.397 ± 0.383
0.794HisCys: 0.794 ± 0.169
0.397HisAsp: 0.397 ± 0.215
1.19HisGlu: 1.19 ± 0.644
2.778HisPhe: 2.778 ± 0.889
1.19HisGly: 1.19 ± 0.644
0.397HisHis: 0.397 ± 0.383
2.381HisIle: 2.381 ± 0.092
1.19HisLys: 1.19 ± 0.552
0.794HisLeu: 0.794 ± 0.429
0.397HisMet: 0.397 ± 0.215
0.794HisAsn: 0.794 ± 0.169
1.19HisPro: 1.19 ± 0.644
0.0HisGln: 0.0 ± 0.0
0.397HisArg: 0.397 ± 0.215
1.19HisSer: 1.19 ± 0.552
1.19HisThr: 1.19 ± 0.046
1.587HisVal: 1.587 ± 0.26
0.794HisTrp: 0.794 ± 0.169
1.19HisTyr: 1.19 ± 0.046
0.0HisXaa: 0.0 ± 0.0
Ile
4.762IleAla: 4.762 ± 1.61
1.19IleCys: 1.19 ± 0.644
4.365IleAsp: 4.365 ± 0.031
2.778IleGlu: 2.778 ± 0.291
3.571IlePhe: 3.571 ± 0.736
3.571IleGly: 3.571 ± 0.138
0.794IleHis: 0.794 ± 0.429
1.19IleIle: 1.19 ± 0.046
2.381IleLys: 2.381 ± 0.69
3.968IleLeu: 3.968 ± 0.95
1.19IleMet: 1.19 ± 0.644
3.175IleAsn: 3.175 ± 0.675
1.587IlePro: 1.587 ± 0.26
1.587IleGln: 1.587 ± 0.858
2.381IleArg: 2.381 ± 0.69
4.762IleSer: 4.762 ± 1.61
3.968IleThr: 3.968 ± 0.246
3.175IleVal: 3.175 ± 0.521
0.794IleTrp: 0.794 ± 0.429
0.397IleTyr: 0.397 ± 0.215
0.0IleXaa: 0.0 ± 0.0
Lys
5.556LysAla: 5.556 ± 1.211
0.397LysCys: 0.397 ± 0.215
4.365LysAsp: 4.365 ± 1.165
3.571LysGlu: 3.571 ± 1.931
4.365LysPhe: 4.365 ± 1.165
3.175LysGly: 3.175 ± 0.077
1.587LysHis: 1.587 ± 0.26
2.778LysIle: 2.778 ± 0.904
2.381LysLys: 2.381 ± 1.287
5.952LysLeu: 5.952 ± 1.425
1.19LysMet: 1.19 ± 0.644
2.381LysAsn: 2.381 ± 0.69
3.175LysPro: 3.175 ± 0.077
2.778LysGln: 2.778 ± 0.291
4.762LysArg: 4.762 ± 0.781
5.952LysSer: 5.952 ± 1.425
2.778LysThr: 2.778 ± 0.291
4.365LysVal: 4.365 ± 0.567
0.0LysTrp: 0.0 ± 0.0
2.381LysTyr: 2.381 ± 0.69
0.0LysXaa: 0.0 ± 0.0
Leu
4.762LeuAla: 4.762 ± 0.184
1.587LeuCys: 1.587 ± 0.858
5.556LeuAsp: 5.556 ± 1.211
5.556LeuGlu: 5.556 ± 0.613
3.175LeuPhe: 3.175 ± 1.119
5.159LeuGly: 5.159 ± 0.398
2.778LeuHis: 2.778 ± 0.306
4.762LeuIle: 4.762 ± 1.379
7.143LeuLys: 7.143 ± 0.92
5.556LeuLeu: 5.556 ± 1.211
0.794LeuMet: 0.794 ± 0.148
3.175LeuAsn: 3.175 ± 1.273
5.952LeuPro: 5.952 ± 0.23
1.984LeuGln: 1.984 ± 0.123
5.952LeuArg: 5.952 ± 1.425
4.365LeuSer: 4.365 ± 0.629
6.746LeuThr: 6.746 ± 0.659
5.952LeuVal: 5.952 ± 2.621
0.794LeuTrp: 0.794 ± 0.169
2.381LeuTyr: 2.381 ± 0.506
0.0LeuXaa: 0.0 ± 0.0
Met
1.984MetAla: 1.984 ± 0.475
0.794MetCys: 0.794 ± 0.429
1.984MetAsp: 1.984 ± 0.721
1.587MetGlu: 1.587 ± 0.858
0.0MetPhe: 0.0 ± 0.0
0.397MetGly: 0.397 ± 0.383
0.794MetHis: 0.794 ± 0.169
1.19MetIle: 1.19 ± 0.046
1.587MetLys: 1.587 ± 0.858
1.984MetLeu: 1.984 ± 0.123
0.397MetMet: 0.397 ± 0.215
0.794MetAsn: 0.794 ± 0.169
0.794MetPro: 0.794 ± 0.429
0.397MetGln: 0.397 ± 0.215
1.19MetArg: 1.19 ± 0.046
1.984MetSer: 1.984 ± 1.073
0.794MetThr: 0.794 ± 0.169
1.587MetVal: 1.587 ± 0.337
1.587MetTrp: 1.587 ± 0.858
0.397MetTyr: 0.397 ± 0.383
0.0MetXaa: 0.0 ± 0.0
Asn
4.762AsnAla: 4.762 ± 1.61
0.397AsnCys: 0.397 ± 0.215
2.778AsnAsp: 2.778 ± 0.291
1.984AsnGlu: 1.984 ± 1.916
3.175AsnPhe: 3.175 ± 1.273
4.762AsnGly: 4.762 ± 1.012
1.19AsnHis: 1.19 ± 0.046
2.381AsnIle: 2.381 ± 1.287
2.778AsnLys: 2.778 ± 0.306
5.952AsnLeu: 5.952 ± 0.23
0.397AsnMet: 0.397 ± 0.215
4.762AsnAsn: 4.762 ± 1.61
3.175AsnPro: 3.175 ± 0.077
0.0AsnGln: 0.0 ± 0.0
1.19AsnArg: 1.19 ± 0.046
3.571AsnSer: 3.571 ± 0.46
0.794AsnThr: 0.794 ± 0.169
3.968AsnVal: 3.968 ± 1.441
0.794AsnTrp: 0.794 ± 0.169
1.984AsnTyr: 1.984 ± 1.318
0.0AsnXaa: 0.0 ± 0.0
Pro
2.778ProAla: 2.778 ± 0.291
0.794ProCys: 0.794 ± 0.169
3.175ProAsp: 3.175 ± 0.077
3.968ProGlu: 3.968 ± 0.352
3.571ProPhe: 3.571 ± 1.058
1.587ProGly: 1.587 ± 0.935
0.794ProHis: 0.794 ± 0.766
1.984ProIle: 1.984 ± 0.475
1.587ProLys: 1.587 ± 0.858
3.571ProLeu: 3.571 ± 0.736
0.794ProMet: 0.794 ± 0.429
1.984ProAsn: 1.984 ± 0.721
1.984ProPro: 1.984 ± 0.123
2.778ProGln: 2.778 ± 0.306
1.19ProArg: 1.19 ± 0.552
1.19ProSer: 1.19 ± 0.552
3.571ProThr: 3.571 ± 1.058
5.159ProVal: 5.159 ± 1.395
1.19ProTrp: 1.19 ± 0.046
0.794ProTyr: 0.794 ± 0.169
0.0ProXaa: 0.0 ± 0.0
Gln
1.984GlnAla: 1.984 ± 0.123
0.397GlnCys: 0.397 ± 0.383
1.984GlnAsp: 1.984 ± 0.475
1.984GlnGlu: 1.984 ± 0.475
0.794GlnPhe: 0.794 ± 0.169
2.381GlnGly: 2.381 ± 0.092
0.397GlnHis: 0.397 ± 0.383
1.19GlnIle: 1.19 ± 0.046
2.381GlnLys: 2.381 ± 0.69
3.175GlnLeu: 3.175 ± 0.675
0.794GlnMet: 0.794 ± 0.169
1.19GlnAsn: 1.19 ± 0.046
1.19GlnPro: 1.19 ± 0.552
0.397GlnGln: 0.397 ± 0.215
1.587GlnArg: 1.587 ± 0.26
3.175GlnSer: 3.175 ± 1.273
1.19GlnThr: 1.19 ± 0.046
3.175GlnVal: 3.175 ± 0.077
0.397GlnTrp: 0.397 ± 0.215
1.984GlnTyr: 1.984 ± 0.123
0.0GlnXaa: 0.0 ± 0.0
Arg
2.381ArgAla: 2.381 ± 0.092
0.794ArgCys: 0.794 ± 0.429
3.175ArgAsp: 3.175 ± 1.119
3.571ArgGlu: 3.571 ± 0.736
1.587ArgPhe: 1.587 ± 0.337
3.175ArgGly: 3.175 ± 0.675
0.0ArgHis: 0.0 ± 0.0
2.381ArgIle: 2.381 ± 0.092
5.159ArgLys: 5.159 ± 2.192
3.175ArgLeu: 3.175 ± 0.521
1.19ArgMet: 1.19 ± 0.046
1.984ArgAsn: 1.984 ± 0.475
3.175ArgPro: 3.175 ± 0.675
0.397ArgGln: 0.397 ± 0.215
2.778ArgArg: 2.778 ± 1.502
1.984ArgSer: 1.984 ± 0.475
1.984ArgThr: 1.984 ± 0.475
5.556ArgVal: 5.556 ± 1.181
1.19ArgTrp: 1.19 ± 0.046
1.19ArgTyr: 1.19 ± 1.15
0.0ArgXaa: 0.0 ± 0.0
Ser
3.968SerAla: 3.968 ± 0.843
0.397SerCys: 0.397 ± 0.215
3.968SerAsp: 3.968 ± 0.843
3.968SerGlu: 3.968 ± 0.95
3.175SerPhe: 3.175 ± 0.675
5.952SerGly: 5.952 ± 2.162
1.587SerHis: 1.587 ± 0.858
5.556SerIle: 5.556 ± 0.583
4.365SerLys: 4.365 ± 0.031
6.349SerLeu: 6.349 ± 0.444
1.984SerMet: 1.984 ± 0.721
4.365SerAsn: 4.365 ± 1.227
2.778SerPro: 2.778 ± 1.487
3.571SerGln: 3.571 ± 1.058
2.381SerArg: 2.381 ± 0.506
5.159SerSer: 5.159 ± 3.189
3.175SerThr: 3.175 ± 0.675
5.556SerVal: 5.556 ± 1.779
0.397SerTrp: 0.397 ± 0.215
1.587SerTyr: 1.587 ± 0.337
0.0SerXaa: 0.0 ± 0.0
Thr
1.984ThrAla: 1.984 ± 0.123
1.19ThrCys: 1.19 ± 0.644
3.175ThrAsp: 3.175 ± 0.077
0.794ThrGlu: 0.794 ± 0.169
3.968ThrPhe: 3.968 ± 0.95
4.365ThrGly: 4.365 ± 0.031
1.19ThrHis: 1.19 ± 0.644
2.381ThrIle: 2.381 ± 0.506
0.794ThrLys: 0.794 ± 0.429
5.159ThrLeu: 5.159 ± 0.797
2.778ThrMet: 2.778 ± 0.291
2.381ThrAsn: 2.381 ± 0.506
1.984ThrPro: 1.984 ± 0.123
2.778ThrGln: 2.778 ± 1.487
1.587ThrArg: 1.587 ± 0.337
7.54ThrSer: 7.54 ± 3.097
4.365ThrThr: 4.365 ± 0.629
3.571ThrVal: 3.571 ± 0.138
1.984ThrTrp: 1.984 ± 0.721
2.381ThrTyr: 2.381 ± 0.69
0.0ThrXaa: 0.0 ± 0.0
Val
4.762ValAla: 4.762 ± 0.414
1.984ValCys: 1.984 ± 1.073
3.571ValAsp: 3.571 ± 0.46
6.349ValGlu: 6.349 ± 2.238
3.968ValPhe: 3.968 ± 0.246
1.984ValGly: 1.984 ± 0.123
1.984ValHis: 1.984 ± 0.475
5.159ValIle: 5.159 ± 0.2
3.571ValLys: 3.571 ± 0.736
5.159ValLeu: 5.159 ± 0.797
1.19ValMet: 1.19 ± 0.644
5.159ValAsn: 5.159 ± 0.797
5.159ValPro: 5.159 ± 2.591
3.968ValGln: 3.968 ± 0.246
2.381ValArg: 2.381 ± 0.69
5.952ValSer: 5.952 ± 1.564
5.556ValThr: 5.556 ± 0.015
6.349ValVal: 6.349 ± 1.64
1.19ValTrp: 1.19 ± 0.046
3.175ValTyr: 3.175 ± 1.273
0.0ValXaa: 0.0 ± 0.0
Trp
1.19TrpAla: 1.19 ± 1.15
1.19TrpCys: 1.19 ± 0.644
1.19TrpAsp: 1.19 ± 0.046
1.984TrpGlu: 1.984 ± 0.123
1.19TrpPhe: 1.19 ± 0.046
1.19TrpGly: 1.19 ± 0.552
0.397TrpHis: 0.397 ± 0.215
0.794TrpIle: 0.794 ± 0.169
0.794TrpLys: 0.794 ± 0.169
0.794TrpLeu: 0.794 ± 0.429
0.397TrpMet: 0.397 ± 0.215
0.397TrpAsn: 0.397 ± 0.383
0.0TrpPro: 0.0 ± 0.0
0.794TrpGln: 0.794 ± 0.429
2.381TrpArg: 2.381 ± 0.092
0.0TrpSer: 0.0 ± 0.0
1.587TrpThr: 1.587 ± 0.26
0.0TrpVal: 0.0 ± 0.0
0.397TrpTrp: 0.397 ± 0.215
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.381TyrAla: 2.381 ± 0.092
1.984TyrCys: 1.984 ± 1.916
1.984TyrAsp: 1.984 ± 0.721
1.984TyrGlu: 1.984 ± 0.721
0.794TyrPhe: 0.794 ± 0.169
4.365TyrGly: 4.365 ± 1.165
1.984TyrHis: 1.984 ± 1.318
1.587TyrIle: 1.587 ± 0.26
1.587TyrLys: 1.587 ± 0.26
1.984TyrLeu: 1.984 ± 0.721
0.397TyrMet: 0.397 ± 0.383
2.778TyrAsn: 2.778 ± 1.487
0.0TyrPro: 0.0 ± 0.0
1.19TyrGln: 1.19 ± 0.046
0.794TyrArg: 0.794 ± 0.429
3.571TyrSer: 3.571 ± 0.46
2.778TyrThr: 2.778 ± 0.291
1.984TyrVal: 1.984 ± 0.475
0.397TyrTrp: 0.397 ± 0.383
1.19TyrTyr: 1.19 ± 0.552
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2521 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski