Amino acid dipepetide frequency for Shahe picorna-like virus 11

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.381AlaAla: 4.381 ± 2.605
1.593AlaCys: 1.593 ± 1.272
3.186AlaAsp: 3.186 ± 1.028
1.593AlaGlu: 1.593 ± 0.157
2.788AlaPhe: 2.788 ± 0.619
5.575AlaGly: 5.575 ± 1.951
0.0AlaHis: 0.0 ± 0.0
2.389AlaIle: 2.389 ± 0.122
5.974AlaLys: 5.974 ± 1.838
6.372AlaLeu: 6.372 ± 2.77
1.195AlaMet: 1.195 ± 0.775
3.186AlaAsn: 3.186 ± 1.115
3.186AlaPro: 3.186 ± 1.115
0.398AlaGln: 0.398 ± 0.218
1.593AlaArg: 1.593 ± 0.871
5.575AlaSer: 5.575 ± 2.666
3.982AlaThr: 3.982 ± 0.035
4.779AlaVal: 4.779 ± 1.673
0.0AlaTrp: 0.0 ± 0.0
1.195AlaTyr: 1.195 ± 0.653
0.0AlaXaa: 0.0 ± 0.0
Cys
0.796CysAla: 0.796 ± 0.993
0.0CysCys: 0.0 ± 0.0
0.796CysAsp: 0.796 ± 0.279
0.398CysGlu: 0.398 ± 0.218
2.389CysPhe: 2.389 ± 0.122
1.195CysGly: 1.195 ± 0.653
0.0CysHis: 0.0 ± 0.0
1.195CysIle: 1.195 ± 0.653
0.0CysLys: 0.0 ± 0.0
1.593CysLeu: 1.593 ± 0.871
0.0CysMet: 0.0 ± 0.0
1.195CysAsn: 1.195 ± 0.775
1.593CysPro: 1.593 ± 0.157
0.398CysGln: 0.398 ± 0.218
0.398CysArg: 0.398 ± 0.218
1.195CysSer: 1.195 ± 0.653
0.796CysThr: 0.796 ± 0.435
0.796CysVal: 0.796 ± 0.279
0.398CysTrp: 0.398 ± 0.218
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.593AspAla: 1.593 ± 0.558
1.593AspCys: 1.593 ± 0.157
4.779AspAsp: 4.779 ± 1.184
3.584AspGlu: 3.584 ± 1.245
5.974AspPhe: 5.974 ± 1.123
2.389AspGly: 2.389 ± 0.592
1.195AspHis: 1.195 ± 0.061
4.381AspIle: 4.381 ± 0.462
4.779AspLys: 4.779 ± 2.613
3.584AspLeu: 3.584 ± 0.531
1.593AspMet: 1.593 ± 0.871
4.381AspAsn: 4.381 ± 0.967
1.593AspPro: 1.593 ± 0.157
0.398AspGln: 0.398 ± 0.497
1.593AspArg: 1.593 ± 0.157
2.788AspSer: 2.788 ± 1.524
2.389AspThr: 2.389 ± 0.836
3.186AspVal: 3.186 ± 1.742
0.398AspTrp: 0.398 ± 0.218
3.186AspTyr: 3.186 ± 0.401
0.0AspXaa: 0.0 ± 0.0
Glu
2.788GluAla: 2.788 ± 0.81
0.398GluCys: 0.398 ± 0.218
3.982GluAsp: 3.982 ± 0.749
4.381GluGlu: 4.381 ± 2.395
2.788GluPhe: 2.788 ± 1.524
3.584GluGly: 3.584 ± 1.96
0.796GluHis: 0.796 ± 0.279
5.177GluIle: 5.177 ± 0.688
2.389GluLys: 2.389 ± 0.592
3.982GluLeu: 3.982 ± 0.035
1.593GluMet: 1.593 ± 0.157
2.389GluAsn: 2.389 ± 0.592
4.381GluPro: 4.381 ± 0.967
0.398GluGln: 0.398 ± 0.218
1.593GluArg: 1.593 ± 0.157
2.788GluSer: 2.788 ± 1.524
2.788GluThr: 2.788 ± 0.81
3.982GluVal: 3.982 ± 0.749
0.796GluTrp: 0.796 ± 0.435
1.991GluTyr: 1.991 ± 1.089
0.0GluXaa: 0.0 ± 0.0
Phe
3.186PheAla: 3.186 ± 0.313
1.195PheCys: 1.195 ± 0.653
3.186PheAsp: 3.186 ± 0.313
3.186PheGlu: 3.186 ± 0.401
1.991PhePhe: 1.991 ± 1.089
5.177PheGly: 5.177 ± 0.026
1.593PheHis: 1.593 ± 0.157
3.584PheIle: 3.584 ± 0.183
2.389PheLys: 2.389 ± 0.592
5.177PheLeu: 5.177 ± 0.688
0.0PheMet: 0.0 ± 0.0
3.186PheAsn: 3.186 ± 0.401
1.593PhePro: 1.593 ± 0.558
2.389PheGln: 2.389 ± 0.592
3.584PheArg: 3.584 ± 0.183
7.567PheSer: 7.567 ± 3.005
3.186PheThr: 3.186 ± 0.401
3.584PheVal: 3.584 ± 0.531
0.398PheTrp: 0.398 ± 0.218
0.796PheTyr: 0.796 ± 0.279
0.0PheXaa: 0.0 ± 0.0
Gly
5.575GlyAla: 5.575 ± 0.191
0.398GlyCys: 0.398 ± 0.497
2.788GlyAsp: 2.788 ± 0.096
2.389GlyGlu: 2.389 ± 0.592
2.788GlyPhe: 2.788 ± 0.619
2.788GlyGly: 2.788 ± 0.619
1.991GlyHis: 1.991 ± 0.374
6.372GlyIle: 6.372 ± 0.802
3.186GlyLys: 3.186 ± 1.742
4.381GlyLeu: 4.381 ± 0.252
0.796GlyMet: 0.796 ± 0.435
3.186GlyAsn: 3.186 ± 0.401
1.593GlyPro: 1.593 ± 0.558
1.593GlyGln: 1.593 ± 0.558
1.991GlyArg: 1.991 ± 1.054
5.575GlySer: 5.575 ± 0.906
5.575GlyThr: 5.575 ± 1.237
6.77GlyVal: 6.77 ± 0.845
0.796GlyTrp: 0.796 ± 0.279
3.186GlyTyr: 3.186 ± 0.401
0.0GlyXaa: 0.0 ± 0.0
His
0.398HisAla: 0.398 ± 0.497
0.398HisCys: 0.398 ± 0.218
0.398HisAsp: 0.398 ± 0.218
0.796HisGlu: 0.796 ± 0.435
1.991HisPhe: 1.991 ± 0.34
0.796HisGly: 0.796 ± 0.435
0.0HisHis: 0.0 ± 0.0
1.991HisIle: 1.991 ± 0.34
0.398HisLys: 0.398 ± 0.218
2.389HisLeu: 2.389 ± 0.122
0.398HisMet: 0.398 ± 0.218
0.796HisAsn: 0.796 ± 0.435
0.796HisPro: 0.796 ± 0.435
0.0HisGln: 0.0 ± 0.0
1.195HisArg: 1.195 ± 0.775
1.991HisSer: 1.991 ± 0.34
0.796HisThr: 0.796 ± 0.435
3.186HisVal: 3.186 ± 0.401
0.0HisTrp: 0.0 ± 0.0
0.398HisTyr: 0.398 ± 0.497
0.0HisXaa: 0.0 ± 0.0
Ile
4.779IleAla: 4.779 ± 1.184
0.796IleCys: 0.796 ± 0.279
3.584IleAsp: 3.584 ± 0.531
4.779IleGlu: 4.779 ± 0.47
3.186IlePhe: 3.186 ± 1.742
3.982IleGly: 3.982 ± 0.035
1.195IleHis: 1.195 ± 0.061
3.584IleIle: 3.584 ± 1.245
2.788IleLys: 2.788 ± 0.81
3.982IleLeu: 3.982 ± 0.035
0.796IleMet: 0.796 ± 0.225
3.982IleAsn: 3.982 ± 0.035
3.584IlePro: 3.584 ± 0.897
2.788IleGln: 2.788 ± 0.619
3.186IleArg: 3.186 ± 0.313
7.168IleSer: 7.168 ± 3.223
4.779IleThr: 4.779 ± 1.184
6.77IleVal: 6.77 ± 0.845
0.796IleTrp: 0.796 ± 0.279
1.195IleTyr: 1.195 ± 0.775
0.0IleXaa: 0.0 ± 0.0
Lys
4.779LysAla: 4.779 ± 1.184
0.398LysCys: 0.398 ± 0.218
2.788LysAsp: 2.788 ± 1.524
3.186LysGlu: 3.186 ± 1.028
2.788LysPhe: 2.788 ± 0.096
2.389LysGly: 2.389 ± 1.306
0.398LysHis: 0.398 ± 0.218
5.177LysIle: 5.177 ± 1.402
1.593LysLys: 1.593 ± 0.871
5.575LysLeu: 5.575 ± 2.334
1.991LysMet: 1.991 ± 0.374
3.584LysAsn: 3.584 ± 0.531
1.593LysPro: 1.593 ± 0.558
1.991LysGln: 1.991 ± 1.089
3.186LysArg: 3.186 ± 1.028
2.788LysSer: 2.788 ± 0.81
4.381LysThr: 4.381 ± 0.252
3.186LysVal: 3.186 ± 0.313
0.0LysTrp: 0.0 ± 0.0
1.991LysTyr: 1.991 ± 1.089
0.0LysXaa: 0.0 ± 0.0
Leu
5.974LeuAla: 5.974 ± 0.305
1.991LeuCys: 1.991 ± 0.374
5.177LeuAsp: 5.177 ± 2.116
3.982LeuGlu: 3.982 ± 1.463
3.982LeuPhe: 3.982 ± 0.68
4.779LeuGly: 4.779 ± 2.387
1.195LeuHis: 1.195 ± 0.061
4.779LeuIle: 4.779 ± 0.244
6.77LeuLys: 6.77 ± 2.273
4.381LeuLeu: 4.381 ± 2.395
2.788LeuMet: 2.788 ± 0.719
7.567LeuAsn: 7.567 ± 0.566
4.381LeuPro: 4.381 ± 0.462
1.991LeuGln: 1.991 ± 1.768
3.584LeuArg: 3.584 ± 0.183
7.965LeuSer: 7.965 ± 3.502
5.177LeuThr: 5.177 ± 1.455
6.77LeuVal: 6.77 ± 1.559
0.398LeuTrp: 0.398 ± 0.218
1.593LeuTyr: 1.593 ± 0.157
0.0LeuXaa: 0.0 ± 0.0
Met
0.796MetAla: 0.796 ± 0.279
0.0MetCys: 0.0 ± 0.0
0.796MetAsp: 0.796 ± 0.279
2.389MetGlu: 2.389 ± 0.592
1.593MetPhe: 1.593 ± 0.157
1.195MetGly: 1.195 ± 0.653
1.991MetHis: 1.991 ± 0.374
0.398MetIle: 0.398 ± 0.218
0.796MetLys: 0.796 ± 0.435
3.584MetLeu: 3.584 ± 0.897
1.195MetMet: 1.195 ± 0.061
1.593MetAsn: 1.593 ± 0.157
2.389MetPro: 2.389 ± 0.592
0.0MetGln: 0.0 ± 0.0
1.593MetArg: 1.593 ± 0.157
0.398MetSer: 0.398 ± 0.218
2.389MetThr: 2.389 ± 0.836
1.991MetVal: 1.991 ± 1.768
0.0MetTrp: 0.0 ± 0.0
0.796MetTyr: 0.796 ± 0.279
0.0MetXaa: 0.0 ± 0.0
Asn
3.186AsnAla: 3.186 ± 1.829
0.398AsnCys: 0.398 ± 0.218
1.195AsnAsp: 1.195 ± 0.653
4.779AsnGlu: 4.779 ± 2.613
5.575AsnPhe: 5.575 ± 1.951
3.982AsnGly: 3.982 ± 0.749
0.398AsnHis: 0.398 ± 0.497
4.381AsnIle: 4.381 ± 0.462
2.788AsnLys: 2.788 ± 0.096
4.779AsnLeu: 4.779 ± 0.244
2.389AsnMet: 2.389 ± 0.836
3.186AsnAsn: 3.186 ± 0.401
3.186AsnPro: 3.186 ± 0.401
2.788AsnGln: 2.788 ± 1.333
1.593AsnArg: 1.593 ± 0.871
4.779AsnSer: 4.779 ± 0.244
2.788AsnThr: 2.788 ± 3.476
3.584AsnVal: 3.584 ± 0.183
0.796AsnTrp: 0.796 ± 0.279
2.788AsnTyr: 2.788 ± 0.81
0.0AsnXaa: 0.0 ± 0.0
Pro
2.788ProAla: 2.788 ± 1.333
0.398ProCys: 0.398 ± 0.218
1.593ProAsp: 1.593 ± 0.157
3.584ProGlu: 3.584 ± 1.96
3.584ProPhe: 3.584 ± 1.612
1.593ProGly: 1.593 ± 0.157
1.593ProHis: 1.593 ± 0.157
2.389ProIle: 2.389 ± 0.836
1.991ProLys: 1.991 ± 1.089
4.779ProLeu: 4.779 ± 1.673
0.796ProMet: 0.796 ± 0.279
1.195ProAsn: 1.195 ± 0.775
1.195ProPro: 1.195 ± 0.061
3.982ProGln: 3.982 ± 0.035
3.584ProArg: 3.584 ± 0.183
2.788ProSer: 2.788 ± 0.096
2.389ProThr: 2.389 ± 0.836
3.186ProVal: 3.186 ± 1.115
0.398ProTrp: 0.398 ± 0.218
2.788ProTyr: 2.788 ± 1.333
0.0ProXaa: 0.0 ± 0.0
Gln
0.398GlnAla: 0.398 ± 0.218
0.398GlnCys: 0.398 ± 0.218
1.991GlnAsp: 1.991 ± 0.34
0.796GlnGlu: 0.796 ± 0.435
0.398GlnPhe: 0.398 ± 0.218
1.991GlnGly: 1.991 ± 0.34
0.796GlnHis: 0.796 ± 0.435
1.991GlnIle: 1.991 ± 0.374
0.398GlnLys: 0.398 ± 0.218
2.788GlnLeu: 2.788 ± 0.619
0.796GlnMet: 0.796 ± 0.279
1.195GlnAsn: 1.195 ± 0.775
1.195GlnPro: 1.195 ± 0.061
0.796GlnGln: 0.796 ± 0.279
1.195GlnArg: 1.195 ± 0.061
3.982GlnSer: 3.982 ± 1.394
2.389GlnThr: 2.389 ± 0.122
2.389GlnVal: 2.389 ± 0.836
0.0GlnTrp: 0.0 ± 0.0
1.593GlnTyr: 1.593 ± 0.157
0.0GlnXaa: 0.0 ± 0.0
Arg
2.389ArgAla: 2.389 ± 0.122
0.398ArgCys: 0.398 ± 0.218
2.389ArgAsp: 2.389 ± 1.306
2.788ArgGlu: 2.788 ± 0.096
2.389ArgPhe: 2.389 ± 1.306
3.584ArgGly: 3.584 ± 0.531
1.195ArgHis: 1.195 ± 0.775
2.389ArgIle: 2.389 ± 1.306
2.788ArgLys: 2.788 ± 0.81
3.584ArgLeu: 3.584 ± 0.897
0.796ArgMet: 0.796 ± 0.279
1.991ArgAsn: 1.991 ± 0.374
1.991ArgPro: 1.991 ± 0.374
1.195ArgGln: 1.195 ± 0.061
1.593ArgArg: 1.593 ± 0.871
3.584ArgSer: 3.584 ± 0.897
1.991ArgThr: 1.991 ± 1.054
3.982ArgVal: 3.982 ± 0.035
0.0ArgTrp: 0.0 ± 0.0
1.991ArgTyr: 1.991 ± 0.374
0.0ArgXaa: 0.0 ± 0.0
Ser
6.372SerAla: 6.372 ± 0.087
0.796SerCys: 0.796 ± 0.435
4.779SerAsp: 4.779 ± 0.47
3.186SerGlu: 3.186 ± 1.028
3.584SerPhe: 3.584 ± 1.612
5.974SerGly: 5.974 ± 1.734
0.796SerHis: 0.796 ± 0.279
5.974SerIle: 5.974 ± 1.734
4.381SerLys: 4.381 ± 1.89
9.16SerLeu: 9.16 ± 0.723
2.389SerMet: 2.389 ± 0.836
5.177SerAsn: 5.177 ± 2.169
1.593SerPro: 1.593 ± 0.157
1.195SerGln: 1.195 ± 0.061
3.186SerArg: 3.186 ± 1.115
11.151SerSer: 11.151 ± 4.617
6.77SerThr: 6.77 ± 4.155
5.575SerVal: 5.575 ± 1.62
1.593SerTrp: 1.593 ± 0.157
1.593SerTyr: 1.593 ± 0.871
0.0SerXaa: 0.0 ± 0.0
Thr
2.788ThrAla: 2.788 ± 0.619
0.398ThrCys: 0.398 ± 0.218
4.381ThrAsp: 4.381 ± 0.462
1.991ThrGlu: 1.991 ± 0.374
4.381ThrPhe: 4.381 ± 0.252
5.177ThrGly: 5.177 ± 1.455
1.593ThrHis: 1.593 ± 1.272
3.982ThrIle: 3.982 ± 0.68
3.982ThrLys: 3.982 ± 0.749
7.965ThrLeu: 7.965 ± 4.93
2.389ThrMet: 2.389 ± 0.836
3.584ThrAsn: 3.584 ± 1.612
2.389ThrPro: 2.389 ± 0.836
1.593ThrGln: 1.593 ± 0.157
1.195ThrArg: 1.195 ± 0.653
3.186ThrSer: 3.186 ± 1.028
3.584ThrThr: 3.584 ± 0.531
4.779ThrVal: 4.779 ± 1.673
0.796ThrTrp: 0.796 ± 0.993
2.788ThrTyr: 2.788 ± 0.096
0.0ThrXaa: 0.0 ± 0.0
Val
5.177ValAla: 5.177 ± 0.741
1.991ValCys: 1.991 ± 0.374
3.584ValAsp: 3.584 ± 0.897
3.186ValGlu: 3.186 ± 0.313
2.788ValPhe: 2.788 ± 0.619
5.974ValGly: 5.974 ± 0.305
1.991ValHis: 1.991 ± 1.089
4.779ValIle: 4.779 ± 1.184
3.584ValLys: 3.584 ± 1.245
4.779ValLeu: 4.779 ± 1.184
2.389ValMet: 2.389 ± 0.592
5.575ValAsn: 5.575 ± 0.523
7.168ValPro: 7.168 ± 3.223
2.389ValGln: 2.389 ± 0.122
3.584ValArg: 3.584 ± 1.245
5.575ValSer: 5.575 ± 1.951
3.584ValThr: 3.584 ± 0.183
4.779ValVal: 4.779 ± 0.958
0.0ValTrp: 0.0 ± 0.0
3.186ValTyr: 3.186 ± 0.313
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.796TrpAsp: 0.796 ± 0.435
0.796TrpGlu: 0.796 ± 0.279
0.0TrpPhe: 0.0 ± 0.0
0.796TrpGly: 0.796 ± 0.279
0.0TrpHis: 0.0 ± 0.0
0.796TrpIle: 0.796 ± 0.435
0.398TrpLys: 0.398 ± 0.218
0.398TrpLeu: 0.398 ± 0.497
0.0TrpMet: 0.0 ± 0.0
0.398TrpAsn: 0.398 ± 0.497
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.398TrpArg: 0.398 ± 0.497
0.796TrpSer: 0.796 ± 0.279
0.398TrpThr: 0.398 ± 0.218
0.796TrpVal: 0.796 ± 0.279
0.0TrpTrp: 0.0 ± 0.0
1.593TrpTyr: 1.593 ± 0.871
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.796TyrAla: 0.796 ± 0.279
1.593TyrCys: 1.593 ± 0.157
3.186TyrAsp: 3.186 ± 1.742
1.195TyrGlu: 1.195 ± 0.653
1.991TyrPhe: 1.991 ± 0.374
0.796TyrGly: 0.796 ± 0.435
0.398TyrHis: 0.398 ± 0.218
2.389TyrIle: 2.389 ± 0.592
2.389TyrLys: 2.389 ± 0.592
1.991TyrLeu: 1.991 ± 1.054
1.195TyrMet: 1.195 ± 0.061
1.991TyrAsn: 1.991 ± 0.34
1.195TyrPro: 1.195 ± 0.061
1.195TyrGln: 1.195 ± 0.061
3.186TyrArg: 3.186 ± 1.028
3.584TyrSer: 3.584 ± 0.531
2.788TyrThr: 2.788 ± 0.619
1.991TyrVal: 1.991 ± 1.054
0.796TyrTrp: 0.796 ± 0.279
1.593TyrTyr: 1.593 ± 0.871
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2512 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski