Amino acid dipepetide frequency for Shahe picorna-like virus 6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.503AlaAla: 10.503 ± 0.044
1.811AlaCys: 1.811 ± 0.358
3.984AlaAsp: 3.984 ± 1.549
1.449AlaGlu: 1.449 ± 0.159
4.346AlaPhe: 4.346 ± 0.157
3.984AlaGly: 3.984 ± 0.99
4.346AlaHis: 4.346 ± 0.478
3.622AlaIle: 3.622 ± 0.081
2.173AlaLys: 2.173 ± 0.556
5.795AlaLeu: 5.795 ± 0.632
1.087AlaMet: 1.087 ± 0.288
3.26AlaAsn: 3.26 ± 0.752
2.898AlaPro: 2.898 ± 1.586
3.622AlaGln: 3.622 ± 1.351
2.535AlaArg: 2.535 ± 0.515
9.779AlaSer: 9.779 ± 4.162
6.157AlaThr: 6.157 ± 0.434
6.882AlaVal: 6.882 ± 1.233
1.087AlaTrp: 1.087 ± 0.596
2.535AlaTyr: 2.535 ± 1.15
0.0AlaXaa: 0.0 ± 0.0
Cys
1.449CysAla: 1.449 ± 0.794
0.0CysCys: 0.0 ± 0.0
1.811CysAsp: 1.811 ± 0.358
0.724CysGlu: 0.724 ± 0.238
1.087CysPhe: 1.087 ± 0.039
1.449CysGly: 1.449 ± 0.794
0.362CysHis: 0.362 ± 0.199
0.362CysIle: 0.362 ± 0.199
0.724CysLys: 0.724 ± 0.238
1.811CysLeu: 1.811 ± 0.993
0.724CysMet: 0.724 ± 0.238
0.724CysAsn: 0.724 ± 0.238
1.087CysPro: 1.087 ± 0.596
0.0CysGln: 0.0 ± 0.0
1.087CysArg: 1.087 ± 0.596
1.087CysSer: 1.087 ± 0.039
1.449CysThr: 1.449 ± 0.794
2.173CysVal: 2.173 ± 0.078
0.0CysTrp: 0.0 ± 0.0
0.724CysTyr: 0.724 ± 0.238
0.0CysXaa: 0.0 ± 0.0
Asp
3.622AspAla: 3.622 ± 0.716
0.362AspCys: 0.362 ± 0.199
5.433AspAsp: 5.433 ± 2.343
2.535AspGlu: 2.535 ± 0.12
2.173AspPhe: 2.173 ± 0.556
3.622AspGly: 3.622 ± 0.716
1.811AspHis: 1.811 ± 0.277
2.535AspIle: 2.535 ± 0.515
3.984AspLys: 3.984 ± 2.184
6.519AspLeu: 6.519 ± 2.304
0.362AspMet: 0.362 ± 0.199
5.071AspAsn: 5.071 ± 0.395
2.173AspPro: 2.173 ± 1.348
0.362AspGln: 0.362 ± 0.436
3.984AspArg: 3.984 ± 0.914
3.622AspSer: 3.622 ± 0.716
4.346AspThr: 4.346 ± 1.113
5.433AspVal: 5.433 ± 0.196
0.362AspTrp: 0.362 ± 0.436
2.535AspTyr: 2.535 ± 0.12
0.0AspXaa: 0.0 ± 0.0
Glu
3.26GluAla: 3.26 ± 0.517
1.087GluCys: 1.087 ± 0.596
2.173GluAsp: 2.173 ± 0.556
1.449GluGlu: 1.449 ± 0.794
1.449GluPhe: 1.449 ± 0.476
0.362GluGly: 0.362 ± 0.199
1.087GluHis: 1.087 ± 0.596
1.449GluIle: 1.449 ± 0.159
1.811GluLys: 1.811 ± 0.277
3.622GluLeu: 3.622 ± 1.189
0.724GluMet: 0.724 ± 0.873
2.898GluAsn: 2.898 ± 0.319
0.362GluPro: 0.362 ± 0.436
0.362GluGln: 0.362 ± 0.199
2.898GluArg: 2.898 ± 1.588
3.622GluSer: 3.622 ± 0.554
0.362GluThr: 0.362 ± 0.436
3.26GluVal: 3.26 ± 0.752
0.0GluTrp: 0.0 ± 0.0
1.087GluTyr: 1.087 ± 0.039
0.0GluXaa: 0.0 ± 0.0
Phe
2.173PheAla: 2.173 ± 0.713
1.087PheCys: 1.087 ± 0.596
2.535PheAsp: 2.535 ± 1.39
2.898PheGlu: 2.898 ± 0.951
0.362PhePhe: 0.362 ± 0.436
3.984PheGly: 3.984 ± 1.549
1.087PheHis: 1.087 ± 0.039
2.173PheIle: 2.173 ± 0.078
1.087PheLys: 1.087 ± 0.039
2.535PheLeu: 2.535 ± 0.12
1.449PheMet: 1.449 ± 0.794
3.622PheAsn: 3.622 ± 1.189
2.898PhePro: 2.898 ± 0.319
1.087PheGln: 1.087 ± 0.674
1.811PheArg: 1.811 ± 0.277
4.708PheSer: 4.708 ± 3.133
2.535PheThr: 2.535 ± 0.515
5.795PheVal: 5.795 ± 1.907
0.724PheTrp: 0.724 ± 0.238
0.362PheTyr: 0.362 ± 0.436
0.0PheXaa: 0.0 ± 0.0
Gly
4.346GlyAla: 4.346 ± 2.061
2.173GlyCys: 2.173 ± 0.078
6.519GlyAsp: 6.519 ± 3.574
1.449GlyGlu: 1.449 ± 0.476
4.346GlyPhe: 4.346 ± 0.478
5.433GlyGly: 5.433 ± 1.074
0.724GlyHis: 0.724 ± 0.397
6.157GlyIle: 6.157 ± 2.741
0.724GlyLys: 0.724 ± 0.397
5.433GlyLeu: 5.433 ± 1.709
0.724GlyMet: 0.724 ± 0.238
2.898GlyAsn: 2.898 ± 0.316
2.898GlyPro: 2.898 ± 0.316
0.362GlyGln: 0.362 ± 0.199
2.898GlyArg: 2.898 ± 0.319
5.433GlySer: 5.433 ± 0.439
3.622GlyThr: 3.622 ± 0.554
9.779GlyVal: 9.779 ± 0.282
0.724GlyTrp: 0.724 ± 0.238
2.173GlyTyr: 2.173 ± 0.078
0.0GlyXaa: 0.0 ± 0.0
His
3.26HisAla: 3.26 ± 0.517
0.724HisCys: 0.724 ± 0.397
1.811HisAsp: 1.811 ± 0.993
0.724HisGlu: 0.724 ± 0.238
1.811HisPhe: 1.811 ± 0.358
1.449HisGly: 1.449 ± 0.794
1.087HisHis: 1.087 ± 0.596
1.449HisIle: 1.449 ± 0.476
1.087HisLys: 1.087 ± 0.596
2.173HisLeu: 2.173 ± 0.078
1.449HisMet: 1.449 ± 0.476
0.362HisAsn: 0.362 ± 0.199
0.724HisPro: 0.724 ± 0.397
0.362HisGln: 0.362 ± 0.199
1.449HisArg: 1.449 ± 0.476
2.173HisSer: 2.173 ± 0.078
1.449HisThr: 1.449 ± 0.794
2.173HisVal: 2.173 ± 0.713
0.362HisTrp: 0.362 ± 0.199
1.449HisTyr: 1.449 ± 0.159
0.0HisXaa: 0.0 ± 0.0
Ile
5.795IleAla: 5.795 ± 0.637
0.362IleCys: 0.362 ± 0.199
1.449IleAsp: 1.449 ± 0.476
2.535IleGlu: 2.535 ± 0.755
2.535IlePhe: 2.535 ± 0.12
3.26IleGly: 3.26 ± 0.752
1.449IleHis: 1.449 ± 0.794
1.449IleIle: 1.449 ± 0.794
1.449IleLys: 1.449 ± 0.159
4.346IleLeu: 4.346 ± 0.478
1.449IleMet: 1.449 ± 0.14
4.708IleAsn: 4.708 ± 1.228
1.811IlePro: 1.811 ± 1.547
0.0IleGln: 0.0 ± 0.0
2.535IleArg: 2.535 ± 0.12
3.622IleSer: 3.622 ± 1.189
4.708IleThr: 4.708 ± 0.042
3.622IleVal: 3.622 ± 0.716
0.724IleTrp: 0.724 ± 0.238
2.535IleTyr: 2.535 ± 0.755
0.0IleXaa: 0.0 ± 0.0
Lys
3.26LysAla: 3.26 ± 1.152
1.087LysCys: 1.087 ± 0.039
1.449LysAsp: 1.449 ± 0.476
1.449LysGlu: 1.449 ± 0.159
2.535LysPhe: 2.535 ± 0.12
3.622LysGly: 3.622 ± 0.081
1.087LysHis: 1.087 ± 0.596
2.535LysIle: 2.535 ± 0.12
0.724LysLys: 0.724 ± 0.397
2.898LysLeu: 2.898 ± 0.951
1.087LysMet: 1.087 ± 0.596
1.811LysAsn: 1.811 ± 0.277
0.724LysPro: 0.724 ± 0.238
0.362LysGln: 0.362 ± 0.199
2.535LysArg: 2.535 ± 1.39
3.622LysSer: 3.622 ± 1.986
4.346LysThr: 4.346 ± 1.748
2.535LysVal: 2.535 ± 0.755
0.0LysTrp: 0.0 ± 0.0
2.898LysTyr: 2.898 ± 0.954
0.0LysXaa: 0.0 ± 0.0
Leu
7.968LeuAla: 7.968 ± 1.346
1.811LeuCys: 1.811 ± 0.358
4.708LeuAsp: 4.708 ± 1.228
3.26LeuGlu: 3.26 ± 0.517
2.535LeuPhe: 2.535 ± 1.15
6.882LeuGly: 6.882 ± 0.598
2.535LeuHis: 2.535 ± 0.515
2.898LeuIle: 2.898 ± 0.316
3.984LeuLys: 3.984 ± 0.914
5.795LeuLeu: 5.795 ± 1.907
1.811LeuMet: 1.811 ± 0.358
2.173LeuAsn: 2.173 ± 0.556
5.071LeuPro: 5.071 ± 1.664
3.984LeuGln: 3.984 ± 0.279
3.984LeuArg: 3.984 ± 0.355
5.433LeuSer: 5.433 ± 0.439
3.26LeuThr: 3.26 ± 1.152
4.346LeuVal: 4.346 ± 1.113
1.449LeuTrp: 1.449 ± 0.794
3.984LeuTyr: 3.984 ± 0.914
0.0LeuXaa: 0.0 ± 0.0
Met
2.535MetAla: 2.535 ± 0.515
0.724MetCys: 0.724 ± 0.238
1.811MetAsp: 1.811 ± 0.277
0.724MetGlu: 0.724 ± 0.238
0.724MetPhe: 0.724 ± 0.238
2.173MetGly: 2.173 ± 0.078
0.0MetHis: 0.0 ± 0.0
1.811MetIle: 1.811 ± 0.358
1.449MetLys: 1.449 ± 0.476
1.811MetLeu: 1.811 ± 0.277
0.724MetMet: 0.724 ± 0.397
2.173MetAsn: 2.173 ± 0.078
0.724MetPro: 0.724 ± 0.873
0.362MetGln: 0.362 ± 0.199
2.173MetArg: 2.173 ± 0.713
3.622MetSer: 3.622 ± 1.189
0.362MetThr: 0.362 ± 0.199
1.449MetVal: 1.449 ± 0.476
0.362MetTrp: 0.362 ± 0.199
0.724MetTyr: 0.724 ± 0.397
0.0MetXaa: 0.0 ± 0.0
Asn
4.708AsnAla: 4.708 ± 1.228
0.724AsnCys: 0.724 ± 0.397
2.898AsnAsp: 2.898 ± 0.954
1.087AsnGlu: 1.087 ± 0.674
2.535AsnPhe: 2.535 ± 0.515
3.26AsnGly: 3.26 ± 0.517
1.811AsnHis: 1.811 ± 0.358
2.898AsnIle: 2.898 ± 0.316
1.449AsnLys: 1.449 ± 0.794
3.26AsnLeu: 3.26 ± 0.118
1.449AsnMet: 1.449 ± 0.159
3.984AsnAsn: 3.984 ± 0.99
6.157AsnPro: 6.157 ± 1.704
1.811AsnGln: 1.811 ± 0.277
1.087AsnArg: 1.087 ± 0.039
4.708AsnSer: 4.708 ± 0.593
5.071AsnThr: 5.071 ± 1.029
4.346AsnVal: 4.346 ± 0.157
1.087AsnTrp: 1.087 ± 0.039
2.173AsnTyr: 2.173 ± 0.078
0.0AsnXaa: 0.0 ± 0.0
Pro
4.346ProAla: 4.346 ± 2.696
0.362ProCys: 0.362 ± 0.199
1.811ProAsp: 1.811 ± 0.277
1.087ProGlu: 1.087 ± 0.674
4.346ProPhe: 4.346 ± 0.157
1.811ProGly: 1.811 ± 0.277
1.811ProHis: 1.811 ± 0.912
2.898ProIle: 2.898 ± 1.586
2.173ProLys: 2.173 ± 0.556
3.984ProLeu: 3.984 ± 0.99
1.087ProMet: 1.087 ± 0.039
1.811ProAsn: 1.811 ± 0.277
2.898ProPro: 2.898 ± 0.316
1.449ProGln: 1.449 ± 1.11
2.535ProArg: 2.535 ± 0.12
2.898ProSer: 2.898 ± 0.951
3.984ProThr: 3.984 ± 2.26
2.898ProVal: 2.898 ± 0.951
0.724ProTrp: 0.724 ± 0.873
1.811ProTyr: 1.811 ± 0.358
0.0ProXaa: 0.0 ± 0.0
Gln
1.811GlnAla: 1.811 ± 0.993
0.724GlnCys: 0.724 ± 0.397
1.087GlnAsp: 1.087 ± 0.039
0.724GlnGlu: 0.724 ± 0.397
1.087GlnPhe: 1.087 ± 0.674
1.087GlnGly: 1.087 ± 0.039
0.362GlnHis: 0.362 ± 0.199
2.173GlnIle: 2.173 ± 0.713
1.449GlnLys: 1.449 ± 0.159
0.362GlnLeu: 0.362 ± 0.199
0.724GlnMet: 0.724 ± 0.873
0.724GlnAsn: 0.724 ± 0.238
1.087GlnPro: 1.087 ± 0.039
0.362GlnGln: 0.362 ± 0.199
0.724GlnArg: 0.724 ± 0.238
3.622GlnSer: 3.622 ± 0.716
1.449GlnThr: 1.449 ± 0.476
1.811GlnVal: 1.811 ± 0.358
0.724GlnTrp: 0.724 ± 0.397
1.811GlnTyr: 1.811 ± 1.547
0.0GlnXaa: 0.0 ± 0.0
Arg
2.535ArgAla: 2.535 ± 0.12
1.087ArgCys: 1.087 ± 0.039
2.898ArgAsp: 2.898 ± 0.319
1.087ArgGlu: 1.087 ± 0.596
0.0ArgPhe: 0.0 ± 0.0
2.535ArgGly: 2.535 ± 0.515
1.087ArgHis: 1.087 ± 0.039
2.535ArgIle: 2.535 ± 0.12
2.535ArgLys: 2.535 ± 1.39
2.898ArgLeu: 2.898 ± 1.586
1.811ArgMet: 1.811 ± 0.358
3.984ArgAsn: 3.984 ± 0.914
3.26ArgPro: 3.26 ± 1.387
2.535ArgGln: 2.535 ± 1.39
2.535ArgArg: 2.535 ± 1.39
3.622ArgSer: 3.622 ± 1.351
4.708ArgThr: 4.708 ± 0.042
5.071ArgVal: 5.071 ± 2.145
1.087ArgTrp: 1.087 ± 0.674
1.449ArgTyr: 1.449 ± 0.159
0.0ArgXaa: 0.0 ± 0.0
Ser
5.795SerAla: 5.795 ± 0.003
1.087SerCys: 1.087 ± 0.674
3.984SerAsp: 3.984 ± 1.549
1.449SerGlu: 1.449 ± 0.476
3.622SerPhe: 3.622 ± 0.554
8.33SerGly: 8.33 ± 0.512
1.087SerHis: 1.087 ± 0.596
6.882SerIle: 6.882 ± 0.037
3.984SerLys: 3.984 ± 0.99
9.779SerLeu: 9.779 ± 2.257
2.535SerMet: 2.535 ± 1.784
3.984SerAsn: 3.984 ± 0.99
3.26SerPro: 3.26 ± 0.752
1.087SerGln: 1.087 ± 1.309
4.708SerArg: 4.708 ± 1.311
5.071SerSer: 5.071 ± 1.664
3.984SerThr: 3.984 ± 2.26
8.33SerVal: 8.33 ± 0.512
1.449SerTrp: 1.449 ± 0.794
5.433SerTyr: 5.433 ± 0.831
0.0SerXaa: 0.0 ± 0.0
Thr
3.622ThrAla: 3.622 ± 1.824
2.173ThrCys: 2.173 ± 0.556
3.26ThrAsp: 3.26 ± 0.517
1.449ThrGlu: 1.449 ± 0.159
3.26ThrPhe: 3.26 ± 0.752
4.708ThrGly: 4.708 ± 1.228
2.898ThrHis: 2.898 ± 0.319
3.622ThrIle: 3.622 ± 0.081
1.087ThrLys: 1.087 ± 0.674
4.708ThrLeu: 4.708 ± 1.946
1.449ThrMet: 1.449 ± 0.476
2.898ThrAsn: 2.898 ± 0.319
3.622ThrPro: 3.622 ± 0.554
2.173ThrGln: 2.173 ± 1.348
2.898ThrArg: 2.898 ± 1.588
5.071ThrSer: 5.071 ± 0.24
3.984ThrThr: 3.984 ± 2.26
6.519ThrVal: 6.519 ± 1.035
0.724ThrTrp: 0.724 ± 0.397
2.535ThrTyr: 2.535 ± 1.784
0.0ThrXaa: 0.0 ± 0.0
Val
7.606ValAla: 7.606 ± 0.995
1.087ValCys: 1.087 ± 0.596
6.519ValAsp: 6.519 ± 1.505
4.346ValGlu: 4.346 ± 1.113
2.535ValPhe: 2.535 ± 1.39
6.882ValGly: 6.882 ± 3.773
2.173ValHis: 2.173 ± 0.556
1.449ValIle: 1.449 ± 0.159
4.346ValLys: 4.346 ± 1.113
4.708ValLeu: 4.708 ± 0.677
3.984ValMet: 3.984 ± 0.99
5.433ValAsn: 5.433 ± 1.709
4.346ValPro: 4.346 ± 2.061
1.811ValGln: 1.811 ± 0.277
4.346ValArg: 4.346 ± 0.478
7.968ValSer: 7.968 ± 1.346
3.622ValThr: 3.622 ± 0.081
6.157ValVal: 6.157 ± 0.201
2.173ValTrp: 2.173 ± 0.078
5.433ValTyr: 5.433 ± 0.831
0.0ValXaa: 0.0 ± 0.0
Trp
1.449TrpAla: 1.449 ± 0.476
0.362TrpCys: 0.362 ± 0.199
1.449TrpAsp: 1.449 ± 0.476
0.362TrpGlu: 0.362 ± 0.436
1.087TrpPhe: 1.087 ± 0.596
0.724TrpGly: 0.724 ± 0.397
0.0TrpHis: 0.0 ± 0.0
0.724TrpIle: 0.724 ± 0.238
1.811TrpLys: 1.811 ± 0.358
1.449TrpLeu: 1.449 ± 0.159
0.362TrpMet: 0.362 ± 0.199
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.362TrpGln: 0.362 ± 0.199
0.724TrpArg: 0.724 ± 0.873
1.449TrpSer: 1.449 ± 0.476
0.362TrpThr: 0.362 ± 0.199
1.087TrpVal: 1.087 ± 0.596
0.0TrpTrp: 0.0 ± 0.0
0.362TrpTyr: 0.362 ± 0.199
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.173TyrAla: 2.173 ± 0.078
0.0TyrCys: 0.0 ± 0.0
3.26TyrAsp: 3.26 ± 0.752
2.535TyrGlu: 2.535 ± 0.12
2.535TyrPhe: 2.535 ± 0.515
3.26TyrGly: 3.26 ± 1.152
0.724TyrHis: 0.724 ± 0.238
0.724TyrIle: 0.724 ± 0.238
2.535TyrLys: 2.535 ± 1.39
3.984TyrLeu: 3.984 ± 0.914
1.087TyrMet: 1.087 ± 0.674
3.984TyrAsn: 3.984 ± 1.625
0.724TyrPro: 0.724 ± 0.397
1.449TyrGln: 1.449 ± 0.159
1.811TyrArg: 1.811 ± 0.277
4.708TyrSer: 4.708 ± 1.863
2.898TyrThr: 2.898 ± 0.319
3.26TyrVal: 3.26 ± 0.752
0.362TyrTrp: 0.362 ± 0.436
0.724TyrTyr: 0.724 ± 0.873
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2762 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski