Amino acid dipepetide frequency for Sanxia picorna-like virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.532AlaAla: 4.532 ± 1.881
1.511AlaCys: 1.511 ± 0.829
2.266AlaAsp: 2.266 ± 0.619
2.266AlaGlu: 2.266 ± 0.619
3.776AlaPhe: 3.776 ± 0.201
4.909AlaGly: 4.909 ± 1.05
2.266AlaHis: 2.266 ± 1.243
2.644AlaIle: 2.644 ± 0.421
4.154AlaLys: 4.154 ± 1.032
4.909AlaLeu: 4.909 ± 0.822
0.755AlaMet: 0.755 ± 0.209
5.665AlaAsn: 5.665 ± 2.507
5.665AlaPro: 5.665 ± 1.259
1.888AlaGln: 1.888 ± 0.836
2.644AlaArg: 2.644 ± 1.669
5.665AlaSer: 5.665 ± 0.011
5.665AlaThr: 5.665 ± 1.883
4.154AlaVal: 4.154 ± 0.216
1.511AlaTrp: 1.511 ± 0.419
1.511AlaTyr: 1.511 ± 0.419
0.0AlaXaa: 0.0 ± 0.0
Cys
0.378CysAla: 0.378 ± 0.207
0.0CysCys: 0.0 ± 0.0
0.378CysAsp: 0.378 ± 0.207
1.133CysGlu: 1.133 ± 0.622
1.133CysPhe: 1.133 ± 0.002
0.755CysGly: 0.755 ± 0.414
1.511CysHis: 1.511 ± 0.829
1.133CysIle: 1.133 ± 0.002
1.511CysLys: 1.511 ± 0.829
0.755CysLeu: 0.755 ± 0.414
0.755CysMet: 0.755 ± 0.414
1.511CysAsn: 1.511 ± 0.829
0.378CysPro: 0.378 ± 0.207
0.755CysGln: 0.755 ± 0.209
1.133CysArg: 1.133 ± 0.622
1.133CysSer: 1.133 ± 0.002
1.133CysThr: 1.133 ± 0.002
1.888CysVal: 1.888 ± 1.036
0.0CysTrp: 0.0 ± 0.0
0.755CysTyr: 0.755 ± 0.414
0.0CysXaa: 0.0 ± 0.0
Asp
3.021AspAla: 3.021 ± 0.41
1.511AspCys: 1.511 ± 0.829
3.021AspAsp: 3.021 ± 0.838
2.266AspGlu: 2.266 ± 0.619
6.042AspPhe: 6.042 ± 0.82
3.399AspGly: 3.399 ± 1.878
1.133AspHis: 1.133 ± 0.002
4.532AspIle: 4.532 ± 1.881
3.776AspLys: 3.776 ± 1.448
5.665AspLeu: 5.665 ± 2.484
1.133AspMet: 1.133 ± 0.622
1.511AspAsn: 1.511 ± 0.205
3.399AspPro: 3.399 ± 0.631
1.133AspGln: 1.133 ± 0.002
1.888AspArg: 1.888 ± 0.212
3.776AspSer: 3.776 ± 0.423
4.154AspThr: 4.154 ± 1.464
4.532AspVal: 4.532 ± 2.505
1.133AspTrp: 1.133 ± 0.622
4.154AspTyr: 4.154 ± 1.656
0.0AspXaa: 0.0 ± 0.0
Glu
2.266GluAla: 2.266 ± 0.619
1.133GluCys: 1.133 ± 0.622
1.511GluAsp: 1.511 ± 0.829
2.644GluGlu: 2.644 ± 0.203
3.021GluPhe: 3.021 ± 0.41
2.266GluGly: 2.266 ± 1.243
2.266GluHis: 2.266 ± 0.619
5.287GluIle: 5.287 ± 0.218
3.021GluLys: 3.021 ± 0.214
3.021GluLeu: 3.021 ± 0.41
2.644GluMet: 2.644 ± 0.421
1.511GluAsn: 1.511 ± 0.205
2.644GluPro: 2.644 ± 0.421
1.888GluGln: 1.888 ± 0.212
3.021GluArg: 3.021 ± 1.658
4.154GluSer: 4.154 ± 0.408
1.511GluThr: 1.511 ± 0.419
3.776GluVal: 3.776 ± 0.824
1.511GluTrp: 1.511 ± 0.829
1.511GluTyr: 1.511 ± 0.419
0.0GluXaa: 0.0 ± 0.0
Phe
4.909PheAla: 4.909 ± 1.673
1.511PheCys: 1.511 ± 0.829
6.042PheAsp: 6.042 ± 3.316
3.776PheGlu: 3.776 ± 1.448
0.755PhePhe: 0.755 ± 0.414
2.266PheGly: 2.266 ± 1.243
1.133PheHis: 1.133 ± 0.622
2.266PheIle: 2.266 ± 1.243
3.021PheLys: 3.021 ± 0.838
6.042PheLeu: 6.042 ± 0.196
1.511PheMet: 1.511 ± 0.419
3.021PheAsn: 3.021 ± 1.462
3.399PhePro: 3.399 ± 0.007
1.133PheGln: 1.133 ± 0.002
1.133PheArg: 1.133 ± 0.002
4.909PheSer: 4.909 ± 0.426
1.511PheThr: 1.511 ± 1.043
1.888PheVal: 1.888 ± 0.212
0.378PheTrp: 0.378 ± 0.207
2.644PheTyr: 2.644 ± 1.451
0.0PheXaa: 0.0 ± 0.0
Gly
3.021GlyAla: 3.021 ± 0.838
1.888GlyCys: 1.888 ± 1.036
4.154GlyAsp: 4.154 ± 2.712
3.399GlyGlu: 3.399 ± 0.631
4.532GlyPhe: 4.532 ± 0.009
5.287GlyGly: 5.287 ± 0.406
1.133GlyHis: 1.133 ± 0.622
4.154GlyIle: 4.154 ± 1.656
3.399GlyLys: 3.399 ± 0.617
4.909GlyLeu: 4.909 ± 0.426
3.399GlyMet: 3.399 ± 0.617
1.133GlyAsn: 1.133 ± 0.626
1.511GlyPro: 1.511 ± 0.205
1.511GlyGln: 1.511 ± 0.205
0.755GlyArg: 0.755 ± 0.414
3.021GlySer: 3.021 ± 0.214
2.644GlyThr: 2.644 ± 0.203
4.532GlyVal: 4.532 ± 0.615
1.888GlyTrp: 1.888 ± 0.836
2.644GlyTyr: 2.644 ± 2.293
0.0GlyXaa: 0.0 ± 0.0
His
1.133HisAla: 1.133 ± 0.622
0.378HisCys: 0.378 ± 0.207
1.888HisAsp: 1.888 ± 0.412
1.133HisGlu: 1.133 ± 0.002
0.755HisPhe: 0.755 ± 0.414
3.399HisGly: 3.399 ± 1.865
1.511HisHis: 1.511 ± 0.419
3.021HisIle: 3.021 ± 0.41
1.888HisLys: 1.888 ± 0.212
2.266HisLeu: 2.266 ± 1.252
1.511HisMet: 1.511 ± 0.419
0.755HisAsn: 0.755 ± 0.209
0.378HisPro: 0.378 ± 0.207
0.755HisGln: 0.755 ± 0.209
0.0HisArg: 0.0 ± 0.0
2.266HisSer: 2.266 ± 0.628
3.021HisThr: 3.021 ± 0.41
1.511HisVal: 1.511 ± 0.205
0.0HisTrp: 0.0 ± 0.0
1.511HisTyr: 1.511 ± 0.205
0.0HisXaa: 0.0 ± 0.0
Ile
4.532IleAla: 4.532 ± 0.615
0.755IleCys: 0.755 ± 0.414
4.532IleAsp: 4.532 ± 0.009
1.888IleGlu: 1.888 ± 0.412
2.266IlePhe: 2.266 ± 0.619
3.776IleGly: 3.776 ± 1.448
1.511IleHis: 1.511 ± 0.419
5.287IleIle: 5.287 ± 1.029
4.154IleLys: 4.154 ± 1.464
4.909IleLeu: 4.909 ± 0.822
2.644IleMet: 2.644 ± 1.045
3.399IleAsn: 3.399 ± 0.631
4.532IlePro: 4.532 ± 0.009
2.266IleGln: 2.266 ± 0.619
3.021IleArg: 3.021 ± 0.214
4.909IleSer: 4.909 ± 0.426
5.665IleThr: 5.665 ± 0.011
3.399IleVal: 3.399 ± 1.878
0.378IleTrp: 0.378 ± 0.207
2.266IleTyr: 2.266 ± 0.619
0.0IleXaa: 0.0 ± 0.0
Lys
4.909LysAla: 4.909 ± 0.822
0.0LysCys: 0.0 ± 0.0
3.776LysAsp: 3.776 ± 1.448
2.644LysGlu: 2.644 ± 0.203
4.909LysPhe: 4.909 ± 0.426
2.266LysGly: 2.266 ± 0.004
0.755LysHis: 0.755 ± 0.414
3.021LysIle: 3.021 ± 0.41
3.776LysLys: 3.776 ± 1.448
3.021LysLeu: 3.021 ± 1.034
1.133LysMet: 1.133 ± 0.002
3.399LysAsn: 3.399 ± 1.865
3.399LysPro: 3.399 ± 0.631
3.399LysGln: 3.399 ± 0.617
3.399LysArg: 3.399 ± 1.241
4.154LysSer: 4.154 ± 1.656
2.644LysThr: 2.644 ± 0.827
3.021LysVal: 3.021 ± 1.034
0.755LysTrp: 0.755 ± 0.414
3.776LysTyr: 3.776 ± 0.201
0.0LysXaa: 0.0 ± 0.0
Leu
6.798LeuAla: 6.798 ± 0.637
1.511LeuCys: 1.511 ± 0.205
4.532LeuAsp: 4.532 ± 1.257
5.287LeuGlu: 5.287 ± 0.218
2.644LeuPhe: 2.644 ± 0.827
6.42LeuGly: 6.42 ± 1.469
3.021LeuHis: 3.021 ± 1.462
3.776LeuIle: 3.776 ± 1.448
3.399LeuLys: 3.399 ± 1.241
6.42LeuLeu: 6.42 ± 0.221
1.511LeuMet: 1.511 ± 0.829
3.021LeuAsn: 3.021 ± 0.214
2.644LeuPro: 2.644 ± 1.045
1.888LeuGln: 1.888 ± 0.212
2.644LeuArg: 2.644 ± 0.203
10.196LeuSer: 10.196 ± 0.644
6.42LeuThr: 6.42 ± 0.221
3.021LeuVal: 3.021 ± 1.658
1.133LeuTrp: 1.133 ± 0.626
2.266LeuTyr: 2.266 ± 0.004
0.0LeuXaa: 0.0 ± 0.0
Met
3.021MetAla: 3.021 ± 0.214
0.755MetCys: 0.755 ± 0.414
1.888MetAsp: 1.888 ± 0.412
1.888MetGlu: 1.888 ± 0.412
1.511MetPhe: 1.511 ± 0.419
1.133MetGly: 1.133 ± 0.002
0.378MetHis: 0.378 ± 0.417
3.776MetIle: 3.776 ± 0.423
1.511MetLys: 1.511 ± 0.205
1.511MetLeu: 1.511 ± 0.419
0.378MetMet: 0.378 ± 0.207
1.511MetAsn: 1.511 ± 0.205
0.755MetPro: 0.755 ± 0.209
1.888MetGln: 1.888 ± 0.412
1.133MetArg: 1.133 ± 0.622
2.644MetSer: 2.644 ± 0.827
0.755MetThr: 0.755 ± 0.833
1.511MetVal: 1.511 ± 0.205
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
5.287AsnAla: 5.287 ± 0.406
0.755AsnCys: 0.755 ± 0.414
0.378AsnAsp: 0.378 ± 0.207
1.511AsnGlu: 1.511 ± 0.205
1.511AsnPhe: 1.511 ± 0.205
5.287AsnGly: 5.287 ± 1.466
0.755AsnHis: 0.755 ± 0.209
3.399AsnIle: 3.399 ± 2.502
1.511AsnLys: 1.511 ± 0.829
1.888AsnLeu: 1.888 ± 0.212
0.755AsnMet: 0.755 ± 0.414
1.511AsnAsn: 1.511 ± 1.043
3.776AsnPro: 3.776 ± 0.201
1.133AsnGln: 1.133 ± 0.002
1.511AsnArg: 1.511 ± 0.205
2.266AsnSer: 2.266 ± 0.619
2.644AsnThr: 2.644 ± 0.421
3.776AsnVal: 3.776 ± 1.671
0.378AsnTrp: 0.378 ± 0.207
1.511AsnTyr: 1.511 ± 0.419
0.0AsnXaa: 0.0 ± 0.0
Pro
2.266ProAla: 2.266 ± 0.628
1.511ProCys: 1.511 ± 0.205
2.266ProAsp: 2.266 ± 0.004
3.021ProGlu: 3.021 ± 0.41
3.399ProPhe: 3.399 ± 0.631
1.888ProGly: 1.888 ± 0.212
2.266ProHis: 2.266 ± 0.004
3.776ProIle: 3.776 ± 0.423
2.266ProLys: 2.266 ± 1.243
3.776ProLeu: 3.776 ± 0.423
1.511ProMet: 1.511 ± 0.205
1.511ProAsn: 1.511 ± 0.419
2.644ProPro: 2.644 ± 1.451
4.154ProGln: 4.154 ± 1.464
2.644ProArg: 2.644 ± 1.669
1.888ProSer: 1.888 ± 0.212
3.399ProThr: 3.399 ± 1.255
4.154ProVal: 4.154 ± 2.088
1.133ProTrp: 1.133 ± 0.626
1.511ProTyr: 1.511 ± 0.205
0.0ProXaa: 0.0 ± 0.0
Gln
1.888GlnAla: 1.888 ± 0.412
0.378GlnCys: 0.378 ± 0.207
2.266GlnAsp: 2.266 ± 1.252
1.888GlnGlu: 1.888 ± 0.212
0.755GlnPhe: 0.755 ± 0.414
1.133GlnGly: 1.133 ± 0.626
0.755GlnHis: 0.755 ± 0.209
1.511GlnIle: 1.511 ± 1.043
3.021GlnLys: 3.021 ± 1.034
4.532GlnLeu: 4.532 ± 1.239
1.133GlnMet: 1.133 ± 0.622
0.0GlnAsn: 0.0 ± 0.0
3.021GlnPro: 3.021 ± 0.838
0.378GlnGln: 0.378 ± 0.207
1.888GlnArg: 1.888 ± 0.212
2.266GlnSer: 2.266 ± 1.252
2.266GlnThr: 2.266 ± 0.628
1.511GlnVal: 1.511 ± 0.829
0.378GlnTrp: 0.378 ± 0.207
1.133GlnTyr: 1.133 ± 0.002
0.0GlnXaa: 0.0 ± 0.0
Arg
2.266ArgAla: 2.266 ± 0.004
0.378ArgCys: 0.378 ± 0.207
3.021ArgAsp: 3.021 ± 0.41
3.021ArgGlu: 3.021 ± 0.41
1.888ArgPhe: 1.888 ± 0.412
2.266ArgGly: 2.266 ± 1.252
0.755ArgHis: 0.755 ± 0.414
2.266ArgIle: 2.266 ± 0.619
3.021ArgLys: 3.021 ± 1.658
3.021ArgLeu: 3.021 ± 0.41
0.378ArgMet: 0.378 ± 0.417
1.133ArgAsn: 1.133 ± 0.622
3.399ArgPro: 3.399 ± 0.631
1.133ArgGln: 1.133 ± 0.626
1.888ArgArg: 1.888 ± 0.412
3.776ArgSer: 3.776 ± 1.671
3.021ArgThr: 3.021 ± 0.41
3.776ArgVal: 3.776 ± 0.201
0.755ArgTrp: 0.755 ± 0.209
1.888ArgTyr: 1.888 ± 0.836
0.0ArgXaa: 0.0 ± 0.0
Ser
3.776SerAla: 3.776 ± 2.295
1.133SerCys: 1.133 ± 0.622
5.287SerAsp: 5.287 ± 0.218
5.665SerGlu: 5.665 ± 0.613
2.266SerPhe: 2.266 ± 1.243
5.665SerGly: 5.665 ± 0.635
1.888SerHis: 1.888 ± 0.412
7.931SerIle: 7.931 ± 1.232
3.776SerLys: 3.776 ± 0.824
6.798SerLeu: 6.798 ± 2.509
1.133SerMet: 1.133 ± 0.002
2.644SerAsn: 2.644 ± 0.421
3.399SerPro: 3.399 ± 0.631
2.644SerGln: 2.644 ± 0.203
2.644SerArg: 2.644 ± 1.045
3.776SerSer: 3.776 ± 1.448
5.665SerThr: 5.665 ± 1.259
5.287SerVal: 5.287 ± 1.029
0.0SerTrp: 0.0 ± 0.0
3.399SerTyr: 3.399 ± 0.617
0.0SerXaa: 0.0 ± 0.0
Thr
4.154ThrAla: 4.154 ± 0.84
0.755ThrCys: 0.755 ± 0.414
4.909ThrAsp: 4.909 ± 1.673
1.511ThrGlu: 1.511 ± 0.419
4.909ThrPhe: 4.909 ± 0.426
3.776ThrGly: 3.776 ± 0.423
3.021ThrHis: 3.021 ± 0.41
3.776ThrIle: 3.776 ± 1.671
4.154ThrLys: 4.154 ± 0.408
4.909ThrLeu: 4.909 ± 2.921
1.511ThrMet: 1.511 ± 0.349
3.021ThrAsn: 3.021 ± 0.838
3.776ThrPro: 3.776 ± 1.671
1.888ThrGln: 1.888 ± 0.212
3.399ThrArg: 3.399 ± 1.255
3.399ThrSer: 3.399 ± 1.878
8.308ThrThr: 8.308 ± 0.432
4.532ThrVal: 4.532 ± 0.009
1.511ThrTrp: 1.511 ± 0.205
3.776ThrTyr: 3.776 ± 1.448
0.0ThrXaa: 0.0 ± 0.0
Val
4.154ValAla: 4.154 ± 1.464
1.888ValCys: 1.888 ± 0.836
4.909ValAsp: 4.909 ± 0.822
3.399ValGlu: 3.399 ± 1.241
4.532ValPhe: 4.532 ± 0.615
1.511ValGly: 1.511 ± 0.205
1.133ValHis: 1.133 ± 0.626
2.266ValIle: 2.266 ± 0.004
3.776ValLys: 3.776 ± 1.448
4.909ValLeu: 4.909 ± 1.05
2.644ValMet: 2.644 ± 0.439
3.399ValAsn: 3.399 ± 0.007
1.888ValPro: 1.888 ± 0.836
0.755ValGln: 0.755 ± 0.414
3.399ValArg: 3.399 ± 0.007
6.42ValSer: 6.42 ± 0.221
4.532ValThr: 4.532 ± 0.009
2.644ValVal: 2.644 ± 0.421
0.755ValTrp: 0.755 ± 0.209
1.888ValTyr: 1.888 ± 0.836
0.0ValXaa: 0.0 ± 0.0
Trp
0.378TrpAla: 0.378 ± 0.417
0.0TrpCys: 0.0 ± 0.0
1.888TrpAsp: 1.888 ± 0.212
1.133TrpGlu: 1.133 ± 0.622
0.755TrpPhe: 0.755 ± 0.209
0.378TrpGly: 0.378 ± 0.207
0.755TrpHis: 0.755 ± 0.209
0.378TrpIle: 0.378 ± 0.207
0.378TrpLys: 0.378 ± 0.207
0.755TrpLeu: 0.755 ± 0.414
0.0TrpMet: 0.0 ± 0.0
0.378TrpAsn: 0.378 ± 0.207
0.0TrpPro: 0.0 ± 0.0
0.378TrpGln: 0.378 ± 0.417
2.644TrpArg: 2.644 ± 0.827
0.378TrpSer: 0.378 ± 0.207
1.888TrpThr: 1.888 ± 1.46
0.378TrpVal: 0.378 ± 0.417
0.378TrpTrp: 0.378 ± 0.417
1.133TrpTyr: 1.133 ± 0.622
0.0TrpXaa: 0.0 ± 0.0
Tyr
5.665TyrAla: 5.665 ± 0.635
0.378TyrCys: 0.378 ± 0.207
2.266TyrAsp: 2.266 ± 0.628
1.133TyrGlu: 1.133 ± 0.002
1.888TyrPhe: 1.888 ± 0.412
1.133TyrGly: 1.133 ± 0.002
1.133TyrHis: 1.133 ± 0.002
1.888TyrIle: 1.888 ± 1.036
2.644TyrLys: 2.644 ± 0.827
4.532TyrLeu: 4.532 ± 0.633
1.133TyrMet: 1.133 ± 0.002
1.511TyrAsn: 1.511 ± 0.205
0.378TyrPro: 0.378 ± 0.207
1.133TyrGln: 1.133 ± 0.622
2.266TyrArg: 2.266 ± 0.619
3.776TyrSer: 3.776 ± 2.072
4.532TyrThr: 4.532 ± 2.505
1.511TyrVal: 1.511 ± 0.205
0.378TyrTrp: 0.378 ± 0.207
1.888TyrTyr: 1.888 ± 0.412
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2649 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski