Amino acid dipepetide frequency for Wenling picorna-like virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.78AlaAla: 6.78 ± 1.51
1.017AlaCys: 1.017 ± 0.564
3.051AlaAsp: 3.051 ± 0.004
2.712AlaGlu: 2.712 ± 0.748
4.068AlaPhe: 4.068 ± 1.121
6.102AlaGly: 6.102 ± 0.571
1.017AlaHis: 1.017 ± 0.564
4.068AlaIle: 4.068 ± 1.121
3.729AlaLys: 3.729 ± 1.873
4.746AlaLeu: 4.746 ± 1.508
3.051AlaMet: 3.051 ± 1.13
3.729AlaAsn: 3.729 ± 1.309
4.407AlaPro: 4.407 ± 1.319
3.051AlaGln: 3.051 ± 1.13
3.729AlaArg: 3.729 ± 0.943
8.475AlaSer: 8.475 ± 2.055
3.729AlaThr: 3.729 ± 1.873
4.407AlaVal: 4.407 ± 0.37
0.0AlaTrp: 0.0 ± 0.0
2.034AlaTyr: 2.034 ± 0.566
0.0AlaXaa: 0.0 ± 0.0
Cys
1.017CysAla: 1.017 ± 0.564
0.339CysCys: 0.339 ± 0.188
0.0CysAsp: 0.0 ± 0.0
0.678CysGlu: 0.678 ± 0.187
2.034CysPhe: 2.034 ± 1.129
0.678CysGly: 0.678 ± 0.376
0.678CysHis: 0.678 ± 0.376
0.339CysIle: 0.339 ± 0.188
1.017CysLys: 1.017 ± 0.564
0.678CysLeu: 0.678 ± 0.187
0.339CysMet: 0.339 ± 0.188
0.339CysAsn: 0.339 ± 0.188
1.356CysPro: 1.356 ± 0.189
0.339CysGln: 0.339 ± 0.375
0.678CysArg: 0.678 ± 0.376
1.017CysSer: 1.017 ± 0.564
1.017CysThr: 1.017 ± 0.001
2.034CysVal: 2.034 ± 0.566
0.339CysTrp: 0.339 ± 0.375
0.678CysTyr: 0.678 ± 0.376
0.0CysXaa: 0.0 ± 0.0
Asp
4.068AspAla: 4.068 ± 1.121
0.678AspCys: 0.678 ± 0.376
5.085AspAsp: 5.085 ± 1.12
4.407AspGlu: 4.407 ± 0.37
3.729AspPhe: 3.729 ± 0.38
2.712AspGly: 2.712 ± 1.311
1.695AspHis: 1.695 ± 0.749
3.729AspIle: 3.729 ± 0.943
1.695AspLys: 1.695 ± 0.378
3.39AspLeu: 3.39 ± 0.371
1.017AspMet: 1.017 ± 0.562
2.373AspAsn: 2.373 ± 0.936
2.373AspPro: 2.373 ± 0.191
1.695AspGln: 1.695 ± 0.941
1.017AspArg: 1.017 ± 0.001
4.407AspSer: 4.407 ± 0.756
3.051AspThr: 3.051 ± 0.559
4.068AspVal: 4.068 ± 0.568
1.017AspTrp: 1.017 ± 0.001
2.712AspTyr: 2.712 ± 0.379
0.0AspXaa: 0.0 ± 0.0
Glu
2.712GluAla: 2.712 ± 1.505
1.017GluCys: 1.017 ± 0.564
1.356GluAsp: 1.356 ± 0.374
1.356GluGlu: 1.356 ± 0.189
1.695GluPhe: 1.695 ± 0.186
3.729GluGly: 3.729 ± 1.309
1.017GluHis: 1.017 ± 0.562
3.051GluIle: 3.051 ± 1.13
1.695GluLys: 1.695 ± 0.186
4.068GluLeu: 4.068 ± 0.005
0.678GluMet: 0.678 ± 0.187
2.373GluAsn: 2.373 ± 0.936
2.034GluPro: 2.034 ± 0.566
1.695GluGln: 1.695 ± 0.186
1.356GluArg: 1.356 ± 0.374
5.424GluSer: 5.424 ± 0.194
4.068GluThr: 4.068 ± 0.568
2.034GluVal: 2.034 ± 1.124
0.0GluTrp: 0.0 ± 0.0
4.746GluTyr: 4.746 ± 0.745
0.0GluXaa: 0.0 ± 0.0
Phe
4.068PheAla: 4.068 ± 0.558
0.678PheCys: 0.678 ± 0.376
3.051PheAsp: 3.051 ± 0.567
2.712PheGlu: 2.712 ± 0.748
2.373PhePhe: 2.373 ± 0.191
1.695PheGly: 1.695 ± 0.749
1.017PheHis: 1.017 ± 0.562
2.373PheIle: 2.373 ± 0.373
2.034PheLys: 2.034 ± 0.566
3.729PheLeu: 3.729 ± 1.506
1.695PheMet: 1.695 ± 0.186
2.712PheAsn: 2.712 ± 0.184
2.034PhePro: 2.034 ± 0.002
2.034PheGln: 2.034 ± 0.561
1.695PheArg: 1.695 ± 0.941
5.763PheSer: 5.763 ± 0.382
3.729PheThr: 3.729 ± 0.746
1.695PheVal: 1.695 ± 0.941
0.0PheTrp: 0.0 ± 0.0
1.017PheTyr: 1.017 ± 0.562
0.0PheXaa: 0.0 ± 0.0
Gly
2.373GlyAla: 2.373 ± 0.373
0.678GlyCys: 0.678 ± 0.187
4.407GlyAsp: 4.407 ± 0.193
2.373GlyGlu: 2.373 ± 1.499
3.39GlyPhe: 3.39 ± 0.192
2.712GlyGly: 2.712 ± 1.311
1.695GlyHis: 1.695 ± 0.378
3.051GlyIle: 3.051 ± 1.123
1.356GlyLys: 1.356 ± 0.753
3.729GlyLeu: 3.729 ± 1.506
1.356GlyMet: 1.356 ± 0.189
1.695GlyAsn: 1.695 ± 0.941
1.356GlyPro: 1.356 ± 0.374
1.017GlyGln: 1.017 ± 1.125
3.051GlyArg: 3.051 ± 0.004
3.051GlySer: 3.051 ± 1.123
3.729GlyThr: 3.729 ± 1.309
4.746GlyVal: 4.746 ± 0.745
0.678GlyTrp: 0.678 ± 0.376
2.034GlyTyr: 2.034 ± 1.124
0.0GlyXaa: 0.0 ± 0.0
His
0.678HisAla: 0.678 ± 0.187
0.0HisCys: 0.0 ± 0.0
0.339HisAsp: 0.339 ± 0.188
1.356HisGlu: 1.356 ± 0.937
1.695HisPhe: 1.695 ± 0.378
1.695HisGly: 1.695 ± 0.186
0.678HisHis: 0.678 ± 0.376
1.017HisIle: 1.017 ± 0.001
0.678HisLys: 0.678 ± 0.376
2.034HisLeu: 2.034 ± 0.566
1.356HisMet: 1.356 ± 0.189
1.695HisAsn: 1.695 ± 0.378
1.356HisPro: 1.356 ± 0.189
3.39HisGln: 3.39 ± 0.192
0.678HisArg: 0.678 ± 0.376
4.068HisSer: 4.068 ± 0.568
1.017HisThr: 1.017 ± 0.001
2.712HisVal: 2.712 ± 0.942
0.339HisTrp: 0.339 ± 0.375
0.339HisTyr: 0.339 ± 0.188
0.0HisXaa: 0.0 ± 0.0
Ile
7.119IleAla: 7.119 ± 3.37
0.339IleCys: 0.339 ± 0.375
2.712IleAsp: 2.712 ± 0.748
3.39IleGlu: 3.39 ± 0.755
1.695IlePhe: 1.695 ± 0.378
1.356IleGly: 1.356 ± 0.189
1.695IleHis: 1.695 ± 0.941
2.034IleIle: 2.034 ± 0.561
1.695IleLys: 1.695 ± 0.749
7.119IleLeu: 7.119 ± 1.698
0.678IleMet: 0.678 ± 0.187
3.729IleAsn: 3.729 ± 1.309
2.373IlePro: 2.373 ± 0.191
2.373IleGln: 2.373 ± 0.191
3.729IleArg: 3.729 ± 0.183
4.746IleSer: 4.746 ± 1.871
5.085IleThr: 5.085 ± 0.557
4.746IleVal: 4.746 ± 0.944
0.339IleTrp: 0.339 ± 0.188
1.695IleTyr: 1.695 ± 0.378
0.0IleXaa: 0.0 ± 0.0
Lys
3.39LysAla: 3.39 ± 1.881
0.678LysCys: 0.678 ± 0.376
3.39LysAsp: 3.39 ± 1.318
1.695LysGlu: 1.695 ± 0.378
1.356LysPhe: 1.356 ± 0.374
1.356LysGly: 1.356 ± 0.374
0.678LysHis: 0.678 ± 0.376
1.017LysIle: 1.017 ± 0.564
2.373LysLys: 2.373 ± 0.754
4.746LysLeu: 4.746 ± 0.745
0.678LysMet: 0.678 ± 0.376
1.356LysAsn: 1.356 ± 0.189
3.051LysPro: 3.051 ± 0.004
1.017LysGln: 1.017 ± 0.001
1.356LysArg: 1.356 ± 0.189
4.068LysSer: 4.068 ± 1.131
2.373LysThr: 2.373 ± 0.373
3.729LysVal: 3.729 ± 0.38
0.339LysTrp: 0.339 ± 0.375
1.356LysTyr: 1.356 ± 0.189
0.0LysXaa: 0.0 ± 0.0
Leu
5.424LeuAla: 5.424 ± 1.321
2.034LeuCys: 2.034 ± 0.566
3.39LeuAsp: 3.39 ± 0.192
4.407LeuGlu: 4.407 ± 0.193
2.373LeuPhe: 2.373 ± 0.191
3.729LeuGly: 3.729 ± 0.38
3.051LeuHis: 3.051 ± 0.567
5.763LeuIle: 5.763 ± 0.181
4.407LeuLys: 4.407 ± 1.883
9.831LeuLeu: 9.831 ± 0.951
1.017LeuMet: 1.017 ± 0.001
4.746LeuAsn: 4.746 ± 0.745
6.78LeuPro: 6.78 ± 0.384
3.051LeuGln: 3.051 ± 1.13
3.39LeuArg: 3.39 ± 1.318
11.864LeuSer: 11.864 ± 1.863
7.119LeuThr: 7.119 ± 0.572
5.424LeuVal: 5.424 ± 1.495
0.339LeuTrp: 0.339 ± 0.188
3.051LeuTyr: 3.051 ± 0.004
0.0LeuXaa: 0.0 ± 0.0
Met
1.017MetAla: 1.017 ± 0.001
0.0MetCys: 0.0 ± 0.0
1.356MetAsp: 1.356 ± 0.374
0.339MetGlu: 0.339 ± 0.188
1.017MetPhe: 1.017 ± 0.564
1.356MetGly: 1.356 ± 0.374
0.0MetHis: 0.0 ± 0.0
0.678MetIle: 0.678 ± 0.376
1.017MetLys: 1.017 ± 0.001
2.373MetLeu: 2.373 ± 0.191
0.0MetMet: 0.0 ± 0.0
1.356MetAsn: 1.356 ± 0.753
1.017MetPro: 1.017 ± 0.001
1.356MetGln: 1.356 ± 0.753
0.678MetArg: 0.678 ± 0.187
2.712MetSer: 2.712 ± 0.748
1.017MetThr: 1.017 ± 0.564
1.017MetVal: 1.017 ± 0.001
0.678MetTrp: 0.678 ± 0.187
0.339MetTyr: 0.339 ± 0.375
0.0MetXaa: 0.0 ± 0.0
Asn
4.407AsnAla: 4.407 ± 0.756
0.678AsnCys: 0.678 ± 0.187
1.356AsnAsp: 1.356 ± 0.374
3.051AsnGlu: 3.051 ± 1.686
3.39AsnPhe: 3.39 ± 0.192
2.034AsnGly: 2.034 ± 0.561
1.356AsnHis: 1.356 ± 0.189
1.695AsnIle: 1.695 ± 1.312
1.356AsnLys: 1.356 ± 0.937
3.729AsnLeu: 3.729 ± 0.183
2.712AsnMet: 2.712 ± 0.184
2.373AsnAsn: 2.373 ± 1.499
4.068AsnPro: 4.068 ± 0.005
1.695AsnGln: 1.695 ± 0.941
2.373AsnArg: 2.373 ± 0.191
4.407AsnSer: 4.407 ± 1.319
1.017AsnThr: 1.017 ± 0.001
3.051AsnVal: 3.051 ± 0.004
0.339AsnTrp: 0.339 ± 0.375
0.678AsnTyr: 0.678 ± 0.376
0.0AsnXaa: 0.0 ± 0.0
Pro
4.746ProAla: 4.746 ± 0.381
2.034ProCys: 2.034 ± 0.566
3.39ProAsp: 3.39 ± 0.192
2.373ProGlu: 2.373 ± 0.754
3.051ProPhe: 3.051 ± 1.686
2.034ProGly: 2.034 ± 0.566
1.017ProHis: 1.017 ± 0.001
4.407ProIle: 4.407 ± 0.193
2.373ProLys: 2.373 ± 1.317
5.085ProLeu: 5.085 ± 0.569
0.0ProMet: 0.0 ± 0.0
2.034ProAsn: 2.034 ± 1.129
3.39ProPro: 3.39 ± 0.755
1.695ProGln: 1.695 ± 0.378
1.356ProArg: 1.356 ± 0.374
8.475ProSer: 8.475 ± 1.324
4.407ProThr: 4.407 ± 2.06
3.39ProVal: 3.39 ± 0.192
1.017ProTrp: 1.017 ± 0.001
1.695ProTyr: 1.695 ± 0.749
0.0ProXaa: 0.0 ± 0.0
Gln
1.017GlnAla: 1.017 ± 0.001
1.356GlnCys: 1.356 ± 0.753
4.407GlnAsp: 4.407 ± 1.496
2.373GlnGlu: 2.373 ± 0.754
1.017GlnPhe: 1.017 ± 0.564
4.068GlnGly: 4.068 ± 0.558
2.373GlnHis: 2.373 ± 1.317
3.729GlnIle: 3.729 ± 0.746
1.695GlnLys: 1.695 ± 0.941
3.39GlnLeu: 3.39 ± 0.755
0.678GlnMet: 0.678 ± 0.187
0.339GlnAsn: 0.339 ± 0.188
2.712GlnPro: 2.712 ± 1.311
3.051GlnGln: 3.051 ± 0.559
1.017GlnArg: 1.017 ± 0.564
2.712GlnSer: 2.712 ± 0.379
2.373GlnThr: 2.373 ± 0.191
2.034GlnVal: 2.034 ± 0.002
0.339GlnTrp: 0.339 ± 0.188
1.017GlnTyr: 1.017 ± 0.001
0.0GlnXaa: 0.0 ± 0.0
Arg
3.39ArgAla: 3.39 ± 0.371
0.0ArgCys: 0.0 ± 0.0
2.373ArgAsp: 2.373 ± 0.191
2.373ArgGlu: 2.373 ± 1.317
2.034ArgPhe: 2.034 ± 1.124
2.712ArgGly: 2.712 ± 0.184
0.678ArgHis: 0.678 ± 0.376
2.373ArgIle: 2.373 ± 0.373
2.373ArgLys: 2.373 ± 1.317
5.424ArgLeu: 5.424 ± 2.058
0.678ArgMet: 0.678 ± 0.376
1.356ArgAsn: 1.356 ± 0.189
2.034ArgPro: 2.034 ± 0.566
1.695ArgGln: 1.695 ± 0.378
3.39ArgArg: 3.39 ± 0.192
3.729ArgSer: 3.729 ± 0.943
4.068ArgThr: 4.068 ± 1.131
5.085ArgVal: 5.085 ± 0.569
0.339ArgTrp: 0.339 ± 0.188
1.017ArgTyr: 1.017 ± 0.001
0.0ArgXaa: 0.0 ± 0.0
Ser
6.441SerAla: 6.441 ± 1.494
1.017SerCys: 1.017 ± 0.001
5.085SerAsp: 5.085 ± 0.557
3.729SerGlu: 3.729 ± 1.309
5.763SerPhe: 5.763 ± 0.946
4.407SerGly: 4.407 ± 0.37
2.712SerHis: 2.712 ± 0.942
5.424SerIle: 5.424 ± 1.321
3.39SerLys: 3.39 ± 0.371
9.831SerLeu: 9.831 ± 0.387
1.017SerMet: 1.017 ± 0.226
5.424SerAsn: 5.424 ± 0.932
6.78SerPro: 6.78 ± 0.384
4.746SerGln: 4.746 ± 0.381
9.831SerArg: 9.831 ± 0.176
12.203SerSer: 12.203 ± 1.111
7.458SerThr: 7.458 ± 0.366
7.119SerVal: 7.119 ± 0.009
1.695SerTrp: 1.695 ± 0.186
2.712SerTyr: 2.712 ± 0.748
0.0SerXaa: 0.0 ± 0.0
Thr
6.102ThrAla: 6.102 ± 0.571
1.695ThrCys: 1.695 ± 0.378
4.068ThrAsp: 4.068 ± 0.568
2.373ThrGlu: 2.373 ± 0.754
2.034ThrPhe: 2.034 ± 1.687
1.356ThrGly: 1.356 ± 0.189
2.034ThrHis: 2.034 ± 0.561
6.78ThrIle: 6.78 ± 1.869
2.034ThrLys: 2.034 ± 0.566
6.102ThrLeu: 6.102 ± 1.134
1.356ThrMet: 1.356 ± 0.374
2.373ThrAsn: 2.373 ± 0.191
3.39ThrPro: 3.39 ± 0.192
3.051ThrGln: 3.051 ± 0.004
2.712ThrArg: 2.712 ± 0.379
6.102ThrSer: 6.102 ± 2.245
7.458ThrThr: 7.458 ± 0.929
5.424ThrVal: 5.424 ± 2.621
0.339ThrTrp: 0.339 ± 0.188
1.356ThrTyr: 1.356 ± 0.374
0.0ThrXaa: 0.0 ± 0.0
Val
6.441ValAla: 6.441 ± 0.931
1.017ValCys: 1.017 ± 0.564
4.068ValAsp: 4.068 ± 0.568
2.373ValGlu: 2.373 ± 1.317
1.356ValPhe: 1.356 ± 0.189
3.051ValGly: 3.051 ± 1.123
2.712ValHis: 2.712 ± 1.311
4.746ValIle: 4.746 ± 0.182
2.712ValLys: 2.712 ± 0.379
7.458ValLeu: 7.458 ± 0.929
0.339ValMet: 0.339 ± 0.188
2.373ValAsn: 2.373 ± 0.754
5.424ValPro: 5.424 ± 0.369
2.373ValGln: 2.373 ± 2.062
3.051ValArg: 3.051 ± 0.004
9.492ValSer: 9.492 ± 0.364
3.39ValThr: 3.39 ± 0.371
4.407ValVal: 4.407 ± 0.193
0.0ValTrp: 0.0 ± 0.0
3.39ValTyr: 3.39 ± 1.318
0.0ValXaa: 0.0 ± 0.0
Trp
0.339TrpAla: 0.339 ± 0.188
0.0TrpCys: 0.0 ± 0.0
0.339TrpAsp: 0.339 ± 0.188
0.678TrpGlu: 0.678 ± 0.75
0.339TrpPhe: 0.339 ± 0.188
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.339TrpIle: 0.339 ± 0.188
0.0TrpLys: 0.0 ± 0.0
1.356TrpLeu: 1.356 ± 0.374
0.0TrpMet: 0.0 ± 0.147
0.0TrpAsn: 0.0 ± 0.0
1.017TrpPro: 1.017 ± 0.001
1.017TrpGln: 1.017 ± 0.001
0.678TrpArg: 0.678 ± 0.187
0.678TrpSer: 0.678 ± 0.376
0.339TrpThr: 0.339 ± 0.188
0.678TrpVal: 0.678 ± 0.75
0.339TrpTrp: 0.339 ± 0.375
0.339TrpTyr: 0.339 ± 0.188
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.373TyrAla: 2.373 ± 0.936
0.339TyrCys: 0.339 ± 0.188
1.695TyrAsp: 1.695 ± 1.312
1.356TyrGlu: 1.356 ± 0.189
1.695TyrPhe: 1.695 ± 0.941
1.695TyrGly: 1.695 ± 0.378
1.017TyrHis: 1.017 ± 0.564
2.034TyrIle: 2.034 ± 1.124
2.373TyrLys: 2.373 ± 0.191
2.373TyrLeu: 2.373 ± 0.373
0.339TyrMet: 0.339 ± 0.188
3.39TyrAsn: 3.39 ± 0.371
1.017TyrPro: 1.017 ± 0.564
1.356TyrGln: 1.356 ± 0.189
1.356TyrArg: 1.356 ± 0.374
3.39TyrSer: 3.39 ± 0.192
1.695TyrThr: 1.695 ± 0.378
2.373TyrVal: 2.373 ± 0.373
0.339TyrTrp: 0.339 ± 0.188
0.339TyrTyr: 0.339 ± 0.188
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2951 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski