Amino acid dipepetide frequency for Sanxia narna-like virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.489AlaAla: 5.489 ± 3.785
0.915AlaCys: 0.915 ± 0.479
6.404AlaAsp: 6.404 ± 1.641
0.915AlaGlu: 0.915 ± 0.479
1.83AlaPhe: 1.83 ± 0.707
2.745AlaGly: 2.745 ± 1.893
0.915AlaHis: 0.915 ± 0.479
2.745AlaIle: 2.745 ± 1.893
2.745AlaLys: 2.745 ± 0.228
7.319AlaLeu: 7.319 ± 1.162
2.745AlaMet: 2.745 ± 1.437
3.66AlaAsn: 3.66 ± 3.079
4.575AlaPro: 4.575 ± 0.935
2.745AlaGln: 2.745 ± 0.228
4.575AlaArg: 4.575 ± 0.935
9.149AlaSer: 9.149 ± 5.199
5.489AlaThr: 5.489 ± 0.455
4.575AlaVal: 4.575 ± 4.265
1.83AlaTrp: 1.83 ± 0.707
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.915CysAla: 0.915 ± 1.186
0.0CysCys: 0.0 ± 0.0
0.915CysAsp: 0.915 ± 1.186
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.915CysGly: 0.915 ± 1.186
0.0CysHis: 0.0 ± 0.0
2.745CysIle: 2.745 ± 3.558
0.0CysLys: 0.0 ± 0.0
0.915CysLeu: 0.915 ± 0.479
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.915CysPro: 0.915 ± 0.479
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
2.745CysSer: 2.745 ± 0.228
0.0CysThr: 0.0 ± 0.0
0.915CysVal: 0.915 ± 0.479
0.0CysTrp: 0.0 ± 0.0
0.915CysTyr: 0.915 ± 0.479
0.0CysXaa: 0.0 ± 0.0
Asp
2.745AspAla: 2.745 ± 0.228
0.915AspCys: 0.915 ± 1.186
6.404AspAsp: 6.404 ± 1.641
0.915AspGlu: 0.915 ± 0.479
1.83AspPhe: 1.83 ± 0.958
3.66AspGly: 3.66 ± 1.414
0.0AspHis: 0.0 ± 0.0
2.745AspIle: 2.745 ± 1.437
2.745AspLys: 2.745 ± 0.228
5.489AspLeu: 5.489 ± 1.21
1.83AspMet: 1.83 ± 0.958
3.66AspAsn: 3.66 ± 0.251
3.66AspPro: 3.66 ± 0.251
0.915AspGln: 0.915 ± 0.479
3.66AspArg: 3.66 ± 3.079
2.745AspSer: 2.745 ± 1.893
3.66AspThr: 3.66 ± 1.414
1.83AspVal: 1.83 ± 0.707
0.0AspTrp: 0.0 ± 0.0
2.745AspTyr: 2.745 ± 1.437
0.0AspXaa: 0.0 ± 0.0
Glu
0.0GluAla: 0.0 ± 0.0
0.0GluCys: 0.0 ± 0.0
1.83GluAsp: 1.83 ± 0.958
0.0GluGlu: 0.0 ± 0.0
5.489GluPhe: 5.489 ± 2.875
0.915GluGly: 0.915 ± 0.479
0.915GluHis: 0.915 ± 0.479
0.915GluIle: 0.915 ± 0.479
3.66GluLys: 3.66 ± 1.916
1.83GluLeu: 1.83 ± 0.958
1.83GluMet: 1.83 ± 0.958
0.0GluAsn: 0.0 ± 0.0
2.745GluPro: 2.745 ± 1.437
1.83GluGln: 1.83 ± 0.958
1.83GluArg: 1.83 ± 0.958
2.745GluSer: 2.745 ± 0.228
3.66GluThr: 3.66 ± 1.916
2.745GluVal: 2.745 ± 0.228
2.745GluTrp: 2.745 ± 0.228
3.66GluTyr: 3.66 ± 1.916
0.0GluXaa: 0.0 ± 0.0
Phe
0.915PheAla: 0.915 ± 0.479
0.0PheCys: 0.0 ± 0.0
0.915PheAsp: 0.915 ± 1.186
0.0PheGlu: 0.0 ± 0.0
1.83PhePhe: 1.83 ± 0.707
5.489PheGly: 5.489 ± 1.21
0.0PheHis: 0.0 ± 0.0
0.915PheIle: 0.915 ± 0.479
0.915PheLys: 0.915 ± 0.479
7.319PheLeu: 7.319 ± 3.833
2.745PheMet: 2.745 ± 0.228
0.915PheAsn: 0.915 ± 0.479
6.404PhePro: 6.404 ± 1.689
0.0PheGln: 0.0 ± 0.0
3.66PheArg: 3.66 ± 0.251
4.575PheSer: 4.575 ± 2.395
1.83PheThr: 1.83 ± 0.958
3.66PheVal: 3.66 ± 0.251
0.915PheTrp: 0.915 ± 0.479
2.745PheTyr: 2.745 ± 0.228
0.0PheXaa: 0.0 ± 0.0
Gly
9.149GlyAla: 9.149 ± 3.534
0.0GlyCys: 0.0 ± 0.0
8.234GlyAsp: 8.234 ± 0.982
0.915GlyGlu: 0.915 ± 0.479
2.745GlyPhe: 2.745 ± 0.228
6.404GlyGly: 6.404 ± 0.024
2.745GlyHis: 2.745 ± 0.228
2.745GlyIle: 2.745 ± 1.437
2.745GlyLys: 2.745 ± 0.228
10.979GlyLeu: 10.979 ± 0.911
3.66GlyMet: 3.66 ± 1.414
1.83GlyAsn: 1.83 ± 2.372
3.66GlyPro: 3.66 ± 1.916
1.83GlyGln: 1.83 ± 0.958
2.745GlyArg: 2.745 ± 0.228
1.83GlySer: 1.83 ± 0.958
3.66GlyThr: 3.66 ± 1.414
1.83GlyVal: 1.83 ± 0.958
0.915GlyTrp: 0.915 ± 0.479
6.404GlyTyr: 6.404 ± 4.971
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.915HisGlu: 0.915 ± 0.479
0.0HisPhe: 0.0 ± 0.0
0.915HisGly: 0.915 ± 0.479
0.0HisHis: 0.0 ± 0.0
2.745HisIle: 2.745 ± 1.437
2.745HisLys: 2.745 ± 0.228
1.83HisLeu: 1.83 ± 0.707
0.0HisMet: 0.0 ± 0.0
0.915HisAsn: 0.915 ± 0.479
0.915HisPro: 0.915 ± 0.479
0.0HisGln: 0.0 ± 0.0
0.915HisArg: 0.915 ± 0.479
0.915HisSer: 0.915 ± 0.479
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.915HisTyr: 0.915 ± 0.479
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
0.0IleCys: 0.0 ± 0.0
1.83IleAsp: 1.83 ± 0.958
1.83IleGlu: 1.83 ± 0.958
4.575IlePhe: 4.575 ± 0.73
5.489IleGly: 5.489 ± 0.455
0.915IleHis: 0.915 ± 0.479
0.0IleIle: 0.0 ± 0.0
2.745IleLys: 2.745 ± 0.228
4.575IleLeu: 4.575 ± 0.935
1.83IleMet: 1.83 ± 0.958
4.575IleAsn: 4.575 ± 2.6
5.489IlePro: 5.489 ± 1.21
0.915IleGln: 0.915 ± 0.479
2.745IleArg: 2.745 ± 0.228
0.0IleSer: 0.0 ± 0.0
4.575IleThr: 4.575 ± 4.265
1.83IleVal: 1.83 ± 0.707
0.915IleTrp: 0.915 ± 0.479
3.66IleTyr: 3.66 ± 0.251
0.0IleXaa: 0.0 ± 0.0
Lys
3.66LysAla: 3.66 ± 0.251
0.915LysCys: 0.915 ± 0.479
4.575LysAsp: 4.575 ± 0.935
0.915LysGlu: 0.915 ± 0.479
0.915LysPhe: 0.915 ± 0.479
2.745LysGly: 2.745 ± 1.437
1.83LysHis: 1.83 ± 0.958
1.83LysIle: 1.83 ± 0.958
1.83LysLys: 1.83 ± 0.958
4.575LysLeu: 4.575 ± 0.73
0.915LysMet: 0.915 ± 0.479
3.66LysAsn: 3.66 ± 1.916
4.575LysPro: 4.575 ± 4.265
1.83LysGln: 1.83 ± 0.958
4.575LysArg: 4.575 ± 0.73
6.404LysSer: 6.404 ± 1.641
4.575LysThr: 4.575 ± 0.73
5.489LysVal: 5.489 ± 0.455
1.83LysTrp: 1.83 ± 0.707
1.83LysTyr: 1.83 ± 0.707
0.0LysXaa: 0.0 ± 0.0
Leu
3.66LeuAla: 3.66 ± 1.414
3.66LeuCys: 3.66 ± 0.251
5.489LeuAsp: 5.489 ± 0.455
5.489LeuGlu: 5.489 ± 1.21
2.745LeuPhe: 2.745 ± 1.437
7.319LeuGly: 7.319 ± 1.162
0.915LeuHis: 0.915 ± 0.479
5.489LeuIle: 5.489 ± 2.12
6.404LeuLys: 6.404 ± 1.689
7.319LeuLeu: 7.319 ± 0.503
2.745LeuMet: 2.745 ± 1.893
2.745LeuAsn: 2.745 ± 1.437
5.489LeuPro: 5.489 ± 1.21
5.489LeuGln: 5.489 ± 2.875
4.575LeuArg: 4.575 ± 0.73
6.404LeuSer: 6.404 ± 1.689
3.66LeuThr: 3.66 ± 0.251
7.319LeuVal: 7.319 ± 2.168
0.915LeuTrp: 0.915 ± 0.479
0.915LeuTyr: 0.915 ± 0.479
0.0LeuXaa: 0.0 ± 0.0
Met
1.83MetAla: 1.83 ± 0.958
0.0MetCys: 0.0 ± 0.0
0.915MetAsp: 0.915 ± 0.479
1.83MetGlu: 1.83 ± 0.958
1.83MetPhe: 1.83 ± 0.958
1.83MetGly: 1.83 ± 0.707
0.915MetHis: 0.915 ± 0.479
0.0MetIle: 0.0 ± 0.0
1.83MetLys: 1.83 ± 0.707
0.915MetLeu: 0.915 ± 0.479
1.83MetMet: 1.83 ± 0.333
0.915MetAsn: 0.915 ± 0.479
1.83MetPro: 1.83 ± 0.958
1.83MetGln: 1.83 ± 2.372
2.745MetArg: 2.745 ± 1.437
2.745MetSer: 2.745 ± 1.437
2.745MetThr: 2.745 ± 0.228
1.83MetVal: 1.83 ± 0.958
0.915MetTrp: 0.915 ± 0.479
2.745MetTyr: 2.745 ± 0.228
0.0MetXaa: 0.0 ± 0.0
Asn
6.404AsnAla: 6.404 ± 4.971
0.0AsnCys: 0.0 ± 0.0
0.915AsnAsp: 0.915 ± 1.186
0.915AsnGlu: 0.915 ± 0.479
0.0AsnPhe: 0.0 ± 0.0
0.915AsnGly: 0.915 ± 0.479
0.915AsnHis: 0.915 ± 0.479
2.745AsnIle: 2.745 ± 0.228
0.915AsnLys: 0.915 ± 1.186
1.83AsnLeu: 1.83 ± 2.372
0.915AsnMet: 0.915 ± 0.778
0.915AsnAsn: 0.915 ± 1.186
1.83AsnPro: 1.83 ± 0.958
3.66AsnGln: 3.66 ± 1.414
4.575AsnArg: 4.575 ± 0.73
3.66AsnSer: 3.66 ± 1.414
0.915AsnThr: 0.915 ± 1.186
3.66AsnVal: 3.66 ± 1.414
2.745AsnTrp: 2.745 ± 1.437
0.915AsnTyr: 0.915 ± 0.479
0.0AsnXaa: 0.0 ± 0.0
Pro
3.66ProAla: 3.66 ± 3.079
0.915ProCys: 0.915 ± 1.186
1.83ProAsp: 1.83 ± 0.958
2.745ProGlu: 2.745 ± 1.437
0.915ProPhe: 0.915 ± 0.479
7.319ProGly: 7.319 ± 2.827
0.0ProHis: 0.0 ± 0.0
5.489ProIle: 5.489 ± 2.875
1.83ProLys: 1.83 ± 0.958
7.319ProLeu: 7.319 ± 0.503
0.915ProMet: 0.915 ± 0.479
3.66ProAsn: 3.66 ± 3.079
2.745ProPro: 2.745 ± 1.437
3.66ProGln: 3.66 ± 1.916
4.575ProArg: 4.575 ± 0.73
4.575ProSer: 4.575 ± 0.73
4.575ProThr: 4.575 ± 0.935
2.745ProVal: 2.745 ± 1.437
2.745ProTrp: 2.745 ± 1.437
0.915ProTyr: 0.915 ± 0.479
0.0ProXaa: 0.0 ± 0.0
Gln
2.745GlnAla: 2.745 ± 0.228
0.0GlnCys: 0.0 ± 0.0
0.915GlnAsp: 0.915 ± 1.186
1.83GlnGlu: 1.83 ± 0.958
1.83GlnPhe: 1.83 ± 0.958
1.83GlnGly: 1.83 ± 0.958
0.0GlnHis: 0.0 ± 0.0
3.66GlnIle: 3.66 ± 0.251
2.745GlnLys: 2.745 ± 0.228
3.66GlnLeu: 3.66 ± 1.916
1.83GlnMet: 1.83 ± 0.958
0.915GlnAsn: 0.915 ± 0.479
0.915GlnPro: 0.915 ± 1.186
0.915GlnGln: 0.915 ± 0.479
4.575GlnArg: 4.575 ± 0.935
1.83GlnSer: 1.83 ± 0.958
1.83GlnThr: 1.83 ± 2.372
1.83GlnVal: 1.83 ± 0.958
0.915GlnTrp: 0.915 ± 0.479
0.915GlnTyr: 0.915 ± 0.479
0.0GlnXaa: 0.0 ± 0.0
Arg
7.319ArgAla: 7.319 ± 1.162
0.915ArgCys: 0.915 ± 1.186
1.83ArgAsp: 1.83 ± 0.958
4.575ArgGlu: 4.575 ± 2.395
6.404ArgPhe: 6.404 ± 0.024
3.66ArgGly: 3.66 ± 0.251
0.915ArgHis: 0.915 ± 1.186
3.66ArgIle: 3.66 ± 1.916
6.404ArgLys: 6.404 ± 1.689
5.489ArgLeu: 5.489 ± 2.875
2.745ArgMet: 2.745 ± 0.228
2.745ArgAsn: 2.745 ± 0.228
1.83ArgPro: 1.83 ± 0.707
2.745ArgGln: 2.745 ± 1.893
3.66ArgArg: 3.66 ± 0.251
9.149ArgSer: 9.149 ± 1.461
1.83ArgThr: 1.83 ± 0.958
0.915ArgVal: 0.915 ± 0.479
2.745ArgTrp: 2.745 ± 1.437
1.83ArgTyr: 1.83 ± 0.707
0.0ArgXaa: 0.0 ± 0.0
Ser
8.234SerAla: 8.234 ± 0.683
0.915SerCys: 0.915 ± 1.186
1.83SerAsp: 1.83 ± 0.707
3.66SerGlu: 3.66 ± 0.251
5.489SerPhe: 5.489 ± 1.21
5.489SerGly: 5.489 ± 2.12
0.915SerHis: 0.915 ± 0.479
3.66SerIle: 3.66 ± 1.414
7.319SerLys: 7.319 ± 0.503
5.489SerLeu: 5.489 ± 1.21
0.915SerMet: 0.915 ± 0.479
2.745SerAsn: 2.745 ± 1.893
5.489SerPro: 5.489 ± 0.455
0.915SerGln: 0.915 ± 1.186
9.149SerArg: 9.149 ± 3.126
2.745SerSer: 2.745 ± 1.437
4.575SerThr: 4.575 ± 2.395
8.234SerVal: 8.234 ± 2.348
0.0SerTrp: 0.0 ± 0.0
0.915SerTyr: 0.915 ± 0.479
0.0SerXaa: 0.0 ± 0.0
Thr
5.489ThrAla: 5.489 ± 5.451
0.0ThrCys: 0.0 ± 0.0
3.66ThrAsp: 3.66 ± 0.251
4.575ThrGlu: 4.575 ± 0.73
2.745ThrPhe: 2.745 ± 1.437
9.149ThrGly: 9.149 ± 1.461
0.0ThrHis: 0.0 ± 0.0
2.745ThrIle: 2.745 ± 3.558
3.66ThrLys: 3.66 ± 1.414
0.915ThrLeu: 0.915 ± 0.479
0.0ThrMet: 0.0 ± 0.0
0.915ThrAsn: 0.915 ± 1.186
4.575ThrPro: 4.575 ± 0.73
1.83ThrGln: 1.83 ± 0.958
3.66ThrArg: 3.66 ± 1.916
4.575ThrSer: 4.575 ± 0.935
0.915ThrThr: 0.915 ± 1.186
3.66ThrVal: 3.66 ± 3.079
0.0ThrTrp: 0.0 ± 0.0
1.83ThrTyr: 1.83 ± 0.958
0.0ThrXaa: 0.0 ± 0.0
Val
5.489ValAla: 5.489 ± 0.455
0.915ValCys: 0.915 ± 1.186
0.915ValAsp: 0.915 ± 0.479
2.745ValGlu: 2.745 ± 1.437
0.915ValPhe: 0.915 ± 0.479
4.575ValGly: 4.575 ± 2.6
0.0ValHis: 0.0 ± 0.0
1.83ValIle: 1.83 ± 0.707
2.745ValLys: 2.745 ± 1.893
7.319ValLeu: 7.319 ± 1.162
2.745ValMet: 2.745 ± 1.437
0.915ValAsn: 0.915 ± 0.479
4.575ValPro: 4.575 ± 0.935
3.66ValGln: 3.66 ± 0.251
6.404ValArg: 6.404 ± 0.024
3.66ValSer: 3.66 ± 0.251
3.66ValThr: 3.66 ± 0.251
2.745ValVal: 2.745 ± 0.228
1.83ValTrp: 1.83 ± 0.958
0.915ValTyr: 0.915 ± 1.186
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.915TrpAsp: 0.915 ± 0.479
0.915TrpGlu: 0.915 ± 0.479
1.83TrpPhe: 1.83 ± 0.958
0.915TrpGly: 0.915 ± 0.479
1.83TrpHis: 1.83 ± 0.958
0.915TrpIle: 0.915 ± 0.479
4.575TrpLys: 4.575 ± 0.73
0.915TrpLeu: 0.915 ± 0.479
0.915TrpMet: 0.915 ± 0.479
2.745TrpAsn: 2.745 ± 1.437
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.83TrpArg: 1.83 ± 0.958
1.83TrpSer: 1.83 ± 2.372
0.0TrpThr: 0.0 ± 0.0
1.83TrpVal: 1.83 ± 0.958
1.83TrpTrp: 1.83 ± 2.372
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.66TyrAla: 3.66 ± 0.251
1.83TyrCys: 1.83 ± 0.707
1.83TyrAsp: 1.83 ± 0.707
3.66TyrGlu: 3.66 ± 1.916
1.83TyrPhe: 1.83 ± 0.707
2.745TyrGly: 2.745 ± 0.228
0.0TyrHis: 0.0 ± 0.0
0.915TyrIle: 0.915 ± 1.186
0.915TyrLys: 0.915 ± 0.479
2.745TyrLeu: 2.745 ± 1.437
0.0TyrMet: 0.0 ± 0.0
1.83TyrAsn: 1.83 ± 2.372
0.915TyrPro: 0.915 ± 0.479
0.915TyrGln: 0.915 ± 0.479
0.915TyrArg: 0.915 ± 0.479
6.404TyrSer: 6.404 ± 1.689
2.745TyrThr: 2.745 ± 1.893
0.915TyrVal: 0.915 ± 0.479
0.0TyrTrp: 0.0 ± 0.0
1.83TyrTyr: 1.83 ± 0.707
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1094 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski