Amino acid dipepetide frequency for Sclerotium hydrophilum virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.146AlaAla: 6.146 ± 2.12
1.756AlaCys: 1.756 ± 1.285
4.39AlaAsp: 4.39 ± 0.966
7.902AlaGlu: 7.902 ± 1.153
3.512AlaPhe: 3.512 ± 1.53
8.78AlaGly: 8.78 ± 3.084
2.634AlaHis: 2.634 ± 1.128
2.634AlaIle: 2.634 ± 0.623
5.268AlaLys: 5.268 ± 1.247
7.024AlaLeu: 7.024 ± 2.825
2.634AlaMet: 2.634 ± 2.243
1.756AlaAsn: 1.756 ± 1.285
8.78AlaPro: 8.78 ± 5.088
4.39AlaGln: 4.39 ± 0.966
9.658AlaArg: 9.658 ± 2.885
4.39AlaSer: 4.39 ± 0.966
4.39AlaThr: 4.39 ± 1.972
1.756AlaVal: 1.756 ± 1.028
1.756AlaTrp: 1.756 ± 1.285
3.512AlaTyr: 3.512 ± 2.056
0.0AlaXaa: 0.0 ± 0.0
Cys
0.878CysAla: 0.878 ± 0.514
0.0CysCys: 0.0 ± 0.0
1.756CysAsp: 1.756 ± 1.028
0.878CysGlu: 0.878 ± 0.514
2.634CysPhe: 2.634 ± 1.316
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.878CysLys: 0.878 ± 0.514
2.634CysLeu: 2.634 ± 1.542
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.878CysPro: 0.878 ± 0.514
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.878CysThr: 0.878 ± 0.514
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.39AspAla: 4.39 ± 2.57
0.0AspCys: 0.0 ± 0.0
7.024AspAsp: 7.024 ± 2.094
2.634AspGlu: 2.634 ± 0.623
2.634AspPhe: 2.634 ± 1.128
7.024AspGly: 7.024 ± 0.778
1.756AspHis: 1.756 ± 1.028
1.756AspIle: 1.756 ± 1.496
0.878AspLys: 0.878 ± 0.748
6.146AspLeu: 6.146 ± 2.709
1.756AspMet: 1.756 ± 2.445
4.39AspAsn: 4.39 ± 1.551
2.634AspPro: 2.634 ± 1.542
0.0AspGln: 0.0 ± 0.0
5.268AspArg: 5.268 ± 1.374
1.756AspSer: 1.756 ± 0.458
0.878AspThr: 0.878 ± 0.748
6.146AspVal: 6.146 ± 1.945
0.878AspTrp: 0.878 ± 0.748
1.756AspTyr: 1.756 ± 0.458
0.0AspXaa: 0.0 ± 0.0
Glu
12.291GluAla: 12.291 ± 2.525
0.878GluCys: 0.878 ± 0.514
0.878GluAsp: 0.878 ± 0.514
7.024GluGlu: 7.024 ± 5.141
0.0GluPhe: 0.0 ± 0.0
9.658GluGly: 9.658 ± 4.121
0.878GluHis: 0.878 ± 1.449
0.878GluIle: 0.878 ± 0.748
3.512GluLys: 3.512 ± 2.056
2.634GluLeu: 2.634 ± 0.623
3.512GluMet: 3.512 ± 1.423
0.878GluAsn: 0.878 ± 0.514
2.634GluPro: 2.634 ± 2.691
4.39GluGln: 4.39 ± 0.86
7.902GluArg: 7.902 ± 2.436
0.878GluSer: 0.878 ± 0.514
2.634GluThr: 2.634 ± 1.542
5.268GluVal: 5.268 ± 1.374
1.756GluTrp: 1.756 ± 0.458
3.512GluTyr: 3.512 ± 1.047
0.0GluXaa: 0.0 ± 0.0
Phe
3.512PheAla: 3.512 ± 1.047
1.756PheCys: 1.756 ± 1.028
3.512PheAsp: 3.512 ± 1.047
5.268PheGlu: 5.268 ± 1.247
1.756PhePhe: 1.756 ± 1.285
7.024PheGly: 7.024 ± 0.435
1.756PheHis: 1.756 ± 0.458
0.878PheIle: 0.878 ± 0.514
1.756PheLys: 1.756 ± 0.458
3.512PheLeu: 3.512 ± 1.53
0.0PheMet: 0.0 ± 0.0
0.878PheAsn: 0.878 ± 0.748
0.878PhePro: 0.878 ± 0.514
0.878PheGln: 0.878 ± 0.748
2.634PheArg: 2.634 ± 1.542
0.878PheSer: 0.878 ± 0.514
1.756PheThr: 1.756 ± 0.458
1.756PheVal: 1.756 ± 1.028
0.0PheTrp: 0.0 ± 0.0
0.878PheTyr: 0.878 ± 0.514
0.0PheXaa: 0.0 ± 0.0
Gly
8.78GlyAla: 8.78 ± 5.199
0.0GlyCys: 0.0 ± 0.0
4.39GlyAsp: 4.39 ± 2.303
1.756GlyGlu: 1.756 ± 1.285
4.39GlyPhe: 4.39 ± 2.599
9.658GlyGly: 9.658 ± 3.43
2.634GlyHis: 2.634 ± 0.623
6.146GlyIle: 6.146 ± 2.709
6.146GlyLys: 6.146 ± 1.377
3.512GlyLeu: 3.512 ± 1.047
0.0GlyMet: 0.0 ± 0.0
7.024GlyAsn: 7.024 ± 0.435
2.634GlyPro: 2.634 ± 1.128
1.756GlyGln: 1.756 ± 0.458
5.268GlyArg: 5.268 ± 2.141
8.78GlySer: 8.78 ± 2.289
2.634GlyThr: 2.634 ± 1.131
7.902GlyVal: 7.902 ± 1.254
0.878GlyTrp: 0.878 ± 1.449
2.634GlyTyr: 2.634 ± 1.542
0.0GlyXaa: 0.0 ± 0.0
His
1.756HisAla: 1.756 ± 1.53
1.756HisCys: 1.756 ± 1.028
1.756HisAsp: 1.756 ± 1.496
0.878HisGlu: 0.878 ± 0.514
0.0HisPhe: 0.0 ± 0.0
0.878HisGly: 0.878 ± 0.514
1.756HisHis: 1.756 ± 1.028
1.756HisIle: 1.756 ± 0.458
0.878HisLys: 0.878 ± 0.514
0.878HisLeu: 0.878 ± 0.514
1.756HisMet: 1.756 ± 0.469
2.634HisAsn: 2.634 ± 0.623
0.878HisPro: 0.878 ± 0.748
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
0.878HisSer: 0.878 ± 0.514
0.0HisThr: 0.0 ± 0.0
0.878HisVal: 0.878 ± 0.514
0.0HisTrp: 0.0 ± 0.0
0.878HisTyr: 0.878 ± 0.514
0.0HisXaa: 0.0 ± 0.0
Ile
2.634IleAla: 2.634 ± 1.128
0.878IleCys: 0.878 ± 0.514
2.634IleAsp: 2.634 ± 1.542
0.878IleGlu: 0.878 ± 0.514
1.756IlePhe: 1.756 ± 1.028
1.756IleGly: 1.756 ± 1.028
0.0IleHis: 0.0 ± 0.0
0.0IleIle: 0.0 ± 0.0
0.0IleLys: 0.0 ± 0.0
7.024IleLeu: 7.024 ± 1.727
0.0IleMet: 0.0 ± 0.0
3.512IleAsn: 3.512 ± 0.916
3.512IlePro: 3.512 ± 1.859
3.512IleGln: 3.512 ± 1.859
3.512IleArg: 3.512 ± 0.863
1.756IleSer: 1.756 ± 1.285
2.634IleThr: 2.634 ± 0.623
3.512IleVal: 3.512 ± 1.53
0.0IleTrp: 0.0 ± 0.0
0.878IleTyr: 0.878 ± 0.514
0.0IleXaa: 0.0 ± 0.0
Lys
1.756LysAla: 1.756 ± 1.028
0.0LysCys: 0.0 ± 0.0
2.634LysAsp: 2.634 ± 1.924
4.39LysGlu: 4.39 ± 0.966
4.39LysPhe: 4.39 ± 2.57
4.39LysGly: 4.39 ± 0.966
0.878LysHis: 0.878 ± 0.514
4.39LysIle: 4.39 ± 0.966
1.756LysLys: 1.756 ± 0.458
4.39LysLeu: 4.39 ± 1.527
0.878LysMet: 0.878 ± 0.514
0.0LysAsn: 0.0 ± 0.0
0.878LysPro: 0.878 ± 0.514
0.878LysGln: 0.878 ± 0.514
3.512LysArg: 3.512 ± 0.916
5.268LysSer: 5.268 ± 1.247
0.878LysThr: 0.878 ± 0.514
3.512LysVal: 3.512 ± 1.047
0.878LysTrp: 0.878 ± 0.748
1.756LysTyr: 1.756 ± 1.028
0.0LysXaa: 0.0 ± 0.0
Leu
5.268LeuAla: 5.268 ± 1.374
0.878LeuCys: 0.878 ± 0.514
4.39LeuAsp: 4.39 ± 1.865
4.39LeuGlu: 4.39 ± 0.86
1.756LeuPhe: 1.756 ± 1.028
6.146LeuGly: 6.146 ± 1.645
0.0LeuHis: 0.0 ± 0.0
6.146LeuIle: 6.146 ± 1.522
1.756LeuLys: 1.756 ± 1.028
1.756LeuLeu: 1.756 ± 1.285
3.512LeuMet: 3.512 ± 2.056
2.634LeuAsn: 2.634 ± 1.542
7.902LeuPro: 7.902 ± 1.153
1.756LeuGln: 1.756 ± 1.028
4.39LeuArg: 4.39 ± 2.55
6.146LeuSer: 6.146 ± 2.12
6.146LeuThr: 6.146 ± 1.522
7.024LeuVal: 7.024 ± 1.727
0.878LeuTrp: 0.878 ± 0.748
2.634LeuTyr: 2.634 ± 1.131
0.0LeuXaa: 0.0 ± 0.0
Met
6.146MetAla: 6.146 ± 1.99
0.0MetCys: 0.0 ± 0.0
2.634MetAsp: 2.634 ± 1.542
2.634MetGlu: 2.634 ± 1.128
0.0MetPhe: 0.0 ± 0.0
0.878MetGly: 0.878 ± 1.449
1.756MetHis: 1.756 ± 1.028
0.878MetIle: 0.878 ± 0.514
0.0MetLys: 0.0 ± 0.0
2.634MetLeu: 2.634 ± 1.316
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.878MetPro: 0.878 ± 0.748
1.756MetGln: 1.756 ± 1.285
2.634MetArg: 2.634 ± 0.623
2.634MetSer: 2.634 ± 1.131
1.756MetThr: 1.756 ± 1.028
3.512MetVal: 3.512 ± 0.863
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
0.878AsnAla: 0.878 ± 0.748
0.0AsnCys: 0.0 ± 0.0
0.0AsnAsp: 0.0 ± 0.0
1.756AsnGlu: 1.756 ± 1.028
0.878AsnPhe: 0.878 ± 0.514
2.634AsnGly: 2.634 ± 1.131
0.0AsnHis: 0.0 ± 0.0
1.756AsnIle: 1.756 ± 0.458
7.024AsnLys: 7.024 ± 1.559
3.512AsnLeu: 3.512 ± 1.53
1.756AsnMet: 1.756 ± 1.028
0.0AsnAsn: 0.0 ± 0.0
4.39AsnPro: 4.39 ± 0.936
0.0AsnGln: 0.0 ± 0.0
1.756AsnArg: 1.756 ± 1.496
0.878AsnSer: 0.878 ± 0.514
1.756AsnThr: 1.756 ± 1.028
0.878AsnVal: 0.878 ± 0.748
0.878AsnTrp: 0.878 ± 0.514
0.878AsnTyr: 0.878 ± 0.514
0.0AsnXaa: 0.0 ± 0.0
Pro
7.902ProAla: 7.902 ± 4.457
0.878ProCys: 0.878 ± 0.514
2.634ProAsp: 2.634 ± 0.623
4.39ProGlu: 4.39 ± 2.303
4.39ProPhe: 4.39 ± 0.966
4.39ProGly: 4.39 ± 2.599
1.756ProHis: 1.756 ± 1.028
2.634ProIle: 2.634 ± 1.542
1.756ProLys: 1.756 ± 1.028
5.268ProLeu: 5.268 ± 2.262
1.756ProMet: 1.756 ± 0.458
1.756ProAsn: 1.756 ± 1.028
9.658ProPro: 9.658 ± 5.746
4.39ProGln: 4.39 ± 5.735
2.634ProArg: 2.634 ± 0.623
2.634ProSer: 2.634 ± 1.924
4.39ProThr: 4.39 ± 2.599
3.512ProVal: 3.512 ± 2.991
0.0ProTrp: 0.0 ± 0.0
2.634ProTyr: 2.634 ± 0.623
0.0ProXaa: 0.0 ± 0.0
Gln
6.146GlnAla: 6.146 ± 0.409
0.0GlnCys: 0.0 ± 0.0
1.756GlnAsp: 1.756 ± 1.53
2.634GlnGlu: 2.634 ± 1.131
1.756GlnPhe: 1.756 ± 0.458
0.878GlnGly: 0.878 ± 0.748
0.0GlnHis: 0.0 ± 0.0
0.0GlnIle: 0.0 ± 0.0
1.756GlnLys: 1.756 ± 1.028
2.634GlnLeu: 2.634 ± 1.131
1.756GlnMet: 1.756 ± 1.028
0.0GlnAsn: 0.0 ± 0.0
1.756GlnPro: 1.756 ± 1.53
0.878GlnGln: 0.878 ± 0.748
4.39GlnArg: 4.39 ± 1.527
1.756GlnSer: 1.756 ± 0.458
0.878GlnThr: 0.878 ± 0.748
1.756GlnVal: 1.756 ± 1.53
0.878GlnTrp: 0.878 ± 1.449
0.878GlnTyr: 0.878 ± 0.748
0.0GlnXaa: 0.0 ± 0.0
Arg
7.024ArgAla: 7.024 ± 0.778
0.0ArgCys: 0.0 ± 0.0
3.512ArgAsp: 3.512 ± 0.916
7.024ArgGlu: 7.024 ± 2.094
0.878ArgPhe: 0.878 ± 0.748
5.268ArgGly: 5.268 ± 0.508
1.756ArgHis: 1.756 ± 0.458
1.756ArgIle: 1.756 ± 1.496
7.024ArgLys: 7.024 ± 0.778
6.146ArgLeu: 6.146 ± 0.409
2.634ArgMet: 2.634 ± 2.885
0.878ArgAsn: 0.878 ± 1.449
7.024ArgPro: 7.024 ± 2.674
1.756ArgGln: 1.756 ± 1.496
7.902ArgArg: 7.902 ± 3.393
2.634ArgSer: 2.634 ± 1.131
2.634ArgThr: 2.634 ± 0.623
7.902ArgVal: 7.902 ± 2.451
1.756ArgTrp: 1.756 ± 0.458
1.756ArgTyr: 1.756 ± 1.028
0.0ArgXaa: 0.0 ± 0.0
Ser
4.39SerAla: 4.39 ± 0.86
0.878SerCys: 0.878 ± 0.514
0.878SerAsp: 0.878 ± 0.748
3.512SerGlu: 3.512 ± 0.916
1.756SerPhe: 1.756 ± 0.458
4.39SerGly: 4.39 ± 1.551
1.756SerHis: 1.756 ± 0.458
1.756SerIle: 1.756 ± 0.458
2.634SerLys: 2.634 ± 0.623
6.146SerLeu: 6.146 ± 1.645
1.756SerMet: 1.756 ± 1.028
1.756SerAsn: 1.756 ± 1.028
7.024SerPro: 7.024 ± 2.025
2.634SerGln: 2.634 ± 1.131
3.512SerArg: 3.512 ± 2.559
3.512SerSer: 3.512 ± 1.047
3.512SerThr: 3.512 ± 1.859
0.0SerVal: 0.0 ± 0.0
0.878SerTrp: 0.878 ± 0.514
0.878SerTyr: 0.878 ± 0.514
0.0SerXaa: 0.0 ± 0.0
Thr
2.634ThrAla: 2.634 ± 2.243
0.878ThrCys: 0.878 ± 0.514
4.39ThrAsp: 4.39 ± 0.936
6.146ThrGlu: 6.146 ± 1.522
2.634ThrPhe: 2.634 ± 1.542
5.268ThrGly: 5.268 ± 2.257
0.878ThrHis: 0.878 ± 0.748
1.756ThrIle: 1.756 ± 1.028
1.756ThrLys: 1.756 ± 1.496
2.634ThrLeu: 2.634 ± 0.623
2.634ThrMet: 2.634 ± 1.128
0.0ThrAsn: 0.0 ± 0.0
2.634ThrPro: 2.634 ± 0.623
0.878ThrGln: 0.878 ± 0.514
2.634ThrArg: 2.634 ± 0.623
4.39ThrSer: 4.39 ± 0.966
2.634ThrThr: 2.634 ± 1.128
0.0ThrVal: 0.0 ± 0.0
2.634ThrTrp: 2.634 ± 2.691
1.756ThrTyr: 1.756 ± 0.458
0.0ThrXaa: 0.0 ± 0.0
Val
5.268ValAla: 5.268 ± 1.458
0.0ValCys: 0.0 ± 0.0
5.268ValAsp: 5.268 ± 2.257
5.268ValGlu: 5.268 ± 3.856
1.756ValPhe: 1.756 ± 1.028
7.024ValGly: 7.024 ± 1.727
0.0ValHis: 0.0 ± 0.0
3.512ValIle: 3.512 ± 1.423
2.634ValLys: 2.634 ± 1.542
2.634ValLeu: 2.634 ± 0.623
0.878ValMet: 0.878 ± 0.919
2.634ValAsn: 2.634 ± 1.542
3.512ValPro: 3.512 ± 1.047
1.756ValGln: 1.756 ± 1.028
6.146ValArg: 6.146 ± 0.409
1.756ValSer: 1.756 ± 1.028
4.39ValThr: 4.39 ± 0.936
6.146ValVal: 6.146 ± 1.945
1.756ValTrp: 1.756 ± 1.53
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
3.512TrpAla: 3.512 ± 2.559
0.0TrpCys: 0.0 ± 0.0
0.878TrpAsp: 0.878 ± 0.748
0.878TrpGlu: 0.878 ± 0.748
2.634TrpPhe: 2.634 ± 0.623
0.878TrpGly: 0.878 ± 0.748
0.0TrpHis: 0.0 ± 0.0
1.756TrpIle: 1.756 ± 2.899
0.0TrpLys: 0.0 ± 0.0
1.756TrpLeu: 1.756 ± 1.028
0.0TrpMet: 0.0 ± 0.0
0.878TrpAsn: 0.878 ± 0.748
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.756TrpArg: 1.756 ± 1.285
0.878TrpSer: 0.878 ± 1.449
0.0TrpThr: 0.0 ± 0.0
1.756TrpVal: 1.756 ± 1.285
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.756TyrAla: 1.756 ± 0.458
0.878TyrCys: 0.878 ± 0.514
4.39TyrAsp: 4.39 ± 2.57
1.756TyrGlu: 1.756 ± 1.028
1.756TyrPhe: 1.756 ± 1.028
0.0TyrGly: 0.0 ± 0.0
0.0TyrHis: 0.0 ± 0.0
0.0TyrIle: 0.0 ± 0.0
0.0TyrLys: 0.0 ± 0.0
2.634TyrLeu: 2.634 ± 0.623
2.634TyrMet: 2.634 ± 1.542
0.0TyrAsn: 0.0 ± 0.0
0.878TyrPro: 0.878 ± 0.514
0.878TyrGln: 0.878 ± 0.748
1.756TyrArg: 1.756 ± 0.458
1.756TyrSer: 1.756 ± 1.028
4.39TyrThr: 4.39 ± 0.966
0.0TyrVal: 0.0 ± 0.0
1.756TyrTrp: 1.756 ± 1.285
0.878TyrTyr: 0.878 ± 1.449
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1140 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski