Amino acid dipepetide frequency for Hubei picorna-like virus 20

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.407AlaAla: 7.407 ± 2.445
0.412AlaCys: 0.412 ± 0.42
4.115AlaAsp: 4.115 ± 0.364
0.0AlaGlu: 0.0 ± 0.0
5.761AlaPhe: 5.761 ± 1.791
8.23AlaGly: 8.23 ± 4.563
0.823AlaHis: 0.823 ± 0.438
6.173AlaIle: 6.173 ± 0.093
2.058AlaLys: 2.058 ± 0.182
7.819AlaLeu: 7.819 ± 0.33
1.646AlaMet: 1.646 ± 0.401
3.292AlaAsn: 3.292 ± 1.442
3.704AlaPro: 3.704 ± 0.056
1.646AlaGln: 1.646 ± 0.238
4.115AlaArg: 4.115 ± 1.553
4.527AlaSer: 4.527 ± 0.784
6.173AlaThr: 6.173 ± 3.103
10.7AlaVal: 10.7 ± 1.97
1.235AlaTrp: 1.235 ± 0.019
2.469AlaTyr: 2.469 ± 1.241
0.0AlaXaa: 0.0 ± 0.0
Cys
0.412CysAla: 0.412 ± 0.219
0.412CysCys: 0.412 ± 0.219
0.412CysAsp: 0.412 ± 0.42
0.412CysGlu: 0.412 ± 0.219
0.0CysPhe: 0.0 ± 0.0
1.235CysGly: 1.235 ± 0.658
0.0CysHis: 0.0 ± 0.0
1.235CysIle: 1.235 ± 0.621
0.823CysLys: 0.823 ± 0.438
0.823CysLeu: 0.823 ± 0.201
0.412CysMet: 0.412 ± 0.219
0.412CysAsn: 0.412 ± 0.219
1.235CysPro: 1.235 ± 0.019
0.0CysGln: 0.0 ± 0.0
0.412CysArg: 0.412 ± 0.219
0.0CysSer: 0.0 ± 0.0
1.646CysThr: 1.646 ± 0.401
1.646CysVal: 1.646 ± 0.401
0.0CysTrp: 0.0 ± 0.0
0.823CysTyr: 0.823 ± 0.201
0.0CysXaa: 0.0 ± 0.0
Asp
3.704AspAla: 3.704 ± 0.695
0.412AspCys: 0.412 ± 0.219
6.996AspAsp: 6.996 ± 2.448
2.469AspGlu: 2.469 ± 0.676
3.292AspPhe: 3.292 ± 0.164
2.469AspGly: 2.469 ± 0.676
0.823AspHis: 0.823 ± 0.438
2.881AspIle: 2.881 ± 1.022
4.527AspLys: 4.527 ± 2.411
6.584AspLeu: 6.584 ± 0.951
0.412AspMet: 0.412 ± 0.42
3.704AspAsn: 3.704 ± 0.056
2.469AspPro: 2.469 ± 0.602
2.058AspGln: 2.058 ± 0.457
4.115AspArg: 4.115 ± 0.914
2.881AspSer: 2.881 ± 1.022
0.823AspThr: 0.823 ± 0.201
4.938AspVal: 4.938 ± 0.074
0.823AspTrp: 0.823 ± 0.438
2.881AspTyr: 2.881 ± 1.535
0.0AspXaa: 0.0 ± 0.0
Glu
4.115GluAla: 4.115 ± 0.914
0.412GluCys: 0.412 ± 0.219
2.881GluAsp: 2.881 ± 0.895
2.469GluGlu: 2.469 ± 0.676
4.115GluPhe: 4.115 ± 0.275
0.412GluGly: 0.412 ± 0.42
0.0GluHis: 0.0 ± 0.0
2.058GluIle: 2.058 ± 0.182
1.646GluLys: 1.646 ± 0.238
4.938GluLeu: 4.938 ± 1.352
2.469GluMet: 2.469 ± 0.676
2.058GluAsn: 2.058 ± 1.096
2.881GluPro: 2.881 ± 0.256
2.058GluGln: 2.058 ± 1.096
3.704GluArg: 3.704 ± 1.973
4.115GluSer: 4.115 ± 0.914
2.058GluThr: 2.058 ± 1.096
1.235GluVal: 1.235 ± 0.019
0.823GluTrp: 0.823 ± 0.438
2.058GluTyr: 2.058 ± 0.182
0.0GluXaa: 0.0 ± 0.0
Phe
3.292PheAla: 3.292 ± 0.476
0.412PheCys: 0.412 ± 0.219
3.704PheAsp: 3.704 ± 0.695
2.469PheGlu: 2.469 ± 0.602
2.881PhePhe: 2.881 ± 0.895
4.527PheGly: 4.527 ± 0.494
1.646PheHis: 1.646 ± 0.401
2.058PheIle: 2.058 ± 0.182
3.292PheLys: 3.292 ± 0.164
4.527PheLeu: 4.527 ± 1.772
1.235PheMet: 1.235 ± 0.019
2.469PheAsn: 2.469 ± 0.602
2.058PhePro: 2.058 ± 0.182
1.646PheGln: 1.646 ± 0.401
4.527PheArg: 4.527 ± 0.494
4.938PheSer: 4.938 ± 0.713
2.881PheThr: 2.881 ± 0.895
4.527PheVal: 4.527 ± 1.133
0.823PheTrp: 0.823 ± 0.201
0.823PheTyr: 0.823 ± 0.201
0.0PheXaa: 0.0 ± 0.0
Gly
5.35GlyAla: 5.35 ± 2.902
0.0GlyCys: 0.0 ± 0.0
3.292GlyAsp: 3.292 ± 1.115
3.292GlyGlu: 3.292 ± 0.164
2.881GlyPhe: 2.881 ± 0.895
4.527GlyGly: 4.527 ± 1.423
1.235GlyHis: 1.235 ± 0.658
6.996GlyIle: 6.996 ± 1.386
4.527GlyLys: 4.527 ± 1.133
4.527GlyLeu: 4.527 ± 1.423
0.823GlyMet: 0.823 ± 0.201
2.058GlyAsn: 2.058 ± 0.821
2.058GlyPro: 2.058 ± 1.46
2.469GlyGln: 2.469 ± 0.037
4.527GlyArg: 4.527 ± 0.145
4.527GlySer: 4.527 ± 1.133
5.35GlyThr: 5.35 ± 2.902
4.527GlyVal: 4.527 ± 0.145
0.412GlyTrp: 0.412 ± 0.219
2.881GlyTyr: 2.881 ± 1.661
0.0GlyXaa: 0.0 ± 0.0
His
1.646HisAla: 1.646 ± 0.401
0.823HisCys: 0.823 ± 0.438
1.235HisAsp: 1.235 ± 0.658
1.235HisGlu: 1.235 ± 0.658
0.412HisPhe: 0.412 ± 0.219
1.235HisGly: 1.235 ± 0.658
0.0HisHis: 0.0 ± 0.0
1.235HisIle: 1.235 ± 0.019
0.823HisLys: 0.823 ± 0.438
3.292HisLeu: 3.292 ± 0.164
0.0HisMet: 0.0 ± 0.0
1.235HisAsn: 1.235 ± 0.621
0.823HisPro: 0.823 ± 0.201
0.412HisGln: 0.412 ± 0.219
0.412HisArg: 0.412 ± 0.219
2.058HisSer: 2.058 ± 0.182
0.412HisThr: 0.412 ± 0.219
1.646HisVal: 1.646 ± 0.877
0.0HisTrp: 0.0 ± 0.0
0.823HisTyr: 0.823 ± 0.201
0.0HisXaa: 0.0 ± 0.0
Ile
4.115IleAla: 4.115 ± 1.642
2.469IleCys: 2.469 ± 1.88
1.235IleAsp: 1.235 ± 0.019
3.704IleGlu: 3.704 ± 1.334
2.058IlePhe: 2.058 ± 0.457
4.115IleGly: 4.115 ± 1.003
0.823IleHis: 0.823 ± 0.438
1.646IleIle: 1.646 ± 0.238
2.881IleLys: 2.881 ± 0.256
4.938IleLeu: 4.938 ± 1.352
1.235IleMet: 1.235 ± 0.658
2.881IleAsn: 2.881 ± 0.256
2.058IlePro: 2.058 ± 0.821
2.881IleGln: 2.881 ± 1.022
2.469IleArg: 2.469 ± 0.037
5.761IleSer: 5.761 ± 2.044
3.704IleThr: 3.704 ± 0.695
2.058IleVal: 2.058 ± 1.46
0.823IleTrp: 0.823 ± 0.438
2.469IleTyr: 2.469 ± 0.602
0.0IleXaa: 0.0 ± 0.0
Lys
4.115LysAla: 4.115 ± 0.364
0.823LysCys: 0.823 ± 0.438
2.058LysAsp: 2.058 ± 0.182
2.881LysGlu: 2.881 ± 1.535
3.292LysPhe: 3.292 ± 1.115
3.704LysGly: 3.704 ± 0.695
0.412LysHis: 0.412 ± 0.219
2.469LysIle: 2.469 ± 0.676
4.527LysLys: 4.527 ± 2.411
5.35LysLeu: 5.35 ± 2.211
2.469LysMet: 2.469 ± 0.037
0.823LysAsn: 0.823 ± 0.438
1.646LysPro: 1.646 ± 0.238
1.646LysGln: 1.646 ± 0.401
3.704LysArg: 3.704 ± 1.973
2.469LysSer: 2.469 ± 0.676
3.292LysThr: 3.292 ± 1.115
3.704LysVal: 3.704 ± 0.695
0.0LysTrp: 0.0 ± 0.0
0.412LysTyr: 0.412 ± 0.219
0.0LysXaa: 0.0 ± 0.0
Leu
9.877LeuAla: 9.877 ± 0.491
0.412LeuCys: 0.412 ± 0.219
4.938LeuAsp: 4.938 ± 1.352
4.938LeuGlu: 4.938 ± 1.991
3.704LeuPhe: 3.704 ± 1.334
3.704LeuGly: 3.704 ± 0.583
2.881LeuHis: 2.881 ± 0.383
4.115LeuIle: 4.115 ± 0.275
4.527LeuLys: 4.527 ± 1.772
6.584LeuLeu: 6.584 ± 2.229
3.704LeuMet: 3.704 ± 0.583
2.469LeuAsn: 2.469 ± 0.037
5.35LeuPro: 5.35 ± 0.346
5.761LeuGln: 5.761 ± 0.513
3.292LeuArg: 3.292 ± 0.476
6.584LeuSer: 6.584 ± 0.312
6.996LeuThr: 6.996 ± 0.531
4.115LeuVal: 4.115 ± 1.553
2.469LeuTrp: 2.469 ± 0.676
3.292LeuTyr: 3.292 ± 0.476
0.0LeuXaa: 0.0 ± 0.0
Met
0.823MetAla: 0.823 ± 0.438
0.412MetCys: 0.412 ± 0.219
3.292MetAsp: 3.292 ± 1.754
0.412MetGlu: 0.412 ± 0.219
1.235MetPhe: 1.235 ± 0.621
1.646MetGly: 1.646 ± 0.401
0.412MetHis: 0.412 ± 0.42
1.235MetIle: 1.235 ± 0.658
1.646MetLys: 1.646 ± 0.401
1.235MetLeu: 1.235 ± 0.658
0.412MetMet: 0.412 ± 0.219
0.412MetAsn: 0.412 ± 0.42
0.823MetPro: 0.823 ± 0.438
0.823MetGln: 0.823 ± 0.438
2.469MetArg: 2.469 ± 0.037
1.235MetSer: 1.235 ± 0.621
1.646MetThr: 1.646 ± 0.238
1.646MetVal: 1.646 ± 1.04
0.412MetTrp: 0.412 ± 0.219
1.646MetTyr: 1.646 ± 1.04
0.0MetXaa: 0.0 ± 0.0
Asn
1.646AsnAla: 1.646 ± 0.238
0.823AsnCys: 0.823 ± 0.201
2.469AsnAsp: 2.469 ± 0.037
0.412AsnGlu: 0.412 ± 0.219
3.704AsnPhe: 3.704 ± 1.334
2.058AsnGly: 2.058 ± 0.821
0.412AsnHis: 0.412 ± 0.219
1.646AsnIle: 1.646 ± 0.401
1.235AsnLys: 1.235 ± 0.621
3.292AsnLeu: 3.292 ± 0.164
1.235AsnMet: 1.235 ± 0.275
0.823AsnAsn: 0.823 ± 0.201
2.881AsnPro: 2.881 ± 1.661
1.235AsnGln: 1.235 ± 0.658
2.469AsnArg: 2.469 ± 0.037
2.881AsnSer: 2.881 ± 0.256
5.35AsnThr: 5.35 ± 2.263
3.292AsnVal: 3.292 ± 0.476
0.823AsnTrp: 0.823 ± 0.201
0.412AsnTyr: 0.412 ± 0.219
0.0AsnXaa: 0.0 ± 0.0
Pro
4.115ProAla: 4.115 ± 2.282
1.646ProCys: 1.646 ± 0.401
1.646ProAsp: 1.646 ± 0.238
2.058ProGlu: 2.058 ± 0.182
3.292ProPhe: 3.292 ± 0.164
2.058ProGly: 2.058 ± 0.457
0.412ProHis: 0.412 ± 0.219
2.881ProIle: 2.881 ± 0.256
1.235ProLys: 1.235 ± 0.658
6.173ProLeu: 6.173 ± 1.371
0.412ProMet: 0.412 ± 0.219
0.412ProAsn: 0.412 ± 0.42
2.469ProPro: 2.469 ± 1.88
2.058ProGln: 2.058 ± 0.821
0.823ProArg: 0.823 ± 0.201
4.938ProSer: 4.938 ± 1.843
2.058ProThr: 2.058 ± 0.821
4.115ProVal: 4.115 ± 2.282
0.823ProTrp: 0.823 ± 0.201
3.704ProTyr: 3.704 ± 1.862
0.0ProXaa: 0.0 ± 0.0
Gln
3.704GlnAla: 3.704 ± 0.056
0.412GlnCys: 0.412 ± 0.219
2.058GlnAsp: 2.058 ± 0.182
2.058GlnGlu: 2.058 ± 0.457
1.235GlnPhe: 1.235 ± 0.621
2.469GlnGly: 2.469 ± 0.602
1.646GlnHis: 1.646 ± 0.877
2.881GlnIle: 2.881 ± 0.383
0.823GlnLys: 0.823 ± 0.438
4.115GlnLeu: 4.115 ± 0.364
0.0GlnMet: 0.0 ± 0.0
0.823GlnAsn: 0.823 ± 0.438
2.058GlnPro: 2.058 ± 0.182
1.235GlnGln: 1.235 ± 0.658
1.235GlnArg: 1.235 ± 0.658
2.058GlnSer: 2.058 ± 0.457
1.235GlnThr: 1.235 ± 0.019
0.823GlnVal: 0.823 ± 0.201
1.235GlnTrp: 1.235 ± 0.019
1.235GlnTyr: 1.235 ± 0.621
0.0GlnXaa: 0.0 ± 0.0
Arg
3.292ArgAla: 3.292 ± 0.164
0.412ArgCys: 0.412 ± 0.219
3.292ArgAsp: 3.292 ± 1.754
5.35ArgGlu: 5.35 ± 1.572
1.235ArgPhe: 1.235 ± 0.019
4.115ArgGly: 4.115 ± 0.364
0.823ArgHis: 0.823 ± 0.438
2.058ArgIle: 2.058 ± 0.457
3.704ArgLys: 3.704 ± 1.334
6.584ArgLeu: 6.584 ± 0.951
2.058ArgMet: 2.058 ± 0.457
3.292ArgAsn: 3.292 ± 1.754
1.646ArgPro: 1.646 ± 0.238
1.235ArgGln: 1.235 ± 0.019
4.115ArgArg: 4.115 ± 1.553
2.469ArgSer: 2.469 ± 0.676
2.881ArgThr: 2.881 ± 0.383
3.704ArgVal: 3.704 ± 0.695
0.823ArgTrp: 0.823 ± 0.201
2.469ArgTyr: 2.469 ± 0.037
0.0ArgXaa: 0.0 ± 0.0
Ser
6.584SerAla: 6.584 ± 0.327
0.412SerCys: 0.412 ± 0.42
4.115SerAsp: 4.115 ± 1.003
2.881SerGlu: 2.881 ± 0.256
4.527SerPhe: 4.527 ± 0.145
4.115SerGly: 4.115 ± 1.003
2.469SerHis: 2.469 ± 0.037
4.938SerIle: 4.938 ± 0.565
4.527SerLys: 4.527 ± 1.772
4.938SerLeu: 4.938 ± 0.074
1.646SerMet: 1.646 ± 0.401
2.058SerAsn: 2.058 ± 0.457
4.527SerPro: 4.527 ± 0.145
0.823SerGln: 0.823 ± 0.84
3.704SerArg: 3.704 ± 0.583
8.23SerSer: 8.23 ± 2.646
6.173SerThr: 6.173 ± 4.381
6.996SerVal: 6.996 ± 0.747
0.823SerTrp: 0.823 ± 0.84
2.058SerTyr: 2.058 ± 1.096
0.0SerXaa: 0.0 ± 0.0
Thr
5.761ThrAla: 5.761 ± 1.405
0.0ThrCys: 0.0 ± 0.0
3.704ThrAsp: 3.704 ± 0.583
2.058ThrGlu: 2.058 ± 0.457
2.881ThrPhe: 2.881 ± 1.022
4.938ThrGly: 4.938 ± 1.204
1.646ThrHis: 1.646 ± 0.877
1.646ThrIle: 1.646 ± 1.68
3.292ThrLys: 3.292 ± 1.115
4.115ThrLeu: 4.115 ± 0.275
2.058ThrMet: 2.058 ± 0.182
4.115ThrAsn: 4.115 ± 2.282
3.704ThrPro: 3.704 ± 2.501
1.235ThrGln: 1.235 ± 0.658
3.292ThrArg: 3.292 ± 1.115
7.819ThrSer: 7.819 ± 4.143
3.292ThrThr: 3.292 ± 1.442
4.115ThrVal: 4.115 ± 0.364
0.412ThrTrp: 0.412 ± 0.219
2.469ThrTyr: 2.469 ± 0.676
0.0ThrXaa: 0.0 ± 0.0
Val
8.23ValAla: 8.23 ± 0.729
0.823ValCys: 0.823 ± 0.438
4.115ValAsp: 4.115 ± 0.275
5.35ValGlu: 5.35 ± 0.932
3.704ValPhe: 3.704 ± 0.695
4.527ValGly: 4.527 ± 0.494
2.881ValHis: 2.881 ± 0.383
4.115ValIle: 4.115 ± 0.364
2.469ValLys: 2.469 ± 0.037
5.35ValLeu: 5.35 ± 0.293
1.235ValMet: 1.235 ± 0.019
4.115ValAsn: 4.115 ± 0.364
4.527ValPro: 4.527 ± 1.423
2.058ValGln: 2.058 ± 0.821
3.292ValArg: 3.292 ± 0.476
4.115ValSer: 4.115 ± 1.642
4.115ValThr: 4.115 ± 1.003
6.173ValVal: 6.173 ± 1.186
0.0ValTrp: 0.0 ± 0.0
3.292ValTyr: 3.292 ± 0.164
0.0ValXaa: 0.0 ± 0.0
Trp
1.235TrpAla: 1.235 ± 0.621
0.412TrpCys: 0.412 ± 0.219
0.412TrpAsp: 0.412 ± 0.219
0.823TrpGlu: 0.823 ± 0.438
1.235TrpPhe: 1.235 ± 0.621
0.823TrpGly: 0.823 ± 0.201
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
1.235TrpLeu: 1.235 ± 0.019
0.0TrpMet: 0.0 ± 0.0
0.412TrpAsn: 0.412 ± 0.219
0.412TrpPro: 0.412 ± 0.219
0.823TrpGln: 0.823 ± 0.438
1.646TrpArg: 1.646 ± 0.238
0.823TrpSer: 0.823 ± 0.438
1.235TrpThr: 1.235 ± 0.019
0.412TrpVal: 0.412 ± 0.42
0.0TrpTrp: 0.0 ± 0.0
1.646TrpTyr: 1.646 ± 0.877
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.881TyrAla: 2.881 ± 1.661
0.412TyrCys: 0.412 ± 0.219
3.292TyrAsp: 3.292 ± 0.164
1.235TyrGlu: 1.235 ± 0.658
3.292TyrPhe: 3.292 ± 0.803
5.35TyrGly: 5.35 ± 0.985
0.412TyrHis: 0.412 ± 0.42
2.058TyrIle: 2.058 ± 0.182
1.235TyrLys: 1.235 ± 0.658
3.292TyrLeu: 3.292 ± 0.164
0.0TyrMet: 0.0 ± 0.179
1.235TyrAsn: 1.235 ± 0.621
0.0TyrPro: 0.0 ± 0.0
1.235TyrGln: 1.235 ± 0.019
1.235TyrArg: 1.235 ± 0.019
4.115TyrSer: 4.115 ± 1.003
1.235TyrThr: 1.235 ± 0.658
4.115TyrVal: 4.115 ± 0.275
0.823TyrTrp: 0.823 ± 0.438
1.235TyrTyr: 1.235 ± 0.658
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2431 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski