Amino acid dipepetide frequency for Hubei tombus-like virus 13

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.756AlaAla: 2.756 ± 0.514
4.41AlaCys: 4.41 ± 2.335
1.654AlaAsp: 1.654 ± 0.875
5.513AlaGlu: 5.513 ± 2.015
3.308AlaPhe: 3.308 ± 0.764
3.859AlaGly: 3.859 ± 2.043
1.654AlaHis: 1.654 ± 0.111
4.41AlaIle: 4.41 ± 1.348
6.615AlaLys: 6.615 ± 0.445
10.474AlaLeu: 10.474 ± 2.585
1.103AlaMet: 1.103 ± 1.39
0.551AlaAsn: 0.551 ± 0.292
2.756AlaPro: 2.756 ± 0.472
2.756AlaGln: 2.756 ± 0.472
3.859AlaArg: 3.859 ± 0.917
3.859AlaSer: 3.859 ± 0.069
2.756AlaThr: 2.756 ± 0.514
4.961AlaVal: 4.961 ± 0.653
2.756AlaTrp: 2.756 ± 1.501
1.103AlaTyr: 1.103 ± 0.403
0.0AlaXaa: 0.0 ± 0.0
Cys
2.205CysAla: 2.205 ± 0.806
1.103CysCys: 1.103 ± 0.584
1.654CysAsp: 1.654 ± 0.875
3.859CysGlu: 3.859 ± 0.069
1.103CysPhe: 1.103 ± 0.403
3.859CysGly: 3.859 ± 0.069
1.103CysHis: 1.103 ± 0.584
1.654CysIle: 1.654 ± 0.875
1.654CysLys: 1.654 ± 0.875
3.308CysLeu: 3.308 ± 1.751
0.551CysMet: 0.551 ± 0.695
1.103CysAsn: 1.103 ± 0.403
3.859CysPro: 3.859 ± 0.917
1.103CysGln: 1.103 ± 0.403
2.205CysArg: 2.205 ± 0.806
1.654CysSer: 1.654 ± 0.875
1.654CysThr: 1.654 ± 0.875
2.205CysVal: 2.205 ± 1.167
0.0CysTrp: 0.0 ± 0.0
1.103CysTyr: 1.103 ± 0.403
0.0CysXaa: 0.0 ± 0.0
Asp
3.308AspAla: 3.308 ± 1.209
3.308AspCys: 3.308 ± 0.222
2.756AspAsp: 2.756 ± 0.472
3.859AspGlu: 3.859 ± 0.069
2.205AspPhe: 2.205 ± 0.181
4.961AspGly: 4.961 ± 0.653
0.551AspHis: 0.551 ± 0.695
3.308AspIle: 3.308 ± 0.222
1.103AspLys: 1.103 ± 0.584
0.551AspLeu: 0.551 ± 0.292
0.0AspMet: 0.0 ± 0.0
1.103AspAsn: 1.103 ± 0.403
1.654AspPro: 1.654 ± 0.111
1.103AspGln: 1.103 ± 0.584
4.41AspArg: 4.41 ± 1.348
2.756AspSer: 2.756 ± 1.501
2.756AspThr: 2.756 ± 1.459
2.205AspVal: 2.205 ± 0.181
1.103AspTrp: 1.103 ± 0.584
2.756AspTyr: 2.756 ± 1.459
0.0AspXaa: 0.0 ± 0.0
Glu
7.718GluAla: 7.718 ± 2.112
1.103GluCys: 1.103 ± 0.403
5.513GluAsp: 5.513 ± 0.042
11.577GluGlu: 11.577 ± 0.779
4.961GluPhe: 4.961 ± 1.32
4.41GluGly: 4.41 ± 0.361
1.103GluHis: 1.103 ± 0.584
3.308GluIle: 3.308 ± 0.764
5.513GluLys: 5.513 ± 1.029
4.41GluLeu: 4.41 ± 2.599
2.205GluMet: 2.205 ± 1.793
3.308GluAsn: 3.308 ± 2.196
2.205GluPro: 2.205 ± 0.806
4.961GluGln: 4.961 ± 1.64
9.372GluArg: 9.372 ± 3.974
3.859GluSer: 3.859 ± 1.056
3.308GluThr: 3.308 ± 0.764
7.718GluVal: 7.718 ± 0.848
1.654GluTrp: 1.654 ± 0.111
2.756GluTyr: 2.756 ± 1.459
0.0GluXaa: 0.0 ± 0.0
Phe
2.205PheAla: 2.205 ± 0.181
1.103PheCys: 1.103 ± 0.403
2.205PheAsp: 2.205 ± 0.181
2.756PheGlu: 2.756 ± 0.514
0.0PhePhe: 0.0 ± 0.0
4.961PheGly: 4.961 ± 0.653
1.103PheHis: 1.103 ± 1.39
1.103PheIle: 1.103 ± 0.584
1.654PheLys: 1.654 ± 0.111
3.308PheLeu: 3.308 ± 0.222
1.103PheMet: 1.103 ± 0.403
1.654PheAsn: 1.654 ± 0.111
0.551PhePro: 0.551 ± 0.292
0.551PheGln: 0.551 ± 0.292
1.103PheArg: 1.103 ± 0.584
1.103PheSer: 1.103 ± 0.584
1.103PheThr: 1.103 ± 0.584
2.205PheVal: 2.205 ± 0.181
1.103PheTrp: 1.103 ± 1.39
1.103PheTyr: 1.103 ± 0.403
0.0PheXaa: 0.0 ± 0.0
Gly
3.308GlyAla: 3.308 ± 0.222
3.859GlyCys: 3.859 ± 0.917
3.859GlyAsp: 3.859 ± 1.904
7.166GlyGlu: 7.166 ± 2.807
2.205GlyPhe: 2.205 ± 0.181
4.961GlyGly: 4.961 ± 1.64
1.103GlyHis: 1.103 ± 0.584
2.756GlyIle: 2.756 ± 0.514
4.961GlyLys: 4.961 ± 1.32
2.756GlyLeu: 2.756 ± 0.514
1.654GlyMet: 1.654 ± 0.111
4.41GlyAsn: 4.41 ± 0.361
3.308GlyPro: 3.308 ± 1.751
2.205GlyGln: 2.205 ± 0.181
4.41GlyArg: 4.41 ± 1.348
3.859GlySer: 3.859 ± 0.069
3.859GlyThr: 3.859 ± 0.917
9.923GlyVal: 9.923 ± 1.306
2.756GlyTrp: 2.756 ± 1.459
1.654GlyTyr: 1.654 ± 0.111
0.0GlyXaa: 0.0 ± 0.0
His
2.205HisAla: 2.205 ± 0.181
0.551HisCys: 0.551 ± 0.292
1.103HisAsp: 1.103 ± 0.584
1.654HisGlu: 1.654 ± 0.111
0.0HisPhe: 0.0 ± 0.0
1.654HisGly: 1.654 ± 0.875
0.0HisHis: 0.0 ± 0.0
0.551HisIle: 0.551 ± 0.695
0.551HisLys: 0.551 ± 0.695
0.551HisLeu: 0.551 ± 0.695
0.0HisMet: 0.0 ± 0.0
1.654HisAsn: 1.654 ± 1.098
0.551HisPro: 0.551 ± 0.695
1.654HisGln: 1.654 ± 0.875
2.205HisArg: 2.205 ± 1.167
1.654HisSer: 1.654 ± 0.111
1.103HisThr: 1.103 ± 1.39
2.205HisVal: 2.205 ± 0.181
1.103HisTrp: 1.103 ± 0.584
1.103HisTyr: 1.103 ± 0.584
0.0HisXaa: 0.0 ± 0.0
Ile
3.859IleAla: 3.859 ± 0.069
1.654IleCys: 1.654 ± 0.111
3.308IleAsp: 3.308 ± 0.764
3.859IleGlu: 3.859 ± 0.917
0.551IlePhe: 0.551 ± 0.695
3.859IleGly: 3.859 ± 0.069
0.0IleHis: 0.0 ± 0.0
2.205IleIle: 2.205 ± 1.793
2.756IleLys: 2.756 ± 0.514
3.308IleLeu: 3.308 ± 1.751
1.103IleMet: 1.103 ± 0.403
2.205IleAsn: 2.205 ± 0.181
2.205IlePro: 2.205 ± 0.181
1.103IleGln: 1.103 ± 0.403
2.205IleArg: 2.205 ± 0.181
1.103IleSer: 1.103 ± 0.403
1.654IleThr: 1.654 ± 0.111
2.756IleVal: 2.756 ± 0.472
0.551IleTrp: 0.551 ± 0.292
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
6.615LysAla: 6.615 ± 0.542
1.103LysCys: 1.103 ± 0.403
2.205LysAsp: 2.205 ± 1.793
6.615LysGlu: 6.615 ± 1.432
1.103LysPhe: 1.103 ± 0.403
7.166LysGly: 7.166 ± 1.14
0.551LysHis: 0.551 ± 0.292
2.756LysIle: 2.756 ± 0.514
5.513LysLys: 5.513 ± 0.042
5.513LysLeu: 5.513 ± 1.029
0.0LysMet: 0.0 ± 0.398
1.103LysAsn: 1.103 ± 0.403
4.41LysPro: 4.41 ± 0.361
0.551LysGln: 0.551 ± 0.292
6.615LysArg: 6.615 ± 2.418
1.103LysSer: 1.103 ± 1.39
2.756LysThr: 2.756 ± 1.501
4.41LysVal: 4.41 ± 0.361
1.103LysTrp: 1.103 ± 0.584
1.654LysTyr: 1.654 ± 1.098
0.0LysXaa: 0.0 ± 0.0
Leu
6.615LeuAla: 6.615 ± 3.405
1.103LeuCys: 1.103 ± 0.584
2.756LeuAsp: 2.756 ± 0.514
8.269LeuGlu: 8.269 ± 0.431
2.205LeuPhe: 2.205 ± 1.167
4.961LeuGly: 4.961 ± 0.653
3.308LeuHis: 3.308 ± 1.209
2.205LeuIle: 2.205 ± 0.181
4.961LeuLys: 4.961 ± 3.294
8.269LeuLeu: 8.269 ± 2.53
1.103LeuMet: 1.103 ± 1.39
4.961LeuAsn: 4.961 ± 0.334
2.756LeuPro: 2.756 ± 0.472
4.41LeuGln: 4.41 ± 0.626
6.615LeuArg: 6.615 ± 1.529
1.654LeuSer: 1.654 ± 1.098
0.551LeuThr: 0.551 ± 0.292
3.859LeuVal: 3.859 ± 1.056
2.205LeuTrp: 2.205 ± 0.806
3.308LeuTyr: 3.308 ± 0.764
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
1.103MetCys: 1.103 ± 0.403
1.654MetAsp: 1.654 ± 0.111
2.756MetGlu: 2.756 ± 0.514
0.551MetPhe: 0.551 ± 0.292
1.103MetGly: 1.103 ± 0.403
0.551MetHis: 0.551 ± 0.292
0.0MetIle: 0.0 ± 0.0
2.205MetLys: 2.205 ± 0.181
0.551MetLeu: 0.551 ± 0.695
0.551MetMet: 0.551 ± 0.292
0.0MetAsn: 0.0 ± 0.0
1.103MetPro: 1.103 ± 1.39
1.103MetGln: 1.103 ± 1.39
0.551MetArg: 0.551 ± 0.695
0.551MetSer: 0.551 ± 0.695
0.0MetThr: 0.0 ± 0.0
1.654MetVal: 1.654 ± 0.111
0.551MetTrp: 0.551 ± 0.292
1.103MetTyr: 1.103 ± 0.403
0.0MetXaa: 0.0 ± 0.0
Asn
2.756AsnAla: 2.756 ± 1.459
1.654AsnCys: 1.654 ± 0.111
2.205AsnAsp: 2.205 ± 1.167
1.654AsnGlu: 1.654 ± 1.098
1.103AsnPhe: 1.103 ± 0.403
2.756AsnGly: 2.756 ± 0.514
0.551AsnHis: 0.551 ± 0.292
2.205AsnIle: 2.205 ± 0.181
3.308AsnLys: 3.308 ± 0.222
3.308AsnLeu: 3.308 ± 1.209
0.551AsnMet: 0.551 ± 0.244
3.308AsnAsn: 3.308 ± 0.764
3.859AsnPro: 3.859 ± 0.069
0.551AsnGln: 0.551 ± 0.695
1.103AsnArg: 1.103 ± 0.403
0.551AsnSer: 0.551 ± 0.292
2.756AsnThr: 2.756 ± 2.488
1.103AsnVal: 1.103 ± 1.39
1.654AsnTrp: 1.654 ± 0.875
1.103AsnTyr: 1.103 ± 0.584
0.0AsnXaa: 0.0 ± 0.0
Pro
2.205ProAla: 2.205 ± 1.167
0.551ProCys: 0.551 ± 0.292
1.103ProAsp: 1.103 ± 0.403
3.859ProGlu: 3.859 ± 0.069
0.551ProPhe: 0.551 ± 0.292
2.756ProGly: 2.756 ± 1.459
1.103ProHis: 1.103 ± 0.403
1.103ProIle: 1.103 ± 0.403
1.654ProLys: 1.654 ± 0.875
5.513ProLeu: 5.513 ± 0.945
1.103ProMet: 1.103 ± 0.584
3.308ProAsn: 3.308 ± 0.222
2.756ProPro: 2.756 ± 1.459
1.654ProGln: 1.654 ± 1.098
2.205ProArg: 2.205 ± 1.167
3.308ProSer: 3.308 ± 0.222
3.859ProThr: 3.859 ± 0.917
7.166ProVal: 7.166 ± 3.113
0.551ProTrp: 0.551 ± 0.292
0.551ProTyr: 0.551 ± 0.695
0.0ProXaa: 0.0 ± 0.0
Gln
1.654GlnAla: 1.654 ± 0.875
2.205GlnCys: 2.205 ± 1.167
1.654GlnAsp: 1.654 ± 0.875
4.41GlnGlu: 4.41 ± 2.335
1.654GlnPhe: 1.654 ± 0.875
1.654GlnGly: 1.654 ± 0.111
1.654GlnHis: 1.654 ± 1.098
1.103GlnIle: 1.103 ± 0.403
1.103GlnLys: 1.103 ± 0.403
3.308GlnLeu: 3.308 ± 1.209
0.0GlnMet: 0.0 ± 0.0
0.551GlnAsn: 0.551 ± 0.292
0.551GlnPro: 0.551 ± 0.695
1.103GlnGln: 1.103 ± 0.403
2.205GlnArg: 2.205 ± 0.181
3.308GlnSer: 3.308 ± 0.764
1.654GlnThr: 1.654 ± 1.098
2.205GlnVal: 2.205 ± 1.167
1.103GlnTrp: 1.103 ± 0.584
1.103GlnTyr: 1.103 ± 1.39
0.0GlnXaa: 0.0 ± 0.0
Arg
6.064ArgAla: 6.064 ± 0.25
3.859ArgCys: 3.859 ± 1.056
3.308ArgAsp: 3.308 ± 1.751
9.923ArgGlu: 9.923 ± 3.28
4.41ArgPhe: 4.41 ± 0.626
1.654ArgGly: 1.654 ± 0.111
2.205ArgHis: 2.205 ± 0.181
3.308ArgIle: 3.308 ± 0.222
5.513ArgLys: 5.513 ± 1.029
3.859ArgLeu: 3.859 ± 1.904
1.654ArgMet: 1.654 ± 0.111
3.308ArgAsn: 3.308 ± 0.764
3.308ArgPro: 3.308 ± 0.222
3.308ArgGln: 3.308 ± 0.764
9.372ArgArg: 9.372 ± 1.014
1.654ArgSer: 1.654 ± 1.098
2.205ArgThr: 2.205 ± 0.181
8.82ArgVal: 8.82 ± 1.709
1.103ArgTrp: 1.103 ± 0.584
2.756ArgTyr: 2.756 ± 1.501
0.0ArgXaa: 0.0 ± 0.0
Ser
5.513SerAla: 5.513 ± 1.029
2.205SerCys: 2.205 ± 0.181
2.205SerAsp: 2.205 ± 0.181
1.654SerGlu: 1.654 ± 0.111
1.654SerPhe: 1.654 ± 0.111
4.961SerGly: 4.961 ± 0.334
1.103SerHis: 1.103 ± 0.584
0.551SerIle: 0.551 ± 0.292
3.308SerLys: 3.308 ± 0.222
2.205SerLeu: 2.205 ± 0.806
1.103SerMet: 1.103 ± 0.403
0.551SerAsn: 0.551 ± 0.292
1.654SerPro: 1.654 ± 0.111
1.103SerGln: 1.103 ± 0.403
6.615SerArg: 6.615 ± 1.432
0.551SerSer: 0.551 ± 0.292
3.308SerThr: 3.308 ± 1.751
4.41SerVal: 4.41 ± 0.361
1.654SerTrp: 1.654 ± 0.875
1.103SerTyr: 1.103 ± 0.584
0.0SerXaa: 0.0 ± 0.0
Thr
2.205ThrAla: 2.205 ± 0.181
0.551ThrCys: 0.551 ± 0.292
1.103ThrAsp: 1.103 ± 0.403
3.308ThrGlu: 3.308 ± 1.209
1.654ThrPhe: 1.654 ± 1.098
2.756ThrGly: 2.756 ± 0.514
1.654ThrHis: 1.654 ± 0.111
2.205ThrIle: 2.205 ± 0.806
3.308ThrLys: 3.308 ± 0.222
1.654ThrLeu: 1.654 ± 0.875
0.551ThrMet: 0.551 ± 0.292
1.103ThrAsn: 1.103 ± 0.584
4.41ThrPro: 4.41 ± 0.361
1.654ThrGln: 1.654 ± 0.875
4.961ThrArg: 4.961 ± 0.334
2.756ThrSer: 2.756 ± 1.459
0.551ThrThr: 0.551 ± 0.292
6.615ThrVal: 6.615 ± 3.405
0.551ThrTrp: 0.551 ± 0.292
2.205ThrTyr: 2.205 ± 0.181
0.0ThrXaa: 0.0 ± 0.0
Val
6.615ValAla: 6.615 ± 2.515
2.205ValCys: 2.205 ± 0.806
2.756ValAsp: 2.756 ± 0.472
4.961ValGlu: 4.961 ± 0.334
1.103ValPhe: 1.103 ± 0.403
7.718ValGly: 7.718 ± 0.848
2.756ValHis: 2.756 ± 1.459
4.41ValIle: 4.41 ± 0.626
3.859ValLys: 3.859 ± 1.904
6.615ValLeu: 6.615 ± 1.432
1.654ValMet: 1.654 ± 0.111
2.205ValAsn: 2.205 ± 0.181
3.859ValPro: 3.859 ± 0.069
2.756ValGln: 2.756 ± 1.459
7.718ValArg: 7.718 ± 0.139
7.718ValSer: 7.718 ± 1.125
6.064ValThr: 6.064 ± 1.724
4.961ValVal: 4.961 ± 0.653
1.103ValTrp: 1.103 ± 0.584
2.205ValTyr: 2.205 ± 1.167
0.0ValXaa: 0.0 ± 0.0
Trp
2.205TrpAla: 2.205 ± 1.167
2.205TrpCys: 2.205 ± 1.167
1.103TrpAsp: 1.103 ± 0.403
1.103TrpGlu: 1.103 ± 0.584
1.654TrpPhe: 1.654 ± 0.875
1.103TrpGly: 1.103 ± 0.584
0.0TrpHis: 0.0 ± 0.0
1.103TrpIle: 1.103 ± 0.584
1.103TrpLys: 1.103 ± 1.39
3.859TrpLeu: 3.859 ± 0.069
1.103TrpMet: 1.103 ± 0.584
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.551TrpGln: 0.551 ± 0.292
2.205TrpArg: 2.205 ± 0.806
1.654TrpSer: 1.654 ± 0.111
1.103TrpThr: 1.103 ± 0.584
0.551TrpVal: 0.551 ± 0.292
0.0TrpTrp: 0.0 ± 0.0
0.551TrpTyr: 0.551 ± 0.292
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.103TyrAla: 1.103 ± 0.584
1.103TyrCys: 1.103 ± 0.403
1.654TyrAsp: 1.654 ± 0.875
2.205TyrGlu: 2.205 ± 0.181
0.0TyrPhe: 0.0 ± 0.0
3.859TyrGly: 3.859 ± 0.069
0.0TyrHis: 0.0 ± 0.0
0.0TyrIle: 0.0 ± 0.0
2.756TyrLys: 2.756 ± 1.501
2.756TyrLeu: 2.756 ± 1.501
0.0TyrMet: 0.0 ± 0.0
1.654TyrAsn: 1.654 ± 1.098
1.103TyrPro: 1.103 ± 0.584
0.0TyrGln: 0.0 ± 0.0
1.654TyrArg: 1.654 ± 0.111
2.756TyrSer: 2.756 ± 0.472
2.756TyrThr: 2.756 ± 1.459
3.308TyrVal: 3.308 ± 0.764
0.551TyrTrp: 0.551 ± 0.292
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1815 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski