Amino acid dipepetide frequency for Human astrovirus-1 (HAstV-1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.387AlaAla: 5.387 ± 0.381
2.218AlaCys: 2.218 ± 0.101
2.852AlaAsp: 2.852 ± 0.315
1.267AlaGlu: 1.267 ± 0.484
4.753AlaPhe: 4.753 ± 0.873
2.852AlaGly: 2.852 ± 1.095
2.535AlaHis: 2.535 ± 0.774
3.169AlaIle: 3.169 ± 0.393
2.535AlaLys: 2.535 ± 0.382
5.387AlaLeu: 5.387 ± 0.493
1.584AlaMet: 1.584 ± 0.196
2.852AlaAsn: 2.852 ± 0.131
7.921AlaPro: 7.921 ± 0.521
1.584AlaGln: 1.584 ± 0.291
4.753AlaArg: 4.753 ± 1.651
2.852AlaSer: 2.852 ± 2.193
7.288AlaThr: 7.288 ± 0.604
5.703AlaVal: 5.703 ± 1.13
0.634AlaTrp: 0.634 ± 0.242
3.485AlaTyr: 3.485 ± 0.664
0.0AlaXaa: 0.0 ± 0.0
Cys
0.634CysAla: 0.634 ± 0.413
0.0CysCys: 0.0 ± 0.0
0.951CysAsp: 0.951 ± 0.306
0.317CysGlu: 0.317 ± 0.288
0.951CysPhe: 0.951 ± 0.142
0.634CysGly: 0.634 ± 0.413
0.0CysHis: 0.0 ± 0.0
0.317CysIle: 0.317 ± 0.288
0.634CysLys: 0.634 ± 0.202
1.267CysLeu: 1.267 ± 0.482
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.267CysPro: 1.267 ± 0.403
0.317CysGln: 0.317 ± 0.288
0.951CysArg: 0.951 ± 0.306
0.0CysSer: 0.0 ± 0.0
0.634CysThr: 0.634 ± 0.576
1.584CysVal: 1.584 ± 0.476
0.0CysTrp: 0.0 ± 0.0
1.901CysTyr: 1.901 ± 0.188
0.0CysXaa: 0.0 ± 0.0
Asp
3.802AspAla: 3.802 ± 0.791
0.0AspCys: 0.0 ± 0.0
4.436AspAsp: 4.436 ± 1.381
4.436AspGlu: 4.436 ± 0.523
3.169AspPhe: 3.169 ± 0.581
3.802AspGly: 3.802 ± 0.376
0.634AspHis: 0.634 ± 0.242
2.852AspIle: 2.852 ± 0.315
4.436AspLys: 4.436 ± 0.251
6.02AspLeu: 6.02 ± 1.038
0.634AspMet: 0.634 ± 0.202
1.901AspAsn: 1.901 ± 0.573
0.951AspPro: 0.951 ± 0.306
1.584AspGln: 1.584 ± 0.279
2.852AspArg: 2.852 ± 0.315
2.535AspSer: 2.535 ± 0.534
3.485AspThr: 3.485 ± 0.99
2.852AspVal: 2.852 ± 0.425
2.218AspTrp: 2.218 ± 0.343
2.852AspTyr: 2.852 ± 0.857
0.0AspXaa: 0.0 ± 0.0
Glu
4.119GluAla: 4.119 ± 0.671
0.0GluCys: 0.0 ± 0.0
6.02GluAsp: 6.02 ± 0.278
5.07GluGlu: 5.07 ± 0.654
2.852GluPhe: 2.852 ± 0.733
2.852GluGly: 2.852 ± 0.131
1.584GluHis: 1.584 ± 0.476
4.119GluIle: 4.119 ± 0.091
4.119GluLys: 4.119 ± 1.388
2.852GluLeu: 2.852 ± 0.425
2.535GluMet: 2.535 ± 0.313
2.852GluAsn: 2.852 ± 0.488
3.802GluPro: 3.802 ± 1.225
2.535GluGln: 2.535 ± 0.382
3.485GluArg: 3.485 ± 0.664
2.535GluSer: 2.535 ± 0.388
2.535GluThr: 2.535 ± 0.277
2.218GluVal: 2.218 ± 0.343
1.267GluTrp: 1.267 ± 0.08
2.218GluTyr: 2.218 ± 0.343
0.0GluXaa: 0.0 ± 0.0
Phe
2.218PheAla: 2.218 ± 0.343
0.317PheCys: 0.317 ± 0.206
5.387PheAsp: 5.387 ± 1.262
2.218PheGlu: 2.218 ± 0.343
1.901PhePhe: 1.901 ± 0.612
1.901PheGly: 1.901 ± 0.573
1.267PheHis: 1.267 ± 0.403
2.852PheIle: 2.852 ± 0.919
1.901PheLys: 1.901 ± 0.726
5.387PheLeu: 5.387 ± 0.926
1.584PheMet: 1.584 ± 0.476
1.267PheAsn: 1.267 ± 0.08
1.267PhePro: 1.267 ± 0.08
1.901PheGln: 1.901 ± 0.283
0.951PheArg: 0.951 ± 0.306
1.267PheSer: 1.267 ± 0.482
2.852PheThr: 2.852 ± 1.057
4.436PheVal: 4.436 ± 0.744
0.317PheTrp: 0.317 ± 0.206
1.584PheTyr: 1.584 ± 0.291
0.0PheXaa: 0.0 ± 0.0
Gly
5.07GlyAla: 5.07 ± 0.776
0.0GlyCys: 0.0 ± 0.0
2.535GlyAsp: 2.535 ± 0.159
2.852GlyGlu: 2.852 ± 0.676
3.169GlyPhe: 3.169 ± 0.207
4.753GlyGly: 4.753 ± 0.489
2.218GlyHis: 2.218 ± 0.859
2.218GlyIle: 2.218 ± 0.372
4.753GlyLys: 4.753 ± 1.043
6.02GlyLeu: 6.02 ± 0.278
2.218GlyMet: 2.218 ± 0.166
2.852GlyAsn: 2.852 ± 0.488
3.485GlyPro: 3.485 ± 0.552
0.634GlyGln: 0.634 ± 0.413
3.802GlyArg: 3.802 ± 1.219
4.436GlySer: 4.436 ± 1.785
5.07GlyThr: 5.07 ± 0.502
2.852GlyVal: 2.852 ± 0.425
2.535GlyTrp: 2.535 ± 0.78
2.535GlyTyr: 2.535 ± 0.277
0.0GlyXaa: 0.0 ± 0.0
His
1.584HisAla: 1.584 ± 0.291
0.317HisCys: 0.317 ± 0.206
0.0HisAsp: 0.0 ± 0.0
1.584HisGlu: 1.584 ± 0.476
0.634HisPhe: 0.634 ± 0.242
2.535HisGly: 2.535 ± 0.159
0.0HisHis: 0.0 ± 0.0
0.634HisIle: 0.634 ± 0.413
0.634HisLys: 0.634 ± 0.202
2.218HisLeu: 2.218 ± 0.101
0.634HisMet: 0.634 ± 0.337
0.0HisAsn: 0.0 ± 0.0
1.584HisPro: 1.584 ± 0.476
1.267HisGln: 1.267 ± 0.08
0.951HisArg: 0.951 ± 0.306
0.951HisSer: 0.951 ± 0.306
1.267HisThr: 1.267 ± 0.08
1.267HisVal: 1.267 ± 0.482
0.634HisTrp: 0.634 ± 0.202
1.267HisTyr: 1.267 ± 0.406
0.0HisXaa: 0.0 ± 0.0
Ile
3.802IleAla: 3.802 ± 0.567
1.267IleCys: 1.267 ± 0.08
2.852IleAsp: 2.852 ± 0.131
3.485IleGlu: 3.485 ± 0.507
3.169IlePhe: 3.169 ± 0.951
2.852IleGly: 2.852 ± 0.857
1.901IleHis: 1.901 ± 0.188
3.802IleIle: 3.802 ± 0.781
4.753IleLys: 4.753 ± 0.673
4.436IleLeu: 4.436 ± 0.982
1.267IleMet: 1.267 ± 0.406
2.218IleAsn: 2.218 ± 0.343
3.485IlePro: 3.485 ± 0.324
0.951IleGln: 0.951 ± 0.142
3.169IleArg: 3.169 ± 1.083
0.951IleSer: 0.951 ± 0.142
2.852IleThr: 2.852 ± 0.488
3.802IleVal: 3.802 ± 0.376
0.951IleTrp: 0.951 ± 0.306
0.951IleTyr: 0.951 ± 0.306
0.0IleXaa: 0.0 ± 0.0
Lys
5.703LysAla: 5.703 ± 1.13
0.317LysCys: 0.317 ± 0.206
2.218LysAsp: 2.218 ± 0.101
3.802LysGlu: 3.802 ± 0.793
1.584LysPhe: 1.584 ± 0.476
4.436LysGly: 4.436 ± 0.744
1.901LysHis: 1.901 ± 0.299
3.169LysIle: 3.169 ± 0.207
3.802LysLys: 3.802 ± 1.014
7.605LysLeu: 7.605 ± 0.752
0.317LysMet: 0.317 ± 0.206
3.169LysAsn: 3.169 ± 0.393
6.971LysPro: 6.971 ± 0.573
4.436LysGln: 4.436 ± 0.601
0.634LysArg: 0.634 ± 0.576
4.119LysSer: 4.119 ± 0.372
3.485LysThr: 3.485 ± 0.763
3.169LysVal: 3.169 ± 0.207
0.634LysTrp: 0.634 ± 0.202
1.901LysTyr: 1.901 ± 0.188
0.0LysXaa: 0.0 ± 0.0
Leu
7.605LeuAla: 7.605 ± 1.286
2.218LeuCys: 2.218 ± 0.343
5.703LeuAsp: 5.703 ± 0.694
5.07LeuGlu: 5.07 ± 1.693
1.901LeuPhe: 1.901 ± 0.299
4.119LeuGly: 4.119 ± 1.432
1.267LeuHis: 1.267 ± 0.482
4.753LeuIle: 4.753 ± 0.589
5.387LeuLys: 5.387 ± 0.493
7.288LeuLeu: 7.288 ± 1.411
2.535LeuMet: 2.535 ± 0.774
3.485LeuAsn: 3.485 ± 0.507
4.436LeuPro: 4.436 ± 1.123
4.119LeuGln: 4.119 ± 0.513
4.436LeuArg: 4.436 ± 0.982
6.02LeuSer: 6.02 ± 0.278
7.288LeuThr: 7.288 ± 0.587
6.02LeuVal: 6.02 ± 2.8
1.267LeuTrp: 1.267 ± 0.482
0.0LeuTyr: 0.0 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
4.119MetAla: 4.119 ± 0.851
0.317MetCys: 0.317 ± 0.288
0.951MetAsp: 0.951 ± 0.49
1.584MetGlu: 1.584 ± 0.476
0.634MetPhe: 0.634 ± 0.202
0.634MetGly: 0.634 ± 0.202
0.0MetHis: 0.0 ± 0.0
1.267MetIle: 1.267 ± 0.403
1.584MetLys: 1.584 ± 0.476
0.317MetLeu: 0.317 ± 0.288
0.0MetMet: 0.0 ± 0.0
1.584MetAsn: 1.584 ± 0.291
1.267MetPro: 1.267 ± 0.406
0.634MetGln: 0.634 ± 0.242
0.951MetArg: 0.951 ± 0.142
1.901MetSer: 1.901 ± 0.283
1.901MetThr: 1.901 ± 0.283
3.169MetVal: 3.169 ± 0.581
1.267MetTrp: 1.267 ± 0.08
0.951MetTyr: 0.951 ± 0.306
0.0MetXaa: 0.0 ± 0.0
Asn
2.852AsnAla: 2.852 ± 0.676
0.0AsnCys: 0.0 ± 0.0
0.951AsnAsp: 0.951 ± 0.142
2.218AsnGlu: 2.218 ± 0.101
2.852AsnPhe: 2.852 ± 0.919
2.218AsnGly: 2.218 ± 0.101
0.0AsnHis: 0.0 ± 0.0
1.584AsnIle: 1.584 ± 0.196
3.485AsnLys: 3.485 ± 1.27
4.119AsnLeu: 4.119 ± 0.091
0.317AsnMet: 0.317 ± 0.206
3.485AsnAsn: 3.485 ± 0.862
2.535AsnPro: 2.535 ± 1.146
0.951AsnGln: 0.951 ± 0.863
0.951AsnArg: 0.951 ± 0.346
2.218AsnSer: 2.218 ± 0.101
5.387AsnThr: 5.387 ± 0.768
3.169AsnVal: 3.169 ± 0.671
0.317AsnTrp: 0.317 ± 0.288
2.852AsnTyr: 2.852 ± 0.488
0.0AsnXaa: 0.0 ± 0.0
Pro
4.753ProAla: 4.753 ± 0.597
0.317ProCys: 0.317 ± 0.288
3.169ProAsp: 3.169 ± 0.645
6.02ProGlu: 6.02 ± 1.125
1.901ProPhe: 1.901 ± 0.188
4.436ProGly: 4.436 ± 0.635
0.634ProHis: 0.634 ± 0.202
4.753ProIle: 4.753 ± 1.098
3.169ProLys: 3.169 ± 0.581
1.901ProLeu: 1.901 ± 0.573
0.634ProMet: 0.634 ± 0.242
0.634ProAsn: 0.634 ± 0.576
1.584ProPro: 1.584 ± 0.717
3.485ProGln: 3.485 ± 1.054
3.169ProArg: 3.169 ± 0.207
4.119ProSer: 4.119 ± 1.071
5.387ProThr: 5.387 ± 0.11
2.535ProVal: 2.535 ± 0.388
0.0ProTrp: 0.0 ± 0.0
1.901ProTyr: 1.901 ± 0.612
0.0ProXaa: 0.0 ± 0.0
Gln
2.218GlnAla: 2.218 ± 0.343
0.317GlnCys: 0.317 ± 0.206
0.951GlnAsp: 0.951 ± 0.142
1.267GlnGlu: 1.267 ± 0.403
2.535GlnPhe: 2.535 ± 1.146
2.218GlnGly: 2.218 ± 1.254
0.951GlnHis: 0.951 ± 0.306
1.584GlnIle: 1.584 ± 0.476
4.119GlnLys: 4.119 ± 1.4
2.852GlnLeu: 2.852 ± 0.733
1.901GlnMet: 1.901 ± 0.283
1.267GlnAsn: 1.267 ± 0.482
2.852GlnPro: 2.852 ± 0.425
2.535GlnGln: 2.535 ± 0.806
2.852GlnArg: 2.852 ± 0.355
1.901GlnSer: 1.901 ± 0.283
2.852GlnThr: 2.852 ± 0.131
3.802GlnVal: 3.802 ± 1.219
0.0GlnTrp: 0.0 ± 0.0
0.317GlnTyr: 0.317 ± 0.288
0.0GlnXaa: 0.0 ± 0.0
Arg
3.485ArgAla: 3.485 ± 2.008
0.317ArgCys: 0.317 ± 0.206
3.802ArgAsp: 3.802 ± 0.675
3.169ArgGlu: 3.169 ± 1.083
1.584ArgPhe: 1.584 ± 0.196
3.802ArgGly: 3.802 ± 0.386
0.634ArgHis: 0.634 ± 0.413
3.169ArgIle: 3.169 ± 0.207
2.218ArgLys: 2.218 ± 0.536
4.753ArgLeu: 4.753 ± 1.427
1.267ArgMet: 1.267 ± 0.403
2.535ArgAsn: 2.535 ± 0.388
0.951ArgPro: 0.951 ± 0.306
2.218ArgGln: 2.218 ± 0.101
2.218ArgArg: 2.218 ± 0.101
3.485ArgSer: 3.485 ± 1.665
5.387ArgThr: 5.387 ± 0.768
4.753ArgVal: 4.753 ± 0.673
0.317ArgTrp: 0.317 ± 0.206
1.267ArgTyr: 1.267 ± 0.482
0.0ArgXaa: 0.0 ± 0.0
Ser
4.119SerAla: 4.119 ± 0.808
0.634SerCys: 0.634 ± 0.202
4.119SerAsp: 4.119 ± 1.077
2.218SerGlu: 2.218 ± 0.101
1.901SerPhe: 1.901 ± 0.299
6.337SerGly: 6.337 ± 0.77
1.267SerHis: 1.267 ± 0.406
1.901SerIle: 1.901 ± 1.334
3.485SerLys: 3.485 ± 0.394
3.802SerLeu: 3.802 ± 0.239
0.634SerMet: 0.634 ± 0.576
1.901SerAsn: 1.901 ± 0.976
2.218SerPro: 2.218 ± 0.516
5.07SerGln: 5.07 ± 0.935
2.535SerArg: 2.535 ± 1.906
2.218SerSer: 2.218 ± 0.536
6.02SerThr: 6.02 ± 1.244
2.218SerVal: 2.218 ± 0.516
0.317SerTrp: 0.317 ± 0.288
2.218SerTyr: 2.218 ± 0.536
0.0SerXaa: 0.0 ± 0.0
Thr
6.02ThrAla: 6.02 ± 0.244
1.267ThrCys: 1.267 ± 0.403
3.169ThrAsp: 3.169 ± 0.582
5.07ThrGlu: 5.07 ± 0.387
2.852ThrPhe: 2.852 ± 0.425
5.703ThrGly: 5.703 ± 1.477
0.317ThrHis: 0.317 ± 0.27
3.802ThrIle: 3.802 ± 0.781
3.485ThrLys: 3.485 ± 1.201
7.921ThrLeu: 7.921 ± 0.936
3.802ThrMet: 3.802 ± 0.239
3.169ThrAsn: 3.169 ± 1.38
3.169ThrPro: 3.169 ± 0.582
3.169ThrGln: 3.169 ± 0.671
4.119ThrArg: 4.119 ± 0.703
5.387ThrSer: 5.387 ± 2.579
7.921ThrThr: 7.921 ± 1.576
6.654ThrVal: 6.654 ± 0.734
0.951ThrTrp: 0.951 ± 0.346
2.852ThrTyr: 2.852 ± 0.488
0.0ThrXaa: 0.0 ± 0.0
Val
1.267ValAla: 1.267 ± 0.403
2.218ValCys: 2.218 ± 0.663
1.584ValAsp: 1.584 ± 0.69
3.169ValGlu: 3.169 ± 0.233
1.901ValPhe: 1.901 ± 0.605
4.119ValGly: 4.119 ± 0.091
1.901ValHis: 1.901 ± 0.188
4.753ValIle: 4.753 ± 1.427
6.654ValLys: 6.654 ± 1.171
6.654ValLeu: 6.654 ± 0.303
1.901ValMet: 1.901 ± 0.605
5.703ValAsn: 5.703 ± 1.361
2.535ValPro: 2.535 ± 0.534
0.951ValGln: 0.951 ± 0.142
6.337ValArg: 6.337 ± 1.327
3.802ValSer: 3.802 ± 1.219
6.654ValThr: 6.654 ± 1.886
7.605ValVal: 7.605 ± 1.989
0.634ValTrp: 0.634 ± 0.576
1.901ValTyr: 1.901 ± 0.726
0.0ValXaa: 0.0 ± 0.0
Trp
0.634TrpAla: 0.634 ± 0.413
0.317TrpCys: 0.317 ± 0.288
0.634TrpAsp: 0.634 ± 0.202
0.634TrpGlu: 0.634 ± 0.202
0.317TrpPhe: 0.317 ± 0.288
0.951TrpGly: 0.951 ± 0.306
0.317TrpHis: 0.317 ± 0.206
1.267TrpIle: 1.267 ± 0.403
0.951TrpLys: 0.951 ± 0.346
1.584TrpLeu: 1.584 ± 0.291
0.0TrpMet: 0.0 ± 0.0
1.267TrpAsn: 1.267 ± 0.08
0.0TrpPro: 0.0 ± 0.0
0.317TrpGln: 0.317 ± 0.288
0.317TrpArg: 0.317 ± 0.206
0.951TrpSer: 0.951 ± 0.49
0.317TrpThr: 0.317 ± 0.206
2.218TrpVal: 2.218 ± 0.101
0.634TrpTrp: 0.634 ± 0.576
1.267TrpTyr: 1.267 ± 0.484
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.218TyrAla: 2.218 ± 0.101
0.0TyrCys: 0.0 ± 0.0
3.169TyrAsp: 3.169 ± 0.951
4.119TyrGlu: 4.119 ± 0.524
2.218TyrPhe: 2.218 ± 0.343
3.169TyrGly: 3.169 ± 0.951
0.634TyrHis: 0.634 ± 0.242
0.951TyrIle: 0.951 ± 0.306
1.267TyrLys: 1.267 ± 0.08
2.535TyrLeu: 2.535 ± 1.205
0.951TyrMet: 0.951 ± 0.142
0.317TyrAsn: 0.317 ± 0.288
2.218TyrPro: 2.218 ± 0.343
0.634TyrGln: 0.634 ± 0.242
1.901TyrArg: 1.901 ± 0.484
3.485TyrSer: 3.485 ± 0.763
2.218TyrThr: 2.218 ± 0.101
2.218TyrVal: 2.218 ± 0.343
0.0TyrTrp: 0.0 ± 0.0
3.169TyrTyr: 3.169 ± 1.008
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3157 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski