Amino acid dipepetide frequency for Yata virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.265AlaAla: 1.265 ± 0.74
1.054AlaCys: 1.054 ± 0.414
2.741AlaAsp: 2.741 ± 0.794
1.687AlaGlu: 1.687 ± 0.754
1.265AlaPhe: 1.265 ± 0.381
2.531AlaGly: 2.531 ± 0.922
1.687AlaHis: 1.687 ± 0.875
1.687AlaIle: 1.687 ± 0.64
1.898AlaLys: 1.898 ± 0.696
4.429AlaLeu: 4.429 ± 0.819
0.633AlaMet: 0.633 ± 0.308
1.898AlaAsn: 1.898 ± 0.809
1.687AlaPro: 1.687 ± 0.747
1.265AlaGln: 1.265 ± 0.617
1.476AlaArg: 1.476 ± 0.522
2.109AlaSer: 2.109 ± 0.696
2.109AlaThr: 2.109 ± 0.816
1.687AlaVal: 1.687 ± 0.647
0.211AlaTrp: 0.211 ± 0.416
2.32AlaTyr: 2.32 ± 0.682
0.0AlaXaa: 0.0 ± 0.0
Cys
0.633CysAla: 0.633 ± 0.308
0.211CysCys: 0.211 ± 0.35
0.844CysAsp: 0.844 ± 0.238
0.0CysGlu: 0.0 ± 0.0
1.054CysPhe: 1.054 ± 0.549
0.633CysGly: 0.633 ± 0.276
0.211CysHis: 0.211 ± 0.35
2.32CysIle: 2.32 ± 0.57
1.898CysLys: 1.898 ± 0.461
2.531CysLeu: 2.531 ± 0.584
0.633CysMet: 0.633 ± 0.57
1.898CysAsn: 1.898 ± 0.435
0.844CysPro: 0.844 ± 0.41
0.633CysGln: 0.633 ± 0.508
1.054CysArg: 1.054 ± 0.743
1.687CysSer: 1.687 ± 0.558
1.265CysThr: 1.265 ± 0.407
0.633CysVal: 0.633 ± 0.242
0.422CysTrp: 0.422 ± 0.273
0.422CysTyr: 0.422 ± 0.487
0.0CysXaa: 0.0 ± 0.0
Asp
2.531AspAla: 2.531 ± 0.694
1.687AspCys: 1.687 ± 0.525
3.374AspAsp: 3.374 ± 0.732
3.796AspGlu: 3.796 ± 1.093
3.163AspPhe: 3.163 ± 0.978
3.163AspGly: 3.163 ± 0.964
1.265AspHis: 1.265 ± 0.782
3.796AspIle: 3.796 ± 0.861
3.585AspLys: 3.585 ± 0.631
5.272AspLeu: 5.272 ± 1.575
1.265AspMet: 1.265 ± 0.533
3.585AspAsn: 3.585 ± 0.773
2.952AspPro: 2.952 ± 0.782
3.374AspGln: 3.374 ± 0.848
2.741AspArg: 2.741 ± 0.397
3.585AspSer: 3.585 ± 0.972
2.531AspThr: 2.531 ± 0.939
3.374AspVal: 3.374 ± 0.471
1.687AspTrp: 1.687 ± 0.592
3.163AspTyr: 3.163 ± 0.674
0.0AspXaa: 0.0 ± 0.0
Glu
1.898GluAla: 1.898 ± 0.384
1.265GluCys: 1.265 ± 0.638
4.85GluAsp: 4.85 ± 0.976
5.061GluGlu: 5.061 ± 1.474
3.163GluPhe: 3.163 ± 1.026
3.374GluGly: 3.374 ± 1.213
1.265GluHis: 1.265 ± 0.611
4.639GluIle: 4.639 ± 1.08
4.218GluLys: 4.218 ± 0.699
8.013GluLeu: 8.013 ± 0.863
1.687GluMet: 1.687 ± 0.991
2.531GluAsn: 2.531 ± 0.422
1.898GluPro: 1.898 ± 0.798
0.633GluGln: 0.633 ± 0.444
3.163GluArg: 3.163 ± 0.517
4.639GluSer: 4.639 ± 0.571
3.796GluThr: 3.796 ± 0.771
2.952GluVal: 2.952 ± 0.512
0.844GluTrp: 0.844 ± 0.665
1.687GluTyr: 1.687 ± 0.51
0.0GluXaa: 0.0 ± 0.0
Phe
1.054PheAla: 1.054 ± 0.455
1.054PheCys: 1.054 ± 0.455
1.687PheAsp: 1.687 ± 0.543
2.952PheGlu: 2.952 ± 0.933
1.898PhePhe: 1.898 ± 0.494
2.32PheGly: 2.32 ± 0.636
1.687PheHis: 1.687 ± 0.666
1.476PheIle: 1.476 ± 0.465
4.639PheLys: 4.639 ± 0.846
3.163PheLeu: 3.163 ± 0.59
0.844PheMet: 0.844 ± 0.383
2.741PheAsn: 2.741 ± 0.884
1.265PhePro: 1.265 ± 0.406
1.265PheGln: 1.265 ± 0.581
1.898PheArg: 1.898 ± 0.548
4.007PheSer: 4.007 ± 0.789
1.898PheThr: 1.898 ± 0.552
2.109PheVal: 2.109 ± 0.763
0.422PheTrp: 0.422 ± 0.305
0.633PheTyr: 0.633 ± 0.617
0.0PheXaa: 0.0 ± 0.0
Gly
0.844GlyAla: 0.844 ± 0.457
0.422GlyCys: 0.422 ± 0.273
2.109GlyAsp: 2.109 ± 0.407
2.531GlyGlu: 2.531 ± 0.789
1.898GlyPhe: 1.898 ± 0.423
3.585GlyGly: 3.585 ± 0.776
1.265GlyHis: 1.265 ± 0.491
6.116GlyIle: 6.116 ± 1.247
5.272GlyLys: 5.272 ± 0.864
8.013GlyLeu: 8.013 ± 1.327
1.054GlyMet: 1.054 ± 0.504
2.32GlyAsn: 2.32 ± 1.211
2.531GlyPro: 2.531 ± 1.064
1.476GlyGln: 1.476 ± 0.566
2.109GlyArg: 2.109 ± 0.593
4.639GlySer: 4.639 ± 0.957
3.585GlyThr: 3.585 ± 0.807
4.85GlyVal: 4.85 ± 1.405
1.476GlyTrp: 1.476 ± 0.526
2.741GlyTyr: 2.741 ± 0.9
0.0GlyXaa: 0.0 ± 0.0
His
1.054HisAla: 1.054 ± 0.683
0.633HisCys: 0.633 ± 0.441
1.054HisAsp: 1.054 ± 0.379
1.476HisGlu: 1.476 ± 0.452
0.422HisPhe: 0.422 ± 0.217
1.265HisGly: 1.265 ± 0.406
0.633HisHis: 0.633 ± 0.708
2.531HisIle: 2.531 ± 0.81
1.687HisLys: 1.687 ± 0.808
1.687HisLeu: 1.687 ± 0.62
0.633HisMet: 0.633 ± 0.644
0.844HisAsn: 0.844 ± 0.398
1.054HisPro: 1.054 ± 0.303
0.844HisGln: 0.844 ± 0.682
0.633HisArg: 0.633 ± 0.276
1.054HisSer: 1.054 ± 0.395
1.476HisThr: 1.476 ± 0.666
1.898HisVal: 1.898 ± 1.07
0.422HisTrp: 0.422 ± 0.217
1.476HisTyr: 1.476 ± 0.651
0.0HisXaa: 0.0 ± 0.0
Ile
2.741IleAla: 2.741 ± 0.38
2.32IleCys: 2.32 ± 0.779
5.061IleAsp: 5.061 ± 0.738
5.694IleGlu: 5.694 ± 1.05
2.32IlePhe: 2.32 ± 0.673
6.116IleGly: 6.116 ± 1.266
1.476IleHis: 1.476 ± 0.931
7.381IleIle: 7.381 ± 1.103
7.381IleLys: 7.381 ± 1.21
6.748IleLeu: 6.748 ± 1.039
0.844IleMet: 0.844 ± 0.704
4.007IleAsn: 4.007 ± 0.836
5.272IlePro: 5.272 ± 1.296
2.531IleGln: 2.531 ± 0.578
4.429IleArg: 4.429 ± 0.871
5.061IleSer: 5.061 ± 1.339
2.952IleThr: 2.952 ± 0.62
3.796IleVal: 3.796 ± 0.978
1.476IleTrp: 1.476 ± 0.536
4.218IleTyr: 4.218 ± 0.687
0.0IleXaa: 0.0 ± 0.0
Lys
2.952LysAla: 2.952 ± 0.881
1.054LysCys: 1.054 ± 0.399
3.374LysAsp: 3.374 ± 0.84
4.429LysGlu: 4.429 ± 0.805
2.109LysPhe: 2.109 ± 0.498
6.116LysGly: 6.116 ± 1.539
1.265LysHis: 1.265 ± 0.536
5.483LysIle: 5.483 ± 0.606
5.483LysLys: 5.483 ± 1.087
9.701LysLeu: 9.701 ± 1.584
1.265LysMet: 1.265 ± 0.629
4.007LysAsn: 4.007 ± 1.03
1.898LysPro: 1.898 ± 0.749
1.054LysGln: 1.054 ± 0.812
2.741LysArg: 2.741 ± 0.645
5.694LysSer: 5.694 ± 0.94
3.585LysThr: 3.585 ± 0.888
4.85LysVal: 4.85 ± 1.147
1.476LysTrp: 1.476 ± 0.742
2.952LysTyr: 2.952 ± 0.583
0.0LysXaa: 0.0 ± 0.0
Leu
3.796LeuAla: 3.796 ± 0.639
2.109LeuCys: 2.109 ± 0.79
8.857LeuAsp: 8.857 ± 1.883
6.748LeuGlu: 6.748 ± 0.992
4.007LeuPhe: 4.007 ± 0.51
4.85LeuGly: 4.85 ± 0.836
1.054LeuHis: 1.054 ± 0.682
9.701LeuIle: 9.701 ± 1.7
6.748LeuLys: 6.748 ± 1.493
8.435LeuLeu: 8.435 ± 1.718
2.741LeuMet: 2.741 ± 0.513
6.116LeuAsn: 6.116 ± 0.941
2.741LeuPro: 2.741 ± 1.118
3.163LeuGln: 3.163 ± 0.785
6.116LeuArg: 6.116 ± 0.579
7.803LeuSer: 7.803 ± 1.03
6.748LeuThr: 6.748 ± 1.084
3.796LeuVal: 3.796 ± 0.695
1.476LeuTrp: 1.476 ± 0.713
2.952LeuTyr: 2.952 ± 1.206
0.0LeuXaa: 0.0 ± 0.0
Met
1.265MetAla: 1.265 ± 0.477
0.633MetCys: 0.633 ± 0.431
1.687MetAsp: 1.687 ± 0.743
2.32MetGlu: 2.32 ± 0.581
1.265MetPhe: 1.265 ± 0.51
1.476MetGly: 1.476 ± 0.585
0.0MetHis: 0.0 ± 0.0
2.741MetIle: 2.741 ± 0.833
2.109MetLys: 2.109 ± 1.231
1.476MetLeu: 1.476 ± 0.519
1.054MetMet: 1.054 ± 0.516
1.054MetAsn: 1.054 ± 0.423
0.633MetPro: 0.633 ± 0.57
0.633MetGln: 0.633 ± 0.268
0.844MetArg: 0.844 ± 0.427
2.741MetSer: 2.741 ± 1.224
1.265MetThr: 1.265 ± 0.725
1.476MetVal: 1.476 ± 0.669
0.0MetTrp: 0.0 ± 0.0
1.687MetTyr: 1.687 ± 0.844
0.0MetXaa: 0.0 ± 0.0
Asn
1.476AsnAla: 1.476 ± 0.81
0.422AsnCys: 0.422 ± 0.217
3.585AsnAsp: 3.585 ± 1.476
2.952AsnGlu: 2.952 ± 0.782
1.687AsnPhe: 1.687 ± 0.476
3.374AsnGly: 3.374 ± 0.728
1.687AsnHis: 1.687 ± 0.532
4.639AsnIle: 4.639 ± 1.184
3.585AsnLys: 3.585 ± 0.77
6.748AsnLeu: 6.748 ± 1.333
0.844AsnMet: 0.844 ± 0.316
3.163AsnAsn: 3.163 ± 0.687
2.32AsnPro: 2.32 ± 0.635
2.109AsnGln: 2.109 ± 0.547
2.109AsnArg: 2.109 ± 0.787
5.061AsnSer: 5.061 ± 1.055
4.007AsnThr: 4.007 ± 0.978
1.265AsnVal: 1.265 ± 0.432
0.633AsnTrp: 0.633 ± 0.372
2.109AsnTyr: 2.109 ± 0.67
0.0AsnXaa: 0.0 ± 0.0
Pro
1.476ProAla: 1.476 ± 0.704
0.211ProCys: 0.211 ± 0.136
2.109ProAsp: 2.109 ± 0.549
1.898ProGlu: 1.898 ± 1.113
1.898ProPhe: 1.898 ± 0.828
2.109ProGly: 2.109 ± 1.075
2.109ProHis: 2.109 ± 0.999
3.796ProIle: 3.796 ± 0.911
2.109ProLys: 2.109 ± 0.552
3.796ProLeu: 3.796 ± 1.03
2.32ProMet: 2.32 ± 0.564
2.109ProAsn: 2.109 ± 0.92
2.531ProPro: 2.531 ± 1.446
0.844ProGln: 0.844 ± 0.497
1.898ProArg: 1.898 ± 0.62
2.531ProSer: 2.531 ± 0.764
2.741ProThr: 2.741 ± 0.895
1.265ProVal: 1.265 ± 0.412
0.422ProTrp: 0.422 ± 0.335
3.163ProTyr: 3.163 ± 1.17
0.0ProXaa: 0.0 ± 0.0
Gln
1.687GlnAla: 1.687 ± 0.524
0.211GlnCys: 0.211 ± 0.136
0.844GlnAsp: 0.844 ± 0.395
2.531GlnGlu: 2.531 ± 0.634
1.476GlnPhe: 1.476 ± 0.597
1.054GlnGly: 1.054 ± 0.583
0.422GlnHis: 0.422 ± 0.273
2.109GlnIle: 2.109 ± 0.652
2.32GlnLys: 2.32 ± 0.895
1.898GlnLeu: 1.898 ± 0.572
1.054GlnMet: 1.054 ± 0.613
1.265GlnAsn: 1.265 ± 0.457
1.054GlnPro: 1.054 ± 0.322
0.844GlnGln: 0.844 ± 0.424
1.476GlnArg: 1.476 ± 0.564
2.32GlnSer: 2.32 ± 0.749
2.531GlnThr: 2.531 ± 0.542
2.109GlnVal: 2.109 ± 0.361
0.211GlnTrp: 0.211 ± 0.244
1.265GlnTyr: 1.265 ± 0.593
0.0GlnXaa: 0.0 ± 0.0
Arg
2.741ArgAla: 2.741 ± 1.181
1.476ArgCys: 1.476 ± 0.594
1.898ArgAsp: 1.898 ± 0.511
3.585ArgGlu: 3.585 ± 1.017
2.741ArgPhe: 2.741 ± 0.731
2.531ArgGly: 2.531 ± 0.932
0.844ArgHis: 0.844 ± 0.434
3.374ArgIle: 3.374 ± 1.031
2.952ArgLys: 2.952 ± 0.811
4.007ArgLeu: 4.007 ± 0.932
1.265ArgMet: 1.265 ± 0.377
2.952ArgAsn: 2.952 ± 0.854
1.898ArgPro: 1.898 ± 0.527
0.633ArgGln: 0.633 ± 0.348
1.687ArgArg: 1.687 ± 0.53
4.639ArgSer: 4.639 ± 0.968
3.163ArgThr: 3.163 ± 0.815
3.796ArgVal: 3.796 ± 0.945
0.844ArgTrp: 0.844 ± 0.398
0.844ArgTyr: 0.844 ± 0.789
0.0ArgXaa: 0.0 ± 0.0
Ser
2.109SerAla: 2.109 ± 0.66
1.687SerCys: 1.687 ± 0.517
4.429SerAsp: 4.429 ± 1.102
4.218SerGlu: 4.218 ± 1.162
3.163SerPhe: 3.163 ± 1.319
4.218SerGly: 4.218 ± 0.792
1.898SerHis: 1.898 ± 0.426
5.483SerIle: 5.483 ± 1.42
4.007SerLys: 4.007 ± 0.8
8.435SerLeu: 8.435 ± 1.324
2.32SerMet: 2.32 ± 0.572
2.32SerAsn: 2.32 ± 0.595
2.32SerPro: 2.32 ± 0.727
2.32SerGln: 2.32 ± 0.713
5.061SerArg: 5.061 ± 0.574
5.061SerSer: 5.061 ± 2.485
3.796SerThr: 3.796 ± 0.807
4.429SerVal: 4.429 ± 1.083
3.163SerTrp: 3.163 ± 0.864
4.639SerTyr: 4.639 ± 1.61
0.0SerXaa: 0.0 ± 0.0
Thr
1.898ThrAla: 1.898 ± 0.556
0.844ThrCys: 0.844 ± 0.344
3.374ThrAsp: 3.374 ± 0.915
2.952ThrGlu: 2.952 ± 0.719
1.265ThrPhe: 1.265 ± 0.463
2.531ThrGly: 2.531 ± 0.629
2.109ThrHis: 2.109 ± 0.55
6.116ThrIle: 6.116 ± 1.098
3.796ThrLys: 3.796 ± 0.835
4.639ThrLeu: 4.639 ± 1.385
1.687ThrMet: 1.687 ± 0.655
3.163ThrAsn: 3.163 ± 0.706
2.952ThrPro: 2.952 ± 1.795
2.109ThrGln: 2.109 ± 0.361
3.374ThrArg: 3.374 ± 0.791
4.218ThrSer: 4.218 ± 1.401
4.218ThrThr: 4.218 ± 1.359
3.163ThrVal: 3.163 ± 0.66
1.687ThrTrp: 1.687 ± 0.691
3.163ThrTyr: 3.163 ± 1.122
0.0ThrXaa: 0.0 ± 0.0
Val
2.531ValAla: 2.531 ± 0.955
1.054ValCys: 1.054 ± 0.421
4.429ValAsp: 4.429 ± 1.288
3.163ValGlu: 3.163 ± 1.228
1.054ValPhe: 1.054 ± 0.626
3.163ValGly: 3.163 ± 0.95
0.844ValHis: 0.844 ± 0.523
3.796ValIle: 3.796 ± 0.932
2.531ValLys: 2.531 ± 0.76
5.694ValLeu: 5.694 ± 0.78
2.109ValMet: 2.109 ± 0.555
3.374ValAsn: 3.374 ± 1.207
3.374ValPro: 3.374 ± 0.651
1.265ValGln: 1.265 ± 0.773
2.32ValArg: 2.32 ± 0.661
2.109ValSer: 2.109 ± 0.725
4.218ValThr: 4.218 ± 0.475
3.163ValVal: 3.163 ± 0.741
0.211ValTrp: 0.211 ± 0.136
3.163ValTyr: 3.163 ± 1.055
0.0ValXaa: 0.0 ± 0.0
Trp
0.422TrpAla: 0.422 ± 0.273
0.211TrpCys: 0.211 ± 0.235
0.844TrpAsp: 0.844 ± 0.357
1.476TrpGlu: 1.476 ± 0.605
1.476TrpPhe: 1.476 ± 0.435
1.898TrpGly: 1.898 ± 1.187
0.211TrpHis: 0.211 ± 0.136
2.32TrpIle: 2.32 ± 0.465
1.265TrpLys: 1.265 ± 0.288
0.633TrpLeu: 0.633 ± 0.348
0.633TrpMet: 0.633 ± 0.349
1.265TrpAsn: 1.265 ± 0.401
0.633TrpPro: 0.633 ± 0.276
0.211TrpGln: 0.211 ± 0.136
1.476TrpArg: 1.476 ± 0.604
1.476TrpSer: 1.476 ± 0.484
0.633TrpThr: 0.633 ± 0.391
0.633TrpVal: 0.633 ± 0.391
0.422TrpTrp: 0.422 ± 0.416
0.422TrpTyr: 0.422 ± 0.375
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.265TyrAla: 1.265 ± 0.47
1.265TyrCys: 1.265 ± 0.531
2.741TyrAsp: 2.741 ± 1.205
2.109TyrGlu: 2.109 ± 0.72
1.687TyrPhe: 1.687 ± 0.669
2.531TyrGly: 2.531 ± 0.809
0.844TyrHis: 0.844 ± 0.377
2.531TyrIle: 2.531 ± 0.994
4.007TyrLys: 4.007 ± 0.697
4.639TyrLeu: 4.639 ± 1.174
1.265TyrMet: 1.265 ± 0.743
3.163TyrAsn: 3.163 ± 0.588
1.476TyrPro: 1.476 ± 0.741
1.476TyrGln: 1.476 ± 0.626
1.265TyrArg: 1.265 ± 0.455
4.639TyrSer: 4.639 ± 2.017
2.741TyrThr: 2.741 ± 1.417
2.32TyrVal: 2.32 ± 0.588
1.054TyrTrp: 1.054 ± 0.828
1.687TyrTyr: 1.687 ± 0.488
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 11 proteins (4743 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski