Amino acid dipepetide frequency for Lye Green virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.883AlaAla: 2.883 ± 1.958
1.442AlaCys: 1.442 ± 0.654
2.643AlaAsp: 2.643 ± 0.662
3.123AlaGlu: 3.123 ± 1.33
2.162AlaPhe: 2.162 ± 0.738
3.123AlaGly: 3.123 ± 0.652
0.721AlaHis: 0.721 ± 0.274
2.643AlaIle: 2.643 ± 1.297
1.922AlaLys: 1.922 ± 1.099
7.448AlaLeu: 7.448 ± 2.176
1.682AlaMet: 1.682 ± 0.798
1.201AlaAsn: 1.201 ± 1.031
1.922AlaPro: 1.922 ± 0.538
3.364AlaGln: 3.364 ± 0.981
2.883AlaArg: 2.883 ± 0.587
4.805AlaSer: 4.805 ± 1.332
2.883AlaThr: 2.883 ± 0.852
1.682AlaVal: 1.682 ± 0.602
0.481AlaTrp: 0.481 ± 0.261
1.201AlaTyr: 1.201 ± 0.594
0.0AlaXaa: 0.0 ± 0.0
Cys
0.481CysAla: 0.481 ± 0.299
0.0CysCys: 0.0 ± 0.0
1.201CysAsp: 1.201 ± 0.512
1.442CysGlu: 1.442 ± 0.673
0.24CysPhe: 0.24 ± 0.15
0.721CysGly: 0.721 ± 0.327
0.24CysHis: 0.24 ± 0.15
1.201CysIle: 1.201 ± 0.512
0.481CysLys: 0.481 ± 0.284
1.922CysLeu: 1.922 ± 0.242
0.481CysMet: 0.481 ± 0.278
0.961CysAsn: 0.961 ± 0.523
0.961CysPro: 0.961 ± 0.422
0.481CysGln: 0.481 ± 0.454
0.481CysArg: 0.481 ± 0.299
1.201CysSer: 1.201 ± 0.538
0.24CysThr: 0.24 ± 0.315
0.961CysVal: 0.961 ± 0.705
0.0CysTrp: 0.0 ± 0.0
0.481CysTyr: 0.481 ± 0.454
0.0CysXaa: 0.0 ± 0.0
Asp
1.442AspAla: 1.442 ± 1.236
0.961AspCys: 0.961 ± 0.897
2.162AspAsp: 2.162 ± 0.977
1.201AspGlu: 1.201 ± 0.595
1.201AspPhe: 1.201 ± 0.317
2.162AspGly: 2.162 ± 1.109
1.682AspHis: 1.682 ± 0.726
5.286AspIle: 5.286 ± 1.112
1.442AspLys: 1.442 ± 0.464
4.565AspLeu: 4.565 ± 0.991
1.201AspMet: 1.201 ± 0.452
1.922AspAsn: 1.922 ± 0.526
5.286AspPro: 5.286 ± 0.96
1.922AspGln: 1.922 ± 0.923
1.922AspArg: 1.922 ± 0.251
2.883AspSer: 2.883 ± 0.886
1.682AspThr: 1.682 ± 0.602
0.961AspVal: 0.961 ± 0.353
0.721AspTrp: 0.721 ± 0.274
1.682AspTyr: 1.682 ± 0.863
0.0AspXaa: 0.0 ± 0.0
Glu
2.403GluAla: 2.403 ± 1.064
1.201GluCys: 1.201 ± 0.317
2.162GluAsp: 2.162 ± 0.763
2.883GluGlu: 2.883 ± 0.793
2.643GluPhe: 2.643 ± 1.046
4.085GluGly: 4.085 ± 0.868
1.682GluHis: 1.682 ± 1.156
1.682GluIle: 1.682 ± 0.716
1.682GluLys: 1.682 ± 0.351
6.247GluLeu: 6.247 ± 2.531
1.682GluMet: 1.682 ± 0.183
2.643GluAsn: 2.643 ± 0.696
1.201GluPro: 1.201 ± 0.891
1.922GluGln: 1.922 ± 0.242
1.682GluArg: 1.682 ± 0.496
2.162GluSer: 2.162 ± 0.454
3.123GluThr: 3.123 ± 0.513
1.682GluVal: 1.682 ± 0.805
0.961GluTrp: 0.961 ± 0.357
2.883GluTyr: 2.883 ± 0.856
0.0GluXaa: 0.0 ± 0.0
Phe
1.201PheAla: 1.201 ± 0.594
1.201PheCys: 1.201 ± 0.342
2.643PheAsp: 2.643 ± 0.687
1.682PheGlu: 1.682 ± 0.381
1.442PhePhe: 1.442 ± 0.516
1.922PheGly: 1.922 ± 0.609
1.922PheHis: 1.922 ± 0.538
1.682PheIle: 1.682 ± 0.401
2.162PheLys: 2.162 ± 0.826
3.844PheLeu: 3.844 ± 1.074
0.24PheMet: 0.24 ± 0.15
0.721PheAsn: 0.721 ± 0.449
1.682PhePro: 1.682 ± 0.401
1.201PheGln: 1.201 ± 0.342
1.922PheArg: 1.922 ± 0.408
4.085PheSer: 4.085 ± 1.151
2.883PheThr: 2.883 ± 0.957
1.922PheVal: 1.922 ± 0.408
0.481PheTrp: 0.481 ± 0.278
0.961PheTyr: 0.961 ± 0.343
0.0PheXaa: 0.0 ± 0.0
Gly
3.604GlyAla: 3.604 ± 1.16
1.442GlyCys: 1.442 ± 0.589
2.643GlyAsp: 2.643 ± 0.604
1.442GlyGlu: 1.442 ± 0.589
1.922GlyPhe: 1.922 ± 0.627
2.883GlyGly: 2.883 ± 0.818
1.442GlyHis: 1.442 ± 0.22
2.883GlyIle: 2.883 ± 1.135
1.201GlyLys: 1.201 ± 0.342
5.286GlyLeu: 5.286 ± 1.066
0.24GlyMet: 0.24 ± 0.353
1.201GlyAsn: 1.201 ± 0.482
0.961GlyPro: 0.961 ± 0.698
0.961GlyGln: 0.961 ± 0.34
3.123GlyArg: 3.123 ± 0.64
3.604GlySer: 3.604 ± 1.815
3.364GlyThr: 3.364 ± 0.959
2.883GlyVal: 2.883 ± 0.637
1.442GlyTrp: 1.442 ± 0.599
2.883GlyTyr: 2.883 ± 0.698
0.481GlyXaa: 0.481 ± 0.299
His
1.922HisAla: 1.922 ± 1.099
0.481HisCys: 0.481 ± 0.299
1.201HisAsp: 1.201 ± 0.888
0.481HisGlu: 0.481 ± 0.261
0.721HisPhe: 0.721 ± 0.695
1.201HisGly: 1.201 ± 0.829
0.24HisHis: 0.24 ± 0.353
0.961HisIle: 0.961 ± 0.618
0.961HisLys: 0.961 ± 0.301
5.046HisLeu: 5.046 ± 1.613
0.481HisMet: 0.481 ± 0.299
2.162HisAsn: 2.162 ± 0.414
1.442HisPro: 1.442 ± 0.464
1.201HisGln: 1.201 ± 0.386
1.922HisArg: 1.922 ± 0.625
1.201HisSer: 1.201 ± 0.572
2.162HisThr: 2.162 ± 0.941
1.442HisVal: 1.442 ± 0.443
0.481HisTrp: 0.481 ± 0.278
0.481HisTyr: 0.481 ± 0.284
0.0HisXaa: 0.0 ± 0.0
Ile
2.643IleAla: 2.643 ± 0.686
1.682IleCys: 1.682 ± 0.426
2.162IleAsp: 2.162 ± 1.152
4.325IleGlu: 4.325 ± 1.168
2.403IlePhe: 2.403 ± 0.892
2.162IleGly: 2.162 ± 0.682
1.442IleHis: 1.442 ± 0.569
2.883IleIle: 2.883 ± 0.974
2.162IleLys: 2.162 ± 1.063
7.929IleLeu: 7.929 ± 1.141
2.162IleMet: 2.162 ± 0.798
3.123IleAsn: 3.123 ± 1.319
4.325IlePro: 4.325 ± 1.359
3.123IleGln: 3.123 ± 0.644
5.286IleArg: 5.286 ± 1.112
4.565IleSer: 4.565 ± 0.82
2.403IleThr: 2.403 ± 0.856
2.403IleVal: 2.403 ± 0.355
1.682IleTrp: 1.682 ± 0.346
2.162IleTyr: 2.162 ± 0.472
0.0IleXaa: 0.0 ± 0.0
Lys
1.922LysAla: 1.922 ± 0.625
0.721LysCys: 0.721 ± 0.359
1.682LysAsp: 1.682 ± 0.768
1.442LysGlu: 1.442 ± 0.409
2.643LysPhe: 2.643 ± 0.602
2.162LysGly: 2.162 ± 0.62
0.961LysHis: 0.961 ± 0.343
4.565LysIle: 4.565 ± 0.962
2.403LysLys: 2.403 ± 1.515
3.604LysLeu: 3.604 ± 0.703
0.961LysMet: 0.961 ± 0.387
3.604LysAsn: 3.604 ± 2.016
1.442LysPro: 1.442 ± 1.135
2.162LysGln: 2.162 ± 0.67
2.883LysArg: 2.883 ± 1.575
1.922LysSer: 1.922 ± 0.549
3.604LysThr: 3.604 ± 1.351
1.442LysVal: 1.442 ± 0.719
0.721LysTrp: 0.721 ± 0.274
2.162LysTyr: 2.162 ± 0.592
0.0LysXaa: 0.0 ± 0.0
Leu
5.046LeuAla: 5.046 ± 0.951
0.721LeuCys: 0.721 ± 0.359
4.085LeuAsp: 4.085 ± 0.311
5.286LeuGlu: 5.286 ± 1.274
5.046LeuPhe: 5.046 ± 1.335
4.325LeuGly: 4.325 ± 0.656
4.805LeuHis: 4.805 ± 0.521
5.286LeuIle: 5.286 ± 0.906
5.526LeuLys: 5.526 ± 1.007
11.293LeuLeu: 11.293 ± 1.212
3.123LeuMet: 3.123 ± 0.256
5.046LeuAsn: 5.046 ± 1.131
6.487LeuPro: 6.487 ± 1.232
4.805LeuGln: 4.805 ± 1.01
7.448LeuArg: 7.448 ± 1.639
8.89LeuSer: 8.89 ± 1.361
6.968LeuThr: 6.968 ± 2.298
5.766LeuVal: 5.766 ± 1.946
1.682LeuTrp: 1.682 ± 0.717
3.364LeuTyr: 3.364 ± 0.916
0.481LeuXaa: 0.481 ± 0.299
Met
1.922MetAla: 1.922 ± 0.436
0.24MetCys: 0.24 ± 0.15
1.682MetAsp: 1.682 ± 0.424
0.961MetGlu: 0.961 ± 0.262
1.201MetPhe: 1.201 ± 0.452
2.403MetGly: 2.403 ± 1.089
0.0MetHis: 0.0 ± 0.0
1.442MetIle: 1.442 ± 0.28
1.442MetLys: 1.442 ± 0.743
2.162MetLeu: 2.162 ± 0.472
0.721MetMet: 0.721 ± 0.399
0.961MetAsn: 0.961 ± 0.765
0.721MetPro: 0.721 ± 0.399
0.721MetGln: 0.721 ± 0.327
1.201MetArg: 1.201 ± 0.469
2.162MetSer: 2.162 ± 0.797
1.442MetThr: 1.442 ± 0.22
1.442MetVal: 1.442 ± 0.754
0.481MetTrp: 0.481 ± 0.491
0.721MetTyr: 0.721 ± 0.364
0.0MetXaa: 0.0 ± 0.0
Asn
1.442AsnAla: 1.442 ± 1.236
0.0AsnCys: 0.0 ± 0.0
1.442AsnAsp: 1.442 ± 0.923
1.682AsnGlu: 1.682 ± 0.641
3.123AsnPhe: 3.123 ± 1.39
0.961AsnGly: 0.961 ± 0.621
1.682AsnHis: 1.682 ± 0.726
3.364AsnIle: 3.364 ± 0.631
2.643AsnLys: 2.643 ± 0.18
4.325AsnLeu: 4.325 ± 0.654
1.442AsnMet: 1.442 ± 0.629
3.123AsnAsn: 3.123 ± 1.146
3.604AsnPro: 3.604 ± 1.198
1.682AsnGln: 1.682 ± 0.957
1.442AsnArg: 1.442 ± 0.674
3.123AsnSer: 3.123 ± 0.941
2.162AsnThr: 2.162 ± 0.538
2.403AsnVal: 2.403 ± 0.808
0.721AsnTrp: 0.721 ± 0.79
3.604AsnTyr: 3.604 ± 0.693
0.0AsnXaa: 0.0 ± 0.0
Pro
3.604ProAla: 3.604 ± 0.613
0.481ProCys: 0.481 ± 0.284
2.643ProAsp: 2.643 ± 1.039
3.364ProGlu: 3.364 ± 0.63
0.481ProPhe: 0.481 ± 0.278
2.883ProGly: 2.883 ± 0.829
0.721ProHis: 0.721 ± 0.272
2.883ProIle: 2.883 ± 0.67
1.682ProLys: 1.682 ± 0.426
6.247ProLeu: 6.247 ± 0.477
1.682ProMet: 1.682 ± 1.479
2.403ProAsn: 2.403 ± 0.619
3.123ProPro: 3.123 ± 0.734
2.162ProGln: 2.162 ± 0.646
1.442ProArg: 1.442 ± 0.792
7.208ProSer: 7.208 ± 1.077
4.085ProThr: 4.085 ± 0.814
3.364ProVal: 3.364 ± 0.84
0.24ProTrp: 0.24 ± 0.328
1.442ProTyr: 1.442 ± 0.654
0.0ProXaa: 0.0 ± 0.0
Gln
2.883GlnAla: 2.883 ± 0.794
0.481GlnCys: 0.481 ± 0.454
2.403GlnAsp: 2.403 ± 0.506
2.643GlnGlu: 2.643 ± 0.416
2.162GlnPhe: 2.162 ± 0.556
1.922GlnGly: 1.922 ± 0.242
0.961GlnHis: 0.961 ± 0.462
1.922GlnIle: 1.922 ± 0.517
2.403GlnLys: 2.403 ± 1.01
3.364GlnLeu: 3.364 ± 1.023
1.922GlnMet: 1.922 ± 0.28
1.201GlnAsn: 1.201 ± 0.317
0.721GlnPro: 0.721 ± 0.364
0.481GlnGln: 0.481 ± 0.284
3.123GlnArg: 3.123 ± 0.67
3.844GlnSer: 3.844 ± 0.879
2.643GlnThr: 2.643 ± 0.788
1.442GlnVal: 1.442 ± 0.379
0.481GlnTrp: 0.481 ± 0.299
1.201GlnTyr: 1.201 ± 0.538
0.0GlnXaa: 0.0 ± 0.0
Arg
3.364ArgAla: 3.364 ± 1.022
0.24ArgCys: 0.24 ± 0.15
1.922ArgAsp: 1.922 ± 0.375
3.604ArgGlu: 3.604 ± 1.616
1.442ArgPhe: 1.442 ± 0.492
2.403ArgGly: 2.403 ± 1.471
1.442ArgHis: 1.442 ± 0.56
4.805ArgIle: 4.805 ± 1.162
2.162ArgLys: 2.162 ± 0.787
5.526ArgLeu: 5.526 ± 1.269
1.682ArgMet: 1.682 ± 0.72
2.883ArgAsn: 2.883 ± 0.883
4.085ArgPro: 4.085 ± 0.972
2.403ArgGln: 2.403 ± 0.905
2.643ArgArg: 2.643 ± 0.686
3.844ArgSer: 3.844 ± 0.545
2.403ArgThr: 2.403 ± 0.433
2.403ArgVal: 2.403 ± 0.563
0.481ArgTrp: 0.481 ± 0.491
2.643ArgTyr: 2.643 ± 0.648
0.0ArgXaa: 0.0 ± 0.0
Ser
3.123SerAla: 3.123 ± 0.953
0.961SerCys: 0.961 ± 1.031
3.844SerAsp: 3.844 ± 1.574
2.883SerGlu: 2.883 ± 1.265
1.682SerPhe: 1.682 ± 0.504
3.364SerGly: 3.364 ± 1.605
2.883SerHis: 2.883 ± 0.385
5.286SerIle: 5.286 ± 0.893
3.364SerLys: 3.364 ± 0.221
11.293SerLeu: 11.293 ± 1.527
0.961SerMet: 0.961 ± 0.634
2.162SerAsn: 2.162 ± 0.414
3.604SerPro: 3.604 ± 0.931
2.883SerGln: 2.883 ± 1.043
5.766SerArg: 5.766 ± 1.313
5.526SerSer: 5.526 ± 1.641
4.325SerThr: 4.325 ± 1.82
4.805SerVal: 4.805 ± 0.876
1.442SerTrp: 1.442 ± 0.443
2.643SerTyr: 2.643 ± 0.549
0.0SerXaa: 0.0 ± 0.0
Thr
4.325ThrAla: 4.325 ± 1.048
0.24ThrCys: 0.24 ± 0.15
1.201ThrAsp: 1.201 ± 0.469
3.123ThrGlu: 3.123 ± 1.078
2.162ThrPhe: 2.162 ± 0.735
1.922ThrGly: 1.922 ± 0.932
1.682ThrHis: 1.682 ± 0.812
4.325ThrIle: 4.325 ± 1.101
2.643ThrLys: 2.643 ± 1.032
5.046ThrLeu: 5.046 ± 1.311
1.682ThrMet: 1.682 ± 0.424
3.844ThrAsn: 3.844 ± 1.412
4.325ThrPro: 4.325 ± 1.342
2.162ThrGln: 2.162 ± 1.064
3.364ThrArg: 3.364 ± 0.221
5.526ThrSer: 5.526 ± 0.694
3.604ThrThr: 3.604 ± 1.862
1.922ThrVal: 1.922 ± 0.402
0.961ThrTrp: 0.961 ± 0.705
0.961ThrTyr: 0.961 ± 0.353
0.24ThrXaa: 0.24 ± 0.15
Val
3.123ValAla: 3.123 ± 1.866
0.721ValCys: 0.721 ± 0.406
2.403ValAsp: 2.403 ± 1.144
2.403ValGlu: 2.403 ± 0.619
1.201ValPhe: 1.201 ± 0.737
2.883ValGly: 2.883 ± 0.915
0.721ValHis: 0.721 ± 0.272
4.565ValIle: 4.565 ± 1.685
2.643ValLys: 2.643 ± 0.989
4.325ValLeu: 4.325 ± 0.802
0.961ValMet: 0.961 ± 0.598
2.883ValAsn: 2.883 ± 0.926
2.643ValPro: 2.643 ± 0.386
0.24ValGln: 0.24 ± 0.353
2.162ValArg: 2.162 ± 0.595
3.604ValSer: 3.604 ± 1.025
2.162ValThr: 2.162 ± 0.65
3.844ValVal: 3.844 ± 1.335
0.961ValTrp: 0.961 ± 0.523
2.162ValTyr: 2.162 ± 1.023
0.0ValXaa: 0.0 ± 0.0
Trp
0.481TrpAla: 0.481 ± 0.6
0.24TrpCys: 0.24 ± 0.328
0.721TrpAsp: 0.721 ± 0.763
1.201TrpGlu: 1.201 ± 0.46
0.721TrpPhe: 0.721 ± 0.406
0.961TrpGly: 0.961 ± 0.348
0.24TrpHis: 0.24 ± 0.315
0.961TrpIle: 0.961 ± 0.353
1.682TrpLys: 1.682 ± 0.401
1.201TrpLeu: 1.201 ± 0.512
0.24TrpMet: 0.24 ± 0.353
1.682TrpAsn: 1.682 ± 0.505
0.721TrpPro: 0.721 ± 0.364
0.24TrpGln: 0.24 ± 0.15
0.961TrpArg: 0.961 ± 0.343
0.721TrpSer: 0.721 ± 0.406
0.961TrpThr: 0.961 ± 0.343
0.481TrpVal: 0.481 ± 0.278
0.24TrpTrp: 0.24 ± 0.15
0.481TrpTyr: 0.481 ± 0.706
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.403TyrAla: 2.403 ± 1.151
0.721TyrCys: 0.721 ± 0.327
1.442TyrAsp: 1.442 ± 0.797
1.442TyrGlu: 1.442 ± 0.516
0.961TyrPhe: 0.961 ± 0.353
1.442TyrGly: 1.442 ± 0.379
0.721TyrHis: 0.721 ± 0.272
2.403TyrIle: 2.403 ± 0.563
2.403TyrLys: 2.403 ± 0.263
4.805TyrLeu: 4.805 ± 0.294
0.24TyrMet: 0.24 ± 0.15
0.24TyrAsn: 0.24 ± 0.15
2.643TyrPro: 2.643 ± 0.525
4.085TyrGln: 4.085 ± 1.023
0.721TyrArg: 0.721 ± 0.327
1.682TyrSer: 1.682 ± 0.768
2.162TyrThr: 2.162 ± 0.477
3.123TyrVal: 3.123 ± 1.638
0.481TyrTrp: 0.481 ± 0.261
1.922TyrTyr: 1.922 ± 0.873
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.24XaaPhe: 0.24 ± 0.15
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.24XaaPro: 0.24 ± 0.15
0.24XaaGln: 0.24 ± 0.15
0.0XaaArg: 0.0 ± 0.0
0.24XaaSer: 0.24 ± 0.15
0.0XaaThr: 0.0 ± 0.0
0.24XaaVal: 0.24 ± 0.15
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
91.543XaaXaa: 91.543 ± 56.982
Statistics based on 5 proteins (4163 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski