Amino acid dipepetide frequency for Hibiscus golden mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.989AlaAla: 1.989 ± 1.016
1.326AlaCys: 1.326 ± 0.846
0.663AlaAsp: 0.663 ± 0.546
1.989AlaGlu: 1.989 ± 0.748
0.663AlaPhe: 0.663 ± 0.567
1.326AlaGly: 1.326 ± 1.136
1.326AlaHis: 1.326 ± 0.771
1.989AlaIle: 1.989 ± 1.097
4.642AlaLys: 4.642 ± 2.526
3.979AlaLeu: 3.979 ± 1.413
0.0AlaMet: 0.0 ± 0.0
2.653AlaAsn: 2.653 ± 1.634
3.979AlaPro: 3.979 ± 0.907
3.979AlaGln: 3.979 ± 1.824
3.979AlaArg: 3.979 ± 1.498
7.294AlaSer: 7.294 ± 0.893
3.979AlaThr: 3.979 ± 2.049
3.316AlaVal: 3.316 ± 0.817
0.0AlaTrp: 0.0 ± 0.0
1.326AlaTyr: 1.326 ± 0.807
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.663CysCys: 0.663 ± 0.662
0.663CysAsp: 0.663 ± 0.567
0.663CysGlu: 0.663 ± 0.568
0.663CysPhe: 0.663 ± 0.662
0.663CysGly: 0.663 ± 0.728
0.0CysHis: 0.0 ± 0.0
1.989CysIle: 1.989 ± 1.301
1.989CysLys: 1.989 ± 0.628
0.663CysLeu: 0.663 ± 0.667
0.663CysMet: 0.663 ± 0.567
1.989CysAsn: 1.989 ± 0.666
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.326CysArg: 1.326 ± 0.707
1.989CysSer: 1.989 ± 1.441
0.663CysThr: 0.663 ± 0.568
1.989CysVal: 1.989 ± 0.897
1.326CysTrp: 1.326 ± 1.091
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.326AspAla: 1.326 ± 0.829
0.663AspCys: 0.663 ± 0.546
2.653AspAsp: 2.653 ± 1.47
1.989AspGlu: 1.989 ± 0.843
2.653AspPhe: 2.653 ± 0.923
1.989AspGly: 1.989 ± 1.135
1.989AspHis: 1.989 ± 0.838
3.316AspIle: 3.316 ± 1.562
1.326AspLys: 1.326 ± 0.818
6.631AspLeu: 6.631 ± 1.431
0.0AspMet: 0.0 ± 0.0
2.653AspAsn: 2.653 ± 0.617
1.989AspPro: 1.989 ± 1.161
0.663AspGln: 0.663 ± 0.662
3.979AspArg: 3.979 ± 1.479
7.294AspSer: 7.294 ± 1.354
0.663AspThr: 0.663 ± 0.567
5.968AspVal: 5.968 ± 1.79
0.663AspTrp: 0.663 ± 0.554
1.326AspTyr: 1.326 ± 0.707
0.0AspXaa: 0.0 ± 0.0
Glu
1.989GluAla: 1.989 ± 0.68
0.663GluCys: 0.663 ± 0.567
0.663GluAsp: 0.663 ± 0.667
3.316GluGlu: 3.316 ± 1.082
1.326GluPhe: 1.326 ± 0.707
4.642GluGly: 4.642 ± 2.046
0.0GluHis: 0.0 ± 0.0
2.653GluIle: 2.653 ± 1.612
1.326GluLys: 1.326 ± 0.707
4.642GluLeu: 4.642 ± 0.922
0.663GluMet: 0.663 ± 0.554
3.316GluAsn: 3.316 ± 1.455
3.316GluPro: 3.316 ± 0.816
3.316GluGln: 3.316 ± 1.245
2.653GluArg: 2.653 ± 1.001
5.968GluSer: 5.968 ± 2.052
0.663GluThr: 0.663 ± 0.554
2.653GluVal: 2.653 ± 1.551
2.653GluTrp: 2.653 ± 1.025
2.653GluTyr: 2.653 ± 1.007
0.0GluXaa: 0.0 ± 0.0
Phe
1.989PheAla: 1.989 ± 0.748
0.663PheCys: 0.663 ± 0.568
2.653PheAsp: 2.653 ± 1.013
1.326PheGlu: 1.326 ± 0.79
1.989PhePhe: 1.989 ± 1.016
2.653PheGly: 2.653 ± 1.013
1.989PheHis: 1.989 ± 1.663
1.326PheIle: 1.326 ± 1.109
3.979PheLys: 3.979 ± 2.052
1.989PheLeu: 1.989 ± 1.663
0.0PheMet: 0.0 ± 0.0
3.979PheAsn: 3.979 ± 0.69
1.326PhePro: 1.326 ± 1.134
2.653PheGln: 2.653 ± 1.004
1.326PheArg: 1.326 ± 0.708
5.305PheSer: 5.305 ± 1.501
2.653PheThr: 2.653 ± 0.686
1.326PheVal: 1.326 ± 1.091
1.989PheTrp: 1.989 ± 1.243
3.316PheTyr: 3.316 ± 1.495
0.0PheXaa: 0.0 ± 0.0
Gly
5.305GlyAla: 5.305 ± 2.002
1.326GlyCys: 1.326 ± 0.846
0.663GlyAsp: 0.663 ± 0.554
5.305GlyGlu: 5.305 ± 1.424
0.663GlyPhe: 0.663 ± 0.728
3.316GlyGly: 3.316 ± 1.467
1.326GlyHis: 1.326 ± 0.814
1.326GlyIle: 1.326 ± 0.597
5.968GlyLys: 5.968 ± 2.548
2.653GlyLeu: 2.653 ± 0.686
0.663GlyMet: 0.663 ± 0.474
2.653GlyAsn: 2.653 ± 0.763
3.316GlyPro: 3.316 ± 1.222
1.989GlyGln: 1.989 ± 1.025
1.989GlyArg: 1.989 ± 0.666
5.305GlySer: 5.305 ± 1.733
5.968GlyThr: 5.968 ± 1.685
3.979GlyVal: 3.979 ± 1.674
0.0GlyTrp: 0.0 ± 0.0
0.663GlyTyr: 0.663 ± 0.567
0.0GlyXaa: 0.0 ± 0.0
His
1.326HisAla: 1.326 ± 0.682
1.326HisCys: 1.326 ± 0.759
3.316HisAsp: 3.316 ± 1.45
1.989HisGlu: 1.989 ± 0.666
1.326HisPhe: 1.326 ± 0.756
1.989HisGly: 1.989 ± 0.895
1.326HisHis: 1.326 ± 1.078
1.989HisIle: 1.989 ± 1.101
1.989HisLys: 1.989 ± 1.11
2.653HisLeu: 2.653 ± 1.19
0.0HisMet: 0.0 ± 0.0
2.653HisAsn: 2.653 ± 1.721
1.326HisPro: 1.326 ± 0.759
2.653HisGln: 2.653 ± 1.082
2.653HisArg: 2.653 ± 1.295
2.653HisSer: 2.653 ± 1.275
2.653HisThr: 2.653 ± 1.654
3.316HisVal: 3.316 ± 0.525
0.663HisTrp: 0.663 ± 0.554
1.326HisTyr: 1.326 ± 0.608
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
1.326IleCys: 1.326 ± 0.818
4.642IleAsp: 4.642 ± 1.839
3.979IleGlu: 3.979 ± 1.642
1.989IlePhe: 1.989 ± 1.188
0.663IleGly: 0.663 ± 0.554
2.653IleHis: 2.653 ± 0.686
2.653IleIle: 2.653 ± 1.001
4.642IleLys: 4.642 ± 0.616
2.653IleLeu: 2.653 ± 1.829
0.663IleMet: 0.663 ± 0.728
3.316IleAsn: 3.316 ± 1.37
1.989IlePro: 1.989 ± 1.037
0.663IleGln: 0.663 ± 0.667
5.968IleArg: 5.968 ± 1.804
3.979IleSer: 3.979 ± 1.343
2.653IleThr: 2.653 ± 1.145
4.642IleVal: 4.642 ± 1.601
1.989IleTrp: 1.989 ± 1.08
1.989IleTyr: 1.989 ± 1.243
0.0IleXaa: 0.0 ± 0.0
Lys
3.316LysAla: 3.316 ± 1.19
0.663LysCys: 0.663 ± 0.554
2.653LysAsp: 2.653 ± 1.001
1.989LysGlu: 1.989 ± 1.663
3.979LysPhe: 3.979 ± 1.312
3.316LysGly: 3.316 ± 0.957
1.989LysHis: 1.989 ± 0.705
4.642LysIle: 4.642 ± 1.37
1.326LysLys: 1.326 ± 0.814
5.968LysLeu: 5.968 ± 1.733
1.989LysMet: 1.989 ± 1.082
3.316LysAsn: 3.316 ± 0.98
3.316LysPro: 3.316 ± 1.211
0.663LysGln: 0.663 ± 0.567
6.631LysArg: 6.631 ± 2.321
3.979LysSer: 3.979 ± 1.205
3.316LysThr: 3.316 ± 0.777
4.642LysVal: 4.642 ± 3.206
0.0LysTrp: 0.0 ± 0.0
1.989LysTyr: 1.989 ± 0.776
0.0LysXaa: 0.0 ± 0.0
Leu
3.316LeuAla: 3.316 ± 1.375
0.663LeuCys: 0.663 ± 0.554
4.642LeuAsp: 4.642 ± 1.404
3.316LeuGlu: 3.316 ± 1.652
0.663LeuPhe: 0.663 ± 0.546
4.642LeuGly: 4.642 ± 0.822
3.979LeuHis: 3.979 ± 1.32
3.979LeuIle: 3.979 ± 0.911
6.631LeuLys: 6.631 ± 1.632
4.642LeuLeu: 4.642 ± 1.33
1.326LeuMet: 1.326 ± 0.682
3.979LeuAsn: 3.979 ± 1.948
2.653LeuPro: 2.653 ± 2.065
4.642LeuGln: 4.642 ± 1.334
4.642LeuArg: 4.642 ± 1.102
7.958LeuSer: 7.958 ± 2.752
3.316LeuThr: 3.316 ± 1.139
2.653LeuVal: 2.653 ± 0.678
0.0LeuTrp: 0.0 ± 0.0
3.979LeuTyr: 3.979 ± 0.894
0.0LeuXaa: 0.0 ± 0.0
Met
2.653MetAla: 2.653 ± 1.364
1.326MetCys: 1.326 ± 0.851
2.653MetAsp: 2.653 ± 1.145
0.663MetGlu: 0.663 ± 0.662
1.989MetPhe: 1.989 ± 1.287
1.989MetGly: 1.989 ± 1.025
0.663MetHis: 0.663 ± 0.568
0.0MetIle: 0.0 ± 0.0
0.663MetLys: 0.663 ± 0.567
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.326MetPro: 1.326 ± 0.597
1.989MetGln: 1.989 ± 1.097
0.663MetArg: 0.663 ± 0.728
1.326MetSer: 1.326 ± 1.091
0.663MetThr: 0.663 ± 0.567
0.663MetVal: 0.663 ± 0.546
0.663MetTrp: 0.663 ± 0.554
1.989MetTyr: 1.989 ± 0.784
0.0MetXaa: 0.0 ± 0.0
Asn
3.979AsnAla: 3.979 ± 1.251
1.989AsnCys: 1.989 ± 0.895
1.989AsnAsp: 1.989 ± 0.628
2.653AsnGlu: 2.653 ± 1.095
2.653AsnPhe: 2.653 ± 1.19
3.316AsnGly: 3.316 ± 0.617
3.316AsnHis: 3.316 ± 2.268
1.989AsnIle: 1.989 ± 0.843
3.316AsnLys: 3.316 ± 1.211
3.979AsnLeu: 3.979 ± 0.984
1.989AsnMet: 1.989 ± 1.095
2.653AsnAsn: 2.653 ± 0.83
4.642AsnPro: 4.642 ± 0.732
0.663AsnGln: 0.663 ± 0.546
2.653AsnArg: 2.653 ± 1.09
5.968AsnSer: 5.968 ± 1.245
0.663AsnThr: 0.663 ± 0.728
1.989AsnVal: 1.989 ± 0.748
0.0AsnTrp: 0.0 ± 0.0
2.653AsnTyr: 2.653 ± 1.019
0.0AsnXaa: 0.0 ± 0.0
Pro
0.663ProAla: 0.663 ± 0.567
0.663ProCys: 0.663 ± 0.568
2.653ProAsp: 2.653 ± 0.617
3.979ProGlu: 3.979 ± 2.208
0.663ProPhe: 0.663 ± 0.567
2.653ProGly: 2.653 ± 0.939
3.979ProHis: 3.979 ± 1.519
2.653ProIle: 2.653 ± 2.268
3.979ProLys: 3.979 ± 1.15
1.989ProLeu: 1.989 ± 1.228
1.326ProMet: 1.326 ± 1.136
3.316ProAsn: 3.316 ± 1.559
0.663ProPro: 0.663 ± 0.567
1.326ProGln: 1.326 ± 1.455
3.979ProArg: 3.979 ± 1.792
7.294ProSer: 7.294 ± 1.376
1.989ProThr: 1.989 ± 1.302
2.653ProVal: 2.653 ± 1.025
1.989ProTrp: 1.989 ± 0.68
1.326ProTyr: 1.326 ± 0.829
0.0ProXaa: 0.0 ± 0.0
Gln
1.989GlnAla: 1.989 ± 0.919
0.0GlnCys: 0.0 ± 0.0
1.326GlnAsp: 1.326 ± 1.092
1.989GlnGlu: 1.989 ± 1.003
3.979GlnPhe: 3.979 ± 1.041
1.326GlnGly: 1.326 ± 0.814
0.0GlnHis: 0.0 ± 0.0
3.979GlnIle: 3.979 ± 1.539
0.663GlnLys: 0.663 ± 0.662
3.316GlnLeu: 3.316 ± 1.475
0.663GlnMet: 0.663 ± 0.522
0.663GlnAsn: 0.663 ± 0.728
2.653GlnPro: 2.653 ± 1.686
1.326GlnGln: 1.326 ± 0.608
3.979GlnArg: 3.979 ± 0.907
2.653GlnSer: 2.653 ± 0.744
1.989GlnThr: 1.989 ± 1.663
3.316GlnVal: 3.316 ± 1.12
0.0GlnTrp: 0.0 ± 0.0
0.663GlnTyr: 0.663 ± 0.568
0.0GlnXaa: 0.0 ± 0.0
Arg
4.642ArgAla: 4.642 ± 1.383
1.326ArgCys: 1.326 ± 0.708
6.631ArgAsp: 6.631 ± 1.375
2.653ArgGlu: 2.653 ± 1.007
7.294ArgPhe: 7.294 ± 1.997
5.305ArgGly: 5.305 ± 1.985
2.653ArgHis: 2.653 ± 1.157
3.979ArgIle: 3.979 ± 1.177
3.979ArgLys: 3.979 ± 0.594
5.305ArgLeu: 5.305 ± 1.848
1.326ArgMet: 1.326 ± 0.756
0.663ArgAsn: 0.663 ± 0.546
3.316ArgPro: 3.316 ± 1.098
1.326ArgGln: 1.326 ± 0.756
9.284ArgArg: 9.284 ± 4.158
5.968ArgSer: 5.968 ± 1.256
3.979ArgThr: 3.979 ± 0.781
5.305ArgVal: 5.305 ± 1.12
0.663ArgTrp: 0.663 ± 0.567
1.326ArgTyr: 1.326 ± 0.75
0.0ArgXaa: 0.0 ± 0.0
Ser
4.642SerAla: 4.642 ± 1.979
1.989SerCys: 1.989 ± 1.161
3.316SerAsp: 3.316 ± 0.52
0.663SerGlu: 0.663 ± 0.567
3.316SerPhe: 3.316 ± 0.924
3.979SerGly: 3.979 ± 1.704
3.979SerHis: 3.979 ± 1.663
6.631SerIle: 6.631 ± 2.007
3.316SerLys: 3.316 ± 1.059
5.968SerLeu: 5.968 ± 1.548
1.989SerMet: 1.989 ± 1.364
6.631SerAsn: 6.631 ± 1.367
5.305SerPro: 5.305 ± 1.778
1.989SerGln: 1.989 ± 1.037
7.958SerArg: 7.958 ± 2.322
13.926SerSer: 13.926 ± 3.859
6.631SerThr: 6.631 ± 2.33
6.631SerVal: 6.631 ± 1.813
1.989SerTrp: 1.989 ± 1.037
6.631SerTyr: 6.631 ± 2.426
0.0SerXaa: 0.0 ± 0.0
Thr
3.979ThrAla: 3.979 ± 1.312
0.0ThrCys: 0.0 ± 0.0
2.653ThrAsp: 2.653 ± 1.126
3.979ThrGlu: 3.979 ± 0.987
1.989ThrPhe: 1.989 ± 1.129
3.316ThrGly: 3.316 ± 1.076
3.979ThrHis: 3.979 ± 1.574
1.326ThrIle: 1.326 ± 0.756
1.989ThrLys: 1.989 ± 1.048
3.979ThrLeu: 3.979 ± 1.104
0.663ThrMet: 0.663 ± 0.554
3.979ThrAsn: 3.979 ± 1.279
1.326ThrPro: 1.326 ± 1.136
0.663ThrGln: 0.663 ± 0.662
1.326ThrArg: 1.326 ± 0.807
3.979ThrSer: 3.979 ± 2.124
2.653ThrThr: 2.653 ± 1.28
5.305ThrVal: 5.305 ± 1.551
0.663ThrTrp: 0.663 ± 0.667
3.316ThrTyr: 3.316 ± 1.579
0.0ThrXaa: 0.0 ± 0.0
Val
1.989ValAla: 1.989 ± 0.671
0.663ValCys: 0.663 ± 0.567
3.979ValAsp: 3.979 ± 1.168
3.979ValGlu: 3.979 ± 0.978
2.653ValPhe: 2.653 ± 0.747
3.979ValGly: 3.979 ± 2.018
3.316ValHis: 3.316 ± 1.054
3.316ValIle: 3.316 ± 1.414
3.979ValLys: 3.979 ± 1.48
5.968ValLeu: 5.968 ± 2.693
3.979ValMet: 3.979 ± 1.819
3.316ValAsn: 3.316 ± 1.098
5.968ValPro: 5.968 ± 1.345
3.316ValGln: 3.316 ± 0.873
3.316ValArg: 3.316 ± 1.308
3.979ValSer: 3.979 ± 0.896
2.653ValThr: 2.653 ± 1.087
1.326ValVal: 1.326 ± 0.682
0.0ValTrp: 0.0 ± 0.0
4.642ValTyr: 4.642 ± 1.878
0.0ValXaa: 0.0 ± 0.0
Trp
1.989TrpAla: 1.989 ± 0.666
0.0TrpCys: 0.0 ± 0.0
0.663TrpAsp: 0.663 ± 0.728
1.326TrpGlu: 1.326 ± 0.771
0.0TrpPhe: 0.0 ± 0.0
0.663TrpGly: 0.663 ± 0.554
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.989TrpLys: 1.989 ± 0.68
0.663TrpLeu: 0.663 ± 0.568
1.326TrpMet: 1.326 ± 0.682
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.663TrpGln: 0.663 ± 0.554
1.989TrpArg: 1.989 ± 0.896
0.663TrpSer: 0.663 ± 0.662
1.989TrpThr: 1.989 ± 0.748
1.989TrpVal: 1.989 ± 1.003
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.316TyrAla: 3.316 ± 1.098
0.663TyrCys: 0.663 ± 0.546
0.663TyrAsp: 0.663 ± 0.568
1.326TyrGlu: 1.326 ± 1.136
3.979TyrPhe: 3.979 ± 0.69
2.653TyrGly: 2.653 ± 1.095
0.663TyrHis: 0.663 ± 0.667
3.316TyrIle: 3.316 ± 1.595
1.989TyrLys: 1.989 ± 1.135
3.979TyrLeu: 3.979 ± 2.771
1.989TyrMet: 1.989 ± 1.288
1.326TyrAsn: 1.326 ± 0.597
1.326TyrPro: 1.326 ± 0.707
1.989TyrGln: 1.989 ± 1.016
6.631TyrArg: 6.631 ± 2.121
0.0TyrSer: 0.0 ± 0.0
1.326TyrThr: 1.326 ± 1.334
2.653TyrVal: 2.653 ± 1.135
0.663TyrTrp: 0.663 ± 0.546
1.326TyrTyr: 1.326 ± 0.708
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (1509 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski