Amino acid dipepetide frequency for Sida golden yellow spot virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.862AlaAla: 1.862 ± 0.994
0.931AlaCys: 0.931 ± 0.772
1.862AlaAsp: 1.862 ± 1.292
3.724AlaGlu: 3.724 ± 1.969
0.931AlaPhe: 0.931 ± 0.914
0.0AlaGly: 0.0 ± 0.0
0.931AlaHis: 0.931 ± 0.772
0.0AlaIle: 0.0 ± 0.0
5.587AlaLys: 5.587 ± 2.223
2.793AlaLeu: 2.793 ± 1.223
0.931AlaMet: 0.931 ± 0.772
3.724AlaAsn: 3.724 ± 1.077
1.862AlaPro: 1.862 ± 0.918
3.724AlaGln: 3.724 ± 1.597
7.449AlaArg: 7.449 ± 2.795
7.449AlaSer: 7.449 ± 0.885
0.931AlaThr: 0.931 ± 0.914
3.724AlaVal: 3.724 ± 1.969
1.862AlaTrp: 1.862 ± 0.801
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.931CysAla: 0.931 ± 1.027
0.0CysCys: 0.0 ± 0.0
1.862CysAsp: 1.862 ± 1.545
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.931CysGly: 0.931 ± 1.027
0.931CysHis: 0.931 ± 0.772
0.0CysIle: 0.0 ± 0.0
0.931CysLys: 0.931 ± 0.692
0.931CysLeu: 0.931 ± 1.29
0.931CysMet: 0.931 ± 0.926
1.862CysAsn: 1.862 ± 0.801
0.0CysPro: 0.0 ± 0.0
0.931CysGln: 0.931 ± 0.929
1.862CysArg: 1.862 ± 0.994
2.793CysSer: 2.793 ± 1.899
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.931CysTyr: 0.931 ± 0.772
0.0CysXaa: 0.0 ± 0.0
Asp
0.931AspAla: 0.931 ± 0.692
0.0AspCys: 0.0 ± 0.0
1.862AspAsp: 1.862 ± 0.994
0.931AspGlu: 0.931 ± 0.692
2.793AspPhe: 2.793 ± 1.282
3.724AspGly: 3.724 ± 1.582
0.931AspHis: 0.931 ± 0.772
5.587AspIle: 5.587 ± 2.742
1.862AspLys: 1.862 ± 0.918
4.655AspLeu: 4.655 ± 2.022
0.931AspMet: 0.931 ± 0.862
2.793AspAsn: 2.793 ± 3.082
0.0AspPro: 0.0 ± 0.0
1.862AspGln: 1.862 ± 1.095
2.793AspArg: 2.793 ± 1.413
7.449AspSer: 7.449 ± 0.861
2.793AspThr: 2.793 ± 1.413
5.587AspVal: 5.587 ± 3.636
1.862AspTrp: 1.862 ± 0.898
1.862AspTyr: 1.862 ± 0.801
0.0AspXaa: 0.0 ± 0.0
Glu
1.862GluAla: 1.862 ± 1.384
0.0GluCys: 0.0 ± 0.0
0.0GluAsp: 0.0 ± 0.0
8.38GluGlu: 8.38 ± 2.976
1.862GluPhe: 1.862 ± 0.918
3.724GluGly: 3.724 ± 1.888
0.931GluHis: 0.931 ± 0.914
1.862GluIle: 1.862 ± 1.424
2.793GluLys: 2.793 ± 1.307
4.655GluLeu: 4.655 ± 1.636
0.931GluMet: 0.931 ± 0.692
3.724GluAsn: 3.724 ± 1.56
2.793GluPro: 2.793 ± 1.307
2.793GluGln: 2.793 ± 1.223
1.862GluArg: 1.862 ± 1.384
2.793GluSer: 2.793 ± 1.37
3.724GluThr: 3.724 ± 2.075
2.793GluVal: 2.793 ± 2.787
2.793GluTrp: 2.793 ± 1.37
1.862GluTyr: 1.862 ± 1.384
0.0GluXaa: 0.0 ± 0.0
Phe
0.931PheAla: 0.931 ± 0.914
1.862PheCys: 1.862 ± 1.479
0.931PheAsp: 0.931 ± 0.692
0.931PheGlu: 0.931 ± 0.692
1.862PhePhe: 1.862 ± 0.801
1.862PheGly: 1.862 ± 0.801
2.793PheHis: 2.793 ± 1.223
2.793PheIle: 2.793 ± 1.307
3.724PheLys: 3.724 ± 2.566
3.724PheLeu: 3.724 ± 1.93
0.931PheMet: 0.931 ± 0.772
2.793PheAsn: 2.793 ± 1.344
1.862PhePro: 1.862 ± 1.292
5.587PheGln: 5.587 ± 0.982
4.655PheArg: 4.655 ± 3.076
0.0PheSer: 0.0 ± 0.0
2.793PheThr: 2.793 ± 1.597
1.862PheVal: 1.862 ± 0.801
0.931PheTrp: 0.931 ± 1.027
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.793GlyAla: 2.793 ± 2.076
0.931GlyCys: 0.931 ± 1.027
3.724GlyAsp: 3.724 ± 1.077
6.518GlyGlu: 6.518 ± 2.242
0.931GlyPhe: 0.931 ± 1.027
3.724GlyGly: 3.724 ± 1.601
0.931GlyHis: 0.931 ± 0.692
0.931GlyIle: 0.931 ± 0.692
4.655GlyLys: 4.655 ± 1.612
4.655GlyLeu: 4.655 ± 2.37
0.0GlyMet: 0.0 ± 0.0
2.793GlyAsn: 2.793 ± 1.598
0.931GlyPro: 0.931 ± 0.772
4.655GlyGln: 4.655 ± 2.104
1.862GlyArg: 1.862 ± 1.098
7.449GlySer: 7.449 ± 3.285
3.724GlyThr: 3.724 ± 2.578
3.724GlyVal: 3.724 ± 1.539
0.0GlyTrp: 0.0 ± 0.0
1.862GlyTyr: 1.862 ± 1.545
0.0GlyXaa: 0.0 ± 0.0
His
1.862HisAla: 1.862 ± 1.095
1.862HisCys: 1.862 ± 1.098
0.931HisAsp: 0.931 ± 0.914
0.931HisGlu: 0.931 ± 0.692
0.931HisPhe: 0.931 ± 0.692
0.0HisGly: 0.0 ± 0.0
2.793HisHis: 2.793 ± 1.208
0.0HisIle: 0.0 ± 0.0
0.931HisLys: 0.931 ± 0.914
3.724HisLeu: 3.724 ± 1.37
0.931HisMet: 0.931 ± 0.772
3.724HisAsn: 3.724 ± 1.93
1.862HisPro: 1.862 ± 0.918
0.931HisGln: 0.931 ± 1.027
1.862HisArg: 1.862 ± 2.054
0.931HisSer: 0.931 ± 0.772
0.0HisThr: 0.0 ± 0.0
4.655HisVal: 4.655 ± 2.396
0.931HisTrp: 0.931 ± 0.692
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.862IleAla: 1.862 ± 1.292
0.0IleCys: 0.0 ± 0.0
5.587IleAsp: 5.587 ± 2.41
3.724IleGlu: 3.724 ± 1.387
1.862IlePhe: 1.862 ± 1.384
0.931IleGly: 0.931 ± 0.692
0.0IleHis: 0.0 ± 0.0
1.862IleIle: 1.862 ± 1.384
6.518IleLys: 6.518 ± 1.516
5.587IleLeu: 5.587 ± 2.826
0.0IleMet: 0.0 ± 0.0
4.655IleAsn: 4.655 ± 1.35
2.793IlePro: 2.793 ± 1.258
6.518IleGln: 6.518 ± 1.342
3.724IleArg: 3.724 ± 2.566
4.655IleSer: 4.655 ± 3.2
3.724IleThr: 3.724 ± 1.835
1.862IleVal: 1.862 ± 0.898
0.931IleTrp: 0.931 ± 0.914
3.724IleTyr: 3.724 ± 2.829
0.0IleXaa: 0.0 ± 0.0
Lys
3.724LysAla: 3.724 ± 1.154
0.931LysCys: 0.931 ± 1.027
5.587LysAsp: 5.587 ± 1.681
5.587LysGlu: 5.587 ± 3.186
4.655LysPhe: 4.655 ± 2.57
2.793LysGly: 2.793 ± 1.692
1.862LysHis: 1.862 ± 0.918
2.793LysIle: 2.793 ± 1.223
2.793LysLys: 2.793 ± 0.818
1.862LysLeu: 1.862 ± 0.918
0.931LysMet: 0.931 ± 0.772
3.724LysAsn: 3.724 ± 2.132
2.793LysPro: 2.793 ± 1.486
0.931LysGln: 0.931 ± 0.772
4.655LysArg: 4.655 ± 2.63
3.724LysSer: 3.724 ± 1.27
3.724LysThr: 3.724 ± 1.93
1.862LysVal: 1.862 ± 0.801
0.0LysTrp: 0.0 ± 0.0
1.862LysTyr: 1.862 ± 0.918
0.0LysXaa: 0.0 ± 0.0
Leu
0.931LeuAla: 0.931 ± 0.772
0.931LeuCys: 0.931 ± 0.692
4.655LeuAsp: 4.655 ± 1.604
3.724LeuGlu: 3.724 ± 1.65
3.724LeuPhe: 3.724 ± 2.162
6.518LeuGly: 6.518 ± 2.079
3.724LeuHis: 3.724 ± 1.93
4.655LeuIle: 4.655 ± 1.97
3.724LeuLys: 3.724 ± 1.93
1.862LeuLeu: 1.862 ± 1.095
0.931LeuMet: 0.931 ± 0.914
6.518LeuAsn: 6.518 ± 1.139
3.724LeuPro: 3.724 ± 2.25
2.793LeuGln: 2.793 ± 1.307
3.724LeuArg: 3.724 ± 1.031
6.518LeuSer: 6.518 ± 1.342
6.518LeuThr: 6.518 ± 2.613
4.655LeuVal: 4.655 ± 1.975
0.0LeuTrp: 0.0 ± 0.0
2.793LeuTyr: 2.793 ± 1.696
0.0LeuXaa: 0.0 ± 0.0
Met
1.862MetAla: 1.862 ± 1.545
0.0MetCys: 0.0 ± 0.0
3.724MetAsp: 3.724 ± 1.031
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
1.862MetGly: 1.862 ± 1.351
0.0MetHis: 0.0 ± 0.0
1.862MetIle: 1.862 ± 1.545
0.931MetLys: 0.931 ± 0.772
1.862MetLeu: 1.862 ± 1.098
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.862MetPro: 1.862 ± 0.801
1.862MetGln: 1.862 ± 0.801
1.862MetArg: 1.862 ± 1.545
0.0MetSer: 0.0 ± 0.0
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.931MetTrp: 0.931 ± 0.692
0.931MetTyr: 0.931 ± 0.914
0.0MetXaa: 0.0 ± 0.0
Asn
9.311AsnAla: 9.311 ± 3.032
3.724AsnCys: 3.724 ± 1.601
0.0AsnAsp: 0.0 ± 0.0
1.862AsnGlu: 1.862 ± 1.384
1.862AsnPhe: 1.862 ± 1.047
5.587AsnGly: 5.587 ± 1.524
0.931AsnHis: 0.931 ± 0.914
3.724AsnIle: 3.724 ± 1.898
2.793AsnLys: 2.793 ± 1.258
4.655AsnLeu: 4.655 ± 2.313
0.931AsnMet: 0.931 ± 0.772
4.655AsnAsn: 4.655 ± 2.282
2.793AsnPro: 2.793 ± 1.208
1.862AsnGln: 1.862 ± 1.351
0.931AsnArg: 0.931 ± 1.027
3.724AsnSer: 3.724 ± 2.6
0.931AsnThr: 0.931 ± 0.772
2.793AsnVal: 2.793 ± 0.818
0.0AsnTrp: 0.0 ± 0.0
1.862AsnTyr: 1.862 ± 1.384
0.0AsnXaa: 0.0 ± 0.0
Pro
1.862ProAla: 1.862 ± 1.095
0.931ProCys: 0.931 ± 1.027
0.931ProAsp: 0.931 ± 0.929
3.724ProGlu: 3.724 ± 1.93
1.862ProPhe: 1.862 ± 0.801
2.793ProGly: 2.793 ± 1.182
0.931ProHis: 0.931 ± 0.692
2.793ProIle: 2.793 ± 1.441
1.862ProLys: 1.862 ± 0.898
3.724ProLeu: 3.724 ± 1.095
0.931ProMet: 0.931 ± 0.641
1.862ProAsn: 1.862 ± 0.994
2.793ProPro: 2.793 ± 1.208
5.587ProGln: 5.587 ± 2.179
4.655ProArg: 4.655 ± 2.367
8.38ProSer: 8.38 ± 2.343
5.587ProThr: 5.587 ± 1.769
4.655ProVal: 4.655 ± 0.979
0.931ProTrp: 0.931 ± 0.692
2.793ProTyr: 2.793 ± 1.597
0.0ProXaa: 0.0 ± 0.0
Gln
3.724GlnAla: 3.724 ± 1.65
0.931GlnCys: 0.931 ± 0.692
2.793GlnAsp: 2.793 ± 1.981
1.862GlnGlu: 1.862 ± 1.292
1.862GlnPhe: 1.862 ± 0.898
3.724GlnGly: 3.724 ± 1.343
0.931GlnHis: 0.931 ± 0.692
7.449GlnIle: 7.449 ± 1.836
1.862GlnLys: 1.862 ± 0.801
3.724GlnLeu: 3.724 ± 2.578
1.862GlnMet: 1.862 ± 1.545
0.931GlnAsn: 0.931 ± 0.692
6.518GlnPro: 6.518 ± 1.823
2.793GlnGln: 2.793 ± 0.841
2.793GlnArg: 2.793 ± 1.413
3.724GlnSer: 3.724 ± 1.93
6.518GlnThr: 6.518 ± 2.26
1.862GlnVal: 1.862 ± 1.047
0.0GlnTrp: 0.0 ± 0.0
2.793GlnTyr: 2.793 ± 1.182
0.0GlnXaa: 0.0 ± 0.0
Arg
6.518ArgAla: 6.518 ± 3.049
0.0ArgCys: 0.0 ± 0.0
1.862ArgAsp: 1.862 ± 0.801
2.793ArgGlu: 2.793 ± 1.486
5.587ArgPhe: 5.587 ± 2.221
4.655ArgGly: 4.655 ± 1.928
0.0ArgHis: 0.0 ± 0.0
4.655ArgIle: 4.655 ± 2.418
4.655ArgLys: 4.655 ± 1.928
3.724ArgLeu: 3.724 ± 1.909
0.931ArgMet: 0.931 ± 0.772
0.0ArgAsn: 0.0 ± 0.0
5.587ArgPro: 5.587 ± 1.23
1.862ArgGln: 1.862 ± 1.095
8.38ArgArg: 8.38 ± 3.608
7.449ArgSer: 7.449 ± 2.776
4.655ArgThr: 4.655 ± 1.239
0.931ArgVal: 0.931 ± 0.692
1.862ArgTrp: 1.862 ± 1.095
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
1.862SerAla: 1.862 ± 0.898
1.862SerCys: 1.862 ± 1.098
3.724SerAsp: 3.724 ± 2.132
0.931SerGlu: 0.931 ± 0.692
4.655SerPhe: 4.655 ± 1.928
6.518SerGly: 6.518 ± 1.823
0.931SerHis: 0.931 ± 0.692
9.311SerIle: 9.311 ± 1.451
2.793SerLys: 2.793 ± 2.526
6.518SerLeu: 6.518 ± 1.877
1.862SerMet: 1.862 ± 1.666
7.449SerAsn: 7.449 ± 2.143
5.587SerPro: 5.587 ± 1.911
5.587SerGln: 5.587 ± 1.355
9.311SerArg: 9.311 ± 4.568
11.173SerSer: 11.173 ± 3.983
6.518SerThr: 6.518 ± 4.167
2.793SerVal: 2.793 ± 1.413
0.931SerTrp: 0.931 ± 0.772
2.793SerTyr: 2.793 ± 1.307
0.0SerXaa: 0.0 ± 0.0
Thr
3.724ThrAla: 3.724 ± 1.031
0.931ThrCys: 0.931 ± 1.29
3.724ThrAsp: 3.724 ± 1.794
2.793ThrGlu: 2.793 ± 1.457
0.931ThrPhe: 0.931 ± 1.29
4.655ThrGly: 4.655 ± 1.226
3.724ThrHis: 3.724 ± 2.959
2.793ThrIle: 2.793 ± 0.818
2.793ThrLys: 2.793 ± 1.223
1.862ThrLeu: 1.862 ± 1.384
2.793ThrMet: 2.793 ± 1.413
2.793ThrAsn: 2.793 ± 0.818
10.242ThrPro: 10.242 ± 2.544
2.793ThrGln: 2.793 ± 1.182
0.931ThrArg: 0.931 ± 0.692
4.655ThrSer: 4.655 ± 3.806
2.793ThrThr: 2.793 ± 2.743
0.931ThrVal: 0.931 ± 1.027
0.931ThrTrp: 0.931 ± 1.29
3.724ThrTyr: 3.724 ± 1.095
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
0.0ValCys: 0.0 ± 0.0
6.518ValAsp: 6.518 ± 3.499
2.793ValGlu: 2.793 ± 1.223
1.862ValPhe: 1.862 ± 1.098
0.0ValGly: 0.0 ± 0.0
3.724ValHis: 3.724 ± 2.047
3.724ValIle: 3.724 ± 2.667
2.793ValLys: 2.793 ± 1.413
7.449ValLeu: 7.449 ± 1.912
0.931ValMet: 0.931 ± 0.692
0.0ValAsn: 0.0 ± 0.0
3.724ValPro: 3.724 ± 1.343
1.862ValGln: 1.862 ± 0.918
0.931ValArg: 0.931 ± 0.929
5.587ValSer: 5.587 ± 1.172
0.931ValThr: 0.931 ± 0.772
0.0ValVal: 0.0 ± 0.0
2.793ValTrp: 2.793 ± 1.597
2.793ValTyr: 2.793 ± 0.818
0.0ValXaa: 0.0 ± 0.0
Trp
1.862TrpAla: 1.862 ± 1.384
0.0TrpCys: 0.0 ± 0.0
0.931TrpAsp: 0.931 ± 1.027
0.931TrpGlu: 0.931 ± 0.914
0.931TrpPhe: 0.931 ± 1.29
0.931TrpGly: 0.931 ± 0.692
0.0TrpHis: 0.0 ± 0.0
1.862TrpIle: 1.862 ± 1.859
0.931TrpLys: 0.931 ± 0.692
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.931TrpGln: 0.931 ± 0.692
0.931TrpArg: 0.931 ± 1.027
2.793TrpSer: 2.793 ± 2.317
3.724TrpThr: 3.724 ± 1.095
0.931TrpVal: 0.931 ± 0.772
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.862TyrAla: 1.862 ± 0.801
0.0TyrCys: 0.0 ± 0.0
0.0TyrAsp: 0.0 ± 0.0
0.0TyrGlu: 0.0 ± 0.0
3.724TyrPhe: 3.724 ± 1.835
1.862TyrGly: 1.862 ± 0.801
2.793TyrHis: 2.793 ± 1.808
1.862TyrIle: 1.862 ± 1.552
1.862TyrLys: 1.862 ± 0.801
4.655TyrLeu: 4.655 ± 1.803
0.931TyrMet: 0.931 ± 0.82
0.931TyrAsn: 0.931 ± 0.692
1.862TyrPro: 1.862 ± 0.801
2.793TyrGln: 2.793 ± 1.413
0.931TyrArg: 0.931 ± 0.929
2.793TyrSer: 2.793 ± 1.413
0.931TyrThr: 0.931 ± 0.914
2.793TyrVal: 2.793 ± 1.208
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1075 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski