Amino acid dipepetide frequency for Squash leaf curl Yunnan virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.5AlaAla: 5.5 ± 2.27
1.833AlaCys: 1.833 ± 0.698
1.833AlaAsp: 1.833 ± 1.114
1.833AlaGlu: 1.833 ± 1.282
0.0AlaPhe: 0.0 ± 0.0
1.833AlaGly: 1.833 ± 0.698
1.833AlaHis: 1.833 ± 1.868
4.583AlaIle: 4.583 ± 1.648
5.5AlaLys: 5.5 ± 1.187
6.416AlaLeu: 6.416 ± 1.769
0.0AlaMet: 0.0 ± 0.0
0.917AlaAsn: 0.917 ± 0.641
0.917AlaPro: 0.917 ± 0.968
1.833AlaGln: 1.833 ± 1.109
3.666AlaArg: 3.666 ± 1.955
2.75AlaSer: 2.75 ± 1.382
4.583AlaThr: 4.583 ± 1.914
1.833AlaVal: 1.833 ± 1.114
0.917AlaTrp: 0.917 ± 0.641
0.917AlaTyr: 0.917 ± 0.713
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.833CysCys: 1.833 ± 2.16
0.0CysAsp: 0.0 ± 0.0
0.917CysGlu: 0.917 ± 0.713
0.917CysPhe: 0.917 ± 0.98
1.833CysGly: 1.833 ± 0.967
0.917CysHis: 0.917 ± 0.934
1.833CysIle: 1.833 ± 1.471
1.833CysLys: 1.833 ± 1.141
0.917CysLeu: 0.917 ± 0.713
0.917CysMet: 0.917 ± 1.08
4.583CysAsn: 4.583 ± 1.579
3.666CysPro: 3.666 ± 2.218
0.917CysGln: 0.917 ± 0.641
0.0CysArg: 0.0 ± 0.0
0.917CysSer: 0.917 ± 0.934
0.917CysThr: 0.917 ± 0.713
0.917CysVal: 0.917 ± 0.713
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.75AspAla: 2.75 ± 1.135
0.0AspCys: 0.0 ± 0.0
1.833AspAsp: 1.833 ± 0.988
0.917AspGlu: 0.917 ± 0.713
0.917AspPhe: 0.917 ± 0.713
3.666AspGly: 3.666 ± 1.877
0.917AspHis: 0.917 ± 0.98
4.583AspIle: 4.583 ± 2.452
2.75AspLys: 2.75 ± 0.892
5.5AspLeu: 5.5 ± 2.599
0.917AspMet: 0.917 ± 1.08
2.75AspAsn: 2.75 ± 0.996
1.833AspPro: 1.833 ± 1.109
0.917AspGln: 0.917 ± 0.713
2.75AspArg: 2.75 ± 1.256
4.583AspSer: 4.583 ± 1.294
2.75AspThr: 2.75 ± 1.215
5.5AspVal: 5.5 ± 1.52
1.833AspTrp: 1.833 ± 1.282
1.833AspTyr: 1.833 ± 1.282
0.0AspXaa: 0.0 ± 0.0
Glu
4.583GluAla: 4.583 ± 1.613
0.0GluCys: 0.0 ± 0.0
1.833GluAsp: 1.833 ± 0.941
8.249GluGlu: 8.249 ± 2.879
1.833GluPhe: 1.833 ± 1.109
3.666GluGly: 3.666 ± 1.054
0.0GluHis: 0.0 ± 0.0
0.917GluIle: 0.917 ± 1.08
4.583GluLys: 4.583 ± 2.521
4.583GluLeu: 4.583 ± 1.633
0.0GluMet: 0.0 ± 0.0
3.666GluAsn: 3.666 ± 2.107
3.666GluPro: 3.666 ± 2.034
1.833GluGln: 1.833 ± 0.698
0.0GluArg: 0.0 ± 0.0
4.583GluSer: 4.583 ± 1.838
0.917GluThr: 0.917 ± 0.968
0.917GluVal: 0.917 ± 0.968
1.833GluTrp: 1.833 ± 0.967
1.833GluTyr: 1.833 ± 1.282
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.917PheCys: 0.917 ± 0.713
3.666PheAsp: 3.666 ± 1.237
0.0PheGlu: 0.0 ± 0.0
0.0PhePhe: 0.0 ± 0.0
0.0PheGly: 0.0 ± 0.0
0.0PheHis: 0.0 ± 0.0
1.833PheIle: 1.833 ± 0.941
3.666PheLys: 3.666 ± 1.881
3.666PheLeu: 3.666 ± 1.881
0.917PheMet: 0.917 ± 0.641
2.75PheAsn: 2.75 ± 0.892
1.833PhePro: 1.833 ± 1.109
7.333PheGln: 7.333 ± 2.32
3.666PheArg: 3.666 ± 1.539
1.833PheSer: 1.833 ± 0.967
2.75PheThr: 2.75 ± 1.79
2.75PheVal: 2.75 ± 0.996
1.833PheTrp: 1.833 ± 1.141
3.666PheTyr: 3.666 ± 2.107
0.0PheXaa: 0.0 ± 0.0
Gly
2.75GlyAla: 2.75 ± 1.924
2.75GlyCys: 2.75 ± 1.99
2.75GlyAsp: 2.75 ± 1.348
1.833GlyGlu: 1.833 ± 0.941
2.75GlyPhe: 2.75 ± 2.233
2.75GlyGly: 2.75 ± 1.135
1.833GlyHis: 1.833 ± 0.698
2.75GlyIle: 2.75 ± 0.892
6.416GlyLys: 6.416 ± 2.037
1.833GlyLeu: 1.833 ± 1.057
0.0GlyMet: 0.0 ± 0.0
0.0GlyAsn: 0.0 ± 0.0
3.666GlyPro: 3.666 ± 1.706
3.666GlyGln: 3.666 ± 1.395
0.917GlyArg: 0.917 ± 0.641
1.833GlySer: 1.833 ± 1.282
2.75GlyThr: 2.75 ± 1.274
3.666GlyVal: 3.666 ± 2.574
0.0GlyTrp: 0.0 ± 0.0
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.833HisAla: 1.833 ± 0.698
1.833HisCys: 1.833 ± 1.327
1.833HisAsp: 1.833 ± 1.959
0.0HisGlu: 0.0 ± 0.0
2.75HisPhe: 2.75 ± 1.454
0.917HisGly: 0.917 ± 1.08
0.0HisHis: 0.0 ± 0.0
0.917HisIle: 0.917 ± 0.641
1.833HisLys: 1.833 ± 1.476
1.833HisLeu: 1.833 ± 0.941
0.917HisMet: 0.917 ± 0.713
3.666HisAsn: 3.666 ± 1.933
2.75HisPro: 2.75 ± 1.278
1.833HisGln: 1.833 ± 0.698
4.583HisArg: 4.583 ± 2.738
0.917HisSer: 0.917 ± 0.934
2.75HisThr: 2.75 ± 1.631
0.917HisVal: 0.917 ± 0.934
0.0HisTrp: 0.0 ± 0.0
0.917HisTyr: 0.917 ± 0.641
0.0HisXaa: 0.0 ± 0.0
Ile
0.917IleAla: 0.917 ± 0.934
0.917IleCys: 0.917 ± 0.713
3.666IleAsp: 3.666 ± 1.79
4.583IleGlu: 4.583 ± 1.838
5.5IlePhe: 5.5 ± 3.067
2.75IleGly: 2.75 ± 1.693
1.833IleHis: 1.833 ± 0.941
3.666IleIle: 3.666 ± 2.941
4.583IleLys: 4.583 ± 1.648
1.833IleLeu: 1.833 ± 1.167
1.833IleMet: 1.833 ± 0.939
1.833IleAsn: 1.833 ± 1.959
3.666IlePro: 3.666 ± 1.419
4.583IleGln: 4.583 ± 2.43
6.416IleArg: 6.416 ± 2.329
5.5IleSer: 5.5 ± 1.942
2.75IleThr: 2.75 ± 2.301
1.833IleVal: 1.833 ± 0.698
3.666IleTrp: 3.666 ± 2.282
1.833IleTyr: 1.833 ± 1.139
0.0IleXaa: 0.0 ± 0.0
Lys
3.666LysAla: 3.666 ± 1.211
3.666LysCys: 3.666 ± 1.79
1.833LysAsp: 1.833 ± 1.282
3.666LysGlu: 3.666 ± 1.706
1.833LysPhe: 1.833 ± 1.425
3.666LysGly: 3.666 ± 1.395
1.833LysHis: 1.833 ± 1.282
2.75LysIle: 2.75 ± 2.004
0.917LysLys: 0.917 ± 0.934
2.75LysLeu: 2.75 ± 1.278
0.0LysMet: 0.0 ± 0.0
8.249LysAsn: 8.249 ± 1.981
1.833LysPro: 1.833 ± 0.698
0.0LysGln: 0.0 ± 0.0
4.583LysArg: 4.583 ± 1.799
3.666LysSer: 3.666 ± 1.201
1.833LysThr: 1.833 ± 1.109
7.333LysVal: 7.333 ± 3.211
0.0LysTrp: 0.0 ± 0.0
4.583LysTyr: 4.583 ± 1.874
0.0LysXaa: 0.0 ± 0.0
Leu
2.75LeuAla: 2.75 ± 1.215
2.75LeuCys: 2.75 ± 1.355
4.583LeuAsp: 4.583 ± 1.63
7.333LeuGlu: 7.333 ± 1.964
2.75LeuPhe: 2.75 ± 0.892
3.666LeuGly: 3.666 ± 2.049
0.917LeuHis: 0.917 ± 0.98
3.666LeuIle: 3.666 ± 1.081
2.75LeuLys: 2.75 ± 0.851
7.333LeuLeu: 7.333 ± 1.864
0.917LeuMet: 0.917 ± 0.968
4.583LeuAsn: 4.583 ± 1.828
0.917LeuPro: 0.917 ± 0.641
4.583LeuGln: 4.583 ± 2.096
3.666LeuArg: 3.666 ± 1.211
4.583LeuSer: 4.583 ± 0.856
5.5LeuThr: 5.5 ± 2.054
1.833LeuVal: 1.833 ± 1.959
0.0LeuTrp: 0.0 ± 0.0
3.666LeuTyr: 3.666 ± 1.312
0.0LeuXaa: 0.0 ± 0.0
Met
0.917MetAla: 0.917 ± 0.713
0.0MetCys: 0.0 ± 0.0
2.75MetAsp: 2.75 ± 1.474
0.917MetGlu: 0.917 ± 1.08
0.917MetPhe: 0.917 ± 0.934
2.75MetGly: 2.75 ± 0.851
0.917MetHis: 0.917 ± 0.641
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
1.833MetLeu: 1.833 ± 1.109
0.917MetMet: 0.917 ± 0.968
0.0MetAsn: 0.0 ± 0.0
0.917MetPro: 0.917 ± 0.641
1.833MetGln: 1.833 ± 1.936
1.833MetArg: 1.833 ± 0.967
2.75MetSer: 2.75 ± 1.898
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
1.833MetTrp: 1.833 ± 1.109
0.917MetTyr: 0.917 ± 0.713
0.0MetXaa: 0.0 ± 0.0
Asn
3.666AsnAla: 3.666 ± 1.074
0.0AsnCys: 0.0 ± 0.0
3.666AsnAsp: 3.666 ± 1.395
1.833AsnGlu: 1.833 ± 1.114
1.833AsnPhe: 1.833 ± 0.698
0.917AsnGly: 0.917 ± 0.934
4.583AsnHis: 4.583 ± 2.306
3.666AsnIle: 3.666 ± 1.706
0.917AsnLys: 0.917 ± 0.98
3.666AsnLeu: 3.666 ± 1.305
0.917AsnMet: 0.917 ± 0.672
7.333AsnAsn: 7.333 ± 3.18
4.583AsnPro: 4.583 ± 1.112
3.666AsnGln: 3.666 ± 1.877
1.833AsnArg: 1.833 ± 1.057
7.333AsnSer: 7.333 ± 1.503
4.583AsnThr: 4.583 ± 1.145
6.416AsnVal: 6.416 ± 2.339
0.917AsnTrp: 0.917 ± 0.641
2.75AsnTyr: 2.75 ± 0.934
0.0AsnXaa: 0.0 ± 0.0
Pro
2.75ProAla: 2.75 ± 1.521
1.833ProCys: 1.833 ± 1.114
2.75ProAsp: 2.75 ± 1.382
1.833ProGlu: 1.833 ± 1.327
2.75ProPhe: 2.75 ± 1.278
1.833ProGly: 1.833 ± 1.282
2.75ProHis: 2.75 ± 1.454
6.416ProIle: 6.416 ± 2.774
5.5ProLys: 5.5 ± 2.27
2.75ProLeu: 2.75 ± 1.325
0.917ProMet: 0.917 ± 0.968
3.666ProAsn: 3.666 ± 2.565
0.917ProPro: 0.917 ± 0.641
1.833ProGln: 1.833 ± 1.468
3.666ProArg: 3.666 ± 1.418
4.583ProSer: 4.583 ± 1.183
3.666ProThr: 3.666 ± 1.237
2.75ProVal: 2.75 ± 1.256
0.0ProTrp: 0.0 ± 0.0
1.833ProTyr: 1.833 ± 1.425
0.0ProXaa: 0.0 ± 0.0
Gln
2.75GlnAla: 2.75 ± 1.693
0.917GlnCys: 0.917 ± 0.98
1.833GlnAsp: 1.833 ± 0.967
2.75GlnGlu: 2.75 ± 0.934
4.583GlnPhe: 4.583 ± 2.365
1.833GlnGly: 1.833 ± 1.282
0.917GlnHis: 0.917 ± 0.934
4.583GlnIle: 4.583 ± 0.856
0.917GlnLys: 0.917 ± 1.08
1.833GlnLeu: 1.833 ± 1.936
1.833GlnMet: 1.833 ± 0.967
2.75GlnAsn: 2.75 ± 1.215
2.75GlnPro: 2.75 ± 2.233
6.416GlnGln: 6.416 ± 2.357
2.75GlnArg: 2.75 ± 0.851
2.75GlnSer: 2.75 ± 0.892
3.666GlnThr: 3.666 ± 1.444
5.5GlnVal: 5.5 ± 1.072
0.0GlnTrp: 0.0 ± 0.0
2.75GlnTyr: 2.75 ± 1.135
0.0GlnXaa: 0.0 ± 0.0
Arg
2.75ArgAla: 2.75 ± 2.076
1.833ArgCys: 1.833 ± 1.327
3.666ArgAsp: 3.666 ± 1.237
3.666ArgGlu: 3.666 ± 1.943
3.666ArgPhe: 3.666 ± 2.244
2.75ArgGly: 2.75 ± 0.996
3.666ArgHis: 3.666 ± 1.211
2.75ArgIle: 2.75 ± 1.79
2.75ArgLys: 2.75 ± 1.631
2.75ArgLeu: 2.75 ± 1.382
1.833ArgMet: 1.833 ± 0.957
3.666ArgAsn: 3.666 ± 1.881
5.5ArgPro: 5.5 ± 1.683
1.833ArgGln: 1.833 ± 1.471
8.249ArgArg: 8.249 ± 4.112
6.416ArgSer: 6.416 ± 1.744
4.583ArgThr: 4.583 ± 1.315
4.583ArgVal: 4.583 ± 2.038
0.0ArgTrp: 0.0 ± 0.0
0.917ArgTyr: 0.917 ± 1.08
0.0ArgXaa: 0.0 ± 0.0
Ser
2.75SerAla: 2.75 ± 1.924
0.917SerCys: 0.917 ± 0.641
4.583SerAsp: 4.583 ± 1.648
4.583SerGlu: 4.583 ± 1.633
3.666SerPhe: 3.666 ± 1.11
0.917SerGly: 0.917 ± 0.98
1.833SerHis: 1.833 ± 1.476
2.75SerIle: 2.75 ± 1.79
3.666SerLys: 3.666 ± 1.776
4.583SerLeu: 4.583 ± 1.126
5.5SerMet: 5.5 ± 2.904
3.666SerAsn: 3.666 ± 1.684
8.249SerPro: 8.249 ± 1.758
0.917SerGln: 0.917 ± 0.641
6.416SerArg: 6.416 ± 2.649
13.749SerSer: 13.749 ± 4.383
6.416SerThr: 6.416 ± 1.887
1.833SerVal: 1.833 ± 1.167
0.0SerTrp: 0.0 ± 0.0
3.666SerTyr: 3.666 ± 1.201
0.0SerXaa: 0.0 ± 0.0
Thr
2.75ThrAla: 2.75 ± 0.892
0.917ThrCys: 0.917 ± 1.08
0.0ThrAsp: 0.0 ± 0.0
2.75ThrGlu: 2.75 ± 0.892
0.917ThrPhe: 0.917 ± 0.641
5.5ThrGly: 5.5 ± 1.399
5.5ThrHis: 5.5 ± 1.836
3.666ThrIle: 3.666 ± 1.305
1.833ThrLys: 1.833 ± 0.698
4.583ThrLeu: 4.583 ± 1.338
0.917ThrMet: 0.917 ± 0.641
3.666ThrAsn: 3.666 ± 1.437
4.583ThrPro: 4.583 ± 1.218
3.666ThrGln: 3.666 ± 1.615
2.75ThrArg: 2.75 ± 1.474
6.416ThrSer: 6.416 ± 3.123
1.833ThrThr: 1.833 ± 0.967
6.416ThrVal: 6.416 ± 2.929
0.0ThrTrp: 0.0 ± 0.0
1.833ThrTyr: 1.833 ± 1.109
0.0ThrXaa: 0.0 ± 0.0
Val
0.917ValAla: 0.917 ± 0.934
0.917ValCys: 0.917 ± 0.98
2.75ValAsp: 2.75 ± 0.996
0.917ValGlu: 0.917 ± 1.08
1.833ValPhe: 1.833 ± 1.141
1.833ValGly: 1.833 ± 1.167
0.917ValHis: 0.917 ± 1.08
7.333ValIle: 7.333 ± 1.734
6.416ValLys: 6.416 ± 1.529
3.666ValLeu: 3.666 ± 1.11
0.0ValMet: 0.0 ± 0.993
3.666ValAsn: 3.666 ± 2.217
2.75ValPro: 2.75 ± 1.256
3.666ValGln: 3.666 ± 1.8
4.583ValArg: 4.583 ± 2.785
2.75ValSer: 2.75 ± 0.851
6.416ValThr: 6.416 ± 2.532
1.833ValVal: 1.833 ± 1.141
0.0ValTrp: 0.0 ± 0.0
4.583ValTyr: 4.583 ± 1.914
0.0ValXaa: 0.0 ± 0.0
Trp
2.75TrpAla: 2.75 ± 1.924
0.0TrpCys: 0.0 ± 0.0
0.917TrpAsp: 0.917 ± 1.08
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.917TrpGly: 0.917 ± 0.641
0.0TrpHis: 0.0 ± 0.0
0.917TrpIle: 0.917 ± 0.713
0.917TrpLys: 0.917 ± 0.713
0.0TrpLeu: 0.0 ± 0.0
0.917TrpMet: 0.917 ± 0.713
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.917TrpGln: 0.917 ± 0.641
1.833TrpArg: 1.833 ± 1.139
0.0TrpSer: 0.0 ± 0.0
1.833TrpThr: 1.833 ± 1.959
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.917TrpTyr: 0.917 ± 0.641
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.75TyrAla: 2.75 ± 1.527
0.0TyrCys: 0.0 ± 0.0
2.75TyrAsp: 2.75 ± 2.138
0.917TyrGlu: 0.917 ± 0.713
3.666TyrPhe: 3.666 ± 1.721
0.917TyrGly: 0.917 ± 0.641
1.833TyrHis: 1.833 ± 1.282
4.583TyrIle: 4.583 ± 2.365
0.917TyrLys: 0.917 ± 0.641
6.416TyrLeu: 6.416 ± 1.825
0.917TyrMet: 0.917 ± 0.713
3.666TyrAsn: 3.666 ± 1.395
0.0TyrPro: 0.0 ± 0.0
1.833TyrGln: 1.833 ± 0.941
3.666TyrArg: 3.666 ± 2.851
2.75TyrSer: 2.75 ± 2.093
0.0TyrThr: 0.0 ± 0.0
1.833TyrVal: 1.833 ± 1.114
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1092 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski