Amino acid dipepetide frequency for Chenopodium leaf curl virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.636AlaAla: 2.636 ± 1.942
0.879AlaCys: 0.879 ± 0.718
1.757AlaAsp: 1.757 ± 1.232
1.757AlaGlu: 1.757 ± 1.162
1.757AlaPhe: 1.757 ± 1.142
3.515AlaGly: 3.515 ± 1.396
0.879AlaHis: 0.879 ± 0.838
0.879AlaIle: 0.879 ± 0.718
2.636AlaLys: 2.636 ± 0.817
5.272AlaLeu: 5.272 ± 1.169
0.0AlaMet: 0.0 ± 0.0
2.636AlaAsn: 2.636 ± 1.481
1.757AlaPro: 1.757 ± 0.698
3.515AlaGln: 3.515 ± 1.993
6.151AlaArg: 6.151 ± 4.313
6.151AlaSer: 6.151 ± 1.921
4.394AlaThr: 4.394 ± 1.835
2.636AlaVal: 2.636 ± 1.848
0.879AlaTrp: 0.879 ± 0.616
0.879AlaTyr: 0.879 ± 0.952
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.757CysAsp: 1.757 ± 0.928
1.757CysGlu: 1.757 ± 0.698
0.879CysPhe: 0.879 ± 0.939
0.879CysGly: 0.879 ± 0.838
0.0CysHis: 0.0 ± 0.0
1.757CysIle: 1.757 ± 1.142
1.757CysLys: 1.757 ± 0.698
2.636CysLeu: 2.636 ± 1.723
0.0CysMet: 0.0 ± 0.0
0.879CysAsn: 0.879 ± 0.616
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.879CysArg: 0.879 ± 0.951
3.515CysSer: 3.515 ± 2.735
0.879CysThr: 0.879 ± 0.718
0.879CysVal: 0.879 ± 0.718
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.879AspAla: 0.879 ± 0.616
0.879AspCys: 0.879 ± 0.838
4.394AspAsp: 4.394 ± 2.129
1.757AspGlu: 1.757 ± 0.698
4.394AspPhe: 4.394 ± 1.761
0.879AspGly: 0.879 ± 0.616
0.0AspHis: 0.0 ± 0.0
1.757AspIle: 1.757 ± 1.232
0.879AspLys: 0.879 ± 0.616
4.394AspLeu: 4.394 ± 0.862
2.636AspMet: 2.636 ± 1.203
1.757AspAsn: 1.757 ± 1.142
0.879AspPro: 0.879 ± 0.616
2.636AspGln: 2.636 ± 1.154
5.272AspArg: 5.272 ± 2.208
5.272AspSer: 5.272 ± 1.49
1.757AspThr: 1.757 ± 0.928
5.272AspVal: 5.272 ± 1.768
1.757AspTrp: 1.757 ± 0.928
1.757AspTyr: 1.757 ± 1.232
0.0AspXaa: 0.0 ± 0.0
Glu
4.394GluAla: 4.394 ± 1.742
0.0GluCys: 0.0 ± 0.0
0.0GluAsp: 0.0 ± 0.0
2.636GluGlu: 2.636 ± 1.481
0.879GluPhe: 0.879 ± 0.616
5.272GluGly: 5.272 ± 1.968
0.879GluHis: 0.879 ± 0.616
0.879GluIle: 0.879 ± 0.952
0.0GluLys: 0.0 ± 0.0
1.757GluLeu: 1.757 ± 1.081
0.0GluMet: 0.0 ± 0.0
5.272GluAsn: 5.272 ± 2.181
1.757GluPro: 1.757 ± 1.008
1.757GluGln: 1.757 ± 1.435
4.394GluArg: 4.394 ± 2.031
2.636GluSer: 2.636 ± 1.108
0.0GluThr: 0.0 ± 0.0
1.757GluVal: 1.757 ± 0.928
1.757GluTrp: 1.757 ± 0.988
1.757GluTyr: 1.757 ± 0.928
0.0GluXaa: 0.0 ± 0.0
Phe
1.757PheAla: 1.757 ± 1.142
0.879PheCys: 0.879 ± 0.718
1.757PheAsp: 1.757 ± 0.698
1.757PheGlu: 1.757 ± 1.143
1.757PhePhe: 1.757 ± 0.928
1.757PheGly: 1.757 ± 0.698
1.757PheHis: 1.757 ± 1.232
2.636PheIle: 2.636 ± 1.481
0.879PheLys: 0.879 ± 0.952
3.515PheLeu: 3.515 ± 1.321
0.0PheMet: 0.0 ± 0.0
4.394PheAsn: 4.394 ± 1.363
2.636PhePro: 2.636 ± 1.256
1.757PheGln: 1.757 ± 1.162
4.394PheArg: 4.394 ± 2.674
3.515PheSer: 3.515 ± 2.181
3.515PheThr: 3.515 ± 1.244
0.879PheVal: 0.879 ± 0.951
2.636PheTrp: 2.636 ± 1.51
2.636PheTyr: 2.636 ± 1.51
0.0PheXaa: 0.0 ± 0.0
Gly
3.515GlyAla: 3.515 ± 1.748
2.636GlyCys: 2.636 ± 1.685
4.394GlyAsp: 4.394 ± 2.377
2.636GlyGlu: 2.636 ± 1.154
3.515GlyPhe: 3.515 ± 1.343
4.394GlyGly: 4.394 ± 1.555
1.757GlyHis: 1.757 ± 0.894
3.515GlyIle: 3.515 ± 1.059
7.03GlyLys: 7.03 ± 3.042
3.515GlyLeu: 3.515 ± 2.212
0.0GlyMet: 0.0 ± 0.0
1.757GlyAsn: 1.757 ± 0.698
4.394GlyPro: 4.394 ± 1.986
3.515GlyGln: 3.515 ± 1.361
0.879GlyArg: 0.879 ± 0.616
3.515GlySer: 3.515 ± 1.001
5.272GlyThr: 5.272 ± 1.957
6.151GlyVal: 6.151 ± 2.174
0.0GlyTrp: 0.0 ± 0.0
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.757HisAla: 1.757 ± 0.698
1.757HisCys: 1.757 ± 0.894
0.879HisAsp: 0.879 ± 0.718
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
1.757HisGly: 1.757 ± 1.301
0.879HisHis: 0.879 ± 0.838
3.515HisIle: 3.515 ± 2.031
2.636HisLys: 2.636 ± 1.154
1.757HisLeu: 1.757 ± 1.232
0.0HisMet: 0.0 ± 0.0
3.515HisAsn: 3.515 ± 1.993
4.394HisPro: 4.394 ± 1.852
0.879HisGln: 0.879 ± 0.718
2.636HisArg: 2.636 ± 1.51
1.757HisSer: 1.757 ± 1.142
3.515HisThr: 3.515 ± 2.288
1.757HisVal: 1.757 ± 0.988
0.879HisTrp: 0.879 ± 0.616
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.757IleAla: 1.757 ± 1.675
0.879IleCys: 0.879 ± 0.718
2.636IleAsp: 2.636 ± 1.154
2.636IleGlu: 2.636 ± 1.481
0.879IlePhe: 0.879 ± 0.616
0.879IleGly: 0.879 ± 0.838
0.879IleHis: 0.879 ± 0.951
3.515IleIle: 3.515 ± 1.128
7.03IleLys: 7.03 ± 1.76
3.515IleLeu: 3.515 ± 2.883
0.879IleMet: 0.879 ± 0.718
0.879IleAsn: 0.879 ± 0.952
2.636IlePro: 2.636 ± 1.256
2.636IleGln: 2.636 ± 1.481
6.151IleArg: 6.151 ± 1.466
2.636IleSer: 2.636 ± 0.934
7.03IleThr: 7.03 ± 1.687
4.394IleVal: 4.394 ± 1.446
1.757IleTrp: 1.757 ± 1.008
4.394IleTyr: 4.394 ± 1.712
0.0IleXaa: 0.0 ± 0.0
Lys
5.272LysAla: 5.272 ± 1.53
0.879LysCys: 0.879 ± 0.939
4.394LysAsp: 4.394 ± 3.081
2.636LysGlu: 2.636 ± 1.848
0.879LysPhe: 0.879 ± 0.952
0.879LysGly: 0.879 ± 0.616
0.879LysHis: 0.879 ± 0.616
5.272LysIle: 5.272 ± 1.539
0.879LysLys: 0.879 ± 0.838
2.636LysLeu: 2.636 ± 0.898
0.879LysMet: 0.879 ± 0.786
4.394LysAsn: 4.394 ± 1.742
3.515LysPro: 3.515 ± 1.057
0.0LysGln: 0.0 ± 0.0
3.515LysArg: 3.515 ± 2.148
2.636LysSer: 2.636 ± 0.817
3.515LysThr: 3.515 ± 1.78
6.151LysVal: 6.151 ± 3.341
0.0LysTrp: 0.0 ± 0.0
2.636LysTyr: 2.636 ± 1.275
0.0LysXaa: 0.0 ± 0.0
Leu
0.879LeuAla: 0.879 ± 0.718
0.879LeuCys: 0.879 ± 0.616
7.03LeuAsp: 7.03 ± 2.476
0.0LeuGlu: 0.0 ± 0.0
5.272LeuPhe: 5.272 ± 1.815
8.787LeuGly: 8.787 ± 2.281
2.636LeuHis: 2.636 ± 1.297
2.636LeuIle: 2.636 ± 1.297
6.151LeuLys: 6.151 ± 1.309
10.545LeuLeu: 10.545 ± 1.632
1.757LeuMet: 1.757 ± 1.841
2.636LeuAsn: 2.636 ± 1.354
6.151LeuPro: 6.151 ± 3.655
1.757LeuGln: 1.757 ± 0.928
3.515LeuArg: 3.515 ± 1.393
4.394LeuSer: 4.394 ± 1.51
7.909LeuThr: 7.909 ± 2.563
1.757LeuVal: 1.757 ± 1.008
0.0LeuTrp: 0.0 ± 0.0
3.515LeuTyr: 3.515 ± 0.857
0.0LeuXaa: 0.0 ± 0.0
Met
0.879MetAla: 0.879 ± 0.718
0.0MetCys: 0.0 ± 0.0
2.636MetAsp: 2.636 ± 1.468
0.0MetGlu: 0.0 ± 0.0
2.636MetPhe: 2.636 ± 1.654
3.515MetGly: 3.515 ± 2.212
1.757MetHis: 1.757 ± 1.143
0.879MetIle: 0.879 ± 0.718
0.879MetLys: 0.879 ± 0.939
0.879MetLeu: 0.879 ± 0.939
0.0MetMet: 0.0 ± 0.0
0.879MetAsn: 0.879 ± 0.718
1.757MetPro: 1.757 ± 0.698
1.757MetGln: 1.757 ± 0.89
0.879MetArg: 0.879 ± 0.838
0.0MetSer: 0.0 ± 0.0
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.879MetTrp: 0.879 ± 0.616
2.636MetTyr: 2.636 ± 1.14
0.0MetXaa: 0.0 ± 0.0
Asn
6.151AsnAla: 6.151 ± 2.735
0.879AsnCys: 0.879 ± 0.616
1.757AsnAsp: 1.757 ± 1.143
2.636AsnGlu: 2.636 ± 1.275
0.0AsnPhe: 0.0 ± 0.0
2.636AsnGly: 2.636 ± 0.934
4.394AsnHis: 4.394 ± 2.743
4.394AsnIle: 4.394 ± 1.495
1.757AsnLys: 1.757 ± 0.698
3.515AsnLeu: 3.515 ± 1.607
2.636AsnMet: 2.636 ± 1.399
1.757AsnAsn: 1.757 ± 1.008
5.272AsnPro: 5.272 ± 1.775
0.879AsnGln: 0.879 ± 0.951
3.515AsnArg: 3.515 ± 1.842
2.636AsnSer: 2.636 ± 0.898
2.636AsnThr: 2.636 ± 1.723
6.151AsnVal: 6.151 ± 2.64
0.879AsnTrp: 0.879 ± 0.616
1.757AsnTyr: 1.757 ± 1.232
0.0AsnXaa: 0.0 ± 0.0
Pro
2.636ProAla: 2.636 ± 1.354
1.757ProCys: 1.757 ± 1.142
3.515ProAsp: 3.515 ± 2.086
2.636ProGlu: 2.636 ± 1.209
0.879ProPhe: 0.879 ± 0.616
0.879ProGly: 0.879 ± 0.616
2.636ProHis: 2.636 ± 1.287
0.879ProIle: 0.879 ± 0.616
2.636ProLys: 2.636 ± 1.104
6.151ProLeu: 6.151 ± 2.533
1.757ProMet: 1.757 ± 1.435
1.757ProAsn: 1.757 ± 1.232
2.636ProPro: 2.636 ± 1.287
3.515ProGln: 3.515 ± 1.788
7.03ProArg: 7.03 ± 2.426
6.151ProSer: 6.151 ± 2.446
4.394ProThr: 4.394 ± 1.662
4.394ProVal: 4.394 ± 0.876
1.757ProTrp: 1.757 ± 0.698
1.757ProTyr: 1.757 ± 1.143
0.0ProXaa: 0.0 ± 0.0
Gln
3.515GlnAla: 3.515 ± 1.22
0.879GlnCys: 0.879 ± 0.616
3.515GlnAsp: 3.515 ± 1.809
3.515GlnGlu: 3.515 ± 1.034
0.0GlnPhe: 0.0 ± 0.0
4.394GlnGly: 4.394 ± 2.567
0.0GlnHis: 0.0 ± 0.0
1.757GlnIle: 1.757 ± 1.081
0.879GlnLys: 0.879 ± 0.616
3.515GlnLeu: 3.515 ± 2.093
0.879GlnMet: 0.879 ± 0.939
1.757GlnAsn: 1.757 ± 1.902
0.879GlnPro: 0.879 ± 0.838
0.879GlnGln: 0.879 ± 0.838
1.757GlnArg: 1.757 ± 1.142
4.394GlnSer: 4.394 ± 0.992
0.0GlnThr: 0.0 ± 0.0
2.636GlnVal: 2.636 ± 1.468
0.0GlnTrp: 0.0 ± 0.0
2.636GlnTyr: 2.636 ± 1.104
0.0GlnXaa: 0.0 ± 0.0
Arg
2.636ArgAla: 2.636 ± 1.942
0.879ArgCys: 0.879 ± 0.951
1.757ArgAsp: 1.757 ± 1.435
3.515ArgGlu: 3.515 ± 1.034
5.272ArgPhe: 5.272 ± 1.676
3.515ArgGly: 3.515 ± 1.976
2.636ArgHis: 2.636 ± 1.14
7.03ArgIle: 7.03 ± 1.753
2.636ArgLys: 2.636 ± 1.335
5.272ArgLeu: 5.272 ± 1.917
2.636ArgMet: 2.636 ± 1.359
3.515ArgAsn: 3.515 ± 1.034
4.394ArgPro: 4.394 ± 1.742
1.757ArgGln: 1.757 ± 1.162
7.03ArgArg: 7.03 ± 2.278
3.515ArgSer: 3.515 ± 1.18
3.515ArgThr: 3.515 ± 1.136
7.03ArgVal: 7.03 ± 2.475
1.757ArgTrp: 1.757 ± 0.928
1.757ArgTyr: 1.757 ± 1.143
0.0ArgXaa: 0.0 ± 0.0
Ser
1.757SerAla: 1.757 ± 1.232
1.757SerCys: 1.757 ± 1.24
2.636SerAsp: 2.636 ± 1.361
0.879SerGlu: 0.879 ± 0.939
4.394SerPhe: 4.394 ± 1.062
7.909SerGly: 7.909 ± 1.822
2.636SerHis: 2.636 ± 1.361
7.03SerIle: 7.03 ± 1.686
2.636SerLys: 2.636 ± 0.898
7.03SerLeu: 7.03 ± 2.012
0.879SerMet: 0.879 ± 0.882
7.909SerAsn: 7.909 ± 1.735
6.151SerPro: 6.151 ± 3.539
1.757SerGln: 1.757 ± 0.89
5.272SerArg: 5.272 ± 2.423
7.909SerSer: 7.909 ± 3.603
3.515SerThr: 3.515 ± 2.284
5.272SerVal: 5.272 ± 1.826
0.0SerTrp: 0.0 ± 0.0
2.636SerTyr: 2.636 ± 1.481
0.0SerXaa: 0.0 ± 0.0
Thr
3.515ThrAla: 3.515 ± 1.752
0.879ThrCys: 0.879 ± 0.939
1.757ThrAsp: 1.757 ± 1.142
1.757ThrGlu: 1.757 ± 1.435
4.394ThrPhe: 4.394 ± 1.551
4.394ThrGly: 4.394 ± 1.134
5.272ThrHis: 5.272 ± 1.826
0.879ThrIle: 0.879 ± 0.616
1.757ThrLys: 1.757 ± 0.89
5.272ThrLeu: 5.272 ± 2.108
0.879ThrMet: 0.879 ± 0.616
4.394ThrAsn: 4.394 ± 0.964
4.394ThrPro: 4.394 ± 3.072
1.757ThrGln: 1.757 ± 1.24
3.515ThrArg: 3.515 ± 1.784
11.424ThrSer: 11.424 ± 4.466
4.394ThrThr: 4.394 ± 2.599
1.757ThrVal: 1.757 ± 1.142
0.0ThrTrp: 0.0 ± 0.0
2.636ThrTyr: 2.636 ± 1.481
0.0ThrXaa: 0.0 ± 0.0
Val
3.515ValAla: 3.515 ± 1.748
0.879ValCys: 0.879 ± 0.951
0.879ValAsp: 0.879 ± 0.616
3.515ValGlu: 3.515 ± 1.321
2.636ValPhe: 2.636 ± 0.934
2.636ValGly: 2.636 ± 1.361
1.757ValHis: 1.757 ± 1.675
5.272ValIle: 5.272 ± 1.642
5.272ValLys: 5.272 ± 2.095
3.515ValLeu: 3.515 ± 1.917
1.757ValMet: 1.757 ± 1.435
4.394ValAsn: 4.394 ± 1.389
5.272ValPro: 5.272 ± 1.73
4.394ValGln: 4.394 ± 0.992
1.757ValArg: 1.757 ± 1.143
5.272ValSer: 5.272 ± 1.796
3.515ValThr: 3.515 ± 1.525
1.757ValVal: 1.757 ± 0.698
1.757ValTrp: 1.757 ± 1.142
5.272ValTyr: 5.272 ± 1.78
0.0ValXaa: 0.0 ± 0.0
Trp
1.757TrpAla: 1.757 ± 1.232
0.0TrpCys: 0.0 ± 0.0
0.879TrpAsp: 0.879 ± 0.838
0.879TrpGlu: 0.879 ± 0.952
0.879TrpPhe: 0.879 ± 0.951
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.757TrpLys: 1.757 ± 0.698
0.879TrpLeu: 0.879 ± 0.718
0.879TrpMet: 0.879 ± 0.718
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.879TrpGln: 0.879 ± 0.616
1.757TrpArg: 1.757 ± 0.988
0.879TrpSer: 0.879 ± 0.616
2.636TrpThr: 2.636 ± 1.07
2.636TrpVal: 2.636 ± 0.925
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.757TyrAla: 1.757 ± 1.435
0.879TyrCys: 0.879 ± 0.616
0.879TyrAsp: 0.879 ± 0.718
0.879TyrGlu: 0.879 ± 0.718
3.515TyrPhe: 3.515 ± 0.857
3.515TyrGly: 3.515 ± 1.371
2.636TyrHis: 2.636 ± 1.481
3.515TyrIle: 3.515 ± 1.08
1.757TyrLys: 1.757 ± 1.232
3.515TyrLeu: 3.515 ± 2.163
3.515TyrMet: 3.515 ± 1.234
1.757TyrAsn: 1.757 ± 0.698
0.879TyrPro: 0.879 ± 0.616
1.757TyrGln: 1.757 ± 0.698
1.757TyrArg: 1.757 ± 1.435
1.757TyrSer: 1.757 ± 0.698
2.636TyrThr: 2.636 ± 1.427
1.757TyrVal: 1.757 ± 1.162
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1139 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski