Amino acid dipepetide frequency for Cotton leaf curl Burewala virus - [India:Vehari:2006]

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.66AlaAla: 6.66 ± 2.299
0.951AlaCys: 0.951 ± 0.875
0.951AlaAsp: 0.951 ± 0.875
0.951AlaGlu: 0.951 ± 1.103
0.951AlaPhe: 0.951 ± 0.936
3.806AlaGly: 3.806 ± 1.286
3.806AlaHis: 3.806 ± 2.24
1.903AlaIle: 1.903 ± 1.342
2.854AlaLys: 2.854 ± 1.195
7.612AlaLeu: 7.612 ± 1.632
0.0AlaMet: 0.0 ± 0.0
3.806AlaAsn: 3.806 ± 1.009
3.806AlaPro: 3.806 ± 1.763
4.757AlaGln: 4.757 ± 1.44
4.757AlaArg: 4.757 ± 2.443
3.806AlaSer: 3.806 ± 2.373
4.757AlaThr: 4.757 ± 2.195
2.854AlaVal: 2.854 ± 1.647
1.903AlaTrp: 1.903 ± 0.804
2.854AlaTyr: 2.854 ± 1.426
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.903CysCys: 1.903 ± 2.427
0.0CysAsp: 0.0 ± 0.0
1.903CysGlu: 1.903 ± 1.267
0.951CysPhe: 0.951 ± 0.936
0.951CysGly: 0.951 ± 0.671
0.0CysHis: 0.0 ± 0.0
0.951CysIle: 0.951 ± 1.103
0.951CysLys: 0.951 ± 0.875
0.0CysLeu: 0.0 ± 0.0
0.951CysMet: 0.951 ± 1.214
0.951CysAsn: 0.951 ± 0.671
1.903CysPro: 1.903 ± 2.427
0.0CysGln: 0.0 ± 0.0
0.951CysArg: 0.951 ± 0.671
2.854CysSer: 2.854 ± 1.435
1.903CysThr: 1.903 ± 0.804
1.903CysVal: 1.903 ± 1.75
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.854AspAla: 2.854 ± 2.013
0.0AspCys: 0.0 ± 0.0
0.951AspAsp: 0.951 ± 0.671
0.951AspGlu: 0.951 ± 0.875
1.903AspPhe: 1.903 ± 0.804
2.854AspGly: 2.854 ± 2.013
0.951AspHis: 0.951 ± 0.936
2.854AspIle: 2.854 ± 1.981
1.903AspLys: 1.903 ± 0.804
3.806AspLeu: 3.806 ± 2.477
0.0AspMet: 0.0 ± 0.0
2.854AspAsn: 2.854 ± 0.864
1.903AspPro: 1.903 ± 1.238
3.806AspGln: 3.806 ± 1.918
2.854AspArg: 2.854 ± 1.541
5.709AspSer: 5.709 ± 1.778
2.854AspThr: 2.854 ± 2.358
5.709AspVal: 5.709 ± 2.095
0.951AspTrp: 0.951 ± 0.671
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.806GluAla: 3.806 ± 1.22
0.0GluCys: 0.0 ± 0.0
2.854GluAsp: 2.854 ± 2.072
5.709GluGlu: 5.709 ± 3.15
3.806GluPhe: 3.806 ± 2.087
5.709GluGly: 5.709 ± 2.23
0.0GluHis: 0.0 ± 0.0
0.0GluIle: 0.0 ± 0.0
0.951GluLys: 0.951 ± 1.214
3.806GluLeu: 3.806 ± 1.934
0.0GluMet: 0.0 ± 0.0
2.854GluAsn: 2.854 ± 2.625
3.806GluPro: 3.806 ± 1.249
2.854GluGln: 2.854 ± 2.322
0.0GluArg: 0.0 ± 0.0
4.757GluSer: 4.757 ± 2.871
0.951GluThr: 0.951 ± 0.671
2.854GluVal: 2.854 ± 1.093
0.951GluTrp: 0.951 ± 0.671
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.951PheCys: 0.951 ± 0.875
3.806PheAsp: 3.806 ± 1.608
2.854PheGlu: 2.854 ± 1.048
1.903PhePhe: 1.903 ± 0.804
1.903PheGly: 1.903 ± 1.172
2.854PheHis: 2.854 ± 1.579
3.806PheIle: 3.806 ± 1.286
3.806PheLys: 3.806 ± 1.282
4.757PheLeu: 4.757 ± 1.736
0.951PheMet: 0.951 ± 0.671
2.854PheAsn: 2.854 ± 2.063
1.903PhePro: 1.903 ± 1.584
1.903PheGln: 1.903 ± 1.342
4.757PheArg: 4.757 ± 2.639
0.951PheSer: 0.951 ± 0.671
0.951PheThr: 0.951 ± 0.936
2.854PheVal: 2.854 ± 1.426
0.0PheTrp: 0.0 ± 0.0
0.951PheTyr: 0.951 ± 0.875
0.0PheXaa: 0.0 ± 0.0
Gly
3.806GlyAla: 3.806 ± 1.282
0.951GlyCys: 0.951 ± 0.875
2.854GlyAsp: 2.854 ± 1.413
4.757GlyGlu: 4.757 ± 0.945
0.951GlyPhe: 0.951 ± 1.214
3.806GlyGly: 3.806 ± 1.763
0.951GlyHis: 0.951 ± 0.671
2.854GlyIle: 2.854 ± 1.819
7.612GlyLys: 7.612 ± 3.216
0.951GlyLeu: 0.951 ± 0.875
0.0GlyMet: 0.0 ± 0.0
0.951GlyAsn: 0.951 ± 0.936
3.806GlyPro: 3.806 ± 1.763
2.854GlyGln: 2.854 ± 0.864
1.903GlyArg: 1.903 ± 1.002
2.854GlySer: 2.854 ± 2.013
2.854GlyThr: 2.854 ± 1.426
0.0GlyVal: 0.0 ± 0.0
0.0GlyTrp: 0.0 ± 0.0
0.951GlyTyr: 0.951 ± 1.214
0.0GlyXaa: 0.0 ± 0.0
His
1.903HisAla: 1.903 ± 1.267
0.951HisCys: 0.951 ± 1.214
0.951HisAsp: 0.951 ± 0.875
1.903HisGlu: 1.903 ± 1.238
4.757HisPhe: 4.757 ± 1.966
0.951HisGly: 0.951 ± 1.214
0.0HisHis: 0.0 ± 0.0
1.903HisIle: 1.903 ± 1.172
0.951HisLys: 0.951 ± 0.936
2.854HisLeu: 2.854 ± 1.413
0.0HisMet: 0.0 ± 0.0
2.854HisAsn: 2.854 ± 1.426
1.903HisPro: 1.903 ± 1.076
1.903HisGln: 1.903 ± 1.584
1.903HisArg: 1.903 ± 1.75
2.854HisSer: 2.854 ± 2.101
0.951HisThr: 0.951 ± 0.875
2.854HisVal: 2.854 ± 1.819
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
0.951IleAla: 0.951 ± 0.936
0.951IleCys: 0.951 ± 1.214
4.757IleAsp: 4.757 ± 2.526
0.951IleGlu: 0.951 ± 0.671
1.903IlePhe: 1.903 ± 1.342
0.951IleGly: 0.951 ± 0.875
1.903IleHis: 1.903 ± 1.227
2.854IleIle: 2.854 ± 1.981
5.709IleLys: 5.709 ± 1.228
1.903IleLeu: 1.903 ± 1.505
0.0IleMet: 0.0 ± 0.0
2.854IleAsn: 2.854 ± 1.061
1.903IlePro: 1.903 ± 1.076
5.709IleGln: 5.709 ± 1.228
5.709IleArg: 5.709 ± 2.484
6.66IleSer: 6.66 ± 3.72
3.806IleThr: 3.806 ± 1.282
2.854IleVal: 2.854 ± 1.093
2.854IleTrp: 2.854 ± 2.063
1.903IleTyr: 1.903 ± 1.279
0.0IleXaa: 0.0 ± 0.0
Lys
5.709LysAla: 5.709 ± 1.173
0.951LysCys: 0.951 ± 0.936
1.903LysAsp: 1.903 ± 1.342
3.806LysGlu: 3.806 ± 1.763
3.806LysPhe: 3.806 ± 0.841
0.951LysGly: 0.951 ± 0.671
1.903LysHis: 1.903 ± 1.076
2.854LysIle: 2.854 ± 1.981
1.903LysLys: 1.903 ± 0.804
0.951LysLeu: 0.951 ± 0.671
0.951LysMet: 0.951 ± 1.103
6.66LysAsn: 6.66 ± 1.917
4.757LysPro: 4.757 ± 1.947
0.0LysGln: 0.0 ± 0.0
3.806LysArg: 3.806 ± 1.366
4.757LysSer: 4.757 ± 2.12
3.806LysThr: 3.806 ± 1.008
4.757LysVal: 4.757 ± 2.297
0.951LysTrp: 0.951 ± 0.875
5.709LysTyr: 5.709 ± 0.674
0.0LysXaa: 0.0 ± 0.0
Leu
1.903LeuAla: 1.903 ± 1.238
2.854LeuCys: 2.854 ± 2.013
1.903LeuAsp: 1.903 ± 1.342
3.806LeuGlu: 3.806 ± 1.934
0.0LeuPhe: 0.0 ± 0.0
4.757LeuGly: 4.757 ± 1.943
1.903LeuHis: 1.903 ± 1.342
5.709LeuIle: 5.709 ± 3.727
6.66LeuLys: 6.66 ± 1.56
1.903LeuLeu: 1.903 ± 1.584
1.903LeuMet: 1.903 ± 1.279
6.66LeuAsn: 6.66 ± 0.957
0.951LeuPro: 0.951 ± 1.103
2.854LeuGln: 2.854 ± 1.415
6.66LeuArg: 6.66 ± 1.327
3.806LeuSer: 3.806 ± 2.151
3.806LeuThr: 3.806 ± 1.81
4.757LeuVal: 4.757 ± 2.226
0.951LeuTrp: 0.951 ± 0.936
2.854LeuTyr: 2.854 ± 1.093
0.0LeuXaa: 0.0 ± 0.0
Met
1.903MetAla: 1.903 ± 0.804
1.903MetCys: 1.903 ± 1.172
2.854MetAsp: 2.854 ± 2.063
0.951MetGlu: 0.951 ± 1.103
1.903MetPhe: 1.903 ± 1.75
2.854MetGly: 2.854 ± 1.435
0.0MetHis: 0.0 ± 0.0
0.951MetIle: 0.951 ± 0.936
0.0MetLys: 0.0 ± 0.0
1.903MetLeu: 1.903 ± 1.267
0.0MetMet: 0.0 ± 0.0
0.951MetAsn: 0.951 ± 0.875
0.951MetPro: 0.951 ± 1.103
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
1.903MetSer: 1.903 ± 1.172
0.951MetThr: 0.951 ± 0.936
0.0MetVal: 0.0 ± 0.0
1.903MetTrp: 1.903 ± 1.238
1.903MetTyr: 1.903 ± 1.75
0.0MetXaa: 0.0 ± 0.0
Asn
3.806AsnAla: 3.806 ± 1.763
0.951AsnCys: 0.951 ± 1.103
1.903AsnAsp: 1.903 ± 1.342
3.806AsnGlu: 3.806 ± 1.22
0.951AsnPhe: 0.951 ± 0.875
0.951AsnGly: 0.951 ± 0.936
3.806AsnHis: 3.806 ± 2.012
4.757AsnIle: 4.757 ± 1.629
0.951AsnLys: 0.951 ± 0.671
3.806AsnLeu: 3.806 ± 2.004
2.854AsnMet: 2.854 ± 1.721
3.806AsnAsn: 3.806 ± 1.869
2.854AsnPro: 2.854 ± 1.093
2.854AsnGln: 2.854 ± 0.864
3.806AsnArg: 3.806 ± 1.707
3.806AsnSer: 3.806 ± 1.81
2.854AsnThr: 2.854 ± 2.013
5.709AsnVal: 5.709 ± 1.431
0.951AsnTrp: 0.951 ± 0.671
3.806AsnTyr: 3.806 ± 1.22
0.0AsnXaa: 0.0 ± 0.0
Pro
2.854ProAla: 2.854 ± 1.541
1.903ProCys: 1.903 ± 1.267
2.854ProAsp: 2.854 ± 2.322
0.951ProGlu: 0.951 ± 0.671
2.854ProPhe: 2.854 ± 1.061
1.903ProGly: 1.903 ± 1.076
3.806ProHis: 3.806 ± 2.087
3.806ProIle: 3.806 ± 1.494
4.757ProLys: 4.757 ± 2.421
3.806ProLeu: 3.806 ± 1.625
0.951ProMet: 0.951 ± 0.875
3.806ProAsn: 3.806 ± 1.282
4.757ProPro: 4.757 ± 1.44
3.806ProGln: 3.806 ± 1.81
5.709ProArg: 5.709 ± 1.802
4.757ProSer: 4.757 ± 0.948
4.757ProThr: 4.757 ± 1.92
2.854ProVal: 2.854 ± 1.541
0.0ProTrp: 0.0 ± 0.0
1.903ProTyr: 1.903 ± 0.804
0.0ProXaa: 0.0 ± 0.0
Gln
6.66GlnAla: 6.66 ± 1.308
0.951GlnCys: 0.951 ± 0.671
2.854GlnAsp: 2.854 ± 2.322
3.806GlnGlu: 3.806 ± 1.009
2.854GlnPhe: 2.854 ± 1.426
1.903GlnGly: 1.903 ± 1.342
3.806GlnHis: 3.806 ± 2.028
2.854GlnIle: 2.854 ± 2.013
1.903GlnLys: 1.903 ± 2.427
2.854GlnLeu: 2.854 ± 2.597
0.951GlnMet: 0.951 ± 1.103
0.0GlnAsn: 0.0 ± 0.0
3.806GlnPro: 3.806 ± 2.365
3.806GlnGln: 3.806 ± 1.009
1.903GlnArg: 1.903 ± 1.076
6.66GlnSer: 6.66 ± 1.308
0.951GlnThr: 0.951 ± 0.671
2.854GlnVal: 2.854 ± 1.093
0.951GlnTrp: 0.951 ± 1.103
0.951GlnTyr: 0.951 ± 0.875
0.0GlnXaa: 0.0 ± 0.0
Arg
3.806ArgAla: 3.806 ± 1.494
1.903ArgCys: 1.903 ± 2.427
3.806ArgAsp: 3.806 ± 1.481
1.903ArgGlu: 1.903 ± 1.076
4.757ArgPhe: 4.757 ± 1.277
1.903ArgGly: 1.903 ± 1.75
1.903ArgHis: 1.903 ± 1.267
5.709ArgIle: 5.709 ± 1.944
3.806ArgLys: 3.806 ± 1.828
1.903ArgLeu: 1.903 ± 1.172
1.903ArgMet: 1.903 ± 1.75
1.903ArgAsn: 1.903 ± 1.584
6.66ArgPro: 6.66 ± 1.81
1.903ArgGln: 1.903 ± 1.584
2.854ArgArg: 2.854 ± 2.625
6.66ArgSer: 6.66 ± 2.69
4.757ArgThr: 4.757 ± 2.194
5.709ArgVal: 5.709 ± 2.113
0.0ArgTrp: 0.0 ± 0.0
1.903ArgTyr: 1.903 ± 1.267
0.0ArgXaa: 0.0 ± 0.0
Ser
5.709SerAla: 5.709 ± 3.227
0.0SerCys: 0.0 ± 0.0
1.903SerAsp: 1.903 ± 0.804
2.854SerGlu: 2.854 ± 1.195
2.854SerPhe: 2.854 ± 0.864
0.951SerGly: 0.951 ± 1.103
0.0SerHis: 0.0 ± 0.0
3.806SerIle: 3.806 ± 1.687
4.757SerLys: 4.757 ± 2.195
6.66SerLeu: 6.66 ± 4.171
4.757SerMet: 4.757 ± 2.858
5.709SerAsn: 5.709 ± 2.065
8.563SerPro: 8.563 ± 2.04
4.757SerGln: 4.757 ± 2.077
9.515SerArg: 9.515 ± 2.417
12.369SerSer: 12.369 ± 6.727
5.709SerThr: 5.709 ± 2.854
2.854SerVal: 2.854 ± 2.625
0.0SerTrp: 0.0 ± 0.0
2.854SerTyr: 2.854 ± 1.061
0.0SerXaa: 0.0 ± 0.0
Thr
3.806ThrAla: 3.806 ± 1.286
0.0ThrCys: 0.0 ± 0.0
0.951ThrAsp: 0.951 ± 1.103
0.0ThrGlu: 0.0 ± 0.0
1.903ThrPhe: 1.903 ± 1.076
5.709ThrGly: 5.709 ± 1.878
2.854ThrHis: 2.854 ± 1.75
1.903ThrIle: 1.903 ± 1.076
4.757ThrLys: 4.757 ± 1.253
5.709ThrLeu: 5.709 ± 1.655
0.951ThrMet: 0.951 ± 0.671
3.806ThrAsn: 3.806 ± 1.707
3.806ThrPro: 3.806 ± 1.008
2.854ThrGln: 2.854 ± 1.061
2.854ThrArg: 2.854 ± 0.864
3.806ThrSer: 3.806 ± 1.394
1.903ThrThr: 1.903 ± 1.227
2.854ThrVal: 2.854 ± 1.808
0.951ThrTrp: 0.951 ± 1.103
1.903ThrTyr: 1.903 ± 1.238
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
0.951ValCys: 0.951 ± 0.671
4.757ValAsp: 4.757 ± 1.905
1.903ValGlu: 1.903 ± 2.427
2.854ValPhe: 2.854 ± 2.063
0.951ValGly: 0.951 ± 0.875
1.903ValHis: 1.903 ± 1.267
3.806ValIle: 3.806 ± 2.087
5.709ValLys: 5.709 ± 1.877
7.612ValLeu: 7.612 ± 2.392
1.903ValMet: 1.903 ± 1.75
1.903ValAsn: 1.903 ± 1.279
2.854ValPro: 2.854 ± 1.093
4.757ValGln: 4.757 ± 2.12
3.806ValArg: 3.806 ± 2.783
4.757ValSer: 4.757 ± 1.349
3.806ValThr: 3.806 ± 3.5
3.806ValVal: 3.806 ± 1.869
0.951ValTrp: 0.951 ± 0.671
3.806ValTyr: 3.806 ± 2.373
0.0ValXaa: 0.0 ± 0.0
Trp
3.806TrpAla: 3.806 ± 1.763
0.0TrpCys: 0.0 ± 0.0
0.951TrpAsp: 0.951 ± 1.214
0.951TrpGlu: 0.951 ± 0.936
0.0TrpPhe: 0.0 ± 0.0
0.951TrpGly: 0.951 ± 0.671
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.951TrpLeu: 0.951 ± 1.103
1.903TrpMet: 1.903 ± 1.279
0.951TrpAsn: 0.951 ± 0.936
0.0TrpPro: 0.0 ± 0.0
0.951TrpGln: 0.951 ± 0.671
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
2.854TrpTyr: 2.854 ± 0.864
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.806TyrAla: 3.806 ± 2.373
0.0TyrCys: 0.0 ± 0.0
1.903TyrAsp: 1.903 ± 1.267
0.951TyrGlu: 0.951 ± 0.875
2.854TyrPhe: 2.854 ± 1.093
0.951TyrGly: 0.951 ± 0.671
0.0TyrHis: 0.0 ± 0.0
3.806TyrIle: 3.806 ± 1.99
0.951TyrLys: 0.951 ± 1.103
2.854TyrLeu: 2.854 ± 1.415
1.903TyrMet: 1.903 ± 1.214
3.806TyrAsn: 3.806 ± 1.22
1.903TyrPro: 1.903 ± 1.076
0.951TyrGln: 0.951 ± 0.875
1.903TyrArg: 1.903 ± 1.75
2.854TyrSer: 2.854 ± 1.579
0.951TyrThr: 0.951 ± 1.103
4.757TyrVal: 4.757 ± 2.69
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1052 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski