Amino acid dipepetide frequency for Cotton leaf curl Gezira virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.43AlaAla: 5.43 ± 2.68
0.905AlaCys: 0.905 ± 0.866
1.81AlaAsp: 1.81 ± 1.062
1.81AlaGlu: 1.81 ± 1.236
0.905AlaPhe: 0.905 ± 0.8
0.905AlaGly: 0.905 ± 0.618
2.715AlaHis: 2.715 ± 1.434
0.905AlaIle: 0.905 ± 0.8
4.525AlaLys: 4.525 ± 1.061
5.43AlaLeu: 5.43 ± 2.032
0.0AlaMet: 0.0 ± 0.0
1.81AlaAsn: 1.81 ± 0.956
4.525AlaPro: 4.525 ± 1.432
1.81AlaGln: 1.81 ± 1.06
4.525AlaArg: 4.525 ± 1.434
6.335AlaSer: 6.335 ± 0.729
4.525AlaThr: 4.525 ± 2.079
2.715AlaVal: 2.715 ± 2.222
1.81AlaTrp: 1.81 ± 0.713
0.905AlaTyr: 0.905 ± 0.618
0.0AlaXaa: 0.0 ± 0.0
Cys
0.905CysAla: 0.905 ± 0.8
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.905CysGlu: 0.905 ± 0.866
1.81CysPhe: 1.81 ± 1.149
1.81CysGly: 1.81 ± 0.899
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.905CysLys: 0.905 ± 0.866
0.0CysLeu: 0.0 ± 0.0
0.905CysMet: 0.905 ± 1.017
0.905CysAsn: 0.905 ± 0.618
2.715CysPro: 2.715 ± 3.051
0.905CysGln: 0.905 ± 0.618
1.81CysArg: 1.81 ± 0.899
0.905CysSer: 0.905 ± 0.618
3.62CysThr: 3.62 ± 1.753
0.905CysVal: 0.905 ± 0.866
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.715AspAla: 2.715 ± 1.319
0.0AspCys: 0.0 ± 0.0
2.715AspAsp: 2.715 ± 0.82
1.81AspGlu: 1.81 ± 0.713
2.715AspPhe: 2.715 ± 0.831
2.715AspGly: 2.715 ± 1.854
0.0AspHis: 0.0 ± 0.0
1.81AspIle: 1.81 ± 1.062
2.715AspLys: 2.715 ± 1.319
5.43AspLeu: 5.43 ± 2.183
0.905AspMet: 0.905 ± 0.632
3.62AspAsn: 3.62 ± 1.934
2.715AspPro: 2.715 ± 1.983
0.905AspGln: 0.905 ± 0.618
2.715AspArg: 2.715 ± 1.765
5.43AspSer: 5.43 ± 2.176
0.905AspThr: 0.905 ± 0.618
5.43AspVal: 5.43 ± 1.936
1.81AspTrp: 1.81 ± 0.899
0.905AspTyr: 0.905 ± 0.866
0.0AspXaa: 0.0 ± 0.0
Glu
6.335GluAla: 6.335 ± 1.769
0.0GluCys: 0.0 ± 0.0
3.62GluAsp: 3.62 ± 1.913
4.525GluGlu: 4.525 ± 2.331
2.715GluPhe: 2.715 ± 1.434
3.62GluGly: 3.62 ± 0.991
0.0GluHis: 0.0 ± 0.0
0.905GluIle: 0.905 ± 0.847
0.0GluLys: 0.0 ± 0.0
7.24GluLeu: 7.24 ± 2.184
0.905GluMet: 0.905 ± 0.847
5.43GluAsn: 5.43 ± 2.186
2.715GluPro: 2.715 ± 0.831
2.715GluGln: 2.715 ± 1.461
0.905GluArg: 0.905 ± 0.618
5.43GluSer: 5.43 ± 3.002
1.81GluThr: 1.81 ± 0.956
0.905GluVal: 0.905 ± 0.967
2.715GluTrp: 2.715 ± 1.319
0.905GluTyr: 0.905 ± 0.618
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.0PheCys: 0.0 ± 0.0
3.62PheAsp: 3.62 ± 1.426
0.905PheGlu: 0.905 ± 0.618
1.81PhePhe: 1.81 ± 1.731
0.905PheGly: 0.905 ± 0.866
3.62PheHis: 3.62 ± 1.448
0.905PheIle: 0.905 ± 0.618
3.62PheLys: 3.62 ± 1.059
5.43PheLeu: 5.43 ± 1.389
1.81PheMet: 1.81 ± 1.002
1.81PheAsn: 1.81 ± 1.013
1.81PhePro: 1.81 ± 1.06
4.525PheGln: 4.525 ± 1.434
3.62PheArg: 3.62 ± 2.076
2.715PheSer: 2.715 ± 1.023
3.62PheThr: 3.62 ± 1.68
0.905PheVal: 0.905 ± 0.618
0.0PheTrp: 0.0 ± 0.0
0.905PheTyr: 0.905 ± 0.866
0.0PheXaa: 0.0 ± 0.0
Gly
1.81GlyAla: 1.81 ± 1.236
1.81GlyCys: 1.81 ± 1.062
2.715GlyAsp: 2.715 ± 1.585
3.62GlyGlu: 3.62 ± 1.362
1.81GlyPhe: 1.81 ± 1.257
3.62GlyGly: 3.62 ± 1.523
0.905GlyHis: 0.905 ± 0.618
3.62GlyIle: 3.62 ± 0.858
4.525GlyLys: 4.525 ± 1.643
4.525GlyLeu: 4.525 ± 1.858
0.905GlyMet: 0.905 ± 0.866
1.81GlyAsn: 1.81 ± 1.013
3.62GlyPro: 3.62 ± 1.426
4.525GlyGln: 4.525 ± 1.432
1.81GlyArg: 1.81 ± 0.899
1.81GlySer: 1.81 ± 0.956
2.715GlyThr: 2.715 ± 0.82
4.525GlyVal: 4.525 ± 3.526
0.0GlyTrp: 0.0 ± 0.0
0.905GlyTyr: 0.905 ± 1.017
0.0GlyXaa: 0.0 ± 0.0
His
0.905HisAla: 0.905 ± 0.866
1.81HisCys: 1.81 ± 1.257
0.0HisAsp: 0.0 ± 0.0
1.81HisGlu: 1.81 ± 0.899
2.715HisPhe: 2.715 ± 1.406
1.81HisGly: 1.81 ± 1.339
1.81HisHis: 1.81 ± 1.257
0.905HisIle: 0.905 ± 0.967
2.715HisLys: 2.715 ± 1.49
2.715HisLeu: 2.715 ± 1.854
0.0HisMet: 0.0 ± 0.0
2.715HisAsn: 2.715 ± 1.434
0.905HisPro: 0.905 ± 0.618
1.81HisGln: 1.81 ± 1.013
3.62HisArg: 3.62 ± 1.455
0.905HisSer: 0.905 ± 1.017
2.715HisThr: 2.715 ± 2.597
1.81HisVal: 1.81 ± 1.002
0.905HisTrp: 0.905 ± 0.967
0.905HisTyr: 0.905 ± 0.618
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
0.905IleCys: 0.905 ± 0.618
3.62IleAsp: 3.62 ± 1.854
0.905IleGlu: 0.905 ± 0.618
1.81IlePhe: 1.81 ± 1.236
3.62IleGly: 3.62 ± 1.699
0.905IleHis: 0.905 ± 0.847
2.715IleIle: 2.715 ± 1.684
7.24IleLys: 7.24 ± 1.892
1.81IleLeu: 1.81 ± 1.106
1.81IleMet: 1.81 ± 1.09
2.715IleAsn: 2.715 ± 1.655
1.81IlePro: 1.81 ± 0.899
4.525IleGln: 4.525 ± 1.398
7.24IleArg: 7.24 ± 2.409
5.43IleSer: 5.43 ± 1.454
1.81IleThr: 1.81 ± 1.002
2.715IleVal: 2.715 ± 1.461
0.905IleTrp: 0.905 ± 0.847
3.62IleTyr: 3.62 ± 1.636
0.0IleXaa: 0.0 ± 0.0
Lys
0.905LysAla: 0.905 ± 0.618
2.715LysCys: 2.715 ± 1.434
2.715LysAsp: 2.715 ± 1.854
5.43LysGlu: 5.43 ± 1.934
1.81LysPhe: 1.81 ± 0.713
5.43LysGly: 5.43 ± 1.473
1.81LysHis: 1.81 ± 0.713
5.43LysIle: 5.43 ± 1.368
2.715LysLys: 2.715 ± 0.82
0.905LysLeu: 0.905 ± 0.847
0.0LysMet: 0.0 ± 0.0
3.62LysAsn: 3.62 ± 1.523
1.81LysPro: 1.81 ± 0.713
2.715LysGln: 2.715 ± 1.243
4.525LysArg: 4.525 ± 2.781
3.62LysSer: 3.62 ± 1.523
1.81LysThr: 1.81 ± 1.002
4.525LysVal: 4.525 ± 2.214
0.0LysTrp: 0.0 ± 0.0
4.525LysTyr: 4.525 ± 1.025
0.0LysXaa: 0.0 ± 0.0
Leu
1.81LeuAla: 1.81 ± 1.06
2.715LeuCys: 2.715 ± 1.406
3.62LeuAsp: 3.62 ± 1.797
4.525LeuGlu: 4.525 ± 2.444
1.81LeuPhe: 1.81 ± 1.002
4.525LeuGly: 4.525 ± 1.083
1.81LeuHis: 1.81 ± 1.236
3.62LeuIle: 3.62 ± 2.076
4.525LeuLys: 4.525 ± 1.434
5.43LeuLeu: 5.43 ± 2.023
0.905LeuMet: 0.905 ± 0.83
5.43LeuAsn: 5.43 ± 0.952
0.0LeuPro: 0.0 ± 0.0
7.24LeuGln: 7.24 ± 2.474
2.715LeuArg: 2.715 ± 1.337
4.525LeuSer: 4.525 ± 1.383
6.335LeuThr: 6.335 ± 1.563
3.62LeuVal: 3.62 ± 2.293
0.0LeuTrp: 0.0 ± 0.0
5.43LeuTyr: 5.43 ± 3.138
0.0LeuXaa: 0.0 ± 0.0
Met
2.715MetAla: 2.715 ± 1.023
0.905MetCys: 0.905 ± 0.967
2.715MetAsp: 2.715 ± 1.684
0.905MetGlu: 0.905 ± 0.967
2.715MetPhe: 2.715 ± 1.765
0.905MetGly: 0.905 ± 0.618
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.81MetLys: 1.81 ± 1.013
0.905MetLeu: 0.905 ± 1.017
0.0MetMet: 0.0 ± 0.0
0.905MetAsn: 0.905 ± 0.847
0.0MetPro: 0.0 ± 0.0
0.905MetGln: 0.905 ± 0.847
1.81MetArg: 1.81 ± 1.062
0.905MetSer: 0.905 ± 0.866
0.905MetThr: 0.905 ± 0.967
0.905MetVal: 0.905 ± 0.847
0.905MetTrp: 0.905 ± 1.017
1.81MetTyr: 1.81 ± 1.731
0.0MetXaa: 0.0 ± 0.0
Asn
4.525AsnAla: 4.525 ± 1.434
0.0AsnCys: 0.0 ± 0.0
3.62AsnAsp: 3.62 ± 0.959
0.905AsnGlu: 0.905 ± 0.866
1.81AsnPhe: 1.81 ± 1.013
3.62AsnGly: 3.62 ± 1.912
4.525AsnHis: 4.525 ± 2.424
3.62AsnIle: 3.62 ± 1.52
1.81AsnLys: 1.81 ± 0.713
3.62AsnLeu: 3.62 ± 1.362
0.905AsnMet: 0.905 ± 1.682
3.62AsnAsn: 3.62 ± 1.696
4.525AsnPro: 4.525 ± 0.961
4.525AsnGln: 4.525 ± 1.936
2.715AsnArg: 2.715 ± 1.023
4.525AsnSer: 4.525 ± 1.474
0.905AsnThr: 0.905 ± 0.618
5.43AsnVal: 5.43 ± 1.747
0.0AsnTrp: 0.0 ± 0.0
4.525AsnTyr: 4.525 ± 1.025
0.0AsnXaa: 0.0 ± 0.0
Pro
4.525ProAla: 4.525 ± 1.474
1.81ProCys: 1.81 ± 1.062
0.905ProAsp: 0.905 ± 0.866
1.81ProGlu: 1.81 ± 1.06
0.905ProPhe: 0.905 ± 0.618
2.715ProGly: 2.715 ± 0.831
4.525ProHis: 4.525 ± 2.444
1.81ProIle: 1.81 ± 0.899
4.525ProLys: 4.525 ± 1.383
2.715ProLeu: 2.715 ± 1.337
1.81ProMet: 1.81 ± 1.3
3.62ProAsn: 3.62 ± 1.523
3.62ProPro: 3.62 ± 1.362
2.715ProGln: 2.715 ± 2.097
2.715ProArg: 2.715 ± 1.941
7.24ProSer: 7.24 ± 2.537
4.525ProThr: 4.525 ± 1.724
2.715ProVal: 2.715 ± 0.82
0.0ProTrp: 0.0 ± 0.0
1.81ProTyr: 1.81 ± 0.713
0.0ProXaa: 0.0 ± 0.0
Gln
1.81GlnAla: 1.81 ± 1.106
0.0GlnCys: 0.0 ± 0.0
0.905GlnAsp: 0.905 ± 0.618
7.24GlnGlu: 7.24 ± 1.736
1.81GlnPhe: 1.81 ± 1.236
1.81GlnGly: 1.81 ± 0.899
1.81GlnHis: 1.81 ± 1.258
9.05GlnIle: 9.05 ± 2.424
1.81GlnLys: 1.81 ± 1.236
1.81GlnLeu: 1.81 ± 1.257
0.905GlnMet: 0.905 ± 0.8
4.525GlnAsn: 4.525 ± 1.709
1.81GlnPro: 1.81 ± 0.899
2.715GlnGln: 2.715 ± 1.207
3.62GlnArg: 3.62 ± 1.858
4.525GlnSer: 4.525 ± 0.882
6.335GlnThr: 6.335 ± 3.162
3.62GlnVal: 3.62 ± 1.427
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.62ArgAla: 3.62 ± 1.683
3.62ArgCys: 3.62 ± 1.99
5.43ArgAsp: 5.43 ± 2.949
6.335ArgGlu: 6.335 ± 2.526
4.525ArgPhe: 4.525 ± 2.136
1.81ArgGly: 1.81 ± 0.899
0.905ArgHis: 0.905 ± 1.017
4.525ArgIle: 4.525 ± 1.435
3.62ArgLys: 3.62 ± 2.027
5.43ArgLeu: 5.43 ± 1.817
2.715ArgMet: 2.715 ± 1.985
0.0ArgAsn: 0.0 ± 0.0
7.24ArgPro: 7.24 ± 1.695
1.81ArgGln: 1.81 ± 1.422
8.145ArgArg: 8.145 ± 4.304
4.525ArgSer: 4.525 ± 1.432
4.525ArgThr: 4.525 ± 0.931
2.715ArgVal: 2.715 ± 1.337
0.0ArgTrp: 0.0 ± 0.0
0.905ArgTyr: 0.905 ± 1.017
0.0ArgXaa: 0.0 ± 0.0
Ser
5.43SerAla: 5.43 ± 1.247
0.0SerCys: 0.0 ± 0.0
4.525SerAsp: 4.525 ± 1.453
2.715SerGlu: 2.715 ± 2.222
1.81SerPhe: 1.81 ± 0.899
2.715SerGly: 2.715 ± 1.016
0.0SerHis: 0.0 ± 0.0
3.62SerIle: 3.62 ± 0.947
3.62SerLys: 3.62 ± 1.636
1.81SerLeu: 1.81 ± 1.236
0.0SerMet: 0.0 ± 0.0
9.955SerAsn: 9.955 ± 2.777
10.86SerPro: 10.86 ± 1.829
2.715SerGln: 2.715 ± 1.668
7.24SerArg: 7.24 ± 3.383
12.67SerSer: 12.67 ± 5.367
6.335SerThr: 6.335 ± 3.025
1.81SerVal: 1.81 ± 1.106
0.905SerTrp: 0.905 ± 0.618
3.62SerTyr: 3.62 ± 0.959
0.0SerXaa: 0.0 ± 0.0
Thr
3.62ThrAla: 3.62 ± 1.059
1.81ThrCys: 1.81 ± 1.257
0.0ThrAsp: 0.0 ± 0.0
2.715ThrGlu: 2.715 ± 1.279
2.715ThrPhe: 2.715 ± 1.287
6.335ThrGly: 6.335 ± 3.008
6.335ThrHis: 6.335 ± 2.332
5.43ThrIle: 5.43 ± 2.381
2.715ThrLys: 2.715 ± 1.016
4.525ThrLeu: 4.525 ± 1.052
0.905ThrMet: 0.905 ± 0.847
3.62ThrAsn: 3.62 ± 1.325
3.62ThrPro: 3.62 ± 1.584
3.62ThrGln: 3.62 ± 1.432
0.905ThrArg: 0.905 ± 0.967
2.715ThrSer: 2.715 ± 2.209
0.905ThrThr: 0.905 ± 0.847
4.525ThrVal: 4.525 ± 1.289
0.905ThrTrp: 0.905 ± 0.967
0.905ThrTyr: 0.905 ± 0.8
0.0ThrXaa: 0.0 ± 0.0
Val
1.81ValAla: 1.81 ± 1.112
0.0ValCys: 0.0 ± 0.0
0.905ValAsp: 0.905 ± 0.618
2.715ValGlu: 2.715 ± 1.337
3.62ValPhe: 3.62 ± 0.858
0.0ValGly: 0.0 ± 0.0
0.905ValHis: 0.905 ± 1.017
4.525ValIle: 4.525 ± 2.017
2.715ValLys: 2.715 ± 0.831
5.43ValLeu: 5.43 ± 2.367
2.715ValMet: 2.715 ± 1.718
0.905ValAsn: 0.905 ± 0.847
2.715ValPro: 2.715 ± 0.82
4.525ValGln: 4.525 ± 1.709
6.335ValArg: 6.335 ± 3.565
4.525ValSer: 4.525 ± 1.168
2.715ValThr: 2.715 ± 2.597
2.715ValVal: 2.715 ± 1.983
2.715ValTrp: 2.715 ± 0.832
2.715ValTyr: 2.715 ± 1.765
0.0ValXaa: 0.0 ± 0.0
Trp
2.715TrpAla: 2.715 ± 1.854
0.0TrpCys: 0.0 ± 0.0
0.905TrpAsp: 0.905 ± 1.017
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.905TrpGly: 0.905 ± 0.618
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.905TrpMet: 0.905 ± 0.866
0.905TrpAsn: 0.905 ± 0.967
0.0TrpPro: 0.0 ± 0.0
0.905TrpGln: 0.905 ± 0.618
0.905TrpArg: 0.905 ± 0.8
2.715TrpSer: 2.715 ± 1.441
1.81TrpThr: 1.81 ± 1.694
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.905TrpTyr: 0.905 ± 0.618
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.715TyrAla: 2.715 ± 1.016
0.0TyrCys: 0.0 ± 0.0
3.62TyrAsp: 3.62 ± 2.032
1.81TyrGlu: 1.81 ± 1.731
2.715TyrPhe: 2.715 ± 0.832
1.81TyrGly: 1.81 ± 0.713
0.905TyrHis: 0.905 ± 1.017
2.715TyrIle: 2.715 ± 1.016
0.905TyrLys: 0.905 ± 0.618
5.43TyrLeu: 5.43 ± 1.963
2.715TyrMet: 2.715 ± 1.27
2.715TyrAsn: 2.715 ± 0.832
0.0TyrPro: 0.0 ± 0.0
0.0TyrGln: 0.0 ± 0.0
4.525TyrArg: 4.525 ± 3.309
0.905TyrSer: 0.905 ± 0.618
0.0TyrThr: 0.0 ± 0.0
2.715TyrVal: 2.715 ± 1.207
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1106 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski