Amino acid dipepetide frequency for Jatropha leaf curl virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.47AlaAla: 7.47 ± 2.588
0.934AlaCys: 0.934 ± 0.813
0.0AlaAsp: 0.0 ± 0.0
2.801AlaGlu: 2.801 ± 1.476
0.934AlaPhe: 0.934 ± 0.689
0.0AlaGly: 0.0 ± 0.0
0.0AlaHis: 0.0 ± 0.0
1.867AlaIle: 1.867 ± 0.767
4.669AlaLys: 4.669 ± 1.323
6.536AlaLeu: 6.536 ± 1.982
0.0AlaMet: 0.0 ± 0.0
3.735AlaAsn: 3.735 ± 1.975
4.669AlaPro: 4.669 ± 1.509
1.867AlaGln: 1.867 ± 1.379
6.536AlaArg: 6.536 ± 2.952
6.536AlaSer: 6.536 ± 2.617
3.735AlaThr: 3.735 ± 1.77
3.735AlaVal: 3.735 ± 1.153
0.934AlaTrp: 0.934 ± 0.813
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.934CysAla: 0.934 ± 1.116
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.934CysGlu: 0.934 ± 0.813
0.934CysPhe: 0.934 ± 1.021
0.934CysGly: 0.934 ± 0.689
0.0CysHis: 0.0 ± 0.0
0.934CysIle: 0.934 ± 0.813
0.934CysLys: 0.934 ± 0.813
0.0CysLeu: 0.0 ± 0.0
0.934CysMet: 0.934 ± 1.044
0.934CysAsn: 0.934 ± 0.689
2.801CysPro: 2.801 ± 1.492
0.934CysGln: 0.934 ± 1.044
1.867CysArg: 1.867 ± 2.088
2.801CysSer: 2.801 ± 2.119
0.934CysThr: 0.934 ± 0.813
0.934CysVal: 0.934 ± 0.813
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.867AspAla: 1.867 ± 1.112
0.0AspCys: 0.0 ± 0.0
0.934AspAsp: 0.934 ± 0.689
1.867AspGlu: 1.867 ± 0.767
0.934AspPhe: 0.934 ± 0.689
3.735AspGly: 3.735 ± 2.018
0.934AspHis: 0.934 ± 1.021
2.801AspIle: 2.801 ± 1.759
1.867AspLys: 1.867 ± 1.112
6.536AspLeu: 6.536 ± 1.693
1.867AspMet: 1.867 ± 1.626
0.934AspAsn: 0.934 ± 0.813
1.867AspPro: 1.867 ± 1.088
0.934AspGln: 0.934 ± 1.021
6.536AspArg: 6.536 ± 1.739
7.47AspSer: 7.47 ± 1.77
1.867AspThr: 1.867 ± 1.455
7.47AspVal: 7.47 ± 1.541
2.801AspTrp: 2.801 ± 1.476
2.801AspTyr: 2.801 ± 1.492
0.0AspXaa: 0.0 ± 0.0
Glu
2.801GluAla: 2.801 ± 1.211
0.0GluCys: 0.0 ± 0.0
3.735GluAsp: 3.735 ± 1.619
2.801GluGlu: 2.801 ± 1.345
1.867GluPhe: 1.867 ± 1.088
2.801GluGly: 2.801 ± 0.912
0.0GluHis: 0.0 ± 0.0
3.735GluIle: 3.735 ± 1.528
3.735GluLys: 3.735 ± 1.152
3.735GluLeu: 3.735 ± 1.457
0.934GluMet: 0.934 ± 0.998
3.735GluAsn: 3.735 ± 2.186
1.867GluPro: 1.867 ± 1.189
2.801GluGln: 2.801 ± 1.473
0.934GluArg: 0.934 ± 0.689
1.867GluSer: 1.867 ± 2.088
2.801GluThr: 2.801 ± 2.017
1.867GluVal: 1.867 ± 1.395
0.0GluTrp: 0.0 ± 0.0
1.867GluTyr: 1.867 ± 1.088
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.934PheCys: 0.934 ± 0.813
3.735PheAsp: 3.735 ± 1.816
0.934PheGlu: 0.934 ± 0.813
0.934PhePhe: 0.934 ± 0.689
1.867PheGly: 1.867 ± 1.626
0.934PheHis: 0.934 ± 0.689
3.735PheIle: 3.735 ± 1.386
5.602PheLys: 5.602 ± 1.845
3.735PheLeu: 3.735 ± 1.936
0.934PheMet: 0.934 ± 0.689
4.669PheAsn: 4.669 ± 4.022
0.934PhePro: 0.934 ± 1.044
2.801PheGln: 2.801 ± 1.476
4.669PheArg: 4.669 ± 1.676
1.867PheSer: 1.867 ± 1.112
2.801PheThr: 2.801 ± 2.119
0.0PheVal: 0.0 ± 0.0
0.934PheTrp: 0.934 ± 0.813
1.867PheTyr: 1.867 ± 1.626
0.0PheXaa: 0.0 ± 0.0
Gly
1.867GlyAla: 1.867 ± 1.379
0.934GlyCys: 0.934 ± 0.813
3.735GlyAsp: 3.735 ± 2.224
0.934GlyGlu: 0.934 ± 1.021
3.735GlyPhe: 3.735 ± 1.013
6.536GlyGly: 6.536 ± 1.681
1.867GlyHis: 1.867 ± 1.112
2.801GlyIle: 2.801 ± 0.912
5.602GlyLys: 5.602 ± 1.628
0.0GlyLeu: 0.0 ± 0.0
1.867GlyMet: 1.867 ± 0.995
0.934GlyAsn: 0.934 ± 0.813
4.669GlyPro: 4.669 ± 1.953
1.867GlyGln: 1.867 ± 0.767
1.867GlyArg: 1.867 ± 1.088
5.602GlySer: 5.602 ± 2.231
0.934GlyThr: 0.934 ± 1.116
4.669GlyVal: 4.669 ± 2.542
0.0GlyTrp: 0.0 ± 0.0
0.934GlyTyr: 0.934 ± 1.044
0.0GlyXaa: 0.0 ± 0.0
His
2.801HisAla: 2.801 ± 1.422
2.801HisCys: 2.801 ± 2.274
0.934HisAsp: 0.934 ± 0.813
1.867HisGlu: 1.867 ± 1.112
1.867HisPhe: 1.867 ± 1.379
2.801HisGly: 2.801 ± 2.274
2.801HisHis: 2.801 ± 2.397
0.934HisIle: 0.934 ± 0.689
1.867HisLys: 1.867 ± 1.189
1.867HisLeu: 1.867 ± 0.968
0.934HisMet: 0.934 ± 1.021
2.801HisAsn: 2.801 ± 1.335
1.867HisPro: 1.867 ± 1.379
1.867HisGln: 1.867 ± 1.247
1.867HisArg: 1.867 ± 1.247
1.867HisSer: 1.867 ± 1.379
4.669HisThr: 4.669 ± 2.485
0.934HisVal: 0.934 ± 1.021
0.0HisTrp: 0.0 ± 0.0
0.934HisTyr: 0.934 ± 0.689
0.0HisXaa: 0.0 ± 0.0
Ile
1.867IleAla: 1.867 ± 1.245
0.0IleCys: 0.0 ± 0.0
2.801IleAsp: 2.801 ± 2.068
0.934IleGlu: 0.934 ± 0.689
5.602IlePhe: 5.602 ± 2.244
0.934IleGly: 0.934 ± 0.689
3.735IleHis: 3.735 ± 1.936
1.867IleIle: 1.867 ± 1.189
3.735IleLys: 3.735 ± 2.474
0.934IleLeu: 0.934 ± 0.998
0.934IleMet: 0.934 ± 1.021
0.934IleAsn: 0.934 ± 1.021
1.867IlePro: 1.867 ± 1.088
6.536IleGln: 6.536 ± 2.099
4.669IleArg: 4.669 ± 1.116
11.204IleSer: 11.204 ± 4.821
2.801IleThr: 2.801 ± 1.762
2.801IleVal: 2.801 ± 1.492
2.801IleTrp: 2.801 ± 0.872
1.867IleTyr: 1.867 ± 1.626
0.0IleXaa: 0.0 ± 0.0
Lys
3.735LysAla: 3.735 ± 1.541
1.867LysCys: 1.867 ± 0.968
0.934LysAsp: 0.934 ± 0.689
5.602LysGlu: 5.602 ± 2.105
0.934LysPhe: 0.934 ± 0.813
1.867LysGly: 1.867 ± 1.379
1.867LysHis: 1.867 ± 1.379
1.867LysIle: 1.867 ± 1.626
0.934LysLys: 0.934 ± 0.813
1.867LysLeu: 1.867 ± 0.767
0.934LysMet: 0.934 ± 0.689
5.602LysAsn: 5.602 ± 1.628
2.801LysPro: 2.801 ± 0.912
0.0LysGln: 0.0 ± 0.0
4.669LysArg: 4.669 ± 2.976
4.669LysSer: 4.669 ± 1.655
3.735LysThr: 3.735 ± 1.153
1.867LysVal: 1.867 ± 1.189
0.934LysTrp: 0.934 ± 0.813
2.801LysTyr: 2.801 ± 0.912
0.0LysXaa: 0.0 ± 0.0
Leu
2.801LeuAla: 2.801 ± 1.314
3.735LeuCys: 3.735 ± 1.816
6.536LeuAsp: 6.536 ± 2.547
4.669LeuGlu: 4.669 ± 2.093
1.867LeuPhe: 1.867 ± 0.767
3.735LeuGly: 3.735 ± 1.38
3.735LeuHis: 3.735 ± 1.153
4.669LeuIle: 4.669 ± 1.334
3.735LeuLys: 3.735 ± 1.816
5.602LeuLeu: 5.602 ± 1.352
2.801LeuMet: 2.801 ± 2.137
1.867LeuAsn: 1.867 ± 0.767
2.801LeuPro: 2.801 ± 2.374
5.602LeuGln: 5.602 ± 3.514
9.337LeuArg: 9.337 ± 3.763
7.47LeuSer: 7.47 ± 2.929
2.801LeuThr: 2.801 ± 1.314
1.867LeuVal: 1.867 ± 1.189
0.0LeuTrp: 0.0 ± 0.0
3.735LeuTyr: 3.735 ± 2.206
0.0LeuXaa: 0.0 ± 0.0
Met
1.867MetAla: 1.867 ± 1.626
0.0MetCys: 0.0 ± 0.0
2.801MetAsp: 2.801 ± 1.762
0.934MetGlu: 0.934 ± 0.998
2.801MetPhe: 2.801 ± 2.266
1.867MetGly: 1.867 ± 0.767
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
2.801MetLeu: 2.801 ± 2.173
0.934MetMet: 0.934 ± 0.998
0.0MetAsn: 0.0 ± 0.0
2.801MetPro: 2.801 ± 1.624
1.867MetGln: 1.867 ± 1.362
0.0MetArg: 0.0 ± 0.0
1.867MetSer: 1.867 ± 1.466
0.0MetThr: 0.0 ± 0.0
0.934MetVal: 0.934 ± 0.813
1.867MetTrp: 1.867 ± 1.088
0.934MetTyr: 0.934 ± 0.813
0.0MetXaa: 0.0 ± 0.0
Asn
3.735AsnAla: 3.735 ± 1.816
0.934AsnCys: 0.934 ± 1.116
2.801AsnAsp: 2.801 ± 1.211
2.801AsnGlu: 2.801 ± 1.537
0.934AsnPhe: 0.934 ± 0.813
1.867AsnGly: 1.867 ± 1.189
4.669AsnHis: 4.669 ± 2.485
4.669AsnIle: 4.669 ± 2.465
0.0AsnLys: 0.0 ± 0.0
4.669AsnLeu: 4.669 ± 2.792
2.801AsnMet: 2.801 ± 1.713
2.801AsnAsn: 2.801 ± 1.537
2.801AsnPro: 2.801 ± 0.872
0.0AsnGln: 0.0 ± 0.0
1.867AsnArg: 1.867 ± 1.245
2.801AsnSer: 2.801 ± 1.161
2.801AsnThr: 2.801 ± 1.314
2.801AsnVal: 2.801 ± 1.335
0.934AsnTrp: 0.934 ± 0.689
1.867AsnTyr: 1.867 ± 0.767
0.0AsnXaa: 0.0 ± 0.0
Pro
2.801ProAla: 2.801 ± 1.211
0.934ProCys: 0.934 ± 0.813
3.735ProAsp: 3.735 ± 1.847
4.669ProGlu: 4.669 ± 3.041
3.735ProPhe: 3.735 ± 1.029
2.801ProGly: 2.801 ± 2.119
3.735ProHis: 3.735 ± 2.758
2.801ProIle: 2.801 ± 1.273
2.801ProLys: 2.801 ± 1.314
8.403ProLeu: 8.403 ± 1.697
0.934ProMet: 0.934 ± 1.144
3.735ProAsn: 3.735 ± 1.396
4.669ProPro: 4.669 ± 2.248
0.0ProGln: 0.0 ± 0.0
5.602ProArg: 5.602 ± 1.415
3.735ProSer: 3.735 ± 1.342
0.934ProThr: 0.934 ± 0.689
1.867ProVal: 1.867 ± 0.767
0.934ProTrp: 0.934 ± 0.813
0.934ProTyr: 0.934 ± 0.689
0.0ProXaa: 0.0 ± 0.0
Gln
3.735GlnAla: 3.735 ± 1.704
0.934GlnCys: 0.934 ± 1.044
1.867GlnAsp: 1.867 ± 1.362
1.867GlnGlu: 1.867 ± 0.767
2.801GlnPhe: 2.801 ± 1.335
0.934GlnGly: 0.934 ± 0.689
0.0GlnHis: 0.0 ± 0.0
3.735GlnIle: 3.735 ± 1.528
0.934GlnLys: 0.934 ± 1.044
1.867GlnLeu: 1.867 ± 1.395
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
0.934GlnPro: 0.934 ± 1.116
1.867GlnGln: 1.867 ± 1.112
2.801GlnArg: 2.801 ± 1.624
6.536GlnSer: 6.536 ± 4.098
2.801GlnThr: 2.801 ± 1.249
5.602GlnVal: 5.602 ± 1.912
0.934GlnTrp: 0.934 ± 1.116
0.934GlnTyr: 0.934 ± 0.689
0.0GlnXaa: 0.0 ± 0.0
Arg
1.867ArgAla: 1.867 ± 1.626
1.867ArgCys: 1.867 ± 2.088
6.536ArgAsp: 6.536 ± 1.572
2.801ArgGlu: 2.801 ± 1.368
2.801ArgPhe: 2.801 ± 0.872
5.602ArgGly: 5.602 ± 2.0
4.669ArgHis: 4.669 ± 1.798
5.602ArgIle: 5.602 ± 2.074
1.867ArgLys: 1.867 ± 1.626
4.669ArgLeu: 4.669 ± 2.531
0.934ArgMet: 0.934 ± 1.116
2.801ArgAsn: 2.801 ± 2.068
6.536ArgPro: 6.536 ± 2.081
3.735ArgGln: 3.735 ± 1.528
9.337ArgArg: 9.337 ± 4.615
4.669ArgSer: 4.669 ± 1.476
1.867ArgThr: 1.867 ± 1.362
9.337ArgVal: 9.337 ± 2.454
0.0ArgTrp: 0.0 ± 0.0
2.801ArgTyr: 2.801 ± 1.537
0.0ArgXaa: 0.0 ± 0.0
Ser
3.735SerAla: 3.735 ± 1.816
1.867SerCys: 1.867 ± 1.112
4.669SerAsp: 4.669 ± 1.655
1.867SerGlu: 1.867 ± 1.158
5.602SerPhe: 5.602 ± 1.945
1.867SerGly: 1.867 ± 1.112
2.801SerHis: 2.801 ± 1.829
5.602SerIle: 5.602 ± 3.077
2.801SerLys: 2.801 ± 0.912
5.602SerLeu: 5.602 ± 3.144
1.867SerMet: 1.867 ± 1.344
5.602SerAsn: 5.602 ± 1.535
8.403SerPro: 8.403 ± 3.047
2.801SerGln: 2.801 ± 2.119
8.403SerArg: 8.403 ± 3.617
13.072SerSer: 13.072 ± 4.138
3.735SerThr: 3.735 ± 2.289
7.47SerVal: 7.47 ± 2.285
0.0SerTrp: 0.0 ± 0.0
2.801SerTyr: 2.801 ± 1.492
0.0SerXaa: 0.0 ± 0.0
Thr
4.669ThrAla: 4.669 ± 1.352
0.0ThrCys: 0.0 ± 0.0
1.867ThrAsp: 1.867 ± 2.232
2.801ThrGlu: 2.801 ± 1.414
0.0ThrPhe: 0.0 ± 0.0
4.669ThrGly: 4.669 ± 2.464
4.669ThrHis: 4.669 ± 2.971
2.801ThrIle: 2.801 ± 1.0
1.867ThrLys: 1.867 ± 0.767
2.801ThrLeu: 2.801 ± 1.402
1.867ThrMet: 1.867 ± 0.968
1.867ThrAsn: 1.867 ± 1.134
2.801ThrPro: 2.801 ± 1.314
1.867ThrGln: 1.867 ± 1.466
2.801ThrArg: 2.801 ± 1.762
1.867ThrSer: 1.867 ± 1.455
0.934ThrThr: 0.934 ± 1.116
4.669ThrVal: 4.669 ± 2.317
0.934ThrTrp: 0.934 ± 0.813
0.934ThrTyr: 0.934 ± 0.689
0.0ThrXaa: 0.0 ± 0.0
Val
2.801ValAla: 2.801 ± 2.017
0.0ValCys: 0.0 ± 0.0
4.669ValAsp: 4.669 ± 2.294
0.934ValGlu: 0.934 ± 1.021
3.735ValPhe: 3.735 ± 1.737
2.801ValGly: 2.801 ± 1.674
0.0ValHis: 0.0 ± 0.0
5.602ValIle: 5.602 ± 3.712
5.602ValLys: 5.602 ± 2.274
10.271ValLeu: 10.271 ± 2.815
0.0ValMet: 0.0 ± 0.0
2.801ValAsn: 2.801 ± 1.161
3.735ValPro: 3.735 ± 1.016
1.867ValGln: 1.867 ± 0.767
2.801ValArg: 2.801 ± 1.467
3.735ValSer: 3.735 ± 1.816
3.735ValThr: 3.735 ± 3.251
1.867ValVal: 1.867 ± 0.767
1.867ValTrp: 1.867 ± 0.968
4.669ValTyr: 4.669 ± 2.976
0.0ValXaa: 0.0 ± 0.0
Trp
2.801TrpAla: 2.801 ± 2.068
0.0TrpCys: 0.0 ± 0.0
1.867TrpAsp: 1.867 ± 1.455
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.934TrpGly: 0.934 ± 0.689
0.934TrpHis: 0.934 ± 0.813
0.934TrpIle: 0.934 ± 0.813
0.0TrpLys: 0.0 ± 0.0
0.934TrpLeu: 0.934 ± 0.813
0.934TrpMet: 0.934 ± 0.813
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.934TrpGln: 0.934 ± 0.689
1.867TrpArg: 1.867 ± 1.247
0.0TrpSer: 0.0 ± 0.0
2.801TrpThr: 2.801 ± 2.062
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.934TrpTyr: 0.934 ± 0.689
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.801TyrAla: 2.801 ± 1.674
0.0TyrCys: 0.0 ± 0.0
1.867TyrAsp: 1.867 ± 1.626
1.867TyrGlu: 1.867 ± 1.134
1.867TyrPhe: 1.867 ± 1.189
2.801TyrGly: 2.801 ± 2.017
0.934TyrHis: 0.934 ± 0.689
1.867TyrIle: 1.867 ± 1.379
0.934TyrLys: 0.934 ± 0.689
6.536TyrLeu: 6.536 ± 2.348
0.934TyrMet: 0.934 ± 0.813
2.801TyrAsn: 2.801 ± 0.872
1.867TyrPro: 1.867 ± 1.379
0.934TyrGln: 0.934 ± 0.813
1.867TyrArg: 1.867 ± 1.626
0.934TyrSer: 0.934 ± 0.689
0.0TyrThr: 0.0 ± 0.0
2.801TyrVal: 2.801 ± 1.422
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1072 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski