Amino acid dipepetide frequency for Tomato leaf curl Mali virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.604AlaAla: 4.604 ± 2.45
0.921AlaCys: 0.921 ± 0.724
2.762AlaAsp: 2.762 ± 0.992
0.921AlaGlu: 0.921 ± 0.61
0.0AlaPhe: 0.0 ± 0.0
1.842AlaGly: 1.842 ± 1.22
1.842AlaHis: 1.842 ± 1.02
2.762AlaIle: 2.762 ± 1.927
4.604AlaLys: 4.604 ± 1.04
6.446AlaLeu: 6.446 ± 1.833
0.921AlaMet: 0.921 ± 0.613
1.842AlaAsn: 1.842 ± 1.22
0.921AlaPro: 0.921 ± 0.877
4.604AlaGln: 4.604 ± 1.384
2.762AlaArg: 2.762 ± 1.83
4.604AlaSer: 4.604 ± 2.277
5.525AlaThr: 5.525 ± 1.7
3.683AlaVal: 3.683 ± 1.201
1.842AlaTrp: 1.842 ± 1.22
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.842CysCys: 1.842 ± 2.003
0.0CysAsp: 0.0 ± 0.0
0.921CysGlu: 0.921 ± 0.724
0.0CysPhe: 0.0 ± 0.0
1.842CysGly: 1.842 ± 0.96
0.0CysHis: 0.0 ± 0.0
0.921CysIle: 0.921 ± 0.724
0.921CysLys: 0.921 ± 0.724
0.921CysLeu: 0.921 ± 0.877
0.921CysMet: 0.921 ± 1.002
0.921CysAsn: 0.921 ± 0.61
1.842CysPro: 1.842 ± 2.003
0.921CysGln: 0.921 ± 0.61
0.921CysArg: 0.921 ± 0.61
3.683CysSer: 3.683 ± 2.778
2.762CysThr: 2.762 ± 0.89
0.921CysVal: 0.921 ± 0.724
0.0CysTrp: 0.0 ± 0.0
0.921CysTyr: 0.921 ± 0.9
0.0CysXaa: 0.0 ± 0.0
Asp
1.842AspAla: 1.842 ± 0.661
0.0AspCys: 0.0 ± 0.0
2.762AspAsp: 2.762 ± 0.89
2.762AspGlu: 2.762 ± 1.045
0.921AspPhe: 0.921 ± 0.724
2.762AspGly: 2.762 ± 1.83
0.0AspHis: 0.0 ± 0.0
2.762AspIle: 2.762 ± 1.631
0.921AspLys: 0.921 ± 0.61
9.208AspLeu: 9.208 ± 2.536
0.921AspMet: 0.921 ± 0.9
3.683AspAsn: 3.683 ± 0.93
1.842AspPro: 1.842 ± 1.02
1.842AspGln: 1.842 ± 0.938
2.762AspArg: 2.762 ± 1.245
4.604AspSer: 4.604 ± 1.04
0.921AspThr: 0.921 ± 0.61
5.525AspVal: 5.525 ± 0.997
1.842AspTrp: 1.842 ± 0.96
0.921AspTyr: 0.921 ± 0.61
0.0AspXaa: 0.0 ± 0.0
Glu
5.525GluAla: 5.525 ± 1.526
0.0GluCys: 0.0 ± 0.0
0.921GluAsp: 0.921 ± 0.61
5.525GluGlu: 5.525 ± 1.526
2.762GluPhe: 2.762 ± 1.349
3.683GluGly: 3.683 ± 1.062
1.842GluHis: 1.842 ± 0.938
0.921GluIle: 0.921 ± 0.9
1.842GluLys: 1.842 ± 1.22
7.366GluLeu: 7.366 ± 1.935
0.0GluMet: 0.0 ± 0.0
5.525GluAsn: 5.525 ± 2.586
4.604GluPro: 4.604 ± 0.966
1.842GluGln: 1.842 ± 1.449
0.921GluArg: 0.921 ± 0.975
0.921GluSer: 0.921 ± 1.002
2.762GluThr: 2.762 ± 1.256
0.921GluVal: 0.921 ± 0.61
2.762GluTrp: 2.762 ± 1.279
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.921PheCys: 0.921 ± 0.724
2.762PheAsp: 2.762 ± 1.245
0.921PheGlu: 0.921 ± 0.61
0.921PhePhe: 0.921 ± 0.61
0.921PheGly: 0.921 ± 0.724
2.762PheHis: 2.762 ± 0.898
2.762PheIle: 2.762 ± 0.798
2.762PheLys: 2.762 ± 1.734
4.604PheLeu: 4.604 ± 1.695
0.0PheMet: 0.0 ± 0.0
3.683PheAsn: 3.683 ± 2.624
0.921PhePro: 0.921 ± 1.002
4.604PheGln: 4.604 ± 1.695
0.921PheArg: 0.921 ± 1.002
2.762PheSer: 2.762 ± 1.83
1.842PheThr: 1.842 ± 0.96
0.921PheVal: 0.921 ± 0.61
0.0PheTrp: 0.0 ± 0.0
3.683PheTyr: 3.683 ± 2.897
0.0PheXaa: 0.0 ± 0.0
Gly
3.683GlyAla: 3.683 ± 1.759
2.762GlyCys: 2.762 ± 0.89
2.762GlyAsp: 2.762 ± 1.279
3.683GlyGlu: 3.683 ± 1.44
1.842GlyPhe: 1.842 ± 1.343
2.762GlyGly: 2.762 ± 1.045
1.842GlyHis: 1.842 ± 0.938
2.762GlyIle: 2.762 ± 0.798
5.525GlyLys: 5.525 ± 1.983
1.842GlyLeu: 1.842 ± 1.131
0.0GlyMet: 0.0 ± 0.0
0.921GlyAsn: 0.921 ± 0.877
3.683GlyPro: 3.683 ± 1.927
1.842GlyGln: 1.842 ± 1.118
1.842GlyArg: 1.842 ± 1.22
2.762GlySer: 2.762 ± 1.349
1.842GlyThr: 1.842 ± 1.131
1.842GlyVal: 1.842 ± 1.013
0.0GlyTrp: 0.0 ± 0.0
0.921GlyTyr: 0.921 ± 1.002
0.0GlyXaa: 0.0 ± 0.0
His
1.842HisAla: 1.842 ± 1.449
2.762HisCys: 2.762 ± 2.123
1.842HisAsp: 1.842 ± 1.278
1.842HisGlu: 1.842 ± 0.938
3.683HisPhe: 3.683 ± 1.395
1.842HisGly: 1.842 ± 1.343
1.842HisHis: 1.842 ± 0.96
3.683HisIle: 3.683 ± 2.076
2.762HisLys: 2.762 ± 1.552
1.842HisLeu: 1.842 ± 1.22
0.921HisMet: 0.921 ± 0.812
2.762HisAsn: 2.762 ± 1.301
0.921HisPro: 0.921 ± 0.61
1.842HisGln: 1.842 ± 1.131
1.842HisArg: 1.842 ± 1.131
0.921HisSer: 0.921 ± 1.002
2.762HisThr: 2.762 ± 2.173
3.683HisVal: 3.683 ± 0.962
0.0HisTrp: 0.0 ± 0.0
0.921HisTyr: 0.921 ± 0.61
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
0.921IleCys: 0.921 ± 0.61
3.683IleAsp: 3.683 ± 1.803
0.921IleGlu: 0.921 ± 1.002
4.604IlePhe: 4.604 ± 1.431
0.0IleGly: 0.0 ± 0.0
1.842IleHis: 1.842 ± 1.316
5.525IleIle: 5.525 ± 3.547
10.129IleLys: 10.129 ± 1.856
2.762IleLeu: 2.762 ± 1.285
0.0IleMet: 0.0 ± 0.0
2.762IleAsn: 2.762 ± 1.427
0.921IlePro: 0.921 ± 0.61
2.762IleGln: 2.762 ± 0.89
8.287IleArg: 8.287 ± 2.354
7.366IleSer: 7.366 ± 3.702
1.842IleThr: 1.842 ± 1.013
1.842IleVal: 1.842 ± 0.661
2.762IleTrp: 2.762 ± 1.774
1.842IleTyr: 1.842 ± 1.013
0.0IleXaa: 0.0 ± 0.0
Lys
2.762LysAla: 2.762 ± 1.349
0.921LysCys: 0.921 ± 0.9
1.842LysAsp: 1.842 ± 1.22
5.525LysGlu: 5.525 ± 1.983
3.683LysPhe: 3.683 ± 1.201
1.842LysGly: 1.842 ± 0.96
2.762LysHis: 2.762 ± 0.798
4.604LysIle: 4.604 ± 1.816
4.604LysLys: 4.604 ± 1.703
2.762LysLeu: 2.762 ± 0.992
0.921LysMet: 0.921 ± 0.724
3.683LysAsn: 3.683 ± 1.579
2.762LysPro: 2.762 ± 0.89
2.762LysGln: 2.762 ± 1.479
3.683LysArg: 3.683 ± 2.897
6.446LysSer: 6.446 ± 1.854
4.604LysThr: 4.604 ± 1.003
2.762LysVal: 2.762 ± 0.89
0.0LysTrp: 0.0 ± 0.0
4.604LysTyr: 4.604 ± 0.972
0.0LysXaa: 0.0 ± 0.0
Leu
1.842LeuAla: 1.842 ± 1.02
1.842LeuCys: 1.842 ± 1.22
4.604LeuAsp: 4.604 ± 1.695
5.525LeuGlu: 5.525 ± 1.821
0.921LeuPhe: 0.921 ± 0.61
4.604LeuGly: 4.604 ± 1.003
3.683LeuHis: 3.683 ± 1.92
2.762LeuIle: 2.762 ± 1.927
5.525LeuLys: 5.525 ± 1.431
3.683LeuLeu: 3.683 ± 1.771
1.842LeuMet: 1.842 ± 0.899
8.287LeuAsn: 8.287 ± 2.36
1.842LeuPro: 1.842 ± 0.96
5.525LeuGln: 5.525 ± 2.91
6.446LeuArg: 6.446 ± 3.56
4.604LeuSer: 4.604 ± 1.825
6.446LeuThr: 6.446 ± 1.816
2.762LeuVal: 2.762 ± 1.245
0.921LeuTrp: 0.921 ± 0.724
2.762LeuTyr: 2.762 ± 1.774
0.0LeuXaa: 0.0 ± 0.0
Met
0.921MetAla: 0.921 ± 0.724
0.0MetCys: 0.0 ± 0.0
3.683MetAsp: 3.683 ± 1.618
0.0MetGlu: 0.0 ± 0.0
1.842MetPhe: 1.842 ± 1.449
1.842MetGly: 1.842 ± 0.981
0.0MetHis: 0.0 ± 0.0
0.921MetIle: 0.921 ± 0.9
0.0MetLys: 0.0 ± 0.0
1.842MetLeu: 1.842 ± 1.109
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.921MetPro: 0.921 ± 0.61
0.0MetGln: 0.0 ± 0.0
0.921MetArg: 0.921 ± 0.975
0.0MetSer: 0.0 ± 0.0
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.921MetTrp: 0.921 ± 1.002
3.683MetTyr: 3.683 ± 2.315
0.0MetXaa: 0.0 ± 0.0
Asn
4.604AsnAla: 4.604 ± 2.153
2.762AsnCys: 2.762 ± 1.836
2.762AsnAsp: 2.762 ± 0.992
1.842AsnGlu: 1.842 ± 1.109
1.842AsnPhe: 1.842 ± 1.013
0.921AsnGly: 0.921 ± 0.877
7.366AsnHis: 7.366 ± 3.72
6.446AsnIle: 6.446 ± 3.372
1.842AsnLys: 1.842 ± 0.96
4.604AsnLeu: 4.604 ± 2.185
1.842AsnMet: 1.842 ± 1.278
3.683AsnAsn: 3.683 ± 1.567
3.683AsnPro: 3.683 ± 1.074
2.762AsnGln: 2.762 ± 1.279
1.842AsnArg: 1.842 ± 0.938
2.762AsnSer: 2.762 ± 1.875
2.762AsnThr: 2.762 ± 1.256
4.604AsnVal: 4.604 ± 0.972
0.0AsnTrp: 0.0 ± 0.0
2.762AsnTyr: 2.762 ± 1.349
0.0AsnXaa: 0.0 ± 0.0
Pro
3.683ProAla: 3.683 ± 1.771
1.842ProCys: 1.842 ± 1.109
2.762ProAsp: 2.762 ± 0.89
1.842ProGlu: 1.842 ± 1.02
1.842ProPhe: 1.842 ± 0.938
2.762ProGly: 2.762 ± 0.992
2.762ProHis: 2.762 ± 1.349
4.604ProIle: 4.604 ± 2.971
4.604ProLys: 4.604 ± 1.48
2.762ProLeu: 2.762 ± 1.211
1.842ProMet: 1.842 ± 1.449
2.762ProAsn: 2.762 ± 1.2
1.842ProPro: 1.842 ± 0.981
5.525ProGln: 5.525 ± 1.831
4.604ProArg: 4.604 ± 1.084
4.604ProSer: 4.604 ± 1.942
5.525ProThr: 5.525 ± 1.613
1.842ProVal: 1.842 ± 0.661
0.921ProTrp: 0.921 ± 0.61
1.842ProTyr: 1.842 ± 1.449
0.0ProXaa: 0.0 ± 0.0
Gln
5.525GlnAla: 5.525 ± 1.776
0.921GlnCys: 0.921 ± 0.61
0.921GlnAsp: 0.921 ± 0.975
4.604GlnGlu: 4.604 ± 1.703
0.921GlnPhe: 0.921 ± 0.61
1.842GlnGly: 1.842 ± 1.22
0.921GlnHis: 0.921 ± 0.877
5.525GlnIle: 5.525 ± 2.698
0.921GlnLys: 0.921 ± 1.002
2.762GlnLeu: 2.762 ± 0.992
0.0GlnMet: 0.0 ± 0.0
1.842GlnAsn: 1.842 ± 1.02
6.446GlnPro: 6.446 ± 3.541
1.842GlnGln: 1.842 ± 1.949
2.762GlnArg: 2.762 ± 1.045
4.604GlnSer: 4.604 ± 1.73
3.683GlnThr: 3.683 ± 1.803
5.525GlnVal: 5.525 ± 1.883
0.0GlnTrp: 0.0 ± 0.0
0.921GlnTyr: 0.921 ± 0.61
0.0GlnXaa: 0.0 ± 0.0
Arg
4.604ArgAla: 4.604 ± 0.972
1.842ArgCys: 1.842 ± 1.109
5.525ArgAsp: 5.525 ± 1.7
1.842ArgGlu: 1.842 ± 0.981
4.604ArgPhe: 4.604 ± 1.157
4.604ArgGly: 4.604 ± 1.919
2.762ArgHis: 2.762 ± 1.479
1.842ArgIle: 1.842 ± 0.661
3.683ArgLys: 3.683 ± 2.147
1.842ArgLeu: 1.842 ± 1.109
0.921ArgMet: 0.921 ± 0.877
0.921ArgAsn: 0.921 ± 0.9
8.287ArgPro: 8.287 ± 2.04
1.842ArgGln: 1.842 ± 1.109
5.525ArgArg: 5.525 ± 3.073
3.683ArgSer: 3.683 ± 1.579
3.683ArgThr: 3.683 ± 1.074
4.604ArgVal: 4.604 ± 2.952
0.0ArgTrp: 0.0 ± 0.0
1.842ArgTyr: 1.842 ± 1.109
0.0ArgXaa: 0.0 ± 0.0
Ser
2.762SerAla: 2.762 ± 1.83
0.0SerCys: 0.0 ± 0.0
3.683SerAsp: 3.683 ± 0.93
2.762SerGlu: 2.762 ± 2.087
1.842SerPhe: 1.842 ± 0.96
1.842SerGly: 1.842 ± 1.013
1.842SerHis: 1.842 ± 1.131
5.525SerIle: 5.525 ± 2.279
3.683SerLys: 3.683 ± 1.771
1.842SerLeu: 1.842 ± 1.22
0.921SerMet: 0.921 ± 1.036
7.366SerAsn: 7.366 ± 2.336
10.129SerPro: 10.129 ± 2.464
3.683SerGln: 3.683 ± 1.92
4.604SerArg: 4.604 ± 1.179
12.891SerSer: 12.891 ± 3.484
5.525SerThr: 5.525 ± 2.016
1.842SerVal: 1.842 ± 2.003
0.0SerTrp: 0.0 ± 0.0
3.683SerTyr: 3.683 ± 1.023
0.0SerXaa: 0.0 ± 0.0
Thr
5.525ThrAla: 5.525 ± 1.664
0.921ThrCys: 0.921 ± 0.877
0.921ThrAsp: 0.921 ± 0.61
4.604ThrGlu: 4.604 ± 1.201
0.921ThrPhe: 0.921 ± 0.61
5.525ThrGly: 5.525 ± 2.122
4.604ThrHis: 4.604 ± 3.079
0.921ThrIle: 0.921 ± 0.975
1.842ThrLys: 1.842 ± 1.22
5.525ThrLeu: 5.525 ± 1.109
0.921ThrMet: 0.921 ± 1.002
5.525ThrAsn: 5.525 ± 1.737
2.762ThrPro: 2.762 ± 0.89
2.762ThrGln: 2.762 ± 1.2
2.762ThrArg: 2.762 ± 0.898
3.683ThrSer: 3.683 ± 1.209
2.762ThrThr: 2.762 ± 1.955
3.683ThrVal: 3.683 ± 1.927
0.921ThrTrp: 0.921 ± 0.877
2.762ThrTyr: 2.762 ± 0.898
0.0ThrXaa: 0.0 ± 0.0
Val
0.921ValAla: 0.921 ± 0.61
0.0ValCys: 0.0 ± 0.0
1.842ValAsp: 1.842 ± 0.96
0.921ValGlu: 0.921 ± 1.002
2.762ValPhe: 2.762 ± 1.774
0.921ValGly: 0.921 ± 0.975
0.921ValHis: 0.921 ± 1.002
1.842ValIle: 1.842 ± 0.938
5.525ValLys: 5.525 ± 2.491
6.446ValLeu: 6.446 ± 1.485
0.921ValMet: 0.921 ± 0.724
1.842ValAsn: 1.842 ± 1.244
4.604ValPro: 4.604 ± 1.924
3.683ValGln: 3.683 ± 1.349
4.604ValArg: 4.604 ± 2.865
3.683ValSer: 3.683 ± 0.996
2.762ValThr: 2.762 ± 1.245
2.762ValVal: 2.762 ± 1.583
0.921ValTrp: 0.921 ± 0.61
4.604ValTyr: 4.604 ± 1.228
0.0ValXaa: 0.0 ± 0.0
Trp
1.842TrpAla: 1.842 ± 1.22
0.0TrpCys: 0.0 ± 0.0
0.921TrpAsp: 0.921 ± 1.002
0.921TrpGlu: 0.921 ± 0.9
0.0TrpPhe: 0.0 ± 0.0
0.921TrpGly: 0.921 ± 0.61
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.921TrpMet: 0.921 ± 0.724
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.921TrpGln: 0.921 ± 0.61
1.842TrpArg: 1.842 ± 0.96
0.921TrpSer: 0.921 ± 0.975
1.842TrpThr: 1.842 ± 1.013
0.921TrpVal: 0.921 ± 0.61
0.0TrpTrp: 0.0 ± 0.0
1.842TrpTyr: 1.842 ± 0.981
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.921TyrAla: 0.921 ± 0.724
0.0TyrCys: 0.0 ± 0.0
1.842TyrAsp: 1.842 ± 1.109
2.762TyrGlu: 2.762 ± 1.583
2.762TyrPhe: 2.762 ± 0.798
1.842TyrGly: 1.842 ± 0.661
0.921TyrHis: 0.921 ± 0.61
3.683TyrIle: 3.683 ± 1.062
1.842TyrLys: 1.842 ± 0.661
7.366TyrLeu: 7.366 ± 1.841
1.842TyrMet: 1.842 ± 1.004
3.683TyrAsn: 3.683 ± 0.996
1.842TyrPro: 1.842 ± 0.981
0.921TyrGln: 0.921 ± 0.724
4.604TyrArg: 4.604 ± 2.998
1.842TyrSer: 1.842 ± 0.661
0.0TyrThr: 0.0 ± 0.0
1.842TyrVal: 1.842 ± 1.02
0.0TyrTrp: 0.0 ± 0.0
0.921TyrTyr: 0.921 ± 0.975
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1087 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski