Amino acid dipepetide frequency for Tomato leaf curl Anjouan virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.61AlaAla: 3.61 ± 1.196
0.903AlaCys: 0.903 ± 0.817
1.805AlaAsp: 1.805 ± 1.253
1.805AlaGlu: 1.805 ± 1.513
0.0AlaPhe: 0.0 ± 0.0
0.903AlaGly: 0.903 ± 0.627
1.805AlaHis: 1.805 ± 1.076
0.903AlaIle: 0.903 ± 1.154
3.61AlaLys: 3.61 ± 1.147
4.513AlaLeu: 4.513 ± 1.887
0.0AlaMet: 0.0 ± 0.0
0.903AlaAsn: 0.903 ± 0.627
2.708AlaPro: 2.708 ± 1.477
5.415AlaGln: 5.415 ± 1.864
4.513AlaArg: 4.513 ± 2.313
3.61AlaSer: 3.61 ± 1.528
5.415AlaThr: 5.415 ± 2.173
2.708AlaVal: 2.708 ± 1.484
2.708AlaTrp: 2.708 ± 1.165
0.903AlaTyr: 0.903 ± 0.627
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.805CysCys: 1.805 ± 2.307
0.0CysAsp: 0.0 ± 0.0
0.903CysGlu: 0.903 ± 0.817
0.903CysPhe: 0.903 ± 1.103
1.805CysGly: 1.805 ± 1.079
0.0CysHis: 0.0 ± 0.0
0.903CysIle: 0.903 ± 0.817
0.903CysLys: 0.903 ± 0.817
0.903CysLeu: 0.903 ± 1.296
0.903CysMet: 0.903 ± 1.154
0.903CysAsn: 0.903 ± 0.627
1.805CysPro: 1.805 ± 2.307
0.0CysGln: 0.0 ± 0.0
0.903CysArg: 0.903 ± 0.627
3.61CysSer: 3.61 ± 3.065
2.708CysThr: 2.708 ± 0.96
2.708CysVal: 2.708 ± 1.116
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.61AspAla: 3.61 ± 2.507
0.0AspCys: 0.0 ± 0.0
1.805AspAsp: 1.805 ± 0.787
1.805AspGlu: 1.805 ± 1.147
1.805AspPhe: 1.805 ± 1.364
0.903AspGly: 0.903 ± 0.627
0.0AspHis: 0.0 ± 0.0
3.61AspIle: 3.61 ± 1.127
1.805AspLys: 1.805 ± 1.253
7.22AspLeu: 7.22 ± 2.793
0.0AspMet: 0.0 ± 0.0
1.805AspAsn: 1.805 ± 1.147
2.708AspPro: 2.708 ± 1.33
0.903AspGln: 0.903 ± 0.627
2.708AspArg: 2.708 ± 1.693
6.318AspSer: 6.318 ± 1.404
2.708AspThr: 2.708 ± 2.041
5.415AspVal: 5.415 ± 1.997
1.805AspTrp: 1.805 ± 1.079
2.708AspTyr: 2.708 ± 1.301
0.0AspXaa: 0.0 ± 0.0
Glu
3.61GluAla: 3.61 ± 1.191
0.0GluCys: 0.0 ± 0.0
1.805GluAsp: 1.805 ± 1.253
5.415GluGlu: 5.415 ± 2.884
2.708GluPhe: 2.708 ± 1.509
5.415GluGly: 5.415 ± 1.533
0.0GluHis: 0.0 ± 0.0
3.61GluIle: 3.61 ± 3.224
1.805GluLys: 1.805 ± 1.253
5.415GluLeu: 5.415 ± 1.712
0.0GluMet: 0.0 ± 0.0
4.513GluAsn: 4.513 ± 2.109
2.708GluPro: 2.708 ± 1.116
1.805GluGln: 1.805 ± 1.634
0.0GluArg: 0.0 ± 0.0
1.805GluSer: 1.805 ± 1.513
0.903GluThr: 0.903 ± 1.154
0.903GluVal: 0.903 ± 1.296
0.903GluTrp: 0.903 ± 1.056
1.805GluTyr: 1.805 ± 1.253
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.903PheCys: 0.903 ± 0.817
3.61PheAsp: 3.61 ± 1.575
0.903PheGlu: 0.903 ± 0.627
3.61PhePhe: 3.61 ± 1.575
0.903PheGly: 0.903 ± 0.817
2.708PheHis: 2.708 ± 1.116
2.708PheIle: 2.708 ± 1.468
4.513PheLys: 4.513 ± 3.259
9.025PheLeu: 9.025 ± 1.761
1.805PheMet: 1.805 ± 1.253
3.61PheAsn: 3.61 ± 3.176
0.903PhePro: 0.903 ± 1.154
3.61PheGln: 3.61 ± 1.147
2.708PheArg: 2.708 ± 1.854
3.61PheSer: 3.61 ± 1.634
1.805PheThr: 1.805 ± 1.147
0.903PheVal: 0.903 ± 0.627
0.0PheTrp: 0.0 ± 0.0
1.805PheTyr: 1.805 ± 1.147
0.0PheXaa: 0.0 ± 0.0
Gly
1.805GlyAla: 1.805 ± 1.253
2.708GlyCys: 2.708 ± 1.622
3.61GlyAsp: 3.61 ± 1.859
2.708GlyGlu: 2.708 ± 1.468
1.805GlyPhe: 1.805 ± 1.513
3.61GlyGly: 3.61 ± 1.316
2.708GlyHis: 2.708 ± 1.301
5.415GlyIle: 5.415 ± 1.904
4.513GlyLys: 4.513 ± 1.887
1.805GlyLeu: 1.805 ± 1.079
0.903GlyMet: 0.903 ± 0.817
2.708GlyAsn: 2.708 ± 2.787
3.61GlyPro: 3.61 ± 1.575
3.61GlyGln: 3.61 ± 1.316
1.805GlyArg: 1.805 ± 1.259
1.805GlySer: 1.805 ± 1.634
0.0GlyThr: 0.0 ± 0.0
2.708GlyVal: 2.708 ± 2.151
0.0GlyTrp: 0.0 ± 0.0
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.805HisAla: 1.805 ± 1.634
2.708HisCys: 2.708 ± 2.341
0.903HisAsp: 0.903 ± 1.154
2.708HisGlu: 2.708 ± 1.369
2.708HisPhe: 2.708 ± 1.468
2.708HisGly: 2.708 ± 2.04
1.805HisHis: 1.805 ± 1.464
1.805HisIle: 1.805 ± 1.662
1.805HisLys: 1.805 ± 1.478
2.708HisLeu: 2.708 ± 1.88
0.0HisMet: 0.0 ± 0.0
2.708HisAsn: 2.708 ± 1.468
2.708HisPro: 2.708 ± 1.524
1.805HisGln: 1.805 ± 0.787
2.708HisArg: 2.708 ± 2.047
3.61HisSer: 3.61 ± 1.41
2.708HisThr: 2.708 ± 2.452
3.61HisVal: 3.61 ± 1.127
0.0HisTrp: 0.0 ± 0.0
0.903HisTyr: 0.903 ± 0.627
0.0HisXaa: 0.0 ± 0.0
Ile
0.903IleAla: 0.903 ± 1.056
0.903IleCys: 0.903 ± 1.154
1.805IleAsp: 1.805 ± 1.079
1.805IleGlu: 1.805 ± 1.076
2.708IlePhe: 2.708 ± 1.413
0.903IleGly: 0.903 ± 1.056
0.903IleHis: 0.903 ± 1.103
2.708IleIle: 2.708 ± 2.151
9.025IleLys: 9.025 ± 1.761
3.61IleLeu: 3.61 ± 1.354
0.903IleMet: 0.903 ± 1.431
4.513IleAsn: 4.513 ± 1.862
1.805IlePro: 1.805 ± 1.079
5.415IleGln: 5.415 ± 2.325
6.318IleArg: 6.318 ± 2.289
3.61IleSer: 3.61 ± 2.84
3.61IleThr: 3.61 ± 1.196
2.708IleVal: 2.708 ± 1.165
2.708IleTrp: 2.708 ± 1.005
4.513IleTyr: 4.513 ± 2.795
0.0IleXaa: 0.0 ± 0.0
Lys
4.513LysAla: 4.513 ± 2.117
0.903LysCys: 0.903 ± 1.103
1.805LysAsp: 1.805 ± 1.259
8.123LysGlu: 8.123 ± 2.232
3.61LysPhe: 3.61 ± 1.196
3.61LysGly: 3.61 ± 1.316
2.708LysHis: 2.708 ± 0.96
5.415LysIle: 5.415 ± 1.997
1.805LysLys: 1.805 ± 1.147
0.0LysLeu: 0.0 ± 0.0
0.903LysMet: 0.903 ± 0.627
3.61LysAsn: 3.61 ± 1.697
3.61LysPro: 3.61 ± 1.528
1.805LysGln: 1.805 ± 1.364
4.513LysArg: 4.513 ± 2.109
6.318LysSer: 6.318 ± 2.021
1.805LysThr: 1.805 ± 1.079
5.415LysVal: 5.415 ± 1.957
0.0LysTrp: 0.0 ± 0.0
2.708LysTyr: 2.708 ± 1.116
0.0LysXaa: 0.0 ± 0.0
Leu
2.708LeuAla: 2.708 ± 1.33
1.805LeuCys: 1.805 ± 1.253
3.61LeuAsp: 3.61 ± 1.546
4.513LeuGlu: 4.513 ± 2.313
0.903LeuPhe: 0.903 ± 0.627
5.415LeuGly: 5.415 ± 1.87
3.61LeuHis: 3.61 ± 1.902
5.415LeuIle: 5.415 ± 2.339
5.415LeuLys: 5.415 ± 2.083
3.61LeuLeu: 3.61 ± 1.886
0.903LeuMet: 0.903 ± 1.296
5.415LeuAsn: 5.415 ± 1.498
1.805LeuPro: 1.805 ± 1.079
5.415LeuGln: 5.415 ± 2.099
7.22LeuArg: 7.22 ± 5.412
1.805LeuSer: 1.805 ± 1.253
2.708LeuThr: 2.708 ± 1.524
4.513LeuVal: 4.513 ± 1.429
0.0LeuTrp: 0.0 ± 0.0
4.513LeuTyr: 4.513 ± 1.984
0.0LeuXaa: 0.0 ± 0.0
Met
0.903MetAla: 0.903 ± 0.817
0.0MetCys: 0.0 ± 0.0
4.513MetAsp: 4.513 ± 1.478
0.903MetGlu: 0.903 ± 1.296
2.708MetPhe: 2.708 ± 1.688
1.805MetGly: 1.805 ± 1.259
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.903MetLys: 0.903 ± 0.817
0.903MetLeu: 0.903 ± 1.154
0.0MetMet: 0.0 ± 0.0
0.903MetAsn: 0.903 ± 1.103
2.708MetPro: 2.708 ± 1.509
0.903MetGln: 0.903 ± 1.056
0.903MetArg: 0.903 ± 0.817
2.708MetSer: 2.708 ± 1.217
0.903MetThr: 0.903 ± 0.627
0.0MetVal: 0.0 ± 0.0
1.805MetTrp: 1.805 ± 1.076
1.805MetTyr: 1.805 ± 1.634
0.0MetXaa: 0.0 ± 0.0
Asn
6.318AsnAla: 6.318 ± 2.685
1.805AsnCys: 1.805 ± 2.112
2.708AsnAsp: 2.708 ± 1.116
0.903AsnGlu: 0.903 ± 0.817
3.61AsnPhe: 3.61 ± 1.354
0.0AsnGly: 0.0 ± 0.0
7.22AsnHis: 7.22 ± 3.041
3.61AsnIle: 3.61 ± 1.196
0.0AsnLys: 0.0 ± 0.0
2.708AsnLeu: 2.708 ± 1.369
2.708AsnMet: 2.708 ± 1.608
0.0AsnAsn: 0.0 ± 0.0
4.513AsnPro: 4.513 ± 1.095
3.61AsnGln: 3.61 ± 1.196
0.0AsnArg: 0.0 ± 0.0
3.61AsnSer: 3.61 ± 2.712
3.61AsnThr: 3.61 ± 1.41
2.708AsnVal: 2.708 ± 1.33
0.0AsnTrp: 0.0 ± 0.0
1.805AsnTyr: 1.805 ± 1.253
0.0AsnXaa: 0.0 ± 0.0
Pro
0.0ProAla: 0.0 ± 0.0
1.805ProCys: 1.805 ± 1.364
3.61ProAsp: 3.61 ± 1.18
1.805ProGlu: 1.805 ± 1.259
1.805ProPhe: 1.805 ± 1.172
3.61ProGly: 3.61 ± 1.647
4.513ProHis: 4.513 ± 2.313
5.415ProIle: 5.415 ± 4.233
5.415ProLys: 5.415 ± 1.635
6.318ProLeu: 6.318 ± 2.204
3.61ProMet: 3.61 ± 1.595
2.708ProAsn: 2.708 ± 1.33
1.805ProPro: 1.805 ± 1.079
3.61ProGln: 3.61 ± 2.165
1.805ProArg: 1.805 ± 1.076
4.513ProSer: 4.513 ± 3.146
3.61ProThr: 3.61 ± 1.697
2.708ProVal: 2.708 ± 1.93
0.903ProTrp: 0.903 ± 0.627
1.805ProTyr: 1.805 ± 0.787
0.0ProXaa: 0.0 ± 0.0
Gln
2.708GlnAla: 2.708 ± 2.391
1.805GlnCys: 1.805 ± 1.253
3.61GlnAsp: 3.61 ± 1.455
0.903GlnGlu: 0.903 ± 0.817
0.903GlnPhe: 0.903 ± 0.627
2.708GlnGly: 2.708 ± 1.413
2.708GlnHis: 2.708 ± 1.524
5.415GlnIle: 5.415 ± 1.96
0.903GlnLys: 0.903 ± 1.154
2.708GlnLeu: 2.708 ± 2.041
0.903GlnMet: 0.903 ± 1.056
2.708GlnAsn: 2.708 ± 1.484
3.61GlnPro: 3.61 ± 3.026
0.0GlnGln: 0.0 ± 0.0
2.708GlnArg: 2.708 ± 1.165
3.61GlnSer: 3.61 ± 1.059
4.513GlnThr: 4.513 ± 2.154
6.318GlnVal: 6.318 ± 2.077
0.0GlnTrp: 0.0 ± 0.0
0.903GlnTyr: 0.903 ± 0.627
0.0GlnXaa: 0.0 ± 0.0
Arg
4.513ArgAla: 4.513 ± 1.852
1.805ArgCys: 1.805 ± 1.364
5.415ArgAsp: 5.415 ± 2.906
1.805ArgGlu: 1.805 ± 1.259
6.318ArgPhe: 6.318 ± 1.679
2.708ArgGly: 2.708 ± 0.96
1.805ArgHis: 1.805 ± 1.513
3.61ArgIle: 3.61 ± 2.293
4.513ArgLys: 4.513 ± 1.949
2.708ArgLeu: 2.708 ± 1.484
1.805ArgMet: 1.805 ± 1.364
0.0ArgAsn: 0.0 ± 0.0
7.22ArgPro: 7.22 ± 2.26
1.805ArgGln: 1.805 ± 1.076
6.318ArgArg: 6.318 ± 3.994
3.61ArgSer: 3.61 ± 1.938
2.708ArgThr: 2.708 ± 1.564
2.708ArgVal: 2.708 ± 1.005
0.0ArgTrp: 0.0 ± 0.0
1.805ArgTyr: 1.805 ± 1.478
0.0ArgXaa: 0.0 ± 0.0
Ser
5.415SerAla: 5.415 ± 1.578
0.903SerCys: 0.903 ± 0.627
2.708SerAsp: 2.708 ± 0.96
1.805SerGlu: 1.805 ± 1.253
3.61SerPhe: 3.61 ± 1.902
2.708SerGly: 2.708 ± 1.693
1.805SerHis: 1.805 ± 1.147
6.318SerIle: 6.318 ± 2.272
5.415SerLys: 5.415 ± 1.635
1.805SerLeu: 1.805 ± 1.137
0.903SerMet: 0.903 ± 1.296
7.22SerAsn: 7.22 ± 1.808
4.513SerPro: 4.513 ± 2.02
3.61SerGln: 3.61 ± 3.065
8.123SerArg: 8.123 ± 3.08
9.025SerSer: 9.025 ± 4.296
6.318SerThr: 6.318 ± 2.893
2.708SerVal: 2.708 ± 1.484
0.0SerTrp: 0.0 ± 0.0
3.61SerTyr: 3.61 ± 1.147
0.0SerXaa: 0.0 ± 0.0
Thr
2.708ThrAla: 2.708 ± 1.693
0.0ThrCys: 0.0 ± 0.0
0.903ThrAsp: 0.903 ± 0.627
1.805ThrGlu: 1.805 ± 1.172
1.805ThrPhe: 1.805 ± 1.259
3.61ThrGly: 3.61 ± 1.958
3.61ThrHis: 3.61 ± 1.42
0.903ThrIle: 0.903 ± 0.627
3.61ThrLys: 3.61 ± 1.902
4.513ThrLeu: 4.513 ± 1.061
2.708ThrMet: 2.708 ± 1.468
4.513ThrAsn: 4.513 ± 1.579
4.513ThrPro: 4.513 ± 1.872
0.903ThrGln: 0.903 ± 1.103
2.708ThrArg: 2.708 ± 1.116
5.415ThrSer: 5.415 ± 0.859
0.0ThrThr: 0.0 ± 0.0
3.61ThrVal: 3.61 ± 2.293
0.903ThrTrp: 0.903 ± 1.296
5.415ThrTyr: 5.415 ± 2.609
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
0.903ValCys: 0.903 ± 0.627
2.708ValAsp: 2.708 ± 1.468
0.903ValGlu: 0.903 ± 1.154
3.61ValPhe: 3.61 ± 1.958
1.805ValGly: 1.805 ± 1.634
2.708ValHis: 2.708 ± 2.013
2.708ValIle: 2.708 ± 1.301
5.415ValLys: 5.415 ± 2.154
4.513ValLeu: 4.513 ± 3.259
2.708ValMet: 2.708 ± 1.477
0.903ValAsn: 0.903 ± 1.103
4.513ValPro: 4.513 ± 1.55
3.61ValGln: 3.61 ± 2.252
2.708ValArg: 2.708 ± 2.452
7.22ValSer: 7.22 ± 2.329
3.61ValThr: 3.61 ± 1.436
2.708ValVal: 2.708 ± 1.693
0.903ValTrp: 0.903 ± 1.103
4.513ValTyr: 4.513 ± 1.541
0.0ValXaa: 0.0 ± 0.0
Trp
1.805TrpAla: 1.805 ± 1.253
0.0TrpCys: 0.0 ± 0.0
0.903TrpAsp: 0.903 ± 1.154
0.903TrpGlu: 0.903 ± 1.103
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.903TrpMet: 0.903 ± 0.817
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.903TrpGln: 0.903 ± 0.627
1.805TrpArg: 1.805 ± 1.079
0.903TrpSer: 0.903 ± 1.056
1.805TrpThr: 1.805 ± 1.172
0.903TrpVal: 0.903 ± 0.627
0.0TrpTrp: 0.0 ± 0.0
1.805TrpTyr: 1.805 ± 1.259
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.805TyrAla: 1.805 ± 0.787
0.0TyrCys: 0.0 ± 0.0
1.805TyrAsp: 1.805 ± 1.172
2.708TyrGlu: 2.708 ± 1.93
4.513TyrPhe: 4.513 ± 1.478
2.708TyrGly: 2.708 ± 1.165
1.805TyrHis: 1.805 ± 1.137
1.805TyrIle: 1.805 ± 1.253
1.805TyrLys: 1.805 ± 0.787
6.318TyrLeu: 6.318 ± 2.184
1.805TyrMet: 1.805 ± 1.089
1.805TyrAsn: 1.805 ± 0.787
3.61TyrPro: 3.61 ± 1.938
0.903TyrGln: 0.903 ± 1.154
2.708TyrArg: 2.708 ± 2.452
1.805TyrSer: 1.805 ± 1.259
2.708TyrThr: 2.708 ± 1.33
2.708TyrVal: 2.708 ± 1.301
0.0TyrTrp: 0.0 ± 0.0
0.903TyrTyr: 0.903 ± 1.056
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1109 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski