Amino acid dipepetide frequency for Catharanthus yellow mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.318AlaAla: 8.318 ± 1.886
0.924AlaCys: 0.924 ± 0.733
2.773AlaAsp: 2.773 ± 0.975
2.773AlaGlu: 2.773 ± 1.174
1.848AlaPhe: 1.848 ± 1.498
1.848AlaGly: 1.848 ± 0.729
1.848AlaHis: 1.848 ± 1.353
1.848AlaIle: 1.848 ± 1.065
2.773AlaLys: 2.773 ± 1.315
7.394AlaLeu: 7.394 ± 2.753
0.0AlaMet: 0.0 ± 0.0
1.848AlaAsn: 1.848 ± 0.729
0.924AlaPro: 0.924 ± 0.733
2.773AlaGln: 2.773 ± 1.098
5.545AlaArg: 5.545 ± 2.157
3.697AlaSer: 3.697 ± 2.931
5.545AlaThr: 5.545 ± 1.634
0.924AlaVal: 0.924 ± 0.901
2.773AlaTrp: 2.773 ± 1.142
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.848CysCys: 1.848 ± 1.802
0.0CysAsp: 0.0 ± 0.0
1.848CysGlu: 1.848 ± 1.058
0.0CysPhe: 0.0 ± 0.0
1.848CysGly: 1.848 ± 1.065
0.924CysHis: 0.924 ± 1.146
1.848CysIle: 1.848 ± 1.17
0.924CysLys: 0.924 ± 0.733
0.0CysLeu: 0.0 ± 0.0
0.924CysMet: 0.924 ± 1.037
1.848CysAsn: 1.848 ± 1.247
3.697CysPro: 3.697 ± 2.035
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
2.773CysSer: 2.773 ± 2.341
0.924CysThr: 0.924 ± 0.733
1.848CysVal: 1.848 ± 1.466
0.0CysTrp: 0.0 ± 0.0
0.924CysTyr: 0.924 ± 0.848
0.0CysXaa: 0.0 ± 0.0
Asp
0.924AspAla: 0.924 ± 0.623
0.0AspCys: 0.0 ± 0.0
0.924AspAsp: 0.924 ± 0.623
1.848AspGlu: 1.848 ± 0.729
1.848AspPhe: 1.848 ± 0.729
2.773AspGly: 2.773 ± 1.87
0.0AspHis: 0.0 ± 0.0
4.621AspIle: 4.621 ± 2.183
1.848AspLys: 1.848 ± 0.729
4.621AspLeu: 4.621 ± 1.858
0.0AspMet: 0.0 ± 0.0
2.773AspAsn: 2.773 ± 2.146
2.773AspPro: 2.773 ± 1.219
0.924AspGln: 0.924 ± 0.623
2.773AspArg: 2.773 ± 1.323
5.545AspSer: 5.545 ± 1.74
1.848AspThr: 1.848 ± 1.802
8.318AspVal: 8.318 ± 2.148
2.773AspTrp: 2.773 ± 1.315
0.924AspTyr: 0.924 ± 0.623
0.0AspXaa: 0.0 ± 0.0
Glu
3.697GluAla: 3.697 ± 1.248
0.0GluCys: 0.0 ± 0.0
1.848GluAsp: 1.848 ± 1.455
4.621GluGlu: 4.621 ± 1.827
2.773GluPhe: 2.773 ± 1.427
5.545GluGly: 5.545 ± 1.699
0.924GluHis: 0.924 ± 0.848
0.924GluIle: 0.924 ± 0.848
3.697GluLys: 3.697 ± 2.493
3.697GluLeu: 3.697 ± 1.443
0.0GluMet: 0.0 ± 0.0
5.545GluAsn: 5.545 ± 2.236
2.773GluPro: 2.773 ± 1.243
3.697GluGln: 3.697 ± 1.73
0.0GluArg: 0.0 ± 0.0
1.848GluSer: 1.848 ± 1.042
2.773GluThr: 2.773 ± 1.524
3.697GluVal: 3.697 ± 1.108
1.848GluTrp: 1.848 ± 1.065
0.924GluTyr: 0.924 ± 0.623
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.924PheCys: 0.924 ± 0.733
3.697PheAsp: 3.697 ± 1.459
0.924PheGlu: 0.924 ± 0.733
0.924PhePhe: 0.924 ± 0.733
0.924PheGly: 0.924 ± 0.733
1.848PheHis: 1.848 ± 1.018
3.697PheIle: 3.697 ± 1.689
3.697PheLys: 3.697 ± 1.624
4.621PheLeu: 4.621 ± 0.866
0.924PheMet: 0.924 ± 0.623
2.773PheAsn: 2.773 ± 1.753
0.924PhePro: 0.924 ± 0.901
3.697PheGln: 3.697 ± 2.493
3.697PheArg: 3.697 ± 1.164
0.924PheSer: 0.924 ± 0.623
2.773PheThr: 2.773 ± 2.123
0.0PheVal: 0.0 ± 0.0
0.0PheTrp: 0.0 ± 0.0
0.924PheTyr: 0.924 ± 0.733
0.0PheXaa: 0.0 ± 0.0
Gly
1.848GlyAla: 1.848 ± 1.247
1.848GlyCys: 1.848 ± 1.121
3.697GlyAsp: 3.697 ± 2.129
3.697GlyGlu: 3.697 ± 0.975
1.848GlyPhe: 1.848 ± 1.353
2.773GlyGly: 2.773 ± 1.142
1.848GlyHis: 1.848 ± 0.812
3.697GlyIle: 3.697 ± 0.997
7.394GlyLys: 7.394 ± 2.601
3.697GlyLeu: 3.697 ± 1.744
1.848GlyMet: 1.848 ± 1.802
0.0GlyAsn: 0.0 ± 0.0
6.47GlyPro: 6.47 ± 2.167
0.924GlyGln: 0.924 ± 0.623
0.924GlyArg: 0.924 ± 0.623
2.773GlySer: 2.773 ± 1.87
1.848GlyThr: 1.848 ± 1.196
2.773GlyVal: 2.773 ± 1.853
0.0GlyTrp: 0.0 ± 0.0
0.924GlyTyr: 0.924 ± 0.901
0.0GlyXaa: 0.0 ± 0.0
His
3.697HisAla: 3.697 ± 1.392
1.848HisCys: 1.848 ± 1.353
2.773HisAsp: 2.773 ± 1.753
1.848HisGlu: 1.848 ± 1.353
3.697HisPhe: 3.697 ± 1.326
1.848HisGly: 1.848 ± 1.353
2.773HisHis: 2.773 ± 2.529
2.773HisIle: 2.773 ± 0.826
0.924HisLys: 0.924 ± 0.901
0.924HisLeu: 0.924 ± 0.623
0.0HisMet: 0.0 ± 0.0
3.697HisAsn: 3.697 ± 1.328
2.773HisPro: 2.773 ± 1.174
3.697HisGln: 3.697 ± 2.129
3.697HisArg: 3.697 ± 2.242
1.848HisSer: 1.848 ± 1.498
2.773HisThr: 2.773 ± 1.59
0.924HisVal: 0.924 ± 0.848
0.0HisTrp: 0.0 ± 0.0
1.848HisTyr: 1.848 ± 1.065
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
0.924IleCys: 0.924 ± 0.901
2.773IleAsp: 2.773 ± 1.87
2.773IleGlu: 2.773 ± 1.174
3.697IlePhe: 3.697 ± 2.493
1.848IleGly: 1.848 ± 1.466
2.773IleHis: 2.773 ± 1.271
3.697IleIle: 3.697 ± 1.296
4.621IleLys: 4.621 ± 1.656
0.924IleLeu: 0.924 ± 0.86
0.924IleMet: 0.924 ± 0.848
3.697IleAsn: 3.697 ± 2.9
1.848IlePro: 1.848 ± 0.969
2.773IleGln: 2.773 ± 0.796
7.394IleArg: 7.394 ± 1.301
6.47IleSer: 6.47 ± 1.321
2.773IleThr: 2.773 ± 1.853
1.848IleVal: 1.848 ± 0.729
1.848IleTrp: 1.848 ± 1.042
1.848IleTyr: 1.848 ± 1.042
0.0IleXaa: 0.0 ± 0.0
Lys
5.545LysAla: 5.545 ± 0.78
2.773LysCys: 2.773 ± 1.174
1.848LysAsp: 1.848 ± 1.247
3.697LysGlu: 3.697 ± 1.689
1.848LysPhe: 1.848 ± 1.042
3.697LysGly: 3.697 ± 2.035
1.848LysHis: 1.848 ± 1.247
2.773LysIle: 2.773 ± 1.59
0.924LysLys: 0.924 ± 0.733
1.848LysLeu: 1.848 ± 1.247
0.0LysMet: 0.0 ± 0.0
5.545LysAsn: 5.545 ± 2.284
2.773LysPro: 2.773 ± 1.323
0.0LysGln: 0.0 ± 0.0
4.621LysArg: 4.621 ± 2.414
6.47LysSer: 6.47 ± 1.734
2.773LysThr: 2.773 ± 0.796
5.545LysVal: 5.545 ± 1.847
0.924LysTrp: 0.924 ± 0.733
2.773LysTyr: 2.773 ± 0.975
0.0LysXaa: 0.0 ± 0.0
Leu
1.848LeuAla: 1.848 ± 1.018
2.773LeuCys: 2.773 ± 1.384
3.697LeuAsp: 3.697 ± 1.695
5.545LeuGlu: 5.545 ± 1.69
0.924LeuPhe: 0.924 ± 0.623
6.47LeuGly: 6.47 ± 3.056
1.848LeuHis: 1.848 ± 0.812
2.773LeuIle: 2.773 ± 1.818
6.47LeuLys: 6.47 ± 2.482
3.697LeuLeu: 3.697 ± 2.276
4.621LeuMet: 4.621 ± 2.737
4.621LeuAsn: 4.621 ± 1.203
1.848LeuPro: 1.848 ± 1.247
2.773LeuGln: 2.773 ± 1.524
6.47LeuArg: 6.47 ± 3.095
1.848LeuSer: 1.848 ± 0.969
2.773LeuThr: 2.773 ± 1.007
3.697LeuVal: 3.697 ± 2.117
0.0LeuTrp: 0.0 ± 0.0
4.621LeuTyr: 4.621 ± 1.746
0.0LeuXaa: 0.0 ± 0.0
Met
0.924MetAla: 0.924 ± 0.733
0.924MetCys: 0.924 ± 0.733
1.848MetAsp: 1.848 ± 1.042
0.924MetGlu: 0.924 ± 0.86
1.848MetPhe: 1.848 ± 1.466
2.773MetGly: 2.773 ± 1.128
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.924MetLys: 0.924 ± 0.848
3.697MetLeu: 3.697 ± 1.058
1.848MetMet: 1.848 ± 1.047
1.848MetAsn: 1.848 ± 1.042
1.848MetPro: 1.848 ± 0.969
0.924MetGln: 0.924 ± 1.146
0.0MetArg: 0.0 ± 0.0
2.773MetSer: 2.773 ± 1.833
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
1.848MetTrp: 1.848 ± 1.018
1.848MetTyr: 1.848 ± 1.466
0.0MetXaa: 0.0 ± 0.0
Asn
2.773AsnAla: 2.773 ± 1.142
2.773AsnCys: 2.773 ± 2.529
0.924AsnAsp: 0.924 ± 0.623
3.697AsnGlu: 3.697 ± 0.898
1.848AsnPhe: 1.848 ± 0.729
0.924AsnGly: 0.924 ± 0.848
6.47AsnHis: 6.47 ± 2.848
4.621AsnIle: 4.621 ± 1.431
0.0AsnLys: 0.0 ± 0.0
2.773AsnLeu: 2.773 ± 1.174
2.773AsnMet: 2.773 ± 1.487
2.773AsnAsn: 2.773 ± 0.796
4.621AsnPro: 4.621 ± 1.077
1.848AsnGln: 1.848 ± 0.729
3.697AsnArg: 3.697 ± 1.715
3.697AsnSer: 3.697 ± 1.263
0.924AsnThr: 0.924 ± 0.623
3.697AsnVal: 3.697 ± 0.978
0.0AsnTrp: 0.0 ± 0.0
2.773AsnTyr: 2.773 ± 0.975
0.0AsnXaa: 0.0 ± 0.0
Pro
2.773ProAla: 2.773 ± 1.659
1.848ProCys: 1.848 ± 1.058
2.773ProAsp: 2.773 ± 1.216
2.773ProGlu: 2.773 ± 1.219
0.924ProPhe: 0.924 ± 0.623
0.924ProGly: 0.924 ± 0.623
3.697ProHis: 3.697 ± 1.385
4.621ProIle: 4.621 ± 1.019
3.697ProLys: 3.697 ± 2.493
6.47ProLeu: 6.47 ± 1.88
0.924ProMet: 0.924 ± 0.733
1.848ProAsn: 1.848 ± 1.247
1.848ProPro: 1.848 ± 1.018
4.621ProGln: 4.621 ± 2.506
5.545ProArg: 5.545 ± 0.919
6.47ProSer: 6.47 ± 2.315
2.773ProThr: 2.773 ± 1.384
4.621ProVal: 4.621 ± 1.056
0.0ProTrp: 0.0 ± 0.0
1.848ProTyr: 1.848 ± 0.729
0.0ProXaa: 0.0 ± 0.0
Gln
3.697GlnAla: 3.697 ± 1.485
0.0GlnCys: 0.0 ± 0.0
4.621GlnAsp: 4.621 ± 1.598
2.773GlnGlu: 2.773 ± 0.826
4.621GlnPhe: 4.621 ± 2.268
2.773GlnGly: 2.773 ± 1.315
3.697GlnHis: 3.697 ± 3.253
1.848GlnIle: 1.848 ± 1.247
1.848GlnLys: 1.848 ± 1.802
2.773GlnLeu: 2.773 ± 1.774
0.924GlnMet: 0.924 ± 0.623
0.924GlnAsn: 0.924 ± 0.848
2.773GlnPro: 2.773 ± 2.341
3.697GlnGln: 3.697 ± 1.108
1.848GlnArg: 1.848 ± 0.969
3.697GlnSer: 3.697 ± 0.941
2.773GlnThr: 2.773 ± 1.058
4.621GlnVal: 4.621 ± 0.866
0.0GlnTrp: 0.0 ± 0.0
0.924GlnTyr: 0.924 ± 0.623
0.0GlnXaa: 0.0 ± 0.0
Arg
4.621ArgAla: 4.621 ± 2.189
1.848ArgCys: 1.848 ± 1.802
3.697ArgAsp: 3.697 ± 1.392
2.773ArgGlu: 2.773 ± 1.321
0.924ArgPhe: 0.924 ± 0.733
3.697ArgGly: 3.697 ± 1.143
4.621ArgHis: 4.621 ± 1.009
2.773ArgIle: 2.773 ± 0.796
3.697ArgLys: 3.697 ± 1.692
3.697ArgLeu: 3.697 ± 1.859
2.773ArgMet: 2.773 ± 1.653
1.848ArgAsn: 1.848 ± 1.018
5.545ArgPro: 5.545 ± 1.827
1.848ArgGln: 1.848 ± 1.113
6.47ArgArg: 6.47 ± 3.053
3.697ArgSer: 3.697 ± 1.263
4.621ArgThr: 4.621 ± 1.448
7.394ArgVal: 7.394 ± 2.529
0.0ArgTrp: 0.0 ± 0.0
2.773ArgTyr: 2.773 ± 1.253
0.0ArgXaa: 0.0 ± 0.0
Ser
3.697SerAla: 3.697 ± 2.493
1.848SerCys: 1.848 ± 1.247
3.697SerAsp: 3.697 ± 0.941
2.773SerGlu: 2.773 ± 1.834
3.697SerPhe: 3.697 ± 1.108
0.924SerGly: 0.924 ± 0.623
0.0SerHis: 0.0 ± 0.0
2.773SerIle: 2.773 ± 1.539
5.545SerLys: 5.545 ± 2.68
5.545SerLeu: 5.545 ± 1.683
1.848SerMet: 1.848 ± 1.72
3.697SerAsn: 3.697 ± 1.459
8.318SerPro: 8.318 ± 1.928
2.773SerGln: 2.773 ± 1.315
9.242SerArg: 9.242 ± 2.205
13.863SerSer: 13.863 ± 5.95
4.621SerThr: 4.621 ± 2.014
3.697SerVal: 3.697 ± 2.227
0.0SerTrp: 0.0 ± 0.0
2.773SerTyr: 2.773 ± 1.315
0.0SerXaa: 0.0 ± 0.0
Thr
2.773ThrAla: 2.773 ± 0.796
0.0ThrCys: 0.0 ± 0.0
0.0ThrAsp: 0.0 ± 0.0
1.848ThrGlu: 1.848 ± 1.042
0.924ThrPhe: 0.924 ± 0.623
5.545ThrGly: 5.545 ± 1.763
4.621ThrHis: 4.621 ± 1.056
1.848ThrIle: 1.848 ± 0.812
2.773ThrLys: 2.773 ± 1.323
0.924ThrLeu: 0.924 ± 0.733
0.924ThrMet: 0.924 ± 0.623
0.924ThrAsn: 0.924 ± 0.733
5.545ThrPro: 5.545 ± 1.658
5.545ThrGln: 5.545 ± 2.274
1.848ThrArg: 1.848 ± 1.042
5.545ThrSer: 5.545 ± 2.612
0.0ThrThr: 0.0 ± 0.0
4.621ThrVal: 4.621 ± 2.112
0.0ThrTrp: 0.0 ± 0.0
1.848ThrTyr: 1.848 ± 1.018
0.0ThrXaa: 0.0 ± 0.0
Val
1.848ValAla: 1.848 ± 1.018
0.0ValCys: 0.0 ± 0.0
2.773ValAsp: 2.773 ± 0.826
1.848ValGlu: 1.848 ± 1.018
1.848ValPhe: 1.848 ± 1.042
1.848ValGly: 1.848 ± 1.058
2.773ValHis: 2.773 ± 1.243
5.545ValIle: 5.545 ± 1.669
5.545ValLys: 5.545 ± 1.955
5.545ValLeu: 5.545 ± 2.89
1.848ValMet: 1.848 ± 1.466
3.697ValAsn: 3.697 ± 1.822
2.773ValPro: 2.773 ± 0.826
6.47ValGln: 6.47 ± 2.251
2.773ValArg: 2.773 ± 2.199
5.545ValSer: 5.545 ± 1.822
3.697ValThr: 3.697 ± 2.01
1.848ValVal: 1.848 ± 0.729
0.0ValTrp: 0.0 ± 0.0
6.47ValTyr: 6.47 ± 1.394
0.0ValXaa: 0.0 ± 0.0
Trp
4.621TrpAla: 4.621 ± 1.366
0.0TrpCys: 0.0 ± 0.0
0.924TrpAsp: 0.924 ± 0.901
0.924TrpGlu: 0.924 ± 0.848
0.0TrpPhe: 0.0 ± 0.0
0.924TrpGly: 0.924 ± 0.623
0.924TrpHis: 0.924 ± 0.733
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.924TrpMet: 0.924 ± 0.733
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.924TrpGln: 0.924 ± 0.623
0.924TrpArg: 0.924 ± 1.146
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.924TrpTyr: 0.924 ± 0.623
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.773TyrAla: 2.773 ± 2.199
0.0TyrCys: 0.0 ± 0.0
2.773TyrAsp: 2.773 ± 1.253
0.924TyrGlu: 0.924 ± 0.733
1.848TyrPhe: 1.848 ± 1.042
0.924TyrGly: 0.924 ± 0.623
0.924TyrHis: 0.924 ± 0.623
2.773TyrIle: 2.773 ± 1.174
0.924TyrLys: 0.924 ± 0.623
6.47TyrLeu: 6.47 ± 1.88
1.848TyrMet: 1.848 ± 1.07
3.697TyrAsn: 3.697 ± 1.248
0.924TyrPro: 0.924 ± 0.623
0.924TyrGln: 0.924 ± 0.733
1.848TyrArg: 1.848 ± 1.466
1.848TyrSer: 1.848 ± 1.018
1.848TyrThr: 1.848 ± 1.065
4.621TyrVal: 4.621 ± 1.096
0.0TyrTrp: 0.0 ± 0.0
0.924TyrTyr: 0.924 ± 1.146
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1083 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski