Amino acid dipepetide frequency for Tomato ringspot virus (isolate raspberry) (ToRSV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.202AlaAla: 15.202 ± 5.363
2.236AlaCys: 2.236 ± 0.439
2.683AlaAsp: 2.683 ± 0.412
5.366AlaGlu: 5.366 ± 0.797
3.353AlaPhe: 3.353 ± 1.541
6.036AlaGly: 6.036 ± 1.199
0.671AlaHis: 0.671 ± 0.173
4.024AlaIle: 4.024 ± 0.839
5.366AlaLys: 5.366 ± 0.515
6.483AlaLeu: 6.483 ± 1.26
2.906AlaMet: 2.906 ± 0.586
3.577AlaAsn: 3.577 ± 0.998
6.93AlaPro: 6.93 ± 0.898
4.248AlaGln: 4.248 ± 0.953
8.048AlaArg: 8.048 ± 1.183
7.601AlaSer: 7.601 ± 3.232
3.13AlaThr: 3.13 ± 0.445
6.93AlaVal: 6.93 ± 0.847
0.0AlaTrp: 0.0 ± 0.0
3.801AlaTyr: 3.801 ± 0.398
0.0AlaXaa: 0.0 ± 0.0
Cys
2.236CysAla: 2.236 ± 0.256
0.671CysCys: 0.671 ± 0.906
0.671CysAsp: 0.671 ± 0.173
1.118CysGlu: 1.118 ± 0.225
1.341CysPhe: 1.341 ± 0.798
2.012CysGly: 2.012 ± 0.519
0.0CysHis: 0.0 ± 0.0
0.894CysIle: 0.894 ± 0.636
1.118CysLys: 1.118 ± 0.551
2.012CysLeu: 2.012 ± 1.512
0.894CysMet: 0.894 ± 0.428
0.0CysAsn: 0.0 ± 0.0
0.894CysPro: 0.894 ± 0.833
0.894CysGln: 0.894 ± 0.176
0.894CysArg: 0.894 ± 0.318
2.236CysSer: 2.236 ± 0.365
1.118CysThr: 1.118 ± 1.533
1.118CysVal: 1.118 ± 0.722
0.224CysTrp: 0.224 ± 0.458
0.447CysTyr: 0.447 ± 0.088
0.0CysXaa: 0.0 ± 0.0
Asp
2.906AspAla: 2.906 ± 1.163
0.894AspCys: 0.894 ± 0.451
2.683AspAsp: 2.683 ± 0.527
3.353AspGlu: 3.353 ± 0.543
1.565AspPhe: 1.565 ± 0.488
2.683AspGly: 2.683 ± 0.692
0.671AspHis: 0.671 ± 0.246
1.789AspIle: 1.789 ± 0.563
1.341AspLys: 1.341 ± 0.346
3.801AspLeu: 3.801 ± 0.74
0.447AspMet: 0.447 ± 0.088
2.012AspAsn: 2.012 ± 0.519
2.236AspPro: 2.236 ± 0.761
1.118AspGln: 1.118 ± 0.613
3.577AspArg: 3.577 ± 0.496
2.459AspSer: 2.459 ± 0.33
2.459AspThr: 2.459 ± 1.098
4.024AspVal: 4.024 ± 1.293
1.565AspTrp: 1.565 ± 0.348
1.341AspTyr: 1.341 ± 0.626
0.0AspXaa: 0.0 ± 0.0
Glu
6.93GluAla: 6.93 ± 0.983
1.341GluCys: 1.341 ± 0.492
3.353GluAsp: 3.353 ± 1.71
3.353GluGlu: 3.353 ± 1.23
1.341GluPhe: 1.341 ± 0.264
6.036GluGly: 6.036 ± 1.199
1.565GluHis: 1.565 ± 0.488
2.012GluIle: 2.012 ± 0.53
3.353GluLys: 3.353 ± 1.494
4.471GluLeu: 4.471 ± 1.078
1.118GluMet: 1.118 ± 0.613
0.671GluAsn: 0.671 ± 0.246
1.565GluPro: 1.565 ± 0.398
2.683GluGln: 2.683 ± 0.516
2.459GluArg: 2.459 ± 1.66
2.459GluSer: 2.459 ± 0.327
0.671GluThr: 0.671 ± 0.173
4.695GluVal: 4.695 ± 0.79
2.012GluTrp: 2.012 ± 0.738
1.341GluTyr: 1.341 ± 0.346
0.0GluXaa: 0.0 ± 0.0
Phe
3.577PheAla: 3.577 ± 0.484
0.447PheCys: 0.447 ± 0.088
2.236PheAsp: 2.236 ± 0.659
3.13PheGlu: 3.13 ± 1.868
2.683PhePhe: 2.683 ± 0.747
3.353PheGly: 3.353 ± 0.621
0.447PheHis: 0.447 ± 0.314
1.565PheIle: 1.565 ± 0.294
3.13PheLys: 3.13 ± 1.093
4.918PheLeu: 4.918 ± 0.258
1.118PheMet: 1.118 ± 0.225
2.459PheAsn: 2.459 ± 0.453
2.012PhePro: 2.012 ± 0.372
1.118PheGln: 1.118 ± 0.319
2.236PheArg: 2.236 ± 0.585
3.13PheSer: 3.13 ± 1.969
1.789PheThr: 1.789 ± 0.856
4.248PheVal: 4.248 ± 1.136
0.671PheTrp: 0.671 ± 0.516
2.236PheTyr: 2.236 ± 0.918
0.0PheXaa: 0.0 ± 0.0
Gly
8.048GlyAla: 8.048 ± 2.355
1.789GlyCys: 1.789 ± 0.391
3.577GlyAsp: 3.577 ± 1.274
2.459GlyGlu: 2.459 ± 0.562
2.459GlyPhe: 2.459 ± 1.1
8.719GlyGly: 8.719 ± 6.82
1.565GlyHis: 1.565 ± 0.984
2.683GlyIle: 2.683 ± 0.431
3.577GlyLys: 3.577 ± 1.032
4.471GlyLeu: 4.471 ± 0.824
1.789GlyMet: 1.789 ± 0.534
3.801GlyAsn: 3.801 ± 0.706
5.142GlyPro: 5.142 ± 1.424
2.012GlyGln: 2.012 ± 0.312
6.036GlyArg: 6.036 ± 0.92
3.801GlySer: 3.801 ± 0.706
4.248GlyThr: 4.248 ± 0.497
4.024GlyVal: 4.024 ± 0.428
0.671GlyTrp: 0.671 ± 0.516
1.565GlyTyr: 1.565 ± 0.425
0.0GlyXaa: 0.0 ± 0.0
His
2.012HisAla: 2.012 ± 0.789
0.224HisCys: 0.224 ± 0.157
0.894HisAsp: 0.894 ± 0.176
0.224HisGlu: 0.224 ± 0.187
0.894HisPhe: 0.894 ± 0.318
1.565HisGly: 1.565 ± 0.984
0.224HisHis: 0.224 ± 0.157
1.789HisIle: 1.789 ± 0.408
1.341HisLys: 1.341 ± 0.626
1.565HisLeu: 1.565 ± 0.488
0.224HisMet: 0.224 ± 0.157
0.0HisAsn: 0.0 ± 0.0
0.671HisPro: 0.671 ± 0.173
0.894HisGln: 0.894 ± 0.176
1.341HisArg: 1.341 ± 0.626
1.789HisSer: 1.789 ± 0.902
0.671HisThr: 0.671 ± 0.56
1.118HisVal: 1.118 ± 0.471
0.224HisTrp: 0.224 ± 0.157
0.671HisTyr: 0.671 ± 0.173
0.0HisXaa: 0.0 ± 0.0
Ile
3.13IleAla: 3.13 ± 1.006
1.118IleCys: 1.118 ± 0.749
2.012IleAsp: 2.012 ± 0.48
1.789IleGlu: 1.789 ± 0.637
1.565IlePhe: 1.565 ± 0.398
2.683IleGly: 2.683 ± 0.715
0.447IleHis: 0.447 ± 0.088
1.789IleIle: 1.789 ± 0.637
2.683IleLys: 2.683 ± 0.955
5.142IleLeu: 5.142 ± 1.229
0.447IleMet: 0.447 ± 0.314
1.565IleAsn: 1.565 ± 1.099
2.459IlePro: 2.459 ± 0.562
0.894IleGln: 0.894 ± 0.318
2.012IleArg: 2.012 ± 0.519
4.471IleSer: 4.471 ± 0.8
2.459IleThr: 2.459 ± 0.59
3.13IleVal: 3.13 ± 0.701
0.0IleTrp: 0.0 ± 0.0
1.341IleTyr: 1.341 ± 0.346
0.0IleXaa: 0.0 ± 0.0
Lys
6.26LysAla: 6.26 ± 0.785
0.447LysCys: 0.447 ± 0.314
2.459LysAsp: 2.459 ± 0.805
2.683LysGlu: 2.683 ± 0.498
1.341LysPhe: 1.341 ± 0.346
5.142LysGly: 5.142 ± 0.572
0.894LysHis: 0.894 ± 0.176
3.13LysIle: 3.13 ± 0.976
2.459LysLys: 2.459 ± 0.563
3.801LysLeu: 3.801 ± 0.905
0.447LysMet: 0.447 ± 0.314
1.118LysAsn: 1.118 ± 0.471
2.459LysPro: 2.459 ± 1.305
1.789LysGln: 1.789 ± 0.351
2.906LysArg: 2.906 ± 0.707
3.801LysSer: 3.801 ± 0.589
2.236LysThr: 2.236 ± 0.278
3.13LysVal: 3.13 ± 0.59
0.671LysTrp: 0.671 ± 0.471
2.236LysTyr: 2.236 ± 0.466
0.0LysXaa: 0.0 ± 0.0
Leu
9.837LeuAla: 9.837 ± 1.591
1.789LeuCys: 1.789 ± 0.783
4.471LeuAsp: 4.471 ± 0.824
6.483LeuGlu: 6.483 ± 1.547
4.918LeuPhe: 4.918 ± 1.243
2.683LeuGly: 2.683 ± 0.412
1.565LeuHis: 1.565 ± 0.398
4.024LeuIle: 4.024 ± 1.293
4.918LeuLys: 4.918 ± 0.811
10.955LeuLeu: 10.955 ± 0.924
1.789LeuMet: 1.789 ± 1.134
2.906LeuAsn: 2.906 ± 0.648
7.378LeuPro: 7.378 ± 3.013
3.353LeuGln: 3.353 ± 0.601
7.825LeuArg: 7.825 ± 1.192
8.495LeuSer: 8.495 ± 2.366
4.695LeuThr: 4.695 ± 0.628
6.036LeuVal: 6.036 ± 1.556
0.671LeuTrp: 0.671 ± 0.246
2.459LeuTyr: 2.459 ± 0.562
0.0LeuXaa: 0.0 ± 0.0
Met
2.236MetAla: 2.236 ± 1.243
0.671MetCys: 0.671 ± 0.51
1.565MetAsp: 1.565 ± 0.398
1.341MetGlu: 1.341 ± 0.346
0.671MetPhe: 0.671 ± 0.471
1.341MetGly: 1.341 ± 0.492
0.671MetHis: 0.671 ± 0.471
0.224MetIle: 0.224 ± 0.458
0.447MetLys: 0.447 ± 0.088
1.789MetLeu: 1.789 ± 0.514
0.671MetMet: 0.671 ± 0.173
0.894MetAsn: 0.894 ± 0.318
1.565MetPro: 1.565 ± 0.348
1.118MetGln: 1.118 ± 0.471
1.118MetArg: 1.118 ± 0.737
2.012MetSer: 2.012 ± 0.372
1.118MetThr: 1.118 ± 0.471
1.565MetVal: 1.565 ± 0.294
0.224MetTrp: 0.224 ± 0.187
0.447MetTyr: 0.447 ± 0.314
0.0MetXaa: 0.0 ± 0.0
Asn
2.459AsnAla: 2.459 ± 0.562
1.118AsnCys: 1.118 ± 0.331
1.118AsnAsp: 1.118 ± 0.225
1.341AsnGlu: 1.341 ± 0.626
2.906AsnPhe: 2.906 ± 0.305
2.236AsnGly: 2.236 ± 0.943
0.894AsnHis: 0.894 ± 0.318
1.565AsnIle: 1.565 ± 0.294
0.894AsnLys: 0.894 ± 0.318
2.236AsnLeu: 2.236 ± 0.719
1.565AsnMet: 1.565 ± 0.294
0.894AsnAsn: 0.894 ± 0.318
1.341AsnPro: 1.341 ± 0.264
0.671AsnGln: 0.671 ± 0.173
1.789AsnArg: 1.789 ± 0.563
2.236AsnSer: 2.236 ± 0.256
1.118AsnThr: 1.118 ± 0.613
2.683AsnVal: 2.683 ± 1.082
0.671AsnTrp: 0.671 ± 0.56
1.118AsnTyr: 1.118 ± 0.785
0.0AsnXaa: 0.0 ± 0.0
Pro
3.801ProAla: 3.801 ± 0.648
1.118ProCys: 1.118 ± 0.401
2.459ProAsp: 2.459 ± 0.562
1.789ProGlu: 1.789 ± 0.563
3.13ProPhe: 3.13 ± 0.439
3.801ProGly: 3.801 ± 0.709
0.894ProHis: 0.894 ± 0.428
0.894ProIle: 0.894 ± 1.054
2.236ProLys: 2.236 ± 0.439
7.601ProLeu: 7.601 ± 0.922
1.565ProMet: 1.565 ± 0.425
0.671ProAsn: 0.671 ± 0.173
9.613ProPro: 9.613 ± 3.335
2.459ProGln: 2.459 ± 0.453
2.683ProArg: 2.683 ± 0.412
6.93ProSer: 6.93 ± 0.812
3.13ProThr: 3.13 ± 1.039
4.695ProVal: 4.695 ± 1.054
0.671ProTrp: 0.671 ± 0.173
0.894ProTyr: 0.894 ± 0.428
0.0ProXaa: 0.0 ± 0.0
Gln
3.577GlnAla: 3.577 ± 0.877
0.0GlnCys: 0.0 ± 0.0
0.224GlnAsp: 0.224 ± 0.157
3.801GlnGlu: 3.801 ± 1.299
2.012GlnPhe: 2.012 ± 0.312
3.801GlnGly: 3.801 ± 1.119
1.789GlnHis: 1.789 ± 0.391
1.565GlnIle: 1.565 ± 0.488
2.459GlnLys: 2.459 ± 0.805
2.236GlnLeu: 2.236 ± 0.466
0.671GlnMet: 0.671 ± 0.173
1.118GlnAsn: 1.118 ± 0.319
2.012GlnPro: 2.012 ± 0.372
3.13GlnGln: 3.13 ± 0.795
4.918GlnArg: 4.918 ± 1.082
2.683GlnSer: 2.683 ± 0.746
0.894GlnThr: 0.894 ± 0.176
1.565GlnVal: 1.565 ± 0.534
1.118GlnTrp: 1.118 ± 0.225
1.565GlnTyr: 1.565 ± 0.294
0.0GlnXaa: 0.0 ± 0.0
Arg
7.378ArgAla: 7.378 ± 2.498
1.789ArgCys: 1.789 ± 0.351
2.906ArgAsp: 2.906 ± 0.329
3.353ArgGlu: 3.353 ± 0.731
2.683ArgPhe: 2.683 ± 0.764
3.801ArgGly: 3.801 ± 0.61
2.012ArgHis: 2.012 ± 1.12
2.683ArgIle: 2.683 ± 0.449
3.353ArgLys: 3.353 ± 0.911
8.495ArgLeu: 8.495 ± 0.912
0.894ArgMet: 0.894 ± 1.202
1.789ArgAsn: 1.789 ± 0.408
3.13ArgPro: 3.13 ± 0.735
2.906ArgGln: 2.906 ± 0.771
2.906ArgArg: 2.906 ± 1.864
4.471ArgSer: 4.471 ± 1.326
3.13ArgThr: 3.13 ± 0.576
5.366ArgVal: 5.366 ± 0.514
0.671ArgTrp: 0.671 ± 0.173
3.353ArgTyr: 3.353 ± 0.93
0.0ArgXaa: 0.0 ± 0.0
Ser
6.26SerAla: 6.26 ± 1.286
1.341SerCys: 1.341 ± 1.242
2.906SerAsp: 2.906 ± 0.536
3.801SerGlu: 3.801 ± 0.928
6.707SerPhe: 6.707 ± 1.468
4.918SerGly: 4.918 ± 1.284
2.459SerHis: 2.459 ± 0.901
4.248SerIle: 4.248 ± 1.087
3.801SerLys: 3.801 ± 0.695
10.06SerLeu: 10.06 ± 1.046
2.236SerMet: 2.236 ± 0.537
2.236SerAsn: 2.236 ± 0.663
3.577SerPro: 3.577 ± 0.57
2.459SerGln: 2.459 ± 1.259
4.695SerArg: 4.695 ± 2.484
7.825SerSer: 7.825 ± 2.143
4.695SerThr: 4.695 ± 1.528
4.024SerVal: 4.024 ± 0.959
1.789SerTrp: 1.789 ± 0.949
1.118SerTyr: 1.118 ± 0.471
0.0SerXaa: 0.0 ± 0.0
Thr
3.577ThrAla: 3.577 ± 1.047
1.341ThrCys: 1.341 ± 0.492
1.118ThrAsp: 1.118 ± 0.471
2.906ThrGlu: 2.906 ± 0.881
2.012ThrPhe: 2.012 ± 1.148
2.683ThrGly: 2.683 ± 1.022
0.447ThrHis: 0.447 ± 0.088
2.236ThrIle: 2.236 ± 0.659
1.789ThrLys: 1.789 ± 0.805
5.366ThrLeu: 5.366 ± 1.419
1.118ThrMet: 1.118 ± 0.319
0.894ThrAsn: 0.894 ± 0.428
1.341ThrPro: 1.341 ± 0.492
2.906ThrGln: 2.906 ± 0.615
2.459ThrArg: 2.459 ± 1.101
4.471ThrSer: 4.471 ± 0.512
3.13ThrThr: 3.13 ± 0.411
3.13ThrVal: 3.13 ± 0.589
0.224ThrTrp: 0.224 ± 0.187
0.671ThrTyr: 0.671 ± 0.56
0.0ThrXaa: 0.0 ± 0.0
Val
5.813ValAla: 5.813 ± 1.903
1.341ValCys: 1.341 ± 0.76
3.577ValAsp: 3.577 ± 1.003
4.024ValGlu: 4.024 ± 0.406
2.459ValPhe: 2.459 ± 0.604
6.26ValGly: 6.26 ± 1.594
1.118ValHis: 1.118 ± 0.471
2.906ValIle: 2.906 ± 0.329
2.906ValLys: 2.906 ± 0.309
7.378ValLeu: 7.378 ± 1.328
1.118ValMet: 1.118 ± 0.225
2.906ValAsn: 2.906 ± 0.56
4.471ValPro: 4.471 ± 0.6
4.024ValGln: 4.024 ± 0.823
4.695ValArg: 4.695 ± 0.269
5.366ValSer: 5.366 ± 1.054
1.565ValThr: 1.565 ± 0.819
5.589ValVal: 5.589 ± 0.898
0.0ValTrp: 0.0 ± 0.0
2.459ValTyr: 2.459 ± 0.562
0.0ValXaa: 0.0 ± 0.0
Trp
0.894TrpAla: 0.894 ± 0.392
0.671TrpCys: 0.671 ± 0.582
0.0TrpAsp: 0.0 ± 0.0
0.894TrpGlu: 0.894 ± 0.474
0.224TrpPhe: 0.224 ± 0.187
0.447TrpGly: 0.447 ± 0.088
0.0TrpHis: 0.0 ± 0.0
0.224TrpIle: 0.224 ± 0.157
0.671TrpLys: 0.671 ± 0.173
1.118TrpLeu: 1.118 ± 0.225
0.224TrpMet: 0.224 ± 0.157
0.224TrpAsn: 0.224 ± 0.157
0.671TrpPro: 0.671 ± 0.39
1.118TrpGln: 1.118 ± 0.818
2.012TrpArg: 2.012 ± 0.48
1.789TrpSer: 1.789 ± 0.282
0.447TrpThr: 0.447 ± 0.088
0.894TrpVal: 0.894 ± 0.318
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.459TyrAla: 2.459 ± 0.808
0.447TyrCys: 0.447 ± 0.526
2.012TyrAsp: 2.012 ± 0.789
0.224TyrGlu: 0.224 ± 0.157
2.236TyrPhe: 2.236 ± 0.45
2.236TyrGly: 2.236 ± 1.079
0.0TyrHis: 0.0 ± 0.0
0.894TyrIle: 0.894 ± 0.176
1.565TyrLys: 1.565 ± 0.488
3.353TyrLeu: 3.353 ± 0.896
0.224TyrMet: 0.224 ± 0.157
1.118TyrAsn: 1.118 ± 0.785
1.565TyrPro: 1.565 ± 0.398
1.341TyrGln: 1.341 ± 0.264
2.459TyrArg: 2.459 ± 0.453
3.353TyrSer: 3.353 ± 0.543
1.118TyrThr: 1.118 ± 0.319
2.012TyrVal: 2.012 ± 0.519
0.447TyrTrp: 0.447 ± 0.088
0.894TyrTyr: 0.894 ± 0.176
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (4474 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski