Amino acid dipepetide frequency for Rice grassy stunt tenuivirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.749AlaAla: 1.749 ± 0.569
0.636AlaCys: 0.636 ± 0.333
1.908AlaAsp: 1.908 ± 1.136
2.385AlaGlu: 2.385 ± 0.474
1.59AlaPhe: 1.59 ± 0.424
1.431AlaGly: 1.431 ± 0.782
0.795AlaHis: 0.795 ± 0.222
2.703AlaIle: 2.703 ± 0.764
2.067AlaLys: 2.067 ± 0.606
3.18AlaLeu: 3.18 ± 0.85
1.749AlaMet: 1.749 ± 0.634
1.749AlaAsn: 1.749 ± 0.465
1.113AlaPro: 1.113 ± 0.357
0.795AlaGln: 0.795 ± 0.312
1.908AlaArg: 1.908 ± 0.544
3.975AlaSer: 3.975 ± 0.658
1.113AlaThr: 1.113 ± 0.661
1.749AlaVal: 1.749 ± 0.493
0.159AlaTrp: 0.159 ± 0.096
2.544AlaTyr: 2.544 ± 0.522
0.0AlaXaa: 0.0 ± 0.0
Cys
0.795CysAla: 0.795 ± 0.463
0.477CysCys: 0.477 ± 0.407
1.272CysAsp: 1.272 ± 0.578
1.431CysGlu: 1.431 ± 0.576
0.795CysPhe: 0.795 ± 0.241
0.954CysGly: 0.954 ± 0.578
0.159CysHis: 0.159 ± 0.096
1.59CysIle: 1.59 ± 0.492
1.59CysLys: 1.59 ± 0.436
1.908CysLeu: 1.908 ± 0.598
0.636CysMet: 0.636 ± 0.433
1.749CysAsn: 1.749 ± 1.233
0.477CysPro: 0.477 ± 0.268
0.954CysGln: 0.954 ± 0.33
1.113CysArg: 1.113 ± 0.497
1.908CysSer: 1.908 ± 0.769
0.795CysThr: 0.795 ± 0.235
0.795CysVal: 0.795 ± 0.276
0.159CysTrp: 0.159 ± 0.251
0.477CysTyr: 0.477 ± 0.281
0.0CysXaa: 0.0 ± 0.0
Asp
2.544AspAla: 2.544 ± 0.757
1.431AspCys: 1.431 ± 0.687
6.201AspAsp: 6.201 ± 0.76
5.088AspGlu: 5.088 ± 0.76
3.498AspPhe: 3.498 ± 0.799
3.657AspGly: 3.657 ± 1.105
1.113AspHis: 1.113 ± 0.498
3.339AspIle: 3.339 ± 0.818
3.657AspLys: 3.657 ± 1.355
8.427AspLeu: 8.427 ± 1.234
2.703AspMet: 2.703 ± 0.957
3.657AspAsn: 3.657 ± 1.196
3.18AspPro: 3.18 ± 1.09
3.021AspGln: 3.021 ± 0.469
2.703AspArg: 2.703 ± 0.54
4.77AspSer: 4.77 ± 0.884
2.067AspThr: 2.067 ± 0.467
4.77AspVal: 4.77 ± 0.5
0.795AspTrp: 0.795 ± 0.328
3.18AspTyr: 3.18 ± 0.608
0.0AspXaa: 0.0 ± 0.0
Glu
2.544GluAla: 2.544 ± 0.617
0.954GluCys: 0.954 ± 0.324
4.77GluAsp: 4.77 ± 1.117
3.975GluGlu: 3.975 ± 1.051
2.385GluPhe: 2.385 ± 0.882
2.544GluGly: 2.544 ± 0.791
1.908GluHis: 1.908 ± 0.709
3.498GluIle: 3.498 ± 1.222
4.77GluLys: 4.77 ± 1.209
7.632GluLeu: 7.632 ± 1.116
1.749GluMet: 1.749 ± 0.712
4.293GluAsn: 4.293 ± 1.474
0.954GluPro: 0.954 ± 0.234
1.749GluGln: 1.749 ± 0.411
2.703GluArg: 2.703 ± 0.455
5.565GluSer: 5.565 ± 1.905
4.611GluThr: 4.611 ± 0.734
3.975GluVal: 3.975 ± 0.737
0.477GluTrp: 0.477 ± 0.219
3.339GluTyr: 3.339 ± 0.731
0.0GluXaa: 0.0 ± 0.0
Phe
1.431PheAla: 1.431 ± 0.418
0.636PheCys: 0.636 ± 0.309
0.954PheAsp: 0.954 ± 0.371
1.749PheGlu: 1.749 ± 0.403
1.431PhePhe: 1.431 ± 0.454
2.226PheGly: 2.226 ± 0.763
0.954PheHis: 0.954 ± 0.334
3.021PheIle: 3.021 ± 0.865
2.385PheLys: 2.385 ± 0.4
4.77PheLeu: 4.77 ± 0.603
1.272PheMet: 1.272 ± 0.633
2.226PheAsn: 2.226 ± 0.455
1.59PhePro: 1.59 ± 0.531
0.954PheGln: 0.954 ± 0.401
2.862PheArg: 2.862 ± 0.788
5.406PheSer: 5.406 ± 0.847
3.18PheThr: 3.18 ± 0.669
3.021PheVal: 3.021 ± 0.645
0.477PheTrp: 0.477 ± 0.195
1.113PheTyr: 1.113 ± 0.37
0.0PheXaa: 0.0 ± 0.0
Gly
0.795GlyAla: 0.795 ± 0.282
0.954GlyCys: 0.954 ± 0.623
3.816GlyAsp: 3.816 ± 0.605
1.908GlyGlu: 1.908 ± 0.431
3.021GlyPhe: 3.021 ± 0.579
2.544GlyGly: 2.544 ± 0.565
0.795GlyHis: 0.795 ± 0.457
4.134GlyIle: 4.134 ± 1.064
2.544GlyLys: 2.544 ± 0.785
4.77GlyLeu: 4.77 ± 0.833
0.954GlyMet: 0.954 ± 0.223
2.544GlyAsn: 2.544 ± 0.837
0.954GlyPro: 0.954 ± 0.261
1.908GlyGln: 1.908 ± 0.579
2.385GlyArg: 2.385 ± 0.645
3.498GlySer: 3.498 ± 0.824
2.544GlyThr: 2.544 ± 0.478
4.293GlyVal: 4.293 ± 0.704
0.477GlyTrp: 0.477 ± 0.222
3.021GlyTyr: 3.021 ± 0.525
0.0GlyXaa: 0.0 ± 0.0
His
0.318HisAla: 0.318 ± 0.191
0.636HisCys: 0.636 ± 0.298
2.385HisAsp: 2.385 ± 0.465
1.59HisGlu: 1.59 ± 0.469
0.795HisPhe: 0.795 ± 0.341
1.113HisGly: 1.113 ± 0.312
0.477HisHis: 0.477 ± 0.195
0.795HisIle: 0.795 ± 0.457
1.749HisLys: 1.749 ± 0.423
1.59HisLeu: 1.59 ± 0.387
1.113HisMet: 1.113 ± 0.374
0.477HisAsn: 0.477 ± 0.287
0.954HisPro: 0.954 ± 0.31
0.636HisGln: 0.636 ± 0.263
1.113HisArg: 1.113 ± 0.34
1.113HisSer: 1.113 ± 0.555
1.431HisThr: 1.431 ± 0.629
1.272HisVal: 1.272 ± 0.307
0.477HisTrp: 0.477 ± 0.399
2.226HisTyr: 2.226 ± 0.612
0.0HisXaa: 0.0 ± 0.0
Ile
2.544IleAla: 2.544 ± 0.898
1.908IleCys: 1.908 ± 0.6
4.293IleAsp: 4.293 ± 0.999
4.134IleGlu: 4.134 ± 0.709
1.908IlePhe: 1.908 ± 0.356
2.862IleGly: 2.862 ± 0.639
2.067IleHis: 2.067 ± 0.649
5.883IleIle: 5.883 ± 1.391
6.36IleLys: 6.36 ± 1.276
5.088IleLeu: 5.088 ± 0.827
1.113IleMet: 1.113 ± 0.615
3.498IleAsn: 3.498 ± 1.151
3.498IlePro: 3.498 ± 1.014
2.703IleGln: 2.703 ± 0.684
4.611IleArg: 4.611 ± 0.788
6.36IleSer: 6.36 ± 0.529
3.975IleThr: 3.975 ± 1.529
4.134IleVal: 4.134 ± 0.83
0.477IleTrp: 0.477 ± 0.195
3.657IleTyr: 3.657 ± 1.156
0.0IleXaa: 0.0 ± 0.0
Lys
2.862LysAla: 2.862 ± 0.955
2.226LysCys: 2.226 ± 0.938
5.247LysAsp: 5.247 ± 1.161
4.134LysGlu: 4.134 ± 1.135
2.544LysPhe: 2.544 ± 0.825
2.067LysGly: 2.067 ± 0.747
1.908LysHis: 1.908 ± 0.467
5.883LysIle: 5.883 ± 0.999
5.883LysLys: 5.883 ± 1.714
7.95LysLeu: 7.95 ± 0.668
3.498LysMet: 3.498 ± 0.903
4.77LysAsn: 4.77 ± 1.023
2.067LysPro: 2.067 ± 0.816
1.749LysGln: 1.749 ± 0.415
2.544LysArg: 2.544 ± 0.735
4.77LysSer: 4.77 ± 0.634
4.929LysThr: 4.929 ± 0.824
4.611LysVal: 4.611 ± 1.135
0.477LysTrp: 0.477 ± 0.287
3.816LysTyr: 3.816 ± 0.98
0.0LysXaa: 0.0 ± 0.0
Leu
3.657LeuAla: 3.657 ± 0.894
1.749LeuCys: 1.749 ± 0.262
6.36LeuAsp: 6.36 ± 1.838
7.473LeuGlu: 7.473 ± 0.938
4.452LeuPhe: 4.452 ± 0.835
6.042LeuGly: 6.042 ± 1.203
2.544LeuHis: 2.544 ± 0.638
6.042LeuIle: 6.042 ± 1.398
8.586LeuLys: 8.586 ± 0.715
7.95LeuLeu: 7.95 ± 1.399
3.021LeuMet: 3.021 ± 0.773
4.77LeuAsn: 4.77 ± 0.925
3.339LeuPro: 3.339 ± 0.607
2.385LeuGln: 2.385 ± 0.646
4.929LeuArg: 4.929 ± 1.001
9.858LeuSer: 9.858 ± 1.265
5.247LeuThr: 5.247 ± 0.826
6.042LeuVal: 6.042 ± 0.873
1.59LeuTrp: 1.59 ± 0.408
2.862LeuTyr: 2.862 ± 0.637
0.0LeuXaa: 0.0 ± 0.0
Met
1.749MetAla: 1.749 ± 0.363
0.318MetCys: 0.318 ± 0.191
2.226MetAsp: 2.226 ± 0.372
2.226MetGlu: 2.226 ± 1.339
1.113MetPhe: 1.113 ± 0.413
2.067MetGly: 2.067 ± 0.492
1.113MetHis: 1.113 ± 0.546
2.544MetIle: 2.544 ± 0.636
1.59MetLys: 1.59 ± 0.391
3.021MetLeu: 3.021 ± 0.794
0.636MetMet: 0.636 ± 0.283
2.226MetAsn: 2.226 ± 0.747
0.318MetPro: 0.318 ± 0.307
1.113MetGln: 1.113 ± 0.314
1.431MetArg: 1.431 ± 0.289
2.862MetSer: 2.862 ± 1.023
2.226MetThr: 2.226 ± 0.484
1.59MetVal: 1.59 ± 0.727
0.795MetTrp: 0.795 ± 0.502
1.59MetTyr: 1.59 ± 0.301
0.0MetXaa: 0.0 ± 0.0
Asn
2.703AsnAla: 2.703 ± 1.15
1.59AsnCys: 1.59 ± 0.341
3.657AsnAsp: 3.657 ± 0.784
2.862AsnGlu: 2.862 ± 0.677
2.226AsnPhe: 2.226 ± 0.492
2.703AsnGly: 2.703 ± 0.829
0.954AsnHis: 0.954 ± 0.67
3.975AsnIle: 3.975 ± 0.616
4.77AsnLys: 4.77 ± 0.835
5.883AsnLeu: 5.883 ± 1.176
1.113AsnMet: 1.113 ± 0.415
2.703AsnAsn: 2.703 ± 1.034
2.226AsnPro: 2.226 ± 0.41
1.59AsnGln: 1.59 ± 0.414
2.385AsnArg: 2.385 ± 0.404
3.975AsnSer: 3.975 ± 0.723
3.498AsnThr: 3.498 ± 0.999
3.816AsnVal: 3.816 ± 0.674
1.113AsnTrp: 1.113 ± 0.455
1.431AsnTyr: 1.431 ± 0.405
0.0AsnXaa: 0.0 ± 0.0
Pro
1.431ProAla: 1.431 ± 0.538
0.159ProCys: 0.159 ± 0.206
2.544ProAsp: 2.544 ± 0.704
2.226ProGlu: 2.226 ± 0.51
1.431ProPhe: 1.431 ± 0.486
1.113ProGly: 1.113 ± 0.376
0.477ProHis: 0.477 ± 0.225
2.544ProIle: 2.544 ± 1.356
2.226ProLys: 2.226 ± 0.634
3.339ProLeu: 3.339 ± 1.556
0.795ProMet: 0.795 ± 0.345
2.226ProAsn: 2.226 ± 0.577
0.795ProPro: 0.795 ± 0.274
1.59ProGln: 1.59 ± 0.61
1.59ProArg: 1.59 ± 0.547
1.272ProSer: 1.272 ± 0.765
2.226ProThr: 2.226 ± 0.815
1.59ProVal: 1.59 ± 0.62
0.477ProTrp: 0.477 ± 0.219
0.795ProTyr: 0.795 ± 0.341
0.0ProXaa: 0.0 ± 0.0
Gln
0.954GlnAla: 0.954 ± 0.378
0.477GlnCys: 0.477 ± 0.215
0.954GlnAsp: 0.954 ± 0.307
2.703GlnGlu: 2.703 ± 0.517
2.544GlnPhe: 2.544 ± 0.749
1.113GlnGly: 1.113 ± 0.452
0.795GlnHis: 0.795 ± 0.423
2.385GlnIle: 2.385 ± 0.399
2.703GlnLys: 2.703 ± 0.669
2.862GlnLeu: 2.862 ± 0.468
1.272GlnMet: 1.272 ± 0.499
1.908GlnAsn: 1.908 ± 0.776
1.113GlnPro: 1.113 ± 0.355
1.113GlnGln: 1.113 ± 0.391
1.113GlnArg: 1.113 ± 0.339
2.385GlnSer: 2.385 ± 1.077
1.749GlnThr: 1.749 ± 0.329
1.908GlnVal: 1.908 ± 0.623
0.159GlnTrp: 0.159 ± 0.096
1.59GlnTyr: 1.59 ± 0.268
0.0GlnXaa: 0.0 ± 0.0
Arg
1.749ArgAla: 1.749 ± 0.547
0.636ArgCys: 0.636 ± 0.352
4.134ArgAsp: 4.134 ± 0.527
4.134ArgGlu: 4.134 ± 0.863
1.59ArgPhe: 1.59 ± 0.296
3.021ArgGly: 3.021 ± 0.485
0.795ArgHis: 0.795 ± 0.472
3.657ArgIle: 3.657 ± 0.47
3.339ArgLys: 3.339 ± 0.429
4.929ArgLeu: 4.929 ± 0.888
2.067ArgMet: 2.067 ± 0.403
2.385ArgAsn: 2.385 ± 0.875
1.272ArgPro: 1.272 ± 0.322
1.59ArgGln: 1.59 ± 0.62
3.339ArgArg: 3.339 ± 0.971
3.816ArgSer: 3.816 ± 1.085
1.431ArgThr: 1.431 ± 0.369
3.657ArgVal: 3.657 ± 0.668
0.795ArgTrp: 0.795 ± 0.214
1.59ArgTyr: 1.59 ± 0.855
0.0ArgXaa: 0.0 ± 0.0
Ser
2.385SerAla: 2.385 ± 0.577
0.795SerCys: 0.795 ± 0.47
6.36SerAsp: 6.36 ± 1.142
5.247SerGlu: 5.247 ± 1.398
3.18SerPhe: 3.18 ± 0.699
4.134SerGly: 4.134 ± 0.634
0.795SerHis: 0.795 ± 0.289
5.883SerIle: 5.883 ± 0.77
7.314SerLys: 7.314 ± 0.551
8.745SerLeu: 8.745 ± 1.833
3.816SerMet: 3.816 ± 0.988
4.611SerAsn: 4.611 ± 1.21
1.431SerPro: 1.431 ± 0.449
2.067SerGln: 2.067 ± 0.649
4.77SerArg: 4.77 ± 0.985
7.95SerSer: 7.95 ± 0.945
5.088SerThr: 5.088 ± 0.683
5.883SerVal: 5.883 ± 0.63
0.477SerTrp: 0.477 ± 0.365
2.862SerTyr: 2.862 ± 0.713
0.0SerXaa: 0.0 ± 0.0
Thr
1.113ThrAla: 1.113 ± 0.673
1.113ThrCys: 1.113 ± 0.6
3.816ThrAsp: 3.816 ± 0.677
3.657ThrGlu: 3.657 ± 0.939
2.385ThrPhe: 2.385 ± 0.512
2.862ThrGly: 2.862 ± 0.782
1.272ThrHis: 1.272 ± 0.327
4.929ThrIle: 4.929 ± 0.933
4.134ThrLys: 4.134 ± 0.704
4.929ThrLeu: 4.929 ± 0.813
2.544ThrMet: 2.544 ± 0.541
3.021ThrAsn: 3.021 ± 0.844
2.067ThrPro: 2.067 ± 0.414
1.431ThrGln: 1.431 ± 0.448
2.544ThrArg: 2.544 ± 0.608
4.452ThrSer: 4.452 ± 1.268
3.339ThrThr: 3.339 ± 1.184
3.339ThrVal: 3.339 ± 0.831
0.477ThrTrp: 0.477 ± 0.248
1.59ThrTyr: 1.59 ± 0.492
0.0ThrXaa: 0.0 ± 0.0
Val
2.544ValAla: 2.544 ± 0.683
2.226ValCys: 2.226 ± 0.671
4.611ValAsp: 4.611 ± 1.089
3.498ValGlu: 3.498 ± 0.963
2.544ValPhe: 2.544 ± 0.57
2.862ValGly: 2.862 ± 0.514
1.908ValHis: 1.908 ± 0.444
3.975ValIle: 3.975 ± 1.047
4.293ValLys: 4.293 ± 1.028
6.201ValLeu: 6.201 ± 0.843
0.795ValMet: 0.795 ± 0.296
3.975ValAsn: 3.975 ± 0.827
2.862ValPro: 2.862 ± 0.837
2.703ValGln: 2.703 ± 0.392
3.021ValArg: 3.021 ± 0.53
6.042ValSer: 6.042 ± 0.571
1.908ValThr: 1.908 ± 0.719
3.498ValVal: 3.498 ± 0.976
0.636ValTrp: 0.636 ± 0.254
2.067ValTyr: 2.067 ± 0.699
0.0ValXaa: 0.0 ± 0.0
Trp
0.477TrpAla: 0.477 ± 0.28
0.159TrpCys: 0.159 ± 0.096
0.636TrpAsp: 0.636 ± 0.323
0.318TrpGlu: 0.318 ± 0.163
1.113TrpPhe: 1.113 ± 0.319
0.795TrpGly: 0.795 ± 0.247
0.159TrpHis: 0.159 ± 0.096
1.272TrpIle: 1.272 ± 0.582
0.636TrpLys: 0.636 ± 0.319
1.113TrpLeu: 1.113 ± 0.331
0.318TrpMet: 0.318 ± 0.191
0.318TrpAsn: 0.318 ± 0.163
0.477TrpPro: 0.477 ± 0.215
0.159TrpGln: 0.159 ± 0.17
0.477TrpArg: 0.477 ± 0.287
0.636TrpSer: 0.636 ± 0.298
0.636TrpThr: 0.636 ± 0.278
0.159TrpVal: 0.159 ± 0.096
0.0TrpTrp: 0.0 ± 0.0
0.795TrpTyr: 0.795 ± 0.306
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.954TyrAla: 0.954 ± 0.285
0.954TyrCys: 0.954 ± 0.323
3.657TyrAsp: 3.657 ± 0.633
3.657TyrGlu: 3.657 ± 0.602
1.113TyrPhe: 1.113 ± 0.508
1.749TyrGly: 1.749 ± 0.325
1.272TyrHis: 1.272 ± 0.395
3.021TyrIle: 3.021 ± 0.628
3.339TyrLys: 3.339 ± 0.542
4.77TyrLeu: 4.77 ± 0.709
1.431TyrMet: 1.431 ± 0.684
2.067TyrAsn: 2.067 ± 0.694
0.159TyrPro: 0.159 ± 0.096
1.59TyrGln: 1.59 ± 0.499
2.385TyrArg: 2.385 ± 0.616
3.18TyrSer: 3.18 ± 0.627
3.021TyrThr: 3.021 ± 1.06
2.067TyrVal: 2.067 ± 0.501
0.159TyrTrp: 0.159 ± 0.096
2.067TyrTyr: 2.067 ± 0.515
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12 proteins (6290 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski