Amino acid dipepetide frequency for Rice stripe necrosis virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.902AlaAla: 7.902 ± 1.831
0.718AlaCys: 0.718 ± 0.437
4.31AlaAsp: 4.31 ± 1.144
5.268AlaGlu: 5.268 ± 0.979
2.395AlaPhe: 2.395 ± 1.011
5.508AlaGly: 5.508 ± 1.864
0.958AlaHis: 0.958 ± 0.582
5.747AlaIle: 5.747 ± 1.589
4.31AlaLys: 4.31 ± 1.186
8.381AlaLeu: 8.381 ± 1.04
4.071AlaMet: 4.071 ± 1.023
2.634AlaAsn: 2.634 ± 0.6
5.029AlaPro: 5.029 ± 1.566
2.155AlaGln: 2.155 ± 0.677
5.987AlaArg: 5.987 ± 0.73
5.747AlaSer: 5.747 ± 1.667
2.634AlaThr: 2.634 ± 1.137
5.987AlaVal: 5.987 ± 1.541
0.239AlaTrp: 0.239 ± 0.146
2.634AlaTyr: 2.634 ± 1.137
0.239AlaXaa: 0.239 ± 0.275
Cys
1.197CysAla: 1.197 ± 0.275
0.239CysCys: 0.239 ± 0.502
1.197CysAsp: 1.197 ± 0.427
0.479CysGlu: 0.479 ± 0.69
0.239CysPhe: 0.239 ± 0.146
1.676CysGly: 1.676 ± 0.734
0.479CysHis: 0.479 ± 0.337
1.197CysIle: 1.197 ± 1.119
0.479CysLys: 0.479 ± 0.632
1.676CysLeu: 1.676 ± 0.687
0.718CysMet: 0.718 ± 0.386
2.395CysAsn: 2.395 ± 2.2
0.0CysPro: 0.0 ± 0.0
0.479CysGln: 0.479 ± 0.291
0.479CysArg: 0.479 ± 0.37
0.958CysSer: 0.958 ± 0.483
2.155CysThr: 2.155 ± 1.133
1.437CysVal: 1.437 ± 1.13
0.479CysTrp: 0.479 ± 0.337
0.239CysTyr: 0.239 ± 0.146
0.0CysXaa: 0.0 ± 0.0
Asp
4.071AspAla: 4.071 ± 0.785
0.958AspCys: 0.958 ± 0.483
2.634AspAsp: 2.634 ± 0.679
3.352AspGlu: 3.352 ± 0.66
1.676AspPhe: 1.676 ± 0.533
3.352AspGly: 3.352 ± 1.023
0.718AspHis: 0.718 ± 0.377
2.395AspIle: 2.395 ± 0.457
3.113AspLys: 3.113 ± 0.798
5.747AspLeu: 5.747 ± 1.064
1.916AspMet: 1.916 ± 0.813
2.634AspAsn: 2.634 ± 1.247
2.874AspPro: 2.874 ± 0.58
0.479AspGln: 0.479 ± 0.291
3.113AspArg: 3.113 ± 1.143
3.352AspSer: 3.352 ± 1.096
2.155AspThr: 2.155 ± 0.949
5.268AspVal: 5.268 ± 1.44
2.874AspTrp: 2.874 ± 0.726
3.113AspTyr: 3.113 ± 1.097
0.0AspXaa: 0.0 ± 0.0
Glu
3.831GluAla: 3.831 ± 0.887
1.437GluCys: 1.437 ± 0.414
2.155GluAsp: 2.155 ± 0.443
3.831GluGlu: 3.831 ± 0.55
4.55GluPhe: 4.55 ± 1.677
2.395GluGly: 2.395 ± 0.679
1.197GluHis: 1.197 ± 0.501
5.029GluIle: 5.029 ± 0.662
3.592GluLys: 3.592 ± 0.858
5.029GluLeu: 5.029 ± 0.662
1.676GluMet: 1.676 ± 0.817
2.395GluAsn: 2.395 ± 1.116
4.071GluPro: 4.071 ± 0.548
2.395GluGln: 2.395 ± 0.637
5.987GluArg: 5.987 ± 2.366
3.831GluSer: 3.831 ± 0.756
2.874GluThr: 2.874 ± 0.793
4.55GluVal: 4.55 ± 1.276
0.958GluTrp: 0.958 ± 0.357
2.395GluTyr: 2.395 ± 0.674
0.0GluXaa: 0.0 ± 0.0
Phe
3.113PheAla: 3.113 ± 1.301
1.197PheCys: 1.197 ± 0.558
2.874PheAsp: 2.874 ± 0.424
3.592PheGlu: 3.592 ± 0.884
0.239PhePhe: 0.239 ± 0.146
2.395PheGly: 2.395 ± 0.674
0.239PheHis: 0.239 ± 0.146
1.676PheIle: 1.676 ± 1.111
2.634PheLys: 2.634 ± 0.413
3.831PheLeu: 3.831 ± 1.263
0.958PheMet: 0.958 ± 0.272
1.676PheAsn: 1.676 ± 0.469
1.676PhePro: 1.676 ± 0.621
1.916PheGln: 1.916 ± 0.536
2.155PheArg: 2.155 ± 1.055
3.831PheSer: 3.831 ± 0.756
3.113PheThr: 3.113 ± 1.196
3.352PheVal: 3.352 ± 0.539
0.718PheTrp: 0.718 ± 1.071
0.479PheTyr: 0.479 ± 0.382
0.0PheXaa: 0.0 ± 0.0
Gly
6.226GlyAla: 6.226 ± 0.869
1.676GlyCys: 1.676 ± 0.914
4.789GlyAsp: 4.789 ± 0.378
3.352GlyGlu: 3.352 ± 0.738
2.395GlyPhe: 2.395 ± 0.461
4.071GlyGly: 4.071 ± 1.667
0.958GlyHis: 0.958 ± 0.582
2.634GlyIle: 2.634 ± 0.575
3.352GlyLys: 3.352 ± 0.76
4.31GlyLeu: 4.31 ± 1.264
1.916GlyMet: 1.916 ± 1.025
3.352GlyAsn: 3.352 ± 1.291
3.352GlyPro: 3.352 ± 1.128
3.113GlyGln: 3.113 ± 0.477
2.155GlyArg: 2.155 ± 1.327
6.466GlySer: 6.466 ± 2.918
2.634GlyThr: 2.634 ± 0.577
4.55GlyVal: 4.55 ± 1.371
0.718GlyTrp: 0.718 ± 0.297
3.831GlyTyr: 3.831 ± 1.002
0.0GlyXaa: 0.0 ± 0.0
His
1.437HisAla: 1.437 ± 0.258
0.239HisCys: 0.239 ± 0.146
1.437HisAsp: 1.437 ± 0.447
0.239HisGlu: 0.239 ± 0.357
1.197HisPhe: 1.197 ± 0.728
1.197HisGly: 1.197 ± 0.427
0.718HisHis: 0.718 ± 0.455
1.437HisIle: 1.437 ± 0.447
2.155HisLys: 2.155 ± 0.754
0.479HisLeu: 0.479 ± 0.291
0.718HisMet: 0.718 ± 0.437
0.479HisAsn: 0.479 ± 0.543
1.197HisPro: 1.197 ± 0.406
0.479HisGln: 0.479 ± 0.543
0.958HisArg: 0.958 ± 0.483
0.239HisSer: 0.239 ± 0.502
0.718HisThr: 0.718 ± 0.455
2.634HisVal: 2.634 ± 1.197
0.0HisTrp: 0.0 ± 0.0
0.958HisTyr: 0.958 ± 0.447
0.0HisXaa: 0.0 ± 0.0
Ile
5.508IleAla: 5.508 ± 1.749
0.479IleCys: 0.479 ± 1.004
2.155IleAsp: 2.155 ± 0.775
3.352IleGlu: 3.352 ± 0.674
2.155IlePhe: 2.155 ± 0.349
3.113IleGly: 3.113 ± 1.768
0.718IleHis: 0.718 ± 0.386
2.634IleIle: 2.634 ± 1.103
3.352IleLys: 3.352 ± 0.953
3.592IleLeu: 3.592 ± 1.085
0.718IleMet: 0.718 ± 0.377
1.916IleAsn: 1.916 ± 0.439
3.113IlePro: 3.113 ± 1.097
2.395IleGln: 2.395 ± 0.421
2.155IleArg: 2.155 ± 0.568
5.508IleSer: 5.508 ± 0.817
2.395IleThr: 2.395 ± 0.419
2.874IleVal: 2.874 ± 1.264
0.239IleTrp: 0.239 ± 0.146
1.197IleTyr: 1.197 ± 0.427
0.0IleXaa: 0.0 ± 0.0
Lys
3.831LysAla: 3.831 ± 0.797
0.0LysCys: 0.0 ± 0.0
3.352LysAsp: 3.352 ± 0.818
4.071LysGlu: 4.071 ± 1.095
2.155LysPhe: 2.155 ± 1.047
3.592LysGly: 3.592 ± 1.216
0.958LysHis: 0.958 ± 0.272
3.831LysIle: 3.831 ± 0.789
2.155LysLys: 2.155 ± 0.561
3.592LysLeu: 3.592 ± 1.086
1.197LysMet: 1.197 ± 0.412
2.155LysAsn: 2.155 ± 0.883
1.916LysPro: 1.916 ± 1.165
2.155LysGln: 2.155 ± 1.327
2.634LysArg: 2.634 ± 1.602
2.155LysSer: 2.155 ± 1.607
4.071LysThr: 4.071 ± 0.628
2.155LysVal: 2.155 ± 1.038
0.958LysTrp: 0.958 ± 0.403
1.197LysTyr: 1.197 ± 0.754
0.0LysXaa: 0.0 ± 0.0
Leu
7.663LeuAla: 7.663 ± 1.356
1.437LeuCys: 1.437 ± 1.046
4.071LeuAsp: 4.071 ± 0.838
6.466LeuGlu: 6.466 ± 0.998
2.395LeuPhe: 2.395 ± 0.674
3.592LeuGly: 3.592 ± 1.645
1.676LeuHis: 1.676 ± 0.462
2.395LeuIle: 2.395 ± 1.609
4.789LeuLys: 4.789 ± 0.639
5.747LeuLeu: 5.747 ± 1.276
2.874LeuMet: 2.874 ± 0.574
3.592LeuAsn: 3.592 ± 1.784
4.31LeuPro: 4.31 ± 1.001
3.113LeuGln: 3.113 ± 0.944
8.621LeuArg: 8.621 ± 1.125
6.944LeuSer: 6.944 ± 2.474
3.352LeuThr: 3.352 ± 0.769
6.705LeuVal: 6.705 ± 1.395
0.958LeuTrp: 0.958 ± 0.608
1.197LeuTyr: 1.197 ± 0.728
0.0LeuXaa: 0.0 ± 0.0
Met
2.155MetAla: 2.155 ± 1.097
0.479MetCys: 0.479 ± 0.37
1.197MetAsp: 1.197 ± 0.393
0.958MetGlu: 0.958 ± 0.582
0.958MetPhe: 0.958 ± 0.447
1.437MetGly: 1.437 ± 0.356
0.718MetHis: 0.718 ± 0.361
1.197MetIle: 1.197 ± 0.558
0.239MetLys: 0.239 ± 0.146
3.592MetLeu: 3.592 ± 0.636
0.479MetMet: 0.479 ± 0.337
1.197MetAsn: 1.197 ± 0.571
1.437MetPro: 1.437 ± 0.356
1.197MetGln: 1.197 ± 0.548
1.437MetArg: 1.437 ± 0.595
3.352MetSer: 3.352 ± 1.185
1.437MetThr: 1.437 ± 0.722
2.155MetVal: 2.155 ± 0.739
0.0MetTrp: 0.0 ± 0.0
0.958MetTyr: 0.958 ± 0.272
0.0MetXaa: 0.0 ± 0.0
Asn
3.592AsnAla: 3.592 ± 1.329
1.197AsnCys: 1.197 ± 1.009
2.395AsnAsp: 2.395 ± 0.511
2.155AsnGlu: 2.155 ± 0.542
2.874AsnPhe: 2.874 ± 0.672
3.352AsnGly: 3.352 ± 1.36
0.718AsnHis: 0.718 ± 0.377
2.634AsnIle: 2.634 ± 0.563
2.155AsnLys: 2.155 ± 0.609
4.789AsnLeu: 4.789 ± 0.802
0.718AsnMet: 0.718 ± 0.45
2.395AsnAsn: 2.395 ± 0.783
2.155AsnPro: 2.155 ± 0.916
0.718AsnGln: 0.718 ± 0.437
2.395AsnArg: 2.395 ± 0.814
4.789AsnSer: 4.789 ± 1.908
1.437AsnThr: 1.437 ± 0.447
3.352AsnVal: 3.352 ± 0.927
0.479AsnTrp: 0.479 ± 0.291
1.197AsnTyr: 1.197 ± 0.548
0.479AsnXaa: 0.479 ± 0.382
Pro
3.352ProAla: 3.352 ± 1.523
0.479ProCys: 0.479 ± 0.594
2.395ProAsp: 2.395 ± 0.831
3.352ProGlu: 3.352 ± 1.066
2.395ProPhe: 2.395 ± 0.401
4.55ProGly: 4.55 ± 0.997
0.958ProHis: 0.958 ± 0.602
4.31ProIle: 4.31 ± 1.674
0.958ProLys: 0.958 ± 0.447
3.831ProLeu: 3.831 ± 1.325
0.239ProMet: 0.239 ± 0.146
2.395ProAsn: 2.395 ± 0.748
2.874ProPro: 2.874 ± 1.08
1.916ProGln: 1.916 ± 1.043
3.831ProArg: 3.831 ± 1.505
3.831ProSer: 3.831 ± 1.383
1.197ProThr: 1.197 ± 0.562
4.071ProVal: 4.071 ± 2.035
0.718ProTrp: 0.718 ± 0.574
2.155ProTyr: 2.155 ± 0.873
0.0ProXaa: 0.0 ± 0.0
Gln
2.874GlnAla: 2.874 ± 0.58
0.479GlnCys: 0.479 ± 0.382
1.916GlnAsp: 1.916 ± 0.74
2.634GlnGlu: 2.634 ± 0.725
1.676GlnPhe: 1.676 ± 0.533
2.634GlnGly: 2.634 ± 0.513
0.718GlnHis: 0.718 ± 0.386
0.718GlnIle: 0.718 ± 0.297
1.197GlnLys: 1.197 ± 0.728
4.071GlnLeu: 4.071 ± 0.844
1.197GlnMet: 1.197 ± 0.611
1.437GlnAsn: 1.437 ± 0.414
1.916GlnPro: 1.916 ± 0.649
1.676GlnGln: 1.676 ± 0.559
1.916GlnArg: 1.916 ± 0.654
3.352GlnSer: 3.352 ± 0.586
1.676GlnThr: 1.676 ± 0.559
2.634GlnVal: 2.634 ± 0.531
0.239GlnTrp: 0.239 ± 0.146
1.676GlnTyr: 1.676 ± 0.685
0.0GlnXaa: 0.0 ± 0.0
Arg
4.789ArgAla: 4.789 ± 1.345
0.958ArgCys: 0.958 ± 0.549
3.592ArgAsp: 3.592 ± 0.474
4.55ArgGlu: 4.55 ± 1.21
3.831ArgPhe: 3.831 ± 1.225
3.831ArgGly: 3.831 ± 1.025
1.197ArgHis: 1.197 ± 0.826
1.676ArgIle: 1.676 ± 0.459
3.352ArgLys: 3.352 ± 0.923
4.789ArgLeu: 4.789 ± 1.679
2.155ArgMet: 2.155 ± 0.533
1.916ArgAsn: 1.916 ± 0.92
3.113ArgPro: 3.113 ± 1.151
2.634ArgGln: 2.634 ± 1.257
5.987ArgArg: 5.987 ± 3.968
4.31ArgSer: 4.31 ± 0.754
3.592ArgThr: 3.592 ± 0.908
5.747ArgVal: 5.747 ± 1.716
1.197ArgTrp: 1.197 ± 0.914
1.916ArgTyr: 1.916 ± 0.663
0.0ArgXaa: 0.0 ± 0.0
Ser
5.268SerAla: 5.268 ± 1.059
1.676SerCys: 1.676 ± 1.165
5.987SerAsp: 5.987 ± 1.455
5.987SerGlu: 5.987 ± 0.474
3.592SerPhe: 3.592 ± 1.304
7.184SerGly: 7.184 ± 1.106
1.676SerHis: 1.676 ± 0.689
2.874SerIle: 2.874 ± 0.706
2.634SerLys: 2.634 ± 1.131
5.268SerLeu: 5.268 ± 0.953
1.916SerMet: 1.916 ± 0.474
3.352SerAsn: 3.352 ± 0.518
2.395SerPro: 2.395 ± 0.847
3.352SerGln: 3.352 ± 0.866
4.31SerArg: 4.31 ± 1.47
6.705SerSer: 6.705 ± 2.011
3.352SerThr: 3.352 ± 0.703
5.987SerVal: 5.987 ± 1.695
0.239SerTrp: 0.239 ± 0.146
3.352SerTyr: 3.352 ± 1.034
0.0SerXaa: 0.0 ± 0.0
Thr
3.592ThrAla: 3.592 ± 1.413
0.718ThrCys: 0.718 ± 0.361
3.352ThrAsp: 3.352 ± 1.391
4.071ThrGlu: 4.071 ± 1.024
2.155ThrPhe: 2.155 ± 0.791
1.916ThrGly: 1.916 ± 0.663
1.676ThrHis: 1.676 ± 0.985
2.155ThrIle: 2.155 ± 0.542
3.352ThrLys: 3.352 ± 0.75
4.31ThrLeu: 4.31 ± 1.105
1.197ThrMet: 1.197 ± 0.534
1.197ThrAsn: 1.197 ± 0.741
2.874ThrPro: 2.874 ± 0.811
1.916ThrGln: 1.916 ± 0.539
2.155ThrArg: 2.155 ± 0.615
2.874ThrSer: 2.874 ± 0.799
1.916ThrThr: 1.916 ± 0.949
2.634ThrVal: 2.634 ± 1.137
1.676ThrTrp: 1.676 ± 0.316
1.676ThrTyr: 1.676 ± 0.462
0.0ThrXaa: 0.0 ± 0.0
Val
10.057ValAla: 10.057 ± 1.005
2.874ValCys: 2.874 ± 1.71
4.31ValAsp: 4.31 ± 0.765
2.395ValGlu: 2.395 ± 0.988
2.155ValPhe: 2.155 ± 0.707
4.789ValGly: 4.789 ± 0.823
1.676ValHis: 1.676 ± 0.316
3.113ValIle: 3.113 ± 1.2
3.113ValLys: 3.113 ± 1.386
5.987ValLeu: 5.987 ± 1.217
1.197ValMet: 1.197 ± 0.275
4.31ValAsn: 4.31 ± 1.277
3.831ValPro: 3.831 ± 0.964
2.155ValGln: 2.155 ± 0.397
5.747ValArg: 5.747 ± 1.167
5.987ValSer: 5.987 ± 0.73
3.352ValThr: 3.352 ± 0.727
5.747ValVal: 5.747 ± 2.689
0.0ValTrp: 0.0 ± 0.0
2.634ValTyr: 2.634 ± 0.844
0.0ValXaa: 0.0 ± 0.0
Trp
0.718TrpAla: 0.718 ± 0.361
0.479TrpCys: 0.479 ± 0.594
0.239TrpAsp: 0.239 ± 0.146
0.958TrpGlu: 0.958 ± 0.272
0.718TrpPhe: 0.718 ± 0.679
0.718TrpGly: 0.718 ± 0.297
0.239TrpHis: 0.239 ± 0.146
0.479TrpIle: 0.479 ± 0.382
0.0TrpLys: 0.0 ± 0.0
0.958TrpLeu: 0.958 ± 0.462
0.239TrpMet: 0.239 ± 0.146
2.395TrpAsn: 2.395 ± 1.336
0.0TrpPro: 0.0 ± 0.0
0.239TrpGln: 0.239 ± 0.146
0.479TrpArg: 0.479 ± 0.291
0.958TrpSer: 0.958 ± 0.833
0.718TrpThr: 0.718 ± 0.455
1.437TrpVal: 1.437 ± 0.911
0.0TrpTrp: 0.0 ± 0.0
0.718TrpTyr: 0.718 ± 0.297
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.676TyrAla: 1.676 ± 0.829
0.479TyrCys: 0.479 ± 0.37
1.676TyrAsp: 1.676 ± 0.533
3.113TyrGlu: 3.113 ± 0.559
1.916TyrPhe: 1.916 ± 0.419
3.831TyrGly: 3.831 ± 1.401
0.718TyrHis: 0.718 ± 0.297
1.437TyrIle: 1.437 ± 0.664
1.197TyrLys: 1.197 ± 0.728
1.916TyrLeu: 1.916 ± 0.46
0.239TyrMet: 0.239 ± 0.146
2.155TyrAsn: 2.155 ± 0.911
1.676TyrPro: 1.676 ± 0.48
2.155TyrGln: 2.155 ± 0.443
2.634TyrArg: 2.634 ± 0.936
1.916TyrSer: 1.916 ± 0.545
2.634TyrThr: 2.634 ± 0.725
2.155TyrVal: 2.155 ± 0.575
0.0TyrTrp: 0.0 ± 0.0
1.916TyrTyr: 1.916 ± 0.518
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.718XaaGly: 0.718 ± 0.586
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (4177 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski