Amino acid dipepetide frequency for Rice stripe mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.377AlaAla: 4.377 ± 1.798
0.257AlaCys: 0.257 ± 0.363
3.09AlaAsp: 3.09 ± 0.613
4.119AlaGlu: 4.119 ± 1.513
1.545AlaPhe: 1.545 ± 0.638
3.09AlaGly: 3.09 ± 1.97
0.515AlaHis: 0.515 ± 0.298
4.119AlaIle: 4.119 ± 0.831
4.377AlaLys: 4.377 ± 0.968
6.437AlaLeu: 6.437 ± 2.046
2.575AlaMet: 2.575 ± 1.128
2.832AlaAsn: 2.832 ± 0.955
2.575AlaPro: 2.575 ± 1.351
3.09AlaGln: 3.09 ± 0.62
2.06AlaArg: 2.06 ± 0.476
4.377AlaSer: 4.377 ± 1.098
4.119AlaThr: 4.119 ± 1.288
4.634AlaVal: 4.634 ± 1.12
0.772AlaTrp: 0.772 ± 0.528
3.605AlaTyr: 3.605 ± 1.28
0.0AlaXaa: 0.0 ± 0.0
Cys
1.545CysAla: 1.545 ± 0.609
0.257CysCys: 0.257 ± 0.149
1.545CysAsp: 1.545 ± 0.529
0.772CysGlu: 0.772 ± 0.265
0.772CysPhe: 0.772 ± 0.447
0.515CysGly: 0.515 ± 0.237
0.257CysHis: 0.257 ± 0.294
0.515CysIle: 0.515 ± 0.237
0.772CysLys: 0.772 ± 0.694
1.545CysLeu: 1.545 ± 0.592
0.515CysMet: 0.515 ± 0.298
0.515CysAsn: 0.515 ± 0.588
1.802CysPro: 1.802 ± 0.56
0.0CysGln: 0.0 ± 0.0
1.03CysArg: 1.03 ± 0.677
1.03CysSer: 1.03 ± 1.041
0.515CysThr: 0.515 ± 0.237
0.257CysVal: 0.257 ± 0.149
0.772CysTrp: 0.772 ± 0.265
0.772CysTyr: 0.772 ± 0.326
0.0CysXaa: 0.0 ± 0.0
Asp
2.832AspAla: 2.832 ± 1.979
1.287AspCys: 1.287 ± 0.481
2.575AspAsp: 2.575 ± 1.16
3.862AspGlu: 3.862 ± 0.824
1.03AspPhe: 1.03 ± 0.445
2.317AspGly: 2.317 ± 0.805
0.515AspHis: 0.515 ± 0.298
3.605AspIle: 3.605 ± 0.665
4.892AspLys: 4.892 ± 0.901
5.407AspLeu: 5.407 ± 0.78
2.317AspMet: 2.317 ± 0.807
2.06AspAsn: 2.06 ± 1.248
2.832AspPro: 2.832 ± 0.617
1.03AspGln: 1.03 ± 0.412
4.377AspArg: 4.377 ± 0.669
5.922AspSer: 5.922 ± 0.693
3.862AspThr: 3.862 ± 1.437
2.832AspVal: 2.832 ± 1.103
0.257AspTrp: 0.257 ± 0.149
2.832AspTyr: 2.832 ± 0.726
0.0AspXaa: 0.0 ± 0.0
Glu
5.149GluAla: 5.149 ± 1.705
2.06GluCys: 2.06 ± 1.46
3.862GluAsp: 3.862 ± 0.817
7.209GluGlu: 7.209 ± 1.354
1.287GluPhe: 1.287 ± 0.546
3.862GluGly: 3.862 ± 0.618
0.515GluHis: 0.515 ± 0.555
4.892GluIle: 4.892 ± 1.421
4.377GluLys: 4.377 ± 1.004
5.407GluLeu: 5.407 ± 1.653
2.317GluMet: 2.317 ± 0.847
1.545GluAsn: 1.545 ± 0.454
1.802GluPro: 1.802 ± 0.49
1.03GluGln: 1.03 ± 0.351
3.09GluArg: 3.09 ± 0.944
5.922GluSer: 5.922 ± 2.991
6.179GluThr: 6.179 ± 0.837
4.377GluVal: 4.377 ± 0.743
0.772GluTrp: 0.772 ± 0.362
2.317GluTyr: 2.317 ± 0.793
0.0GluXaa: 0.0 ± 0.0
Phe
3.09PheAla: 3.09 ± 0.609
0.515PheCys: 0.515 ± 0.298
1.802PheAsp: 1.802 ± 0.393
2.06PheGlu: 2.06 ± 0.777
1.802PhePhe: 1.802 ± 0.507
2.575PheGly: 2.575 ± 0.573
0.0PheHis: 0.0 ± 0.0
1.545PheIle: 1.545 ± 1.117
2.575PheLys: 2.575 ± 0.903
3.862PheLeu: 3.862 ± 1.527
0.772PheMet: 0.772 ± 0.315
1.545PheAsn: 1.545 ± 0.591
1.545PhePro: 1.545 ± 0.459
2.575PheGln: 2.575 ± 0.919
2.832PheArg: 2.832 ± 0.475
2.317PheSer: 2.317 ± 1.043
1.545PheThr: 1.545 ± 0.79
1.287PheVal: 1.287 ± 0.556
0.0PheTrp: 0.0 ± 0.0
1.802PheTyr: 1.802 ± 0.793
0.0PheXaa: 0.0 ± 0.0
Gly
3.09GlyAla: 3.09 ± 0.918
0.515GlyCys: 0.515 ± 0.298
3.605GlyAsp: 3.605 ± 0.847
2.832GlyGlu: 2.832 ± 0.791
1.802GlyPhe: 1.802 ± 0.683
3.09GlyGly: 3.09 ± 1.846
1.545GlyHis: 1.545 ± 0.529
3.09GlyIle: 3.09 ± 1.662
4.377GlyLys: 4.377 ± 1.382
4.634GlyLeu: 4.634 ± 1.204
1.545GlyMet: 1.545 ± 0.77
1.802GlyAsn: 1.802 ± 0.64
1.545GlyPro: 1.545 ± 0.638
0.515GlyGln: 0.515 ± 0.452
2.06GlyArg: 2.06 ± 0.717
4.377GlySer: 4.377 ± 1.498
2.832GlyThr: 2.832 ± 0.577
3.347GlyVal: 3.347 ± 0.757
1.287GlyTrp: 1.287 ± 0.746
3.347GlyTyr: 3.347 ± 1.33
0.0GlyXaa: 0.0 ± 0.0
His
0.772HisAla: 0.772 ± 0.385
0.0HisCys: 0.0 ± 0.0
1.287HisAsp: 1.287 ± 0.541
0.257HisGlu: 0.257 ± 0.294
0.772HisPhe: 0.772 ± 0.474
0.772HisGly: 0.772 ± 0.265
0.515HisHis: 0.515 ± 0.237
1.287HisIle: 1.287 ± 0.481
0.0HisLys: 0.0 ± 0.0
2.575HisLeu: 2.575 ± 0.895
0.772HisMet: 0.772 ± 0.315
0.772HisAsn: 0.772 ± 0.447
1.03HisPro: 1.03 ± 0.359
0.772HisGln: 0.772 ± 0.362
1.545HisArg: 1.545 ± 0.895
0.515HisSer: 0.515 ± 0.298
0.515HisThr: 0.515 ± 0.237
1.545HisVal: 1.545 ± 0.564
0.515HisTrp: 0.515 ± 0.237
0.515HisTyr: 0.515 ± 0.298
0.0HisXaa: 0.0 ± 0.0
Ile
3.09IleAla: 3.09 ± 1.303
1.03IleCys: 1.03 ± 0.42
3.09IleAsp: 3.09 ± 0.902
4.119IleGlu: 4.119 ± 1.225
4.119IlePhe: 4.119 ± 0.838
3.09IleGly: 3.09 ± 0.612
0.515IleHis: 0.515 ± 0.298
5.407IleIle: 5.407 ± 1.389
3.862IleLys: 3.862 ± 1.082
4.892IleLeu: 4.892 ± 0.549
2.317IleMet: 2.317 ± 0.76
2.317IleAsn: 2.317 ± 1.094
3.605IlePro: 3.605 ± 0.72
1.545IleGln: 1.545 ± 0.616
2.575IleArg: 2.575 ± 1.119
8.239IleSer: 8.239 ± 1.438
3.605IleThr: 3.605 ± 1.166
4.119IleVal: 4.119 ± 0.793
0.257IleTrp: 0.257 ± 0.503
2.832IleTyr: 2.832 ± 0.988
0.0IleXaa: 0.0 ± 0.0
Lys
7.724LysAla: 7.724 ± 3.901
0.515LysCys: 0.515 ± 0.452
3.605LysAsp: 3.605 ± 1.967
4.377LysGlu: 4.377 ± 1.367
1.545LysPhe: 1.545 ± 1.01
4.892LysGly: 4.892 ± 0.829
0.772LysHis: 0.772 ± 0.447
3.605LysIle: 3.605 ± 0.927
6.179LysLys: 6.179 ± 3.107
6.694LysLeu: 6.694 ± 2.206
2.317LysMet: 2.317 ± 0.633
2.575LysAsn: 2.575 ± 0.449
2.06LysPro: 2.06 ± 0.708
0.772LysGln: 0.772 ± 0.419
3.09LysArg: 3.09 ± 0.976
6.179LysSer: 6.179 ± 1.276
4.377LysThr: 4.377 ± 1.342
5.149LysVal: 5.149 ± 1.881
1.03LysTrp: 1.03 ± 0.597
2.832LysTyr: 2.832 ± 1.667
0.0LysXaa: 0.0 ± 0.0
Leu
6.952LeuAla: 6.952 ± 1.092
1.545LeuCys: 1.545 ± 0.529
5.922LeuAsp: 5.922 ± 1.428
5.664LeuGlu: 5.664 ± 1.768
4.119LeuPhe: 4.119 ± 0.568
4.377LeuGly: 4.377 ± 2.247
1.802LeuHis: 1.802 ± 0.868
6.437LeuIle: 6.437 ± 1.469
4.892LeuLys: 4.892 ± 0.941
6.179LeuLeu: 6.179 ± 1.271
4.892LeuMet: 4.892 ± 0.772
3.862LeuAsn: 3.862 ± 0.701
7.209LeuPro: 7.209 ± 1.076
3.347LeuGln: 3.347 ± 0.447
6.437LeuArg: 6.437 ± 1.673
5.922LeuSer: 5.922 ± 0.79
5.664LeuThr: 5.664 ± 1.354
6.179LeuVal: 6.179 ± 0.899
1.287LeuTrp: 1.287 ± 0.746
4.377LeuTyr: 4.377 ± 0.869
0.0LeuXaa: 0.0 ± 0.0
Met
2.06MetAla: 2.06 ± 0.646
0.772MetCys: 0.772 ± 0.344
1.802MetAsp: 1.802 ± 0.572
1.545MetGlu: 1.545 ± 0.956
0.772MetPhe: 0.772 ± 0.265
2.06MetGly: 2.06 ± 0.89
0.515MetHis: 0.515 ± 0.295
2.832MetIle: 2.832 ± 1.095
2.06MetLys: 2.06 ± 1.168
2.317MetLeu: 2.317 ± 0.753
0.515MetMet: 0.515 ± 0.48
1.03MetAsn: 1.03 ± 0.576
1.03MetPro: 1.03 ± 0.359
0.0MetGln: 0.0 ± 0.0
2.317MetArg: 2.317 ± 0.638
3.605MetSer: 3.605 ± 1.275
4.119MetThr: 4.119 ± 0.836
1.545MetVal: 1.545 ± 0.364
0.257MetTrp: 0.257 ± 0.294
0.772MetTyr: 0.772 ± 0.385
0.0MetXaa: 0.0 ± 0.0
Asn
3.347AsnAla: 3.347 ± 0.825
1.03AsnCys: 1.03 ± 1.014
1.545AsnAsp: 1.545 ± 1.073
2.06AsnGlu: 2.06 ± 0.98
1.287AsnPhe: 1.287 ± 0.603
1.287AsnGly: 1.287 ± 0.373
0.257AsnHis: 0.257 ± 0.149
2.317AsnIle: 2.317 ± 0.856
2.832AsnLys: 2.832 ± 1.048
4.119AsnLeu: 4.119 ± 1.24
0.515AsnMet: 0.515 ± 0.298
1.287AsnAsn: 1.287 ± 0.481
2.575AsnPro: 2.575 ± 0.924
2.317AsnGln: 2.317 ± 0.573
1.802AsnArg: 1.802 ± 0.4
2.06AsnSer: 2.06 ± 0.734
2.06AsnThr: 2.06 ± 0.557
1.802AsnVal: 1.802 ± 0.393
0.772AsnTrp: 0.772 ± 0.559
1.03AsnTyr: 1.03 ± 0.256
0.0AsnXaa: 0.0 ± 0.0
Pro
1.545ProAla: 1.545 ± 0.558
0.772ProCys: 0.772 ± 0.512
2.06ProAsp: 2.06 ± 0.933
4.892ProGlu: 4.892 ± 1.336
1.03ProPhe: 1.03 ± 0.351
1.287ProGly: 1.287 ± 0.564
2.06ProHis: 2.06 ± 0.717
3.347ProIle: 3.347 ± 0.84
3.605ProLys: 3.605 ± 1.201
4.634ProLeu: 4.634 ± 0.729
1.545ProMet: 1.545 ± 0.623
2.317ProAsn: 2.317 ± 0.762
1.545ProPro: 1.545 ± 0.616
0.772ProGln: 0.772 ± 0.73
2.575ProArg: 2.575 ± 0.944
3.862ProSer: 3.862 ± 0.633
2.832ProThr: 2.832 ± 0.862
2.575ProVal: 2.575 ± 1.379
1.03ProTrp: 1.03 ± 1.113
2.06ProTyr: 2.06 ± 0.629
0.0ProXaa: 0.0 ± 0.0
Gln
1.287GlnAla: 1.287 ± 0.556
0.515GlnCys: 0.515 ± 0.459
1.287GlnAsp: 1.287 ± 0.54
2.06GlnGlu: 2.06 ± 0.476
0.772GlnPhe: 0.772 ± 0.528
0.772GlnGly: 0.772 ± 0.362
0.772GlnHis: 0.772 ± 0.447
2.832GlnIle: 2.832 ± 0.79
2.06GlnLys: 2.06 ± 0.359
3.605GlnLeu: 3.605 ± 0.512
0.772GlnMet: 0.772 ± 0.265
0.257GlnAsn: 0.257 ± 0.477
1.545GlnPro: 1.545 ± 1.111
0.515GlnGln: 0.515 ± 0.33
0.772GlnArg: 0.772 ± 0.447
2.832GlnSer: 2.832 ± 0.839
1.545GlnThr: 1.545 ± 0.499
1.287GlnVal: 1.287 ± 0.602
0.772GlnTrp: 0.772 ± 0.474
1.287GlnTyr: 1.287 ± 0.527
0.0GlnXaa: 0.0 ± 0.0
Arg
2.317ArgAla: 2.317 ± 0.754
1.802ArgCys: 1.802 ± 0.561
3.862ArgAsp: 3.862 ± 0.939
5.664ArgGlu: 5.664 ± 1.945
1.802ArgPhe: 1.802 ± 0.573
3.09ArgGly: 3.09 ± 1.196
1.03ArgHis: 1.03 ± 0.42
3.862ArgIle: 3.862 ± 0.584
3.347ArgLys: 3.347 ± 0.597
4.892ArgLeu: 4.892 ± 1.31
1.802ArgMet: 1.802 ± 1.044
1.545ArgAsn: 1.545 ± 0.249
1.287ArgPro: 1.287 ± 0.481
1.545ArgGln: 1.545 ± 0.616
3.605ArgArg: 3.605 ± 1.098
4.119ArgSer: 4.119 ± 1.117
3.09ArgThr: 3.09 ± 0.741
2.832ArgVal: 2.832 ± 1.256
1.287ArgTrp: 1.287 ± 0.746
1.545ArgTyr: 1.545 ± 0.454
0.0ArgXaa: 0.0 ± 0.0
Ser
3.347SerAla: 3.347 ± 1.082
1.03SerCys: 1.03 ± 0.473
7.467SerAsp: 7.467 ± 1.905
6.179SerGlu: 6.179 ± 1.573
3.347SerPhe: 3.347 ± 1.34
5.922SerGly: 5.922 ± 1.158
1.03SerHis: 1.03 ± 0.908
4.892SerIle: 4.892 ± 0.644
4.892SerLys: 4.892 ± 1.339
9.011SerLeu: 9.011 ± 0.991
2.06SerMet: 2.06 ± 0.687
3.09SerAsn: 3.09 ± 1.151
3.605SerPro: 3.605 ± 0.899
2.832SerGln: 2.832 ± 0.836
3.862SerArg: 3.862 ± 0.65
7.724SerSer: 7.724 ± 2.977
5.407SerThr: 5.407 ± 1.628
5.664SerVal: 5.664 ± 1.177
1.545SerTrp: 1.545 ± 0.652
3.347SerTyr: 3.347 ± 0.783
0.0SerXaa: 0.0 ± 0.0
Thr
4.634ThrAla: 4.634 ± 1.65
0.257ThrCys: 0.257 ± 0.294
3.347ThrAsp: 3.347 ± 0.995
4.634ThrGlu: 4.634 ± 1.376
3.347ThrPhe: 3.347 ± 1.215
3.09ThrGly: 3.09 ± 1.087
1.287ThrHis: 1.287 ± 0.746
2.832ThrIle: 2.832 ± 0.98
7.467ThrLys: 7.467 ± 3.603
6.694ThrLeu: 6.694 ± 1.752
1.545ThrMet: 1.545 ± 0.864
2.832ThrAsn: 2.832 ± 0.807
2.575ThrPro: 2.575 ± 0.372
1.545ThrGln: 1.545 ± 0.454
3.347ThrArg: 3.347 ± 0.529
5.664ThrSer: 5.664 ± 1.667
3.09ThrThr: 3.09 ± 1.537
3.09ThrVal: 3.09 ± 0.684
1.287ThrTrp: 1.287 ± 0.976
1.802ThrTyr: 1.802 ± 0.4
0.0ThrXaa: 0.0 ± 0.0
Val
2.832ValAla: 2.832 ± 1.343
0.0ValCys: 0.0 ± 0.0
2.06ValAsp: 2.06 ± 0.528
2.832ValGlu: 2.832 ± 0.534
3.09ValPhe: 3.09 ± 0.861
1.545ValGly: 1.545 ± 0.396
1.03ValHis: 1.03 ± 0.359
4.119ValIle: 4.119 ± 1.657
4.377ValLys: 4.377 ± 2.185
5.149ValLeu: 5.149 ± 0.844
1.802ValMet: 1.802 ± 0.876
2.317ValAsn: 2.317 ± 0.782
3.605ValPro: 3.605 ± 0.465
1.802ValGln: 1.802 ± 0.653
4.377ValArg: 4.377 ± 0.935
6.952ValSer: 6.952 ± 0.648
4.634ValThr: 4.634 ± 0.716
3.862ValVal: 3.862 ± 0.909
0.772ValTrp: 0.772 ± 0.474
1.802ValTyr: 1.802 ± 0.56
0.0ValXaa: 0.0 ± 0.0
Trp
0.515TrpAla: 0.515 ± 0.237
0.0TrpCys: 0.0 ± 0.0
1.287TrpAsp: 1.287 ± 0.401
1.03TrpGlu: 1.03 ± 0.412
1.287TrpPhe: 1.287 ± 0.909
0.515TrpGly: 0.515 ± 0.452
0.0TrpHis: 0.0 ± 0.0
0.772TrpIle: 0.772 ± 0.326
1.03TrpLys: 1.03 ± 0.445
0.772TrpLeu: 0.772 ± 0.265
0.772TrpMet: 0.772 ± 0.447
0.772TrpAsn: 0.772 ± 0.265
0.257TrpPro: 0.257 ± 0.477
0.772TrpGln: 0.772 ± 0.447
1.03TrpArg: 1.03 ± 0.473
1.287TrpSer: 1.287 ± 0.478
1.03TrpThr: 1.03 ± 0.485
1.545TrpVal: 1.545 ± 0.862
0.257TrpTrp: 0.257 ± 0.294
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.06TyrAla: 2.06 ± 0.309
1.545TyrCys: 1.545 ± 1.025
1.545TyrAsp: 1.545 ± 0.538
1.802TyrGlu: 1.802 ± 0.838
0.515TyrPhe: 0.515 ± 0.324
3.09TyrGly: 3.09 ± 0.958
1.802TyrHis: 1.802 ± 0.773
2.317TyrIle: 2.317 ± 0.615
2.317TyrLys: 2.317 ± 1.016
8.239TyrLeu: 8.239 ± 1.469
0.0TyrMet: 0.0 ± 0.0
1.03TyrAsn: 1.03 ± 0.627
2.317TyrPro: 2.317 ± 0.805
0.772TyrGln: 0.772 ± 0.265
1.545TyrArg: 1.545 ± 0.657
3.347TyrSer: 3.347 ± 1.121
3.347TyrThr: 3.347 ± 0.995
1.287TyrVal: 1.287 ± 0.445
0.0TyrTrp: 0.0 ± 0.0
1.287TyrTyr: 1.287 ± 0.545
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (3885 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski