Amino acid dipepetide frequency for Salvia divinorum RNA virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.787AlaAla: 2.787 ± 2.439
0.697AlaCys: 0.697 ± 0.356
2.439AlaAsp: 2.439 ± 1.245
2.439AlaGlu: 2.439 ± 1.948
4.181AlaPhe: 4.181 ± 0.983
4.181AlaGly: 4.181 ± 0.954
1.045AlaHis: 1.045 ± 0.534
4.878AlaIle: 4.878 ± 0.989
5.226AlaLys: 5.226 ± 1.748
2.439AlaLeu: 2.439 ± 0.723
0.697AlaMet: 0.697 ± 0.356
3.484AlaAsn: 3.484 ± 0.853
1.394AlaPro: 1.394 ± 0.587
0.697AlaGln: 0.697 ± 0.356
2.787AlaArg: 2.787 ± 0.744
1.394AlaSer: 1.394 ± 0.422
1.045AlaThr: 1.045 ± 0.756
3.136AlaVal: 3.136 ± 1.2
0.0AlaTrp: 0.0 ± 0.0
2.439AlaTyr: 2.439 ± 1.325
0.0AlaXaa: 0.0 ± 0.0
Cys
1.394CysAla: 1.394 ± 1.346
0.348CysCys: 0.348 ± 0.178
2.787CysAsp: 2.787 ± 0.843
1.045CysGlu: 1.045 ± 0.534
2.787CysPhe: 2.787 ± 0.952
1.394CysGly: 1.394 ± 0.758
0.697CysHis: 0.697 ± 0.356
0.348CysIle: 0.348 ± 0.178
2.091CysLys: 2.091 ± 0.796
2.091CysLeu: 2.091 ± 1.067
1.394CysMet: 1.394 ± 0.44
0.697CysAsn: 0.697 ± 0.356
1.742CysPro: 1.742 ± 0.793
1.045CysGln: 1.045 ± 0.534
1.394CysArg: 1.394 ± 0.712
1.742CysSer: 1.742 ± 0.465
2.439CysThr: 2.439 ± 1.245
2.091CysVal: 2.091 ± 0.724
0.348CysTrp: 0.348 ± 0.178
0.697CysTyr: 0.697 ± 0.54
0.0CysXaa: 0.0 ± 0.0
Asp
2.787AspAla: 2.787 ± 0.269
2.787AspCys: 2.787 ± 0.843
4.53AspAsp: 4.53 ± 1.252
5.923AspGlu: 5.923 ± 2.006
4.878AspPhe: 4.878 ± 1.969
4.878AspGly: 4.878 ± 0.75
1.045AspHis: 1.045 ± 0.534
3.833AspIle: 3.833 ± 0.729
3.136AspLys: 3.136 ± 1.854
4.878AspLeu: 4.878 ± 0.426
1.742AspMet: 1.742 ± 0.427
1.045AspAsn: 1.045 ± 0.416
3.136AspPro: 3.136 ± 0.773
1.742AspGln: 1.742 ± 0.889
3.833AspArg: 3.833 ± 1.906
4.181AspSer: 4.181 ± 1.623
1.045AspThr: 1.045 ± 0.45
6.272AspVal: 6.272 ± 1.08
1.045AspTrp: 1.045 ± 0.534
1.394AspTyr: 1.394 ± 0.712
0.0AspXaa: 0.0 ± 0.0
Glu
3.484GluAla: 3.484 ± 0.562
2.787GluCys: 2.787 ± 1.104
4.181GluAsp: 4.181 ± 1.288
6.62GluGlu: 6.62 ± 1.635
3.833GluPhe: 3.833 ± 1.795
2.091GluGly: 2.091 ± 0.628
1.742GluHis: 1.742 ± 1.282
4.53GluIle: 4.53 ± 1.126
9.059GluLys: 9.059 ± 1.63
3.833GluLeu: 3.833 ± 0.704
1.394GluMet: 1.394 ± 1.579
5.226GluAsn: 5.226 ± 1.342
2.091GluPro: 2.091 ± 0.286
1.045GluGln: 1.045 ± 0.871
5.226GluArg: 5.226 ± 1.22
8.711GluSer: 8.711 ± 0.843
1.742GluThr: 1.742 ± 0.865
6.272GluVal: 6.272 ± 0.921
1.045GluTrp: 1.045 ± 0.534
1.394GluTyr: 1.394 ± 0.642
0.0GluXaa: 0.0 ± 0.0
Phe
4.53PheAla: 4.53 ± 1.389
3.136PheCys: 3.136 ± 1.115
7.317PheAsp: 7.317 ± 2.667
6.272PheGlu: 6.272 ± 2.668
6.969PhePhe: 6.969 ± 1.805
3.136PheGly: 3.136 ± 0.77
0.348PheHis: 0.348 ± 0.178
1.394PheIle: 1.394 ± 0.712
6.62PheLys: 6.62 ± 1.181
8.362PheLeu: 8.362 ± 2.343
0.697PheMet: 0.697 ± 0.356
3.136PheAsn: 3.136 ± 0.87
2.091PhePro: 2.091 ± 0.651
1.742PheGln: 1.742 ± 0.465
2.439PheArg: 2.439 ± 1.245
5.923PheSer: 5.923 ± 1.302
2.091PheThr: 2.091 ± 0.286
2.439PheVal: 2.439 ± 0.879
1.045PheTrp: 1.045 ± 0.416
0.697PheTyr: 0.697 ± 1.137
0.0PheXaa: 0.0 ± 0.0
Gly
1.394GlyAla: 1.394 ± 0.712
1.045GlyCys: 1.045 ± 0.534
3.833GlyAsp: 3.833 ± 1.451
3.833GlyGlu: 3.833 ± 1.139
1.742GlyPhe: 1.742 ± 1.442
3.136GlyGly: 3.136 ± 1.115
1.742GlyHis: 1.742 ± 0.526
4.181GlyIle: 4.181 ± 1.127
5.575GlyLys: 5.575 ± 0.537
4.878GlyLeu: 4.878 ± 0.64
0.348GlyMet: 0.348 ± 0.178
2.787GlyAsn: 2.787 ± 0.843
1.742GlyPro: 1.742 ± 0.793
2.091GlyGln: 2.091 ± 0.796
1.394GlyArg: 1.394 ± 0.642
4.53GlySer: 4.53 ± 3.575
2.787GlyThr: 2.787 ± 1.462
2.091GlyVal: 2.091 ± 0.912
1.045GlyTrp: 1.045 ± 0.534
2.091GlyTyr: 2.091 ± 0.796
0.0GlyXaa: 0.0 ± 0.0
His
0.697HisAla: 0.697 ± 0.356
0.697HisCys: 0.697 ± 0.356
1.742HisAsp: 1.742 ± 0.889
1.394HisGlu: 1.394 ± 0.892
1.742HisPhe: 1.742 ± 0.889
0.0HisGly: 0.0 ± 0.0
0.697HisHis: 0.697 ± 0.356
1.045HisIle: 1.045 ± 0.629
1.045HisLys: 1.045 ± 0.756
4.181HisLeu: 4.181 ± 1.502
0.348HisMet: 0.348 ± 0.568
1.742HisAsn: 1.742 ± 0.889
0.697HisPro: 0.697 ± 0.356
1.394HisGln: 1.394 ± 0.712
0.697HisArg: 0.697 ± 0.54
3.136HisSer: 3.136 ± 1.601
0.348HisThr: 0.348 ± 0.178
1.394HisVal: 1.394 ± 0.44
0.348HisTrp: 0.348 ± 0.178
0.348HisTyr: 0.348 ± 0.178
0.0HisXaa: 0.0 ± 0.0
Ile
2.091IleAla: 2.091 ± 2.299
2.439IleCys: 2.439 ± 0.694
7.317IleAsp: 7.317 ± 1.937
6.62IleGlu: 6.62 ± 1.17
2.439IlePhe: 2.439 ± 0.694
2.439IleGly: 2.439 ± 1.258
1.742IleHis: 1.742 ± 0.889
4.53IleIle: 4.53 ± 0.81
5.923IleLys: 5.923 ± 0.748
4.878IleLeu: 4.878 ± 1.389
1.742IleMet: 1.742 ± 0.326
2.787IleAsn: 2.787 ± 1.424
0.697IlePro: 0.697 ± 0.465
0.348IleGln: 0.348 ± 0.178
4.878IleArg: 4.878 ± 2.454
5.226IleSer: 5.226 ± 3.74
1.394IleThr: 1.394 ± 1.224
3.484IleVal: 3.484 ± 0.562
0.348IleTrp: 0.348 ± 0.178
1.394IleTyr: 1.394 ± 0.712
0.0IleXaa: 0.0 ± 0.0
Lys
3.833LysAla: 3.833 ± 1.17
2.439LysCys: 2.439 ± 0.917
3.484LysAsp: 3.484 ± 1.701
5.923LysGlu: 5.923 ± 1.695
4.53LysPhe: 4.53 ± 0.465
2.787LysGly: 2.787 ± 0.586
1.394LysHis: 1.394 ± 0.422
6.969LysIle: 6.969 ± 3.222
6.969LysLys: 6.969 ± 0.968
8.362LysLeu: 8.362 ± 0.7
3.136LysMet: 3.136 ± 0.758
4.878LysAsn: 4.878 ± 0.887
0.697LysPro: 0.697 ± 0.54
1.045LysGln: 1.045 ± 0.534
8.014LysArg: 8.014 ± 2.465
5.923LysSer: 5.923 ± 1.193
3.136LysThr: 3.136 ± 1.601
4.878LysVal: 4.878 ± 2.49
1.742LysTrp: 1.742 ± 1.58
0.348LysTyr: 0.348 ± 0.178
0.0LysXaa: 0.0 ± 0.0
Leu
2.439LeuAla: 2.439 ± 0.917
1.394LeuCys: 1.394 ± 0.44
4.181LeuAsp: 4.181 ± 1.802
4.878LeuGlu: 4.878 ± 1.833
7.666LeuPhe: 7.666 ± 3.248
7.317LeuGly: 7.317 ± 1.234
2.091LeuHis: 2.091 ± 0.286
5.575LeuIle: 5.575 ± 1.295
7.666LeuLys: 7.666 ± 1.83
4.878LeuLeu: 4.878 ± 1.16
2.091LeuMet: 2.091 ± 0.651
7.317LeuAsn: 7.317 ± 0.675
3.484LeuPro: 3.484 ± 0.93
1.394LeuGln: 1.394 ± 0.422
2.439LeuArg: 2.439 ± 0.796
8.711LeuSer: 8.711 ± 1.409
5.923LeuThr: 5.923 ± 2.413
4.181LeuVal: 4.181 ± 0.554
0.348LeuTrp: 0.348 ± 0.178
3.136LeuTyr: 3.136 ± 0.403
0.0LeuXaa: 0.0 ± 0.0
Met
2.091MetAla: 2.091 ± 0.286
1.045MetCys: 1.045 ± 0.534
0.697MetAsp: 0.697 ± 0.465
2.091MetGlu: 2.091 ± 0.628
2.091MetPhe: 2.091 ± 1.028
1.045MetGly: 1.045 ± 0.416
0.348MetHis: 0.348 ± 0.178
1.045MetIle: 1.045 ± 0.534
1.742MetLys: 1.742 ± 1.485
1.045MetLeu: 1.045 ± 1.199
0.348MetMet: 0.348 ± 0.178
3.136MetAsn: 3.136 ± 1.001
0.348MetPro: 0.348 ± 0.178
0.697MetGln: 0.697 ± 0.54
1.045MetArg: 1.045 ± 0.534
2.787MetSer: 2.787 ± 0.9
1.045MetThr: 1.045 ± 0.534
0.697MetVal: 0.697 ± 0.356
0.0MetTrp: 0.0 ± 0.0
0.348MetTyr: 0.348 ± 0.178
0.0MetXaa: 0.0 ± 0.0
Asn
2.787AsnAla: 2.787 ± 3.166
1.742AsnCys: 1.742 ± 0.889
3.484AsnAsp: 3.484 ± 1.091
4.181AsnGlu: 4.181 ± 1.762
5.575AsnPhe: 5.575 ± 1.601
3.484AsnGly: 3.484 ± 0.562
1.394AsnHis: 1.394 ± 0.422
2.787AsnIle: 2.787 ± 1.324
2.091AsnLys: 2.091 ± 0.651
5.923AsnLeu: 5.923 ± 1.367
2.091AsnMet: 2.091 ± 1.067
1.742AsnAsn: 1.742 ± 1.184
2.091AsnPro: 2.091 ± 0.628
1.045AsnGln: 1.045 ± 0.756
3.484AsnArg: 3.484 ± 0.862
4.53AsnSer: 4.53 ± 1.253
1.394AsnThr: 1.394 ± 0.44
2.439AsnVal: 2.439 ± 0.213
1.045AsnTrp: 1.045 ± 0.416
3.136AsnTyr: 3.136 ± 1.115
0.0AsnXaa: 0.0 ± 0.0
Pro
1.394ProAla: 1.394 ± 0.44
0.697ProCys: 0.697 ± 0.54
1.045ProAsp: 1.045 ± 0.629
2.091ProGlu: 2.091 ± 0.651
3.833ProPhe: 3.833 ± 1.451
1.045ProGly: 1.045 ± 0.45
0.697ProHis: 0.697 ± 0.356
2.439ProIle: 2.439 ± 0.854
2.439ProLys: 2.439 ± 0.771
1.742ProLeu: 1.742 ± 0.889
0.697ProMet: 0.697 ± 0.356
2.439ProAsn: 2.439 ± 1.271
0.0ProPro: 0.0 ± 0.0
0.0ProGln: 0.0 ± 0.0
2.091ProArg: 2.091 ± 0.874
1.742ProSer: 1.742 ± 0.465
2.091ProThr: 2.091 ± 0.912
1.742ProVal: 1.742 ± 1.736
0.348ProTrp: 0.348 ± 0.178
0.348ProTyr: 0.348 ± 0.666
0.0ProXaa: 0.0 ± 0.0
Gln
2.439GlnAla: 2.439 ± 0.213
0.0GlnCys: 0.0 ± 0.0
1.394GlnAsp: 1.394 ± 0.712
0.348GlnGlu: 0.348 ± 0.178
1.045GlnPhe: 1.045 ± 0.534
1.742GlnGly: 1.742 ± 0.889
0.697GlnHis: 0.697 ± 0.356
2.439GlnIle: 2.439 ± 1.678
1.394GlnLys: 1.394 ± 0.642
3.484GlnLeu: 3.484 ± 1.165
0.0GlnMet: 0.0 ± 0.0
1.394GlnAsn: 1.394 ± 0.712
0.697GlnPro: 0.697 ± 0.356
0.348GlnGln: 0.348 ± 0.178
1.045GlnArg: 1.045 ± 1.399
2.439GlnSer: 2.439 ± 0.771
1.045GlnThr: 1.045 ± 0.534
1.045GlnVal: 1.045 ± 0.534
0.348GlnTrp: 0.348 ± 0.178
1.742GlnTyr: 1.742 ± 0.526
0.0GlnXaa: 0.0 ± 0.0
Arg
3.136ArgAla: 3.136 ± 2.662
1.742ArgCys: 1.742 ± 1.579
2.091ArgAsp: 2.091 ± 0.651
3.484ArgGlu: 3.484 ± 0.93
4.181ArgPhe: 4.181 ± 1.32
2.439ArgGly: 2.439 ± 0.696
1.045ArgHis: 1.045 ± 0.416
3.484ArgIle: 3.484 ± 1.268
3.136ArgLys: 3.136 ± 1.422
6.969ArgLeu: 6.969 ± 1.337
1.394ArgMet: 1.394 ± 0.44
2.439ArgAsn: 2.439 ± 1.953
1.045ArgPro: 1.045 ± 0.756
1.045ArgGln: 1.045 ± 0.629
2.787ArgArg: 2.787 ± 1.724
6.62ArgSer: 6.62 ± 4.535
2.091ArgThr: 2.091 ± 1.067
3.484ArgVal: 3.484 ± 0.936
0.697ArgTrp: 0.697 ± 0.54
1.742ArgTyr: 1.742 ± 0.889
0.0ArgXaa: 0.0 ± 0.0
Ser
3.136SerAla: 3.136 ± 0.77
2.439SerCys: 2.439 ± 0.771
4.53SerAsp: 4.53 ± 0.906
9.059SerGlu: 9.059 ± 2.784
3.136SerPhe: 3.136 ± 1.601
5.226SerGly: 5.226 ± 0.926
2.439SerHis: 2.439 ± 1.245
4.878SerIle: 4.878 ± 1.436
6.272SerLys: 6.272 ± 1.72
8.362SerLeu: 8.362 ± 2.907
2.091SerMet: 2.091 ± 1.602
4.53SerAsn: 4.53 ± 1.126
1.394SerPro: 1.394 ± 0.422
5.226SerGln: 5.226 ± 0.984
4.53SerArg: 4.53 ± 3.727
6.969SerSer: 6.969 ± 0.361
2.787SerThr: 2.787 ± 0.88
3.833SerVal: 3.833 ± 1.018
0.348SerTrp: 0.348 ± 0.178
2.787SerTyr: 2.787 ± 0.269
0.0SerXaa: 0.0 ± 0.0
Thr
1.045ThrAla: 1.045 ± 0.416
0.697ThrCys: 0.697 ± 0.356
2.439ThrAsp: 2.439 ± 0.213
2.091ThrGlu: 2.091 ± 2.363
5.923ThrPhe: 5.923 ± 2.493
2.439ThrGly: 2.439 ± 1.342
1.394ThrHis: 1.394 ± 0.712
3.484ThrIle: 3.484 ± 1.221
1.742ThrLys: 1.742 ± 1.064
2.439ThrLeu: 2.439 ± 1.245
0.348ThrMet: 0.348 ± 0.178
1.394ThrAsn: 1.394 ± 0.44
0.697ThrPro: 0.697 ± 0.356
0.697ThrGln: 0.697 ± 0.465
2.091ThrArg: 2.091 ± 0.286
3.833ThrSer: 3.833 ± 1.157
0.697ThrThr: 0.697 ± 0.356
3.484ThrVal: 3.484 ± 0.551
0.0ThrTrp: 0.0 ± 0.0
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
2.439ValAla: 2.439 ± 0.213
1.742ValCys: 1.742 ± 0.889
5.923ValAsp: 5.923 ± 1.0
5.923ValGlu: 5.923 ± 0.661
2.787ValPhe: 2.787 ± 0.843
1.394ValGly: 1.394 ± 0.587
2.091ValHis: 2.091 ± 0.563
3.833ValIle: 3.833 ± 0.729
5.226ValLys: 5.226 ± 1.304
3.484ValLeu: 3.484 ± 1.779
1.742ValMet: 1.742 ± 0.427
2.439ValAsn: 2.439 ± 0.796
2.787ValPro: 2.787 ± 1.533
2.439ValGln: 2.439 ± 0.694
1.742ValArg: 1.742 ± 1.034
3.833ValSer: 3.833 ± 1.27
2.439ValThr: 2.439 ± 0.771
4.53ValVal: 4.53 ± 0.635
0.348ValTrp: 0.348 ± 0.178
2.091ValTyr: 2.091 ± 0.651
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.348TrpCys: 0.348 ± 0.178
0.0TrpAsp: 0.0 ± 0.0
0.697TrpGlu: 0.697 ± 0.356
0.0TrpPhe: 0.0 ± 0.0
0.348TrpGly: 0.348 ± 0.666
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.697TrpLys: 0.697 ± 0.356
1.742TrpLeu: 1.742 ± 0.701
0.697TrpMet: 0.697 ± 0.356
0.697TrpAsn: 0.697 ± 0.465
0.348TrpPro: 0.348 ± 0.568
0.0TrpGln: 0.0 ± 0.0
1.742TrpArg: 1.742 ± 0.526
1.045TrpSer: 1.045 ± 0.416
1.045TrpThr: 1.045 ± 0.45
1.045TrpVal: 1.045 ± 0.416
0.0TrpTrp: 0.0 ± 0.0
0.348TrpTyr: 0.348 ± 0.178
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.484TyrAla: 3.484 ± 1.762
0.348TyrCys: 0.348 ± 0.178
0.697TyrAsp: 0.697 ± 0.356
1.394TyrGlu: 1.394 ± 0.712
1.045TyrPhe: 1.045 ± 0.534
1.742TyrGly: 1.742 ± 0.465
1.045TyrHis: 1.045 ± 0.534
1.394TyrIle: 1.394 ± 0.712
2.439TyrLys: 2.439 ± 0.723
3.136TyrLeu: 3.136 ± 0.922
0.348TyrMet: 0.348 ± 0.178
3.136TyrAsn: 3.136 ± 0.954
1.742TyrPro: 1.742 ± 0.889
1.045TyrGln: 1.045 ± 0.534
1.394TyrArg: 1.394 ± 0.712
0.697TyrSer: 0.697 ± 0.356
0.348TyrThr: 0.348 ± 0.178
0.697TyrVal: 0.697 ± 0.356
0.348TyrTrp: 0.348 ± 0.568
0.348TyrTyr: 0.348 ± 0.178
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (2871 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski