Amino acid dipepetide frequency for Sweet potato pakakuy virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.48AlaAla: 6.48 ± 2.441
0.81AlaCys: 0.81 ± 0.263
2.835AlaAsp: 2.835 ± 1.015
5.265AlaGlu: 5.265 ± 1.497
3.24AlaPhe: 3.24 ± 1.213
1.62AlaGly: 1.62 ± 0.983
1.215AlaHis: 1.215 ± 0.672
3.645AlaIle: 3.645 ± 1.206
1.62AlaLys: 1.62 ± 0.518
6.885AlaLeu: 6.885 ± 1.903
0.81AlaMet: 0.81 ± 0.574
0.405AlaAsn: 0.405 ± 0.698
3.24AlaPro: 3.24 ± 0.637
6.48AlaGln: 6.48 ± 3.188
6.48AlaArg: 6.48 ± 2.488
2.835AlaSer: 2.835 ± 0.881
1.215AlaThr: 1.215 ± 0.522
3.645AlaVal: 3.645 ± 0.637
0.405AlaTrp: 0.405 ± 0.265
2.43AlaTyr: 2.43 ± 0.843
0.405AlaXaa: 0.405 ± 0.265
Cys
0.0CysAla: 0.0 ± 0.0
0.81CysCys: 0.81 ± 0.639
0.0CysAsp: 0.0 ± 0.0
0.81CysGlu: 0.81 ± 0.263
0.405CysPhe: 0.405 ± 0.265
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.62CysIle: 1.62 ± 0.824
2.43CysLys: 2.43 ± 0.843
1.215CysLeu: 1.215 ± 0.796
1.62CysMet: 1.62 ± 0.525
1.62CysAsn: 1.62 ± 0.653
1.215CysPro: 1.215 ± 0.422
1.215CysGln: 1.215 ± 0.422
0.405CysArg: 0.405 ± 0.265
0.81CysSer: 0.81 ± 0.263
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.405CysTyr: 0.405 ± 0.68
0.0CysXaa: 0.0 ± 0.0
Asp
2.025AspAla: 2.025 ± 0.651
1.215AspCys: 1.215 ± 0.522
2.43AspAsp: 2.43 ± 0.752
2.43AspGlu: 2.43 ± 1.593
2.025AspPhe: 2.025 ± 0.763
1.215AspGly: 1.215 ± 0.522
0.81AspHis: 0.81 ± 0.263
2.43AspIle: 2.43 ± 1.161
2.835AspLys: 2.835 ± 0.656
7.695AspLeu: 7.695 ± 2.189
0.405AspMet: 0.405 ± 0.265
3.24AspAsn: 3.24 ± 0.646
2.835AspPro: 2.835 ± 1.282
3.645AspGln: 3.645 ± 0.935
3.24AspArg: 3.24 ± 0.791
2.835AspSer: 2.835 ± 1.174
1.62AspThr: 1.62 ± 0.544
2.43AspVal: 2.43 ± 1.137
1.215AspTrp: 1.215 ± 0.528
2.43AspTyr: 2.43 ± 0.843
0.0AspXaa: 0.0 ± 0.0
Glu
6.885GluAla: 6.885 ± 2.291
0.0GluCys: 0.0 ± 0.0
5.67GluAsp: 5.67 ± 1.786
12.151GluGlu: 12.151 ± 2.284
3.645GluPhe: 3.645 ± 1.008
7.695GluGly: 7.695 ± 1.374
3.24GluHis: 3.24 ± 1.213
5.67GluIle: 5.67 ± 0.386
5.67GluLys: 5.67 ± 1.7
3.645GluLeu: 3.645 ± 1.114
2.025GluMet: 2.025 ± 0.651
3.24GluAsn: 3.24 ± 1.101
3.24GluPro: 3.24 ± 1.267
3.24GluGln: 3.24 ± 2.078
6.48GluArg: 6.48 ± 1.861
2.43GluSer: 2.43 ± 0.538
4.05GluThr: 4.05 ± 1.169
5.67GluVal: 5.67 ± 1.492
1.215GluTrp: 1.215 ± 0.528
1.62GluTyr: 1.62 ± 0.811
0.405GluXaa: 0.405 ± 0.265
Phe
1.62PheAla: 1.62 ± 0.443
0.81PheCys: 0.81 ± 0.531
0.405PheAsp: 0.405 ± 0.319
2.025PheGlu: 2.025 ± 1.09
1.62PhePhe: 1.62 ± 0.527
1.62PheGly: 1.62 ± 0.653
0.405PheHis: 0.405 ± 0.319
2.835PheIle: 2.835 ± 0.972
1.62PheLys: 1.62 ± 1.12
2.43PheLeu: 2.43 ± 0.743
1.215PheMet: 1.215 ± 0.422
2.835PheAsn: 2.835 ± 0.636
1.62PhePro: 1.62 ± 0.527
1.215PheGln: 1.215 ± 0.522
2.025PheArg: 2.025 ± 0.651
2.025PheSer: 2.025 ± 1.136
0.81PheThr: 0.81 ± 0.639
0.405PheVal: 0.405 ± 0.319
0.405PheTrp: 0.405 ± 0.319
0.405PheTyr: 0.405 ± 0.319
0.0PheXaa: 0.0 ± 0.0
Gly
4.455GlyAla: 4.455 ± 0.538
1.215GlyCys: 1.215 ± 0.958
3.645GlyAsp: 3.645 ± 1.226
3.645GlyGlu: 3.645 ± 1.155
2.835GlyPhe: 2.835 ± 1.126
3.24GlyGly: 3.24 ± 1.183
0.405GlyHis: 0.405 ± 0.265
3.24GlyIle: 3.24 ± 1.271
4.455GlyLys: 4.455 ± 2.33
6.075GlyLeu: 6.075 ± 1.873
0.81GlyMet: 0.81 ± 0.515
1.215GlyAsn: 1.215 ± 0.522
2.835GlyPro: 2.835 ± 1.116
2.43GlyGln: 2.43 ± 0.752
4.455GlyArg: 4.455 ± 1.483
4.455GlySer: 4.455 ± 1.193
4.455GlyThr: 4.455 ± 1.117
3.645GlyVal: 3.645 ± 2.058
0.405GlyTrp: 0.405 ± 0.319
1.62GlyTyr: 1.62 ± 0.443
0.0GlyXaa: 0.0 ± 0.0
His
0.81HisAla: 0.81 ± 0.531
0.81HisCys: 0.81 ± 0.531
0.0HisAsp: 0.0 ± 0.0
2.835HisGlu: 2.835 ± 0.891
0.0HisPhe: 0.0 ± 0.0
0.405HisGly: 0.405 ± 0.265
0.81HisHis: 0.81 ± 0.531
1.62HisIle: 1.62 ± 0.527
0.405HisLys: 0.405 ± 0.265
1.62HisLeu: 1.62 ± 0.657
1.215HisMet: 1.215 ± 0.422
1.215HisAsn: 1.215 ± 1.113
1.215HisPro: 1.215 ± 0.622
1.62HisGln: 1.62 ± 0.653
2.025HisArg: 2.025 ± 0.835
0.0HisSer: 0.0 ± 0.0
0.405HisThr: 0.405 ± 0.68
0.81HisVal: 0.81 ± 0.263
0.405HisTrp: 0.405 ± 0.319
0.405HisTyr: 0.405 ± 0.265
0.0HisXaa: 0.0 ± 0.0
Ile
3.24IleAla: 3.24 ± 1.256
0.81IleCys: 0.81 ± 0.531
1.62IleAsp: 1.62 ± 0.824
4.455IleGlu: 4.455 ± 0.856
1.62IlePhe: 1.62 ± 0.824
3.645IleGly: 3.645 ± 1.959
0.81IleHis: 0.81 ± 0.531
3.24IleIle: 3.24 ± 1.213
4.05IleLys: 4.05 ± 1.549
4.86IleLeu: 4.86 ± 1.977
1.215IleMet: 1.215 ± 0.422
3.24IleAsn: 3.24 ± 1.226
3.645IlePro: 3.645 ± 1.555
4.86IleGln: 4.86 ± 1.034
6.48IleArg: 6.48 ± 2.44
3.645IleSer: 3.645 ± 0.794
4.455IleThr: 4.455 ± 0.616
4.05IleVal: 4.05 ± 1.415
0.0IleTrp: 0.0 ± 0.0
1.215IleTyr: 1.215 ± 0.528
0.0IleXaa: 0.0 ± 0.0
Lys
3.645LysAla: 3.645 ± 1.064
0.405LysCys: 0.405 ± 0.265
3.24LysAsp: 3.24 ± 1.054
7.29LysGlu: 7.29 ± 2.662
1.215LysPhe: 1.215 ± 0.522
3.645LysGly: 3.645 ± 1.183
1.215LysHis: 1.215 ± 0.522
4.05LysIle: 4.05 ± 2.272
3.645LysLys: 3.645 ± 1.126
5.67LysLeu: 5.67 ± 2.879
2.43LysMet: 2.43 ± 0.79
2.835LysAsn: 2.835 ± 1.067
2.43LysPro: 2.43 ± 0.718
2.43LysGln: 2.43 ± 1.136
4.455LysArg: 4.455 ± 1.036
4.05LysSer: 4.05 ± 1.859
2.43LysThr: 2.43 ± 0.74
2.835LysVal: 2.835 ± 1.04
0.405LysTrp: 0.405 ± 0.319
0.81LysTyr: 0.81 ± 0.531
0.81LysXaa: 0.81 ± 0.263
Leu
3.24LeuAla: 3.24 ± 0.933
0.405LeuCys: 0.405 ± 0.265
4.455LeuAsp: 4.455 ± 0.976
9.316LeuGlu: 9.316 ± 3.383
1.215LeuPhe: 1.215 ± 0.69
6.48LeuGly: 6.48 ± 1.256
2.025LeuHis: 2.025 ± 0.8
5.67LeuIle: 5.67 ± 1.925
9.316LeuLys: 9.316 ± 2.103
9.721LeuLeu: 9.721 ± 1.362
2.025LeuMet: 2.025 ± 0.917
4.455LeuAsn: 4.455 ± 1.236
4.455LeuPro: 4.455 ± 0.538
4.86LeuGln: 4.86 ± 2.616
3.645LeuArg: 3.645 ± 0.501
5.67LeuSer: 5.67 ± 1.473
5.67LeuThr: 5.67 ± 1.866
3.645LeuVal: 3.645 ± 1.183
1.215LeuTrp: 1.215 ± 0.622
3.24LeuTyr: 3.24 ± 1.065
0.0LeuXaa: 0.0 ± 0.0
Met
1.215MetAla: 1.215 ± 0.622
0.405MetCys: 0.405 ± 0.265
0.81MetAsp: 0.81 ± 0.263
4.05MetGlu: 4.05 ± 0.807
0.81MetPhe: 0.81 ± 0.263
1.62MetGly: 1.62 ± 1.062
0.0MetHis: 0.0 ± 0.0
2.43MetIle: 2.43 ± 0.653
1.62MetLys: 1.62 ± 1.277
0.405MetLeu: 0.405 ± 0.319
0.405MetMet: 0.405 ± 0.265
2.43MetAsn: 2.43 ± 0.53
1.215MetPro: 1.215 ± 0.422
0.81MetGln: 0.81 ± 0.606
2.43MetArg: 2.43 ± 0.843
0.81MetSer: 0.81 ± 0.622
2.025MetThr: 2.025 ± 0.457
3.24MetVal: 3.24 ± 1.307
0.405MetTrp: 0.405 ± 0.319
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.24AsnAla: 3.24 ± 1.675
0.405AsnCys: 0.405 ± 0.265
4.455AsnAsp: 4.455 ± 1.583
1.62AsnGlu: 1.62 ± 0.707
1.215AsnPhe: 1.215 ± 0.528
0.81AsnGly: 0.81 ± 0.263
0.405AsnHis: 0.405 ± 0.265
3.24AsnIle: 3.24 ± 1.683
2.025AsnLys: 2.025 ± 0.763
4.455AsnLeu: 4.455 ± 1.165
0.81AsnMet: 0.81 ± 0.263
2.835AsnAsn: 2.835 ± 1.857
2.43AsnPro: 2.43 ± 0.405
2.43AsnGln: 2.43 ± 1.028
2.835AsnArg: 2.835 ± 1.731
3.24AsnSer: 3.24 ± 1.603
3.645AsnThr: 3.645 ± 0.738
2.43AsnVal: 2.43 ± 1.161
1.215AsnTrp: 1.215 ± 0.672
2.835AsnTyr: 2.835 ± 0.619
0.0AsnXaa: 0.0 ± 0.0
Pro
4.455ProAla: 4.455 ± 1.46
0.0ProCys: 0.0 ± 0.0
2.835ProAsp: 2.835 ± 1.473
4.455ProGlu: 4.455 ± 1.117
1.62ProPhe: 1.62 ± 0.527
4.86ProGly: 4.86 ± 1.17
0.405ProHis: 0.405 ± 0.265
2.025ProIle: 2.025 ± 0.763
2.835ProLys: 2.835 ± 0.641
4.455ProLeu: 4.455 ± 2.327
0.405ProMet: 0.405 ± 0.319
2.43ProAsn: 2.43 ± 1.161
4.455ProPro: 4.455 ± 1.06
2.025ProGln: 2.025 ± 0.504
4.86ProArg: 4.86 ± 1.893
4.05ProSer: 4.05 ± 0.749
2.835ProThr: 2.835 ± 0.641
0.81ProVal: 0.81 ± 0.531
0.81ProTrp: 0.81 ± 0.639
1.62ProTyr: 1.62 ± 0.544
0.0ProXaa: 0.0 ± 0.0
Gln
4.455GlnAla: 4.455 ± 2.102
0.405GlnCys: 0.405 ± 0.265
3.645GlnAsp: 3.645 ± 1.064
5.265GlnGlu: 5.265 ± 0.973
1.215GlnPhe: 1.215 ± 0.85
4.455GlnGly: 4.455 ± 0.538
0.81GlnHis: 0.81 ± 0.606
3.645GlnIle: 3.645 ± 1.126
1.62GlnLys: 1.62 ± 1.062
4.455GlnLeu: 4.455 ± 1.337
0.405GlnMet: 0.405 ± 0.265
3.24GlnAsn: 3.24 ± 1.283
3.24GlnPro: 3.24 ± 0.591
5.67GlnGln: 5.67 ± 1.185
4.455GlnArg: 4.455 ± 2.102
2.43GlnSer: 2.43 ± 0.743
2.43GlnThr: 2.43 ± 0.755
3.645GlnVal: 3.645 ± 1.546
0.81GlnTrp: 0.81 ± 0.531
0.81GlnTyr: 0.81 ± 0.639
0.0GlnXaa: 0.0 ± 0.0
Arg
4.455ArgAla: 4.455 ± 1.798
1.215ArgCys: 1.215 ± 0.672
2.835ArgAsp: 2.835 ± 1.955
5.265ArgGlu: 5.265 ± 1.417
2.025ArgPhe: 2.025 ± 0.457
6.885ArgGly: 6.885 ± 3.094
0.405ArgHis: 0.405 ± 0.265
5.265ArgIle: 5.265 ± 1.503
4.05ArgLys: 4.05 ± 0.817
4.86ArgLeu: 4.86 ± 1.286
3.24ArgMet: 3.24 ± 0.521
3.24ArgAsn: 3.24 ± 0.705
4.455ArgPro: 4.455 ± 1.035
4.455ArgGln: 4.455 ± 2.272
11.341ArgArg: 11.341 ± 2.526
4.05ArgSer: 4.05 ± 1.971
4.86ArgThr: 4.86 ± 0.853
4.455ArgVal: 4.455 ± 0.945
2.025ArgTrp: 2.025 ± 0.609
2.025ArgTyr: 2.025 ± 0.922
0.0ArgXaa: 0.0 ± 0.0
Ser
2.43SerAla: 2.43 ± 0.79
0.81SerCys: 0.81 ± 0.263
3.24SerAsp: 3.24 ± 1.267
4.455SerGlu: 4.455 ± 1.196
1.62SerPhe: 1.62 ± 0.527
3.24SerGly: 3.24 ± 1.414
2.025SerHis: 2.025 ± 0.956
1.62SerIle: 1.62 ± 1.07
4.86SerLys: 4.86 ± 2.637
7.695SerLeu: 7.695 ± 1.218
2.43SerMet: 2.43 ± 0.52
2.025SerAsn: 2.025 ± 0.504
3.24SerPro: 3.24 ± 1.257
1.62SerGln: 1.62 ± 0.518
5.67SerArg: 5.67 ± 2.234
3.24SerSer: 3.24 ± 2.505
4.455SerThr: 4.455 ± 1.413
2.025SerVal: 2.025 ± 0.609
1.215SerTrp: 1.215 ± 0.422
0.81SerTyr: 0.81 ± 0.606
0.405SerXaa: 0.405 ± 0.265
Thr
4.455ThrAla: 4.455 ± 0.542
0.81ThrCys: 0.81 ± 0.531
3.645ThrAsp: 3.645 ± 1.155
4.86ThrGlu: 4.86 ± 1.568
0.81ThrPhe: 0.81 ± 0.263
4.05ThrGly: 4.05 ± 2.051
1.62ThrHis: 1.62 ± 1.062
2.835ThrIle: 2.835 ± 1.451
2.835ThrLys: 2.835 ± 0.972
4.455ThrLeu: 4.455 ± 1.887
1.62ThrMet: 1.62 ± 0.74
0.81ThrAsn: 0.81 ± 0.531
4.455ThrPro: 4.455 ± 1.173
3.645ThrGln: 3.645 ± 0.794
2.835ThrArg: 2.835 ± 1.42
2.025ThrSer: 2.025 ± 0.651
2.43ThrThr: 2.43 ± 0.673
2.835ThrVal: 2.835 ± 0.899
0.0ThrTrp: 0.0 ± 0.0
1.62ThrTyr: 1.62 ± 0.443
0.0ThrXaa: 0.0 ± 0.0
Val
2.025ValAla: 2.025 ± 0.651
2.025ValCys: 2.025 ± 0.763
1.62ValAsp: 1.62 ± 0.527
5.265ValGlu: 5.265 ± 2.031
1.215ValPhe: 1.215 ± 0.522
2.43ValGly: 2.43 ± 1.028
1.62ValHis: 1.62 ± 0.653
1.62ValIle: 1.62 ± 0.653
1.62ValLys: 1.62 ± 0.518
6.075ValLeu: 6.075 ± 2.48
1.62ValMet: 1.62 ± 1.21
2.025ValAsn: 2.025 ± 0.504
1.62ValPro: 1.62 ± 0.653
2.43ValGln: 2.43 ± 0.752
3.24ValArg: 3.24 ± 0.843
5.67ValSer: 5.67 ± 0.758
2.43ValThr: 2.43 ± 0.843
0.81ValVal: 0.81 ± 1.129
0.0ValTrp: 0.0 ± 0.0
2.835ValTyr: 2.835 ± 0.99
0.0ValXaa: 0.0 ± 0.0
Trp
0.405TrpAla: 0.405 ± 0.265
0.405TrpCys: 0.405 ± 0.265
0.405TrpAsp: 0.405 ± 0.265
0.81TrpGlu: 0.81 ± 0.622
0.0TrpPhe: 0.0 ± 0.0
0.405TrpGly: 0.405 ± 0.319
0.0TrpHis: 0.0 ± 0.0
1.62TrpIle: 1.62 ± 0.443
1.215TrpLys: 1.215 ± 0.958
0.81TrpLeu: 0.81 ± 0.709
0.0TrpMet: 0.0 ± 0.0
1.62TrpAsn: 1.62 ± 1.062
0.0TrpPro: 0.0 ± 0.0
0.405TrpGln: 0.405 ± 0.265
0.81TrpArg: 0.81 ± 0.531
1.215TrpSer: 1.215 ± 0.528
1.215TrpThr: 1.215 ± 0.958
0.81TrpVal: 0.81 ± 0.263
0.0TrpTrp: 0.0 ± 0.0
0.405TrpTyr: 0.405 ± 0.68
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.43TyrAla: 2.43 ± 0.79
0.81TyrCys: 0.81 ± 0.639
1.215TyrAsp: 1.215 ± 0.554
1.62TyrGlu: 1.62 ± 0.443
0.0TyrPhe: 0.0 ± 0.0
1.215TyrGly: 1.215 ± 0.522
0.405TyrHis: 0.405 ± 0.698
2.835TyrIle: 2.835 ± 0.528
1.215TyrLys: 1.215 ± 0.422
3.24TyrLeu: 3.24 ± 0.591
2.025TyrMet: 2.025 ± 0.904
1.62TyrAsn: 1.62 ± 0.628
0.0TyrPro: 0.0 ± 0.0
1.62TyrGln: 1.62 ± 0.802
2.835TyrArg: 2.835 ± 0.955
2.835TyrSer: 2.835 ± 0.891
1.215TyrThr: 1.215 ± 0.672
0.0TyrVal: 0.0 ± 0.0
0.405TyrTrp: 0.405 ± 0.265
1.215TyrTyr: 1.215 ± 0.422
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.405XaaLeu: 0.405 ± 0.265
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.405XaaPro: 0.405 ± 0.265
0.0XaaGln: 0.0 ± 0.0
0.405XaaArg: 0.405 ± 0.265
0.81XaaSer: 0.81 ± 0.263
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.405XaaXaa: 0.405 ± 0.265
Statistics based on 5 proteins (2470 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski