Amino acid dipepetide frequency for Sesame yellow mosaic virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.887AlaAla: 0.887 ± 0.658
0.887AlaCys: 0.887 ± 0.658
0.887AlaAsp: 0.887 ± 0.941
3.549AlaGlu: 3.549 ± 2.074
1.775AlaPhe: 1.775 ± 1.411
1.775AlaGly: 1.775 ± 0.74
0.887AlaHis: 0.887 ± 0.941
0.887AlaIle: 0.887 ± 0.754
1.775AlaLys: 1.775 ± 1.317
7.098AlaLeu: 7.098 ± 2.477
0.887AlaMet: 0.887 ± 1.017
4.437AlaAsn: 4.437 ± 1.352
6.211AlaPro: 6.211 ± 2.437
1.775AlaGln: 1.775 ± 0.999
4.437AlaArg: 4.437 ± 1.21
0.887AlaSer: 0.887 ± 0.941
1.775AlaThr: 1.775 ± 0.74
1.775AlaVal: 1.775 ± 1.176
0.0AlaTrp: 0.0 ± 0.0
1.775AlaTyr: 1.775 ± 1.176
0.0AlaXaa: 0.0 ± 0.0
Cys
1.775CysAla: 1.775 ± 0.999
0.887CysCys: 0.887 ± 1.03
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.775CysIle: 1.775 ± 0.74
0.887CysLys: 0.887 ± 0.941
0.887CysLeu: 0.887 ± 1.149
0.887CysMet: 0.887 ± 0.754
0.887CysAsn: 0.887 ± 0.658
1.775CysPro: 1.775 ± 0.963
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
2.662CysSer: 2.662 ± 1.24
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
1.775CysTrp: 1.775 ± 1.19
0.887CysTyr: 0.887 ± 0.941
0.0CysXaa: 0.0 ± 0.0
Asp
2.662AspAla: 2.662 ± 0.891
0.0AspCys: 0.0 ± 0.0
5.324AspAsp: 5.324 ± 2.03
2.662AspGlu: 2.662 ± 2.26
5.324AspPhe: 5.324 ± 0.781
4.437AspGly: 4.437 ± 1.745
0.0AspHis: 0.0 ± 0.0
4.437AspIle: 4.437 ± 1.652
0.887AspLys: 0.887 ± 0.658
4.437AspLeu: 4.437 ± 1.25
1.775AspMet: 1.775 ± 1.508
1.775AspAsn: 1.775 ± 1.418
0.0AspPro: 0.0 ± 0.0
1.775AspGln: 1.775 ± 1.014
3.549AspArg: 3.549 ± 2.047
1.775AspSer: 1.775 ± 0.963
1.775AspThr: 1.775 ± 1.508
1.775AspVal: 1.775 ± 1.317
5.324AspTrp: 5.324 ± 1.798
0.887AspTyr: 0.887 ± 1.03
0.0AspXaa: 0.0 ± 0.0
Glu
7.098GluAla: 7.098 ± 2.477
2.662GluCys: 2.662 ± 1.24
3.549GluAsp: 3.549 ± 1.261
6.211GluGlu: 6.211 ± 2.373
7.098GluPhe: 7.098 ± 3.056
4.437GluGly: 4.437 ± 2.393
0.0GluHis: 0.0 ± 0.0
3.549GluIle: 3.549 ± 1.762
3.549GluLys: 3.549 ± 1.035
3.549GluLeu: 3.549 ± 1.669
0.0GluMet: 0.0 ± 0.0
2.662GluAsn: 2.662 ± 1.288
4.437GluPro: 4.437 ± 0.979
0.0GluGln: 0.0 ± 0.0
0.887GluArg: 0.887 ± 0.754
0.887GluSer: 0.887 ± 0.658
2.662GluThr: 2.662 ± 1.376
4.437GluVal: 4.437 ± 3.089
0.887GluTrp: 0.887 ± 0.658
0.887GluTyr: 0.887 ± 1.03
0.0GluXaa: 0.0 ± 0.0
Phe
0.887PheAla: 0.887 ± 1.03
0.0PheCys: 0.0 ± 0.0
6.211PheAsp: 6.211 ± 1.804
2.662PheGlu: 2.662 ± 1.24
2.662PhePhe: 2.662 ± 1.826
0.887PheGly: 0.887 ± 1.017
1.775PheHis: 1.775 ± 1.014
4.437PheIle: 4.437 ± 2.385
0.0PheLys: 0.0 ± 0.0
3.549PheLeu: 3.549 ± 1.316
0.0PheMet: 0.0 ± 0.0
1.775PheAsn: 1.775 ± 2.033
2.662PhePro: 2.662 ± 1.686
3.549PheGln: 3.549 ± 1.168
1.775PheArg: 1.775 ± 1.485
4.437PheSer: 4.437 ± 1.044
3.549PheThr: 3.549 ± 1.168
4.437PheVal: 4.437 ± 1.406
0.887PheTrp: 0.887 ± 0.754
1.775PheTyr: 1.775 ± 2.06
0.0PheXaa: 0.0 ± 0.0
Gly
4.437GlyAla: 4.437 ± 1.352
0.0GlyCys: 0.0 ± 0.0
4.437GlyAsp: 4.437 ± 1.695
4.437GlyGlu: 4.437 ± 1.845
2.662GlyPhe: 2.662 ± 1.596
4.437GlyGly: 4.437 ± 1.352
1.775GlyHis: 1.775 ± 1.014
4.437GlyIle: 4.437 ± 1.404
5.324GlyLys: 5.324 ± 1.373
2.662GlyLeu: 2.662 ± 0.899
0.0GlyMet: 0.0 ± 0.0
2.662GlyAsn: 2.662 ± 1.416
2.662GlyPro: 2.662 ± 1.975
2.662GlyGln: 2.662 ± 0.978
1.775GlyArg: 1.775 ± 0.74
2.662GlySer: 2.662 ± 1.289
2.662GlyThr: 2.662 ± 1.289
7.098GlyVal: 7.098 ± 1.727
0.0GlyTrp: 0.0 ± 0.0
0.887GlyTyr: 0.887 ± 0.754
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
1.775HisCys: 1.775 ± 0.999
0.0HisAsp: 0.0 ± 0.0
0.887HisGlu: 0.887 ± 0.658
1.775HisPhe: 1.775 ± 0.74
0.887HisGly: 0.887 ± 0.941
0.887HisHis: 0.887 ± 0.658
0.0HisIle: 0.0 ± 0.0
1.775HisLys: 1.775 ± 1.19
4.437HisLeu: 4.437 ± 1.792
2.662HisMet: 2.662 ± 1.283
4.437HisAsn: 4.437 ± 2.35
1.775HisPro: 1.775 ± 0.999
0.887HisGln: 0.887 ± 0.658
2.662HisArg: 2.662 ± 2.086
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
2.662HisVal: 2.662 ± 1.376
1.775HisTrp: 1.775 ± 1.317
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
0.887IleCys: 0.887 ± 0.658
1.775IleAsp: 1.775 ± 1.317
4.437IleGlu: 4.437 ± 2.393
3.549IlePhe: 3.549 ± 1.763
4.437IleGly: 4.437 ± 2.077
1.775IleHis: 1.775 ± 1.259
1.775IleIle: 1.775 ± 1.317
4.437IleLys: 4.437 ± 1.341
3.549IleLeu: 3.549 ± 1.165
0.0IleMet: 0.0 ± 0.0
1.775IleAsn: 1.775 ± 1.508
2.662IlePro: 2.662 ± 1.288
7.986IleGln: 7.986 ± 1.99
2.662IleArg: 2.662 ± 1.199
10.648IleSer: 10.648 ± 6.325
5.324IleThr: 5.324 ± 1.929
3.549IleVal: 3.549 ± 1.049
0.0IleTrp: 0.0 ± 0.0
2.662IleTyr: 2.662 ± 0.978
0.0IleXaa: 0.0 ± 0.0
Lys
2.662LysAla: 2.662 ± 1.416
2.662LysCys: 2.662 ± 1.122
3.549LysAsp: 3.549 ± 1.341
7.098LysGlu: 7.098 ± 3.85
1.775LysPhe: 1.775 ± 0.74
3.549LysGly: 3.549 ± 1.058
4.437LysHis: 4.437 ± 2.35
0.887LysIle: 0.887 ± 0.941
2.662LysLys: 2.662 ± 0.891
3.549LysLeu: 3.549 ± 1.155
2.662LysMet: 2.662 ± 1.051
1.775LysAsn: 1.775 ± 0.74
4.437LysPro: 4.437 ± 1.745
2.662LysGln: 2.662 ± 1.921
4.437LysArg: 4.437 ± 2.335
7.986LysSer: 7.986 ± 2.563
2.662LysThr: 2.662 ± 1.181
2.662LysVal: 2.662 ± 0.899
0.0LysTrp: 0.0 ± 0.0
3.549LysTyr: 3.549 ± 1.168
0.0LysXaa: 0.0 ± 0.0
Leu
0.887LeuAla: 0.887 ± 0.658
0.0LeuCys: 0.0 ± 0.0
5.324LeuAsp: 5.324 ± 1.848
3.549LeuGlu: 3.549 ± 1.805
4.437LeuPhe: 4.437 ± 2.454
0.887LeuGly: 0.887 ± 0.754
3.549LeuHis: 3.549 ± 1.957
5.324LeuIle: 5.324 ± 1.549
7.986LeuLys: 7.986 ± 2.386
2.662LeuLeu: 2.662 ± 0.891
3.549LeuMet: 3.549 ± 1.469
6.211LeuAsn: 6.211 ± 2.066
5.324LeuPro: 5.324 ± 3.741
1.775LeuGln: 1.775 ± 1.203
5.324LeuArg: 5.324 ± 1.957
4.437LeuSer: 4.437 ± 2.166
1.775LeuThr: 1.775 ± 1.014
2.662LeuVal: 2.662 ± 1.376
0.0LeuTrp: 0.0 ± 0.0
6.211LeuTyr: 6.211 ± 1.786
0.0LeuXaa: 0.0 ± 0.0
Met
1.775MetAla: 1.775 ± 1.176
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
2.662MetGlu: 2.662 ± 2.271
0.0MetPhe: 0.0 ± 0.0
1.775MetGly: 1.775 ± 1.483
1.775MetHis: 1.775 ± 1.196
0.887MetIle: 0.887 ± 1.017
4.437MetLys: 4.437 ± 1.133
2.662MetLeu: 2.662 ± 1.596
0.887MetMet: 0.887 ± 1.149
0.887MetAsn: 0.887 ± 1.017
2.662MetPro: 2.662 ± 1.342
0.0MetGln: 0.0 ± 0.0
1.775MetArg: 1.775 ± 1.508
1.775MetSer: 1.775 ± 1.196
2.662MetThr: 2.662 ± 1.714
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
1.775MetTyr: 1.775 ± 1.508
0.0MetXaa: 0.0 ± 0.0
Asn
1.775AsnAla: 1.775 ± 0.74
1.775AsnCys: 1.775 ± 1.508
5.324AsnAsp: 5.324 ± 1.798
1.775AsnGlu: 1.775 ± 0.999
2.662AsnPhe: 2.662 ± 1.376
2.662AsnGly: 2.662 ± 1.342
0.0AsnHis: 0.0 ± 0.0
4.437AsnIle: 4.437 ± 1.625
1.775AsnLys: 1.775 ± 0.74
7.098AsnLeu: 7.098 ± 2.057
0.0AsnMet: 0.0 ± 0.0
2.662AsnAsn: 2.662 ± 2.081
7.098AsnPro: 7.098 ± 1.053
1.775AsnGln: 1.775 ± 1.317
1.775AsnArg: 1.775 ± 1.619
4.437AsnSer: 4.437 ± 1.409
0.887AsnThr: 0.887 ± 0.658
3.549AsnVal: 3.549 ± 1.853
0.0AsnTrp: 0.0 ± 0.0
1.775AsnTyr: 1.775 ± 0.74
0.0AsnXaa: 0.0 ± 0.0
Pro
2.662ProAla: 2.662 ± 2.258
1.775ProCys: 1.775 ± 1.411
0.887ProAsp: 0.887 ± 0.658
3.549ProGlu: 3.549 ± 1.397
0.0ProPhe: 0.0 ± 0.0
4.437ProGly: 4.437 ± 1.352
1.775ProHis: 1.775 ± 1.317
5.324ProIle: 5.324 ± 2.554
6.211ProLys: 6.211 ± 3.818
3.549ProLeu: 3.549 ± 1.992
1.775ProMet: 1.775 ± 1.196
3.549ProAsn: 3.549 ± 1.422
2.662ProPro: 2.662 ± 0.899
4.437ProGln: 4.437 ± 3.649
5.324ProArg: 5.324 ± 1.857
6.211ProSer: 6.211 ± 1.962
1.775ProThr: 1.775 ± 1.19
5.324ProVal: 5.324 ± 2.883
0.0ProTrp: 0.0 ± 0.0
1.775ProTyr: 1.775 ± 1.508
0.0ProXaa: 0.0 ± 0.0
Gln
5.324GlnAla: 5.324 ± 1.13
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
2.662GlnGlu: 2.662 ± 1.883
1.775GlnPhe: 1.775 ± 1.317
1.775GlnGly: 1.775 ± 0.999
0.887GlnHis: 0.887 ± 0.658
3.549GlnIle: 3.549 ± 2.099
0.887GlnLys: 0.887 ± 0.658
1.775GlnLeu: 1.775 ± 1.014
0.887GlnMet: 0.887 ± 0.941
1.775GlnAsn: 1.775 ± 1.317
1.775GlnPro: 1.775 ± 1.881
3.549GlnGln: 3.549 ± 1.999
0.887GlnArg: 0.887 ± 1.017
2.662GlnSer: 2.662 ± 1.407
2.662GlnThr: 2.662 ± 1.181
2.662GlnVal: 2.662 ± 0.899
0.887GlnTrp: 0.887 ± 1.017
2.662GlnTyr: 2.662 ± 1.375
0.0GlnXaa: 0.0 ± 0.0
Arg
1.775ArgAla: 1.775 ± 0.963
0.0ArgCys: 0.0 ± 0.0
2.662ArgAsp: 2.662 ± 1.342
6.211ArgGlu: 6.211 ± 1.857
5.324ArgPhe: 5.324 ± 3.283
2.662ArgGly: 2.662 ± 0.899
2.662ArgHis: 2.662 ± 1.376
3.549ArgIle: 3.549 ± 1.397
1.775ArgLys: 1.775 ± 1.176
2.662ArgLeu: 2.662 ± 1.376
0.887ArgMet: 0.887 ± 0.727
4.437ArgAsn: 4.437 ± 2.056
1.775ArgPro: 1.775 ± 0.963
0.0ArgGln: 0.0 ± 0.0
12.422ArgArg: 12.422 ± 5.189
8.873ArgSer: 8.873 ± 2.51
4.437ArgThr: 4.437 ± 1.404
2.662ArgVal: 2.662 ± 0.891
0.887ArgTrp: 0.887 ± 1.017
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
1.775SerAla: 1.775 ± 0.963
0.0SerCys: 0.0 ± 0.0
4.437SerAsp: 4.437 ± 0.979
2.662SerGlu: 2.662 ± 2.078
0.887SerPhe: 0.887 ± 0.658
5.324SerGly: 5.324 ± 3.748
1.775SerHis: 1.775 ± 1.259
6.211SerIle: 6.211 ± 3.593
7.098SerLys: 7.098 ± 3.662
7.098SerLeu: 7.098 ± 1.707
1.775SerMet: 1.775 ± 1.485
2.662SerAsn: 2.662 ± 1.051
4.437SerPro: 4.437 ± 1.256
0.887SerGln: 0.887 ± 0.941
6.211SerArg: 6.211 ± 2.25
9.76SerSer: 9.76 ± 4.055
9.76SerThr: 9.76 ± 3.461
2.662SerVal: 2.662 ± 1.596
1.775SerTrp: 1.775 ± 0.74
4.437SerTyr: 4.437 ± 1.489
0.0SerXaa: 0.0 ± 0.0
Thr
3.549ThrAla: 3.549 ± 0.939
0.887ThrCys: 0.887 ± 1.149
2.662ThrAsp: 2.662 ± 0.978
0.887ThrGlu: 0.887 ± 1.149
2.662ThrPhe: 2.662 ± 0.978
7.098ThrGly: 7.098 ± 2.72
2.662ThrHis: 2.662 ± 1.181
4.437ThrIle: 4.437 ± 1.865
0.887ThrLys: 0.887 ± 0.754
2.662ThrLeu: 2.662 ± 1.407
4.437ThrMet: 4.437 ± 2.853
1.775ThrAsn: 1.775 ± 0.74
5.324ThrPro: 5.324 ± 1.861
0.0ThrGln: 0.0 ± 0.0
0.887ThrArg: 0.887 ± 0.658
3.549ThrSer: 3.549 ± 2.251
0.887ThrThr: 0.887 ± 1.017
1.775ThrVal: 1.775 ± 1.071
1.775ThrTrp: 1.775 ± 1.19
4.437ThrTyr: 4.437 ± 1.534
0.0ThrXaa: 0.0 ± 0.0
Val
2.662ValAla: 2.662 ± 0.899
0.887ValCys: 0.887 ± 0.754
2.662ValAsp: 2.662 ± 1.288
0.887ValGlu: 0.887 ± 0.658
1.775ValPhe: 1.775 ± 1.014
1.775ValGly: 1.775 ± 1.418
1.775ValHis: 1.775 ± 1.071
5.324ValIle: 5.324 ± 1.032
4.437ValLys: 4.437 ± 2.056
4.437ValLeu: 4.437 ± 1.406
2.662ValMet: 2.662 ± 1.288
3.549ValAsn: 3.549 ± 2.393
3.549ValPro: 3.549 ± 2.508
2.662ValGln: 2.662 ± 0.891
6.211ValArg: 6.211 ± 3.446
1.775ValSer: 1.775 ± 1.014
1.775ValThr: 1.775 ± 1.071
3.549ValVal: 3.549 ± 2.236
0.887ValTrp: 0.887 ± 0.754
1.775ValTyr: 1.775 ± 1.176
0.0ValXaa: 0.0 ± 0.0
Trp
0.887TrpAla: 0.887 ± 0.658
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.775TrpGlu: 1.775 ± 1.19
0.0TrpPhe: 0.0 ± 0.0
0.887TrpGly: 0.887 ± 0.658
0.0TrpHis: 0.0 ± 0.0
0.887TrpIle: 0.887 ± 0.754
1.775TrpLys: 1.775 ± 0.74
0.887TrpLeu: 0.887 ± 0.658
1.775TrpMet: 1.775 ± 1.19
0.887TrpAsn: 0.887 ± 0.941
0.0TrpPro: 0.0 ± 0.0
1.775TrpGln: 1.775 ± 1.014
0.0TrpArg: 0.0 ± 0.0
0.887TrpSer: 0.887 ± 0.754
2.662TrpThr: 2.662 ± 0.899
0.887TrpVal: 0.887 ± 0.754
0.0TrpTrp: 0.0 ± 0.0
0.887TrpTyr: 0.887 ± 1.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.887TyrAla: 0.887 ± 0.754
0.0TyrCys: 0.0 ± 0.0
0.887TyrAsp: 0.887 ± 0.754
0.887TyrGlu: 0.887 ± 0.754
1.775TyrPhe: 1.775 ± 1.19
3.549TyrGly: 3.549 ± 1.165
0.887TyrHis: 0.887 ± 0.941
1.775TyrIle: 1.775 ± 1.317
7.098TyrLys: 7.098 ± 2.458
2.662TyrLeu: 2.662 ± 1.921
0.0TyrMet: 0.0 ± 0.952
2.662TyrAsn: 2.662 ± 1.975
1.775TyrPro: 1.775 ± 0.963
0.887TyrGln: 0.887 ± 0.658
2.662TyrArg: 2.662 ± 1.882
5.324TyrSer: 5.324 ± 1.994
3.549TyrThr: 3.549 ± 1.168
0.887TyrVal: 0.887 ± 1.03
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1128 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski