Amino acid dipepetide frequency for Wheat dwarf virus (isolate Sweden) (WDV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.149AlaAla: 4.149 ± 1.681
1.037AlaCys: 1.037 ± 0.824
4.149AlaAsp: 4.149 ± 1.245
3.112AlaGlu: 3.112 ± 0.427
1.037AlaPhe: 1.037 ± 0.824
1.037AlaGly: 1.037 ± 0.953
0.0AlaHis: 0.0 ± 0.0
2.075AlaIle: 2.075 ± 1.541
4.149AlaLys: 4.149 ± 1.276
6.224AlaLeu: 6.224 ± 0.855
1.037AlaMet: 1.037 ± 0.824
1.037AlaAsn: 1.037 ± 0.824
4.149AlaPro: 4.149 ± 0.9
3.112AlaGln: 3.112 ± 0.427
4.149AlaArg: 4.149 ± 1.276
8.299AlaSer: 8.299 ± 1.478
3.112AlaThr: 3.112 ± 0.427
6.224AlaVal: 6.224 ± 0.812
3.112AlaTrp: 3.112 ± 2.641
3.112AlaTyr: 3.112 ± 2.157
0.0AlaXaa: 0.0 ± 0.0
Cys
1.037CysAla: 1.037 ± 0.824
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
2.075CysPhe: 2.075 ± 0.795
1.037CysGly: 1.037 ± 0.782
2.075CysHis: 2.075 ± 0.84
1.037CysIle: 1.037 ± 0.782
1.037CysLys: 1.037 ± 1.205
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
4.149CysAsn: 4.149 ± 1.681
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
3.112CysThr: 3.112 ± 1.618
1.037CysVal: 1.037 ± 0.824
0.0CysTrp: 0.0 ± 0.0
2.075CysTyr: 2.075 ± 1.648
0.0CysXaa: 0.0 ± 0.0
Asp
6.224AspAla: 6.224 ± 2.6
2.075AspCys: 2.075 ± 0.84
2.075AspAsp: 2.075 ± 0.84
1.037AspGlu: 1.037 ± 0.782
4.149AspPhe: 4.149 ± 1.993
4.149AspGly: 4.149 ± 1.008
0.0AspHis: 0.0 ± 0.0
3.112AspIle: 3.112 ± 1.418
0.0AspLys: 0.0 ± 0.0
3.112AspLeu: 3.112 ± 1.468
3.112AspMet: 3.112 ± 1.314
1.037AspAsn: 1.037 ± 0.824
2.075AspPro: 2.075 ± 0.84
3.112AspGln: 3.112 ± 1.468
3.112AspArg: 3.112 ± 0.427
6.224AspSer: 6.224 ± 0.855
2.075AspThr: 2.075 ± 0.84
5.187AspVal: 5.187 ± 1.46
1.037AspTrp: 1.037 ± 0.782
6.224AspTyr: 6.224 ± 2.089
0.0AspXaa: 0.0 ± 0.0
Glu
0.0GluAla: 0.0 ± 0.0
2.075GluCys: 2.075 ± 0.84
4.149GluAsp: 4.149 ± 1.993
4.149GluGlu: 4.149 ± 0.9
4.149GluPhe: 4.149 ± 1.681
2.075GluGly: 2.075 ± 0.795
2.075GluHis: 2.075 ± 0.84
1.037GluIle: 1.037 ± 1.205
0.0GluLys: 0.0 ± 0.0
2.075GluLeu: 2.075 ± 0.84
0.0GluMet: 0.0 ± 0.0
0.0GluAsn: 0.0 ± 0.0
5.187GluPro: 5.187 ± 1.048
3.112GluGln: 3.112 ± 1.045
1.037GluArg: 1.037 ± 0.953
9.336GluSer: 9.336 ± 2.678
3.112GluThr: 3.112 ± 1.314
2.075GluVal: 2.075 ± 0.84
6.224GluTrp: 6.224 ± 1.863
2.075GluTyr: 2.075 ± 0.84
0.0GluXaa: 0.0 ± 0.0
Phe
2.075PheAla: 2.075 ± 0.795
0.0PheCys: 0.0 ± 0.0
1.037PheAsp: 1.037 ± 0.824
4.149PheGlu: 4.149 ± 1.681
1.037PhePhe: 1.037 ± 0.824
1.037PheGly: 1.037 ± 1.205
2.075PheHis: 2.075 ± 0.84
2.075PheIle: 2.075 ± 0.795
3.112PheLys: 3.112 ± 1.418
5.187PheLeu: 5.187 ± 1.464
0.0PheMet: 0.0 ± 0.0
1.037PheAsn: 1.037 ± 0.824
6.224PhePro: 6.224 ± 2.521
0.0PheGln: 0.0 ± 0.0
3.112PheArg: 3.112 ± 0.427
2.075PheSer: 2.075 ± 0.84
4.149PheThr: 4.149 ± 2.052
6.224PheVal: 6.224 ± 1.933
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.112GlyAla: 3.112 ± 1.418
0.0GlyCys: 0.0 ± 0.0
4.149GlyAsp: 4.149 ± 2.879
3.112GlyGlu: 3.112 ± 1.345
0.0GlyPhe: 0.0 ± 0.0
3.112GlyGly: 3.112 ± 0.427
0.0GlyHis: 0.0 ± 0.0
3.112GlyIle: 3.112 ± 0.427
7.261GlyLys: 7.261 ± 3.583
4.149GlyLeu: 4.149 ± 2.485
0.0GlyMet: 0.0 ± 0.0
2.075GlyAsn: 2.075 ± 1.541
2.075GlyPro: 2.075 ± 0.795
2.075GlyGln: 2.075 ± 1.907
5.187GlyArg: 5.187 ± 0.849
4.149GlySer: 4.149 ± 0.9
4.149GlyThr: 4.149 ± 0.739
2.075GlyVal: 2.075 ± 1.541
0.0GlyTrp: 0.0 ± 0.0
1.037GlyTyr: 1.037 ± 1.205
0.0GlyXaa: 0.0 ± 0.0
His
1.037HisAla: 1.037 ± 0.824
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
4.149HisGlu: 4.149 ± 1.681
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
1.037HisLys: 1.037 ± 0.782
6.224HisLeu: 6.224 ± 2.521
0.0HisMet: 0.0 ± 0.0
1.037HisAsn: 1.037 ± 0.782
2.075HisPro: 2.075 ± 0.84
0.0HisGln: 0.0 ± 0.0
2.075HisArg: 2.075 ± 1.021
0.0HisSer: 0.0 ± 0.0
1.037HisThr: 1.037 ± 0.824
2.075HisVal: 2.075 ± 0.84
1.037HisTrp: 1.037 ± 0.824
1.037HisTyr: 1.037 ± 0.782
0.0HisXaa: 0.0 ± 0.0
Ile
4.149IleAla: 4.149 ± 2.093
3.112IleCys: 3.112 ± 1.314
1.037IleAsp: 1.037 ± 0.782
4.149IleGlu: 4.149 ± 1.681
3.112IlePhe: 3.112 ± 0.427
4.149IleGly: 4.149 ± 2.879
0.0IleHis: 0.0 ± 0.0
4.149IleIle: 4.149 ± 1.993
1.037IleLys: 1.037 ± 0.824
3.112IleLeu: 3.112 ± 1.045
0.0IleMet: 0.0 ± 0.0
1.037IleAsn: 1.037 ± 0.782
3.112IlePro: 3.112 ± 1.712
4.149IleGln: 4.149 ± 1.008
4.149IleArg: 4.149 ± 1.276
2.075IleSer: 2.075 ± 1.021
5.187IleThr: 5.187 ± 1.048
2.075IleVal: 2.075 ± 1.648
0.0IleTrp: 0.0 ± 0.0
1.037IleTyr: 1.037 ± 0.782
0.0IleXaa: 0.0 ± 0.0
Lys
3.112LysAla: 3.112 ± 2.157
1.037LysCys: 1.037 ± 0.782
9.336LysAsp: 9.336 ± 1.618
2.075LysGlu: 2.075 ± 0.84
3.112LysPhe: 3.112 ± 1.418
4.149LysGly: 4.149 ± 3.296
0.0LysHis: 0.0 ± 0.0
1.037LysIle: 1.037 ± 0.824
2.075LysLys: 2.075 ± 1.541
2.075LysLeu: 2.075 ± 0.84
1.037LysMet: 1.037 ± 0.824
3.112LysAsn: 3.112 ± 0.427
1.037LysPro: 1.037 ± 0.824
5.187LysGln: 5.187 ± 1.374
4.149LysArg: 4.149 ± 2.179
1.037LysSer: 1.037 ± 0.782
2.075LysThr: 2.075 ± 0.795
3.112LysVal: 3.112 ± 1.418
1.037LysTrp: 1.037 ± 0.824
6.224LysTyr: 6.224 ± 2.629
0.0LysXaa: 0.0 ± 0.0
Leu
1.037LeuAla: 1.037 ± 1.205
2.075LeuCys: 2.075 ± 1.021
2.075LeuAsp: 2.075 ± 0.84
4.149LeuGlu: 4.149 ± 2.52
5.187LeuPhe: 5.187 ± 1.464
1.037LeuGly: 1.037 ± 0.782
4.149LeuHis: 4.149 ± 1.681
6.224LeuIle: 6.224 ± 2.38
4.149LeuLys: 4.149 ± 1.276
4.149LeuLeu: 4.149 ± 0.9
1.037LeuMet: 1.037 ± 0.93
6.224LeuAsn: 6.224 ± 0.812
4.149LeuPro: 4.149 ± 2.485
0.0LeuGln: 0.0 ± 0.0
7.261LeuArg: 7.261 ± 1.852
2.075LeuSer: 2.075 ± 0.84
2.075LeuThr: 2.075 ± 0.84
7.261LeuVal: 7.261 ± 1.246
0.0LeuTrp: 0.0 ± 0.0
7.261LeuTyr: 7.261 ± 1.311
0.0LeuXaa: 0.0 ± 0.0
Met
2.075MetAla: 2.075 ± 0.84
0.0MetCys: 0.0 ± 0.0
3.112MetAsp: 3.112 ± 1.618
2.075MetGlu: 2.075 ± 1.541
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
2.075MetLys: 2.075 ± 0.84
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
1.037MetAsn: 1.037 ± 0.824
2.075MetPro: 2.075 ± 1.648
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
1.037MetSer: 1.037 ± 0.824
1.037MetThr: 1.037 ± 0.782
3.112MetVal: 3.112 ± 0.427
0.0MetTrp: 0.0 ± 0.0
2.075MetTyr: 2.075 ± 1.565
0.0MetXaa: 0.0 ± 0.0
Asn
3.112AsnAla: 3.112 ± 1.314
0.0AsnCys: 0.0 ± 0.0
0.0AsnAsp: 0.0 ± 0.0
2.075AsnGlu: 2.075 ± 0.84
0.0AsnPhe: 0.0 ± 0.0
1.037AsnGly: 1.037 ± 1.205
0.0AsnHis: 0.0 ± 0.0
6.224AsnIle: 6.224 ± 1.502
6.224AsnLys: 6.224 ± 0.855
4.149AsnLeu: 4.149 ± 1.008
0.0AsnMet: 0.0 ± 0.0
1.037AsnAsn: 1.037 ± 0.824
4.149AsnPro: 4.149 ± 1.993
3.112AsnGln: 3.112 ± 0.427
3.112AsnArg: 3.112 ± 0.427
2.075AsnSer: 2.075 ± 0.795
6.224AsnThr: 6.224 ± 1.052
0.0AsnVal: 0.0 ± 0.0
3.112AsnTrp: 3.112 ± 1.418
2.075AsnTyr: 2.075 ± 1.565
0.0AsnXaa: 0.0 ± 0.0
Pro
3.112ProAla: 3.112 ± 1.468
1.037ProCys: 1.037 ± 0.782
4.149ProAsp: 4.149 ± 0.739
6.224ProGlu: 6.224 ± 2.521
6.224ProPhe: 6.224 ± 1.368
5.187ProGly: 5.187 ± 1.464
2.075ProHis: 2.075 ± 0.84
1.037ProIle: 1.037 ± 1.205
2.075ProLys: 2.075 ± 0.795
3.112ProLeu: 3.112 ± 1.045
0.0ProMet: 0.0 ± 0.0
5.187ProAsn: 5.187 ± 2.063
2.075ProPro: 2.075 ± 1.541
2.075ProGln: 2.075 ± 0.84
4.149ProArg: 4.149 ± 0.9
3.112ProSer: 3.112 ± 1.468
3.112ProThr: 3.112 ± 1.314
4.149ProVal: 4.149 ± 1.008
1.037ProTrp: 1.037 ± 0.824
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
6.224GlnAla: 6.224 ± 1.458
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
1.037GlnGlu: 1.037 ± 0.953
0.0GlnPhe: 0.0 ± 0.0
4.149GlnGly: 4.149 ± 1.973
0.0GlnHis: 0.0 ± 0.0
2.075GlnIle: 2.075 ± 0.84
0.0GlnLys: 0.0 ± 0.0
0.0GlnLeu: 0.0 ± 0.0
1.037GlnMet: 1.037 ± 0.803
2.075GlnAsn: 2.075 ± 0.84
1.037GlnPro: 1.037 ± 0.953
1.037GlnGln: 1.037 ± 1.205
4.149GlnArg: 4.149 ± 1.861
2.075GlnSer: 2.075 ± 0.84
2.075GlnThr: 2.075 ± 1.541
2.075GlnVal: 2.075 ± 0.84
0.0GlnTrp: 0.0 ± 0.0
3.112GlnTyr: 3.112 ± 1.045
0.0GlnXaa: 0.0 ± 0.0
Arg
3.112ArgAla: 3.112 ± 0.427
0.0ArgCys: 0.0 ± 0.0
6.224ArgAsp: 6.224 ± 1.757
4.149ArgGlu: 4.149 ± 1.681
5.187ArgPhe: 5.187 ± 1.048
4.149ArgGly: 4.149 ± 1.008
4.149ArgHis: 4.149 ± 0.739
0.0ArgIle: 0.0 ± 0.0
7.261ArgLys: 7.261 ± 2.268
5.187ArgLeu: 5.187 ± 1.263
3.112ArgMet: 3.112 ± 0.896
0.0ArgAsn: 0.0 ± 0.0
0.0ArgPro: 0.0 ± 0.0
0.0ArgGln: 0.0 ± 0.0
2.075ArgArg: 2.075 ± 1.541
2.075ArgSer: 2.075 ± 0.84
6.224ArgThr: 6.224 ± 1.933
4.149ArgVal: 4.149 ± 1.008
1.037ArgTrp: 1.037 ± 0.824
2.075ArgTyr: 2.075 ± 1.648
0.0ArgXaa: 0.0 ± 0.0
Ser
5.187SerAla: 5.187 ± 1.357
0.0SerCys: 0.0 ± 0.0
7.261SerAsp: 7.261 ± 0.73
1.037SerGlu: 1.037 ± 0.782
1.037SerPhe: 1.037 ± 0.782
2.075SerGly: 2.075 ± 0.795
2.075SerHis: 2.075 ± 1.021
8.299SerIle: 8.299 ± 2.442
3.112SerLys: 3.112 ± 0.427
7.261SerLeu: 7.261 ± 2.863
2.075SerMet: 2.075 ± 0.84
1.037SerAsn: 1.037 ± 0.782
6.224SerPro: 6.224 ± 2.521
1.037SerGln: 1.037 ± 0.782
4.149SerArg: 4.149 ± 1.245
13.485SerSer: 13.485 ± 2.224
7.261SerThr: 7.261 ± 2.363
1.037SerVal: 1.037 ± 0.953
3.112SerTrp: 3.112 ± 1.314
1.037SerTyr: 1.037 ± 0.953
0.0SerXaa: 0.0 ± 0.0
Thr
6.224ThrAla: 6.224 ± 1.368
2.075ThrCys: 2.075 ± 1.648
2.075ThrAsp: 2.075 ± 1.648
4.149ThrGlu: 4.149 ± 0.9
5.187ThrPhe: 5.187 ± 2.106
3.112ThrGly: 3.112 ± 1.418
2.075ThrHis: 2.075 ± 0.795
2.075ThrIle: 2.075 ± 0.84
2.075ThrLys: 2.075 ± 0.84
5.187ThrLeu: 5.187 ± 1.464
1.037ThrMet: 1.037 ± 0.824
5.187ThrAsn: 5.187 ± 1.791
10.373ThrPro: 10.373 ± 1.879
0.0ThrGln: 0.0 ± 0.0
2.075ThrArg: 2.075 ± 0.795
5.187ThrSer: 5.187 ± 2.396
2.075ThrThr: 2.075 ± 0.795
6.224ThrVal: 6.224 ± 2.836
1.037ThrTrp: 1.037 ± 0.824
5.187ThrTyr: 5.187 ± 2.063
0.0ThrXaa: 0.0 ± 0.0
Val
5.187ValAla: 5.187 ± 1.464
4.149ValCys: 4.149 ± 0.9
6.224ValAsp: 6.224 ± 1.502
0.0ValGlu: 0.0 ± 0.0
2.075ValPhe: 2.075 ± 1.541
5.187ValGly: 5.187 ± 2.538
1.037ValHis: 1.037 ± 0.782
1.037ValIle: 1.037 ± 0.782
2.075ValLys: 2.075 ± 1.648
3.112ValLeu: 3.112 ± 0.427
0.0ValMet: 0.0 ± 0.0
6.224ValAsn: 6.224 ± 2.224
1.037ValPro: 1.037 ± 0.824
3.112ValGln: 3.112 ± 0.427
5.187ValArg: 5.187 ± 1.048
3.112ValSer: 3.112 ± 1.618
6.224ValThr: 6.224 ± 3.283
6.224ValVal: 6.224 ± 3.78
0.0ValTrp: 0.0 ± 0.0
4.149ValTyr: 4.149 ± 1.008
0.0ValXaa: 0.0 ± 0.0
Trp
1.037TrpAla: 1.037 ± 0.782
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
2.075TrpGly: 2.075 ± 0.84
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
4.149TrpLys: 4.149 ± 0.739
4.149TrpLeu: 4.149 ± 1.861
2.075TrpMet: 2.075 ± 1.021
1.037TrpAsn: 1.037 ± 0.824
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.037TrpArg: 1.037 ± 1.205
3.112TrpSer: 3.112 ± 0.427
2.075TrpThr: 2.075 ± 1.648
1.037TrpVal: 1.037 ± 0.824
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.112TyrAla: 3.112 ± 1.045
0.0TyrCys: 0.0 ± 0.0
2.075TyrAsp: 2.075 ± 0.795
3.112TyrGlu: 3.112 ± 1.314
1.037TyrPhe: 1.037 ± 0.824
2.075TyrGly: 2.075 ± 1.257
1.037TyrHis: 1.037 ± 0.824
6.224TyrIle: 6.224 ± 2.521
3.112TyrLys: 3.112 ± 2.472
3.112TyrLeu: 3.112 ± 0.427
3.112TyrMet: 3.112 ± 1.345
4.149TyrAsn: 4.149 ± 1.591
2.075TyrPro: 2.075 ± 0.84
2.075TyrGln: 2.075 ± 2.411
0.0TyrArg: 0.0 ± 0.0
7.261TyrSer: 7.261 ± 2.863
6.224TyrThr: 6.224 ± 0.812
0.0TyrVal: 0.0 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
1.037TyrTyr: 1.037 ± 0.782
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (965 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski