Amino acid dipepetide frequency for Wheat dwarf India virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.016AlaAla: 8.016 ± 3.385
1.002AlaCys: 1.002 ± 0.843
1.002AlaAsp: 1.002 ± 0.811
4.008AlaGlu: 4.008 ± 1.487
2.004AlaPhe: 2.004 ± 0.744
2.004AlaGly: 2.004 ± 0.744
0.0AlaHis: 0.0 ± 0.0
1.002AlaIle: 1.002 ± 1.352
4.008AlaLys: 4.008 ± 0.924
4.008AlaLeu: 4.008 ± 1.189
0.0AlaMet: 0.0 ± 0.0
3.006AlaAsn: 3.006 ± 1.216
2.004AlaPro: 2.004 ± 1.505
2.004AlaGln: 2.004 ± 0.837
4.008AlaArg: 4.008 ± 2.519
10.02AlaSer: 10.02 ± 1.951
3.006AlaThr: 3.006 ± 0.523
5.01AlaVal: 5.01 ± 2.536
0.0AlaTrp: 0.0 ± 0.0
3.006AlaTyr: 3.006 ± 2.59
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
2.004CysPhe: 2.004 ± 0.744
0.0CysGly: 0.0 ± 0.0
1.002CysHis: 1.002 ± 0.843
1.002CysIle: 1.002 ± 0.843
2.004CysLys: 2.004 ± 1.686
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
2.004CysAsn: 2.004 ± 0.744
3.006CysPro: 3.006 ± 1.216
0.0CysGln: 0.0 ± 0.0
2.004CysArg: 2.004 ± 0.744
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
1.002CysTrp: 1.002 ± 0.717
4.008CysTyr: 4.008 ± 1.487
0.0CysXaa: 0.0 ± 0.0
Asp
2.004AspAla: 2.004 ± 1.686
0.0AspCys: 0.0 ± 0.0
3.006AspAsp: 3.006 ± 1.216
3.006AspGlu: 3.006 ± 1.518
5.01AspPhe: 5.01 ± 1.45
6.012AspGly: 6.012 ± 0.741
0.0AspHis: 0.0 ± 0.0
6.012AspIle: 6.012 ± 1.324
0.0AspLys: 0.0 ± 0.0
4.008AspLeu: 4.008 ± 1.487
0.0AspMet: 0.0 ± 0.0
0.0AspAsn: 0.0 ± 0.0
5.01AspPro: 5.01 ± 2.392
0.0AspGln: 0.0 ± 0.0
0.0AspArg: 0.0 ± 0.0
3.006AspSer: 3.006 ± 0.523
1.002AspThr: 1.002 ± 0.811
2.004AspVal: 2.004 ± 1.505
5.01AspTrp: 5.01 ± 1.884
4.008AspTyr: 4.008 ± 1.487
0.0AspXaa: 0.0 ± 0.0
Glu
3.006GluAla: 3.006 ± 1.216
0.0GluCys: 0.0 ± 0.0
2.004GluAsp: 2.004 ± 1.336
1.002GluGlu: 1.002 ± 1.352
1.002GluPhe: 1.002 ± 1.352
2.004GluGly: 2.004 ± 1.686
0.0GluHis: 0.0 ± 0.0
2.004GluIle: 2.004 ± 1.336
3.006GluLys: 3.006 ± 1.216
3.006GluLeu: 3.006 ± 1.381
0.0GluMet: 0.0 ± 0.0
6.012GluAsn: 6.012 ± 2.761
1.002GluPro: 1.002 ± 0.717
5.01GluGln: 5.01 ± 0.971
1.002GluArg: 1.002 ± 0.843
1.002GluSer: 1.002 ± 0.843
4.008GluThr: 4.008 ± 1.189
1.002GluVal: 1.002 ± 0.843
2.004GluTrp: 2.004 ± 1.686
5.01GluTyr: 5.01 ± 1.884
0.0GluXaa: 0.0 ± 0.0
Phe
1.002PheAla: 1.002 ± 0.843
1.002PheCys: 1.002 ± 0.717
4.008PheAsp: 4.008 ± 1.487
3.006PheGlu: 3.006 ± 1.216
3.006PhePhe: 3.006 ± 1.216
4.008PheGly: 4.008 ± 1.74
1.002PheHis: 1.002 ± 0.843
2.004PheIle: 2.004 ± 0.744
3.006PheLys: 3.006 ± 1.518
2.004PheLeu: 2.004 ± 0.744
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
6.012PhePro: 6.012 ± 2.231
2.004PheGln: 2.004 ± 0.744
2.004PheArg: 2.004 ± 0.744
3.006PheSer: 3.006 ± 1.186
3.006PheThr: 3.006 ± 1.311
5.01PheVal: 5.01 ± 4.2
2.004PheTrp: 2.004 ± 0.744
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.008GlyAla: 4.008 ± 0.82
1.002GlyCys: 1.002 ± 0.843
1.002GlyAsp: 1.002 ± 0.843
3.006GlyGlu: 3.006 ± 2.03
1.002GlyPhe: 1.002 ± 1.352
5.01GlyGly: 5.01 ± 1.722
2.004GlyHis: 2.004 ± 0.744
2.004GlyIle: 2.004 ± 0.837
2.004GlyLys: 2.004 ± 1.435
2.004GlyLeu: 2.004 ± 1.505
2.004GlyMet: 2.004 ± 0.744
4.008GlyAsn: 4.008 ± 2.72
6.012GlyPro: 6.012 ± 1.93
5.01GlyGln: 5.01 ± 2.143
6.012GlyArg: 6.012 ± 1.045
6.012GlySer: 6.012 ± 4.06
6.012GlyThr: 6.012 ± 1.045
2.004GlyVal: 2.004 ± 1.686
0.0GlyTrp: 0.0 ± 0.0
1.002GlyTyr: 1.002 ± 0.843
0.0GlyXaa: 0.0 ± 0.0
His
2.004HisAla: 2.004 ± 0.744
2.004HisCys: 2.004 ± 0.744
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
4.008HisPhe: 4.008 ± 1.487
3.006HisGly: 3.006 ± 1.381
2.004HisHis: 2.004 ± 0.744
2.004HisIle: 2.004 ± 0.744
1.002HisLys: 1.002 ± 0.843
4.008HisLeu: 4.008 ± 1.077
0.0HisMet: 0.0 ± 0.0
1.002HisAsn: 1.002 ± 0.717
4.008HisPro: 4.008 ± 1.487
2.004HisGln: 2.004 ± 0.744
1.002HisArg: 1.002 ± 0.843
2.004HisSer: 2.004 ± 0.744
1.002HisThr: 1.002 ± 0.843
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.006IleAla: 3.006 ± 2.733
0.0IleCys: 0.0 ± 0.0
3.006IleAsp: 3.006 ± 1.216
2.004IleGlu: 2.004 ± 0.744
1.002IlePhe: 1.002 ± 0.843
4.008IleGly: 4.008 ± 3.009
1.002IleHis: 1.002 ± 0.811
6.012IleIle: 6.012 ± 2.366
2.004IleLys: 2.004 ± 0.837
4.008IleLeu: 4.008 ± 1.332
3.006IleMet: 3.006 ± 1.381
2.004IleAsn: 2.004 ± 0.95
5.01IlePro: 5.01 ± 1.781
5.01IleGln: 5.01 ± 0.971
2.004IleArg: 2.004 ± 1.505
4.008IleSer: 4.008 ± 1.487
2.004IleThr: 2.004 ± 0.744
2.004IleVal: 2.004 ± 1.435
1.002IleTrp: 1.002 ± 0.843
4.008IleTyr: 4.008 ± 0.82
0.0IleXaa: 0.0 ± 0.0
Lys
3.006LysAla: 3.006 ± 0.523
0.0LysCys: 0.0 ± 0.0
3.006LysAsp: 3.006 ± 0.523
0.0LysGlu: 0.0 ± 0.0
3.006LysPhe: 3.006 ± 1.311
8.016LysGly: 8.016 ± 1.557
0.0LysHis: 0.0 ± 0.0
0.0LysIle: 0.0 ± 0.0
5.01LysLys: 5.01 ± 2.302
7.014LysLeu: 7.014 ± 1.174
0.0LysMet: 0.0 ± 0.0
3.006LysAsn: 3.006 ± 0.523
1.002LysPro: 1.002 ± 0.843
3.006LysGln: 3.006 ± 1.216
7.014LysArg: 7.014 ± 2.778
5.01LysSer: 5.01 ± 1.135
3.006LysThr: 3.006 ± 1.518
2.004LysVal: 2.004 ± 0.744
0.0LysTrp: 0.0 ± 0.0
2.004LysTyr: 2.004 ± 0.837
0.0LysXaa: 0.0 ± 0.0
Leu
2.004LeuAla: 2.004 ± 1.505
2.004LeuCys: 2.004 ± 0.744
0.0LeuAsp: 0.0 ± 0.0
1.002LeuGlu: 1.002 ± 0.843
4.008LeuPhe: 4.008 ± 1.487
3.006LeuGly: 3.006 ± 1.302
6.012LeuHis: 6.012 ± 1.589
4.008LeuIle: 4.008 ± 0.82
3.006LeuLys: 3.006 ± 1.216
4.008LeuLeu: 4.008 ± 2.432
1.002LeuMet: 1.002 ± 0.843
4.008LeuAsn: 4.008 ± 1.189
1.002LeuPro: 1.002 ± 0.843
5.01LeuGln: 5.01 ± 1.834
3.006LeuArg: 3.006 ± 2.03
4.008LeuSer: 4.008 ± 0.924
4.008LeuThr: 4.008 ± 1.077
3.006LeuVal: 3.006 ± 1.665
0.0LeuTrp: 0.0 ± 0.0
7.014LeuTyr: 7.014 ± 0.653
0.0LeuXaa: 0.0 ± 0.0
Met
1.002MetAla: 1.002 ± 0.843
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
1.002MetGlu: 1.002 ± 0.717
1.002MetPhe: 1.002 ± 0.843
1.002MetGly: 1.002 ± 1.352
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
2.004MetLys: 2.004 ± 0.744
2.004MetLeu: 2.004 ± 0.744
1.002MetMet: 1.002 ± 0.705
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
3.006MetSer: 3.006 ± 0.523
2.004MetThr: 2.004 ± 0.837
3.006MetVal: 3.006 ± 0.523
0.0MetTrp: 0.0 ± 0.0
1.002MetTyr: 1.002 ± 0.843
0.0MetXaa: 0.0 ± 0.0
Asn
2.004AsnAla: 2.004 ± 0.744
2.004AsnCys: 2.004 ± 0.744
2.004AsnAsp: 2.004 ± 0.744
2.004AsnGlu: 2.004 ± 1.505
5.01AsnPhe: 5.01 ± 0.971
2.004AsnGly: 2.004 ± 1.686
0.0AsnHis: 0.0 ± 0.0
1.002AsnIle: 1.002 ± 0.717
6.012AsnLys: 6.012 ± 1.045
4.008AsnLeu: 4.008 ± 1.077
0.0AsnMet: 0.0 ± 0.0
5.01AsnAsn: 5.01 ± 1.884
7.014AsnPro: 7.014 ± 2.593
1.002AsnGln: 1.002 ± 0.843
0.0AsnArg: 0.0 ± 0.0
11.022AsnSer: 11.022 ± 3.096
4.008AsnThr: 4.008 ± 0.924
2.004AsnVal: 2.004 ± 0.837
0.0AsnTrp: 0.0 ± 0.0
1.002AsnTyr: 1.002 ± 0.717
0.0AsnXaa: 0.0 ± 0.0
Pro
2.004ProAla: 2.004 ± 2.703
2.004ProCys: 2.004 ± 0.837
4.008ProAsp: 4.008 ± 2.432
7.014ProGlu: 7.014 ± 2.593
1.002ProPhe: 1.002 ± 0.717
2.004ProGly: 2.004 ± 1.505
10.02ProHis: 10.02 ± 3.718
4.008ProIle: 4.008 ± 1.487
4.008ProLys: 4.008 ± 0.82
1.002ProLeu: 1.002 ± 1.352
0.0ProMet: 0.0 ± 0.0
0.0ProAsn: 0.0 ± 0.0
4.008ProPro: 4.008 ± 3.009
1.002ProGln: 1.002 ± 1.352
4.008ProArg: 4.008 ± 1.487
6.012ProSer: 6.012 ± 3.144
8.016ProThr: 8.016 ± 1.641
1.002ProVal: 1.002 ± 0.843
2.004ProTrp: 2.004 ± 1.438
7.014ProTyr: 7.014 ± 1.971
0.0ProXaa: 0.0 ± 0.0
Gln
2.004GlnAla: 2.004 ± 0.744
1.002GlnCys: 1.002 ± 0.843
4.008GlnAsp: 4.008 ± 1.487
4.008GlnGlu: 4.008 ± 2.139
0.0GlnPhe: 0.0 ± 0.0
3.006GlnGly: 3.006 ± 2.432
2.004GlnHis: 2.004 ± 0.744
3.006GlnIle: 3.006 ± 0.523
0.0GlnLys: 0.0 ± 0.0
1.002GlnLeu: 1.002 ± 0.717
1.002GlnMet: 1.002 ± 0.737
2.004GlnAsn: 2.004 ± 1.336
4.008GlnPro: 4.008 ± 0.924
4.008GlnGln: 4.008 ± 1.487
3.006GlnArg: 3.006 ± 1.186
4.008GlnSer: 4.008 ± 0.924
4.008GlnThr: 4.008 ± 1.487
3.006GlnVal: 3.006 ± 0.523
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.004ArgAla: 2.004 ± 1.438
0.0ArgCys: 0.0 ± 0.0
6.012ArgAsp: 6.012 ± 1.694
1.002ArgGlu: 1.002 ± 0.843
1.002ArgPhe: 1.002 ± 0.843
5.01ArgGly: 5.01 ± 1.312
4.008ArgHis: 4.008 ± 0.82
2.004ArgIle: 2.004 ± 1.622
4.008ArgLys: 4.008 ± 1.189
4.008ArgLeu: 4.008 ± 3.009
2.004ArgMet: 2.004 ± 1.686
4.008ArgAsn: 4.008 ± 1.487
2.004ArgPro: 2.004 ± 1.505
1.002ArgGln: 1.002 ± 1.352
5.01ArgArg: 5.01 ± 1.312
7.014ArgSer: 7.014 ± 0.895
3.006ArgThr: 3.006 ± 1.518
0.0ArgVal: 0.0 ± 0.0
2.004ArgTrp: 2.004 ± 1.505
2.004ArgTyr: 2.004 ± 0.744
0.0ArgXaa: 0.0 ± 0.0
Ser
10.02SerAla: 10.02 ± 1.5
2.004SerCys: 2.004 ± 0.744
7.014SerAsp: 7.014 ± 2.604
2.004SerGlu: 2.004 ± 0.744
0.0SerPhe: 0.0 ± 0.0
1.002SerGly: 1.002 ± 0.843
0.0SerHis: 0.0 ± 0.0
6.012SerIle: 6.012 ± 2.279
6.012SerLys: 6.012 ± 1.045
4.008SerLeu: 4.008 ± 2.309
0.0SerMet: 0.0 ± 0.0
10.02SerAsn: 10.02 ± 1.941
7.014SerPro: 7.014 ± 1.952
3.006SerGln: 3.006 ± 0.523
4.008SerArg: 4.008 ± 1.077
7.014SerSer: 7.014 ± 1.487
12.024SerThr: 12.024 ± 3.376
6.012SerVal: 6.012 ± 2.761
1.002SerTrp: 1.002 ± 0.843
3.006SerTyr: 3.006 ± 2.03
0.0SerXaa: 0.0 ± 0.0
Thr
0.0ThrAla: 0.0 ± 0.0
0.0ThrCys: 0.0 ± 0.0
1.002ThrAsp: 1.002 ± 0.843
3.006ThrGlu: 3.006 ± 1.302
7.014ThrPhe: 7.014 ± 1.648
6.012ThrGly: 6.012 ± 2.821
0.0ThrHis: 0.0 ± 0.0
4.008ThrIle: 4.008 ± 1.077
3.006ThrLys: 3.006 ± 0.523
4.008ThrLeu: 4.008 ± 1.189
2.004ThrMet: 2.004 ± 0.837
6.012ThrAsn: 6.012 ± 1.045
6.012ThrPro: 6.012 ± 1.694
0.0ThrGln: 0.0 ± 0.0
5.01ThrArg: 5.01 ± 0.834
5.01ThrSer: 5.01 ± 2.064
4.008ThrThr: 4.008 ± 1.9
2.004ThrVal: 2.004 ± 1.435
5.01ThrTrp: 5.01 ± 0.971
3.006ThrTyr: 3.006 ± 1.518
0.0ThrXaa: 0.0 ± 0.0
Val
5.01ValAla: 5.01 ± 1.446
0.0ValCys: 0.0 ± 0.0
2.004ValAsp: 2.004 ± 0.837
4.008ValGlu: 4.008 ± 1.189
2.004ValPhe: 2.004 ± 2.703
1.002ValGly: 1.002 ± 0.717
2.004ValHis: 2.004 ± 0.744
3.006ValIle: 3.006 ± 2.733
2.004ValLys: 2.004 ± 1.686
0.0ValLeu: 0.0 ± 0.0
1.002ValMet: 1.002 ± 0.665
2.004ValAsn: 2.004 ± 1.435
3.006ValPro: 3.006 ± 0.523
5.01ValGln: 5.01 ± 2.143
5.01ValArg: 5.01 ± 1.312
1.002ValSer: 1.002 ± 0.843
0.0ValThr: 0.0 ± 0.0
2.004ValVal: 2.004 ± 1.686
1.002ValTrp: 1.002 ± 0.843
1.002ValTyr: 1.002 ± 0.717
0.0ValXaa: 0.0 ± 0.0
Trp
4.008TrpAla: 4.008 ± 1.487
0.0TrpCys: 0.0 ± 0.0
2.004TrpAsp: 2.004 ± 0.744
0.0TrpGlu: 0.0 ± 0.0
1.002TrpPhe: 1.002 ± 1.352
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
3.006TrpIle: 3.006 ± 1.186
2.004TrpLys: 2.004 ± 0.837
3.006TrpLeu: 3.006 ± 0.523
1.002TrpMet: 1.002 ± 0.843
0.0TrpAsn: 0.0 ± 0.0
2.004TrpPro: 2.004 ± 1.686
1.002TrpGln: 1.002 ± 0.717
0.0TrpArg: 0.0 ± 0.0
3.006TrpSer: 3.006 ± 0.523
1.002TrpThr: 1.002 ± 0.843
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.006TyrAla: 3.006 ± 0.523
4.008TyrCys: 4.008 ± 1.487
4.008TyrAsp: 4.008 ± 0.82
2.004TyrGlu: 2.004 ± 0.744
3.006TyrPhe: 3.006 ± 1.518
3.006TyrGly: 3.006 ± 1.216
0.0TyrHis: 0.0 ± 0.0
4.008TyrIle: 4.008 ± 1.332
1.002TyrLys: 1.002 ± 0.843
4.008TyrLeu: 4.008 ± 0.924
4.008TyrMet: 4.008 ± 1.21
3.006TyrAsn: 3.006 ± 1.216
1.002TyrPro: 1.002 ± 1.352
0.0TyrGln: 0.0 ± 0.0
3.006TyrArg: 3.006 ± 1.186
6.012TyrSer: 6.012 ± 1.045
1.002TyrThr: 1.002 ± 0.843
1.002TyrVal: 1.002 ± 0.843
1.002TyrTrp: 1.002 ± 0.717
1.002TyrTyr: 1.002 ± 0.843
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (999 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski