Amino acid dipepetide frequency for Red clover powdery mildew-associated totivirus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.743AlaAla: 6.743 ± 1.407
4.046AlaCys: 4.046 ± 0.234
4.72AlaAsp: 4.72 ± 0.235
4.046AlaGlu: 4.046 ± 0.234
4.72AlaPhe: 4.72 ± 0.782
4.72AlaGly: 4.72 ± 0.782
1.349AlaHis: 1.349 ± 0.078
4.046AlaIle: 4.046 ± 0.782
5.394AlaLys: 5.394 ± 0.313
8.766AlaLeu: 8.766 ± 1.016
5.394AlaMet: 5.394 ± 2.737
3.372AlaAsn: 3.372 ± 0.313
4.72AlaPro: 4.72 ± 2.815
4.046AlaGln: 4.046 ± 0.782
4.72AlaArg: 4.72 ± 0.235
6.743AlaSer: 6.743 ± 2.424
3.372AlaThr: 3.372 ± 1.329
5.394AlaVal: 5.394 ± 1.72
0.674AlaTrp: 0.674 ± 0.547
3.372AlaTyr: 3.372 ± 0.704
0.0AlaXaa: 0.0 ± 0.0
Cys
0.674CysAla: 0.674 ± 0.469
0.674CysCys: 0.674 ± 0.469
1.349CysAsp: 1.349 ± 1.095
2.697CysGlu: 2.697 ± 0.86
0.0CysPhe: 0.0 ± 0.0
1.349CysGly: 1.349 ± 0.078
0.0CysHis: 0.0 ± 0.0
1.349CysIle: 1.349 ± 0.938
0.0CysLys: 0.0 ± 0.0
1.349CysLeu: 1.349 ± 0.078
0.0CysMet: 0.0 ± 0.0
0.674CysAsn: 0.674 ± 0.547
0.0CysPro: 0.0 ± 0.0
0.674CysGln: 0.674 ± 0.469
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.674CysThr: 0.674 ± 0.469
0.674CysVal: 0.674 ± 0.469
0.0CysTrp: 0.0 ± 0.0
1.349CysTyr: 1.349 ± 1.095
0.0CysXaa: 0.0 ± 0.0
Asp
4.72AspAla: 4.72 ± 1.251
1.349AspCys: 1.349 ± 0.078
2.697AspAsp: 2.697 ± 0.156
2.697AspGlu: 2.697 ± 0.156
4.046AspPhe: 4.046 ± 0.234
5.394AspGly: 5.394 ± 0.313
0.674AspHis: 0.674 ± 0.469
2.023AspIle: 2.023 ± 1.642
0.674AspLys: 0.674 ± 0.547
6.069AspLeu: 6.069 ± 1.876
2.697AspMet: 2.697 ± 1.173
0.674AspAsn: 0.674 ± 0.547
0.0AspPro: 0.0 ± 0.0
0.674AspGln: 0.674 ± 0.469
6.069AspArg: 6.069 ± 0.157
3.372AspSer: 3.372 ± 0.313
4.046AspThr: 4.046 ± 1.251
5.394AspVal: 5.394 ± 1.329
2.023AspTrp: 2.023 ± 0.625
4.046AspTyr: 4.046 ± 0.234
0.0AspXaa: 0.0 ± 0.0
Glu
5.394GluAla: 5.394 ± 0.704
0.0GluCys: 0.0 ± 0.0
4.046GluAsp: 4.046 ± 0.782
0.0GluGlu: 0.0 ± 0.0
2.023GluPhe: 2.023 ± 0.391
2.023GluGly: 2.023 ± 0.625
0.674GluHis: 0.674 ± 0.547
3.372GluIle: 3.372 ± 1.72
2.697GluLys: 2.697 ± 0.86
6.743GluLeu: 6.743 ± 1.407
2.023GluMet: 2.023 ± 0.391
2.697GluAsn: 2.697 ± 0.156
3.372GluPro: 3.372 ± 0.704
3.372GluGln: 3.372 ± 0.313
2.697GluArg: 2.697 ± 1.173
6.743GluSer: 6.743 ± 0.626
2.023GluThr: 2.023 ± 0.625
5.394GluVal: 5.394 ± 2.737
1.349GluTrp: 1.349 ± 0.938
1.349GluTyr: 1.349 ± 1.095
0.0GluXaa: 0.0 ± 0.0
Phe
4.72PheAla: 4.72 ± 1.251
0.0PheCys: 0.0 ± 0.0
2.023PheAsp: 2.023 ± 0.625
4.046PheGlu: 4.046 ± 0.234
0.674PhePhe: 0.674 ± 0.547
2.023PheGly: 2.023 ± 0.391
0.674PheHis: 0.674 ± 0.547
2.023PheIle: 2.023 ± 0.625
3.372PheLys: 3.372 ± 0.704
2.023PheLeu: 2.023 ± 0.391
0.674PheMet: 0.674 ± 0.469
3.372PheAsn: 3.372 ± 0.313
1.349PhePro: 1.349 ± 0.078
0.674PheGln: 0.674 ± 0.547
2.697PheArg: 2.697 ± 0.156
2.023PheSer: 2.023 ± 1.642
1.349PheThr: 1.349 ± 0.078
2.023PheVal: 2.023 ± 0.625
1.349PheTrp: 1.349 ± 0.078
0.674PheTyr: 0.674 ± 0.547
0.0PheXaa: 0.0 ± 0.0
Gly
6.743GlyAla: 6.743 ± 0.391
0.0GlyCys: 0.0 ± 0.0
4.72GlyAsp: 4.72 ± 0.782
1.349GlyGlu: 1.349 ± 0.078
0.674GlyPhe: 0.674 ± 0.547
4.046GlyGly: 4.046 ± 0.234
0.674GlyHis: 0.674 ± 0.547
5.394GlyIle: 5.394 ± 2.737
3.372GlyLys: 3.372 ± 1.329
3.372GlyLeu: 3.372 ± 0.313
2.697GlyMet: 2.697 ± 0.927
0.674GlyAsn: 0.674 ± 0.547
2.023GlyPro: 2.023 ± 0.391
2.697GlyGln: 2.697 ± 0.156
4.046GlyArg: 4.046 ± 1.251
4.046GlySer: 4.046 ± 1.251
4.046GlyThr: 4.046 ± 1.799
4.72GlyVal: 4.72 ± 0.235
1.349GlyTrp: 1.349 ± 0.938
2.023GlyTyr: 2.023 ± 0.625
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
2.023HisAsp: 2.023 ± 0.625
2.697HisGlu: 2.697 ± 2.189
1.349HisPhe: 1.349 ± 0.938
0.674HisGly: 0.674 ± 0.547
0.674HisHis: 0.674 ± 0.469
1.349HisIle: 1.349 ± 0.078
1.349HisLys: 1.349 ± 0.938
2.023HisLeu: 2.023 ± 1.408
0.674HisMet: 0.674 ± 0.547
0.674HisAsn: 0.674 ± 0.469
1.349HisPro: 1.349 ± 0.938
0.674HisGln: 0.674 ± 0.469
0.674HisArg: 0.674 ± 0.469
3.372HisSer: 3.372 ± 1.329
0.674HisThr: 0.674 ± 0.469
2.023HisVal: 2.023 ± 0.625
0.674HisTrp: 0.674 ± 0.547
1.349HisTyr: 1.349 ± 0.078
0.0HisXaa: 0.0 ± 0.0
Ile
4.046IleAla: 4.046 ± 1.799
0.674IleCys: 0.674 ± 0.547
4.046IleAsp: 4.046 ± 0.782
2.697IleGlu: 2.697 ± 0.86
3.372IlePhe: 3.372 ± 0.313
2.023IleGly: 2.023 ± 0.625
1.349IleHis: 1.349 ± 0.938
2.697IleIle: 2.697 ± 1.173
2.697IleLys: 2.697 ± 0.86
3.372IleLeu: 3.372 ± 1.72
1.349IleMet: 1.349 ± 1.095
1.349IleAsn: 1.349 ± 0.078
2.023IlePro: 2.023 ± 1.642
0.674IleGln: 0.674 ± 0.547
4.046IleArg: 4.046 ± 0.782
3.372IleSer: 3.372 ± 0.704
5.394IleThr: 5.394 ± 0.704
2.023IleVal: 2.023 ± 0.625
0.674IleTrp: 0.674 ± 0.547
1.349IleTyr: 1.349 ± 0.938
0.0IleXaa: 0.0 ± 0.0
Lys
4.72LysAla: 4.72 ± 0.235
0.674LysCys: 0.674 ± 0.469
2.023LysAsp: 2.023 ± 0.391
1.349LysGlu: 1.349 ± 0.938
1.349LysPhe: 1.349 ± 0.078
4.046LysGly: 4.046 ± 1.251
0.674LysHis: 0.674 ± 0.547
4.046LysIle: 4.046 ± 0.234
2.023LysLys: 2.023 ± 0.391
3.372LysLeu: 3.372 ± 2.346
0.674LysMet: 0.674 ± 0.469
2.697LysAsn: 2.697 ± 0.156
2.023LysPro: 2.023 ± 0.391
1.349LysGln: 1.349 ± 0.938
2.023LysArg: 2.023 ± 0.625
4.046LysSer: 4.046 ± 1.799
0.674LysThr: 0.674 ± 0.469
8.092LysVal: 8.092 ± 0.469
0.0LysTrp: 0.0 ± 0.0
3.372LysTyr: 3.372 ± 0.313
0.0LysXaa: 0.0 ± 0.0
Leu
9.44LeuAla: 9.44 ± 0.547
0.674LeuCys: 0.674 ± 0.469
3.372LeuAsp: 3.372 ± 0.704
4.72LeuGlu: 4.72 ± 0.782
6.069LeuPhe: 6.069 ± 0.86
3.372LeuGly: 3.372 ± 0.704
2.023LeuHis: 2.023 ± 1.408
1.349LeuIle: 1.349 ± 0.078
2.697LeuLys: 2.697 ± 1.173
8.092LeuLeu: 8.092 ± 2.581
0.674LeuMet: 0.674 ± 0.547
8.766LeuAsn: 8.766 ± 1.016
4.72LeuPro: 4.72 ± 2.268
2.023LeuGln: 2.023 ± 1.408
4.72LeuArg: 4.72 ± 1.251
6.069LeuSer: 6.069 ± 2.19
8.766LeuThr: 8.766 ± 0.0
6.743LeuVal: 6.743 ± 1.407
0.0LeuTrp: 0.0 ± 0.0
2.697LeuTyr: 2.697 ± 0.156
0.0LeuXaa: 0.0 ± 0.0
Met
2.697MetAla: 2.697 ± 2.189
1.349MetCys: 1.349 ± 0.938
0.674MetAsp: 0.674 ± 0.547
2.697MetGlu: 2.697 ± 0.86
0.674MetPhe: 0.674 ± 0.469
0.674MetGly: 0.674 ± 0.547
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
2.023MetLys: 2.023 ± 0.391
2.023MetLeu: 2.023 ± 0.391
1.349MetMet: 1.349 ± 0.078
2.023MetAsn: 2.023 ± 0.391
2.023MetPro: 2.023 ± 0.391
0.674MetGln: 0.674 ± 0.469
1.349MetArg: 1.349 ± 0.078
5.394MetSer: 5.394 ± 0.313
2.023MetThr: 2.023 ± 0.625
0.674MetVal: 0.674 ± 0.547
0.674MetTrp: 0.674 ± 0.469
4.046MetTyr: 4.046 ± 0.782
0.0MetXaa: 0.0 ± 0.0
Asn
4.72AsnAla: 4.72 ± 1.798
2.023AsnCys: 2.023 ± 0.391
1.349AsnAsp: 1.349 ± 1.095
1.349AsnGlu: 1.349 ± 0.078
2.697AsnPhe: 2.697 ± 1.173
2.697AsnGly: 2.697 ± 0.156
0.674AsnHis: 0.674 ± 0.547
2.023AsnIle: 2.023 ± 1.408
2.023AsnLys: 2.023 ± 0.625
2.697AsnLeu: 2.697 ± 0.86
2.697AsnMet: 2.697 ± 0.156
2.023AsnAsn: 2.023 ± 0.391
4.72AsnPro: 4.72 ± 0.782
0.674AsnGln: 0.674 ± 0.547
4.72AsnArg: 4.72 ± 1.251
1.349AsnSer: 1.349 ± 0.938
1.349AsnThr: 1.349 ± 0.078
4.046AsnVal: 4.046 ± 0.234
0.674AsnTrp: 0.674 ± 0.469
1.349AsnTyr: 1.349 ± 0.078
0.0AsnXaa: 0.0 ± 0.0
Pro
2.697ProAla: 2.697 ± 0.156
0.0ProCys: 0.0 ± 0.0
0.674ProAsp: 0.674 ± 0.469
3.372ProGlu: 3.372 ± 0.704
0.674ProPhe: 0.674 ± 0.547
4.046ProGly: 4.046 ± 0.782
0.674ProHis: 0.674 ± 0.469
3.372ProIle: 3.372 ± 0.704
4.046ProLys: 4.046 ± 0.782
2.697ProLeu: 2.697 ± 0.156
1.349ProMet: 1.349 ± 0.938
1.349ProAsn: 1.349 ± 0.078
0.0ProPro: 0.0 ± 0.0
0.674ProGln: 0.674 ± 0.547
0.674ProArg: 0.674 ± 0.547
4.72ProSer: 4.72 ± 1.251
2.697ProThr: 2.697 ± 2.189
2.697ProVal: 2.697 ± 0.86
0.0ProTrp: 0.0 ± 0.0
2.023ProTyr: 2.023 ± 1.642
0.0ProXaa: 0.0 ± 0.0
Gln
4.72GlnAla: 4.72 ± 1.798
0.674GlnCys: 0.674 ± 0.469
3.372GlnAsp: 3.372 ± 0.313
2.023GlnGlu: 2.023 ± 0.391
0.674GlnPhe: 0.674 ± 0.469
0.674GlnGly: 0.674 ± 0.469
3.372GlnHis: 3.372 ± 0.313
1.349GlnIle: 1.349 ± 0.078
0.674GlnLys: 0.674 ± 0.469
3.372GlnLeu: 3.372 ± 0.704
0.0GlnMet: 0.0 ± 0.0
0.674GlnAsn: 0.674 ± 0.469
0.674GlnPro: 0.674 ± 0.469
1.349GlnGln: 1.349 ± 0.078
2.697GlnArg: 2.697 ± 1.877
0.674GlnSer: 0.674 ± 0.469
1.349GlnThr: 1.349 ± 0.078
1.349GlnVal: 1.349 ± 1.095
0.0GlnTrp: 0.0 ± 0.0
2.023GlnTyr: 2.023 ± 0.391
0.0GlnXaa: 0.0 ± 0.0
Arg
4.72ArgAla: 4.72 ± 1.798
0.674ArgCys: 0.674 ± 0.547
5.394ArgAsp: 5.394 ± 2.346
5.394ArgGlu: 5.394 ± 0.313
1.349ArgPhe: 1.349 ± 1.095
5.394ArgGly: 5.394 ± 0.704
2.023ArgHis: 2.023 ± 0.625
2.697ArgIle: 2.697 ± 0.156
2.023ArgLys: 2.023 ± 1.408
6.069ArgLeu: 6.069 ± 1.173
0.674ArgMet: 0.674 ± 0.469
2.023ArgAsn: 2.023 ± 0.625
2.697ArgPro: 2.697 ± 0.156
2.697ArgGln: 2.697 ± 1.173
4.046ArgArg: 4.046 ± 0.234
4.72ArgSer: 4.72 ± 1.251
3.372ArgThr: 3.372 ± 0.704
6.743ArgVal: 6.743 ± 2.659
1.349ArgTrp: 1.349 ± 0.938
0.674ArgTyr: 0.674 ± 0.547
0.0ArgXaa: 0.0 ± 0.0
Ser
6.743SerAla: 6.743 ± 0.391
0.0SerCys: 0.0 ± 0.0
5.394SerAsp: 5.394 ± 1.329
2.023SerGlu: 2.023 ± 0.391
1.349SerPhe: 1.349 ± 1.095
5.394SerGly: 5.394 ± 0.313
2.023SerHis: 2.023 ± 0.625
6.069SerIle: 6.069 ± 0.157
4.72SerLys: 4.72 ± 1.251
3.372SerLeu: 3.372 ± 0.313
3.372SerMet: 3.372 ± 0.313
2.697SerAsn: 2.697 ± 0.156
0.0SerPro: 0.0 ± 0.0
5.394SerGln: 5.394 ± 0.313
4.72SerArg: 4.72 ± 0.235
6.069SerSer: 6.069 ± 2.893
1.349SerThr: 1.349 ± 0.938
10.115SerVal: 10.115 ± 5.005
1.349SerTrp: 1.349 ± 0.078
3.372SerTyr: 3.372 ± 0.704
0.0SerXaa: 0.0 ± 0.0
Thr
6.069ThrAla: 6.069 ± 0.86
0.0ThrCys: 0.0 ± 0.0
2.023ThrAsp: 2.023 ± 0.625
3.372ThrGlu: 3.372 ± 1.329
0.0ThrPhe: 0.0 ± 0.0
1.349ThrGly: 1.349 ± 0.078
2.697ThrHis: 2.697 ± 0.86
4.046ThrIle: 4.046 ± 0.234
2.697ThrLys: 2.697 ± 0.156
3.372ThrLeu: 3.372 ± 0.704
3.372ThrMet: 3.372 ± 1.329
2.023ThrAsn: 2.023 ± 0.625
2.023ThrPro: 2.023 ± 0.391
2.023ThrGln: 2.023 ± 1.408
5.394ThrArg: 5.394 ± 1.329
4.046ThrSer: 4.046 ± 0.234
4.046ThrThr: 4.046 ± 0.782
2.697ThrVal: 2.697 ± 0.156
0.674ThrTrp: 0.674 ± 0.469
0.674ThrTyr: 0.674 ± 0.469
0.0ThrXaa: 0.0 ± 0.0
Val
5.394ValAla: 5.394 ± 0.704
0.0ValCys: 0.0 ± 0.0
6.069ValAsp: 6.069 ± 1.173
8.092ValGlu: 8.092 ± 2.502
4.046ValPhe: 4.046 ± 0.782
4.72ValGly: 4.72 ± 2.268
3.372ValHis: 3.372 ± 1.329
0.674ValIle: 0.674 ± 0.547
4.046ValLys: 4.046 ± 0.782
11.463ValLeu: 11.463 ± 2.894
2.697ValMet: 2.697 ± 1.967
5.394ValAsn: 5.394 ± 1.72
4.046ValPro: 4.046 ± 0.234
1.349ValGln: 1.349 ± 0.078
5.394ValArg: 5.394 ± 1.329
4.046ValSer: 4.046 ± 0.234
1.349ValThr: 1.349 ± 0.938
3.372ValVal: 3.372 ± 0.704
0.674ValTrp: 0.674 ± 0.547
2.023ValTyr: 2.023 ± 0.391
0.0ValXaa: 0.0 ± 0.0
Trp
0.674TrpAla: 0.674 ± 0.469
0.674TrpCys: 0.674 ± 0.547
0.674TrpAsp: 0.674 ± 0.469
1.349TrpGlu: 1.349 ± 0.938
0.674TrpPhe: 0.674 ± 0.547
1.349TrpGly: 1.349 ± 0.938
0.0TrpHis: 0.0 ± 0.0
0.674TrpIle: 0.674 ± 0.547
1.349TrpLys: 1.349 ± 0.078
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
1.349TrpAsn: 1.349 ± 0.078
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.349TrpArg: 1.349 ± 0.078
0.674TrpSer: 0.674 ± 0.469
2.697TrpThr: 2.697 ± 0.156
0.0TrpVal: 0.0 ± 0.0
0.674TrpTrp: 0.674 ± 0.547
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.72TyrAla: 4.72 ± 1.251
0.0TyrCys: 0.0 ± 0.0
2.697TyrAsp: 2.697 ± 2.189
2.023TyrGlu: 2.023 ± 0.391
1.349TyrPhe: 1.349 ± 0.078
2.697TyrGly: 2.697 ± 0.86
0.674TyrHis: 0.674 ± 0.469
0.674TyrIle: 0.674 ± 0.469
1.349TyrLys: 1.349 ± 0.078
6.743TyrLeu: 6.743 ± 1.407
0.0TyrMet: 0.0 ± 0.0
1.349TyrAsn: 1.349 ± 1.095
0.674TyrPro: 0.674 ± 0.547
0.0TyrGln: 0.0 ± 0.0
2.697TyrArg: 2.697 ± 1.173
4.046TyrSer: 4.046 ± 0.234
1.349TyrThr: 1.349 ± 0.078
4.72TyrVal: 4.72 ± 0.782
0.0TyrTrp: 0.0 ± 0.0
0.674TyrTyr: 0.674 ± 0.469
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1484 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski