Amino acid dipepetide frequency for Sophora japonica powdery mildew-associated partitivirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.802AlaAla: 1.802 ± 1.237
0.0AlaCys: 0.0 ± 0.0
3.604AlaAsp: 3.604 ± 1.298
5.405AlaGlu: 5.405 ± 2.172
3.604AlaPhe: 3.604 ± 0.121
3.604AlaGly: 3.604 ± 1.298
1.802AlaHis: 1.802 ± 0.06
1.802AlaIle: 1.802 ± 0.06
2.703AlaLys: 2.703 ± 0.498
3.604AlaLeu: 3.604 ± 0.121
0.0AlaMet: 0.0 ± 0.0
3.604AlaAsn: 3.604 ± 1.298
2.703AlaPro: 2.703 ± 0.679
0.0AlaGln: 0.0 ± 0.0
3.604AlaArg: 3.604 ± 1.298
4.505AlaSer: 4.505 ± 0.437
3.604AlaThr: 3.604 ± 1.298
1.802AlaVal: 1.802 ± 0.06
0.0AlaTrp: 0.0 ± 0.0
2.703AlaTyr: 2.703 ± 0.498
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.901CysCys: 0.901 ± 0.558
0.0CysAsp: 0.0 ± 0.0
0.901CysGlu: 0.901 ± 0.558
1.802CysPhe: 1.802 ± 0.06
0.901CysGly: 0.901 ± 0.619
0.0CysHis: 0.0 ± 0.0
0.901CysIle: 0.901 ± 0.558
0.0CysLys: 0.0 ± 0.0
2.703CysLeu: 2.703 ± 0.498
0.0CysMet: 0.0 ± 0.0
0.901CysAsn: 0.901 ± 0.619
1.802CysPro: 1.802 ± 0.06
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
2.703CysSer: 2.703 ± 1.674
0.0CysThr: 0.0 ± 0.0
0.901CysVal: 0.901 ± 0.558
0.0CysTrp: 0.0 ± 0.0
0.901CysTyr: 0.901 ± 0.558
0.0CysXaa: 0.0 ± 0.0
Asp
2.703AspAla: 2.703 ± 0.679
0.0AspCys: 0.0 ± 0.0
15.315AspAsp: 15.315 ± 3.455
4.505AspGlu: 4.505 ± 0.739
6.306AspPhe: 6.306 ± 1.554
2.703AspGly: 2.703 ± 1.674
0.0AspHis: 0.0 ± 0.0
2.703AspIle: 2.703 ± 0.679
1.802AspLys: 1.802 ± 1.116
6.306AspLeu: 6.306 ± 0.377
0.901AspMet: 0.901 ± 0.558
0.901AspAsn: 0.901 ± 0.558
2.703AspPro: 2.703 ± 0.679
0.0AspGln: 0.0 ± 0.0
1.802AspArg: 1.802 ± 0.06
8.108AspSer: 8.108 ± 2.037
1.802AspThr: 1.802 ± 0.06
4.505AspVal: 4.505 ± 0.437
0.901AspTrp: 0.901 ± 0.558
1.802AspTyr: 1.802 ± 0.06
0.0AspXaa: 0.0 ± 0.0
Glu
5.405GluAla: 5.405 ± 0.181
1.802GluCys: 1.802 ± 1.116
0.901GluAsp: 0.901 ± 0.558
0.901GluGlu: 0.901 ± 0.619
0.901GluPhe: 0.901 ± 0.558
1.802GluGly: 1.802 ± 1.116
0.901GluHis: 0.901 ± 0.619
3.604GluIle: 3.604 ± 0.121
2.703GluLys: 2.703 ± 0.498
1.802GluLeu: 1.802 ± 1.116
0.901GluMet: 0.901 ± 0.443
0.901GluAsn: 0.901 ± 0.558
2.703GluPro: 2.703 ± 0.498
1.802GluGln: 1.802 ± 1.237
3.604GluArg: 3.604 ± 1.056
8.108GluSer: 8.108 ± 2.037
2.703GluThr: 2.703 ± 0.498
1.802GluVal: 1.802 ± 1.116
0.0GluTrp: 0.0 ± 0.0
0.901GluTyr: 0.901 ± 0.558
0.0GluXaa: 0.0 ± 0.0
Phe
0.901PheAla: 0.901 ± 0.558
0.0PheCys: 0.0 ± 0.0
4.505PheAsp: 4.505 ± 0.437
2.703PheGlu: 2.703 ± 1.674
3.604PhePhe: 3.604 ± 1.298
4.505PheGly: 4.505 ± 0.739
3.604PheHis: 3.604 ± 2.233
3.604PheIle: 3.604 ± 0.121
2.703PheLys: 2.703 ± 1.674
5.405PheLeu: 5.405 ± 2.172
1.802PheMet: 1.802 ± 1.237
3.604PheAsn: 3.604 ± 0.121
4.505PhePro: 4.505 ± 0.739
2.703PheGln: 2.703 ± 0.498
4.505PheArg: 4.505 ± 0.437
4.505PheSer: 4.505 ± 0.437
2.703PheThr: 2.703 ± 1.856
5.405PheVal: 5.405 ± 1.358
0.901PheTrp: 0.901 ± 0.558
0.901PheTyr: 0.901 ± 0.558
0.0PheXaa: 0.0 ± 0.0
Gly
3.604GlyAla: 3.604 ± 2.474
0.901GlyCys: 0.901 ± 0.558
2.703GlyAsp: 2.703 ± 1.674
3.604GlyGlu: 3.604 ± 0.121
8.108GlyPhe: 8.108 ± 0.86
1.802GlyGly: 1.802 ± 0.06
0.0GlyHis: 0.0 ± 0.0
7.207GlyIle: 7.207 ± 2.112
2.703GlyLys: 2.703 ± 0.679
2.703GlyLeu: 2.703 ± 1.856
0.901GlyMet: 0.901 ± 0.619
0.901GlyAsn: 0.901 ± 0.619
1.802GlyPro: 1.802 ± 0.06
1.802GlyGln: 1.802 ± 1.116
5.405GlyArg: 5.405 ± 2.535
6.306GlySer: 6.306 ± 0.8
0.901GlyThr: 0.901 ± 0.558
0.901GlyVal: 0.901 ± 0.558
1.802GlyTrp: 1.802 ± 0.06
0.901GlyTyr: 0.901 ± 0.558
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
1.802HisPhe: 1.802 ± 0.06
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
1.802HisLeu: 1.802 ± 1.116
0.901HisMet: 0.901 ± 0.619
2.703HisAsn: 2.703 ± 1.674
1.802HisPro: 1.802 ± 0.06
0.901HisGln: 0.901 ± 0.558
0.901HisArg: 0.901 ± 0.558
1.802HisSer: 1.802 ± 1.237
1.802HisThr: 1.802 ± 0.06
0.901HisVal: 0.901 ± 0.619
0.0HisTrp: 0.0 ± 0.0
0.901HisTyr: 0.901 ± 0.558
0.0HisXaa: 0.0 ± 0.0
Ile
6.306IleAla: 6.306 ± 1.554
1.802IleCys: 1.802 ± 0.06
0.0IleAsp: 0.0 ± 0.0
3.604IleGlu: 3.604 ± 0.121
3.604IlePhe: 3.604 ± 0.121
3.604IleGly: 3.604 ± 0.121
1.802IleHis: 1.802 ± 1.116
3.604IleIle: 3.604 ± 0.121
3.604IleLys: 3.604 ± 1.056
4.505IleLeu: 4.505 ± 3.093
0.0IleMet: 0.0 ± 0.0
4.505IleAsn: 4.505 ± 1.614
4.505IlePro: 4.505 ± 0.739
4.505IleGln: 4.505 ± 0.437
4.505IleArg: 4.505 ± 0.437
1.802IleSer: 1.802 ± 1.237
4.505IleThr: 4.505 ± 3.093
0.0IleVal: 0.0 ± 0.0
1.802IleTrp: 1.802 ± 1.116
1.802IleTyr: 1.802 ± 1.116
0.0IleXaa: 0.0 ± 0.0
Lys
0.0LysAla: 0.0 ± 0.0
0.901LysCys: 0.901 ± 0.558
3.604LysAsp: 3.604 ± 1.056
3.604LysGlu: 3.604 ± 0.121
1.802LysPhe: 1.802 ± 0.06
3.604LysGly: 3.604 ± 0.121
0.901LysHis: 0.901 ± 0.558
5.405LysIle: 5.405 ± 2.172
6.306LysLys: 6.306 ± 0.8
2.703LysLeu: 2.703 ± 0.498
0.0LysMet: 0.0 ± 0.464
0.0LysAsn: 0.0 ± 0.0
2.703LysPro: 2.703 ± 0.679
3.604LysGln: 3.604 ± 1.298
3.604LysArg: 3.604 ± 2.233
8.108LysSer: 8.108 ± 0.86
3.604LysThr: 3.604 ± 0.121
2.703LysVal: 2.703 ± 0.498
0.0LysTrp: 0.0 ± 0.0
1.802LysTyr: 1.802 ± 1.116
0.0LysXaa: 0.0 ± 0.0
Leu
7.207LeuAla: 7.207 ± 1.418
0.901LeuCys: 0.901 ± 0.619
2.703LeuAsp: 2.703 ± 0.498
1.802LeuGlu: 1.802 ± 0.06
4.505LeuPhe: 4.505 ± 0.437
3.604LeuGly: 3.604 ± 0.121
2.703LeuHis: 2.703 ± 0.498
5.405LeuIle: 5.405 ± 0.181
3.604LeuLys: 3.604 ± 1.298
4.505LeuLeu: 4.505 ± 2.791
0.901LeuMet: 0.901 ± 0.558
3.604LeuAsn: 3.604 ± 1.056
0.901LeuPro: 0.901 ± 0.558
6.306LeuGln: 6.306 ± 1.977
9.91LeuArg: 9.91 ± 1.433
14.414LeuSer: 14.414 ± 1.66
7.207LeuThr: 7.207 ± 2.112
1.802LeuVal: 1.802 ± 0.06
2.703LeuTrp: 2.703 ± 1.674
3.604LeuTyr: 3.604 ± 1.056
0.0LeuXaa: 0.0 ± 0.0
Met
1.802MetAla: 1.802 ± 1.116
0.901MetCys: 0.901 ± 0.558
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
1.802MetPhe: 1.802 ± 0.06
2.703MetGly: 2.703 ± 0.679
0.0MetHis: 0.0 ± 0.0
2.703MetIle: 2.703 ± 1.856
0.901MetLys: 0.901 ± 0.558
1.802MetLeu: 1.802 ± 1.116
0.0MetMet: 0.0 ± 0.0
0.901MetAsn: 0.901 ± 0.619
0.901MetPro: 0.901 ± 0.619
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
1.802MetSer: 1.802 ± 1.237
0.0MetThr: 0.0 ± 0.0
0.901MetVal: 0.901 ± 0.619
0.901MetTrp: 0.901 ± 0.558
2.703MetTyr: 2.703 ± 0.679
0.0MetXaa: 0.0 ± 0.0
Asn
1.802AsnAla: 1.802 ± 0.06
0.0AsnCys: 0.0 ± 0.0
4.505AsnAsp: 4.505 ± 0.739
1.802AsnGlu: 1.802 ± 1.116
5.405AsnPhe: 5.405 ± 0.181
1.802AsnGly: 1.802 ± 0.06
0.0AsnHis: 0.0 ± 0.0
2.703AsnIle: 2.703 ± 0.679
0.901AsnLys: 0.901 ± 0.558
9.009AsnLeu: 9.009 ± 0.875
1.802AsnMet: 1.802 ± 0.06
1.802AsnAsn: 1.802 ± 0.06
1.802AsnPro: 1.802 ± 1.116
2.703AsnGln: 2.703 ± 0.498
2.703AsnArg: 2.703 ± 1.674
2.703AsnSer: 2.703 ± 0.679
1.802AsnThr: 1.802 ± 1.116
0.0AsnVal: 0.0 ± 0.0
1.802AsnTrp: 1.802 ± 1.116
0.901AsnTyr: 0.901 ± 0.558
0.0AsnXaa: 0.0 ± 0.0
Pro
3.604ProAla: 3.604 ± 2.474
0.0ProCys: 0.0 ± 0.0
5.405ProAsp: 5.405 ± 0.181
1.802ProGlu: 1.802 ± 0.06
1.802ProPhe: 1.802 ± 0.06
1.802ProGly: 1.802 ± 0.06
1.802ProHis: 1.802 ± 0.06
3.604ProIle: 3.604 ± 1.056
2.703ProLys: 2.703 ± 1.674
4.505ProLeu: 4.505 ± 0.739
2.703ProMet: 2.703 ± 0.679
0.901ProAsn: 0.901 ± 0.619
4.505ProPro: 4.505 ± 1.916
2.703ProGln: 2.703 ± 0.679
2.703ProArg: 2.703 ± 1.674
3.604ProSer: 3.604 ± 0.121
5.405ProThr: 5.405 ± 0.181
4.505ProVal: 4.505 ± 3.093
1.802ProTrp: 1.802 ± 0.06
2.703ProTyr: 2.703 ± 0.498
0.0ProXaa: 0.0 ± 0.0
Gln
0.901GlnAla: 0.901 ± 0.619
0.901GlnCys: 0.901 ± 0.619
0.901GlnAsp: 0.901 ± 0.558
0.901GlnGlu: 0.901 ± 0.619
1.802GlnPhe: 1.802 ± 0.06
2.703GlnGly: 2.703 ± 0.498
0.0GlnHis: 0.0 ± 0.0
2.703GlnIle: 2.703 ± 0.679
7.207GlnLys: 7.207 ± 0.935
4.505GlnLeu: 4.505 ± 1.614
1.802GlnMet: 1.802 ± 1.237
1.802GlnAsn: 1.802 ± 0.06
0.901GlnPro: 0.901 ± 0.619
0.901GlnGln: 0.901 ± 0.558
2.703GlnArg: 2.703 ± 0.679
4.505GlnSer: 4.505 ± 0.437
2.703GlnThr: 2.703 ± 0.498
0.901GlnVal: 0.901 ± 0.558
1.802GlnTrp: 1.802 ± 0.06
2.703GlnTyr: 2.703 ± 0.498
0.0GlnXaa: 0.0 ± 0.0
Arg
4.505ArgAla: 4.505 ± 0.437
0.0ArgCys: 0.0 ± 0.0
4.505ArgAsp: 4.505 ± 0.739
1.802ArgGlu: 1.802 ± 1.237
5.405ArgPhe: 5.405 ± 2.172
2.703ArgGly: 2.703 ± 0.679
0.0ArgHis: 0.0 ± 0.0
3.604ArgIle: 3.604 ± 1.056
2.703ArgLys: 2.703 ± 1.856
5.405ArgLeu: 5.405 ± 2.172
2.703ArgMet: 2.703 ± 0.498
2.703ArgAsn: 2.703 ± 0.498
3.604ArgPro: 3.604 ± 1.056
4.505ArgGln: 4.505 ± 0.437
1.802ArgArg: 1.802 ± 1.237
9.009ArgSer: 9.009 ± 3.228
3.604ArgThr: 3.604 ± 1.298
1.802ArgVal: 1.802 ± 0.06
0.901ArgTrp: 0.901 ± 0.619
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
2.703SerAla: 2.703 ± 0.498
0.901SerCys: 0.901 ± 0.558
8.108SerAsp: 8.108 ± 1.493
5.405SerGlu: 5.405 ± 0.995
1.802SerPhe: 1.802 ± 1.116
7.207SerGly: 7.207 ± 0.242
0.0SerHis: 0.0 ± 0.0
4.505SerIle: 4.505 ± 0.739
8.108SerLys: 8.108 ± 0.316
11.712SerLeu: 11.712 ± 2.158
1.802SerMet: 1.802 ± 0.06
1.802SerAsn: 1.802 ± 0.06
6.306SerPro: 6.306 ± 1.977
1.802SerGln: 1.802 ± 0.06
7.207SerArg: 7.207 ± 0.935
9.009SerSer: 9.009 ± 5.009
9.91SerThr: 9.91 ± 3.274
6.306SerVal: 6.306 ± 0.8
2.703SerTrp: 2.703 ± 1.856
2.703SerTyr: 2.703 ± 0.498
0.0SerXaa: 0.0 ± 0.0
Thr
1.802ThrAla: 1.802 ± 0.06
2.703ThrCys: 2.703 ± 0.498
6.306ThrAsp: 6.306 ± 1.977
0.901ThrGlu: 0.901 ± 0.558
4.505ThrPhe: 4.505 ± 0.437
4.505ThrGly: 4.505 ± 1.916
1.802ThrHis: 1.802 ± 1.237
3.604ThrIle: 3.604 ± 1.298
2.703ThrLys: 2.703 ± 0.679
4.505ThrLeu: 4.505 ± 1.614
0.0ThrMet: 0.0 ± 0.0
5.405ThrAsn: 5.405 ± 0.181
3.604ThrPro: 3.604 ± 0.121
2.703ThrGln: 2.703 ± 0.498
2.703ThrArg: 2.703 ± 0.679
0.901ThrSer: 0.901 ± 0.619
3.604ThrThr: 3.604 ± 0.121
5.405ThrVal: 5.405 ± 2.535
0.0ThrTrp: 0.0 ± 0.0
2.703ThrTyr: 2.703 ± 0.679
0.0ThrXaa: 0.0 ± 0.0
Val
3.604ValAla: 3.604 ± 0.121
0.901ValCys: 0.901 ± 0.558
2.703ValAsp: 2.703 ± 0.498
1.802ValGlu: 1.802 ± 1.116
0.901ValPhe: 0.901 ± 0.558
0.901ValGly: 0.901 ± 0.619
0.0ValHis: 0.0 ± 0.0
1.802ValIle: 1.802 ± 0.06
3.604ValLys: 3.604 ± 0.121
8.108ValLeu: 8.108 ± 4.39
0.0ValMet: 0.0 ± 0.0
1.802ValAsn: 1.802 ± 0.06
5.405ValPro: 5.405 ± 1.358
2.703ValGln: 2.703 ± 0.498
0.901ValArg: 0.901 ± 0.558
3.604ValSer: 3.604 ± 0.121
2.703ValThr: 2.703 ± 1.856
0.0ValVal: 0.0 ± 0.0
0.0ValTrp: 0.0 ± 0.0
0.901ValTyr: 0.901 ± 0.619
0.0ValXaa: 0.0 ± 0.0
Trp
0.901TrpAla: 0.901 ± 0.619
0.901TrpCys: 0.901 ± 0.619
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.901TrpPhe: 0.901 ± 0.558
0.901TrpGly: 0.901 ± 0.558
0.0TrpHis: 0.0 ± 0.0
0.901TrpIle: 0.901 ± 0.619
0.0TrpLys: 0.0 ± 0.0
0.901TrpLeu: 0.901 ± 0.558
1.802TrpMet: 1.802 ± 0.06
2.703TrpAsn: 2.703 ± 1.674
0.901TrpPro: 0.901 ± 0.558
0.901TrpGln: 0.901 ± 0.558
0.901TrpArg: 0.901 ± 0.558
2.703TrpSer: 2.703 ± 1.674
0.901TrpThr: 0.901 ± 0.619
0.901TrpVal: 0.901 ± 0.619
0.0TrpTrp: 0.0 ± 0.0
2.703TrpTyr: 2.703 ± 0.679
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.901TyrAla: 0.901 ± 0.558
0.901TyrCys: 0.901 ± 0.558
0.0TyrAsp: 0.0 ± 0.0
2.703TyrGlu: 2.703 ± 0.498
2.703TyrPhe: 2.703 ± 0.679
3.604TyrGly: 3.604 ± 0.121
0.901TyrHis: 0.901 ± 0.619
0.0TyrIle: 0.0 ± 0.0
0.901TyrLys: 0.901 ± 0.558
1.802TyrLeu: 1.802 ± 1.116
0.901TyrMet: 0.901 ± 0.558
4.505TyrAsn: 4.505 ± 2.791
4.505TyrPro: 4.505 ± 0.437
2.703TyrGln: 2.703 ± 0.498
1.802TyrArg: 1.802 ± 1.237
1.802TyrSer: 1.802 ± 0.06
0.901TyrThr: 0.901 ± 0.558
0.901TyrVal: 0.901 ± 0.558
1.802TyrTrp: 1.802 ± 0.06
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1111 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski