Amino acid dipepetide frequency for Eragrostis minor streak virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.111AlaAla: 4.111 ± 1.006
0.0AlaCys: 0.0 ± 0.0
1.028AlaAsp: 1.028 ± 0.835
4.111AlaGlu: 4.111 ± 2.451
3.083AlaPhe: 3.083 ± 4.277
5.139AlaGly: 5.139 ± 2.202
0.0AlaHis: 0.0 ± 0.0
3.083AlaIle: 3.083 ± 1.285
0.0AlaLys: 0.0 ± 0.0
9.25AlaLeu: 9.25 ± 3.261
0.0AlaMet: 0.0 ± 0.718
2.055AlaAsn: 2.055 ± 0.805
6.166AlaPro: 6.166 ± 3.435
6.166AlaGln: 6.166 ± 2.847
5.139AlaArg: 5.139 ± 0.901
5.139AlaSer: 5.139 ± 2.278
2.055AlaThr: 2.055 ± 0.857
3.083AlaVal: 3.083 ± 2.859
0.0AlaTrp: 0.0 ± 0.0
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
2.055CysAsp: 2.055 ± 0.857
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.028CysGly: 1.028 ± 0.807
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
1.028CysAsn: 1.028 ± 0.953
4.111CysPro: 4.111 ± 1.537
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
1.028CysThr: 1.028 ± 0.953
1.028CysVal: 1.028 ± 0.953
1.028CysTrp: 1.028 ± 1.426
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.111AspAla: 4.111 ± 1.288
0.0AspCys: 0.0 ± 0.0
4.111AspAsp: 4.111 ± 0.846
3.083AspGlu: 3.083 ± 1.569
2.055AspPhe: 2.055 ± 0.805
8.222AspGly: 8.222 ± 2.011
0.0AspHis: 0.0 ± 0.0
4.111AspIle: 4.111 ± 1.15
0.0AspLys: 0.0 ± 0.0
5.139AspLeu: 5.139 ± 2.278
0.0AspMet: 0.0 ± 0.0
0.0AspAsn: 0.0 ± 0.0
3.083AspPro: 3.083 ± 1.569
10.277AspGln: 10.277 ± 3.257
0.0AspArg: 0.0 ± 0.0
0.0AspSer: 0.0 ± 0.0
2.055AspThr: 2.055 ± 0.857
2.055AspVal: 2.055 ± 1.613
3.083AspTrp: 3.083 ± 1.44
2.055AspTyr: 2.055 ± 0.805
0.0AspXaa: 0.0 ± 0.0
Glu
4.111GluAla: 4.111 ± 1.625
1.028GluCys: 1.028 ± 0.953
4.111GluAsp: 4.111 ± 0.846
3.083GluGlu: 3.083 ± 1.285
2.055GluPhe: 2.055 ± 0.857
3.083GluGly: 3.083 ± 1.44
1.028GluHis: 1.028 ± 0.835
2.055GluIle: 2.055 ± 0.857
3.083GluLys: 3.083 ± 0.537
5.139GluLeu: 5.139 ± 1.655
1.028GluMet: 1.028 ± 0.807
0.0GluAsn: 0.0 ± 0.0
3.083GluPro: 3.083 ± 1.285
0.0GluGln: 0.0 ± 0.0
3.083GluArg: 3.083 ± 1.585
3.083GluSer: 3.083 ± 0.537
1.028GluThr: 1.028 ± 0.953
0.0GluVal: 0.0 ± 0.0
3.083GluTrp: 3.083 ± 0.537
11.305GluTyr: 11.305 ± 3.676
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.0PheCys: 0.0 ± 0.0
5.139PheAsp: 5.139 ± 1.655
3.083PheGlu: 3.083 ± 0.537
3.083PhePhe: 3.083 ± 0.537
3.083PheGly: 3.083 ± 1.285
2.055PheHis: 2.055 ± 0.857
0.0PheIle: 0.0 ± 0.0
2.055PheLys: 2.055 ± 0.805
4.111PheLeu: 4.111 ± 1.713
2.055PheMet: 2.055 ± 1.455
1.028PheAsn: 1.028 ± 0.953
9.25PhePro: 9.25 ± 2.689
2.055PheGln: 2.055 ± 0.857
3.083PheArg: 3.083 ± 0.537
2.055PheSer: 2.055 ± 1.657
3.083PheThr: 3.083 ± 1.44
2.055PheVal: 2.055 ± 1.657
1.028PheTrp: 1.028 ± 0.953
1.028PheTyr: 1.028 ± 0.807
0.0PheXaa: 0.0 ± 0.0
Gly
4.111GlyAla: 4.111 ± 1.507
0.0GlyCys: 0.0 ± 0.0
2.055GlyAsp: 2.055 ± 1.906
3.083GlyGlu: 3.083 ± 1.44
2.055GlyPhe: 2.055 ± 0.857
2.055GlyGly: 2.055 ± 1.613
1.028GlyHis: 1.028 ± 0.835
1.028GlyIle: 1.028 ± 0.807
6.166GlyLys: 6.166 ± 1.412
5.139GlyLeu: 5.139 ± 2.671
0.0GlyMet: 0.0 ± 0.0
1.028GlyAsn: 1.028 ± 0.953
2.055GlyPro: 2.055 ± 1.434
1.028GlyGln: 1.028 ± 0.807
6.166GlyArg: 6.166 ± 1.383
9.25GlySer: 9.25 ± 2.379
7.194GlyThr: 7.194 ± 1.729
1.028GlyVal: 1.028 ± 0.953
0.0GlyTrp: 0.0 ± 0.0
3.083GlyTyr: 3.083 ± 0.537
0.0GlyXaa: 0.0 ± 0.0
His
4.111HisAla: 4.111 ± 1.537
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
3.083HisGly: 3.083 ± 1.44
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
2.055HisLys: 2.055 ± 0.857
2.055HisLeu: 2.055 ± 0.857
0.0HisMet: 0.0 ± 0.0
1.028HisAsn: 1.028 ± 0.807
3.083HisPro: 3.083 ± 1.487
2.055HisGln: 2.055 ± 1.078
1.028HisArg: 1.028 ± 0.953
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
5.139HisVal: 5.139 ± 1.066
0.0HisTrp: 0.0 ± 0.0
2.055HisTyr: 2.055 ± 1.613
0.0HisXaa: 0.0 ± 0.0
Ile
1.028IleAla: 1.028 ± 1.426
1.028IleCys: 1.028 ± 0.807
2.055IleAsp: 2.055 ± 0.805
2.055IleGlu: 2.055 ± 0.857
4.111IlePhe: 4.111 ± 1.288
2.055IleGly: 2.055 ± 1.657
3.083IleHis: 3.083 ± 1.487
5.139IleIle: 5.139 ± 2.227
2.055IleLys: 2.055 ± 0.805
1.028IleLeu: 1.028 ± 0.807
0.0IleMet: 0.0 ± 0.0
2.055IleAsn: 2.055 ± 0.857
2.055IlePro: 2.055 ± 1.434
4.111IleGln: 4.111 ± 1.713
2.055IleArg: 2.055 ± 1.657
5.139IleSer: 5.139 ± 2.241
5.139IleThr: 5.139 ± 1.066
2.055IleVal: 2.055 ± 0.805
0.0IleTrp: 0.0 ± 0.0
1.028IleTyr: 1.028 ± 0.807
0.0IleXaa: 0.0 ± 0.0
Lys
2.055LysAla: 2.055 ± 1.613
0.0LysCys: 0.0 ± 0.0
5.139LysAsp: 5.139 ± 1.066
2.055LysGlu: 2.055 ± 0.857
4.111LysPhe: 4.111 ± 0.846
2.055LysGly: 2.055 ± 0.805
1.028LysHis: 1.028 ± 0.953
2.055LysIle: 2.055 ± 0.805
13.361LysLys: 13.361 ± 3.631
4.111LysLeu: 4.111 ± 1.288
1.028LysMet: 1.028 ± 0.953
1.028LysAsn: 1.028 ± 0.953
5.139LysPro: 5.139 ± 2.227
7.194LysGln: 7.194 ± 1.858
3.083LysArg: 3.083 ± 2.859
1.028LysSer: 1.028 ± 0.953
4.111LysThr: 4.111 ± 0.846
2.055LysVal: 2.055 ± 0.857
2.055LysTrp: 2.055 ± 1.906
4.111LysTyr: 4.111 ± 2.171
0.0LysXaa: 0.0 ± 0.0
Leu
1.028LeuAla: 1.028 ± 1.426
2.055LeuCys: 2.055 ± 0.857
1.028LeuAsp: 1.028 ± 0.807
7.194LeuGlu: 7.194 ± 2.769
7.194LeuPhe: 7.194 ± 2.3
10.277LeuGly: 10.277 ± 3.118
8.222LeuHis: 8.222 ± 2.717
6.166LeuIle: 6.166 ± 2.42
0.0LeuLys: 0.0 ± 0.0
11.305LeuLeu: 11.305 ± 6.38
2.055LeuMet: 2.055 ± 0.857
1.028LeuAsn: 1.028 ± 0.953
5.139LeuPro: 5.139 ± 1.479
4.111LeuGln: 4.111 ± 1.15
2.055LeuArg: 2.055 ± 1.657
7.194LeuSer: 7.194 ± 2.102
4.111LeuThr: 4.111 ± 0.846
5.139LeuVal: 5.139 ± 1.084
1.028LeuTrp: 1.028 ± 0.953
3.083LeuTyr: 3.083 ± 2.859
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
1.028MetGlu: 1.028 ± 0.835
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.028MetLys: 1.028 ± 0.807
1.028MetLeu: 1.028 ± 0.953
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.028MetPro: 1.028 ± 0.953
4.111MetGln: 4.111 ± 1.15
2.055MetArg: 2.055 ± 0.857
4.111MetSer: 4.111 ± 1.006
0.0MetThr: 0.0 ± 0.0
5.139MetVal: 5.139 ± 2.202
0.0MetTrp: 0.0 ± 0.0
1.028MetTyr: 1.028 ± 0.807
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
1.028AsnCys: 1.028 ± 0.953
1.028AsnAsp: 1.028 ± 0.807
0.0AsnGlu: 0.0 ± 0.0
1.028AsnPhe: 1.028 ± 0.953
2.055AsnGly: 2.055 ± 0.857
0.0AsnHis: 0.0 ± 0.0
2.055AsnIle: 2.055 ± 0.857
3.083AsnLys: 3.083 ± 0.537
0.0AsnLeu: 0.0 ± 0.0
0.0AsnMet: 0.0 ± 0.0
0.0AsnAsn: 0.0 ± 0.0
1.028AsnPro: 1.028 ± 0.807
0.0AsnGln: 0.0 ± 0.0
0.0AsnArg: 0.0 ± 0.0
5.139AsnSer: 5.139 ± 1.563
4.111AsnThr: 4.111 ± 3.813
4.111AsnVal: 4.111 ± 2.468
0.0AsnTrp: 0.0 ± 0.0
1.028AsnTyr: 1.028 ± 0.807
0.0AsnXaa: 0.0 ± 0.0
Pro
9.25ProAla: 9.25 ± 3.261
1.028ProCys: 1.028 ± 1.426
1.028ProAsp: 1.028 ± 0.953
4.111ProGlu: 4.111 ± 1.713
6.166ProPhe: 6.166 ± 1.5
4.111ProGly: 4.111 ± 1.288
2.055ProHis: 2.055 ± 0.857
3.083ProIle: 3.083 ± 2.941
7.194ProLys: 7.194 ± 3.053
1.028ProLeu: 1.028 ± 1.426
0.0ProMet: 0.0 ± 0.0
2.055ProAsn: 2.055 ± 0.857
2.055ProPro: 2.055 ± 0.857
3.083ProGln: 3.083 ± 1.299
6.166ProArg: 6.166 ± 1.037
3.083ProSer: 3.083 ± 2.744
3.083ProThr: 3.083 ± 1.44
8.222ProVal: 8.222 ± 2.292
1.028ProTrp: 1.028 ± 0.953
2.055ProTyr: 2.055 ± 1.078
0.0ProXaa: 0.0 ± 0.0
Gln
4.111GlnAla: 4.111 ± 1.537
0.0GlnCys: 0.0 ± 0.0
6.166GlnAsp: 6.166 ± 2.57
8.222GlnGlu: 8.222 ± 2.8
4.111GlnPhe: 4.111 ± 1.713
2.055GlnGly: 2.055 ± 1.906
1.028GlnHis: 1.028 ± 0.807
1.028GlnIle: 1.028 ± 0.953
0.0GlnLys: 0.0 ± 0.0
10.277GlnLeu: 10.277 ± 2.147
3.083GlnMet: 3.083 ± 1.743
4.111GlnAsn: 4.111 ± 1.713
4.111GlnPro: 4.111 ± 1.625
1.028GlnGln: 1.028 ± 0.835
4.111GlnArg: 4.111 ± 0.846
1.028GlnSer: 1.028 ± 0.953
4.111GlnThr: 4.111 ± 1.288
1.028GlnVal: 1.028 ± 0.835
1.028GlnTrp: 1.028 ± 0.807
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
5.139ArgAla: 5.139 ± 2.408
2.055ArgCys: 2.055 ± 0.857
6.166ArgAsp: 6.166 ± 2.034
2.055ArgGlu: 2.055 ± 1.657
3.083ArgPhe: 3.083 ± 0.537
0.0ArgGly: 0.0 ± 0.0
1.028ArgHis: 1.028 ± 0.807
2.055ArgIle: 2.055 ± 0.857
6.166ArgLys: 6.166 ± 3.139
4.111ArgLeu: 4.111 ± 1.15
2.055ArgMet: 2.055 ± 0.857
0.0ArgAsn: 0.0 ± 0.0
4.111ArgPro: 4.111 ± 1.865
4.111ArgGln: 4.111 ± 1.006
6.166ArgArg: 6.166 ± 1.514
7.194ArgSer: 7.194 ± 3.124
3.083ArgThr: 3.083 ± 1.363
2.055ArgVal: 2.055 ± 1.906
0.0ArgTrp: 0.0 ± 0.0
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
6.166SerAla: 6.166 ± 2.276
1.028SerCys: 1.028 ± 1.426
4.111SerAsp: 4.111 ± 1.288
3.083SerGlu: 3.083 ± 0.537
2.055SerPhe: 2.055 ± 1.657
2.055SerGly: 2.055 ± 1.657
1.028SerHis: 1.028 ± 0.807
3.083SerIle: 3.083 ± 1.44
7.194SerLys: 7.194 ± 1.858
6.166SerLeu: 6.166 ± 2.23
4.111SerMet: 4.111 ± 1.15
0.0SerAsn: 0.0 ± 0.0
7.194SerPro: 7.194 ± 2.102
4.111SerGln: 4.111 ± 1.625
4.111SerArg: 4.111 ± 1.006
15.416SerSer: 15.416 ± 4.683
4.111SerThr: 4.111 ± 1.15
4.111SerVal: 4.111 ± 1.288
0.0SerTrp: 0.0 ± 0.0
1.028SerTyr: 1.028 ± 0.835
0.0SerXaa: 0.0 ± 0.0
Thr
4.111ThrAla: 4.111 ± 1.288
1.028ThrCys: 1.028 ± 0.953
2.055ThrAsp: 2.055 ± 1.906
3.083ThrGlu: 3.083 ± 0.537
2.055ThrPhe: 2.055 ± 0.857
2.055ThrGly: 2.055 ± 0.805
1.028ThrHis: 1.028 ± 0.807
2.055ThrIle: 2.055 ± 1.906
6.166ThrLys: 6.166 ± 1.5
7.194ThrLeu: 7.194 ± 1.438
1.028ThrMet: 1.028 ± 0.953
2.055ThrAsn: 2.055 ± 0.805
1.028ThrPro: 1.028 ± 0.835
3.083ThrGln: 3.083 ± 0.537
7.194ThrArg: 7.194 ± 1.165
6.166ThrSer: 6.166 ± 2.973
6.166ThrThr: 6.166 ± 4.336
0.0ThrVal: 0.0 ± 0.0
2.055ThrTrp: 2.055 ± 0.805
3.083ThrTyr: 3.083 ± 0.537
0.0ThrXaa: 0.0 ± 0.0
Val
3.083ValAla: 3.083 ± 1.856
0.0ValCys: 0.0 ± 0.0
4.111ValAsp: 4.111 ± 1.15
3.083ValGlu: 3.083 ± 0.537
0.0ValPhe: 0.0 ± 0.0
3.083ValGly: 3.083 ± 1.363
2.055ValHis: 2.055 ± 0.805
2.055ValIle: 2.055 ± 0.805
4.111ValLys: 4.111 ± 3.813
5.139ValLeu: 5.139 ± 1.066
1.028ValMet: 1.028 ± 0.73
3.083ValAsn: 3.083 ± 1.569
2.055ValPro: 2.055 ± 0.805
2.055ValGln: 2.055 ± 0.857
5.139ValArg: 5.139 ± 2.408
2.055ValSer: 2.055 ± 1.906
4.111ValThr: 4.111 ± 1.15
2.055ValVal: 2.055 ± 1.906
0.0ValTrp: 0.0 ± 0.0
2.055ValTyr: 2.055 ± 1.906
0.0ValXaa: 0.0 ± 0.0
Trp
4.111TrpAla: 4.111 ± 1.537
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
2.055TrpPhe: 2.055 ± 0.857
1.028TrpGly: 1.028 ± 0.953
1.028TrpHis: 1.028 ± 0.953
0.0TrpIle: 0.0 ± 0.0
1.028TrpLys: 1.028 ± 0.807
3.083TrpLeu: 3.083 ± 1.569
0.0TrpMet: 0.0 ± 0.0
1.028TrpAsn: 1.028 ± 0.953
1.028TrpPro: 1.028 ± 0.953
1.028TrpGln: 1.028 ± 0.953
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
1.028TyrCys: 1.028 ± 0.807
2.055TyrAsp: 2.055 ± 0.805
2.055TyrGlu: 2.055 ± 0.805
1.028TyrPhe: 1.028 ± 0.953
1.028TyrGly: 1.028 ± 0.807
0.0TyrHis: 0.0 ± 0.0
8.222TyrIle: 8.222 ± 3.652
2.055TyrLys: 2.055 ± 0.805
4.111TyrLeu: 4.111 ± 0.846
2.055TyrMet: 2.055 ± 0.805
3.083TyrAsn: 3.083 ± 1.299
3.083TyrPro: 3.083 ± 0.537
1.028TyrGln: 1.028 ± 0.953
0.0TyrArg: 0.0 ± 0.0
3.083TyrSer: 3.083 ± 1.487
4.111TyrThr: 4.111 ± 1.15
0.0TyrVal: 0.0 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
1.028TyrTyr: 1.028 ± 0.807
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (974 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski