Amino acid dipepetide frequency for Raphanus sativus cryptic virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.197AlaAla: 8.197 ± 0.103
2.342AlaCys: 2.342 ± 1.888
3.513AlaAsp: 3.513 ± 0.524
4.684AlaGlu: 4.684 ± 0.42
7.026AlaPhe: 7.026 ± 0.631
5.855AlaGly: 5.855 ± 0.313
2.342AlaHis: 2.342 ± 0.21
8.197AlaIle: 8.197 ± 0.103
2.342AlaLys: 2.342 ± 1.468
4.684AlaLeu: 4.684 ± 2.098
1.171AlaMet: 1.171 ± 0.944
4.684AlaAsn: 4.684 ± 2.098
2.342AlaPro: 2.342 ± 0.21
2.342AlaGln: 2.342 ± 0.21
3.513AlaArg: 3.513 ± 0.524
3.513AlaSer: 3.513 ± 1.154
4.684AlaThr: 4.684 ± 2.098
4.684AlaVal: 4.684 ± 1.257
0.0AlaTrp: 0.0 ± 0.0
2.342AlaTyr: 2.342 ± 1.468
0.0AlaXaa: 0.0 ± 0.0
Cys
1.171CysAla: 1.171 ± 0.944
1.171CysCys: 1.171 ± 0.944
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.171CysGly: 1.171 ± 0.734
1.171CysHis: 1.171 ± 0.734
1.171CysIle: 1.171 ± 0.734
1.171CysLys: 1.171 ± 0.944
2.342CysLeu: 2.342 ± 0.21
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
1.171CysThr: 1.171 ± 0.944
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.171CysTyr: 1.171 ± 0.944
0.0CysXaa: 0.0 ± 0.0
Asp
4.684AspAla: 4.684 ± 0.42
2.342AspCys: 2.342 ± 1.468
3.513AspAsp: 3.513 ± 0.524
2.342AspGlu: 2.342 ± 1.468
1.171AspPhe: 1.171 ± 0.944
2.342AspGly: 2.342 ± 1.468
1.171AspHis: 1.171 ± 0.944
4.684AspIle: 4.684 ± 0.42
1.171AspLys: 1.171 ± 0.944
8.197AspLeu: 8.197 ± 3.459
0.0AspMet: 0.0 ± 0.0
1.171AspAsn: 1.171 ± 0.944
3.513AspPro: 3.513 ± 0.524
1.171AspGln: 1.171 ± 0.944
3.513AspArg: 3.513 ± 1.154
1.171AspSer: 1.171 ± 0.734
0.0AspThr: 0.0 ± 0.0
3.513AspVal: 3.513 ± 1.154
4.684AspTrp: 4.684 ± 0.42
2.342AspTyr: 2.342 ± 0.21
0.0AspXaa: 0.0 ± 0.0
Glu
2.342GluAla: 2.342 ± 0.21
0.0GluCys: 0.0 ± 0.0
5.855GluAsp: 5.855 ± 1.365
3.513GluGlu: 3.513 ± 2.201
2.342GluPhe: 2.342 ± 1.468
3.513GluGly: 3.513 ± 0.524
1.171GluHis: 1.171 ± 0.734
3.513GluIle: 3.513 ± 0.524
3.513GluLys: 3.513 ± 2.201
4.684GluLeu: 4.684 ± 0.42
1.171GluMet: 1.171 ± 0.734
2.342GluAsn: 2.342 ± 0.21
1.171GluPro: 1.171 ± 0.944
1.171GluGln: 1.171 ± 0.734
0.0GluArg: 0.0 ± 0.0
1.171GluSer: 1.171 ± 0.734
5.855GluThr: 5.855 ± 1.991
2.342GluVal: 2.342 ± 0.21
0.0GluTrp: 0.0 ± 0.0
7.026GluTyr: 7.026 ± 0.631
0.0GluXaa: 0.0 ± 0.0
Phe
2.342PheAla: 2.342 ± 1.468
0.0PheCys: 0.0 ± 0.0
2.342PheAsp: 2.342 ± 0.21
3.513PheGlu: 3.513 ± 0.524
0.0PhePhe: 0.0 ± 0.0
2.342PheGly: 2.342 ± 0.21
1.171PheHis: 1.171 ± 0.734
4.684PheIle: 4.684 ± 2.935
1.171PheLys: 1.171 ± 0.734
3.513PheLeu: 3.513 ± 1.154
1.171PheMet: 1.171 ± 0.734
5.855PheAsn: 5.855 ± 3.042
3.513PhePro: 3.513 ± 0.524
1.171PheGln: 1.171 ± 0.944
3.513PheArg: 3.513 ± 1.154
2.342PheSer: 2.342 ± 1.888
1.171PheThr: 1.171 ± 0.734
1.171PheVal: 1.171 ± 0.944
0.0PheTrp: 0.0 ± 0.0
1.171PheTyr: 1.171 ± 0.944
0.0PheXaa: 0.0 ± 0.0
Gly
3.513GlyAla: 3.513 ± 2.832
0.0GlyCys: 0.0 ± 0.0
2.342GlyAsp: 2.342 ± 1.468
2.342GlyGlu: 2.342 ± 1.468
1.171GlyPhe: 1.171 ± 0.734
2.342GlyGly: 2.342 ± 1.468
0.0GlyHis: 0.0 ± 0.0
4.684GlyIle: 4.684 ± 1.257
1.171GlyLys: 1.171 ± 0.734
7.026GlyLeu: 7.026 ± 1.047
1.171GlyMet: 1.171 ± 0.944
3.513GlyAsn: 3.513 ± 1.154
2.342GlyPro: 2.342 ± 0.21
0.0GlyGln: 0.0 ± 0.0
7.026GlyArg: 7.026 ± 2.725
4.684GlySer: 4.684 ± 0.42
2.342GlyThr: 2.342 ± 0.21
1.171GlyVal: 1.171 ± 0.944
3.513GlyTrp: 3.513 ± 0.524
4.684GlyTyr: 4.684 ± 2.935
0.0GlyXaa: 0.0 ± 0.0
His
4.684HisAla: 4.684 ± 0.42
0.0HisCys: 0.0 ± 0.0
1.171HisAsp: 1.171 ± 0.944
1.171HisGlu: 1.171 ± 0.944
1.171HisPhe: 1.171 ± 0.944
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
4.684HisIle: 4.684 ± 2.098
2.342HisLys: 2.342 ± 1.468
2.342HisLeu: 2.342 ± 0.21
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
2.342HisGln: 2.342 ± 0.21
1.171HisArg: 1.171 ± 0.944
0.0HisSer: 0.0 ± 0.0
3.513HisThr: 3.513 ± 0.524
1.171HisVal: 1.171 ± 0.734
0.0HisTrp: 0.0 ± 0.0
2.342HisTyr: 2.342 ± 0.21
0.0HisXaa: 0.0 ± 0.0
Ile
7.026IleAla: 7.026 ± 2.725
4.684IleCys: 4.684 ± 2.098
1.171IleAsp: 1.171 ± 0.944
4.684IleGlu: 4.684 ± 2.935
1.171IlePhe: 1.171 ± 0.734
3.513IleGly: 3.513 ± 0.524
3.513IleHis: 3.513 ± 0.524
1.171IleIle: 1.171 ± 0.734
3.513IleLys: 3.513 ± 2.201
4.684IleLeu: 4.684 ± 2.935
0.0IleMet: 0.0 ± 0.0
2.342IleAsn: 2.342 ± 1.888
5.855IlePro: 5.855 ± 0.313
1.171IleGln: 1.171 ± 0.734
4.684IleArg: 4.684 ± 2.098
8.197IleSer: 8.197 ± 0.103
2.342IleThr: 2.342 ± 0.21
5.855IleVal: 5.855 ± 1.365
0.0IleTrp: 0.0 ± 0.0
5.855IleTyr: 5.855 ± 0.313
0.0IleXaa: 0.0 ± 0.0
Lys
2.342LysAla: 2.342 ± 0.21
0.0LysCys: 0.0 ± 0.0
1.171LysAsp: 1.171 ± 0.944
0.0LysGlu: 0.0 ± 0.0
1.171LysPhe: 1.171 ± 0.944
3.513LysGly: 3.513 ± 2.201
1.171LysHis: 1.171 ± 0.734
3.513LysIle: 3.513 ± 0.524
3.513LysLys: 3.513 ± 1.154
1.171LysLeu: 1.171 ± 0.944
1.171LysMet: 1.171 ± 0.734
1.171LysAsn: 1.171 ± 0.944
0.0LysPro: 0.0 ± 0.0
2.342LysGln: 2.342 ± 1.468
4.684LysArg: 4.684 ± 1.257
4.684LysSer: 4.684 ± 1.257
4.684LysThr: 4.684 ± 2.935
2.342LysVal: 2.342 ± 1.468
0.0LysTrp: 0.0 ± 0.0
3.513LysTyr: 3.513 ± 1.154
0.0LysXaa: 0.0 ± 0.0
Leu
4.684LeuAla: 4.684 ± 0.42
0.0LeuCys: 0.0 ± 0.0
5.855LeuAsp: 5.855 ± 1.991
3.513LeuGlu: 3.513 ± 0.524
4.684LeuPhe: 4.684 ± 2.098
5.855LeuGly: 5.855 ± 0.313
1.171LeuHis: 1.171 ± 0.944
8.197LeuIle: 8.197 ± 1.575
4.684LeuLys: 4.684 ± 0.42
5.855LeuLeu: 5.855 ± 1.991
0.0LeuMet: 0.0 ± 0.0
4.684LeuAsn: 4.684 ± 1.257
5.855LeuPro: 5.855 ± 1.365
3.513LeuGln: 3.513 ± 2.201
3.513LeuArg: 3.513 ± 2.201
4.684LeuSer: 4.684 ± 0.42
3.513LeuThr: 3.513 ± 2.201
7.026LeuVal: 7.026 ± 2.725
1.171LeuTrp: 1.171 ± 0.734
4.684LeuTyr: 4.684 ± 2.098
0.0LeuXaa: 0.0 ± 0.0
Met
1.171MetAla: 1.171 ± 0.944
0.0MetCys: 0.0 ± 0.0
1.171MetAsp: 1.171 ± 0.734
2.342MetGlu: 2.342 ± 0.21
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.171MetLys: 1.171 ± 0.944
0.0MetLeu: 0.0 ± 0.0
1.171MetMet: 1.171 ± 0.558
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
1.171MetGln: 1.171 ± 0.944
2.342MetArg: 2.342 ± 0.21
1.171MetSer: 1.171 ± 0.944
0.0MetThr: 0.0 ± 0.0
2.342MetVal: 2.342 ± 0.21
0.0MetTrp: 0.0 ± 0.0
2.342MetTyr: 2.342 ± 1.888
0.0MetXaa: 0.0 ± 0.0
Asn
1.171AsnAla: 1.171 ± 0.944
0.0AsnCys: 0.0 ± 0.0
1.171AsnAsp: 1.171 ± 0.734
3.513AsnGlu: 3.513 ± 1.154
2.342AsnPhe: 2.342 ± 1.888
1.171AsnGly: 1.171 ± 0.944
2.342AsnHis: 2.342 ± 0.21
2.342AsnIle: 2.342 ± 0.21
0.0AsnLys: 0.0 ± 0.0
0.0AsnLeu: 0.0 ± 0.0
1.171AsnMet: 1.171 ± 1.48
4.684AsnAsn: 4.684 ± 2.098
7.026AsnPro: 7.026 ± 2.309
2.342AsnGln: 2.342 ± 0.21
4.684AsnArg: 4.684 ± 1.257
1.171AsnSer: 1.171 ± 0.944
1.171AsnThr: 1.171 ± 0.734
4.684AsnVal: 4.684 ± 1.257
0.0AsnTrp: 0.0 ± 0.0
2.342AsnTyr: 2.342 ± 1.888
0.0AsnXaa: 0.0 ± 0.0
Pro
4.684ProAla: 4.684 ± 3.776
0.0ProCys: 0.0 ± 0.0
5.855ProAsp: 5.855 ± 1.991
7.026ProGlu: 7.026 ± 0.631
4.684ProPhe: 4.684 ± 0.42
3.513ProGly: 3.513 ± 1.154
2.342ProHis: 2.342 ± 1.888
2.342ProIle: 2.342 ± 0.21
2.342ProLys: 2.342 ± 0.21
3.513ProLeu: 3.513 ± 1.154
0.0ProMet: 0.0 ± 0.0
2.342ProAsn: 2.342 ± 0.21
4.684ProPro: 4.684 ± 3.776
3.513ProGln: 3.513 ± 1.154
1.171ProArg: 1.171 ± 0.944
8.197ProSer: 8.197 ± 1.575
2.342ProThr: 2.342 ± 0.21
3.513ProVal: 3.513 ± 0.524
0.0ProTrp: 0.0 ± 0.0
1.171ProTyr: 1.171 ± 0.944
0.0ProXaa: 0.0 ± 0.0
Gln
2.342GlnAla: 2.342 ± 1.468
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
1.171GlnGlu: 1.171 ± 0.734
2.342GlnPhe: 2.342 ± 0.21
2.342GlnGly: 2.342 ± 1.468
1.171GlnHis: 1.171 ± 0.734
1.171GlnIle: 1.171 ± 0.944
0.0GlnLys: 0.0 ± 0.0
7.026GlnLeu: 7.026 ± 1.047
1.171GlnMet: 1.171 ± 0.944
1.171GlnAsn: 1.171 ± 0.734
3.513GlnPro: 3.513 ± 2.832
0.0GlnGln: 0.0 ± 0.0
2.342GlnArg: 2.342 ± 0.21
1.171GlnSer: 1.171 ± 0.734
1.171GlnThr: 1.171 ± 0.944
3.513GlnVal: 3.513 ± 0.524
0.0GlnTrp: 0.0 ± 0.0
3.513GlnTyr: 3.513 ± 2.201
0.0GlnXaa: 0.0 ± 0.0
Arg
9.368ArgAla: 9.368 ± 0.837
1.171ArgCys: 1.171 ± 0.734
8.197ArgAsp: 8.197 ± 1.575
2.342ArgGlu: 2.342 ± 1.468
1.171ArgPhe: 1.171 ± 0.734
1.171ArgGly: 1.171 ± 0.734
2.342ArgHis: 2.342 ± 0.21
5.855ArgIle: 5.855 ± 1.991
3.513ArgLys: 3.513 ± 2.832
7.026ArgLeu: 7.026 ± 2.725
0.0ArgMet: 0.0 ± 0.0
1.171ArgAsn: 1.171 ± 0.734
3.513ArgPro: 3.513 ± 2.832
3.513ArgGln: 3.513 ± 2.201
3.513ArgArg: 3.513 ± 0.524
7.026ArgSer: 7.026 ± 0.631
3.513ArgThr: 3.513 ± 1.154
1.171ArgVal: 1.171 ± 0.734
0.0ArgTrp: 0.0 ± 0.0
2.342ArgTyr: 2.342 ± 0.21
0.0ArgXaa: 0.0 ± 0.0
Ser
7.026SerAla: 7.026 ± 1.047
0.0SerCys: 0.0 ± 0.0
3.513SerAsp: 3.513 ± 2.201
2.342SerGlu: 2.342 ± 0.21
4.684SerPhe: 4.684 ± 0.42
7.026SerGly: 7.026 ± 2.309
1.171SerHis: 1.171 ± 0.734
3.513SerIle: 3.513 ± 2.201
2.342SerLys: 2.342 ± 0.21
9.368SerLeu: 9.368 ± 0.841
0.0SerMet: 0.0 ± 0.0
2.342SerAsn: 2.342 ± 1.888
1.171SerPro: 1.171 ± 0.944
0.0SerGln: 0.0 ± 0.0
7.026SerArg: 7.026 ± 1.047
5.855SerSer: 5.855 ± 0.313
3.513SerThr: 3.513 ± 1.154
3.513SerVal: 3.513 ± 0.524
1.171SerTrp: 1.171 ± 0.944
7.026SerTyr: 7.026 ± 1.047
0.0SerXaa: 0.0 ± 0.0
Thr
4.684ThrAla: 4.684 ± 2.098
0.0ThrCys: 0.0 ± 0.0
3.513ThrAsp: 3.513 ± 1.154
4.684ThrGlu: 4.684 ± 1.257
1.171ThrPhe: 1.171 ± 0.734
1.171ThrGly: 1.171 ± 0.734
1.171ThrHis: 1.171 ± 0.944
2.342ThrIle: 2.342 ± 1.468
3.513ThrLys: 3.513 ± 2.201
2.342ThrLeu: 2.342 ± 1.468
2.342ThrMet: 2.342 ± 1.888
2.342ThrAsn: 2.342 ± 1.468
4.684ThrPro: 4.684 ± 0.42
2.342ThrGln: 2.342 ± 1.468
1.171ThrArg: 1.171 ± 0.734
7.026ThrSer: 7.026 ± 0.631
1.171ThrThr: 1.171 ± 0.944
1.171ThrVal: 1.171 ± 0.734
1.171ThrTrp: 1.171 ± 0.944
2.342ThrTyr: 2.342 ± 0.21
0.0ThrXaa: 0.0 ± 0.0
Val
5.855ValAla: 5.855 ± 0.313
0.0ValCys: 0.0 ± 0.0
0.0ValAsp: 0.0 ± 0.0
0.0ValGlu: 0.0 ± 0.0
1.171ValPhe: 1.171 ± 0.734
3.513ValGly: 3.513 ± 0.524
2.342ValHis: 2.342 ± 1.888
3.513ValIle: 3.513 ± 0.524
1.171ValLys: 1.171 ± 0.734
3.513ValLeu: 3.513 ± 0.524
2.342ValMet: 2.342 ± 0.21
3.513ValAsn: 3.513 ± 1.154
8.197ValPro: 8.197 ± 0.103
4.684ValGln: 4.684 ± 0.42
4.684ValArg: 4.684 ± 1.257
2.342ValSer: 2.342 ± 1.468
3.513ValThr: 3.513 ± 1.154
0.0ValVal: 0.0 ± 0.0
1.171ValTrp: 1.171 ± 0.734
2.342ValTyr: 2.342 ± 1.468
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
1.171TrpPhe: 1.171 ± 0.944
1.171TrpGly: 1.171 ± 0.734
1.171TrpHis: 1.171 ± 0.944
1.171TrpIle: 1.171 ± 0.734
1.171TrpLys: 1.171 ± 0.734
1.171TrpLeu: 1.171 ± 0.944
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
1.171TrpPro: 1.171 ± 0.944
1.171TrpGln: 1.171 ± 0.944
0.0TrpArg: 0.0 ± 0.0
2.342TrpSer: 2.342 ± 1.468
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.171TrpTyr: 1.171 ± 0.734
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.171TyrAla: 1.171 ± 0.944
0.0TyrCys: 0.0 ± 0.0
2.342TyrAsp: 2.342 ± 1.888
3.513TyrGlu: 3.513 ± 1.154
3.513TyrPhe: 3.513 ± 2.201
3.513TyrGly: 3.513 ± 0.524
1.171TyrHis: 1.171 ± 0.944
4.684TyrIle: 4.684 ± 2.098
2.342TyrLys: 2.342 ± 1.468
4.684TyrLeu: 4.684 ± 0.42
1.171TyrMet: 1.171 ± 0.944
1.171TyrAsn: 1.171 ± 0.734
4.684TyrPro: 4.684 ± 0.42
1.171TyrGln: 1.171 ± 0.734
9.368TyrArg: 9.368 ± 0.841
5.855TyrSer: 5.855 ± 1.991
4.684TyrThr: 4.684 ± 2.935
4.684TyrVal: 4.684 ± 2.098
0.0TyrTrp: 0.0 ± 0.0
7.026TyrTyr: 7.026 ± 2.309
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (855 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski