Amino acid dipepetide frequency for Shahe endorna-like virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.694AlaAla: 3.694 ± 0.0
1.944AlaCys: 1.944 ± 0.0
3.111AlaAsp: 3.111 ± 0.0
3.305AlaGlu: 3.305 ± 0.0
1.556AlaPhe: 1.556 ± 0.0
4.278AlaGly: 4.278 ± 0.0
1.167AlaHis: 1.167 ± 0.0
4.278AlaIle: 4.278 ± 0.0
3.889AlaLys: 3.889 ± 0.0
4.083AlaLeu: 4.083 ± 0.0
2.333AlaMet: 2.333 ± 0.0
2.528AlaAsn: 2.528 ± 0.0
1.75AlaPro: 1.75 ± 0.0
0.972AlaGln: 0.972 ± 0.0
2.139AlaArg: 2.139 ± 0.0
4.278AlaSer: 4.278 ± 0.0
3.889AlaThr: 3.889 ± 0.0
6.222AlaVal: 6.222 ± 0.0
0.583AlaTrp: 0.583 ± 0.0
1.944AlaTyr: 1.944 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.972CysAla: 0.972 ± 0.0
0.583CysCys: 0.583 ± 0.0
1.361CysAsp: 1.361 ± 0.0
1.556CysGlu: 1.556 ± 0.0
0.778CysPhe: 0.778 ± 0.0
1.944CysGly: 1.944 ± 0.0
0.583CysHis: 0.583 ± 0.0
1.361CysIle: 1.361 ± 0.0
2.139CysLys: 2.139 ± 0.0
1.361CysLeu: 1.361 ± 0.0
0.583CysMet: 0.583 ± 0.0
0.972CysAsn: 0.972 ± 0.0
1.167CysPro: 1.167 ± 0.0
0.778CysGln: 0.778 ± 0.0
1.167CysArg: 1.167 ± 0.0
1.361CysSer: 1.361 ± 0.0
0.778CysThr: 0.778 ± 0.0
2.139CysVal: 2.139 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.583CysTyr: 0.583 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.305AspAla: 3.305 ± 0.0
0.778AspCys: 0.778 ± 0.0
4.278AspAsp: 4.278 ± 0.0
2.917AspGlu: 2.917 ± 0.0
1.944AspPhe: 1.944 ± 0.0
3.694AspGly: 3.694 ± 0.0
0.972AspHis: 0.972 ± 0.0
3.111AspIle: 3.111 ± 0.0
1.944AspLys: 1.944 ± 0.0
8.166AspLeu: 8.166 ± 0.0
0.778AspMet: 0.778 ± 0.0
3.5AspAsn: 3.5 ± 0.0
3.305AspPro: 3.305 ± 0.0
2.333AspGln: 2.333 ± 0.0
2.917AspArg: 2.917 ± 0.0
3.111AspSer: 3.111 ± 0.0
4.278AspThr: 4.278 ± 0.0
4.083AspVal: 4.083 ± 0.0
0.972AspTrp: 0.972 ± 0.0
2.139AspTyr: 2.139 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
2.528GluAla: 2.528 ± 0.0
1.75GluCys: 1.75 ± 0.0
2.333GluAsp: 2.333 ± 0.0
2.917GluGlu: 2.917 ± 0.0
1.75GluPhe: 1.75 ± 0.0
2.528GluGly: 2.528 ± 0.0
1.556GluHis: 1.556 ± 0.0
2.722GluIle: 2.722 ± 0.0
2.139GluLys: 2.139 ± 0.0
9.139GluLeu: 9.139 ± 0.0
1.75GluMet: 1.75 ± 0.0
1.944GluAsn: 1.944 ± 0.0
3.305GluPro: 3.305 ± 0.0
0.778GluGln: 0.778 ± 0.0
3.305GluArg: 3.305 ± 0.0
5.833GluSer: 5.833 ± 0.0
2.917GluThr: 2.917 ± 0.0
4.861GluVal: 4.861 ± 0.0
0.778GluTrp: 0.778 ± 0.0
2.333GluTyr: 2.333 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.361PheAla: 1.361 ± 0.0
0.583PheCys: 0.583 ± 0.0
2.333PheAsp: 2.333 ± 0.0
1.167PheGlu: 1.167 ± 0.0
0.583PhePhe: 0.583 ± 0.0
2.333PheGly: 2.333 ± 0.0
0.972PheHis: 0.972 ± 0.0
2.139PheIle: 2.139 ± 0.0
0.972PheLys: 0.972 ± 0.0
0.972PheLeu: 0.972 ± 0.0
0.778PheMet: 0.778 ± 0.0
1.75PheAsn: 1.75 ± 0.0
0.778PhePro: 0.778 ± 0.0
0.389PheGln: 0.389 ± 0.0
0.972PheArg: 0.972 ± 0.0
1.944PheSer: 1.944 ± 0.0
2.139PheThr: 2.139 ± 0.0
0.972PheVal: 0.972 ± 0.0
0.583PheTrp: 0.583 ± 0.0
0.778PheTyr: 0.778 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.333GlyAla: 2.333 ± 0.0
1.361GlyCys: 1.361 ± 0.0
2.722GlyAsp: 2.722 ± 0.0
2.528GlyGlu: 2.528 ± 0.0
1.361GlyPhe: 1.361 ± 0.0
3.694GlyGly: 3.694 ± 0.0
2.917GlyHis: 2.917 ± 0.0
3.305GlyIle: 3.305 ± 0.0
4.472GlyLys: 4.472 ± 0.0
4.861GlyLeu: 4.861 ± 0.0
1.75GlyMet: 1.75 ± 0.0
2.333GlyAsn: 2.333 ± 0.0
2.917GlyPro: 2.917 ± 0.0
2.917GlyGln: 2.917 ± 0.0
3.111GlyArg: 3.111 ± 0.0
4.861GlySer: 4.861 ± 0.0
3.889GlyThr: 3.889 ± 0.0
4.472GlyVal: 4.472 ± 0.0
1.167GlyTrp: 1.167 ± 0.0
1.75GlyTyr: 1.75 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.75HisAla: 1.75 ± 0.0
0.778HisCys: 0.778 ± 0.0
1.944HisAsp: 1.944 ± 0.0
1.167HisGlu: 1.167 ± 0.0
0.389HisPhe: 0.389 ± 0.0
1.75HisGly: 1.75 ± 0.0
3.305HisHis: 3.305 ± 0.0
2.528HisIle: 2.528 ± 0.0
2.528HisLys: 2.528 ± 0.0
3.111HisLeu: 3.111 ± 0.0
0.778HisMet: 0.778 ± 0.0
2.528HisAsn: 2.528 ± 0.0
0.972HisPro: 0.972 ± 0.0
0.778HisGln: 0.778 ± 0.0
0.972HisArg: 0.972 ± 0.0
1.944HisSer: 1.944 ± 0.0
2.528HisThr: 2.528 ± 0.0
2.333HisVal: 2.333 ± 0.0
0.778HisTrp: 0.778 ± 0.0
2.139HisTyr: 2.139 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.305IleAla: 3.305 ± 0.0
1.361IleCys: 1.361 ± 0.0
4.667IleAsp: 4.667 ± 0.0
3.694IleGlu: 3.694 ± 0.0
1.361IlePhe: 1.361 ± 0.0
4.278IleGly: 4.278 ± 0.0
1.75IleHis: 1.75 ± 0.0
3.111IleIle: 3.111 ± 0.0
3.889IleLys: 3.889 ± 0.0
4.667IleLeu: 4.667 ± 0.0
1.75IleMet: 1.75 ± 0.0
3.5IleAsn: 3.5 ± 0.0
2.528IlePro: 2.528 ± 0.0
2.139IleGln: 2.139 ± 0.0
3.111IleArg: 3.111 ± 0.0
5.444IleSer: 5.444 ± 0.0
6.222IleThr: 6.222 ± 0.0
4.472IleVal: 4.472 ± 0.0
0.194IleTrp: 0.194 ± 0.0
0.778IleTyr: 0.778 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.722LysAla: 2.722 ± 0.0
0.778LysCys: 0.778 ± 0.0
2.722LysAsp: 2.722 ± 0.0
4.472LysGlu: 4.472 ± 0.0
2.139LysPhe: 2.139 ± 0.0
2.139LysGly: 2.139 ± 0.0
1.944LysHis: 1.944 ± 0.0
3.694LysIle: 3.694 ± 0.0
2.333LysLys: 2.333 ± 0.0
8.361LysLeu: 8.361 ± 0.0
0.972LysMet: 0.972 ± 0.0
1.556LysAsn: 1.556 ± 0.0
4.667LysPro: 4.667 ± 0.0
1.944LysGln: 1.944 ± 0.0
2.722LysArg: 2.722 ± 0.0
3.111LysSer: 3.111 ± 0.0
4.861LysThr: 4.861 ± 0.0
4.667LysVal: 4.667 ± 0.0
1.361LysTrp: 1.361 ± 0.0
1.361LysTyr: 1.361 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
6.028LeuAla: 6.028 ± 0.0
1.75LeuCys: 1.75 ± 0.0
4.861LeuAsp: 4.861 ± 0.0
6.222LeuGlu: 6.222 ± 0.0
2.528LeuPhe: 2.528 ± 0.0
5.25LeuGly: 5.25 ± 0.0
2.528LeuHis: 2.528 ± 0.0
4.278LeuIle: 4.278 ± 0.0
5.833LeuLys: 5.833 ± 0.0
9.722LeuLeu: 9.722 ± 0.0
3.5LeuMet: 3.5 ± 0.0
5.444LeuAsn: 5.444 ± 0.0
2.722LeuPro: 2.722 ± 0.0
3.111LeuGln: 3.111 ± 0.0
5.055LeuArg: 5.055 ± 0.0
7.0LeuSer: 7.0 ± 0.0
7.0LeuThr: 7.0 ± 0.0
6.222LeuVal: 6.222 ± 0.0
0.778LeuTrp: 0.778 ± 0.0
4.278LeuTyr: 4.278 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
2.528MetAla: 2.528 ± 0.0
0.778MetCys: 0.778 ± 0.0
1.556MetAsp: 1.556 ± 0.0
1.75MetGlu: 1.75 ± 0.0
0.778MetPhe: 0.778 ± 0.0
2.139MetGly: 2.139 ± 0.0
1.556MetHis: 1.556 ± 0.0
1.167MetIle: 1.167 ± 0.0
2.139MetLys: 2.139 ± 0.0
2.333MetLeu: 2.333 ± 0.0
0.972MetMet: 0.972 ± 0.0
0.583MetAsn: 0.583 ± 0.0
1.167MetPro: 1.167 ± 0.0
0.583MetGln: 0.583 ± 0.0
1.556MetArg: 1.556 ± 0.0
2.139MetSer: 2.139 ± 0.0
1.944MetThr: 1.944 ± 0.0
1.556MetVal: 1.556 ± 0.0
0.389MetTrp: 0.389 ± 0.0
1.361MetTyr: 1.361 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.944AsnAla: 1.944 ± 0.0
0.972AsnCys: 0.972 ± 0.0
1.944AsnAsp: 1.944 ± 0.0
1.944AsnGlu: 1.944 ± 0.0
1.167AsnPhe: 1.167 ± 0.0
1.556AsnGly: 1.556 ± 0.0
1.556AsnHis: 1.556 ± 0.0
2.528AsnIle: 2.528 ± 0.0
3.111AsnLys: 3.111 ± 0.0
4.861AsnLeu: 4.861 ± 0.0
1.944AsnMet: 1.944 ± 0.0
2.528AsnAsn: 2.528 ± 0.0
3.111AsnPro: 3.111 ± 0.0
3.5AsnGln: 3.5 ± 0.0
2.528AsnArg: 2.528 ± 0.0
4.278AsnSer: 4.278 ± 0.0
2.917AsnThr: 2.917 ± 0.0
3.305AsnVal: 3.305 ± 0.0
1.361AsnTrp: 1.361 ± 0.0
1.944AsnTyr: 1.944 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.917ProAla: 2.917 ± 0.0
1.556ProCys: 1.556 ± 0.0
4.667ProAsp: 4.667 ± 0.0
2.917ProGlu: 2.917 ± 0.0
0.778ProPhe: 0.778 ± 0.0
3.305ProGly: 3.305 ± 0.0
1.556ProHis: 1.556 ± 0.0
5.25ProIle: 5.25 ± 0.0
2.917ProLys: 2.917 ± 0.0
3.694ProLeu: 3.694 ± 0.0
0.778ProMet: 0.778 ± 0.0
2.333ProAsn: 2.333 ± 0.0
3.111ProPro: 3.111 ± 0.0
1.361ProGln: 1.361 ± 0.0
2.333ProArg: 2.333 ± 0.0
4.278ProSer: 4.278 ± 0.0
2.528ProThr: 2.528 ± 0.0
3.889ProVal: 3.889 ± 0.0
0.389ProTrp: 0.389 ± 0.0
1.167ProTyr: 1.167 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.333GlnAla: 2.333 ± 0.0
0.583GlnCys: 0.583 ± 0.0
1.556GlnAsp: 1.556 ± 0.0
2.528GlnGlu: 2.528 ± 0.0
0.194GlnPhe: 0.194 ± 0.0
1.361GlnGly: 1.361 ± 0.0
0.778GlnHis: 0.778 ± 0.0
1.944GlnIle: 1.944 ± 0.0
1.167GlnLys: 1.167 ± 0.0
3.5GlnLeu: 3.5 ± 0.0
0.389GlnMet: 0.389 ± 0.0
1.556GlnAsn: 1.556 ± 0.0
1.556GlnPro: 1.556 ± 0.0
1.944GlnGln: 1.944 ± 0.0
0.389GlnArg: 0.389 ± 0.0
3.111GlnSer: 3.111 ± 0.0
2.139GlnThr: 2.139 ± 0.0
3.111GlnVal: 3.111 ± 0.0
0.778GlnTrp: 0.778 ± 0.0
1.944GlnTyr: 1.944 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.722ArgAla: 2.722 ± 0.0
1.556ArgCys: 1.556 ± 0.0
2.333ArgAsp: 2.333 ± 0.0
2.917ArgGlu: 2.917 ± 0.0
1.361ArgPhe: 1.361 ± 0.0
1.75ArgGly: 1.75 ± 0.0
1.944ArgHis: 1.944 ± 0.0
2.333ArgIle: 2.333 ± 0.0
2.917ArgLys: 2.917 ± 0.0
3.889ArgLeu: 3.889 ± 0.0
2.333ArgMet: 2.333 ± 0.0
2.333ArgAsn: 2.333 ± 0.0
2.917ArgPro: 2.917 ± 0.0
1.361ArgGln: 1.361 ± 0.0
0.583ArgArg: 0.583 ± 0.0
2.528ArgSer: 2.528 ± 0.0
2.917ArgThr: 2.917 ± 0.0
2.722ArgVal: 2.722 ± 0.0
0.972ArgTrp: 0.972 ± 0.0
2.722ArgTyr: 2.722 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
3.889SerAla: 3.889 ± 0.0
1.361SerCys: 1.361 ± 0.0
4.667SerAsp: 4.667 ± 0.0
3.5SerGlu: 3.5 ± 0.0
0.972SerPhe: 0.972 ± 0.0
4.861SerGly: 4.861 ± 0.0
1.944SerHis: 1.944 ± 0.0
6.222SerIle: 6.222 ± 0.0
4.472SerLys: 4.472 ± 0.0
6.222SerLeu: 6.222 ± 0.0
1.75SerMet: 1.75 ± 0.0
2.722SerAsn: 2.722 ± 0.0
4.472SerPro: 4.472 ± 0.0
1.944SerGln: 1.944 ± 0.0
4.083SerArg: 4.083 ± 0.0
2.917SerSer: 2.917 ± 0.0
5.639SerThr: 5.639 ± 0.0
3.305SerVal: 3.305 ± 0.0
1.75SerTrp: 1.75 ± 0.0
3.5SerTyr: 3.5 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
5.833ThrAla: 5.833 ± 0.0
1.167ThrCys: 1.167 ± 0.0
4.472ThrAsp: 4.472 ± 0.0
3.889ThrGlu: 3.889 ± 0.0
1.167ThrPhe: 1.167 ± 0.0
4.083ThrGly: 4.083 ± 0.0
2.722ThrHis: 2.722 ± 0.0
3.305ThrIle: 3.305 ± 0.0
4.472ThrLys: 4.472 ± 0.0
4.278ThrLeu: 4.278 ± 0.0
1.944ThrMet: 1.944 ± 0.0
3.5ThrAsn: 3.5 ± 0.0
5.833ThrPro: 5.833 ± 0.0
2.139ThrGln: 2.139 ± 0.0
3.305ThrArg: 3.305 ± 0.0
5.25ThrSer: 5.25 ± 0.0
6.805ThrThr: 6.805 ± 0.0
3.889ThrVal: 3.889 ± 0.0
1.167ThrTrp: 1.167 ± 0.0
2.139ThrTyr: 2.139 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
5.25ValAla: 5.25 ± 0.0
1.556ValCys: 1.556 ± 0.0
3.694ValAsp: 3.694 ± 0.0
4.861ValGlu: 4.861 ± 0.0
1.361ValPhe: 1.361 ± 0.0
4.667ValGly: 4.667 ± 0.0
2.528ValHis: 2.528 ± 0.0
6.222ValIle: 6.222 ± 0.0
4.861ValLys: 4.861 ± 0.0
4.667ValLeu: 4.667 ± 0.0
2.333ValMet: 2.333 ± 0.0
4.083ValAsn: 4.083 ± 0.0
3.305ValPro: 3.305 ± 0.0
2.722ValGln: 2.722 ± 0.0
2.722ValArg: 2.722 ± 0.0
3.889ValSer: 3.889 ± 0.0
3.694ValThr: 3.694 ± 0.0
4.278ValVal: 4.278 ± 0.0
0.972ValTrp: 0.972 ± 0.0
2.528ValTyr: 2.528 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.194TrpAla: 0.194 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.972TrpAsp: 0.972 ± 0.0
1.167TrpGlu: 1.167 ± 0.0
0.778TrpPhe: 0.778 ± 0.0
1.556TrpGly: 1.556 ± 0.0
0.583TrpHis: 0.583 ± 0.0
0.972TrpIle: 0.972 ± 0.0
0.778TrpLys: 0.778 ± 0.0
1.944TrpLeu: 1.944 ± 0.0
0.583TrpMet: 0.583 ± 0.0
0.778TrpAsn: 0.778 ± 0.0
0.778TrpPro: 0.778 ± 0.0
0.778TrpGln: 0.778 ± 0.0
0.389TrpArg: 0.389 ± 0.0
0.194TrpSer: 0.194 ± 0.0
0.778TrpThr: 0.778 ± 0.0
1.361TrpVal: 1.361 ± 0.0
0.583TrpTrp: 0.583 ± 0.0
0.389TrpTyr: 0.389 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.333TyrAla: 2.333 ± 0.0
0.972TyrCys: 0.972 ± 0.0
2.333TyrAsp: 2.333 ± 0.0
1.75TyrGlu: 1.75 ± 0.0
1.556TyrPhe: 1.556 ± 0.0
1.556TyrGly: 1.556 ± 0.0
2.139TyrHis: 2.139 ± 0.0
1.75TyrIle: 1.75 ± 0.0
1.556TyrLys: 1.556 ± 0.0
4.278TyrLeu: 4.278 ± 0.0
0.778TyrMet: 0.778 ± 0.0
2.528TyrAsn: 2.528 ± 0.0
1.361TyrPro: 1.361 ± 0.0
0.389TyrGln: 0.389 ± 0.0
1.75TyrArg: 1.75 ± 0.0
2.722TyrSer: 2.722 ± 0.0
3.5TyrThr: 3.5 ± 0.0
2.333TyrVal: 2.333 ± 0.0
0.194TyrTrp: 0.194 ± 0.0
0.389TyrTyr: 0.389 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (5144 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski