Amino acid dipepetide frequency for Beihai narna-like virus 17

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.609AlaAla: 1.609 ± 0.0
0.805AlaCys: 0.805 ± 0.0
4.023AlaAsp: 4.023 ± 0.0
2.414AlaGlu: 2.414 ± 0.0
0.805AlaPhe: 0.805 ± 0.0
2.414AlaGly: 2.414 ± 0.0
0.0AlaHis: 0.0 ± 0.0
0.805AlaIle: 0.805 ± 0.0
0.805AlaLys: 0.805 ± 0.0
6.436AlaLeu: 6.436 ± 0.0
1.609AlaMet: 1.609 ± 0.0
2.414AlaAsn: 2.414 ± 0.0
1.609AlaPro: 1.609 ± 0.0
2.414AlaGln: 2.414 ± 0.0
2.414AlaArg: 2.414 ± 0.0
0.0AlaSer: 0.0 ± 0.0
0.805AlaThr: 0.805 ± 0.0
3.218AlaVal: 3.218 ± 0.0
0.805AlaTrp: 0.805 ± 0.0
3.218AlaTyr: 3.218 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.805CysAsp: 0.805 ± 0.0
0.805CysGlu: 0.805 ± 0.0
0.805CysPhe: 0.805 ± 0.0
0.805CysGly: 0.805 ± 0.0
0.805CysHis: 0.805 ± 0.0
2.414CysIle: 2.414 ± 0.0
0.0CysLys: 0.0 ± 0.0
4.023CysLeu: 4.023 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
3.218CysGln: 3.218 ± 0.0
2.414CysArg: 2.414 ± 0.0
1.609CysSer: 1.609 ± 0.0
0.805CysThr: 0.805 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.805AspAla: 0.805 ± 0.0
0.805AspCys: 0.805 ± 0.0
4.827AspAsp: 4.827 ± 0.0
2.414AspGlu: 2.414 ± 0.0
8.85AspPhe: 8.85 ± 0.0
4.827AspGly: 4.827 ± 0.0
1.609AspHis: 1.609 ± 0.0
3.218AspIle: 3.218 ± 0.0
4.827AspLys: 4.827 ± 0.0
4.023AspLeu: 4.023 ± 0.0
0.805AspMet: 0.805 ± 0.0
4.023AspAsn: 4.023 ± 0.0
7.241AspPro: 7.241 ± 0.0
0.805AspGln: 0.805 ± 0.0
2.414AspArg: 2.414 ± 0.0
3.218AspSer: 3.218 ± 0.0
0.805AspThr: 0.805 ± 0.0
4.023AspVal: 4.023 ± 0.0
0.805AspTrp: 0.805 ± 0.0
1.609AspTyr: 1.609 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.218GluAla: 3.218 ± 0.0
0.0GluCys: 0.0 ± 0.0
4.023GluAsp: 4.023 ± 0.0
5.632GluGlu: 5.632 ± 0.0
7.241GluPhe: 7.241 ± 0.0
1.609GluGly: 1.609 ± 0.0
0.0GluHis: 0.0 ± 0.0
3.218GluIle: 3.218 ± 0.0
7.241GluLys: 7.241 ± 0.0
8.045GluLeu: 8.045 ± 0.0
0.805GluMet: 0.805 ± 0.0
0.0GluAsn: 0.0 ± 0.0
4.827GluPro: 4.827 ± 0.0
0.805GluGln: 0.805 ± 0.0
4.023GluArg: 4.023 ± 0.0
4.023GluSer: 4.023 ± 0.0
4.023GluThr: 4.023 ± 0.0
4.023GluVal: 4.023 ± 0.0
0.0GluTrp: 0.0 ± 0.0
0.805GluTyr: 0.805 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
1.609PheAla: 1.609 ± 0.0
0.805PheCys: 0.805 ± 0.0
1.609PheAsp: 1.609 ± 0.0
1.609PheGlu: 1.609 ± 0.0
4.023PhePhe: 4.023 ± 0.0
5.632PheGly: 5.632 ± 0.0
1.609PheHis: 1.609 ± 0.0
2.414PheIle: 2.414 ± 0.0
1.609PheLys: 1.609 ± 0.0
4.023PheLeu: 4.023 ± 0.0
0.805PheMet: 0.805 ± 0.0
4.827PheAsn: 4.827 ± 0.0
9.654PhePro: 9.654 ± 0.0
3.218PheGln: 3.218 ± 0.0
4.827PheArg: 4.827 ± 0.0
4.827PheSer: 4.827 ± 0.0
3.218PheThr: 3.218 ± 0.0
0.805PheVal: 0.805 ± 0.0
1.609PheTrp: 1.609 ± 0.0
4.827PheTyr: 4.827 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
0.0GlyAla: 0.0 ± 0.0
0.805GlyCys: 0.805 ± 0.0
3.218GlyAsp: 3.218 ± 0.0
3.218GlyGlu: 3.218 ± 0.0
3.218GlyPhe: 3.218 ± 0.0
2.414GlyGly: 2.414 ± 0.0
1.609GlyHis: 1.609 ± 0.0
4.023GlyIle: 4.023 ± 0.0
4.827GlyLys: 4.827 ± 0.0
6.436GlyLeu: 6.436 ± 0.0
2.414GlyMet: 2.414 ± 0.0
1.609GlyAsn: 1.609 ± 0.0
3.218GlyPro: 3.218 ± 0.0
2.414GlyGln: 2.414 ± 0.0
2.414GlyArg: 2.414 ± 0.0
5.632GlySer: 5.632 ± 0.0
4.827GlyThr: 4.827 ± 0.0
3.218GlyVal: 3.218 ± 0.0
0.0GlyTrp: 0.0 ± 0.0
3.218GlyTyr: 3.218 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.609HisAla: 1.609 ± 0.0
0.0HisCys: 0.0 ± 0.0
1.609HisAsp: 1.609 ± 0.0
4.023HisGlu: 4.023 ± 0.0
0.805HisPhe: 0.805 ± 0.0
3.218HisGly: 3.218 ± 0.0
0.805HisHis: 0.805 ± 0.0
0.805HisIle: 0.805 ± 0.0
0.0HisLys: 0.0 ± 0.0
0.805HisLeu: 0.805 ± 0.0
0.0HisMet: 0.0 ± 0.0
0.805HisAsn: 0.805 ± 0.0
2.414HisPro: 2.414 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
0.805HisSer: 0.805 ± 0.0
2.414HisThr: 2.414 ± 0.0
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.805HisTyr: 0.805 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.609IleAla: 1.609 ± 0.0
1.609IleCys: 1.609 ± 0.0
4.023IleAsp: 4.023 ± 0.0
5.632IleGlu: 5.632 ± 0.0
1.609IlePhe: 1.609 ± 0.0
3.218IleGly: 3.218 ± 0.0
1.609IleHis: 1.609 ± 0.0
3.218IleIle: 3.218 ± 0.0
5.632IleLys: 5.632 ± 0.0
2.414IleLeu: 2.414 ± 0.0
0.0IleMet: 0.0 ± 0.0
0.805IleAsn: 0.805 ± 0.0
4.023IlePro: 4.023 ± 0.0
1.609IleGln: 1.609 ± 0.0
4.827IleArg: 4.827 ± 0.0
5.632IleSer: 5.632 ± 0.0
0.805IleThr: 0.805 ± 0.0
4.023IleVal: 4.023 ± 0.0
0.0IleTrp: 0.0 ± 0.0
1.609IleTyr: 1.609 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
5.632LysAla: 5.632 ± 0.0
1.609LysCys: 1.609 ± 0.0
4.023LysAsp: 4.023 ± 0.0
2.414LysGlu: 2.414 ± 0.0
4.023LysPhe: 4.023 ± 0.0
2.414LysGly: 2.414 ± 0.0
2.414LysHis: 2.414 ± 0.0
3.218LysIle: 3.218 ± 0.0
0.805LysLys: 0.805 ± 0.0
4.023LysLeu: 4.023 ± 0.0
1.609LysMet: 1.609 ± 0.0
3.218LysAsn: 3.218 ± 0.0
0.805LysPro: 0.805 ± 0.0
3.218LysGln: 3.218 ± 0.0
4.827LysArg: 4.827 ± 0.0
1.609LysSer: 1.609 ± 0.0
3.218LysThr: 3.218 ± 0.0
6.436LysVal: 6.436 ± 0.0
0.0LysTrp: 0.0 ± 0.0
1.609LysTyr: 1.609 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
5.632LeuAla: 5.632 ± 0.0
1.609LeuCys: 1.609 ± 0.0
5.632LeuAsp: 5.632 ± 0.0
6.436LeuGlu: 6.436 ± 0.0
5.632LeuPhe: 5.632 ± 0.0
8.045LeuGly: 8.045 ± 0.0
3.218LeuHis: 3.218 ± 0.0
2.414LeuIle: 2.414 ± 0.0
8.045LeuLys: 8.045 ± 0.0
15.286LeuLeu: 15.286 ± 0.0
0.805LeuMet: 0.805 ± 0.0
4.023LeuAsn: 4.023 ± 0.0
7.241LeuPro: 7.241 ± 0.0
4.827LeuGln: 4.827 ± 0.0
9.654LeuArg: 9.654 ± 0.0
7.241LeuSer: 7.241 ± 0.0
7.241LeuThr: 7.241 ± 0.0
2.414LeuVal: 2.414 ± 0.0
1.609LeuTrp: 1.609 ± 0.0
2.414LeuTyr: 2.414 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.609MetAla: 1.609 ± 0.0
0.805MetCys: 0.805 ± 0.0
2.414MetAsp: 2.414 ± 0.0
2.414MetGlu: 2.414 ± 0.0
0.0MetPhe: 0.0 ± 0.0
1.609MetGly: 1.609 ± 0.0
0.0MetHis: 0.0 ± 0.0
3.218MetIle: 3.218 ± 0.0
0.0MetLys: 0.0 ± 0.0
0.805MetLeu: 0.805 ± 0.0
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.805MetPro: 0.805 ± 0.0
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
1.609MetSer: 1.609 ± 0.0
0.805MetThr: 0.805 ± 0.0
0.805MetVal: 0.805 ± 0.0
1.609MetTrp: 1.609 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.609AsnAla: 1.609 ± 0.0
0.805AsnCys: 0.805 ± 0.0
1.609AsnAsp: 1.609 ± 0.0
1.609AsnGlu: 1.609 ± 0.0
1.609AsnPhe: 1.609 ± 0.0
3.218AsnGly: 3.218 ± 0.0
0.805AsnHis: 0.805 ± 0.0
2.414AsnIle: 2.414 ± 0.0
2.414AsnLys: 2.414 ± 0.0
4.827AsnLeu: 4.827 ± 0.0
1.609AsnMet: 1.609 ± 0.0
0.805AsnAsn: 0.805 ± 0.0
3.218AsnPro: 3.218 ± 0.0
2.414AsnGln: 2.414 ± 0.0
1.609AsnArg: 1.609 ± 0.0
4.023AsnSer: 4.023 ± 0.0
0.805AsnThr: 0.805 ± 0.0
2.414AsnVal: 2.414 ± 0.0
0.0AsnTrp: 0.0 ± 0.0
0.805AsnTyr: 0.805 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.218ProAla: 3.218 ± 0.0
1.609ProCys: 1.609 ± 0.0
2.414ProAsp: 2.414 ± 0.0
8.85ProGlu: 8.85 ± 0.0
0.805ProPhe: 0.805 ± 0.0
1.609ProGly: 1.609 ± 0.0
1.609ProHis: 1.609 ± 0.0
5.632ProIle: 5.632 ± 0.0
2.414ProLys: 2.414 ± 0.0
6.436ProLeu: 6.436 ± 0.0
1.609ProMet: 1.609 ± 0.0
3.218ProAsn: 3.218 ± 0.0
4.023ProPro: 4.023 ± 0.0
3.218ProGln: 3.218 ± 0.0
2.414ProArg: 2.414 ± 0.0
1.609ProSer: 1.609 ± 0.0
4.023ProThr: 4.023 ± 0.0
4.023ProVal: 4.023 ± 0.0
0.805ProTrp: 0.805 ± 0.0
1.609ProTyr: 1.609 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.609GlnAla: 1.609 ± 0.0
0.0GlnCys: 0.0 ± 0.0
1.609GlnAsp: 1.609 ± 0.0
3.218GlnGlu: 3.218 ± 0.0
3.218GlnPhe: 3.218 ± 0.0
0.805GlnGly: 0.805 ± 0.0
0.805GlnHis: 0.805 ± 0.0
2.414GlnIle: 2.414 ± 0.0
2.414GlnLys: 2.414 ± 0.0
6.436GlnLeu: 6.436 ± 0.0
0.805GlnMet: 0.805 ± 0.0
2.414GlnAsn: 2.414 ± 0.0
1.609GlnPro: 1.609 ± 0.0
0.805GlnGln: 0.805 ± 0.0
6.436GlnArg: 6.436 ± 0.0
1.609GlnSer: 1.609 ± 0.0
1.609GlnThr: 1.609 ± 0.0
1.609GlnVal: 1.609 ± 0.0
0.805GlnTrp: 0.805 ± 0.0
1.609GlnTyr: 1.609 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
1.609ArgAla: 1.609 ± 0.0
0.0ArgCys: 0.0 ± 0.0
7.241ArgAsp: 7.241 ± 0.0
2.414ArgGlu: 2.414 ± 0.0
6.436ArgPhe: 6.436 ± 0.0
4.827ArgGly: 4.827 ± 0.0
1.609ArgHis: 1.609 ± 0.0
6.436ArgIle: 6.436 ± 0.0
7.241ArgLys: 7.241 ± 0.0
5.632ArgLeu: 5.632 ± 0.0
1.609ArgMet: 1.609 ± 0.0
2.414ArgAsn: 2.414 ± 0.0
2.414ArgPro: 2.414 ± 0.0
2.414ArgGln: 2.414 ± 0.0
4.827ArgArg: 4.827 ± 0.0
5.632ArgSer: 5.632 ± 0.0
3.218ArgThr: 3.218 ± 0.0
8.045ArgVal: 8.045 ± 0.0
0.805ArgTrp: 0.805 ± 0.0
0.805ArgTyr: 0.805 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
0.805SerAla: 0.805 ± 0.0
3.218SerCys: 3.218 ± 0.0
4.023SerAsp: 4.023 ± 0.0
3.218SerGlu: 3.218 ± 0.0
6.436SerPhe: 6.436 ± 0.0
4.023SerGly: 4.023 ± 0.0
0.0SerHis: 0.0 ± 0.0
3.218SerIle: 3.218 ± 0.0
2.414SerLys: 2.414 ± 0.0
11.263SerLeu: 11.263 ± 0.0
1.609SerMet: 1.609 ± 0.0
3.218SerAsn: 3.218 ± 0.0
4.023SerPro: 4.023 ± 0.0
4.827SerGln: 4.827 ± 0.0
8.045SerArg: 8.045 ± 0.0
1.609SerSer: 1.609 ± 0.0
1.609SerThr: 1.609 ± 0.0
1.609SerVal: 1.609 ± 0.0
1.609SerTrp: 1.609 ± 0.0
1.609SerTyr: 1.609 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
2.414ThrAla: 2.414 ± 0.0
0.0ThrCys: 0.0 ± 0.0
3.218ThrAsp: 3.218 ± 0.0
0.805ThrGlu: 0.805 ± 0.0
2.414ThrPhe: 2.414 ± 0.0
4.023ThrGly: 4.023 ± 0.0
0.0ThrHis: 0.0 ± 0.0
3.218ThrIle: 3.218 ± 0.0
3.218ThrLys: 3.218 ± 0.0
8.045ThrLeu: 8.045 ± 0.0
0.805ThrMet: 0.805 ± 0.0
1.609ThrAsn: 1.609 ± 0.0
2.414ThrPro: 2.414 ± 0.0
1.609ThrGln: 1.609 ± 0.0
3.218ThrArg: 3.218 ± 0.0
7.241ThrSer: 7.241 ± 0.0
2.414ThrThr: 2.414 ± 0.0
3.218ThrVal: 3.218 ± 0.0
0.0ThrTrp: 0.0 ± 0.0
0.805ThrTyr: 0.805 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
1.609ValAla: 1.609 ± 0.0
3.218ValCys: 3.218 ± 0.0
2.414ValAsp: 2.414 ± 0.0
4.023ValGlu: 4.023 ± 0.0
3.218ValPhe: 3.218 ± 0.0
1.609ValGly: 1.609 ± 0.0
1.609ValHis: 1.609 ± 0.0
1.609ValIle: 1.609 ± 0.0
2.414ValLys: 2.414 ± 0.0
4.827ValLeu: 4.827 ± 0.0
0.805ValMet: 0.805 ± 0.0
1.609ValAsn: 1.609 ± 0.0
0.805ValPro: 0.805 ± 0.0
1.609ValGln: 1.609 ± 0.0
4.827ValArg: 4.827 ± 0.0
8.045ValSer: 8.045 ± 0.0
4.827ValThr: 4.827 ± 0.0
4.827ValVal: 4.827 ± 0.0
1.609ValTrp: 1.609 ± 0.0
1.609ValTyr: 1.609 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.609TrpAla: 1.609 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.805TrpAsp: 0.805 ± 0.0
0.805TrpGlu: 0.805 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.805TrpGly: 0.805 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.805TrpIle: 0.805 ± 0.0
0.0TrpLys: 0.0 ± 0.0
1.609TrpLeu: 1.609 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.805TrpAsn: 0.805 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.805TrpGln: 0.805 ± 0.0
2.414TrpArg: 2.414 ± 0.0
0.805TrpSer: 0.805 ± 0.0
0.805TrpThr: 0.805 ± 0.0
0.805TrpVal: 0.805 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.805TyrAla: 0.805 ± 0.0
0.805TyrCys: 0.805 ± 0.0
2.414TyrAsp: 2.414 ± 0.0
1.609TyrGlu: 1.609 ± 0.0
2.414TyrPhe: 2.414 ± 0.0
1.609TyrGly: 1.609 ± 0.0
0.805TyrHis: 0.805 ± 0.0
0.0TyrIle: 0.0 ± 0.0
0.805TyrLys: 0.805 ± 0.0
4.023TyrLeu: 4.023 ± 0.0
0.0TyrMet: 0.0 ± 0.0
0.805TyrAsn: 0.805 ± 0.0
0.805TyrPro: 0.805 ± 0.0
1.609TyrGln: 1.609 ± 0.0
4.023TyrArg: 4.023 ± 0.0
1.609TyrSer: 1.609 ± 0.0
2.414TyrThr: 2.414 ± 0.0
1.609TyrVal: 1.609 ± 0.0
0.805TyrTrp: 0.805 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (1244 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski