Amino acid dipepetide frequency for Planarian secretory cell nidovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.697AlaAla: 1.697 ± 0.0
0.738AlaCys: 0.738 ± 0.0
1.771AlaAsp: 1.771 ± 0.0
1.549AlaGlu: 1.549 ± 0.0
1.254AlaPhe: 1.254 ± 0.0
1.254AlaGly: 1.254 ± 0.0
1.328AlaHis: 1.328 ± 0.0
5.902AlaIle: 5.902 ± 0.0
2.877AlaLys: 2.877 ± 0.0
4.648AlaLeu: 4.648 ± 0.0
0.959AlaMet: 0.959 ± 0.0
1.992AlaAsn: 1.992 ± 0.0
1.328AlaPro: 1.328 ± 0.0
1.771AlaGln: 1.771 ± 0.0
1.697AlaArg: 1.697 ± 0.0
1.475AlaSer: 1.475 ± 0.0
3.025AlaThr: 3.025 ± 0.0
2.066AlaVal: 2.066 ± 0.0
0.443AlaTrp: 0.443 ± 0.0
2.803AlaTyr: 2.803 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.812CysAla: 0.812 ± 0.0
0.516CysCys: 0.516 ± 0.0
0.738CysAsp: 0.738 ± 0.0
1.033CysGlu: 1.033 ± 0.0
0.812CysPhe: 0.812 ± 0.0
1.18CysGly: 1.18 ± 0.0
0.443CysHis: 0.443 ± 0.0
2.508CysIle: 2.508 ± 0.0
1.402CysLys: 1.402 ± 0.0
1.475CysLeu: 1.475 ± 0.0
0.59CysMet: 0.59 ± 0.0
1.549CysAsn: 1.549 ± 0.0
0.443CysPro: 0.443 ± 0.0
1.033CysGln: 1.033 ± 0.0
0.664CysArg: 0.664 ± 0.0
1.623CysSer: 1.623 ± 0.0
1.844CysThr: 1.844 ± 0.0
1.328CysVal: 1.328 ± 0.0
0.074CysTrp: 0.074 ± 0.0
1.475CysTyr: 1.475 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.025AspAla: 3.025 ± 0.0
0.959AspCys: 0.959 ± 0.0
4.648AspAsp: 4.648 ± 0.0
2.435AspGlu: 2.435 ± 0.0
2.951AspPhe: 2.951 ± 0.0
2.582AspGly: 2.582 ± 0.0
1.18AspHis: 1.18 ± 0.0
7.746AspIle: 7.746 ± 0.0
5.533AspLys: 5.533 ± 0.0
4.795AspLeu: 4.795 ± 0.0
1.402AspMet: 1.402 ± 0.0
5.607AspAsn: 5.607 ± 0.0
1.771AspPro: 1.771 ± 0.0
1.697AspGln: 1.697 ± 0.0
1.918AspArg: 1.918 ± 0.0
3.172AspSer: 3.172 ± 0.0
5.312AspThr: 5.312 ± 0.0
4.722AspVal: 4.722 ± 0.0
0.295AspTrp: 0.295 ± 0.0
3.689AspTyr: 3.689 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
2.139GluAla: 2.139 ± 0.0
0.885GluCys: 0.885 ± 0.0
2.73GluAsp: 2.73 ± 0.0
1.992GluGlu: 1.992 ± 0.0
1.992GluPhe: 1.992 ± 0.0
1.254GluGly: 1.254 ± 0.0
0.812GluHis: 0.812 ± 0.0
3.32GluIle: 3.32 ± 0.0
2.73GluLys: 2.73 ± 0.0
3.984GluLeu: 3.984 ± 0.0
1.033GluMet: 1.033 ± 0.0
2.066GluAsn: 2.066 ± 0.0
0.885GluPro: 0.885 ± 0.0
1.771GluGln: 1.771 ± 0.0
2.066GluArg: 2.066 ± 0.0
2.139GluSer: 2.139 ± 0.0
3.689GluThr: 3.689 ± 0.0
2.066GluVal: 2.066 ± 0.0
0.074GluTrp: 0.074 ± 0.0
3.246GluTyr: 3.246 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.139PheAla: 2.139 ± 0.0
0.885PheCys: 0.885 ± 0.0
3.394PheAsp: 3.394 ± 0.0
1.918PheGlu: 1.918 ± 0.0
1.623PhePhe: 1.623 ± 0.0
2.287PheGly: 2.287 ± 0.0
0.443PheHis: 0.443 ± 0.0
5.385PheIle: 5.385 ± 0.0
3.467PheLys: 3.467 ± 0.0
3.836PheLeu: 3.836 ± 0.0
0.59PheMet: 0.59 ± 0.0
3.467PheAsn: 3.467 ± 0.0
0.516PhePro: 0.516 ± 0.0
0.738PheGln: 0.738 ± 0.0
1.475PheArg: 1.475 ± 0.0
2.656PheSer: 2.656 ± 0.0
3.615PheThr: 3.615 ± 0.0
3.541PheVal: 3.541 ± 0.0
0.148PheTrp: 0.148 ± 0.0
1.844PheTyr: 1.844 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
1.033GlyAla: 1.033 ± 0.0
0.959GlyCys: 0.959 ± 0.0
2.139GlyAsp: 2.139 ± 0.0
0.959GlyGlu: 0.959 ± 0.0
1.918GlyPhe: 1.918 ± 0.0
1.623GlyGly: 1.623 ± 0.0
1.107GlyHis: 1.107 ± 0.0
2.951GlyIle: 2.951 ± 0.0
2.582GlyLys: 2.582 ± 0.0
3.098GlyLeu: 3.098 ± 0.0
0.664GlyMet: 0.664 ± 0.0
2.508GlyAsn: 2.508 ± 0.0
0.59GlyPro: 0.59 ± 0.0
1.328GlyGln: 1.328 ± 0.0
1.328GlyArg: 1.328 ± 0.0
1.992GlySer: 1.992 ± 0.0
1.992GlyThr: 1.992 ± 0.0
1.475GlyVal: 1.475 ± 0.0
0.148GlyTrp: 0.148 ± 0.0
2.877GlyTyr: 2.877 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.475HisAla: 1.475 ± 0.0
0.516HisCys: 0.516 ± 0.0
1.918HisAsp: 1.918 ± 0.0
1.18HisGlu: 1.18 ± 0.0
0.885HisPhe: 0.885 ± 0.0
0.738HisGly: 0.738 ± 0.0
0.221HisHis: 0.221 ± 0.0
2.73HisIle: 2.73 ± 0.0
1.918HisLys: 1.918 ± 0.0
1.254HisLeu: 1.254 ± 0.0
0.369HisMet: 0.369 ± 0.0
1.623HisAsn: 1.623 ± 0.0
0.738HisPro: 0.738 ± 0.0
0.812HisGln: 0.812 ± 0.0
0.59HisArg: 0.59 ± 0.0
1.254HisSer: 1.254 ± 0.0
1.623HisThr: 1.623 ± 0.0
1.328HisVal: 1.328 ± 0.0
0.369HisTrp: 0.369 ± 0.0
0.812HisTyr: 0.812 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.722IleAla: 4.722 ± 0.0
2.508IleCys: 2.508 ± 0.0
7.599IleAsp: 7.599 ± 0.0
4.648IleGlu: 4.648 ± 0.0
4.353IlePhe: 4.353 ± 0.0
3.984IleGly: 3.984 ± 0.0
2.287IleHis: 2.287 ± 0.0
9.591IleIle: 9.591 ± 0.0
7.968IleLys: 7.968 ± 0.0
8.558IleLeu: 8.558 ± 0.0
2.435IleMet: 2.435 ± 0.0
7.599IleAsn: 7.599 ± 0.0
4.058IlePro: 4.058 ± 0.0
4.426IleGln: 4.426 ± 0.0
3.172IleArg: 3.172 ± 0.0
7.304IleSer: 7.304 ± 0.0
7.23IleThr: 7.23 ± 0.0
5.017IleVal: 5.017 ± 0.0
0.664IleTrp: 0.664 ± 0.0
6.861IleTyr: 6.861 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.287LysAla: 2.287 ± 0.0
2.139LysCys: 2.139 ± 0.0
4.869LysAsp: 4.869 ± 0.0
2.582LysGlu: 2.582 ± 0.0
3.91LysPhe: 3.91 ± 0.0
1.18LysGly: 1.18 ± 0.0
1.402LysHis: 1.402 ± 0.0
6.492LysIle: 6.492 ± 0.0
3.984LysLys: 3.984 ± 0.0
7.672LysLeu: 7.672 ± 0.0
1.328LysMet: 1.328 ± 0.0
4.869LysAsn: 4.869 ± 0.0
2.951LysPro: 2.951 ± 0.0
4.5LysGln: 4.5 ± 0.0
2.361LysArg: 2.361 ± 0.0
4.426LysSer: 4.426 ± 0.0
5.754LysThr: 5.754 ± 0.0
2.951LysVal: 2.951 ± 0.0
0.812LysTrp: 0.812 ± 0.0
5.312LysTyr: 5.312 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
3.025LeuAla: 3.025 ± 0.0
1.844LeuCys: 1.844 ± 0.0
4.648LeuAsp: 4.648 ± 0.0
2.951LeuGlu: 2.951 ± 0.0
3.615LeuPhe: 3.615 ± 0.0
2.582LeuGly: 2.582 ± 0.0
2.73LeuHis: 2.73 ± 0.0
7.599LeuIle: 7.599 ± 0.0
5.607LeuLys: 5.607 ± 0.0
7.968LeuLeu: 7.968 ± 0.0
2.361LeuMet: 2.361 ± 0.0
6.418LeuAsn: 6.418 ± 0.0
2.435LeuPro: 2.435 ± 0.0
3.394LeuGln: 3.394 ± 0.0
3.615LeuArg: 3.615 ± 0.0
7.451LeuSer: 7.451 ± 0.0
7.377LeuThr: 7.377 ± 0.0
2.877LeuVal: 2.877 ± 0.0
0.59LeuTrp: 0.59 ± 0.0
5.754LeuTyr: 5.754 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.033MetAla: 1.033 ± 0.0
0.516MetCys: 0.516 ± 0.0
1.107MetAsp: 1.107 ± 0.0
0.959MetGlu: 0.959 ± 0.0
1.475MetPhe: 1.475 ± 0.0
1.18MetGly: 1.18 ± 0.0
0.59MetHis: 0.59 ± 0.0
2.066MetIle: 2.066 ± 0.0
1.918MetLys: 1.918 ± 0.0
3.098MetLeu: 3.098 ± 0.0
0.516MetMet: 0.516 ± 0.0
1.402MetAsn: 1.402 ± 0.0
0.738MetPro: 0.738 ± 0.0
0.812MetGln: 0.812 ± 0.0
0.885MetArg: 0.885 ± 0.0
1.328MetSer: 1.328 ± 0.0
1.697MetThr: 1.697 ± 0.0
0.516MetVal: 0.516 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.959MetTyr: 0.959 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.984AsnAla: 3.984 ± 0.0
1.918AsnCys: 1.918 ± 0.0
5.976AsnAsp: 5.976 ± 0.0
3.098AsnGlu: 3.098 ± 0.0
2.951AsnPhe: 2.951 ± 0.0
2.139AsnGly: 2.139 ± 0.0
1.549AsnHis: 1.549 ± 0.0
8.779AsnIle: 8.779 ± 0.0
6.049AsnLys: 6.049 ± 0.0
5.164AsnLeu: 5.164 ± 0.0
1.549AsnMet: 1.549 ± 0.0
5.828AsnAsn: 5.828 ± 0.0
1.254AsnPro: 1.254 ± 0.0
1.697AsnGln: 1.697 ± 0.0
2.656AsnArg: 2.656 ± 0.0
3.615AsnSer: 3.615 ± 0.0
6.787AsnThr: 6.787 ± 0.0
5.238AsnVal: 5.238 ± 0.0
0.295AsnTrp: 0.295 ± 0.0
4.5AsnTyr: 4.5 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
1.402ProAla: 1.402 ± 0.0
0.59ProCys: 0.59 ± 0.0
1.697ProAsp: 1.697 ± 0.0
1.328ProGlu: 1.328 ± 0.0
0.959ProPhe: 0.959 ± 0.0
0.59ProGly: 0.59 ± 0.0
0.516ProHis: 0.516 ± 0.0
3.32ProIle: 3.32 ± 0.0
2.066ProLys: 2.066 ± 0.0
2.73ProLeu: 2.73 ± 0.0
0.738ProMet: 0.738 ± 0.0
2.951ProAsn: 2.951 ± 0.0
0.885ProPro: 0.885 ± 0.0
1.697ProGln: 1.697 ± 0.0
1.254ProArg: 1.254 ± 0.0
1.549ProSer: 1.549 ± 0.0
2.803ProThr: 2.803 ± 0.0
1.771ProVal: 1.771 ± 0.0
0.148ProTrp: 0.148 ± 0.0
1.328ProTyr: 1.328 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
0.59GlnAla: 0.59 ± 0.0
1.18GlnCys: 1.18 ± 0.0
2.803GlnAsp: 2.803 ± 0.0
1.918GlnGlu: 1.918 ± 0.0
1.697GlnPhe: 1.697 ± 0.0
1.107GlnGly: 1.107 ± 0.0
0.959GlnHis: 0.959 ± 0.0
3.467GlnIle: 3.467 ± 0.0
3.172GlnLys: 3.172 ± 0.0
3.246GlnLeu: 3.246 ± 0.0
0.885GlnMet: 0.885 ± 0.0
3.32GlnAsn: 3.32 ± 0.0
1.771GlnPro: 1.771 ± 0.0
2.582GlnGln: 2.582 ± 0.0
1.254GlnArg: 1.254 ± 0.0
2.73GlnSer: 2.73 ± 0.0
2.361GlnThr: 2.361 ± 0.0
1.771GlnVal: 1.771 ± 0.0
0.148GlnTrp: 0.148 ± 0.0
3.615GlnTyr: 3.615 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
1.623ArgAla: 1.623 ± 0.0
0.516ArgCys: 0.516 ± 0.0
2.73ArgAsp: 2.73 ± 0.0
2.287ArgGlu: 2.287 ± 0.0
1.254ArgPhe: 1.254 ± 0.0
1.254ArgGly: 1.254 ± 0.0
0.664ArgHis: 0.664 ± 0.0
3.025ArgIle: 3.025 ± 0.0
1.623ArgLys: 1.623 ± 0.0
3.762ArgLeu: 3.762 ± 0.0
1.107ArgMet: 1.107 ± 0.0
3.394ArgAsn: 3.394 ± 0.0
1.107ArgPro: 1.107 ± 0.0
1.402ArgGln: 1.402 ± 0.0
0.885ArgArg: 0.885 ± 0.0
1.623ArgSer: 1.623 ± 0.0
2.73ArgThr: 2.73 ± 0.0
2.139ArgVal: 2.139 ± 0.0
0.221ArgTrp: 0.221 ± 0.0
2.361ArgTyr: 2.361 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
1.697SerAla: 1.697 ± 0.0
0.295SerCys: 0.295 ± 0.0
4.058SerAsp: 4.058 ± 0.0
2.582SerGlu: 2.582 ± 0.0
2.582SerPhe: 2.582 ± 0.0
2.066SerGly: 2.066 ± 0.0
1.254SerHis: 1.254 ± 0.0
7.156SerIle: 7.156 ± 0.0
3.394SerLys: 3.394 ± 0.0
4.722SerLeu: 4.722 ± 0.0
1.402SerMet: 1.402 ± 0.0
4.353SerAsn: 4.353 ± 0.0
1.844SerPro: 1.844 ± 0.0
2.435SerGln: 2.435 ± 0.0
2.213SerArg: 2.213 ± 0.0
3.541SerSer: 3.541 ± 0.0
5.902SerThr: 5.902 ± 0.0
3.762SerVal: 3.762 ± 0.0
0.148SerTrp: 0.148 ± 0.0
3.91SerTyr: 3.91 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.098ThrAla: 3.098 ± 0.0
1.697ThrCys: 1.697 ± 0.0
4.279ThrAsp: 4.279 ± 0.0
2.435ThrGlu: 2.435 ± 0.0
3.984ThrPhe: 3.984 ± 0.0
2.508ThrGly: 2.508 ± 0.0
2.213ThrHis: 2.213 ± 0.0
8.705ThrIle: 8.705 ± 0.0
7.451ThrLys: 7.451 ± 0.0
6.271ThrLeu: 6.271 ± 0.0
1.697ThrMet: 1.697 ± 0.0
5.533ThrAsn: 5.533 ± 0.0
3.615ThrPro: 3.615 ± 0.0
3.762ThrGln: 3.762 ± 0.0
2.435ThrArg: 2.435 ± 0.0
4.5ThrSer: 4.5 ± 0.0
9.443ThrThr: 9.443 ± 0.0
4.943ThrVal: 4.943 ± 0.0
0.369ThrTrp: 0.369 ± 0.0
4.795ThrTyr: 4.795 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
2.139ValAla: 2.139 ± 0.0
1.18ValCys: 1.18 ± 0.0
3.467ValAsp: 3.467 ± 0.0
2.73ValGlu: 2.73 ± 0.0
2.803ValPhe: 2.803 ± 0.0
1.328ValGly: 1.328 ± 0.0
0.959ValHis: 0.959 ± 0.0
6.123ValIle: 6.123 ± 0.0
3.467ValLys: 3.467 ± 0.0
3.615ValLeu: 3.615 ± 0.0
1.18ValMet: 1.18 ± 0.0
5.312ValAsn: 5.312 ± 0.0
1.844ValPro: 1.844 ± 0.0
2.139ValGln: 2.139 ± 0.0
2.139ValArg: 2.139 ± 0.0
3.098ValSer: 3.098 ± 0.0
4.279ValThr: 4.279 ± 0.0
2.951ValVal: 2.951 ± 0.0
0.443ValTrp: 0.443 ± 0.0
2.951ValTyr: 2.951 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.369TrpAla: 0.369 ± 0.0
0.221TrpCys: 0.221 ± 0.0
0.443TrpAsp: 0.443 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.443TrpPhe: 0.443 ± 0.0
0.074TrpGly: 0.074 ± 0.0
0.221TrpHis: 0.221 ± 0.0
0.664TrpIle: 0.664 ± 0.0
0.221TrpLys: 0.221 ± 0.0
0.738TrpLeu: 0.738 ± 0.0
0.148TrpMet: 0.148 ± 0.0
0.516TrpAsn: 0.516 ± 0.0
0.221TrpPro: 0.221 ± 0.0
0.369TrpGln: 0.369 ± 0.0
0.074TrpArg: 0.074 ± 0.0
0.443TrpSer: 0.443 ± 0.0
0.443TrpThr: 0.443 ± 0.0
0.074TrpVal: 0.074 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.221TrpTyr: 0.221 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.287TyrAla: 2.287 ± 0.0
1.475TyrCys: 1.475 ± 0.0
4.426TyrAsp: 4.426 ± 0.0
2.287TyrGlu: 2.287 ± 0.0
2.508TyrPhe: 2.508 ± 0.0
2.361TyrGly: 2.361 ± 0.0
1.475TyrHis: 1.475 ± 0.0
7.451TyrIle: 7.451 ± 0.0
4.353TyrLys: 4.353 ± 0.0
4.205TyrLeu: 4.205 ± 0.0
1.771TyrMet: 1.771 ± 0.0
4.722TyrAsn: 4.722 ± 0.0
1.328TyrPro: 1.328 ± 0.0
2.287TyrGln: 2.287 ± 0.0
3.025TyrArg: 3.025 ± 0.0
3.394TyrSer: 3.394 ± 0.0
5.828TyrThr: 5.828 ± 0.0
3.615TyrVal: 3.615 ± 0.0
0.516TyrTrp: 0.516 ± 0.0
3.541TyrTyr: 3.541 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (13556 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski