Amino acid dipepetide frequency for Beihai sipunculid worm virus 6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.7AlaAla: 4.7 ± 0.0
1.446AlaCys: 1.446 ± 0.0
3.615AlaAsp: 3.615 ± 0.0
5.785AlaGlu: 5.785 ± 0.0
3.977AlaPhe: 3.977 ± 0.0
5.785AlaGly: 5.785 ± 0.0
3.254AlaHis: 3.254 ± 0.0
3.615AlaIle: 3.615 ± 0.0
3.977AlaLys: 3.977 ± 0.0
9.4AlaLeu: 9.4 ± 0.0
1.446AlaMet: 1.446 ± 0.0
2.169AlaAsn: 2.169 ± 0.0
4.7AlaPro: 4.7 ± 0.0
2.531AlaGln: 2.531 ± 0.0
4.338AlaArg: 4.338 ± 0.0
6.508AlaSer: 6.508 ± 0.0
6.146AlaThr: 6.146 ± 0.0
7.592AlaVal: 7.592 ± 0.0
1.085AlaTrp: 1.085 ± 0.0
1.446AlaTyr: 1.446 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.446CysAla: 1.446 ± 0.0
0.362CysCys: 0.362 ± 0.0
0.723CysAsp: 0.723 ± 0.0
0.362CysGlu: 0.362 ± 0.0
0.723CysPhe: 0.723 ± 0.0
0.723CysGly: 0.723 ± 0.0
0.723CysHis: 0.723 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.723CysLys: 0.723 ± 0.0
1.085CysLeu: 1.085 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.362CysPro: 0.362 ± 0.0
1.085CysGln: 1.085 ± 0.0
0.0CysArg: 0.0 ± 0.0
1.446CysSer: 1.446 ± 0.0
1.808CysThr: 1.808 ± 0.0
0.723CysVal: 0.723 ± 0.0
0.362CysTrp: 0.362 ± 0.0
1.446CysTyr: 1.446 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.061AspAla: 5.061 ± 0.0
1.446AspCys: 1.446 ± 0.0
3.254AspAsp: 3.254 ± 0.0
3.615AspGlu: 3.615 ± 0.0
2.892AspPhe: 2.892 ± 0.0
3.977AspGly: 3.977 ± 0.0
0.723AspHis: 0.723 ± 0.0
1.446AspIle: 1.446 ± 0.0
1.085AspLys: 1.085 ± 0.0
5.785AspLeu: 5.785 ± 0.0
1.085AspMet: 1.085 ± 0.0
1.808AspAsn: 1.808 ± 0.0
2.892AspPro: 2.892 ± 0.0
3.254AspGln: 3.254 ± 0.0
2.531AspArg: 2.531 ± 0.0
2.531AspSer: 2.531 ± 0.0
2.169AspThr: 2.169 ± 0.0
2.531AspVal: 2.531 ± 0.0
1.085AspTrp: 1.085 ± 0.0
2.169AspTyr: 2.169 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.977GluAla: 3.977 ± 0.0
0.0GluCys: 0.0 ± 0.0
2.531GluAsp: 2.531 ± 0.0
1.085GluGlu: 1.085 ± 0.0
3.615GluPhe: 3.615 ± 0.0
2.169GluGly: 2.169 ± 0.0
1.085GluHis: 1.085 ± 0.0
4.7GluIle: 4.7 ± 0.0
3.254GluLys: 3.254 ± 0.0
4.7GluLeu: 4.7 ± 0.0
0.723GluMet: 0.723 ± 0.0
2.892GluAsn: 2.892 ± 0.0
3.615GluPro: 3.615 ± 0.0
1.085GluGln: 1.085 ± 0.0
3.254GluArg: 3.254 ± 0.0
3.615GluSer: 3.615 ± 0.0
3.615GluThr: 3.615 ± 0.0
2.169GluVal: 2.169 ± 0.0
1.085GluTrp: 1.085 ± 0.0
2.169GluTyr: 2.169 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
4.338PheAla: 4.338 ± 0.0
0.0PheCys: 0.0 ± 0.0
3.977PheAsp: 3.977 ± 0.0
1.808PheGlu: 1.808 ± 0.0
0.723PhePhe: 0.723 ± 0.0
4.338PheGly: 4.338 ± 0.0
1.446PheHis: 1.446 ± 0.0
1.446PheIle: 1.446 ± 0.0
0.723PheLys: 0.723 ± 0.0
6.146PheLeu: 6.146 ± 0.0
1.085PheMet: 1.085 ± 0.0
0.362PheAsn: 0.362 ± 0.0
2.892PhePro: 2.892 ± 0.0
0.362PheGln: 0.362 ± 0.0
3.254PheArg: 3.254 ± 0.0
3.254PheSer: 3.254 ± 0.0
3.615PheThr: 3.615 ± 0.0
4.338PheVal: 4.338 ± 0.0
0.723PheTrp: 0.723 ± 0.0
1.808PheTyr: 1.808 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
6.146GlyAla: 6.146 ± 0.0
0.723GlyCys: 0.723 ± 0.0
2.169GlyAsp: 2.169 ± 0.0
1.446GlyGlu: 1.446 ± 0.0
1.808GlyPhe: 1.808 ± 0.0
2.892GlyGly: 2.892 ± 0.0
1.446GlyHis: 1.446 ± 0.0
5.423GlyIle: 5.423 ± 0.0
3.254GlyLys: 3.254 ± 0.0
6.508GlyLeu: 6.508 ± 0.0
0.362GlyMet: 0.362 ± 0.0
1.446GlyAsn: 1.446 ± 0.0
2.892GlyPro: 2.892 ± 0.0
2.169GlyGln: 2.169 ± 0.0
4.338GlyArg: 4.338 ± 0.0
7.592GlySer: 7.592 ± 0.0
3.615GlyThr: 3.615 ± 0.0
6.508GlyVal: 6.508 ± 0.0
1.085GlyTrp: 1.085 ± 0.0
2.892GlyTyr: 2.892 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
2.892HisAla: 2.892 ± 0.0
0.0HisCys: 0.0 ± 0.0
1.085HisAsp: 1.085 ± 0.0
1.085HisGlu: 1.085 ± 0.0
1.085HisPhe: 1.085 ± 0.0
1.085HisGly: 1.085 ± 0.0
0.723HisHis: 0.723 ± 0.0
0.723HisIle: 0.723 ± 0.0
0.723HisLys: 0.723 ± 0.0
2.531HisLeu: 2.531 ± 0.0
0.0HisMet: 0.0 ± 0.0
0.362HisAsn: 0.362 ± 0.0
2.169HisPro: 2.169 ± 0.0
1.446HisGln: 1.446 ± 0.0
1.446HisArg: 1.446 ± 0.0
1.446HisSer: 1.446 ± 0.0
0.723HisThr: 0.723 ± 0.0
2.531HisVal: 2.531 ± 0.0
0.362HisTrp: 0.362 ± 0.0
1.446HisTyr: 1.446 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
5.785IleAla: 5.785 ± 0.0
0.0IleCys: 0.0 ± 0.0
2.531IleAsp: 2.531 ± 0.0
3.615IleGlu: 3.615 ± 0.0
2.531IlePhe: 2.531 ± 0.0
2.892IleGly: 2.892 ± 0.0
1.808IleHis: 1.808 ± 0.0
1.085IleIle: 1.085 ± 0.0
3.254IleLys: 3.254 ± 0.0
3.254IleLeu: 3.254 ± 0.0
0.362IleMet: 0.362 ± 0.0
1.808IleAsn: 1.808 ± 0.0
1.808IlePro: 1.808 ± 0.0
1.808IleGln: 1.808 ± 0.0
2.892IleArg: 2.892 ± 0.0
2.531IleSer: 2.531 ± 0.0
3.977IleThr: 3.977 ± 0.0
3.615IleVal: 3.615 ± 0.0
0.723IleTrp: 0.723 ± 0.0
1.085IleTyr: 1.085 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
3.615LysAla: 3.615 ± 0.0
1.085LysCys: 1.085 ± 0.0
2.531LysAsp: 2.531 ± 0.0
2.892LysGlu: 2.892 ± 0.0
1.085LysPhe: 1.085 ± 0.0
2.169LysGly: 2.169 ± 0.0
2.169LysHis: 2.169 ± 0.0
2.892LysIle: 2.892 ± 0.0
2.169LysLys: 2.169 ± 0.0
3.977LysLeu: 3.977 ± 0.0
0.723LysMet: 0.723 ± 0.0
1.085LysAsn: 1.085 ± 0.0
1.446LysPro: 1.446 ± 0.0
1.085LysGln: 1.085 ± 0.0
3.977LysArg: 3.977 ± 0.0
1.446LysSer: 1.446 ± 0.0
3.254LysThr: 3.254 ± 0.0
3.615LysVal: 3.615 ± 0.0
1.446LysTrp: 1.446 ± 0.0
2.169LysTyr: 2.169 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
10.123LeuAla: 10.123 ± 0.0
1.446LeuCys: 1.446 ± 0.0
5.423LeuAsp: 5.423 ± 0.0
7.231LeuGlu: 7.231 ± 0.0
4.7LeuPhe: 4.7 ± 0.0
5.785LeuGly: 5.785 ± 0.0
1.808LeuHis: 1.808 ± 0.0
4.338LeuIle: 4.338 ± 0.0
4.338LeuLys: 4.338 ± 0.0
11.208LeuLeu: 11.208 ± 0.0
3.254LeuMet: 3.254 ± 0.0
5.423LeuAsn: 5.423 ± 0.0
3.977LeuPro: 3.977 ± 0.0
2.892LeuGln: 2.892 ± 0.0
2.892LeuArg: 2.892 ± 0.0
7.592LeuSer: 7.592 ± 0.0
7.954LeuThr: 7.954 ± 0.0
7.231LeuVal: 7.231 ± 0.0
0.723LeuTrp: 0.723 ± 0.0
3.977LeuTyr: 3.977 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
2.169MetAla: 2.169 ± 0.0
0.0MetCys: 0.0 ± 0.0
1.085MetAsp: 1.085 ± 0.0
0.723MetGlu: 0.723 ± 0.0
1.085MetPhe: 1.085 ± 0.0
2.169MetGly: 2.169 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.446MetIle: 1.446 ± 0.0
1.808MetLys: 1.808 ± 0.0
1.085MetLeu: 1.085 ± 0.0
1.085MetMet: 1.085 ± 0.0
0.723MetAsn: 0.723 ± 0.0
1.085MetPro: 1.085 ± 0.0
0.0MetGln: 0.0 ± 0.0
1.446MetArg: 1.446 ± 0.0
1.808MetSer: 1.808 ± 0.0
1.085MetThr: 1.085 ± 0.0
1.446MetVal: 1.446 ± 0.0
0.362MetTrp: 0.362 ± 0.0
0.362MetTyr: 0.362 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.531AsnAla: 2.531 ± 0.0
0.723AsnCys: 0.723 ± 0.0
2.169AsnAsp: 2.169 ± 0.0
0.723AsnGlu: 0.723 ± 0.0
1.808AsnPhe: 1.808 ± 0.0
2.531AsnGly: 2.531 ± 0.0
0.723AsnHis: 0.723 ± 0.0
0.723AsnIle: 0.723 ± 0.0
1.446AsnLys: 1.446 ± 0.0
2.892AsnLeu: 2.892 ± 0.0
1.085AsnMet: 1.085 ± 0.0
0.723AsnAsn: 0.723 ± 0.0
2.169AsnPro: 2.169 ± 0.0
2.169AsnGln: 2.169 ± 0.0
2.531AsnArg: 2.531 ± 0.0
1.085AsnSer: 1.085 ± 0.0
2.892AsnThr: 2.892 ± 0.0
2.892AsnVal: 2.892 ± 0.0
0.362AsnTrp: 0.362 ± 0.0
1.446AsnTyr: 1.446 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.892ProAla: 2.892 ± 0.0
0.0ProCys: 0.0 ± 0.0
3.615ProAsp: 3.615 ± 0.0
2.169ProGlu: 2.169 ± 0.0
3.977ProPhe: 3.977 ± 0.0
5.061ProGly: 5.061 ± 0.0
2.531ProHis: 2.531 ± 0.0
1.446ProIle: 1.446 ± 0.0
2.169ProLys: 2.169 ± 0.0
3.254ProLeu: 3.254 ± 0.0
0.362ProMet: 0.362 ± 0.0
1.446ProAsn: 1.446 ± 0.0
2.531ProPro: 2.531 ± 0.0
1.808ProGln: 1.808 ± 0.0
4.338ProArg: 4.338 ± 0.0
6.508ProSer: 6.508 ± 0.0
2.892ProThr: 2.892 ± 0.0
2.892ProVal: 2.892 ± 0.0
0.723ProTrp: 0.723 ± 0.0
1.085ProTyr: 1.085 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.892GlnAla: 2.892 ± 0.0
0.362GlnCys: 0.362 ± 0.0
1.085GlnAsp: 1.085 ± 0.0
1.085GlnGlu: 1.085 ± 0.0
0.723GlnPhe: 0.723 ± 0.0
2.169GlnGly: 2.169 ± 0.0
0.723GlnHis: 0.723 ± 0.0
2.892GlnIle: 2.892 ± 0.0
0.723GlnLys: 0.723 ± 0.0
1.808GlnLeu: 1.808 ± 0.0
0.723GlnMet: 0.723 ± 0.0
0.362GlnAsn: 0.362 ± 0.0
1.085GlnPro: 1.085 ± 0.0
0.723GlnGln: 0.723 ± 0.0
2.531GlnArg: 2.531 ± 0.0
1.085GlnSer: 1.085 ± 0.0
3.615GlnThr: 3.615 ± 0.0
5.061GlnVal: 5.061 ± 0.0
0.362GlnTrp: 0.362 ± 0.0
1.085GlnTyr: 1.085 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.615ArgAla: 3.615 ± 0.0
0.723ArgCys: 0.723 ± 0.0
2.531ArgAsp: 2.531 ± 0.0
2.892ArgGlu: 2.892 ± 0.0
2.531ArgPhe: 2.531 ± 0.0
3.977ArgGly: 3.977 ± 0.0
0.723ArgHis: 0.723 ± 0.0
3.254ArgIle: 3.254 ± 0.0
3.977ArgLys: 3.977 ± 0.0
9.761ArgLeu: 9.761 ± 0.0
1.085ArgMet: 1.085 ± 0.0
1.446ArgAsn: 1.446 ± 0.0
3.977ArgPro: 3.977 ± 0.0
1.446ArgGln: 1.446 ± 0.0
3.977ArgArg: 3.977 ± 0.0
6.146ArgSer: 6.146 ± 0.0
2.531ArgThr: 2.531 ± 0.0
3.615ArgVal: 3.615 ± 0.0
0.723ArgTrp: 0.723 ± 0.0
1.808ArgTyr: 1.808 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
5.061SerAla: 5.061 ± 0.0
1.808SerCys: 1.808 ± 0.0
2.892SerAsp: 2.892 ± 0.0
3.254SerGlu: 3.254 ± 0.0
3.977SerPhe: 3.977 ± 0.0
3.977SerGly: 3.977 ± 0.0
1.446SerHis: 1.446 ± 0.0
2.169SerIle: 2.169 ± 0.0
2.892SerLys: 2.892 ± 0.0
8.677SerLeu: 8.677 ± 0.0
2.169SerMet: 2.169 ± 0.0
4.338SerAsn: 4.338 ± 0.0
3.977SerPro: 3.977 ± 0.0
1.808SerGln: 1.808 ± 0.0
4.7SerArg: 4.7 ± 0.0
3.977SerSer: 3.977 ± 0.0
4.7SerThr: 4.7 ± 0.0
8.315SerVal: 8.315 ± 0.0
1.808SerTrp: 1.808 ± 0.0
3.254SerTyr: 3.254 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
4.7ThrAla: 4.7 ± 0.0
1.085ThrCys: 1.085 ± 0.0
2.892ThrAsp: 2.892 ± 0.0
2.531ThrGlu: 2.531 ± 0.0
4.7ThrPhe: 4.7 ± 0.0
5.785ThrGly: 5.785 ± 0.0
1.085ThrHis: 1.085 ± 0.0
3.615ThrIle: 3.615 ± 0.0
2.169ThrLys: 2.169 ± 0.0
6.146ThrLeu: 6.146 ± 0.0
1.446ThrMet: 1.446 ± 0.0
2.169ThrAsn: 2.169 ± 0.0
2.892ThrPro: 2.892 ± 0.0
1.446ThrGln: 1.446 ± 0.0
3.615ThrArg: 3.615 ± 0.0
5.785ThrSer: 5.785 ± 0.0
2.892ThrThr: 2.892 ± 0.0
4.338ThrVal: 4.338 ± 0.0
1.446ThrTrp: 1.446 ± 0.0
2.531ThrTyr: 2.531 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
7.231ValAla: 7.231 ± 0.0
0.723ValCys: 0.723 ± 0.0
5.061ValAsp: 5.061 ± 0.0
6.146ValGlu: 6.146 ± 0.0
2.892ValPhe: 2.892 ± 0.0
4.338ValGly: 4.338 ± 0.0
0.362ValHis: 0.362 ± 0.0
3.977ValIle: 3.977 ± 0.0
2.892ValLys: 2.892 ± 0.0
7.592ValLeu: 7.592 ± 0.0
2.531ValMet: 2.531 ± 0.0
3.254ValAsn: 3.254 ± 0.0
3.977ValPro: 3.977 ± 0.0
3.254ValGln: 3.254 ± 0.0
5.061ValArg: 5.061 ± 0.0
6.508ValSer: 6.508 ± 0.0
3.977ValThr: 3.977 ± 0.0
6.146ValVal: 6.146 ± 0.0
0.723ValTrp: 0.723 ± 0.0
2.169ValTyr: 2.169 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.723TrpAsp: 0.723 ± 0.0
1.085TrpGlu: 1.085 ± 0.0
0.362TrpPhe: 0.362 ± 0.0
0.362TrpGly: 0.362 ± 0.0
0.362TrpHis: 0.362 ± 0.0
1.085TrpIle: 1.085 ± 0.0
0.723TrpLys: 0.723 ± 0.0
1.446TrpLeu: 1.446 ± 0.0
1.085TrpMet: 1.085 ± 0.0
0.362TrpAsn: 0.362 ± 0.0
1.446TrpPro: 1.446 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.446TrpArg: 1.446 ± 0.0
1.808TrpSer: 1.808 ± 0.0
0.723TrpThr: 0.723 ± 0.0
0.723TrpVal: 0.723 ± 0.0
0.362TrpTrp: 0.362 ± 0.0
1.808TrpTyr: 1.808 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.338TyrAla: 4.338 ± 0.0
2.169TyrCys: 2.169 ± 0.0
1.808TyrAsp: 1.808 ± 0.0
1.808TyrGlu: 1.808 ± 0.0
1.446TyrPhe: 1.446 ± 0.0
2.531TyrGly: 2.531 ± 0.0
0.723TyrHis: 0.723 ± 0.0
1.085TyrIle: 1.085 ± 0.0
2.531TyrLys: 2.531 ± 0.0
5.061TyrLeu: 5.061 ± 0.0
0.0TyrMet: 0.0 ± 0.0
1.446TyrAsn: 1.446 ± 0.0
1.808TyrPro: 1.808 ± 0.0
0.723TyrGln: 0.723 ± 0.0
2.169TyrArg: 2.169 ± 0.0
2.531TyrSer: 2.531 ± 0.0
1.085TyrThr: 1.085 ± 0.0
2.531TyrVal: 2.531 ± 0.0
0.362TyrTrp: 0.362 ± 0.0
2.531TyrTyr: 2.531 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1 proteins (2767 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski