Amino acid dipepetide frequency for Chicken stool-associated circular virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.338AlaAla: 4.338 ± 2.941
3.254AlaCys: 3.254 ± 1.547
3.254AlaAsp: 3.254 ± 1.427
0.0AlaGlu: 0.0 ± 0.0
3.254AlaPhe: 3.254 ± 3.222
4.338AlaGly: 4.338 ± 0.7
2.169AlaHis: 2.169 ± 1.028
4.338AlaIle: 4.338 ± 1.612
1.085AlaLys: 1.085 ± 0.836
3.254AlaLeu: 3.254 ± 1.427
2.169AlaMet: 2.169 ± 1.694
1.085AlaAsn: 1.085 ± 1.074
0.0AlaPro: 0.0 ± 0.0
5.423AlaGln: 5.423 ± 2.891
4.338AlaArg: 4.338 ± 2.941
4.338AlaSer: 4.338 ± 4.296
1.085AlaThr: 1.085 ± 1.074
1.085AlaVal: 1.085 ± 0.847
1.085AlaTrp: 1.085 ± 1.074
3.254AlaTyr: 3.254 ± 1.866
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.085CysAsp: 1.085 ± 0.847
2.169CysGlu: 2.169 ± 0.806
0.0CysPhe: 0.0 ± 0.0
3.254CysGly: 3.254 ± 1.427
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
2.169CysLys: 2.169 ± 1.694
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.085CysPro: 1.085 ± 0.847
1.085CysGln: 1.085 ± 0.847
2.169CysArg: 2.169 ± 1.694
2.169CysSer: 2.169 ± 0.806
1.085CysThr: 1.085 ± 1.074
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.085CysTyr: 1.085 ± 0.847
0.0CysXaa: 0.0 ± 0.0
Asp
1.085AspAla: 1.085 ± 1.074
1.085AspCys: 1.085 ± 0.847
3.254AspAsp: 3.254 ± 1.427
8.677AspGlu: 8.677 ± 6.692
3.254AspPhe: 3.254 ± 2.509
7.592AspGly: 7.592 ± 3.312
0.0AspHis: 0.0 ± 0.0
4.338AspIle: 4.338 ± 1.612
3.254AspLys: 3.254 ± 0.271
5.423AspLeu: 5.423 ± 3.062
3.254AspMet: 3.254 ± 1.427
5.423AspAsn: 5.423 ± 2.136
1.085AspPro: 1.085 ± 0.836
1.085AspGln: 1.085 ± 1.074
3.254AspArg: 3.254 ± 0.271
4.338AspSer: 4.338 ± 2.887
4.338AspThr: 4.338 ± 1.612
5.423AspVal: 5.423 ± 2.157
0.0AspTrp: 0.0 ± 0.0
1.085AspTyr: 1.085 ± 0.847
0.0AspXaa: 0.0 ± 0.0
Glu
1.085GluAla: 1.085 ± 0.847
1.085GluCys: 1.085 ± 0.847
7.592GluAsp: 7.592 ± 2.927
1.085GluGlu: 1.085 ± 0.836
2.169GluPhe: 2.169 ± 1.694
5.423GluGly: 5.423 ± 2.136
1.085GluHis: 1.085 ± 0.836
8.677GluIle: 8.677 ± 5.389
6.508GluLys: 6.508 ± 1.158
2.169GluLeu: 2.169 ± 1.028
4.338GluMet: 4.338 ± 2.096
2.169GluAsn: 2.169 ± 0.806
1.085GluPro: 1.085 ± 0.847
2.169GluGln: 2.169 ± 0.806
2.169GluArg: 2.169 ± 0.806
2.169GluSer: 2.169 ± 1.673
4.338GluThr: 4.338 ± 1.612
2.169GluVal: 2.169 ± 1.028
2.169GluTrp: 2.169 ± 0.806
4.338GluTyr: 4.338 ± 2.17
0.0GluXaa: 0.0 ± 0.0
Phe
1.085PheAla: 1.085 ± 0.847
0.0PheCys: 0.0 ± 0.0
1.085PheAsp: 1.085 ± 1.074
0.0PheGlu: 0.0 ± 0.0
0.0PhePhe: 0.0 ± 0.0
5.423PheGly: 5.423 ± 1.628
1.085PheHis: 1.085 ± 0.847
3.254PheIle: 3.254 ± 0.271
5.423PheLys: 5.423 ± 0.542
3.254PheLeu: 3.254 ± 1.427
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
3.254PhePro: 3.254 ± 1.457
0.0PheGln: 0.0 ± 0.0
3.254PheArg: 3.254 ± 1.924
4.338PheSer: 4.338 ± 2.17
2.169PheThr: 2.169 ± 1.028
1.085PheVal: 1.085 ± 1.074
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.254GlyAla: 3.254 ± 0.271
2.169GlyCys: 2.169 ± 1.694
3.254GlyAsp: 3.254 ± 1.427
6.508GlyGlu: 6.508 ± 2.815
0.0GlyPhe: 0.0 ± 0.0
2.169GlyGly: 2.169 ± 0.806
2.169GlyHis: 2.169 ± 0.806
7.592GlyIle: 7.592 ± 0.847
1.085GlyLys: 1.085 ± 0.847
5.423GlyLeu: 5.423 ± 0.542
2.169GlyMet: 2.169 ± 0.969
3.254GlyAsn: 3.254 ± 1.866
2.169GlyPro: 2.169 ± 0.806
1.085GlyGln: 1.085 ± 1.074
1.085GlyArg: 1.085 ± 0.836
5.423GlySer: 5.423 ± 1.249
8.677GlyThr: 8.677 ± 2.686
3.254GlyVal: 3.254 ± 1.924
0.0GlyTrp: 0.0 ± 0.0
6.508GlyTyr: 6.508 ± 2.853
0.0GlyXaa: 0.0 ± 0.0
His
2.169HisAla: 2.169 ± 0.806
0.0HisCys: 0.0 ± 0.0
1.085HisAsp: 1.085 ± 0.836
2.169HisGlu: 2.169 ± 1.694
1.085HisPhe: 1.085 ± 0.847
0.0HisGly: 0.0 ± 0.0
1.085HisHis: 1.085 ± 0.847
0.0HisIle: 0.0 ± 0.0
2.169HisLys: 2.169 ± 1.694
1.085HisLeu: 1.085 ± 0.847
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.085HisPro: 1.085 ± 0.847
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
0.0HisSer: 0.0 ± 0.0
1.085HisThr: 1.085 ± 0.847
1.085HisVal: 1.085 ± 0.836
2.169HisTrp: 2.169 ± 0.969
2.169HisTyr: 2.169 ± 0.806
0.0HisXaa: 0.0 ± 0.0
Ile
1.085IleAla: 1.085 ± 1.074
0.0IleCys: 0.0 ± 0.0
4.338IleAsp: 4.338 ± 0.7
4.338IleGlu: 4.338 ± 1.612
2.169IlePhe: 2.169 ± 1.673
2.169IleGly: 2.169 ± 0.806
1.085IleHis: 1.085 ± 0.836
1.085IleIle: 1.085 ± 0.836
7.592IleLys: 7.592 ± 4.567
5.423IleLeu: 5.423 ± 2.157
2.169IleMet: 2.169 ± 0.969
4.338IleAsn: 4.338 ± 1.612
2.169IlePro: 2.169 ± 0.969
4.338IleGln: 4.338 ± 0.7
2.169IleArg: 2.169 ± 0.969
6.508IleSer: 6.508 ± 3.491
5.423IleThr: 5.423 ± 4.182
3.254IleVal: 3.254 ± 2.542
3.254IleTrp: 3.254 ± 2.542
2.169IleTyr: 2.169 ± 0.969
0.0IleXaa: 0.0 ± 0.0
Lys
4.338LysAla: 4.338 ± 2.204
1.085LysCys: 1.085 ± 0.847
5.423LysAsp: 5.423 ± 2.136
6.508LysGlu: 6.508 ± 2.853
2.169LysPhe: 2.169 ± 0.806
3.254LysGly: 3.254 ± 1.547
2.169LysHis: 2.169 ± 0.806
6.508LysIle: 6.508 ± 0.543
3.254LysLys: 3.254 ± 2.542
2.169LysLeu: 2.169 ± 0.806
2.169LysMet: 2.169 ± 1.673
0.0LysAsn: 0.0 ± 0.0
6.508LysPro: 6.508 ± 1.486
0.0LysGln: 0.0 ± 0.0
4.338LysArg: 4.338 ± 3.389
2.169LysSer: 2.169 ± 1.694
6.508LysThr: 6.508 ± 2.815
3.254LysVal: 3.254 ± 0.271
4.338LysTrp: 4.338 ± 1.612
2.169LysTyr: 2.169 ± 0.806
0.0LysXaa: 0.0 ± 0.0
Leu
5.423LeuAla: 5.423 ± 1.249
0.0LeuCys: 0.0 ± 0.0
5.423LeuAsp: 5.423 ± 1.519
7.592LeuGlu: 7.592 ± 1.346
2.169LeuPhe: 2.169 ± 0.969
3.254LeuGly: 3.254 ± 1.427
0.0LeuHis: 0.0 ± 0.0
0.0LeuIle: 0.0 ± 0.0
8.677LeuLys: 8.677 ± 5.522
5.423LeuLeu: 5.423 ± 2.136
2.169LeuMet: 2.169 ± 0.806
4.338LeuAsn: 4.338 ± 0.803
1.085LeuPro: 1.085 ± 0.836
2.169LeuGln: 2.169 ± 0.806
1.085LeuArg: 1.085 ± 1.074
2.169LeuSer: 2.169 ± 0.969
3.254LeuThr: 3.254 ± 0.271
4.338LeuVal: 4.338 ± 0.7
0.0LeuTrp: 0.0 ± 0.0
2.169LeuTyr: 2.169 ± 1.673
0.0LeuXaa: 0.0 ± 0.0
Met
2.169MetAla: 2.169 ± 0.969
0.0MetCys: 0.0 ± 0.0
1.085MetAsp: 1.085 ± 0.847
5.423MetGlu: 5.423 ± 1.628
3.254MetPhe: 3.254 ± 1.427
1.085MetGly: 1.085 ± 0.847
0.0MetHis: 0.0 ± 0.0
2.169MetIle: 2.169 ± 1.673
2.169MetLys: 2.169 ± 1.694
1.085MetLeu: 1.085 ± 0.836
0.0MetMet: 0.0 ± 0.0
1.085MetAsn: 1.085 ± 0.836
1.085MetPro: 1.085 ± 1.074
1.085MetGln: 1.085 ± 0.847
1.085MetArg: 1.085 ± 1.074
2.169MetSer: 2.169 ± 1.673
1.085MetThr: 1.085 ± 0.847
0.0MetVal: 0.0 ± 0.0
2.169MetTrp: 2.169 ± 1.028
1.085MetTyr: 1.085 ± 1.074
0.0MetXaa: 0.0 ± 0.0
Asn
3.254AsnAla: 3.254 ± 3.222
0.0AsnCys: 0.0 ± 0.0
3.254AsnAsp: 3.254 ± 1.427
1.085AsnGlu: 1.085 ± 0.836
1.085AsnPhe: 1.085 ± 0.847
7.592AsnGly: 7.592 ± 0.655
2.169AsnHis: 2.169 ± 1.694
3.254AsnIle: 3.254 ± 1.407
4.338AsnLys: 4.338 ± 2.171
2.169AsnLeu: 2.169 ± 1.673
0.0AsnMet: 0.0 ± 0.0
3.254AsnAsn: 3.254 ± 1.407
2.169AsnPro: 2.169 ± 0.969
1.085AsnGln: 1.085 ± 0.847
2.169AsnArg: 2.169 ± 1.673
3.254AsnSer: 3.254 ± 3.222
1.085AsnThr: 1.085 ± 0.836
4.338AsnVal: 4.338 ± 0.7
1.085AsnTrp: 1.085 ± 0.836
2.169AsnTyr: 2.169 ± 1.028
0.0AsnXaa: 0.0 ± 0.0
Pro
2.169ProAla: 2.169 ± 1.028
0.0ProCys: 0.0 ± 0.0
6.508ProAsp: 6.508 ± 2.815
3.254ProGlu: 3.254 ± 0.271
1.085ProPhe: 1.085 ± 1.074
1.085ProGly: 1.085 ± 0.836
0.0ProHis: 0.0 ± 0.0
2.169ProIle: 2.169 ± 1.673
2.169ProLys: 2.169 ± 0.806
2.169ProLeu: 2.169 ± 1.673
0.0ProMet: 0.0 ± 0.0
2.169ProAsn: 2.169 ± 1.028
2.169ProPro: 2.169 ± 1.673
1.085ProGln: 1.085 ± 0.847
3.254ProArg: 3.254 ± 2.509
1.085ProSer: 1.085 ± 1.074
2.169ProThr: 2.169 ± 2.148
5.423ProVal: 5.423 ± 1.143
1.085ProTrp: 1.085 ± 0.836
2.169ProTyr: 2.169 ± 1.673
0.0ProXaa: 0.0 ± 0.0
Gln
1.085GlnAla: 1.085 ± 0.847
2.169GlnCys: 2.169 ± 1.694
0.0GlnAsp: 0.0 ± 0.0
1.085GlnGlu: 1.085 ± 0.847
1.085GlnPhe: 1.085 ± 0.847
2.169GlnGly: 2.169 ± 0.969
0.0GlnHis: 0.0 ± 0.0
3.254GlnIle: 3.254 ± 0.271
4.338GlnLys: 4.338 ± 2.204
1.085GlnLeu: 1.085 ± 1.074
0.0GlnMet: 0.0 ± 0.0
1.085GlnAsn: 1.085 ± 0.847
1.085GlnPro: 1.085 ± 0.836
2.169GlnGln: 2.169 ± 1.028
2.169GlnArg: 2.169 ± 0.969
2.169GlnSer: 2.169 ± 2.148
6.508GlnThr: 6.508 ± 3.733
0.0GlnVal: 0.0 ± 0.0
1.085GlnTrp: 1.085 ± 1.074
1.085GlnTyr: 1.085 ± 1.074
0.0GlnXaa: 0.0 ± 0.0
Arg
5.423ArgAla: 5.423 ± 2.891
1.085ArgCys: 1.085 ± 0.847
3.254ArgAsp: 3.254 ± 0.271
2.169ArgGlu: 2.169 ± 1.673
2.169ArgPhe: 2.169 ± 1.694
4.338ArgGly: 4.338 ± 1.343
2.169ArgHis: 2.169 ± 0.806
1.085ArgIle: 1.085 ± 0.836
1.085ArgLys: 1.085 ± 1.074
1.085ArgLeu: 1.085 ± 1.074
1.085ArgMet: 1.085 ± 0.847
2.169ArgAsn: 2.169 ± 1.028
4.338ArgPro: 4.338 ± 2.171
4.338ArgGln: 4.338 ± 2.941
5.423ArgArg: 5.423 ± 3.989
0.0ArgSer: 0.0 ± 0.0
2.169ArgThr: 2.169 ± 0.969
7.592ArgVal: 7.592 ± 1.947
1.085ArgTrp: 1.085 ± 1.074
1.085ArgTyr: 1.085 ± 1.074
0.0ArgXaa: 0.0 ± 0.0
Ser
5.423SerAla: 5.423 ± 2.773
0.0SerCys: 0.0 ± 0.0
3.254SerAsp: 3.254 ± 1.866
0.0SerGlu: 0.0 ± 0.0
0.0SerPhe: 0.0 ± 0.0
6.508SerGly: 6.508 ± 1.486
0.0SerHis: 0.0 ± 0.0
3.254SerIle: 3.254 ± 2.542
2.169SerLys: 2.169 ± 1.673
6.508SerLeu: 6.508 ± 2.906
2.169SerMet: 2.169 ± 1.614
2.169SerAsn: 2.169 ± 0.806
2.169SerPro: 2.169 ± 0.969
2.169SerGln: 2.169 ± 0.969
5.423SerArg: 5.423 ± 2.417
5.423SerSer: 5.423 ± 3.936
11.931SerThr: 11.931 ± 7.791
4.338SerVal: 4.338 ± 2.887
1.085SerTrp: 1.085 ± 0.836
2.169SerTyr: 2.169 ± 2.148
0.0SerXaa: 0.0 ± 0.0
Thr
3.254ThrAla: 3.254 ± 1.866
3.254ThrCys: 3.254 ± 1.407
5.423ThrAsp: 5.423 ± 0.542
4.338ThrGlu: 4.338 ± 0.803
3.254ThrPhe: 3.254 ± 1.866
3.254ThrGly: 3.254 ± 3.222
2.169ThrHis: 2.169 ± 1.694
5.423ThrIle: 5.423 ± 2.773
3.254ThrLys: 3.254 ± 1.427
5.423ThrLeu: 5.423 ± 1.519
3.254ThrMet: 3.254 ± 1.866
8.677ThrAsn: 8.677 ± 3.875
0.0ThrPro: 0.0 ± 0.0
0.0ThrGln: 0.0 ± 0.0
4.338ThrArg: 4.338 ± 1.343
6.508ThrSer: 6.508 ± 2.104
3.254ThrThr: 3.254 ± 1.457
7.592ThrVal: 7.592 ± 2.102
2.169ThrTrp: 2.169 ± 1.694
4.338ThrTyr: 4.338 ± 2.055
0.0ThrXaa: 0.0 ± 0.0
Val
1.085ValAla: 1.085 ± 1.074
1.085ValCys: 1.085 ± 0.847
2.169ValAsp: 2.169 ± 1.673
4.338ValGlu: 4.338 ± 0.7
2.169ValPhe: 2.169 ± 0.969
1.085ValGly: 1.085 ± 1.074
1.085ValHis: 1.085 ± 0.847
5.423ValIle: 5.423 ± 1.519
4.338ValLys: 4.338 ± 2.273
4.338ValLeu: 4.338 ± 0.803
2.169ValMet: 2.169 ± 1.694
0.0ValAsn: 0.0 ± 0.0
3.254ValPro: 3.254 ± 0.271
1.085ValGln: 1.085 ± 1.074
2.169ValArg: 2.169 ± 1.694
6.508ValSer: 6.508 ± 2.104
9.761ValThr: 9.761 ± 3.969
4.338ValVal: 4.338 ± 0.803
2.169ValTrp: 2.169 ± 0.806
2.169ValTyr: 2.169 ± 1.673
0.0ValXaa: 0.0 ± 0.0
Trp
1.085TrpAla: 1.085 ± 0.847
0.0TrpCys: 0.0 ± 0.0
2.169TrpAsp: 2.169 ± 1.028
0.0TrpGlu: 0.0 ± 0.0
1.085TrpPhe: 1.085 ± 0.847
2.169TrpGly: 2.169 ± 1.028
0.0TrpHis: 0.0 ± 0.0
1.085TrpIle: 1.085 ± 0.836
2.169TrpLys: 2.169 ± 0.806
2.169TrpLeu: 2.169 ± 0.806
1.085TrpMet: 1.085 ± 0.847
4.338TrpAsn: 4.338 ± 0.7
2.169TrpPro: 2.169 ± 0.806
1.085TrpGln: 1.085 ± 1.074
1.085TrpArg: 1.085 ± 0.836
1.085TrpSer: 1.085 ± 1.074
1.085TrpThr: 1.085 ± 0.847
0.0TrpVal: 0.0 ± 0.0
2.169TrpTrp: 2.169 ± 0.806
1.085TrpTyr: 1.085 ± 0.836
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.338TyrAla: 4.338 ± 2.055
1.085TyrCys: 1.085 ± 0.836
4.338TyrAsp: 4.338 ± 0.803
3.254TyrGlu: 3.254 ± 1.427
3.254TyrPhe: 3.254 ± 1.924
1.085TyrGly: 1.085 ± 0.836
0.0TyrHis: 0.0 ± 0.0
2.169TyrIle: 2.169 ± 1.028
1.085TyrLys: 1.085 ± 0.836
1.085TyrLeu: 1.085 ± 0.847
0.0TyrMet: 0.0 ± 0.0
3.254TyrAsn: 3.254 ± 2.509
3.254TyrPro: 3.254 ± 1.457
2.169TyrGln: 2.169 ± 1.673
2.169TyrArg: 2.169 ± 0.969
5.423TyrSer: 5.423 ± 1.143
2.169TyrThr: 2.169 ± 0.969
2.169TyrVal: 2.169 ± 1.028
0.0TyrTrp: 0.0 ± 0.0
2.169TyrTyr: 2.169 ± 0.806
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (923 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski