Amino acid dipepetide frequency for Lake Sarah-associated circular virus-35

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.742AlaAla: 1.742 ± 0.846
1.742AlaCys: 1.742 ± 0.846
1.742AlaAsp: 1.742 ± 0.846
1.742AlaGlu: 1.742 ± 1.528
1.742AlaPhe: 1.742 ± 0.846
8.711AlaGly: 8.711 ± 0.519
3.484AlaHis: 3.484 ± 0.682
3.484AlaIle: 3.484 ± 0.682
3.484AlaLys: 3.484 ± 1.692
8.711AlaLeu: 8.711 ± 1.855
1.742AlaMet: 1.742 ± 0.846
3.484AlaAsn: 3.484 ± 0.682
3.484AlaPro: 3.484 ± 0.682
1.742AlaGln: 1.742 ± 0.846
5.226AlaArg: 5.226 ± 0.163
3.484AlaSer: 3.484 ± 1.692
5.226AlaThr: 5.226 ± 2.538
6.969AlaVal: 6.969 ± 3.383
0.0AlaTrp: 0.0 ± 0.0
1.742AlaTyr: 1.742 ± 0.846
0.0AlaXaa: 0.0 ± 0.0
Cys
3.484CysAla: 3.484 ± 1.692
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
3.484CysPhe: 3.484 ± 1.692
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.742CysIle: 1.742 ± 0.846
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
1.742CysSer: 1.742 ± 0.846
1.742CysThr: 1.742 ± 0.846
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.742CysTyr: 1.742 ± 1.528
0.0CysXaa: 0.0 ± 0.0
Asp
3.484AspAla: 3.484 ± 1.692
1.742AspCys: 1.742 ± 0.846
3.484AspAsp: 3.484 ± 0.682
3.484AspGlu: 3.484 ± 0.682
5.226AspPhe: 5.226 ± 2.211
8.711AspGly: 8.711 ± 1.855
0.0AspHis: 0.0 ± 0.0
3.484AspIle: 3.484 ± 0.682
0.0AspLys: 0.0 ± 0.0
6.969AspLeu: 6.969 ± 1.365
0.0AspMet: 0.0 ± 0.0
5.226AspAsn: 5.226 ± 2.211
3.484AspPro: 3.484 ± 0.682
0.0AspGln: 0.0 ± 0.0
5.226AspArg: 5.226 ± 0.163
0.0AspSer: 0.0 ± 0.0
3.484AspThr: 3.484 ± 0.682
6.969AspVal: 6.969 ± 1.365
0.0AspTrp: 0.0 ± 0.0
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
5.226GluAla: 5.226 ± 2.538
0.0GluCys: 0.0 ± 0.0
5.226GluAsp: 5.226 ± 2.538
0.0GluGlu: 0.0 ± 0.0
3.484GluPhe: 3.484 ± 1.692
3.484GluGly: 3.484 ± 0.682
0.0GluHis: 0.0 ± 0.0
1.742GluIle: 1.742 ± 1.528
1.742GluLys: 1.742 ± 0.846
6.969GluLeu: 6.969 ± 1.009
5.226GluMet: 5.226 ± 2.714
1.742GluAsn: 1.742 ± 0.846
3.484GluPro: 3.484 ± 0.682
1.742GluGln: 1.742 ± 1.528
0.0GluArg: 0.0 ± 0.0
0.0GluSer: 0.0 ± 0.0
1.742GluThr: 1.742 ± 0.846
5.226GluVal: 5.226 ± 2.538
0.0GluTrp: 0.0 ± 0.0
1.742GluTyr: 1.742 ± 1.528
0.0GluXaa: 0.0 ± 0.0
Phe
5.226PheAla: 5.226 ± 2.538
1.742PheCys: 1.742 ± 0.846
3.484PheAsp: 3.484 ± 1.692
0.0PheGlu: 0.0 ± 0.0
3.484PhePhe: 3.484 ± 3.057
3.484PheGly: 3.484 ± 0.682
1.742PheHis: 1.742 ± 0.846
3.484PheIle: 3.484 ± 0.682
1.742PheLys: 1.742 ± 0.846
1.742PheLeu: 1.742 ± 0.846
0.0PheMet: 0.0 ± 0.0
3.484PheAsn: 3.484 ± 0.682
5.226PhePro: 5.226 ± 2.211
0.0PheGln: 0.0 ± 0.0
5.226PheArg: 5.226 ± 2.538
1.742PheSer: 1.742 ± 1.528
5.226PheThr: 5.226 ± 0.163
5.226PheVal: 5.226 ± 0.163
0.0PheTrp: 0.0 ± 0.0
5.226PheTyr: 5.226 ± 4.585
0.0PheXaa: 0.0 ± 0.0
Gly
6.969GlyAla: 6.969 ± 3.383
0.0GlyCys: 0.0 ± 0.0
3.484GlyAsp: 3.484 ± 0.682
3.484GlyGlu: 3.484 ± 1.692
5.226GlyPhe: 5.226 ± 2.538
8.711GlyGly: 8.711 ± 4.229
0.0GlyHis: 0.0 ± 0.0
3.484GlyIle: 3.484 ± 3.057
5.226GlyLys: 5.226 ± 2.538
5.226GlyLeu: 5.226 ± 0.163
1.742GlyMet: 1.742 ± 1.528
1.742GlyAsn: 1.742 ± 1.528
6.969GlyPro: 6.969 ± 1.009
3.484GlyGln: 3.484 ± 1.692
0.0GlyArg: 0.0 ± 0.0
5.226GlySer: 5.226 ± 0.163
8.711GlyThr: 8.711 ± 0.519
3.484GlyVal: 3.484 ± 1.692
1.742GlyTrp: 1.742 ± 0.846
1.742GlyTyr: 1.742 ± 0.846
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
1.742HisAsp: 1.742 ± 0.846
1.742HisGlu: 1.742 ± 0.846
1.742HisPhe: 1.742 ± 0.846
1.742HisGly: 1.742 ± 1.528
0.0HisHis: 0.0 ± 0.0
3.484HisIle: 3.484 ± 1.692
3.484HisLys: 3.484 ± 1.692
1.742HisLeu: 1.742 ± 0.846
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
1.742HisGln: 1.742 ± 1.528
0.0HisArg: 0.0 ± 0.0
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.742HisTyr: 1.742 ± 1.528
0.0HisXaa: 0.0 ± 0.0
Ile
5.226IleAla: 5.226 ± 2.211
0.0IleCys: 0.0 ± 0.0
0.0IleAsp: 0.0 ± 0.0
3.484IleGlu: 3.484 ± 0.682
3.484IlePhe: 3.484 ± 1.692
0.0IleGly: 0.0 ± 0.0
1.742IleHis: 1.742 ± 0.846
1.742IleIle: 1.742 ± 0.846
0.0IleLys: 0.0 ± 0.0
8.711IleLeu: 8.711 ± 2.893
0.0IleMet: 0.0 ± 0.0
5.226IleAsn: 5.226 ± 0.163
6.969IlePro: 6.969 ± 1.365
1.742IleGln: 1.742 ± 1.528
0.0IleArg: 0.0 ± 0.0
5.226IleSer: 5.226 ± 0.163
3.484IleThr: 3.484 ± 0.682
0.0IleVal: 0.0 ± 0.0
0.0IleTrp: 0.0 ± 0.0
1.742IleTyr: 1.742 ± 0.846
0.0IleXaa: 0.0 ± 0.0
Lys
0.0LysAla: 0.0 ± 0.0
0.0LysCys: 0.0 ± 0.0
0.0LysAsp: 0.0 ± 0.0
6.969LysGlu: 6.969 ± 3.383
1.742LysPhe: 1.742 ± 0.846
5.226LysGly: 5.226 ± 2.538
0.0LysHis: 0.0 ± 0.0
3.484LysIle: 3.484 ± 0.682
1.742LysLys: 1.742 ± 0.846
3.484LysLeu: 3.484 ± 0.682
1.742LysMet: 1.742 ± 0.846
0.0LysAsn: 0.0 ± 0.0
1.742LysPro: 1.742 ± 1.528
1.742LysGln: 1.742 ± 0.846
3.484LysArg: 3.484 ± 1.692
0.0LysSer: 0.0 ± 0.0
3.484LysThr: 3.484 ± 0.682
3.484LysVal: 3.484 ± 1.692
0.0LysTrp: 0.0 ± 0.0
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
5.226LeuAla: 5.226 ± 0.163
5.226LeuCys: 5.226 ± 0.163
8.711LeuAsp: 8.711 ± 0.519
5.226LeuGlu: 5.226 ± 0.163
1.742LeuPhe: 1.742 ± 1.528
3.484LeuGly: 3.484 ± 1.692
3.484LeuHis: 3.484 ± 0.682
0.0LeuIle: 0.0 ± 0.0
1.742LeuLys: 1.742 ± 0.846
10.453LeuLeu: 10.453 ± 0.327
0.0LeuMet: 0.0 ± 0.0
3.484LeuAsn: 3.484 ± 0.682
5.226LeuPro: 5.226 ± 2.211
3.484LeuGln: 3.484 ± 0.682
10.453LeuArg: 10.453 ± 2.701
6.969LeuSer: 6.969 ± 1.365
5.226LeuThr: 5.226 ± 2.538
5.226LeuVal: 5.226 ± 2.211
0.0LeuTrp: 0.0 ± 0.0
3.484LeuTyr: 3.484 ± 0.682
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
3.484MetAsp: 3.484 ± 0.682
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.742MetIle: 1.742 ± 1.528
3.484MetLys: 3.484 ± 0.682
1.742MetLeu: 1.742 ± 1.528
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
1.742MetArg: 1.742 ± 1.528
0.0MetSer: 0.0 ± 0.0
1.742MetThr: 1.742 ± 0.846
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
5.226MetTyr: 5.226 ± 2.211
0.0MetXaa: 0.0 ± 0.0
Asn
3.484AsnAla: 3.484 ± 0.682
0.0AsnCys: 0.0 ± 0.0
1.742AsnAsp: 1.742 ± 0.846
3.484AsnGlu: 3.484 ± 0.682
3.484AsnPhe: 3.484 ± 3.057
1.742AsnGly: 1.742 ± 0.846
0.0AsnHis: 0.0 ± 0.0
0.0AsnIle: 0.0 ± 0.0
1.742AsnLys: 1.742 ± 0.846
3.484AsnLeu: 3.484 ± 0.682
1.742AsnMet: 1.742 ± 1.528
3.484AsnAsn: 3.484 ± 1.692
1.742AsnPro: 1.742 ± 1.528
1.742AsnGln: 1.742 ± 0.846
0.0AsnArg: 0.0 ± 0.0
6.969AsnSer: 6.969 ± 1.009
0.0AsnThr: 0.0 ± 0.0
3.484AsnVal: 3.484 ± 3.057
1.742AsnTrp: 1.742 ± 0.846
1.742AsnTyr: 1.742 ± 0.846
0.0AsnXaa: 0.0 ± 0.0
Pro
8.711ProAla: 8.711 ± 1.855
0.0ProCys: 0.0 ± 0.0
1.742ProAsp: 1.742 ± 1.528
8.711ProGlu: 8.711 ± 0.519
1.742ProPhe: 1.742 ± 1.528
5.226ProGly: 5.226 ± 0.163
5.226ProHis: 5.226 ± 0.163
1.742ProIle: 1.742 ± 0.846
0.0ProLys: 0.0 ± 0.0
3.484ProLeu: 3.484 ± 3.057
0.0ProMet: 0.0 ± 0.0
1.742ProAsn: 1.742 ± 0.846
5.226ProPro: 5.226 ± 2.211
6.969ProGln: 6.969 ± 1.365
5.226ProArg: 5.226 ± 0.163
1.742ProSer: 1.742 ± 0.846
3.484ProThr: 3.484 ± 1.692
6.969ProVal: 6.969 ± 3.739
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.742GlnAla: 1.742 ± 1.528
0.0GlnCys: 0.0 ± 0.0
5.226GlnAsp: 5.226 ± 2.211
0.0GlnGlu: 0.0 ± 0.0
0.0GlnPhe: 0.0 ± 0.0
3.484GlnGly: 3.484 ± 0.682
1.742GlnHis: 1.742 ± 0.846
0.0GlnIle: 0.0 ± 0.0
1.742GlnLys: 1.742 ± 1.528
6.969GlnLeu: 6.969 ± 1.009
0.0GlnMet: 0.0 ± 0.0
1.742GlnAsn: 1.742 ± 1.528
3.484GlnPro: 3.484 ± 0.682
0.0GlnGln: 0.0 ± 0.0
3.484GlnArg: 3.484 ± 0.682
1.742GlnSer: 1.742 ± 1.528
0.0GlnThr: 0.0 ± 0.0
1.742GlnVal: 1.742 ± 0.846
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
5.226ArgAla: 5.226 ± 2.538
1.742ArgCys: 1.742 ± 0.846
5.226ArgAsp: 5.226 ± 2.211
0.0ArgGlu: 0.0 ± 0.0
6.969ArgPhe: 6.969 ± 3.739
0.0ArgGly: 0.0 ± 0.0
0.0ArgHis: 0.0 ± 0.0
3.484ArgIle: 3.484 ± 1.692
1.742ArgLys: 1.742 ± 0.846
3.484ArgLeu: 3.484 ± 1.692
0.0ArgMet: 0.0 ± 1.003
3.484ArgAsn: 3.484 ± 0.682
3.484ArgPro: 3.484 ± 1.692
0.0ArgGln: 0.0 ± 0.0
5.226ArgArg: 5.226 ± 2.538
8.711ArgSer: 8.711 ± 0.519
3.484ArgThr: 3.484 ± 1.692
5.226ArgVal: 5.226 ± 2.211
1.742ArgTrp: 1.742 ± 0.846
1.742ArgTyr: 1.742 ± 0.846
0.0ArgXaa: 0.0 ± 0.0
Ser
1.742SerAla: 1.742 ± 0.846
0.0SerCys: 0.0 ± 0.0
6.969SerAsp: 6.969 ± 1.365
1.742SerGlu: 1.742 ± 0.846
5.226SerPhe: 5.226 ± 2.211
5.226SerGly: 5.226 ± 2.538
0.0SerHis: 0.0 ± 0.0
3.484SerIle: 3.484 ± 3.057
1.742SerLys: 1.742 ± 1.528
5.226SerLeu: 5.226 ± 0.163
1.742SerMet: 1.742 ± 1.528
0.0SerAsn: 0.0 ± 0.0
1.742SerPro: 1.742 ± 0.846
3.484SerGln: 3.484 ± 0.682
0.0SerArg: 0.0 ± 0.0
6.969SerSer: 6.969 ± 3.739
5.226SerThr: 5.226 ± 0.163
3.484SerVal: 3.484 ± 1.692
3.484SerTrp: 3.484 ± 0.682
1.742SerTyr: 1.742 ± 1.528
0.0SerXaa: 0.0 ± 0.0
Thr
5.226ThrAla: 5.226 ± 0.163
0.0ThrCys: 0.0 ± 0.0
1.742ThrAsp: 1.742 ± 1.528
1.742ThrGlu: 1.742 ± 0.846
3.484ThrPhe: 3.484 ± 1.692
5.226ThrGly: 5.226 ± 0.163
0.0ThrHis: 0.0 ± 0.0
5.226ThrIle: 5.226 ± 0.163
3.484ThrLys: 3.484 ± 0.682
5.226ThrLeu: 5.226 ± 2.538
3.484ThrMet: 3.484 ± 0.682
1.742ThrAsn: 1.742 ± 0.846
6.969ThrPro: 6.969 ± 3.383
1.742ThrGln: 1.742 ± 1.528
6.969ThrArg: 6.969 ± 3.383
3.484ThrSer: 3.484 ± 0.682
6.969ThrThr: 6.969 ± 3.383
3.484ThrVal: 3.484 ± 1.692
0.0ThrTrp: 0.0 ± 0.0
1.742ThrTyr: 1.742 ± 1.528
0.0ThrXaa: 0.0 ± 0.0
Val
3.484ValAla: 3.484 ± 1.692
0.0ValCys: 0.0 ± 0.0
0.0ValAsp: 0.0 ± 0.0
5.226ValGlu: 5.226 ± 2.538
1.742ValPhe: 1.742 ± 0.846
6.969ValGly: 6.969 ± 1.009
1.742ValHis: 1.742 ± 0.846
1.742ValIle: 1.742 ± 0.846
5.226ValLys: 5.226 ± 2.538
5.226ValLeu: 5.226 ± 2.211
0.0ValMet: 0.0 ± 0.0
1.742ValAsn: 1.742 ± 0.846
5.226ValPro: 5.226 ± 0.163
3.484ValGln: 3.484 ± 3.057
5.226ValArg: 5.226 ± 2.211
5.226ValSer: 5.226 ± 4.585
1.742ValThr: 1.742 ± 0.846
8.711ValVal: 8.711 ± 0.519
3.484ValTrp: 3.484 ± 1.692
6.969ValTyr: 6.969 ± 1.365
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
3.484TrpPhe: 3.484 ± 1.692
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
3.484TrpIle: 3.484 ± 1.692
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
1.742TrpAsn: 1.742 ± 0.846
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.742TrpArg: 1.742 ± 1.528
0.0TrpSer: 0.0 ± 0.0
1.742TrpThr: 1.742 ± 0.846
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.484TyrAla: 3.484 ± 3.057
1.742TyrCys: 1.742 ± 0.846
6.969TyrAsp: 6.969 ± 1.365
1.742TyrGlu: 1.742 ± 1.528
0.0TyrPhe: 0.0 ± 0.0
5.226TyrGly: 5.226 ± 2.538
0.0TyrHis: 0.0 ± 0.0
1.742TyrIle: 1.742 ± 1.528
0.0TyrLys: 0.0 ± 0.0
0.0TyrLeu: 0.0 ± 0.0
0.0TyrMet: 0.0 ± 0.0
1.742TyrAsn: 1.742 ± 1.528
3.484TyrPro: 3.484 ± 0.682
0.0TyrGln: 0.0 ± 0.0
3.484TyrArg: 3.484 ± 3.057
0.0TyrSer: 0.0 ± 0.0
5.226TyrThr: 5.226 ± 2.211
3.484TyrVal: 3.484 ± 0.682
0.0TyrTrp: 0.0 ± 0.0
3.484TyrTyr: 3.484 ± 3.057
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (575 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski