Amino acid dipepetide frequency for Lake Sarah-associated circular virus-7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.279AlaAla: 10.279 ± 2.082
0.0AlaCys: 0.0 ± 0.0
10.279AlaAsp: 10.279 ± 3.709
0.0AlaGlu: 0.0 ± 0.0
5.874AlaPhe: 5.874 ± 1.465
5.874AlaGly: 5.874 ± 0.465
1.468AlaHis: 1.468 ± 1.081
4.405AlaIle: 4.405 ± 1.314
8.811AlaLys: 8.811 ± 3.163
7.342AlaLeu: 7.342 ± 1.546
0.0AlaMet: 0.0 ± 0.0
1.468AlaAsn: 1.468 ± 0.849
4.405AlaPro: 4.405 ± 1.314
1.468AlaGln: 1.468 ± 0.849
7.342AlaArg: 7.342 ± 3.476
5.874AlaSer: 5.874 ± 1.465
2.937AlaThr: 2.937 ± 1.698
2.937AlaVal: 2.937 ± 0.232
2.937AlaTrp: 2.937 ± 0.232
8.811AlaTyr: 8.811 ± 3.163
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.468CysCys: 1.468 ± 0.849
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
2.937CysPhe: 2.937 ± 0.232
2.937CysGly: 2.937 ± 0.232
1.468CysHis: 1.468 ± 1.081
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
2.937CysLeu: 2.937 ± 0.232
1.468CysMet: 1.468 ± 0.849
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
2.937CysTyr: 2.937 ± 2.162
0.0CysXaa: 0.0 ± 0.0
Asp
4.405AspAla: 4.405 ± 1.314
0.0AspCys: 0.0 ± 0.0
5.874AspAsp: 5.874 ± 0.465
2.937AspGlu: 2.937 ± 2.162
1.468AspPhe: 1.468 ± 1.081
4.405AspGly: 4.405 ± 1.314
2.937AspHis: 2.937 ± 2.162
5.874AspIle: 5.874 ± 0.465
0.0AspLys: 0.0 ± 0.0
2.937AspLeu: 2.937 ± 2.162
2.937AspMet: 2.937 ± 2.162
0.0AspAsn: 0.0 ± 0.0
2.937AspPro: 2.937 ± 0.232
2.937AspGln: 2.937 ± 1.698
0.0AspArg: 0.0 ± 0.0
1.468AspSer: 1.468 ± 0.849
2.937AspThr: 2.937 ± 0.232
4.405AspVal: 4.405 ± 0.616
0.0AspTrp: 0.0 ± 0.0
1.468AspTyr: 1.468 ± 1.081
0.0AspXaa: 0.0 ± 0.0
Glu
4.405GluAla: 4.405 ± 0.616
1.468GluCys: 1.468 ± 1.081
0.0GluAsp: 0.0 ± 0.0
2.937GluGlu: 2.937 ± 0.232
2.937GluPhe: 2.937 ± 0.232
1.468GluGly: 1.468 ± 1.081
0.0GluHis: 0.0 ± 0.0
7.342GluIle: 7.342 ± 3.476
4.405GluLys: 4.405 ± 1.314
1.468GluLeu: 1.468 ± 0.849
0.0GluMet: 0.0 ± 0.0
0.0GluAsn: 0.0 ± 0.0
1.468GluPro: 1.468 ± 0.849
0.0GluGln: 0.0 ± 0.0
2.937GluArg: 2.937 ± 0.232
2.937GluSer: 2.937 ± 0.232
1.468GluThr: 1.468 ± 1.081
5.874GluVal: 5.874 ± 1.465
0.0GluTrp: 0.0 ± 0.0
1.468GluTyr: 1.468 ± 1.081
0.0GluXaa: 0.0 ± 0.0
Phe
1.468PheAla: 1.468 ± 1.081
1.468PheCys: 1.468 ± 1.081
1.468PheAsp: 1.468 ± 1.081
0.0PheGlu: 0.0 ± 0.0
1.468PhePhe: 1.468 ± 1.081
7.342PheGly: 7.342 ± 1.546
0.0PheHis: 0.0 ± 0.0
4.405PheIle: 4.405 ± 3.244
5.874PheLys: 5.874 ± 1.465
0.0PheLeu: 0.0 ± 0.0
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
1.468PhePro: 1.468 ± 0.849
0.0PheGln: 0.0 ± 0.0
4.405PheArg: 4.405 ± 0.616
2.937PheSer: 2.937 ± 0.232
4.405PheThr: 4.405 ± 1.314
5.874PheVal: 5.874 ± 1.465
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
5.874GlyAla: 5.874 ± 1.465
1.468GlyCys: 1.468 ± 0.849
2.937GlyAsp: 2.937 ± 2.162
2.937GlyGlu: 2.937 ± 0.232
0.0GlyPhe: 0.0 ± 0.0
7.342GlyGly: 7.342 ± 2.314
0.0GlyHis: 0.0 ± 0.0
0.0GlyIle: 0.0 ± 0.0
10.279GlyLys: 10.279 ± 0.152
7.342GlyLeu: 7.342 ± 0.384
2.937GlyMet: 2.937 ± 1.386
4.405GlyAsn: 4.405 ± 2.547
0.0GlyPro: 0.0 ± 0.0
2.937GlyGln: 2.937 ± 1.698
4.405GlyArg: 4.405 ± 1.314
4.405GlySer: 4.405 ± 0.616
5.874GlyThr: 5.874 ± 0.465
4.405GlyVal: 4.405 ± 0.616
0.0GlyTrp: 0.0 ± 0.0
4.405GlyTyr: 4.405 ± 3.244
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.468HisGlu: 1.468 ± 1.081
1.468HisPhe: 1.468 ± 0.849
1.468HisGly: 1.468 ± 1.081
1.468HisHis: 1.468 ± 0.849
1.468HisIle: 1.468 ± 1.081
1.468HisLys: 1.468 ± 1.081
2.937HisLeu: 2.937 ± 0.232
1.468HisMet: 1.468 ± 0.849
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
0.0HisSer: 0.0 ± 0.0
1.468HisThr: 1.468 ± 0.849
2.937HisVal: 2.937 ± 0.232
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.937IleAla: 2.937 ± 2.162
1.468IleCys: 1.468 ± 0.849
4.405IleAsp: 4.405 ± 1.314
4.405IleGlu: 4.405 ± 1.314
2.937IlePhe: 2.937 ± 2.162
2.937IleGly: 2.937 ± 1.698
1.468IleHis: 1.468 ± 1.081
2.937IleIle: 2.937 ± 0.232
7.342IleLys: 7.342 ± 0.384
2.937IleLeu: 2.937 ± 0.232
0.0IleMet: 0.0 ± 0.768
5.874IleAsn: 5.874 ± 0.465
1.468IlePro: 1.468 ± 0.849
4.405IleGln: 4.405 ± 1.314
5.874IleArg: 5.874 ± 4.325
2.937IleSer: 2.937 ± 2.162
0.0IleThr: 0.0 ± 0.0
4.405IleVal: 4.405 ± 0.616
1.468IleTrp: 1.468 ± 1.081
4.405IleTyr: 4.405 ± 2.547
0.0IleXaa: 0.0 ± 0.0
Lys
5.874LysAla: 5.874 ± 0.465
0.0LysCys: 0.0 ± 0.0
1.468LysAsp: 1.468 ± 1.081
5.874LysGlu: 5.874 ± 1.465
0.0LysPhe: 0.0 ± 0.0
5.874LysGly: 5.874 ± 1.465
1.468LysHis: 1.468 ± 0.849
7.342LysIle: 7.342 ± 1.546
8.811LysLys: 8.811 ± 5.093
5.874LysLeu: 5.874 ± 1.465
0.0LysMet: 0.0 ± 0.0
4.405LysAsn: 4.405 ± 1.314
1.468LysPro: 1.468 ± 0.849
1.468LysGln: 1.468 ± 0.849
1.468LysArg: 1.468 ± 0.849
4.405LysSer: 4.405 ± 0.616
4.405LysThr: 4.405 ± 1.314
5.874LysVal: 5.874 ± 3.395
5.874LysTrp: 5.874 ± 0.465
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
5.874LeuAla: 5.874 ± 1.465
0.0LeuCys: 0.0 ± 0.0
1.468LeuAsp: 1.468 ± 1.081
2.937LeuGlu: 2.937 ± 2.162
4.405LeuPhe: 4.405 ± 0.616
4.405LeuGly: 4.405 ± 2.547
1.468LeuHis: 1.468 ± 0.849
1.468LeuIle: 1.468 ± 0.849
1.468LeuLys: 1.468 ± 0.849
1.468LeuLeu: 1.468 ± 1.081
1.468LeuMet: 1.468 ± 0.849
5.874LeuAsn: 5.874 ± 1.465
4.405LeuPro: 4.405 ± 1.314
1.468LeuGln: 1.468 ± 1.081
4.405LeuArg: 4.405 ± 1.314
8.811LeuSer: 8.811 ± 0.697
7.342LeuThr: 7.342 ± 0.384
2.937LeuVal: 2.937 ± 1.698
2.937LeuTrp: 2.937 ± 0.232
1.468LeuTyr: 1.468 ± 1.081
0.0LeuXaa: 0.0 ± 0.0
Met
1.468MetAla: 1.468 ± 1.081
0.0MetCys: 0.0 ± 0.0
1.468MetAsp: 1.468 ± 1.081
2.937MetGlu: 2.937 ± 0.232
2.937MetPhe: 2.937 ± 0.232
1.468MetGly: 1.468 ± 0.849
1.468MetHis: 1.468 ± 0.849
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
2.937MetAsn: 2.937 ± 1.698
1.468MetPro: 1.468 ± 1.081
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
0.0MetSer: 0.0 ± 0.0
0.0MetThr: 0.0 ± 0.0
1.468MetVal: 1.468 ± 1.081
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
8.811AsnAla: 8.811 ± 1.233
0.0AsnCys: 0.0 ± 0.0
2.937AsnAsp: 2.937 ± 1.698
0.0AsnGlu: 0.0 ± 0.0
0.0AsnPhe: 0.0 ± 0.0
7.342AsnGly: 7.342 ± 2.314
0.0AsnHis: 0.0 ± 0.0
4.405AsnIle: 4.405 ± 0.616
4.405AsnLys: 4.405 ± 2.547
7.342AsnLeu: 7.342 ± 4.244
0.0AsnMet: 0.0 ± 0.0
2.937AsnAsn: 2.937 ± 0.232
1.468AsnPro: 1.468 ± 1.081
1.468AsnGln: 1.468 ± 1.081
0.0AsnArg: 0.0 ± 0.0
1.468AsnSer: 1.468 ± 0.849
5.874AsnThr: 5.874 ± 3.395
4.405AsnVal: 4.405 ± 0.616
0.0AsnTrp: 0.0 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.937ProAla: 2.937 ± 0.232
2.937ProCys: 2.937 ± 0.232
2.937ProAsp: 2.937 ± 2.162
0.0ProGlu: 0.0 ± 0.0
1.468ProPhe: 1.468 ± 1.081
1.468ProGly: 1.468 ± 0.849
0.0ProHis: 0.0 ± 0.0
1.468ProIle: 1.468 ± 0.849
4.405ProLys: 4.405 ± 0.616
2.937ProLeu: 2.937 ± 2.162
0.0ProMet: 0.0 ± 0.0
1.468ProAsn: 1.468 ± 0.849
2.937ProPro: 2.937 ± 2.162
1.468ProGln: 1.468 ± 0.849
1.468ProArg: 1.468 ± 0.849
4.405ProSer: 4.405 ± 0.616
4.405ProThr: 4.405 ± 1.314
1.468ProVal: 1.468 ± 1.081
0.0ProTrp: 0.0 ± 0.0
2.937ProTyr: 2.937 ± 1.698
0.0ProXaa: 0.0 ± 0.0
Gln
7.342GlnAla: 7.342 ± 0.384
0.0GlnCys: 0.0 ± 0.0
1.468GlnAsp: 1.468 ± 1.081
2.937GlnGlu: 2.937 ± 1.698
0.0GlnPhe: 0.0 ± 0.0
2.937GlnGly: 2.937 ± 0.232
0.0GlnHis: 0.0 ± 0.0
0.0GlnIle: 0.0 ± 0.0
1.468GlnLys: 1.468 ± 1.081
1.468GlnLeu: 1.468 ± 0.849
0.0GlnMet: 0.0 ± 0.0
4.405GlnAsn: 4.405 ± 0.616
0.0GlnPro: 0.0 ± 0.0
1.468GlnGln: 1.468 ± 0.849
1.468GlnArg: 1.468 ± 0.849
1.468GlnSer: 1.468 ± 0.849
5.874GlnThr: 5.874 ± 1.465
1.468GlnVal: 1.468 ± 0.849
0.0GlnTrp: 0.0 ± 0.0
2.937GlnTyr: 2.937 ± 0.232
0.0GlnXaa: 0.0 ± 0.0
Arg
4.405ArgAla: 4.405 ± 0.616
2.937ArgCys: 2.937 ± 0.232
0.0ArgAsp: 0.0 ± 0.0
1.468ArgGlu: 1.468 ± 1.081
5.874ArgPhe: 5.874 ± 2.395
5.874ArgGly: 5.874 ± 1.465
1.468ArgHis: 1.468 ± 1.081
4.405ArgIle: 4.405 ± 1.314
1.468ArgLys: 1.468 ± 0.849
2.937ArgLeu: 2.937 ± 0.232
1.468ArgMet: 1.468 ± 1.081
0.0ArgAsn: 0.0 ± 0.0
5.874ArgPro: 5.874 ± 2.395
1.468ArgGln: 1.468 ± 1.081
5.874ArgArg: 5.874 ± 0.465
2.937ArgSer: 2.937 ± 2.162
4.405ArgThr: 4.405 ± 1.314
1.468ArgVal: 1.468 ± 1.081
1.468ArgTrp: 1.468 ± 1.081
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
7.342SerAla: 7.342 ± 0.384
1.468SerCys: 1.468 ± 1.081
1.468SerAsp: 1.468 ± 0.849
0.0SerGlu: 0.0 ± 0.0
1.468SerPhe: 1.468 ± 0.849
1.468SerGly: 1.468 ± 1.081
0.0SerHis: 0.0 ± 0.0
5.874SerIle: 5.874 ± 0.465
4.405SerLys: 4.405 ± 1.314
5.874SerLeu: 5.874 ± 2.395
1.468SerMet: 1.468 ± 1.081
4.405SerAsn: 4.405 ± 2.547
5.874SerPro: 5.874 ± 3.395
5.874SerGln: 5.874 ± 1.465
5.874SerArg: 5.874 ± 2.395
0.0SerSer: 0.0 ± 0.0
1.468SerThr: 1.468 ± 0.849
2.937SerVal: 2.937 ± 1.698
0.0SerTrp: 0.0 ± 0.0
4.405SerTyr: 4.405 ± 0.616
0.0SerXaa: 0.0 ± 0.0
Thr
7.342ThrAla: 7.342 ± 1.546
0.0ThrCys: 0.0 ± 0.0
1.468ThrAsp: 1.468 ± 1.081
4.405ThrGlu: 4.405 ± 0.616
5.874ThrPhe: 5.874 ± 0.465
2.937ThrGly: 2.937 ± 0.232
2.937ThrHis: 2.937 ± 1.698
5.874ThrIle: 5.874 ± 0.465
2.937ThrLys: 2.937 ± 0.232
1.468ThrLeu: 1.468 ± 0.849
0.0ThrMet: 0.0 ± 0.0
0.0ThrAsn: 0.0 ± 0.0
2.937ThrPro: 2.937 ± 1.698
2.937ThrGln: 2.937 ± 1.698
5.874ThrArg: 5.874 ± 4.325
10.279ThrSer: 10.279 ± 4.012
8.811ThrThr: 8.811 ± 3.163
2.937ThrVal: 2.937 ± 0.232
2.937ThrTrp: 2.937 ± 0.232
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
5.874ValAla: 5.874 ± 1.465
0.0ValCys: 0.0 ± 0.0
7.342ValAsp: 7.342 ± 2.314
5.874ValGlu: 5.874 ± 0.465
1.468ValPhe: 1.468 ± 1.081
4.405ValGly: 4.405 ± 1.314
0.0ValHis: 0.0 ± 0.0
2.937ValIle: 2.937 ± 1.698
1.468ValLys: 1.468 ± 1.081
0.0ValLeu: 0.0 ± 0.0
0.0ValMet: 0.0 ± 0.0
8.811ValAsn: 8.811 ± 5.093
1.468ValPro: 1.468 ± 1.081
1.468ValGln: 1.468 ± 0.849
4.405ValArg: 4.405 ± 0.616
2.937ValSer: 2.937 ± 1.698
2.937ValThr: 2.937 ± 0.232
1.468ValVal: 1.468 ± 0.849
0.0ValTrp: 0.0 ± 0.0
8.811ValTyr: 8.811 ± 3.163
0.0ValXaa: 0.0 ± 0.0
Trp
4.405TrpAla: 4.405 ± 1.314
0.0TrpCys: 0.0 ± 0.0
2.937TrpAsp: 2.937 ± 0.232
1.468TrpGlu: 1.468 ± 1.081
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.468TrpIle: 1.468 ± 1.081
0.0TrpLys: 0.0 ± 0.0
1.468TrpLeu: 1.468 ± 0.849
0.0TrpMet: 0.0 ± 0.0
2.937TrpAsn: 2.937 ± 0.232
0.0TrpPro: 0.0 ± 0.0
1.468TrpGln: 1.468 ± 0.849
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
1.468TrpThr: 1.468 ± 1.081
1.468TrpVal: 1.468 ± 0.849
1.468TrpTrp: 1.468 ± 1.081
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.468TyrAla: 1.468 ± 0.849
1.468TyrCys: 1.468 ± 1.081
0.0TyrAsp: 0.0 ± 0.0
0.0TyrGlu: 0.0 ± 0.0
0.0TyrPhe: 0.0 ± 0.0
0.0TyrGly: 0.0 ± 0.0
0.0TyrHis: 0.0 ± 0.0
4.405TyrIle: 4.405 ± 1.314
2.937TyrLys: 2.937 ± 1.698
7.342TyrLeu: 7.342 ± 2.314
2.937TyrMet: 2.937 ± 0.232
2.937TyrAsn: 2.937 ± 1.698
1.468TyrPro: 1.468 ± 1.081
4.405TyrGln: 4.405 ± 1.314
0.0TyrArg: 0.0 ± 0.0
4.405TyrSer: 4.405 ± 3.244
5.874TyrThr: 5.874 ± 1.465
2.937TyrVal: 2.937 ± 1.698
1.468TyrTrp: 1.468 ± 0.849
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (682 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski