Amino acid dipepetide frequency for Lake Sarah-associated circular virus-2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.508AlaAla: 7.508 ± 4.54
3.003AlaCys: 3.003 ± 1.816
3.003AlaAsp: 3.003 ± 1.816
1.502AlaGlu: 1.502 ± 1.145
1.502AlaPhe: 1.502 ± 1.145
3.003AlaGly: 3.003 ± 1.816
1.502AlaHis: 1.502 ± 0.908
7.508AlaIle: 7.508 ± 1.62
1.502AlaLys: 1.502 ± 1.145
4.505AlaLeu: 4.505 ± 1.383
3.003AlaMet: 3.003 ± 0.872
3.003AlaAsn: 3.003 ± 1.816
1.502AlaPro: 1.502 ± 0.908
1.502AlaGln: 1.502 ± 1.145
4.505AlaArg: 4.505 ± 1.383
4.505AlaSer: 4.505 ± 0.671
3.003AlaThr: 3.003 ± 0.237
6.006AlaVal: 6.006 ± 3.632
0.0AlaTrp: 0.0 ± 0.0
4.505AlaTyr: 4.505 ± 2.724
0.0AlaXaa: 0.0 ± 0.0
Cys
3.003CysAla: 3.003 ± 0.237
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
3.003CysPhe: 3.003 ± 0.237
0.0CysGly: 0.0 ± 0.0
1.502CysHis: 1.502 ± 0.908
4.505CysIle: 4.505 ± 3.436
1.502CysLys: 1.502 ± 0.908
1.502CysLeu: 1.502 ± 0.908
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
1.502CysThr: 1.502 ± 1.145
1.502CysVal: 1.502 ± 0.908
0.0CysTrp: 0.0 ± 0.0
1.502CysTyr: 1.502 ± 1.145
0.0CysXaa: 0.0 ± 0.0
Asp
0.0AspAla: 0.0 ± 0.0
3.003AspCys: 3.003 ± 1.816
3.003AspAsp: 3.003 ± 2.291
4.505AspGlu: 4.505 ± 0.671
4.505AspPhe: 4.505 ± 3.436
6.006AspGly: 6.006 ± 0.475
0.0AspHis: 0.0 ± 0.0
1.502AspIle: 1.502 ± 0.908
4.505AspLys: 4.505 ± 0.671
4.505AspLeu: 4.505 ± 1.383
1.502AspMet: 1.502 ± 0.908
0.0AspAsn: 0.0 ± 0.0
1.502AspPro: 1.502 ± 0.908
3.003AspGln: 3.003 ± 2.291
0.0AspArg: 0.0 ± 0.0
3.003AspSer: 3.003 ± 1.816
4.505AspThr: 4.505 ± 0.671
6.006AspVal: 6.006 ± 1.579
4.505AspTrp: 4.505 ± 3.436
3.003AspTyr: 3.003 ± 2.291
0.0AspXaa: 0.0 ± 0.0
Glu
0.0GluAla: 0.0 ± 0.0
0.0GluCys: 0.0 ± 0.0
1.502GluAsp: 1.502 ± 1.145
1.502GluGlu: 1.502 ± 1.145
1.502GluPhe: 1.502 ± 1.145
1.502GluGly: 1.502 ± 0.908
0.0GluHis: 0.0 ± 0.0
1.502GluIle: 1.502 ± 1.145
3.003GluLys: 3.003 ± 1.816
9.009GluLeu: 9.009 ± 0.712
0.0GluMet: 0.0 ± 0.0
0.0GluAsn: 0.0 ± 0.0
1.502GluPro: 1.502 ± 1.145
0.0GluGln: 0.0 ± 0.0
3.003GluArg: 3.003 ± 2.291
3.003GluSer: 3.003 ± 2.291
3.003GluThr: 3.003 ± 0.237
3.003GluVal: 3.003 ± 0.237
1.502GluTrp: 1.502 ± 1.145
4.505GluTyr: 4.505 ± 3.436
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.0PheCys: 0.0 ± 0.0
3.003PheAsp: 3.003 ± 2.291
1.502PheGlu: 1.502 ± 1.145
4.505PhePhe: 4.505 ± 1.383
1.502PheGly: 1.502 ± 0.908
3.003PheHis: 3.003 ± 0.237
1.502PheIle: 1.502 ± 1.145
6.006PheLys: 6.006 ± 2.528
4.505PheLeu: 4.505 ± 0.671
4.505PheMet: 4.505 ± 0.671
3.003PheAsn: 3.003 ± 2.291
1.502PhePro: 1.502 ± 0.908
1.502PheGln: 1.502 ± 1.145
3.003PheArg: 3.003 ± 0.237
0.0PheSer: 0.0 ± 0.0
3.003PheThr: 3.003 ± 2.291
6.006PheVal: 6.006 ± 1.579
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
10.511GlyAla: 10.511 ± 2.249
0.0GlyCys: 0.0 ± 0.0
0.0GlyAsp: 0.0 ± 0.0
3.003GlyGlu: 3.003 ± 0.237
1.502GlyPhe: 1.502 ± 0.908
4.505GlyGly: 4.505 ± 2.724
3.003GlyHis: 3.003 ± 0.237
1.502GlyIle: 1.502 ± 0.908
12.012GlyLys: 12.012 ± 3.157
1.502GlyLeu: 1.502 ± 0.908
0.0GlyMet: 0.0 ± 0.0
3.003GlyAsn: 3.003 ± 0.237
3.003GlyPro: 3.003 ± 0.237
4.505GlyGln: 4.505 ± 2.724
4.505GlyArg: 4.505 ± 2.724
4.505GlySer: 4.505 ± 0.671
3.003GlyThr: 3.003 ± 0.237
0.0GlyVal: 0.0 ± 0.0
0.0GlyTrp: 0.0 ± 0.0
3.003GlyTyr: 3.003 ± 0.237
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
3.003HisGlu: 3.003 ± 0.237
1.502HisPhe: 1.502 ± 0.908
1.502HisGly: 1.502 ± 1.145
1.502HisHis: 1.502 ± 1.145
3.003HisIle: 3.003 ± 1.816
0.0HisLys: 0.0 ± 0.0
0.0HisLeu: 0.0 ± 0.0
1.502HisMet: 1.502 ± 0.908
0.0HisAsn: 0.0 ± 0.0
1.502HisPro: 1.502 ± 1.145
1.502HisGln: 1.502 ± 1.145
1.502HisArg: 1.502 ± 0.908
3.003HisSer: 3.003 ± 1.816
0.0HisThr: 0.0 ± 0.0
9.009HisVal: 9.009 ± 0.712
0.0HisTrp: 0.0 ± 0.0
3.003HisTyr: 3.003 ± 1.816
0.0HisXaa: 0.0 ± 0.0
Ile
3.003IleAla: 3.003 ± 1.816
0.0IleCys: 0.0 ± 0.0
7.508IleAsp: 7.508 ± 0.433
1.502IleGlu: 1.502 ± 1.145
4.505IlePhe: 4.505 ± 0.671
3.003IleGly: 3.003 ± 0.237
0.0IleHis: 0.0 ± 0.0
1.502IleIle: 1.502 ± 1.145
3.003IleLys: 3.003 ± 0.237
4.505IleLeu: 4.505 ± 3.436
0.0IleMet: 0.0 ± 0.0
3.003IleAsn: 3.003 ± 0.237
1.502IlePro: 1.502 ± 1.145
1.502IleGln: 1.502 ± 0.908
3.003IleArg: 3.003 ± 0.237
3.003IleSer: 3.003 ± 0.237
6.006IleThr: 6.006 ± 0.475
4.505IleVal: 4.505 ± 0.671
0.0IleTrp: 0.0 ± 0.0
1.502IleTyr: 1.502 ± 0.908
0.0IleXaa: 0.0 ± 0.0
Lys
1.502LysAla: 1.502 ± 0.908
1.502LysCys: 1.502 ± 1.145
4.505LysAsp: 4.505 ± 0.671
0.0LysGlu: 0.0 ± 0.0
3.003LysPhe: 3.003 ± 1.816
3.003LysGly: 3.003 ± 1.816
3.003LysHis: 3.003 ± 1.816
1.502LysIle: 1.502 ± 0.908
6.006LysLys: 6.006 ± 2.528
6.006LysLeu: 6.006 ± 0.475
1.502LysMet: 1.502 ± 0.908
1.502LysAsn: 1.502 ± 0.908
4.505LysPro: 4.505 ± 3.436
4.505LysGln: 4.505 ± 3.436
9.009LysArg: 9.009 ± 3.395
3.003LysSer: 3.003 ± 0.237
3.003LysThr: 3.003 ± 2.291
1.502LysVal: 1.502 ± 1.145
3.003LysTrp: 3.003 ± 0.237
3.003LysTyr: 3.003 ± 0.237
0.0LysXaa: 0.0 ± 0.0
Leu
6.006LeuAla: 6.006 ± 3.632
0.0LeuCys: 0.0 ± 0.0
4.505LeuAsp: 4.505 ± 1.383
4.505LeuGlu: 4.505 ± 3.436
3.003LeuPhe: 3.003 ± 0.237
4.505LeuGly: 4.505 ± 1.383
1.502LeuHis: 1.502 ± 1.145
4.505LeuIle: 4.505 ± 1.383
0.0LeuLys: 0.0 ± 0.0
4.505LeuLeu: 4.505 ± 1.383
3.003LeuMet: 3.003 ± 1.816
6.006LeuAsn: 6.006 ± 2.528
4.505LeuPro: 4.505 ± 2.724
3.003LeuGln: 3.003 ± 0.237
1.502LeuArg: 1.502 ± 1.145
4.505LeuSer: 4.505 ± 1.383
4.505LeuThr: 4.505 ± 0.671
1.502LeuVal: 1.502 ± 1.145
3.003LeuTrp: 3.003 ± 2.291
3.003LeuTyr: 3.003 ± 2.291
0.0LeuXaa: 0.0 ± 0.0
Met
3.003MetAla: 3.003 ± 0.237
1.502MetCys: 1.502 ± 0.908
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
1.502MetHis: 1.502 ± 0.908
1.502MetIle: 1.502 ± 0.908
3.003MetLys: 3.003 ± 0.237
3.003MetLeu: 3.003 ± 0.237
0.0MetMet: 0.0 ± 0.0
1.502MetAsn: 1.502 ± 0.908
1.502MetPro: 1.502 ± 0.908
3.003MetGln: 3.003 ± 0.237
1.502MetArg: 1.502 ± 0.908
0.0MetSer: 0.0 ± 0.0
0.0MetThr: 0.0 ± 0.0
1.502MetVal: 1.502 ± 0.908
0.0MetTrp: 0.0 ± 0.0
1.502MetTyr: 1.502 ± 0.908
0.0MetXaa: 0.0 ± 0.0
Asn
3.003AsnAla: 3.003 ± 0.237
1.502AsnCys: 1.502 ± 1.145
1.502AsnAsp: 1.502 ± 0.908
1.502AsnGlu: 1.502 ± 1.145
1.502AsnPhe: 1.502 ± 1.145
3.003AsnGly: 3.003 ± 0.237
1.502AsnHis: 1.502 ± 0.908
7.508AsnIle: 7.508 ± 0.433
1.502AsnLys: 1.502 ± 1.145
0.0AsnLeu: 0.0 ± 0.0
0.0AsnMet: 0.0 ± 0.0
0.0AsnAsn: 0.0 ± 0.0
1.502AsnPro: 1.502 ± 0.908
1.502AsnGln: 1.502 ± 1.145
4.505AsnArg: 4.505 ± 2.724
1.502AsnSer: 1.502 ± 0.908
0.0AsnThr: 0.0 ± 0.0
3.003AsnVal: 3.003 ± 1.816
0.0AsnTrp: 0.0 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.003ProAla: 3.003 ± 2.291
1.502ProCys: 1.502 ± 1.145
0.0ProAsp: 0.0 ± 0.0
1.502ProGlu: 1.502 ± 0.908
0.0ProPhe: 0.0 ± 0.0
3.003ProGly: 3.003 ± 1.816
3.003ProHis: 3.003 ± 2.291
1.502ProIle: 1.502 ± 0.908
3.003ProLys: 3.003 ± 1.816
1.502ProLeu: 1.502 ± 1.145
0.0ProMet: 0.0 ± 0.0
1.502ProAsn: 1.502 ± 1.145
1.502ProPro: 1.502 ± 1.145
4.505ProGln: 4.505 ± 3.436
6.006ProArg: 6.006 ± 2.528
7.508ProSer: 7.508 ± 4.54
7.508ProThr: 7.508 ± 0.433
3.003ProVal: 3.003 ± 0.237
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.502GlnAla: 1.502 ± 0.908
0.0GlnCys: 0.0 ± 0.0
6.006GlnAsp: 6.006 ± 0.475
3.003GlnGlu: 3.003 ± 2.291
1.502GlnPhe: 1.502 ± 1.145
0.0GlnGly: 0.0 ± 0.0
0.0GlnHis: 0.0 ± 0.0
1.502GlnIle: 1.502 ± 1.145
3.003GlnLys: 3.003 ± 0.237
1.502GlnLeu: 1.502 ± 1.145
1.502GlnMet: 1.502 ± 0.908
3.003GlnAsn: 3.003 ± 1.816
1.502GlnPro: 1.502 ± 0.908
0.0GlnGln: 0.0 ± 0.0
1.502GlnArg: 1.502 ± 1.145
1.502GlnSer: 1.502 ± 1.145
3.003GlnThr: 3.003 ± 0.237
1.502GlnVal: 1.502 ± 0.908
3.003GlnTrp: 3.003 ± 2.291
1.502GlnTyr: 1.502 ± 1.145
0.0GlnXaa: 0.0 ± 0.0
Arg
4.505ArgAla: 4.505 ± 0.671
1.502ArgCys: 1.502 ± 1.145
7.508ArgAsp: 7.508 ± 3.674
0.0ArgGlu: 0.0 ± 0.0
1.502ArgPhe: 1.502 ± 0.908
7.508ArgGly: 7.508 ± 4.54
1.502ArgHis: 1.502 ± 1.145
1.502ArgIle: 1.502 ± 0.908
4.505ArgLys: 4.505 ± 3.436
3.003ArgLeu: 3.003 ± 0.237
1.502ArgMet: 1.502 ± 0.908
0.0ArgAsn: 0.0 ± 0.0
3.003ArgPro: 3.003 ± 2.291
0.0ArgGln: 0.0 ± 0.0
1.502ArgArg: 1.502 ± 0.908
13.514ArgSer: 13.514 ± 6.119
6.006ArgThr: 6.006 ± 0.475
3.003ArgVal: 3.003 ± 0.237
0.0ArgTrp: 0.0 ± 0.0
3.003ArgTyr: 3.003 ± 1.816
0.0ArgXaa: 0.0 ± 0.0
Ser
0.0SerAla: 0.0 ± 0.0
0.0SerCys: 0.0 ± 0.0
3.003SerAsp: 3.003 ± 0.237
4.505SerGlu: 4.505 ± 3.436
3.003SerPhe: 3.003 ± 0.237
9.009SerGly: 9.009 ± 3.395
3.003SerHis: 3.003 ± 1.816
1.502SerIle: 1.502 ± 0.908
4.505SerLys: 4.505 ± 2.724
4.505SerLeu: 4.505 ± 1.383
1.502SerMet: 1.502 ± 0.908
1.502SerAsn: 1.502 ± 1.145
3.003SerPro: 3.003 ± 0.237
3.003SerGln: 3.003 ± 1.816
7.508SerArg: 7.508 ± 2.487
6.006SerSer: 6.006 ± 3.632
7.508SerThr: 7.508 ± 2.487
6.006SerVal: 6.006 ± 3.632
1.502SerTrp: 1.502 ± 0.908
1.502SerTyr: 1.502 ± 0.908
0.0SerXaa: 0.0 ± 0.0
Thr
12.012ThrAla: 12.012 ± 0.95
1.502ThrCys: 1.502 ± 0.908
7.508ThrAsp: 7.508 ± 2.487
1.502ThrGlu: 1.502 ± 1.145
7.508ThrPhe: 7.508 ± 1.62
4.505ThrGly: 4.505 ± 0.671
3.003ThrHis: 3.003 ± 1.816
1.502ThrIle: 1.502 ± 0.908
1.502ThrLys: 1.502 ± 0.908
4.505ThrLeu: 4.505 ± 0.671
0.0ThrMet: 0.0 ± 0.0
0.0ThrAsn: 0.0 ± 0.0
4.505ThrPro: 4.505 ± 1.383
0.0ThrGln: 0.0 ± 0.0
4.505ThrArg: 4.505 ± 0.671
3.003ThrSer: 3.003 ± 0.237
7.508ThrThr: 7.508 ± 0.433
3.003ThrVal: 3.003 ± 0.237
0.0ThrTrp: 0.0 ± 0.0
4.505ThrTyr: 4.505 ± 3.436
0.0ThrXaa: 0.0 ± 0.0
Val
4.505ValAla: 4.505 ± 2.724
4.505ValCys: 4.505 ± 3.436
4.505ValAsp: 4.505 ± 1.383
3.003ValGlu: 3.003 ± 0.237
1.502ValPhe: 1.502 ± 1.145
4.505ValGly: 4.505 ± 2.724
1.502ValHis: 1.502 ± 1.145
4.505ValIle: 4.505 ± 0.671
4.505ValLys: 4.505 ± 1.383
1.502ValLeu: 1.502 ± 0.908
1.502ValMet: 1.502 ± 0.908
3.003ValAsn: 3.003 ± 1.816
6.006ValPro: 6.006 ± 0.475
1.502ValGln: 1.502 ± 0.908
1.502ValArg: 1.502 ± 1.145
7.508ValSer: 7.508 ± 4.54
4.505ValThr: 4.505 ± 2.724
7.508ValVal: 7.508 ± 2.487
1.502ValTrp: 1.502 ± 0.908
4.505ValTyr: 4.505 ± 0.671
0.0ValXaa: 0.0 ± 0.0
Trp
1.502TrpAla: 1.502 ± 1.145
0.0TrpCys: 0.0 ± 0.0
1.502TrpAsp: 1.502 ± 0.908
1.502TrpGlu: 1.502 ± 0.908
0.0TrpPhe: 0.0 ± 0.0
1.502TrpGly: 1.502 ± 1.145
0.0TrpHis: 0.0 ± 0.0
3.003TrpIle: 3.003 ± 2.291
0.0TrpLys: 0.0 ± 0.0
4.505TrpLeu: 4.505 ± 1.383
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.502TrpSer: 1.502 ± 1.145
1.502TrpThr: 1.502 ± 1.145
1.502TrpVal: 1.502 ± 1.145
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.502TyrAla: 1.502 ± 1.145
0.0TyrCys: 0.0 ± 0.0
0.0TyrAsp: 0.0 ± 0.0
1.502TyrGlu: 1.502 ± 0.908
3.003TyrPhe: 3.003 ± 2.291
1.502TyrGly: 1.502 ± 1.145
1.502TyrHis: 1.502 ± 0.908
0.0TyrIle: 0.0 ± 0.0
1.502TyrLys: 1.502 ± 1.145
3.003TyrLeu: 3.003 ± 2.291
1.502TyrMet: 1.502 ± 1.82
4.505TyrAsn: 4.505 ± 2.724
6.006TyrPro: 6.006 ± 0.475
1.502TyrGln: 1.502 ± 0.908
6.006TyrArg: 6.006 ± 0.475
1.502TyrSer: 1.502 ± 0.908
3.003TyrThr: 3.003 ± 1.816
4.505TyrVal: 4.505 ± 1.383
0.0TyrTrp: 0.0 ± 0.0
3.003TyrTyr: 3.003 ± 0.237
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (667 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski