Amino acid dipepetide frequency for Lake Sarah-associated circular virus-44

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.18AlaAla: 6.18 ± 0.741
0.0AlaCys: 0.0 ± 0.0
1.236AlaAsp: 1.236 ± 0.637
3.708AlaGlu: 3.708 ± 0.052
0.0AlaPhe: 0.0 ± 0.0
4.944AlaGly: 4.944 ± 0.585
1.236AlaHis: 1.236 ± 1.326
3.708AlaIle: 3.708 ± 0.052
1.236AlaLys: 1.236 ± 0.637
3.708AlaLeu: 3.708 ± 0.052
1.236AlaMet: 1.236 ± 1.326
0.0AlaAsn: 0.0 ± 0.0
4.944AlaPro: 4.944 ± 2.547
2.472AlaGln: 2.472 ± 1.274
3.708AlaArg: 3.708 ± 1.91
6.18AlaSer: 6.18 ± 1.221
6.18AlaThr: 6.18 ± 1.221
8.653AlaVal: 8.653 ± 1.43
2.472AlaTrp: 2.472 ± 0.689
4.944AlaTyr: 4.944 ± 1.378
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
2.472CysGly: 2.472 ± 1.274
0.0CysHis: 0.0 ± 0.0
1.236CysIle: 1.236 ± 1.326
1.236CysLys: 1.236 ± 0.637
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
1.236CysAsn: 1.236 ± 1.326
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
2.472CysArg: 2.472 ± 1.274
2.472CysSer: 2.472 ± 0.689
0.0CysThr: 0.0 ± 0.0
1.236CysVal: 1.236 ± 0.637
0.0CysTrp: 0.0 ± 0.0
1.236CysTyr: 1.236 ± 1.326
0.0CysXaa: 0.0 ± 0.0
Asp
3.708AspAla: 3.708 ± 2.015
0.0AspCys: 0.0 ± 0.0
2.472AspAsp: 2.472 ± 0.689
6.18AspGlu: 6.18 ± 2.704
0.0AspPhe: 0.0 ± 0.0
2.472AspGly: 2.472 ± 0.689
3.708AspHis: 3.708 ± 0.052
0.0AspIle: 0.0 ± 0.0
3.708AspLys: 3.708 ± 1.91
3.708AspLeu: 3.708 ± 1.91
1.236AspMet: 1.236 ± 0.637
2.472AspAsn: 2.472 ± 1.274
2.472AspPro: 2.472 ± 2.652
0.0AspGln: 0.0 ± 0.0
4.944AspArg: 4.944 ± 0.585
3.708AspSer: 3.708 ± 1.91
1.236AspThr: 1.236 ± 0.637
3.708AspVal: 3.708 ± 0.052
3.708AspTrp: 3.708 ± 2.015
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.708GluAla: 3.708 ± 2.015
0.0GluCys: 0.0 ± 0.0
0.0GluAsp: 0.0 ± 0.0
0.0GluGlu: 0.0 ± 0.0
3.708GluPhe: 3.708 ± 0.052
1.236GluGly: 1.236 ± 1.326
1.236GluHis: 1.236 ± 0.637
2.472GluIle: 2.472 ± 1.274
1.236GluLys: 1.236 ± 1.326
2.472GluLeu: 2.472 ± 2.652
1.236GluMet: 1.236 ± 0.637
3.708GluAsn: 3.708 ± 0.052
2.472GluPro: 2.472 ± 0.689
2.472GluGln: 2.472 ± 0.689
1.236GluArg: 1.236 ± 1.326
1.236GluSer: 1.236 ± 1.326
2.472GluThr: 2.472 ± 0.689
3.708GluVal: 3.708 ± 0.052
0.0GluTrp: 0.0 ± 0.0
1.236GluTyr: 1.236 ± 1.326
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.0PheCys: 0.0 ± 0.0
3.708PheAsp: 3.708 ± 2.015
1.236PheGlu: 1.236 ± 1.326
0.0PhePhe: 0.0 ± 0.0
1.236PheGly: 1.236 ± 0.637
0.0PheHis: 0.0 ± 0.0
2.472PheIle: 2.472 ± 1.274
3.708PheLys: 3.708 ± 0.052
3.708PheLeu: 3.708 ± 2.015
0.0PheMet: 0.0 ± 0.0
2.472PheAsn: 2.472 ± 1.274
1.236PhePro: 1.236 ± 0.637
0.0PheGln: 0.0 ± 0.0
1.236PheArg: 1.236 ± 1.326
2.472PheSer: 2.472 ± 1.274
1.236PheThr: 1.236 ± 0.637
1.236PheVal: 1.236 ± 1.326
1.236PheTrp: 1.236 ± 1.326
1.236PheTyr: 1.236 ± 0.637
0.0PheXaa: 0.0 ± 0.0
Gly
3.708GlyAla: 3.708 ± 0.052
0.0GlyCys: 0.0 ± 0.0
2.472GlyAsp: 2.472 ± 0.689
1.236GlyGlu: 1.236 ± 0.637
1.236GlyPhe: 1.236 ± 1.326
3.708GlyGly: 3.708 ± 1.91
1.236GlyHis: 1.236 ± 1.326
1.236GlyIle: 1.236 ± 1.326
3.708GlyLys: 3.708 ± 0.052
1.236GlyLeu: 1.236 ± 0.637
1.236GlyMet: 1.236 ± 0.637
8.653GlyAsn: 8.653 ± 4.458
2.472GlyPro: 2.472 ± 0.689
1.236GlyGln: 1.236 ± 1.326
6.18GlyArg: 6.18 ± 1.221
3.708GlySer: 3.708 ± 2.015
9.889GlyThr: 9.889 ± 3.132
4.944GlyVal: 4.944 ± 2.547
2.472GlyTrp: 2.472 ± 1.274
8.653GlyTyr: 8.653 ± 0.533
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
1.236HisCys: 1.236 ± 0.637
2.472HisAsp: 2.472 ± 1.274
1.236HisGlu: 1.236 ± 0.637
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
2.472HisHis: 2.472 ± 1.274
2.472HisIle: 2.472 ± 0.689
2.472HisLys: 2.472 ± 0.689
0.0HisLeu: 0.0 ± 0.0
0.0HisMet: 0.0 ± 0.0
1.236HisAsn: 1.236 ± 0.637
1.236HisPro: 1.236 ± 1.326
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
2.472HisSer: 2.472 ± 0.689
3.708HisThr: 3.708 ± 0.052
0.0HisVal: 0.0 ± 0.0
1.236HisTrp: 1.236 ± 1.326
1.236HisTyr: 1.236 ± 1.326
0.0HisXaa: 0.0 ± 0.0
Ile
3.708IleAla: 3.708 ± 0.052
1.236IleCys: 1.236 ± 0.637
2.472IleAsp: 2.472 ± 0.689
4.944IleGlu: 4.944 ± 1.378
1.236IlePhe: 1.236 ± 0.637
7.417IleGly: 7.417 ± 3.821
1.236IleHis: 1.236 ± 0.637
6.18IleIle: 6.18 ± 0.741
2.472IleLys: 2.472 ± 1.274
6.18IleLeu: 6.18 ± 0.741
1.236IleMet: 1.236 ± 0.637
2.472IleAsn: 2.472 ± 0.689
3.708IlePro: 3.708 ± 2.015
2.472IleGln: 2.472 ± 1.274
7.417IleArg: 7.417 ± 2.067
3.708IleSer: 3.708 ± 0.052
1.236IleThr: 1.236 ± 1.326
3.708IleVal: 3.708 ± 2.015
1.236IleTrp: 1.236 ± 1.326
2.472IleTyr: 2.472 ± 1.274
0.0IleXaa: 0.0 ± 0.0
Lys
8.653LysAla: 8.653 ± 2.495
1.236LysCys: 1.236 ± 0.637
2.472LysAsp: 2.472 ± 2.652
0.0LysGlu: 0.0 ± 0.0
2.472LysPhe: 2.472 ± 0.689
12.361LysGly: 12.361 ± 4.406
0.0LysHis: 0.0 ± 0.0
4.944LysIle: 4.944 ± 1.378
7.417LysLys: 7.417 ± 1.858
4.944LysLeu: 4.944 ± 1.378
1.236LysMet: 1.236 ± 0.637
3.708LysAsn: 3.708 ± 0.052
2.472LysPro: 2.472 ± 0.689
2.472LysGln: 2.472 ± 0.689
4.944LysArg: 4.944 ± 1.378
9.889LysSer: 9.889 ± 1.169
2.472LysThr: 2.472 ± 1.274
1.236LysVal: 1.236 ± 0.637
0.0LysTrp: 0.0 ± 0.0
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
2.472LeuAla: 2.472 ± 0.689
0.0LeuCys: 0.0 ± 0.0
3.708LeuAsp: 3.708 ± 0.052
4.944LeuGlu: 4.944 ± 0.585
0.0LeuPhe: 0.0 ± 0.0
2.472LeuGly: 2.472 ± 0.689
4.944LeuHis: 4.944 ± 3.341
2.472LeuIle: 2.472 ± 0.689
7.417LeuLys: 7.417 ± 2.067
2.472LeuLeu: 2.472 ± 0.689
0.0LeuMet: 0.0 ± 0.824
1.236LeuAsn: 1.236 ± 0.637
1.236LeuPro: 1.236 ± 1.326
0.0LeuGln: 0.0 ± 0.0
1.236LeuArg: 1.236 ± 1.326
4.944LeuSer: 4.944 ± 2.547
4.944LeuThr: 4.944 ± 0.585
6.18LeuVal: 6.18 ± 1.221
2.472LeuTrp: 2.472 ± 1.274
3.708LeuTyr: 3.708 ± 0.052
0.0LeuXaa: 0.0 ± 0.0
Met
1.236MetAla: 1.236 ± 0.637
0.0MetCys: 0.0 ± 0.0
2.472MetAsp: 2.472 ± 1.274
3.708MetGlu: 3.708 ± 3.977
1.236MetPhe: 1.236 ± 0.637
1.236MetGly: 1.236 ± 0.637
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
1.236MetAsn: 1.236 ± 0.637
2.472MetPro: 2.472 ± 1.274
0.0MetGln: 0.0 ± 0.0
1.236MetArg: 1.236 ± 0.637
2.472MetSer: 2.472 ± 1.274
2.472MetThr: 2.472 ± 1.274
1.236MetVal: 1.236 ± 1.326
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.472AsnAla: 2.472 ± 1.274
0.0AsnCys: 0.0 ± 0.0
3.708AsnAsp: 3.708 ± 1.91
0.0AsnGlu: 0.0 ± 0.0
1.236AsnPhe: 1.236 ± 0.637
4.944AsnGly: 4.944 ± 0.585
0.0AsnHis: 0.0 ± 0.0
9.889AsnIle: 9.889 ± 0.793
3.708AsnLys: 3.708 ± 1.91
9.889AsnLeu: 9.889 ± 3.132
2.472AsnMet: 2.472 ± 1.077
1.236AsnAsn: 1.236 ± 0.637
0.0AsnPro: 0.0 ± 0.0
0.0AsnGln: 0.0 ± 0.0
1.236AsnArg: 1.236 ± 0.637
2.472AsnSer: 2.472 ± 0.689
2.472AsnThr: 2.472 ± 2.652
6.18AsnVal: 6.18 ± 0.741
0.0AsnTrp: 0.0 ± 0.0
2.472AsnTyr: 2.472 ± 0.689
0.0AsnXaa: 0.0 ± 0.0
Pro
3.708ProAla: 3.708 ± 0.052
0.0ProCys: 0.0 ± 0.0
2.472ProAsp: 2.472 ± 0.689
1.236ProGlu: 1.236 ± 1.326
3.708ProPhe: 3.708 ± 0.052
2.472ProGly: 2.472 ± 1.274
2.472ProHis: 2.472 ± 0.689
2.472ProIle: 2.472 ± 1.274
4.944ProLys: 4.944 ± 0.585
1.236ProLeu: 1.236 ± 0.637
1.236ProMet: 1.236 ± 0.637
3.708ProAsn: 3.708 ± 2.015
2.472ProPro: 2.472 ± 1.274
1.236ProGln: 1.236 ± 1.326
3.708ProArg: 3.708 ± 1.91
1.236ProSer: 1.236 ± 1.326
8.653ProThr: 8.653 ± 1.43
3.708ProVal: 3.708 ± 2.015
0.0ProTrp: 0.0 ± 0.0
1.236ProTyr: 1.236 ± 0.637
0.0ProXaa: 0.0 ± 0.0
Gln
1.236GlnAla: 1.236 ± 0.637
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
1.236GlnGlu: 1.236 ± 1.326
1.236GlnPhe: 1.236 ± 1.326
1.236GlnGly: 1.236 ± 1.326
0.0GlnHis: 0.0 ± 0.0
1.236GlnIle: 1.236 ± 0.637
0.0GlnLys: 0.0 ± 0.0
1.236GlnLeu: 1.236 ± 1.326
1.236GlnMet: 1.236 ± 1.326
0.0GlnAsn: 0.0 ± 0.0
1.236GlnPro: 1.236 ± 0.637
0.0GlnGln: 0.0 ± 0.0
2.472GlnArg: 2.472 ± 0.689
1.236GlnSer: 1.236 ± 0.637
2.472GlnThr: 2.472 ± 1.274
0.0GlnVal: 0.0 ± 0.0
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
4.944ArgAla: 4.944 ± 0.585
3.708ArgCys: 3.708 ± 2.015
2.472ArgAsp: 2.472 ± 1.274
0.0ArgGlu: 0.0 ± 0.0
2.472ArgPhe: 2.472 ± 1.274
4.944ArgGly: 4.944 ± 1.378
0.0ArgHis: 0.0 ± 0.0
2.472ArgIle: 2.472 ± 0.689
7.417ArgLys: 7.417 ± 1.858
2.472ArgLeu: 2.472 ± 0.689
2.472ArgMet: 2.472 ± 0.689
2.472ArgAsn: 2.472 ± 1.274
1.236ArgPro: 1.236 ± 0.637
0.0ArgGln: 0.0 ± 0.0
7.417ArgArg: 7.417 ± 2.067
14.833ArgSer: 14.833 ± 1.754
2.472ArgThr: 2.472 ± 1.274
1.236ArgVal: 1.236 ± 0.637
1.236ArgTrp: 1.236 ± 1.326
3.708ArgTyr: 3.708 ± 2.015
0.0ArgXaa: 0.0 ± 0.0
Ser
3.708SerAla: 3.708 ± 3.977
2.472SerCys: 2.472 ± 0.689
1.236SerAsp: 1.236 ± 1.326
1.236SerGlu: 1.236 ± 0.637
2.472SerPhe: 2.472 ± 0.689
3.708SerGly: 3.708 ± 0.052
3.708SerHis: 3.708 ± 1.91
6.18SerIle: 6.18 ± 1.221
7.417SerLys: 7.417 ± 0.104
7.417SerLeu: 7.417 ± 0.104
2.472SerMet: 2.472 ± 1.274
6.18SerAsn: 6.18 ± 0.741
7.417SerPro: 7.417 ± 1.858
0.0SerGln: 0.0 ± 0.0
7.417SerArg: 7.417 ± 0.104
12.361SerSer: 12.361 ± 0.48
6.18SerThr: 6.18 ± 3.184
6.18SerVal: 6.18 ± 1.221
1.236SerTrp: 1.236 ± 0.637
2.472SerTyr: 2.472 ± 1.274
0.0SerXaa: 0.0 ± 0.0
Thr
6.18ThrAla: 6.18 ± 3.184
1.236ThrCys: 1.236 ± 0.637
9.889ThrAsp: 9.889 ± 5.095
0.0ThrGlu: 0.0 ± 0.0
2.472ThrPhe: 2.472 ± 1.274
4.944ThrGly: 4.944 ± 1.378
0.0ThrHis: 0.0 ± 0.0
3.708ThrIle: 3.708 ± 0.052
6.18ThrLys: 6.18 ± 0.741
0.0ThrLeu: 0.0 ± 0.0
0.0ThrMet: 0.0 ± 0.0
3.708ThrAsn: 3.708 ± 1.91
4.944ThrPro: 4.944 ± 0.585
1.236ThrGln: 1.236 ± 1.326
3.708ThrArg: 3.708 ± 0.052
9.889ThrSer: 9.889 ± 2.756
4.944ThrThr: 4.944 ± 2.547
6.18ThrVal: 6.18 ± 3.184
0.0ThrTrp: 0.0 ± 0.0
4.944ThrTyr: 4.944 ± 2.547
0.0ThrXaa: 0.0 ± 0.0
Val
6.18ValAla: 6.18 ± 3.184
1.236ValCys: 1.236 ± 1.326
2.472ValAsp: 2.472 ± 0.689
4.944ValGlu: 4.944 ± 1.378
2.472ValPhe: 2.472 ± 2.652
2.472ValGly: 2.472 ± 1.274
1.236ValHis: 1.236 ± 0.637
6.18ValIle: 6.18 ± 1.221
6.18ValLys: 6.18 ± 0.741
3.708ValLeu: 3.708 ± 2.015
1.236ValMet: 1.236 ± 0.637
4.944ValAsn: 4.944 ± 0.585
1.236ValPro: 1.236 ± 1.326
0.0ValGln: 0.0 ± 0.0
3.708ValArg: 3.708 ± 0.052
4.944ValSer: 4.944 ± 2.547
4.944ValThr: 4.944 ± 2.547
3.708ValVal: 3.708 ± 2.015
3.708ValTrp: 3.708 ± 0.052
3.708ValTyr: 3.708 ± 2.015
0.0ValXaa: 0.0 ± 0.0
Trp
2.472TrpAla: 2.472 ± 1.274
1.236TrpCys: 1.236 ± 0.637
3.708TrpAsp: 3.708 ± 3.977
0.0TrpGlu: 0.0 ± 0.0
1.236TrpPhe: 1.236 ± 1.326
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.236TrpIle: 1.236 ± 1.326
1.236TrpLys: 1.236 ± 1.326
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
2.472TrpPro: 2.472 ± 1.274
1.236TrpGln: 1.236 ± 1.326
1.236TrpArg: 1.236 ± 0.637
0.0TrpSer: 0.0 ± 0.0
2.472TrpThr: 2.472 ± 1.274
1.236TrpVal: 1.236 ± 0.637
1.236TrpTrp: 1.236 ± 0.637
1.236TrpTyr: 1.236 ± 1.326
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.472TyrAla: 2.472 ± 0.689
0.0TyrCys: 0.0 ± 0.0
0.0TyrAsp: 0.0 ± 0.0
0.0TyrGlu: 0.0 ± 0.0
1.236TyrPhe: 1.236 ± 0.637
4.944TyrGly: 4.944 ± 1.378
0.0TyrHis: 0.0 ± 0.0
6.18TyrIle: 6.18 ± 0.741
1.236TyrLys: 1.236 ± 0.637
3.708TyrLeu: 3.708 ± 0.052
1.236TyrMet: 1.236 ± 0.637
3.708TyrAsn: 3.708 ± 2.015
6.18TyrPro: 6.18 ± 2.704
1.236TyrGln: 1.236 ± 0.637
2.472TyrArg: 2.472 ± 1.274
1.236TyrSer: 1.236 ± 0.637
3.708TyrThr: 3.708 ± 0.052
4.944TyrVal: 4.944 ± 1.378
0.0TyrTrp: 0.0 ± 0.0
3.708TyrTyr: 3.708 ± 2.015
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (810 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski