Amino acid dipepetide frequency for Lake Sarah-associated circular virus-17

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.0AlaCys: 0.0 ± 0.0
4.942AlaAsp: 4.942 ± 1.776
1.647AlaGlu: 1.647 ± 1.39
1.647AlaPhe: 1.647 ± 1.004
3.295AlaGly: 3.295 ± 0.386
1.647AlaHis: 1.647 ± 1.004
1.647AlaIle: 1.647 ± 1.004
1.647AlaLys: 1.647 ± 1.004
1.647AlaLeu: 1.647 ± 1.004
0.0AlaMet: 0.0 ± 0.0
4.942AlaAsn: 4.942 ± 1.776
1.647AlaPro: 1.647 ± 1.004
1.647AlaGln: 1.647 ± 1.004
9.885AlaArg: 9.885 ± 1.234
0.0AlaSer: 0.0 ± 0.0
13.18AlaThr: 13.18 ± 3.939
3.295AlaVal: 3.295 ± 2.78
3.295AlaTrp: 3.295 ± 0.386
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.647CysAsp: 1.647 ± 1.004
0.0CysGlu: 0.0 ± 0.0
1.647CysPhe: 1.647 ± 1.39
0.0CysGly: 0.0 ± 0.0
1.647CysHis: 1.647 ± 1.004
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
3.295CysLeu: 3.295 ± 0.386
1.647CysMet: 1.647 ± 1.004
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
1.647CysGln: 1.647 ± 1.39
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
3.295CysVal: 3.295 ± 2.007
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.295AspAla: 3.295 ± 2.007
0.0AspCys: 0.0 ± 0.0
4.942AspAsp: 4.942 ± 3.011
1.647AspGlu: 1.647 ± 1.004
1.647AspPhe: 1.647 ± 1.39
1.647AspGly: 1.647 ± 1.39
3.295AspHis: 3.295 ± 0.386
4.942AspIle: 4.942 ± 1.776
3.295AspLys: 3.295 ± 0.386
3.295AspLeu: 3.295 ± 2.007
0.0AspMet: 0.0 ± 0.0
1.647AspAsn: 1.647 ± 1.004
4.942AspPro: 4.942 ± 3.011
3.295AspGln: 3.295 ± 2.007
4.942AspArg: 4.942 ± 1.776
6.59AspSer: 6.59 ± 3.166
3.295AspThr: 3.295 ± 2.007
3.295AspVal: 3.295 ± 2.78
1.647AspTrp: 1.647 ± 1.004
1.647AspTyr: 1.647 ± 1.004
0.0AspXaa: 0.0 ± 0.0
Glu
4.942GluAla: 4.942 ± 3.011
0.0GluCys: 0.0 ± 0.0
3.295GluAsp: 3.295 ± 2.007
9.885GluGlu: 9.885 ± 6.021
3.295GluPhe: 3.295 ± 2.007
0.0GluGly: 0.0 ± 0.0
0.0GluHis: 0.0 ± 0.0
4.942GluIle: 4.942 ± 0.617
0.0GluLys: 0.0 ± 0.0
3.295GluLeu: 3.295 ± 0.386
1.647GluMet: 1.647 ± 1.004
4.942GluAsn: 4.942 ± 1.776
1.647GluPro: 1.647 ± 1.004
3.295GluGln: 3.295 ± 2.007
1.647GluArg: 1.647 ± 1.004
0.0GluSer: 0.0 ± 0.0
1.647GluThr: 1.647 ± 1.39
1.647GluVal: 1.647 ± 1.004
0.0GluTrp: 0.0 ± 0.0
4.942GluTyr: 4.942 ± 3.011
0.0GluXaa: 0.0 ± 0.0
Phe
3.295PheAla: 3.295 ± 0.386
1.647PheCys: 1.647 ± 1.39
4.942PheAsp: 4.942 ± 0.617
3.295PheGlu: 3.295 ± 2.78
1.647PhePhe: 1.647 ± 1.39
6.59PheGly: 6.59 ± 1.621
0.0PheHis: 0.0 ± 0.0
6.59PheIle: 6.59 ± 1.621
3.295PheLys: 3.295 ± 0.386
1.647PheLeu: 1.647 ± 1.004
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
1.647PhePro: 1.647 ± 1.39
4.942PheGln: 4.942 ± 0.617
0.0PheArg: 0.0 ± 0.0
3.295PheSer: 3.295 ± 2.78
1.647PheThr: 1.647 ± 1.39
0.0PheVal: 0.0 ± 0.0
1.647PheTrp: 1.647 ± 1.004
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.942GlyAla: 4.942 ± 1.776
0.0GlyCys: 0.0 ± 0.0
4.942GlyAsp: 4.942 ± 0.617
0.0GlyGlu: 0.0 ± 0.0
6.59GlyPhe: 6.59 ± 0.773
4.942GlyGly: 4.942 ± 1.776
1.647GlyHis: 1.647 ± 1.004
1.647GlyIle: 1.647 ± 1.004
3.295GlyLys: 3.295 ± 2.007
0.0GlyLeu: 0.0 ± 0.0
0.0GlyMet: 0.0 ± 0.0
4.942GlyAsn: 4.942 ± 0.617
1.647GlyPro: 1.647 ± 1.004
3.295GlyGln: 3.295 ± 0.386
1.647GlyArg: 1.647 ± 1.39
8.237GlySer: 8.237 ± 2.163
6.59GlyThr: 6.59 ± 1.621
3.295GlyVal: 3.295 ± 0.386
0.0GlyTrp: 0.0 ± 0.0
3.295GlyTyr: 3.295 ± 2.007
0.0GlyXaa: 0.0 ± 0.0
His
1.647HisAla: 1.647 ± 1.004
0.0HisCys: 0.0 ± 0.0
3.295HisAsp: 3.295 ± 0.386
1.647HisGlu: 1.647 ± 1.004
0.0HisPhe: 0.0 ± 0.0
1.647HisGly: 1.647 ± 1.004
3.295HisHis: 3.295 ± 2.007
6.59HisIle: 6.59 ± 4.014
0.0HisLys: 0.0 ± 0.0
0.0HisLeu: 0.0 ± 0.0
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.647HisPro: 1.647 ± 1.004
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
1.647HisSer: 1.647 ± 1.39
0.0HisThr: 0.0 ± 0.0
1.647HisVal: 1.647 ± 1.004
1.647HisTrp: 1.647 ± 1.004
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.647IleAla: 1.647 ± 1.39
0.0IleCys: 0.0 ± 0.0
1.647IleAsp: 1.647 ± 1.004
1.647IleGlu: 1.647 ± 1.004
1.647IlePhe: 1.647 ± 1.004
3.295IleGly: 3.295 ± 2.78
1.647IleHis: 1.647 ± 1.004
3.295IleIle: 3.295 ± 2.78
1.647IleLys: 1.647 ± 1.004
1.647IleLeu: 1.647 ± 1.39
1.647IleMet: 1.647 ± 1.39
6.59IleAsn: 6.59 ± 1.621
6.59IlePro: 6.59 ± 0.773
4.942IleGln: 4.942 ± 0.617
4.942IleArg: 4.942 ± 0.617
1.647IleSer: 1.647 ± 1.39
4.942IleThr: 4.942 ± 1.776
3.295IleVal: 3.295 ± 2.007
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
1.647LysAla: 1.647 ± 1.39
0.0LysCys: 0.0 ± 0.0
1.647LysAsp: 1.647 ± 1.004
1.647LysGlu: 1.647 ± 1.004
0.0LysPhe: 0.0 ± 0.0
1.647LysGly: 1.647 ± 1.004
0.0LysHis: 0.0 ± 0.0
0.0LysIle: 0.0 ± 0.0
1.647LysLys: 1.647 ± 1.004
4.942LysLeu: 4.942 ± 0.617
0.0LysMet: 0.0 ± 0.0
1.647LysAsn: 1.647 ± 1.004
4.942LysPro: 4.942 ± 1.776
1.647LysGln: 1.647 ± 1.39
3.295LysArg: 3.295 ± 2.78
0.0LysSer: 0.0 ± 0.0
3.295LysThr: 3.295 ± 2.007
1.647LysVal: 1.647 ± 1.39
1.647LysTrp: 1.647 ± 1.004
1.647LysTyr: 1.647 ± 1.004
0.0LysXaa: 0.0 ± 0.0
Leu
4.942LeuAla: 4.942 ± 4.17
0.0LeuCys: 0.0 ± 0.0
8.237LeuAsp: 8.237 ± 5.018
0.0LeuGlu: 0.0 ± 0.0
4.942LeuPhe: 4.942 ± 0.617
3.295LeuGly: 3.295 ± 0.386
1.647LeuHis: 1.647 ± 1.39
1.647LeuIle: 1.647 ± 1.39
3.295LeuLys: 3.295 ± 0.386
4.942LeuLeu: 4.942 ± 1.776
0.0LeuMet: 0.0 ± 0.0
1.647LeuAsn: 1.647 ± 1.39
4.942LeuPro: 4.942 ± 3.011
3.295LeuGln: 3.295 ± 0.386
4.942LeuArg: 4.942 ± 3.011
4.942LeuSer: 4.942 ± 1.776
6.59LeuThr: 6.59 ± 1.621
3.295LeuVal: 3.295 ± 0.386
0.0LeuTrp: 0.0 ± 0.0
0.0LeuTyr: 0.0 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
1.647MetAla: 1.647 ± 1.004
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
3.295MetGlu: 3.295 ± 2.007
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
3.295MetLys: 3.295 ± 0.386
4.942MetLeu: 4.942 ± 1.776
0.0MetMet: 0.0 ± 0.0
1.647MetAsn: 1.647 ± 1.39
1.647MetPro: 1.647 ± 1.39
0.0MetGln: 0.0 ± 0.0
1.647MetArg: 1.647 ± 1.39
0.0MetSer: 0.0 ± 0.0
1.647MetThr: 1.647 ± 1.39
1.647MetVal: 1.647 ± 1.004
0.0MetTrp: 0.0 ± 0.0
1.647MetTyr: 1.647 ± 1.39
0.0MetXaa: 0.0 ± 0.0
Asn
1.647AsnAla: 1.647 ± 1.004
0.0AsnCys: 0.0 ± 0.0
3.295AsnAsp: 3.295 ± 2.78
0.0AsnGlu: 0.0 ± 0.0
0.0AsnPhe: 0.0 ± 0.0
4.942AsnGly: 4.942 ± 0.617
0.0AsnHis: 0.0 ± 0.0
1.647AsnIle: 1.647 ± 1.39
1.647AsnLys: 1.647 ± 1.004
6.59AsnLeu: 6.59 ± 5.56
3.295AsnMet: 3.295 ± 2.78
3.295AsnAsn: 3.295 ± 0.386
8.237AsnPro: 8.237 ± 0.231
0.0AsnGln: 0.0 ± 0.0
4.942AsnArg: 4.942 ± 4.17
8.237AsnSer: 8.237 ± 2.163
3.295AsnThr: 3.295 ± 2.007
8.237AsnVal: 8.237 ± 2.163
0.0AsnTrp: 0.0 ± 0.0
3.295AsnTyr: 3.295 ± 2.007
0.0AsnXaa: 0.0 ± 0.0
Pro
0.0ProAla: 0.0 ± 0.0
1.647ProCys: 1.647 ± 1.004
0.0ProAsp: 0.0 ± 0.0
4.942ProGlu: 4.942 ± 3.011
8.237ProPhe: 8.237 ± 2.163
3.295ProGly: 3.295 ± 0.386
1.647ProHis: 1.647 ± 1.004
3.295ProIle: 3.295 ± 2.007
1.647ProLys: 1.647 ± 1.004
8.237ProLeu: 8.237 ± 0.231
1.647ProMet: 1.647 ± 1.39
3.295ProAsn: 3.295 ± 2.78
9.885ProPro: 9.885 ± 3.628
1.647ProGln: 1.647 ± 1.004
4.942ProArg: 4.942 ± 1.776
8.237ProSer: 8.237 ± 2.163
9.885ProThr: 9.885 ± 6.021
3.295ProVal: 3.295 ± 0.386
1.647ProTrp: 1.647 ± 1.004
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.295GlnAla: 3.295 ± 0.386
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
1.647GlnGlu: 1.647 ± 1.004
3.295GlnPhe: 3.295 ± 2.007
8.237GlnGly: 8.237 ± 2.624
3.295GlnHis: 3.295 ± 2.007
4.942GlnIle: 4.942 ± 0.617
0.0GlnLys: 0.0 ± 0.0
1.647GlnLeu: 1.647 ± 1.004
0.0GlnMet: 0.0 ± 0.0
1.647GlnAsn: 1.647 ± 1.004
1.647GlnPro: 1.647 ± 1.39
1.647GlnGln: 1.647 ± 1.004
3.295GlnArg: 3.295 ± 0.386
1.647GlnSer: 1.647 ± 1.004
0.0GlnThr: 0.0 ± 0.0
8.237GlnVal: 8.237 ± 2.163
0.0GlnTrp: 0.0 ± 0.0
3.295GlnTyr: 3.295 ± 0.386
0.0GlnXaa: 0.0 ± 0.0
Arg
0.0ArgAla: 0.0 ± 0.0
3.295ArgCys: 3.295 ± 2.007
6.59ArgAsp: 6.59 ± 0.773
4.942ArgGlu: 4.942 ± 0.617
3.295ArgPhe: 3.295 ± 2.78
3.295ArgGly: 3.295 ± 2.007
1.647ArgHis: 1.647 ± 1.004
3.295ArgIle: 3.295 ± 2.78
4.942ArgLys: 4.942 ± 4.17
6.59ArgLeu: 6.59 ± 1.621
1.647ArgMet: 1.647 ± 1.39
0.0ArgAsn: 0.0 ± 0.0
3.295ArgPro: 3.295 ± 2.007
4.942ArgGln: 4.942 ± 0.617
4.942ArgArg: 4.942 ± 0.617
1.647ArgSer: 1.647 ± 1.004
4.942ArgThr: 4.942 ± 4.17
1.647ArgVal: 1.647 ± 1.39
0.0ArgTrp: 0.0 ± 0.0
6.59ArgTyr: 6.59 ± 0.773
0.0ArgXaa: 0.0 ± 0.0
Ser
8.237SerAla: 8.237 ± 4.556
3.295SerCys: 3.295 ± 0.386
3.295SerAsp: 3.295 ± 2.007
3.295SerGlu: 3.295 ± 2.007
1.647SerPhe: 1.647 ± 1.39
1.647SerGly: 1.647 ± 1.39
0.0SerHis: 0.0 ± 0.0
3.295SerIle: 3.295 ± 2.78
1.647SerLys: 1.647 ± 1.39
1.647SerLeu: 1.647 ± 1.39
0.0SerMet: 0.0 ± 0.0
4.942SerAsn: 4.942 ± 4.17
4.942SerPro: 4.942 ± 0.617
3.295SerGln: 3.295 ± 0.386
0.0SerArg: 0.0 ± 0.0
6.59SerSer: 6.59 ± 5.56
3.295SerThr: 3.295 ± 2.78
6.59SerVal: 6.59 ± 0.773
3.295SerTrp: 3.295 ± 0.386
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
4.942ThrAla: 4.942 ± 3.011
3.295ThrCys: 3.295 ± 0.386
3.295ThrAsp: 3.295 ± 2.78
3.295ThrGlu: 3.295 ± 0.386
3.295ThrPhe: 3.295 ± 0.386
9.885ThrGly: 9.885 ± 1.234
1.647ThrHis: 1.647 ± 1.004
0.0ThrIle: 0.0 ± 0.0
0.0ThrLys: 0.0 ± 0.0
1.647ThrLeu: 1.647 ± 1.004
8.237ThrMet: 8.237 ± 0.231
6.59ThrAsn: 6.59 ± 1.621
8.237ThrPro: 8.237 ± 4.556
4.942ThrGln: 4.942 ± 0.617
1.647ThrArg: 1.647 ± 1.004
1.647ThrSer: 1.647 ± 1.004
6.59ThrThr: 6.59 ± 3.166
8.237ThrVal: 8.237 ± 0.231
1.647ThrTrp: 1.647 ± 1.004
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
4.942ValAla: 4.942 ± 3.011
0.0ValCys: 0.0 ± 0.0
1.647ValAsp: 1.647 ± 1.39
3.295ValGlu: 3.295 ± 2.007
3.295ValPhe: 3.295 ± 0.386
1.647ValGly: 1.647 ± 1.39
0.0ValHis: 0.0 ± 0.0
3.295ValIle: 3.295 ± 2.78
1.647ValLys: 1.647 ± 1.004
3.295ValLeu: 3.295 ± 2.007
0.0ValMet: 0.0 ± 0.903
13.18ValAsn: 13.18 ± 6.333
4.942ValPro: 4.942 ± 0.617
1.647ValGln: 1.647 ± 1.004
8.237ValArg: 8.237 ± 4.556
3.295ValSer: 3.295 ± 2.78
6.59ValThr: 6.59 ± 4.014
4.942ValVal: 4.942 ± 0.617
0.0ValTrp: 0.0 ± 0.0
3.295ValTyr: 3.295 ± 0.386
0.0ValXaa: 0.0 ± 0.0
Trp
1.647TrpAla: 1.647 ± 1.39
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
3.295TrpGlu: 3.295 ± 2.007
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
1.647TrpHis: 1.647 ± 1.004
1.647TrpIle: 1.647 ± 1.004
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
1.647TrpAsn: 1.647 ± 1.39
1.647TrpPro: 1.647 ± 1.004
0.0TrpGln: 0.0 ± 0.0
1.647TrpArg: 1.647 ± 1.004
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
1.647TrpVal: 1.647 ± 1.004
1.647TrpTrp: 1.647 ± 1.004
1.647TrpTyr: 1.647 ± 1.004
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.295TyrAla: 3.295 ± 2.78
1.647TyrCys: 1.647 ± 1.004
1.647TyrAsp: 1.647 ± 1.39
1.647TyrGlu: 1.647 ± 1.004
0.0TyrPhe: 0.0 ± 0.0
1.647TyrGly: 1.647 ± 1.004
0.0TyrHis: 0.0 ± 0.0
0.0TyrIle: 0.0 ± 0.0
0.0TyrLys: 0.0 ± 0.0
3.295TyrLeu: 3.295 ± 2.007
1.647TyrMet: 1.647 ± 0.764
0.0TyrAsn: 0.0 ± 0.0
3.295TyrPro: 3.295 ± 2.007
1.647TyrGln: 1.647 ± 1.004
4.942TyrArg: 4.942 ± 3.011
3.295TyrSer: 3.295 ± 0.386
1.647TyrThr: 1.647 ± 1.004
1.647TyrVal: 1.647 ± 1.39
0.0TyrTrp: 0.0 ± 0.0
1.647TyrTyr: 1.647 ± 1.004
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (608 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski