Amino acid dipepetide frequency for Sewage-associated circular DNA virus-28

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.942AlaAla: 4.942 ± 1.723
0.0AlaCys: 0.0 ± 0.0
4.942AlaAsp: 4.942 ± 0.614
3.295AlaGlu: 3.295 ± 1.967
0.0AlaPhe: 0.0 ± 0.0
3.295AlaGly: 3.295 ± 0.37
1.647AlaHis: 1.647 ± 0.984
3.295AlaIle: 3.295 ± 1.967
8.237AlaLys: 8.237 ± 4.918
6.59AlaLeu: 6.59 ± 1.598
1.647AlaMet: 1.647 ± 0.9
4.942AlaAsn: 4.942 ± 1.723
6.59AlaPro: 6.59 ± 0.739
3.295AlaGln: 3.295 ± 1.967
8.237AlaArg: 8.237 ± 4.429
1.647AlaSer: 1.647 ± 1.353
1.647AlaThr: 1.647 ± 0.984
1.647AlaVal: 1.647 ± 0.984
1.647AlaTrp: 1.647 ± 0.984
3.295AlaTyr: 3.295 ± 0.37
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.647CysAsp: 1.647 ± 1.353
0.0CysGlu: 0.0 ± 0.0
3.295CysPhe: 3.295 ± 0.37
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.647CysIle: 1.647 ± 1.353
1.647CysLys: 1.647 ± 0.984
0.0CysLeu: 0.0 ± 0.0
1.647CysMet: 1.647 ± 0.984
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.647CysTyr: 1.647 ± 0.984
0.0CysXaa: 0.0 ± 0.0
Asp
6.59AspAla: 6.59 ± 0.739
0.0AspCys: 0.0 ± 0.0
4.942AspAsp: 4.942 ± 0.614
3.295AspGlu: 3.295 ± 1.967
4.942AspPhe: 4.942 ± 2.951
4.942AspGly: 4.942 ± 2.951
0.0AspHis: 0.0 ± 0.0
1.647AspIle: 1.647 ± 0.984
0.0AspLys: 0.0 ± 0.0
8.237AspLeu: 8.237 ± 2.581
1.647AspMet: 1.647 ± 0.984
3.295AspAsn: 3.295 ± 0.37
1.647AspPro: 1.647 ± 0.984
1.647AspGln: 1.647 ± 1.353
1.647AspArg: 1.647 ± 0.984
8.237AspSer: 8.237 ± 2.092
4.942AspThr: 4.942 ± 4.059
0.0AspVal: 0.0 ± 0.0
1.647AspTrp: 1.647 ± 0.984
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.295GluAla: 3.295 ± 1.967
1.647GluCys: 1.647 ± 0.984
4.942GluAsp: 4.942 ± 2.951
6.59GluGlu: 6.59 ± 3.934
4.942GluPhe: 4.942 ± 0.614
0.0GluGly: 0.0 ± 0.0
0.0GluHis: 0.0 ± 0.0
0.0GluIle: 0.0 ± 0.0
3.295GluLys: 3.295 ± 0.37
0.0GluLeu: 0.0 ± 0.0
1.647GluMet: 1.647 ± 1.046
3.295GluAsn: 3.295 ± 0.37
1.647GluPro: 1.647 ± 0.984
1.647GluGln: 1.647 ± 0.984
1.647GluArg: 1.647 ± 0.984
4.942GluSer: 4.942 ± 0.614
0.0GluThr: 0.0 ± 0.0
3.295GluVal: 3.295 ± 0.37
1.647GluTrp: 1.647 ± 1.353
1.647GluTyr: 1.647 ± 1.353
0.0GluXaa: 0.0 ± 0.0
Phe
1.647PheAla: 1.647 ± 0.984
0.0PheCys: 0.0 ± 0.0
3.295PheAsp: 3.295 ± 0.37
1.647PheGlu: 1.647 ± 0.984
0.0PhePhe: 0.0 ± 0.0
3.295PheGly: 3.295 ± 2.706
0.0PheHis: 0.0 ± 0.0
3.295PheIle: 3.295 ± 0.37
4.942PheLys: 4.942 ± 1.723
6.59PheLeu: 6.59 ± 3.934
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
0.0PhePro: 0.0 ± 0.0
0.0PheGln: 0.0 ± 0.0
1.647PheArg: 1.647 ± 0.984
1.647PheSer: 1.647 ± 0.984
4.942PheThr: 4.942 ± 0.614
1.647PheVal: 1.647 ± 0.984
0.0PheTrp: 0.0 ± 0.0
1.647PheTyr: 1.647 ± 0.984
0.0PheXaa: 0.0 ± 0.0
Gly
3.295GlyAla: 3.295 ± 2.706
1.647GlyCys: 1.647 ± 0.984
1.647GlyAsp: 1.647 ± 1.353
6.59GlyGlu: 6.59 ± 0.739
0.0GlyPhe: 0.0 ± 0.0
1.647GlyGly: 1.647 ± 1.353
0.0GlyHis: 0.0 ± 0.0
1.647GlyIle: 1.647 ± 1.353
6.59GlyLys: 6.59 ± 1.598
3.295GlyLeu: 3.295 ± 0.37
1.647GlyMet: 1.647 ± 1.353
1.647GlyAsn: 1.647 ± 0.984
0.0GlyPro: 0.0 ± 0.0
3.295GlyGln: 3.295 ± 1.967
6.59GlyArg: 6.59 ± 3.076
3.295GlySer: 3.295 ± 0.37
1.647GlyThr: 1.647 ± 1.353
6.59GlyVal: 6.59 ± 0.739
1.647GlyTrp: 1.647 ± 0.984
3.295GlyTyr: 3.295 ± 0.37
0.0GlyXaa: 0.0 ± 0.0
His
1.647HisAla: 1.647 ± 0.984
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
3.295HisGly: 3.295 ± 1.967
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
3.295HisLys: 3.295 ± 1.967
0.0HisLeu: 0.0 ± 0.0
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
1.647HisGln: 1.647 ± 0.984
0.0HisArg: 0.0 ± 0.0
1.647HisSer: 1.647 ± 1.353
0.0HisThr: 0.0 ± 0.0
1.647HisVal: 1.647 ± 0.984
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.647IleAla: 1.647 ± 0.984
0.0IleCys: 0.0 ± 0.0
3.295IleAsp: 3.295 ± 1.967
4.942IleGlu: 4.942 ± 0.614
1.647IlePhe: 1.647 ± 0.984
3.295IleGly: 3.295 ± 0.37
1.647IleHis: 1.647 ± 1.353
3.295IleIle: 3.295 ± 0.37
4.942IleLys: 4.942 ± 4.059
1.647IleLeu: 1.647 ± 1.353
1.647IleMet: 1.647 ± 0.984
1.647IleAsn: 1.647 ± 1.353
4.942IlePro: 4.942 ± 4.059
3.295IleGln: 3.295 ± 0.37
1.647IleArg: 1.647 ± 1.353
0.0IleSer: 0.0 ± 0.0
4.942IleThr: 4.942 ± 0.614
1.647IleVal: 1.647 ± 0.984
1.647IleTrp: 1.647 ± 1.353
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
4.942LysAla: 4.942 ± 2.951
1.647LysCys: 1.647 ± 1.353
1.647LysAsp: 1.647 ± 0.984
3.295LysGlu: 3.295 ± 0.37
0.0LysPhe: 0.0 ± 0.0
3.295LysGly: 3.295 ± 0.37
1.647LysHis: 1.647 ± 0.984
3.295LysIle: 3.295 ± 0.37
6.59LysLys: 6.59 ± 1.598
8.237LysLeu: 8.237 ± 4.918
1.647LysMet: 1.647 ± 1.353
1.647LysAsn: 1.647 ± 0.984
4.942LysPro: 4.942 ± 0.614
0.0LysGln: 0.0 ± 0.0
4.942LysArg: 4.942 ± 1.723
4.942LysSer: 4.942 ± 2.951
3.295LysThr: 3.295 ± 1.967
6.59LysVal: 6.59 ± 3.076
0.0LysTrp: 0.0 ± 0.0
1.647LysTyr: 1.647 ± 1.353
0.0LysXaa: 0.0 ± 0.0
Leu
3.295LeuAla: 3.295 ± 1.967
0.0LeuCys: 0.0 ± 0.0
6.59LeuAsp: 6.59 ± 0.739
1.647LeuGlu: 1.647 ± 0.984
1.647LeuPhe: 1.647 ± 0.984
1.647LeuGly: 1.647 ± 1.353
0.0LeuHis: 0.0 ± 0.0
6.59LeuIle: 6.59 ± 0.739
3.295LeuLys: 3.295 ± 0.37
3.295LeuLeu: 3.295 ± 0.37
1.647LeuMet: 1.647 ± 0.984
4.942LeuAsn: 4.942 ± 1.723
6.59LeuPro: 6.59 ± 3.934
4.942LeuGln: 4.942 ± 0.614
9.885LeuArg: 9.885 ± 3.565
4.942LeuSer: 4.942 ± 2.951
13.18LeuThr: 13.18 ± 3.815
9.885LeuVal: 9.885 ± 3.565
4.942LeuTrp: 4.942 ± 2.951
1.647LeuTyr: 1.647 ± 0.984
0.0LeuXaa: 0.0 ± 0.0
Met
1.647MetAla: 1.647 ± 0.984
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
3.295MetGlu: 3.295 ± 1.967
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.647MetLys: 1.647 ± 1.353
1.647MetLeu: 1.647 ± 0.984
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
3.295MetPro: 3.295 ± 0.37
3.295MetGln: 3.295 ± 2.706
0.0MetArg: 0.0 ± 0.0
1.647MetSer: 1.647 ± 1.353
1.647MetThr: 1.647 ± 1.353
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
6.59AsnAla: 6.59 ± 1.598
0.0AsnCys: 0.0 ± 0.0
3.295AsnAsp: 3.295 ± 0.37
0.0AsnGlu: 0.0 ± 0.0
1.647AsnPhe: 1.647 ± 1.353
1.647AsnGly: 1.647 ± 1.353
0.0AsnHis: 0.0 ± 0.0
3.295AsnIle: 3.295 ± 2.706
0.0AsnLys: 0.0 ± 0.0
3.295AsnLeu: 3.295 ± 0.37
0.0AsnMet: 0.0 ± 0.0
1.647AsnAsn: 1.647 ± 1.353
3.295AsnPro: 3.295 ± 0.37
3.295AsnGln: 3.295 ± 2.706
0.0AsnArg: 0.0 ± 0.0
4.942AsnSer: 4.942 ± 4.059
1.647AsnThr: 1.647 ± 1.353
1.647AsnVal: 1.647 ± 0.984
0.0AsnTrp: 0.0 ± 0.0
3.295AsnTyr: 3.295 ± 1.967
0.0AsnXaa: 0.0 ± 0.0
Pro
3.295ProAla: 3.295 ± 0.37
0.0ProCys: 0.0 ± 0.0
3.295ProAsp: 3.295 ± 0.37
1.647ProGlu: 1.647 ± 1.353
1.647ProPhe: 1.647 ± 0.984
3.295ProGly: 3.295 ± 1.967
1.647ProHis: 1.647 ± 0.984
3.295ProIle: 3.295 ± 0.37
1.647ProLys: 1.647 ± 0.984
9.885ProLeu: 9.885 ± 1.228
0.0ProMet: 0.0 ± 0.0
1.647ProAsn: 1.647 ± 0.984
9.885ProPro: 9.885 ± 5.901
3.295ProGln: 3.295 ± 0.37
4.942ProArg: 4.942 ± 2.951
1.647ProSer: 1.647 ± 0.984
4.942ProThr: 4.942 ± 4.059
3.295ProVal: 3.295 ± 0.37
1.647ProTrp: 1.647 ± 0.984
3.295ProTyr: 3.295 ± 0.37
0.0ProXaa: 0.0 ± 0.0
Gln
6.59GlnAla: 6.59 ± 1.598
0.0GlnCys: 0.0 ± 0.0
1.647GlnAsp: 1.647 ± 0.984
0.0GlnGlu: 0.0 ± 0.0
3.295GlnPhe: 3.295 ± 1.967
3.295GlnGly: 3.295 ± 2.706
0.0GlnHis: 0.0 ± 0.0
0.0GlnIle: 0.0 ± 0.0
1.647GlnLys: 1.647 ± 0.984
1.647GlnLeu: 1.647 ± 0.984
0.0GlnMet: 0.0 ± 0.0
1.647GlnAsn: 1.647 ± 0.984
3.295GlnPro: 3.295 ± 1.967
1.647GlnGln: 1.647 ± 0.984
8.237GlnArg: 8.237 ± 4.429
3.295GlnSer: 3.295 ± 1.967
0.0GlnThr: 0.0 ± 0.0
3.295GlnVal: 3.295 ± 0.37
0.0GlnTrp: 0.0 ± 0.0
4.942GlnTyr: 4.942 ± 1.723
0.0GlnXaa: 0.0 ± 0.0
Arg
1.647ArgAla: 1.647 ± 0.984
0.0ArgCys: 0.0 ± 0.0
4.942ArgAsp: 4.942 ± 2.951
1.647ArgGlu: 1.647 ± 0.984
6.59ArgPhe: 6.59 ± 0.739
3.295ArgGly: 3.295 ± 2.706
0.0ArgHis: 0.0 ± 0.0
3.295ArgIle: 3.295 ± 0.37
6.59ArgLys: 6.59 ± 3.076
6.59ArgLeu: 6.59 ± 1.598
1.647ArgMet: 1.647 ± 1.353
3.295ArgAsn: 3.295 ± 2.706
1.647ArgPro: 1.647 ± 0.984
6.59ArgGln: 6.59 ± 1.598
0.0ArgArg: 0.0 ± 0.0
4.942ArgSer: 4.942 ± 4.059
3.295ArgThr: 3.295 ± 0.37
6.59ArgVal: 6.59 ± 0.739
0.0ArgTrp: 0.0 ± 0.0
3.295ArgTyr: 3.295 ± 0.37
0.0ArgXaa: 0.0 ± 0.0
Ser
6.59SerAla: 6.59 ± 3.076
1.647SerCys: 1.647 ± 1.353
0.0SerAsp: 0.0 ± 0.0
0.0SerGlu: 0.0 ± 0.0
0.0SerPhe: 0.0 ± 0.0
4.942SerGly: 4.942 ± 2.951
1.647SerHis: 1.647 ± 0.984
6.59SerIle: 6.59 ± 3.076
1.647SerLys: 1.647 ± 0.984
8.237SerLeu: 8.237 ± 0.244
1.647SerMet: 1.647 ± 1.353
4.942SerAsn: 4.942 ± 1.723
3.295SerPro: 3.295 ± 0.37
1.647SerGln: 1.647 ± 0.984
3.295SerArg: 3.295 ± 0.37
1.647SerSer: 1.647 ± 0.984
1.647SerThr: 1.647 ± 1.353
8.237SerVal: 8.237 ± 0.244
3.295SerTrp: 3.295 ± 0.37
4.942SerTyr: 4.942 ± 1.723
0.0SerXaa: 0.0 ± 0.0
Thr
6.59ThrAla: 6.59 ± 5.412
1.647ThrCys: 1.647 ± 1.353
3.295ThrAsp: 3.295 ± 0.37
4.942ThrGlu: 4.942 ± 0.614
3.295ThrPhe: 3.295 ± 0.37
6.59ThrGly: 6.59 ± 3.076
1.647ThrHis: 1.647 ± 0.984
3.295ThrIle: 3.295 ± 0.37
1.647ThrLys: 1.647 ± 0.984
1.647ThrLeu: 1.647 ± 1.353
0.0ThrMet: 0.0 ± 0.0
1.647ThrAsn: 1.647 ± 1.353
3.295ThrPro: 3.295 ± 1.967
3.295ThrGln: 3.295 ± 0.37
1.647ThrArg: 1.647 ± 0.984
4.942ThrSer: 4.942 ± 4.059
9.885ThrThr: 9.885 ± 8.118
4.942ThrVal: 4.942 ± 1.723
3.295ThrTrp: 3.295 ± 0.37
1.647ThrTyr: 1.647 ± 0.984
0.0ThrXaa: 0.0 ± 0.0
Val
1.647ValAla: 1.647 ± 0.984
1.647ValCys: 1.647 ± 0.984
4.942ValAsp: 4.942 ± 0.614
1.647ValGlu: 1.647 ± 1.353
1.647ValPhe: 1.647 ± 0.984
6.59ValGly: 6.59 ± 0.739
3.295ValHis: 3.295 ± 1.967
1.647ValIle: 1.647 ± 1.353
4.942ValLys: 4.942 ± 2.951
11.532ValLeu: 11.532 ± 2.211
0.0ValMet: 0.0 ± 0.0
0.0ValAsn: 0.0 ± 0.0
6.59ValPro: 6.59 ± 0.739
0.0ValGln: 0.0 ± 0.0
6.59ValArg: 6.59 ± 3.076
6.59ValSer: 6.59 ± 0.739
4.942ValThr: 4.942 ± 1.723
9.885ValVal: 9.885 ± 3.445
0.0ValTrp: 0.0 ± 0.0
1.647ValTyr: 1.647 ± 0.984
0.0ValXaa: 0.0 ± 0.0
Trp
4.942TrpAla: 4.942 ± 2.951
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
3.295TrpGlu: 3.295 ± 0.37
1.647TrpPhe: 1.647 ± 1.353
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
1.647TrpLeu: 1.647 ± 0.984
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
1.647TrpPro: 1.647 ± 0.984
0.0TrpGln: 0.0 ± 0.0
3.295TrpArg: 3.295 ± 0.37
3.295TrpSer: 3.295 ± 0.37
1.647TrpThr: 1.647 ± 0.984
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
1.647TyrCys: 1.647 ± 0.984
4.942TyrAsp: 4.942 ± 0.614
0.0TyrGlu: 0.0 ± 0.0
0.0TyrPhe: 0.0 ± 0.0
1.647TyrGly: 1.647 ± 1.353
0.0TyrHis: 0.0 ± 0.0
1.647TyrIle: 1.647 ± 0.984
1.647TyrLys: 1.647 ± 0.984
6.59TyrLeu: 6.59 ± 3.076
0.0TyrMet: 0.0 ± 0.0
3.295TyrAsn: 3.295 ± 2.706
1.647TyrPro: 1.647 ± 1.353
1.647TyrGln: 1.647 ± 0.984
1.647TyrArg: 1.647 ± 0.984
1.647TyrSer: 1.647 ± 0.984
4.942TyrThr: 4.942 ± 0.614
4.942TyrVal: 4.942 ± 0.614
0.0TyrTrp: 0.0 ± 0.0
1.647TyrTyr: 1.647 ± 1.353
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (608 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski