Amino acid dipepetide frequency for Sewage-associated circular DNA virus-27

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.634AlaAla: 5.634 ± 3.391
0.0AlaCys: 0.0 ± 0.0
5.634AlaAsp: 5.634 ± 1.143
4.225AlaGlu: 4.225 ± 0.276
1.408AlaPhe: 1.408 ± 0.848
7.042AlaGly: 7.042 ± 1.972
0.0AlaHis: 0.0 ± 0.0
5.634AlaIle: 5.634 ± 1.124
1.408AlaLys: 1.408 ± 0.848
4.225AlaLeu: 4.225 ± 1.991
1.408AlaMet: 1.408 ± 0.848
0.0AlaAsn: 0.0 ± 0.0
2.817AlaPro: 2.817 ± 1.695
1.408AlaGln: 1.408 ± 1.419
0.0AlaArg: 0.0 ± 0.0
9.859AlaSer: 9.859 ± 3.667
7.042AlaThr: 7.042 ± 0.295
2.817AlaVal: 2.817 ± 1.695
1.408AlaTrp: 1.408 ± 1.419
2.817AlaTyr: 2.817 ± 0.572
0.0AlaXaa: 0.0 ± 0.0
Cys
2.817CysAla: 2.817 ± 2.838
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.408CysGly: 1.408 ± 1.419
0.0CysHis: 0.0 ± 0.0
1.408CysIle: 1.408 ± 0.848
2.817CysLys: 2.817 ± 0.572
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
4.225CysSer: 4.225 ± 1.991
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
1.408CysTrp: 1.408 ± 0.848
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.817AspAla: 2.817 ± 0.572
0.0AspCys: 0.0 ± 0.0
0.0AspAsp: 0.0 ± 0.0
4.225AspGlu: 4.225 ± 1.991
1.408AspPhe: 1.408 ± 0.848
1.408AspGly: 1.408 ± 0.848
0.0AspHis: 0.0 ± 0.0
2.817AspIle: 2.817 ± 0.572
0.0AspLys: 0.0 ± 0.0
4.225AspLeu: 4.225 ± 1.991
1.408AspMet: 1.408 ± 0.643
2.817AspAsn: 2.817 ± 1.695
1.408AspPro: 1.408 ± 1.419
1.408AspGln: 1.408 ± 1.419
4.225AspArg: 4.225 ± 0.276
2.817AspSer: 2.817 ± 0.572
0.0AspThr: 0.0 ± 0.0
11.268AspVal: 11.268 ± 0.019
4.225AspTrp: 4.225 ± 0.276
2.817AspTyr: 2.817 ± 0.572
0.0AspXaa: 0.0 ± 0.0
Glu
4.225GluAla: 4.225 ± 4.258
2.817GluCys: 2.817 ± 0.572
1.408GluAsp: 1.408 ± 0.848
2.817GluGlu: 2.817 ± 0.572
5.634GluPhe: 5.634 ± 3.391
1.408GluGly: 1.408 ± 1.419
0.0GluHis: 0.0 ± 0.0
1.408GluIle: 1.408 ± 1.419
1.408GluLys: 1.408 ± 0.848
5.634GluLeu: 5.634 ± 1.143
0.0GluMet: 0.0 ± 0.0
2.817GluAsn: 2.817 ± 1.695
1.408GluPro: 1.408 ± 0.848
1.408GluGln: 1.408 ± 1.419
2.817GluArg: 2.817 ± 2.838
4.225GluSer: 4.225 ± 4.258
1.408GluThr: 1.408 ± 1.419
2.817GluVal: 2.817 ± 1.695
1.408GluTrp: 1.408 ± 1.419
1.408GluTyr: 1.408 ± 1.419
0.0GluXaa: 0.0 ± 0.0
Phe
1.408PheAla: 1.408 ± 0.848
0.0PheCys: 0.0 ± 0.0
4.225PheAsp: 4.225 ± 0.276
2.817PheGlu: 2.817 ± 0.572
4.225PhePhe: 4.225 ± 2.543
2.817PheGly: 2.817 ± 0.572
1.408PheHis: 1.408 ± 1.419
2.817PheIle: 2.817 ± 1.695
5.634PheLys: 5.634 ± 1.124
0.0PheLeu: 0.0 ± 0.0
0.0PheMet: 0.0 ± 0.0
2.817PheAsn: 2.817 ± 1.695
2.817PhePro: 2.817 ± 1.695
0.0PheGln: 0.0 ± 0.0
1.408PheArg: 1.408 ± 1.419
0.0PheSer: 0.0 ± 0.0
4.225PheThr: 4.225 ± 2.543
4.225PheVal: 4.225 ± 2.543
2.817PheTrp: 2.817 ± 2.838
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
1.408GlyAla: 1.408 ± 0.848
0.0GlyCys: 0.0 ± 0.0
2.817GlyAsp: 2.817 ± 2.838
2.817GlyGlu: 2.817 ± 2.838
4.225GlyPhe: 4.225 ± 2.543
5.634GlyGly: 5.634 ± 3.391
1.408GlyHis: 1.408 ± 0.848
4.225GlyIle: 4.225 ± 1.991
5.634GlyLys: 5.634 ± 1.124
4.225GlyLeu: 4.225 ± 0.276
2.817GlyMet: 2.817 ± 0.572
4.225GlyAsn: 4.225 ± 2.543
4.225GlyPro: 4.225 ± 0.276
4.225GlyGln: 4.225 ± 1.991
4.225GlyArg: 4.225 ± 0.276
5.634GlySer: 5.634 ± 1.143
7.042GlyThr: 7.042 ± 1.972
5.634GlyVal: 5.634 ± 1.124
0.0GlyTrp: 0.0 ± 0.0
5.634GlyTyr: 5.634 ± 1.143
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
1.408HisAsp: 1.408 ± 0.848
2.817HisGlu: 2.817 ± 0.572
0.0HisPhe: 0.0 ± 0.0
1.408HisGly: 1.408 ± 1.419
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
1.408HisLys: 1.408 ± 0.848
4.225HisLeu: 4.225 ± 0.276
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
1.408HisSer: 1.408 ± 1.419
0.0HisThr: 0.0 ± 0.0
1.408HisVal: 1.408 ± 1.419
1.408HisTrp: 1.408 ± 1.419
1.408HisTyr: 1.408 ± 0.848
0.0HisXaa: 0.0 ± 0.0
Ile
1.408IleAla: 1.408 ± 0.848
0.0IleCys: 0.0 ± 0.0
2.817IleAsp: 2.817 ± 0.572
2.817IleGlu: 2.817 ± 0.572
0.0IlePhe: 0.0 ± 0.0
1.408IleGly: 1.408 ± 1.419
2.817IleHis: 2.817 ± 0.572
1.408IleIle: 1.408 ± 0.848
9.859IleLys: 9.859 ± 1.4
2.817IleLeu: 2.817 ± 1.695
0.0IleMet: 0.0 ± 0.0
1.408IleAsn: 1.408 ± 0.848
4.225IlePro: 4.225 ± 1.991
0.0IleGln: 0.0 ± 0.0
5.634IleArg: 5.634 ± 1.124
4.225IleSer: 4.225 ± 0.276
4.225IleThr: 4.225 ± 1.991
5.634IleVal: 5.634 ± 1.143
1.408IleTrp: 1.408 ± 1.419
2.817IleTyr: 2.817 ± 0.572
0.0IleXaa: 0.0 ± 0.0
Lys
5.634LysAla: 5.634 ± 3.391
1.408LysCys: 1.408 ± 1.419
4.225LysAsp: 4.225 ± 1.991
2.817LysGlu: 2.817 ± 0.572
2.817LysPhe: 2.817 ± 0.572
7.042LysGly: 7.042 ± 0.295
1.408LysHis: 1.408 ± 0.848
4.225LysIle: 4.225 ± 0.276
5.634LysLys: 5.634 ± 3.391
2.817LysLeu: 2.817 ± 1.695
1.408LysMet: 1.408 ± 0.848
1.408LysAsn: 1.408 ± 0.848
2.817LysPro: 2.817 ± 0.572
2.817LysGln: 2.817 ± 1.695
8.451LysArg: 8.451 ± 2.819
2.817LysSer: 2.817 ± 0.572
0.0LysThr: 0.0 ± 0.0
4.225LysVal: 4.225 ± 2.543
1.408LysTrp: 1.408 ± 0.848
1.408LysTyr: 1.408 ± 0.848
0.0LysXaa: 0.0 ± 0.0
Leu
4.225LeuAla: 4.225 ± 2.543
1.408LeuCys: 1.408 ± 1.419
7.042LeuAsp: 7.042 ± 2.562
5.634LeuGlu: 5.634 ± 1.143
4.225LeuPhe: 4.225 ± 0.276
4.225LeuGly: 4.225 ± 0.276
1.408LeuHis: 1.408 ± 1.419
0.0LeuIle: 0.0 ± 0.0
5.634LeuLys: 5.634 ± 1.124
2.817LeuLeu: 2.817 ± 2.838
1.408LeuMet: 1.408 ± 1.419
1.408LeuAsn: 1.408 ± 0.848
7.042LeuPro: 7.042 ± 0.295
1.408LeuGln: 1.408 ± 0.848
7.042LeuArg: 7.042 ± 4.829
5.634LeuSer: 5.634 ± 1.124
2.817LeuThr: 2.817 ± 2.838
1.408LeuVal: 1.408 ± 0.848
0.0LeuTrp: 0.0 ± 0.0
1.408LeuTyr: 1.408 ± 0.848
0.0LeuXaa: 0.0 ± 0.0
Met
1.408MetAla: 1.408 ± 0.848
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
2.817MetGlu: 2.817 ± 2.838
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
1.408MetIle: 1.408 ± 0.848
0.0MetLys: 0.0 ± 0.0
1.408MetLeu: 1.408 ± 0.848
0.0MetMet: 0.0 ± 0.0
1.408MetAsn: 1.408 ± 0.848
0.0MetPro: 0.0 ± 0.0
1.408MetGln: 1.408 ± 0.848
1.408MetArg: 1.408 ± 0.848
1.408MetSer: 1.408 ± 1.419
1.408MetThr: 1.408 ± 0.848
1.408MetVal: 1.408 ± 1.419
0.0MetTrp: 0.0 ± 0.0
1.408MetTyr: 1.408 ± 0.848
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
2.817AsnCys: 2.817 ± 0.572
1.408AsnAsp: 1.408 ± 0.848
1.408AsnGlu: 1.408 ± 0.848
0.0AsnPhe: 0.0 ± 0.0
1.408AsnGly: 1.408 ± 0.848
0.0AsnHis: 0.0 ± 0.0
1.408AsnIle: 1.408 ± 1.419
4.225AsnLys: 4.225 ± 2.543
1.408AsnLeu: 1.408 ± 1.419
0.0AsnMet: 0.0 ± 0.0
2.817AsnAsn: 2.817 ± 1.695
7.042AsnPro: 7.042 ± 4.239
2.817AsnGln: 2.817 ± 1.695
1.408AsnArg: 1.408 ± 1.419
7.042AsnSer: 7.042 ± 1.972
5.634AsnThr: 5.634 ± 3.391
4.225AsnVal: 4.225 ± 0.276
0.0AsnTrp: 0.0 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
11.268ProAla: 11.268 ± 4.515
0.0ProCys: 0.0 ± 0.0
1.408ProAsp: 1.408 ± 1.419
0.0ProGlu: 0.0 ± 0.0
5.634ProPhe: 5.634 ± 1.143
2.817ProGly: 2.817 ± 0.572
0.0ProHis: 0.0 ± 0.0
1.408ProIle: 1.408 ± 0.848
1.408ProLys: 1.408 ± 0.848
5.634ProLeu: 5.634 ± 1.143
1.408ProMet: 1.408 ± 1.419
5.634ProAsn: 5.634 ± 1.143
1.408ProPro: 1.408 ± 1.419
2.817ProGln: 2.817 ± 1.695
0.0ProArg: 0.0 ± 0.0
1.408ProSer: 1.408 ± 0.848
2.817ProThr: 2.817 ± 0.572
5.634ProVal: 5.634 ± 1.124
0.0ProTrp: 0.0 ± 0.0
2.817ProTyr: 2.817 ± 0.572
0.0ProXaa: 0.0 ± 0.0
Gln
0.0GlnAla: 0.0 ± 0.0
1.408GlnCys: 1.408 ± 0.848
1.408GlnAsp: 1.408 ± 0.848
0.0GlnGlu: 0.0 ± 0.0
2.817GlnPhe: 2.817 ± 0.572
4.225GlnGly: 4.225 ± 0.276
0.0GlnHis: 0.0 ± 0.0
4.225GlnIle: 4.225 ± 1.991
1.408GlnLys: 1.408 ± 0.848
4.225GlnLeu: 4.225 ± 0.276
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
1.408GlnPro: 1.408 ± 1.419
4.225GlnGln: 4.225 ± 2.543
2.817GlnArg: 2.817 ± 0.572
4.225GlnSer: 4.225 ± 2.543
1.408GlnThr: 1.408 ± 0.848
0.0GlnVal: 0.0 ± 0.0
1.408GlnTrp: 1.408 ± 1.419
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
5.634ArgAla: 5.634 ± 3.41
0.0ArgCys: 0.0 ± 0.0
2.817ArgAsp: 2.817 ± 0.572
0.0ArgGlu: 0.0 ± 0.0
0.0ArgPhe: 0.0 ± 0.0
11.268ArgGly: 11.268 ± 2.286
1.408ArgHis: 1.408 ± 0.848
4.225ArgIle: 4.225 ± 1.991
4.225ArgLys: 4.225 ± 2.543
4.225ArgLeu: 4.225 ± 0.276
1.408ArgMet: 1.408 ± 0.494
2.817ArgAsn: 2.817 ± 0.572
1.408ArgPro: 1.408 ± 1.419
2.817ArgGln: 2.817 ± 0.572
11.268ArgArg: 11.268 ± 2.286
4.225ArgSer: 4.225 ± 2.543
4.225ArgThr: 4.225 ± 1.991
2.817ArgVal: 2.817 ± 0.572
1.408ArgTrp: 1.408 ± 1.419
1.408ArgTyr: 1.408 ± 1.419
0.0ArgXaa: 0.0 ± 0.0
Ser
7.042SerAla: 7.042 ± 1.972
0.0SerCys: 0.0 ± 0.0
1.408SerAsp: 1.408 ± 0.848
1.408SerGlu: 1.408 ± 1.419
4.225SerPhe: 4.225 ± 0.276
11.268SerGly: 11.268 ± 0.019
1.408SerHis: 1.408 ± 1.419
5.634SerIle: 5.634 ± 1.124
5.634SerLys: 5.634 ± 3.41
1.408SerLeu: 1.408 ± 1.419
2.817SerMet: 2.817 ± 1.695
4.225SerAsn: 4.225 ± 2.543
1.408SerPro: 1.408 ± 0.848
1.408SerGln: 1.408 ± 0.848
5.634SerArg: 5.634 ± 1.143
2.817SerSer: 2.817 ± 0.572
7.042SerThr: 7.042 ± 1.972
5.634SerVal: 5.634 ± 1.143
2.817SerTrp: 2.817 ± 0.572
4.225SerTyr: 4.225 ± 0.276
0.0SerXaa: 0.0 ± 0.0
Thr
2.817ThrAla: 2.817 ± 0.572
1.408ThrCys: 1.408 ± 0.848
2.817ThrAsp: 2.817 ± 1.695
0.0ThrGlu: 0.0 ± 0.0
1.408ThrPhe: 1.408 ± 0.848
7.042ThrGly: 7.042 ± 1.972
1.408ThrHis: 1.408 ± 1.419
5.634ThrIle: 5.634 ± 1.124
2.817ThrLys: 2.817 ± 0.572
5.634ThrLeu: 5.634 ± 1.143
0.0ThrMet: 0.0 ± 0.0
8.451ThrAsn: 8.451 ± 0.552
5.634ThrPro: 5.634 ± 1.124
1.408ThrGln: 1.408 ± 1.419
4.225ThrArg: 4.225 ± 0.276
0.0ThrSer: 0.0 ± 0.0
8.451ThrThr: 8.451 ± 1.715
0.0ThrVal: 0.0 ± 0.0
1.408ThrTrp: 1.408 ± 0.848
5.634ThrTyr: 5.634 ± 1.124
0.0ThrXaa: 0.0 ± 0.0
Val
4.225ValAla: 4.225 ± 0.276
0.0ValCys: 0.0 ± 0.0
4.225ValAsp: 4.225 ± 0.276
2.817ValGlu: 2.817 ± 0.572
2.817ValPhe: 2.817 ± 1.695
2.817ValGly: 2.817 ± 1.695
4.225ValHis: 4.225 ± 0.276
1.408ValIle: 1.408 ± 0.848
2.817ValLys: 2.817 ± 1.695
5.634ValLeu: 5.634 ± 1.124
1.408ValMet: 1.408 ± 0.848
1.408ValAsn: 1.408 ± 1.419
4.225ValPro: 4.225 ± 0.276
1.408ValGln: 1.408 ± 0.848
2.817ValArg: 2.817 ± 2.838
8.451ValSer: 8.451 ± 0.552
5.634ValThr: 5.634 ± 3.391
7.042ValVal: 7.042 ± 1.972
1.408ValTrp: 1.408 ± 1.419
5.634ValTyr: 5.634 ± 1.143
0.0ValXaa: 0.0 ± 0.0
Trp
1.408TrpAla: 1.408 ± 0.848
1.408TrpCys: 1.408 ± 1.419
1.408TrpAsp: 1.408 ± 1.419
2.817TrpGlu: 2.817 ± 2.838
1.408TrpPhe: 1.408 ± 1.419
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
4.225TrpIle: 4.225 ± 1.991
2.817TrpLys: 2.817 ± 0.572
1.408TrpLeu: 1.408 ± 1.419
0.0TrpMet: 0.0 ± 0.0
1.408TrpAsn: 1.408 ± 0.848
0.0TrpPro: 0.0 ± 0.0
2.817TrpGln: 2.817 ± 0.572
0.0TrpArg: 0.0 ± 0.0
1.408TrpSer: 1.408 ± 0.848
1.408TrpThr: 1.408 ± 1.419
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.408TrpTyr: 1.408 ± 1.419
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.408TyrAla: 1.408 ± 0.848
1.408TyrCys: 1.408 ± 1.419
2.817TyrAsp: 2.817 ± 1.695
4.225TyrGlu: 4.225 ± 2.543
1.408TyrPhe: 1.408 ± 1.419
1.408TyrGly: 1.408 ± 0.848
0.0TyrHis: 0.0 ± 0.0
1.408TyrIle: 1.408 ± 1.419
0.0TyrLys: 0.0 ± 0.0
4.225TyrLeu: 4.225 ± 1.991
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
4.225TyrPro: 4.225 ± 1.991
1.408TyrGln: 1.408 ± 0.848
5.634TyrArg: 5.634 ± 1.143
5.634TyrSer: 5.634 ± 1.143
1.408TyrThr: 1.408 ± 0.848
4.225TyrVal: 4.225 ± 0.276
1.408TyrTrp: 1.408 ± 1.419
2.817TyrTyr: 2.817 ± 1.695
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (711 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski