Amino acid dipepetide frequency for Pacific flying fox faeces associated circular DNA virus-7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.578AlaAla: 3.578 ± 0.177
3.578AlaCys: 3.578 ± 0.177
3.578AlaAsp: 3.578 ± 0.177
1.789AlaGlu: 1.789 ± 1.214
0.0AlaPhe: 0.0 ± 0.0
7.156AlaGly: 7.156 ± 0.354
5.367AlaHis: 5.367 ± 1.037
7.156AlaIle: 7.156 ± 2.25
1.789AlaLys: 1.789 ± 1.214
0.0AlaLeu: 0.0 ± 0.0
5.367AlaMet: 5.367 ± 1.568
1.789AlaAsn: 1.789 ± 1.391
1.789AlaPro: 1.789 ± 1.214
5.367AlaGln: 5.367 ± 1.568
0.0AlaArg: 0.0 ± 0.0
3.578AlaSer: 3.578 ± 0.177
5.367AlaThr: 5.367 ± 4.172
3.578AlaVal: 3.578 ± 0.177
0.0AlaTrp: 0.0 ± 0.0
3.578AlaTyr: 3.578 ± 2.427
0.0AlaXaa: 0.0 ± 0.0
Cys
1.789CysAla: 1.789 ± 1.214
0.0CysCys: 0.0 ± 0.0
1.789CysAsp: 1.789 ± 1.214
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.789CysGly: 1.789 ± 1.391
1.789CysHis: 1.789 ± 1.214
1.789CysIle: 1.789 ± 1.214
0.0CysLys: 0.0 ± 0.0
1.789CysLeu: 1.789 ± 1.214
0.0CysMet: 0.0 ± 0.0
1.789CysAsn: 1.789 ± 1.214
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
1.789CysSer: 1.789 ± 1.214
1.789CysThr: 1.789 ± 1.391
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.789CysTyr: 1.789 ± 1.391
0.0CysXaa: 0.0 ± 0.0
Asp
5.367AspAla: 5.367 ± 1.037
0.0AspCys: 0.0 ± 0.0
7.156AspAsp: 7.156 ± 4.854
1.789AspGlu: 1.789 ± 1.214
1.789AspPhe: 1.789 ± 1.391
5.367AspGly: 5.367 ± 3.641
1.789AspHis: 1.789 ± 1.214
1.789AspIle: 1.789 ± 1.214
7.156AspLys: 7.156 ± 4.854
0.0AspLeu: 0.0 ± 0.0
0.0AspMet: 0.0 ± 0.0
1.789AspAsn: 1.789 ± 1.391
3.578AspPro: 3.578 ± 0.177
1.789AspGln: 1.789 ± 1.391
0.0AspArg: 0.0 ± 0.0
1.789AspSer: 1.789 ± 1.391
3.578AspThr: 3.578 ± 0.177
1.789AspVal: 1.789 ± 1.214
0.0AspTrp: 0.0 ± 0.0
1.789AspTyr: 1.789 ± 1.214
0.0AspXaa: 0.0 ± 0.0
Glu
0.0GluAla: 0.0 ± 0.0
0.0GluCys: 0.0 ± 0.0
1.789GluAsp: 1.789 ± 1.214
1.789GluGlu: 1.789 ± 1.214
3.578GluPhe: 3.578 ± 2.427
1.789GluGly: 1.789 ± 1.391
0.0GluHis: 0.0 ± 0.0
0.0GluIle: 0.0 ± 0.0
3.578GluLys: 3.578 ± 2.427
5.367GluLeu: 5.367 ± 3.641
1.789GluMet: 1.789 ± 1.214
0.0GluAsn: 0.0 ± 0.0
0.0GluPro: 0.0 ± 0.0
0.0GluGln: 0.0 ± 0.0
0.0GluArg: 0.0 ± 0.0
0.0GluSer: 0.0 ± 0.0
3.578GluThr: 3.578 ± 0.177
3.578GluVal: 3.578 ± 2.427
0.0GluTrp: 0.0 ± 0.0
1.789GluTyr: 1.789 ± 1.214
0.0GluXaa: 0.0 ± 0.0
Phe
3.578PheAla: 3.578 ± 2.781
0.0PheCys: 0.0 ± 0.0
1.789PheAsp: 1.789 ± 1.214
1.789PheGlu: 1.789 ± 1.214
0.0PhePhe: 0.0 ± 0.0
5.367PheGly: 5.367 ± 1.037
1.789PheHis: 1.789 ± 1.214
5.367PheIle: 5.367 ± 4.172
7.156PheLys: 7.156 ± 0.354
1.789PheLeu: 1.789 ± 1.214
0.0PheMet: 0.0 ± 0.0
3.578PheAsn: 3.578 ± 0.177
0.0PhePro: 0.0 ± 0.0
1.789PheGln: 1.789 ± 1.391
3.578PheArg: 3.578 ± 2.781
5.367PheSer: 5.367 ± 1.568
3.578PheThr: 3.578 ± 0.177
0.0PheVal: 0.0 ± 0.0
1.789PheTrp: 1.789 ± 1.214
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
8.945GlyAla: 8.945 ± 0.86
0.0GlyCys: 0.0 ± 0.0
5.367GlyAsp: 5.367 ± 1.037
3.578GlyGlu: 3.578 ± 2.427
1.789GlyPhe: 1.789 ± 1.214
7.156GlyGly: 7.156 ± 4.854
1.789GlyHis: 1.789 ± 1.214
3.578GlyIle: 3.578 ± 0.177
5.367GlyLys: 5.367 ± 1.037
3.578GlyLeu: 3.578 ± 0.177
0.0GlyMet: 0.0 ± 0.0
5.367GlyAsn: 5.367 ± 1.037
3.578GlyPro: 3.578 ± 2.427
3.578GlyGln: 3.578 ± 0.177
3.578GlyArg: 3.578 ± 0.177
3.578GlySer: 3.578 ± 0.177
10.733GlyThr: 10.733 ± 0.531
3.578GlyVal: 3.578 ± 0.177
1.789GlyTrp: 1.789 ± 1.214
5.367GlyTyr: 5.367 ± 1.037
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
1.789HisCys: 1.789 ± 1.214
1.789HisAsp: 1.789 ± 1.214
0.0HisGlu: 0.0 ± 0.0
7.156HisPhe: 7.156 ± 2.25
0.0HisGly: 0.0 ± 0.0
1.789HisHis: 1.789 ± 1.214
3.578HisIle: 3.578 ± 2.427
3.578HisLys: 3.578 ± 2.427
0.0HisLeu: 0.0 ± 0.0
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
0.0HisSer: 0.0 ± 0.0
1.789HisThr: 1.789 ± 1.214
0.0HisVal: 0.0 ± 0.0
1.789HisTrp: 1.789 ± 1.214
3.578HisTyr: 3.578 ± 2.427
0.0HisXaa: 0.0 ± 0.0
Ile
7.156IleAla: 7.156 ± 2.958
1.789IleCys: 1.789 ± 1.214
1.789IleAsp: 1.789 ± 1.214
1.789IleGlu: 1.789 ± 1.214
7.156IlePhe: 7.156 ± 0.354
3.578IleGly: 3.578 ± 0.177
0.0IleHis: 0.0 ± 0.0
5.367IleIle: 5.367 ± 4.172
3.578IleLys: 3.578 ± 2.427
5.367IleLeu: 5.367 ± 1.568
1.789IleMet: 1.789 ± 1.391
7.156IleAsn: 7.156 ± 2.958
1.789IlePro: 1.789 ± 1.391
1.789IleGln: 1.789 ± 1.391
5.367IleArg: 5.367 ± 1.037
0.0IleSer: 0.0 ± 0.0
1.789IleThr: 1.789 ± 1.214
5.367IleVal: 5.367 ± 3.641
1.789IleTrp: 1.789 ± 1.214
1.789IleTyr: 1.789 ± 1.214
0.0IleXaa: 0.0 ± 0.0
Lys
3.578LysAla: 3.578 ± 0.177
5.367LysCys: 5.367 ± 1.568
0.0LysAsp: 0.0 ± 0.0
0.0LysGlu: 0.0 ± 0.0
7.156LysPhe: 7.156 ± 2.25
8.945LysGly: 8.945 ± 3.464
0.0LysHis: 0.0 ± 0.0
0.0LysIle: 0.0 ± 0.0
3.578LysLys: 3.578 ± 2.427
5.367LysLeu: 5.367 ± 1.568
1.789LysMet: 1.789 ± 0.859
1.789LysAsn: 1.789 ± 1.391
3.578LysPro: 3.578 ± 0.177
1.789LysGln: 1.789 ± 1.214
3.578LysArg: 3.578 ± 2.427
3.578LysSer: 3.578 ± 2.427
5.367LysThr: 5.367 ± 3.641
1.789LysVal: 1.789 ± 1.391
0.0LysTrp: 0.0 ± 0.0
7.156LysTyr: 7.156 ± 0.354
0.0LysXaa: 0.0 ± 0.0
Leu
3.578LeuAla: 3.578 ± 0.177
1.789LeuCys: 1.789 ± 1.214
5.367LeuAsp: 5.367 ± 3.641
3.578LeuGlu: 3.578 ± 2.427
0.0LeuPhe: 0.0 ± 0.0
1.789LeuGly: 1.789 ± 1.391
0.0LeuHis: 0.0 ± 0.0
0.0LeuIle: 0.0 ± 0.0
1.789LeuLys: 1.789 ± 1.214
0.0LeuLeu: 0.0 ± 0.0
1.789LeuMet: 1.789 ± 1.391
3.578LeuAsn: 3.578 ± 0.177
5.367LeuPro: 5.367 ± 1.037
3.578LeuGln: 3.578 ± 0.177
1.789LeuArg: 1.789 ± 1.214
1.789LeuSer: 1.789 ± 1.391
7.156LeuThr: 7.156 ± 2.25
3.578LeuVal: 3.578 ± 2.781
0.0LeuTrp: 0.0 ± 0.0
7.156LeuTyr: 7.156 ± 4.854
0.0LeuXaa: 0.0 ± 0.0
Met
3.578MetAla: 3.578 ± 2.781
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
1.789MetPhe: 1.789 ± 1.391
3.578MetGly: 3.578 ± 0.177
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
3.578MetLys: 3.578 ± 2.781
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
1.789MetAsn: 1.789 ± 1.391
1.789MetPro: 1.789 ± 1.214
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
5.367MetSer: 5.367 ± 1.037
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
3.578MetTyr: 3.578 ± 0.177
0.0MetXaa: 0.0 ± 0.0
Asn
3.578AsnAla: 3.578 ± 2.781
0.0AsnCys: 0.0 ± 0.0
1.789AsnAsp: 1.789 ± 1.391
1.789AsnGlu: 1.789 ± 1.214
0.0AsnPhe: 0.0 ± 0.0
5.367AsnGly: 5.367 ± 1.037
1.789AsnHis: 1.789 ± 1.214
3.578AsnIle: 3.578 ± 0.177
5.367AsnLys: 5.367 ± 4.172
3.578AsnLeu: 3.578 ± 2.427
0.0AsnMet: 0.0 ± 0.924
7.156AsnAsn: 7.156 ± 2.958
7.156AsnPro: 7.156 ± 5.562
3.578AsnGln: 3.578 ± 2.781
0.0AsnArg: 0.0 ± 0.0
3.578AsnSer: 3.578 ± 2.781
1.789AsnThr: 1.789 ± 1.391
1.789AsnVal: 1.789 ± 1.391
0.0AsnTrp: 0.0 ± 0.0
5.367AsnTyr: 5.367 ± 1.568
0.0AsnXaa: 0.0 ± 0.0
Pro
0.0ProAla: 0.0 ± 0.0
0.0ProCys: 0.0 ± 0.0
0.0ProAsp: 0.0 ± 0.0
0.0ProGlu: 0.0 ± 0.0
3.578ProPhe: 3.578 ± 2.781
3.578ProGly: 3.578 ± 2.427
0.0ProHis: 0.0 ± 0.0
7.156ProIle: 7.156 ± 2.958
3.578ProLys: 3.578 ± 0.177
3.578ProLeu: 3.578 ± 0.177
0.0ProMet: 0.0 ± 0.0
1.789ProAsn: 1.789 ± 1.391
5.367ProPro: 5.367 ± 1.037
1.789ProGln: 1.789 ± 1.391
1.789ProArg: 1.789 ± 1.214
8.945ProSer: 8.945 ± 1.745
1.789ProThr: 1.789 ± 1.391
5.367ProVal: 5.367 ± 4.172
1.789ProTrp: 1.789 ± 1.214
1.789ProTyr: 1.789 ± 1.214
0.0ProXaa: 0.0 ± 0.0
Gln
1.789GlnAla: 1.789 ± 1.214
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
1.789GlnGlu: 1.789 ± 1.391
0.0GlnPhe: 0.0 ± 0.0
5.367GlnGly: 5.367 ± 3.641
0.0GlnHis: 0.0 ± 0.0
1.789GlnIle: 1.789 ± 1.391
0.0GlnLys: 0.0 ± 0.0
3.578GlnLeu: 3.578 ± 0.177
0.0GlnMet: 0.0 ± 0.0
3.578GlnAsn: 3.578 ± 2.781
0.0GlnPro: 0.0 ± 0.0
1.789GlnGln: 1.789 ± 1.214
3.578GlnArg: 3.578 ± 0.177
5.367GlnSer: 5.367 ± 4.172
5.367GlnThr: 5.367 ± 4.172
5.367GlnVal: 5.367 ± 4.172
0.0GlnTrp: 0.0 ± 0.0
1.789GlnTyr: 1.789 ± 1.391
0.0GlnXaa: 0.0 ± 0.0
Arg
1.789ArgAla: 1.789 ± 1.214
1.789ArgCys: 1.789 ± 1.214
3.578ArgAsp: 3.578 ± 0.177
1.789ArgGlu: 1.789 ± 1.214
1.789ArgPhe: 1.789 ± 1.391
5.367ArgGly: 5.367 ± 3.641
0.0ArgHis: 0.0 ± 0.0
1.789ArgIle: 1.789 ± 1.214
1.789ArgLys: 1.789 ± 1.391
5.367ArgLeu: 5.367 ± 1.037
0.0ArgMet: 0.0 ± 0.0
0.0ArgAsn: 0.0 ± 0.0
0.0ArgPro: 0.0 ± 0.0
3.578ArgGln: 3.578 ± 2.781
1.789ArgArg: 1.789 ± 1.214
1.789ArgSer: 1.789 ± 1.214
3.578ArgThr: 3.578 ± 2.427
1.789ArgVal: 1.789 ± 1.391
1.789ArgTrp: 1.789 ± 1.214
1.789ArgTyr: 1.789 ± 1.391
0.0ArgXaa: 0.0 ± 0.0
Ser
0.0SerAla: 0.0 ± 0.0
0.0SerCys: 0.0 ± 0.0
5.367SerAsp: 5.367 ± 1.568
1.789SerGlu: 1.789 ± 1.214
7.156SerPhe: 7.156 ± 5.562
5.367SerGly: 5.367 ± 1.568
0.0SerHis: 0.0 ± 0.0
7.156SerIle: 7.156 ± 0.354
1.789SerLys: 1.789 ± 1.214
0.0SerLeu: 0.0 ± 0.0
0.0SerMet: 0.0 ± 0.0
3.578SerAsn: 3.578 ± 0.177
1.789SerPro: 1.789 ± 1.391
5.367SerGln: 5.367 ± 4.172
5.367SerArg: 5.367 ± 3.641
3.578SerSer: 3.578 ± 2.781
5.367SerThr: 5.367 ± 1.037
7.156SerVal: 7.156 ± 2.958
3.578SerTrp: 3.578 ± 2.781
3.578SerTyr: 3.578 ± 2.781
0.0SerXaa: 0.0 ± 0.0
Thr
7.156ThrAla: 7.156 ± 2.25
0.0ThrCys: 0.0 ± 0.0
5.367ThrAsp: 5.367 ± 1.037
1.789ThrGlu: 1.789 ± 1.391
1.789ThrPhe: 1.789 ± 1.391
3.578ThrGly: 3.578 ± 0.177
3.578ThrHis: 3.578 ± 2.427
3.578ThrIle: 3.578 ± 0.177
1.789ThrLys: 1.789 ± 1.391
7.156ThrLeu: 7.156 ± 4.854
1.789ThrMet: 1.789 ± 1.214
3.578ThrAsn: 3.578 ± 2.781
8.945ThrPro: 8.945 ± 4.349
1.789ThrGln: 1.789 ± 1.391
3.578ThrArg: 3.578 ± 0.177
3.578ThrSer: 3.578 ± 2.781
7.156ThrThr: 7.156 ± 5.562
5.367ThrVal: 5.367 ± 1.037
1.789ThrTrp: 1.789 ± 1.214
1.789ThrTyr: 1.789 ± 1.391
0.0ThrXaa: 0.0 ± 0.0
Val
7.156ValAla: 7.156 ± 2.25
0.0ValCys: 0.0 ± 0.0
0.0ValAsp: 0.0 ± 0.0
1.789ValGlu: 1.789 ± 1.214
0.0ValPhe: 0.0 ± 0.0
1.789ValGly: 1.789 ± 1.391
5.367ValHis: 5.367 ± 3.641
3.578ValIle: 3.578 ± 0.177
5.367ValLys: 5.367 ± 1.037
3.578ValLeu: 3.578 ± 0.177
0.0ValMet: 0.0 ± 0.0
5.367ValAsn: 5.367 ± 4.172
5.367ValPro: 5.367 ± 4.172
0.0ValGln: 0.0 ± 0.0
1.789ValArg: 1.789 ± 1.391
5.367ValSer: 5.367 ± 4.172
3.578ValThr: 3.578 ± 2.781
0.0ValVal: 0.0 ± 0.0
0.0ValTrp: 0.0 ± 0.0
5.367ValTyr: 5.367 ± 4.172
0.0ValXaa: 0.0 ± 0.0
Trp
1.789TrpAla: 1.789 ± 1.214
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
5.367TrpGly: 5.367 ± 1.037
1.789TrpHis: 1.789 ± 1.214
3.578TrpIle: 3.578 ± 2.427
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
1.789TrpMet: 1.789 ± 1.214
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
3.578TrpSer: 3.578 ± 0.177
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
1.789TrpTrp: 1.789 ± 1.214
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
1.789TyrCys: 1.789 ± 1.214
1.789TyrAsp: 1.789 ± 1.214
1.789TyrGlu: 1.789 ± 1.214
3.578TyrPhe: 3.578 ± 0.177
0.0TyrGly: 0.0 ± 0.0
0.0TyrHis: 0.0 ± 0.0
5.367TyrIle: 5.367 ± 1.037
3.578TyrLys: 3.578 ± 2.427
3.578TyrLeu: 3.578 ± 0.177
7.156TyrMet: 7.156 ± 5.562
5.367TyrAsn: 5.367 ± 1.568
1.789TyrPro: 1.789 ± 1.214
3.578TyrGln: 3.578 ± 2.427
5.367TyrArg: 5.367 ± 1.037
5.367TyrSer: 5.367 ± 1.568
1.789TyrThr: 1.789 ± 1.214
5.367TyrVal: 5.367 ± 4.172
1.789TyrTrp: 1.789 ± 1.214
5.367TyrTyr: 5.367 ± 1.568
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (560 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski