Amino acid dipepetide frequency for Odonata-associated circular virus-7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.122AlaAla: 9.122 ± 1.088
1.14AlaCys: 1.14 ± 2.078
3.421AlaAsp: 3.421 ± 1.925
3.421AlaGlu: 3.421 ± 1.587
1.14AlaPhe: 1.14 ± 1.023
4.561AlaGly: 4.561 ± 2.732
0.0AlaHis: 0.0 ± 0.0
2.281AlaIle: 2.281 ± 1.366
4.561AlaLys: 4.561 ± 1.894
3.421AlaLeu: 3.421 ± 0.867
2.281AlaMet: 2.281 ± 1.366
3.421AlaAsn: 3.421 ± 2.049
4.561AlaPro: 4.561 ± 1.416
3.421AlaGln: 3.421 ± 0.867
4.561AlaArg: 4.561 ± 1.416
2.281AlaSer: 2.281 ± 0.658
0.0AlaThr: 0.0 ± 0.0
4.561AlaVal: 4.561 ± 1.416
1.14AlaTrp: 1.14 ± 0.683
1.14AlaTyr: 1.14 ± 1.023
0.0AlaXaa: 0.0 ± 0.0
Cys
1.14CysAla: 1.14 ± 0.683
2.281CysCys: 2.281 ± 1.366
1.14CysAsp: 1.14 ± 2.078
3.421CysGlu: 3.421 ± 4.041
1.14CysPhe: 1.14 ± 1.023
1.14CysGly: 1.14 ± 1.023
0.0CysHis: 0.0 ± 0.0
3.421CysIle: 3.421 ± 0.867
0.0CysLys: 0.0 ± 0.0
1.14CysLeu: 1.14 ± 1.023
0.0CysMet: 0.0 ± 0.0
2.281CysAsn: 2.281 ± 0.658
1.14CysPro: 1.14 ± 0.683
1.14CysGln: 1.14 ± 1.023
2.281CysArg: 2.281 ± 0.658
0.0CysSer: 0.0 ± 0.0
1.14CysThr: 1.14 ± 0.683
0.0CysVal: 0.0 ± 0.0
1.14CysTrp: 1.14 ± 2.078
1.14CysTyr: 1.14 ± 0.683
0.0CysXaa: 0.0 ± 0.0
Asp
1.14AspAla: 1.14 ± 0.683
0.0AspCys: 0.0 ± 0.0
4.561AspAsp: 4.561 ± 4.182
3.421AspGlu: 3.421 ± 4.041
3.421AspPhe: 3.421 ± 2.553
6.842AspGly: 6.842 ± 3.935
1.14AspHis: 1.14 ± 1.023
4.561AspIle: 4.561 ± 6.077
1.14AspLys: 1.14 ± 2.078
3.421AspLeu: 3.421 ± 1.587
1.14AspMet: 1.14 ± 1.023
1.14AspAsn: 1.14 ± 0.683
3.421AspPro: 3.421 ± 1.925
3.421AspGln: 3.421 ± 0.867
3.421AspArg: 3.421 ± 1.579
3.421AspSer: 3.421 ± 3.906
2.281AspThr: 2.281 ± 0.658
1.14AspVal: 1.14 ± 0.683
0.0AspTrp: 0.0 ± 0.0
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.421GluAla: 3.421 ± 2.049
0.0GluCys: 0.0 ± 0.0
3.421GluAsp: 3.421 ± 3.906
6.842GluGlu: 6.842 ± 3.935
4.561GluPhe: 4.561 ± 4.182
4.561GluGly: 4.561 ± 1.894
1.14GluHis: 1.14 ± 1.023
5.701GluIle: 5.701 ± 3.329
0.0GluLys: 0.0 ± 0.0
6.842GluLeu: 6.842 ± 0.721
2.281GluMet: 2.281 ± 0.658
1.14GluAsn: 1.14 ± 0.683
5.701GluPro: 5.701 ± 1.38
5.701GluGln: 5.701 ± 1.27
3.421GluArg: 3.421 ± 3.069
4.561GluSer: 4.561 ± 2.578
4.561GluThr: 4.561 ± 1.265
1.14GluVal: 1.14 ± 1.023
2.281GluTrp: 2.281 ± 2.091
3.421GluTyr: 3.421 ± 1.587
0.0GluXaa: 0.0 ± 0.0
Phe
6.842PheAla: 6.842 ± 1.735
2.281PheCys: 2.281 ± 0.658
3.421PheAsp: 3.421 ± 3.069
0.0PheGlu: 0.0 ± 0.0
0.0PhePhe: 0.0 ± 0.0
0.0PheGly: 0.0 ± 0.0
1.14PheHis: 1.14 ± 1.023
3.421PheIle: 3.421 ± 2.553
4.561PheLys: 4.561 ± 1.416
3.421PheLeu: 3.421 ± 1.579
0.0PheMet: 0.0 ± 0.0
5.701PheAsn: 5.701 ± 2.598
4.561PhePro: 4.561 ± 1.416
3.421PheGln: 3.421 ± 4.041
3.421PheArg: 3.421 ± 0.867
2.281PheSer: 2.281 ± 0.658
4.561PheThr: 4.561 ± 1.316
0.0PheVal: 0.0 ± 0.0
2.281PheTrp: 2.281 ± 1.366
2.281PheTyr: 2.281 ± 1.366
0.0PheXaa: 0.0 ± 0.0
Gly
2.281GlyAla: 2.281 ± 4.156
2.281GlyCys: 2.281 ± 0.658
3.421GlyAsp: 3.421 ± 2.553
4.561GlyGlu: 4.561 ± 2.578
1.14GlyPhe: 1.14 ± 1.023
4.561GlyGly: 4.561 ± 1.416
0.0GlyHis: 0.0 ± 0.0
0.0GlyIle: 0.0 ± 0.0
5.701GlyLys: 5.701 ± 2.047
4.561GlyLeu: 4.561 ± 2.732
1.14GlyMet: 1.14 ± 2.078
0.0GlyAsn: 0.0 ± 0.0
1.14GlyPro: 1.14 ± 1.023
1.14GlyGln: 1.14 ± 0.683
2.281GlyArg: 2.281 ± 1.366
3.421GlySer: 3.421 ± 2.049
9.122GlyThr: 9.122 ± 2.209
5.701GlyVal: 5.701 ± 2.047
2.281GlyTrp: 2.281 ± 2.046
2.281GlyTyr: 2.281 ± 1.883
0.0GlyXaa: 0.0 ± 0.0
His
2.281HisAla: 2.281 ± 2.046
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.14HisGlu: 1.14 ± 1.023
1.14HisPhe: 1.14 ± 1.023
1.14HisGly: 1.14 ± 1.023
0.0HisHis: 0.0 ± 0.0
3.421HisIle: 3.421 ± 3.069
1.14HisLys: 1.14 ± 1.023
1.14HisLeu: 1.14 ± 0.683
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
3.421HisPro: 3.421 ± 1.579
1.14HisGln: 1.14 ± 1.023
0.0HisArg: 0.0 ± 0.0
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
1.14HisVal: 1.14 ± 1.023
0.0HisTrp: 0.0 ± 0.0
1.14HisTyr: 1.14 ± 1.023
0.0HisXaa: 0.0 ± 0.0
Ile
1.14IleAla: 1.14 ± 0.683
0.0IleCys: 0.0 ± 0.0
3.421IleAsp: 3.421 ± 4.041
3.421IleGlu: 3.421 ± 1.587
2.281IlePhe: 2.281 ± 0.658
3.421IleGly: 3.421 ± 0.867
2.281IleHis: 2.281 ± 2.046
2.281IleIle: 2.281 ± 2.046
2.281IleLys: 2.281 ± 2.046
3.421IleLeu: 3.421 ± 0.867
3.421IleMet: 3.421 ± 1.925
2.281IleAsn: 2.281 ± 2.091
1.14IlePro: 1.14 ± 1.023
1.14IleGln: 1.14 ± 0.683
3.421IleArg: 3.421 ± 3.906
6.842IleSer: 6.842 ± 0.721
4.561IleThr: 4.561 ± 1.265
6.842IleVal: 6.842 ± 3.139
0.0IleTrp: 0.0 ± 0.0
3.421IleTyr: 3.421 ± 1.579
0.0IleXaa: 0.0 ± 0.0
Lys
3.421LysAla: 3.421 ± 1.925
1.14LysCys: 1.14 ± 1.023
2.281LysAsp: 2.281 ± 0.658
3.421LysGlu: 3.421 ± 1.587
2.281LysPhe: 2.281 ± 1.366
0.0LysGly: 0.0 ± 0.0
0.0LysHis: 0.0 ± 0.0
4.561LysIle: 4.561 ± 1.416
9.122LysLys: 9.122 ± 2.832
6.842LysLeu: 6.842 ± 2.704
0.0LysMet: 0.0 ± 0.0
3.421LysAsn: 3.421 ± 1.925
1.14LysPro: 1.14 ± 1.023
2.281LysGln: 2.281 ± 1.366
9.122LysArg: 9.122 ± 2.789
5.701LysSer: 5.701 ± 1.27
5.701LysThr: 5.701 ± 3.59
3.421LysVal: 3.421 ± 0.867
0.0LysTrp: 0.0 ± 0.0
3.421LysTyr: 3.421 ± 2.049
0.0LysXaa: 0.0 ± 0.0
Leu
4.561LeuAla: 4.561 ± 1.265
2.281LeuCys: 2.281 ± 0.658
3.421LeuAsp: 3.421 ± 1.587
10.262LeuGlu: 10.262 ± 1.708
3.421LeuPhe: 3.421 ± 3.069
2.281LeuGly: 2.281 ± 1.883
1.14LeuHis: 1.14 ± 1.023
2.281LeuIle: 2.281 ± 2.091
7.982LeuLys: 7.982 ± 4.781
1.14LeuLeu: 1.14 ± 0.683
1.14LeuMet: 1.14 ± 2.078
2.281LeuAsn: 2.281 ± 1.366
1.14LeuPro: 1.14 ± 0.683
9.122LeuGln: 9.122 ± 2.832
4.561LeuArg: 4.561 ± 2.732
5.701LeuSer: 5.701 ± 1.38
3.421LeuThr: 3.421 ± 1.579
6.842LeuVal: 6.842 ± 2.704
1.14LeuTrp: 1.14 ± 1.023
3.421LeuTyr: 3.421 ± 2.049
0.0LeuXaa: 0.0 ± 0.0
Met
2.281MetAla: 2.281 ± 1.366
0.0MetCys: 0.0 ± 0.0
1.14MetAsp: 1.14 ± 2.078
1.14MetGlu: 1.14 ± 1.023
2.281MetPhe: 2.281 ± 1.883
1.14MetGly: 1.14 ± 2.078
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
4.561MetLys: 4.561 ± 2.19
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.14MetPro: 1.14 ± 1.023
1.14MetGln: 1.14 ± 1.023
1.14MetArg: 1.14 ± 0.683
0.0MetSer: 0.0 ± 0.0
1.14MetThr: 1.14 ± 1.023
2.281MetVal: 2.281 ± 1.366
1.14MetTrp: 1.14 ± 0.683
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.561AsnAla: 4.561 ± 1.416
1.14AsnCys: 1.14 ± 0.683
1.14AsnAsp: 1.14 ± 1.023
2.281AsnGlu: 2.281 ± 0.658
3.421AsnPhe: 3.421 ± 2.049
2.281AsnGly: 2.281 ± 0.658
2.281AsnHis: 2.281 ± 0.658
2.281AsnIle: 2.281 ± 0.658
1.14AsnLys: 1.14 ± 2.078
6.842AsnLeu: 6.842 ± 3.128
3.421AsnMet: 3.421 ± 1.328
2.281AsnAsn: 2.281 ± 1.366
2.281AsnPro: 2.281 ± 0.658
5.701AsnGln: 5.701 ± 3.415
3.421AsnArg: 3.421 ± 1.579
0.0AsnSer: 0.0 ± 0.0
1.14AsnThr: 1.14 ± 0.683
3.421AsnVal: 3.421 ± 0.867
0.0AsnTrp: 0.0 ± 0.0
3.421AsnTyr: 3.421 ± 0.867
0.0AsnXaa: 0.0 ± 0.0
Pro
2.281ProAla: 2.281 ± 1.366
2.281ProCys: 2.281 ± 2.091
1.14ProAsp: 1.14 ± 2.078
1.14ProGlu: 1.14 ± 1.023
5.701ProPhe: 5.701 ± 1.38
4.561ProGly: 4.561 ± 2.732
0.0ProHis: 0.0 ± 0.0
2.281ProIle: 2.281 ± 1.366
3.421ProLys: 3.421 ± 3.069
3.421ProLeu: 3.421 ± 0.867
0.0ProMet: 0.0 ± 0.0
4.561ProAsn: 4.561 ± 1.416
0.0ProPro: 0.0 ± 0.0
1.14ProGln: 1.14 ± 0.683
5.701ProArg: 5.701 ± 3.59
3.421ProSer: 3.421 ± 0.867
2.281ProThr: 2.281 ± 2.046
2.281ProVal: 2.281 ± 1.366
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.421GlnAla: 3.421 ± 0.867
0.0GlnCys: 0.0 ± 0.0
3.421GlnAsp: 3.421 ± 1.925
5.701GlnGlu: 5.701 ± 3.59
1.14GlnPhe: 1.14 ± 0.683
2.281GlnGly: 2.281 ± 1.366
1.14GlnHis: 1.14 ± 1.023
4.561GlnIle: 4.561 ± 1.416
1.14GlnLys: 1.14 ± 0.683
3.421GlnLeu: 3.421 ± 1.925
0.0GlnMet: 0.0 ± 0.0
3.421GlnAsn: 3.421 ± 2.049
5.701GlnPro: 5.701 ± 2.612
0.0GlnGln: 0.0 ± 0.0
3.421GlnArg: 3.421 ± 1.579
3.421GlnSer: 3.421 ± 0.867
1.14GlnThr: 1.14 ± 0.683
1.14GlnVal: 1.14 ± 0.683
0.0GlnTrp: 0.0 ± 0.0
3.421GlnTyr: 3.421 ± 0.867
0.0GlnXaa: 0.0 ± 0.0
Arg
3.421ArgAla: 3.421 ± 0.867
1.14ArgCys: 1.14 ± 0.683
2.281ArgAsp: 2.281 ± 2.091
2.281ArgGlu: 2.281 ± 1.883
4.561ArgPhe: 4.561 ± 1.316
2.281ArgGly: 2.281 ± 0.658
2.281ArgHis: 2.281 ± 0.658
1.14ArgIle: 1.14 ± 0.683
6.842ArgLys: 6.842 ± 1.599
7.982ArgLeu: 7.982 ± 1.233
0.0ArgMet: 0.0 ± 0.0
3.421ArgAsn: 3.421 ± 1.579
2.281ArgPro: 2.281 ± 0.658
1.14ArgGln: 1.14 ± 1.023
3.421ArgArg: 3.421 ± 0.867
4.561ArgSer: 4.561 ± 1.316
6.842ArgThr: 6.842 ± 3.174
2.281ArgVal: 2.281 ± 0.658
0.0ArgTrp: 0.0 ± 0.0
4.561ArgTyr: 4.561 ± 1.416
0.0ArgXaa: 0.0 ± 0.0
Ser
2.281SerAla: 2.281 ± 1.366
2.281SerCys: 2.281 ± 1.883
1.14SerAsp: 1.14 ± 2.078
2.281SerGlu: 2.281 ± 1.883
5.701SerPhe: 5.701 ± 2.047
3.421SerGly: 3.421 ± 0.867
2.281SerHis: 2.281 ± 2.046
4.561SerIle: 4.561 ± 2.578
4.561SerLys: 4.561 ± 1.316
5.701SerLeu: 5.701 ± 1.38
2.281SerMet: 2.281 ± 1.366
3.421SerAsn: 3.421 ± 2.049
2.281SerPro: 2.281 ± 0.658
3.421SerGln: 3.421 ± 0.867
4.561SerArg: 4.561 ± 2.19
9.122SerSer: 9.122 ± 4.301
1.14SerThr: 1.14 ± 0.683
4.561SerVal: 4.561 ± 1.416
1.14SerTrp: 1.14 ± 1.023
1.14SerTyr: 1.14 ± 0.683
0.0SerXaa: 0.0 ± 0.0
Thr
1.14ThrAla: 1.14 ± 1.023
2.281ThrCys: 2.281 ± 0.658
4.561ThrAsp: 4.561 ± 1.316
9.122ThrGlu: 9.122 ± 2.529
4.561ThrPhe: 4.561 ± 1.316
1.14ThrGly: 1.14 ± 0.683
1.14ThrHis: 1.14 ± 1.023
4.561ThrIle: 4.561 ± 1.894
2.281ThrLys: 2.281 ± 1.366
2.281ThrLeu: 2.281 ± 0.658
0.0ThrMet: 0.0 ± 0.0
5.701ThrAsn: 5.701 ± 1.38
1.14ThrPro: 1.14 ± 1.023
2.281ThrGln: 2.281 ± 1.366
1.14ThrArg: 1.14 ± 2.078
1.14ThrSer: 1.14 ± 0.683
2.281ThrThr: 2.281 ± 1.366
3.421ThrVal: 3.421 ± 0.867
0.0ThrTrp: 0.0 ± 0.0
4.561ThrTyr: 4.561 ± 2.578
0.0ThrXaa: 0.0 ± 0.0
Val
3.421ValAla: 3.421 ± 0.867
0.0ValCys: 0.0 ± 0.0
3.421ValAsp: 3.421 ± 0.867
3.421ValGlu: 3.421 ± 2.049
3.421ValPhe: 3.421 ± 1.579
6.842ValGly: 6.842 ± 1.735
1.14ValHis: 1.14 ± 1.023
2.281ValIle: 2.281 ± 1.883
4.561ValLys: 4.561 ± 1.265
6.842ValLeu: 6.842 ± 1.735
1.14ValMet: 1.14 ± 0.644
3.421ValAsn: 3.421 ± 0.867
2.281ValPro: 2.281 ± 0.658
0.0ValGln: 0.0 ± 0.0
3.421ValArg: 3.421 ± 0.867
6.842ValSer: 6.842 ± 4.098
2.281ValThr: 2.281 ± 1.366
3.421ValVal: 3.421 ± 0.867
0.0ValTrp: 0.0 ± 0.0
1.14ValTyr: 1.14 ± 0.683
0.0ValXaa: 0.0 ± 0.0
Trp
1.14TrpAla: 1.14 ± 1.023
1.14TrpCys: 1.14 ± 2.078
1.14TrpAsp: 1.14 ± 0.683
1.14TrpGlu: 1.14 ± 1.023
0.0TrpPhe: 0.0 ± 0.0
2.281TrpGly: 2.281 ± 2.046
1.14TrpHis: 1.14 ± 1.023
1.14TrpIle: 1.14 ± 0.683
1.14TrpLys: 1.14 ± 0.683
1.14TrpLeu: 1.14 ± 2.078
0.0TrpMet: 0.0 ± 0.0
1.14TrpAsn: 1.14 ± 0.683
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.14TyrAla: 1.14 ± 0.683
3.421TyrCys: 3.421 ± 1.579
1.14TyrAsp: 1.14 ± 1.023
2.281TyrGlu: 2.281 ± 0.658
2.281TyrPhe: 2.281 ± 1.366
2.281TyrGly: 2.281 ± 1.366
0.0TyrHis: 0.0 ± 0.0
2.281TyrIle: 2.281 ± 4.156
1.14TyrLys: 1.14 ± 1.023
4.561TyrLeu: 4.561 ± 2.732
1.14TyrMet: 1.14 ± 0.683
4.561TyrAsn: 4.561 ± 1.316
1.14TyrPro: 1.14 ± 1.023
1.14TyrGln: 1.14 ± 0.683
0.0TyrArg: 0.0 ± 0.0
4.561TyrSer: 4.561 ± 1.416
1.14TyrThr: 1.14 ± 0.683
5.701TyrVal: 5.701 ± 1.38
0.0TyrTrp: 0.0 ± 0.0
1.14TyrTyr: 1.14 ± 0.683
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (878 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski