Amino acid dipepetide frequency for Circovirus-like genome DCCV-2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.431AlaAla: 4.431 ± 1.766
0.0AlaCys: 0.0 ± 0.0
1.477AlaAsp: 1.477 ± 1.122
1.477AlaGlu: 1.477 ± 1.122
1.477AlaPhe: 1.477 ± 1.122
4.431AlaGly: 4.431 ± 1.766
0.0AlaHis: 0.0 ± 0.0
5.908AlaIle: 5.908 ± 2.667
4.431AlaLys: 4.431 ± 0.953
4.431AlaLeu: 4.431 ± 0.953
2.954AlaMet: 2.954 ± 1.879
4.431AlaAsn: 4.431 ± 0.953
1.477AlaPro: 1.477 ± 0.955
0.0AlaGln: 0.0 ± 0.0
0.0AlaArg: 0.0 ± 0.0
2.954AlaSer: 2.954 ± 0.87
13.294AlaThr: 13.294 ± 1.607
4.431AlaVal: 4.431 ± 1.766
5.908AlaTrp: 5.908 ± 2.667
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
2.954CysAla: 2.954 ± 1.642
1.477CysCys: 1.477 ± 1.795
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
1.477CysPhe: 1.477 ± 0.955
2.954CysGly: 2.954 ± 1.642
4.431CysHis: 4.431 ± 2.864
1.477CysIle: 1.477 ± 0.955
1.477CysLys: 1.477 ± 0.955
1.477CysLeu: 1.477 ± 1.795
1.477CysMet: 1.477 ± 1.122
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
1.477CysGln: 1.477 ± 1.122
0.0CysArg: 0.0 ± 0.0
1.477CysSer: 1.477 ± 1.795
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.477AspAla: 1.477 ± 1.122
1.477AspCys: 1.477 ± 0.955
4.431AspAsp: 4.431 ± 1.766
0.0AspGlu: 0.0 ± 0.0
2.954AspPhe: 2.954 ± 0.87
4.431AspGly: 4.431 ± 2.864
1.477AspHis: 1.477 ± 0.955
1.477AspIle: 1.477 ± 0.955
1.477AspLys: 1.477 ± 0.955
2.954AspLeu: 2.954 ± 0.87
2.954AspMet: 2.954 ± 1.909
2.954AspAsn: 2.954 ± 0.87
4.431AspPro: 4.431 ± 1.441
1.477AspGln: 1.477 ± 0.955
2.954AspArg: 2.954 ± 1.909
4.431AspSer: 4.431 ± 1.766
4.431AspThr: 4.431 ± 1.766
0.0AspVal: 0.0 ± 0.0
1.477AspTrp: 1.477 ± 0.955
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
4.431GluAla: 4.431 ± 1.766
0.0GluCys: 0.0 ± 0.0
2.954GluAsp: 2.954 ± 1.909
0.0GluGlu: 0.0 ± 0.0
0.0GluPhe: 0.0 ± 0.0
0.0GluGly: 0.0 ± 0.0
1.477GluHis: 1.477 ± 1.122
0.0GluIle: 0.0 ± 0.0
1.477GluLys: 1.477 ± 0.955
0.0GluLeu: 0.0 ± 0.0
0.0GluMet: 0.0 ± 0.0
0.0GluAsn: 0.0 ± 0.0
2.954GluPro: 2.954 ± 1.642
4.431GluGln: 4.431 ± 2.864
1.477GluArg: 1.477 ± 1.795
5.908GluSer: 5.908 ± 2.667
1.477GluThr: 1.477 ± 1.795
1.477GluVal: 1.477 ± 1.795
1.477GluTrp: 1.477 ± 0.955
1.477GluTyr: 1.477 ± 0.955
0.0GluXaa: 0.0 ± 0.0
Phe
4.431PheAla: 4.431 ± 3.366
1.477PheCys: 1.477 ± 1.122
5.908PheAsp: 5.908 ± 2.285
1.477PheGlu: 1.477 ± 1.122
2.954PhePhe: 2.954 ± 0.87
0.0PheGly: 0.0 ± 0.0
1.477PheHis: 1.477 ± 0.955
2.954PheIle: 2.954 ± 1.909
1.477PheLys: 1.477 ± 0.955
1.477PheLeu: 1.477 ± 1.795
1.477PheMet: 1.477 ± 1.122
2.954PheAsn: 2.954 ± 1.688
5.908PhePro: 5.908 ± 2.829
2.954PheGln: 2.954 ± 0.87
2.954PheArg: 2.954 ± 0.87
1.477PheSer: 1.477 ± 0.955
5.908PheThr: 5.908 ± 2.829
2.954PheVal: 2.954 ± 1.909
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.954GlyAla: 2.954 ± 1.688
2.954GlyCys: 2.954 ± 1.909
7.386GlyAsp: 7.386 ± 3.191
0.0GlyGlu: 0.0 ± 0.0
5.908GlyPhe: 5.908 ± 1.28
0.0GlyGly: 0.0 ± 0.0
0.0GlyHis: 0.0 ± 0.0
2.954GlyIle: 2.954 ± 3.59
5.908GlyLys: 5.908 ± 0.89
2.954GlyLeu: 2.954 ± 0.87
1.477GlyMet: 1.477 ± 0.955
1.477GlyAsn: 1.477 ± 1.122
1.477GlyPro: 1.477 ± 1.122
2.954GlyGln: 2.954 ± 2.244
2.954GlyArg: 2.954 ± 3.59
5.908GlySer: 5.908 ± 2.74
1.477GlyThr: 1.477 ± 0.955
5.908GlyVal: 5.908 ± 2.285
0.0GlyTrp: 0.0 ± 0.0
2.954GlyTyr: 2.954 ± 1.688
0.0GlyXaa: 0.0 ± 0.0
His
1.477HisAla: 1.477 ± 0.955
1.477HisCys: 1.477 ± 1.795
0.0HisAsp: 0.0 ± 0.0
1.477HisGlu: 1.477 ± 0.955
4.431HisPhe: 4.431 ± 1.441
1.477HisGly: 1.477 ± 1.122
0.0HisHis: 0.0 ± 0.0
1.477HisIle: 1.477 ± 0.955
1.477HisLys: 1.477 ± 0.955
2.954HisLeu: 2.954 ± 1.909
2.954HisMet: 2.954 ± 1.909
0.0HisAsn: 0.0 ± 0.0
1.477HisPro: 1.477 ± 0.955
1.477HisGln: 1.477 ± 0.955
1.477HisArg: 1.477 ± 0.955
1.477HisSer: 1.477 ± 0.955
1.477HisThr: 1.477 ± 0.955
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.477HisTyr: 1.477 ± 1.122
0.0HisXaa: 0.0 ± 0.0
Ile
1.477IleAla: 1.477 ± 1.795
2.954IleCys: 2.954 ± 1.909
0.0IleAsp: 0.0 ± 0.0
2.954IleGlu: 2.954 ± 1.909
2.954IlePhe: 2.954 ± 0.87
7.386IleGly: 7.386 ± 2.181
2.954IleHis: 2.954 ± 1.909
1.477IleIle: 1.477 ± 0.955
2.954IleLys: 2.954 ± 1.642
2.954IleLeu: 2.954 ± 0.87
0.0IleMet: 0.0 ± 0.0
1.477IleAsn: 1.477 ± 0.955
2.954IlePro: 2.954 ± 0.87
1.477IleGln: 1.477 ± 1.795
4.431IleArg: 4.431 ± 1.999
5.908IleSer: 5.908 ± 5.036
1.477IleThr: 1.477 ± 1.795
4.431IleVal: 4.431 ± 1.999
1.477IleTrp: 1.477 ± 0.955
4.431IleTyr: 4.431 ± 0.953
0.0IleXaa: 0.0 ± 0.0
Lys
0.0LysAla: 0.0 ± 0.0
0.0LysCys: 0.0 ± 0.0
2.954LysAsp: 2.954 ± 1.909
1.477LysGlu: 1.477 ± 0.955
0.0LysPhe: 0.0 ± 0.0
0.0LysGly: 0.0 ± 0.0
5.908LysHis: 5.908 ± 2.285
7.386LysIle: 7.386 ± 0.329
5.908LysLys: 5.908 ± 1.74
1.477LysLeu: 1.477 ± 1.122
1.477LysMet: 1.477 ± 1.122
7.386LysAsn: 7.386 ± 2.44
2.954LysPro: 2.954 ± 1.642
0.0LysGln: 0.0 ± 0.0
1.477LysArg: 1.477 ± 0.955
1.477LysSer: 1.477 ± 0.955
7.386LysThr: 7.386 ± 0.329
1.477LysVal: 1.477 ± 1.122
0.0LysTrp: 0.0 ± 0.0
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
2.954LeuAla: 2.954 ± 1.688
2.954LeuCys: 2.954 ± 1.642
4.431LeuAsp: 4.431 ± 1.441
5.908LeuGlu: 5.908 ± 3.285
4.431LeuPhe: 4.431 ± 1.766
2.954LeuGly: 2.954 ± 1.642
0.0LeuHis: 0.0 ± 0.0
1.477LeuIle: 1.477 ± 1.795
4.431LeuLys: 4.431 ± 0.953
4.431LeuLeu: 4.431 ± 0.953
1.477LeuMet: 1.477 ± 0.876
2.954LeuAsn: 2.954 ± 1.688
4.431LeuPro: 4.431 ± 1.999
1.477LeuGln: 1.477 ± 1.122
8.863LeuArg: 8.863 ± 2.494
1.477LeuSer: 1.477 ± 1.795
8.863LeuThr: 8.863 ± 5.032
5.908LeuVal: 5.908 ± 0.89
1.477LeuTrp: 1.477 ± 1.122
0.0LeuTyr: 0.0 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
1.477MetCys: 1.477 ± 0.955
4.431MetAsp: 4.431 ± 2.864
4.431MetGlu: 4.431 ± 1.999
1.477MetPhe: 1.477 ± 0.955
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.477MetLys: 1.477 ± 1.122
4.431MetLeu: 4.431 ± 3.299
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
1.477MetGln: 1.477 ± 1.122
1.477MetArg: 1.477 ± 1.122
1.477MetSer: 1.477 ± 1.122
2.954MetThr: 2.954 ± 0.87
1.477MetVal: 1.477 ± 0.955
0.0MetTrp: 0.0 ± 0.0
1.477MetTyr: 1.477 ± 1.122
0.0MetXaa: 0.0 ± 0.0
Asn
7.386AsnAla: 7.386 ± 2.548
0.0AsnCys: 0.0 ± 0.0
2.954AsnAsp: 2.954 ± 2.244
0.0AsnGlu: 0.0 ± 0.0
4.431AsnPhe: 4.431 ± 1.999
4.431AsnGly: 4.431 ± 3.366
1.477AsnHis: 1.477 ± 1.795
2.954AsnIle: 2.954 ± 2.244
1.477AsnLys: 1.477 ± 1.122
1.477AsnLeu: 1.477 ± 0.955
0.0AsnMet: 0.0 ± 0.0
5.908AsnAsn: 5.908 ± 5.06
5.908AsnPro: 5.908 ± 0.89
2.954AsnGln: 2.954 ± 2.244
5.908AsnArg: 5.908 ± 3.107
1.477AsnSer: 1.477 ± 0.955
2.954AsnThr: 2.954 ± 3.59
4.431AsnVal: 4.431 ± 1.441
0.0AsnTrp: 0.0 ± 0.0
2.954AsnTyr: 2.954 ± 0.87
0.0AsnXaa: 0.0 ± 0.0
Pro
4.431ProAla: 4.431 ± 2.864
0.0ProCys: 0.0 ± 0.0
0.0ProAsp: 0.0 ± 0.0
2.954ProGlu: 2.954 ± 1.909
2.954ProPhe: 2.954 ± 0.87
5.908ProGly: 5.908 ± 5.036
0.0ProHis: 0.0 ± 0.0
7.386ProIle: 7.386 ± 4.533
1.477ProLys: 1.477 ± 0.955
4.431ProLeu: 4.431 ± 0.953
0.0ProMet: 0.0 ± 0.0
1.477ProAsn: 1.477 ± 0.955
5.908ProPro: 5.908 ± 0.89
1.477ProGln: 1.477 ± 0.955
5.908ProArg: 5.908 ± 2.285
2.954ProSer: 2.954 ± 0.87
5.908ProThr: 5.908 ± 1.28
4.431ProVal: 4.431 ± 3.366
1.477ProTrp: 1.477 ± 0.955
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
4.431GlnAla: 4.431 ± 1.766
0.0GlnCys: 0.0 ± 0.0
1.477GlnAsp: 1.477 ± 0.955
0.0GlnGlu: 0.0 ± 0.0
4.431GlnPhe: 4.431 ± 1.441
4.431GlnGly: 4.431 ± 1.441
1.477GlnHis: 1.477 ± 0.955
2.954GlnIle: 2.954 ± 0.87
0.0GlnLys: 0.0 ± 0.0
2.954GlnLeu: 2.954 ± 0.87
0.0GlnMet: 0.0 ± 0.0
1.477GlnAsn: 1.477 ± 1.122
0.0GlnPro: 0.0 ± 0.0
2.954GlnGln: 2.954 ± 0.87
0.0GlnArg: 0.0 ± 0.0
1.477GlnSer: 1.477 ± 0.955
10.34GlnThr: 10.34 ± 6.225
1.477GlnVal: 1.477 ± 1.122
0.0GlnTrp: 0.0 ± 0.0
2.954GlnTyr: 2.954 ± 1.909
0.0GlnXaa: 0.0 ± 0.0
Arg
4.431ArgAla: 4.431 ± 2.864
0.0ArgCys: 0.0 ± 0.0
4.431ArgAsp: 4.431 ± 1.441
2.954ArgGlu: 2.954 ± 1.909
0.0ArgPhe: 0.0 ± 0.0
5.908ArgGly: 5.908 ± 5.036
1.477ArgHis: 1.477 ± 0.955
2.954ArgIle: 2.954 ± 1.909
1.477ArgLys: 1.477 ± 0.955
7.386ArgLeu: 7.386 ± 4.923
2.954ArgMet: 2.954 ± 1.688
1.477ArgAsn: 1.477 ± 1.795
0.0ArgPro: 0.0 ± 0.0
0.0ArgGln: 0.0 ± 0.0
10.34ArgArg: 10.34 ± 4.049
2.954ArgSer: 2.954 ± 0.87
2.954ArgThr: 2.954 ± 0.87
1.477ArgVal: 1.477 ± 1.795
0.0ArgTrp: 0.0 ± 0.0
2.954ArgTyr: 2.954 ± 0.87
0.0ArgXaa: 0.0 ± 0.0
Ser
2.954SerAla: 2.954 ± 3.59
0.0SerCys: 0.0 ± 0.0
0.0SerAsp: 0.0 ± 0.0
0.0SerGlu: 0.0 ± 0.0
5.908SerPhe: 5.908 ± 4.489
4.431SerGly: 4.431 ± 0.953
1.477SerHis: 1.477 ± 0.955
1.477SerIle: 1.477 ± 0.955
0.0SerLys: 0.0 ± 0.0
11.817SerLeu: 11.817 ± 1.78
1.477SerMet: 1.477 ± 0.955
8.863SerAsn: 8.863 ± 2.61
4.431SerPro: 4.431 ± 0.953
2.954SerGln: 2.954 ± 0.87
2.954SerArg: 2.954 ± 1.642
4.431SerSer: 4.431 ± 2.234
10.34SerThr: 10.34 ± 3.689
2.954SerVal: 2.954 ± 1.688
0.0SerTrp: 0.0 ± 0.0
1.477SerTyr: 1.477 ± 1.122
0.0SerXaa: 0.0 ± 0.0
Thr
8.863ThrAla: 8.863 ± 1.907
2.954ThrCys: 2.954 ± 1.688
1.477ThrAsp: 1.477 ± 1.122
1.477ThrGlu: 1.477 ± 1.795
1.477ThrPhe: 1.477 ± 1.122
5.908ThrGly: 5.908 ± 0.89
1.477ThrHis: 1.477 ± 0.955
4.431ThrIle: 4.431 ± 3.306
4.431ThrLys: 4.431 ± 1.766
5.908ThrLeu: 5.908 ± 1.28
0.0ThrMet: 0.0 ± 0.0
8.863ThrAsn: 8.863 ± 5.032
8.863ThrPro: 8.863 ± 0.633
2.954ThrGln: 2.954 ± 2.244
2.954ThrArg: 2.954 ± 1.688
14.771ThrSer: 14.771 ± 7.51
5.908ThrThr: 5.908 ± 1.74
5.908ThrVal: 5.908 ± 2.829
1.477ThrTrp: 1.477 ± 0.955
10.34ThrTyr: 10.34 ± 2.509
0.0ThrXaa: 0.0 ± 0.0
Val
2.954ValAla: 2.954 ± 0.87
0.0ValCys: 0.0 ± 0.0
1.477ValAsp: 1.477 ± 1.122
2.954ValGlu: 2.954 ± 0.87
1.477ValPhe: 1.477 ± 0.955
1.477ValGly: 1.477 ± 0.955
1.477ValHis: 1.477 ± 1.122
4.431ValIle: 4.431 ± 1.441
2.954ValLys: 2.954 ± 2.244
2.954ValLeu: 2.954 ± 1.909
4.431ValMet: 4.431 ± 0.953
5.908ValAsn: 5.908 ± 2.829
2.954ValPro: 2.954 ± 1.688
7.386ValGln: 7.386 ± 2.181
0.0ValArg: 0.0 ± 0.0
2.954ValSer: 2.954 ± 1.909
7.386ValThr: 7.386 ± 2.57
4.431ValVal: 4.431 ± 3.366
1.477ValTrp: 1.477 ± 1.795
1.477ValTyr: 1.477 ± 1.122
0.0ValXaa: 0.0 ± 0.0
Trp
1.477TrpAla: 1.477 ± 0.955
1.477TrpCys: 1.477 ± 1.795
1.477TrpAsp: 1.477 ± 0.955
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
4.431TrpLys: 4.431 ± 1.999
1.477TrpLeu: 1.477 ± 0.955
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.477TrpGln: 1.477 ± 0.955
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
1.477TrpThr: 1.477 ± 1.122
2.954TrpVal: 2.954 ± 1.909
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.477TyrAla: 1.477 ± 1.122
1.477TyrCys: 1.477 ± 0.955
0.0TyrAsp: 0.0 ± 0.0
1.477TyrGlu: 1.477 ± 1.795
1.477TyrPhe: 1.477 ± 1.122
1.477TyrGly: 1.477 ± 0.955
1.477TyrHis: 1.477 ± 0.955
2.954TyrIle: 2.954 ± 1.909
0.0TyrLys: 0.0 ± 0.0
2.954TyrLeu: 2.954 ± 1.688
1.477TyrMet: 1.477 ± 0.955
1.477TyrAsn: 1.477 ± 1.122
2.954TyrPro: 2.954 ± 1.642
1.477TyrGln: 1.477 ± 1.122
0.0TyrArg: 0.0 ± 0.0
2.954TyrSer: 2.954 ± 0.87
4.431TyrThr: 4.431 ± 3.366
4.431TyrVal: 4.431 ± 3.366
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (678 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski