Amino acid dipepetide frequency for Circovirus-like genome DHCV-3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.0AlaCys: 0.0 ± 0.0
1.511AlaAsp: 1.511 ± 1.072
6.042AlaGlu: 6.042 ± 1.148
0.0AlaPhe: 0.0 ± 0.0
1.511AlaGly: 1.511 ± 1.072
3.021AlaHis: 3.021 ± 2.144
3.021AlaIle: 3.021 ± 2.16
7.553AlaLys: 7.553 ± 2.283
6.042AlaLeu: 6.042 ± 1.493
0.0AlaMet: 0.0 ± 0.0
1.511AlaAsn: 1.511 ± 1.08
4.532AlaPro: 4.532 ± 4.018
1.511AlaGln: 1.511 ± 1.072
1.511AlaArg: 1.511 ± 1.08
7.553AlaSer: 7.553 ± 3.636
4.532AlaThr: 4.532 ± 1.571
3.021AlaVal: 3.021 ± 0.818
0.0AlaTrp: 0.0 ± 0.0
3.021AlaTyr: 3.021 ± 2.16
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
1.511CysPhe: 1.511 ± 1.072
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.511CysIle: 1.511 ± 1.08
1.511CysLys: 1.511 ± 1.072
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
1.511CysAsn: 1.511 ± 2.093
1.511CysPro: 1.511 ± 1.072
3.021CysGln: 3.021 ± 2.144
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
1.511CysTrp: 1.511 ± 1.08
1.511CysTyr: 1.511 ± 1.072
0.0CysXaa: 0.0 ± 0.0
Asp
4.532AspAla: 4.532 ± 1.571
0.0AspCys: 0.0 ± 0.0
4.532AspAsp: 4.532 ± 1.571
1.511AspGlu: 1.511 ± 1.072
3.021AspPhe: 3.021 ± 0.818
1.511AspGly: 1.511 ± 1.072
0.0AspHis: 0.0 ± 0.0
1.511AspIle: 1.511 ± 1.08
1.511AspLys: 1.511 ± 1.08
6.042AspLeu: 6.042 ± 1.493
0.0AspMet: 0.0 ± 0.0
1.511AspAsn: 1.511 ± 1.08
1.511AspPro: 1.511 ± 1.072
0.0AspGln: 0.0 ± 0.0
1.511AspArg: 1.511 ± 1.072
1.511AspSer: 1.511 ± 1.08
4.532AspThr: 4.532 ± 1.284
6.042AspVal: 6.042 ± 2.563
1.511AspTrp: 1.511 ± 1.072
3.021AspTyr: 3.021 ± 0.818
0.0AspXaa: 0.0 ± 0.0
Glu
1.511GluAla: 1.511 ± 1.08
1.511GluCys: 1.511 ± 1.072
1.511GluAsp: 1.511 ± 1.072
4.532GluGlu: 4.532 ± 3.216
4.532GluPhe: 4.532 ± 1.571
3.021GluGly: 3.021 ± 0.818
1.511GluHis: 1.511 ± 1.072
4.532GluIle: 4.532 ± 3.216
1.511GluLys: 1.511 ± 1.072
7.553GluLeu: 7.553 ± 2.264
1.511GluMet: 1.511 ± 1.072
3.021GluAsn: 3.021 ± 2.144
0.0GluPro: 0.0 ± 0.0
1.511GluGln: 1.511 ± 1.072
1.511GluArg: 1.511 ± 1.072
1.511GluSer: 1.511 ± 1.08
3.021GluThr: 3.021 ± 0.818
6.042GluVal: 6.042 ± 1.148
1.511GluTrp: 1.511 ± 1.072
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.0PheCys: 0.0 ± 0.0
0.0PheAsp: 0.0 ± 0.0
3.021PheGlu: 3.021 ± 0.818
0.0PhePhe: 0.0 ± 0.0
1.511PheGly: 1.511 ± 1.08
1.511PheHis: 1.511 ± 2.093
1.511PheIle: 1.511 ± 1.08
0.0PheLys: 0.0 ± 0.0
6.042PheLeu: 6.042 ± 5.846
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
3.021PhePro: 3.021 ± 1.843
0.0PheGln: 0.0 ± 0.0
0.0PheArg: 0.0 ± 0.0
6.042PheSer: 6.042 ± 3.375
3.021PheThr: 3.021 ± 0.818
0.0PheVal: 0.0 ± 0.0
1.511PheTrp: 1.511 ± 1.072
1.511PheTyr: 1.511 ± 1.072
0.0PheXaa: 0.0 ± 0.0
Gly
4.532GlyAla: 4.532 ± 1.587
0.0GlyCys: 0.0 ± 0.0
0.0GlyAsp: 0.0 ± 0.0
1.511GlyGlu: 1.511 ± 1.072
1.511GlyPhe: 1.511 ± 2.093
4.532GlyGly: 4.532 ± 1.284
0.0GlyHis: 0.0 ± 0.0
3.021GlyIle: 3.021 ± 1.843
7.553GlyLys: 7.553 ± 3.601
3.021GlyLeu: 3.021 ± 0.818
3.021GlyMet: 3.021 ± 0.818
1.511GlyAsn: 1.511 ± 1.08
1.511GlyPro: 1.511 ± 1.072
1.511GlyGln: 1.511 ± 1.08
3.021GlyArg: 3.021 ± 0.818
4.532GlySer: 4.532 ± 1.587
4.532GlyThr: 4.532 ± 3.216
6.042GlyVal: 6.042 ± 3.326
0.0GlyTrp: 0.0 ± 0.0
3.021GlyTyr: 3.021 ± 2.144
0.0GlyXaa: 0.0 ± 0.0
His
1.511HisAla: 1.511 ± 2.093
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.511HisGlu: 1.511 ± 1.072
1.511HisPhe: 1.511 ± 2.093
1.511HisGly: 1.511 ± 1.072
1.511HisHis: 1.511 ± 2.093
1.511HisIle: 1.511 ± 1.072
1.511HisLys: 1.511 ± 1.08
6.042HisLeu: 6.042 ± 5.846
1.511HisMet: 1.511 ± 1.072
1.511HisAsn: 1.511 ± 2.093
0.0HisPro: 0.0 ± 0.0
1.511HisGln: 1.511 ± 1.08
1.511HisArg: 1.511 ± 2.093
1.511HisSer: 1.511 ± 2.093
3.021HisThr: 3.021 ± 2.068
1.511HisVal: 1.511 ± 1.072
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.511IleAla: 1.511 ± 1.08
1.511IleCys: 1.511 ± 1.08
6.042IleAsp: 6.042 ± 1.635
4.532IleGlu: 4.532 ± 1.571
0.0IlePhe: 0.0 ± 0.0
6.042IleGly: 6.042 ± 1.148
3.021IleHis: 3.021 ± 4.186
4.532IleIle: 4.532 ± 2.551
1.511IleLys: 1.511 ± 1.08
3.021IleLeu: 3.021 ± 2.144
1.511IleMet: 1.511 ± 1.072
1.511IleAsn: 1.511 ± 1.08
4.532IlePro: 4.532 ± 6.279
0.0IleGln: 0.0 ± 0.0
7.553IleArg: 7.553 ± 2.283
3.021IleSer: 3.021 ± 2.068
0.0IleThr: 0.0 ± 0.0
1.511IleVal: 1.511 ± 1.08
1.511IleTrp: 1.511 ± 1.072
3.021IleTyr: 3.021 ± 2.144
0.0IleXaa: 0.0 ± 0.0
Lys
4.532LysAla: 4.532 ± 1.571
1.511LysCys: 1.511 ± 1.08
3.021LysAsp: 3.021 ± 0.818
6.042LysGlu: 6.042 ± 2.563
0.0LysPhe: 0.0 ± 0.0
6.042LysGly: 6.042 ± 2.563
3.021LysHis: 3.021 ± 2.144
0.0LysIle: 0.0 ± 0.0
6.042LysLys: 6.042 ± 1.635
1.511LysLeu: 1.511 ± 1.072
4.532LysMet: 4.532 ± 1.587
6.042LysAsn: 6.042 ± 3.375
1.511LysPro: 1.511 ± 1.072
1.511LysGln: 1.511 ± 1.08
6.042LysArg: 6.042 ± 2.589
7.553LysSer: 7.553 ± 2.264
9.063LysThr: 9.063 ± 3.175
6.042LysVal: 6.042 ± 1.635
1.511LysTrp: 1.511 ± 1.072
1.511LysTyr: 1.511 ± 1.072
0.0LysXaa: 0.0 ± 0.0
Leu
4.532LeuAla: 4.532 ± 1.571
0.0LeuCys: 0.0 ± 0.0
3.021LeuAsp: 3.021 ± 0.818
3.021LeuGlu: 3.021 ± 2.144
4.532LeuPhe: 4.532 ± 6.279
3.021LeuGly: 3.021 ± 2.068
1.511LeuHis: 1.511 ± 2.093
4.532LeuIle: 4.532 ± 2.171
10.574LeuLys: 10.574 ± 3.04
6.042LeuLeu: 6.042 ± 1.148
1.511LeuMet: 1.511 ± 1.08
1.511LeuAsn: 1.511 ± 1.072
6.042LeuPro: 6.042 ± 1.148
4.532LeuGln: 4.532 ± 1.284
0.0LeuArg: 0.0 ± 0.0
6.042LeuSer: 6.042 ± 3.375
7.553LeuThr: 7.553 ± 3.636
4.532LeuVal: 4.532 ± 4.018
1.511LeuTrp: 1.511 ± 1.072
6.042LeuTyr: 6.042 ± 3.375
0.0LeuXaa: 0.0 ± 0.0
Met
3.021MetAla: 3.021 ± 2.068
1.511MetCys: 1.511 ± 1.072
1.511MetAsp: 1.511 ± 1.08
0.0MetGlu: 0.0 ± 0.0
1.511MetPhe: 1.511 ± 1.08
1.511MetGly: 1.511 ± 1.072
1.511MetHis: 1.511 ± 2.093
0.0MetIle: 0.0 ± 0.0
3.021MetLys: 3.021 ± 2.144
6.042MetLeu: 6.042 ± 1.148
0.0MetMet: 0.0 ± 0.0
1.511MetAsn: 1.511 ± 1.072
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
3.021MetArg: 3.021 ± 0.818
0.0MetSer: 0.0 ± 0.0
0.0MetThr: 0.0 ± 0.0
1.511MetVal: 1.511 ± 1.08
0.0MetTrp: 0.0 ± 0.0
3.021MetTyr: 3.021 ± 0.818
0.0MetXaa: 0.0 ± 0.0
Asn
7.553AsnAla: 7.553 ± 0.504
0.0AsnCys: 0.0 ± 0.0
1.511AsnAsp: 1.511 ± 1.072
3.021AsnGlu: 3.021 ± 0.818
1.511AsnPhe: 1.511 ± 2.093
0.0AsnGly: 0.0 ± 0.0
1.511AsnHis: 1.511 ± 2.093
4.532AsnIle: 4.532 ± 1.284
0.0AsnLys: 0.0 ± 0.0
1.511AsnLeu: 1.511 ± 2.093
3.021AsnMet: 3.021 ± 1.899
10.574AsnAsn: 10.574 ± 2.263
0.0AsnPro: 0.0 ± 0.0
3.021AsnGln: 3.021 ± 0.818
4.532AsnArg: 4.532 ± 3.239
3.021AsnSer: 3.021 ± 1.843
4.532AsnThr: 4.532 ± 2.551
6.042AsnVal: 6.042 ± 1.635
1.511AsnTrp: 1.511 ± 1.072
9.063AsnTyr: 9.063 ± 2.568
0.0AsnXaa: 0.0 ± 0.0
Pro
1.511ProAla: 1.511 ± 1.08
1.511ProCys: 1.511 ± 2.093
1.511ProAsp: 1.511 ± 2.093
1.511ProGlu: 1.511 ± 1.072
0.0ProPhe: 0.0 ± 0.0
1.511ProGly: 1.511 ± 2.093
0.0ProHis: 0.0 ± 0.0
1.511ProIle: 1.511 ± 2.093
4.532ProLys: 4.532 ± 2.171
4.532ProLeu: 4.532 ± 6.279
0.0ProMet: 0.0 ± 0.0
3.021ProAsn: 3.021 ± 2.144
0.0ProPro: 0.0 ± 0.0
6.042ProGln: 6.042 ± 2.563
1.511ProArg: 1.511 ± 1.072
1.511ProSer: 1.511 ± 1.072
0.0ProThr: 0.0 ± 0.0
1.511ProVal: 1.511 ± 1.08
0.0ProTrp: 0.0 ± 0.0
1.511ProTyr: 1.511 ± 2.093
0.0ProXaa: 0.0 ± 0.0
Gln
1.511GlnAla: 1.511 ± 1.072
1.511GlnCys: 1.511 ± 1.072
3.021GlnAsp: 3.021 ± 0.818
3.021GlnGlu: 3.021 ± 0.818
1.511GlnPhe: 1.511 ± 1.08
1.511GlnGly: 1.511 ± 1.072
1.511GlnHis: 1.511 ± 2.093
1.511GlnIle: 1.511 ± 2.093
0.0GlnLys: 0.0 ± 0.0
0.0GlnLeu: 0.0 ± 0.0
0.0GlnMet: 0.0 ± 0.0
4.532GlnAsn: 4.532 ± 2.551
3.021GlnPro: 3.021 ± 2.144
0.0GlnGln: 0.0 ± 0.0
0.0GlnArg: 0.0 ± 0.0
4.532GlnSer: 4.532 ± 1.284
1.511GlnThr: 1.511 ± 1.072
4.532GlnVal: 4.532 ± 3.239
1.511GlnTrp: 1.511 ± 1.072
3.021GlnTyr: 3.021 ± 0.818
0.0GlnXaa: 0.0 ± 0.0
Arg
4.532ArgAla: 4.532 ± 1.587
0.0ArgCys: 0.0 ± 0.0
4.532ArgAsp: 4.532 ± 1.571
4.532ArgGlu: 4.532 ± 1.571
1.511ArgPhe: 1.511 ± 1.072
1.511ArgGly: 1.511 ± 1.08
1.511ArgHis: 1.511 ± 1.072
6.042ArgIle: 6.042 ± 1.635
4.532ArgLys: 4.532 ± 1.571
6.042ArgLeu: 6.042 ± 4.319
0.0ArgMet: 0.0 ± 0.0
4.532ArgAsn: 4.532 ± 1.571
0.0ArgPro: 0.0 ± 0.0
1.511ArgGln: 1.511 ± 1.08
6.042ArgArg: 6.042 ± 1.635
4.532ArgSer: 4.532 ± 1.571
0.0ArgThr: 0.0 ± 0.0
3.021ArgVal: 3.021 ± 0.818
0.0ArgTrp: 0.0 ± 0.0
4.532ArgTyr: 4.532 ± 1.284
0.0ArgXaa: 0.0 ± 0.0
Ser
3.021SerAla: 3.021 ± 0.818
0.0SerCys: 0.0 ± 0.0
1.511SerAsp: 1.511 ± 1.08
1.511SerGlu: 1.511 ± 1.072
3.021SerPhe: 3.021 ± 1.843
7.553SerGly: 7.553 ± 2.264
3.021SerHis: 3.021 ± 2.068
3.021SerIle: 3.021 ± 2.068
4.532SerLys: 4.532 ± 1.284
4.532SerLeu: 4.532 ± 1.571
6.042SerMet: 6.042 ± 3.687
6.042SerAsn: 6.042 ± 1.635
0.0SerPro: 0.0 ± 0.0
3.021SerGln: 3.021 ± 4.186
1.511SerArg: 1.511 ± 1.072
6.042SerSer: 6.042 ± 3.326
7.553SerThr: 7.553 ± 5.399
6.042SerVal: 6.042 ± 2.589
1.511SerTrp: 1.511 ± 1.08
3.021SerTyr: 3.021 ± 1.843
0.0SerXaa: 0.0 ± 0.0
Thr
4.532ThrAla: 4.532 ± 3.239
0.0ThrCys: 0.0 ± 0.0
3.021ThrAsp: 3.021 ± 2.068
1.511ThrGlu: 1.511 ± 1.08
1.511ThrPhe: 1.511 ± 1.08
6.042ThrGly: 6.042 ± 2.589
1.511ThrHis: 1.511 ± 1.08
3.021ThrIle: 3.021 ± 1.843
4.532ThrLys: 4.532 ± 1.571
6.042ThrLeu: 6.042 ± 3.326
0.0ThrMet: 0.0 ± 0.877
1.511ThrAsn: 1.511 ± 1.08
1.511ThrPro: 1.511 ± 1.072
3.021ThrGln: 3.021 ± 2.16
6.042ThrArg: 6.042 ± 2.563
6.042ThrSer: 6.042 ± 2.563
6.042ThrThr: 6.042 ± 4.319
6.042ThrVal: 6.042 ± 1.493
0.0ThrTrp: 0.0 ± 0.0
3.021ThrTyr: 3.021 ± 0.818
0.0ThrXaa: 0.0 ± 0.0
Val
7.553ValAla: 7.553 ± 0.504
1.511ValCys: 1.511 ± 1.072
3.021ValAsp: 3.021 ± 2.16
0.0ValGlu: 0.0 ± 0.0
0.0ValPhe: 0.0 ± 0.0
3.021ValGly: 3.021 ± 2.16
1.511ValHis: 1.511 ± 1.08
6.042ValIle: 6.042 ± 2.563
6.042ValLys: 6.042 ± 2.589
1.511ValLeu: 1.511 ± 1.072
3.021ValMet: 3.021 ± 0.818
9.063ValAsn: 9.063 ± 3.514
4.532ValPro: 4.532 ± 4.018
3.021ValGln: 3.021 ± 0.818
7.553ValArg: 7.553 ± 2.283
4.532ValSer: 4.532 ± 2.551
3.021ValThr: 3.021 ± 1.843
4.532ValVal: 4.532 ± 1.587
0.0ValTrp: 0.0 ± 0.0
4.532ValTyr: 4.532 ± 1.571
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
1.511TrpCys: 1.511 ± 1.072
1.511TrpAsp: 1.511 ± 1.072
1.511TrpGlu: 1.511 ± 1.072
0.0TrpPhe: 0.0 ± 0.0
3.021TrpGly: 3.021 ± 2.144
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
3.021TrpLys: 3.021 ± 2.144
3.021TrpLeu: 3.021 ± 0.818
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
1.511TrpTrp: 1.511 ± 1.072
1.511TrpTyr: 1.511 ± 1.08
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
1.511TyrCys: 1.511 ± 1.072
4.532TyrAsp: 4.532 ± 3.216
3.021TyrGlu: 3.021 ± 2.144
1.511TyrPhe: 1.511 ± 1.072
0.0TyrGly: 0.0 ± 0.0
1.511TyrHis: 1.511 ± 2.093
4.532TyrIle: 4.532 ± 3.239
7.553TyrLys: 7.553 ± 2.283
1.511TyrLeu: 1.511 ± 1.072
1.511TyrMet: 1.511 ± 1.553
6.042TyrAsn: 6.042 ± 4.136
0.0TyrPro: 0.0 ± 0.0
3.021TyrGln: 3.021 ± 1.843
6.042TyrArg: 6.042 ± 2.563
3.021TyrSer: 3.021 ± 0.818
4.532TyrThr: 4.532 ± 1.587
6.042TyrVal: 6.042 ± 3.687
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (663 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski