Amino acid dipepetide frequency for Circovirus-like genome DCCV-13

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.35AlaAla: 1.35 ± 1.073
1.35AlaCys: 1.35 ± 1.17
4.049AlaAsp: 4.049 ± 3.22
4.049AlaGlu: 4.049 ± 1.84
0.0AlaPhe: 0.0 ± 0.0
6.748AlaGly: 6.748 ± 2.818
2.699AlaHis: 2.699 ± 1.05
1.35AlaIle: 1.35 ± 1.073
1.35AlaLys: 1.35 ± 1.08
5.398AlaLeu: 5.398 ± 1.287
0.0AlaMet: 0.0 ± 0.0
4.049AlaAsn: 4.049 ± 2.129
0.0AlaPro: 0.0 ± 0.0
0.0AlaGln: 0.0 ± 0.0
1.35AlaArg: 1.35 ± 1.073
2.699AlaSer: 2.699 ± 2.147
4.049AlaThr: 4.049 ± 1.773
6.748AlaVal: 6.748 ± 2.818
1.35AlaTrp: 1.35 ± 1.073
1.35AlaTyr: 1.35 ± 1.073
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.35CysAsp: 1.35 ± 1.17
1.35CysGlu: 1.35 ± 1.073
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.35CysLys: 1.35 ± 1.08
4.049CysLeu: 4.049 ± 1.947
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.35CysPro: 1.35 ± 1.17
0.0CysGln: 0.0 ± 0.0
1.35CysArg: 1.35 ± 1.17
1.35CysSer: 1.35 ± 1.17
1.35CysThr: 1.35 ± 1.08
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.699AspAla: 2.699 ± 2.16
0.0AspCys: 0.0 ± 0.0
1.35AspAsp: 1.35 ± 1.073
1.35AspGlu: 1.35 ± 1.073
0.0AspPhe: 0.0 ± 0.0
5.398AspGly: 5.398 ± 0.932
0.0AspHis: 0.0 ± 0.0
5.398AspIle: 5.398 ± 2.121
1.35AspLys: 1.35 ± 1.08
5.398AspLeu: 5.398 ± 3.035
2.699AspMet: 2.699 ± 2.16
0.0AspAsn: 0.0 ± 0.0
1.35AspPro: 1.35 ± 1.073
4.049AspGln: 4.049 ± 0.148
4.049AspArg: 4.049 ± 0.148
2.699AspSer: 2.699 ± 1.217
4.049AspThr: 4.049 ± 1.947
1.35AspVal: 1.35 ± 1.08
4.049AspTrp: 4.049 ± 3.22
6.748AspTyr: 6.748 ± 2.805
0.0AspXaa: 0.0 ± 0.0
Glu
4.049GluAla: 4.049 ± 1.773
0.0GluCys: 0.0 ± 0.0
2.699GluAsp: 2.699 ± 2.147
6.748GluGlu: 6.748 ± 4.168
1.35GluPhe: 1.35 ± 1.073
0.0GluGly: 0.0 ± 0.0
0.0GluHis: 0.0 ± 0.0
6.748GluIle: 6.748 ± 2.805
0.0GluLys: 0.0 ± 0.0
4.049GluLeu: 4.049 ± 3.509
2.699GluMet: 2.699 ± 1.05
2.699GluAsn: 2.699 ± 1.05
2.699GluPro: 2.699 ± 1.05
2.699GluGln: 2.699 ± 1.217
6.748GluArg: 6.748 ± 5.367
5.398GluSer: 5.398 ± 1.287
2.699GluThr: 2.699 ± 2.16
1.35GluVal: 1.35 ± 1.17
1.35GluTrp: 1.35 ± 1.17
1.35GluTyr: 1.35 ± 1.17
0.0GluXaa: 0.0 ± 0.0
Phe
2.699PheAla: 2.699 ± 2.16
1.35PheCys: 1.35 ± 1.17
2.699PheAsp: 2.699 ± 2.147
4.049PheGlu: 4.049 ± 1.773
0.0PhePhe: 0.0 ± 0.0
2.699PheGly: 2.699 ± 2.339
0.0PheHis: 0.0 ± 0.0
0.0PheIle: 0.0 ± 0.0
4.049PheLys: 4.049 ± 1.852
2.699PheLeu: 2.699 ± 2.16
1.35PheMet: 1.35 ± 1.17
2.699PheAsn: 2.699 ± 1.217
1.35PhePro: 1.35 ± 1.17
2.699PheGln: 2.699 ± 2.16
1.35PheArg: 1.35 ± 1.073
0.0PheSer: 0.0 ± 0.0
1.35PheThr: 1.35 ± 1.08
4.049PheVal: 4.049 ± 0.148
1.35PheTrp: 1.35 ± 1.073
2.699PheTyr: 2.699 ± 1.06
0.0PheXaa: 0.0 ± 0.0
Gly
2.699GlyAla: 2.699 ± 2.147
1.35GlyCys: 1.35 ± 1.17
4.049GlyAsp: 4.049 ± 1.852
1.35GlyGlu: 1.35 ± 1.073
5.398GlyPhe: 5.398 ± 2.84
2.699GlyGly: 2.699 ± 1.06
0.0GlyHis: 0.0 ± 0.0
1.35GlyIle: 1.35 ± 1.073
4.049GlyLys: 4.049 ± 0.148
8.097GlyLeu: 8.097 ± 3.963
1.35GlyMet: 1.35 ± 1.17
4.049GlyAsn: 4.049 ± 1.982
2.699GlyPro: 2.699 ± 1.05
1.35GlyGln: 1.35 ± 1.073
2.699GlyArg: 2.699 ± 1.217
6.748GlySer: 6.748 ± 3.264
8.097GlyThr: 8.097 ± 1.697
2.699GlyVal: 2.699 ± 1.06
1.35GlyTrp: 1.35 ± 1.073
1.35GlyTyr: 1.35 ± 1.073
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
1.35HisCys: 1.35 ± 1.073
0.0HisAsp: 0.0 ± 0.0
1.35HisGlu: 1.35 ± 1.073
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
2.699HisIle: 2.699 ± 2.147
0.0HisLys: 0.0 ± 0.0
0.0HisLeu: 0.0 ± 0.0
0.0HisMet: 0.0 ± 0.0
1.35HisAsn: 1.35 ± 1.08
2.699HisPro: 2.699 ± 1.05
0.0HisGln: 0.0 ± 0.0
1.35HisArg: 1.35 ± 1.073
4.049HisSer: 4.049 ± 1.982
1.35HisThr: 1.35 ± 1.17
0.0HisVal: 0.0 ± 0.0
1.35HisTrp: 1.35 ± 1.073
2.699HisTyr: 2.699 ± 1.06
0.0HisXaa: 0.0 ± 0.0
Ile
1.35IleAla: 1.35 ± 1.17
0.0IleCys: 0.0 ± 0.0
4.049IleAsp: 4.049 ± 1.852
1.35IleGlu: 1.35 ± 1.073
2.699IlePhe: 2.699 ± 1.217
4.049IleGly: 4.049 ± 0.148
0.0IleHis: 0.0 ± 0.0
1.35IleIle: 1.35 ± 1.073
4.049IleLys: 4.049 ± 1.852
6.748IleLeu: 6.748 ± 0.962
0.0IleMet: 0.0 ± 0.0
2.699IleAsn: 2.699 ± 2.16
6.748IlePro: 6.748 ± 2.204
2.699IleGln: 2.699 ± 2.147
8.097IleArg: 8.097 ± 0.297
1.35IleSer: 1.35 ± 1.073
4.049IleThr: 4.049 ± 0.148
4.049IleVal: 4.049 ± 3.22
1.35IleTrp: 1.35 ± 1.073
2.699IleTyr: 2.699 ± 1.05
0.0IleXaa: 0.0 ± 0.0
Lys
4.049LysAla: 4.049 ± 1.84
0.0LysCys: 0.0 ± 0.0
5.398LysAsp: 5.398 ± 2.121
1.35LysGlu: 1.35 ± 1.073
1.35LysPhe: 1.35 ± 1.073
1.35LysGly: 1.35 ± 1.073
2.699LysHis: 2.699 ± 1.06
5.398LysIle: 5.398 ± 2.434
5.398LysLys: 5.398 ± 2.95
2.699LysLeu: 2.699 ± 2.16
2.699LysMet: 2.699 ± 2.16
2.699LysAsn: 2.699 ± 1.06
6.748LysPro: 6.748 ± 4.168
4.049LysGln: 4.049 ± 0.148
4.049LysArg: 4.049 ± 3.24
5.398LysSer: 5.398 ± 0.932
2.699LysThr: 2.699 ± 1.217
1.35LysVal: 1.35 ± 1.08
1.35LysTrp: 1.35 ± 1.17
1.35LysTyr: 1.35 ± 1.08
0.0LysXaa: 0.0 ± 0.0
Leu
2.699LeuAla: 2.699 ± 2.147
0.0LeuCys: 0.0 ± 0.0
4.049LeuAsp: 4.049 ± 0.148
6.748LeuGlu: 6.748 ± 0.962
0.0LeuPhe: 0.0 ± 0.0
6.748LeuGly: 6.748 ± 5.4
1.35LeuHis: 1.35 ± 1.073
4.049LeuIle: 4.049 ± 2.129
6.748LeuLys: 6.748 ± 3.264
8.097LeuLeu: 8.097 ± 3.623
1.35LeuMet: 1.35 ± 1.17
4.049LeuAsn: 4.049 ± 2.129
6.748LeuPro: 6.748 ± 2.454
6.748LeuGln: 6.748 ± 3.264
5.398LeuArg: 5.398 ± 2.737
8.097LeuSer: 8.097 ± 2.274
4.049LeuThr: 4.049 ± 1.947
4.049LeuVal: 4.049 ± 1.84
0.0LeuTrp: 0.0 ± 0.0
4.049LeuTyr: 4.049 ± 1.982
0.0LeuXaa: 0.0 ± 0.0
Met
1.35MetAla: 1.35 ± 1.08
0.0MetCys: 0.0 ± 0.0
2.699MetAsp: 2.699 ± 2.339
1.35MetGlu: 1.35 ± 1.17
1.35MetPhe: 1.35 ± 1.17
2.699MetGly: 2.699 ± 2.147
0.0MetHis: 0.0 ± 0.0
2.699MetIle: 2.699 ± 2.16
0.0MetLys: 0.0 ± 0.0
2.699MetLeu: 2.699 ± 1.05
0.0MetMet: 0.0 ± 0.921
2.699MetAsn: 2.699 ± 1.217
2.699MetPro: 2.699 ± 2.16
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
1.35MetSer: 1.35 ± 1.17
1.35MetThr: 1.35 ± 1.17
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.35AsnAla: 1.35 ± 1.08
1.35AsnCys: 1.35 ± 1.08
2.699AsnAsp: 2.699 ± 1.217
2.699AsnGlu: 2.699 ± 1.217
5.398AsnPhe: 5.398 ± 2.434
0.0AsnGly: 0.0 ± 0.0
0.0AsnHis: 0.0 ± 0.0
4.049AsnIle: 4.049 ± 0.148
1.35AsnLys: 1.35 ± 1.08
4.049AsnLeu: 4.049 ± 1.852
0.0AsnMet: 0.0 ± 0.0
1.35AsnAsn: 1.35 ± 1.08
4.049AsnPro: 4.049 ± 1.982
2.699AsnGln: 2.699 ± 2.339
2.699AsnArg: 2.699 ± 1.217
2.699AsnSer: 2.699 ± 2.16
2.699AsnThr: 2.699 ± 1.217
5.398AsnVal: 5.398 ± 0.932
0.0AsnTrp: 0.0 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.699ProAla: 2.699 ± 2.339
0.0ProCys: 0.0 ± 0.0
4.049ProAsp: 4.049 ± 0.148
0.0ProGlu: 0.0 ± 0.0
2.699ProPhe: 2.699 ± 1.06
1.35ProGly: 1.35 ± 1.17
4.049ProHis: 4.049 ± 1.84
4.049ProIle: 4.049 ± 1.982
4.049ProLys: 4.049 ± 3.22
4.049ProLeu: 4.049 ± 2.129
1.35ProMet: 1.35 ± 1.17
1.35ProAsn: 1.35 ± 1.17
1.35ProPro: 1.35 ± 1.17
2.699ProGln: 2.699 ± 1.217
4.049ProArg: 4.049 ± 1.947
5.398ProSer: 5.398 ± 3.035
8.097ProThr: 8.097 ± 3.546
4.049ProVal: 4.049 ± 0.148
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.699GlnAla: 2.699 ± 2.147
0.0GlnCys: 0.0 ± 0.0
1.35GlnAsp: 1.35 ± 1.17
0.0GlnGlu: 0.0 ± 0.0
1.35GlnPhe: 1.35 ± 1.08
5.398GlnGly: 5.398 ± 2.434
1.35GlnHis: 1.35 ± 1.073
1.35GlnIle: 1.35 ± 1.17
2.699GlnLys: 2.699 ± 1.05
2.699GlnLeu: 2.699 ± 1.05
2.699GlnMet: 2.699 ± 1.013
0.0GlnAsn: 0.0 ± 0.0
1.35GlnPro: 1.35 ± 1.17
2.699GlnGln: 2.699 ± 1.217
2.699GlnArg: 2.699 ± 2.339
1.35GlnSer: 1.35 ± 1.08
0.0GlnThr: 0.0 ± 0.0
8.097GlnVal: 8.097 ± 2.094
0.0GlnTrp: 0.0 ± 0.0
6.748GlnTyr: 6.748 ± 3.106
0.0GlnXaa: 0.0 ± 0.0
Arg
4.049ArgAla: 4.049 ± 3.22
1.35ArgCys: 1.35 ± 1.08
0.0ArgAsp: 0.0 ± 0.0
4.049ArgGlu: 4.049 ± 3.509
2.699ArgPhe: 2.699 ± 1.06
4.049ArgGly: 4.049 ± 1.84
1.35ArgHis: 1.35 ± 1.17
4.049ArgIle: 4.049 ± 1.84
4.049ArgLys: 4.049 ± 1.982
5.398ArgLeu: 5.398 ± 2.121
1.35ArgMet: 1.35 ± 1.17
1.35ArgAsn: 1.35 ± 1.073
1.35ArgPro: 1.35 ± 1.073
1.35ArgGln: 1.35 ± 1.08
6.748ArgArg: 6.748 ± 3.76
10.796ArgSer: 10.796 ± 2.937
4.049ArgThr: 4.049 ± 1.773
2.699ArgVal: 2.699 ± 1.05
2.699ArgTrp: 2.699 ± 1.05
2.699ArgTyr: 2.699 ± 2.147
0.0ArgXaa: 0.0 ± 0.0
Ser
4.049SerAla: 4.049 ± 1.84
2.699SerCys: 2.699 ± 2.339
5.398SerAsp: 5.398 ± 1.287
2.699SerGlu: 2.699 ± 1.217
2.699SerPhe: 2.699 ± 1.05
2.699SerGly: 2.699 ± 1.06
2.699SerHis: 2.699 ± 1.05
8.097SerIle: 8.097 ± 0.297
8.097SerLys: 8.097 ± 1.679
4.049SerLeu: 4.049 ± 3.509
0.0SerMet: 0.0 ± 0.0
6.748SerAsn: 6.748 ± 2.204
2.699SerPro: 2.699 ± 2.339
1.35SerGln: 1.35 ± 1.08
5.398SerArg: 5.398 ± 1.287
4.049SerSer: 4.049 ± 1.852
4.049SerThr: 4.049 ± 1.852
4.049SerVal: 4.049 ± 2.129
0.0SerTrp: 0.0 ± 0.0
1.35SerTyr: 1.35 ± 1.073
0.0SerXaa: 0.0 ± 0.0
Thr
6.748ThrAla: 6.748 ± 1.041
1.35ThrCys: 1.35 ± 1.073
0.0ThrAsp: 0.0 ± 0.0
4.049ThrGlu: 4.049 ± 1.947
1.35ThrPhe: 1.35 ± 1.17
12.146ThrGly: 12.146 ± 0.445
0.0ThrHis: 0.0 ± 0.0
4.049ThrIle: 4.049 ± 3.22
4.049ThrLys: 4.049 ± 1.982
5.398ThrLeu: 5.398 ± 1.128
1.35ThrMet: 1.35 ± 1.17
0.0ThrAsn: 0.0 ± 0.0
4.049ThrPro: 4.049 ± 1.773
2.699ThrGln: 2.699 ± 2.339
2.699ThrArg: 2.699 ± 2.147
4.049ThrSer: 4.049 ± 1.852
5.398ThrThr: 5.398 ± 1.128
1.35ThrVal: 1.35 ± 1.073
0.0ThrTrp: 0.0 ± 0.0
4.049ThrTyr: 4.049 ± 0.148
0.0ThrXaa: 0.0 ± 0.0
Val
4.049ValAla: 4.049 ± 1.84
0.0ValCys: 0.0 ± 0.0
4.049ValAsp: 4.049 ± 1.852
4.049ValGlu: 4.049 ± 0.148
6.748ValPhe: 6.748 ± 1.041
2.699ValGly: 2.699 ± 1.217
2.699ValHis: 2.699 ± 1.06
0.0ValIle: 0.0 ± 0.0
8.097ValLys: 8.097 ± 1.82
6.748ValLeu: 6.748 ± 0.962
1.35ValMet: 1.35 ± 1.08
4.049ValAsn: 4.049 ± 1.982
0.0ValPro: 0.0 ± 0.0
2.699ValGln: 2.699 ± 2.339
2.699ValArg: 2.699 ± 2.147
1.35ValSer: 1.35 ± 1.073
1.35ValThr: 1.35 ± 1.073
2.699ValVal: 2.699 ± 1.06
5.398ValTrp: 5.398 ± 0.932
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
2.699TrpAsp: 2.699 ± 1.06
5.398TrpGlu: 5.398 ± 2.737
1.35TrpPhe: 1.35 ± 1.073
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
2.699TrpIle: 2.699 ± 1.06
1.35TrpLys: 1.35 ± 1.073
0.0TrpLeu: 0.0 ± 0.0
2.699TrpMet: 2.699 ± 1.068
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.35TrpGln: 1.35 ± 1.073
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
4.049TrpThr: 4.049 ± 1.773
1.35TrpVal: 1.35 ± 1.17
0.0TrpTrp: 0.0 ± 0.0
1.35TrpTyr: 1.35 ± 1.073
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.35TyrAla: 1.35 ± 1.17
1.35TyrCys: 1.35 ± 1.17
1.35TyrAsp: 1.35 ± 1.08
1.35TyrGlu: 1.35 ± 1.17
2.699TyrPhe: 2.699 ± 2.16
2.699TyrGly: 2.699 ± 1.05
1.35TyrHis: 1.35 ± 1.08
0.0TyrIle: 0.0 ± 0.0
0.0TyrLys: 0.0 ± 0.0
2.699TyrLeu: 2.699 ± 1.05
0.0TyrMet: 0.0 ± 0.0
2.699TyrAsn: 2.699 ± 2.16
5.398TyrPro: 5.398 ± 4.294
2.699TyrGln: 2.699 ± 1.06
2.699TyrArg: 2.699 ± 1.06
4.049TyrSer: 4.049 ± 1.773
0.0TyrThr: 0.0 ± 0.0
5.398TyrVal: 5.398 ± 2.121
2.699TyrTrp: 2.699 ± 1.06
1.35TyrTyr: 1.35 ± 1.08
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (742 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski