Amino acid dipepetide frequency for Water beetle associated circular virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.946AlaAla: 2.946 ± 0.083
0.0AlaCys: 0.0 ± 0.0
2.946AlaAsp: 2.946 ± 0.083
2.946AlaGlu: 2.946 ± 0.083
2.946AlaPhe: 2.946 ± 2.085
4.418AlaGly: 4.418 ± 0.959
2.946AlaHis: 2.946 ± 2.085
7.364AlaIle: 7.364 ± 0.876
2.946AlaLys: 2.946 ± 2.085
4.418AlaLeu: 4.418 ± 3.128
4.418AlaMet: 4.418 ± 1.209
1.473AlaAsn: 1.473 ± 1.126
1.473AlaPro: 1.473 ± 1.126
4.418AlaGln: 4.418 ± 0.959
2.946AlaArg: 2.946 ± 0.083
1.473AlaSer: 1.473 ± 1.126
8.837AlaThr: 8.837 ± 1.919
4.418AlaVal: 4.418 ± 0.959
1.473AlaTrp: 1.473 ± 1.043
2.946AlaTyr: 2.946 ± 0.083
0.0AlaXaa: 0.0 ± 0.0
Cys
1.473CysAla: 1.473 ± 1.126
0.0CysCys: 0.0 ± 0.0
4.418CysAsp: 4.418 ± 1.209
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
2.946CysGly: 2.946 ± 0.083
0.0CysHis: 0.0 ± 0.0
1.473CysIle: 1.473 ± 1.126
1.473CysLys: 1.473 ± 1.126
1.473CysLeu: 1.473 ± 1.126
0.0CysMet: 0.0 ± 0.0
1.473CysAsn: 1.473 ± 1.126
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
1.473CysThr: 1.473 ± 1.043
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.473AspAla: 1.473 ± 1.043
0.0AspCys: 0.0 ± 0.0
1.473AspAsp: 1.473 ± 1.126
5.891AspGlu: 5.891 ± 2.334
0.0AspPhe: 0.0 ± 0.0
5.891AspGly: 5.891 ± 0.166
1.473AspHis: 1.473 ± 1.126
7.364AspIle: 7.364 ± 1.292
2.946AspLys: 2.946 ± 0.083
2.946AspLeu: 2.946 ± 2.251
0.0AspMet: 0.0 ± 0.0
4.418AspAsn: 4.418 ± 0.959
2.946AspPro: 2.946 ± 0.083
4.418AspGln: 4.418 ± 0.959
1.473AspArg: 1.473 ± 1.043
4.418AspSer: 4.418 ± 1.209
2.946AspThr: 2.946 ± 0.083
1.473AspVal: 1.473 ± 1.043
2.946AspTrp: 2.946 ± 2.251
1.473AspTyr: 1.473 ± 1.043
0.0AspXaa: 0.0 ± 0.0
Glu
4.418GluAla: 4.418 ± 0.959
0.0GluCys: 0.0 ± 0.0
4.418GluAsp: 4.418 ± 1.209
8.837GluGlu: 8.837 ± 0.249
0.0GluPhe: 0.0 ± 0.0
2.946GluGly: 2.946 ± 2.251
1.473GluHis: 1.473 ± 1.043
7.364GluIle: 7.364 ± 3.46
2.946GluLys: 2.946 ± 2.251
2.946GluLeu: 2.946 ± 2.085
1.473GluMet: 1.473 ± 1.126
1.473GluAsn: 1.473 ± 1.126
4.418GluPro: 4.418 ± 1.209
2.946GluGln: 2.946 ± 0.083
7.364GluArg: 7.364 ± 3.46
2.946GluSer: 2.946 ± 2.085
4.418GluThr: 4.418 ± 0.959
4.418GluVal: 4.418 ± 0.959
1.473GluTrp: 1.473 ± 1.126
1.473GluTyr: 1.473 ± 1.126
0.0GluXaa: 0.0 ± 0.0
Phe
4.418PheAla: 4.418 ± 0.959
0.0PheCys: 0.0 ± 0.0
2.946PheAsp: 2.946 ± 2.251
1.473PheGlu: 1.473 ± 1.043
1.473PhePhe: 1.473 ± 1.126
2.946PheGly: 2.946 ± 2.085
2.946PheHis: 2.946 ± 0.083
4.418PheIle: 4.418 ± 3.128
4.418PheLys: 4.418 ± 0.959
1.473PheLeu: 1.473 ± 1.043
1.473PheMet: 1.473 ± 0.753
1.473PheAsn: 1.473 ± 1.126
1.473PhePro: 1.473 ± 1.043
1.473PheGln: 1.473 ± 1.043
0.0PheArg: 0.0 ± 0.0
2.946PheSer: 2.946 ± 2.085
2.946PheThr: 2.946 ± 2.251
1.473PheVal: 1.473 ± 1.126
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
5.891GlyAla: 5.891 ± 4.17
1.473GlyCys: 1.473 ± 1.126
0.0GlyAsp: 0.0 ± 0.0
5.891GlyGlu: 5.891 ± 2.002
1.473GlyPhe: 1.473 ± 1.043
4.418GlyGly: 4.418 ± 0.959
0.0GlyHis: 0.0 ± 0.0
4.418GlyIle: 4.418 ± 0.959
4.418GlyLys: 4.418 ± 3.377
7.364GlyLeu: 7.364 ± 1.292
1.473GlyMet: 1.473 ± 1.043
5.891GlyAsn: 5.891 ± 2.002
1.473GlyPro: 1.473 ± 1.043
4.418GlyGln: 4.418 ± 0.959
5.891GlyArg: 5.891 ± 2.334
4.418GlySer: 4.418 ± 3.128
7.364GlyThr: 7.364 ± 3.044
2.946GlyVal: 2.946 ± 2.085
2.946GlyTrp: 2.946 ± 0.083
1.473GlyTyr: 1.473 ± 1.043
0.0GlyXaa: 0.0 ± 0.0
His
1.473HisAla: 1.473 ± 1.043
1.473HisCys: 1.473 ± 1.043
0.0HisAsp: 0.0 ± 0.0
1.473HisGlu: 1.473 ± 1.126
1.473HisPhe: 1.473 ± 1.126
0.0HisGly: 0.0 ± 0.0
4.418HisHis: 4.418 ± 1.209
1.473HisIle: 1.473 ± 1.043
0.0HisLys: 0.0 ± 0.0
4.418HisLeu: 4.418 ± 1.209
4.418HisMet: 4.418 ± 1.209
1.473HisAsn: 1.473 ± 1.126
2.946HisPro: 2.946 ± 0.083
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
1.473HisSer: 1.473 ± 1.126
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
7.364HisTrp: 7.364 ± 1.292
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.946IleAla: 2.946 ± 0.083
2.946IleCys: 2.946 ± 2.251
2.946IleAsp: 2.946 ± 2.251
7.364IleGlu: 7.364 ± 1.292
2.946IlePhe: 2.946 ± 2.251
5.891IleGly: 5.891 ± 4.17
2.946IleHis: 2.946 ± 0.083
1.473IleIle: 1.473 ± 1.043
5.891IleLys: 5.891 ± 2.334
7.364IleLeu: 7.364 ± 3.46
0.0IleMet: 0.0 ± 0.722
4.418IleAsn: 4.418 ± 1.209
2.946IlePro: 2.946 ± 0.083
2.946IleGln: 2.946 ± 0.083
4.418IleArg: 4.418 ± 1.209
0.0IleSer: 0.0 ± 0.0
4.418IleThr: 4.418 ± 0.959
2.946IleVal: 2.946 ± 0.083
0.0IleTrp: 0.0 ± 0.0
1.473IleTyr: 1.473 ± 1.043
0.0IleXaa: 0.0 ± 0.0
Lys
4.418LysAla: 4.418 ± 1.209
1.473LysCys: 1.473 ± 1.126
1.473LysAsp: 1.473 ± 1.043
2.946LysGlu: 2.946 ± 2.251
2.946LysPhe: 2.946 ± 2.085
5.891LysGly: 5.891 ± 2.002
7.364LysHis: 7.364 ± 3.46
2.946LysIle: 2.946 ± 2.251
2.946LysLys: 2.946 ± 2.251
4.418LysLeu: 4.418 ± 0.959
1.473LysMet: 1.473 ± 1.126
1.473LysAsn: 1.473 ± 1.043
0.0LysPro: 0.0 ± 0.0
1.473LysGln: 1.473 ± 1.126
8.837LysArg: 8.837 ± 0.249
5.891LysSer: 5.891 ± 2.334
2.946LysThr: 2.946 ± 2.251
2.946LysVal: 2.946 ± 2.085
0.0LysTrp: 0.0 ± 0.0
1.473LysTyr: 1.473 ± 1.126
0.0LysXaa: 0.0 ± 0.0
Leu
5.891LeuAla: 5.891 ± 2.002
1.473LeuCys: 1.473 ± 1.126
1.473LeuAsp: 1.473 ± 1.126
4.418LeuGlu: 4.418 ± 3.377
5.891LeuPhe: 5.891 ± 2.002
5.891LeuGly: 5.891 ± 2.002
2.946LeuHis: 2.946 ± 0.083
4.418LeuIle: 4.418 ± 1.209
5.891LeuLys: 5.891 ± 2.334
5.891LeuLeu: 5.891 ± 4.502
0.0LeuMet: 0.0 ± 0.0
1.473LeuAsn: 1.473 ± 1.043
4.418LeuPro: 4.418 ± 3.128
1.473LeuGln: 1.473 ± 1.043
4.418LeuArg: 4.418 ± 0.959
11.782LeuSer: 11.782 ± 1.836
0.0LeuThr: 0.0 ± 0.0
1.473LeuVal: 1.473 ± 1.126
0.0LeuTrp: 0.0 ± 0.0
4.418LeuTyr: 4.418 ± 1.209
0.0LeuXaa: 0.0 ± 0.0
Met
4.418MetAla: 4.418 ± 1.209
0.0MetCys: 0.0 ± 0.0
4.418MetAsp: 4.418 ± 3.377
2.946MetGlu: 2.946 ± 0.083
1.473MetPhe: 1.473 ± 1.043
1.473MetGly: 1.473 ± 1.043
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.473MetLys: 1.473 ± 1.126
1.473MetLeu: 1.473 ± 1.043
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.473MetPro: 1.473 ± 1.126
0.0MetGln: 0.0 ± 0.0
1.473MetArg: 1.473 ± 1.043
0.0MetSer: 0.0 ± 0.0
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.418AsnAla: 4.418 ± 3.128
1.473AsnCys: 1.473 ± 1.043
2.946AsnAsp: 2.946 ± 2.251
4.418AsnGlu: 4.418 ± 1.209
2.946AsnPhe: 2.946 ± 2.085
4.418AsnGly: 4.418 ± 3.128
1.473AsnHis: 1.473 ± 1.043
2.946AsnIle: 2.946 ± 0.083
0.0AsnLys: 0.0 ± 0.0
2.946AsnLeu: 2.946 ± 0.083
0.0AsnMet: 0.0 ± 0.0
7.364AsnAsn: 7.364 ± 0.876
2.946AsnPro: 2.946 ± 2.085
1.473AsnGln: 1.473 ± 1.126
1.473AsnArg: 1.473 ± 1.126
2.946AsnSer: 2.946 ± 2.251
1.473AsnThr: 1.473 ± 1.126
2.946AsnVal: 2.946 ± 2.251
0.0AsnTrp: 0.0 ± 0.0
1.473AsnTyr: 1.473 ± 1.043
0.0AsnXaa: 0.0 ± 0.0
Pro
2.946ProAla: 2.946 ± 2.085
0.0ProCys: 0.0 ± 0.0
2.946ProAsp: 2.946 ± 2.251
2.946ProGlu: 2.946 ± 0.083
2.946ProPhe: 2.946 ± 0.083
1.473ProGly: 1.473 ± 1.043
0.0ProHis: 0.0 ± 0.0
0.0ProIle: 0.0 ± 0.0
7.364ProLys: 7.364 ± 1.292
4.418ProLeu: 4.418 ± 0.959
0.0ProMet: 0.0 ± 0.0
2.946ProAsn: 2.946 ± 0.083
1.473ProPro: 1.473 ± 1.043
4.418ProGln: 4.418 ± 3.128
2.946ProArg: 2.946 ± 2.251
0.0ProSer: 0.0 ± 0.0
5.891ProThr: 5.891 ± 2.002
2.946ProVal: 2.946 ± 2.085
1.473ProTrp: 1.473 ± 1.043
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
7.364GlnAla: 7.364 ± 0.876
0.0GlnCys: 0.0 ± 0.0
1.473GlnAsp: 1.473 ± 1.043
2.946GlnGlu: 2.946 ± 2.085
1.473GlnPhe: 1.473 ± 1.126
4.418GlnGly: 4.418 ± 3.377
0.0GlnHis: 0.0 ± 0.0
2.946GlnIle: 2.946 ± 2.085
0.0GlnLys: 0.0 ± 0.0
2.946GlnLeu: 2.946 ± 2.085
1.473GlnMet: 1.473 ± 1.043
1.473GlnAsn: 1.473 ± 1.043
0.0GlnPro: 0.0 ± 0.0
0.0GlnGln: 0.0 ± 0.0
1.473GlnArg: 1.473 ± 1.126
4.418GlnSer: 4.418 ± 0.959
0.0GlnThr: 0.0 ± 0.0
4.418GlnVal: 4.418 ± 1.209
1.473GlnTrp: 1.473 ± 1.043
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.946ArgAla: 2.946 ± 0.083
1.473ArgCys: 1.473 ± 1.126
1.473ArgAsp: 1.473 ± 1.043
0.0ArgGlu: 0.0 ± 0.0
0.0ArgPhe: 0.0 ± 0.0
4.418ArgGly: 4.418 ± 1.209
2.946ArgHis: 2.946 ± 2.251
5.891ArgIle: 5.891 ± 4.502
8.837ArgLys: 8.837 ± 4.087
4.418ArgLeu: 4.418 ± 3.377
1.473ArgMet: 1.473 ± 1.043
1.473ArgAsn: 1.473 ± 1.043
1.473ArgPro: 1.473 ± 1.126
1.473ArgGln: 1.473 ± 1.043
4.418ArgArg: 4.418 ± 1.209
4.418ArgSer: 4.418 ± 3.377
4.418ArgThr: 4.418 ± 0.959
4.418ArgVal: 4.418 ± 3.128
1.473ArgTrp: 1.473 ± 1.043
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
4.418SerAla: 4.418 ± 3.128
0.0SerCys: 0.0 ± 0.0
1.473SerAsp: 1.473 ± 1.043
10.309SerGlu: 10.309 ± 1.375
1.473SerPhe: 1.473 ± 1.126
5.891SerGly: 5.891 ± 2.002
0.0SerHis: 0.0 ± 0.0
2.946SerIle: 2.946 ± 0.083
0.0SerLys: 0.0 ± 0.0
2.946SerLeu: 2.946 ± 0.083
0.0SerMet: 0.0 ± 0.0
5.891SerAsn: 5.891 ± 0.166
2.946SerPro: 2.946 ± 0.083
0.0SerGln: 0.0 ± 0.0
4.418SerArg: 4.418 ± 0.959
1.473SerSer: 1.473 ± 1.043
8.837SerThr: 8.837 ± 1.919
1.473SerVal: 1.473 ± 1.126
4.418SerTrp: 4.418 ± 3.377
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
2.946ThrAla: 2.946 ± 2.251
2.946ThrCys: 2.946 ± 0.083
13.255ThrAsp: 13.255 ± 7.215
1.473ThrGlu: 1.473 ± 1.126
4.418ThrPhe: 4.418 ± 0.959
4.418ThrGly: 4.418 ± 0.959
0.0ThrHis: 0.0 ± 0.0
7.364ThrIle: 7.364 ± 1.292
1.473ThrLys: 1.473 ± 1.043
7.364ThrLeu: 7.364 ± 3.044
0.0ThrMet: 0.0 ± 0.0
4.418ThrAsn: 4.418 ± 0.959
7.364ThrPro: 7.364 ± 3.044
1.473ThrGln: 1.473 ± 1.126
2.946ThrArg: 2.946 ± 2.085
1.473ThrSer: 1.473 ± 1.043
4.418ThrThr: 4.418 ± 1.209
2.946ThrVal: 2.946 ± 2.251
0.0ThrTrp: 0.0 ± 0.0
2.946ThrTyr: 2.946 ± 2.251
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
0.0ValCys: 0.0 ± 0.0
2.946ValAsp: 2.946 ± 0.083
0.0ValGlu: 0.0 ± 0.0
2.946ValPhe: 2.946 ± 2.251
4.418ValGly: 4.418 ± 3.128
2.946ValHis: 2.946 ± 2.251
2.946ValIle: 2.946 ± 2.251
2.946ValLys: 2.946 ± 2.251
1.473ValLeu: 1.473 ± 1.043
1.473ValMet: 1.473 ± 1.126
1.473ValAsn: 1.473 ± 1.043
4.418ValPro: 4.418 ± 0.959
1.473ValGln: 1.473 ± 1.043
2.946ValArg: 2.946 ± 2.085
1.473ValSer: 1.473 ± 1.126
7.364ValThr: 7.364 ± 0.876
0.0ValVal: 0.0 ± 0.0
0.0ValTrp: 0.0 ± 0.0
4.418ValTyr: 4.418 ± 3.128
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
2.946TrpAsp: 2.946 ± 0.083
1.473TrpGlu: 1.473 ± 1.126
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
2.946TrpLys: 2.946 ± 2.251
1.473TrpLeu: 1.473 ± 1.126
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
2.946TrpGln: 2.946 ± 2.251
0.0TrpArg: 0.0 ± 0.0
4.418TrpSer: 4.418 ± 3.128
5.891TrpThr: 5.891 ± 2.002
2.946TrpVal: 2.946 ± 2.251
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.946TyrAla: 2.946 ± 2.251
1.473TyrCys: 1.473 ± 1.126
0.0TyrAsp: 0.0 ± 0.0
0.0TyrGlu: 0.0 ± 0.0
2.946TyrPhe: 2.946 ± 2.085
1.473TyrGly: 1.473 ± 1.126
0.0TyrHis: 0.0 ± 0.0
0.0TyrIle: 0.0 ± 0.0
4.418TyrLys: 4.418 ± 0.959
1.473TyrLeu: 1.473 ± 1.043
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
2.946TyrPro: 2.946 ± 0.083
1.473TyrGln: 1.473 ± 1.043
0.0TyrArg: 0.0 ± 0.0
2.946TyrSer: 2.946 ± 0.083
0.0TyrThr: 0.0 ± 0.0
1.473TyrVal: 1.473 ± 1.043
0.0TyrTrp: 0.0 ± 0.0
1.473TyrTyr: 1.473 ± 1.043
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (680 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski