Amino acid dipepetide frequency for Bromus-associated circular DNA virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.044AlaAla: 4.044 ± 1.068
0.0AlaCys: 0.0 ± 0.0
9.1AlaAsp: 9.1 ± 2.375
2.022AlaGlu: 2.022 ± 0.534
4.044AlaPhe: 4.044 ± 1.482
6.067AlaGly: 6.067 ± 1.53
0.0AlaHis: 0.0 ± 0.0
3.033AlaIle: 3.033 ± 1.975
6.067AlaLys: 6.067 ± 0.406
6.067AlaLeu: 6.067 ± 2.396
1.011AlaMet: 1.011 ± 1.153
1.011AlaAsn: 1.011 ± 0.658
1.011AlaPro: 1.011 ± 0.658
1.011AlaGln: 1.011 ± 0.658
3.033AlaArg: 3.033 ± 1.337
4.044AlaSer: 4.044 ± 1.068
8.089AlaThr: 8.089 ± 1.099
9.1AlaVal: 9.1 ± 1.197
3.033AlaTrp: 3.033 ± 1.582
2.022AlaTyr: 2.022 ± 0.534
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.011CysCys: 1.011 ± 0.794
0.0CysAsp: 0.0 ± 0.0
1.011CysGlu: 1.011 ± 0.794
0.0CysPhe: 0.0 ± 0.0
5.056CysGly: 5.056 ± 3.999
1.011CysHis: 1.011 ± 1.049
2.022CysIle: 2.022 ± 1.587
3.033CysLys: 3.033 ± 0.551
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.011CysPro: 1.011 ± 1.049
0.0CysGln: 0.0 ± 0.0
3.033CysArg: 3.033 ± 2.381
0.0CysSer: 0.0 ± 0.0
2.022CysThr: 2.022 ± 0.534
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.011CysTyr: 1.011 ± 0.794
0.0CysXaa: 0.0 ± 0.0
Asp
3.033AspAla: 3.033 ± 2.381
1.011AspCys: 1.011 ± 0.794
4.044AspAsp: 4.044 ± 1.941
5.056AspGlu: 5.056 ± 2.72
1.011AspPhe: 1.011 ± 0.794
5.056AspGly: 5.056 ± 1.081
0.0AspHis: 0.0 ± 0.0
2.022AspIle: 2.022 ± 1.587
0.0AspLys: 0.0 ± 0.0
8.089AspLeu: 8.089 ± 1.85
0.0AspMet: 0.0 ± 0.0
2.022AspAsn: 2.022 ± 1.587
5.056AspPro: 5.056 ± 1.413
2.022AspGln: 2.022 ± 0.534
2.022AspArg: 2.022 ± 2.097
6.067AspSer: 6.067 ± 2.427
4.044AspThr: 4.044 ± 1.482
6.067AspVal: 6.067 ± 1.703
1.011AspTrp: 1.011 ± 0.658
1.011AspTyr: 1.011 ± 0.794
0.0AspXaa: 0.0 ± 0.0
Glu
4.044GluAla: 4.044 ± 1.588
0.0GluCys: 0.0 ± 0.0
3.033GluAsp: 3.033 ± 2.381
3.033GluGlu: 3.033 ± 3.146
2.022GluPhe: 2.022 ± 1.082
3.033GluGly: 3.033 ± 1.978
4.044GluHis: 4.044 ± 2.164
2.022GluIle: 2.022 ± 1.587
2.022GluLys: 2.022 ± 0.534
7.078GluLeu: 7.078 ± 3.815
0.0GluMet: 0.0 ± 0.0
1.011GluAsn: 1.011 ± 1.049
3.033GluPro: 3.033 ± 1.582
3.033GluGln: 3.033 ± 1.946
3.033GluArg: 3.033 ± 0.898
1.011GluSer: 1.011 ± 0.658
2.022GluThr: 2.022 ± 0.534
0.0GluVal: 0.0 ± 0.0
1.011GluTrp: 1.011 ± 1.049
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
6.067PheAla: 6.067 ± 2.364
2.022PheCys: 2.022 ± 1.005
4.044PheAsp: 4.044 ± 1.068
1.011PheGlu: 1.011 ± 0.794
1.011PhePhe: 1.011 ± 0.658
1.011PheGly: 1.011 ± 0.794
2.022PheHis: 2.022 ± 0.534
1.011PheIle: 1.011 ± 0.794
2.022PheLys: 2.022 ± 0.534
2.022PheLeu: 2.022 ± 1.082
2.022PheMet: 2.022 ± 0.534
0.0PheAsn: 0.0 ± 0.0
1.011PhePro: 1.011 ± 0.794
1.011PheGln: 1.011 ± 0.658
1.011PheArg: 1.011 ± 0.658
6.067PheSer: 6.067 ± 2.752
6.067PheThr: 6.067 ± 1.602
1.011PheVal: 1.011 ± 0.658
1.011PheTrp: 1.011 ± 1.049
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.033GlyAla: 3.033 ± 1.975
1.011GlyCys: 1.011 ± 0.794
4.044GlyAsp: 4.044 ± 0.55
4.044GlyGlu: 4.044 ± 4.195
5.056GlyPhe: 5.056 ± 1.081
9.1GlyGly: 9.1 ± 3.646
2.022GlyHis: 2.022 ± 2.097
2.022GlyIle: 2.022 ± 1.082
8.089GlyLys: 8.089 ± 3.882
6.067GlyLeu: 6.067 ± 2.674
2.022GlyMet: 2.022 ± 1.316
3.033GlyAsn: 3.033 ± 0.551
8.089GlyPro: 8.089 ± 2.529
0.0GlyGln: 0.0 ± 0.0
6.067GlyArg: 6.067 ± 2.427
4.044GlySer: 4.044 ± 2.633
5.056GlyThr: 5.056 ± 2.11
4.044GlyVal: 4.044 ± 0.55
0.0GlyTrp: 0.0 ± 0.0
2.022GlyTyr: 2.022 ± 1.005
0.0GlyXaa: 0.0 ± 0.0
His
5.056HisAla: 5.056 ± 1.413
2.022HisCys: 2.022 ± 1.082
3.033HisAsp: 3.033 ± 1.582
4.044HisGlu: 4.044 ± 4.195
1.011HisPhe: 1.011 ± 0.794
2.022HisGly: 2.022 ± 1.316
4.044HisHis: 4.044 ± 2.633
3.033HisIle: 3.033 ± 1.182
1.011HisLys: 1.011 ± 0.794
1.011HisLeu: 1.011 ± 0.658
1.011HisMet: 1.011 ± 0.794
1.011HisAsn: 1.011 ± 1.049
1.011HisPro: 1.011 ± 0.794
1.011HisGln: 1.011 ± 0.794
0.0HisArg: 0.0 ± 0.0
3.033HisSer: 3.033 ± 1.978
0.0HisThr: 0.0 ± 0.0
2.022HisVal: 2.022 ± 2.097
1.011HisTrp: 1.011 ± 1.049
2.022HisTyr: 2.022 ± 1.005
0.0HisXaa: 0.0 ± 0.0
Ile
5.056IleAla: 5.056 ± 1.323
0.0IleCys: 0.0 ± 0.0
4.044IleAsp: 4.044 ± 1.941
3.033IleGlu: 3.033 ± 1.182
1.011IlePhe: 1.011 ± 0.794
5.056IleGly: 5.056 ± 3.291
4.044IleHis: 4.044 ± 0.925
1.011IleIle: 1.011 ± 0.794
1.011IleLys: 1.011 ± 0.658
5.056IleLeu: 5.056 ± 1.323
0.0IleMet: 0.0 ± 0.0
3.033IleAsn: 3.033 ± 2.381
2.022IlePro: 2.022 ± 0.534
1.011IleGln: 1.011 ± 0.658
5.056IleArg: 5.056 ± 2.634
2.022IleSer: 2.022 ± 0.534
1.011IleThr: 1.011 ± 0.658
3.033IleVal: 3.033 ± 1.182
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.022LysAla: 2.022 ± 1.316
1.011LysCys: 1.011 ± 0.794
1.011LysAsp: 1.011 ± 0.658
0.0LysGlu: 0.0 ± 0.0
4.044LysPhe: 4.044 ± 1.068
5.056LysGly: 5.056 ± 1.633
1.011LysHis: 1.011 ± 0.658
3.033LysIle: 3.033 ± 0.898
3.033LysLys: 3.033 ± 0.898
3.033LysLeu: 3.033 ± 1.182
0.0LysMet: 0.0 ± 0.0
2.022LysAsn: 2.022 ± 1.587
2.022LysPro: 2.022 ± 1.316
1.011LysGln: 1.011 ± 0.794
9.1LysArg: 9.1 ± 3.586
2.022LysSer: 2.022 ± 1.587
4.044LysThr: 4.044 ± 1.068
5.056LysVal: 5.056 ± 1.081
1.011LysTrp: 1.011 ± 0.794
2.022LysTyr: 2.022 ± 0.534
0.0LysXaa: 0.0 ± 0.0
Leu
4.044LeuAla: 4.044 ± 2.01
0.0LeuCys: 0.0 ± 0.0
5.056LeuAsp: 5.056 ± 1.653
1.011LeuGlu: 1.011 ± 0.794
3.033LeuPhe: 3.033 ± 1.182
8.089LeuGly: 8.089 ± 1.099
3.033LeuHis: 3.033 ± 1.182
6.067LeuIle: 6.067 ± 0.406
1.011LeuLys: 1.011 ± 0.658
5.056LeuLeu: 5.056 ± 0.277
1.011LeuMet: 1.011 ± 0.639
4.044LeuAsn: 4.044 ± 1.068
5.056LeuPro: 5.056 ± 1.323
1.011LeuGln: 1.011 ± 1.049
4.044LeuArg: 4.044 ± 2.256
5.056LeuSer: 5.056 ± 2.634
2.022LeuThr: 2.022 ± 0.534
3.033LeuVal: 3.033 ± 0.551
2.022LeuTrp: 2.022 ± 1.587
6.067LeuTyr: 6.067 ± 0.406
0.0LeuXaa: 0.0 ± 0.0
Met
2.022MetAla: 2.022 ± 1.005
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
1.011MetGlu: 1.011 ± 0.794
1.011MetPhe: 1.011 ± 0.658
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
3.033MetArg: 3.033 ± 1.975
4.044MetSer: 4.044 ± 1.482
0.0MetThr: 0.0 ± 0.0
2.022MetVal: 2.022 ± 1.587
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.044AsnAla: 4.044 ± 1.941
3.033AsnCys: 3.033 ± 1.182
0.0AsnAsp: 0.0 ± 0.0
2.022AsnGlu: 2.022 ± 1.082
4.044AsnPhe: 4.044 ± 1.482
1.011AsnGly: 1.011 ± 1.049
1.011AsnHis: 1.011 ± 0.794
2.022AsnIle: 2.022 ± 1.587
2.022AsnLys: 2.022 ± 0.534
4.044AsnLeu: 4.044 ± 0.55
1.011AsnMet: 1.011 ± 0.658
3.033AsnAsn: 3.033 ± 1.975
2.022AsnPro: 2.022 ± 1.082
4.044AsnGln: 4.044 ± 1.482
2.022AsnArg: 2.022 ± 1.005
1.011AsnSer: 1.011 ± 0.658
2.022AsnThr: 2.022 ± 0.534
4.044AsnVal: 4.044 ± 1.482
0.0AsnTrp: 0.0 ± 0.0
2.022AsnTyr: 2.022 ± 1.587
0.0AsnXaa: 0.0 ± 0.0
Pro
1.011ProAla: 1.011 ± 1.049
0.0ProCys: 0.0 ± 0.0
2.022ProAsp: 2.022 ± 1.005
5.056ProGlu: 5.056 ± 2.99
2.022ProPhe: 2.022 ± 0.534
6.067ProGly: 6.067 ± 3.015
4.044ProHis: 4.044 ± 2.975
1.011ProIle: 1.011 ± 1.049
4.044ProLys: 4.044 ± 1.941
4.044ProLeu: 4.044 ± 0.925
1.011ProMet: 1.011 ± 0.794
2.022ProAsn: 2.022 ± 0.534
1.011ProPro: 1.011 ± 1.049
1.011ProGln: 1.011 ± 0.658
4.044ProArg: 4.044 ± 2.01
5.056ProSer: 5.056 ± 1.081
5.056ProThr: 5.056 ± 1.323
3.033ProVal: 3.033 ± 0.898
1.011ProTrp: 1.011 ± 1.049
3.033ProTyr: 3.033 ± 1.337
0.0ProXaa: 0.0 ± 0.0
Gln
2.022GlnAla: 2.022 ± 0.534
1.011GlnCys: 1.011 ± 0.794
3.033GlnAsp: 3.033 ± 0.551
1.011GlnGlu: 1.011 ± 0.658
1.011GlnPhe: 1.011 ± 0.794
2.022GlnGly: 2.022 ± 1.316
1.011GlnHis: 1.011 ± 0.794
5.056GlnIle: 5.056 ± 2.11
1.011GlnLys: 1.011 ± 0.658
2.022GlnLeu: 2.022 ± 0.534
2.022GlnMet: 2.022 ± 0.821
1.011GlnAsn: 1.011 ± 0.658
4.044GlnPro: 4.044 ± 2.01
4.044GlnGln: 4.044 ± 1.852
3.033GlnArg: 3.033 ± 0.551
2.022GlnSer: 2.022 ± 0.534
2.022GlnThr: 2.022 ± 1.005
2.022GlnVal: 2.022 ± 0.534
1.011GlnTrp: 1.011 ± 0.794
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
8.089ArgAla: 8.089 ± 1.099
2.022ArgCys: 2.022 ± 2.097
4.044ArgAsp: 4.044 ± 1.588
2.022ArgGlu: 2.022 ± 0.534
4.044ArgPhe: 4.044 ± 1.941
6.067ArgGly: 6.067 ± 1.703
4.044ArgHis: 4.044 ± 4.195
3.033ArgIle: 3.033 ± 0.898
5.056ArgLys: 5.056 ± 1.323
5.056ArgLeu: 5.056 ± 1.323
0.0ArgMet: 0.0 ± 0.0
5.056ArgAsn: 5.056 ± 1.081
4.044ArgPro: 4.044 ± 1.588
3.033ArgGln: 3.033 ± 0.898
8.089ArgArg: 8.089 ± 1.099
4.044ArgSer: 4.044 ± 1.588
3.033ArgThr: 3.033 ± 0.551
1.011ArgVal: 1.011 ± 0.658
2.022ArgTrp: 2.022 ± 0.534
3.033ArgTyr: 3.033 ± 0.898
0.0ArgXaa: 0.0 ± 0.0
Ser
7.078SerAla: 7.078 ± 1.058
0.0SerCys: 0.0 ± 0.0
3.033SerAsp: 3.033 ± 1.582
2.022SerGlu: 2.022 ± 2.097
3.033SerPhe: 3.033 ± 1.975
7.078SerGly: 7.078 ± 3.163
1.011SerHis: 1.011 ± 0.658
2.022SerIle: 2.022 ± 0.534
3.033SerLys: 3.033 ± 1.975
2.022SerLeu: 2.022 ± 1.005
0.0SerMet: 0.0 ± 0.0
5.056SerAsn: 5.056 ± 1.081
3.033SerPro: 3.033 ± 1.978
4.044SerGln: 4.044 ± 1.941
7.078SerArg: 7.078 ± 1.058
9.1SerSer: 9.1 ± 3.646
4.044SerThr: 4.044 ± 1.482
5.056SerVal: 5.056 ± 1.586
1.011SerTrp: 1.011 ± 0.794
1.011SerTyr: 1.011 ± 0.658
0.0SerXaa: 0.0 ± 0.0
Thr
5.056ThrAla: 5.056 ± 1.081
1.011ThrCys: 1.011 ± 0.794
2.022ThrAsp: 2.022 ± 1.316
1.011ThrGlu: 1.011 ± 0.794
0.0ThrPhe: 0.0 ± 0.0
4.044ThrGly: 4.044 ± 1.068
1.011ThrHis: 1.011 ± 0.794
4.044ThrIle: 4.044 ± 1.482
4.044ThrLys: 4.044 ± 1.482
2.022ThrLeu: 2.022 ± 0.534
0.0ThrMet: 0.0 ± 0.0
4.044ThrAsn: 4.044 ± 1.482
6.067ThrPro: 6.067 ± 1.797
7.078ThrGln: 7.078 ± 1.058
5.056ThrArg: 5.056 ± 2.11
2.022ThrSer: 2.022 ± 1.005
6.067ThrThr: 6.067 ± 1.602
6.067ThrVal: 6.067 ± 0.406
1.011ThrTrp: 1.011 ± 0.794
3.033ThrTyr: 3.033 ± 1.182
0.0ThrXaa: 0.0 ± 0.0
Val
4.044ValAla: 4.044 ± 1.068
2.022ValCys: 2.022 ± 2.097
4.044ValAsp: 4.044 ± 0.55
3.033ValGlu: 3.033 ± 0.551
2.022ValPhe: 2.022 ± 1.082
3.033ValGly: 3.033 ± 1.582
2.022ValHis: 2.022 ± 1.005
1.011ValIle: 1.011 ± 0.658
4.044ValLys: 4.044 ± 1.482
4.044ValLeu: 4.044 ± 0.925
1.011ValMet: 1.011 ± 0.658
5.056ValAsn: 5.056 ± 1.323
2.022ValPro: 2.022 ± 1.005
4.044ValGln: 4.044 ± 1.852
4.044ValArg: 4.044 ± 2.256
4.044ValSer: 4.044 ± 1.482
5.056ValThr: 5.056 ± 1.323
4.044ValVal: 4.044 ± 1.482
1.011ValTrp: 1.011 ± 0.794
2.022ValTyr: 2.022 ± 0.534
0.0ValXaa: 0.0 ± 0.0
Trp
2.022TrpAla: 2.022 ± 1.082
2.022TrpCys: 2.022 ± 1.082
0.0TrpAsp: 0.0 ± 0.0
1.011TrpGlu: 1.011 ± 0.658
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
2.022TrpIle: 2.022 ± 1.082
1.011TrpLys: 1.011 ± 0.794
1.011TrpLeu: 1.011 ± 0.658
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
2.022TrpPro: 2.022 ± 1.587
1.011TrpGln: 1.011 ± 1.049
2.022TrpArg: 2.022 ± 1.082
1.011TrpSer: 1.011 ± 0.794
1.011TrpThr: 1.011 ± 0.794
1.011TrpVal: 1.011 ± 0.794
0.0TrpTrp: 0.0 ± 0.0
1.011TrpTyr: 1.011 ± 1.049
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.033TyrAla: 3.033 ± 0.898
1.011TyrCys: 1.011 ± 0.794
3.033TyrAsp: 3.033 ± 0.898
2.022TyrGlu: 2.022 ± 1.082
2.022TyrPhe: 2.022 ± 0.534
0.0TyrGly: 0.0 ± 0.0
3.033TyrHis: 3.033 ± 0.551
1.011TyrIle: 1.011 ± 0.658
0.0TyrLys: 0.0 ± 0.0
2.022TyrLeu: 2.022 ± 1.316
0.0TyrMet: 0.0 ± 0.0
2.022TyrAsn: 2.022 ± 1.587
2.022TyrPro: 2.022 ± 1.082
1.011TyrGln: 1.011 ± 0.794
2.022TyrArg: 2.022 ± 0.534
4.044TyrSer: 4.044 ± 0.55
2.022TyrThr: 2.022 ± 1.316
0.0TyrVal: 0.0 ± 0.0
1.011TyrTrp: 1.011 ± 1.049
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (990 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski