Amino acid dipepetide frequency for Chicken anemia virus (isolate Japan 82-2) (CAV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.822AlaAla: 3.822 ± 2.443
0.0AlaCys: 0.0 ± 0.0
6.369AlaAsp: 6.369 ± 1.492
1.274AlaGlu: 1.274 ± 0.726
3.822AlaPhe: 3.822 ± 2.177
7.643AlaGly: 7.643 ± 0.78
0.0AlaHis: 0.0 ± 0.0
1.274AlaIle: 1.274 ± 1.465
3.822AlaLys: 3.822 ± 2.443
6.369AlaLeu: 6.369 ± 0.844
1.274AlaMet: 1.274 ± 0.726
2.548AlaAsn: 2.548 ± 1.763
3.822AlaPro: 3.822 ± 1.362
5.096AlaGln: 5.096 ± 1.472
6.369AlaArg: 6.369 ± 3.585
5.096AlaSer: 5.096 ± 1.472
8.917AlaThr: 8.917 ± 1.945
3.822AlaVal: 3.822 ± 2.443
2.548AlaTrp: 2.548 ± 1.451
1.274AlaTyr: 1.274 ± 0.726
0.0AlaXaa: 0.0 ± 0.0
Cys
2.548CysAla: 2.548 ± 2.001
1.274CysCys: 1.274 ± 0.726
1.274CysAsp: 1.274 ± 1.92
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
2.548CysGly: 2.548 ± 2.001
0.0CysHis: 0.0 ± 0.0
1.274CysIle: 1.274 ± 1.92
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
1.274CysMet: 1.274 ± 0.726
1.274CysAsn: 1.274 ± 1.465
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
2.548CysArg: 2.548 ± 1.763
1.274CysSer: 1.274 ± 1.465
0.0CysThr: 0.0 ± 0.0
1.274CysVal: 1.274 ± 0.726
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.0AspAla: 0.0 ± 0.0
0.0AspCys: 0.0 ± 0.0
1.274AspAsp: 1.274 ± 0.726
5.096AspGlu: 5.096 ± 3.889
2.548AspPhe: 2.548 ± 2.93
2.548AspGly: 2.548 ± 1.049
2.548AspHis: 2.548 ± 1.451
2.548AspIle: 2.548 ± 2.93
0.0AspLys: 0.0 ± 0.0
1.274AspLeu: 1.274 ± 1.92
1.274AspMet: 1.274 ± 0.726
2.548AspAsn: 2.548 ± 1.763
5.096AspPro: 5.096 ± 0.87
1.274AspGln: 1.274 ± 1.92
2.548AspArg: 2.548 ± 2.93
5.096AspSer: 5.096 ± 1.472
3.822AspThr: 3.822 ± 1.893
2.548AspVal: 2.548 ± 1.451
1.274AspTrp: 1.274 ± 1.465
1.274AspTyr: 1.274 ± 1.465
0.0AspXaa: 0.0 ± 0.0
Glu
3.822GluAla: 3.822 ± 2.443
2.548GluCys: 2.548 ± 2.93
5.096GluAsp: 5.096 ± 4.186
2.548GluGlu: 2.548 ± 2.93
1.274GluPhe: 1.274 ± 0.726
3.822GluGly: 3.822 ± 1.052
0.0GluHis: 0.0 ± 0.0
1.274GluIle: 1.274 ± 1.92
0.0GluLys: 0.0 ± 0.0
6.369GluLeu: 6.369 ± 1.492
0.0GluMet: 0.0 ± 0.0
1.274GluAsn: 1.274 ± 0.726
1.274GluPro: 1.274 ± 0.726
1.274GluGln: 1.274 ± 0.726
2.548GluArg: 2.548 ± 1.049
3.822GluSer: 3.822 ± 3.638
3.822GluThr: 3.822 ± 1.893
0.0GluVal: 0.0 ± 0.0
0.0GluTrp: 0.0 ± 0.0
2.548GluTyr: 2.548 ± 1.763
0.0GluXaa: 0.0 ± 0.0
Phe
1.274PheAla: 1.274 ± 0.726
0.0PheCys: 0.0 ± 0.0
2.548PheAsp: 2.548 ± 2.93
0.0PheGlu: 0.0 ± 0.0
2.548PhePhe: 2.548 ± 1.451
2.548PheGly: 2.548 ± 1.451
1.274PheHis: 1.274 ± 0.726
0.0PheIle: 0.0 ± 0.0
1.274PheLys: 1.274 ± 1.92
2.548PheLeu: 2.548 ± 1.451
0.0PheMet: 0.0 ± 0.0
2.548PheAsn: 2.548 ± 1.451
1.274PhePro: 1.274 ± 0.726
3.822PheGln: 3.822 ± 1.052
7.643PheArg: 7.643 ± 1.312
1.274PheSer: 1.274 ± 0.726
5.096PheThr: 5.096 ± 3.889
0.0PheVal: 0.0 ± 0.0
0.0PheTrp: 0.0 ± 0.0
1.274PheTyr: 1.274 ± 0.726
0.0PheXaa: 0.0 ± 0.0
Gly
6.369GlyAla: 6.369 ± 2.069
1.274GlyCys: 1.274 ± 1.92
2.548GlyAsp: 2.548 ± 1.049
2.548GlyGlu: 2.548 ± 1.451
1.274GlyPhe: 1.274 ± 1.92
11.465GlyGly: 11.465 ± 4.007
0.0GlyHis: 0.0 ± 0.0
5.096GlyIle: 5.096 ± 3.159
2.548GlyLys: 2.548 ± 1.049
2.548GlyLeu: 2.548 ± 1.049
0.0GlyMet: 0.0 ± 0.0
3.822GlyAsn: 3.822 ± 2.443
3.822GlyPro: 3.822 ± 1.362
8.917GlyGln: 8.917 ± 4.496
5.096GlyArg: 5.096 ± 1.472
3.822GlySer: 3.822 ± 1.052
8.917GlyThr: 8.917 ± 3.415
2.548GlyVal: 2.548 ± 1.049
1.274GlyTrp: 1.274 ± 0.726
1.274GlyTyr: 1.274 ± 0.726
0.0GlyXaa: 0.0 ± 0.0
His
1.274HisAla: 1.274 ± 1.465
1.274HisCys: 1.274 ± 1.92
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
1.274HisGly: 1.274 ± 1.465
1.274HisHis: 1.274 ± 0.726
1.274HisIle: 1.274 ± 0.726
1.274HisLys: 1.274 ± 0.726
2.548HisLeu: 2.548 ± 1.451
1.274HisMet: 1.274 ± 0.726
1.274HisAsn: 1.274 ± 0.726
1.274HisPro: 1.274 ± 0.726
1.274HisGln: 1.274 ± 0.726
1.274HisArg: 1.274 ± 0.726
1.274HisSer: 1.274 ± 1.465
1.274HisThr: 1.274 ± 0.726
0.0HisVal: 0.0 ± 0.0
1.274HisTrp: 1.274 ± 1.465
1.274HisTyr: 1.274 ± 1.465
0.0HisXaa: 0.0 ± 0.0
Ile
6.369IleAla: 6.369 ± 0.844
1.274IleCys: 1.274 ± 1.465
0.0IleAsp: 0.0 ± 0.0
1.274IleGlu: 1.274 ± 0.726
1.274IlePhe: 1.274 ± 0.726
2.548IleGly: 2.548 ± 2.001
0.0IleHis: 0.0 ± 0.0
1.274IleIle: 1.274 ± 0.726
0.0IleLys: 0.0 ± 0.0
2.548IleLeu: 2.548 ± 1.049
0.0IleMet: 0.0 ± 0.0
2.548IleAsn: 2.548 ± 2.93
0.0IlePro: 0.0 ± 0.0
0.0IleGln: 0.0 ± 0.0
3.822IleArg: 3.822 ± 3.614
2.548IleSer: 2.548 ± 1.049
5.096IleThr: 5.096 ± 3.526
2.548IleVal: 2.548 ± 2.93
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.548LysAla: 2.548 ± 1.451
0.0LysCys: 0.0 ± 0.0
1.274LysAsp: 1.274 ± 1.465
3.822LysGlu: 3.822 ± 1.893
3.822LysPhe: 3.822 ± 1.052
2.548LysGly: 2.548 ± 1.451
1.274LysHis: 1.274 ± 1.465
2.548LysIle: 2.548 ± 1.049
2.548LysLys: 2.548 ± 2.001
2.548LysLeu: 2.548 ± 1.049
1.274LysMet: 1.274 ± 0.726
3.822LysAsn: 3.822 ± 1.893
1.274LysPro: 1.274 ± 1.92
1.274LysGln: 1.274 ± 0.726
5.096LysArg: 5.096 ± 2.213
3.822LysSer: 3.822 ± 2.177
1.274LysThr: 1.274 ± 1.465
1.274LysVal: 1.274 ± 1.465
0.0LysTrp: 0.0 ± 0.0
1.274LysTyr: 1.274 ± 0.726
0.0LysXaa: 0.0 ± 0.0
Leu
3.822LeuAla: 3.822 ± 1.052
1.274LeuCys: 1.274 ± 1.92
3.822LeuAsp: 3.822 ± 1.052
3.822LeuGlu: 3.822 ± 2.936
1.274LeuPhe: 1.274 ± 0.726
6.369LeuGly: 6.369 ± 2.069
0.0LeuHis: 0.0 ± 0.0
2.548LeuIle: 2.548 ± 1.451
5.096LeuLys: 5.096 ± 2.261
5.096LeuLeu: 5.096 ± 1.472
1.274LeuMet: 1.274 ± 1.348
0.0LeuAsn: 0.0 ± 0.0
2.548LeuPro: 2.548 ± 1.451
3.822LeuGln: 3.822 ± 1.362
7.643LeuArg: 7.643 ± 4.087
3.822LeuSer: 3.822 ± 1.362
6.369LeuThr: 6.369 ± 2.773
1.274LeuVal: 1.274 ± 0.726
0.0LeuTrp: 0.0 ± 0.0
1.274LeuTyr: 1.274 ± 0.726
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
1.274MetGlu: 1.274 ± 0.726
1.274MetPhe: 1.274 ± 0.726
2.548MetGly: 2.548 ± 1.451
1.274MetHis: 1.274 ± 1.465
0.0MetIle: 0.0 ± 0.0
1.274MetLys: 1.274 ± 0.726
0.0MetLeu: 0.0 ± 0.0
2.548MetMet: 2.548 ± 1.451
3.822MetAsn: 3.822 ± 1.893
1.274MetPro: 1.274 ± 0.726
1.274MetGln: 1.274 ± 0.726
1.274MetArg: 1.274 ± 0.726
1.274MetSer: 1.274 ± 0.726
3.822MetThr: 3.822 ± 2.177
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
2.548MetTyr: 2.548 ± 1.451
0.0MetXaa: 0.0 ± 0.0
Asn
2.548AsnAla: 2.548 ± 3.84
2.548AsnCys: 2.548 ± 1.049
0.0AsnAsp: 0.0 ± 0.0
2.548AsnGlu: 2.548 ± 1.049
2.548AsnPhe: 2.548 ± 1.049
2.548AsnGly: 2.548 ± 2.93
5.096AsnHis: 5.096 ± 1.472
0.0AsnIle: 0.0 ± 0.0
2.548AsnLys: 2.548 ± 1.049
2.548AsnLeu: 2.548 ± 1.451
0.0AsnMet: 0.0 ± 0.0
1.274AsnAsn: 1.274 ± 0.726
3.822AsnPro: 3.822 ± 1.052
0.0AsnGln: 0.0 ± 0.0
3.822AsnArg: 3.822 ± 1.052
5.096AsnSer: 5.096 ± 2.261
0.0AsnThr: 0.0 ± 0.0
2.548AsnVal: 2.548 ± 1.763
2.548AsnTrp: 2.548 ± 1.451
1.274AsnTyr: 1.274 ± 1.465
0.0AsnXaa: 0.0 ± 0.0
Pro
5.096ProAla: 5.096 ± 2.098
0.0ProCys: 0.0 ± 0.0
5.096ProAsp: 5.096 ± 0.87
0.0ProGlu: 0.0 ± 0.0
1.274ProPhe: 1.274 ± 1.465
5.096ProGly: 5.096 ± 0.87
1.274ProHis: 1.274 ± 1.92
2.548ProIle: 2.548 ± 1.451
5.096ProLys: 5.096 ± 2.261
5.096ProLeu: 5.096 ± 0.87
2.548ProMet: 2.548 ± 1.269
3.822ProAsn: 3.822 ± 1.052
7.643ProPro: 7.643 ± 5.289
1.274ProGln: 1.274 ± 0.726
5.096ProArg: 5.096 ± 0.87
11.465ProSer: 11.465 ± 6.341
5.096ProThr: 5.096 ± 3.159
3.822ProVal: 3.822 ± 1.052
1.274ProTrp: 1.274 ± 0.726
1.274ProTyr: 1.274 ± 0.726
0.0ProXaa: 0.0 ± 0.0
Gln
3.822GlnAla: 3.822 ± 2.443
0.0GlnCys: 0.0 ± 0.0
1.274GlnAsp: 1.274 ± 0.726
2.548GlnGlu: 2.548 ± 2.001
1.274GlnPhe: 1.274 ± 1.465
7.643GlnGly: 7.643 ± 2.105
0.0GlnHis: 0.0 ± 0.0
0.0GlnIle: 0.0 ± 0.0
0.0GlnLys: 0.0 ± 0.0
2.548GlnLeu: 2.548 ± 1.451
1.274GlnMet: 1.274 ± 0.726
1.274GlnAsn: 1.274 ± 0.726
8.917GlnPro: 8.917 ± 2.19
2.548GlnGln: 2.548 ± 1.451
6.369GlnArg: 6.369 ± 2.069
2.548GlnSer: 2.548 ± 1.451
0.0GlnThr: 0.0 ± 0.0
2.548GlnVal: 2.548 ± 1.049
1.274GlnTrp: 1.274 ± 0.726
2.548GlnTyr: 2.548 ± 1.451
0.0GlnXaa: 0.0 ± 0.0
Arg
5.096ArgAla: 5.096 ± 0.87
2.548ArgCys: 2.548 ± 1.763
2.548ArgAsp: 2.548 ± 1.049
5.096ArgGlu: 5.096 ± 4.186
5.096ArgPhe: 5.096 ± 2.902
3.822ArgGly: 3.822 ± 2.177
2.548ArgHis: 2.548 ± 1.451
2.548ArgIle: 2.548 ± 2.001
6.369ArgLys: 6.369 ± 3.463
3.822ArgLeu: 3.822 ± 1.893
1.274ArgMet: 1.274 ± 0.726
0.0ArgAsn: 0.0 ± 0.0
11.465ArgPro: 11.465 ± 3.659
2.548ArgGln: 2.548 ± 1.451
16.561ArgArg: 16.561 ± 6.185
8.917ArgSer: 8.917 ± 3.37
3.822ArgThr: 3.822 ± 3.638
5.096ArgVal: 5.096 ± 0.87
7.643ArgTrp: 7.643 ± 2.105
3.822ArgTyr: 3.822 ± 1.052
0.0ArgXaa: 0.0 ± 0.0
Ser
5.096SerAla: 5.096 ± 0.87
2.548SerCys: 2.548 ± 1.763
3.822SerAsp: 3.822 ± 1.052
8.917SerGlu: 8.917 ± 4.471
5.096SerPhe: 5.096 ± 2.902
2.548SerGly: 2.548 ± 2.93
1.274SerHis: 1.274 ± 1.465
1.274SerIle: 1.274 ± 1.465
3.822SerLys: 3.822 ± 1.893
6.369SerLeu: 6.369 ± 3.346
1.274SerMet: 1.274 ± 0.726
1.274SerAsn: 1.274 ± 1.465
2.548SerPro: 2.548 ± 1.049
3.822SerGln: 3.822 ± 1.052
6.369SerArg: 6.369 ± 3.346
3.822SerSer: 3.822 ± 1.362
11.465SerThr: 11.465 ± 1.554
5.096SerVal: 5.096 ± 2.902
2.548SerTrp: 2.548 ± 1.451
1.274SerTyr: 1.274 ± 0.726
0.0SerXaa: 0.0 ± 0.0
Thr
8.917ThrAla: 8.917 ± 2.556
0.0ThrCys: 0.0 ± 0.0
3.822ThrAsp: 3.822 ± 1.893
2.548ThrGlu: 2.548 ± 1.451
0.0ThrPhe: 0.0 ± 0.0
3.822ThrGly: 3.822 ± 1.893
0.0ThrHis: 0.0 ± 0.0
5.096ThrIle: 5.096 ± 0.87
1.274ThrLys: 1.274 ± 0.726
7.643ThrLeu: 7.643 ± 3.786
2.548ThrMet: 2.548 ± 1.451
3.822ThrAsn: 3.822 ± 1.052
8.917ThrPro: 8.917 ± 5.343
5.096ThrGln: 5.096 ± 1.472
3.822ThrArg: 3.822 ± 1.052
3.822ThrSer: 3.822 ± 1.362
7.643ThrThr: 7.643 ± 4.986
2.548ThrVal: 2.548 ± 2.001
0.0ThrTrp: 0.0 ± 0.0
5.096ThrTyr: 5.096 ± 2.902
0.0ThrXaa: 0.0 ± 0.0
Val
3.822ValAla: 3.822 ± 2.177
0.0ValCys: 0.0 ± 0.0
1.274ValAsp: 1.274 ± 1.465
0.0ValGlu: 0.0 ± 0.0
1.274ValPhe: 1.274 ± 1.92
1.274ValGly: 1.274 ± 1.465
0.0ValHis: 0.0 ± 0.0
2.548ValIle: 2.548 ± 1.049
1.274ValLys: 1.274 ± 0.726
0.0ValLeu: 0.0 ± 0.0
2.548ValMet: 2.548 ± 0.877
2.548ValAsn: 2.548 ± 1.451
5.096ValPro: 5.096 ± 2.261
3.822ValGln: 3.822 ± 2.443
6.369ValArg: 6.369 ± 1.972
3.822ValSer: 3.822 ± 1.893
0.0ValThr: 0.0 ± 0.0
1.274ValVal: 1.274 ± 0.726
2.548ValTrp: 2.548 ± 1.049
1.274ValTyr: 1.274 ± 1.465
0.0ValXaa: 0.0 ± 0.0
Trp
3.822TrpAla: 3.822 ± 2.177
0.0TrpCys: 0.0 ± 0.0
2.548TrpAsp: 2.548 ± 1.451
0.0TrpGlu: 0.0 ± 0.0
1.274TrpPhe: 1.274 ± 1.465
1.274TrpGly: 1.274 ± 0.726
2.548TrpHis: 2.548 ± 1.451
0.0TrpIle: 0.0 ± 0.0
1.274TrpLys: 1.274 ± 1.465
1.274TrpLeu: 1.274 ± 1.465
0.0TrpMet: 0.0 ± 0.0
2.548TrpAsn: 2.548 ± 1.049
1.274TrpPro: 1.274 ± 0.726
1.274TrpGln: 1.274 ± 0.726
2.548TrpArg: 2.548 ± 1.451
2.548TrpSer: 2.548 ± 1.451
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
2.548TrpTrp: 2.548 ± 1.451
1.274TrpTyr: 1.274 ± 1.465
0.0TrpXaa: 0.0 ± 0.0
Tyr
5.096TyrAla: 5.096 ± 2.902
0.0TyrCys: 0.0 ± 0.0
0.0TyrAsp: 0.0 ± 0.0
0.0TyrGlu: 0.0 ± 0.0
0.0TyrPhe: 0.0 ± 0.0
1.274TyrGly: 1.274 ± 0.726
1.274TyrHis: 1.274 ± 1.465
0.0TyrIle: 0.0 ± 0.0
3.822TyrLys: 3.822 ± 1.052
1.274TyrLeu: 1.274 ± 0.726
2.548TyrMet: 2.548 ± 1.451
1.274TyrAsn: 1.274 ± 1.465
2.548TyrPro: 2.548 ± 1.451
0.0TyrGln: 0.0 ± 0.0
3.822TyrArg: 3.822 ± 1.362
5.096TyrSer: 5.096 ± 2.098
0.0TyrThr: 0.0 ± 0.0
2.548TyrVal: 2.548 ± 1.451
1.274TyrTrp: 1.274 ± 0.726
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (786 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski