Amino acid dipepetide frequency for Bacilladnaviridae sp.

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.062AlaAla: 10.062 ± 4.978
0.0AlaCys: 0.0 ± 0.0
3.096AlaAsp: 3.096 ± 1.173
5.418AlaGlu: 5.418 ± 1.578
3.096AlaPhe: 3.096 ± 0.838
10.062AlaGly: 10.062 ± 4.121
2.322AlaHis: 2.322 ± 1.025
6.192AlaIle: 6.192 ± 1.857
1.548AlaLys: 1.548 ± 1.177
3.87AlaLeu: 3.87 ± 0.664
0.774AlaMet: 0.774 ± 0.588
2.322AlaAsn: 2.322 ± 1.247
1.548AlaPro: 1.548 ± 1.19
3.87AlaGln: 3.87 ± 2.302
2.322AlaArg: 2.322 ± 0.708
3.87AlaSer: 3.87 ± 1.239
2.322AlaThr: 2.322 ± 1.496
6.966AlaVal: 6.966 ± 2.914
0.774AlaTrp: 0.774 ± 0.771
3.096AlaTyr: 3.096 ± 0.675
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.774CysAsp: 0.774 ± 0.588
0.0CysGlu: 0.0 ± 0.0
0.774CysPhe: 0.774 ± 0.771
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.774CysLeu: 0.774 ± 0.595
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.548CysPro: 1.548 ± 0.586
0.0CysGln: 0.0 ± 0.0
1.548CysArg: 1.548 ± 1.19
1.548CysSer: 1.548 ± 1.177
0.774CysThr: 0.774 ± 0.588
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
7.74AspAla: 7.74 ± 1.329
0.0AspCys: 0.0 ± 0.0
8.514AspAsp: 8.514 ± 3.164
3.096AspGlu: 3.096 ± 1.756
0.774AspPhe: 0.774 ± 0.771
7.74AspGly: 7.74 ± 0.931
3.87AspHis: 3.87 ± 1.37
2.322AspIle: 2.322 ± 1.013
3.096AspLys: 3.096 ± 2.354
6.192AspLeu: 6.192 ± 1.19
3.096AspMet: 3.096 ± 1.173
2.322AspAsn: 2.322 ± 0.484
5.418AspPro: 5.418 ± 2.893
2.322AspGln: 2.322 ± 0.484
1.548AspArg: 1.548 ± 0.855
2.322AspSer: 2.322 ± 1.518
3.87AspThr: 3.87 ± 1.463
2.322AspVal: 2.322 ± 1.604
3.096AspTrp: 3.096 ± 1.431
3.096AspTyr: 3.096 ± 0.448
0.0AspXaa: 0.0 ± 0.0
Glu
0.0GluAla: 0.0 ± 0.0
0.0GluCys: 0.0 ± 0.0
1.548GluAsp: 1.548 ± 1.177
4.644GluGlu: 4.644 ± 2.846
4.644GluPhe: 4.644 ± 1.751
0.774GluGly: 0.774 ± 0.588
0.0GluHis: 0.0 ± 0.0
2.322GluIle: 2.322 ± 1.024
2.322GluLys: 2.322 ± 0.817
3.87GluLeu: 3.87 ± 2.149
0.774GluMet: 0.774 ± 0.768
1.548GluAsn: 1.548 ± 1.19
2.322GluPro: 2.322 ± 1.249
2.322GluGln: 2.322 ± 0.817
0.774GluArg: 0.774 ± 0.588
5.418GluSer: 5.418 ± 0.526
6.966GluThr: 6.966 ± 1.815
6.192GluVal: 6.192 ± 4.17
0.0GluTrp: 0.0 ± 0.0
2.322GluTyr: 2.322 ± 0.484
0.0GluXaa: 0.0 ± 0.0
Phe
1.548PheAla: 1.548 ± 0.586
0.0PheCys: 0.0 ± 0.0
3.096PheAsp: 3.096 ± 1.431
2.322PheGlu: 2.322 ± 0.708
2.322PhePhe: 2.322 ± 1.013
0.774PheGly: 0.774 ± 0.595
0.774PheHis: 0.774 ± 0.588
1.548PheIle: 1.548 ± 1.177
0.0PheLys: 0.0 ± 0.0
1.548PheLeu: 1.548 ± 0.849
1.548PheMet: 1.548 ± 1.19
3.096PheAsn: 3.096 ± 2.354
3.096PhePro: 3.096 ± 1.55
0.0PheGln: 0.0 ± 0.0
1.548PheArg: 1.548 ± 1.536
2.322PheSer: 2.322 ± 0.708
6.966PheThr: 6.966 ± 1.501
3.87PheVal: 3.87 ± 1.039
1.548PheTrp: 1.548 ± 1.177
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
7.74GlyAla: 7.74 ± 2.014
0.0GlyCys: 0.0 ± 0.0
0.774GlyAsp: 0.774 ± 0.588
1.548GlyGlu: 1.548 ± 1.536
3.096GlyPhe: 3.096 ± 1.55
6.192GlyGly: 6.192 ± 0.825
3.096GlyHis: 3.096 ± 1.491
2.322GlyIle: 2.322 ± 1.024
4.644GlyLys: 4.644 ± 0.968
5.418GlyLeu: 5.418 ± 2.893
1.548GlyMet: 1.548 ± 0.828
2.322GlyAsn: 2.322 ± 1.024
3.87GlyPro: 3.87 ± 0.461
3.87GlyGln: 3.87 ± 1.645
3.096GlyArg: 3.096 ± 0.675
4.644GlySer: 4.644 ± 0.968
9.288GlyThr: 9.288 ± 1.878
2.322GlyVal: 2.322 ± 1.496
1.548GlyTrp: 1.548 ± 0.586
1.548GlyTyr: 1.548 ± 0.849
0.0GlyXaa: 0.0 ± 0.0
His
5.418HisAla: 5.418 ± 2.296
1.548HisCys: 1.548 ± 0.586
3.87HisAsp: 3.87 ± 1.548
1.548HisGlu: 1.548 ± 0.586
1.548HisPhe: 1.548 ± 0.996
0.774HisGly: 0.774 ± 0.588
3.096HisHis: 3.096 ± 0.675
1.548HisIle: 1.548 ± 0.586
2.322HisLys: 2.322 ± 2.305
0.774HisLeu: 0.774 ± 0.588
0.774HisMet: 0.774 ± 0.768
1.548HisAsn: 1.548 ± 1.19
1.548HisPro: 1.548 ± 1.177
0.0HisGln: 0.0 ± 0.0
3.096HisArg: 3.096 ± 1.431
3.096HisSer: 3.096 ± 1.216
0.774HisThr: 0.774 ± 0.595
1.548HisVal: 1.548 ± 0.689
0.774HisTrp: 0.774 ± 0.588
2.322HisTyr: 2.322 ± 1.518
0.0HisXaa: 0.0 ± 0.0
Ile
4.644IleAla: 4.644 ± 1.573
0.774IleCys: 0.774 ± 0.588
5.418IleAsp: 5.418 ± 1.814
2.322IleGlu: 2.322 ± 0.708
0.774IlePhe: 0.774 ± 0.588
2.322IleGly: 2.322 ± 1.237
1.548IleHis: 1.548 ± 1.177
2.322IleIle: 2.322 ± 0.484
3.87IleLys: 3.87 ± 2.149
3.096IleLeu: 3.096 ± 0.987
3.096IleMet: 3.096 ± 1.428
1.548IleAsn: 1.548 ± 1.19
1.548IlePro: 1.548 ± 0.586
1.548IleGln: 1.548 ± 1.19
2.322IleArg: 2.322 ± 0.708
3.096IleSer: 3.096 ± 0.675
3.096IleThr: 3.096 ± 1.491
1.548IleVal: 1.548 ± 0.586
1.548IleTrp: 1.548 ± 0.855
2.322IleTyr: 2.322 ± 0.817
0.0IleXaa: 0.0 ± 0.0
Lys
3.096LysAla: 3.096 ± 1.491
0.0LysCys: 0.0 ± 0.0
3.87LysAsp: 3.87 ± 1.047
2.322LysGlu: 2.322 ± 0.708
2.322LysPhe: 2.322 ± 1.237
0.774LysGly: 0.774 ± 0.588
2.322LysHis: 2.322 ± 2.305
2.322LysIle: 2.322 ± 0.484
8.514LysLys: 8.514 ± 3.723
5.418LysLeu: 5.418 ± 1.26
0.774LysMet: 0.774 ± 0.588
1.548LysAsn: 1.548 ± 0.84
5.418LysPro: 5.418 ± 1.543
3.096LysGln: 3.096 ± 1.55
6.966LysArg: 6.966 ± 2.135
5.418LysSer: 5.418 ± 2.029
4.644LysThr: 4.644 ± 2.066
1.548LysVal: 1.548 ± 0.855
1.548LysTrp: 1.548 ± 0.84
0.774LysTyr: 0.774 ± 0.588
0.0LysXaa: 0.0 ± 0.0
Leu
1.548LeuAla: 1.548 ± 1.177
0.774LeuCys: 0.774 ± 0.595
9.288LeuAsp: 9.288 ± 2.999
3.096LeuGlu: 3.096 ± 1.378
2.322LeuPhe: 2.322 ± 1.496
8.514LeuGly: 8.514 ± 1.819
4.644LeuHis: 4.644 ± 2.015
4.644LeuIle: 4.644 ± 2.846
3.096LeuLys: 3.096 ± 0.675
6.966LeuLeu: 6.966 ± 1.501
1.548LeuMet: 1.548 ± 0.996
6.192LeuAsn: 6.192 ± 1.19
0.774LeuPro: 0.774 ± 0.771
3.096LeuGln: 3.096 ± 0.987
1.548LeuArg: 1.548 ± 0.84
3.87LeuSer: 3.87 ± 1.448
3.096LeuThr: 3.096 ± 0.838
4.644LeuVal: 4.644 ± 2.222
1.548LeuTrp: 1.548 ± 1.177
0.774LeuTyr: 0.774 ± 0.771
0.0LeuXaa: 0.0 ± 0.0
Met
1.548MetAla: 1.548 ± 0.586
0.0MetCys: 0.0 ± 0.0
2.322MetAsp: 2.322 ± 0.939
0.0MetGlu: 0.0 ± 0.0
0.774MetPhe: 0.774 ± 0.768
0.774MetGly: 0.774 ± 0.595
0.0MetHis: 0.0 ± 0.0
2.322MetIle: 2.322 ± 1.496
1.548MetLys: 1.548 ± 0.84
0.0MetLeu: 0.0 ± 0.0
0.774MetMet: 0.774 ± 0.595
0.774MetAsn: 0.774 ± 0.768
0.0MetPro: 0.0 ± 0.0
0.774MetGln: 0.774 ± 0.595
2.322MetArg: 2.322 ± 0.708
3.096MetSer: 3.096 ± 1.756
3.096MetThr: 3.096 ± 1.216
2.322MetVal: 2.322 ± 1.024
0.774MetTrp: 0.774 ± 0.588
0.774MetTyr: 0.774 ± 0.771
0.0MetXaa: 0.0 ± 0.0
Asn
5.418AsnAla: 5.418 ± 1.615
0.0AsnCys: 0.0 ± 0.0
1.548AsnAsp: 1.548 ± 0.996
3.096AsnGlu: 3.096 ± 0.99
0.774AsnPhe: 0.774 ± 0.588
3.096AsnGly: 3.096 ± 0.99
2.322AsnHis: 2.322 ± 1.249
3.87AsnIle: 3.87 ± 1.047
3.87AsnLys: 3.87 ± 2.302
3.096AsnLeu: 3.096 ± 0.448
1.548AsnMet: 1.548 ± 0.958
4.644AsnAsn: 4.644 ± 2.049
0.774AsnPro: 0.774 ± 0.588
1.548AsnGln: 1.548 ± 0.84
1.548AsnArg: 1.548 ± 0.689
2.322AsnSer: 2.322 ± 1.785
3.87AsnThr: 3.87 ± 2.302
3.096AsnVal: 3.096 ± 1.523
1.548AsnTrp: 1.548 ± 0.586
1.548AsnTyr: 1.548 ± 0.689
0.0AsnXaa: 0.0 ± 0.0
Pro
6.192ProAla: 6.192 ± 0.403
0.0ProCys: 0.0 ± 0.0
9.288ProAsp: 9.288 ± 1.704
1.548ProGlu: 1.548 ± 1.536
3.87ProPhe: 3.87 ± 1.548
2.322ProGly: 2.322 ± 1.013
2.322ProHis: 2.322 ± 1.013
2.322ProIle: 2.322 ± 1.508
6.192ProLys: 6.192 ± 2.573
3.87ProLeu: 3.87 ± 1.453
0.774ProMet: 0.774 ± 0.588
1.548ProAsn: 1.548 ± 0.84
0.774ProPro: 0.774 ± 0.588
0.774ProGln: 0.774 ± 0.588
3.096ProArg: 3.096 ± 1.173
1.548ProSer: 1.548 ± 1.177
3.096ProThr: 3.096 ± 1.498
0.0ProVal: 0.0 ± 0.0
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
0.0GlnAla: 0.0 ± 0.0
0.0GlnCys: 0.0 ± 0.0
0.774GlnAsp: 0.774 ± 0.768
3.096GlnGlu: 3.096 ± 1.55
1.548GlnPhe: 1.548 ± 0.586
2.322GlnGly: 2.322 ± 0.484
1.548GlnHis: 1.548 ± 0.586
1.548GlnIle: 1.548 ± 0.586
0.0GlnLys: 0.0 ± 0.0
2.322GlnLeu: 2.322 ± 1.508
1.548GlnMet: 1.548 ± 0.996
2.322GlnAsn: 2.322 ± 1.237
1.548GlnPro: 1.548 ± 1.19
2.322GlnGln: 2.322 ± 1.249
2.322GlnArg: 2.322 ± 1.013
3.096GlnSer: 3.096 ± 0.675
2.322GlnThr: 2.322 ± 1.013
4.644GlnVal: 4.644 ± 2.051
0.774GlnTrp: 0.774 ± 0.768
0.774GlnTyr: 0.774 ± 0.588
0.0GlnXaa: 0.0 ± 0.0
Arg
2.322ArgAla: 2.322 ± 0.708
0.0ArgCys: 0.0 ± 0.0
1.548ArgAsp: 1.548 ± 0.996
1.548ArgGlu: 1.548 ± 0.996
1.548ArgPhe: 1.548 ± 0.586
3.87ArgGly: 3.87 ± 2.178
1.548ArgHis: 1.548 ± 1.177
0.0ArgIle: 0.0 ± 0.0
5.418ArgLys: 5.418 ± 1.339
4.644ArgLeu: 4.644 ± 1.939
0.774ArgMet: 0.774 ± 0.622
4.644ArgAsn: 4.644 ± 1.751
1.548ArgPro: 1.548 ± 1.177
2.322ArgGln: 2.322 ± 1.013
5.418ArgArg: 5.418 ± 2.296
6.192ArgSer: 6.192 ± 1.279
3.096ArgThr: 3.096 ± 0.99
3.096ArgVal: 3.096 ± 1.523
1.548ArgTrp: 1.548 ± 0.84
2.322ArgTyr: 2.322 ± 0.708
0.0ArgXaa: 0.0 ± 0.0
Ser
4.644SerAla: 4.644 ± 1.177
0.0SerCys: 0.0 ± 0.0
3.87SerAsp: 3.87 ± 1.645
1.548SerGlu: 1.548 ± 0.84
1.548SerPhe: 1.548 ± 0.586
3.096SerGly: 3.096 ± 1.336
1.548SerHis: 1.548 ± 0.84
1.548SerIle: 1.548 ± 0.855
10.062SerLys: 10.062 ± 2.135
6.192SerLeu: 6.192 ± 0.403
0.774SerMet: 0.774 ± 0.595
3.87SerAsn: 3.87 ± 0.461
6.192SerPro: 6.192 ± 2.39
1.548SerGln: 1.548 ± 0.855
3.87SerArg: 3.87 ± 1.031
2.322SerSer: 2.322 ± 1.025
4.644SerThr: 4.644 ± 0.311
4.644SerVal: 4.644 ± 1.075
3.096SerTrp: 3.096 ± 0.675
0.774SerTyr: 0.774 ± 0.595
0.0SerXaa: 0.0 ± 0.0
Thr
6.966ThrAla: 6.966 ± 3.006
1.548ThrCys: 1.548 ± 0.849
5.418ThrAsp: 5.418 ± 2.122
3.096ThrGlu: 3.096 ± 0.448
3.87ThrPhe: 3.87 ± 1.453
6.192ThrGly: 6.192 ± 1.838
0.774ThrHis: 0.774 ± 0.768
4.644ThrIle: 4.644 ± 1.179
1.548ThrLys: 1.548 ± 1.19
1.548ThrLeu: 1.548 ± 0.84
1.548ThrMet: 1.548 ± 0.849
5.418ThrAsn: 5.418 ± 2.173
4.644ThrPro: 4.644 ± 1.939
2.322ThrGln: 2.322 ± 1.765
3.87ThrArg: 3.87 ± 1.039
7.74ThrSer: 7.74 ± 3.255
4.644ThrThr: 4.644 ± 0.311
4.644ThrVal: 4.644 ± 2.106
2.322ThrTrp: 2.322 ± 1.025
2.322ThrTyr: 2.322 ± 1.024
0.0ThrXaa: 0.0 ± 0.0
Val
3.87ValAla: 3.87 ± 1.886
1.548ValCys: 1.548 ± 1.177
4.644ValAsp: 4.644 ± 2.399
5.418ValGlu: 5.418 ± 0.934
0.0ValPhe: 0.0 ± 0.0
5.418ValGly: 5.418 ± 1.493
3.87ValHis: 3.87 ± 2.302
5.418ValIle: 5.418 ± 2.077
0.774ValLys: 0.774 ± 0.595
5.418ValLeu: 5.418 ± 1.349
0.774ValMet: 0.774 ± 0.595
3.096ValAsn: 3.096 ± 1.55
5.418ValPro: 5.418 ± 1.892
2.322ValGln: 2.322 ± 0.484
1.548ValArg: 1.548 ± 0.849
3.096ValSer: 3.096 ± 1.018
3.87ValThr: 3.87 ± 0.664
3.87ValVal: 3.87 ± 2.178
0.0ValTrp: 0.0 ± 0.0
3.096ValTyr: 3.096 ± 1.523
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.548TrpAsp: 1.548 ± 0.855
0.774TrpGlu: 0.774 ± 0.771
0.774TrpPhe: 0.774 ± 0.588
1.548TrpGly: 1.548 ± 1.177
0.774TrpHis: 0.774 ± 0.771
0.774TrpIle: 0.774 ± 0.771
1.548TrpLys: 1.548 ± 1.536
2.322TrpLeu: 2.322 ± 0.484
0.0TrpMet: 0.0 ± 0.0
0.774TrpAsn: 0.774 ± 0.588
1.548TrpPro: 1.548 ± 1.177
0.0TrpGln: 0.0 ± 0.0
3.096TrpArg: 3.096 ± 0.675
0.0TrpSer: 0.0 ± 0.0
3.096TrpThr: 3.096 ± 0.838
2.322TrpVal: 2.322 ± 1.025
0.0TrpTrp: 0.0 ± 0.0
1.548TrpTyr: 1.548 ± 0.586
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.774TyrAla: 0.774 ± 0.595
1.548TyrCys: 1.548 ± 0.586
1.548TyrAsp: 1.548 ± 0.586
2.322TyrGlu: 2.322 ± 1.013
1.548TyrPhe: 1.548 ± 0.855
2.322TyrGly: 2.322 ± 1.335
1.548TyrHis: 1.548 ± 0.855
0.774TyrIle: 0.774 ± 0.588
2.322TyrLys: 2.322 ± 1.249
4.644TyrLeu: 4.644 ± 1.634
0.774TyrMet: 0.774 ± 0.595
0.774TyrAsn: 0.774 ± 0.595
0.0TyrPro: 0.0 ± 0.0
0.774TyrGln: 0.774 ± 0.768
1.548TyrArg: 1.548 ± 0.855
0.774TyrSer: 0.774 ± 0.588
1.548TyrThr: 1.548 ± 1.19
3.87TyrVal: 3.87 ± 1.164
0.0TyrTrp: 0.0 ± 0.0
1.548TyrTyr: 1.548 ± 0.855
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1293 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski