Amino acid dipepetide frequency for Bombyx mori densovirus Zhenjiang

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.037AlaAla: 2.037 ± 1.043
0.582AlaCys: 0.582 ± 0.449
2.619AlaAsp: 2.619 ± 1.372
2.328AlaGlu: 2.328 ± 0.594
2.328AlaPhe: 2.328 ± 0.553
3.2AlaGly: 3.2 ± 1.031
0.582AlaHis: 0.582 ± 0.449
2.619AlaIle: 2.619 ± 1.058
3.491AlaLys: 3.491 ± 0.942
3.782AlaLeu: 3.782 ± 0.59
0.0AlaMet: 0.0 ± 0.0
2.328AlaAsn: 2.328 ± 1.189
2.328AlaPro: 2.328 ± 0.698
1.455AlaGln: 1.455 ± 0.342
2.328AlaArg: 2.328 ± 1.022
3.782AlaSer: 3.782 ± 1.53
2.91AlaThr: 2.91 ± 0.75
2.037AlaVal: 2.037 ± 0.526
0.582AlaTrp: 0.582 ± 0.309
2.328AlaTyr: 2.328 ± 0.551
0.0AlaXaa: 0.0 ± 0.0
Cys
0.582CysAla: 0.582 ± 0.454
0.0CysCys: 0.0 ± 0.0
0.873CysAsp: 0.873 ± 0.414
1.164CysGlu: 1.164 ± 0.829
0.582CysPhe: 0.582 ± 0.485
0.291CysGly: 0.291 ± 0.309
0.0CysHis: 0.0 ± 0.0
1.746CysIle: 1.746 ± 0.547
1.455CysLys: 1.455 ± 1.086
1.455CysLeu: 1.455 ± 0.454
0.582CysMet: 0.582 ± 0.449
1.746CysAsn: 1.746 ± 0.451
0.291CysPro: 0.291 ± 0.227
0.582CysGln: 0.582 ± 0.309
1.164CysArg: 1.164 ± 0.712
1.164CysSer: 1.164 ± 0.504
0.873CysThr: 0.873 ± 0.401
1.164CysVal: 1.164 ± 0.696
0.0CysTrp: 0.0 ± 0.0
1.164CysTyr: 1.164 ± 0.885
0.0CysXaa: 0.0 ± 0.0
Asp
1.746AspAla: 1.746 ± 0.612
1.164AspCys: 1.164 ± 0.638
3.491AspAsp: 3.491 ± 1.402
2.91AspGlu: 2.91 ± 1.096
2.328AspPhe: 2.328 ± 0.917
3.2AspGly: 3.2 ± 0.845
0.291AspHis: 0.291 ± 0.225
7.565AspIle: 7.565 ± 1.85
2.619AspLys: 2.619 ± 1.707
4.073AspLeu: 4.073 ± 1.402
0.873AspMet: 0.873 ± 0.401
6.401AspAsn: 6.401 ± 1.672
2.91AspPro: 2.91 ± 0.835
1.746AspGln: 1.746 ± 1.024
2.328AspArg: 2.328 ± 0.407
4.073AspSer: 4.073 ± 1.837
2.619AspThr: 2.619 ± 1.078
5.819AspVal: 5.819 ± 1.403
1.455AspTrp: 1.455 ± 0.392
3.782AspTyr: 3.782 ± 1.618
0.0AspXaa: 0.0 ± 0.0
Glu
3.491GluAla: 3.491 ± 0.641
0.291GluCys: 0.291 ± 0.225
4.946GluAsp: 4.946 ± 0.524
4.946GluGlu: 4.946 ± 3.281
2.619GluPhe: 2.619 ± 0.791
1.746GluGly: 1.746 ± 0.438
0.582GluHis: 0.582 ± 0.541
3.2GluIle: 3.2 ± 0.694
2.328GluLys: 2.328 ± 0.7
5.237GluLeu: 5.237 ± 1.061
1.164GluMet: 1.164 ± 0.699
3.2GluAsn: 3.2 ± 1.068
2.328GluPro: 2.328 ± 0.88
1.746GluGln: 1.746 ± 0.736
2.328GluArg: 2.328 ± 0.573
2.91GluSer: 2.91 ± 1.262
2.91GluThr: 2.91 ± 0.92
4.655GluVal: 4.655 ± 1.299
0.873GluTrp: 0.873 ± 0.531
3.491GluTyr: 3.491 ± 1.106
0.0GluXaa: 0.0 ± 0.0
Phe
0.873PheAla: 0.873 ± 0.68
2.619PheCys: 2.619 ± 0.708
3.782PheAsp: 3.782 ± 0.831
2.328PheGlu: 2.328 ± 0.714
0.291PhePhe: 0.291 ± 0.227
2.037PheGly: 2.037 ± 0.625
2.037PheHis: 2.037 ± 0.631
4.364PheIle: 4.364 ± 1.099
5.237PheLys: 5.237 ± 1.539
4.655PheLeu: 4.655 ± 1.071
0.873PheMet: 0.873 ± 0.497
6.692PheAsn: 6.692 ± 0.62
0.582PhePro: 0.582 ± 0.436
1.455PheGln: 1.455 ± 0.83
2.328PheArg: 2.328 ± 0.845
2.91PheSer: 2.91 ± 0.359
2.91PheThr: 2.91 ± 1.334
2.037PheVal: 2.037 ± 0.987
0.0PheTrp: 0.0 ± 0.0
2.619PheTyr: 2.619 ± 1.223
0.0PheXaa: 0.0 ± 0.0
Gly
1.746GlyAla: 1.746 ± 0.778
0.291GlyCys: 0.291 ± 0.422
2.619GlyAsp: 2.619 ± 0.448
3.782GlyGlu: 3.782 ± 1.187
2.328GlyPhe: 2.328 ± 0.628
3.782GlyGly: 3.782 ± 3.386
0.291GlyHis: 0.291 ± 0.422
4.073GlyIle: 4.073 ± 0.957
3.491GlyLys: 3.491 ± 1.256
3.782GlyLeu: 3.782 ± 1.03
0.582GlyMet: 0.582 ± 0.383
3.782GlyAsn: 3.782 ± 1.407
1.746GlyPro: 1.746 ± 0.891
1.746GlyGln: 1.746 ± 0.612
1.455GlyArg: 1.455 ± 0.672
6.692GlySer: 6.692 ± 1.859
3.491GlyThr: 3.491 ± 0.902
2.619GlyVal: 2.619 ± 0.363
0.582GlyTrp: 0.582 ± 0.576
2.91GlyTyr: 2.91 ± 0.762
0.0GlyXaa: 0.0 ± 0.0
His
0.873HisAla: 0.873 ± 0.476
0.582HisCys: 0.582 ± 0.394
1.164HisAsp: 1.164 ± 0.472
0.291HisGlu: 0.291 ± 0.227
0.291HisPhe: 0.291 ± 0.225
0.582HisGly: 0.582 ± 0.309
0.291HisHis: 0.291 ± 0.225
1.164HisIle: 1.164 ± 0.662
1.164HisLys: 1.164 ± 0.717
0.873HisLeu: 0.873 ± 0.562
0.291HisMet: 0.291 ± 0.225
2.037HisAsn: 2.037 ± 0.616
0.291HisPro: 0.291 ± 0.225
0.582HisGln: 0.582 ± 0.236
0.0HisArg: 0.0 ± 0.0
1.455HisSer: 1.455 ± 0.916
1.164HisThr: 1.164 ± 0.511
0.582HisVal: 0.582 ± 0.449
0.0HisTrp: 0.0 ± 0.0
1.164HisTyr: 1.164 ± 0.577
0.0HisXaa: 0.0 ± 0.0
Ile
3.2IleAla: 3.2 ± 1.315
2.037IleCys: 2.037 ± 0.761
4.655IleAsp: 4.655 ± 1.369
5.528IleGlu: 5.528 ± 1.047
3.2IlePhe: 3.2 ± 1.63
3.2IleGly: 3.2 ± 0.622
2.328IleHis: 2.328 ± 1.484
7.565IleIle: 7.565 ± 1.722
5.528IleLys: 5.528 ± 0.739
4.655IleLeu: 4.655 ± 0.988
0.291IleMet: 0.291 ± 0.227
7.274IleAsn: 7.274 ± 2.748
6.11IlePro: 6.11 ± 1.249
2.619IleGln: 2.619 ± 0.864
6.401IleArg: 6.401 ± 0.863
6.401IleSer: 6.401 ± 2.638
5.819IleThr: 5.819 ± 1.1
4.073IleVal: 4.073 ± 0.794
0.873IleTrp: 0.873 ± 0.401
4.073IleTyr: 4.073 ± 0.791
0.0IleXaa: 0.0 ± 0.0
Lys
2.91LysAla: 2.91 ± 1.086
1.455LysCys: 1.455 ± 1.023
5.528LysAsp: 5.528 ± 2.637
1.455LysGlu: 1.455 ± 0.969
3.782LysPhe: 3.782 ± 1.095
4.073LysGly: 4.073 ± 2.321
1.746LysHis: 1.746 ± 0.45
4.946LysIle: 4.946 ± 0.861
2.91LysLys: 2.91 ± 1.683
5.528LysLeu: 5.528 ± 2.406
0.873LysMet: 0.873 ± 0.456
3.491LysAsn: 3.491 ± 0.772
2.91LysPro: 2.91 ± 1.05
2.328LysGln: 2.328 ± 0.682
3.2LysArg: 3.2 ± 1.133
5.528LysSer: 5.528 ± 1.088
3.782LysThr: 3.782 ± 0.858
2.328LysVal: 2.328 ± 0.714
1.746LysTrp: 1.746 ± 0.777
4.073LysTyr: 4.073 ± 1.557
0.0LysXaa: 0.0 ± 0.0
Leu
5.819LeuAla: 5.819 ± 1.664
0.873LeuCys: 0.873 ± 0.472
6.401LeuAsp: 6.401 ± 1.62
6.401LeuGlu: 6.401 ± 1.203
4.655LeuPhe: 4.655 ± 1.082
3.2LeuGly: 3.2 ± 1.091
2.037LeuHis: 2.037 ± 0.761
5.819LeuIle: 5.819 ± 1.063
4.073LeuLys: 4.073 ± 1.025
5.528LeuLeu: 5.528 ± 0.835
1.746LeuMet: 1.746 ± 0.809
6.401LeuAsn: 6.401 ± 1.558
2.619LeuPro: 2.619 ± 0.888
2.328LeuGln: 2.328 ± 0.993
3.782LeuArg: 3.782 ± 1.256
7.565LeuSer: 7.565 ± 1.67
6.692LeuThr: 6.692 ± 0.661
2.91LeuVal: 2.91 ± 0.677
0.873LeuTrp: 0.873 ± 0.414
4.073LeuTyr: 4.073 ± 0.632
0.0LeuXaa: 0.0 ± 0.0
Met
0.582MetAla: 0.582 ± 0.64
0.0MetCys: 0.0 ± 0.0
1.746MetAsp: 1.746 ± 0.221
0.582MetGlu: 0.582 ± 0.76
0.582MetPhe: 0.582 ± 0.236
0.291MetGly: 0.291 ± 0.309
0.291MetHis: 0.291 ± 0.225
1.164MetIle: 1.164 ± 0.577
1.746MetLys: 1.746 ± 0.862
1.746MetLeu: 1.746 ± 0.631
0.291MetMet: 0.291 ± 0.422
1.164MetAsn: 1.164 ± 0.346
1.455MetPro: 1.455 ± 0.619
0.291MetGln: 0.291 ± 0.225
0.582MetArg: 0.582 ± 0.536
1.164MetSer: 1.164 ± 0.362
0.873MetThr: 0.873 ± 0.446
0.0MetVal: 0.0 ± 0.0
0.582MetTrp: 0.582 ± 0.576
0.582MetTyr: 0.582 ± 0.449
0.0MetXaa: 0.0 ± 0.0
Asn
2.037AsnAla: 2.037 ± 0.55
2.037AsnCys: 2.037 ± 0.556
4.073AsnAsp: 4.073 ± 1.809
3.491AsnGlu: 3.491 ± 0.836
7.274AsnPhe: 7.274 ± 1.287
3.2AsnGly: 3.2 ± 1.621
0.291AsnHis: 0.291 ± 0.309
8.147AsnIle: 8.147 ± 1.594
6.11AsnLys: 6.11 ± 2.252
5.528AsnLeu: 5.528 ± 1.406
1.164AsnMet: 1.164 ± 0.346
8.729AsnAsn: 8.729 ± 1.534
3.782AsnPro: 3.782 ± 1.134
3.491AsnGln: 3.491 ± 1.759
4.364AsnArg: 4.364 ± 1.375
4.946AsnSer: 4.946 ± 0.799
4.655AsnThr: 4.655 ± 1.417
5.528AsnVal: 5.528 ± 0.868
0.582AsnTrp: 0.582 ± 0.48
2.328AsnTyr: 2.328 ± 0.492
0.0AsnXaa: 0.0 ± 0.0
Pro
2.328ProAla: 2.328 ± 1.484
0.291ProCys: 0.291 ± 0.309
2.037ProAsp: 2.037 ± 0.782
2.619ProGlu: 2.619 ± 0.545
2.328ProPhe: 2.328 ± 0.671
2.619ProGly: 2.619 ± 1.025
0.873ProHis: 0.873 ± 0.251
5.237ProIle: 5.237 ± 1.714
2.328ProLys: 2.328 ± 1.732
2.91ProLeu: 2.91 ± 0.732
0.873ProMet: 0.873 ± 0.599
2.328ProAsn: 2.328 ± 0.708
0.873ProPro: 0.873 ± 0.476
1.455ProGln: 1.455 ± 0.72
1.455ProArg: 1.455 ± 0.853
3.782ProSer: 3.782 ± 1.664
2.328ProThr: 2.328 ± 0.685
1.746ProVal: 1.746 ± 0.481
0.291ProTrp: 0.291 ± 0.227
2.328ProTyr: 2.328 ± 0.7
0.0ProXaa: 0.0 ± 0.0
Gln
0.582GlnAla: 0.582 ± 0.454
0.291GlnCys: 0.291 ± 0.227
1.455GlnAsp: 1.455 ± 0.376
3.2GlnGlu: 3.2 ± 1.182
1.164GlnPhe: 1.164 ± 0.66
0.873GlnGly: 0.873 ± 0.576
0.582GlnHis: 0.582 ± 0.33
2.91GlnIle: 2.91 ± 0.93
2.037GlnLys: 2.037 ± 0.628
4.946GlnLeu: 4.946 ± 1.917
0.582GlnMet: 0.582 ± 0.401
2.037GlnAsn: 2.037 ± 0.765
0.873GlnPro: 0.873 ± 0.576
0.873GlnGln: 0.873 ± 0.497
2.037GlnArg: 2.037 ± 0.621
0.873GlnSer: 0.873 ± 0.446
2.619GlnThr: 2.619 ± 0.903
1.746GlnVal: 1.746 ± 0.606
0.582GlnTrp: 0.582 ± 0.454
2.037GlnTyr: 2.037 ± 0.56
0.0GlnXaa: 0.0 ± 0.0
Arg
2.91ArgAla: 2.91 ± 1.197
0.582ArgCys: 0.582 ± 0.454
4.073ArgAsp: 4.073 ± 0.923
3.2ArgGlu: 3.2 ± 0.842
2.328ArgPhe: 2.328 ± 1.33
3.2ArgGly: 3.2 ± 0.579
0.291ArgHis: 0.291 ± 0.225
3.491ArgIle: 3.491 ± 0.873
2.91ArgLys: 2.91 ± 0.796
4.946ArgLeu: 4.946 ± 1.829
0.582ArgMet: 0.582 ± 0.485
4.073ArgAsn: 4.073 ± 0.914
0.873ArgPro: 0.873 ± 0.536
2.037ArgGln: 2.037 ± 1.033
0.873ArgArg: 0.873 ± 0.405
3.2ArgSer: 3.2 ± 1.417
2.037ArgThr: 2.037 ± 1.124
1.746ArgVal: 1.746 ± 0.571
0.0ArgTrp: 0.0 ± 0.0
3.2ArgTyr: 3.2 ± 1.167
0.0ArgXaa: 0.0 ± 0.0
Ser
5.237SerAla: 5.237 ± 2.259
1.164SerCys: 1.164 ± 0.6
3.782SerAsp: 3.782 ± 1.679
1.746SerGlu: 1.746 ± 1.425
5.819SerPhe: 5.819 ± 1.368
4.946SerGly: 4.946 ± 1.314
0.582SerHis: 0.582 ± 0.33
5.237SerIle: 5.237 ± 1.147
6.983SerLys: 6.983 ± 1.854
8.147SerLeu: 8.147 ± 1.675
1.455SerMet: 1.455 ± 0.593
6.401SerAsn: 6.401 ± 2.244
2.328SerPro: 2.328 ± 0.671
0.582SerGln: 0.582 ± 0.454
3.491SerArg: 3.491 ± 1.351
5.819SerSer: 5.819 ± 1.419
5.819SerThr: 5.819 ± 1.026
2.619SerVal: 2.619 ± 0.782
0.873SerTrp: 0.873 ± 0.701
2.328SerTyr: 2.328 ± 0.764
0.0SerXaa: 0.0 ± 0.0
Thr
1.746ThrAla: 1.746 ± 0.511
0.873ThrCys: 0.873 ± 0.569
2.619ThrAsp: 2.619 ± 1.045
3.2ThrGlu: 3.2 ± 0.66
3.2ThrPhe: 3.2 ± 0.872
5.819ThrGly: 5.819 ± 2.149
0.582ThrHis: 0.582 ± 0.394
6.11ThrIle: 6.11 ± 2.127
2.91ThrLys: 2.91 ± 1.93
4.946ThrLeu: 4.946 ± 1.807
1.746ThrMet: 1.746 ± 0.515
4.364ThrAsn: 4.364 ± 1.179
4.655ThrPro: 4.655 ± 0.88
2.619ThrGln: 2.619 ± 0.864
3.491ThrArg: 3.491 ± 2.068
4.655ThrSer: 4.655 ± 1.054
4.946ThrThr: 4.946 ± 1.156
3.491ThrVal: 3.491 ± 1.837
0.291ThrTrp: 0.291 ± 0.309
2.328ThrTyr: 2.328 ± 0.895
0.0ThrXaa: 0.0 ± 0.0
Val
1.746ValAla: 1.746 ± 0.614
0.873ValCys: 0.873 ± 0.414
1.746ValAsp: 1.746 ± 0.547
2.619ValGlu: 2.619 ± 1.14
3.491ValPhe: 3.491 ± 0.717
2.037ValGly: 2.037 ± 0.852
0.873ValHis: 0.873 ± 0.405
6.11ValIle: 6.11 ± 1.307
2.328ValLys: 2.328 ± 0.87
4.946ValLeu: 4.946 ± 0.867
0.582ValMet: 0.582 ± 0.536
4.073ValAsn: 4.073 ± 1.19
2.037ValPro: 2.037 ± 0.79
3.491ValGln: 3.491 ± 1.429
1.455ValArg: 1.455 ± 0.733
2.619ValSer: 2.619 ± 0.702
3.2ValThr: 3.2 ± 0.872
2.037ValVal: 2.037 ± 0.56
0.291ValTrp: 0.291 ± 0.225
2.91ValTyr: 2.91 ± 0.637
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.291TrpCys: 0.291 ± 0.498
0.873TrpAsp: 0.873 ± 0.414
0.582TrpGlu: 0.582 ± 0.618
0.291TrpPhe: 0.291 ± 0.225
1.455TrpGly: 1.455 ± 0.709
0.0TrpHis: 0.0 ± 0.0
0.873TrpIle: 0.873 ± 0.599
0.873TrpLys: 0.873 ± 0.512
0.873TrpLeu: 0.873 ± 0.251
0.291TrpMet: 0.291 ± 0.227
1.164TrpAsn: 1.164 ± 0.959
0.291TrpPro: 0.291 ± 0.498
0.0TrpGln: 0.0 ± 0.0
0.291TrpArg: 0.291 ± 0.227
1.746TrpSer: 1.746 ± 0.515
0.291TrpThr: 0.291 ± 0.227
0.582TrpVal: 0.582 ± 0.236
0.0TrpTrp: 0.0 ± 0.0
0.291TrpTyr: 0.291 ± 0.498
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.2TyrAla: 3.2 ± 0.904
0.582TyrCys: 0.582 ± 0.449
2.91TyrAsp: 2.91 ± 1.309
2.91TyrGlu: 2.91 ± 0.866
1.746TyrPhe: 1.746 ± 0.527
2.037TyrGly: 2.037 ± 0.628
0.291TyrHis: 0.291 ± 0.309
3.491TyrIle: 3.491 ± 1.515
4.073TyrLys: 4.073 ± 1.215
5.237TyrLeu: 5.237 ± 1.542
0.582TyrMet: 0.582 ± 0.485
4.073TyrAsn: 4.073 ± 1.863
1.746TyrPro: 1.746 ± 0.551
0.873TyrGln: 0.873 ± 1.266
3.2TyrArg: 3.2 ± 1.623
4.073TyrSer: 4.073 ± 1.224
4.655TyrThr: 4.655 ± 1.098
1.746TyrVal: 1.746 ± 0.708
0.582TyrTrp: 0.582 ± 0.485
5.819TyrTyr: 5.819 ± 2.111
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3438 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski