Amino acid dipepetide frequency for Primate bocaparvovirus 1 (strain Human bocavirus 1 type 1) (HBoV1) (Human bocavirus type 1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.339AlaAla: 4.339 ± 0.676
0.0AlaCys: 0.0 ± 0.0
6.075AlaAsp: 6.075 ± 0.758
2.17AlaGlu: 2.17 ± 0.945
1.953AlaPhe: 1.953 ± 0.933
1.953AlaGly: 1.953 ± 0.84
0.651AlaHis: 0.651 ± 0.311
1.302AlaIle: 1.302 ± 0.236
3.038AlaLys: 3.038 ± 0.424
4.99AlaLeu: 4.99 ± 0.784
1.302AlaMet: 1.302 ± 0.56
1.953AlaAsn: 1.953 ± 0.357
4.773AlaPro: 4.773 ± 0.402
3.471AlaGln: 3.471 ± 0.504
0.868AlaArg: 0.868 ± 0.333
7.594AlaSer: 7.594 ± 1.097
3.905AlaThr: 3.905 ± 0.653
2.821AlaVal: 2.821 ± 0.849
0.651AlaTrp: 0.651 ± 0.311
3.255AlaTyr: 3.255 ± 0.714
0.0AlaXaa: 0.0 ± 0.0
Cys
1.302CysAla: 1.302 ± 0.622
0.651CysCys: 0.651 ± 0.254
1.302CysAsp: 1.302 ± 0.213
1.519CysGlu: 1.519 ± 0.557
2.17CysPhe: 2.17 ± 0.618
1.302CysGly: 1.302 ± 0.421
1.302CysHis: 1.302 ± 0.213
1.302CysIle: 1.302 ± 0.452
2.604CysLys: 2.604 ± 0.52
1.302CysLeu: 1.302 ± 0.236
0.651CysMet: 0.651 ± 0.28
0.217CysAsn: 0.217 ± 0.252
2.17CysPro: 2.17 ± 0.618
0.0CysGln: 0.0 ± 0.0
1.302CysArg: 1.302 ± 0.508
0.651CysSer: 0.651 ± 0.254
3.905CysThr: 3.905 ± 0.195
1.953CysVal: 1.953 ± 0.402
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.868AspAla: 0.868 ± 0.329
3.255AspCys: 3.255 ± 0.482
1.302AspAsp: 1.302 ± 0.398
5.424AspGlu: 5.424 ± 1.326
1.085AspPhe: 1.085 ± 0.329
3.038AspGly: 3.038 ± 0.698
1.519AspHis: 1.519 ± 0.654
4.99AspIle: 4.99 ± 0.782
2.821AspLys: 2.821 ± 0.43
5.858AspLeu: 5.858 ± 0.495
0.0AspMet: 0.0 ± 0.0
4.773AspAsn: 4.773 ± 1.262
3.471AspPro: 3.471 ± 0.518
3.038AspGln: 3.038 ± 0.726
3.255AspArg: 3.255 ± 0.583
2.821AspSer: 2.821 ± 0.429
3.471AspThr: 3.471 ± 0.979
2.17AspVal: 2.17 ± 0.257
1.519AspTrp: 1.519 ± 0.313
0.868AspTyr: 0.868 ± 0.333
0.0AspXaa: 0.0 ± 0.0
Glu
2.604GluAla: 2.604 ± 0.772
1.302GluCys: 1.302 ± 0.508
6.509GluAsp: 6.509 ± 1.694
3.255GluGlu: 3.255 ± 0.906
0.651GluPhe: 0.651 ± 0.28
3.471GluGly: 3.471 ± 0.219
2.604GluHis: 2.604 ± 0.372
2.821GluIle: 2.821 ± 0.445
4.339GluLys: 4.339 ± 0.861
3.905GluLeu: 3.905 ± 0.459
0.434GluMet: 0.434 ± 0.503
3.038GluAsn: 3.038 ± 0.629
3.688GluPro: 3.688 ± 0.721
2.17GluGln: 2.17 ± 0.61
6.292GluArg: 6.292 ± 0.872
2.604GluSer: 2.604 ± 0.606
3.471GluThr: 3.471 ± 0.807
1.953GluVal: 1.953 ± 0.257
1.302GluTrp: 1.302 ± 0.213
3.688GluTyr: 3.688 ± 0.503
0.0GluXaa: 0.0 ± 0.0
Phe
0.434PheAla: 0.434 ± 0.217
0.651PheCys: 0.651 ± 0.28
1.953PheAsp: 1.953 ± 0.84
1.519PheGlu: 1.519 ± 0.507
2.387PhePhe: 2.387 ± 0.335
2.604PheGly: 2.604 ± 0.465
1.302PheHis: 1.302 ± 0.452
3.038PheIle: 3.038 ± 0.397
2.17PheLys: 2.17 ± 0.686
0.651PheLeu: 0.651 ± 0.28
1.302PheMet: 1.302 ± 0.236
6.292PheAsn: 6.292 ± 0.536
3.255PhePro: 3.255 ± 0.399
0.868PheGln: 0.868 ± 0.329
0.868PheArg: 0.868 ± 0.329
2.17PheSer: 2.17 ± 0.366
3.038PheThr: 3.038 ± 0.433
0.651PheVal: 0.651 ± 0.254
0.217PheTrp: 0.217 ± 0.252
1.953PheTyr: 1.953 ± 0.369
0.0PheXaa: 0.0 ± 0.0
Gly
3.688GlyAla: 3.688 ± 0.663
1.302GlyCys: 1.302 ± 0.236
3.905GlyAsp: 3.905 ± 0.659
8.896GlyGlu: 8.896 ± 1.396
3.688GlyPhe: 3.688 ± 0.345
11.499GlyGly: 11.499 ± 0.928
1.302GlyHis: 1.302 ± 0.236
1.519GlyIle: 1.519 ± 0.313
3.038GlyLys: 3.038 ± 0.467
3.255GlyLeu: 3.255 ± 0.637
1.302GlyMet: 1.302 ± 0.236
5.207GlyAsn: 5.207 ± 1.127
3.905GlyPro: 3.905 ± 0.272
1.302GlyGln: 1.302 ± 0.236
1.736GlyArg: 1.736 ± 0.218
4.773GlySer: 4.773 ± 1.734
5.424GlyThr: 5.424 ± 0.666
2.604GlyVal: 2.604 ± 0.291
1.302GlyTrp: 1.302 ± 0.543
3.688GlyTyr: 3.688 ± 0.519
0.0GlyXaa: 0.0 ± 0.0
His
4.99HisAla: 4.99 ± 0.977
1.519HisCys: 1.519 ± 0.507
0.868HisAsp: 0.868 ± 0.255
1.085HisGlu: 1.085 ± 0.329
2.604HisPhe: 2.604 ± 0.499
0.651HisGly: 0.651 ± 0.311
0.868HisHis: 0.868 ± 0.333
2.604HisIle: 2.604 ± 0.291
0.651HisLys: 0.651 ± 0.311
3.471HisLeu: 3.471 ± 0.6
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
2.821HisPro: 2.821 ± 0.225
1.519HisGln: 1.519 ± 0.261
1.302HisArg: 1.302 ± 0.763
2.17HisSer: 2.17 ± 0.673
2.604HisThr: 2.604 ± 0.426
1.302HisVal: 1.302 ± 0.236
0.217HisTrp: 0.217 ± 0.252
0.651HisTyr: 0.651 ± 0.311
0.0HisXaa: 0.0 ± 0.0
Ile
2.17IleAla: 2.17 ± 0.428
0.0IleCys: 0.0 ± 0.0
1.085IleAsp: 1.085 ± 0.3
1.953IleGlu: 1.953 ± 0.451
2.17IlePhe: 2.17 ± 0.366
2.17IleGly: 2.17 ± 0.484
0.0IleHis: 0.0 ± 0.0
4.122IleIle: 4.122 ± 1.079
1.953IleLys: 1.953 ± 0.916
3.255IleLeu: 3.255 ± 0.817
0.651IleMet: 0.651 ± 0.28
2.604IleAsn: 2.604 ± 0.291
3.255IlePro: 3.255 ± 0.51
5.207IleGln: 5.207 ± 0.582
2.821IleArg: 2.821 ± 0.507
3.688IleSer: 3.688 ± 0.62
4.122IleThr: 4.122 ± 0.421
3.471IleVal: 3.471 ± 0.62
1.302IleTrp: 1.302 ± 0.56
1.953IleTyr: 1.953 ± 0.451
0.0IleXaa: 0.0 ± 0.0
Lys
4.122LysAla: 4.122 ± 0.501
1.302LysCys: 1.302 ± 0.508
0.651LysAsp: 0.651 ± 0.491
3.688LysGlu: 3.688 ± 0.992
0.868LysPhe: 0.868 ± 0.305
2.604LysGly: 2.604 ± 0.572
1.519LysHis: 1.519 ± 0.261
2.387LysIle: 2.387 ± 0.496
4.99LysLys: 4.99 ± 0.61
1.953LysLeu: 1.953 ± 0.451
1.302LysMet: 1.302 ± 0.429
4.122LysAsn: 4.122 ± 0.895
3.038LysPro: 3.038 ± 0.424
4.122LysGln: 4.122 ± 0.922
6.075LysArg: 6.075 ± 1.246
2.604LysSer: 2.604 ± 0.485
4.339LysThr: 4.339 ± 0.733
1.302LysVal: 1.302 ± 0.56
0.651LysTrp: 0.651 ± 0.254
0.868LysTyr: 0.868 ± 0.329
0.0LysXaa: 0.0 ± 0.0
Leu
6.292LeuAla: 6.292 ± 1.059
1.953LeuCys: 1.953 ± 0.664
3.688LeuAsp: 3.688 ± 0.622
2.604LeuGlu: 2.604 ± 0.291
1.519LeuPhe: 1.519 ± 0.261
6.292LeuGly: 6.292 ± 0.61
4.773LeuHis: 4.773 ± 1.342
3.471LeuIle: 3.471 ± 1.121
1.519LeuLys: 1.519 ± 0.414
12.15LeuLeu: 12.15 ± 1.974
3.688LeuMet: 3.688 ± 0.6
3.905LeuAsn: 3.905 ± 1.119
5.641LeuPro: 5.641 ± 1.178
4.339LeuGln: 4.339 ± 0.282
4.339LeuArg: 4.339 ± 0.645
3.038LeuSer: 3.038 ± 0.663
4.122LeuThr: 4.122 ± 0.545
2.604LeuVal: 2.604 ± 0.63
0.0LeuTrp: 0.0 ± 0.0
2.821LeuTyr: 2.821 ± 0.796
0.0LeuXaa: 0.0 ± 0.0
Met
3.905MetAla: 3.905 ± 0.656
0.0MetCys: 0.0 ± 0.0
1.302MetAsp: 1.302 ± 0.56
0.217MetGlu: 0.217 ± 0.252
2.17MetPhe: 2.17 ± 0.366
0.0MetGly: 0.0 ± 0.0
1.302MetHis: 1.302 ± 0.508
0.651MetIle: 0.651 ± 0.311
1.302MetLys: 1.302 ± 0.763
2.17MetLeu: 2.17 ± 0.397
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
2.17MetPro: 2.17 ± 0.928
1.302MetGln: 1.302 ± 0.213
0.0MetArg: 0.0 ± 0.0
2.17MetSer: 2.17 ± 0.366
0.651MetThr: 0.651 ± 0.28
0.651MetVal: 0.651 ± 0.28
0.0MetTrp: 0.0 ± 0.0
1.085MetTyr: 1.085 ± 0.329
0.0MetXaa: 0.0 ± 0.0
Asn
3.471AsnAla: 3.471 ± 0.998
1.302AsnCys: 1.302 ± 0.452
1.302AsnAsp: 1.302 ± 0.236
3.471AsnGlu: 3.471 ± 0.439
2.604AsnPhe: 2.604 ± 0.472
5.207AsnGly: 5.207 ± 1.042
1.302AsnHis: 1.302 ± 0.508
1.953AsnIle: 1.953 ± 0.84
4.556AsnLys: 4.556 ± 0.824
4.556AsnLeu: 4.556 ± 0.583
2.604AsnMet: 2.604 ± 0.983
2.387AsnAsn: 2.387 ± 0.896
4.773AsnPro: 4.773 ± 0.872
3.688AsnGln: 3.688 ± 0.949
0.868AsnArg: 0.868 ± 0.383
5.641AsnSer: 5.641 ± 0.913
3.688AsnThr: 3.688 ± 0.529
0.868AsnVal: 0.868 ± 0.333
1.302AsnTrp: 1.302 ± 0.213
2.17AsnTyr: 2.17 ± 0.458
0.0AsnXaa: 0.0 ± 0.0
Pro
4.122ProAla: 4.122 ± 0.879
0.0ProCys: 0.0 ± 0.0
4.122ProAsp: 4.122 ± 0.679
6.292ProGlu: 6.292 ± 0.495
2.821ProPhe: 2.821 ± 0.732
4.122ProGly: 4.122 ± 0.679
1.736ProHis: 1.736 ± 0.554
3.471ProIle: 3.471 ± 0.821
3.255ProLys: 3.255 ± 0.436
3.471ProLeu: 3.471 ± 1.049
0.0ProMet: 0.0 ± 0.0
4.99ProAsn: 4.99 ± 0.547
5.424ProPro: 5.424 ± 0.821
4.773ProGln: 4.773 ± 0.339
2.387ProArg: 2.387 ± 0.335
2.604ProSer: 2.604 ± 0.488
6.075ProThr: 6.075 ± 1.643
3.471ProVal: 3.471 ± 0.409
2.387ProTrp: 2.387 ± 0.379
2.387ProTyr: 2.387 ± 0.439
0.0ProXaa: 0.0 ± 0.0
Gln
0.868GlnAla: 0.868 ± 0.333
0.651GlnCys: 0.651 ± 0.28
2.604GlnAsp: 2.604 ± 0.472
1.953GlnGlu: 1.953 ± 0.732
2.17GlnPhe: 2.17 ± 0.257
1.302GlnGly: 1.302 ± 0.508
0.217GlnHis: 0.217 ± 0.252
2.17GlnIle: 2.17 ± 0.366
2.387GlnLys: 2.387 ± 0.563
3.688GlnLeu: 3.688 ± 0.622
1.519GlnMet: 1.519 ± 0.31
2.604GlnAsn: 2.604 ± 0.713
6.075GlnPro: 6.075 ± 0.715
1.085GlnGln: 1.085 ± 0.514
4.556GlnArg: 4.556 ± 0.473
2.604GlnSer: 2.604 ± 0.426
4.339GlnThr: 4.339 ± 0.513
4.339GlnVal: 4.339 ± 0.778
2.821GlnTrp: 2.821 ± 0.362
2.604GlnTyr: 2.604 ± 0.643
0.0GlnXaa: 0.0 ± 0.0
Arg
4.122ArgAla: 4.122 ± 0.537
2.821ArgCys: 2.821 ± 0.742
3.688ArgAsp: 3.688 ± 0.62
3.471ArgGlu: 3.471 ± 0.775
1.953ArgPhe: 1.953 ± 0.389
3.255ArgGly: 3.255 ± 0.892
2.387ArgHis: 2.387 ± 0.672
1.519ArgIle: 1.519 ± 0.261
1.953ArgLys: 1.953 ± 0.715
4.99ArgLeu: 4.99 ± 1.026
0.0ArgMet: 0.0 ± 0.0
2.387ArgAsn: 2.387 ± 1.022
2.821ArgPro: 2.821 ± 0.328
3.905ArgGln: 3.905 ± 0.526
3.038ArgArg: 3.038 ± 0.324
0.868ArgSer: 0.868 ± 1.006
0.868ArgThr: 0.868 ± 0.329
1.953ArgVal: 1.953 ± 0.451
0.0ArgTrp: 0.0 ± 0.0
0.868ArgTyr: 0.868 ± 0.388
0.0ArgXaa: 0.0 ± 0.0
Ser
1.953SerAla: 1.953 ± 0.287
2.17SerCys: 2.17 ± 0.366
6.292SerAsp: 6.292 ± 0.894
2.387SerGlu: 2.387 ± 0.444
0.868SerPhe: 0.868 ± 0.255
4.556SerGly: 4.556 ± 1.254
3.471SerHis: 3.471 ± 0.335
1.736SerIle: 1.736 ± 0.635
2.821SerLys: 2.821 ± 0.488
4.122SerLeu: 4.122 ± 0.414
2.604SerMet: 2.604 ± 0.372
3.471SerAsn: 3.471 ± 0.853
3.471SerPro: 3.471 ± 0.775
4.122SerGln: 4.122 ± 0.774
1.302SerArg: 1.302 ± 0.626
6.075SerSer: 6.075 ± 1.251
6.943SerThr: 6.943 ± 0.562
2.821SerVal: 2.821 ± 0.843
0.651SerTrp: 0.651 ± 0.28
2.387SerTyr: 2.387 ± 0.494
0.0SerXaa: 0.0 ± 0.0
Thr
5.641ThrAla: 5.641 ± 0.914
0.0ThrCys: 0.0 ± 0.0
3.038ThrAsp: 3.038 ± 0.523
4.99ThrGlu: 4.99 ± 0.179
2.17ThrPhe: 2.17 ± 0.267
8.896ThrGly: 8.896 ± 1.765
1.519ThrHis: 1.519 ± 0.261
2.821ThrIle: 2.821 ± 0.715
1.736ThrLys: 1.736 ± 0.728
5.424ThrLeu: 5.424 ± 1.691
0.651ThrMet: 0.651 ± 0.311
4.99ThrAsn: 4.99 ± 1.094
4.556ThrPro: 4.556 ± 0.493
1.519ThrGln: 1.519 ± 0.427
3.255ThrArg: 3.255 ± 0.801
6.075ThrSer: 6.075 ± 0.403
5.424ThrThr: 5.424 ± 0.787
2.17ThrVal: 2.17 ± 0.257
1.519ThrTrp: 1.519 ± 0.296
5.207ThrTyr: 5.207 ± 0.582
0.0ThrXaa: 0.0 ± 0.0
Val
0.868ValAla: 0.868 ± 0.255
2.604ValCys: 2.604 ± 0.465
2.821ValAsp: 2.821 ± 0.328
1.302ValGlu: 1.302 ± 0.236
1.519ValPhe: 1.519 ± 0.488
5.207ValGly: 5.207 ± 0.705
1.302ValHis: 1.302 ± 0.236
2.604ValIle: 2.604 ± 0.905
1.953ValLys: 1.953 ± 0.451
5.641ValLeu: 5.641 ± 0.756
0.868ValMet: 0.868 ± 0.329
1.519ValAsn: 1.519 ± 0.313
1.302ValPro: 1.302 ± 0.56
1.953ValGln: 1.953 ± 0.257
0.0ValArg: 0.0 ± 0.0
3.038ValSer: 3.038 ± 0.44
2.604ValThr: 2.604 ± 0.291
1.953ValVal: 1.953 ± 0.402
1.953ValTrp: 1.953 ± 0.451
1.519ValTyr: 1.519 ± 0.488
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
1.519TrpCys: 1.519 ± 0.261
1.736TrpAsp: 1.736 ± 0.658
2.387TrpGlu: 2.387 ± 0.274
0.651TrpPhe: 0.651 ± 0.311
1.519TrpGly: 1.519 ± 0.507
1.302TrpHis: 1.302 ± 0.626
0.0TrpIle: 0.0 ± 0.0
1.302TrpLys: 1.302 ± 0.213
0.651TrpLeu: 0.651 ± 0.311
1.302TrpMet: 1.302 ± 0.56
0.651TrpAsn: 0.651 ± 0.254
0.0TrpPro: 0.0 ± 0.0
0.868TrpGln: 0.868 ± 0.329
0.651TrpArg: 0.651 ± 0.28
0.217TrpSer: 0.217 ± 0.175
0.651TrpThr: 0.651 ± 0.28
2.17TrpVal: 2.17 ± 0.536
0.651TrpTrp: 0.651 ± 0.254
0.651TrpTyr: 0.651 ± 0.28
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.302TyrAla: 1.302 ± 0.236
2.604TyrCys: 2.604 ± 0.488
1.302TyrAsp: 1.302 ± 0.236
1.519TyrGlu: 1.519 ± 0.427
1.302TyrPhe: 1.302 ± 0.573
3.688TyrGly: 3.688 ± 0.622
1.519TyrHis: 1.519 ± 0.249
3.471TyrIle: 3.471 ± 0.979
4.122TyrLys: 4.122 ± 1.211
3.471TyrLeu: 3.471 ± 0.424
0.651TyrMet: 0.651 ± 0.311
2.604TyrAsn: 2.604 ± 0.713
1.302TyrPro: 1.302 ± 0.213
1.302TyrGln: 1.302 ± 0.56
1.953TyrArg: 1.953 ± 0.713
2.821TyrSer: 2.821 ± 0.477
1.953TyrThr: 1.953 ± 0.287
1.302TyrVal: 1.302 ± 0.213
0.434TyrTrp: 0.434 ± 0.503
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (4610 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski