Amino acid dipepetide frequency for Burkholderia virus BcepC6B

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
22.47AlaAla: 22.47 ± 2.395
1.215AlaCys: 1.215 ± 0.331
8.275AlaAsp: 8.275 ± 0.793
6.377AlaGlu: 6.377 ± 0.868
2.885AlaPhe: 2.885 ± 0.436
13.285AlaGly: 13.285 ± 1.28
2.505AlaHis: 2.505 ± 0.475
5.769AlaIle: 5.769 ± 0.673
5.542AlaLys: 5.542 ± 0.839
10.476AlaLeu: 10.476 ± 0.915
2.885AlaMet: 2.885 ± 0.444
3.947AlaAsn: 3.947 ± 0.498
6.301AlaPro: 6.301 ± 0.621
6.073AlaGln: 6.073 ± 1.064
10.172AlaArg: 10.172 ± 1.019
6.68AlaSer: 6.68 ± 1.087
7.364AlaThr: 7.364 ± 0.848
7.819AlaVal: 7.819 ± 0.681
1.746AlaTrp: 1.746 ± 0.389
3.037AlaTyr: 3.037 ± 0.426
0.0AlaXaa: 0.0 ± 0.0
Cys
0.607CysAla: 0.607 ± 0.238
0.152CysCys: 0.152 ± 0.119
0.228CysAsp: 0.228 ± 0.164
0.835CysGlu: 0.835 ± 0.212
0.304CysPhe: 0.304 ± 0.11
0.835CysGly: 0.835 ± 0.237
0.152CysHis: 0.152 ± 0.106
0.38CysIle: 0.38 ± 0.36
0.38CysLys: 0.38 ± 0.167
0.455CysLeu: 0.455 ± 0.162
0.304CysMet: 0.304 ± 0.174
0.38CysAsn: 0.38 ± 0.171
0.455CysPro: 0.455 ± 0.219
0.455CysGln: 0.455 ± 0.197
0.683CysArg: 0.683 ± 0.209
0.531CysSer: 0.531 ± 0.283
0.455CysThr: 0.455 ± 0.199
0.835CysVal: 0.835 ± 0.226
0.304CysTrp: 0.304 ± 0.152
0.304CysTyr: 0.304 ± 0.178
0.0CysXaa: 0.0 ± 0.0
Asp
7.515AspAla: 7.515 ± 0.934
0.38AspCys: 0.38 ± 0.145
5.162AspAsp: 5.162 ± 0.622
4.023AspGlu: 4.023 ± 0.719
1.974AspPhe: 1.974 ± 0.427
6.149AspGly: 6.149 ± 0.648
1.215AspHis: 1.215 ± 0.276
1.518AspIle: 1.518 ± 0.376
2.126AspLys: 2.126 ± 0.429
5.39AspLeu: 5.39 ± 0.68
1.898AspMet: 1.898 ± 0.41
2.277AspAsn: 2.277 ± 0.498
3.568AspPro: 3.568 ± 0.424
2.277AspGln: 2.277 ± 0.485
3.72AspArg: 3.72 ± 0.568
3.492AspSer: 3.492 ± 0.571
3.34AspThr: 3.34 ± 0.462
4.631AspVal: 4.631 ± 0.559
1.518AspTrp: 1.518 ± 0.361
1.746AspTyr: 1.746 ± 0.362
0.0AspXaa: 0.0 ± 0.0
Glu
6.453GluAla: 6.453 ± 0.722
0.683GluCys: 0.683 ± 0.263
2.657GluAsp: 2.657 ± 0.382
1.822GluGlu: 1.822 ± 0.424
1.366GluPhe: 1.366 ± 0.334
3.264GluGly: 3.264 ± 0.431
1.518GluHis: 1.518 ± 0.41
2.505GluIle: 2.505 ± 0.534
2.277GluLys: 2.277 ± 0.563
6.073GluLeu: 6.073 ± 1.077
0.759GluMet: 0.759 ± 0.197
2.05GluAsn: 2.05 ± 0.29
2.126GluPro: 2.126 ± 0.443
2.126GluGln: 2.126 ± 0.429
4.403GluArg: 4.403 ± 0.666
1.822GluSer: 1.822 ± 0.482
3.492GluThr: 3.492 ± 0.454
3.568GluVal: 3.568 ± 0.496
1.139GluTrp: 1.139 ± 0.318
1.215GluTyr: 1.215 ± 0.267
0.0GluXaa: 0.0 ± 0.0
Phe
2.581PheAla: 2.581 ± 0.379
0.607PheCys: 0.607 ± 0.229
2.353PheAsp: 2.353 ± 0.349
1.974PheGlu: 1.974 ± 0.47
0.759PhePhe: 0.759 ± 0.255
3.34PheGly: 3.34 ± 0.445
0.304PheHis: 0.304 ± 0.114
1.291PheIle: 1.291 ± 0.245
1.139PheLys: 1.139 ± 0.271
1.442PheLeu: 1.442 ± 0.328
0.228PheMet: 0.228 ± 0.122
0.531PheAsn: 0.531 ± 0.169
1.518PhePro: 1.518 ± 0.373
1.063PheGln: 1.063 ± 0.239
2.05PheArg: 2.05 ± 0.387
2.201PheSer: 2.201 ± 0.45
1.291PheThr: 1.291 ± 0.392
2.581PheVal: 2.581 ± 0.462
0.531PheTrp: 0.531 ± 0.273
1.063PheTyr: 1.063 ± 0.288
0.0PheXaa: 0.0 ± 0.0
Gly
11.387GlyAla: 11.387 ± 1.075
0.987GlyCys: 0.987 ± 0.317
5.01GlyAsp: 5.01 ± 0.75
4.327GlyGlu: 4.327 ± 0.648
3.112GlyPhe: 3.112 ± 0.464
8.123GlyGly: 8.123 ± 0.896
1.291GlyHis: 1.291 ± 0.359
4.175GlyIle: 4.175 ± 0.49
3.796GlyLys: 3.796 ± 0.617
5.466GlyLeu: 5.466 ± 0.904
2.429GlyMet: 2.429 ± 0.45
2.961GlyAsn: 2.961 ± 0.709
2.353GlyPro: 2.353 ± 0.372
3.112GlyGln: 3.112 ± 0.467
5.39GlyArg: 5.39 ± 0.741
5.086GlySer: 5.086 ± 0.743
6.301GlyThr: 6.301 ± 1.167
6.377GlyVal: 6.377 ± 0.81
1.139GlyTrp: 1.139 ± 0.278
1.974GlyTyr: 1.974 ± 0.356
0.0GlyXaa: 0.0 ± 0.0
His
2.126HisAla: 2.126 ± 0.33
0.228HisCys: 0.228 ± 0.112
1.063HisAsp: 1.063 ± 0.274
1.291HisGlu: 1.291 ± 0.313
0.455HisPhe: 0.455 ± 0.21
1.366HisGly: 1.366 ± 0.299
0.531HisHis: 0.531 ± 0.235
0.987HisIle: 0.987 ± 0.296
0.759HisLys: 0.759 ± 0.252
1.215HisLeu: 1.215 ± 0.269
0.38HisMet: 0.38 ± 0.188
0.38HisAsn: 0.38 ± 0.138
0.759HisPro: 0.759 ± 0.238
0.607HisGln: 0.607 ± 0.205
1.215HisArg: 1.215 ± 0.224
0.455HisSer: 0.455 ± 0.189
0.759HisThr: 0.759 ± 0.222
1.291HisVal: 1.291 ± 0.293
0.683HisTrp: 0.683 ± 0.267
0.304HisTyr: 0.304 ± 0.156
0.0HisXaa: 0.0 ± 0.0
Ile
6.377IleAla: 6.377 ± 0.544
0.304IleCys: 0.304 ± 0.135
4.175IleAsp: 4.175 ± 0.697
3.492IleGlu: 3.492 ± 0.566
0.987IlePhe: 0.987 ± 0.327
4.707IleGly: 4.707 ± 0.513
0.304IleHis: 0.304 ± 0.119
1.67IleIle: 1.67 ± 0.316
1.518IleLys: 1.518 ± 0.38
2.353IleLeu: 2.353 ± 0.463
0.835IleMet: 0.835 ± 0.228
1.746IleAsn: 1.746 ± 0.377
1.898IlePro: 1.898 ± 0.415
0.987IleGln: 0.987 ± 0.232
2.809IleArg: 2.809 ± 0.465
2.581IleSer: 2.581 ± 0.503
1.67IleThr: 1.67 ± 0.347
2.961IleVal: 2.961 ± 0.386
0.304IleTrp: 0.304 ± 0.124
1.518IleTyr: 1.518 ± 0.385
0.0IleXaa: 0.0 ± 0.0
Lys
4.858LysAla: 4.858 ± 0.566
0.152LysCys: 0.152 ± 0.103
2.126LysAsp: 2.126 ± 0.356
1.746LysGlu: 1.746 ± 0.406
1.442LysPhe: 1.442 ± 0.277
2.201LysGly: 2.201 ± 0.41
0.607LysHis: 0.607 ± 0.2
1.67LysIle: 1.67 ± 0.253
1.822LysLys: 1.822 ± 0.407
4.023LysLeu: 4.023 ± 0.509
0.759LysMet: 0.759 ± 0.23
0.759LysAsn: 0.759 ± 0.249
2.657LysPro: 2.657 ± 0.603
2.126LysGln: 2.126 ± 0.567
3.112LysArg: 3.112 ± 0.493
2.201LysSer: 2.201 ± 0.437
2.733LysThr: 2.733 ± 0.464
1.594LysVal: 1.594 ± 0.373
0.759LysTrp: 0.759 ± 0.288
1.063LysTyr: 1.063 ± 0.288
0.0LysXaa: 0.0 ± 0.0
Leu
11.691LeuAla: 11.691 ± 1.108
1.063LeuCys: 1.063 ± 0.378
5.39LeuAsp: 5.39 ± 0.677
3.492LeuGlu: 3.492 ± 0.555
2.581LeuPhe: 2.581 ± 0.501
5.314LeuGly: 5.314 ± 0.442
1.215LeuHis: 1.215 ± 0.357
3.872LeuIle: 3.872 ± 0.427
2.353LeuLys: 2.353 ± 0.49
7.288LeuLeu: 7.288 ± 0.827
1.746LeuMet: 1.746 ± 0.319
3.34LeuAsn: 3.34 ± 0.548
4.479LeuPro: 4.479 ± 0.76
3.947LeuGln: 3.947 ± 0.782
6.832LeuArg: 6.832 ± 0.791
4.555LeuSer: 4.555 ± 0.678
5.845LeuThr: 5.845 ± 0.687
5.39LeuVal: 5.39 ± 0.679
1.366LeuTrp: 1.366 ± 0.279
1.594LeuTyr: 1.594 ± 0.314
0.0LeuXaa: 0.0 ± 0.0
Met
2.126MetAla: 2.126 ± 0.399
0.228MetCys: 0.228 ± 0.113
1.139MetAsp: 1.139 ± 0.244
1.063MetGlu: 1.063 ± 0.302
1.063MetPhe: 1.063 ± 0.342
1.366MetGly: 1.366 ± 0.326
0.076MetHis: 0.076 ± 0.067
1.063MetIle: 1.063 ± 0.279
1.063MetLys: 1.063 ± 0.245
2.277MetLeu: 2.277 ± 0.426
0.531MetMet: 0.531 ± 0.173
0.759MetAsn: 0.759 ± 0.258
2.733MetPro: 2.733 ± 0.409
1.746MetGln: 1.746 ± 0.363
1.594MetArg: 1.594 ± 0.411
1.291MetSer: 1.291 ± 0.307
1.518MetThr: 1.518 ± 0.298
0.683MetVal: 0.683 ± 0.198
0.228MetTrp: 0.228 ± 0.13
0.683MetTyr: 0.683 ± 0.176
0.0MetXaa: 0.0 ± 0.0
Asn
4.783AsnAla: 4.783 ± 0.677
0.304AsnCys: 0.304 ± 0.17
2.353AsnAsp: 2.353 ± 0.481
1.67AsnGlu: 1.67 ± 0.286
0.683AsnPhe: 0.683 ± 0.155
3.796AsnGly: 3.796 ± 0.595
0.228AsnHis: 0.228 ± 0.114
1.594AsnIle: 1.594 ± 0.414
1.063AsnLys: 1.063 ± 0.245
3.568AsnLeu: 3.568 ± 0.674
1.063AsnMet: 1.063 ± 0.3
1.215AsnAsn: 1.215 ± 0.297
2.277AsnPro: 2.277 ± 0.454
1.594AsnGln: 1.594 ± 0.379
1.291AsnArg: 1.291 ± 0.418
1.898AsnSer: 1.898 ± 0.42
2.126AsnThr: 2.126 ± 0.475
1.974AsnVal: 1.974 ± 0.38
0.38AsnTrp: 0.38 ± 0.191
0.987AsnTyr: 0.987 ± 0.298
0.0AsnXaa: 0.0 ± 0.0
Pro
7.667ProAla: 7.667 ± 0.809
0.304ProCys: 0.304 ± 0.152
4.403ProAsp: 4.403 ± 0.615
2.885ProGlu: 2.885 ± 0.524
1.366ProPhe: 1.366 ± 0.268
4.175ProGly: 4.175 ± 0.536
1.063ProHis: 1.063 ± 0.316
1.442ProIle: 1.442 ± 0.475
1.746ProLys: 1.746 ± 0.347
5.162ProLeu: 5.162 ± 0.602
1.139ProMet: 1.139 ± 0.305
1.67ProAsn: 1.67 ± 0.33
2.657ProPro: 2.657 ± 0.607
1.518ProGln: 1.518 ± 0.313
3.568ProArg: 3.568 ± 0.613
2.581ProSer: 2.581 ± 0.504
3.644ProThr: 3.644 ± 0.574
3.72ProVal: 3.72 ± 0.493
0.835ProTrp: 0.835 ± 0.275
0.304ProTyr: 0.304 ± 0.135
0.0ProXaa: 0.0 ± 0.0
Gln
5.162GlnAla: 5.162 ± 1.076
0.304GlnCys: 0.304 ± 0.141
2.126GlnAsp: 2.126 ± 0.414
1.518GlnGlu: 1.518 ± 0.275
1.139GlnPhe: 1.139 ± 0.242
2.05GlnGly: 2.05 ± 0.455
0.759GlnHis: 0.759 ± 0.287
2.201GlnIle: 2.201 ± 0.492
1.442GlnLys: 1.442 ± 0.313
4.631GlnLeu: 4.631 ± 0.613
1.746GlnMet: 1.746 ± 0.364
1.215GlnAsn: 1.215 ± 0.26
2.05GlnPro: 2.05 ± 0.518
2.885GlnGln: 2.885 ± 0.796
3.037GlnArg: 3.037 ± 0.51
2.277GlnSer: 2.277 ± 0.42
3.112GlnThr: 3.112 ± 0.414
2.353GlnVal: 2.353 ± 0.504
0.835GlnTrp: 0.835 ± 0.264
1.063GlnTyr: 1.063 ± 0.277
0.0GlnXaa: 0.0 ± 0.0
Arg
10.172ArgAla: 10.172 ± 0.985
0.38ArgCys: 0.38 ± 0.168
4.023ArgAsp: 4.023 ± 0.714
3.796ArgGlu: 3.796 ± 0.49
1.518ArgPhe: 1.518 ± 0.324
5.542ArgGly: 5.542 ± 0.593
0.911ArgHis: 0.911 ± 0.268
3.188ArgIle: 3.188 ± 0.389
2.809ArgLys: 2.809 ± 0.637
4.934ArgLeu: 4.934 ± 0.777
1.442ArgMet: 1.442 ± 0.325
1.898ArgAsn: 1.898 ± 0.385
2.885ArgPro: 2.885 ± 0.576
3.188ArgGln: 3.188 ± 0.53
5.238ArgArg: 5.238 ± 0.901
2.885ArgSer: 2.885 ± 0.614
4.479ArgThr: 4.479 ± 0.497
5.542ArgVal: 5.542 ± 0.518
0.911ArgTrp: 0.911 ± 0.215
1.746ArgTyr: 1.746 ± 0.381
0.0ArgXaa: 0.0 ± 0.0
Ser
8.275SerAla: 8.275 ± 0.957
0.304SerCys: 0.304 ± 0.189
3.34SerAsp: 3.34 ± 0.593
1.746SerGlu: 1.746 ± 0.397
1.822SerPhe: 1.822 ± 0.342
5.162SerGly: 5.162 ± 0.737
0.759SerHis: 0.759 ± 0.255
2.201SerIle: 2.201 ± 0.398
1.746SerLys: 1.746 ± 0.304
4.175SerLeu: 4.175 ± 0.652
1.442SerMet: 1.442 ± 0.342
1.822SerAsn: 1.822 ± 0.606
2.581SerPro: 2.581 ± 0.423
2.05SerGln: 2.05 ± 0.414
2.201SerArg: 2.201 ± 0.391
4.479SerSer: 4.479 ± 1.007
4.631SerThr: 4.631 ± 0.709
3.872SerVal: 3.872 ± 0.753
0.911SerTrp: 0.911 ± 0.222
1.291SerTyr: 1.291 ± 0.334
0.0SerXaa: 0.0 ± 0.0
Thr
7.895ThrAla: 7.895 ± 1.023
0.38ThrCys: 0.38 ± 0.156
3.34ThrAsp: 3.34 ± 0.546
2.429ThrGlu: 2.429 ± 0.377
1.898ThrPhe: 1.898 ± 0.384
5.769ThrGly: 5.769 ± 0.746
1.139ThrHis: 1.139 ± 0.256
3.037ThrIle: 3.037 ± 0.35
2.353ThrLys: 2.353 ± 0.477
5.845ThrLeu: 5.845 ± 0.793
1.594ThrMet: 1.594 ± 0.312
3.112ThrAsn: 3.112 ± 0.568
4.099ThrPro: 4.099 ± 0.639
2.429ThrGln: 2.429 ± 0.393
3.112ThrArg: 3.112 ± 0.43
3.796ThrSer: 3.796 ± 0.671
4.783ThrThr: 4.783 ± 1.125
4.175ThrVal: 4.175 ± 0.634
1.442ThrTrp: 1.442 ± 0.339
1.518ThrTyr: 1.518 ± 0.321
0.0ThrXaa: 0.0 ± 0.0
Val
7.743ValAla: 7.743 ± 0.619
0.683ValCys: 0.683 ± 0.217
4.707ValAsp: 4.707 ± 0.556
4.934ValGlu: 4.934 ± 0.573
2.201ValPhe: 2.201 ± 0.506
4.707ValGly: 4.707 ± 0.576
1.291ValHis: 1.291 ± 0.338
2.581ValIle: 2.581 ± 0.48
2.657ValLys: 2.657 ± 0.47
4.783ValLeu: 4.783 ± 0.481
1.215ValMet: 1.215 ± 0.261
2.885ValAsn: 2.885 ± 0.346
3.796ValPro: 3.796 ± 0.62
1.898ValGln: 1.898 ± 0.342
4.175ValArg: 4.175 ± 0.568
4.631ValSer: 4.631 ± 0.713
4.858ValThr: 4.858 ± 0.874
3.644ValVal: 3.644 ± 0.469
1.366ValTrp: 1.366 ± 0.302
1.518ValTyr: 1.518 ± 0.281
0.0ValXaa: 0.0 ± 0.0
Trp
1.518TrpAla: 1.518 ± 0.411
0.304TrpCys: 0.304 ± 0.133
0.835TrpAsp: 0.835 ± 0.245
0.683TrpGlu: 0.683 ± 0.171
0.835TrpPhe: 0.835 ± 0.393
0.759TrpGly: 0.759 ± 0.241
0.607TrpHis: 0.607 ± 0.192
1.366TrpIle: 1.366 ± 0.342
0.759TrpLys: 0.759 ± 0.229
1.822TrpLeu: 1.822 ± 0.388
0.228TrpMet: 0.228 ± 0.163
0.835TrpAsn: 0.835 ± 0.26
1.291TrpPro: 1.291 ± 0.322
0.607TrpGln: 0.607 ± 0.194
1.518TrpArg: 1.518 ± 0.402
0.607TrpSer: 0.607 ± 0.196
0.835TrpThr: 0.835 ± 0.296
1.139TrpVal: 1.139 ± 0.269
0.455TrpTrp: 0.455 ± 0.199
0.304TrpTyr: 0.304 ± 0.142
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.492TyrAla: 3.492 ± 0.517
0.076TyrCys: 0.076 ± 0.063
1.215TyrAsp: 1.215 ± 0.272
1.215TyrGlu: 1.215 ± 0.257
0.455TyrPhe: 0.455 ± 0.18
2.885TyrGly: 2.885 ± 0.451
0.38TyrHis: 0.38 ± 0.208
0.683TyrIle: 0.683 ± 0.194
1.063TyrLys: 1.063 ± 0.316
1.974TyrLeu: 1.974 ± 0.341
0.531TyrMet: 0.531 ± 0.236
1.063TyrAsn: 1.063 ± 0.199
1.366TyrPro: 1.366 ± 0.264
1.215TyrGln: 1.215 ± 0.293
1.366TyrArg: 1.366 ± 0.325
0.759TyrSer: 0.759 ± 0.192
0.911TyrThr: 0.911 ± 0.328
2.126TyrVal: 2.126 ± 0.332
0.455TyrTrp: 0.455 ± 0.18
0.38TyrTyr: 0.38 ± 0.145
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 46 proteins (13174 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski