Amino acid dipepetide frequency for Brucella phage BiPBO1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.508AlaAla: 13.508 ± 1.247
0.99AlaCys: 0.99 ± 0.306
5.941AlaAsp: 5.941 ± 0.689
7.284AlaGlu: 7.284 ± 0.566
3.182AlaPhe: 3.182 ± 0.462
8.274AlaGly: 8.274 ± 0.947
2.122AlaHis: 2.122 ± 0.35
7.284AlaIle: 7.284 ± 0.617
5.728AlaLys: 5.728 ± 0.68
9.972AlaLeu: 9.972 ± 0.732
3.819AlaMet: 3.819 ± 0.575
4.102AlaAsn: 4.102 ± 0.646
3.253AlaPro: 3.253 ± 0.515
3.748AlaGln: 3.748 ± 0.673
6.223AlaArg: 6.223 ± 0.737
5.941AlaSer: 5.941 ± 0.585
6.294AlaThr: 6.294 ± 0.79
7.638AlaVal: 7.638 ± 0.683
1.697AlaTrp: 1.697 ± 0.419
2.546AlaTyr: 2.546 ± 0.39
0.0AlaXaa: 0.0 ± 0.0
Cys
1.061CysAla: 1.061 ± 0.264
0.354CysCys: 0.354 ± 0.171
0.354CysAsp: 0.354 ± 0.174
0.636CysGlu: 0.636 ± 0.231
0.212CysPhe: 0.212 ± 0.117
0.566CysGly: 0.566 ± 0.185
0.354CysHis: 0.354 ± 0.172
0.424CysIle: 0.424 ± 0.168
0.354CysLys: 0.354 ± 0.165
0.707CysLeu: 0.707 ± 0.243
0.212CysMet: 0.212 ± 0.101
0.566CysAsn: 0.566 ± 0.203
0.566CysPro: 0.566 ± 0.212
0.354CysGln: 0.354 ± 0.15
0.424CysArg: 0.424 ± 0.16
0.283CysSer: 0.283 ± 0.14
0.283CysThr: 0.283 ± 0.128
1.132CysVal: 1.132 ± 0.293
0.212CysTrp: 0.212 ± 0.129
0.283CysTyr: 0.283 ± 0.135
0.0CysXaa: 0.0 ± 0.0
Asp
6.365AspAla: 6.365 ± 0.735
0.566AspCys: 0.566 ± 0.229
3.465AspAsp: 3.465 ± 0.523
4.809AspGlu: 4.809 ± 0.544
2.546AspPhe: 2.546 ± 0.432
5.941AspGly: 5.941 ± 0.634
1.273AspHis: 1.273 ± 0.318
4.031AspIle: 4.031 ± 0.59
2.9AspLys: 2.9 ± 0.426
4.314AspLeu: 4.314 ± 0.527
1.556AspMet: 1.556 ± 0.361
2.051AspAsn: 2.051 ± 0.348
2.334AspPro: 2.334 ± 0.442
2.475AspGln: 2.475 ± 0.472
4.385AspArg: 4.385 ± 0.66
1.627AspSer: 1.627 ± 0.3
2.9AspThr: 2.9 ± 0.439
4.738AspVal: 4.738 ± 0.536
0.707AspTrp: 0.707 ± 0.217
2.192AspTyr: 2.192 ± 0.358
0.0AspXaa: 0.0 ± 0.0
Glu
8.769GluAla: 8.769 ± 0.917
0.849GluCys: 0.849 ± 0.26
2.758GluAsp: 2.758 ± 0.448
3.253GluGlu: 3.253 ± 0.524
2.546GluPhe: 2.546 ± 0.48
5.092GluGly: 5.092 ± 0.63
0.99GluHis: 0.99 ± 0.284
4.102GluIle: 4.102 ± 0.617
3.96GluLys: 3.96 ± 0.578
5.021GluLeu: 5.021 ± 0.606
2.97GluMet: 2.97 ± 0.479
2.405GluAsn: 2.405 ± 0.383
2.475GluPro: 2.475 ± 0.439
2.405GluGln: 2.405 ± 0.535
4.102GluArg: 4.102 ± 0.466
1.98GluSer: 1.98 ± 0.399
4.88GluThr: 4.88 ± 0.488
3.395GluVal: 3.395 ± 0.477
1.132GluTrp: 1.132 ± 0.236
1.768GluTyr: 1.768 ± 0.36
0.0GluXaa: 0.0 ± 0.0
Phe
3.182PheAla: 3.182 ± 0.513
0.495PheCys: 0.495 ± 0.18
3.041PheAsp: 3.041 ± 0.428
1.697PheGlu: 1.697 ± 0.361
1.344PhePhe: 1.344 ± 0.297
2.829PheGly: 2.829 ± 0.446
0.566PheHis: 0.566 ± 0.21
2.617PheIle: 2.617 ± 0.382
1.909PheLys: 1.909 ± 0.43
2.546PheLeu: 2.546 ± 0.561
1.061PheMet: 1.061 ± 0.268
1.627PheAsn: 1.627 ± 0.326
1.697PhePro: 1.697 ± 0.318
0.919PheGln: 0.919 ± 0.253
2.546PheArg: 2.546 ± 0.377
1.909PheSer: 1.909 ± 0.382
1.909PheThr: 1.909 ± 0.311
2.546PheVal: 2.546 ± 0.492
0.424PheTrp: 0.424 ± 0.214
0.778PheTyr: 0.778 ± 0.242
0.0PheXaa: 0.0 ± 0.0
Gly
5.941GlyAla: 5.941 ± 0.743
0.919GlyCys: 0.919 ± 0.228
4.597GlyAsp: 4.597 ± 0.54
5.446GlyGlu: 5.446 ± 0.648
3.041GlyPhe: 3.041 ± 0.46
6.082GlyGly: 6.082 ± 0.903
1.485GlyHis: 1.485 ± 0.344
4.314GlyIle: 4.314 ± 0.613
4.102GlyLys: 4.102 ± 0.664
6.86GlyLeu: 6.86 ± 0.739
2.829GlyMet: 2.829 ± 0.437
2.97GlyAsn: 2.97 ± 0.382
2.829GlyPro: 2.829 ± 0.39
2.758GlyGln: 2.758 ± 0.392
4.243GlyArg: 4.243 ± 0.631
4.031GlySer: 4.031 ± 0.503
4.031GlyThr: 4.031 ± 0.695
5.658GlyVal: 5.658 ± 0.756
1.627GlyTrp: 1.627 ± 0.282
2.405GlyTyr: 2.405 ± 0.437
0.0GlyXaa: 0.0 ± 0.0
His
1.909HisAla: 1.909 ± 0.352
0.212HisCys: 0.212 ± 0.124
0.849HisAsp: 0.849 ± 0.282
1.414HisGlu: 1.414 ± 0.288
0.707HisPhe: 0.707 ± 0.211
1.627HisGly: 1.627 ± 0.306
0.495HisHis: 0.495 ± 0.178
0.99HisIle: 0.99 ± 0.233
1.202HisLys: 1.202 ± 0.26
1.768HisLeu: 1.768 ± 0.329
0.495HisMet: 0.495 ± 0.173
0.636HisAsn: 0.636 ± 0.234
0.849HisPro: 0.849 ± 0.25
0.636HisGln: 0.636 ± 0.175
1.414HisArg: 1.414 ± 0.285
1.132HisSer: 1.132 ± 0.236
0.778HisThr: 0.778 ± 0.27
1.132HisVal: 1.132 ± 0.225
0.141HisTrp: 0.141 ± 0.095
0.283HisTyr: 0.283 ± 0.153
0.0HisXaa: 0.0 ± 0.0
Ile
5.658IleAla: 5.658 ± 0.596
0.707IleCys: 0.707 ± 0.199
4.809IleAsp: 4.809 ± 0.655
4.314IleGlu: 4.314 ± 0.605
2.122IlePhe: 2.122 ± 0.363
5.021IleGly: 5.021 ± 0.65
1.273IleHis: 1.273 ± 0.337
3.182IleIle: 3.182 ± 0.43
3.536IleLys: 3.536 ± 0.457
4.95IleLeu: 4.95 ± 0.621
0.707IleMet: 0.707 ± 0.219
2.9IleAsn: 2.9 ± 0.49
2.758IlePro: 2.758 ± 0.299
2.405IleGln: 2.405 ± 0.364
3.748IleArg: 3.748 ± 0.494
3.607IleSer: 3.607 ± 0.534
3.253IleThr: 3.253 ± 0.48
3.536IleVal: 3.536 ± 0.585
0.354IleTrp: 0.354 ± 0.162
1.202IleTyr: 1.202 ± 0.272
0.0IleXaa: 0.0 ± 0.0
Lys
7.709LysAla: 7.709 ± 0.901
0.424LysCys: 0.424 ± 0.176
3.182LysAsp: 3.182 ± 0.548
3.182LysGlu: 3.182 ± 0.477
1.768LysPhe: 1.768 ± 0.308
3.678LysGly: 3.678 ± 0.425
1.697LysHis: 1.697 ± 0.39
3.182LysIle: 3.182 ± 0.453
2.617LysLys: 2.617 ± 0.468
3.96LysLeu: 3.96 ± 0.562
1.768LysMet: 1.768 ± 0.372
1.768LysAsn: 1.768 ± 0.308
2.617LysPro: 2.617 ± 0.439
2.263LysGln: 2.263 ± 0.469
3.819LysArg: 3.819 ± 0.581
3.182LysSer: 3.182 ± 0.516
3.96LysThr: 3.96 ± 0.667
2.9LysVal: 2.9 ± 0.4
0.707LysTrp: 0.707 ± 0.2
0.707LysTyr: 0.707 ± 0.239
0.0LysXaa: 0.0 ± 0.0
Leu
9.264LeuAla: 9.264 ± 0.807
0.707LeuCys: 0.707 ± 0.224
4.597LeuAsp: 4.597 ± 0.565
5.304LeuGlu: 5.304 ± 0.586
3.112LeuPhe: 3.112 ± 0.507
5.516LeuGly: 5.516 ± 0.665
1.697LeuHis: 1.697 ± 0.281
4.173LeuIle: 4.173 ± 0.626
4.95LeuLys: 4.95 ± 0.569
5.446LeuLeu: 5.446 ± 0.647
1.556LeuMet: 1.556 ± 0.27
3.112LeuAsn: 3.112 ± 0.538
3.748LeuPro: 3.748 ± 0.472
3.536LeuGln: 3.536 ± 0.579
4.738LeuArg: 4.738 ± 0.65
5.375LeuSer: 5.375 ± 0.582
4.455LeuThr: 4.455 ± 0.562
4.95LeuVal: 4.95 ± 0.54
1.627LeuTrp: 1.627 ± 0.362
1.768LeuTyr: 1.768 ± 0.298
0.0LeuXaa: 0.0 ± 0.0
Met
3.96MetAla: 3.96 ± 0.489
0.212MetCys: 0.212 ± 0.121
1.273MetAsp: 1.273 ± 0.328
1.556MetGlu: 1.556 ± 0.354
0.424MetPhe: 0.424 ± 0.154
1.344MetGly: 1.344 ± 0.344
0.636MetHis: 0.636 ± 0.2
1.627MetIle: 1.627 ± 0.372
1.909MetLys: 1.909 ± 0.435
2.687MetLeu: 2.687 ± 0.384
0.707MetMet: 0.707 ± 0.235
1.697MetAsn: 1.697 ± 0.349
1.414MetPro: 1.414 ± 0.341
1.273MetGln: 1.273 ± 0.31
2.051MetArg: 2.051 ± 0.358
2.051MetSer: 2.051 ± 0.414
1.98MetThr: 1.98 ± 0.357
1.485MetVal: 1.485 ± 0.368
0.283MetTrp: 0.283 ± 0.16
0.566MetTyr: 0.566 ± 0.209
0.0MetXaa: 0.0 ± 0.0
Asn
3.96AsnAla: 3.96 ± 0.819
0.212AsnCys: 0.212 ± 0.143
1.839AsnAsp: 1.839 ± 0.351
2.334AsnGlu: 2.334 ± 0.463
0.919AsnPhe: 0.919 ± 0.207
2.97AsnGly: 2.97 ± 0.381
0.354AsnHis: 0.354 ± 0.155
2.475AsnIle: 2.475 ± 0.437
1.839AsnLys: 1.839 ± 0.563
2.617AsnLeu: 2.617 ± 0.424
0.919AsnMet: 0.919 ± 0.251
1.414AsnAsn: 1.414 ± 0.449
2.192AsnPro: 2.192 ± 0.3
0.99AsnGln: 0.99 ± 0.222
3.112AsnArg: 3.112 ± 0.425
2.334AsnSer: 2.334 ± 0.333
2.405AsnThr: 2.405 ± 0.33
2.263AsnVal: 2.263 ± 0.484
0.919AsnTrp: 0.919 ± 0.272
1.061AsnTyr: 1.061 ± 0.231
0.0AsnXaa: 0.0 ± 0.0
Pro
3.96ProAla: 3.96 ± 0.506
0.071ProCys: 0.071 ± 0.08
3.748ProAsp: 3.748 ± 0.567
3.112ProGlu: 3.112 ± 0.456
2.263ProPhe: 2.263 ± 0.428
2.122ProGly: 2.122 ± 0.338
0.636ProHis: 0.636 ± 0.21
1.839ProIle: 1.839 ± 0.373
2.9ProLys: 2.9 ± 0.422
2.829ProLeu: 2.829 ± 0.442
1.061ProMet: 1.061 ± 0.257
1.627ProAsn: 1.627 ± 0.361
1.061ProPro: 1.061 ± 0.242
1.344ProGln: 1.344 ± 0.35
2.617ProArg: 2.617 ± 0.486
3.96ProSer: 3.96 ± 0.492
2.192ProThr: 2.192 ± 0.45
2.97ProVal: 2.97 ± 0.566
0.566ProTrp: 0.566 ± 0.208
1.061ProTyr: 1.061 ± 0.316
0.0ProXaa: 0.0 ± 0.0
Gln
4.385GlnAla: 4.385 ± 0.642
0.071GlnCys: 0.071 ± 0.075
1.909GlnAsp: 1.909 ± 0.426
1.556GlnGlu: 1.556 ± 0.256
1.273GlnPhe: 1.273 ± 0.305
2.617GlnGly: 2.617 ± 0.361
0.495GlnHis: 0.495 ± 0.158
2.829GlnIle: 2.829 ± 0.634
1.768GlnLys: 1.768 ± 0.335
2.192GlnLeu: 2.192 ± 0.417
0.919GlnMet: 0.919 ± 0.26
1.627GlnAsn: 1.627 ± 0.331
2.263GlnPro: 2.263 ± 0.447
1.132GlnGln: 1.132 ± 0.238
2.546GlnArg: 2.546 ± 0.471
2.334GlnSer: 2.334 ± 0.445
2.334GlnThr: 2.334 ± 0.363
2.122GlnVal: 2.122 ± 0.31
0.778GlnTrp: 0.778 ± 0.268
1.273GlnTyr: 1.273 ± 0.379
0.0GlnXaa: 0.0 ± 0.0
Arg
6.294ArgAla: 6.294 ± 0.662
0.849ArgCys: 0.849 ± 0.225
4.314ArgAsp: 4.314 ± 0.635
4.314ArgGlu: 4.314 ± 0.646
2.405ArgPhe: 2.405 ± 0.409
5.092ArgGly: 5.092 ± 0.609
1.132ArgHis: 1.132 ± 0.263
4.243ArgIle: 4.243 ± 0.547
3.465ArgLys: 3.465 ± 0.569
5.658ArgLeu: 5.658 ± 0.703
1.909ArgMet: 1.909 ± 0.422
1.485ArgAsn: 1.485 ± 0.362
2.546ArgPro: 2.546 ± 0.435
2.334ArgGln: 2.334 ± 0.322
4.809ArgArg: 4.809 ± 0.716
3.89ArgSer: 3.89 ± 0.519
3.041ArgThr: 3.041 ± 0.453
5.233ArgVal: 5.233 ± 0.644
0.636ArgTrp: 0.636 ± 0.234
1.556ArgTyr: 1.556 ± 0.305
0.0ArgXaa: 0.0 ± 0.0
Ser
5.304SerAla: 5.304 ± 0.717
0.495SerCys: 0.495 ± 0.169
4.738SerAsp: 4.738 ± 0.632
4.173SerGlu: 4.173 ± 0.51
2.829SerPhe: 2.829 ± 0.407
5.021SerGly: 5.021 ± 0.63
0.778SerHis: 0.778 ± 0.244
3.607SerIle: 3.607 ± 0.511
3.465SerLys: 3.465 ± 0.493
4.95SerLeu: 4.95 ± 0.715
2.405SerMet: 2.405 ± 0.417
1.202SerAsn: 1.202 ± 0.303
2.405SerPro: 2.405 ± 0.441
1.697SerGln: 1.697 ± 0.318
3.678SerArg: 3.678 ± 0.546
3.465SerSer: 3.465 ± 0.561
2.617SerThr: 2.617 ± 0.388
2.97SerVal: 2.97 ± 0.448
1.061SerTrp: 1.061 ± 0.309
1.627SerTyr: 1.627 ± 0.296
0.0SerXaa: 0.0 ± 0.0
Thr
7.284ThrAla: 7.284 ± 0.681
0.141ThrCys: 0.141 ± 0.102
3.324ThrAsp: 3.324 ± 0.46
3.748ThrGlu: 3.748 ± 0.517
1.98ThrPhe: 1.98 ± 0.321
4.95ThrGly: 4.95 ± 0.604
0.778ThrHis: 0.778 ± 0.246
3.96ThrIle: 3.96 ± 0.479
3.465ThrLys: 3.465 ± 0.511
5.163ThrLeu: 5.163 ± 0.525
1.061ThrMet: 1.061 ± 0.25
1.768ThrAsn: 1.768 ± 0.445
2.334ThrPro: 2.334 ± 0.489
1.98ThrGln: 1.98 ± 0.436
2.97ThrArg: 2.97 ± 0.507
3.395ThrSer: 3.395 ± 0.581
3.041ThrThr: 3.041 ± 0.554
3.253ThrVal: 3.253 ± 0.482
0.919ThrTrp: 0.919 ± 0.305
0.849ThrTyr: 0.849 ± 0.215
0.0ThrXaa: 0.0 ± 0.0
Val
6.86ValAla: 6.86 ± 0.697
0.707ValCys: 0.707 ± 0.255
4.314ValAsp: 4.314 ± 0.649
3.96ValGlu: 3.96 ± 0.545
1.697ValPhe: 1.697 ± 0.39
3.89ValGly: 3.89 ± 0.485
1.273ValHis: 1.273 ± 0.332
3.607ValIle: 3.607 ± 0.553
3.607ValLys: 3.607 ± 0.472
4.385ValLeu: 4.385 ± 0.487
1.839ValMet: 1.839 ± 0.442
2.9ValAsn: 2.9 ± 0.493
2.9ValPro: 2.9 ± 0.457
2.334ValGln: 2.334 ± 0.359
4.385ValArg: 4.385 ± 0.544
5.163ValSer: 5.163 ± 0.539
3.96ValThr: 3.96 ± 0.482
3.96ValVal: 3.96 ± 0.514
0.919ValTrp: 0.919 ± 0.264
1.839ValTyr: 1.839 ± 0.486
0.0ValXaa: 0.0 ± 0.0
Trp
1.202TrpAla: 1.202 ± 0.297
0.212TrpCys: 0.212 ± 0.11
0.707TrpAsp: 0.707 ± 0.243
0.919TrpGlu: 0.919 ± 0.23
0.636TrpPhe: 0.636 ± 0.181
1.485TrpGly: 1.485 ± 0.331
0.212TrpHis: 0.212 ± 0.119
0.778TrpIle: 0.778 ± 0.229
0.707TrpLys: 0.707 ± 0.229
0.919TrpLeu: 0.919 ± 0.219
0.778TrpMet: 0.778 ± 0.271
0.566TrpAsn: 0.566 ± 0.178
0.636TrpPro: 0.636 ± 0.204
0.919TrpGln: 0.919 ± 0.253
1.202TrpArg: 1.202 ± 0.297
1.132TrpSer: 1.132 ± 0.262
0.99TrpThr: 0.99 ± 0.254
1.273TrpVal: 1.273 ± 0.348
0.495TrpTrp: 0.495 ± 0.173
0.141TrpTyr: 0.141 ± 0.097
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.97TyrAla: 2.97 ± 0.47
0.141TyrCys: 0.141 ± 0.106
1.414TyrAsp: 1.414 ± 0.356
1.839TyrGlu: 1.839 ± 0.415
0.636TyrPhe: 0.636 ± 0.251
2.051TyrGly: 2.051 ± 0.39
0.424TyrHis: 0.424 ± 0.191
0.778TyrIle: 0.778 ± 0.225
0.707TyrLys: 0.707 ± 0.22
2.687TyrLeu: 2.687 ± 0.375
0.495TyrMet: 0.495 ± 0.192
0.707TyrAsn: 0.707 ± 0.233
1.132TyrPro: 1.132 ± 0.237
0.919TyrGln: 0.919 ± 0.23
2.192TyrArg: 2.192 ± 0.477
1.697TyrSer: 1.697 ± 0.323
1.061TyrThr: 1.061 ± 0.298
1.344TyrVal: 1.344 ± 0.351
0.707TyrTrp: 0.707 ± 0.261
0.849TyrTyr: 0.849 ± 0.237
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 86 proteins (14141 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski