Amino acid dipepetide frequency for Burkholderia virus Bcep22

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
21.623AlaAla: 21.623 ± 1.286
0.947AlaCys: 0.947 ± 0.394
6.826AlaAsp: 6.826 ± 0.562
7.324AlaGlu: 7.324 ± 0.586
3.836AlaPhe: 3.836 ± 0.43
10.363AlaGly: 10.363 ± 0.833
1.993AlaHis: 1.993 ± 0.37
5.73AlaIle: 5.73 ± 0.552
5.829AlaLys: 5.829 ± 0.676
9.865AlaLeu: 9.865 ± 0.596
3.388AlaMet: 3.388 ± 0.382
4.384AlaAsn: 4.384 ± 0.429
7.025AlaPro: 7.025 ± 1.017
6.178AlaGln: 6.178 ± 0.787
9.466AlaArg: 9.466 ± 0.799
7.125AlaSer: 7.125 ± 0.67
6.776AlaThr: 6.776 ± 0.556
6.826AlaVal: 6.826 ± 0.782
2.142AlaTrp: 2.142 ± 0.315
2.69AlaTyr: 2.69 ± 0.502
0.0AlaXaa: 0.0 ± 0.0
Cys
1.146CysAla: 1.146 ± 0.39
0.149CysCys: 0.149 ± 0.102
0.448CysAsp: 0.448 ± 0.254
0.448CysGlu: 0.448 ± 0.185
0.249CysPhe: 0.249 ± 0.135
0.947CysGly: 0.947 ± 0.319
0.0CysHis: 0.0 ± 0.0
0.548CysIle: 0.548 ± 0.207
0.349CysLys: 0.349 ± 0.132
0.698CysLeu: 0.698 ± 0.253
0.299CysMet: 0.299 ± 0.17
0.349CysAsn: 0.349 ± 0.145
0.448CysPro: 0.448 ± 0.177
0.249CysGln: 0.249 ± 0.112
0.847CysArg: 0.847 ± 0.258
0.399CysSer: 0.399 ± 0.139
0.349CysThr: 0.349 ± 0.148
0.448CysVal: 0.448 ± 0.181
0.199CysTrp: 0.199 ± 0.114
0.349CysTyr: 0.349 ± 0.136
0.0CysXaa: 0.0 ± 0.0
Asp
9.466AspAla: 9.466 ± 0.798
0.747AspCys: 0.747 ± 0.263
5.132AspAsp: 5.132 ± 0.648
4.484AspGlu: 4.484 ± 0.539
2.342AspPhe: 2.342 ± 0.399
6.078AspGly: 6.078 ± 0.615
1.594AspHis: 1.594 ± 0.449
3.139AspIle: 3.139 ± 0.368
2.541AspLys: 2.541 ± 0.383
3.587AspLeu: 3.587 ± 0.351
1.594AspMet: 1.594 ± 0.268
2.043AspAsn: 2.043 ± 0.276
3.338AspPro: 3.338 ± 0.434
2.292AspGln: 2.292 ± 0.357
4.783AspArg: 4.783 ± 0.706
2.392AspSer: 2.392 ± 0.367
2.79AspThr: 2.79 ± 0.37
3.687AspVal: 3.687 ± 0.455
1.146AspTrp: 1.146 ± 0.283
1.644AspTyr: 1.644 ± 0.288
0.0AspXaa: 0.0 ± 0.0
Glu
7.723GluAla: 7.723 ± 0.816
0.448GluCys: 0.448 ± 0.193
2.242GluAsp: 2.242 ± 0.282
3.438GluGlu: 3.438 ± 0.464
2.142GluPhe: 2.142 ± 0.335
3.936GluGly: 3.936 ± 0.603
1.843GluHis: 1.843 ± 0.345
3.338GluIle: 3.338 ± 0.356
3.089GluLys: 3.089 ± 0.365
6.278GluLeu: 6.278 ± 0.525
1.246GluMet: 1.246 ± 0.256
1.644GluAsn: 1.644 ± 0.304
3.288GluPro: 3.288 ± 0.434
3.637GluGln: 3.637 ± 0.766
5.829GluArg: 5.829 ± 0.626
2.441GluSer: 2.441 ± 0.478
2.74GluThr: 2.74 ± 0.457
3.288GluVal: 3.288 ± 0.404
1.096GluTrp: 1.096 ± 0.225
1.744GluTyr: 1.744 ± 0.274
0.0GluXaa: 0.0 ± 0.0
Phe
2.94PheAla: 2.94 ± 0.461
0.149PheCys: 0.149 ± 0.093
2.69PheAsp: 2.69 ± 0.376
2.491PheGlu: 2.491 ± 0.552
1.146PhePhe: 1.146 ± 0.238
2.441PheGly: 2.441 ± 0.301
0.598PheHis: 0.598 ± 0.222
1.545PheIle: 1.545 ± 0.351
1.096PheLys: 1.096 ± 0.25
1.794PheLeu: 1.794 ± 0.272
0.897PheMet: 0.897 ± 0.171
1.196PheAsn: 1.196 ± 0.23
1.445PhePro: 1.445 ± 0.243
0.797PheGln: 0.797 ± 0.17
1.993PheArg: 1.993 ± 0.465
1.644PheSer: 1.644 ± 0.238
1.794PheThr: 1.794 ± 0.328
2.242PheVal: 2.242 ± 0.34
0.598PheTrp: 0.598 ± 0.154
0.548PheTyr: 0.548 ± 0.155
0.0PheXaa: 0.0 ± 0.0
Gly
9.516GlyAla: 9.516 ± 0.999
0.598GlyCys: 0.598 ± 0.225
6.078GlyAsp: 6.078 ± 0.523
4.783GlyGlu: 4.783 ± 0.514
2.093GlyPhe: 2.093 ± 0.317
6.726GlyGly: 6.726 ± 0.834
1.395GlyHis: 1.395 ± 0.31
4.185GlyIle: 4.185 ± 0.587
4.534GlyLys: 4.534 ± 0.637
5.281GlyLeu: 5.281 ± 0.556
2.292GlyMet: 2.292 ± 0.653
3.388GlyAsn: 3.388 ± 0.58
2.192GlyPro: 2.192 ± 0.246
2.74GlyGln: 2.74 ± 0.331
5.53GlyArg: 5.53 ± 0.48
4.384GlySer: 4.384 ± 0.451
6.577GlyThr: 6.577 ± 0.728
5.431GlyVal: 5.431 ± 0.408
1.046GlyTrp: 1.046 ± 0.185
2.142GlyTyr: 2.142 ± 0.284
0.0GlyXaa: 0.0 ± 0.0
His
1.993HisAla: 1.993 ± 0.352
0.249HisCys: 0.249 ± 0.122
1.046HisAsp: 1.046 ± 0.208
1.545HisGlu: 1.545 ± 0.343
0.847HisPhe: 0.847 ± 0.259
1.843HisGly: 1.843 ± 0.387
0.399HisHis: 0.399 ± 0.199
0.797HisIle: 0.797 ± 0.177
0.548HisLys: 0.548 ± 0.191
1.146HisLeu: 1.146 ± 0.247
0.399HisMet: 0.399 ± 0.162
0.648HisAsn: 0.648 ± 0.23
0.747HisPro: 0.747 ± 0.213
0.548HisGln: 0.548 ± 0.164
1.744HisArg: 1.744 ± 0.315
1.046HisSer: 1.046 ± 0.225
0.996HisThr: 0.996 ± 0.287
1.196HisVal: 1.196 ± 0.236
0.349HisTrp: 0.349 ± 0.118
0.448HisTyr: 0.448 ± 0.198
0.0HisXaa: 0.0 ± 0.0
Ile
5.929IleAla: 5.929 ± 0.642
0.498IleCys: 0.498 ± 0.213
4.135IleAsp: 4.135 ± 0.475
4.484IleGlu: 4.484 ± 0.566
1.196IlePhe: 1.196 ± 0.259
4.783IleGly: 4.783 ± 0.535
0.548IleHis: 0.548 ± 0.178
1.993IleIle: 1.993 ± 0.283
1.893IleLys: 1.893 ± 0.315
2.292IleLeu: 2.292 ± 0.37
0.747IleMet: 0.747 ± 0.154
1.644IleAsn: 1.644 ± 0.28
1.893IlePro: 1.893 ± 0.244
1.246IleGln: 1.246 ± 0.345
2.89IleArg: 2.89 ± 0.354
2.242IleSer: 2.242 ± 0.393
2.989IleThr: 2.989 ± 0.404
3.587IleVal: 3.587 ± 0.493
0.548IleTrp: 0.548 ± 0.171
1.096IleTyr: 1.096 ± 0.225
0.0IleXaa: 0.0 ± 0.0
Lys
6.128LysAla: 6.128 ± 0.696
0.199LysCys: 0.199 ± 0.116
2.69LysAsp: 2.69 ± 0.389
2.242LysGlu: 2.242 ± 0.313
1.196LysPhe: 1.196 ± 0.197
2.989LysGly: 2.989 ± 0.479
1.046LysHis: 1.046 ± 0.241
2.093LysIle: 2.093 ± 0.342
2.79LysLys: 2.79 ± 0.475
3.388LysLeu: 3.388 ± 0.386
1.196LysMet: 1.196 ± 0.244
1.395LysAsn: 1.395 ± 0.208
3.488LysPro: 3.488 ± 0.426
1.843LysGln: 1.843 ± 0.33
3.388LysArg: 3.388 ± 0.355
2.342LysSer: 2.342 ± 0.345
2.242LysThr: 2.242 ± 0.349
3.338LysVal: 3.338 ± 0.39
0.598LysTrp: 0.598 ± 0.177
0.548LysTyr: 0.548 ± 0.138
0.0LysXaa: 0.0 ± 0.0
Leu
9.068LeuAla: 9.068 ± 0.569
0.598LeuCys: 0.598 ± 0.246
5.381LeuAsp: 5.381 ± 0.449
4.534LeuGlu: 4.534 ± 0.436
1.794LeuPhe: 1.794 ± 0.285
5.58LeuGly: 5.58 ± 0.629
1.843LeuHis: 1.843 ± 0.289
2.79LeuIle: 2.79 ± 0.371
3.836LeuLys: 3.836 ± 0.43
4.584LeuLeu: 4.584 ± 0.521
1.893LeuMet: 1.893 ± 0.292
2.989LeuAsn: 2.989 ± 0.339
3.737LeuPro: 3.737 ± 0.561
2.342LeuGln: 2.342 ± 0.292
5.53LeuArg: 5.53 ± 0.587
4.434LeuSer: 4.434 ± 0.65
5.53LeuThr: 5.53 ± 0.467
4.434LeuVal: 4.434 ± 0.659
0.797LeuTrp: 0.797 ± 0.196
1.694LeuTyr: 1.694 ± 0.233
0.0LeuXaa: 0.0 ± 0.0
Met
2.541MetAla: 2.541 ± 0.453
0.299MetCys: 0.299 ± 0.143
1.345MetAsp: 1.345 ± 0.259
1.146MetGlu: 1.146 ± 0.274
0.548MetPhe: 0.548 ± 0.154
1.943MetGly: 1.943 ± 0.357
0.349MetHis: 0.349 ± 0.112
0.947MetIle: 0.947 ± 0.193
1.395MetLys: 1.395 ± 0.342
2.142MetLeu: 2.142 ± 0.354
0.399MetMet: 0.399 ± 0.145
0.947MetAsn: 0.947 ± 0.235
1.644MetPro: 1.644 ± 0.281
1.096MetGln: 1.096 ± 0.215
2.74MetArg: 2.74 ± 0.406
1.594MetSer: 1.594 ± 0.258
2.192MetThr: 2.192 ± 0.26
1.395MetVal: 1.395 ± 0.321
0.648MetTrp: 0.648 ± 0.209
0.498MetTyr: 0.498 ± 0.182
0.0MetXaa: 0.0 ± 0.0
Asn
3.886AsnAla: 3.886 ± 0.396
0.1AsnCys: 0.1 ± 0.077
1.943AsnAsp: 1.943 ± 0.417
1.843AsnGlu: 1.843 ± 0.252
0.996AsnPhe: 0.996 ± 0.225
3.537AsnGly: 3.537 ± 0.52
0.648AsnHis: 0.648 ± 0.167
1.345AsnIle: 1.345 ± 0.251
1.196AsnLys: 1.196 ± 0.235
2.84AsnLeu: 2.84 ± 0.376
0.947AsnMet: 0.947 ± 0.184
1.146AsnAsn: 1.146 ± 0.285
2.641AsnPro: 2.641 ± 0.449
1.046AsnGln: 1.046 ± 0.274
2.441AsnArg: 2.441 ± 0.419
1.843AsnSer: 1.843 ± 0.384
1.594AsnThr: 1.594 ± 0.376
2.79AsnVal: 2.79 ± 0.357
0.548AsnTrp: 0.548 ± 0.199
0.947AsnTyr: 0.947 ± 0.232
0.0AsnXaa: 0.0 ± 0.0
Pro
9.217ProAla: 9.217 ± 0.986
0.648ProCys: 0.648 ± 0.265
3.787ProAsp: 3.787 ± 0.404
4.335ProGlu: 4.335 ± 0.616
1.545ProPhe: 1.545 ± 0.26
3.787ProGly: 3.787 ± 0.481
0.698ProHis: 0.698 ± 0.228
2.142ProIle: 2.142 ± 0.293
1.993ProLys: 1.993 ± 0.278
3.438ProLeu: 3.438 ± 0.436
0.996ProMet: 0.996 ± 0.273
1.495ProAsn: 1.495 ± 0.303
3.089ProPro: 3.089 ± 0.722
2.093ProGln: 2.093 ± 0.356
2.84ProArg: 2.84 ± 0.498
3.438ProSer: 3.438 ± 0.437
3.039ProThr: 3.039 ± 0.335
2.94ProVal: 2.94 ± 0.383
0.548ProTrp: 0.548 ± 0.169
1.146ProTyr: 1.146 ± 0.322
0.0ProXaa: 0.0 ± 0.0
Gln
4.833GlnAla: 4.833 ± 0.661
0.249GlnCys: 0.249 ± 0.11
1.943GlnAsp: 1.943 ± 0.393
2.342GlnGlu: 2.342 ± 0.295
0.996GlnPhe: 0.996 ± 0.218
2.441GlnGly: 2.441 ± 0.472
0.797GlnHis: 0.797 ± 0.204
1.843GlnIle: 1.843 ± 0.461
1.545GlnLys: 1.545 ± 0.335
3.139GlnLeu: 3.139 ± 0.403
2.093GlnMet: 2.093 ± 0.445
1.545GlnAsn: 1.545 ± 0.413
2.591GlnPro: 2.591 ± 0.421
2.989GlnGln: 2.989 ± 0.593
3.388GlnArg: 3.388 ± 0.484
2.541GlnSer: 2.541 ± 0.619
1.196GlnThr: 1.196 ± 0.266
1.993GlnVal: 1.993 ± 0.298
0.399GlnTrp: 0.399 ± 0.118
1.345GlnTyr: 1.345 ± 0.189
0.0GlnXaa: 0.0 ± 0.0
Arg
9.765ArgAla: 9.765 ± 0.689
0.747ArgCys: 0.747 ± 0.243
4.783ArgAsp: 4.783 ± 0.406
5.082ArgGlu: 5.082 ± 0.663
3.039ArgPhe: 3.039 ± 0.429
4.833ArgGly: 4.833 ± 0.479
1.196ArgHis: 1.196 ± 0.324
3.886ArgIle: 3.886 ± 0.417
2.69ArgLys: 2.69 ± 0.34
6.427ArgLeu: 6.427 ± 0.513
2.441ArgMet: 2.441 ± 0.406
2.392ArgAsn: 2.392 ± 0.28
2.74ArgPro: 2.74 ± 0.466
3.239ArgGln: 3.239 ± 0.492
5.979ArgArg: 5.979 ± 0.882
3.189ArgSer: 3.189 ± 0.462
3.737ArgThr: 3.737 ± 0.406
5.53ArgVal: 5.53 ± 0.555
0.947ArgTrp: 0.947 ± 0.231
2.342ArgTyr: 2.342 ± 0.362
0.0ArgXaa: 0.0 ± 0.0
Ser
6.328SerAla: 6.328 ± 0.778
0.249SerCys: 0.249 ± 0.151
3.637SerAsp: 3.637 ± 0.326
2.392SerGlu: 2.392 ± 0.347
1.594SerPhe: 1.594 ± 0.222
5.879SerGly: 5.879 ± 0.556
0.897SerHis: 0.897 ± 0.187
2.74SerIle: 2.74 ± 0.45
2.69SerLys: 2.69 ± 0.351
3.886SerLeu: 3.886 ± 0.477
1.345SerMet: 1.345 ± 0.267
1.744SerAsn: 1.744 ± 0.322
2.491SerPro: 2.491 ± 0.451
1.843SerGln: 1.843 ± 0.299
4.085SerArg: 4.085 ± 0.353
2.292SerSer: 2.292 ± 0.397
2.989SerThr: 2.989 ± 0.4
3.388SerVal: 3.388 ± 0.607
0.598SerTrp: 0.598 ± 0.15
1.246SerTyr: 1.246 ± 0.212
0.0SerXaa: 0.0 ± 0.0
Thr
7.025ThrAla: 7.025 ± 0.532
0.698ThrCys: 0.698 ± 0.279
3.388ThrAsp: 3.388 ± 0.396
2.74ThrGlu: 2.74 ± 0.37
1.993ThrPhe: 1.993 ± 0.226
5.231ThrGly: 5.231 ± 0.831
1.196ThrHis: 1.196 ± 0.282
3.039ThrIle: 3.039 ± 0.444
2.242ThrLys: 2.242 ± 0.253
4.235ThrLeu: 4.235 ± 0.49
1.495ThrMet: 1.495 ± 0.222
1.943ThrAsn: 1.943 ± 0.401
4.634ThrPro: 4.634 ± 0.551
2.441ThrGln: 2.441 ± 0.387
2.89ThrArg: 2.89 ± 0.391
2.69ThrSer: 2.69 ± 0.352
3.687ThrThr: 3.687 ± 0.58
4.135ThrVal: 4.135 ± 0.478
0.698ThrTrp: 0.698 ± 0.203
1.246ThrTyr: 1.246 ± 0.236
0.0ThrXaa: 0.0 ± 0.0
Val
7.224ValAla: 7.224 ± 0.829
0.698ValCys: 0.698 ± 0.266
4.384ValAsp: 4.384 ± 0.44
4.185ValGlu: 4.185 ± 0.435
1.096ValPhe: 1.096 ± 0.256
4.733ValGly: 4.733 ± 0.548
0.698ValHis: 0.698 ± 0.196
2.84ValIle: 2.84 ± 0.421
3.438ValLys: 3.438 ± 0.522
4.484ValLeu: 4.484 ± 0.46
1.146ValMet: 1.146 ± 0.272
2.142ValAsn: 2.142 ± 0.472
4.135ValPro: 4.135 ± 0.455
1.993ValGln: 1.993 ± 0.297
5.182ValArg: 5.182 ± 0.708
4.235ValSer: 4.235 ± 0.442
3.986ValThr: 3.986 ± 0.423
4.335ValVal: 4.335 ± 0.389
0.797ValTrp: 0.797 ± 0.245
1.644ValTyr: 1.644 ± 0.329
0.0ValXaa: 0.0 ± 0.0
Trp
1.545TrpAla: 1.545 ± 0.307
0.249TrpCys: 0.249 ± 0.135
0.747TrpAsp: 0.747 ± 0.232
0.548TrpGlu: 0.548 ± 0.132
0.598TrpPhe: 0.598 ± 0.15
0.747TrpGly: 0.747 ± 0.169
0.399TrpHis: 0.399 ± 0.161
0.747TrpIle: 0.747 ± 0.174
0.399TrpLys: 0.399 ± 0.104
1.644TrpLeu: 1.644 ± 0.341
0.548TrpMet: 0.548 ± 0.164
0.349TrpAsn: 0.349 ± 0.128
0.299TrpPro: 0.299 ± 0.125
0.797TrpGln: 0.797 ± 0.231
1.644TrpArg: 1.644 ± 0.248
0.698TrpSer: 0.698 ± 0.188
0.847TrpThr: 0.847 ± 0.186
0.797TrpVal: 0.797 ± 0.23
0.1TrpTrp: 0.1 ± 0.054
0.199TrpTyr: 0.199 ± 0.09
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.441TyrAla: 2.441 ± 0.329
0.399TyrCys: 0.399 ± 0.205
2.192TyrAsp: 2.192 ± 0.313
1.046TyrGlu: 1.046 ± 0.217
0.797TyrPhe: 0.797 ± 0.191
2.093TyrGly: 2.093 ± 0.393
0.399TyrHis: 0.399 ± 0.174
0.947TyrIle: 0.947 ± 0.25
1.046TyrLys: 1.046 ± 0.237
2.242TyrLeu: 2.242 ± 0.314
0.399TyrMet: 0.399 ± 0.108
0.897TyrAsn: 0.897 ± 0.169
1.345TyrPro: 1.345 ± 0.21
0.847TyrGln: 0.847 ± 0.234
1.794TyrArg: 1.794 ± 0.298
1.345TyrSer: 1.345 ± 0.208
1.594TyrThr: 1.594 ± 0.277
1.594TyrVal: 1.594 ± 0.261
0.1TyrTrp: 0.1 ± 0.105
0.598TyrTyr: 0.598 ± 0.175
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 77 proteins (20072 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski