Amino acid dipepetide frequency for Bordetella phage vB_BbrS_PHB09

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
23.064AlaAla: 23.064 ± 2.96
1.138AlaCys: 1.138 ± 0.318
8.497AlaAsp: 8.497 ± 1.001
9.256AlaGlu: 9.256 ± 0.894
4.249AlaPhe: 4.249 ± 0.566
13.277AlaGly: 13.277 ± 1.095
2.2AlaHis: 2.2 ± 0.481
6.6AlaIle: 6.6 ± 0.669
5.007AlaLys: 5.007 ± 0.545
11.608AlaLeu: 11.608 ± 0.98
4.324AlaMet: 4.324 ± 0.598
3.566AlaAsn: 3.566 ± 0.586
5.614AlaPro: 5.614 ± 0.874
6.221AlaGln: 6.221 ± 1.109
8.725AlaArg: 8.725 ± 1.277
7.131AlaSer: 7.131 ± 0.928
6.525AlaThr: 6.525 ± 0.919
8.269AlaVal: 8.269 ± 0.943
2.883AlaTrp: 2.883 ± 0.38
3.414AlaTyr: 3.414 ± 0.56
0.0AlaXaa: 0.0 ± 0.0
Cys
0.986CysAla: 0.986 ± 0.381
0.152CysCys: 0.152 ± 0.127
0.531CysAsp: 0.531 ± 0.24
0.152CysGlu: 0.152 ± 0.12
0.076CysPhe: 0.076 ± 0.063
0.607CysGly: 0.607 ± 0.226
0.531CysHis: 0.531 ± 0.258
0.379CysIle: 0.379 ± 0.188
0.228CysLys: 0.228 ± 0.144
0.455CysLeu: 0.455 ± 0.191
0.303CysMet: 0.303 ± 0.138
0.228CysAsn: 0.228 ± 0.119
0.531CysPro: 0.531 ± 0.192
0.683CysGln: 0.683 ± 0.267
0.759CysArg: 0.759 ± 0.251
0.531CysSer: 0.531 ± 0.236
0.152CysThr: 0.152 ± 0.104
1.062CysVal: 1.062 ± 0.318
0.228CysTrp: 0.228 ± 0.143
0.303CysTyr: 0.303 ± 0.131
0.0CysXaa: 0.0 ± 0.0
Asp
9.635AspAla: 9.635 ± 1.089
0.228AspCys: 0.228 ± 0.155
3.49AspAsp: 3.49 ± 0.756
4.476AspGlu: 4.476 ± 0.543
1.821AspPhe: 1.821 ± 0.374
6.904AspGly: 6.904 ± 0.822
1.214AspHis: 1.214 ± 0.281
2.048AspIle: 2.048 ± 0.375
1.441AspLys: 1.441 ± 0.369
4.78AspLeu: 4.78 ± 0.646
1.745AspMet: 1.745 ± 0.321
1.214AspAsn: 1.214 ± 0.31
3.111AspPro: 3.111 ± 0.522
3.111AspGln: 3.111 ± 0.425
2.959AspArg: 2.959 ± 0.473
2.428AspSer: 2.428 ± 0.367
2.352AspThr: 2.352 ± 0.46
3.49AspVal: 3.49 ± 0.585
1.745AspTrp: 1.745 ± 0.334
0.91AspTyr: 0.91 ± 0.301
0.0AspXaa: 0.0 ± 0.0
Glu
7.89GluAla: 7.89 ± 1.045
0.759GluCys: 0.759 ± 0.318
2.883GluAsp: 2.883 ± 0.505
3.035GluGlu: 3.035 ± 0.447
1.897GluPhe: 1.897 ± 0.399
3.793GluGly: 3.793 ± 0.427
1.214GluHis: 1.214 ± 0.317
3.414GluIle: 3.414 ± 0.449
2.2GluLys: 2.2 ± 0.447
5.159GluLeu: 5.159 ± 0.656
1.138GluMet: 1.138 ± 0.233
1.973GluAsn: 1.973 ± 0.389
2.579GluPro: 2.579 ± 0.506
2.504GluGln: 2.504 ± 0.421
5.159GluArg: 5.159 ± 0.831
2.352GluSer: 2.352 ± 0.434
2.731GluThr: 2.731 ± 0.418
4.476GluVal: 4.476 ± 0.525
1.214GluTrp: 1.214 ± 0.396
1.821GluTyr: 1.821 ± 0.316
0.0GluXaa: 0.0 ± 0.0
Phe
2.428PheAla: 2.428 ± 0.446
0.152PheCys: 0.152 ± 0.115
1.897PheAsp: 1.897 ± 0.359
1.593PheGlu: 1.593 ± 0.338
0.986PhePhe: 0.986 ± 0.211
3.186PheGly: 3.186 ± 0.454
0.986PheHis: 0.986 ± 0.315
1.138PheIle: 1.138 ± 0.287
1.366PheLys: 1.366 ± 0.393
1.821PheLeu: 1.821 ± 0.317
1.138PheMet: 1.138 ± 0.272
1.062PheAsn: 1.062 ± 0.256
1.441PhePro: 1.441 ± 0.395
1.29PheGln: 1.29 ± 0.332
1.821PheArg: 1.821 ± 0.375
1.973PheSer: 1.973 ± 0.46
1.366PheThr: 1.366 ± 0.371
2.352PheVal: 2.352 ± 0.37
0.303PheTrp: 0.303 ± 0.155
0.607PheTyr: 0.607 ± 0.198
0.0PheXaa: 0.0 ± 0.0
Gly
12.139GlyAla: 12.139 ± 1.042
0.986GlyCys: 0.986 ± 0.311
5.159GlyAsp: 5.159 ± 0.649
5.842GlyGlu: 5.842 ± 0.694
2.731GlyPhe: 2.731 ± 0.405
8.421GlyGly: 8.421 ± 0.781
0.91GlyHis: 0.91 ± 0.258
3.945GlyIle: 3.945 ± 0.651
4.78GlyLys: 4.78 ± 0.596
6.6GlyLeu: 6.6 ± 0.525
2.276GlyMet: 2.276 ± 0.415
2.807GlyAsn: 2.807 ± 0.445
3.186GlyPro: 3.186 ± 0.529
3.793GlyGln: 3.793 ± 0.583
8.194GlyArg: 8.194 ± 0.875
3.642GlySer: 3.642 ± 0.686
4.249GlyThr: 4.249 ± 0.56
5.69GlyVal: 5.69 ± 0.714
2.048GlyTrp: 2.048 ± 0.381
2.352GlyTyr: 2.352 ± 0.352
0.0GlyXaa: 0.0 ± 0.0
His
1.973HisAla: 1.973 ± 0.44
0.455HisCys: 0.455 ± 0.203
1.593HisAsp: 1.593 ± 0.424
0.759HisGlu: 0.759 ± 0.233
0.683HisPhe: 0.683 ± 0.261
1.821HisGly: 1.821 ± 0.373
0.455HisHis: 0.455 ± 0.219
0.986HisIle: 0.986 ± 0.284
0.379HisLys: 0.379 ± 0.198
1.441HisLeu: 1.441 ± 0.318
0.455HisMet: 0.455 ± 0.167
0.379HisAsn: 0.379 ± 0.135
1.062HisPro: 1.062 ± 0.318
0.759HisGln: 0.759 ± 0.306
1.517HisArg: 1.517 ± 0.358
0.835HisSer: 0.835 ± 0.379
0.607HisThr: 0.607 ± 0.206
0.683HisVal: 0.683 ± 0.231
0.531HisTrp: 0.531 ± 0.213
0.683HisTyr: 0.683 ± 0.228
0.0HisXaa: 0.0 ± 0.0
Ile
5.766IleAla: 5.766 ± 0.778
0.379IleCys: 0.379 ± 0.172
3.566IleAsp: 3.566 ± 0.632
4.249IleGlu: 4.249 ± 0.617
1.062IlePhe: 1.062 ± 0.306
4.097IleGly: 4.097 ± 0.376
0.759IleHis: 0.759 ± 0.247
1.897IleIle: 1.897 ± 0.321
1.745IleLys: 1.745 ± 0.318
1.897IleLeu: 1.897 ± 0.395
0.683IleMet: 0.683 ± 0.198
1.441IleAsn: 1.441 ± 0.273
2.276IlePro: 2.276 ± 0.408
2.959IleGln: 2.959 ± 0.61
4.021IleArg: 4.021 ± 0.609
2.352IleSer: 2.352 ± 0.423
2.352IleThr: 2.352 ± 0.415
3.49IleVal: 3.49 ± 0.573
0.531IleTrp: 0.531 ± 0.17
1.29IleTyr: 1.29 ± 0.374
0.0IleXaa: 0.0 ± 0.0
Lys
5.918LysAla: 5.918 ± 0.802
0.455LysCys: 0.455 ± 0.169
1.745LysAsp: 1.745 ± 0.367
2.2LysGlu: 2.2 ± 0.527
0.759LysPhe: 0.759 ± 0.211
3.869LysGly: 3.869 ± 0.557
0.683LysHis: 0.683 ± 0.295
1.366LysIle: 1.366 ± 0.427
1.29LysLys: 1.29 ± 0.39
3.793LysLeu: 3.793 ± 0.397
0.759LysMet: 0.759 ± 0.257
0.759LysAsn: 0.759 ± 0.252
2.579LysPro: 2.579 ± 0.497
1.366LysGln: 1.366 ± 0.376
3.338LysArg: 3.338 ± 0.614
1.745LysSer: 1.745 ± 0.264
1.897LysThr: 1.897 ± 0.346
2.428LysVal: 2.428 ± 0.523
0.986LysTrp: 0.986 ± 0.319
0.91LysTyr: 0.91 ± 0.233
0.0LysXaa: 0.0 ± 0.0
Leu
11.456LeuAla: 11.456 ± 1.157
0.683LeuCys: 0.683 ± 0.239
4.855LeuAsp: 4.855 ± 0.544
4.628LeuGlu: 4.628 ± 0.609
2.048LeuPhe: 2.048 ± 0.412
6.145LeuGly: 6.145 ± 0.733
2.124LeuHis: 2.124 ± 0.441
4.324LeuIle: 4.324 ± 0.774
2.959LeuLys: 2.959 ± 0.5
6.145LeuLeu: 6.145 ± 0.901
1.29LeuMet: 1.29 ± 0.388
1.897LeuAsn: 1.897 ± 0.408
3.111LeuPro: 3.111 ± 0.51
2.579LeuGln: 2.579 ± 0.441
7.131LeuArg: 7.131 ± 0.747
4.097LeuSer: 4.097 ± 0.597
4.931LeuThr: 4.931 ± 0.555
5.007LeuVal: 5.007 ± 0.737
1.29LeuTrp: 1.29 ± 0.36
2.276LeuTyr: 2.276 ± 0.405
0.0LeuXaa: 0.0 ± 0.0
Met
3.262MetAla: 3.262 ± 0.481
0.228MetCys: 0.228 ± 0.133
0.986MetAsp: 0.986 ± 0.317
0.759MetGlu: 0.759 ± 0.204
0.835MetPhe: 0.835 ± 0.251
2.048MetGly: 2.048 ± 0.409
0.228MetHis: 0.228 ± 0.134
0.91MetIle: 0.91 ± 0.24
1.29MetLys: 1.29 ± 0.335
2.048MetLeu: 2.048 ± 0.353
0.759MetMet: 0.759 ± 0.319
0.607MetAsn: 0.607 ± 0.203
1.29MetPro: 1.29 ± 0.271
1.593MetGln: 1.593 ± 0.424
1.669MetArg: 1.669 ± 0.434
2.048MetSer: 2.048 ± 0.414
1.897MetThr: 1.897 ± 0.279
1.669MetVal: 1.669 ± 0.31
0.228MetTrp: 0.228 ± 0.145
0.531MetTyr: 0.531 ± 0.214
0.0MetXaa: 0.0 ± 0.0
Asn
3.717AsnAla: 3.717 ± 0.574
0.303AsnCys: 0.303 ± 0.145
1.593AsnAsp: 1.593 ± 0.328
0.91AsnGlu: 0.91 ± 0.208
0.759AsnPhe: 0.759 ± 0.235
2.807AsnGly: 2.807 ± 0.436
0.379AsnHis: 0.379 ± 0.179
0.986AsnIle: 0.986 ± 0.242
0.607AsnLys: 0.607 ± 0.216
1.669AsnLeu: 1.669 ± 0.313
0.228AsnMet: 0.228 ± 0.128
0.607AsnAsn: 0.607 ± 0.169
2.276AsnPro: 2.276 ± 0.478
1.366AsnGln: 1.366 ± 0.305
1.973AsnArg: 1.973 ± 0.384
1.441AsnSer: 1.441 ± 0.337
1.441AsnThr: 1.441 ± 0.393
1.897AsnVal: 1.897 ± 0.467
0.759AsnTrp: 0.759 ± 0.242
0.228AsnTyr: 0.228 ± 0.118
0.0AsnXaa: 0.0 ± 0.0
Pro
7.435ProAla: 7.435 ± 1.102
0.379ProCys: 0.379 ± 0.161
3.793ProAsp: 3.793 ± 0.624
2.883ProGlu: 2.883 ± 0.481
0.91ProPhe: 0.91 ± 0.18
4.931ProGly: 4.931 ± 0.701
0.91ProHis: 0.91 ± 0.24
2.124ProIle: 2.124 ± 0.445
1.593ProLys: 1.593 ± 0.464
2.883ProLeu: 2.883 ± 0.442
1.29ProMet: 1.29 ± 0.332
0.683ProAsn: 0.683 ± 0.194
3.414ProPro: 3.414 ± 0.698
1.29ProGln: 1.29 ± 0.352
2.959ProArg: 2.959 ± 0.52
3.338ProSer: 3.338 ± 0.59
1.897ProThr: 1.897 ± 0.407
3.49ProVal: 3.49 ± 0.634
0.986ProTrp: 0.986 ± 0.227
1.214ProTyr: 1.214 ± 0.304
0.0ProXaa: 0.0 ± 0.0
Gln
7.587GlnAla: 7.587 ± 1.451
0.531GlnCys: 0.531 ± 0.225
1.897GlnAsp: 1.897 ± 0.32
1.973GlnGlu: 1.973 ± 0.344
1.062GlnPhe: 1.062 ± 0.236
3.945GlnGly: 3.945 ± 0.475
0.986GlnHis: 0.986 ± 0.327
1.897GlnIle: 1.897 ± 0.367
1.669GlnLys: 1.669 ± 0.461
3.945GlnLeu: 3.945 ± 0.661
0.91GlnMet: 0.91 ± 0.287
0.607GlnAsn: 0.607 ± 0.177
1.517GlnPro: 1.517 ± 0.378
2.276GlnGln: 2.276 ± 0.463
3.49GlnArg: 3.49 ± 0.716
1.821GlnSer: 1.821 ± 0.321
1.593GlnThr: 1.593 ± 0.39
3.414GlnVal: 3.414 ± 0.638
0.986GlnTrp: 0.986 ± 0.273
1.366GlnTyr: 1.366 ± 0.294
0.0GlnXaa: 0.0 ± 0.0
Arg
10.849ArgAla: 10.849 ± 0.952
0.531ArgCys: 0.531 ± 0.246
4.249ArgAsp: 4.249 ± 0.448
3.717ArgGlu: 3.717 ± 0.596
2.352ArgPhe: 2.352 ± 0.466
5.007ArgGly: 5.007 ± 0.709
1.138ArgHis: 1.138 ± 0.276
3.945ArgIle: 3.945 ± 0.766
3.869ArgLys: 3.869 ± 0.645
7.056ArgLeu: 7.056 ± 0.778
2.2ArgMet: 2.2 ± 0.442
2.048ArgAsn: 2.048 ± 0.34
3.642ArgPro: 3.642 ± 0.674
3.338ArgGln: 3.338 ± 0.494
5.462ArgArg: 5.462 ± 0.734
3.566ArgSer: 3.566 ± 0.475
3.338ArgThr: 3.338 ± 0.539
5.235ArgVal: 5.235 ± 0.588
1.441ArgTrp: 1.441 ± 0.337
2.2ArgTyr: 2.2 ± 0.398
0.0ArgXaa: 0.0 ± 0.0
Ser
6.904SerAla: 6.904 ± 0.64
0.379SerCys: 0.379 ± 0.179
2.352SerAsp: 2.352 ± 0.476
2.731SerGlu: 2.731 ± 0.47
1.366SerPhe: 1.366 ± 0.314
5.842SerGly: 5.842 ± 0.819
0.91SerHis: 0.91 ± 0.281
2.352SerIle: 2.352 ± 0.463
2.048SerLys: 2.048 ± 0.288
4.097SerLeu: 4.097 ± 0.586
1.29SerMet: 1.29 ± 0.279
1.593SerAsn: 1.593 ± 0.413
1.897SerPro: 1.897 ± 0.49
2.048SerGln: 2.048 ± 0.393
2.276SerArg: 2.276 ± 0.335
3.262SerSer: 3.262 ± 0.67
2.959SerThr: 2.959 ± 0.481
4.097SerVal: 4.097 ± 0.543
0.759SerTrp: 0.759 ± 0.32
1.062SerTyr: 1.062 ± 0.392
0.0SerXaa: 0.0 ± 0.0
Thr
7.587ThrAla: 7.587 ± 0.702
0.228ThrCys: 0.228 ± 0.133
3.262ThrAsp: 3.262 ± 0.484
3.035ThrGlu: 3.035 ± 0.514
1.973ThrPhe: 1.973 ± 0.416
4.021ThrGly: 4.021 ± 0.562
0.91ThrHis: 0.91 ± 0.358
2.883ThrIle: 2.883 ± 0.568
2.124ThrLys: 2.124 ± 0.444
3.642ThrLeu: 3.642 ± 0.478
1.366ThrMet: 1.366 ± 0.281
1.29ThrAsn: 1.29 ± 0.437
2.504ThrPro: 2.504 ± 0.5
1.593ThrGln: 1.593 ± 0.34
3.642ThrArg: 3.642 ± 0.574
2.124ThrSer: 2.124 ± 0.647
2.655ThrThr: 2.655 ± 0.466
3.717ThrVal: 3.717 ± 0.579
0.455ThrTrp: 0.455 ± 0.166
1.138ThrTyr: 1.138 ± 0.235
0.0ThrXaa: 0.0 ± 0.0
Val
8.725ValAla: 8.725 ± 0.75
0.303ValCys: 0.303 ± 0.149
4.628ValAsp: 4.628 ± 0.625
4.021ValGlu: 4.021 ± 0.631
1.821ValPhe: 1.821 ± 0.278
5.462ValGly: 5.462 ± 0.62
1.062ValHis: 1.062 ± 0.25
3.338ValIle: 3.338 ± 0.513
2.883ValLys: 2.883 ± 0.586
5.159ValLeu: 5.159 ± 0.68
1.669ValMet: 1.669 ± 0.302
2.048ValAsn: 2.048 ± 0.422
3.793ValPro: 3.793 ± 0.54
2.655ValGln: 2.655 ± 0.593
6.449ValArg: 6.449 ± 0.718
3.338ValSer: 3.338 ± 0.488
4.324ValThr: 4.324 ± 0.925
4.4ValVal: 4.4 ± 0.696
0.986ValTrp: 0.986 ± 0.299
1.593ValTyr: 1.593 ± 0.388
0.0ValXaa: 0.0 ± 0.0
Trp
1.669TrpAla: 1.669 ± 0.315
0.152TrpCys: 0.152 ± 0.104
0.835TrpAsp: 0.835 ± 0.262
0.835TrpGlu: 0.835 ± 0.223
0.455TrpPhe: 0.455 ± 0.202
1.441TrpGly: 1.441 ± 0.316
0.076TrpHis: 0.076 ± 0.091
1.214TrpIle: 1.214 ± 0.298
0.531TrpLys: 0.531 ± 0.167
2.2TrpLeu: 2.2 ± 0.592
0.607TrpMet: 0.607 ± 0.246
0.379TrpAsn: 0.379 ± 0.197
1.062TrpPro: 1.062 ± 0.338
0.683TrpGln: 0.683 ± 0.219
1.897TrpArg: 1.897 ± 0.423
0.986TrpSer: 0.986 ± 0.238
1.29TrpThr: 1.29 ± 0.254
2.048TrpVal: 2.048 ± 0.402
0.455TrpTrp: 0.455 ± 0.186
0.303TrpTyr: 0.303 ± 0.145
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.579TyrAla: 2.579 ± 0.399
0.455TyrCys: 0.455 ± 0.184
1.29TyrAsp: 1.29 ± 0.309
1.138TyrGlu: 1.138 ± 0.261
1.062TyrPhe: 1.062 ± 0.346
2.2TyrGly: 2.2 ± 0.46
0.379TyrHis: 0.379 ± 0.168
1.062TyrIle: 1.062 ± 0.241
0.835TyrLys: 0.835 ± 0.256
2.579TyrLeu: 2.579 ± 0.471
0.379TyrMet: 0.379 ± 0.182
0.759TyrAsn: 0.759 ± 0.236
1.517TyrPro: 1.517 ± 0.336
1.366TyrGln: 1.366 ± 0.375
1.745TyrArg: 1.745 ± 0.307
1.214TyrSer: 1.214 ± 0.295
1.593TyrThr: 1.593 ± 0.332
1.745TyrVal: 1.745 ± 0.419
0.303TyrTrp: 0.303 ± 0.148
0.683TyrTyr: 0.683 ± 0.235
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 59 proteins (13182 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski