Amino acid dipepetide frequency for Bordetella phage vB_BbrP_BB8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.162AlaAla: 11.162 ± 1.09
1.093AlaCys: 1.093 ± 0.422
7.416AlaAsp: 7.416 ± 0.925
6.167AlaGlu: 6.167 ± 0.736
2.81AlaPhe: 2.81 ± 0.535
9.367AlaGly: 9.367 ± 0.979
2.264AlaHis: 2.264 ± 0.573
4.84AlaIle: 4.84 ± 0.652
5.074AlaLys: 5.074 ± 0.704
8.977AlaLeu: 8.977 ± 0.83
2.108AlaMet: 2.108 ± 0.435
3.435AlaAsn: 3.435 ± 0.6
3.669AlaPro: 3.669 ± 0.629
5.23AlaGln: 5.23 ± 0.63
5.386AlaArg: 5.386 ± 0.659
5.542AlaSer: 5.542 ± 0.879
6.557AlaThr: 6.557 ± 0.834
6.245AlaVal: 6.245 ± 0.718
1.015AlaTrp: 1.015 ± 0.355
2.654AlaTyr: 2.654 ± 0.43
0.0AlaXaa: 0.0 ± 0.0
Cys
1.093CysAla: 1.093 ± 0.342
0.156CysCys: 0.156 ± 0.161
0.234CysAsp: 0.234 ± 0.135
0.703CysGlu: 0.703 ± 0.31
0.234CysPhe: 0.234 ± 0.184
1.015CysGly: 1.015 ± 0.296
0.156CysHis: 0.156 ± 0.116
0.546CysIle: 0.546 ± 0.276
0.546CysLys: 0.546 ± 0.254
1.015CysLeu: 1.015 ± 0.338
0.234CysMet: 0.234 ± 0.155
0.078CysAsn: 0.078 ± 0.085
0.703CysPro: 0.703 ± 0.244
0.234CysGln: 0.234 ± 0.197
0.781CysArg: 0.781 ± 0.301
0.624CysSer: 0.624 ± 0.22
0.156CysThr: 0.156 ± 0.107
0.546CysVal: 0.546 ± 0.277
0.234CysTrp: 0.234 ± 0.148
0.234CysTyr: 0.234 ± 0.183
0.0CysXaa: 0.0 ± 0.0
Asp
5.542AspAla: 5.542 ± 0.571
0.546AspCys: 0.546 ± 0.186
3.356AspAsp: 3.356 ± 0.487
5.23AspGlu: 5.23 ± 0.803
2.966AspPhe: 2.966 ± 0.708
4.84AspGly: 4.84 ± 0.784
0.703AspHis: 0.703 ± 0.242
2.966AspIle: 2.966 ± 0.395
3.122AspLys: 3.122 ± 0.532
4.996AspLeu: 4.996 ± 0.58
1.327AspMet: 1.327 ± 0.289
2.42AspAsn: 2.42 ± 0.507
2.966AspPro: 2.966 ± 0.514
1.795AspGln: 1.795 ± 0.483
3.435AspArg: 3.435 ± 0.433
2.42AspSer: 2.42 ± 0.562
3.903AspThr: 3.903 ± 0.657
4.059AspVal: 4.059 ± 0.597
0.859AspTrp: 0.859 ± 0.207
2.03AspTyr: 2.03 ± 0.399
0.0AspXaa: 0.0 ± 0.0
Glu
7.494GluAla: 7.494 ± 0.833
0.703GluCys: 0.703 ± 0.244
4.137GluAsp: 4.137 ± 0.618
6.245GluGlu: 6.245 ± 0.94
2.888GluPhe: 2.888 ± 0.526
5.386GluGly: 5.386 ± 0.826
0.624GluHis: 0.624 ± 0.215
2.03GluIle: 2.03 ± 0.337
3.747GluLys: 3.747 ± 0.702
5.542GluLeu: 5.542 ± 0.637
1.405GluMet: 1.405 ± 0.353
2.576GluAsn: 2.576 ± 0.526
2.03GluPro: 2.03 ± 0.588
2.654GluGln: 2.654 ± 0.464
3.747GluArg: 3.747 ± 0.482
4.059GluSer: 4.059 ± 0.518
2.966GluThr: 2.966 ± 0.522
5.152GluVal: 5.152 ± 0.8
1.171GluTrp: 1.171 ± 0.318
2.654GluTyr: 2.654 ± 0.6
0.0GluXaa: 0.0 ± 0.0
Phe
2.732PheAla: 2.732 ± 0.419
0.39PheCys: 0.39 ± 0.189
2.342PheAsp: 2.342 ± 0.426
2.03PheGlu: 2.03 ± 0.315
1.171PhePhe: 1.171 ± 0.289
3.356PheGly: 3.356 ± 0.599
0.703PheHis: 0.703 ± 0.234
1.327PheIle: 1.327 ± 0.306
1.639PheLys: 1.639 ± 0.34
3.044PheLeu: 3.044 ± 0.719
0.937PheMet: 0.937 ± 0.244
2.342PheAsn: 2.342 ± 0.5
1.873PhePro: 1.873 ± 0.382
1.951PheGln: 1.951 ± 0.399
2.186PheArg: 2.186 ± 0.41
2.732PheSer: 2.732 ± 0.584
2.186PheThr: 2.186 ± 0.404
2.732PheVal: 2.732 ± 0.605
0.468PheTrp: 0.468 ± 0.169
0.624PheTyr: 0.624 ± 0.274
0.0PheXaa: 0.0 ± 0.0
Gly
7.181GlyAla: 7.181 ± 0.957
1.015GlyCys: 1.015 ± 0.379
5.074GlyAsp: 5.074 ± 0.677
5.308GlyGlu: 5.308 ± 0.856
3.122GlyPhe: 3.122 ± 0.435
6.245GlyGly: 6.245 ± 0.901
1.717GlyHis: 1.717 ± 0.329
3.903GlyIle: 3.903 ± 0.593
5.698GlyLys: 5.698 ± 0.75
7.65GlyLeu: 7.65 ± 0.925
2.342GlyMet: 2.342 ± 0.603
2.654GlyAsn: 2.654 ± 0.494
2.108GlyPro: 2.108 ± 0.466
2.732GlyGln: 2.732 ± 0.609
4.137GlyArg: 4.137 ± 0.629
5.464GlySer: 5.464 ± 0.983
4.84GlyThr: 4.84 ± 0.647
5.152GlyVal: 5.152 ± 0.602
1.327GlyTrp: 1.327 ± 0.349
3.591GlyTyr: 3.591 ± 0.489
0.0GlyXaa: 0.0 ± 0.0
His
1.561HisAla: 1.561 ± 0.393
0.234HisCys: 0.234 ± 0.129
0.703HisAsp: 0.703 ± 0.238
0.624HisGlu: 0.624 ± 0.226
0.703HisPhe: 0.703 ± 0.248
1.561HisGly: 1.561 ± 0.389
0.156HisHis: 0.156 ± 0.101
1.327HisIle: 1.327 ± 0.321
0.468HisLys: 0.468 ± 0.222
2.108HisLeu: 2.108 ± 0.514
0.546HisMet: 0.546 ± 0.199
0.781HisAsn: 0.781 ± 0.236
0.624HisPro: 0.624 ± 0.219
0.39HisGln: 0.39 ± 0.205
1.171HisArg: 1.171 ± 0.365
0.468HisSer: 0.468 ± 0.211
1.327HisThr: 1.327 ± 0.577
1.717HisVal: 1.717 ± 0.484
0.234HisTrp: 0.234 ± 0.132
0.781HisTyr: 0.781 ± 0.235
0.0HisXaa: 0.0 ± 0.0
Ile
4.293IleAla: 4.293 ± 0.497
0.546IleCys: 0.546 ± 0.262
3.591IleAsp: 3.591 ± 0.482
3.044IleGlu: 3.044 ± 0.492
1.249IlePhe: 1.249 ± 0.366
3.669IleGly: 3.669 ± 0.58
0.859IleHis: 0.859 ± 0.289
2.576IleIle: 2.576 ± 0.525
2.108IleLys: 2.108 ± 0.485
3.669IleLeu: 3.669 ± 0.618
0.859IleMet: 0.859 ± 0.227
2.108IleAsn: 2.108 ± 0.411
2.186IlePro: 2.186 ± 0.389
2.342IleGln: 2.342 ± 0.409
2.81IleArg: 2.81 ± 0.425
2.42IleSer: 2.42 ± 0.373
3.122IleThr: 3.122 ± 0.502
2.81IleVal: 2.81 ± 0.433
0.546IleTrp: 0.546 ± 0.203
1.639IleTyr: 1.639 ± 0.297
0.0IleXaa: 0.0 ± 0.0
Lys
6.167LysAla: 6.167 ± 0.77
0.39LysCys: 0.39 ± 0.214
3.278LysAsp: 3.278 ± 0.62
4.215LysGlu: 4.215 ± 0.671
1.951LysPhe: 1.951 ± 0.484
4.059LysGly: 4.059 ± 0.501
1.249LysHis: 1.249 ± 0.42
2.186LysIle: 2.186 ± 0.655
3.669LysLys: 3.669 ± 0.687
5.308LysLeu: 5.308 ± 0.761
1.093LysMet: 1.093 ± 0.259
1.171LysAsn: 1.171 ± 0.253
2.498LysPro: 2.498 ± 0.497
1.873LysGln: 1.873 ± 0.417
3.591LysArg: 3.591 ± 0.526
2.264LysSer: 2.264 ± 0.38
3.669LysThr: 3.669 ± 0.533
4.527LysVal: 4.527 ± 0.581
1.171LysTrp: 1.171 ± 0.297
2.264LysTyr: 2.264 ± 0.444
0.0LysXaa: 0.0 ± 0.0
Leu
8.899LeuAla: 8.899 ± 0.895
0.468LeuCys: 0.468 ± 0.31
5.23LeuAsp: 5.23 ± 0.741
5.542LeuGlu: 5.542 ± 0.7
2.888LeuPhe: 2.888 ± 0.583
6.791LeuGly: 6.791 ± 0.733
1.483LeuHis: 1.483 ± 0.369
3.356LeuIle: 3.356 ± 0.466
5.464LeuLys: 5.464 ± 0.747
7.025LeuLeu: 7.025 ± 0.707
1.873LeuMet: 1.873 ± 0.419
3.591LeuAsn: 3.591 ± 0.493
5.074LeuPro: 5.074 ± 0.829
3.591LeuGln: 3.591 ± 0.571
4.762LeuArg: 4.762 ± 0.53
5.152LeuSer: 5.152 ± 0.81
5.464LeuThr: 5.464 ± 0.647
4.996LeuVal: 4.996 ± 0.568
0.624LeuTrp: 0.624 ± 0.197
2.732LeuTyr: 2.732 ± 0.565
0.0LeuXaa: 0.0 ± 0.0
Met
2.966MetAla: 2.966 ± 0.483
0.468MetCys: 0.468 ± 0.239
0.859MetAsp: 0.859 ± 0.21
1.171MetGlu: 1.171 ± 0.363
0.781MetPhe: 0.781 ± 0.26
2.186MetGly: 2.186 ± 0.48
0.468MetHis: 0.468 ± 0.184
1.015MetIle: 1.015 ± 0.268
1.483MetLys: 1.483 ± 0.383
1.873MetLeu: 1.873 ± 0.385
0.468MetMet: 0.468 ± 0.186
1.171MetAsn: 1.171 ± 0.342
0.937MetPro: 0.937 ± 0.207
0.624MetGln: 0.624 ± 0.183
1.405MetArg: 1.405 ± 0.364
1.639MetSer: 1.639 ± 0.369
1.405MetThr: 1.405 ± 0.377
1.639MetVal: 1.639 ± 0.313
0.234MetTrp: 0.234 ± 0.15
0.39MetTyr: 0.39 ± 0.163
0.0MetXaa: 0.0 ± 0.0
Asn
4.371AsnAla: 4.371 ± 0.662
0.078AsnCys: 0.078 ± 0.073
1.561AsnAsp: 1.561 ± 0.367
2.03AsnGlu: 2.03 ± 0.406
2.42AsnPhe: 2.42 ± 0.576
2.888AsnGly: 2.888 ± 0.614
0.703AsnHis: 0.703 ± 0.226
1.717AsnIle: 1.717 ± 0.298
1.405AsnLys: 1.405 ± 0.343
3.513AsnLeu: 3.513 ± 0.573
1.327AsnMet: 1.327 ± 0.322
1.405AsnAsn: 1.405 ± 0.378
2.498AsnPro: 2.498 ± 0.372
1.561AsnGln: 1.561 ± 0.382
2.576AsnArg: 2.576 ± 0.433
2.03AsnSer: 2.03 ± 0.475
2.888AsnThr: 2.888 ± 0.57
1.951AsnVal: 1.951 ± 0.442
0.546AsnTrp: 0.546 ± 0.23
0.937AsnTyr: 0.937 ± 0.241
0.0AsnXaa: 0.0 ± 0.0
Pro
3.747ProAla: 3.747 ± 0.58
0.39ProCys: 0.39 ± 0.166
2.81ProAsp: 2.81 ± 0.557
4.215ProGlu: 4.215 ± 0.574
1.639ProPhe: 1.639 ± 0.397
2.966ProGly: 2.966 ± 0.483
0.703ProHis: 0.703 ± 0.271
1.249ProIle: 1.249 ± 0.284
3.044ProLys: 3.044 ± 0.476
3.669ProLeu: 3.669 ± 0.429
0.781ProMet: 0.781 ± 0.259
2.264ProAsn: 2.264 ± 0.4
3.2ProPro: 3.2 ± 1.005
1.249ProGln: 1.249 ± 0.363
2.42ProArg: 2.42 ± 0.366
2.264ProSer: 2.264 ± 0.382
2.81ProThr: 2.81 ± 0.467
3.903ProVal: 3.903 ± 0.502
0.859ProTrp: 0.859 ± 0.272
1.405ProTyr: 1.405 ± 0.379
0.0ProXaa: 0.0 ± 0.0
Gln
4.371GlnAla: 4.371 ± 0.731
0.312GlnCys: 0.312 ± 0.166
1.561GlnAsp: 1.561 ± 0.298
3.513GlnGlu: 3.513 ± 0.565
2.498GlnPhe: 2.498 ± 0.331
3.044GlnGly: 3.044 ± 0.455
0.781GlnHis: 0.781 ± 0.254
2.576GlnIle: 2.576 ± 0.454
2.654GlnLys: 2.654 ± 0.367
3.278GlnLeu: 3.278 ± 0.425
0.859GlnMet: 0.859 ± 0.226
1.405GlnAsn: 1.405 ± 0.442
2.03GlnPro: 2.03 ± 0.397
2.03GlnGln: 2.03 ± 0.42
2.966GlnArg: 2.966 ± 0.539
1.795GlnSer: 1.795 ± 0.362
0.937GlnThr: 0.937 ± 0.243
2.42GlnVal: 2.42 ± 0.499
0.624GlnTrp: 0.624 ± 0.249
1.327GlnTyr: 1.327 ± 0.338
0.0GlnXaa: 0.0 ± 0.0
Arg
5.854ArgAla: 5.854 ± 0.599
0.624ArgCys: 0.624 ± 0.229
3.825ArgAsp: 3.825 ± 0.496
3.513ArgGlu: 3.513 ± 0.606
1.717ArgPhe: 1.717 ± 0.405
4.762ArgGly: 4.762 ± 0.628
1.015ArgHis: 1.015 ± 0.272
3.903ArgIle: 3.903 ± 0.618
3.903ArgLys: 3.903 ± 0.526
4.918ArgLeu: 4.918 ± 0.463
1.171ArgMet: 1.171 ± 0.288
2.03ArgAsn: 2.03 ± 0.379
2.264ArgPro: 2.264 ± 0.362
1.873ArgGln: 1.873 ± 0.394
3.669ArgArg: 3.669 ± 0.571
3.435ArgSer: 3.435 ± 0.491
2.654ArgThr: 2.654 ± 0.447
4.137ArgVal: 4.137 ± 0.498
1.249ArgTrp: 1.249 ± 0.253
2.42ArgTyr: 2.42 ± 0.461
0.0ArgXaa: 0.0 ± 0.0
Ser
5.464SerAla: 5.464 ± 0.776
0.546SerCys: 0.546 ± 0.288
3.435SerAsp: 3.435 ± 0.663
3.122SerGlu: 3.122 ± 0.465
1.717SerPhe: 1.717 ± 0.39
5.464SerGly: 5.464 ± 0.696
1.171SerHis: 1.171 ± 0.279
1.873SerIle: 1.873 ± 0.386
2.81SerLys: 2.81 ± 0.488
4.527SerLeu: 4.527 ± 0.627
1.249SerMet: 1.249 ± 0.296
2.342SerAsn: 2.342 ± 0.468
2.654SerPro: 2.654 ± 0.44
2.576SerGln: 2.576 ± 0.457
3.903SerArg: 3.903 ± 0.513
3.435SerSer: 3.435 ± 0.642
2.966SerThr: 2.966 ± 0.849
3.747SerVal: 3.747 ± 0.682
0.624SerTrp: 0.624 ± 0.205
1.795SerTyr: 1.795 ± 0.46
0.0SerXaa: 0.0 ± 0.0
Thr
6.323ThrAla: 6.323 ± 0.64
0.39ThrCys: 0.39 ± 0.179
3.435ThrAsp: 3.435 ± 0.527
2.81ThrGlu: 2.81 ± 0.438
1.483ThrPhe: 1.483 ± 0.354
5.074ThrGly: 5.074 ± 0.715
0.937ThrHis: 0.937 ± 0.322
2.966ThrIle: 2.966 ± 0.486
3.747ThrLys: 3.747 ± 0.582
4.918ThrLeu: 4.918 ± 0.602
1.327ThrMet: 1.327 ± 0.338
2.186ThrAsn: 2.186 ± 0.524
3.044ThrPro: 3.044 ± 0.514
2.732ThrGln: 2.732 ± 0.391
2.888ThrArg: 2.888 ± 0.424
3.044ThrSer: 3.044 ± 0.506
3.825ThrThr: 3.825 ± 0.599
5.854ThrVal: 5.854 ± 0.83
0.312ThrTrp: 0.312 ± 0.193
1.951ThrTyr: 1.951 ± 0.527
0.0ThrXaa: 0.0 ± 0.0
Val
7.416ValAla: 7.416 ± 0.847
0.468ValCys: 0.468 ± 0.219
3.591ValAsp: 3.591 ± 0.65
4.762ValGlu: 4.762 ± 0.542
2.342ValPhe: 2.342 ± 0.396
4.527ValGly: 4.527 ± 0.712
1.171ValHis: 1.171 ± 0.318
3.981ValIle: 3.981 ± 0.73
3.903ValLys: 3.903 ± 0.49
5.074ValLeu: 5.074 ± 0.757
1.639ValMet: 1.639 ± 0.549
2.888ValAsn: 2.888 ± 0.627
3.669ValPro: 3.669 ± 0.501
3.669ValGln: 3.669 ± 0.544
4.293ValArg: 4.293 ± 0.501
4.293ValSer: 4.293 ± 0.755
4.137ValThr: 4.137 ± 0.722
6.479ValVal: 6.479 ± 0.828
0.937ValTrp: 0.937 ± 0.297
2.108ValTyr: 2.108 ± 0.377
0.0ValXaa: 0.0 ± 0.0
Trp
1.249TrpAla: 1.249 ± 0.293
0.234TrpCys: 0.234 ± 0.157
0.703TrpAsp: 0.703 ± 0.23
0.703TrpGlu: 0.703 ± 0.213
0.703TrpPhe: 0.703 ± 0.269
0.781TrpGly: 0.781 ± 0.239
0.312TrpHis: 0.312 ± 0.185
0.703TrpIle: 0.703 ± 0.198
0.781TrpLys: 0.781 ± 0.23
1.405TrpLeu: 1.405 ± 0.346
0.624TrpMet: 0.624 ± 0.164
0.703TrpAsn: 0.703 ± 0.238
0.234TrpPro: 0.234 ± 0.146
0.234TrpGln: 0.234 ± 0.112
1.093TrpArg: 1.093 ± 0.245
0.937TrpSer: 0.937 ± 0.303
0.781TrpThr: 0.781 ± 0.296
1.015TrpVal: 1.015 ± 0.251
0.156TrpTrp: 0.156 ± 0.098
0.078TrpTyr: 0.078 ± 0.068
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.122TyrAla: 3.122 ± 0.593
0.468TyrCys: 0.468 ± 0.189
2.342TyrAsp: 2.342 ± 0.441
2.264TyrGlu: 2.264 ± 0.395
1.483TyrPhe: 1.483 ± 0.435
3.278TyrGly: 3.278 ± 0.504
0.312TyrHis: 0.312 ± 0.147
1.405TyrIle: 1.405 ± 0.436
1.327TyrLys: 1.327 ± 0.332
2.498TyrLeu: 2.498 ± 0.303
0.937TyrMet: 0.937 ± 0.269
0.937TyrAsn: 0.937 ± 0.324
1.327TyrPro: 1.327 ± 0.277
1.639TyrGln: 1.639 ± 0.419
1.717TyrArg: 1.717 ± 0.286
1.405TyrSer: 1.405 ± 0.328
2.654TyrThr: 2.654 ± 0.538
2.264TyrVal: 2.264 ± 0.453
0.234TyrTrp: 0.234 ± 0.174
1.093TyrTyr: 1.093 ± 0.246
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (12812 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski