Amino acid dipepetide frequency for Bdellovibrio phage phi1422

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.675AlaAla: 6.675 ± 0.676
0.695AlaCys: 0.695 ± 0.234
5.214AlaAsp: 5.214 ± 0.532
5.979AlaGlu: 5.979 ± 0.63
2.92AlaPhe: 2.92 ± 0.443
5.284AlaGly: 5.284 ± 0.616
1.251AlaHis: 1.251 ± 0.348
5.006AlaIle: 5.006 ± 0.605
6.883AlaLys: 6.883 ± 0.733
6.883AlaLeu: 6.883 ± 0.639
1.53AlaMet: 1.53 ± 0.325
3.615AlaAsn: 3.615 ± 0.477
2.572AlaPro: 2.572 ± 0.628
2.433AlaGln: 2.433 ± 0.386
3.476AlaArg: 3.476 ± 0.493
3.685AlaSer: 3.685 ± 0.532
4.102AlaThr: 4.102 ± 0.681
4.936AlaVal: 4.936 ± 0.614
0.834AlaTrp: 0.834 ± 0.239
2.572AlaTyr: 2.572 ± 0.535
0.0AlaXaa: 0.0 ± 0.0
Cys
0.973CysAla: 0.973 ± 0.199
0.209CysCys: 0.209 ± 0.112
0.834CysAsp: 0.834 ± 0.23
0.904CysGlu: 0.904 ± 0.287
0.556CysPhe: 0.556 ± 0.219
0.834CysGly: 0.834 ± 0.255
0.417CysHis: 0.417 ± 0.182
0.834CysIle: 0.834 ± 0.256
0.973CysLys: 0.973 ± 0.257
0.834CysLeu: 0.834 ± 0.198
0.139CysMet: 0.139 ± 0.094
0.487CysAsn: 0.487 ± 0.158
0.695CysPro: 0.695 ± 0.194
0.348CysGln: 0.348 ± 0.139
0.487CysArg: 0.487 ± 0.172
0.556CysSer: 0.556 ± 0.163
0.487CysThr: 0.487 ± 0.174
0.973CysVal: 0.973 ± 0.256
0.209CysTrp: 0.209 ± 0.117
0.487CysTyr: 0.487 ± 0.198
0.0CysXaa: 0.0 ± 0.0
Asp
5.214AspAla: 5.214 ± 0.5
0.904AspCys: 0.904 ± 0.251
3.893AspAsp: 3.893 ± 0.63
4.519AspGlu: 4.519 ± 0.643
2.92AspPhe: 2.92 ± 0.47
4.311AspGly: 4.311 ± 0.519
0.765AspHis: 0.765 ± 0.239
3.615AspIle: 3.615 ± 0.482
3.198AspLys: 3.198 ± 0.521
5.493AspLeu: 5.493 ± 0.49
2.016AspMet: 2.016 ± 0.365
3.129AspAsn: 3.129 ± 0.453
1.877AspPro: 1.877 ± 0.362
1.669AspGln: 1.669 ± 0.31
1.877AspArg: 1.877 ± 0.34
3.129AspSer: 3.129 ± 0.469
3.407AspThr: 3.407 ± 0.526
3.615AspVal: 3.615 ± 0.569
0.834AspTrp: 0.834 ± 0.252
2.642AspTyr: 2.642 ± 0.41
0.0AspXaa: 0.0 ± 0.0
Glu
4.241GluAla: 4.241 ± 0.536
0.834GluCys: 0.834 ± 0.268
3.963GluAsp: 3.963 ± 0.542
5.562GluGlu: 5.562 ± 1.018
3.824GluPhe: 3.824 ± 0.423
3.754GluGly: 3.754 ± 0.434
2.016GluHis: 2.016 ± 0.389
5.771GluIle: 5.771 ± 0.687
7.509GluLys: 7.509 ± 0.816
4.936GluLeu: 4.936 ± 0.589
2.364GluMet: 2.364 ± 0.491
4.728GluAsn: 4.728 ± 0.611
1.599GluPro: 1.599 ± 0.389
2.364GluGln: 2.364 ± 0.412
2.433GluArg: 2.433 ± 0.494
3.963GluSer: 3.963 ± 0.516
3.268GluThr: 3.268 ± 0.446
4.797GluVal: 4.797 ± 0.657
0.973GluTrp: 0.973 ± 0.272
2.851GluTyr: 2.851 ± 0.514
0.0GluXaa: 0.0 ± 0.0
Phe
3.476PheAla: 3.476 ± 0.433
0.556PheCys: 0.556 ± 0.18
2.572PheAsp: 2.572 ± 0.533
2.781PheGlu: 2.781 ± 0.456
2.155PhePhe: 2.155 ± 0.292
3.546PheGly: 3.546 ± 0.566
0.556PheHis: 0.556 ± 0.172
3.198PheIle: 3.198 ± 0.52
4.241PheLys: 4.241 ± 0.491
3.268PheLeu: 3.268 ± 0.496
0.765PheMet: 0.765 ± 0.203
2.642PheAsn: 2.642 ± 0.412
1.599PhePro: 1.599 ± 0.342
1.877PheGln: 1.877 ± 0.332
2.225PheArg: 2.225 ± 0.377
2.642PheSer: 2.642 ± 0.36
2.642PheThr: 2.642 ± 0.367
2.155PheVal: 2.155 ± 0.331
0.348PheTrp: 0.348 ± 0.177
1.391PheTyr: 1.391 ± 0.295
0.0PheXaa: 0.0 ± 0.0
Gly
5.075GlyAla: 5.075 ± 0.517
0.973GlyCys: 0.973 ± 0.274
3.268GlyAsp: 3.268 ± 0.481
3.198GlyGlu: 3.198 ± 0.455
3.963GlyPhe: 3.963 ± 0.394
4.936GlyGly: 4.936 ± 0.619
0.487GlyHis: 0.487 ± 0.173
5.354GlyIle: 5.354 ± 0.505
5.562GlyLys: 5.562 ± 0.701
4.589GlyLeu: 4.589 ± 0.517
1.877GlyMet: 1.877 ± 0.386
3.546GlyAsn: 3.546 ± 0.495
1.53GlyPro: 1.53 ± 0.348
2.781GlyGln: 2.781 ± 0.508
2.712GlyArg: 2.712 ± 0.369
5.145GlySer: 5.145 ± 0.732
4.38GlyThr: 4.38 ± 0.783
5.493GlyVal: 5.493 ± 0.508
0.626GlyTrp: 0.626 ± 0.197
1.808GlyTyr: 1.808 ± 0.333
0.0GlyXaa: 0.0 ± 0.0
His
0.973HisAla: 0.973 ± 0.262
0.765HisCys: 0.765 ± 0.245
0.487HisAsp: 0.487 ± 0.175
0.834HisGlu: 0.834 ± 0.213
0.904HisPhe: 0.904 ± 0.246
1.391HisGly: 1.391 ± 0.3
0.278HisHis: 0.278 ± 0.125
0.904HisIle: 0.904 ± 0.22
1.599HisLys: 1.599 ± 0.275
0.973HisLeu: 0.973 ± 0.278
0.348HisMet: 0.348 ± 0.125
0.278HisAsn: 0.278 ± 0.12
0.695HisPro: 0.695 ± 0.228
0.348HisGln: 0.348 ± 0.135
0.556HisArg: 0.556 ± 0.197
0.904HisSer: 0.904 ± 0.275
0.904HisThr: 0.904 ± 0.227
0.765HisVal: 0.765 ± 0.247
0.209HisTrp: 0.209 ± 0.105
0.695HisTyr: 0.695 ± 0.231
0.0HisXaa: 0.0 ± 0.0
Ile
4.45IleAla: 4.45 ± 0.602
0.695IleCys: 0.695 ± 0.19
4.658IleAsp: 4.658 ± 0.54
4.936IleGlu: 4.936 ± 0.652
2.155IlePhe: 2.155 ± 0.536
4.311IleGly: 4.311 ± 0.473
0.904IleHis: 0.904 ± 0.306
3.754IleIle: 3.754 ± 0.508
6.049IleLys: 6.049 ± 0.539
5.006IleLeu: 5.006 ± 0.572
1.321IleMet: 1.321 ± 0.387
2.503IleAsn: 2.503 ± 0.436
2.92IlePro: 2.92 ± 0.405
2.92IleGln: 2.92 ± 0.393
3.476IleArg: 3.476 ± 0.513
4.102IleSer: 4.102 ± 0.631
3.824IleThr: 3.824 ± 0.476
4.797IleVal: 4.797 ± 0.534
0.626IleTrp: 0.626 ± 0.197
1.808IleTyr: 1.808 ± 0.329
0.0IleXaa: 0.0 ± 0.0
Lys
5.91LysAla: 5.91 ± 0.742
0.695LysCys: 0.695 ± 0.224
4.867LysAsp: 4.867 ± 0.659
7.578LysGlu: 7.578 ± 0.964
3.963LysPhe: 3.963 ± 0.512
5.145LysGly: 5.145 ± 0.641
1.112LysHis: 1.112 ± 0.268
5.006LysIle: 5.006 ± 0.609
9.108LysLys: 9.108 ± 1.029
6.257LysLeu: 6.257 ± 0.604
2.92LysMet: 2.92 ± 0.539
4.519LysAsn: 4.519 ± 0.538
2.99LysPro: 2.99 ± 0.502
2.712LysGln: 2.712 ± 0.486
3.824LysArg: 3.824 ± 0.532
5.214LysSer: 5.214 ± 0.652
4.936LysThr: 4.936 ± 0.567
5.284LysVal: 5.284 ± 0.563
1.321LysTrp: 1.321 ± 0.206
1.947LysTyr: 1.947 ± 0.312
0.0LysXaa: 0.0 ± 0.0
Leu
6.953LeuAla: 6.953 ± 0.722
1.251LeuCys: 1.251 ± 0.294
3.754LeuAsp: 3.754 ± 0.54
5.632LeuGlu: 5.632 ± 0.614
2.99LeuPhe: 2.99 ± 0.445
3.963LeuGly: 3.963 ± 0.569
1.321LeuHis: 1.321 ± 0.25
4.241LeuIle: 4.241 ± 0.465
5.84LeuLys: 5.84 ± 0.642
5.979LeuLeu: 5.979 ± 0.873
1.808LeuMet: 1.808 ± 0.358
4.172LeuAsn: 4.172 ± 0.445
3.963LeuPro: 3.963 ± 0.414
2.225LeuGln: 2.225 ± 0.341
4.102LeuArg: 4.102 ± 0.564
4.936LeuSer: 4.936 ± 0.611
5.075LeuThr: 5.075 ± 0.514
4.728LeuVal: 4.728 ± 0.629
0.695LeuTrp: 0.695 ± 0.202
2.503LeuTyr: 2.503 ± 0.376
0.0LeuXaa: 0.0 ± 0.0
Met
1.391MetAla: 1.391 ± 0.296
0.07MetCys: 0.07 ± 0.069
1.46MetAsp: 1.46 ± 0.275
1.53MetGlu: 1.53 ± 0.337
1.043MetPhe: 1.043 ± 0.266
2.016MetGly: 2.016 ± 0.424
0.626MetHis: 0.626 ± 0.188
2.016MetIle: 2.016 ± 0.357
3.268MetLys: 3.268 ± 0.501
1.738MetLeu: 1.738 ± 0.331
0.556MetMet: 0.556 ± 0.181
1.391MetAsn: 1.391 ± 0.267
1.321MetPro: 1.321 ± 0.243
1.251MetGln: 1.251 ± 0.331
1.46MetArg: 1.46 ± 0.322
2.155MetSer: 2.155 ± 0.397
1.46MetThr: 1.46 ± 0.292
1.46MetVal: 1.46 ± 0.315
0.417MetTrp: 0.417 ± 0.163
0.417MetTyr: 0.417 ± 0.161
0.0MetXaa: 0.0 ± 0.0
Asn
4.172AsnAla: 4.172 ± 0.667
0.834AsnCys: 0.834 ± 0.239
1.947AsnAsp: 1.947 ± 0.32
3.198AsnGlu: 3.198 ± 0.6
2.294AsnPhe: 2.294 ± 0.385
4.172AsnGly: 4.172 ± 0.605
0.487AsnHis: 0.487 ± 0.208
3.615AsnIle: 3.615 ± 0.499
3.198AsnLys: 3.198 ± 0.554
4.589AsnLeu: 4.589 ± 0.59
1.738AsnMet: 1.738 ± 0.32
2.503AsnAsn: 2.503 ± 0.358
3.059AsnPro: 3.059 ± 0.54
1.599AsnGln: 1.599 ± 0.288
2.364AsnArg: 2.364 ± 0.418
3.963AsnSer: 3.963 ± 0.462
2.433AsnThr: 2.433 ± 0.403
3.268AsnVal: 3.268 ± 0.595
0.765AsnTrp: 0.765 ± 0.245
1.321AsnTyr: 1.321 ± 0.327
0.0AsnXaa: 0.0 ± 0.0
Pro
3.129ProAla: 3.129 ± 0.503
0.417ProCys: 0.417 ± 0.184
2.92ProAsp: 2.92 ± 0.634
3.129ProGlu: 3.129 ± 0.507
1.669ProPhe: 1.669 ± 0.292
2.086ProGly: 2.086 ± 0.38
0.556ProHis: 0.556 ± 0.164
2.572ProIle: 2.572 ± 0.44
2.364ProLys: 2.364 ± 0.493
2.225ProLeu: 2.225 ± 0.426
1.251ProMet: 1.251 ± 0.22
2.155ProAsn: 2.155 ± 0.394
1.391ProPro: 1.391 ± 0.281
1.043ProGln: 1.043 ± 0.294
1.251ProArg: 1.251 ± 0.256
2.155ProSer: 2.155 ± 0.412
2.294ProThr: 2.294 ± 0.458
3.198ProVal: 3.198 ± 0.51
0.417ProTrp: 0.417 ± 0.162
0.695ProTyr: 0.695 ± 0.264
0.0ProXaa: 0.0 ± 0.0
Gln
3.407GlnAla: 3.407 ± 0.58
0.139GlnCys: 0.139 ± 0.094
1.877GlnAsp: 1.877 ± 0.352
2.364GlnGlu: 2.364 ± 0.335
1.043GlnPhe: 1.043 ± 0.29
1.947GlnGly: 1.947 ± 0.333
0.278GlnHis: 0.278 ± 0.158
2.503GlnIle: 2.503 ± 0.323
3.546GlnLys: 3.546 ± 0.512
2.92GlnLeu: 2.92 ± 0.508
1.182GlnMet: 1.182 ± 0.249
2.155GlnAsn: 2.155 ± 0.418
0.834GlnPro: 0.834 ± 0.231
1.46GlnGln: 1.46 ± 0.249
1.391GlnArg: 1.391 ± 0.307
2.225GlnSer: 2.225 ± 0.472
1.391GlnThr: 1.391 ± 0.342
2.572GlnVal: 2.572 ± 0.335
0.626GlnTrp: 0.626 ± 0.267
1.53GlnTyr: 1.53 ± 0.338
0.0GlnXaa: 0.0 ± 0.0
Arg
3.476ArgAla: 3.476 ± 0.472
0.487ArgCys: 0.487 ± 0.176
2.503ArgAsp: 2.503 ± 0.506
3.963ArgGlu: 3.963 ± 0.55
2.086ArgPhe: 2.086 ± 0.545
2.642ArgGly: 2.642 ± 0.601
0.556ArgHis: 0.556 ± 0.198
2.572ArgIle: 2.572 ± 0.374
3.546ArgLys: 3.546 ± 0.591
3.198ArgLeu: 3.198 ± 0.441
1.53ArgMet: 1.53 ± 0.335
1.808ArgAsn: 1.808 ± 0.319
1.321ArgPro: 1.321 ± 0.272
1.599ArgGln: 1.599 ± 0.322
2.225ArgArg: 2.225 ± 0.554
2.433ArgSer: 2.433 ± 0.366
1.738ArgThr: 1.738 ± 0.352
3.546ArgVal: 3.546 ± 0.423
0.417ArgTrp: 0.417 ± 0.163
2.294ArgTyr: 2.294 ± 0.441
0.0ArgXaa: 0.0 ± 0.0
Ser
4.658SerAla: 4.658 ± 0.584
0.417SerCys: 0.417 ± 0.182
3.476SerAsp: 3.476 ± 0.409
4.241SerGlu: 4.241 ± 0.611
3.059SerPhe: 3.059 ± 0.43
5.214SerGly: 5.214 ± 0.558
1.112SerHis: 1.112 ± 0.283
3.824SerIle: 3.824 ± 0.509
4.658SerLys: 4.658 ± 0.676
4.589SerLeu: 4.589 ± 0.468
1.391SerMet: 1.391 ± 0.296
2.712SerAsn: 2.712 ± 0.428
1.599SerPro: 1.599 ± 0.392
2.294SerGln: 2.294 ± 0.404
2.851SerArg: 2.851 ± 0.362
3.754SerSer: 3.754 ± 0.509
3.198SerThr: 3.198 ± 0.381
5.91SerVal: 5.91 ± 0.694
0.765SerTrp: 0.765 ± 0.204
2.433SerTyr: 2.433 ± 0.398
0.0SerXaa: 0.0 ± 0.0
Thr
4.38ThrAla: 4.38 ± 0.671
0.556ThrCys: 0.556 ± 0.227
3.407ThrAsp: 3.407 ± 0.519
3.198ThrGlu: 3.198 ± 0.521
2.712ThrPhe: 2.712 ± 0.414
4.38ThrGly: 4.38 ± 0.716
0.556ThrHis: 0.556 ± 0.199
4.45ThrIle: 4.45 ± 0.473
3.963ThrLys: 3.963 ± 0.464
4.658ThrLeu: 4.658 ± 0.595
1.738ThrMet: 1.738 ± 0.297
2.086ThrAsn: 2.086 ± 0.378
2.712ThrPro: 2.712 ± 0.474
2.294ThrGln: 2.294 ± 0.356
2.155ThrArg: 2.155 ± 0.394
3.963ThrSer: 3.963 ± 0.558
3.059ThrThr: 3.059 ± 0.487
4.033ThrVal: 4.033 ± 0.561
0.765ThrTrp: 0.765 ± 0.217
1.53ThrTyr: 1.53 ± 0.337
0.0ThrXaa: 0.0 ± 0.0
Val
4.936ValAla: 4.936 ± 0.688
1.043ValCys: 1.043 ± 0.279
5.562ValAsp: 5.562 ± 0.482
5.006ValGlu: 5.006 ± 0.754
2.781ValPhe: 2.781 ± 0.405
3.893ValGly: 3.893 ± 0.428
0.834ValHis: 0.834 ± 0.235
4.311ValIle: 4.311 ± 0.561
5.214ValLys: 5.214 ± 0.482
4.797ValLeu: 4.797 ± 0.495
1.46ValMet: 1.46 ± 0.364
4.172ValAsn: 4.172 ± 0.438
2.503ValPro: 2.503 ± 0.42
2.712ValGln: 2.712 ± 0.46
2.433ValArg: 2.433 ± 0.402
4.589ValSer: 4.589 ± 0.532
4.936ValThr: 4.936 ± 0.564
4.797ValVal: 4.797 ± 0.741
0.834ValTrp: 0.834 ± 0.184
2.364ValTyr: 2.364 ± 0.357
0.0ValXaa: 0.0 ± 0.0
Trp
0.834TrpAla: 0.834 ± 0.235
0.139TrpCys: 0.139 ± 0.095
1.043TrpAsp: 1.043 ± 0.281
1.321TrpGlu: 1.321 ± 0.288
0.278TrpPhe: 0.278 ± 0.138
0.348TrpGly: 0.348 ± 0.157
0.209TrpHis: 0.209 ± 0.107
0.348TrpIle: 0.348 ± 0.153
1.46TrpLys: 1.46 ± 0.381
0.973TrpLeu: 0.973 ± 0.232
0.278TrpMet: 0.278 ± 0.18
0.904TrpAsn: 0.904 ± 0.296
0.348TrpPro: 0.348 ± 0.157
0.417TrpGln: 0.417 ± 0.163
0.556TrpArg: 0.556 ± 0.204
0.556TrpSer: 0.556 ± 0.221
0.973TrpThr: 0.973 ± 0.217
0.834TrpVal: 0.834 ± 0.223
0.07TrpTrp: 0.07 ± 0.068
0.07TrpTyr: 0.07 ± 0.073
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.225TyrAla: 2.225 ± 0.445
0.556TyrCys: 0.556 ± 0.2
1.321TyrAsp: 1.321 ± 0.275
1.877TyrGlu: 1.877 ± 0.386
1.321TyrPhe: 1.321 ± 0.259
3.129TyrGly: 3.129 ± 0.391
0.487TyrHis: 0.487 ± 0.187
1.321TyrIle: 1.321 ± 0.278
3.129TyrLys: 3.129 ± 0.501
2.225TyrLeu: 2.225 ± 0.37
0.765TyrMet: 0.765 ± 0.206
2.086TyrAsn: 2.086 ± 0.438
1.391TyrPro: 1.391 ± 0.386
1.043TyrGln: 1.043 ± 0.295
2.086TyrArg: 2.086 ± 0.403
2.086TyrSer: 2.086 ± 0.457
2.086TyrThr: 2.086 ± 0.374
2.016TyrVal: 2.016 ± 0.344
0.209TyrTrp: 0.209 ± 0.097
0.973TyrTyr: 0.973 ± 0.224
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 76 proteins (14384 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski