Amino acid dipepetide frequency for Bdellovibrio phage phi1402

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.67AlaAla: 8.67 ± 1.587
0.4AlaCys: 0.4 ± 0.246
4.802AlaAsp: 4.802 ± 0.88
4.669AlaGlu: 4.669 ± 0.897
5.469AlaPhe: 5.469 ± 1.202
7.203AlaGly: 7.203 ± 1.317
1.2AlaHis: 1.2 ± 0.451
5.469AlaIle: 5.469 ± 0.878
6.936AlaLys: 6.936 ± 1.243
8.003AlaLeu: 8.003 ± 0.827
3.335AlaMet: 3.335 ± 0.69
5.202AlaAsn: 5.202 ± 0.836
4.135AlaPro: 4.135 ± 0.893
4.535AlaGln: 4.535 ± 0.882
3.068AlaArg: 3.068 ± 0.699
5.869AlaSer: 5.869 ± 0.75
4.669AlaThr: 4.669 ± 0.848
4.935AlaVal: 4.935 ± 0.873
1.334AlaTrp: 1.334 ± 0.345
2.268AlaTyr: 2.268 ± 0.521
0.0AlaXaa: 0.0 ± 0.0
Cys
0.8CysAla: 0.8 ± 0.295
0.0CysCys: 0.0 ± 0.0
0.534CysAsp: 0.534 ± 0.283
0.667CysGlu: 0.667 ± 0.328
0.534CysPhe: 0.534 ± 0.27
1.334CysGly: 1.334 ± 0.45
0.534CysHis: 0.534 ± 0.253
0.267CysIle: 0.267 ± 0.195
0.667CysLys: 0.667 ± 0.276
0.934CysLeu: 0.934 ± 0.316
0.133CysMet: 0.133 ± 0.117
0.534CysAsn: 0.534 ± 0.287
0.8CysPro: 0.8 ± 0.392
0.534CysGln: 0.534 ± 0.247
0.0CysArg: 0.0 ± 0.0
0.4CysSer: 0.4 ± 0.213
0.534CysThr: 0.534 ± 0.236
1.067CysVal: 1.067 ± 0.392
0.0CysTrp: 0.0 ± 0.0
0.133CysTyr: 0.133 ± 0.147
0.0CysXaa: 0.0 ± 0.0
Asp
3.335AspAla: 3.335 ± 0.787
0.4AspCys: 0.4 ± 0.179
3.201AspAsp: 3.201 ± 0.655
3.868AspGlu: 3.868 ± 0.574
3.335AspPhe: 3.335 ± 0.798
5.202AspGly: 5.202 ± 0.825
0.534AspHis: 0.534 ± 0.228
4.002AspIle: 4.002 ± 0.792
3.201AspLys: 3.201 ± 0.636
6.136AspLeu: 6.136 ± 1.055
0.534AspMet: 0.534 ± 0.251
2.268AspAsn: 2.268 ± 0.548
3.068AspPro: 3.068 ± 0.618
2.134AspGln: 2.134 ± 0.519
2.268AspArg: 2.268 ± 0.597
3.601AspSer: 3.601 ± 0.724
2.935AspThr: 2.935 ± 0.535
3.735AspVal: 3.735 ± 0.74
0.934AspTrp: 0.934 ± 0.353
2.668AspTyr: 2.668 ± 0.623
0.0AspXaa: 0.0 ± 0.0
Glu
6.536GluAla: 6.536 ± 1.19
1.2GluCys: 1.2 ± 0.487
2.935GluAsp: 2.935 ± 0.605
2.668GluGlu: 2.668 ± 0.541
2.668GluPhe: 2.668 ± 0.667
4.669GluGly: 4.669 ± 0.826
1.067GluHis: 1.067 ± 0.396
2.668GluIle: 2.668 ± 0.589
6.536GluLys: 6.536 ± 1.143
4.535GluLeu: 4.535 ± 0.769
1.601GluMet: 1.601 ± 0.348
2.801GluAsn: 2.801 ± 0.688
2.534GluPro: 2.534 ± 0.469
2.668GluGln: 2.668 ± 0.772
2.801GluArg: 2.801 ± 0.579
2.935GluSer: 2.935 ± 0.568
4.402GluThr: 4.402 ± 0.834
4.669GluVal: 4.669 ± 0.638
0.8GluTrp: 0.8 ± 0.295
2.134GluTyr: 2.134 ± 0.582
0.0GluXaa: 0.0 ± 0.0
Phe
4.802PheAla: 4.802 ± 0.802
0.8PheCys: 0.8 ± 0.325
3.335PheAsp: 3.335 ± 0.691
3.601PheGlu: 3.601 ± 0.595
2.134PhePhe: 2.134 ± 0.589
3.735PheGly: 3.735 ± 0.701
0.667PheHis: 0.667 ± 0.247
2.534PheIle: 2.534 ± 0.604
3.468PheLys: 3.468 ± 0.817
3.468PheLeu: 3.468 ± 0.831
0.8PheMet: 0.8 ± 0.276
1.734PheAsn: 1.734 ± 0.398
1.601PhePro: 1.601 ± 0.341
1.467PheGln: 1.467 ± 0.493
1.2PheArg: 1.2 ± 0.426
4.135PheSer: 4.135 ± 0.74
1.867PheThr: 1.867 ± 0.45
2.801PheVal: 2.801 ± 0.578
0.4PheTrp: 0.4 ± 0.248
1.601PheTyr: 1.601 ± 0.548
0.0PheXaa: 0.0 ± 0.0
Gly
7.336GlyAla: 7.336 ± 1.247
1.067GlyCys: 1.067 ± 0.426
3.601GlyAsp: 3.601 ± 0.684
5.069GlyGlu: 5.069 ± 1.123
2.935GlyPhe: 2.935 ± 0.662
6.803GlyGly: 6.803 ± 2.304
1.334GlyHis: 1.334 ± 0.341
3.735GlyIle: 3.735 ± 0.698
5.069GlyLys: 5.069 ± 0.875
8.403GlyLeu: 8.403 ± 1.351
1.467GlyMet: 1.467 ± 0.551
3.201GlyAsn: 3.201 ± 0.68
2.268GlyPro: 2.268 ± 0.552
3.335GlyGln: 3.335 ± 0.604
3.068GlyArg: 3.068 ± 0.597
5.335GlySer: 5.335 ± 1.024
4.935GlyThr: 4.935 ± 0.778
6.002GlyVal: 6.002 ± 0.862
0.667GlyTrp: 0.667 ± 0.261
3.201GlyTyr: 3.201 ± 0.676
0.0GlyXaa: 0.0 ± 0.0
His
0.934HisAla: 0.934 ± 0.385
0.267HisCys: 0.267 ± 0.18
0.534HisAsp: 0.534 ± 0.234
1.601HisGlu: 1.601 ± 0.353
0.667HisPhe: 0.667 ± 0.268
1.734HisGly: 1.734 ± 0.384
0.0HisHis: 0.0 ± 0.0
0.267HisIle: 0.267 ± 0.189
1.067HisLys: 1.067 ± 0.334
1.467HisLeu: 1.467 ± 0.516
0.267HisMet: 0.267 ± 0.166
0.267HisAsn: 0.267 ± 0.195
0.667HisPro: 0.667 ± 0.311
0.534HisGln: 0.534 ± 0.342
0.267HisArg: 0.267 ± 0.159
1.067HisSer: 1.067 ± 0.422
0.0HisThr: 0.0 ± 0.0
0.8HisVal: 0.8 ± 0.3
0.0HisTrp: 0.0 ± 0.0
1.067HisTyr: 1.067 ± 0.376
0.0HisXaa: 0.0 ± 0.0
Ile
4.135IleAla: 4.135 ± 0.691
0.267IleCys: 0.267 ± 0.205
3.201IleAsp: 3.201 ± 0.714
3.868IleGlu: 3.868 ± 0.767
3.468IlePhe: 3.468 ± 0.726
3.468IleGly: 3.468 ± 0.74
0.8IleHis: 0.8 ± 0.383
2.935IleIle: 2.935 ± 0.644
4.402IleLys: 4.402 ± 0.888
5.602IleLeu: 5.602 ± 0.863
1.734IleMet: 1.734 ± 0.434
2.801IleAsn: 2.801 ± 0.473
2.668IlePro: 2.668 ± 0.524
1.867IleGln: 1.867 ± 0.473
1.734IleArg: 1.734 ± 0.497
3.868IleSer: 3.868 ± 0.891
4.535IleThr: 4.535 ± 0.827
3.601IleVal: 3.601 ± 0.616
0.934IleTrp: 0.934 ± 0.479
1.734IleTyr: 1.734 ± 0.491
0.0IleXaa: 0.0 ± 0.0
Lys
6.936LysAla: 6.936 ± 1.102
0.8LysCys: 0.8 ± 0.287
5.069LysAsp: 5.069 ± 1.126
3.335LysGlu: 3.335 ± 0.864
3.335LysPhe: 3.335 ± 0.688
5.869LysGly: 5.869 ± 0.873
0.534LysHis: 0.534 ± 0.259
5.202LysIle: 5.202 ± 0.993
7.47LysLys: 7.47 ± 1.407
5.069LysLeu: 5.069 ± 0.749
2.668LysMet: 2.668 ± 0.616
4.402LysAsn: 4.402 ± 0.637
3.335LysPro: 3.335 ± 0.764
2.134LysGln: 2.134 ± 0.522
2.801LysArg: 2.801 ± 0.623
4.135LysSer: 4.135 ± 0.862
5.469LysThr: 5.469 ± 0.895
3.601LysVal: 3.601 ± 0.768
0.667LysTrp: 0.667 ± 0.34
1.601LysTyr: 1.601 ± 0.505
0.0LysXaa: 0.0 ± 0.0
Leu
7.336LeuAla: 7.336 ± 1.028
0.534LeuCys: 0.534 ± 0.318
4.669LeuAsp: 4.669 ± 0.598
6.269LeuGlu: 6.269 ± 0.885
2.801LeuPhe: 2.801 ± 0.676
5.202LeuGly: 5.202 ± 0.737
1.334LeuHis: 1.334 ± 0.469
5.736LeuIle: 5.736 ± 0.667
6.536LeuLys: 6.536 ± 1.016
6.936LeuLeu: 6.936 ± 0.808
1.067LeuMet: 1.067 ± 0.376
3.201LeuAsn: 3.201 ± 0.651
5.069LeuPro: 5.069 ± 0.837
3.068LeuGln: 3.068 ± 0.551
3.601LeuArg: 3.601 ± 0.707
4.402LeuSer: 4.402 ± 0.805
6.002LeuThr: 6.002 ± 0.785
4.535LeuVal: 4.535 ± 0.824
0.534LeuTrp: 0.534 ± 0.229
2.001LeuTyr: 2.001 ± 0.489
0.0LeuXaa: 0.0 ± 0.0
Met
2.001MetAla: 2.001 ± 0.526
0.267MetCys: 0.267 ± 0.188
1.867MetAsp: 1.867 ± 0.481
1.067MetGlu: 1.067 ± 0.456
1.334MetPhe: 1.334 ± 0.392
1.601MetGly: 1.601 ± 0.518
0.4MetHis: 0.4 ± 0.212
1.734MetIle: 1.734 ± 0.564
2.001MetLys: 2.001 ± 0.567
1.334MetLeu: 1.334 ± 0.389
0.133MetMet: 0.133 ± 0.15
1.601MetAsn: 1.601 ± 0.489
1.334MetPro: 1.334 ± 0.397
1.467MetGln: 1.467 ± 0.418
2.001MetArg: 2.001 ± 0.556
2.001MetSer: 2.001 ± 0.459
2.134MetThr: 2.134 ± 0.421
0.667MetVal: 0.667 ± 0.287
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.002AsnAla: 4.002 ± 0.589
0.8AsnCys: 0.8 ± 0.351
1.2AsnAsp: 1.2 ± 0.365
2.534AsnGlu: 2.534 ± 0.489
2.134AsnPhe: 2.134 ± 0.574
4.935AsnGly: 4.935 ± 0.89
0.267AsnHis: 0.267 ± 0.19
2.001AsnIle: 2.001 ± 0.364
3.201AsnLys: 3.201 ± 0.645
3.068AsnLeu: 3.068 ± 0.515
1.734AsnMet: 1.734 ± 0.479
1.734AsnAsn: 1.734 ± 0.418
1.867AsnPro: 1.867 ± 0.483
2.935AsnGln: 2.935 ± 0.901
1.601AsnArg: 1.601 ± 0.38
2.668AsnSer: 2.668 ± 0.958
1.867AsnThr: 1.867 ± 0.503
2.134AsnVal: 2.134 ± 0.562
1.067AsnTrp: 1.067 ± 0.361
1.734AsnTyr: 1.734 ± 0.529
0.0AsnXaa: 0.0 ± 0.0
Pro
5.469ProAla: 5.469 ± 1.212
0.133ProCys: 0.133 ± 0.153
3.601ProAsp: 3.601 ± 0.768
3.601ProGlu: 3.601 ± 0.785
1.334ProPhe: 1.334 ± 0.501
3.735ProGly: 3.735 ± 0.747
0.4ProHis: 0.4 ± 0.248
2.401ProIle: 2.401 ± 0.643
4.135ProLys: 4.135 ± 0.89
2.935ProLeu: 2.935 ± 0.569
0.534ProMet: 0.534 ± 0.284
1.601ProAsn: 1.601 ± 0.487
3.468ProPro: 3.468 ± 1.085
2.401ProGln: 2.401 ± 0.547
1.067ProArg: 1.067 ± 0.461
1.601ProSer: 1.601 ± 0.444
2.401ProThr: 2.401 ± 0.437
2.668ProVal: 2.668 ± 0.467
0.667ProTrp: 0.667 ± 0.268
1.601ProTyr: 1.601 ± 0.489
0.0ProXaa: 0.0 ± 0.0
Gln
4.402GlnAla: 4.402 ± 0.935
0.534GlnCys: 0.534 ± 0.269
2.801GlnAsp: 2.801 ± 0.653
1.867GlnGlu: 1.867 ± 0.582
1.734GlnPhe: 1.734 ± 0.442
2.401GlnGly: 2.401 ± 0.586
0.133GlnHis: 0.133 ± 0.129
2.401GlnIle: 2.401 ± 0.534
3.068GlnLys: 3.068 ± 0.588
2.534GlnLeu: 2.534 ± 0.488
1.067GlnMet: 1.067 ± 0.384
2.668GlnAsn: 2.668 ± 0.596
1.467GlnPro: 1.467 ± 0.529
1.334GlnGln: 1.334 ± 0.454
1.867GlnArg: 1.867 ± 0.536
2.134GlnSer: 2.134 ± 0.489
2.268GlnThr: 2.268 ± 0.674
3.468GlnVal: 3.468 ± 0.579
0.667GlnTrp: 0.667 ± 0.252
1.334GlnTyr: 1.334 ± 0.502
0.0GlnXaa: 0.0 ± 0.0
Arg
4.135ArgAla: 4.135 ± 0.925
0.667ArgCys: 0.667 ± 0.253
2.268ArgAsp: 2.268 ± 0.586
2.401ArgGlu: 2.401 ± 0.518
1.467ArgPhe: 1.467 ± 0.437
3.068ArgGly: 3.068 ± 0.62
0.934ArgHis: 0.934 ± 0.442
2.935ArgIle: 2.935 ± 0.621
1.734ArgLys: 1.734 ± 0.434
3.335ArgLeu: 3.335 ± 0.646
1.067ArgMet: 1.067 ± 0.392
1.734ArgAsn: 1.734 ± 0.443
1.334ArgPro: 1.334 ± 0.543
1.2ArgGln: 1.2 ± 0.432
1.467ArgArg: 1.467 ± 0.373
2.534ArgSer: 2.534 ± 0.511
1.867ArgThr: 1.867 ± 0.514
3.068ArgVal: 3.068 ± 0.652
0.667ArgTrp: 0.667 ± 0.272
0.934ArgTyr: 0.934 ± 0.345
0.0ArgXaa: 0.0 ± 0.0
Ser
6.403SerAla: 6.403 ± 0.864
0.4SerCys: 0.4 ± 0.228
4.002SerAsp: 4.002 ± 0.777
4.002SerGlu: 4.002 ± 0.921
1.2SerPhe: 1.2 ± 0.347
4.535SerGly: 4.535 ± 0.7
0.934SerHis: 0.934 ± 0.432
2.935SerIle: 2.935 ± 0.679
4.135SerLys: 4.135 ± 0.926
4.402SerLeu: 4.402 ± 0.516
1.467SerMet: 1.467 ± 0.393
2.268SerAsn: 2.268 ± 0.508
2.935SerPro: 2.935 ± 0.643
2.401SerGln: 2.401 ± 0.609
3.201SerArg: 3.201 ± 0.636
4.268SerSer: 4.268 ± 0.769
4.135SerThr: 4.135 ± 0.823
5.202SerVal: 5.202 ± 0.986
0.4SerTrp: 0.4 ± 0.28
2.801SerTyr: 2.801 ± 0.589
0.0SerXaa: 0.0 ± 0.0
Thr
5.736ThrAla: 5.736 ± 1.034
0.4ThrCys: 0.4 ± 0.251
3.735ThrAsp: 3.735 ± 0.739
4.535ThrGlu: 4.535 ± 0.91
4.135ThrPhe: 4.135 ± 0.719
4.402ThrGly: 4.402 ± 1.066
0.8ThrHis: 0.8 ± 0.312
2.801ThrIle: 2.801 ± 0.675
3.735ThrLys: 3.735 ± 0.636
4.002ThrLeu: 4.002 ± 0.585
1.734ThrMet: 1.734 ± 0.53
2.268ThrAsn: 2.268 ± 0.416
3.468ThrPro: 3.468 ± 0.785
1.734ThrGln: 1.734 ± 0.39
2.268ThrArg: 2.268 ± 0.526
4.002ThrSer: 4.002 ± 0.79
4.669ThrThr: 4.669 ± 0.735
4.135ThrVal: 4.135 ± 0.978
1.2ThrTrp: 1.2 ± 0.544
2.668ThrTyr: 2.668 ± 0.667
0.0ThrXaa: 0.0 ± 0.0
Val
5.602ValAla: 5.602 ± 0.785
0.934ValCys: 0.934 ± 0.414
2.801ValAsp: 2.801 ± 0.585
4.268ValGlu: 4.268 ± 0.653
2.935ValPhe: 2.935 ± 0.537
5.469ValGly: 5.469 ± 0.985
1.067ValHis: 1.067 ± 0.418
4.135ValIle: 4.135 ± 0.761
4.402ValLys: 4.402 ± 0.706
5.335ValLeu: 5.335 ± 0.887
2.401ValMet: 2.401 ± 0.447
2.001ValAsn: 2.001 ± 0.675
2.401ValPro: 2.401 ± 0.541
2.935ValGln: 2.935 ± 0.487
3.068ValArg: 3.068 ± 0.562
4.535ValSer: 4.535 ± 0.862
4.935ValThr: 4.935 ± 0.835
4.268ValVal: 4.268 ± 0.707
0.133ValTrp: 0.133 ± 0.117
1.067ValTyr: 1.067 ± 0.389
0.0ValXaa: 0.0 ± 0.0
Trp
0.8TrpAla: 0.8 ± 0.33
0.133TrpCys: 0.133 ± 0.135
0.667TrpAsp: 0.667 ± 0.291
0.667TrpGlu: 0.667 ± 0.279
0.667TrpPhe: 0.667 ± 0.293
0.667TrpGly: 0.667 ± 0.331
0.133TrpHis: 0.133 ± 0.125
1.067TrpIle: 1.067 ± 0.367
0.4TrpLys: 0.4 ± 0.206
1.334TrpLeu: 1.334 ± 0.481
0.267TrpMet: 0.267 ± 0.162
0.133TrpAsn: 0.133 ± 0.117
0.534TrpPro: 0.534 ± 0.292
0.133TrpGln: 0.133 ± 0.125
0.4TrpArg: 0.4 ± 0.22
0.4TrpSer: 0.4 ± 0.277
1.067TrpThr: 1.067 ± 0.507
1.334TrpVal: 1.334 ± 0.58
0.0TrpTrp: 0.0 ± 0.0
0.667TrpTyr: 0.667 ± 0.258
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.801TyrAla: 2.801 ± 0.65
0.4TyrCys: 0.4 ± 0.236
2.534TyrAsp: 2.534 ± 0.58
2.268TyrGlu: 2.268 ± 0.577
1.867TyrPhe: 1.867 ± 0.467
2.801TyrGly: 2.801 ± 0.496
0.4TyrHis: 0.4 ± 0.268
2.401TyrIle: 2.401 ± 0.728
2.268TyrLys: 2.268 ± 0.405
2.001TyrLeu: 2.001 ± 0.499
0.8TyrMet: 0.8 ± 0.312
1.067TyrAsn: 1.067 ± 0.387
0.8TyrPro: 0.8 ± 0.415
1.334TyrGln: 1.334 ± 0.384
1.2TyrArg: 1.2 ± 0.404
2.134TyrSer: 2.134 ± 0.578
1.734TyrThr: 1.734 ± 0.533
2.001TyrVal: 2.001 ± 0.492
0.4TyrTrp: 0.4 ± 0.268
0.934TyrTyr: 0.934 ± 0.421
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 42 proteins (7498 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski