Amino acid dipepetide frequency for Brevibacillus phage Powder

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.246AlaAla: 2.246 ± 0.353
0.811AlaCys: 0.811 ± 0.216
3.681AlaAsp: 3.681 ± 0.468
5.178AlaGlu: 5.178 ± 0.63
2.87AlaPhe: 2.87 ± 0.378
3.244AlaGly: 3.244 ± 0.391
0.749AlaHis: 0.749 ± 0.261
4.617AlaIle: 4.617 ± 0.455
5.74AlaLys: 5.74 ± 0.473
4.804AlaLeu: 4.804 ± 0.599
1.996AlaMet: 1.996 ± 0.369
2.932AlaAsn: 2.932 ± 0.439
1.747AlaPro: 1.747 ± 0.499
2.558AlaGln: 2.558 ± 0.379
2.87AlaArg: 2.87 ± 0.47
2.683AlaSer: 2.683 ± 0.441
2.87AlaThr: 2.87 ± 0.463
3.868AlaVal: 3.868 ± 0.493
1.248AlaTrp: 1.248 ± 0.304
2.308AlaTyr: 2.308 ± 0.311
0.0AlaXaa: 0.0 ± 0.0
Cys
0.25CysAla: 0.25 ± 0.138
0.25CysCys: 0.25 ± 0.155
0.624CysAsp: 0.624 ± 0.201
0.998CysGlu: 0.998 ± 0.276
0.437CysPhe: 0.437 ± 0.146
0.499CysGly: 0.499 ± 0.227
0.187CysHis: 0.187 ± 0.116
0.811CysIle: 0.811 ± 0.203
0.437CysLys: 0.437 ± 0.157
0.312CysLeu: 0.312 ± 0.121
0.0CysMet: 0.0 ± 0.0
0.624CysAsn: 0.624 ± 0.219
0.25CysPro: 0.25 ± 0.122
0.187CysGln: 0.187 ± 0.116
0.624CysArg: 0.624 ± 0.177
0.437CysSer: 0.437 ± 0.182
0.561CysThr: 0.561 ± 0.198
0.561CysVal: 0.561 ± 0.162
0.187CysTrp: 0.187 ± 0.142
0.499CysTyr: 0.499 ± 0.169
0.0CysXaa: 0.0 ± 0.0
Asp
3.307AspAla: 3.307 ± 0.49
0.437AspCys: 0.437 ± 0.186
3.556AspAsp: 3.556 ± 0.5
5.365AspGlu: 5.365 ± 0.436
2.745AspPhe: 2.745 ± 0.464
3.93AspGly: 3.93 ± 0.441
0.998AspHis: 0.998 ± 0.221
4.804AspIle: 4.804 ± 0.549
5.49AspLys: 5.49 ± 0.631
4.554AspLeu: 4.554 ± 0.503
1.747AspMet: 1.747 ± 0.276
2.932AspAsn: 2.932 ± 0.39
2.495AspPro: 2.495 ± 0.436
1.684AspGln: 1.684 ± 0.332
2.184AspArg: 2.184 ± 0.303
4.305AspSer: 4.305 ± 0.507
2.683AspThr: 2.683 ± 0.419
3.618AspVal: 3.618 ± 0.55
0.811AspTrp: 0.811 ± 0.194
2.995AspTyr: 2.995 ± 0.384
0.0AspXaa: 0.0 ± 0.0
Glu
4.305GluAla: 4.305 ± 0.399
0.936GluCys: 0.936 ± 0.276
3.806GluAsp: 3.806 ± 0.475
7.174GluGlu: 7.174 ± 0.806
3.806GluPhe: 3.806 ± 0.476
4.492GluGly: 4.492 ± 0.442
1.684GluHis: 1.684 ± 0.325
6.613GluIle: 6.613 ± 0.614
7.798GluLys: 7.798 ± 0.659
6.738GluLeu: 6.738 ± 0.739
2.433GluMet: 2.433 ± 0.299
2.495GluAsn: 2.495 ± 0.383
1.622GluPro: 1.622 ± 0.329
4.617GluGln: 4.617 ± 0.67
5.241GluArg: 5.241 ± 0.605
4.118GluSer: 4.118 ± 0.492
4.679GluThr: 4.679 ± 0.556
5.927GluVal: 5.927 ± 0.569
1.248GluTrp: 1.248 ± 0.301
2.371GluTyr: 2.371 ± 0.429
0.0GluXaa: 0.0 ± 0.0
Phe
2.308PheAla: 2.308 ± 0.339
0.25PheCys: 0.25 ± 0.131
3.431PheAsp: 3.431 ± 0.399
3.494PheGlu: 3.494 ± 0.476
1.622PhePhe: 1.622 ± 0.333
2.62PheGly: 2.62 ± 0.337
0.561PheHis: 0.561 ± 0.185
3.182PheIle: 3.182 ± 0.473
3.307PheLys: 3.307 ± 0.503
4.18PheLeu: 4.18 ± 0.529
0.811PheMet: 0.811 ± 0.21
1.497PheAsn: 1.497 ± 0.404
0.873PhePro: 0.873 ± 0.239
1.747PheGln: 1.747 ± 0.284
1.622PheArg: 1.622 ± 0.29
2.495PheSer: 2.495 ± 0.374
2.121PheThr: 2.121 ± 0.362
2.87PheVal: 2.87 ± 0.433
0.624PheTrp: 0.624 ± 0.174
1.435PheTyr: 1.435 ± 0.37
0.0PheXaa: 0.0 ± 0.0
Gly
3.993GlyAla: 3.993 ± 0.608
0.312GlyCys: 0.312 ± 0.15
3.93GlyAsp: 3.93 ± 0.575
4.118GlyGlu: 4.118 ± 0.401
2.433GlyPhe: 2.433 ± 0.441
3.244GlyGly: 3.244 ± 0.552
0.749GlyHis: 0.749 ± 0.218
4.242GlyIle: 4.242 ± 0.478
5.802GlyLys: 5.802 ± 0.539
4.305GlyLeu: 4.305 ± 0.542
1.809GlyMet: 1.809 ± 0.289
3.743GlyAsn: 3.743 ± 0.512
0.873GlyPro: 0.873 ± 0.205
1.809GlyGln: 1.809 ± 0.367
3.369GlyArg: 3.369 ± 0.475
3.494GlySer: 3.494 ± 0.416
3.618GlyThr: 3.618 ± 0.47
5.116GlyVal: 5.116 ± 0.647
0.873GlyTrp: 0.873 ± 0.253
2.807GlyTyr: 2.807 ± 0.393
0.0GlyXaa: 0.0 ± 0.0
His
0.811HisAla: 0.811 ± 0.189
0.062HisCys: 0.062 ± 0.053
0.998HisAsp: 0.998 ± 0.241
1.185HisGlu: 1.185 ± 0.262
0.749HisPhe: 0.749 ± 0.206
0.811HisGly: 0.811 ± 0.221
0.312HisHis: 0.312 ± 0.162
1.248HisIle: 1.248 ± 0.293
0.936HisLys: 0.936 ± 0.273
0.998HisLeu: 0.998 ± 0.264
0.062HisMet: 0.062 ± 0.05
0.998HisAsn: 0.998 ± 0.227
0.811HisPro: 0.811 ± 0.196
0.499HisGln: 0.499 ± 0.194
0.561HisArg: 0.561 ± 0.2
0.686HisSer: 0.686 ± 0.179
1.185HisThr: 1.185 ± 0.231
1.248HisVal: 1.248 ± 0.279
0.312HisTrp: 0.312 ± 0.123
1.185HisTyr: 1.185 ± 0.297
0.0HisXaa: 0.0 ± 0.0
Ile
5.49IleAla: 5.49 ± 0.666
0.561IleCys: 0.561 ± 0.18
4.367IleAsp: 4.367 ± 0.464
6.613IleGlu: 6.613 ± 0.619
2.184IlePhe: 2.184 ± 0.368
4.055IleGly: 4.055 ± 0.486
1.497IleHis: 1.497 ± 0.275
3.868IleIle: 3.868 ± 0.493
6.426IleLys: 6.426 ± 0.456
5.552IleLeu: 5.552 ± 0.568
1.373IleMet: 1.373 ± 0.309
4.118IleAsn: 4.118 ± 0.544
2.807IlePro: 2.807 ± 0.458
2.62IleGln: 2.62 ± 0.361
3.681IleArg: 3.681 ± 0.461
5.303IleSer: 5.303 ± 0.476
4.367IleThr: 4.367 ± 0.428
5.241IleVal: 5.241 ± 0.524
0.499IleTrp: 0.499 ± 0.16
3.431IleTyr: 3.431 ± 0.624
0.0IleXaa: 0.0 ± 0.0
Lys
5.303LysAla: 5.303 ± 0.58
0.873LysCys: 0.873 ± 0.225
4.055LysAsp: 4.055 ± 0.501
7.674LysGlu: 7.674 ± 0.701
2.558LysPhe: 2.558 ± 0.406
5.74LysGly: 5.74 ± 0.581
1.56LysHis: 1.56 ± 0.339
6.052LysIle: 6.052 ± 0.525
7.736LysLys: 7.736 ± 0.709
6.675LysLeu: 6.675 ± 0.544
3.119LysMet: 3.119 ± 0.465
4.741LysAsn: 4.741 ± 0.66
2.62LysPro: 2.62 ± 0.293
4.242LysGln: 4.242 ± 0.554
4.617LysArg: 4.617 ± 0.536
4.242LysSer: 4.242 ± 0.628
4.741LysThr: 4.741 ± 0.595
5.178LysVal: 5.178 ± 0.566
0.936LysTrp: 0.936 ± 0.224
2.995LysTyr: 2.995 ± 0.4
0.0LysXaa: 0.0 ± 0.0
Leu
6.488LeuAla: 6.488 ± 0.704
0.873LeuCys: 0.873 ± 0.231
5.864LeuAsp: 5.864 ± 0.657
6.925LeuGlu: 6.925 ± 0.69
3.868LeuPhe: 3.868 ± 0.47
4.367LeuGly: 4.367 ± 0.432
0.936LeuHis: 0.936 ± 0.263
5.927LeuIle: 5.927 ± 0.657
6.052LeuLys: 6.052 ± 0.622
6.738LeuLeu: 6.738 ± 0.582
2.121LeuMet: 2.121 ± 0.356
4.617LeuAsn: 4.617 ± 0.454
2.308LeuPro: 2.308 ± 0.309
3.244LeuGln: 3.244 ± 0.405
3.431LeuArg: 3.431 ± 0.469
5.864LeuSer: 5.864 ± 0.596
4.492LeuThr: 4.492 ± 0.515
4.305LeuVal: 4.305 ± 0.416
0.811LeuTrp: 0.811 ± 0.202
2.807LeuTyr: 2.807 ± 0.401
0.0LeuXaa: 0.0 ± 0.0
Met
1.872MetAla: 1.872 ± 0.341
0.187MetCys: 0.187 ± 0.103
1.31MetAsp: 1.31 ± 0.234
1.872MetGlu: 1.872 ± 0.379
1.185MetPhe: 1.185 ± 0.332
1.435MetGly: 1.435 ± 0.263
0.187MetHis: 0.187 ± 0.094
1.497MetIle: 1.497 ± 0.356
2.807MetLys: 2.807 ± 0.437
1.56MetLeu: 1.56 ± 0.297
1.185MetMet: 1.185 ± 0.251
1.497MetAsn: 1.497 ± 0.261
1.061MetPro: 1.061 ± 0.348
1.185MetGln: 1.185 ± 0.252
1.622MetArg: 1.622 ± 0.291
2.184MetSer: 2.184 ± 0.304
1.747MetThr: 1.747 ± 0.336
1.497MetVal: 1.497 ± 0.28
0.062MetTrp: 0.062 ± 0.055
0.998MetTyr: 0.998 ± 0.26
0.0MetXaa: 0.0 ± 0.0
Asn
2.246AsnAla: 2.246 ± 0.579
0.374AsnCys: 0.374 ± 0.129
2.246AsnAsp: 2.246 ± 0.399
4.055AsnGlu: 4.055 ± 0.487
1.809AsnPhe: 1.809 ± 0.29
3.743AsnGly: 3.743 ± 0.547
0.998AsnHis: 0.998 ± 0.246
3.618AsnIle: 3.618 ± 0.551
4.991AsnLys: 4.991 ± 0.564
4.429AsnLeu: 4.429 ± 0.493
1.123AsnMet: 1.123 ± 0.269
2.62AsnAsn: 2.62 ± 0.544
2.371AsnPro: 2.371 ± 0.573
1.747AsnGln: 1.747 ± 0.321
2.683AsnArg: 2.683 ± 0.448
2.807AsnSer: 2.807 ± 0.458
2.87AsnThr: 2.87 ± 0.467
2.558AsnVal: 2.558 ± 0.338
0.686AsnTrp: 0.686 ± 0.185
2.308AsnTyr: 2.308 ± 0.502
0.0AsnXaa: 0.0 ± 0.0
Pro
1.56ProAla: 1.56 ± 0.323
0.187ProCys: 0.187 ± 0.112
2.683ProAsp: 2.683 ± 0.387
2.059ProGlu: 2.059 ± 0.4
1.747ProPhe: 1.747 ± 0.332
1.248ProGly: 1.248 ± 0.311
0.437ProHis: 0.437 ± 0.163
2.495ProIle: 2.495 ± 0.287
1.996ProLys: 1.996 ± 0.437
2.059ProLeu: 2.059 ± 0.304
0.374ProMet: 0.374 ± 0.166
2.495ProAsn: 2.495 ± 0.632
0.811ProPro: 0.811 ± 0.249
0.998ProGln: 0.998 ± 0.236
1.435ProArg: 1.435 ± 0.322
2.371ProSer: 2.371 ± 0.369
1.996ProThr: 1.996 ± 0.364
1.56ProVal: 1.56 ± 0.294
0.437ProTrp: 0.437 ± 0.166
1.123ProTyr: 1.123 ± 0.257
0.0ProXaa: 0.0 ± 0.0
Gln
2.87GlnAla: 2.87 ± 0.472
0.187GlnCys: 0.187 ± 0.107
1.872GlnAsp: 1.872 ± 0.391
2.932GlnGlu: 2.932 ± 0.402
1.123GlnPhe: 1.123 ± 0.273
2.184GlnGly: 2.184 ± 0.38
0.749GlnHis: 0.749 ± 0.236
3.806GlnIle: 3.806 ± 0.518
2.932GlnLys: 2.932 ± 0.436
4.929GlnLeu: 4.929 ± 0.487
0.936GlnMet: 0.936 ± 0.239
2.308GlnAsn: 2.308 ± 0.407
1.061GlnPro: 1.061 ± 0.21
1.622GlnGln: 1.622 ± 0.429
1.747GlnArg: 1.747 ± 0.319
1.872GlnSer: 1.872 ± 0.372
2.495GlnThr: 2.495 ± 0.392
2.371GlnVal: 2.371 ± 0.332
0.374GlnTrp: 0.374 ± 0.173
1.373GlnTyr: 1.373 ± 0.314
0.0GlnXaa: 0.0 ± 0.0
Arg
2.246ArgAla: 2.246 ± 0.42
0.686ArgCys: 0.686 ± 0.211
1.872ArgAsp: 1.872 ± 0.33
4.492ArgGlu: 4.492 ± 0.585
2.121ArgPhe: 2.121 ± 0.288
3.119ArgGly: 3.119 ± 0.389
0.936ArgHis: 0.936 ± 0.288
3.93ArgIle: 3.93 ± 0.428
4.991ArgLys: 4.991 ± 0.652
3.868ArgLeu: 3.868 ± 0.586
1.934ArgMet: 1.934 ± 0.342
1.622ArgAsn: 1.622 ± 0.275
1.123ArgPro: 1.123 ± 0.261
1.872ArgGln: 1.872 ± 0.374
1.809ArgArg: 1.809 ± 0.37
2.308ArgSer: 2.308 ± 0.401
2.995ArgThr: 2.995 ± 0.396
3.057ArgVal: 3.057 ± 0.508
0.561ArgTrp: 0.561 ± 0.178
2.121ArgTyr: 2.121 ± 0.349
0.0ArgXaa: 0.0 ± 0.0
Ser
2.495SerAla: 2.495 ± 0.432
0.187SerCys: 0.187 ± 0.1
4.429SerAsp: 4.429 ± 0.531
4.305SerGlu: 4.305 ± 0.505
3.806SerPhe: 3.806 ± 0.643
3.494SerGly: 3.494 ± 0.332
0.873SerHis: 0.873 ± 0.232
4.679SerIle: 4.679 ± 0.486
5.116SerLys: 5.116 ± 0.611
5.615SerLeu: 5.615 ± 0.492
1.56SerMet: 1.56 ± 0.333
2.433SerAsn: 2.433 ± 0.363
1.872SerPro: 1.872 ± 0.393
2.184SerGln: 2.184 ± 0.299
2.059SerArg: 2.059 ± 0.289
4.118SerSer: 4.118 ± 0.456
3.93SerThr: 3.93 ± 0.461
3.556SerVal: 3.556 ± 0.395
0.749SerTrp: 0.749 ± 0.194
2.308SerTyr: 2.308 ± 0.392
0.0SerXaa: 0.0 ± 0.0
Thr
3.743ThrAla: 3.743 ± 0.579
0.437ThrCys: 0.437 ± 0.152
3.494ThrAsp: 3.494 ± 0.444
4.18ThrGlu: 4.18 ± 0.456
2.059ThrPhe: 2.059 ± 0.324
4.305ThrGly: 4.305 ± 0.395
0.998ThrHis: 0.998 ± 0.172
3.93ThrIle: 3.93 ± 0.505
4.679ThrLys: 4.679 ± 0.593
5.677ThrLeu: 5.677 ± 0.582
1.31ThrMet: 1.31 ± 0.288
2.87ThrAsn: 2.87 ± 0.441
2.121ThrPro: 2.121 ± 0.308
2.059ThrGln: 2.059 ± 0.375
1.872ThrArg: 1.872 ± 0.339
3.431ThrSer: 3.431 ± 0.489
2.995ThrThr: 2.995 ± 0.323
3.868ThrVal: 3.868 ± 0.54
0.811ThrTrp: 0.811 ± 0.206
2.87ThrTyr: 2.87 ± 0.364
0.0ThrXaa: 0.0 ± 0.0
Val
4.741ValAla: 4.741 ± 0.602
0.437ValCys: 0.437 ± 0.147
4.554ValAsp: 4.554 ± 0.49
4.429ValGlu: 4.429 ± 0.576
1.747ValPhe: 1.747 ± 0.394
4.242ValGly: 4.242 ± 0.515
0.561ValHis: 0.561 ± 0.171
5.178ValIle: 5.178 ± 0.593
4.242ValLys: 4.242 ± 0.532
4.18ValLeu: 4.18 ± 0.505
1.747ValMet: 1.747 ± 0.302
3.369ValAsn: 3.369 ± 0.395
1.809ValPro: 1.809 ± 0.273
2.807ValGln: 2.807 ± 0.409
3.369ValArg: 3.369 ± 0.418
4.118ValSer: 4.118 ± 0.42
4.242ValThr: 4.242 ± 0.579
4.429ValVal: 4.429 ± 0.557
0.624ValTrp: 0.624 ± 0.203
3.556ValTyr: 3.556 ± 0.575
0.0ValXaa: 0.0 ± 0.0
Trp
0.374TrpAla: 0.374 ± 0.169
0.125TrpCys: 0.125 ± 0.081
0.624TrpAsp: 0.624 ± 0.191
1.185TrpGlu: 1.185 ± 0.225
0.437TrpPhe: 0.437 ± 0.156
0.873TrpGly: 0.873 ± 0.224
0.187TrpHis: 0.187 ± 0.112
0.624TrpIle: 0.624 ± 0.196
1.622TrpLys: 1.622 ± 0.303
1.435TrpLeu: 1.435 ± 0.277
0.25TrpMet: 0.25 ± 0.111
0.25TrpAsn: 0.25 ± 0.102
0.25TrpPro: 0.25 ± 0.133
0.437TrpGln: 0.437 ± 0.147
0.686TrpArg: 0.686 ± 0.222
0.561TrpSer: 0.561 ± 0.189
1.123TrpThr: 1.123 ± 0.249
0.936TrpVal: 0.936 ± 0.243
0.312TrpTrp: 0.312 ± 0.185
0.25TrpTyr: 0.25 ± 0.11
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.371TyrAla: 2.371 ± 0.321
0.561TyrCys: 0.561 ± 0.232
3.244TyrAsp: 3.244 ± 0.417
3.868TyrGlu: 3.868 ± 0.591
1.747TyrPhe: 1.747 ± 0.351
2.995TyrGly: 2.995 ± 0.496
0.499TyrHis: 0.499 ± 0.163
2.932TyrIle: 2.932 ± 0.412
2.558TyrLys: 2.558 ± 0.395
3.494TyrLeu: 3.494 ± 0.436
0.998TyrMet: 0.998 ± 0.237
2.121TyrAsn: 2.121 ± 0.384
1.185TyrPro: 1.185 ± 0.297
1.622TyrGln: 1.622 ± 0.255
2.184TyrArg: 2.184 ± 0.396
2.371TyrSer: 2.371 ± 0.43
1.996TyrThr: 1.996 ± 0.376
2.62TyrVal: 2.62 ± 0.401
0.374TyrTrp: 0.374 ± 0.154
1.56TyrTyr: 1.56 ± 0.325
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 103 proteins (16030 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski