Amino acid dipepetide frequency for Brochothrix phage BL3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.599AlaAla: 4.599 ± 0.722
0.603AlaCys: 0.603 ± 0.29
4.373AlaAsp: 4.373 ± 0.708
4.675AlaGlu: 4.675 ± 0.777
2.187AlaPhe: 2.187 ± 0.375
4.298AlaGly: 4.298 ± 0.705
0.829AlaHis: 0.829 ± 0.218
5.052AlaIle: 5.052 ± 0.509
6.56AlaLys: 6.56 ± 0.909
5.202AlaLeu: 5.202 ± 0.712
2.262AlaMet: 2.262 ± 0.477
3.996AlaAsn: 3.996 ± 0.806
1.508AlaPro: 1.508 ± 0.362
3.317AlaGln: 3.317 ± 0.579
1.81AlaArg: 1.81 ± 0.304
3.619AlaSer: 3.619 ± 0.77
4.222AlaThr: 4.222 ± 0.545
4.976AlaVal: 4.976 ± 0.71
1.131AlaTrp: 1.131 ± 0.338
2.79AlaTyr: 2.79 ± 0.627
0.0AlaXaa: 0.0 ± 0.0
Cys
0.452CysAla: 0.452 ± 0.204
0.0CysCys: 0.0 ± 0.0
0.679CysAsp: 0.679 ± 0.268
0.377CysGlu: 0.377 ± 0.17
0.226CysPhe: 0.226 ± 0.123
0.829CysGly: 0.829 ± 0.292
0.0CysHis: 0.0 ± 0.0
0.528CysIle: 0.528 ± 0.257
0.829CysLys: 0.829 ± 0.281
0.829CysLeu: 0.829 ± 0.336
0.151CysMet: 0.151 ± 0.126
0.302CysAsn: 0.302 ± 0.149
0.151CysPro: 0.151 ± 0.102
0.226CysGln: 0.226 ± 0.148
0.452CysArg: 0.452 ± 0.176
0.528CysSer: 0.528 ± 0.259
0.302CysThr: 0.302 ± 0.154
0.075CysVal: 0.075 ± 0.08
0.0CysTrp: 0.0 ± 0.0
0.226CysTyr: 0.226 ± 0.127
0.0CysXaa: 0.0 ± 0.0
Asp
4.222AspAla: 4.222 ± 0.601
0.226AspCys: 0.226 ± 0.146
3.77AspAsp: 3.77 ± 0.506
4.976AspGlu: 4.976 ± 0.658
3.393AspPhe: 3.393 ± 0.517
5.052AspGly: 5.052 ± 0.759
0.754AspHis: 0.754 ± 0.237
4.222AspIle: 4.222 ± 0.585
7.163AspLys: 7.163 ± 0.727
4.524AspLeu: 4.524 ± 0.576
1.206AspMet: 1.206 ± 0.287
4.147AspAsn: 4.147 ± 0.487
1.81AspPro: 1.81 ± 0.339
0.679AspGln: 0.679 ± 0.233
1.734AspArg: 1.734 ± 0.442
3.694AspSer: 3.694 ± 0.401
4.071AspThr: 4.071 ± 0.432
3.77AspVal: 3.77 ± 0.423
0.905AspTrp: 0.905 ± 0.265
3.016AspTyr: 3.016 ± 0.415
0.0AspXaa: 0.0 ± 0.0
Glu
4.901GluAla: 4.901 ± 0.806
0.528GluCys: 0.528 ± 0.249
3.77GluAsp: 3.77 ± 0.634
4.373GluGlu: 4.373 ± 0.787
2.941GluPhe: 2.941 ± 0.495
5.052GluGly: 5.052 ± 0.818
0.829GluHis: 0.829 ± 0.295
5.655GluIle: 5.655 ± 0.528
6.635GluLys: 6.635 ± 0.744
8.369GluLeu: 8.369 ± 0.968
2.79GluMet: 2.79 ± 0.621
2.941GluAsn: 2.941 ± 0.433
1.357GluPro: 1.357 ± 0.37
3.694GluGln: 3.694 ± 0.821
3.167GluArg: 3.167 ± 0.641
5.353GluSer: 5.353 ± 0.678
3.77GluThr: 3.77 ± 0.551
4.675GluVal: 4.675 ± 0.628
0.829GluTrp: 0.829 ± 0.186
2.337GluTyr: 2.337 ± 0.502
0.0GluXaa: 0.0 ± 0.0
Phe
2.564PheAla: 2.564 ± 0.501
0.226PheCys: 0.226 ± 0.141
2.865PheAsp: 2.865 ± 0.538
2.941PheGlu: 2.941 ± 0.495
1.508PhePhe: 1.508 ± 0.392
3.016PheGly: 3.016 ± 0.505
0.452PheHis: 0.452 ± 0.215
2.865PheIle: 2.865 ± 0.454
3.845PheLys: 3.845 ± 0.501
2.111PheLeu: 2.111 ± 0.353
1.357PheMet: 1.357 ± 0.272
2.111PheAsn: 2.111 ± 0.554
0.98PhePro: 0.98 ± 0.226
0.679PheGln: 0.679 ± 0.257
1.206PheArg: 1.206 ± 0.313
2.639PheSer: 2.639 ± 0.411
2.413PheThr: 2.413 ± 0.427
2.036PheVal: 2.036 ± 0.354
0.226PheTrp: 0.226 ± 0.136
1.583PheTyr: 1.583 ± 0.393
0.0PheXaa: 0.0 ± 0.0
Gly
3.921GlyAla: 3.921 ± 0.35
0.302GlyCys: 0.302 ± 0.175
3.544GlyAsp: 3.544 ± 0.704
5.806GlyGlu: 5.806 ± 0.682
2.337GlyPhe: 2.337 ± 0.413
4.222GlyGly: 4.222 ± 0.607
0.528GlyHis: 0.528 ± 0.199
4.675GlyIle: 4.675 ± 0.766
5.73GlyLys: 5.73 ± 0.926
5.353GlyLeu: 5.353 ± 0.699
1.583GlyMet: 1.583 ± 0.505
4.599GlyAsn: 4.599 ± 0.771
1.206GlyPro: 1.206 ± 0.258
2.337GlyGln: 2.337 ± 0.446
2.413GlyArg: 2.413 ± 0.483
3.694GlySer: 3.694 ± 0.485
3.619GlyThr: 3.619 ± 0.617
5.353GlyVal: 5.353 ± 0.737
0.603GlyTrp: 0.603 ± 0.177
2.639GlyTyr: 2.639 ± 0.401
0.0GlyXaa: 0.0 ± 0.0
His
0.754HisAla: 0.754 ± 0.224
0.302HisCys: 0.302 ± 0.146
0.603HisAsp: 0.603 ± 0.247
0.452HisGlu: 0.452 ± 0.182
0.452HisPhe: 0.452 ± 0.171
0.829HisGly: 0.829 ± 0.235
0.452HisHis: 0.452 ± 0.272
0.905HisIle: 0.905 ± 0.333
1.206HisLys: 1.206 ± 0.311
1.131HisLeu: 1.131 ± 0.387
0.528HisMet: 0.528 ± 0.156
1.206HisAsn: 1.206 ± 0.348
0.452HisPro: 0.452 ± 0.18
0.603HisGln: 0.603 ± 0.196
0.679HisArg: 0.679 ± 0.273
0.905HisSer: 0.905 ± 0.284
0.603HisThr: 0.603 ± 0.268
0.829HisVal: 0.829 ± 0.272
0.151HisTrp: 0.151 ± 0.096
0.754HisTyr: 0.754 ± 0.28
0.0HisXaa: 0.0 ± 0.0
Ile
6.183IleAla: 6.183 ± 0.789
0.679IleCys: 0.679 ± 0.255
5.73IleAsp: 5.73 ± 0.703
6.107IleGlu: 6.107 ± 0.706
1.96IlePhe: 1.96 ± 0.441
4.599IleGly: 4.599 ± 0.696
0.98IleHis: 0.98 ± 0.247
4.524IleIle: 4.524 ± 0.739
7.615IleLys: 7.615 ± 0.683
4.448IleLeu: 4.448 ± 0.727
1.433IleMet: 1.433 ± 0.361
5.127IleAsn: 5.127 ± 0.776
1.583IlePro: 1.583 ± 0.369
2.79IleGln: 2.79 ± 0.476
1.659IleArg: 1.659 ± 0.362
3.845IleSer: 3.845 ± 0.688
3.996IleThr: 3.996 ± 0.654
4.298IleVal: 4.298 ± 0.664
0.679IleTrp: 0.679 ± 0.192
2.337IleTyr: 2.337 ± 0.453
0.0IleXaa: 0.0 ± 0.0
Lys
7.238LysAla: 7.238 ± 0.804
0.603LysCys: 0.603 ± 0.212
4.976LysAsp: 4.976 ± 0.643
8.143LysGlu: 8.143 ± 0.835
3.468LysPhe: 3.468 ± 0.437
6.032LysGly: 6.032 ± 0.629
1.885LysHis: 1.885 ± 0.366
5.202LysIle: 5.202 ± 0.64
8.068LysLys: 8.068 ± 1.11
7.464LysLeu: 7.464 ± 0.758
2.941LysMet: 2.941 ± 0.413
5.353LysAsn: 5.353 ± 0.578
2.337LysPro: 2.337 ± 0.591
3.845LysGln: 3.845 ± 0.638
4.147LysArg: 4.147 ± 0.803
6.183LysSer: 6.183 ± 0.54
6.258LysThr: 6.258 ± 0.853
7.389LysVal: 7.389 ± 0.691
1.282LysTrp: 1.282 ± 0.423
3.317LysTyr: 3.317 ± 0.437
0.0LysXaa: 0.0 ± 0.0
Leu
5.504LeuAla: 5.504 ± 0.775
0.603LeuCys: 0.603 ± 0.206
5.655LeuAsp: 5.655 ± 0.72
6.032LeuGlu: 6.032 ± 0.729
3.091LeuPhe: 3.091 ± 0.532
4.147LeuGly: 4.147 ± 0.549
1.206LeuHis: 1.206 ± 0.292
4.976LeuIle: 4.976 ± 0.602
8.294LeuLys: 8.294 ± 0.786
7.238LeuLeu: 7.238 ± 0.905
2.262LeuMet: 2.262 ± 0.531
4.298LeuAsn: 4.298 ± 0.604
2.488LeuPro: 2.488 ± 0.361
3.393LeuGln: 3.393 ± 0.493
2.865LeuArg: 2.865 ± 0.459
6.333LeuSer: 6.333 ± 0.655
4.373LeuThr: 4.373 ± 0.494
4.675LeuVal: 4.675 ± 0.684
0.528LeuTrp: 0.528 ± 0.206
2.036LeuTyr: 2.036 ± 0.379
0.0LeuXaa: 0.0 ± 0.0
Met
1.433MetAla: 1.433 ± 0.323
0.0MetCys: 0.0 ± 0.0
1.433MetAsp: 1.433 ± 0.447
1.433MetGlu: 1.433 ± 0.323
0.679MetPhe: 0.679 ± 0.217
1.433MetGly: 1.433 ± 0.287
0.528MetHis: 0.528 ± 0.22
2.488MetIle: 2.488 ± 0.396
2.187MetLys: 2.187 ± 0.484
1.508MetLeu: 1.508 ± 0.37
0.528MetMet: 0.528 ± 0.208
1.508MetAsn: 1.508 ± 0.375
1.056MetPro: 1.056 ± 0.365
1.508MetGln: 1.508 ± 0.369
1.734MetArg: 1.734 ± 0.38
1.734MetSer: 1.734 ± 0.379
1.96MetThr: 1.96 ± 0.432
1.659MetVal: 1.659 ± 0.321
0.151MetTrp: 0.151 ± 0.106
0.905MetTyr: 0.905 ± 0.305
0.0MetXaa: 0.0 ± 0.0
Asn
3.694AsnAla: 3.694 ± 0.554
0.377AsnCys: 0.377 ± 0.232
3.544AsnAsp: 3.544 ± 0.454
3.468AsnGlu: 3.468 ± 0.501
1.81AsnPhe: 1.81 ± 0.341
3.77AsnGly: 3.77 ± 0.507
0.452AsnHis: 0.452 ± 0.193
5.052AsnIle: 5.052 ± 0.636
6.409AsnLys: 6.409 ± 0.844
4.448AsnLeu: 4.448 ± 0.69
1.583AsnMet: 1.583 ± 0.248
2.639AsnAsn: 2.639 ± 0.482
2.036AsnPro: 2.036 ± 0.445
1.81AsnGln: 1.81 ± 0.27
2.187AsnArg: 2.187 ± 0.445
3.996AsnSer: 3.996 ± 0.752
3.544AsnThr: 3.544 ± 0.538
3.77AsnVal: 3.77 ± 0.509
0.754AsnTrp: 0.754 ± 0.261
2.262AsnTyr: 2.262 ± 0.406
0.0AsnXaa: 0.0 ± 0.0
Pro
2.111ProAla: 2.111 ± 0.487
0.377ProCys: 0.377 ± 0.179
1.433ProAsp: 1.433 ± 0.372
2.111ProGlu: 2.111 ± 0.339
1.508ProPhe: 1.508 ± 0.346
1.508ProGly: 1.508 ± 0.271
0.603ProHis: 0.603 ± 0.261
1.508ProIle: 1.508 ± 0.351
2.337ProLys: 2.337 ± 0.36
2.111ProLeu: 2.111 ± 0.444
0.452ProMet: 0.452 ± 0.131
1.508ProAsn: 1.508 ± 0.394
0.98ProPro: 0.98 ± 0.398
0.829ProGln: 0.829 ± 0.297
1.056ProArg: 1.056 ± 0.343
1.734ProSer: 1.734 ± 0.395
1.131ProThr: 1.131 ± 0.391
1.885ProVal: 1.885 ± 0.311
0.452ProTrp: 0.452 ± 0.179
0.98ProTyr: 0.98 ± 0.255
0.0ProXaa: 0.0 ± 0.0
Gln
2.639GlnAla: 2.639 ± 0.393
0.0GlnCys: 0.0 ± 0.0
1.81GlnAsp: 1.81 ± 0.356
2.564GlnGlu: 2.564 ± 0.468
1.206GlnPhe: 1.206 ± 0.275
2.413GlnGly: 2.413 ± 0.43
0.528GlnHis: 0.528 ± 0.171
2.941GlnIle: 2.941 ± 0.456
3.242GlnLys: 3.242 ± 0.54
3.393GlnLeu: 3.393 ± 0.429
0.829GlnMet: 0.829 ± 0.311
2.262GlnAsn: 2.262 ± 0.378
0.829GlnPro: 0.829 ± 0.289
2.111GlnGln: 2.111 ± 0.479
1.508GlnArg: 1.508 ± 0.341
3.619GlnSer: 3.619 ± 0.56
2.187GlnThr: 2.187 ± 0.392
2.262GlnVal: 2.262 ± 0.318
0.151GlnTrp: 0.151 ± 0.092
1.131GlnTyr: 1.131 ± 0.317
0.0GlnXaa: 0.0 ± 0.0
Arg
2.639ArgAla: 2.639 ± 0.501
0.226ArgCys: 0.226 ± 0.136
1.583ArgAsp: 1.583 ± 0.327
2.488ArgGlu: 2.488 ± 0.653
1.056ArgPhe: 1.056 ± 0.278
2.488ArgGly: 2.488 ± 0.557
0.377ArgHis: 0.377 ± 0.158
2.564ArgIle: 2.564 ± 0.463
3.016ArgLys: 3.016 ± 0.687
4.222ArgLeu: 4.222 ± 0.637
1.282ArgMet: 1.282 ± 0.296
2.036ArgAsn: 2.036 ± 0.44
0.679ArgPro: 0.679 ± 0.316
1.659ArgGln: 1.659 ± 0.373
1.433ArgArg: 1.433 ± 0.374
1.508ArgSer: 1.508 ± 0.337
2.337ArgThr: 2.337 ± 0.422
2.639ArgVal: 2.639 ± 0.48
0.226ArgTrp: 0.226 ± 0.132
2.111ArgTyr: 2.111 ± 0.52
0.0ArgXaa: 0.0 ± 0.0
Ser
4.599SerAla: 4.599 ± 0.7
0.302SerCys: 0.302 ± 0.136
4.524SerAsp: 4.524 ± 0.55
4.675SerGlu: 4.675 ± 0.518
2.337SerPhe: 2.337 ± 0.378
5.353SerGly: 5.353 ± 0.753
0.829SerHis: 0.829 ± 0.249
6.258SerIle: 6.258 ± 0.508
5.353SerLys: 5.353 ± 0.573
3.544SerLeu: 3.544 ± 0.491
1.885SerMet: 1.885 ± 0.377
3.921SerAsn: 3.921 ± 0.679
1.131SerPro: 1.131 ± 0.268
2.413SerGln: 2.413 ± 0.478
2.337SerArg: 2.337 ± 0.356
4.524SerSer: 4.524 ± 0.798
3.317SerThr: 3.317 ± 0.626
3.996SerVal: 3.996 ± 0.608
1.056SerTrp: 1.056 ± 0.323
3.091SerTyr: 3.091 ± 0.464
0.0SerXaa: 0.0 ± 0.0
Thr
4.222ThrAla: 4.222 ± 0.758
0.452ThrCys: 0.452 ± 0.215
4.825ThrAsp: 4.825 ± 0.513
4.222ThrGlu: 4.222 ± 0.622
2.488ThrPhe: 2.488 ± 0.458
3.921ThrGly: 3.921 ± 0.61
0.679ThrHis: 0.679 ± 0.241
4.147ThrIle: 4.147 ± 0.662
5.127ThrLys: 5.127 ± 0.546
5.429ThrLeu: 5.429 ± 0.57
1.056ThrMet: 1.056 ± 0.278
2.865ThrAsn: 2.865 ± 0.423
1.885ThrPro: 1.885 ± 0.323
2.036ThrGln: 2.036 ± 0.544
1.282ThrArg: 1.282 ± 0.419
4.071ThrSer: 4.071 ± 0.619
3.996ThrThr: 3.996 ± 0.665
3.921ThrVal: 3.921 ± 0.549
0.754ThrTrp: 0.754 ± 0.249
2.262ThrTyr: 2.262 ± 0.391
0.0ThrXaa: 0.0 ± 0.0
Val
3.091ValAla: 3.091 ± 0.635
0.679ValCys: 0.679 ± 0.303
5.052ValAsp: 5.052 ± 0.688
5.353ValGlu: 5.353 ± 0.54
2.488ValPhe: 2.488 ± 0.455
3.242ValGly: 3.242 ± 0.579
1.131ValHis: 1.131 ± 0.317
3.996ValIle: 3.996 ± 0.501
7.766ValLys: 7.766 ± 0.82
4.675ValLeu: 4.675 ± 0.704
0.679ValMet: 0.679 ± 0.209
3.694ValAsn: 3.694 ± 0.53
2.413ValPro: 2.413 ± 0.523
1.583ValGln: 1.583 ± 0.383
2.262ValArg: 2.262 ± 0.419
4.976ValSer: 4.976 ± 0.689
4.599ValThr: 4.599 ± 0.556
4.901ValVal: 4.901 ± 0.767
0.829ValTrp: 0.829 ± 0.239
2.865ValTyr: 2.865 ± 0.591
0.0ValXaa: 0.0 ± 0.0
Trp
0.452TrpAla: 0.452 ± 0.218
0.075TrpCys: 0.075 ± 0.084
0.905TrpAsp: 0.905 ± 0.42
1.056TrpGlu: 1.056 ± 0.234
0.754TrpPhe: 0.754 ± 0.255
0.528TrpGly: 0.528 ± 0.253
0.226TrpHis: 0.226 ± 0.179
0.679TrpIle: 0.679 ± 0.21
0.377TrpLys: 0.377 ± 0.164
1.056TrpLeu: 1.056 ± 0.289
0.226TrpMet: 0.226 ± 0.131
0.905TrpAsn: 0.905 ± 0.367
0.075TrpPro: 0.075 ± 0.08
0.452TrpGln: 0.452 ± 0.156
0.98TrpArg: 0.98 ± 0.285
0.679TrpSer: 0.679 ± 0.182
0.528TrpThr: 0.528 ± 0.245
0.905TrpVal: 0.905 ± 0.28
0.075TrpTrp: 0.075 ± 0.085
0.226TrpTyr: 0.226 ± 0.167
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.79TyrAla: 2.79 ± 0.434
0.603TyrCys: 0.603 ± 0.303
2.262TyrAsp: 2.262 ± 0.398
2.714TyrGlu: 2.714 ± 0.488
1.734TyrPhe: 1.734 ± 0.359
1.734TyrGly: 1.734 ± 0.421
0.528TyrHis: 0.528 ± 0.207
2.337TyrIle: 2.337 ± 0.495
4.373TyrLys: 4.373 ± 0.66
2.941TyrLeu: 2.941 ± 0.615
0.754TyrMet: 0.754 ± 0.299
2.187TyrAsn: 2.187 ± 0.405
1.659TyrPro: 1.659 ± 0.3
1.659TyrGln: 1.659 ± 0.336
1.734TyrArg: 1.734 ± 0.375
1.81TyrSer: 1.81 ± 0.448
2.413TyrThr: 2.413 ± 0.456
2.262TyrVal: 2.262 ± 0.491
0.377TyrTrp: 0.377 ± 0.193
1.282TyrTyr: 1.282 ± 0.364
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 67 proteins (13264 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski