Amino acid dipepetide frequency for Mannheimia phage PHL101

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.699AlaAla: 3.699 ± 0.724
0.973AlaCys: 0.973 ± 0.259
5.354AlaAsp: 5.354 ± 0.681
7.69AlaGlu: 7.69 ± 1.096
2.823AlaPhe: 2.823 ± 0.683
5.451AlaGly: 5.451 ± 0.711
0.876AlaHis: 0.876 ± 0.302
5.256AlaIle: 5.256 ± 0.73
6.522AlaLys: 6.522 ± 0.701
7.009AlaLeu: 7.009 ± 0.841
2.628AlaMet: 2.628 ± 0.546
3.894AlaAsn: 3.894 ± 0.49
1.947AlaPro: 1.947 ± 0.359
2.044AlaGln: 2.044 ± 0.462
3.991AlaArg: 3.991 ± 0.63
3.407AlaSer: 3.407 ± 0.68
5.354AlaThr: 5.354 ± 0.732
4.964AlaVal: 4.964 ± 0.689
0.779AlaTrp: 0.779 ± 0.217
3.894AlaTyr: 3.894 ± 0.586
0.0AlaXaa: 0.0 ± 0.0
Cys
0.487CysAla: 0.487 ± 0.198
0.097CysCys: 0.097 ± 0.087
0.487CysAsp: 0.487 ± 0.225
0.779CysGlu: 0.779 ± 0.276
0.292CysPhe: 0.292 ± 0.159
0.779CysGly: 0.779 ± 0.256
0.097CysHis: 0.097 ± 0.087
0.779CysIle: 0.779 ± 0.291
0.681CysLys: 0.681 ± 0.213
1.265CysLeu: 1.265 ± 0.303
0.097CysMet: 0.097 ± 0.087
0.876CysAsn: 0.876 ± 0.326
0.292CysPro: 0.292 ± 0.183
0.681CysGln: 0.681 ± 0.312
0.584CysArg: 0.584 ± 0.23
0.487CysSer: 0.487 ± 0.221
0.876CysThr: 0.876 ± 0.308
0.876CysVal: 0.876 ± 0.251
0.097CysTrp: 0.097 ± 0.1
0.389CysTyr: 0.389 ± 0.192
0.0CysXaa: 0.0 ± 0.0
Asp
2.434AspAla: 2.434 ± 0.396
0.097AspCys: 0.097 ± 0.084
3.602AspAsp: 3.602 ± 0.678
4.38AspGlu: 4.38 ± 0.62
2.336AspPhe: 2.336 ± 0.544
4.478AspGly: 4.478 ± 0.706
0.389AspHis: 0.389 ± 0.213
3.212AspIle: 3.212 ± 0.659
3.602AspLys: 3.602 ± 0.607
5.743AspLeu: 5.743 ± 0.601
1.071AspMet: 1.071 ± 0.276
2.628AspAsn: 2.628 ± 0.514
1.655AspPro: 1.655 ± 0.337
0.876AspGln: 0.876 ± 0.283
2.142AspArg: 2.142 ± 0.427
2.823AspSer: 2.823 ± 0.437
3.115AspThr: 3.115 ± 0.594
3.407AspVal: 3.407 ± 0.671
0.779AspTrp: 0.779 ± 0.226
2.239AspTyr: 2.239 ± 0.362
0.0AspXaa: 0.0 ± 0.0
Glu
5.159GluAla: 5.159 ± 0.839
0.779GluCys: 0.779 ± 0.311
2.823GluAsp: 2.823 ± 0.486
4.575GluGlu: 4.575 ± 0.766
3.212GluPhe: 3.212 ± 0.664
1.85GluGly: 1.85 ± 0.456
2.434GluHis: 2.434 ± 0.488
5.646GluIle: 5.646 ± 0.852
6.814GluLys: 6.814 ± 0.935
7.982GluLeu: 7.982 ± 0.811
1.752GluMet: 1.752 ± 0.375
4.088GluAsn: 4.088 ± 0.53
2.92GluPro: 2.92 ± 0.494
6.133GluGln: 6.133 ± 0.707
3.699GluArg: 3.699 ± 0.538
4.283GluSer: 4.283 ± 0.62
3.115GluThr: 3.115 ± 0.483
3.991GluVal: 3.991 ± 0.562
1.071GluTrp: 1.071 ± 0.294
1.947GluTyr: 1.947 ± 0.423
0.0GluXaa: 0.0 ± 0.0
Phe
3.407PheAla: 3.407 ± 0.726
0.681PheCys: 0.681 ± 0.261
3.212PheAsp: 3.212 ± 0.618
3.018PheGlu: 3.018 ± 0.514
1.363PhePhe: 1.363 ± 0.464
2.336PheGly: 2.336 ± 0.542
0.681PheHis: 0.681 ± 0.204
3.018PheIle: 3.018 ± 0.825
3.115PheLys: 3.115 ± 0.508
2.628PheLeu: 2.628 ± 0.506
1.071PheMet: 1.071 ± 0.367
2.531PheAsn: 2.531 ± 0.615
0.681PhePro: 0.681 ± 0.342
1.265PheGln: 1.265 ± 0.356
1.947PheArg: 1.947 ± 0.331
2.531PheSer: 2.531 ± 0.376
2.628PheThr: 2.628 ± 0.476
3.31PheVal: 3.31 ± 0.545
0.389PheTrp: 0.389 ± 0.174
0.973PheTyr: 0.973 ± 0.413
0.0PheXaa: 0.0 ± 0.0
Gly
5.159GlyAla: 5.159 ± 0.823
0.584GlyCys: 0.584 ± 0.291
3.212GlyAsp: 3.212 ± 0.611
5.062GlyGlu: 5.062 ± 0.564
3.018GlyPhe: 3.018 ± 0.537
3.699GlyGly: 3.699 ± 0.819
0.973GlyHis: 0.973 ± 0.265
3.699GlyIle: 3.699 ± 0.612
6.23GlyLys: 6.23 ± 0.872
4.672GlyLeu: 4.672 ± 0.909
1.168GlyMet: 1.168 ± 0.268
3.212GlyAsn: 3.212 ± 0.534
0.584GlyPro: 0.584 ± 0.252
1.655GlyGln: 1.655 ± 0.381
2.531GlyArg: 2.531 ± 0.516
3.699GlySer: 3.699 ± 0.609
3.504GlyThr: 3.504 ± 0.592
4.77GlyVal: 4.77 ± 0.606
0.876GlyTrp: 0.876 ± 0.31
1.947GlyTyr: 1.947 ± 0.375
0.0GlyXaa: 0.0 ± 0.0
His
1.265HisAla: 1.265 ± 0.372
0.097HisCys: 0.097 ± 0.093
0.876HisAsp: 0.876 ± 0.255
1.071HisGlu: 1.071 ± 0.262
0.779HisPhe: 0.779 ± 0.232
0.681HisGly: 0.681 ± 0.219
0.389HisHis: 0.389 ± 0.209
1.947HisIle: 1.947 ± 0.347
1.363HisLys: 1.363 ± 0.335
1.85HisLeu: 1.85 ± 0.369
0.0HisMet: 0.0 ± 0.0
0.779HisAsn: 0.779 ± 0.276
0.487HisPro: 0.487 ± 0.262
1.071HisGln: 1.071 ± 0.259
1.071HisArg: 1.071 ± 0.37
1.46HisSer: 1.46 ± 0.323
0.389HisThr: 0.389 ± 0.177
0.389HisVal: 0.389 ± 0.19
0.292HisTrp: 0.292 ± 0.176
0.779HisTyr: 0.779 ± 0.227
0.0HisXaa: 0.0 ± 0.0
Ile
5.354IleAla: 5.354 ± 0.646
0.681IleCys: 0.681 ± 0.232
3.504IleAsp: 3.504 ± 0.556
5.062IleGlu: 5.062 ± 0.545
2.434IlePhe: 2.434 ± 0.477
3.796IleGly: 3.796 ± 0.606
0.779IleHis: 0.779 ± 0.238
4.088IleIle: 4.088 ± 0.677
5.159IleLys: 5.159 ± 0.71
4.964IleLeu: 4.964 ± 0.74
0.584IleMet: 0.584 ± 0.233
4.38IleAsn: 4.38 ± 0.898
2.142IlePro: 2.142 ± 0.471
2.434IleGln: 2.434 ± 0.506
3.212IleArg: 3.212 ± 0.519
5.354IleSer: 5.354 ± 0.682
5.159IleThr: 5.159 ± 0.541
3.699IleVal: 3.699 ± 0.61
0.584IleTrp: 0.584 ± 0.186
1.752IleTyr: 1.752 ± 0.525
0.0IleXaa: 0.0 ± 0.0
Lys
7.203LysAla: 7.203 ± 0.907
0.487LysCys: 0.487 ± 0.217
2.726LysAsp: 2.726 ± 0.562
4.478LysGlu: 4.478 ± 0.532
2.92LysPhe: 2.92 ± 0.609
5.841LysGly: 5.841 ± 0.797
1.168LysHis: 1.168 ± 0.327
4.867LysIle: 4.867 ± 0.704
5.451LysLys: 5.451 ± 0.834
7.69LysLeu: 7.69 ± 0.867
2.531LysMet: 2.531 ± 0.484
4.38LysAsn: 4.38 ± 0.601
3.115LysPro: 3.115 ± 0.659
5.743LysGln: 5.743 ± 0.782
4.088LysArg: 4.088 ± 0.66
4.283LysSer: 4.283 ± 0.608
3.407LysThr: 3.407 ± 0.532
3.894LysVal: 3.894 ± 0.623
2.044LysTrp: 2.044 ± 0.574
2.726LysTyr: 2.726 ± 0.494
0.0LysXaa: 0.0 ± 0.0
Leu
8.371LeuAla: 8.371 ± 0.86
1.071LeuCys: 1.071 ± 0.28
4.186LeuAsp: 4.186 ± 0.692
7.495LeuGlu: 7.495 ± 0.795
3.991LeuPhe: 3.991 ± 0.613
5.938LeuGly: 5.938 ± 0.827
1.265LeuHis: 1.265 ± 0.322
5.062LeuIle: 5.062 ± 0.762
8.079LeuLys: 8.079 ± 0.826
8.177LeuLeu: 8.177 ± 0.913
1.168LeuMet: 1.168 ± 0.364
5.451LeuAsn: 5.451 ± 0.715
2.628LeuPro: 2.628 ± 0.623
5.159LeuGln: 5.159 ± 0.64
4.38LeuArg: 4.38 ± 0.607
5.549LeuSer: 5.549 ± 0.86
6.133LeuThr: 6.133 ± 0.913
5.159LeuVal: 5.159 ± 0.694
0.876LeuTrp: 0.876 ± 0.256
2.628LeuTyr: 2.628 ± 0.503
0.0LeuXaa: 0.0 ± 0.0
Met
1.752MetAla: 1.752 ± 0.306
0.292MetCys: 0.292 ± 0.172
0.876MetAsp: 0.876 ± 0.228
1.947MetGlu: 1.947 ± 0.419
0.779MetPhe: 0.779 ± 0.234
0.779MetGly: 0.779 ± 0.36
0.584MetHis: 0.584 ± 0.233
1.557MetIle: 1.557 ± 0.365
1.265MetLys: 1.265 ± 0.308
1.071MetLeu: 1.071 ± 0.252
0.584MetMet: 0.584 ± 0.267
1.168MetAsn: 1.168 ± 0.333
1.071MetPro: 1.071 ± 0.363
1.363MetGln: 1.363 ± 0.442
1.168MetArg: 1.168 ± 0.324
2.142MetSer: 2.142 ± 0.547
0.779MetThr: 0.779 ± 0.221
0.876MetVal: 0.876 ± 0.276
0.195MetTrp: 0.195 ± 0.125
0.389MetTyr: 0.389 ± 0.157
0.0MetXaa: 0.0 ± 0.0
Asn
3.699AsnAla: 3.699 ± 0.658
0.876AsnCys: 0.876 ± 0.306
1.655AsnAsp: 1.655 ± 0.406
4.38AsnGlu: 4.38 ± 0.585
1.752AsnPhe: 1.752 ± 0.396
4.77AsnGly: 4.77 ± 0.58
1.168AsnHis: 1.168 ± 0.374
4.283AsnIle: 4.283 ± 0.642
3.504AsnLys: 3.504 ± 0.621
5.062AsnLeu: 5.062 ± 0.655
0.973AsnMet: 0.973 ± 0.308
3.115AsnAsn: 3.115 ± 1.069
2.726AsnPro: 2.726 ± 0.556
3.31AsnGln: 3.31 ± 0.596
2.726AsnArg: 2.726 ± 0.501
2.044AsnSer: 2.044 ± 0.397
2.336AsnThr: 2.336 ± 0.506
2.628AsnVal: 2.628 ± 0.583
0.779AsnTrp: 0.779 ± 0.235
2.239AsnTyr: 2.239 ± 0.499
0.0AsnXaa: 0.0 ± 0.0
Pro
2.044ProAla: 2.044 ± 0.408
0.292ProCys: 0.292 ± 0.171
2.142ProAsp: 2.142 ± 0.444
1.947ProGlu: 1.947 ± 0.425
1.265ProPhe: 1.265 ± 0.338
0.195ProGly: 0.195 ± 0.237
0.681ProHis: 0.681 ± 0.266
2.239ProIle: 2.239 ± 0.506
2.531ProLys: 2.531 ± 0.469
3.018ProLeu: 3.018 ± 0.544
0.973ProMet: 0.973 ± 0.278
2.142ProAsn: 2.142 ± 0.522
1.363ProPro: 1.363 ± 0.406
2.142ProGln: 2.142 ± 0.529
1.557ProArg: 1.557 ± 0.476
1.85ProSer: 1.85 ± 0.309
2.142ProThr: 2.142 ± 0.464
2.531ProVal: 2.531 ± 0.549
0.389ProTrp: 0.389 ± 0.197
0.876ProTyr: 0.876 ± 0.309
0.0ProXaa: 0.0 ± 0.0
Gln
5.549GlnAla: 5.549 ± 0.636
0.389GlnCys: 0.389 ± 0.207
2.044GlnAsp: 2.044 ± 0.39
2.823GlnGlu: 2.823 ± 0.463
2.239GlnPhe: 2.239 ± 0.388
2.336GlnGly: 2.336 ± 0.507
0.292GlnHis: 0.292 ± 0.177
3.602GlnIle: 3.602 ± 0.435
3.894GlnLys: 3.894 ± 0.53
5.062GlnLeu: 5.062 ± 0.667
1.168GlnMet: 1.168 ± 0.35
3.115GlnAsn: 3.115 ± 0.446
1.752GlnPro: 1.752 ± 0.348
2.92GlnGln: 2.92 ± 0.623
3.115GlnArg: 3.115 ± 0.482
2.92GlnSer: 2.92 ± 0.426
3.212GlnThr: 3.212 ± 0.408
2.142GlnVal: 2.142 ± 0.481
0.779GlnTrp: 0.779 ± 0.312
1.265GlnTyr: 1.265 ± 0.401
0.0GlnXaa: 0.0 ± 0.0
Arg
4.283ArgAla: 4.283 ± 0.604
0.779ArgCys: 0.779 ± 0.259
2.239ArgAsp: 2.239 ± 0.464
4.478ArgGlu: 4.478 ± 0.651
1.947ArgPhe: 1.947 ± 0.436
2.239ArgGly: 2.239 ± 0.734
0.584ArgHis: 0.584 ± 0.199
3.796ArgIle: 3.796 ± 0.465
4.186ArgLys: 4.186 ± 0.63
6.911ArgLeu: 6.911 ± 0.808
0.779ArgMet: 0.779 ± 0.265
2.044ArgAsn: 2.044 ± 0.345
1.557ArgPro: 1.557 ± 0.386
2.434ArgGln: 2.434 ± 0.469
2.628ArgArg: 2.628 ± 0.577
1.85ArgSer: 1.85 ± 0.412
2.336ArgThr: 2.336 ± 0.424
3.894ArgVal: 3.894 ± 0.683
0.681ArgTrp: 0.681 ± 0.263
1.265ArgTyr: 1.265 ± 0.37
0.0ArgXaa: 0.0 ± 0.0
Ser
4.964SerAla: 4.964 ± 0.745
0.389SerCys: 0.389 ± 0.229
2.628SerAsp: 2.628 ± 0.474
3.991SerGlu: 3.991 ± 0.49
2.142SerPhe: 2.142 ± 0.423
4.867SerGly: 4.867 ± 0.736
1.168SerHis: 1.168 ± 0.273
3.31SerIle: 3.31 ± 0.554
4.38SerLys: 4.38 ± 0.577
4.867SerLeu: 4.867 ± 0.709
0.973SerMet: 0.973 ± 0.29
2.823SerAsn: 2.823 ± 0.56
2.531SerPro: 2.531 ± 0.432
3.31SerGln: 3.31 ± 0.417
3.31SerArg: 3.31 ± 0.482
3.504SerSer: 3.504 ± 0.55
2.336SerThr: 2.336 ± 0.501
4.478SerVal: 4.478 ± 0.486
0.292SerTrp: 0.292 ± 0.126
1.752SerTyr: 1.752 ± 0.358
0.0SerXaa: 0.0 ± 0.0
Thr
4.964ThrAla: 4.964 ± 1.007
0.487ThrCys: 0.487 ± 0.213
2.823ThrAsp: 2.823 ± 0.65
4.283ThrGlu: 4.283 ± 0.808
2.239ThrPhe: 2.239 ± 0.448
4.478ThrGly: 4.478 ± 0.57
1.557ThrHis: 1.557 ± 0.419
3.212ThrIle: 3.212 ± 0.571
3.602ThrLys: 3.602 ± 0.749
5.062ThrLeu: 5.062 ± 0.706
1.265ThrMet: 1.265 ± 0.36
2.434ThrAsn: 2.434 ± 0.433
2.726ThrPro: 2.726 ± 0.476
2.044ThrGln: 2.044 ± 0.335
2.142ThrArg: 2.142 ± 0.434
3.115ThrSer: 3.115 ± 0.522
3.31ThrThr: 3.31 ± 0.622
4.478ThrVal: 4.478 ± 0.663
0.487ThrTrp: 0.487 ± 0.224
1.85ThrTyr: 1.85 ± 0.423
0.0ThrXaa: 0.0 ± 0.0
Val
5.159ValAla: 5.159 ± 0.739
0.681ValCys: 0.681 ± 0.253
3.796ValAsp: 3.796 ± 0.547
3.991ValGlu: 3.991 ± 0.636
2.628ValPhe: 2.628 ± 0.478
3.504ValGly: 3.504 ± 0.467
0.195ValHis: 0.195 ± 0.124
3.796ValIle: 3.796 ± 0.739
5.938ValLys: 5.938 ± 1.0
5.256ValLeu: 5.256 ± 0.783
1.265ValMet: 1.265 ± 0.381
3.31ValAsn: 3.31 ± 0.545
1.168ValPro: 1.168 ± 0.356
3.504ValGln: 3.504 ± 0.566
3.407ValArg: 3.407 ± 0.562
4.186ValSer: 4.186 ± 0.707
3.991ValThr: 3.991 ± 0.672
4.186ValVal: 4.186 ± 0.621
0.584ValTrp: 0.584 ± 0.242
1.85ValTyr: 1.85 ± 0.486
0.0ValXaa: 0.0 ± 0.0
Trp
1.557TrpAla: 1.557 ± 0.42
0.195TrpCys: 0.195 ± 0.136
0.876TrpAsp: 0.876 ± 0.249
0.584TrpGlu: 0.584 ± 0.207
0.681TrpPhe: 0.681 ± 0.273
0.487TrpGly: 0.487 ± 0.242
0.681TrpHis: 0.681 ± 0.221
0.876TrpIle: 0.876 ± 0.314
0.389TrpLys: 0.389 ± 0.137
1.655TrpLeu: 1.655 ± 0.396
0.097TrpMet: 0.097 ± 0.081
0.292TrpAsn: 0.292 ± 0.153
0.0TrpPro: 0.0 ± 0.0
0.487TrpGln: 0.487 ± 0.185
0.876TrpArg: 0.876 ± 0.283
1.071TrpSer: 1.071 ± 0.282
0.584TrpThr: 0.584 ± 0.26
0.876TrpVal: 0.876 ± 0.31
0.195TrpTrp: 0.195 ± 0.121
0.292TrpTyr: 0.292 ± 0.173
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.044TyrAla: 2.044 ± 0.267
1.071TyrCys: 1.071 ± 0.291
1.947TyrAsp: 1.947 ± 0.546
2.239TyrGlu: 2.239 ± 0.417
1.947TyrPhe: 1.947 ± 0.427
1.752TyrGly: 1.752 ± 0.338
1.071TyrHis: 1.071 ± 0.331
0.876TyrIle: 0.876 ± 0.256
2.336TyrLys: 2.336 ± 0.461
2.823TyrLeu: 2.823 ± 0.545
0.389TyrMet: 0.389 ± 0.209
1.363TyrAsn: 1.363 ± 0.327
0.973TyrPro: 0.973 ± 0.31
2.142TyrGln: 2.142 ± 0.467
2.336TyrArg: 2.336 ± 0.369
1.46TyrSer: 1.46 ± 0.378
1.85TyrThr: 1.85 ± 0.423
1.85TyrVal: 1.85 ± 0.37
0.584TyrTrp: 0.584 ± 0.201
0.876TyrTyr: 0.876 ± 0.251
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 49 proteins (10274 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski