Amino acid dipepetide frequency for Dickeya phage Sucellus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.525AlaAla: 6.525 ± 1.204
1.047AlaCys: 1.047 ± 0.341
3.947AlaAsp: 3.947 ± 0.552
4.108AlaGlu: 4.108 ± 0.699
2.336AlaPhe: 2.336 ± 0.479
6.283AlaGly: 6.283 ± 0.823
0.967AlaHis: 0.967 ± 0.26
4.833AlaIle: 4.833 ± 0.565
4.028AlaLys: 4.028 ± 0.598
5.639AlaLeu: 5.639 ± 0.79
2.094AlaMet: 2.094 ± 0.491
3.705AlaAsn: 3.705 ± 0.476
1.692AlaPro: 1.692 ± 0.498
2.658AlaGln: 2.658 ± 0.456
2.256AlaArg: 2.256 ± 0.488
4.914AlaSer: 4.914 ± 0.535
4.269AlaThr: 4.269 ± 0.798
4.189AlaVal: 4.189 ± 0.624
0.967AlaTrp: 0.967 ± 0.285
1.933AlaTyr: 1.933 ± 0.398
0.0AlaXaa: 0.0 ± 0.0
Cys
0.886CysAla: 0.886 ± 0.245
0.242CysCys: 0.242 ± 0.152
0.967CysAsp: 0.967 ± 0.282
1.128CysGlu: 1.128 ± 0.332
0.644CysPhe: 0.644 ± 0.258
1.128CysGly: 1.128 ± 0.339
0.403CysHis: 0.403 ± 0.178
1.531CysIle: 1.531 ± 0.333
1.128CysLys: 1.128 ± 0.321
0.967CysLeu: 0.967 ± 0.337
0.161CysMet: 0.161 ± 0.116
0.806CysAsn: 0.806 ± 0.327
0.725CysPro: 0.725 ± 0.211
0.483CysGln: 0.483 ± 0.186
0.967CysArg: 0.967 ± 0.324
0.967CysSer: 0.967 ± 0.305
0.483CysThr: 0.483 ± 0.193
0.725CysVal: 0.725 ± 0.246
0.081CysTrp: 0.081 ± 0.094
0.322CysTyr: 0.322 ± 0.172
0.0CysXaa: 0.0 ± 0.0
Asp
4.189AspAla: 4.189 ± 0.567
0.564AspCys: 0.564 ± 0.194
4.028AspAsp: 4.028 ± 0.655
4.833AspGlu: 4.833 ± 0.693
2.417AspPhe: 2.417 ± 0.447
5.8AspGly: 5.8 ± 0.609
0.483AspHis: 0.483 ± 0.242
4.43AspIle: 4.43 ± 0.502
4.269AspLys: 4.269 ± 0.57
3.625AspLeu: 3.625 ± 0.583
1.692AspMet: 1.692 ± 0.378
3.786AspAsn: 3.786 ± 0.595
1.128AspPro: 1.128 ± 0.26
1.208AspGln: 1.208 ± 0.544
2.256AspArg: 2.256 ± 0.285
4.269AspSer: 4.269 ± 0.596
2.175AspThr: 2.175 ± 0.352
4.592AspVal: 4.592 ± 0.611
0.644AspTrp: 0.644 ± 0.255
2.9AspTyr: 2.9 ± 0.442
0.0AspXaa: 0.0 ± 0.0
Glu
3.544GluAla: 3.544 ± 0.532
1.128GluCys: 1.128 ± 0.408
2.014GluAsp: 2.014 ± 0.359
3.222GluGlu: 3.222 ± 0.612
2.739GluPhe: 2.739 ± 0.441
3.142GluGly: 3.142 ± 0.518
1.208GluHis: 1.208 ± 0.417
6.444GluIle: 6.444 ± 0.667
3.705GluLys: 3.705 ± 0.523
5.8GluLeu: 5.8 ± 0.603
2.739GluMet: 2.739 ± 0.443
2.175GluAsn: 2.175 ± 0.404
1.853GluPro: 1.853 ± 0.44
2.175GluGln: 2.175 ± 0.423
3.705GluArg: 3.705 ± 0.476
4.189GluSer: 4.189 ± 0.846
4.43GluThr: 4.43 ± 0.525
3.464GluVal: 3.464 ± 0.514
0.886GluTrp: 0.886 ± 0.285
3.867GluTyr: 3.867 ± 0.478
0.0GluXaa: 0.0 ± 0.0
Phe
2.739PheAla: 2.739 ± 0.538
0.967PheCys: 0.967 ± 0.345
2.981PheAsp: 2.981 ± 0.595
2.497PheGlu: 2.497 ± 0.469
1.611PhePhe: 1.611 ± 0.449
2.739PheGly: 2.739 ± 0.405
0.644PheHis: 0.644 ± 0.21
2.658PheIle: 2.658 ± 0.549
2.497PheLys: 2.497 ± 0.409
1.772PheLeu: 1.772 ± 0.426
1.289PheMet: 1.289 ± 0.292
3.303PheAsn: 3.303 ± 0.745
1.289PhePro: 1.289 ± 0.293
1.611PheGln: 1.611 ± 0.506
1.692PheArg: 1.692 ± 0.338
4.189PheSer: 4.189 ± 0.533
1.611PheThr: 1.611 ± 0.289
2.9PheVal: 2.9 ± 0.458
0.483PheTrp: 0.483 ± 0.212
1.611PheTyr: 1.611 ± 0.368
0.0PheXaa: 0.0 ± 0.0
Gly
4.994GlyAla: 4.994 ± 0.754
0.725GlyCys: 0.725 ± 0.265
4.914GlyAsp: 4.914 ± 0.454
3.705GlyGlu: 3.705 ± 0.554
4.43GlyPhe: 4.43 ± 0.931
7.25GlyGly: 7.25 ± 1.523
0.725GlyHis: 0.725 ± 0.314
3.947GlyIle: 3.947 ± 0.559
5.558GlyLys: 5.558 ± 0.473
6.444GlyLeu: 6.444 ± 0.741
2.981GlyMet: 2.981 ± 0.458
4.592GlyAsn: 4.592 ± 0.766
1.692GlyPro: 1.692 ± 0.338
2.175GlyGln: 2.175 ± 0.512
3.061GlyArg: 3.061 ± 0.448
4.753GlySer: 4.753 ± 0.579
3.786GlyThr: 3.786 ± 0.776
6.364GlyVal: 6.364 ± 0.739
1.208GlyTrp: 1.208 ± 0.336
2.981GlyTyr: 2.981 ± 0.495
0.0GlyXaa: 0.0 ± 0.0
His
0.564HisAla: 0.564 ± 0.242
0.242HisCys: 0.242 ± 0.125
0.483HisAsp: 0.483 ± 0.182
0.725HisGlu: 0.725 ± 0.247
0.081HisPhe: 0.081 ± 0.073
1.531HisGly: 1.531 ± 0.364
0.806HisHis: 0.806 ± 0.316
0.967HisIle: 0.967 ± 0.295
1.047HisLys: 1.047 ± 0.401
0.886HisLeu: 0.886 ± 0.263
0.403HisMet: 0.403 ± 0.168
0.886HisAsn: 0.886 ± 0.262
0.806HisPro: 0.806 ± 0.244
0.564HisGln: 0.564 ± 0.257
1.289HisArg: 1.289 ± 0.35
1.45HisSer: 1.45 ± 0.372
0.564HisThr: 0.564 ± 0.186
0.725HisVal: 0.725 ± 0.314
0.322HisTrp: 0.322 ± 0.144
0.483HisTyr: 0.483 ± 0.17
0.0HisXaa: 0.0 ± 0.0
Ile
5.155IleAla: 5.155 ± 0.553
1.611IleCys: 1.611 ± 0.469
6.525IleAsp: 6.525 ± 0.85
5.075IleGlu: 5.075 ± 0.738
2.578IlePhe: 2.578 ± 0.602
3.947IleGly: 3.947 ± 0.648
0.967IleHis: 0.967 ± 0.315
5.155IleIle: 5.155 ± 0.828
6.122IleLys: 6.122 ± 0.789
3.061IleLeu: 3.061 ± 0.675
1.289IleMet: 1.289 ± 0.353
6.203IleAsn: 6.203 ± 0.946
2.658IlePro: 2.658 ± 0.539
1.933IleGln: 1.933 ± 0.363
3.222IleArg: 3.222 ± 0.51
6.444IleSer: 6.444 ± 0.827
5.075IleThr: 5.075 ± 0.772
5.317IleVal: 5.317 ± 0.679
1.128IleTrp: 1.128 ± 0.33
3.061IleTyr: 3.061 ± 0.511
0.0IleXaa: 0.0 ± 0.0
Lys
3.625LysAla: 3.625 ± 0.677
0.967LysCys: 0.967 ± 0.287
3.625LysAsp: 3.625 ± 0.592
4.189LysGlu: 4.189 ± 0.493
3.061LysPhe: 3.061 ± 0.61
3.544LysGly: 3.544 ± 0.577
0.644LysHis: 0.644 ± 0.28
6.847LysIle: 6.847 ± 0.769
6.444LysLys: 6.444 ± 0.955
5.88LysLeu: 5.88 ± 0.665
2.175LysMet: 2.175 ± 0.39
4.108LysAsn: 4.108 ± 0.727
2.094LysPro: 2.094 ± 0.409
2.094LysGln: 2.094 ± 0.559
2.417LysArg: 2.417 ± 0.463
5.075LysSer: 5.075 ± 0.72
3.061LysThr: 3.061 ± 0.524
4.35LysVal: 4.35 ± 0.613
0.564LysTrp: 0.564 ± 0.186
2.739LysTyr: 2.739 ± 0.482
0.0LysXaa: 0.0 ± 0.0
Leu
4.43LeuAla: 4.43 ± 0.689
1.128LeuCys: 1.128 ± 0.395
4.753LeuAsp: 4.753 ± 0.681
4.189LeuGlu: 4.189 ± 0.586
2.739LeuPhe: 2.739 ± 0.487
4.43LeuGly: 4.43 ± 0.545
0.886LeuHis: 0.886 ± 0.301
6.122LeuIle: 6.122 ± 0.648
4.269LeuLys: 4.269 ± 0.727
3.867LeuLeu: 3.867 ± 0.59
1.853LeuMet: 1.853 ± 0.386
4.511LeuAsn: 4.511 ± 0.76
2.9LeuPro: 2.9 ± 0.309
2.094LeuGln: 2.094 ± 0.331
3.061LeuArg: 3.061 ± 0.516
6.605LeuSer: 6.605 ± 0.707
3.786LeuThr: 3.786 ± 0.472
3.786LeuVal: 3.786 ± 0.465
0.644LeuTrp: 0.644 ± 0.21
2.256LeuTyr: 2.256 ± 0.462
0.0LeuXaa: 0.0 ± 0.0
Met
3.383MetAla: 3.383 ± 0.552
0.403MetCys: 0.403 ± 0.163
1.289MetAsp: 1.289 ± 0.331
1.933MetGlu: 1.933 ± 0.504
1.369MetPhe: 1.369 ± 0.41
1.772MetGly: 1.772 ± 0.404
0.081MetHis: 0.081 ± 0.077
2.256MetIle: 2.256 ± 0.366
2.417MetLys: 2.417 ± 0.513
1.611MetLeu: 1.611 ± 0.407
1.128MetMet: 1.128 ± 0.32
1.369MetAsn: 1.369 ± 0.343
1.128MetPro: 1.128 ± 0.344
1.45MetGln: 1.45 ± 0.35
1.45MetArg: 1.45 ± 0.414
2.739MetSer: 2.739 ± 0.349
1.369MetThr: 1.369 ± 0.342
1.933MetVal: 1.933 ± 0.335
0.161MetTrp: 0.161 ± 0.131
0.725MetTyr: 0.725 ± 0.224
0.0MetXaa: 0.0 ± 0.0
Asn
3.867AsnAla: 3.867 ± 0.688
0.886AsnCys: 0.886 ± 0.273
3.544AsnAsp: 3.544 ± 0.498
3.625AsnGlu: 3.625 ± 0.574
2.256AsnPhe: 2.256 ± 0.44
6.122AsnGly: 6.122 ± 0.893
1.289AsnHis: 1.289 ± 0.338
4.35AsnIle: 4.35 ± 0.811
4.833AsnLys: 4.833 ± 0.699
3.786AsnLeu: 3.786 ± 0.515
1.772AsnMet: 1.772 ± 0.307
5.236AsnAsn: 5.236 ± 1.334
1.692AsnPro: 1.692 ± 0.402
2.9AsnGln: 2.9 ± 1.07
2.9AsnArg: 2.9 ± 0.389
5.478AsnSer: 5.478 ± 0.718
2.981AsnThr: 2.981 ± 0.616
2.819AsnVal: 2.819 ± 0.534
0.483AsnTrp: 0.483 ± 0.215
2.256AsnTyr: 2.256 ± 0.369
0.0AsnXaa: 0.0 ± 0.0
Pro
2.497ProAla: 2.497 ± 0.442
0.242ProCys: 0.242 ± 0.134
2.417ProAsp: 2.417 ± 0.447
2.739ProGlu: 2.739 ± 0.829
1.369ProPhe: 1.369 ± 0.44
2.336ProGly: 2.336 ± 0.324
0.564ProHis: 0.564 ± 0.202
1.853ProIle: 1.853 ± 0.378
1.289ProLys: 1.289 ± 0.319
2.658ProLeu: 2.658 ± 0.384
0.725ProMet: 0.725 ± 0.251
1.208ProAsn: 1.208 ± 0.307
1.369ProPro: 1.369 ± 0.359
0.806ProGln: 0.806 ± 0.32
0.806ProArg: 0.806 ± 0.276
3.061ProSer: 3.061 ± 0.505
2.175ProThr: 2.175 ± 0.392
2.578ProVal: 2.578 ± 0.423
0.322ProTrp: 0.322 ± 0.164
0.806ProTyr: 0.806 ± 0.203
0.0ProXaa: 0.0 ± 0.0
Gln
2.819GlnAla: 2.819 ± 0.428
0.564GlnCys: 0.564 ± 0.199
1.692GlnAsp: 1.692 ± 0.444
1.611GlnGlu: 1.611 ± 0.331
1.369GlnPhe: 1.369 ± 0.355
1.933GlnGly: 1.933 ± 0.534
0.644GlnHis: 0.644 ± 0.214
2.094GlnIle: 2.094 ± 0.456
1.369GlnLys: 1.369 ± 0.291
2.497GlnLeu: 2.497 ± 0.461
1.369GlnMet: 1.369 ± 0.338
3.383GlnAsn: 3.383 ± 1.612
0.886GlnPro: 0.886 ± 0.258
3.222GlnGln: 3.222 ± 1.841
2.175GlnArg: 2.175 ± 0.381
1.531GlnSer: 1.531 ± 0.35
1.772GlnThr: 1.772 ± 0.342
1.45GlnVal: 1.45 ± 0.428
0.242GlnTrp: 0.242 ± 0.136
0.725GlnTyr: 0.725 ± 0.245
0.0GlnXaa: 0.0 ± 0.0
Arg
2.658ArgAla: 2.658 ± 0.484
0.886ArgCys: 0.886 ± 0.26
2.417ArgAsp: 2.417 ± 0.37
2.336ArgGlu: 2.336 ± 0.453
2.256ArgPhe: 2.256 ± 0.419
3.544ArgGly: 3.544 ± 0.551
0.967ArgHis: 0.967 ± 0.273
3.222ArgIle: 3.222 ± 0.433
2.175ArgLys: 2.175 ± 0.409
3.142ArgLeu: 3.142 ± 0.446
1.772ArgMet: 1.772 ± 0.343
2.497ArgAsn: 2.497 ± 0.423
1.772ArgPro: 1.772 ± 0.421
1.208ArgGln: 1.208 ± 0.383
3.303ArgArg: 3.303 ± 0.539
3.383ArgSer: 3.383 ± 0.703
2.175ArgThr: 2.175 ± 0.409
2.658ArgVal: 2.658 ± 0.511
0.886ArgTrp: 0.886 ± 0.298
1.772ArgTyr: 1.772 ± 0.308
0.0ArgXaa: 0.0 ± 0.0
Ser
4.753SerAla: 4.753 ± 0.511
1.047SerCys: 1.047 ± 0.322
4.269SerAsp: 4.269 ± 0.628
6.203SerGlu: 6.203 ± 0.644
3.625SerPhe: 3.625 ± 0.584
7.975SerGly: 7.975 ± 0.727
0.967SerHis: 0.967 ± 0.228
6.283SerIle: 6.283 ± 0.827
5.478SerLys: 5.478 ± 0.771
4.994SerLeu: 4.994 ± 0.52
2.336SerMet: 2.336 ± 0.37
4.108SerAsn: 4.108 ± 0.551
2.417SerPro: 2.417 ± 0.528
1.692SerGln: 1.692 ± 0.391
3.303SerArg: 3.303 ± 0.485
7.33SerSer: 7.33 ± 0.902
4.592SerThr: 4.592 ± 0.72
5.639SerVal: 5.639 ± 1.116
0.403SerTrp: 0.403 ± 0.219
3.061SerTyr: 3.061 ± 0.596
0.0SerXaa: 0.0 ± 0.0
Thr
4.511ThrAla: 4.511 ± 0.74
0.725ThrCys: 0.725 ± 0.255
3.061ThrAsp: 3.061 ± 0.386
3.222ThrGlu: 3.222 ± 0.504
1.853ThrPhe: 1.853 ± 0.345
5.558ThrGly: 5.558 ± 0.994
0.483ThrHis: 0.483 ± 0.215
4.35ThrIle: 4.35 ± 0.808
3.303ThrLys: 3.303 ± 0.557
3.303ThrLeu: 3.303 ± 0.53
1.128ThrMet: 1.128 ± 0.301
3.303ThrAsn: 3.303 ± 0.626
2.336ThrPro: 2.336 ± 0.434
1.289ThrGln: 1.289 ± 0.302
1.692ThrArg: 1.692 ± 0.589
3.544ThrSer: 3.544 ± 0.737
3.625ThrThr: 3.625 ± 0.457
3.464ThrVal: 3.464 ± 0.441
0.967ThrTrp: 0.967 ± 0.273
1.853ThrTyr: 1.853 ± 0.327
0.0ThrXaa: 0.0 ± 0.0
Val
4.753ValAla: 4.753 ± 0.697
0.564ValCys: 0.564 ± 0.205
3.705ValAsp: 3.705 ± 0.52
4.269ValGlu: 4.269 ± 0.735
2.497ValPhe: 2.497 ± 0.551
3.947ValGly: 3.947 ± 0.649
0.806ValHis: 0.806 ± 0.268
5.961ValIle: 5.961 ± 0.792
4.592ValLys: 4.592 ± 0.632
3.544ValLeu: 3.544 ± 0.502
1.369ValMet: 1.369 ± 0.419
4.833ValAsn: 4.833 ± 0.579
2.256ValPro: 2.256 ± 0.506
1.369ValGln: 1.369 ± 0.413
2.981ValArg: 2.981 ± 0.712
5.719ValSer: 5.719 ± 0.69
3.544ValThr: 3.544 ± 0.48
4.672ValVal: 4.672 ± 0.657
0.564ValTrp: 0.564 ± 0.184
2.497ValTyr: 2.497 ± 0.566
0.0ValXaa: 0.0 ± 0.0
Trp
0.242TrpAla: 0.242 ± 0.149
0.322TrpCys: 0.322 ± 0.153
0.403TrpAsp: 0.403 ± 0.201
0.322TrpGlu: 0.322 ± 0.16
0.483TrpPhe: 0.483 ± 0.209
0.725TrpGly: 0.725 ± 0.248
0.242TrpHis: 0.242 ± 0.133
0.644TrpIle: 0.644 ± 0.183
0.644TrpLys: 0.644 ± 0.212
1.45TrpLeu: 1.45 ± 0.328
0.564TrpMet: 0.564 ± 0.266
0.564TrpAsn: 0.564 ± 0.246
0.322TrpPro: 0.322 ± 0.166
0.644TrpGln: 0.644 ± 0.285
1.128TrpArg: 1.128 ± 0.277
0.967TrpSer: 0.967 ± 0.263
0.564TrpThr: 0.564 ± 0.216
0.644TrpVal: 0.644 ± 0.238
0.242TrpTrp: 0.242 ± 0.123
0.483TrpTyr: 0.483 ± 0.136
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.256TyrAla: 2.256 ± 0.391
0.483TyrCys: 0.483 ± 0.223
2.256TyrAsp: 2.256 ± 0.424
2.497TyrGlu: 2.497 ± 0.442
1.369TyrPhe: 1.369 ± 0.3
2.578TyrGly: 2.578 ± 0.379
1.047TyrHis: 1.047 ± 0.291
2.578TyrIle: 2.578 ± 0.544
2.336TyrLys: 2.336 ± 0.44
3.383TyrLeu: 3.383 ± 0.608
1.047TyrMet: 1.047 ± 0.262
2.497TyrAsn: 2.497 ± 0.522
0.806TyrPro: 0.806 ± 0.321
1.772TyrGln: 1.772 ± 0.467
1.369TyrArg: 1.369 ± 0.308
3.947TyrSer: 3.947 ± 0.693
1.531TyrThr: 1.531 ± 0.314
2.175TyrVal: 2.175 ± 0.443
0.403TyrTrp: 0.403 ± 0.151
0.967TyrTyr: 0.967 ± 0.304
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (12415 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski