Amino acid dipepetide frequency for Dickeya phage Dagda_B1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.262AlaAla: 10.262 ± 1.646
0.862AlaCys: 0.862 ± 0.275
5.484AlaAsp: 5.484 ± 0.895
5.719AlaGlu: 5.719 ± 0.84
3.134AlaPhe: 3.134 ± 0.491
6.816AlaGly: 6.816 ± 0.869
1.488AlaHis: 1.488 ± 0.307
4.779AlaIle: 4.779 ± 0.641
6.11AlaLys: 6.11 ± 0.623
8.304AlaLeu: 8.304 ± 0.827
2.977AlaMet: 2.977 ± 0.48
4.544AlaAsn: 4.544 ± 0.545
3.055AlaPro: 3.055 ± 0.464
5.249AlaGln: 5.249 ± 0.777
5.014AlaArg: 5.014 ± 0.517
5.875AlaSer: 5.875 ± 0.742
4.309AlaThr: 4.309 ± 0.797
5.64AlaVal: 5.64 ± 0.708
1.018AlaTrp: 1.018 ± 0.302
2.82AlaTyr: 2.82 ± 0.598
0.0AlaXaa: 0.0 ± 0.0
Cys
1.175CysAla: 1.175 ± 0.28
0.078CysCys: 0.078 ± 0.082
0.94CysAsp: 0.94 ± 0.368
0.94CysGlu: 0.94 ± 0.241
0.392CysPhe: 0.392 ± 0.194
0.627CysGly: 0.627 ± 0.283
0.548CysHis: 0.548 ± 0.229
0.627CysIle: 0.627 ± 0.256
0.235CysLys: 0.235 ± 0.146
0.548CysLeu: 0.548 ± 0.197
0.078CysMet: 0.078 ± 0.082
0.392CysAsn: 0.392 ± 0.174
0.47CysPro: 0.47 ± 0.213
0.392CysGln: 0.392 ± 0.239
0.392CysArg: 0.392 ± 0.175
0.47CysSer: 0.47 ± 0.272
0.392CysThr: 0.392 ± 0.184
0.627CysVal: 0.627 ± 0.216
0.313CysTrp: 0.313 ± 0.193
0.313CysTyr: 0.313 ± 0.173
0.0CysXaa: 0.0 ± 0.0
Asp
6.737AspAla: 6.737 ± 0.769
0.548AspCys: 0.548 ± 0.296
3.212AspAsp: 3.212 ± 0.534
3.604AspGlu: 3.604 ± 0.537
2.82AspPhe: 2.82 ± 0.403
5.562AspGly: 5.562 ± 0.542
0.94AspHis: 0.94 ± 0.279
3.525AspIle: 3.525 ± 0.463
3.447AspLys: 3.447 ± 0.385
4.23AspLeu: 4.23 ± 0.533
1.488AspMet: 1.488 ± 0.28
2.037AspAsn: 2.037 ± 0.394
2.664AspPro: 2.664 ± 0.384
2.272AspGln: 2.272 ± 0.681
2.82AspArg: 2.82 ± 0.463
3.369AspSer: 3.369 ± 0.463
3.525AspThr: 3.525 ± 0.41
4.857AspVal: 4.857 ± 0.774
0.548AspTrp: 0.548 ± 0.184
2.115AspTyr: 2.115 ± 0.417
0.0AspXaa: 0.0 ± 0.0
Glu
8.382GluAla: 8.382 ± 0.977
0.705GluCys: 0.705 ± 0.239
3.604GluAsp: 3.604 ± 0.557
5.014GluGlu: 5.014 ± 0.875
3.682GluPhe: 3.682 ± 0.518
4.622GluGly: 4.622 ± 0.738
1.253GluHis: 1.253 ± 0.257
3.447GluIle: 3.447 ± 0.52
2.585GluLys: 2.585 ± 0.565
4.935GluLeu: 4.935 ± 0.572
2.272GluMet: 2.272 ± 0.381
3.134GluAsn: 3.134 ± 0.5
1.802GluPro: 1.802 ± 0.447
3.604GluGln: 3.604 ± 0.568
3.995GluArg: 3.995 ± 0.493
3.839GluSer: 3.839 ± 0.718
3.134GluThr: 3.134 ± 0.357
3.76GluVal: 3.76 ± 0.536
1.018GluTrp: 1.018 ± 0.267
2.507GluTyr: 2.507 ± 0.442
0.0GluXaa: 0.0 ± 0.0
Phe
2.742PheAla: 2.742 ± 0.463
0.235PheCys: 0.235 ± 0.113
2.742PheAsp: 2.742 ± 0.437
2.115PheGlu: 2.115 ± 0.461
1.567PhePhe: 1.567 ± 0.431
3.369PheGly: 3.369 ± 0.55
0.705PheHis: 0.705 ± 0.177
2.664PheIle: 2.664 ± 0.547
2.664PheLys: 2.664 ± 0.441
3.369PheLeu: 3.369 ± 0.449
1.253PheMet: 1.253 ± 0.311
2.272PheAsn: 2.272 ± 0.46
1.88PhePro: 1.88 ± 0.494
1.332PheGln: 1.332 ± 0.339
1.018PheArg: 1.018 ± 0.296
2.82PheSer: 2.82 ± 0.562
2.977PheThr: 2.977 ± 0.465
2.272PheVal: 2.272 ± 0.421
0.548PheTrp: 0.548 ± 0.212
0.705PheTyr: 0.705 ± 0.279
0.0PheXaa: 0.0 ± 0.0
Gly
6.267GlyAla: 6.267 ± 0.778
1.018GlyCys: 1.018 ± 0.352
4.935GlyAsp: 4.935 ± 0.629
5.014GlyGlu: 5.014 ± 0.8
3.134GlyPhe: 3.134 ± 0.557
4.935GlyGly: 4.935 ± 0.583
1.097GlyHis: 1.097 ± 0.289
3.917GlyIle: 3.917 ± 0.436
5.484GlyLys: 5.484 ± 0.915
6.345GlyLeu: 6.345 ± 0.861
1.723GlyMet: 1.723 ± 0.368
3.682GlyAsn: 3.682 ± 0.635
1.645GlyPro: 1.645 ± 0.389
2.82GlyGln: 2.82 ± 0.527
3.839GlyArg: 3.839 ± 0.691
5.875GlySer: 5.875 ± 0.696
4.23GlyThr: 4.23 ± 0.616
4.465GlyVal: 4.465 ± 0.559
1.175GlyTrp: 1.175 ± 0.434
2.507GlyTyr: 2.507 ± 0.437
0.0GlyXaa: 0.0 ± 0.0
His
0.705HisAla: 0.705 ± 0.211
0.627HisCys: 0.627 ± 0.201
0.783HisAsp: 0.783 ± 0.272
1.958HisGlu: 1.958 ± 0.433
0.783HisPhe: 0.783 ± 0.282
1.253HisGly: 1.253 ± 0.284
0.548HisHis: 0.548 ± 0.24
0.94HisIle: 0.94 ± 0.329
0.94HisLys: 0.94 ± 0.241
1.958HisLeu: 1.958 ± 0.392
0.47HisMet: 0.47 ± 0.189
1.097HisAsn: 1.097 ± 0.34
0.235HisPro: 0.235 ± 0.125
0.627HisGln: 0.627 ± 0.17
0.47HisArg: 0.47 ± 0.259
1.802HisSer: 1.802 ± 0.468
0.94HisThr: 0.94 ± 0.35
1.332HisVal: 1.332 ± 0.324
0.47HisTrp: 0.47 ± 0.199
0.705HisTyr: 0.705 ± 0.23
0.0HisXaa: 0.0 ± 0.0
Ile
4.857IleAla: 4.857 ± 0.54
0.392IleCys: 0.392 ± 0.176
3.604IleAsp: 3.604 ± 0.503
2.899IleGlu: 2.899 ± 0.582
1.567IlePhe: 1.567 ± 0.385
3.369IleGly: 3.369 ± 0.489
1.253IleHis: 1.253 ± 0.291
3.212IleIle: 3.212 ± 0.541
4.544IleLys: 4.544 ± 0.68
3.995IleLeu: 3.995 ± 0.561
1.567IleMet: 1.567 ± 0.38
2.272IleAsn: 2.272 ± 0.564
2.35IlePro: 2.35 ± 0.436
2.193IleGln: 2.193 ± 0.334
3.604IleArg: 3.604 ± 0.517
2.664IleSer: 2.664 ± 0.368
2.899IleThr: 2.899 ± 0.473
2.742IleVal: 2.742 ± 0.391
0.392IleTrp: 0.392 ± 0.156
1.958IleTyr: 1.958 ± 0.392
0.0IleXaa: 0.0 ± 0.0
Lys
7.991LysAla: 7.991 ± 0.942
0.47LysCys: 0.47 ± 0.2
4.074LysAsp: 4.074 ± 0.577
5.092LysGlu: 5.092 ± 0.618
2.272LysPhe: 2.272 ± 0.424
4.544LysGly: 4.544 ± 0.665
0.94LysHis: 0.94 ± 0.257
2.742LysIle: 2.742 ± 0.439
3.604LysLys: 3.604 ± 0.728
5.17LysLeu: 5.17 ± 0.555
2.193LysMet: 2.193 ± 0.365
2.507LysAsn: 2.507 ± 0.49
2.429LysPro: 2.429 ± 0.496
2.742LysGln: 2.742 ± 0.511
3.447LysArg: 3.447 ± 0.524
4.465LysSer: 4.465 ± 0.621
4.544LysThr: 4.544 ± 0.566
3.917LysVal: 3.917 ± 0.503
0.47LysTrp: 0.47 ± 0.149
1.958LysTyr: 1.958 ± 0.265
0.0LysXaa: 0.0 ± 0.0
Leu
6.58LeuAla: 6.58 ± 0.831
0.705LeuCys: 0.705 ± 0.304
5.327LeuAsp: 5.327 ± 0.55
6.11LeuGlu: 6.11 ± 0.773
2.429LeuPhe: 2.429 ± 0.456
5.562LeuGly: 5.562 ± 0.741
1.488LeuHis: 1.488 ± 0.315
3.76LeuIle: 3.76 ± 0.548
8.226LeuLys: 8.226 ± 0.631
4.857LeuLeu: 4.857 ± 0.632
2.742LeuMet: 2.742 ± 0.423
4.309LeuAsn: 4.309 ± 0.56
2.82LeuPro: 2.82 ± 0.513
3.212LeuGln: 3.212 ± 0.676
4.544LeuArg: 4.544 ± 0.551
5.797LeuSer: 5.797 ± 0.794
3.76LeuThr: 3.76 ± 0.541
4.309LeuVal: 4.309 ± 0.55
0.94LeuTrp: 0.94 ± 0.318
2.193LeuTyr: 2.193 ± 0.426
0.0LeuXaa: 0.0 ± 0.0
Met
2.899MetAla: 2.899 ± 0.474
0.235MetCys: 0.235 ± 0.167
1.88MetAsp: 1.88 ± 0.351
1.488MetGlu: 1.488 ± 0.345
1.41MetPhe: 1.41 ± 0.251
2.35MetGly: 2.35 ± 0.533
0.627MetHis: 0.627 ± 0.201
1.41MetIle: 1.41 ± 0.284
1.88MetLys: 1.88 ± 0.34
2.899MetLeu: 2.899 ± 0.496
0.783MetMet: 0.783 ± 0.225
1.723MetAsn: 1.723 ± 0.351
1.41MetPro: 1.41 ± 0.275
0.862MetGln: 0.862 ± 0.172
1.332MetArg: 1.332 ± 0.29
2.742MetSer: 2.742 ± 0.518
1.332MetThr: 1.332 ± 0.297
2.507MetVal: 2.507 ± 0.312
0.157MetTrp: 0.157 ± 0.11
0.47MetTyr: 0.47 ± 0.177
0.0MetXaa: 0.0 ± 0.0
Asn
3.369AsnAla: 3.369 ± 0.445
0.47AsnCys: 0.47 ± 0.161
2.272AsnAsp: 2.272 ± 0.382
2.977AsnGlu: 2.977 ± 0.457
2.272AsnPhe: 2.272 ± 0.411
4.074AsnGly: 4.074 ± 0.666
0.783AsnHis: 0.783 ± 0.238
2.193AsnIle: 2.193 ± 0.296
2.742AsnLys: 2.742 ± 0.458
3.134AsnLeu: 3.134 ± 0.455
1.88AsnMet: 1.88 ± 0.388
2.272AsnAsn: 2.272 ± 0.43
3.134AsnPro: 3.134 ± 0.517
1.802AsnGln: 1.802 ± 0.433
2.193AsnArg: 2.193 ± 0.511
2.585AsnSer: 2.585 ± 0.529
2.193AsnThr: 2.193 ± 0.405
3.604AsnVal: 3.604 ± 0.511
1.018AsnTrp: 1.018 ± 0.321
1.958AsnTyr: 1.958 ± 0.402
0.0AsnXaa: 0.0 ± 0.0
Pro
3.134ProAla: 3.134 ± 0.53
0.47ProCys: 0.47 ± 0.208
2.664ProAsp: 2.664 ± 0.344
4.7ProGlu: 4.7 ± 0.713
1.41ProPhe: 1.41 ± 0.355
1.567ProGly: 1.567 ± 0.287
0.783ProHis: 0.783 ± 0.226
1.88ProIle: 1.88 ± 0.305
2.585ProLys: 2.585 ± 0.466
2.429ProLeu: 2.429 ± 0.49
1.332ProMet: 1.332 ± 0.336
2.429ProAsn: 2.429 ± 0.426
0.783ProPro: 0.783 ± 0.22
1.645ProGln: 1.645 ± 0.384
1.41ProArg: 1.41 ± 0.255
2.193ProSer: 2.193 ± 0.406
1.958ProThr: 1.958 ± 0.281
3.29ProVal: 3.29 ± 0.456
0.47ProTrp: 0.47 ± 0.141
1.253ProTyr: 1.253 ± 0.292
0.0ProXaa: 0.0 ± 0.0
Gln
4.152GlnAla: 4.152 ± 0.616
0.313GlnCys: 0.313 ± 0.153
2.115GlnAsp: 2.115 ± 0.433
2.742GlnGlu: 2.742 ± 0.583
1.88GlnPhe: 1.88 ± 0.332
2.82GlnGly: 2.82 ± 0.365
0.548GlnHis: 0.548 ± 0.226
2.35GlnIle: 2.35 ± 0.574
2.429GlnLys: 2.429 ± 0.531
4.309GlnLeu: 4.309 ± 0.593
1.332GlnMet: 1.332 ± 0.332
2.115GlnAsn: 2.115 ± 0.331
1.567GlnPro: 1.567 ± 0.3
1.567GlnGln: 1.567 ± 0.279
2.272GlnArg: 2.272 ± 0.361
2.507GlnSer: 2.507 ± 0.408
2.193GlnThr: 2.193 ± 0.457
3.29GlnVal: 3.29 ± 0.577
0.94GlnTrp: 0.94 ± 0.278
0.627GlnTyr: 0.627 ± 0.223
0.0GlnXaa: 0.0 ± 0.0
Arg
4.152ArgAla: 4.152 ± 0.652
0.627ArgCys: 0.627 ± 0.249
3.29ArgAsp: 3.29 ± 0.442
3.525ArgGlu: 3.525 ± 0.459
2.585ArgPhe: 2.585 ± 0.514
3.525ArgGly: 3.525 ± 0.462
1.097ArgHis: 1.097 ± 0.34
2.899ArgIle: 2.899 ± 0.327
3.682ArgLys: 3.682 ± 0.552
4.23ArgLeu: 4.23 ± 0.563
1.488ArgMet: 1.488 ± 0.418
2.037ArgAsn: 2.037 ± 0.315
2.272ArgPro: 2.272 ± 0.505
2.35ArgGln: 2.35 ± 0.364
2.664ArgArg: 2.664 ± 0.485
2.272ArgSer: 2.272 ± 0.386
2.585ArgThr: 2.585 ± 0.469
3.525ArgVal: 3.525 ± 0.562
0.548ArgTrp: 0.548 ± 0.197
1.802ArgTyr: 1.802 ± 0.37
0.0ArgXaa: 0.0 ± 0.0
Ser
6.424SerAla: 6.424 ± 1.243
0.783SerCys: 0.783 ± 0.259
3.682SerAsp: 3.682 ± 0.618
4.387SerGlu: 4.387 ± 0.567
3.134SerPhe: 3.134 ± 0.541
6.345SerGly: 6.345 ± 0.667
1.332SerHis: 1.332 ± 0.313
2.35SerIle: 2.35 ± 0.459
3.525SerLys: 3.525 ± 0.433
5.092SerLeu: 5.092 ± 0.527
1.723SerMet: 1.723 ± 0.35
2.35SerAsn: 2.35 ± 0.322
2.82SerPro: 2.82 ± 0.606
2.82SerGln: 2.82 ± 0.503
2.664SerArg: 2.664 ± 0.336
4.544SerSer: 4.544 ± 0.702
2.664SerThr: 2.664 ± 0.509
3.76SerVal: 3.76 ± 0.541
1.567SerTrp: 1.567 ± 0.386
2.35SerTyr: 2.35 ± 0.406
0.0SerXaa: 0.0 ± 0.0
Thr
4.387ThrAla: 4.387 ± 0.581
0.392ThrCys: 0.392 ± 0.19
3.447ThrAsp: 3.447 ± 0.485
3.055ThrGlu: 3.055 ± 0.429
1.802ThrPhe: 1.802 ± 0.38
5.327ThrGly: 5.327 ± 0.55
0.862ThrHis: 0.862 ± 0.295
4.152ThrIle: 4.152 ± 0.507
3.369ThrLys: 3.369 ± 0.489
4.779ThrLeu: 4.779 ± 0.788
1.802ThrMet: 1.802 ± 0.308
2.272ThrAsn: 2.272 ± 0.362
2.585ThrPro: 2.585 ± 0.441
2.037ThrGln: 2.037 ± 0.393
1.723ThrArg: 1.723 ± 0.384
2.585ThrSer: 2.585 ± 0.452
3.29ThrThr: 3.29 ± 0.552
4.622ThrVal: 4.622 ± 0.538
0.862ThrTrp: 0.862 ± 0.249
1.332ThrTyr: 1.332 ± 0.257
0.0ThrXaa: 0.0 ± 0.0
Val
5.327ValAla: 5.327 ± 0.644
0.548ValCys: 0.548 ± 0.185
3.134ValAsp: 3.134 ± 0.351
3.29ValGlu: 3.29 ± 0.449
1.567ValPhe: 1.567 ± 0.411
4.935ValGly: 4.935 ± 0.602
1.018ValHis: 1.018 ± 0.304
3.447ValIle: 3.447 ± 0.609
4.779ValLys: 4.779 ± 0.628
5.249ValLeu: 5.249 ± 0.634
1.645ValMet: 1.645 ± 0.426
3.369ValAsn: 3.369 ± 0.449
2.977ValPro: 2.977 ± 0.459
2.664ValGln: 2.664 ± 0.504
4.544ValArg: 4.544 ± 0.525
4.935ValSer: 4.935 ± 0.604
5.249ValThr: 5.249 ± 0.746
5.014ValVal: 5.014 ± 0.772
1.253ValTrp: 1.253 ± 0.278
1.88ValTyr: 1.88 ± 0.366
0.0ValXaa: 0.0 ± 0.0
Trp
0.94TrpAla: 0.94 ± 0.193
0.313TrpCys: 0.313 ± 0.157
1.253TrpAsp: 1.253 ± 0.291
0.548TrpGlu: 0.548 ± 0.259
0.548TrpPhe: 0.548 ± 0.215
0.392TrpGly: 0.392 ± 0.141
0.392TrpHis: 0.392 ± 0.151
0.862TrpIle: 0.862 ± 0.266
1.018TrpLys: 1.018 ± 0.269
1.097TrpLeu: 1.097 ± 0.209
0.705TrpMet: 0.705 ± 0.19
0.548TrpAsn: 0.548 ± 0.177
0.313TrpPro: 0.313 ± 0.152
0.47TrpGln: 0.47 ± 0.196
1.175TrpArg: 1.175 ± 0.291
1.175TrpSer: 1.175 ± 0.325
0.783TrpThr: 0.783 ± 0.232
1.175TrpVal: 1.175 ± 0.366
0.313TrpTrp: 0.313 ± 0.132
0.313TrpTyr: 0.313 ± 0.224
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.977TyrAla: 2.977 ± 0.456
0.157TyrCys: 0.157 ± 0.157
1.88TyrAsp: 1.88 ± 0.396
2.115TyrGlu: 2.115 ± 0.378
0.862TyrPhe: 0.862 ± 0.193
2.193TyrGly: 2.193 ± 0.364
0.94TyrHis: 0.94 ± 0.275
1.175TyrIle: 1.175 ± 0.381
1.723TyrLys: 1.723 ± 0.342
2.742TyrLeu: 2.742 ± 0.483
0.862TyrMet: 0.862 ± 0.283
1.567TyrAsn: 1.567 ± 0.28
1.175TyrPro: 1.175 ± 0.276
1.41TyrGln: 1.41 ± 0.343
2.037TyrArg: 2.037 ± 0.439
1.802TyrSer: 1.802 ± 0.288
1.723TyrThr: 1.723 ± 0.407
2.115TyrVal: 2.115 ± 0.42
0.313TyrTrp: 0.313 ± 0.187
0.548TyrTyr: 0.548 ± 0.305
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (12766 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski