Amino acid dipepetide frequency for Escherichia phage Bp4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.675AlaAla: 8.675 ± 1.131
0.6AlaCys: 0.6 ± 0.22
5.03AlaAsp: 5.03 ± 0.64
5.353AlaGlu: 5.353 ± 0.473
2.307AlaPhe: 2.307 ± 0.351
6.368AlaGly: 6.368 ± 0.715
1.154AlaHis: 1.154 ± 0.204
4.799AlaIle: 4.799 ± 0.516
5.399AlaLys: 5.399 ± 0.581
7.383AlaLeu: 7.383 ± 1.012
2.907AlaMet: 2.907 ± 0.467
5.353AlaAsn: 5.353 ± 0.417
2.399AlaPro: 2.399 ± 0.438
4.568AlaGln: 4.568 ± 0.761
3.322AlaArg: 3.322 ± 0.361
4.845AlaSer: 4.845 ± 0.416
5.906AlaThr: 5.906 ± 0.631
5.676AlaVal: 5.676 ± 0.549
0.923AlaTrp: 0.923 ± 0.2
3.415AlaTyr: 3.415 ± 0.37
0.0AlaXaa: 0.0 ± 0.0
Cys
0.554CysAla: 0.554 ± 0.171
0.092CysCys: 0.092 ± 0.109
0.415CysAsp: 0.415 ± 0.162
0.6CysGlu: 0.6 ± 0.175
0.369CysPhe: 0.369 ± 0.126
0.508CysGly: 0.508 ± 0.156
0.185CysHis: 0.185 ± 0.101
0.692CysIle: 0.692 ± 0.252
0.738CysLys: 0.738 ± 0.216
0.692CysLeu: 0.692 ± 0.246
0.323CysMet: 0.323 ± 0.141
0.554CysAsn: 0.554 ± 0.21
0.323CysPro: 0.323 ± 0.166
0.138CysGln: 0.138 ± 0.089
0.323CysArg: 0.323 ± 0.138
0.554CysSer: 0.554 ± 0.182
0.554CysThr: 0.554 ± 0.201
0.831CysVal: 0.831 ± 0.265
0.231CysTrp: 0.231 ± 0.101
0.277CysTyr: 0.277 ± 0.132
0.0CysXaa: 0.0 ± 0.0
Asp
4.66AspAla: 4.66 ± 0.585
0.877AspCys: 0.877 ± 0.253
3.415AspAsp: 3.415 ± 0.472
3.691AspGlu: 3.691 ± 0.499
2.076AspPhe: 2.076 ± 0.298
2.907AspGly: 2.907 ± 0.285
0.831AspHis: 0.831 ± 0.188
4.614AspIle: 4.614 ± 0.529
2.999AspLys: 2.999 ± 0.374
4.845AspLeu: 4.845 ± 0.496
1.615AspMet: 1.615 ± 0.263
2.353AspAsn: 2.353 ± 0.303
2.815AspPro: 2.815 ± 0.268
1.938AspGln: 1.938 ± 0.31
2.538AspArg: 2.538 ± 0.341
3.968AspSer: 3.968 ± 0.35
3.784AspThr: 3.784 ± 0.421
3.738AspVal: 3.738 ± 0.398
0.784AspTrp: 0.784 ± 0.181
2.123AspTyr: 2.123 ± 0.441
0.0AspXaa: 0.0 ± 0.0
Glu
6.598GluAla: 6.598 ± 0.902
0.461GluCys: 0.461 ± 0.17
3.738GluAsp: 3.738 ± 0.539
5.214GluGlu: 5.214 ± 0.732
2.399GluPhe: 2.399 ± 0.325
3.968GluGly: 3.968 ± 0.429
0.877GluHis: 0.877 ± 0.191
3.507GluIle: 3.507 ± 0.409
3.83GluLys: 3.83 ± 0.49
5.906GluLeu: 5.906 ± 0.405
1.938GluMet: 1.938 ± 0.243
3.23GluAsn: 3.23 ± 0.389
3.322GluPro: 3.322 ± 0.433
2.861GluGln: 2.861 ± 0.472
1.846GluArg: 1.846 ± 0.283
3.138GluSer: 3.138 ± 0.399
3.045GluThr: 3.045 ± 0.416
4.476GluVal: 4.476 ± 0.449
0.738GluTrp: 0.738 ± 0.15
2.307GluTyr: 2.307 ± 0.373
0.0GluXaa: 0.0 ± 0.0
Phe
2.722PheAla: 2.722 ± 0.37
0.323PheCys: 0.323 ± 0.134
2.399PheAsp: 2.399 ± 0.314
1.892PheGlu: 1.892 ± 0.258
0.923PhePhe: 0.923 ± 0.199
2.676PheGly: 2.676 ± 0.298
0.508PheHis: 0.508 ± 0.129
2.492PheIle: 2.492 ± 0.363
2.03PheLys: 2.03 ± 0.31
2.446PheLeu: 2.446 ± 0.369
1.477PheMet: 1.477 ± 0.235
2.538PheAsn: 2.538 ± 0.424
1.107PhePro: 1.107 ± 0.254
1.569PheGln: 1.569 ± 0.263
1.846PheArg: 1.846 ± 0.28
2.399PheSer: 2.399 ± 0.321
2.907PheThr: 2.907 ± 0.467
2.076PheVal: 2.076 ± 0.327
0.369PheTrp: 0.369 ± 0.149
1.43PheTyr: 1.43 ± 0.237
0.0PheXaa: 0.0 ± 0.0
Gly
5.399GlyAla: 5.399 ± 0.652
1.107GlyCys: 1.107 ± 0.316
2.953GlyAsp: 2.953 ± 0.332
3.738GlyGlu: 3.738 ± 0.369
2.815GlyPhe: 2.815 ± 0.408
3.968GlyGly: 3.968 ± 0.564
1.107GlyHis: 1.107 ± 0.208
3.922GlyIle: 3.922 ± 0.361
5.999GlyLys: 5.999 ± 0.518
4.937GlyLeu: 4.937 ± 0.513
2.399GlyMet: 2.399 ± 0.318
4.614GlyAsn: 4.614 ± 0.437
0.969GlyPro: 0.969 ± 0.208
2.215GlyGln: 2.215 ± 0.31
2.538GlyArg: 2.538 ± 0.417
5.491GlySer: 5.491 ± 0.581
4.753GlyThr: 4.753 ± 0.522
4.753GlyVal: 4.753 ± 0.463
1.015GlyTrp: 1.015 ± 0.239
3.092GlyTyr: 3.092 ± 0.368
0.0GlyXaa: 0.0 ± 0.0
His
1.2HisAla: 1.2 ± 0.184
0.185HisCys: 0.185 ± 0.107
1.061HisAsp: 1.061 ± 0.263
0.969HisGlu: 0.969 ± 0.201
0.461HisPhe: 0.461 ± 0.179
0.877HisGly: 0.877 ± 0.253
0.461HisHis: 0.461 ± 0.149
1.154HisIle: 1.154 ± 0.187
1.523HisLys: 1.523 ± 0.296
2.215HisLeu: 2.215 ± 0.296
0.369HisMet: 0.369 ± 0.111
0.923HisAsn: 0.923 ± 0.188
0.738HisPro: 0.738 ± 0.241
0.323HisGln: 0.323 ± 0.112
0.692HisArg: 0.692 ± 0.201
1.477HisSer: 1.477 ± 0.262
0.923HisThr: 0.923 ± 0.231
0.784HisVal: 0.784 ± 0.193
0.369HisTrp: 0.369 ± 0.118
1.061HisTyr: 1.061 ± 0.159
0.0HisXaa: 0.0 ± 0.0
Ile
5.076IleAla: 5.076 ± 0.497
0.461IleCys: 0.461 ± 0.173
3.461IleAsp: 3.461 ± 0.498
3.599IleGlu: 3.599 ± 0.318
1.477IlePhe: 1.477 ± 0.294
3.645IleGly: 3.645 ± 0.397
1.292IleHis: 1.292 ± 0.28
2.999IleIle: 2.999 ± 0.489
3.922IleLys: 3.922 ± 0.385
4.384IleLeu: 4.384 ± 0.502
1.384IleMet: 1.384 ± 0.195
3.645IleAsn: 3.645 ± 0.36
3.092IlePro: 3.092 ± 0.418
2.538IleGln: 2.538 ± 0.329
3.092IleArg: 3.092 ± 0.342
3.184IleSer: 3.184 ± 0.387
4.337IleThr: 4.337 ± 0.487
3.322IleVal: 3.322 ± 0.43
0.461IleTrp: 0.461 ± 0.161
2.353IleTyr: 2.353 ± 0.312
0.0IleXaa: 0.0 ± 0.0
Lys
6.46LysAla: 6.46 ± 0.605
0.415LysCys: 0.415 ± 0.195
3.184LysAsp: 3.184 ± 0.338
4.66LysGlu: 4.66 ± 0.492
2.492LysPhe: 2.492 ± 0.333
3.738LysGly: 3.738 ± 0.383
1.753LysHis: 1.753 ± 0.299
2.63LysIle: 2.63 ± 0.394
3.23LysLys: 3.23 ± 0.397
6.275LysLeu: 6.275 ± 0.474
1.892LysMet: 1.892 ± 0.356
3.23LysAsn: 3.23 ± 0.522
2.953LysPro: 2.953 ± 0.484
3.092LysGln: 3.092 ± 0.377
2.861LysArg: 2.861 ± 0.344
3.83LysSer: 3.83 ± 0.454
4.384LysThr: 4.384 ± 0.445
4.153LysVal: 4.153 ± 0.351
0.692LysTrp: 0.692 ± 0.218
1.753LysTyr: 1.753 ± 0.316
0.0LysXaa: 0.0 ± 0.0
Leu
7.475LeuAla: 7.475 ± 0.559
0.6LeuCys: 0.6 ± 0.205
4.614LeuAsp: 4.614 ± 0.504
4.476LeuGlu: 4.476 ± 0.425
3.045LeuPhe: 3.045 ± 0.349
6.506LeuGly: 6.506 ± 0.463
1.477LeuHis: 1.477 ± 0.264
4.568LeuIle: 4.568 ± 0.469
5.214LeuLys: 5.214 ± 0.508
5.952LeuLeu: 5.952 ± 0.51
2.907LeuMet: 2.907 ± 0.4
5.076LeuAsn: 5.076 ± 0.514
4.384LeuPro: 4.384 ± 0.427
3.092LeuGln: 3.092 ± 0.488
3.738LeuArg: 3.738 ± 0.437
5.491LeuSer: 5.491 ± 0.54
5.353LeuThr: 5.353 ± 0.426
6.322LeuVal: 6.322 ± 0.477
0.831LeuTrp: 0.831 ± 0.174
2.538LeuTyr: 2.538 ± 0.341
0.0LeuXaa: 0.0 ± 0.0
Met
2.722MetAla: 2.722 ± 0.507
0.138MetCys: 0.138 ± 0.091
1.246MetAsp: 1.246 ± 0.226
2.076MetGlu: 2.076 ± 0.333
0.646MetPhe: 0.646 ± 0.19
1.753MetGly: 1.753 ± 0.245
0.369MetHis: 0.369 ± 0.115
1.615MetIle: 1.615 ± 0.263
2.538MetLys: 2.538 ± 0.389
2.492MetLeu: 2.492 ± 0.295
0.831MetMet: 0.831 ± 0.206
2.123MetAsn: 2.123 ± 0.31
1.107MetPro: 1.107 ± 0.238
1.615MetGln: 1.615 ± 0.301
1.2MetArg: 1.2 ± 0.211
2.63MetSer: 2.63 ± 0.291
1.753MetThr: 1.753 ± 0.305
1.753MetVal: 1.753 ± 0.299
0.277MetTrp: 0.277 ± 0.091
0.646MetTyr: 0.646 ± 0.142
0.0MetXaa: 0.0 ± 0.0
Asn
4.337AsnAla: 4.337 ± 0.601
0.231AsnCys: 0.231 ± 0.114
2.722AsnAsp: 2.722 ± 0.325
3.322AsnGlu: 3.322 ± 0.465
2.076AsnPhe: 2.076 ± 0.324
4.384AsnGly: 4.384 ± 0.586
1.292AsnHis: 1.292 ± 0.257
3.23AsnIle: 3.23 ± 0.37
3.922AsnLys: 3.922 ± 0.429
4.66AsnLeu: 4.66 ± 0.586
1.384AsnMet: 1.384 ± 0.224
3.045AsnAsn: 3.045 ± 0.312
3.092AsnPro: 3.092 ± 0.449
3.276AsnGln: 3.276 ± 0.398
3.138AsnArg: 3.138 ± 0.337
3.138AsnSer: 3.138 ± 0.449
3.553AsnThr: 3.553 ± 0.55
3.368AsnVal: 3.368 ± 0.41
0.969AsnTrp: 0.969 ± 0.245
2.03AsnTyr: 2.03 ± 0.428
0.0AsnXaa: 0.0 ± 0.0
Pro
3.461ProAla: 3.461 ± 0.478
0.277ProCys: 0.277 ± 0.121
2.261ProAsp: 2.261 ± 0.441
4.061ProGlu: 4.061 ± 0.413
2.169ProPhe: 2.169 ± 0.375
2.492ProGly: 2.492 ± 0.307
0.369ProHis: 0.369 ± 0.158
2.169ProIle: 2.169 ± 0.312
1.938ProLys: 1.938 ± 0.266
3.045ProLeu: 3.045 ± 0.368
1.292ProMet: 1.292 ± 0.216
1.984ProAsn: 1.984 ± 0.254
1.061ProPro: 1.061 ± 0.281
1.338ProGln: 1.338 ± 0.22
1.338ProArg: 1.338 ± 0.299
2.399ProSer: 2.399 ± 0.31
3.23ProThr: 3.23 ± 0.37
4.014ProVal: 4.014 ± 0.406
0.646ProTrp: 0.646 ± 0.183
1.384ProTyr: 1.384 ± 0.22
0.0ProXaa: 0.0 ± 0.0
Gln
5.122GlnAla: 5.122 ± 0.727
0.231GlnCys: 0.231 ± 0.103
1.984GlnAsp: 1.984 ± 0.299
2.769GlnGlu: 2.769 ± 0.355
1.661GlnPhe: 1.661 ± 0.247
2.815GlnGly: 2.815 ± 0.435
0.461GlnHis: 0.461 ± 0.128
2.399GlnIle: 2.399 ± 0.329
2.815GlnLys: 2.815 ± 0.453
3.922GlnLeu: 3.922 ± 0.483
1.154GlnMet: 1.154 ± 0.223
1.984GlnAsn: 1.984 ± 0.328
1.061GlnPro: 1.061 ± 0.229
1.938GlnGln: 1.938 ± 0.36
1.8GlnArg: 1.8 ± 0.259
2.492GlnSer: 2.492 ± 0.261
2.676GlnThr: 2.676 ± 0.351
3.599GlnVal: 3.599 ± 0.397
0.415GlnTrp: 0.415 ± 0.129
1.753GlnTyr: 1.753 ± 0.226
0.0GlnXaa: 0.0 ± 0.0
Arg
3.368ArgAla: 3.368 ± 0.524
0.369ArgCys: 0.369 ± 0.166
2.446ArgAsp: 2.446 ± 0.282
2.953ArgGlu: 2.953 ± 0.452
1.477ArgPhe: 1.477 ± 0.272
2.63ArgGly: 2.63 ± 0.268
0.831ArgHis: 0.831 ± 0.2
2.907ArgIle: 2.907 ± 0.421
3.23ArgLys: 3.23 ± 0.414
4.199ArgLeu: 4.199 ± 0.359
1.154ArgMet: 1.154 ± 0.246
3.184ArgAsn: 3.184 ± 0.421
1.477ArgPro: 1.477 ± 0.263
2.03ArgGln: 2.03 ± 0.305
1.984ArgArg: 1.984 ± 0.31
2.399ArgSer: 2.399 ± 0.375
2.492ArgThr: 2.492 ± 0.371
2.353ArgVal: 2.353 ± 0.322
0.508ArgTrp: 0.508 ± 0.141
1.246ArgTyr: 1.246 ± 0.228
0.0ArgXaa: 0.0 ± 0.0
Ser
4.245SerAla: 4.245 ± 0.475
0.784SerCys: 0.784 ± 0.237
3.645SerAsp: 3.645 ± 0.425
3.876SerGlu: 3.876 ± 0.395
2.353SerPhe: 2.353 ± 0.359
4.983SerGly: 4.983 ± 0.423
0.738SerHis: 0.738 ± 0.169
3.784SerIle: 3.784 ± 0.436
3.507SerLys: 3.507 ± 0.438
6.552SerLeu: 6.552 ± 0.593
1.523SerMet: 1.523 ± 0.334
2.953SerAsn: 2.953 ± 0.346
2.63SerPro: 2.63 ± 0.427
2.492SerGln: 2.492 ± 0.341
2.815SerArg: 2.815 ± 0.301
3.461SerSer: 3.461 ± 0.442
4.384SerThr: 4.384 ± 0.587
4.337SerVal: 4.337 ± 0.452
0.692SerTrp: 0.692 ± 0.188
2.03SerTyr: 2.03 ± 0.323
0.0SerXaa: 0.0 ± 0.0
Thr
4.66ThrAla: 4.66 ± 0.56
0.461ThrCys: 0.461 ± 0.157
3.876ThrAsp: 3.876 ± 0.324
3.322ThrGlu: 3.322 ± 0.444
3.138ThrPhe: 3.138 ± 0.334
5.214ThrGly: 5.214 ± 0.499
1.154ThrHis: 1.154 ± 0.268
4.107ThrIle: 4.107 ± 0.476
3.645ThrLys: 3.645 ± 0.404
5.722ThrLeu: 5.722 ± 0.441
1.154ThrMet: 1.154 ± 0.262
4.107ThrAsn: 4.107 ± 0.542
3.415ThrPro: 3.415 ± 0.452
2.353ThrGln: 2.353 ± 0.297
2.815ThrArg: 2.815 ± 0.385
4.153ThrSer: 4.153 ± 0.459
3.23ThrThr: 3.23 ± 0.57
5.03ThrVal: 5.03 ± 0.555
0.692ThrTrp: 0.692 ± 0.174
2.123ThrTyr: 2.123 ± 0.397
0.0ThrXaa: 0.0 ± 0.0
Val
6.183ValAla: 6.183 ± 0.654
0.646ValCys: 0.646 ± 0.217
4.614ValAsp: 4.614 ± 0.443
4.245ValGlu: 4.245 ± 0.415
2.307ValPhe: 2.307 ± 0.356
5.168ValGly: 5.168 ± 0.626
1.707ValHis: 1.707 ± 0.228
3.276ValIle: 3.276 ± 0.444
3.83ValLys: 3.83 ± 0.41
4.245ValLeu: 4.245 ± 0.39
2.492ValMet: 2.492 ± 0.286
3.922ValAsn: 3.922 ± 0.408
3.23ValPro: 3.23 ± 0.517
3.276ValGln: 3.276 ± 0.403
3.322ValArg: 3.322 ± 0.366
4.014ValSer: 4.014 ± 0.514
4.845ValThr: 4.845 ± 0.619
5.353ValVal: 5.353 ± 0.585
0.738ValTrp: 0.738 ± 0.168
2.676ValTyr: 2.676 ± 0.391
0.0ValXaa: 0.0 ± 0.0
Trp
0.784TrpAla: 0.784 ± 0.207
0.185TrpCys: 0.185 ± 0.09
0.877TrpAsp: 0.877 ± 0.243
0.554TrpGlu: 0.554 ± 0.179
0.369TrpPhe: 0.369 ± 0.142
0.554TrpGly: 0.554 ± 0.149
0.323TrpHis: 0.323 ± 0.135
0.554TrpIle: 0.554 ± 0.155
0.784TrpLys: 0.784 ± 0.183
1.292TrpLeu: 1.292 ± 0.227
0.369TrpMet: 0.369 ± 0.168
0.554TrpAsn: 0.554 ± 0.165
0.369TrpPro: 0.369 ± 0.119
0.6TrpGln: 0.6 ± 0.14
0.554TrpArg: 0.554 ± 0.147
0.646TrpSer: 0.646 ± 0.193
0.461TrpThr: 0.461 ± 0.121
1.523TrpVal: 1.523 ± 0.306
0.092TrpTrp: 0.092 ± 0.07
0.461TrpTyr: 0.461 ± 0.157
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.676TyrAla: 2.676 ± 0.287
0.508TyrCys: 0.508 ± 0.208
2.63TyrAsp: 2.63 ± 0.231
2.03TyrGlu: 2.03 ± 0.284
1.569TyrPhe: 1.569 ± 0.347
2.63TyrGly: 2.63 ± 0.321
0.877TyrHis: 0.877 ± 0.234
2.307TyrIle: 2.307 ± 0.376
2.584TyrLys: 2.584 ± 0.396
2.492TyrLeu: 2.492 ± 0.299
0.923TyrMet: 0.923 ± 0.17
1.8TyrAsn: 1.8 ± 0.256
1.384TyrPro: 1.384 ± 0.249
1.8TyrGln: 1.8 ± 0.299
1.523TyrArg: 1.523 ± 0.283
2.076TyrSer: 2.076 ± 0.393
1.8TyrThr: 1.8 ± 0.292
2.63TyrVal: 2.63 ± 0.361
0.461TyrTrp: 0.461 ± 0.12
1.2TyrTyr: 1.2 ± 0.292
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 94 proteins (21673 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski