Amino acid dipepetide frequency for Enterobacter phage Ec_L1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.46AlaAla: 9.46 ± 1.378
1.662AlaCys: 1.662 ± 0.35
5.114AlaAsp: 5.114 ± 0.651
5.689AlaGlu: 5.689 ± 0.526
3.004AlaPhe: 3.004 ± 0.49
6.456AlaGly: 6.456 ± 0.618
0.959AlaHis: 0.959 ± 0.264
4.986AlaIle: 4.986 ± 0.518
6.968AlaLys: 6.968 ± 1.12
6.328AlaLeu: 6.328 ± 0.879
2.621AlaMet: 2.621 ± 0.566
3.835AlaAsn: 3.835 ± 0.773
2.237AlaPro: 2.237 ± 0.4
3.388AlaGln: 3.388 ± 0.591
4.411AlaArg: 4.411 ± 0.543
5.178AlaSer: 5.178 ± 0.69
4.347AlaThr: 4.347 ± 0.52
6.328AlaVal: 6.328 ± 0.71
1.278AlaTrp: 1.278 ± 0.226
2.365AlaTyr: 2.365 ± 0.342
0.0AlaXaa: 0.0 ± 0.0
Cys
1.278CysAla: 1.278 ± 0.366
0.256CysCys: 0.256 ± 0.164
1.406CysAsp: 1.406 ± 0.289
1.087CysGlu: 1.087 ± 0.306
0.32CysPhe: 0.32 ± 0.142
1.726CysGly: 1.726 ± 0.43
0.447CysHis: 0.447 ± 0.198
0.959CysIle: 0.959 ± 0.256
0.703CysLys: 0.703 ± 0.191
0.384CysLeu: 0.384 ± 0.151
0.447CysMet: 0.447 ± 0.17
0.511CysAsn: 0.511 ± 0.205
0.639CysPro: 0.639 ± 0.218
0.447CysGln: 0.447 ± 0.16
0.511CysArg: 0.511 ± 0.193
0.703CysSer: 0.703 ± 0.198
0.511CysThr: 0.511 ± 0.184
0.959CysVal: 0.959 ± 0.251
0.384CysTrp: 0.384 ± 0.157
0.32CysTyr: 0.32 ± 0.153
0.0CysXaa: 0.0 ± 0.0
Asp
4.538AspAla: 4.538 ± 0.662
0.639AspCys: 0.639 ± 0.186
3.835AspAsp: 3.835 ± 0.538
4.091AspGlu: 4.091 ± 0.44
2.813AspPhe: 2.813 ± 0.558
6.392AspGly: 6.392 ± 0.692
1.087AspHis: 1.087 ± 0.23
3.771AspIle: 3.771 ± 0.496
5.369AspLys: 5.369 ± 0.64
3.771AspLeu: 3.771 ± 0.491
1.726AspMet: 1.726 ± 0.329
3.58AspAsn: 3.58 ± 0.486
1.854AspPro: 1.854 ± 0.369
2.109AspGln: 2.109 ± 0.411
2.237AspArg: 2.237 ± 0.419
3.707AspSer: 3.707 ± 0.474
2.365AspThr: 2.365 ± 0.424
4.219AspVal: 4.219 ± 0.522
1.023AspTrp: 1.023 ± 0.258
2.557AspTyr: 2.557 ± 0.399
0.0AspXaa: 0.0 ± 0.0
Glu
5.05GluAla: 5.05 ± 0.645
1.47GluCys: 1.47 ± 0.292
2.621GluAsp: 2.621 ± 0.42
4.347GluGlu: 4.347 ± 0.601
3.132GluPhe: 3.132 ± 0.439
4.219GluGly: 4.219 ± 0.597
1.278GluHis: 1.278 ± 0.306
4.858GluIle: 4.858 ± 0.581
4.347GluLys: 4.347 ± 0.474
5.306GluLeu: 5.306 ± 0.611
2.365GluMet: 2.365 ± 0.33
3.452GluAsn: 3.452 ± 0.433
1.918GluPro: 1.918 ± 0.357
3.068GluGln: 3.068 ± 0.642
3.26GluArg: 3.26 ± 0.428
3.771GluSer: 3.771 ± 0.463
4.411GluThr: 4.411 ± 0.788
4.475GluVal: 4.475 ± 0.543
0.767GluTrp: 0.767 ± 0.185
3.068GluTyr: 3.068 ± 0.425
0.0GluXaa: 0.0 ± 0.0
Phe
3.644PheAla: 3.644 ± 0.45
0.32PheCys: 0.32 ± 0.152
3.196PheAsp: 3.196 ± 0.304
2.557PheGlu: 2.557 ± 0.413
1.215PhePhe: 1.215 ± 0.25
3.835PheGly: 3.835 ± 0.592
0.703PheHis: 0.703 ± 0.219
2.109PheIle: 2.109 ± 0.383
2.813PheLys: 2.813 ± 0.411
2.046PheLeu: 2.046 ± 0.405
0.959PheMet: 0.959 ± 0.269
2.557PheAsn: 2.557 ± 0.472
1.342PhePro: 1.342 ± 0.286
1.151PheGln: 1.151 ± 0.281
2.237PheArg: 2.237 ± 0.35
1.47PheSer: 1.47 ± 0.291
2.109PheThr: 2.109 ± 0.35
3.196PheVal: 3.196 ± 0.475
0.703PheTrp: 0.703 ± 0.193
1.598PheTyr: 1.598 ± 0.326
0.0PheXaa: 0.0 ± 0.0
Gly
5.306GlyAla: 5.306 ± 0.717
1.342GlyCys: 1.342 ± 0.301
3.771GlyAsp: 3.771 ± 0.518
4.347GlyGlu: 4.347 ± 0.536
3.899GlyPhe: 3.899 ± 0.458
6.904GlyGly: 6.904 ± 0.917
1.726GlyHis: 1.726 ± 0.429
3.963GlyIle: 3.963 ± 0.525
6.648GlyLys: 6.648 ± 0.578
5.178GlyLeu: 5.178 ± 0.62
3.132GlyMet: 3.132 ± 0.471
3.644GlyAsn: 3.644 ± 0.496
0.064GlyPro: 0.064 ± 0.071
2.365GlyGln: 2.365 ± 0.385
3.004GlyArg: 3.004 ± 0.366
5.306GlySer: 5.306 ± 0.533
3.899GlyThr: 3.899 ± 0.571
6.648GlyVal: 6.648 ± 0.751
1.342GlyTrp: 1.342 ± 0.285
2.813GlyTyr: 2.813 ± 0.44
0.0GlyXaa: 0.0 ± 0.0
His
1.342HisAla: 1.342 ± 0.399
0.575HisCys: 0.575 ± 0.25
0.895HisAsp: 0.895 ± 0.24
1.534HisGlu: 1.534 ± 0.298
0.831HisPhe: 0.831 ± 0.273
1.342HisGly: 1.342 ± 0.349
0.703HisHis: 0.703 ± 0.262
0.767HisIle: 0.767 ± 0.224
1.278HisLys: 1.278 ± 0.273
1.023HisLeu: 1.023 ± 0.324
0.384HisMet: 0.384 ± 0.173
0.639HisAsn: 0.639 ± 0.21
1.023HisPro: 1.023 ± 0.299
0.511HisGln: 0.511 ± 0.163
1.215HisArg: 1.215 ± 0.288
1.151HisSer: 1.151 ± 0.357
1.215HisThr: 1.215 ± 0.26
1.406HisVal: 1.406 ± 0.373
0.384HisTrp: 0.384 ± 0.133
0.959HisTyr: 0.959 ± 0.236
0.0HisXaa: 0.0 ± 0.0
Ile
5.433IleAla: 5.433 ± 0.584
0.895IleCys: 0.895 ± 0.255
3.771IleAsp: 3.771 ± 0.387
4.283IleGlu: 4.283 ± 0.448
1.918IlePhe: 1.918 ± 0.391
4.027IleGly: 4.027 ± 0.554
0.895IleHis: 0.895 ± 0.272
4.027IleIle: 4.027 ± 0.591
4.666IleLys: 4.666 ± 0.589
3.771IleLeu: 3.771 ± 0.624
2.046IleMet: 2.046 ± 0.374
2.877IleAsn: 2.877 ± 0.405
3.132IlePro: 3.132 ± 0.427
1.982IleGln: 1.982 ± 0.341
3.707IleArg: 3.707 ± 0.48
3.58IleSer: 3.58 ± 0.458
4.475IleThr: 4.475 ± 0.506
4.475IleVal: 4.475 ± 0.482
0.959IleTrp: 0.959 ± 0.237
1.342IleTyr: 1.342 ± 0.292
0.0IleXaa: 0.0 ± 0.0
Lys
6.712LysAla: 6.712 ± 0.741
1.087LysCys: 1.087 ± 0.279
4.538LysAsp: 4.538 ± 0.468
4.922LysGlu: 4.922 ± 0.455
2.749LysPhe: 2.749 ± 0.332
4.219LysGly: 4.219 ± 0.505
1.534LysHis: 1.534 ± 0.346
4.666LysIle: 4.666 ± 0.555
4.794LysLys: 4.794 ± 0.491
5.178LysLeu: 5.178 ± 0.66
3.516LysMet: 3.516 ± 0.576
3.196LysAsn: 3.196 ± 0.463
2.877LysPro: 2.877 ± 0.375
2.301LysGln: 2.301 ± 0.383
4.027LysArg: 4.027 ± 0.625
4.538LysSer: 4.538 ± 0.516
4.666LysThr: 4.666 ± 0.606
4.219LysVal: 4.219 ± 0.515
0.959LysTrp: 0.959 ± 0.252
1.726LysTyr: 1.726 ± 0.362
0.0LysXaa: 0.0 ± 0.0
Leu
6.392LeuAla: 6.392 ± 0.869
1.151LeuCys: 1.151 ± 0.291
5.178LeuAsp: 5.178 ± 0.53
3.707LeuGlu: 3.707 ± 0.545
1.79LeuPhe: 1.79 ± 0.413
4.219LeuGly: 4.219 ± 0.599
2.046LeuHis: 2.046 ± 0.462
3.58LeuIle: 3.58 ± 0.472
4.538LeuLys: 4.538 ± 0.462
3.899LeuLeu: 3.899 ± 0.57
1.406LeuMet: 1.406 ± 0.313
3.26LeuAsn: 3.26 ± 0.484
2.94LeuPro: 2.94 ± 0.378
1.854LeuGln: 1.854 ± 0.345
2.94LeuArg: 2.94 ± 0.309
5.178LeuSer: 5.178 ± 0.441
3.899LeuThr: 3.899 ± 0.518
4.794LeuVal: 4.794 ± 0.526
0.831LeuTrp: 0.831 ± 0.276
2.301LeuTyr: 2.301 ± 0.366
0.0LeuXaa: 0.0 ± 0.0
Met
2.877MetAla: 2.877 ± 0.394
0.192MetCys: 0.192 ± 0.097
1.726MetAsp: 1.726 ± 0.279
1.278MetGlu: 1.278 ± 0.334
0.831MetPhe: 0.831 ± 0.234
1.406MetGly: 1.406 ± 0.38
0.639MetHis: 0.639 ± 0.228
2.046MetIle: 2.046 ± 0.376
3.452MetLys: 3.452 ± 0.475
1.982MetLeu: 1.982 ± 0.371
0.767MetMet: 0.767 ± 0.223
1.726MetAsn: 1.726 ± 0.256
0.767MetPro: 0.767 ± 0.195
1.023MetGln: 1.023 ± 0.225
1.726MetArg: 1.726 ± 0.398
2.365MetSer: 2.365 ± 0.319
1.79MetThr: 1.79 ± 0.344
2.173MetVal: 2.173 ± 0.39
0.192MetTrp: 0.192 ± 0.105
1.215MetTyr: 1.215 ± 0.267
0.0MetXaa: 0.0 ± 0.0
Asn
5.306AsnAla: 5.306 ± 1.009
0.384AsnCys: 0.384 ± 0.141
2.94AsnAsp: 2.94 ± 0.442
3.516AsnGlu: 3.516 ± 0.385
1.47AsnPhe: 1.47 ± 0.257
5.306AsnGly: 5.306 ± 0.529
0.959AsnHis: 0.959 ± 0.254
2.685AsnIle: 2.685 ± 0.349
1.982AsnLys: 1.982 ± 0.425
2.813AsnLeu: 2.813 ± 0.511
1.406AsnMet: 1.406 ± 0.286
2.173AsnAsn: 2.173 ± 0.413
1.662AsnPro: 1.662 ± 0.316
1.982AsnGln: 1.982 ± 0.538
2.173AsnArg: 2.173 ± 0.32
3.707AsnSer: 3.707 ± 0.49
2.173AsnThr: 2.173 ± 0.469
2.493AsnVal: 2.493 ± 0.596
0.703AsnTrp: 0.703 ± 0.198
1.598AsnTyr: 1.598 ± 0.311
0.0AsnXaa: 0.0 ± 0.0
Pro
1.918ProAla: 1.918 ± 0.343
0.32ProCys: 0.32 ± 0.137
2.046ProAsp: 2.046 ± 0.453
3.324ProGlu: 3.324 ± 0.482
1.406ProPhe: 1.406 ± 0.304
2.237ProGly: 2.237 ± 0.383
0.767ProHis: 0.767 ± 0.221
2.046ProIle: 2.046 ± 0.393
1.342ProLys: 1.342 ± 0.339
1.79ProLeu: 1.79 ± 0.293
1.215ProMet: 1.215 ± 0.251
1.534ProAsn: 1.534 ± 0.297
0.831ProPro: 0.831 ± 0.224
1.534ProGln: 1.534 ± 0.349
1.662ProArg: 1.662 ± 0.326
1.662ProSer: 1.662 ± 0.303
1.47ProThr: 1.47 ± 0.285
3.388ProVal: 3.388 ± 0.383
0.575ProTrp: 0.575 ± 0.205
1.151ProTyr: 1.151 ± 0.233
0.0ProXaa: 0.0 ± 0.0
Gln
3.452GlnAla: 3.452 ± 0.523
0.384GlnCys: 0.384 ± 0.158
1.854GlnAsp: 1.854 ± 0.306
2.237GlnGlu: 2.237 ± 0.38
1.342GlnPhe: 1.342 ± 0.263
1.406GlnGly: 1.406 ± 0.281
0.575GlnHis: 0.575 ± 0.194
2.493GlnIle: 2.493 ± 0.518
2.109GlnLys: 2.109 ± 0.314
3.004GlnLeu: 3.004 ± 0.405
0.703GlnMet: 0.703 ± 0.237
0.895GlnAsn: 0.895 ± 0.196
1.278GlnPro: 1.278 ± 0.293
2.557GlnGln: 2.557 ± 0.593
1.854GlnArg: 1.854 ± 0.3
2.557GlnSer: 2.557 ± 0.543
2.109GlnThr: 2.109 ± 0.398
2.685GlnVal: 2.685 ± 0.509
0.192GlnTrp: 0.192 ± 0.104
1.662GlnTyr: 1.662 ± 0.368
0.0GlnXaa: 0.0 ± 0.0
Arg
3.963ArgAla: 3.963 ± 0.599
0.575ArgCys: 0.575 ± 0.269
2.685ArgAsp: 2.685 ± 0.362
3.644ArgGlu: 3.644 ± 0.424
2.813ArgPhe: 2.813 ± 0.464
3.004ArgGly: 3.004 ± 0.404
0.703ArgHis: 0.703 ± 0.236
3.644ArgIle: 3.644 ± 0.441
4.347ArgLys: 4.347 ± 0.506
3.132ArgLeu: 3.132 ± 0.464
1.278ArgMet: 1.278 ± 0.335
1.662ArgAsn: 1.662 ± 0.313
1.534ArgPro: 1.534 ± 0.399
1.342ArgGln: 1.342 ± 0.301
2.877ArgArg: 2.877 ± 0.375
3.004ArgSer: 3.004 ± 0.454
2.173ArgThr: 2.173 ± 0.326
3.899ArgVal: 3.899 ± 0.753
0.767ArgTrp: 0.767 ± 0.22
2.109ArgTyr: 2.109 ± 0.313
0.0ArgXaa: 0.0 ± 0.0
Ser
5.753SerAla: 5.753 ± 0.901
0.32SerCys: 0.32 ± 0.134
4.411SerAsp: 4.411 ± 0.48
5.114SerGlu: 5.114 ± 0.854
2.429SerPhe: 2.429 ± 0.294
6.584SerGly: 6.584 ± 0.658
0.959SerHis: 0.959 ± 0.244
3.771SerIle: 3.771 ± 0.402
5.05SerLys: 5.05 ± 0.596
4.922SerLeu: 4.922 ± 0.49
1.342SerMet: 1.342 ± 0.29
2.301SerAsn: 2.301 ± 0.354
1.534SerPro: 1.534 ± 0.317
2.046SerGln: 2.046 ± 0.374
2.493SerArg: 2.493 ± 0.44
3.516SerSer: 3.516 ± 0.76
3.068SerThr: 3.068 ± 0.379
5.561SerVal: 5.561 ± 0.512
0.767SerTrp: 0.767 ± 0.19
1.79SerTyr: 1.79 ± 0.326
0.0SerXaa: 0.0 ± 0.0
Thr
4.602ThrAla: 4.602 ± 0.773
0.575ThrCys: 0.575 ± 0.222
3.196ThrAsp: 3.196 ± 0.375
2.94ThrGlu: 2.94 ± 0.419
2.429ThrPhe: 2.429 ± 0.475
4.922ThrGly: 4.922 ± 0.459
1.023ThrHis: 1.023 ± 0.277
3.963ThrIle: 3.963 ± 0.453
3.196ThrLys: 3.196 ± 0.55
3.644ThrLeu: 3.644 ± 0.416
1.918ThrMet: 1.918 ± 0.395
2.94ThrAsn: 2.94 ± 0.713
2.429ThrPro: 2.429 ± 0.33
1.47ThrGln: 1.47 ± 0.366
2.046ThrArg: 2.046 ± 0.316
3.452ThrSer: 3.452 ± 0.648
2.749ThrThr: 2.749 ± 0.427
4.475ThrVal: 4.475 ± 0.714
0.639ThrTrp: 0.639 ± 0.225
2.109ThrTyr: 2.109 ± 0.32
0.0ThrXaa: 0.0 ± 0.0
Val
5.945ValAla: 5.945 ± 0.643
1.023ValCys: 1.023 ± 0.281
5.05ValAsp: 5.05 ± 0.683
5.306ValGlu: 5.306 ± 0.628
3.004ValPhe: 3.004 ± 0.406
3.516ValGly: 3.516 ± 0.574
1.278ValHis: 1.278 ± 0.25
5.497ValIle: 5.497 ± 0.747
6.648ValLys: 6.648 ± 0.615
3.835ValLeu: 3.835 ± 0.451
1.47ValMet: 1.47 ± 0.324
3.899ValAsn: 3.899 ± 0.623
2.557ValPro: 2.557 ± 0.396
2.046ValGln: 2.046 ± 0.5
3.835ValArg: 3.835 ± 0.47
5.561ValSer: 5.561 ± 0.671
4.411ValThr: 4.411 ± 0.498
5.178ValVal: 5.178 ± 0.662
1.278ValTrp: 1.278 ± 0.273
2.237ValTyr: 2.237 ± 0.367
0.0ValXaa: 0.0 ± 0.0
Trp
1.023TrpAla: 1.023 ± 0.238
0.128TrpCys: 0.128 ± 0.092
0.767TrpAsp: 0.767 ± 0.217
1.023TrpGlu: 1.023 ± 0.207
0.831TrpPhe: 0.831 ± 0.238
1.215TrpGly: 1.215 ± 0.279
0.447TrpHis: 0.447 ± 0.159
0.831TrpIle: 0.831 ± 0.252
0.639TrpLys: 0.639 ± 0.176
1.47TrpLeu: 1.47 ± 0.297
0.256TrpMet: 0.256 ± 0.125
0.959TrpAsn: 0.959 ± 0.222
0.32TrpPro: 0.32 ± 0.154
0.384TrpGln: 0.384 ± 0.166
0.959TrpArg: 0.959 ± 0.265
1.023TrpSer: 1.023 ± 0.269
0.639TrpThr: 0.639 ± 0.173
1.023TrpVal: 1.023 ± 0.255
0.384TrpTrp: 0.384 ± 0.15
0.447TrpTyr: 0.447 ± 0.155
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.621TyrAla: 2.621 ± 0.465
0.703TyrCys: 0.703 ± 0.231
2.685TyrAsp: 2.685 ± 0.419
2.237TyrGlu: 2.237 ± 0.318
1.534TyrPhe: 1.534 ± 0.298
2.365TyrGly: 2.365 ± 0.393
0.447TyrHis: 0.447 ± 0.17
1.598TyrIle: 1.598 ± 0.28
1.534TyrLys: 1.534 ± 0.328
2.429TyrLeu: 2.429 ± 0.384
1.151TyrMet: 1.151 ± 0.249
1.79TyrAsn: 1.79 ± 0.377
1.278TyrPro: 1.278 ± 0.278
1.79TyrGln: 1.79 ± 0.295
1.982TyrArg: 1.982 ± 0.32
2.365TyrSer: 2.365 ± 0.365
2.237TyrThr: 2.237 ± 0.294
1.982TyrVal: 1.982 ± 0.372
0.639TyrTrp: 0.639 ± 0.186
1.215TyrTyr: 1.215 ± 0.243
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 85 proteins (15645 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski