Amino acid dipepetide frequency for Enterobacteria phage N4 (Bacteriophage N4)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.581AlaAla: 7.581 ± 1.087
0.59AlaCys: 0.59 ± 0.168
4.403AlaAsp: 4.403 ± 0.52
5.765AlaGlu: 5.765 ± 0.609
2.633AlaPhe: 2.633 ± 0.355
6.174AlaGly: 6.174 ± 0.697
0.999AlaHis: 0.999 ± 0.238
5.13AlaIle: 5.13 ± 0.525
5.311AlaLys: 5.311 ± 0.644
8.444AlaLeu: 8.444 ± 0.731
2.588AlaMet: 2.588 ± 0.325
4.222AlaAsn: 4.222 ± 0.394
2.951AlaPro: 2.951 ± 0.49
3.859AlaGln: 3.859 ± 0.661
3.496AlaArg: 3.496 ± 0.507
4.131AlaSer: 4.131 ± 0.384
6.038AlaThr: 6.038 ± 0.567
5.584AlaVal: 5.584 ± 0.475
0.999AlaTrp: 0.999 ± 0.22
3.859AlaTyr: 3.859 ± 0.375
0.0AlaXaa: 0.0 ± 0.0
Cys
0.545CysAla: 0.545 ± 0.154
0.182CysCys: 0.182 ± 0.098
0.454CysAsp: 0.454 ± 0.155
0.409CysGlu: 0.409 ± 0.124
0.272CysPhe: 0.272 ± 0.111
0.817CysGly: 0.817 ± 0.275
0.318CysHis: 0.318 ± 0.121
0.726CysIle: 0.726 ± 0.196
0.999CysLys: 0.999 ± 0.252
0.863CysLeu: 0.863 ± 0.239
0.182CysMet: 0.182 ± 0.09
0.545CysAsn: 0.545 ± 0.21
0.136CysPro: 0.136 ± 0.086
0.318CysGln: 0.318 ± 0.164
0.409CysArg: 0.409 ± 0.181
0.681CysSer: 0.681 ± 0.191
0.59CysThr: 0.59 ± 0.187
0.454CysVal: 0.454 ± 0.151
0.136CysTrp: 0.136 ± 0.072
0.182CysTyr: 0.182 ± 0.09
0.0CysXaa: 0.0 ± 0.0
Asp
5.311AspAla: 5.311 ± 0.406
0.726AspCys: 0.726 ± 0.278
3.314AspAsp: 3.314 ± 0.42
3.223AspGlu: 3.223 ± 0.47
1.997AspPhe: 1.997 ± 0.293
3.768AspGly: 3.768 ± 0.382
0.772AspHis: 0.772 ± 0.187
4.494AspIle: 4.494 ± 0.459
4.222AspLys: 4.222 ± 0.427
4.403AspLeu: 4.403 ± 0.469
1.362AspMet: 1.362 ± 0.208
2.406AspAsn: 2.406 ± 0.387
3.223AspPro: 3.223 ± 0.408
1.997AspGln: 1.997 ± 0.301
2.315AspArg: 2.315 ± 0.268
3.768AspSer: 3.768 ± 0.394
3.132AspThr: 3.132 ± 0.365
4.177AspVal: 4.177 ± 0.482
1.044AspTrp: 1.044 ± 0.237
2.542AspTyr: 2.542 ± 0.361
0.0AspXaa: 0.0 ± 0.0
Glu
5.72GluAla: 5.72 ± 0.599
0.499GluCys: 0.499 ± 0.157
3.904GluAsp: 3.904 ± 0.423
5.13GluGlu: 5.13 ± 0.678
2.588GluPhe: 2.588 ± 0.3
3.904GluGly: 3.904 ± 0.369
1.589GluHis: 1.589 ± 0.233
3.768GluIle: 3.768 ± 0.361
3.904GluLys: 3.904 ± 0.573
5.584GluLeu: 5.584 ± 0.55
2.043GluMet: 2.043 ± 0.221
2.769GluAsn: 2.769 ± 0.331
2.633GluPro: 2.633 ± 0.418
3.632GluGln: 3.632 ± 0.456
2.315GluArg: 2.315 ± 0.251
3.632GluSer: 3.632 ± 0.384
3.359GluThr: 3.359 ± 0.357
4.585GluVal: 4.585 ± 0.529
0.953GluTrp: 0.953 ± 0.204
2.815GluTyr: 2.815 ± 0.401
0.0GluXaa: 0.0 ± 0.0
Phe
2.361PheAla: 2.361 ± 0.306
0.409PheCys: 0.409 ± 0.148
2.134PheAsp: 2.134 ± 0.284
1.589PheGlu: 1.589 ± 0.25
1.226PhePhe: 1.226 ± 0.26
2.406PheGly: 2.406 ± 0.286
0.681PheHis: 0.681 ± 0.184
2.315PheIle: 2.315 ± 0.334
2.27PheLys: 2.27 ± 0.317
2.678PheLeu: 2.678 ± 0.266
1.634PheMet: 1.634 ± 0.225
2.361PheAsn: 2.361 ± 0.287
1.09PhePro: 1.09 ± 0.255
1.407PheGln: 1.407 ± 0.302
1.634PheArg: 1.634 ± 0.252
2.088PheSer: 2.088 ± 0.29
2.996PheThr: 2.996 ± 0.458
2.27PheVal: 2.27 ± 0.344
0.318PheTrp: 0.318 ± 0.128
1.589PheTyr: 1.589 ± 0.218
0.0PheXaa: 0.0 ± 0.0
Gly
5.357GlyAla: 5.357 ± 0.637
0.59GlyCys: 0.59 ± 0.165
2.996GlyAsp: 2.996 ± 0.434
3.632GlyGlu: 3.632 ± 0.295
2.905GlyPhe: 2.905 ± 0.364
3.178GlyGly: 3.178 ± 0.505
1.226GlyHis: 1.226 ± 0.263
4.494GlyIle: 4.494 ± 0.406
5.266GlyLys: 5.266 ± 0.505
5.039GlyLeu: 5.039 ± 0.567
2.224GlyMet: 2.224 ± 0.285
4.358GlyAsn: 4.358 ± 0.434
0.681GlyPro: 0.681 ± 0.121
2.361GlyGln: 2.361 ± 0.395
2.588GlyArg: 2.588 ± 0.427
4.54GlySer: 4.54 ± 0.419
4.721GlyThr: 4.721 ± 0.386
4.222GlyVal: 4.222 ± 0.52
0.863GlyTrp: 0.863 ± 0.195
2.769GlyTyr: 2.769 ± 0.34
0.0GlyXaa: 0.0 ± 0.0
His
1.317HisAla: 1.317 ± 0.311
0.318HisCys: 0.318 ± 0.117
0.908HisAsp: 0.908 ± 0.279
1.317HisGlu: 1.317 ± 0.268
0.817HisPhe: 0.817 ± 0.198
1.135HisGly: 1.135 ± 0.253
0.499HisHis: 0.499 ± 0.157
1.18HisIle: 1.18 ± 0.213
1.09HisLys: 1.09 ± 0.189
1.997HisLeu: 1.997 ± 0.279
0.681HisMet: 0.681 ± 0.188
0.545HisAsn: 0.545 ± 0.196
0.953HisPro: 0.953 ± 0.204
0.726HisGln: 0.726 ± 0.228
1.09HisArg: 1.09 ± 0.217
1.044HisSer: 1.044 ± 0.186
0.772HisThr: 0.772 ± 0.171
1.18HisVal: 1.18 ± 0.274
0.363HisTrp: 0.363 ± 0.129
0.863HisTyr: 0.863 ± 0.229
0.0HisXaa: 0.0 ± 0.0
Ile
4.767IleAla: 4.767 ± 0.663
0.545IleCys: 0.545 ± 0.153
4.177IleAsp: 4.177 ± 0.45
4.086IleGlu: 4.086 ± 0.31
1.725IlePhe: 1.725 ± 0.362
3.178IleGly: 3.178 ± 0.334
1.271IleHis: 1.271 ± 0.242
3.632IleIle: 3.632 ± 0.555
4.767IleLys: 4.767 ± 0.527
3.95IleLeu: 3.95 ± 0.402
1.453IleMet: 1.453 ± 0.263
3.132IleAsn: 3.132 ± 0.312
3.496IlePro: 3.496 ± 0.442
2.315IleGln: 2.315 ± 0.288
3.132IleArg: 3.132 ± 0.384
3.178IleSer: 3.178 ± 0.407
4.585IleThr: 4.585 ± 0.483
3.723IleVal: 3.723 ± 0.459
0.454IleTrp: 0.454 ± 0.152
2.27IleTyr: 2.27 ± 0.336
0.0IleXaa: 0.0 ± 0.0
Lys
7.036LysAla: 7.036 ± 0.562
0.59LysCys: 0.59 ± 0.198
4.676LysAsp: 4.676 ± 0.59
4.449LysGlu: 4.449 ± 0.519
2.224LysPhe: 2.224 ± 0.327
3.723LysGly: 3.723 ± 0.416
1.407LysHis: 1.407 ± 0.252
2.905LysIle: 2.905 ± 0.343
4.403LysLys: 4.403 ± 0.642
6.356LysLeu: 6.356 ± 0.542
2.179LysMet: 2.179 ± 0.35
3.042LysAsn: 3.042 ± 0.385
2.588LysPro: 2.588 ± 0.424
3.45LysGln: 3.45 ± 0.402
2.815LysArg: 2.815 ± 0.348
3.314LysSer: 3.314 ± 0.387
4.403LysThr: 4.403 ± 0.427
4.676LysVal: 4.676 ± 0.361
0.953LysTrp: 0.953 ± 0.263
1.816LysTyr: 1.816 ± 0.307
0.0LysXaa: 0.0 ± 0.0
Leu
7.4LeuAla: 7.4 ± 0.741
0.953LeuCys: 0.953 ± 0.276
5.72LeuAsp: 5.72 ± 0.667
4.676LeuGlu: 4.676 ± 0.444
3.269LeuPhe: 3.269 ± 0.379
5.992LeuGly: 5.992 ± 0.661
1.453LeuHis: 1.453 ± 0.36
3.95LeuIle: 3.95 ± 0.453
4.994LeuLys: 4.994 ± 0.4
6.492LeuLeu: 6.492 ± 0.619
2.27LeuMet: 2.27 ± 0.466
5.221LeuAsn: 5.221 ± 0.556
3.541LeuPro: 3.541 ± 0.325
3.314LeuGln: 3.314 ± 0.334
4.086LeuArg: 4.086 ± 0.518
4.812LeuSer: 4.812 ± 0.448
6.083LeuThr: 6.083 ± 0.527
5.538LeuVal: 5.538 ± 0.481
0.681LeuTrp: 0.681 ± 0.182
1.952LeuTyr: 1.952 ± 0.287
0.0LeuXaa: 0.0 ± 0.0
Met
2.769MetAla: 2.769 ± 0.335
0.091MetCys: 0.091 ± 0.064
1.271MetAsp: 1.271 ± 0.297
2.497MetGlu: 2.497 ± 0.313
0.772MetPhe: 0.772 ± 0.203
1.498MetGly: 1.498 ± 0.268
0.409MetHis: 0.409 ± 0.134
1.407MetIle: 1.407 ± 0.247
2.134MetLys: 2.134 ± 0.283
2.043MetLeu: 2.043 ± 0.323
0.681MetMet: 0.681 ± 0.148
1.634MetAsn: 1.634 ± 0.27
1.543MetPro: 1.543 ± 0.302
1.68MetGln: 1.68 ± 0.26
1.226MetArg: 1.226 ± 0.23
2.678MetSer: 2.678 ± 0.305
2.179MetThr: 2.179 ± 0.305
1.952MetVal: 1.952 ± 0.318
0.182MetTrp: 0.182 ± 0.079
1.135MetTyr: 1.135 ± 0.178
0.0MetXaa: 0.0 ± 0.0
Asn
4.222AsnAla: 4.222 ± 0.531
0.454AsnCys: 0.454 ± 0.134
2.815AsnAsp: 2.815 ± 0.302
3.269AsnGlu: 3.269 ± 0.422
1.589AsnPhe: 1.589 ± 0.275
3.632AsnGly: 3.632 ± 0.524
1.362AsnHis: 1.362 ± 0.24
3.087AsnIle: 3.087 ± 0.349
3.677AsnLys: 3.677 ± 0.41
5.266AsnLeu: 5.266 ± 0.504
1.77AsnMet: 1.77 ± 0.227
3.269AsnAsn: 3.269 ± 0.451
3.405AsnPro: 3.405 ± 0.612
2.769AsnGln: 2.769 ± 0.354
2.724AsnArg: 2.724 ± 0.292
2.451AsnSer: 2.451 ± 0.316
3.723AsnThr: 3.723 ± 0.484
3.132AsnVal: 3.132 ± 0.413
0.817AsnTrp: 0.817 ± 0.234
2.451AsnTyr: 2.451 ± 0.384
0.0AsnXaa: 0.0 ± 0.0
Pro
3.269ProAla: 3.269 ± 0.37
0.272ProCys: 0.272 ± 0.09
2.769ProAsp: 2.769 ± 0.34
3.723ProGlu: 3.723 ± 0.495
1.634ProPhe: 1.634 ± 0.326
2.406ProGly: 2.406 ± 0.428
0.409ProHis: 0.409 ± 0.135
2.315ProIle: 2.315 ± 0.242
2.633ProLys: 2.633 ± 0.369
2.951ProLeu: 2.951 ± 0.312
0.953ProMet: 0.953 ± 0.155
2.588ProAsn: 2.588 ± 0.351
1.044ProPro: 1.044 ± 0.217
1.407ProGln: 1.407 ± 0.199
1.226ProArg: 1.226 ± 0.267
2.406ProSer: 2.406 ± 0.382
3.405ProThr: 3.405 ± 0.378
3.95ProVal: 3.95 ± 0.498
0.59ProTrp: 0.59 ± 0.211
1.09ProTyr: 1.09 ± 0.194
0.0ProXaa: 0.0 ± 0.0
Gln
4.449GlnAla: 4.449 ± 0.716
0.454GlnCys: 0.454 ± 0.132
1.861GlnAsp: 1.861 ± 0.282
3.768GlnGlu: 3.768 ± 0.442
1.407GlnPhe: 1.407 ± 0.206
2.315GlnGly: 2.315 ± 0.393
0.726GlnHis: 0.726 ± 0.148
2.86GlnIle: 2.86 ± 0.344
2.905GlnLys: 2.905 ± 0.429
3.995GlnLeu: 3.995 ± 0.46
1.362GlnMet: 1.362 ± 0.227
2.724GlnAsn: 2.724 ± 0.482
1.589GlnPro: 1.589 ± 0.292
1.634GlnGln: 1.634 ± 0.274
1.453GlnArg: 1.453 ± 0.31
2.451GlnSer: 2.451 ± 0.459
2.179GlnThr: 2.179 ± 0.268
3.496GlnVal: 3.496 ± 0.322
0.499GlnTrp: 0.499 ± 0.133
1.589GlnTyr: 1.589 ± 0.254
0.0GlnXaa: 0.0 ± 0.0
Arg
3.042ArgAla: 3.042 ± 0.392
0.454ArgCys: 0.454 ± 0.186
2.497ArgAsp: 2.497 ± 0.279
2.724ArgGlu: 2.724 ± 0.372
1.543ArgPhe: 1.543 ± 0.253
2.27ArgGly: 2.27 ± 0.34
0.636ArgHis: 0.636 ± 0.152
3.223ArgIle: 3.223 ± 0.351
3.132ArgLys: 3.132 ± 0.315
4.267ArgLeu: 4.267 ± 0.474
1.543ArgMet: 1.543 ± 0.251
3.269ArgAsn: 3.269 ± 0.53
1.18ArgPro: 1.18 ± 0.291
2.134ArgGln: 2.134 ± 0.31
2.406ArgArg: 2.406 ± 0.428
2.179ArgSer: 2.179 ± 0.371
2.361ArgThr: 2.361 ± 0.357
2.134ArgVal: 2.134 ± 0.364
0.545ArgTrp: 0.545 ± 0.15
1.589ArgTyr: 1.589 ± 0.249
0.0ArgXaa: 0.0 ± 0.0
Ser
4.04SerAla: 4.04 ± 0.419
0.499SerCys: 0.499 ± 0.156
2.497SerAsp: 2.497 ± 0.371
3.904SerGlu: 3.904 ± 0.366
2.043SerPhe: 2.043 ± 0.284
4.449SerGly: 4.449 ± 0.681
0.772SerHis: 0.772 ± 0.172
3.723SerIle: 3.723 ± 0.392
4.267SerLys: 4.267 ± 0.535
5.039SerLeu: 5.039 ± 0.513
1.68SerMet: 1.68 ± 0.338
2.678SerAsn: 2.678 ± 0.399
2.27SerPro: 2.27 ± 0.442
2.088SerGln: 2.088 ± 0.348
2.542SerArg: 2.542 ± 0.407
3.995SerSer: 3.995 ± 0.569
4.403SerThr: 4.403 ± 0.474
3.859SerVal: 3.859 ± 0.437
0.817SerTrp: 0.817 ± 0.229
2.179SerTyr: 2.179 ± 0.35
0.0SerXaa: 0.0 ± 0.0
Thr
5.402ThrAla: 5.402 ± 0.559
0.272ThrCys: 0.272 ± 0.126
4.131ThrAsp: 4.131 ± 0.315
3.995ThrGlu: 3.995 ± 0.506
2.86ThrPhe: 2.86 ± 0.52
5.039ThrGly: 5.039 ± 0.394
1.317ThrHis: 1.317 ± 0.261
3.541ThrIle: 3.541 ± 0.383
3.995ThrLys: 3.995 ± 0.439
5.448ThrLeu: 5.448 ± 0.422
1.589ThrMet: 1.589 ± 0.233
3.677ThrAsn: 3.677 ± 0.379
3.405ThrPro: 3.405 ± 0.452
3.132ThrGln: 3.132 ± 0.312
2.406ThrArg: 2.406 ± 0.334
3.677ThrSer: 3.677 ± 0.387
3.45ThrThr: 3.45 ± 0.511
5.538ThrVal: 5.538 ± 0.608
1.09ThrTrp: 1.09 ± 0.347
1.77ThrTyr: 1.77 ± 0.312
0.0ThrXaa: 0.0 ± 0.0
Val
6.446ValAla: 6.446 ± 0.697
0.636ValCys: 0.636 ± 0.178
3.768ValAsp: 3.768 ± 0.361
4.54ValGlu: 4.54 ± 0.567
1.997ValPhe: 1.997 ± 0.363
4.585ValGly: 4.585 ± 0.552
1.634ValHis: 1.634 ± 0.25
3.859ValIle: 3.859 ± 0.57
4.131ValLys: 4.131 ± 0.343
3.813ValLeu: 3.813 ± 0.375
2.451ValMet: 2.451 ± 0.254
4.403ValAsn: 4.403 ± 0.37
3.087ValPro: 3.087 ± 0.441
2.724ValGln: 2.724 ± 0.324
3.087ValArg: 3.087 ± 0.392
4.177ValSer: 4.177 ± 0.477
5.13ValThr: 5.13 ± 0.619
4.267ValVal: 4.267 ± 0.434
0.953ValTrp: 0.953 ± 0.186
2.497ValTyr: 2.497 ± 0.335
0.0ValXaa: 0.0 ± 0.0
Trp
0.953TrpAla: 0.953 ± 0.211
0.091TrpCys: 0.091 ± 0.061
1.135TrpAsp: 1.135 ± 0.272
0.726TrpGlu: 0.726 ± 0.229
0.772TrpPhe: 0.772 ± 0.178
0.817TrpGly: 0.817 ± 0.224
0.363TrpHis: 0.363 ± 0.125
0.953TrpIle: 0.953 ± 0.206
0.726TrpLys: 0.726 ± 0.211
1.09TrpLeu: 1.09 ± 0.278
0.227TrpMet: 0.227 ± 0.151
0.772TrpAsn: 0.772 ± 0.199
0.227TrpPro: 0.227 ± 0.089
0.772TrpGln: 0.772 ± 0.171
0.772TrpArg: 0.772 ± 0.227
0.545TrpSer: 0.545 ± 0.153
0.454TrpThr: 0.454 ± 0.12
1.044TrpVal: 1.044 ± 0.209
0.318TrpTrp: 0.318 ± 0.195
0.409TrpTyr: 0.409 ± 0.149
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.769TyrAla: 2.769 ± 0.349
0.59TyrCys: 0.59 ± 0.212
2.815TyrAsp: 2.815 ± 0.332
1.907TyrGlu: 1.907 ± 0.286
1.18TyrPhe: 1.18 ± 0.26
2.724TyrGly: 2.724 ± 0.55
0.953TyrHis: 0.953 ± 0.223
2.27TyrIle: 2.27 ± 0.384
2.179TyrLys: 2.179 ± 0.382
2.678TyrLeu: 2.678 ± 0.414
0.863TyrMet: 0.863 ± 0.209
2.27TyrAsn: 2.27 ± 0.254
1.997TyrPro: 1.997 ± 0.34
1.997TyrGln: 1.997 ± 0.381
1.407TyrArg: 1.407 ± 0.341
1.997TyrSer: 1.997 ± 0.337
1.77TyrThr: 1.77 ± 0.237
2.406TyrVal: 2.406 ± 0.32
0.59TyrTrp: 0.59 ± 0.151
1.271TyrTyr: 1.271 ± 0.249
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 72 proteins (22029 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski