Amino acid dipepetide frequency for Escherichia phage vB_EcoS_ESCO41

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.254AlaAla: 7.254 ± 0.975
0.972AlaCys: 0.972 ± 0.315
4.34AlaAsp: 4.34 ± 0.447
4.793AlaGlu: 4.793 ± 0.517
2.72AlaPhe: 2.72 ± 0.46
6.931AlaGly: 6.931 ± 0.736
0.712AlaHis: 0.712 ± 0.219
5.441AlaIle: 5.441 ± 0.615
5.506AlaLys: 5.506 ± 0.855
6.542AlaLeu: 6.542 ± 0.64
2.073AlaMet: 2.073 ± 0.371
3.368AlaAsn: 3.368 ± 0.505
1.619AlaPro: 1.619 ± 0.263
2.72AlaGln: 2.72 ± 0.483
4.534AlaArg: 4.534 ± 0.605
4.858AlaSer: 4.858 ± 0.547
4.664AlaThr: 4.664 ± 0.928
5.441AlaVal: 5.441 ± 0.594
1.231AlaTrp: 1.231 ± 0.238
2.591AlaTyr: 2.591 ± 0.439
0.0AlaXaa: 0.0 ± 0.0
Cys
1.231CysAla: 1.231 ± 0.296
0.13CysCys: 0.13 ± 0.101
0.842CysAsp: 0.842 ± 0.234
1.231CysGlu: 1.231 ± 0.409
0.842CysPhe: 0.842 ± 0.201
1.36CysGly: 1.36 ± 0.394
0.259CysHis: 0.259 ± 0.121
1.036CysIle: 1.036 ± 0.271
1.101CysLys: 1.101 ± 0.359
0.972CysLeu: 0.972 ± 0.25
0.518CysMet: 0.518 ± 0.209
0.518CysAsn: 0.518 ± 0.177
0.518CysPro: 0.518 ± 0.216
0.194CysGln: 0.194 ± 0.109
0.907CysArg: 0.907 ± 0.275
1.295CysSer: 1.295 ± 0.313
0.648CysThr: 0.648 ± 0.216
1.166CysVal: 1.166 ± 0.285
0.389CysTrp: 0.389 ± 0.138
0.389CysTyr: 0.389 ± 0.178
0.0CysXaa: 0.0 ± 0.0
Asp
4.34AspAla: 4.34 ± 0.486
0.842AspCys: 0.842 ± 0.231
4.34AspAsp: 4.34 ± 0.608
5.311AspGlu: 5.311 ± 0.711
2.073AspPhe: 2.073 ± 0.337
6.931AspGly: 6.931 ± 0.655
1.555AspHis: 1.555 ± 0.295
3.821AspIle: 3.821 ± 0.414
4.145AspLys: 4.145 ± 0.605
3.886AspLeu: 3.886 ± 0.547
1.619AspMet: 1.619 ± 0.343
3.174AspAsn: 3.174 ± 0.419
1.814AspPro: 1.814 ± 0.295
1.555AspGln: 1.555 ± 0.377
2.202AspArg: 2.202 ± 0.374
3.433AspSer: 3.433 ± 0.52
2.591AspThr: 2.591 ± 0.4
3.433AspVal: 3.433 ± 0.515
0.777AspTrp: 0.777 ± 0.204
3.303AspTyr: 3.303 ± 0.378
0.0AspXaa: 0.0 ± 0.0
Glu
4.858GluAla: 4.858 ± 0.495
1.231GluCys: 1.231 ± 0.36
2.915GluAsp: 2.915 ± 0.526
3.174GluGlu: 3.174 ± 0.496
3.044GluPhe: 3.044 ± 0.379
3.498GluGly: 3.498 ± 0.488
1.166GluHis: 1.166 ± 0.267
4.987GluIle: 4.987 ± 0.478
4.793GluLys: 4.793 ± 0.64
4.599GluLeu: 4.599 ± 0.405
2.656GluMet: 2.656 ± 0.466
3.368GluAsn: 3.368 ± 0.495
2.267GluPro: 2.267 ± 0.374
2.397GluGln: 2.397 ± 0.41
2.785GluArg: 2.785 ± 0.449
4.664GluSer: 4.664 ± 0.476
3.044GluThr: 3.044 ± 0.421
4.728GluVal: 4.728 ± 0.479
1.101GluTrp: 1.101 ± 0.321
2.591GluTyr: 2.591 ± 0.387
0.0GluXaa: 0.0 ± 0.0
Phe
2.979PheAla: 2.979 ± 0.546
0.972PheCys: 0.972 ± 0.255
3.692PheAsp: 3.692 ± 0.462
2.526PheGlu: 2.526 ± 0.469
0.777PhePhe: 0.777 ± 0.233
2.915PheGly: 2.915 ± 0.487
0.518PheHis: 0.518 ± 0.175
2.656PheIle: 2.656 ± 0.363
2.85PheLys: 2.85 ± 0.384
2.397PheLeu: 2.397 ± 0.373
0.907PheMet: 0.907 ± 0.316
2.785PheAsn: 2.785 ± 0.378
1.555PhePro: 1.555 ± 0.335
1.49PheGln: 1.49 ± 0.284
1.684PheArg: 1.684 ± 0.326
2.979PheSer: 2.979 ± 0.419
2.202PheThr: 2.202 ± 0.405
2.915PheVal: 2.915 ± 0.483
0.712PheTrp: 0.712 ± 0.188
1.49PheTyr: 1.49 ± 0.352
0.0PheXaa: 0.0 ± 0.0
Gly
4.793GlyAla: 4.793 ± 0.592
1.36GlyCys: 1.36 ± 0.329
4.923GlyAsp: 4.923 ± 0.524
4.987GlyGlu: 4.987 ± 0.551
3.757GlyPhe: 3.757 ± 0.515
7.125GlyGly: 7.125 ± 1.011
1.555GlyHis: 1.555 ± 0.45
5.182GlyIle: 5.182 ± 0.635
7.384GlyLys: 7.384 ± 0.7
5.311GlyLeu: 5.311 ± 0.556
2.72GlyMet: 2.72 ± 0.453
3.951GlyAsn: 3.951 ± 0.591
1.036GlyPro: 1.036 ± 0.294
1.425GlyGln: 1.425 ± 0.332
3.821GlyArg: 3.821 ± 0.414
5.829GlySer: 5.829 ± 0.775
3.044GlyThr: 3.044 ± 0.413
6.088GlyVal: 6.088 ± 0.553
1.295GlyTrp: 1.295 ± 0.297
3.627GlyTyr: 3.627 ± 0.483
0.0GlyXaa: 0.0 ± 0.0
His
0.972HisAla: 0.972 ± 0.293
0.324HisCys: 0.324 ± 0.133
1.231HisAsp: 1.231 ± 0.23
1.425HisGlu: 1.425 ± 0.322
0.972HisPhe: 0.972 ± 0.226
1.619HisGly: 1.619 ± 0.307
0.453HisHis: 0.453 ± 0.176
1.36HisIle: 1.36 ± 0.291
1.295HisLys: 1.295 ± 0.353
1.166HisLeu: 1.166 ± 0.354
0.389HisMet: 0.389 ± 0.171
0.518HisAsn: 0.518 ± 0.146
0.972HisPro: 0.972 ± 0.242
0.453HisGln: 0.453 ± 0.185
0.907HisArg: 0.907 ± 0.298
0.907HisSer: 0.907 ± 0.325
1.101HisThr: 1.101 ± 0.256
1.101HisVal: 1.101 ± 0.278
0.065HisTrp: 0.065 ± 0.062
0.907HisTyr: 0.907 ± 0.27
0.0HisXaa: 0.0 ± 0.0
Ile
6.088IleAla: 6.088 ± 0.753
0.583IleCys: 0.583 ± 0.214
5.052IleAsp: 5.052 ± 0.586
4.987IleGlu: 4.987 ± 0.571
2.073IlePhe: 2.073 ± 0.329
4.275IleGly: 4.275 ± 0.438
1.231IleHis: 1.231 ± 0.34
4.275IleIle: 4.275 ± 0.566
5.376IleLys: 5.376 ± 0.736
3.109IleLeu: 3.109 ± 0.419
1.814IleMet: 1.814 ± 0.406
4.016IleAsn: 4.016 ± 0.609
2.85IlePro: 2.85 ± 0.455
2.202IleGln: 2.202 ± 0.41
3.368IleArg: 3.368 ± 0.498
4.858IleSer: 4.858 ± 0.593
4.275IleThr: 4.275 ± 0.461
3.562IleVal: 3.562 ± 0.518
1.101IleTrp: 1.101 ± 0.356
2.202IleTyr: 2.202 ± 0.373
0.0IleXaa: 0.0 ± 0.0
Lys
6.088LysAla: 6.088 ± 0.824
1.101LysCys: 1.101 ± 0.255
4.145LysAsp: 4.145 ± 0.489
5.117LysGlu: 5.117 ± 0.757
2.332LysPhe: 2.332 ± 0.38
4.21LysGly: 4.21 ± 0.619
1.101LysHis: 1.101 ± 0.278
4.145LysIle: 4.145 ± 0.418
5.441LysLys: 5.441 ± 0.713
4.858LysLeu: 4.858 ± 0.487
3.627LysMet: 3.627 ± 0.608
2.979LysAsn: 2.979 ± 0.484
2.461LysPro: 2.461 ± 0.43
2.85LysGln: 2.85 ± 0.614
2.979LysArg: 2.979 ± 0.51
4.923LysSer: 4.923 ± 0.643
4.145LysThr: 4.145 ± 0.476
5.635LysVal: 5.635 ± 0.632
0.907LysTrp: 0.907 ± 0.267
2.85LysTyr: 2.85 ± 0.39
0.0LysXaa: 0.0 ± 0.0
Leu
4.987LeuAla: 4.987 ± 0.632
0.907LeuCys: 0.907 ± 0.234
3.951LeuAsp: 3.951 ± 0.394
3.562LeuGlu: 3.562 ± 0.422
2.137LeuPhe: 2.137 ± 0.423
4.599LeuGly: 4.599 ± 0.56
1.231LeuHis: 1.231 ± 0.316
4.534LeuIle: 4.534 ± 0.581
4.534LeuLys: 4.534 ± 0.493
4.404LeuLeu: 4.404 ± 0.652
1.36LeuMet: 1.36 ± 0.272
3.498LeuAsn: 3.498 ± 0.517
3.433LeuPro: 3.433 ± 0.47
1.943LeuGln: 1.943 ± 0.355
3.627LeuArg: 3.627 ± 0.472
4.923LeuSer: 4.923 ± 0.657
4.404LeuThr: 4.404 ± 0.453
3.951LeuVal: 3.951 ± 0.465
0.712LeuTrp: 0.712 ± 0.213
2.008LeuTyr: 2.008 ± 0.318
0.0LeuXaa: 0.0 ± 0.0
Met
3.109MetAla: 3.109 ± 0.434
0.194MetCys: 0.194 ± 0.105
0.842MetAsp: 0.842 ± 0.237
1.684MetGlu: 1.684 ± 0.302
1.295MetPhe: 1.295 ± 0.313
1.231MetGly: 1.231 ± 0.325
0.777MetHis: 0.777 ± 0.233
2.656MetIle: 2.656 ± 0.424
2.267MetLys: 2.267 ± 0.34
1.555MetLeu: 1.555 ± 0.288
1.295MetMet: 1.295 ± 0.315
2.073MetAsn: 2.073 ± 0.362
1.036MetPro: 1.036 ± 0.256
1.101MetGln: 1.101 ± 0.23
1.49MetArg: 1.49 ± 0.314
2.526MetSer: 2.526 ± 0.399
1.878MetThr: 1.878 ± 0.468
2.137MetVal: 2.137 ± 0.379
0.453MetTrp: 0.453 ± 0.154
0.842MetTyr: 0.842 ± 0.21
0.0MetXaa: 0.0 ± 0.0
Asn
4.404AsnAla: 4.404 ± 0.621
0.777AsnCys: 0.777 ± 0.201
3.044AsnAsp: 3.044 ± 0.334
3.044AsnGlu: 3.044 ± 0.464
2.461AsnPhe: 2.461 ± 0.386
6.153AsnGly: 6.153 ± 0.564
1.166AsnHis: 1.166 ± 0.28
2.137AsnIle: 2.137 ± 0.312
2.979AsnLys: 2.979 ± 0.367
3.821AsnLeu: 3.821 ± 0.447
1.231AsnMet: 1.231 ± 0.288
3.303AsnAsn: 3.303 ± 0.555
1.814AsnPro: 1.814 ± 0.319
1.943AsnGln: 1.943 ± 0.376
1.878AsnArg: 1.878 ± 0.385
3.433AsnSer: 3.433 ± 0.518
1.878AsnThr: 1.878 ± 0.307
3.303AsnVal: 3.303 ± 0.338
0.777AsnTrp: 0.777 ± 0.184
1.49AsnTyr: 1.49 ± 0.297
0.0AsnXaa: 0.0 ± 0.0
Pro
2.979ProAla: 2.979 ± 0.447
0.777ProCys: 0.777 ± 0.257
2.397ProAsp: 2.397 ± 0.397
2.85ProGlu: 2.85 ± 0.537
1.878ProPhe: 1.878 ± 0.34
2.526ProGly: 2.526 ± 0.497
0.648ProHis: 0.648 ± 0.152
2.332ProIle: 2.332 ± 0.436
1.425ProLys: 1.425 ± 0.258
1.49ProLeu: 1.49 ± 0.282
0.972ProMet: 0.972 ± 0.281
1.555ProAsn: 1.555 ± 0.284
1.101ProPro: 1.101 ± 0.29
1.166ProGln: 1.166 ± 0.282
1.49ProArg: 1.49 ± 0.319
1.814ProSer: 1.814 ± 0.324
1.684ProThr: 1.684 ± 0.34
2.785ProVal: 2.785 ± 0.365
0.583ProTrp: 0.583 ± 0.235
1.295ProTyr: 1.295 ± 0.354
0.0ProXaa: 0.0 ± 0.0
Gln
3.886GlnAla: 3.886 ± 0.467
0.712GlnCys: 0.712 ± 0.241
1.101GlnAsp: 1.101 ± 0.237
1.555GlnGlu: 1.555 ± 0.453
2.073GlnPhe: 2.073 ± 0.38
1.749GlnGly: 1.749 ± 0.369
0.453GlnHis: 0.453 ± 0.206
3.239GlnIle: 3.239 ± 0.439
2.073GlnLys: 2.073 ± 0.395
2.332GlnLeu: 2.332 ± 0.444
1.295GlnMet: 1.295 ± 0.345
1.555GlnAsn: 1.555 ± 0.307
0.907GlnPro: 0.907 ± 0.214
1.814GlnGln: 1.814 ± 0.623
1.555GlnArg: 1.555 ± 0.305
2.332GlnSer: 2.332 ± 0.353
1.036GlnThr: 1.036 ± 0.256
2.85GlnVal: 2.85 ± 0.43
0.453GlnTrp: 0.453 ± 0.226
1.49GlnTyr: 1.49 ± 0.4
0.0GlnXaa: 0.0 ± 0.0
Arg
3.498ArgAla: 3.498 ± 0.423
0.712ArgCys: 0.712 ± 0.268
2.397ArgAsp: 2.397 ± 0.392
3.433ArgGlu: 3.433 ± 0.377
2.397ArgPhe: 2.397 ± 0.438
2.915ArgGly: 2.915 ± 0.376
0.712ArgHis: 0.712 ± 0.265
2.915ArgIle: 2.915 ± 0.435
4.145ArgLys: 4.145 ± 0.45
3.368ArgLeu: 3.368 ± 0.506
1.684ArgMet: 1.684 ± 0.35
1.814ArgAsn: 1.814 ± 0.414
1.749ArgPro: 1.749 ± 0.336
2.332ArgGln: 2.332 ± 0.364
2.785ArgArg: 2.785 ± 0.355
2.72ArgSer: 2.72 ± 0.397
1.555ArgThr: 1.555 ± 0.327
3.368ArgVal: 3.368 ± 0.504
0.518ArgTrp: 0.518 ± 0.181
2.137ArgTyr: 2.137 ± 0.294
0.0ArgXaa: 0.0 ± 0.0
Ser
4.664SerAla: 4.664 ± 0.58
0.712SerCys: 0.712 ± 0.223
4.793SerAsp: 4.793 ± 0.519
4.534SerGlu: 4.534 ± 0.522
2.979SerPhe: 2.979 ± 0.381
6.477SerGly: 6.477 ± 0.641
1.231SerHis: 1.231 ± 0.292
4.599SerIle: 4.599 ± 0.571
4.599SerLys: 4.599 ± 0.623
4.858SerLeu: 4.858 ± 0.563
1.295SerMet: 1.295 ± 0.316
3.886SerAsn: 3.886 ± 0.527
2.008SerPro: 2.008 ± 0.397
2.979SerGln: 2.979 ± 0.427
2.656SerArg: 2.656 ± 0.423
3.562SerSer: 3.562 ± 0.639
3.692SerThr: 3.692 ± 0.554
4.664SerVal: 4.664 ± 0.529
0.907SerTrp: 0.907 ± 0.234
2.461SerTyr: 2.461 ± 0.355
0.0SerXaa: 0.0 ± 0.0
Thr
3.821ThrAla: 3.821 ± 0.64
0.583ThrCys: 0.583 ± 0.18
2.526ThrAsp: 2.526 ± 0.389
3.044ThrGlu: 3.044 ± 0.482
2.267ThrPhe: 2.267 ± 0.426
5.117ThrGly: 5.117 ± 0.639
1.101ThrHis: 1.101 ± 0.267
3.562ThrIle: 3.562 ± 0.482
2.72ThrLys: 2.72 ± 0.461
3.627ThrLeu: 3.627 ± 0.469
1.555ThrMet: 1.555 ± 0.348
2.397ThrAsn: 2.397 ± 0.395
2.202ThrPro: 2.202 ± 0.373
2.526ThrGln: 2.526 ± 0.349
1.749ThrArg: 1.749 ± 0.29
3.627ThrSer: 3.627 ± 0.469
2.72ThrThr: 2.72 ± 0.536
4.081ThrVal: 4.081 ± 0.489
0.907ThrTrp: 0.907 ± 0.215
1.555ThrTyr: 1.555 ± 0.325
0.0ThrXaa: 0.0 ± 0.0
Val
4.923ValAla: 4.923 ± 0.529
1.231ValCys: 1.231 ± 0.363
4.664ValAsp: 4.664 ± 0.54
3.821ValGlu: 3.821 ± 0.558
2.785ValPhe: 2.785 ± 0.417
5.506ValGly: 5.506 ± 0.59
1.036ValHis: 1.036 ± 0.281
4.923ValIle: 4.923 ± 0.409
5.829ValLys: 5.829 ± 0.62
3.757ValLeu: 3.757 ± 0.414
1.943ValMet: 1.943 ± 0.334
3.498ValAsn: 3.498 ± 0.505
2.591ValPro: 2.591 ± 0.474
2.008ValGln: 2.008 ± 0.462
3.239ValArg: 3.239 ± 0.565
4.793ValSer: 4.793 ± 0.627
4.858ValThr: 4.858 ± 0.565
5.57ValVal: 5.57 ± 0.733
0.907ValTrp: 0.907 ± 0.234
2.85ValTyr: 2.85 ± 0.404
0.0ValXaa: 0.0 ± 0.0
Trp
1.166TrpAla: 1.166 ± 0.273
0.453TrpCys: 0.453 ± 0.156
0.518TrpAsp: 0.518 ± 0.178
0.907TrpGlu: 0.907 ± 0.256
0.777TrpPhe: 0.777 ± 0.266
0.712TrpGly: 0.712 ± 0.189
0.259TrpHis: 0.259 ± 0.128
0.842TrpIle: 0.842 ± 0.224
1.425TrpLys: 1.425 ± 0.338
1.166TrpLeu: 1.166 ± 0.273
0.453TrpMet: 0.453 ± 0.164
0.583TrpAsn: 0.583 ± 0.207
0.389TrpPro: 0.389 ± 0.186
0.259TrpGln: 0.259 ± 0.126
0.777TrpArg: 0.777 ± 0.205
1.619TrpSer: 1.619 ± 0.337
0.324TrpThr: 0.324 ± 0.156
1.036TrpVal: 1.036 ± 0.273
0.259TrpTrp: 0.259 ± 0.133
0.583TrpTyr: 0.583 ± 0.219
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.267TyrAla: 2.267 ± 0.34
0.972TyrCys: 0.972 ± 0.316
3.433TyrAsp: 3.433 ± 0.469
1.878TyrGlu: 1.878 ± 0.35
1.36TyrPhe: 1.36 ± 0.262
3.368TyrGly: 3.368 ± 0.363
0.972TyrHis: 0.972 ± 0.263
2.656TyrIle: 2.656 ± 0.407
2.008TyrLys: 2.008 ± 0.392
1.49TyrLeu: 1.49 ± 0.3
0.972TyrMet: 0.972 ± 0.331
2.202TyrAsn: 2.202 ± 0.405
1.425TyrPro: 1.425 ± 0.317
1.101TyrGln: 1.101 ± 0.246
2.785TyrArg: 2.785 ± 0.507
2.461TyrSer: 2.461 ± 0.414
1.943TyrThr: 1.943 ± 0.282
2.915TyrVal: 2.915 ± 0.427
0.453TyrTrp: 0.453 ± 0.184
0.907TyrTyr: 0.907 ± 0.227
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 79 proteins (15440 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski