Amino acid dipepetide frequency for Escherichia phage Snoke

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.964AlaAla: 10.964 ± 1.273
1.061AlaCys: 1.061 ± 0.261
5.871AlaAsp: 5.871 ± 0.665
7.64AlaGlu: 7.64 ± 0.701
3.749AlaPhe: 3.749 ± 0.575
6.225AlaGly: 6.225 ± 0.7
1.556AlaHis: 1.556 ± 0.285
4.669AlaIle: 4.669 ± 0.677
5.517AlaLys: 5.517 ± 0.641
8.488AlaLeu: 8.488 ± 0.707
2.193AlaMet: 2.193 ± 0.423
4.103AlaAsn: 4.103 ± 0.418
3.466AlaPro: 3.466 ± 0.339
3.254AlaGln: 3.254 ± 0.531
4.173AlaArg: 4.173 ± 0.517
5.8AlaSer: 5.8 ± 0.804
6.083AlaThr: 6.083 ± 0.737
6.013AlaVal: 6.013 ± 0.464
1.627AlaTrp: 1.627 ± 0.282
3.395AlaTyr: 3.395 ± 0.41
0.0AlaXaa: 0.0 ± 0.0
Cys
1.061CysAla: 1.061 ± 0.288
0.141CysCys: 0.141 ± 0.095
0.849CysAsp: 0.849 ± 0.207
0.99CysGlu: 0.99 ± 0.315
0.212CysPhe: 0.212 ± 0.125
0.99CysGly: 0.99 ± 0.24
0.141CysHis: 0.141 ± 0.109
0.424CysIle: 0.424 ± 0.164
0.778CysLys: 0.778 ± 0.234
0.566CysLeu: 0.566 ± 0.21
0.141CysMet: 0.141 ± 0.114
0.424CysAsn: 0.424 ± 0.159
0.354CysPro: 0.354 ± 0.151
0.212CysGln: 0.212 ± 0.131
0.92CysArg: 0.92 ± 0.23
0.707CysSer: 0.707 ± 0.267
0.849CysThr: 0.849 ± 0.272
0.778CysVal: 0.778 ± 0.242
0.212CysTrp: 0.212 ± 0.113
0.354CysTyr: 0.354 ± 0.138
0.0CysXaa: 0.0 ± 0.0
Asp
5.73AspAla: 5.73 ± 0.719
0.637AspCys: 0.637 ± 0.196
4.669AspAsp: 4.669 ± 0.477
4.527AspGlu: 4.527 ± 0.613
3.183AspPhe: 3.183 ± 0.446
5.588AspGly: 5.588 ± 0.694
0.92AspHis: 0.92 ± 0.217
4.032AspIle: 4.032 ± 0.451
3.183AspLys: 3.183 ± 0.484
4.598AspLeu: 4.598 ± 0.474
1.556AspMet: 1.556 ± 0.3
2.9AspAsn: 2.9 ± 0.464
1.485AspPro: 1.485 ± 0.288
0.92AspGln: 0.92 ± 0.261
2.193AspArg: 2.193 ± 0.37
3.183AspSer: 3.183 ± 0.536
4.103AspThr: 4.103 ± 0.604
4.881AspVal: 4.881 ± 0.655
0.92AspTrp: 0.92 ± 0.225
1.698AspTyr: 1.698 ± 0.35
0.0AspXaa: 0.0 ± 0.0
Glu
6.296GluAla: 6.296 ± 0.793
0.495GluCys: 0.495 ± 0.199
3.678GluAsp: 3.678 ± 0.531
4.952GluGlu: 4.952 ± 0.886
2.759GluPhe: 2.759 ± 0.487
4.315GluGly: 4.315 ± 0.569
1.203GluHis: 1.203 ± 0.287
3.82GluIle: 3.82 ± 0.442
4.386GluLys: 4.386 ± 0.657
5.871GluLeu: 5.871 ± 0.548
2.829GluMet: 2.829 ± 0.455
2.547GluAsn: 2.547 ± 0.374
1.981GluPro: 1.981 ± 0.424
3.82GluGln: 3.82 ± 0.758
3.466GluArg: 3.466 ± 0.575
2.476GluSer: 2.476 ± 0.449
3.395GluThr: 3.395 ± 0.484
5.022GluVal: 5.022 ± 0.433
0.92GluTrp: 0.92 ± 0.281
2.405GluTyr: 2.405 ± 0.41
0.0GluXaa: 0.0 ± 0.0
Phe
2.617PheAla: 2.617 ± 0.37
0.495PheCys: 0.495 ± 0.172
3.537PheAsp: 3.537 ± 0.542
2.759PheGlu: 2.759 ± 0.423
1.132PhePhe: 1.132 ± 0.278
2.9PheGly: 2.9 ± 0.355
0.566PheHis: 0.566 ± 0.172
2.405PheIle: 2.405 ± 0.456
2.334PheLys: 2.334 ± 0.419
1.839PheLeu: 1.839 ± 0.347
0.637PheMet: 0.637 ± 0.168
1.981PheAsn: 1.981 ± 0.269
1.627PhePro: 1.627 ± 0.386
1.344PheGln: 1.344 ± 0.266
1.698PheArg: 1.698 ± 0.265
2.829PheSer: 2.829 ± 0.512
2.9PheThr: 2.9 ± 0.435
3.395PheVal: 3.395 ± 0.426
0.495PheTrp: 0.495 ± 0.198
1.627PheTyr: 1.627 ± 0.323
0.0PheXaa: 0.0 ± 0.0
Gly
7.498GlyAla: 7.498 ± 0.703
1.132GlyCys: 1.132 ± 0.232
4.244GlyAsp: 4.244 ± 0.444
4.81GlyGlu: 4.81 ± 0.662
3.395GlyPhe: 3.395 ± 0.455
5.588GlyGly: 5.588 ± 0.615
1.627GlyHis: 1.627 ± 0.373
2.617GlyIle: 2.617 ± 0.433
5.022GlyLys: 5.022 ± 0.592
5.234GlyLeu: 5.234 ± 0.57
2.334GlyMet: 2.334 ± 0.468
4.244GlyAsn: 4.244 ± 0.55
1.556GlyPro: 1.556 ± 0.313
2.334GlyGln: 2.334 ± 0.343
3.537GlyArg: 3.537 ± 0.452
5.093GlySer: 5.093 ± 0.683
3.891GlyThr: 3.891 ± 0.458
5.588GlyVal: 5.588 ± 0.696
1.344GlyTrp: 1.344 ± 0.282
2.264GlyTyr: 2.264 ± 0.427
0.0GlyXaa: 0.0 ± 0.0
His
0.99HisAla: 0.99 ± 0.241
0.283HisCys: 0.283 ± 0.124
0.707HisAsp: 0.707 ± 0.205
0.92HisGlu: 0.92 ± 0.283
0.566HisPhe: 0.566 ± 0.205
1.061HisGly: 1.061 ± 0.293
0.354HisHis: 0.354 ± 0.174
1.273HisIle: 1.273 ± 0.285
1.839HisLys: 1.839 ± 0.363
1.132HisLeu: 1.132 ± 0.301
0.424HisMet: 0.424 ± 0.187
1.061HisAsn: 1.061 ± 0.242
0.707HisPro: 0.707 ± 0.202
1.061HisGln: 1.061 ± 0.222
0.92HisArg: 0.92 ± 0.252
0.778HisSer: 0.778 ± 0.221
0.707HisThr: 0.707 ± 0.31
1.273HisVal: 1.273 ± 0.386
0.071HisTrp: 0.071 ± 0.069
0.566HisTyr: 0.566 ± 0.222
0.0HisXaa: 0.0 ± 0.0
Ile
4.81IleAla: 4.81 ± 0.656
1.132IleCys: 1.132 ± 0.223
3.537IleAsp: 3.537 ± 0.508
3.254IleGlu: 3.254 ± 0.389
1.556IlePhe: 1.556 ± 0.3
2.688IleGly: 2.688 ± 0.411
0.495IleHis: 0.495 ± 0.159
2.547IleIle: 2.547 ± 0.331
3.325IleLys: 3.325 ± 0.425
2.9IleLeu: 2.9 ± 0.485
0.849IleMet: 0.849 ± 0.253
3.183IleAsn: 3.183 ± 0.361
2.688IlePro: 2.688 ± 0.348
1.768IleGln: 1.768 ± 0.334
2.476IleArg: 2.476 ± 0.341
3.395IleSer: 3.395 ± 0.443
4.952IleThr: 4.952 ± 0.72
3.466IleVal: 3.466 ± 0.505
1.132IleTrp: 1.132 ± 0.269
1.839IleTyr: 1.839 ± 0.399
0.0IleXaa: 0.0 ± 0.0
Lys
6.013LysAla: 6.013 ± 0.864
0.354LysCys: 0.354 ± 0.152
3.042LysAsp: 3.042 ± 0.455
3.891LysGlu: 3.891 ± 0.721
2.193LysPhe: 2.193 ± 0.217
3.82LysGly: 3.82 ± 0.428
1.556LysHis: 1.556 ± 0.324
2.971LysIle: 2.971 ± 0.528
3.042LysLys: 3.042 ± 0.512
4.952LysLeu: 4.952 ± 0.558
3.466LysMet: 3.466 ± 0.597
2.476LysAsn: 2.476 ± 0.393
2.476LysPro: 2.476 ± 0.49
1.91LysGln: 1.91 ± 0.361
4.032LysArg: 4.032 ± 0.493
3.325LysSer: 3.325 ± 0.572
3.537LysThr: 3.537 ± 0.453
3.537LysVal: 3.537 ± 0.575
0.566LysTrp: 0.566 ± 0.17
2.617LysTyr: 2.617 ± 0.381
0.0LysXaa: 0.0 ± 0.0
Leu
7.215LeuAla: 7.215 ± 0.672
0.566LeuCys: 0.566 ± 0.205
4.386LeuAsp: 4.386 ± 0.532
5.022LeuGlu: 5.022 ± 0.667
2.688LeuPhe: 2.688 ± 0.626
5.234LeuGly: 5.234 ± 0.521
1.344LeuHis: 1.344 ± 0.377
4.173LeuIle: 4.173 ± 0.371
4.173LeuLys: 4.173 ± 0.638
5.871LeuLeu: 5.871 ± 0.663
1.627LeuMet: 1.627 ± 0.297
3.891LeuAsn: 3.891 ± 0.459
3.891LeuPro: 3.891 ± 0.519
2.547LeuGln: 2.547 ± 0.422
5.164LeuArg: 5.164 ± 0.616
4.315LeuSer: 4.315 ± 0.617
5.517LeuThr: 5.517 ± 0.734
5.376LeuVal: 5.376 ± 0.762
0.849LeuTrp: 0.849 ± 0.242
1.91LeuTyr: 1.91 ± 0.316
0.0LeuXaa: 0.0 ± 0.0
Met
2.759MetAla: 2.759 ± 0.342
0.212MetCys: 0.212 ± 0.094
0.566MetAsp: 0.566 ± 0.224
1.485MetGlu: 1.485 ± 0.33
0.778MetPhe: 0.778 ± 0.224
1.627MetGly: 1.627 ± 0.344
0.283MetHis: 0.283 ± 0.145
1.273MetIle: 1.273 ± 0.295
1.839MetLys: 1.839 ± 0.425
1.91MetLeu: 1.91 ± 0.404
0.707MetMet: 0.707 ± 0.242
0.99MetAsn: 0.99 ± 0.282
1.132MetPro: 1.132 ± 0.223
1.061MetGln: 1.061 ± 0.249
1.344MetArg: 1.344 ± 0.26
2.264MetSer: 2.264 ± 0.327
2.264MetThr: 2.264 ± 0.475
2.334MetVal: 2.334 ± 0.366
0.212MetTrp: 0.212 ± 0.111
1.132MetTyr: 1.132 ± 0.306
0.0MetXaa: 0.0 ± 0.0
Asn
4.386AsnAla: 4.386 ± 0.478
0.637AsnCys: 0.637 ± 0.252
2.617AsnAsp: 2.617 ± 0.365
2.971AsnGlu: 2.971 ± 0.356
1.556AsnPhe: 1.556 ± 0.276
4.81AsnGly: 4.81 ± 0.545
0.566AsnHis: 0.566 ± 0.193
2.547AsnIle: 2.547 ± 0.377
2.405AsnLys: 2.405 ± 0.345
3.537AsnLeu: 3.537 ± 0.435
0.637AsnMet: 0.637 ± 0.255
2.476AsnAsn: 2.476 ± 0.484
1.91AsnPro: 1.91 ± 0.373
1.627AsnGln: 1.627 ± 0.406
2.334AsnArg: 2.334 ± 0.514
2.476AsnSer: 2.476 ± 0.348
2.971AsnThr: 2.971 ± 0.442
4.244AsnVal: 4.244 ± 0.543
0.849AsnTrp: 0.849 ± 0.205
1.273AsnTyr: 1.273 ± 0.261
0.0AsnXaa: 0.0 ± 0.0
Pro
3.042ProAla: 3.042 ± 0.451
0.354ProCys: 0.354 ± 0.15
2.829ProAsp: 2.829 ± 0.471
3.325ProGlu: 3.325 ± 0.51
1.273ProPhe: 1.273 ± 0.361
2.971ProGly: 2.971 ± 0.416
0.707ProHis: 0.707 ± 0.2
1.698ProIle: 1.698 ± 0.344
1.556ProLys: 1.556 ± 0.324
2.971ProLeu: 2.971 ± 0.473
0.707ProMet: 0.707 ± 0.234
1.556ProAsn: 1.556 ± 0.29
1.273ProPro: 1.273 ± 0.243
1.273ProGln: 1.273 ± 0.268
1.627ProArg: 1.627 ± 0.325
2.476ProSer: 2.476 ± 0.303
2.829ProThr: 2.829 ± 0.474
4.173ProVal: 4.173 ± 0.45
0.354ProTrp: 0.354 ± 0.163
1.344ProTyr: 1.344 ± 0.327
0.0ProXaa: 0.0 ± 0.0
Gln
3.183GlnAla: 3.183 ± 0.481
0.424GlnCys: 0.424 ± 0.186
2.051GlnAsp: 2.051 ± 0.372
2.193GlnGlu: 2.193 ± 0.495
1.768GlnPhe: 1.768 ± 0.447
1.768GlnGly: 1.768 ± 0.295
0.99GlnHis: 0.99 ± 0.282
1.839GlnIle: 1.839 ± 0.366
2.051GlnLys: 2.051 ± 0.443
3.608GlnLeu: 3.608 ± 0.459
0.92GlnMet: 0.92 ± 0.233
1.839GlnAsn: 1.839 ± 0.418
1.415GlnPro: 1.415 ± 0.26
1.981GlnGln: 1.981 ± 0.538
2.334GlnArg: 2.334 ± 0.407
1.839GlnSer: 1.839 ± 0.403
1.768GlnThr: 1.768 ± 0.282
2.334GlnVal: 2.334 ± 0.426
0.849GlnTrp: 0.849 ± 0.24
1.768GlnTyr: 1.768 ± 0.309
0.0GlnXaa: 0.0 ± 0.0
Arg
4.173ArgAla: 4.173 ± 0.473
0.495ArgCys: 0.495 ± 0.18
2.617ArgAsp: 2.617 ± 0.322
3.466ArgGlu: 3.466 ± 0.51
1.91ArgPhe: 1.91 ± 0.298
3.112ArgGly: 3.112 ± 0.462
1.061ArgHis: 1.061 ± 0.269
2.547ArgIle: 2.547 ± 0.475
3.466ArgLys: 3.466 ± 0.541
4.244ArgLeu: 4.244 ± 0.38
1.556ArgMet: 1.556 ± 0.255
3.395ArgAsn: 3.395 ± 0.444
1.91ArgPro: 1.91 ± 0.348
3.112ArgGln: 3.112 ± 0.431
4.315ArgArg: 4.315 ± 0.622
2.405ArgSer: 2.405 ± 0.39
2.547ArgThr: 2.547 ± 0.43
4.386ArgVal: 4.386 ± 0.527
0.495ArgTrp: 0.495 ± 0.19
1.91ArgTyr: 1.91 ± 0.356
0.0ArgXaa: 0.0 ± 0.0
Ser
5.8SerAla: 5.8 ± 0.952
0.424SerCys: 0.424 ± 0.184
3.678SerAsp: 3.678 ± 0.628
3.112SerGlu: 3.112 ± 0.552
2.971SerPhe: 2.971 ± 0.429
5.871SerGly: 5.871 ± 0.685
0.778SerHis: 0.778 ± 0.249
3.466SerIle: 3.466 ± 0.615
3.678SerLys: 3.678 ± 0.575
3.749SerLeu: 3.749 ± 0.44
1.485SerMet: 1.485 ± 0.401
2.759SerAsn: 2.759 ± 0.449
2.405SerPro: 2.405 ± 0.465
2.122SerGln: 2.122 ± 0.405
2.405SerArg: 2.405 ± 0.458
2.9SerSer: 2.9 ± 0.586
3.608SerThr: 3.608 ± 0.587
4.669SerVal: 4.669 ± 0.486
0.637SerTrp: 0.637 ± 0.191
2.122SerTyr: 2.122 ± 0.367
0.0SerXaa: 0.0 ± 0.0
Thr
6.437ThrAla: 6.437 ± 0.712
0.92ThrCys: 0.92 ± 0.264
4.173ThrAsp: 4.173 ± 0.562
3.325ThrGlu: 3.325 ± 0.453
2.829ThrPhe: 2.829 ± 0.482
6.437ThrGly: 6.437 ± 0.85
0.92ThrHis: 0.92 ± 0.224
2.759ThrIle: 2.759 ± 0.378
3.749ThrLys: 3.749 ± 0.404
4.739ThrLeu: 4.739 ± 0.428
1.627ThrMet: 1.627 ± 0.376
2.193ThrAsn: 2.193 ± 0.488
3.678ThrPro: 3.678 ± 0.517
2.051ThrGln: 2.051 ± 0.484
3.042ThrArg: 3.042 ± 0.352
3.678ThrSer: 3.678 ± 0.481
4.456ThrThr: 4.456 ± 0.593
4.739ThrVal: 4.739 ± 0.653
0.495ThrTrp: 0.495 ± 0.177
2.193ThrTyr: 2.193 ± 0.392
0.0ThrXaa: 0.0 ± 0.0
Val
7.498ValAla: 7.498 ± 0.72
0.92ValCys: 0.92 ± 0.287
4.527ValAsp: 4.527 ± 0.464
5.234ValGlu: 5.234 ± 0.623
2.334ValPhe: 2.334 ± 0.367
4.527ValGly: 4.527 ± 0.533
0.778ValHis: 0.778 ± 0.203
4.386ValIle: 4.386 ± 0.683
4.315ValLys: 4.315 ± 0.457
5.588ValLeu: 5.588 ± 0.593
1.273ValMet: 1.273 ± 0.258
3.042ValAsn: 3.042 ± 0.439
2.829ValPro: 2.829 ± 0.442
2.688ValGln: 2.688 ± 0.358
3.608ValArg: 3.608 ± 0.606
5.588ValSer: 5.588 ± 0.716
5.588ValThr: 5.588 ± 0.567
6.225ValVal: 6.225 ± 0.759
0.99ValTrp: 0.99 ± 0.323
3.395ValTyr: 3.395 ± 0.584
0.0ValXaa: 0.0 ± 0.0
Trp
1.273TrpAla: 1.273 ± 0.365
0.141TrpCys: 0.141 ± 0.091
0.637TrpAsp: 0.637 ± 0.181
0.424TrpGlu: 0.424 ± 0.145
0.99TrpPhe: 0.99 ± 0.296
1.132TrpGly: 1.132 ± 0.321
0.354TrpHis: 0.354 ± 0.174
0.354TrpIle: 0.354 ± 0.153
0.707TrpLys: 0.707 ± 0.18
1.415TrpLeu: 1.415 ± 0.263
0.495TrpMet: 0.495 ± 0.174
0.637TrpAsn: 0.637 ± 0.208
0.283TrpPro: 0.283 ± 0.155
0.495TrpGln: 0.495 ± 0.19
0.92TrpArg: 0.92 ± 0.227
0.707TrpSer: 0.707 ± 0.268
0.637TrpThr: 0.637 ± 0.178
1.273TrpVal: 1.273 ± 0.262
0.141TrpTrp: 0.141 ± 0.091
0.495TrpTyr: 0.495 ± 0.172
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.103TyrAla: 4.103 ± 0.474
0.283TyrCys: 0.283 ± 0.155
2.688TyrAsp: 2.688 ± 0.454
2.334TyrGlu: 2.334 ± 0.328
1.273TyrPhe: 1.273 ± 0.324
2.688TyrGly: 2.688 ± 0.385
0.495TyrHis: 0.495 ± 0.188
2.193TyrIle: 2.193 ± 0.308
2.405TyrLys: 2.405 ± 0.451
2.547TyrLeu: 2.547 ± 0.369
0.637TyrMet: 0.637 ± 0.174
1.132TyrAsn: 1.132 ± 0.266
1.273TyrPro: 1.273 ± 0.384
1.344TyrGln: 1.344 ± 0.287
2.617TyrArg: 2.617 ± 0.396
2.405TyrSer: 2.405 ± 0.445
1.839TyrThr: 1.839 ± 0.327
1.698TyrVal: 1.698 ± 0.345
0.354TyrTrp: 0.354 ± 0.15
1.203TyrTyr: 1.203 ± 0.224
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 81 proteins (14138 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski