Amino acid dipepetide frequency for Escherichia phage slur05

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.936AlaAla: 12.936 ± 1.58
0.818AlaCys: 0.818 ± 0.245
6.171AlaAsp: 6.171 ± 0.738
7.137AlaGlu: 7.137 ± 0.792
3.494AlaPhe: 3.494 ± 0.493
8.921AlaGly: 8.921 ± 0.894
1.859AlaHis: 1.859 ± 0.332
5.948AlaIle: 5.948 ± 0.63
7.509AlaLys: 7.509 ± 1.078
7.806AlaLeu: 7.806 ± 0.769
2.676AlaMet: 2.676 ± 0.483
3.271AlaAsn: 3.271 ± 0.703
2.974AlaPro: 2.974 ± 0.429
4.832AlaGln: 4.832 ± 0.882
6.394AlaArg: 6.394 ± 0.615
4.832AlaSer: 4.832 ± 0.531
4.386AlaThr: 4.386 ± 0.715
7.657AlaVal: 7.657 ± 0.68
1.859AlaTrp: 1.859 ± 0.344
3.048AlaTyr: 3.048 ± 0.424
0.0AlaXaa: 0.0 ± 0.0
Cys
1.115CysAla: 1.115 ± 0.309
0.223CysCys: 0.223 ± 0.112
0.892CysAsp: 0.892 ± 0.273
0.743CysGlu: 0.743 ± 0.223
0.595CysPhe: 0.595 ± 0.204
0.743CysGly: 0.743 ± 0.213
0.223CysHis: 0.223 ± 0.14
0.372CysIle: 0.372 ± 0.149
0.669CysLys: 0.669 ± 0.218
1.041CysLeu: 1.041 ± 0.291
0.372CysMet: 0.372 ± 0.14
0.446CysAsn: 0.446 ± 0.186
0.223CysPro: 0.223 ± 0.103
0.297CysGln: 0.297 ± 0.147
0.669CysArg: 0.669 ± 0.211
0.52CysSer: 0.52 ± 0.179
0.595CysThr: 0.595 ± 0.181
0.52CysVal: 0.52 ± 0.188
0.149CysTrp: 0.149 ± 0.106
0.818CysTyr: 0.818 ± 0.254
0.0CysXaa: 0.0 ± 0.0
Asp
6.542AspAla: 6.542 ± 0.684
1.041AspCys: 1.041 ± 0.265
4.832AspAsp: 4.832 ± 1.419
5.576AspGlu: 5.576 ± 1.189
2.825AspPhe: 2.825 ± 0.509
6.171AspGly: 6.171 ± 0.746
1.338AspHis: 1.338 ± 0.304
3.197AspIle: 3.197 ± 0.508
3.122AspLys: 3.122 ± 0.617
5.13AspLeu: 5.13 ± 0.621
1.933AspMet: 1.933 ± 0.326
2.751AspAsn: 2.751 ± 0.468
3.494AspPro: 3.494 ± 0.481
1.784AspGln: 1.784 ± 0.378
2.528AspArg: 2.528 ± 0.324
2.602AspSer: 2.602 ± 0.467
4.089AspThr: 4.089 ± 0.511
4.089AspVal: 4.089 ± 0.392
1.115AspTrp: 1.115 ± 0.31
1.71AspTyr: 1.71 ± 0.352
0.0AspXaa: 0.0 ± 0.0
Glu
6.319GluAla: 6.319 ± 0.899
0.52GluCys: 0.52 ± 0.204
4.832GluAsp: 4.832 ± 1.02
4.015GluGlu: 4.015 ± 0.597
2.379GluPhe: 2.379 ± 0.421
3.048GluGly: 3.048 ± 0.558
1.115GluHis: 1.115 ± 0.301
4.907GluIle: 4.907 ± 0.442
3.271GluLys: 3.271 ± 0.651
5.13GluLeu: 5.13 ± 0.675
1.636GluMet: 1.636 ± 0.337
2.899GluAsn: 2.899 ± 0.402
1.561GluPro: 1.561 ± 0.411
2.305GluGln: 2.305 ± 0.424
4.163GluArg: 4.163 ± 0.588
2.676GluSer: 2.676 ± 0.439
2.305GluThr: 2.305 ± 0.429
4.832GluVal: 4.832 ± 0.599
0.966GluTrp: 0.966 ± 0.271
2.899GluTyr: 2.899 ± 0.462
0.0GluXaa: 0.0 ± 0.0
Phe
3.569PheAla: 3.569 ± 0.448
0.446PheCys: 0.446 ± 0.15
3.122PheAsp: 3.122 ± 0.48
2.156PheGlu: 2.156 ± 0.35
1.115PhePhe: 1.115 ± 0.273
3.345PheGly: 3.345 ± 0.525
0.669PheHis: 0.669 ± 0.263
1.933PheIle: 1.933 ± 0.375
1.561PheLys: 1.561 ± 0.368
1.933PheLeu: 1.933 ± 0.371
0.372PheMet: 0.372 ± 0.167
1.413PheAsn: 1.413 ± 0.376
1.413PhePro: 1.413 ± 0.378
1.338PheGln: 1.338 ± 0.233
2.379PheArg: 2.379 ± 0.466
1.933PheSer: 1.933 ± 0.403
2.379PheThr: 2.379 ± 0.499
2.528PheVal: 2.528 ± 0.596
0.52PheTrp: 0.52 ± 0.201
1.041PheTyr: 1.041 ± 0.238
0.0PheXaa: 0.0 ± 0.0
Gly
5.873GlyAla: 5.873 ± 0.656
1.115GlyCys: 1.115 ± 0.275
5.13GlyAsp: 5.13 ± 0.71
4.684GlyGlu: 4.684 ± 0.533
3.494GlyPhe: 3.494 ± 0.491
6.84GlyGly: 6.84 ± 0.828
1.413GlyHis: 1.413 ± 0.464
4.163GlyIle: 4.163 ± 0.511
6.468GlyLys: 6.468 ± 1.013
6.765GlyLeu: 6.765 ± 0.697
2.305GlyMet: 2.305 ± 0.517
4.238GlyAsn: 4.238 ± 0.862
1.859GlyPro: 1.859 ± 0.349
2.156GlyGln: 2.156 ± 0.415
4.609GlyArg: 4.609 ± 0.6
3.866GlySer: 3.866 ± 0.559
3.866GlyThr: 3.866 ± 0.635
6.096GlyVal: 6.096 ± 0.663
1.115GlyTrp: 1.115 ± 0.303
2.007GlyTyr: 2.007 ± 0.426
0.0GlyXaa: 0.0 ± 0.0
His
1.041HisAla: 1.041 ± 0.28
0.372HisCys: 0.372 ± 0.157
1.487HisAsp: 1.487 ± 0.279
0.743HisGlu: 0.743 ± 0.194
0.297HisPhe: 0.297 ± 0.132
1.19HisGly: 1.19 ± 0.331
0.595HisHis: 0.595 ± 0.29
1.115HisIle: 1.115 ± 0.276
1.19HisLys: 1.19 ± 0.31
1.115HisLeu: 1.115 ± 0.299
0.372HisMet: 0.372 ± 0.154
0.595HisAsn: 0.595 ± 0.184
1.19HisPro: 1.19 ± 0.298
0.892HisGln: 0.892 ± 0.328
0.818HisArg: 0.818 ± 0.219
0.669HisSer: 0.669 ± 0.224
1.115HisThr: 1.115 ± 0.359
0.966HisVal: 0.966 ± 0.24
0.372HisTrp: 0.372 ± 0.166
1.041HisTyr: 1.041 ± 0.285
0.0HisXaa: 0.0 ± 0.0
Ile
5.65IleAla: 5.65 ± 0.58
0.669IleCys: 0.669 ± 0.208
4.461IleAsp: 4.461 ± 0.455
3.345IleGlu: 3.345 ± 0.422
1.264IlePhe: 1.264 ± 0.28
3.494IleGly: 3.494 ± 0.507
0.892IleHis: 0.892 ± 0.204
2.602IleIle: 2.602 ± 0.423
3.94IleLys: 3.94 ± 0.596
2.751IleLeu: 2.751 ± 0.378
1.859IleMet: 1.859 ± 0.327
2.899IleAsn: 2.899 ± 0.418
2.156IlePro: 2.156 ± 0.508
2.007IleGln: 2.007 ± 0.447
3.345IleArg: 3.345 ± 0.529
2.379IleSer: 2.379 ± 0.455
4.386IleThr: 4.386 ± 0.872
3.94IleVal: 3.94 ± 0.452
0.743IleTrp: 0.743 ± 0.257
1.487IleTyr: 1.487 ± 0.361
0.0IleXaa: 0.0 ± 0.0
Lys
7.657LysAla: 7.657 ± 1.109
0.818LysCys: 0.818 ± 0.238
2.676LysAsp: 2.676 ± 0.394
3.494LysGlu: 3.494 ± 0.58
1.784LysPhe: 1.784 ± 0.307
4.312LysGly: 4.312 ± 0.536
0.966LysHis: 0.966 ± 0.247
2.305LysIle: 2.305 ± 0.495
3.122LysLys: 3.122 ± 0.685
4.684LysLeu: 4.684 ± 0.62
2.082LysMet: 2.082 ± 0.346
2.156LysAsn: 2.156 ± 0.406
3.122LysPro: 3.122 ± 0.553
3.122LysGln: 3.122 ± 0.565
3.717LysArg: 3.717 ± 0.598
3.792LysSer: 3.792 ± 0.729
4.312LysThr: 4.312 ± 0.573
4.163LysVal: 4.163 ± 0.512
1.19LysTrp: 1.19 ± 0.273
1.487LysTyr: 1.487 ± 0.339
0.0LysXaa: 0.0 ± 0.0
Leu
6.914LeuAla: 6.914 ± 0.788
0.892LeuCys: 0.892 ± 0.314
4.684LeuAsp: 4.684 ± 0.565
3.494LeuGlu: 3.494 ± 0.494
2.751LeuPhe: 2.751 ± 0.373
5.427LeuGly: 5.427 ± 0.532
1.041LeuHis: 1.041 ± 0.294
4.015LeuIle: 4.015 ± 0.528
4.535LeuLys: 4.535 ± 0.51
4.907LeuLeu: 4.907 ± 0.607
2.156LeuMet: 2.156 ± 0.423
3.42LeuAsn: 3.42 ± 0.519
3.345LeuPro: 3.345 ± 0.553
2.825LeuGln: 2.825 ± 0.436
4.981LeuArg: 4.981 ± 0.71
4.386LeuSer: 4.386 ± 0.492
5.724LeuThr: 5.724 ± 0.646
5.501LeuVal: 5.501 ± 0.798
0.595LeuTrp: 0.595 ± 0.19
2.23LeuTyr: 2.23 ± 0.374
0.0LeuXaa: 0.0 ± 0.0
Met
2.676MetAla: 2.676 ± 0.579
0.372MetCys: 0.372 ± 0.165
1.115MetAsp: 1.115 ± 0.366
1.487MetGlu: 1.487 ± 0.328
0.892MetPhe: 0.892 ± 0.262
1.636MetGly: 1.636 ± 0.408
0.52MetHis: 0.52 ± 0.165
2.156MetIle: 2.156 ± 0.514
1.636MetLys: 1.636 ± 0.422
2.082MetLeu: 2.082 ± 0.417
0.595MetMet: 0.595 ± 0.237
1.338MetAsn: 1.338 ± 0.346
0.818MetPro: 0.818 ± 0.205
1.338MetGln: 1.338 ± 0.293
1.338MetArg: 1.338 ± 0.341
1.784MetSer: 1.784 ± 0.371
2.305MetThr: 2.305 ± 0.452
1.636MetVal: 1.636 ± 0.357
0.297MetTrp: 0.297 ± 0.134
1.041MetTyr: 1.041 ± 0.291
0.0MetXaa: 0.0 ± 0.0
Asn
5.278AsnAla: 5.278 ± 0.679
0.149AsnCys: 0.149 ± 0.113
2.23AsnAsp: 2.23 ± 0.417
2.899AsnGlu: 2.899 ± 0.598
1.264AsnPhe: 1.264 ± 0.394
4.684AsnGly: 4.684 ± 0.532
1.19AsnHis: 1.19 ± 0.284
1.859AsnIle: 1.859 ± 0.333
2.082AsnLys: 2.082 ± 0.38
3.048AsnLeu: 3.048 ± 0.425
1.338AsnMet: 1.338 ± 0.275
1.784AsnAsn: 1.784 ± 0.47
1.636AsnPro: 1.636 ± 0.389
2.007AsnGln: 2.007 ± 0.391
1.859AsnArg: 1.859 ± 0.295
1.413AsnSer: 1.413 ± 0.377
2.23AsnThr: 2.23 ± 0.383
3.42AsnVal: 3.42 ± 0.598
0.818AsnTrp: 0.818 ± 0.25
1.561AsnTyr: 1.561 ± 0.34
0.0AsnXaa: 0.0 ± 0.0
Pro
4.089ProAla: 4.089 ± 0.573
0.223ProCys: 0.223 ± 0.127
3.569ProAsp: 3.569 ± 0.489
2.156ProGlu: 2.156 ± 0.521
1.19ProPhe: 1.19 ± 0.296
3.643ProGly: 3.643 ± 0.62
0.818ProHis: 0.818 ± 0.261
2.305ProIle: 2.305 ± 0.333
2.156ProLys: 2.156 ± 0.47
3.048ProLeu: 3.048 ± 0.552
0.149ProMet: 0.149 ± 0.107
1.264ProAsn: 1.264 ± 0.293
1.784ProPro: 1.784 ± 0.421
1.19ProGln: 1.19 ± 0.281
2.751ProArg: 2.751 ± 0.549
1.784ProSer: 1.784 ± 0.317
2.528ProThr: 2.528 ± 0.562
3.494ProVal: 3.494 ± 0.512
0.446ProTrp: 0.446 ± 0.204
1.115ProTyr: 1.115 ± 0.327
0.0ProXaa: 0.0 ± 0.0
Gln
4.089GlnAla: 4.089 ± 0.621
0.297GlnCys: 0.297 ± 0.177
1.784GlnAsp: 1.784 ± 0.316
1.413GlnGlu: 1.413 ± 0.38
1.338GlnPhe: 1.338 ± 0.359
1.933GlnGly: 1.933 ± 0.343
0.743GlnHis: 0.743 ± 0.234
3.197GlnIle: 3.197 ± 0.509
1.636GlnLys: 1.636 ± 0.4
3.94GlnLeu: 3.94 ± 0.569
1.264GlnMet: 1.264 ± 0.28
1.19GlnAsn: 1.19 ± 0.296
1.859GlnPro: 1.859 ± 0.345
1.487GlnGln: 1.487 ± 0.397
2.751GlnArg: 2.751 ± 0.526
2.156GlnSer: 2.156 ± 0.44
2.528GlnThr: 2.528 ± 0.473
3.048GlnVal: 3.048 ± 0.444
0.743GlnTrp: 0.743 ± 0.217
0.966GlnTyr: 0.966 ± 0.223
0.0GlnXaa: 0.0 ± 0.0
Arg
5.65ArgAla: 5.65 ± 0.661
0.595ArgCys: 0.595 ± 0.185
3.643ArgAsp: 3.643 ± 0.679
3.643ArgGlu: 3.643 ± 0.565
2.602ArgPhe: 2.602 ± 0.443
4.461ArgGly: 4.461 ± 0.554
1.264ArgHis: 1.264 ± 0.267
3.42ArgIle: 3.42 ± 0.698
4.461ArgLys: 4.461 ± 0.699
4.758ArgLeu: 4.758 ± 0.444
1.636ArgMet: 1.636 ± 0.383
2.825ArgAsn: 2.825 ± 0.517
2.676ArgPro: 2.676 ± 0.476
2.453ArgGln: 2.453 ± 0.38
4.609ArgArg: 4.609 ± 0.742
2.453ArgSer: 2.453 ± 0.381
2.899ArgThr: 2.899 ± 0.446
4.907ArgVal: 4.907 ± 0.684
0.966ArgTrp: 0.966 ± 0.286
1.859ArgTyr: 1.859 ± 0.358
0.0ArgXaa: 0.0 ± 0.0
Ser
4.163SerAla: 4.163 ± 0.543
0.446SerCys: 0.446 ± 0.183
3.048SerAsp: 3.048 ± 0.545
3.122SerGlu: 3.122 ± 0.535
1.859SerPhe: 1.859 ± 0.385
5.427SerGly: 5.427 ± 0.789
0.669SerHis: 0.669 ± 0.19
2.305SerIle: 2.305 ± 0.418
3.122SerLys: 3.122 ± 0.533
3.494SerLeu: 3.494 ± 0.494
1.041SerMet: 1.041 ± 0.291
2.156SerAsn: 2.156 ± 0.351
1.561SerPro: 1.561 ± 0.247
2.082SerGln: 2.082 ± 0.36
2.379SerArg: 2.379 ± 0.415
2.453SerSer: 2.453 ± 0.433
3.42SerThr: 3.42 ± 0.476
3.792SerVal: 3.792 ± 0.505
0.892SerTrp: 0.892 ± 0.34
1.264SerTyr: 1.264 ± 0.328
0.0SerXaa: 0.0 ± 0.0
Thr
6.691ThrAla: 6.691 ± 0.742
0.52ThrCys: 0.52 ± 0.187
3.94ThrAsp: 3.94 ± 0.615
3.42ThrGlu: 3.42 ± 0.597
2.676ThrPhe: 2.676 ± 0.449
4.535ThrGly: 4.535 ± 0.586
0.52ThrHis: 0.52 ± 0.153
2.602ThrIle: 2.602 ± 0.412
3.42ThrLys: 3.42 ± 0.541
5.13ThrLeu: 5.13 ± 0.606
1.784ThrMet: 1.784 ± 0.364
2.899ThrAsn: 2.899 ± 0.473
3.197ThrPro: 3.197 ± 0.514
2.23ThrGln: 2.23 ± 0.535
3.569ThrArg: 3.569 ± 0.45
3.122ThrSer: 3.122 ± 0.664
2.676ThrThr: 2.676 ± 0.472
4.015ThrVal: 4.015 ± 0.546
1.115ThrTrp: 1.115 ± 0.242
1.561ThrTyr: 1.561 ± 0.4
0.0ThrXaa: 0.0 ± 0.0
Val
7.806ValAla: 7.806 ± 0.627
0.595ValCys: 0.595 ± 0.22
5.353ValAsp: 5.353 ± 0.637
5.576ValGlu: 5.576 ± 0.585
2.082ValPhe: 2.082 ± 0.384
4.684ValGly: 4.684 ± 0.659
0.669ValHis: 0.669 ± 0.169
4.461ValIle: 4.461 ± 0.764
4.758ValLys: 4.758 ± 0.706
4.238ValLeu: 4.238 ± 0.457
1.933ValMet: 1.933 ± 0.39
3.494ValAsn: 3.494 ± 0.556
2.751ValPro: 2.751 ± 0.434
2.676ValGln: 2.676 ± 0.392
5.724ValArg: 5.724 ± 0.599
3.345ValSer: 3.345 ± 0.539
4.386ValThr: 4.386 ± 0.525
6.096ValVal: 6.096 ± 0.76
0.669ValTrp: 0.669 ± 0.214
1.859ValTyr: 1.859 ± 0.463
0.0ValXaa: 0.0 ± 0.0
Trp
2.305TrpAla: 2.305 ± 0.575
0.446TrpCys: 0.446 ± 0.147
0.892TrpAsp: 0.892 ± 0.181
1.041TrpGlu: 1.041 ± 0.287
0.446TrpPhe: 0.446 ± 0.214
0.743TrpGly: 0.743 ± 0.254
0.372TrpHis: 0.372 ± 0.137
0.446TrpIle: 0.446 ± 0.187
0.669TrpLys: 0.669 ± 0.186
0.892TrpLeu: 0.892 ± 0.291
0.669TrpMet: 0.669 ± 0.221
0.743TrpAsn: 0.743 ± 0.266
0.818TrpPro: 0.818 ± 0.262
0.372TrpGln: 0.372 ± 0.158
1.115TrpArg: 1.115 ± 0.262
0.669TrpSer: 0.669 ± 0.187
0.743TrpThr: 0.743 ± 0.202
1.264TrpVal: 1.264 ± 0.264
0.52TrpTrp: 0.52 ± 0.164
0.446TrpTyr: 0.446 ± 0.16
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.163TyrAla: 4.163 ± 0.383
0.669TyrCys: 0.669 ± 0.236
2.082TyrAsp: 2.082 ± 0.392
1.636TyrGlu: 1.636 ± 0.417
0.818TyrPhe: 0.818 ± 0.202
2.825TyrGly: 2.825 ± 0.492
0.297TyrHis: 0.297 ± 0.133
0.966TyrIle: 0.966 ± 0.269
1.636TyrLys: 1.636 ± 0.339
1.933TyrLeu: 1.933 ± 0.426
0.818TyrMet: 0.818 ± 0.274
0.966TyrAsn: 0.966 ± 0.239
1.264TyrPro: 1.264 ± 0.376
0.966TyrGln: 0.966 ± 0.324
2.082TyrArg: 2.082 ± 0.458
1.859TyrSer: 1.859 ± 0.39
2.751TyrThr: 2.751 ± 0.419
1.19TyrVal: 1.19 ± 0.281
0.52TyrTrp: 0.52 ± 0.218
0.966TyrTyr: 0.966 ± 0.231
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 58 proteins (13452 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski