Amino acid dipepetide frequency for Salmonella phage Lumpael

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.868AlaAla: 14.868 ± 2.112
0.545AlaCys: 0.545 ± 0.225
6.539AlaAsp: 6.539 ± 0.823
8.641AlaGlu: 8.641 ± 1.249
3.503AlaPhe: 3.503 ± 0.507
9.03AlaGly: 9.03 ± 1.35
1.479AlaHis: 1.479 ± 0.297
5.916AlaIle: 5.916 ± 0.581
5.683AlaLys: 5.683 ± 0.795
9.964AlaLeu: 9.964 ± 1.165
2.958AlaMet: 2.958 ± 0.448
4.437AlaAsn: 4.437 ± 0.782
3.503AlaPro: 3.503 ± 0.524
5.916AlaGln: 5.916 ± 1.228
7.395AlaArg: 7.395 ± 1.031
5.449AlaSer: 5.449 ± 0.645
4.982AlaThr: 4.982 ± 0.857
5.916AlaVal: 5.916 ± 0.733
1.09AlaTrp: 1.09 ± 0.304
3.581AlaTyr: 3.581 ± 0.434
0.0AlaXaa: 0.0 ± 0.0
Cys
1.012CysAla: 1.012 ± 0.355
0.156CysCys: 0.156 ± 0.099
0.389CysAsp: 0.389 ± 0.17
0.467CysGlu: 0.467 ± 0.2
0.311CysPhe: 0.311 ± 0.153
0.934CysGly: 0.934 ± 0.311
0.311CysHis: 0.311 ± 0.163
0.078CysIle: 0.078 ± 0.079
0.545CysLys: 0.545 ± 0.201
0.701CysLeu: 0.701 ± 0.28
0.311CysMet: 0.311 ± 0.167
0.311CysAsn: 0.311 ± 0.14
0.389CysPro: 0.389 ± 0.169
0.234CysGln: 0.234 ± 0.136
1.479CysArg: 1.479 ± 0.444
0.156CysSer: 0.156 ± 0.088
0.545CysThr: 0.545 ± 0.226
0.856CysVal: 0.856 ± 0.269
0.078CysTrp: 0.078 ± 0.061
0.234CysTyr: 0.234 ± 0.126
0.0CysXaa: 0.0 ± 0.0
Asp
7.473AspAla: 7.473 ± 0.78
0.545AspCys: 0.545 ± 0.232
3.503AspAsp: 3.503 ± 0.512
4.437AspGlu: 4.437 ± 0.714
1.557AspPhe: 1.557 ± 0.307
5.683AspGly: 5.683 ± 0.642
1.713AspHis: 1.713 ± 0.423
2.335AspIle: 2.335 ± 0.49
2.802AspLys: 2.802 ± 0.548
5.216AspLeu: 5.216 ± 0.703
1.09AspMet: 1.09 ± 0.281
3.27AspAsn: 3.27 ± 0.614
3.97AspPro: 3.97 ± 0.528
3.347AspGln: 3.347 ± 0.631
3.503AspArg: 3.503 ± 0.55
2.413AspSer: 2.413 ± 0.476
3.347AspThr: 3.347 ± 0.56
3.814AspVal: 3.814 ± 0.544
1.946AspTrp: 1.946 ± 0.381
1.79AspTyr: 1.79 ± 0.406
0.0AspXaa: 0.0 ± 0.0
Glu
8.563GluAla: 8.563 ± 1.067
0.701GluCys: 0.701 ± 0.196
4.593GluAsp: 4.593 ± 0.804
4.048GluGlu: 4.048 ± 0.702
1.868GluPhe: 1.868 ± 0.416
3.659GluGly: 3.659 ± 0.568
0.934GluHis: 0.934 ± 0.253
3.737GluIle: 3.737 ± 0.653
3.892GluLys: 3.892 ± 0.574
5.527GluLeu: 5.527 ± 0.711
2.024GluMet: 2.024 ± 0.459
2.413GluAsn: 2.413 ± 0.533
2.88GluPro: 2.88 ± 0.51
3.892GluGln: 3.892 ± 0.758
4.671GluArg: 4.671 ± 0.716
3.27GluSer: 3.27 ± 0.712
2.647GluThr: 2.647 ± 0.496
3.503GluVal: 3.503 ± 0.602
1.479GluTrp: 1.479 ± 0.319
1.79GluTyr: 1.79 ± 0.336
0.0GluXaa: 0.0 ± 0.0
Phe
2.18PheAla: 2.18 ± 0.423
0.311PheCys: 0.311 ± 0.143
2.335PheAsp: 2.335 ± 0.51
2.024PheGlu: 2.024 ± 0.458
0.623PhePhe: 0.623 ± 0.16
2.958PheGly: 2.958 ± 0.417
0.467PheHis: 0.467 ± 0.189
1.479PheIle: 1.479 ± 0.314
1.946PheLys: 1.946 ± 0.353
2.18PheLeu: 2.18 ± 0.428
1.012PheMet: 1.012 ± 0.306
1.557PheAsn: 1.557 ± 0.275
1.012PhePro: 1.012 ± 0.261
0.934PheGln: 0.934 ± 0.289
2.102PheArg: 2.102 ± 0.418
1.946PheSer: 1.946 ± 0.409
1.713PheThr: 1.713 ± 0.561
2.102PheVal: 2.102 ± 0.443
1.479PheTrp: 1.479 ± 0.306
0.701PheTyr: 0.701 ± 0.278
0.0PheXaa: 0.0 ± 0.0
Gly
6.928GlyAla: 6.928 ± 1.196
0.389GlyCys: 0.389 ± 0.154
3.503GlyAsp: 3.503 ± 0.486
5.449GlyGlu: 5.449 ± 0.782
3.036GlyPhe: 3.036 ± 0.399
8.485GlyGly: 8.485 ± 0.986
1.012GlyHis: 1.012 ± 0.264
4.671GlyIle: 4.671 ± 0.766
4.671GlyLys: 4.671 ± 0.659
4.671GlyLeu: 4.671 ± 0.549
1.635GlyMet: 1.635 ± 0.381
3.192GlyAsn: 3.192 ± 0.509
3.347GlyPro: 3.347 ± 0.525
3.737GlyGln: 3.737 ± 0.715
5.216GlyArg: 5.216 ± 0.441
5.838GlySer: 5.838 ± 0.706
4.982GlyThr: 4.982 ± 0.869
4.904GlyVal: 4.904 ± 0.635
0.701GlyTrp: 0.701 ± 0.191
3.114GlyTyr: 3.114 ± 0.451
0.0GlyXaa: 0.0 ± 0.0
His
1.246HisAla: 1.246 ± 0.295
0.311HisCys: 0.311 ± 0.122
1.09HisAsp: 1.09 ± 0.298
1.635HisGlu: 1.635 ± 0.417
0.856HisPhe: 0.856 ± 0.267
0.623HisGly: 0.623 ± 0.228
0.623HisHis: 0.623 ± 0.265
0.934HisIle: 0.934 ± 0.285
0.856HisLys: 0.856 ± 0.241
1.479HisLeu: 1.479 ± 0.363
0.545HisMet: 0.545 ± 0.213
0.545HisAsn: 0.545 ± 0.196
1.323HisPro: 1.323 ± 0.425
0.701HisGln: 0.701 ± 0.229
0.701HisArg: 0.701 ± 0.243
1.09HisSer: 1.09 ± 0.224
1.012HisThr: 1.012 ± 0.274
0.856HisVal: 0.856 ± 0.219
0.778HisTrp: 0.778 ± 0.234
0.467HisTyr: 0.467 ± 0.193
0.0HisXaa: 0.0 ± 0.0
Ile
5.916IleAla: 5.916 ± 0.637
0.623IleCys: 0.623 ± 0.194
3.737IleAsp: 3.737 ± 0.425
3.27IleGlu: 3.27 ± 0.583
1.168IlePhe: 1.168 ± 0.304
3.114IleGly: 3.114 ± 0.485
0.856IleHis: 0.856 ± 0.305
2.569IleIle: 2.569 ± 0.633
1.946IleLys: 1.946 ± 0.377
3.425IleLeu: 3.425 ± 0.708
2.18IleMet: 2.18 ± 0.396
1.323IleAsn: 1.323 ± 0.315
3.036IlePro: 3.036 ± 0.563
1.79IleGln: 1.79 ± 0.438
3.425IleArg: 3.425 ± 0.476
2.258IleSer: 2.258 ± 0.331
3.27IleThr: 3.27 ± 0.844
3.192IleVal: 3.192 ± 0.555
0.701IleTrp: 0.701 ± 0.308
1.868IleTyr: 1.868 ± 0.328
0.0IleXaa: 0.0 ± 0.0
Lys
7.317LysAla: 7.317 ± 0.962
0.389LysCys: 0.389 ± 0.184
3.814LysAsp: 3.814 ± 0.664
2.88LysGlu: 2.88 ± 0.602
1.168LysPhe: 1.168 ± 0.266
3.114LysGly: 3.114 ± 0.439
1.323LysHis: 1.323 ± 0.284
2.18LysIle: 2.18 ± 0.44
4.204LysLys: 4.204 ± 0.717
3.737LysLeu: 3.737 ± 0.48
1.868LysMet: 1.868 ± 0.438
2.88LysAsn: 2.88 ± 0.435
2.102LysPro: 2.102 ± 0.344
2.725LysGln: 2.725 ± 0.388
3.503LysArg: 3.503 ± 0.57
1.635LysSer: 1.635 ± 0.313
2.725LysThr: 2.725 ± 0.537
3.814LysVal: 3.814 ± 0.559
0.856LysTrp: 0.856 ± 0.243
1.868LysTyr: 1.868 ± 0.471
0.0LysXaa: 0.0 ± 0.0
Leu
7.395LeuAla: 7.395 ± 0.857
0.934LeuCys: 0.934 ± 0.315
5.138LeuAsp: 5.138 ± 0.59
5.216LeuGlu: 5.216 ± 0.757
1.868LeuPhe: 1.868 ± 0.353
5.838LeuGly: 5.838 ± 0.695
1.246LeuHis: 1.246 ± 0.27
3.347LeuIle: 3.347 ± 0.551
3.503LeuLys: 3.503 ± 0.626
5.761LeuLeu: 5.761 ± 0.661
2.102LeuMet: 2.102 ± 0.464
3.581LeuAsn: 3.581 ± 0.571
4.048LeuPro: 4.048 ± 0.461
2.88LeuGln: 2.88 ± 0.683
4.982LeuArg: 4.982 ± 0.71
4.204LeuSer: 4.204 ± 0.631
5.293LeuThr: 5.293 ± 0.678
5.371LeuVal: 5.371 ± 0.608
1.168LeuTrp: 1.168 ± 0.282
2.024LeuTyr: 2.024 ± 0.43
0.0LeuXaa: 0.0 ± 0.0
Met
4.204MetAla: 4.204 ± 0.588
0.311MetCys: 0.311 ± 0.153
1.557MetAsp: 1.557 ± 0.524
1.246MetGlu: 1.246 ± 0.337
1.323MetPhe: 1.323 ± 0.362
1.323MetGly: 1.323 ± 0.324
0.311MetHis: 0.311 ± 0.231
2.413MetIle: 2.413 ± 0.379
1.246MetLys: 1.246 ± 0.323
1.557MetLeu: 1.557 ± 0.321
0.701MetMet: 0.701 ± 0.285
0.701MetAsn: 0.701 ± 0.21
1.713MetPro: 1.713 ± 0.308
1.635MetGln: 1.635 ± 0.418
1.479MetArg: 1.479 ± 0.351
1.401MetSer: 1.401 ± 0.324
1.401MetThr: 1.401 ± 0.315
1.401MetVal: 1.401 ± 0.401
0.311MetTrp: 0.311 ± 0.155
0.856MetTyr: 0.856 ± 0.196
0.0MetXaa: 0.0 ± 0.0
Asn
4.826AsnAla: 4.826 ± 0.806
0.311AsnCys: 0.311 ± 0.251
2.18AsnAsp: 2.18 ± 0.435
1.868AsnGlu: 1.868 ± 0.386
1.401AsnPhe: 1.401 ± 0.41
4.281AsnGly: 4.281 ± 0.49
1.012AsnHis: 1.012 ± 0.285
1.946AsnIle: 1.946 ± 0.419
2.102AsnLys: 2.102 ± 0.339
3.581AsnLeu: 3.581 ± 0.493
1.246AsnMet: 1.246 ± 0.308
1.946AsnAsn: 1.946 ± 0.475
2.725AsnPro: 2.725 ± 0.505
1.868AsnGln: 1.868 ± 0.391
2.258AsnArg: 2.258 ± 0.556
2.18AsnSer: 2.18 ± 0.562
2.18AsnThr: 2.18 ± 0.6
3.114AsnVal: 3.114 ± 0.733
0.701AsnTrp: 0.701 ± 0.234
1.012AsnTyr: 1.012 ± 0.295
0.0AsnXaa: 0.0 ± 0.0
Pro
5.605ProAla: 5.605 ± 0.59
0.389ProCys: 0.389 ± 0.224
4.204ProAsp: 4.204 ± 0.547
4.515ProGlu: 4.515 ± 0.578
1.635ProPhe: 1.635 ± 0.294
4.359ProGly: 4.359 ± 0.731
0.623ProHis: 0.623 ± 0.273
2.258ProIle: 2.258 ± 0.452
1.868ProLys: 1.868 ± 0.362
2.725ProLeu: 2.725 ± 0.48
0.778ProMet: 0.778 ± 0.263
1.79ProAsn: 1.79 ± 0.372
1.946ProPro: 1.946 ± 0.381
1.557ProGln: 1.557 ± 0.388
2.18ProArg: 2.18 ± 0.419
2.024ProSer: 2.024 ± 0.352
3.581ProThr: 3.581 ± 0.399
3.036ProVal: 3.036 ± 0.58
0.778ProTrp: 0.778 ± 0.208
1.868ProTyr: 1.868 ± 0.428
0.0ProXaa: 0.0 ± 0.0
Gln
6.305GlnAla: 6.305 ± 0.963
0.311GlnCys: 0.311 ± 0.198
2.88GlnAsp: 2.88 ± 0.453
2.413GlnGlu: 2.413 ± 0.508
1.012GlnPhe: 1.012 ± 0.278
2.491GlnGly: 2.491 ± 0.574
0.623GlnHis: 0.623 ± 0.253
2.024GlnIle: 2.024 ± 0.427
2.18GlnLys: 2.18 ± 0.46
3.814GlnLeu: 3.814 ± 0.701
1.946GlnMet: 1.946 ± 0.424
2.335GlnAsn: 2.335 ± 0.612
2.413GlnPro: 2.413 ± 0.428
4.281GlnGln: 4.281 ± 0.819
3.036GlnArg: 3.036 ± 0.56
1.635GlnSer: 1.635 ± 0.368
3.27GlnThr: 3.27 ± 0.587
2.491GlnVal: 2.491 ± 0.482
0.623GlnTrp: 0.623 ± 0.186
0.856GlnTyr: 0.856 ± 0.26
0.0GlnXaa: 0.0 ± 0.0
Arg
4.982ArgAla: 4.982 ± 0.573
0.545ArgCys: 0.545 ± 0.271
4.437ArgAsp: 4.437 ± 0.455
4.515ArgGlu: 4.515 ± 0.789
2.802ArgPhe: 2.802 ± 0.535
4.281ArgGly: 4.281 ± 0.575
1.479ArgHis: 1.479 ± 0.4
3.814ArgIle: 3.814 ± 0.549
4.281ArgLys: 4.281 ± 0.601
4.826ArgLeu: 4.826 ± 0.76
1.323ArgMet: 1.323 ± 0.309
2.335ArgAsn: 2.335 ± 0.453
2.102ArgPro: 2.102 ± 0.464
3.27ArgGln: 3.27 ± 0.612
2.958ArgArg: 2.958 ± 0.539
3.036ArgSer: 3.036 ± 0.505
2.491ArgThr: 2.491 ± 0.519
3.659ArgVal: 3.659 ± 0.468
0.623ArgTrp: 0.623 ± 0.185
2.258ArgTyr: 2.258 ± 0.341
0.0ArgXaa: 0.0 ± 0.0
Ser
5.371SerAla: 5.371 ± 0.549
0.467SerCys: 0.467 ± 0.141
3.347SerAsp: 3.347 ± 0.45
2.958SerGlu: 2.958 ± 0.579
1.713SerPhe: 1.713 ± 0.428
4.437SerGly: 4.437 ± 0.666
0.701SerHis: 0.701 ± 0.274
2.024SerIle: 2.024 ± 0.404
3.114SerLys: 3.114 ± 0.406
3.503SerLeu: 3.503 ± 0.443
1.09SerMet: 1.09 ± 0.304
1.79SerAsn: 1.79 ± 0.352
2.18SerPro: 2.18 ± 0.377
1.946SerGln: 1.946 ± 0.457
2.958SerArg: 2.958 ± 0.508
3.036SerSer: 3.036 ± 0.99
3.425SerThr: 3.425 ± 0.559
3.814SerVal: 3.814 ± 0.488
0.778SerTrp: 0.778 ± 0.239
1.168SerTyr: 1.168 ± 0.353
0.0SerXaa: 0.0 ± 0.0
Thr
5.683ThrAla: 5.683 ± 0.821
0.623ThrCys: 0.623 ± 0.257
3.814ThrAsp: 3.814 ± 0.467
3.814ThrGlu: 3.814 ± 0.656
1.401ThrPhe: 1.401 ± 0.323
6.072ThrGly: 6.072 ± 0.657
1.012ThrHis: 1.012 ± 0.278
2.802ThrIle: 2.802 ± 0.45
2.88ThrLys: 2.88 ± 0.386
4.749ThrLeu: 4.749 ± 0.616
1.557ThrMet: 1.557 ± 0.298
2.413ThrAsn: 2.413 ± 0.426
4.437ThrPro: 4.437 ± 0.53
2.258ThrGln: 2.258 ± 0.378
2.258ThrArg: 2.258 ± 0.503
2.802ThrSer: 2.802 ± 0.592
3.892ThrThr: 3.892 ± 0.623
3.503ThrVal: 3.503 ± 0.717
0.701ThrTrp: 0.701 ± 0.287
1.635ThrTyr: 1.635 ± 0.307
0.0ThrXaa: 0.0 ± 0.0
Val
6.928ValAla: 6.928 ± 0.715
0.701ValCys: 0.701 ± 0.259
3.892ValAsp: 3.892 ± 0.56
3.347ValGlu: 3.347 ± 0.467
2.102ValPhe: 2.102 ± 0.367
5.138ValGly: 5.138 ± 0.79
0.856ValHis: 0.856 ± 0.216
3.581ValIle: 3.581 ± 0.511
4.826ValLys: 4.826 ± 0.62
3.97ValLeu: 3.97 ± 0.513
1.323ValMet: 1.323 ± 0.272
3.036ValAsn: 3.036 ± 0.484
3.036ValPro: 3.036 ± 0.536
2.569ValGln: 2.569 ± 0.45
2.958ValArg: 2.958 ± 0.517
2.725ValSer: 2.725 ± 0.379
3.97ValThr: 3.97 ± 0.501
4.204ValVal: 4.204 ± 0.689
0.778ValTrp: 0.778 ± 0.249
2.102ValTyr: 2.102 ± 0.528
0.0ValXaa: 0.0 ± 0.0
Trp
1.09TrpAla: 1.09 ± 0.256
0.311TrpCys: 0.311 ± 0.136
1.09TrpAsp: 1.09 ± 0.293
1.401TrpGlu: 1.401 ± 0.3
0.623TrpPhe: 0.623 ± 0.283
1.401TrpGly: 1.401 ± 0.334
0.467TrpHis: 0.467 ± 0.188
0.623TrpIle: 0.623 ± 0.18
0.467TrpLys: 0.467 ± 0.211
1.401TrpLeu: 1.401 ± 0.289
0.623TrpMet: 0.623 ± 0.218
1.09TrpAsn: 1.09 ± 0.263
0.467TrpPro: 0.467 ± 0.194
0.545TrpGln: 0.545 ± 0.216
1.323TrpArg: 1.323 ± 0.301
0.545TrpSer: 0.545 ± 0.178
1.401TrpThr: 1.401 ± 0.406
0.545TrpVal: 0.545 ± 0.228
0.078TrpTrp: 0.078 ± 0.061
0.311TrpTyr: 0.311 ± 0.183
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.503TyrAla: 3.503 ± 0.525
0.623TyrCys: 0.623 ± 0.234
1.713TyrAsp: 1.713 ± 0.386
1.946TyrGlu: 1.946 ± 0.385
1.09TyrPhe: 1.09 ± 0.372
2.258TyrGly: 2.258 ± 0.489
0.623TyrHis: 0.623 ± 0.211
0.934TyrIle: 0.934 ± 0.383
1.168TyrLys: 1.168 ± 0.317
3.114TyrLeu: 3.114 ± 0.439
0.701TyrMet: 0.701 ± 0.219
1.79TyrAsn: 1.79 ± 0.453
1.09TyrPro: 1.09 ± 0.397
0.856TyrGln: 0.856 ± 0.234
1.635TyrArg: 1.635 ± 0.285
2.18TyrSer: 2.18 ± 0.372
2.18TyrThr: 2.18 ± 0.449
1.946TyrVal: 1.946 ± 0.38
0.156TyrTrp: 0.156 ± 0.101
1.401TyrTyr: 1.401 ± 0.325
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 58 proteins (12847 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski