Amino acid dipepetide frequency for Salmonella phage SS3e

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.379AlaAla: 11.379 ± 1.419
1.411AlaCys: 1.411 ± 0.315
6.351AlaAsp: 6.351 ± 0.697
5.733AlaGlu: 5.733 ± 0.908
3.793AlaPhe: 3.793 ± 0.509
8.115AlaGly: 8.115 ± 0.77
1.764AlaHis: 1.764 ± 0.455
4.234AlaIle: 4.234 ± 0.787
5.469AlaLys: 5.469 ± 0.797
8.468AlaLeu: 8.468 ± 0.949
2.47AlaMet: 2.47 ± 0.38
3.264AlaAsn: 3.264 ± 0.499
3.44AlaPro: 3.44 ± 0.497
3.352AlaGln: 3.352 ± 0.652
4.587AlaArg: 4.587 ± 0.556
5.998AlaSer: 5.998 ± 0.908
6.527AlaThr: 6.527 ± 0.85
6.704AlaVal: 6.704 ± 0.773
1.411AlaTrp: 1.411 ± 0.285
3.616AlaTyr: 3.616 ± 0.548
0.0AlaXaa: 0.0 ± 0.0
Cys
0.882CysAla: 0.882 ± 0.283
0.176CysCys: 0.176 ± 0.119
0.882CysAsp: 0.882 ± 0.266
1.411CysGlu: 1.411 ± 0.437
0.265CysPhe: 0.265 ± 0.133
0.882CysGly: 0.882 ± 0.316
0.088CysHis: 0.088 ± 0.08
0.176CysIle: 0.176 ± 0.108
0.617CysLys: 0.617 ± 0.197
0.794CysLeu: 0.794 ± 0.304
0.088CysMet: 0.088 ± 0.088
0.617CysAsn: 0.617 ± 0.291
0.265CysPro: 0.265 ± 0.16
0.265CysGln: 0.265 ± 0.154
0.706CysArg: 0.706 ± 0.229
0.176CysSer: 0.176 ± 0.119
0.617CysThr: 0.617 ± 0.191
0.794CysVal: 0.794 ± 0.291
0.176CysTrp: 0.176 ± 0.108
0.353CysTyr: 0.353 ± 0.212
0.0CysXaa: 0.0 ± 0.0
Asp
7.233AspAla: 7.233 ± 0.702
0.794AspCys: 0.794 ± 0.261
3.793AspAsp: 3.793 ± 0.488
3.616AspGlu: 3.616 ± 0.544
2.911AspPhe: 2.911 ± 0.439
6.263AspGly: 6.263 ± 0.915
0.706AspHis: 0.706 ± 0.262
3.793AspIle: 3.793 ± 0.516
3.352AspLys: 3.352 ± 0.413
5.381AspLeu: 5.381 ± 0.526
1.764AspMet: 1.764 ± 0.313
2.911AspAsn: 2.911 ± 0.523
1.852AspPro: 1.852 ± 0.416
0.617AspGln: 0.617 ± 0.266
3.087AspArg: 3.087 ± 0.458
3.705AspSer: 3.705 ± 0.575
4.41AspThr: 4.41 ± 0.522
3.352AspVal: 3.352 ± 0.515
0.706AspTrp: 0.706 ± 0.214
2.029AspTyr: 2.029 ± 0.466
0.0AspXaa: 0.0 ± 0.0
Glu
6.704GluAla: 6.704 ± 0.908
0.353GluCys: 0.353 ± 0.151
4.234GluAsp: 4.234 ± 0.64
5.469GluGlu: 5.469 ± 1.152
3.528GluPhe: 3.528 ± 0.773
4.851GluGly: 4.851 ± 0.852
0.97GluHis: 0.97 ± 0.28
3.44GluIle: 3.44 ± 0.439
4.322GluLys: 4.322 ± 0.701
5.998GluLeu: 5.998 ± 0.763
2.734GluMet: 2.734 ± 0.428
2.382GluAsn: 2.382 ± 0.475
1.852GluPro: 1.852 ± 0.602
3.528GluGln: 3.528 ± 0.693
3.881GluArg: 3.881 ± 0.75
3.705GluSer: 3.705 ± 0.639
3.528GluThr: 3.528 ± 0.55
5.292GluVal: 5.292 ± 0.542
1.058GluTrp: 1.058 ± 0.354
1.588GluTyr: 1.588 ± 0.41
0.0GluXaa: 0.0 ± 0.0
Phe
3.175PheAla: 3.175 ± 0.521
0.529PheCys: 0.529 ± 0.195
3.087PheAsp: 3.087 ± 0.511
2.999PheGlu: 2.999 ± 0.544
0.529PhePhe: 0.529 ± 0.173
2.646PheGly: 2.646 ± 0.448
0.353PheHis: 0.353 ± 0.175
2.47PheIle: 2.47 ± 0.448
1.411PheLys: 1.411 ± 0.354
2.293PheLeu: 2.293 ± 0.434
0.441PheMet: 0.441 ± 0.193
1.5PheAsn: 1.5 ± 0.425
1.676PhePro: 1.676 ± 0.494
1.411PheGln: 1.411 ± 0.348
2.293PheArg: 2.293 ± 0.392
2.205PheSer: 2.205 ± 0.474
2.823PheThr: 2.823 ± 0.411
2.823PheVal: 2.823 ± 0.561
0.882PheTrp: 0.882 ± 0.349
0.97PheTyr: 0.97 ± 0.309
0.0PheXaa: 0.0 ± 0.0
Gly
7.85GlyAla: 7.85 ± 0.845
0.706GlyCys: 0.706 ± 0.222
4.234GlyAsp: 4.234 ± 0.69
6.174GlyGlu: 6.174 ± 0.914
2.999GlyPhe: 2.999 ± 0.582
6.616GlyGly: 6.616 ± 0.887
1.058GlyHis: 1.058 ± 0.357
3.616GlyIle: 3.616 ± 0.494
5.204GlyLys: 5.204 ± 0.595
5.028GlyLeu: 5.028 ± 0.579
2.205GlyMet: 2.205 ± 0.67
3.705GlyAsn: 3.705 ± 0.516
1.676GlyPro: 1.676 ± 0.407
3.087GlyGln: 3.087 ± 0.455
4.94GlyArg: 4.94 ± 0.666
5.204GlySer: 5.204 ± 0.952
3.528GlyThr: 3.528 ± 0.568
5.469GlyVal: 5.469 ± 0.772
1.235GlyTrp: 1.235 ± 0.355
2.911GlyTyr: 2.911 ± 0.524
0.0GlyXaa: 0.0 ± 0.0
His
0.97HisAla: 0.97 ± 0.361
0.353HisCys: 0.353 ± 0.173
0.882HisAsp: 0.882 ± 0.255
0.97HisGlu: 0.97 ± 0.283
0.706HisPhe: 0.706 ± 0.296
0.353HisGly: 0.353 ± 0.178
0.529HisHis: 0.529 ± 0.246
0.882HisIle: 0.882 ± 0.299
1.235HisLys: 1.235 ± 0.287
1.058HisLeu: 1.058 ± 0.263
0.617HisMet: 0.617 ± 0.266
0.617HisAsn: 0.617 ± 0.254
1.147HisPro: 1.147 ± 0.346
0.794HisGln: 0.794 ± 0.24
0.97HisArg: 0.97 ± 0.306
0.882HisSer: 0.882 ± 0.222
0.794HisThr: 0.794 ± 0.376
0.706HisVal: 0.706 ± 0.238
0.0HisTrp: 0.0 ± 0.0
0.97HisTyr: 0.97 ± 0.307
0.0HisXaa: 0.0 ± 0.0
Ile
4.41IleAla: 4.41 ± 0.65
0.617IleCys: 0.617 ± 0.313
3.969IleAsp: 3.969 ± 0.613
3.175IleGlu: 3.175 ± 0.489
1.411IlePhe: 1.411 ± 0.316
3.352IleGly: 3.352 ± 0.442
0.794IleHis: 0.794 ± 0.217
2.029IleIle: 2.029 ± 0.563
3.087IleLys: 3.087 ± 0.623
3.352IleLeu: 3.352 ± 0.529
1.058IleMet: 1.058 ± 0.334
2.47IleAsn: 2.47 ± 0.455
2.823IlePro: 2.823 ± 0.487
1.764IleGln: 1.764 ± 0.449
2.558IleArg: 2.558 ± 0.325
2.646IleSer: 2.646 ± 0.511
3.969IleThr: 3.969 ± 0.508
3.528IleVal: 3.528 ± 0.498
0.97IleTrp: 0.97 ± 0.284
1.411IleTyr: 1.411 ± 0.378
0.0IleXaa: 0.0 ± 0.0
Lys
5.822LysAla: 5.822 ± 0.831
0.706LysCys: 0.706 ± 0.328
4.146LysAsp: 4.146 ± 0.656
4.322LysGlu: 4.322 ± 0.704
1.941LysPhe: 1.941 ± 0.345
3.616LysGly: 3.616 ± 0.53
1.058LysHis: 1.058 ± 0.301
1.5LysIle: 1.5 ± 0.293
3.175LysLys: 3.175 ± 0.574
4.41LysLeu: 4.41 ± 0.675
2.646LysMet: 2.646 ± 0.538
2.47LysAsn: 2.47 ± 0.426
2.47LysPro: 2.47 ± 0.614
2.646LysGln: 2.646 ± 0.471
3.793LysArg: 3.793 ± 0.714
2.911LysSer: 2.911 ± 0.591
3.705LysThr: 3.705 ± 0.498
3.705LysVal: 3.705 ± 0.556
0.794LysTrp: 0.794 ± 0.232
2.646LysTyr: 2.646 ± 0.564
0.0LysXaa: 0.0 ± 0.0
Leu
6.616LeuAla: 6.616 ± 0.737
0.441LeuCys: 0.441 ± 0.218
4.234LeuAsp: 4.234 ± 0.561
5.381LeuGlu: 5.381 ± 0.825
1.588LeuPhe: 1.588 ± 0.408
4.41LeuGly: 4.41 ± 0.534
1.235LeuHis: 1.235 ± 0.391
4.499LeuIle: 4.499 ± 0.518
5.292LeuLys: 5.292 ± 0.686
6.263LeuLeu: 6.263 ± 0.87
2.205LeuMet: 2.205 ± 0.42
4.763LeuAsn: 4.763 ± 0.676
3.264LeuPro: 3.264 ± 0.558
3.087LeuGln: 3.087 ± 0.677
5.557LeuArg: 5.557 ± 0.713
4.499LeuSer: 4.499 ± 0.545
5.381LeuThr: 5.381 ± 0.537
5.381LeuVal: 5.381 ± 0.612
1.058LeuTrp: 1.058 ± 0.31
2.382LeuTyr: 2.382 ± 0.404
0.0LeuXaa: 0.0 ± 0.0
Met
2.47MetAla: 2.47 ± 0.38
0.441MetCys: 0.441 ± 0.226
1.411MetAsp: 1.411 ± 0.377
1.5MetGlu: 1.5 ± 0.329
0.882MetPhe: 0.882 ± 0.308
2.205MetGly: 2.205 ± 0.537
0.265MetHis: 0.265 ± 0.126
1.147MetIle: 1.147 ± 0.285
1.235MetLys: 1.235 ± 0.345
2.117MetLeu: 2.117 ± 0.432
0.617MetMet: 0.617 ± 0.233
0.97MetAsn: 0.97 ± 0.275
1.5MetPro: 1.5 ± 0.434
0.706MetGln: 0.706 ± 0.258
1.147MetArg: 1.147 ± 0.277
2.205MetSer: 2.205 ± 0.358
1.941MetThr: 1.941 ± 0.343
1.764MetVal: 1.764 ± 0.355
0.706MetTrp: 0.706 ± 0.232
0.794MetTyr: 0.794 ± 0.297
0.0MetXaa: 0.0 ± 0.0
Asn
4.146AsnAla: 4.146 ± 0.614
0.441AsnCys: 0.441 ± 0.198
2.734AsnAsp: 2.734 ± 0.332
2.293AsnGlu: 2.293 ± 0.432
1.411AsnPhe: 1.411 ± 0.316
3.793AsnGly: 3.793 ± 0.605
0.265AsnHis: 0.265 ± 0.15
2.823AsnIle: 2.823 ± 0.436
1.764AsnLys: 1.764 ± 0.461
4.058AsnLeu: 4.058 ± 0.547
0.441AsnMet: 0.441 ± 0.24
2.47AsnAsn: 2.47 ± 0.528
1.764AsnPro: 1.764 ± 0.404
1.235AsnGln: 1.235 ± 0.321
2.911AsnArg: 2.911 ± 0.529
2.382AsnSer: 2.382 ± 0.389
1.852AsnThr: 1.852 ± 0.402
3.705AsnVal: 3.705 ± 0.453
0.97AsnTrp: 0.97 ± 0.283
1.764AsnTyr: 1.764 ± 0.403
0.0AsnXaa: 0.0 ± 0.0
Pro
2.823ProAla: 2.823 ± 0.562
0.265ProCys: 0.265 ± 0.125
2.911ProAsp: 2.911 ± 0.537
3.528ProGlu: 3.528 ± 0.459
1.588ProPhe: 1.588 ± 0.277
3.528ProGly: 3.528 ± 0.55
0.441ProHis: 0.441 ± 0.163
1.411ProIle: 1.411 ± 0.424
2.646ProLys: 2.646 ± 0.554
3.352ProLeu: 3.352 ± 0.496
0.706ProMet: 0.706 ± 0.234
1.5ProAsn: 1.5 ± 0.432
1.147ProPro: 1.147 ± 0.341
1.411ProGln: 1.411 ± 0.321
1.764ProArg: 1.764 ± 0.348
2.382ProSer: 2.382 ± 0.382
1.5ProThr: 1.5 ± 0.353
3.793ProVal: 3.793 ± 0.608
0.353ProTrp: 0.353 ± 0.201
1.235ProTyr: 1.235 ± 0.342
0.0ProXaa: 0.0 ± 0.0
Gln
4.587GlnAla: 4.587 ± 0.609
0.441GlnCys: 0.441 ± 0.238
1.764GlnAsp: 1.764 ± 0.359
2.205GlnGlu: 2.205 ± 0.667
1.147GlnPhe: 1.147 ± 0.25
2.205GlnGly: 2.205 ± 0.49
0.617GlnHis: 0.617 ± 0.291
1.764GlnIle: 1.764 ± 0.43
2.029GlnLys: 2.029 ± 0.451
3.352GlnLeu: 3.352 ± 0.554
1.147GlnMet: 1.147 ± 0.321
2.029GlnAsn: 2.029 ± 0.404
1.941GlnPro: 1.941 ± 0.343
2.382GlnGln: 2.382 ± 0.753
1.852GlnArg: 1.852 ± 0.363
1.5GlnSer: 1.5 ± 0.255
1.588GlnThr: 1.588 ± 0.348
2.911GlnVal: 2.911 ± 0.643
0.617GlnTrp: 0.617 ± 0.21
1.411GlnTyr: 1.411 ± 0.309
0.0GlnXaa: 0.0 ± 0.0
Arg
4.94ArgAla: 4.94 ± 0.496
0.529ArgCys: 0.529 ± 0.211
3.264ArgAsp: 3.264 ± 0.47
4.058ArgGlu: 4.058 ± 0.538
2.029ArgPhe: 2.029 ± 0.449
4.146ArgGly: 4.146 ± 0.546
0.97ArgHis: 0.97 ± 0.322
3.528ArgIle: 3.528 ± 0.55
4.058ArgLys: 4.058 ± 0.683
3.881ArgLeu: 3.881 ± 0.518
2.293ArgMet: 2.293 ± 0.473
2.646ArgAsn: 2.646 ± 0.542
2.205ArgPro: 2.205 ± 0.419
3.175ArgGln: 3.175 ± 0.506
5.116ArgArg: 5.116 ± 0.805
2.205ArgSer: 2.205 ± 0.391
2.646ArgThr: 2.646 ± 0.576
4.41ArgVal: 4.41 ± 0.535
1.058ArgTrp: 1.058 ± 0.288
1.235ArgTyr: 1.235 ± 0.41
0.0ArgXaa: 0.0 ± 0.0
Ser
6.086SerAla: 6.086 ± 1.061
0.265SerCys: 0.265 ± 0.132
3.175SerAsp: 3.175 ± 0.586
3.44SerGlu: 3.44 ± 0.588
2.646SerPhe: 2.646 ± 0.435
7.409SerGly: 7.409 ± 1.052
0.794SerHis: 0.794 ± 0.201
2.911SerIle: 2.911 ± 0.544
2.558SerLys: 2.558 ± 0.568
4.851SerLeu: 4.851 ± 0.682
1.147SerMet: 1.147 ± 0.306
2.382SerAsn: 2.382 ± 0.406
1.588SerPro: 1.588 ± 0.346
2.205SerGln: 2.205 ± 0.42
3.264SerArg: 3.264 ± 0.519
2.646SerSer: 2.646 ± 0.48
3.969SerThr: 3.969 ± 0.669
5.116SerVal: 5.116 ± 0.726
0.794SerTrp: 0.794 ± 0.218
1.852SerTyr: 1.852 ± 0.45
0.0SerXaa: 0.0 ± 0.0
Thr
6.263ThrAla: 6.263 ± 0.783
0.353ThrCys: 0.353 ± 0.167
4.322ThrAsp: 4.322 ± 0.54
3.44ThrGlu: 3.44 ± 0.498
3.087ThrPhe: 3.087 ± 0.486
5.292ThrGly: 5.292 ± 0.71
1.058ThrHis: 1.058 ± 0.316
2.823ThrIle: 2.823 ± 0.449
3.087ThrLys: 3.087 ± 0.588
4.94ThrLeu: 4.94 ± 0.659
0.882ThrMet: 0.882 ± 0.264
1.5ThrAsn: 1.5 ± 0.382
3.705ThrPro: 3.705 ± 0.415
1.5ThrGln: 1.5 ± 0.287
2.558ThrArg: 2.558 ± 0.393
5.028ThrSer: 5.028 ± 0.701
3.793ThrThr: 3.793 ± 0.515
5.116ThrVal: 5.116 ± 0.746
0.617ThrTrp: 0.617 ± 0.189
2.205ThrTyr: 2.205 ± 0.487
0.0ThrXaa: 0.0 ± 0.0
Val
7.498ValAla: 7.498 ± 0.713
0.794ValCys: 0.794 ± 0.253
4.058ValAsp: 4.058 ± 0.448
6.527ValGlu: 6.527 ± 0.702
2.029ValPhe: 2.029 ± 0.447
3.969ValGly: 3.969 ± 0.708
1.147ValHis: 1.147 ± 0.309
4.146ValIle: 4.146 ± 0.673
5.116ValLys: 5.116 ± 0.608
4.41ValLeu: 4.41 ± 0.564
1.235ValMet: 1.235 ± 0.344
3.087ValAsn: 3.087 ± 0.615
2.117ValPro: 2.117 ± 0.56
2.205ValGln: 2.205 ± 0.433
3.705ValArg: 3.705 ± 0.515
6.174ValSer: 6.174 ± 0.977
6.086ValThr: 6.086 ± 0.654
5.733ValVal: 5.733 ± 0.752
0.794ValTrp: 0.794 ± 0.291
2.47ValTyr: 2.47 ± 0.41
0.0ValXaa: 0.0 ± 0.0
Trp
1.323TrpAla: 1.323 ± 0.429
0.265TrpCys: 0.265 ± 0.139
1.058TrpAsp: 1.058 ± 0.301
0.353TrpGlu: 0.353 ± 0.148
0.794TrpPhe: 0.794 ± 0.348
1.058TrpGly: 1.058 ± 0.31
0.265TrpHis: 0.265 ± 0.152
0.617TrpIle: 0.617 ± 0.231
0.529TrpLys: 0.529 ± 0.274
1.588TrpLeu: 1.588 ± 0.34
0.441TrpMet: 0.441 ± 0.216
0.882TrpAsn: 0.882 ± 0.322
0.441TrpPro: 0.441 ± 0.217
0.529TrpGln: 0.529 ± 0.2
1.323TrpArg: 1.323 ± 0.438
0.529TrpSer: 0.529 ± 0.228
0.706TrpThr: 0.706 ± 0.258
1.235TrpVal: 1.235 ± 0.326
0.265TrpTrp: 0.265 ± 0.158
0.265TrpTyr: 0.265 ± 0.158
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.911TyrAla: 2.911 ± 0.508
0.353TyrCys: 0.353 ± 0.171
1.764TyrAsp: 1.764 ± 0.491
2.823TyrGlu: 2.823 ± 0.546
1.235TyrPhe: 1.235 ± 0.356
2.734TyrGly: 2.734 ± 0.473
1.058TyrHis: 1.058 ± 0.272
1.588TyrIle: 1.588 ± 0.367
2.293TyrLys: 2.293 ± 0.464
1.941TyrLeu: 1.941 ± 0.344
0.794TyrMet: 0.794 ± 0.245
1.058TyrAsn: 1.058 ± 0.268
1.411TyrPro: 1.411 ± 0.423
1.588TyrGln: 1.588 ± 0.371
2.47TyrArg: 2.47 ± 0.57
2.117TyrSer: 2.117 ± 0.414
2.205TyrThr: 2.205 ± 0.372
1.764TyrVal: 1.764 ± 0.432
0.0TyrTrp: 0.0 ± 0.0
1.235TyrTyr: 1.235 ± 0.234
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 58 proteins (11338 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski