Amino acid dipepetide frequency for Salmonella phage phSE-2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.031AlaAla: 8.031 ± 1.005
0.54AlaCys: 0.54 ± 0.189
4.589AlaAsp: 4.589 ± 0.602
5.872AlaGlu: 5.872 ± 0.631
3.644AlaPhe: 3.644 ± 0.421
6.074AlaGly: 6.074 ± 0.862
0.877AlaHis: 0.877 ± 0.263
5.467AlaIle: 5.467 ± 0.663
7.424AlaLys: 7.424 ± 1.229
7.424AlaLeu: 7.424 ± 0.774
2.767AlaMet: 2.767 ± 0.43
3.914AlaAsn: 3.914 ± 0.605
2.025AlaPro: 2.025 ± 0.31
4.252AlaGln: 4.252 ± 0.522
4.454AlaArg: 4.454 ± 0.702
4.049AlaSer: 4.049 ± 0.543
3.577AlaThr: 3.577 ± 0.541
5.872AlaVal: 5.872 ± 0.784
1.012AlaTrp: 1.012 ± 0.225
3.105AlaTyr: 3.105 ± 0.483
0.0AlaXaa: 0.0 ± 0.0
Cys
0.877CysAla: 0.877 ± 0.253
0.135CysCys: 0.135 ± 0.093
0.877CysAsp: 0.877 ± 0.21
1.147CysGlu: 1.147 ± 0.309
0.337CysPhe: 0.337 ± 0.17
1.147CysGly: 1.147 ± 0.336
0.337CysHis: 0.337 ± 0.173
0.607CysIle: 0.607 ± 0.233
0.742CysLys: 0.742 ± 0.232
0.877CysLeu: 0.877 ± 0.284
0.607CysMet: 0.607 ± 0.2
0.54CysAsn: 0.54 ± 0.206
0.337CysPro: 0.337 ± 0.146
0.337CysGln: 0.337 ± 0.165
0.81CysArg: 0.81 ± 0.254
0.607CysSer: 0.607 ± 0.202
0.742CysThr: 0.742 ± 0.224
1.282CysVal: 1.282 ± 0.299
0.54CysTrp: 0.54 ± 0.166
0.405CysTyr: 0.405 ± 0.156
0.0CysXaa: 0.0 ± 0.0
Asp
4.522AspAla: 4.522 ± 0.592
0.607AspCys: 0.607 ± 0.197
3.509AspAsp: 3.509 ± 0.541
4.859AspGlu: 4.859 ± 0.411
2.835AspPhe: 2.835 ± 0.434
5.872AspGly: 5.872 ± 0.765
0.877AspHis: 0.877 ± 0.234
3.24AspIle: 3.24 ± 0.408
4.994AspLys: 4.994 ± 0.559
4.657AspLeu: 4.657 ± 0.432
1.417AspMet: 1.417 ± 0.323
3.577AspAsn: 3.577 ± 0.443
2.295AspPro: 2.295 ± 0.388
1.687AspGln: 1.687 ± 0.391
2.767AspArg: 2.767 ± 0.431
3.307AspSer: 3.307 ± 0.448
3.779AspThr: 3.779 ± 0.428
3.712AspVal: 3.712 ± 0.499
0.945AspTrp: 0.945 ± 0.201
3.037AspTyr: 3.037 ± 0.412
0.0AspXaa: 0.0 ± 0.0
Glu
5.467GluAla: 5.467 ± 0.534
1.215GluCys: 1.215 ± 0.327
3.442GluAsp: 3.442 ± 0.527
4.859GluGlu: 4.859 ± 0.73
3.779GluPhe: 3.779 ± 0.513
3.442GluGly: 3.442 ± 0.446
1.687GluHis: 1.687 ± 0.32
4.724GluIle: 4.724 ± 0.613
5.062GluLys: 5.062 ± 0.766
5.399GluLeu: 5.399 ± 0.609
2.97GluMet: 2.97 ± 0.407
3.577GluAsn: 3.577 ± 0.453
2.092GluPro: 2.092 ± 0.448
3.375GluGln: 3.375 ± 0.567
3.375GluArg: 3.375 ± 0.663
4.522GluSer: 4.522 ± 0.422
4.387GluThr: 4.387 ± 0.512
4.522GluVal: 4.522 ± 0.664
0.742GluTrp: 0.742 ± 0.215
3.712GluTyr: 3.712 ± 0.461
0.0GluXaa: 0.0 ± 0.0
Phe
2.902PheAla: 2.902 ± 0.557
0.472PheCys: 0.472 ± 0.166
3.644PheAsp: 3.644 ± 0.449
3.037PheGlu: 3.037 ± 0.547
1.215PhePhe: 1.215 ± 0.292
3.307PheGly: 3.307 ± 0.572
0.54PheHis: 0.54 ± 0.199
2.565PheIle: 2.565 ± 0.432
3.24PheLys: 3.24 ± 0.476
2.227PheLeu: 2.227 ± 0.385
1.215PheMet: 1.215 ± 0.296
2.497PheAsn: 2.497 ± 0.323
1.822PhePro: 1.822 ± 0.365
1.485PheGln: 1.485 ± 0.323
1.552PheArg: 1.552 ± 0.304
2.092PheSer: 2.092 ± 0.45
2.835PheThr: 2.835 ± 0.538
2.565PheVal: 2.565 ± 0.357
0.607PheTrp: 0.607 ± 0.208
1.08PheTyr: 1.08 ± 0.315
0.0PheXaa: 0.0 ± 0.0
Gly
5.399GlyAla: 5.399 ± 0.81
1.147GlyCys: 1.147 ± 0.255
4.184GlyAsp: 4.184 ± 0.596
5.264GlyGlu: 5.264 ± 0.576
3.037GlyPhe: 3.037 ± 0.51
6.547GlyGly: 6.547 ± 0.822
1.417GlyHis: 1.417 ± 0.304
4.522GlyIle: 4.522 ± 0.467
5.399GlyLys: 5.399 ± 0.622
4.387GlyLeu: 4.387 ± 0.516
2.632GlyMet: 2.632 ± 0.37
3.712GlyAsn: 3.712 ± 0.515
0.0GlyPro: 0.0 ± 0.0
1.822GlyGln: 1.822 ± 0.456
2.767GlyArg: 2.767 ± 0.455
4.994GlySer: 4.994 ± 0.647
3.644GlyThr: 3.644 ± 0.572
5.129GlyVal: 5.129 ± 0.571
1.687GlyTrp: 1.687 ± 0.295
3.577GlyTyr: 3.577 ± 0.479
0.0GlyXaa: 0.0 ± 0.0
His
1.215HisAla: 1.215 ± 0.333
0.202HisCys: 0.202 ± 0.114
1.485HisAsp: 1.485 ± 0.351
1.012HisGlu: 1.012 ± 0.285
1.08HisPhe: 1.08 ± 0.257
1.552HisGly: 1.552 ± 0.306
0.54HisHis: 0.54 ± 0.326
1.012HisIle: 1.012 ± 0.227
1.35HisLys: 1.35 ± 0.335
1.35HisLeu: 1.35 ± 0.379
0.27HisMet: 0.27 ± 0.122
0.405HisAsn: 0.405 ± 0.161
0.472HisPro: 0.472 ± 0.235
0.81HisGln: 0.81 ± 0.273
1.08HisArg: 1.08 ± 0.275
1.08HisSer: 1.08 ± 0.341
0.877HisThr: 0.877 ± 0.227
1.417HisVal: 1.417 ± 0.269
0.135HisTrp: 0.135 ± 0.102
0.742HisTyr: 0.742 ± 0.251
0.0HisXaa: 0.0 ± 0.0
Ile
6.682IleAla: 6.682 ± 0.76
0.81IleCys: 0.81 ± 0.236
4.792IleAsp: 4.792 ± 0.536
4.927IleGlu: 4.927 ± 0.493
2.565IlePhe: 2.565 ± 0.377
3.442IleGly: 3.442 ± 0.445
1.485IleHis: 1.485 ± 0.288
3.779IleIle: 3.779 ± 0.431
5.669IleLys: 5.669 ± 0.628
3.307IleLeu: 3.307 ± 0.517
2.227IleMet: 2.227 ± 0.475
3.375IleAsn: 3.375 ± 0.476
2.092IlePro: 2.092 ± 0.375
2.025IleGln: 2.025 ± 0.399
2.835IleArg: 2.835 ± 0.445
4.049IleSer: 4.049 ± 0.495
4.454IleThr: 4.454 ± 0.557
4.657IleVal: 4.657 ± 0.449
0.877IleTrp: 0.877 ± 0.235
1.755IleTyr: 1.755 ± 0.328
0.0IleXaa: 0.0 ± 0.0
Lys
7.559LysAla: 7.559 ± 0.826
0.675LysCys: 0.675 ± 0.202
4.859LysAsp: 4.859 ± 0.516
6.614LysGlu: 6.614 ± 0.957
1.89LysPhe: 1.89 ± 0.401
3.847LysGly: 3.847 ± 0.606
1.485LysHis: 1.485 ± 0.319
4.184LysIle: 4.184 ± 0.502
4.724LysLys: 4.724 ± 0.779
5.399LysLeu: 5.399 ± 0.712
3.172LysMet: 3.172 ± 0.436
3.644LysAsn: 3.644 ± 0.524
2.97LysPro: 2.97 ± 0.416
2.16LysGln: 2.16 ± 0.309
4.117LysArg: 4.117 ± 0.59
3.442LysSer: 3.442 ± 0.551
4.724LysThr: 4.724 ± 0.566
4.522LysVal: 4.522 ± 0.424
1.35LysTrp: 1.35 ± 0.309
2.43LysTyr: 2.43 ± 0.453
0.0LysXaa: 0.0 ± 0.0
Leu
7.289LeuAla: 7.289 ± 0.991
1.08LeuCys: 1.08 ± 0.286
3.644LeuAsp: 3.644 ± 0.488
3.847LeuGlu: 3.847 ± 0.715
2.362LeuPhe: 2.362 ± 0.381
3.172LeuGly: 3.172 ± 0.539
1.35LeuHis: 1.35 ± 0.332
4.454LeuIle: 4.454 ± 0.453
4.994LeuLys: 4.994 ± 0.641
4.252LeuLeu: 4.252 ± 0.437
2.295LeuMet: 2.295 ± 0.461
4.049LeuAsn: 4.049 ± 0.491
3.24LeuPro: 3.24 ± 0.493
1.822LeuGln: 1.822 ± 0.347
3.375LeuArg: 3.375 ± 0.503
4.454LeuSer: 4.454 ± 0.552
5.129LeuThr: 5.129 ± 0.674
4.184LeuVal: 4.184 ± 0.452
1.147LeuTrp: 1.147 ± 0.272
2.295LeuTyr: 2.295 ± 0.38
0.0LeuXaa: 0.0 ± 0.0
Met
2.7MetAla: 2.7 ± 0.478
0.54MetCys: 0.54 ± 0.191
1.552MetAsp: 1.552 ± 0.339
1.282MetGlu: 1.282 ± 0.328
1.755MetPhe: 1.755 ± 0.385
1.485MetGly: 1.485 ± 0.33
0.742MetHis: 0.742 ± 0.215
2.565MetIle: 2.565 ± 0.376
2.43MetLys: 2.43 ± 0.367
2.43MetLeu: 2.43 ± 0.459
1.215MetMet: 1.215 ± 0.372
1.62MetAsn: 1.62 ± 0.321
0.742MetPro: 0.742 ± 0.198
1.755MetGln: 1.755 ± 0.358
2.025MetArg: 2.025 ± 0.365
1.755MetSer: 1.755 ± 0.394
1.822MetThr: 1.822 ± 0.326
1.62MetVal: 1.62 ± 0.273
0.337MetTrp: 0.337 ± 0.15
0.742MetTyr: 0.742 ± 0.211
0.0MetXaa: 0.0 ± 0.0
Asn
4.252AsnAla: 4.252 ± 0.574
0.607AsnCys: 0.607 ± 0.188
3.307AsnAsp: 3.307 ± 0.488
3.577AsnGlu: 3.577 ± 0.448
1.89AsnPhe: 1.89 ± 0.318
4.792AsnGly: 4.792 ± 0.612
0.675AsnHis: 0.675 ± 0.195
3.172AsnIle: 3.172 ± 0.425
3.712AsnLys: 3.712 ± 0.502
2.835AsnLeu: 2.835 ± 0.472
1.012AsnMet: 1.012 ± 0.282
2.43AsnAsn: 2.43 ± 0.424
1.755AsnPro: 1.755 ± 0.344
2.16AsnGln: 2.16 ± 0.356
2.295AsnArg: 2.295 ± 0.358
3.172AsnSer: 3.172 ± 0.518
1.822AsnThr: 1.822 ± 0.301
4.184AsnVal: 4.184 ± 0.605
0.472AsnTrp: 0.472 ± 0.143
1.08AsnTyr: 1.08 ± 0.252
0.0AsnXaa: 0.0 ± 0.0
Pro
2.497ProAla: 2.497 ± 0.398
0.405ProCys: 0.405 ± 0.22
2.362ProAsp: 2.362 ± 0.421
3.577ProGlu: 3.577 ± 0.612
1.417ProPhe: 1.417 ± 0.292
2.565ProGly: 2.565 ± 0.405
0.607ProHis: 0.607 ± 0.205
2.025ProIle: 2.025 ± 0.481
1.485ProLys: 1.485 ± 0.309
1.755ProLeu: 1.755 ± 0.314
0.877ProMet: 0.877 ± 0.304
1.687ProAsn: 1.687 ± 0.315
1.282ProPro: 1.282 ± 0.31
1.08ProGln: 1.08 ± 0.257
1.147ProArg: 1.147 ± 0.374
1.147ProSer: 1.147 ± 0.229
1.282ProThr: 1.282 ± 0.279
2.835ProVal: 2.835 ± 0.391
0.27ProTrp: 0.27 ± 0.123
1.147ProTyr: 1.147 ± 0.223
0.0ProXaa: 0.0 ± 0.0
Gln
3.442GlnAla: 3.442 ± 0.439
0.472GlnCys: 0.472 ± 0.196
2.362GlnAsp: 2.362 ± 0.422
2.835GlnGlu: 2.835 ± 0.392
1.417GlnPhe: 1.417 ± 0.3
1.687GlnGly: 1.687 ± 0.326
0.472GlnHis: 0.472 ± 0.154
2.97GlnIle: 2.97 ± 0.461
2.632GlnLys: 2.632 ± 0.533
3.375GlnLeu: 3.375 ± 0.643
1.012GlnMet: 1.012 ± 0.251
1.62GlnAsn: 1.62 ± 0.39
1.282GlnPro: 1.282 ± 0.264
2.7GlnGln: 2.7 ± 0.747
1.957GlnArg: 1.957 ± 0.416
2.565GlnSer: 2.565 ± 0.431
1.485GlnThr: 1.485 ± 0.311
2.362GlnVal: 2.362 ± 0.351
0.54GlnTrp: 0.54 ± 0.171
1.35GlnTyr: 1.35 ± 0.336
0.0GlnXaa: 0.0 ± 0.0
Arg
3.982ArgAla: 3.982 ± 0.532
1.012ArgCys: 1.012 ± 0.308
2.835ArgAsp: 2.835 ± 0.388
3.577ArgGlu: 3.577 ± 0.542
2.362ArgPhe: 2.362 ± 0.407
2.97ArgGly: 2.97 ± 0.467
0.742ArgHis: 0.742 ± 0.228
3.509ArgIle: 3.509 ± 0.472
3.712ArgLys: 3.712 ± 0.567
3.105ArgLeu: 3.105 ± 0.491
1.485ArgMet: 1.485 ± 0.27
1.552ArgAsn: 1.552 ± 0.34
1.417ArgPro: 1.417 ± 0.384
2.16ArgGln: 2.16 ± 0.436
3.577ArgArg: 3.577 ± 0.543
2.16ArgSer: 2.16 ± 0.384
1.687ArgThr: 1.687 ± 0.379
4.657ArgVal: 4.657 ± 0.606
0.607ArgTrp: 0.607 ± 0.208
2.16ArgTyr: 2.16 ± 0.366
0.0ArgXaa: 0.0 ± 0.0
Ser
4.589SerAla: 4.589 ± 0.58
0.607SerCys: 0.607 ± 0.195
4.522SerAsp: 4.522 ± 0.575
4.049SerGlu: 4.049 ± 0.498
2.025SerPhe: 2.025 ± 0.333
5.129SerGly: 5.129 ± 0.543
0.945SerHis: 0.945 ± 0.265
4.049SerIle: 4.049 ± 0.513
3.375SerLys: 3.375 ± 0.509
3.914SerLeu: 3.914 ± 0.474
1.485SerMet: 1.485 ± 0.3
2.092SerAsn: 2.092 ± 0.364
1.552SerPro: 1.552 ± 0.371
2.497SerGln: 2.497 ± 0.58
2.632SerArg: 2.632 ± 0.421
2.16SerSer: 2.16 ± 0.419
2.295SerThr: 2.295 ± 0.458
4.184SerVal: 4.184 ± 0.517
1.012SerTrp: 1.012 ± 0.274
2.025SerTyr: 2.025 ± 0.346
0.0SerXaa: 0.0 ± 0.0
Thr
4.522ThrAla: 4.522 ± 0.651
0.472ThrCys: 0.472 ± 0.187
3.105ThrAsp: 3.105 ± 0.411
2.565ThrGlu: 2.565 ± 0.44
2.97ThrPhe: 2.97 ± 0.388
5.467ThrGly: 5.467 ± 0.541
1.08ThrHis: 1.08 ± 0.256
3.712ThrIle: 3.712 ± 0.413
3.577ThrLys: 3.577 ± 0.732
3.644ThrLeu: 3.644 ± 0.52
1.215ThrMet: 1.215 ± 0.259
2.295ThrAsn: 2.295 ± 0.349
2.565ThrPro: 2.565 ± 0.381
2.362ThrGln: 2.362 ± 0.454
1.687ThrArg: 1.687 ± 0.287
2.767ThrSer: 2.767 ± 0.476
2.362ThrThr: 2.362 ± 0.325
4.117ThrVal: 4.117 ± 0.576
1.08ThrTrp: 1.08 ± 0.231
1.957ThrTyr: 1.957 ± 0.402
0.0ThrXaa: 0.0 ± 0.0
Val
5.062ValAla: 5.062 ± 0.639
1.147ValCys: 1.147 ± 0.285
4.252ValAsp: 4.252 ± 0.524
5.939ValGlu: 5.939 ± 0.568
2.227ValPhe: 2.227 ± 0.399
4.184ValGly: 4.184 ± 0.458
1.012ValHis: 1.012 ± 0.249
5.602ValIle: 5.602 ± 0.586
5.872ValLys: 5.872 ± 0.617
4.589ValLeu: 4.589 ± 0.516
2.16ValMet: 2.16 ± 0.373
3.779ValAsn: 3.779 ± 0.544
2.092ValPro: 2.092 ± 0.419
1.957ValGln: 1.957 ± 0.359
3.577ValArg: 3.577 ± 0.493
3.712ValSer: 3.712 ± 0.558
3.982ValThr: 3.982 ± 0.413
3.577ValVal: 3.577 ± 0.571
1.282ValTrp: 1.282 ± 0.241
2.497ValTyr: 2.497 ± 0.39
0.0ValXaa: 0.0 ± 0.0
Trp
0.945TrpAla: 0.945 ± 0.309
0.405TrpCys: 0.405 ± 0.182
0.877TrpAsp: 0.877 ± 0.236
0.877TrpGlu: 0.877 ± 0.237
0.742TrpPhe: 0.742 ± 0.22
1.552TrpGly: 1.552 ± 0.365
0.337TrpHis: 0.337 ± 0.141
1.215TrpIle: 1.215 ± 0.275
1.012TrpLys: 1.012 ± 0.247
1.35TrpLeu: 1.35 ± 0.301
0.337TrpMet: 0.337 ± 0.133
0.81TrpAsn: 0.81 ± 0.203
0.27TrpPro: 0.27 ± 0.126
0.405TrpGln: 0.405 ± 0.138
1.147TrpArg: 1.147 ± 0.345
0.742TrpSer: 0.742 ± 0.282
0.81TrpThr: 0.81 ± 0.248
0.742TrpVal: 0.742 ± 0.163
0.202TrpTrp: 0.202 ± 0.109
0.405TrpTyr: 0.405 ± 0.158
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.037TyrAla: 3.037 ± 0.567
0.742TyrCys: 0.742 ± 0.242
2.295TyrAsp: 2.295 ± 0.382
2.632TyrGlu: 2.632 ± 0.464
1.35TyrPhe: 1.35 ± 0.32
3.037TyrGly: 3.037 ± 0.396
0.877TyrHis: 0.877 ± 0.206
2.43TyrIle: 2.43 ± 0.412
2.362TyrLys: 2.362 ± 0.411
2.16TyrLeu: 2.16 ± 0.338
0.675TyrMet: 0.675 ± 0.216
2.025TyrAsn: 2.025 ± 0.388
1.147TyrPro: 1.147 ± 0.29
1.687TyrGln: 1.687 ± 0.323
2.092TyrArg: 2.092 ± 0.483
2.43TyrSer: 2.43 ± 0.453
1.822TyrThr: 1.822 ± 0.293
2.362TyrVal: 2.362 ± 0.353
0.337TyrTrp: 0.337 ± 0.128
1.215TyrTyr: 1.215 ± 0.272
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 83 proteins (14818 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski