Amino acid dipepetide frequency for Salmonella phage P22 (Bacteriophage P22)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.021AlaAla: 10.021 ± 1.167
1.147AlaCys: 1.147 ± 0.338
5.89AlaAsp: 5.89 ± 0.738
6.961AlaGlu: 6.961 ± 0.728
3.748AlaPhe: 3.748 ± 0.581
6.961AlaGly: 6.961 ± 0.785
0.918AlaHis: 0.918 ± 0.288
6.349AlaIle: 6.349 ± 0.655
5.508AlaLys: 5.508 ± 0.656
6.426AlaLeu: 6.426 ± 0.918
3.825AlaMet: 3.825 ± 0.608
5.89AlaAsn: 5.89 ± 0.883
2.371AlaPro: 2.371 ± 0.337
3.366AlaGln: 3.366 ± 0.545
5.355AlaArg: 5.355 ± 0.732
4.437AlaSer: 4.437 ± 0.563
5.584AlaThr: 5.584 ± 0.72
5.202AlaVal: 5.202 ± 0.587
1.606AlaTrp: 1.606 ± 0.337
3.06AlaTyr: 3.06 ± 0.455
0.0AlaXaa: 0.0 ± 0.0
Cys
0.918CysAla: 0.918 ± 0.312
0.306CysCys: 0.306 ± 0.159
0.306CysAsp: 0.306 ± 0.164
0.688CysGlu: 0.688 ± 0.234
0.459CysPhe: 0.459 ± 0.194
1.224CysGly: 1.224 ± 0.369
0.535CysHis: 0.535 ± 0.243
0.688CysIle: 0.688 ± 0.233
1.071CysLys: 1.071 ± 0.324
0.612CysLeu: 0.612 ± 0.262
0.306CysMet: 0.306 ± 0.15
0.612CysAsn: 0.612 ± 0.242
0.382CysPro: 0.382 ± 0.17
0.459CysGln: 0.459 ± 0.143
1.071CysArg: 1.071 ± 0.313
0.994CysSer: 0.994 ± 0.292
0.306CysThr: 0.306 ± 0.164
1.3CysVal: 1.3 ± 0.314
0.153CysTrp: 0.153 ± 0.12
0.612CysTyr: 0.612 ± 0.188
0.0CysXaa: 0.0 ± 0.0
Asp
7.879AspAla: 7.879 ± 0.664
0.765AspCys: 0.765 ± 0.285
4.131AspAsp: 4.131 ± 0.441
4.054AspGlu: 4.054 ± 0.615
1.912AspPhe: 1.912 ± 0.493
4.054AspGly: 4.054 ± 0.713
1.071AspHis: 1.071 ± 0.357
4.131AspIle: 4.131 ± 0.446
3.366AspLys: 3.366 ± 0.529
4.896AspLeu: 4.896 ± 0.533
1.683AspMet: 1.683 ± 0.295
2.065AspAsn: 2.065 ± 0.373
2.065AspPro: 2.065 ± 0.397
1.071AspGln: 1.071 ± 0.209
2.218AspArg: 2.218 ± 0.367
3.06AspSer: 3.06 ± 0.463
2.065AspThr: 2.065 ± 0.359
4.437AspVal: 4.437 ± 0.627
0.841AspTrp: 0.841 ± 0.29
3.366AspTyr: 3.366 ± 0.518
0.0AspXaa: 0.0 ± 0.0
Glu
5.814GluAla: 5.814 ± 0.676
1.377GluCys: 1.377 ± 0.36
3.366GluAsp: 3.366 ± 0.49
4.972GluGlu: 4.972 ± 0.728
1.989GluPhe: 1.989 ± 0.446
3.748GluGly: 3.748 ± 0.498
1.224GluHis: 1.224 ± 0.271
3.672GluIle: 3.672 ± 0.461
4.513GluLys: 4.513 ± 0.614
6.655GluLeu: 6.655 ± 0.755
1.836GluMet: 1.836 ± 0.391
3.289GluAsn: 3.289 ± 0.459
2.218GluPro: 2.218 ± 0.463
3.825GluGln: 3.825 ± 0.537
4.437GluArg: 4.437 ± 0.737
3.748GluSer: 3.748 ± 0.496
2.601GluThr: 2.601 ± 0.378
3.136GluVal: 3.136 ± 0.451
1.606GluTrp: 1.606 ± 0.348
1.606GluTyr: 1.606 ± 0.382
0.0GluXaa: 0.0 ± 0.0
Phe
3.825PheAla: 3.825 ± 0.603
0.765PheCys: 0.765 ± 0.22
2.218PheAsp: 2.218 ± 0.443
1.377PheGlu: 1.377 ± 0.317
1.071PhePhe: 1.071 ± 0.443
2.371PheGly: 2.371 ± 0.356
0.459PheHis: 0.459 ± 0.15
3.672PheIle: 3.672 ± 0.645
1.836PheLys: 1.836 ± 0.356
2.142PheLeu: 2.142 ± 0.444
1.071PheMet: 1.071 ± 0.259
2.371PheAsn: 2.371 ± 0.423
1.224PhePro: 1.224 ± 0.26
1.147PheGln: 1.147 ± 0.387
1.683PheArg: 1.683 ± 0.417
2.83PheSer: 2.83 ± 0.486
2.065PheThr: 2.065 ± 0.393
1.377PheVal: 1.377 ± 0.265
0.535PheTrp: 0.535 ± 0.206
1.224PheTyr: 1.224 ± 0.329
0.0PheXaa: 0.0 ± 0.0
Gly
5.661GlyAla: 5.661 ± 0.854
0.459GlyCys: 0.459 ± 0.19
3.442GlyAsp: 3.442 ± 0.458
3.978GlyGlu: 3.978 ± 0.475
2.83GlyPhe: 2.83 ± 0.48
4.896GlyGly: 4.896 ± 0.889
0.994GlyHis: 0.994 ± 0.26
4.284GlyIle: 4.284 ± 0.564
4.972GlyLys: 4.972 ± 0.566
5.202GlyLeu: 5.202 ± 0.662
2.754GlyMet: 2.754 ± 0.386
3.289GlyAsn: 3.289 ± 0.445
0.994GlyPro: 0.994 ± 0.227
3.825GlyGln: 3.825 ± 0.693
4.896GlyArg: 4.896 ± 0.675
3.595GlySer: 3.595 ± 0.74
3.672GlyThr: 3.672 ± 0.547
5.202GlyVal: 5.202 ± 0.494
1.453GlyTrp: 1.453 ± 0.268
2.295GlyTyr: 2.295 ± 0.4
0.0GlyXaa: 0.0 ± 0.0
His
1.147HisAla: 1.147 ± 0.279
0.382HisCys: 0.382 ± 0.171
1.3HisAsp: 1.3 ± 0.342
1.683HisGlu: 1.683 ± 0.349
0.994HisPhe: 0.994 ± 0.34
0.994HisGly: 0.994 ± 0.309
0.153HisHis: 0.153 ± 0.1
0.612HisIle: 0.612 ± 0.188
1.377HisLys: 1.377 ± 0.37
1.377HisLeu: 1.377 ± 0.376
0.459HisMet: 0.459 ± 0.164
0.459HisAsn: 0.459 ± 0.193
0.765HisPro: 0.765 ± 0.226
0.841HisGln: 0.841 ± 0.281
0.918HisArg: 0.918 ± 0.304
0.765HisSer: 0.765 ± 0.227
0.535HisThr: 0.535 ± 0.185
0.535HisVal: 0.535 ± 0.209
0.306HisTrp: 0.306 ± 0.137
0.841HisTyr: 0.841 ± 0.257
0.0HisXaa: 0.0 ± 0.0
Ile
6.579IleAla: 6.579 ± 0.621
0.535IleCys: 0.535 ± 0.222
3.825IleAsp: 3.825 ± 0.448
3.901IleGlu: 3.901 ± 0.517
2.065IlePhe: 2.065 ± 0.479
5.89IleGly: 5.89 ± 0.822
1.224IleHis: 1.224 ± 0.296
4.972IleIle: 4.972 ± 0.937
3.289IleLys: 3.289 ± 0.431
3.595IleLeu: 3.595 ± 0.697
0.994IleMet: 0.994 ± 0.29
3.366IleAsn: 3.366 ± 0.556
3.442IlePro: 3.442 ± 0.507
1.836IleGln: 1.836 ± 0.322
3.748IleArg: 3.748 ± 0.441
4.666IleSer: 4.666 ± 0.789
5.278IleThr: 5.278 ± 0.606
3.519IleVal: 3.519 ± 0.631
0.535IleTrp: 0.535 ± 0.167
2.371IleTyr: 2.371 ± 0.497
0.0IleXaa: 0.0 ± 0.0
Lys
5.278LysAla: 5.278 ± 0.712
0.994LysCys: 0.994 ± 0.304
3.519LysAsp: 3.519 ± 0.54
4.896LysGlu: 4.896 ± 0.7
1.989LysPhe: 1.989 ± 0.41
3.825LysGly: 3.825 ± 0.484
0.994LysHis: 0.994 ± 0.249
3.213LysIle: 3.213 ± 0.524
4.972LysLys: 4.972 ± 0.681
6.043LysLeu: 6.043 ± 0.834
1.453LysMet: 1.453 ± 0.293
2.371LysAsn: 2.371 ± 0.397
3.978LysPro: 3.978 ± 0.57
3.213LysGln: 3.213 ± 0.63
4.437LysArg: 4.437 ± 0.653
3.595LysSer: 3.595 ± 0.588
3.595LysThr: 3.595 ± 0.542
2.524LysVal: 2.524 ± 0.452
0.306LysTrp: 0.306 ± 0.159
2.295LysTyr: 2.295 ± 0.395
0.0LysXaa: 0.0 ± 0.0
Leu
7.191LeuAla: 7.191 ± 0.797
0.535LeuCys: 0.535 ± 0.231
3.519LeuAsp: 3.519 ± 0.494
5.125LeuGlu: 5.125 ± 0.725
2.83LeuPhe: 2.83 ± 0.567
4.59LeuGly: 4.59 ± 0.595
1.147LeuHis: 1.147 ± 0.242
6.196LeuIle: 6.196 ± 0.802
6.502LeuLys: 6.502 ± 0.668
7.038LeuLeu: 7.038 ± 0.685
2.448LeuMet: 2.448 ± 0.332
4.131LeuAsn: 4.131 ± 0.528
3.213LeuPro: 3.213 ± 0.439
2.907LeuGln: 2.907 ± 0.444
4.513LeuArg: 4.513 ± 0.563
5.431LeuSer: 5.431 ± 0.478
4.666LeuThr: 4.666 ± 0.544
4.207LeuVal: 4.207 ± 0.51
0.918LeuTrp: 0.918 ± 0.285
2.218LeuTyr: 2.218 ± 0.387
0.0LeuXaa: 0.0 ± 0.0
Met
3.06MetAla: 3.06 ± 0.476
0.382MetCys: 0.382 ± 0.18
1.224MetAsp: 1.224 ± 0.33
1.53MetGlu: 1.53 ± 0.348
0.535MetPhe: 0.535 ± 0.266
1.989MetGly: 1.989 ± 0.323
0.153MetHis: 0.153 ± 0.101
1.606MetIle: 1.606 ± 0.341
2.295MetLys: 2.295 ± 0.501
2.142MetLeu: 2.142 ± 0.414
1.147MetMet: 1.147 ± 0.25
1.3MetAsn: 1.3 ± 0.319
1.377MetPro: 1.377 ± 0.33
1.683MetGln: 1.683 ± 0.396
2.218MetArg: 2.218 ± 0.39
2.448MetSer: 2.448 ± 0.438
2.065MetThr: 2.065 ± 0.361
1.759MetVal: 1.759 ± 0.307
0.153MetTrp: 0.153 ± 0.1
1.071MetTyr: 1.071 ± 0.256
0.0MetXaa: 0.0 ± 0.0
Asn
5.202AsnAla: 5.202 ± 0.774
0.306AsnCys: 0.306 ± 0.177
2.218AsnAsp: 2.218 ± 0.375
2.677AsnGlu: 2.677 ± 0.424
0.918AsnPhe: 0.918 ± 0.229
4.284AsnGly: 4.284 ± 0.534
1.071AsnHis: 1.071 ± 0.317
3.06AsnIle: 3.06 ± 0.524
2.601AsnLys: 2.601 ± 0.417
3.136AsnLeu: 3.136 ± 0.495
1.377AsnMet: 1.377 ± 0.311
2.371AsnAsn: 2.371 ± 0.504
2.524AsnPro: 2.524 ± 0.394
2.907AsnGln: 2.907 ± 0.515
2.142AsnArg: 2.142 ± 0.398
3.136AsnSer: 3.136 ± 0.4
2.524AsnThr: 2.524 ± 0.404
2.601AsnVal: 2.601 ± 0.692
0.535AsnTrp: 0.535 ± 0.222
1.836AsnTyr: 1.836 ± 0.397
0.0AsnXaa: 0.0 ± 0.0
Pro
2.754ProAla: 2.754 ± 0.412
0.306ProCys: 0.306 ± 0.165
3.213ProAsp: 3.213 ± 0.436
5.049ProGlu: 5.049 ± 0.619
1.147ProPhe: 1.147 ± 0.27
1.912ProGly: 1.912 ± 0.374
0.765ProHis: 0.765 ± 0.248
2.295ProIle: 2.295 ± 0.477
2.295ProLys: 2.295 ± 0.445
3.289ProLeu: 3.289 ± 0.631
1.071ProMet: 1.071 ± 0.283
1.53ProAsn: 1.53 ± 0.372
1.606ProPro: 1.606 ± 0.367
1.606ProGln: 1.606 ± 0.29
1.836ProArg: 1.836 ± 0.385
2.295ProSer: 2.295 ± 0.404
2.218ProThr: 2.218 ± 0.361
3.213ProVal: 3.213 ± 0.4
0.306ProTrp: 0.306 ± 0.182
0.994ProTyr: 0.994 ± 0.279
0.0ProXaa: 0.0 ± 0.0
Gln
3.366GlnAla: 3.366 ± 0.555
0.612GlnCys: 0.612 ± 0.204
2.83GlnAsp: 2.83 ± 0.525
2.448GlnGlu: 2.448 ± 0.435
1.147GlnPhe: 1.147 ± 0.252
2.83GlnGly: 2.83 ± 0.564
0.994GlnHis: 0.994 ± 0.3
2.677GlnIle: 2.677 ± 0.495
1.53GlnLys: 1.53 ± 0.343
3.901GlnLeu: 3.901 ± 0.548
1.377GlnMet: 1.377 ± 0.264
2.065GlnAsn: 2.065 ± 0.626
1.836GlnPro: 1.836 ± 0.305
3.289GlnGln: 3.289 ± 0.767
2.983GlnArg: 2.983 ± 0.455
3.366GlnSer: 3.366 ± 0.525
1.453GlnThr: 1.453 ± 0.313
2.295GlnVal: 2.295 ± 0.476
1.147GlnTrp: 1.147 ± 0.309
1.836GlnTyr: 1.836 ± 0.442
0.0GlnXaa: 0.0 ± 0.0
Arg
4.437ArgAla: 4.437 ± 0.516
0.841ArgCys: 0.841 ± 0.329
4.131ArgAsp: 4.131 ± 0.596
3.978ArgGlu: 3.978 ± 0.55
2.371ArgPhe: 2.371 ± 0.453
3.366ArgGly: 3.366 ± 0.499
1.3ArgHis: 1.3 ± 0.32
3.978ArgIle: 3.978 ± 0.535
4.284ArgLys: 4.284 ± 0.707
4.896ArgLeu: 4.896 ± 0.492
2.295ArgMet: 2.295 ± 0.397
2.524ArgAsn: 2.524 ± 0.515
1.836ArgPro: 1.836 ± 0.395
2.677ArgGln: 2.677 ± 0.464
3.595ArgArg: 3.595 ± 0.728
3.519ArgSer: 3.519 ± 0.46
1.836ArgThr: 1.836 ± 0.337
3.06ArgVal: 3.06 ± 0.389
0.918ArgTrp: 0.918 ± 0.296
2.142ArgTyr: 2.142 ± 0.451
0.0ArgXaa: 0.0 ± 0.0
Ser
5.508SerAla: 5.508 ± 0.771
0.918SerCys: 0.918 ± 0.261
3.442SerAsp: 3.442 ± 0.5
3.519SerGlu: 3.519 ± 0.52
2.677SerPhe: 2.677 ± 0.468
5.278SerGly: 5.278 ± 0.821
1.147SerHis: 1.147 ± 0.322
3.825SerIle: 3.825 ± 0.518
3.136SerLys: 3.136 ± 0.393
5.355SerLeu: 5.355 ± 0.617
1.836SerMet: 1.836 ± 0.293
2.601SerAsn: 2.601 ± 0.325
3.289SerPro: 3.289 ± 0.541
2.754SerGln: 2.754 ± 0.455
3.06SerArg: 3.06 ± 0.44
3.136SerSer: 3.136 ± 0.61
2.601SerThr: 2.601 ± 0.567
3.672SerVal: 3.672 ± 0.438
0.841SerTrp: 0.841 ± 0.239
2.218SerTyr: 2.218 ± 0.464
0.0SerXaa: 0.0 ± 0.0
Thr
6.196ThrAla: 6.196 ± 0.682
0.535ThrCys: 0.535 ± 0.219
3.06ThrAsp: 3.06 ± 0.513
2.83ThrGlu: 2.83 ± 0.477
1.759ThrPhe: 1.759 ± 0.381
4.131ThrGly: 4.131 ± 0.556
0.688ThrHis: 0.688 ± 0.166
3.213ThrIle: 3.213 ± 0.521
3.748ThrLys: 3.748 ± 0.6
3.978ThrLeu: 3.978 ± 0.67
1.147ThrMet: 1.147 ± 0.292
1.989ThrAsn: 1.989 ± 0.363
2.601ThrPro: 2.601 ± 0.43
2.065ThrGln: 2.065 ± 0.372
2.601ThrArg: 2.601 ± 0.473
2.83ThrSer: 2.83 ± 0.461
3.06ThrThr: 3.06 ± 0.598
2.907ThrVal: 2.907 ± 0.445
0.612ThrTrp: 0.612 ± 0.226
1.377ThrTyr: 1.377 ± 0.303
0.0ThrXaa: 0.0 ± 0.0
Val
5.814ValAla: 5.814 ± 0.519
0.765ValCys: 0.765 ± 0.248
3.366ValAsp: 3.366 ± 0.496
3.901ValGlu: 3.901 ± 0.398
2.601ValPhe: 2.601 ± 0.461
3.442ValGly: 3.442 ± 0.506
0.459ValHis: 0.459 ± 0.17
3.748ValIle: 3.748 ± 0.559
3.978ValLys: 3.978 ± 0.485
4.819ValLeu: 4.819 ± 0.618
1.3ValMet: 1.3 ± 0.386
2.983ValAsn: 2.983 ± 0.457
1.759ValPro: 1.759 ± 0.362
2.065ValGln: 2.065 ± 0.294
3.06ValArg: 3.06 ± 0.338
4.207ValSer: 4.207 ± 0.549
3.06ValThr: 3.06 ± 0.44
4.437ValVal: 4.437 ± 0.503
0.841ValTrp: 0.841 ± 0.238
1.683ValTyr: 1.683 ± 0.399
0.0ValXaa: 0.0 ± 0.0
Trp
0.918TrpAla: 0.918 ± 0.249
0.459TrpCys: 0.459 ± 0.211
0.994TrpAsp: 0.994 ± 0.27
0.688TrpGlu: 0.688 ± 0.253
0.612TrpPhe: 0.612 ± 0.211
0.994TrpGly: 0.994 ± 0.246
0.459TrpHis: 0.459 ± 0.246
0.841TrpIle: 0.841 ± 0.228
1.071TrpLys: 1.071 ± 0.313
1.377TrpLeu: 1.377 ± 0.441
0.918TrpMet: 0.918 ± 0.202
0.382TrpAsn: 0.382 ± 0.152
0.459TrpPro: 0.459 ± 0.148
0.765TrpGln: 0.765 ± 0.191
0.535TrpArg: 0.535 ± 0.237
0.765TrpSer: 0.765 ± 0.245
0.535TrpThr: 0.535 ± 0.173
0.994TrpVal: 0.994 ± 0.2
0.229TrpTrp: 0.229 ± 0.126
0.382TrpTyr: 0.382 ± 0.139
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.06TyrAla: 3.06 ± 0.56
0.612TyrCys: 0.612 ± 0.204
3.136TyrAsp: 3.136 ± 0.606
1.377TyrGlu: 1.377 ± 0.42
1.606TyrPhe: 1.606 ± 0.307
1.912TyrGly: 1.912 ± 0.442
0.688TyrHis: 0.688 ± 0.231
2.601TyrIle: 2.601 ± 0.502
1.147TyrLys: 1.147 ± 0.281
2.448TyrLeu: 2.448 ± 0.402
0.765TyrMet: 0.765 ± 0.203
1.836TyrAsn: 1.836 ± 0.42
1.759TyrPro: 1.759 ± 0.4
1.759TyrGln: 1.759 ± 0.37
2.677TyrArg: 2.677 ± 0.489
1.989TyrSer: 1.989 ± 0.359
1.606TyrThr: 1.606 ± 0.253
1.912TyrVal: 1.912 ± 0.396
0.535TyrTrp: 0.535 ± 0.193
1.224TyrTyr: 1.224 ± 0.353
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 73 proteins (13073 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski