Amino acid dipepetide frequency for Salmonella phage vB_SenS_AG11

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.92AlaAla: 10.92 ± 1.926
1.0AlaCys: 1.0 ± 0.281
6.229AlaAsp: 6.229 ± 0.746
6.383AlaGlu: 6.383 ± 0.829
4.306AlaPhe: 4.306 ± 0.575
7.382AlaGly: 7.382 ± 0.709
2.076AlaHis: 2.076 ± 0.383
3.999AlaIle: 3.999 ± 0.882
5.691AlaLys: 5.691 ± 0.91
7.536AlaLeu: 7.536 ± 0.779
2.23AlaMet: 2.23 ± 0.52
3.691AlaAsn: 3.691 ± 0.515
3.307AlaPro: 3.307 ± 0.401
4.153AlaGln: 4.153 ± 1.228
4.768AlaArg: 4.768 ± 0.715
5.998AlaSer: 5.998 ± 0.746
5.921AlaThr: 5.921 ± 0.738
7.536AlaVal: 7.536 ± 0.935
1.23AlaTrp: 1.23 ± 0.307
3.307AlaTyr: 3.307 ± 0.517
0.0AlaXaa: 0.0 ± 0.0
Cys
0.769CysAla: 0.769 ± 0.224
0.154CysCys: 0.154 ± 0.132
0.846CysAsp: 0.846 ± 0.231
1.0CysGlu: 1.0 ± 0.398
0.308CysPhe: 0.308 ± 0.165
0.538CysGly: 0.538 ± 0.215
0.154CysHis: 0.154 ± 0.098
0.308CysIle: 0.308 ± 0.132
0.923CysLys: 0.923 ± 0.307
0.846CysLeu: 0.846 ± 0.314
0.231CysMet: 0.231 ± 0.126
0.615CysAsn: 0.615 ± 0.239
0.384CysPro: 0.384 ± 0.163
0.231CysGln: 0.231 ± 0.151
0.692CysArg: 0.692 ± 0.223
0.308CysSer: 0.308 ± 0.15
0.615CysThr: 0.615 ± 0.194
0.692CysVal: 0.692 ± 0.206
0.308CysTrp: 0.308 ± 0.148
0.231CysTyr: 0.231 ± 0.141
0.0CysXaa: 0.0 ± 0.0
Asp
6.921AspAla: 6.921 ± 0.777
0.846AspCys: 0.846 ± 0.302
3.614AspAsp: 3.614 ± 0.626
3.691AspGlu: 3.691 ± 0.589
2.922AspPhe: 2.922 ± 0.367
6.613AspGly: 6.613 ± 0.774
0.692AspHis: 0.692 ± 0.239
3.384AspIle: 3.384 ± 0.399
3.614AspLys: 3.614 ± 0.46
4.614AspLeu: 4.614 ± 0.641
1.615AspMet: 1.615 ± 0.289
2.845AspAsn: 2.845 ± 0.581
1.922AspPro: 1.922 ± 0.468
0.615AspGln: 0.615 ± 0.252
2.691AspArg: 2.691 ± 0.417
3.845AspSer: 3.845 ± 0.536
4.383AspThr: 4.383 ± 0.472
3.768AspVal: 3.768 ± 0.495
0.769AspTrp: 0.769 ± 0.223
1.999AspTyr: 1.999 ± 0.475
0.0AspXaa: 0.0 ± 0.0
Glu
6.229GluAla: 6.229 ± 0.854
0.461GluCys: 0.461 ± 0.201
3.691GluAsp: 3.691 ± 0.585
4.922GluGlu: 4.922 ± 0.915
2.768GluPhe: 2.768 ± 0.611
4.768GluGly: 4.768 ± 0.932
1.0GluHis: 1.0 ± 0.251
3.768GluIle: 3.768 ± 0.597
4.229GluLys: 4.229 ± 0.657
6.383GluLeu: 6.383 ± 0.818
2.384GluMet: 2.384 ± 0.427
2.538GluAsn: 2.538 ± 0.418
1.846GluPro: 1.846 ± 0.528
2.922GluGln: 2.922 ± 0.709
3.768GluArg: 3.768 ± 0.59
3.537GluSer: 3.537 ± 0.448
4.076GluThr: 4.076 ± 0.533
4.153GluVal: 4.153 ± 0.548
1.077GluTrp: 1.077 ± 0.295
1.769GluTyr: 1.769 ± 0.441
0.0GluXaa: 0.0 ± 0.0
Phe
2.691PheAla: 2.691 ± 0.547
0.538PheCys: 0.538 ± 0.193
3.23PheAsp: 3.23 ± 0.587
2.461PheGlu: 2.461 ± 0.445
0.615PhePhe: 0.615 ± 0.209
3.153PheGly: 3.153 ± 0.345
0.615PheHis: 0.615 ± 0.228
2.384PheIle: 2.384 ± 0.496
1.769PheLys: 1.769 ± 0.492
2.153PheLeu: 2.153 ± 0.487
0.384PheMet: 0.384 ± 0.184
1.307PheAsn: 1.307 ± 0.412
1.538PhePro: 1.538 ± 0.442
1.461PheGln: 1.461 ± 0.318
2.384PheArg: 2.384 ± 0.363
2.461PheSer: 2.461 ± 0.595
3.076PheThr: 3.076 ± 0.762
2.615PheVal: 2.615 ± 0.441
0.769PheTrp: 0.769 ± 0.263
1.23PheTyr: 1.23 ± 0.352
0.0PheXaa: 0.0 ± 0.0
Gly
7.613GlyAla: 7.613 ± 0.844
1.077GlyCys: 1.077 ± 0.35
3.307GlyAsp: 3.307 ± 0.688
6.152GlyGlu: 6.152 ± 0.965
2.845GlyPhe: 2.845 ± 0.561
6.69GlyGly: 6.69 ± 0.79
1.461GlyHis: 1.461 ± 0.396
3.46GlyIle: 3.46 ± 0.46
4.768GlyLys: 4.768 ± 0.674
5.229GlyLeu: 5.229 ± 0.548
2.538GlyMet: 2.538 ± 0.655
3.845GlyAsn: 3.845 ± 0.629
1.692GlyPro: 1.692 ± 0.339
3.076GlyGln: 3.076 ± 0.466
4.46GlyArg: 4.46 ± 0.602
5.383GlySer: 5.383 ± 0.836
3.691GlyThr: 3.691 ± 0.572
5.537GlyVal: 5.537 ± 0.757
1.307GlyTrp: 1.307 ± 0.324
2.999GlyTyr: 2.999 ± 0.609
0.0GlyXaa: 0.0 ± 0.0
His
1.307HisAla: 1.307 ± 0.369
0.461HisCys: 0.461 ± 0.192
0.846HisAsp: 0.846 ± 0.223
0.923HisGlu: 0.923 ± 0.269
0.769HisPhe: 0.769 ± 0.254
0.923HisGly: 0.923 ± 0.342
0.615HisHis: 0.615 ± 0.239
0.769HisIle: 0.769 ± 0.246
1.153HisLys: 1.153 ± 0.33
1.153HisLeu: 1.153 ± 0.336
0.538HisMet: 0.538 ± 0.206
0.461HisAsn: 0.461 ± 0.184
1.307HisPro: 1.307 ± 0.322
1.077HisGln: 1.077 ± 0.323
0.846HisArg: 0.846 ± 0.24
0.846HisSer: 0.846 ± 0.264
1.0HisThr: 1.0 ± 0.347
0.769HisVal: 0.769 ± 0.234
0.154HisTrp: 0.154 ± 0.119
0.846HisTyr: 0.846 ± 0.28
0.0HisXaa: 0.0 ± 0.0
Ile
4.922IleAla: 4.922 ± 1.009
0.769IleCys: 0.769 ± 0.254
3.922IleAsp: 3.922 ± 0.524
2.23IleGlu: 2.23 ± 0.419
1.153IlePhe: 1.153 ± 0.357
3.384IleGly: 3.384 ± 0.456
0.769IleHis: 0.769 ± 0.252
1.999IleIle: 1.999 ± 0.485
3.153IleLys: 3.153 ± 0.518
3.076IleLeu: 3.076 ± 0.472
0.692IleMet: 0.692 ± 0.227
2.076IleAsn: 2.076 ± 0.446
2.615IlePro: 2.615 ± 0.426
1.999IleGln: 1.999 ± 0.526
2.461IleArg: 2.461 ± 0.379
2.845IleSer: 2.845 ± 0.558
4.46IleThr: 4.46 ± 0.687
3.614IleVal: 3.614 ± 0.499
0.846IleTrp: 0.846 ± 0.251
1.538IleTyr: 1.538 ± 0.404
0.0IleXaa: 0.0 ± 0.0
Lys
5.383LysAla: 5.383 ± 0.768
0.538LysCys: 0.538 ± 0.214
3.691LysAsp: 3.691 ± 0.548
3.768LysGlu: 3.768 ± 0.592
2.307LysPhe: 2.307 ± 0.381
3.614LysGly: 3.614 ± 0.439
1.307LysHis: 1.307 ± 0.313
2.076LysIle: 2.076 ± 0.445
2.845LysLys: 2.845 ± 0.471
5.921LysLeu: 5.921 ± 0.743
2.461LysMet: 2.461 ± 0.488
2.23LysAsn: 2.23 ± 0.433
2.691LysPro: 2.691 ± 0.632
2.153LysGln: 2.153 ± 0.448
4.076LysArg: 4.076 ± 0.573
3.153LysSer: 3.153 ± 0.659
3.845LysThr: 3.845 ± 0.467
3.46LysVal: 3.46 ± 0.605
0.846LysTrp: 0.846 ± 0.282
2.768LysTyr: 2.768 ± 0.422
0.0LysXaa: 0.0 ± 0.0
Leu
6.844LeuAla: 6.844 ± 0.768
0.538LeuCys: 0.538 ± 0.213
4.076LeuAsp: 4.076 ± 0.425
4.537LeuGlu: 4.537 ± 0.67
1.999LeuPhe: 1.999 ± 0.367
4.691LeuGly: 4.691 ± 0.492
1.23LeuHis: 1.23 ± 0.321
5.075LeuIle: 5.075 ± 0.6
4.998LeuLys: 4.998 ± 0.594
5.767LeuLeu: 5.767 ± 0.783
2.538LeuMet: 2.538 ± 0.421
4.153LeuAsn: 4.153 ± 0.533
3.691LeuPro: 3.691 ± 0.604
2.999LeuGln: 2.999 ± 0.53
4.845LeuArg: 4.845 ± 0.731
4.537LeuSer: 4.537 ± 0.645
5.537LeuThr: 5.537 ± 0.539
5.998LeuVal: 5.998 ± 0.558
1.077LeuTrp: 1.077 ± 0.332
2.691LeuTyr: 2.691 ± 0.42
0.0LeuXaa: 0.0 ± 0.0
Met
3.076MetAla: 3.076 ± 0.332
0.384MetCys: 0.384 ± 0.15
1.307MetAsp: 1.307 ± 0.386
1.307MetGlu: 1.307 ± 0.293
0.923MetPhe: 0.923 ± 0.281
1.922MetGly: 1.922 ± 0.392
0.231MetHis: 0.231 ± 0.142
1.0MetIle: 1.0 ± 0.288
1.769MetLys: 1.769 ± 0.399
2.384MetLeu: 2.384 ± 0.418
0.692MetMet: 0.692 ± 0.261
0.923MetAsn: 0.923 ± 0.233
1.538MetPro: 1.538 ± 0.362
1.0MetGln: 1.0 ± 0.253
1.461MetArg: 1.461 ± 0.288
2.153MetSer: 2.153 ± 0.342
2.076MetThr: 2.076 ± 0.368
1.999MetVal: 1.999 ± 0.421
0.461MetTrp: 0.461 ± 0.229
0.615MetTyr: 0.615 ± 0.245
0.0MetXaa: 0.0 ± 0.0
Asn
4.229AsnAla: 4.229 ± 0.774
0.384AsnCys: 0.384 ± 0.154
3.076AsnAsp: 3.076 ± 0.399
2.461AsnGlu: 2.461 ± 0.424
1.23AsnPhe: 1.23 ± 0.28
4.46AsnGly: 4.46 ± 0.688
0.769AsnHis: 0.769 ± 0.265
2.922AsnIle: 2.922 ± 0.492
1.846AsnLys: 1.846 ± 0.398
3.691AsnLeu: 3.691 ± 0.424
0.692AsnMet: 0.692 ± 0.298
2.076AsnAsn: 2.076 ± 0.456
1.615AsnPro: 1.615 ± 0.35
1.0AsnGln: 1.0 ± 0.256
2.461AsnArg: 2.461 ± 0.392
1.922AsnSer: 1.922 ± 0.365
2.307AsnThr: 2.307 ± 0.412
3.384AsnVal: 3.384 ± 0.457
0.692AsnTrp: 0.692 ± 0.221
1.538AsnTyr: 1.538 ± 0.36
0.0AsnXaa: 0.0 ± 0.0
Pro
3.153ProAla: 3.153 ± 0.52
0.384ProCys: 0.384 ± 0.179
3.23ProAsp: 3.23 ± 0.583
3.23ProGlu: 3.23 ± 0.503
1.692ProPhe: 1.692 ± 0.372
3.307ProGly: 3.307 ± 0.552
0.384ProHis: 0.384 ± 0.176
1.307ProIle: 1.307 ± 0.344
2.615ProLys: 2.615 ± 0.438
3.153ProLeu: 3.153 ± 0.425
0.923ProMet: 0.923 ± 0.313
1.692ProAsn: 1.692 ± 0.555
1.538ProPro: 1.538 ± 0.346
1.23ProGln: 1.23 ± 0.316
2.076ProArg: 2.076 ± 0.35
2.076ProSer: 2.076 ± 0.463
1.615ProThr: 1.615 ± 0.322
4.076ProVal: 4.076 ± 0.45
0.384ProTrp: 0.384 ± 0.199
1.384ProTyr: 1.384 ± 0.389
0.0ProXaa: 0.0 ± 0.0
Gln
4.229GlnAla: 4.229 ± 0.7
0.308GlnCys: 0.308 ± 0.145
1.461GlnAsp: 1.461 ± 0.31
1.999GlnGlu: 1.999 ± 0.461
1.23GlnPhe: 1.23 ± 0.394
2.307GlnGly: 2.307 ± 0.439
0.769GlnHis: 0.769 ± 0.289
1.692GlnIle: 1.692 ± 0.454
2.461GlnLys: 2.461 ± 0.42
3.153GlnLeu: 3.153 ± 0.528
1.23GlnMet: 1.23 ± 0.28
1.999GlnAsn: 1.999 ± 0.381
2.076GlnPro: 2.076 ± 0.329
2.615GlnGln: 2.615 ± 0.839
1.922GlnArg: 1.922 ± 0.372
1.922GlnSer: 1.922 ± 0.461
1.846GlnThr: 1.846 ± 0.44
2.768GlnVal: 2.768 ± 0.456
0.538GlnTrp: 0.538 ± 0.157
1.307GlnTyr: 1.307 ± 0.265
0.0GlnXaa: 0.0 ± 0.0
Arg
4.845ArgAla: 4.845 ± 0.462
0.461ArgCys: 0.461 ± 0.152
3.922ArgAsp: 3.922 ± 0.531
4.537ArgGlu: 4.537 ± 0.832
1.922ArgPhe: 1.922 ± 0.357
3.845ArgGly: 3.845 ± 0.512
0.846ArgHis: 0.846 ± 0.256
3.076ArgIle: 3.076 ± 0.529
3.153ArgLys: 3.153 ± 0.62
3.768ArgLeu: 3.768 ± 0.505
2.384ArgMet: 2.384 ± 0.487
3.076ArgAsn: 3.076 ± 0.441
2.076ArgPro: 2.076 ± 0.429
2.615ArgGln: 2.615 ± 0.449
4.614ArgArg: 4.614 ± 0.67
2.076ArgSer: 2.076 ± 0.392
2.845ArgThr: 2.845 ± 0.468
4.46ArgVal: 4.46 ± 0.571
0.923ArgTrp: 0.923 ± 0.249
1.538ArgTyr: 1.538 ± 0.387
0.0ArgXaa: 0.0 ± 0.0
Ser
6.383SerAla: 6.383 ± 1.06
0.308SerCys: 0.308 ± 0.175
3.153SerAsp: 3.153 ± 0.484
3.153SerGlu: 3.153 ± 0.6
2.461SerPhe: 2.461 ± 0.49
6.46SerGly: 6.46 ± 0.803
0.769SerHis: 0.769 ± 0.222
2.23SerIle: 2.23 ± 0.406
3.307SerLys: 3.307 ± 0.482
5.152SerLeu: 5.152 ± 0.751
1.23SerMet: 1.23 ± 0.283
2.307SerAsn: 2.307 ± 0.415
1.846SerPro: 1.846 ± 0.398
1.999SerGln: 1.999 ± 0.356
2.768SerArg: 2.768 ± 0.389
2.845SerSer: 2.845 ± 0.465
4.076SerThr: 4.076 ± 0.748
5.537SerVal: 5.537 ± 0.962
0.615SerTrp: 0.615 ± 0.198
1.615SerTyr: 1.615 ± 0.361
0.0SerXaa: 0.0 ± 0.0
Thr
6.921ThrAla: 6.921 ± 0.789
0.538ThrCys: 0.538 ± 0.203
4.537ThrAsp: 4.537 ± 0.509
3.845ThrGlu: 3.845 ± 0.579
2.999ThrPhe: 2.999 ± 0.535
5.691ThrGly: 5.691 ± 1.046
1.077ThrHis: 1.077 ± 0.302
2.691ThrIle: 2.691 ± 0.5
3.153ThrLys: 3.153 ± 0.476
5.229ThrLeu: 5.229 ± 0.798
1.077ThrMet: 1.077 ± 0.34
1.922ThrAsn: 1.922 ± 0.412
3.691ThrPro: 3.691 ± 0.574
1.769ThrGln: 1.769 ± 0.343
2.999ThrArg: 2.999 ± 0.405
4.614ThrSer: 4.614 ± 0.509
3.614ThrThr: 3.614 ± 0.553
4.614ThrVal: 4.614 ± 0.762
1.23ThrTrp: 1.23 ± 0.305
2.384ThrTyr: 2.384 ± 0.444
0.0ThrXaa: 0.0 ± 0.0
Val
6.613ValAla: 6.613 ± 0.637
0.384ValCys: 0.384 ± 0.154
3.999ValAsp: 3.999 ± 0.438
6.69ValGlu: 6.69 ± 0.691
2.153ValPhe: 2.153 ± 0.464
3.922ValGly: 3.922 ± 0.679
1.0ValHis: 1.0 ± 0.262
4.306ValIle: 4.306 ± 0.667
4.768ValLys: 4.768 ± 0.727
4.614ValLeu: 4.614 ± 0.564
1.769ValMet: 1.769 ± 0.432
3.153ValAsn: 3.153 ± 0.503
2.23ValPro: 2.23 ± 0.697
2.691ValGln: 2.691 ± 0.389
3.922ValArg: 3.922 ± 0.524
5.306ValSer: 5.306 ± 0.742
6.46ValThr: 6.46 ± 0.988
5.537ValVal: 5.537 ± 0.74
0.769ValTrp: 0.769 ± 0.239
2.538ValTyr: 2.538 ± 0.38
0.0ValXaa: 0.0 ± 0.0
Trp
1.384TrpAla: 1.384 ± 0.374
0.077TrpCys: 0.077 ± 0.077
1.077TrpAsp: 1.077 ± 0.28
0.692TrpGlu: 0.692 ± 0.21
0.692TrpPhe: 0.692 ± 0.307
0.923TrpGly: 0.923 ± 0.218
0.308TrpHis: 0.308 ± 0.195
0.538TrpIle: 0.538 ± 0.211
0.692TrpLys: 0.692 ± 0.285
1.769TrpLeu: 1.769 ± 0.405
0.461TrpMet: 0.461 ± 0.175
0.615TrpAsn: 0.615 ± 0.26
0.461TrpPro: 0.461 ± 0.193
0.846TrpGln: 0.846 ± 0.263
1.23TrpArg: 1.23 ± 0.352
0.384TrpSer: 0.384 ± 0.162
0.769TrpThr: 0.769 ± 0.232
0.923TrpVal: 0.923 ± 0.223
0.461TrpTrp: 0.461 ± 0.176
0.231TrpTyr: 0.231 ± 0.135
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.23TyrAla: 3.23 ± 0.453
0.538TyrCys: 0.538 ± 0.204
2.153TyrAsp: 2.153 ± 0.525
2.384TyrGlu: 2.384 ± 0.426
1.538TyrPhe: 1.538 ± 0.444
2.538TyrGly: 2.538 ± 0.504
0.692TyrHis: 0.692 ± 0.207
1.538TyrIle: 1.538 ± 0.351
2.23TyrLys: 2.23 ± 0.482
2.23TyrLeu: 2.23 ± 0.387
1.077TyrMet: 1.077 ± 0.23
1.23TyrAsn: 1.23 ± 0.274
1.384TyrPro: 1.384 ± 0.326
1.384TyrGln: 1.384 ± 0.339
2.538TyrArg: 2.538 ± 0.514
1.999TyrSer: 1.999 ± 0.388
2.384TyrThr: 2.384 ± 0.457
1.384TyrVal: 1.384 ± 0.376
0.077TyrTrp: 0.077 ± 0.071
1.384TyrTyr: 1.384 ± 0.371
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 66 proteins (13005 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski