Amino acid dipepetide frequency for Salmonella phage LPST10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.569AlaAla: 9.569 ± 2.782
0.624AlaCys: 0.624 ± 0.22
4.23AlaAsp: 4.23 ± 0.427
7.489AlaGlu: 7.489 ± 1.043
2.08AlaPhe: 2.08 ± 0.346
7.489AlaGly: 7.489 ± 0.71
1.525AlaHis: 1.525 ± 0.327
5.478AlaIle: 5.478 ± 0.628
7.211AlaLys: 7.211 ± 0.89
6.587AlaLeu: 6.587 ± 0.739
2.843AlaMet: 2.843 ± 0.547
3.259AlaAsn: 3.259 ± 0.46
1.872AlaPro: 1.872 ± 0.327
3.536AlaGln: 3.536 ± 1.051
4.992AlaArg: 4.992 ± 0.637
5.27AlaSer: 5.27 ± 0.714
5.062AlaThr: 5.062 ± 0.981
6.24AlaVal: 6.24 ± 0.604
1.04AlaTrp: 1.04 ± 0.257
3.051AlaTyr: 3.051 ± 0.498
0.0AlaXaa: 0.0 ± 0.0
Cys
1.248CysAla: 1.248 ± 0.258
0.347CysCys: 0.347 ± 0.164
0.763CysAsp: 0.763 ± 0.222
1.248CysGlu: 1.248 ± 0.289
0.555CysPhe: 0.555 ± 0.191
1.595CysGly: 1.595 ± 0.408
0.416CysHis: 0.416 ± 0.146
1.04CysIle: 1.04 ± 0.249
1.525CysLys: 1.525 ± 0.313
0.901CysLeu: 0.901 ± 0.234
0.139CysMet: 0.139 ± 0.105
0.901CysAsn: 0.901 ± 0.295
0.485CysPro: 0.485 ± 0.181
0.693CysGln: 0.693 ± 0.217
1.179CysArg: 1.179 ± 0.391
1.109CysSer: 1.109 ± 0.289
0.624CysThr: 0.624 ± 0.233
0.416CysVal: 0.416 ± 0.158
0.347CysTrp: 0.347 ± 0.162
0.416CysTyr: 0.416 ± 0.16
0.0CysXaa: 0.0 ± 0.0
Asp
6.934AspAla: 6.934 ± 0.591
1.04AspCys: 1.04 ± 0.293
4.576AspAsp: 4.576 ± 0.864
4.646AspGlu: 4.646 ± 0.635
2.704AspPhe: 2.704 ± 0.518
6.379AspGly: 6.379 ± 0.902
1.179AspHis: 1.179 ± 0.247
4.715AspIle: 4.715 ± 0.479
4.022AspLys: 4.022 ± 0.643
2.496AspLeu: 2.496 ± 0.391
2.08AspMet: 2.08 ± 0.398
2.149AspAsn: 2.149 ± 0.389
1.595AspPro: 1.595 ± 0.326
1.109AspGln: 1.109 ± 0.324
2.358AspArg: 2.358 ± 0.498
3.606AspSer: 3.606 ± 0.506
2.774AspThr: 2.774 ± 0.495
4.368AspVal: 4.368 ± 0.53
0.693AspTrp: 0.693 ± 0.22
2.774AspTyr: 2.774 ± 0.404
0.0AspXaa: 0.0 ± 0.0
Glu
5.2GluAla: 5.2 ± 0.531
1.248GluCys: 1.248 ± 0.285
3.675GluAsp: 3.675 ± 0.449
4.646GluGlu: 4.646 ± 0.541
2.704GluPhe: 2.704 ± 0.458
3.051GluGly: 3.051 ± 0.402
0.693GluHis: 0.693 ± 0.215
5.478GluIle: 5.478 ± 0.485
3.883GluLys: 3.883 ± 0.596
5.478GluLeu: 5.478 ± 0.608
2.358GluMet: 2.358 ± 0.399
2.912GluAsn: 2.912 ± 0.393
2.08GluPro: 2.08 ± 0.363
4.368GluGln: 4.368 ± 0.663
3.398GluArg: 3.398 ± 0.522
4.299GluSer: 4.299 ± 0.565
2.496GluThr: 2.496 ± 0.411
3.883GluVal: 3.883 ± 0.572
1.317GluTrp: 1.317 ± 0.299
2.427GluTyr: 2.427 ± 0.462
0.0GluXaa: 0.0 ± 0.0
Phe
2.011PheAla: 2.011 ± 0.299
1.179PheCys: 1.179 ± 0.283
3.398PheAsp: 3.398 ± 0.492
1.941PheGlu: 1.941 ± 0.334
0.832PhePhe: 0.832 ± 0.198
2.427PheGly: 2.427 ± 0.262
0.693PheHis: 0.693 ± 0.201
2.358PheIle: 2.358 ± 0.46
1.595PheLys: 1.595 ± 0.244
1.733PheLeu: 1.733 ± 0.383
1.525PheMet: 1.525 ± 0.309
2.635PheAsn: 2.635 ± 0.425
1.04PhePro: 1.04 ± 0.291
1.317PheGln: 1.317 ± 0.276
1.733PheArg: 1.733 ± 0.325
2.427PheSer: 2.427 ± 0.433
2.08PheThr: 2.08 ± 0.409
1.664PheVal: 1.664 ± 0.306
0.901PheTrp: 0.901 ± 0.238
1.387PheTyr: 1.387 ± 0.336
0.0PheXaa: 0.0 ± 0.0
Gly
5.755GlyAla: 5.755 ± 0.64
1.525GlyCys: 1.525 ± 0.35
4.715GlyAsp: 4.715 ± 0.721
5.062GlyGlu: 5.062 ± 0.461
2.427GlyPhe: 2.427 ± 0.457
4.992GlyGly: 4.992 ± 0.712
1.387GlyHis: 1.387 ± 0.343
4.646GlyIle: 4.646 ± 0.476
5.686GlyLys: 5.686 ± 0.57
4.784GlyLeu: 4.784 ± 0.641
2.427GlyMet: 2.427 ± 0.419
3.467GlyAsn: 3.467 ± 0.333
0.763GlyPro: 0.763 ± 0.216
2.08GlyGln: 2.08 ± 0.347
3.952GlyArg: 3.952 ± 0.417
5.2GlySer: 5.2 ± 0.789
3.606GlyThr: 3.606 ± 0.491
5.408GlyVal: 5.408 ± 0.689
0.971GlyTrp: 0.971 ± 0.267
3.744GlyTyr: 3.744 ± 0.606
0.0GlyXaa: 0.0 ± 0.0
His
1.248HisAla: 1.248 ± 0.355
0.416HisCys: 0.416 ± 0.173
1.387HisAsp: 1.387 ± 0.309
0.832HisGlu: 0.832 ± 0.197
0.763HisPhe: 0.763 ± 0.244
2.219HisGly: 2.219 ± 0.399
0.901HisHis: 0.901 ± 0.254
1.04HisIle: 1.04 ± 0.291
1.248HisLys: 1.248 ± 0.303
1.387HisLeu: 1.387 ± 0.268
0.485HisMet: 0.485 ± 0.212
1.179HisAsn: 1.179 ± 0.269
0.693HisPro: 0.693 ± 0.217
0.555HisGln: 0.555 ± 0.187
1.179HisArg: 1.179 ± 0.317
0.693HisSer: 0.693 ± 0.26
0.763HisThr: 0.763 ± 0.247
1.04HisVal: 1.04 ± 0.32
0.069HisTrp: 0.069 ± 0.074
0.693HisTyr: 0.693 ± 0.221
0.0HisXaa: 0.0 ± 0.0
Ile
6.518IleAla: 6.518 ± 0.748
1.109IleCys: 1.109 ± 0.317
5.27IleAsp: 5.27 ± 0.612
4.299IleGlu: 4.299 ± 0.509
2.358IlePhe: 2.358 ± 0.343
5.062IleGly: 5.062 ± 0.614
1.248IleHis: 1.248 ± 0.242
5.547IleIle: 5.547 ± 0.701
4.368IleLys: 4.368 ± 0.561
3.952IleLeu: 3.952 ± 0.533
1.803IleMet: 1.803 ± 0.374
3.536IleAsn: 3.536 ± 0.519
2.566IlePro: 2.566 ± 0.403
2.219IleGln: 2.219 ± 0.388
3.952IleArg: 3.952 ± 0.429
4.646IleSer: 4.646 ± 0.644
3.952IleThr: 3.952 ± 0.584
4.299IleVal: 4.299 ± 0.478
0.555IleTrp: 0.555 ± 0.182
1.803IleTyr: 1.803 ± 0.398
0.0IleXaa: 0.0 ± 0.0
Lys
6.587LysAla: 6.587 ± 0.695
1.317LysCys: 1.317 ± 0.381
3.467LysAsp: 3.467 ± 0.566
2.982LysGlu: 2.982 ± 0.438
2.496LysPhe: 2.496 ± 0.457
3.328LysGly: 3.328 ± 0.603
1.109LysHis: 1.109 ± 0.325
4.784LysIle: 4.784 ± 0.561
4.23LysLys: 4.23 ± 0.665
4.923LysLeu: 4.923 ± 0.597
2.496LysMet: 2.496 ± 0.39
2.774LysAsn: 2.774 ± 0.441
2.149LysPro: 2.149 ± 0.366
2.496LysGln: 2.496 ± 0.366
3.744LysArg: 3.744 ± 0.48
4.646LysSer: 4.646 ± 0.516
3.814LysThr: 3.814 ± 0.515
4.16LysVal: 4.16 ± 0.555
1.248LysTrp: 1.248 ± 0.247
2.496LysTyr: 2.496 ± 0.39
0.0LysXaa: 0.0 ± 0.0
Leu
5.824LeuAla: 5.824 ± 0.908
0.832LeuCys: 0.832 ± 0.211
3.19LeuAsp: 3.19 ± 0.423
4.16LeuGlu: 4.16 ± 0.534
2.288LeuPhe: 2.288 ± 0.375
4.23LeuGly: 4.23 ± 0.546
0.971LeuHis: 0.971 ± 0.261
4.16LeuIle: 4.16 ± 0.608
4.646LeuLys: 4.646 ± 0.656
5.131LeuLeu: 5.131 ± 0.654
1.595LeuMet: 1.595 ± 0.305
2.843LeuAsn: 2.843 ± 0.532
3.259LeuPro: 3.259 ± 0.634
2.704LeuGln: 2.704 ± 0.504
4.784LeuArg: 4.784 ± 0.539
4.992LeuSer: 4.992 ± 0.613
4.923LeuThr: 4.923 ± 0.471
4.715LeuVal: 4.715 ± 0.585
1.525LeuTrp: 1.525 ± 0.291
2.358LeuTyr: 2.358 ± 0.371
0.0LeuXaa: 0.0 ± 0.0
Met
3.536MetAla: 3.536 ± 0.514
0.485MetCys: 0.485 ± 0.183
1.664MetAsp: 1.664 ± 0.332
1.456MetGlu: 1.456 ± 0.302
1.179MetPhe: 1.179 ± 0.346
1.387MetGly: 1.387 ± 0.296
0.693MetHis: 0.693 ± 0.159
2.08MetIle: 2.08 ± 0.428
2.635MetLys: 2.635 ± 0.406
1.872MetLeu: 1.872 ± 0.322
1.248MetMet: 1.248 ± 0.228
1.803MetAsn: 1.803 ± 0.304
1.664MetPro: 1.664 ± 0.374
1.109MetGln: 1.109 ± 0.296
1.456MetArg: 1.456 ± 0.24
2.704MetSer: 2.704 ± 0.473
1.456MetThr: 1.456 ± 0.266
1.456MetVal: 1.456 ± 0.34
0.416MetTrp: 0.416 ± 0.142
0.901MetTyr: 0.901 ± 0.22
0.0MetXaa: 0.0 ± 0.0
Asn
3.952AsnAla: 3.952 ± 0.496
0.347AsnCys: 0.347 ± 0.14
3.19AsnAsp: 3.19 ± 0.509
2.566AsnGlu: 2.566 ± 0.394
1.04AsnPhe: 1.04 ± 0.298
4.923AsnGly: 4.923 ± 0.457
1.456AsnHis: 1.456 ± 0.348
2.982AsnIle: 2.982 ± 0.415
2.704AsnLys: 2.704 ± 0.421
3.606AsnLeu: 3.606 ± 0.485
0.971AsnMet: 0.971 ± 0.25
2.358AsnAsn: 2.358 ± 0.406
1.387AsnPro: 1.387 ± 0.308
1.456AsnGln: 1.456 ± 0.264
2.912AsnArg: 2.912 ± 0.399
3.398AsnSer: 3.398 ± 0.522
2.427AsnThr: 2.427 ± 0.523
2.566AsnVal: 2.566 ± 0.592
0.416AsnTrp: 0.416 ± 0.171
1.664AsnTyr: 1.664 ± 0.382
0.0AsnXaa: 0.0 ± 0.0
Pro
2.774ProAla: 2.774 ± 0.515
0.555ProCys: 0.555 ± 0.174
2.496ProAsp: 2.496 ± 0.457
2.704ProGlu: 2.704 ± 0.461
1.109ProPhe: 1.109 ± 0.245
1.803ProGly: 1.803 ± 0.443
0.485ProHis: 0.485 ± 0.145
1.803ProIle: 1.803 ± 0.332
1.387ProLys: 1.387 ± 0.32
1.733ProLeu: 1.733 ± 0.401
1.04ProMet: 1.04 ± 0.259
1.248ProAsn: 1.248 ± 0.339
0.763ProPro: 0.763 ± 0.241
1.525ProGln: 1.525 ± 0.343
1.248ProArg: 1.248 ± 0.288
2.08ProSer: 2.08 ± 0.411
1.941ProThr: 1.941 ± 0.335
2.912ProVal: 2.912 ± 0.407
0.416ProTrp: 0.416 ± 0.198
1.317ProTyr: 1.317 ± 0.303
0.0ProXaa: 0.0 ± 0.0
Gln
2.704GlnAla: 2.704 ± 0.894
0.485GlnCys: 0.485 ± 0.177
1.525GlnAsp: 1.525 ± 0.305
2.566GlnGlu: 2.566 ± 0.339
2.011GlnPhe: 2.011 ± 0.314
1.803GlnGly: 1.803 ± 0.416
1.317GlnHis: 1.317 ± 0.295
2.358GlnIle: 2.358 ± 0.435
1.872GlnLys: 1.872 ± 0.328
3.814GlnLeu: 3.814 ± 0.605
1.733GlnMet: 1.733 ± 0.323
1.317GlnAsn: 1.317 ± 0.263
1.387GlnPro: 1.387 ± 0.354
3.12GlnGln: 3.12 ± 0.796
2.219GlnArg: 2.219 ± 0.389
2.427GlnSer: 2.427 ± 0.479
2.219GlnThr: 2.219 ± 0.456
2.358GlnVal: 2.358 ± 0.462
0.555GlnTrp: 0.555 ± 0.198
1.387GlnTyr: 1.387 ± 0.272
0.0GlnXaa: 0.0 ± 0.0
Arg
4.715ArgAla: 4.715 ± 0.681
1.387ArgCys: 1.387 ± 0.402
2.635ArgAsp: 2.635 ± 0.392
3.467ArgGlu: 3.467 ± 0.511
1.803ArgPhe: 1.803 ± 0.354
3.12ArgGly: 3.12 ± 0.478
0.763ArgHis: 0.763 ± 0.22
3.883ArgIle: 3.883 ± 0.599
4.438ArgLys: 4.438 ± 0.642
4.576ArgLeu: 4.576 ± 0.555
1.387ArgMet: 1.387 ± 0.298
3.675ArgAsn: 3.675 ± 0.575
1.179ArgPro: 1.179 ± 0.26
1.664ArgGln: 1.664 ± 0.409
2.635ArgArg: 2.635 ± 0.521
3.19ArgSer: 3.19 ± 0.463
1.733ArgThr: 1.733 ± 0.265
3.19ArgVal: 3.19 ± 0.456
0.832ArgTrp: 0.832 ± 0.236
2.774ArgTyr: 2.774 ± 0.374
0.0ArgXaa: 0.0 ± 0.0
Ser
5.963SerAla: 5.963 ± 0.7
1.317SerCys: 1.317 ± 0.267
4.438SerAsp: 4.438 ± 0.574
4.299SerGlu: 4.299 ± 0.492
2.566SerPhe: 2.566 ± 0.423
5.894SerGly: 5.894 ± 0.652
0.832SerHis: 0.832 ± 0.207
4.091SerIle: 4.091 ± 0.593
3.814SerLys: 3.814 ± 0.547
5.616SerLeu: 5.616 ± 0.774
2.011SerMet: 2.011 ± 0.324
2.358SerAsn: 2.358 ± 0.4
2.219SerPro: 2.219 ± 0.446
2.774SerGln: 2.774 ± 0.46
3.259SerArg: 3.259 ± 0.661
3.12SerSer: 3.12 ± 0.563
4.022SerThr: 4.022 ± 0.712
3.675SerVal: 3.675 ± 0.549
1.109SerTrp: 1.109 ± 0.262
2.219SerTyr: 2.219 ± 0.344
0.0SerXaa: 0.0 ± 0.0
Thr
5.616ThrAla: 5.616 ± 1.035
0.485ThrCys: 0.485 ± 0.154
3.12ThrAsp: 3.12 ± 0.473
4.299ThrGlu: 4.299 ± 0.507
1.664ThrPhe: 1.664 ± 0.375
5.062ThrGly: 5.062 ± 0.509
0.693ThrHis: 0.693 ± 0.222
3.259ThrIle: 3.259 ± 0.464
3.19ThrLys: 3.19 ± 0.421
3.19ThrLeu: 3.19 ± 0.614
1.456ThrMet: 1.456 ± 0.256
2.358ThrAsn: 2.358 ± 0.505
2.912ThrPro: 2.912 ± 0.439
1.941ThrGln: 1.941 ± 0.334
2.358ThrArg: 2.358 ± 0.39
3.467ThrSer: 3.467 ± 0.566
3.467ThrThr: 3.467 ± 0.679
3.051ThrVal: 3.051 ± 0.47
0.693ThrTrp: 0.693 ± 0.26
2.011ThrTyr: 2.011 ± 0.404
0.0ThrXaa: 0.0 ± 0.0
Val
5.616ValAla: 5.616 ± 0.661
0.485ValCys: 0.485 ± 0.188
4.299ValAsp: 4.299 ± 0.616
4.368ValGlu: 4.368 ± 0.44
2.358ValPhe: 2.358 ± 0.514
4.022ValGly: 4.022 ± 0.526
1.179ValHis: 1.179 ± 0.326
5.963ValIle: 5.963 ± 0.711
4.16ValLys: 4.16 ± 0.481
3.606ValLeu: 3.606 ± 0.412
2.358ValMet: 2.358 ± 0.335
3.467ValAsn: 3.467 ± 0.461
1.387ValPro: 1.387 ± 0.342
2.496ValGln: 2.496 ± 0.35
2.774ValArg: 2.774 ± 0.366
4.23ValSer: 4.23 ± 0.436
3.883ValThr: 3.883 ± 0.767
4.299ValVal: 4.299 ± 0.717
0.971ValTrp: 0.971 ± 0.298
2.08ValTyr: 2.08 ± 0.328
0.0ValXaa: 0.0 ± 0.0
Trp
0.901TrpAla: 0.901 ± 0.266
0.208TrpCys: 0.208 ± 0.121
0.832TrpAsp: 0.832 ± 0.233
0.485TrpGlu: 0.485 ± 0.23
0.763TrpPhe: 0.763 ± 0.248
0.624TrpGly: 0.624 ± 0.191
0.277TrpHis: 0.277 ± 0.127
0.971TrpIle: 0.971 ± 0.296
1.109TrpLys: 1.109 ± 0.296
1.525TrpLeu: 1.525 ± 0.412
0.347TrpMet: 0.347 ± 0.154
0.485TrpAsn: 0.485 ± 0.185
0.693TrpPro: 0.693 ± 0.192
0.693TrpGln: 0.693 ± 0.189
1.387TrpArg: 1.387 ± 0.331
0.693TrpSer: 0.693 ± 0.2
0.971TrpThr: 0.971 ± 0.222
1.317TrpVal: 1.317 ± 0.267
0.139TrpTrp: 0.139 ± 0.097
0.416TrpTyr: 0.416 ± 0.159
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.774TyrAla: 2.774 ± 0.355
0.555TyrCys: 0.555 ± 0.184
3.328TyrAsp: 3.328 ± 0.397
2.08TyrGlu: 2.08 ± 0.294
1.179TyrPhe: 1.179 ± 0.274
3.19TyrGly: 3.19 ± 0.506
0.971TyrHis: 0.971 ± 0.242
2.427TyrIle: 2.427 ± 0.442
1.525TyrLys: 1.525 ± 0.325
2.219TyrLeu: 2.219 ± 0.436
0.832TyrMet: 0.832 ± 0.205
1.595TyrAsn: 1.595 ± 0.361
1.248TyrPro: 1.248 ± 0.291
1.317TyrGln: 1.317 ± 0.342
1.525TyrArg: 1.525 ± 0.318
3.398TyrSer: 3.398 ± 0.485
2.08TyrThr: 2.08 ± 0.415
3.051TyrVal: 3.051 ± 0.483
0.624TyrTrp: 0.624 ± 0.17
1.387TyrTyr: 1.387 ± 0.271
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 87 proteins (14423 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski