Amino acid dipepetide frequency for Salmonella phage BP63

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.125AlaAla: 8.125 ± 1.034
0.8AlaCys: 0.8 ± 0.213
5.416AlaAsp: 5.416 ± 0.483
6.463AlaGlu: 6.463 ± 0.951
1.846AlaPhe: 1.846 ± 0.353
5.109AlaGly: 5.109 ± 0.646
1.785AlaHis: 1.785 ± 0.446
5.847AlaIle: 5.847 ± 0.618
6.032AlaLys: 6.032 ± 0.617
6.586AlaLeu: 6.586 ± 0.854
2.093AlaMet: 2.093 ± 0.32
4.308AlaAsn: 4.308 ± 0.501
2.585AlaPro: 2.585 ± 0.472
2.77AlaGln: 2.77 ± 0.511
3.631AlaArg: 3.631 ± 0.546
4.986AlaSer: 4.986 ± 0.639
5.786AlaThr: 5.786 ± 0.912
4.862AlaVal: 4.862 ± 0.508
1.477AlaTrp: 1.477 ± 0.343
2.524AlaTyr: 2.524 ± 0.419
0.0AlaXaa: 0.0 ± 0.0
Cys
0.8CysAla: 0.8 ± 0.254
0.123CysCys: 0.123 ± 0.076
1.046CysAsp: 1.046 ± 0.303
0.492CysGlu: 0.492 ± 0.191
0.492CysPhe: 0.492 ± 0.223
0.923CysGly: 0.923 ± 0.23
0.308CysHis: 0.308 ± 0.147
0.677CysIle: 0.677 ± 0.201
1.354CysLys: 1.354 ± 0.273
1.046CysLeu: 1.046 ± 0.302
0.369CysMet: 0.369 ± 0.14
1.169CysAsn: 1.169 ± 0.206
0.369CysPro: 0.369 ± 0.143
0.246CysGln: 0.246 ± 0.132
0.615CysArg: 0.615 ± 0.189
0.369CysSer: 0.369 ± 0.168
0.492CysThr: 0.492 ± 0.185
0.677CysVal: 0.677 ± 0.234
0.0CysTrp: 0.0 ± 0.0
0.615CysTyr: 0.615 ± 0.152
0.0CysXaa: 0.0 ± 0.0
Asp
4.185AspAla: 4.185 ± 0.591
1.293AspCys: 1.293 ± 0.243
3.139AspAsp: 3.139 ± 0.423
5.416AspGlu: 5.416 ± 0.518
2.524AspPhe: 2.524 ± 0.352
5.663AspGly: 5.663 ± 0.701
0.615AspHis: 0.615 ± 0.224
4.678AspIle: 4.678 ± 0.467
3.385AspLys: 3.385 ± 0.433
4.801AspLeu: 4.801 ± 0.493
1.908AspMet: 1.908 ± 0.292
3.508AspAsn: 3.508 ± 0.567
2.831AspPro: 2.831 ± 0.38
2.093AspGln: 2.093 ± 0.343
2.462AspArg: 2.462 ± 0.452
3.755AspSer: 3.755 ± 0.573
2.893AspThr: 2.893 ± 0.434
5.293AspVal: 5.293 ± 0.459
1.539AspTrp: 1.539 ± 0.357
2.277AspTyr: 2.277 ± 0.406
0.0AspXaa: 0.0 ± 0.0
Glu
5.109GluAla: 5.109 ± 0.914
0.431GluCys: 0.431 ± 0.154
3.139GluAsp: 3.139 ± 0.448
5.232GluGlu: 5.232 ± 0.883
2.831GluPhe: 2.831 ± 0.483
4.801GluGly: 4.801 ± 0.482
1.231GluHis: 1.231 ± 0.311
2.708GluIle: 2.708 ± 0.418
3.631GluLys: 3.631 ± 0.602
7.14GluLeu: 7.14 ± 0.707
2.277GluMet: 2.277 ± 0.321
2.339GluAsn: 2.339 ± 0.415
1.723GluPro: 1.723 ± 0.321
2.77GluGln: 2.77 ± 0.413
3.324GluArg: 3.324 ± 0.421
4.185GluSer: 4.185 ± 0.557
2.462GluThr: 2.462 ± 0.33
4.432GluVal: 4.432 ± 0.46
0.923GluTrp: 0.923 ± 0.224
2.77GluTyr: 2.77 ± 0.436
0.0GluXaa: 0.0 ± 0.0
Phe
2.154PheAla: 2.154 ± 0.332
0.369PheCys: 0.369 ± 0.145
2.4PheAsp: 2.4 ± 0.423
1.97PheGlu: 1.97 ± 0.348
1.108PhePhe: 1.108 ± 0.296
2.339PheGly: 2.339 ± 0.412
0.554PheHis: 0.554 ± 0.179
2.216PheIle: 2.216 ± 0.426
1.908PheLys: 1.908 ± 0.338
2.647PheLeu: 2.647 ± 0.424
1.046PheMet: 1.046 ± 0.243
3.016PheAsn: 3.016 ± 0.379
1.046PhePro: 1.046 ± 0.239
1.046PheGln: 1.046 ± 0.299
1.846PheArg: 1.846 ± 0.417
2.154PheSer: 2.154 ± 0.348
2.893PheThr: 2.893 ± 0.538
2.216PheVal: 2.216 ± 0.422
0.369PheTrp: 0.369 ± 0.151
0.923PheTyr: 0.923 ± 0.255
0.0PheXaa: 0.0 ± 0.0
Gly
5.539GlyAla: 5.539 ± 0.743
0.923GlyCys: 0.923 ± 0.247
3.631GlyAsp: 3.631 ± 0.489
3.385GlyGlu: 3.385 ± 0.512
2.647GlyPhe: 2.647 ± 0.473
5.663GlyGly: 5.663 ± 0.894
1.785GlyHis: 1.785 ± 0.343
4.616GlyIle: 4.616 ± 0.58
5.416GlyLys: 5.416 ± 0.578
5.355GlyLeu: 5.355 ± 0.566
1.354GlyMet: 1.354 ± 0.28
3.693GlyAsn: 3.693 ± 0.494
1.908GlyPro: 1.908 ± 0.333
2.216GlyGln: 2.216 ± 0.321
2.831GlyArg: 2.831 ± 0.542
4.062GlySer: 4.062 ± 0.41
4.739GlyThr: 4.739 ± 0.686
5.97GlyVal: 5.97 ± 0.695
1.354GlyTrp: 1.354 ± 0.399
2.708GlyTyr: 2.708 ± 0.42
0.0GlyXaa: 0.0 ± 0.0
His
0.739HisAla: 0.739 ± 0.212
0.185HisCys: 0.185 ± 0.101
0.923HisAsp: 0.923 ± 0.251
1.046HisGlu: 1.046 ± 0.243
0.431HisPhe: 0.431 ± 0.16
1.108HisGly: 1.108 ± 0.247
0.369HisHis: 0.369 ± 0.172
1.354HisIle: 1.354 ± 0.349
1.046HisLys: 1.046 ± 0.23
1.293HisLeu: 1.293 ± 0.271
0.615HisMet: 0.615 ± 0.188
1.108HisAsn: 1.108 ± 0.223
0.923HisPro: 0.923 ± 0.285
0.923HisGln: 0.923 ± 0.205
0.431HisArg: 0.431 ± 0.19
1.108HisSer: 1.108 ± 0.263
0.739HisThr: 0.739 ± 0.257
1.108HisVal: 1.108 ± 0.226
0.246HisTrp: 0.246 ± 0.112
1.108HisTyr: 1.108 ± 0.244
0.0HisXaa: 0.0 ± 0.0
Ile
5.293IleAla: 5.293 ± 0.523
0.615IleCys: 0.615 ± 0.262
3.755IleAsp: 3.755 ± 0.522
3.385IleGlu: 3.385 ± 0.374
2.093IlePhe: 2.093 ± 0.321
3.631IleGly: 3.631 ± 0.543
1.108IleHis: 1.108 ± 0.251
3.201IleIle: 3.201 ± 0.415
4.493IleLys: 4.493 ± 0.504
4.062IleLeu: 4.062 ± 0.586
0.739IleMet: 0.739 ± 0.194
3.939IleAsn: 3.939 ± 0.647
2.954IlePro: 2.954 ± 0.499
2.154IleGln: 2.154 ± 0.409
2.647IleArg: 2.647 ± 0.355
2.708IleSer: 2.708 ± 0.431
4.308IleThr: 4.308 ± 0.552
4.432IleVal: 4.432 ± 0.492
0.862IleTrp: 0.862 ± 0.295
2.77IleTyr: 2.77 ± 0.397
0.0IleXaa: 0.0 ± 0.0
Lys
5.97LysAla: 5.97 ± 0.95
0.615LysCys: 0.615 ± 0.181
3.447LysAsp: 3.447 ± 0.524
4.555LysGlu: 4.555 ± 0.656
2.462LysPhe: 2.462 ± 0.514
3.201LysGly: 3.201 ± 0.426
1.231LysHis: 1.231 ± 0.246
2.893LysIle: 2.893 ± 0.401
3.016LysLys: 3.016 ± 0.528
5.293LysLeu: 5.293 ± 0.628
2.154LysMet: 2.154 ± 0.43
2.277LysAsn: 2.277 ± 0.369
3.631LysPro: 3.631 ± 0.443
3.077LysGln: 3.077 ± 0.421
2.462LysArg: 2.462 ± 0.472
3.385LysSer: 3.385 ± 0.501
3.324LysThr: 3.324 ± 0.501
4.801LysVal: 4.801 ± 0.545
1.046LysTrp: 1.046 ± 0.219
2.216LysTyr: 2.216 ± 0.443
0.0LysXaa: 0.0 ± 0.0
Leu
6.77LeuAla: 6.77 ± 0.609
1.416LeuCys: 1.416 ± 0.353
5.355LeuAsp: 5.355 ± 0.503
5.232LeuGlu: 5.232 ± 0.634
2.524LeuPhe: 2.524 ± 0.386
4.555LeuGly: 4.555 ± 0.547
1.108LeuHis: 1.108 ± 0.288
4.062LeuIle: 4.062 ± 0.563
5.109LeuLys: 5.109 ± 0.637
5.17LeuLeu: 5.17 ± 0.616
2.4LeuMet: 2.4 ± 0.392
3.324LeuAsn: 3.324 ± 0.439
4.432LeuPro: 4.432 ± 0.564
3.201LeuGln: 3.201 ± 0.528
4.555LeuArg: 4.555 ± 0.436
4.001LeuSer: 4.001 ± 0.52
6.34LeuThr: 6.34 ± 0.686
5.232LeuVal: 5.232 ± 0.576
1.293LeuTrp: 1.293 ± 0.336
2.524LeuTyr: 2.524 ± 0.336
0.0LeuXaa: 0.0 ± 0.0
Met
2.462MetAla: 2.462 ± 0.432
0.492MetCys: 0.492 ± 0.159
2.462MetAsp: 2.462 ± 0.323
1.416MetGlu: 1.416 ± 0.354
1.046MetPhe: 1.046 ± 0.251
1.169MetGly: 1.169 ± 0.239
0.431MetHis: 0.431 ± 0.136
1.293MetIle: 1.293 ± 0.261
1.416MetLys: 1.416 ± 0.26
2.093MetLeu: 2.093 ± 0.28
0.985MetMet: 0.985 ± 0.214
1.169MetAsn: 1.169 ± 0.218
1.293MetPro: 1.293 ± 0.281
1.169MetGln: 1.169 ± 0.305
1.846MetArg: 1.846 ± 0.352
2.462MetSer: 2.462 ± 0.337
2.216MetThr: 2.216 ± 0.367
1.6MetVal: 1.6 ± 0.283
0.123MetTrp: 0.123 ± 0.095
0.862MetTyr: 0.862 ± 0.242
0.0MetXaa: 0.0 ± 0.0
Asn
4.678AsnAla: 4.678 ± 0.536
0.677AsnCys: 0.677 ± 0.228
3.385AsnAsp: 3.385 ± 0.5
2.339AsnGlu: 2.339 ± 0.325
1.908AsnPhe: 1.908 ± 0.25
5.047AsnGly: 5.047 ± 0.493
0.431AsnHis: 0.431 ± 0.139
3.816AsnIle: 3.816 ± 0.521
3.508AsnLys: 3.508 ± 0.455
3.447AsnLeu: 3.447 ± 0.52
1.231AsnMet: 1.231 ± 0.34
2.154AsnAsn: 2.154 ± 0.369
2.954AsnPro: 2.954 ± 0.335
1.785AsnGln: 1.785 ± 0.324
2.462AsnArg: 2.462 ± 0.421
2.462AsnSer: 2.462 ± 0.466
3.447AsnThr: 3.447 ± 0.358
3.631AsnVal: 3.631 ± 0.45
1.662AsnTrp: 1.662 ± 0.276
1.354AsnTyr: 1.354 ± 0.29
0.0AsnXaa: 0.0 ± 0.0
Pro
3.508ProAla: 3.508 ± 0.377
0.492ProCys: 0.492 ± 0.164
3.693ProAsp: 3.693 ± 0.566
2.954ProGlu: 2.954 ± 0.396
1.6ProPhe: 1.6 ± 0.416
2.77ProGly: 2.77 ± 0.4
0.431ProHis: 0.431 ± 0.151
1.723ProIle: 1.723 ± 0.279
1.97ProLys: 1.97 ± 0.346
3.077ProLeu: 3.077 ± 0.415
1.231ProMet: 1.231 ± 0.31
2.216ProAsn: 2.216 ± 0.342
2.216ProPro: 2.216 ± 0.315
1.416ProGln: 1.416 ± 0.257
2.154ProArg: 2.154 ± 0.42
2.277ProSer: 2.277 ± 0.517
3.139ProThr: 3.139 ± 0.351
3.447ProVal: 3.447 ± 0.465
0.615ProTrp: 0.615 ± 0.161
1.354ProTyr: 1.354 ± 0.32
0.0ProXaa: 0.0 ± 0.0
Gln
4.062GlnAla: 4.062 ± 0.645
0.246GlnCys: 0.246 ± 0.11
1.846GlnAsp: 1.846 ± 0.276
2.339GlnGlu: 2.339 ± 0.416
1.723GlnPhe: 1.723 ± 0.343
1.97GlnGly: 1.97 ± 0.387
0.677GlnHis: 0.677 ± 0.182
2.77GlnIle: 2.77 ± 0.374
1.846GlnLys: 1.846 ± 0.324
4.37GlnLeu: 4.37 ± 0.445
1.6GlnMet: 1.6 ± 0.319
1.539GlnAsn: 1.539 ± 0.328
1.354GlnPro: 1.354 ± 0.349
1.846GlnGln: 1.846 ± 0.389
1.662GlnArg: 1.662 ± 0.295
1.723GlnSer: 1.723 ± 0.255
1.416GlnThr: 1.416 ± 0.31
2.277GlnVal: 2.277 ± 0.398
0.739GlnTrp: 0.739 ± 0.169
1.477GlnTyr: 1.477 ± 0.276
0.0GlnXaa: 0.0 ± 0.0
Arg
3.631ArgAla: 3.631 ± 0.57
0.615ArgCys: 0.615 ± 0.177
2.4ArgAsp: 2.4 ± 0.426
2.77ArgGlu: 2.77 ± 0.387
1.354ArgPhe: 1.354 ± 0.283
3.201ArgGly: 3.201 ± 0.487
0.985ArgHis: 0.985 ± 0.27
3.447ArgIle: 3.447 ± 0.513
3.262ArgLys: 3.262 ± 0.545
3.508ArgLeu: 3.508 ± 0.429
1.293ArgMet: 1.293 ± 0.257
2.4ArgAsn: 2.4 ± 0.301
1.908ArgPro: 1.908 ± 0.319
2.524ArgGln: 2.524 ± 0.324
2.585ArgArg: 2.585 ± 0.4
2.216ArgSer: 2.216 ± 0.329
3.077ArgThr: 3.077 ± 0.369
3.016ArgVal: 3.016 ± 0.427
0.8ArgTrp: 0.8 ± 0.198
1.416ArgTyr: 1.416 ± 0.291
0.0ArgXaa: 0.0 ± 0.0
Ser
4.555SerAla: 4.555 ± 0.623
0.615SerCys: 0.615 ± 0.236
3.939SerAsp: 3.939 ± 0.59
3.385SerGlu: 3.385 ± 0.375
2.093SerPhe: 2.093 ± 0.27
5.109SerGly: 5.109 ± 0.505
0.554SerHis: 0.554 ± 0.193
3.447SerIle: 3.447 ± 0.533
3.324SerLys: 3.324 ± 0.479
4.124SerLeu: 4.124 ± 0.48
1.477SerMet: 1.477 ± 0.318
3.324SerAsn: 3.324 ± 0.419
2.339SerPro: 2.339 ± 0.635
1.416SerGln: 1.416 ± 0.281
2.154SerArg: 2.154 ± 0.347
3.755SerSer: 3.755 ± 0.536
4.124SerThr: 4.124 ± 0.529
4.801SerVal: 4.801 ± 0.758
0.492SerTrp: 0.492 ± 0.198
2.524SerTyr: 2.524 ± 0.438
0.0SerXaa: 0.0 ± 0.0
Thr
6.032ThrAla: 6.032 ± 0.792
0.554ThrCys: 0.554 ± 0.206
4.062ThrAsp: 4.062 ± 0.725
3.324ThrGlu: 3.324 ± 0.498
1.97ThrPhe: 1.97 ± 0.313
5.047ThrGly: 5.047 ± 0.624
0.923ThrHis: 0.923 ± 0.224
3.139ThrIle: 3.139 ± 0.473
3.57ThrLys: 3.57 ± 0.428
5.293ThrLeu: 5.293 ± 0.776
1.662ThrMet: 1.662 ± 0.401
3.139ThrAsn: 3.139 ± 0.495
2.831ThrPro: 2.831 ± 0.428
1.846ThrGln: 1.846 ± 0.347
2.585ThrArg: 2.585 ± 0.474
4.001ThrSer: 4.001 ± 0.562
3.447ThrThr: 3.447 ± 0.559
7.201ThrVal: 7.201 ± 1.271
1.108ThrTrp: 1.108 ± 0.27
2.462ThrTyr: 2.462 ± 0.433
0.0ThrXaa: 0.0 ± 0.0
Val
5.724ValAla: 5.724 ± 0.8
0.739ValCys: 0.739 ± 0.264
5.724ValAsp: 5.724 ± 0.706
4.924ValGlu: 4.924 ± 0.616
1.723ValPhe: 1.723 ± 0.287
4.986ValGly: 4.986 ± 0.515
0.862ValHis: 0.862 ± 0.253
3.878ValIle: 3.878 ± 0.598
4.616ValLys: 4.616 ± 0.522
5.724ValLeu: 5.724 ± 0.579
2.339ValMet: 2.339 ± 0.344
4.555ValAsn: 4.555 ± 0.504
2.77ValPro: 2.77 ± 0.424
2.954ValGln: 2.954 ± 0.426
2.954ValArg: 2.954 ± 0.422
4.37ValSer: 4.37 ± 0.602
6.647ValThr: 6.647 ± 1.32
5.909ValVal: 5.909 ± 0.741
1.108ValTrp: 1.108 ± 0.228
2.647ValTyr: 2.647 ± 0.467
0.0ValXaa: 0.0 ± 0.0
Trp
1.108TrpAla: 1.108 ± 0.245
0.308TrpCys: 0.308 ± 0.131
1.354TrpAsp: 1.354 ± 0.254
0.615TrpGlu: 0.615 ± 0.245
0.8TrpPhe: 0.8 ± 0.195
1.169TrpGly: 1.169 ± 0.282
0.369TrpHis: 0.369 ± 0.158
0.985TrpIle: 0.985 ± 0.176
0.492TrpLys: 0.492 ± 0.176
1.293TrpLeu: 1.293 ± 0.317
0.369TrpMet: 0.369 ± 0.134
0.554TrpAsn: 0.554 ± 0.16
0.615TrpPro: 0.615 ± 0.209
0.369TrpGln: 0.369 ± 0.13
0.923TrpArg: 0.923 ± 0.227
1.539TrpSer: 1.539 ± 0.393
0.739TrpThr: 0.739 ± 0.213
1.416TrpVal: 1.416 ± 0.233
0.369TrpTrp: 0.369 ± 0.166
1.231TrpTyr: 1.231 ± 0.3
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.585TyrAla: 2.585 ± 0.391
0.677TyrCys: 0.677 ± 0.206
3.139TyrAsp: 3.139 ± 0.418
2.216TyrGlu: 2.216 ± 0.44
1.046TyrPhe: 1.046 ± 0.268
2.339TyrGly: 2.339 ± 0.366
1.046TyrHis: 1.046 ± 0.285
2.4TyrIle: 2.4 ± 0.302
1.723TyrLys: 1.723 ± 0.309
2.277TyrLeu: 2.277 ± 0.466
0.8TyrMet: 0.8 ± 0.2
2.647TyrAsn: 2.647 ± 0.449
1.6TyrPro: 1.6 ± 0.346
1.662TyrGln: 1.662 ± 0.295
2.277TyrArg: 2.277 ± 0.379
2.031TyrSer: 2.031 ± 0.338
1.97TyrThr: 1.97 ± 0.342
2.831TyrVal: 2.831 ± 0.447
0.492TyrTrp: 0.492 ± 0.172
1.6TyrTyr: 1.6 ± 0.354
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 76 proteins (16248 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski