Amino acid dipepetide frequency for Escherichia phage Skarpretter

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.315AlaAla: 16.315 ± 2.113
1.138AlaCys: 1.138 ± 0.304
7.209AlaAsp: 7.209 ± 0.894
8.499AlaGlu: 8.499 ± 1.086
3.415AlaPhe: 3.415 ± 0.533
8.954AlaGly: 8.954 ± 0.686
2.201AlaHis: 2.201 ± 0.575
5.615AlaIle: 5.615 ± 0.524
6.526AlaLys: 6.526 ± 0.868
8.803AlaLeu: 8.803 ± 0.962
3.187AlaMet: 3.187 ± 0.399
5.767AlaAsn: 5.767 ± 0.762
3.642AlaPro: 3.642 ± 0.575
6.526AlaGln: 6.526 ± 1.092
6.678AlaArg: 6.678 ± 0.953
4.857AlaSer: 4.857 ± 0.538
5.919AlaThr: 5.919 ± 0.865
6.905AlaVal: 6.905 ± 0.617
0.986AlaTrp: 0.986 ± 0.216
2.808AlaTyr: 2.808 ± 0.432
0.0AlaXaa: 0.0 ± 0.0
Cys
0.607CysAla: 0.607 ± 0.258
0.076CysCys: 0.076 ± 0.082
0.304CysAsp: 0.304 ± 0.165
0.683CysGlu: 0.683 ± 0.237
0.379CysPhe: 0.379 ± 0.156
1.821CysGly: 1.821 ± 0.455
0.379CysHis: 0.379 ± 0.243
0.304CysIle: 0.304 ± 0.162
0.531CysLys: 0.531 ± 0.284
0.759CysLeu: 0.759 ± 0.247
0.228CysMet: 0.228 ± 0.114
0.759CysAsn: 0.759 ± 0.226
0.304CysPro: 0.304 ± 0.17
0.455CysGln: 0.455 ± 0.19
0.835CysArg: 0.835 ± 0.272
0.683CysSer: 0.683 ± 0.252
0.531CysThr: 0.531 ± 0.28
1.138CysVal: 1.138 ± 0.258
0.304CysTrp: 0.304 ± 0.188
0.304CysTyr: 0.304 ± 0.168
0.0CysXaa: 0.0 ± 0.0
Asp
8.195AspAla: 8.195 ± 0.845
0.986AspCys: 0.986 ± 0.353
3.794AspAsp: 3.794 ± 0.734
3.794AspGlu: 3.794 ± 0.376
2.049AspPhe: 2.049 ± 0.388
5.388AspGly: 5.388 ± 0.661
1.29AspHis: 1.29 ± 0.324
2.049AspIle: 2.049 ± 0.376
3.263AspLys: 3.263 ± 0.492
5.767AspLeu: 5.767 ± 0.692
1.669AspMet: 1.669 ± 0.346
2.125AspAsn: 2.125 ± 0.469
4.174AspPro: 4.174 ± 0.614
2.959AspGln: 2.959 ± 0.547
3.491AspArg: 3.491 ± 0.573
3.187AspSer: 3.187 ± 0.541
2.656AspThr: 2.656 ± 0.491
3.794AspVal: 3.794 ± 0.592
1.062AspTrp: 1.062 ± 0.207
2.884AspTyr: 2.884 ± 0.412
0.0AspXaa: 0.0 ± 0.0
Glu
5.691GluAla: 5.691 ± 0.69
0.531GluCys: 0.531 ± 0.301
4.174GluAsp: 4.174 ± 0.509
2.808GluGlu: 2.808 ± 0.525
1.518GluPhe: 1.518 ± 0.31
4.325GluGly: 4.325 ± 0.463
1.062GluHis: 1.062 ± 0.329
4.098GluIle: 4.098 ± 0.712
3.946GluLys: 3.946 ± 0.737
4.25GluLeu: 4.25 ± 0.497
1.442GluMet: 1.442 ± 0.321
2.808GluAsn: 2.808 ± 0.475
2.277GluPro: 2.277 ± 0.378
2.352GluGln: 2.352 ± 0.473
4.325GluArg: 4.325 ± 0.682
3.263GluSer: 3.263 ± 0.534
2.656GluThr: 2.656 ± 0.558
4.022GluVal: 4.022 ± 0.528
1.442GluTrp: 1.442 ± 0.307
2.656GluTyr: 2.656 ± 0.391
0.0GluXaa: 0.0 ± 0.0
Phe
3.111PheAla: 3.111 ± 0.478
0.759PheCys: 0.759 ± 0.273
3.111PheAsp: 3.111 ± 0.449
1.745PheGlu: 1.745 ± 0.309
0.986PhePhe: 0.986 ± 0.23
2.58PheGly: 2.58 ± 0.417
0.683PheHis: 0.683 ± 0.237
1.366PheIle: 1.366 ± 0.379
2.201PheLys: 2.201 ± 0.339
1.442PheLeu: 1.442 ± 0.416
1.214PheMet: 1.214 ± 0.442
2.125PheAsn: 2.125 ± 0.406
0.986PhePro: 0.986 ± 0.275
0.759PheGln: 0.759 ± 0.297
1.821PheArg: 1.821 ± 0.37
1.29PheSer: 1.29 ± 0.286
2.277PheThr: 2.277 ± 0.473
1.442PheVal: 1.442 ± 0.34
0.607PheTrp: 0.607 ± 0.194
0.683PheTyr: 0.683 ± 0.201
0.0PheXaa: 0.0 ± 0.0
Gly
8.651GlyAla: 8.651 ± 1.088
0.835GlyCys: 0.835 ± 0.321
3.87GlyAsp: 3.87 ± 0.73
4.477GlyGlu: 4.477 ± 0.499
3.491GlyPhe: 3.491 ± 0.505
7.892GlyGly: 7.892 ± 1.215
0.911GlyHis: 0.911 ± 0.257
4.477GlyIle: 4.477 ± 0.601
5.084GlyLys: 5.084 ± 0.642
5.388GlyLeu: 5.388 ± 0.685
1.29GlyMet: 1.29 ± 0.345
3.035GlyAsn: 3.035 ± 0.628
2.959GlyPro: 2.959 ± 0.493
2.959GlyGln: 2.959 ± 0.779
4.25GlyArg: 4.25 ± 0.583
4.932GlySer: 4.932 ± 0.527
4.781GlyThr: 4.781 ± 0.786
5.008GlyVal: 5.008 ± 0.609
1.594GlyTrp: 1.594 ± 0.363
3.339GlyTyr: 3.339 ± 0.501
0.0GlyXaa: 0.0 ± 0.0
His
1.594HisAla: 1.594 ± 0.391
0.379HisCys: 0.379 ± 0.176
1.214HisAsp: 1.214 ± 0.245
0.759HisGlu: 0.759 ± 0.25
0.379HisPhe: 0.379 ± 0.174
1.062HisGly: 1.062 ± 0.232
0.607HisHis: 0.607 ± 0.342
1.062HisIle: 1.062 ± 0.32
0.911HisLys: 0.911 ± 0.313
1.062HisLeu: 1.062 ± 0.235
0.379HisMet: 0.379 ± 0.187
0.683HisAsn: 0.683 ± 0.297
1.366HisPro: 1.366 ± 0.357
0.607HisGln: 0.607 ± 0.183
1.29HisArg: 1.29 ± 0.298
0.759HisSer: 0.759 ± 0.221
0.531HisThr: 0.531 ± 0.204
1.29HisVal: 1.29 ± 0.384
0.228HisTrp: 0.228 ± 0.129
0.759HisTyr: 0.759 ± 0.306
0.0HisXaa: 0.0 ± 0.0
Ile
6.147IleAla: 6.147 ± 0.665
0.455IleCys: 0.455 ± 0.21
4.477IleAsp: 4.477 ± 0.52
4.022IleGlu: 4.022 ± 0.564
0.986IlePhe: 0.986 ± 0.222
2.959IleGly: 2.959 ± 0.569
0.835IleHis: 0.835 ± 0.335
2.352IleIle: 2.352 ± 0.648
2.884IleLys: 2.884 ± 0.49
2.884IleLeu: 2.884 ± 0.576
1.594IleMet: 1.594 ± 0.286
1.669IleAsn: 1.669 ± 0.391
2.808IlePro: 2.808 ± 0.496
1.745IleGln: 1.745 ± 0.411
2.732IleArg: 2.732 ± 0.385
2.656IleSer: 2.656 ± 0.4
4.325IleThr: 4.325 ± 0.631
3.339IleVal: 3.339 ± 0.586
0.911IleTrp: 0.911 ± 0.259
1.518IleTyr: 1.518 ± 0.407
0.0IleXaa: 0.0 ± 0.0
Lys
8.803LysAla: 8.803 ± 1.007
0.228LysCys: 0.228 ± 0.148
4.098LysAsp: 4.098 ± 0.605
3.491LysGlu: 3.491 ± 0.55
1.669LysPhe: 1.669 ± 0.299
3.794LysGly: 3.794 ± 0.487
0.759LysHis: 0.759 ± 0.25
2.504LysIle: 2.504 ± 0.38
3.946LysLys: 3.946 ± 0.599
2.959LysLeu: 2.959 ± 0.461
1.518LysMet: 1.518 ± 0.416
2.277LysAsn: 2.277 ± 0.406
2.656LysPro: 2.656 ± 0.561
2.58LysGln: 2.58 ± 0.435
4.325LysArg: 4.325 ± 0.721
2.959LysSer: 2.959 ± 0.393
3.87LysThr: 3.87 ± 0.738
3.263LysVal: 3.263 ± 0.377
0.835LysTrp: 0.835 ± 0.261
1.518LysTyr: 1.518 ± 0.324
0.0LysXaa: 0.0 ± 0.0
Leu
9.486LeuAla: 9.486 ± 1.148
0.455LeuCys: 0.455 ± 0.182
4.401LeuAsp: 4.401 ± 0.523
4.174LeuGlu: 4.174 ± 0.621
2.504LeuPhe: 2.504 ± 0.461
4.857LeuGly: 4.857 ± 1.131
0.911LeuHis: 0.911 ± 0.288
3.491LeuIle: 3.491 ± 0.517
3.718LeuLys: 3.718 ± 0.617
4.325LeuLeu: 4.325 ± 0.545
2.201LeuMet: 2.201 ± 0.386
3.946LeuAsn: 3.946 ± 0.537
2.884LeuPro: 2.884 ± 0.461
3.415LeuGln: 3.415 ± 0.495
4.325LeuArg: 4.325 ± 0.568
3.415LeuSer: 3.415 ± 0.444
5.54LeuThr: 5.54 ± 0.599
4.022LeuVal: 4.022 ± 0.715
0.835LeuTrp: 0.835 ± 0.228
1.821LeuTyr: 1.821 ± 0.392
0.0LeuXaa: 0.0 ± 0.0
Met
3.491MetAla: 3.491 ± 0.393
0.076MetCys: 0.076 ± 0.085
1.214MetAsp: 1.214 ± 0.369
0.911MetGlu: 0.911 ± 0.279
0.835MetPhe: 0.835 ± 0.231
1.745MetGly: 1.745 ± 0.365
0.228MetHis: 0.228 ± 0.165
1.973MetIle: 1.973 ± 0.453
2.201MetLys: 2.201 ± 0.512
2.277MetLeu: 2.277 ± 0.353
0.607MetMet: 0.607 ± 0.262
0.835MetAsn: 0.835 ± 0.272
1.745MetPro: 1.745 ± 0.301
1.973MetGln: 1.973 ± 0.323
1.669MetArg: 1.669 ± 0.363
1.745MetSer: 1.745 ± 0.417
0.759MetThr: 0.759 ± 0.203
1.669MetVal: 1.669 ± 0.302
0.455MetTrp: 0.455 ± 0.17
0.835MetTyr: 0.835 ± 0.201
0.0MetXaa: 0.0 ± 0.0
Asn
4.932AsnAla: 4.932 ± 0.67
0.759AsnCys: 0.759 ± 0.231
3.111AsnAsp: 3.111 ± 0.438
2.656AsnGlu: 2.656 ± 0.436
0.683AsnPhe: 0.683 ± 0.325
4.629AsnGly: 4.629 ± 0.524
0.835AsnHis: 0.835 ± 0.236
2.277AsnIle: 2.277 ± 0.443
2.884AsnLys: 2.884 ± 0.437
2.58AsnLeu: 2.58 ± 0.518
0.986AsnMet: 0.986 ± 0.261
1.745AsnAsn: 1.745 ± 0.395
3.035AsnPro: 3.035 ± 0.434
3.035AsnGln: 3.035 ± 0.771
2.352AsnArg: 2.352 ± 0.451
1.973AsnSer: 1.973 ± 0.311
2.352AsnThr: 2.352 ± 0.467
3.415AsnVal: 3.415 ± 0.763
0.455AsnTrp: 0.455 ± 0.178
0.835AsnTyr: 0.835 ± 0.242
0.0AsnXaa: 0.0 ± 0.0
Pro
6.298ProAla: 6.298 ± 1.162
0.531ProCys: 0.531 ± 0.228
4.022ProAsp: 4.022 ± 0.63
3.415ProGlu: 3.415 ± 0.486
1.29ProPhe: 1.29 ± 0.252
3.946ProGly: 3.946 ± 0.666
0.531ProHis: 0.531 ± 0.17
1.897ProIle: 1.897 ± 0.42
1.897ProLys: 1.897 ± 0.433
3.415ProLeu: 3.415 ± 0.522
0.835ProMet: 0.835 ± 0.308
1.442ProAsn: 1.442 ± 0.39
1.518ProPro: 1.518 ± 0.341
2.884ProGln: 2.884 ± 0.475
1.745ProArg: 1.745 ± 0.322
1.745ProSer: 1.745 ± 0.519
2.959ProThr: 2.959 ± 0.475
2.428ProVal: 2.428 ± 0.36
0.531ProTrp: 0.531 ± 0.182
2.049ProTyr: 2.049 ± 0.426
0.0ProXaa: 0.0 ± 0.0
Gln
5.995GlnAla: 5.995 ± 0.843
0.228GlnCys: 0.228 ± 0.132
2.656GlnAsp: 2.656 ± 0.47
2.732GlnGlu: 2.732 ± 0.534
1.518GlnPhe: 1.518 ± 0.38
3.187GlnGly: 3.187 ± 0.73
0.911GlnHis: 0.911 ± 0.262
2.808GlnIle: 2.808 ± 0.478
2.201GlnLys: 2.201 ± 0.497
3.87GlnLeu: 3.87 ± 0.508
1.594GlnMet: 1.594 ± 0.453
2.125GlnAsn: 2.125 ± 0.519
1.669GlnPro: 1.669 ± 0.473
4.629GlnGln: 4.629 ± 1.323
3.415GlnArg: 3.415 ± 0.536
1.594GlnSer: 1.594 ± 0.37
2.732GlnThr: 2.732 ± 0.466
3.035GlnVal: 3.035 ± 0.47
0.683GlnTrp: 0.683 ± 0.224
1.594GlnTyr: 1.594 ± 0.406
0.0GlnXaa: 0.0 ± 0.0
Arg
5.312ArgAla: 5.312 ± 0.832
0.607ArgCys: 0.607 ± 0.35
3.263ArgAsp: 3.263 ± 0.454
3.339ArgGlu: 3.339 ± 0.464
2.201ArgPhe: 2.201 ± 0.419
4.022ArgGly: 4.022 ± 0.558
1.214ArgHis: 1.214 ± 0.356
3.642ArgIle: 3.642 ± 0.693
3.794ArgLys: 3.794 ± 0.584
5.16ArgLeu: 5.16 ± 0.871
1.745ArgMet: 1.745 ± 0.383
3.035ArgAsn: 3.035 ± 0.436
2.428ArgPro: 2.428 ± 0.439
2.732ArgGln: 2.732 ± 0.394
3.339ArgArg: 3.339 ± 0.611
1.973ArgSer: 1.973 ± 0.415
2.428ArgThr: 2.428 ± 0.402
4.098ArgVal: 4.098 ± 0.526
0.911ArgTrp: 0.911 ± 0.272
2.884ArgTyr: 2.884 ± 0.486
0.0ArgXaa: 0.0 ± 0.0
Ser
4.098SerAla: 4.098 ± 0.555
0.379SerCys: 0.379 ± 0.147
2.656SerAsp: 2.656 ± 0.475
2.58SerGlu: 2.58 ± 0.5
1.366SerPhe: 1.366 ± 0.324
4.629SerGly: 4.629 ± 0.527
0.759SerHis: 0.759 ± 0.325
1.669SerIle: 1.669 ± 0.38
3.187SerLys: 3.187 ± 0.478
3.035SerLeu: 3.035 ± 0.456
1.442SerMet: 1.442 ± 0.351
3.263SerAsn: 3.263 ± 0.721
2.352SerPro: 2.352 ± 0.48
1.897SerGln: 1.897 ± 0.398
2.201SerArg: 2.201 ± 0.408
2.352SerSer: 2.352 ± 0.351
2.732SerThr: 2.732 ± 0.383
3.567SerVal: 3.567 ± 0.614
0.835SerTrp: 0.835 ± 0.297
1.366SerTyr: 1.366 ± 0.349
0.0SerXaa: 0.0 ± 0.0
Thr
6.678ThrAla: 6.678 ± 1.005
0.759ThrCys: 0.759 ± 0.267
3.567ThrAsp: 3.567 ± 0.49
3.187ThrGlu: 3.187 ± 0.516
1.973ThrPhe: 1.973 ± 0.359
5.16ThrGly: 5.16 ± 0.658
0.455ThrHis: 0.455 ± 0.341
2.808ThrIle: 2.808 ± 0.535
2.732ThrLys: 2.732 ± 0.481
3.794ThrLeu: 3.794 ± 0.561
1.669ThrMet: 1.669 ± 0.278
2.656ThrAsn: 2.656 ± 0.367
4.857ThrPro: 4.857 ± 0.641
2.58ThrGln: 2.58 ± 0.417
3.035ThrArg: 3.035 ± 0.472
2.201ThrSer: 2.201 ± 0.547
4.098ThrThr: 4.098 ± 0.583
4.25ThrVal: 4.25 ± 0.612
0.759ThrTrp: 0.759 ± 0.243
1.366ThrTyr: 1.366 ± 0.431
0.0ThrXaa: 0.0 ± 0.0
Val
6.905ValAla: 6.905 ± 0.872
1.29ValCys: 1.29 ± 0.327
4.174ValAsp: 4.174 ± 0.505
3.415ValGlu: 3.415 ± 0.613
2.656ValPhe: 2.656 ± 0.49
4.174ValGly: 4.174 ± 0.566
0.986ValHis: 0.986 ± 0.305
4.857ValIle: 4.857 ± 0.724
3.794ValLys: 3.794 ± 0.511
3.642ValLeu: 3.642 ± 0.445
2.277ValMet: 2.277 ± 0.407
2.959ValAsn: 2.959 ± 0.418
2.201ValPro: 2.201 ± 0.483
1.821ValGln: 1.821 ± 0.439
3.263ValArg: 3.263 ± 0.501
2.504ValSer: 2.504 ± 0.437
4.25ValThr: 4.25 ± 0.699
3.339ValVal: 3.339 ± 0.483
1.062ValTrp: 1.062 ± 0.262
2.428ValTyr: 2.428 ± 0.666
0.0ValXaa: 0.0 ± 0.0
Trp
0.835TrpAla: 0.835 ± 0.261
0.304TrpCys: 0.304 ± 0.139
0.759TrpAsp: 0.759 ± 0.247
1.138TrpGlu: 1.138 ± 0.266
0.379TrpPhe: 0.379 ± 0.168
1.669TrpGly: 1.669 ± 0.384
0.683TrpHis: 0.683 ± 0.247
0.531TrpIle: 0.531 ± 0.198
0.835TrpLys: 0.835 ± 0.261
2.352TrpLeu: 2.352 ± 0.453
0.304TrpMet: 0.304 ± 0.113
0.455TrpAsn: 0.455 ± 0.17
0.379TrpPro: 0.379 ± 0.149
1.062TrpGln: 1.062 ± 0.31
1.138TrpArg: 1.138 ± 0.2
0.531TrpSer: 0.531 ± 0.181
0.759TrpThr: 0.759 ± 0.208
0.759TrpVal: 0.759 ± 0.236
0.228TrpTrp: 0.228 ± 0.152
0.455TrpTyr: 0.455 ± 0.18
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.884TyrAla: 2.884 ± 0.5
0.607TyrCys: 0.607 ± 0.197
2.504TyrAsp: 2.504 ± 0.491
1.821TyrGlu: 1.821 ± 0.474
0.911TyrPhe: 0.911 ± 0.324
2.428TyrGly: 2.428 ± 0.498
0.683TyrHis: 0.683 ± 0.215
1.518TyrIle: 1.518 ± 0.292
1.518TyrLys: 1.518 ± 0.414
2.959TyrLeu: 2.959 ± 0.43
0.986TyrMet: 0.986 ± 0.274
1.821TyrAsn: 1.821 ± 0.403
1.138TyrPro: 1.138 ± 0.342
2.201TyrGln: 2.201 ± 0.393
1.897TyrArg: 1.897 ± 0.406
1.745TyrSer: 1.745 ± 0.37
2.428TyrThr: 2.428 ± 0.473
1.214TyrVal: 1.214 ± 0.3
0.835TyrTrp: 0.835 ± 0.21
0.986TyrTyr: 0.986 ± 0.298
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 63 proteins (13179 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski