Amino acid dipepetide frequency for Escherichia phage 13a

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.588AlaAla: 8.588 ± 1.218
0.875AlaCys: 0.875 ± 0.284
5.408AlaAsp: 5.408 ± 0.645
5.328AlaGlu: 5.328 ± 0.578
2.386AlaPhe: 2.386 ± 0.431
7.237AlaGly: 7.237 ± 0.93
1.59AlaHis: 1.59 ± 0.309
5.01AlaIle: 5.01 ± 0.69
6.839AlaLys: 6.839 ± 0.649
8.27AlaLeu: 8.27 ± 0.897
3.658AlaMet: 3.658 ± 0.579
4.294AlaAsn: 4.294 ± 0.524
2.227AlaPro: 2.227 ± 0.36
3.181AlaGln: 3.181 ± 0.678
5.328AlaArg: 5.328 ± 0.589
4.851AlaSer: 4.851 ± 0.653
3.34AlaThr: 3.34 ± 0.526
5.169AlaVal: 5.169 ± 0.708
1.431AlaTrp: 1.431 ± 0.335
2.386AlaTyr: 2.386 ± 0.442
0.0AlaXaa: 0.0 ± 0.0
Cys
1.113CysAla: 1.113 ± 0.308
0.318CysCys: 0.318 ± 0.178
1.113CysAsp: 1.113 ± 0.381
0.239CysGlu: 0.239 ± 0.132
0.636CysPhe: 0.636 ± 0.242
0.716CysGly: 0.716 ± 0.269
0.795CysHis: 0.795 ± 0.336
0.795CysIle: 0.795 ± 0.312
0.875CysLys: 0.875 ± 0.33
0.795CysLeu: 0.795 ± 0.256
0.159CysMet: 0.159 ± 0.101
0.318CysAsn: 0.318 ± 0.18
0.398CysPro: 0.398 ± 0.161
0.477CysGln: 0.477 ± 0.174
0.477CysArg: 0.477 ± 0.209
1.034CysSer: 1.034 ± 0.344
0.477CysThr: 0.477 ± 0.18
1.272CysVal: 1.272 ± 0.304
0.398CysTrp: 0.398 ± 0.174
0.318CysTyr: 0.318 ± 0.172
0.0CysXaa: 0.0 ± 0.0
Asp
4.294AspAla: 4.294 ± 0.561
0.954AspCys: 0.954 ± 0.282
4.294AspAsp: 4.294 ± 0.634
4.533AspGlu: 4.533 ± 0.482
2.942AspPhe: 2.942 ± 0.374
5.249AspGly: 5.249 ± 0.586
0.795AspHis: 0.795 ± 0.229
3.101AspIle: 3.101 ± 0.47
4.374AspLys: 4.374 ± 0.649
4.135AspLeu: 4.135 ± 0.739
2.386AspMet: 2.386 ± 0.552
3.022AspAsn: 3.022 ± 0.389
2.704AspPro: 2.704 ± 0.562
1.988AspGln: 1.988 ± 0.474
2.783AspArg: 2.783 ± 0.55
4.056AspSer: 4.056 ± 0.63
4.135AspThr: 4.135 ± 0.488
4.851AspVal: 4.851 ± 0.495
1.034AspTrp: 1.034 ± 0.339
1.59AspTyr: 1.59 ± 0.332
0.0AspXaa: 0.0 ± 0.0
Glu
6.998GluAla: 6.998 ± 0.801
0.477GluCys: 0.477 ± 0.185
3.976GluAsp: 3.976 ± 0.602
6.6GluGlu: 6.6 ± 1.041
1.75GluPhe: 1.75 ± 0.4
5.567GluGly: 5.567 ± 0.65
1.909GluHis: 1.909 ± 0.415
3.419GluIle: 3.419 ± 0.512
3.419GluLys: 3.419 ± 0.568
5.885GluLeu: 5.885 ± 0.634
2.386GluMet: 2.386 ± 0.478
2.704GluAsn: 2.704 ± 0.38
1.829GluPro: 1.829 ± 0.414
3.738GluGln: 3.738 ± 0.719
3.499GluArg: 3.499 ± 0.428
4.851GluSer: 4.851 ± 0.592
3.579GluThr: 3.579 ± 0.526
3.897GluVal: 3.897 ± 0.566
1.352GluTrp: 1.352 ± 0.396
3.419GluTyr: 3.419 ± 0.549
0.0GluXaa: 0.0 ± 0.0
Phe
1.988PheAla: 1.988 ± 0.352
0.318PheCys: 0.318 ± 0.177
3.181PheAsp: 3.181 ± 0.485
2.227PheGlu: 2.227 ± 0.435
0.875PhePhe: 0.875 ± 0.228
3.658PheGly: 3.658 ± 0.398
0.875PheHis: 0.875 ± 0.322
1.431PheIle: 1.431 ± 0.365
2.386PheLys: 2.386 ± 0.379
3.101PheLeu: 3.101 ± 0.436
1.511PheMet: 1.511 ± 0.251
1.511PheAsn: 1.511 ± 0.377
1.352PhePro: 1.352 ± 0.367
1.034PheGln: 1.034 ± 0.277
2.147PheArg: 2.147 ± 0.416
1.909PheSer: 1.909 ± 0.417
2.386PheThr: 2.386 ± 0.359
1.67PheVal: 1.67 ± 0.326
0.239PheTrp: 0.239 ± 0.123
0.875PheTyr: 0.875 ± 0.257
0.0PheXaa: 0.0 ± 0.0
Gly
6.203GlyAla: 6.203 ± 0.878
1.511GlyCys: 1.511 ± 0.455
4.215GlyAsp: 4.215 ± 0.475
4.851GlyGlu: 4.851 ± 0.674
3.26GlyPhe: 3.26 ± 0.594
4.93GlyGly: 4.93 ± 0.724
1.431GlyHis: 1.431 ± 0.373
4.771GlyIle: 4.771 ± 0.537
6.203GlyLys: 6.203 ± 0.849
5.885GlyLeu: 5.885 ± 0.731
1.988GlyMet: 1.988 ± 0.396
2.704GlyAsn: 2.704 ± 0.552
0.954GlyPro: 0.954 ± 0.311
3.26GlyGln: 3.26 ± 0.485
4.533GlyArg: 4.533 ± 0.506
4.612GlySer: 4.612 ± 0.599
3.897GlyThr: 3.897 ± 0.439
4.533GlyVal: 4.533 ± 0.5
1.511GlyTrp: 1.511 ± 0.416
3.579GlyTyr: 3.579 ± 0.594
0.0GlyXaa: 0.0 ± 0.0
His
1.272HisAla: 1.272 ± 0.355
0.398HisCys: 0.398 ± 0.153
1.352HisAsp: 1.352 ± 0.292
1.829HisGlu: 1.829 ± 0.43
0.795HisPhe: 0.795 ± 0.2
1.511HisGly: 1.511 ± 0.266
0.716HisHis: 0.716 ± 0.207
1.67HisIle: 1.67 ± 0.402
1.67HisLys: 1.67 ± 0.377
2.863HisLeu: 2.863 ± 0.613
0.398HisMet: 0.398 ± 0.14
0.557HisAsn: 0.557 ± 0.179
0.159HisPro: 0.159 ± 0.105
0.159HisGln: 0.159 ± 0.116
0.954HisArg: 0.954 ± 0.291
1.193HisSer: 1.193 ± 0.257
0.636HisThr: 0.636 ± 0.221
1.272HisVal: 1.272 ± 0.342
0.318HisTrp: 0.318 ± 0.165
1.113HisTyr: 1.113 ± 0.271
0.0HisXaa: 0.0 ± 0.0
Ile
3.817IleAla: 3.817 ± 0.486
0.716IleCys: 0.716 ± 0.235
3.499IleAsp: 3.499 ± 0.472
3.579IleGlu: 3.579 ± 0.589
0.875IlePhe: 0.875 ± 0.206
3.897IleGly: 3.897 ± 0.366
1.67IleHis: 1.67 ± 0.517
3.499IleIle: 3.499 ± 0.597
3.579IleLys: 3.579 ± 0.478
4.215IleLeu: 4.215 ± 0.54
1.272IleMet: 1.272 ± 0.313
2.863IleAsn: 2.863 ± 0.666
2.704IlePro: 2.704 ± 0.356
1.59IleGln: 1.59 ± 0.389
3.976IleArg: 3.976 ± 0.58
2.704IleSer: 2.704 ± 0.362
2.704IleThr: 2.704 ± 0.352
3.34IleVal: 3.34 ± 0.405
0.636IleTrp: 0.636 ± 0.212
1.909IleTyr: 1.909 ± 0.282
0.0IleXaa: 0.0 ± 0.0
Lys
7.157LysAla: 7.157 ± 0.855
0.795LysCys: 0.795 ± 0.303
3.579LysAsp: 3.579 ± 0.433
4.533LysGlu: 4.533 ± 0.711
2.306LysPhe: 2.306 ± 0.33
4.612LysGly: 4.612 ± 0.565
1.67LysHis: 1.67 ± 0.394
2.465LysIle: 2.465 ± 0.492
4.612LysLys: 4.612 ± 0.726
5.567LysLeu: 5.567 ± 0.824
2.386LysMet: 2.386 ± 0.41
2.545LysAsn: 2.545 ± 0.317
2.624LysPro: 2.624 ± 0.563
2.624LysGln: 2.624 ± 0.542
3.738LysArg: 3.738 ± 0.573
3.897LysSer: 3.897 ± 0.577
3.579LysThr: 3.579 ± 0.484
4.135LysVal: 4.135 ± 0.442
0.875LysTrp: 0.875 ± 0.252
2.624LysTyr: 2.624 ± 0.347
0.0LysXaa: 0.0 ± 0.0
Leu
8.907LeuAla: 8.907 ± 1.03
0.875LeuCys: 0.875 ± 0.365
4.056LeuAsp: 4.056 ± 0.509
6.521LeuGlu: 6.521 ± 0.735
2.386LeuPhe: 2.386 ± 0.42
4.294LeuGly: 4.294 ± 0.612
1.511LeuHis: 1.511 ± 0.27
3.897LeuIle: 3.897 ± 0.553
5.249LeuLys: 5.249 ± 0.519
5.646LeuLeu: 5.646 ± 0.799
2.783LeuMet: 2.783 ± 0.437
3.579LeuAsn: 3.579 ± 0.413
3.26LeuPro: 3.26 ± 0.442
4.135LeuGln: 4.135 ± 0.637
6.521LeuArg: 6.521 ± 0.685
5.567LeuSer: 5.567 ± 0.713
4.692LeuThr: 4.692 ± 0.595
4.215LeuVal: 4.215 ± 0.504
1.193LeuTrp: 1.193 ± 0.345
2.545LeuTyr: 2.545 ± 0.531
0.0LeuXaa: 0.0 ± 0.0
Met
3.419MetAla: 3.419 ± 0.503
0.318MetCys: 0.318 ± 0.161
2.147MetAsp: 2.147 ± 0.422
2.147MetGlu: 2.147 ± 0.481
1.034MetPhe: 1.034 ± 0.354
2.465MetGly: 2.465 ± 0.419
0.318MetHis: 0.318 ± 0.154
1.59MetIle: 1.59 ± 0.295
0.875MetLys: 0.875 ± 0.207
3.101MetLeu: 3.101 ± 0.496
0.875MetMet: 0.875 ± 0.265
1.193MetAsn: 1.193 ± 0.316
1.431MetPro: 1.431 ± 0.329
1.113MetGln: 1.113 ± 0.299
1.75MetArg: 1.75 ± 0.305
1.829MetSer: 1.829 ± 0.414
1.988MetThr: 1.988 ± 0.423
2.386MetVal: 2.386 ± 0.432
0.08MetTrp: 0.08 ± 0.089
1.193MetTyr: 1.193 ± 0.336
0.0MetXaa: 0.0 ± 0.0
Asn
3.101AsnAla: 3.101 ± 0.312
0.716AsnCys: 0.716 ± 0.186
2.863AsnAsp: 2.863 ± 0.469
2.306AsnGlu: 2.306 ± 0.423
1.75AsnPhe: 1.75 ± 0.349
5.408AsnGly: 5.408 ± 0.656
0.557AsnHis: 0.557 ± 0.214
2.704AsnIle: 2.704 ± 0.531
2.227AsnLys: 2.227 ± 0.389
3.34AsnLeu: 3.34 ± 0.612
1.272AsnMet: 1.272 ± 0.272
1.511AsnAsn: 1.511 ± 0.425
2.624AsnPro: 2.624 ± 0.371
1.75AsnGln: 1.75 ± 0.287
2.227AsnArg: 2.227 ± 0.481
2.704AsnSer: 2.704 ± 0.743
2.624AsnThr: 2.624 ± 0.705
2.704AsnVal: 2.704 ± 0.456
0.477AsnTrp: 0.477 ± 0.171
1.75AsnTyr: 1.75 ± 0.381
0.0AsnXaa: 0.0 ± 0.0
Pro
2.783ProAla: 2.783 ± 0.361
0.557ProCys: 0.557 ± 0.232
2.783ProAsp: 2.783 ± 0.37
3.738ProGlu: 3.738 ± 0.682
1.431ProPhe: 1.431 ± 0.287
0.716ProGly: 0.716 ± 0.295
0.636ProHis: 0.636 ± 0.188
1.352ProIle: 1.352 ± 0.313
2.386ProLys: 2.386 ± 0.438
2.465ProLeu: 2.465 ± 0.388
1.193ProMet: 1.193 ± 0.296
2.386ProAsn: 2.386 ± 0.533
0.636ProPro: 0.636 ± 0.209
0.954ProGln: 0.954 ± 0.281
1.59ProArg: 1.59 ± 0.373
2.465ProSer: 2.465 ± 0.455
1.829ProThr: 1.829 ± 0.38
1.67ProVal: 1.67 ± 0.327
0.795ProTrp: 0.795 ± 0.172
1.193ProTyr: 1.193 ± 0.294
0.0ProXaa: 0.0 ± 0.0
Gln
3.658GlnAla: 3.658 ± 0.826
0.239GlnCys: 0.239 ± 0.141
1.829GlnAsp: 1.829 ± 0.384
2.783GlnGlu: 2.783 ± 0.563
2.227GlnPhe: 2.227 ± 0.325
1.909GlnGly: 1.909 ± 0.306
0.159GlnHis: 0.159 ± 0.141
1.67GlnIle: 1.67 ± 0.366
2.227GlnLys: 2.227 ± 0.439
3.976GlnLeu: 3.976 ± 0.555
1.352GlnMet: 1.352 ± 0.297
1.352GlnAsn: 1.352 ± 0.362
1.113GlnPro: 1.113 ± 0.374
2.068GlnGln: 2.068 ± 0.282
2.068GlnArg: 2.068 ± 0.56
1.988GlnSer: 1.988 ± 0.455
1.75GlnThr: 1.75 ± 0.417
2.704GlnVal: 2.704 ± 0.522
1.113GlnTrp: 1.113 ± 0.299
1.113GlnTyr: 1.113 ± 0.34
0.0GlnXaa: 0.0 ± 0.0
Arg
4.93ArgAla: 4.93 ± 0.605
0.875ArgCys: 0.875 ± 0.248
3.897ArgAsp: 3.897 ± 0.705
5.169ArgGlu: 5.169 ± 0.669
2.227ArgPhe: 2.227 ± 0.36
3.897ArgGly: 3.897 ± 0.521
0.875ArgHis: 0.875 ± 0.277
2.863ArgIle: 2.863 ± 0.374
3.897ArgLys: 3.897 ± 0.539
5.01ArgLeu: 5.01 ± 0.63
1.511ArgMet: 1.511 ± 0.334
2.783ArgAsn: 2.783 ± 0.584
1.988ArgPro: 1.988 ± 0.282
1.67ArgGln: 1.67 ± 0.477
2.465ArgArg: 2.465 ± 0.468
3.579ArgSer: 3.579 ± 0.532
2.783ArgThr: 2.783 ± 0.445
3.658ArgVal: 3.658 ± 0.54
0.954ArgTrp: 0.954 ± 0.302
1.829ArgTyr: 1.829 ± 0.424
0.0ArgXaa: 0.0 ± 0.0
Ser
5.646SerAla: 5.646 ± 0.809
1.034SerCys: 1.034 ± 0.322
5.249SerAsp: 5.249 ± 0.708
3.34SerGlu: 3.34 ± 0.472
2.386SerPhe: 2.386 ± 0.466
5.726SerGly: 5.726 ± 0.957
1.75SerHis: 1.75 ± 0.381
3.181SerIle: 3.181 ± 0.559
3.817SerLys: 3.817 ± 0.634
3.976SerLeu: 3.976 ± 0.439
1.431SerMet: 1.431 ± 0.425
2.624SerAsn: 2.624 ± 0.417
1.67SerPro: 1.67 ± 0.351
1.829SerGln: 1.829 ± 0.351
3.26SerArg: 3.26 ± 0.576
3.738SerSer: 3.738 ± 0.641
3.181SerThr: 3.181 ± 0.461
4.374SerVal: 4.374 ± 0.484
0.875SerTrp: 0.875 ± 0.216
1.909SerTyr: 1.909 ± 0.377
0.0SerXaa: 0.0 ± 0.0
Thr
4.215ThrAla: 4.215 ± 0.507
0.716ThrCys: 0.716 ± 0.314
3.579ThrAsp: 3.579 ± 0.452
3.26ThrGlu: 3.26 ± 0.54
1.829ThrPhe: 1.829 ± 0.34
5.885ThrGly: 5.885 ± 0.651
1.113ThrHis: 1.113 ± 0.246
3.34ThrIle: 3.34 ± 0.554
4.93ThrLys: 4.93 ± 0.555
4.453ThrLeu: 4.453 ± 0.566
1.193ThrMet: 1.193 ± 0.333
1.909ThrAsn: 1.909 ± 0.497
2.465ThrPro: 2.465 ± 0.452
2.147ThrGln: 2.147 ± 0.465
2.545ThrArg: 2.545 ± 0.437
3.419ThrSer: 3.419 ± 0.459
3.181ThrThr: 3.181 ± 0.545
3.419ThrVal: 3.419 ± 0.549
0.477ThrTrp: 0.477 ± 0.176
1.431ThrTyr: 1.431 ± 0.337
0.0ThrXaa: 0.0 ± 0.0
Val
5.408ValAla: 5.408 ± 0.548
0.398ValCys: 0.398 ± 0.187
3.499ValAsp: 3.499 ± 0.445
4.692ValGlu: 4.692 ± 0.726
2.068ValPhe: 2.068 ± 0.486
3.817ValGly: 3.817 ± 0.509
1.431ValHis: 1.431 ± 0.349
3.419ValIle: 3.419 ± 0.562
3.738ValLys: 3.738 ± 0.674
4.692ValLeu: 4.692 ± 0.591
1.988ValMet: 1.988 ± 0.389
3.181ValAsn: 3.181 ± 0.645
2.386ValPro: 2.386 ± 0.436
1.988ValGln: 1.988 ± 0.402
4.056ValArg: 4.056 ± 0.547
3.658ValSer: 3.658 ± 0.527
5.328ValThr: 5.328 ± 0.535
4.851ValVal: 4.851 ± 0.712
1.193ValTrp: 1.193 ± 0.288
2.147ValTyr: 2.147 ± 0.488
0.0ValXaa: 0.0 ± 0.0
Trp
0.875TrpAla: 0.875 ± 0.278
0.318TrpCys: 0.318 ± 0.151
0.398TrpAsp: 0.398 ± 0.149
1.193TrpGlu: 1.193 ± 0.303
0.477TrpPhe: 0.477 ± 0.197
0.875TrpGly: 0.875 ± 0.279
0.398TrpHis: 0.398 ± 0.214
0.954TrpIle: 0.954 ± 0.307
1.59TrpLys: 1.59 ± 0.373
1.67TrpLeu: 1.67 ± 0.367
0.318TrpMet: 0.318 ± 0.137
1.511TrpAsn: 1.511 ± 0.376
0.159TrpPro: 0.159 ± 0.098
0.398TrpGln: 0.398 ± 0.143
0.795TrpArg: 0.795 ± 0.261
1.034TrpSer: 1.034 ± 0.401
1.352TrpThr: 1.352 ± 0.277
1.193TrpVal: 1.193 ± 0.449
0.398TrpTrp: 0.398 ± 0.155
0.159TrpTyr: 0.159 ± 0.1
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.022TyrAla: 3.022 ± 0.519
0.318TyrCys: 0.318 ± 0.179
2.386TyrAsp: 2.386 ± 0.425
2.386TyrGlu: 2.386 ± 0.405
1.034TyrPhe: 1.034 ± 0.288
2.465TyrGly: 2.465 ± 0.403
0.636TyrHis: 0.636 ± 0.247
1.909TyrIle: 1.909 ± 0.414
1.431TyrLys: 1.431 ± 0.304
2.783TyrLeu: 2.783 ± 0.466
1.113TyrMet: 1.113 ± 0.312
2.068TyrAsn: 2.068 ± 0.367
0.875TyrPro: 0.875 ± 0.271
1.193TyrGln: 1.193 ± 0.398
2.147TyrArg: 2.147 ± 0.435
1.909TyrSer: 1.909 ± 0.441
2.068TyrThr: 2.068 ± 0.412
2.624TyrVal: 2.624 ± 0.533
0.716TyrTrp: 0.716 ± 0.246
0.716TyrTyr: 0.716 ± 0.327
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (12576 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski