Amino acid dipepetide frequency for Escherichia phage EcoDS1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.316AlaAla: 9.316 ± 0.999
0.665AlaCys: 0.665 ± 0.218
4.658AlaAsp: 4.658 ± 0.509
5.989AlaGlu: 5.989 ± 0.907
3.327AlaPhe: 3.327 ± 0.451
7.237AlaGly: 7.237 ± 1.087
0.832AlaHis: 0.832 ± 0.256
5.49AlaIle: 5.49 ± 0.736
5.906AlaLys: 5.906 ± 0.628
6.987AlaLeu: 6.987 ± 0.922
2.911AlaMet: 2.911 ± 0.494
3.161AlaAsn: 3.161 ± 0.462
2.662AlaPro: 2.662 ± 0.564
2.995AlaGln: 2.995 ± 0.469
3.826AlaArg: 3.826 ± 0.54
5.074AlaSer: 5.074 ± 0.48
4.492AlaThr: 4.492 ± 0.661
5.656AlaVal: 5.656 ± 0.881
1.497AlaTrp: 1.497 ± 0.385
2.828AlaTyr: 2.828 ± 0.582
0.0AlaXaa: 0.0 ± 0.0
Cys
0.582CysAla: 0.582 ± 0.207
0.0CysCys: 0.0 ± 0.0
0.749CysAsp: 0.749 ± 0.331
0.832CysGlu: 0.832 ± 0.292
0.582CysPhe: 0.582 ± 0.229
0.998CysGly: 0.998 ± 0.279
0.582CysHis: 0.582 ± 0.239
0.333CysIle: 0.333 ± 0.203
0.749CysLys: 0.749 ± 0.295
1.248CysLeu: 1.248 ± 0.334
0.333CysMet: 0.333 ± 0.201
0.25CysAsn: 0.25 ± 0.137
0.416CysPro: 0.416 ± 0.182
0.25CysGln: 0.25 ± 0.162
0.832CysArg: 0.832 ± 0.335
0.665CysSer: 0.665 ± 0.278
0.25CysThr: 0.25 ± 0.134
0.416CysVal: 0.416 ± 0.189
0.25CysTrp: 0.25 ± 0.129
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.405AspAla: 6.405 ± 0.709
0.998AspCys: 0.998 ± 0.404
4.242AspAsp: 4.242 ± 0.722
4.076AspGlu: 4.076 ± 0.548
2.329AspPhe: 2.329 ± 0.507
6.571AspGly: 6.571 ± 0.726
1.414AspHis: 1.414 ± 0.285
3.244AspIle: 3.244 ± 0.509
3.66AspLys: 3.66 ± 0.625
5.074AspLeu: 5.074 ± 0.551
1.83AspMet: 1.83 ± 0.339
2.662AspAsn: 2.662 ± 0.579
2.828AspPro: 2.828 ± 0.462
1.58AspGln: 1.58 ± 0.43
2.412AspArg: 2.412 ± 0.447
3.41AspSer: 3.41 ± 0.456
4.076AspThr: 4.076 ± 0.46
4.325AspVal: 4.325 ± 0.583
0.915AspTrp: 0.915 ± 0.295
2.246AspTyr: 2.246 ± 0.342
0.0AspXaa: 0.0 ± 0.0
Glu
6.488GluAla: 6.488 ± 0.994
0.665GluCys: 0.665 ± 0.262
4.824GluAsp: 4.824 ± 0.573
4.908GluGlu: 4.908 ± 0.795
2.745GluPhe: 2.745 ± 0.548
4.908GluGly: 4.908 ± 0.69
0.915GluHis: 0.915 ± 0.28
2.828GluIle: 2.828 ± 0.453
2.911GluLys: 2.911 ± 0.466
5.49GluLeu: 5.49 ± 0.748
2.495GluMet: 2.495 ± 0.461
1.996GluAsn: 1.996 ± 0.535
2.08GluPro: 2.08 ± 0.4
2.911GluGln: 2.911 ± 0.518
4.492GluArg: 4.492 ± 0.666
3.909GluSer: 3.909 ± 0.664
3.41GluThr: 3.41 ± 0.524
4.325GluVal: 4.325 ± 0.584
1.414GluTrp: 1.414 ± 0.315
2.995GluTyr: 2.995 ± 0.526
0.0GluXaa: 0.0 ± 0.0
Phe
2.495PheAla: 2.495 ± 0.338
0.416PheCys: 0.416 ± 0.173
2.246PheAsp: 2.246 ± 0.416
1.996PheGlu: 1.996 ± 0.343
1.081PhePhe: 1.081 ± 0.347
2.662PheGly: 2.662 ± 0.523
0.832PheHis: 0.832 ± 0.184
1.913PheIle: 1.913 ± 0.47
2.828PheLys: 2.828 ± 0.516
2.995PheLeu: 2.995 ± 0.386
1.165PheMet: 1.165 ± 0.399
2.412PheAsn: 2.412 ± 0.381
1.664PhePro: 1.664 ± 0.426
0.749PheGln: 0.749 ± 0.245
1.58PheArg: 1.58 ± 0.319
2.828PheSer: 2.828 ± 0.366
2.08PheThr: 2.08 ± 0.304
2.495PheVal: 2.495 ± 0.432
0.333PheTrp: 0.333 ± 0.126
1.081PheTyr: 1.081 ± 0.257
0.0PheXaa: 0.0 ± 0.0
Gly
5.906GlyAla: 5.906 ± 0.983
0.333GlyCys: 0.333 ± 0.18
4.908GlyAsp: 4.908 ± 0.929
5.24GlyGlu: 5.24 ± 0.646
2.163GlyPhe: 2.163 ± 0.343
5.24GlyGly: 5.24 ± 0.704
0.832GlyHis: 0.832 ± 0.237
4.492GlyIle: 4.492 ± 0.616
6.239GlyLys: 6.239 ± 0.917
6.072GlyLeu: 6.072 ± 0.705
2.412GlyMet: 2.412 ± 0.47
2.995GlyAsn: 2.995 ± 0.726
1.248GlyPro: 1.248 ± 0.295
2.745GlyGln: 2.745 ± 0.402
5.656GlyArg: 5.656 ± 0.624
5.989GlySer: 5.989 ± 0.83
4.741GlyThr: 4.741 ± 0.637
5.49GlyVal: 5.49 ± 0.74
1.248GlyTrp: 1.248 ± 0.278
3.743GlyTyr: 3.743 ± 0.517
0.0GlyXaa: 0.0 ± 0.0
His
0.915HisAla: 0.915 ± 0.388
0.333HisCys: 0.333 ± 0.197
1.331HisAsp: 1.331 ± 0.376
1.248HisGlu: 1.248 ± 0.408
0.499HisPhe: 0.499 ± 0.228
1.081HisGly: 1.081 ± 0.315
0.333HisHis: 0.333 ± 0.17
1.081HisIle: 1.081 ± 0.263
1.248HisLys: 1.248 ± 0.298
1.414HisLeu: 1.414 ± 0.315
0.582HisMet: 0.582 ± 0.223
0.665HisAsn: 0.665 ± 0.219
0.333HisPro: 0.333 ± 0.136
0.499HisGln: 0.499 ± 0.174
1.165HisArg: 1.165 ± 0.311
0.998HisSer: 0.998 ± 0.212
1.248HisThr: 1.248 ± 0.289
1.414HisVal: 1.414 ± 0.312
0.333HisTrp: 0.333 ± 0.171
0.416HisTyr: 0.416 ± 0.169
0.0HisXaa: 0.0 ± 0.0
Ile
3.909IleAla: 3.909 ± 0.718
0.665IleCys: 0.665 ± 0.29
3.327IleAsp: 3.327 ± 0.384
3.161IleGlu: 3.161 ± 0.397
1.414IlePhe: 1.414 ± 0.364
4.076IleGly: 4.076 ± 0.517
1.331IleHis: 1.331 ± 0.395
2.163IleIle: 2.163 ± 0.37
3.909IleLys: 3.909 ± 0.517
3.66IleLeu: 3.66 ± 0.511
0.915IleMet: 0.915 ± 0.241
2.911IleAsn: 2.911 ± 0.684
1.83IlePro: 1.83 ± 0.514
1.996IleGln: 1.996 ± 0.492
3.244IleArg: 3.244 ± 0.564
2.579IleSer: 2.579 ± 0.517
3.41IleThr: 3.41 ± 0.589
4.409IleVal: 4.409 ± 0.438
0.665IleTrp: 0.665 ± 0.202
1.664IleTyr: 1.664 ± 0.265
0.0IleXaa: 0.0 ± 0.0
Lys
6.987LysAla: 6.987 ± 0.721
0.665LysCys: 0.665 ± 0.294
4.159LysAsp: 4.159 ± 0.509
3.826LysGlu: 3.826 ± 0.537
2.495LysPhe: 2.495 ± 0.545
4.159LysGly: 4.159 ± 0.613
1.497LysHis: 1.497 ± 0.428
2.495LysIle: 2.495 ± 0.327
4.076LysLys: 4.076 ± 0.881
5.49LysLeu: 5.49 ± 0.516
1.913LysMet: 1.913 ± 0.345
2.412LysAsn: 2.412 ± 0.472
2.579LysPro: 2.579 ± 0.537
2.163LysGln: 2.163 ± 0.434
4.159LysArg: 4.159 ± 0.628
4.159LysSer: 4.159 ± 0.618
4.325LysThr: 4.325 ± 0.568
5.739LysVal: 5.739 ± 0.845
1.248LysTrp: 1.248 ± 0.337
2.163LysTyr: 2.163 ± 0.393
0.0LysXaa: 0.0 ± 0.0
Leu
6.987LeuAla: 6.987 ± 0.785
0.499LeuCys: 0.499 ± 0.225
4.159LeuAsp: 4.159 ± 0.398
6.488LeuGlu: 6.488 ± 0.657
2.163LeuPhe: 2.163 ± 0.365
5.157LeuGly: 5.157 ± 0.781
0.749LeuHis: 0.749 ± 0.21
3.826LeuIle: 3.826 ± 0.623
6.571LeuLys: 6.571 ± 0.836
4.908LeuLeu: 4.908 ± 0.611
2.579LeuMet: 2.579 ± 0.519
3.993LeuAsn: 3.993 ± 0.649
3.41LeuPro: 3.41 ± 0.385
4.159LeuGln: 4.159 ± 0.678
4.991LeuArg: 4.991 ± 0.451
4.908LeuSer: 4.908 ± 0.623
5.24LeuThr: 5.24 ± 0.708
4.824LeuVal: 4.824 ± 0.591
0.998LeuTrp: 0.998 ± 0.257
2.329LeuTyr: 2.329 ± 0.505
0.0LeuXaa: 0.0 ± 0.0
Met
3.244MetAla: 3.244 ± 0.477
0.416MetCys: 0.416 ± 0.171
1.664MetAsp: 1.664 ± 0.329
1.83MetGlu: 1.83 ± 0.359
1.497MetPhe: 1.497 ± 0.412
1.913MetGly: 1.913 ± 0.442
0.25MetHis: 0.25 ± 0.135
1.331MetIle: 1.331 ± 0.302
1.081MetLys: 1.081 ± 0.256
2.828MetLeu: 2.828 ± 0.422
0.665MetMet: 0.665 ± 0.208
1.248MetAsn: 1.248 ± 0.282
0.998MetPro: 0.998 ± 0.291
0.832MetGln: 0.832 ± 0.34
1.081MetArg: 1.081 ± 0.24
2.495MetSer: 2.495 ± 0.391
1.747MetThr: 1.747 ± 0.344
2.495MetVal: 2.495 ± 0.458
0.333MetTrp: 0.333 ± 0.175
1.165MetTyr: 1.165 ± 0.287
0.0MetXaa: 0.0 ± 0.0
Asn
4.658AsnAla: 4.658 ± 0.605
0.749AsnCys: 0.749 ± 0.251
2.246AsnAsp: 2.246 ± 0.538
2.495AsnGlu: 2.495 ± 0.474
1.414AsnPhe: 1.414 ± 0.267
4.575AsnGly: 4.575 ± 0.685
0.582AsnHis: 0.582 ± 0.192
3.161AsnIle: 3.161 ± 0.74
2.329AsnLys: 2.329 ± 0.409
3.327AsnLeu: 3.327 ± 0.568
0.915AsnMet: 0.915 ± 0.29
2.412AsnAsn: 2.412 ± 0.595
2.995AsnPro: 2.995 ± 0.6
1.497AsnGln: 1.497 ± 0.349
2.246AsnArg: 2.246 ± 0.481
2.995AsnSer: 2.995 ± 1.062
2.246AsnThr: 2.246 ± 0.423
2.412AsnVal: 2.412 ± 0.42
0.083AsnTrp: 0.083 ± 0.079
1.747AsnTyr: 1.747 ± 0.387
0.0AsnXaa: 0.0 ± 0.0
Pro
2.745ProAla: 2.745 ± 0.518
0.416ProCys: 0.416 ± 0.218
2.495ProAsp: 2.495 ± 0.343
2.995ProGlu: 2.995 ± 0.608
1.081ProPhe: 1.081 ± 0.227
1.664ProGly: 1.664 ± 0.291
0.665ProHis: 0.665 ± 0.191
1.83ProIle: 1.83 ± 0.355
3.078ProLys: 3.078 ± 0.557
2.163ProLeu: 2.163 ± 0.421
0.998ProMet: 0.998 ± 0.238
2.412ProAsn: 2.412 ± 0.435
0.915ProPro: 0.915 ± 0.241
1.58ProGln: 1.58 ± 0.283
1.664ProArg: 1.664 ± 0.419
2.828ProSer: 2.828 ± 0.401
3.078ProThr: 3.078 ± 0.471
3.078ProVal: 3.078 ± 0.358
0.749ProTrp: 0.749 ± 0.222
0.915ProTyr: 0.915 ± 0.207
0.0ProXaa: 0.0 ± 0.0
Gln
3.66GlnAla: 3.66 ± 0.513
0.166GlnCys: 0.166 ± 0.1
3.078GlnAsp: 3.078 ± 0.72
2.08GlnGlu: 2.08 ± 0.385
1.664GlnPhe: 1.664 ± 0.275
2.579GlnGly: 2.579 ± 0.588
0.665GlnHis: 0.665 ± 0.245
1.248GlnIle: 1.248 ± 0.271
1.747GlnLys: 1.747 ± 0.343
3.909GlnLeu: 3.909 ± 0.639
1.331GlnMet: 1.331 ± 0.363
1.331GlnAsn: 1.331 ± 0.416
1.248GlnPro: 1.248 ± 0.344
2.08GlnGln: 2.08 ± 0.573
2.246GlnArg: 2.246 ± 0.687
2.911GlnSer: 2.911 ± 0.466
2.08GlnThr: 2.08 ± 0.447
2.246GlnVal: 2.246 ± 0.289
0.915GlnTrp: 0.915 ± 0.299
1.165GlnTyr: 1.165 ± 0.359
0.0GlnXaa: 0.0 ± 0.0
Arg
4.076ArgAla: 4.076 ± 0.702
0.499ArgCys: 0.499 ± 0.157
4.409ArgAsp: 4.409 ± 0.399
4.242ArgGlu: 4.242 ± 0.57
2.579ArgPhe: 2.579 ± 0.398
4.076ArgGly: 4.076 ± 0.448
0.832ArgHis: 0.832 ± 0.272
3.66ArgIle: 3.66 ± 0.649
3.161ArgLys: 3.161 ± 0.59
5.739ArgLeu: 5.739 ± 0.653
1.081ArgMet: 1.081 ± 0.282
2.495ArgAsn: 2.495 ± 0.454
1.664ArgPro: 1.664 ± 0.359
2.329ArgGln: 2.329 ± 0.399
2.246ArgArg: 2.246 ± 0.409
3.41ArgSer: 3.41 ± 0.638
2.495ArgThr: 2.495 ± 0.354
2.662ArgVal: 2.662 ± 0.491
1.165ArgTrp: 1.165 ± 0.268
1.497ArgTyr: 1.497 ± 0.317
0.0ArgXaa: 0.0 ± 0.0
Ser
4.076SerAla: 4.076 ± 0.647
0.915SerCys: 0.915 ± 0.304
5.49SerAsp: 5.49 ± 0.462
2.911SerGlu: 2.911 ± 0.502
2.412SerPhe: 2.412 ± 0.373
5.656SerGly: 5.656 ± 0.775
1.913SerHis: 1.913 ± 0.358
3.327SerIle: 3.327 ± 0.637
4.159SerLys: 4.159 ± 0.486
3.909SerLeu: 3.909 ± 0.551
1.58SerMet: 1.58 ± 0.32
2.995SerAsn: 2.995 ± 0.618
2.911SerPro: 2.911 ± 0.463
2.163SerGln: 2.163 ± 0.425
2.828SerArg: 2.828 ± 0.477
4.242SerSer: 4.242 ± 0.52
3.327SerThr: 3.327 ± 0.662
4.741SerVal: 4.741 ± 0.729
0.665SerTrp: 0.665 ± 0.211
2.662SerTyr: 2.662 ± 0.558
0.0SerXaa: 0.0 ± 0.0
Thr
3.743ThrAla: 3.743 ± 0.661
0.665ThrCys: 0.665 ± 0.252
3.577ThrAsp: 3.577 ± 0.568
4.908ThrGlu: 4.908 ± 0.723
2.579ThrPhe: 2.579 ± 0.411
5.906ThrGly: 5.906 ± 0.661
0.582ThrHis: 0.582 ± 0.21
3.66ThrIle: 3.66 ± 0.584
3.494ThrLys: 3.494 ± 0.497
4.409ThrLeu: 4.409 ± 0.374
1.747ThrMet: 1.747 ± 0.366
2.329ThrAsn: 2.329 ± 0.677
3.244ThrPro: 3.244 ± 0.412
3.078ThrGln: 3.078 ± 0.64
2.579ThrArg: 2.579 ± 0.437
2.412ThrSer: 2.412 ± 0.438
4.076ThrThr: 4.076 ± 0.76
4.658ThrVal: 4.658 ± 0.527
0.416ThrTrp: 0.416 ± 0.123
1.248ThrTyr: 1.248 ± 0.303
0.0ThrXaa: 0.0 ± 0.0
Val
5.324ValAla: 5.324 ± 0.536
0.582ValCys: 0.582 ± 0.212
3.577ValAsp: 3.577 ± 0.703
4.824ValGlu: 4.824 ± 0.847
2.246ValPhe: 2.246 ± 0.477
5.573ValGly: 5.573 ± 0.628
1.165ValHis: 1.165 ± 0.479
3.078ValIle: 3.078 ± 0.551
5.656ValLys: 5.656 ± 0.597
5.074ValLeu: 5.074 ± 0.682
2.163ValMet: 2.163 ± 0.474
3.41ValAsn: 3.41 ± 0.53
2.662ValPro: 2.662 ± 0.482
2.828ValGln: 2.828 ± 0.488
4.242ValArg: 4.242 ± 0.558
4.492ValSer: 4.492 ± 0.471
4.575ValThr: 4.575 ± 0.671
5.407ValVal: 5.407 ± 0.88
0.749ValTrp: 0.749 ± 0.295
2.579ValTyr: 2.579 ± 0.424
0.0ValXaa: 0.0 ± 0.0
Trp
0.499TrpAla: 0.499 ± 0.134
0.333TrpCys: 0.333 ± 0.168
0.749TrpAsp: 0.749 ± 0.232
1.081TrpGlu: 1.081 ± 0.243
0.665TrpPhe: 0.665 ± 0.202
0.915TrpGly: 0.915 ± 0.26
0.416TrpHis: 0.416 ± 0.17
0.416TrpIle: 0.416 ± 0.192
1.83TrpLys: 1.83 ± 0.429
1.996TrpLeu: 1.996 ± 0.403
0.166TrpMet: 0.166 ± 0.106
0.832TrpAsn: 0.832 ± 0.242
0.333TrpPro: 0.333 ± 0.148
0.499TrpGln: 0.499 ± 0.214
0.665TrpArg: 0.665 ± 0.206
0.915TrpSer: 0.915 ± 0.416
0.416TrpThr: 0.416 ± 0.18
1.165TrpVal: 1.165 ± 0.292
0.166TrpTrp: 0.166 ± 0.119
0.582TrpTyr: 0.582 ± 0.217
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.494TyrAla: 3.494 ± 0.564
0.333TyrCys: 0.333 ± 0.154
2.412TyrAsp: 2.412 ± 0.405
1.58TyrGlu: 1.58 ± 0.429
0.915TyrPhe: 0.915 ± 0.243
2.745TyrGly: 2.745 ± 0.483
0.749TyrHis: 0.749 ± 0.33
1.664TyrIle: 1.664 ± 0.501
2.08TyrLys: 2.08 ± 0.42
2.246TyrLeu: 2.246 ± 0.328
1.248TyrMet: 1.248 ± 0.302
2.246TyrAsn: 2.246 ± 0.525
1.331TyrPro: 1.331 ± 0.361
1.414TyrGln: 1.414 ± 0.475
2.163TyrArg: 2.163 ± 0.425
1.664TyrSer: 1.664 ± 0.311
1.996TyrThr: 1.996 ± 0.306
2.246TyrVal: 2.246 ± 0.394
0.499TyrTrp: 0.499 ± 0.216
1.331TyrTyr: 1.331 ± 0.349
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (12023 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski