Amino acid dipepetide frequency for Escherichia phage DE3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.803AlaAla: 13.803 ± 2.26
0.898AlaCys: 0.898 ± 0.302
5.88AlaAsp: 5.88 ± 0.634
7.024AlaGlu: 7.024 ± 0.931
3.757AlaPhe: 3.757 ± 0.532
8.576AlaGly: 8.576 ± 0.919
1.388AlaHis: 1.388 ± 0.295
5.39AlaIle: 5.39 ± 0.592
5.472AlaLys: 5.472 ± 0.775
8.494AlaLeu: 8.494 ± 0.914
3.185AlaMet: 3.185 ± 0.515
2.777AlaAsn: 2.777 ± 0.534
2.205AlaPro: 2.205 ± 0.402
4.492AlaGln: 4.492 ± 0.897
6.861AlaArg: 6.861 ± 1.01
7.841AlaSer: 7.841 ± 1.236
5.064AlaThr: 5.064 ± 0.997
7.432AlaVal: 7.432 ± 0.912
1.878AlaTrp: 1.878 ± 0.482
3.349AlaTyr: 3.349 ± 0.51
0.0AlaXaa: 0.0 ± 0.0
Cys
1.143CysAla: 1.143 ± 0.345
0.245CysCys: 0.245 ± 0.196
0.408CysAsp: 0.408 ± 0.175
0.408CysGlu: 0.408 ± 0.204
0.327CysPhe: 0.327 ± 0.172
0.735CysGly: 0.735 ± 0.286
0.408CysHis: 0.408 ± 0.221
0.735CysIle: 0.735 ± 0.232
0.653CysLys: 0.653 ± 0.239
0.572CysLeu: 0.572 ± 0.196
0.163CysMet: 0.163 ± 0.123
0.408CysAsn: 0.408 ± 0.189
0.408CysPro: 0.408 ± 0.158
0.245CysGln: 0.245 ± 0.156
1.062CysArg: 1.062 ± 0.409
1.225CysSer: 1.225 ± 0.347
0.98CysThr: 0.98 ± 0.313
0.653CysVal: 0.653 ± 0.202
0.245CysTrp: 0.245 ± 0.142
0.49CysTyr: 0.49 ± 0.199
0.0CysXaa: 0.0 ± 0.0
Asp
5.635AspAla: 5.635 ± 0.928
0.735AspCys: 0.735 ± 0.252
3.757AspAsp: 3.757 ± 0.567
4.084AspGlu: 4.084 ± 0.731
1.797AspPhe: 1.797 ± 0.282
5.472AspGly: 5.472 ± 0.759
0.735AspHis: 0.735 ± 0.212
3.594AspIle: 3.594 ± 0.416
2.614AspLys: 2.614 ± 0.454
3.43AspLeu: 3.43 ± 0.795
1.96AspMet: 1.96 ± 0.427
2.287AspAsn: 2.287 ± 0.413
2.369AspPro: 2.369 ± 0.697
1.552AspGln: 1.552 ± 0.433
2.287AspArg: 2.287 ± 0.492
3.594AspSer: 3.594 ± 0.57
3.594AspThr: 3.594 ± 0.617
3.839AspVal: 3.839 ± 0.591
1.552AspTrp: 1.552 ± 0.438
1.96AspTyr: 1.96 ± 0.474
0.0AspXaa: 0.0 ± 0.0
Glu
7.024GluAla: 7.024 ± 1.006
0.653GluCys: 0.653 ± 0.273
3.512GluAsp: 3.512 ± 0.497
4.165GluGlu: 4.165 ± 0.572
2.205GluPhe: 2.205 ± 0.498
3.267GluGly: 3.267 ± 0.58
1.307GluHis: 1.307 ± 0.397
3.839GluIle: 3.839 ± 0.525
3.104GluLys: 3.104 ± 0.414
6.044GluLeu: 6.044 ± 0.731
1.633GluMet: 1.633 ± 0.38
3.594GluAsn: 3.594 ± 0.577
1.633GluPro: 1.633 ± 0.391
5.064GluGln: 5.064 ± 0.719
4.084GluArg: 4.084 ± 0.559
4.329GluSer: 4.329 ± 0.543
4.002GluThr: 4.002 ± 0.702
3.757GluVal: 3.757 ± 0.612
0.817GluTrp: 0.817 ± 0.241
1.878GluTyr: 1.878 ± 0.313
0.0GluXaa: 0.0 ± 0.0
Phe
2.532PheAla: 2.532 ± 0.536
0.817PheCys: 0.817 ± 0.242
2.777PheAsp: 2.777 ± 0.487
1.715PheGlu: 1.715 ± 0.405
1.143PhePhe: 1.143 ± 0.368
2.287PheGly: 2.287 ± 0.301
1.143PheHis: 1.143 ± 0.31
1.388PheIle: 1.388 ± 0.375
1.47PheLys: 1.47 ± 0.32
2.45PheLeu: 2.45 ± 0.452
0.735PheMet: 0.735 ± 0.264
0.898PheAsn: 0.898 ± 0.228
1.47PhePro: 1.47 ± 0.338
0.898PheGln: 0.898 ± 0.245
3.185PheArg: 3.185 ± 0.414
2.695PheSer: 2.695 ± 0.765
2.695PheThr: 2.695 ± 0.439
2.205PheVal: 2.205 ± 0.37
0.49PheTrp: 0.49 ± 0.161
0.817PheTyr: 0.817 ± 0.21
0.0PheXaa: 0.0 ± 0.0
Gly
5.799GlyAla: 5.799 ± 0.875
0.817GlyCys: 0.817 ± 0.212
3.675GlyAsp: 3.675 ± 0.495
4.492GlyGlu: 4.492 ± 0.609
2.205GlyPhe: 2.205 ± 0.402
5.309GlyGly: 5.309 ± 0.91
0.653GlyHis: 0.653 ± 0.211
3.92GlyIle: 3.92 ± 0.57
4.165GlyLys: 4.165 ± 0.632
5.227GlyLeu: 5.227 ± 0.615
2.614GlyMet: 2.614 ± 0.469
3.349GlyAsn: 3.349 ± 0.575
1.143GlyPro: 1.143 ± 0.191
3.349GlyGln: 3.349 ± 0.455
4.655GlyArg: 4.655 ± 0.593
4.084GlySer: 4.084 ± 0.589
4.492GlyThr: 4.492 ± 0.736
5.472GlyVal: 5.472 ± 0.62
1.633GlyTrp: 1.633 ± 0.317
2.695GlyTyr: 2.695 ± 0.546
0.0GlyXaa: 0.0 ± 0.0
His
0.98HisAla: 0.98 ± 0.317
0.245HisCys: 0.245 ± 0.118
0.98HisAsp: 0.98 ± 0.203
1.307HisGlu: 1.307 ± 0.356
1.225HisPhe: 1.225 ± 0.312
1.307HisGly: 1.307 ± 0.376
0.408HisHis: 0.408 ± 0.175
0.898HisIle: 0.898 ± 0.246
1.388HisLys: 1.388 ± 0.319
1.388HisLeu: 1.388 ± 0.371
0.327HisMet: 0.327 ± 0.176
0.817HisAsn: 0.817 ± 0.249
0.898HisPro: 0.898 ± 0.278
0.653HisGln: 0.653 ± 0.245
0.98HisArg: 0.98 ± 0.237
0.98HisSer: 0.98 ± 0.287
0.735HisThr: 0.735 ± 0.227
0.898HisVal: 0.898 ± 0.241
0.327HisTrp: 0.327 ± 0.15
0.98HisTyr: 0.98 ± 0.241
0.0HisXaa: 0.0 ± 0.0
Ile
4.982IleAla: 4.982 ± 0.597
0.653IleCys: 0.653 ± 0.238
3.349IleAsp: 3.349 ± 0.542
3.92IleGlu: 3.92 ± 0.618
1.062IlePhe: 1.062 ± 0.46
2.94IleGly: 2.94 ± 0.72
0.98IleHis: 0.98 ± 0.342
1.878IleIle: 1.878 ± 0.436
2.94IleLys: 2.94 ± 0.529
2.695IleLeu: 2.695 ± 0.435
0.817IleMet: 0.817 ± 0.262
2.94IleAsn: 2.94 ± 0.478
2.287IlePro: 2.287 ± 0.419
1.96IleGln: 1.96 ± 0.286
3.022IleArg: 3.022 ± 0.412
3.594IleSer: 3.594 ± 0.475
4.329IleThr: 4.329 ± 0.705
2.94IleVal: 2.94 ± 0.444
0.817IleTrp: 0.817 ± 0.268
1.225IleTyr: 1.225 ± 0.334
0.0IleXaa: 0.0 ± 0.0
Lys
5.88LysAla: 5.88 ± 0.889
0.408LysCys: 0.408 ± 0.259
2.695LysAsp: 2.695 ± 0.537
3.839LysGlu: 3.839 ± 0.488
1.715LysPhe: 1.715 ± 0.354
3.43LysGly: 3.43 ± 0.514
1.47LysHis: 1.47 ± 0.302
1.715LysIle: 1.715 ± 0.293
3.594LysLys: 3.594 ± 0.617
3.594LysLeu: 3.594 ± 0.713
1.388LysMet: 1.388 ± 0.376
2.042LysAsn: 2.042 ± 0.439
2.614LysPro: 2.614 ± 0.502
2.369LysGln: 2.369 ± 0.369
3.92LysArg: 3.92 ± 0.509
3.675LysSer: 3.675 ± 0.437
3.839LysThr: 3.839 ± 0.62
3.104LysVal: 3.104 ± 0.491
1.143LysTrp: 1.143 ± 0.296
1.552LysTyr: 1.552 ± 0.383
0.0LysXaa: 0.0 ± 0.0
Leu
10.046LeuAla: 10.046 ± 1.293
0.898LeuCys: 0.898 ± 0.273
3.675LeuAsp: 3.675 ± 0.499
4.492LeuGlu: 4.492 ± 0.621
1.96LeuPhe: 1.96 ± 0.391
4.247LeuGly: 4.247 ± 0.697
1.225LeuHis: 1.225 ± 0.348
3.675LeuIle: 3.675 ± 0.612
4.737LeuLys: 4.737 ± 0.71
5.635LeuLeu: 5.635 ± 0.73
1.878LeuMet: 1.878 ± 0.349
2.94LeuAsn: 2.94 ± 0.533
4.165LeuPro: 4.165 ± 0.591
2.777LeuGln: 2.777 ± 0.456
4.819LeuArg: 4.819 ± 0.555
6.37LeuSer: 6.37 ± 0.879
5.88LeuThr: 5.88 ± 0.896
4.247LeuVal: 4.247 ± 0.635
1.225LeuTrp: 1.225 ± 0.298
1.143LeuTyr: 1.143 ± 0.331
0.0LeuXaa: 0.0 ± 0.0
Met
3.512MetAla: 3.512 ± 0.641
0.245MetCys: 0.245 ± 0.129
1.388MetAsp: 1.388 ± 0.379
0.98MetGlu: 0.98 ± 0.323
1.388MetPhe: 1.388 ± 0.376
1.307MetGly: 1.307 ± 0.275
0.572MetHis: 0.572 ± 0.284
1.062MetIle: 1.062 ± 0.327
1.715MetLys: 1.715 ± 0.448
2.532MetLeu: 2.532 ± 0.421
0.653MetMet: 0.653 ± 0.226
1.307MetAsn: 1.307 ± 0.314
1.47MetPro: 1.47 ± 0.386
1.552MetGln: 1.552 ± 0.429
1.797MetArg: 1.797 ± 0.384
1.633MetSer: 1.633 ± 0.341
3.022MetThr: 3.022 ± 0.579
2.042MetVal: 2.042 ± 0.332
0.245MetTrp: 0.245 ± 0.137
0.572MetTyr: 0.572 ± 0.225
0.0MetXaa: 0.0 ± 0.0
Asn
3.757AsnAla: 3.757 ± 0.672
0.735AsnCys: 0.735 ± 0.304
2.042AsnAsp: 2.042 ± 0.337
2.94AsnGlu: 2.94 ± 0.518
1.062AsnPhe: 1.062 ± 0.457
4.002AsnGly: 4.002 ± 0.517
1.062AsnHis: 1.062 ± 0.308
2.123AsnIle: 2.123 ± 0.411
2.042AsnLys: 2.042 ± 0.478
2.532AsnLeu: 2.532 ± 0.398
1.062AsnMet: 1.062 ± 0.333
1.552AsnAsn: 1.552 ± 0.362
1.878AsnPro: 1.878 ± 0.353
1.388AsnGln: 1.388 ± 0.396
2.205AsnArg: 2.205 ± 0.481
2.287AsnSer: 2.287 ± 0.42
2.94AsnThr: 2.94 ± 0.558
2.532AsnVal: 2.532 ± 0.365
0.572AsnTrp: 0.572 ± 0.217
0.98AsnTyr: 0.98 ± 0.313
0.0AsnXaa: 0.0 ± 0.0
Pro
4.002ProAla: 4.002 ± 0.58
0.163ProCys: 0.163 ± 0.102
3.185ProAsp: 3.185 ± 0.609
2.777ProGlu: 2.777 ± 0.587
1.552ProPhe: 1.552 ± 0.319
2.94ProGly: 2.94 ± 0.45
0.572ProHis: 0.572 ± 0.21
1.225ProIle: 1.225 ± 0.315
2.042ProLys: 2.042 ± 0.382
2.859ProLeu: 2.859 ± 0.426
1.143ProMet: 1.143 ± 0.289
1.47ProAsn: 1.47 ± 0.397
1.388ProPro: 1.388 ± 0.429
1.388ProGln: 1.388 ± 0.309
1.96ProArg: 1.96 ± 0.469
2.205ProSer: 2.205 ± 0.467
1.878ProThr: 1.878 ± 0.439
3.512ProVal: 3.512 ± 0.43
0.735ProTrp: 0.735 ± 0.263
0.817ProTyr: 0.817 ± 0.248
0.0ProXaa: 0.0 ± 0.0
Gln
4.329GlnAla: 4.329 ± 0.856
0.49GlnCys: 0.49 ± 0.182
1.797GlnAsp: 1.797 ± 0.388
2.94GlnGlu: 2.94 ± 0.563
1.552GlnPhe: 1.552 ± 0.328
1.797GlnGly: 1.797 ± 0.359
0.572GlnHis: 0.572 ± 0.208
2.614GlnIle: 2.614 ± 0.523
1.878GlnLys: 1.878 ± 0.414
4.084GlnLeu: 4.084 ± 0.477
1.715GlnMet: 1.715 ± 0.434
2.287GlnAsn: 2.287 ± 0.444
1.715GlnPro: 1.715 ± 0.428
2.94GlnGln: 2.94 ± 0.704
2.695GlnArg: 2.695 ± 0.461
2.94GlnSer: 2.94 ± 0.53
2.777GlnThr: 2.777 ± 0.617
3.512GlnVal: 3.512 ± 0.521
0.572GlnTrp: 0.572 ± 0.238
1.552GlnTyr: 1.552 ± 0.341
0.0GlnXaa: 0.0 ± 0.0
Arg
5.717ArgAla: 5.717 ± 0.653
0.898ArgCys: 0.898 ± 0.313
3.594ArgAsp: 3.594 ± 0.812
5.145ArgGlu: 5.145 ± 0.72
1.878ArgPhe: 1.878 ± 0.401
4.084ArgGly: 4.084 ± 0.541
1.552ArgHis: 1.552 ± 0.348
4.084ArgIle: 4.084 ± 0.607
3.512ArgLys: 3.512 ± 0.584
5.145ArgLeu: 5.145 ± 0.723
1.96ArgMet: 1.96 ± 0.413
2.532ArgAsn: 2.532 ± 0.496
1.552ArgPro: 1.552 ± 0.361
3.594ArgGln: 3.594 ± 0.505
4.9ArgArg: 4.9 ± 0.815
3.022ArgSer: 3.022 ± 0.407
2.532ArgThr: 2.532 ± 0.527
3.594ArgVal: 3.594 ± 0.642
0.898ArgTrp: 0.898 ± 0.307
2.287ArgTyr: 2.287 ± 0.397
0.0ArgXaa: 0.0 ± 0.0
Ser
8.249SerAla: 8.249 ± 1.766
0.408SerCys: 0.408 ± 0.177
3.757SerAsp: 3.757 ± 0.531
4.819SerGlu: 4.819 ± 0.646
2.042SerPhe: 2.042 ± 0.398
7.024SerGly: 7.024 ± 0.873
1.062SerHis: 1.062 ± 0.27
2.287SerIle: 2.287 ± 0.348
3.022SerLys: 3.022 ± 0.454
5.064SerLeu: 5.064 ± 0.857
2.369SerMet: 2.369 ± 0.456
1.878SerAsn: 1.878 ± 0.307
2.695SerPro: 2.695 ± 0.578
3.349SerGln: 3.349 ± 0.357
4.655SerArg: 4.655 ± 0.735
3.839SerSer: 3.839 ± 0.722
3.757SerThr: 3.757 ± 0.622
5.717SerVal: 5.717 ± 0.912
0.653SerTrp: 0.653 ± 0.324
1.633SerTyr: 1.633 ± 0.344
0.0SerXaa: 0.0 ± 0.0
Thr
7.351ThrAla: 7.351 ± 0.872
0.653ThrCys: 0.653 ± 0.211
3.594ThrAsp: 3.594 ± 0.437
4.084ThrGlu: 4.084 ± 0.8
2.777ThrPhe: 2.777 ± 0.649
4.329ThrGly: 4.329 ± 0.648
0.98ThrHis: 0.98 ± 0.287
3.349ThrIle: 3.349 ± 0.537
2.532ThrLys: 2.532 ± 0.481
5.799ThrLeu: 5.799 ± 0.67
1.307ThrMet: 1.307 ± 0.406
1.715ThrAsn: 1.715 ± 0.475
3.675ThrPro: 3.675 ± 0.718
2.695ThrGln: 2.695 ± 0.454
3.267ThrArg: 3.267 ± 0.465
3.92ThrSer: 3.92 ± 0.674
3.267ThrThr: 3.267 ± 0.618
5.064ThrVal: 5.064 ± 0.904
1.225ThrTrp: 1.225 ± 0.288
1.633ThrTyr: 1.633 ± 0.365
0.0ThrXaa: 0.0 ± 0.0
Val
6.779ValAla: 6.779 ± 0.68
0.735ValCys: 0.735 ± 0.295
3.675ValAsp: 3.675 ± 0.527
4.247ValGlu: 4.247 ± 0.603
2.042ValPhe: 2.042 ± 0.376
3.839ValGly: 3.839 ± 0.644
0.653ValHis: 0.653 ± 0.219
3.43ValIle: 3.43 ± 0.641
4.574ValLys: 4.574 ± 0.567
5.064ValLeu: 5.064 ± 0.782
2.614ValMet: 2.614 ± 0.51
3.349ValAsn: 3.349 ± 0.511
2.205ValPro: 2.205 ± 0.487
2.614ValGln: 2.614 ± 0.808
3.185ValArg: 3.185 ± 0.551
5.88ValSer: 5.88 ± 0.876
5.227ValThr: 5.227 ± 0.753
4.982ValVal: 4.982 ± 0.555
0.817ValTrp: 0.817 ± 0.235
2.123ValTyr: 2.123 ± 0.545
0.0ValXaa: 0.0 ± 0.0
Trp
1.633TrpAla: 1.633 ± 0.34
0.408TrpCys: 0.408 ± 0.169
0.98TrpAsp: 0.98 ± 0.329
0.653TrpGlu: 0.653 ± 0.227
0.653TrpPhe: 0.653 ± 0.237
0.735TrpGly: 0.735 ± 0.256
0.49TrpHis: 0.49 ± 0.178
0.572TrpIle: 0.572 ± 0.348
1.143TrpLys: 1.143 ± 0.276
1.225TrpLeu: 1.225 ± 0.326
0.735TrpMet: 0.735 ± 0.222
0.817TrpAsn: 0.817 ± 0.242
0.653TrpPro: 0.653 ± 0.202
0.572TrpGln: 0.572 ± 0.218
1.062TrpArg: 1.062 ± 0.348
1.225TrpSer: 1.225 ± 0.316
0.98TrpThr: 0.98 ± 0.318
1.225TrpVal: 1.225 ± 0.28
0.327TrpTrp: 0.327 ± 0.201
0.49TrpTyr: 0.49 ± 0.181
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.614TyrAla: 2.614 ± 0.483
0.408TyrCys: 0.408 ± 0.174
2.123TyrAsp: 2.123 ± 0.347
1.878TyrGlu: 1.878 ± 0.387
1.47TyrPhe: 1.47 ± 0.403
2.123TyrGly: 2.123 ± 0.423
0.49TyrHis: 0.49 ± 0.224
1.552TyrIle: 1.552 ± 0.361
1.388TyrLys: 1.388 ± 0.42
2.205TyrLeu: 2.205 ± 0.544
0.653TyrMet: 0.653 ± 0.266
0.735TyrAsn: 0.735 ± 0.208
1.388TyrPro: 1.388 ± 0.313
1.307TyrGln: 1.307 ± 0.277
1.878TyrArg: 1.878 ± 0.363
2.94TyrSer: 2.94 ± 0.574
1.307TyrThr: 1.307 ± 0.323
1.307TyrVal: 1.307 ± 0.411
0.408TyrTrp: 0.408 ± 0.158
1.307TyrTyr: 1.307 ± 0.384
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (12245 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski