Amino acid dipepetide frequency for Escherichia phage JLK-2012

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.068AlaAla: 11.068 ± 1.666
0.795AlaCys: 0.795 ± 0.227
4.647AlaAsp: 4.647 ± 0.538
7.521AlaGlu: 7.521 ± 0.89
2.996AlaPhe: 2.996 ± 0.432
7.949AlaGly: 7.949 ± 1.067
2.018AlaHis: 2.018 ± 0.311
5.503AlaIle: 5.503 ± 0.518
4.464AlaLys: 4.464 ± 0.493
8.927AlaLeu: 8.927 ± 0.959
2.69AlaMet: 2.69 ± 0.345
3.057AlaAsn: 3.057 ± 0.447
2.568AlaPro: 2.568 ± 0.368
4.341AlaGln: 4.341 ± 0.598
6.115AlaArg: 6.115 ± 0.748
5.87AlaSer: 5.87 ± 0.797
4.341AlaThr: 4.341 ± 0.6
6.176AlaVal: 6.176 ± 0.919
2.079AlaTrp: 2.079 ± 0.353
2.324AlaTyr: 2.324 ± 0.387
0.0AlaXaa: 0.0 ± 0.0
Cys
0.611CysAla: 0.611 ± 0.17
0.306CysCys: 0.306 ± 0.155
0.673CysAsp: 0.673 ± 0.235
0.673CysGlu: 0.673 ± 0.246
0.367CysPhe: 0.367 ± 0.175
1.284CysGly: 1.284 ± 0.345
0.611CysHis: 0.611 ± 0.212
0.856CysIle: 0.856 ± 0.22
0.611CysLys: 0.611 ± 0.226
1.04CysLeu: 1.04 ± 0.24
0.489CysMet: 0.489 ± 0.195
0.428CysAsn: 0.428 ± 0.162
0.489CysPro: 0.489 ± 0.219
0.367CysGln: 0.367 ± 0.15
1.284CysArg: 1.284 ± 0.321
0.978CysSer: 0.978 ± 0.263
1.101CysThr: 1.101 ± 0.25
0.856CysVal: 0.856 ± 0.231
0.245CysTrp: 0.245 ± 0.108
0.55CysTyr: 0.55 ± 0.183
0.0CysXaa: 0.0 ± 0.0
Asp
5.259AspAla: 5.259 ± 0.6
0.611AspCys: 0.611 ± 0.176
3.73AspAsp: 3.73 ± 0.531
3.852AspGlu: 3.852 ± 0.554
1.223AspPhe: 1.223 ± 0.24
5.626AspGly: 5.626 ± 0.636
0.673AspHis: 0.673 ± 0.198
4.036AspIle: 4.036 ± 0.465
2.568AspLys: 2.568 ± 0.432
3.975AspLeu: 3.975 ± 0.59
2.079AspMet: 2.079 ± 0.378
2.507AspAsn: 2.507 ± 0.457
2.385AspPro: 2.385 ± 0.472
1.345AspGln: 1.345 ± 0.268
3.057AspArg: 3.057 ± 0.44
2.813AspSer: 2.813 ± 0.396
2.507AspThr: 2.507 ± 0.476
3.852AspVal: 3.852 ± 0.577
1.101AspTrp: 1.101 ± 0.285
1.468AspTyr: 1.468 ± 0.334
0.0AspXaa: 0.0 ± 0.0
Glu
6.787GluAla: 6.787 ± 0.625
1.101GluCys: 1.101 ± 0.295
2.996GluAsp: 2.996 ± 0.398
3.424GluGlu: 3.424 ± 0.609
2.568GluPhe: 2.568 ± 0.337
3.547GluGly: 3.547 ± 0.412
0.917GluHis: 0.917 ± 0.264
4.647GluIle: 4.647 ± 0.484
4.219GluLys: 4.219 ± 0.592
6.054GluLeu: 6.054 ± 0.7
1.773GluMet: 1.773 ± 0.3
3.119GluAsn: 3.119 ± 0.434
2.14GluPro: 2.14 ± 0.356
4.403GluGln: 4.403 ± 0.5
4.831GluArg: 4.831 ± 0.632
4.28GluSer: 4.28 ± 0.487
3.73GluThr: 3.73 ± 0.631
3.363GluVal: 3.363 ± 0.507
1.345GluTrp: 1.345 ± 0.248
2.201GluTyr: 2.201 ± 0.345
0.0GluXaa: 0.0 ± 0.0
Phe
2.385PheAla: 2.385 ± 0.416
0.55PheCys: 0.55 ± 0.197
1.834PheAsp: 1.834 ± 0.342
1.712PheGlu: 1.712 ± 0.316
0.611PhePhe: 0.611 ± 0.22
3.119PheGly: 3.119 ± 0.397
0.611PheHis: 0.611 ± 0.181
1.651PheIle: 1.651 ± 0.337
1.345PheLys: 1.345 ± 0.282
1.712PheLeu: 1.712 ± 0.319
0.734PheMet: 0.734 ± 0.232
1.406PheAsn: 1.406 ± 0.258
1.101PhePro: 1.101 ± 0.294
0.795PheGln: 0.795 ± 0.213
2.262PheArg: 2.262 ± 0.329
3.547PheSer: 3.547 ± 0.528
2.324PheThr: 2.324 ± 0.429
2.079PheVal: 2.079 ± 0.331
0.55PheTrp: 0.55 ± 0.2
0.611PheTyr: 0.611 ± 0.189
0.0PheXaa: 0.0 ± 0.0
Gly
5.564GlyAla: 5.564 ± 0.703
1.223GlyCys: 1.223 ± 0.319
3.852GlyAsp: 3.852 ± 0.529
5.626GlyGlu: 5.626 ± 0.814
2.507GlyPhe: 2.507 ± 0.349
5.198GlyGly: 5.198 ± 0.657
1.284GlyHis: 1.284 ± 0.287
3.852GlyIle: 3.852 ± 0.384
5.014GlyLys: 5.014 ± 0.64
6.543GlyLeu: 6.543 ± 0.652
2.813GlyMet: 2.813 ± 0.362
3.485GlyAsn: 3.485 ± 0.463
2.69GlyPro: 2.69 ± 1.24
3.241GlyGln: 3.241 ± 0.479
3.913GlyArg: 3.913 ± 0.54
4.464GlySer: 4.464 ± 0.547
3.547GlyThr: 3.547 ± 0.379
5.442GlyVal: 5.442 ± 0.528
1.406GlyTrp: 1.406 ± 0.249
1.712GlyTyr: 1.712 ± 0.306
0.0GlyXaa: 0.0 ± 0.0
His
1.284HisAla: 1.284 ± 0.299
0.306HisCys: 0.306 ± 0.135
1.162HisAsp: 1.162 ± 0.284
1.101HisGlu: 1.101 ± 0.291
0.611HisPhe: 0.611 ± 0.183
1.59HisGly: 1.59 ± 0.285
0.917HisHis: 0.917 ± 0.232
1.406HisIle: 1.406 ± 0.285
1.284HisLys: 1.284 ± 0.272
1.834HisLeu: 1.834 ± 0.323
0.183HisMet: 0.183 ± 0.117
0.428HisAsn: 0.428 ± 0.16
1.162HisPro: 1.162 ± 0.253
0.795HisGln: 0.795 ± 0.217
1.284HisArg: 1.284 ± 0.263
0.795HisSer: 0.795 ± 0.275
0.978HisThr: 0.978 ± 0.243
0.795HisVal: 0.795 ± 0.22
0.367HisTrp: 0.367 ± 0.149
1.04HisTyr: 1.04 ± 0.26
0.0HisXaa: 0.0 ± 0.0
Ile
4.097IleAla: 4.097 ± 0.426
0.978IleCys: 0.978 ± 0.247
3.485IleAsp: 3.485 ± 0.455
3.669IleGlu: 3.669 ± 0.611
0.856IlePhe: 0.856 ± 0.233
2.69IleGly: 2.69 ± 0.367
1.101IleHis: 1.101 ± 0.242
2.69IleIle: 2.69 ± 0.347
4.097IleLys: 4.097 ± 0.413
3.913IleLeu: 3.913 ± 0.543
0.795IleMet: 0.795 ± 0.224
2.813IleAsn: 2.813 ± 0.43
3.057IlePro: 3.057 ± 0.613
2.385IleGln: 2.385 ± 0.41
4.28IleArg: 4.28 ± 0.567
4.464IleSer: 4.464 ± 0.639
4.341IleThr: 4.341 ± 0.449
2.629IleVal: 2.629 ± 0.466
0.734IleTrp: 0.734 ± 0.264
1.773IleTyr: 1.773 ± 0.372
0.0IleXaa: 0.0 ± 0.0
Lys
5.564LysAla: 5.564 ± 0.603
0.55LysCys: 0.55 ± 0.166
2.568LysAsp: 2.568 ± 0.365
3.608LysGlu: 3.608 ± 0.467
1.284LysPhe: 1.284 ± 0.287
4.036LysGly: 4.036 ± 0.499
0.795LysHis: 0.795 ± 0.189
2.996LysIle: 2.996 ± 0.359
4.464LysLys: 4.464 ± 0.616
4.036LysLeu: 4.036 ± 0.55
1.773LysMet: 1.773 ± 0.328
3.057LysAsn: 3.057 ± 0.427
2.813LysPro: 2.813 ± 0.437
3.18LysGln: 3.18 ± 0.54
2.996LysArg: 2.996 ± 0.506
3.485LysSer: 3.485 ± 0.435
3.791LysThr: 3.791 ± 0.454
2.568LysVal: 2.568 ± 0.486
1.101LysTrp: 1.101 ± 0.284
1.529LysTyr: 1.529 ± 0.362
0.0LysXaa: 0.0 ± 0.0
Leu
9.6LeuAla: 9.6 ± 0.844
1.284LeuCys: 1.284 ± 0.325
3.975LeuAsp: 3.975 ± 0.545
4.769LeuGlu: 4.769 ± 0.59
2.201LeuPhe: 2.201 ± 0.333
4.464LeuGly: 4.464 ± 0.687
1.101LeuHis: 1.101 ± 0.254
4.036LeuIle: 4.036 ± 0.572
5.32LeuLys: 5.32 ± 0.499
5.687LeuLeu: 5.687 ± 0.61
2.018LeuMet: 2.018 ± 0.409
3.73LeuAsn: 3.73 ± 0.531
3.608LeuPro: 3.608 ± 0.448
3.302LeuGln: 3.302 ± 0.459
5.748LeuArg: 5.748 ± 0.602
5.87LeuSer: 5.87 ± 0.708
5.687LeuThr: 5.687 ± 0.577
4.647LeuVal: 4.647 ± 0.522
1.834LeuTrp: 1.834 ± 0.358
1.834LeuTyr: 1.834 ± 0.378
0.0LeuXaa: 0.0 ± 0.0
Met
2.935MetAla: 2.935 ± 0.396
0.245MetCys: 0.245 ± 0.13
1.101MetAsp: 1.101 ± 0.258
0.978MetGlu: 0.978 ± 0.227
0.795MetPhe: 0.795 ± 0.216
1.162MetGly: 1.162 ± 0.276
0.183MetHis: 0.183 ± 0.094
1.101MetIle: 1.101 ± 0.254
1.59MetLys: 1.59 ± 0.318
3.119MetLeu: 3.119 ± 0.455
0.489MetMet: 0.489 ± 0.173
1.345MetAsn: 1.345 ± 0.326
1.345MetPro: 1.345 ± 0.254
1.529MetGln: 1.529 ± 0.277
1.834MetArg: 1.834 ± 0.359
2.018MetSer: 2.018 ± 0.343
2.629MetThr: 2.629 ± 0.313
1.712MetVal: 1.712 ± 0.351
0.428MetTrp: 0.428 ± 0.154
0.306MetTyr: 0.306 ± 0.116
0.0MetXaa: 0.0 ± 0.0
Asn
4.708AsnAla: 4.708 ± 0.736
0.428AsnCys: 0.428 ± 0.193
2.507AsnAsp: 2.507 ± 0.39
2.874AsnGlu: 2.874 ± 0.527
1.468AsnPhe: 1.468 ± 0.408
4.464AsnGly: 4.464 ± 0.559
1.284AsnHis: 1.284 ± 0.27
2.385AsnIle: 2.385 ± 0.379
1.896AsnLys: 1.896 ± 0.314
2.874AsnLeu: 2.874 ± 0.464
0.917AsnMet: 0.917 ± 0.269
1.712AsnAsn: 1.712 ± 0.325
2.14AsnPro: 2.14 ± 0.291
1.529AsnGln: 1.529 ± 0.278
2.629AsnArg: 2.629 ± 0.402
2.385AsnSer: 2.385 ± 0.449
2.752AsnThr: 2.752 ± 0.376
2.568AsnVal: 2.568 ± 0.405
0.367AsnTrp: 0.367 ± 0.144
1.284AsnTyr: 1.284 ± 0.331
0.0AsnXaa: 0.0 ± 0.0
Pro
3.852ProAla: 3.852 ± 0.681
0.428ProCys: 0.428 ± 0.17
3.363ProAsp: 3.363 ± 0.571
4.708ProGlu: 4.708 ± 0.528
1.468ProPhe: 1.468 ± 0.284
3.18ProGly: 3.18 ± 0.472
0.917ProHis: 0.917 ± 0.238
1.284ProIle: 1.284 ± 0.279
1.957ProLys: 1.957 ± 0.517
2.262ProLeu: 2.262 ± 0.39
0.795ProMet: 0.795 ± 0.249
1.59ProAsn: 1.59 ± 0.326
1.957ProPro: 1.957 ± 0.446
1.651ProGln: 1.651 ± 0.382
2.324ProArg: 2.324 ± 0.551
2.935ProSer: 2.935 ± 0.372
1.651ProThr: 1.651 ± 0.32
4.097ProVal: 4.097 ± 0.755
0.734ProTrp: 0.734 ± 0.221
1.101ProTyr: 1.101 ± 0.254
0.0ProXaa: 0.0 ± 0.0
Gln
4.464GlnAla: 4.464 ± 0.693
1.04GlnCys: 1.04 ± 0.254
1.651GlnAsp: 1.651 ± 0.273
2.752GlnGlu: 2.752 ± 0.544
2.018GlnPhe: 2.018 ± 0.301
3.18GlnGly: 3.18 ± 0.48
0.917GlnHis: 0.917 ± 0.196
2.752GlnIle: 2.752 ± 0.372
2.262GlnLys: 2.262 ± 0.425
2.935GlnLeu: 2.935 ± 0.4
1.529GlnMet: 1.529 ± 0.42
1.834GlnAsn: 1.834 ± 0.516
1.712GlnPro: 1.712 ± 0.233
2.385GlnGln: 2.385 ± 0.491
2.996GlnArg: 2.996 ± 0.453
2.874GlnSer: 2.874 ± 0.478
2.324GlnThr: 2.324 ± 0.312
2.385GlnVal: 2.385 ± 0.365
0.734GlnTrp: 0.734 ± 0.216
1.651GlnTyr: 1.651 ± 0.294
0.0GlnXaa: 0.0 ± 0.0
Arg
4.831ArgAla: 4.831 ± 0.716
0.856ArgCys: 0.856 ± 0.286
4.403ArgAsp: 4.403 ± 0.877
6.054ArgGlu: 6.054 ± 0.578
1.651ArgPhe: 1.651 ± 0.313
3.485ArgGly: 3.485 ± 0.471
1.957ArgHis: 1.957 ± 0.423
3.485ArgIle: 3.485 ± 0.531
3.791ArgLys: 3.791 ± 0.548
5.992ArgLeu: 5.992 ± 0.73
2.018ArgMet: 2.018 ± 0.337
3.485ArgAsn: 3.485 ± 0.465
2.568ArgPro: 2.568 ± 0.457
3.241ArgGln: 3.241 ± 0.471
5.687ArgArg: 5.687 ± 0.795
2.324ArgSer: 2.324 ± 0.359
3.18ArgThr: 3.18 ± 0.388
3.975ArgVal: 3.975 ± 0.508
1.101ArgTrp: 1.101 ± 0.28
1.834ArgTyr: 1.834 ± 0.329
0.0ArgXaa: 0.0 ± 0.0
Ser
6.971SerAla: 6.971 ± 0.994
0.734SerCys: 0.734 ± 0.223
3.424SerAsp: 3.424 ± 0.461
4.769SerGlu: 4.769 ± 0.472
1.957SerPhe: 1.957 ± 0.319
6.298SerGly: 6.298 ± 0.665
0.673SerHis: 0.673 ± 0.231
3.302SerIle: 3.302 ± 0.448
2.568SerLys: 2.568 ± 0.476
5.259SerLeu: 5.259 ± 0.563
2.14SerMet: 2.14 ± 0.452
2.446SerAsn: 2.446 ± 0.337
2.568SerPro: 2.568 ± 0.481
3.302SerGln: 3.302 ± 0.465
3.73SerArg: 3.73 ± 0.514
4.586SerSer: 4.586 ± 0.496
2.874SerThr: 2.874 ± 0.417
4.831SerVal: 4.831 ± 0.595
0.734SerTrp: 0.734 ± 0.206
1.406SerTyr: 1.406 ± 0.247
0.0SerXaa: 0.0 ± 0.0
Thr
5.564ThrAla: 5.564 ± 0.613
0.55ThrCys: 0.55 ± 0.2
3.424ThrAsp: 3.424 ± 0.439
3.913ThrGlu: 3.913 ± 0.568
1.957ThrPhe: 1.957 ± 0.337
5.381ThrGly: 5.381 ± 0.787
1.406ThrHis: 1.406 ± 0.355
3.363ThrIle: 3.363 ± 0.423
2.69ThrLys: 2.69 ± 0.425
5.381ThrLeu: 5.381 ± 0.624
0.489ThrMet: 0.489 ± 0.193
2.079ThrAsn: 2.079 ± 0.358
2.935ThrPro: 2.935 ± 0.463
2.079ThrGln: 2.079 ± 0.353
2.507ThrArg: 2.507 ± 0.34
3.363ThrSer: 3.363 ± 0.479
3.057ThrThr: 3.057 ± 0.417
4.708ThrVal: 4.708 ± 0.653
0.673ThrTrp: 0.673 ± 0.222
1.284ThrTyr: 1.284 ± 0.277
0.0ThrXaa: 0.0 ± 0.0
Val
5.748ValAla: 5.748 ± 0.729
1.04ValCys: 1.04 ± 0.278
3.057ValAsp: 3.057 ± 0.412
3.975ValGlu: 3.975 ± 0.43
2.079ValPhe: 2.079 ± 0.366
3.913ValGly: 3.913 ± 0.535
0.795ValHis: 0.795 ± 0.203
3.363ValIle: 3.363 ± 0.453
3.852ValLys: 3.852 ± 0.544
5.503ValLeu: 5.503 ± 0.709
1.834ValMet: 1.834 ± 0.326
3.424ValAsn: 3.424 ± 0.392
3.302ValPro: 3.302 ± 0.54
1.834ValGln: 1.834 ± 0.41
4.831ValArg: 4.831 ± 0.555
4.525ValSer: 4.525 ± 0.641
3.302ValThr: 3.302 ± 0.465
4.953ValVal: 4.953 ± 0.578
1.04ValTrp: 1.04 ± 0.267
2.201ValTyr: 2.201 ± 0.354
0.0ValXaa: 0.0 ± 0.0
Trp
1.223TrpAla: 1.223 ± 0.24
0.489TrpCys: 0.489 ± 0.161
0.856TrpAsp: 0.856 ± 0.277
0.489TrpGlu: 0.489 ± 0.169
0.917TrpPhe: 0.917 ± 0.247
0.734TrpGly: 0.734 ± 0.187
0.489TrpHis: 0.489 ± 0.152
0.734TrpIle: 0.734 ± 0.168
1.04TrpLys: 1.04 ± 0.25
2.201TrpLeu: 2.201 ± 0.457
0.673TrpMet: 0.673 ± 0.139
0.306TrpAsn: 0.306 ± 0.137
0.428TrpPro: 0.428 ± 0.152
1.162TrpGln: 1.162 ± 0.204
1.406TrpArg: 1.406 ± 0.279
1.101TrpSer: 1.101 ± 0.237
0.795TrpThr: 0.795 ± 0.198
1.529TrpVal: 1.529 ± 0.253
0.367TrpTrp: 0.367 ± 0.201
0.428TrpTyr: 0.428 ± 0.142
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.935TyrAla: 2.935 ± 0.349
0.306TyrCys: 0.306 ± 0.144
1.957TyrAsp: 1.957 ± 0.346
0.978TyrGlu: 0.978 ± 0.245
1.406TyrPhe: 1.406 ± 0.312
2.079TyrGly: 2.079 ± 0.45
0.734TyrHis: 0.734 ± 0.216
1.406TyrIle: 1.406 ± 0.316
1.162TyrLys: 1.162 ± 0.243
1.468TyrLeu: 1.468 ± 0.286
0.55TyrMet: 0.55 ± 0.206
0.917TyrAsn: 0.917 ± 0.23
1.284TyrPro: 1.284 ± 0.297
1.529TyrGln: 1.529 ± 0.396
2.201TyrArg: 2.201 ± 0.383
1.896TyrSer: 1.896 ± 0.305
1.896TyrThr: 1.896 ± 0.274
1.406TyrVal: 1.406 ± 0.296
0.428TyrTrp: 0.428 ± 0.146
0.795TyrTyr: 0.795 ± 0.229
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 79 proteins (16355 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski