Amino acid dipepetide frequency for Salmonella phage ZCSE2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.036AlaAla: 7.036 ± 0.835
0.728AlaCys: 0.728 ± 0.231
5.762AlaAsp: 5.762 ± 0.694
6.308AlaGlu: 6.308 ± 1.106
2.062AlaPhe: 2.062 ± 0.361
5.217AlaGly: 5.217 ± 0.678
1.334AlaHis: 1.334 ± 0.271
5.095AlaIle: 5.095 ± 0.519
6.126AlaLys: 6.126 ± 0.742
6.066AlaLeu: 6.066 ± 0.638
2.002AlaMet: 2.002 ± 0.349
5.095AlaAsn: 5.095 ± 0.578
2.73AlaPro: 2.73 ± 0.415
2.79AlaGln: 2.79 ± 0.435
3.943AlaArg: 3.943 ± 0.5
5.095AlaSer: 5.095 ± 0.561
6.187AlaThr: 6.187 ± 1.17
5.823AlaVal: 5.823 ± 0.6
1.516AlaTrp: 1.516 ± 0.382
2.608AlaTyr: 2.608 ± 0.355
0.0AlaXaa: 0.0 ± 0.0
Cys
0.91CysAla: 0.91 ± 0.229
0.182CysCys: 0.182 ± 0.093
0.789CysAsp: 0.789 ± 0.235
1.031CysGlu: 1.031 ± 0.253
0.546CysPhe: 0.546 ± 0.203
0.789CysGly: 0.789 ± 0.236
0.425CysHis: 0.425 ± 0.195
0.789CysIle: 0.789 ± 0.177
1.395CysLys: 1.395 ± 0.271
1.092CysLeu: 1.092 ± 0.273
0.485CysMet: 0.485 ± 0.191
0.607CysAsn: 0.607 ± 0.174
0.303CysPro: 0.303 ± 0.125
0.303CysGln: 0.303 ± 0.127
0.667CysArg: 0.667 ± 0.208
0.243CysSer: 0.243 ± 0.134
0.728CysThr: 0.728 ± 0.253
1.031CysVal: 1.031 ± 0.276
0.061CysTrp: 0.061 ± 0.065
0.728CysTyr: 0.728 ± 0.189
0.0CysXaa: 0.0 ± 0.0
Asp
6.066AspAla: 6.066 ± 0.589
0.849AspCys: 0.849 ± 0.188
4.367AspAsp: 4.367 ± 0.458
4.671AspGlu: 4.671 ± 0.53
2.548AspPhe: 2.548 ± 0.417
4.974AspGly: 4.974 ± 0.494
0.789AspHis: 0.789 ± 0.254
4.853AspIle: 4.853 ± 0.564
4.003AspLys: 4.003 ± 0.438
5.277AspLeu: 5.277 ± 0.561
1.82AspMet: 1.82 ± 0.38
3.7AspAsn: 3.7 ± 0.604
2.669AspPro: 2.669 ± 0.327
2.184AspGln: 2.184 ± 0.316
2.123AspArg: 2.123 ± 0.417
3.094AspSer: 3.094 ± 0.508
2.912AspThr: 2.912 ± 0.499
4.185AspVal: 4.185 ± 0.423
1.516AspTrp: 1.516 ± 0.388
2.123AspTyr: 2.123 ± 0.415
0.0AspXaa: 0.0 ± 0.0
Glu
5.641GluAla: 5.641 ± 0.894
0.91GluCys: 0.91 ± 0.23
3.215GluAsp: 3.215 ± 0.416
4.307GluGlu: 4.307 ± 0.656
2.123GluPhe: 2.123 ± 0.519
4.307GluGly: 4.307 ± 0.504
1.152GluHis: 1.152 ± 0.262
2.73GluIle: 2.73 ± 0.353
2.912GluLys: 2.912 ± 0.675
7.643GluLeu: 7.643 ± 0.959
2.548GluMet: 2.548 ± 0.419
2.548GluAsn: 2.548 ± 0.393
2.305GluPro: 2.305 ± 0.319
3.094GluGln: 3.094 ± 0.521
3.397GluArg: 3.397 ± 0.55
3.518GluSer: 3.518 ± 0.564
2.608GluThr: 2.608 ± 0.366
4.003GluVal: 4.003 ± 0.408
0.91GluTrp: 0.91 ± 0.222
2.669GluTyr: 2.669 ± 0.437
0.0GluXaa: 0.0 ± 0.0
Phe
1.82PheAla: 1.82 ± 0.396
0.546PheCys: 0.546 ± 0.189
2.305PheAsp: 2.305 ± 0.377
2.305PheGlu: 2.305 ± 0.474
1.092PhePhe: 1.092 ± 0.268
2.73PheGly: 2.73 ± 0.365
0.667PheHis: 0.667 ± 0.174
2.244PheIle: 2.244 ± 0.356
2.73PheLys: 2.73 ± 0.421
1.88PheLeu: 1.88 ± 0.288
0.91PheMet: 0.91 ± 0.23
2.972PheAsn: 2.972 ± 0.405
1.152PhePro: 1.152 ± 0.233
1.092PheGln: 1.092 ± 0.272
1.698PheArg: 1.698 ± 0.335
2.669PheSer: 2.669 ± 0.388
3.033PheThr: 3.033 ± 0.429
2.305PheVal: 2.305 ± 0.386
0.485PheTrp: 0.485 ± 0.188
1.152PheTyr: 1.152 ± 0.27
0.0PheXaa: 0.0 ± 0.0
Gly
5.338GlyAla: 5.338 ± 0.599
0.849GlyCys: 0.849 ± 0.208
3.579GlyAsp: 3.579 ± 0.528
2.912GlyGlu: 2.912 ± 0.469
2.912GlyPhe: 2.912 ± 0.453
5.52GlyGly: 5.52 ± 0.776
1.516GlyHis: 1.516 ± 0.305
5.156GlyIle: 5.156 ± 0.48
5.338GlyLys: 5.338 ± 0.54
5.277GlyLeu: 5.277 ± 0.467
1.698GlyMet: 1.698 ± 0.392
3.457GlyAsn: 3.457 ± 0.394
2.002GlyPro: 2.002 ± 0.31
2.851GlyGln: 2.851 ± 0.375
2.972GlyArg: 2.972 ± 0.538
4.003GlySer: 4.003 ± 0.457
4.185GlyThr: 4.185 ± 0.582
5.399GlyVal: 5.399 ± 0.577
1.395GlyTrp: 1.395 ± 0.34
2.669GlyTyr: 2.669 ± 0.488
0.0GlyXaa: 0.0 ± 0.0
His
0.971HisAla: 0.971 ± 0.19
0.303HisCys: 0.303 ± 0.15
1.152HisAsp: 1.152 ± 0.314
0.971HisGlu: 0.971 ± 0.3
0.789HisPhe: 0.789 ± 0.199
0.91HisGly: 0.91 ± 0.218
0.546HisHis: 0.546 ± 0.171
1.456HisIle: 1.456 ± 0.328
0.789HisLys: 0.789 ± 0.262
1.577HisLeu: 1.577 ± 0.386
0.789HisMet: 0.789 ± 0.23
0.91HisAsn: 0.91 ± 0.269
1.092HisPro: 1.092 ± 0.23
0.728HisGln: 0.728 ± 0.219
0.485HisArg: 0.485 ± 0.208
1.152HisSer: 1.152 ± 0.28
0.607HisThr: 0.607 ± 0.194
0.971HisVal: 0.971 ± 0.226
0.364HisTrp: 0.364 ± 0.12
0.789HisTyr: 0.789 ± 0.195
0.0HisXaa: 0.0 ± 0.0
Ile
5.217IleAla: 5.217 ± 0.577
0.607IleCys: 0.607 ± 0.21
4.064IleAsp: 4.064 ± 0.404
3.094IleGlu: 3.094 ± 0.461
1.698IlePhe: 1.698 ± 0.327
3.761IleGly: 3.761 ± 0.454
0.91IleHis: 0.91 ± 0.276
2.548IleIle: 2.548 ± 0.467
4.125IleLys: 4.125 ± 0.438
4.185IleLeu: 4.185 ± 0.563
1.577IleMet: 1.577 ± 0.288
3.639IleAsn: 3.639 ± 0.478
2.972IlePro: 2.972 ± 0.457
2.669IleGln: 2.669 ± 0.412
2.487IleArg: 2.487 ± 0.31
2.366IleSer: 2.366 ± 0.35
5.095IleThr: 5.095 ± 0.707
3.579IleVal: 3.579 ± 0.558
0.789IleTrp: 0.789 ± 0.277
2.366IleTyr: 2.366 ± 0.49
0.0IleXaa: 0.0 ± 0.0
Lys
6.369LysAla: 6.369 ± 1.092
0.546LysCys: 0.546 ± 0.212
3.397LysAsp: 3.397 ± 0.443
5.156LysGlu: 5.156 ± 0.656
2.244LysPhe: 2.244 ± 0.451
3.457LysGly: 3.457 ± 0.41
0.971LysHis: 0.971 ± 0.261
2.73LysIle: 2.73 ± 0.42
3.094LysLys: 3.094 ± 0.574
6.066LysLeu: 6.066 ± 0.588
2.123LysMet: 2.123 ± 0.373
2.912LysAsn: 2.912 ± 0.402
3.882LysPro: 3.882 ± 0.521
2.548LysGln: 2.548 ± 0.356
2.548LysArg: 2.548 ± 0.456
3.639LysSer: 3.639 ± 0.494
3.761LysThr: 3.761 ± 0.544
4.792LysVal: 4.792 ± 0.505
0.667LysTrp: 0.667 ± 0.198
2.002LysTyr: 2.002 ± 0.456
0.0LysXaa: 0.0 ± 0.0
Leu
6.915LeuAla: 6.915 ± 0.713
1.334LeuCys: 1.334 ± 0.326
5.399LeuAsp: 5.399 ± 0.532
5.338LeuGlu: 5.338 ± 0.666
2.912LeuPhe: 2.912 ± 0.4
4.307LeuGly: 4.307 ± 0.547
1.334LeuHis: 1.334 ± 0.344
3.882LeuIle: 3.882 ± 0.482
5.58LeuLys: 5.58 ± 0.53
5.035LeuLeu: 5.035 ± 0.595
2.426LeuMet: 2.426 ± 0.417
3.579LeuAsn: 3.579 ± 0.528
4.731LeuPro: 4.731 ± 0.643
3.276LeuGln: 3.276 ± 0.395
4.307LeuArg: 4.307 ± 0.494
4.125LeuSer: 4.125 ± 0.373
6.612LeuThr: 6.612 ± 0.666
5.277LeuVal: 5.277 ± 0.568
1.092LeuTrp: 1.092 ± 0.317
2.73LeuTyr: 2.73 ± 0.42
0.0LeuXaa: 0.0 ± 0.0
Met
2.79MetAla: 2.79 ± 0.419
0.485MetCys: 0.485 ± 0.188
2.184MetAsp: 2.184 ± 0.304
1.82MetGlu: 1.82 ± 0.407
1.152MetPhe: 1.152 ± 0.257
1.334MetGly: 1.334 ± 0.251
0.364MetHis: 0.364 ± 0.134
1.213MetIle: 1.213 ± 0.309
2.123MetLys: 2.123 ± 0.353
2.366MetLeu: 2.366 ± 0.309
0.728MetMet: 0.728 ± 0.166
1.092MetAsn: 1.092 ± 0.243
1.334MetPro: 1.334 ± 0.267
0.849MetGln: 0.849 ± 0.223
1.82MetArg: 1.82 ± 0.33
1.82MetSer: 1.82 ± 0.3
2.123MetThr: 2.123 ± 0.352
1.638MetVal: 1.638 ± 0.27
0.182MetTrp: 0.182 ± 0.106
1.213MetTyr: 1.213 ± 0.291
0.0MetXaa: 0.0 ± 0.0
Asn
4.367AsnAla: 4.367 ± 0.448
0.789AsnCys: 0.789 ± 0.185
2.851AsnAsp: 2.851 ± 0.414
2.608AsnGlu: 2.608 ± 0.437
2.244AsnPhe: 2.244 ± 0.363
4.428AsnGly: 4.428 ± 0.549
0.607AsnHis: 0.607 ± 0.169
3.154AsnIle: 3.154 ± 0.402
2.972AsnLys: 2.972 ± 0.48
3.821AsnLeu: 3.821 ± 0.45
1.334AsnMet: 1.334 ± 0.263
2.487AsnAsn: 2.487 ± 0.47
2.73AsnPro: 2.73 ± 0.357
1.516AsnGln: 1.516 ± 0.331
2.79AsnArg: 2.79 ± 0.435
2.73AsnSer: 2.73 ± 0.443
3.033AsnThr: 3.033 ± 0.357
3.276AsnVal: 3.276 ± 0.37
1.638AsnTrp: 1.638 ± 0.295
1.213AsnTyr: 1.213 ± 0.318
0.0AsnXaa: 0.0 ± 0.0
Pro
3.882ProAla: 3.882 ± 0.58
0.546ProCys: 0.546 ± 0.212
3.518ProAsp: 3.518 ± 0.472
2.79ProGlu: 2.79 ± 0.466
1.698ProPhe: 1.698 ± 0.348
2.912ProGly: 2.912 ± 0.466
0.546ProHis: 0.546 ± 0.151
2.062ProIle: 2.062 ± 0.369
2.366ProLys: 2.366 ± 0.383
3.336ProLeu: 3.336 ± 0.5
1.274ProMet: 1.274 ± 0.309
1.88ProAsn: 1.88 ± 0.328
2.305ProPro: 2.305 ± 0.339
1.516ProGln: 1.516 ± 0.302
2.002ProArg: 2.002 ± 0.402
2.608ProSer: 2.608 ± 0.484
3.276ProThr: 3.276 ± 0.381
3.7ProVal: 3.7 ± 0.541
0.546ProTrp: 0.546 ± 0.149
1.334ProTyr: 1.334 ± 0.321
0.0ProXaa: 0.0 ± 0.0
Gln
3.033GlnAla: 3.033 ± 0.441
0.667GlnCys: 0.667 ± 0.185
1.698GlnAsp: 1.698 ± 0.311
2.305GlnGlu: 2.305 ± 0.337
1.941GlnPhe: 1.941 ± 0.326
2.184GlnGly: 2.184 ± 0.378
0.728GlnHis: 0.728 ± 0.228
2.972GlnIle: 2.972 ± 0.522
1.577GlnLys: 1.577 ± 0.283
4.913GlnLeu: 4.913 ± 0.457
1.577GlnMet: 1.577 ± 0.371
1.031GlnAsn: 1.031 ± 0.181
1.698GlnPro: 1.698 ± 0.361
1.941GlnGln: 1.941 ± 0.46
1.759GlnArg: 1.759 ± 0.315
1.638GlnSer: 1.638 ± 0.244
2.184GlnThr: 2.184 ± 0.398
2.184GlnVal: 2.184 ± 0.355
0.849GlnTrp: 0.849 ± 0.239
1.698GlnTyr: 1.698 ± 0.341
0.0GlnXaa: 0.0 ± 0.0
Arg
4.003ArgAla: 4.003 ± 0.625
0.607ArgCys: 0.607 ± 0.177
2.184ArgAsp: 2.184 ± 0.381
2.608ArgGlu: 2.608 ± 0.547
1.577ArgPhe: 1.577 ± 0.314
2.851ArgGly: 2.851 ± 0.435
1.152ArgHis: 1.152 ± 0.265
3.215ArgIle: 3.215 ± 0.453
3.639ArgLys: 3.639 ± 0.56
3.518ArgLeu: 3.518 ± 0.462
1.456ArgMet: 1.456 ± 0.315
1.88ArgAsn: 1.88 ± 0.348
2.002ArgPro: 2.002 ± 0.389
2.184ArgGln: 2.184 ± 0.314
3.457ArgArg: 3.457 ± 0.46
2.123ArgSer: 2.123 ± 0.39
2.548ArgThr: 2.548 ± 0.318
3.518ArgVal: 3.518 ± 0.438
0.91ArgTrp: 0.91 ± 0.247
1.334ArgTyr: 1.334 ± 0.252
0.0ArgXaa: 0.0 ± 0.0
Ser
5.035SerAla: 5.035 ± 0.673
0.667SerCys: 0.667 ± 0.218
3.639SerAsp: 3.639 ± 0.532
3.518SerGlu: 3.518 ± 0.377
2.366SerPhe: 2.366 ± 0.321
4.549SerGly: 4.549 ± 0.515
0.667SerHis: 0.667 ± 0.203
2.851SerIle: 2.851 ± 0.485
3.397SerLys: 3.397 ± 0.452
3.7SerLeu: 3.7 ± 0.625
1.274SerMet: 1.274 ± 0.282
2.669SerAsn: 2.669 ± 0.446
1.88SerPro: 1.88 ± 0.408
2.062SerGln: 2.062 ± 0.387
2.426SerArg: 2.426 ± 0.393
4.003SerSer: 4.003 ± 0.52
3.639SerThr: 3.639 ± 0.501
4.428SerVal: 4.428 ± 0.6
0.607SerTrp: 0.607 ± 0.212
1.88SerTyr: 1.88 ± 0.328
0.0SerXaa: 0.0 ± 0.0
Thr
6.126ThrAla: 6.126 ± 0.78
0.728ThrCys: 0.728 ± 0.216
4.428ThrAsp: 4.428 ± 0.631
3.397ThrGlu: 3.397 ± 0.397
1.941ThrPhe: 1.941 ± 0.321
5.641ThrGly: 5.641 ± 0.715
1.092ThrHis: 1.092 ± 0.255
4.003ThrIle: 4.003 ± 0.663
3.579ThrLys: 3.579 ± 0.541
4.913ThrLeu: 4.913 ± 0.679
1.395ThrMet: 1.395 ± 0.262
3.094ThrAsn: 3.094 ± 0.375
3.094ThrPro: 3.094 ± 0.497
2.426ThrGln: 2.426 ± 0.428
2.79ThrArg: 2.79 ± 0.54
3.882ThrSer: 3.882 ± 0.535
3.518ThrThr: 3.518 ± 0.681
6.126ThrVal: 6.126 ± 1.304
1.334ThrTrp: 1.334 ± 0.284
2.305ThrTyr: 2.305 ± 0.343
0.0ThrXaa: 0.0 ± 0.0
Val
4.974ValAla: 4.974 ± 0.537
0.91ValCys: 0.91 ± 0.228
5.641ValAsp: 5.641 ± 0.616
4.61ValGlu: 4.61 ± 0.606
1.88ValPhe: 1.88 ± 0.274
5.459ValGly: 5.459 ± 0.456
1.516ValHis: 1.516 ± 0.312
3.518ValIle: 3.518 ± 0.547
4.489ValLys: 4.489 ± 0.465
5.338ValLeu: 5.338 ± 0.574
2.002ValMet: 2.002 ± 0.362
5.095ValAsn: 5.095 ± 0.553
3.094ValPro: 3.094 ± 0.497
2.608ValGln: 2.608 ± 0.411
2.244ValArg: 2.244 ± 0.39
3.7ValSer: 3.7 ± 0.45
6.672ValThr: 6.672 ± 1.374
4.974ValVal: 4.974 ± 0.776
0.849ValTrp: 0.849 ± 0.234
2.062ValTyr: 2.062 ± 0.328
0.0ValXaa: 0.0 ± 0.0
Trp
1.092TrpAla: 1.092 ± 0.262
0.243TrpCys: 0.243 ± 0.133
1.274TrpAsp: 1.274 ± 0.253
0.546TrpGlu: 0.546 ± 0.189
0.971TrpPhe: 0.971 ± 0.237
1.152TrpGly: 1.152 ± 0.249
0.243TrpHis: 0.243 ± 0.142
1.031TrpIle: 1.031 ± 0.261
0.667TrpLys: 0.667 ± 0.183
1.516TrpLeu: 1.516 ± 0.279
0.303TrpMet: 0.303 ± 0.147
0.425TrpAsn: 0.425 ± 0.165
0.607TrpPro: 0.607 ± 0.175
0.364TrpGln: 0.364 ± 0.157
0.849TrpArg: 0.849 ± 0.232
1.274TrpSer: 1.274 ± 0.334
0.849TrpThr: 0.849 ± 0.261
1.456TrpVal: 1.456 ± 0.315
0.425TrpTrp: 0.425 ± 0.156
1.092TrpTyr: 1.092 ± 0.224
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.941TyrAla: 1.941 ± 0.366
0.607TyrCys: 0.607 ± 0.181
3.276TyrAsp: 3.276 ± 0.478
2.244TyrGlu: 2.244 ± 0.361
1.092TyrPhe: 1.092 ± 0.272
2.548TyrGly: 2.548 ± 0.408
0.789TyrHis: 0.789 ± 0.227
2.244TyrIle: 2.244 ± 0.34
2.002TyrLys: 2.002 ± 0.328
2.669TyrLeu: 2.669 ± 0.522
0.849TyrMet: 0.849 ± 0.238
1.516TyrAsn: 1.516 ± 0.34
1.516TyrPro: 1.516 ± 0.279
1.577TyrGln: 1.577 ± 0.284
2.062TyrArg: 2.062 ± 0.342
1.577TyrSer: 1.577 ± 0.302
2.244TyrThr: 2.244 ± 0.354
2.972TyrVal: 2.972 ± 0.43
0.243TyrTrp: 0.243 ± 0.123
1.395TyrTyr: 1.395 ± 0.305
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 78 proteins (16487 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski