Amino acid dipepetide frequency for Pseudomonas phage MR14

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.272AlaAla: 17.272 ± 1.723
1.028AlaCys: 1.028 ± 0.263
5.278AlaAsp: 5.278 ± 0.601
6.169AlaGlu: 6.169 ± 0.808
2.81AlaPhe: 2.81 ± 0.337
9.733AlaGly: 9.733 ± 0.82
1.851AlaHis: 1.851 ± 0.462
4.455AlaIle: 4.455 ± 0.531
4.661AlaLys: 4.661 ± 0.594
11.035AlaLeu: 11.035 ± 0.776
3.153AlaMet: 3.153 ± 0.381
3.907AlaAsn: 3.907 ± 0.505
6.923AlaPro: 6.923 ± 1.461
4.249AlaGln: 4.249 ± 0.454
7.539AlaArg: 7.539 ± 0.63
8.568AlaSer: 8.568 ± 0.766
8.088AlaThr: 8.088 ± 1.108
6.648AlaVal: 6.648 ± 0.659
2.056AlaTrp: 2.056 ± 0.306
3.153AlaTyr: 3.153 ± 0.45
0.0AlaXaa: 0.0 ± 0.0
Cys
1.097CysAla: 1.097 ± 0.336
0.274CysCys: 0.274 ± 0.151
0.685CysAsp: 0.685 ± 0.185
1.371CysGlu: 1.371 ± 0.334
0.548CysPhe: 0.548 ± 0.225
1.165CysGly: 1.165 ± 0.299
0.343CysHis: 0.343 ± 0.145
0.48CysIle: 0.48 ± 0.123
0.617CysLys: 0.617 ± 0.174
1.028CysLeu: 1.028 ± 0.226
0.137CysMet: 0.137 ± 0.072
0.137CysAsn: 0.137 ± 0.091
0.754CysPro: 0.754 ± 0.255
0.343CysGln: 0.343 ± 0.167
0.617CysArg: 0.617 ± 0.228
1.097CysSer: 1.097 ± 0.288
0.822CysThr: 0.822 ± 0.277
0.48CysVal: 0.48 ± 0.146
0.617CysTrp: 0.617 ± 0.183
0.274CysTyr: 0.274 ± 0.15
0.0CysXaa: 0.0 ± 0.0
Asp
5.894AspAla: 5.894 ± 0.617
0.617AspCys: 0.617 ± 0.184
3.427AspAsp: 3.427 ± 0.48
3.084AspGlu: 3.084 ± 0.551
2.125AspPhe: 2.125 ± 0.419
5.003AspGly: 5.003 ± 0.631
0.822AspHis: 0.822 ± 0.182
1.645AspIle: 1.645 ± 0.356
2.056AspLys: 2.056 ± 0.395
4.249AspLeu: 4.249 ± 0.515
1.028AspMet: 1.028 ± 0.24
1.988AspAsn: 1.988 ± 0.408
2.879AspPro: 2.879 ± 0.41
2.947AspGln: 2.947 ± 0.547
3.496AspArg: 3.496 ± 0.466
3.358AspSer: 3.358 ± 0.596
3.29AspThr: 3.29 ± 0.747
3.564AspVal: 3.564 ± 0.436
1.576AspTrp: 1.576 ± 0.315
1.851AspTyr: 1.851 ± 0.33
0.0AspXaa: 0.0 ± 0.0
Glu
6.511GluAla: 6.511 ± 0.889
0.822GluCys: 0.822 ± 0.248
2.605GluAsp: 2.605 ± 0.447
1.645GluGlu: 1.645 ± 0.431
2.193GluPhe: 2.193 ± 0.442
3.084GluGly: 3.084 ± 0.438
1.371GluHis: 1.371 ± 0.311
2.262GluIle: 2.262 ± 0.367
2.947GluLys: 2.947 ± 0.469
5.689GluLeu: 5.689 ± 0.531
1.371GluMet: 1.371 ± 0.314
2.056GluAsn: 2.056 ± 0.284
2.673GluPro: 2.673 ± 0.487
2.605GluGln: 2.605 ± 0.474
3.084GluArg: 3.084 ± 0.582
3.221GluSer: 3.221 ± 0.509
3.29GluThr: 3.29 ± 0.415
4.044GluVal: 4.044 ± 0.712
0.96GluTrp: 0.96 ± 0.287
1.576GluTyr: 1.576 ± 0.285
0.0GluXaa: 0.0 ± 0.0
Phe
3.29PheAla: 3.29 ± 0.539
0.343PheCys: 0.343 ± 0.152
2.193PheAsp: 2.193 ± 0.344
2.193PheGlu: 2.193 ± 0.382
0.48PhePhe: 0.48 ± 0.159
2.742PheGly: 2.742 ± 0.338
0.411PheHis: 0.411 ± 0.137
1.302PheIle: 1.302 ± 0.329
1.576PheLys: 1.576 ± 0.376
2.399PheLeu: 2.399 ± 0.362
0.548PheMet: 0.548 ± 0.193
1.714PheAsn: 1.714 ± 0.355
1.371PhePro: 1.371 ± 0.333
1.439PheGln: 1.439 ± 0.242
1.508PheArg: 1.508 ± 0.282
2.262PheSer: 2.262 ± 0.35
2.33PheThr: 2.33 ± 0.434
1.782PheVal: 1.782 ± 0.323
0.411PheTrp: 0.411 ± 0.236
0.685PheTyr: 0.685 ± 0.223
0.0PheXaa: 0.0 ± 0.0
Gly
8.362GlyAla: 8.362 ± 1.078
1.028GlyCys: 1.028 ± 0.349
4.798GlyAsp: 4.798 ± 0.5
3.975GlyGlu: 3.975 ± 0.443
2.879GlyPhe: 2.879 ± 0.332
5.894GlyGly: 5.894 ± 0.671
1.165GlyHis: 1.165 ± 0.272
3.633GlyIle: 3.633 ± 0.471
4.387GlyLys: 4.387 ± 0.69
6.717GlyLeu: 6.717 ± 0.722
1.988GlyMet: 1.988 ± 0.423
2.879GlyAsn: 2.879 ± 0.422
3.153GlyPro: 3.153 ± 0.545
3.084GlyGln: 3.084 ± 0.417
3.907GlyArg: 3.907 ± 0.623
5.141GlySer: 5.141 ± 0.69
6.1GlyThr: 6.1 ± 0.763
7.128GlyVal: 7.128 ± 0.702
1.576GlyTrp: 1.576 ± 0.355
3.496GlyTyr: 3.496 ± 0.509
0.0GlyXaa: 0.0 ± 0.0
His
1.714HisAla: 1.714 ± 0.396
0.274HisCys: 0.274 ± 0.177
1.028HisAsp: 1.028 ± 0.271
1.302HisGlu: 1.302 ± 0.236
0.206HisPhe: 0.206 ± 0.112
1.097HisGly: 1.097 ± 0.367
0.891HisHis: 0.891 ± 0.246
0.617HisIle: 0.617 ± 0.254
0.48HisLys: 0.48 ± 0.138
1.714HisLeu: 1.714 ± 0.381
0.48HisMet: 0.48 ± 0.181
0.48HisAsn: 0.48 ± 0.184
1.302HisPro: 1.302 ± 0.293
0.343HisGln: 0.343 ± 0.206
1.508HisArg: 1.508 ± 0.38
0.96HisSer: 0.96 ± 0.228
0.96HisThr: 0.96 ± 0.261
1.919HisVal: 1.919 ± 0.381
0.48HisTrp: 0.48 ± 0.221
0.411HisTyr: 0.411 ± 0.196
0.0HisXaa: 0.0 ± 0.0
Ile
5.003IleAla: 5.003 ± 0.606
0.617IleCys: 0.617 ± 0.189
2.742IleAsp: 2.742 ± 0.349
2.947IleGlu: 2.947 ± 0.454
1.097IlePhe: 1.097 ± 0.218
3.633IleGly: 3.633 ± 0.46
0.754IleHis: 0.754 ± 0.267
1.919IleIle: 1.919 ± 0.509
1.439IleLys: 1.439 ± 0.266
3.153IleLeu: 3.153 ± 0.663
0.822IleMet: 0.822 ± 0.204
1.919IleAsn: 1.919 ± 0.311
1.988IlePro: 1.988 ± 0.337
1.714IleGln: 1.714 ± 0.385
2.399IleArg: 2.399 ± 0.399
3.084IleSer: 3.084 ± 0.386
3.358IleThr: 3.358 ± 0.473
2.056IleVal: 2.056 ± 0.401
0.411IleTrp: 0.411 ± 0.181
1.097IleTyr: 1.097 ± 0.222
0.0IleXaa: 0.0 ± 0.0
Lys
4.592LysAla: 4.592 ± 0.571
0.617LysCys: 0.617 ± 0.197
2.056LysAsp: 2.056 ± 0.524
2.33LysGlu: 2.33 ± 0.576
1.371LysPhe: 1.371 ± 0.336
3.221LysGly: 3.221 ± 0.481
0.891LysHis: 0.891 ± 0.311
2.193LysIle: 2.193 ± 0.366
2.673LysLys: 2.673 ± 0.522
3.564LysLeu: 3.564 ± 0.618
0.96LysMet: 0.96 ± 0.24
1.576LysAsn: 1.576 ± 0.288
1.988LysPro: 1.988 ± 0.513
1.508LysGln: 1.508 ± 0.312
2.399LysArg: 2.399 ± 0.441
2.81LysSer: 2.81 ± 0.521
2.399LysThr: 2.399 ± 0.531
2.467LysVal: 2.467 ± 0.368
0.548LysTrp: 0.548 ± 0.156
1.234LysTyr: 1.234 ± 0.305
0.0LysXaa: 0.0 ± 0.0
Leu
8.842LeuAla: 8.842 ± 0.953
1.371LeuCys: 1.371 ± 0.384
4.798LeuAsp: 4.798 ± 0.472
4.661LeuGlu: 4.661 ± 0.616
1.439LeuPhe: 1.439 ± 0.355
5.689LeuGly: 5.689 ± 0.564
2.193LeuHis: 2.193 ± 0.373
4.935LeuIle: 4.935 ± 0.495
3.427LeuLys: 3.427 ± 0.741
7.128LeuLeu: 7.128 ± 0.548
2.193LeuMet: 2.193 ± 0.319
3.221LeuAsn: 3.221 ± 0.473
5.757LeuPro: 5.757 ± 0.841
2.879LeuGln: 2.879 ± 0.494
6.511LeuArg: 6.511 ± 0.638
4.935LeuSer: 4.935 ± 0.742
6.717LeuThr: 6.717 ± 0.852
4.935LeuVal: 4.935 ± 0.459
1.165LeuTrp: 1.165 ± 0.302
1.714LeuTyr: 1.714 ± 0.344
0.0LeuXaa: 0.0 ± 0.0
Met
3.153MetAla: 3.153 ± 0.447
0.411MetCys: 0.411 ± 0.168
0.822MetAsp: 0.822 ± 0.242
0.822MetGlu: 0.822 ± 0.211
0.891MetPhe: 0.891 ± 0.293
1.165MetGly: 1.165 ± 0.27
0.274MetHis: 0.274 ± 0.159
1.165MetIle: 1.165 ± 0.318
0.891MetLys: 0.891 ± 0.279
1.988MetLeu: 1.988 ± 0.326
0.411MetMet: 0.411 ± 0.142
0.617MetAsn: 0.617 ± 0.239
1.988MetPro: 1.988 ± 0.266
1.028MetGln: 1.028 ± 0.301
1.988MetArg: 1.988 ± 0.333
2.262MetSer: 2.262 ± 0.471
1.919MetThr: 1.919 ± 0.336
0.617MetVal: 0.617 ± 0.182
0.274MetTrp: 0.274 ± 0.126
0.48MetTyr: 0.48 ± 0.174
0.0MetXaa: 0.0 ± 0.0
Asn
4.524AsnAla: 4.524 ± 0.572
0.48AsnCys: 0.48 ± 0.182
1.988AsnAsp: 1.988 ± 0.352
1.782AsnGlu: 1.782 ± 0.355
0.822AsnPhe: 0.822 ± 0.216
2.879AsnGly: 2.879 ± 0.488
0.411AsnHis: 0.411 ± 0.182
1.234AsnIle: 1.234 ± 0.384
1.028AsnLys: 1.028 ± 0.228
3.153AsnLeu: 3.153 ± 0.349
0.822AsnMet: 0.822 ± 0.261
1.576AsnAsn: 1.576 ± 0.329
2.262AsnPro: 2.262 ± 0.407
1.165AsnGln: 1.165 ± 0.278
2.262AsnArg: 2.262 ± 0.469
2.605AsnSer: 2.605 ± 0.453
2.81AsnThr: 2.81 ± 0.411
2.536AsnVal: 2.536 ± 0.521
0.685AsnTrp: 0.685 ± 0.163
0.891AsnTyr: 0.891 ± 0.224
0.0AsnXaa: 0.0 ± 0.0
Pro
6.58ProAla: 6.58 ± 1.177
0.411ProCys: 0.411 ± 0.174
3.221ProAsp: 3.221 ± 0.576
3.427ProGlu: 3.427 ± 0.517
1.439ProPhe: 1.439 ± 0.373
5.141ProGly: 5.141 ± 0.58
1.028ProHis: 1.028 ± 0.31
2.125ProIle: 2.125 ± 0.354
1.371ProLys: 1.371 ± 0.491
4.249ProLeu: 4.249 ± 0.67
0.822ProMet: 0.822 ± 0.188
2.056ProAsn: 2.056 ± 0.5
2.125ProPro: 2.125 ± 0.606
1.782ProGln: 1.782 ± 0.364
3.016ProArg: 3.016 ± 0.518
3.564ProSer: 3.564 ± 0.484
3.084ProThr: 3.084 ± 0.4
3.77ProVal: 3.77 ± 0.526
0.685ProTrp: 0.685 ± 0.231
1.508ProTyr: 1.508 ± 0.275
0.0ProXaa: 0.0 ± 0.0
Gln
5.072GlnAla: 5.072 ± 0.68
0.411GlnCys: 0.411 ± 0.161
1.302GlnAsp: 1.302 ± 0.31
1.508GlnGlu: 1.508 ± 0.367
1.439GlnPhe: 1.439 ± 0.284
2.605GlnGly: 2.605 ± 0.428
0.891GlnHis: 0.891 ± 0.272
1.165GlnIle: 1.165 ± 0.257
1.919GlnLys: 1.919 ± 0.354
3.907GlnLeu: 3.907 ± 0.589
1.234GlnMet: 1.234 ± 0.274
0.685GlnAsn: 0.685 ± 0.191
2.399GlnPro: 2.399 ± 0.437
1.782GlnGln: 1.782 ± 0.608
3.221GlnArg: 3.221 ± 0.759
1.714GlnSer: 1.714 ± 0.371
2.193GlnThr: 2.193 ± 0.437
3.153GlnVal: 3.153 ± 0.433
0.822GlnTrp: 0.822 ± 0.25
1.988GlnTyr: 1.988 ± 0.402
0.0GlnXaa: 0.0 ± 0.0
Arg
7.608ArgAla: 7.608 ± 0.673
0.343ArgCys: 0.343 ± 0.173
3.564ArgAsp: 3.564 ± 0.458
3.496ArgGlu: 3.496 ± 0.423
2.399ArgPhe: 2.399 ± 0.317
4.524ArgGly: 4.524 ± 0.577
1.302ArgHis: 1.302 ± 0.294
2.742ArgIle: 2.742 ± 0.404
2.947ArgLys: 2.947 ± 0.425
4.524ArgLeu: 4.524 ± 0.585
1.576ArgMet: 1.576 ± 0.335
2.673ArgAsn: 2.673 ± 0.403
1.782ArgPro: 1.782 ± 0.281
2.947ArgGln: 2.947 ± 0.68
4.249ArgArg: 4.249 ± 0.597
3.975ArgSer: 3.975 ± 0.494
3.084ArgThr: 3.084 ± 0.436
4.592ArgVal: 4.592 ± 0.55
1.645ArgTrp: 1.645 ± 0.343
1.919ArgTyr: 1.919 ± 0.405
0.0ArgXaa: 0.0 ± 0.0
Ser
8.156SerAla: 8.156 ± 0.834
1.097SerCys: 1.097 ± 0.363
2.742SerAsp: 2.742 ± 0.315
3.77SerGlu: 3.77 ± 0.509
2.262SerPhe: 2.262 ± 0.335
6.854SerGly: 6.854 ± 0.759
0.685SerHis: 0.685 ± 0.213
2.81SerIle: 2.81 ± 0.467
2.879SerLys: 2.879 ± 0.547
4.112SerLeu: 4.112 ± 0.549
1.576SerMet: 1.576 ± 0.258
2.399SerAsn: 2.399 ± 0.399
2.879SerPro: 2.879 ± 0.449
2.193SerGln: 2.193 ± 0.426
2.947SerArg: 2.947 ± 0.395
3.633SerSer: 3.633 ± 0.472
4.249SerThr: 4.249 ± 0.593
5.552SerVal: 5.552 ± 0.705
1.097SerTrp: 1.097 ± 0.284
2.193SerTyr: 2.193 ± 0.443
0.0SerXaa: 0.0 ± 0.0
Thr
7.676ThrAla: 7.676 ± 0.856
0.96ThrCys: 0.96 ± 0.253
3.701ThrAsp: 3.701 ± 0.57
3.975ThrGlu: 3.975 ± 0.475
2.056ThrPhe: 2.056 ± 0.535
8.088ThrGly: 8.088 ± 0.755
1.234ThrHis: 1.234 ± 0.316
2.605ThrIle: 2.605 ± 0.414
1.302ThrLys: 1.302 ± 0.349
5.62ThrLeu: 5.62 ± 0.753
1.645ThrMet: 1.645 ± 0.306
1.851ThrAsn: 1.851 ± 0.366
4.592ThrPro: 4.592 ± 0.449
2.673ThrGln: 2.673 ± 0.401
3.153ThrArg: 3.153 ± 0.415
3.838ThrSer: 3.838 ± 0.452
5.072ThrThr: 5.072 ± 0.757
5.278ThrVal: 5.278 ± 0.499
1.439ThrTrp: 1.439 ± 0.287
2.193ThrTyr: 2.193 ± 0.407
0.0ThrXaa: 0.0 ± 0.0
Val
7.745ValAla: 7.745 ± 0.837
1.097ValCys: 1.097 ± 0.281
4.661ValAsp: 4.661 ± 0.502
3.633ValGlu: 3.633 ± 0.437
2.399ValPhe: 2.399 ± 0.482
4.798ValGly: 4.798 ± 0.554
0.754ValHis: 0.754 ± 0.234
3.564ValIle: 3.564 ± 0.499
2.879ValLys: 2.879 ± 0.374
6.237ValLeu: 6.237 ± 0.883
1.439ValMet: 1.439 ± 0.359
2.879ValAsn: 2.879 ± 0.529
2.536ValPro: 2.536 ± 0.358
3.084ValGln: 3.084 ± 0.532
3.907ValArg: 3.907 ± 0.481
3.77ValSer: 3.77 ± 0.521
6.169ValThr: 6.169 ± 0.794
4.866ValVal: 4.866 ± 0.595
1.097ValTrp: 1.097 ± 0.298
1.302ValTyr: 1.302 ± 0.279
0.0ValXaa: 0.0 ± 0.0
Trp
1.919TrpAla: 1.919 ± 0.35
0.343TrpCys: 0.343 ± 0.155
1.508TrpAsp: 1.508 ± 0.253
0.48TrpGlu: 0.48 ± 0.159
0.96TrpPhe: 0.96 ± 0.187
1.097TrpGly: 1.097 ± 0.293
0.411TrpHis: 0.411 ± 0.168
0.48TrpIle: 0.48 ± 0.144
0.891TrpLys: 0.891 ± 0.245
1.782TrpLeu: 1.782 ± 0.393
0.411TrpMet: 0.411 ± 0.157
0.411TrpAsn: 0.411 ± 0.157
1.028TrpPro: 1.028 ± 0.307
0.685TrpGln: 0.685 ± 0.169
1.508TrpArg: 1.508 ± 0.407
1.097TrpSer: 1.097 ± 0.281
1.028TrpThr: 1.028 ± 0.251
1.508TrpVal: 1.508 ± 0.33
0.617TrpTrp: 0.617 ± 0.204
0.48TrpTyr: 0.48 ± 0.159
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.564TyrAla: 3.564 ± 0.527
0.343TyrCys: 0.343 ± 0.132
1.782TyrAsp: 1.782 ± 0.323
1.508TyrGlu: 1.508 ± 0.318
1.165TyrPhe: 1.165 ± 0.271
2.879TyrGly: 2.879 ± 0.392
0.48TyrHis: 0.48 ± 0.244
0.822TyrIle: 0.822 ± 0.246
1.028TyrLys: 1.028 ± 0.185
2.262TyrLeu: 2.262 ± 0.364
0.48TyrMet: 0.48 ± 0.14
0.891TyrAsn: 0.891 ± 0.241
1.028TyrPro: 1.028 ± 0.239
0.96TyrGln: 0.96 ± 0.26
2.673TyrArg: 2.673 ± 0.413
2.193TyrSer: 2.193 ± 0.458
1.988TyrThr: 1.988 ± 0.406
1.919TyrVal: 1.919 ± 0.436
0.48TyrTrp: 0.48 ± 0.154
0.411TyrTyr: 0.411 ± 0.15
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 63 proteins (14591 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski