Amino acid dipepetide frequency for Pseudomonas phage MR1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.208AlaAla: 10.208 ± 1.287
0.402AlaCys: 0.402 ± 0.182
6.189AlaAsp: 6.189 ± 0.701
4.582AlaGlu: 4.582 ± 0.566
3.054AlaPhe: 3.054 ± 0.62
7.958AlaGly: 7.958 ± 0.891
1.688AlaHis: 1.688 ± 0.421
5.144AlaIle: 5.144 ± 0.681
5.948AlaLys: 5.948 ± 0.437
6.993AlaLeu: 6.993 ± 0.834
2.894AlaMet: 2.894 ± 0.5
4.421AlaAsn: 4.421 ± 0.613
3.858AlaPro: 3.858 ± 0.546
5.225AlaGln: 5.225 ± 0.989
5.787AlaArg: 5.787 ± 0.729
7.315AlaSer: 7.315 ± 1.058
5.466AlaThr: 5.466 ± 0.844
6.591AlaVal: 6.591 ± 0.894
1.206AlaTrp: 1.206 ± 0.343
2.492AlaTyr: 2.492 ± 0.476
0.0AlaXaa: 0.0 ± 0.0
Cys
1.286CysAla: 1.286 ± 0.274
0.0CysCys: 0.0 ± 0.0
0.482CysAsp: 0.482 ± 0.258
0.482CysGlu: 0.482 ± 0.197
0.723CysPhe: 0.723 ± 0.255
0.402CysGly: 0.402 ± 0.163
0.322CysHis: 0.322 ± 0.132
0.322CysIle: 0.322 ± 0.151
0.804CysLys: 0.804 ± 0.305
0.804CysLeu: 0.804 ± 0.266
0.08CysMet: 0.08 ± 0.079
0.08CysAsn: 0.08 ± 0.08
0.482CysPro: 0.482 ± 0.195
0.402CysGln: 0.402 ± 0.147
0.965CysArg: 0.965 ± 0.324
0.563CysSer: 0.563 ± 0.211
0.482CysThr: 0.482 ± 0.237
0.241CysVal: 0.241 ± 0.125
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.948AspAla: 5.948 ± 0.728
0.563AspCys: 0.563 ± 0.2
5.385AspAsp: 5.385 ± 0.764
3.858AspGlu: 3.858 ± 0.71
2.894AspPhe: 2.894 ± 0.45
6.27AspGly: 6.27 ± 0.561
0.884AspHis: 0.884 ± 0.253
3.296AspIle: 3.296 ± 0.559
3.537AspLys: 3.537 ± 0.428
5.064AspLeu: 5.064 ± 0.658
2.17AspMet: 2.17 ± 0.337
2.411AspAsn: 2.411 ± 0.373
2.894AspPro: 2.894 ± 0.69
2.411AspGln: 2.411 ± 0.311
3.296AspArg: 3.296 ± 0.494
3.537AspSer: 3.537 ± 0.515
3.054AspThr: 3.054 ± 0.474
4.099AspVal: 4.099 ± 0.527
0.965AspTrp: 0.965 ± 0.322
2.009AspTyr: 2.009 ± 0.382
0.0AspXaa: 0.0 ± 0.0
Glu
6.993GluAla: 6.993 ± 0.838
0.723GluCys: 0.723 ± 0.244
3.939GluAsp: 3.939 ± 0.411
3.135GluGlu: 3.135 ± 0.709
2.492GluPhe: 2.492 ± 0.476
6.189GluGly: 6.189 ± 0.891
1.929GluHis: 1.929 ± 0.413
2.411GluIle: 2.411 ± 0.363
2.974GluLys: 2.974 ± 0.433
4.421GluLeu: 4.421 ± 0.636
2.09GluMet: 2.09 ± 0.381
2.572GluAsn: 2.572 ± 0.417
1.849GluPro: 1.849 ± 0.452
3.135GluGln: 3.135 ± 0.622
4.18GluArg: 4.18 ± 0.458
3.215GluSer: 3.215 ± 0.564
3.778GluThr: 3.778 ± 0.604
4.582GluVal: 4.582 ± 0.753
0.965GluTrp: 0.965 ± 0.307
2.251GluTyr: 2.251 ± 0.366
0.0GluXaa: 0.0 ± 0.0
Phe
2.572PheAla: 2.572 ± 0.362
0.643PheCys: 0.643 ± 0.227
2.331PheAsp: 2.331 ± 0.433
2.009PheGlu: 2.009 ± 0.343
1.206PhePhe: 1.206 ± 0.404
2.974PheGly: 2.974 ± 0.468
0.723PheHis: 0.723 ± 0.233
1.206PheIle: 1.206 ± 0.324
2.09PheLys: 2.09 ± 0.418
2.411PheLeu: 2.411 ± 0.481
1.447PheMet: 1.447 ± 0.353
2.492PheAsn: 2.492 ± 0.415
1.849PhePro: 1.849 ± 0.396
1.045PheGln: 1.045 ± 0.308
2.009PheArg: 2.009 ± 0.392
2.17PheSer: 2.17 ± 0.444
3.054PheThr: 3.054 ± 0.413
2.492PheVal: 2.492 ± 0.405
0.402PheTrp: 0.402 ± 0.188
0.482PheTyr: 0.482 ± 0.173
0.0PheXaa: 0.0 ± 0.0
Gly
7.636GlyAla: 7.636 ± 0.977
0.723GlyCys: 0.723 ± 0.217
4.582GlyAsp: 4.582 ± 0.531
5.707GlyGlu: 5.707 ± 0.758
3.135GlyPhe: 3.135 ± 0.325
6.752GlyGly: 6.752 ± 0.781
1.125GlyHis: 1.125 ± 0.306
4.823GlyIle: 4.823 ± 0.678
3.939GlyLys: 3.939 ± 0.693
6.671GlyLeu: 6.671 ± 0.966
1.849GlyMet: 1.849 ± 0.405
3.135GlyAsn: 3.135 ± 0.5
2.331GlyPro: 2.331 ± 0.362
3.537GlyGln: 3.537 ± 0.529
3.697GlyArg: 3.697 ± 0.557
5.707GlySer: 5.707 ± 0.997
5.546GlyThr: 5.546 ± 0.75
4.742GlyVal: 4.742 ± 0.666
1.447GlyTrp: 1.447 ± 0.409
2.894GlyTyr: 2.894 ± 0.528
0.0GlyXaa: 0.0 ± 0.0
His
1.045HisAla: 1.045 ± 0.282
0.643HisCys: 0.643 ± 0.265
1.206HisAsp: 1.206 ± 0.283
0.804HisGlu: 0.804 ± 0.269
0.643HisPhe: 0.643 ± 0.194
1.929HisGly: 1.929 ± 0.413
0.322HisHis: 0.322 ± 0.172
1.045HisIle: 1.045 ± 0.264
0.884HisLys: 0.884 ± 0.251
1.849HisLeu: 1.849 ± 0.372
1.045HisMet: 1.045 ± 0.265
0.804HisAsn: 0.804 ± 0.249
0.723HisPro: 0.723 ± 0.214
0.643HisGln: 0.643 ± 0.174
0.804HisArg: 0.804 ± 0.261
1.045HisSer: 1.045 ± 0.278
0.723HisThr: 0.723 ± 0.225
1.366HisVal: 1.366 ± 0.349
0.402HisTrp: 0.402 ± 0.173
0.482HisTyr: 0.482 ± 0.178
0.0HisXaa: 0.0 ± 0.0
Ile
4.099IleAla: 4.099 ± 0.405
0.482IleCys: 0.482 ± 0.193
3.617IleAsp: 3.617 ± 0.375
3.456IleGlu: 3.456 ± 0.479
1.045IlePhe: 1.045 ± 0.281
3.296IleGly: 3.296 ± 0.493
1.206IleHis: 1.206 ± 0.44
2.251IleIle: 2.251 ± 0.332
2.974IleLys: 2.974 ± 0.45
4.662IleLeu: 4.662 ± 0.633
1.045IleMet: 1.045 ± 0.268
2.411IleAsn: 2.411 ± 0.467
1.608IlePro: 1.608 ± 0.335
2.411IleGln: 2.411 ± 0.406
2.894IleArg: 2.894 ± 0.489
2.331IleSer: 2.331 ± 0.331
2.572IleThr: 2.572 ± 0.523
3.617IleVal: 3.617 ± 0.609
0.643IleTrp: 0.643 ± 0.21
1.608IleTyr: 1.608 ± 0.322
0.0IleXaa: 0.0 ± 0.0
Lys
6.189LysAla: 6.189 ± 0.746
0.241LysCys: 0.241 ± 0.137
3.215LysAsp: 3.215 ± 0.628
3.778LysGlu: 3.778 ± 0.514
1.849LysPhe: 1.849 ± 0.345
4.742LysGly: 4.742 ± 0.689
1.527LysHis: 1.527 ± 0.452
2.733LysIle: 2.733 ± 0.484
3.054LysLys: 3.054 ± 0.614
4.582LysLeu: 4.582 ± 0.537
1.447LysMet: 1.447 ± 0.386
1.688LysAsn: 1.688 ± 0.346
2.411LysPro: 2.411 ± 0.438
2.894LysGln: 2.894 ± 0.686
3.215LysArg: 3.215 ± 0.583
2.411LysSer: 2.411 ± 0.437
2.894LysThr: 2.894 ± 0.568
4.662LysVal: 4.662 ± 0.651
0.804LysTrp: 0.804 ± 0.251
1.447LysTyr: 1.447 ± 0.325
0.0LysXaa: 0.0 ± 0.0
Leu
8.199LeuAla: 8.199 ± 0.869
0.563LeuCys: 0.563 ± 0.217
4.903LeuAsp: 4.903 ± 0.534
4.823LeuGlu: 4.823 ± 0.631
2.09LeuPhe: 2.09 ± 0.431
4.742LeuGly: 4.742 ± 0.656
1.688LeuHis: 1.688 ± 0.378
4.18LeuIle: 4.18 ± 0.582
5.787LeuLys: 5.787 ± 0.575
4.421LeuLeu: 4.421 ± 0.609
2.251LeuMet: 2.251 ± 0.337
4.26LeuAsn: 4.26 ± 0.672
2.894LeuPro: 2.894 ± 0.401
4.421LeuGln: 4.421 ± 0.629
5.707LeuArg: 5.707 ± 0.849
5.627LeuSer: 5.627 ± 0.601
4.742LeuThr: 4.742 ± 0.65
4.984LeuVal: 4.984 ± 0.643
1.125LeuTrp: 1.125 ± 0.295
2.17LeuTyr: 2.17 ± 0.402
0.0LeuXaa: 0.0 ± 0.0
Met
3.456MetAla: 3.456 ± 0.353
0.241MetCys: 0.241 ± 0.144
2.572MetAsp: 2.572 ± 0.393
1.768MetGlu: 1.768 ± 0.391
0.884MetPhe: 0.884 ± 0.323
2.492MetGly: 2.492 ± 0.404
0.563MetHis: 0.563 ± 0.213
2.09MetIle: 2.09 ± 0.283
1.608MetLys: 1.608 ± 0.418
2.653MetLeu: 2.653 ± 0.421
0.482MetMet: 0.482 ± 0.191
1.206MetAsn: 1.206 ± 0.377
1.366MetPro: 1.366 ± 0.307
0.884MetGln: 0.884 ± 0.245
0.884MetArg: 0.884 ± 0.267
2.251MetSer: 2.251 ± 0.384
2.009MetThr: 2.009 ± 0.346
1.366MetVal: 1.366 ± 0.328
0.402MetTrp: 0.402 ± 0.174
0.161MetTyr: 0.161 ± 0.117
0.0MetXaa: 0.0 ± 0.0
Asn
3.778AsnAla: 3.778 ± 0.461
0.402AsnCys: 0.402 ± 0.172
2.009AsnAsp: 2.009 ± 0.324
2.974AsnGlu: 2.974 ± 0.638
2.17AsnPhe: 2.17 ± 0.433
4.742AsnGly: 4.742 ± 0.783
0.643AsnHis: 0.643 ± 0.2
1.688AsnIle: 1.688 ± 0.383
1.125AsnLys: 1.125 ± 0.371
3.215AsnLeu: 3.215 ± 0.443
0.965AsnMet: 0.965 ± 0.301
2.17AsnAsn: 2.17 ± 0.541
2.492AsnPro: 2.492 ± 0.382
1.849AsnGln: 1.849 ± 0.396
2.251AsnArg: 2.251 ± 0.461
3.456AsnSer: 3.456 ± 0.767
2.572AsnThr: 2.572 ± 0.446
2.572AsnVal: 2.572 ± 0.829
0.884AsnTrp: 0.884 ± 0.275
1.768AsnTyr: 1.768 ± 0.359
0.0AsnXaa: 0.0 ± 0.0
Pro
2.894ProAla: 2.894 ± 0.491
0.322ProCys: 0.322 ± 0.126
3.778ProAsp: 3.778 ± 0.692
4.099ProGlu: 4.099 ± 0.727
1.366ProPhe: 1.366 ± 0.221
2.09ProGly: 2.09 ± 0.354
0.563ProHis: 0.563 ± 0.154
1.286ProIle: 1.286 ± 0.318
2.411ProLys: 2.411 ± 0.487
3.296ProLeu: 3.296 ± 0.71
0.884ProMet: 0.884 ± 0.263
2.411ProAsn: 2.411 ± 0.545
0.723ProPro: 0.723 ± 0.222
1.125ProGln: 1.125 ± 0.271
2.653ProArg: 2.653 ± 0.561
2.009ProSer: 2.009 ± 0.432
2.411ProThr: 2.411 ± 0.431
3.054ProVal: 3.054 ± 0.497
0.402ProTrp: 0.402 ± 0.187
1.447ProTyr: 1.447 ± 0.353
0.0ProXaa: 0.0 ± 0.0
Gln
4.984GlnAla: 4.984 ± 0.708
0.402GlnCys: 0.402 ± 0.146
2.411GlnAsp: 2.411 ± 0.557
3.376GlnGlu: 3.376 ± 0.757
1.929GlnPhe: 1.929 ± 0.379
3.054GlnGly: 3.054 ± 0.563
0.723GlnHis: 0.723 ± 0.171
1.688GlnIle: 1.688 ± 0.476
2.17GlnLys: 2.17 ± 0.307
4.662GlnLeu: 4.662 ± 0.571
1.527GlnMet: 1.527 ± 0.316
1.447GlnAsn: 1.447 ± 0.429
1.125GlnPro: 1.125 ± 0.229
1.527GlnGln: 1.527 ± 0.538
3.858GlnArg: 3.858 ± 0.721
2.974GlnSer: 2.974 ± 0.539
2.17GlnThr: 2.17 ± 0.399
2.492GlnVal: 2.492 ± 0.382
0.563GlnTrp: 0.563 ± 0.204
1.366GlnTyr: 1.366 ± 0.465
0.0GlnXaa: 0.0 ± 0.0
Arg
4.984ArgAla: 4.984 ± 0.483
0.563ArgCys: 0.563 ± 0.236
3.617ArgAsp: 3.617 ± 0.492
3.858ArgGlu: 3.858 ± 0.601
2.17ArgPhe: 2.17 ± 0.426
4.18ArgGly: 4.18 ± 0.522
1.045ArgHis: 1.045 ± 0.37
2.894ArgIle: 2.894 ± 0.424
3.939ArgLys: 3.939 ± 0.715
5.627ArgLeu: 5.627 ± 0.582
1.688ArgMet: 1.688 ± 0.339
2.492ArgAsn: 2.492 ± 0.389
1.849ArgPro: 1.849 ± 0.348
2.653ArgGln: 2.653 ± 0.355
3.135ArgArg: 3.135 ± 0.436
4.662ArgSer: 4.662 ± 0.705
3.537ArgThr: 3.537 ± 0.537
3.054ArgVal: 3.054 ± 0.509
0.563ArgTrp: 0.563 ± 0.175
2.251ArgTyr: 2.251 ± 0.242
0.0ArgXaa: 0.0 ± 0.0
Ser
6.511SerAla: 6.511 ± 1.055
0.563SerCys: 0.563 ± 0.222
4.421SerAsp: 4.421 ± 0.562
3.617SerGlu: 3.617 ± 0.362
2.009SerPhe: 2.009 ± 0.411
5.787SerGly: 5.787 ± 0.815
1.125SerHis: 1.125 ± 0.238
3.296SerIle: 3.296 ± 0.463
3.296SerLys: 3.296 ± 0.436
4.662SerLeu: 4.662 ± 0.731
1.929SerMet: 1.929 ± 0.348
2.492SerAsn: 2.492 ± 0.573
2.653SerPro: 2.653 ± 0.427
3.537SerGln: 3.537 ± 0.572
3.296SerArg: 3.296 ± 0.507
3.537SerSer: 3.537 ± 0.517
3.215SerThr: 3.215 ± 0.674
3.376SerVal: 3.376 ± 0.523
1.045SerTrp: 1.045 ± 0.243
2.653SerTyr: 2.653 ± 0.509
0.0SerXaa: 0.0 ± 0.0
Thr
4.823ThrAla: 4.823 ± 0.623
0.241ThrCys: 0.241 ± 0.167
4.501ThrAsp: 4.501 ± 0.623
4.019ThrGlu: 4.019 ± 0.468
2.894ThrPhe: 2.894 ± 0.483
4.662ThrGly: 4.662 ± 0.665
0.804ThrHis: 0.804 ± 0.3
3.456ThrIle: 3.456 ± 0.573
3.537ThrLys: 3.537 ± 0.592
4.582ThrLeu: 4.582 ± 0.645
1.366ThrMet: 1.366 ± 0.358
2.009ThrAsn: 2.009 ± 0.391
2.894ThrPro: 2.894 ± 0.404
2.492ThrGln: 2.492 ± 0.406
2.492ThrArg: 2.492 ± 0.551
3.858ThrSer: 3.858 ± 0.711
3.456ThrThr: 3.456 ± 0.751
4.26ThrVal: 4.26 ± 0.453
1.045ThrTrp: 1.045 ± 0.235
1.206ThrTyr: 1.206 ± 0.273
0.0ThrXaa: 0.0 ± 0.0
Val
6.913ValAla: 6.913 ± 0.997
0.723ValCys: 0.723 ± 0.246
2.733ValAsp: 2.733 ± 0.425
4.421ValGlu: 4.421 ± 0.631
1.527ValPhe: 1.527 ± 0.311
4.501ValGly: 4.501 ± 0.64
0.643ValHis: 0.643 ± 0.205
2.894ValIle: 2.894 ± 0.536
3.215ValLys: 3.215 ± 0.509
5.305ValLeu: 5.305 ± 0.633
2.331ValMet: 2.331 ± 0.401
2.974ValAsn: 2.974 ± 0.45
3.778ValPro: 3.778 ± 0.678
2.411ValGln: 2.411 ± 0.431
4.823ValArg: 4.823 ± 0.603
4.099ValSer: 4.099 ± 0.438
4.099ValThr: 4.099 ± 0.542
4.823ValVal: 4.823 ± 0.784
1.125ValTrp: 1.125 ± 0.34
2.331ValTyr: 2.331 ± 0.53
0.0ValXaa: 0.0 ± 0.0
Trp
1.688TrpAla: 1.688 ± 0.431
0.0TrpCys: 0.0 ± 0.0
0.884TrpAsp: 0.884 ± 0.266
0.723TrpGlu: 0.723 ± 0.211
0.482TrpPhe: 0.482 ± 0.166
0.804TrpGly: 0.804 ± 0.197
0.482TrpHis: 0.482 ± 0.175
0.322TrpIle: 0.322 ± 0.181
0.723TrpLys: 0.723 ± 0.216
1.527TrpLeu: 1.527 ± 0.284
0.482TrpMet: 0.482 ± 0.137
0.723TrpAsn: 0.723 ± 0.222
0.161TrpPro: 0.161 ± 0.11
0.643TrpGln: 0.643 ± 0.229
0.884TrpArg: 0.884 ± 0.208
0.884TrpSer: 0.884 ± 0.284
0.884TrpThr: 0.884 ± 0.227
1.608TrpVal: 1.608 ± 0.411
0.161TrpTrp: 0.161 ± 0.11
0.563TrpTyr: 0.563 ± 0.221
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.296TyrAla: 3.296 ± 0.619
0.482TyrCys: 0.482 ± 0.222
1.768TyrAsp: 1.768 ± 0.35
2.331TyrGlu: 2.331 ± 0.439
1.045TyrPhe: 1.045 ± 0.297
2.411TyrGly: 2.411 ± 0.367
0.402TyrHis: 0.402 ± 0.189
1.206TyrIle: 1.206 ± 0.301
1.768TyrLys: 1.768 ± 0.395
2.17TyrLeu: 2.17 ± 0.388
1.286TyrMet: 1.286 ± 0.236
1.527TyrAsn: 1.527 ± 0.279
1.286TyrPro: 1.286 ± 0.326
1.206TyrGln: 1.206 ± 0.21
1.929TyrArg: 1.929 ± 0.285
1.366TyrSer: 1.366 ± 0.388
1.929TyrThr: 1.929 ± 0.402
1.688TyrVal: 1.688 ± 0.407
0.482TyrTrp: 0.482 ± 0.206
0.723TyrTyr: 0.723 ± 0.219
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 50 proteins (12442 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski