Amino acid dipepetide frequency for Pseudomonas phage MP42

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.559AlaAla: 19.559 ± 3.331
1.166AlaCys: 1.166 ± 0.407
8.703AlaAsp: 8.703 ± 0.829
9.062AlaGlu: 9.062 ± 1.502
2.512AlaPhe: 2.512 ± 0.496
9.331AlaGly: 9.331 ± 1.57
1.615AlaHis: 1.615 ± 0.379
6.639AlaIle: 6.639 ± 0.765
4.396AlaLys: 4.396 ± 0.737
13.458AlaLeu: 13.458 ± 1.203
3.05AlaMet: 3.05 ± 0.505
3.05AlaAsn: 3.05 ± 0.706
6.011AlaPro: 6.011 ± 0.772
6.101AlaGln: 6.101 ± 1.024
10.318AlaArg: 10.318 ± 1.225
7.895AlaSer: 7.895 ± 1.039
6.46AlaThr: 6.46 ± 0.586
6.729AlaVal: 6.729 ± 0.738
2.422AlaTrp: 2.422 ± 0.364
2.422AlaTyr: 2.422 ± 0.402
0.0AlaXaa: 0.0 ± 0.0
Cys
0.807CysAla: 0.807 ± 0.309
0.0CysCys: 0.0 ± 0.0
0.807CysAsp: 0.807 ± 0.263
0.628CysGlu: 0.628 ± 0.303
0.449CysPhe: 0.449 ± 0.21
0.628CysGly: 0.628 ± 0.243
0.359CysHis: 0.359 ± 0.187
0.359CysIle: 0.359 ± 0.172
0.09CysLys: 0.09 ± 0.085
0.359CysLeu: 0.359 ± 0.174
0.179CysMet: 0.179 ± 0.124
0.538CysAsn: 0.538 ± 0.294
0.628CysPro: 0.628 ± 0.317
0.09CysGln: 0.09 ± 0.098
1.077CysArg: 1.077 ± 0.377
0.359CysSer: 0.359 ± 0.184
0.538CysThr: 0.538 ± 0.218
0.179CysVal: 0.179 ± 0.128
0.269CysTrp: 0.269 ± 0.164
0.269CysTyr: 0.269 ± 0.176
0.0CysXaa: 0.0 ± 0.0
Asp
7.895AspAla: 7.895 ± 0.948
0.269AspCys: 0.269 ± 0.175
3.05AspAsp: 3.05 ± 0.564
3.14AspGlu: 3.14 ± 0.623
1.705AspPhe: 1.705 ± 0.307
6.011AspGly: 6.011 ± 0.771
1.346AspHis: 1.346 ± 0.447
2.243AspIle: 2.243 ± 0.323
1.346AspLys: 1.346 ± 0.345
5.293AspLeu: 5.293 ± 0.783
1.525AspMet: 1.525 ± 0.509
1.435AspAsn: 1.435 ± 0.343
3.14AspPro: 3.14 ± 0.585
2.512AspGln: 2.512 ± 0.479
3.32AspArg: 3.32 ± 0.498
3.409AspSer: 3.409 ± 0.486
3.32AspThr: 3.32 ± 0.497
3.499AspVal: 3.499 ± 0.612
1.435AspTrp: 1.435 ± 0.294
1.525AspTyr: 1.525 ± 0.468
0.0AspXaa: 0.0 ± 0.0
Glu
7.267GluAla: 7.267 ± 0.845
0.449GluCys: 0.449 ± 0.285
2.692GluAsp: 2.692 ± 0.582
2.781GluGlu: 2.781 ± 0.542
1.525GluPhe: 1.525 ± 0.332
3.678GluGly: 3.678 ± 0.518
0.897GluHis: 0.897 ± 0.268
2.961GluIle: 2.961 ± 0.529
2.692GluLys: 2.692 ± 0.519
6.998GluLeu: 6.998 ± 0.728
1.974GluMet: 1.974 ± 0.424
2.064GluAsn: 2.064 ± 0.652
2.602GluPro: 2.602 ± 0.568
4.217GluGln: 4.217 ± 0.706
4.576GluArg: 4.576 ± 0.882
2.781GluSer: 2.781 ± 0.446
3.499GluThr: 3.499 ± 0.438
4.396GluVal: 4.396 ± 0.64
1.166GluTrp: 1.166 ± 0.282
1.794GluTyr: 1.794 ± 0.323
0.0GluXaa: 0.0 ± 0.0
Phe
3.23PheAla: 3.23 ± 0.568
0.359PheCys: 0.359 ± 0.169
2.781PheAsp: 2.781 ± 0.483
1.346PheGlu: 1.346 ± 0.376
0.628PhePhe: 0.628 ± 0.21
2.871PheGly: 2.871 ± 0.498
0.359PheHis: 0.359 ± 0.181
0.897PheIle: 0.897 ± 0.29
0.807PheLys: 0.807 ± 0.219
2.333PheLeu: 2.333 ± 0.407
0.718PheMet: 0.718 ± 0.212
0.987PheAsn: 0.987 ± 0.243
1.435PhePro: 1.435 ± 0.403
1.077PheGln: 1.077 ± 0.356
1.794PheArg: 1.794 ± 0.462
1.077PheSer: 1.077 ± 0.266
1.435PheThr: 1.435 ± 0.401
1.256PheVal: 1.256 ± 0.265
0.269PheTrp: 0.269 ± 0.165
0.897PheTyr: 0.897 ± 0.279
0.0PheXaa: 0.0 ± 0.0
Gly
8.075GlyAla: 8.075 ± 1.279
0.628GlyCys: 0.628 ± 0.241
3.589GlyAsp: 3.589 ± 0.762
4.576GlyGlu: 4.576 ± 0.52
2.602GlyPhe: 2.602 ± 0.459
6.37GlyGly: 6.37 ± 0.684
0.897GlyHis: 0.897 ± 0.261
3.499GlyIle: 3.499 ± 0.527
2.871GlyLys: 2.871 ± 0.489
7.805GlyLeu: 7.805 ± 1.05
1.435GlyMet: 1.435 ± 0.386
2.422GlyAsn: 2.422 ± 0.468
2.781GlyPro: 2.781 ± 0.418
5.473GlyGln: 5.473 ± 0.504
6.639GlyArg: 6.639 ± 0.711
4.486GlySer: 4.486 ± 0.753
3.589GlyThr: 3.589 ± 0.528
4.665GlyVal: 4.665 ± 0.578
1.256GlyTrp: 1.256 ± 0.259
2.153GlyTyr: 2.153 ± 0.645
0.0GlyXaa: 0.0 ± 0.0
His
2.064HisAla: 2.064 ± 0.355
0.179HisCys: 0.179 ± 0.14
0.897HisAsp: 0.897 ± 0.265
0.897HisGlu: 0.897 ± 0.321
0.449HisPhe: 0.449 ± 0.163
1.525HisGly: 1.525 ± 0.41
0.179HisHis: 0.179 ± 0.134
0.897HisIle: 0.897 ± 0.259
0.179HisLys: 0.179 ± 0.129
1.794HisLeu: 1.794 ± 0.436
0.987HisMet: 0.987 ± 0.314
0.987HisAsn: 0.987 ± 0.31
1.435HisPro: 1.435 ± 0.361
0.987HisGln: 0.987 ± 0.313
1.166HisArg: 1.166 ± 0.364
0.538HisSer: 0.538 ± 0.206
0.897HisThr: 0.897 ± 0.264
0.628HisVal: 0.628 ± 0.23
0.179HisTrp: 0.179 ± 0.128
0.807HisTyr: 0.807 ± 0.359
0.0HisXaa: 0.0 ± 0.0
Ile
4.665IleAla: 4.665 ± 0.883
0.628IleCys: 0.628 ± 0.231
3.23IleAsp: 3.23 ± 0.51
2.871IleGlu: 2.871 ± 0.493
0.987IlePhe: 0.987 ± 0.204
3.14IleGly: 3.14 ± 0.394
0.987IleHis: 0.987 ± 0.249
1.884IleIle: 1.884 ± 0.572
1.884IleLys: 1.884 ± 0.427
2.692IleLeu: 2.692 ± 0.392
0.718IleMet: 0.718 ± 0.273
1.256IleAsn: 1.256 ± 0.324
2.064IlePro: 2.064 ± 0.461
1.435IleGln: 1.435 ± 0.509
4.396IleArg: 4.396 ± 0.522
2.243IleSer: 2.243 ± 0.469
3.409IleThr: 3.409 ± 0.643
2.692IleVal: 2.692 ± 0.475
0.449IleTrp: 0.449 ± 0.17
0.987IleTyr: 0.987 ± 0.271
0.0IleXaa: 0.0 ± 0.0
Lys
5.204LysAla: 5.204 ± 0.719
0.09LysCys: 0.09 ± 0.102
0.987LysAsp: 0.987 ± 0.343
2.064LysGlu: 2.064 ± 0.552
0.538LysPhe: 0.538 ± 0.182
2.422LysGly: 2.422 ± 0.407
0.718LysHis: 0.718 ± 0.252
0.987LysIle: 0.987 ± 0.373
1.615LysLys: 1.615 ± 0.405
3.14LysLeu: 3.14 ± 0.568
0.449LysMet: 0.449 ± 0.2
0.897LysAsn: 0.897 ± 0.207
2.333LysPro: 2.333 ± 0.677
1.077LysGln: 1.077 ± 0.257
3.32LysArg: 3.32 ± 0.733
2.602LysSer: 2.602 ± 0.497
2.064LysThr: 2.064 ± 0.477
2.512LysVal: 2.512 ± 0.436
0.269LysTrp: 0.269 ± 0.115
0.628LysTyr: 0.628 ± 0.222
0.0LysXaa: 0.0 ± 0.0
Leu
15.611LeuAla: 15.611 ± 1.399
0.897LeuCys: 0.897 ± 0.23
6.191LeuAsp: 6.191 ± 0.632
7.177LeuGlu: 7.177 ± 0.764
2.422LeuPhe: 2.422 ± 0.551
7.267LeuGly: 7.267 ± 0.647
2.422LeuHis: 2.422 ± 0.531
3.32LeuIle: 3.32 ± 0.587
3.499LeuLys: 3.499 ± 0.733
9.42LeuLeu: 9.42 ± 1.04
1.705LeuMet: 1.705 ± 0.39
2.961LeuAsn: 2.961 ± 0.492
4.935LeuPro: 4.935 ± 0.838
3.589LeuGln: 3.589 ± 0.527
7.267LeuArg: 7.267 ± 0.74
4.576LeuSer: 4.576 ± 0.744
4.665LeuThr: 4.665 ± 0.659
7.177LeuVal: 7.177 ± 0.728
1.077LeuTrp: 1.077 ± 0.356
2.333LeuTyr: 2.333 ± 0.492
0.0LeuXaa: 0.0 ± 0.0
Met
3.948MetAla: 3.948 ± 0.623
0.09MetCys: 0.09 ± 0.094
2.333MetAsp: 2.333 ± 0.464
1.166MetGlu: 1.166 ± 0.327
0.628MetPhe: 0.628 ± 0.231
1.615MetGly: 1.615 ± 0.327
0.359MetHis: 0.359 ± 0.224
0.718MetIle: 0.718 ± 0.227
0.718MetLys: 0.718 ± 0.217
1.615MetLeu: 1.615 ± 0.335
0.449MetMet: 0.449 ± 0.218
0.538MetAsn: 0.538 ± 0.255
1.077MetPro: 1.077 ± 0.357
0.987MetGln: 0.987 ± 0.263
1.705MetArg: 1.705 ± 0.303
1.615MetSer: 1.615 ± 0.371
1.166MetThr: 1.166 ± 0.251
0.628MetVal: 0.628 ± 0.234
0.269MetTrp: 0.269 ± 0.201
0.179MetTyr: 0.179 ± 0.128
0.0MetXaa: 0.0 ± 0.0
Asn
3.32AsnAla: 3.32 ± 0.809
0.09AsnCys: 0.09 ± 0.078
1.435AsnAsp: 1.435 ± 0.386
1.525AsnGlu: 1.525 ± 0.401
0.718AsnPhe: 0.718 ± 0.324
2.871AsnGly: 2.871 ± 0.499
0.718AsnHis: 0.718 ± 0.245
0.628AsnIle: 0.628 ± 0.232
0.807AsnLys: 0.807 ± 0.297
3.32AsnLeu: 3.32 ± 0.584
0.628AsnMet: 0.628 ± 0.275
1.346AsnAsn: 1.346 ± 0.441
2.692AsnPro: 2.692 ± 0.6
0.807AsnGln: 0.807 ± 0.293
2.781AsnArg: 2.781 ± 0.397
1.974AsnSer: 1.974 ± 0.453
1.166AsnThr: 1.166 ± 0.265
1.435AsnVal: 1.435 ± 0.268
0.718AsnTrp: 0.718 ± 0.221
0.628AsnTyr: 0.628 ± 0.227
0.0AsnXaa: 0.0 ± 0.0
Pro
7.985ProAla: 7.985 ± 0.936
0.628ProCys: 0.628 ± 0.343
3.409ProAsp: 3.409 ± 0.655
3.05ProGlu: 3.05 ± 0.542
1.615ProPhe: 1.615 ± 0.406
3.948ProGly: 3.948 ± 0.658
0.987ProHis: 0.987 ± 0.524
1.884ProIle: 1.884 ± 0.348
1.794ProLys: 1.794 ± 0.469
4.665ProLeu: 4.665 ± 0.647
0.718ProMet: 0.718 ± 0.295
1.615ProAsn: 1.615 ± 0.418
2.243ProPro: 2.243 ± 0.658
2.422ProGln: 2.422 ± 0.495
2.961ProArg: 2.961 ± 0.629
3.589ProSer: 3.589 ± 0.434
2.243ProThr: 2.243 ± 0.55
2.871ProVal: 2.871 ± 0.557
0.538ProTrp: 0.538 ± 0.224
1.256ProTyr: 1.256 ± 0.445
0.0ProXaa: 0.0 ± 0.0
Gln
5.832GlnAla: 5.832 ± 1.152
0.179GlnCys: 0.179 ± 0.131
1.705GlnAsp: 1.705 ± 0.393
1.794GlnGlu: 1.794 ± 0.449
1.615GlnPhe: 1.615 ± 0.349
3.678GlnGly: 3.678 ± 0.583
0.987GlnHis: 0.987 ± 0.273
2.602GlnIle: 2.602 ± 0.394
1.077GlnLys: 1.077 ± 0.419
6.46GlnLeu: 6.46 ± 0.62
1.077GlnMet: 1.077 ± 0.419
1.077GlnAsn: 1.077 ± 0.263
2.602GlnPro: 2.602 ± 0.436
2.961GlnGln: 2.961 ± 0.852
3.589GlnArg: 3.589 ± 0.527
2.871GlnSer: 2.871 ± 0.462
2.064GlnThr: 2.064 ± 0.371
4.396GlnVal: 4.396 ± 0.607
0.807GlnTrp: 0.807 ± 0.267
0.628GlnTyr: 0.628 ± 0.242
0.0GlnXaa: 0.0 ± 0.0
Arg
8.972ArgAla: 8.972 ± 0.87
0.718ArgCys: 0.718 ± 0.271
4.127ArgAsp: 4.127 ± 0.641
5.473ArgGlu: 5.473 ± 0.725
2.153ArgPhe: 2.153 ± 0.427
4.306ArgGly: 4.306 ± 0.64
2.064ArgHis: 2.064 ± 0.458
3.589ArgIle: 3.589 ± 0.57
2.692ArgLys: 2.692 ± 0.443
8.164ArgLeu: 8.164 ± 0.818
1.615ArgMet: 1.615 ± 0.352
1.794ArgAsn: 1.794 ± 0.479
3.589ArgPro: 3.589 ± 0.742
4.845ArgGln: 4.845 ± 0.608
6.37ArgArg: 6.37 ± 0.874
4.217ArgSer: 4.217 ± 0.694
3.409ArgThr: 3.409 ± 0.608
3.948ArgVal: 3.948 ± 0.686
1.615ArgTrp: 1.615 ± 0.447
3.05ArgTyr: 3.05 ± 0.458
0.0ArgXaa: 0.0 ± 0.0
Ser
7.805SerAla: 7.805 ± 0.892
0.718SerCys: 0.718 ± 0.236
3.409SerAsp: 3.409 ± 0.589
3.14SerGlu: 3.14 ± 0.431
1.705SerPhe: 1.705 ± 0.363
4.396SerGly: 4.396 ± 0.627
0.987SerHis: 0.987 ± 0.333
2.781SerIle: 2.781 ± 0.555
2.064SerLys: 2.064 ± 0.381
5.563SerLeu: 5.563 ± 1.05
0.807SerMet: 0.807 ± 0.271
1.435SerAsn: 1.435 ± 0.349
3.32SerPro: 3.32 ± 0.626
2.243SerGln: 2.243 ± 0.366
4.037SerArg: 4.037 ± 0.699
4.576SerSer: 4.576 ± 0.856
3.499SerThr: 3.499 ± 0.723
3.948SerVal: 3.948 ± 0.564
1.525SerTrp: 1.525 ± 0.364
1.525SerTyr: 1.525 ± 0.417
0.0SerXaa: 0.0 ± 0.0
Thr
6.639ThrAla: 6.639 ± 0.884
0.449ThrCys: 0.449 ± 0.286
2.871ThrAsp: 2.871 ± 0.484
2.692ThrGlu: 2.692 ± 0.483
0.897ThrPhe: 0.897 ± 0.262
4.935ThrGly: 4.935 ± 0.848
0.538ThrHis: 0.538 ± 0.228
2.153ThrIle: 2.153 ± 0.406
1.615ThrLys: 1.615 ± 0.391
5.563ThrLeu: 5.563 ± 0.709
1.077ThrMet: 1.077 ± 0.365
1.525ThrAsn: 1.525 ± 0.406
2.064ThrPro: 2.064 ± 0.315
1.256ThrGln: 1.256 ± 0.279
3.32ThrArg: 3.32 ± 0.386
3.589ThrSer: 3.589 ± 0.665
3.858ThrThr: 3.858 ± 0.588
5.921ThrVal: 5.921 ± 0.867
0.897ThrTrp: 0.897 ± 0.264
1.525ThrTyr: 1.525 ± 0.365
0.0ThrXaa: 0.0 ± 0.0
Val
7.447ValAla: 7.447 ± 0.852
0.359ValCys: 0.359 ± 0.177
3.32ValAsp: 3.32 ± 0.413
5.024ValGlu: 5.024 ± 0.753
1.615ValPhe: 1.615 ± 0.38
3.948ValGly: 3.948 ± 0.584
0.718ValHis: 0.718 ± 0.241
2.512ValIle: 2.512 ± 0.439
2.243ValLys: 2.243 ± 0.485
6.011ValLeu: 6.011 ± 0.814
1.256ValMet: 1.256 ± 0.351
2.512ValAsn: 2.512 ± 0.441
2.961ValPro: 2.961 ± 0.503
3.589ValGln: 3.589 ± 0.605
4.845ValArg: 4.845 ± 0.599
3.948ValSer: 3.948 ± 0.607
3.948ValThr: 3.948 ± 0.625
3.948ValVal: 3.948 ± 0.687
0.987ValTrp: 0.987 ± 0.262
2.961ValTyr: 2.961 ± 0.556
0.0ValXaa: 0.0 ± 0.0
Trp
1.615TrpAla: 1.615 ± 0.395
0.359TrpCys: 0.359 ± 0.165
0.449TrpAsp: 0.449 ± 0.192
0.987TrpGlu: 0.987 ± 0.301
0.807TrpPhe: 0.807 ± 0.274
0.718TrpGly: 0.718 ± 0.267
0.09TrpHis: 0.09 ± 0.078
0.987TrpIle: 0.987 ± 0.29
0.807TrpLys: 0.807 ± 0.224
1.525TrpLeu: 1.525 ± 0.401
0.987TrpMet: 0.987 ± 0.356
0.359TrpAsn: 0.359 ± 0.163
0.807TrpPro: 0.807 ± 0.317
0.987TrpGln: 0.987 ± 0.227
1.435TrpArg: 1.435 ± 0.352
1.346TrpSer: 1.346 ± 0.341
0.718TrpThr: 0.718 ± 0.255
1.346TrpVal: 1.346 ± 0.371
0.538TrpTrp: 0.538 ± 0.216
0.269TrpTyr: 0.269 ± 0.158
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.961TyrAla: 2.961 ± 0.455
0.359TyrCys: 0.359 ± 0.156
1.435TyrAsp: 1.435 ± 0.301
1.615TyrGlu: 1.615 ± 0.407
0.987TyrPhe: 0.987 ± 0.257
2.153TyrGly: 2.153 ± 0.515
0.449TyrHis: 0.449 ± 0.281
1.077TyrIle: 1.077 ± 0.334
0.718TyrLys: 0.718 ± 0.275
2.333TyrLeu: 2.333 ± 0.433
0.359TyrMet: 0.359 ± 0.153
0.807TyrAsn: 0.807 ± 0.229
1.794TyrPro: 1.794 ± 0.561
1.346TyrGln: 1.346 ± 0.381
1.794TyrArg: 1.794 ± 0.386
1.794TyrSer: 1.794 ± 0.473
1.435TyrThr: 1.435 ± 0.342
1.974TyrVal: 1.974 ± 0.375
0.449TyrTrp: 0.449 ± 0.23
0.449TyrTyr: 0.449 ± 0.215
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (11147 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski