Amino acid dipepetide frequency for Pseudomonas phage B3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.402AlaAla: 17.402 ± 3.124
0.392AlaCys: 0.392 ± 0.157
7.447AlaAsp: 7.447 ± 0.602
9.328AlaGlu: 9.328 ± 1.178
2.665AlaPhe: 2.665 ± 0.452
11.837AlaGly: 11.837 ± 1.049
2.116AlaHis: 2.116 ± 0.427
5.644AlaIle: 5.644 ± 0.449
5.017AlaLys: 5.017 ± 0.881
10.19AlaLeu: 10.19 ± 0.946
3.292AlaMet: 3.292 ± 0.491
3.214AlaAsn: 3.214 ± 0.546
5.487AlaPro: 5.487 ± 0.678
6.585AlaGln: 6.585 ± 0.771
7.996AlaArg: 7.996 ± 0.662
5.801AlaSer: 5.801 ± 0.795
6.82AlaThr: 6.82 ± 0.66
6.036AlaVal: 6.036 ± 0.758
2.195AlaTrp: 2.195 ± 0.381
3.057AlaTyr: 3.057 ± 0.419
0.0AlaXaa: 0.0 ± 0.0
Cys
0.705CysAla: 0.705 ± 0.226
0.235CysCys: 0.235 ± 0.135
0.549CysAsp: 0.549 ± 0.22
0.392CysGlu: 0.392 ± 0.149
0.157CysPhe: 0.157 ± 0.105
0.705CysGly: 0.705 ± 0.265
0.314CysHis: 0.314 ± 0.198
0.392CysIle: 0.392 ± 0.188
0.314CysLys: 0.314 ± 0.14
0.705CysLeu: 0.705 ± 0.262
0.157CysMet: 0.157 ± 0.118
0.314CysAsn: 0.314 ± 0.15
0.549CysPro: 0.549 ± 0.284
0.392CysGln: 0.392 ± 0.167
0.627CysArg: 0.627 ± 0.223
0.549CysSer: 0.549 ± 0.308
0.314CysThr: 0.314 ± 0.142
0.705CysVal: 0.705 ± 0.258
0.078CysTrp: 0.078 ± 0.068
0.314CysTyr: 0.314 ± 0.145
0.0CysXaa: 0.0 ± 0.0
Asp
6.349AspAla: 6.349 ± 0.71
0.549AspCys: 0.549 ± 0.2
3.057AspAsp: 3.057 ± 0.575
4.155AspGlu: 4.155 ± 0.636
1.881AspPhe: 1.881 ± 0.413
5.566AspGly: 5.566 ± 0.752
1.176AspHis: 1.176 ± 0.281
2.352AspIle: 2.352 ± 0.418
1.333AspLys: 1.333 ± 0.327
6.271AspLeu: 6.271 ± 0.626
0.47AspMet: 0.47 ± 0.177
1.019AspAsn: 1.019 ± 0.222
2.9AspPro: 2.9 ± 0.41
3.606AspGln: 3.606 ± 0.501
5.017AspArg: 5.017 ± 0.819
2.822AspSer: 2.822 ± 0.371
2.665AspThr: 2.665 ± 0.413
4.311AspVal: 4.311 ± 0.57
1.019AspTrp: 1.019 ± 0.258
1.489AspTyr: 1.489 ± 0.333
0.0AspXaa: 0.0 ± 0.0
Glu
7.212GluAla: 7.212 ± 1.032
0.784GluCys: 0.784 ± 0.209
3.136GluAsp: 3.136 ± 0.551
3.292GluGlu: 3.292 ± 0.541
2.43GluPhe: 2.43 ± 0.53
3.527GluGly: 3.527 ± 0.461
1.411GluHis: 1.411 ± 0.298
2.979GluIle: 2.979 ± 0.472
1.881GluLys: 1.881 ± 0.408
8.623GluLeu: 8.623 ± 0.979
1.097GluMet: 1.097 ± 0.371
1.254GluAsn: 1.254 ± 0.332
2.352GluPro: 2.352 ± 0.532
4.39GluGln: 4.39 ± 0.721
6.271GluArg: 6.271 ± 0.725
3.292GluSer: 3.292 ± 0.472
2.508GluThr: 2.508 ± 0.407
4.233GluVal: 4.233 ± 0.656
1.333GluTrp: 1.333 ± 0.294
1.489GluTyr: 1.489 ± 0.335
0.0GluXaa: 0.0 ± 0.0
Phe
3.371PheAla: 3.371 ± 0.532
0.392PheCys: 0.392 ± 0.19
1.881PheAsp: 1.881 ± 0.399
1.725PheGlu: 1.725 ± 0.37
0.941PhePhe: 0.941 ± 0.225
3.136PheGly: 3.136 ± 0.472
0.47PheHis: 0.47 ± 0.176
0.862PheIle: 0.862 ± 0.237
0.784PheLys: 0.784 ± 0.239
2.352PheLeu: 2.352 ± 0.418
0.314PheMet: 0.314 ± 0.129
0.705PheAsn: 0.705 ± 0.238
1.176PhePro: 1.176 ± 0.327
1.254PheGln: 1.254 ± 0.296
2.038PheArg: 2.038 ± 0.444
1.881PheSer: 1.881 ± 0.341
1.411PheThr: 1.411 ± 0.327
1.489PheVal: 1.489 ± 0.359
0.862PheTrp: 0.862 ± 0.298
0.784PheTyr: 0.784 ± 0.248
0.0PheXaa: 0.0 ± 0.0
Gly
8.623GlyAla: 8.623 ± 1.147
1.176GlyCys: 1.176 ± 0.292
4.86GlyAsp: 4.86 ± 0.576
3.684GlyGlu: 3.684 ± 0.405
2.587GlyPhe: 2.587 ± 0.419
6.271GlyGly: 6.271 ± 0.725
1.097GlyHis: 1.097 ± 0.32
3.449GlyIle: 3.449 ± 0.453
3.136GlyLys: 3.136 ± 0.501
7.996GlyLeu: 7.996 ± 0.921
1.568GlyMet: 1.568 ± 0.321
2.508GlyAsn: 2.508 ± 0.498
2.116GlyPro: 2.116 ± 0.466
3.527GlyGln: 3.527 ± 0.492
7.682GlyArg: 7.682 ± 0.731
4.311GlySer: 4.311 ± 0.54
4.233GlyThr: 4.233 ± 0.754
4.233GlyVal: 4.233 ± 0.44
2.116GlyTrp: 2.116 ± 0.349
1.96GlyTyr: 1.96 ± 0.396
0.0GlyXaa: 0.0 ± 0.0
His
2.508HisAla: 2.508 ± 0.467
0.0HisCys: 0.0 ± 0.0
0.941HisAsp: 0.941 ± 0.25
0.941HisGlu: 0.941 ± 0.31
0.549HisPhe: 0.549 ± 0.17
1.803HisGly: 1.803 ± 0.406
0.392HisHis: 0.392 ± 0.188
1.176HisIle: 1.176 ± 0.39
0.235HisLys: 0.235 ± 0.135
1.881HisLeu: 1.881 ± 0.393
0.157HisMet: 0.157 ± 0.119
0.549HisAsn: 0.549 ± 0.187
1.411HisPro: 1.411 ± 0.364
0.941HisGln: 0.941 ± 0.332
1.725HisArg: 1.725 ± 0.299
1.411HisSer: 1.411 ± 0.328
0.627HisThr: 0.627 ± 0.226
1.411HisVal: 1.411 ± 0.34
0.235HisTrp: 0.235 ± 0.118
0.47HisTyr: 0.47 ± 0.235
0.0HisXaa: 0.0 ± 0.0
Ile
4.703IleAla: 4.703 ± 0.586
0.392IleCys: 0.392 ± 0.167
3.057IleAsp: 3.057 ± 0.578
4.155IleGlu: 4.155 ± 0.554
0.705IlePhe: 0.705 ± 0.238
2.9IleGly: 2.9 ± 0.549
0.941IleHis: 0.941 ± 0.247
1.489IleIle: 1.489 ± 0.29
0.862IleLys: 0.862 ± 0.213
3.371IleLeu: 3.371 ± 0.602
0.47IleMet: 0.47 ± 0.185
1.568IleAsn: 1.568 ± 0.37
2.116IlePro: 2.116 ± 0.416
2.195IleGln: 2.195 ± 0.434
3.684IleArg: 3.684 ± 0.608
2.744IleSer: 2.744 ± 0.493
3.527IleThr: 3.527 ± 0.545
2.665IleVal: 2.665 ± 0.45
0.705IleTrp: 0.705 ± 0.317
1.254IleTyr: 1.254 ± 0.322
0.0IleXaa: 0.0 ± 0.0
Lys
6.036LysAla: 6.036 ± 0.879
0.235LysCys: 0.235 ± 0.152
1.96LysAsp: 1.96 ± 0.387
1.646LysGlu: 1.646 ± 0.432
1.097LysPhe: 1.097 ± 0.276
2.195LysGly: 2.195 ± 0.469
0.784LysHis: 0.784 ± 0.303
0.862LysIle: 0.862 ± 0.283
1.489LysLys: 1.489 ± 0.312
3.763LysLeu: 3.763 ± 0.519
0.705LysMet: 0.705 ± 0.244
1.019LysAsn: 1.019 ± 0.248
1.568LysPro: 1.568 ± 0.434
1.411LysGln: 1.411 ± 0.438
2.979LysArg: 2.979 ± 0.55
1.568LysSer: 1.568 ± 0.363
2.038LysThr: 2.038 ± 0.5
1.96LysVal: 1.96 ± 0.418
0.47LysTrp: 0.47 ± 0.21
0.47LysTyr: 0.47 ± 0.163
0.0LysXaa: 0.0 ± 0.0
Leu
12.229LeuAla: 12.229 ± 0.933
1.333LeuCys: 1.333 ± 0.357
6.977LeuAsp: 6.977 ± 0.783
5.487LeuGlu: 5.487 ± 0.741
2.116LeuPhe: 2.116 ± 0.503
7.133LeuGly: 7.133 ± 0.754
2.744LeuHis: 2.744 ± 0.479
5.958LeuIle: 5.958 ± 0.721
3.606LeuLys: 3.606 ± 0.644
9.407LeuLeu: 9.407 ± 0.913
1.881LeuMet: 1.881 ± 0.349
3.606LeuAsn: 3.606 ± 0.521
5.566LeuPro: 5.566 ± 0.739
6.349LeuGln: 6.349 ± 0.676
8.074LeuArg: 8.074 ± 0.793
5.722LeuSer: 5.722 ± 0.567
4.782LeuThr: 4.782 ± 0.596
7.447LeuVal: 7.447 ± 0.593
0.705LeuTrp: 0.705 ± 0.287
1.96LeuTyr: 1.96 ± 0.394
0.0LeuXaa: 0.0 ± 0.0
Met
2.9MetAla: 2.9 ± 0.539
0.078MetCys: 0.078 ± 0.075
1.411MetAsp: 1.411 ± 0.36
0.784MetGlu: 0.784 ± 0.266
0.549MetPhe: 0.549 ± 0.207
1.568MetGly: 1.568 ± 0.365
0.078MetHis: 0.078 ± 0.085
0.705MetIle: 0.705 ± 0.217
0.627MetLys: 0.627 ± 0.208
1.254MetLeu: 1.254 ± 0.258
0.392MetMet: 0.392 ± 0.199
1.254MetAsn: 1.254 ± 0.412
1.333MetPro: 1.333 ± 0.269
1.097MetGln: 1.097 ± 0.28
0.627MetArg: 0.627 ± 0.243
1.489MetSer: 1.489 ± 0.262
1.489MetThr: 1.489 ± 0.374
0.862MetVal: 0.862 ± 0.262
0.235MetTrp: 0.235 ± 0.131
0.392MetTyr: 0.392 ± 0.195
0.0MetXaa: 0.0 ± 0.0
Asn
3.214AsnAla: 3.214 ± 0.42
0.235AsnCys: 0.235 ± 0.168
1.568AsnAsp: 1.568 ± 0.418
1.803AsnGlu: 1.803 ± 0.369
1.097AsnPhe: 1.097 ± 0.24
2.352AsnGly: 2.352 ± 0.489
0.627AsnHis: 0.627 ± 0.227
1.646AsnIle: 1.646 ± 0.436
0.941AsnLys: 0.941 ± 0.206
2.43AsnLeu: 2.43 ± 0.466
0.784AsnMet: 0.784 ± 0.194
0.941AsnAsn: 0.941 ± 0.263
1.881AsnPro: 1.881 ± 0.526
2.038AsnGln: 2.038 ± 0.457
1.96AsnArg: 1.96 ± 0.34
1.176AsnSer: 1.176 ± 0.342
1.646AsnThr: 1.646 ± 0.377
1.489AsnVal: 1.489 ± 0.291
0.549AsnTrp: 0.549 ± 0.202
0.627AsnTyr: 0.627 ± 0.211
0.0AsnXaa: 0.0 ± 0.0
Pro
7.133ProAla: 7.133 ± 0.784
0.47ProCys: 0.47 ± 0.179
2.43ProAsp: 2.43 ± 0.462
3.527ProGlu: 3.527 ± 0.538
1.881ProPhe: 1.881 ± 0.395
4.311ProGly: 4.311 ± 0.544
1.019ProHis: 1.019 ± 0.28
1.725ProIle: 1.725 ± 0.332
1.725ProLys: 1.725 ± 0.355
3.763ProLeu: 3.763 ± 0.622
0.705ProMet: 0.705 ± 0.232
1.725ProAsn: 1.725 ± 0.486
2.587ProPro: 2.587 ± 0.516
1.568ProGln: 1.568 ± 0.33
3.527ProArg: 3.527 ± 0.427
2.9ProSer: 2.9 ± 0.43
2.038ProThr: 2.038 ± 0.322
4.076ProVal: 4.076 ± 0.684
0.784ProTrp: 0.784 ± 0.253
1.646ProTyr: 1.646 ± 0.418
0.0ProXaa: 0.0 ± 0.0
Gln
7.29GlnAla: 7.29 ± 0.825
0.235GlnCys: 0.235 ± 0.142
2.587GlnAsp: 2.587 ± 0.606
3.841GlnGlu: 3.841 ± 0.686
1.411GlnPhe: 1.411 ± 0.323
2.587GlnGly: 2.587 ± 0.342
0.627GlnHis: 0.627 ± 0.2
2.508GlnIle: 2.508 ± 0.398
1.646GlnLys: 1.646 ± 0.387
8.231GlnLeu: 8.231 ± 0.804
1.176GlnMet: 1.176 ± 0.261
1.019GlnAsn: 1.019 ± 0.303
2.508GlnPro: 2.508 ± 0.428
2.9GlnGln: 2.9 ± 0.819
4.39GlnArg: 4.39 ± 0.662
2.195GlnSer: 2.195 ± 0.309
2.665GlnThr: 2.665 ± 0.375
3.449GlnVal: 3.449 ± 0.464
0.627GlnTrp: 0.627 ± 0.217
1.019GlnTyr: 1.019 ± 0.291
0.0GlnXaa: 0.0 ± 0.0
Arg
7.839ArgAla: 7.839 ± 0.866
0.627ArgCys: 0.627 ± 0.227
4.703ArgAsp: 4.703 ± 0.563
5.174ArgGlu: 5.174 ± 0.519
1.96ArgPhe: 1.96 ± 0.285
4.782ArgGly: 4.782 ± 0.702
1.568ArgHis: 1.568 ± 0.335
2.822ArgIle: 2.822 ± 0.722
3.371ArgLys: 3.371 ± 0.687
9.093ArgLeu: 9.093 ± 0.601
1.568ArgMet: 1.568 ± 0.351
2.038ArgAsn: 2.038 ± 0.394
3.841ArgPro: 3.841 ± 0.634
4.625ArgGln: 4.625 ± 0.668
6.898ArgArg: 6.898 ± 0.902
3.763ArgSer: 3.763 ± 0.57
3.214ArgThr: 3.214 ± 0.521
5.33ArgVal: 5.33 ± 0.64
1.489ArgTrp: 1.489 ± 0.421
2.665ArgTyr: 2.665 ± 0.454
0.0ArgXaa: 0.0 ± 0.0
Ser
7.76SerAla: 7.76 ± 0.794
0.47SerCys: 0.47 ± 0.182
3.214SerAsp: 3.214 ± 0.444
2.43SerGlu: 2.43 ± 0.43
1.646SerPhe: 1.646 ± 0.327
4.782SerGly: 4.782 ± 0.658
0.47SerHis: 0.47 ± 0.197
1.725SerIle: 1.725 ± 0.318
1.96SerLys: 1.96 ± 0.472
6.349SerLeu: 6.349 ± 0.691
0.784SerMet: 0.784 ± 0.268
1.489SerAsn: 1.489 ± 0.327
2.979SerPro: 2.979 ± 0.564
1.646SerGln: 1.646 ± 0.222
3.684SerArg: 3.684 ± 0.568
3.763SerSer: 3.763 ± 0.498
4.155SerThr: 4.155 ± 0.536
4.155SerVal: 4.155 ± 0.555
0.627SerTrp: 0.627 ± 0.253
1.568SerTyr: 1.568 ± 0.333
0.0SerXaa: 0.0 ± 0.0
Thr
6.506ThrAla: 6.506 ± 0.781
0.314ThrCys: 0.314 ± 0.155
2.352ThrAsp: 2.352 ± 0.455
3.919ThrGlu: 3.919 ± 0.573
1.333ThrPhe: 1.333 ± 0.287
4.547ThrGly: 4.547 ± 0.684
1.411ThrHis: 1.411 ± 0.355
1.725ThrIle: 1.725 ± 0.458
1.646ThrLys: 1.646 ± 0.379
6.428ThrLeu: 6.428 ± 0.803
0.862ThrMet: 0.862 ± 0.263
1.568ThrAsn: 1.568 ± 0.311
3.371ThrPro: 3.371 ± 0.444
2.508ThrGln: 2.508 ± 0.524
3.057ThrArg: 3.057 ± 0.361
4.076ThrSer: 4.076 ± 0.488
3.136ThrThr: 3.136 ± 0.433
3.449ThrVal: 3.449 ± 0.485
0.627ThrTrp: 0.627 ± 0.251
1.568ThrTyr: 1.568 ± 0.343
0.0ThrXaa: 0.0 ± 0.0
Val
6.193ValAla: 6.193 ± 0.742
0.314ValCys: 0.314 ± 0.164
3.449ValAsp: 3.449 ± 0.495
5.566ValGlu: 5.566 ± 0.59
1.725ValPhe: 1.725 ± 0.393
4.155ValGly: 4.155 ± 0.538
1.254ValHis: 1.254 ± 0.324
3.449ValIle: 3.449 ± 0.442
2.587ValLys: 2.587 ± 0.42
6.585ValLeu: 6.585 ± 0.647
1.176ValMet: 1.176 ± 0.269
1.96ValAsn: 1.96 ± 0.37
3.606ValPro: 3.606 ± 0.482
3.371ValGln: 3.371 ± 0.46
3.449ValArg: 3.449 ± 0.346
3.998ValSer: 3.998 ± 0.546
4.625ValThr: 4.625 ± 0.577
3.763ValVal: 3.763 ± 0.58
1.097ValTrp: 1.097 ± 0.292
1.097ValTyr: 1.097 ± 0.282
0.0ValXaa: 0.0 ± 0.0
Trp
1.96TrpAla: 1.96 ± 0.477
0.0TrpCys: 0.0 ± 0.0
0.784TrpAsp: 0.784 ± 0.249
0.392TrpGlu: 0.392 ± 0.187
0.549TrpPhe: 0.549 ± 0.2
0.784TrpGly: 0.784 ± 0.243
0.392TrpHis: 0.392 ± 0.174
1.176TrpIle: 1.176 ± 0.305
0.392TrpLys: 0.392 ± 0.192
2.352TrpLeu: 2.352 ± 0.502
0.627TrpMet: 0.627 ± 0.23
0.392TrpAsn: 0.392 ± 0.138
0.862TrpPro: 0.862 ± 0.413
1.176TrpGln: 1.176 ± 0.262
1.333TrpArg: 1.333 ± 0.321
1.019TrpSer: 1.019 ± 0.287
1.254TrpThr: 1.254 ± 0.377
0.705TrpVal: 0.705 ± 0.209
0.47TrpTrp: 0.47 ± 0.208
0.078TrpTyr: 0.078 ± 0.083
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.587TyrAla: 2.587 ± 0.51
0.235TyrCys: 0.235 ± 0.147
1.646TyrAsp: 1.646 ± 0.281
1.333TyrGlu: 1.333 ± 0.308
0.549TyrPhe: 0.549 ± 0.189
2.195TyrGly: 2.195 ± 0.387
0.47TyrHis: 0.47 ± 0.256
0.862TyrIle: 0.862 ± 0.246
0.705TyrLys: 0.705 ± 0.208
2.038TyrLeu: 2.038 ± 0.377
0.862TyrMet: 0.862 ± 0.258
0.862TyrAsn: 0.862 ± 0.291
1.333TyrPro: 1.333 ± 0.507
1.333TyrGln: 1.333 ± 0.323
2.352TyrArg: 2.352 ± 0.404
1.176TyrSer: 1.176 ± 0.379
1.254TyrThr: 1.254 ± 0.245
1.646TyrVal: 1.646 ± 0.347
0.47TyrTrp: 0.47 ± 0.157
0.314TyrTyr: 0.314 ± 0.162
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 59 proteins (12758 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski