Amino acid dipepetide frequency for Pseudomonas phage tf

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.255AlaAla: 10.255 ± 1.321
0.977AlaCys: 0.977 ± 0.273
3.628AlaAsp: 3.628 ± 0.546
6.139AlaGlu: 6.139 ± 0.861
2.86AlaPhe: 2.86 ± 0.436
7.325AlaGly: 7.325 ± 1.034
1.046AlaHis: 1.046 ± 0.247
4.465AlaIle: 4.465 ± 0.53
6.418AlaLys: 6.418 ± 0.643
9.069AlaLeu: 9.069 ± 0.694
2.721AlaMet: 2.721 ± 0.375
4.046AlaAsn: 4.046 ± 0.632
3.139AlaPro: 3.139 ± 0.488
3.698AlaGln: 3.698 ± 0.817
5.232AlaArg: 5.232 ± 0.846
4.395AlaSer: 4.395 ± 0.838
4.883AlaThr: 4.883 ± 0.579
6.349AlaVal: 6.349 ± 0.689
1.395AlaTrp: 1.395 ± 0.317
2.791AlaTyr: 2.791 ± 0.492
0.0AlaXaa: 0.0 ± 0.0
Cys
0.837CysAla: 0.837 ± 0.23
0.209CysCys: 0.209 ± 0.12
0.558CysAsp: 0.558 ± 0.195
1.046CysGlu: 1.046 ± 0.291
0.419CysPhe: 0.419 ± 0.168
0.698CysGly: 0.698 ± 0.279
0.558CysHis: 0.558 ± 0.185
0.279CysIle: 0.279 ± 0.139
0.907CysLys: 0.907 ± 0.29
0.349CysLeu: 0.349 ± 0.151
0.279CysMet: 0.279 ± 0.127
0.558CysAsn: 0.558 ± 0.198
0.837CysPro: 0.837 ± 0.243
0.419CysGln: 0.419 ± 0.173
0.698CysArg: 0.698 ± 0.228
0.488CysSer: 0.488 ± 0.176
0.419CysThr: 0.419 ± 0.218
1.186CysVal: 1.186 ± 0.309
0.349CysTrp: 0.349 ± 0.135
0.209CysTyr: 0.209 ± 0.109
0.0CysXaa: 0.0 ± 0.0
Asp
4.744AspAla: 4.744 ± 0.613
0.558AspCys: 0.558 ± 0.191
3.418AspAsp: 3.418 ± 0.469
3.07AspGlu: 3.07 ± 0.51
2.302AspPhe: 2.302 ± 0.426
5.163AspGly: 5.163 ± 0.702
0.698AspHis: 0.698 ± 0.261
3.488AspIle: 3.488 ± 0.52
2.232AspLys: 2.232 ± 0.51
4.814AspLeu: 4.814 ± 0.602
1.186AspMet: 1.186 ± 0.265
2.302AspAsn: 2.302 ± 0.347
3.558AspPro: 3.558 ± 0.449
2.442AspGln: 2.442 ± 0.452
3.139AspArg: 3.139 ± 0.439
3.767AspSer: 3.767 ± 0.516
3.209AspThr: 3.209 ± 0.411
4.325AspVal: 4.325 ± 0.589
1.395AspTrp: 1.395 ± 0.329
1.395AspTyr: 1.395 ± 0.282
0.0AspXaa: 0.0 ± 0.0
Glu
7.465GluAla: 7.465 ± 1.007
0.698GluCys: 0.698 ± 0.204
3.279GluAsp: 3.279 ± 0.535
4.465GluGlu: 4.465 ± 0.968
3.0GluPhe: 3.0 ± 0.44
5.86GluGly: 5.86 ± 0.593
1.605GluHis: 1.605 ± 0.382
2.791GluIle: 2.791 ± 0.506
3.0GluLys: 3.0 ± 0.518
5.721GluLeu: 5.721 ± 0.829
1.953GluMet: 1.953 ± 0.324
2.651GluAsn: 2.651 ± 0.36
1.814GluPro: 1.814 ± 0.369
2.86GluGln: 2.86 ± 0.564
4.046GluArg: 4.046 ± 0.595
3.139GluSer: 3.139 ± 0.595
2.372GluThr: 2.372 ± 0.36
4.395GluVal: 4.395 ± 0.566
0.977GluTrp: 0.977 ± 0.228
2.302GluTyr: 2.302 ± 0.467
0.0GluXaa: 0.0 ± 0.0
Phe
3.279PheAla: 3.279 ± 0.592
0.349PheCys: 0.349 ± 0.16
2.442PheAsp: 2.442 ± 0.444
2.581PheGlu: 2.581 ± 0.387
1.046PhePhe: 1.046 ± 0.31
3.349PheGly: 3.349 ± 0.449
0.837PheHis: 0.837 ± 0.227
1.535PheIle: 1.535 ± 0.304
2.163PheLys: 2.163 ± 0.472
2.93PheLeu: 2.93 ± 0.47
0.767PheMet: 0.767 ± 0.304
1.116PheAsn: 1.116 ± 0.242
1.465PhePro: 1.465 ± 0.306
1.326PheGln: 1.326 ± 0.336
1.535PheArg: 1.535 ± 0.293
2.093PheSer: 2.093 ± 0.367
1.953PheThr: 1.953 ± 0.311
3.488PheVal: 3.488 ± 0.414
0.349PheTrp: 0.349 ± 0.162
0.977PheTyr: 0.977 ± 0.295
0.0PheXaa: 0.0 ± 0.0
Gly
7.116GlyAla: 7.116 ± 1.039
1.186GlyCys: 1.186 ± 0.286
5.163GlyAsp: 5.163 ± 0.523
3.907GlyGlu: 3.907 ± 0.522
2.023GlyPhe: 2.023 ± 0.385
7.744GlyGly: 7.744 ± 1.13
2.093GlyHis: 2.093 ± 0.42
4.256GlyIle: 4.256 ± 0.475
4.883GlyLys: 4.883 ± 0.786
6.209GlyLeu: 6.209 ± 0.759
1.744GlyMet: 1.744 ± 0.342
3.907GlyAsn: 3.907 ± 0.526
1.953GlyPro: 1.953 ± 0.44
3.558GlyGln: 3.558 ± 0.61
4.116GlyArg: 4.116 ± 0.594
5.651GlySer: 5.651 ± 0.928
5.86GlyThr: 5.86 ± 0.966
6.418GlyVal: 6.418 ± 0.707
1.605GlyTrp: 1.605 ± 0.304
3.418GlyTyr: 3.418 ± 0.461
0.0GlyXaa: 0.0 ± 0.0
His
1.116HisAla: 1.116 ± 0.271
0.14HisCys: 0.14 ± 0.087
1.046HisAsp: 1.046 ± 0.264
1.465HisGlu: 1.465 ± 0.342
0.558HisPhe: 0.558 ± 0.234
1.256HisGly: 1.256 ± 0.318
0.488HisHis: 0.488 ± 0.296
1.326HisIle: 1.326 ± 0.371
1.256HisLys: 1.256 ± 0.295
2.023HisLeu: 2.023 ± 0.311
0.907HisMet: 0.907 ± 0.253
0.767HisAsn: 0.767 ± 0.318
1.116HisPro: 1.116 ± 0.329
0.767HisGln: 0.767 ± 0.249
1.326HisArg: 1.326 ± 0.323
0.628HisSer: 0.628 ± 0.175
1.046HisThr: 1.046 ± 0.351
1.116HisVal: 1.116 ± 0.311
0.837HisTrp: 0.837 ± 0.233
1.046HisTyr: 1.046 ± 0.273
0.0HisXaa: 0.0 ± 0.0
Ile
3.628IleAla: 3.628 ± 0.471
0.628IleCys: 0.628 ± 0.235
3.209IleAsp: 3.209 ± 0.46
3.279IleGlu: 3.279 ± 0.495
1.605IlePhe: 1.605 ± 0.333
4.325IleGly: 4.325 ± 0.492
0.977IleHis: 0.977 ± 0.312
2.302IleIle: 2.302 ± 0.351
3.837IleLys: 3.837 ± 0.5
3.209IleLeu: 3.209 ± 0.365
1.395IleMet: 1.395 ± 0.364
1.814IleAsn: 1.814 ± 0.417
2.093IlePro: 2.093 ± 0.519
2.093IleGln: 2.093 ± 0.41
3.209IleArg: 3.209 ± 0.427
2.302IleSer: 2.302 ± 0.392
2.721IleThr: 2.721 ± 0.323
2.93IleVal: 2.93 ± 0.442
0.907IleTrp: 0.907 ± 0.231
1.953IleTyr: 1.953 ± 0.374
0.0IleXaa: 0.0 ± 0.0
Lys
5.651LysAla: 5.651 ± 0.671
0.698LysCys: 0.698 ± 0.205
3.628LysAsp: 3.628 ± 0.353
3.628LysGlu: 3.628 ± 0.476
1.674LysPhe: 1.674 ± 0.314
4.465LysGly: 4.465 ± 0.596
1.326LysHis: 1.326 ± 0.325
2.651LysIle: 2.651 ± 0.388
3.07LysLys: 3.07 ± 0.685
4.535LysLeu: 4.535 ± 0.631
1.256LysMet: 1.256 ± 0.259
2.232LysAsn: 2.232 ± 0.453
2.791LysPro: 2.791 ± 0.537
2.163LysGln: 2.163 ± 0.375
2.721LysArg: 2.721 ± 0.421
2.791LysSer: 2.791 ± 0.456
3.209LysThr: 3.209 ± 0.433
5.163LysVal: 5.163 ± 0.669
1.326LysTrp: 1.326 ± 0.284
2.232LysTyr: 2.232 ± 0.36
0.0LysXaa: 0.0 ± 0.0
Leu
7.255LeuAla: 7.255 ± 0.829
0.558LeuCys: 0.558 ± 0.182
4.116LeuAsp: 4.116 ± 0.386
6.0LeuGlu: 6.0 ± 0.761
2.791LeuPhe: 2.791 ± 0.469
7.325LeuGly: 7.325 ± 1.137
1.744LeuHis: 1.744 ± 0.38
3.07LeuIle: 3.07 ± 0.492
5.79LeuLys: 5.79 ± 0.844
6.976LeuLeu: 6.976 ± 0.681
2.232LeuMet: 2.232 ± 0.407
2.721LeuAsn: 2.721 ± 0.461
3.628LeuPro: 3.628 ± 0.571
3.628LeuGln: 3.628 ± 0.508
5.511LeuArg: 5.511 ± 0.69
5.651LeuSer: 5.651 ± 0.614
4.535LeuThr: 4.535 ± 0.501
5.372LeuVal: 5.372 ± 0.619
1.186LeuTrp: 1.186 ± 0.342
2.232LeuTyr: 2.232 ± 0.54
0.0LeuXaa: 0.0 ± 0.0
Met
3.209MetAla: 3.209 ± 0.599
0.279MetCys: 0.279 ± 0.145
2.093MetAsp: 2.093 ± 0.391
1.326MetGlu: 1.326 ± 0.265
0.698MetPhe: 0.698 ± 0.209
2.302MetGly: 2.302 ± 0.458
0.698MetHis: 0.698 ± 0.217
1.395MetIle: 1.395 ± 0.326
1.744MetLys: 1.744 ± 0.352
2.232MetLeu: 2.232 ± 0.37
0.488MetMet: 0.488 ± 0.17
1.186MetAsn: 1.186 ± 0.233
1.674MetPro: 1.674 ± 0.297
0.977MetGln: 0.977 ± 0.23
1.326MetArg: 1.326 ± 0.31
1.535MetSer: 1.535 ± 0.309
2.372MetThr: 2.372 ± 0.387
1.256MetVal: 1.256 ± 0.349
0.14MetTrp: 0.14 ± 0.097
0.698MetTyr: 0.698 ± 0.209
0.0MetXaa: 0.0 ± 0.0
Asn
4.465AsnAla: 4.465 ± 0.632
0.488AsnCys: 0.488 ± 0.186
2.232AsnAsp: 2.232 ± 0.371
3.07AsnGlu: 3.07 ± 0.452
1.186AsnPhe: 1.186 ± 0.282
3.558AsnGly: 3.558 ± 0.615
1.186AsnHis: 1.186 ± 0.305
3.139AsnIle: 3.139 ± 0.5
1.953AsnLys: 1.953 ± 0.386
2.93AsnLeu: 2.93 ± 0.447
1.116AsnMet: 1.116 ± 0.273
2.163AsnAsn: 2.163 ± 0.365
2.302AsnPro: 2.302 ± 0.389
2.86AsnGln: 2.86 ± 0.42
2.023AsnArg: 2.023 ± 0.368
2.163AsnSer: 2.163 ± 0.399
2.512AsnThr: 2.512 ± 0.437
2.86AsnVal: 2.86 ± 0.428
0.558AsnTrp: 0.558 ± 0.198
1.256AsnTyr: 1.256 ± 0.259
0.0AsnXaa: 0.0 ± 0.0
Pro
3.279ProAla: 3.279 ± 0.394
0.488ProCys: 0.488 ± 0.201
3.209ProAsp: 3.209 ± 0.48
4.046ProGlu: 4.046 ± 0.568
2.302ProPhe: 2.302 ± 0.395
2.581ProGly: 2.581 ± 0.532
0.907ProHis: 0.907 ± 0.302
1.465ProIle: 1.465 ± 0.322
2.721ProLys: 2.721 ± 0.438
3.139ProLeu: 3.139 ± 0.417
1.256ProMet: 1.256 ± 0.29
1.884ProAsn: 1.884 ± 0.419
1.395ProPro: 1.395 ± 0.365
1.814ProGln: 1.814 ± 0.332
1.674ProArg: 1.674 ± 0.352
2.791ProSer: 2.791 ± 0.361
2.791ProThr: 2.791 ± 0.405
2.651ProVal: 2.651 ± 0.441
0.977ProTrp: 0.977 ± 0.214
1.605ProTyr: 1.605 ± 0.391
0.0ProXaa: 0.0 ± 0.0
Gln
3.907GlnAla: 3.907 ± 0.802
0.419GlnCys: 0.419 ± 0.147
2.442GlnAsp: 2.442 ± 0.459
3.209GlnGlu: 3.209 ± 0.525
2.023GlnPhe: 2.023 ± 0.361
3.628GlnGly: 3.628 ± 0.566
1.046GlnHis: 1.046 ± 0.331
2.372GlnIle: 2.372 ± 0.292
1.674GlnLys: 1.674 ± 0.337
3.977GlnLeu: 3.977 ± 0.439
1.116GlnMet: 1.116 ± 0.259
1.605GlnAsn: 1.605 ± 0.368
1.116GlnPro: 1.116 ± 0.271
2.372GlnGln: 2.372 ± 0.641
3.418GlnArg: 3.418 ± 0.504
2.93GlnSer: 2.93 ± 0.612
1.953GlnThr: 1.953 ± 0.465
2.023GlnVal: 2.023 ± 0.417
0.907GlnTrp: 0.907 ± 0.257
0.977GlnTyr: 0.977 ± 0.228
0.0GlnXaa: 0.0 ± 0.0
Arg
5.511ArgAla: 5.511 ± 0.541
0.419ArgCys: 0.419 ± 0.156
3.349ArgAsp: 3.349 ± 0.522
3.209ArgGlu: 3.209 ± 0.53
1.953ArgPhe: 1.953 ± 0.467
4.256ArgGly: 4.256 ± 0.472
0.837ArgHis: 0.837 ± 0.24
2.93ArgIle: 2.93 ± 0.516
2.442ArgLys: 2.442 ± 0.456
4.744ArgLeu: 4.744 ± 0.49
1.884ArgMet: 1.884 ± 0.401
3.139ArgAsn: 3.139 ± 0.462
2.442ArgPro: 2.442 ± 0.41
2.721ArgGln: 2.721 ± 0.418
3.837ArgArg: 3.837 ± 0.578
2.93ArgSer: 2.93 ± 0.602
3.698ArgThr: 3.698 ± 0.421
3.488ArgVal: 3.488 ± 0.483
0.977ArgTrp: 0.977 ± 0.224
1.744ArgTyr: 1.744 ± 0.286
0.0ArgXaa: 0.0 ± 0.0
Ser
4.744SerAla: 4.744 ± 0.627
0.837SerCys: 0.837 ± 0.267
3.07SerAsp: 3.07 ± 0.425
3.279SerGlu: 3.279 ± 0.561
1.953SerPhe: 1.953 ± 0.419
5.651SerGly: 5.651 ± 0.694
1.186SerHis: 1.186 ± 0.298
2.163SerIle: 2.163 ± 0.377
3.139SerLys: 3.139 ± 0.509
4.535SerLeu: 4.535 ± 0.684
2.302SerMet: 2.302 ± 0.374
2.791SerAsn: 2.791 ± 0.462
3.07SerPro: 3.07 ± 0.47
2.163SerGln: 2.163 ± 0.402
2.721SerArg: 2.721 ± 0.403
3.907SerSer: 3.907 ± 0.646
3.0SerThr: 3.0 ± 0.662
3.837SerVal: 3.837 ± 0.435
0.977SerTrp: 0.977 ± 0.295
2.232SerTyr: 2.232 ± 0.321
0.0SerXaa: 0.0 ± 0.0
Thr
5.093ThrAla: 5.093 ± 0.605
0.628ThrCys: 0.628 ± 0.164
3.07ThrAsp: 3.07 ± 0.423
3.488ThrGlu: 3.488 ± 0.491
2.651ThrPhe: 2.651 ± 0.367
4.604ThrGly: 4.604 ± 0.527
0.767ThrHis: 0.767 ± 0.233
3.418ThrIle: 3.418 ± 0.503
3.139ThrLys: 3.139 ± 0.482
4.953ThrLeu: 4.953 ± 0.658
1.395ThrMet: 1.395 ± 0.32
2.86ThrAsn: 2.86 ± 0.625
3.279ThrPro: 3.279 ± 0.62
2.302ThrGln: 2.302 ± 0.42
2.791ThrArg: 2.791 ± 0.383
2.93ThrSer: 2.93 ± 0.425
3.139ThrThr: 3.139 ± 0.443
4.465ThrVal: 4.465 ± 0.757
0.907ThrTrp: 0.907 ± 0.277
1.395ThrTyr: 1.395 ± 0.328
0.0ThrXaa: 0.0 ± 0.0
Val
5.721ValAla: 5.721 ± 0.606
0.837ValCys: 0.837 ± 0.304
3.558ValAsp: 3.558 ± 0.538
3.977ValGlu: 3.977 ± 0.517
2.442ValPhe: 2.442 ± 0.433
4.535ValGly: 4.535 ± 0.61
1.465ValHis: 1.465 ± 0.301
3.139ValIle: 3.139 ± 0.461
4.465ValLys: 4.465 ± 0.477
5.372ValLeu: 5.372 ± 0.658
2.232ValMet: 2.232 ± 0.351
4.046ValAsn: 4.046 ± 0.486
3.349ValPro: 3.349 ± 0.392
2.93ValGln: 2.93 ± 0.399
4.186ValArg: 4.186 ± 0.667
4.256ValSer: 4.256 ± 0.618
4.744ValThr: 4.744 ± 0.716
5.163ValVal: 5.163 ± 0.493
0.628ValTrp: 0.628 ± 0.235
3.139ValTyr: 3.139 ± 0.477
0.0ValXaa: 0.0 ± 0.0
Trp
1.674TrpAla: 1.674 ± 0.319
0.279TrpCys: 0.279 ± 0.182
1.186TrpAsp: 1.186 ± 0.253
1.326TrpGlu: 1.326 ± 0.282
0.767TrpPhe: 0.767 ± 0.176
1.326TrpGly: 1.326 ± 0.351
0.14TrpHis: 0.14 ± 0.098
0.558TrpIle: 0.558 ± 0.199
0.837TrpLys: 0.837 ± 0.269
1.814TrpLeu: 1.814 ± 0.439
0.698TrpMet: 0.698 ± 0.197
0.767TrpAsn: 0.767 ± 0.233
0.767TrpPro: 0.767 ± 0.25
0.419TrpGln: 0.419 ± 0.156
0.837TrpArg: 0.837 ± 0.196
1.046TrpSer: 1.046 ± 0.267
0.977TrpThr: 0.977 ± 0.266
1.395TrpVal: 1.395 ± 0.331
0.279TrpTrp: 0.279 ± 0.178
0.279TrpTyr: 0.279 ± 0.15
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.884TyrAla: 1.884 ± 0.325
0.628TyrCys: 0.628 ± 0.221
2.512TyrAsp: 2.512 ± 0.447
1.884TyrGlu: 1.884 ± 0.439
1.326TyrPhe: 1.326 ± 0.288
2.86TyrGly: 2.86 ± 0.362
0.628TyrHis: 0.628 ± 0.233
1.674TyrIle: 1.674 ± 0.301
1.535TyrLys: 1.535 ± 0.278
2.791TyrLeu: 2.791 ± 0.53
0.628TyrMet: 0.628 ± 0.244
1.465TyrAsn: 1.465 ± 0.322
1.186TyrPro: 1.186 ± 0.316
1.605TyrGln: 1.605 ± 0.361
2.232TyrArg: 2.232 ± 0.39
2.163TyrSer: 2.163 ± 0.36
1.884TyrThr: 1.884 ± 0.433
2.302TyrVal: 2.302 ± 0.419
0.698TyrTrp: 0.698 ± 0.189
1.186TyrTyr: 1.186 ± 0.29
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 72 proteins (14335 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski