Amino acid dipepetide frequency for Pseudomonas phage vB_PaeS_SCH_Ab26

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.052AlaAla: 14.052 ± 2.175
0.892AlaCys: 0.892 ± 0.266
4.981AlaAsp: 4.981 ± 0.738
6.32AlaGlu: 6.32 ± 0.602
3.792AlaPhe: 3.792 ± 0.549
8.253AlaGly: 8.253 ± 0.816
1.784AlaHis: 1.784 ± 0.482
5.948AlaIle: 5.948 ± 0.848
6.84AlaLys: 6.84 ± 0.685
9.814AlaLeu: 9.814 ± 1.103
2.082AlaMet: 2.082 ± 0.452
6.171AlaAsn: 6.171 ± 0.77
4.907AlaPro: 4.907 ± 0.739
5.13AlaGln: 5.13 ± 1.069
5.502AlaArg: 5.502 ± 0.584
6.766AlaSer: 6.766 ± 0.85
6.84AlaThr: 6.84 ± 1.252
7.584AlaVal: 7.584 ± 0.821
1.561AlaTrp: 1.561 ± 0.423
3.346AlaTyr: 3.346 ± 0.351
0.0AlaXaa: 0.0 ± 0.0
Cys
1.264CysAla: 1.264 ± 0.291
0.297CysCys: 0.297 ± 0.14
0.967CysAsp: 0.967 ± 0.263
0.743CysGlu: 0.743 ± 0.273
0.52CysPhe: 0.52 ± 0.171
1.041CysGly: 1.041 ± 0.358
0.446CysHis: 0.446 ± 0.17
0.595CysIle: 0.595 ± 0.252
0.446CysLys: 0.446 ± 0.205
0.595CysLeu: 0.595 ± 0.203
0.074CysMet: 0.074 ± 0.083
0.52CysAsn: 0.52 ± 0.187
0.446CysPro: 0.446 ± 0.215
0.297CysGln: 0.297 ± 0.142
0.892CysArg: 0.892 ± 0.332
0.52CysSer: 0.52 ± 0.198
0.297CysThr: 0.297 ± 0.126
0.743CysVal: 0.743 ± 0.193
0.074CysTrp: 0.074 ± 0.078
0.595CysTyr: 0.595 ± 0.195
0.0CysXaa: 0.0 ± 0.0
Asp
5.948AspAla: 5.948 ± 0.713
0.52AspCys: 0.52 ± 0.184
3.197AspAsp: 3.197 ± 0.437
4.461AspGlu: 4.461 ± 0.523
2.454AspPhe: 2.454 ± 0.443
4.461AspGly: 4.461 ± 0.765
0.669AspHis: 0.669 ± 0.262
1.859AspIle: 1.859 ± 0.441
2.23AspLys: 2.23 ± 0.415
4.238AspLeu: 4.238 ± 0.523
1.19AspMet: 1.19 ± 0.288
2.23AspAsn: 2.23 ± 0.38
2.156AspPro: 2.156 ± 0.401
1.115AspGln: 1.115 ± 0.347
3.123AspArg: 3.123 ± 0.52
2.379AspSer: 2.379 ± 0.377
2.528AspThr: 2.528 ± 0.478
3.123AspVal: 3.123 ± 0.504
0.372AspTrp: 0.372 ± 0.15
1.636AspTyr: 1.636 ± 0.353
0.074AspXaa: 0.074 ± 0.083
Glu
8.55GluAla: 8.55 ± 1.0
0.892GluCys: 0.892 ± 0.264
2.974GluAsp: 2.974 ± 0.418
4.238GluGlu: 4.238 ± 0.629
2.454GluPhe: 2.454 ± 0.426
4.089GluGly: 4.089 ± 0.458
0.818GluHis: 0.818 ± 0.277
3.941GluIle: 3.941 ± 0.561
2.825GluLys: 2.825 ± 0.592
6.468GluLeu: 6.468 ± 0.592
1.19GluMet: 1.19 ± 0.316
1.859GluAsn: 1.859 ± 0.367
1.859GluPro: 1.859 ± 0.383
2.454GluGln: 2.454 ± 0.587
3.569GluArg: 3.569 ± 0.428
2.602GluSer: 2.602 ± 0.452
2.825GluThr: 2.825 ± 0.381
4.907GluVal: 4.907 ± 0.545
1.636GluTrp: 1.636 ± 0.362
2.007GluTyr: 2.007 ± 0.347
0.0GluXaa: 0.0 ± 0.0
Phe
3.048PheAla: 3.048 ± 0.491
0.967PheCys: 0.967 ± 0.359
3.866PheAsp: 3.866 ± 0.588
2.677PheGlu: 2.677 ± 0.485
1.487PhePhe: 1.487 ± 0.343
3.717PheGly: 3.717 ± 0.486
0.297PheHis: 0.297 ± 0.136
2.454PheIle: 2.454 ± 0.351
1.636PheLys: 1.636 ± 0.294
1.413PheLeu: 1.413 ± 0.304
0.967PheMet: 0.967 ± 0.255
2.454PheAsn: 2.454 ± 0.431
1.338PhePro: 1.338 ± 0.312
1.413PheGln: 1.413 ± 0.316
2.23PheArg: 2.23 ± 0.336
1.933PheSer: 1.933 ± 0.415
2.156PheThr: 2.156 ± 0.419
3.048PheVal: 3.048 ± 0.581
0.669PheTrp: 0.669 ± 0.206
1.784PheTyr: 1.784 ± 0.336
0.0PheXaa: 0.0 ± 0.0
Gly
7.063GlyAla: 7.063 ± 0.909
0.818GlyCys: 0.818 ± 0.296
4.164GlyAsp: 4.164 ± 0.595
5.056GlyGlu: 5.056 ± 0.642
3.494GlyPhe: 3.494 ± 0.423
6.914GlyGly: 6.914 ± 1.017
1.264GlyHis: 1.264 ± 0.359
3.569GlyIle: 3.569 ± 0.59
4.461GlyLys: 4.461 ± 0.515
7.138GlyLeu: 7.138 ± 0.794
1.413GlyMet: 1.413 ± 0.343
2.9GlyAsn: 2.9 ± 0.524
2.974GlyPro: 2.974 ± 0.483
3.197GlyGln: 3.197 ± 0.729
3.941GlyArg: 3.941 ± 0.483
5.874GlySer: 5.874 ± 0.879
4.61GlyThr: 4.61 ± 0.812
6.022GlyVal: 6.022 ± 0.613
1.487GlyTrp: 1.487 ± 0.363
2.751GlyTyr: 2.751 ± 0.46
0.0GlyXaa: 0.0 ± 0.0
His
1.338HisAla: 1.338 ± 0.289
0.446HisCys: 0.446 ± 0.175
0.223HisAsp: 0.223 ± 0.157
0.818HisGlu: 0.818 ± 0.243
0.595HisPhe: 0.595 ± 0.252
0.743HisGly: 0.743 ± 0.28
0.074HisHis: 0.074 ± 0.075
0.669HisIle: 0.669 ± 0.216
0.669HisLys: 0.669 ± 0.228
1.487HisLeu: 1.487 ± 0.455
0.297HisMet: 0.297 ± 0.138
0.669HisAsn: 0.669 ± 0.209
0.669HisPro: 0.669 ± 0.27
0.223HisGln: 0.223 ± 0.129
0.595HisArg: 0.595 ± 0.295
0.892HisSer: 0.892 ± 0.258
0.297HisThr: 0.297 ± 0.147
1.784HisVal: 1.784 ± 0.453
0.074HisTrp: 0.074 ± 0.063
0.52HisTyr: 0.52 ± 0.153
0.0HisXaa: 0.0 ± 0.0
Ile
5.576IleAla: 5.576 ± 0.664
0.669IleCys: 0.669 ± 0.249
3.717IleAsp: 3.717 ± 0.477
4.089IleGlu: 4.089 ± 0.659
1.19IlePhe: 1.19 ± 0.374
3.569IleGly: 3.569 ± 0.422
0.818IleHis: 0.818 ± 0.251
2.454IleIle: 2.454 ± 0.474
2.9IleLys: 2.9 ± 0.399
2.528IleLeu: 2.528 ± 0.45
0.892IleMet: 0.892 ± 0.281
3.048IleAsn: 3.048 ± 0.477
2.528IlePro: 2.528 ± 0.436
1.19IleGln: 1.19 ± 0.344
2.156IleArg: 2.156 ± 0.397
2.751IleSer: 2.751 ± 0.42
3.123IleThr: 3.123 ± 0.615
3.717IleVal: 3.717 ± 0.523
0.52IleTrp: 0.52 ± 0.192
1.487IleTyr: 1.487 ± 0.368
0.0IleXaa: 0.0 ± 0.0
Lys
6.543LysAla: 6.543 ± 0.762
0.595LysCys: 0.595 ± 0.183
2.007LysAsp: 2.007 ± 0.525
3.123LysGlu: 3.123 ± 0.482
2.23LysPhe: 2.23 ± 0.4
3.346LysGly: 3.346 ± 0.568
0.669LysHis: 0.669 ± 0.252
2.677LysIle: 2.677 ± 0.363
3.866LysLys: 3.866 ± 0.819
6.32LysLeu: 6.32 ± 0.731
1.338LysMet: 1.338 ± 0.315
2.156LysAsn: 2.156 ± 0.517
2.305LysPro: 2.305 ± 0.485
1.264LysGln: 1.264 ± 0.343
2.23LysArg: 2.23 ± 0.372
3.271LysSer: 3.271 ± 0.509
3.197LysThr: 3.197 ± 0.521
3.792LysVal: 3.792 ± 0.465
0.818LysTrp: 0.818 ± 0.224
1.338LysTyr: 1.338 ± 0.292
0.0LysXaa: 0.0 ± 0.0
Leu
8.625LeuAla: 8.625 ± 1.019
0.595LeuCys: 0.595 ± 0.188
3.866LeuAsp: 3.866 ± 0.469
4.907LeuGlu: 4.907 ± 0.604
3.42LeuPhe: 3.42 ± 0.489
5.13LeuGly: 5.13 ± 0.588
0.818LeuHis: 0.818 ± 0.24
4.015LeuIle: 4.015 ± 0.569
3.941LeuLys: 3.941 ± 0.632
6.914LeuLeu: 6.914 ± 0.866
1.413LeuMet: 1.413 ± 0.292
3.941LeuAsn: 3.941 ± 0.548
4.684LeuPro: 4.684 ± 0.62
3.494LeuGln: 3.494 ± 0.882
5.056LeuArg: 5.056 ± 0.553
6.394LeuSer: 6.394 ± 0.511
4.61LeuThr: 4.61 ± 0.566
4.758LeuVal: 4.758 ± 0.563
0.967LeuTrp: 0.967 ± 0.293
2.9LeuTyr: 2.9 ± 0.487
0.074LeuXaa: 0.074 ± 0.084
Met
2.751MetAla: 2.751 ± 0.434
0.149MetCys: 0.149 ± 0.104
0.669MetAsp: 0.669 ± 0.195
0.149MetGlu: 0.149 ± 0.091
0.743MetPhe: 0.743 ± 0.193
1.933MetGly: 1.933 ± 0.409
0.372MetHis: 0.372 ± 0.213
0.892MetIle: 0.892 ± 0.238
0.967MetLys: 0.967 ± 0.286
2.082MetLeu: 2.082 ± 0.441
0.149MetMet: 0.149 ± 0.098
0.892MetAsn: 0.892 ± 0.29
2.156MetPro: 2.156 ± 0.432
0.967MetGln: 0.967 ± 0.256
1.264MetArg: 1.264 ± 0.325
1.859MetSer: 1.859 ± 0.362
1.487MetThr: 1.487 ± 0.311
0.818MetVal: 0.818 ± 0.186
0.149MetTrp: 0.149 ± 0.093
0.297MetTyr: 0.297 ± 0.147
0.0MetXaa: 0.0 ± 0.0
Asn
5.948AsnAla: 5.948 ± 0.764
0.595AsnCys: 0.595 ± 0.247
2.23AsnAsp: 2.23 ± 0.317
3.42AsnGlu: 3.42 ± 0.457
1.338AsnPhe: 1.338 ± 0.266
5.279AsnGly: 5.279 ± 0.752
0.446AsnHis: 0.446 ± 0.163
1.561AsnIle: 1.561 ± 0.364
2.9AsnLys: 2.9 ± 0.505
3.271AsnLeu: 3.271 ± 0.496
0.595AsnMet: 0.595 ± 0.171
1.784AsnAsn: 1.784 ± 0.538
2.9AsnPro: 2.9 ± 0.372
1.413AsnGln: 1.413 ± 0.292
3.197AsnArg: 3.197 ± 0.479
3.048AsnSer: 3.048 ± 0.476
2.379AsnThr: 2.379 ± 0.291
5.056AsnVal: 5.056 ± 0.622
0.595AsnTrp: 0.595 ± 0.193
1.041AsnTyr: 1.041 ± 0.36
0.0AsnXaa: 0.0 ± 0.0
Pro
5.576ProAla: 5.576 ± 1.045
0.223ProCys: 0.223 ± 0.121
2.007ProAsp: 2.007 ± 0.345
3.569ProGlu: 3.569 ± 0.594
1.784ProPhe: 1.784 ± 0.335
3.866ProGly: 3.866 ± 0.755
1.041ProHis: 1.041 ± 0.29
2.454ProIle: 2.454 ± 0.418
2.156ProLys: 2.156 ± 0.45
3.643ProLeu: 3.643 ± 0.578
1.041ProMet: 1.041 ± 0.284
4.015ProAsn: 4.015 ± 0.671
2.602ProPro: 2.602 ± 0.501
2.602ProGln: 2.602 ± 0.707
1.71ProArg: 1.71 ± 0.393
2.751ProSer: 2.751 ± 0.474
4.089ProThr: 4.089 ± 0.652
2.974ProVal: 2.974 ± 0.408
0.743ProTrp: 0.743 ± 0.264
1.487ProTyr: 1.487 ± 0.403
0.0ProXaa: 0.0 ± 0.0
Gln
4.312GlnAla: 4.312 ± 0.621
0.52GlnCys: 0.52 ± 0.181
1.264GlnAsp: 1.264 ± 0.435
1.338GlnGlu: 1.338 ± 0.516
1.561GlnPhe: 1.561 ± 0.361
2.9GlnGly: 2.9 ± 0.739
0.446GlnHis: 0.446 ± 0.192
2.007GlnIle: 2.007 ± 0.366
1.859GlnLys: 1.859 ± 0.351
4.089GlnLeu: 4.089 ± 0.798
0.892GlnMet: 0.892 ± 0.243
2.23GlnAsn: 2.23 ± 0.511
2.379GlnPro: 2.379 ± 1.049
3.42GlnGln: 3.42 ± 1.369
3.271GlnArg: 3.271 ± 0.819
2.23GlnSer: 2.23 ± 0.342
2.677GlnThr: 2.677 ± 0.575
2.082GlnVal: 2.082 ± 0.38
0.372GlnTrp: 0.372 ± 0.186
1.19GlnTyr: 1.19 ± 0.268
0.0GlnXaa: 0.0 ± 0.0
Arg
6.022ArgAla: 6.022 ± 0.727
0.446ArgCys: 0.446 ± 0.26
2.825ArgAsp: 2.825 ± 0.432
2.825ArgGlu: 2.825 ± 0.455
2.528ArgPhe: 2.528 ± 0.424
3.197ArgGly: 3.197 ± 0.461
0.743ArgHis: 0.743 ± 0.218
3.792ArgIle: 3.792 ± 0.684
3.048ArgLys: 3.048 ± 0.529
4.61ArgLeu: 4.61 ± 0.555
1.784ArgMet: 1.784 ± 0.294
2.528ArgAsn: 2.528 ± 0.421
2.379ArgPro: 2.379 ± 0.465
2.454ArgGln: 2.454 ± 0.325
3.643ArgArg: 3.643 ± 0.667
2.9ArgSer: 2.9 ± 0.417
2.974ArgThr: 2.974 ± 0.502
3.197ArgVal: 3.197 ± 0.563
0.967ArgTrp: 0.967 ± 0.231
1.933ArgTyr: 1.933 ± 0.351
0.0ArgXaa: 0.0 ± 0.0
Ser
6.022SerAla: 6.022 ± 0.825
0.372SerCys: 0.372 ± 0.162
3.569SerAsp: 3.569 ± 0.529
3.569SerGlu: 3.569 ± 0.522
2.454SerPhe: 2.454 ± 0.499
5.576SerGly: 5.576 ± 0.677
0.595SerHis: 0.595 ± 0.224
2.602SerIle: 2.602 ± 0.428
3.941SerLys: 3.941 ± 0.606
3.866SerLeu: 3.866 ± 0.522
1.115SerMet: 1.115 ± 0.274
3.792SerAsn: 3.792 ± 0.528
2.9SerPro: 2.9 ± 0.387
2.974SerGln: 2.974 ± 0.752
2.677SerArg: 2.677 ± 0.464
2.974SerSer: 2.974 ± 0.545
3.123SerThr: 3.123 ± 0.415
4.387SerVal: 4.387 ± 0.453
0.595SerTrp: 0.595 ± 0.169
2.156SerTyr: 2.156 ± 0.396
0.0SerXaa: 0.0 ± 0.0
Thr
6.914ThrAla: 6.914 ± 0.975
0.372ThrCys: 0.372 ± 0.196
2.23ThrAsp: 2.23 ± 0.353
3.717ThrGlu: 3.717 ± 0.606
2.082ThrPhe: 2.082 ± 0.379
5.576ThrGly: 5.576 ± 0.697
0.446ThrHis: 0.446 ± 0.157
2.156ThrIle: 2.156 ± 0.462
3.346ThrLys: 3.346 ± 0.427
4.387ThrLeu: 4.387 ± 0.486
1.338ThrMet: 1.338 ± 0.318
2.454ThrAsn: 2.454 ± 0.402
3.866ThrPro: 3.866 ± 0.583
2.305ThrGln: 2.305 ± 0.387
2.825ThrArg: 2.825 ± 0.519
2.974ThrSer: 2.974 ± 0.616
3.197ThrThr: 3.197 ± 0.608
3.792ThrVal: 3.792 ± 0.637
0.892ThrTrp: 0.892 ± 0.192
2.082ThrTyr: 2.082 ± 0.34
0.0ThrXaa: 0.0 ± 0.0
Val
8.401ValAla: 8.401 ± 0.705
0.743ValCys: 0.743 ± 0.266
2.528ValAsp: 2.528 ± 0.466
4.238ValGlu: 4.238 ± 0.789
3.123ValPhe: 3.123 ± 0.602
6.022ValGly: 6.022 ± 0.8
0.892ValHis: 0.892 ± 0.222
3.048ValIle: 3.048 ± 0.471
3.346ValLys: 3.346 ± 0.393
4.758ValLeu: 4.758 ± 0.508
2.007ValMet: 2.007 ± 0.388
2.751ValAsn: 2.751 ± 0.456
4.61ValPro: 4.61 ± 0.613
2.751ValGln: 2.751 ± 0.551
3.717ValArg: 3.717 ± 0.605
4.015ValSer: 4.015 ± 0.565
4.758ValThr: 4.758 ± 0.704
5.204ValVal: 5.204 ± 0.622
1.264ValTrp: 1.264 ± 0.338
2.305ValTyr: 2.305 ± 0.47
0.0ValXaa: 0.0 ± 0.0
Trp
1.19TrpAla: 1.19 ± 0.322
0.297TrpCys: 0.297 ± 0.142
0.372TrpAsp: 0.372 ± 0.206
0.743TrpGlu: 0.743 ± 0.193
0.818TrpPhe: 0.818 ± 0.229
0.892TrpGly: 0.892 ± 0.347
0.149TrpHis: 0.149 ± 0.103
0.669TrpIle: 0.669 ± 0.222
0.52TrpLys: 0.52 ± 0.214
0.669TrpLeu: 0.669 ± 0.197
0.297TrpMet: 0.297 ± 0.175
0.446TrpAsn: 0.446 ± 0.195
1.338TrpPro: 1.338 ± 0.331
0.818TrpGln: 0.818 ± 0.226
1.413TrpArg: 1.413 ± 0.274
0.743TrpSer: 0.743 ± 0.21
0.52TrpThr: 0.52 ± 0.169
1.19TrpVal: 1.19 ± 0.313
0.297TrpTrp: 0.297 ± 0.124
0.892TrpTyr: 0.892 ± 0.258
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.346TyrAla: 3.346 ± 0.646
0.967TyrCys: 0.967 ± 0.363
2.305TyrAsp: 2.305 ± 0.494
2.156TyrGlu: 2.156 ± 0.37
1.487TyrPhe: 1.487 ± 0.381
2.602TyrGly: 2.602 ± 0.46
0.297TyrHis: 0.297 ± 0.131
1.561TyrIle: 1.561 ± 0.255
1.115TyrLys: 1.115 ± 0.319
2.23TyrLeu: 2.23 ± 0.352
0.595TyrMet: 0.595 ± 0.314
2.156TyrAsn: 2.156 ± 0.384
1.338TyrPro: 1.338 ± 0.407
1.561TyrGln: 1.561 ± 0.428
1.784TyrArg: 1.784 ± 0.398
2.454TyrSer: 2.454 ± 0.576
1.264TyrThr: 1.264 ± 0.299
2.305TyrVal: 2.305 ± 0.365
0.223TyrTrp: 0.223 ± 0.124
0.892TyrTyr: 0.892 ± 0.303
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.074XaaAla: 0.074 ± 0.083
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.074XaaLys: 0.074 ± 0.084
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 52 proteins (13451 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski