Amino acid dipepetide frequency for Pseudomonas phage JBD24

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.492AlaAla: 18.492 ± 3.132
1.52AlaCys: 1.52 ± 0.474
8.19AlaAsp: 8.19 ± 0.74
8.444AlaGlu: 8.444 ± 1.265
2.702AlaPhe: 2.702 ± 0.435
9.373AlaGly: 9.373 ± 1.324
1.182AlaHis: 1.182 ± 0.324
6.502AlaIle: 6.502 ± 0.756
4.222AlaLys: 4.222 ± 0.677
13.341AlaLeu: 13.341 ± 1.076
3.715AlaMet: 3.715 ± 0.424
3.462AlaAsn: 3.462 ± 0.595
5.657AlaPro: 5.657 ± 0.751
5.911AlaGln: 5.911 ± 0.974
9.795AlaArg: 9.795 ± 1.02
6.839AlaSer: 6.839 ± 0.929
6.417AlaThr: 6.417 ± 0.56
6.502AlaVal: 6.502 ± 0.787
2.364AlaTrp: 2.364 ± 0.302
2.871AlaTyr: 2.871 ± 0.434
0.0AlaXaa: 0.0 ± 0.0
Cys
0.844CysAla: 0.844 ± 0.26
0.084CysCys: 0.084 ± 0.086
1.013CysAsp: 1.013 ± 0.36
0.422CysGlu: 0.422 ± 0.281
0.507CysPhe: 0.507 ± 0.185
0.676CysGly: 0.676 ± 0.27
0.253CysHis: 0.253 ± 0.143
0.338CysIle: 0.338 ± 0.18
0.338CysLys: 0.338 ± 0.213
0.422CysLeu: 0.422 ± 0.221
0.253CysMet: 0.253 ± 0.199
0.253CysAsn: 0.253 ± 0.136
0.676CysPro: 0.676 ± 0.25
0.0CysGln: 0.0 ± 0.0
1.267CysArg: 1.267 ± 0.376
0.507CysSer: 0.507 ± 0.219
0.676CysThr: 0.676 ± 0.242
0.338CysVal: 0.338 ± 0.168
0.338CysTrp: 0.338 ± 0.167
0.253CysTyr: 0.253 ± 0.139
0.0CysXaa: 0.0 ± 0.0
Asp
7.599AspAla: 7.599 ± 0.895
0.253AspCys: 0.253 ± 0.127
3.124AspAsp: 3.124 ± 0.538
3.462AspGlu: 3.462 ± 0.567
1.773AspPhe: 1.773 ± 0.277
6.417AspGly: 6.417 ± 0.875
1.351AspHis: 1.351 ± 0.369
2.702AspIle: 2.702 ± 0.347
1.52AspLys: 1.52 ± 0.395
6.248AspLeu: 6.248 ± 0.908
1.689AspMet: 1.689 ± 0.423
1.52AspAsn: 1.52 ± 0.34
3.293AspPro: 3.293 ± 0.531
2.449AspGln: 2.449 ± 0.398
3.293AspArg: 3.293 ± 0.465
3.546AspSer: 3.546 ± 0.532
3.462AspThr: 3.462 ± 0.536
3.462AspVal: 3.462 ± 0.603
1.098AspTrp: 1.098 ± 0.269
1.52AspTyr: 1.52 ± 0.406
0.0AspXaa: 0.0 ± 0.0
Glu
7.768GluAla: 7.768 ± 0.808
1.013GluCys: 1.013 ± 0.442
3.209GluAsp: 3.209 ± 0.595
2.364GluGlu: 2.364 ± 0.45
1.773GluPhe: 1.773 ± 0.385
3.546GluGly: 3.546 ± 0.498
0.844GluHis: 0.844 ± 0.253
3.209GluIle: 3.209 ± 0.515
2.618GluLys: 2.618 ± 0.45
6.502GluLeu: 6.502 ± 0.72
1.52GluMet: 1.52 ± 0.354
1.942GluAsn: 1.942 ± 0.446
2.618GluPro: 2.618 ± 0.553
4.053GluGln: 4.053 ± 0.536
4.306GluArg: 4.306 ± 0.725
2.702GluSer: 2.702 ± 0.391
2.702GluThr: 2.702 ± 0.472
4.813GluVal: 4.813 ± 0.729
1.267GluTrp: 1.267 ± 0.31
1.351GluTyr: 1.351 ± 0.313
0.0GluXaa: 0.0 ± 0.0
Phe
3.462PheAla: 3.462 ± 0.509
0.591PheCys: 0.591 ± 0.194
2.195PheAsp: 2.195 ± 0.426
1.435PheGlu: 1.435 ± 0.339
0.676PhePhe: 0.676 ± 0.218
2.449PheGly: 2.449 ± 0.425
0.507PheHis: 0.507 ± 0.198
0.929PheIle: 0.929 ± 0.272
0.76PheLys: 0.76 ± 0.276
2.364PheLeu: 2.364 ± 0.394
0.929PheMet: 0.929 ± 0.274
1.098PheAsn: 1.098 ± 0.315
1.435PhePro: 1.435 ± 0.352
1.182PheGln: 1.182 ± 0.328
2.027PheArg: 2.027 ± 0.466
1.435PheSer: 1.435 ± 0.318
1.182PheThr: 1.182 ± 0.343
1.267PheVal: 1.267 ± 0.234
0.338PheTrp: 0.338 ± 0.166
0.929PheTyr: 0.929 ± 0.27
0.0PheXaa: 0.0 ± 0.0
Gly
7.008GlyAla: 7.008 ± 0.993
0.507GlyCys: 0.507 ± 0.208
4.222GlyAsp: 4.222 ± 0.827
4.391GlyGlu: 4.391 ± 0.527
3.124GlyPhe: 3.124 ± 0.425
6.417GlyGly: 6.417 ± 0.753
0.844GlyHis: 0.844 ± 0.245
3.715GlyIle: 3.715 ± 0.498
2.955GlyLys: 2.955 ± 0.472
7.853GlyLeu: 7.853 ± 1.012
1.182GlyMet: 1.182 ± 0.336
2.618GlyAsn: 2.618 ± 0.424
2.618GlyPro: 2.618 ± 0.385
4.982GlyGln: 4.982 ± 0.627
6.586GlyArg: 6.586 ± 0.691
5.826GlySer: 5.826 ± 0.896
3.378GlyThr: 3.378 ± 0.512
4.644GlyVal: 4.644 ± 0.718
1.689GlyTrp: 1.689 ± 0.342
2.618GlyTyr: 2.618 ± 0.638
0.0GlyXaa: 0.0 ± 0.0
His
1.689HisAla: 1.689 ± 0.348
0.084HisCys: 0.084 ± 0.086
0.844HisAsp: 0.844 ± 0.257
0.844HisGlu: 0.844 ± 0.336
0.338HisPhe: 0.338 ± 0.147
1.435HisGly: 1.435 ± 0.421
0.253HisHis: 0.253 ± 0.133
0.76HisIle: 0.76 ± 0.204
0.169HisLys: 0.169 ± 0.124
1.689HisLeu: 1.689 ± 0.366
0.844HisMet: 0.844 ± 0.302
0.76HisAsn: 0.76 ± 0.248
1.351HisPro: 1.351 ± 0.388
0.929HisGln: 0.929 ± 0.262
1.182HisArg: 1.182 ± 0.377
0.507HisSer: 0.507 ± 0.192
0.591HisThr: 0.591 ± 0.195
0.76HisVal: 0.76 ± 0.242
0.169HisTrp: 0.169 ± 0.1
0.929HisTyr: 0.929 ± 0.329
0.0HisXaa: 0.0 ± 0.0
Ile
4.982IleAla: 4.982 ± 0.74
0.591IleCys: 0.591 ± 0.226
3.631IleAsp: 3.631 ± 0.451
3.124IleGlu: 3.124 ± 0.466
0.676IlePhe: 0.676 ± 0.22
3.124IleGly: 3.124 ± 0.429
1.013IleHis: 1.013 ± 0.195
1.689IleIle: 1.689 ± 0.631
1.351IleLys: 1.351 ± 0.281
3.124IleLeu: 3.124 ± 0.474
0.507IleMet: 0.507 ± 0.211
0.929IleAsn: 0.929 ± 0.262
2.195IlePro: 2.195 ± 0.441
1.604IleGln: 1.604 ± 0.41
4.222IleArg: 4.222 ± 0.533
2.111IleSer: 2.111 ± 0.459
3.209IleThr: 3.209 ± 0.586
2.871IleVal: 2.871 ± 0.427
0.676IleTrp: 0.676 ± 0.242
0.929IleTyr: 0.929 ± 0.265
0.0IleXaa: 0.0 ± 0.0
Lys
4.982LysAla: 4.982 ± 0.734
0.084LysCys: 0.084 ± 0.065
0.76LysAsp: 0.76 ± 0.277
2.111LysGlu: 2.111 ± 0.492
0.422LysPhe: 0.422 ± 0.156
2.364LysGly: 2.364 ± 0.343
0.76LysHis: 0.76 ± 0.263
1.182LysIle: 1.182 ± 0.318
1.52LysLys: 1.52 ± 0.338
3.209LysLeu: 3.209 ± 0.546
0.422LysMet: 0.422 ± 0.185
1.013LysAsn: 1.013 ± 0.249
2.28LysPro: 2.28 ± 0.575
1.013LysGln: 1.013 ± 0.277
3.293LysArg: 3.293 ± 0.812
2.28LysSer: 2.28 ± 0.456
1.942LysThr: 1.942 ± 0.371
2.533LysVal: 2.533 ± 0.461
0.169LysTrp: 0.169 ± 0.117
0.929LysTyr: 0.929 ± 0.248
0.0LysXaa: 0.0 ± 0.0
Leu
14.523LeuAla: 14.523 ± 1.398
0.844LeuCys: 0.844 ± 0.281
6.417LeuAsp: 6.417 ± 0.701
6.755LeuGlu: 6.755 ± 0.868
2.533LeuPhe: 2.533 ± 0.578
7.177LeuGly: 7.177 ± 0.689
2.195LeuHis: 2.195 ± 0.487
3.209LeuIle: 3.209 ± 0.544
3.378LeuLys: 3.378 ± 0.585
8.95LeuLeu: 8.95 ± 1.08
1.773LeuMet: 1.773 ± 0.458
2.702LeuAsn: 2.702 ± 0.493
4.56LeuPro: 4.56 ± 0.691
3.631LeuGln: 3.631 ± 0.51
7.599LeuArg: 7.599 ± 0.67
4.644LeuSer: 4.644 ± 0.764
5.404LeuThr: 5.404 ± 0.776
7.937LeuVal: 7.937 ± 0.71
1.435LeuTrp: 1.435 ± 0.316
2.449LeuTyr: 2.449 ± 0.359
0.0LeuXaa: 0.0 ± 0.0
Met
4.053MetAla: 4.053 ± 0.571
0.084MetCys: 0.084 ± 0.082
2.28MetAsp: 2.28 ± 0.43
1.52MetGlu: 1.52 ± 0.429
0.422MetPhe: 0.422 ± 0.183
1.773MetGly: 1.773 ± 0.429
0.169MetHis: 0.169 ± 0.115
0.253MetIle: 0.253 ± 0.136
0.591MetLys: 0.591 ± 0.215
1.098MetLeu: 1.098 ± 0.309
0.591MetMet: 0.591 ± 0.221
0.422MetAsn: 0.422 ± 0.229
1.351MetPro: 1.351 ± 0.412
1.098MetGln: 1.098 ± 0.337
1.351MetArg: 1.351 ± 0.307
1.942MetSer: 1.942 ± 0.488
1.858MetThr: 1.858 ± 0.356
0.929MetVal: 0.929 ± 0.229
0.338MetTrp: 0.338 ± 0.233
0.253MetTyr: 0.253 ± 0.138
0.0MetXaa: 0.0 ± 0.0
Asn
3.04AsnAla: 3.04 ± 0.626
0.084AsnCys: 0.084 ± 0.088
1.267AsnAsp: 1.267 ± 0.313
1.182AsnGlu: 1.182 ± 0.336
0.676AsnPhe: 0.676 ± 0.302
2.871AsnGly: 2.871 ± 0.776
0.591AsnHis: 0.591 ± 0.247
0.676AsnIle: 0.676 ± 0.309
0.844AsnLys: 0.844 ± 0.294
3.293AsnLeu: 3.293 ± 0.555
1.013AsnMet: 1.013 ± 0.354
1.267AsnAsn: 1.267 ± 0.474
2.533AsnPro: 2.533 ± 0.483
1.351AsnGln: 1.351 ± 0.332
3.293AsnArg: 3.293 ± 0.423
1.604AsnSer: 1.604 ± 0.3
1.182AsnThr: 1.182 ± 0.304
1.182AsnVal: 1.182 ± 0.266
0.591AsnTrp: 0.591 ± 0.192
0.929AsnTyr: 0.929 ± 0.274
0.0AsnXaa: 0.0 ± 0.0
Pro
7.177ProAla: 7.177 ± 0.905
0.422ProCys: 0.422 ± 0.179
3.969ProAsp: 3.969 ± 0.553
2.786ProGlu: 2.786 ± 0.497
1.604ProPhe: 1.604 ± 0.371
3.8ProGly: 3.8 ± 0.487
0.929ProHis: 0.929 ± 0.481
1.858ProIle: 1.858 ± 0.388
1.604ProLys: 1.604 ± 0.439
4.644ProLeu: 4.644 ± 0.538
0.844ProMet: 0.844 ± 0.243
1.773ProAsn: 1.773 ± 0.452
2.28ProPro: 2.28 ± 0.576
2.28ProGln: 2.28 ± 0.527
3.209ProArg: 3.209 ± 0.595
3.546ProSer: 3.546 ± 0.461
2.871ProThr: 2.871 ± 0.639
3.124ProVal: 3.124 ± 0.613
0.338ProTrp: 0.338 ± 0.152
1.182ProTyr: 1.182 ± 0.357
0.0ProXaa: 0.0 ± 0.0
Gln
5.32GlnAla: 5.32 ± 1.135
0.338GlnCys: 0.338 ± 0.153
1.858GlnAsp: 1.858 ± 0.394
1.858GlnGlu: 1.858 ± 0.404
1.52GlnPhe: 1.52 ± 0.33
3.378GlnGly: 3.378 ± 0.536
0.929GlnHis: 0.929 ± 0.306
2.195GlnIle: 2.195 ± 0.328
1.013GlnLys: 1.013 ± 0.361
6.248GlnLeu: 6.248 ± 0.592
0.929GlnMet: 0.929 ± 0.333
1.351GlnAsn: 1.351 ± 0.336
2.533GlnPro: 2.533 ± 0.441
3.04GlnGln: 3.04 ± 0.704
3.715GlnArg: 3.715 ± 0.564
2.871GlnSer: 2.871 ± 0.495
2.28GlnThr: 2.28 ± 0.451
4.56GlnVal: 4.56 ± 0.517
1.098GlnTrp: 1.098 ± 0.337
0.591GlnTyr: 0.591 ± 0.223
0.0GlnXaa: 0.0 ± 0.0
Arg
8.528ArgAla: 8.528 ± 0.965
0.844ArgCys: 0.844 ± 0.28
4.475ArgAsp: 4.475 ± 0.64
5.488ArgGlu: 5.488 ± 0.613
2.111ArgPhe: 2.111 ± 0.411
4.391ArgGly: 4.391 ± 0.583
1.689ArgHis: 1.689 ± 0.353
3.631ArgIle: 3.631 ± 0.515
2.449ArgLys: 2.449 ± 0.426
7.768ArgLeu: 7.768 ± 0.758
1.013ArgMet: 1.013 ± 0.267
1.52ArgAsn: 1.52 ± 0.409
4.137ArgPro: 4.137 ± 0.81
4.897ArgGln: 4.897 ± 0.7
5.995ArgArg: 5.995 ± 0.798
3.969ArgSer: 3.969 ± 0.717
3.378ArgThr: 3.378 ± 0.503
4.475ArgVal: 4.475 ± 0.673
1.435ArgTrp: 1.435 ± 0.402
2.955ArgTyr: 2.955 ± 0.474
0.0ArgXaa: 0.0 ± 0.0
Ser
8.106SerAla: 8.106 ± 0.867
0.591SerCys: 0.591 ± 0.21
3.462SerAsp: 3.462 ± 0.595
3.462SerGlu: 3.462 ± 0.523
1.435SerPhe: 1.435 ± 0.316
4.813SerGly: 4.813 ± 0.739
0.676SerHis: 0.676 ± 0.207
2.449SerIle: 2.449 ± 0.389
2.195SerLys: 2.195 ± 0.365
5.742SerLeu: 5.742 ± 1.053
1.013SerMet: 1.013 ± 0.257
1.858SerAsn: 1.858 ± 0.4
3.378SerPro: 3.378 ± 0.577
2.027SerGln: 2.027 ± 0.349
3.124SerArg: 3.124 ± 0.653
3.8SerSer: 3.8 ± 0.612
3.884SerThr: 3.884 ± 0.694
3.462SerVal: 3.462 ± 0.487
1.182SerTrp: 1.182 ± 0.32
1.773SerTyr: 1.773 ± 0.409
0.0SerXaa: 0.0 ± 0.0
Thr
7.431ThrAla: 7.431 ± 1.148
0.253ThrCys: 0.253 ± 0.271
3.462ThrAsp: 3.462 ± 0.568
3.124ThrGlu: 3.124 ± 0.416
1.267ThrPhe: 1.267 ± 0.341
5.066ThrGly: 5.066 ± 0.864
0.76ThrHis: 0.76 ± 0.281
2.027ThrIle: 2.027 ± 0.415
1.942ThrLys: 1.942 ± 0.495
5.32ThrLeu: 5.32 ± 0.716
1.182ThrMet: 1.182 ± 0.381
1.773ThrAsn: 1.773 ± 0.328
1.773ThrPro: 1.773 ± 0.408
1.182ThrGln: 1.182 ± 0.267
3.124ThrArg: 3.124 ± 0.465
3.378ThrSer: 3.378 ± 0.648
3.884ThrThr: 3.884 ± 0.712
5.573ThrVal: 5.573 ± 0.857
1.013ThrTrp: 1.013 ± 0.312
1.604ThrTyr: 1.604 ± 0.387
0.0ThrXaa: 0.0 ± 0.0
Val
7.599ValAla: 7.599 ± 0.627
0.507ValCys: 0.507 ± 0.183
3.293ValAsp: 3.293 ± 0.455
4.982ValGlu: 4.982 ± 0.632
1.942ValPhe: 1.942 ± 0.35
4.644ValGly: 4.644 ± 0.65
0.591ValHis: 0.591 ± 0.158
2.955ValIle: 2.955 ± 0.496
2.364ValLys: 2.364 ± 0.432
5.995ValLeu: 5.995 ± 0.807
1.435ValMet: 1.435 ± 0.379
1.942ValAsn: 1.942 ± 0.336
3.124ValPro: 3.124 ± 0.463
3.631ValGln: 3.631 ± 0.503
4.644ValArg: 4.644 ± 0.474
3.631ValSer: 3.631 ± 0.541
4.391ValThr: 4.391 ± 0.587
3.8ValVal: 3.8 ± 0.542
1.182ValTrp: 1.182 ± 0.265
2.618ValTyr: 2.618 ± 0.548
0.0ValXaa: 0.0 ± 0.0
Trp
1.689TrpAla: 1.689 ± 0.311
0.422TrpCys: 0.422 ± 0.162
0.422TrpAsp: 0.422 ± 0.189
0.929TrpGlu: 0.929 ± 0.264
0.591TrpPhe: 0.591 ± 0.231
1.013TrpGly: 1.013 ± 0.297
0.084TrpHis: 0.084 ± 0.088
1.013TrpIle: 1.013 ± 0.221
0.844TrpLys: 0.844 ± 0.214
1.858TrpLeu: 1.858 ± 0.412
0.844TrpMet: 0.844 ± 0.32
0.507TrpAsn: 0.507 ± 0.177
0.76TrpPro: 0.76 ± 0.284
1.013TrpGln: 1.013 ± 0.261
1.351TrpArg: 1.351 ± 0.366
1.435TrpSer: 1.435 ± 0.309
0.929TrpThr: 0.929 ± 0.314
1.267TrpVal: 1.267 ± 0.332
0.507TrpTrp: 0.507 ± 0.212
0.169TrpTyr: 0.169 ± 0.11
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.209TyrAla: 3.209 ± 0.507
0.338TyrCys: 0.338 ± 0.185
1.689TyrAsp: 1.689 ± 0.332
1.435TyrGlu: 1.435 ± 0.405
1.098TyrPhe: 1.098 ± 0.271
2.449TyrGly: 2.449 ± 0.467
0.507TyrHis: 0.507 ± 0.274
1.267TyrIle: 1.267 ± 0.316
0.591TyrLys: 0.591 ± 0.209
2.449TyrLeu: 2.449 ± 0.419
0.591TyrMet: 0.591 ± 0.283
0.76TyrAsn: 0.76 ± 0.231
1.689TyrPro: 1.689 ± 0.474
1.351TyrGln: 1.351 ± 0.393
1.858TyrArg: 1.858 ± 0.389
1.858TyrSer: 1.858 ± 0.474
1.52TyrThr: 1.52 ± 0.372
1.773TyrVal: 1.773 ± 0.272
0.422TyrTrp: 0.422 ± 0.201
0.507TyrTyr: 0.507 ± 0.208
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 58 proteins (11844 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski