Amino acid dipepetide frequency for Xylella phage Paz

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.464AlaAla: 12.464 ± 1.585
1.039AlaCys: 1.039 ± 0.299
6.9AlaAsp: 6.9 ± 0.542
6.232AlaGlu: 6.232 ± 0.787
2.819AlaPhe: 2.819 ± 0.406
8.977AlaGly: 8.977 ± 1.277
1.41AlaHis: 1.41 ± 0.272
5.045AlaIle: 5.045 ± 0.573
5.861AlaLys: 5.861 ± 0.652
7.864AlaLeu: 7.864 ± 1.034
3.932AlaMet: 3.932 ± 0.654
4.6AlaAsn: 4.6 ± 0.609
4.303AlaPro: 4.303 ± 0.435
5.713AlaGln: 5.713 ± 0.826
6.603AlaArg: 6.603 ± 0.654
4.377AlaSer: 4.377 ± 0.483
5.638AlaThr: 5.638 ± 0.809
6.603AlaVal: 6.603 ± 0.84
2.077AlaTrp: 2.077 ± 0.321
3.561AlaTyr: 3.561 ± 0.535
0.0AlaXaa: 0.0 ± 0.0
Cys
0.816CysAla: 0.816 ± 0.217
0.0CysCys: 0.0 ± 0.0
0.519CysAsp: 0.519 ± 0.165
0.297CysGlu: 0.297 ± 0.131
0.223CysPhe: 0.223 ± 0.158
0.668CysGly: 0.668 ± 0.27
0.223CysHis: 0.223 ± 0.135
0.89CysIle: 0.89 ± 0.302
0.445CysLys: 0.445 ± 0.184
0.594CysLeu: 0.594 ± 0.217
0.297CysMet: 0.297 ± 0.129
0.519CysAsn: 0.519 ± 0.196
0.297CysPro: 0.297 ± 0.128
0.223CysGln: 0.223 ± 0.117
0.148CysArg: 0.148 ± 0.099
0.0CysSer: 0.0 ± 0.0
0.223CysThr: 0.223 ± 0.131
0.668CysVal: 0.668 ± 0.166
0.074CysTrp: 0.074 ± 0.067
0.148CysTyr: 0.148 ± 0.093
0.0CysXaa: 0.0 ± 0.0
Asp
6.9AspAla: 6.9 ± 0.957
0.445AspCys: 0.445 ± 0.176
3.339AspAsp: 3.339 ± 0.506
3.932AspGlu: 3.932 ± 0.48
2.374AspPhe: 2.374 ± 0.411
4.674AspGly: 4.674 ± 0.583
1.113AspHis: 1.113 ± 0.287
3.858AspIle: 3.858 ± 0.308
3.413AspLys: 3.413 ± 0.447
4.303AspLeu: 4.303 ± 0.517
2.226AspMet: 2.226 ± 0.471
2.448AspAsn: 2.448 ± 0.366
3.932AspPro: 3.932 ± 0.432
2.226AspGln: 2.226 ± 0.47
4.006AspArg: 4.006 ± 0.559
3.19AspSer: 3.19 ± 0.392
2.597AspThr: 2.597 ± 0.57
3.709AspVal: 3.709 ± 0.6
1.187AspTrp: 1.187 ± 0.336
2.077AspTyr: 2.077 ± 0.43
0.0AspXaa: 0.0 ± 0.0
Glu
6.454GluAla: 6.454 ± 0.593
0.297GluCys: 0.297 ± 0.141
3.339GluAsp: 3.339 ± 0.455
3.264GluGlu: 3.264 ± 0.665
2.745GluPhe: 2.745 ± 0.383
3.858GluGly: 3.858 ± 0.469
1.484GluHis: 1.484 ± 0.34
1.261GluIle: 1.261 ± 0.28
2.3GluLys: 2.3 ± 0.39
4.229GluLeu: 4.229 ± 0.701
2.003GluMet: 2.003 ± 0.381
1.929GluAsn: 1.929 ± 0.361
2.077GluPro: 2.077 ± 0.472
4.526GluGln: 4.526 ± 0.655
4.377GluArg: 4.377 ± 0.551
2.597GluSer: 2.597 ± 0.374
3.264GluThr: 3.264 ± 0.454
4.229GluVal: 4.229 ± 0.684
1.484GluTrp: 1.484 ± 0.271
2.448GluTyr: 2.448 ± 0.414
0.0GluXaa: 0.0 ± 0.0
Phe
3.487PheAla: 3.487 ± 0.384
0.148PheCys: 0.148 ± 0.105
3.413PheAsp: 3.413 ± 0.586
2.374PheGlu: 2.374 ± 0.296
0.816PhePhe: 0.816 ± 0.236
2.671PheGly: 2.671 ± 0.49
0.816PheHis: 0.816 ± 0.229
1.855PheIle: 1.855 ± 0.418
1.855PheLys: 1.855 ± 0.339
2.374PheLeu: 2.374 ± 0.344
0.816PheMet: 0.816 ± 0.243
1.558PheAsn: 1.558 ± 0.336
1.335PhePro: 1.335 ± 0.316
1.484PheGln: 1.484 ± 0.233
1.855PheArg: 1.855 ± 0.355
2.3PheSer: 2.3 ± 0.295
2.077PheThr: 2.077 ± 0.304
2.077PheVal: 2.077 ± 0.434
0.371PheTrp: 0.371 ± 0.179
1.41PheTyr: 1.41 ± 0.389
0.0PheXaa: 0.0 ± 0.0
Gly
7.567GlyAla: 7.567 ± 1.091
0.519GlyCys: 0.519 ± 0.205
4.897GlyAsp: 4.897 ± 0.489
4.748GlyGlu: 4.748 ± 0.579
2.597GlyPhe: 2.597 ± 0.384
7.716GlyGly: 7.716 ± 1.035
1.039GlyHis: 1.039 ± 0.293
4.08GlyIle: 4.08 ± 0.557
5.267GlyLys: 5.267 ± 0.544
5.267GlyLeu: 5.267 ± 0.716
3.264GlyMet: 3.264 ± 0.663
3.858GlyAsn: 3.858 ± 0.395
1.929GlyPro: 1.929 ± 0.293
2.522GlyGln: 2.522 ± 0.44
4.377GlyArg: 4.377 ± 0.603
3.709GlySer: 3.709 ± 0.517
5.342GlyThr: 5.342 ± 0.643
5.49GlyVal: 5.49 ± 0.667
1.484GlyTrp: 1.484 ± 0.38
2.968GlyTyr: 2.968 ± 0.397
0.0GlyXaa: 0.0 ± 0.0
His
1.706HisAla: 1.706 ± 0.352
0.223HisCys: 0.223 ± 0.126
0.964HisAsp: 0.964 ± 0.284
1.187HisGlu: 1.187 ± 0.286
0.816HisPhe: 0.816 ± 0.215
1.41HisGly: 1.41 ± 0.304
0.371HisHis: 0.371 ± 0.175
0.445HisIle: 0.445 ± 0.198
1.113HisLys: 1.113 ± 0.329
1.484HisLeu: 1.484 ± 0.235
0.964HisMet: 0.964 ± 0.234
0.816HisAsn: 0.816 ± 0.229
0.89HisPro: 0.89 ± 0.223
0.519HisGln: 0.519 ± 0.177
0.964HisArg: 0.964 ± 0.227
1.113HisSer: 1.113 ± 0.247
1.558HisThr: 1.558 ± 0.312
1.484HisVal: 1.484 ± 0.297
0.519HisTrp: 0.519 ± 0.158
0.742HisTyr: 0.742 ± 0.224
0.0HisXaa: 0.0 ± 0.0
Ile
6.158IleAla: 6.158 ± 0.629
0.668IleCys: 0.668 ± 0.19
3.116IleAsp: 3.116 ± 0.452
3.042IleGlu: 3.042 ± 0.632
1.039IlePhe: 1.039 ± 0.232
3.709IleGly: 3.709 ± 0.521
0.89IleHis: 0.89 ± 0.186
2.448IleIle: 2.448 ± 0.499
2.226IleLys: 2.226 ± 0.545
3.487IleLeu: 3.487 ± 0.528
1.187IleMet: 1.187 ± 0.279
2.448IleAsn: 2.448 ± 0.325
2.151IlePro: 2.151 ± 0.515
1.632IleGln: 1.632 ± 0.328
3.413IleArg: 3.413 ± 0.451
2.597IleSer: 2.597 ± 0.454
3.19IleThr: 3.19 ± 0.558
2.893IleVal: 2.893 ± 0.402
0.594IleTrp: 0.594 ± 0.221
1.039IleTyr: 1.039 ± 0.258
0.0IleXaa: 0.0 ± 0.0
Lys
6.38LysAla: 6.38 ± 0.729
0.297LysCys: 0.297 ± 0.133
2.968LysAsp: 2.968 ± 0.43
3.116LysGlu: 3.116 ± 0.411
2.522LysPhe: 2.522 ± 0.448
3.339LysGly: 3.339 ± 0.489
1.261LysHis: 1.261 ± 0.279
2.226LysIle: 2.226 ± 0.383
2.448LysLys: 2.448 ± 0.442
5.342LysLeu: 5.342 ± 0.727
1.484LysMet: 1.484 ± 0.336
2.077LysAsn: 2.077 ± 0.414
2.522LysPro: 2.522 ± 0.414
2.448LysGln: 2.448 ± 0.397
2.819LysArg: 2.819 ± 0.538
2.968LysSer: 2.968 ± 0.496
1.855LysThr: 1.855 ± 0.375
4.08LysVal: 4.08 ± 0.617
1.187LysTrp: 1.187 ± 0.279
1.781LysTyr: 1.781 ± 0.396
0.0LysXaa: 0.0 ± 0.0
Leu
8.161LeuAla: 8.161 ± 0.853
0.668LeuCys: 0.668 ± 0.187
5.564LeuAsp: 5.564 ± 0.601
3.19LeuGlu: 3.19 ± 0.468
2.448LeuPhe: 2.448 ± 0.252
6.009LeuGly: 6.009 ± 0.578
1.855LeuHis: 1.855 ± 0.365
4.303LeuIle: 4.303 ± 0.601
4.006LeuLys: 4.006 ± 0.407
6.9LeuLeu: 6.9 ± 0.794
2.374LeuMet: 2.374 ± 0.431
2.745LeuAsn: 2.745 ± 0.448
3.339LeuPro: 3.339 ± 0.49
3.042LeuGln: 3.042 ± 0.518
5.416LeuArg: 5.416 ± 0.483
5.861LeuSer: 5.861 ± 0.757
4.897LeuThr: 4.897 ± 0.587
4.822LeuVal: 4.822 ± 0.721
1.261LeuTrp: 1.261 ± 0.319
2.522LeuTyr: 2.522 ± 0.429
0.0LeuXaa: 0.0 ± 0.0
Met
3.635MetAla: 3.635 ± 0.785
0.148MetCys: 0.148 ± 0.105
2.597MetAsp: 2.597 ± 0.459
1.558MetGlu: 1.558 ± 0.287
1.039MetPhe: 1.039 ± 0.328
2.226MetGly: 2.226 ± 0.683
0.594MetHis: 0.594 ± 0.186
1.187MetIle: 1.187 ± 0.314
1.261MetLys: 1.261 ± 0.216
3.042MetLeu: 3.042 ± 0.559
0.816MetMet: 0.816 ± 0.221
1.706MetAsn: 1.706 ± 0.373
0.964MetPro: 0.964 ± 0.285
1.855MetGln: 1.855 ± 0.29
2.374MetArg: 2.374 ± 0.377
2.3MetSer: 2.3 ± 0.316
2.151MetThr: 2.151 ± 0.319
1.41MetVal: 1.41 ± 0.25
0.742MetTrp: 0.742 ± 0.169
0.742MetTyr: 0.742 ± 0.266
0.0MetXaa: 0.0 ± 0.0
Asn
3.19AsnAla: 3.19 ± 0.519
0.074AsnCys: 0.074 ± 0.081
2.003AsnAsp: 2.003 ± 0.453
2.077AsnGlu: 2.077 ± 0.476
1.41AsnPhe: 1.41 ± 0.324
3.487AsnGly: 3.487 ± 0.473
0.668AsnHis: 0.668 ± 0.3
2.151AsnIle: 2.151 ± 0.432
1.929AsnLys: 1.929 ± 0.441
4.006AsnLeu: 4.006 ± 0.529
1.558AsnMet: 1.558 ± 0.232
1.558AsnAsn: 1.558 ± 0.295
2.003AsnPro: 2.003 ± 0.401
1.706AsnGln: 1.706 ± 0.385
2.893AsnArg: 2.893 ± 0.44
2.522AsnSer: 2.522 ± 0.608
2.893AsnThr: 2.893 ± 0.402
3.487AsnVal: 3.487 ± 0.432
0.594AsnTrp: 0.594 ± 0.302
1.632AsnTyr: 1.632 ± 0.471
0.0AsnXaa: 0.0 ± 0.0
Pro
4.303ProAla: 4.303 ± 0.529
0.371ProCys: 0.371 ± 0.166
3.487ProAsp: 3.487 ± 0.573
2.745ProGlu: 2.745 ± 0.377
1.261ProPhe: 1.261 ± 0.268
3.116ProGly: 3.116 ± 0.531
0.519ProHis: 0.519 ± 0.255
1.781ProIle: 1.781 ± 0.288
2.522ProLys: 2.522 ± 0.62
2.893ProLeu: 2.893 ± 0.395
1.335ProMet: 1.335 ± 0.327
2.077ProAsn: 2.077 ± 0.425
1.261ProPro: 1.261 ± 0.31
1.855ProGln: 1.855 ± 0.36
1.187ProArg: 1.187 ± 0.306
1.781ProSer: 1.781 ± 0.343
2.3ProThr: 2.3 ± 0.433
3.042ProVal: 3.042 ± 0.417
0.668ProTrp: 0.668 ± 0.177
1.484ProTyr: 1.484 ± 0.39
0.0ProXaa: 0.0 ± 0.0
Gln
5.713GlnAla: 5.713 ± 0.91
0.371GlnCys: 0.371 ± 0.183
1.855GlnAsp: 1.855 ± 0.282
2.671GlnGlu: 2.671 ± 0.434
1.632GlnPhe: 1.632 ± 0.261
2.671GlnGly: 2.671 ± 0.367
0.668GlnHis: 0.668 ± 0.203
1.929GlnIle: 1.929 ± 0.372
2.226GlnLys: 2.226 ± 0.483
3.561GlnLeu: 3.561 ± 0.559
2.077GlnMet: 2.077 ± 0.425
1.113GlnAsn: 1.113 ± 0.215
1.929GlnPro: 1.929 ± 0.469
2.448GlnGln: 2.448 ± 0.717
2.819GlnArg: 2.819 ± 0.551
2.151GlnSer: 2.151 ± 0.399
1.855GlnThr: 1.855 ± 0.32
3.19GlnVal: 3.19 ± 0.504
0.445GlnTrp: 0.445 ± 0.188
2.374GlnTyr: 2.374 ± 0.345
0.0GlnXaa: 0.0 ± 0.0
Arg
7.345ArgAla: 7.345 ± 0.649
0.371ArgCys: 0.371 ± 0.148
3.858ArgAsp: 3.858 ± 0.507
4.451ArgGlu: 4.451 ± 0.662
2.671ArgPhe: 2.671 ± 0.387
4.155ArgGly: 4.155 ± 0.556
0.964ArgHis: 0.964 ± 0.247
3.635ArgIle: 3.635 ± 0.517
3.858ArgLys: 3.858 ± 0.546
4.526ArgLeu: 4.526 ± 0.72
2.003ArgMet: 2.003 ± 0.377
2.522ArgAsn: 2.522 ± 0.462
1.41ArgPro: 1.41 ± 0.248
2.077ArgGln: 2.077 ± 0.34
3.116ArgArg: 3.116 ± 0.558
2.745ArgSer: 2.745 ± 0.492
3.635ArgThr: 3.635 ± 0.503
3.858ArgVal: 3.858 ± 0.564
1.039ArgTrp: 1.039 ± 0.29
2.3ArgTyr: 2.3 ± 0.545
0.0ArgXaa: 0.0 ± 0.0
Ser
5.713SerAla: 5.713 ± 0.559
0.519SerCys: 0.519 ± 0.175
3.042SerAsp: 3.042 ± 0.433
2.448SerGlu: 2.448 ± 0.381
2.151SerPhe: 2.151 ± 0.375
4.971SerGly: 4.971 ± 0.693
0.816SerHis: 0.816 ± 0.245
2.374SerIle: 2.374 ± 0.456
2.745SerLys: 2.745 ± 0.407
5.638SerLeu: 5.638 ± 0.653
1.781SerMet: 1.781 ± 0.424
2.077SerAsn: 2.077 ± 0.427
2.522SerPro: 2.522 ± 0.374
2.3SerGln: 2.3 ± 0.489
2.893SerArg: 2.893 ± 0.44
3.487SerSer: 3.487 ± 0.53
3.635SerThr: 3.635 ± 0.434
3.339SerVal: 3.339 ± 0.523
1.113SerTrp: 1.113 ± 0.254
1.41SerTyr: 1.41 ± 0.304
0.0SerXaa: 0.0 ± 0.0
Thr
6.084ThrAla: 6.084 ± 0.685
0.371ThrCys: 0.371 ± 0.149
3.413ThrAsp: 3.413 ± 0.59
2.968ThrGlu: 2.968 ± 0.56
2.151ThrPhe: 2.151 ± 0.313
5.564ThrGly: 5.564 ± 0.558
0.964ThrHis: 0.964 ± 0.241
3.042ThrIle: 3.042 ± 0.498
3.339ThrLys: 3.339 ± 0.372
4.674ThrLeu: 4.674 ± 0.537
1.484ThrMet: 1.484 ± 0.486
2.374ThrAsn: 2.374 ± 0.515
2.448ThrPro: 2.448 ± 0.488
1.929ThrGln: 1.929 ± 0.398
3.042ThrArg: 3.042 ± 0.491
3.264ThrSer: 3.264 ± 0.395
3.784ThrThr: 3.784 ± 0.575
4.377ThrVal: 4.377 ± 0.688
0.742ThrTrp: 0.742 ± 0.167
2.448ThrTyr: 2.448 ± 0.363
0.0ThrXaa: 0.0 ± 0.0
Val
4.971ValAla: 4.971 ± 0.599
0.297ValCys: 0.297 ± 0.162
3.487ValAsp: 3.487 ± 0.64
5.342ValGlu: 5.342 ± 0.544
1.781ValPhe: 1.781 ± 0.319
5.564ValGly: 5.564 ± 0.619
2.3ValHis: 2.3 ± 0.354
2.448ValIle: 2.448 ± 0.389
3.709ValLys: 3.709 ± 0.591
4.822ValLeu: 4.822 ± 0.571
1.261ValMet: 1.261 ± 0.337
3.264ValAsn: 3.264 ± 0.541
2.374ValPro: 2.374 ± 0.425
3.339ValGln: 3.339 ± 0.388
5.267ValArg: 5.267 ± 0.523
4.229ValSer: 4.229 ± 0.76
4.451ValThr: 4.451 ± 0.65
5.787ValVal: 5.787 ± 0.856
1.039ValTrp: 1.039 ± 0.303
2.671ValTyr: 2.671 ± 0.458
0.0ValXaa: 0.0 ± 0.0
Trp
1.706TrpAla: 1.706 ± 0.417
0.223TrpCys: 0.223 ± 0.113
1.187TrpAsp: 1.187 ± 0.338
0.668TrpGlu: 0.668 ± 0.186
1.187TrpPhe: 1.187 ± 0.246
0.89TrpGly: 0.89 ± 0.208
0.445TrpHis: 0.445 ± 0.142
0.594TrpIle: 0.594 ± 0.174
0.668TrpLys: 0.668 ± 0.229
1.558TrpLeu: 1.558 ± 0.345
0.594TrpMet: 0.594 ± 0.197
0.742TrpAsn: 0.742 ± 0.169
0.519TrpPro: 0.519 ± 0.208
0.742TrpGln: 0.742 ± 0.221
1.039TrpArg: 1.039 ± 0.276
1.261TrpSer: 1.261 ± 0.326
1.039TrpThr: 1.039 ± 0.258
1.113TrpVal: 1.113 ± 0.265
0.148TrpTrp: 0.148 ± 0.103
0.668TrpTyr: 0.668 ± 0.245
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.264TyrAla: 3.264 ± 0.412
0.223TyrCys: 0.223 ± 0.123
2.003TyrAsp: 2.003 ± 0.43
1.855TyrGlu: 1.855 ± 0.331
1.41TyrPhe: 1.41 ± 0.376
2.968TyrGly: 2.968 ± 0.538
0.89TyrHis: 0.89 ± 0.251
2.374TyrIle: 2.374 ± 0.379
1.929TyrLys: 1.929 ± 0.297
2.819TyrLeu: 2.819 ± 0.495
0.816TyrMet: 0.816 ± 0.193
1.41TyrAsn: 1.41 ± 0.284
1.855TyrPro: 1.855 ± 0.307
1.113TyrGln: 1.113 ± 0.245
1.929TyrArg: 1.929 ± 0.333
2.522TyrSer: 2.522 ± 0.411
2.077TyrThr: 2.077 ± 0.458
2.745TyrVal: 2.745 ± 0.468
0.223TyrTrp: 0.223 ± 0.107
1.484TyrTyr: 1.484 ± 0.302
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 51 proteins (13480 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski