Amino acid dipepetide frequency for Pectobacterium phage PhiM1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.555AlaAla: 12.555 ± 1.217
0.886AlaCys: 0.886 ± 0.298
5.318AlaAsp: 5.318 ± 0.728
5.761AlaGlu: 5.761 ± 0.951
2.733AlaPhe: 2.733 ± 0.489
7.164AlaGly: 7.164 ± 0.685
1.773AlaHis: 1.773 ± 0.381
2.954AlaIle: 2.954 ± 0.508
4.284AlaLys: 4.284 ± 0.709
10.044AlaLeu: 10.044 ± 0.899
2.437AlaMet: 2.437 ± 0.384
3.914AlaAsn: 3.914 ± 0.638
3.545AlaPro: 3.545 ± 0.602
6.056AlaGln: 6.056 ± 0.822
4.357AlaArg: 4.357 ± 0.587
7.09AlaSer: 7.09 ± 0.882
5.613AlaThr: 5.613 ± 0.911
7.681AlaVal: 7.681 ± 0.862
1.403AlaTrp: 1.403 ± 0.31
3.102AlaTyr: 3.102 ± 0.51
0.0AlaXaa: 0.0 ± 0.0
Cys
0.295CysAla: 0.295 ± 0.18
0.222CysCys: 0.222 ± 0.131
0.739CysAsp: 0.739 ± 0.264
0.369CysGlu: 0.369 ± 0.154
0.369CysPhe: 0.369 ± 0.155
0.665CysGly: 0.665 ± 0.261
0.369CysHis: 0.369 ± 0.159
0.812CysIle: 0.812 ± 0.272
0.443CysLys: 0.443 ± 0.212
0.812CysLeu: 0.812 ± 0.233
0.591CysMet: 0.591 ± 0.186
0.812CysAsn: 0.812 ± 0.297
0.591CysPro: 0.591 ± 0.221
0.443CysGln: 0.443 ± 0.134
0.517CysArg: 0.517 ± 0.183
0.812CysSer: 0.812 ± 0.291
1.256CysThr: 1.256 ± 0.329
0.665CysVal: 0.665 ± 0.222
0.295CysTrp: 0.295 ± 0.147
0.886CysTyr: 0.886 ± 0.282
0.0CysXaa: 0.0 ± 0.0
Asp
7.238AspAla: 7.238 ± 0.746
0.148AspCys: 0.148 ± 0.116
4.284AspAsp: 4.284 ± 0.587
3.397AspGlu: 3.397 ± 0.491
1.773AspPhe: 1.773 ± 0.277
4.21AspGly: 4.21 ± 0.504
0.665AspHis: 0.665 ± 0.239
3.914AspIle: 3.914 ± 0.444
2.806AspLys: 2.806 ± 0.697
4.727AspLeu: 4.727 ± 0.7
2.142AspMet: 2.142 ± 0.367
3.102AspAsn: 3.102 ± 0.479
1.92AspPro: 1.92 ± 0.333
1.256AspGln: 1.256 ± 0.347
2.733AspArg: 2.733 ± 0.525
4.653AspSer: 4.653 ± 0.556
4.874AspThr: 4.874 ± 0.443
4.948AspVal: 4.948 ± 0.556
1.256AspTrp: 1.256 ± 0.289
1.92AspTyr: 1.92 ± 0.446
0.0AspXaa: 0.0 ± 0.0
Glu
4.948GluAla: 4.948 ± 0.785
0.591GluCys: 0.591 ± 0.26
3.323GluAsp: 3.323 ± 0.568
2.954GluGlu: 2.954 ± 0.671
2.733GluPhe: 2.733 ± 0.497
2.88GluGly: 2.88 ± 0.283
1.108GluHis: 1.108 ± 0.297
2.437GluIle: 2.437 ± 0.352
2.585GluLys: 2.585 ± 0.474
5.096GluLeu: 5.096 ± 0.53
1.625GluMet: 1.625 ± 0.34
1.551GluAsn: 1.551 ± 0.307
0.96GluPro: 0.96 ± 0.214
3.545GluGln: 3.545 ± 0.579
2.511GluArg: 2.511 ± 0.519
3.102GluSer: 3.102 ± 0.339
2.733GluThr: 2.733 ± 0.463
3.619GluVal: 3.619 ± 0.49
0.517GluTrp: 0.517 ± 0.203
2.437GluTyr: 2.437 ± 0.484
0.0GluXaa: 0.0 ± 0.0
Phe
2.733PheAla: 2.733 ± 0.431
0.074PheCys: 0.074 ± 0.081
2.733PheAsp: 2.733 ± 0.472
1.108PheGlu: 1.108 ± 0.258
0.886PhePhe: 0.886 ± 0.29
2.659PheGly: 2.659 ± 0.442
0.443PheHis: 0.443 ± 0.172
1.625PheIle: 1.625 ± 0.371
1.699PheLys: 1.699 ± 0.367
2.437PheLeu: 2.437 ± 0.424
0.517PheMet: 0.517 ± 0.163
1.477PheAsn: 1.477 ± 0.512
1.256PhePro: 1.256 ± 0.295
1.329PheGln: 1.329 ± 0.296
1.699PheArg: 1.699 ± 0.366
1.699PheSer: 1.699 ± 0.402
1.551PheThr: 1.551 ± 0.421
2.216PheVal: 2.216 ± 0.361
0.295PheTrp: 0.295 ± 0.109
0.739PheTyr: 0.739 ± 0.239
0.0PheXaa: 0.0 ± 0.0
Gly
7.533GlyAla: 7.533 ± 1.009
1.625GlyCys: 1.625 ± 0.546
4.579GlyAsp: 4.579 ± 0.693
2.88GlyGlu: 2.88 ± 0.456
2.659GlyPhe: 2.659 ± 0.291
5.244GlyGly: 5.244 ± 0.643
0.812GlyHis: 0.812 ± 0.27
4.801GlyIle: 4.801 ± 0.728
3.988GlyLys: 3.988 ± 0.542
6.499GlyLeu: 6.499 ± 0.67
1.994GlyMet: 1.994 ± 0.382
3.693GlyAsn: 3.693 ± 0.671
1.108GlyPro: 1.108 ± 0.324
2.29GlyGln: 2.29 ± 0.447
4.357GlyArg: 4.357 ± 0.523
5.318GlySer: 5.318 ± 0.683
6.647GlyThr: 6.647 ± 0.653
6.573GlyVal: 6.573 ± 0.616
0.812GlyTrp: 0.812 ± 0.265
3.693GlyTyr: 3.693 ± 0.52
0.0GlyXaa: 0.0 ± 0.0
His
1.92HisAla: 1.92 ± 0.415
0.369HisCys: 0.369 ± 0.146
0.96HisAsp: 0.96 ± 0.299
1.108HisGlu: 1.108 ± 0.361
0.369HisPhe: 0.369 ± 0.158
1.625HisGly: 1.625 ± 0.378
0.295HisHis: 0.295 ± 0.124
1.182HisIle: 1.182 ± 0.263
0.739HisLys: 0.739 ± 0.318
1.846HisLeu: 1.846 ± 0.418
0.295HisMet: 0.295 ± 0.147
0.665HisAsn: 0.665 ± 0.203
1.108HisPro: 1.108 ± 0.293
0.739HisGln: 0.739 ± 0.215
0.96HisArg: 0.96 ± 0.266
1.034HisSer: 1.034 ± 0.279
0.665HisThr: 0.665 ± 0.223
1.551HisVal: 1.551 ± 0.335
0.443HisTrp: 0.443 ± 0.18
0.591HisTyr: 0.591 ± 0.279
0.0HisXaa: 0.0 ± 0.0
Ile
4.062IleAla: 4.062 ± 0.485
0.591IleCys: 0.591 ± 0.225
3.767IleAsp: 3.767 ± 0.563
2.585IleGlu: 2.585 ± 0.507
0.812IlePhe: 0.812 ± 0.169
2.511IleGly: 2.511 ± 0.356
0.665IleHis: 0.665 ± 0.206
1.92IleIle: 1.92 ± 0.454
2.733IleLys: 2.733 ± 0.367
3.767IleLeu: 3.767 ± 0.463
0.96IleMet: 0.96 ± 0.275
2.363IleAsn: 2.363 ± 0.573
2.216IlePro: 2.216 ± 0.304
2.29IleGln: 2.29 ± 0.361
1.699IleArg: 1.699 ± 0.289
3.767IleSer: 3.767 ± 0.46
4.357IleThr: 4.357 ± 0.688
2.659IleVal: 2.659 ± 0.421
0.665IleTrp: 0.665 ± 0.262
1.034IleTyr: 1.034 ± 0.286
0.0IleXaa: 0.0 ± 0.0
Lys
5.096LysAla: 5.096 ± 0.95
0.222LysCys: 0.222 ± 0.123
3.176LysAsp: 3.176 ± 0.398
3.545LysGlu: 3.545 ± 0.621
0.443LysPhe: 0.443 ± 0.185
2.88LysGly: 2.88 ± 0.481
0.739LysHis: 0.739 ± 0.244
1.329LysIle: 1.329 ± 0.242
2.068LysLys: 2.068 ± 0.613
5.022LysLeu: 5.022 ± 0.615
1.108LysMet: 1.108 ± 0.288
1.551LysAsn: 1.551 ± 0.384
1.994LysPro: 1.994 ± 0.308
2.216LysGln: 2.216 ± 0.528
2.733LysArg: 2.733 ± 0.404
2.142LysSer: 2.142 ± 0.368
2.142LysThr: 2.142 ± 0.481
3.176LysVal: 3.176 ± 0.455
0.665LysTrp: 0.665 ± 0.259
2.216LysTyr: 2.216 ± 0.471
0.0LysXaa: 0.0 ± 0.0
Leu
7.312LeuAla: 7.312 ± 0.812
1.182LeuCys: 1.182 ± 0.355
4.948LeuAsp: 4.948 ± 0.613
4.579LeuGlu: 4.579 ± 0.587
2.437LeuPhe: 2.437 ± 0.406
7.09LeuGly: 7.09 ± 0.855
1.773LeuHis: 1.773 ± 0.315
3.102LeuIle: 3.102 ± 0.719
3.545LeuLys: 3.545 ± 0.446
7.533LeuLeu: 7.533 ± 0.868
2.142LeuMet: 2.142 ± 0.335
4.579LeuAsn: 4.579 ± 0.686
4.948LeuPro: 4.948 ± 0.599
3.767LeuGln: 3.767 ± 0.48
5.465LeuArg: 5.465 ± 0.771
6.13LeuSer: 6.13 ± 0.643
5.318LeuThr: 5.318 ± 0.669
7.681LeuVal: 7.681 ± 0.582
0.591LeuTrp: 0.591 ± 0.241
3.102LeuTyr: 3.102 ± 0.466
0.0LeuXaa: 0.0 ± 0.0
Met
2.216MetAla: 2.216 ± 0.418
0.295MetCys: 0.295 ± 0.128
1.034MetAsp: 1.034 ± 0.225
0.886MetGlu: 0.886 ± 0.266
1.034MetPhe: 1.034 ± 0.274
2.29MetGly: 2.29 ± 0.449
0.739MetHis: 0.739 ± 0.243
0.591MetIle: 0.591 ± 0.221
0.517MetLys: 0.517 ± 0.161
2.142MetLeu: 2.142 ± 0.424
0.443MetMet: 0.443 ± 0.131
1.034MetAsn: 1.034 ± 0.227
1.403MetPro: 1.403 ± 0.487
2.142MetGln: 2.142 ± 0.52
1.994MetArg: 1.994 ± 0.443
1.92MetSer: 1.92 ± 0.479
1.329MetThr: 1.329 ± 0.273
1.846MetVal: 1.846 ± 0.373
0.222MetTrp: 0.222 ± 0.116
1.182MetTyr: 1.182 ± 0.257
0.0MetXaa: 0.0 ± 0.0
Asn
4.062AsnAla: 4.062 ± 0.635
0.812AsnCys: 0.812 ± 0.267
1.773AsnAsp: 1.773 ± 0.295
1.92AsnGlu: 1.92 ± 0.371
1.329AsnPhe: 1.329 ± 0.369
4.136AsnGly: 4.136 ± 0.522
0.517AsnHis: 0.517 ± 0.214
2.068AsnIle: 2.068 ± 0.542
2.585AsnLys: 2.585 ± 0.373
4.653AsnLeu: 4.653 ± 0.768
1.403AsnMet: 1.403 ± 0.31
2.954AsnAsn: 2.954 ± 0.541
2.068AsnPro: 2.068 ± 0.414
2.216AsnGln: 2.216 ± 0.403
1.846AsnArg: 1.846 ± 0.294
3.25AsnSer: 3.25 ± 0.654
3.914AsnThr: 3.914 ± 0.72
2.806AsnVal: 2.806 ± 0.492
0.591AsnTrp: 0.591 ± 0.218
0.591AsnTyr: 0.591 ± 0.181
0.0AsnXaa: 0.0 ± 0.0
Pro
3.84ProAla: 3.84 ± 0.439
0.148ProCys: 0.148 ± 0.104
3.84ProAsp: 3.84 ± 0.512
3.102ProGlu: 3.102 ± 0.466
0.517ProPhe: 0.517 ± 0.171
2.585ProGly: 2.585 ± 0.44
0.443ProHis: 0.443 ± 0.222
1.625ProIle: 1.625 ± 0.331
1.846ProLys: 1.846 ± 0.37
2.437ProLeu: 2.437 ± 0.4
0.812ProMet: 0.812 ± 0.237
1.329ProAsn: 1.329 ± 0.28
1.551ProPro: 1.551 ± 0.351
1.329ProGln: 1.329 ± 0.376
1.403ProArg: 1.403 ± 0.267
2.216ProSer: 2.216 ± 0.383
3.323ProThr: 3.323 ± 0.713
3.693ProVal: 3.693 ± 0.4
0.886ProTrp: 0.886 ± 0.329
1.551ProTyr: 1.551 ± 0.253
0.0ProXaa: 0.0 ± 0.0
Gln
6.204GlnAla: 6.204 ± 0.69
0.369GlnCys: 0.369 ± 0.181
2.733GlnAsp: 2.733 ± 0.459
2.363GlnGlu: 2.363 ± 0.53
1.699GlnPhe: 1.699 ± 0.288
3.693GlnGly: 3.693 ± 0.617
1.403GlnHis: 1.403 ± 0.336
1.477GlnIle: 1.477 ± 0.394
1.773GlnLys: 1.773 ± 0.371
4.136GlnLeu: 4.136 ± 0.589
1.256GlnMet: 1.256 ± 0.327
2.363GlnAsn: 2.363 ± 0.53
1.477GlnPro: 1.477 ± 0.314
3.323GlnGln: 3.323 ± 0.79
2.954GlnArg: 2.954 ± 0.475
3.028GlnSer: 3.028 ± 0.582
2.142GlnThr: 2.142 ± 0.393
3.397GlnVal: 3.397 ± 0.47
0.591GlnTrp: 0.591 ± 0.16
2.733GlnTyr: 2.733 ± 0.45
0.0GlnXaa: 0.0 ± 0.0
Arg
4.136ArgAla: 4.136 ± 0.448
0.96ArgCys: 0.96 ± 0.268
3.619ArgAsp: 3.619 ± 0.567
3.028ArgGlu: 3.028 ± 0.469
1.329ArgPhe: 1.329 ± 0.352
3.988ArgGly: 3.988 ± 0.54
1.108ArgHis: 1.108 ± 0.225
3.323ArgIle: 3.323 ± 0.587
2.363ArgLys: 2.363 ± 0.393
3.767ArgLeu: 3.767 ± 0.484
1.846ArgMet: 1.846 ± 0.368
2.954ArgAsn: 2.954 ± 0.52
1.403ArgPro: 1.403 ± 0.365
2.216ArgGln: 2.216 ± 0.366
4.727ArgArg: 4.727 ± 0.579
3.397ArgSer: 3.397 ± 0.712
3.545ArgThr: 3.545 ± 0.624
3.693ArgVal: 3.693 ± 0.632
1.108ArgTrp: 1.108 ± 0.242
2.068ArgTyr: 2.068 ± 0.407
0.0ArgXaa: 0.0 ± 0.0
Ser
7.681SerAla: 7.681 ± 0.838
0.812SerCys: 0.812 ± 0.226
3.988SerAsp: 3.988 ± 0.447
2.29SerGlu: 2.29 ± 0.454
2.068SerPhe: 2.068 ± 0.419
6.204SerGly: 6.204 ± 0.754
1.108SerHis: 1.108 ± 0.323
3.988SerIle: 3.988 ± 0.656
2.806SerLys: 2.806 ± 0.514
5.391SerLeu: 5.391 ± 0.525
1.994SerMet: 1.994 ± 0.349
2.88SerAsn: 2.88 ± 0.667
1.699SerPro: 1.699 ± 0.361
2.733SerGln: 2.733 ± 0.42
3.176SerArg: 3.176 ± 0.423
4.21SerSer: 4.21 ± 0.588
4.505SerThr: 4.505 ± 0.691
6.278SerVal: 6.278 ± 0.719
0.96SerTrp: 0.96 ± 0.303
1.699SerTyr: 1.699 ± 0.346
0.0SerXaa: 0.0 ± 0.0
Thr
7.386ThrAla: 7.386 ± 0.948
0.812ThrCys: 0.812 ± 0.241
3.545ThrAsp: 3.545 ± 0.689
3.397ThrGlu: 3.397 ± 0.408
1.699ThrPhe: 1.699 ± 0.392
7.312ThrGly: 7.312 ± 0.909
1.699ThrHis: 1.699 ± 0.383
2.29ThrIle: 2.29 ± 0.457
3.25ThrLys: 3.25 ± 0.344
5.17ThrLeu: 5.17 ± 0.637
1.034ThrMet: 1.034 ± 0.24
2.733ThrAsn: 2.733 ± 0.498
3.693ThrPro: 3.693 ± 0.554
2.216ThrGln: 2.216 ± 0.478
3.323ThrArg: 3.323 ± 0.457
4.579ThrSer: 4.579 ± 0.671
4.21ThrThr: 4.21 ± 1.144
5.244ThrVal: 5.244 ± 0.871
0.517ThrTrp: 0.517 ± 0.179
2.806ThrTyr: 2.806 ± 0.496
0.0ThrXaa: 0.0 ± 0.0
Val
5.835ValAla: 5.835 ± 0.705
0.96ValCys: 0.96 ± 0.325
4.357ValAsp: 4.357 ± 0.59
3.028ValGlu: 3.028 ± 0.457
2.511ValPhe: 2.511 ± 0.362
6.056ValGly: 6.056 ± 0.559
1.92ValHis: 1.92 ± 0.409
3.323ValIle: 3.323 ± 0.428
2.954ValLys: 2.954 ± 0.625
7.238ValLeu: 7.238 ± 0.726
1.477ValMet: 1.477 ± 0.341
3.102ValAsn: 3.102 ± 0.553
3.471ValPro: 3.471 ± 0.479
6.204ValGln: 6.204 ± 0.669
4.357ValArg: 4.357 ± 0.618
4.727ValSer: 4.727 ± 0.577
5.244ValThr: 5.244 ± 0.714
4.948ValVal: 4.948 ± 0.577
0.739ValTrp: 0.739 ± 0.291
3.619ValTyr: 3.619 ± 0.53
0.0ValXaa: 0.0 ± 0.0
Trp
1.034TrpAla: 1.034 ± 0.281
0.222TrpCys: 0.222 ± 0.116
0.739TrpAsp: 0.739 ± 0.212
0.739TrpGlu: 0.739 ± 0.251
0.591TrpPhe: 0.591 ± 0.219
1.329TrpGly: 1.329 ± 0.415
0.148TrpHis: 0.148 ± 0.092
0.295TrpIle: 0.295 ± 0.174
0.222TrpLys: 0.222 ± 0.115
1.551TrpLeu: 1.551 ± 0.39
0.222TrpMet: 0.222 ± 0.123
0.665TrpAsn: 0.665 ± 0.264
0.369TrpPro: 0.369 ± 0.198
0.886TrpGln: 0.886 ± 0.297
0.591TrpArg: 0.591 ± 0.215
0.739TrpSer: 0.739 ± 0.204
0.591TrpThr: 0.591 ± 0.228
1.108TrpVal: 1.108 ± 0.258
0.148TrpTrp: 0.148 ± 0.102
1.256TrpTyr: 1.256 ± 0.358
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.659TyrAla: 2.659 ± 0.467
0.665TyrCys: 0.665 ± 0.237
2.511TyrAsp: 2.511 ± 0.323
2.216TyrGlu: 2.216 ± 0.421
1.182TyrPhe: 1.182 ± 0.337
2.585TyrGly: 2.585 ± 0.529
0.739TyrHis: 0.739 ± 0.22
2.363TyrIle: 2.363 ± 0.443
1.256TyrLys: 1.256 ± 0.331
2.954TyrLeu: 2.954 ± 0.524
0.886TyrMet: 0.886 ± 0.3
1.773TyrAsn: 1.773 ± 0.406
1.699TyrPro: 1.699 ± 0.342
1.92TyrGln: 1.92 ± 0.354
3.176TyrArg: 3.176 ± 0.559
2.659TyrSer: 2.659 ± 0.487
2.954TyrThr: 2.954 ± 0.636
2.29TyrVal: 2.29 ± 0.392
0.739TyrTrp: 0.739 ± 0.321
1.256TyrTyr: 1.256 ± 0.331
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 52 proteins (13541 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski