Amino acid dipepetide frequency for Pectobacterium phage CX5-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.077AlaAla: 11.077 ± 1.023
0.796AlaCys: 0.796 ± 0.271
8.036AlaAsp: 8.036 ± 0.683
5.43AlaGlu: 5.43 ± 0.802
3.475AlaPhe: 3.475 ± 0.485
6.588AlaGly: 6.588 ± 0.782
2.462AlaHis: 2.462 ± 0.545
4.199AlaIle: 4.199 ± 0.547
3.403AlaLys: 3.403 ± 0.573
9.485AlaLeu: 9.485 ± 0.884
2.679AlaMet: 2.679 ± 0.448
3.692AlaAsn: 3.692 ± 0.511
3.33AlaPro: 3.33 ± 0.657
4.778AlaGln: 4.778 ± 0.678
4.923AlaArg: 4.923 ± 0.637
5.213AlaSer: 5.213 ± 0.654
5.43AlaThr: 5.43 ± 0.888
7.312AlaVal: 7.312 ± 0.766
0.941AlaTrp: 0.941 ± 0.213
3.403AlaTyr: 3.403 ± 0.432
0.0AlaXaa: 0.0 ± 0.0
Cys
0.579CysAla: 0.579 ± 0.271
0.145CysCys: 0.145 ± 0.106
1.014CysAsp: 1.014 ± 0.312
0.29CysGlu: 0.29 ± 0.136
0.434CysPhe: 0.434 ± 0.218
0.362CysGly: 0.362 ± 0.145
0.507CysHis: 0.507 ± 0.236
0.434CysIle: 0.434 ± 0.181
0.434CysLys: 0.434 ± 0.261
0.652CysLeu: 0.652 ± 0.217
0.434CysMet: 0.434 ± 0.184
0.434CysAsn: 0.434 ± 0.133
0.579CysPro: 0.579 ± 0.212
0.29CysGln: 0.29 ± 0.13
0.579CysArg: 0.579 ± 0.229
0.869CysSer: 0.869 ± 0.276
0.724CysThr: 0.724 ± 0.285
0.869CysVal: 0.869 ± 0.292
0.217CysTrp: 0.217 ± 0.129
0.796CysTyr: 0.796 ± 0.217
0.0CysXaa: 0.0 ± 0.0
Asp
6.733AspAla: 6.733 ± 0.62
0.869AspCys: 0.869 ± 0.246
4.416AspAsp: 4.416 ± 0.748
3.258AspGlu: 3.258 ± 0.501
1.955AspPhe: 1.955 ± 0.288
5.285AspGly: 5.285 ± 0.765
0.941AspHis: 0.941 ± 0.254
3.475AspIle: 3.475 ± 0.429
3.041AspLys: 3.041 ± 0.402
5.502AspLeu: 5.502 ± 0.515
2.317AspMet: 2.317 ± 0.315
3.113AspAsn: 3.113 ± 0.467
1.955AspPro: 1.955 ± 0.284
1.086AspGln: 1.086 ± 0.276
2.462AspArg: 2.462 ± 0.386
5.14AspSer: 5.14 ± 0.53
5.14AspThr: 5.14 ± 0.572
5.358AspVal: 5.358 ± 0.538
1.231AspTrp: 1.231 ± 0.267
2.172AspTyr: 2.172 ± 0.375
0.0AspXaa: 0.0 ± 0.0
Glu
4.778GluAla: 4.778 ± 0.653
0.145GluCys: 0.145 ± 0.092
2.679GluAsp: 2.679 ± 0.489
2.751GluGlu: 2.751 ± 0.502
2.534GluPhe: 2.534 ± 0.369
2.824GluGly: 2.824 ± 0.462
1.086GluHis: 1.086 ± 0.323
1.665GluIle: 1.665 ± 0.358
1.738GluLys: 1.738 ± 0.436
5.213GluLeu: 5.213 ± 0.657
2.027GluMet: 2.027 ± 0.418
1.665GluAsn: 1.665 ± 0.354
0.941GluPro: 0.941 ± 0.24
2.679GluGln: 2.679 ± 0.603
2.968GluArg: 2.968 ± 0.48
3.475GluSer: 3.475 ± 0.488
2.606GluThr: 2.606 ± 0.494
4.416GluVal: 4.416 ± 0.573
0.579GluTrp: 0.579 ± 0.286
2.896GluTyr: 2.896 ± 0.542
0.0GluXaa: 0.0 ± 0.0
Phe
2.751PheAla: 2.751 ± 0.448
0.072PheCys: 0.072 ± 0.075
2.968PheAsp: 2.968 ± 0.478
1.158PheGlu: 1.158 ± 0.316
1.448PhePhe: 1.448 ± 0.341
2.606PheGly: 2.606 ± 0.359
0.652PheHis: 0.652 ± 0.24
1.231PheIle: 1.231 ± 0.343
2.027PheLys: 2.027 ± 0.328
1.955PheLeu: 1.955 ± 0.347
0.724PheMet: 0.724 ± 0.185
1.738PheAsn: 1.738 ± 0.372
1.448PhePro: 1.448 ± 0.311
0.869PheGln: 0.869 ± 0.235
1.665PheArg: 1.665 ± 0.407
2.317PheSer: 2.317 ± 0.479
1.593PheThr: 1.593 ± 0.319
2.389PheVal: 2.389 ± 0.279
0.434PheTrp: 0.434 ± 0.18
0.724PheTyr: 0.724 ± 0.218
0.0PheXaa: 0.0 ± 0.0
Gly
7.095GlyAla: 7.095 ± 0.698
1.448GlyCys: 1.448 ± 0.437
4.416GlyAsp: 4.416 ± 0.737
2.968GlyGlu: 2.968 ± 0.387
2.462GlyPhe: 2.462 ± 0.297
5.792GlyGly: 5.792 ± 0.724
0.724GlyHis: 0.724 ± 0.237
4.706GlyIle: 4.706 ± 0.507
3.62GlyLys: 3.62 ± 0.669
5.575GlyLeu: 5.575 ± 0.548
2.462GlyMet: 2.462 ± 0.415
2.679GlyAsn: 2.679 ± 0.541
1.376GlyPro: 1.376 ± 0.41
2.534GlyGln: 2.534 ± 0.349
3.548GlyArg: 3.548 ± 0.492
5.068GlySer: 5.068 ± 0.582
8.109GlyThr: 8.109 ± 1.062
6.95GlyVal: 6.95 ± 0.867
1.086GlyTrp: 1.086 ± 0.325
3.33GlyTyr: 3.33 ± 0.627
0.0GlyXaa: 0.0 ± 0.0
His
1.448HisAla: 1.448 ± 0.322
0.29HisCys: 0.29 ± 0.127
1.376HisAsp: 1.376 ± 0.304
1.303HisGlu: 1.303 ± 0.418
0.29HisPhe: 0.29 ± 0.137
1.738HisGly: 1.738 ± 0.436
0.362HisHis: 0.362 ± 0.158
1.086HisIle: 1.086 ± 0.209
1.158HisLys: 1.158 ± 0.352
2.172HisLeu: 2.172 ± 0.427
0.579HisMet: 0.579 ± 0.197
1.086HisAsn: 1.086 ± 0.253
1.014HisPro: 1.014 ± 0.303
0.869HisGln: 0.869 ± 0.202
1.448HisArg: 1.448 ± 0.285
1.014HisSer: 1.014 ± 0.269
0.652HisThr: 0.652 ± 0.225
1.738HisVal: 1.738 ± 0.405
0.362HisTrp: 0.362 ± 0.162
0.941HisTyr: 0.941 ± 0.289
0.0HisXaa: 0.0 ± 0.0
Ile
4.199IleAla: 4.199 ± 0.557
0.652IleCys: 0.652 ± 0.175
3.258IleAsp: 3.258 ± 0.465
2.679IleGlu: 2.679 ± 0.61
0.652IlePhe: 0.652 ± 0.246
3.403IleGly: 3.403 ± 0.429
1.231IleHis: 1.231 ± 0.287
1.303IleIle: 1.303 ± 0.324
2.824IleLys: 2.824 ± 0.406
3.548IleLeu: 3.548 ± 0.626
1.231IleMet: 1.231 ± 0.242
2.824IleAsn: 2.824 ± 0.437
2.027IlePro: 2.027 ± 0.348
1.593IleGln: 1.593 ± 0.409
2.027IleArg: 2.027 ± 0.453
2.534IleSer: 2.534 ± 0.326
3.982IleThr: 3.982 ± 0.695
2.751IleVal: 2.751 ± 0.4
0.507IleTrp: 0.507 ± 0.257
0.796IleTyr: 0.796 ± 0.246
0.0IleXaa: 0.0 ± 0.0
Lys
4.996LysAla: 4.996 ± 0.731
0.362LysCys: 0.362 ± 0.192
2.679LysAsp: 2.679 ± 0.406
2.679LysGlu: 2.679 ± 0.427
0.652LysPhe: 0.652 ± 0.232
2.968LysGly: 2.968 ± 0.404
0.796LysHis: 0.796 ± 0.302
1.231LysIle: 1.231 ± 0.255
1.231LysLys: 1.231 ± 0.381
5.068LysLeu: 5.068 ± 0.552
1.448LysMet: 1.448 ± 0.344
1.158LysAsn: 1.158 ± 0.251
2.027LysPro: 2.027 ± 0.4
1.882LysGln: 1.882 ± 0.471
3.258LysArg: 3.258 ± 0.49
1.738LysSer: 1.738 ± 0.325
2.1LysThr: 2.1 ± 0.407
2.824LysVal: 2.824 ± 0.49
0.507LysTrp: 0.507 ± 0.206
2.027LysTyr: 2.027 ± 0.432
0.0LysXaa: 0.0 ± 0.0
Leu
7.24LeuAla: 7.24 ± 0.777
1.52LeuCys: 1.52 ± 0.314
5.14LeuAsp: 5.14 ± 0.549
4.561LeuGlu: 4.561 ± 0.547
2.389LeuPhe: 2.389 ± 0.482
6.516LeuGly: 6.516 ± 0.757
1.52LeuHis: 1.52 ± 0.333
3.692LeuIle: 3.692 ± 0.616
3.113LeuLys: 3.113 ± 0.417
6.444LeuLeu: 6.444 ± 0.622
2.534LeuMet: 2.534 ± 0.4
4.272LeuAsn: 4.272 ± 0.47
4.416LeuPro: 4.416 ± 0.521
3.041LeuGln: 3.041 ± 0.455
7.312LeuArg: 7.312 ± 0.724
7.747LeuSer: 7.747 ± 0.98
5.72LeuThr: 5.72 ± 0.655
7.24LeuVal: 7.24 ± 0.665
0.579LeuTrp: 0.579 ± 0.259
2.968LeuTyr: 2.968 ± 0.422
0.0LeuXaa: 0.0 ± 0.0
Met
2.824MetAla: 2.824 ± 0.539
0.434MetCys: 0.434 ± 0.207
1.376MetAsp: 1.376 ± 0.334
1.158MetGlu: 1.158 ± 0.243
1.376MetPhe: 1.376 ± 0.382
1.955MetGly: 1.955 ± 0.448
0.796MetHis: 0.796 ± 0.214
1.014MetIle: 1.014 ± 0.236
1.158MetLys: 1.158 ± 0.269
2.824MetLeu: 2.824 ± 0.46
0.796MetMet: 0.796 ± 0.242
0.434MetAsn: 0.434 ± 0.185
1.376MetPro: 1.376 ± 0.33
2.1MetGln: 2.1 ± 0.476
2.027MetArg: 2.027 ± 0.457
1.665MetSer: 1.665 ± 0.368
2.027MetThr: 2.027 ± 0.351
2.172MetVal: 2.172 ± 0.453
0.29MetTrp: 0.29 ± 0.134
1.231MetTyr: 1.231 ± 0.235
0.0MetXaa: 0.0 ± 0.0
Asn
3.837AsnAla: 3.837 ± 0.535
0.434AsnCys: 0.434 ± 0.208
1.81AsnAsp: 1.81 ± 0.37
2.1AsnGlu: 2.1 ± 0.346
1.738AsnPhe: 1.738 ± 0.279
4.127AsnGly: 4.127 ± 0.559
0.796AsnHis: 0.796 ± 0.209
2.172AsnIle: 2.172 ± 0.357
2.462AsnLys: 2.462 ± 0.392
3.62AsnLeu: 3.62 ± 0.484
1.448AsnMet: 1.448 ± 0.295
1.665AsnAsn: 1.665 ± 0.292
1.52AsnPro: 1.52 ± 0.292
1.955AsnGln: 1.955 ± 0.607
2.317AsnArg: 2.317 ± 0.376
2.679AsnSer: 2.679 ± 0.586
3.041AsnThr: 3.041 ± 0.428
2.751AsnVal: 2.751 ± 0.369
0.796AsnTrp: 0.796 ± 0.254
0.579AsnTyr: 0.579 ± 0.192
0.0AsnXaa: 0.0 ± 0.0
Pro
3.113ProAla: 3.113 ± 0.491
0.217ProCys: 0.217 ± 0.117
3.258ProAsp: 3.258 ± 0.497
3.258ProGlu: 3.258 ± 0.532
0.724ProPhe: 0.724 ± 0.199
2.244ProGly: 2.244 ± 0.262
0.507ProHis: 0.507 ± 0.143
1.738ProIle: 1.738 ± 0.435
1.231ProLys: 1.231 ± 0.323
2.824ProLeu: 2.824 ± 0.433
1.303ProMet: 1.303 ± 0.255
1.376ProAsn: 1.376 ± 0.395
2.027ProPro: 2.027 ± 0.513
1.738ProGln: 1.738 ± 0.362
2.027ProArg: 2.027 ± 0.359
2.389ProSer: 2.389 ± 0.388
3.765ProThr: 3.765 ± 0.591
3.692ProVal: 3.692 ± 0.553
0.579ProTrp: 0.579 ± 0.204
1.738ProTyr: 1.738 ± 0.259
0.0ProXaa: 0.0 ± 0.0
Gln
4.996GlnAla: 4.996 ± 0.742
0.072GlnCys: 0.072 ± 0.071
2.1GlnAsp: 2.1 ± 0.461
2.606GlnGlu: 2.606 ± 0.454
1.448GlnPhe: 1.448 ± 0.299
4.127GlnGly: 4.127 ± 0.56
1.448GlnHis: 1.448 ± 0.378
1.882GlnIle: 1.882 ± 0.419
1.231GlnLys: 1.231 ± 0.342
4.054GlnLeu: 4.054 ± 0.538
1.448GlnMet: 1.448 ± 0.368
2.027GlnAsn: 2.027 ± 0.495
1.303GlnPro: 1.303 ± 0.365
2.751GlnGln: 2.751 ± 0.615
2.751GlnArg: 2.751 ± 0.482
2.389GlnSer: 2.389 ± 0.495
1.955GlnThr: 1.955 ± 0.326
3.62GlnVal: 3.62 ± 0.596
0.579GlnTrp: 0.579 ± 0.205
2.317GlnTyr: 2.317 ± 0.403
0.0GlnXaa: 0.0 ± 0.0
Arg
5.358ArgAla: 5.358 ± 0.629
0.652ArgCys: 0.652 ± 0.218
4.344ArgAsp: 4.344 ± 0.496
2.679ArgGlu: 2.679 ± 0.463
1.52ArgPhe: 1.52 ± 0.376
4.272ArgGly: 4.272 ± 0.535
1.448ArgHis: 1.448 ± 0.321
3.475ArgIle: 3.475 ± 0.604
2.679ArgLys: 2.679 ± 0.447
4.344ArgLeu: 4.344 ± 0.609
1.52ArgMet: 1.52 ± 0.313
2.751ArgAsn: 2.751 ± 0.497
1.738ArgPro: 1.738 ± 0.254
3.258ArgGln: 3.258 ± 0.406
4.923ArgArg: 4.923 ± 0.574
3.548ArgSer: 3.548 ± 0.637
3.403ArgThr: 3.403 ± 0.514
4.054ArgVal: 4.054 ± 0.507
0.652ArgTrp: 0.652 ± 0.182
2.317ArgTyr: 2.317 ± 0.434
0.0ArgXaa: 0.0 ± 0.0
Ser
6.806SerAla: 6.806 ± 0.67
0.579SerCys: 0.579 ± 0.173
3.692SerAsp: 3.692 ± 0.489
2.317SerGlu: 2.317 ± 0.363
1.52SerPhe: 1.52 ± 0.317
5.72SerGly: 5.72 ± 0.817
1.086SerHis: 1.086 ± 0.215
2.824SerIle: 2.824 ± 0.48
3.33SerLys: 3.33 ± 0.421
5.937SerLeu: 5.937 ± 0.713
1.738SerMet: 1.738 ± 0.364
2.534SerAsn: 2.534 ± 0.482
2.244SerPro: 2.244 ± 0.366
2.896SerGln: 2.896 ± 0.37
2.534SerArg: 2.534 ± 0.326
2.824SerSer: 2.824 ± 0.467
6.226SerThr: 6.226 ± 0.826
5.14SerVal: 5.14 ± 0.5
1.086SerTrp: 1.086 ± 0.327
1.882SerTyr: 1.882 ± 0.319
0.0SerXaa: 0.0 ± 0.0
Thr
8.254ThrAla: 8.254 ± 0.745
0.579ThrCys: 0.579 ± 0.186
4.489ThrAsp: 4.489 ± 0.573
2.896ThrGlu: 2.896 ± 0.502
1.955ThrPhe: 1.955 ± 0.421
6.082ThrGly: 6.082 ± 0.746
2.027ThrHis: 2.027 ± 0.375
2.534ThrIle: 2.534 ± 0.405
2.244ThrLys: 2.244 ± 0.305
6.154ThrLeu: 6.154 ± 0.955
0.724ThrMet: 0.724 ± 0.187
2.534ThrAsn: 2.534 ± 0.491
3.548ThrPro: 3.548 ± 0.474
2.389ThrGln: 2.389 ± 0.363
3.548ThrArg: 3.548 ± 0.474
4.272ThrSer: 4.272 ± 0.64
4.996ThrThr: 4.996 ± 1.17
6.226ThrVal: 6.226 ± 0.861
1.014ThrTrp: 1.014 ± 0.279
2.606ThrTyr: 2.606 ± 0.567
0.0ThrXaa: 0.0 ± 0.0
Val
7.095ValAla: 7.095 ± 0.753
0.869ValCys: 0.869 ± 0.253
5.213ValAsp: 5.213 ± 0.655
2.896ValGlu: 2.896 ± 0.515
2.462ValPhe: 2.462 ± 0.417
5.213ValGly: 5.213 ± 0.445
1.593ValHis: 1.593 ± 0.328
2.896ValIle: 2.896 ± 0.492
3.403ValLys: 3.403 ± 0.681
7.892ValLeu: 7.892 ± 0.87
2.027ValMet: 2.027 ± 0.347
3.692ValAsn: 3.692 ± 0.502
4.706ValPro: 4.706 ± 0.664
5.575ValGln: 5.575 ± 0.779
5.068ValArg: 5.068 ± 0.679
4.489ValSer: 4.489 ± 0.798
4.996ValThr: 4.996 ± 0.586
6.009ValVal: 6.009 ± 0.665
0.796ValTrp: 0.796 ± 0.289
2.606ValTyr: 2.606 ± 0.406
0.0ValXaa: 0.0 ± 0.0
Trp
0.941TrpAla: 0.941 ± 0.235
0.145TrpCys: 0.145 ± 0.1
0.652TrpAsp: 0.652 ± 0.197
0.796TrpGlu: 0.796 ± 0.281
0.869TrpPhe: 0.869 ± 0.336
1.086TrpGly: 1.086 ± 0.244
0.072TrpHis: 0.072 ± 0.064
0.434TrpIle: 0.434 ± 0.194
0.29TrpLys: 0.29 ± 0.143
1.158TrpLeu: 1.158 ± 0.314
0.217TrpMet: 0.217 ± 0.121
0.579TrpAsn: 0.579 ± 0.202
0.29TrpPro: 0.29 ± 0.145
1.086TrpGln: 1.086 ± 0.246
0.652TrpArg: 0.652 ± 0.206
0.579TrpSer: 0.579 ± 0.193
0.434TrpThr: 0.434 ± 0.178
1.376TrpVal: 1.376 ± 0.326
0.217TrpTrp: 0.217 ± 0.139
1.014TrpTyr: 1.014 ± 0.285
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.534TyrAla: 2.534 ± 0.366
0.362TyrCys: 0.362 ± 0.162
2.534TyrAsp: 2.534 ± 0.47
1.955TyrGlu: 1.955 ± 0.348
0.869TyrPhe: 0.869 ± 0.278
3.041TyrGly: 3.041 ± 0.478
0.869TyrHis: 0.869 ± 0.282
2.027TyrIle: 2.027 ± 0.417
1.014TyrLys: 1.014 ± 0.308
3.548TyrLeu: 3.548 ± 0.6
1.303TyrMet: 1.303 ± 0.319
1.593TyrAsn: 1.593 ± 0.378
1.882TyrPro: 1.882 ± 0.317
1.738TyrGln: 1.738 ± 0.32
2.896TyrArg: 2.896 ± 0.468
2.968TyrSer: 2.968 ± 0.428
2.172TyrThr: 2.172 ± 0.446
2.606TyrVal: 2.606 ± 0.476
0.434TyrTrp: 0.434 ± 0.182
0.941TyrTyr: 0.941 ± 0.24
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (13813 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski