Amino acid dipepetide frequency for Pectobacterium phage CX5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.077AlaAla: 11.077 ± 1.099
0.796AlaCys: 0.796 ± 0.192
8.036AlaAsp: 8.036 ± 0.703
5.43AlaGlu: 5.43 ± 0.809
3.475AlaPhe: 3.475 ± 0.465
6.588AlaGly: 6.588 ± 0.791
2.462AlaHis: 2.462 ± 0.465
4.199AlaIle: 4.199 ± 0.56
3.403AlaLys: 3.403 ± 0.493
9.485AlaLeu: 9.485 ± 1.066
2.679AlaMet: 2.679 ± 0.535
3.692AlaAsn: 3.692 ± 0.532
3.33AlaPro: 3.33 ± 0.711
4.778AlaGln: 4.778 ± 0.597
4.923AlaArg: 4.923 ± 0.66
5.213AlaSer: 5.213 ± 0.714
5.43AlaThr: 5.43 ± 0.862
7.312AlaVal: 7.312 ± 0.762
0.941AlaTrp: 0.941 ± 0.222
3.403AlaTyr: 3.403 ± 0.421
0.0AlaXaa: 0.0 ± 0.0
Cys
0.579CysAla: 0.579 ± 0.236
0.145CysCys: 0.145 ± 0.107
1.014CysAsp: 1.014 ± 0.305
0.29CysGlu: 0.29 ± 0.111
0.434CysPhe: 0.434 ± 0.225
0.362CysGly: 0.362 ± 0.148
0.507CysHis: 0.507 ± 0.246
0.434CysIle: 0.434 ± 0.193
0.434CysLys: 0.434 ± 0.189
0.579CysLeu: 0.579 ± 0.224
0.434CysMet: 0.434 ± 0.194
0.434CysAsn: 0.434 ± 0.172
0.579CysPro: 0.579 ± 0.186
0.29CysGln: 0.29 ± 0.127
0.579CysArg: 0.579 ± 0.253
0.869CysSer: 0.869 ± 0.306
0.724CysThr: 0.724 ± 0.285
0.869CysVal: 0.869 ± 0.347
0.217CysTrp: 0.217 ± 0.113
0.796CysTyr: 0.796 ± 0.237
0.0CysXaa: 0.0 ± 0.0
Asp
6.733AspAla: 6.733 ± 0.642
0.869AspCys: 0.869 ± 0.312
4.416AspAsp: 4.416 ± 0.711
3.258AspGlu: 3.258 ± 0.53
1.955AspPhe: 1.955 ± 0.305
5.285AspGly: 5.285 ± 0.787
0.941AspHis: 0.941 ± 0.267
3.475AspIle: 3.475 ± 0.422
3.041AspLys: 3.041 ± 0.469
5.502AspLeu: 5.502 ± 0.543
2.317AspMet: 2.317 ± 0.317
3.113AspAsn: 3.113 ± 0.498
1.955AspPro: 1.955 ± 0.278
1.086AspGln: 1.086 ± 0.296
2.462AspArg: 2.462 ± 0.408
5.14AspSer: 5.14 ± 0.565
5.14AspThr: 5.14 ± 0.592
5.358AspVal: 5.358 ± 0.599
1.231AspTrp: 1.231 ± 0.299
2.172AspTyr: 2.172 ± 0.365
0.0AspXaa: 0.0 ± 0.0
Glu
4.778GluAla: 4.778 ± 0.666
0.145GluCys: 0.145 ± 0.1
2.679GluAsp: 2.679 ± 0.443
2.751GluGlu: 2.751 ± 0.5
2.534GluPhe: 2.534 ± 0.401
2.824GluGly: 2.824 ± 0.478
1.086GluHis: 1.086 ± 0.304
1.665GluIle: 1.665 ± 0.343
1.738GluLys: 1.738 ± 0.402
5.213GluLeu: 5.213 ± 0.636
2.1GluMet: 2.1 ± 0.356
1.665GluAsn: 1.665 ± 0.353
0.941GluPro: 0.941 ± 0.259
2.679GluGln: 2.679 ± 0.579
2.968GluArg: 2.968 ± 0.485
3.475GluSer: 3.475 ± 0.512
2.606GluThr: 2.606 ± 0.479
4.416GluVal: 4.416 ± 0.607
0.579GluTrp: 0.579 ± 0.27
2.896GluTyr: 2.896 ± 0.491
0.0GluXaa: 0.0 ± 0.0
Phe
2.751PheAla: 2.751 ± 0.408
0.072PheCys: 0.072 ± 0.086
2.968PheAsp: 2.968 ± 0.46
1.158PheGlu: 1.158 ± 0.294
1.448PhePhe: 1.448 ± 0.376
2.606PheGly: 2.606 ± 0.375
0.652PheHis: 0.652 ± 0.209
1.231PheIle: 1.231 ± 0.307
2.027PheLys: 2.027 ± 0.391
1.955PheLeu: 1.955 ± 0.306
0.724PheMet: 0.724 ± 0.172
1.738PheAsn: 1.738 ± 0.35
1.448PhePro: 1.448 ± 0.283
0.869PheGln: 0.869 ± 0.239
1.665PheArg: 1.665 ± 0.356
2.317PheSer: 2.317 ± 0.49
1.593PheThr: 1.593 ± 0.345
2.389PheVal: 2.389 ± 0.342
0.434PheTrp: 0.434 ± 0.173
0.724PheTyr: 0.724 ± 0.193
0.0PheXaa: 0.0 ± 0.0
Gly
7.095GlyAla: 7.095 ± 0.684
1.448GlyCys: 1.448 ± 0.411
4.416GlyAsp: 4.416 ± 0.691
2.968GlyGlu: 2.968 ± 0.456
2.462GlyPhe: 2.462 ± 0.265
5.792GlyGly: 5.792 ± 0.805
0.724GlyHis: 0.724 ± 0.203
4.706GlyIle: 4.706 ± 0.46
3.62GlyLys: 3.62 ± 0.646
5.575GlyLeu: 5.575 ± 0.622
2.462GlyMet: 2.462 ± 0.452
2.679GlyAsn: 2.679 ± 0.552
1.376GlyPro: 1.376 ± 0.353
2.534GlyGln: 2.534 ± 0.464
3.548GlyArg: 3.548 ± 0.477
5.068GlySer: 5.068 ± 0.651
8.109GlyThr: 8.109 ± 1.021
6.95GlyVal: 6.95 ± 0.768
1.086GlyTrp: 1.086 ± 0.289
3.33GlyTyr: 3.33 ± 0.647
0.0GlyXaa: 0.0 ± 0.0
His
1.448HisAla: 1.448 ± 0.257
0.29HisCys: 0.29 ± 0.128
1.376HisAsp: 1.376 ± 0.306
1.303HisGlu: 1.303 ± 0.305
0.29HisPhe: 0.29 ± 0.152
1.738HisGly: 1.738 ± 0.417
0.362HisHis: 0.362 ± 0.16
1.086HisIle: 1.086 ± 0.194
1.158HisLys: 1.158 ± 0.328
2.172HisLeu: 2.172 ± 0.447
0.579HisMet: 0.579 ± 0.206
1.086HisAsn: 1.086 ± 0.277
1.014HisPro: 1.014 ± 0.307
0.869HisGln: 0.869 ± 0.238
1.448HisArg: 1.448 ± 0.258
1.014HisSer: 1.014 ± 0.258
0.652HisThr: 0.652 ± 0.256
1.738HisVal: 1.738 ± 0.367
0.362HisTrp: 0.362 ± 0.169
0.941HisTyr: 0.941 ± 0.315
0.0HisXaa: 0.0 ± 0.0
Ile
4.199IleAla: 4.199 ± 0.571
0.652IleCys: 0.652 ± 0.21
3.258IleAsp: 3.258 ± 0.618
2.679IleGlu: 2.679 ± 0.518
0.652IlePhe: 0.652 ± 0.207
3.403IleGly: 3.403 ± 0.444
1.231IleHis: 1.231 ± 0.31
1.303IleIle: 1.303 ± 0.262
2.824IleLys: 2.824 ± 0.392
3.548IleLeu: 3.548 ± 0.592
1.231IleMet: 1.231 ± 0.257
2.824IleAsn: 2.824 ± 0.526
2.027IlePro: 2.027 ± 0.371
1.593IleGln: 1.593 ± 0.37
2.027IleArg: 2.027 ± 0.453
2.534IleSer: 2.534 ± 0.378
3.982IleThr: 3.982 ± 0.726
2.751IleVal: 2.751 ± 0.351
0.507IleTrp: 0.507 ± 0.289
0.796IleTyr: 0.796 ± 0.238
0.0IleXaa: 0.0 ± 0.0
Lys
4.996LysAla: 4.996 ± 0.771
0.362LysCys: 0.362 ± 0.191
2.679LysAsp: 2.679 ± 0.488
2.679LysGlu: 2.679 ± 0.422
0.652LysPhe: 0.652 ± 0.228
2.968LysGly: 2.968 ± 0.423
0.796LysHis: 0.796 ± 0.213
1.231LysIle: 1.231 ± 0.277
1.231LysLys: 1.231 ± 0.358
5.068LysLeu: 5.068 ± 0.633
1.376LysMet: 1.376 ± 0.299
1.158LysAsn: 1.158 ± 0.243
2.027LysPro: 2.027 ± 0.39
1.882LysGln: 1.882 ± 0.434
3.258LysArg: 3.258 ± 0.67
1.738LysSer: 1.738 ± 0.287
2.1LysThr: 2.1 ± 0.392
2.824LysVal: 2.824 ± 0.429
0.507LysTrp: 0.507 ± 0.205
2.027LysTyr: 2.027 ± 0.429
0.0LysXaa: 0.0 ± 0.0
Leu
7.24LeuAla: 7.24 ± 0.606
1.52LeuCys: 1.52 ± 0.342
5.14LeuAsp: 5.14 ± 0.606
4.561LeuGlu: 4.561 ± 0.495
2.389LeuPhe: 2.389 ± 0.399
6.516LeuGly: 6.516 ± 0.764
1.52LeuHis: 1.52 ± 0.309
3.692LeuIle: 3.692 ± 0.571
3.113LeuLys: 3.113 ± 0.361
6.444LeuLeu: 6.444 ± 0.69
2.534LeuMet: 2.534 ± 0.406
4.272LeuAsn: 4.272 ± 0.502
4.416LeuPro: 4.416 ± 0.499
3.041LeuGln: 3.041 ± 0.511
7.312LeuArg: 7.312 ± 0.887
7.747LeuSer: 7.747 ± 0.935
5.72LeuThr: 5.72 ± 0.62
7.24LeuVal: 7.24 ± 0.72
0.579LeuTrp: 0.579 ± 0.251
2.968LeuTyr: 2.968 ± 0.509
0.0LeuXaa: 0.0 ± 0.0
Met
2.824MetAla: 2.824 ± 0.616
0.434MetCys: 0.434 ± 0.195
1.376MetAsp: 1.376 ± 0.311
1.158MetGlu: 1.158 ± 0.26
1.376MetPhe: 1.376 ± 0.371
1.955MetGly: 1.955 ± 0.435
0.796MetHis: 0.796 ± 0.206
1.014MetIle: 1.014 ± 0.271
1.158MetLys: 1.158 ± 0.253
2.824MetLeu: 2.824 ± 0.393
0.796MetMet: 0.796 ± 0.24
0.434MetAsn: 0.434 ± 0.17
1.376MetPro: 1.376 ± 0.31
2.1MetGln: 2.1 ± 0.439
2.027MetArg: 2.027 ± 0.429
1.665MetSer: 1.665 ± 0.364
2.027MetThr: 2.027 ± 0.35
2.172MetVal: 2.172 ± 0.423
0.29MetTrp: 0.29 ± 0.138
1.231MetTyr: 1.231 ± 0.24
0.0MetXaa: 0.0 ± 0.0
Asn
3.837AsnAla: 3.837 ± 0.528
0.434AsnCys: 0.434 ± 0.237
1.81AsnAsp: 1.81 ± 0.318
2.1AsnGlu: 2.1 ± 0.32
1.738AsnPhe: 1.738 ± 0.308
4.127AsnGly: 4.127 ± 0.512
0.796AsnHis: 0.796 ± 0.253
2.172AsnIle: 2.172 ± 0.443
2.462AsnLys: 2.462 ± 0.344
3.62AsnLeu: 3.62 ± 0.494
1.448AsnMet: 1.448 ± 0.265
1.665AsnAsn: 1.665 ± 0.278
1.52AsnPro: 1.52 ± 0.343
1.955AsnGln: 1.955 ± 0.551
2.317AsnArg: 2.317 ± 0.355
2.679AsnSer: 2.679 ± 0.495
3.041AsnThr: 3.041 ± 0.48
2.751AsnVal: 2.751 ± 0.336
0.796AsnTrp: 0.796 ± 0.234
0.579AsnTyr: 0.579 ± 0.181
0.0AsnXaa: 0.0 ± 0.0
Pro
3.113ProAla: 3.113 ± 0.548
0.217ProCys: 0.217 ± 0.103
3.258ProAsp: 3.258 ± 0.524
3.258ProGlu: 3.258 ± 0.487
0.724ProPhe: 0.724 ± 0.21
2.244ProGly: 2.244 ± 0.271
0.507ProHis: 0.507 ± 0.155
1.738ProIle: 1.738 ± 0.463
1.231ProLys: 1.231 ± 0.358
2.824ProLeu: 2.824 ± 0.401
1.303ProMet: 1.303 ± 0.304
1.376ProAsn: 1.376 ± 0.395
2.027ProPro: 2.027 ± 0.453
1.738ProGln: 1.738 ± 0.401
2.027ProArg: 2.027 ± 0.358
2.389ProSer: 2.389 ± 0.378
3.765ProThr: 3.765 ± 0.558
3.692ProVal: 3.692 ± 0.683
0.579ProTrp: 0.579 ± 0.211
1.738ProTyr: 1.738 ± 0.254
0.0ProXaa: 0.0 ± 0.0
Gln
4.996GlnAla: 4.996 ± 0.659
0.072GlnCys: 0.072 ± 0.073
2.1GlnAsp: 2.1 ± 0.405
2.606GlnGlu: 2.606 ± 0.466
1.448GlnPhe: 1.448 ± 0.3
4.127GlnGly: 4.127 ± 0.561
1.448GlnHis: 1.448 ± 0.325
1.882GlnIle: 1.882 ± 0.378
1.231GlnLys: 1.231 ± 0.38
4.054GlnLeu: 4.054 ± 0.572
1.448GlnMet: 1.448 ± 0.293
2.027GlnAsn: 2.027 ± 0.431
1.303GlnPro: 1.303 ± 0.355
2.751GlnGln: 2.751 ± 0.514
2.751GlnArg: 2.751 ± 0.489
2.389GlnSer: 2.389 ± 0.472
1.955GlnThr: 1.955 ± 0.28
3.62GlnVal: 3.62 ± 0.513
0.579GlnTrp: 0.579 ± 0.204
2.317GlnTyr: 2.317 ± 0.457
0.0GlnXaa: 0.0 ± 0.0
Arg
5.358ArgAla: 5.358 ± 0.66
0.652ArgCys: 0.652 ± 0.217
4.344ArgAsp: 4.344 ± 0.475
2.679ArgGlu: 2.679 ± 0.455
1.52ArgPhe: 1.52 ± 0.307
4.272ArgGly: 4.272 ± 0.49
1.448ArgHis: 1.448 ± 0.328
3.475ArgIle: 3.475 ± 0.491
2.679ArgLys: 2.679 ± 0.352
4.344ArgLeu: 4.344 ± 0.587
1.52ArgMet: 1.52 ± 0.318
2.751ArgAsn: 2.751 ± 0.506
1.738ArgPro: 1.738 ± 0.356
3.258ArgGln: 3.258 ± 0.434
4.923ArgArg: 4.923 ± 0.566
3.548ArgSer: 3.548 ± 0.666
3.403ArgThr: 3.403 ± 0.536
4.054ArgVal: 4.054 ± 0.495
0.652ArgTrp: 0.652 ± 0.163
2.317ArgTyr: 2.317 ± 0.387
0.0ArgXaa: 0.0 ± 0.0
Ser
6.806SerAla: 6.806 ± 0.641
0.579SerCys: 0.579 ± 0.205
3.692SerAsp: 3.692 ± 0.441
2.317SerGlu: 2.317 ± 0.338
1.52SerPhe: 1.52 ± 0.339
5.72SerGly: 5.72 ± 0.832
1.086SerHis: 1.086 ± 0.248
2.824SerIle: 2.824 ± 0.427
3.33SerLys: 3.33 ± 0.433
5.937SerLeu: 5.937 ± 0.608
1.738SerMet: 1.738 ± 0.375
2.534SerAsn: 2.534 ± 0.603
2.244SerPro: 2.244 ± 0.419
2.896SerGln: 2.896 ± 0.378
2.534SerArg: 2.534 ± 0.385
2.824SerSer: 2.824 ± 0.383
6.226SerThr: 6.226 ± 0.924
5.14SerVal: 5.14 ± 0.542
1.086SerTrp: 1.086 ± 0.249
1.882SerTyr: 1.882 ± 0.356
0.0SerXaa: 0.0 ± 0.0
Thr
8.254ThrAla: 8.254 ± 0.895
0.579ThrCys: 0.579 ± 0.223
4.489ThrAsp: 4.489 ± 0.682
2.896ThrGlu: 2.896 ± 0.523
1.955ThrPhe: 1.955 ± 0.389
6.082ThrGly: 6.082 ± 0.755
2.027ThrHis: 2.027 ± 0.383
2.534ThrIle: 2.534 ± 0.38
2.244ThrLys: 2.244 ± 0.343
6.154ThrLeu: 6.154 ± 0.875
0.724ThrMet: 0.724 ± 0.187
2.534ThrAsn: 2.534 ± 0.451
3.548ThrPro: 3.548 ± 0.444
2.389ThrGln: 2.389 ± 0.369
3.548ThrArg: 3.548 ± 0.447
4.272ThrSer: 4.272 ± 0.638
4.996ThrThr: 4.996 ± 1.356
6.226ThrVal: 6.226 ± 0.793
1.014ThrTrp: 1.014 ± 0.335
2.606ThrTyr: 2.606 ± 0.564
0.0ThrXaa: 0.0 ± 0.0
Val
7.095ValAla: 7.095 ± 0.803
0.796ValCys: 0.796 ± 0.257
5.213ValAsp: 5.213 ± 0.611
2.896ValGlu: 2.896 ± 0.472
2.462ValPhe: 2.462 ± 0.435
5.213ValGly: 5.213 ± 0.488
1.593ValHis: 1.593 ± 0.343
2.896ValIle: 2.896 ± 0.517
3.403ValLys: 3.403 ± 0.594
7.892ValLeu: 7.892 ± 0.821
2.027ValMet: 2.027 ± 0.318
3.692ValAsn: 3.692 ± 0.55
4.706ValPro: 4.706 ± 0.704
5.575ValGln: 5.575 ± 0.663
5.068ValArg: 5.068 ± 0.745
4.489ValSer: 4.489 ± 0.681
4.996ValThr: 4.996 ± 0.705
6.009ValVal: 6.009 ± 0.722
0.796ValTrp: 0.796 ± 0.309
2.679ValTyr: 2.679 ± 0.461
0.0ValXaa: 0.0 ± 0.0
Trp
0.941TrpAla: 0.941 ± 0.231
0.145TrpCys: 0.145 ± 0.101
0.652TrpAsp: 0.652 ± 0.225
0.796TrpGlu: 0.796 ± 0.263
0.869TrpPhe: 0.869 ± 0.303
1.086TrpGly: 1.086 ± 0.244
0.072TrpHis: 0.072 ± 0.073
0.434TrpIle: 0.434 ± 0.197
0.29TrpLys: 0.29 ± 0.159
1.158TrpLeu: 1.158 ± 0.317
0.217TrpMet: 0.217 ± 0.117
0.579TrpAsn: 0.579 ± 0.204
0.29TrpPro: 0.29 ± 0.152
1.086TrpGln: 1.086 ± 0.27
0.652TrpArg: 0.652 ± 0.178
0.579TrpSer: 0.579 ± 0.193
0.434TrpThr: 0.434 ± 0.195
1.376TrpVal: 1.376 ± 0.296
0.217TrpTrp: 0.217 ± 0.143
1.014TrpTyr: 1.014 ± 0.289
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.534TyrAla: 2.534 ± 0.355
0.362TyrCys: 0.362 ± 0.159
2.534TyrAsp: 2.534 ± 0.404
1.955TyrGlu: 1.955 ± 0.325
0.869TyrPhe: 0.869 ± 0.312
3.041TyrGly: 3.041 ± 0.583
0.869TyrHis: 0.869 ± 0.284
2.027TyrIle: 2.027 ± 0.364
1.014TyrLys: 1.014 ± 0.311
3.62TyrLeu: 3.62 ± 0.612
1.303TyrMet: 1.303 ± 0.322
1.593TyrAsn: 1.593 ± 0.362
1.882TyrPro: 1.882 ± 0.347
1.738TyrGln: 1.738 ± 0.343
2.896TyrArg: 2.896 ± 0.56
2.968TyrSer: 2.968 ± 0.419
2.172TyrThr: 2.172 ± 0.448
2.606TyrVal: 2.606 ± 0.457
0.434TyrTrp: 0.434 ± 0.192
0.941TyrTyr: 0.941 ± 0.202
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (13813 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski