Amino acid dipepetide frequency for Campylobacter phage CJIE4-5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.117AlaAla: 1.117 ± 0.375
0.773AlaCys: 0.773 ± 0.351
1.976AlaAsp: 1.976 ± 0.356
3.179AlaGlu: 3.179 ± 0.777
4.726AlaPhe: 4.726 ± 0.784
2.664AlaGly: 2.664 ± 0.573
0.687AlaHis: 0.687 ± 0.249
3.609AlaIle: 3.609 ± 0.503
7.476AlaLys: 7.476 ± 0.869
8.764AlaLeu: 8.764 ± 0.997
1.289AlaMet: 1.289 ± 0.387
5.929AlaAsn: 5.929 ± 1.078
1.461AlaPro: 1.461 ± 0.407
1.976AlaGln: 1.976 ± 0.426
1.461AlaArg: 1.461 ± 0.461
4.124AlaSer: 4.124 ± 0.634
2.75AlaThr: 2.75 ± 0.826
1.375AlaVal: 1.375 ± 0.324
0.773AlaTrp: 0.773 ± 0.266
2.32AlaTyr: 2.32 ± 0.473
0.0AlaXaa: 0.0 ± 0.0
Cys
0.43CysAla: 0.43 ± 0.167
0.172CysCys: 0.172 ± 0.133
0.945CysAsp: 0.945 ± 0.373
1.031CysGlu: 1.031 ± 0.343
0.687CysPhe: 0.687 ± 0.286
0.601CysGly: 0.601 ± 0.301
0.086CysHis: 0.086 ± 0.094
0.773CysIle: 0.773 ± 0.306
1.461CysLys: 1.461 ± 0.432
1.031CysLeu: 1.031 ± 0.382
0.258CysMet: 0.258 ± 0.199
0.773CysAsn: 0.773 ± 0.281
0.344CysPro: 0.344 ± 0.178
0.086CysGln: 0.086 ± 0.077
0.0CysArg: 0.0 ± 0.0
0.859CysSer: 0.859 ± 0.337
0.258CysThr: 0.258 ± 0.169
0.516CysVal: 0.516 ± 0.299
0.0CysTrp: 0.0 ± 0.0
0.516CysTyr: 0.516 ± 0.242
0.0CysXaa: 0.0 ± 0.0
Asp
2.492AspAla: 2.492 ± 0.436
0.258AspCys: 0.258 ± 0.163
3.007AspAsp: 3.007 ± 0.583
5.843AspGlu: 5.843 ± 0.829
4.038AspPhe: 4.038 ± 0.542
2.578AspGly: 2.578 ± 0.42
0.43AspHis: 0.43 ± 0.179
3.695AspIle: 3.695 ± 0.514
6.616AspLys: 6.616 ± 0.751
5.757AspLeu: 5.757 ± 1.015
0.516AspMet: 0.516 ± 0.195
3.609AspAsn: 3.609 ± 0.628
0.859AspPro: 0.859 ± 0.276
0.601AspGln: 0.601 ± 0.203
0.945AspArg: 0.945 ± 0.296
1.461AspSer: 1.461 ± 0.336
2.664AspThr: 2.664 ± 0.447
2.234AspVal: 2.234 ± 0.439
0.344AspTrp: 0.344 ± 0.14
2.664AspTyr: 2.664 ± 0.509
0.0AspXaa: 0.0 ± 0.0
Glu
5.07GluAla: 5.07 ± 1.037
1.031GluCys: 1.031 ± 0.327
2.921GluAsp: 2.921 ± 0.579
7.561GluGlu: 7.561 ± 1.331
4.554GluPhe: 4.554 ± 0.619
1.804GluGly: 1.804 ± 0.374
0.773GluHis: 0.773 ± 0.292
8.421GluIle: 8.421 ± 0.877
11.514GluLys: 11.514 ± 0.956
10.655GluLeu: 10.655 ± 1.464
1.804GluMet: 1.804 ± 0.377
8.678GluAsn: 8.678 ± 0.769
1.031GluPro: 1.031 ± 0.346
4.124GluGln: 4.124 ± 0.489
2.578GluArg: 2.578 ± 0.617
4.554GluSer: 4.554 ± 0.747
2.234GluThr: 2.234 ± 0.455
4.038GluVal: 4.038 ± 0.566
0.516GluTrp: 0.516 ± 0.158
2.492GluTyr: 2.492 ± 0.516
0.086GluXaa: 0.086 ± 0.067
Phe
2.836PheAla: 2.836 ± 0.484
1.031PheCys: 1.031 ± 0.383
2.664PheAsp: 2.664 ± 0.477
4.296PheGlu: 4.296 ± 0.577
2.836PhePhe: 2.836 ± 0.745
2.32PheGly: 2.32 ± 0.496
0.601PheHis: 0.601 ± 0.283
4.726PheIle: 4.726 ± 0.839
7.647PheLys: 7.647 ± 0.907
6.444PheLeu: 6.444 ± 0.991
1.117PheMet: 1.117 ± 0.302
4.468PheAsn: 4.468 ± 0.715
0.516PhePro: 0.516 ± 0.256
1.203PheGln: 1.203 ± 0.252
1.289PheArg: 1.289 ± 0.347
3.695PheSer: 3.695 ± 0.693
2.75PheThr: 2.75 ± 0.521
2.32PheVal: 2.32 ± 0.691
0.172PheTrp: 0.172 ± 0.131
3.179PheTyr: 3.179 ± 0.824
0.0PheXaa: 0.0 ± 0.0
Gly
2.578GlyAla: 2.578 ± 0.785
0.601GlyCys: 0.601 ± 0.232
3.437GlyAsp: 3.437 ± 0.583
3.093GlyGlu: 3.093 ± 0.468
3.781GlyPhe: 3.781 ± 0.499
2.234GlyGly: 2.234 ± 0.633
0.172GlyHis: 0.172 ± 0.133
4.468GlyIle: 4.468 ± 0.618
4.296GlyLys: 4.296 ± 0.803
4.726GlyLeu: 4.726 ± 0.949
0.601GlyMet: 0.601 ± 0.18
3.093GlyAsn: 3.093 ± 0.503
0.0GlyPro: 0.0 ± 0.0
0.945GlyGln: 0.945 ± 0.281
0.086GlyArg: 0.086 ± 0.067
2.664GlySer: 2.664 ± 0.648
1.89GlyThr: 1.89 ± 0.639
3.093GlyVal: 3.093 ± 0.521
0.258GlyTrp: 0.258 ± 0.125
2.234GlyTyr: 2.234 ± 0.475
0.0GlyXaa: 0.0 ± 0.0
His
0.344HisAla: 0.344 ± 0.152
0.086HisCys: 0.086 ± 0.079
0.258HisAsp: 0.258 ± 0.167
0.859HisGlu: 0.859 ± 0.293
1.031HisPhe: 1.031 ± 0.365
0.43HisGly: 0.43 ± 0.247
0.43HisHis: 0.43 ± 0.26
0.601HisIle: 0.601 ± 0.194
1.117HisLys: 1.117 ± 0.314
0.859HisLeu: 0.859 ± 0.248
0.086HisMet: 0.086 ± 0.081
0.687HisAsn: 0.687 ± 0.263
0.258HisPro: 0.258 ± 0.187
0.687HisGln: 0.687 ± 0.352
0.344HisArg: 0.344 ± 0.208
0.43HisSer: 0.43 ± 0.183
0.945HisThr: 0.945 ± 0.284
0.172HisVal: 0.172 ± 0.108
0.086HisTrp: 0.086 ± 0.089
0.258HisTyr: 0.258 ± 0.15
0.0HisXaa: 0.0 ± 0.0
Ile
5.327IleAla: 5.327 ± 0.561
0.344IleCys: 0.344 ± 0.177
4.554IleAsp: 4.554 ± 0.742
7.218IleGlu: 7.218 ± 0.87
3.695IlePhe: 3.695 ± 0.595
4.21IleGly: 4.21 ± 0.755
0.945IleHis: 0.945 ± 0.322
6.273IleIle: 6.273 ± 1.144
11.342IleLys: 11.342 ± 0.931
7.561IleLeu: 7.561 ± 1.005
1.461IleMet: 1.461 ± 0.354
7.132IleAsn: 7.132 ± 0.744
1.976IlePro: 1.976 ± 0.436
2.406IleGln: 2.406 ± 0.436
2.148IleArg: 2.148 ± 0.531
4.898IleSer: 4.898 ± 0.68
4.296IleThr: 4.296 ± 0.573
2.75IleVal: 2.75 ± 0.454
0.859IleTrp: 0.859 ± 0.312
2.75IleTyr: 2.75 ± 0.493
0.0IleXaa: 0.0 ± 0.0
Lys
8.421LysAla: 8.421 ± 1.128
1.375LysCys: 1.375 ± 0.405
7.046LysAsp: 7.046 ± 0.867
11.6LysGlu: 11.6 ± 0.867
3.695LysPhe: 3.695 ± 0.653
5.843LysGly: 5.843 ± 0.897
1.117LysHis: 1.117 ± 0.368
11.514LysIle: 11.514 ± 1.561
11.772LysLys: 11.772 ± 1.221
9.795LysLeu: 9.795 ± 1.016
2.664LysMet: 2.664 ± 0.4
11.084LysAsn: 11.084 ± 0.872
3.179LysPro: 3.179 ± 0.683
5.156LysGln: 5.156 ± 0.716
3.867LysArg: 3.867 ± 0.731
5.413LysSer: 5.413 ± 0.752
6.616LysThr: 6.616 ± 0.653
4.726LysVal: 4.726 ± 0.687
0.601LysTrp: 0.601 ± 0.189
3.953LysTyr: 3.953 ± 0.587
0.0LysXaa: 0.0 ± 0.0
Leu
6.358LeuAla: 6.358 ± 0.929
1.117LeuCys: 1.117 ± 0.388
3.609LeuAsp: 3.609 ± 0.565
11.6LeuGlu: 11.6 ± 1.221
4.726LeuPhe: 4.726 ± 0.802
5.413LeuGly: 5.413 ± 1.032
0.687LeuHis: 0.687 ± 0.212
6.101LeuIle: 6.101 ± 0.604
14.865LeuLys: 14.865 ± 1.432
6.273LeuLeu: 6.273 ± 0.744
2.492LeuMet: 2.492 ± 0.467
8.678LeuAsn: 8.678 ± 0.799
1.375LeuPro: 1.375 ± 0.339
3.179LeuGln: 3.179 ± 0.647
4.64LeuArg: 4.64 ± 0.542
9.194LeuSer: 9.194 ± 0.927
4.124LeuThr: 4.124 ± 0.48
3.781LeuVal: 3.781 ± 0.6
0.945LeuTrp: 0.945 ± 0.304
2.921LeuTyr: 2.921 ± 0.522
0.086LeuXaa: 0.086 ± 0.067
Met
1.031MetAla: 1.031 ± 0.288
0.086MetCys: 0.086 ± 0.08
1.203MetAsp: 1.203 ± 0.267
0.945MetGlu: 0.945 ± 0.321
0.859MetPhe: 0.859 ± 0.271
0.687MetGly: 0.687 ± 0.215
0.086MetHis: 0.086 ± 0.087
2.234MetIle: 2.234 ± 0.502
2.148MetLys: 2.148 ± 0.453
2.32MetLeu: 2.32 ± 0.645
0.0MetMet: 0.0 ± 0.0
1.203MetAsn: 1.203 ± 0.318
0.516MetPro: 0.516 ± 0.199
2.062MetGln: 2.062 ± 0.408
1.117MetArg: 1.117 ± 0.319
2.75MetSer: 2.75 ± 0.616
0.687MetThr: 0.687 ± 0.211
0.601MetVal: 0.601 ± 0.245
0.172MetTrp: 0.172 ± 0.142
0.43MetTyr: 0.43 ± 0.165
0.0MetXaa: 0.0 ± 0.0
Asn
6.358AsnAla: 6.358 ± 0.59
0.859AsnCys: 0.859 ± 0.37
4.726AsnAsp: 4.726 ± 0.551
8.335AsnGlu: 8.335 ± 0.927
3.265AsnPhe: 3.265 ± 0.569
4.038AsnGly: 4.038 ± 0.753
1.117AsnHis: 1.117 ± 0.275
7.218AsnIle: 7.218 ± 0.882
7.476AsnLys: 7.476 ± 0.61
9.28AsnLeu: 9.28 ± 0.933
1.719AsnMet: 1.719 ± 0.417
5.07AsnAsn: 5.07 ± 0.77
1.804AsnPro: 1.804 ± 0.375
3.437AsnGln: 3.437 ± 0.907
1.289AsnArg: 1.289 ± 0.304
5.413AsnSer: 5.413 ± 0.81
2.492AsnThr: 2.492 ± 0.535
2.578AsnVal: 2.578 ± 0.453
0.43AsnTrp: 0.43 ± 0.217
2.32AsnTyr: 2.32 ± 0.416
0.086AsnXaa: 0.086 ± 0.067
Pro
0.859ProAla: 0.859 ± 0.304
0.258ProCys: 0.258 ± 0.206
0.601ProAsp: 0.601 ± 0.21
1.031ProGlu: 1.031 ± 0.261
1.547ProPhe: 1.547 ± 0.402
0.344ProGly: 0.344 ± 0.156
0.43ProHis: 0.43 ± 0.205
1.289ProIle: 1.289 ± 0.321
2.062ProLys: 2.062 ± 0.565
1.976ProLeu: 1.976 ± 0.617
0.344ProMet: 0.344 ± 0.159
1.117ProAsn: 1.117 ± 0.371
0.344ProPro: 0.344 ± 0.172
1.633ProGln: 1.633 ± 0.486
0.687ProArg: 0.687 ± 0.345
1.804ProSer: 1.804 ± 0.505
1.375ProThr: 1.375 ± 0.413
0.859ProVal: 0.859 ± 0.308
0.086ProTrp: 0.086 ± 0.08
1.547ProTyr: 1.547 ± 0.342
0.0ProXaa: 0.0 ± 0.0
Gln
2.75GlnAla: 2.75 ± 0.693
0.344GlnCys: 0.344 ± 0.185
1.804GlnAsp: 1.804 ± 0.348
3.437GlnGlu: 3.437 ± 0.63
0.859GlnPhe: 0.859 ± 0.236
1.976GlnGly: 1.976 ± 0.502
0.258GlnHis: 0.258 ± 0.146
3.609GlnIle: 3.609 ± 0.475
4.382GlnLys: 4.382 ± 0.438
2.836GlnLeu: 2.836 ± 0.698
1.289GlnMet: 1.289 ± 0.48
3.523GlnAsn: 3.523 ± 0.495
0.258GlnPro: 0.258 ± 0.166
0.773GlnGln: 0.773 ± 0.243
1.375GlnArg: 1.375 ± 0.412
2.921GlnSer: 2.921 ± 0.617
2.062GlnThr: 2.062 ± 0.405
1.375GlnVal: 1.375 ± 0.351
0.086GlnTrp: 0.086 ± 0.086
1.117GlnTyr: 1.117 ± 0.303
0.0GlnXaa: 0.0 ± 0.0
Arg
1.633ArgAla: 1.633 ± 0.38
0.258ArgCys: 0.258 ± 0.136
1.289ArgAsp: 1.289 ± 0.359
2.578ArgGlu: 2.578 ± 0.541
1.633ArgPhe: 1.633 ± 0.491
0.773ArgGly: 0.773 ± 0.247
0.258ArgHis: 0.258 ± 0.203
1.976ArgIle: 1.976 ± 0.503
3.523ArgLys: 3.523 ± 0.595
3.351ArgLeu: 3.351 ± 0.7
0.687ArgMet: 0.687 ± 0.348
1.547ArgAsn: 1.547 ± 0.326
0.687ArgPro: 0.687 ± 0.184
1.117ArgGln: 1.117 ± 0.23
0.687ArgArg: 0.687 ± 0.261
1.547ArgSer: 1.547 ± 0.401
0.859ArgThr: 0.859 ± 0.255
1.547ArgVal: 1.547 ± 0.368
0.344ArgTrp: 0.344 ± 0.215
0.687ArgTyr: 0.687 ± 0.257
0.0ArgXaa: 0.0 ± 0.0
Ser
3.867SerAla: 3.867 ± 0.553
0.43SerCys: 0.43 ± 0.231
3.609SerAsp: 3.609 ± 0.505
4.726SerGlu: 4.726 ± 0.722
5.843SerPhe: 5.843 ± 1.054
2.921SerGly: 2.921 ± 0.563
0.258SerHis: 0.258 ± 0.142
5.499SerIle: 5.499 ± 0.782
7.733SerLys: 7.733 ± 0.793
7.647SerLeu: 7.647 ± 0.783
1.976SerMet: 1.976 ± 0.392
3.781SerAsn: 3.781 ± 0.517
1.633SerPro: 1.633 ± 0.371
2.148SerGln: 2.148 ± 0.39
1.203SerArg: 1.203 ± 0.381
4.812SerSer: 4.812 ± 1.12
2.836SerThr: 2.836 ± 0.5
3.093SerVal: 3.093 ± 0.511
0.516SerTrp: 0.516 ± 0.176
1.633SerTyr: 1.633 ± 0.415
0.0SerXaa: 0.0 ± 0.0
Thr
2.406ThrAla: 2.406 ± 0.509
0.43ThrCys: 0.43 ± 0.2
1.804ThrAsp: 1.804 ± 0.348
2.664ThrGlu: 2.664 ± 0.384
2.062ThrPhe: 2.062 ± 0.42
2.062ThrGly: 2.062 ± 0.418
0.945ThrHis: 0.945 ± 0.225
3.437ThrIle: 3.437 ± 0.502
5.07ThrLys: 5.07 ± 0.596
4.382ThrLeu: 4.382 ± 0.629
1.203ThrMet: 1.203 ± 0.307
3.867ThrAsn: 3.867 ± 0.573
1.547ThrPro: 1.547 ± 0.426
2.578ThrGln: 2.578 ± 0.526
1.117ThrArg: 1.117 ± 0.319
3.953ThrSer: 3.953 ± 0.624
3.007ThrThr: 3.007 ± 0.478
0.258ThrVal: 0.258 ± 0.139
0.601ThrTrp: 0.601 ± 0.21
2.062ThrTyr: 2.062 ± 0.407
0.0ThrXaa: 0.0 ± 0.0
Val
2.148ValAla: 2.148 ± 0.399
0.859ValCys: 0.859 ± 0.347
1.89ValAsp: 1.89 ± 0.368
2.32ValGlu: 2.32 ± 0.382
3.351ValPhe: 3.351 ± 0.76
1.633ValGly: 1.633 ± 0.402
0.172ValHis: 0.172 ± 0.147
3.007ValIle: 3.007 ± 0.51
3.695ValLys: 3.695 ± 0.551
5.07ValLeu: 5.07 ± 0.793
0.945ValMet: 0.945 ± 0.231
2.148ValAsn: 2.148 ± 0.34
1.289ValPro: 1.289 ± 0.37
0.945ValGln: 0.945 ± 0.286
0.773ValArg: 0.773 ± 0.262
3.437ValSer: 3.437 ± 0.577
1.461ValThr: 1.461 ± 0.352
2.148ValVal: 2.148 ± 0.44
0.687ValTrp: 0.687 ± 0.228
1.289ValTyr: 1.289 ± 0.382
0.0ValXaa: 0.0 ± 0.0
Trp
0.344TrpAla: 0.344 ± 0.143
0.086TrpCys: 0.086 ± 0.08
0.516TrpAsp: 0.516 ± 0.179
0.516TrpGlu: 0.516 ± 0.162
0.43TrpPhe: 0.43 ± 0.264
0.344TrpGly: 0.344 ± 0.175
0.258TrpHis: 0.258 ± 0.129
0.773TrpIle: 0.773 ± 0.223
0.859TrpLys: 0.859 ± 0.284
0.945TrpLeu: 0.945 ± 0.35
0.0TrpMet: 0.0 ± 0.0
0.687TrpAsn: 0.687 ± 0.319
0.0TrpPro: 0.0 ± 0.0
0.258TrpGln: 0.258 ± 0.125
0.172TrpArg: 0.172 ± 0.124
0.43TrpSer: 0.43 ± 0.212
0.516TrpThr: 0.516 ± 0.159
0.344TrpVal: 0.344 ± 0.215
0.258TrpTrp: 0.258 ± 0.146
0.172TrpTyr: 0.172 ± 0.112
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.062TyrAla: 2.062 ± 0.494
0.43TyrCys: 0.43 ± 0.244
2.75TyrAsp: 2.75 ± 0.574
3.609TyrGlu: 3.609 ± 0.422
2.492TyrPhe: 2.492 ± 0.518
1.375TyrGly: 1.375 ± 0.386
0.258TyrHis: 0.258 ± 0.164
2.921TyrIle: 2.921 ± 0.523
4.382TyrLys: 4.382 ± 0.676
2.578TyrLeu: 2.578 ± 0.604
0.601TyrMet: 0.601 ± 0.227
2.148TyrAsn: 2.148 ± 0.424
1.117TyrPro: 1.117 ± 0.334
1.633TyrGln: 1.633 ± 0.418
1.203TyrArg: 1.203 ± 0.336
1.719TyrSer: 1.719 ± 0.432
1.633TyrThr: 1.633 ± 0.345
1.461TyrVal: 1.461 ± 0.281
0.172TyrTrp: 0.172 ± 0.138
1.375TyrTyr: 1.375 ± 0.487
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.086XaaAsn: 0.086 ± 0.067
0.0XaaPro: 0.0 ± 0.0
0.086XaaGln: 0.086 ± 0.067
0.0XaaArg: 0.0 ± 0.0
0.086XaaSer: 0.086 ± 0.067
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 59 proteins (11639 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski