Amino acid dipepetide frequency for Streptococcus phage phi-SsuHCJ31_comEC

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.934AlaAla: 3.934 ± 0.914
0.388AlaCys: 0.388 ± 0.139
3.214AlaAsp: 3.214 ± 0.445
3.768AlaGlu: 3.768 ± 0.442
2.826AlaPhe: 2.826 ± 0.334
4.931AlaGly: 4.931 ± 0.625
0.831AlaHis: 0.831 ± 0.216
5.153AlaIle: 5.153 ± 0.668
5.929AlaLys: 5.929 ± 0.563
5.485AlaLeu: 5.485 ± 0.59
1.551AlaMet: 1.551 ± 0.286
3.546AlaAsn: 3.546 ± 0.465
1.496AlaPro: 1.496 ± 0.274
2.549AlaGln: 2.549 ± 0.488
2.438AlaArg: 2.438 ± 0.365
4.488AlaSer: 4.488 ± 0.599
3.989AlaThr: 3.989 ± 0.485
4.1AlaVal: 4.1 ± 0.471
0.887AlaTrp: 0.887 ± 0.251
2.77AlaTyr: 2.77 ± 0.359
0.0AlaXaa: 0.0 ± 0.0
Cys
0.443CysAla: 0.443 ± 0.154
0.166CysCys: 0.166 ± 0.088
0.277CysAsp: 0.277 ± 0.124
0.499CysGlu: 0.499 ± 0.145
0.277CysPhe: 0.277 ± 0.109
0.831CysGly: 0.831 ± 0.206
0.222CysHis: 0.222 ± 0.114
0.499CysIle: 0.499 ± 0.171
0.554CysLys: 0.554 ± 0.249
0.997CysLeu: 0.997 ± 0.253
0.111CysMet: 0.111 ± 0.087
0.388CysAsn: 0.388 ± 0.153
0.443CysPro: 0.443 ± 0.181
0.831CysGln: 0.831 ± 0.184
0.609CysArg: 0.609 ± 0.229
0.443CysSer: 0.443 ± 0.185
0.277CysThr: 0.277 ± 0.123
0.776CysVal: 0.776 ± 0.214
0.0CysTrp: 0.0 ± 0.0
0.499CysTyr: 0.499 ± 0.206
0.0CysXaa: 0.0 ± 0.0
Asp
3.047AspAla: 3.047 ± 0.359
0.831AspCys: 0.831 ± 0.232
3.103AspAsp: 3.103 ± 0.518
4.876AspGlu: 4.876 ± 0.607
3.214AspPhe: 3.214 ± 0.393
4.433AspGly: 4.433 ± 0.502
0.776AspHis: 0.776 ± 0.218
4.433AspIle: 4.433 ± 0.549
3.879AspLys: 3.879 ± 0.355
4.931AspLeu: 4.931 ± 0.707
1.496AspMet: 1.496 ± 0.261
2.881AspAsn: 2.881 ± 0.505
1.773AspPro: 1.773 ± 0.365
1.995AspGln: 1.995 ± 0.266
2.438AspArg: 2.438 ± 0.418
3.269AspSer: 3.269 ± 0.457
2.715AspThr: 2.715 ± 0.456
3.712AspVal: 3.712 ± 0.499
0.997AspTrp: 0.997 ± 0.204
2.881AspTyr: 2.881 ± 0.605
0.0AspXaa: 0.0 ± 0.0
Glu
4.543GluAla: 4.543 ± 0.476
0.72GluCys: 0.72 ± 0.255
3.989GluAsp: 3.989 ± 0.608
5.984GluGlu: 5.984 ± 0.722
1.939GluPhe: 1.939 ± 0.361
4.211GluGly: 4.211 ± 0.371
1.053GluHis: 1.053 ± 0.243
4.543GluIle: 4.543 ± 0.45
6.095GluLys: 6.095 ± 0.707
8.09GluLeu: 8.09 ± 0.751
1.773GluMet: 1.773 ± 0.425
3.491GluAsn: 3.491 ± 0.561
1.496GluPro: 1.496 ± 0.321
3.879GluGln: 3.879 ± 0.376
3.269GluArg: 3.269 ± 0.492
4.045GluSer: 4.045 ± 0.501
4.377GluThr: 4.377 ± 0.494
4.599GluVal: 4.599 ± 0.487
0.776GluTrp: 0.776 ± 0.214
2.272GluTyr: 2.272 ± 0.523
0.0GluXaa: 0.0 ± 0.0
Phe
2.715PheAla: 2.715 ± 0.479
0.554PheCys: 0.554 ± 0.207
2.992PheAsp: 2.992 ± 0.437
3.158PheGlu: 3.158 ± 0.507
1.773PhePhe: 1.773 ± 0.371
2.715PheGly: 2.715 ± 0.413
0.72PheHis: 0.72 ± 0.179
2.77PheIle: 2.77 ± 0.524
2.937PheLys: 2.937 ± 0.519
3.158PheLeu: 3.158 ± 0.409
0.831PheMet: 0.831 ± 0.262
2.161PheAsn: 2.161 ± 0.281
0.665PhePro: 0.665 ± 0.199
1.385PheGln: 1.385 ± 0.253
1.939PheArg: 1.939 ± 0.29
3.214PheSer: 3.214 ± 0.447
2.105PheThr: 2.105 ± 0.365
2.493PheVal: 2.493 ± 0.369
0.72PheTrp: 0.72 ± 0.195
1.496PheTyr: 1.496 ± 0.276
0.0PheXaa: 0.0 ± 0.0
Gly
3.324GlyAla: 3.324 ± 0.425
0.499GlyCys: 0.499 ± 0.2
3.768GlyAsp: 3.768 ± 0.62
4.322GlyGlu: 4.322 ± 0.471
3.546GlyPhe: 3.546 ± 0.558
3.602GlyGly: 3.602 ± 0.548
1.828GlyHis: 1.828 ± 0.327
6.095GlyIle: 6.095 ± 0.692
4.488GlyLys: 4.488 ± 0.424
6.594GlyLeu: 6.594 ± 0.689
1.718GlyMet: 1.718 ± 0.324
2.992GlyAsn: 2.992 ± 0.478
0.776GlyPro: 0.776 ± 0.176
2.493GlyGln: 2.493 ± 0.419
3.435GlyArg: 3.435 ± 0.397
3.879GlySer: 3.879 ± 0.448
3.435GlyThr: 3.435 ± 0.476
3.768GlyVal: 3.768 ± 0.539
0.499GlyTrp: 0.499 ± 0.158
2.992GlyTyr: 2.992 ± 0.383
0.0GlyXaa: 0.0 ± 0.0
His
0.831HisAla: 0.831 ± 0.17
0.055HisCys: 0.055 ± 0.059
0.997HisAsp: 0.997 ± 0.239
0.997HisGlu: 0.997 ± 0.253
1.053HisPhe: 1.053 ± 0.253
1.496HisGly: 1.496 ± 0.242
0.609HisHis: 0.609 ± 0.197
1.33HisIle: 1.33 ± 0.23
1.053HisLys: 1.053 ± 0.264
1.939HisLeu: 1.939 ± 0.271
0.443HisMet: 0.443 ± 0.187
0.776HisAsn: 0.776 ± 0.207
1.108HisPro: 1.108 ± 0.226
1.053HisGln: 1.053 ± 0.26
0.776HisArg: 0.776 ± 0.208
1.274HisSer: 1.274 ± 0.256
1.053HisThr: 1.053 ± 0.279
1.053HisVal: 1.053 ± 0.21
0.388HisTrp: 0.388 ± 0.162
0.554HisTyr: 0.554 ± 0.187
0.0HisXaa: 0.0 ± 0.0
Ile
4.654IleAla: 4.654 ± 0.524
0.499IleCys: 0.499 ± 0.167
5.264IleAsp: 5.264 ± 0.463
4.322IleGlu: 4.322 ± 0.509
2.216IlePhe: 2.216 ± 0.344
4.433IleGly: 4.433 ± 0.523
1.053IleHis: 1.053 ± 0.262
4.433IleIle: 4.433 ± 0.632
4.71IleLys: 4.71 ± 0.48
6.039IleLeu: 6.039 ± 0.626
0.997IleMet: 0.997 ± 0.266
3.324IleAsn: 3.324 ± 0.462
3.047IlePro: 3.047 ± 0.416
2.383IleGln: 2.383 ± 0.314
2.826IleArg: 2.826 ± 0.32
4.987IleSer: 4.987 ± 0.616
4.654IleThr: 4.654 ± 0.659
4.71IleVal: 4.71 ± 0.579
0.942IleTrp: 0.942 ± 0.265
2.881IleTyr: 2.881 ± 0.397
0.0IleXaa: 0.0 ± 0.0
Lys
5.264LysAla: 5.264 ± 0.58
0.554LysCys: 0.554 ± 0.163
3.38LysAsp: 3.38 ± 0.433
4.931LysGlu: 4.931 ± 0.415
2.383LysPhe: 2.383 ± 0.39
4.654LysGly: 4.654 ± 0.689
1.773LysHis: 1.773 ± 0.355
5.153LysIle: 5.153 ± 0.472
4.82LysLys: 4.82 ± 0.646
6.372LysLeu: 6.372 ± 0.534
1.939LysMet: 1.939 ± 0.312
3.103LysAsn: 3.103 ± 0.397
2.327LysPro: 2.327 ± 0.271
3.435LysGln: 3.435 ± 0.482
3.602LysArg: 3.602 ± 0.466
4.543LysSer: 4.543 ± 0.515
4.488LysThr: 4.488 ± 0.378
4.82LysVal: 4.82 ± 0.557
1.108LysTrp: 1.108 ± 0.312
2.66LysTyr: 2.66 ± 0.459
0.0LysXaa: 0.0 ± 0.0
Leu
6.261LeuAla: 6.261 ± 0.64
0.499LeuCys: 0.499 ± 0.192
5.596LeuAsp: 5.596 ± 0.498
6.704LeuGlu: 6.704 ± 0.728
3.047LeuPhe: 3.047 ± 0.43
5.652LeuGly: 5.652 ± 0.573
1.884LeuHis: 1.884 ± 0.349
5.208LeuIle: 5.208 ± 0.513
6.704LeuLys: 6.704 ± 0.515
9.419LeuLeu: 9.419 ± 1.367
2.216LeuMet: 2.216 ± 0.334
4.599LeuAsn: 4.599 ± 0.618
3.546LeuPro: 3.546 ± 0.498
3.657LeuGln: 3.657 ± 0.429
3.657LeuArg: 3.657 ± 0.379
8.034LeuSer: 8.034 ± 0.683
6.649LeuThr: 6.649 ± 0.611
6.372LeuVal: 6.372 ± 0.705
0.72LeuTrp: 0.72 ± 0.174
3.602LeuTyr: 3.602 ± 0.455
0.0LeuXaa: 0.0 ± 0.0
Met
1.773MetAla: 1.773 ± 0.375
0.111MetCys: 0.111 ± 0.077
1.662MetAsp: 1.662 ± 0.378
1.441MetGlu: 1.441 ± 0.277
0.776MetPhe: 0.776 ± 0.222
1.662MetGly: 1.662 ± 0.395
0.111MetHis: 0.111 ± 0.079
1.33MetIle: 1.33 ± 0.279
1.828MetLys: 1.828 ± 0.287
1.274MetLeu: 1.274 ± 0.259
0.831MetMet: 0.831 ± 0.238
1.108MetAsn: 1.108 ± 0.225
0.443MetPro: 0.443 ± 0.169
0.887MetGln: 0.887 ± 0.218
1.219MetArg: 1.219 ± 0.264
1.995MetSer: 1.995 ± 0.378
2.105MetThr: 2.105 ± 0.354
1.385MetVal: 1.385 ± 0.277
0.166MetTrp: 0.166 ± 0.098
0.443MetTyr: 0.443 ± 0.14
0.0MetXaa: 0.0 ± 0.0
Asn
4.045AsnAla: 4.045 ± 0.646
0.443AsnCys: 0.443 ± 0.214
2.604AsnAsp: 2.604 ± 0.405
2.66AsnGlu: 2.66 ± 0.376
1.995AsnPhe: 1.995 ± 0.376
4.599AsnGly: 4.599 ± 0.433
1.219AsnHis: 1.219 ± 0.258
2.715AsnIle: 2.715 ± 0.356
3.214AsnLys: 3.214 ± 0.449
4.876AsnLeu: 4.876 ± 0.616
1.108AsnMet: 1.108 ± 0.262
2.327AsnAsn: 2.327 ± 0.364
2.383AsnPro: 2.383 ± 0.349
2.216AsnGln: 2.216 ± 0.283
2.105AsnArg: 2.105 ± 0.329
2.66AsnSer: 2.66 ± 0.416
2.272AsnThr: 2.272 ± 0.475
2.66AsnVal: 2.66 ± 0.448
0.665AsnTrp: 0.665 ± 0.2
0.942AsnTyr: 0.942 ± 0.243
0.0AsnXaa: 0.0 ± 0.0
Pro
1.551ProAla: 1.551 ± 0.241
0.554ProCys: 0.554 ± 0.181
2.105ProAsp: 2.105 ± 0.405
2.272ProGlu: 2.272 ± 0.486
1.607ProPhe: 1.607 ± 0.278
0.942ProGly: 0.942 ± 0.253
0.887ProHis: 0.887 ± 0.208
1.995ProIle: 1.995 ± 0.386
2.383ProLys: 2.383 ± 0.397
2.881ProLeu: 2.881 ± 0.374
0.554ProMet: 0.554 ± 0.141
1.385ProAsn: 1.385 ± 0.269
0.942ProPro: 0.942 ± 0.253
1.053ProGln: 1.053 ± 0.29
1.718ProArg: 1.718 ± 0.285
2.272ProSer: 2.272 ± 0.357
2.272ProThr: 2.272 ± 0.357
2.327ProVal: 2.327 ± 0.372
0.332ProTrp: 0.332 ± 0.115
1.607ProTyr: 1.607 ± 0.277
0.0ProXaa: 0.0 ± 0.0
Gln
3.491GlnAla: 3.491 ± 0.455
0.388GlnCys: 0.388 ± 0.162
2.05GlnAsp: 2.05 ± 0.321
3.712GlnGlu: 3.712 ± 0.428
1.551GlnPhe: 1.551 ± 0.229
2.216GlnGly: 2.216 ± 0.291
0.499GlnHis: 0.499 ± 0.133
2.881GlnIle: 2.881 ± 0.352
3.158GlnLys: 3.158 ± 0.473
4.377GlnLeu: 4.377 ± 0.508
1.053GlnMet: 1.053 ± 0.257
1.828GlnAsn: 1.828 ± 0.346
1.773GlnPro: 1.773 ± 0.321
1.828GlnGln: 1.828 ± 0.309
1.828GlnArg: 1.828 ± 0.396
2.272GlnSer: 2.272 ± 0.386
2.992GlnThr: 2.992 ± 0.595
3.768GlnVal: 3.768 ± 0.539
0.776GlnTrp: 0.776 ± 0.214
0.942GlnTyr: 0.942 ± 0.247
0.0GlnXaa: 0.0 ± 0.0
Arg
2.161ArgAla: 2.161 ± 0.363
0.499ArgCys: 0.499 ± 0.17
2.383ArgAsp: 2.383 ± 0.347
3.214ArgGlu: 3.214 ± 0.286
1.551ArgPhe: 1.551 ± 0.293
2.327ArgGly: 2.327 ± 0.334
0.499ArgHis: 0.499 ± 0.179
2.937ArgIle: 2.937 ± 0.423
3.324ArgLys: 3.324 ± 0.474
4.876ArgLeu: 4.876 ± 0.561
0.776ArgMet: 0.776 ± 0.225
2.604ArgAsn: 2.604 ± 0.325
1.274ArgPro: 1.274 ± 0.25
2.383ArgGln: 2.383 ± 0.342
1.718ArgArg: 1.718 ± 0.393
2.272ArgSer: 2.272 ± 0.362
3.435ArgThr: 3.435 ± 0.485
3.602ArgVal: 3.602 ± 0.402
0.997ArgTrp: 0.997 ± 0.249
1.718ArgTyr: 1.718 ± 0.367
0.0ArgXaa: 0.0 ± 0.0
Ser
4.377SerAla: 4.377 ± 0.48
0.609SerCys: 0.609 ± 0.206
4.156SerAsp: 4.156 ± 0.461
4.599SerGlu: 4.599 ± 0.478
2.937SerPhe: 2.937 ± 0.48
4.599SerGly: 4.599 ± 0.495
1.607SerHis: 1.607 ± 0.236
4.654SerIle: 4.654 ± 0.63
4.654SerLys: 4.654 ± 0.459
6.095SerLeu: 6.095 ± 0.709
1.164SerMet: 1.164 ± 0.199
2.77SerAsn: 2.77 ± 0.468
2.383SerPro: 2.383 ± 0.273
3.38SerGln: 3.38 ± 0.653
3.214SerArg: 3.214 ± 0.354
5.707SerSer: 5.707 ± 0.934
4.543SerThr: 4.543 ± 0.497
4.1SerVal: 4.1 ± 0.455
0.887SerTrp: 0.887 ± 0.186
2.937SerTyr: 2.937 ± 0.465
0.0SerXaa: 0.0 ± 0.0
Thr
4.211ThrAla: 4.211 ± 0.689
0.332ThrCys: 0.332 ± 0.13
2.77ThrAsp: 2.77 ± 0.348
4.71ThrGlu: 4.71 ± 0.473
2.438ThrPhe: 2.438 ± 0.428
4.1ThrGly: 4.1 ± 0.595
0.997ThrHis: 0.997 ± 0.255
4.931ThrIle: 4.931 ± 0.612
4.211ThrLys: 4.211 ± 0.421
5.596ThrLeu: 5.596 ± 0.531
1.385ThrMet: 1.385 ± 0.278
2.937ThrAsn: 2.937 ± 0.481
1.718ThrPro: 1.718 ± 0.378
2.493ThrGln: 2.493 ± 0.489
2.383ThrArg: 2.383 ± 0.387
5.208ThrSer: 5.208 ± 0.842
4.599ThrThr: 4.599 ± 0.662
5.375ThrVal: 5.375 ± 0.628
1.053ThrTrp: 1.053 ± 0.22
2.438ThrTyr: 2.438 ± 0.372
0.0ThrXaa: 0.0 ± 0.0
Val
4.266ValAla: 4.266 ± 0.445
0.554ValCys: 0.554 ± 0.171
3.712ValAsp: 3.712 ± 0.502
5.43ValGlu: 5.43 ± 0.655
2.992ValPhe: 2.992 ± 0.512
3.768ValGly: 3.768 ± 0.443
1.164ValHis: 1.164 ± 0.19
3.823ValIle: 3.823 ± 0.517
4.71ValLys: 4.71 ± 0.535
6.538ValLeu: 6.538 ± 0.525
1.496ValMet: 1.496 ± 0.264
2.604ValAsn: 2.604 ± 0.408
2.549ValPro: 2.549 ± 0.333
2.549ValGln: 2.549 ± 0.343
2.881ValArg: 2.881 ± 0.474
5.652ValSer: 5.652 ± 0.677
4.377ValThr: 4.377 ± 0.455
4.266ValVal: 4.266 ± 0.429
0.942ValTrp: 0.942 ± 0.22
2.327ValTyr: 2.327 ± 0.373
0.0ValXaa: 0.0 ± 0.0
Trp
0.997TrpAla: 0.997 ± 0.256
0.111TrpCys: 0.111 ± 0.079
0.609TrpAsp: 0.609 ± 0.163
1.164TrpGlu: 1.164 ± 0.272
0.776TrpPhe: 0.776 ± 0.219
0.72TrpGly: 0.72 ± 0.172
0.222TrpHis: 0.222 ± 0.111
0.887TrpIle: 0.887 ± 0.207
0.776TrpLys: 0.776 ± 0.224
1.164TrpLeu: 1.164 ± 0.235
0.332TrpMet: 0.332 ± 0.127
1.164TrpAsn: 1.164 ± 0.273
0.055TrpPro: 0.055 ± 0.05
0.665TrpGln: 0.665 ± 0.214
0.443TrpArg: 0.443 ± 0.166
0.942TrpSer: 0.942 ± 0.313
1.164TrpThr: 1.164 ± 0.274
0.887TrpVal: 0.887 ± 0.26
0.222TrpTrp: 0.222 ± 0.102
0.166TrpTyr: 0.166 ± 0.094
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.383TyrAla: 2.383 ± 0.298
0.776TyrCys: 0.776 ± 0.208
3.047TyrAsp: 3.047 ± 0.485
2.383TyrGlu: 2.383 ± 0.412
1.607TyrPhe: 1.607 ± 0.322
2.161TyrGly: 2.161 ± 0.404
0.887TyrHis: 0.887 ± 0.258
2.604TyrIle: 2.604 ± 0.432
1.939TyrLys: 1.939 ± 0.323
3.324TyrLeu: 3.324 ± 0.504
0.72TyrMet: 0.72 ± 0.203
1.718TyrAsn: 1.718 ± 0.36
1.441TyrPro: 1.441 ± 0.251
2.272TyrGln: 2.272 ± 0.323
2.05TyrArg: 2.05 ± 0.325
2.216TyrSer: 2.216 ± 0.481
2.383TyrThr: 2.383 ± 0.381
1.828TyrVal: 1.828 ± 0.354
0.388TyrTrp: 0.388 ± 0.144
1.385TyrTyr: 1.385 ± 0.337
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 64 proteins (18049 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski