Amino acid dipepetide frequency for Siphoviridae sp. ctdc_1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.015AlaAla: 10.015 ± 1.263
0.789AlaCys: 0.789 ± 0.21
6.434AlaAsp: 6.434 ± 0.717
5.22AlaGlu: 5.22 ± 0.657
3.46AlaPhe: 3.46 ± 0.553
7.951AlaGly: 7.951 ± 0.874
1.821AlaHis: 1.821 ± 0.408
4.067AlaIle: 4.067 ± 0.508
3.642AlaLys: 3.642 ± 0.591
6.495AlaLeu: 6.495 ± 0.635
2.61AlaMet: 2.61 ± 0.438
3.096AlaAsn: 3.096 ± 0.426
3.581AlaPro: 3.581 ± 0.65
4.552AlaGln: 4.552 ± 0.767
4.552AlaArg: 4.552 ± 0.623
5.645AlaSer: 5.645 ± 0.605
4.917AlaThr: 4.917 ± 0.67
6.555AlaVal: 6.555 ± 0.557
1.396AlaTrp: 1.396 ± 0.313
2.246AlaTyr: 2.246 ± 0.276
0.0AlaXaa: 0.0 ± 0.0
Cys
1.578CysAla: 1.578 ± 0.342
0.182CysCys: 0.182 ± 0.118
0.789CysAsp: 0.789 ± 0.217
1.578CysGlu: 1.578 ± 0.4
0.486CysPhe: 0.486 ± 0.196
0.85CysGly: 0.85 ± 0.243
0.425CysHis: 0.425 ± 0.148
0.486CysIle: 0.486 ± 0.137
0.546CysLys: 0.546 ± 0.263
0.85CysLeu: 0.85 ± 0.22
0.061CysMet: 0.061 ± 0.059
0.486CysAsn: 0.486 ± 0.157
0.91CysPro: 0.91 ± 0.265
0.425CysGln: 0.425 ± 0.151
0.91CysArg: 0.91 ± 0.279
0.668CysSer: 0.668 ± 0.267
0.971CysThr: 0.971 ± 0.249
0.85CysVal: 0.85 ± 0.225
0.0CysTrp: 0.0 ± 0.0
0.668CysTyr: 0.668 ± 0.174
0.0CysXaa: 0.0 ± 0.0
Asp
5.463AspAla: 5.463 ± 0.545
0.789AspCys: 0.789 ± 0.266
3.399AspAsp: 3.399 ± 0.567
3.703AspGlu: 3.703 ± 0.389
2.428AspPhe: 2.428 ± 0.39
5.524AspGly: 5.524 ± 0.573
1.214AspHis: 1.214 ± 0.272
4.127AspIle: 4.127 ± 0.488
2.914AspLys: 2.914 ± 0.323
4.977AspLeu: 4.977 ± 0.527
1.821AspMet: 1.821 ± 0.297
2.914AspAsn: 2.914 ± 0.304
3.156AspPro: 3.156 ± 0.559
2.124AspGln: 2.124 ± 0.464
3.46AspArg: 3.46 ± 0.493
3.217AspSer: 3.217 ± 0.548
2.549AspThr: 2.549 ± 0.458
4.795AspVal: 4.795 ± 0.569
1.032AspTrp: 1.032 ± 0.346
2.185AspTyr: 2.185 ± 0.366
0.0AspXaa: 0.0 ± 0.0
Glu
5.159GluAla: 5.159 ± 0.627
0.971GluCys: 0.971 ± 0.277
2.61GluAsp: 2.61 ± 0.436
2.489GluGlu: 2.489 ± 0.354
1.882GluPhe: 1.882 ± 0.337
3.642GluGly: 3.642 ± 0.62
0.971GluHis: 0.971 ± 0.2
3.763GluIle: 3.763 ± 0.48
3.096GluLys: 3.096 ± 0.598
5.099GluLeu: 5.099 ± 0.572
2.003GluMet: 2.003 ± 0.365
2.246GluAsn: 2.246 ± 0.331
1.214GluPro: 1.214 ± 0.257
3.096GluGln: 3.096 ± 0.522
3.703GluArg: 3.703 ± 0.471
3.096GluSer: 3.096 ± 0.482
3.156GluThr: 3.156 ± 0.541
3.52GluVal: 3.52 ± 0.33
1.214GluTrp: 1.214 ± 0.263
2.185GluTyr: 2.185 ± 0.338
0.0GluXaa: 0.0 ± 0.0
Phe
2.124PheAla: 2.124 ± 0.442
0.607PheCys: 0.607 ± 0.186
2.731PheAsp: 2.731 ± 0.461
1.821PheGlu: 1.821 ± 0.31
0.789PhePhe: 0.789 ± 0.235
2.914PheGly: 2.914 ± 0.425
1.093PheHis: 1.093 ± 0.348
2.246PheIle: 2.246 ± 0.41
1.882PheLys: 1.882 ± 0.29
2.124PheLeu: 2.124 ± 0.454
1.093PheMet: 1.093 ± 0.241
2.185PheAsn: 2.185 ± 0.34
1.214PhePro: 1.214 ± 0.27
0.728PheGln: 0.728 ± 0.241
1.942PheArg: 1.942 ± 0.348
2.671PheSer: 2.671 ± 0.34
3.278PheThr: 3.278 ± 0.589
2.307PheVal: 2.307 ± 0.435
0.486PheTrp: 0.486 ± 0.218
1.457PheTyr: 1.457 ± 0.313
0.0PheXaa: 0.0 ± 0.0
Gly
6.252GlyAla: 6.252 ± 0.829
1.153GlyCys: 1.153 ± 0.28
4.674GlyAsp: 4.674 ± 0.507
4.734GlyGlu: 4.734 ± 0.482
3.217GlyPhe: 3.217 ± 0.425
6.555GlyGly: 6.555 ± 0.696
1.214GlyHis: 1.214 ± 0.337
3.156GlyIle: 3.156 ± 0.385
3.763GlyLys: 3.763 ± 0.592
4.613GlyLeu: 4.613 ± 0.493
2.307GlyMet: 2.307 ± 0.412
3.46GlyAsn: 3.46 ± 0.422
1.214GlyPro: 1.214 ± 0.267
3.399GlyGln: 3.399 ± 0.428
3.945GlyArg: 3.945 ± 0.437
4.734GlySer: 4.734 ± 0.742
5.099GlyThr: 5.099 ± 0.602
7.162GlyVal: 7.162 ± 0.63
1.396GlyTrp: 1.396 ± 0.305
2.671GlyTyr: 2.671 ± 0.396
0.0GlyXaa: 0.0 ± 0.0
His
1.275HisAla: 1.275 ± 0.278
0.364HisCys: 0.364 ± 0.152
0.971HisAsp: 0.971 ± 0.255
0.971HisGlu: 0.971 ± 0.257
1.032HisPhe: 1.032 ± 0.342
1.396HisGly: 1.396 ± 0.389
0.728HisHis: 0.728 ± 0.246
1.032HisIle: 1.032 ± 0.255
0.971HisLys: 0.971 ± 0.289
1.032HisLeu: 1.032 ± 0.283
0.486HisMet: 0.486 ± 0.178
0.728HisAsn: 0.728 ± 0.165
0.789HisPro: 0.789 ± 0.236
0.607HisGln: 0.607 ± 0.201
1.032HisArg: 1.032 ± 0.323
1.093HisSer: 1.093 ± 0.266
1.214HisThr: 1.214 ± 0.28
2.064HisVal: 2.064 ± 0.335
0.85HisTrp: 0.85 ± 0.272
0.668HisTyr: 0.668 ± 0.219
0.0HisXaa: 0.0 ± 0.0
Ile
5.281IleAla: 5.281 ± 0.586
0.85IleCys: 0.85 ± 0.253
3.824IleAsp: 3.824 ± 0.435
2.914IleGlu: 2.914 ± 0.441
1.396IlePhe: 1.396 ± 0.312
3.096IleGly: 3.096 ± 0.319
0.85IleHis: 0.85 ± 0.195
1.7IleIle: 1.7 ± 0.322
3.278IleLys: 3.278 ± 0.37
2.671IleLeu: 2.671 ± 0.419
1.032IleMet: 1.032 ± 0.284
2.731IleAsn: 2.731 ± 0.337
2.853IlePro: 2.853 ± 0.34
2.428IleGln: 2.428 ± 0.416
3.703IleArg: 3.703 ± 0.435
4.552IleSer: 4.552 ± 0.491
4.31IleThr: 4.31 ± 0.47
3.035IleVal: 3.035 ± 0.445
0.91IleTrp: 0.91 ± 0.188
1.457IleTyr: 1.457 ± 0.313
0.0IleXaa: 0.0 ± 0.0
Lys
5.645LysAla: 5.645 ± 0.818
0.425LysCys: 0.425 ± 0.172
2.671LysAsp: 2.671 ± 0.354
2.731LysGlu: 2.731 ± 0.422
1.821LysPhe: 1.821 ± 0.361
3.156LysGly: 3.156 ± 0.42
0.85LysHis: 0.85 ± 0.223
2.731LysIle: 2.731 ± 0.513
2.307LysLys: 2.307 ± 0.459
4.006LysLeu: 4.006 ± 0.498
1.335LysMet: 1.335 ± 0.278
1.821LysAsn: 1.821 ± 0.473
3.278LysPro: 3.278 ± 0.499
1.942LysGln: 1.942 ± 0.306
2.974LysArg: 2.974 ± 0.417
3.824LysSer: 3.824 ± 0.696
2.489LysThr: 2.489 ± 0.379
3.035LysVal: 3.035 ± 0.385
0.486LysTrp: 0.486 ± 0.138
2.307LysTyr: 2.307 ± 0.429
0.0LysXaa: 0.0 ± 0.0
Leu
6.252LeuAla: 6.252 ± 0.815
1.153LeuCys: 1.153 ± 0.268
4.37LeuAsp: 4.37 ± 0.487
3.096LeuGlu: 3.096 ± 0.479
2.124LeuPhe: 2.124 ± 0.347
4.31LeuGly: 4.31 ± 0.548
1.76LeuHis: 1.76 ± 0.358
3.763LeuIle: 3.763 ± 0.438
4.188LeuLys: 4.188 ± 0.46
5.463LeuLeu: 5.463 ± 0.526
1.517LeuMet: 1.517 ± 0.288
4.006LeuAsn: 4.006 ± 0.489
3.46LeuPro: 3.46 ± 0.575
3.156LeuGln: 3.156 ± 0.32
4.552LeuArg: 4.552 ± 0.54
6.434LeuSer: 6.434 ± 0.501
5.159LeuThr: 5.159 ± 0.537
4.431LeuVal: 4.431 ± 0.439
0.668LeuTrp: 0.668 ± 0.211
3.035LeuTyr: 3.035 ± 0.472
0.0LeuXaa: 0.0 ± 0.0
Met
2.124MetAla: 2.124 ± 0.311
0.182MetCys: 0.182 ± 0.096
1.093MetAsp: 1.093 ± 0.263
1.275MetGlu: 1.275 ± 0.257
0.971MetPhe: 0.971 ± 0.266
1.275MetGly: 1.275 ± 0.274
0.182MetHis: 0.182 ± 0.106
1.821MetIle: 1.821 ± 0.349
1.882MetLys: 1.882 ± 0.344
2.246MetLeu: 2.246 ± 0.423
0.486MetMet: 0.486 ± 0.174
1.275MetAsn: 1.275 ± 0.263
1.093MetPro: 1.093 ± 0.248
0.85MetGln: 0.85 ± 0.206
1.578MetArg: 1.578 ± 0.301
2.064MetSer: 2.064 ± 0.321
1.942MetThr: 1.942 ± 0.356
1.76MetVal: 1.76 ± 0.33
0.425MetTrp: 0.425 ± 0.146
0.728MetTyr: 0.728 ± 0.212
0.0MetXaa: 0.0 ± 0.0
Asn
4.188AsnAla: 4.188 ± 0.516
0.668AsnCys: 0.668 ± 0.225
2.367AsnAsp: 2.367 ± 0.358
2.367AsnGlu: 2.367 ± 0.353
1.639AsnPhe: 1.639 ± 0.326
4.249AsnGly: 4.249 ± 0.523
0.971AsnHis: 0.971 ± 0.249
3.46AsnIle: 3.46 ± 0.438
2.124AsnLys: 2.124 ± 0.33
3.278AsnLeu: 3.278 ± 0.396
0.971AsnMet: 0.971 ± 0.195
3.217AsnAsn: 3.217 ± 0.556
2.731AsnPro: 2.731 ± 0.457
1.942AsnGln: 1.942 ± 0.463
2.367AsnArg: 2.367 ± 0.491
2.671AsnSer: 2.671 ± 0.351
2.489AsnThr: 2.489 ± 0.466
3.217AsnVal: 3.217 ± 0.413
0.607AsnTrp: 0.607 ± 0.242
2.124AsnTyr: 2.124 ± 0.362
0.0AsnXaa: 0.0 ± 0.0
Pro
5.402ProAla: 5.402 ± 0.623
0.668ProCys: 0.668 ± 0.246
3.824ProAsp: 3.824 ± 0.509
3.52ProGlu: 3.52 ± 0.517
1.153ProPhe: 1.153 ± 0.238
3.824ProGly: 3.824 ± 0.458
1.032ProHis: 1.032 ± 0.249
1.214ProIle: 1.214 ± 0.284
1.578ProLys: 1.578 ± 0.366
2.124ProLeu: 2.124 ± 0.362
1.032ProMet: 1.032 ± 0.294
1.821ProAsn: 1.821 ± 0.287
1.882ProPro: 1.882 ± 0.363
1.275ProGln: 1.275 ± 0.242
2.367ProArg: 2.367 ± 0.342
2.853ProSer: 2.853 ± 0.471
2.064ProThr: 2.064 ± 0.405
4.856ProVal: 4.856 ± 0.549
0.425ProTrp: 0.425 ± 0.175
1.639ProTyr: 1.639 ± 0.432
0.0ProXaa: 0.0 ± 0.0
Gln
3.035GlnAla: 3.035 ± 0.524
0.425GlnCys: 0.425 ± 0.179
2.064GlnAsp: 2.064 ± 0.385
1.457GlnGlu: 1.457 ± 0.361
1.942GlnPhe: 1.942 ± 0.413
2.489GlnGly: 2.489 ± 0.427
0.91GlnHis: 0.91 ± 0.276
3.763GlnIle: 3.763 ± 0.489
2.064GlnLys: 2.064 ± 0.473
2.792GlnLeu: 2.792 ± 0.482
0.85GlnMet: 0.85 ± 0.242
2.671GlnAsn: 2.671 ± 0.569
2.489GlnPro: 2.489 ± 0.425
3.52GlnGln: 3.52 ± 0.86
2.428GlnArg: 2.428 ± 0.382
3.338GlnSer: 3.338 ± 0.693
2.185GlnThr: 2.185 ± 0.325
3.035GlnVal: 3.035 ± 0.478
1.335GlnTrp: 1.335 ± 0.33
1.457GlnTyr: 1.457 ± 0.413
0.0GlnXaa: 0.0 ± 0.0
Arg
4.795ArgAla: 4.795 ± 0.474
0.789ArgCys: 0.789 ± 0.249
4.188ArgAsp: 4.188 ± 0.478
2.914ArgGlu: 2.914 ± 0.493
2.064ArgPhe: 2.064 ± 0.292
2.974ArgGly: 2.974 ± 0.497
0.91ArgHis: 0.91 ± 0.285
3.642ArgIle: 3.642 ± 0.491
3.035ArgLys: 3.035 ± 0.411
4.674ArgLeu: 4.674 ± 0.517
1.7ArgMet: 1.7 ± 0.347
3.035ArgAsn: 3.035 ± 0.395
1.942ArgPro: 1.942 ± 0.393
2.307ArgGln: 2.307 ± 0.445
3.581ArgArg: 3.581 ± 0.578
3.399ArgSer: 3.399 ± 0.52
3.642ArgThr: 3.642 ± 0.558
4.006ArgVal: 4.006 ± 0.451
1.032ArgTrp: 1.032 ± 0.249
2.246ArgTyr: 2.246 ± 0.407
0.0ArgXaa: 0.0 ± 0.0
Ser
6.555SerAla: 6.555 ± 0.692
0.728SerCys: 0.728 ± 0.223
4.492SerAsp: 4.492 ± 0.568
3.52SerGlu: 3.52 ± 0.477
2.428SerPhe: 2.428 ± 0.405
6.98SerGly: 6.98 ± 0.733
0.789SerHis: 0.789 ± 0.258
3.338SerIle: 3.338 ± 0.476
2.61SerLys: 2.61 ± 0.445
5.22SerLeu: 5.22 ± 0.622
1.396SerMet: 1.396 ± 0.252
3.096SerAsn: 3.096 ± 0.448
3.156SerPro: 3.156 ± 0.421
3.399SerGln: 3.399 ± 0.6
3.338SerArg: 3.338 ± 0.421
5.341SerSer: 5.341 ± 0.721
4.31SerThr: 4.31 ± 0.629
5.645SerVal: 5.645 ± 0.682
0.91SerTrp: 0.91 ± 0.262
2.853SerTyr: 2.853 ± 0.419
0.0SerXaa: 0.0 ± 0.0
Thr
5.341ThrAla: 5.341 ± 0.782
0.971ThrCys: 0.971 ± 0.273
3.703ThrAsp: 3.703 ± 0.414
3.217ThrGlu: 3.217 ± 0.435
2.428ThrPhe: 2.428 ± 0.424
6.252ThrGly: 6.252 ± 0.757
1.093ThrHis: 1.093 ± 0.253
2.61ThrIle: 2.61 ± 0.339
2.853ThrLys: 2.853 ± 0.509
5.402ThrLeu: 5.402 ± 0.533
0.85ThrMet: 0.85 ± 0.227
2.792ThrAsn: 2.792 ± 0.446
4.006ThrPro: 4.006 ± 0.858
2.307ThrGln: 2.307 ± 0.418
3.156ThrArg: 3.156 ± 0.413
4.249ThrSer: 4.249 ± 0.764
3.399ThrThr: 3.399 ± 0.545
4.795ThrVal: 4.795 ± 0.715
0.971ThrTrp: 0.971 ± 0.274
1.7ThrTyr: 1.7 ± 0.422
0.0ThrXaa: 0.0 ± 0.0
Val
5.584ValAla: 5.584 ± 0.666
1.032ValCys: 1.032 ± 0.213
3.885ValAsp: 3.885 ± 0.499
4.31ValGlu: 4.31 ± 0.496
2.489ValPhe: 2.489 ± 0.401
3.945ValGly: 3.945 ± 0.455
1.153ValHis: 1.153 ± 0.285
3.945ValIle: 3.945 ± 0.531
4.006ValLys: 4.006 ± 0.614
5.22ValLeu: 5.22 ± 0.541
2.61ValMet: 2.61 ± 0.411
3.885ValAsn: 3.885 ± 0.532
3.581ValPro: 3.581 ± 0.511
3.338ValGln: 3.338 ± 0.457
4.249ValArg: 4.249 ± 0.523
6.313ValSer: 6.313 ± 0.686
5.159ValThr: 5.159 ± 0.531
3.763ValVal: 3.763 ± 0.516
1.457ValTrp: 1.457 ± 0.362
1.942ValTyr: 1.942 ± 0.348
0.0ValXaa: 0.0 ± 0.0
Trp
0.91TrpAla: 0.91 ± 0.239
0.364TrpCys: 0.364 ± 0.159
1.214TrpAsp: 1.214 ± 0.266
0.971TrpGlu: 0.971 ± 0.237
0.85TrpPhe: 0.85 ± 0.17
0.971TrpGly: 0.971 ± 0.197
0.364TrpHis: 0.364 ± 0.127
0.486TrpIle: 0.486 ± 0.188
0.85TrpLys: 0.85 ± 0.239
1.639TrpLeu: 1.639 ± 0.331
0.364TrpMet: 0.364 ± 0.135
0.668TrpAsn: 0.668 ± 0.189
0.789TrpPro: 0.789 ± 0.248
1.153TrpGln: 1.153 ± 0.266
1.275TrpArg: 1.275 ± 0.28
0.85TrpSer: 0.85 ± 0.206
0.607TrpThr: 0.607 ± 0.183
1.032TrpVal: 1.032 ± 0.307
0.364TrpTrp: 0.364 ± 0.148
0.91TrpTyr: 0.91 ± 0.212
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.064TyrAla: 2.064 ± 0.349
0.607TyrCys: 0.607 ± 0.269
2.489TyrAsp: 2.489 ± 0.512
2.307TyrGlu: 2.307 ± 0.511
1.093TyrPhe: 1.093 ± 0.247
2.367TyrGly: 2.367 ± 0.395
0.971TyrHis: 0.971 ± 0.284
1.517TyrIle: 1.517 ± 0.301
2.246TyrLys: 2.246 ± 0.357
2.974TyrLeu: 2.974 ± 0.396
0.728TyrMet: 0.728 ± 0.189
1.517TyrAsn: 1.517 ± 0.372
1.214TyrPro: 1.214 ± 0.282
1.7TyrGln: 1.7 ± 0.287
1.76TyrArg: 1.76 ± 0.415
3.096TyrSer: 3.096 ± 0.388
3.156TyrThr: 3.156 ± 0.43
2.003TyrVal: 2.003 ± 0.439
0.668TyrTrp: 0.668 ± 0.234
1.275TyrTyr: 1.275 ± 0.265
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 66 proteins (16476 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski