Amino acid dipepetide frequency for Shigella phage vB_SsoS_008

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.999AlaAla: 7.999 ± 1.412
1.155AlaCys: 1.155 ± 0.333
3.958AlaAsp: 3.958 ± 0.618
5.608AlaGlu: 5.608 ± 0.765
2.886AlaPhe: 2.886 ± 0.498
5.195AlaGly: 5.195 ± 0.696
1.484AlaHis: 1.484 ± 0.331
6.103AlaIle: 6.103 ± 0.829
6.02AlaLys: 6.02 ± 0.939
5.938AlaLeu: 5.938 ± 0.649
2.392AlaMet: 2.392 ± 0.444
3.051AlaAsn: 3.051 ± 0.44
1.732AlaPro: 1.732 ± 0.423
3.381AlaGln: 3.381 ± 0.758
5.031AlaArg: 5.031 ± 0.551
6.103AlaSer: 6.103 ± 0.907
4.453AlaThr: 4.453 ± 0.776
5.113AlaVal: 5.113 ± 0.723
1.237AlaTrp: 1.237 ± 0.322
2.556AlaTyr: 2.556 ± 0.454
0.0AlaXaa: 0.0 ± 0.0
Cys
0.825CysAla: 0.825 ± 0.246
0.577CysCys: 0.577 ± 0.245
0.907CysAsp: 0.907 ± 0.266
0.907CysGlu: 0.907 ± 0.228
0.495CysPhe: 0.495 ± 0.257
1.402CysGly: 1.402 ± 0.371
0.33CysHis: 0.33 ± 0.153
1.072CysIle: 1.072 ± 0.267
0.412CysLys: 0.412 ± 0.226
0.907CysLeu: 0.907 ± 0.305
0.66CysMet: 0.66 ± 0.211
0.99CysAsn: 0.99 ± 0.307
0.66CysPro: 0.66 ± 0.232
0.66CysGln: 0.66 ± 0.225
0.742CysArg: 0.742 ± 0.241
0.99CysSer: 0.99 ± 0.32
0.907CysThr: 0.907 ± 0.284
0.66CysVal: 0.66 ± 0.259
0.165CysTrp: 0.165 ± 0.115
0.577CysTyr: 0.577 ± 0.189
0.0CysXaa: 0.0 ± 0.0
Asp
5.525AspAla: 5.525 ± 0.855
0.495AspCys: 0.495 ± 0.188
4.123AspAsp: 4.123 ± 0.639
4.206AspGlu: 4.206 ± 0.6
2.392AspPhe: 2.392 ± 0.489
5.69AspGly: 5.69 ± 0.757
0.99AspHis: 0.99 ± 0.246
3.546AspIle: 3.546 ± 0.494
3.629AspLys: 3.629 ± 0.587
4.041AspLeu: 4.041 ± 0.689
1.732AspMet: 1.732 ± 0.416
3.381AspAsn: 3.381 ± 0.57
2.144AspPro: 2.144 ± 0.447
1.732AspGln: 1.732 ± 0.345
2.474AspArg: 2.474 ± 0.473
3.958AspSer: 3.958 ± 0.649
2.144AspThr: 2.144 ± 0.439
3.958AspVal: 3.958 ± 0.606
0.66AspTrp: 0.66 ± 0.184
1.319AspTyr: 1.319 ± 0.299
0.0AspXaa: 0.0 ± 0.0
Glu
4.618GluAla: 4.618 ± 0.688
0.907GluCys: 0.907 ± 0.271
3.051GluAsp: 3.051 ± 0.581
3.711GluGlu: 3.711 ± 0.458
3.134GluPhe: 3.134 ± 0.52
2.721GluGly: 2.721 ± 0.419
1.649GluHis: 1.649 ± 0.346
5.855GluIle: 5.855 ± 0.618
4.618GluLys: 4.618 ± 0.715
5.113GluLeu: 5.113 ± 0.571
2.392GluMet: 2.392 ± 0.47
2.969GluAsn: 2.969 ± 0.436
1.402GluPro: 1.402 ± 0.4
2.556GluGln: 2.556 ± 0.417
3.464GluArg: 3.464 ± 0.767
4.453GluSer: 4.453 ± 0.736
3.216GluThr: 3.216 ± 0.555
4.783GluVal: 4.783 ± 0.597
0.495GluTrp: 0.495 ± 0.177
1.897GluTyr: 1.897 ± 0.388
0.0GluXaa: 0.0 ± 0.0
Phe
2.309PheAla: 2.309 ± 0.464
0.742PheCys: 0.742 ± 0.259
2.969PheAsp: 2.969 ± 0.566
2.721PheGlu: 2.721 ± 0.435
1.649PhePhe: 1.649 ± 0.444
3.299PheGly: 3.299 ± 0.663
1.155PheHis: 1.155 ± 0.39
2.886PheIle: 2.886 ± 0.447
2.886PheLys: 2.886 ± 0.57
1.897PheLeu: 1.897 ± 0.373
1.814PheMet: 1.814 ± 0.362
2.144PheAsn: 2.144 ± 0.535
1.237PhePro: 1.237 ± 0.394
0.99PheGln: 0.99 ± 0.283
2.474PheArg: 2.474 ± 0.477
2.392PheSer: 2.392 ± 0.503
2.556PheThr: 2.556 ± 0.451
2.309PheVal: 2.309 ± 0.396
0.66PheTrp: 0.66 ± 0.259
1.319PheTyr: 1.319 ± 0.297
0.0PheXaa: 0.0 ± 0.0
Gly
4.866GlyAla: 4.866 ± 1.08
1.237GlyCys: 1.237 ± 0.31
4.288GlyAsp: 4.288 ± 0.654
3.794GlyGlu: 3.794 ± 0.612
2.309GlyPhe: 2.309 ± 0.419
5.608GlyGly: 5.608 ± 1.319
0.907GlyHis: 0.907 ± 0.318
3.464GlyIle: 3.464 ± 0.526
5.938GlyLys: 5.938 ± 0.771
4.536GlyLeu: 4.536 ± 0.548
2.804GlyMet: 2.804 ± 0.594
4.206GlyAsn: 4.206 ± 0.587
0.907GlyPro: 0.907 ± 0.332
2.309GlyGln: 2.309 ± 0.52
3.134GlyArg: 3.134 ± 0.561
4.948GlySer: 4.948 ± 0.618
3.546GlyThr: 3.546 ± 0.445
4.783GlyVal: 4.783 ± 0.626
1.072GlyTrp: 1.072 ± 0.285
2.227GlyTyr: 2.227 ± 0.447
0.0GlyXaa: 0.0 ± 0.0
His
1.814HisAla: 1.814 ± 0.483
0.412HisCys: 0.412 ± 0.156
1.319HisAsp: 1.319 ± 0.33
0.907HisGlu: 0.907 ± 0.28
0.99HisPhe: 0.99 ± 0.288
1.649HisGly: 1.649 ± 0.352
0.742HisHis: 0.742 ± 0.245
1.155HisIle: 1.155 ± 0.26
1.402HisLys: 1.402 ± 0.323
1.484HisLeu: 1.484 ± 0.382
0.495HisMet: 0.495 ± 0.192
0.99HisAsn: 0.99 ± 0.301
0.99HisPro: 0.99 ± 0.278
0.742HisGln: 0.742 ± 0.223
0.99HisArg: 0.99 ± 0.299
1.484HisSer: 1.484 ± 0.383
1.237HisThr: 1.237 ± 0.332
0.825HisVal: 0.825 ± 0.262
0.33HisTrp: 0.33 ± 0.163
0.495HisTyr: 0.495 ± 0.195
0.0HisXaa: 0.0 ± 0.0
Ile
6.185IleAla: 6.185 ± 0.637
0.577IleCys: 0.577 ± 0.235
4.701IleAsp: 4.701 ± 0.624
4.206IleGlu: 4.206 ± 0.515
2.392IlePhe: 2.392 ± 0.462
4.123IleGly: 4.123 ± 0.532
1.072IleHis: 1.072 ± 0.3
3.958IleIle: 3.958 ± 0.536
4.536IleLys: 4.536 ± 0.62
3.876IleLeu: 3.876 ± 0.561
2.392IleMet: 2.392 ± 0.439
3.794IleAsn: 3.794 ± 0.49
2.144IlePro: 2.144 ± 0.458
2.309IleGln: 2.309 ± 0.345
3.051IleArg: 3.051 ± 0.385
5.278IleSer: 5.278 ± 0.556
4.866IleThr: 4.866 ± 0.605
3.464IleVal: 3.464 ± 0.525
0.825IleTrp: 0.825 ± 0.251
2.227IleTyr: 2.227 ± 0.38
0.0IleXaa: 0.0 ± 0.0
Lys
6.515LysAla: 6.515 ± 0.818
0.907LysCys: 0.907 ± 0.28
3.464LysAsp: 3.464 ± 0.587
5.031LysGlu: 5.031 ± 0.601
2.309LysPhe: 2.309 ± 0.471
3.546LysGly: 3.546 ± 0.558
1.072LysHis: 1.072 ± 0.322
4.371LysIle: 4.371 ± 0.599
3.464LysLys: 3.464 ± 0.542
5.855LysLeu: 5.855 ± 0.657
2.804LysMet: 2.804 ± 0.557
2.886LysAsn: 2.886 ± 0.461
3.381LysPro: 3.381 ± 0.632
2.639LysGln: 2.639 ± 0.421
3.794LysArg: 3.794 ± 0.543
3.876LysSer: 3.876 ± 0.585
3.876LysThr: 3.876 ± 0.595
4.206LysVal: 4.206 ± 0.67
0.495LysTrp: 0.495 ± 0.205
2.062LysTyr: 2.062 ± 0.454
0.0LysXaa: 0.0 ± 0.0
Leu
5.031LeuAla: 5.031 ± 0.721
1.072LeuCys: 1.072 ± 0.27
3.546LeuAsp: 3.546 ± 0.506
3.794LeuGlu: 3.794 ± 0.504
2.804LeuPhe: 2.804 ± 0.53
3.711LeuGly: 3.711 ± 0.588
1.319LeuHis: 1.319 ± 0.315
5.113LeuIle: 5.113 ± 0.591
5.278LeuLys: 5.278 ± 0.647
5.278LeuLeu: 5.278 ± 0.745
2.144LeuMet: 2.144 ± 0.448
2.969LeuAsn: 2.969 ± 0.48
3.381LeuPro: 3.381 ± 0.511
1.897LeuGln: 1.897 ± 0.373
4.288LeuArg: 4.288 ± 0.839
6.103LeuSer: 6.103 ± 0.772
5.195LeuThr: 5.195 ± 0.627
4.041LeuVal: 4.041 ± 0.56
0.907LeuTrp: 0.907 ± 0.236
2.309LeuTyr: 2.309 ± 0.434
0.0LeuXaa: 0.0 ± 0.0
Met
2.144MetAla: 2.144 ± 0.441
0.412MetCys: 0.412 ± 0.17
1.732MetAsp: 1.732 ± 0.333
2.639MetGlu: 2.639 ± 0.469
1.319MetPhe: 1.319 ± 0.343
0.99MetGly: 0.99 ± 0.261
0.742MetHis: 0.742 ± 0.217
3.216MetIle: 3.216 ± 0.449
2.227MetLys: 2.227 ± 0.419
2.804MetLeu: 2.804 ± 0.417
2.144MetMet: 2.144 ± 0.491
1.319MetAsn: 1.319 ± 0.287
0.907MetPro: 0.907 ± 0.269
1.402MetGln: 1.402 ± 0.316
2.556MetArg: 2.556 ± 0.459
2.392MetSer: 2.392 ± 0.506
1.319MetThr: 1.319 ± 0.346
2.556MetVal: 2.556 ± 0.394
0.33MetTrp: 0.33 ± 0.172
0.577MetTyr: 0.577 ± 0.229
0.0MetXaa: 0.0 ± 0.0
Asn
3.876AsnAla: 3.876 ± 0.679
0.495AsnCys: 0.495 ± 0.206
2.639AsnAsp: 2.639 ± 0.46
2.721AsnGlu: 2.721 ± 0.414
2.721AsnPhe: 2.721 ± 0.443
4.536AsnGly: 4.536 ± 0.654
1.402AsnHis: 1.402 ± 0.3
3.299AsnIle: 3.299 ± 0.418
3.629AsnLys: 3.629 ± 0.635
2.639AsnLeu: 2.639 ± 0.41
1.484AsnMet: 1.484 ± 0.386
3.381AsnAsn: 3.381 ± 0.678
2.062AsnPro: 2.062 ± 0.425
3.051AsnGln: 3.051 ± 0.444
2.227AsnArg: 2.227 ± 0.392
2.969AsnSer: 2.969 ± 0.606
1.814AsnThr: 1.814 ± 0.437
3.381AsnVal: 3.381 ± 0.481
0.825AsnTrp: 0.825 ± 0.26
1.649AsnTyr: 1.649 ± 0.371
0.0AsnXaa: 0.0 ± 0.0
Pro
2.721ProAla: 2.721 ± 0.481
0.495ProCys: 0.495 ± 0.213
2.309ProAsp: 2.309 ± 0.453
3.051ProGlu: 3.051 ± 0.53
1.319ProPhe: 1.319 ± 0.335
2.556ProGly: 2.556 ± 0.503
0.907ProHis: 0.907 ± 0.263
2.144ProIle: 2.144 ± 0.417
1.567ProLys: 1.567 ± 0.363
1.979ProLeu: 1.979 ± 0.399
0.742ProMet: 0.742 ± 0.23
1.979ProAsn: 1.979 ± 0.36
1.155ProPro: 1.155 ± 0.365
1.237ProGln: 1.237 ± 0.359
1.732ProArg: 1.732 ± 0.351
3.216ProSer: 3.216 ± 0.539
1.319ProThr: 1.319 ± 0.297
2.804ProVal: 2.804 ± 0.53
0.247ProTrp: 0.247 ± 0.15
1.567ProTyr: 1.567 ± 0.326
0.0ProXaa: 0.0 ± 0.0
Gln
3.299GlnAla: 3.299 ± 0.617
0.412GlnCys: 0.412 ± 0.183
1.567GlnAsp: 1.567 ± 0.425
2.392GlnGlu: 2.392 ± 0.388
1.402GlnPhe: 1.402 ± 0.363
2.309GlnGly: 2.309 ± 0.517
0.66GlnHis: 0.66 ± 0.212
3.051GlnIle: 3.051 ± 0.557
2.062GlnLys: 2.062 ± 0.372
3.299GlnLeu: 3.299 ± 0.545
1.237GlnMet: 1.237 ± 0.285
1.897GlnAsn: 1.897 ± 0.332
1.319GlnPro: 1.319 ± 0.358
2.721GlnGln: 2.721 ± 0.8
2.639GlnArg: 2.639 ± 0.494
2.969GlnSer: 2.969 ± 0.633
1.732GlnThr: 1.732 ± 0.508
2.721GlnVal: 2.721 ± 0.439
0.66GlnTrp: 0.66 ± 0.236
1.649GlnTyr: 1.649 ± 0.362
0.0GlnXaa: 0.0 ± 0.0
Arg
4.123ArgAla: 4.123 ± 0.617
1.567ArgCys: 1.567 ± 0.472
2.639ArgAsp: 2.639 ± 0.518
4.206ArgGlu: 4.206 ± 0.612
3.216ArgPhe: 3.216 ± 0.445
3.464ArgGly: 3.464 ± 0.488
0.495ArgHis: 0.495 ± 0.206
2.392ArgIle: 2.392 ± 0.447
4.041ArgLys: 4.041 ± 0.501
4.288ArgLeu: 4.288 ± 0.73
1.814ArgMet: 1.814 ± 0.433
2.309ArgAsn: 2.309 ± 0.447
2.144ArgPro: 2.144 ± 0.532
2.144ArgGln: 2.144 ± 0.572
4.123ArgArg: 4.123 ± 0.624
3.876ArgSer: 3.876 ± 0.54
1.567ArgThr: 1.567 ± 0.449
4.783ArgVal: 4.783 ± 0.57
0.66ArgTrp: 0.66 ± 0.242
2.474ArgTyr: 2.474 ± 0.42
0.0ArgXaa: 0.0 ± 0.0
Ser
6.597SerAla: 6.597 ± 1.002
1.155SerCys: 1.155 ± 0.306
4.041SerAsp: 4.041 ± 0.597
4.371SerGlu: 4.371 ± 0.723
2.886SerPhe: 2.886 ± 0.464
6.185SerGly: 6.185 ± 0.866
1.402SerHis: 1.402 ± 0.335
4.123SerIle: 4.123 ± 0.669
2.886SerLys: 2.886 ± 0.453
6.927SerLeu: 6.927 ± 1.029
1.649SerMet: 1.649 ± 0.329
3.216SerAsn: 3.216 ± 0.607
2.144SerPro: 2.144 ± 0.548
3.299SerGln: 3.299 ± 0.567
4.371SerArg: 4.371 ± 0.591
5.278SerSer: 5.278 ± 0.734
4.041SerThr: 4.041 ± 0.515
4.948SerVal: 4.948 ± 0.57
1.072SerTrp: 1.072 ± 0.263
2.392SerTyr: 2.392 ± 0.508
0.0SerXaa: 0.0 ± 0.0
Thr
3.876ThrAla: 3.876 ± 0.73
1.155ThrCys: 1.155 ± 0.306
3.381ThrAsp: 3.381 ± 0.509
3.051ThrGlu: 3.051 ± 0.562
1.649ThrPhe: 1.649 ± 0.404
4.123ThrGly: 4.123 ± 0.507
1.237ThrHis: 1.237 ± 0.32
3.546ThrIle: 3.546 ± 0.533
2.639ThrLys: 2.639 ± 0.456
3.711ThrLeu: 3.711 ± 0.547
1.484ThrMet: 1.484 ± 0.315
2.474ThrAsn: 2.474 ± 0.457
3.216ThrPro: 3.216 ± 0.619
2.144ThrGln: 2.144 ± 0.421
2.392ThrArg: 2.392 ± 0.47
4.866ThrSer: 4.866 ± 0.684
2.392ThrThr: 2.392 ± 0.423
3.299ThrVal: 3.299 ± 0.612
0.907ThrTrp: 0.907 ± 0.297
1.237ThrTyr: 1.237 ± 0.297
0.0ThrXaa: 0.0 ± 0.0
Val
4.948ValAla: 4.948 ± 0.637
0.742ValCys: 0.742 ± 0.244
4.371ValAsp: 4.371 ± 0.599
3.134ValGlu: 3.134 ± 0.479
2.556ValPhe: 2.556 ± 0.521
2.639ValGly: 2.639 ± 0.531
1.567ValHis: 1.567 ± 0.347
4.288ValIle: 4.288 ± 0.648
6.597ValLys: 6.597 ± 0.892
2.639ValLeu: 2.639 ± 0.441
2.309ValMet: 2.309 ± 0.526
4.123ValAsn: 4.123 ± 0.597
2.804ValPro: 2.804 ± 0.483
2.639ValGln: 2.639 ± 0.419
3.711ValArg: 3.711 ± 0.545
4.948ValSer: 4.948 ± 0.705
3.629ValThr: 3.629 ± 0.705
4.206ValVal: 4.206 ± 0.803
1.237ValTrp: 1.237 ± 0.235
2.309ValTyr: 2.309 ± 0.51
0.0ValXaa: 0.0 ± 0.0
Trp
0.825TrpAla: 0.825 ± 0.287
0.165TrpCys: 0.165 ± 0.101
0.825TrpAsp: 0.825 ± 0.238
0.907TrpGlu: 0.907 ± 0.253
0.412TrpPhe: 0.412 ± 0.275
0.577TrpGly: 0.577 ± 0.19
0.577TrpHis: 0.577 ± 0.191
0.742TrpIle: 0.742 ± 0.201
0.907TrpLys: 0.907 ± 0.32
0.825TrpLeu: 0.825 ± 0.253
0.412TrpMet: 0.412 ± 0.188
0.577TrpAsn: 0.577 ± 0.209
0.33TrpPro: 0.33 ± 0.143
0.412TrpGln: 0.412 ± 0.166
1.072TrpArg: 1.072 ± 0.282
1.155TrpSer: 1.155 ± 0.338
0.907TrpThr: 0.907 ± 0.303
0.907TrpVal: 0.907 ± 0.245
0.165TrpTrp: 0.165 ± 0.113
0.33TrpTyr: 0.33 ± 0.136
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.299TyrAla: 3.299 ± 0.501
0.165TyrCys: 0.165 ± 0.111
2.392TyrAsp: 2.392 ± 0.481
1.649TyrGlu: 1.649 ± 0.388
1.567TyrPhe: 1.567 ± 0.413
2.556TyrGly: 2.556 ± 0.416
0.742TyrHis: 0.742 ± 0.242
1.649TyrIle: 1.649 ± 0.326
1.814TyrLys: 1.814 ± 0.312
2.144TyrLeu: 2.144 ± 0.374
0.742TyrMet: 0.742 ± 0.224
2.144TyrAsn: 2.144 ± 0.517
0.99TyrPro: 0.99 ± 0.269
1.732TyrGln: 1.732 ± 0.359
2.062TyrArg: 2.062 ± 0.426
1.732TyrSer: 1.732 ± 0.372
2.062TyrThr: 2.062 ± 0.48
1.649TyrVal: 1.649 ± 0.451
0.082TyrTrp: 0.082 ± 0.088
0.907TyrTyr: 0.907 ± 0.273
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 83 proteins (12127 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski