Amino acid dipepetide frequency for Salmonella phage vB_SemP_Emek

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.873AlaAla: 9.873 ± 1.35
1.079AlaCys: 1.079 ± 0.325
6.305AlaAsp: 6.305 ± 0.839
7.384AlaGlu: 7.384 ± 0.872
2.904AlaPhe: 2.904 ± 0.436
6.803AlaGly: 6.803 ± 0.766
0.996AlaHis: 0.996 ± 0.278
5.061AlaIle: 5.061 ± 0.717
5.144AlaLys: 5.144 ± 0.607
6.886AlaLeu: 6.886 ± 0.864
3.982AlaMet: 3.982 ± 0.494
5.31AlaAsn: 5.31 ± 0.844
2.406AlaPro: 2.406 ± 0.483
3.236AlaGln: 3.236 ± 0.649
6.554AlaArg: 6.554 ± 0.723
4.646AlaSer: 4.646 ± 0.731
6.057AlaThr: 6.057 ± 0.865
5.31AlaVal: 5.31 ± 0.538
1.659AlaTrp: 1.659 ± 0.392
2.655AlaTyr: 2.655 ± 0.492
0.0AlaXaa: 0.0 ± 0.0
Cys
0.581CysAla: 0.581 ± 0.233
0.249CysCys: 0.249 ± 0.152
0.747CysAsp: 0.747 ± 0.258
0.498CysGlu: 0.498 ± 0.203
0.581CysPhe: 0.581 ± 0.209
1.493CysGly: 1.493 ± 0.441
0.415CysHis: 0.415 ± 0.258
0.664CysIle: 0.664 ± 0.225
0.747CysLys: 0.747 ± 0.308
0.498CysLeu: 0.498 ± 0.233
0.415CysMet: 0.415 ± 0.193
0.498CysAsn: 0.498 ± 0.204
0.332CysPro: 0.332 ± 0.159
0.498CysGln: 0.498 ± 0.191
1.162CysArg: 1.162 ± 0.33
0.581CysSer: 0.581 ± 0.211
0.664CysThr: 0.664 ± 0.241
0.747CysVal: 0.747 ± 0.294
0.166CysTrp: 0.166 ± 0.12
0.581CysTyr: 0.581 ± 0.206
0.0CysXaa: 0.0 ± 0.0
Asp
7.799AspAla: 7.799 ± 0.803
0.415AspCys: 0.415 ± 0.209
4.397AspAsp: 4.397 ± 0.825
4.314AspGlu: 4.314 ± 0.59
1.327AspPhe: 1.327 ± 0.332
4.48AspGly: 4.48 ± 0.748
0.747AspHis: 0.747 ± 0.292
4.148AspIle: 4.148 ± 0.505
3.485AspLys: 3.485 ± 0.529
4.646AspLeu: 4.646 ± 0.565
1.327AspMet: 1.327 ± 0.337
2.24AspAsn: 2.24 ± 0.502
1.825AspPro: 1.825 ± 0.424
1.327AspGln: 1.327 ± 0.298
2.157AspArg: 2.157 ± 0.431
3.816AspSer: 3.816 ± 0.475
1.576AspThr: 1.576 ± 0.306
5.144AspVal: 5.144 ± 0.732
1.327AspTrp: 1.327 ± 0.434
2.821AspTyr: 2.821 ± 0.377
0.0AspXaa: 0.0 ± 0.0
Glu
5.31GluAla: 5.31 ± 0.626
1.327GluCys: 1.327 ± 0.329
2.987GluAsp: 2.987 ± 0.502
4.729GluGlu: 4.729 ± 0.81
1.991GluPhe: 1.991 ± 0.434
3.402GluGly: 3.402 ± 0.462
1.245GluHis: 1.245 ± 0.307
4.397GluIle: 4.397 ± 0.608
4.397GluLys: 4.397 ± 0.671
6.969GluLeu: 6.969 ± 0.822
2.323GluMet: 2.323 ± 0.543
2.572GluAsn: 2.572 ± 0.607
2.572GluPro: 2.572 ± 0.453
4.397GluGln: 4.397 ± 0.54
4.231GluArg: 4.231 ± 0.539
3.899GluSer: 3.899 ± 0.567
2.572GluThr: 2.572 ± 0.394
4.231GluVal: 4.231 ± 0.694
1.908GluTrp: 1.908 ± 0.427
2.323GluTyr: 2.323 ± 0.454
0.0GluXaa: 0.0 ± 0.0
Phe
2.821PheAla: 2.821 ± 0.451
0.498PheCys: 0.498 ± 0.25
2.323PheAsp: 2.323 ± 0.471
1.327PheGlu: 1.327 ± 0.321
1.245PhePhe: 1.245 ± 0.352
2.489PheGly: 2.489 ± 0.378
0.498PheHis: 0.498 ± 0.18
2.323PheIle: 2.323 ± 0.546
1.659PheLys: 1.659 ± 0.357
2.157PheLeu: 2.157 ± 0.458
0.996PheMet: 0.996 ± 0.257
1.991PheAsn: 1.991 ± 0.336
1.327PhePro: 1.327 ± 0.279
1.162PheGln: 1.162 ± 0.408
1.742PheArg: 1.742 ± 0.427
3.236PheSer: 3.236 ± 0.512
1.659PheThr: 1.659 ± 0.324
1.245PheVal: 1.245 ± 0.246
0.747PheTrp: 0.747 ± 0.192
0.747PheTyr: 0.747 ± 0.277
0.0PheXaa: 0.0 ± 0.0
Gly
5.559GlyAla: 5.559 ± 0.823
0.581GlyCys: 0.581 ± 0.222
3.816GlyAsp: 3.816 ± 0.687
4.314GlyGlu: 4.314 ± 0.626
3.153GlyPhe: 3.153 ± 0.528
5.061GlyGly: 5.061 ± 0.805
1.245GlyHis: 1.245 ± 0.326
5.31GlyIle: 5.31 ± 0.664
4.895GlyLys: 4.895 ± 0.61
5.061GlyLeu: 5.061 ± 0.894
2.406GlyMet: 2.406 ± 0.524
3.236GlyAsn: 3.236 ± 0.551
1.079GlyPro: 1.079 ± 0.328
3.982GlyGln: 3.982 ± 0.711
4.895GlyArg: 4.895 ± 0.604
3.236GlySer: 3.236 ± 0.553
4.729GlyThr: 4.729 ± 0.796
4.812GlyVal: 4.812 ± 0.65
1.742GlyTrp: 1.742 ± 0.353
2.572GlyTyr: 2.572 ± 0.587
0.0GlyXaa: 0.0 ± 0.0
His
0.83HisAla: 0.83 ± 0.267
0.498HisCys: 0.498 ± 0.192
1.245HisAsp: 1.245 ± 0.307
1.41HisGlu: 1.41 ± 0.317
0.83HisPhe: 0.83 ± 0.298
1.576HisGly: 1.576 ± 0.324
0.332HisHis: 0.332 ± 0.167
0.664HisIle: 0.664 ± 0.229
1.079HisLys: 1.079 ± 0.304
1.493HisLeu: 1.493 ± 0.432
0.581HisMet: 0.581 ± 0.167
0.332HisAsn: 0.332 ± 0.154
1.327HisPro: 1.327 ± 0.395
0.83HisGln: 0.83 ± 0.295
1.245HisArg: 1.245 ± 0.337
0.664HisSer: 0.664 ± 0.255
0.581HisThr: 0.581 ± 0.263
0.913HisVal: 0.913 ± 0.283
0.166HisTrp: 0.166 ± 0.119
0.83HisTyr: 0.83 ± 0.231
0.0HisXaa: 0.0 ± 0.0
Ile
6.223IleAla: 6.223 ± 0.918
0.498IleCys: 0.498 ± 0.214
4.148IleAsp: 4.148 ± 0.546
4.48IleGlu: 4.48 ± 0.534
1.742IlePhe: 1.742 ± 0.35
4.646IleGly: 4.646 ± 0.674
1.245IleHis: 1.245 ± 0.344
3.402IleIle: 3.402 ± 0.535
3.07IleLys: 3.07 ± 0.525
2.821IleLeu: 2.821 ± 0.431
1.079IleMet: 1.079 ± 0.27
2.987IleAsn: 2.987 ± 0.453
3.899IlePro: 3.899 ± 0.581
1.659IleGln: 1.659 ± 0.275
4.314IleArg: 4.314 ± 0.438
4.397IleSer: 4.397 ± 0.504
4.231IleThr: 4.231 ± 0.593
2.323IleVal: 2.323 ± 0.404
0.498IleTrp: 0.498 ± 0.192
1.576IleTyr: 1.576 ± 0.383
0.0IleXaa: 0.0 ± 0.0
Lys
5.559LysAla: 5.559 ± 0.841
0.83LysCys: 0.83 ± 0.237
3.236LysAsp: 3.236 ± 0.518
5.144LysGlu: 5.144 ± 0.808
1.659LysPhe: 1.659 ± 0.434
3.816LysGly: 3.816 ± 0.459
0.83LysHis: 0.83 ± 0.267
3.153LysIle: 3.153 ± 0.394
4.148LysLys: 4.148 ± 0.673
5.476LysLeu: 5.476 ± 0.785
1.576LysMet: 1.576 ± 0.323
2.489LysAsn: 2.489 ± 0.442
3.07LysPro: 3.07 ± 0.485
3.816LysGln: 3.816 ± 0.584
4.397LysArg: 4.397 ± 0.567
3.899LysSer: 3.899 ± 0.46
4.065LysThr: 4.065 ± 0.691
3.402LysVal: 3.402 ± 0.546
0.498LysTrp: 0.498 ± 0.239
2.157LysTyr: 2.157 ± 0.388
0.0LysXaa: 0.0 ± 0.0
Leu
8.214LeuAla: 8.214 ± 0.801
0.913LeuCys: 0.913 ± 0.279
3.568LeuAsp: 3.568 ± 0.772
5.144LeuGlu: 5.144 ± 0.554
2.24LeuPhe: 2.24 ± 0.534
4.729LeuGly: 4.729 ± 0.721
1.327LeuHis: 1.327 ± 0.375
4.314LeuIle: 4.314 ± 0.612
5.393LeuLys: 5.393 ± 0.689
5.559LeuLeu: 5.559 ± 0.634
1.908LeuMet: 1.908 ± 0.359
4.895LeuAsn: 4.895 ± 0.673
3.319LeuPro: 3.319 ± 0.491
2.572LeuGln: 2.572 ± 0.473
5.061LeuArg: 5.061 ± 0.595
6.057LeuSer: 6.057 ± 0.693
4.729LeuThr: 4.729 ± 0.724
3.982LeuVal: 3.982 ± 0.477
0.996LeuTrp: 0.996 ± 0.271
2.738LeuTyr: 2.738 ± 0.511
0.0LeuXaa: 0.0 ± 0.0
Met
3.982MetAla: 3.982 ± 0.606
0.083MetCys: 0.083 ± 0.09
1.245MetAsp: 1.245 ± 0.334
1.825MetGlu: 1.825 ± 0.383
0.581MetPhe: 0.581 ± 0.229
2.24MetGly: 2.24 ± 0.433
0.332MetHis: 0.332 ± 0.183
1.245MetIle: 1.245 ± 0.374
1.576MetLys: 1.576 ± 0.389
2.323MetLeu: 2.323 ± 0.429
1.327MetMet: 1.327 ± 0.428
1.327MetAsn: 1.327 ± 0.352
0.996MetPro: 0.996 ± 0.266
1.079MetGln: 1.079 ± 0.318
2.323MetArg: 2.323 ± 0.447
2.489MetSer: 2.489 ± 0.482
1.825MetThr: 1.825 ± 0.377
1.576MetVal: 1.576 ± 0.365
0.166MetTrp: 0.166 ± 0.1
0.664MetTyr: 0.664 ± 0.29
0.0MetXaa: 0.0 ± 0.0
Asn
5.725AsnAla: 5.725 ± 0.874
0.249AsnCys: 0.249 ± 0.155
3.07AsnAsp: 3.07 ± 0.448
3.568AsnGlu: 3.568 ± 0.585
0.581AsnPhe: 0.581 ± 0.214
3.816AsnGly: 3.816 ± 0.639
0.996AsnHis: 0.996 ± 0.298
2.572AsnIle: 2.572 ± 0.478
3.153AsnLys: 3.153 ± 0.488
2.572AsnLeu: 2.572 ± 0.447
1.742AsnMet: 1.742 ± 0.44
1.576AsnAsn: 1.576 ± 0.381
1.908AsnPro: 1.908 ± 0.428
2.904AsnGln: 2.904 ± 0.576
2.655AsnArg: 2.655 ± 0.459
2.489AsnSer: 2.489 ± 0.412
2.572AsnThr: 2.572 ± 0.503
2.904AsnVal: 2.904 ± 0.846
0.83AsnTrp: 0.83 ± 0.277
1.742AsnTyr: 1.742 ± 0.347
0.0AsnXaa: 0.0 ± 0.0
Pro
3.734ProAla: 3.734 ± 0.404
0.166ProCys: 0.166 ± 0.122
2.821ProAsp: 2.821 ± 0.491
4.231ProGlu: 4.231 ± 0.603
1.41ProPhe: 1.41 ± 0.356
1.825ProGly: 1.825 ± 0.363
0.747ProHis: 0.747 ± 0.277
1.659ProIle: 1.659 ± 0.322
2.987ProLys: 2.987 ± 0.531
2.987ProLeu: 2.987 ± 0.487
1.162ProMet: 1.162 ± 0.351
1.41ProAsn: 1.41 ± 0.351
1.576ProPro: 1.576 ± 0.349
1.742ProGln: 1.742 ± 0.301
1.493ProArg: 1.493 ± 0.309
2.821ProSer: 2.821 ± 0.457
1.908ProThr: 1.908 ± 0.36
3.236ProVal: 3.236 ± 0.579
0.332ProTrp: 0.332 ± 0.155
1.162ProTyr: 1.162 ± 0.312
0.0ProXaa: 0.0 ± 0.0
Gln
3.402GlnAla: 3.402 ± 0.724
0.415GlnCys: 0.415 ± 0.209
1.991GlnAsp: 1.991 ± 0.441
2.074GlnGlu: 2.074 ± 0.468
1.576GlnPhe: 1.576 ± 0.311
3.07GlnGly: 3.07 ± 0.448
0.581GlnHis: 0.581 ± 0.229
3.07GlnIle: 3.07 ± 0.519
2.572GlnLys: 2.572 ± 0.402
4.646GlnLeu: 4.646 ± 0.533
1.327GlnMet: 1.327 ± 0.321
2.572GlnAsn: 2.572 ± 0.609
2.074GlnPro: 2.074 ± 0.376
3.734GlnGln: 3.734 ± 0.791
2.655GlnArg: 2.655 ± 0.51
2.572GlnSer: 2.572 ± 0.517
1.991GlnThr: 1.991 ± 0.332
2.074GlnVal: 2.074 ± 0.499
1.327GlnTrp: 1.327 ± 0.351
1.493GlnTyr: 1.493 ± 0.431
0.0GlnXaa: 0.0 ± 0.0
Arg
4.646ArgAla: 4.646 ± 0.67
1.245ArgCys: 1.245 ± 0.374
3.651ArgAsp: 3.651 ± 0.631
4.563ArgGlu: 4.563 ± 0.546
2.323ArgPhe: 2.323 ± 0.482
3.568ArgGly: 3.568 ± 0.487
1.576ArgHis: 1.576 ± 0.318
3.982ArgIle: 3.982 ± 0.599
4.729ArgLys: 4.729 ± 0.708
5.476ArgLeu: 5.476 ± 0.612
2.157ArgMet: 2.157 ± 0.453
3.568ArgAsn: 3.568 ± 0.564
1.41ArgPro: 1.41 ± 0.293
2.323ArgGln: 2.323 ± 0.477
3.899ArgArg: 3.899 ± 0.731
3.734ArgSer: 3.734 ± 0.54
1.991ArgThr: 1.991 ± 0.397
2.738ArgVal: 2.738 ± 0.464
1.245ArgTrp: 1.245 ± 0.383
2.572ArgTyr: 2.572 ± 0.513
0.0ArgXaa: 0.0 ± 0.0
Ser
5.061SerAla: 5.061 ± 0.737
0.747SerCys: 0.747 ± 0.266
4.563SerAsp: 4.563 ± 0.674
3.651SerGlu: 3.651 ± 0.439
2.489SerPhe: 2.489 ± 0.46
5.891SerGly: 5.891 ± 0.653
1.079SerHis: 1.079 ± 0.341
3.568SerIle: 3.568 ± 0.613
3.402SerLys: 3.402 ± 0.483
5.144SerLeu: 5.144 ± 0.561
1.908SerMet: 1.908 ± 0.298
2.655SerAsn: 2.655 ± 0.405
3.236SerPro: 3.236 ± 0.604
3.319SerGln: 3.319 ± 0.504
3.485SerArg: 3.485 ± 0.583
3.402SerSer: 3.402 ± 0.599
2.074SerThr: 2.074 ± 0.48
3.153SerVal: 3.153 ± 0.449
0.83SerTrp: 0.83 ± 0.242
2.074SerTyr: 2.074 ± 0.434
0.0SerXaa: 0.0 ± 0.0
Thr
4.812ThrAla: 4.812 ± 0.596
0.664ThrCys: 0.664 ± 0.213
2.406ThrAsp: 2.406 ± 0.411
2.572ThrGlu: 2.572 ± 0.508
2.157ThrPhe: 2.157 ± 0.526
5.061ThrGly: 5.061 ± 0.732
1.079ThrHis: 1.079 ± 0.279
3.568ThrIle: 3.568 ± 0.496
3.982ThrLys: 3.982 ± 0.698
3.319ThrLeu: 3.319 ± 0.439
0.581ThrMet: 0.581 ± 0.225
1.825ThrAsn: 1.825 ± 0.45
3.07ThrPro: 3.07 ± 0.567
2.738ThrGln: 2.738 ± 0.536
2.655ThrArg: 2.655 ± 0.356
2.655ThrSer: 2.655 ± 0.486
2.157ThrThr: 2.157 ± 0.379
3.734ThrVal: 3.734 ± 0.594
0.664ThrTrp: 0.664 ± 0.241
1.825ThrTyr: 1.825 ± 0.405
0.0ThrXaa: 0.0 ± 0.0
Val
5.476ValAla: 5.476 ± 0.598
0.747ValCys: 0.747 ± 0.288
2.904ValAsp: 2.904 ± 0.488
4.065ValGlu: 4.065 ± 0.506
1.576ValPhe: 1.576 ± 0.402
3.982ValGly: 3.982 ± 0.572
0.581ValHis: 0.581 ± 0.24
3.485ValIle: 3.485 ± 0.64
3.899ValLys: 3.899 ± 0.509
5.227ValLeu: 5.227 ± 0.579
1.245ValMet: 1.245 ± 0.298
3.899ValAsn: 3.899 ± 0.613
1.576ValPro: 1.576 ± 0.404
1.659ValGln: 1.659 ± 0.318
2.323ValArg: 2.323 ± 0.545
4.065ValSer: 4.065 ± 0.61
3.982ValThr: 3.982 ± 0.493
3.651ValVal: 3.651 ± 0.491
1.245ValTrp: 1.245 ± 0.397
2.24ValTyr: 2.24 ± 0.55
0.0ValXaa: 0.0 ± 0.0
Trp
1.245TrpAla: 1.245 ± 0.284
0.581TrpCys: 0.581 ± 0.237
1.41TrpAsp: 1.41 ± 0.342
0.415TrpGlu: 0.415 ± 0.188
0.664TrpPhe: 0.664 ± 0.21
0.996TrpGly: 0.996 ± 0.246
0.664TrpHis: 0.664 ± 0.259
0.913TrpIle: 0.913 ± 0.23
1.245TrpLys: 1.245 ± 0.369
1.825TrpLeu: 1.825 ± 0.351
0.747TrpMet: 0.747 ± 0.273
0.415TrpAsn: 0.415 ± 0.194
0.913TrpPro: 0.913 ± 0.212
0.747TrpGln: 0.747 ± 0.245
1.245TrpArg: 1.245 ± 0.349
0.913TrpSer: 0.913 ± 0.277
0.581TrpThr: 0.581 ± 0.261
1.162TrpVal: 1.162 ± 0.28
0.249TrpTrp: 0.249 ± 0.147
0.249TrpTyr: 0.249 ± 0.154
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.485TyrAla: 3.485 ± 0.658
0.332TyrCys: 0.332 ± 0.187
2.489TyrAsp: 2.489 ± 0.521
2.074TyrGlu: 2.074 ± 0.529
1.162TyrPhe: 1.162 ± 0.311
2.738TyrGly: 2.738 ± 0.606
0.913TyrHis: 0.913 ± 0.301
1.742TyrIle: 1.742 ± 0.37
1.659TyrLys: 1.659 ± 0.357
2.572TyrLeu: 2.572 ± 0.369
0.332TyrMet: 0.332 ± 0.131
1.825TyrAsn: 1.825 ± 0.491
1.493TyrPro: 1.493 ± 0.468
1.493TyrGln: 1.493 ± 0.333
2.904TyrArg: 2.904 ± 0.538
2.157TyrSer: 2.157 ± 0.468
1.576TyrThr: 1.576 ± 0.311
1.493TyrVal: 1.493 ± 0.323
0.581TyrTrp: 0.581 ± 0.197
0.996TyrTyr: 0.996 ± 0.276
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 70 proteins (12054 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski