Amino acid dipepetide frequency for Salmonella phage Solent

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.444AlaAla: 10.444 ± 2.019
1.102AlaCys: 1.102 ± 0.276
5.454AlaAsp: 5.454 ± 0.801
5.744AlaGlu: 5.744 ± 0.693
3.191AlaPhe: 3.191 ± 0.491
5.57AlaGly: 5.57 ± 0.603
1.625AlaHis: 1.625 ± 0.284
4.758AlaIle: 4.758 ± 0.553
4.352AlaLys: 4.352 ± 0.589
6.441AlaLeu: 6.441 ± 0.771
2.669AlaMet: 2.669 ± 0.367
4.468AlaAsn: 4.468 ± 0.657
3.365AlaPro: 3.365 ± 0.847
3.423AlaGln: 3.423 ± 1.161
4.178AlaArg: 4.178 ± 0.593
4.758AlaSer: 4.758 ± 0.623
5.338AlaThr: 5.338 ± 0.843
5.164AlaVal: 5.164 ± 0.567
1.16AlaTrp: 1.16 ± 0.279
3.656AlaTyr: 3.656 ± 0.66
0.0AlaXaa: 0.0 ± 0.0
Cys
1.102CysAla: 1.102 ± 0.284
0.522CysCys: 0.522 ± 0.261
1.219CysAsp: 1.219 ± 0.283
0.812CysGlu: 0.812 ± 0.216
0.464CysPhe: 0.464 ± 0.191
0.87CysGly: 0.87 ± 0.243
0.116CysHis: 0.116 ± 0.075
0.812CysIle: 0.812 ± 0.248
0.464CysLys: 0.464 ± 0.194
0.928CysLeu: 0.928 ± 0.19
0.348CysMet: 0.348 ± 0.13
0.522CysAsn: 0.522 ± 0.149
0.638CysPro: 0.638 ± 0.23
0.696CysGln: 0.696 ± 0.17
0.638CysArg: 0.638 ± 0.244
0.464CysSer: 0.464 ± 0.144
0.522CysThr: 0.522 ± 0.151
0.87CysVal: 0.87 ± 0.243
0.29CysTrp: 0.29 ± 0.135
0.696CysTyr: 0.696 ± 0.221
0.0CysXaa: 0.0 ± 0.0
Asp
5.744AspAla: 5.744 ± 0.832
0.754AspCys: 0.754 ± 0.201
4.236AspAsp: 4.236 ± 0.476
3.83AspGlu: 3.83 ± 0.385
3.075AspPhe: 3.075 ± 0.427
4.584AspGly: 4.584 ± 0.646
1.16AspHis: 1.16 ± 0.281
4.004AspIle: 4.004 ± 0.482
2.959AspLys: 2.959 ± 0.41
3.83AspLeu: 3.83 ± 0.469
2.089AspMet: 2.089 ± 0.292
2.263AspAsn: 2.263 ± 0.366
2.727AspPro: 2.727 ± 0.471
1.683AspGln: 1.683 ± 0.376
3.714AspArg: 3.714 ± 0.543
2.495AspSer: 2.495 ± 0.326
3.075AspThr: 3.075 ± 0.343
4.178AspVal: 4.178 ± 0.458
1.219AspTrp: 1.219 ± 0.248
2.959AspTyr: 2.959 ± 0.44
0.0AspXaa: 0.0 ± 0.0
Glu
5.106GluAla: 5.106 ± 0.599
0.928GluCys: 0.928 ± 0.279
2.495GluAsp: 2.495 ± 0.471
2.843GluGlu: 2.843 ± 0.382
1.799GluPhe: 1.799 ± 0.299
3.307GluGly: 3.307 ± 0.391
0.754GluHis: 0.754 ± 0.246
3.017GluIle: 3.017 ± 0.356
4.468GluLys: 4.468 ± 0.573
6.905GluLeu: 6.905 ± 0.719
1.741GluMet: 1.741 ± 0.414
3.017GluAsn: 3.017 ± 0.396
2.321GluPro: 2.321 ± 0.43
4.352GluGln: 4.352 ± 0.681
4.236GluArg: 4.236 ± 0.522
2.553GluSer: 2.553 ± 0.428
3.075GluThr: 3.075 ± 0.513
2.495GluVal: 2.495 ± 0.382
1.451GluTrp: 1.451 ± 0.317
2.321GluTyr: 2.321 ± 0.337
0.0GluXaa: 0.0 ± 0.0
Phe
2.437PheAla: 2.437 ± 0.35
0.58PheCys: 0.58 ± 0.162
2.089PheAsp: 2.089 ± 0.333
2.205PheGlu: 2.205 ± 0.351
1.16PhePhe: 1.16 ± 0.26
2.727PheGly: 2.727 ± 0.442
0.464PheHis: 0.464 ± 0.177
1.915PheIle: 1.915 ± 0.322
2.031PheLys: 2.031 ± 0.37
2.031PheLeu: 2.031 ± 0.416
0.87PheMet: 0.87 ± 0.219
1.799PheAsn: 1.799 ± 0.375
2.089PhePro: 2.089 ± 0.35
1.451PheGln: 1.451 ± 0.259
1.509PheArg: 1.509 ± 0.274
2.611PheSer: 2.611 ± 0.413
3.075PheThr: 3.075 ± 0.399
3.017PheVal: 3.017 ± 0.447
0.58PheTrp: 0.58 ± 0.186
1.277PheTyr: 1.277 ± 0.362
0.0PheXaa: 0.0 ± 0.0
Gly
5.338GlyAla: 5.338 ± 0.582
0.754GlyCys: 0.754 ± 0.208
4.526GlyAsp: 4.526 ± 0.432
4.584GlyGlu: 4.584 ± 0.488
3.133GlyPhe: 3.133 ± 0.556
4.932GlyGly: 4.932 ± 0.748
1.219GlyHis: 1.219 ± 0.279
3.714GlyIle: 3.714 ± 0.547
4.41GlyLys: 4.41 ± 0.601
5.396GlyLeu: 5.396 ± 0.595
1.973GlyMet: 1.973 ± 0.385
3.54GlyAsn: 3.54 ± 0.545
1.567GlyPro: 1.567 ± 0.249
2.495GlyGln: 2.495 ± 0.365
3.307GlyArg: 3.307 ± 0.505
4.526GlySer: 4.526 ± 0.489
4.12GlyThr: 4.12 ± 0.559
6.151GlyVal: 6.151 ± 0.642
1.451GlyTrp: 1.451 ± 0.313
3.017GlyTyr: 3.017 ± 0.394
0.0GlyXaa: 0.0 ± 0.0
His
1.16HisAla: 1.16 ± 0.26
0.406HisCys: 0.406 ± 0.152
1.219HisAsp: 1.219 ± 0.273
1.277HisGlu: 1.277 ± 0.268
0.58HisPhe: 0.58 ± 0.165
1.277HisGly: 1.277 ± 0.332
0.696HisHis: 0.696 ± 0.486
1.16HisIle: 1.16 ± 0.267
0.754HisLys: 0.754 ± 0.185
1.044HisLeu: 1.044 ± 0.233
0.116HisMet: 0.116 ± 0.087
1.16HisAsn: 1.16 ± 0.263
0.638HisPro: 0.638 ± 0.213
0.87HisGln: 0.87 ± 0.237
0.928HisArg: 0.928 ± 0.239
1.335HisSer: 1.335 ± 0.338
1.567HisThr: 1.567 ± 0.351
1.219HisVal: 1.219 ± 0.295
0.174HisTrp: 0.174 ± 0.102
0.754HisTyr: 0.754 ± 0.209
0.0HisXaa: 0.0 ± 0.0
Ile
5.396IleAla: 5.396 ± 0.535
0.696IleCys: 0.696 ± 0.218
4.758IleAsp: 4.758 ± 0.535
3.423IleGlu: 3.423 ± 0.512
1.335IlePhe: 1.335 ± 0.286
3.249IleGly: 3.249 ± 0.364
1.567IleHis: 1.567 ± 0.352
2.901IleIle: 2.901 ± 0.345
2.553IleLys: 2.553 ± 0.375
3.249IleLeu: 3.249 ± 0.407
1.277IleMet: 1.277 ± 0.247
3.481IleAsn: 3.481 ± 0.438
2.089IlePro: 2.089 ± 0.356
2.031IleGln: 2.031 ± 0.393
2.785IleArg: 2.785 ± 0.384
4.004IleSer: 4.004 ± 0.576
4.758IleThr: 4.758 ± 0.538
4.236IleVal: 4.236 ± 0.571
0.58IleTrp: 0.58 ± 0.185
2.611IleTyr: 2.611 ± 0.421
0.0IleXaa: 0.0 ± 0.0
Lys
4.7LysAla: 4.7 ± 0.692
1.102LysCys: 1.102 ± 0.299
2.727LysAsp: 2.727 ± 0.497
3.191LysGlu: 3.191 ± 0.402
2.031LysPhe: 2.031 ± 0.351
3.481LysGly: 3.481 ± 0.381
0.986LysHis: 0.986 ± 0.227
3.191LysIle: 3.191 ± 0.38
3.249LysLys: 3.249 ± 0.487
5.338LysLeu: 5.338 ± 0.658
1.625LysMet: 1.625 ± 0.331
2.089LysAsn: 2.089 ± 0.352
2.901LysPro: 2.901 ± 0.514
3.656LysGln: 3.656 ± 0.558
3.423LysArg: 3.423 ± 0.47
2.785LysSer: 2.785 ± 0.44
3.656LysThr: 3.656 ± 0.575
3.714LysVal: 3.714 ± 0.525
0.812LysTrp: 0.812 ± 0.205
1.915LysTyr: 1.915 ± 0.355
0.0LysXaa: 0.0 ± 0.0
Leu
6.963LeuAla: 6.963 ± 0.91
0.638LeuCys: 0.638 ± 0.166
3.83LeuAsp: 3.83 ± 0.387
4.99LeuGlu: 4.99 ± 0.532
2.205LeuPhe: 2.205 ± 0.325
3.772LeuGly: 3.772 ± 0.336
1.451LeuHis: 1.451 ± 0.334
4.874LeuIle: 4.874 ± 0.492
5.338LeuLys: 5.338 ± 0.559
6.035LeuLeu: 6.035 ± 0.574
2.263LeuMet: 2.263 ± 0.385
4.642LeuAsn: 4.642 ± 0.478
4.236LeuPro: 4.236 ± 0.475
3.307LeuGln: 3.307 ± 0.568
4.41LeuArg: 4.41 ± 0.621
5.57LeuSer: 5.57 ± 0.556
5.28LeuThr: 5.28 ± 0.584
4.874LeuVal: 4.874 ± 0.529
1.335LeuTrp: 1.335 ± 0.283
2.669LeuTyr: 2.669 ± 0.431
0.0LeuXaa: 0.0 ± 0.0
Met
2.785MetAla: 2.785 ± 0.382
0.29MetCys: 0.29 ± 0.147
1.044MetAsp: 1.044 ± 0.226
1.102MetGlu: 1.102 ± 0.251
1.16MetPhe: 1.16 ± 0.275
1.799MetGly: 1.799 ± 0.362
0.58MetHis: 0.58 ± 0.181
1.393MetIle: 1.393 ± 0.23
2.785MetLys: 2.785 ± 0.42
1.741MetLeu: 1.741 ± 0.307
0.638MetMet: 0.638 ± 0.281
1.509MetAsn: 1.509 ± 0.296
0.812MetPro: 0.812 ± 0.264
1.044MetGln: 1.044 ± 0.253
1.044MetArg: 1.044 ± 0.253
1.857MetSer: 1.857 ± 0.283
2.089MetThr: 2.089 ± 0.436
1.509MetVal: 1.509 ± 0.305
0.232MetTrp: 0.232 ± 0.109
0.58MetTyr: 0.58 ± 0.191
0.0MetXaa: 0.0 ± 0.0
Asn
4.41AsnAla: 4.41 ± 0.458
0.696AsnCys: 0.696 ± 0.201
2.843AsnAsp: 2.843 ± 0.339
2.611AsnGlu: 2.611 ± 0.371
1.393AsnPhe: 1.393 ± 0.263
4.99AsnGly: 4.99 ± 0.654
1.044AsnHis: 1.044 ± 0.244
3.481AsnIle: 3.481 ± 0.47
2.553AsnLys: 2.553 ± 0.318
3.365AsnLeu: 3.365 ± 0.621
1.451AsnMet: 1.451 ± 0.282
3.133AsnAsn: 3.133 ± 0.498
2.669AsnPro: 2.669 ± 0.388
2.727AsnGln: 2.727 ± 0.654
2.147AsnArg: 2.147 ± 0.372
3.075AsnSer: 3.075 ± 0.376
3.481AsnThr: 3.481 ± 0.558
3.307AsnVal: 3.307 ± 0.484
0.696AsnTrp: 0.696 ± 0.201
1.915AsnTyr: 1.915 ± 0.41
0.0AsnXaa: 0.0 ± 0.0
Pro
4.004ProAla: 4.004 ± 0.719
0.464ProCys: 0.464 ± 0.148
3.481ProAsp: 3.481 ± 0.458
3.307ProGlu: 3.307 ± 0.541
1.567ProPhe: 1.567 ± 0.288
3.772ProGly: 3.772 ± 0.465
0.638ProHis: 0.638 ± 0.262
1.567ProIle: 1.567 ± 0.284
1.625ProLys: 1.625 ± 0.307
2.669ProLeu: 2.669 ± 0.423
0.696ProMet: 0.696 ± 0.251
1.973ProAsn: 1.973 ± 0.432
2.147ProPro: 2.147 ± 0.38
0.928ProGln: 0.928 ± 0.222
2.553ProArg: 2.553 ± 0.419
2.089ProSer: 2.089 ± 0.334
2.669ProThr: 2.669 ± 0.464
5.106ProVal: 5.106 ± 0.759
0.928ProTrp: 0.928 ± 0.25
1.915ProTyr: 1.915 ± 0.34
0.0ProXaa: 0.0 ± 0.0
Gln
5.106GlnAla: 5.106 ± 1.164
0.696GlnCys: 0.696 ± 0.2
1.799GlnAsp: 1.799 ± 0.301
2.321GlnGlu: 2.321 ± 0.337
2.089GlnPhe: 2.089 ± 0.355
2.669GlnGly: 2.669 ± 0.392
0.812GlnHis: 0.812 ± 0.231
2.611GlnIle: 2.611 ± 0.632
2.321GlnLys: 2.321 ± 0.564
4.294GlnLeu: 4.294 ± 0.783
1.451GlnMet: 1.451 ± 0.34
2.031GlnAsn: 2.031 ± 0.404
1.857GlnPro: 1.857 ± 0.378
2.727GlnGln: 2.727 ± 1.018
2.437GlnArg: 2.437 ± 0.377
2.263GlnSer: 2.263 ± 0.739
2.205GlnThr: 2.205 ± 0.339
2.669GlnVal: 2.669 ± 0.554
0.812GlnTrp: 0.812 ± 0.203
0.928GlnTyr: 0.928 ± 0.197
0.0GlnXaa: 0.0 ± 0.0
Arg
3.946ArgAla: 3.946 ± 0.381
0.58ArgCys: 0.58 ± 0.164
3.54ArgAsp: 3.54 ± 0.477
2.495ArgGlu: 2.495 ± 0.389
2.321ArgPhe: 2.321 ± 0.377
3.075ArgGly: 3.075 ± 0.402
0.754ArgHis: 0.754 ± 0.203
3.191ArgIle: 3.191 ± 0.515
3.133ArgLys: 3.133 ± 0.447
4.932ArgLeu: 4.932 ± 0.689
1.567ArgMet: 1.567 ± 0.339
2.843ArgAsn: 2.843 ± 0.319
2.437ArgPro: 2.437 ± 0.422
2.205ArgGln: 2.205 ± 0.373
2.495ArgArg: 2.495 ± 0.455
3.133ArgSer: 3.133 ± 0.415
2.321ArgThr: 2.321 ± 0.504
3.772ArgVal: 3.772 ± 0.552
0.812ArgTrp: 0.812 ± 0.217
2.089ArgTyr: 2.089 ± 0.367
0.0ArgXaa: 0.0 ± 0.0
Ser
5.28SerAla: 5.28 ± 0.645
0.348SerCys: 0.348 ± 0.152
3.481SerAsp: 3.481 ± 0.403
2.611SerGlu: 2.611 ± 0.32
1.741SerPhe: 1.741 ± 0.305
5.28SerGly: 5.28 ± 0.709
1.219SerHis: 1.219 ± 0.261
4.004SerIle: 4.004 ± 0.489
2.553SerLys: 2.553 ± 0.352
4.468SerLeu: 4.468 ± 0.518
0.986SerMet: 0.986 ± 0.231
3.017SerAsn: 3.017 ± 0.42
2.437SerPro: 2.437 ± 0.338
3.191SerGln: 3.191 ± 0.495
2.089SerArg: 2.089 ± 0.329
3.772SerSer: 3.772 ± 0.533
3.54SerThr: 3.54 ± 0.42
5.048SerVal: 5.048 ± 0.562
0.696SerTrp: 0.696 ± 0.239
1.799SerTyr: 1.799 ± 0.294
0.0SerXaa: 0.0 ± 0.0
Thr
4.178ThrAla: 4.178 ± 0.644
0.696ThrCys: 0.696 ± 0.183
4.178ThrAsp: 4.178 ± 0.534
3.83ThrGlu: 3.83 ± 0.397
1.973ThrPhe: 1.973 ± 0.335
6.383ThrGly: 6.383 ± 0.617
0.812ThrHis: 0.812 ± 0.244
3.075ThrIle: 3.075 ± 0.453
2.901ThrLys: 2.901 ± 0.393
5.919ThrLeu: 5.919 ± 0.667
1.16ThrMet: 1.16 ± 0.318
3.133ThrAsn: 3.133 ± 0.463
3.946ThrPro: 3.946 ± 0.455
2.263ThrGln: 2.263 ± 0.449
3.133ThrArg: 3.133 ± 0.404
3.191ThrSer: 3.191 ± 0.42
4.236ThrThr: 4.236 ± 0.505
5.222ThrVal: 5.222 ± 0.761
0.754ThrTrp: 0.754 ± 0.23
2.147ThrTyr: 2.147 ± 0.311
0.0ThrXaa: 0.0 ± 0.0
Val
5.164ValAla: 5.164 ± 0.552
0.638ValCys: 0.638 ± 0.183
4.584ValAsp: 4.584 ± 0.478
4.526ValGlu: 4.526 ± 0.588
2.785ValPhe: 2.785 ± 0.373
4.758ValGly: 4.758 ± 0.49
1.219ValHis: 1.219 ± 0.318
4.178ValIle: 4.178 ± 0.574
4.526ValLys: 4.526 ± 0.463
6.557ValLeu: 6.557 ± 0.677
1.857ValMet: 1.857 ± 0.291
4.294ValAsn: 4.294 ± 0.486
2.611ValPro: 2.611 ± 0.408
2.959ValGln: 2.959 ± 0.465
3.017ValArg: 3.017 ± 0.361
3.714ValSer: 3.714 ± 0.542
5.338ValThr: 5.338 ± 0.65
4.7ValVal: 4.7 ± 0.527
0.928ValTrp: 0.928 ± 0.184
2.785ValTyr: 2.785 ± 0.388
0.0ValXaa: 0.0 ± 0.0
Trp
0.754TrpAla: 0.754 ± 0.235
0.232TrpCys: 0.232 ± 0.125
1.044TrpAsp: 1.044 ± 0.253
1.277TrpGlu: 1.277 ± 0.295
0.754TrpPhe: 0.754 ± 0.276
1.102TrpGly: 1.102 ± 0.25
0.464TrpHis: 0.464 ± 0.149
0.754TrpIle: 0.754 ± 0.253
1.335TrpLys: 1.335 ± 0.338
1.625TrpLeu: 1.625 ± 0.272
0.29TrpMet: 0.29 ± 0.123
1.044TrpAsn: 1.044 ± 0.252
0.464TrpPro: 0.464 ± 0.174
0.406TrpGln: 0.406 ± 0.175
0.754TrpArg: 0.754 ± 0.271
1.219TrpSer: 1.219 ± 0.26
0.522TrpThr: 0.522 ± 0.185
1.044TrpVal: 1.044 ± 0.285
0.406TrpTrp: 0.406 ± 0.171
0.638TrpTyr: 0.638 ± 0.233
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.843TyrAla: 2.843 ± 0.789
0.87TyrCys: 0.87 ± 0.211
2.031TyrAsp: 2.031 ± 0.327
2.205TyrGlu: 2.205 ± 0.374
1.16TyrPhe: 1.16 ± 0.264
2.553TyrGly: 2.553 ± 0.418
0.754TyrHis: 0.754 ± 0.27
2.379TyrIle: 2.379 ± 0.435
2.263TyrLys: 2.263 ± 0.398
1.915TyrLeu: 1.915 ± 0.32
0.87TyrMet: 0.87 ± 0.247
2.031TyrAsn: 2.031 ± 0.361
2.147TyrPro: 2.147 ± 0.405
1.683TyrGln: 1.683 ± 0.404
2.843TyrArg: 2.843 ± 0.462
2.205TyrSer: 2.205 ± 0.391
2.263TyrThr: 2.263 ± 0.367
2.843TyrVal: 2.843 ± 0.514
0.812TyrTrp: 0.812 ± 0.179
1.277TyrTyr: 1.277 ± 0.272
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 92 proteins (17235 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski