Amino acid dipepetide frequency for Salmonella phage vB_SalP_PM43

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.11AlaAla: 8.11 ± 1.255
1.298AlaCys: 1.298 ± 0.373
6.326AlaAsp: 6.326 ± 0.868
6.813AlaGlu: 6.813 ± 0.864
2.595AlaPhe: 2.595 ± 0.4
6.569AlaGly: 6.569 ± 1.005
1.298AlaHis: 1.298 ± 0.287
5.353AlaIle: 5.353 ± 0.648
5.353AlaLys: 5.353 ± 0.591
5.515AlaLeu: 5.515 ± 0.791
3.812AlaMet: 3.812 ± 0.628
6.488AlaAsn: 6.488 ± 0.875
2.758AlaPro: 2.758 ± 0.446
3.731AlaGln: 3.731 ± 0.729
5.109AlaArg: 5.109 ± 0.732
4.542AlaSer: 4.542 ± 0.738
5.596AlaThr: 5.596 ± 0.785
5.109AlaVal: 5.109 ± 0.591
1.865AlaTrp: 1.865 ± 0.441
2.271AlaTyr: 2.271 ± 0.472
0.0AlaXaa: 0.0 ± 0.0
Cys
0.892CysAla: 0.892 ± 0.317
0.487CysCys: 0.487 ± 0.189
0.324CysAsp: 0.324 ± 0.154
0.487CysGlu: 0.487 ± 0.181
0.406CysPhe: 0.406 ± 0.201
1.46CysGly: 1.46 ± 0.411
0.324CysHis: 0.324 ± 0.159
0.487CysIle: 0.487 ± 0.238
0.73CysLys: 0.73 ± 0.299
0.406CysLeu: 0.406 ± 0.177
0.162CysMet: 0.162 ± 0.115
0.73CysAsn: 0.73 ± 0.265
0.487CysPro: 0.487 ± 0.213
0.73CysGln: 0.73 ± 0.242
1.46CysArg: 1.46 ± 0.368
0.649CysSer: 0.649 ± 0.232
0.487CysThr: 0.487 ± 0.225
0.973CysVal: 0.973 ± 0.237
0.081CysTrp: 0.081 ± 0.076
0.324CysTyr: 0.324 ± 0.17
0.0CysXaa: 0.0 ± 0.0
Asp
6.65AspAla: 6.65 ± 0.836
0.73AspCys: 0.73 ± 0.256
5.191AspAsp: 5.191 ± 0.868
3.569AspGlu: 3.569 ± 0.549
2.352AspPhe: 2.352 ± 0.394
5.353AspGly: 5.353 ± 0.841
0.73AspHis: 0.73 ± 0.241
3.65AspIle: 3.65 ± 0.43
3.893AspLys: 3.893 ± 0.549
4.785AspLeu: 4.785 ± 0.739
1.865AspMet: 1.865 ± 0.461
2.271AspAsn: 2.271 ± 0.387
2.028AspPro: 2.028 ± 0.385
1.784AspGln: 1.784 ± 0.326
1.865AspArg: 1.865 ± 0.435
2.839AspSer: 2.839 ± 0.534
2.19AspThr: 2.19 ± 0.352
4.542AspVal: 4.542 ± 0.598
1.217AspTrp: 1.217 ± 0.327
2.676AspTyr: 2.676 ± 0.493
0.0AspXaa: 0.0 ± 0.0
Glu
5.839GluAla: 5.839 ± 0.696
1.135GluCys: 1.135 ± 0.281
2.92GluAsp: 2.92 ± 0.506
4.055GluGlu: 4.055 ± 0.77
1.784GluPhe: 1.784 ± 0.419
3.731GluGly: 3.731 ± 0.514
1.298GluHis: 1.298 ± 0.336
3.812GluIle: 3.812 ± 0.578
4.298GluLys: 4.298 ± 0.638
6.326GluLeu: 6.326 ± 0.895
2.271GluMet: 2.271 ± 0.434
2.676GluAsn: 2.676 ± 0.532
2.352GluPro: 2.352 ± 0.413
3.731GluGln: 3.731 ± 0.572
4.298GluArg: 4.298 ± 0.662
4.217GluSer: 4.217 ± 0.545
3.812GluThr: 3.812 ± 0.467
3.812GluVal: 3.812 ± 0.619
1.865GluTrp: 1.865 ± 0.392
1.703GluTyr: 1.703 ± 0.456
0.0GluXaa: 0.0 ± 0.0
Phe
2.433PheAla: 2.433 ± 0.367
0.406PheCys: 0.406 ± 0.172
1.865PheAsp: 1.865 ± 0.393
2.109PheGlu: 2.109 ± 0.369
1.217PhePhe: 1.217 ± 0.425
2.19PheGly: 2.19 ± 0.297
0.568PheHis: 0.568 ± 0.252
2.271PheIle: 2.271 ± 0.544
2.19PheLys: 2.19 ± 0.463
2.271PheLeu: 2.271 ± 0.404
1.054PheMet: 1.054 ± 0.216
2.028PheAsn: 2.028 ± 0.414
1.135PhePro: 1.135 ± 0.253
1.217PheGln: 1.217 ± 0.439
1.379PheArg: 1.379 ± 0.378
2.758PheSer: 2.758 ± 0.475
2.19PheThr: 2.19 ± 0.397
1.46PheVal: 1.46 ± 0.314
1.054PheTrp: 1.054 ± 0.24
2.028PheTyr: 2.028 ± 0.368
0.0PheXaa: 0.0 ± 0.0
Gly
6.083GlyAla: 6.083 ± 0.888
0.811GlyCys: 0.811 ± 0.289
3.569GlyAsp: 3.569 ± 0.536
4.785GlyGlu: 4.785 ± 0.607
3.082GlyPhe: 3.082 ± 0.362
5.109GlyGly: 5.109 ± 0.736
0.73GlyHis: 0.73 ± 0.268
4.785GlyIle: 4.785 ± 0.666
4.704GlyLys: 4.704 ± 0.599
5.353GlyLeu: 5.353 ± 0.776
3.082GlyMet: 3.082 ± 0.586
4.136GlyAsn: 4.136 ± 0.521
0.892GlyPro: 0.892 ± 0.272
4.298GlyGln: 4.298 ± 0.838
4.785GlyArg: 4.785 ± 0.703
4.217GlySer: 4.217 ± 0.797
3.082GlyThr: 3.082 ± 0.589
5.272GlyVal: 5.272 ± 0.653
1.622GlyTrp: 1.622 ± 0.453
1.946GlyTyr: 1.946 ± 0.35
0.0GlyXaa: 0.0 ± 0.0
His
0.973HisAla: 0.973 ± 0.222
0.243HisCys: 0.243 ± 0.135
1.217HisAsp: 1.217 ± 0.34
1.865HisGlu: 1.865 ± 0.383
0.487HisPhe: 0.487 ± 0.226
1.541HisGly: 1.541 ± 0.345
0.487HisHis: 0.487 ± 0.202
0.649HisIle: 0.649 ± 0.218
0.892HisLys: 0.892 ± 0.262
2.271HisLeu: 2.271 ± 0.45
0.487HisMet: 0.487 ± 0.209
0.487HisAsn: 0.487 ± 0.204
0.892HisPro: 0.892 ± 0.254
0.487HisGln: 0.487 ± 0.19
1.054HisArg: 1.054 ± 0.29
0.892HisSer: 0.892 ± 0.265
0.811HisThr: 0.811 ± 0.265
0.973HisVal: 0.973 ± 0.29
0.081HisTrp: 0.081 ± 0.077
0.649HisTyr: 0.649 ± 0.205
0.0HisXaa: 0.0 ± 0.0
Ile
5.839IleAla: 5.839 ± 0.649
0.487IleCys: 0.487 ± 0.191
4.38IleAsp: 4.38 ± 0.554
5.109IleGlu: 5.109 ± 0.678
2.19IlePhe: 2.19 ± 0.361
4.704IleGly: 4.704 ± 0.67
1.379IleHis: 1.379 ± 0.283
3.731IleIle: 3.731 ± 0.704
2.758IleLys: 2.758 ± 0.548
3.893IleLeu: 3.893 ± 0.693
0.892IleMet: 0.892 ± 0.271
3.001IleAsn: 3.001 ± 0.521
3.325IlePro: 3.325 ± 0.561
2.028IleGln: 2.028 ± 0.354
3.325IleArg: 3.325 ± 0.352
3.812IleSer: 3.812 ± 0.787
4.217IleThr: 4.217 ± 0.708
2.92IleVal: 2.92 ± 0.565
0.324IleTrp: 0.324 ± 0.157
1.865IleTyr: 1.865 ± 0.411
0.0IleXaa: 0.0 ± 0.0
Lys
5.272LysAla: 5.272 ± 0.797
0.568LysCys: 0.568 ± 0.203
2.676LysAsp: 2.676 ± 0.361
5.028LysGlu: 5.028 ± 0.706
1.784LysPhe: 1.784 ± 0.448
4.542LysGly: 4.542 ± 0.656
0.892LysHis: 0.892 ± 0.259
3.082LysIle: 3.082 ± 0.44
3.893LysLys: 3.893 ± 0.601
5.191LysLeu: 5.191 ± 0.448
1.541LysMet: 1.541 ± 0.46
2.514LysAsn: 2.514 ± 0.445
3.569LysPro: 3.569 ± 0.596
3.244LysGln: 3.244 ± 0.543
4.947LysArg: 4.947 ± 0.654
3.974LysSer: 3.974 ± 0.586
3.569LysThr: 3.569 ± 0.472
3.406LysVal: 3.406 ± 0.741
0.649LysTrp: 0.649 ± 0.219
2.433LysTyr: 2.433 ± 0.421
0.0LysXaa: 0.0 ± 0.0
Leu
7.624LeuAla: 7.624 ± 0.831
0.649LeuCys: 0.649 ± 0.204
3.974LeuAsp: 3.974 ± 0.56
5.191LeuGlu: 5.191 ± 0.545
2.433LeuPhe: 2.433 ± 0.545
4.055LeuGly: 4.055 ± 0.715
1.054LeuHis: 1.054 ± 0.27
5.028LeuIle: 5.028 ± 0.901
5.596LeuLys: 5.596 ± 0.585
5.677LeuLeu: 5.677 ± 0.641
2.433LeuMet: 2.433 ± 0.336
4.217LeuAsn: 4.217 ± 0.656
3.487LeuPro: 3.487 ± 0.544
2.271LeuGln: 2.271 ± 0.62
4.947LeuArg: 4.947 ± 0.707
6.002LeuSer: 6.002 ± 0.625
4.947LeuThr: 4.947 ± 0.647
4.055LeuVal: 4.055 ± 0.56
1.135LeuTrp: 1.135 ± 0.282
2.839LeuTyr: 2.839 ± 0.45
0.0LeuXaa: 0.0 ± 0.0
Met
3.001MetAla: 3.001 ± 0.463
0.487MetCys: 0.487 ± 0.258
1.217MetAsp: 1.217 ± 0.307
1.298MetGlu: 1.298 ± 0.302
0.811MetPhe: 0.811 ± 0.27
1.703MetGly: 1.703 ± 0.411
0.324MetHis: 0.324 ± 0.164
1.622MetIle: 1.622 ± 0.438
2.433MetLys: 2.433 ± 0.431
2.352MetLeu: 2.352 ± 0.452
0.73MetMet: 0.73 ± 0.287
0.892MetAsn: 0.892 ± 0.224
1.054MetPro: 1.054 ± 0.241
2.109MetGln: 2.109 ± 0.532
2.19MetArg: 2.19 ± 0.51
2.433MetSer: 2.433 ± 0.469
1.865MetThr: 1.865 ± 0.481
1.703MetVal: 1.703 ± 0.3
0.243MetTrp: 0.243 ± 0.149
1.054MetTyr: 1.054 ± 0.281
0.0MetXaa: 0.0 ± 0.0
Asn
5.191AsnAla: 5.191 ± 0.761
0.649AsnCys: 0.649 ± 0.261
3.001AsnAsp: 3.001 ± 0.408
2.839AsnGlu: 2.839 ± 0.444
0.811AsnPhe: 0.811 ± 0.26
4.298AsnGly: 4.298 ± 0.539
1.135AsnHis: 1.135 ± 0.318
3.163AsnIle: 3.163 ± 0.539
3.244AsnLys: 3.244 ± 0.475
3.244AsnLeu: 3.244 ± 0.523
1.217AsnMet: 1.217 ± 0.303
2.352AsnAsn: 2.352 ± 0.437
2.433AsnPro: 2.433 ± 0.397
2.839AsnGln: 2.839 ± 0.498
2.433AsnArg: 2.433 ± 0.577
2.028AsnSer: 2.028 ± 0.397
3.163AsnThr: 3.163 ± 0.43
2.676AsnVal: 2.676 ± 0.69
0.649AsnTrp: 0.649 ± 0.248
2.028AsnTyr: 2.028 ± 0.39
0.0AsnXaa: 0.0 ± 0.0
Pro
3.487ProAla: 3.487 ± 0.464
0.162ProCys: 0.162 ± 0.106
3.406ProAsp: 3.406 ± 0.477
4.461ProGlu: 4.461 ± 0.677
1.541ProPhe: 1.541 ± 0.372
2.028ProGly: 2.028 ± 0.422
0.973ProHis: 0.973 ± 0.267
2.595ProIle: 2.595 ± 0.448
3.001ProLys: 3.001 ± 0.554
2.514ProLeu: 2.514 ± 0.479
0.73ProMet: 0.73 ± 0.257
1.541ProAsn: 1.541 ± 0.338
1.865ProPro: 1.865 ± 0.405
1.379ProGln: 1.379 ± 0.318
1.784ProArg: 1.784 ± 0.342
2.839ProSer: 2.839 ± 0.448
1.784ProThr: 1.784 ± 0.351
2.758ProVal: 2.758 ± 0.501
0.487ProTrp: 0.487 ± 0.177
0.973ProTyr: 0.973 ± 0.318
0.0ProXaa: 0.0 ± 0.0
Gln
4.298GlnAla: 4.298 ± 0.88
0.324GlnCys: 0.324 ± 0.168
2.514GlnAsp: 2.514 ± 0.489
1.46GlnGlu: 1.46 ± 0.384
1.46GlnPhe: 1.46 ± 0.301
3.163GlnGly: 3.163 ± 0.665
0.568GlnHis: 0.568 ± 0.183
3.406GlnIle: 3.406 ± 0.45
2.595GlnLys: 2.595 ± 0.515
4.704GlnLeu: 4.704 ± 0.609
1.541GlnMet: 1.541 ± 0.36
2.433GlnAsn: 2.433 ± 0.751
2.433GlnPro: 2.433 ± 0.543
2.92GlnGln: 2.92 ± 0.814
3.001GlnArg: 3.001 ± 0.481
2.839GlnSer: 2.839 ± 0.575
1.541GlnThr: 1.541 ± 0.405
1.946GlnVal: 1.946 ± 0.477
1.054GlnTrp: 1.054 ± 0.288
1.865GlnTyr: 1.865 ± 0.408
0.0GlnXaa: 0.0 ± 0.0
Arg
5.353ArgAla: 5.353 ± 0.594
0.73ArgCys: 0.73 ± 0.21
3.244ArgAsp: 3.244 ± 0.43
4.623ArgGlu: 4.623 ± 0.749
1.784ArgPhe: 1.784 ± 0.357
4.055ArgGly: 4.055 ± 0.697
1.541ArgHis: 1.541 ± 0.358
4.055ArgIle: 4.055 ± 0.545
3.974ArgLys: 3.974 ± 0.683
5.353ArgLeu: 5.353 ± 0.621
2.595ArgMet: 2.595 ± 0.449
3.163ArgAsn: 3.163 ± 0.447
1.622ArgPro: 1.622 ± 0.37
2.352ArgGln: 2.352 ± 0.498
3.731ArgArg: 3.731 ± 0.664
3.163ArgSer: 3.163 ± 0.481
2.433ArgThr: 2.433 ± 0.406
3.082ArgVal: 3.082 ± 0.477
0.811ArgTrp: 0.811 ± 0.25
2.433ArgTyr: 2.433 ± 0.424
0.0ArgXaa: 0.0 ± 0.0
Ser
4.704SerAla: 4.704 ± 0.89
0.487SerCys: 0.487 ± 0.172
3.406SerAsp: 3.406 ± 0.473
3.406SerGlu: 3.406 ± 0.524
3.001SerPhe: 3.001 ± 0.497
5.434SerGly: 5.434 ± 0.714
1.217SerHis: 1.217 ± 0.358
3.325SerIle: 3.325 ± 0.575
3.325SerLys: 3.325 ± 0.493
5.353SerLeu: 5.353 ± 0.686
1.622SerMet: 1.622 ± 0.272
2.676SerAsn: 2.676 ± 0.433
2.758SerPro: 2.758 ± 0.473
3.001SerGln: 3.001 ± 0.518
4.055SerArg: 4.055 ± 0.496
3.001SerSer: 3.001 ± 0.625
3.244SerThr: 3.244 ± 0.588
3.082SerVal: 3.082 ± 0.561
1.135SerTrp: 1.135 ± 0.326
1.784SerTyr: 1.784 ± 0.404
0.0SerXaa: 0.0 ± 0.0
Thr
5.028ThrAla: 5.028 ± 0.583
0.73ThrCys: 0.73 ± 0.2
3.163ThrAsp: 3.163 ± 0.524
2.352ThrGlu: 2.352 ± 0.406
1.865ThrPhe: 1.865 ± 0.408
5.191ThrGly: 5.191 ± 0.636
0.649ThrHis: 0.649 ± 0.201
3.406ThrIle: 3.406 ± 0.559
3.65ThrLys: 3.65 ± 0.562
3.325ThrLeu: 3.325 ± 0.597
1.217ThrMet: 1.217 ± 0.281
2.595ThrAsn: 2.595 ± 0.527
3.244ThrPro: 3.244 ± 0.541
2.92ThrGln: 2.92 ± 0.418
2.271ThrArg: 2.271 ± 0.41
2.92ThrSer: 2.92 ± 0.535
3.001ThrThr: 3.001 ± 0.618
3.163ThrVal: 3.163 ± 0.442
0.568ThrTrp: 0.568 ± 0.254
1.946ThrTyr: 1.946 ± 0.353
0.0ThrXaa: 0.0 ± 0.0
Val
5.191ValAla: 5.191 ± 0.557
0.649ValCys: 0.649 ± 0.259
3.082ValAsp: 3.082 ± 0.565
3.974ValGlu: 3.974 ± 0.573
2.028ValPhe: 2.028 ± 0.438
4.217ValGly: 4.217 ± 0.611
0.649ValHis: 0.649 ± 0.212
3.163ValIle: 3.163 ± 0.481
3.65ValLys: 3.65 ± 0.541
4.461ValLeu: 4.461 ± 0.581
1.298ValMet: 1.298 ± 0.324
3.406ValAsn: 3.406 ± 0.52
1.784ValPro: 1.784 ± 0.384
2.352ValGln: 2.352 ± 0.358
3.406ValArg: 3.406 ± 0.435
4.217ValSer: 4.217 ± 0.711
3.325ValThr: 3.325 ± 0.577
3.893ValVal: 3.893 ± 0.438
0.973ValTrp: 0.973 ± 0.285
1.784ValTyr: 1.784 ± 0.453
0.0ValXaa: 0.0 ± 0.0
Trp
0.973TrpAla: 0.973 ± 0.241
0.406TrpCys: 0.406 ± 0.192
1.135TrpAsp: 1.135 ± 0.307
0.811TrpGlu: 0.811 ± 0.246
0.649TrpPhe: 0.649 ± 0.222
0.892TrpGly: 0.892 ± 0.195
0.487TrpHis: 0.487 ± 0.178
0.487TrpIle: 0.487 ± 0.193
1.379TrpLys: 1.379 ± 0.405
2.19TrpLeu: 2.19 ± 0.504
0.406TrpMet: 0.406 ± 0.15
0.406TrpAsn: 0.406 ± 0.159
0.892TrpPro: 0.892 ± 0.244
0.811TrpGln: 0.811 ± 0.212
1.46TrpArg: 1.46 ± 0.315
0.892TrpSer: 0.892 ± 0.296
0.649TrpThr: 0.649 ± 0.201
1.217TrpVal: 1.217 ± 0.289
0.162TrpTrp: 0.162 ± 0.118
0.406TrpTyr: 0.406 ± 0.177
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.569TyrAla: 3.569 ± 0.667
0.649TyrCys: 0.649 ± 0.254
3.569TyrAsp: 3.569 ± 0.584
1.703TyrGlu: 1.703 ± 0.478
1.541TyrPhe: 1.541 ± 0.328
2.109TyrGly: 2.109 ± 0.502
1.135TyrHis: 1.135 ± 0.289
2.109TyrIle: 2.109 ± 0.542
1.298TyrLys: 1.298 ± 0.319
2.433TyrLeu: 2.433 ± 0.465
0.487TyrMet: 0.487 ± 0.194
1.379TyrAsn: 1.379 ± 0.364
1.298TyrPro: 1.298 ± 0.444
1.865TyrGln: 1.865 ± 0.361
2.595TyrArg: 2.595 ± 0.466
1.784TyrSer: 1.784 ± 0.401
1.298TyrThr: 1.298 ± 0.279
1.46TyrVal: 1.46 ± 0.353
0.73TyrTrp: 0.73 ± 0.209
1.298TyrTyr: 1.298 ± 0.435
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 68 proteins (12331 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski