Amino acid dipepetide frequency for Salmonella phage SETP13

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.45AlaAla: 11.45 ± 1.404
1.374AlaCys: 1.374 ± 0.299
6.641AlaAsp: 6.641 ± 0.684
6.565AlaGlu: 6.565 ± 0.782
4.122AlaPhe: 4.122 ± 0.508
7.939AlaGly: 7.939 ± 0.713
2.061AlaHis: 2.061 ± 0.395
3.817AlaIle: 3.817 ± 0.613
5.954AlaLys: 5.954 ± 0.901
7.939AlaLeu: 7.939 ± 0.783
2.748AlaMet: 2.748 ± 0.563
3.359AlaAsn: 3.359 ± 0.413
3.282AlaPro: 3.282 ± 0.502
3.511AlaGln: 3.511 ± 0.652
4.427AlaArg: 4.427 ± 0.554
5.878AlaSer: 5.878 ± 0.854
5.802AlaThr: 5.802 ± 0.722
7.328AlaVal: 7.328 ± 0.777
1.145AlaTrp: 1.145 ± 0.269
3.664AlaTyr: 3.664 ± 0.504
0.0AlaXaa: 0.0 ± 0.0
Cys
0.84CysAla: 0.84 ± 0.186
0.229CysCys: 0.229 ± 0.135
0.84CysAsp: 0.84 ± 0.228
0.84CysGlu: 0.84 ± 0.287
0.229CysPhe: 0.229 ± 0.125
0.611CysGly: 0.611 ± 0.217
0.229CysHis: 0.229 ± 0.134
0.305CysIle: 0.305 ± 0.145
0.84CysLys: 0.84 ± 0.218
0.534CysLeu: 0.534 ± 0.24
0.153CysMet: 0.153 ± 0.104
0.763CysAsn: 0.763 ± 0.255
0.229CysPro: 0.229 ± 0.11
0.305CysGln: 0.305 ± 0.161
0.763CysArg: 0.763 ± 0.227
0.382CysSer: 0.382 ± 0.195
0.458CysThr: 0.458 ± 0.15
0.84CysVal: 0.84 ± 0.244
0.229CysTrp: 0.229 ± 0.117
0.382CysTyr: 0.382 ± 0.179
0.0CysXaa: 0.0 ± 0.0
Asp
7.863AspAla: 7.863 ± 0.857
0.534AspCys: 0.534 ± 0.205
3.969AspAsp: 3.969 ± 0.489
4.046AspGlu: 4.046 ± 0.507
2.977AspPhe: 2.977 ± 0.386
5.496AspGly: 5.496 ± 0.699
0.687AspHis: 0.687 ± 0.205
3.588AspIle: 3.588 ± 0.43
3.511AspLys: 3.511 ± 0.394
4.962AspLeu: 4.962 ± 0.542
1.374AspMet: 1.374 ± 0.258
2.824AspAsn: 2.824 ± 0.42
1.756AspPro: 1.756 ± 0.328
0.763AspGln: 0.763 ± 0.24
2.901AspArg: 2.901 ± 0.38
3.664AspSer: 3.664 ± 0.513
3.893AspThr: 3.893 ± 0.476
4.122AspVal: 4.122 ± 0.524
0.992AspTrp: 0.992 ± 0.22
2.137AspTyr: 2.137 ± 0.404
0.0AspXaa: 0.0 ± 0.0
Glu
6.641GluAla: 6.641 ± 0.856
0.611GluCys: 0.611 ± 0.207
3.893GluAsp: 3.893 ± 0.49
5.038GluGlu: 5.038 ± 0.935
3.664GluPhe: 3.664 ± 0.691
4.504GluGly: 4.504 ± 0.897
0.84GluHis: 0.84 ± 0.213
3.817GluIle: 3.817 ± 0.463
4.046GluLys: 4.046 ± 0.58
5.954GluLeu: 5.954 ± 0.702
2.443GluMet: 2.443 ± 0.393
2.595GluAsn: 2.595 ± 0.37
1.756GluPro: 1.756 ± 0.589
3.435GluGln: 3.435 ± 0.656
3.511GluArg: 3.511 ± 0.646
4.275GluSer: 4.275 ± 0.531
3.282GluThr: 3.282 ± 0.528
4.809GluVal: 4.809 ± 0.51
0.992GluTrp: 0.992 ± 0.325
1.756GluTyr: 1.756 ± 0.384
0.0GluXaa: 0.0 ± 0.0
Phe
3.13PheAla: 3.13 ± 0.452
0.611PheCys: 0.611 ± 0.189
3.282PheAsp: 3.282 ± 0.496
2.824PheGlu: 2.824 ± 0.481
0.763PhePhe: 0.763 ± 0.226
2.901PheGly: 2.901 ± 0.398
0.611PheHis: 0.611 ± 0.221
2.443PheIle: 2.443 ± 0.405
1.985PheLys: 1.985 ± 0.488
2.214PheLeu: 2.214 ± 0.425
0.305PheMet: 0.305 ± 0.152
1.374PheAsn: 1.374 ± 0.328
1.527PhePro: 1.527 ± 0.419
1.374PheGln: 1.374 ± 0.292
2.29PheArg: 2.29 ± 0.29
2.29PheSer: 2.29 ± 0.496
3.282PheThr: 3.282 ± 0.531
3.053PheVal: 3.053 ± 0.431
0.763PheTrp: 0.763 ± 0.258
1.298PheTyr: 1.298 ± 0.304
0.0PheXaa: 0.0 ± 0.0
Gly
7.481GlyAla: 7.481 ± 0.72
0.84GlyCys: 0.84 ± 0.228
4.427GlyAsp: 4.427 ± 0.616
5.267GlyGlu: 5.267 ± 0.781
2.595GlyPhe: 2.595 ± 0.45
6.336GlyGly: 6.336 ± 0.731
1.45GlyHis: 1.45 ± 0.397
3.206GlyIle: 3.206 ± 0.477
5.573GlyLys: 5.573 ± 0.543
5.191GlyLeu: 5.191 ± 0.425
2.443GlyMet: 2.443 ± 0.619
3.893GlyAsn: 3.893 ± 0.478
1.679GlyPro: 1.679 ± 0.349
2.748GlyGln: 2.748 ± 0.413
4.351GlyArg: 4.351 ± 0.509
5.191GlySer: 5.191 ± 0.651
3.664GlyThr: 3.664 ± 0.612
5.954GlyVal: 5.954 ± 0.694
1.527GlyTrp: 1.527 ± 0.326
2.824GlyTyr: 2.824 ± 0.557
0.0GlyXaa: 0.0 ± 0.0
His
1.221HisAla: 1.221 ± 0.319
0.305HisCys: 0.305 ± 0.143
0.916HisAsp: 0.916 ± 0.2
0.916HisGlu: 0.916 ± 0.313
0.534HisPhe: 0.534 ± 0.205
1.145HisGly: 1.145 ± 0.345
0.611HisHis: 0.611 ± 0.226
1.069HisIle: 1.069 ± 0.234
1.069HisLys: 1.069 ± 0.281
1.145HisLeu: 1.145 ± 0.298
0.611HisMet: 0.611 ± 0.214
0.534HisAsn: 0.534 ± 0.172
1.145HisPro: 1.145 ± 0.284
0.916HisGln: 0.916 ± 0.257
0.84HisArg: 0.84 ± 0.233
0.687HisSer: 0.687 ± 0.178
0.763HisThr: 0.763 ± 0.325
1.221HisVal: 1.221 ± 0.319
0.153HisTrp: 0.153 ± 0.11
0.916HisTyr: 0.916 ± 0.323
0.0HisXaa: 0.0 ± 0.0
Ile
4.58IleAla: 4.58 ± 0.616
0.458IleCys: 0.458 ± 0.203
3.969IleAsp: 3.969 ± 0.558
3.053IleGlu: 3.053 ± 0.561
1.069IlePhe: 1.069 ± 0.256
3.282IleGly: 3.282 ± 0.438
0.763IleHis: 0.763 ± 0.221
2.29IleIle: 2.29 ± 0.384
3.053IleLys: 3.053 ± 0.449
2.748IleLeu: 2.748 ± 0.462
0.687IleMet: 0.687 ± 0.242
2.29IleAsn: 2.29 ± 0.418
2.824IlePro: 2.824 ± 0.431
1.603IleGln: 1.603 ± 0.411
2.672IleArg: 2.672 ± 0.285
3.053IleSer: 3.053 ± 0.558
4.504IleThr: 4.504 ± 0.515
3.893IleVal: 3.893 ± 0.521
0.687IleTrp: 0.687 ± 0.202
1.298IleTyr: 1.298 ± 0.36
0.0IleXaa: 0.0 ± 0.0
Lys
5.878LysAla: 5.878 ± 0.809
0.611LysCys: 0.611 ± 0.229
3.817LysAsp: 3.817 ± 0.504
4.427LysGlu: 4.427 ± 0.656
2.29LysPhe: 2.29 ± 0.341
3.893LysGly: 3.893 ± 0.412
1.145LysHis: 1.145 ± 0.283
2.061LysIle: 2.061 ± 0.506
3.282LysLys: 3.282 ± 0.547
5.191LysLeu: 5.191 ± 0.603
2.901LysMet: 2.901 ± 0.557
2.748LysAsn: 2.748 ± 0.421
2.519LysPro: 2.519 ± 0.52
2.29LysGln: 2.29 ± 0.381
3.817LysArg: 3.817 ± 0.607
2.824LysSer: 2.824 ± 0.496
3.435LysThr: 3.435 ± 0.368
3.893LysVal: 3.893 ± 0.746
0.992LysTrp: 0.992 ± 0.254
2.672LysTyr: 2.672 ± 0.434
0.0LysXaa: 0.0 ± 0.0
Leu
6.87LeuAla: 6.87 ± 0.568
0.534LeuCys: 0.534 ± 0.206
4.58LeuAsp: 4.58 ± 0.489
4.427LeuGlu: 4.427 ± 0.769
1.985LeuPhe: 1.985 ± 0.333
4.733LeuGly: 4.733 ± 0.543
1.145LeuHis: 1.145 ± 0.251
4.58LeuIle: 4.58 ± 0.571
5.496LeuLys: 5.496 ± 0.605
6.26LeuLeu: 6.26 ± 0.778
1.985LeuMet: 1.985 ± 0.35
4.198LeuAsn: 4.198 ± 0.587
3.282LeuPro: 3.282 ± 0.493
2.595LeuGln: 2.595 ± 0.4
5.344LeuArg: 5.344 ± 0.716
4.198LeuSer: 4.198 ± 0.504
5.038LeuThr: 5.038 ± 0.447
6.336LeuVal: 6.336 ± 0.587
0.916LeuTrp: 0.916 ± 0.3
2.595LeuTyr: 2.595 ± 0.373
0.0LeuXaa: 0.0 ± 0.0
Met
2.672MetAla: 2.672 ± 0.4
0.229MetCys: 0.229 ± 0.128
1.374MetAsp: 1.374 ± 0.372
1.298MetGlu: 1.298 ± 0.268
1.069MetPhe: 1.069 ± 0.264
1.679MetGly: 1.679 ± 0.356
0.305MetHis: 0.305 ± 0.151
1.145MetIle: 1.145 ± 0.337
1.603MetLys: 1.603 ± 0.334
2.214MetLeu: 2.214 ± 0.403
0.382MetMet: 0.382 ± 0.176
1.069MetAsn: 1.069 ± 0.239
1.298MetPro: 1.298 ± 0.364
0.916MetGln: 0.916 ± 0.268
1.45MetArg: 1.45 ± 0.267
2.061MetSer: 2.061 ± 0.371
1.908MetThr: 1.908 ± 0.387
1.832MetVal: 1.832 ± 0.36
0.458MetTrp: 0.458 ± 0.137
0.84MetTyr: 0.84 ± 0.272
0.0MetXaa: 0.0 ± 0.0
Asn
3.588AsnAla: 3.588 ± 0.454
0.382AsnCys: 0.382 ± 0.173
2.901AsnAsp: 2.901 ± 0.315
2.672AsnGlu: 2.672 ± 0.456
1.756AsnPhe: 1.756 ± 0.386
3.817AsnGly: 3.817 ± 0.533
0.763AsnHis: 0.763 ± 0.254
2.443AsnIle: 2.443 ± 0.427
2.214AsnLys: 2.214 ± 0.473
3.969AsnLeu: 3.969 ± 0.434
0.84AsnMet: 0.84 ± 0.309
2.137AsnAsn: 2.137 ± 0.455
1.756AsnPro: 1.756 ± 0.312
1.374AsnGln: 1.374 ± 0.284
2.137AsnArg: 2.137 ± 0.317
2.061AsnSer: 2.061 ± 0.358
2.061AsnThr: 2.061 ± 0.412
3.74AsnVal: 3.74 ± 0.417
0.687AsnTrp: 0.687 ± 0.188
1.45AsnTyr: 1.45 ± 0.301
0.0AsnXaa: 0.0 ± 0.0
Pro
2.824ProAla: 2.824 ± 0.515
0.458ProCys: 0.458 ± 0.17
3.13ProAsp: 3.13 ± 0.574
3.588ProGlu: 3.588 ± 0.505
1.603ProPhe: 1.603 ± 0.298
2.977ProGly: 2.977 ± 0.436
0.763ProHis: 0.763 ± 0.22
1.45ProIle: 1.45 ± 0.337
2.672ProLys: 2.672 ± 0.487
3.511ProLeu: 3.511 ± 0.485
0.763ProMet: 0.763 ± 0.247
1.298ProAsn: 1.298 ± 0.498
1.603ProPro: 1.603 ± 0.374
1.221ProGln: 1.221 ± 0.316
1.679ProArg: 1.679 ± 0.338
1.908ProSer: 1.908 ± 0.3
1.298ProThr: 1.298 ± 0.327
4.046ProVal: 4.046 ± 0.609
0.458ProTrp: 0.458 ± 0.224
1.298ProTyr: 1.298 ± 0.356
0.0ProXaa: 0.0 ± 0.0
Gln
4.046GlnAla: 4.046 ± 0.577
0.076GlnCys: 0.076 ± 0.07
1.298GlnAsp: 1.298 ± 0.299
2.366GlnGlu: 2.366 ± 0.458
1.221GlnPhe: 1.221 ± 0.356
2.366GlnGly: 2.366 ± 0.429
0.611GlnHis: 0.611 ± 0.199
1.679GlnIle: 1.679 ± 0.431
2.061GlnLys: 2.061 ± 0.314
3.13GlnLeu: 3.13 ± 0.532
1.221GlnMet: 1.221 ± 0.267
1.603GlnAsn: 1.603 ± 0.317
2.29GlnPro: 2.29 ± 0.397
2.214GlnGln: 2.214 ± 0.54
1.832GlnArg: 1.832 ± 0.343
1.908GlnSer: 1.908 ± 0.388
1.679GlnThr: 1.679 ± 0.307
2.595GlnVal: 2.595 ± 0.468
0.458GlnTrp: 0.458 ± 0.146
1.298GlnTyr: 1.298 ± 0.28
0.0GlnXaa: 0.0 ± 0.0
Arg
4.733ArgAla: 4.733 ± 0.458
0.305ArgCys: 0.305 ± 0.138
3.13ArgAsp: 3.13 ± 0.415
4.046ArgGlu: 4.046 ± 0.653
2.29ArgPhe: 2.29 ± 0.322
4.122ArgGly: 4.122 ± 0.519
1.298ArgHis: 1.298 ± 0.295
2.901ArgIle: 2.901 ± 0.436
3.588ArgLys: 3.588 ± 0.614
4.351ArgLeu: 4.351 ± 0.604
1.756ArgMet: 1.756 ± 0.345
2.977ArgAsn: 2.977 ± 0.499
1.908ArgPro: 1.908 ± 0.389
3.053ArgGln: 3.053 ± 0.482
4.427ArgArg: 4.427 ± 0.596
2.366ArgSer: 2.366 ± 0.346
2.672ArgThr: 2.672 ± 0.512
4.198ArgVal: 4.198 ± 0.564
0.916ArgTrp: 0.916 ± 0.205
1.45ArgTyr: 1.45 ± 0.386
0.0ArgXaa: 0.0 ± 0.0
Ser
6.641SerAla: 6.641 ± 0.987
0.458SerCys: 0.458 ± 0.179
3.282SerAsp: 3.282 ± 0.441
3.282SerGlu: 3.282 ± 0.531
2.29SerPhe: 2.29 ± 0.427
6.26SerGly: 6.26 ± 0.655
0.763SerHis: 0.763 ± 0.228
2.824SerIle: 2.824 ± 0.492
2.901SerLys: 2.901 ± 0.424
4.504SerLeu: 4.504 ± 0.439
1.374SerMet: 1.374 ± 0.323
1.985SerAsn: 1.985 ± 0.262
1.832SerPro: 1.832 ± 0.398
1.908SerGln: 1.908 ± 0.338
2.977SerArg: 2.977 ± 0.498
2.977SerSer: 2.977 ± 0.401
3.664SerThr: 3.664 ± 0.55
4.58SerVal: 4.58 ± 0.632
0.84SerTrp: 0.84 ± 0.232
2.214SerTyr: 2.214 ± 0.393
0.0SerXaa: 0.0 ± 0.0
Thr
6.565ThrAla: 6.565 ± 0.686
0.611ThrCys: 0.611 ± 0.189
3.969ThrAsp: 3.969 ± 0.461
3.74ThrGlu: 3.74 ± 0.454
2.824ThrPhe: 2.824 ± 0.466
6.107ThrGly: 6.107 ± 0.801
0.763ThrHis: 0.763 ± 0.232
2.672ThrIle: 2.672 ± 0.443
2.824ThrLys: 2.824 ± 0.437
4.962ThrLeu: 4.962 ± 0.533
0.916ThrMet: 0.916 ± 0.303
1.756ThrAsn: 1.756 ± 0.345
3.588ThrPro: 3.588 ± 0.475
1.298ThrGln: 1.298 ± 0.315
2.977ThrArg: 2.977 ± 0.463
4.198ThrSer: 4.198 ± 0.579
3.435ThrThr: 3.435 ± 0.51
4.656ThrVal: 4.656 ± 0.678
0.916ThrTrp: 0.916 ± 0.236
2.214ThrTyr: 2.214 ± 0.432
0.0ThrXaa: 0.0 ± 0.0
Val
7.405ValAla: 7.405 ± 0.744
0.534ValCys: 0.534 ± 0.177
3.969ValAsp: 3.969 ± 0.441
5.954ValGlu: 5.954 ± 0.689
3.053ValPhe: 3.053 ± 0.561
4.58ValGly: 4.58 ± 0.643
0.992ValHis: 0.992 ± 0.222
4.275ValIle: 4.275 ± 0.584
5.038ValLys: 5.038 ± 0.685
4.733ValLeu: 4.733 ± 0.567
1.374ValMet: 1.374 ± 0.348
3.435ValAsn: 3.435 ± 0.481
2.29ValPro: 2.29 ± 0.659
2.519ValGln: 2.519 ± 0.359
4.198ValArg: 4.198 ± 0.577
5.191ValSer: 5.191 ± 0.735
6.565ValThr: 6.565 ± 0.703
5.344ValVal: 5.344 ± 0.702
0.916ValTrp: 0.916 ± 0.249
2.748ValTyr: 2.748 ± 0.356
0.0ValXaa: 0.0 ± 0.0
Trp
1.374TrpAla: 1.374 ± 0.413
0.229TrpCys: 0.229 ± 0.12
0.763TrpAsp: 0.763 ± 0.187
0.611TrpGlu: 0.611 ± 0.18
0.687TrpPhe: 0.687 ± 0.267
1.069TrpGly: 1.069 ± 0.254
0.305TrpHis: 0.305 ± 0.178
0.84TrpIle: 0.84 ± 0.266
0.382TrpLys: 0.382 ± 0.183
1.374TrpLeu: 1.374 ± 0.364
0.458TrpMet: 0.458 ± 0.181
0.687TrpAsn: 0.687 ± 0.233
0.534TrpPro: 0.534 ± 0.232
0.611TrpGln: 0.611 ± 0.235
1.145TrpArg: 1.145 ± 0.292
0.534TrpSer: 0.534 ± 0.209
0.916TrpThr: 0.916 ± 0.209
1.069TrpVal: 1.069 ± 0.245
0.305TrpTrp: 0.305 ± 0.152
0.534TrpTyr: 0.534 ± 0.204
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.206TyrAla: 3.206 ± 0.496
0.611TyrCys: 0.611 ± 0.179
1.908TyrAsp: 1.908 ± 0.503
2.901TyrGlu: 2.901 ± 0.484
1.298TyrPhe: 1.298 ± 0.324
2.824TyrGly: 2.824 ± 0.511
0.611TyrHis: 0.611 ± 0.213
1.374TyrIle: 1.374 ± 0.304
2.595TyrLys: 2.595 ± 0.449
2.29TyrLeu: 2.29 ± 0.404
0.992TyrMet: 0.992 ± 0.233
1.221TyrAsn: 1.221 ± 0.249
1.45TyrPro: 1.45 ± 0.311
1.298TyrGln: 1.298 ± 0.325
2.748TyrArg: 2.748 ± 0.498
1.908TyrSer: 1.908 ± 0.386
2.672TyrThr: 2.672 ± 0.363
1.603TyrVal: 1.603 ± 0.315
0.076TyrTrp: 0.076 ± 0.077
1.603TyrTyr: 1.603 ± 0.306
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 69 proteins (13101 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski