Amino acid dipepetide frequency for Salmonella phage SPN3UB

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.196AlaAla: 9.196 ± 1.032
0.913AlaCys: 0.913 ± 0.267
5.546AlaAsp: 5.546 ± 0.556
5.827AlaGlu: 5.827 ± 0.601
2.738AlaPhe: 2.738 ± 0.427
6.248AlaGly: 6.248 ± 0.772
1.193AlaHis: 1.193 ± 0.248
6.037AlaIle: 6.037 ± 0.637
6.037AlaLys: 6.037 ± 0.732
7.862AlaLeu: 7.862 ± 0.898
2.738AlaMet: 2.738 ± 0.487
4.212AlaAsn: 4.212 ± 0.697
2.597AlaPro: 2.597 ± 0.532
4.282AlaGln: 4.282 ± 0.665
6.037AlaArg: 6.037 ± 0.689
5.616AlaSer: 5.616 ± 0.788
5.476AlaThr: 5.476 ± 0.753
7.09AlaVal: 7.09 ± 0.657
2.317AlaTrp: 2.317 ± 0.478
3.229AlaTyr: 3.229 ± 0.513
0.0AlaXaa: 0.0 ± 0.0
Cys
0.842CysAla: 0.842 ± 0.243
0.211CysCys: 0.211 ± 0.12
0.913CysAsp: 0.913 ± 0.227
0.702CysGlu: 0.702 ± 0.218
0.421CysPhe: 0.421 ± 0.175
0.842CysGly: 0.842 ± 0.28
0.281CysHis: 0.281 ± 0.183
0.421CysIle: 0.421 ± 0.146
0.842CysLys: 0.842 ± 0.248
0.351CysLeu: 0.351 ± 0.147
0.211CysMet: 0.211 ± 0.138
0.562CysAsn: 0.562 ± 0.203
0.702CysPro: 0.702 ± 0.201
0.632CysGln: 0.632 ± 0.194
1.404CysArg: 1.404 ± 0.426
0.913CysSer: 0.913 ± 0.24
0.702CysThr: 0.702 ± 0.213
0.351CysVal: 0.351 ± 0.153
0.281CysTrp: 0.281 ± 0.133
0.421CysTyr: 0.421 ± 0.18
0.0CysXaa: 0.0 ± 0.0
Asp
5.686AspAla: 5.686 ± 0.701
0.913AspCys: 0.913 ± 0.282
3.44AspAsp: 3.44 ± 0.515
4.844AspGlu: 4.844 ± 0.633
1.966AspPhe: 1.966 ± 0.317
5.054AspGly: 5.054 ± 0.64
0.772AspHis: 0.772 ± 0.23
3.159AspIle: 3.159 ± 0.462
3.37AspLys: 3.37 ± 0.533
3.159AspLeu: 3.159 ± 0.522
1.755AspMet: 1.755 ± 0.346
2.808AspAsn: 2.808 ± 0.475
2.387AspPro: 2.387 ± 0.353
1.685AspGln: 1.685 ± 0.376
3.089AspArg: 3.089 ± 0.501
3.229AspSer: 3.229 ± 0.425
2.948AspThr: 2.948 ± 0.52
3.229AspVal: 3.229 ± 0.481
0.983AspTrp: 0.983 ± 0.271
1.895AspTyr: 1.895 ± 0.53
0.0AspXaa: 0.0 ± 0.0
Glu
6.318GluAla: 6.318 ± 0.869
0.772GluCys: 0.772 ± 0.257
2.387GluAsp: 2.387 ± 0.408
3.721GluGlu: 3.721 ± 0.62
2.808GluPhe: 2.808 ± 0.415
4.282GluGly: 4.282 ± 0.485
0.842GluHis: 0.842 ± 0.254
4.774GluIle: 4.774 ± 0.805
4.774GluLys: 4.774 ± 0.819
6.318GluLeu: 6.318 ± 0.742
1.966GluMet: 1.966 ± 0.324
2.457GluAsn: 2.457 ± 0.576
2.317GluPro: 2.317 ± 0.451
3.861GluGln: 3.861 ± 0.523
4.142GluArg: 4.142 ± 0.683
4.212GluSer: 4.212 ± 0.511
3.58GluThr: 3.58 ± 0.641
3.721GluVal: 3.721 ± 0.593
1.123GluTrp: 1.123 ± 0.28
2.387GluTyr: 2.387 ± 0.52
0.0GluXaa: 0.0 ± 0.0
Phe
3.229PheAla: 3.229 ± 0.452
0.702PheCys: 0.702 ± 0.19
1.966PheAsp: 1.966 ± 0.322
2.457PheGlu: 2.457 ± 0.414
0.983PhePhe: 0.983 ± 0.216
2.246PheGly: 2.246 ± 0.457
0.562PheHis: 0.562 ± 0.212
1.755PheIle: 1.755 ± 0.351
1.404PheLys: 1.404 ± 0.284
1.966PheLeu: 1.966 ± 0.373
0.491PheMet: 0.491 ± 0.186
2.176PheAsn: 2.176 ± 0.426
1.123PhePro: 1.123 ± 0.241
0.772PheGln: 0.772 ± 0.218
2.036PheArg: 2.036 ± 0.55
2.387PheSer: 2.387 ± 0.403
2.808PheThr: 2.808 ± 0.442
1.474PheVal: 1.474 ± 0.292
0.421PheTrp: 0.421 ± 0.181
1.053PheTyr: 1.053 ± 0.249
0.0PheXaa: 0.0 ± 0.0
Gly
6.107GlyAla: 6.107 ± 0.568
0.983GlyCys: 0.983 ± 0.28
3.65GlyAsp: 3.65 ± 0.532
4.423GlyGlu: 4.423 ± 0.579
2.387GlyPhe: 2.387 ± 0.39
5.756GlyGly: 5.756 ± 0.943
0.842GlyHis: 0.842 ± 0.241
4.844GlyIle: 4.844 ± 0.508
5.335GlyLys: 5.335 ± 0.474
4.493GlyLeu: 4.493 ± 0.475
2.176GlyMet: 2.176 ± 0.343
3.721GlyAsn: 3.721 ± 0.475
1.544GlyPro: 1.544 ± 0.403
2.597GlyGln: 2.597 ± 0.411
4.633GlyArg: 4.633 ± 0.577
3.791GlySer: 3.791 ± 0.503
3.299GlyThr: 3.299 ± 0.51
4.703GlyVal: 4.703 ± 0.587
1.053GlyTrp: 1.053 ± 0.27
2.808GlyTyr: 2.808 ± 0.374
0.0GlyXaa: 0.0 ± 0.0
His
1.193HisAla: 1.193 ± 0.317
0.421HisCys: 0.421 ± 0.165
0.913HisAsp: 0.913 ± 0.197
0.702HisGlu: 0.702 ± 0.215
0.562HisPhe: 0.562 ± 0.222
1.755HisGly: 1.755 ± 0.375
0.562HisHis: 0.562 ± 0.251
1.053HisIle: 1.053 ± 0.225
0.421HisLys: 0.421 ± 0.177
1.474HisLeu: 1.474 ± 0.334
0.562HisMet: 0.562 ± 0.178
0.632HisAsn: 0.632 ± 0.193
1.264HisPro: 1.264 ± 0.306
0.562HisGln: 0.562 ± 0.181
0.772HisArg: 0.772 ± 0.215
0.983HisSer: 0.983 ± 0.322
0.772HisThr: 0.772 ± 0.315
0.702HisVal: 0.702 ± 0.212
0.281HisTrp: 0.281 ± 0.139
0.702HisTyr: 0.702 ± 0.204
0.0HisXaa: 0.0 ± 0.0
Ile
5.265IleAla: 5.265 ± 0.701
0.421IleCys: 0.421 ± 0.238
4.423IleAsp: 4.423 ± 0.557
2.878IleGlu: 2.878 ± 0.382
1.334IlePhe: 1.334 ± 0.256
2.948IleGly: 2.948 ± 0.389
1.193IleHis: 1.193 ± 0.266
4.072IleIle: 4.072 ± 0.535
3.44IleLys: 3.44 ± 0.501
4.774IleLeu: 4.774 ± 0.71
0.491IleMet: 0.491 ± 0.222
3.159IleAsn: 3.159 ± 0.427
3.159IlePro: 3.159 ± 0.407
2.387IleGln: 2.387 ± 0.474
3.58IleArg: 3.58 ± 0.537
5.054IleSer: 5.054 ± 0.572
3.791IleThr: 3.791 ± 0.798
3.44IleVal: 3.44 ± 0.504
1.193IleTrp: 1.193 ± 0.24
0.983IleTyr: 0.983 ± 0.314
0.0IleXaa: 0.0 ± 0.0
Lys
6.599LysAla: 6.599 ± 0.643
0.842LysCys: 0.842 ± 0.263
2.036LysAsp: 2.036 ± 0.455
4.703LysGlu: 4.703 ± 0.678
1.825LysPhe: 1.825 ± 0.348
3.229LysGly: 3.229 ± 0.506
0.983LysHis: 0.983 ± 0.267
2.948LysIle: 2.948 ± 0.422
3.65LysLys: 3.65 ± 0.505
5.125LysLeu: 5.125 ± 0.762
1.755LysMet: 1.755 ± 0.303
2.668LysAsn: 2.668 ± 0.467
2.527LysPro: 2.527 ± 0.467
2.457LysGln: 2.457 ± 0.389
3.931LysArg: 3.931 ± 0.485
3.861LysSer: 3.861 ± 0.554
4.212LysThr: 4.212 ± 0.555
3.65LysVal: 3.65 ± 0.597
1.053LysTrp: 1.053 ± 0.228
2.317LysTyr: 2.317 ± 0.389
0.0LysXaa: 0.0 ± 0.0
Leu
7.441LeuAla: 7.441 ± 0.65
0.983LeuCys: 0.983 ± 0.261
5.265LeuAsp: 5.265 ± 0.652
5.405LeuGlu: 5.405 ± 0.808
1.966LeuPhe: 1.966 ± 0.371
4.563LeuGly: 4.563 ± 0.67
1.404LeuHis: 1.404 ± 0.302
3.65LeuIle: 3.65 ± 0.404
3.861LeuLys: 3.861 ± 0.51
5.616LeuLeu: 5.616 ± 0.677
1.264LeuMet: 1.264 ± 0.274
4.282LeuAsn: 4.282 ± 0.642
4.072LeuPro: 4.072 ± 0.729
2.948LeuGln: 2.948 ± 0.437
5.195LeuArg: 5.195 ± 0.57
5.827LeuSer: 5.827 ± 0.723
4.914LeuThr: 4.914 ± 0.655
4.633LeuVal: 4.633 ± 0.54
1.053LeuTrp: 1.053 ± 0.239
2.317LeuTyr: 2.317 ± 0.434
0.0LeuXaa: 0.0 ± 0.0
Met
2.808MetAla: 2.808 ± 0.409
0.14MetCys: 0.14 ± 0.108
0.983MetAsp: 0.983 ± 0.243
1.123MetGlu: 1.123 ± 0.255
0.772MetPhe: 0.772 ± 0.224
1.334MetGly: 1.334 ± 0.31
0.14MetHis: 0.14 ± 0.106
1.123MetIle: 1.123 ± 0.313
2.036MetLys: 2.036 ± 0.378
1.544MetLeu: 1.544 ± 0.356
0.562MetMet: 0.562 ± 0.183
1.474MetAsn: 1.474 ± 0.321
1.123MetPro: 1.123 ± 0.29
1.123MetGln: 1.123 ± 0.288
1.474MetArg: 1.474 ± 0.304
2.948MetSer: 2.948 ± 0.494
2.246MetThr: 2.246 ± 0.433
1.264MetVal: 1.264 ± 0.301
0.351MetTrp: 0.351 ± 0.149
0.562MetTyr: 0.562 ± 0.208
0.0MetXaa: 0.0 ± 0.0
Asn
4.352AsnAla: 4.352 ± 0.701
0.281AsnCys: 0.281 ± 0.146
2.176AsnAsp: 2.176 ± 0.378
2.527AsnGlu: 2.527 ± 0.378
1.264AsnPhe: 1.264 ± 0.266
4.844AsnGly: 4.844 ± 0.619
0.983AsnHis: 0.983 ± 0.326
2.527AsnIle: 2.527 ± 0.476
2.387AsnLys: 2.387 ± 0.386
4.282AsnLeu: 4.282 ± 0.459
1.123AsnMet: 1.123 ± 0.271
2.387AsnAsn: 2.387 ± 0.495
2.948AsnPro: 2.948 ± 0.516
2.246AsnGln: 2.246 ± 0.457
1.966AsnArg: 1.966 ± 0.373
3.299AsnSer: 3.299 ± 0.45
2.317AsnThr: 2.317 ± 0.478
3.861AsnVal: 3.861 ± 0.559
0.351AsnTrp: 0.351 ± 0.135
1.895AsnTyr: 1.895 ± 0.411
0.0AsnXaa: 0.0 ± 0.0
Pro
4.142ProAla: 4.142 ± 0.475
0.211ProCys: 0.211 ± 0.131
3.019ProAsp: 3.019 ± 0.377
4.984ProGlu: 4.984 ± 0.985
1.053ProPhe: 1.053 ± 0.285
2.597ProGly: 2.597 ± 0.509
0.702ProHis: 0.702 ± 0.27
2.317ProIle: 2.317 ± 0.322
2.527ProLys: 2.527 ± 0.452
3.299ProLeu: 3.299 ± 0.507
0.702ProMet: 0.702 ± 0.249
1.685ProAsn: 1.685 ± 0.266
1.615ProPro: 1.615 ± 0.322
1.123ProGln: 1.123 ± 0.274
1.755ProArg: 1.755 ± 0.338
2.387ProSer: 2.387 ± 0.435
2.176ProThr: 2.176 ± 0.393
4.001ProVal: 4.001 ± 0.761
0.351ProTrp: 0.351 ± 0.184
0.913ProTyr: 0.913 ± 0.207
0.0ProXaa: 0.0 ± 0.0
Gln
4.703GlnAla: 4.703 ± 0.624
0.491GlnCys: 0.491 ± 0.218
1.544GlnAsp: 1.544 ± 0.358
2.738GlnGlu: 2.738 ± 0.36
0.983GlnPhe: 0.983 ± 0.292
1.755GlnGly: 1.755 ± 0.329
0.772GlnHis: 0.772 ± 0.252
2.808GlnIle: 2.808 ± 0.466
2.738GlnLys: 2.738 ± 0.413
3.159GlnLeu: 3.159 ± 0.465
1.544GlnMet: 1.544 ± 0.366
1.544GlnAsn: 1.544 ± 0.384
1.615GlnPro: 1.615 ± 0.437
2.106GlnGln: 2.106 ± 0.544
2.668GlnArg: 2.668 ± 0.484
2.597GlnSer: 2.597 ± 0.44
1.334GlnThr: 1.334 ± 0.34
3.159GlnVal: 3.159 ± 0.481
0.632GlnTrp: 0.632 ± 0.181
1.474GlnTyr: 1.474 ± 0.418
0.0GlnXaa: 0.0 ± 0.0
Arg
4.282ArgAla: 4.282 ± 0.449
1.264ArgCys: 1.264 ± 0.36
3.58ArgAsp: 3.58 ± 0.494
3.159ArgGlu: 3.159 ± 0.465
2.106ArgPhe: 2.106 ± 0.463
4.212ArgGly: 4.212 ± 0.606
1.474ArgHis: 1.474 ± 0.309
4.423ArgIle: 4.423 ± 0.68
4.212ArgLys: 4.212 ± 0.669
5.897ArgLeu: 5.897 ± 0.694
1.895ArgMet: 1.895 ± 0.383
2.176ArgAsn: 2.176 ± 0.365
2.106ArgPro: 2.106 ± 0.383
3.159ArgGln: 3.159 ± 0.578
4.212ArgArg: 4.212 ± 0.892
2.317ArgSer: 2.317 ± 0.396
2.948ArgThr: 2.948 ± 0.481
4.001ArgVal: 4.001 ± 0.538
1.123ArgTrp: 1.123 ± 0.321
2.246ArgTyr: 2.246 ± 0.413
0.0ArgXaa: 0.0 ± 0.0
Ser
6.529SerAla: 6.529 ± 0.585
0.491SerCys: 0.491 ± 0.191
4.142SerAsp: 4.142 ± 0.558
4.563SerGlu: 4.563 ± 0.551
2.668SerPhe: 2.668 ± 0.452
6.248SerGly: 6.248 ± 0.796
1.053SerHis: 1.053 ± 0.33
2.948SerIle: 2.948 ± 0.409
2.597SerLys: 2.597 ± 0.419
5.195SerLeu: 5.195 ± 0.815
2.106SerMet: 2.106 ± 0.313
3.089SerAsn: 3.089 ± 0.405
2.176SerPro: 2.176 ± 0.385
2.457SerGln: 2.457 ± 0.411
3.229SerArg: 3.229 ± 0.437
5.054SerSer: 5.054 ± 0.701
3.299SerThr: 3.299 ± 0.548
4.914SerVal: 4.914 ± 0.545
0.983SerTrp: 0.983 ± 0.237
2.246SerTyr: 2.246 ± 0.461
0.0SerXaa: 0.0 ± 0.0
Thr
7.09ThrAla: 7.09 ± 0.763
0.281ThrCys: 0.281 ± 0.129
4.142ThrAsp: 4.142 ± 0.574
4.774ThrGlu: 4.774 ± 0.683
1.966ThrPhe: 1.966 ± 0.411
4.633ThrGly: 4.633 ± 0.57
0.632ThrHis: 0.632 ± 0.214
2.808ThrIle: 2.808 ± 0.457
3.229ThrLys: 3.229 ± 0.479
4.282ThrLeu: 4.282 ± 0.608
1.334ThrMet: 1.334 ± 0.371
2.808ThrAsn: 2.808 ± 0.32
2.878ThrPro: 2.878 ± 0.531
1.825ThrGln: 1.825 ± 0.329
3.37ThrArg: 3.37 ± 0.528
3.37ThrSer: 3.37 ± 0.421
3.65ThrThr: 3.65 ± 0.644
3.931ThrVal: 3.931 ± 0.564
1.193ThrTrp: 1.193 ± 0.255
1.615ThrTyr: 1.615 ± 0.356
0.0ThrXaa: 0.0 ± 0.0
Val
5.546ValAla: 5.546 ± 0.718
0.562ValCys: 0.562 ± 0.228
3.58ValAsp: 3.58 ± 0.5
4.352ValGlu: 4.352 ± 0.529
2.387ValPhe: 2.387 ± 0.399
4.072ValGly: 4.072 ± 0.62
0.772ValHis: 0.772 ± 0.217
3.58ValIle: 3.58 ± 0.583
5.054ValLys: 5.054 ± 0.674
4.563ValLeu: 4.563 ± 0.64
1.544ValMet: 1.544 ± 0.391
3.931ValAsn: 3.931 ± 0.573
3.159ValPro: 3.159 ± 0.509
2.317ValGln: 2.317 ± 0.467
3.299ValArg: 3.299 ± 0.56
4.914ValSer: 4.914 ± 0.614
5.827ValThr: 5.827 ± 0.626
4.844ValVal: 4.844 ± 0.63
0.913ValTrp: 0.913 ± 0.301
1.966ValTyr: 1.966 ± 0.313
0.0ValXaa: 0.0 ± 0.0
Trp
1.193TrpAla: 1.193 ± 0.334
0.632TrpCys: 0.632 ± 0.201
0.983TrpAsp: 0.983 ± 0.228
1.123TrpGlu: 1.123 ± 0.271
0.772TrpPhe: 0.772 ± 0.223
0.562TrpGly: 0.562 ± 0.205
0.421TrpHis: 0.421 ± 0.174
1.123TrpIle: 1.123 ± 0.273
0.842TrpLys: 0.842 ± 0.329
1.123TrpLeu: 1.123 ± 0.349
0.07TrpMet: 0.07 ± 0.075
0.842TrpAsn: 0.842 ± 0.208
0.562TrpPro: 0.562 ± 0.223
0.632TrpGln: 0.632 ± 0.207
1.123TrpArg: 1.123 ± 0.277
0.913TrpSer: 0.913 ± 0.235
1.474TrpThr: 1.474 ± 0.432
1.544TrpVal: 1.544 ± 0.333
0.351TrpTrp: 0.351 ± 0.199
0.281TrpTyr: 0.281 ± 0.128
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.668TyrAla: 2.668 ± 0.329
0.562TyrCys: 0.562 ± 0.211
1.685TyrAsp: 1.685 ± 0.327
1.755TyrGlu: 1.755 ± 0.29
1.053TyrPhe: 1.053 ± 0.324
1.966TyrGly: 1.966 ± 0.36
0.702TyrHis: 0.702 ± 0.342
1.615TyrIle: 1.615 ± 0.279
1.615TyrLys: 1.615 ± 0.304
2.176TyrLeu: 2.176 ± 0.373
0.772TyrMet: 0.772 ± 0.245
1.404TyrAsn: 1.404 ± 0.364
1.685TyrPro: 1.685 ± 0.363
1.193TyrGln: 1.193 ± 0.236
2.878TyrArg: 2.878 ± 0.431
2.317TyrSer: 2.317 ± 0.414
2.036TyrThr: 2.036 ± 0.323
2.668TyrVal: 2.668 ± 0.476
0.562TyrTrp: 0.562 ± 0.235
0.491TyrTyr: 0.491 ± 0.17
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 71 proteins (14246 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski