Amino acid dipepetide frequency for Salmonella phage Seszw_1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.418AlaAla: 9.418 ± 2.027
0.452AlaCys: 0.452 ± 0.189
4.596AlaAsp: 4.596 ± 0.561
6.479AlaGlu: 6.479 ± 0.933
2.562AlaPhe: 2.562 ± 0.464
7.157AlaGly: 7.157 ± 0.798
0.678AlaHis: 0.678 ± 0.271
5.877AlaIle: 5.877 ± 0.711
6.253AlaLys: 6.253 ± 0.857
7.308AlaLeu: 7.308 ± 0.831
2.863AlaMet: 2.863 ± 0.621
4.144AlaAsn: 4.144 ± 0.655
1.658AlaPro: 1.658 ± 0.345
3.164AlaGln: 3.164 ± 0.929
5.274AlaArg: 5.274 ± 0.576
5.425AlaSer: 5.425 ± 0.822
5.048AlaThr: 5.048 ± 0.791
5.726AlaVal: 5.726 ± 0.802
1.205AlaTrp: 1.205 ± 0.274
2.562AlaTyr: 2.562 ± 0.427
0.0AlaXaa: 0.0 ± 0.0
Cys
1.13CysAla: 1.13 ± 0.224
0.301CysCys: 0.301 ± 0.134
0.904CysAsp: 0.904 ± 0.212
1.658CysGlu: 1.658 ± 0.38
0.527CysPhe: 0.527 ± 0.201
1.431CysGly: 1.431 ± 0.466
0.979CysHis: 0.979 ± 0.249
0.829CysIle: 0.829 ± 0.252
1.356CysLys: 1.356 ± 0.332
0.904CysLeu: 0.904 ± 0.281
0.301CysMet: 0.301 ± 0.154
0.603CysAsn: 0.603 ± 0.258
0.452CysPro: 0.452 ± 0.195
0.377CysGln: 0.377 ± 0.163
0.603CysArg: 0.603 ± 0.207
0.904CysSer: 0.904 ± 0.304
0.452CysThr: 0.452 ± 0.193
0.377CysVal: 0.377 ± 0.155
0.301CysTrp: 0.301 ± 0.153
0.603CysTyr: 0.603 ± 0.231
0.0CysXaa: 0.0 ± 0.0
Asp
5.801AspAla: 5.801 ± 0.844
1.055AspCys: 1.055 ± 0.274
4.973AspAsp: 4.973 ± 0.962
4.596AspGlu: 4.596 ± 0.623
3.466AspPhe: 3.466 ± 0.451
6.404AspGly: 6.404 ± 0.884
0.904AspHis: 0.904 ± 0.254
4.52AspIle: 4.52 ± 0.521
4.219AspLys: 4.219 ± 0.657
3.24AspLeu: 3.24 ± 0.461
1.959AspMet: 1.959 ± 0.447
2.185AspAsn: 2.185 ± 0.467
1.658AspPro: 1.658 ± 0.358
1.13AspGln: 1.13 ± 0.298
2.562AspArg: 2.562 ± 0.428
3.164AspSer: 3.164 ± 0.622
2.336AspThr: 2.336 ± 0.499
4.144AspVal: 4.144 ± 0.566
0.829AspTrp: 0.829 ± 0.277
2.411AspTyr: 2.411 ± 0.409
0.0AspXaa: 0.0 ± 0.0
Glu
5.199GluAla: 5.199 ± 0.6
1.431GluCys: 1.431 ± 0.355
3.692GluAsp: 3.692 ± 0.551
4.068GluGlu: 4.068 ± 0.608
1.884GluPhe: 1.884 ± 0.38
3.089GluGly: 3.089 ± 0.442
0.753GluHis: 0.753 ± 0.23
5.349GluIle: 5.349 ± 0.56
4.219GluLys: 4.219 ± 0.757
5.952GluLeu: 5.952 ± 0.547
2.863GluMet: 2.863 ± 0.51
2.26GluAsn: 2.26 ± 0.37
2.411GluPro: 2.411 ± 0.394
4.37GluGln: 4.37 ± 0.636
3.014GluArg: 3.014 ± 0.567
4.445GluSer: 4.445 ± 0.534
2.788GluThr: 2.788 ± 0.507
3.39GluVal: 3.39 ± 0.529
1.658GluTrp: 1.658 ± 0.34
2.863GluTyr: 2.863 ± 0.432
0.0GluXaa: 0.0 ± 0.0
Phe
1.733PheAla: 1.733 ± 0.427
0.979PheCys: 0.979 ± 0.288
3.089PheAsp: 3.089 ± 0.413
1.884PheGlu: 1.884 ± 0.363
0.753PhePhe: 0.753 ± 0.229
2.562PheGly: 2.562 ± 0.328
0.527PheHis: 0.527 ± 0.185
2.562PheIle: 2.562 ± 0.523
1.507PheLys: 1.507 ± 0.317
1.959PheLeu: 1.959 ± 0.443
1.431PheMet: 1.431 ± 0.325
2.562PheAsn: 2.562 ± 0.443
1.431PhePro: 1.431 ± 0.374
0.979PheGln: 0.979 ± 0.25
1.658PheArg: 1.658 ± 0.34
3.164PheSer: 3.164 ± 0.476
1.658PheThr: 1.658 ± 0.369
1.281PheVal: 1.281 ± 0.26
0.226PheTrp: 0.226 ± 0.128
1.13PheTyr: 1.13 ± 0.299
0.0PheXaa: 0.0 ± 0.0
Gly
5.575GlyAla: 5.575 ± 0.707
1.205GlyCys: 1.205 ± 0.335
4.671GlyAsp: 4.671 ± 0.715
4.671GlyGlu: 4.671 ± 0.508
2.336GlyPhe: 2.336 ± 0.412
4.746GlyGly: 4.746 ± 0.882
1.13GlyHis: 1.13 ± 0.244
4.52GlyIle: 4.52 ± 0.468
6.103GlyLys: 6.103 ± 0.719
4.822GlyLeu: 4.822 ± 0.604
2.411GlyMet: 2.411 ± 0.444
2.938GlyAsn: 2.938 ± 0.404
0.753GlyPro: 0.753 ± 0.192
2.411GlyGln: 2.411 ± 0.448
3.616GlyArg: 3.616 ± 0.57
4.973GlySer: 4.973 ± 0.829
3.993GlyThr: 3.993 ± 0.524
5.123GlyVal: 5.123 ± 0.696
1.055GlyTrp: 1.055 ± 0.235
3.918GlyTyr: 3.918 ± 0.617
0.0GlyXaa: 0.0 ± 0.0
His
0.904HisAla: 0.904 ± 0.332
0.527HisCys: 0.527 ± 0.205
0.979HisAsp: 0.979 ± 0.258
1.205HisGlu: 1.205 ± 0.305
0.603HisPhe: 0.603 ± 0.202
2.185HisGly: 2.185 ± 0.586
0.527HisHis: 0.527 ± 0.209
0.904HisIle: 0.904 ± 0.319
1.507HisLys: 1.507 ± 0.361
1.431HisLeu: 1.431 ± 0.339
0.301HisMet: 0.301 ± 0.152
0.829HisAsn: 0.829 ± 0.236
0.904HisPro: 0.904 ± 0.242
0.753HisGln: 0.753 ± 0.212
0.904HisArg: 0.904 ± 0.244
0.829HisSer: 0.829 ± 0.228
0.678HisThr: 0.678 ± 0.238
1.281HisVal: 1.281 ± 0.334
0.226HisTrp: 0.226 ± 0.134
0.904HisTyr: 0.904 ± 0.274
0.0HisXaa: 0.0 ± 0.0
Ile
5.952IleAla: 5.952 ± 0.682
1.055IleCys: 1.055 ± 0.265
5.048IleAsp: 5.048 ± 0.601
4.52IleGlu: 4.52 ± 0.482
1.733IlePhe: 1.733 ± 0.319
4.671IleGly: 4.671 ± 0.714
1.507IleHis: 1.507 ± 0.351
4.822IleIle: 4.822 ± 0.725
4.219IleLys: 4.219 ± 0.519
4.596IleLeu: 4.596 ± 0.516
2.185IleMet: 2.185 ± 0.552
3.164IleAsn: 3.164 ± 0.417
2.185IlePro: 2.185 ± 0.395
1.733IleGln: 1.733 ± 0.314
3.993IleArg: 3.993 ± 0.56
5.048IleSer: 5.048 ± 0.772
4.068IleThr: 4.068 ± 0.579
4.37IleVal: 4.37 ± 0.464
0.452IleTrp: 0.452 ± 0.175
1.582IleTyr: 1.582 ± 0.367
0.0IleXaa: 0.0 ± 0.0
Lys
6.555LysAla: 6.555 ± 0.765
1.205LysCys: 1.205 ± 0.334
3.842LysAsp: 3.842 ± 0.658
3.541LysGlu: 3.541 ± 0.502
2.562LysPhe: 2.562 ± 0.537
3.541LysGly: 3.541 ± 0.625
1.507LysHis: 1.507 ± 0.434
3.993LysIle: 3.993 ± 0.529
3.616LysLys: 3.616 ± 0.628
5.048LysLeu: 5.048 ± 0.561
2.863LysMet: 2.863 ± 0.535
3.692LysAsn: 3.692 ± 0.555
2.562LysPro: 2.562 ± 0.443
2.863LysGln: 2.863 ± 0.551
3.089LysArg: 3.089 ± 0.495
4.294LysSer: 4.294 ± 0.639
3.24LysThr: 3.24 ± 0.549
4.219LysVal: 4.219 ± 0.663
1.205LysTrp: 1.205 ± 0.321
2.562LysTyr: 2.562 ± 0.433
0.0LysXaa: 0.0 ± 0.0
Leu
6.027LeuAla: 6.027 ± 0.821
0.829LeuCys: 0.829 ± 0.291
3.164LeuAsp: 3.164 ± 0.429
4.746LeuGlu: 4.746 ± 0.552
2.11LeuPhe: 2.11 ± 0.417
4.596LeuGly: 4.596 ± 0.571
1.431LeuHis: 1.431 ± 0.353
4.596LeuIle: 4.596 ± 0.636
4.973LeuLys: 4.973 ± 0.724
4.973LeuLeu: 4.973 ± 0.658
1.507LeuMet: 1.507 ± 0.304
3.541LeuAsn: 3.541 ± 0.546
3.39LeuPro: 3.39 ± 0.691
2.712LeuGln: 2.712 ± 0.625
4.973LeuArg: 4.973 ± 0.573
4.746LeuSer: 4.746 ± 0.704
4.52LeuThr: 4.52 ± 0.548
5.274LeuVal: 5.274 ± 0.721
1.431LeuTrp: 1.431 ± 0.316
2.562LeuTyr: 2.562 ± 0.487
0.0LeuXaa: 0.0 ± 0.0
Met
3.39MetAla: 3.39 ± 0.572
0.527MetCys: 0.527 ± 0.201
1.658MetAsp: 1.658 ± 0.443
1.431MetGlu: 1.431 ± 0.296
1.055MetPhe: 1.055 ± 0.258
0.979MetGly: 0.979 ± 0.248
0.753MetHis: 0.753 ± 0.245
1.582MetIle: 1.582 ± 0.397
2.863MetLys: 2.863 ± 0.471
2.11MetLeu: 2.11 ± 0.39
1.13MetMet: 1.13 ± 0.31
2.034MetAsn: 2.034 ± 0.382
1.884MetPro: 1.884 ± 0.501
1.205MetGln: 1.205 ± 0.331
1.507MetArg: 1.507 ± 0.277
2.26MetSer: 2.26 ± 0.464
2.26MetThr: 2.26 ± 0.38
1.356MetVal: 1.356 ± 0.413
0.527MetTrp: 0.527 ± 0.181
0.979MetTyr: 0.979 ± 0.252
0.0MetXaa: 0.0 ± 0.0
Asn
4.596AsnAla: 4.596 ± 0.747
0.452AsnCys: 0.452 ± 0.157
3.39AsnAsp: 3.39 ± 0.505
2.637AsnGlu: 2.637 ± 0.4
0.829AsnPhe: 0.829 ± 0.349
5.048AsnGly: 5.048 ± 0.628
1.356AsnHis: 1.356 ± 0.339
3.089AsnIle: 3.089 ± 0.361
2.788AsnLys: 2.788 ± 0.544
3.767AsnLeu: 3.767 ± 0.45
0.753AsnMet: 0.753 ± 0.3
2.11AsnAsn: 2.11 ± 0.45
1.507AsnPro: 1.507 ± 0.364
1.808AsnGln: 1.808 ± 0.346
2.11AsnArg: 2.11 ± 0.345
2.637AsnSer: 2.637 ± 0.519
2.562AsnThr: 2.562 ± 0.498
3.089AsnVal: 3.089 ± 0.664
0.753AsnTrp: 0.753 ± 0.247
1.808AsnTyr: 1.808 ± 0.366
0.0AsnXaa: 0.0 ± 0.0
Pro
3.39ProAla: 3.39 ± 0.629
0.603ProCys: 0.603 ± 0.236
2.411ProAsp: 2.411 ± 0.417
3.164ProGlu: 3.164 ± 0.462
1.13ProPhe: 1.13 ± 0.307
2.11ProGly: 2.11 ± 0.5
0.452ProHis: 0.452 ± 0.172
1.884ProIle: 1.884 ± 0.279
1.884ProLys: 1.884 ± 0.442
2.336ProLeu: 2.336 ± 0.462
0.753ProMet: 0.753 ± 0.262
1.356ProAsn: 1.356 ± 0.322
1.13ProPro: 1.13 ± 0.319
1.507ProGln: 1.507 ± 0.394
1.356ProArg: 1.356 ± 0.43
1.733ProSer: 1.733 ± 0.35
1.884ProThr: 1.884 ± 0.354
3.164ProVal: 3.164 ± 0.498
0.527ProTrp: 0.527 ± 0.235
1.13ProTyr: 1.13 ± 0.282
0.0ProXaa: 0.0 ± 0.0
Gln
3.164GlnAla: 3.164 ± 0.915
0.452GlnCys: 0.452 ± 0.175
1.356GlnAsp: 1.356 ± 0.377
3.014GlnGlu: 3.014 ± 0.378
1.658GlnPhe: 1.658 ± 0.404
1.431GlnGly: 1.431 ± 0.34
0.979GlnHis: 0.979 ± 0.264
2.411GlnIle: 2.411 ± 0.469
2.712GlnLys: 2.712 ± 0.576
3.541GlnLeu: 3.541 ± 0.599
1.507GlnMet: 1.507 ± 0.359
1.205GlnAsn: 1.205 ± 0.272
1.884GlnPro: 1.884 ± 0.342
3.315GlnGln: 3.315 ± 1.355
1.884GlnArg: 1.884 ± 0.424
2.863GlnSer: 2.863 ± 0.491
2.26GlnThr: 2.26 ± 0.54
2.562GlnVal: 2.562 ± 0.455
0.603GlnTrp: 0.603 ± 0.21
1.356GlnTyr: 1.356 ± 0.267
0.0GlnXaa: 0.0 ± 0.0
Arg
4.445ArgAla: 4.445 ± 0.535
1.13ArgCys: 1.13 ± 0.39
2.863ArgAsp: 2.863 ± 0.376
3.466ArgGlu: 3.466 ± 0.58
1.507ArgPhe: 1.507 ± 0.315
2.712ArgGly: 2.712 ± 0.399
0.904ArgHis: 0.904 ± 0.259
3.089ArgIle: 3.089 ± 0.502
3.993ArgLys: 3.993 ± 0.755
3.993ArgLeu: 3.993 ± 0.558
1.733ArgMet: 1.733 ± 0.377
3.466ArgAsn: 3.466 ± 0.451
1.205ArgPro: 1.205 ± 0.317
2.034ArgGln: 2.034 ± 0.471
3.842ArgArg: 3.842 ± 0.697
3.466ArgSer: 3.466 ± 0.444
1.808ArgThr: 1.808 ± 0.447
3.466ArgVal: 3.466 ± 0.504
0.603ArgTrp: 0.603 ± 0.228
2.712ArgTyr: 2.712 ± 0.47
0.0ArgXaa: 0.0 ± 0.0
Ser
6.781SerAla: 6.781 ± 1.298
0.678SerCys: 0.678 ± 0.256
4.52SerAsp: 4.52 ± 0.539
4.068SerGlu: 4.068 ± 0.495
2.712SerPhe: 2.712 ± 0.489
6.253SerGly: 6.253 ± 0.765
0.753SerHis: 0.753 ± 0.21
3.918SerIle: 3.918 ± 0.532
3.767SerLys: 3.767 ± 0.475
4.897SerLeu: 4.897 ± 0.696
2.26SerMet: 2.26 ± 0.411
2.863SerAsn: 2.863 ± 0.415
2.562SerPro: 2.562 ± 0.4
2.788SerGln: 2.788 ± 0.526
2.712SerArg: 2.712 ± 0.462
4.294SerSer: 4.294 ± 0.837
3.616SerThr: 3.616 ± 0.609
3.767SerVal: 3.767 ± 0.529
0.904SerTrp: 0.904 ± 0.258
2.562SerTyr: 2.562 ± 0.382
0.0SerXaa: 0.0 ± 0.0
Thr
4.294ThrAla: 4.294 ± 0.62
0.753ThrCys: 0.753 ± 0.253
3.014ThrAsp: 3.014 ± 0.44
3.993ThrGlu: 3.993 ± 0.566
1.658ThrPhe: 1.658 ± 0.432
5.575ThrGly: 5.575 ± 0.612
0.753ThrHis: 0.753 ± 0.235
3.39ThrIle: 3.39 ± 0.51
3.315ThrLys: 3.315 ± 0.525
3.616ThrLeu: 3.616 ± 0.534
1.281ThrMet: 1.281 ± 0.267
2.411ThrAsn: 2.411 ± 0.597
3.164ThrPro: 3.164 ± 0.533
2.11ThrGln: 2.11 ± 0.344
3.164ThrArg: 3.164 ± 0.569
3.315ThrSer: 3.315 ± 0.509
3.842ThrThr: 3.842 ± 0.572
2.938ThrVal: 2.938 ± 0.533
0.603ThrTrp: 0.603 ± 0.26
1.356ThrTyr: 1.356 ± 0.332
0.0ThrXaa: 0.0 ± 0.0
Val
5.425ValAla: 5.425 ± 0.743
0.452ValCys: 0.452 ± 0.194
3.767ValAsp: 3.767 ± 0.599
3.692ValGlu: 3.692 ± 0.551
2.336ValPhe: 2.336 ± 0.511
3.466ValGly: 3.466 ± 0.572
1.281ValHis: 1.281 ± 0.274
5.952ValIle: 5.952 ± 0.6
3.918ValLys: 3.918 ± 0.525
3.767ValLeu: 3.767 ± 0.604
2.336ValMet: 2.336 ± 0.408
3.39ValAsn: 3.39 ± 0.534
1.431ValPro: 1.431 ± 0.335
2.486ValGln: 2.486 ± 0.358
2.938ValArg: 2.938 ± 0.459
5.349ValSer: 5.349 ± 0.678
4.144ValThr: 4.144 ± 0.799
4.219ValVal: 4.219 ± 0.692
0.979ValTrp: 0.979 ± 0.27
2.486ValTyr: 2.486 ± 0.399
0.0ValXaa: 0.0 ± 0.0
Trp
1.13TrpAla: 1.13 ± 0.314
0.377TrpCys: 0.377 ± 0.209
0.904TrpAsp: 0.904 ± 0.231
0.678TrpGlu: 0.678 ± 0.24
0.527TrpPhe: 0.527 ± 0.224
0.301TrpGly: 0.301 ± 0.139
0.301TrpHis: 0.301 ± 0.163
0.979TrpIle: 0.979 ± 0.299
1.055TrpLys: 1.055 ± 0.275
1.507TrpLeu: 1.507 ± 0.342
0.301TrpMet: 0.301 ± 0.199
0.753TrpAsn: 0.753 ± 0.244
0.527TrpPro: 0.527 ± 0.195
0.527TrpGln: 0.527 ± 0.196
1.431TrpArg: 1.431 ± 0.34
0.829TrpSer: 0.829 ± 0.217
0.979TrpThr: 0.979 ± 0.232
1.13TrpVal: 1.13 ± 0.316
0.151TrpTrp: 0.151 ± 0.093
0.603TrpTyr: 0.603 ± 0.197
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.938TyrAla: 2.938 ± 0.436
0.527TyrCys: 0.527 ± 0.166
2.712TyrAsp: 2.712 ± 0.482
2.185TyrGlu: 2.185 ± 0.39
1.13TyrPhe: 1.13 ± 0.334
2.788TyrGly: 2.788 ± 0.497
0.904TyrHis: 0.904 ± 0.313
2.637TyrIle: 2.637 ± 0.538
1.733TyrLys: 1.733 ± 0.424
2.336TyrLeu: 2.336 ± 0.449
0.678TyrMet: 0.678 ± 0.238
1.582TyrAsn: 1.582 ± 0.328
1.205TyrPro: 1.205 ± 0.303
1.884TyrGln: 1.884 ± 0.281
1.808TyrArg: 1.808 ± 0.365
2.938TyrSer: 2.938 ± 0.467
2.411TyrThr: 2.411 ± 0.442
2.938TyrVal: 2.938 ± 0.592
0.753TyrTrp: 0.753 ± 0.217
1.281TyrTyr: 1.281 ± 0.243
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 72 proteins (13274 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski