Amino acid dipepetide frequency for Salmonella phage SEN22

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.435AlaAla: 11.435 ± 2.169
1.484AlaCys: 1.484 ± 0.355
5.15AlaAsp: 5.15 ± 0.666
7.332AlaGlu: 7.332 ± 0.838
2.706AlaPhe: 2.706 ± 0.512
6.023AlaGly: 6.023 ± 1.23
1.135AlaHis: 1.135 ± 0.249
6.023AlaIle: 6.023 ± 1.086
4.801AlaLys: 4.801 ± 0.592
6.983AlaLeu: 6.983 ± 0.898
3.579AlaMet: 3.579 ± 0.64
5.412AlaAsn: 5.412 ± 1.024
2.008AlaPro: 2.008 ± 0.422
3.492AlaGln: 3.492 ± 0.779
4.626AlaArg: 4.626 ± 0.593
4.452AlaSer: 4.452 ± 0.492
8.031AlaThr: 8.031 ± 1.667
5.412AlaVal: 5.412 ± 0.762
1.309AlaTrp: 1.309 ± 0.331
1.92AlaTyr: 1.92 ± 0.437
0.0AlaXaa: 0.0 ± 0.0
Cys
0.698CysAla: 0.698 ± 0.292
0.436CysCys: 0.436 ± 0.18
0.786CysAsp: 0.786 ± 0.287
0.786CysGlu: 0.786 ± 0.28
0.436CysPhe: 0.436 ± 0.187
1.047CysGly: 1.047 ± 0.269
0.436CysHis: 0.436 ± 0.216
0.436CysIle: 0.436 ± 0.204
0.611CysLys: 0.611 ± 0.207
0.873CysLeu: 0.873 ± 0.32
0.175CysMet: 0.175 ± 0.122
0.524CysAsn: 0.524 ± 0.236
0.873CysPro: 0.873 ± 0.262
0.524CysGln: 0.524 ± 0.177
1.746CysArg: 1.746 ± 0.414
0.96CysSer: 0.96 ± 0.693
0.611CysThr: 0.611 ± 0.216
0.698CysVal: 0.698 ± 0.223
0.087CysTrp: 0.087 ± 0.078
0.96CysTyr: 0.96 ± 0.26
0.0CysXaa: 0.0 ± 0.0
Asp
6.11AspAla: 6.11 ± 1.076
1.047AspCys: 1.047 ± 0.33
4.626AspAsp: 4.626 ± 0.736
4.103AspGlu: 4.103 ± 0.562
1.746AspPhe: 1.746 ± 0.314
5.587AspGly: 5.587 ± 0.548
0.786AspHis: 0.786 ± 0.281
4.452AspIle: 4.452 ± 0.605
3.317AspLys: 3.317 ± 0.495
5.325AspLeu: 5.325 ± 0.584
1.92AspMet: 1.92 ± 0.451
2.619AspAsn: 2.619 ± 0.488
1.484AspPro: 1.484 ± 0.36
1.571AspGln: 1.571 ± 0.323
3.142AspArg: 3.142 ± 0.561
2.968AspSer: 2.968 ± 0.555
1.397AspThr: 1.397 ± 0.289
5.063AspVal: 5.063 ± 0.633
1.571AspTrp: 1.571 ± 0.455
2.357AspTyr: 2.357 ± 0.487
0.0AspXaa: 0.0 ± 0.0
Glu
6.285GluAla: 6.285 ± 0.55
1.397GluCys: 1.397 ± 0.312
3.055GluAsp: 3.055 ± 0.49
4.277GluGlu: 4.277 ± 0.853
1.746GluPhe: 1.746 ± 0.446
4.103GluGly: 4.103 ± 0.604
1.222GluHis: 1.222 ± 0.31
5.15GluIle: 5.15 ± 0.769
4.277GluLys: 4.277 ± 0.57
5.936GluLeu: 5.936 ± 0.821
2.357GluMet: 2.357 ± 0.403
2.793GluAsn: 2.793 ± 0.485
2.27GluPro: 2.27 ± 0.511
3.317GluGln: 3.317 ± 0.56
4.365GluArg: 4.365 ± 0.685
4.277GluSer: 4.277 ± 0.456
3.23GluThr: 3.23 ± 0.52
4.452GluVal: 4.452 ± 0.704
1.92GluTrp: 1.92 ± 0.418
2.182GluTyr: 2.182 ± 0.615
0.0GluXaa: 0.0 ± 0.0
Phe
2.706PheAla: 2.706 ± 0.494
0.262PheCys: 0.262 ± 0.163
2.182PheAsp: 2.182 ± 0.361
2.27PheGlu: 2.27 ± 0.437
1.309PhePhe: 1.309 ± 0.445
2.357PheGly: 2.357 ± 0.428
0.698PheHis: 0.698 ± 0.26
2.095PheIle: 2.095 ± 0.456
2.182PheLys: 2.182 ± 0.343
1.484PheLeu: 1.484 ± 0.285
0.698PheMet: 0.698 ± 0.218
1.833PheAsn: 1.833 ± 0.434
1.309PhePro: 1.309 ± 0.29
1.135PheGln: 1.135 ± 0.42
1.571PheArg: 1.571 ± 0.369
2.531PheSer: 2.531 ± 0.444
1.746PheThr: 1.746 ± 0.336
1.92PheVal: 1.92 ± 0.604
1.135PheTrp: 1.135 ± 0.252
1.047PheTyr: 1.047 ± 0.337
0.0PheXaa: 0.0 ± 0.0
Gly
5.237GlyAla: 5.237 ± 1.097
0.698GlyCys: 0.698 ± 0.201
4.888GlyAsp: 4.888 ± 0.869
4.277GlyGlu: 4.277 ± 0.581
2.793GlyPhe: 2.793 ± 0.438
4.976GlyGly: 4.976 ± 1.201
1.222GlyHis: 1.222 ± 0.402
6.372GlyIle: 6.372 ± 0.796
6.198GlyLys: 6.198 ± 0.848
5.412GlyLeu: 5.412 ± 1.002
1.484GlyMet: 1.484 ± 0.343
3.841GlyAsn: 3.841 ± 0.675
1.222GlyPro: 1.222 ± 0.39
4.015GlyGln: 4.015 ± 0.838
4.801GlyArg: 4.801 ± 0.724
3.579GlySer: 3.579 ± 0.691
2.968GlyThr: 2.968 ± 0.618
5.063GlyVal: 5.063 ± 0.744
1.397GlyTrp: 1.397 ± 0.27
2.444GlyTyr: 2.444 ± 0.642
0.0GlyXaa: 0.0 ± 0.0
His
1.397HisAla: 1.397 ± 0.371
0.349HisCys: 0.349 ± 0.167
0.96HisAsp: 0.96 ± 0.302
1.397HisGlu: 1.397 ± 0.332
0.611HisPhe: 0.611 ± 0.22
1.659HisGly: 1.659 ± 0.336
0.349HisHis: 0.349 ± 0.162
0.96HisIle: 0.96 ± 0.255
0.611HisLys: 0.611 ± 0.247
1.222HisLeu: 1.222 ± 0.368
0.611HisMet: 0.611 ± 0.184
0.524HisAsn: 0.524 ± 0.171
0.786HisPro: 0.786 ± 0.273
0.96HisGln: 0.96 ± 0.279
1.047HisArg: 1.047 ± 0.304
1.047HisSer: 1.047 ± 0.257
0.96HisThr: 0.96 ± 0.314
0.524HisVal: 0.524 ± 0.16
0.087HisTrp: 0.087 ± 0.086
0.873HisTyr: 0.873 ± 0.257
0.0HisXaa: 0.0 ± 0.0
Ile
6.11IleAla: 6.11 ± 0.846
0.524IleCys: 0.524 ± 0.21
3.841IleAsp: 3.841 ± 0.635
4.365IleGlu: 4.365 ± 0.603
1.746IlePhe: 1.746 ± 0.316
5.15IleGly: 5.15 ± 0.778
1.484IleHis: 1.484 ± 0.372
3.404IleIle: 3.404 ± 0.549
3.055IleLys: 3.055 ± 0.47
3.841IleLeu: 3.841 ± 0.615
1.047IleMet: 1.047 ± 0.301
3.492IleAsn: 3.492 ± 0.618
2.881IlePro: 2.881 ± 0.513
2.444IleGln: 2.444 ± 0.34
3.753IleArg: 3.753 ± 0.488
4.015IleSer: 4.015 ± 0.512
4.452IleThr: 4.452 ± 0.613
3.404IleVal: 3.404 ± 0.524
0.873IleTrp: 0.873 ± 0.282
1.92IleTyr: 1.92 ± 0.367
0.0IleXaa: 0.0 ± 0.0
Lys
5.761LysAla: 5.761 ± 0.878
0.96LysCys: 0.96 ± 0.256
3.317LysAsp: 3.317 ± 0.549
4.365LysGlu: 4.365 ± 0.592
2.095LysPhe: 2.095 ± 0.35
4.714LysGly: 4.714 ± 0.425
1.135LysHis: 1.135 ± 0.391
2.706LysIle: 2.706 ± 0.404
3.753LysLys: 3.753 ± 0.671
4.801LysLeu: 4.801 ± 0.938
1.222LysMet: 1.222 ± 0.313
2.182LysAsn: 2.182 ± 0.553
2.793LysPro: 2.793 ± 0.506
3.579LysGln: 3.579 ± 0.654
3.317LysArg: 3.317 ± 0.693
4.365LysSer: 4.365 ± 0.532
3.666LysThr: 3.666 ± 0.502
3.492LysVal: 3.492 ± 0.578
0.611LysTrp: 0.611 ± 0.243
2.182LysTyr: 2.182 ± 0.486
0.0LysXaa: 0.0 ± 0.0
Leu
6.809LeuAla: 6.809 ± 0.702
1.222LeuCys: 1.222 ± 0.354
4.103LeuAsp: 4.103 ± 0.66
6.285LeuGlu: 6.285 ± 0.726
2.706LeuPhe: 2.706 ± 0.591
3.928LeuGly: 3.928 ± 0.844
0.786LeuHis: 0.786 ± 0.275
3.841LeuIle: 3.841 ± 0.714
4.452LeuLys: 4.452 ± 0.743
4.714LeuLeu: 4.714 ± 0.763
2.008LeuMet: 2.008 ± 0.405
4.976LeuAsn: 4.976 ± 0.682
3.666LeuPro: 3.666 ± 0.472
2.706LeuGln: 2.706 ± 0.555
4.801LeuArg: 4.801 ± 0.719
5.325LeuSer: 5.325 ± 0.605
5.325LeuThr: 5.325 ± 0.619
3.492LeuVal: 3.492 ± 0.577
1.047LeuTrp: 1.047 ± 0.35
1.746LeuTyr: 1.746 ± 0.332
0.0LeuXaa: 0.0 ± 0.0
Met
2.968MetAla: 2.968 ± 0.488
0.175MetCys: 0.175 ± 0.133
1.746MetAsp: 1.746 ± 0.436
0.873MetGlu: 0.873 ± 0.256
0.698MetPhe: 0.698 ± 0.311
1.659MetGly: 1.659 ± 0.438
0.262MetHis: 0.262 ± 0.155
1.309MetIle: 1.309 ± 0.294
2.793MetLys: 2.793 ± 0.469
2.182MetLeu: 2.182 ± 0.415
1.047MetMet: 1.047 ± 0.319
1.309MetAsn: 1.309 ± 0.366
1.397MetPro: 1.397 ± 0.322
1.222MetGln: 1.222 ± 0.33
2.095MetArg: 2.095 ± 0.377
1.746MetSer: 1.746 ± 0.35
1.571MetThr: 1.571 ± 0.394
1.135MetVal: 1.135 ± 0.238
0.087MetTrp: 0.087 ± 0.078
1.135MetTyr: 1.135 ± 0.3
0.0MetXaa: 0.0 ± 0.0
Asn
4.801AsnAla: 4.801 ± 0.962
0.611AsnCys: 0.611 ± 0.32
3.404AsnAsp: 3.404 ± 0.399
3.142AsnGlu: 3.142 ± 0.502
0.786AsnPhe: 0.786 ± 0.281
5.237AsnGly: 5.237 ± 0.806
1.047AsnHis: 1.047 ± 0.259
2.531AsnIle: 2.531 ± 0.528
2.881AsnLys: 2.881 ± 0.455
2.27AsnLeu: 2.27 ± 0.482
1.484AsnMet: 1.484 ± 0.296
1.833AsnAsn: 1.833 ± 0.425
2.619AsnPro: 2.619 ± 0.464
2.706AsnGln: 2.706 ± 0.528
2.531AsnArg: 2.531 ± 0.548
3.23AsnSer: 3.23 ± 0.513
3.753AsnThr: 3.753 ± 1.638
2.881AsnVal: 2.881 ± 0.625
0.349AsnTrp: 0.349 ± 0.156
0.786AsnTyr: 0.786 ± 0.279
0.0AsnXaa: 0.0 ± 0.0
Pro
2.531ProAla: 2.531 ± 0.444
0.087ProCys: 0.087 ± 0.083
3.841ProAsp: 3.841 ± 0.428
4.888ProGlu: 4.888 ± 0.7
1.659ProPhe: 1.659 ± 0.331
1.571ProGly: 1.571 ± 0.341
1.047ProHis: 1.047 ± 0.24
1.659ProIle: 1.659 ± 0.292
2.095ProLys: 2.095 ± 0.351
1.833ProLeu: 1.833 ± 0.402
0.786ProMet: 0.786 ± 0.264
1.309ProAsn: 1.309 ± 0.362
1.571ProPro: 1.571 ± 0.426
1.397ProGln: 1.397 ± 0.312
1.222ProArg: 1.222 ± 0.286
3.055ProSer: 3.055 ± 0.414
1.92ProThr: 1.92 ± 0.318
2.968ProVal: 2.968 ± 0.476
0.611ProTrp: 0.611 ± 0.19
1.571ProTyr: 1.571 ± 0.382
0.0ProXaa: 0.0 ± 0.0
Gln
3.928GlnAla: 3.928 ± 0.678
0.262GlnCys: 0.262 ± 0.139
2.182GlnAsp: 2.182 ± 0.418
2.706GlnGlu: 2.706 ± 0.47
1.746GlnPhe: 1.746 ± 0.306
3.055GlnGly: 3.055 ± 0.554
0.698GlnHis: 0.698 ± 0.223
3.753GlnIle: 3.753 ± 0.473
2.357GlnLys: 2.357 ± 0.41
3.753GlnLeu: 3.753 ± 0.581
1.659GlnMet: 1.659 ± 0.361
2.531GlnAsn: 2.531 ± 0.732
1.659GlnPro: 1.659 ± 0.358
4.015GlnGln: 4.015 ± 0.79
2.444GlnArg: 2.444 ± 0.436
3.23GlnSer: 3.23 ± 0.5
1.222GlnThr: 1.222 ± 0.28
1.92GlnVal: 1.92 ± 0.452
0.873GlnTrp: 0.873 ± 0.248
1.571GlnTyr: 1.571 ± 0.389
0.0GlnXaa: 0.0 ± 0.0
Arg
4.277ArgAla: 4.277 ± 0.592
1.135ArgCys: 1.135 ± 0.304
4.277ArgAsp: 4.277 ± 0.708
4.976ArgGlu: 4.976 ± 0.949
1.833ArgPhe: 1.833 ± 0.391
3.317ArgGly: 3.317 ± 0.333
1.135ArgHis: 1.135 ± 0.292
4.015ArgIle: 4.015 ± 0.736
4.015ArgLys: 4.015 ± 0.856
5.063ArgLeu: 5.063 ± 0.61
2.008ArgMet: 2.008 ± 0.428
2.444ArgAsn: 2.444 ± 0.438
1.397ArgPro: 1.397 ± 0.325
2.444ArgGln: 2.444 ± 0.481
4.277ArgArg: 4.277 ± 0.702
4.103ArgSer: 4.103 ± 0.676
2.531ArgThr: 2.531 ± 0.443
3.055ArgVal: 3.055 ± 0.617
0.786ArgTrp: 0.786 ± 0.242
2.444ArgTyr: 2.444 ± 0.458
0.0ArgXaa: 0.0 ± 0.0
Ser
7.856SerAla: 7.856 ± 1.777
0.349SerCys: 0.349 ± 0.155
3.055SerAsp: 3.055 ± 0.562
4.103SerGlu: 4.103 ± 0.556
2.444SerPhe: 2.444 ± 0.446
5.499SerGly: 5.499 ± 1.065
1.309SerHis: 1.309 ± 0.373
3.142SerIle: 3.142 ± 0.516
3.579SerLys: 3.579 ± 0.447
4.801SerLeu: 4.801 ± 0.736
1.571SerMet: 1.571 ± 0.328
2.095SerAsn: 2.095 ± 0.389
2.881SerPro: 2.881 ± 0.537
2.968SerGln: 2.968 ± 0.572
3.579SerArg: 3.579 ± 0.458
3.404SerSer: 3.404 ± 0.835
2.793SerThr: 2.793 ± 0.468
4.452SerVal: 4.452 ± 0.579
0.698SerTrp: 0.698 ± 0.278
2.27SerTyr: 2.27 ± 0.354
0.0SerXaa: 0.0 ± 0.0
Thr
4.976ThrAla: 4.976 ± 0.704
0.873ThrCys: 0.873 ± 0.265
3.317ThrAsp: 3.317 ± 0.548
2.531ThrGlu: 2.531 ± 0.55
1.746ThrPhe: 1.746 ± 0.429
6.547ThrGly: 6.547 ± 1.57
0.611ThrHis: 0.611 ± 0.148
3.23ThrIle: 3.23 ± 0.392
3.404ThrLys: 3.404 ± 0.556
3.928ThrLeu: 3.928 ± 0.614
1.047ThrMet: 1.047 ± 0.281
4.015ThrAsn: 4.015 ± 1.611
3.142ThrPro: 3.142 ± 0.553
2.008ThrGln: 2.008 ± 0.513
2.968ThrArg: 2.968 ± 0.571
2.793ThrSer: 2.793 ± 0.538
2.619ThrThr: 2.619 ± 0.396
3.055ThrVal: 3.055 ± 0.569
0.524ThrTrp: 0.524 ± 0.183
1.659ThrTyr: 1.659 ± 0.333
0.0ThrXaa: 0.0 ± 0.0
Val
5.237ValAla: 5.237 ± 0.606
0.436ValCys: 0.436 ± 0.241
3.142ValAsp: 3.142 ± 0.402
3.404ValGlu: 3.404 ± 0.515
2.095ValPhe: 2.095 ± 0.399
3.317ValGly: 3.317 ± 0.525
0.349ValHis: 0.349 ± 0.156
4.365ValIle: 4.365 ± 0.595
3.579ValLys: 3.579 ± 0.414
5.15ValLeu: 5.15 ± 0.736
1.397ValMet: 1.397 ± 0.415
3.23ValAsn: 3.23 ± 0.557
1.92ValPro: 1.92 ± 0.405
2.182ValGln: 2.182 ± 0.489
3.666ValArg: 3.666 ± 0.453
4.976ValSer: 4.976 ± 0.725
4.19ValThr: 4.19 ± 0.566
3.492ValVal: 3.492 ± 0.518
1.047ValTrp: 1.047 ± 0.34
1.92ValTyr: 1.92 ± 0.558
0.0ValXaa: 0.0 ± 0.0
Trp
1.135TrpAla: 1.135 ± 0.249
0.436TrpCys: 0.436 ± 0.166
0.786TrpAsp: 0.786 ± 0.196
0.786TrpGlu: 0.786 ± 0.316
0.436TrpPhe: 0.436 ± 0.197
0.873TrpGly: 0.873 ± 0.259
0.262TrpHis: 0.262 ± 0.197
0.524TrpIle: 0.524 ± 0.168
0.786TrpLys: 0.786 ± 0.306
2.095TrpLeu: 2.095 ± 0.458
0.524TrpMet: 0.524 ± 0.19
0.611TrpAsn: 0.611 ± 0.195
0.524TrpPro: 0.524 ± 0.166
1.135TrpGln: 1.135 ± 0.275
1.309TrpArg: 1.309 ± 0.283
0.96TrpSer: 0.96 ± 0.551
0.786TrpThr: 0.786 ± 0.312
0.96TrpVal: 0.96 ± 0.29
0.175TrpTrp: 0.175 ± 0.124
0.524TrpTyr: 0.524 ± 0.208
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.055TyrAla: 3.055 ± 0.501
0.96TyrCys: 0.96 ± 0.571
2.531TyrAsp: 2.531 ± 0.616
1.746TyrGlu: 1.746 ± 0.451
1.135TyrPhe: 1.135 ± 0.273
2.27TyrGly: 2.27 ± 0.459
0.96TyrHis: 0.96 ± 0.288
1.833TyrIle: 1.833 ± 0.33
2.27TyrLys: 2.27 ± 0.377
2.27TyrLeu: 2.27 ± 0.398
0.698TyrMet: 0.698 ± 0.187
1.309TyrAsn: 1.309 ± 0.342
1.222TyrPro: 1.222 ± 0.39
1.746TyrGln: 1.746 ± 0.386
2.357TyrArg: 2.357 ± 0.473
1.833TyrSer: 1.833 ± 0.34
1.222TyrThr: 1.222 ± 0.275
1.659TyrVal: 1.659 ± 0.308
0.436TyrTrp: 0.436 ± 0.191
1.047TyrTyr: 1.047 ± 0.308
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (11457 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski