Amino acid dipepetide frequency for Staphylococcus phage StauST398-2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.522AlaAla: 2.522 ± 0.866
0.216AlaCys: 0.216 ± 0.136
2.594AlaAsp: 2.594 ± 0.43
3.963AlaGlu: 3.963 ± 0.493
1.945AlaPhe: 1.945 ± 0.292
3.242AlaGly: 3.242 ± 0.6
1.081AlaHis: 1.081 ± 0.271
4.323AlaIle: 4.323 ± 0.74
6.124AlaLys: 6.124 ± 1.166
5.044AlaLeu: 5.044 ± 0.667
1.369AlaMet: 1.369 ± 0.251
4.035AlaAsn: 4.035 ± 0.868
1.729AlaPro: 1.729 ± 0.41
1.801AlaGln: 1.801 ± 0.463
2.594AlaArg: 2.594 ± 0.346
5.116AlaSer: 5.116 ± 0.638
3.242AlaThr: 3.242 ± 0.488
2.522AlaVal: 2.522 ± 0.471
1.369AlaTrp: 1.369 ± 0.392
2.666AlaTyr: 2.666 ± 0.507
0.0AlaXaa: 0.0 ± 0.0
Cys
0.216CysAla: 0.216 ± 0.106
0.0CysCys: 0.0 ± 0.0
0.216CysAsp: 0.216 ± 0.132
0.432CysGlu: 0.432 ± 0.213
0.36CysPhe: 0.36 ± 0.158
0.144CysGly: 0.144 ± 0.108
0.216CysHis: 0.216 ± 0.133
0.216CysIle: 0.216 ± 0.12
0.504CysLys: 0.504 ± 0.201
0.648CysLeu: 0.648 ± 0.26
0.072CysMet: 0.072 ± 0.078
0.288CysAsn: 0.288 ± 0.143
0.216CysPro: 0.216 ± 0.152
0.216CysGln: 0.216 ± 0.124
0.288CysArg: 0.288 ± 0.158
0.216CysSer: 0.216 ± 0.128
0.216CysThr: 0.216 ± 0.116
0.288CysVal: 0.288 ± 0.139
0.0CysTrp: 0.0 ± 0.0
0.216CysTyr: 0.216 ± 0.161
0.0CysXaa: 0.0 ± 0.0
Asp
2.522AspAla: 2.522 ± 0.582
0.216AspCys: 0.216 ± 0.134
3.458AspAsp: 3.458 ± 0.607
5.116AspGlu: 5.116 ± 0.841
3.603AspPhe: 3.603 ± 0.49
3.531AspGly: 3.531 ± 0.546
0.648AspHis: 0.648 ± 0.198
5.908AspIle: 5.908 ± 0.712
6.629AspLys: 6.629 ± 0.793
6.268AspLeu: 6.268 ± 0.539
1.873AspMet: 1.873 ± 0.369
2.882AspAsn: 2.882 ± 0.388
1.225AspPro: 1.225 ± 0.383
1.009AspGln: 1.009 ± 0.207
2.306AspArg: 2.306 ± 0.378
3.531AspSer: 3.531 ± 0.557
3.314AspThr: 3.314 ± 0.523
4.179AspVal: 4.179 ± 0.509
0.793AspTrp: 0.793 ± 0.199
3.819AspTyr: 3.819 ± 0.651
0.0AspXaa: 0.0 ± 0.0
Glu
4.323GluAla: 4.323 ± 0.458
0.576GluCys: 0.576 ± 0.201
5.116GluAsp: 5.116 ± 0.849
7.133GluGlu: 7.133 ± 1.199
3.026GluPhe: 3.026 ± 0.55
3.386GluGly: 3.386 ± 0.552
0.937GluHis: 0.937 ± 0.28
5.692GluIle: 5.692 ± 0.989
8.934GluLys: 8.934 ± 0.925
7.061GluLeu: 7.061 ± 0.83
2.738GluMet: 2.738 ± 0.44
5.116GluAsn: 5.116 ± 0.576
1.225GluPro: 1.225 ± 0.387
3.17GluGln: 3.17 ± 0.587
2.954GluArg: 2.954 ± 0.617
3.386GluSer: 3.386 ± 0.487
4.395GluThr: 4.395 ± 0.544
3.963GluVal: 3.963 ± 0.471
0.937GluTrp: 0.937 ± 0.195
2.738GluTyr: 2.738 ± 0.553
0.0GluXaa: 0.0 ± 0.0
Phe
1.873PheAla: 1.873 ± 0.436
0.36PheCys: 0.36 ± 0.15
3.531PheAsp: 3.531 ± 0.547
3.314PheGlu: 3.314 ± 0.51
0.865PhePhe: 0.865 ± 0.238
3.17PheGly: 3.17 ± 0.557
0.576PheHis: 0.576 ± 0.185
3.747PheIle: 3.747 ± 0.699
4.827PheLys: 4.827 ± 0.714
2.306PheLeu: 2.306 ± 0.405
1.009PheMet: 1.009 ± 0.233
3.386PheAsn: 3.386 ± 0.561
0.721PhePro: 0.721 ± 0.282
1.153PheGln: 1.153 ± 0.284
1.153PheArg: 1.153 ± 0.279
1.729PheSer: 1.729 ± 0.361
2.162PheThr: 2.162 ± 0.365
2.378PheVal: 2.378 ± 0.496
0.288PheTrp: 0.288 ± 0.142
2.162PheTyr: 2.162 ± 0.445
0.0PheXaa: 0.0 ± 0.0
Gly
4.323GlyAla: 4.323 ± 0.901
0.288GlyCys: 0.288 ± 0.144
4.395GlyAsp: 4.395 ± 0.487
3.314GlyGlu: 3.314 ± 0.412
2.594GlyPhe: 2.594 ± 0.355
5.044GlyGly: 5.044 ± 1.242
1.225GlyHis: 1.225 ± 0.279
3.242GlyIle: 3.242 ± 0.486
5.836GlyLys: 5.836 ± 0.593
5.98GlyLeu: 5.98 ± 0.962
1.153GlyMet: 1.153 ± 0.418
3.026GlyAsn: 3.026 ± 0.413
1.009GlyPro: 1.009 ± 0.245
1.225GlyGln: 1.225 ± 0.359
2.017GlyArg: 2.017 ± 0.503
3.891GlySer: 3.891 ± 0.609
3.531GlyThr: 3.531 ± 0.686
4.683GlyVal: 4.683 ± 0.685
1.153GlyTrp: 1.153 ± 0.407
2.666GlyTyr: 2.666 ± 0.562
0.0GlyXaa: 0.0 ± 0.0
His
1.009HisAla: 1.009 ± 0.243
0.072HisCys: 0.072 ± 0.078
0.793HisAsp: 0.793 ± 0.289
1.009HisGlu: 1.009 ± 0.256
0.937HisPhe: 0.937 ± 0.195
1.081HisGly: 1.081 ± 0.282
0.504HisHis: 0.504 ± 0.258
1.225HisIle: 1.225 ± 0.41
1.153HisLys: 1.153 ± 0.269
1.369HisLeu: 1.369 ± 0.301
0.504HisMet: 0.504 ± 0.216
1.081HisAsn: 1.081 ± 0.262
0.865HisPro: 0.865 ± 0.195
0.648HisGln: 0.648 ± 0.204
0.937HisArg: 0.937 ± 0.232
1.297HisSer: 1.297 ± 0.229
1.225HisThr: 1.225 ± 0.305
0.793HisVal: 0.793 ± 0.255
0.288HisTrp: 0.288 ± 0.145
1.081HisTyr: 1.081 ± 0.278
0.0HisXaa: 0.0 ± 0.0
Ile
3.963IleAla: 3.963 ± 0.539
0.288IleCys: 0.288 ± 0.177
5.044IleAsp: 5.044 ± 0.801
6.196IleGlu: 6.196 ± 0.661
2.594IlePhe: 2.594 ± 0.618
3.386IleGly: 3.386 ± 0.634
1.801IleHis: 1.801 ± 0.382
4.827IleIle: 4.827 ± 0.768
7.421IleLys: 7.421 ± 0.839
5.26IleLeu: 5.26 ± 0.758
1.369IleMet: 1.369 ± 0.309
4.323IleAsn: 4.323 ± 0.488
2.522IlePro: 2.522 ± 0.351
1.873IleGln: 1.873 ± 0.352
3.458IleArg: 3.458 ± 0.385
4.683IleSer: 4.683 ± 0.606
3.747IleThr: 3.747 ± 0.445
3.675IleVal: 3.675 ± 0.691
0.504IleTrp: 0.504 ± 0.205
2.954IleTyr: 2.954 ± 0.534
0.0IleXaa: 0.0 ± 0.0
Lys
7.421LysAla: 7.421 ± 1.241
0.216LysCys: 0.216 ± 0.152
5.404LysAsp: 5.404 ± 0.672
8.862LysGlu: 8.862 ± 1.094
2.306LysPhe: 2.306 ± 0.363
5.332LysGly: 5.332 ± 0.866
1.945LysHis: 1.945 ± 0.478
6.485LysIle: 6.485 ± 0.812
7.493LysLys: 7.493 ± 1.027
9.006LysLeu: 9.006 ± 0.938
2.45LysMet: 2.45 ± 0.452
5.692LysAsn: 5.692 ± 0.661
2.162LysPro: 2.162 ± 0.434
5.116LysGln: 5.116 ± 0.616
3.891LysArg: 3.891 ± 0.688
6.485LysSer: 6.485 ± 1.344
5.26LysThr: 5.26 ± 0.55
5.62LysVal: 5.62 ± 0.766
2.017LysTrp: 2.017 ± 0.426
4.899LysTyr: 4.899 ± 0.717
0.0LysXaa: 0.0 ± 0.0
Leu
4.467LeuAla: 4.467 ± 0.832
0.504LeuCys: 0.504 ± 0.209
5.26LeuAsp: 5.26 ± 1.003
6.341LeuGlu: 6.341 ± 0.693
3.603LeuPhe: 3.603 ± 0.518
4.251LeuGly: 4.251 ± 1.006
1.441LeuHis: 1.441 ± 0.419
5.404LeuIle: 5.404 ± 0.709
9.511LeuLys: 9.511 ± 1.186
6.845LeuLeu: 6.845 ± 0.923
2.017LeuMet: 2.017 ± 0.326
6.196LeuAsn: 6.196 ± 0.686
2.954LeuPro: 2.954 ± 0.529
2.882LeuGln: 2.882 ± 0.579
3.603LeuArg: 3.603 ± 0.497
5.476LeuSer: 5.476 ± 0.606
5.836LeuThr: 5.836 ± 0.621
3.603LeuVal: 3.603 ± 0.366
0.504LeuTrp: 0.504 ± 0.207
3.098LeuTyr: 3.098 ± 0.725
0.0LeuXaa: 0.0 ± 0.0
Met
0.937MetAla: 0.937 ± 0.23
0.144MetCys: 0.144 ± 0.108
1.225MetAsp: 1.225 ± 0.378
1.657MetGlu: 1.657 ± 0.353
0.793MetPhe: 0.793 ± 0.195
1.513MetGly: 1.513 ± 0.534
0.36MetHis: 0.36 ± 0.174
1.441MetIle: 1.441 ± 0.257
2.882MetLys: 2.882 ± 0.52
2.017MetLeu: 2.017 ± 0.542
0.432MetMet: 0.432 ± 0.161
1.585MetAsn: 1.585 ± 0.353
1.153MetPro: 1.153 ± 0.262
1.657MetGln: 1.657 ± 0.403
1.009MetArg: 1.009 ± 0.285
1.729MetSer: 1.729 ± 0.314
2.089MetThr: 2.089 ± 0.289
1.297MetVal: 1.297 ± 0.298
0.288MetTrp: 0.288 ± 0.116
0.937MetTyr: 0.937 ± 0.244
0.0MetXaa: 0.0 ± 0.0
Asn
4.107AsnAla: 4.107 ± 0.624
0.144AsnCys: 0.144 ± 0.106
4.035AsnAsp: 4.035 ± 0.465
4.827AsnGlu: 4.827 ± 0.627
2.017AsnPhe: 2.017 ± 0.269
4.467AsnGly: 4.467 ± 0.582
0.937AsnHis: 0.937 ± 0.312
3.891AsnIle: 3.891 ± 0.472
6.917AsnLys: 6.917 ± 0.803
5.332AsnLeu: 5.332 ± 0.609
0.865AsnMet: 0.865 ± 0.235
3.675AsnAsn: 3.675 ± 0.648
2.378AsnPro: 2.378 ± 0.315
2.45AsnGln: 2.45 ± 0.453
2.522AsnArg: 2.522 ± 0.433
4.179AsnSer: 4.179 ± 0.557
4.035AsnThr: 4.035 ± 0.422
3.17AsnVal: 3.17 ± 0.524
1.009AsnTrp: 1.009 ± 0.324
2.234AsnTyr: 2.234 ± 0.423
0.0AsnXaa: 0.0 ± 0.0
Pro
1.081ProAla: 1.081 ± 0.233
0.36ProCys: 0.36 ± 0.167
1.369ProAsp: 1.369 ± 0.325
2.234ProGlu: 2.234 ± 0.434
1.297ProPhe: 1.297 ± 0.326
1.873ProGly: 1.873 ± 0.436
0.432ProHis: 0.432 ± 0.165
1.873ProIle: 1.873 ± 0.347
2.378ProLys: 2.378 ± 0.501
2.234ProLeu: 2.234 ± 0.424
0.648ProMet: 0.648 ± 0.234
2.162ProAsn: 2.162 ± 0.414
0.721ProPro: 0.721 ± 0.272
1.153ProGln: 1.153 ± 0.284
1.225ProArg: 1.225 ± 0.266
2.234ProSer: 2.234 ± 0.359
1.657ProThr: 1.657 ± 0.398
1.513ProVal: 1.513 ± 0.412
0.288ProTrp: 0.288 ± 0.133
1.081ProTyr: 1.081 ± 0.297
0.0ProXaa: 0.0 ± 0.0
Gln
2.594GlnAla: 2.594 ± 0.422
0.144GlnCys: 0.144 ± 0.094
2.162GlnAsp: 2.162 ± 0.417
2.306GlnGlu: 2.306 ± 0.429
1.585GlnPhe: 1.585 ± 0.304
2.089GlnGly: 2.089 ± 0.432
0.721GlnHis: 0.721 ± 0.189
3.026GlnIle: 3.026 ± 0.529
3.098GlnLys: 3.098 ± 0.416
3.242GlnLeu: 3.242 ± 0.488
1.009GlnMet: 1.009 ± 0.306
2.017GlnAsn: 2.017 ± 0.366
1.369GlnPro: 1.369 ± 0.386
1.225GlnGln: 1.225 ± 0.357
1.873GlnArg: 1.873 ± 0.378
2.089GlnSer: 2.089 ± 0.332
1.297GlnThr: 1.297 ± 0.432
2.162GlnVal: 2.162 ± 0.344
0.432GlnTrp: 0.432 ± 0.196
1.657GlnTyr: 1.657 ± 0.31
0.0GlnXaa: 0.0 ± 0.0
Arg
2.522ArgAla: 2.522 ± 0.458
0.144ArgCys: 0.144 ± 0.112
2.522ArgAsp: 2.522 ± 0.319
2.45ArgGlu: 2.45 ± 0.428
2.306ArgPhe: 2.306 ± 0.42
2.378ArgGly: 2.378 ± 0.399
0.648ArgHis: 0.648 ± 0.211
3.314ArgIle: 3.314 ± 0.504
3.747ArgLys: 3.747 ± 0.481
4.035ArgLeu: 4.035 ± 0.737
0.937ArgMet: 0.937 ± 0.237
3.098ArgAsn: 3.098 ± 0.424
0.793ArgPro: 0.793 ± 0.326
1.513ArgGln: 1.513 ± 0.287
1.657ArgArg: 1.657 ± 0.382
2.162ArgSer: 2.162 ± 0.389
2.306ArgThr: 2.306 ± 0.468
1.945ArgVal: 1.945 ± 0.345
0.216ArgTrp: 0.216 ± 0.122
2.089ArgTyr: 2.089 ± 0.424
0.0ArgXaa: 0.0 ± 0.0
Ser
3.819SerAla: 3.819 ± 0.75
0.432SerCys: 0.432 ± 0.175
4.972SerAsp: 4.972 ± 0.562
4.251SerGlu: 4.251 ± 0.572
2.45SerPhe: 2.45 ± 0.511
4.611SerGly: 4.611 ± 0.929
0.648SerHis: 0.648 ± 0.19
3.963SerIle: 3.963 ± 0.527
6.485SerLys: 6.485 ± 1.336
4.179SerLeu: 4.179 ± 0.485
2.234SerMet: 2.234 ± 0.388
4.899SerAsn: 4.899 ± 0.573
2.017SerPro: 2.017 ± 0.444
2.45SerGln: 2.45 ± 0.374
2.594SerArg: 2.594 ± 0.502
4.179SerSer: 4.179 ± 0.739
3.458SerThr: 3.458 ± 0.59
3.963SerVal: 3.963 ± 0.59
0.937SerTrp: 0.937 ± 0.239
2.089SerTyr: 2.089 ± 0.347
0.0SerXaa: 0.0 ± 0.0
Thr
3.531ThrAla: 3.531 ± 0.511
0.288ThrCys: 0.288 ± 0.15
3.675ThrAsp: 3.675 ± 0.525
4.179ThrGlu: 4.179 ± 0.538
2.81ThrPhe: 2.81 ± 0.383
4.395ThrGly: 4.395 ± 0.54
1.513ThrHis: 1.513 ± 0.349
3.963ThrIle: 3.963 ± 0.62
5.548ThrLys: 5.548 ± 0.72
3.963ThrLeu: 3.963 ± 0.448
1.009ThrMet: 1.009 ± 0.289
2.738ThrAsn: 2.738 ± 0.396
2.162ThrPro: 2.162 ± 0.313
1.513ThrGln: 1.513 ± 0.25
1.801ThrArg: 1.801 ± 0.266
3.458ThrSer: 3.458 ± 0.552
2.81ThrThr: 2.81 ± 0.564
4.395ThrVal: 4.395 ± 0.53
0.721ThrTrp: 0.721 ± 0.27
2.017ThrTyr: 2.017 ± 0.465
0.0ThrXaa: 0.0 ± 0.0
Val
3.242ValAla: 3.242 ± 0.487
0.216ValCys: 0.216 ± 0.12
3.891ValAsp: 3.891 ± 0.67
5.404ValGlu: 5.404 ± 0.695
2.882ValPhe: 2.882 ± 0.539
3.747ValGly: 3.747 ± 0.544
1.153ValHis: 1.153 ± 0.324
3.891ValIle: 3.891 ± 0.542
4.611ValLys: 4.611 ± 0.638
4.107ValLeu: 4.107 ± 0.536
1.225ValMet: 1.225 ± 0.289
3.531ValAsn: 3.531 ± 0.472
1.225ValPro: 1.225 ± 0.357
2.162ValGln: 2.162 ± 0.34
2.594ValArg: 2.594 ± 0.4
4.395ValSer: 4.395 ± 0.564
2.882ValThr: 2.882 ± 0.576
2.954ValVal: 2.954 ± 0.495
0.648ValTrp: 0.648 ± 0.232
2.162ValTyr: 2.162 ± 0.488
0.0ValXaa: 0.0 ± 0.0
Trp
0.721TrpAla: 0.721 ± 0.209
0.0TrpCys: 0.0 ± 0.0
0.432TrpAsp: 0.432 ± 0.207
0.937TrpGlu: 0.937 ± 0.321
1.225TrpPhe: 1.225 ± 0.331
0.432TrpGly: 0.432 ± 0.19
0.072TrpHis: 0.072 ± 0.068
0.576TrpIle: 0.576 ± 0.236
0.865TrpLys: 0.865 ± 0.283
1.297TrpLeu: 1.297 ± 0.26
0.432TrpMet: 0.432 ± 0.204
0.937TrpAsn: 0.937 ± 0.263
0.288TrpPro: 0.288 ± 0.228
0.721TrpGln: 0.721 ± 0.207
0.504TrpArg: 0.504 ± 0.181
1.153TrpSer: 1.153 ± 0.435
0.576TrpThr: 0.576 ± 0.191
0.937TrpVal: 0.937 ± 0.244
0.144TrpTrp: 0.144 ± 0.123
0.721TrpTyr: 0.721 ± 0.262
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.306TyrAla: 2.306 ± 0.304
0.36TyrCys: 0.36 ± 0.132
2.738TyrAsp: 2.738 ± 0.481
2.882TyrGlu: 2.882 ± 0.568
1.513TyrPhe: 1.513 ± 0.378
2.522TyrGly: 2.522 ± 0.451
1.009TyrHis: 1.009 ± 0.307
2.882TyrIle: 2.882 ± 0.593
3.098TyrLys: 3.098 ± 0.511
3.675TyrLeu: 3.675 ± 0.564
1.729TyrMet: 1.729 ± 0.3
2.522TyrAsn: 2.522 ± 0.488
1.081TyrPro: 1.081 ± 0.256
2.162TyrGln: 2.162 ± 0.363
1.801TyrArg: 1.801 ± 0.475
3.242TyrSer: 3.242 ± 0.445
2.45TyrThr: 2.45 ± 0.464
2.882TyrVal: 2.882 ± 0.5
0.432TyrTrp: 0.432 ± 0.147
1.657TyrTyr: 1.657 ± 0.52
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 62 proteins (13880 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski