Amino acid dipepetide frequency for Staphylococcus phage StauST398-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.455AlaAla: 0.455 ± 0.197
0.379AlaCys: 0.379 ± 0.149
2.576AlaAsp: 2.576 ± 0.424
3.486AlaGlu: 3.486 ± 0.478
3.031AlaPhe: 3.031 ± 0.535
4.243AlaGly: 4.243 ± 0.814
1.061AlaHis: 1.061 ± 0.274
5.001AlaIle: 5.001 ± 0.642
6.517AlaLys: 6.517 ± 0.868
5.304AlaLeu: 5.304 ± 0.782
1.667AlaMet: 1.667 ± 0.493
3.94AlaAsn: 3.94 ± 0.394
1.894AlaPro: 1.894 ± 0.403
2.501AlaGln: 2.501 ± 0.491
2.576AlaArg: 2.576 ± 0.386
4.319AlaSer: 4.319 ± 0.788
3.41AlaThr: 3.41 ± 0.453
3.713AlaVal: 3.713 ± 0.646
0.834AlaTrp: 0.834 ± 0.289
2.349AlaTyr: 2.349 ± 0.437
0.0AlaXaa: 0.0 ± 0.0
Cys
0.152CysAla: 0.152 ± 0.107
0.076CysCys: 0.076 ± 0.069
0.076CysAsp: 0.076 ± 0.079
0.303CysGlu: 0.303 ± 0.187
0.303CysPhe: 0.303 ± 0.157
0.303CysGly: 0.303 ± 0.159
0.076CysHis: 0.076 ± 0.067
0.303CysIle: 0.303 ± 0.196
0.455CysLys: 0.455 ± 0.18
0.227CysLeu: 0.227 ± 0.124
0.076CysMet: 0.076 ± 0.078
0.227CysAsn: 0.227 ± 0.13
0.152CysPro: 0.152 ± 0.097
0.076CysGln: 0.076 ± 0.065
0.606CysArg: 0.606 ± 0.247
0.455CysSer: 0.455 ± 0.203
0.076CysThr: 0.076 ± 0.077
0.53CysVal: 0.53 ± 0.231
0.152CysTrp: 0.152 ± 0.1
0.227CysTyr: 0.227 ± 0.116
0.0CysXaa: 0.0 ± 0.0
Asp
4.925AspAla: 4.925 ± 0.6
0.152AspCys: 0.152 ± 0.102
4.546AspAsp: 4.546 ± 0.622
5.001AspGlu: 5.001 ± 0.64
3.637AspPhe: 3.637 ± 0.654
4.698AspGly: 4.698 ± 0.595
0.379AspHis: 0.379 ± 0.163
5.001AspIle: 5.001 ± 0.671
6.896AspLys: 6.896 ± 0.933
5.38AspLeu: 5.38 ± 0.66
1.288AspMet: 1.288 ± 0.307
3.41AspAsn: 3.41 ± 0.536
1.137AspPro: 1.137 ± 0.294
1.44AspGln: 1.44 ± 0.295
1.819AspArg: 1.819 ± 0.33
4.168AspSer: 4.168 ± 0.561
3.107AspThr: 3.107 ± 0.433
3.789AspVal: 3.789 ± 0.666
0.53AspTrp: 0.53 ± 0.211
2.728AspTyr: 2.728 ± 0.456
0.0AspXaa: 0.0 ± 0.0
Glu
4.092GluAla: 4.092 ± 0.568
0.455GluCys: 0.455 ± 0.178
4.622GluAsp: 4.622 ± 0.973
4.85GluGlu: 4.85 ± 0.948
2.273GluPhe: 2.273 ± 0.438
2.425GluGly: 2.425 ± 0.316
1.591GluHis: 1.591 ± 0.385
5.304GluIle: 5.304 ± 0.701
5.456GluLys: 5.456 ± 0.737
7.805GluLeu: 7.805 ± 0.888
1.97GluMet: 1.97 ± 0.406
4.774GluAsn: 4.774 ± 0.575
1.667GluPro: 1.667 ± 0.267
3.334GluGln: 3.334 ± 0.508
3.258GluArg: 3.258 ± 0.54
3.713GluSer: 3.713 ± 0.517
3.183GluThr: 3.183 ± 0.394
5.228GluVal: 5.228 ± 0.594
1.212GluTrp: 1.212 ± 0.289
3.789GluTyr: 3.789 ± 0.719
0.0GluXaa: 0.0 ± 0.0
Phe
1.743PheAla: 1.743 ± 0.35
0.455PheCys: 0.455 ± 0.18
3.637PheAsp: 3.637 ± 0.49
2.728PheGlu: 2.728 ± 0.492
0.985PhePhe: 0.985 ± 0.282
2.425PheGly: 2.425 ± 0.735
0.758PheHis: 0.758 ± 0.232
4.243PheIle: 4.243 ± 0.494
4.546PheLys: 4.546 ± 0.563
3.41PheLeu: 3.41 ± 0.515
0.909PheMet: 0.909 ± 0.219
2.728PheAsn: 2.728 ± 0.334
0.758PhePro: 0.758 ± 0.328
0.834PheGln: 0.834 ± 0.248
1.44PheArg: 1.44 ± 0.277
2.349PheSer: 2.349 ± 0.405
3.486PheThr: 3.486 ± 0.572
3.183PheVal: 3.183 ± 0.537
0.379PheTrp: 0.379 ± 0.222
1.894PheTyr: 1.894 ± 0.434
0.0PheXaa: 0.0 ± 0.0
Gly
4.622GlyAla: 4.622 ± 0.737
0.455GlyCys: 0.455 ± 0.174
3.789GlyAsp: 3.789 ± 0.699
2.652GlyGlu: 2.652 ± 0.466
3.107GlyPhe: 3.107 ± 0.484
2.652GlyGly: 2.652 ± 0.584
1.667GlyHis: 1.667 ± 0.359
4.471GlyIle: 4.471 ± 0.489
5.456GlyLys: 5.456 ± 0.542
3.865GlyLeu: 3.865 ± 0.73
1.44GlyMet: 1.44 ± 0.307
3.107GlyAsn: 3.107 ± 0.563
0.53GlyPro: 0.53 ± 0.223
2.955GlyGln: 2.955 ± 0.362
2.046GlyArg: 2.046 ± 0.38
3.183GlySer: 3.183 ± 0.48
4.319GlyThr: 4.319 ± 0.583
4.698GlyVal: 4.698 ± 0.921
0.758GlyTrp: 0.758 ± 0.236
2.349GlyTyr: 2.349 ± 0.39
0.0GlyXaa: 0.0 ± 0.0
His
1.44HisAla: 1.44 ± 0.332
0.152HisCys: 0.152 ± 0.114
0.53HisAsp: 0.53 ± 0.199
1.212HisGlu: 1.212 ± 0.35
1.061HisPhe: 1.061 ± 0.289
1.061HisGly: 1.061 ± 0.248
0.379HisHis: 0.379 ± 0.203
1.515HisIle: 1.515 ± 0.375
0.985HisLys: 0.985 ± 0.255
1.212HisLeu: 1.212 ± 0.253
0.379HisMet: 0.379 ± 0.204
1.515HisAsn: 1.515 ± 0.322
0.909HisPro: 0.909 ± 0.257
0.682HisGln: 0.682 ± 0.216
0.53HisArg: 0.53 ± 0.23
0.985HisSer: 0.985 ± 0.263
1.591HisThr: 1.591 ± 0.312
1.212HisVal: 1.212 ± 0.313
0.076HisTrp: 0.076 ± 0.077
1.061HisTyr: 1.061 ± 0.371
0.0HisXaa: 0.0 ± 0.0
Ile
4.471IleAla: 4.471 ± 0.627
0.303IleCys: 0.303 ± 0.159
6.214IleAsp: 6.214 ± 0.712
6.517IleGlu: 6.517 ± 0.799
3.334IlePhe: 3.334 ± 0.513
4.925IleGly: 4.925 ± 0.839
0.985IleHis: 0.985 ± 0.328
4.622IleIle: 4.622 ± 0.758
8.259IleLys: 8.259 ± 0.87
4.319IleLeu: 4.319 ± 0.605
1.894IleMet: 1.894 ± 0.389
4.925IleAsn: 4.925 ± 0.791
3.031IlePro: 3.031 ± 0.377
3.486IleGln: 3.486 ± 0.46
3.183IleArg: 3.183 ± 0.621
5.001IleSer: 5.001 ± 0.717
5.001IleThr: 5.001 ± 0.555
4.471IleVal: 4.471 ± 0.538
0.758IleTrp: 0.758 ± 0.299
2.879IleTyr: 2.879 ± 0.547
0.0IleXaa: 0.0 ± 0.0
Lys
5.38LysAla: 5.38 ± 0.538
0.379LysCys: 0.379 ± 0.183
6.062LysAsp: 6.062 ± 0.744
8.335LysGlu: 8.335 ± 1.016
3.107LysPhe: 3.107 ± 0.444
5.456LysGly: 5.456 ± 0.59
2.273LysHis: 2.273 ± 0.459
6.896LysIle: 6.896 ± 0.906
7.805LysLys: 7.805 ± 0.773
7.577LysLeu: 7.577 ± 0.957
2.349LysMet: 2.349 ± 0.355
5.153LysAsn: 5.153 ± 0.677
2.576LysPro: 2.576 ± 0.443
4.243LysGln: 4.243 ± 0.546
4.85LysArg: 4.85 ± 0.613
5.153LysSer: 5.153 ± 0.653
6.214LysThr: 6.214 ± 0.609
5.456LysVal: 5.456 ± 0.69
0.834LysTrp: 0.834 ± 0.253
4.319LysTyr: 4.319 ± 0.644
0.0LysXaa: 0.0 ± 0.0
Leu
4.471LeuAla: 4.471 ± 0.716
0.379LeuCys: 0.379 ± 0.17
5.153LeuAsp: 5.153 ± 0.477
6.062LeuGlu: 6.062 ± 1.01
3.789LeuPhe: 3.789 ± 0.513
3.486LeuGly: 3.486 ± 0.492
1.515LeuHis: 1.515 ± 0.386
5.986LeuIle: 5.986 ± 0.691
7.274LeuLys: 7.274 ± 0.832
5.153LeuLeu: 5.153 ± 0.84
2.122LeuMet: 2.122 ± 0.523
5.607LeuAsn: 5.607 ± 0.641
2.046LeuPro: 2.046 ± 0.387
3.107LeuGln: 3.107 ± 0.49
2.349LeuArg: 2.349 ± 0.392
4.85LeuSer: 4.85 ± 0.479
5.077LeuThr: 5.077 ± 0.728
3.865LeuVal: 3.865 ± 0.523
0.455LeuTrp: 0.455 ± 0.223
4.243LeuTyr: 4.243 ± 0.752
0.0LeuXaa: 0.0 ± 0.0
Met
1.44MetAla: 1.44 ± 0.485
0.076MetCys: 0.076 ± 0.077
1.212MetAsp: 1.212 ± 0.295
1.212MetGlu: 1.212 ± 0.314
1.137MetPhe: 1.137 ± 0.321
0.985MetGly: 0.985 ± 0.289
0.379MetHis: 0.379 ± 0.144
1.819MetIle: 1.819 ± 0.307
2.576MetLys: 2.576 ± 0.42
1.667MetLeu: 1.667 ± 0.29
0.53MetMet: 0.53 ± 0.245
1.819MetAsn: 1.819 ± 0.334
1.137MetPro: 1.137 ± 0.254
1.743MetGln: 1.743 ± 0.403
0.455MetArg: 0.455 ± 0.175
2.122MetSer: 2.122 ± 0.515
1.44MetThr: 1.44 ± 0.33
1.061MetVal: 1.061 ± 0.224
0.455MetTrp: 0.455 ± 0.182
1.212MetTyr: 1.212 ± 0.304
0.0MetXaa: 0.0 ± 0.0
Asn
4.016AsnAla: 4.016 ± 0.664
0.227AsnCys: 0.227 ± 0.133
4.471AsnAsp: 4.471 ± 0.642
5.228AsnGlu: 5.228 ± 0.806
2.804AsnPhe: 2.804 ± 0.532
4.925AsnGly: 4.925 ± 0.758
0.909AsnHis: 0.909 ± 0.29
5.001AsnIle: 5.001 ± 0.76
4.85AsnLys: 4.85 ± 0.698
4.622AsnLeu: 4.622 ± 0.65
1.288AsnMet: 1.288 ± 0.323
5.077AsnAsn: 5.077 ± 0.887
2.652AsnPro: 2.652 ± 0.394
2.728AsnGln: 2.728 ± 0.49
2.197AsnArg: 2.197 ± 0.371
4.168AsnSer: 4.168 ± 0.475
3.258AsnThr: 3.258 ± 0.411
3.107AsnVal: 3.107 ± 0.613
1.212AsnTrp: 1.212 ± 0.245
2.652AsnTyr: 2.652 ± 0.446
0.0AsnXaa: 0.0 ± 0.0
Pro
1.44ProAla: 1.44 ± 0.325
0.076ProCys: 0.076 ± 0.085
1.44ProAsp: 1.44 ± 0.315
1.667ProGlu: 1.667 ± 0.281
1.288ProPhe: 1.288 ± 0.307
1.819ProGly: 1.819 ± 0.453
0.379ProHis: 0.379 ± 0.161
1.894ProIle: 1.894 ± 0.467
2.804ProLys: 2.804 ± 0.527
1.364ProLeu: 1.364 ± 0.278
0.909ProMet: 0.909 ± 0.281
1.667ProAsn: 1.667 ± 0.447
0.53ProPro: 0.53 ± 0.166
0.909ProGln: 0.909 ± 0.264
0.985ProArg: 0.985 ± 0.239
2.046ProSer: 2.046 ± 0.371
2.046ProThr: 2.046 ± 0.353
1.819ProVal: 1.819 ± 0.315
0.076ProTrp: 0.076 ± 0.076
1.212ProTyr: 1.212 ± 0.272
0.0ProXaa: 0.0 ± 0.0
Gln
3.486GlnAla: 3.486 ± 0.431
0.227GlnCys: 0.227 ± 0.133
2.046GlnAsp: 2.046 ± 0.454
2.804GlnGlu: 2.804 ± 0.43
1.97GlnPhe: 1.97 ± 0.42
2.501GlnGly: 2.501 ± 0.428
0.834GlnHis: 0.834 ± 0.183
2.501GlnIle: 2.501 ± 0.373
3.865GlnLys: 3.865 ± 0.588
2.273GlnLeu: 2.273 ± 0.486
1.212GlnMet: 1.212 ± 0.349
2.955GlnAsn: 2.955 ± 0.434
1.212GlnPro: 1.212 ± 0.242
1.667GlnGln: 1.667 ± 0.41
2.349GlnArg: 2.349 ± 0.393
1.743GlnSer: 1.743 ± 0.361
1.44GlnThr: 1.44 ± 0.363
2.728GlnVal: 2.728 ± 0.537
0.076GlnTrp: 0.076 ± 0.073
1.743GlnTyr: 1.743 ± 0.43
0.0GlnXaa: 0.0 ± 0.0
Arg
1.364ArgAla: 1.364 ± 0.319
0.379ArgCys: 0.379 ± 0.166
3.183ArgAsp: 3.183 ± 0.55
3.183ArgGlu: 3.183 ± 0.436
1.743ArgPhe: 1.743 ± 0.402
1.819ArgGly: 1.819 ± 0.353
1.288ArgHis: 1.288 ± 0.323
2.879ArgIle: 2.879 ± 0.47
3.789ArgLys: 3.789 ± 0.546
3.183ArgLeu: 3.183 ± 0.577
1.061ArgMet: 1.061 ± 0.294
2.425ArgAsn: 2.425 ± 0.405
0.834ArgPro: 0.834 ± 0.255
2.197ArgGln: 2.197 ± 0.464
1.212ArgArg: 1.212 ± 0.263
1.819ArgSer: 1.819 ± 0.395
1.819ArgThr: 1.819 ± 0.404
2.349ArgVal: 2.349 ± 0.449
0.379ArgTrp: 0.379 ± 0.135
1.97ArgTyr: 1.97 ± 0.407
0.0ArgXaa: 0.0 ± 0.0
Ser
4.243SerAla: 4.243 ± 0.589
0.0SerCys: 0.0 ± 0.0
3.789SerAsp: 3.789 ± 0.549
3.789SerGlu: 3.789 ± 0.5
2.879SerPhe: 2.879 ± 0.478
4.471SerGly: 4.471 ± 0.609
0.985SerHis: 0.985 ± 0.245
5.835SerIle: 5.835 ± 0.736
6.365SerLys: 6.365 ± 0.627
4.016SerLeu: 4.016 ± 0.436
2.122SerMet: 2.122 ± 0.336
4.319SerAsn: 4.319 ± 0.632
1.288SerPro: 1.288 ± 0.383
2.046SerGln: 2.046 ± 0.535
1.591SerArg: 1.591 ± 0.268
3.713SerSer: 3.713 ± 0.63
3.41SerThr: 3.41 ± 0.452
3.486SerVal: 3.486 ± 0.562
0.606SerTrp: 0.606 ± 0.191
1.819SerTyr: 1.819 ± 0.292
0.0SerXaa: 0.0 ± 0.0
Thr
3.94ThrAla: 3.94 ± 0.599
0.152ThrCys: 0.152 ± 0.158
3.713ThrAsp: 3.713 ± 0.437
3.41ThrGlu: 3.41 ± 0.376
1.97ThrPhe: 1.97 ± 0.445
3.713ThrGly: 3.713 ± 0.494
1.667ThrHis: 1.667 ± 0.328
5.001ThrIle: 5.001 ± 0.687
4.698ThrLys: 4.698 ± 0.706
5.683ThrLeu: 5.683 ± 0.633
0.606ThrMet: 0.606 ± 0.186
4.774ThrAsn: 4.774 ± 0.633
1.591ThrPro: 1.591 ± 0.382
1.97ThrGln: 1.97 ± 0.428
2.349ThrArg: 2.349 ± 0.378
4.546ThrSer: 4.546 ± 0.674
3.94ThrThr: 3.94 ± 0.521
3.258ThrVal: 3.258 ± 0.494
0.834ThrTrp: 0.834 ± 0.3
2.197ThrTyr: 2.197 ± 0.424
0.0ThrXaa: 0.0 ± 0.0
Val
5.153ValAla: 5.153 ± 0.923
0.152ValCys: 0.152 ± 0.095
4.395ValAsp: 4.395 ± 0.76
4.243ValGlu: 4.243 ± 0.646
2.349ValPhe: 2.349 ± 0.43
3.41ValGly: 3.41 ± 0.669
0.682ValHis: 0.682 ± 0.256
5.456ValIle: 5.456 ± 0.644
6.896ValLys: 6.896 ± 0.596
5.001ValLeu: 5.001 ± 0.663
1.515ValMet: 1.515 ± 0.41
3.183ValAsn: 3.183 ± 0.432
1.364ValPro: 1.364 ± 0.285
1.137ValGln: 1.137 ± 0.313
2.349ValArg: 2.349 ± 0.354
3.107ValSer: 3.107 ± 0.634
3.94ValThr: 3.94 ± 0.628
3.258ValVal: 3.258 ± 0.404
0.53ValTrp: 0.53 ± 0.177
2.576ValTyr: 2.576 ± 0.482
0.0ValXaa: 0.0 ± 0.0
Trp
0.606TrpAla: 0.606 ± 0.196
0.076TrpCys: 0.076 ± 0.065
0.379TrpAsp: 0.379 ± 0.168
0.758TrpGlu: 0.758 ± 0.229
0.606TrpPhe: 0.606 ± 0.177
0.682TrpGly: 0.682 ± 0.21
0.152TrpHis: 0.152 ± 0.099
1.061TrpIle: 1.061 ± 0.303
0.834TrpLys: 0.834 ± 0.268
1.288TrpLeu: 1.288 ± 0.266
0.227TrpMet: 0.227 ± 0.135
0.834TrpAsn: 0.834 ± 0.265
0.0TrpPro: 0.0 ± 0.0
0.834TrpGln: 0.834 ± 0.239
0.379TrpArg: 0.379 ± 0.159
0.682TrpSer: 0.682 ± 0.251
0.834TrpThr: 0.834 ± 0.223
0.834TrpVal: 0.834 ± 0.256
0.227TrpTrp: 0.227 ± 0.119
0.303TrpTyr: 0.303 ± 0.18
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.819TyrAla: 1.819 ± 0.393
0.303TyrCys: 0.303 ± 0.155
2.197TyrAsp: 2.197 ± 0.456
3.561TyrGlu: 3.561 ± 0.499
1.515TyrPhe: 1.515 ± 0.417
2.122TyrGly: 2.122 ± 0.422
0.53TyrHis: 0.53 ± 0.224
3.789TyrIle: 3.789 ± 0.525
4.092TyrLys: 4.092 ± 0.6
3.713TyrLeu: 3.713 ± 0.567
0.834TyrMet: 0.834 ± 0.261
2.955TyrAsn: 2.955 ± 0.492
1.061TyrPro: 1.061 ± 0.322
1.743TyrGln: 1.743 ± 0.301
2.349TyrArg: 2.349 ± 0.458
2.652TyrSer: 2.652 ± 0.519
2.425TyrThr: 2.425 ± 0.484
2.652TyrVal: 2.652 ± 0.48
1.212TyrTrp: 1.212 ± 0.341
2.046TyrTyr: 2.046 ± 0.399
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 69 proteins (13198 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski