Amino acid dipepetide frequency for Staphylococcus phage SpT99F3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.004AlaAla: 3.004 ± 1.202
0.081AlaCys: 0.081 ± 0.08
3.248AlaAsp: 3.248 ± 0.531
3.004AlaGlu: 3.004 ± 0.427
1.867AlaPhe: 1.867 ± 0.316
3.248AlaGly: 3.248 ± 0.662
0.812AlaHis: 0.812 ± 0.229
4.466AlaIle: 4.466 ± 0.745
4.872AlaLys: 4.872 ± 0.817
5.521AlaLeu: 5.521 ± 0.729
1.543AlaMet: 1.543 ± 0.326
4.06AlaAsn: 4.06 ± 0.562
0.812AlaPro: 0.812 ± 0.248
1.867AlaGln: 1.867 ± 0.39
2.761AlaArg: 2.761 ± 0.404
3.735AlaSer: 3.735 ± 0.99
3.654AlaThr: 3.654 ± 0.575
3.816AlaVal: 3.816 ± 0.553
0.325AlaTrp: 0.325 ± 0.164
1.867AlaTyr: 1.867 ± 0.417
0.0AlaXaa: 0.0 ± 0.0
Cys
0.244CysAla: 0.244 ± 0.151
0.0CysCys: 0.0 ± 0.0
0.162CysAsp: 0.162 ± 0.102
0.406CysGlu: 0.406 ± 0.26
0.0CysPhe: 0.0 ± 0.0
0.487CysGly: 0.487 ± 0.24
0.325CysHis: 0.325 ± 0.165
0.325CysIle: 0.325 ± 0.194
0.325CysLys: 0.325 ± 0.163
0.568CysLeu: 0.568 ± 0.263
0.081CysMet: 0.081 ± 0.101
0.487CysAsn: 0.487 ± 0.267
0.081CysPro: 0.081 ± 0.083
0.0CysGln: 0.0 ± 0.0
0.162CysArg: 0.162 ± 0.133
0.65CysSer: 0.65 ± 0.262
0.162CysThr: 0.162 ± 0.093
0.487CysVal: 0.487 ± 0.18
0.0CysTrp: 0.0 ± 0.0
0.487CysTyr: 0.487 ± 0.227
0.0CysXaa: 0.0 ± 0.0
Asp
3.004AspAla: 3.004 ± 0.596
0.081AspCys: 0.081 ± 0.076
4.303AspAsp: 4.303 ± 0.677
6.09AspGlu: 6.09 ± 0.829
3.004AspPhe: 3.004 ± 0.566
4.141AspGly: 4.141 ± 0.61
0.65AspHis: 0.65 ± 0.227
4.791AspIle: 4.791 ± 0.785
5.765AspLys: 5.765 ± 0.651
4.872AspLeu: 4.872 ± 0.782
1.462AspMet: 1.462 ± 0.319
4.222AspAsn: 4.222 ± 0.571
1.705AspPro: 1.705 ± 0.386
1.056AspGln: 1.056 ± 0.298
2.517AspArg: 2.517 ± 0.527
2.679AspSer: 2.679 ± 0.547
3.573AspThr: 3.573 ± 0.55
5.684AspVal: 5.684 ± 0.626
1.218AspTrp: 1.218 ± 0.322
3.979AspTyr: 3.979 ± 0.548
0.0AspXaa: 0.0 ± 0.0
Glu
3.248GluAla: 3.248 ± 0.492
0.812GluCys: 0.812 ± 0.308
3.004GluAsp: 3.004 ± 0.411
5.927GluGlu: 5.927 ± 1.009
3.004GluPhe: 3.004 ± 0.471
2.355GluGly: 2.355 ± 0.421
1.299GluHis: 1.299 ± 0.38
5.034GluIle: 5.034 ± 0.909
5.602GluLys: 5.602 ± 0.999
7.47GluLeu: 7.47 ± 0.967
2.355GluMet: 2.355 ± 0.419
4.791GluAsn: 4.791 ± 0.88
1.624GluPro: 1.624 ± 0.395
3.248GluGln: 3.248 ± 0.675
3.573GluArg: 3.573 ± 0.63
4.385GluSer: 4.385 ± 0.748
3.735GluThr: 3.735 ± 0.538
5.927GluVal: 5.927 ± 1.018
1.462GluTrp: 1.462 ± 0.287
3.654GluTyr: 3.654 ± 0.77
0.0GluXaa: 0.0 ± 0.0
Phe
1.705PheAla: 1.705 ± 0.417
0.244PheCys: 0.244 ± 0.143
3.248PheAsp: 3.248 ± 0.502
3.248PheGlu: 3.248 ± 0.845
1.299PhePhe: 1.299 ± 0.334
3.248PheGly: 3.248 ± 0.433
0.406PheHis: 0.406 ± 0.162
3.004PheIle: 3.004 ± 0.518
4.222PheLys: 4.222 ± 0.518
2.436PheLeu: 2.436 ± 0.555
1.543PheMet: 1.543 ± 0.353
3.004PheAsn: 3.004 ± 0.575
0.65PhePro: 0.65 ± 0.202
0.893PheGln: 0.893 ± 0.252
1.705PheArg: 1.705 ± 0.369
2.517PheSer: 2.517 ± 0.558
2.842PheThr: 2.842 ± 0.451
2.355PheVal: 2.355 ± 0.403
0.325PheTrp: 0.325 ± 0.234
1.543PheTyr: 1.543 ± 0.384
0.0PheXaa: 0.0 ± 0.0
Gly
2.679GlyAla: 2.679 ± 0.895
0.081GlyCys: 0.081 ± 0.076
3.329GlyAsp: 3.329 ± 0.544
3.897GlyGlu: 3.897 ± 0.584
2.761GlyPhe: 2.761 ± 0.394
3.491GlyGly: 3.491 ± 0.953
0.974GlyHis: 0.974 ± 0.296
3.654GlyIle: 3.654 ± 0.764
4.547GlyLys: 4.547 ± 0.532
5.196GlyLeu: 5.196 ± 1.277
1.056GlyMet: 1.056 ± 0.3
3.248GlyAsn: 3.248 ± 0.458
0.893GlyPro: 0.893 ± 0.393
1.705GlyGln: 1.705 ± 0.343
2.923GlyArg: 2.923 ± 0.668
3.816GlySer: 3.816 ± 0.71
3.491GlyThr: 3.491 ± 0.525
3.329GlyVal: 3.329 ± 0.386
0.65GlyTrp: 0.65 ± 0.245
3.167GlyTyr: 3.167 ± 0.502
0.0GlyXaa: 0.0 ± 0.0
His
0.974HisAla: 0.974 ± 0.329
0.406HisCys: 0.406 ± 0.176
0.65HisAsp: 0.65 ± 0.208
0.893HisGlu: 0.893 ± 0.248
1.218HisPhe: 1.218 ± 0.34
0.893HisGly: 0.893 ± 0.228
0.406HisHis: 0.406 ± 0.173
1.624HisIle: 1.624 ± 0.404
1.218HisLys: 1.218 ± 0.264
1.462HisLeu: 1.462 ± 0.312
0.487HisMet: 0.487 ± 0.244
1.462HisAsn: 1.462 ± 0.403
0.406HisPro: 0.406 ± 0.213
0.65HisGln: 0.65 ± 0.248
0.731HisArg: 0.731 ± 0.257
0.568HisSer: 0.568 ± 0.221
0.812HisThr: 0.812 ± 0.255
0.812HisVal: 0.812 ± 0.326
0.081HisTrp: 0.081 ± 0.09
0.487HisTyr: 0.487 ± 0.226
0.0HisXaa: 0.0 ± 0.0
Ile
5.034IleAla: 5.034 ± 0.567
0.244IleCys: 0.244 ± 0.157
6.333IleAsp: 6.333 ± 0.766
4.709IleGlu: 4.709 ± 0.712
2.761IlePhe: 2.761 ± 0.505
4.466IleGly: 4.466 ± 1.072
1.38IleHis: 1.38 ± 0.295
4.628IleIle: 4.628 ± 0.711
7.145IleLys: 7.145 ± 0.654
4.466IleLeu: 4.466 ± 0.754
1.137IleMet: 1.137 ± 0.282
4.953IleAsn: 4.953 ± 0.589
2.03IlePro: 2.03 ± 0.37
2.273IleGln: 2.273 ± 0.493
3.248IleArg: 3.248 ± 0.686
5.196IleSer: 5.196 ± 0.714
4.222IleThr: 4.222 ± 0.622
4.222IleVal: 4.222 ± 0.641
0.731IleTrp: 0.731 ± 0.361
2.355IleTyr: 2.355 ± 0.654
0.0IleXaa: 0.0 ± 0.0
Lys
4.709LysAla: 4.709 ± 0.562
0.162LysCys: 0.162 ± 0.111
5.927LysAsp: 5.927 ± 0.653
7.226LysGlu: 7.226 ± 1.208
3.491LysPhe: 3.491 ± 0.599
5.359LysGly: 5.359 ± 0.725
1.705LysHis: 1.705 ± 0.414
5.359LysIle: 5.359 ± 0.67
6.252LysLys: 6.252 ± 1.121
7.714LysLeu: 7.714 ± 0.99
2.923LysMet: 2.923 ± 0.646
5.359LysAsn: 5.359 ± 0.656
2.517LysPro: 2.517 ± 0.504
4.547LysGln: 4.547 ± 0.855
4.872LysArg: 4.872 ± 0.544
5.359LysSer: 5.359 ± 0.754
5.359LysThr: 5.359 ± 0.51
5.521LysVal: 5.521 ± 0.616
0.731LysTrp: 0.731 ± 0.254
4.385LysTyr: 4.385 ± 0.654
0.0LysXaa: 0.0 ± 0.0
Leu
4.303LeuAla: 4.303 ± 0.854
0.568LeuCys: 0.568 ± 0.249
5.034LeuAsp: 5.034 ± 0.704
5.359LeuGlu: 5.359 ± 0.814
3.167LeuPhe: 3.167 ± 0.409
3.167LeuGly: 3.167 ± 0.881
1.38LeuHis: 1.38 ± 0.302
6.008LeuIle: 6.008 ± 0.945
9.094LeuLys: 9.094 ± 1.167
5.927LeuLeu: 5.927 ± 0.742
1.867LeuMet: 1.867 ± 0.419
5.602LeuAsn: 5.602 ± 0.63
2.679LeuPro: 2.679 ± 0.476
3.816LeuGln: 3.816 ± 0.505
3.654LeuArg: 3.654 ± 0.713
5.846LeuSer: 5.846 ± 0.71
4.466LeuThr: 4.466 ± 0.666
4.222LeuVal: 4.222 ± 0.583
0.731LeuTrp: 0.731 ± 0.281
2.679LeuTyr: 2.679 ± 0.438
0.0LeuXaa: 0.0 ± 0.0
Met
1.462MetAla: 1.462 ± 0.378
0.0MetCys: 0.0 ± 0.0
1.462MetAsp: 1.462 ± 0.293
1.786MetGlu: 1.786 ± 0.334
0.974MetPhe: 0.974 ± 0.288
0.812MetGly: 0.812 ± 0.384
0.487MetHis: 0.487 ± 0.178
1.462MetIle: 1.462 ± 0.395
2.03MetLys: 2.03 ± 0.362
2.355MetLeu: 2.355 ± 0.417
0.893MetMet: 0.893 ± 0.252
1.462MetAsn: 1.462 ± 0.365
1.462MetPro: 1.462 ± 0.29
1.299MetGln: 1.299 ± 0.422
1.867MetArg: 1.867 ± 0.518
2.03MetSer: 2.03 ± 0.372
2.111MetThr: 2.111 ± 0.42
0.974MetVal: 0.974 ± 0.318
0.325MetTrp: 0.325 ± 0.137
0.974MetTyr: 0.974 ± 0.299
0.0MetXaa: 0.0 ± 0.0
Asn
3.004AsnAla: 3.004 ± 0.499
0.325AsnCys: 0.325 ± 0.211
4.547AsnAsp: 4.547 ± 0.927
4.709AsnGlu: 4.709 ± 0.794
2.355AsnPhe: 2.355 ± 0.423
5.196AsnGly: 5.196 ± 0.754
1.299AsnHis: 1.299 ± 0.3
3.979AsnIle: 3.979 ± 0.515
6.496AsnLys: 6.496 ± 0.816
4.141AsnLeu: 4.141 ± 0.449
1.786AsnMet: 1.786 ± 0.473
5.196AsnAsn: 5.196 ± 0.802
2.436AsnPro: 2.436 ± 0.427
3.167AsnGln: 3.167 ± 0.569
3.004AsnArg: 3.004 ± 0.431
3.979AsnSer: 3.979 ± 0.548
3.654AsnThr: 3.654 ± 0.619
3.816AsnVal: 3.816 ± 0.496
1.056AsnTrp: 1.056 ± 0.298
2.761AsnTyr: 2.761 ± 0.509
0.0AsnXaa: 0.0 ± 0.0
Pro
1.218ProAla: 1.218 ± 0.397
0.0ProCys: 0.0 ± 0.0
1.218ProAsp: 1.218 ± 0.353
2.111ProGlu: 2.111 ± 0.656
1.299ProPhe: 1.299 ± 0.341
1.137ProGly: 1.137 ± 0.266
0.487ProHis: 0.487 ± 0.222
2.111ProIle: 2.111 ± 0.453
2.598ProLys: 2.598 ± 0.41
2.679ProLeu: 2.679 ± 0.49
0.65ProMet: 0.65 ± 0.169
0.893ProAsn: 0.893 ± 0.22
0.893ProPro: 0.893 ± 0.242
1.705ProGln: 1.705 ± 0.519
0.65ProArg: 0.65 ± 0.246
1.705ProSer: 1.705 ± 0.367
1.624ProThr: 1.624 ± 0.348
1.218ProVal: 1.218 ± 0.265
0.325ProTrp: 0.325 ± 0.198
1.056ProTyr: 1.056 ± 0.265
0.0ProXaa: 0.0 ± 0.0
Gln
4.06GlnAla: 4.06 ± 0.54
0.244GlnCys: 0.244 ± 0.166
2.03GlnAsp: 2.03 ± 0.446
3.004GlnGlu: 3.004 ± 0.588
1.299GlnPhe: 1.299 ± 0.262
1.218GlnGly: 1.218 ± 0.325
0.731GlnHis: 0.731 ± 0.251
2.192GlnIle: 2.192 ± 0.405
3.654GlnLys: 3.654 ± 0.581
2.761GlnLeu: 2.761 ± 0.448
0.974GlnMet: 0.974 ± 0.348
2.517GlnAsn: 2.517 ± 0.652
1.38GlnPro: 1.38 ± 0.428
2.273GlnGln: 2.273 ± 0.58
1.705GlnArg: 1.705 ± 0.407
3.004GlnSer: 3.004 ± 0.426
2.436GlnThr: 2.436 ± 0.403
1.705GlnVal: 1.705 ± 0.466
0.406GlnTrp: 0.406 ± 0.159
2.436GlnTyr: 2.436 ± 0.43
0.0GlnXaa: 0.0 ± 0.0
Arg
2.111ArgAla: 2.111 ± 0.413
0.325ArgCys: 0.325 ± 0.218
2.923ArgAsp: 2.923 ± 0.503
3.735ArgGlu: 3.735 ± 0.488
1.462ArgPhe: 1.462 ± 0.372
2.273ArgGly: 2.273 ± 0.51
0.487ArgHis: 0.487 ± 0.189
3.085ArgIle: 3.085 ± 0.521
3.979ArgLys: 3.979 ± 0.495
4.222ArgLeu: 4.222 ± 0.524
1.949ArgMet: 1.949 ± 0.315
3.248ArgAsn: 3.248 ± 0.513
1.218ArgPro: 1.218 ± 0.283
2.273ArgGln: 2.273 ± 0.405
2.03ArgArg: 2.03 ± 0.349
2.436ArgSer: 2.436 ± 0.437
3.329ArgThr: 3.329 ± 0.465
3.004ArgVal: 3.004 ± 0.494
0.0ArgTrp: 0.0 ± 0.0
2.598ArgTyr: 2.598 ± 0.412
0.0ArgXaa: 0.0 ± 0.0
Ser
3.167SerAla: 3.167 ± 0.729
0.325SerCys: 0.325 ± 0.163
4.547SerAsp: 4.547 ± 0.649
4.547SerGlu: 4.547 ± 0.904
3.329SerPhe: 3.329 ± 0.614
3.979SerGly: 3.979 ± 0.617
1.218SerHis: 1.218 ± 0.286
5.359SerIle: 5.359 ± 1.001
5.196SerLys: 5.196 ± 0.574
4.709SerLeu: 4.709 ± 0.792
1.624SerMet: 1.624 ± 0.362
5.034SerAsn: 5.034 ± 0.608
0.568SerPro: 0.568 ± 0.207
2.842SerGln: 2.842 ± 0.605
2.842SerArg: 2.842 ± 0.551
3.167SerSer: 3.167 ± 0.566
3.248SerThr: 3.248 ± 0.496
3.897SerVal: 3.897 ± 0.562
0.893SerTrp: 0.893 ± 0.302
2.436SerTyr: 2.436 ± 0.568
0.0SerXaa: 0.0 ± 0.0
Thr
3.654ThrAla: 3.654 ± 0.566
0.406ThrCys: 0.406 ± 0.199
4.141ThrAsp: 4.141 ± 0.328
3.816ThrGlu: 3.816 ± 0.666
2.842ThrPhe: 2.842 ± 0.468
3.167ThrGly: 3.167 ± 0.607
0.731ThrHis: 0.731 ± 0.213
4.953ThrIle: 4.953 ± 0.667
5.927ThrLys: 5.927 ± 0.72
4.872ThrLeu: 4.872 ± 0.614
1.218ThrMet: 1.218 ± 0.346
3.248ThrAsn: 3.248 ± 0.375
1.38ThrPro: 1.38 ± 0.306
2.355ThrGln: 2.355 ± 0.497
3.816ThrArg: 3.816 ± 0.527
4.303ThrSer: 4.303 ± 0.614
4.141ThrThr: 4.141 ± 0.592
3.816ThrVal: 3.816 ± 0.65
0.568ThrTrp: 0.568 ± 0.23
2.598ThrTyr: 2.598 ± 0.62
0.0ThrXaa: 0.0 ± 0.0
Val
3.654ValAla: 3.654 ± 0.529
0.487ValCys: 0.487 ± 0.235
4.466ValAsp: 4.466 ± 0.714
5.034ValGlu: 5.034 ± 0.804
2.03ValPhe: 2.03 ± 0.346
3.491ValGly: 3.491 ± 0.526
0.568ValHis: 0.568 ± 0.219
5.034ValIle: 5.034 ± 0.596
5.359ValLys: 5.359 ± 0.755
4.466ValLeu: 4.466 ± 0.583
1.38ValMet: 1.38 ± 0.304
4.547ValAsn: 4.547 ± 0.855
1.137ValPro: 1.137 ± 0.329
1.462ValGln: 1.462 ± 0.343
2.761ValArg: 2.761 ± 0.489
4.466ValSer: 4.466 ± 0.451
4.872ValThr: 4.872 ± 0.638
5.684ValVal: 5.684 ± 0.924
0.487ValTrp: 0.487 ± 0.164
2.761ValTyr: 2.761 ± 0.466
0.0ValXaa: 0.0 ± 0.0
Trp
0.812TrpAla: 0.812 ± 0.272
0.081TrpCys: 0.081 ± 0.081
0.731TrpAsp: 0.731 ± 0.214
0.65TrpGlu: 0.65 ± 0.25
0.487TrpPhe: 0.487 ± 0.205
0.325TrpGly: 0.325 ± 0.145
0.162TrpHis: 0.162 ± 0.105
1.056TrpIle: 1.056 ± 0.229
0.65TrpLys: 0.65 ± 0.259
0.812TrpLeu: 0.812 ± 0.223
0.406TrpMet: 0.406 ± 0.16
0.974TrpAsn: 0.974 ± 0.367
0.081TrpPro: 0.081 ± 0.076
0.65TrpGln: 0.65 ± 0.199
0.406TrpArg: 0.406 ± 0.165
0.65TrpSer: 0.65 ± 0.22
0.812TrpThr: 0.812 ± 0.266
0.974TrpVal: 0.974 ± 0.265
0.244TrpTrp: 0.244 ± 0.164
0.568TrpTyr: 0.568 ± 0.252
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.517TyrAla: 2.517 ± 0.458
0.568TyrCys: 0.568 ± 0.224
3.491TyrAsp: 3.491 ± 0.733
2.436TyrGlu: 2.436 ± 0.379
1.786TyrPhe: 1.786 ± 0.426
2.436TyrGly: 2.436 ± 0.478
0.731TyrHis: 0.731 ± 0.245
3.41TyrIle: 3.41 ± 0.673
4.547TyrLys: 4.547 ± 0.724
2.923TyrLeu: 2.923 ± 0.721
0.812TyrMet: 0.812 ± 0.253
2.598TyrAsn: 2.598 ± 0.52
1.624TyrPro: 1.624 ± 0.361
1.949TyrGln: 1.949 ± 0.345
1.462TyrArg: 1.462 ± 0.299
2.436TyrSer: 2.436 ± 0.431
3.41TyrThr: 3.41 ± 0.532
2.679TyrVal: 2.679 ± 0.516
0.974TyrTrp: 0.974 ± 0.243
1.786TyrTyr: 1.786 ± 0.503
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 68 proteins (12317 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski