Amino acid dipepetide frequency for Staphylococcus phage SAP3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.381AlaAla: 1.381 ± 0.376
0.325AlaCys: 0.325 ± 0.177
2.924AlaAsp: 2.924 ± 0.548
4.386AlaGlu: 4.386 ± 0.827
2.437AlaPhe: 2.437 ± 0.638
3.493AlaGly: 3.493 ± 0.809
0.894AlaHis: 0.894 ± 0.261
5.767AlaIle: 5.767 ± 0.68
7.067AlaLys: 7.067 ± 0.821
5.605AlaLeu: 5.605 ± 1.343
1.462AlaMet: 1.462 ± 0.436
3.899AlaAsn: 3.899 ± 0.55
1.462AlaPro: 1.462 ± 0.322
2.193AlaGln: 2.193 ± 0.429
2.599AlaArg: 2.599 ± 0.379
4.061AlaSer: 4.061 ± 0.532
3.33AlaThr: 3.33 ± 0.567
3.087AlaVal: 3.087 ± 0.71
0.569AlaTrp: 0.569 ± 0.179
2.274AlaTyr: 2.274 ± 0.456
0.0AlaXaa: 0.0 ± 0.0
Cys
0.244CysAla: 0.244 ± 0.138
0.0CysCys: 0.0 ± 0.0
0.406CysAsp: 0.406 ± 0.168
0.244CysGlu: 0.244 ± 0.147
0.162CysPhe: 0.162 ± 0.119
0.406CysGly: 0.406 ± 0.166
0.0CysHis: 0.0 ± 0.0
0.406CysIle: 0.406 ± 0.17
0.406CysLys: 0.406 ± 0.216
0.081CysLeu: 0.081 ± 0.094
0.244CysMet: 0.244 ± 0.144
0.487CysAsn: 0.487 ± 0.17
0.081CysPro: 0.081 ± 0.068
0.162CysGln: 0.162 ± 0.111
0.487CysArg: 0.487 ± 0.169
0.244CysSer: 0.244 ± 0.134
0.325CysThr: 0.325 ± 0.15
0.325CysVal: 0.325 ± 0.165
0.162CysTrp: 0.162 ± 0.118
0.162CysTyr: 0.162 ± 0.13
0.0CysXaa: 0.0 ± 0.0
Asp
4.63AspAla: 4.63 ± 0.62
0.487AspCys: 0.487 ± 0.194
5.199AspAsp: 5.199 ± 1.056
4.792AspGlu: 4.792 ± 0.774
3.087AspPhe: 3.087 ± 0.464
3.818AspGly: 3.818 ± 0.608
0.406AspHis: 0.406 ± 0.179
4.224AspIle: 4.224 ± 0.515
5.442AspLys: 5.442 ± 0.809
5.036AspLeu: 5.036 ± 0.535
1.218AspMet: 1.218 ± 0.326
4.224AspAsn: 4.224 ± 0.792
1.218AspPro: 1.218 ± 0.234
1.462AspGln: 1.462 ± 0.317
2.843AspArg: 2.843 ± 0.456
2.924AspSer: 2.924 ± 0.546
3.98AspThr: 3.98 ± 0.44
4.549AspVal: 4.549 ± 0.661
1.137AspTrp: 1.137 ± 0.305
3.899AspTyr: 3.899 ± 0.701
0.0AspXaa: 0.0 ± 0.0
Glu
2.924GluAla: 2.924 ± 0.481
0.244GluCys: 0.244 ± 0.136
4.955GluAsp: 4.955 ± 0.775
6.579GluGlu: 6.579 ± 1.414
4.468GluPhe: 4.468 ± 0.842
3.655GluGly: 3.655 ± 0.51
1.137GluHis: 1.137 ± 0.36
5.117GluIle: 5.117 ± 0.567
6.173GluLys: 6.173 ± 0.987
5.361GluLeu: 5.361 ± 0.892
2.924GluMet: 2.924 ± 0.609
4.061GluAsn: 4.061 ± 0.607
1.462GluPro: 1.462 ± 0.365
3.168GluGln: 3.168 ± 0.48
3.736GluArg: 3.736 ± 0.577
4.63GluSer: 4.63 ± 0.69
2.762GluThr: 2.762 ± 0.603
6.498GluVal: 6.498 ± 0.789
0.65GluTrp: 0.65 ± 0.227
4.061GluTyr: 4.061 ± 0.616
0.0GluXaa: 0.0 ± 0.0
Phe
2.518PheAla: 2.518 ± 0.506
0.162PheCys: 0.162 ± 0.111
3.33PheAsp: 3.33 ± 0.482
3.33PheGlu: 3.33 ± 0.606
1.706PhePhe: 1.706 ± 0.398
3.005PheGly: 3.005 ± 0.766
0.731PheHis: 0.731 ± 0.211
2.762PheIle: 2.762 ± 0.453
4.224PheLys: 4.224 ± 0.519
2.762PheLeu: 2.762 ± 0.454
1.3PheMet: 1.3 ± 0.343
3.574PheAsn: 3.574 ± 0.48
0.569PhePro: 0.569 ± 0.259
1.3PheGln: 1.3 ± 0.318
1.3PheArg: 1.3 ± 0.378
1.706PheSer: 1.706 ± 0.486
1.868PheThr: 1.868 ± 0.39
2.599PheVal: 2.599 ± 0.464
0.65PheTrp: 0.65 ± 0.318
2.031PheTyr: 2.031 ± 0.461
0.0PheXaa: 0.0 ± 0.0
Gly
4.143GlyAla: 4.143 ± 0.933
0.244GlyCys: 0.244 ± 0.126
3.899GlyAsp: 3.899 ± 0.629
3.249GlyGlu: 3.249 ± 0.672
3.005GlyPhe: 3.005 ± 0.594
3.33GlyGly: 3.33 ± 0.747
0.894GlyHis: 0.894 ± 0.348
5.361GlyIle: 5.361 ± 0.762
4.386GlyLys: 4.386 ± 0.736
4.061GlyLeu: 4.061 ± 0.644
2.112GlyMet: 2.112 ± 0.476
3.249GlyAsn: 3.249 ± 0.555
0.812GlyPro: 0.812 ± 0.286
2.924GlyGln: 2.924 ± 0.544
2.599GlyArg: 2.599 ± 0.471
2.762GlySer: 2.762 ± 0.553
3.736GlyThr: 3.736 ± 0.778
5.524GlyVal: 5.524 ± 0.719
0.812GlyTrp: 0.812 ± 0.282
3.899GlyTyr: 3.899 ± 0.626
0.0GlyXaa: 0.0 ± 0.0
His
0.65HisAla: 0.65 ± 0.196
0.0HisCys: 0.0 ± 0.0
1.056HisAsp: 1.056 ± 0.313
0.569HisGlu: 0.569 ± 0.17
0.487HisPhe: 0.487 ± 0.195
0.894HisGly: 0.894 ± 0.273
0.244HisHis: 0.244 ± 0.133
1.3HisIle: 1.3 ± 0.303
1.218HisLys: 1.218 ± 0.351
1.3HisLeu: 1.3 ± 0.386
0.325HisMet: 0.325 ± 0.152
1.137HisAsn: 1.137 ± 0.262
0.812HisPro: 0.812 ± 0.284
0.731HisGln: 0.731 ± 0.252
0.569HisArg: 0.569 ± 0.292
0.975HisSer: 0.975 ± 0.246
1.218HisThr: 1.218 ± 0.35
1.137HisVal: 1.137 ± 0.378
0.162HisTrp: 0.162 ± 0.101
0.975HisTyr: 0.975 ± 0.283
0.0HisXaa: 0.0 ± 0.0
Ile
4.711IleAla: 4.711 ± 0.567
0.325IleCys: 0.325 ± 0.141
6.255IleAsp: 6.255 ± 0.934
7.229IleGlu: 7.229 ± 0.761
2.031IlePhe: 2.031 ± 0.456
4.386IleGly: 4.386 ± 0.807
1.218IleHis: 1.218 ± 0.332
4.224IleIle: 4.224 ± 0.622
7.96IleLys: 7.96 ± 0.8
4.955IleLeu: 4.955 ± 0.633
1.381IleMet: 1.381 ± 0.41
6.011IleAsn: 6.011 ± 0.591
2.518IlePro: 2.518 ± 0.516
2.356IleGln: 2.356 ± 0.395
3.168IleArg: 3.168 ± 0.49
4.711IleSer: 4.711 ± 0.699
5.442IleThr: 5.442 ± 0.832
3.818IleVal: 3.818 ± 0.51
0.894IleTrp: 0.894 ± 0.372
2.924IleTyr: 2.924 ± 0.623
0.0IleXaa: 0.0 ± 0.0
Lys
5.686LysAla: 5.686 ± 0.71
0.325LysCys: 0.325 ± 0.147
5.524LysAsp: 5.524 ± 0.685
7.392LysGlu: 7.392 ± 1.099
2.518LysPhe: 2.518 ± 0.367
6.255LysGly: 6.255 ± 0.961
1.706LysHis: 1.706 ± 0.319
6.011LysIle: 6.011 ± 0.935
6.255LysLys: 6.255 ± 1.056
6.417LysLeu: 6.417 ± 0.801
2.681LysMet: 2.681 ± 0.46
5.199LysAsn: 5.199 ± 0.527
3.899LysPro: 3.899 ± 0.825
3.33LysGln: 3.33 ± 0.604
4.63LysArg: 4.63 ± 0.803
4.63LysSer: 4.63 ± 0.587
4.792LysThr: 4.792 ± 0.712
4.386LysVal: 4.386 ± 0.525
1.137LysTrp: 1.137 ± 0.42
4.224LysTyr: 4.224 ± 0.71
0.0LysXaa: 0.0 ± 0.0
Leu
4.224LeuAla: 4.224 ± 0.622
0.487LeuCys: 0.487 ± 0.219
4.224LeuAsp: 4.224 ± 0.597
5.93LeuGlu: 5.93 ± 0.851
3.412LeuPhe: 3.412 ± 0.614
3.98LeuGly: 3.98 ± 0.512
1.218LeuHis: 1.218 ± 0.276
4.955LeuIle: 4.955 ± 0.685
7.717LeuLys: 7.717 ± 0.634
5.361LeuLeu: 5.361 ± 0.819
2.193LeuMet: 2.193 ± 0.451
6.011LeuAsn: 6.011 ± 0.601
2.843LeuPro: 2.843 ± 0.4
3.412LeuGln: 3.412 ± 0.624
4.061LeuArg: 4.061 ± 0.604
4.711LeuSer: 4.711 ± 0.658
4.63LeuThr: 4.63 ± 0.497
3.98LeuVal: 3.98 ± 0.441
0.406LeuTrp: 0.406 ± 0.189
3.005LeuTyr: 3.005 ± 0.489
0.0LeuXaa: 0.0 ± 0.0
Met
2.274MetAla: 2.274 ± 0.497
0.0MetCys: 0.0 ± 0.0
1.218MetAsp: 1.218 ± 0.358
1.787MetGlu: 1.787 ± 0.411
1.137MetPhe: 1.137 ± 0.327
0.975MetGly: 0.975 ± 0.32
0.731MetHis: 0.731 ± 0.259
1.868MetIle: 1.868 ± 0.388
1.949MetLys: 1.949 ± 0.399
2.437MetLeu: 2.437 ± 0.469
0.487MetMet: 0.487 ± 0.201
1.137MetAsn: 1.137 ± 0.358
0.731MetPro: 0.731 ± 0.226
0.894MetGln: 0.894 ± 0.314
1.137MetArg: 1.137 ± 0.278
1.381MetSer: 1.381 ± 0.385
2.193MetThr: 2.193 ± 0.404
0.975MetVal: 0.975 ± 0.245
0.569MetTrp: 0.569 ± 0.223
1.218MetTyr: 1.218 ± 0.289
0.0MetXaa: 0.0 ± 0.0
Asn
4.955AsnAla: 4.955 ± 0.625
0.487AsnCys: 0.487 ± 0.256
4.874AsnAsp: 4.874 ± 0.771
4.224AsnGlu: 4.224 ± 0.598
2.437AsnPhe: 2.437 ± 0.403
4.874AsnGly: 4.874 ± 0.78
0.812AsnHis: 0.812 ± 0.22
4.955AsnIle: 4.955 ± 0.796
6.173AsnLys: 6.173 ± 0.543
4.143AsnLeu: 4.143 ± 0.539
1.137AsnMet: 1.137 ± 0.313
4.63AsnAsn: 4.63 ± 0.693
2.193AsnPro: 2.193 ± 0.379
3.168AsnGln: 3.168 ± 0.595
2.274AsnArg: 2.274 ± 0.356
2.681AsnSer: 2.681 ± 0.379
4.549AsnThr: 4.549 ± 0.838
3.98AsnVal: 3.98 ± 0.689
0.569AsnTrp: 0.569 ± 0.225
1.949AsnTyr: 1.949 ± 0.421
0.0AsnXaa: 0.0 ± 0.0
Pro
1.381ProAla: 1.381 ± 0.358
0.0ProCys: 0.0 ± 0.0
1.543ProAsp: 1.543 ± 0.46
1.868ProGlu: 1.868 ± 0.361
1.137ProPhe: 1.137 ± 0.384
1.787ProGly: 1.787 ± 0.591
0.65ProHis: 0.65 ± 0.216
3.33ProIle: 3.33 ± 0.474
2.274ProLys: 2.274 ± 0.458
2.681ProLeu: 2.681 ± 0.546
0.731ProMet: 0.731 ± 0.199
2.031ProAsn: 2.031 ± 0.369
0.65ProPro: 0.65 ± 0.413
0.894ProGln: 0.894 ± 0.257
0.569ProArg: 0.569 ± 0.273
2.274ProSer: 2.274 ± 0.369
2.193ProThr: 2.193 ± 0.453
1.949ProVal: 1.949 ± 0.341
0.244ProTrp: 0.244 ± 0.155
1.462ProTyr: 1.462 ± 0.367
0.0ProXaa: 0.0 ± 0.0
Gln
2.681GlnAla: 2.681 ± 0.545
0.244GlnCys: 0.244 ± 0.133
1.868GlnAsp: 1.868 ± 0.338
3.087GlnGlu: 3.087 ± 0.643
1.625GlnPhe: 1.625 ± 0.496
2.274GlnGly: 2.274 ± 0.556
0.569GlnHis: 0.569 ± 0.234
2.924GlnIle: 2.924 ± 0.454
3.493GlnLys: 3.493 ± 0.584
3.249GlnLeu: 3.249 ± 0.539
1.218GlnMet: 1.218 ± 0.357
1.787GlnAsn: 1.787 ± 0.406
1.543GlnPro: 1.543 ± 0.288
1.625GlnGln: 1.625 ± 0.507
1.462GlnArg: 1.462 ± 0.364
1.949GlnSer: 1.949 ± 0.442
1.706GlnThr: 1.706 ± 0.323
2.437GlnVal: 2.437 ± 0.435
0.569GlnTrp: 0.569 ± 0.265
1.706GlnTyr: 1.706 ± 0.315
0.0GlnXaa: 0.0 ± 0.0
Arg
2.274ArgAla: 2.274 ± 0.616
0.325ArgCys: 0.325 ± 0.156
2.762ArgAsp: 2.762 ± 0.517
4.061ArgGlu: 4.061 ± 0.68
1.787ArgPhe: 1.787 ± 0.315
2.193ArgGly: 2.193 ± 0.4
1.056ArgHis: 1.056 ± 0.287
3.249ArgIle: 3.249 ± 0.418
2.762ArgLys: 2.762 ± 0.514
4.468ArgLeu: 4.468 ± 0.682
0.975ArgMet: 0.975 ± 0.255
3.168ArgAsn: 3.168 ± 0.508
1.381ArgPro: 1.381 ± 0.314
1.3ArgGln: 1.3 ± 0.301
1.543ArgArg: 1.543 ± 0.326
1.949ArgSer: 1.949 ± 0.352
2.356ArgThr: 2.356 ± 0.485
1.625ArgVal: 1.625 ± 0.319
0.731ArgTrp: 0.731 ± 0.211
2.112ArgTyr: 2.112 ± 0.539
0.0ArgXaa: 0.0 ± 0.0
Ser
2.762SerAla: 2.762 ± 0.655
0.325SerCys: 0.325 ± 0.147
3.655SerAsp: 3.655 ± 0.599
4.468SerGlu: 4.468 ± 0.799
2.356SerPhe: 2.356 ± 0.377
4.63SerGly: 4.63 ± 0.655
1.218SerHis: 1.218 ± 0.364
4.63SerIle: 4.63 ± 0.911
4.792SerLys: 4.792 ± 0.621
4.143SerLeu: 4.143 ± 0.574
1.381SerMet: 1.381 ± 0.353
3.818SerAsn: 3.818 ± 0.735
1.543SerPro: 1.543 ± 0.35
2.762SerGln: 2.762 ± 0.515
2.356SerArg: 2.356 ± 0.456
3.818SerSer: 3.818 ± 0.575
2.437SerThr: 2.437 ± 0.433
3.98SerVal: 3.98 ± 0.732
0.894SerTrp: 0.894 ± 0.29
2.274SerTyr: 2.274 ± 0.517
0.0SerXaa: 0.0 ± 0.0
Thr
3.899ThrAla: 3.899 ± 0.629
0.081ThrCys: 0.081 ± 0.082
3.574ThrAsp: 3.574 ± 0.563
3.493ThrGlu: 3.493 ± 0.533
2.762ThrPhe: 2.762 ± 0.503
3.818ThrGly: 3.818 ± 0.719
0.812ThrHis: 0.812 ± 0.228
4.792ThrIle: 4.792 ± 0.798
4.874ThrLys: 4.874 ± 0.609
6.173ThrLeu: 6.173 ± 0.849
1.137ThrMet: 1.137 ± 0.362
3.412ThrAsn: 3.412 ± 0.593
2.031ThrPro: 2.031 ± 0.445
1.949ThrGln: 1.949 ± 0.441
2.193ThrArg: 2.193 ± 0.457
3.493ThrSer: 3.493 ± 0.672
4.143ThrThr: 4.143 ± 0.875
3.412ThrVal: 3.412 ± 0.606
0.569ThrTrp: 0.569 ± 0.261
2.843ThrTyr: 2.843 ± 0.458
0.0ThrXaa: 0.0 ± 0.0
Val
4.549ValAla: 4.549 ± 0.685
0.406ValCys: 0.406 ± 0.188
4.061ValAsp: 4.061 ± 0.741
4.143ValGlu: 4.143 ± 0.757
2.681ValPhe: 2.681 ± 0.434
3.574ValGly: 3.574 ± 0.516
0.325ValHis: 0.325 ± 0.18
5.767ValIle: 5.767 ± 0.796
5.361ValLys: 5.361 ± 0.696
4.549ValLeu: 4.549 ± 0.557
1.381ValMet: 1.381 ± 0.394
3.574ValAsn: 3.574 ± 0.497
2.356ValPro: 2.356 ± 0.434
1.787ValGln: 1.787 ± 0.348
2.762ValArg: 2.762 ± 0.43
4.874ValSer: 4.874 ± 0.778
4.549ValThr: 4.549 ± 0.716
4.224ValVal: 4.224 ± 0.574
0.487ValTrp: 0.487 ± 0.203
1.543ValTyr: 1.543 ± 0.355
0.0ValXaa: 0.0 ± 0.0
Trp
0.731TrpAla: 0.731 ± 0.279
0.081TrpCys: 0.081 ± 0.083
0.569TrpAsp: 0.569 ± 0.207
0.731TrpGlu: 0.731 ± 0.258
0.406TrpPhe: 0.406 ± 0.169
0.569TrpGly: 0.569 ± 0.3
0.569TrpHis: 0.569 ± 0.187
0.894TrpIle: 0.894 ± 0.276
0.569TrpLys: 0.569 ± 0.176
0.975TrpLeu: 0.975 ± 0.389
0.0TrpMet: 0.0 ± 0.0
0.65TrpAsn: 0.65 ± 0.3
0.162TrpPro: 0.162 ± 0.132
0.812TrpGln: 0.812 ± 0.334
0.244TrpArg: 0.244 ± 0.131
1.462TrpSer: 1.462 ± 0.399
0.487TrpThr: 0.487 ± 0.174
1.543TrpVal: 1.543 ± 0.341
0.162TrpTrp: 0.162 ± 0.095
0.569TrpTyr: 0.569 ± 0.268
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.599TyrAla: 2.599 ± 0.506
0.487TyrCys: 0.487 ± 0.225
2.681TyrAsp: 2.681 ± 0.485
3.005TyrGlu: 3.005 ± 0.564
1.787TyrPhe: 1.787 ± 0.368
3.087TyrGly: 3.087 ± 0.502
0.569TyrHis: 0.569 ± 0.242
3.98TyrIle: 3.98 ± 0.624
3.899TyrLys: 3.899 ± 0.646
3.005TyrLeu: 3.005 ± 0.594
0.731TyrMet: 0.731 ± 0.214
3.087TyrAsn: 3.087 ± 0.544
1.381TyrPro: 1.381 ± 0.381
1.787TyrGln: 1.787 ± 0.339
1.543TyrArg: 1.543 ± 0.429
3.005TyrSer: 3.005 ± 0.638
2.762TyrThr: 2.762 ± 0.38
2.924TyrVal: 2.924 ± 0.413
0.812TyrTrp: 0.812 ± 0.291
2.274TyrTyr: 2.274 ± 0.449
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 60 proteins (12312 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski