Amino acid dipepetide frequency for Staphylococcus phage phiSa2wa_st93

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.733AlaAla: 2.733 ± 0.79
0.205AlaCys: 0.205 ± 0.117
2.664AlaAsp: 2.664 ± 0.479
4.031AlaGlu: 4.031 ± 0.535
1.571AlaPhe: 1.571 ± 0.277
3.416AlaGly: 3.416 ± 0.617
1.23AlaHis: 1.23 ± 0.277
4.577AlaIle: 4.577 ± 0.763
5.944AlaLys: 5.944 ± 1.246
4.987AlaLeu: 4.987 ± 0.692
1.64AlaMet: 1.64 ± 0.298
4.031AlaAsn: 4.031 ± 0.818
1.571AlaPro: 1.571 ± 0.334
1.64AlaGln: 1.64 ± 0.354
2.46AlaArg: 2.46 ± 0.321
4.509AlaSer: 4.509 ± 0.716
3.279AlaThr: 3.279 ± 0.462
2.733AlaVal: 2.733 ± 0.448
1.093AlaTrp: 1.093 ± 0.367
2.664AlaTyr: 2.664 ± 0.461
0.0AlaXaa: 0.0 ± 0.0
Cys
0.205CysAla: 0.205 ± 0.119
0.068CysCys: 0.068 ± 0.088
0.068CysAsp: 0.068 ± 0.069
0.41CysGlu: 0.41 ± 0.185
0.342CysPhe: 0.342 ± 0.139
0.273CysGly: 0.273 ± 0.146
0.137CysHis: 0.137 ± 0.109
0.82CysIle: 0.82 ± 0.254
0.615CysLys: 0.615 ± 0.219
0.615CysLeu: 0.615 ± 0.216
0.205CysMet: 0.205 ± 0.156
0.137CysAsn: 0.137 ± 0.108
0.137CysPro: 0.137 ± 0.101
0.205CysGln: 0.205 ± 0.121
0.342CysArg: 0.342 ± 0.19
0.205CysSer: 0.205 ± 0.134
0.205CysThr: 0.205 ± 0.11
0.068CysVal: 0.068 ± 0.069
0.0CysTrp: 0.0 ± 0.0
0.41CysTyr: 0.41 ± 0.155
0.0CysXaa: 0.0 ± 0.0
Asp
2.869AspAla: 2.869 ± 0.574
0.273AspCys: 0.273 ± 0.167
3.689AspAsp: 3.689 ± 0.618
4.509AspGlu: 4.509 ± 0.591
3.484AspPhe: 3.484 ± 0.442
3.279AspGly: 3.279 ± 0.59
0.683AspHis: 0.683 ± 0.242
5.124AspIle: 5.124 ± 0.481
6.9AspLys: 6.9 ± 0.695
5.329AspLeu: 5.329 ± 0.519
2.323AspMet: 2.323 ± 0.351
2.801AspAsn: 2.801 ± 0.386
1.093AspPro: 1.093 ± 0.314
1.776AspGln: 1.776 ± 0.366
2.118AspArg: 2.118 ± 0.364
3.689AspSer: 3.689 ± 0.54
3.689AspThr: 3.689 ± 0.442
3.689AspVal: 3.689 ± 0.406
0.82AspTrp: 0.82 ± 0.196
3.143AspTyr: 3.143 ± 0.523
0.0AspXaa: 0.0 ± 0.0
Glu
4.714GluAla: 4.714 ± 0.476
0.478GluCys: 0.478 ± 0.184
4.714GluAsp: 4.714 ± 0.772
7.925GluGlu: 7.925 ± 1.245
3.006GluPhe: 3.006 ± 0.567
3.279GluGly: 3.279 ± 0.53
0.888GluHis: 0.888 ± 0.263
5.261GluIle: 5.261 ± 0.816
8.13GluLys: 8.13 ± 0.819
7.037GluLeu: 7.037 ± 0.693
2.391GluMet: 2.391 ± 0.371
5.124GluAsn: 5.124 ± 0.613
1.093GluPro: 1.093 ± 0.268
3.143GluGln: 3.143 ± 0.489
3.006GluArg: 3.006 ± 0.553
3.621GluSer: 3.621 ± 0.409
3.826GluThr: 3.826 ± 0.584
4.304GluVal: 4.304 ± 0.507
1.025GluTrp: 1.025 ± 0.201
2.938GluTyr: 2.938 ± 0.504
0.0GluXaa: 0.0 ± 0.0
Phe
1.845PheAla: 1.845 ± 0.341
0.547PheCys: 0.547 ± 0.169
3.143PheAsp: 3.143 ± 0.457
2.801PheGlu: 2.801 ± 0.427
1.025PhePhe: 1.025 ± 0.22
2.938PheGly: 2.938 ± 0.57
0.615PheHis: 0.615 ± 0.199
3.621PheIle: 3.621 ± 0.627
4.372PheLys: 4.372 ± 0.654
2.255PheLeu: 2.255 ± 0.35
1.025PheMet: 1.025 ± 0.233
3.963PheAsn: 3.963 ± 0.551
0.956PhePro: 0.956 ± 0.292
1.025PheGln: 1.025 ± 0.218
1.161PheArg: 1.161 ± 0.257
2.391PheSer: 2.391 ± 0.497
1.845PheThr: 1.845 ± 0.339
2.05PheVal: 2.05 ± 0.422
0.342PheTrp: 0.342 ± 0.132
1.913PheTyr: 1.913 ± 0.368
0.0PheXaa: 0.0 ± 0.0
Gly
3.894GlyAla: 3.894 ± 1.003
0.273GlyCys: 0.273 ± 0.121
3.758GlyAsp: 3.758 ± 0.49
3.484GlyGlu: 3.484 ± 0.391
2.391GlyPhe: 2.391 ± 0.364
5.466GlyGly: 5.466 ± 1.133
1.503GlyHis: 1.503 ± 0.344
3.484GlyIle: 3.484 ± 0.488
5.739GlyLys: 5.739 ± 0.574
5.671GlyLeu: 5.671 ± 0.813
1.23GlyMet: 1.23 ± 0.338
3.348GlyAsn: 3.348 ± 0.539
0.956GlyPro: 0.956 ± 0.225
1.64GlyGln: 1.64 ± 0.413
2.05GlyArg: 2.05 ± 0.413
3.689GlySer: 3.689 ± 0.608
3.553GlyThr: 3.553 ± 0.607
4.509GlyVal: 4.509 ± 0.595
1.025GlyTrp: 1.025 ± 0.254
2.46GlyTyr: 2.46 ± 0.412
0.0GlyXaa: 0.0 ± 0.0
His
0.956HisAla: 0.956 ± 0.244
0.068HisCys: 0.068 ± 0.074
0.683HisAsp: 0.683 ± 0.275
1.298HisGlu: 1.298 ± 0.294
0.683HisPhe: 0.683 ± 0.186
1.093HisGly: 1.093 ± 0.23
0.41HisHis: 0.41 ± 0.187
1.298HisIle: 1.298 ± 0.378
1.64HisLys: 1.64 ± 0.311
1.708HisLeu: 1.708 ± 0.367
0.205HisMet: 0.205 ± 0.115
0.956HisAsn: 0.956 ± 0.223
0.752HisPro: 0.752 ± 0.155
0.615HisGln: 0.615 ± 0.182
0.752HisArg: 0.752 ± 0.2
1.161HisSer: 1.161 ± 0.236
1.161HisThr: 1.161 ± 0.238
0.752HisVal: 0.752 ± 0.281
0.41HisTrp: 0.41 ± 0.16
1.025HisTyr: 1.025 ± 0.228
0.0HisXaa: 0.0 ± 0.0
Ile
3.963IleAla: 3.963 ± 0.606
0.478IleCys: 0.478 ± 0.227
5.192IleAsp: 5.192 ± 0.652
5.602IleGlu: 5.602 ± 0.589
2.733IlePhe: 2.733 ± 0.596
3.553IleGly: 3.553 ± 0.536
1.708IleHis: 1.708 ± 0.325
4.304IleIle: 4.304 ± 0.669
7.993IleLys: 7.993 ± 0.761
4.919IleLeu: 4.919 ± 0.636
1.571IleMet: 1.571 ± 0.344
5.056IleAsn: 5.056 ± 0.485
2.186IlePro: 2.186 ± 0.257
1.845IleGln: 1.845 ± 0.374
3.348IleArg: 3.348 ± 0.467
4.577IleSer: 4.577 ± 0.425
4.782IleThr: 4.782 ± 0.558
3.826IleVal: 3.826 ± 0.673
0.41IleTrp: 0.41 ± 0.201
2.596IleTyr: 2.596 ± 0.469
0.0IleXaa: 0.0 ± 0.0
Lys
7.515LysAla: 7.515 ± 1.149
0.205LysCys: 0.205 ± 0.133
5.534LysAsp: 5.534 ± 0.611
8.745LysGlu: 8.745 ± 0.916
2.255LysPhe: 2.255 ± 0.351
5.329LysGly: 5.329 ± 0.747
1.708LysHis: 1.708 ± 0.396
6.012LysIle: 6.012 ± 0.684
7.72LysLys: 7.72 ± 0.758
8.54LysLeu: 8.54 ± 0.919
3.143LysMet: 3.143 ± 0.483
5.876LysAsn: 5.876 ± 0.684
2.391LysPro: 2.391 ± 0.449
4.714LysGln: 4.714 ± 0.576
4.031LysArg: 4.031 ± 0.745
6.012LysSer: 6.012 ± 1.174
5.261LysThr: 5.261 ± 0.62
5.466LysVal: 5.466 ± 0.724
1.981LysTrp: 1.981 ± 0.436
5.261LysTyr: 5.261 ± 0.71
0.0LysXaa: 0.0 ± 0.0
Leu
4.646LeuAla: 4.646 ± 0.781
0.547LeuCys: 0.547 ± 0.184
5.261LeuAsp: 5.261 ± 0.85
6.285LeuGlu: 6.285 ± 0.699
3.484LeuPhe: 3.484 ± 0.452
4.372LeuGly: 4.372 ± 0.79
1.161LeuHis: 1.161 ± 0.319
5.124LeuIle: 5.124 ± 0.607
8.335LeuLys: 8.335 ± 1.185
7.105LeuLeu: 7.105 ± 0.814
1.913LeuMet: 1.913 ± 0.312
5.534LeuAsn: 5.534 ± 0.701
3.074LeuPro: 3.074 ± 0.484
3.006LeuGln: 3.006 ± 0.513
3.553LeuArg: 3.553 ± 0.455
5.534LeuSer: 5.534 ± 0.67
5.056LeuThr: 5.056 ± 0.598
3.758LeuVal: 3.758 ± 0.396
0.478LeuTrp: 0.478 ± 0.189
3.621LeuTyr: 3.621 ± 0.829
0.0LeuXaa: 0.0 ± 0.0
Met
1.161MetAla: 1.161 ± 0.223
0.342MetCys: 0.342 ± 0.157
1.366MetAsp: 1.366 ± 0.35
1.366MetGlu: 1.366 ± 0.301
1.161MetPhe: 1.161 ± 0.267
1.435MetGly: 1.435 ± 0.469
0.342MetHis: 0.342 ± 0.16
1.708MetIle: 1.708 ± 0.355
2.733MetLys: 2.733 ± 0.458
1.981MetLeu: 1.981 ± 0.452
0.547MetMet: 0.547 ± 0.187
2.186MetAsn: 2.186 ± 0.492
0.956MetPro: 0.956 ± 0.263
1.776MetGln: 1.776 ± 0.439
1.161MetArg: 1.161 ± 0.323
2.255MetSer: 2.255 ± 0.377
2.255MetThr: 2.255 ± 0.325
1.025MetVal: 1.025 ± 0.184
0.342MetTrp: 0.342 ± 0.135
0.888MetTyr: 0.888 ± 0.223
0.0MetXaa: 0.0 ± 0.0
Asn
3.348AsnAla: 3.348 ± 0.534
0.205AsnCys: 0.205 ± 0.12
3.826AsnAsp: 3.826 ± 0.423
4.782AsnGlu: 4.782 ± 0.787
2.118AsnPhe: 2.118 ± 0.386
4.441AsnGly: 4.441 ± 0.457
1.093AsnHis: 1.093 ± 0.302
4.714AsnIle: 4.714 ± 0.55
6.49AsnLys: 6.49 ± 0.679
4.851AsnLeu: 4.851 ± 0.498
1.435AsnMet: 1.435 ± 0.307
4.236AsnAsn: 4.236 ± 0.67
2.391AsnPro: 2.391 ± 0.328
2.938AsnGln: 2.938 ± 0.46
2.664AsnArg: 2.664 ± 0.433
4.509AsnSer: 4.509 ± 0.644
3.894AsnThr: 3.894 ± 0.415
3.279AsnVal: 3.279 ± 0.526
1.161AsnTrp: 1.161 ± 0.319
3.006AsnTyr: 3.006 ± 0.558
0.0AsnXaa: 0.0 ± 0.0
Pro
1.161ProAla: 1.161 ± 0.259
0.273ProCys: 0.273 ± 0.136
1.298ProAsp: 1.298 ± 0.287
2.05ProGlu: 2.05 ± 0.378
1.435ProPhe: 1.435 ± 0.318
1.776ProGly: 1.776 ± 0.347
0.273ProHis: 0.273 ± 0.132
1.503ProIle: 1.503 ± 0.3
2.323ProLys: 2.323 ± 0.445
2.46ProLeu: 2.46 ± 0.396
0.683ProMet: 0.683 ± 0.238
2.186ProAsn: 2.186 ± 0.374
0.82ProPro: 0.82 ± 0.257
1.093ProGln: 1.093 ± 0.274
1.23ProArg: 1.23 ± 0.23
2.255ProSer: 2.255 ± 0.295
1.571ProThr: 1.571 ± 0.362
1.435ProVal: 1.435 ± 0.405
0.342ProTrp: 0.342 ± 0.144
1.093ProTyr: 1.093 ± 0.304
0.0ProXaa: 0.0 ± 0.0
Gln
2.596GlnAla: 2.596 ± 0.359
0.137GlnCys: 0.137 ± 0.093
2.255GlnAsp: 2.255 ± 0.375
2.664GlnGlu: 2.664 ± 0.423
1.435GlnPhe: 1.435 ± 0.246
2.186GlnGly: 2.186 ± 0.425
0.888GlnHis: 0.888 ± 0.263
3.074GlnIle: 3.074 ± 0.553
2.869GlnLys: 2.869 ± 0.349
3.689GlnLeu: 3.689 ± 0.467
1.161GlnMet: 1.161 ± 0.282
2.255GlnAsn: 2.255 ± 0.519
1.161GlnPro: 1.161 ± 0.198
1.366GlnGln: 1.366 ± 0.358
1.913GlnArg: 1.913 ± 0.393
2.323GlnSer: 2.323 ± 0.294
1.366GlnThr: 1.366 ± 0.37
2.255GlnVal: 2.255 ± 0.354
0.41GlnTrp: 0.41 ± 0.207
1.503GlnTyr: 1.503 ± 0.342
0.0GlnXaa: 0.0 ± 0.0
Arg
2.391ArgAla: 2.391 ± 0.474
0.205ArgCys: 0.205 ± 0.155
3.348ArgAsp: 3.348 ± 0.306
2.118ArgGlu: 2.118 ± 0.354
2.118ArgPhe: 2.118 ± 0.365
2.05ArgGly: 2.05 ± 0.342
0.888ArgHis: 0.888 ± 0.239
3.689ArgIle: 3.689 ± 0.525
4.236ArgLys: 4.236 ± 0.476
3.484ArgLeu: 3.484 ± 0.624
0.82ArgMet: 0.82 ± 0.248
2.733ArgAsn: 2.733 ± 0.443
0.615ArgPro: 0.615 ± 0.229
1.503ArgGln: 1.503 ± 0.305
2.323ArgArg: 2.323 ± 0.361
1.64ArgSer: 1.64 ± 0.3
2.391ArgThr: 2.391 ± 0.38
2.05ArgVal: 2.05 ± 0.339
0.342ArgTrp: 0.342 ± 0.158
2.255ArgTyr: 2.255 ± 0.4
0.0ArgXaa: 0.0 ± 0.0
Ser
3.758SerAla: 3.758 ± 0.739
0.273SerCys: 0.273 ± 0.14
4.577SerAsp: 4.577 ± 0.539
4.919SerGlu: 4.919 ± 0.576
2.733SerPhe: 2.733 ± 0.448
4.372SerGly: 4.372 ± 0.941
0.615SerHis: 0.615 ± 0.156
4.099SerIle: 4.099 ± 0.447
6.695SerLys: 6.695 ± 1.022
4.168SerLeu: 4.168 ± 0.439
1.776SerMet: 1.776 ± 0.327
4.714SerAsn: 4.714 ± 0.611
1.776SerPro: 1.776 ± 0.383
2.869SerGln: 2.869 ± 0.412
2.323SerArg: 2.323 ± 0.408
3.894SerSer: 3.894 ± 0.707
3.348SerThr: 3.348 ± 0.476
3.826SerVal: 3.826 ± 0.517
0.82SerTrp: 0.82 ± 0.213
2.46SerTyr: 2.46 ± 0.41
0.0SerXaa: 0.0 ± 0.0
Thr
3.348ThrAla: 3.348 ± 0.57
0.137ThrCys: 0.137 ± 0.086
3.279ThrAsp: 3.279 ± 0.481
4.031ThrGlu: 4.031 ± 0.553
2.801ThrPhe: 2.801 ± 0.361
3.963ThrGly: 3.963 ± 0.531
1.571ThrHis: 1.571 ± 0.406
4.646ThrIle: 4.646 ± 0.61
5.192ThrLys: 5.192 ± 0.663
4.099ThrLeu: 4.099 ± 0.508
1.366ThrMet: 1.366 ± 0.332
3.143ThrAsn: 3.143 ± 0.473
2.255ThrPro: 2.255 ± 0.331
1.913ThrGln: 1.913 ± 0.268
1.776ThrArg: 1.776 ± 0.239
3.348ThrSer: 3.348 ± 0.467
3.211ThrThr: 3.211 ± 0.573
4.509ThrVal: 4.509 ± 0.425
0.41ThrTrp: 0.41 ± 0.197
2.255ThrTyr: 2.255 ± 0.406
0.0ThrXaa: 0.0 ± 0.0
Val
2.938ValAla: 2.938 ± 0.414
0.342ValCys: 0.342 ± 0.135
3.963ValAsp: 3.963 ± 0.602
4.987ValGlu: 4.987 ± 0.521
2.255ValPhe: 2.255 ± 0.468
3.416ValGly: 3.416 ± 0.445
0.888ValHis: 0.888 ± 0.213
3.894ValIle: 3.894 ± 0.453
4.851ValLys: 4.851 ± 0.617
4.304ValLeu: 4.304 ± 0.554
1.571ValMet: 1.571 ± 0.257
3.758ValAsn: 3.758 ± 0.495
1.571ValPro: 1.571 ± 0.337
1.981ValGln: 1.981 ± 0.348
2.596ValArg: 2.596 ± 0.436
4.168ValSer: 4.168 ± 0.46
3.143ValThr: 3.143 ± 0.521
3.143ValVal: 3.143 ± 0.438
0.478ValTrp: 0.478 ± 0.201
2.118ValTyr: 2.118 ± 0.446
0.0ValXaa: 0.0 ± 0.0
Trp
0.547TrpAla: 0.547 ± 0.187
0.0TrpCys: 0.0 ± 0.0
0.41TrpAsp: 0.41 ± 0.202
0.888TrpGlu: 0.888 ± 0.26
1.23TrpPhe: 1.23 ± 0.332
0.547TrpGly: 0.547 ± 0.236
0.0TrpHis: 0.0 ± 0.0
1.093TrpIle: 1.093 ± 0.286
0.683TrpLys: 0.683 ± 0.259
1.161TrpLeu: 1.161 ± 0.261
0.41TrpMet: 0.41 ± 0.199
0.888TrpAsn: 0.888 ± 0.221
0.342TrpPro: 0.342 ± 0.212
0.547TrpGln: 0.547 ± 0.157
0.478TrpArg: 0.478 ± 0.185
1.161TrpSer: 1.161 ± 0.394
0.615TrpThr: 0.615 ± 0.178
0.888TrpVal: 0.888 ± 0.2
0.137TrpTrp: 0.137 ± 0.116
0.615TrpTyr: 0.615 ± 0.179
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.391TyrAla: 2.391 ± 0.29
0.41TyrCys: 0.41 ± 0.144
2.255TyrAsp: 2.255 ± 0.518
3.006TyrGlu: 3.006 ± 0.592
1.571TyrPhe: 1.571 ± 0.335
2.938TyrGly: 2.938 ± 0.582
1.025TyrHis: 1.025 ± 0.333
2.596TyrIle: 2.596 ± 0.488
4.236TyrLys: 4.236 ± 0.482
3.484TyrLeu: 3.484 ± 0.488
1.503TyrMet: 1.503 ± 0.299
2.46TyrAsn: 2.46 ± 0.549
1.161TyrPro: 1.161 ± 0.227
1.913TyrGln: 1.913 ± 0.35
1.913TyrArg: 1.913 ± 0.445
3.143TyrSer: 3.143 ± 0.367
2.801TyrThr: 2.801 ± 0.533
2.801TyrVal: 2.801 ± 0.46
0.615TyrTrp: 0.615 ± 0.191
1.776TyrTyr: 1.776 ± 0.462
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 67 proteins (14638 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski