Amino acid dipepetide frequency for Staphylococcus phage vB_SsapH-Golestan101-M

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.487AlaAla: 0.487 ± 0.166
0.195AlaCys: 0.195 ± 0.066
1.897AlaAsp: 1.897 ± 0.225
2.798AlaGlu: 2.798 ± 0.326
1.727AlaPhe: 1.727 ± 0.212
2.141AlaGly: 2.141 ± 0.353
0.778AlaHis: 0.778 ± 0.152
3.041AlaIle: 3.041 ± 0.324
3.795AlaLys: 3.795 ± 0.276
3.357AlaLeu: 3.357 ± 0.403
0.924AlaMet: 0.924 ± 0.179
1.776AlaAsn: 1.776 ± 0.22
1.338AlaPro: 1.338 ± 0.211
1.533AlaGln: 1.533 ± 0.227
1.411AlaArg: 1.411 ± 0.195
2.749AlaSer: 2.749 ± 0.265
2.7AlaThr: 2.7 ± 0.313
2.238AlaVal: 2.238 ± 0.254
0.243AlaTrp: 0.243 ± 0.08
2.165AlaTyr: 2.165 ± 0.195
0.0AlaXaa: 0.0 ± 0.0
Cys
0.17CysAla: 0.17 ± 0.057
0.049CysCys: 0.049 ± 0.034
0.219CysAsp: 0.219 ± 0.076
0.292CysGlu: 0.292 ± 0.084
0.389CysPhe: 0.389 ± 0.101
0.632CysGly: 0.632 ± 0.162
0.097CysHis: 0.097 ± 0.042
0.268CysIle: 0.268 ± 0.077
0.73CysLys: 0.73 ± 0.198
0.657CysLeu: 0.657 ± 0.14
0.122CysMet: 0.122 ± 0.058
0.219CysAsn: 0.219 ± 0.091
0.462CysPro: 0.462 ± 0.121
0.268CysGln: 0.268 ± 0.091
0.389CysArg: 0.389 ± 0.111
0.365CysSer: 0.365 ± 0.102
0.316CysThr: 0.316 ± 0.09
0.316CysVal: 0.316 ± 0.094
0.073CysTrp: 0.073 ± 0.045
0.389CysTyr: 0.389 ± 0.094
0.0CysXaa: 0.0 ± 0.0
Asp
2.311AspAla: 2.311 ± 0.269
0.341AspCys: 0.341 ± 0.092
3.552AspAsp: 3.552 ± 0.35
4.914AspGlu: 4.914 ± 0.352
3.43AspPhe: 3.43 ± 0.3
3.065AspGly: 3.065 ± 0.25
0.608AspHis: 0.608 ± 0.121
6.836AspIle: 6.836 ± 0.481
6.325AspLys: 6.325 ± 0.46
5.474AspLeu: 5.474 ± 0.421
1.825AspMet: 1.825 ± 0.192
5.011AspAsn: 5.011 ± 0.347
1.654AspPro: 1.654 ± 0.263
1.241AspGln: 1.241 ± 0.183
2.506AspArg: 2.506 ± 0.278
4.233AspSer: 4.233 ± 0.348
4.379AspThr: 4.379 ± 0.334
3.965AspVal: 3.965 ± 0.34
0.487AspTrp: 0.487 ± 0.105
4.282AspTyr: 4.282 ± 0.298
0.0AspXaa: 0.0 ± 0.0
Glu
3.284GluAla: 3.284 ± 0.282
0.56GluCys: 0.56 ± 0.119
6.86GluAsp: 6.86 ± 0.559
10.071GluGlu: 10.071 ± 0.751
3.479GluPhe: 3.479 ± 0.336
5.011GluGly: 5.011 ± 0.376
1.606GluHis: 1.606 ± 0.204
5.936GluIle: 5.936 ± 0.352
6.884GluLys: 6.884 ± 0.474
7.347GluLeu: 7.347 ± 0.409
2.189GluMet: 2.189 ± 0.223
5.328GluAsn: 5.328 ± 0.396
2.214GluPro: 2.214 ± 0.303
4.403GluGln: 4.403 ± 0.453
3.284GluArg: 3.284 ± 0.299
5.182GluSer: 5.182 ± 0.318
4.184GluThr: 4.184 ± 0.296
5.328GluVal: 5.328 ± 0.422
0.705GluTrp: 0.705 ± 0.111
4.89GluTyr: 4.89 ± 0.414
0.0GluXaa: 0.0 ± 0.0
Phe
1.435PheAla: 1.435 ± 0.191
0.195PheCys: 0.195 ± 0.059
2.7PheAsp: 2.7 ± 0.218
3.065PheGlu: 3.065 ± 0.256
1.289PhePhe: 1.289 ± 0.185
1.897PheGly: 1.897 ± 0.28
0.511PheHis: 0.511 ± 0.111
3.527PheIle: 3.527 ± 0.303
4.233PheLys: 4.233 ± 0.331
3.211PheLeu: 3.211 ± 0.271
1.095PheMet: 1.095 ± 0.167
3.673PheAsn: 3.673 ± 0.263
0.827PhePro: 0.827 ± 0.12
1.192PheGln: 1.192 ± 0.158
1.435PheArg: 1.435 ± 0.187
2.141PheSer: 2.141 ± 0.265
2.944PheThr: 2.944 ± 0.329
2.579PheVal: 2.579 ± 0.271
0.341PheTrp: 0.341 ± 0.078
2.043PheTyr: 2.043 ± 0.236
0.0PheXaa: 0.0 ± 0.0
Gly
2.335GlyAla: 2.335 ± 0.529
0.292GlyCys: 0.292 ± 0.104
3.6GlyAsp: 3.6 ± 0.396
4.33GlyGlu: 4.33 ± 0.375
2.7GlyPhe: 2.7 ± 0.258
3.892GlyGly: 3.892 ± 0.886
1.07GlyHis: 1.07 ± 0.163
4.282GlyIle: 4.282 ± 0.414
5.084GlyLys: 5.084 ± 0.684
4.354GlyLeu: 4.354 ± 0.343
1.533GlyMet: 1.533 ± 0.279
4.476GlyAsn: 4.476 ± 0.343
0.0GlyPro: 0.0 ± 0.0
1.97GlyGln: 1.97 ± 0.274
2.043GlyArg: 2.043 ± 0.253
3.552GlySer: 3.552 ± 0.363
4.257GlyThr: 4.257 ± 0.384
3.6GlyVal: 3.6 ± 0.338
0.584GlyTrp: 0.584 ± 0.106
3.333GlyTyr: 3.333 ± 0.306
0.0GlyXaa: 0.0 ± 0.0
His
0.584HisAla: 0.584 ± 0.116
0.219HisCys: 0.219 ± 0.073
0.876HisAsp: 0.876 ± 0.141
1.07HisGlu: 1.07 ± 0.209
0.608HisPhe: 0.608 ± 0.102
0.778HisGly: 0.778 ± 0.132
0.219HisHis: 0.219 ± 0.062
1.338HisIle: 1.338 ± 0.19
1.557HisLys: 1.557 ± 0.216
1.508HisLeu: 1.508 ± 0.203
0.341HisMet: 0.341 ± 0.081
1.07HisAsn: 1.07 ± 0.18
0.389HisPro: 0.389 ± 0.093
0.608HisGln: 0.608 ± 0.125
0.608HisArg: 0.608 ± 0.13
0.803HisSer: 0.803 ± 0.118
0.851HisThr: 0.851 ± 0.139
0.876HisVal: 0.876 ± 0.17
0.219HisTrp: 0.219 ± 0.082
0.876HisTyr: 0.876 ± 0.123
0.0HisXaa: 0.0 ± 0.0
Ile
2.968IleAla: 2.968 ± 0.28
0.414IleCys: 0.414 ± 0.102
6.009IleAsp: 6.009 ± 0.366
7.03IleGlu: 7.03 ± 0.519
2.36IlePhe: 2.36 ± 0.201
4.16IleGly: 4.16 ± 0.376
1.095IleHis: 1.095 ± 0.188
4.914IleIle: 4.914 ± 0.392
7.298IleLys: 7.298 ± 0.488
5.328IleLeu: 5.328 ± 0.477
1.8IleMet: 1.8 ± 0.243
5.522IleAsn: 5.522 ± 0.339
2.433IlePro: 2.433 ± 0.255
2.773IleGln: 2.773 ± 0.228
3.089IleArg: 3.089 ± 0.329
3.892IleSer: 3.892 ± 0.354
4.695IleThr: 4.695 ± 0.38
4.5IleVal: 4.5 ± 0.364
0.632IleTrp: 0.632 ± 0.145
3.357IleTyr: 3.357 ± 0.306
0.0IleXaa: 0.0 ± 0.0
Lys
3.6LysAla: 3.6 ± 0.327
0.584LysCys: 0.584 ± 0.162
7.298LysAsp: 7.298 ± 0.417
11.069LysGlu: 11.069 ± 0.757
2.846LysPhe: 2.846 ± 0.262
6.179LysGly: 6.179 ± 0.508
1.703LysHis: 1.703 ± 0.257
5.644LysIle: 5.644 ± 0.416
7.979LysLys: 7.979 ± 0.624
6.86LysLeu: 6.86 ± 0.433
2.311LysMet: 2.311 ± 0.271
5.084LysAsn: 5.084 ± 0.345
2.652LysPro: 2.652 ± 0.244
4.622LysGln: 4.622 ± 0.366
3.625LysArg: 3.625 ± 0.317
5.011LysSer: 5.011 ± 0.436
4.938LysThr: 4.938 ± 0.34
6.106LysVal: 6.106 ± 0.388
0.705LysTrp: 0.705 ± 0.152
4.963LysTyr: 4.963 ± 0.348
0.0LysXaa: 0.0 ± 0.0
Leu
2.798LeuAla: 2.798 ± 0.261
0.511LeuCys: 0.511 ± 0.116
6.203LeuAsp: 6.203 ± 0.478
7.809LeuGlu: 7.809 ± 0.527
2.798LeuPhe: 2.798 ± 0.298
4.817LeuGly: 4.817 ± 0.454
0.924LeuHis: 0.924 ± 0.138
5.668LeuIle: 5.668 ± 0.469
7.152LeuLys: 7.152 ± 0.446
5.717LeuLeu: 5.717 ± 0.48
1.97LeuMet: 1.97 ± 0.192
6.179LeuAsn: 6.179 ± 0.37
2.189LeuPro: 2.189 ± 0.215
2.749LeuGln: 2.749 ± 0.295
3.552LeuArg: 3.552 ± 0.313
5.425LeuSer: 5.425 ± 0.327
5.036LeuThr: 5.036 ± 0.39
3.868LeuVal: 3.868 ± 0.307
0.632LeuTrp: 0.632 ± 0.118
3.698LeuTyr: 3.698 ± 0.31
0.0LeuXaa: 0.0 ± 0.0
Met
1.265MetAla: 1.265 ± 0.172
0.195MetCys: 0.195 ± 0.067
1.387MetAsp: 1.387 ± 0.196
1.703MetGlu: 1.703 ± 0.183
1.143MetPhe: 1.143 ± 0.167
1.143MetGly: 1.143 ± 0.309
0.195MetHis: 0.195 ± 0.078
1.825MetIle: 1.825 ± 0.225
2.53MetLys: 2.53 ± 0.288
1.897MetLeu: 1.897 ± 0.272
0.73MetMet: 0.73 ± 0.279
1.654MetAsn: 1.654 ± 0.189
0.487MetPro: 0.487 ± 0.109
0.632MetGln: 0.632 ± 0.115
1.289MetArg: 1.289 ± 0.191
1.776MetSer: 1.776 ± 0.258
1.46MetThr: 1.46 ± 0.171
1.387MetVal: 1.387 ± 0.215
0.122MetTrp: 0.122 ± 0.053
0.9MetTyr: 0.9 ± 0.148
0.0MetXaa: 0.0 ± 0.0
Asn
2.408AsnAla: 2.408 ± 0.276
0.487AsnCys: 0.487 ± 0.12
3.917AsnAsp: 3.917 ± 0.251
5.157AsnGlu: 5.157 ± 0.386
2.725AsnPhe: 2.725 ± 0.296
3.941AsnGly: 3.941 ± 0.287
1.095AsnHis: 1.095 ± 0.224
5.084AsnIle: 5.084 ± 0.35
7.712AsnLys: 7.712 ± 0.462
5.644AsnLeu: 5.644 ± 0.401
1.63AsnMet: 1.63 ± 0.211
5.692AsnAsn: 5.692 ± 0.52
2.165AsnPro: 2.165 ± 0.3
2.092AsnGln: 2.092 ± 0.314
2.846AsnArg: 2.846 ± 0.237
4.646AsnSer: 4.646 ± 0.344
4.646AsnThr: 4.646 ± 0.307
3.819AsnVal: 3.819 ± 0.284
0.632AsnTrp: 0.632 ± 0.125
3.26AsnTyr: 3.26 ± 0.303
0.0AsnXaa: 0.0 ± 0.0
Pro
0.803ProAla: 0.803 ± 0.151
0.195ProCys: 0.195 ± 0.065
1.338ProAsp: 1.338 ± 0.216
2.53ProGlu: 2.53 ± 0.253
1.338ProPhe: 1.338 ± 0.186
0.754ProGly: 0.754 ± 0.113
0.389ProHis: 0.389 ± 0.1
1.873ProIle: 1.873 ± 0.254
2.676ProLys: 2.676 ± 0.239
2.092ProLeu: 2.092 ± 0.222
0.535ProMet: 0.535 ± 0.107
2.019ProAsn: 2.019 ± 0.235
0.365ProPro: 0.365 ± 0.103
1.533ProGln: 1.533 ± 0.29
0.9ProArg: 0.9 ± 0.145
2.019ProSer: 2.019 ± 0.229
1.946ProThr: 1.946 ± 0.279
1.411ProVal: 1.411 ± 0.23
0.17ProTrp: 0.17 ± 0.058
1.654ProTyr: 1.654 ± 0.213
0.0ProXaa: 0.0 ± 0.0
Gln
2.262GlnAla: 2.262 ± 0.293
0.097GlnCys: 0.097 ± 0.044
2.214GlnAsp: 2.214 ± 0.191
3.746GlnGlu: 3.746 ± 0.373
1.581GlnPhe: 1.581 ± 0.165
2.822GlnGly: 2.822 ± 0.367
0.414GlnHis: 0.414 ± 0.08
2.506GlnIle: 2.506 ± 0.256
2.822GlnLys: 2.822 ± 0.29
2.554GlnLeu: 2.554 ± 0.28
0.73GlnMet: 0.73 ± 0.139
2.043GlnAsn: 2.043 ± 0.221
1.533GlnPro: 1.533 ± 0.384
2.335GlnGln: 2.335 ± 0.605
1.581GlnArg: 1.581 ± 0.219
2.481GlnSer: 2.481 ± 0.323
1.8GlnThr: 1.8 ± 0.227
2.554GlnVal: 2.554 ± 0.317
0.316GlnTrp: 0.316 ± 0.088
1.8GlnTyr: 1.8 ± 0.228
0.0GlnXaa: 0.0 ± 0.0
Arg
1.849ArgAla: 1.849 ± 0.249
0.292ArgCys: 0.292 ± 0.113
2.627ArgAsp: 2.627 ± 0.311
3.357ArgGlu: 3.357 ± 0.277
1.581ArgPhe: 1.581 ± 0.19
1.849ArgGly: 1.849 ± 0.204
0.316ArgHis: 0.316 ± 0.092
2.798ArgIle: 2.798 ± 0.239
3.941ArgLys: 3.941 ± 0.409
3.381ArgLeu: 3.381 ± 0.289
1.095ArgMet: 1.095 ± 0.199
2.433ArgAsn: 2.433 ± 0.228
0.9ArgPro: 0.9 ± 0.157
1.508ArgGln: 1.508 ± 0.173
1.654ArgArg: 1.654 ± 0.215
1.727ArgSer: 1.727 ± 0.18
2.506ArgThr: 2.506 ± 0.343
2.603ArgVal: 2.603 ± 0.284
0.292ArgTrp: 0.292 ± 0.084
2.384ArgTyr: 2.384 ± 0.235
0.0ArgXaa: 0.0 ± 0.0
Ser
1.922SerAla: 1.922 ± 0.278
0.195SerCys: 0.195 ± 0.078
3.965SerAsp: 3.965 ± 0.313
4.865SerGlu: 4.865 ± 0.33
2.36SerPhe: 2.36 ± 0.227
4.014SerGly: 4.014 ± 0.395
0.973SerHis: 0.973 ± 0.181
5.23SerIle: 5.23 ± 0.387
5.984SerLys: 5.984 ± 0.442
4.963SerLeu: 4.963 ± 0.342
1.241SerMet: 1.241 ± 0.226
4.476SerAsn: 4.476 ± 0.305
1.752SerPro: 1.752 ± 0.197
2.141SerGln: 2.141 ± 0.305
1.776SerArg: 1.776 ± 0.209
4.379SerSer: 4.379 ± 0.415
3.357SerThr: 3.357 ± 0.274
3.333SerVal: 3.333 ± 0.275
0.608SerTrp: 0.608 ± 0.123
3.43SerTyr: 3.43 ± 0.306
0.0SerXaa: 0.0 ± 0.0
Thr
2.335ThrAla: 2.335 ± 0.236
0.365ThrCys: 0.365 ± 0.106
3.406ThrAsp: 3.406 ± 0.271
4.671ThrGlu: 4.671 ± 0.377
2.822ThrPhe: 2.822 ± 0.287
4.136ThrGly: 4.136 ± 0.376
1.192ThrHis: 1.192 ± 0.151
5.084ThrIle: 5.084 ± 0.395
5.498ThrLys: 5.498 ± 0.422
5.182ThrLeu: 5.182 ± 0.31
1.095ThrMet: 1.095 ± 0.208
3.99ThrAsn: 3.99 ± 0.319
2.043ThrPro: 2.043 ± 0.233
2.141ThrGln: 2.141 ± 0.253
2.919ThrArg: 2.919 ± 0.294
3.211ThrSer: 3.211 ± 0.294
3.722ThrThr: 3.722 ± 0.393
4.136ThrVal: 4.136 ± 0.407
0.487ThrTrp: 0.487 ± 0.115
2.749ThrTyr: 2.749 ± 0.344
0.0ThrXaa: 0.0 ± 0.0
Val
2.092ValAla: 2.092 ± 0.194
0.632ValCys: 0.632 ± 0.132
4.379ValAsp: 4.379 ± 0.323
5.595ValGlu: 5.595 ± 0.426
2.457ValPhe: 2.457 ± 0.271
2.749ValGly: 2.749 ± 0.262
1.143ValHis: 1.143 ± 0.162
3.527ValIle: 3.527 ± 0.263
5.79ValLys: 5.79 ± 0.392
5.182ValLeu: 5.182 ± 0.398
1.168ValMet: 1.168 ± 0.16
4.354ValAsn: 4.354 ± 0.329
1.97ValPro: 1.97 ± 0.3
1.922ValGln: 1.922 ± 0.209
2.019ValArg: 2.019 ± 0.251
3.965ValSer: 3.965 ± 0.307
3.673ValThr: 3.673 ± 0.374
3.065ValVal: 3.065 ± 0.317
0.365ValTrp: 0.365 ± 0.099
3.454ValTyr: 3.454 ± 0.402
0.0ValXaa: 0.0 ± 0.0
Trp
0.414TrpAla: 0.414 ± 0.114
0.097TrpCys: 0.097 ± 0.046
0.365TrpAsp: 0.365 ± 0.1
0.924TrpGlu: 0.924 ± 0.147
0.316TrpPhe: 0.316 ± 0.1
0.438TrpGly: 0.438 ± 0.103
0.146TrpHis: 0.146 ± 0.064
0.511TrpIle: 0.511 ± 0.102
0.876TrpLys: 0.876 ± 0.131
0.681TrpLeu: 0.681 ± 0.139
0.122TrpMet: 0.122 ± 0.05
0.487TrpAsn: 0.487 ± 0.109
0.024TrpPro: 0.024 ± 0.024
0.462TrpGln: 0.462 ± 0.092
0.243TrpArg: 0.243 ± 0.075
0.438TrpSer: 0.438 ± 0.121
0.389TrpThr: 0.389 ± 0.09
0.487TrpVal: 0.487 ± 0.096
0.146TrpTrp: 0.146 ± 0.071
0.608TrpTyr: 0.608 ± 0.128
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.97TyrAla: 1.97 ± 0.229
0.511TyrCys: 0.511 ± 0.11
3.625TyrAsp: 3.625 ± 0.282
3.746TyrGlu: 3.746 ± 0.354
2.408TyrPhe: 2.408 ± 0.271
2.871TyrGly: 2.871 ± 0.306
0.924TyrHis: 0.924 ± 0.152
4.379TyrIle: 4.379 ± 0.352
4.914TyrLys: 4.914 ± 0.422
4.306TyrLeu: 4.306 ± 0.33
1.241TyrMet: 1.241 ± 0.203
4.111TyrAsn: 4.111 ± 0.346
1.143TyrPro: 1.143 ± 0.206
2.068TyrGln: 2.068 ± 0.215
1.922TyrArg: 1.922 ± 0.214
2.968TyrSer: 2.968 ± 0.283
3.381TyrThr: 3.381 ± 0.332
3.333TyrVal: 3.333 ± 0.326
0.414TyrTrp: 0.414 ± 0.117
3.138TyrTyr: 3.138 ± 0.313
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 205 proteins (41108 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski