Amino acid dipepetide frequency for Pseudomonas phage SM1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.466AlaAla: 10.466 ± 1.329
1.018AlaCys: 1.018 ± 0.213
5.057AlaAsp: 5.057 ± 0.494
7.024AlaGlu: 7.024 ± 0.615
3.126AlaPhe: 3.126 ± 0.417
6.743AlaGly: 6.743 ± 0.587
1.861AlaHis: 1.861 ± 0.277
4.671AlaIle: 4.671 ± 0.359
5.444AlaLys: 5.444 ± 0.688
8.394AlaLeu: 8.394 ± 0.701
2.739AlaMet: 2.739 ± 0.347
3.723AlaAsn: 3.723 ± 0.502
3.688AlaPro: 3.688 ± 0.391
3.863AlaGln: 3.863 ± 0.686
7.305AlaArg: 7.305 ± 0.559
5.865AlaSer: 5.865 ± 0.592
4.741AlaThr: 4.741 ± 0.511
6.392AlaVal: 6.392 ± 0.486
1.159AlaTrp: 1.159 ± 0.191
3.301AlaTyr: 3.301 ± 0.297
0.0AlaXaa: 0.0 ± 0.0
Cys
0.878CysAla: 0.878 ± 0.168
0.14CysCys: 0.14 ± 0.075
0.562CysAsp: 0.562 ± 0.156
0.667CysGlu: 0.667 ± 0.185
0.421CysPhe: 0.421 ± 0.125
0.843CysGly: 0.843 ± 0.198
0.316CysHis: 0.316 ± 0.11
0.527CysIle: 0.527 ± 0.141
0.597CysLys: 0.597 ± 0.14
1.264CysLeu: 1.264 ± 0.261
0.211CysMet: 0.211 ± 0.073
0.281CysAsn: 0.281 ± 0.1
0.597CysPro: 0.597 ± 0.175
0.211CysGln: 0.211 ± 0.086
1.018CysArg: 1.018 ± 0.238
0.632CysSer: 0.632 ± 0.147
0.702CysThr: 0.702 ± 0.177
1.264CysVal: 1.264 ± 0.219
0.176CysTrp: 0.176 ± 0.077
0.281CysTyr: 0.281 ± 0.093
0.0CysXaa: 0.0 ± 0.0
Asp
5.935AspAla: 5.935 ± 0.496
0.773AspCys: 0.773 ± 0.198
3.442AspAsp: 3.442 ± 0.408
4.39AspGlu: 4.39 ± 0.416
2.599AspPhe: 2.599 ± 0.404
4.741AspGly: 4.741 ± 0.389
0.808AspHis: 0.808 ± 0.194
2.985AspIle: 2.985 ± 0.325
1.967AspLys: 1.967 ± 0.22
5.303AspLeu: 5.303 ± 0.447
1.51AspMet: 1.51 ± 0.207
1.58AspAsn: 1.58 ± 0.302
3.055AspPro: 3.055 ± 0.317
2.037AspGln: 2.037 ± 0.276
3.758AspArg: 3.758 ± 0.368
3.688AspSer: 3.688 ± 0.369
3.126AspThr: 3.126 ± 0.281
3.477AspVal: 3.477 ± 0.347
1.054AspTrp: 1.054 ± 0.177
1.686AspTyr: 1.686 ± 0.224
0.0AspXaa: 0.0 ± 0.0
Glu
6.497GluAla: 6.497 ± 0.628
0.667GluCys: 0.667 ± 0.153
3.407GluAsp: 3.407 ± 0.432
5.76GluGlu: 5.76 ± 0.934
3.301GluPhe: 3.301 ± 0.377
4.249GluGly: 4.249 ± 0.395
1.791GluHis: 1.791 ± 0.291
3.371GluIle: 3.371 ± 0.316
3.196GluLys: 3.196 ± 0.347
6.989GluLeu: 6.989 ± 0.484
2.002GluMet: 2.002 ± 0.265
2.564GluAsn: 2.564 ± 0.267
3.512GluPro: 3.512 ± 0.426
3.652GluGln: 3.652 ± 0.468
4.32GluArg: 4.32 ± 0.42
3.547GluSer: 3.547 ± 0.374
3.301GluThr: 3.301 ± 0.469
5.689GluVal: 5.689 ± 0.471
0.948GluTrp: 0.948 ± 0.198
2.142GluTyr: 2.142 ± 0.246
0.0GluXaa: 0.0 ± 0.0
Phe
2.388PheAla: 2.388 ± 0.294
0.492PheCys: 0.492 ± 0.155
2.985PheAsp: 2.985 ± 0.297
2.95PheGlu: 2.95 ± 0.321
1.37PhePhe: 1.37 ± 0.177
2.845PheGly: 2.845 ± 0.375
0.492PheHis: 0.492 ± 0.126
1.545PheIle: 1.545 ± 0.212
1.194PheLys: 1.194 ± 0.223
2.985PheLeu: 2.985 ± 0.347
0.738PheMet: 0.738 ± 0.176
1.37PheAsn: 1.37 ± 0.185
1.58PhePro: 1.58 ± 0.213
1.335PheGln: 1.335 ± 0.256
2.88PheArg: 2.88 ± 0.371
2.353PheSer: 2.353 ± 0.321
2.177PheThr: 2.177 ± 0.345
2.634PheVal: 2.634 ± 0.311
0.773PheTrp: 0.773 ± 0.201
1.335PheTyr: 1.335 ± 0.197
0.0PheXaa: 0.0 ± 0.0
Gly
5.689GlyAla: 5.689 ± 0.487
0.808GlyCys: 0.808 ± 0.154
4.811GlyAsp: 4.811 ± 0.388
4.566GlyGlu: 4.566 ± 0.429
3.336GlyPhe: 3.336 ± 0.345
5.584GlyGly: 5.584 ± 0.568
1.475GlyHis: 1.475 ± 0.317
3.828GlyIle: 3.828 ± 0.467
3.793GlyLys: 3.793 ± 0.397
6.603GlyLeu: 6.603 ± 0.518
2.529GlyMet: 2.529 ± 0.314
2.81GlyAsn: 2.81 ± 0.444
2.458GlyPro: 2.458 ± 0.335
2.739GlyGln: 2.739 ± 0.346
4.601GlyArg: 4.601 ± 0.396
4.917GlySer: 4.917 ± 0.524
3.442GlyThr: 3.442 ± 0.369
5.689GlyVal: 5.689 ± 0.482
1.475GlyTrp: 1.475 ± 0.321
2.95GlyTyr: 2.95 ± 0.344
0.0GlyXaa: 0.0 ± 0.0
His
2.037HisAla: 2.037 ± 0.298
0.421HisCys: 0.421 ± 0.143
1.018HisAsp: 1.018 ± 0.248
1.159HisGlu: 1.159 ± 0.279
0.738HisPhe: 0.738 ± 0.184
1.299HisGly: 1.299 ± 0.22
0.386HisHis: 0.386 ± 0.115
0.843HisIle: 0.843 ± 0.152
0.913HisLys: 0.913 ± 0.174
1.44HisLeu: 1.44 ± 0.248
0.386HisMet: 0.386 ± 0.126
0.527HisAsn: 0.527 ± 0.167
1.229HisPro: 1.229 ± 0.209
0.632HisGln: 0.632 ± 0.175
1.194HisArg: 1.194 ± 0.2
1.264HisSer: 1.264 ± 0.294
1.089HisThr: 1.089 ± 0.206
1.335HisVal: 1.335 ± 0.284
0.035HisTrp: 0.035 ± 0.035
0.738HisTyr: 0.738 ± 0.15
0.0HisXaa: 0.0 ± 0.0
Ile
5.092IleAla: 5.092 ± 0.633
0.457IleCys: 0.457 ± 0.132
3.547IleAsp: 3.547 ± 0.323
3.512IleGlu: 3.512 ± 0.356
1.229IlePhe: 1.229 ± 0.213
3.371IleGly: 3.371 ± 0.4
0.913IleHis: 0.913 ± 0.147
1.721IleIle: 1.721 ± 0.271
1.58IleLys: 1.58 ± 0.266
3.688IleLeu: 3.688 ± 0.423
0.632IleMet: 0.632 ± 0.123
1.932IleAsn: 1.932 ± 0.272
2.142IlePro: 2.142 ± 0.267
2.002IleGln: 2.002 ± 0.246
3.723IleArg: 3.723 ± 0.441
2.985IleSer: 2.985 ± 0.344
3.02IleThr: 3.02 ± 0.369
3.793IleVal: 3.793 ± 0.377
0.702IleTrp: 0.702 ± 0.164
1.159IleTyr: 1.159 ± 0.218
0.0IleXaa: 0.0 ± 0.0
Lys
5.092LysAla: 5.092 ± 0.632
0.421LysCys: 0.421 ± 0.173
2.458LysAsp: 2.458 ± 0.35
2.88LysGlu: 2.88 ± 0.353
1.44LysPhe: 1.44 ± 0.233
3.126LysGly: 3.126 ± 0.37
0.878LysHis: 0.878 ± 0.183
1.932LysIle: 1.932 ± 0.312
2.177LysLys: 2.177 ± 0.33
4.355LysLeu: 4.355 ± 0.345
1.405LysMet: 1.405 ± 0.227
1.651LysAsn: 1.651 ± 0.227
2.915LysPro: 2.915 ± 0.444
1.335LysGln: 1.335 ± 0.23
3.02LysArg: 3.02 ± 0.44
2.704LysSer: 2.704 ± 0.29
2.318LysThr: 2.318 ± 0.275
2.739LysVal: 2.739 ± 0.302
0.492LysTrp: 0.492 ± 0.117
1.054LysTyr: 1.054 ± 0.205
0.0LysXaa: 0.0 ± 0.0
Leu
8.499LeuAla: 8.499 ± 0.578
1.124LeuCys: 1.124 ± 0.175
5.408LeuAsp: 5.408 ± 0.427
5.725LeuGlu: 5.725 ± 0.528
3.055LeuPhe: 3.055 ± 0.365
7.656LeuGly: 7.656 ± 0.52
2.107LeuHis: 2.107 ± 0.343
3.617LeuIle: 3.617 ± 0.341
4.004LeuLys: 4.004 ± 0.4
6.883LeuLeu: 6.883 ± 0.539
1.756LeuMet: 1.756 ± 0.261
3.371LeuAsn: 3.371 ± 0.35
4.636LeuPro: 4.636 ± 0.452
2.88LeuGln: 2.88 ± 0.416
6.216LeuArg: 6.216 ± 0.437
6.462LeuSer: 6.462 ± 0.459
5.303LeuThr: 5.303 ± 0.447
6.638LeuVal: 6.638 ± 0.483
0.913LeuTrp: 0.913 ± 0.171
3.055LeuTyr: 3.055 ± 0.344
0.0LeuXaa: 0.0 ± 0.0
Met
3.231MetAla: 3.231 ± 0.327
0.246MetCys: 0.246 ± 0.087
1.264MetAsp: 1.264 ± 0.253
1.44MetGlu: 1.44 ± 0.28
0.667MetPhe: 0.667 ± 0.158
1.967MetGly: 1.967 ± 0.372
0.386MetHis: 0.386 ± 0.119
1.018MetIle: 1.018 ± 0.184
0.983MetLys: 0.983 ± 0.148
1.791MetLeu: 1.791 ± 0.255
0.386MetMet: 0.386 ± 0.104
0.913MetAsn: 0.913 ± 0.175
1.264MetPro: 1.264 ± 0.214
1.054MetGln: 1.054 ± 0.21
1.861MetArg: 1.861 ± 0.242
1.932MetSer: 1.932 ± 0.231
1.58MetThr: 1.58 ± 0.257
1.44MetVal: 1.44 ± 0.251
0.176MetTrp: 0.176 ± 0.095
0.562MetTyr: 0.562 ± 0.117
0.0MetXaa: 0.0 ± 0.0
Asn
3.371AsnAla: 3.371 ± 0.446
0.386AsnCys: 0.386 ± 0.14
1.405AsnAsp: 1.405 ± 0.226
2.774AsnGlu: 2.774 ± 0.235
1.37AsnPhe: 1.37 ± 0.22
3.407AsnGly: 3.407 ± 0.446
0.492AsnHis: 0.492 ± 0.13
1.51AsnIle: 1.51 ± 0.227
1.58AsnLys: 1.58 ± 0.208
4.144AsnLeu: 4.144 ± 0.503
0.948AsnMet: 0.948 ± 0.198
1.686AsnAsn: 1.686 ± 0.354
2.072AsnPro: 2.072 ± 0.291
1.405AsnGln: 1.405 ± 0.238
2.283AsnArg: 2.283 ± 0.308
2.318AsnSer: 2.318 ± 0.297
1.826AsnThr: 1.826 ± 0.297
1.896AsnVal: 1.896 ± 0.276
0.386AsnTrp: 0.386 ± 0.1
1.054AsnTyr: 1.054 ± 0.168
0.0AsnXaa: 0.0 ± 0.0
Pro
4.53ProAla: 4.53 ± 0.518
0.527ProCys: 0.527 ± 0.161
2.739ProAsp: 2.739 ± 0.357
3.512ProGlu: 3.512 ± 0.475
1.089ProPhe: 1.089 ± 0.18
3.407ProGly: 3.407 ± 0.331
0.527ProHis: 0.527 ± 0.147
2.002ProIle: 2.002 ± 0.234
2.072ProLys: 2.072 ± 0.327
3.371ProLeu: 3.371 ± 0.337
0.983ProMet: 0.983 ± 0.205
1.721ProAsn: 1.721 ± 0.249
2.037ProPro: 2.037 ± 0.307
1.44ProGln: 1.44 ± 0.278
2.88ProArg: 2.88 ± 0.4
3.231ProSer: 3.231 ± 0.368
2.81ProThr: 2.81 ± 0.329
3.758ProVal: 3.758 ± 0.34
0.808ProTrp: 0.808 ± 0.156
2.002ProTyr: 2.002 ± 0.329
0.0ProXaa: 0.0 ± 0.0
Gln
4.882GlnAla: 4.882 ± 0.79
0.351GlnCys: 0.351 ± 0.121
1.686GlnAsp: 1.686 ± 0.247
2.599GlnGlu: 2.599 ± 0.364
1.37GlnPhe: 1.37 ± 0.188
3.02GlnGly: 3.02 ± 0.381
0.702GlnHis: 0.702 ± 0.183
1.932GlnIle: 1.932 ± 0.275
1.545GlnLys: 1.545 ± 0.291
2.985GlnLeu: 2.985 ± 0.294
0.562GlnMet: 0.562 ± 0.128
1.37GlnAsn: 1.37 ± 0.219
1.826GlnPro: 1.826 ± 0.241
1.721GlnGln: 1.721 ± 0.417
3.126GlnArg: 3.126 ± 0.396
1.616GlnSer: 1.616 ± 0.234
1.932GlnThr: 1.932 ± 0.277
2.529GlnVal: 2.529 ± 0.313
0.457GlnTrp: 0.457 ± 0.11
1.264GlnTyr: 1.264 ± 0.175
0.0GlnXaa: 0.0 ± 0.0
Arg
6.041ArgAla: 6.041 ± 0.563
0.667ArgCys: 0.667 ± 0.184
4.074ArgAsp: 4.074 ± 0.378
4.952ArgGlu: 4.952 ± 0.465
2.599ArgPhe: 2.599 ± 0.304
4.249ArgGly: 4.249 ± 0.375
1.264ArgHis: 1.264 ± 0.197
4.249ArgIle: 4.249 ± 0.465
3.266ArgLys: 3.266 ± 0.406
7.551ArgLeu: 7.551 ± 0.536
2.177ArgMet: 2.177 ± 0.305
2.704ArgAsn: 2.704 ± 0.352
2.458ArgPro: 2.458 ± 0.328
3.055ArgGln: 3.055 ± 0.353
7.235ArgArg: 7.235 ± 0.813
5.092ArgSer: 5.092 ± 0.708
3.512ArgThr: 3.512 ± 0.453
5.163ArgVal: 5.163 ± 0.453
0.562ArgTrp: 0.562 ± 0.162
1.932ArgTyr: 1.932 ± 0.261
0.0ArgXaa: 0.0 ± 0.0
Ser
6.427SerAla: 6.427 ± 0.619
0.632SerCys: 0.632 ± 0.164
4.004SerAsp: 4.004 ± 0.315
4.671SerGlu: 4.671 ± 0.465
2.564SerPhe: 2.564 ± 0.292
5.479SerGly: 5.479 ± 0.587
0.843SerHis: 0.843 ± 0.201
3.793SerIle: 3.793 ± 0.359
2.985SerLys: 2.985 ± 0.349
5.233SerLeu: 5.233 ± 0.525
1.299SerMet: 1.299 ± 0.202
2.774SerAsn: 2.774 ± 0.38
2.072SerPro: 2.072 ± 0.223
1.756SerGln: 1.756 ± 0.248
4.495SerArg: 4.495 ± 0.583
4.53SerSer: 4.53 ± 0.476
2.985SerThr: 2.985 ± 0.279
4.847SerVal: 4.847 ± 0.391
0.983SerTrp: 0.983 ± 0.184
1.932SerTyr: 1.932 ± 0.327
0.0SerXaa: 0.0 ± 0.0
Thr
5.198ThrAla: 5.198 ± 0.489
0.632ThrCys: 0.632 ± 0.138
3.301ThrAsp: 3.301 ± 0.4
3.863ThrGlu: 3.863 ± 0.366
1.791ThrPhe: 1.791 ± 0.264
4.179ThrGly: 4.179 ± 0.373
0.913ThrHis: 0.913 ± 0.214
2.704ThrIle: 2.704 ± 0.374
2.283ThrLys: 2.283 ± 0.347
5.479ThrLeu: 5.479 ± 0.464
1.054ThrMet: 1.054 ± 0.207
1.545ThrAsn: 1.545 ± 0.233
2.599ThrPro: 2.599 ± 0.244
1.932ThrGln: 1.932 ± 0.227
3.512ThrArg: 3.512 ± 0.313
3.091ThrSer: 3.091 ± 0.309
3.863ThrThr: 3.863 ± 0.395
3.512ThrVal: 3.512 ± 0.416
0.773ThrTrp: 0.773 ± 0.16
1.616ThrTyr: 1.616 ± 0.287
0.0ThrXaa: 0.0 ± 0.0
Val
6.251ValAla: 6.251 ± 0.484
1.264ValCys: 1.264 ± 0.236
4.179ValAsp: 4.179 ± 0.434
5.338ValGlu: 5.338 ± 0.465
2.739ValPhe: 2.739 ± 0.239
4.566ValGly: 4.566 ± 0.398
1.299ValHis: 1.299 ± 0.243
3.301ValIle: 3.301 ± 0.389
3.02ValLys: 3.02 ± 0.333
6.146ValLeu: 6.146 ± 0.455
1.37ValMet: 1.37 ± 0.19
2.248ValAsn: 2.248 ± 0.265
3.723ValPro: 3.723 ± 0.424
2.915ValGln: 2.915 ± 0.357
5.83ValArg: 5.83 ± 0.388
5.408ValSer: 5.408 ± 0.524
3.688ValThr: 3.688 ± 0.387
6.497ValVal: 6.497 ± 0.51
1.054ValTrp: 1.054 ± 0.203
2.388ValTyr: 2.388 ± 0.4
0.0ValXaa: 0.0 ± 0.0
Trp
1.299TrpAla: 1.299 ± 0.22
0.07TrpCys: 0.07 ± 0.046
0.562TrpAsp: 0.562 ± 0.128
0.843TrpGlu: 0.843 ± 0.14
0.421TrpPhe: 0.421 ± 0.108
1.054TrpGly: 1.054 ± 0.217
0.246TrpHis: 0.246 ± 0.094
0.386TrpIle: 0.386 ± 0.118
0.667TrpLys: 0.667 ± 0.158
1.44TrpLeu: 1.44 ± 0.211
0.667TrpMet: 0.667 ± 0.142
0.457TrpAsn: 0.457 ± 0.097
0.562TrpPro: 0.562 ± 0.14
0.492TrpGln: 0.492 ± 0.126
0.983TrpArg: 0.983 ± 0.162
0.878TrpSer: 0.878 ± 0.229
0.632TrpThr: 0.632 ± 0.129
1.335TrpVal: 1.335 ± 0.245
0.14TrpTrp: 0.14 ± 0.083
0.386TrpTyr: 0.386 ± 0.112
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.774TyrAla: 2.774 ± 0.283
0.492TyrCys: 0.492 ± 0.126
2.002TyrAsp: 2.002 ± 0.344
2.283TyrGlu: 2.283 ± 0.33
1.194TyrPhe: 1.194 ± 0.198
2.318TyrGly: 2.318 ± 0.307
0.983TyrHis: 0.983 ± 0.184
1.264TyrIle: 1.264 ± 0.215
1.124TyrLys: 1.124 ± 0.214
3.266TyrLeu: 3.266 ± 0.335
0.773TyrMet: 0.773 ± 0.184
1.089TyrAsn: 1.089 ± 0.185
1.054TyrPro: 1.054 ± 0.204
1.018TyrGln: 1.018 ± 0.197
2.564TyrArg: 2.564 ± 0.298
1.791TyrSer: 1.791 ± 0.247
1.791TyrThr: 1.791 ± 0.283
2.599TyrVal: 2.599 ± 0.331
0.457TyrTrp: 0.457 ± 0.153
0.983TyrTyr: 0.983 ± 0.163
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 129 proteins (28475 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski