Amino acid dipepetide frequency for Streptomyces phage Coruscant

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.47AlaAla: 7.47 ± 0.891
0.661AlaCys: 0.661 ± 0.158
4.626AlaAsp: 4.626 ± 0.4
4.798AlaGlu: 4.798 ± 0.449
3.448AlaPhe: 3.448 ± 0.296
5.947AlaGly: 5.947 ± 0.579
1.408AlaHis: 1.408 ± 0.21
4.913AlaIle: 4.913 ± 0.46
5.286AlaLys: 5.286 ± 0.482
5.372AlaLeu: 5.372 ± 0.502
2.873AlaMet: 2.873 ± 0.339
3.792AlaAsn: 3.792 ± 0.525
2.011AlaPro: 2.011 ± 0.256
3.16AlaGln: 3.16 ± 0.419
4.166AlaArg: 4.166 ± 0.431
4.913AlaSer: 4.913 ± 0.411
5.114AlaThr: 5.114 ± 0.452
5.258AlaVal: 5.258 ± 0.436
1.666AlaTrp: 1.666 ± 0.193
2.816AlaTyr: 2.816 ± 0.277
0.0AlaXaa: 0.0 ± 0.0
Cys
0.46CysAla: 0.46 ± 0.102
0.201CysCys: 0.201 ± 0.085
0.603CysAsp: 0.603 ± 0.138
0.776CysGlu: 0.776 ± 0.166
0.316CysPhe: 0.316 ± 0.101
1.322CysGly: 1.322 ± 0.25
0.402CysHis: 0.402 ± 0.113
0.546CysIle: 0.546 ± 0.133
0.718CysLys: 0.718 ± 0.162
0.69CysLeu: 0.69 ± 0.15
0.172CysMet: 0.172 ± 0.074
0.575CysAsn: 0.575 ± 0.161
0.575CysPro: 0.575 ± 0.115
0.345CysGln: 0.345 ± 0.109
0.747CysArg: 0.747 ± 0.154
0.776CysSer: 0.776 ± 0.162
0.46CysThr: 0.46 ± 0.117
0.833CysVal: 0.833 ± 0.164
0.115CysTrp: 0.115 ± 0.051
0.718CysTyr: 0.718 ± 0.194
0.0CysXaa: 0.0 ± 0.0
Asp
4.712AspAla: 4.712 ± 0.364
0.661AspCys: 0.661 ± 0.12
3.62AspAsp: 3.62 ± 0.352
4.424AspGlu: 4.424 ± 0.413
3.419AspPhe: 3.419 ± 0.364
5.2AspGly: 5.2 ± 0.531
1.264AspHis: 1.264 ± 0.193
3.792AspIle: 3.792 ± 0.357
3.706AspLys: 3.706 ± 0.339
5.258AspLeu: 5.258 ± 0.43
1.81AspMet: 1.81 ± 0.217
3.103AspAsn: 3.103 ± 0.279
2.442AspPro: 2.442 ± 0.294
1.666AspGln: 1.666 ± 0.203
3.218AspArg: 3.218 ± 0.264
3.563AspSer: 3.563 ± 0.292
3.505AspThr: 3.505 ± 0.344
3.85AspVal: 3.85 ± 0.377
1.465AspTrp: 1.465 ± 0.178
2.557AspTyr: 2.557 ± 0.326
0.0AspXaa: 0.0 ± 0.0
Glu
6.062GluAla: 6.062 ± 0.479
0.517GluCys: 0.517 ± 0.138
4.367GluAsp: 4.367 ± 0.369
5.976GluGlu: 5.976 ± 0.566
3.132GluPhe: 3.132 ± 0.255
3.85GluGly: 3.85 ± 0.304
1.465GluHis: 1.465 ± 0.229
4.424GluIle: 4.424 ± 0.327
4.424GluLys: 4.424 ± 0.435
4.855GluLeu: 4.855 ± 0.417
2.873GluMet: 2.873 ± 0.34
3.275GluAsn: 3.275 ± 0.303
1.839GluPro: 1.839 ± 0.244
2.586GluGln: 2.586 ± 0.39
3.735GluArg: 3.735 ± 0.359
3.677GluSer: 3.677 ± 0.333
3.792GluThr: 3.792 ± 0.435
5.171GluVal: 5.171 ± 0.519
1.523GluTrp: 1.523 ± 0.215
2.413GluTyr: 2.413 ± 0.285
0.0GluXaa: 0.0 ± 0.0
Phe
3.017PheAla: 3.017 ± 0.276
0.661PheCys: 0.661 ± 0.147
3.936PheAsp: 3.936 ± 0.354
3.62PheGlu: 3.62 ± 0.29
1.781PhePhe: 1.781 ± 0.24
3.074PheGly: 3.074 ± 0.249
0.661PheHis: 0.661 ± 0.139
2.097PheIle: 2.097 ± 0.227
2.729PheLys: 2.729 ± 0.287
2.212PheLeu: 2.212 ± 0.296
1.063PheMet: 1.063 ± 0.2
2.155PheAsn: 2.155 ± 0.237
1.178PhePro: 1.178 ± 0.203
1.178PheGln: 1.178 ± 0.242
2.183PheArg: 2.183 ± 0.241
2.988PheSer: 2.988 ± 0.269
2.988PheThr: 2.988 ± 0.312
2.844PheVal: 2.844 ± 0.26
0.747PheTrp: 0.747 ± 0.16
1.293PheTyr: 1.293 ± 0.183
0.0PheXaa: 0.0 ± 0.0
Gly
5.171GlyAla: 5.171 ± 0.448
0.833GlyCys: 0.833 ± 0.182
3.735GlyAsp: 3.735 ± 0.328
4.309GlyGlu: 4.309 ± 0.339
3.333GlyPhe: 3.333 ± 0.272
4.511GlyGly: 4.511 ± 0.486
1.666GlyHis: 1.666 ± 0.234
4.396GlyIle: 4.396 ± 0.346
4.511GlyLys: 4.511 ± 0.332
4.913GlyLeu: 4.913 ± 0.429
2.586GlyMet: 2.586 ± 0.327
3.333GlyAsn: 3.333 ± 0.287
2.643GlyPro: 2.643 ± 0.316
2.643GlyGln: 2.643 ± 0.28
4.626GlyArg: 4.626 ± 0.298
3.85GlySer: 3.85 ± 0.356
5.315GlyThr: 5.315 ± 0.508
5.229GlyVal: 5.229 ± 0.34
1.609GlyTrp: 1.609 ± 0.178
3.074GlyTyr: 3.074 ± 0.3
0.0GlyXaa: 0.0 ± 0.0
His
1.178HisAla: 1.178 ± 0.225
0.402HisCys: 0.402 ± 0.115
1.178HisAsp: 1.178 ± 0.204
0.919HisGlu: 0.919 ± 0.174
0.891HisPhe: 0.891 ± 0.173
1.609HisGly: 1.609 ± 0.206
0.46HisHis: 0.46 ± 0.096
1.149HisIle: 1.149 ± 0.2
0.977HisLys: 0.977 ± 0.176
1.666HisLeu: 1.666 ± 0.254
0.431HisMet: 0.431 ± 0.111
0.747HisAsn: 0.747 ± 0.139
0.919HisPro: 0.919 ± 0.186
0.603HisGln: 0.603 ± 0.17
1.264HisArg: 1.264 ± 0.187
1.12HisSer: 1.12 ± 0.167
0.833HisThr: 0.833 ± 0.151
1.523HisVal: 1.523 ± 0.2
0.316HisTrp: 0.316 ± 0.097
0.948HisTyr: 0.948 ± 0.187
0.0HisXaa: 0.0 ± 0.0
Ile
4.281IleAla: 4.281 ± 0.375
0.632IleCys: 0.632 ± 0.156
4.424IleAsp: 4.424 ± 0.386
4.884IleGlu: 4.884 ± 0.408
1.839IlePhe: 1.839 ± 0.242
3.821IleGly: 3.821 ± 0.311
0.948IleHis: 0.948 ± 0.171
2.93IleIle: 2.93 ± 0.278
4.08IleLys: 4.08 ± 0.343
3.792IleLeu: 3.792 ± 0.399
1.724IleMet: 1.724 ± 0.222
2.758IleAsn: 2.758 ± 0.271
1.896IlePro: 1.896 ± 0.239
1.781IleGln: 1.781 ± 0.228
2.988IleArg: 2.988 ± 0.307
3.218IleSer: 3.218 ± 0.314
3.476IleThr: 3.476 ± 0.328
4.166IleVal: 4.166 ± 0.293
1.235IleTrp: 1.235 ± 0.223
1.982IleTyr: 1.982 ± 0.219
0.0IleXaa: 0.0 ± 0.0
Lys
5.401LysAla: 5.401 ± 0.482
0.977LysCys: 0.977 ± 0.193
3.591LysAsp: 3.591 ± 0.378
3.936LysGlu: 3.936 ± 0.386
2.499LysPhe: 2.499 ± 0.313
3.706LysGly: 3.706 ± 0.322
1.092LysHis: 1.092 ± 0.163
3.879LysIle: 3.879 ± 0.271
4.683LysLys: 4.683 ± 0.428
3.85LysLeu: 3.85 ± 0.374
1.839LysMet: 1.839 ± 0.23
3.85LysAsn: 3.85 ± 0.283
2.499LysPro: 2.499 ± 0.296
2.183LysGln: 2.183 ± 0.314
3.563LysArg: 3.563 ± 0.347
3.735LysSer: 3.735 ± 0.374
3.476LysThr: 3.476 ± 0.348
4.568LysVal: 4.568 ± 0.438
1.063LysTrp: 1.063 ± 0.183
3.045LysTyr: 3.045 ± 0.315
0.0LysXaa: 0.0 ± 0.0
Leu
5.861LeuAla: 5.861 ± 0.447
0.919LeuCys: 0.919 ± 0.151
4.051LeuAsp: 4.051 ± 0.415
5.803LeuGlu: 5.803 ± 0.47
2.844LeuPhe: 2.844 ± 0.291
5.143LeuGly: 5.143 ± 0.406
1.063LeuHis: 1.063 ± 0.184
4.08LeuIle: 4.08 ± 0.316
4.338LeuLys: 4.338 ± 0.373
4.309LeuLeu: 4.309 ± 0.387
1.781LeuMet: 1.781 ± 0.226
3.246LeuAsn: 3.246 ± 0.296
2.787LeuPro: 2.787 ± 0.299
2.04LeuGln: 2.04 ± 0.29
4.108LeuArg: 4.108 ± 0.342
5.114LeuSer: 5.114 ± 0.403
4.453LeuThr: 4.453 ± 0.33
4.654LeuVal: 4.654 ± 0.427
1.293LeuTrp: 1.293 ± 0.204
2.298LeuTyr: 2.298 ± 0.259
0.0LeuXaa: 0.0 ± 0.0
Met
2.672MetAla: 2.672 ± 0.275
0.431MetCys: 0.431 ± 0.132
1.436MetAsp: 1.436 ± 0.22
1.839MetGlu: 1.839 ± 0.234
1.034MetPhe: 1.034 ± 0.175
2.241MetGly: 2.241 ± 0.369
0.603MetHis: 0.603 ± 0.134
1.781MetIle: 1.781 ± 0.222
1.954MetLys: 1.954 ± 0.261
1.839MetLeu: 1.839 ± 0.239
0.69MetMet: 0.69 ± 0.158
1.839MetAsn: 1.839 ± 0.26
1.264MetPro: 1.264 ± 0.205
1.006MetGln: 1.006 ± 0.168
1.925MetArg: 1.925 ± 0.236
2.27MetSer: 2.27 ± 0.212
1.81MetThr: 1.81 ± 0.214
1.896MetVal: 1.896 ± 0.23
0.402MetTrp: 0.402 ± 0.14
0.891MetTyr: 0.891 ± 0.162
0.0MetXaa: 0.0 ± 0.0
Asn
3.879AsnAla: 3.879 ± 0.501
0.373AsnCys: 0.373 ± 0.104
3.218AsnAsp: 3.218 ± 0.386
3.62AsnGlu: 3.62 ± 0.322
1.982AsnPhe: 1.982 ± 0.206
3.448AsnGly: 3.448 ± 0.374
0.891AsnHis: 0.891 ± 0.166
2.586AsnIle: 2.586 ± 0.278
2.672AsnLys: 2.672 ± 0.307
3.16AsnLeu: 3.16 ± 0.305
1.322AsnMet: 1.322 ± 0.193
1.551AsnAsn: 1.551 ± 0.196
2.557AsnPro: 2.557 ± 0.248
1.695AsnGln: 1.695 ± 0.237
2.729AsnArg: 2.729 ± 0.247
2.729AsnSer: 2.729 ± 0.289
2.787AsnThr: 2.787 ± 0.362
3.39AsnVal: 3.39 ± 0.297
0.718AsnTrp: 0.718 ± 0.131
1.896AsnTyr: 1.896 ± 0.221
0.0AsnXaa: 0.0 ± 0.0
Pro
3.045ProAla: 3.045 ± 0.29
0.402ProCys: 0.402 ± 0.11
2.557ProAsp: 2.557 ± 0.358
2.816ProGlu: 2.816 ± 0.23
1.408ProPhe: 1.408 ± 0.207
2.959ProGly: 2.959 ± 0.281
0.69ProHis: 0.69 ± 0.139
1.523ProIle: 1.523 ± 0.198
2.385ProLys: 2.385 ± 0.321
2.442ProLeu: 2.442 ± 0.291
0.833ProMet: 0.833 ± 0.186
1.753ProAsn: 1.753 ± 0.267
1.35ProPro: 1.35 ± 0.273
1.063ProGln: 1.063 ± 0.175
1.867ProArg: 1.867 ± 0.218
2.413ProSer: 2.413 ± 0.414
2.298ProThr: 2.298 ± 0.27
3.045ProVal: 3.045 ± 0.239
0.259ProTrp: 0.259 ± 0.094
1.494ProTyr: 1.494 ± 0.252
0.0ProXaa: 0.0 ± 0.0
Gln
3.246GlnAla: 3.246 ± 0.331
0.086GlnCys: 0.086 ± 0.048
1.609GlnAsp: 1.609 ± 0.253
2.385GlnGlu: 2.385 ± 0.327
1.408GlnPhe: 1.408 ± 0.193
2.126GlnGly: 2.126 ± 0.248
0.632GlnHis: 0.632 ± 0.132
2.011GlnIle: 2.011 ± 0.271
2.586GlnLys: 2.586 ± 0.299
2.729GlnLeu: 2.729 ± 0.282
1.092GlnMet: 1.092 ± 0.262
1.322GlnAsn: 1.322 ± 0.256
0.919GlnPro: 0.919 ± 0.196
1.006GlnGln: 1.006 ± 0.276
2.385GlnArg: 2.385 ± 0.382
2.212GlnSer: 2.212 ± 0.256
2.04GlnThr: 2.04 ± 0.276
2.241GlnVal: 2.241 ± 0.304
0.488GlnTrp: 0.488 ± 0.16
1.006GlnTyr: 1.006 ± 0.154
0.0GlnXaa: 0.0 ± 0.0
Arg
4.683ArgAla: 4.683 ± 0.436
0.46ArgCys: 0.46 ± 0.116
2.93ArgAsp: 2.93 ± 0.311
3.505ArgGlu: 3.505 ± 0.345
2.643ArgPhe: 2.643 ± 0.311
4.08ArgGly: 4.08 ± 0.333
1.149ArgHis: 1.149 ± 0.202
3.505ArgIle: 3.505 ± 0.312
3.907ArgLys: 3.907 ± 0.378
4.511ArgLeu: 4.511 ± 0.299
1.954ArgMet: 1.954 ± 0.252
2.557ArgAsn: 2.557 ± 0.238
1.753ArgPro: 1.753 ± 0.219
2.011ArgGln: 2.011 ± 0.248
3.132ArgArg: 3.132 ± 0.371
3.132ArgSer: 3.132 ± 0.315
2.816ArgThr: 2.816 ± 0.315
3.563ArgVal: 3.563 ± 0.307
1.092ArgTrp: 1.092 ± 0.172
2.385ArgTyr: 2.385 ± 0.28
0.0ArgXaa: 0.0 ± 0.0
Ser
4.08SerAla: 4.08 ± 0.495
0.69SerCys: 0.69 ± 0.168
4.424SerAsp: 4.424 ± 0.341
3.563SerGlu: 3.563 ± 0.33
3.045SerPhe: 3.045 ± 0.279
5.487SerGly: 5.487 ± 0.424
0.919SerHis: 0.919 ± 0.183
3.706SerIle: 3.706 ± 0.389
3.821SerLys: 3.821 ± 0.36
5.401SerLeu: 5.401 ± 0.418
1.81SerMet: 1.81 ± 0.221
2.758SerAsn: 2.758 ± 0.314
2.298SerPro: 2.298 ± 0.327
2.327SerGln: 2.327 ± 0.248
2.902SerArg: 2.902 ± 0.328
3.361SerSer: 3.361 ± 0.453
3.62SerThr: 3.62 ± 0.4
4.511SerVal: 4.511 ± 0.457
1.034SerTrp: 1.034 ± 0.187
1.867SerTyr: 1.867 ± 0.228
0.0SerXaa: 0.0 ± 0.0
Thr
4.798ThrAla: 4.798 ± 0.499
0.804ThrCys: 0.804 ± 0.172
3.706ThrAsp: 3.706 ± 0.294
3.993ThrGlu: 3.993 ± 0.315
2.557ThrPhe: 2.557 ± 0.299
4.712ThrGly: 4.712 ± 0.435
0.833ThrHis: 0.833 ± 0.183
3.246ThrIle: 3.246 ± 0.414
2.902ThrLys: 2.902 ± 0.357
4.769ThrLeu: 4.769 ± 0.334
1.551ThrMet: 1.551 ± 0.22
2.471ThrAsn: 2.471 ± 0.289
2.356ThrPro: 2.356 ± 0.35
2.27ThrGln: 2.27 ± 0.266
3.017ThrArg: 3.017 ± 0.319
4.137ThrSer: 4.137 ± 0.434
3.965ThrThr: 3.965 ± 0.625
4.97ThrVal: 4.97 ± 0.485
1.178ThrTrp: 1.178 ± 0.175
2.126ThrTyr: 2.126 ± 0.254
0.0ThrXaa: 0.0 ± 0.0
Val
5.487ValAla: 5.487 ± 0.368
0.862ValCys: 0.862 ± 0.145
5.401ValAsp: 5.401 ± 0.454
5.028ValGlu: 5.028 ± 0.424
2.729ValPhe: 2.729 ± 0.252
4.712ValGly: 4.712 ± 0.328
1.638ValHis: 1.638 ± 0.233
3.534ValIle: 3.534 ± 0.326
4.396ValLys: 4.396 ± 0.383
4.396ValLeu: 4.396 ± 0.347
1.982ValMet: 1.982 ± 0.225
2.902ValAsn: 2.902 ± 0.342
3.333ValPro: 3.333 ± 0.364
2.27ValGln: 2.27 ± 0.245
3.936ValArg: 3.936 ± 0.372
4.798ValSer: 4.798 ± 0.47
4.108ValThr: 4.108 ± 0.463
5.315ValVal: 5.315 ± 0.487
1.35ValTrp: 1.35 ± 0.199
2.614ValTyr: 2.614 ± 0.301
0.0ValXaa: 0.0 ± 0.0
Trp
1.207TrpAla: 1.207 ± 0.165
0.172TrpCys: 0.172 ± 0.069
1.207TrpAsp: 1.207 ± 0.178
1.436TrpGlu: 1.436 ± 0.2
0.948TrpPhe: 0.948 ± 0.153
1.465TrpGly: 1.465 ± 0.21
0.69TrpHis: 0.69 ± 0.141
0.804TrpIle: 0.804 ± 0.167
1.322TrpLys: 1.322 ± 0.206
1.35TrpLeu: 1.35 ± 0.244
0.661TrpMet: 0.661 ± 0.141
1.436TrpAsn: 1.436 ± 0.228
0.575TrpPro: 0.575 ± 0.128
0.747TrpGln: 0.747 ± 0.139
0.862TrpArg: 0.862 ± 0.163
1.12TrpSer: 1.12 ± 0.179
1.006TrpThr: 1.006 ± 0.162
0.718TrpVal: 0.718 ± 0.138
0.345TrpTrp: 0.345 ± 0.107
0.488TrpTyr: 0.488 ± 0.138
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.988TyrAla: 2.988 ± 0.25
0.603TyrCys: 0.603 ± 0.135
2.614TyrAsp: 2.614 ± 0.257
2.327TyrGlu: 2.327 ± 0.292
1.149TyrPhe: 1.149 ± 0.212
2.873TyrGly: 2.873 ± 0.302
0.661TyrHis: 0.661 ± 0.149
1.954TyrIle: 1.954 ± 0.26
2.04TyrLys: 2.04 ± 0.231
2.701TyrLeu: 2.701 ± 0.274
0.747TyrMet: 0.747 ± 0.155
1.781TyrAsn: 1.781 ± 0.259
1.408TyrPro: 1.408 ± 0.217
0.948TyrGln: 0.948 ± 0.155
2.413TyrArg: 2.413 ± 0.262
2.499TyrSer: 2.499 ± 0.271
2.499TyrThr: 2.499 ± 0.281
3.045TyrVal: 3.045 ± 0.293
0.661TyrTrp: 0.661 ± 0.177
1.293TyrTyr: 1.293 ± 0.213
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 225 proteins (34808 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski