Amino acid dipepetide frequency for Streptomyces phage Wofford

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.136AlaAla: 7.136 ± 0.982
0.708AlaCys: 0.708 ± 0.163
4.571AlaAsp: 4.571 ± 0.367
6.075AlaGlu: 6.075 ± 0.541
3.037AlaPhe: 3.037 ± 0.305
6.488AlaGly: 6.488 ± 0.564
1.327AlaHis: 1.327 ± 0.202
4.571AlaIle: 4.571 ± 0.45
5.249AlaLys: 5.249 ± 0.419
6.576AlaLeu: 6.576 ± 0.542
2.831AlaMet: 2.831 ± 0.296
3.893AlaAsn: 3.893 ± 0.494
3.037AlaPro: 3.037 ± 0.403
2.801AlaGln: 2.801 ± 0.301
4.512AlaArg: 4.512 ± 0.441
4.689AlaSer: 4.689 ± 0.499
4.689AlaThr: 4.689 ± 0.635
5.19AlaVal: 5.19 ± 0.499
1.415AlaTrp: 1.415 ± 0.185
3.332AlaTyr: 3.332 ± 0.306
0.0AlaXaa: 0.0 ± 0.0
Cys
0.678CysAla: 0.678 ± 0.149
0.147CysCys: 0.147 ± 0.079
0.708CysAsp: 0.708 ± 0.167
0.619CysGlu: 0.619 ± 0.16
0.265CysPhe: 0.265 ± 0.102
1.091CysGly: 1.091 ± 0.215
0.265CysHis: 0.265 ± 0.093
0.59CysIle: 0.59 ± 0.144
0.796CysLys: 0.796 ± 0.177
0.649CysLeu: 0.649 ± 0.135
0.413CysMet: 0.413 ± 0.114
0.59CysAsn: 0.59 ± 0.146
0.472CysPro: 0.472 ± 0.13
0.383CysGln: 0.383 ± 0.124
0.737CysArg: 0.737 ± 0.157
0.796CysSer: 0.796 ± 0.199
0.472CysThr: 0.472 ± 0.143
0.767CysVal: 0.767 ± 0.167
0.177CysTrp: 0.177 ± 0.077
0.324CysTyr: 0.324 ± 0.106
0.0CysXaa: 0.0 ± 0.0
Asp
5.573AspAla: 5.573 ± 0.491
0.767AspCys: 0.767 ± 0.164
3.775AspAsp: 3.775 ± 0.43
4.836AspGlu: 4.836 ± 0.437
3.214AspPhe: 3.214 ± 0.315
5.986AspGly: 5.986 ± 0.536
0.973AspHis: 0.973 ± 0.149
3.332AspIle: 3.332 ± 0.384
3.893AspLys: 3.893 ± 0.423
4.954AspLeu: 4.954 ± 0.426
1.917AspMet: 1.917 ± 0.253
3.185AspAsn: 3.185 ± 0.3
2.625AspPro: 2.625 ± 0.248
1.917AspGln: 1.917 ± 0.273
3.126AspArg: 3.126 ± 0.328
3.981AspSer: 3.981 ± 0.358
3.096AspThr: 3.096 ± 0.37
4.482AspVal: 4.482 ± 0.454
1.533AspTrp: 1.533 ± 0.194
2.683AspTyr: 2.683 ± 0.281
0.0AspXaa: 0.0 ± 0.0
Glu
6.37GluAla: 6.37 ± 0.47
0.914GluCys: 0.914 ± 0.187
4.246GluAsp: 4.246 ± 0.404
4.984GluGlu: 4.984 ± 0.521
3.126GluPhe: 3.126 ± 0.299
4.099GluGly: 4.099 ± 0.358
1.327GluHis: 1.327 ± 0.257
4.807GluIle: 4.807 ± 0.358
4.394GluLys: 4.394 ± 0.493
5.514GluLeu: 5.514 ± 0.422
2.3GluMet: 2.3 ± 0.235
2.949GluAsn: 2.949 ± 0.286
1.681GluPro: 1.681 ± 0.211
2.625GluGln: 2.625 ± 0.341
4.482GluArg: 4.482 ± 0.421
3.332GluSer: 3.332 ± 0.363
3.421GluThr: 3.421 ± 0.353
4.984GluVal: 4.984 ± 0.449
1.209GluTrp: 1.209 ± 0.228
2.507GluTyr: 2.507 ± 0.283
0.0GluXaa: 0.0 ± 0.0
Phe
2.33PheAla: 2.33 ± 0.32
0.442PheCys: 0.442 ± 0.12
3.509PheAsp: 3.509 ± 0.286
3.421PheGlu: 3.421 ± 0.305
1.445PhePhe: 1.445 ± 0.247
2.919PheGly: 2.919 ± 0.311
0.767PheHis: 0.767 ± 0.136
2.094PheIle: 2.094 ± 0.262
1.917PheLys: 1.917 ± 0.239
2.035PheLeu: 2.035 ± 0.283
0.944PheMet: 0.944 ± 0.164
2.123PheAsn: 2.123 ± 0.264
1.18PhePro: 1.18 ± 0.171
1.091PheGln: 1.091 ± 0.218
2.153PheArg: 2.153 ± 0.26
2.742PheSer: 2.742 ± 0.3
2.507PheThr: 2.507 ± 0.268
2.33PheVal: 2.33 ± 0.365
0.354PheTrp: 0.354 ± 0.099
1.563PheTyr: 1.563 ± 0.221
0.0PheXaa: 0.0 ± 0.0
Gly
4.836GlyAla: 4.836 ± 0.407
0.649GlyCys: 0.649 ± 0.137
4.571GlyAsp: 4.571 ± 0.48
4.807GlyGlu: 4.807 ± 0.398
3.008GlyPhe: 3.008 ± 0.305
4.807GlyGly: 4.807 ± 0.523
1.563GlyHis: 1.563 ± 0.25
4.63GlyIle: 4.63 ± 0.48
5.78GlyLys: 5.78 ± 0.491
5.573GlyLeu: 5.573 ± 0.577
2.448GlyMet: 2.448 ± 0.22
3.804GlyAsn: 3.804 ± 0.436
2.742GlyPro: 2.742 ± 0.622
2.182GlyGln: 2.182 ± 0.266
3.775GlyArg: 3.775 ± 0.303
4.069GlySer: 4.069 ± 0.397
5.632GlyThr: 5.632 ± 0.739
6.075GlyVal: 6.075 ± 0.43
1.563GlyTrp: 1.563 ± 0.211
2.801GlyTyr: 2.801 ± 0.295
0.0GlyXaa: 0.0 ± 0.0
His
1.15HisAla: 1.15 ± 0.186
0.147HisCys: 0.147 ± 0.071
1.18HisAsp: 1.18 ± 0.192
1.209HisGlu: 1.209 ± 0.18
0.531HisPhe: 0.531 ± 0.122
1.415HisGly: 1.415 ± 0.226
0.472HisHis: 0.472 ± 0.117
0.796HisIle: 0.796 ± 0.118
1.062HisLys: 1.062 ± 0.179
1.386HisLeu: 1.386 ± 0.208
0.354HisMet: 0.354 ± 0.092
0.885HisAsn: 0.885 ± 0.168
0.767HisPro: 0.767 ± 0.151
0.413HisGln: 0.413 ± 0.108
1.268HisArg: 1.268 ± 0.215
0.914HisSer: 0.914 ± 0.138
1.003HisThr: 1.003 ± 0.216
1.298HisVal: 1.298 ± 0.164
0.295HisTrp: 0.295 ± 0.094
0.885HisTyr: 0.885 ± 0.166
0.0HisXaa: 0.0 ± 0.0
Ile
4.689IleAla: 4.689 ± 0.43
0.708IleCys: 0.708 ± 0.157
4.423IleAsp: 4.423 ± 0.368
4.63IleGlu: 4.63 ± 0.379
1.474IlePhe: 1.474 ± 0.205
4.128IleGly: 4.128 ± 0.36
0.855IleHis: 0.855 ± 0.17
3.126IleIle: 3.126 ± 0.303
4.335IleLys: 4.335 ± 0.361
3.037IleLeu: 3.037 ± 0.28
1.622IleMet: 1.622 ± 0.242
2.683IleAsn: 2.683 ± 0.295
1.946IlePro: 1.946 ± 0.201
1.976IleGln: 1.976 ± 0.285
2.978IleArg: 2.978 ± 0.318
3.273IleSer: 3.273 ± 0.332
4.01IleThr: 4.01 ± 0.423
3.952IleVal: 3.952 ± 0.371
0.885IleTrp: 0.885 ± 0.17
1.828IleTyr: 1.828 ± 0.303
0.0IleXaa: 0.0 ± 0.0
Lys
5.662LysAla: 5.662 ± 0.443
0.796LysCys: 0.796 ± 0.181
3.922LysAsp: 3.922 ± 0.415
3.922LysGlu: 3.922 ± 0.342
2.241LysPhe: 2.241 ± 0.276
3.893LysGly: 3.893 ± 0.46
1.091LysHis: 1.091 ± 0.191
4.748LysIle: 4.748 ± 0.325
4.984LysLys: 4.984 ± 0.441
3.775LysLeu: 3.775 ± 0.34
2.153LysMet: 2.153 ± 0.253
3.686LysAsn: 3.686 ± 0.37
2.536LysPro: 2.536 ± 0.274
2.566LysGln: 2.566 ± 0.328
3.716LysArg: 3.716 ± 0.401
3.627LysSer: 3.627 ± 0.341
3.922LysThr: 3.922 ± 0.329
3.804LysVal: 3.804 ± 0.351
1.15LysTrp: 1.15 ± 0.195
2.683LysTyr: 2.683 ± 0.317
0.0LysXaa: 0.0 ± 0.0
Leu
6.134LeuAla: 6.134 ± 0.409
0.678LeuCys: 0.678 ± 0.144
5.043LeuAsp: 5.043 ± 0.339
5.632LeuGlu: 5.632 ± 0.494
2.566LeuPhe: 2.566 ± 0.296
5.161LeuGly: 5.161 ± 0.425
1.327LeuHis: 1.327 ± 0.19
4.217LeuIle: 4.217 ± 0.327
4.158LeuLys: 4.158 ± 0.388
4.364LeuLeu: 4.364 ± 0.428
1.946LeuMet: 1.946 ± 0.288
3.303LeuAsn: 3.303 ± 0.3
3.008LeuPro: 3.008 ± 0.349
1.622LeuGln: 1.622 ± 0.28
3.804LeuArg: 3.804 ± 0.347
5.043LeuSer: 5.043 ± 0.362
5.072LeuThr: 5.072 ± 0.433
4.305LeuVal: 4.305 ± 0.418
1.209LeuTrp: 1.209 ± 0.196
2.389LeuTyr: 2.389 ± 0.265
0.0LeuXaa: 0.0 ± 0.0
Met
2.949MetAla: 2.949 ± 0.283
0.383MetCys: 0.383 ± 0.123
1.769MetAsp: 1.769 ± 0.251
1.592MetGlu: 1.592 ± 0.24
0.826MetPhe: 0.826 ± 0.175
2.064MetGly: 2.064 ± 0.306
0.442MetHis: 0.442 ± 0.112
2.123MetIle: 2.123 ± 0.265
1.917MetLys: 1.917 ± 0.251
1.917MetLeu: 1.917 ± 0.235
0.944MetMet: 0.944 ± 0.152
1.504MetAsn: 1.504 ± 0.197
1.121MetPro: 1.121 ± 0.194
1.18MetGln: 1.18 ± 0.322
1.799MetArg: 1.799 ± 0.242
2.241MetSer: 2.241 ± 0.257
2.005MetThr: 2.005 ± 0.245
1.828MetVal: 1.828 ± 0.247
0.265MetTrp: 0.265 ± 0.087
0.914MetTyr: 0.914 ± 0.181
0.0MetXaa: 0.0 ± 0.0
Asn
3.686AsnAla: 3.686 ± 0.428
0.501AsnCys: 0.501 ± 0.131
3.48AsnAsp: 3.48 ± 0.355
3.657AsnGlu: 3.657 ± 0.397
1.769AsnPhe: 1.769 ± 0.21
4.158AsnGly: 4.158 ± 0.43
0.708AsnHis: 0.708 ± 0.15
2.094AsnIle: 2.094 ± 0.244
3.273AsnLys: 3.273 ± 0.354
3.362AsnLeu: 3.362 ± 0.308
1.592AsnMet: 1.592 ± 0.244
1.976AsnAsn: 1.976 ± 0.29
2.389AsnPro: 2.389 ± 0.314
1.386AsnGln: 1.386 ± 0.194
2.241AsnArg: 2.241 ± 0.231
2.831AsnSer: 2.831 ± 0.329
2.33AsnThr: 2.33 ± 0.287
3.273AsnVal: 3.273 ± 0.29
0.973AsnTrp: 0.973 ± 0.143
1.828AsnTyr: 1.828 ± 0.262
0.0AsnXaa: 0.0 ± 0.0
Pro
2.89ProAla: 2.89 ± 0.317
0.413ProCys: 0.413 ± 0.115
2.831ProAsp: 2.831 ± 0.341
2.33ProGlu: 2.33 ± 0.292
1.622ProPhe: 1.622 ± 0.273
3.303ProGly: 3.303 ± 0.438
0.708ProHis: 0.708 ± 0.139
1.769ProIle: 1.769 ± 0.196
2.035ProLys: 2.035 ± 0.257
2.33ProLeu: 2.33 ± 0.296
0.619ProMet: 0.619 ± 0.132
1.651ProAsn: 1.651 ± 0.257
1.239ProPro: 1.239 ± 0.264
1.386ProGln: 1.386 ± 0.25
1.976ProArg: 1.976 ± 0.218
2.182ProSer: 2.182 ± 0.364
2.3ProThr: 2.3 ± 0.43
3.421ProVal: 3.421 ± 0.367
0.442ProTrp: 0.442 ± 0.11
1.062ProTyr: 1.062 ± 0.179
0.0ProXaa: 0.0 ± 0.0
Gln
3.303GlnAla: 3.303 ± 0.476
0.324GlnCys: 0.324 ± 0.111
1.356GlnAsp: 1.356 ± 0.269
2.064GlnGlu: 2.064 ± 0.221
1.533GlnPhe: 1.533 ± 0.234
2.035GlnGly: 2.035 ± 0.422
0.383GlnHis: 0.383 ± 0.092
1.622GlnIle: 1.622 ± 0.241
2.3GlnLys: 2.3 ± 0.294
2.713GlnLeu: 2.713 ± 0.278
1.268GlnMet: 1.268 ± 0.238
1.474GlnAsn: 1.474 ± 0.217
0.826GlnPro: 0.826 ± 0.157
1.062GlnGln: 1.062 ± 0.256
2.005GlnArg: 2.005 ± 0.36
1.917GlnSer: 1.917 ± 0.242
1.858GlnThr: 1.858 ± 0.237
2.566GlnVal: 2.566 ± 0.267
0.708GlnTrp: 0.708 ± 0.149
1.474GlnTyr: 1.474 ± 0.226
0.0GlnXaa: 0.0 ± 0.0
Arg
4.866ArgAla: 4.866 ± 0.517
0.678ArgCys: 0.678 ± 0.155
3.509ArgAsp: 3.509 ± 0.408
3.952ArgGlu: 3.952 ± 0.37
2.212ArgPhe: 2.212 ± 0.32
3.657ArgGly: 3.657 ± 0.306
0.767ArgHis: 0.767 ± 0.185
2.978ArgIle: 2.978 ± 0.267
3.834ArgLys: 3.834 ± 0.423
4.099ArgLeu: 4.099 ± 0.33
1.887ArgMet: 1.887 ± 0.257
2.595ArgAsn: 2.595 ± 0.266
1.828ArgPro: 1.828 ± 0.246
2.064ArgGln: 2.064 ± 0.243
3.509ArgArg: 3.509 ± 0.5
2.654ArgSer: 2.654 ± 0.307
2.89ArgThr: 2.89 ± 0.335
4.04ArgVal: 4.04 ± 0.407
1.239ArgTrp: 1.239 ± 0.216
2.212ArgTyr: 2.212 ± 0.251
0.0ArgXaa: 0.0 ± 0.0
Ser
4.895SerAla: 4.895 ± 0.714
0.649SerCys: 0.649 ± 0.154
4.069SerAsp: 4.069 ± 0.45
3.509SerGlu: 3.509 ± 0.311
2.33SerPhe: 2.33 ± 0.272
6.193SerGly: 6.193 ± 0.588
1.327SerHis: 1.327 ± 0.18
3.185SerIle: 3.185 ± 0.327
3.863SerLys: 3.863 ± 0.385
4.836SerLeu: 4.836 ± 0.409
1.828SerMet: 1.828 ± 0.214
2.33SerAsn: 2.33 ± 0.395
1.828SerPro: 1.828 ± 0.258
1.799SerGln: 1.799 ± 0.196
3.48SerArg: 3.48 ± 0.301
3.421SerSer: 3.421 ± 0.443
3.421SerThr: 3.421 ± 0.446
4.571SerVal: 4.571 ± 0.358
1.18SerTrp: 1.18 ± 0.184
1.74SerTyr: 1.74 ± 0.277
0.0SerXaa: 0.0 ± 0.0
Thr
4.807ThrAla: 4.807 ± 0.628
0.619ThrCys: 0.619 ± 0.149
3.45ThrAsp: 3.45 ± 0.329
3.627ThrGlu: 3.627 ± 0.333
2.389ThrPhe: 2.389 ± 0.258
5.013ThrGly: 5.013 ± 0.669
0.914ThrHis: 0.914 ± 0.168
3.391ThrIle: 3.391 ± 0.34
3.155ThrLys: 3.155 ± 0.282
3.922ThrLeu: 3.922 ± 0.338
1.445ThrMet: 1.445 ± 0.231
2.978ThrAsn: 2.978 ± 0.446
2.86ThrPro: 2.86 ± 0.388
2.241ThrGln: 2.241 ± 0.317
3.008ThrArg: 3.008 ± 0.314
3.421ThrSer: 3.421 ± 0.324
3.657ThrThr: 3.657 ± 0.621
4.689ThrVal: 4.689 ± 0.582
1.563ThrTrp: 1.563 ± 0.211
2.241ThrTyr: 2.241 ± 0.29
0.0ThrXaa: 0.0 ± 0.0
Val
5.75ValAla: 5.75 ± 0.398
0.855ValCys: 0.855 ± 0.165
5.131ValAsp: 5.131 ± 0.492
4.453ValGlu: 4.453 ± 0.402
2.507ValPhe: 2.507 ± 0.277
4.453ValGly: 4.453 ± 0.392
1.209ValHis: 1.209 ± 0.199
3.775ValIle: 3.775 ± 0.29
4.394ValLys: 4.394 ± 0.33
5.249ValLeu: 5.249 ± 0.509
1.533ValMet: 1.533 ± 0.178
3.037ValAsn: 3.037 ± 0.327
2.595ValPro: 2.595 ± 0.339
2.153ValGln: 2.153 ± 0.264
4.187ValArg: 4.187 ± 0.343
5.279ValSer: 5.279 ± 0.517
3.834ValThr: 3.834 ± 0.458
5.396ValVal: 5.396 ± 0.36
1.533ValTrp: 1.533 ± 0.244
3.096ValTyr: 3.096 ± 0.319
0.0ValXaa: 0.0 ± 0.0
Trp
1.356TrpAla: 1.356 ± 0.23
0.265TrpCys: 0.265 ± 0.106
1.445TrpAsp: 1.445 ± 0.215
1.445TrpGlu: 1.445 ± 0.242
0.708TrpPhe: 0.708 ± 0.143
1.445TrpGly: 1.445 ± 0.252
0.383TrpHis: 0.383 ± 0.108
0.914TrpIle: 0.914 ± 0.153
1.356TrpLys: 1.356 ± 0.247
1.474TrpLeu: 1.474 ± 0.262
0.737TrpMet: 0.737 ± 0.145
1.032TrpAsn: 1.032 ± 0.148
0.413TrpPro: 0.413 ± 0.117
0.531TrpGln: 0.531 ± 0.141
0.885TrpArg: 0.885 ± 0.181
1.209TrpSer: 1.209 ± 0.171
1.032TrpThr: 1.032 ± 0.199
0.914TrpVal: 0.914 ± 0.175
0.501TrpTrp: 0.501 ± 0.123
0.767TrpTyr: 0.767 ± 0.159
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.067TyrAla: 3.067 ± 0.344
0.295TyrCys: 0.295 ± 0.09
2.86TyrAsp: 2.86 ± 0.319
2.507TyrGlu: 2.507 ± 0.287
1.062TyrPhe: 1.062 ± 0.207
3.008TyrGly: 3.008 ± 0.253
0.619TyrHis: 0.619 ± 0.134
1.828TyrIle: 1.828 ± 0.28
2.182TyrLys: 2.182 ± 0.285
3.214TyrLeu: 3.214 ± 0.279
0.973TyrMet: 0.973 ± 0.162
1.828TyrAsn: 1.828 ± 0.22
1.415TyrPro: 1.415 ± 0.244
1.415TyrGln: 1.415 ± 0.201
1.828TyrArg: 1.828 ± 0.229
2.831TyrSer: 2.831 ± 0.307
2.241TyrThr: 2.241 ± 0.261
2.536TyrVal: 2.536 ± 0.298
0.619TyrTrp: 0.619 ± 0.141
1.327TyrTyr: 1.327 ± 0.249
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 212 proteins (33912 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski