Amino acid dipepetide frequency for Brevibacillus phage Jenst

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.794AlaAla: 2.794 ± 0.353
0.672AlaCys: 0.672 ± 0.144
4.084AlaAsp: 4.084 ± 0.383
5.562AlaGlu: 5.562 ± 0.49
2.096AlaPhe: 2.096 ± 0.231
3.815AlaGly: 3.815 ± 0.288
1.129AlaHis: 1.129 ± 0.164
4.407AlaIle: 4.407 ± 0.33
5.24AlaLys: 5.24 ± 0.427
4.863AlaLeu: 4.863 ± 0.401
1.988AlaMet: 1.988 ± 0.243
3.278AlaAsn: 3.278 ± 0.331
2.633AlaPro: 2.633 ± 0.266
2.176AlaGln: 2.176 ± 0.302
2.768AlaArg: 2.768 ± 0.275
3.574AlaSer: 3.574 ± 0.327
4.111AlaThr: 4.111 ± 0.387
4.38AlaVal: 4.38 ± 0.346
0.752AlaTrp: 0.752 ± 0.149
2.741AlaTyr: 2.741 ± 0.338
0.0AlaXaa: 0.0 ± 0.0
Cys
0.322CysAla: 0.322 ± 0.1
0.081CysCys: 0.081 ± 0.061
0.725CysAsp: 0.725 ± 0.139
0.725CysGlu: 0.725 ± 0.197
0.296CysPhe: 0.296 ± 0.08
1.129CysGly: 1.129 ± 0.226
0.43CysHis: 0.43 ± 0.118
0.725CysIle: 0.725 ± 0.181
0.806CysLys: 0.806 ± 0.149
0.618CysLeu: 0.618 ± 0.143
0.269CysMet: 0.269 ± 0.085
0.511CysAsn: 0.511 ± 0.102
0.591CysPro: 0.591 ± 0.164
0.242CysGln: 0.242 ± 0.077
0.484CysArg: 0.484 ± 0.109
1.021CysSer: 1.021 ± 0.21
0.403CysThr: 0.403 ± 0.117
0.564CysVal: 0.564 ± 0.148
0.027CysTrp: 0.027 ± 0.029
0.457CysTyr: 0.457 ± 0.122
0.0CysXaa: 0.0 ± 0.0
Asp
3.789AspAla: 3.789 ± 0.288
0.591AspCys: 0.591 ± 0.133
3.977AspAsp: 3.977 ± 0.347
4.756AspGlu: 4.756 ± 0.396
3.063AspPhe: 3.063 ± 0.273
4.836AspGly: 4.836 ± 0.418
0.779AspHis: 0.779 ± 0.15
5.374AspIle: 5.374 ± 0.454
4.245AspLys: 4.245 ± 0.363
4.998AspLeu: 4.998 ± 0.399
1.747AspMet: 1.747 ± 0.236
2.741AspAsn: 2.741 ± 0.296
2.203AspPro: 2.203 ± 0.288
0.94AspGln: 0.94 ± 0.163
2.875AspArg: 2.875 ± 0.326
4.165AspSer: 4.165 ± 0.377
3.412AspThr: 3.412 ± 0.394
5.454AspVal: 5.454 ± 0.319
0.833AspTrp: 0.833 ± 0.174
2.848AspTyr: 2.848 ± 0.286
0.0AspXaa: 0.0 ± 0.0
Glu
4.675GluAla: 4.675 ± 0.391
0.806GluCys: 0.806 ± 0.25
4.568GluAsp: 4.568 ± 0.346
6.771GluGlu: 6.771 ± 0.621
3.063GluPhe: 3.063 ± 0.33
4.756GluGly: 4.756 ± 0.492
1.37GluHis: 1.37 ± 0.183
5.965GluIle: 5.965 ± 0.389
4.971GluLys: 4.971 ± 0.354
7.631GluLeu: 7.631 ± 0.525
2.526GluMet: 2.526 ± 0.339
3.654GluAsn: 3.654 ± 0.264
1.8GluPro: 1.8 ± 0.243
3.036GluGln: 3.036 ± 0.302
3.735GluArg: 3.735 ± 0.359
3.117GluSer: 3.117 ± 0.284
4.245GluThr: 4.245 ± 0.588
5.508GluVal: 5.508 ± 0.398
1.021GluTrp: 1.021 ± 0.186
2.768GluTyr: 2.768 ± 0.277
0.0GluXaa: 0.0 ± 0.0
Phe
2.15PheAla: 2.15 ± 0.208
0.349PheCys: 0.349 ± 0.104
3.762PheAsp: 3.762 ± 0.316
2.929PheGlu: 2.929 ± 0.299
1.666PhePhe: 1.666 ± 0.236
3.009PheGly: 3.009 ± 0.286
0.672PheHis: 0.672 ± 0.152
2.848PheIle: 2.848 ± 0.319
2.633PheLys: 2.633 ± 0.243
2.929PheLeu: 2.929 ± 0.357
1.29PheMet: 1.29 ± 0.213
2.526PheAsn: 2.526 ± 0.207
1.451PhePro: 1.451 ± 0.205
1.29PheGln: 1.29 ± 0.192
1.666PheArg: 1.666 ± 0.183
2.445PheSer: 2.445 ± 0.275
2.365PheThr: 2.365 ± 0.279
2.176PheVal: 2.176 ± 0.267
0.376PheTrp: 0.376 ± 0.1
2.015PheTyr: 2.015 ± 0.249
0.0PheXaa: 0.0 ± 0.0
Gly
3.923GlyAla: 3.923 ± 0.332
0.752GlyCys: 0.752 ± 0.169
4.03GlyAsp: 4.03 ± 0.377
4.38GlyGlu: 4.38 ± 0.354
3.386GlyPhe: 3.386 ± 0.28
4.944GlyGly: 4.944 ± 0.388
1.558GlyHis: 1.558 ± 0.193
4.219GlyIle: 4.219 ± 0.336
4.783GlyLys: 4.783 ± 0.374
5.616GlyLeu: 5.616 ± 0.424
1.747GlyMet: 1.747 ± 0.213
3.601GlyAsn: 3.601 ± 0.339
0.242GlyPro: 0.242 ± 0.094
2.365GlyGln: 2.365 ± 0.266
2.902GlyArg: 2.902 ± 0.249
3.977GlySer: 3.977 ± 0.306
4.487GlyThr: 4.487 ± 0.41
4.917GlyVal: 4.917 ± 0.397
1.129GlyTrp: 1.129 ± 0.169
3.439GlyTyr: 3.439 ± 0.369
0.0GlyXaa: 0.0 ± 0.0
His
0.94HisAla: 0.94 ± 0.166
0.403HisCys: 0.403 ± 0.086
0.887HisAsp: 0.887 ± 0.156
0.914HisGlu: 0.914 ± 0.178
1.182HisPhe: 1.182 ± 0.189
1.048HisGly: 1.048 ± 0.168
0.376HisHis: 0.376 ± 0.102
1.129HisIle: 1.129 ± 0.151
1.505HisLys: 1.505 ± 0.159
1.478HisLeu: 1.478 ± 0.166
0.752HisMet: 0.752 ± 0.149
0.994HisAsn: 0.994 ± 0.158
0.967HisPro: 0.967 ± 0.165
0.537HisGln: 0.537 ± 0.117
0.86HisArg: 0.86 ± 0.147
1.451HisSer: 1.451 ± 0.201
1.075HisThr: 1.075 ± 0.167
1.505HisVal: 1.505 ± 0.198
0.376HisTrp: 0.376 ± 0.102
0.618HisTyr: 0.618 ± 0.148
0.0HisXaa: 0.0 ± 0.0
Ile
4.514IleAla: 4.514 ± 0.318
0.833IleCys: 0.833 ± 0.154
4.568IleAsp: 4.568 ± 0.389
6.476IleGlu: 6.476 ± 0.458
1.827IlePhe: 1.827 ± 0.244
4.245IleGly: 4.245 ± 0.338
1.558IleHis: 1.558 ± 0.229
3.52IleIle: 3.52 ± 0.333
4.729IleLys: 4.729 ± 0.385
4.433IleLeu: 4.433 ± 0.344
1.585IleMet: 1.585 ± 0.198
3.117IleAsn: 3.117 ± 0.282
2.579IlePro: 2.579 ± 0.285
2.553IleGln: 2.553 ± 0.279
3.117IleArg: 3.117 ± 0.282
3.815IleSer: 3.815 ± 0.324
4.38IleThr: 4.38 ± 0.429
4.836IleVal: 4.836 ± 0.382
0.618IleTrp: 0.618 ± 0.124
2.579IleTyr: 2.579 ± 0.281
0.0IleXaa: 0.0 ± 0.0
Lys
4.729LysAla: 4.729 ± 0.332
0.806LysCys: 0.806 ± 0.18
4.783LysAsp: 4.783 ± 0.368
5.643LysGlu: 5.643 ± 0.509
2.875LysPhe: 2.875 ± 0.324
5.32LysGly: 5.32 ± 0.413
1.343LysHis: 1.343 ± 0.183
4.836LysIle: 4.836 ± 0.301
5.911LysLys: 5.911 ± 0.533
5.858LysLeu: 5.858 ± 0.443
2.284LysMet: 2.284 ± 0.235
3.815LysAsn: 3.815 ± 0.351
2.821LysPro: 2.821 ± 0.317
2.526LysGln: 2.526 ± 0.211
3.224LysArg: 3.224 ± 0.324
3.305LysSer: 3.305 ± 0.327
3.52LysThr: 3.52 ± 0.297
5.401LysVal: 5.401 ± 0.39
0.752LysTrp: 0.752 ± 0.134
3.036LysTyr: 3.036 ± 0.318
0.0LysXaa: 0.0 ± 0.0
Leu
5.616LeuAla: 5.616 ± 0.437
1.021LeuCys: 1.021 ± 0.169
5.616LeuAsp: 5.616 ± 0.412
5.696LeuGlu: 5.696 ± 0.416
2.929LeuPhe: 2.929 ± 0.276
5.051LeuGly: 5.051 ± 0.362
1.182LeuHis: 1.182 ± 0.19
4.407LeuIle: 4.407 ± 0.404
6.395LeuLys: 6.395 ± 0.441
5.804LeuLeu: 5.804 ± 0.492
2.257LeuMet: 2.257 ± 0.246
4.38LeuAsn: 4.38 ± 0.354
3.224LeuPro: 3.224 ± 0.37
2.902LeuGln: 2.902 ± 0.308
4.057LeuArg: 4.057 ± 0.282
5.884LeuSer: 5.884 ± 0.451
4.863LeuThr: 4.863 ± 0.348
5.347LeuVal: 5.347 ± 0.348
0.591LeuTrp: 0.591 ± 0.126
3.197LeuTyr: 3.197 ± 0.335
0.0LeuXaa: 0.0 ± 0.0
Met
2.15MetAla: 2.15 ± 0.261
0.081MetCys: 0.081 ± 0.047
1.612MetAsp: 1.612 ± 0.228
1.747MetGlu: 1.747 ± 0.251
0.86MetPhe: 0.86 ± 0.145
1.827MetGly: 1.827 ± 0.244
0.645MetHis: 0.645 ± 0.158
1.397MetIle: 1.397 ± 0.173
2.633MetLys: 2.633 ± 0.284
2.015MetLeu: 2.015 ± 0.226
0.484MetMet: 0.484 ± 0.117
1.343MetAsn: 1.343 ± 0.235
0.967MetPro: 0.967 ± 0.159
1.048MetGln: 1.048 ± 0.169
1.343MetArg: 1.343 ± 0.192
2.391MetSer: 2.391 ± 0.252
1.666MetThr: 1.666 ± 0.201
2.015MetVal: 2.015 ± 0.225
0.349MetTrp: 0.349 ± 0.086
1.209MetTyr: 1.209 ± 0.171
0.0MetXaa: 0.0 ± 0.0
Asn
3.439AsnAla: 3.439 ± 0.344
0.564AsnCys: 0.564 ± 0.15
2.499AsnAsp: 2.499 ± 0.259
3.627AsnGlu: 3.627 ± 0.252
2.499AsnPhe: 2.499 ± 0.259
4.004AsnGly: 4.004 ± 0.34
1.209AsnHis: 1.209 ± 0.165
3.386AsnIle: 3.386 ± 0.314
3.386AsnLys: 3.386 ± 0.294
4.057AsnLeu: 4.057 ± 0.372
1.236AsnMet: 1.236 ± 0.193
2.687AsnAsn: 2.687 ± 0.269
2.445AsnPro: 2.445 ± 0.265
1.317AsnGln: 1.317 ± 0.177
2.445AsnArg: 2.445 ± 0.252
3.036AsnSer: 3.036 ± 0.362
3.251AsnThr: 3.251 ± 0.357
3.036AsnVal: 3.036 ± 0.293
0.645AsnTrp: 0.645 ± 0.129
2.391AsnTyr: 2.391 ± 0.255
0.0AsnXaa: 0.0 ± 0.0
Pro
2.284ProAla: 2.284 ± 0.259
0.349ProCys: 0.349 ± 0.089
2.445ProAsp: 2.445 ± 0.23
2.983ProGlu: 2.983 ± 0.27
1.478ProPhe: 1.478 ± 0.259
0.591ProGly: 0.591 ± 0.148
0.806ProHis: 0.806 ± 0.144
1.693ProIle: 1.693 ± 0.181
2.176ProLys: 2.176 ± 0.289
2.902ProLeu: 2.902 ± 0.296
1.317ProMet: 1.317 ± 0.156
1.827ProAsn: 1.827 ± 0.258
1.612ProPro: 1.612 ± 0.609
1.666ProGln: 1.666 ± 0.274
1.155ProArg: 1.155 ± 0.146
3.036ProSer: 3.036 ± 0.45
2.579ProThr: 2.579 ± 0.283
2.338ProVal: 2.338 ± 0.231
0.188ProTrp: 0.188 ± 0.073
1.478ProTyr: 1.478 ± 0.179
0.0ProXaa: 0.0 ± 0.0
Gln
2.956GlnAla: 2.956 ± 0.392
0.43GlnCys: 0.43 ± 0.117
1.827GlnAsp: 1.827 ± 0.196
2.821GlnGlu: 2.821 ± 0.308
1.155GlnPhe: 1.155 ± 0.194
2.176GlnGly: 2.176 ± 0.296
0.322GlnHis: 0.322 ± 0.094
1.988GlnIle: 1.988 ± 0.194
2.365GlnLys: 2.365 ± 0.341
3.09GlnLeu: 3.09 ± 0.294
1.075GlnMet: 1.075 ± 0.22
1.935GlnAsn: 1.935 ± 0.228
0.994GlnPro: 0.994 ± 0.182
1.236GlnGln: 1.236 ± 0.201
1.639GlnArg: 1.639 ± 0.241
1.532GlnSer: 1.532 ± 0.243
1.666GlnThr: 1.666 ± 0.203
2.821GlnVal: 2.821 ± 0.272
0.457GlnTrp: 0.457 ± 0.118
1.747GlnTyr: 1.747 ± 0.203
0.0GlnXaa: 0.0 ± 0.0
Arg
2.956ArgAla: 2.956 ± 0.298
0.322ArgCys: 0.322 ± 0.103
2.821ArgAsp: 2.821 ± 0.313
3.493ArgGlu: 3.493 ± 0.366
1.881ArgPhe: 1.881 ± 0.257
2.714ArgGly: 2.714 ± 0.308
0.887ArgHis: 0.887 ± 0.159
3.493ArgIle: 3.493 ± 0.301
3.332ArgLys: 3.332 ± 0.331
4.622ArgLeu: 4.622 ± 0.417
1.37ArgMet: 1.37 ± 0.194
2.606ArgAsn: 2.606 ± 0.243
1.585ArgPro: 1.585 ± 0.22
1.988ArgGln: 1.988 ± 0.246
2.687ArgArg: 2.687 ± 0.268
2.311ArgSer: 2.311 ± 0.289
2.284ArgThr: 2.284 ± 0.254
3.197ArgVal: 3.197 ± 0.276
0.511ArgTrp: 0.511 ± 0.141
2.445ArgTyr: 2.445 ± 0.226
0.0ArgXaa: 0.0 ± 0.0
Ser
3.386SerAla: 3.386 ± 0.309
0.537SerCys: 0.537 ± 0.13
3.923SerAsp: 3.923 ± 0.341
3.842SerGlu: 3.842 ± 0.373
2.633SerPhe: 2.633 ± 0.289
3.869SerGly: 3.869 ± 0.333
1.155SerHis: 1.155 ± 0.178
4.326SerIle: 4.326 ± 0.389
4.836SerLys: 4.836 ± 0.409
4.568SerLeu: 4.568 ± 0.382
1.317SerMet: 1.317 ± 0.168
3.144SerAsn: 3.144 ± 0.37
2.633SerPro: 2.633 ± 0.309
2.203SerGln: 2.203 ± 0.278
3.386SerArg: 3.386 ± 0.286
4.568SerSer: 4.568 ± 0.427
3.171SerThr: 3.171 ± 0.251
4.756SerVal: 4.756 ± 0.44
0.806SerTrp: 0.806 ± 0.17
2.391SerTyr: 2.391 ± 0.278
0.0SerXaa: 0.0 ± 0.0
Thr
4.272ThrAla: 4.272 ± 0.335
0.618ThrCys: 0.618 ± 0.134
3.278ThrAsp: 3.278 ± 0.302
4.272ThrGlu: 4.272 ± 0.415
2.553ThrPhe: 2.553 ± 0.246
3.95ThrGly: 3.95 ± 0.34
0.806ThrHis: 0.806 ± 0.146
4.487ThrIle: 4.487 ± 0.375
4.004ThrLys: 4.004 ± 0.288
5.266ThrLeu: 5.266 ± 0.391
1.263ThrMet: 1.263 ± 0.212
2.472ThrAsn: 2.472 ± 0.299
2.526ThrPro: 2.526 ± 0.299
1.881ThrGln: 1.881 ± 0.243
2.257ThrArg: 2.257 ± 0.218
3.063ThrSer: 3.063 ± 0.233
3.305ThrThr: 3.305 ± 0.239
4.89ThrVal: 4.89 ± 0.36
0.591ThrTrp: 0.591 ± 0.11
2.526ThrTyr: 2.526 ± 0.272
0.0ThrXaa: 0.0 ± 0.0
Val
4.541ValAla: 4.541 ± 0.299
0.591ValCys: 0.591 ± 0.13
4.836ValAsp: 4.836 ± 0.299
5.75ValGlu: 5.75 ± 0.454
2.929ValPhe: 2.929 ± 0.288
5.213ValGly: 5.213 ± 0.411
1.343ValHis: 1.343 ± 0.178
4.299ValIle: 4.299 ± 0.33
5.374ValLys: 5.374 ± 0.361
5.159ValLeu: 5.159 ± 0.355
1.827ValMet: 1.827 ± 0.235
3.493ValAsn: 3.493 ± 0.399
1.961ValPro: 1.961 ± 0.251
2.096ValGln: 2.096 ± 0.251
3.977ValArg: 3.977 ± 0.464
5.401ValSer: 5.401 ± 0.391
4.944ValThr: 4.944 ± 0.344
5.401ValVal: 5.401 ± 0.415
0.591ValTrp: 0.591 ± 0.106
2.929ValTyr: 2.929 ± 0.333
0.0ValXaa: 0.0 ± 0.0
Trp
0.86TrpAla: 0.86 ± 0.121
0.107TrpCys: 0.107 ± 0.047
0.699TrpAsp: 0.699 ± 0.165
1.102TrpGlu: 1.102 ± 0.214
0.403TrpPhe: 0.403 ± 0.112
0.833TrpGly: 0.833 ± 0.137
0.322TrpHis: 0.322 ± 0.099
0.457TrpIle: 0.457 ± 0.106
0.591TrpLys: 0.591 ± 0.134
1.048TrpLeu: 1.048 ± 0.174
0.376TrpMet: 0.376 ± 0.099
0.86TrpAsn: 0.86 ± 0.111
0.027TrpPro: 0.027 ± 0.027
0.296TrpGln: 0.296 ± 0.081
0.591TrpArg: 0.591 ± 0.123
0.672TrpSer: 0.672 ± 0.121
0.699TrpThr: 0.699 ± 0.14
0.591TrpVal: 0.591 ± 0.128
0.107TrpTrp: 0.107 ± 0.05
0.457TrpTyr: 0.457 ± 0.114
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.794TyrAla: 2.794 ± 0.222
0.511TyrCys: 0.511 ± 0.144
2.687TyrAsp: 2.687 ± 0.277
2.499TyrGlu: 2.499 ± 0.245
1.773TyrPhe: 1.773 ± 0.217
3.063TyrGly: 3.063 ± 0.332
1.102TyrHis: 1.102 ± 0.179
2.929TyrIle: 2.929 ± 0.291
2.875TyrLys: 2.875 ± 0.298
3.466TyrLeu: 3.466 ± 0.3
1.048TyrMet: 1.048 ± 0.166
2.176TyrAsn: 2.176 ± 0.282
1.639TyrPro: 1.639 ± 0.254
1.935TyrGln: 1.935 ± 0.295
2.338TyrArg: 2.338 ± 0.238
2.687TyrSer: 2.687 ± 0.292
1.854TyrThr: 1.854 ± 0.231
3.52TyrVal: 3.52 ± 0.374
0.376TyrTrp: 0.376 ± 0.111
1.532TyrTyr: 1.532 ± 0.184
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 178 proteins (37218 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski