Amino acid dipepetide frequency for Streptomyces phage Henoccus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.16AlaAla: 15.16 ± 1.307
0.94AlaCys: 0.94 ± 0.271
7.639AlaAsp: 7.639 ± 0.721
8.461AlaGlu: 8.461 ± 0.932
3.232AlaPhe: 3.232 ± 0.528
9.989AlaGly: 9.989 ± 0.841
1.88AlaHis: 1.88 ± 0.382
4.289AlaIle: 4.289 ± 0.639
8.931AlaLys: 8.931 ± 2.183
9.166AlaLeu: 9.166 ± 0.719
3.408AlaMet: 3.408 ± 0.428
2.82AlaAsn: 2.82 ± 0.503
4.818AlaPro: 4.818 ± 0.665
4.289AlaGln: 4.289 ± 0.886
7.932AlaArg: 7.932 ± 0.538
4.759AlaSer: 4.759 ± 0.517
6.287AlaThr: 6.287 ± 0.744
7.756AlaVal: 7.756 ± 0.713
2.35AlaTrp: 2.35 ± 0.41
3.349AlaTyr: 3.349 ± 0.544
0.0AlaXaa: 0.0 ± 0.0
Cys
0.94CysAla: 0.94 ± 0.299
0.0CysCys: 0.0 ± 0.0
0.705CysAsp: 0.705 ± 0.282
0.881CysGlu: 0.881 ± 0.313
0.118CysPhe: 0.118 ± 0.098
1.234CysGly: 1.234 ± 0.355
0.294CysHis: 0.294 ± 0.116
0.646CysIle: 0.646 ± 0.23
0.411CysLys: 0.411 ± 0.172
0.529CysLeu: 0.529 ± 0.18
0.235CysMet: 0.235 ± 0.101
0.235CysAsn: 0.235 ± 0.117
0.881CysPro: 0.881 ± 0.319
0.47CysGln: 0.47 ± 0.181
0.646CysArg: 0.646 ± 0.257
0.823CysSer: 0.823 ± 0.234
0.646CysThr: 0.646 ± 0.202
0.764CysVal: 0.764 ± 0.236
0.118CysTrp: 0.118 ± 0.083
0.235CysTyr: 0.235 ± 0.118
0.0CysXaa: 0.0 ± 0.0
Asp
7.462AspAla: 7.462 ± 0.801
0.646AspCys: 0.646 ± 0.198
4.113AspAsp: 4.113 ± 0.587
4.818AspGlu: 4.818 ± 0.702
2.527AspPhe: 2.527 ± 0.435
6.111AspGly: 6.111 ± 0.697
0.999AspHis: 0.999 ± 0.249
2.174AspIle: 2.174 ± 0.278
2.644AspLys: 2.644 ± 0.462
4.994AspLeu: 4.994 ± 0.718
1.645AspMet: 1.645 ± 0.287
1.586AspAsn: 1.586 ± 0.337
3.408AspPro: 3.408 ± 0.55
1.763AspGln: 1.763 ± 0.308
4.113AspArg: 4.113 ± 0.437
2.115AspSer: 2.115 ± 0.362
2.82AspThr: 2.82 ± 0.421
4.231AspVal: 4.231 ± 0.501
1.469AspTrp: 1.469 ± 0.277
1.469AspTyr: 1.469 ± 0.279
0.0AspXaa: 0.0 ± 0.0
Glu
7.168GluAla: 7.168 ± 0.826
1.116GluCys: 1.116 ± 0.347
4.113GluAsp: 4.113 ± 0.675
3.408GluGlu: 3.408 ± 0.515
2.174GluPhe: 2.174 ± 0.337
4.759GluGly: 4.759 ± 0.635
1.645GluHis: 1.645 ± 0.364
2.762GluIle: 2.762 ± 0.403
3.467GluLys: 3.467 ± 0.722
5.112GluLeu: 5.112 ± 0.662
1.41GluMet: 1.41 ± 0.237
1.586GluAsn: 1.586 ± 0.346
3.408GluPro: 3.408 ± 0.567
2.997GluGln: 2.997 ± 0.504
5.171GluArg: 5.171 ± 0.676
2.879GluSer: 2.879 ± 0.393
3.819GluThr: 3.819 ± 0.529
3.819GluVal: 3.819 ± 0.435
1.586GluTrp: 1.586 ± 0.349
0.823GluTyr: 0.823 ± 0.262
0.0GluXaa: 0.0 ± 0.0
Phe
2.292PheAla: 2.292 ± 0.373
0.294PheCys: 0.294 ± 0.149
2.057PheAsp: 2.057 ± 0.255
1.645PheGlu: 1.645 ± 0.402
0.94PhePhe: 0.94 ± 0.202
2.703PheGly: 2.703 ± 0.45
0.764PheHis: 0.764 ± 0.222
0.823PheIle: 0.823 ± 0.237
1.175PheLys: 1.175 ± 0.293
2.409PheLeu: 2.409 ± 0.418
0.588PheMet: 0.588 ± 0.226
1.116PheAsn: 1.116 ± 0.285
1.528PhePro: 1.528 ± 0.295
1.351PheGln: 1.351 ± 0.279
2.35PheArg: 2.35 ± 0.476
1.645PheSer: 1.645 ± 0.242
2.057PheThr: 2.057 ± 0.374
2.762PheVal: 2.762 ± 0.413
0.705PheTrp: 0.705 ± 0.196
0.646PheTyr: 0.646 ± 0.185
0.0PheXaa: 0.0 ± 0.0
Gly
9.636GlyAla: 9.636 ± 0.713
0.94GlyCys: 0.94 ± 0.25
5.464GlyAsp: 5.464 ± 0.645
5.523GlyGlu: 5.523 ± 0.565
3.055GlyPhe: 3.055 ± 0.269
9.225GlyGly: 9.225 ± 0.953
1.939GlyHis: 1.939 ± 0.338
2.879GlyIle: 2.879 ± 0.638
5.464GlyLys: 5.464 ± 0.819
5.935GlyLeu: 5.935 ± 0.465
2.585GlyMet: 2.585 ± 0.315
3.114GlyAsn: 3.114 ± 0.529
3.761GlyPro: 3.761 ± 0.445
3.467GlyGln: 3.467 ± 0.483
4.701GlyArg: 4.701 ± 0.418
5.288GlySer: 5.288 ± 0.577
5.582GlyThr: 5.582 ± 0.683
7.639GlyVal: 7.639 ± 0.712
2.762GlyTrp: 2.762 ± 0.437
3.173GlyTyr: 3.173 ± 0.442
0.0GlyXaa: 0.0 ± 0.0
His
2.233HisAla: 2.233 ± 0.342
0.353HisCys: 0.353 ± 0.154
0.823HisAsp: 0.823 ± 0.245
1.293HisGlu: 1.293 ± 0.266
0.764HisPhe: 0.764 ± 0.225
2.35HisGly: 2.35 ± 0.404
0.411HisHis: 0.411 ± 0.149
0.47HisIle: 0.47 ± 0.152
0.646HisLys: 0.646 ± 0.223
0.94HisLeu: 0.94 ± 0.227
0.588HisMet: 0.588 ± 0.157
0.646HisAsn: 0.646 ± 0.166
1.704HisPro: 1.704 ± 0.32
0.529HisGln: 0.529 ± 0.172
1.116HisArg: 1.116 ± 0.291
0.764HisSer: 0.764 ± 0.205
0.999HisThr: 0.999 ± 0.244
0.94HisVal: 0.94 ± 0.233
0.353HisTrp: 0.353 ± 0.14
0.705HisTyr: 0.705 ± 0.195
0.0HisXaa: 0.0 ± 0.0
Ile
4.466IleAla: 4.466 ± 0.613
0.705IleCys: 0.705 ± 0.232
2.409IleAsp: 2.409 ± 0.489
2.409IleGlu: 2.409 ± 0.317
1.528IlePhe: 1.528 ± 0.371
3.349IleGly: 3.349 ± 0.513
0.94IleHis: 0.94 ± 0.231
1.351IleIle: 1.351 ± 0.315
2.585IleLys: 2.585 ± 0.466
2.703IleLeu: 2.703 ± 0.449
1.058IleMet: 1.058 ± 0.247
1.645IleAsn: 1.645 ± 0.324
2.292IlePro: 2.292 ± 0.419
1.293IleGln: 1.293 ± 0.264
2.82IleArg: 2.82 ± 0.422
1.763IleSer: 1.763 ± 0.402
2.468IleThr: 2.468 ± 0.359
2.997IleVal: 2.997 ± 0.404
0.764IleTrp: 0.764 ± 0.183
0.881IleTyr: 0.881 ± 0.217
0.0IleXaa: 0.0 ± 0.0
Lys
7.991LysAla: 7.991 ± 2.17
0.705LysCys: 0.705 ± 0.266
2.997LysAsp: 2.997 ± 0.547
3.055LysGlu: 3.055 ± 0.562
1.351LysPhe: 1.351 ± 0.274
4.231LysGly: 4.231 ± 0.547
1.175LysHis: 1.175 ± 0.414
2.762LysIle: 2.762 ± 0.437
3.996LysLys: 3.996 ± 1.56
4.994LysLeu: 4.994 ± 0.525
1.939LysMet: 1.939 ± 0.376
1.234LysAsn: 1.234 ± 0.223
3.232LysPro: 3.232 ± 0.588
1.704LysGln: 1.704 ± 0.369
2.997LysArg: 2.997 ± 0.331
2.292LysSer: 2.292 ± 0.475
3.878LysThr: 3.878 ± 0.74
3.643LysVal: 3.643 ± 0.483
0.764LysTrp: 0.764 ± 0.243
1.351LysTyr: 1.351 ± 0.342
0.0LysXaa: 0.0 ± 0.0
Leu
9.578LeuAla: 9.578 ± 0.942
0.646LeuCys: 0.646 ± 0.212
4.524LeuAsp: 4.524 ± 0.695
5.112LeuGlu: 5.112 ± 0.635
1.234LeuPhe: 1.234 ± 0.274
6.64LeuGly: 6.64 ± 0.787
1.645LeuHis: 1.645 ± 0.318
2.938LeuIle: 2.938 ± 0.462
4.172LeuLys: 4.172 ± 0.466
4.524LeuLeu: 4.524 ± 0.619
1.586LeuMet: 1.586 ± 0.293
2.292LeuAsn: 2.292 ± 0.396
4.172LeuPro: 4.172 ± 0.496
1.645LeuGln: 1.645 ± 0.289
5.7LeuArg: 5.7 ± 0.611
3.937LeuSer: 3.937 ± 0.519
4.759LeuThr: 4.759 ± 0.579
4.289LeuVal: 4.289 ± 0.537
1.704LeuTrp: 1.704 ± 0.448
1.586LeuTyr: 1.586 ± 0.328
0.0LeuXaa: 0.0 ± 0.0
Met
2.35MetAla: 2.35 ± 0.391
0.176MetCys: 0.176 ± 0.108
1.763MetAsp: 1.763 ± 0.254
1.175MetGlu: 1.175 ± 0.256
0.705MetPhe: 0.705 ± 0.197
1.939MetGly: 1.939 ± 0.322
0.353MetHis: 0.353 ± 0.139
1.116MetIle: 1.116 ± 0.224
1.821MetLys: 1.821 ± 0.308
1.88MetLeu: 1.88 ± 0.342
0.294MetMet: 0.294 ± 0.11
0.529MetAsn: 0.529 ± 0.146
1.88MetPro: 1.88 ± 0.407
0.47MetGln: 0.47 ± 0.168
1.41MetArg: 1.41 ± 0.273
1.998MetSer: 1.998 ± 0.299
2.174MetThr: 2.174 ± 0.368
1.293MetVal: 1.293 ± 0.294
0.235MetTrp: 0.235 ± 0.124
0.823MetTyr: 0.823 ± 0.211
0.0MetXaa: 0.0 ± 0.0
Asn
4.113AsnAla: 4.113 ± 0.578
0.353AsnCys: 0.353 ± 0.169
2.057AsnAsp: 2.057 ± 0.393
1.821AsnGlu: 1.821 ± 0.383
1.351AsnPhe: 1.351 ± 0.385
3.173AsnGly: 3.173 ± 0.486
0.47AsnHis: 0.47 ± 0.144
0.94AsnIle: 0.94 ± 0.22
0.999AsnLys: 0.999 ± 0.238
2.468AsnLeu: 2.468 ± 0.432
0.529AsnMet: 0.529 ± 0.177
0.94AsnAsn: 0.94 ± 0.24
2.527AsnPro: 2.527 ± 0.407
0.823AsnGln: 0.823 ± 0.225
1.469AsnArg: 1.469 ± 0.261
1.351AsnSer: 1.351 ± 0.248
1.704AsnThr: 1.704 ± 0.373
1.645AsnVal: 1.645 ± 0.369
0.411AsnTrp: 0.411 ± 0.136
0.47AsnTyr: 0.47 ± 0.18
0.0AsnXaa: 0.0 ± 0.0
Pro
5.758ProAla: 5.758 ± 0.747
0.705ProCys: 0.705 ± 0.233
3.584ProAsp: 3.584 ± 0.48
4.466ProGlu: 4.466 ± 0.512
1.998ProPhe: 1.998 ± 0.264
7.11ProGly: 7.11 ± 0.706
0.823ProHis: 0.823 ± 0.242
1.645ProIle: 1.645 ± 0.305
2.997ProLys: 2.997 ± 0.491
3.467ProLeu: 3.467 ± 0.523
0.999ProMet: 0.999 ± 0.23
1.528ProAsn: 1.528 ± 0.258
3.29ProPro: 3.29 ± 0.524
1.469ProGln: 1.469 ± 0.257
2.997ProArg: 2.997 ± 0.539
2.703ProSer: 2.703 ± 0.536
2.585ProThr: 2.585 ± 0.377
4.524ProVal: 4.524 ± 0.675
1.351ProTrp: 1.351 ± 0.323
1.704ProTyr: 1.704 ± 0.306
0.0ProXaa: 0.0 ± 0.0
Gln
4.289GlnAla: 4.289 ± 0.642
0.294GlnCys: 0.294 ± 0.169
1.763GlnAsp: 1.763 ± 0.307
1.41GlnGlu: 1.41 ± 0.293
0.94GlnPhe: 0.94 ± 0.275
2.997GlnGly: 2.997 ± 0.578
0.999GlnHis: 0.999 ± 0.265
1.704GlnIle: 1.704 ± 0.366
2.35GlnLys: 2.35 ± 0.583
2.292GlnLeu: 2.292 ± 0.408
1.175GlnMet: 1.175 ± 0.275
1.116GlnAsn: 1.116 ± 0.286
1.528GlnPro: 1.528 ± 0.4
1.998GlnGln: 1.998 ± 0.417
2.409GlnArg: 2.409 ± 0.411
1.645GlnSer: 1.645 ± 0.311
2.057GlnThr: 2.057 ± 0.358
1.998GlnVal: 1.998 ± 0.314
0.588GlnTrp: 0.588 ± 0.229
0.588GlnTyr: 0.588 ± 0.17
0.0GlnXaa: 0.0 ± 0.0
Arg
7.403ArgAla: 7.403 ± 0.608
0.881ArgCys: 0.881 ± 0.253
3.584ArgAsp: 3.584 ± 0.499
5.347ArgGlu: 5.347 ± 0.709
1.763ArgPhe: 1.763 ± 0.403
4.524ArgGly: 4.524 ± 0.579
0.823ArgHis: 0.823 ± 0.196
3.114ArgIle: 3.114 ± 0.481
3.761ArgLys: 3.761 ± 0.582
3.937ArgLeu: 3.937 ± 0.546
1.586ArgMet: 1.586 ± 0.331
1.528ArgAsn: 1.528 ± 0.234
3.761ArgPro: 3.761 ± 0.575
2.997ArgGln: 2.997 ± 0.32
5.288ArgArg: 5.288 ± 0.718
3.173ArgSer: 3.173 ± 0.46
3.525ArgThr: 3.525 ± 0.474
4.348ArgVal: 4.348 ± 0.428
1.645ArgTrp: 1.645 ± 0.309
2.174ArgTyr: 2.174 ± 0.408
0.0ArgXaa: 0.0 ± 0.0
Ser
5.229SerAla: 5.229 ± 0.728
0.235SerCys: 0.235 ± 0.116
2.879SerAsp: 2.879 ± 0.485
2.703SerGlu: 2.703 ± 0.401
1.645SerPhe: 1.645 ± 0.314
6.581SerGly: 6.581 ± 0.699
0.529SerHis: 0.529 ± 0.151
2.409SerIle: 2.409 ± 0.395
2.585SerLys: 2.585 ± 0.456
3.29SerLeu: 3.29 ± 0.435
1.058SerMet: 1.058 ± 0.214
1.88SerAsn: 1.88 ± 0.433
2.938SerPro: 2.938 ± 0.493
1.88SerGln: 1.88 ± 0.358
2.292SerArg: 2.292 ± 0.345
1.821SerSer: 1.821 ± 0.343
2.585SerThr: 2.585 ± 0.367
3.702SerVal: 3.702 ± 0.443
1.469SerTrp: 1.469 ± 0.328
1.469SerTyr: 1.469 ± 0.37
0.0SerXaa: 0.0 ± 0.0
Thr
7.462ThrAla: 7.462 ± 0.673
0.764ThrCys: 0.764 ± 0.274
2.997ThrAsp: 2.997 ± 0.465
3.173ThrGlu: 3.173 ± 0.432
1.469ThrPhe: 1.469 ± 0.246
5.817ThrGly: 5.817 ± 0.519
0.94ThrHis: 0.94 ± 0.174
2.762ThrIle: 2.762 ± 0.442
2.527ThrLys: 2.527 ± 0.451
4.936ThrLeu: 4.936 ± 0.601
0.823ThrMet: 0.823 ± 0.188
1.528ThrAsn: 1.528 ± 0.33
4.407ThrPro: 4.407 ± 0.466
1.351ThrGln: 1.351 ± 0.249
3.232ThrArg: 3.232 ± 0.358
3.467ThrSer: 3.467 ± 0.48
3.525ThrThr: 3.525 ± 0.416
4.466ThrVal: 4.466 ± 0.533
1.586ThrTrp: 1.586 ± 0.39
2.292ThrTyr: 2.292 ± 0.457
0.0ThrXaa: 0.0 ± 0.0
Val
8.109ValAla: 8.109 ± 0.577
0.47ValCys: 0.47 ± 0.198
3.819ValAsp: 3.819 ± 0.56
3.878ValGlu: 3.878 ± 0.525
1.645ValPhe: 1.645 ± 0.331
5.347ValGly: 5.347 ± 0.73
1.175ValHis: 1.175 ± 0.276
3.643ValIle: 3.643 ± 0.533
3.349ValLys: 3.349 ± 0.464
5.347ValLeu: 5.347 ± 0.637
1.939ValMet: 1.939 ± 0.287
2.409ValAsn: 2.409 ± 0.412
4.054ValPro: 4.054 ± 0.548
1.939ValGln: 1.939 ± 0.35
4.759ValArg: 4.759 ± 0.476
3.114ValSer: 3.114 ± 0.414
4.289ValThr: 4.289 ± 0.692
5.406ValVal: 5.406 ± 0.711
1.998ValTrp: 1.998 ± 0.416
2.644ValTyr: 2.644 ± 0.32
0.0ValXaa: 0.0 ± 0.0
Trp
2.997TrpAla: 2.997 ± 0.482
0.235TrpCys: 0.235 ± 0.099
2.115TrpAsp: 2.115 ± 0.445
0.94TrpGlu: 0.94 ± 0.193
0.705TrpPhe: 0.705 ± 0.255
1.351TrpGly: 1.351 ± 0.239
0.529TrpHis: 0.529 ± 0.198
1.41TrpIle: 1.41 ± 0.277
1.175TrpLys: 1.175 ± 0.234
1.939TrpLeu: 1.939 ± 0.41
0.353TrpMet: 0.353 ± 0.172
0.94TrpAsn: 0.94 ± 0.306
0.823TrpPro: 0.823 ± 0.259
0.764TrpGln: 0.764 ± 0.187
1.351TrpArg: 1.351 ± 0.253
1.645TrpSer: 1.645 ± 0.396
1.41TrpThr: 1.41 ± 0.3
1.41TrpVal: 1.41 ± 0.334
0.588TrpTrp: 0.588 ± 0.21
0.411TrpTyr: 0.411 ± 0.15
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.82TyrAla: 2.82 ± 0.311
0.353TyrCys: 0.353 ± 0.184
1.704TyrAsp: 1.704 ± 0.284
1.704TyrGlu: 1.704 ± 0.371
0.705TyrPhe: 0.705 ± 0.213
2.762TyrGly: 2.762 ± 0.418
0.294TyrHis: 0.294 ± 0.124
0.588TyrIle: 0.588 ± 0.17
1.234TyrLys: 1.234 ± 0.307
1.821TyrLeu: 1.821 ± 0.343
0.529TyrMet: 0.529 ± 0.162
0.94TyrAsn: 0.94 ± 0.208
1.293TyrPro: 1.293 ± 0.403
0.94TyrGln: 0.94 ± 0.224
2.409TyrArg: 2.409 ± 0.382
1.998TyrSer: 1.998 ± 0.331
2.35TyrThr: 2.35 ± 0.519
1.704TyrVal: 1.704 ± 0.276
0.646TyrTrp: 0.646 ± 0.183
0.47TyrTyr: 0.47 ± 0.227
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 82 proteins (17020 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski