Amino acid dipepetide frequency for Shewanella sp. phage 3/49

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.106AlaAla: 10.106 ± 1.138
0.849AlaCys: 0.849 ± 0.235
6.403AlaAsp: 6.403 ± 0.692
5.786AlaGlu: 5.786 ± 0.947
3.317AlaPhe: 3.317 ± 0.486
6.943AlaGly: 6.943 ± 0.932
1.389AlaHis: 1.389 ± 0.321
5.554AlaIle: 5.554 ± 0.718
6.634AlaLys: 6.634 ± 0.837
6.326AlaLeu: 6.326 ± 0.931
3.086AlaMet: 3.086 ± 0.567
5.4AlaAsn: 5.4 ± 0.655
2.854AlaPro: 2.854 ± 0.474
3.24AlaGln: 3.24 ± 0.361
3.394AlaArg: 3.394 ± 0.547
5.631AlaSer: 5.631 ± 0.815
4.551AlaThr: 4.551 ± 0.577
4.937AlaVal: 4.937 ± 0.644
1.003AlaTrp: 1.003 ± 0.272
2.623AlaTyr: 2.623 ± 0.399
0.0AlaXaa: 0.0 ± 0.0
Cys
0.926CysAla: 0.926 ± 0.268
0.231CysCys: 0.231 ± 0.193
0.849CysAsp: 0.849 ± 0.255
1.157CysGlu: 1.157 ± 0.393
0.617CysPhe: 0.617 ± 0.245
1.389CysGly: 1.389 ± 0.23
0.386CysHis: 0.386 ± 0.2
1.08CysIle: 1.08 ± 0.379
1.003CysLys: 1.003 ± 0.293
1.389CysLeu: 1.389 ± 0.37
0.309CysMet: 0.309 ± 0.154
0.463CysAsn: 0.463 ± 0.2
0.617CysPro: 0.617 ± 0.227
0.463CysGln: 0.463 ± 0.173
0.926CysArg: 0.926 ± 0.232
0.926CysSer: 0.926 ± 0.247
1.234CysThr: 1.234 ± 0.284
0.309CysVal: 0.309 ± 0.159
0.309CysTrp: 0.309 ± 0.155
0.386CysTyr: 0.386 ± 0.172
0.0CysXaa: 0.0 ± 0.0
Asp
7.097AspAla: 7.097 ± 0.922
0.926AspCys: 0.926 ± 0.314
4.243AspAsp: 4.243 ± 0.603
4.166AspGlu: 4.166 ± 0.54
2.623AspPhe: 2.623 ± 0.408
5.786AspGly: 5.786 ± 0.793
0.617AspHis: 0.617 ± 0.216
4.474AspIle: 4.474 ± 0.571
5.091AspLys: 5.091 ± 0.628
5.323AspLeu: 5.323 ± 0.678
1.697AspMet: 1.697 ± 0.355
3.626AspAsn: 3.626 ± 0.534
1.08AspPro: 1.08 ± 0.265
1.234AspGln: 1.234 ± 0.296
2.006AspArg: 2.006 ± 0.47
4.474AspSer: 4.474 ± 0.556
3.163AspThr: 3.163 ± 0.405
3.471AspVal: 3.471 ± 0.509
1.389AspTrp: 1.389 ± 0.291
2.16AspTyr: 2.16 ± 0.399
0.0AspXaa: 0.0 ± 0.0
Glu
4.937GluAla: 4.937 ± 0.657
1.234GluCys: 1.234 ± 0.353
2.7GluAsp: 2.7 ± 0.491
2.623GluGlu: 2.623 ± 0.357
4.011GluPhe: 4.011 ± 0.64
2.237GluGly: 2.237 ± 0.537
1.157GluHis: 1.157 ± 0.308
4.474GluIle: 4.474 ± 0.498
3.24GluLys: 3.24 ± 0.622
6.866GluLeu: 6.866 ± 0.838
1.774GluMet: 1.774 ± 0.384
3.24GluAsn: 3.24 ± 0.479
1.697GluPro: 1.697 ± 0.347
2.7GluGln: 2.7 ± 0.487
3.009GluArg: 3.009 ± 0.525
4.551GluSer: 4.551 ± 0.637
2.469GluThr: 2.469 ± 0.517
4.474GluVal: 4.474 ± 0.58
0.926GluTrp: 0.926 ± 0.261
2.777GluTyr: 2.777 ± 0.391
0.0GluXaa: 0.0 ± 0.0
Phe
3.086PheAla: 3.086 ± 0.585
0.849PheCys: 0.849 ± 0.264
4.011PheAsp: 4.011 ± 0.796
3.086PheGlu: 3.086 ± 0.442
0.849PhePhe: 0.849 ± 0.243
3.009PheGly: 3.009 ± 0.524
0.463PheHis: 0.463 ± 0.178
3.163PheIle: 3.163 ± 0.469
2.469PheLys: 2.469 ± 0.472
1.466PheLeu: 1.466 ± 0.404
0.849PheMet: 0.849 ± 0.285
2.546PheAsn: 2.546 ± 0.425
0.694PhePro: 0.694 ± 0.232
0.771PheGln: 0.771 ± 0.237
1.543PheArg: 1.543 ± 0.291
3.163PheSer: 3.163 ± 0.484
3.471PheThr: 3.471 ± 0.528
2.469PheVal: 2.469 ± 0.533
0.54PheTrp: 0.54 ± 0.24
1.234PheTyr: 1.234 ± 0.305
0.0PheXaa: 0.0 ± 0.0
Gly
5.477GlyAla: 5.477 ± 0.675
0.771GlyCys: 0.771 ± 0.258
3.703GlyAsp: 3.703 ± 0.62
4.706GlyGlu: 4.706 ± 0.595
3.009GlyPhe: 3.009 ± 0.567
6.094GlyGly: 6.094 ± 1.102
0.771GlyHis: 0.771 ± 0.256
2.931GlyIle: 2.931 ± 0.483
4.706GlyLys: 4.706 ± 0.657
5.4GlyLeu: 5.4 ± 0.767
1.389GlyMet: 1.389 ± 0.381
4.32GlyAsn: 4.32 ± 0.681
0.849GlyPro: 0.849 ± 0.277
3.163GlyGln: 3.163 ± 0.591
2.546GlyArg: 2.546 ± 0.306
3.934GlySer: 3.934 ± 0.669
3.934GlyThr: 3.934 ± 0.836
5.863GlyVal: 5.863 ± 0.716
1.157GlyTrp: 1.157 ± 0.263
2.854GlyTyr: 2.854 ± 0.411
0.0GlyXaa: 0.0 ± 0.0
His
1.08HisAla: 1.08 ± 0.331
0.617HisCys: 0.617 ± 0.186
1.62HisAsp: 1.62 ± 0.45
0.771HisGlu: 0.771 ± 0.312
0.617HisPhe: 0.617 ± 0.197
0.926HisGly: 0.926 ± 0.227
0.463HisHis: 0.463 ± 0.186
0.926HisIle: 0.926 ± 0.273
0.849HisLys: 0.849 ± 0.3
0.926HisLeu: 0.926 ± 0.26
0.309HisMet: 0.309 ± 0.133
1.157HisAsn: 1.157 ± 0.321
0.617HisPro: 0.617 ± 0.253
0.54HisGln: 0.54 ± 0.225
0.926HisArg: 0.926 ± 0.281
1.003HisSer: 1.003 ± 0.248
0.771HisThr: 0.771 ± 0.272
1.08HisVal: 1.08 ± 0.29
0.154HisTrp: 0.154 ± 0.116
1.003HisTyr: 1.003 ± 0.283
0.0HisXaa: 0.0 ± 0.0
Ile
6.017IleAla: 6.017 ± 0.703
1.08IleCys: 1.08 ± 0.376
5.014IleAsp: 5.014 ± 0.633
4.783IleGlu: 4.783 ± 0.696
1.003IlePhe: 1.003 ± 0.258
3.78IleGly: 3.78 ± 0.535
1.08IleHis: 1.08 ± 0.341
3.549IleIle: 3.549 ± 0.482
5.246IleLys: 5.246 ± 0.873
3.394IleLeu: 3.394 ± 0.483
2.391IleMet: 2.391 ± 0.377
4.937IleAsn: 4.937 ± 0.582
2.083IlePro: 2.083 ± 0.404
1.543IleGln: 1.543 ± 0.382
3.086IleArg: 3.086 ± 0.428
4.706IleSer: 4.706 ± 0.471
4.243IleThr: 4.243 ± 0.55
4.011IleVal: 4.011 ± 0.565
0.463IleTrp: 0.463 ± 0.184
2.623IleTyr: 2.623 ± 0.503
0.0IleXaa: 0.0 ± 0.0
Lys
5.863LysAla: 5.863 ± 1.065
0.771LysCys: 0.771 ± 0.266
3.394LysAsp: 3.394 ± 0.774
4.089LysGlu: 4.089 ± 0.78
2.7LysPhe: 2.7 ± 0.436
2.931LysGly: 2.931 ± 0.439
1.543LysHis: 1.543 ± 0.375
3.857LysIle: 3.857 ± 0.555
4.166LysLys: 4.166 ± 0.663
5.246LysLeu: 5.246 ± 0.639
2.931LysMet: 2.931 ± 0.47
3.549LysAsn: 3.549 ± 0.44
3.317LysPro: 3.317 ± 0.448
3.703LysGln: 3.703 ± 0.477
3.394LysArg: 3.394 ± 0.648
4.551LysSer: 4.551 ± 0.59
4.243LysThr: 4.243 ± 0.691
4.397LysVal: 4.397 ± 0.618
0.926LysTrp: 0.926 ± 0.231
2.16LysTyr: 2.16 ± 0.359
0.0LysXaa: 0.0 ± 0.0
Leu
7.637LeuAla: 7.637 ± 0.918
1.234LeuCys: 1.234 ± 0.313
4.86LeuAsp: 4.86 ± 0.489
4.243LeuGlu: 4.243 ± 0.499
2.546LeuPhe: 2.546 ± 0.529
4.474LeuGly: 4.474 ± 0.756
1.003LeuHis: 1.003 ± 0.375
5.323LeuIle: 5.323 ± 0.538
6.094LeuLys: 6.094 ± 0.757
4.706LeuLeu: 4.706 ± 0.517
1.697LeuMet: 1.697 ± 0.376
3.703LeuAsn: 3.703 ± 0.643
2.854LeuPro: 2.854 ± 0.548
2.083LeuGln: 2.083 ± 0.43
3.394LeuArg: 3.394 ± 0.516
6.403LeuSer: 6.403 ± 0.708
5.631LeuThr: 5.631 ± 0.658
4.551LeuVal: 4.551 ± 0.58
0.386LeuTrp: 0.386 ± 0.174
2.237LeuTyr: 2.237 ± 0.397
0.0LeuXaa: 0.0 ± 0.0
Met
2.777MetAla: 2.777 ± 0.471
0.463MetCys: 0.463 ± 0.201
1.157MetAsp: 1.157 ± 0.274
0.926MetGlu: 0.926 ± 0.233
0.771MetPhe: 0.771 ± 0.258
0.849MetGly: 0.849 ± 0.255
0.694MetHis: 0.694 ± 0.204
1.851MetIle: 1.851 ± 0.453
2.546MetLys: 2.546 ± 0.556
2.469MetLeu: 2.469 ± 0.463
0.463MetMet: 0.463 ± 0.181
1.543MetAsn: 1.543 ± 0.29
1.08MetPro: 1.08 ± 0.225
1.311MetGln: 1.311 ± 0.276
1.157MetArg: 1.157 ± 0.273
2.7MetSer: 2.7 ± 0.461
1.851MetThr: 1.851 ± 0.392
2.006MetVal: 2.006 ± 0.422
0.617MetTrp: 0.617 ± 0.249
0.771MetTyr: 0.771 ± 0.24
0.0MetXaa: 0.0 ± 0.0
Asn
4.474AsnAla: 4.474 ± 0.502
0.463AsnCys: 0.463 ± 0.217
3.086AsnAsp: 3.086 ± 0.493
3.626AsnGlu: 3.626 ± 0.455
1.774AsnPhe: 1.774 ± 0.349
5.014AsnGly: 5.014 ± 0.594
0.849AsnHis: 0.849 ± 0.225
3.394AsnIle: 3.394 ± 0.64
4.706AsnLys: 4.706 ± 0.513
2.777AsnLeu: 2.777 ± 0.505
1.697AsnMet: 1.697 ± 0.279
3.163AsnAsn: 3.163 ± 0.55
2.623AsnPro: 2.623 ± 0.564
2.777AsnGln: 2.777 ± 0.383
1.697AsnArg: 1.697 ± 0.346
3.703AsnSer: 3.703 ± 0.507
3.394AsnThr: 3.394 ± 0.393
3.549AsnVal: 3.549 ± 0.407
0.694AsnTrp: 0.694 ± 0.225
2.006AsnTyr: 2.006 ± 0.436
0.0AsnXaa: 0.0 ± 0.0
Pro
2.931ProAla: 2.931 ± 0.537
0.771ProCys: 0.771 ± 0.29
1.697ProAsp: 1.697 ± 0.373
2.7ProGlu: 2.7 ± 0.453
1.543ProPhe: 1.543 ± 0.343
0.849ProGly: 0.849 ± 0.336
0.617ProHis: 0.617 ± 0.194
1.774ProIle: 1.774 ± 0.342
1.851ProLys: 1.851 ± 0.403
3.009ProLeu: 3.009 ± 0.567
0.926ProMet: 0.926 ± 0.297
1.234ProAsn: 1.234 ± 0.357
0.926ProPro: 0.926 ± 0.305
0.617ProGln: 0.617 ± 0.217
1.311ProArg: 1.311 ± 0.357
2.7ProSer: 2.7 ± 0.476
2.16ProThr: 2.16 ± 0.471
2.546ProVal: 2.546 ± 0.419
0.386ProTrp: 0.386 ± 0.194
0.771ProTyr: 0.771 ± 0.298
0.0ProXaa: 0.0 ± 0.0
Gln
4.089GlnAla: 4.089 ± 0.5
0.154GlnCys: 0.154 ± 0.116
2.391GlnAsp: 2.391 ± 0.352
1.929GlnGlu: 1.929 ± 0.369
1.389GlnPhe: 1.389 ± 0.324
2.777GlnGly: 2.777 ± 0.499
0.849GlnHis: 0.849 ± 0.274
1.851GlnIle: 1.851 ± 0.384
1.311GlnLys: 1.311 ± 0.341
3.24GlnLeu: 3.24 ± 0.467
0.926GlnMet: 0.926 ± 0.325
1.62GlnAsn: 1.62 ± 0.351
1.389GlnPro: 1.389 ± 0.378
1.697GlnGln: 1.697 ± 0.37
1.851GlnArg: 1.851 ± 0.514
3.394GlnSer: 3.394 ± 0.449
1.003GlnThr: 1.003 ± 0.31
2.623GlnVal: 2.623 ± 0.485
0.54GlnTrp: 0.54 ± 0.201
1.62GlnTyr: 1.62 ± 0.325
0.0GlnXaa: 0.0 ± 0.0
Arg
2.777ArgAla: 2.777 ± 0.39
0.849ArgCys: 0.849 ± 0.237
2.546ArgAsp: 2.546 ± 0.456
2.931ArgGlu: 2.931 ± 0.56
2.16ArgPhe: 2.16 ± 0.358
2.237ArgGly: 2.237 ± 0.34
0.617ArgHis: 0.617 ± 0.154
3.317ArgIle: 3.317 ± 0.521
3.163ArgLys: 3.163 ± 0.468
4.32ArgLeu: 4.32 ± 0.561
1.543ArgMet: 1.543 ± 0.362
2.314ArgAsn: 2.314 ± 0.426
0.926ArgPro: 0.926 ± 0.243
1.311ArgGln: 1.311 ± 0.29
1.774ArgArg: 1.774 ± 0.532
2.391ArgSer: 2.391 ± 0.464
2.314ArgThr: 2.314 ± 0.353
3.009ArgVal: 3.009 ± 0.395
0.463ArgTrp: 0.463 ± 0.213
0.926ArgTyr: 0.926 ± 0.281
0.0ArgXaa: 0.0 ± 0.0
Ser
6.943SerAla: 6.943 ± 0.892
1.311SerCys: 1.311 ± 0.376
4.706SerAsp: 4.706 ± 0.47
4.32SerGlu: 4.32 ± 0.61
2.7SerPhe: 2.7 ± 0.442
5.863SerGly: 5.863 ± 0.731
1.466SerHis: 1.466 ± 0.334
5.94SerIle: 5.94 ± 0.662
4.551SerLys: 4.551 ± 0.441
5.554SerLeu: 5.554 ± 0.692
1.543SerMet: 1.543 ± 0.267
4.32SerAsn: 4.32 ± 0.519
2.16SerPro: 2.16 ± 0.474
3.009SerGln: 3.009 ± 0.462
2.391SerArg: 2.391 ± 0.497
5.014SerSer: 5.014 ± 1.039
3.549SerThr: 3.549 ± 0.588
4.706SerVal: 4.706 ± 0.679
0.617SerTrp: 0.617 ± 0.218
2.006SerTyr: 2.006 ± 0.435
0.0SerXaa: 0.0 ± 0.0
Thr
5.323ThrAla: 5.323 ± 0.83
0.694ThrCys: 0.694 ± 0.238
4.32ThrAsp: 4.32 ± 0.434
3.009ThrGlu: 3.009 ± 0.444
2.006ThrPhe: 2.006 ± 0.375
4.551ThrGly: 4.551 ± 0.792
0.849ThrHis: 0.849 ± 0.267
4.243ThrIle: 4.243 ± 0.548
4.166ThrLys: 4.166 ± 0.603
4.474ThrLeu: 4.474 ± 0.631
1.234ThrMet: 1.234 ± 0.28
2.7ThrAsn: 2.7 ± 0.483
2.7ThrPro: 2.7 ± 0.538
2.083ThrGln: 2.083 ± 0.416
2.391ThrArg: 2.391 ± 0.348
4.166ThrSer: 4.166 ± 0.822
3.78ThrThr: 3.78 ± 0.646
4.551ThrVal: 4.551 ± 0.836
0.771ThrTrp: 0.771 ± 0.234
1.774ThrTyr: 1.774 ± 0.363
0.0ThrXaa: 0.0 ± 0.0
Val
4.86ValAla: 4.86 ± 0.698
0.771ValCys: 0.771 ± 0.284
5.246ValAsp: 5.246 ± 0.504
3.857ValGlu: 3.857 ± 0.501
3.086ValPhe: 3.086 ± 0.569
4.243ValGly: 4.243 ± 0.675
0.463ValHis: 0.463 ± 0.197
4.397ValIle: 4.397 ± 0.598
3.549ValLys: 3.549 ± 0.516
4.629ValLeu: 4.629 ± 0.588
2.006ValMet: 2.006 ± 0.418
3.626ValAsn: 3.626 ± 0.537
1.311ValPro: 1.311 ± 0.318
1.851ValGln: 1.851 ± 0.4
2.854ValArg: 2.854 ± 0.583
6.48ValSer: 6.48 ± 0.871
5.014ValThr: 5.014 ± 0.933
4.706ValVal: 4.706 ± 0.582
0.771ValTrp: 0.771 ± 0.251
1.774ValTyr: 1.774 ± 0.35
0.0ValXaa: 0.0 ± 0.0
Trp
0.849TrpAla: 0.849 ± 0.263
0.231TrpCys: 0.231 ± 0.125
0.617TrpAsp: 0.617 ± 0.202
0.694TrpGlu: 0.694 ± 0.209
0.926TrpPhe: 0.926 ± 0.271
0.463TrpGly: 0.463 ± 0.192
0.309TrpHis: 0.309 ± 0.193
0.926TrpIle: 0.926 ± 0.265
0.849TrpLys: 0.849 ± 0.277
0.926TrpLeu: 0.926 ± 0.254
0.617TrpMet: 0.617 ± 0.192
0.463TrpAsn: 0.463 ± 0.195
0.463TrpPro: 0.463 ± 0.187
0.771TrpGln: 0.771 ± 0.235
0.849TrpArg: 0.849 ± 0.252
0.849TrpSer: 0.849 ± 0.315
0.54TrpThr: 0.54 ± 0.174
0.849TrpVal: 0.849 ± 0.268
0.077TrpTrp: 0.077 ± 0.074
0.309TrpTyr: 0.309 ± 0.155
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.777TyrAla: 2.777 ± 0.42
0.694TyrCys: 0.694 ± 0.244
2.083TyrAsp: 2.083 ± 0.35
1.697TyrGlu: 1.697 ± 0.386
2.083TyrPhe: 2.083 ± 0.427
3.163TyrGly: 3.163 ± 0.643
0.771TyrHis: 0.771 ± 0.239
2.006TyrIle: 2.006 ± 0.429
1.543TyrLys: 1.543 ± 0.302
2.623TyrLeu: 2.623 ± 0.425
0.54TyrMet: 0.54 ± 0.174
1.543TyrAsn: 1.543 ± 0.329
1.08TyrPro: 1.08 ± 0.315
1.774TyrGln: 1.774 ± 0.378
1.466TyrArg: 1.466 ± 0.384
1.929TyrSer: 1.929 ± 0.386
2.469TyrThr: 2.469 ± 0.407
1.389TyrVal: 1.389 ± 0.359
0.386TyrTrp: 0.386 ± 0.186
1.929TyrTyr: 1.929 ± 0.509
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 70 proteins (12964 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski