Amino acid dipepetide frequency for Salmonella phage celemicas

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.211AlaAla: 11.211 ± 1.553
1.136AlaCys: 1.136 ± 0.289
5.984AlaAsp: 5.984 ± 0.664
6.59AlaGlu: 6.59 ± 0.862
3.636AlaPhe: 3.636 ± 0.536
7.348AlaGly: 7.348 ± 0.778
1.818AlaHis: 1.818 ± 0.389
3.485AlaIle: 3.485 ± 0.656
5.681AlaLys: 5.681 ± 0.856
7.424AlaLeu: 7.424 ± 0.749
2.424AlaMet: 2.424 ± 0.469
3.333AlaAsn: 3.333 ± 0.481
3.182AlaPro: 3.182 ± 0.561
3.409AlaGln: 3.409 ± 0.694
4.394AlaArg: 4.394 ± 0.53
5.909AlaSer: 5.909 ± 0.874
5.909AlaThr: 5.909 ± 0.71
8.03AlaVal: 8.03 ± 0.804
0.985AlaTrp: 0.985 ± 0.233
3.485AlaTyr: 3.485 ± 0.458
0.0AlaXaa: 0.0 ± 0.0
Cys
0.682CysAla: 0.682 ± 0.187
0.152CysCys: 0.152 ± 0.127
0.909CysAsp: 0.909 ± 0.265
1.136CysGlu: 1.136 ± 0.317
0.455CysPhe: 0.455 ± 0.172
0.833CysGly: 0.833 ± 0.249
0.227CysHis: 0.227 ± 0.135
0.303CysIle: 0.303 ± 0.14
1.061CysLys: 1.061 ± 0.267
0.833CysLeu: 0.833 ± 0.266
0.455CysMet: 0.455 ± 0.187
0.682CysAsn: 0.682 ± 0.255
0.303CysPro: 0.303 ± 0.158
0.227CysGln: 0.227 ± 0.16
0.53CysArg: 0.53 ± 0.193
0.303CysSer: 0.303 ± 0.13
0.379CysThr: 0.379 ± 0.145
0.682CysVal: 0.682 ± 0.213
0.379CysTrp: 0.379 ± 0.149
0.303CysTyr: 0.303 ± 0.165
0.0CysXaa: 0.0 ± 0.0
Asp
6.666AspAla: 6.666 ± 0.691
0.682AspCys: 0.682 ± 0.229
3.788AspAsp: 3.788 ± 0.535
3.939AspGlu: 3.939 ± 0.512
3.182AspPhe: 3.182 ± 0.46
5.606AspGly: 5.606 ± 0.705
0.758AspHis: 0.758 ± 0.234
3.788AspIle: 3.788 ± 0.472
3.409AspLys: 3.409 ± 0.477
5.0AspLeu: 5.0 ± 0.533
1.439AspMet: 1.439 ± 0.252
3.03AspAsn: 3.03 ± 0.486
2.121AspPro: 2.121 ± 0.422
0.606AspGln: 0.606 ± 0.197
2.651AspArg: 2.651 ± 0.359
3.56AspSer: 3.56 ± 0.44
4.015AspThr: 4.015 ± 0.432
3.788AspVal: 3.788 ± 0.472
0.909AspTrp: 0.909 ± 0.228
1.97AspTyr: 1.97 ± 0.495
0.0AspXaa: 0.0 ± 0.0
Glu
7.196GluAla: 7.196 ± 0.881
0.303GluCys: 0.303 ± 0.145
3.939GluAsp: 3.939 ± 0.468
4.848GluGlu: 4.848 ± 0.916
3.257GluPhe: 3.257 ± 0.689
4.697GluGly: 4.697 ± 0.691
1.136GluHis: 1.136 ± 0.277
3.485GluIle: 3.485 ± 0.413
4.015GluLys: 4.015 ± 0.616
6.06GluLeu: 6.06 ± 0.786
2.954GluMet: 2.954 ± 0.468
2.5GluAsn: 2.5 ± 0.458
1.894GluPro: 1.894 ± 0.53
3.485GluGln: 3.485 ± 0.572
3.788GluArg: 3.788 ± 0.693
3.712GluSer: 3.712 ± 0.548
3.939GluThr: 3.939 ± 0.548
4.772GluVal: 4.772 ± 0.532
0.985GluTrp: 0.985 ± 0.293
1.97GluTyr: 1.97 ± 0.446
0.0GluXaa: 0.0 ± 0.0
Phe
2.424PheAla: 2.424 ± 0.425
0.53PheCys: 0.53 ± 0.185
3.03PheAsp: 3.03 ± 0.501
3.03PheGlu: 3.03 ± 0.475
0.682PhePhe: 0.682 ± 0.2
2.879PheGly: 2.879 ± 0.389
0.682PheHis: 0.682 ± 0.198
2.5PheIle: 2.5 ± 0.486
1.97PheLys: 1.97 ± 0.384
2.197PheLeu: 2.197 ± 0.391
0.606PheMet: 0.606 ± 0.226
1.364PheAsn: 1.364 ± 0.327
1.439PhePro: 1.439 ± 0.383
1.515PheGln: 1.515 ± 0.324
2.045PheArg: 2.045 ± 0.296
2.576PheSer: 2.576 ± 0.591
3.409PheThr: 3.409 ± 0.519
2.576PheVal: 2.576 ± 0.456
0.909PheTrp: 0.909 ± 0.282
0.985PheTyr: 0.985 ± 0.259
0.0PheXaa: 0.0 ± 0.0
Gly
6.969GlyAla: 6.969 ± 0.681
0.909GlyCys: 0.909 ± 0.293
4.091GlyAsp: 4.091 ± 0.647
5.984GlyGlu: 5.984 ± 0.834
2.803GlyPhe: 2.803 ± 0.526
6.136GlyGly: 6.136 ± 0.754
1.515GlyHis: 1.515 ± 0.419
3.712GlyIle: 3.712 ± 0.466
5.227GlyLys: 5.227 ± 0.544
5.606GlyLeu: 5.606 ± 0.587
2.121GlyMet: 2.121 ± 0.517
3.712GlyAsn: 3.712 ± 0.622
1.818GlyPro: 1.818 ± 0.362
3.182GlyGln: 3.182 ± 0.491
4.166GlyArg: 4.166 ± 0.477
5.0GlySer: 5.0 ± 0.681
3.56GlyThr: 3.56 ± 0.473
5.833GlyVal: 5.833 ± 0.659
1.212GlyTrp: 1.212 ± 0.294
2.954GlyTyr: 2.954 ± 0.438
0.0GlyXaa: 0.0 ± 0.0
His
1.136HisAla: 1.136 ± 0.316
0.303HisCys: 0.303 ± 0.156
1.136HisAsp: 1.136 ± 0.241
0.985HisGlu: 0.985 ± 0.273
0.682HisPhe: 0.682 ± 0.204
1.212HisGly: 1.212 ± 0.354
0.758HisHis: 0.758 ± 0.287
1.061HisIle: 1.061 ± 0.304
1.212HisLys: 1.212 ± 0.281
0.909HisLeu: 0.909 ± 0.233
0.455HisMet: 0.455 ± 0.171
0.455HisAsn: 0.455 ± 0.184
0.909HisPro: 0.909 ± 0.334
0.985HisGln: 0.985 ± 0.232
0.833HisArg: 0.833 ± 0.227
1.061HisSer: 1.061 ± 0.262
0.758HisThr: 0.758 ± 0.302
0.758HisVal: 0.758 ± 0.225
0.152HisTrp: 0.152 ± 0.118
0.833HisTyr: 0.833 ± 0.268
0.0HisXaa: 0.0 ± 0.0
Ile
4.394IleAla: 4.394 ± 0.797
0.758IleCys: 0.758 ± 0.225
3.863IleAsp: 3.863 ± 0.529
2.651IleGlu: 2.651 ± 0.515
1.288IlePhe: 1.288 ± 0.291
3.636IleGly: 3.636 ± 0.406
0.53IleHis: 0.53 ± 0.2
2.576IleIle: 2.576 ± 0.516
2.954IleLys: 2.954 ± 0.49
2.803IleLeu: 2.803 ± 0.415
1.061IleMet: 1.061 ± 0.346
2.197IleAsn: 2.197 ± 0.427
2.879IlePro: 2.879 ± 0.432
1.97IleGln: 1.97 ± 0.427
2.879IleArg: 2.879 ± 0.416
3.182IleSer: 3.182 ± 0.455
4.848IleThr: 4.848 ± 0.651
3.03IleVal: 3.03 ± 0.395
0.758IleTrp: 0.758 ± 0.232
1.439IleTyr: 1.439 ± 0.324
0.0IleXaa: 0.0 ± 0.0
Lys
5.303LysAla: 5.303 ± 0.813
0.909LysCys: 0.909 ± 0.32
3.712LysAsp: 3.712 ± 0.516
4.621LysGlu: 4.621 ± 0.755
2.121LysPhe: 2.121 ± 0.302
3.788LysGly: 3.788 ± 0.384
0.909LysHis: 0.909 ± 0.231
1.818LysIle: 1.818 ± 0.337
3.409LysLys: 3.409 ± 0.598
5.681LysLeu: 5.681 ± 0.667
2.803LysMet: 2.803 ± 0.564
2.424LysAsn: 2.424 ± 0.419
2.5LysPro: 2.5 ± 0.562
2.197LysGln: 2.197 ± 0.428
3.788LysArg: 3.788 ± 0.682
3.106LysSer: 3.106 ± 0.596
3.712LysThr: 3.712 ± 0.542
3.788LysVal: 3.788 ± 0.615
0.606LysTrp: 0.606 ± 0.172
2.651LysTyr: 2.651 ± 0.429
0.0LysXaa: 0.0 ± 0.0
Leu
6.742LeuAla: 6.742 ± 0.588
0.985LeuCys: 0.985 ± 0.262
4.015LeuAsp: 4.015 ± 0.634
5.075LeuGlu: 5.075 ± 0.784
1.97LeuPhe: 1.97 ± 0.432
4.772LeuGly: 4.772 ± 0.528
1.288LeuHis: 1.288 ± 0.36
4.545LeuIle: 4.545 ± 0.53
5.227LeuLys: 5.227 ± 0.685
6.363LeuLeu: 6.363 ± 0.692
1.818LeuMet: 1.818 ± 0.347
4.318LeuAsn: 4.318 ± 0.649
3.788LeuPro: 3.788 ± 0.548
2.651LeuGln: 2.651 ± 0.522
5.227LeuArg: 5.227 ± 0.686
4.621LeuSer: 4.621 ± 0.597
5.227LeuThr: 5.227 ± 0.455
5.909LeuVal: 5.909 ± 0.54
1.136LeuTrp: 1.136 ± 0.334
2.273LeuTyr: 2.273 ± 0.382
0.0LeuXaa: 0.0 ± 0.0
Met
2.197MetAla: 2.197 ± 0.335
0.227MetCys: 0.227 ± 0.113
1.136MetAsp: 1.136 ± 0.302
1.364MetGlu: 1.364 ± 0.283
1.061MetPhe: 1.061 ± 0.322
1.667MetGly: 1.667 ± 0.347
0.303MetHis: 0.303 ± 0.135
1.061MetIle: 1.061 ± 0.233
1.515MetLys: 1.515 ± 0.334
1.97MetLeu: 1.97 ± 0.436
0.379MetMet: 0.379 ± 0.177
1.212MetAsn: 1.212 ± 0.308
1.364MetPro: 1.364 ± 0.333
0.909MetGln: 0.909 ± 0.241
1.667MetArg: 1.667 ± 0.294
2.121MetSer: 2.121 ± 0.384
2.348MetThr: 2.348 ± 0.4
1.818MetVal: 1.818 ± 0.308
0.53MetTrp: 0.53 ± 0.145
0.758MetTyr: 0.758 ± 0.251
0.0MetXaa: 0.0 ± 0.0
Asn
3.788AsnAla: 3.788 ± 0.518
0.53AsnCys: 0.53 ± 0.184
2.803AsnAsp: 2.803 ± 0.421
2.197AsnGlu: 2.197 ± 0.359
1.818AsnPhe: 1.818 ± 0.343
4.469AsnGly: 4.469 ± 0.613
0.682AsnHis: 0.682 ± 0.215
2.879AsnIle: 2.879 ± 0.428
2.197AsnLys: 2.197 ± 0.462
3.788AsnLeu: 3.788 ± 0.389
0.379AsnMet: 0.379 ± 0.182
2.121AsnAsn: 2.121 ± 0.482
1.742AsnPro: 1.742 ± 0.343
1.439AsnGln: 1.439 ± 0.346
2.5AsnArg: 2.5 ± 0.342
2.045AsnSer: 2.045 ± 0.311
2.121AsnThr: 2.121 ± 0.431
3.712AsnVal: 3.712 ± 0.489
0.909AsnTrp: 0.909 ± 0.249
1.364AsnTyr: 1.364 ± 0.307
0.0AsnXaa: 0.0 ± 0.0
Pro
2.879ProAla: 2.879 ± 0.495
0.227ProCys: 0.227 ± 0.133
2.727ProAsp: 2.727 ± 0.492
3.863ProGlu: 3.863 ± 0.387
1.667ProPhe: 1.667 ± 0.328
3.03ProGly: 3.03 ± 0.477
0.455ProHis: 0.455 ± 0.178
1.591ProIle: 1.591 ± 0.42
2.727ProLys: 2.727 ± 0.441
3.409ProLeu: 3.409 ± 0.467
0.833ProMet: 0.833 ± 0.238
1.439ProAsn: 1.439 ± 0.431
1.364ProPro: 1.364 ± 0.351
1.212ProGln: 1.212 ± 0.267
1.742ProArg: 1.742 ± 0.345
2.273ProSer: 2.273 ± 0.362
1.591ProThr: 1.591 ± 0.296
3.863ProVal: 3.863 ± 0.515
0.455ProTrp: 0.455 ± 0.226
1.515ProTyr: 1.515 ± 0.396
0.0ProXaa: 0.0 ± 0.0
Gln
4.242GlnAla: 4.242 ± 0.643
0.379GlnCys: 0.379 ± 0.149
1.364GlnAsp: 1.364 ± 0.265
2.121GlnGlu: 2.121 ± 0.539
1.591GlnPhe: 1.591 ± 0.375
2.197GlnGly: 2.197 ± 0.409
0.682GlnHis: 0.682 ± 0.23
1.742GlnIle: 1.742 ± 0.359
1.742GlnLys: 1.742 ± 0.352
3.03GlnLeu: 3.03 ± 0.525
1.288GlnMet: 1.288 ± 0.34
1.591GlnAsn: 1.591 ± 0.329
1.97GlnPro: 1.97 ± 0.279
2.273GlnGln: 2.273 ± 0.618
1.742GlnArg: 1.742 ± 0.348
2.273GlnSer: 2.273 ± 0.398
1.667GlnThr: 1.667 ± 0.337
2.803GlnVal: 2.803 ± 0.485
0.606GlnTrp: 0.606 ± 0.179
1.364GlnTyr: 1.364 ± 0.272
0.0GlnXaa: 0.0 ± 0.0
Arg
4.469ArgAla: 4.469 ± 0.391
0.303ArgCys: 0.303 ± 0.141
3.485ArgAsp: 3.485 ± 0.419
4.166ArgGlu: 4.166 ± 0.499
1.742ArgPhe: 1.742 ± 0.432
3.56ArgGly: 3.56 ± 0.494
0.985ArgHis: 0.985 ± 0.269
3.182ArgIle: 3.182 ± 0.463
3.409ArgLys: 3.409 ± 0.654
3.712ArgLeu: 3.712 ± 0.468
1.894ArgMet: 1.894 ± 0.407
2.879ArgAsn: 2.879 ± 0.42
1.818ArgPro: 1.818 ± 0.401
2.727ArgGln: 2.727 ± 0.443
4.394ArgArg: 4.394 ± 0.689
2.273ArgSer: 2.273 ± 0.434
2.879ArgThr: 2.879 ± 0.497
4.318ArgVal: 4.318 ± 0.553
0.985ArgTrp: 0.985 ± 0.267
1.591ArgTyr: 1.591 ± 0.38
0.0ArgXaa: 0.0 ± 0.0
Ser
6.59SerAla: 6.59 ± 1.104
0.303SerCys: 0.303 ± 0.119
3.257SerAsp: 3.257 ± 0.446
3.333SerGlu: 3.333 ± 0.464
2.348SerPhe: 2.348 ± 0.394
7.045SerGly: 7.045 ± 0.771
0.833SerHis: 0.833 ± 0.197
2.197SerIle: 2.197 ± 0.423
2.727SerLys: 2.727 ± 0.516
5.075SerLeu: 5.075 ± 0.458
1.364SerMet: 1.364 ± 0.308
2.727SerAsn: 2.727 ± 0.303
2.121SerPro: 2.121 ± 0.489
2.197SerGln: 2.197 ± 0.374
2.803SerArg: 2.803 ± 0.416
2.651SerSer: 2.651 ± 0.454
3.939SerThr: 3.939 ± 0.56
5.53SerVal: 5.53 ± 0.567
0.833SerTrp: 0.833 ± 0.209
1.97SerTyr: 1.97 ± 0.345
0.0SerXaa: 0.0 ± 0.0
Thr
6.818ThrAla: 6.818 ± 0.64
0.682ThrCys: 0.682 ± 0.204
4.545ThrAsp: 4.545 ± 0.634
3.182ThrGlu: 3.182 ± 0.512
2.879ThrPhe: 2.879 ± 0.452
6.06ThrGly: 6.06 ± 0.816
0.758ThrHis: 0.758 ± 0.236
3.106ThrIle: 3.106 ± 0.447
2.803ThrLys: 2.803 ± 0.466
5.454ThrLeu: 5.454 ± 0.628
1.061ThrMet: 1.061 ± 0.279
2.045ThrAsn: 2.045 ± 0.41
3.788ThrPro: 3.788 ± 0.486
1.591ThrGln: 1.591 ± 0.306
2.651ThrArg: 2.651 ± 0.429
4.318ThrSer: 4.318 ± 0.626
3.863ThrThr: 3.863 ± 0.474
4.469ThrVal: 4.469 ± 0.644
0.985ThrTrp: 0.985 ± 0.266
2.424ThrTyr: 2.424 ± 0.467
0.0ThrXaa: 0.0 ± 0.0
Val
7.348ValAla: 7.348 ± 0.831
1.061ValCys: 1.061 ± 0.247
4.091ValAsp: 4.091 ± 0.46
6.515ValGlu: 6.515 ± 0.638
2.197ValPhe: 2.197 ± 0.406
3.863ValGly: 3.863 ± 0.55
1.212ValHis: 1.212 ± 0.243
4.469ValIle: 4.469 ± 0.633
5.454ValLys: 5.454 ± 0.691
4.545ValLeu: 4.545 ± 0.53
0.909ValMet: 0.909 ± 0.267
3.485ValAsn: 3.485 ± 0.586
2.273ValPro: 2.273 ± 0.641
2.045ValGln: 2.045 ± 0.365
3.257ValArg: 3.257 ± 0.457
6.212ValSer: 6.212 ± 0.762
5.984ValThr: 5.984 ± 0.648
5.681ValVal: 5.681 ± 0.786
1.061ValTrp: 1.061 ± 0.25
2.651ValTyr: 2.651 ± 0.431
0.0ValXaa: 0.0 ± 0.0
Trp
1.364TrpAla: 1.364 ± 0.396
0.076TrpCys: 0.076 ± 0.074
0.909TrpAsp: 0.909 ± 0.237
0.682TrpGlu: 0.682 ± 0.217
0.833TrpPhe: 0.833 ± 0.303
0.909TrpGly: 0.909 ± 0.288
0.379TrpHis: 0.379 ± 0.243
0.682TrpIle: 0.682 ± 0.259
0.455TrpLys: 0.455 ± 0.202
1.667TrpLeu: 1.667 ± 0.409
0.53TrpMet: 0.53 ± 0.244
0.53TrpAsn: 0.53 ± 0.216
0.379TrpPro: 0.379 ± 0.21
0.682TrpGln: 0.682 ± 0.215
1.439TrpArg: 1.439 ± 0.42
0.53TrpSer: 0.53 ± 0.174
0.833TrpThr: 0.833 ± 0.208
1.212TrpVal: 1.212 ± 0.256
0.227TrpTrp: 0.227 ± 0.126
0.379TrpTyr: 0.379 ± 0.182
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.03TyrAla: 3.03 ± 0.53
0.379TyrCys: 0.379 ± 0.168
2.121TyrAsp: 2.121 ± 0.493
2.651TyrGlu: 2.651 ± 0.508
1.136TyrPhe: 1.136 ± 0.335
2.803TyrGly: 2.803 ± 0.44
0.682TyrHis: 0.682 ± 0.193
1.515TyrIle: 1.515 ± 0.329
2.651TyrLys: 2.651 ± 0.389
2.424TyrLeu: 2.424 ± 0.412
0.758TyrMet: 0.758 ± 0.204
1.439TyrAsn: 1.439 ± 0.34
1.212TyrPro: 1.212 ± 0.314
1.364TyrGln: 1.364 ± 0.333
2.273TyrArg: 2.273 ± 0.467
2.121TyrSer: 2.121 ± 0.385
2.5TyrThr: 2.5 ± 0.398
1.742TyrVal: 1.742 ± 0.36
0.076TyrTrp: 0.076 ± 0.08
1.439TyrTyr: 1.439 ± 0.324
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 67 proteins (13202 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski