Amino acid dipepetide frequency for Enterococcus phage LY0322

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.324AlaAla: 0.324 ± 0.153
0.243AlaCys: 0.243 ± 0.152
2.594AlaAsp: 2.594 ± 0.446
3.485AlaGlu: 3.485 ± 0.47
2.675AlaPhe: 2.675 ± 0.574
2.756AlaGly: 2.756 ± 0.459
0.811AlaHis: 0.811 ± 0.256
5.431AlaIle: 5.431 ± 1.015
5.836AlaLys: 5.836 ± 0.725
5.026AlaLeu: 5.026 ± 0.777
2.756AlaMet: 2.756 ± 0.454
4.215AlaAsn: 4.215 ± 0.544
1.783AlaPro: 1.783 ± 0.292
1.945AlaGln: 1.945 ± 0.358
1.864AlaArg: 1.864 ± 0.397
3.08AlaSer: 3.08 ± 0.495
4.458AlaThr: 4.458 ± 0.628
4.296AlaVal: 4.296 ± 0.624
0.567AlaTrp: 0.567 ± 0.187
2.594AlaTyr: 2.594 ± 0.428
0.0AlaXaa: 0.0 ± 0.0
Cys
0.486CysAla: 0.486 ± 0.236
0.0CysCys: 0.0 ± 0.0
0.243CysAsp: 0.243 ± 0.165
0.811CysGlu: 0.811 ± 0.294
0.081CysPhe: 0.081 ± 0.084
0.162CysGly: 0.162 ± 0.09
0.162CysHis: 0.162 ± 0.114
0.405CysIle: 0.405 ± 0.202
0.973CysLys: 0.973 ± 0.289
0.648CysLeu: 0.648 ± 0.262
0.081CysMet: 0.081 ± 0.073
0.648CysAsn: 0.648 ± 0.264
0.0CysPro: 0.0 ± 0.0
0.162CysGln: 0.162 ± 0.116
0.243CysArg: 0.243 ± 0.154
0.405CysSer: 0.405 ± 0.176
0.405CysThr: 0.405 ± 0.206
0.73CysVal: 0.73 ± 0.245
0.081CysTrp: 0.081 ± 0.07
0.162CysTyr: 0.162 ± 0.122
0.0CysXaa: 0.0 ± 0.0
Asp
3.323AspAla: 3.323 ± 0.668
0.324AspCys: 0.324 ± 0.171
2.107AspAsp: 2.107 ± 0.437
4.539AspGlu: 4.539 ± 0.691
3.404AspPhe: 3.404 ± 0.439
4.782AspGly: 4.782 ± 0.556
0.567AspHis: 0.567 ± 0.206
4.782AspIle: 4.782 ± 0.758
5.755AspLys: 5.755 ± 0.859
4.701AspLeu: 4.701 ± 0.516
1.702AspMet: 1.702 ± 0.441
4.944AspAsn: 4.944 ± 0.724
1.783AspPro: 1.783 ± 0.449
1.459AspGln: 1.459 ± 0.294
1.783AspArg: 1.783 ± 0.36
2.999AspSer: 2.999 ± 0.426
3.729AspThr: 3.729 ± 0.725
3.485AspVal: 3.485 ± 0.444
0.73AspTrp: 0.73 ± 0.219
3.161AspTyr: 3.161 ± 0.584
0.0AspXaa: 0.0 ± 0.0
Glu
4.863GluAla: 4.863 ± 0.57
1.054GluCys: 1.054 ± 0.401
4.296GluAsp: 4.296 ± 0.961
7.133GluGlu: 7.133 ± 1.539
3.161GluPhe: 3.161 ± 0.611
5.188GluGly: 5.188 ± 0.755
1.297GluHis: 1.297 ± 0.307
3.81GluIle: 3.81 ± 0.557
6.241GluLys: 6.241 ± 0.561
8.592GluLeu: 8.592 ± 1.004
2.513GluMet: 2.513 ± 0.539
3.485GluAsn: 3.485 ± 0.45
2.351GluPro: 2.351 ± 0.54
3.08GluGln: 3.08 ± 0.535
3.323GluArg: 3.323 ± 0.528
3.404GluSer: 3.404 ± 0.499
4.458GluThr: 4.458 ± 0.639
5.593GluVal: 5.593 ± 0.77
1.783GluTrp: 1.783 ± 0.342
3.485GluTyr: 3.485 ± 0.685
0.0GluXaa: 0.0 ± 0.0
Phe
1.945PheAla: 1.945 ± 0.326
0.324PheCys: 0.324 ± 0.149
2.675PheAsp: 2.675 ± 0.544
3.404PheGlu: 3.404 ± 0.556
1.216PhePhe: 1.216 ± 0.403
2.999PheGly: 2.999 ± 0.617
0.243PheHis: 0.243 ± 0.166
4.458PheIle: 4.458 ± 0.718
4.377PheLys: 4.377 ± 0.567
2.026PheLeu: 2.026 ± 0.278
1.135PheMet: 1.135 ± 0.3
3.648PheAsn: 3.648 ± 0.505
0.73PhePro: 0.73 ± 0.221
2.107PheGln: 2.107 ± 0.423
1.702PheArg: 1.702 ± 0.399
2.189PheSer: 2.189 ± 0.455
3.485PheThr: 3.485 ± 0.706
2.432PheVal: 2.432 ± 0.428
0.405PheTrp: 0.405 ± 0.158
0.973PheTyr: 0.973 ± 0.218
0.0PheXaa: 0.0 ± 0.0
Gly
3.972GlyAla: 3.972 ± 1.175
0.243GlyCys: 0.243 ± 0.129
3.404GlyAsp: 3.404 ± 0.515
3.972GlyGlu: 3.972 ± 0.666
3.729GlyPhe: 3.729 ± 0.484
4.863GlyGly: 4.863 ± 1.282
0.486GlyHis: 0.486 ± 0.195
5.107GlyIle: 5.107 ± 0.931
5.512GlyLys: 5.512 ± 0.886
6.809GlyLeu: 6.809 ± 1.062
1.945GlyMet: 1.945 ± 0.391
3.891GlyAsn: 3.891 ± 0.541
0.567GlyPro: 0.567 ± 0.288
1.945GlyGln: 1.945 ± 0.375
2.999GlyArg: 2.999 ± 0.466
3.81GlySer: 3.81 ± 0.544
4.215GlyThr: 4.215 ± 0.837
4.539GlyVal: 4.539 ± 0.713
0.973GlyTrp: 0.973 ± 0.226
3.161GlyTyr: 3.161 ± 0.542
0.0GlyXaa: 0.0 ± 0.0
His
0.648HisAla: 0.648 ± 0.222
0.162HisCys: 0.162 ± 0.131
0.648HisAsp: 0.648 ± 0.294
1.054HisGlu: 1.054 ± 0.313
0.892HisPhe: 0.892 ± 0.316
1.297HisGly: 1.297 ± 0.406
0.324HisHis: 0.324 ± 0.173
1.054HisIle: 1.054 ± 0.247
1.621HisLys: 1.621 ± 0.35
1.216HisLeu: 1.216 ± 0.361
0.243HisMet: 0.243 ± 0.161
0.73HisAsn: 0.73 ± 0.237
0.162HisPro: 0.162 ± 0.123
0.486HisGln: 0.486 ± 0.152
1.216HisArg: 1.216 ± 0.403
0.648HisSer: 0.648 ± 0.246
0.973HisThr: 0.973 ± 0.358
1.054HisVal: 1.054 ± 0.284
0.081HisTrp: 0.081 ± 0.089
0.811HisTyr: 0.811 ± 0.278
0.0HisXaa: 0.0 ± 0.0
Ile
4.053IleAla: 4.053 ± 0.526
0.243IleCys: 0.243 ± 0.159
5.755IleAsp: 5.755 ± 0.463
7.295IleGlu: 7.295 ± 0.954
2.189IlePhe: 2.189 ± 0.605
3.972IleGly: 3.972 ± 0.529
0.892IleHis: 0.892 ± 0.273
4.458IleIle: 4.458 ± 0.473
6.566IleLys: 6.566 ± 0.694
4.944IleLeu: 4.944 ± 0.667
1.459IleMet: 1.459 ± 0.452
4.701IleAsn: 4.701 ± 0.74
3.08IlePro: 3.08 ± 0.356
2.837IleGln: 2.837 ± 0.435
2.107IleArg: 2.107 ± 0.471
4.62IleSer: 4.62 ± 0.609
3.891IleThr: 3.891 ± 0.546
3.891IleVal: 3.891 ± 0.455
0.892IleTrp: 0.892 ± 0.245
1.378IleTyr: 1.378 ± 0.37
0.0IleXaa: 0.0 ± 0.0
Lys
6.728LysAla: 6.728 ± 0.772
0.405LysCys: 0.405 ± 0.183
5.836LysAsp: 5.836 ± 0.545
8.268LysGlu: 8.268 ± 0.894
3.485LysPhe: 3.485 ± 0.446
4.782LysGly: 4.782 ± 0.908
1.621LysHis: 1.621 ± 0.326
4.296LysIle: 4.296 ± 0.696
5.188LysLys: 5.188 ± 0.732
6.971LysLeu: 6.971 ± 0.819
3.323LysMet: 3.323 ± 0.456
5.836LysAsn: 5.836 ± 0.779
3.567LysPro: 3.567 ± 0.579
3.567LysGln: 3.567 ± 0.539
4.215LysArg: 4.215 ± 0.581
4.944LysSer: 4.944 ± 0.942
4.944LysThr: 4.944 ± 0.626
5.998LysVal: 5.998 ± 0.55
1.297LysTrp: 1.297 ± 0.295
3.81LysTyr: 3.81 ± 0.512
0.0LysXaa: 0.0 ± 0.0
Leu
3.81LeuAla: 3.81 ± 0.606
0.567LeuCys: 0.567 ± 0.217
5.998LeuAsp: 5.998 ± 0.584
7.538LeuGlu: 7.538 ± 0.912
3.323LeuPhe: 3.323 ± 0.564
5.512LeuGly: 5.512 ± 0.994
1.054LeuHis: 1.054 ± 0.309
4.539LeuIle: 4.539 ± 0.677
7.538LeuLys: 7.538 ± 0.701
6.241LeuLeu: 6.241 ± 0.989
1.54LeuMet: 1.54 ± 0.358
5.998LeuAsn: 5.998 ± 0.682
2.918LeuPro: 2.918 ± 0.448
3.972LeuGln: 3.972 ± 0.41
2.594LeuArg: 2.594 ± 0.464
4.377LeuSer: 4.377 ± 0.59
4.539LeuThr: 4.539 ± 0.584
5.998LeuVal: 5.998 ± 0.845
0.973LeuTrp: 0.973 ± 0.263
2.432LeuTyr: 2.432 ± 0.479
0.0LeuXaa: 0.0 ± 0.0
Met
1.378MetAla: 1.378 ± 0.309
0.243MetCys: 0.243 ± 0.131
1.702MetAsp: 1.702 ± 0.415
1.621MetGlu: 1.621 ± 0.394
1.702MetPhe: 1.702 ± 0.462
2.27MetGly: 2.27 ± 0.547
0.324MetHis: 0.324 ± 0.148
2.351MetIle: 2.351 ± 0.483
2.675MetLys: 2.675 ± 0.423
1.54MetLeu: 1.54 ± 0.32
0.811MetMet: 0.811 ± 0.309
1.702MetAsn: 1.702 ± 0.44
0.648MetPro: 0.648 ± 0.25
1.216MetGln: 1.216 ± 0.275
1.783MetArg: 1.783 ± 0.387
1.783MetSer: 1.783 ± 0.42
1.216MetThr: 1.216 ± 0.3
1.945MetVal: 1.945 ± 0.424
0.648MetTrp: 0.648 ± 0.268
1.297MetTyr: 1.297 ± 0.346
0.0MetXaa: 0.0 ± 0.0
Asn
4.296AsnAla: 4.296 ± 0.82
0.405AsnCys: 0.405 ± 0.175
2.513AsnAsp: 2.513 ± 0.516
5.431AsnGlu: 5.431 ± 0.598
1.945AsnPhe: 1.945 ± 0.412
6.16AsnGly: 6.16 ± 0.577
1.621AsnHis: 1.621 ± 0.521
4.62AsnIle: 4.62 ± 0.662
7.052AsnLys: 7.052 ± 0.92
4.782AsnLeu: 4.782 ± 0.53
2.513AsnMet: 2.513 ± 0.443
3.81AsnAsn: 3.81 ± 0.626
1.702AsnPro: 1.702 ± 0.327
1.945AsnGln: 1.945 ± 0.363
1.297AsnArg: 1.297 ± 0.317
2.999AsnSer: 2.999 ± 0.561
4.701AsnThr: 4.701 ± 0.566
3.567AsnVal: 3.567 ± 0.529
0.811AsnTrp: 0.811 ± 0.27
2.513AsnTyr: 2.513 ± 0.606
0.0AsnXaa: 0.0 ± 0.0
Pro
1.459ProAla: 1.459 ± 0.347
0.081ProCys: 0.081 ± 0.094
2.026ProAsp: 2.026 ± 0.435
3.242ProGlu: 3.242 ± 0.61
1.216ProPhe: 1.216 ± 0.318
0.162ProGly: 0.162 ± 0.116
0.243ProHis: 0.243 ± 0.132
1.54ProIle: 1.54 ± 0.287
2.513ProLys: 2.513 ± 0.618
2.837ProLeu: 2.837 ± 0.557
0.892ProMet: 0.892 ± 0.246
2.27ProAsn: 2.27 ± 0.484
0.324ProPro: 0.324 ± 0.149
1.378ProGln: 1.378 ± 0.272
0.648ProArg: 0.648 ± 0.193
1.702ProSer: 1.702 ± 0.339
2.27ProThr: 2.27 ± 0.421
1.945ProVal: 1.945 ± 0.33
0.162ProTrp: 0.162 ± 0.106
1.702ProTyr: 1.702 ± 0.496
0.0ProXaa: 0.0 ± 0.0
Gln
2.27GlnAla: 2.27 ± 0.332
0.324GlnCys: 0.324 ± 0.18
2.026GlnAsp: 2.026 ± 0.357
2.189GlnGlu: 2.189 ± 0.341
1.864GlnPhe: 1.864 ± 0.495
1.945GlnGly: 1.945 ± 0.351
0.567GlnHis: 0.567 ± 0.197
2.756GlnIle: 2.756 ± 0.403
2.675GlnLys: 2.675 ± 0.456
3.648GlnLeu: 3.648 ± 0.523
1.459GlnMet: 1.459 ± 0.341
1.459GlnAsn: 1.459 ± 0.337
1.297GlnPro: 1.297 ± 0.417
2.189GlnGln: 2.189 ± 0.421
1.783GlnArg: 1.783 ± 0.385
1.783GlnSer: 1.783 ± 0.435
1.945GlnThr: 1.945 ± 0.295
2.999GlnVal: 2.999 ± 0.51
0.567GlnTrp: 0.567 ± 0.387
2.27GlnTyr: 2.27 ± 0.467
0.0GlnXaa: 0.0 ± 0.0
Arg
1.864ArgAla: 1.864 ± 0.383
0.567ArgCys: 0.567 ± 0.267
2.918ArgAsp: 2.918 ± 0.546
2.107ArgGlu: 2.107 ± 0.411
1.702ArgPhe: 1.702 ± 0.249
2.351ArgGly: 2.351 ± 0.466
0.892ArgHis: 0.892 ± 0.27
3.323ArgIle: 3.323 ± 0.536
2.675ArgLys: 2.675 ± 0.501
3.08ArgLeu: 3.08 ± 0.58
1.135ArgMet: 1.135 ± 0.265
2.351ArgAsn: 2.351 ± 0.407
1.135ArgPro: 1.135 ± 0.325
1.54ArgGln: 1.54 ± 0.331
1.459ArgArg: 1.459 ± 0.287
1.702ArgSer: 1.702 ± 0.417
1.297ArgThr: 1.297 ± 0.279
2.837ArgVal: 2.837 ± 0.522
0.486ArgTrp: 0.486 ± 0.216
1.459ArgTyr: 1.459 ± 0.346
0.0ArgXaa: 0.0 ± 0.0
Ser
3.648SerAla: 3.648 ± 0.727
0.0SerCys: 0.0 ± 0.0
3.729SerAsp: 3.729 ± 0.537
3.161SerGlu: 3.161 ± 0.551
2.432SerPhe: 2.432 ± 0.364
4.62SerGly: 4.62 ± 0.678
1.378SerHis: 1.378 ± 0.344
3.891SerIle: 3.891 ± 0.535
4.458SerLys: 4.458 ± 0.595
3.404SerLeu: 3.404 ± 0.603
1.297SerMet: 1.297 ± 0.278
3.729SerAsn: 3.729 ± 0.686
1.135SerPro: 1.135 ± 0.256
2.351SerGln: 2.351 ± 0.497
1.459SerArg: 1.459 ± 0.424
3.323SerSer: 3.323 ± 0.671
4.134SerThr: 4.134 ± 0.794
2.675SerVal: 2.675 ± 0.358
0.811SerTrp: 0.811 ± 0.259
2.756SerTyr: 2.756 ± 0.611
0.0SerXaa: 0.0 ± 0.0
Thr
4.053ThrAla: 4.053 ± 0.559
0.243ThrCys: 0.243 ± 0.14
3.242ThrAsp: 3.242 ± 0.443
4.377ThrGlu: 4.377 ± 0.531
2.107ThrPhe: 2.107 ± 0.493
5.026ThrGly: 5.026 ± 0.706
1.378ThrHis: 1.378 ± 0.287
4.134ThrIle: 4.134 ± 0.556
6.566ThrLys: 6.566 ± 0.577
5.026ThrLeu: 5.026 ± 0.765
1.378ThrMet: 1.378 ± 0.373
3.08ThrAsn: 3.08 ± 0.577
2.351ThrPro: 2.351 ± 0.458
2.026ThrGln: 2.026 ± 0.476
2.107ThrArg: 2.107 ± 0.368
2.594ThrSer: 2.594 ± 0.436
4.053ThrThr: 4.053 ± 0.656
4.539ThrVal: 4.539 ± 0.591
0.567ThrTrp: 0.567 ± 0.171
2.513ThrTyr: 2.513 ± 0.599
0.0ThrXaa: 0.0 ± 0.0
Val
5.026ValAla: 5.026 ± 0.652
0.486ValCys: 0.486 ± 0.21
4.944ValAsp: 4.944 ± 0.656
4.62ValGlu: 4.62 ± 0.632
3.08ValPhe: 3.08 ± 0.459
4.458ValGly: 4.458 ± 0.771
0.811ValHis: 0.811 ± 0.329
3.972ValIle: 3.972 ± 0.504
5.836ValLys: 5.836 ± 0.801
5.269ValLeu: 5.269 ± 0.784
1.297ValMet: 1.297 ± 0.277
4.134ValAsn: 4.134 ± 0.597
1.864ValPro: 1.864 ± 0.34
2.026ValGln: 2.026 ± 0.458
2.27ValArg: 2.27 ± 0.42
5.107ValSer: 5.107 ± 0.526
3.485ValThr: 3.485 ± 0.62
5.026ValVal: 5.026 ± 0.573
0.73ValTrp: 0.73 ± 0.27
2.513ValTyr: 2.513 ± 0.487
0.0ValXaa: 0.0 ± 0.0
Trp
0.567TrpAla: 0.567 ± 0.224
0.243TrpCys: 0.243 ± 0.158
0.648TrpAsp: 0.648 ± 0.295
1.459TrpGlu: 1.459 ± 0.321
0.73TrpPhe: 0.73 ± 0.184
1.216TrpGly: 1.216 ± 0.275
0.081TrpHis: 0.081 ± 0.076
0.73TrpIle: 0.73 ± 0.257
0.973TrpLys: 0.973 ± 0.22
1.216TrpLeu: 1.216 ± 0.333
0.162TrpMet: 0.162 ± 0.116
0.811TrpAsn: 0.811 ± 0.285
0.0TrpPro: 0.0 ± 0.0
0.405TrpGln: 0.405 ± 0.161
0.73TrpArg: 0.73 ± 0.264
0.567TrpSer: 0.567 ± 0.172
0.567TrpThr: 0.567 ± 0.213
1.378TrpVal: 1.378 ± 0.221
0.324TrpTrp: 0.324 ± 0.134
0.324TrpTyr: 0.324 ± 0.173
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.026TyrAla: 2.026 ± 0.418
0.73TyrCys: 0.73 ± 0.233
2.756TyrAsp: 2.756 ± 0.641
3.404TyrGlu: 3.404 ± 0.593
1.54TyrPhe: 1.54 ± 0.333
1.621TyrGly: 1.621 ± 0.366
0.567TyrHis: 0.567 ± 0.201
3.567TyrIle: 3.567 ± 0.695
4.053TyrLys: 4.053 ± 0.626
3.485TyrLeu: 3.485 ± 0.674
1.135TyrMet: 1.135 ± 0.322
3.242TyrAsn: 3.242 ± 0.531
1.216TyrPro: 1.216 ± 0.332
1.378TyrGln: 1.378 ± 0.347
1.297TyrArg: 1.297 ± 0.362
2.351TyrSer: 2.351 ± 0.594
2.594TyrThr: 2.594 ± 0.6
2.107TyrVal: 2.107 ± 0.557
0.243TyrTrp: 0.243 ± 0.15
1.297TyrTyr: 1.297 ± 0.356
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 64 proteins (12338 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski