Amino acid dipepetide frequency for Enterococcus phage phiSHEF4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.405AlaAla: 0.405 ± 0.178
0.243AlaCys: 0.243 ± 0.125
3.241AlaAsp: 3.241 ± 0.545
3.079AlaGlu: 3.079 ± 0.493
2.593AlaPhe: 2.593 ± 0.578
2.836AlaGly: 2.836 ± 0.474
1.296AlaHis: 1.296 ± 0.309
4.862AlaIle: 4.862 ± 0.801
5.996AlaLys: 5.996 ± 0.863
5.429AlaLeu: 5.429 ± 0.703
2.674AlaMet: 2.674 ± 0.469
3.646AlaAsn: 3.646 ± 0.538
1.864AlaPro: 1.864 ± 0.337
1.54AlaGln: 1.54 ± 0.33
1.621AlaArg: 1.621 ± 0.313
3.322AlaSer: 3.322 ± 0.415
4.376AlaThr: 4.376 ± 0.575
4.376AlaVal: 4.376 ± 0.628
0.405AlaTrp: 0.405 ± 0.179
2.107AlaTyr: 2.107 ± 0.357
0.0AlaXaa: 0.0 ± 0.0
Cys
0.243CysAla: 0.243 ± 0.12
0.0CysCys: 0.0 ± 0.0
0.405CysAsp: 0.405 ± 0.199
0.648CysGlu: 0.648 ± 0.237
0.081CysPhe: 0.081 ± 0.087
0.405CysGly: 0.405 ± 0.214
0.243CysHis: 0.243 ± 0.131
0.567CysIle: 0.567 ± 0.224
0.729CysLys: 0.729 ± 0.269
0.486CysLeu: 0.486 ± 0.179
0.162CysMet: 0.162 ± 0.123
0.567CysAsn: 0.567 ± 0.2
0.081CysPro: 0.081 ± 0.087
0.081CysGln: 0.081 ± 0.075
0.243CysArg: 0.243 ± 0.135
0.324CysSer: 0.324 ± 0.152
0.486CysThr: 0.486 ± 0.184
0.162CysVal: 0.162 ± 0.1
0.162CysTrp: 0.162 ± 0.109
0.162CysTyr: 0.162 ± 0.131
0.0CysXaa: 0.0 ± 0.0
Asp
2.998AspAla: 2.998 ± 0.449
0.486AspCys: 0.486 ± 0.195
2.188AspAsp: 2.188 ± 0.518
4.376AspGlu: 4.376 ± 0.627
2.836AspPhe: 2.836 ± 0.476
4.862AspGly: 4.862 ± 0.535
0.567AspHis: 0.567 ± 0.231
4.781AspIle: 4.781 ± 0.848
5.51AspLys: 5.51 ± 0.674
5.348AspLeu: 5.348 ± 0.584
2.026AspMet: 2.026 ± 0.384
5.024AspAsn: 5.024 ± 0.669
2.188AspPro: 2.188 ± 0.457
0.891AspGln: 0.891 ± 0.209
1.864AspArg: 1.864 ± 0.355
2.674AspSer: 2.674 ± 0.394
4.295AspThr: 4.295 ± 0.767
4.133AspVal: 4.133 ± 0.63
0.729AspTrp: 0.729 ± 0.257
3.16AspTyr: 3.16 ± 0.507
0.0AspXaa: 0.0 ± 0.0
Glu
4.943GluAla: 4.943 ± 0.603
0.486GluCys: 0.486 ± 0.212
4.133GluAsp: 4.133 ± 0.757
6.563GluGlu: 6.563 ± 1.626
3.403GluPhe: 3.403 ± 0.651
4.781GluGly: 4.781 ± 0.544
1.621GluHis: 1.621 ± 0.346
3.403GluIle: 3.403 ± 0.57
6.888GluLys: 6.888 ± 0.824
8.022GluLeu: 8.022 ± 1.0
2.674GluMet: 2.674 ± 0.437
4.214GluAsn: 4.214 ± 0.673
2.269GluPro: 2.269 ± 0.644
3.322GluGln: 3.322 ± 0.644
3.079GluArg: 3.079 ± 0.604
3.971GluSer: 3.971 ± 0.53
4.133GluThr: 4.133 ± 0.431
7.617GluVal: 7.617 ± 0.882
1.702GluTrp: 1.702 ± 0.354
3.565GluTyr: 3.565 ± 0.673
0.0GluXaa: 0.0 ± 0.0
Phe
1.54PheAla: 1.54 ± 0.274
0.324PheCys: 0.324 ± 0.197
2.917PheAsp: 2.917 ± 0.496
3.079PheGlu: 3.079 ± 0.462
1.053PhePhe: 1.053 ± 0.426
2.836PheGly: 2.836 ± 0.665
0.162PheHis: 0.162 ± 0.098
4.214PheIle: 4.214 ± 0.59
4.457PheLys: 4.457 ± 0.549
2.107PheLeu: 2.107 ± 0.305
1.053PheMet: 1.053 ± 0.264
3.16PheAsn: 3.16 ± 0.528
0.81PhePro: 0.81 ± 0.241
1.702PheGln: 1.702 ± 0.342
1.459PheArg: 1.459 ± 0.358
2.35PheSer: 2.35 ± 0.457
3.808PheThr: 3.808 ± 0.649
2.026PheVal: 2.026 ± 0.382
0.486PheTrp: 0.486 ± 0.172
1.215PheTyr: 1.215 ± 0.269
0.0PheXaa: 0.0 ± 0.0
Gly
4.295GlyAla: 4.295 ± 1.113
0.243GlyCys: 0.243 ± 0.148
4.295GlyAsp: 4.295 ± 0.626
3.16GlyGlu: 3.16 ± 0.52
3.403GlyPhe: 3.403 ± 0.396
4.943GlyGly: 4.943 ± 1.36
1.053GlyHis: 1.053 ± 0.291
5.834GlyIle: 5.834 ± 0.936
6.807GlyLys: 6.807 ± 0.703
6.563GlyLeu: 6.563 ± 1.321
1.378GlyMet: 1.378 ± 0.302
4.214GlyAsn: 4.214 ± 0.636
0.972GlyPro: 0.972 ± 0.323
1.702GlyGln: 1.702 ± 0.361
2.593GlyArg: 2.593 ± 0.445
4.052GlySer: 4.052 ± 0.751
4.133GlyThr: 4.133 ± 0.712
3.971GlyVal: 3.971 ± 0.751
1.053GlyTrp: 1.053 ± 0.227
2.512GlyTyr: 2.512 ± 0.452
0.0GlyXaa: 0.0 ± 0.0
His
0.648HisAla: 0.648 ± 0.204
0.162HisCys: 0.162 ± 0.11
0.81HisAsp: 0.81 ± 0.279
1.134HisGlu: 1.134 ± 0.299
0.81HisPhe: 0.81 ± 0.272
1.215HisGly: 1.215 ± 0.331
0.243HisHis: 0.243 ± 0.133
0.81HisIle: 0.81 ± 0.196
1.378HisLys: 1.378 ± 0.406
1.296HisLeu: 1.296 ± 0.364
0.324HisMet: 0.324 ± 0.163
0.891HisAsn: 0.891 ± 0.24
0.486HisPro: 0.486 ± 0.186
0.729HisGln: 0.729 ± 0.177
0.972HisArg: 0.972 ± 0.301
0.405HisSer: 0.405 ± 0.2
1.053HisThr: 1.053 ± 0.323
0.972HisVal: 0.972 ± 0.271
0.162HisTrp: 0.162 ± 0.127
1.134HisTyr: 1.134 ± 0.316
0.0HisXaa: 0.0 ± 0.0
Ile
3.971IleAla: 3.971 ± 0.505
0.567IleCys: 0.567 ± 0.222
5.591IleAsp: 5.591 ± 0.612
6.888IleGlu: 6.888 ± 1.039
2.188IlePhe: 2.188 ± 0.515
4.214IleGly: 4.214 ± 0.608
0.648IleHis: 0.648 ± 0.232
3.971IleIle: 3.971 ± 0.462
6.239IleLys: 6.239 ± 0.821
5.024IleLeu: 5.024 ± 0.699
1.945IleMet: 1.945 ± 0.42
4.133IleAsn: 4.133 ± 0.849
2.512IlePro: 2.512 ± 0.38
3.322IleGln: 3.322 ± 0.565
1.783IleArg: 1.783 ± 0.402
4.376IleSer: 4.376 ± 0.572
4.862IleThr: 4.862 ± 0.536
4.376IleVal: 4.376 ± 0.428
0.648IleTrp: 0.648 ± 0.202
1.702IleTyr: 1.702 ± 0.409
0.0IleXaa: 0.0 ± 0.0
Lys
5.996LysAla: 5.996 ± 0.782
0.891LysCys: 0.891 ± 0.373
5.591LysAsp: 5.591 ± 0.607
8.751LysGlu: 8.751 ± 1.215
2.836LysPhe: 2.836 ± 0.457
5.429LysGly: 5.429 ± 1.035
1.053LysHis: 1.053 ± 0.261
4.538LysIle: 4.538 ± 0.642
5.915LysLys: 5.915 ± 0.864
7.374LysLeu: 7.374 ± 0.723
3.808LysMet: 3.808 ± 0.457
5.348LysAsn: 5.348 ± 0.546
3.808LysPro: 3.808 ± 0.574
3.403LysGln: 3.403 ± 0.524
4.457LysArg: 4.457 ± 0.613
4.295LysSer: 4.295 ± 0.886
5.429LysThr: 5.429 ± 0.638
5.348LysVal: 5.348 ± 0.506
1.296LysTrp: 1.296 ± 0.351
3.16LysTyr: 3.16 ± 0.413
0.0LysXaa: 0.0 ± 0.0
Leu
4.052LeuAla: 4.052 ± 0.491
0.324LeuCys: 0.324 ± 0.178
6.239LeuAsp: 6.239 ± 0.63
8.265LeuGlu: 8.265 ± 0.847
2.674LeuPhe: 2.674 ± 0.486
5.834LeuGly: 5.834 ± 0.993
1.053LeuHis: 1.053 ± 0.256
5.429LeuIle: 5.429 ± 0.75
7.617LeuLys: 7.617 ± 0.828
6.807LeuLeu: 6.807 ± 0.842
1.864LeuMet: 1.864 ± 0.397
5.591LeuAsn: 5.591 ± 0.79
2.755LeuPro: 2.755 ± 0.488
3.727LeuGln: 3.727 ± 0.571
2.674LeuArg: 2.674 ± 0.566
4.7LeuSer: 4.7 ± 0.575
4.052LeuThr: 4.052 ± 0.568
5.672LeuVal: 5.672 ± 0.798
1.053LeuTrp: 1.053 ± 0.277
2.431LeuTyr: 2.431 ± 0.449
0.0LeuXaa: 0.0 ± 0.0
Met
1.459MetAla: 1.459 ± 0.42
0.162MetCys: 0.162 ± 0.095
1.54MetAsp: 1.54 ± 0.41
2.431MetGlu: 2.431 ± 0.563
1.053MetPhe: 1.053 ± 0.382
2.107MetGly: 2.107 ± 0.41
0.162MetHis: 0.162 ± 0.091
2.431MetIle: 2.431 ± 0.454
2.593MetLys: 2.593 ± 0.471
2.188MetLeu: 2.188 ± 0.436
0.567MetMet: 0.567 ± 0.206
2.431MetAsn: 2.431 ± 0.448
0.729MetPro: 0.729 ± 0.227
1.378MetGln: 1.378 ± 0.309
1.621MetArg: 1.621 ± 0.42
1.702MetSer: 1.702 ± 0.407
1.864MetThr: 1.864 ± 0.395
1.702MetVal: 1.702 ± 0.438
0.648MetTrp: 0.648 ± 0.282
1.215MetTyr: 1.215 ± 0.324
0.0MetXaa: 0.0 ± 0.0
Asn
4.457AsnAla: 4.457 ± 0.769
0.162AsnCys: 0.162 ± 0.13
3.079AsnAsp: 3.079 ± 0.522
5.915AsnGlu: 5.915 ± 0.695
2.269AsnPhe: 2.269 ± 0.461
6.401AsnGly: 6.401 ± 0.756
1.053AsnHis: 1.053 ± 0.248
4.295AsnIle: 4.295 ± 0.691
6.239AsnLys: 6.239 ± 0.614
5.429AsnLeu: 5.429 ± 0.499
2.026AsnMet: 2.026 ± 0.416
3.808AsnAsn: 3.808 ± 0.608
1.864AsnPro: 1.864 ± 0.405
1.621AsnGln: 1.621 ± 0.314
1.783AsnArg: 1.783 ± 0.315
2.998AsnSer: 2.998 ± 0.542
3.971AsnThr: 3.971 ± 0.557
3.403AsnVal: 3.403 ± 0.497
0.729AsnTrp: 0.729 ± 0.214
2.998AsnTyr: 2.998 ± 0.583
0.0AsnXaa: 0.0 ± 0.0
Pro
1.54ProAla: 1.54 ± 0.366
0.162ProCys: 0.162 ± 0.132
2.269ProAsp: 2.269 ± 0.409
3.079ProGlu: 3.079 ± 0.533
1.54ProPhe: 1.54 ± 0.374
0.0ProGly: 0.0 ± 0.0
0.162ProHis: 0.162 ± 0.115
1.864ProIle: 1.864 ± 0.291
2.269ProLys: 2.269 ± 0.594
2.512ProLeu: 2.512 ± 0.541
0.972ProMet: 0.972 ± 0.231
2.188ProAsn: 2.188 ± 0.583
0.243ProPro: 0.243 ± 0.139
1.296ProGln: 1.296 ± 0.377
0.729ProArg: 0.729 ± 0.253
1.702ProSer: 1.702 ± 0.324
2.026ProThr: 2.026 ± 0.384
2.431ProVal: 2.431 ± 0.338
0.162ProTrp: 0.162 ± 0.101
1.783ProTyr: 1.783 ± 0.489
0.0ProXaa: 0.0 ± 0.0
Gln
1.864GlnAla: 1.864 ± 0.345
0.405GlnCys: 0.405 ± 0.199
1.945GlnAsp: 1.945 ± 0.289
2.35GlnGlu: 2.35 ± 0.469
1.54GlnPhe: 1.54 ± 0.347
1.945GlnGly: 1.945 ± 0.402
0.567GlnHis: 0.567 ± 0.206
2.755GlnIle: 2.755 ± 0.49
2.188GlnLys: 2.188 ± 0.432
3.484GlnLeu: 3.484 ± 0.547
0.729GlnMet: 0.729 ± 0.182
1.459GlnAsn: 1.459 ± 0.225
0.81GlnPro: 0.81 ± 0.263
2.35GlnGln: 2.35 ± 0.432
2.35GlnArg: 2.35 ± 0.553
2.512GlnSer: 2.512 ± 0.495
1.702GlnThr: 1.702 ± 0.306
2.998GlnVal: 2.998 ± 0.446
0.324GlnTrp: 0.324 ± 0.138
1.864GlnTyr: 1.864 ± 0.373
0.0GlnXaa: 0.0 ± 0.0
Arg
1.945ArgAla: 1.945 ± 0.402
0.324ArgCys: 0.324 ± 0.167
2.026ArgAsp: 2.026 ± 0.421
2.269ArgGlu: 2.269 ± 0.465
1.945ArgPhe: 1.945 ± 0.257
1.864ArgGly: 1.864 ± 0.494
0.891ArgHis: 0.891 ± 0.266
2.674ArgIle: 2.674 ± 0.402
3.241ArgLys: 3.241 ± 0.55
2.35ArgLeu: 2.35 ± 0.433
1.134ArgMet: 1.134 ± 0.388
2.593ArgAsn: 2.593 ± 0.435
1.459ArgPro: 1.459 ± 0.343
1.378ArgGln: 1.378 ± 0.399
1.459ArgArg: 1.459 ± 0.318
1.702ArgSer: 1.702 ± 0.363
2.107ArgThr: 2.107 ± 0.398
2.107ArgVal: 2.107 ± 0.407
0.405ArgTrp: 0.405 ± 0.222
1.459ArgTyr: 1.459 ± 0.383
0.0ArgXaa: 0.0 ± 0.0
Ser
3.403SerAla: 3.403 ± 0.768
0.162SerCys: 0.162 ± 0.112
2.674SerAsp: 2.674 ± 0.428
3.322SerGlu: 3.322 ± 0.503
2.188SerPhe: 2.188 ± 0.316
5.753SerGly: 5.753 ± 0.91
1.702SerHis: 1.702 ± 0.319
3.565SerIle: 3.565 ± 0.581
4.619SerLys: 4.619 ± 0.707
4.052SerLeu: 4.052 ± 0.586
1.702SerMet: 1.702 ± 0.331
3.403SerAsn: 3.403 ± 0.535
0.891SerPro: 0.891 ± 0.228
2.269SerGln: 2.269 ± 0.495
1.053SerArg: 1.053 ± 0.267
3.079SerSer: 3.079 ± 0.915
3.565SerThr: 3.565 ± 0.727
2.836SerVal: 2.836 ± 0.523
0.891SerTrp: 0.891 ± 0.195
2.674SerTyr: 2.674 ± 0.609
0.0SerXaa: 0.0 ± 0.0
Thr
3.971ThrAla: 3.971 ± 0.571
0.081ThrCys: 0.081 ± 0.071
3.16ThrAsp: 3.16 ± 0.434
4.943ThrGlu: 4.943 ± 0.611
2.674ThrPhe: 2.674 ± 0.516
4.214ThrGly: 4.214 ± 0.612
1.783ThrHis: 1.783 ± 0.451
5.186ThrIle: 5.186 ± 0.577
6.239ThrLys: 6.239 ± 0.718
5.753ThrLeu: 5.753 ± 0.948
1.621ThrMet: 1.621 ± 0.397
3.322ThrAsn: 3.322 ± 0.56
2.188ThrPro: 2.188 ± 0.329
2.431ThrGln: 2.431 ± 0.42
1.945ThrArg: 1.945 ± 0.386
2.431ThrSer: 2.431 ± 0.533
4.781ThrThr: 4.781 ± 0.79
3.889ThrVal: 3.889 ± 0.497
0.567ThrTrp: 0.567 ± 0.205
2.35ThrTyr: 2.35 ± 0.525
0.0ThrXaa: 0.0 ± 0.0
Val
5.51ValAla: 5.51 ± 0.602
0.405ValCys: 0.405 ± 0.18
4.862ValAsp: 4.862 ± 0.529
5.429ValGlu: 5.429 ± 0.81
3.16ValPhe: 3.16 ± 0.359
4.457ValGly: 4.457 ± 0.757
0.648ValHis: 0.648 ± 0.257
4.133ValIle: 4.133 ± 0.547
5.348ValLys: 5.348 ± 0.858
4.133ValLeu: 4.133 ± 0.573
2.107ValMet: 2.107 ± 0.286
4.295ValAsn: 4.295 ± 0.555
1.945ValPro: 1.945 ± 0.363
1.864ValGln: 1.864 ± 0.337
2.107ValArg: 2.107 ± 0.384
4.781ValSer: 4.781 ± 0.583
3.403ValThr: 3.403 ± 0.581
4.619ValVal: 4.619 ± 0.563
1.053ValTrp: 1.053 ± 0.359
2.593ValTyr: 2.593 ± 0.533
0.0ValXaa: 0.0 ± 0.0
Trp
0.486TrpAla: 0.486 ± 0.193
0.243TrpCys: 0.243 ± 0.157
0.891TrpAsp: 0.891 ± 0.322
1.702TrpGlu: 1.702 ± 0.306
0.729TrpPhe: 0.729 ± 0.213
1.134TrpGly: 1.134 ± 0.301
0.162TrpHis: 0.162 ± 0.108
0.648TrpIle: 0.648 ± 0.259
0.891TrpLys: 0.891 ± 0.25
1.053TrpLeu: 1.053 ± 0.333
0.162TrpMet: 0.162 ± 0.12
0.648TrpAsn: 0.648 ± 0.239
0.0TrpPro: 0.0 ± 0.0
0.324TrpGln: 0.324 ± 0.143
0.648TrpArg: 0.648 ± 0.246
0.405TrpSer: 0.405 ± 0.153
0.729TrpThr: 0.729 ± 0.196
1.54TrpVal: 1.54 ± 0.326
0.243TrpTrp: 0.243 ± 0.126
0.243TrpTyr: 0.243 ± 0.159
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.188TyrAla: 2.188 ± 0.473
0.405TyrCys: 0.405 ± 0.176
3.079TyrAsp: 3.079 ± 0.599
3.403TyrGlu: 3.403 ± 0.574
1.783TyrPhe: 1.783 ± 0.336
2.026TyrGly: 2.026 ± 0.375
0.567TyrHis: 0.567 ± 0.193
2.998TyrIle: 2.998 ± 0.614
3.565TyrLys: 3.565 ± 0.492
3.403TyrLeu: 3.403 ± 0.617
0.972TyrMet: 0.972 ± 0.259
3.565TyrAsn: 3.565 ± 0.627
1.053TyrPro: 1.053 ± 0.251
0.81TyrGln: 0.81 ± 0.22
0.891TyrArg: 0.891 ± 0.3
2.026TyrSer: 2.026 ± 0.487
2.755TyrThr: 2.755 ± 0.699
2.674TyrVal: 2.674 ± 0.507
0.162TyrTrp: 0.162 ± 0.098
1.783TyrTyr: 1.783 ± 0.458
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 63 proteins (12342 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski