Amino acid dipepetide frequency for Enterococcus phage EF-P10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.93AlaAla: 5.93 ± 0.823
0.734AlaCys: 0.734 ± 0.231
4.035AlaAsp: 4.035 ± 0.552
5.869AlaGlu: 5.869 ± 0.608
3.301AlaPhe: 3.301 ± 0.49
4.646AlaGly: 4.646 ± 0.561
0.856AlaHis: 0.856 ± 0.241
5.563AlaIle: 5.563 ± 0.522
5.013AlaLys: 5.013 ± 0.653
5.93AlaLeu: 5.93 ± 0.615
2.507AlaMet: 2.507 ± 0.427
3.485AlaAsn: 3.485 ± 0.451
1.895AlaPro: 1.895 ± 0.302
2.201AlaGln: 2.201 ± 0.37
2.996AlaArg: 2.996 ± 0.367
4.341AlaSer: 4.341 ± 0.487
3.729AlaThr: 3.729 ± 0.456
4.463AlaVal: 4.463 ± 0.466
0.672AlaTrp: 0.672 ± 0.173
3.301AlaTyr: 3.301 ± 0.459
0.0AlaXaa: 0.0 ± 0.0
Cys
0.367CysAla: 0.367 ± 0.161
0.367CysCys: 0.367 ± 0.19
0.489CysAsp: 0.489 ± 0.158
0.55CysGlu: 0.55 ± 0.176
0.489CysPhe: 0.489 ± 0.19
0.672CysGly: 0.672 ± 0.244
0.061CysHis: 0.061 ± 0.059
0.306CysIle: 0.306 ± 0.129
0.489CysLys: 0.489 ± 0.231
0.917CysLeu: 0.917 ± 0.23
0.489CysMet: 0.489 ± 0.156
0.183CysAsn: 0.183 ± 0.099
0.0CysPro: 0.0 ± 0.0
0.428CysGln: 0.428 ± 0.156
0.245CysArg: 0.245 ± 0.111
0.672CysSer: 0.672 ± 0.22
0.367CysThr: 0.367 ± 0.16
0.183CysVal: 0.183 ± 0.105
0.061CysTrp: 0.061 ± 0.075
0.122CysTyr: 0.122 ± 0.082
0.0CysXaa: 0.0 ± 0.0
Asp
3.546AspAla: 3.546 ± 0.505
0.734AspCys: 0.734 ± 0.238
2.873AspAsp: 2.873 ± 0.641
4.707AspGlu: 4.707 ± 0.494
3.362AspPhe: 3.362 ± 0.463
3.913AspGly: 3.913 ± 0.434
0.306AspHis: 0.306 ± 0.152
4.035AspIle: 4.035 ± 0.475
4.341AspLys: 4.341 ± 0.521
5.502AspLeu: 5.502 ± 0.559
2.14AspMet: 2.14 ± 0.427
3.057AspAsn: 3.057 ± 0.41
1.284AspPro: 1.284 ± 0.233
1.162AspGln: 1.162 ± 0.243
2.629AspArg: 2.629 ± 0.456
3.424AspSer: 3.424 ± 0.359
3.913AspThr: 3.913 ± 0.637
3.852AspVal: 3.852 ± 0.488
0.978AspTrp: 0.978 ± 0.28
3.118AspTyr: 3.118 ± 0.354
0.0AspXaa: 0.0 ± 0.0
Glu
8.498GluAla: 8.498 ± 0.837
0.245GluCys: 0.245 ± 0.111
5.625GluAsp: 5.625 ± 0.614
10.821GluGlu: 10.821 ± 1.149
3.24GluPhe: 3.24 ± 0.494
5.625GluGly: 5.625 ± 0.626
0.672GluHis: 0.672 ± 0.227
4.585GluIle: 4.585 ± 0.598
4.891GluLys: 4.891 ± 0.582
8.07GluLeu: 8.07 ± 0.768
2.629GluMet: 2.629 ± 0.362
3.913GluAsn: 3.913 ± 0.428
2.079GluPro: 2.079 ± 0.385
2.935GluGln: 2.935 ± 0.366
4.585GluArg: 4.585 ± 0.502
5.013GluSer: 5.013 ± 0.61
4.035GluThr: 4.035 ± 0.439
6.419GluVal: 6.419 ± 0.693
2.14GluTrp: 2.14 ± 0.332
3.301GluTyr: 3.301 ± 0.438
0.0GluXaa: 0.0 ± 0.0
Phe
2.384PheAla: 2.384 ± 0.333
0.306PheCys: 0.306 ± 0.126
2.445PheAsp: 2.445 ± 0.468
3.485PheGlu: 3.485 ± 0.402
1.528PhePhe: 1.528 ± 0.323
3.179PheGly: 3.179 ± 0.454
0.734PheHis: 0.734 ± 0.219
2.873PheIle: 2.873 ± 0.416
3.179PheLys: 3.179 ± 0.434
4.096PheLeu: 4.096 ± 0.523
1.039PheMet: 1.039 ± 0.214
2.751PheAsn: 2.751 ± 0.427
1.467PhePro: 1.467 ± 0.285
1.223PheGln: 1.223 ± 0.268
1.345PheArg: 1.345 ± 0.28
2.568PheSer: 2.568 ± 0.419
2.629PheThr: 2.629 ± 0.343
2.017PheVal: 2.017 ± 0.409
0.611PheTrp: 0.611 ± 0.194
2.079PheTyr: 2.079 ± 0.331
0.0PheXaa: 0.0 ± 0.0
Gly
3.546GlyAla: 3.546 ± 0.533
0.428GlyCys: 0.428 ± 0.207
3.424GlyAsp: 3.424 ± 0.42
3.974GlyGlu: 3.974 ± 0.511
3.79GlyPhe: 3.79 ± 0.556
3.607GlyGly: 3.607 ± 0.755
1.651GlyHis: 1.651 ± 0.291
4.891GlyIle: 4.891 ± 0.433
5.747GlyLys: 5.747 ± 0.672
4.769GlyLeu: 4.769 ± 0.499
1.59GlyMet: 1.59 ± 0.302
3.118GlyAsn: 3.118 ± 0.459
0.061GlyPro: 0.061 ± 0.062
1.834GlyGln: 1.834 ± 0.398
3.24GlyArg: 3.24 ± 0.525
3.424GlySer: 3.424 ± 0.413
4.463GlyThr: 4.463 ± 0.506
4.096GlyVal: 4.096 ± 0.604
1.162GlyTrp: 1.162 ± 0.276
2.751GlyTyr: 2.751 ± 0.433
0.0GlyXaa: 0.0 ± 0.0
His
0.978HisAla: 0.978 ± 0.254
0.122HisCys: 0.122 ± 0.086
0.55HisAsp: 0.55 ± 0.168
1.284HisGlu: 1.284 ± 0.256
0.917HisPhe: 0.917 ± 0.251
0.978HisGly: 0.978 ± 0.255
0.306HisHis: 0.306 ± 0.139
1.162HisIle: 1.162 ± 0.247
1.223HisLys: 1.223 ± 0.318
1.406HisLeu: 1.406 ± 0.256
0.367HisMet: 0.367 ± 0.127
0.978HisAsn: 0.978 ± 0.198
0.795HisPro: 0.795 ± 0.225
0.55HisGln: 0.55 ± 0.211
0.856HisArg: 0.856 ± 0.211
0.795HisSer: 0.795 ± 0.229
0.489HisThr: 0.489 ± 0.181
0.978HisVal: 0.978 ± 0.188
0.122HisTrp: 0.122 ± 0.081
0.795HisTyr: 0.795 ± 0.219
0.0HisXaa: 0.0 ± 0.0
Ile
4.952IleAla: 4.952 ± 0.519
0.428IleCys: 0.428 ± 0.14
3.913IleAsp: 3.913 ± 0.467
5.93IleGlu: 5.93 ± 0.666
2.079IlePhe: 2.079 ± 0.334
3.301IleGly: 3.301 ± 0.422
1.406IleHis: 1.406 ± 0.284
3.668IleIle: 3.668 ± 0.479
5.074IleLys: 5.074 ± 0.547
4.035IleLeu: 4.035 ± 0.636
1.712IleMet: 1.712 ± 0.29
4.402IleAsn: 4.402 ± 0.565
2.079IlePro: 2.079 ± 0.359
2.445IleGln: 2.445 ± 0.392
3.057IleArg: 3.057 ± 0.461
3.24IleSer: 3.24 ± 0.467
3.729IleThr: 3.729 ± 0.558
3.913IleVal: 3.913 ± 0.522
0.489IleTrp: 0.489 ± 0.19
2.568IleTyr: 2.568 ± 0.437
0.0IleXaa: 0.0 ± 0.0
Lys
7.642LysAla: 7.642 ± 0.693
0.245LysCys: 0.245 ± 0.153
5.808LysAsp: 5.808 ± 0.547
7.031LysGlu: 7.031 ± 0.663
2.751LysPhe: 2.751 ± 0.384
4.524LysGly: 4.524 ± 0.508
1.406LysHis: 1.406 ± 0.314
3.301LysIle: 3.301 ± 0.428
5.747LysLys: 5.747 ± 0.743
6.786LysLeu: 6.786 ± 0.593
2.323LysMet: 2.323 ± 0.492
2.751LysAsn: 2.751 ± 0.374
3.118LysPro: 3.118 ± 0.391
2.751LysGln: 2.751 ± 0.468
3.424LysArg: 3.424 ± 0.61
4.218LysSer: 4.218 ± 0.477
4.096LysThr: 4.096 ± 0.518
6.052LysVal: 6.052 ± 0.598
0.917LysTrp: 0.917 ± 0.218
2.568LysTyr: 2.568 ± 0.379
0.0LysXaa: 0.0 ± 0.0
Leu
5.441LeuAla: 5.441 ± 0.549
0.611LeuCys: 0.611 ± 0.189
4.707LeuAsp: 4.707 ± 0.505
9.354LeuGlu: 9.354 ± 0.774
2.935LeuPhe: 2.935 ± 0.403
5.502LeuGly: 5.502 ± 0.641
1.1LeuHis: 1.1 ± 0.22
5.93LeuIle: 5.93 ± 0.745
6.847LeuLys: 6.847 ± 0.701
7.092LeuLeu: 7.092 ± 0.873
2.507LeuMet: 2.507 ± 0.4
4.402LeuAsn: 4.402 ± 0.475
2.812LeuPro: 2.812 ± 0.372
3.607LeuGln: 3.607 ± 0.465
3.424LeuArg: 3.424 ± 0.437
5.625LeuSer: 5.625 ± 0.482
6.603LeuThr: 6.603 ± 0.569
5.625LeuVal: 5.625 ± 0.704
0.672LeuTrp: 0.672 ± 0.167
2.69LeuTyr: 2.69 ± 0.427
0.0LeuXaa: 0.0 ± 0.0
Met
2.262MetAla: 2.262 ± 0.394
0.122MetCys: 0.122 ± 0.094
1.773MetAsp: 1.773 ± 0.353
3.179MetGlu: 3.179 ± 0.388
0.917MetPhe: 0.917 ± 0.223
1.039MetGly: 1.039 ± 0.262
0.245MetHis: 0.245 ± 0.091
1.039MetIle: 1.039 ± 0.248
2.507MetLys: 2.507 ± 0.356
2.751MetLeu: 2.751 ± 0.442
0.489MetMet: 0.489 ± 0.156
1.956MetAsn: 1.956 ± 0.332
0.978MetPro: 0.978 ± 0.249
0.672MetGln: 0.672 ± 0.206
1.528MetArg: 1.528 ± 0.269
1.956MetSer: 1.956 ± 0.312
1.345MetThr: 1.345 ± 0.282
1.834MetVal: 1.834 ± 0.266
0.122MetTrp: 0.122 ± 0.099
0.978MetTyr: 0.978 ± 0.235
0.0MetXaa: 0.0 ± 0.0
Asn
2.812AsnAla: 2.812 ± 0.547
0.306AsnCys: 0.306 ± 0.124
2.69AsnAsp: 2.69 ± 0.389
3.913AsnGlu: 3.913 ± 0.409
2.14AsnPhe: 2.14 ± 0.328
3.852AsnGly: 3.852 ± 0.48
0.978AsnHis: 0.978 ± 0.231
3.546AsnIle: 3.546 ± 0.409
4.707AsnLys: 4.707 ± 0.636
4.341AsnLeu: 4.341 ± 0.47
1.162AsnMet: 1.162 ± 0.317
2.323AsnAsn: 2.323 ± 0.521
2.629AsnPro: 2.629 ± 0.521
1.895AsnGln: 1.895 ± 0.315
2.201AsnArg: 2.201 ± 0.324
3.301AsnSer: 3.301 ± 0.427
2.751AsnThr: 2.751 ± 0.397
3.057AsnVal: 3.057 ± 0.456
0.489AsnTrp: 0.489 ± 0.195
2.017AsnTyr: 2.017 ± 0.316
0.0AsnXaa: 0.0 ± 0.0
Pro
1.712ProAla: 1.712 ± 0.412
0.183ProCys: 0.183 ± 0.095
1.834ProAsp: 1.834 ± 0.297
3.179ProGlu: 3.179 ± 0.478
1.162ProPhe: 1.162 ± 0.332
0.183ProGly: 0.183 ± 0.106
0.489ProHis: 0.489 ± 0.181
2.201ProIle: 2.201 ± 0.351
2.873ProLys: 2.873 ± 0.418
2.201ProLeu: 2.201 ± 0.37
0.672ProMet: 0.672 ± 0.192
1.712ProAsn: 1.712 ± 0.28
0.856ProPro: 0.856 ± 0.225
0.795ProGln: 0.795 ± 0.211
1.467ProArg: 1.467 ± 0.338
2.812ProSer: 2.812 ± 0.407
1.528ProThr: 1.528 ± 0.281
1.956ProVal: 1.956 ± 0.319
0.489ProTrp: 0.489 ± 0.164
1.712ProTyr: 1.712 ± 0.33
0.0ProXaa: 0.0 ± 0.0
Gln
2.69GlnAla: 2.69 ± 0.357
0.183GlnCys: 0.183 ± 0.1
1.59GlnAsp: 1.59 ± 0.274
2.935GlnGlu: 2.935 ± 0.339
1.1GlnPhe: 1.1 ± 0.275
2.323GlnGly: 2.323 ± 0.436
0.55GlnHis: 0.55 ± 0.155
1.345GlnIle: 1.345 ± 0.388
2.079GlnLys: 2.079 ± 0.394
3.485GlnLeu: 3.485 ± 0.396
1.039GlnMet: 1.039 ± 0.267
1.039GlnAsn: 1.039 ± 0.237
1.162GlnPro: 1.162 ± 0.235
1.651GlnGln: 1.651 ± 0.457
1.895GlnArg: 1.895 ± 0.301
2.079GlnSer: 2.079 ± 0.452
2.751GlnThr: 2.751 ± 0.37
2.568GlnVal: 2.568 ± 0.406
0.367GlnTrp: 0.367 ± 0.129
1.467GlnTyr: 1.467 ± 0.326
0.0GlnXaa: 0.0 ± 0.0
Arg
2.69ArgAla: 2.69 ± 0.449
0.367ArgCys: 0.367 ± 0.146
2.262ArgAsp: 2.262 ± 0.303
3.424ArgGlu: 3.424 ± 0.384
1.956ArgPhe: 1.956 ± 0.359
2.751ArgGly: 2.751 ± 0.45
0.856ArgHis: 0.856 ± 0.219
3.485ArgIle: 3.485 ± 0.527
3.546ArgLys: 3.546 ± 0.457
4.707ArgLeu: 4.707 ± 0.519
1.1ArgMet: 1.1 ± 0.316
2.079ArgAsn: 2.079 ± 0.393
1.59ArgPro: 1.59 ± 0.328
1.956ArgGln: 1.956 ± 0.409
2.384ArgArg: 2.384 ± 0.408
2.262ArgSer: 2.262 ± 0.438
1.528ArgThr: 1.528 ± 0.28
2.629ArgVal: 2.629 ± 0.431
0.55ArgTrp: 0.55 ± 0.165
2.445ArgTyr: 2.445 ± 0.396
0.0ArgXaa: 0.0 ± 0.0
Ser
2.935SerAla: 2.935 ± 0.347
0.672SerCys: 0.672 ± 0.171
3.607SerAsp: 3.607 ± 0.493
4.341SerGlu: 4.341 ± 0.496
3.24SerPhe: 3.24 ± 0.459
4.157SerGly: 4.157 ± 0.453
0.856SerHis: 0.856 ± 0.227
4.341SerIle: 4.341 ± 0.41
5.319SerLys: 5.319 ± 0.482
4.28SerLeu: 4.28 ± 0.49
1.712SerMet: 1.712 ± 0.264
3.24SerAsn: 3.24 ± 0.43
1.834SerPro: 1.834 ± 0.294
1.712SerGln: 1.712 ± 0.284
2.568SerArg: 2.568 ± 0.433
3.24SerSer: 3.24 ± 0.448
3.485SerThr: 3.485 ± 0.473
4.341SerVal: 4.341 ± 0.597
0.978SerTrp: 0.978 ± 0.206
2.751SerTyr: 2.751 ± 0.408
0.0SerXaa: 0.0 ± 0.0
Thr
4.524ThrAla: 4.524 ± 0.664
0.428ThrCys: 0.428 ± 0.151
3.301ThrAsp: 3.301 ± 0.443
3.729ThrGlu: 3.729 ± 0.424
2.812ThrPhe: 2.812 ± 0.407
3.79ThrGly: 3.79 ± 0.507
1.1ThrHis: 1.1 ± 0.212
4.096ThrIle: 4.096 ± 0.432
3.607ThrLys: 3.607 ± 0.34
6.603ThrLeu: 6.603 ± 0.617
1.1ThrMet: 1.1 ± 0.253
2.873ThrAsn: 2.873 ± 0.462
2.445ThrPro: 2.445 ± 0.437
1.712ThrGln: 1.712 ± 0.368
1.651ThrArg: 1.651 ± 0.274
3.852ThrSer: 3.852 ± 0.561
2.507ThrThr: 2.507 ± 0.498
4.035ThrVal: 4.035 ± 0.495
0.734ThrTrp: 0.734 ± 0.195
2.751ThrTyr: 2.751 ± 0.36
0.0ThrXaa: 0.0 ± 0.0
Val
5.074ValAla: 5.074 ± 0.535
0.489ValCys: 0.489 ± 0.192
4.218ValAsp: 4.218 ± 0.487
5.625ValGlu: 5.625 ± 0.682
2.69ValPhe: 2.69 ± 0.399
4.035ValGly: 4.035 ± 0.61
0.856ValHis: 0.856 ± 0.201
3.668ValIle: 3.668 ± 0.524
5.869ValLys: 5.869 ± 0.646
5.319ValLeu: 5.319 ± 0.593
1.528ValMet: 1.528 ± 0.264
3.668ValAsn: 3.668 ± 0.406
1.528ValPro: 1.528 ± 0.268
2.323ValGln: 2.323 ± 0.376
2.751ValArg: 2.751 ± 0.415
3.24ValSer: 3.24 ± 0.444
4.402ValThr: 4.402 ± 0.514
4.157ValVal: 4.157 ± 0.579
1.162ValTrp: 1.162 ± 0.292
2.568ValTyr: 2.568 ± 0.375
0.0ValXaa: 0.0 ± 0.0
Trp
0.611TrpAla: 0.611 ± 0.176
0.183TrpCys: 0.183 ± 0.1
0.734TrpAsp: 0.734 ± 0.213
1.528TrpGlu: 1.528 ± 0.278
0.489TrpPhe: 0.489 ± 0.178
0.917TrpGly: 0.917 ± 0.265
0.245TrpHis: 0.245 ± 0.12
0.978TrpIle: 0.978 ± 0.269
1.1TrpLys: 1.1 ± 0.208
1.162TrpLeu: 1.162 ± 0.321
0.061TrpMet: 0.061 ± 0.052
1.039TrpAsn: 1.039 ± 0.208
0.245TrpPro: 0.245 ± 0.109
0.672TrpGln: 0.672 ± 0.197
0.183TrpArg: 0.183 ± 0.109
1.1TrpSer: 1.1 ± 0.247
0.734TrpThr: 0.734 ± 0.236
0.55TrpVal: 0.55 ± 0.171
0.245TrpTrp: 0.245 ± 0.106
0.734TrpTyr: 0.734 ± 0.202
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.873TyrAla: 2.873 ± 0.463
0.306TyrCys: 0.306 ± 0.123
2.996TyrAsp: 2.996 ± 0.411
4.463TyrGlu: 4.463 ± 0.62
1.039TyrPhe: 1.039 ± 0.241
2.568TyrGly: 2.568 ± 0.39
0.978TyrHis: 0.978 ± 0.247
1.406TyrIle: 1.406 ± 0.289
3.301TyrLys: 3.301 ± 0.419
4.035TyrLeu: 4.035 ± 0.452
1.467TyrMet: 1.467 ± 0.32
2.14TyrAsn: 2.14 ± 0.374
1.162TyrPro: 1.162 ± 0.22
1.773TyrGln: 1.773 ± 0.304
2.14TyrArg: 2.14 ± 0.354
2.445TyrSer: 2.445 ± 0.354
2.568TyrThr: 2.568 ± 0.357
2.445TyrVal: 2.445 ± 0.29
0.55TyrTrp: 0.55 ± 0.214
2.079TyrTyr: 2.079 ± 0.348
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 127 proteins (16358 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski