Amino acid dipepetide frequency for Enterococcus phage SAP6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.014AlaAla: 7.014 ± 1.558
0.323AlaCys: 0.323 ± 0.174
5.079AlaAsp: 5.079 ± 0.636
5.805AlaGlu: 5.805 ± 0.575
2.419AlaPhe: 2.419 ± 0.427
4.354AlaGly: 4.354 ± 0.865
0.887AlaHis: 0.887 ± 0.327
5.805AlaIle: 5.805 ± 0.903
4.676AlaLys: 4.676 ± 0.754
5.079AlaLeu: 5.079 ± 0.561
2.58AlaMet: 2.58 ± 0.516
3.789AlaAsn: 3.789 ± 0.539
2.177AlaPro: 2.177 ± 0.387
2.58AlaGln: 2.58 ± 0.419
4.273AlaArg: 4.273 ± 0.578
5.402AlaSer: 5.402 ± 0.611
5.079AlaThr: 5.079 ± 0.667
5.241AlaVal: 5.241 ± 0.713
0.484AlaTrp: 0.484 ± 0.18
2.903AlaTyr: 2.903 ± 0.512
0.0AlaXaa: 0.0 ± 0.0
Cys
0.242CysAla: 0.242 ± 0.163
0.0CysCys: 0.0 ± 0.0
0.323CysAsp: 0.323 ± 0.157
0.887CysGlu: 0.887 ± 0.243
0.564CysPhe: 0.564 ± 0.182
0.564CysGly: 0.564 ± 0.279
0.081CysHis: 0.081 ± 0.088
0.484CysIle: 0.484 ± 0.241
0.403CysLys: 0.403 ± 0.265
0.806CysLeu: 0.806 ± 0.29
0.403CysMet: 0.403 ± 0.174
0.081CysAsn: 0.081 ± 0.086
0.161CysPro: 0.161 ± 0.114
0.242CysGln: 0.242 ± 0.126
0.161CysArg: 0.161 ± 0.109
0.323CysSer: 0.323 ± 0.146
0.161CysThr: 0.161 ± 0.106
0.242CysVal: 0.242 ± 0.158
0.0CysTrp: 0.0 ± 0.0
0.242CysTyr: 0.242 ± 0.154
0.0CysXaa: 0.0 ± 0.0
Asp
4.354AspAla: 4.354 ± 0.643
0.726AspCys: 0.726 ± 0.275
2.741AspAsp: 2.741 ± 0.351
5.402AspGlu: 5.402 ± 0.597
3.789AspPhe: 3.789 ± 0.471
4.031AspGly: 4.031 ± 0.658
0.484AspHis: 0.484 ± 0.172
4.031AspIle: 4.031 ± 0.622
3.628AspLys: 3.628 ± 0.537
6.128AspLeu: 6.128 ± 0.625
1.693AspMet: 1.693 ± 0.471
3.386AspAsn: 3.386 ± 0.485
1.29AspPro: 1.29 ± 0.31
1.371AspGln: 1.371 ± 0.335
2.177AspArg: 2.177 ± 0.389
3.628AspSer: 3.628 ± 0.546
4.999AspThr: 4.999 ± 0.89
4.515AspVal: 4.515 ± 0.52
0.645AspTrp: 0.645 ± 0.219
3.951AspTyr: 3.951 ± 0.514
0.0AspXaa: 0.0 ± 0.0
Glu
8.304GluAla: 8.304 ± 0.911
0.564GluCys: 0.564 ± 0.245
5.321GluAsp: 5.321 ± 0.689
10.643GluGlu: 10.643 ± 1.334
3.789GluPhe: 3.789 ± 0.645
6.369GluGly: 6.369 ± 0.769
0.564GluHis: 0.564 ± 0.17
3.951GluIle: 3.951 ± 0.565
3.709GluLys: 3.709 ± 0.522
7.579GluLeu: 7.579 ± 0.933
2.177GluMet: 2.177 ± 0.377
3.789GluAsn: 3.789 ± 0.498
1.935GluPro: 1.935 ± 0.411
3.386GluGln: 3.386 ± 0.613
4.596GluArg: 4.596 ± 0.675
5.402GluSer: 5.402 ± 0.68
4.193GluThr: 4.193 ± 0.536
6.45GluVal: 6.45 ± 0.71
1.774GluTrp: 1.774 ± 0.396
2.661GluTyr: 2.661 ± 0.426
0.0GluXaa: 0.0 ± 0.0
Phe
2.983PheAla: 2.983 ± 0.485
0.403PheCys: 0.403 ± 0.183
2.096PheAsp: 2.096 ± 0.381
3.467PheGlu: 3.467 ± 0.541
1.371PhePhe: 1.371 ± 0.301
2.903PheGly: 2.903 ± 0.443
0.484PheHis: 0.484 ± 0.171
3.306PheIle: 3.306 ± 0.59
2.58PheLys: 2.58 ± 0.457
2.983PheLeu: 2.983 ± 0.465
1.613PheMet: 1.613 ± 0.451
2.419PheAsn: 2.419 ± 0.437
1.613PhePro: 1.613 ± 0.511
1.048PheGln: 1.048 ± 0.345
1.29PheArg: 1.29 ± 0.312
2.499PheSer: 2.499 ± 0.365
2.661PheThr: 2.661 ± 0.409
2.661PheVal: 2.661 ± 0.479
0.806PheTrp: 0.806 ± 0.34
1.532PheTyr: 1.532 ± 0.394
0.0PheXaa: 0.0 ± 0.0
Gly
4.193GlyAla: 4.193 ± 0.637
0.081GlyCys: 0.081 ± 0.086
3.951GlyAsp: 3.951 ± 0.657
3.467GlyGlu: 3.467 ± 0.469
3.467GlyPhe: 3.467 ± 0.473
4.193GlyGly: 4.193 ± 0.998
1.532GlyHis: 1.532 ± 0.309
4.838GlyIle: 4.838 ± 0.817
5.805GlyLys: 5.805 ± 0.841
4.434GlyLeu: 4.434 ± 0.547
1.371GlyMet: 1.371 ± 0.32
3.225GlyAsn: 3.225 ± 0.452
0.0GlyPro: 0.0 ± 0.0
2.338GlyGln: 2.338 ± 0.624
2.661GlyArg: 2.661 ± 0.513
4.112GlySer: 4.112 ± 0.599
5.402GlyThr: 5.402 ± 0.942
4.434GlyVal: 4.434 ± 0.883
0.806GlyTrp: 0.806 ± 0.286
2.822GlyTyr: 2.822 ± 0.473
0.0GlyXaa: 0.0 ± 0.0
His
0.968HisAla: 0.968 ± 0.392
0.242HisCys: 0.242 ± 0.136
0.645HisAsp: 0.645 ± 0.239
0.726HisGlu: 0.726 ± 0.169
0.726HisPhe: 0.726 ± 0.243
0.645HisGly: 0.645 ± 0.188
0.242HisHis: 0.242 ± 0.122
1.209HisIle: 1.209 ± 0.331
1.129HisLys: 1.129 ± 0.284
0.968HisLeu: 0.968 ± 0.304
0.161HisMet: 0.161 ± 0.106
1.371HisAsn: 1.371 ± 0.209
0.645HisPro: 0.645 ± 0.191
0.564HisGln: 0.564 ± 0.271
0.887HisArg: 0.887 ± 0.214
0.806HisSer: 0.806 ± 0.288
0.564HisThr: 0.564 ± 0.173
0.968HisVal: 0.968 ± 0.246
0.081HisTrp: 0.081 ± 0.075
0.403HisTyr: 0.403 ± 0.179
0.0HisXaa: 0.0 ± 0.0
Ile
4.757IleAla: 4.757 ± 0.491
0.403IleCys: 0.403 ± 0.161
4.596IleAsp: 4.596 ± 0.584
4.676IleGlu: 4.676 ± 0.664
1.693IlePhe: 1.693 ± 0.319
3.386IleGly: 3.386 ± 0.408
1.29IleHis: 1.29 ± 0.292
2.903IleIle: 2.903 ± 0.48
5.241IleLys: 5.241 ± 0.586
3.789IleLeu: 3.789 ± 0.604
1.613IleMet: 1.613 ± 0.334
4.757IleAsn: 4.757 ± 0.65
2.177IlePro: 2.177 ± 0.365
3.064IleGln: 3.064 ± 0.666
2.741IleArg: 2.741 ± 0.562
4.273IleSer: 4.273 ± 0.811
4.354IleThr: 4.354 ± 0.789
3.144IleVal: 3.144 ± 0.543
0.484IleTrp: 0.484 ± 0.231
2.016IleTyr: 2.016 ± 0.427
0.0IleXaa: 0.0 ± 0.0
Lys
6.45LysAla: 6.45 ± 0.745
0.161LysCys: 0.161 ± 0.12
5.966LysAsp: 5.966 ± 0.612
5.966LysGlu: 5.966 ± 0.763
3.064LysPhe: 3.064 ± 0.42
4.434LysGly: 4.434 ± 0.691
0.887LysHis: 0.887 ± 0.344
2.983LysIle: 2.983 ± 0.364
5.241LysLys: 5.241 ± 0.889
6.773LysLeu: 6.773 ± 0.754
1.532LysMet: 1.532 ± 0.345
3.064LysAsn: 3.064 ± 0.474
3.064LysPro: 3.064 ± 0.553
3.144LysGln: 3.144 ± 0.626
3.225LysArg: 3.225 ± 0.622
4.515LysSer: 4.515 ± 0.635
4.031LysThr: 4.031 ± 0.538
5.563LysVal: 5.563 ± 0.566
1.048LysTrp: 1.048 ± 0.212
2.419LysTyr: 2.419 ± 0.541
0.0LysXaa: 0.0 ± 0.0
Leu
5.724LeuAla: 5.724 ± 0.663
0.484LeuCys: 0.484 ± 0.156
5.321LeuAsp: 5.321 ± 0.489
9.594LeuGlu: 9.594 ± 1.344
1.935LeuPhe: 1.935 ± 0.343
4.676LeuGly: 4.676 ± 0.637
1.29LeuHis: 1.29 ± 0.309
4.273LeuIle: 4.273 ± 0.671
5.886LeuLys: 5.886 ± 0.737
5.402LeuLeu: 5.402 ± 1.096
1.693LeuMet: 1.693 ± 0.34
3.548LeuAsn: 3.548 ± 0.465
3.306LeuPro: 3.306 ± 0.559
4.031LeuGln: 4.031 ± 0.47
3.306LeuArg: 3.306 ± 0.483
5.16LeuSer: 5.16 ± 0.493
7.095LeuThr: 7.095 ± 0.686
5.483LeuVal: 5.483 ± 0.589
0.806LeuTrp: 0.806 ± 0.274
2.016LeuTyr: 2.016 ± 0.404
0.0LeuXaa: 0.0 ± 0.0
Met
2.58MetAla: 2.58 ± 0.61
0.081MetCys: 0.081 ± 0.083
1.693MetAsp: 1.693 ± 0.355
2.258MetGlu: 2.258 ± 0.389
0.887MetPhe: 0.887 ± 0.28
0.968MetGly: 0.968 ± 0.279
0.242MetHis: 0.242 ± 0.127
1.209MetIle: 1.209 ± 0.296
1.854MetLys: 1.854 ± 0.325
2.58MetLeu: 2.58 ± 0.379
0.323MetMet: 0.323 ± 0.159
1.29MetAsn: 1.29 ± 0.238
0.887MetPro: 0.887 ± 0.214
0.726MetGln: 0.726 ± 0.165
1.29MetArg: 1.29 ± 0.34
1.371MetSer: 1.371 ± 0.306
1.532MetThr: 1.532 ± 0.368
1.29MetVal: 1.29 ± 0.211
0.323MetTrp: 0.323 ± 0.209
0.645MetTyr: 0.645 ± 0.157
0.0MetXaa: 0.0 ± 0.0
Asn
3.064AsnAla: 3.064 ± 0.548
0.242AsnCys: 0.242 ± 0.16
2.903AsnAsp: 2.903 ± 0.373
3.628AsnGlu: 3.628 ± 0.455
2.177AsnPhe: 2.177 ± 0.337
4.031AsnGly: 4.031 ± 0.532
0.887AsnHis: 0.887 ± 0.226
3.144AsnIle: 3.144 ± 0.434
4.999AsnLys: 4.999 ± 0.545
3.87AsnLeu: 3.87 ± 0.353
1.048AsnMet: 1.048 ± 0.222
1.209AsnAsn: 1.209 ± 0.327
3.064AsnPro: 3.064 ± 0.626
1.854AsnGln: 1.854 ± 0.403
2.419AsnArg: 2.419 ± 0.355
3.386AsnSer: 3.386 ± 0.404
2.016AsnThr: 2.016 ± 0.491
3.87AsnVal: 3.87 ± 0.539
0.968AsnTrp: 0.968 ± 0.263
1.854AsnTyr: 1.854 ± 0.505
0.0AsnXaa: 0.0 ± 0.0
Pro
2.58ProAla: 2.58 ± 0.589
0.161ProCys: 0.161 ± 0.114
1.613ProAsp: 1.613 ± 0.316
3.386ProGlu: 3.386 ± 0.647
1.693ProPhe: 1.693 ± 0.369
0.161ProGly: 0.161 ± 0.105
0.403ProHis: 0.403 ± 0.208
2.499ProIle: 2.499 ± 0.573
3.144ProLys: 3.144 ± 0.525
2.096ProLeu: 2.096 ± 0.385
0.726ProMet: 0.726 ± 0.206
2.177ProAsn: 2.177 ± 0.402
0.968ProPro: 0.968 ± 0.293
0.726ProGln: 0.726 ± 0.225
0.968ProArg: 0.968 ± 0.298
2.822ProSer: 2.822 ± 0.457
2.096ProThr: 2.096 ± 0.423
2.258ProVal: 2.258 ± 0.369
0.403ProTrp: 0.403 ± 0.145
1.693ProTyr: 1.693 ± 0.359
0.0ProXaa: 0.0 ± 0.0
Gln
3.064GlnAla: 3.064 ± 0.56
0.081GlnCys: 0.081 ± 0.07
1.854GlnAsp: 1.854 ± 0.373
3.467GlnGlu: 3.467 ± 0.603
0.806GlnPhe: 0.806 ± 0.194
2.822GlnGly: 2.822 ± 0.517
0.726GlnHis: 0.726 ± 0.242
2.096GlnIle: 2.096 ± 0.587
2.338GlnLys: 2.338 ± 0.38
3.628GlnLeu: 3.628 ± 0.525
0.887GlnMet: 0.887 ± 0.368
1.29GlnAsn: 1.29 ± 0.367
1.048GlnPro: 1.048 ± 0.22
2.096GlnGln: 2.096 ± 0.584
1.854GlnArg: 1.854 ± 0.272
2.338GlnSer: 2.338 ± 0.437
2.822GlnThr: 2.822 ± 0.496
2.903GlnVal: 2.903 ± 0.289
0.564GlnTrp: 0.564 ± 0.167
1.451GlnTyr: 1.451 ± 0.464
0.0GlnXaa: 0.0 ± 0.0
Arg
2.661ArgAla: 2.661 ± 0.499
0.403ArgCys: 0.403 ± 0.164
2.499ArgAsp: 2.499 ± 0.286
3.789ArgGlu: 3.789 ± 0.392
2.419ArgPhe: 2.419 ± 0.379
2.338ArgGly: 2.338 ± 0.379
0.968ArgHis: 0.968 ± 0.335
2.822ArgIle: 2.822 ± 0.45
3.064ArgLys: 3.064 ± 0.5
4.193ArgLeu: 4.193 ± 0.547
0.564ArgMet: 0.564 ± 0.21
1.774ArgAsn: 1.774 ± 0.337
2.016ArgPro: 2.016 ± 0.433
1.935ArgGln: 1.935 ± 0.352
2.016ArgArg: 2.016 ± 0.394
1.935ArgSer: 1.935 ± 0.384
2.338ArgThr: 2.338 ± 0.393
2.338ArgVal: 2.338 ± 0.407
0.645ArgTrp: 0.645 ± 0.195
2.016ArgTyr: 2.016 ± 0.367
0.0ArgXaa: 0.0 ± 0.0
Ser
3.548SerAla: 3.548 ± 0.544
0.564SerCys: 0.564 ± 0.237
4.031SerAsp: 4.031 ± 0.449
4.112SerGlu: 4.112 ± 0.48
2.741SerPhe: 2.741 ± 0.444
4.757SerGly: 4.757 ± 0.553
0.726SerHis: 0.726 ± 0.223
4.434SerIle: 4.434 ± 0.549
5.805SerLys: 5.805 ± 0.386
5.483SerLeu: 5.483 ± 0.696
1.532SerMet: 1.532 ± 0.287
3.306SerAsn: 3.306 ± 0.578
2.096SerPro: 2.096 ± 0.348
1.693SerGln: 1.693 ± 0.338
2.177SerArg: 2.177 ± 0.493
2.983SerSer: 2.983 ± 0.406
4.193SerThr: 4.193 ± 0.543
4.273SerVal: 4.273 ± 0.504
0.806SerTrp: 0.806 ± 0.214
2.177SerTyr: 2.177 ± 0.488
0.0SerXaa: 0.0 ± 0.0
Thr
5.241ThrAla: 5.241 ± 1.023
0.403ThrCys: 0.403 ± 0.179
4.354ThrAsp: 4.354 ± 0.521
4.838ThrGlu: 4.838 ± 0.553
2.822ThrPhe: 2.822 ± 0.446
3.628ThrGly: 3.628 ± 0.821
0.645ThrHis: 0.645 ± 0.165
4.838ThrIle: 4.838 ± 0.651
4.354ThrLys: 4.354 ± 0.43
6.208ThrLeu: 6.208 ± 0.664
1.371ThrMet: 1.371 ± 0.297
3.386ThrAsn: 3.386 ± 0.596
2.741ThrPro: 2.741 ± 0.588
2.338ThrGln: 2.338 ± 0.465
1.451ThrArg: 1.451 ± 0.306
3.709ThrSer: 3.709 ± 0.698
3.144ThrThr: 3.144 ± 0.6
5.563ThrVal: 5.563 ± 0.751
0.564ThrTrp: 0.564 ± 0.254
2.499ThrTyr: 2.499 ± 0.459
0.0ThrXaa: 0.0 ± 0.0
Val
5.241ValAla: 5.241 ± 0.696
0.645ValCys: 0.645 ± 0.242
4.838ValAsp: 4.838 ± 0.532
5.886ValGlu: 5.886 ± 0.656
2.983ValPhe: 2.983 ± 0.578
4.918ValGly: 4.918 ± 0.705
0.806ValHis: 0.806 ± 0.222
4.354ValIle: 4.354 ± 0.712
5.644ValLys: 5.644 ± 0.677
4.999ValLeu: 4.999 ± 0.624
1.209ValMet: 1.209 ± 0.288
3.386ValAsn: 3.386 ± 0.501
2.258ValPro: 2.258 ± 0.349
2.903ValGln: 2.903 ± 0.417
2.903ValArg: 2.903 ± 0.514
3.951ValSer: 3.951 ± 0.526
4.354ValThr: 4.354 ± 0.623
3.789ValVal: 3.789 ± 0.604
1.048ValTrp: 1.048 ± 0.295
2.499ValTyr: 2.499 ± 0.418
0.0ValXaa: 0.0 ± 0.0
Trp
0.887TrpAla: 0.887 ± 0.229
0.081TrpCys: 0.081 ± 0.092
0.726TrpAsp: 0.726 ± 0.258
0.968TrpGlu: 0.968 ± 0.242
0.564TrpPhe: 0.564 ± 0.215
1.209TrpGly: 1.209 ± 0.343
0.081TrpHis: 0.081 ± 0.066
0.887TrpIle: 0.887 ± 0.244
1.048TrpLys: 1.048 ± 0.243
0.484TrpLeu: 0.484 ± 0.19
0.161TrpMet: 0.161 ± 0.091
1.129TrpAsn: 1.129 ± 0.313
0.0TrpPro: 0.0 ± 0.0
0.645TrpGln: 0.645 ± 0.236
0.484TrpArg: 0.484 ± 0.195
0.726TrpSer: 0.726 ± 0.214
0.887TrpThr: 0.887 ± 0.256
0.726TrpVal: 0.726 ± 0.226
0.323TrpTrp: 0.323 ± 0.199
0.806TrpTyr: 0.806 ± 0.243
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.096TyrAla: 2.096 ± 0.406
0.323TyrCys: 0.323 ± 0.152
2.419TyrAsp: 2.419 ± 0.347
4.193TyrGlu: 4.193 ± 0.725
0.806TyrPhe: 0.806 ± 0.24
2.741TyrGly: 2.741 ± 0.438
0.726TyrHis: 0.726 ± 0.233
1.29TyrIle: 1.29 ± 0.349
2.983TyrLys: 2.983 ± 0.468
3.628TyrLeu: 3.628 ± 0.515
1.29TyrMet: 1.29 ± 0.316
2.258TyrAsn: 2.258 ± 0.487
1.129TyrPro: 1.129 ± 0.314
1.451TyrGln: 1.451 ± 0.321
1.854TyrArg: 1.854 ± 0.407
2.096TyrSer: 2.096 ± 0.383
2.177TyrThr: 2.177 ± 0.318
2.903TyrVal: 2.903 ± 0.324
0.242TyrTrp: 0.242 ± 0.124
2.177TyrTyr: 2.177 ± 0.438
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 44 proteins (12404 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski