Amino acid dipepetide frequency for Staphylococcus phage TEM123

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.529AlaAla: 1.529 ± 0.581
0.679AlaCys: 0.679 ± 0.204
2.717AlaAsp: 2.717 ± 0.496
3.991AlaGlu: 3.991 ± 0.884
2.802AlaPhe: 2.802 ± 0.636
3.821AlaGly: 3.821 ± 0.704
0.594AlaHis: 0.594 ± 0.202
4.925AlaIle: 4.925 ± 1.402
4.925AlaLys: 4.925 ± 0.705
4.161AlaLeu: 4.161 ± 0.786
2.123AlaMet: 2.123 ± 0.577
4.416AlaAsn: 4.416 ± 0.587
1.698AlaPro: 1.698 ± 0.437
2.293AlaGln: 2.293 ± 0.531
3.142AlaArg: 3.142 ± 0.544
3.482AlaSer: 3.482 ± 0.703
4.076AlaThr: 4.076 ± 0.618
3.397AlaVal: 3.397 ± 0.589
0.849AlaTrp: 0.849 ± 0.382
2.972AlaTyr: 2.972 ± 0.51
0.0AlaXaa: 0.0 ± 0.0
Cys
0.34CysAla: 0.34 ± 0.21
0.085CysCys: 0.085 ± 0.082
0.255CysAsp: 0.255 ± 0.154
0.255CysGlu: 0.255 ± 0.129
0.17CysPhe: 0.17 ± 0.109
0.17CysGly: 0.17 ± 0.134
0.0CysHis: 0.0 ± 0.0
0.679CysIle: 0.679 ± 0.261
0.594CysLys: 0.594 ± 0.254
0.425CysLeu: 0.425 ± 0.206
0.0CysMet: 0.0 ± 0.0
0.51CysAsn: 0.51 ± 0.218
0.255CysPro: 0.255 ± 0.128
0.255CysGln: 0.255 ± 0.136
0.17CysArg: 0.17 ± 0.12
0.425CysSer: 0.425 ± 0.238
0.34CysThr: 0.34 ± 0.142
0.0CysVal: 0.0 ± 0.0
0.085CysTrp: 0.085 ± 0.08
0.425CysTyr: 0.425 ± 0.18
0.0CysXaa: 0.0 ± 0.0
Asp
2.632AspAla: 2.632 ± 0.447
0.34AspCys: 0.34 ± 0.169
4.501AspAsp: 4.501 ± 0.885
5.52AspGlu: 5.52 ± 1.037
3.312AspPhe: 3.312 ± 0.53
3.991AspGly: 3.991 ± 0.557
0.679AspHis: 0.679 ± 0.251
4.076AspIle: 4.076 ± 0.623
5.944AspLys: 5.944 ± 0.77
4.755AspLeu: 4.755 ± 0.731
1.613AspMet: 1.613 ± 0.36
5.35AspAsn: 5.35 ± 0.827
1.189AspPro: 1.189 ± 0.32
0.594AspGln: 0.594 ± 0.223
2.293AspArg: 2.293 ± 0.433
4.076AspSer: 4.076 ± 0.708
3.821AspThr: 3.821 ± 0.718
3.906AspVal: 3.906 ± 0.564
0.679AspTrp: 0.679 ± 0.166
2.632AspTyr: 2.632 ± 0.487
0.0AspXaa: 0.0 ± 0.0
Glu
5.265GluAla: 5.265 ± 0.996
0.255GluCys: 0.255 ± 0.145
2.632GluAsp: 2.632 ± 0.618
6.114GluGlu: 6.114 ± 1.121
3.651GluPhe: 3.651 ± 0.608
3.057GluGly: 3.057 ± 0.506
1.104GluHis: 1.104 ± 0.324
6.454GluIle: 6.454 ± 1.072
6.029GluLys: 6.029 ± 1.152
6.539GluLeu: 6.539 ± 1.162
2.038GluMet: 2.038 ± 0.456
5.265GluAsn: 5.265 ± 0.872
2.038GluPro: 2.038 ± 0.436
3.482GluGln: 3.482 ± 0.608
3.142GluArg: 3.142 ± 0.602
4.586GluSer: 4.586 ± 0.807
3.227GluThr: 3.227 ± 0.557
5.52GluVal: 5.52 ± 0.839
0.51GluTrp: 0.51 ± 0.231
4.925GluTyr: 4.925 ± 0.714
0.0GluXaa: 0.0 ± 0.0
Phe
2.208PheAla: 2.208 ± 0.354
0.34PheCys: 0.34 ± 0.147
2.717PheAsp: 2.717 ± 0.427
4.671PheGlu: 4.671 ± 0.714
1.359PhePhe: 1.359 ± 0.293
2.293PheGly: 2.293 ± 0.537
0.764PheHis: 0.764 ± 0.399
2.632PheIle: 2.632 ± 0.421
4.331PheLys: 4.331 ± 0.592
2.293PheLeu: 2.293 ± 0.437
1.104PheMet: 1.104 ± 0.404
3.906PheAsn: 3.906 ± 0.57
0.934PhePro: 0.934 ± 0.322
1.698PheGln: 1.698 ± 0.572
1.104PheArg: 1.104 ± 0.413
2.293PheSer: 2.293 ± 0.466
2.548PheThr: 2.548 ± 0.437
2.802PheVal: 2.802 ± 0.827
0.17PheTrp: 0.17 ± 0.096
1.613PheTyr: 1.613 ± 0.325
0.0PheXaa: 0.0 ± 0.0
Gly
3.227GlyAla: 3.227 ± 0.644
0.34GlyCys: 0.34 ± 0.147
3.482GlyAsp: 3.482 ± 0.517
3.567GlyGlu: 3.567 ± 0.6
1.868GlyPhe: 1.868 ± 0.549
3.397GlyGly: 3.397 ± 0.714
1.274GlyHis: 1.274 ± 0.466
4.076GlyIle: 4.076 ± 0.667
4.84GlyLys: 4.84 ± 0.562
5.01GlyLeu: 5.01 ± 0.903
1.444GlyMet: 1.444 ± 0.395
3.482GlyAsn: 3.482 ± 0.727
0.679GlyPro: 0.679 ± 0.336
1.953GlyGln: 1.953 ± 0.398
2.972GlyArg: 2.972 ± 0.5
2.717GlySer: 2.717 ± 0.458
3.821GlyThr: 3.821 ± 0.566
4.246GlyVal: 4.246 ± 0.82
1.104GlyTrp: 1.104 ± 0.331
2.632GlyTyr: 2.632 ± 0.51
0.0GlyXaa: 0.0 ± 0.0
His
1.529HisAla: 1.529 ± 0.325
0.085HisCys: 0.085 ± 0.078
1.104HisAsp: 1.104 ± 0.245
1.104HisGlu: 1.104 ± 0.327
0.594HisPhe: 0.594 ± 0.221
1.359HisGly: 1.359 ± 0.294
0.255HisHis: 0.255 ± 0.134
0.764HisIle: 0.764 ± 0.244
0.764HisLys: 0.764 ± 0.244
1.104HisLeu: 1.104 ± 0.283
0.425HisMet: 0.425 ± 0.225
1.104HisAsn: 1.104 ± 0.307
0.51HisPro: 0.51 ± 0.17
0.425HisGln: 0.425 ± 0.194
0.34HisArg: 0.34 ± 0.15
0.849HisSer: 0.849 ± 0.286
0.679HisThr: 0.679 ± 0.222
1.019HisVal: 1.019 ± 0.301
0.0HisTrp: 0.0 ± 0.0
0.51HisTyr: 0.51 ± 0.272
0.0HisXaa: 0.0 ± 0.0
Ile
5.605IleAla: 5.605 ± 1.13
0.0IleCys: 0.0 ± 0.0
6.114IleAsp: 6.114 ± 0.845
5.944IleGlu: 5.944 ± 0.787
2.293IlePhe: 2.293 ± 0.539
3.651IleGly: 3.651 ± 0.681
1.444IleHis: 1.444 ± 0.326
4.671IleIle: 4.671 ± 0.842
7.473IleLys: 7.473 ± 0.895
4.416IleLeu: 4.416 ± 0.599
1.613IleMet: 1.613 ± 0.424
5.52IleAsn: 5.52 ± 1.046
1.953IlePro: 1.953 ± 0.318
2.463IleGln: 2.463 ± 0.505
2.717IleArg: 2.717 ± 0.558
3.991IleSer: 3.991 ± 0.574
5.605IleThr: 5.605 ± 0.917
5.605IleVal: 5.605 ± 1.278
1.444IleTrp: 1.444 ± 0.857
2.632IleTyr: 2.632 ± 0.435
0.0IleXaa: 0.0 ± 0.0
Lys
5.265LysAla: 5.265 ± 0.665
0.085LysCys: 0.085 ± 0.079
6.454LysAsp: 6.454 ± 0.977
6.963LysGlu: 6.963 ± 1.172
2.887LysPhe: 2.887 ± 0.502
4.84LysGly: 4.84 ± 0.723
1.189LysHis: 1.189 ± 0.323
5.944LysIle: 5.944 ± 1.034
8.322LysLys: 8.322 ± 1.183
7.048LysLeu: 7.048 ± 0.964
2.123LysMet: 2.123 ± 0.446
6.539LysAsn: 6.539 ± 0.892
3.227LysPro: 3.227 ± 0.558
5.18LysGln: 5.18 ± 0.864
3.991LysArg: 3.991 ± 0.761
5.435LysSer: 5.435 ± 0.846
5.01LysThr: 5.01 ± 0.676
5.69LysVal: 5.69 ± 0.617
0.679LysTrp: 0.679 ± 0.271
4.331LysTyr: 4.331 ± 0.839
0.0LysXaa: 0.0 ± 0.0
Leu
2.802LeuAla: 2.802 ± 0.645
0.255LeuCys: 0.255 ± 0.132
4.246LeuAsp: 4.246 ± 0.719
5.944LeuGlu: 5.944 ± 0.773
3.227LeuPhe: 3.227 ± 0.538
3.821LeuGly: 3.821 ± 0.515
0.934LeuHis: 0.934 ± 0.271
4.246LeuIle: 4.246 ± 0.58
7.303LeuLys: 7.303 ± 0.674
4.925LeuLeu: 4.925 ± 0.64
1.783LeuMet: 1.783 ± 0.343
6.029LeuAsn: 6.029 ± 0.516
2.632LeuPro: 2.632 ± 0.385
3.821LeuGln: 3.821 ± 0.652
3.567LeuArg: 3.567 ± 0.61
3.567LeuSer: 3.567 ± 0.482
5.265LeuThr: 5.265 ± 0.712
4.331LeuVal: 4.331 ± 0.785
0.679LeuTrp: 0.679 ± 0.297
2.887LeuTyr: 2.887 ± 0.565
0.0LeuXaa: 0.0 ± 0.0
Met
1.359MetAla: 1.359 ± 0.368
0.085MetCys: 0.085 ± 0.078
1.104MetAsp: 1.104 ± 0.247
2.038MetGlu: 2.038 ± 0.425
0.679MetPhe: 0.679 ± 0.212
1.104MetGly: 1.104 ± 0.311
0.255MetHis: 0.255 ± 0.191
1.613MetIle: 1.613 ± 0.321
1.698MetLys: 1.698 ± 0.325
2.717MetLeu: 2.717 ± 0.36
0.594MetMet: 0.594 ± 0.18
1.444MetAsn: 1.444 ± 0.342
0.679MetPro: 0.679 ± 0.292
1.444MetGln: 1.444 ± 0.41
0.934MetArg: 0.934 ± 0.26
1.274MetSer: 1.274 ± 0.336
2.717MetThr: 2.717 ± 0.593
0.934MetVal: 0.934 ± 0.263
0.34MetTrp: 0.34 ± 0.177
1.189MetTyr: 1.189 ± 0.312
0.0MetXaa: 0.0 ± 0.0
Asn
6.284AsnAla: 6.284 ± 1.039
0.51AsnCys: 0.51 ± 0.238
5.435AsnAsp: 5.435 ± 0.887
5.095AsnGlu: 5.095 ± 0.897
3.651AsnPhe: 3.651 ± 0.606
5.69AsnGly: 5.69 ± 0.647
0.934AsnHis: 0.934 ± 0.274
4.246AsnIle: 4.246 ± 0.496
7.218AsnLys: 7.218 ± 0.764
5.01AsnLeu: 5.01 ± 0.713
1.444AsnMet: 1.444 ± 0.485
5.18AsnAsn: 5.18 ± 1.054
3.142AsnPro: 3.142 ± 0.51
2.972AsnGln: 2.972 ± 0.597
2.632AsnArg: 2.632 ± 0.414
3.227AsnSer: 3.227 ± 0.419
3.567AsnThr: 3.567 ± 0.496
5.435AsnVal: 5.435 ± 0.749
0.764AsnTrp: 0.764 ± 0.202
3.227AsnTyr: 3.227 ± 0.516
0.0AsnXaa: 0.0 ± 0.0
Pro
1.444ProAla: 1.444 ± 0.321
0.255ProCys: 0.255 ± 0.137
1.274ProAsp: 1.274 ± 0.269
1.783ProGlu: 1.783 ± 0.321
1.444ProPhe: 1.444 ± 0.346
1.698ProGly: 1.698 ± 0.468
0.17ProHis: 0.17 ± 0.119
2.632ProIle: 2.632 ± 0.456
3.057ProLys: 3.057 ± 0.62
1.274ProLeu: 1.274 ± 0.366
0.934ProMet: 0.934 ± 0.275
2.293ProAsn: 2.293 ± 0.435
0.764ProPro: 0.764 ± 0.291
1.189ProGln: 1.189 ± 0.31
1.274ProArg: 1.274 ± 0.351
1.274ProSer: 1.274 ± 0.347
1.613ProThr: 1.613 ± 0.398
2.293ProVal: 2.293 ± 0.419
0.085ProTrp: 0.085 ± 0.086
1.359ProTyr: 1.359 ± 0.333
0.0ProXaa: 0.0 ± 0.0
Gln
2.378GlnAla: 2.378 ± 0.406
0.51GlnCys: 0.51 ± 0.205
2.887GlnAsp: 2.887 ± 0.52
2.887GlnGlu: 2.887 ± 0.545
1.953GlnPhe: 1.953 ± 0.391
2.378GlnGly: 2.378 ± 0.383
0.34GlnHis: 0.34 ± 0.156
3.142GlnIle: 3.142 ± 0.707
3.057GlnLys: 3.057 ± 0.541
3.057GlnLeu: 3.057 ± 0.611
1.019GlnMet: 1.019 ± 0.276
3.312GlnAsn: 3.312 ± 0.624
1.274GlnPro: 1.274 ± 0.298
2.632GlnGln: 2.632 ± 0.745
1.444GlnArg: 1.444 ± 0.389
2.378GlnSer: 2.378 ± 0.482
2.123GlnThr: 2.123 ± 0.43
1.698GlnVal: 1.698 ± 0.341
0.425GlnTrp: 0.425 ± 0.204
1.189GlnTyr: 1.189 ± 0.374
0.0GlnXaa: 0.0 ± 0.0
Arg
1.698ArgAla: 1.698 ± 0.355
0.34ArgCys: 0.34 ± 0.169
1.868ArgAsp: 1.868 ± 0.45
2.548ArgGlu: 2.548 ± 0.349
1.698ArgPhe: 1.698 ± 0.423
2.123ArgGly: 2.123 ± 0.485
0.764ArgHis: 0.764 ± 0.24
3.821ArgIle: 3.821 ± 0.656
4.586ArgLys: 4.586 ± 0.736
4.161ArgLeu: 4.161 ± 0.628
0.764ArgMet: 0.764 ± 0.351
3.142ArgAsn: 3.142 ± 0.617
0.764ArgPro: 0.764 ± 0.206
1.444ArgGln: 1.444 ± 0.412
2.038ArgArg: 2.038 ± 0.454
2.293ArgSer: 2.293 ± 0.551
1.953ArgThr: 1.953 ± 0.459
2.548ArgVal: 2.548 ± 0.506
0.425ArgTrp: 0.425 ± 0.22
2.123ArgTyr: 2.123 ± 0.477
0.0ArgXaa: 0.0 ± 0.0
Ser
4.161SerAla: 4.161 ± 0.753
0.34SerCys: 0.34 ± 0.159
4.416SerAsp: 4.416 ± 0.735
3.567SerGlu: 3.567 ± 0.615
2.378SerPhe: 2.378 ± 0.435
3.312SerGly: 3.312 ± 0.545
1.359SerHis: 1.359 ± 0.369
5.01SerIle: 5.01 ± 0.618
5.18SerLys: 5.18 ± 0.737
3.227SerLeu: 3.227 ± 0.364
1.444SerMet: 1.444 ± 0.353
4.246SerAsn: 4.246 ± 0.549
1.444SerPro: 1.444 ± 0.406
1.953SerGln: 1.953 ± 0.468
2.293SerArg: 2.293 ± 0.411
3.567SerSer: 3.567 ± 0.615
3.736SerThr: 3.736 ± 0.525
2.632SerVal: 2.632 ± 0.508
0.34SerTrp: 0.34 ± 0.24
2.123SerTyr: 2.123 ± 0.417
0.0SerXaa: 0.0 ± 0.0
Thr
4.161ThrAla: 4.161 ± 0.664
0.17ThrCys: 0.17 ± 0.115
3.227ThrAsp: 3.227 ± 0.589
4.925ThrGlu: 4.925 ± 0.766
2.123ThrPhe: 2.123 ± 0.445
3.821ThrGly: 3.821 ± 0.617
0.849ThrHis: 0.849 ± 0.229
6.029ThrIle: 6.029 ± 1.347
5.18ThrLys: 5.18 ± 0.666
4.755ThrLeu: 4.755 ± 0.596
0.679ThrMet: 0.679 ± 0.255
4.84ThrAsn: 4.84 ± 0.691
1.953ThrPro: 1.953 ± 0.444
2.972ThrGln: 2.972 ± 0.451
2.463ThrArg: 2.463 ± 0.351
3.651ThrSer: 3.651 ± 0.624
5.01ThrThr: 5.01 ± 0.957
4.586ThrVal: 4.586 ± 1.043
0.594ThrTrp: 0.594 ± 0.265
1.613ThrTyr: 1.613 ± 0.325
0.0ThrXaa: 0.0 ± 0.0
Val
3.651ValAla: 3.651 ± 1.025
0.425ValCys: 0.425 ± 0.189
4.416ValAsp: 4.416 ± 0.741
5.265ValGlu: 5.265 ± 0.704
3.227ValPhe: 3.227 ± 0.589
2.378ValGly: 2.378 ± 0.589
0.764ValHis: 0.764 ± 0.248
6.284ValIle: 6.284 ± 0.949
5.605ValLys: 5.605 ± 0.664
4.501ValLeu: 4.501 ± 0.775
1.783ValMet: 1.783 ± 0.387
4.671ValAsn: 4.671 ± 0.605
1.953ValPro: 1.953 ± 0.462
1.698ValGln: 1.698 ± 0.356
2.208ValArg: 2.208 ± 0.503
4.161ValSer: 4.161 ± 0.832
3.821ValThr: 3.821 ± 0.502
4.331ValVal: 4.331 ± 0.975
1.019ValTrp: 1.019 ± 0.44
2.208ValTyr: 2.208 ± 0.478
0.0ValXaa: 0.0 ± 0.0
Trp
1.019TrpAla: 1.019 ± 0.352
0.085TrpCys: 0.085 ± 0.08
0.255TrpAsp: 0.255 ± 0.122
0.764TrpGlu: 0.764 ± 0.229
0.34TrpPhe: 0.34 ± 0.233
0.51TrpGly: 0.51 ± 0.333
0.17TrpHis: 0.17 ± 0.107
0.594TrpIle: 0.594 ± 0.254
0.849TrpLys: 0.849 ± 0.233
0.594TrpLeu: 0.594 ± 0.226
0.085TrpMet: 0.085 ± 0.078
1.783TrpAsn: 1.783 ± 1.371
0.17TrpPro: 0.17 ± 0.118
0.34TrpGln: 0.34 ± 0.178
0.085TrpArg: 0.085 ± 0.08
0.849TrpSer: 0.849 ± 0.386
1.104TrpThr: 1.104 ± 0.398
0.764TrpVal: 0.764 ± 0.27
0.0TrpTrp: 0.0 ± 0.0
0.51TrpTyr: 0.51 ± 0.24
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.038TyrAla: 2.038 ± 0.468
0.255TyrCys: 0.255 ± 0.155
2.802TyrAsp: 2.802 ± 0.628
2.887TyrGlu: 2.887 ± 0.567
2.208TyrPhe: 2.208 ± 0.437
2.717TyrGly: 2.717 ± 0.681
0.679TyrHis: 0.679 ± 0.276
3.312TyrIle: 3.312 ± 0.537
4.246TyrLys: 4.246 ± 0.554
2.293TyrLeu: 2.293 ± 0.37
1.104TyrMet: 1.104 ± 0.336
2.632TyrAsn: 2.632 ± 0.389
0.849TyrPro: 0.849 ± 0.345
1.274TyrGln: 1.274 ± 0.277
2.293TyrArg: 2.293 ± 0.553
2.632TyrSer: 2.632 ± 0.449
3.567TyrThr: 3.567 ± 0.515
2.632TyrVal: 2.632 ± 0.535
0.679TyrTrp: 0.679 ± 0.257
1.359TyrTyr: 1.359 ± 0.281
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 43 proteins (11777 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski