Amino acid dipepetide frequency for Synechococcus phage S-CBP1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.486AlaAla: 12.486 ± 1.351
0.374AlaCys: 0.374 ± 0.149
6.056AlaAsp: 6.056 ± 0.609
5.832AlaGlu: 5.832 ± 0.826
3.14AlaPhe: 3.14 ± 0.568
6.953AlaGly: 6.953 ± 0.926
0.748AlaHis: 0.748 ± 0.28
4.86AlaIle: 4.86 ± 0.71
4.86AlaLys: 4.86 ± 0.735
8.299AlaLeu: 8.299 ± 0.74
3.29AlaMet: 3.29 ± 0.518
4.785AlaAsn: 4.785 ± 1.49
4.187AlaPro: 4.187 ± 0.642
4.411AlaGln: 4.411 ± 0.482
5.832AlaArg: 5.832 ± 0.963
6.579AlaSer: 6.579 ± 1.241
5.757AlaThr: 5.757 ± 1.063
5.907AlaVal: 5.907 ± 0.822
1.121AlaTrp: 1.121 ± 0.316
2.766AlaTyr: 2.766 ± 0.426
0.0AlaXaa: 0.0 ± 0.0
Cys
0.523CysAla: 0.523 ± 0.224
0.075CysCys: 0.075 ± 0.078
0.449CysAsp: 0.449 ± 0.167
0.449CysGlu: 0.449 ± 0.22
0.15CysPhe: 0.15 ± 0.098
0.449CysGly: 0.449 ± 0.208
0.15CysHis: 0.15 ± 0.107
0.449CysIle: 0.449 ± 0.167
0.374CysLys: 0.374 ± 0.245
0.449CysLeu: 0.449 ± 0.213
0.075CysMet: 0.075 ± 0.067
0.523CysAsn: 0.523 ± 0.234
0.15CysPro: 0.15 ± 0.105
0.374CysGln: 0.374 ± 0.203
0.299CysArg: 0.299 ± 0.166
0.897CysSer: 0.897 ± 0.274
0.449CysThr: 0.449 ± 0.193
0.598CysVal: 0.598 ± 0.196
0.0CysTrp: 0.0 ± 0.0
0.449CysTyr: 0.449 ± 0.192
0.0CysXaa: 0.0 ± 0.0
Asp
7.103AspAla: 7.103 ± 0.76
0.449AspCys: 0.449 ± 0.2
4.187AspAsp: 4.187 ± 0.754
4.037AspGlu: 4.037 ± 0.537
2.019AspPhe: 2.019 ± 0.394
5.907AspGly: 5.907 ± 0.874
0.972AspHis: 0.972 ± 0.283
3.364AspIle: 3.364 ± 0.594
2.318AspLys: 2.318 ± 0.404
5.757AspLeu: 5.757 ± 0.637
1.346AspMet: 1.346 ± 0.324
3.065AspAsn: 3.065 ± 0.543
3.589AspPro: 3.589 ± 0.408
2.841AspGln: 2.841 ± 0.648
2.467AspArg: 2.467 ± 0.471
3.14AspSer: 3.14 ± 0.551
4.112AspThr: 4.112 ± 0.436
3.14AspVal: 3.14 ± 0.447
0.598AspTrp: 0.598 ± 0.237
2.991AspTyr: 2.991 ± 0.484
0.0AspXaa: 0.0 ± 0.0
Glu
7.327GluAla: 7.327 ± 0.732
0.523GluCys: 0.523 ± 0.187
3.664GluAsp: 3.664 ± 0.547
4.71GluGlu: 4.71 ± 0.983
2.243GluPhe: 2.243 ± 0.424
2.991GluGly: 2.991 ± 0.449
0.897GluHis: 0.897 ± 0.237
3.29GluIle: 3.29 ± 0.545
2.692GluLys: 2.692 ± 0.43
4.561GluLeu: 4.561 ± 0.656
1.421GluMet: 1.421 ± 0.329
1.944GluAsn: 1.944 ± 0.326
1.645GluPro: 1.645 ± 0.389
2.467GluGln: 2.467 ± 0.512
4.785GluArg: 4.785 ± 0.676
3.589GluSer: 3.589 ± 0.39
3.888GluThr: 3.888 ± 0.621
4.037GluVal: 4.037 ± 0.716
1.421GluTrp: 1.421 ± 0.453
2.019GluTyr: 2.019 ± 0.395
0.0GluXaa: 0.0 ± 0.0
Phe
2.093PheAla: 2.093 ± 0.38
0.299PheCys: 0.299 ± 0.173
2.168PheAsp: 2.168 ± 0.556
0.972PheGlu: 0.972 ± 0.278
1.794PhePhe: 1.794 ± 0.325
2.766PheGly: 2.766 ± 0.551
0.523PheHis: 0.523 ± 0.178
2.093PheIle: 2.093 ± 0.36
2.168PheLys: 2.168 ± 0.53
3.065PheLeu: 3.065 ± 0.536
1.047PheMet: 1.047 ± 0.266
2.243PheAsn: 2.243 ± 0.556
1.346PhePro: 1.346 ± 0.229
1.944PheGln: 1.944 ± 0.385
2.243PheArg: 2.243 ± 0.381
2.542PheSer: 2.542 ± 0.419
2.243PheThr: 2.243 ± 0.374
1.794PheVal: 1.794 ± 0.331
0.374PheTrp: 0.374 ± 0.214
1.421PheTyr: 1.421 ± 0.323
0.0PheXaa: 0.0 ± 0.0
Gly
5.757GlyAla: 5.757 ± 0.711
0.449GlyCys: 0.449 ± 0.166
4.262GlyAsp: 4.262 ± 0.617
3.888GlyGlu: 3.888 ± 0.64
2.318GlyPhe: 2.318 ± 0.383
5.009GlyGly: 5.009 ± 1.997
1.196GlyHis: 1.196 ± 0.335
3.963GlyIle: 3.963 ± 0.523
4.71GlyLys: 4.71 ± 0.484
6.43GlyLeu: 6.43 ± 0.721
1.645GlyMet: 1.645 ± 0.424
3.514GlyAsn: 3.514 ± 0.617
1.57GlyPro: 1.57 ± 0.324
3.065GlyGln: 3.065 ± 0.382
4.785GlyArg: 4.785 ± 0.819
6.505GlySer: 6.505 ± 0.964
4.636GlyThr: 4.636 ± 1.036
5.682GlyVal: 5.682 ± 0.632
1.421GlyTrp: 1.421 ± 0.322
3.215GlyTyr: 3.215 ± 0.477
0.0GlyXaa: 0.0 ± 0.0
His
0.822HisAla: 0.822 ± 0.257
0.299HisCys: 0.299 ± 0.178
1.121HisAsp: 1.121 ± 0.292
1.047HisGlu: 1.047 ± 0.298
0.598HisPhe: 0.598 ± 0.231
1.271HisGly: 1.271 ± 0.369
0.299HisHis: 0.299 ± 0.152
0.374HisIle: 0.374 ± 0.149
0.523HisLys: 0.523 ± 0.197
1.645HisLeu: 1.645 ± 0.504
0.299HisMet: 0.299 ± 0.14
0.598HisAsn: 0.598 ± 0.207
0.673HisPro: 0.673 ± 0.192
0.897HisGln: 0.897 ± 0.265
0.598HisArg: 0.598 ± 0.153
0.449HisSer: 0.449 ± 0.182
0.972HisThr: 0.972 ± 0.342
0.598HisVal: 0.598 ± 0.199
0.299HisTrp: 0.299 ± 0.162
0.523HisTyr: 0.523 ± 0.19
0.0HisXaa: 0.0 ± 0.0
Ile
5.159IleAla: 5.159 ± 0.598
0.449IleCys: 0.449 ± 0.223
3.963IleAsp: 3.963 ± 0.593
3.215IleGlu: 3.215 ± 0.529
1.421IlePhe: 1.421 ± 0.329
3.664IleGly: 3.664 ± 0.575
0.598IleHis: 0.598 ± 0.296
2.318IleIle: 2.318 ± 0.607
2.916IleLys: 2.916 ± 0.603
3.065IleLeu: 3.065 ± 0.477
0.972IleMet: 0.972 ± 0.283
2.318IleAsn: 2.318 ± 0.433
2.318IlePro: 2.318 ± 0.398
2.617IleGln: 2.617 ± 0.37
2.841IleArg: 2.841 ± 0.459
3.738IleSer: 3.738 ± 0.65
3.439IleThr: 3.439 ± 0.519
3.29IleVal: 3.29 ± 0.657
0.598IleTrp: 0.598 ± 0.222
1.944IleTyr: 1.944 ± 0.333
0.0IleXaa: 0.0 ± 0.0
Lys
5.234LysAla: 5.234 ± 0.768
0.15LysCys: 0.15 ± 0.111
3.29LysAsp: 3.29 ± 0.425
3.589LysGlu: 3.589 ± 0.573
1.944LysPhe: 1.944 ± 0.463
3.215LysGly: 3.215 ± 0.444
0.673LysHis: 0.673 ± 0.229
2.916LysIle: 2.916 ± 0.321
2.766LysLys: 2.766 ± 0.627
5.607LysLeu: 5.607 ± 0.83
1.271LysMet: 1.271 ± 0.37
2.318LysAsn: 2.318 ± 0.454
2.318LysPro: 2.318 ± 0.46
2.766LysGln: 2.766 ± 0.574
2.393LysArg: 2.393 ± 0.411
2.318LysSer: 2.318 ± 0.453
2.841LysThr: 2.841 ± 0.434
3.888LysVal: 3.888 ± 0.546
0.673LysTrp: 0.673 ± 0.221
2.168LysTyr: 2.168 ± 0.385
0.0LysXaa: 0.0 ± 0.0
Leu
6.579LeuAla: 6.579 ± 0.662
0.224LeuCys: 0.224 ± 0.125
5.682LeuAsp: 5.682 ± 0.673
5.159LeuGlu: 5.159 ± 0.891
3.14LeuPhe: 3.14 ± 0.55
6.131LeuGly: 6.131 ± 0.602
1.421LeuHis: 1.421 ± 0.292
4.486LeuIle: 4.486 ± 0.657
4.262LeuLys: 4.262 ± 0.551
7.028LeuLeu: 7.028 ± 0.786
1.645LeuMet: 1.645 ± 0.328
4.86LeuAsn: 4.86 ± 0.53
3.364LeuPro: 3.364 ± 0.502
4.411LeuGln: 4.411 ± 0.699
5.458LeuArg: 5.458 ± 0.598
5.084LeuSer: 5.084 ± 0.64
4.262LeuThr: 4.262 ± 0.661
4.486LeuVal: 4.486 ± 0.604
0.598LeuTrp: 0.598 ± 0.194
2.467LeuTyr: 2.467 ± 0.495
0.0LeuXaa: 0.0 ± 0.0
Met
2.393MetAla: 2.393 ± 0.401
0.299MetCys: 0.299 ± 0.145
1.495MetAsp: 1.495 ± 0.338
2.393MetGlu: 2.393 ± 0.413
0.598MetPhe: 0.598 ± 0.214
1.121MetGly: 1.121 ± 0.274
0.374MetHis: 0.374 ± 0.218
0.972MetIle: 0.972 ± 0.302
0.972MetLys: 0.972 ± 0.183
1.421MetLeu: 1.421 ± 0.411
0.673MetMet: 0.673 ± 0.253
1.121MetAsn: 1.121 ± 0.284
1.271MetPro: 1.271 ± 0.398
1.421MetGln: 1.421 ± 0.353
1.421MetArg: 1.421 ± 0.387
1.944MetSer: 1.944 ± 0.433
2.019MetThr: 2.019 ± 0.34
0.523MetVal: 0.523 ± 0.17
0.523MetTrp: 0.523 ± 0.186
0.897MetTyr: 0.897 ± 0.271
0.0MetXaa: 0.0 ± 0.0
Asn
4.112AsnAla: 4.112 ± 0.928
0.299AsnCys: 0.299 ± 0.138
2.467AsnAsp: 2.467 ± 0.456
2.542AsnGlu: 2.542 ± 0.466
2.093AsnPhe: 2.093 ± 0.401
4.561AsnGly: 4.561 ± 0.629
0.822AsnHis: 0.822 ± 0.161
2.168AsnIle: 2.168 ± 0.388
2.467AsnLys: 2.467 ± 0.437
3.664AsnLeu: 3.664 ± 0.403
1.121AsnMet: 1.121 ± 0.275
2.617AsnAsn: 2.617 ± 0.48
3.514AsnPro: 3.514 ± 0.545
1.869AsnGln: 1.869 ± 0.29
3.813AsnArg: 3.813 ± 0.525
2.916AsnSer: 2.916 ± 0.692
2.991AsnThr: 2.991 ± 1.205
2.916AsnVal: 2.916 ± 0.476
0.822AsnTrp: 0.822 ± 0.195
2.093AsnTyr: 2.093 ± 0.367
0.0AsnXaa: 0.0 ± 0.0
Pro
3.738ProAla: 3.738 ± 0.463
0.299ProCys: 0.299 ± 0.152
2.692ProAsp: 2.692 ± 0.398
2.916ProGlu: 2.916 ± 0.475
1.645ProPhe: 1.645 ± 0.351
3.589ProGly: 3.589 ± 0.396
0.748ProHis: 0.748 ± 0.282
2.243ProIle: 2.243 ± 0.475
1.72ProLys: 1.72 ± 0.363
2.318ProLeu: 2.318 ± 0.416
1.196ProMet: 1.196 ± 0.277
1.944ProAsn: 1.944 ± 0.315
1.421ProPro: 1.421 ± 0.295
1.869ProGln: 1.869 ± 0.333
1.72ProArg: 1.72 ± 0.476
3.589ProSer: 3.589 ± 0.484
2.393ProThr: 2.393 ± 0.46
3.439ProVal: 3.439 ± 0.591
1.346ProTrp: 1.346 ± 0.321
1.421ProTyr: 1.421 ± 0.3
0.0ProXaa: 0.0 ± 0.0
Gln
4.486GlnAla: 4.486 ± 0.68
0.224GlnCys: 0.224 ± 0.121
2.916GlnAsp: 2.916 ± 0.466
3.065GlnGlu: 3.065 ± 0.832
2.243GlnPhe: 2.243 ± 0.32
3.065GlnGly: 3.065 ± 0.416
0.822GlnHis: 0.822 ± 0.291
2.542GlnIle: 2.542 ± 0.424
2.617GlnLys: 2.617 ± 0.448
3.364GlnLeu: 3.364 ± 0.461
0.822GlnMet: 0.822 ± 0.235
2.243GlnAsn: 2.243 ± 0.404
1.121GlnPro: 1.121 ± 0.364
3.065GlnGln: 3.065 ± 0.51
1.794GlnArg: 1.794 ± 0.472
2.991GlnSer: 2.991 ± 0.529
3.065GlnThr: 3.065 ± 0.498
3.14GlnVal: 3.14 ± 0.525
0.374GlnTrp: 0.374 ± 0.179
1.794GlnTyr: 1.794 ± 0.332
0.0GlnXaa: 0.0 ± 0.0
Arg
4.336ArgAla: 4.336 ± 0.633
0.299ArgCys: 0.299 ± 0.145
3.29ArgAsp: 3.29 ± 0.625
3.439ArgGlu: 3.439 ± 0.479
1.421ArgPhe: 1.421 ± 0.259
3.589ArgGly: 3.589 ± 0.829
0.748ArgHis: 0.748 ± 0.271
3.664ArgIle: 3.664 ± 0.598
3.664ArgLys: 3.664 ± 0.53
5.682ArgLeu: 5.682 ± 0.509
1.121ArgMet: 1.121 ± 0.218
2.542ArgAsn: 2.542 ± 0.348
2.168ArgPro: 2.168 ± 0.454
2.467ArgGln: 2.467 ± 0.554
2.841ArgArg: 2.841 ± 0.564
4.411ArgSer: 4.411 ± 0.642
3.664ArgThr: 3.664 ± 0.734
3.065ArgVal: 3.065 ± 0.459
0.598ArgTrp: 0.598 ± 0.215
2.093ArgTyr: 2.093 ± 0.436
0.0ArgXaa: 0.0 ± 0.0
Ser
6.505SerAla: 6.505 ± 1.25
0.748SerCys: 0.748 ± 0.274
4.037SerAsp: 4.037 ± 0.587
3.14SerGlu: 3.14 ± 0.467
2.617SerPhe: 2.617 ± 0.436
7.252SerGly: 7.252 ± 1.018
0.598SerHis: 0.598 ± 0.188
3.065SerIle: 3.065 ± 0.501
3.14SerLys: 3.14 ± 0.568
4.785SerLeu: 4.785 ± 0.606
1.271SerMet: 1.271 ± 0.297
3.813SerAsn: 3.813 ± 0.73
2.991SerPro: 2.991 ± 0.406
2.542SerGln: 2.542 ± 0.378
3.738SerArg: 3.738 ± 0.598
4.411SerSer: 4.411 ± 1.053
5.757SerThr: 5.757 ± 1.349
4.411SerVal: 4.411 ± 0.652
0.972SerTrp: 0.972 ± 0.264
2.093SerTyr: 2.093 ± 0.346
0.0SerXaa: 0.0 ± 0.0
Thr
8.224ThrAla: 8.224 ± 2.507
0.598ThrCys: 0.598 ± 0.279
3.664ThrAsp: 3.664 ± 0.51
3.14ThrGlu: 3.14 ± 0.42
2.019ThrPhe: 2.019 ± 0.306
4.561ThrGly: 4.561 ± 1.019
0.523ThrHis: 0.523 ± 0.137
3.29ThrIle: 3.29 ± 0.566
3.813ThrLys: 3.813 ± 0.461
4.486ThrLeu: 4.486 ± 0.505
1.271ThrMet: 1.271 ± 0.251
4.112ThrAsn: 4.112 ± 0.685
4.112ThrPro: 4.112 ± 0.436
2.542ThrGln: 2.542 ± 0.414
2.318ThrArg: 2.318 ± 0.396
4.486ThrSer: 4.486 ± 0.893
6.28ThrThr: 6.28 ± 1.321
4.037ThrVal: 4.037 ± 0.69
0.897ThrTrp: 0.897 ± 0.275
2.617ThrTyr: 2.617 ± 0.391
0.0ThrXaa: 0.0 ± 0.0
Val
7.252ValAla: 7.252 ± 0.96
0.748ValCys: 0.748 ± 0.283
4.71ValAsp: 4.71 ± 0.619
3.664ValGlu: 3.664 ± 0.556
2.393ValPhe: 2.393 ± 0.471
4.636ValGly: 4.636 ± 0.74
0.972ValHis: 0.972 ± 0.202
2.393ValIle: 2.393 ± 0.462
3.29ValLys: 3.29 ± 0.558
4.486ValLeu: 4.486 ± 0.536
1.645ValMet: 1.645 ± 0.335
2.692ValAsn: 2.692 ± 0.427
2.692ValPro: 2.692 ± 0.315
1.794ValGln: 1.794 ± 0.372
2.916ValArg: 2.916 ± 0.489
5.308ValSer: 5.308 ± 0.776
4.262ValThr: 4.262 ± 0.961
4.561ValVal: 4.561 ± 0.582
0.673ValTrp: 0.673 ± 0.253
1.72ValTyr: 1.72 ± 0.324
0.0ValXaa: 0.0 ± 0.0
Trp
1.271TrpAla: 1.271 ± 0.215
0.224TrpCys: 0.224 ± 0.135
1.047TrpAsp: 1.047 ± 0.274
1.196TrpGlu: 1.196 ± 0.307
0.449TrpPhe: 0.449 ± 0.162
0.449TrpGly: 0.449 ± 0.212
0.224TrpHis: 0.224 ± 0.119
0.673TrpIle: 0.673 ± 0.216
0.523TrpLys: 0.523 ± 0.18
1.794TrpLeu: 1.794 ± 0.276
0.224TrpMet: 0.224 ± 0.144
0.598TrpAsn: 0.598 ± 0.216
0.374TrpPro: 0.374 ± 0.162
0.748TrpGln: 0.748 ± 0.228
0.748TrpArg: 0.748 ± 0.225
0.598TrpSer: 0.598 ± 0.175
1.196TrpThr: 1.196 ± 0.287
0.897TrpVal: 0.897 ± 0.252
0.075TrpTrp: 0.075 ± 0.077
0.374TrpTyr: 0.374 ± 0.17
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.841TyrAla: 2.841 ± 0.556
0.299TyrCys: 0.299 ± 0.209
2.318TyrAsp: 2.318 ± 0.464
1.72TyrGlu: 1.72 ± 0.429
0.897TyrPhe: 0.897 ± 0.255
2.841TyrGly: 2.841 ± 0.521
0.598TyrHis: 0.598 ± 0.175
1.72TyrIle: 1.72 ± 0.378
2.542TyrLys: 2.542 ± 0.461
2.841TyrLeu: 2.841 ± 0.433
1.421TyrMet: 1.421 ± 0.293
2.019TyrAsn: 2.019 ± 0.294
1.421TyrPro: 1.421 ± 0.348
1.645TyrGln: 1.645 ± 0.36
2.093TyrArg: 2.093 ± 0.436
2.393TyrSer: 2.393 ± 0.379
2.766TyrThr: 2.766 ± 0.473
2.393TyrVal: 2.393 ± 0.462
0.299TyrTrp: 0.299 ± 0.13
0.673TyrTyr: 0.673 ± 0.193
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 50 proteins (13376 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski