Amino acid dipepetide frequency for Synechococcus phage S-LBS1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.527AlaAla: 19.527 ± 1.537
0.81AlaCys: 0.81 ± 0.263
6.299AlaAsp: 6.299 ± 0.892
7.199AlaGlu: 7.199 ± 0.864
3.869AlaPhe: 3.869 ± 0.555
10.168AlaGly: 10.168 ± 1.118
1.26AlaHis: 1.26 ± 0.396
6.479AlaIle: 6.479 ± 0.851
3.779AlaLys: 3.779 ± 0.636
8.729AlaLeu: 8.729 ± 1.298
3.329AlaMet: 3.329 ± 0.771
3.329AlaAsn: 3.329 ± 0.625
5.309AlaPro: 5.309 ± 0.639
4.409AlaGln: 4.409 ± 0.697
6.929AlaArg: 6.929 ± 0.853
7.559AlaSer: 7.559 ± 0.866
7.469AlaThr: 7.469 ± 0.876
8.279AlaVal: 8.279 ± 0.889
1.62AlaTrp: 1.62 ± 0.354
2.88AlaTyr: 2.88 ± 0.522
0.0AlaXaa: 0.0 ± 0.0
Cys
0.81CysAla: 0.81 ± 0.218
0.27CysCys: 0.27 ± 0.228
0.72CysAsp: 0.72 ± 0.272
0.36CysGlu: 0.36 ± 0.202
0.27CysPhe: 0.27 ± 0.15
1.08CysGly: 1.08 ± 0.318
0.36CysHis: 0.36 ± 0.191
0.63CysIle: 0.63 ± 0.251
0.45CysLys: 0.45 ± 0.176
0.72CysLeu: 0.72 ± 0.238
0.09CysMet: 0.09 ± 0.079
0.45CysAsn: 0.45 ± 0.189
1.35CysPro: 1.35 ± 0.336
0.63CysGln: 0.63 ± 0.213
1.44CysArg: 1.44 ± 0.259
1.08CysSer: 1.08 ± 0.357
0.9CysThr: 0.9 ± 0.274
0.54CysVal: 0.54 ± 0.17
0.09CysTrp: 0.09 ± 0.083
0.18CysTyr: 0.18 ± 0.119
0.0CysXaa: 0.0 ± 0.0
Asp
6.749AspAla: 6.749 ± 0.726
0.9AspCys: 0.9 ± 0.258
2.61AspAsp: 2.61 ± 0.471
3.329AspGlu: 3.329 ± 0.715
1.71AspPhe: 1.71 ± 0.438
6.029AspGly: 6.029 ± 0.556
1.08AspHis: 1.08 ± 0.381
2.25AspIle: 2.25 ± 0.503
1.17AspLys: 1.17 ± 0.265
6.569AspLeu: 6.569 ± 0.733
0.99AspMet: 0.99 ± 0.301
1.26AspAsn: 1.26 ± 0.455
2.7AspPro: 2.7 ± 0.487
2.52AspGln: 2.52 ± 0.62
2.34AspArg: 2.34 ± 0.368
2.34AspSer: 2.34 ± 0.403
3.059AspThr: 3.059 ± 0.515
2.969AspVal: 2.969 ± 0.422
0.99AspTrp: 0.99 ± 0.271
0.99AspTyr: 0.99 ± 0.306
0.0AspXaa: 0.0 ± 0.0
Glu
5.489GluAla: 5.489 ± 0.706
0.99GluCys: 0.99 ± 0.427
2.61GluAsp: 2.61 ± 0.54
2.79GluGlu: 2.79 ± 0.588
1.62GluPhe: 1.62 ± 0.345
2.88GluGly: 2.88 ± 0.521
1.44GluHis: 1.44 ± 0.381
3.419GluIle: 3.419 ± 0.585
1.71GluLys: 1.71 ± 0.455
7.379GluLeu: 7.379 ± 1.02
1.26GluMet: 1.26 ± 0.363
1.08GluAsn: 1.08 ± 0.344
2.7GluPro: 2.7 ± 0.565
4.049GluGln: 4.049 ± 0.627
4.229GluArg: 4.229 ± 0.665
3.239GluSer: 3.239 ± 0.541
2.25GluThr: 2.25 ± 0.443
4.409GluVal: 4.409 ± 0.707
1.08GluTrp: 1.08 ± 0.277
0.81GluTyr: 0.81 ± 0.229
0.0GluXaa: 0.0 ± 0.0
Phe
3.149PheAla: 3.149 ± 0.642
0.63PheCys: 0.63 ± 0.292
2.07PheAsp: 2.07 ± 0.453
2.25PheGlu: 2.25 ± 0.483
0.9PhePhe: 0.9 ± 0.23
1.53PheGly: 1.53 ± 0.365
0.18PheHis: 0.18 ± 0.118
1.71PheIle: 1.71 ± 0.329
1.26PheLys: 1.26 ± 0.278
2.16PheLeu: 2.16 ± 0.382
0.45PheMet: 0.45 ± 0.195
0.81PheAsn: 0.81 ± 0.236
1.71PhePro: 1.71 ± 0.377
0.9PheGln: 0.9 ± 0.228
1.44PheArg: 1.44 ± 0.332
3.959PheSer: 3.959 ± 0.663
2.7PheThr: 2.7 ± 0.559
1.98PheVal: 1.98 ± 0.401
0.72PheTrp: 0.72 ± 0.236
0.18PheTyr: 0.18 ± 0.121
0.0PheXaa: 0.0 ± 0.0
Gly
8.459GlyAla: 8.459 ± 1.327
1.08GlyCys: 1.08 ± 0.276
4.409GlyAsp: 4.409 ± 0.767
3.779GlyGlu: 3.779 ± 0.536
2.969GlyPhe: 2.969 ± 0.486
7.109GlyGly: 7.109 ± 1.182
0.99GlyHis: 0.99 ± 0.301
4.049GlyIle: 4.049 ± 0.698
3.059GlyLys: 3.059 ± 0.677
8.279GlyLeu: 8.279 ± 0.982
1.71GlyMet: 1.71 ± 0.432
3.059GlyAsn: 3.059 ± 0.404
2.7GlyPro: 2.7 ± 0.568
2.7GlyGln: 2.7 ± 0.412
5.219GlyArg: 5.219 ± 0.518
5.399GlySer: 5.399 ± 0.929
7.109GlyThr: 7.109 ± 1.321
6.209GlyVal: 6.209 ± 0.775
1.8GlyTrp: 1.8 ± 0.404
2.43GlyTyr: 2.43 ± 0.445
0.0GlyXaa: 0.0 ± 0.0
His
1.26HisAla: 1.26 ± 0.307
0.45HisCys: 0.45 ± 0.231
0.9HisAsp: 0.9 ± 0.283
0.72HisGlu: 0.72 ± 0.277
0.45HisPhe: 0.45 ± 0.205
0.81HisGly: 0.81 ± 0.303
0.36HisHis: 0.36 ± 0.171
0.81HisIle: 0.81 ± 0.186
0.27HisLys: 0.27 ± 0.175
1.62HisLeu: 1.62 ± 0.429
0.45HisMet: 0.45 ± 0.179
0.45HisAsn: 0.45 ± 0.181
0.9HisPro: 0.9 ± 0.278
1.35HisGln: 1.35 ± 0.397
1.26HisArg: 1.26 ± 0.237
1.53HisSer: 1.53 ± 0.417
0.81HisThr: 0.81 ± 0.248
1.17HisVal: 1.17 ± 0.324
0.09HisTrp: 0.09 ± 0.075
0.99HisTyr: 0.99 ± 0.29
0.0HisXaa: 0.0 ± 0.0
Ile
5.039IleAla: 5.039 ± 0.711
0.54IleCys: 0.54 ± 0.157
3.779IleAsp: 3.779 ± 0.606
4.319IleGlu: 4.319 ± 0.703
0.99IlePhe: 0.99 ± 0.321
4.499IleGly: 4.499 ± 0.708
0.27IleHis: 0.27 ± 0.156
2.52IleIle: 2.52 ± 0.585
1.53IleLys: 1.53 ± 0.355
3.329IleLeu: 3.329 ± 0.462
0.81IleMet: 0.81 ± 0.29
2.52IleAsn: 2.52 ± 0.506
3.329IlePro: 3.329 ± 0.548
1.98IleGln: 1.98 ± 0.34
3.239IleArg: 3.239 ± 0.508
3.419IleSer: 3.419 ± 0.55
4.229IleThr: 4.229 ± 0.628
3.239IleVal: 3.239 ± 0.58
0.81IleTrp: 0.81 ± 0.261
1.53IleTyr: 1.53 ± 0.357
0.0IleXaa: 0.0 ± 0.0
Lys
4.499LysAla: 4.499 ± 0.767
0.09LysCys: 0.09 ± 0.092
1.35LysAsp: 1.35 ± 0.453
1.62LysGlu: 1.62 ± 0.421
0.54LysPhe: 0.54 ± 0.22
2.25LysGly: 2.25 ± 0.501
0.36LysHis: 0.36 ± 0.185
1.53LysIle: 1.53 ± 0.271
0.81LysLys: 0.81 ± 0.255
2.88LysLeu: 2.88 ± 0.595
0.81LysMet: 0.81 ± 0.228
1.26LysAsn: 1.26 ± 0.299
2.7LysPro: 2.7 ± 0.556
1.35LysGln: 1.35 ± 0.38
2.79LysArg: 2.79 ± 0.526
1.89LysSer: 1.89 ± 0.359
1.89LysThr: 1.89 ± 0.412
1.8LysVal: 1.8 ± 0.385
0.63LysTrp: 0.63 ± 0.276
0.36LysTyr: 0.36 ± 0.178
0.0LysXaa: 0.0 ± 0.0
Leu
10.798LeuAla: 10.798 ± 1.175
0.63LeuCys: 0.63 ± 0.253
5.039LeuAsp: 5.039 ± 0.616
6.119LeuGlu: 6.119 ± 0.876
2.7LeuPhe: 2.7 ± 0.455
7.649LeuGly: 7.649 ± 0.746
1.8LeuHis: 1.8 ± 0.405
5.849LeuIle: 5.849 ± 0.807
2.34LeuLys: 2.34 ± 0.432
6.929LeuLeu: 6.929 ± 0.84
1.62LeuMet: 1.62 ± 0.367
2.61LeuAsn: 2.61 ± 0.612
4.859LeuPro: 4.859 ± 0.591
4.499LeuGln: 4.499 ± 0.632
5.039LeuArg: 5.039 ± 0.511
5.399LeuSer: 5.399 ± 0.782
6.029LeuThr: 6.029 ± 0.586
6.209LeuVal: 6.209 ± 0.77
0.81LeuTrp: 0.81 ± 0.244
1.35LeuTyr: 1.35 ± 0.341
0.0LeuXaa: 0.0 ± 0.0
Met
2.969MetAla: 2.969 ± 0.579
0.0MetCys: 0.0 ± 0.0
0.99MetAsp: 0.99 ± 0.305
1.53MetGlu: 1.53 ± 0.425
0.09MetPhe: 0.09 ± 0.102
1.98MetGly: 1.98 ± 0.358
0.36MetHis: 0.36 ± 0.189
1.35MetIle: 1.35 ± 0.299
0.9MetLys: 0.9 ± 0.279
2.34MetLeu: 2.34 ± 0.397
0.09MetMet: 0.09 ± 0.102
0.45MetAsn: 0.45 ± 0.185
1.8MetPro: 1.8 ± 0.441
1.26MetGln: 1.26 ± 0.342
1.17MetArg: 1.17 ± 0.248
1.08MetSer: 1.08 ± 0.237
1.26MetThr: 1.26 ± 0.265
1.08MetVal: 1.08 ± 0.354
0.27MetTrp: 0.27 ± 0.145
0.27MetTyr: 0.27 ± 0.192
0.0MetXaa: 0.0 ± 0.0
Asn
3.059AsnAla: 3.059 ± 0.639
0.36AsnCys: 0.36 ± 0.154
1.71AsnAsp: 1.71 ± 0.376
1.71AsnGlu: 1.71 ± 0.301
0.45AsnPhe: 0.45 ± 0.202
3.329AsnGly: 3.329 ± 0.617
0.63AsnHis: 0.63 ± 0.196
1.53AsnIle: 1.53 ± 0.408
0.99AsnLys: 0.99 ± 0.301
1.35AsnLeu: 1.35 ± 0.356
0.45AsnMet: 0.45 ± 0.248
1.26AsnAsn: 1.26 ± 0.358
1.53AsnPro: 1.53 ± 0.378
2.07AsnGln: 2.07 ± 0.391
2.43AsnArg: 2.43 ± 0.514
1.08AsnSer: 1.08 ± 0.303
3.149AsnThr: 3.149 ± 0.655
1.62AsnVal: 1.62 ± 0.363
0.81AsnTrp: 0.81 ± 0.332
1.08AsnTyr: 1.08 ± 0.253
0.0AsnXaa: 0.0 ± 0.0
Pro
6.209ProAla: 6.209 ± 0.781
0.72ProCys: 0.72 ± 0.246
3.599ProAsp: 3.599 ± 0.677
3.419ProGlu: 3.419 ± 0.583
1.53ProPhe: 1.53 ± 0.389
4.499ProGly: 4.499 ± 0.561
0.72ProHis: 0.72 ± 0.215
2.7ProIle: 2.7 ± 0.493
1.8ProLys: 1.8 ± 0.326
3.239ProLeu: 3.239 ± 0.402
0.63ProMet: 0.63 ± 0.214
0.9ProAsn: 0.9 ± 0.304
2.43ProPro: 2.43 ± 0.472
2.34ProGln: 2.34 ± 0.427
2.79ProArg: 2.79 ± 0.565
2.969ProSer: 2.969 ± 0.595
4.589ProThr: 4.589 ± 0.608
3.959ProVal: 3.959 ± 0.573
0.72ProTrp: 0.72 ± 0.252
1.26ProTyr: 1.26 ± 0.307
0.0ProXaa: 0.0 ± 0.0
Gln
7.019GlnAla: 7.019 ± 0.94
0.54GlnCys: 0.54 ± 0.218
1.71GlnAsp: 1.71 ± 0.394
2.07GlnGlu: 2.07 ± 0.493
1.44GlnPhe: 1.44 ± 0.439
2.61GlnGly: 2.61 ± 0.476
1.89GlnHis: 1.89 ± 0.505
2.43GlnIle: 2.43 ± 0.358
0.63GlnLys: 0.63 ± 0.19
4.589GlnLeu: 4.589 ± 0.603
1.62GlnMet: 1.62 ± 0.331
0.54GlnAsn: 0.54 ± 0.176
2.61GlnPro: 2.61 ± 0.681
3.239GlnGln: 3.239 ± 0.455
2.88GlnArg: 2.88 ± 0.585
2.88GlnSer: 2.88 ± 0.513
3.059GlnThr: 3.059 ± 0.386
2.969GlnVal: 2.969 ± 0.477
0.99GlnTrp: 0.99 ± 0.275
0.63GlnTyr: 0.63 ± 0.231
0.0GlnXaa: 0.0 ± 0.0
Arg
6.119ArgAla: 6.119 ± 0.717
1.35ArgCys: 1.35 ± 0.319
2.79ArgAsp: 2.79 ± 0.42
3.869ArgGlu: 3.869 ± 0.78
2.7ArgPhe: 2.7 ± 0.69
3.599ArgGly: 3.599 ± 0.782
1.53ArgHis: 1.53 ± 0.337
2.16ArgIle: 2.16 ± 0.465
2.88ArgLys: 2.88 ± 0.592
7.019ArgLeu: 7.019 ± 0.982
1.62ArgMet: 1.62 ± 0.344
1.98ArgAsn: 1.98 ± 0.393
2.52ArgPro: 2.52 ± 0.508
3.869ArgGln: 3.869 ± 0.605
5.849ArgArg: 5.849 ± 0.964
4.319ArgSer: 4.319 ± 0.729
2.7ArgThr: 2.7 ± 0.42
4.499ArgVal: 4.499 ± 0.603
1.8ArgTrp: 1.8 ± 0.444
1.62ArgTyr: 1.62 ± 0.355
0.0ArgXaa: 0.0 ± 0.0
Ser
7.649SerAla: 7.649 ± 0.857
0.45SerCys: 0.45 ± 0.243
3.329SerAsp: 3.329 ± 0.553
2.61SerGlu: 2.61 ± 0.453
2.88SerPhe: 2.88 ± 0.638
6.749SerGly: 6.749 ± 0.874
0.9SerHis: 0.9 ± 0.257
2.969SerIle: 2.969 ± 0.599
2.79SerLys: 2.79 ± 0.488
5.219SerLeu: 5.219 ± 0.611
1.71SerMet: 1.71 ± 0.343
1.98SerAsn: 1.98 ± 0.405
2.07SerPro: 2.07 ± 0.443
1.98SerGln: 1.98 ± 0.348
4.589SerArg: 4.589 ± 0.613
4.769SerSer: 4.769 ± 0.816
4.589SerThr: 4.589 ± 0.924
5.579SerVal: 5.579 ± 0.707
1.98SerTrp: 1.98 ± 0.349
1.98SerTyr: 1.98 ± 0.438
0.0SerXaa: 0.0 ± 0.0
Thr
9.178ThrAla: 9.178 ± 0.933
0.9ThrCys: 0.9 ± 0.302
2.969ThrAsp: 2.969 ± 0.659
2.88ThrGlu: 2.88 ± 0.511
2.61ThrPhe: 2.61 ± 0.635
6.749ThrGly: 6.749 ± 1.133
0.9ThrHis: 0.9 ± 0.28
4.319ThrIle: 4.319 ± 0.59
1.44ThrLys: 1.44 ± 0.422
6.119ThrLeu: 6.119 ± 0.52
1.62ThrMet: 1.62 ± 0.392
1.89ThrAsn: 1.89 ± 0.399
4.769ThrPro: 4.769 ± 0.617
2.52ThrGln: 2.52 ± 0.507
3.419ThrArg: 3.419 ± 0.553
4.769ThrSer: 4.769 ± 0.836
6.119ThrThr: 6.119 ± 0.945
4.319ThrVal: 4.319 ± 0.743
0.81ThrTrp: 0.81 ± 0.263
1.89ThrTyr: 1.89 ± 0.625
0.0ThrXaa: 0.0 ± 0.0
Val
7.019ValAla: 7.019 ± 0.763
0.81ValCys: 0.81 ± 0.278
3.779ValAsp: 3.779 ± 0.464
3.509ValGlu: 3.509 ± 0.492
1.8ValPhe: 1.8 ± 0.326
5.399ValGly: 5.399 ± 0.722
1.08ValHis: 1.08 ± 0.315
3.239ValIle: 3.239 ± 0.558
2.34ValLys: 2.34 ± 0.378
6.929ValLeu: 6.929 ± 0.824
1.26ValMet: 1.26 ± 0.349
2.7ValAsn: 2.7 ± 0.506
3.869ValPro: 3.869 ± 0.574
2.61ValGln: 2.61 ± 0.401
4.049ValArg: 4.049 ± 0.478
5.849ValSer: 5.849 ± 0.727
5.849ValThr: 5.849 ± 0.874
3.959ValVal: 3.959 ± 0.52
0.45ValTrp: 0.45 ± 0.166
1.44ValTyr: 1.44 ± 0.418
0.0ValXaa: 0.0 ± 0.0
Trp
1.8TrpAla: 1.8 ± 0.321
0.63TrpCys: 0.63 ± 0.213
0.81TrpAsp: 0.81 ± 0.253
0.54TrpGlu: 0.54 ± 0.225
0.72TrpPhe: 0.72 ± 0.248
1.26TrpGly: 1.26 ± 0.289
0.18TrpHis: 0.18 ± 0.117
1.08TrpIle: 1.08 ± 0.292
0.36TrpLys: 0.36 ± 0.205
0.81TrpLeu: 0.81 ± 0.241
0.27TrpMet: 0.27 ± 0.185
0.99TrpAsn: 0.99 ± 0.246
0.72TrpPro: 0.72 ± 0.217
0.63TrpGln: 0.63 ± 0.219
1.8TrpArg: 1.8 ± 0.335
1.26TrpSer: 1.26 ± 0.32
1.17TrpThr: 1.17 ± 0.351
1.53TrpVal: 1.53 ± 0.362
0.27TrpTrp: 0.27 ± 0.212
0.45TrpTyr: 0.45 ± 0.19
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.52TyrAla: 2.52 ± 0.399
0.27TyrCys: 0.27 ± 0.205
1.08TyrAsp: 1.08 ± 0.257
0.81TyrGlu: 0.81 ± 0.272
0.45TyrPhe: 0.45 ± 0.233
2.34TyrGly: 2.34 ± 0.447
0.36TyrHis: 0.36 ± 0.148
0.9TyrIle: 0.9 ± 0.241
0.9TyrLys: 0.9 ± 0.315
2.07TyrLeu: 2.07 ± 0.331
0.36TyrMet: 0.36 ± 0.176
1.08TyrAsn: 1.08 ± 0.352
0.45TyrPro: 0.45 ± 0.205
1.44TyrGln: 1.44 ± 0.38
1.89TyrArg: 1.89 ± 0.397
1.98TyrSer: 1.98 ± 0.546
1.26TyrThr: 1.26 ± 0.371
1.62TyrVal: 1.62 ± 0.443
0.54TyrTrp: 0.54 ± 0.186
0.45TyrTyr: 0.45 ± 0.224
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 52 proteins (11114 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski