Amino acid dipepetide frequency for Flavobacterium phage vB_FspP_elemoD_13-5B

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.234AlaAla: 2.234 ± 0.602
0.571AlaCys: 0.571 ± 0.264
2.39AlaAsp: 2.39 ± 0.232
4.312AlaGlu: 4.312 ± 0.675
2.857AlaPhe: 2.857 ± 0.445
2.753AlaGly: 2.753 ± 0.606
0.779AlaHis: 0.779 ± 0.182
5.143AlaIle: 5.143 ± 0.497
4.883AlaLys: 4.883 ± 0.885
4.312AlaLeu: 4.312 ± 0.623
1.143AlaMet: 1.143 ± 0.252
3.688AlaAsn: 3.688 ± 0.631
1.195AlaPro: 1.195 ± 0.274
1.818AlaGln: 1.818 ± 0.403
1.922AlaArg: 1.922 ± 0.319
4.364AlaSer: 4.364 ± 0.735
3.169AlaThr: 3.169 ± 0.569
2.857AlaVal: 2.857 ± 0.538
0.104AlaTrp: 0.104 ± 0.086
1.974AlaTyr: 1.974 ± 0.288
0.0AlaXaa: 0.0 ± 0.0
Cys
0.312CysAla: 0.312 ± 0.191
0.052CysCys: 0.052 ± 0.055
0.623CysAsp: 0.623 ± 0.206
0.727CysGlu: 0.727 ± 0.224
0.416CysPhe: 0.416 ± 0.146
0.468CysGly: 0.468 ± 0.162
0.26CysHis: 0.26 ± 0.137
0.571CysIle: 0.571 ± 0.197
0.468CysLys: 0.468 ± 0.193
0.779CysLeu: 0.779 ± 0.232
0.156CysMet: 0.156 ± 0.081
0.208CysAsn: 0.208 ± 0.113
0.156CysPro: 0.156 ± 0.1
0.156CysGln: 0.156 ± 0.096
0.364CysArg: 0.364 ± 0.157
0.26CysSer: 0.26 ± 0.126
0.468CysThr: 0.468 ± 0.175
0.364CysVal: 0.364 ± 0.138
0.052CysTrp: 0.052 ± 0.062
0.312CysTyr: 0.312 ± 0.141
0.0CysXaa: 0.0 ± 0.0
Asp
4.104AspAla: 4.104 ± 0.671
0.519AspCys: 0.519 ± 0.184
4.416AspAsp: 4.416 ± 0.606
4.208AspGlu: 4.208 ± 0.649
4.883AspPhe: 4.883 ± 0.542
4.571AspGly: 4.571 ± 0.522
0.416AspHis: 0.416 ± 0.17
5.039AspIle: 5.039 ± 0.512
6.909AspLys: 6.909 ± 0.787
5.87AspLeu: 5.87 ± 0.521
1.455AspMet: 1.455 ± 0.329
5.039AspAsn: 5.039 ± 0.485
1.247AspPro: 1.247 ± 0.241
1.506AspGln: 1.506 ± 0.271
2.286AspArg: 2.286 ± 0.349
4.675AspSer: 4.675 ± 0.488
3.844AspThr: 3.844 ± 0.523
3.74AspVal: 3.74 ± 0.357
0.883AspTrp: 0.883 ± 0.27
3.013AspTyr: 3.013 ± 0.612
0.0AspXaa: 0.0 ± 0.0
Glu
3.844GluAla: 3.844 ± 0.601
0.26GluCys: 0.26 ± 0.125
5.662GluAsp: 5.662 ± 0.461
6.286GluGlu: 6.286 ± 0.844
4.416GluPhe: 4.416 ± 0.581
4.416GluGly: 4.416 ± 0.485
0.831GluHis: 0.831 ± 0.252
6.078GluIle: 6.078 ± 0.493
5.558GluLys: 5.558 ± 0.9
6.701GluLeu: 6.701 ± 0.722
1.61GluMet: 1.61 ± 0.265
4.364GluAsn: 4.364 ± 0.722
1.299GluPro: 1.299 ± 0.318
2.442GluGln: 2.442 ± 0.434
2.545GluArg: 2.545 ± 0.333
5.455GluSer: 5.455 ± 0.557
3.273GluThr: 3.273 ± 0.418
5.195GluVal: 5.195 ± 0.485
0.883GluTrp: 0.883 ± 0.198
3.584GluTyr: 3.584 ± 0.423
0.0GluXaa: 0.0 ± 0.0
Phe
1.87PheAla: 1.87 ± 0.3
0.468PheCys: 0.468 ± 0.185
4.052PheAsp: 4.052 ± 0.49
4.104PheGlu: 4.104 ± 0.442
1.455PhePhe: 1.455 ± 0.339
2.753PheGly: 2.753 ± 0.401
0.883PheHis: 0.883 ± 0.214
3.377PheIle: 3.377 ± 0.429
4.208PheLys: 4.208 ± 0.564
3.325PheLeu: 3.325 ± 0.604
1.247PheMet: 1.247 ± 0.28
4.208PheAsn: 4.208 ± 0.369
0.831PhePro: 0.831 ± 0.147
1.143PheGln: 1.143 ± 0.29
1.091PheArg: 1.091 ± 0.298
3.481PheSer: 3.481 ± 0.38
2.494PheThr: 2.494 ± 0.403
2.39PheVal: 2.39 ± 0.535
0.416PheTrp: 0.416 ± 0.166
1.974PheTyr: 1.974 ± 0.474
0.0PheXaa: 0.0 ± 0.0
Gly
2.545GlyAla: 2.545 ± 0.47
0.312GlyCys: 0.312 ± 0.132
3.584GlyAsp: 3.584 ± 0.412
3.532GlyGlu: 3.532 ± 0.382
2.857GlyPhe: 2.857 ± 0.413
3.948GlyGly: 3.948 ± 0.57
0.623GlyHis: 0.623 ± 0.235
5.091GlyIle: 5.091 ± 0.634
4.831GlyLys: 4.831 ± 0.601
5.091GlyLeu: 5.091 ± 0.615
1.403GlyMet: 1.403 ± 0.339
3.429GlyAsn: 3.429 ± 0.597
0.987GlyPro: 0.987 ± 0.513
1.61GlyGln: 1.61 ± 0.253
2.39GlyArg: 2.39 ± 0.334
4.0GlySer: 4.0 ± 0.663
4.104GlyThr: 4.104 ± 0.673
4.519GlyVal: 4.519 ± 0.63
0.519GlyTrp: 0.519 ± 0.151
3.325GlyTyr: 3.325 ± 0.357
0.0GlyXaa: 0.0 ± 0.0
His
0.519HisAla: 0.519 ± 0.17
0.312HisCys: 0.312 ± 0.128
0.779HisAsp: 0.779 ± 0.252
0.416HisGlu: 0.416 ± 0.139
0.571HisPhe: 0.571 ± 0.197
0.468HisGly: 0.468 ± 0.135
0.416HisHis: 0.416 ± 0.177
1.091HisIle: 1.091 ± 0.257
1.403HisLys: 1.403 ± 0.364
0.675HisLeu: 0.675 ± 0.194
0.052HisMet: 0.052 ± 0.052
0.779HisAsn: 0.779 ± 0.223
0.571HisPro: 0.571 ± 0.187
0.623HisGln: 0.623 ± 0.162
0.416HisArg: 0.416 ± 0.174
0.831HisSer: 0.831 ± 0.195
0.727HisThr: 0.727 ± 0.198
0.26HisVal: 0.26 ± 0.146
0.156HisTrp: 0.156 ± 0.103
0.623HisTyr: 0.623 ± 0.214
0.0HisXaa: 0.0 ± 0.0
Ile
4.312IleAla: 4.312 ± 0.603
0.312IleCys: 0.312 ± 0.129
6.961IleAsp: 6.961 ± 0.581
5.558IleGlu: 5.558 ± 0.559
2.701IlePhe: 2.701 ± 0.356
3.636IleGly: 3.636 ± 0.376
1.247IleHis: 1.247 ± 0.251
5.403IleIle: 5.403 ± 0.537
7.844IleLys: 7.844 ± 0.797
4.935IleLeu: 4.935 ± 0.53
1.299IleMet: 1.299 ± 0.358
6.442IleAsn: 6.442 ± 0.582
1.818IlePro: 1.818 ± 0.353
2.701IleGln: 2.701 ± 0.391
2.338IleArg: 2.338 ± 0.386
6.13IleSer: 6.13 ± 0.565
4.883IleThr: 4.883 ± 0.605
5.247IleVal: 5.247 ± 0.708
0.468IleTrp: 0.468 ± 0.156
2.909IleTyr: 2.909 ± 0.574
0.0IleXaa: 0.0 ± 0.0
Lys
5.506LysAla: 5.506 ± 0.867
0.571LysCys: 0.571 ± 0.214
6.701LysAsp: 6.701 ± 0.664
8.779LysGlu: 8.779 ± 1.249
3.117LysPhe: 3.117 ± 0.424
4.883LysGly: 4.883 ± 0.512
0.935LysHis: 0.935 ± 0.215
7.013LysIle: 7.013 ± 0.887
8.312LysLys: 8.312 ± 1.16
7.429LysLeu: 7.429 ± 0.964
2.805LysMet: 2.805 ± 0.577
6.13LysAsn: 6.13 ± 0.612
3.169LysPro: 3.169 ± 0.475
2.649LysGln: 2.649 ± 0.6
3.377LysArg: 3.377 ± 0.579
5.558LysSer: 5.558 ± 0.587
4.779LysThr: 4.779 ± 0.591
6.13LysVal: 6.13 ± 0.696
0.623LysTrp: 0.623 ± 0.181
4.675LysTyr: 4.675 ± 0.471
0.0LysXaa: 0.0 ± 0.0
Leu
4.104LeuAla: 4.104 ± 0.491
0.623LeuCys: 0.623 ± 0.208
5.558LeuAsp: 5.558 ± 0.598
5.922LeuGlu: 5.922 ± 0.552
3.377LeuPhe: 3.377 ± 0.4
4.052LeuGly: 4.052 ± 0.564
0.883LeuHis: 0.883 ± 0.204
5.506LeuIle: 5.506 ± 0.595
7.325LeuLys: 7.325 ± 0.798
6.338LeuLeu: 6.338 ± 0.767
1.818LeuMet: 1.818 ± 0.428
6.494LeuAsn: 6.494 ± 0.547
3.065LeuPro: 3.065 ± 0.331
2.701LeuGln: 2.701 ± 0.472
1.87LeuArg: 1.87 ± 0.331
6.442LeuSer: 6.442 ± 0.448
4.416LeuThr: 4.416 ± 0.514
4.0LeuVal: 4.0 ± 0.484
0.779LeuTrp: 0.779 ± 0.251
3.065LeuTyr: 3.065 ± 0.408
0.0LeuXaa: 0.0 ± 0.0
Met
2.182MetAla: 2.182 ± 0.333
0.156MetCys: 0.156 ± 0.1
0.571MetAsp: 0.571 ± 0.183
1.61MetGlu: 1.61 ± 0.312
0.779MetPhe: 0.779 ± 0.201
0.831MetGly: 0.831 ± 0.236
0.26MetHis: 0.26 ± 0.127
1.455MetIle: 1.455 ± 0.317
2.753MetLys: 2.753 ± 0.506
1.299MetLeu: 1.299 ± 0.233
0.519MetMet: 0.519 ± 0.213
2.078MetAsn: 2.078 ± 0.377
0.468MetPro: 0.468 ± 0.13
0.935MetGln: 0.935 ± 0.283
1.039MetArg: 1.039 ± 0.323
1.455MetSer: 1.455 ± 0.254
1.195MetThr: 1.195 ± 0.286
1.195MetVal: 1.195 ± 0.222
0.208MetTrp: 0.208 ± 0.118
1.403MetTyr: 1.403 ± 0.365
0.0MetXaa: 0.0 ± 0.0
Asn
4.312AsnAla: 4.312 ± 0.672
0.623AsnCys: 0.623 ± 0.19
4.416AsnAsp: 4.416 ± 0.51
4.26AsnGlu: 4.26 ± 0.449
3.117AsnPhe: 3.117 ± 0.529
4.935AsnGly: 4.935 ± 0.463
0.675AsnHis: 0.675 ± 0.152
5.143AsnIle: 5.143 ± 0.512
7.74AsnLys: 7.74 ± 0.965
4.831AsnLeu: 4.831 ± 0.691
1.714AsnMet: 1.714 ± 0.378
5.662AsnAsn: 5.662 ± 0.54
2.39AsnPro: 2.39 ± 0.449
2.961AsnGln: 2.961 ± 0.413
3.584AsnArg: 3.584 ± 0.411
4.468AsnSer: 4.468 ± 0.436
3.636AsnThr: 3.636 ± 0.413
4.26AsnVal: 4.26 ± 0.568
0.519AsnTrp: 0.519 ± 0.186
2.909AsnTyr: 2.909 ± 0.426
0.0AsnXaa: 0.0 ± 0.0
Pro
0.831ProAla: 0.831 ± 0.291
0.104ProCys: 0.104 ± 0.081
1.662ProAsp: 1.662 ± 0.31
2.182ProGlu: 2.182 ± 0.372
1.247ProPhe: 1.247 ± 0.264
1.039ProGly: 1.039 ± 0.231
0.208ProHis: 0.208 ± 0.106
2.234ProIle: 2.234 ± 0.344
1.818ProLys: 1.818 ± 0.386
1.974ProLeu: 1.974 ± 0.374
0.831ProMet: 0.831 ± 0.225
2.13ProAsn: 2.13 ± 0.342
0.623ProPro: 0.623 ± 0.185
0.779ProGln: 0.779 ± 0.221
1.143ProArg: 1.143 ± 0.267
2.597ProSer: 2.597 ± 0.398
2.753ProThr: 2.753 ± 0.352
1.558ProVal: 1.558 ± 0.292
0.052ProTrp: 0.052 ± 0.056
1.143ProTyr: 1.143 ± 0.245
0.0ProXaa: 0.0 ± 0.0
Gln
2.182GlnAla: 2.182 ± 0.462
0.052GlnCys: 0.052 ± 0.064
2.182GlnAsp: 2.182 ± 0.345
2.649GlnGlu: 2.649 ± 0.486
0.831GlnPhe: 0.831 ± 0.183
1.714GlnGly: 1.714 ± 0.466
0.26GlnHis: 0.26 ± 0.106
2.182GlnIle: 2.182 ± 0.312
2.701GlnLys: 2.701 ± 0.566
2.078GlnLeu: 2.078 ± 0.424
1.091GlnMet: 1.091 ± 0.295
1.61GlnAsn: 1.61 ± 0.239
0.779GlnPro: 0.779 ± 0.178
1.039GlnGln: 1.039 ± 0.306
1.091GlnArg: 1.091 ± 0.347
2.857GlnSer: 2.857 ± 0.519
1.506GlnThr: 1.506 ± 0.246
1.818GlnVal: 1.818 ± 0.315
0.364GlnTrp: 0.364 ± 0.153
1.403GlnTyr: 1.403 ± 0.31
0.0GlnXaa: 0.0 ± 0.0
Arg
1.974ArgAla: 1.974 ± 0.317
0.416ArgCys: 0.416 ± 0.172
2.234ArgAsp: 2.234 ± 0.378
2.39ArgGlu: 2.39 ± 0.348
1.558ArgPhe: 1.558 ± 0.259
2.805ArgGly: 2.805 ± 0.472
0.831ArgHis: 0.831 ± 0.26
2.597ArgIle: 2.597 ± 0.331
3.117ArgLys: 3.117 ± 0.477
3.273ArgLeu: 3.273 ± 0.379
0.883ArgMet: 0.883 ± 0.258
1.61ArgAsn: 1.61 ± 0.295
0.987ArgPro: 0.987 ± 0.232
0.675ArgGln: 0.675 ± 0.242
1.506ArgArg: 1.506 ± 0.318
1.818ArgSer: 1.818 ± 0.334
1.195ArgThr: 1.195 ± 0.314
2.494ArgVal: 2.494 ± 0.346
0.519ArgTrp: 0.519 ± 0.177
1.818ArgTyr: 1.818 ± 0.389
0.0ArgXaa: 0.0 ± 0.0
Ser
2.909SerAla: 2.909 ± 0.4
0.519SerCys: 0.519 ± 0.192
5.247SerAsp: 5.247 ± 0.516
5.403SerGlu: 5.403 ± 0.544
3.221SerPhe: 3.221 ± 0.549
5.195SerGly: 5.195 ± 0.713
0.416SerHis: 0.416 ± 0.162
5.766SerIle: 5.766 ± 0.628
7.273SerLys: 7.273 ± 1.029
5.351SerLeu: 5.351 ± 0.529
1.247SerMet: 1.247 ± 0.289
5.818SerAsn: 5.818 ± 0.466
1.818SerPro: 1.818 ± 0.386
1.766SerGln: 1.766 ± 0.233
2.286SerArg: 2.286 ± 0.363
5.247SerSer: 5.247 ± 0.663
3.584SerThr: 3.584 ± 0.612
5.039SerVal: 5.039 ± 0.488
0.779SerTrp: 0.779 ± 0.25
3.584SerTyr: 3.584 ± 0.408
0.0SerXaa: 0.0 ± 0.0
Thr
2.909ThrAla: 2.909 ± 0.517
0.26ThrCys: 0.26 ± 0.113
3.844ThrAsp: 3.844 ± 0.437
4.156ThrGlu: 4.156 ± 0.44
2.234ThrPhe: 2.234 ± 0.396
3.221ThrGly: 3.221 ± 0.399
0.571ThrHis: 0.571 ± 0.203
4.312ThrIle: 4.312 ± 0.676
5.61ThrLys: 5.61 ± 0.716
4.831ThrLeu: 4.831 ± 0.714
0.831ThrMet: 0.831 ± 0.232
3.584ThrAsn: 3.584 ± 0.336
2.805ThrPro: 2.805 ± 0.393
1.714ThrGln: 1.714 ± 0.353
1.455ThrArg: 1.455 ± 0.277
4.364ThrSer: 4.364 ± 0.569
4.727ThrThr: 4.727 ± 0.621
2.494ThrVal: 2.494 ± 0.445
0.571ThrTrp: 0.571 ± 0.18
3.013ThrTyr: 3.013 ± 0.646
0.0ThrXaa: 0.0 ± 0.0
Val
3.273ValAla: 3.273 ± 0.49
0.519ValCys: 0.519 ± 0.16
4.0ValAsp: 4.0 ± 0.422
4.727ValGlu: 4.727 ± 0.516
2.545ValPhe: 2.545 ± 0.396
3.74ValGly: 3.74 ± 0.482
0.571ValHis: 0.571 ± 0.179
5.039ValIle: 5.039 ± 0.597
5.662ValLys: 5.662 ± 0.584
4.935ValLeu: 4.935 ± 0.479
0.883ValMet: 0.883 ± 0.202
4.104ValAsn: 4.104 ± 0.543
1.558ValPro: 1.558 ± 0.357
1.403ValGln: 1.403 ± 0.2
2.13ValArg: 2.13 ± 0.355
4.779ValSer: 4.779 ± 0.535
3.325ValThr: 3.325 ± 0.566
3.948ValVal: 3.948 ± 0.526
0.623ValTrp: 0.623 ± 0.239
2.805ValTyr: 2.805 ± 0.537
0.0ValXaa: 0.0 ± 0.0
Trp
0.416TrpAla: 0.416 ± 0.137
0.052TrpCys: 0.052 ± 0.059
0.468TrpAsp: 0.468 ± 0.185
0.571TrpGlu: 0.571 ± 0.161
0.727TrpPhe: 0.727 ± 0.24
0.571TrpGly: 0.571 ± 0.222
0.052TrpHis: 0.052 ± 0.052
0.416TrpIle: 0.416 ± 0.165
0.727TrpLys: 0.727 ± 0.219
0.779TrpLeu: 0.779 ± 0.176
0.364TrpMet: 0.364 ± 0.148
0.831TrpAsn: 0.831 ± 0.205
0.0TrpPro: 0.0 ± 0.0
0.468TrpGln: 0.468 ± 0.183
0.571TrpArg: 0.571 ± 0.156
0.779TrpSer: 0.779 ± 0.262
0.364TrpThr: 0.364 ± 0.165
0.571TrpVal: 0.571 ± 0.194
0.104TrpTrp: 0.104 ± 0.085
0.312TrpTyr: 0.312 ± 0.173
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.13TyrAla: 2.13 ± 0.272
0.468TyrCys: 0.468 ± 0.171
3.273TyrAsp: 3.273 ± 0.55
2.909TyrGlu: 2.909 ± 0.419
2.857TyrPhe: 2.857 ± 0.563
2.701TyrGly: 2.701 ± 0.582
0.571TyrHis: 0.571 ± 0.181
3.688TyrIle: 3.688 ± 0.483
4.0TyrLys: 4.0 ± 0.462
3.584TyrLeu: 3.584 ± 0.538
0.987TyrMet: 0.987 ± 0.2
3.948TyrAsn: 3.948 ± 0.574
1.143TyrPro: 1.143 ± 0.214
1.299TyrGln: 1.299 ± 0.23
1.299TyrArg: 1.299 ± 0.269
2.909TyrSer: 2.909 ± 0.507
3.065TyrThr: 3.065 ± 0.49
2.494TyrVal: 2.494 ± 0.452
0.519TyrTrp: 0.519 ± 0.162
2.182TyrTyr: 2.182 ± 0.347
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 91 proteins (19251 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski