Amino acid dipepetide frequency for Flavobacterium phage vB_FspS_snusmum6-3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.886AlaAla: 0.886 ± 0.395
0.564AlaCys: 0.564 ± 0.217
1.933AlaAsp: 1.933 ± 0.359
3.221AlaGlu: 3.221 ± 0.669
2.658AlaPhe: 2.658 ± 0.376
1.933AlaGly: 1.933 ± 0.378
0.403AlaHis: 0.403 ± 0.193
4.268AlaIle: 4.268 ± 0.547
5.879AlaLys: 5.879 ± 0.735
4.993AlaLeu: 4.993 ± 0.539
1.53AlaMet: 1.53 ± 0.502
4.832AlaAsn: 4.832 ± 0.799
0.886AlaPro: 0.886 ± 0.266
1.53AlaGln: 1.53 ± 0.322
1.369AlaArg: 1.369 ± 0.349
2.819AlaSer: 2.819 ± 0.443
4.107AlaThr: 4.107 ± 0.511
3.382AlaVal: 3.382 ± 0.557
0.725AlaTrp: 0.725 ± 0.224
1.45AlaTyr: 1.45 ± 0.275
0.0AlaXaa: 0.0 ± 0.0
Cys
0.242CysAla: 0.242 ± 0.141
0.242CysCys: 0.242 ± 0.139
0.564CysAsp: 0.564 ± 0.2
1.208CysGlu: 1.208 ± 0.35
0.886CysPhe: 0.886 ± 0.211
1.208CysGly: 1.208 ± 0.408
0.161CysHis: 0.161 ± 0.117
0.805CysIle: 0.805 ± 0.258
0.966CysLys: 0.966 ± 0.335
1.127CysLeu: 1.127 ± 0.337
0.081CysMet: 0.081 ± 0.085
0.564CysAsn: 0.564 ± 0.318
0.483CysPro: 0.483 ± 0.173
0.322CysGln: 0.322 ± 0.18
0.403CysArg: 0.403 ± 0.156
0.483CysSer: 0.483 ± 0.19
0.805CysThr: 0.805 ± 0.214
0.805CysVal: 0.805 ± 0.222
0.081CysTrp: 0.081 ± 0.091
0.403CysTyr: 0.403 ± 0.184
0.0CysXaa: 0.0 ± 0.0
Asp
3.544AspAla: 3.544 ± 0.456
0.886AspCys: 0.886 ± 0.227
2.819AspAsp: 2.819 ± 0.594
4.268AspGlu: 4.268 ± 0.665
4.429AspPhe: 4.429 ± 0.519
2.819AspGly: 2.819 ± 0.485
0.725AspHis: 0.725 ± 0.198
4.429AspIle: 4.429 ± 0.623
4.913AspLys: 4.913 ± 0.606
4.913AspLeu: 4.913 ± 0.623
1.208AspMet: 1.208 ± 0.367
3.785AspAsn: 3.785 ± 0.585
0.483AspPro: 0.483 ± 0.184
0.644AspGln: 0.644 ± 0.307
1.047AspArg: 1.047 ± 0.3
4.59AspSer: 4.59 ± 0.414
3.463AspThr: 3.463 ± 0.581
3.705AspVal: 3.705 ± 0.64
0.483AspTrp: 0.483 ± 0.153
2.98AspTyr: 2.98 ± 0.538
0.0AspXaa: 0.0 ± 0.0
Glu
3.946GluAla: 3.946 ± 0.688
0.725GluCys: 0.725 ± 0.307
2.98GluAsp: 2.98 ± 0.665
3.866GluGlu: 3.866 ± 0.591
3.866GluPhe: 3.866 ± 0.535
1.53GluGly: 1.53 ± 0.303
1.289GluHis: 1.289 ± 0.297
7.49GluIle: 7.49 ± 0.742
6.604GluLys: 6.604 ± 0.821
7.651GluLeu: 7.651 ± 0.876
1.852GluMet: 1.852 ± 0.378
5.718GluAsn: 5.718 ± 0.731
1.772GluPro: 1.772 ± 0.355
3.302GluGln: 3.302 ± 0.486
2.255GluArg: 2.255 ± 0.579
4.188GluSer: 4.188 ± 0.516
3.866GluThr: 3.866 ± 0.63
3.382GluVal: 3.382 ± 0.452
0.403GluTrp: 0.403 ± 0.177
2.899GluTyr: 2.899 ± 0.468
0.0GluXaa: 0.0 ± 0.0
Phe
2.577PheAla: 2.577 ± 0.444
0.483PheCys: 0.483 ± 0.172
4.107PheAsp: 4.107 ± 0.438
2.98PheGlu: 2.98 ± 0.425
2.738PhePhe: 2.738 ± 0.433
3.06PheGly: 3.06 ± 0.501
0.322PheHis: 0.322 ± 0.164
3.785PheIle: 3.785 ± 0.644
4.993PheLys: 4.993 ± 0.585
3.705PheLeu: 3.705 ± 0.549
0.966PheMet: 0.966 ± 0.238
4.59PheAsn: 4.59 ± 0.717
1.208PhePro: 1.208 ± 0.272
2.094PheGln: 2.094 ± 0.412
0.805PheArg: 0.805 ± 0.264
3.705PheSer: 3.705 ± 0.554
4.913PheThr: 4.913 ± 0.755
2.738PheVal: 2.738 ± 0.456
0.322PheTrp: 0.322 ± 0.168
1.772PheTyr: 1.772 ± 0.377
0.0PheXaa: 0.0 ± 0.0
Gly
2.98GlyAla: 2.98 ± 0.652
0.644GlyCys: 0.644 ± 0.219
1.852GlyAsp: 1.852 ± 0.403
2.094GlyGlu: 2.094 ± 0.384
2.658GlyPhe: 2.658 ± 0.595
1.691GlyGly: 1.691 ± 0.39
0.483GlyHis: 0.483 ± 0.224
4.027GlyIle: 4.027 ± 0.412
3.785GlyLys: 3.785 ± 0.542
3.946GlyLeu: 3.946 ± 0.446
0.886GlyMet: 0.886 ± 0.225
4.027GlyAsn: 4.027 ± 0.495
0.081GlyPro: 0.081 ± 0.085
1.772GlyGln: 1.772 ± 0.359
1.772GlyArg: 1.772 ± 0.391
3.06GlySer: 3.06 ± 0.557
3.946GlyThr: 3.946 ± 0.635
2.577GlyVal: 2.577 ± 0.425
0.161GlyTrp: 0.161 ± 0.103
2.336GlyTyr: 2.336 ± 0.4
0.0GlyXaa: 0.0 ± 0.0
His
0.322HisAla: 0.322 ± 0.146
0.403HisCys: 0.403 ± 0.176
0.886HisAsp: 0.886 ± 0.26
0.886HisGlu: 0.886 ± 0.281
1.127HisPhe: 1.127 ± 0.326
0.564HisGly: 0.564 ± 0.222
0.483HisHis: 0.483 ± 0.252
1.047HisIle: 1.047 ± 0.297
1.289HisLys: 1.289 ± 0.344
1.289HisLeu: 1.289 ± 0.295
0.0HisMet: 0.0 ± 0.0
1.208HisAsn: 1.208 ± 0.331
0.403HisPro: 0.403 ± 0.17
0.483HisGln: 0.483 ± 0.183
0.644HisArg: 0.644 ± 0.226
1.127HisSer: 1.127 ± 0.344
1.047HisThr: 1.047 ± 0.229
0.725HisVal: 0.725 ± 0.272
0.081HisTrp: 0.081 ± 0.074
1.047HisTyr: 1.047 ± 0.279
0.0HisXaa: 0.0 ± 0.0
Ile
3.866IleAla: 3.866 ± 0.621
1.289IleCys: 1.289 ± 0.414
6.845IleAsp: 6.845 ± 0.883
7.973IleGlu: 7.973 ± 0.857
3.705IlePhe: 3.705 ± 0.507
3.624IleGly: 3.624 ± 0.738
1.369IleHis: 1.369 ± 0.399
6.684IleIle: 6.684 ± 0.978
8.053IleLys: 8.053 ± 0.778
8.456IleLeu: 8.456 ± 0.972
1.611IleMet: 1.611 ± 0.368
6.523IleAsn: 6.523 ± 0.746
2.013IlePro: 2.013 ± 0.397
3.544IleGln: 3.544 ± 0.595
2.336IleArg: 2.336 ± 0.417
5.557IleSer: 5.557 ± 0.761
5.154IleThr: 5.154 ± 0.804
4.51IleVal: 4.51 ± 0.624
0.644IleTrp: 0.644 ± 0.255
3.141IleTyr: 3.141 ± 0.526
0.0IleXaa: 0.0 ± 0.0
Lys
5.557LysAla: 5.557 ± 0.753
1.047LysCys: 1.047 ± 0.346
5.557LysAsp: 5.557 ± 0.895
7.731LysGlu: 7.731 ± 1.001
3.785LysPhe: 3.785 ± 0.447
4.107LysGly: 4.107 ± 0.447
1.772LysHis: 1.772 ± 0.407
9.1LysIle: 9.1 ± 0.916
8.376LysLys: 8.376 ± 1.058
7.168LysLeu: 7.168 ± 0.825
3.382LysMet: 3.382 ± 0.581
6.523LysAsn: 6.523 ± 0.753
2.577LysPro: 2.577 ± 0.639
4.671LysGln: 4.671 ± 0.726
3.463LysArg: 3.463 ± 0.63
4.752LysSer: 4.752 ± 0.547
5.799LysThr: 5.799 ± 0.841
5.235LysVal: 5.235 ± 0.648
1.127LysTrp: 1.127 ± 0.283
4.913LysTyr: 4.913 ± 0.672
0.0LysXaa: 0.0 ± 0.0
Leu
3.785LeuAla: 3.785 ± 0.664
1.127LeuCys: 1.127 ± 0.36
5.235LeuAsp: 5.235 ± 0.543
6.04LeuGlu: 6.04 ± 0.93
3.866LeuPhe: 3.866 ± 0.491
3.382LeuGly: 3.382 ± 0.555
0.966LeuHis: 0.966 ± 0.297
8.698LeuIle: 8.698 ± 1.064
8.537LeuLys: 8.537 ± 0.781
6.684LeuLeu: 6.684 ± 0.732
2.174LeuMet: 2.174 ± 0.489
8.053LeuAsn: 8.053 ± 0.727
3.463LeuPro: 3.463 ± 0.523
4.107LeuGln: 4.107 ± 0.532
2.658LeuArg: 2.658 ± 0.482
5.476LeuSer: 5.476 ± 0.591
5.96LeuThr: 5.96 ± 0.97
4.429LeuVal: 4.429 ± 0.626
1.45LeuTrp: 1.45 ± 0.368
3.221LeuTyr: 3.221 ± 0.511
0.0LeuXaa: 0.0 ± 0.0
Met
1.611MetAla: 1.611 ± 0.288
0.081MetCys: 0.081 ± 0.084
0.886MetAsp: 0.886 ± 0.277
2.094MetGlu: 2.094 ± 0.429
0.886MetPhe: 0.886 ± 0.281
0.886MetGly: 0.886 ± 0.277
0.483MetHis: 0.483 ± 0.213
1.691MetIle: 1.691 ± 0.377
2.819MetLys: 2.819 ± 0.626
1.852MetLeu: 1.852 ± 0.363
0.161MetMet: 0.161 ± 0.098
1.208MetAsn: 1.208 ± 0.355
1.45MetPro: 1.45 ± 0.37
0.886MetGln: 0.886 ± 0.294
0.966MetArg: 0.966 ± 0.237
1.772MetSer: 1.772 ± 0.365
0.966MetThr: 0.966 ± 0.292
1.127MetVal: 1.127 ± 0.254
0.403MetTrp: 0.403 ± 0.175
0.403MetTyr: 0.403 ± 0.162
0.0MetXaa: 0.0 ± 0.0
Asn
4.429AsnAla: 4.429 ± 0.659
0.805AsnCys: 0.805 ± 0.199
4.59AsnAsp: 4.59 ± 0.59
6.121AsnGlu: 6.121 ± 0.77
3.866AsnPhe: 3.866 ± 0.565
3.866AsnGly: 3.866 ± 0.72
1.047AsnHis: 1.047 ± 0.339
5.396AsnIle: 5.396 ± 0.638
8.859AsnLys: 8.859 ± 1.254
7.168AsnLeu: 7.168 ± 1.035
1.047AsnMet: 1.047 ± 0.285
5.396AsnAsn: 5.396 ± 1.054
2.336AsnPro: 2.336 ± 0.464
2.416AsnGln: 2.416 ± 0.47
2.819AsnArg: 2.819 ± 0.531
4.993AsnSer: 4.993 ± 0.653
4.349AsnThr: 4.349 ± 0.594
5.154AsnVal: 5.154 ± 0.627
0.886AsnTrp: 0.886 ± 0.326
4.349AsnTyr: 4.349 ± 0.424
0.0AsnXaa: 0.0 ± 0.0
Pro
0.886ProAla: 0.886 ± 0.274
0.403ProCys: 0.403 ± 0.15
1.289ProAsp: 1.289 ± 0.263
1.772ProGlu: 1.772 ± 0.354
1.611ProPhe: 1.611 ± 0.362
0.161ProGly: 0.161 ± 0.128
0.161ProHis: 0.161 ± 0.121
1.611ProIle: 1.611 ± 0.315
1.852ProLys: 1.852 ± 0.398
2.738ProLeu: 2.738 ± 0.437
0.564ProMet: 0.564 ± 0.179
2.497ProAsn: 2.497 ± 0.407
0.403ProPro: 0.403 ± 0.182
1.047ProGln: 1.047 ± 0.335
0.322ProArg: 0.322 ± 0.185
2.094ProSer: 2.094 ± 0.438
1.691ProThr: 1.691 ± 0.311
1.047ProVal: 1.047 ± 0.297
0.0ProTrp: 0.0 ± 0.0
1.369ProTyr: 1.369 ± 0.335
0.0ProXaa: 0.0 ± 0.0
Gln
2.094GlnAla: 2.094 ± 0.575
0.081GlnCys: 0.081 ± 0.085
1.208GlnAsp: 1.208 ± 0.336
2.336GlnGlu: 2.336 ± 0.423
1.45GlnPhe: 1.45 ± 0.271
2.416GlnGly: 2.416 ± 0.504
1.047GlnHis: 1.047 ± 0.288
4.349GlnIle: 4.349 ± 0.608
4.268GlnLys: 4.268 ± 0.815
4.027GlnLeu: 4.027 ± 0.533
1.208GlnMet: 1.208 ± 0.349
2.336GlnAsn: 2.336 ± 0.563
0.805GlnPro: 0.805 ± 0.24
1.852GlnGln: 1.852 ± 0.548
1.611GlnArg: 1.611 ± 0.385
2.416GlnSer: 2.416 ± 0.44
2.255GlnThr: 2.255 ± 0.462
1.611GlnVal: 1.611 ± 0.387
0.483GlnTrp: 0.483 ± 0.185
1.45GlnTyr: 1.45 ± 0.31
0.0GlnXaa: 0.0 ± 0.0
Arg
0.886ArgAla: 0.886 ± 0.267
0.483ArgCys: 0.483 ± 0.191
1.369ArgAsp: 1.369 ± 0.313
2.013ArgGlu: 2.013 ± 0.414
1.289ArgPhe: 1.289 ± 0.369
0.966ArgGly: 0.966 ± 0.307
0.564ArgHis: 0.564 ± 0.211
2.577ArgIle: 2.577 ± 0.654
2.497ArgLys: 2.497 ± 0.453
3.785ArgLeu: 3.785 ± 0.502
0.805ArgMet: 0.805 ± 0.209
2.416ArgAsn: 2.416 ± 0.414
0.161ArgPro: 0.161 ± 0.11
0.966ArgGln: 0.966 ± 0.256
1.127ArgArg: 1.127 ± 0.297
2.013ArgSer: 2.013 ± 0.356
2.174ArgThr: 2.174 ± 0.347
2.013ArgVal: 2.013 ± 0.348
0.081ArgTrp: 0.081 ± 0.076
1.772ArgTyr: 1.772 ± 0.415
0.0ArgXaa: 0.0 ± 0.0
Ser
2.174SerAla: 2.174 ± 0.353
0.644SerCys: 0.644 ± 0.249
3.866SerAsp: 3.866 ± 0.681
4.671SerGlu: 4.671 ± 0.73
4.027SerPhe: 4.027 ± 0.55
3.946SerGly: 3.946 ± 0.569
0.805SerHis: 0.805 ± 0.26
6.201SerIle: 6.201 ± 0.704
6.282SerLys: 6.282 ± 0.742
5.396SerLeu: 5.396 ± 0.757
1.127SerMet: 1.127 ± 0.259
5.154SerAsn: 5.154 ± 0.743
1.289SerPro: 1.289 ± 0.28
2.577SerGln: 2.577 ± 0.528
1.45SerArg: 1.45 ± 0.349
3.866SerSer: 3.866 ± 0.59
3.302SerThr: 3.302 ± 0.52
4.349SerVal: 4.349 ± 0.396
0.483SerTrp: 0.483 ± 0.222
1.852SerTyr: 1.852 ± 0.335
0.0SerXaa: 0.0 ± 0.0
Thr
4.349ThrAla: 4.349 ± 0.602
0.725ThrCys: 0.725 ± 0.204
4.349ThrAsp: 4.349 ± 0.55
3.544ThrGlu: 3.544 ± 0.62
3.785ThrPhe: 3.785 ± 0.694
3.463ThrGly: 3.463 ± 0.679
0.966ThrHis: 0.966 ± 0.232
6.201ThrIle: 6.201 ± 0.813
5.799ThrLys: 5.799 ± 0.643
4.671ThrLeu: 4.671 ± 0.649
1.772ThrMet: 1.772 ± 0.415
5.396ThrAsn: 5.396 ± 0.81
1.691ThrPro: 1.691 ± 0.458
2.738ThrGln: 2.738 ± 0.553
1.611ThrArg: 1.611 ± 0.325
3.624ThrSer: 3.624 ± 0.521
5.074ThrThr: 5.074 ± 1.032
1.611ThrVal: 1.611 ± 0.364
0.644ThrTrp: 0.644 ± 0.194
2.336ThrTyr: 2.336 ± 0.435
0.0ThrXaa: 0.0 ± 0.0
Val
3.06ValAla: 3.06 ± 0.503
0.483ValCys: 0.483 ± 0.245
3.141ValAsp: 3.141 ± 0.491
3.221ValGlu: 3.221 ± 0.392
3.06ValPhe: 3.06 ± 0.477
2.819ValGly: 2.819 ± 0.334
1.127ValHis: 1.127 ± 0.334
3.544ValIle: 3.544 ± 0.436
4.752ValLys: 4.752 ± 0.552
4.671ValLeu: 4.671 ± 0.536
1.611ValMet: 1.611 ± 0.353
5.235ValAsn: 5.235 ± 0.741
1.047ValPro: 1.047 ± 0.31
2.336ValGln: 2.336 ± 0.392
1.611ValArg: 1.611 ± 0.339
3.705ValSer: 3.705 ± 0.503
2.899ValThr: 2.899 ± 0.581
2.416ValVal: 2.416 ± 0.604
0.644ValTrp: 0.644 ± 0.226
2.174ValTyr: 2.174 ± 0.294
0.0ValXaa: 0.0 ± 0.0
Trp
0.644TrpAla: 0.644 ± 0.2
0.081TrpCys: 0.081 ± 0.087
0.564TrpAsp: 0.564 ± 0.227
0.805TrpGlu: 0.805 ± 0.242
0.483TrpPhe: 0.483 ± 0.173
0.161TrpGly: 0.161 ± 0.098
0.403TrpHis: 0.403 ± 0.19
1.047TrpIle: 1.047 ± 0.384
0.644TrpLys: 0.644 ± 0.255
0.966TrpLeu: 0.966 ± 0.268
0.0TrpMet: 0.0 ± 0.083
0.966TrpAsn: 0.966 ± 0.341
0.0TrpPro: 0.0 ± 0.0
0.242TrpGln: 0.242 ± 0.149
0.564TrpArg: 0.564 ± 0.235
0.644TrpSer: 0.644 ± 0.315
0.403TrpThr: 0.403 ± 0.163
0.564TrpVal: 0.564 ± 0.215
0.0TrpTrp: 0.0 ± 0.0
0.403TrpTyr: 0.403 ± 0.161
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.53TyrAla: 1.53 ± 0.307
0.644TyrCys: 0.644 ± 0.3
2.094TyrAsp: 2.094 ± 0.39
2.738TyrGlu: 2.738 ± 0.495
1.691TyrPhe: 1.691 ± 0.343
2.174TyrGly: 2.174 ± 0.377
0.483TyrHis: 0.483 ± 0.233
3.946TyrIle: 3.946 ± 0.527
5.476TyrLys: 5.476 ± 0.8
4.107TyrLeu: 4.107 ± 0.59
0.564TyrMet: 0.564 ± 0.238
3.463TyrAsn: 3.463 ± 0.498
0.966TyrPro: 0.966 ± 0.307
1.772TyrGln: 1.772 ± 0.368
1.127TyrArg: 1.127 ± 0.281
2.577TyrSer: 2.577 ± 0.492
2.174TyrThr: 2.174 ± 0.529
2.174TyrVal: 2.174 ± 0.433
0.564TyrTrp: 0.564 ± 0.206
2.013TyrTyr: 2.013 ± 0.555
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 72 proteins (12418 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski