Amino acid dipepetide frequency for Flavobacterium phage vB_FspS_snusmum6-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.886AlaAla: 0.886 ± 0.391
0.564AlaCys: 0.564 ± 0.229
1.933AlaAsp: 1.933 ± 0.36
3.221AlaGlu: 3.221 ± 0.737
2.658AlaPhe: 2.658 ± 0.513
1.933AlaGly: 1.933 ± 0.381
0.403AlaHis: 0.403 ± 0.203
4.268AlaIle: 4.268 ± 0.584
5.879AlaLys: 5.879 ± 0.741
4.993AlaLeu: 4.993 ± 0.624
1.53AlaMet: 1.53 ± 0.555
4.832AlaAsn: 4.832 ± 0.668
0.886AlaPro: 0.886 ± 0.25
1.53AlaGln: 1.53 ± 0.328
1.369AlaArg: 1.369 ± 0.409
2.819AlaSer: 2.819 ± 0.487
4.107AlaThr: 4.107 ± 0.503
3.382AlaVal: 3.382 ± 0.543
0.725AlaTrp: 0.725 ± 0.191
1.45AlaTyr: 1.45 ± 0.233
0.0AlaXaa: 0.0 ± 0.0
Cys
0.242CysAla: 0.242 ± 0.142
0.242CysCys: 0.242 ± 0.154
0.564CysAsp: 0.564 ± 0.2
1.208CysGlu: 1.208 ± 0.317
0.886CysPhe: 0.886 ± 0.224
1.208CysGly: 1.208 ± 0.377
0.161CysHis: 0.161 ± 0.121
0.805CysIle: 0.805 ± 0.238
0.966CysLys: 0.966 ± 0.344
1.127CysLeu: 1.127 ± 0.302
0.081CysMet: 0.081 ± 0.076
0.564CysAsn: 0.564 ± 0.313
0.483CysPro: 0.483 ± 0.177
0.322CysGln: 0.322 ± 0.186
0.403CysArg: 0.403 ± 0.173
0.483CysSer: 0.483 ± 0.173
0.805CysThr: 0.805 ± 0.189
0.805CysVal: 0.805 ± 0.247
0.081CysTrp: 0.081 ± 0.086
0.403CysTyr: 0.403 ± 0.171
0.0CysXaa: 0.0 ± 0.0
Asp
3.544AspAla: 3.544 ± 0.388
0.886AspCys: 0.886 ± 0.216
2.819AspAsp: 2.819 ± 0.651
4.268AspGlu: 4.268 ± 0.638
4.429AspPhe: 4.429 ± 0.555
2.819AspGly: 2.819 ± 0.506
0.725AspHis: 0.725 ± 0.22
4.429AspIle: 4.429 ± 0.706
4.913AspLys: 4.913 ± 0.765
4.913AspLeu: 4.913 ± 0.624
1.208AspMet: 1.208 ± 0.349
3.785AspAsn: 3.785 ± 0.711
0.483AspPro: 0.483 ± 0.221
0.644AspGln: 0.644 ± 0.267
1.047AspArg: 1.047 ± 0.269
4.59AspSer: 4.59 ± 0.435
3.463AspThr: 3.463 ± 0.648
3.705AspVal: 3.705 ± 0.602
0.483AspTrp: 0.483 ± 0.188
2.98AspTyr: 2.98 ± 0.502
0.0AspXaa: 0.0 ± 0.0
Glu
3.946GluAla: 3.946 ± 0.779
0.725GluCys: 0.725 ± 0.328
2.98GluAsp: 2.98 ± 0.662
3.866GluGlu: 3.866 ± 0.686
3.866GluPhe: 3.866 ± 0.529
1.53GluGly: 1.53 ± 0.297
1.289GluHis: 1.289 ± 0.286
7.49GluIle: 7.49 ± 0.805
6.604GluLys: 6.604 ± 0.815
7.651GluLeu: 7.651 ± 0.836
1.852GluMet: 1.852 ± 0.328
5.718GluAsn: 5.718 ± 0.76
1.772GluPro: 1.772 ± 0.393
3.302GluGln: 3.302 ± 0.386
2.255GluArg: 2.255 ± 0.623
4.188GluSer: 4.188 ± 0.482
3.866GluThr: 3.866 ± 0.57
3.382GluVal: 3.382 ± 0.437
0.403GluTrp: 0.403 ± 0.148
2.899GluTyr: 2.899 ± 0.458
0.0GluXaa: 0.0 ± 0.0
Phe
2.577PheAla: 2.577 ± 0.409
0.483PheCys: 0.483 ± 0.173
4.107PheAsp: 4.107 ± 0.518
2.98PheGlu: 2.98 ± 0.526
2.738PhePhe: 2.738 ± 0.447
3.06PheGly: 3.06 ± 0.498
0.322PheHis: 0.322 ± 0.153
3.785PheIle: 3.785 ± 0.527
4.993PheLys: 4.993 ± 0.563
3.705PheLeu: 3.705 ± 0.664
0.966PheMet: 0.966 ± 0.225
4.59PheAsn: 4.59 ± 0.657
1.208PhePro: 1.208 ± 0.326
2.094PheGln: 2.094 ± 0.483
0.805PheArg: 0.805 ± 0.271
3.705PheSer: 3.705 ± 0.592
4.913PheThr: 4.913 ± 0.722
2.738PheVal: 2.738 ± 0.481
0.322PheTrp: 0.322 ± 0.147
1.772PheTyr: 1.772 ± 0.388
0.0PheXaa: 0.0 ± 0.0
Gly
2.98GlyAla: 2.98 ± 0.725
0.644GlyCys: 0.644 ± 0.238
1.852GlyAsp: 1.852 ± 0.388
2.094GlyGlu: 2.094 ± 0.349
2.658GlyPhe: 2.658 ± 0.563
1.691GlyGly: 1.691 ± 0.391
0.483GlyHis: 0.483 ± 0.226
4.027GlyIle: 4.027 ± 0.444
3.785GlyLys: 3.785 ± 0.615
3.946GlyLeu: 3.946 ± 0.482
0.886GlyMet: 0.886 ± 0.235
4.027GlyAsn: 4.027 ± 0.464
0.081GlyPro: 0.081 ± 0.078
1.772GlyGln: 1.772 ± 0.356
1.772GlyArg: 1.772 ± 0.324
3.06GlySer: 3.06 ± 0.551
3.946GlyThr: 3.946 ± 0.602
2.577GlyVal: 2.577 ± 0.441
0.161GlyTrp: 0.161 ± 0.106
2.336GlyTyr: 2.336 ± 0.408
0.0GlyXaa: 0.0 ± 0.0
His
0.322HisAla: 0.322 ± 0.144
0.403HisCys: 0.403 ± 0.177
0.886HisAsp: 0.886 ± 0.275
0.886HisGlu: 0.886 ± 0.262
1.127HisPhe: 1.127 ± 0.307
0.564HisGly: 0.564 ± 0.179
0.483HisHis: 0.483 ± 0.275
1.047HisIle: 1.047 ± 0.304
1.289HisLys: 1.289 ± 0.358
1.289HisLeu: 1.289 ± 0.333
0.0HisMet: 0.0 ± 0.0
1.208HisAsn: 1.208 ± 0.279
0.403HisPro: 0.403 ± 0.177
0.483HisGln: 0.483 ± 0.219
0.644HisArg: 0.644 ± 0.231
1.127HisSer: 1.127 ± 0.334
1.047HisThr: 1.047 ± 0.255
0.725HisVal: 0.725 ± 0.251
0.081HisTrp: 0.081 ± 0.083
1.047HisTyr: 1.047 ± 0.243
0.0HisXaa: 0.0 ± 0.0
Ile
3.866IleAla: 3.866 ± 0.688
1.289IleCys: 1.289 ± 0.319
6.845IleAsp: 6.845 ± 1.003
7.973IleGlu: 7.973 ± 0.76
3.705IlePhe: 3.705 ± 0.56
3.624IleGly: 3.624 ± 0.611
1.369IleHis: 1.369 ± 0.393
6.684IleIle: 6.684 ± 0.853
8.053IleLys: 8.053 ± 0.73
8.456IleLeu: 8.456 ± 0.879
1.611IleMet: 1.611 ± 0.386
6.523IleAsn: 6.523 ± 0.71
2.013IlePro: 2.013 ± 0.388
3.544IleGln: 3.544 ± 0.679
2.336IleArg: 2.336 ± 0.406
5.557IleSer: 5.557 ± 0.79
5.154IleThr: 5.154 ± 0.617
4.51IleVal: 4.51 ± 0.559
0.644IleTrp: 0.644 ± 0.223
3.141IleTyr: 3.141 ± 0.544
0.0IleXaa: 0.0 ± 0.0
Lys
5.557LysAla: 5.557 ± 0.599
1.047LysCys: 1.047 ± 0.276
5.557LysAsp: 5.557 ± 0.694
7.731LysGlu: 7.731 ± 1.171
3.785LysPhe: 3.785 ± 0.422
4.107LysGly: 4.107 ± 0.572
1.772LysHis: 1.772 ± 0.342
9.1LysIle: 9.1 ± 0.894
8.376LysLys: 8.376 ± 1.03
7.168LysLeu: 7.168 ± 0.854
3.382LysMet: 3.382 ± 0.547
6.523LysAsn: 6.523 ± 0.693
2.577LysPro: 2.577 ± 0.549
4.671LysGln: 4.671 ± 0.789
3.463LysArg: 3.463 ± 0.575
4.671LysSer: 4.671 ± 0.541
5.799LysThr: 5.799 ± 0.778
5.235LysVal: 5.235 ± 0.637
1.127LysTrp: 1.127 ± 0.291
4.993LysTyr: 4.993 ± 0.638
0.0LysXaa: 0.0 ± 0.0
Leu
3.785LeuAla: 3.785 ± 0.64
1.127LeuCys: 1.127 ± 0.374
5.235LeuAsp: 5.235 ± 0.616
6.04LeuGlu: 6.04 ± 0.821
3.866LeuPhe: 3.866 ± 0.522
3.382LeuGly: 3.382 ± 0.573
0.966LeuHis: 0.966 ± 0.312
8.698LeuIle: 8.698 ± 0.937
8.537LeuLys: 8.537 ± 0.826
6.684LeuLeu: 6.684 ± 0.75
2.174LeuMet: 2.174 ± 0.462
8.053LeuAsn: 8.053 ± 0.75
3.463LeuPro: 3.463 ± 0.543
4.107LeuGln: 4.107 ± 0.597
2.658LeuArg: 2.658 ± 0.389
5.476LeuSer: 5.476 ± 0.72
5.96LeuThr: 5.96 ± 0.742
4.429LeuVal: 4.429 ± 0.55
1.45LeuTrp: 1.45 ± 0.353
3.221LeuTyr: 3.221 ± 0.466
0.0LeuXaa: 0.0 ± 0.0
Met
1.611MetAla: 1.611 ± 0.315
0.081MetCys: 0.081 ± 0.084
0.886MetAsp: 0.886 ± 0.28
2.094MetGlu: 2.094 ± 0.422
0.886MetPhe: 0.886 ± 0.268
0.886MetGly: 0.886 ± 0.286
0.483MetHis: 0.483 ± 0.162
1.691MetIle: 1.691 ± 0.352
2.819MetLys: 2.819 ± 0.671
1.852MetLeu: 1.852 ± 0.334
0.161MetMet: 0.161 ± 0.106
1.208MetAsn: 1.208 ± 0.303
1.45MetPro: 1.45 ± 0.276
0.886MetGln: 0.886 ± 0.267
0.966MetArg: 0.966 ± 0.237
1.772MetSer: 1.772 ± 0.343
0.966MetThr: 0.966 ± 0.257
1.127MetVal: 1.127 ± 0.262
0.403MetTrp: 0.403 ± 0.169
0.403MetTyr: 0.403 ± 0.155
0.0MetXaa: 0.0 ± 0.0
Asn
4.429AsnAla: 4.429 ± 0.669
0.805AsnCys: 0.805 ± 0.198
4.59AsnAsp: 4.59 ± 0.666
6.121AsnGlu: 6.121 ± 0.63
3.866AsnPhe: 3.866 ± 0.605
3.866AsnGly: 3.866 ± 0.525
1.047AsnHis: 1.047 ± 0.293
5.396AsnIle: 5.396 ± 0.519
8.859AsnLys: 8.859 ± 1.133
7.168AsnLeu: 7.168 ± 1.017
0.966AsnMet: 0.966 ± 0.27
5.396AsnAsn: 5.396 ± 0.975
2.336AsnPro: 2.336 ± 0.442
2.416AsnGln: 2.416 ± 0.431
2.819AsnArg: 2.819 ± 0.483
4.993AsnSer: 4.993 ± 0.651
4.349AsnThr: 4.349 ± 0.54
5.154AsnVal: 5.154 ± 0.656
0.886AsnTrp: 0.886 ± 0.24
4.349AsnTyr: 4.349 ± 0.419
0.0AsnXaa: 0.0 ± 0.0
Pro
0.886ProAla: 0.886 ± 0.27
0.403ProCys: 0.403 ± 0.168
1.289ProAsp: 1.289 ± 0.266
1.772ProGlu: 1.772 ± 0.382
1.611ProPhe: 1.611 ± 0.324
0.161ProGly: 0.161 ± 0.106
0.161ProHis: 0.161 ± 0.114
1.611ProIle: 1.611 ± 0.374
1.852ProLys: 1.852 ± 0.439
2.738ProLeu: 2.738 ± 0.425
0.564ProMet: 0.564 ± 0.197
2.497ProAsn: 2.497 ± 0.423
0.403ProPro: 0.403 ± 0.184
1.047ProGln: 1.047 ± 0.344
0.322ProArg: 0.322 ± 0.159
2.094ProSer: 2.094 ± 0.443
1.691ProThr: 1.691 ± 0.296
1.047ProVal: 1.047 ± 0.29
0.0ProTrp: 0.0 ± 0.0
1.369ProTyr: 1.369 ± 0.348
0.0ProXaa: 0.0 ± 0.0
Gln
2.094GlnAla: 2.094 ± 0.551
0.081GlnCys: 0.081 ± 0.087
1.208GlnAsp: 1.208 ± 0.305
2.336GlnGlu: 2.336 ± 0.389
1.45GlnPhe: 1.45 ± 0.304
2.416GlnGly: 2.416 ± 0.477
1.047GlnHis: 1.047 ± 0.294
4.349GlnIle: 4.349 ± 0.575
4.268GlnLys: 4.268 ± 0.766
4.027GlnLeu: 4.027 ± 0.5
1.208GlnMet: 1.208 ± 0.375
2.336GlnAsn: 2.336 ± 0.467
0.805GlnPro: 0.805 ± 0.255
1.852GlnGln: 1.852 ± 0.674
1.611GlnArg: 1.611 ± 0.35
2.416GlnSer: 2.416 ± 0.403
2.255GlnThr: 2.255 ± 0.49
1.611GlnVal: 1.611 ± 0.316
0.483GlnTrp: 0.483 ± 0.162
1.45GlnTyr: 1.45 ± 0.332
0.0GlnXaa: 0.0 ± 0.0
Arg
0.886ArgAla: 0.886 ± 0.281
0.483ArgCys: 0.483 ± 0.167
1.369ArgAsp: 1.369 ± 0.289
2.013ArgGlu: 2.013 ± 0.377
1.289ArgPhe: 1.289 ± 0.349
0.966ArgGly: 0.966 ± 0.339
0.564ArgHis: 0.564 ± 0.234
2.577ArgIle: 2.577 ± 0.504
2.497ArgLys: 2.497 ± 0.376
3.785ArgLeu: 3.785 ± 0.532
0.805ArgMet: 0.805 ± 0.209
2.416ArgAsn: 2.416 ± 0.388
0.161ArgPro: 0.161 ± 0.119
0.966ArgGln: 0.966 ± 0.274
1.127ArgArg: 1.127 ± 0.263
2.013ArgSer: 2.013 ± 0.354
2.174ArgThr: 2.174 ± 0.398
2.013ArgVal: 2.013 ± 0.302
0.081ArgTrp: 0.081 ± 0.08
1.772ArgTyr: 1.772 ± 0.387
0.0ArgXaa: 0.0 ± 0.0
Ser
2.174SerAla: 2.174 ± 0.404
0.644SerCys: 0.644 ± 0.23
3.866SerAsp: 3.866 ± 0.756
4.671SerGlu: 4.671 ± 0.754
4.027SerPhe: 4.027 ± 0.627
3.946SerGly: 3.946 ± 0.538
0.805SerHis: 0.805 ± 0.279
6.201SerIle: 6.201 ± 0.547
6.282SerLys: 6.282 ± 0.657
5.396SerLeu: 5.396 ± 0.655
1.127SerMet: 1.127 ± 0.279
5.074SerAsn: 5.074 ± 0.762
1.289SerPro: 1.289 ± 0.25
2.577SerGln: 2.577 ± 0.53
1.45SerArg: 1.45 ± 0.308
3.866SerSer: 3.866 ± 0.655
3.302SerThr: 3.302 ± 0.526
4.349SerVal: 4.349 ± 0.447
0.483SerTrp: 0.483 ± 0.226
1.852SerTyr: 1.852 ± 0.336
0.0SerXaa: 0.0 ± 0.0
Thr
4.349ThrAla: 4.349 ± 0.637
0.725ThrCys: 0.725 ± 0.218
4.349ThrAsp: 4.349 ± 0.513
3.544ThrGlu: 3.544 ± 0.689
3.785ThrPhe: 3.785 ± 0.651
3.463ThrGly: 3.463 ± 0.671
0.966ThrHis: 0.966 ± 0.239
6.201ThrIle: 6.201 ± 0.847
5.799ThrLys: 5.799 ± 0.662
4.671ThrLeu: 4.671 ± 0.619
1.772ThrMet: 1.772 ± 0.425
5.396ThrAsn: 5.396 ± 0.725
1.691ThrPro: 1.691 ± 0.426
2.738ThrGln: 2.738 ± 0.467
1.611ThrArg: 1.611 ± 0.317
3.624ThrSer: 3.624 ± 0.565
5.074ThrThr: 5.074 ± 0.86
1.611ThrVal: 1.611 ± 0.353
0.644ThrTrp: 0.644 ± 0.189
2.336ThrTyr: 2.336 ± 0.514
0.0ThrXaa: 0.0 ± 0.0
Val
3.06ValAla: 3.06 ± 0.455
0.483ValCys: 0.483 ± 0.196
3.141ValAsp: 3.141 ± 0.427
3.221ValGlu: 3.221 ± 0.412
3.06ValPhe: 3.06 ± 0.434
2.819ValGly: 2.819 ± 0.388
1.127ValHis: 1.127 ± 0.314
3.544ValIle: 3.544 ± 0.461
4.752ValLys: 4.752 ± 0.518
4.671ValLeu: 4.671 ± 0.666
1.611ValMet: 1.611 ± 0.374
5.235ValAsn: 5.235 ± 0.771
1.047ValPro: 1.047 ± 0.281
2.336ValGln: 2.336 ± 0.381
1.611ValArg: 1.611 ± 0.332
3.705ValSer: 3.705 ± 0.525
2.899ValThr: 2.899 ± 0.529
2.416ValVal: 2.416 ± 0.547
0.644ValTrp: 0.644 ± 0.198
2.174ValTyr: 2.174 ± 0.283
0.0ValXaa: 0.0 ± 0.0
Trp
0.644TrpAla: 0.644 ± 0.212
0.081TrpCys: 0.081 ± 0.08
0.564TrpAsp: 0.564 ± 0.229
0.805TrpGlu: 0.805 ± 0.227
0.483TrpPhe: 0.483 ± 0.175
0.161TrpGly: 0.161 ± 0.109
0.403TrpHis: 0.403 ± 0.204
1.047TrpIle: 1.047 ± 0.349
0.644TrpLys: 0.644 ± 0.26
0.966TrpLeu: 0.966 ± 0.244
0.081TrpMet: 0.081 ± 0.084
0.966TrpAsn: 0.966 ± 0.274
0.0TrpPro: 0.0 ± 0.0
0.242TrpGln: 0.242 ± 0.131
0.564TrpArg: 0.564 ± 0.204
0.644TrpSer: 0.644 ± 0.255
0.403TrpThr: 0.403 ± 0.154
0.564TrpVal: 0.564 ± 0.197
0.0TrpTrp: 0.0 ± 0.0
0.403TrpTyr: 0.403 ± 0.175
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.53TyrAla: 1.53 ± 0.275
0.644TyrCys: 0.644 ± 0.281
2.094TyrAsp: 2.094 ± 0.421
2.738TyrGlu: 2.738 ± 0.503
1.691TyrPhe: 1.691 ± 0.318
2.174TyrGly: 2.174 ± 0.374
0.483TyrHis: 0.483 ± 0.235
3.946TyrIle: 3.946 ± 0.594
5.476TyrLys: 5.476 ± 0.724
4.107TyrLeu: 4.107 ± 0.589
0.564TyrMet: 0.564 ± 0.217
3.544TyrAsn: 3.544 ± 0.55
0.966TyrPro: 0.966 ± 0.301
1.772TyrGln: 1.772 ± 0.339
1.127TyrArg: 1.127 ± 0.337
2.577TyrSer: 2.577 ± 0.473
2.174TyrThr: 2.174 ± 0.534
2.174TyrVal: 2.174 ± 0.497
0.564TyrTrp: 0.564 ± 0.183
2.013TyrTyr: 2.013 ± 0.456
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 72 proteins (12418 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski