Amino acid dipepetide frequency for Flavobacterium phage vB_FspS_hemulen6-2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.736AlaAla: 0.736 ± 0.346
0.654AlaCys: 0.654 ± 0.233
2.125AlaAsp: 2.125 ± 0.399
2.698AlaGlu: 2.698 ± 0.721
2.289AlaPhe: 2.289 ± 0.486
2.125AlaGly: 2.125 ± 0.362
0.736AlaHis: 0.736 ± 0.253
4.169AlaIle: 4.169 ± 0.479
5.804AlaLys: 5.804 ± 0.649
4.741AlaLeu: 4.741 ± 0.733
1.226AlaMet: 1.226 ± 0.427
4.823AlaAsn: 4.823 ± 0.665
0.899AlaPro: 0.899 ± 0.249
1.88AlaGln: 1.88 ± 0.375
1.635AlaArg: 1.635 ± 0.412
3.188AlaSer: 3.188 ± 0.677
4.006AlaThr: 4.006 ± 0.573
3.188AlaVal: 3.188 ± 0.691
0.409AlaTrp: 0.409 ± 0.155
1.962AlaTyr: 1.962 ± 0.401
0.0AlaXaa: 0.0 ± 0.0
Cys
0.245CysAla: 0.245 ± 0.226
0.245CysCys: 0.245 ± 0.172
0.817CysAsp: 0.817 ± 0.26
1.144CysGlu: 1.144 ± 0.328
0.654CysPhe: 0.654 ± 0.244
0.981CysGly: 0.981 ± 0.37
0.082CysHis: 0.082 ± 0.084
0.327CysIle: 0.327 ± 0.164
1.063CysLys: 1.063 ± 0.37
1.39CysLeu: 1.39 ± 0.356
0.163CysMet: 0.163 ± 0.111
0.409CysAsn: 0.409 ± 0.213
0.572CysPro: 0.572 ± 0.233
0.245CysGln: 0.245 ± 0.14
0.245CysArg: 0.245 ± 0.141
0.817CysSer: 0.817 ± 0.281
0.572CysThr: 0.572 ± 0.203
0.736CysVal: 0.736 ± 0.243
0.163CysTrp: 0.163 ± 0.111
0.409CysTyr: 0.409 ± 0.188
0.0CysXaa: 0.0 ± 0.0
Asp
3.842AspAla: 3.842 ± 0.484
1.063AspCys: 1.063 ± 0.251
1.798AspAsp: 1.798 ± 0.321
4.333AspGlu: 4.333 ± 0.637
3.924AspPhe: 3.924 ± 0.569
2.779AspGly: 2.779 ± 0.509
0.572AspHis: 0.572 ± 0.233
3.76AspIle: 3.76 ± 0.494
5.886AspLys: 5.886 ± 0.931
5.232AspLeu: 5.232 ± 0.611
1.39AspMet: 1.39 ± 0.396
4.578AspAsn: 4.578 ± 0.666
0.49AspPro: 0.49 ± 0.146
0.409AspGln: 0.409 ± 0.193
1.471AspArg: 1.471 ± 0.315
3.76AspSer: 3.76 ± 0.567
3.76AspThr: 3.76 ± 0.586
3.025AspVal: 3.025 ± 0.631
0.817AspTrp: 0.817 ± 0.211
3.188AspTyr: 3.188 ± 0.607
0.0AspXaa: 0.0 ± 0.0
Glu
2.289GluAla: 2.289 ± 0.653
0.899GluCys: 0.899 ± 0.32
3.433GluAsp: 3.433 ± 0.54
4.169GluGlu: 4.169 ± 0.755
4.66GluPhe: 4.66 ± 0.76
2.044GluGly: 2.044 ± 0.348
1.063GluHis: 1.063 ± 0.28
7.439GluIle: 7.439 ± 0.783
7.357GluLys: 7.357 ± 0.844
7.521GluLeu: 7.521 ± 0.848
1.962GluMet: 1.962 ± 0.39
7.03GluAsn: 7.03 ± 0.808
1.798GluPro: 1.798 ± 0.368
3.27GluGln: 3.27 ± 0.506
2.207GluArg: 2.207 ± 0.563
3.597GluSer: 3.597 ± 0.511
4.251GluThr: 4.251 ± 0.553
4.251GluVal: 4.251 ± 0.642
0.409GluTrp: 0.409 ± 0.168
3.433GluTyr: 3.433 ± 0.581
0.0GluXaa: 0.0 ± 0.0
Phe
2.534PheAla: 2.534 ± 0.433
0.654PheCys: 0.654 ± 0.222
3.597PheAsp: 3.597 ± 0.612
4.414PheGlu: 4.414 ± 0.639
2.289PhePhe: 2.289 ± 0.356
2.861PheGly: 2.861 ± 0.588
0.49PheHis: 0.49 ± 0.212
3.27PheIle: 3.27 ± 0.577
4.251PheLys: 4.251 ± 0.613
3.433PheLeu: 3.433 ± 0.595
1.308PheMet: 1.308 ± 0.214
4.905PheAsn: 4.905 ± 0.9
0.817PhePro: 0.817 ± 0.229
1.717PheGln: 1.717 ± 0.443
1.39PheArg: 1.39 ± 0.397
3.842PheSer: 3.842 ± 0.602
4.414PheThr: 4.414 ± 0.674
2.534PheVal: 2.534 ± 0.436
0.409PheTrp: 0.409 ± 0.182
1.798PheTyr: 1.798 ± 0.368
0.0PheXaa: 0.0 ± 0.0
Gly
3.025GlyAla: 3.025 ± 0.691
0.327GlyCys: 0.327 ± 0.151
2.861GlyAsp: 2.861 ± 0.548
2.125GlyGlu: 2.125 ± 0.382
2.616GlyPhe: 2.616 ± 0.525
2.371GlyGly: 2.371 ± 0.576
0.327GlyHis: 0.327 ± 0.165
4.006GlyIle: 4.006 ± 0.48
3.188GlyLys: 3.188 ± 0.558
3.433GlyLeu: 3.433 ± 0.51
1.471GlyMet: 1.471 ± 0.35
4.823GlyAsn: 4.823 ± 0.628
0.0GlyPro: 0.0 ± 0.0
1.635GlyGln: 1.635 ± 0.338
1.635GlyArg: 1.635 ± 0.335
2.861GlySer: 2.861 ± 0.408
4.741GlyThr: 4.741 ± 0.757
2.698GlyVal: 2.698 ± 0.412
0.327GlyTrp: 0.327 ± 0.184
2.289GlyTyr: 2.289 ± 0.493
0.0GlyXaa: 0.0 ± 0.0
His
0.409HisAla: 0.409 ± 0.183
0.245HisCys: 0.245 ± 0.166
0.981HisAsp: 0.981 ± 0.287
0.49HisGlu: 0.49 ± 0.192
1.063HisPhe: 1.063 ± 0.271
0.899HisGly: 0.899 ± 0.281
0.654HisHis: 0.654 ± 0.289
1.226HisIle: 1.226 ± 0.342
0.736HisLys: 0.736 ± 0.25
1.308HisLeu: 1.308 ± 0.351
0.082HisMet: 0.082 ± 0.089
1.063HisAsn: 1.063 ± 0.28
0.572HisPro: 0.572 ± 0.244
0.49HisGln: 0.49 ± 0.229
0.409HisArg: 0.409 ± 0.231
1.144HisSer: 1.144 ± 0.366
0.736HisThr: 0.736 ± 0.192
0.817HisVal: 0.817 ± 0.303
0.163HisTrp: 0.163 ± 0.174
0.572HisTyr: 0.572 ± 0.198
0.0HisXaa: 0.0 ± 0.0
Ile
4.741IleAla: 4.741 ± 0.706
0.981IleCys: 0.981 ± 0.399
5.068IleAsp: 5.068 ± 0.576
8.011IleGlu: 8.011 ± 0.954
3.433IlePhe: 3.433 ± 0.522
3.842IleGly: 3.842 ± 0.615
0.981IleHis: 0.981 ± 0.344
5.313IleIle: 5.313 ± 0.921
8.338IleLys: 8.338 ± 0.827
6.785IleLeu: 6.785 ± 0.747
1.144IleMet: 1.144 ± 0.37
6.049IleAsn: 6.049 ± 0.85
2.616IlePro: 2.616 ± 0.499
2.371IleGln: 2.371 ± 0.521
1.717IleArg: 1.717 ± 0.442
5.477IleSer: 5.477 ± 0.75
4.987IleThr: 4.987 ± 0.739
4.496IleVal: 4.496 ± 0.655
0.981IleTrp: 0.981 ± 0.271
3.188IleTyr: 3.188 ± 0.439
0.0IleXaa: 0.0 ± 0.0
Lys
5.313LysAla: 5.313 ± 0.695
1.308LysCys: 1.308 ± 0.351
4.414LysAsp: 4.414 ± 0.635
9.074LysGlu: 9.074 ± 1.207
3.352LysPhe: 3.352 ± 0.43
4.169LysGly: 4.169 ± 0.603
1.962LysHis: 1.962 ± 0.484
8.093LysIle: 8.093 ± 0.772
6.948LysLys: 6.948 ± 0.807
7.766LysLeu: 7.766 ± 0.849
3.27LysMet: 3.27 ± 0.558
6.213LysAsn: 6.213 ± 0.711
2.534LysPro: 2.534 ± 0.408
4.741LysGln: 4.741 ± 0.915
3.679LysArg: 3.679 ± 0.68
5.313LysSer: 5.313 ± 0.567
6.213LysThr: 6.213 ± 0.852
5.15LysVal: 5.15 ± 0.676
1.226LysTrp: 1.226 ± 0.37
4.087LysTyr: 4.087 ± 0.64
0.0LysXaa: 0.0 ± 0.0
Leu
4.006LeuAla: 4.006 ± 0.704
0.49LeuCys: 0.49 ± 0.21
5.395LeuAsp: 5.395 ± 0.601
6.376LeuGlu: 6.376 ± 1.079
4.087LeuPhe: 4.087 ± 0.459
3.679LeuGly: 3.679 ± 0.459
0.981LeuHis: 0.981 ± 0.266
7.112LeuIle: 7.112 ± 0.9
9.074LeuLys: 9.074 ± 0.846
6.54LeuLeu: 6.54 ± 0.79
1.962LeuMet: 1.962 ± 0.476
7.194LeuAsn: 7.194 ± 0.803
3.924LeuPro: 3.924 ± 0.674
4.169LeuGln: 4.169 ± 0.584
2.943LeuArg: 2.943 ± 0.523
5.232LeuSer: 5.232 ± 0.691
6.049LeuThr: 6.049 ± 0.759
4.578LeuVal: 4.578 ± 0.722
0.654LeuTrp: 0.654 ± 0.243
3.679LeuTyr: 3.679 ± 0.643
0.0LeuXaa: 0.0 ± 0.0
Met
1.717MetAla: 1.717 ± 0.413
0.163MetCys: 0.163 ± 0.137
0.736MetAsp: 0.736 ± 0.235
1.553MetGlu: 1.553 ± 0.349
1.226MetPhe: 1.226 ± 0.315
0.817MetGly: 0.817 ± 0.249
0.082MetHis: 0.082 ± 0.072
1.226MetIle: 1.226 ± 0.314
2.779MetLys: 2.779 ± 0.568
1.717MetLeu: 1.717 ± 0.356
0.327MetMet: 0.327 ± 0.162
1.553MetAsn: 1.553 ± 0.388
0.817MetPro: 0.817 ± 0.214
1.144MetGln: 1.144 ± 0.276
0.817MetArg: 0.817 ± 0.252
1.88MetSer: 1.88 ± 0.419
1.063MetThr: 1.063 ± 0.287
0.899MetVal: 0.899 ± 0.23
0.327MetTrp: 0.327 ± 0.159
0.49MetTyr: 0.49 ± 0.202
0.0MetXaa: 0.0 ± 0.0
Asn
4.66AsnAla: 4.66 ± 0.662
0.654AsnCys: 0.654 ± 0.226
4.905AsnAsp: 4.905 ± 0.772
6.049AsnGlu: 6.049 ± 0.682
4.333AsnPhe: 4.333 ± 0.689
4.006AsnGly: 4.006 ± 0.632
1.226AsnHis: 1.226 ± 0.322
5.15AsnIle: 5.15 ± 0.616
9.074AsnLys: 9.074 ± 1.138
7.03AsnLeu: 7.03 ± 0.865
0.899AsnMet: 0.899 ± 0.279
4.496AsnAsn: 4.496 ± 0.936
2.125AsnPro: 2.125 ± 0.404
3.025AsnGln: 3.025 ± 0.407
2.207AsnArg: 2.207 ± 0.584
5.232AsnSer: 5.232 ± 0.598
4.496AsnThr: 4.496 ± 0.552
5.559AsnVal: 5.559 ± 0.622
0.654AsnTrp: 0.654 ± 0.257
4.496AsnTyr: 4.496 ± 0.569
0.0AsnXaa: 0.0 ± 0.0
Pro
1.471ProAla: 1.471 ± 0.338
0.49ProCys: 0.49 ± 0.179
1.144ProAsp: 1.144 ± 0.289
1.798ProGlu: 1.798 ± 0.445
1.635ProPhe: 1.635 ± 0.33
0.0ProGly: 0.0 ± 0.0
0.082ProHis: 0.082 ± 0.079
1.39ProIle: 1.39 ± 0.31
1.717ProLys: 1.717 ± 0.395
3.025ProLeu: 3.025 ± 0.542
0.817ProMet: 0.817 ± 0.255
2.289ProAsn: 2.289 ± 0.405
0.736ProPro: 0.736 ± 0.354
1.39ProGln: 1.39 ± 0.331
0.409ProArg: 0.409 ± 0.218
2.289ProSer: 2.289 ± 0.422
1.88ProThr: 1.88 ± 0.423
1.308ProVal: 1.308 ± 0.302
0.0ProTrp: 0.0 ± 0.0
1.635ProTyr: 1.635 ± 0.439
0.0ProXaa: 0.0 ± 0.0
Gln
2.125GlnAla: 2.125 ± 0.615
0.327GlnCys: 0.327 ± 0.183
1.144GlnAsp: 1.144 ± 0.312
2.289GlnGlu: 2.289 ± 0.488
1.063GlnPhe: 1.063 ± 0.244
2.289GlnGly: 2.289 ± 0.421
0.736GlnHis: 0.736 ± 0.276
4.414GlnIle: 4.414 ± 0.655
4.006GlnLys: 4.006 ± 0.77
3.597GlnLeu: 3.597 ± 0.543
0.899GlnMet: 0.899 ± 0.333
2.371GlnAsn: 2.371 ± 0.453
1.226GlnPro: 1.226 ± 0.299
2.044GlnGln: 2.044 ± 0.791
1.962GlnArg: 1.962 ± 0.489
2.044GlnSer: 2.044 ± 0.388
2.371GlnThr: 2.371 ± 0.55
2.044GlnVal: 2.044 ± 0.461
0.409GlnTrp: 0.409 ± 0.188
1.635GlnTyr: 1.635 ± 0.373
0.0GlnXaa: 0.0 ± 0.0
Arg
0.899ArgAla: 0.899 ± 0.349
0.409ArgCys: 0.409 ± 0.205
1.471ArgAsp: 1.471 ± 0.28
2.207ArgGlu: 2.207 ± 0.407
1.144ArgPhe: 1.144 ± 0.327
1.063ArgGly: 1.063 ± 0.283
0.49ArgHis: 0.49 ± 0.21
2.861ArgIle: 2.861 ± 0.68
2.698ArgLys: 2.698 ± 0.358
2.943ArgLeu: 2.943 ± 0.459
0.654ArgMet: 0.654 ± 0.194
2.698ArgAsn: 2.698 ± 0.468
0.245ArgPro: 0.245 ± 0.144
0.899ArgGln: 0.899 ± 0.331
0.899ArgArg: 0.899 ± 0.257
2.207ArgSer: 2.207 ± 0.424
2.125ArgThr: 2.125 ± 0.374
2.289ArgVal: 2.289 ± 0.365
0.163ArgTrp: 0.163 ± 0.132
1.88ArgTyr: 1.88 ± 0.406
0.0ArgXaa: 0.0 ± 0.0
Ser
2.125SerAla: 2.125 ± 0.346
0.409SerCys: 0.409 ± 0.2
4.169SerAsp: 4.169 ± 0.709
5.15SerGlu: 5.15 ± 0.832
4.251SerPhe: 4.251 ± 0.692
4.087SerGly: 4.087 ± 0.608
0.817SerHis: 0.817 ± 0.186
5.722SerIle: 5.722 ± 0.617
5.886SerLys: 5.886 ± 0.77
5.886SerLeu: 5.886 ± 0.78
1.144SerMet: 1.144 ± 0.259
5.15SerAsn: 5.15 ± 0.786
1.553SerPro: 1.553 ± 0.335
2.779SerGln: 2.779 ± 0.508
1.88SerArg: 1.88 ± 0.369
3.924SerSer: 3.924 ± 0.678
2.616SerThr: 2.616 ± 0.452
4.741SerVal: 4.741 ± 0.599
0.654SerTrp: 0.654 ± 0.241
1.308SerTyr: 1.308 ± 0.296
0.0SerXaa: 0.0 ± 0.0
Thr
3.924ThrAla: 3.924 ± 0.677
0.572ThrCys: 0.572 ± 0.221
4.741ThrAsp: 4.741 ± 0.57
4.496ThrGlu: 4.496 ± 0.782
3.679ThrPhe: 3.679 ± 0.558
3.433ThrGly: 3.433 ± 0.609
1.144ThrHis: 1.144 ± 0.292
6.458ThrIle: 6.458 ± 0.868
4.823ThrLys: 4.823 ± 0.533
5.722ThrLeu: 5.722 ± 0.623
1.063ThrMet: 1.063 ± 0.373
5.068ThrAsn: 5.068 ± 0.684
2.207ThrPro: 2.207 ± 0.427
3.106ThrGln: 3.106 ± 0.533
1.39ThrArg: 1.39 ± 0.268
3.679ThrSer: 3.679 ± 0.605
5.559ThrThr: 5.559 ± 0.915
1.717ThrVal: 1.717 ± 0.366
0.654ThrTrp: 0.654 ± 0.218
2.452ThrTyr: 2.452 ± 0.481
0.0ThrXaa: 0.0 ± 0.0
Val
3.025ValAla: 3.025 ± 0.468
0.736ValCys: 0.736 ± 0.31
3.352ValAsp: 3.352 ± 0.525
3.025ValGlu: 3.025 ± 0.535
2.534ValPhe: 2.534 ± 0.428
3.106ValGly: 3.106 ± 0.411
0.654ValHis: 0.654 ± 0.226
4.333ValIle: 4.333 ± 0.49
5.395ValLys: 5.395 ± 0.705
4.905ValLeu: 4.905 ± 0.524
1.063ValMet: 1.063 ± 0.307
5.068ValAsn: 5.068 ± 0.715
1.144ValPro: 1.144 ± 0.293
2.207ValGln: 2.207 ± 0.449
1.635ValArg: 1.635 ± 0.411
4.741ValSer: 4.741 ± 0.609
2.534ValThr: 2.534 ± 0.446
3.106ValVal: 3.106 ± 0.69
0.817ValTrp: 0.817 ± 0.284
2.207ValTyr: 2.207 ± 0.341
0.0ValXaa: 0.0 ± 0.0
Trp
0.49TrpAla: 0.49 ± 0.179
0.082TrpCys: 0.082 ± 0.098
0.899TrpAsp: 0.899 ± 0.291
0.654TrpGlu: 0.654 ± 0.218
0.327TrpPhe: 0.327 ± 0.129
0.245TrpGly: 0.245 ± 0.152
0.327TrpHis: 0.327 ± 0.154
0.899TrpIle: 0.899 ± 0.314
0.981TrpLys: 0.981 ± 0.352
1.226TrpLeu: 1.226 ± 0.247
0.082TrpMet: 0.082 ± 0.129
0.899TrpAsn: 0.899 ± 0.318
0.0TrpPro: 0.0 ± 0.0
0.163TrpGln: 0.163 ± 0.111
0.409TrpArg: 0.409 ± 0.241
0.572TrpSer: 0.572 ± 0.206
0.654TrpThr: 0.654 ± 0.214
0.245TrpVal: 0.245 ± 0.129
0.0TrpTrp: 0.0 ± 0.0
0.49TrpTyr: 0.49 ± 0.233
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.717TyrAla: 1.717 ± 0.279
0.572TyrCys: 0.572 ± 0.198
3.025TyrAsp: 3.025 ± 0.52
3.025TyrGlu: 3.025 ± 0.545
2.371TyrPhe: 2.371 ± 0.505
1.962TyrGly: 1.962 ± 0.347
0.654TyrHis: 0.654 ± 0.224
3.352TyrIle: 3.352 ± 0.499
4.823TyrLys: 4.823 ± 0.765
3.924TyrLeu: 3.924 ± 0.613
0.327TyrMet: 0.327 ± 0.159
3.597TyrAsn: 3.597 ± 0.522
1.226TyrPro: 1.226 ± 0.418
1.308TyrGln: 1.308 ± 0.3
1.308TyrArg: 1.308 ± 0.436
2.452TyrSer: 2.452 ± 0.458
2.861TyrThr: 2.861 ± 0.512
2.207TyrVal: 2.207 ± 0.454
0.49TyrTrp: 0.49 ± 0.22
2.125TyrTyr: 2.125 ± 0.526
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 70 proteins (12234 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski