Amino acid dipepetide frequency for Yersinia phage vB_YpM_46

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.146AlaAla: 10.146 ± 2.169
0.913AlaCys: 0.913 ± 0.321
5.58AlaAsp: 5.58 ± 0.832
5.377AlaGlu: 5.377 ± 0.707
3.145AlaPhe: 3.145 ± 0.542
8.421AlaGly: 8.421 ± 1.163
1.623AlaHis: 1.623 ± 0.365
4.363AlaIle: 4.363 ± 0.694
4.769AlaLys: 4.769 ± 0.934
8.827AlaLeu: 8.827 ± 1.418
2.131AlaMet: 2.131 ± 0.426
2.334AlaAsn: 2.334 ± 0.688
4.464AlaPro: 4.464 ± 0.787
4.16AlaGln: 4.16 ± 0.749
4.667AlaArg: 4.667 ± 1.018
7.407AlaSer: 7.407 ± 0.89
6.798AlaThr: 6.798 ± 1.287
7.508AlaVal: 7.508 ± 0.906
1.623AlaTrp: 1.623 ± 0.42
3.145AlaTyr: 3.145 ± 0.708
0.0AlaXaa: 0.0 ± 0.0
Cys
0.812CysAla: 0.812 ± 0.265
0.203CysCys: 0.203 ± 0.151
0.609CysAsp: 0.609 ± 0.264
0.304CysGlu: 0.304 ± 0.175
0.304CysPhe: 0.304 ± 0.254
0.507CysGly: 0.507 ± 0.204
0.101CysHis: 0.101 ± 0.09
0.406CysIle: 0.406 ± 0.261
0.406CysLys: 0.406 ± 0.187
0.406CysLeu: 0.406 ± 0.199
0.203CysMet: 0.203 ± 0.138
0.406CysAsn: 0.406 ± 0.183
0.507CysPro: 0.507 ± 0.2
0.913CysGln: 0.913 ± 0.318
1.015CysArg: 1.015 ± 0.327
0.609CysSer: 0.609 ± 0.247
0.913CysThr: 0.913 ± 0.285
0.609CysVal: 0.609 ± 0.225
0.101CysTrp: 0.101 ± 0.089
0.304CysTyr: 0.304 ± 0.182
0.0CysXaa: 0.0 ± 0.0
Asp
7.102AspAla: 7.102 ± 0.832
0.609AspCys: 0.609 ± 0.278
2.841AspAsp: 2.841 ± 0.619
4.87AspGlu: 4.87 ± 0.987
3.145AspPhe: 3.145 ± 0.789
4.87AspGly: 4.87 ± 0.632
0.304AspHis: 0.304 ± 0.178
4.363AspIle: 4.363 ± 0.717
2.638AspLys: 2.638 ± 0.597
4.261AspLeu: 4.261 ± 0.652
0.609AspMet: 0.609 ± 0.259
1.826AspAsn: 1.826 ± 0.366
2.131AspPro: 2.131 ± 0.512
1.623AspGln: 1.623 ± 0.44
2.232AspArg: 2.232 ± 0.555
2.638AspSer: 2.638 ± 0.461
3.754AspThr: 3.754 ± 0.565
3.247AspVal: 3.247 ± 0.615
0.913AspTrp: 0.913 ± 0.261
2.131AspTyr: 2.131 ± 0.511
0.0AspXaa: 0.0 ± 0.0
Glu
5.377GluAla: 5.377 ± 0.84
0.406GluCys: 0.406 ± 0.207
2.334GluAsp: 2.334 ± 0.456
3.45GluGlu: 3.45 ± 0.59
1.522GluPhe: 1.522 ± 0.408
2.334GluGly: 2.334 ± 0.415
1.218GluHis: 1.218 ± 0.329
3.348GluIle: 3.348 ± 0.631
3.957GluLys: 3.957 ± 0.569
7.711GluLeu: 7.711 ± 0.666
2.334GluMet: 2.334 ± 0.387
3.145GluAsn: 3.145 ± 0.689
2.638GluPro: 2.638 ± 0.654
3.247GluGln: 3.247 ± 0.658
5.175GluArg: 5.175 ± 1.011
4.261GluSer: 4.261 ± 0.735
3.653GluThr: 3.653 ± 0.663
3.754GluVal: 3.754 ± 0.788
1.623GluTrp: 1.623 ± 0.504
2.334GluTyr: 2.334 ± 0.502
0.0GluXaa: 0.0 ± 0.0
Phe
2.942PheAla: 2.942 ± 0.488
0.304PheCys: 0.304 ± 0.192
1.725PheAsp: 1.725 ± 0.338
2.131PheGlu: 2.131 ± 0.468
1.42PhePhe: 1.42 ± 0.394
1.725PheGly: 1.725 ± 0.369
0.609PheHis: 0.609 ± 0.249
1.42PheIle: 1.42 ± 0.349
2.841PheLys: 2.841 ± 0.535
3.247PheLeu: 3.247 ± 0.564
0.812PheMet: 0.812 ± 0.344
1.928PheAsn: 1.928 ± 0.449
1.015PhePro: 1.015 ± 0.361
1.218PheGln: 1.218 ± 0.307
2.131PheArg: 2.131 ± 0.492
2.841PheSer: 2.841 ± 0.672
3.247PheThr: 3.247 ± 0.586
1.42PheVal: 1.42 ± 0.414
0.812PheTrp: 0.812 ± 0.306
1.218PheTyr: 1.218 ± 0.397
0.0PheXaa: 0.0 ± 0.0
Gly
5.377GlyAla: 5.377 ± 0.933
1.015GlyCys: 1.015 ± 0.378
5.073GlyAsp: 5.073 ± 0.744
4.16GlyGlu: 4.16 ± 0.562
2.334GlyPhe: 2.334 ± 0.485
5.175GlyGly: 5.175 ± 1.114
0.71GlyHis: 0.71 ± 0.263
3.856GlyIle: 3.856 ± 0.624
5.682GlyLys: 5.682 ± 0.81
5.073GlyLeu: 5.073 ± 0.816
2.537GlyMet: 2.537 ± 0.617
1.826GlyAsn: 1.826 ± 0.432
0.203GlyPro: 0.203 ± 0.127
2.334GlyGln: 2.334 ± 0.442
4.769GlyArg: 4.769 ± 0.863
3.247GlySer: 3.247 ± 0.559
4.058GlyThr: 4.058 ± 0.974
5.479GlyVal: 5.479 ± 0.738
1.218GlyTrp: 1.218 ± 0.249
2.029GlyTyr: 2.029 ± 0.378
0.0GlyXaa: 0.0 ± 0.0
His
1.826HisAla: 1.826 ± 0.558
0.507HisCys: 0.507 ± 0.194
0.913HisAsp: 0.913 ± 0.36
1.319HisGlu: 1.319 ± 0.452
0.812HisPhe: 0.812 ± 0.441
1.522HisGly: 1.522 ± 0.442
0.609HisHis: 0.609 ± 0.222
1.116HisIle: 1.116 ± 0.314
0.507HisLys: 0.507 ± 0.222
1.522HisLeu: 1.522 ± 0.367
0.406HisMet: 0.406 ± 0.162
1.218HisAsn: 1.218 ± 0.439
1.218HisPro: 1.218 ± 0.334
1.015HisGln: 1.015 ± 0.28
1.015HisArg: 1.015 ± 0.302
1.015HisSer: 1.015 ± 0.362
0.913HisThr: 0.913 ± 0.331
1.116HisVal: 1.116 ± 0.308
0.203HisTrp: 0.203 ± 0.133
0.507HisTyr: 0.507 ± 0.23
0.0HisXaa: 0.0 ± 0.0
Ile
4.87IleAla: 4.87 ± 0.686
0.304IleCys: 0.304 ± 0.181
3.044IleAsp: 3.044 ± 0.511
3.653IleGlu: 3.653 ± 0.631
1.42IlePhe: 1.42 ± 0.467
3.45IleGly: 3.45 ± 0.725
0.812IleHis: 0.812 ± 0.247
2.739IleIle: 2.739 ± 0.436
2.739IleLys: 2.739 ± 0.768
2.739IleLeu: 2.739 ± 0.498
1.319IleMet: 1.319 ± 0.406
3.348IleAsn: 3.348 ± 0.479
2.537IlePro: 2.537 ± 0.522
1.623IleGln: 1.623 ± 0.415
4.566IleArg: 4.566 ± 0.515
4.16IleSer: 4.16 ± 0.612
4.972IleThr: 4.972 ± 0.853
3.348IleVal: 3.348 ± 0.59
0.812IleTrp: 0.812 ± 0.276
1.725IleTyr: 1.725 ± 0.451
0.0IleXaa: 0.0 ± 0.0
Lys
4.769LysAla: 4.769 ± 0.799
0.101LysCys: 0.101 ± 0.107
2.029LysAsp: 2.029 ± 0.463
3.044LysGlu: 3.044 ± 0.544
2.537LysPhe: 2.537 ± 0.636
2.942LysGly: 2.942 ± 0.568
1.319LysHis: 1.319 ± 0.441
2.232LysIle: 2.232 ± 0.492
3.957LysLys: 3.957 ± 0.74
5.885LysLeu: 5.885 ± 0.936
0.913LysMet: 0.913 ± 0.344
3.551LysAsn: 3.551 ± 0.642
2.942LysPro: 2.942 ± 0.607
1.522LysGln: 1.522 ± 0.44
4.464LysArg: 4.464 ± 0.69
2.739LysSer: 2.739 ± 0.577
4.058LysThr: 4.058 ± 0.7
4.16LysVal: 4.16 ± 0.756
0.71LysTrp: 0.71 ± 0.292
2.739LysTyr: 2.739 ± 0.517
0.0LysXaa: 0.0 ± 0.0
Leu
9.74LeuAla: 9.74 ± 0.992
0.507LeuCys: 0.507 ± 0.232
4.464LeuAsp: 4.464 ± 0.753
5.885LeuGlu: 5.885 ± 0.803
3.754LeuPhe: 3.754 ± 0.684
4.058LeuGly: 4.058 ± 0.692
2.029LeuHis: 2.029 ± 0.561
4.667LeuIle: 4.667 ± 0.679
5.276LeuLys: 5.276 ± 0.896
5.783LeuLeu: 5.783 ± 0.945
2.942LeuMet: 2.942 ± 0.563
5.073LeuAsn: 5.073 ± 0.584
3.754LeuPro: 3.754 ± 0.71
2.942LeuGln: 2.942 ± 0.526
4.667LeuArg: 4.667 ± 0.521
7.001LeuSer: 7.001 ± 0.869
6.899LeuThr: 6.899 ± 0.827
4.87LeuVal: 4.87 ± 0.741
0.913LeuTrp: 0.913 ± 0.273
2.841LeuTyr: 2.841 ± 0.582
0.0LeuXaa: 0.0 ± 0.0
Met
3.044MetAla: 3.044 ± 0.445
0.507MetCys: 0.507 ± 0.224
0.71MetAsp: 0.71 ± 0.321
1.319MetGlu: 1.319 ± 0.436
0.913MetPhe: 0.913 ± 0.279
1.116MetGly: 1.116 ± 0.286
0.71MetHis: 0.71 ± 0.297
1.116MetIle: 1.116 ± 0.332
1.116MetLys: 1.116 ± 0.365
2.841MetLeu: 2.841 ± 0.622
1.015MetMet: 1.015 ± 0.285
1.826MetAsn: 1.826 ± 0.467
1.319MetPro: 1.319 ± 0.373
1.116MetGln: 1.116 ± 0.321
1.826MetArg: 1.826 ± 0.439
2.029MetSer: 2.029 ± 0.396
2.739MetThr: 2.739 ± 0.628
1.522MetVal: 1.522 ± 0.35
0.203MetTrp: 0.203 ± 0.135
0.913MetTyr: 0.913 ± 0.207
0.0MetXaa: 0.0 ± 0.0
Asn
3.45AsnAla: 3.45 ± 0.64
0.304AsnCys: 0.304 ± 0.181
2.942AsnAsp: 2.942 ± 0.631
2.232AsnGlu: 2.232 ± 0.466
1.218AsnPhe: 1.218 ± 0.359
4.363AsnGly: 4.363 ± 0.828
0.71AsnHis: 0.71 ± 0.229
2.638AsnIle: 2.638 ± 0.528
2.232AsnLys: 2.232 ± 0.54
2.537AsnLeu: 2.537 ± 0.426
1.015AsnMet: 1.015 ± 0.286
1.725AsnAsn: 1.725 ± 0.251
2.029AsnPro: 2.029 ± 0.57
1.319AsnGln: 1.319 ± 0.384
3.145AsnArg: 3.145 ± 0.592
2.435AsnSer: 2.435 ± 0.725
1.928AsnThr: 1.928 ± 0.401
2.739AsnVal: 2.739 ± 0.524
0.304AsnTrp: 0.304 ± 0.176
2.131AsnTyr: 2.131 ± 0.553
0.0AsnXaa: 0.0 ± 0.0
Pro
4.261ProAla: 4.261 ± 0.7
0.203ProCys: 0.203 ± 0.16
3.247ProAsp: 3.247 ± 0.671
3.145ProGlu: 3.145 ± 0.635
1.319ProPhe: 1.319 ± 0.545
2.435ProGly: 2.435 ± 0.452
1.015ProHis: 1.015 ± 0.369
1.116ProIle: 1.116 ± 0.289
2.435ProLys: 2.435 ± 0.508
3.551ProLeu: 3.551 ± 0.551
0.609ProMet: 0.609 ± 0.239
1.116ProAsn: 1.116 ± 0.417
1.319ProPro: 1.319 ± 0.323
1.42ProGln: 1.42 ± 0.362
2.537ProArg: 2.537 ± 0.686
2.131ProSer: 2.131 ± 0.537
1.725ProThr: 1.725 ± 0.408
4.769ProVal: 4.769 ± 0.784
0.609ProTrp: 0.609 ± 0.244
1.015ProTyr: 1.015 ± 0.274
0.0ProXaa: 0.0 ± 0.0
Gln
3.957GlnAla: 3.957 ± 1.278
0.203GlnCys: 0.203 ± 0.158
2.739GlnAsp: 2.739 ± 0.665
2.537GlnGlu: 2.537 ± 0.518
1.015GlnPhe: 1.015 ± 0.261
1.928GlnGly: 1.928 ± 0.315
0.71GlnHis: 0.71 ± 0.254
2.131GlnIle: 2.131 ± 0.659
2.334GlnLys: 2.334 ± 0.396
4.16GlnLeu: 4.16 ± 0.569
1.015GlnMet: 1.015 ± 0.29
1.218GlnAsn: 1.218 ± 0.278
1.623GlnPro: 1.623 ± 0.428
2.435GlnGln: 2.435 ± 0.583
3.856GlnArg: 3.856 ± 0.792
2.334GlnSer: 2.334 ± 0.491
1.826GlnThr: 1.826 ± 0.377
1.826GlnVal: 1.826 ± 0.407
0.406GlnTrp: 0.406 ± 0.241
0.609GlnTyr: 0.609 ± 0.296
0.0GlnXaa: 0.0 ± 0.0
Arg
5.58ArgAla: 5.58 ± 0.653
0.812ArgCys: 0.812 ± 0.238
3.348ArgAsp: 3.348 ± 0.388
4.363ArgGlu: 4.363 ± 0.593
1.826ArgPhe: 1.826 ± 0.485
3.856ArgGly: 3.856 ± 0.823
1.725ArgHis: 1.725 ± 0.484
3.754ArgIle: 3.754 ± 0.675
3.957ArgLys: 3.957 ± 0.595
6.392ArgLeu: 6.392 ± 0.867
1.623ArgMet: 1.623 ± 0.471
2.638ArgAsn: 2.638 ± 0.565
2.131ArgPro: 2.131 ± 0.436
3.348ArgGln: 3.348 ± 0.734
4.464ArgArg: 4.464 ± 0.712
2.841ArgSer: 2.841 ± 0.572
2.739ArgThr: 2.739 ± 0.485
4.769ArgVal: 4.769 ± 0.905
1.218ArgTrp: 1.218 ± 0.295
2.537ArgTyr: 2.537 ± 0.515
0.0ArgXaa: 0.0 ± 0.0
Ser
6.494SerAla: 6.494 ± 1.014
0.507SerCys: 0.507 ± 0.205
4.16SerAsp: 4.16 ± 0.618
4.566SerGlu: 4.566 ± 0.681
1.928SerPhe: 1.928 ± 0.48
4.16SerGly: 4.16 ± 0.69
1.319SerHis: 1.319 ± 0.629
2.942SerIle: 2.942 ± 0.507
3.247SerLys: 3.247 ± 0.741
6.494SerLeu: 6.494 ± 1.233
1.928SerMet: 1.928 ± 0.419
2.334SerAsn: 2.334 ± 0.534
2.232SerPro: 2.232 ± 0.767
2.334SerGln: 2.334 ± 0.483
3.754SerArg: 3.754 ± 0.705
3.247SerSer: 3.247 ± 1.057
3.856SerThr: 3.856 ± 0.751
4.769SerVal: 4.769 ± 0.793
0.71SerTrp: 0.71 ± 0.176
1.42SerTyr: 1.42 ± 0.352
0.0SerXaa: 0.0 ± 0.0
Thr
7.407ThrAla: 7.407 ± 1.581
0.71ThrCys: 0.71 ± 0.308
4.566ThrAsp: 4.566 ± 0.632
3.044ThrGlu: 3.044 ± 0.593
2.232ThrPhe: 2.232 ± 0.419
6.696ThrGly: 6.696 ± 1.007
0.812ThrHis: 0.812 ± 0.247
3.653ThrIle: 3.653 ± 0.671
3.145ThrLys: 3.145 ± 0.845
6.899ThrLeu: 6.899 ± 1.005
2.232ThrMet: 2.232 ± 0.393
1.623ThrAsn: 1.623 ± 0.448
3.348ThrPro: 3.348 ± 0.541
1.725ThrGln: 1.725 ± 0.515
3.957ThrArg: 3.957 ± 0.612
3.856ThrSer: 3.856 ± 0.514
4.261ThrThr: 4.261 ± 1.024
4.058ThrVal: 4.058 ± 0.602
0.507ThrTrp: 0.507 ± 0.213
1.116ThrTyr: 1.116 ± 0.347
0.0ThrXaa: 0.0 ± 0.0
Val
6.291ValAla: 6.291 ± 1.147
1.218ValCys: 1.218 ± 0.428
3.45ValAsp: 3.45 ± 0.593
4.769ValGlu: 4.769 ± 0.755
2.435ValPhe: 2.435 ± 0.423
3.856ValGly: 3.856 ± 0.717
0.913ValHis: 0.913 ± 0.303
4.566ValIle: 4.566 ± 0.792
4.058ValLys: 4.058 ± 0.554
5.783ValLeu: 5.783 ± 0.959
2.435ValMet: 2.435 ± 0.406
2.638ValAsn: 2.638 ± 0.56
2.232ValPro: 2.232 ± 0.391
2.334ValGln: 2.334 ± 0.508
2.537ValArg: 2.537 ± 0.499
5.175ValSer: 5.175 ± 0.636
5.58ValThr: 5.58 ± 0.938
4.667ValVal: 4.667 ± 0.611
0.507ValTrp: 0.507 ± 0.216
1.42ValTyr: 1.42 ± 0.45
0.0ValXaa: 0.0 ± 0.0
Trp
1.218TrpAla: 1.218 ± 0.298
0.203TrpCys: 0.203 ± 0.129
0.913TrpAsp: 0.913 ± 0.282
1.015TrpGlu: 1.015 ± 0.248
0.507TrpPhe: 0.507 ± 0.228
0.406TrpGly: 0.406 ± 0.221
0.71TrpHis: 0.71 ± 0.231
0.71TrpIle: 0.71 ± 0.29
0.609TrpLys: 0.609 ± 0.292
1.826TrpLeu: 1.826 ± 0.454
0.609TrpMet: 0.609 ± 0.279
0.71TrpAsn: 0.71 ± 0.324
1.116TrpPro: 1.116 ± 0.391
0.406TrpGln: 0.406 ± 0.177
1.319TrpArg: 1.319 ± 0.354
0.609TrpSer: 0.609 ± 0.223
0.406TrpThr: 0.406 ± 0.191
0.507TrpVal: 0.507 ± 0.209
0.507TrpTrp: 0.507 ± 0.195
0.609TrpTyr: 0.609 ± 0.229
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.739TyrAla: 2.739 ± 0.634
0.101TyrCys: 0.101 ± 0.087
1.42TyrAsp: 1.42 ± 0.613
2.638TyrGlu: 2.638 ± 0.528
1.218TyrPhe: 1.218 ± 0.378
2.435TyrGly: 2.435 ± 0.441
1.319TyrHis: 1.319 ± 0.362
3.044TyrIle: 3.044 ± 0.497
0.71TyrLys: 0.71 ± 0.328
2.131TyrLeu: 2.131 ± 0.433
1.116TyrMet: 1.116 ± 0.275
0.913TyrAsn: 0.913 ± 0.271
1.218TyrPro: 1.218 ± 0.401
1.826TyrGln: 1.826 ± 0.476
1.826TyrArg: 1.826 ± 0.455
1.826TyrSer: 1.826 ± 0.413
1.522TyrThr: 1.522 ± 0.479
1.725TyrVal: 1.725 ± 0.428
1.015TyrTrp: 1.015 ± 0.263
0.913TyrTyr: 0.913 ± 0.33
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 41 proteins (9857 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski