Amino acid dipepetide frequency for Yersinia phage YpP-Y

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.974AlaAla: 8.974 ± 1.424
0.366AlaCys: 0.366 ± 0.163
5.494AlaAsp: 5.494 ± 0.689
5.677AlaGlu: 5.677 ± 0.734
2.839AlaPhe: 2.839 ± 0.478
6.776AlaGly: 6.776 ± 0.942
1.923AlaHis: 1.923 ± 0.348
4.945AlaIle: 4.945 ± 0.766
6.776AlaLys: 6.776 ± 0.735
8.424AlaLeu: 8.424 ± 1.11
2.747AlaMet: 2.747 ± 0.578
4.304AlaAsn: 4.304 ± 0.56
2.472AlaPro: 2.472 ± 0.343
3.754AlaGln: 3.754 ± 0.872
5.586AlaArg: 5.586 ± 0.649
5.219AlaSer: 5.219 ± 0.758
3.846AlaThr: 3.846 ± 0.641
4.578AlaVal: 4.578 ± 0.736
1.648AlaTrp: 1.648 ± 0.409
2.106AlaTyr: 2.106 ± 0.532
0.0AlaXaa: 0.0 ± 0.0
Cys
0.641CysAla: 0.641 ± 0.213
0.092CysCys: 0.092 ± 0.108
1.19CysAsp: 1.19 ± 0.441
0.275CysGlu: 0.275 ± 0.144
0.458CysPhe: 0.458 ± 0.206
1.007CysGly: 1.007 ± 0.352
0.824CysHis: 0.824 ± 0.273
0.641CysIle: 0.641 ± 0.31
0.458CysLys: 0.458 ± 0.221
0.549CysLeu: 0.549 ± 0.254
0.092CysMet: 0.092 ± 0.08
0.183CysAsn: 0.183 ± 0.157
0.275CysPro: 0.275 ± 0.152
0.458CysGln: 0.458 ± 0.152
0.549CysArg: 0.549 ± 0.265
0.458CysSer: 0.458 ± 0.204
0.366CysThr: 0.366 ± 0.204
0.824CysVal: 0.824 ± 0.338
0.366CysTrp: 0.366 ± 0.195
0.275CysTyr: 0.275 ± 0.153
0.0CysXaa: 0.0 ± 0.0
Asp
4.487AspAla: 4.487 ± 0.585
0.824AspCys: 0.824 ± 0.292
3.937AspAsp: 3.937 ± 0.61
4.761AspGlu: 4.761 ± 0.511
3.022AspPhe: 3.022 ± 0.443
5.769AspGly: 5.769 ± 0.767
0.916AspHis: 0.916 ± 0.289
3.022AspIle: 3.022 ± 0.568
4.121AspLys: 4.121 ± 0.608
4.487AspLeu: 4.487 ± 0.795
2.289AspMet: 2.289 ± 0.541
3.022AspAsn: 3.022 ± 0.384
3.113AspPro: 3.113 ± 0.673
1.648AspGln: 1.648 ± 0.436
3.48AspArg: 3.48 ± 0.723
4.395AspSer: 4.395 ± 0.748
4.487AspThr: 4.487 ± 0.512
5.219AspVal: 5.219 ± 0.608
0.916AspTrp: 0.916 ± 0.485
1.831AspTyr: 1.831 ± 0.317
0.0AspXaa: 0.0 ± 0.0
Glu
7.234GluAla: 7.234 ± 1.006
0.549GluCys: 0.549 ± 0.247
4.853GluAsp: 4.853 ± 0.729
6.501GluGlu: 6.501 ± 0.947
1.831GluPhe: 1.831 ± 0.391
5.128GluGly: 5.128 ± 0.595
1.374GluHis: 1.374 ± 0.372
3.48GluIle: 3.48 ± 0.611
3.846GluLys: 3.846 ± 0.563
6.501GluLeu: 6.501 ± 0.732
2.014GluMet: 2.014 ± 0.426
2.93GluAsn: 2.93 ± 0.463
1.831GluPro: 1.831 ± 0.441
3.205GluGln: 3.205 ± 0.975
3.571GluArg: 3.571 ± 0.462
4.761GluSer: 4.761 ± 0.718
3.663GluThr: 3.663 ± 0.553
4.304GluVal: 4.304 ± 0.57
1.374GluTrp: 1.374 ± 0.4
3.022GluTyr: 3.022 ± 0.578
0.0GluXaa: 0.0 ± 0.0
Phe
1.923PheAla: 1.923 ± 0.357
0.458PheCys: 0.458 ± 0.195
3.205PheAsp: 3.205 ± 0.577
2.564PheGlu: 2.564 ± 0.506
1.007PhePhe: 1.007 ± 0.304
3.296PheGly: 3.296 ± 0.516
0.916PheHis: 0.916 ± 0.339
1.557PheIle: 1.557 ± 0.44
3.296PheLys: 3.296 ± 0.505
3.296PheLeu: 3.296 ± 0.47
1.374PheMet: 1.374 ± 0.282
1.648PheAsn: 1.648 ± 0.459
1.374PhePro: 1.374 ± 0.455
1.19PheGln: 1.19 ± 0.308
2.198PheArg: 2.198 ± 0.519
1.557PheSer: 1.557 ± 0.452
2.472PheThr: 2.472 ± 0.408
1.831PheVal: 1.831 ± 0.418
0.275PheTrp: 0.275 ± 0.138
1.099PheTyr: 1.099 ± 0.302
0.0PheXaa: 0.0 ± 0.0
Gly
6.227GlyAla: 6.227 ± 0.963
1.007GlyCys: 1.007 ± 0.325
4.853GlyAsp: 4.853 ± 0.726
4.395GlyGlu: 4.395 ± 0.63
3.48GlyPhe: 3.48 ± 0.655
5.494GlyGly: 5.494 ± 0.811
1.282GlyHis: 1.282 ± 0.418
4.945GlyIle: 4.945 ± 0.678
5.402GlyLys: 5.402 ± 0.854
5.677GlyLeu: 5.677 ± 0.994
1.831GlyMet: 1.831 ± 0.375
3.022GlyAsn: 3.022 ± 0.638
1.282GlyPro: 1.282 ± 0.415
3.296GlyGln: 3.296 ± 0.558
4.395GlyArg: 4.395 ± 0.634
4.304GlySer: 4.304 ± 0.856
4.121GlyThr: 4.121 ± 0.571
4.212GlyVal: 4.212 ± 0.618
1.557GlyTrp: 1.557 ± 0.552
3.205GlyTyr: 3.205 ± 0.588
0.0GlyXaa: 0.0 ± 0.0
His
1.374HisAla: 1.374 ± 0.437
0.183HisCys: 0.183 ± 0.114
1.374HisAsp: 1.374 ± 0.334
1.557HisGlu: 1.557 ± 0.438
0.366HisPhe: 0.366 ± 0.143
1.099HisGly: 1.099 ± 0.333
0.458HisHis: 0.458 ± 0.167
2.381HisIle: 2.381 ± 0.599
1.099HisLys: 1.099 ± 0.299
2.106HisLeu: 2.106 ± 0.518
0.549HisMet: 0.549 ± 0.184
0.916HisAsn: 0.916 ± 0.283
0.366HisPro: 0.366 ± 0.162
0.183HisGln: 0.183 ± 0.122
0.824HisArg: 0.824 ± 0.269
1.282HisSer: 1.282 ± 0.28
0.641HisThr: 0.641 ± 0.222
1.465HisVal: 1.465 ± 0.403
0.366HisTrp: 0.366 ± 0.162
0.733HisTyr: 0.733 ± 0.273
0.0HisXaa: 0.0 ± 0.0
Ile
3.846IleAla: 3.846 ± 0.472
0.549IleCys: 0.549 ± 0.18
4.212IleAsp: 4.212 ± 0.56
3.022IleGlu: 3.022 ± 0.461
1.099IlePhe: 1.099 ± 0.271
3.754IleGly: 3.754 ± 0.477
1.465IleHis: 1.465 ± 0.452
3.113IleIle: 3.113 ± 0.536
3.754IleLys: 3.754 ± 0.687
4.121IleLeu: 4.121 ± 0.558
1.19IleMet: 1.19 ± 0.328
2.839IleAsn: 2.839 ± 0.591
2.93IlePro: 2.93 ± 0.377
1.923IleGln: 1.923 ± 0.453
3.937IleArg: 3.937 ± 0.717
2.839IleSer: 2.839 ± 0.419
2.655IleThr: 2.655 ± 0.434
3.113IleVal: 3.113 ± 0.392
0.549IleTrp: 0.549 ± 0.178
2.106IleTyr: 2.106 ± 0.314
0.0IleXaa: 0.0 ± 0.0
Lys
7.234LysAla: 7.234 ± 0.973
0.733LysCys: 0.733 ± 0.28
3.388LysAsp: 3.388 ± 0.474
5.036LysGlu: 5.036 ± 0.889
2.381LysPhe: 2.381 ± 0.416
4.761LysGly: 4.761 ± 0.769
1.557LysHis: 1.557 ± 0.373
2.289LysIle: 2.289 ± 0.532
4.212LysLys: 4.212 ± 0.792
5.769LysLeu: 5.769 ± 0.844
2.014LysMet: 2.014 ± 0.481
2.747LysAsn: 2.747 ± 0.385
2.472LysPro: 2.472 ± 0.584
3.022LysGln: 3.022 ± 0.685
4.212LysArg: 4.212 ± 0.59
4.578LysSer: 4.578 ± 0.745
3.296LysThr: 3.296 ± 0.477
4.853LysVal: 4.853 ± 0.645
1.007LysTrp: 1.007 ± 0.313
2.381LysTyr: 2.381 ± 0.394
0.0LysXaa: 0.0 ± 0.0
Leu
8.241LeuAla: 8.241 ± 0.87
0.458LeuCys: 0.458 ± 0.201
4.945LeuAsp: 4.945 ± 0.531
6.501LeuGlu: 6.501 ± 0.855
2.289LeuPhe: 2.289 ± 0.554
4.761LeuGly: 4.761 ± 0.784
1.282LeuHis: 1.282 ± 0.363
3.571LeuIle: 3.571 ± 0.653
5.769LeuLys: 5.769 ± 0.668
4.853LeuLeu: 4.853 ± 0.773
2.472LeuMet: 2.472 ± 0.493
3.754LeuAsn: 3.754 ± 0.523
2.839LeuPro: 2.839 ± 0.418
4.029LeuGln: 4.029 ± 0.735
6.135LeuArg: 6.135 ± 0.679
4.67LeuSer: 4.67 ± 0.648
4.487LeuThr: 4.487 ± 0.61
4.121LeuVal: 4.121 ± 0.562
1.374LeuTrp: 1.374 ± 0.448
2.839LeuTyr: 2.839 ± 0.458
0.0LeuXaa: 0.0 ± 0.0
Met
3.48MetAla: 3.48 ± 0.52
0.458MetCys: 0.458 ± 0.219
2.106MetAsp: 2.106 ± 0.42
1.923MetGlu: 1.923 ± 0.351
0.916MetPhe: 0.916 ± 0.298
2.381MetGly: 2.381 ± 0.403
0.458MetHis: 0.458 ± 0.21
1.282MetIle: 1.282 ± 0.332
1.19MetLys: 1.19 ± 0.302
3.022MetLeu: 3.022 ± 0.528
0.824MetMet: 0.824 ± 0.314
1.282MetAsn: 1.282 ± 0.375
1.099MetPro: 1.099 ± 0.284
1.19MetGln: 1.19 ± 0.339
1.465MetArg: 1.465 ± 0.329
1.74MetSer: 1.74 ± 0.452
1.648MetThr: 1.648 ± 0.36
2.198MetVal: 2.198 ± 0.453
0.183MetTrp: 0.183 ± 0.132
0.733MetTyr: 0.733 ± 0.278
0.0MetXaa: 0.0 ± 0.0
Asn
3.48AsnAla: 3.48 ± 0.422
0.641AsnCys: 0.641 ± 0.21
2.381AsnAsp: 2.381 ± 0.495
2.747AsnGlu: 2.747 ± 0.471
2.289AsnPhe: 2.289 ± 0.514
4.945AsnGly: 4.945 ± 0.782
0.549AsnHis: 0.549 ± 0.22
2.747AsnIle: 2.747 ± 0.536
2.564AsnLys: 2.564 ± 0.453
3.571AsnLeu: 3.571 ± 0.642
1.19AsnMet: 1.19 ± 0.373
1.831AsnAsn: 1.831 ± 0.473
2.839AsnPro: 2.839 ± 0.482
1.74AsnGln: 1.74 ± 0.383
2.381AsnArg: 2.381 ± 0.559
3.205AsnSer: 3.205 ± 0.849
1.74AsnThr: 1.74 ± 0.449
2.564AsnVal: 2.564 ± 0.47
0.733AsnTrp: 0.733 ± 0.229
1.74AsnTyr: 1.74 ± 0.401
0.0AsnXaa: 0.0 ± 0.0
Pro
3.022ProAla: 3.022 ± 0.465
0.366ProCys: 0.366 ± 0.206
2.747ProAsp: 2.747 ± 0.468
3.571ProGlu: 3.571 ± 0.807
1.282ProPhe: 1.282 ± 0.323
0.641ProGly: 0.641 ± 0.214
0.824ProHis: 0.824 ± 0.22
1.648ProIle: 1.648 ± 0.349
2.93ProLys: 2.93 ± 0.574
2.106ProLeu: 2.106 ± 0.389
1.282ProMet: 1.282 ± 0.351
2.289ProAsn: 2.289 ± 0.601
0.549ProPro: 0.549 ± 0.227
1.099ProGln: 1.099 ± 0.353
1.831ProArg: 1.831 ± 0.362
2.014ProSer: 2.014 ± 0.4
2.198ProThr: 2.198 ± 0.426
1.831ProVal: 1.831 ± 0.344
0.641ProTrp: 0.641 ± 0.213
1.282ProTyr: 1.282 ± 0.354
0.0ProXaa: 0.0 ± 0.0
Gln
3.937GlnAla: 3.937 ± 1.094
0.275GlnCys: 0.275 ± 0.169
1.648GlnAsp: 1.648 ± 0.303
2.747GlnGlu: 2.747 ± 0.559
2.472GlnPhe: 2.472 ± 0.414
2.014GlnGly: 2.014 ± 0.41
0.275GlnHis: 0.275 ± 0.156
1.74GlnIle: 1.74 ± 0.431
2.472GlnLys: 2.472 ± 0.425
4.029GlnLeu: 4.029 ± 0.577
1.374GlnMet: 1.374 ± 0.339
1.007GlnAsn: 1.007 ± 0.388
1.282GlnPro: 1.282 ± 0.449
1.74GlnGln: 1.74 ± 0.343
2.472GlnArg: 2.472 ± 0.588
2.198GlnSer: 2.198 ± 0.621
1.831GlnThr: 1.831 ± 0.484
2.198GlnVal: 2.198 ± 0.546
1.099GlnTrp: 1.099 ± 0.283
1.374GlnTyr: 1.374 ± 0.426
0.0GlnXaa: 0.0 ± 0.0
Arg
5.128ArgAla: 5.128 ± 0.63
0.916ArgCys: 0.916 ± 0.308
4.304ArgAsp: 4.304 ± 0.705
5.494ArgGlu: 5.494 ± 0.706
2.289ArgPhe: 2.289 ± 0.508
3.754ArgGly: 3.754 ± 0.543
1.099ArgHis: 1.099 ± 0.361
3.022ArgIle: 3.022 ± 0.455
4.761ArgLys: 4.761 ± 0.797
4.853ArgLeu: 4.853 ± 0.645
2.014ArgMet: 2.014 ± 0.445
2.93ArgAsn: 2.93 ± 0.605
1.74ArgPro: 1.74 ± 0.299
1.74ArgGln: 1.74 ± 0.432
2.655ArgArg: 2.655 ± 0.483
3.937ArgSer: 3.937 ± 0.558
2.564ArgThr: 2.564 ± 0.37
3.113ArgVal: 3.113 ± 0.613
0.824ArgTrp: 0.824 ± 0.26
1.557ArgTyr: 1.557 ± 0.412
0.0ArgXaa: 0.0 ± 0.0
Ser
5.769SerAla: 5.769 ± 0.795
0.641SerCys: 0.641 ± 0.225
5.311SerAsp: 5.311 ± 0.644
2.93SerGlu: 2.93 ± 0.451
2.381SerPhe: 2.381 ± 0.507
5.677SerGly: 5.677 ± 1.053
1.648SerHis: 1.648 ± 0.452
3.205SerIle: 3.205 ± 0.627
3.388SerLys: 3.388 ± 0.59
3.48SerLeu: 3.48 ± 0.563
1.648SerMet: 1.648 ± 0.478
3.205SerAsn: 3.205 ± 0.516
1.557SerPro: 1.557 ± 0.42
2.198SerGln: 2.198 ± 0.398
3.388SerArg: 3.388 ± 0.69
3.48SerSer: 3.48 ± 0.665
3.846SerThr: 3.846 ± 0.655
4.121SerVal: 4.121 ± 0.513
0.916SerTrp: 0.916 ± 0.255
2.289SerTyr: 2.289 ± 0.477
0.0SerXaa: 0.0 ± 0.0
Thr
4.578ThrAla: 4.578 ± 0.633
0.549ThrCys: 0.549 ± 0.227
3.205ThrAsp: 3.205 ± 0.447
4.121ThrGlu: 4.121 ± 0.658
2.106ThrPhe: 2.106 ± 0.439
5.494ThrGly: 5.494 ± 0.694
0.733ThrHis: 0.733 ± 0.214
3.388ThrIle: 3.388 ± 0.541
4.945ThrLys: 4.945 ± 0.575
4.029ThrLeu: 4.029 ± 0.598
1.648ThrMet: 1.648 ± 0.402
1.831ThrAsn: 1.831 ± 0.399
2.747ThrPro: 2.747 ± 0.581
1.923ThrGln: 1.923 ± 0.384
2.839ThrArg: 2.839 ± 0.477
3.754ThrSer: 3.754 ± 0.718
2.655ThrThr: 2.655 ± 0.475
2.839ThrVal: 2.839 ± 0.554
0.275ThrTrp: 0.275 ± 0.193
1.374ThrTyr: 1.374 ± 0.323
0.0ThrXaa: 0.0 ± 0.0
Val
5.402ValAla: 5.402 ± 0.462
0.366ValCys: 0.366 ± 0.17
3.571ValAsp: 3.571 ± 0.491
4.487ValGlu: 4.487 ± 0.805
2.747ValPhe: 2.747 ± 0.72
4.029ValGly: 4.029 ± 0.6
1.282ValHis: 1.282 ± 0.401
3.205ValIle: 3.205 ± 0.536
4.212ValLys: 4.212 ± 0.782
3.937ValLeu: 3.937 ± 0.65
2.014ValMet: 2.014 ± 0.422
3.113ValAsn: 3.113 ± 0.638
2.381ValPro: 2.381 ± 0.522
1.831ValGln: 1.831 ± 0.412
3.846ValArg: 3.846 ± 0.51
3.663ValSer: 3.663 ± 0.589
4.761ValThr: 4.761 ± 0.598
4.395ValVal: 4.395 ± 0.622
1.19ValTrp: 1.19 ± 0.363
1.923ValTyr: 1.923 ± 0.428
0.0ValXaa: 0.0 ± 0.0
Trp
0.733TrpAla: 0.733 ± 0.223
0.366TrpCys: 0.366 ± 0.207
0.458TrpAsp: 0.458 ± 0.22
1.282TrpGlu: 1.282 ± 0.315
0.458TrpPhe: 0.458 ± 0.203
0.916TrpGly: 0.916 ± 0.338
0.183TrpHis: 0.183 ± 0.158
0.733TrpIle: 0.733 ± 0.279
1.007TrpLys: 1.007 ± 0.269
1.557TrpLeu: 1.557 ± 0.404
0.183TrpMet: 0.183 ± 0.12
1.282TrpAsn: 1.282 ± 0.33
0.183TrpPro: 0.183 ± 0.122
0.641TrpGln: 0.641 ± 0.219
1.099TrpArg: 1.099 ± 0.358
0.916TrpSer: 0.916 ± 0.381
1.557TrpThr: 1.557 ± 0.361
1.74TrpVal: 1.74 ± 0.597
0.275TrpTrp: 0.275 ± 0.158
0.275TrpTyr: 0.275 ± 0.154
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.747TyrAla: 2.747 ± 0.584
0.183TyrCys: 0.183 ± 0.157
2.564TyrAsp: 2.564 ± 0.449
2.381TyrGlu: 2.381 ± 0.514
0.824TyrPhe: 0.824 ± 0.306
2.289TyrGly: 2.289 ± 0.423
0.275TyrHis: 0.275 ± 0.148
2.106TyrIle: 2.106 ± 0.408
1.74TyrLys: 1.74 ± 0.382
3.022TyrLeu: 3.022 ± 0.524
0.733TyrMet: 0.733 ± 0.244
1.74TyrAsn: 1.74 ± 0.321
0.824TyrPro: 0.824 ± 0.303
1.374TyrGln: 1.374 ± 0.398
1.831TyrArg: 1.831 ± 0.444
2.106TyrSer: 2.106 ± 0.377
2.198TyrThr: 2.198 ± 0.402
2.839TyrVal: 2.839 ± 0.514
0.366TyrTrp: 0.366 ± 0.225
0.733TyrTyr: 0.733 ± 0.307
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 46 proteins (10922 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski