Amino acid dipepetide frequency for Listeria phage PSU-VKH-LP019

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.301AlaAla: 4.301 ± 0.984
0.414AlaCys: 0.414 ± 0.213
4.384AlaAsp: 4.384 ± 0.732
5.542AlaGlu: 5.542 ± 0.838
1.82AlaPhe: 1.82 ± 0.417
4.715AlaGly: 4.715 ± 0.913
1.158AlaHis: 1.158 ± 0.339
4.797AlaIle: 4.797 ± 0.569
6.617AlaLys: 6.617 ± 0.733
5.211AlaLeu: 5.211 ± 0.842
2.151AlaMet: 2.151 ± 0.35
3.722AlaAsn: 3.722 ± 0.914
2.233AlaPro: 2.233 ± 0.4
2.73AlaGln: 2.73 ± 0.485
2.068AlaArg: 2.068 ± 0.418
4.797AlaSer: 4.797 ± 0.734
3.639AlaThr: 3.639 ± 0.478
2.812AlaVal: 2.812 ± 0.476
0.827AlaTrp: 0.827 ± 0.257
1.82AlaTyr: 1.82 ± 0.474
0.0AlaXaa: 0.0 ± 0.0
Cys
0.579CysAla: 0.579 ± 0.233
0.0CysCys: 0.0 ± 0.0
0.579CysAsp: 0.579 ± 0.223
0.827CysGlu: 0.827 ± 0.285
0.248CysPhe: 0.248 ± 0.148
1.489CysGly: 1.489 ± 0.463
0.083CysHis: 0.083 ± 0.086
0.165CysIle: 0.165 ± 0.117
0.993CysLys: 0.993 ± 0.292
0.331CysLeu: 0.331 ± 0.188
0.496CysMet: 0.496 ± 0.2
0.083CysAsn: 0.083 ± 0.081
0.662CysPro: 0.662 ± 0.258
0.083CysGln: 0.083 ± 0.081
0.496CysArg: 0.496 ± 0.227
0.579CysSer: 0.579 ± 0.219
0.165CysThr: 0.165 ± 0.126
0.414CysVal: 0.414 ± 0.195
0.083CysTrp: 0.083 ± 0.092
0.496CysTyr: 0.496 ± 0.217
0.0CysXaa: 0.0 ± 0.0
Asp
3.639AspAla: 3.639 ± 0.515
0.579AspCys: 0.579 ± 0.235
3.888AspAsp: 3.888 ± 0.564
5.624AspGlu: 5.624 ± 0.979
2.895AspPhe: 2.895 ± 0.57
4.632AspGly: 4.632 ± 0.579
0.414AspHis: 0.414 ± 0.17
4.218AspIle: 4.218 ± 0.547
6.286AspLys: 6.286 ± 0.892
5.294AspLeu: 5.294 ± 0.634
1.406AspMet: 1.406 ± 0.268
3.391AspAsn: 3.391 ± 0.622
1.075AspPro: 1.075 ± 0.302
0.91AspGln: 0.91 ± 0.27
1.489AspArg: 1.489 ± 0.378
4.136AspSer: 4.136 ± 0.525
2.812AspThr: 2.812 ± 0.557
3.474AspVal: 3.474 ± 0.525
0.993AspTrp: 0.993 ± 0.269
2.399AspTyr: 2.399 ± 0.426
0.0AspXaa: 0.0 ± 0.0
Glu
5.128GluAla: 5.128 ± 0.635
0.827GluCys: 0.827 ± 0.388
3.226GluAsp: 3.226 ± 0.465
7.361GluGlu: 7.361 ± 1.169
3.557GluPhe: 3.557 ± 0.409
3.474GluGly: 3.474 ± 0.592
1.572GluHis: 1.572 ± 0.408
7.196GluIle: 7.196 ± 0.841
6.865GluLys: 6.865 ± 0.898
7.61GluLeu: 7.61 ± 0.924
2.895GluMet: 2.895 ± 0.445
4.632GluAsn: 4.632 ± 0.734
2.316GluPro: 2.316 ± 0.514
4.88GluGln: 4.88 ± 0.618
4.218GluArg: 4.218 ± 0.713
4.053GluSer: 4.053 ± 0.594
4.384GluThr: 4.384 ± 0.639
6.617GluVal: 6.617 ± 0.8
1.406GluTrp: 1.406 ± 0.302
2.978GluTyr: 2.978 ± 0.597
0.0GluXaa: 0.0 ± 0.0
Phe
2.564PheAla: 2.564 ± 0.354
0.579PheCys: 0.579 ± 0.222
3.06PheAsp: 3.06 ± 0.535
3.639PheGlu: 3.639 ± 0.602
1.572PhePhe: 1.572 ± 0.502
2.316PheGly: 2.316 ± 0.426
0.248PheHis: 0.248 ± 0.126
3.309PheIle: 3.309 ± 0.573
3.639PheLys: 3.639 ± 0.427
2.564PheLeu: 2.564 ± 0.503
0.91PheMet: 0.91 ± 0.307
2.895PheAsn: 2.895 ± 0.44
0.744PhePro: 0.744 ± 0.298
0.993PheGln: 0.993 ± 0.313
1.323PheArg: 1.323 ± 0.398
2.151PheSer: 2.151 ± 0.447
2.647PheThr: 2.647 ± 0.547
3.06PheVal: 3.06 ± 0.639
0.496PheTrp: 0.496 ± 0.241
0.91PheTyr: 0.91 ± 0.316
0.0PheXaa: 0.0 ± 0.0
Gly
3.557GlyAla: 3.557 ± 0.606
0.662GlyCys: 0.662 ± 0.308
3.309GlyAsp: 3.309 ± 0.482
3.309GlyGlu: 3.309 ± 0.534
2.564GlyPhe: 2.564 ± 0.48
3.557GlyGly: 3.557 ± 0.513
0.414GlyHis: 0.414 ± 0.207
3.722GlyIle: 3.722 ± 0.574
5.707GlyLys: 5.707 ± 0.608
5.542GlyLeu: 5.542 ± 0.706
1.489GlyMet: 1.489 ± 0.448
2.812GlyAsn: 2.812 ± 0.54
0.993GlyPro: 0.993 ± 0.323
2.068GlyGln: 2.068 ± 0.387
1.985GlyArg: 1.985 ± 0.346
3.391GlySer: 3.391 ± 0.572
3.805GlyThr: 3.805 ± 0.72
4.136GlyVal: 4.136 ± 0.549
1.075GlyTrp: 1.075 ± 0.252
3.226GlyTyr: 3.226 ± 0.484
0.0GlyXaa: 0.0 ± 0.0
His
1.075HisAla: 1.075 ± 0.322
0.165HisCys: 0.165 ± 0.129
0.91HisAsp: 0.91 ± 0.296
1.82HisGlu: 1.82 ± 0.524
0.662HisPhe: 0.662 ± 0.289
0.744HisGly: 0.744 ± 0.24
0.496HisHis: 0.496 ± 0.311
0.827HisIle: 0.827 ± 0.267
0.827HisLys: 0.827 ± 0.242
1.323HisLeu: 1.323 ± 0.35
0.248HisMet: 0.248 ± 0.144
0.579HisAsn: 0.579 ± 0.282
0.331HisPro: 0.331 ± 0.16
0.248HisGln: 0.248 ± 0.147
0.662HisArg: 0.662 ± 0.225
0.827HisSer: 0.827 ± 0.268
0.579HisThr: 0.579 ± 0.314
0.827HisVal: 0.827 ± 0.257
0.165HisTrp: 0.165 ± 0.117
0.414HisTyr: 0.414 ± 0.18
0.0HisXaa: 0.0 ± 0.0
Ile
5.128IleAla: 5.128 ± 0.717
0.579IleCys: 0.579 ± 0.225
5.707IleAsp: 5.707 ± 0.802
6.452IleGlu: 6.452 ± 0.728
3.143IlePhe: 3.143 ± 0.68
3.557IleGly: 3.557 ± 0.486
0.993IleHis: 0.993 ± 0.262
5.045IleIle: 5.045 ± 0.869
6.121IleLys: 6.121 ± 0.684
3.888IleLeu: 3.888 ± 0.593
1.158IleMet: 1.158 ± 0.271
5.873IleAsn: 5.873 ± 0.667
2.151IlePro: 2.151 ± 0.396
3.226IleGln: 3.226 ± 0.701
2.399IleArg: 2.399 ± 0.382
4.549IleSer: 4.549 ± 0.687
4.797IleThr: 4.797 ± 0.757
4.301IleVal: 4.301 ± 0.714
0.744IleTrp: 0.744 ± 0.254
2.73IleTyr: 2.73 ± 0.478
0.0IleXaa: 0.0 ± 0.0
Lys
6.865LysAla: 6.865 ± 1.128
0.496LysCys: 0.496 ± 0.222
4.467LysAsp: 4.467 ± 0.608
8.602LysGlu: 8.602 ± 0.891
2.73LysPhe: 2.73 ± 0.455
4.797LysGly: 4.797 ± 0.608
1.737LysHis: 1.737 ± 0.338
6.865LysIle: 6.865 ± 0.691
7.775LysLys: 7.775 ± 0.893
8.023LysLeu: 8.023 ± 0.807
2.233LysMet: 2.233 ± 0.391
7.113LysAsn: 7.113 ± 0.86
2.481LysPro: 2.481 ± 0.608
4.136LysGln: 4.136 ± 0.587
3.722LysArg: 3.722 ± 0.602
5.211LysSer: 5.211 ± 0.713
6.617LysThr: 6.617 ± 1.069
4.384LysVal: 4.384 ± 0.512
1.158LysTrp: 1.158 ± 0.311
3.805LysTyr: 3.805 ± 0.546
0.0LysXaa: 0.0 ± 0.0
Leu
5.294LeuAla: 5.294 ± 0.68
0.496LeuCys: 0.496 ± 0.267
4.467LeuAsp: 4.467 ± 0.668
6.7LeuGlu: 6.7 ± 0.828
2.978LeuPhe: 2.978 ± 0.645
4.136LeuGly: 4.136 ± 0.529
0.579LeuHis: 0.579 ± 0.241
6.369LeuIle: 6.369 ± 0.901
6.948LeuLys: 6.948 ± 0.593
5.707LeuLeu: 5.707 ± 0.858
1.82LeuMet: 1.82 ± 0.493
6.369LeuAsn: 6.369 ± 0.655
2.316LeuPro: 2.316 ± 0.373
2.895LeuGln: 2.895 ± 0.488
2.481LeuArg: 2.481 ± 0.387
5.707LeuSer: 5.707 ± 0.733
4.88LeuThr: 4.88 ± 0.647
4.136LeuVal: 4.136 ± 0.455
0.496LeuTrp: 0.496 ± 0.214
3.143LeuTyr: 3.143 ± 0.526
0.0LeuXaa: 0.0 ± 0.0
Met
1.489MetAla: 1.489 ± 0.341
0.165MetCys: 0.165 ± 0.105
1.489MetAsp: 1.489 ± 0.324
1.82MetGlu: 1.82 ± 0.359
0.827MetPhe: 0.827 ± 0.241
1.323MetGly: 1.323 ± 0.325
0.248MetHis: 0.248 ± 0.132
1.737MetIle: 1.737 ± 0.447
2.647MetLys: 2.647 ± 0.428
1.489MetLeu: 1.489 ± 0.326
0.827MetMet: 0.827 ± 0.294
1.82MetAsn: 1.82 ± 0.395
0.579MetPro: 0.579 ± 0.192
1.323MetGln: 1.323 ± 0.35
1.075MetArg: 1.075 ± 0.279
2.151MetSer: 2.151 ± 0.454
2.316MetThr: 2.316 ± 0.463
0.827MetVal: 0.827 ± 0.323
0.165MetTrp: 0.165 ± 0.159
0.993MetTyr: 0.993 ± 0.282
0.0MetXaa: 0.0 ± 0.0
Asn
3.888AsnAla: 3.888 ± 0.799
0.827AsnCys: 0.827 ± 0.311
3.722AsnAsp: 3.722 ± 0.455
5.873AsnGlu: 5.873 ± 0.595
2.647AsnPhe: 2.647 ± 0.487
4.053AsnGly: 4.053 ± 0.585
1.323AsnHis: 1.323 ± 0.322
4.136AsnIle: 4.136 ± 0.641
6.203AsnLys: 6.203 ± 0.922
4.467AsnLeu: 4.467 ± 0.622
1.737AsnMet: 1.737 ± 0.395
4.797AsnAsn: 4.797 ± 0.69
2.151AsnPro: 2.151 ± 0.455
2.481AsnGln: 2.481 ± 0.559
2.895AsnArg: 2.895 ± 0.481
4.136AsnSer: 4.136 ± 0.535
2.812AsnThr: 2.812 ± 0.642
3.722AsnVal: 3.722 ± 0.557
0.827AsnTrp: 0.827 ± 0.286
2.564AsnTyr: 2.564 ± 0.508
0.0AsnXaa: 0.0 ± 0.0
Pro
2.068ProAla: 2.068 ± 0.387
0.083ProCys: 0.083 ± 0.09
2.068ProAsp: 2.068 ± 0.462
1.985ProGlu: 1.985 ± 0.448
1.241ProPhe: 1.241 ± 0.349
1.406ProGly: 1.406 ± 0.361
0.248ProHis: 0.248 ± 0.144
1.82ProIle: 1.82 ± 0.301
1.902ProLys: 1.902 ± 0.434
2.068ProLeu: 2.068 ± 0.296
0.248ProMet: 0.248 ± 0.158
1.737ProAsn: 1.737 ± 0.346
1.158ProPro: 1.158 ± 0.413
0.827ProGln: 0.827 ± 0.243
0.91ProArg: 0.91 ± 0.251
1.82ProSer: 1.82 ± 0.295
1.985ProThr: 1.985 ± 0.432
2.068ProVal: 2.068 ± 0.435
0.165ProTrp: 0.165 ± 0.117
0.331ProTyr: 0.331 ± 0.182
0.0ProXaa: 0.0 ± 0.0
Gln
2.647GlnAla: 2.647 ± 0.454
0.414GlnCys: 0.414 ± 0.185
1.406GlnAsp: 1.406 ± 0.322
2.895GlnGlu: 2.895 ± 0.58
1.572GlnPhe: 1.572 ± 0.397
1.489GlnGly: 1.489 ± 0.269
0.414GlnHis: 0.414 ± 0.232
2.978GlnIle: 2.978 ± 0.456
5.542GlnLys: 5.542 ± 0.882
3.143GlnLeu: 3.143 ± 0.669
0.744GlnMet: 0.744 ± 0.233
2.564GlnAsn: 2.564 ± 0.534
0.579GlnPro: 0.579 ± 0.22
2.564GlnGln: 2.564 ± 0.434
1.654GlnArg: 1.654 ± 0.392
2.151GlnSer: 2.151 ± 0.42
2.481GlnThr: 2.481 ± 0.449
2.151GlnVal: 2.151 ± 0.453
0.496GlnTrp: 0.496 ± 0.205
1.406GlnTyr: 1.406 ± 0.296
0.0GlnXaa: 0.0 ± 0.0
Arg
1.985ArgAla: 1.985 ± 0.354
0.165ArgCys: 0.165 ± 0.11
2.73ArgAsp: 2.73 ± 0.581
3.391ArgGlu: 3.391 ± 0.517
1.406ArgPhe: 1.406 ± 0.356
1.737ArgGly: 1.737 ± 0.371
0.662ArgHis: 0.662 ± 0.256
2.481ArgIle: 2.481 ± 0.556
3.639ArgLys: 3.639 ± 0.545
2.812ArgLeu: 2.812 ± 0.609
0.91ArgMet: 0.91 ± 0.317
2.399ArgAsn: 2.399 ± 0.432
0.827ArgPro: 0.827 ± 0.259
0.91ArgGln: 0.91 ± 0.23
1.406ArgArg: 1.406 ± 0.53
2.068ArgSer: 2.068 ± 0.469
2.233ArgThr: 2.233 ± 0.476
2.895ArgVal: 2.895 ± 0.524
0.579ArgTrp: 0.579 ± 0.184
2.068ArgTyr: 2.068 ± 0.479
0.0ArgXaa: 0.0 ± 0.0
Ser
3.722SerAla: 3.722 ± 0.684
0.662SerCys: 0.662 ± 0.271
4.136SerAsp: 4.136 ± 0.689
4.467SerGlu: 4.467 ± 0.506
2.564SerPhe: 2.564 ± 0.579
5.128SerGly: 5.128 ± 0.748
0.827SerHis: 0.827 ± 0.274
5.294SerIle: 5.294 ± 0.676
7.113SerLys: 7.113 ± 1.026
5.294SerLeu: 5.294 ± 0.683
1.572SerMet: 1.572 ± 0.426
3.474SerAsn: 3.474 ± 0.451
0.91SerPro: 0.91 ± 0.19
1.82SerGln: 1.82 ± 0.334
2.233SerArg: 2.233 ± 0.512
3.226SerSer: 3.226 ± 0.517
3.391SerThr: 3.391 ± 0.44
3.805SerVal: 3.805 ± 0.691
0.248SerTrp: 0.248 ± 0.132
2.647SerTyr: 2.647 ± 0.438
0.0SerXaa: 0.0 ± 0.0
Thr
4.632ThrAla: 4.632 ± 0.707
0.165ThrCys: 0.165 ± 0.106
2.812ThrAsp: 2.812 ± 0.405
4.715ThrGlu: 4.715 ± 0.752
2.73ThrPhe: 2.73 ± 0.456
2.895ThrGly: 2.895 ± 0.607
0.496ThrHis: 0.496 ± 0.168
4.549ThrIle: 4.549 ± 0.617
5.955ThrLys: 5.955 ± 0.557
4.715ThrLeu: 4.715 ± 0.54
1.737ThrMet: 1.737 ± 0.41
4.136ThrAsn: 4.136 ± 0.581
2.233ThrPro: 2.233 ± 0.37
2.73ThrGln: 2.73 ± 0.546
1.985ThrArg: 1.985 ± 0.434
3.226ThrSer: 3.226 ± 0.552
3.391ThrThr: 3.391 ± 0.608
3.557ThrVal: 3.557 ± 0.65
0.579ThrTrp: 0.579 ± 0.208
2.316ThrTyr: 2.316 ± 0.457
0.0ThrXaa: 0.0 ± 0.0
Val
4.549ValAla: 4.549 ± 0.558
0.579ValCys: 0.579 ± 0.234
3.97ValAsp: 3.97 ± 0.543
4.88ValGlu: 4.88 ± 0.512
2.647ValPhe: 2.647 ± 0.554
3.226ValGly: 3.226 ± 0.635
0.993ValHis: 0.993 ± 0.385
3.391ValIle: 3.391 ± 0.561
4.963ValLys: 4.963 ± 0.719
3.97ValLeu: 3.97 ± 0.662
1.406ValMet: 1.406 ± 0.439
3.391ValAsn: 3.391 ± 0.505
1.654ValPro: 1.654 ± 0.432
2.481ValGln: 2.481 ± 0.332
2.399ValArg: 2.399 ± 0.387
4.301ValSer: 4.301 ± 0.559
4.632ValThr: 4.632 ± 0.64
3.97ValVal: 3.97 ± 0.639
0.662ValTrp: 0.662 ± 0.215
1.985ValTyr: 1.985 ± 0.446
0.0ValXaa: 0.0 ± 0.0
Trp
0.414TrpAla: 0.414 ± 0.168
0.165TrpCys: 0.165 ± 0.112
1.241TrpAsp: 1.241 ± 0.304
1.075TrpGlu: 1.075 ± 0.378
0.331TrpPhe: 0.331 ± 0.145
0.579TrpGly: 0.579 ± 0.184
0.331TrpHis: 0.331 ± 0.18
0.827TrpIle: 0.827 ± 0.296
0.496TrpLys: 0.496 ± 0.252
1.406TrpLeu: 1.406 ± 0.324
0.414TrpMet: 0.414 ± 0.184
0.496TrpAsn: 0.496 ± 0.21
0.165TrpPro: 0.165 ± 0.119
0.414TrpGln: 0.414 ± 0.264
0.414TrpArg: 0.414 ± 0.181
1.241TrpSer: 1.241 ± 0.413
0.083TrpThr: 0.083 ± 0.082
0.331TrpVal: 0.331 ± 0.202
0.083TrpTrp: 0.083 ± 0.076
0.993TrpTyr: 0.993 ± 0.496
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.233TyrAla: 2.233 ± 0.358
0.827TyrCys: 0.827 ± 0.27
1.985TyrAsp: 1.985 ± 0.46
4.218TyrGlu: 4.218 ± 0.617
1.489TyrPhe: 1.489 ± 0.429
2.068TyrGly: 2.068 ± 0.476
0.496TyrHis: 0.496 ± 0.199
2.812TyrIle: 2.812 ± 0.647
2.978TyrLys: 2.978 ± 0.484
3.06TyrLeu: 3.06 ± 0.845
0.827TyrMet: 0.827 ± 0.222
3.06TyrAsn: 3.06 ± 0.477
0.579TyrPro: 0.579 ± 0.195
1.572TyrGln: 1.572 ± 0.348
1.489TyrArg: 1.489 ± 0.463
2.812TyrSer: 2.812 ± 0.567
1.902TyrThr: 1.902 ± 0.374
2.564TyrVal: 2.564 ± 0.429
0.248TyrTrp: 0.248 ± 0.145
2.151TyrTyr: 2.151 ± 0.422
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 66 proteins (12091 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski