Amino acid dipepetide frequency for Listeria phage PSU-VKH-LP040

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.815AlaAla: 6.815 ± 2.417
0.246AlaCys: 0.246 ± 0.159
4.927AlaAsp: 4.927 ± 0.615
6.322AlaGlu: 6.322 ± 0.656
2.627AlaPhe: 2.627 ± 0.833
5.748AlaGly: 5.748 ± 1.327
0.657AlaHis: 0.657 ± 0.283
6.158AlaIle: 6.158 ± 1.071
8.786AlaLys: 8.786 ± 1.633
6.979AlaLeu: 6.979 ± 1.182
2.217AlaMet: 2.217 ± 0.414
5.173AlaAsn: 5.173 ± 0.762
1.971AlaPro: 1.971 ± 0.504
2.463AlaGln: 2.463 ± 0.417
2.217AlaArg: 2.217 ± 0.508
4.68AlaSer: 4.68 ± 0.927
3.777AlaThr: 3.777 ± 0.426
5.501AlaVal: 5.501 ± 1.024
0.903AlaTrp: 0.903 ± 0.264
2.627AlaTyr: 2.627 ± 0.473
0.0AlaXaa: 0.0 ± 0.0
Cys
0.246CysAla: 0.246 ± 0.141
0.246CysCys: 0.246 ± 0.139
0.821CysAsp: 0.821 ± 0.323
0.575CysGlu: 0.575 ± 0.291
0.082CysPhe: 0.082 ± 0.073
0.985CysGly: 0.985 ± 0.262
0.0CysHis: 0.0 ± 0.0
0.246CysIle: 0.246 ± 0.14
0.411CysLys: 0.411 ± 0.171
0.575CysLeu: 0.575 ± 0.204
0.164CysMet: 0.164 ± 0.115
0.328CysAsn: 0.328 ± 0.202
0.164CysPro: 0.164 ± 0.114
0.246CysGln: 0.246 ± 0.154
0.493CysArg: 0.493 ± 0.251
0.246CysSer: 0.246 ± 0.152
0.082CysThr: 0.082 ± 0.07
0.246CysVal: 0.246 ± 0.153
0.082CysTrp: 0.082 ± 0.071
0.575CysTyr: 0.575 ± 0.252
0.0CysXaa: 0.0 ± 0.0
Asp
4.105AspAla: 4.105 ± 0.668
0.411AspCys: 0.411 ± 0.186
3.613AspAsp: 3.613 ± 0.689
5.337AspGlu: 5.337 ± 0.879
2.874AspPhe: 2.874 ± 0.38
4.105AspGly: 4.105 ± 0.642
0.411AspHis: 0.411 ± 0.176
4.927AspIle: 4.927 ± 0.648
5.255AspLys: 5.255 ± 0.571
5.912AspLeu: 5.912 ± 0.597
1.971AspMet: 1.971 ± 0.478
2.956AspAsn: 2.956 ± 0.482
1.478AspPro: 1.478 ± 0.318
0.985AspGln: 0.985 ± 0.254
1.396AspArg: 1.396 ± 0.386
3.859AspSer: 3.859 ± 0.621
2.463AspThr: 2.463 ± 0.387
3.531AspVal: 3.531 ± 0.489
0.411AspTrp: 0.411 ± 0.183
2.792AspTyr: 2.792 ± 0.658
0.0AspXaa: 0.0 ± 0.0
Glu
5.994GluAla: 5.994 ± 0.856
0.821GluCys: 0.821 ± 0.26
3.613GluAsp: 3.613 ± 0.672
5.748GluGlu: 5.748 ± 1.192
3.366GluPhe: 3.366 ± 0.554
3.531GluGly: 3.531 ± 0.53
1.396GluHis: 1.396 ± 0.373
5.337GluIle: 5.337 ± 0.768
7.554GluLys: 7.554 ± 0.73
7.965GluLeu: 7.965 ± 0.867
2.71GluMet: 2.71 ± 0.535
5.665GluAsn: 5.665 ± 0.693
1.067GluPro: 1.067 ± 0.348
2.956GluGln: 2.956 ± 0.621
3.366GluArg: 3.366 ± 0.719
4.598GluSer: 4.598 ± 0.505
4.598GluThr: 4.598 ± 0.684
5.009GluVal: 5.009 ± 0.789
1.232GluTrp: 1.232 ± 0.396
2.792GluTyr: 2.792 ± 0.483
0.0GluXaa: 0.0 ± 0.0
Phe
2.71PheAla: 2.71 ± 0.357
0.493PheCys: 0.493 ± 0.244
2.792PheAsp: 2.792 ± 0.673
3.366PheGlu: 3.366 ± 0.633
0.739PhePhe: 0.739 ± 0.242
1.806PheGly: 1.806 ± 0.29
0.328PheHis: 0.328 ± 0.134
2.874PheIle: 2.874 ± 0.429
3.613PheLys: 3.613 ± 0.637
3.449PheLeu: 3.449 ± 0.454
0.575PheMet: 0.575 ± 0.202
2.217PheAsn: 2.217 ± 0.42
1.067PhePro: 1.067 ± 0.311
1.232PheGln: 1.232 ± 0.289
1.067PheArg: 1.067 ± 0.323
2.627PheSer: 2.627 ± 0.476
2.463PheThr: 2.463 ± 0.495
3.284PheVal: 3.284 ± 0.391
0.164PheTrp: 0.164 ± 0.111
1.067PheTyr: 1.067 ± 0.382
0.0PheXaa: 0.0 ± 0.0
Gly
3.613GlyAla: 3.613 ± 0.905
0.739GlyCys: 0.739 ± 0.266
3.038GlyAsp: 3.038 ± 0.632
4.023GlyGlu: 4.023 ± 0.495
2.545GlyPhe: 2.545 ± 0.504
3.284GlyGly: 3.284 ± 0.516
0.657GlyHis: 0.657 ± 0.234
3.941GlyIle: 3.941 ± 0.588
5.665GlyLys: 5.665 ± 0.588
5.419GlyLeu: 5.419 ± 0.83
1.314GlyMet: 1.314 ± 0.384
4.105GlyAsn: 4.105 ± 0.712
0.246GlyPro: 0.246 ± 0.14
1.56GlyGln: 1.56 ± 0.332
1.888GlyArg: 1.888 ± 0.449
4.188GlySer: 4.188 ± 0.957
3.366GlyThr: 3.366 ± 0.771
4.352GlyVal: 4.352 ± 0.611
0.493GlyTrp: 0.493 ± 0.198
2.956GlyTyr: 2.956 ± 0.655
0.0GlyXaa: 0.0 ± 0.0
His
0.739HisAla: 0.739 ± 0.256
0.0HisCys: 0.0 ± 0.0
0.657HisAsp: 0.657 ± 0.242
0.657HisGlu: 0.657 ± 0.256
0.328HisPhe: 0.328 ± 0.189
0.493HisGly: 0.493 ± 0.229
0.493HisHis: 0.493 ± 0.187
1.067HisIle: 1.067 ± 0.453
1.15HisLys: 1.15 ± 0.395
0.328HisLeu: 0.328 ± 0.187
0.0HisMet: 0.0 ± 0.0
0.903HisAsn: 0.903 ± 0.362
0.575HisPro: 0.575 ± 0.245
0.246HisGln: 0.246 ± 0.152
0.411HisArg: 0.411 ± 0.205
0.657HisSer: 0.657 ± 0.215
0.985HisThr: 0.985 ± 0.35
1.15HisVal: 1.15 ± 0.365
0.246HisTrp: 0.246 ± 0.125
0.739HisTyr: 0.739 ± 0.226
0.0HisXaa: 0.0 ± 0.0
Ile
5.173IleAla: 5.173 ± 0.699
0.246IleCys: 0.246 ± 0.136
5.501IleAsp: 5.501 ± 0.691
6.897IleGlu: 6.897 ± 0.838
2.299IlePhe: 2.299 ± 0.522
2.135IleGly: 2.135 ± 0.428
0.821IleHis: 0.821 ± 0.266
4.105IleIle: 4.105 ± 0.822
7.39IleLys: 7.39 ± 1.113
5.091IleLeu: 5.091 ± 0.616
1.396IleMet: 1.396 ± 0.41
4.516IleAsn: 4.516 ± 0.507
1.806IlePro: 1.806 ± 0.345
2.545IleGln: 2.545 ± 0.415
1.888IleArg: 1.888 ± 0.478
4.844IleSer: 4.844 ± 0.673
4.188IleThr: 4.188 ± 0.572
4.27IleVal: 4.27 ± 0.591
0.739IleTrp: 0.739 ± 0.209
2.299IleTyr: 2.299 ± 0.524
0.0IleXaa: 0.0 ± 0.0
Lys
7.965LysAla: 7.965 ± 1.271
0.493LysCys: 0.493 ± 0.193
5.255LysAsp: 5.255 ± 0.674
7.554LysGlu: 7.554 ± 1.031
2.874LysPhe: 2.874 ± 0.57
5.748LysGly: 5.748 ± 0.665
1.067LysHis: 1.067 ± 0.363
4.434LysIle: 4.434 ± 0.68
8.539LysLys: 8.539 ± 0.998
7.39LysLeu: 7.39 ± 0.635
2.71LysMet: 2.71 ± 0.479
5.748LysAsn: 5.748 ± 0.697
3.12LysPro: 3.12 ± 0.634
4.352LysGln: 4.352 ± 0.614
4.188LysArg: 4.188 ± 0.594
5.912LysSer: 5.912 ± 1.413
6.404LysThr: 6.404 ± 0.616
5.665LysVal: 5.665 ± 0.767
1.724LysTrp: 1.724 ± 0.369
3.531LysTyr: 3.531 ± 0.439
0.0LysXaa: 0.0 ± 0.0
Leu
7.882LeuAla: 7.882 ± 0.929
0.575LeuCys: 0.575 ± 0.335
4.927LeuAsp: 4.927 ± 0.675
7.226LeuGlu: 7.226 ± 1.097
3.038LeuPhe: 3.038 ± 0.616
4.105LeuGly: 4.105 ± 0.678
0.657LeuHis: 0.657 ± 0.193
4.927LeuIle: 4.927 ± 0.495
8.129LeuLys: 8.129 ± 0.783
6.076LeuLeu: 6.076 ± 0.684
1.642LeuMet: 1.642 ± 0.38
5.665LeuAsn: 5.665 ± 0.578
2.956LeuPro: 2.956 ± 0.409
2.545LeuGln: 2.545 ± 0.488
3.284LeuArg: 3.284 ± 0.638
5.665LeuSer: 5.665 ± 0.614
5.091LeuThr: 5.091 ± 0.504
4.927LeuVal: 4.927 ± 0.561
0.328LeuTrp: 0.328 ± 0.148
2.956LeuTyr: 2.956 ± 0.447
0.0LeuXaa: 0.0 ± 0.0
Met
1.971MetAla: 1.971 ± 0.377
0.164MetCys: 0.164 ± 0.107
1.232MetAsp: 1.232 ± 0.334
1.888MetGlu: 1.888 ± 0.461
0.903MetPhe: 0.903 ± 0.289
1.806MetGly: 1.806 ± 0.334
0.082MetHis: 0.082 ± 0.099
1.232MetIle: 1.232 ± 0.377
2.135MetLys: 2.135 ± 0.401
1.642MetLeu: 1.642 ± 0.329
0.657MetMet: 0.657 ± 0.245
1.642MetAsn: 1.642 ± 0.276
0.411MetPro: 0.411 ± 0.146
1.314MetGln: 1.314 ± 0.269
1.314MetArg: 1.314 ± 0.253
1.56MetSer: 1.56 ± 0.293
2.463MetThr: 2.463 ± 0.382
0.985MetVal: 0.985 ± 0.268
0.328MetTrp: 0.328 ± 0.151
1.15MetTyr: 1.15 ± 0.33
0.0MetXaa: 0.0 ± 0.0
Asn
4.762AsnAla: 4.762 ± 1.043
0.575AsnCys: 0.575 ± 0.252
2.874AsnAsp: 2.874 ± 0.585
4.68AsnGlu: 4.68 ± 0.654
1.56AsnPhe: 1.56 ± 0.314
5.83AsnGly: 5.83 ± 0.711
0.985AsnHis: 0.985 ± 0.276
4.844AsnIle: 4.844 ± 0.603
5.83AsnLys: 5.83 ± 0.834
4.598AsnLeu: 4.598 ± 0.52
1.314AsnMet: 1.314 ± 0.323
4.27AsnAsn: 4.27 ± 0.643
1.642AsnPro: 1.642 ± 0.434
2.956AsnGln: 2.956 ± 0.444
3.12AsnArg: 3.12 ± 0.592
3.695AsnSer: 3.695 ± 0.613
3.531AsnThr: 3.531 ± 0.594
3.941AsnVal: 3.941 ± 0.52
0.575AsnTrp: 0.575 ± 0.245
2.381AsnTyr: 2.381 ± 0.458
0.0AsnXaa: 0.0 ± 0.0
Pro
2.381ProAla: 2.381 ± 0.549
0.164ProCys: 0.164 ± 0.117
1.396ProAsp: 1.396 ± 0.396
1.806ProGlu: 1.806 ± 0.372
1.314ProPhe: 1.314 ± 0.43
0.657ProGly: 0.657 ± 0.208
0.164ProHis: 0.164 ± 0.124
1.642ProIle: 1.642 ± 0.424
1.56ProLys: 1.56 ± 0.346
2.381ProLeu: 2.381 ± 0.536
0.493ProMet: 0.493 ± 0.203
1.067ProAsn: 1.067 ± 0.394
1.478ProPro: 1.478 ± 0.426
0.575ProGln: 0.575 ± 0.171
0.985ProArg: 0.985 ± 0.342
1.396ProSer: 1.396 ± 0.282
1.56ProThr: 1.56 ± 0.375
2.71ProVal: 2.71 ± 0.488
0.082ProTrp: 0.082 ± 0.072
0.575ProTyr: 0.575 ± 0.244
0.0ProXaa: 0.0 ± 0.0
Gln
2.956GlnAla: 2.956 ± 0.823
0.411GlnCys: 0.411 ± 0.177
2.135GlnAsp: 2.135 ± 0.468
2.71GlnGlu: 2.71 ± 0.588
1.15GlnPhe: 1.15 ± 0.335
1.642GlnGly: 1.642 ± 0.464
0.575GlnHis: 0.575 ± 0.257
3.12GlnIle: 3.12 ± 0.426
4.023GlnLys: 4.023 ± 0.518
3.12GlnLeu: 3.12 ± 0.498
0.657GlnMet: 0.657 ± 0.256
2.381GlnAsn: 2.381 ± 0.455
0.575GlnPro: 0.575 ± 0.231
2.299GlnGln: 2.299 ± 0.451
1.232GlnArg: 1.232 ± 0.337
2.135GlnSer: 2.135 ± 0.459
2.053GlnThr: 2.053 ± 0.328
1.724GlnVal: 1.724 ± 0.439
0.575GlnTrp: 0.575 ± 0.199
1.314GlnTyr: 1.314 ± 0.28
0.0GlnXaa: 0.0 ± 0.0
Arg
2.299ArgAla: 2.299 ± 0.531
0.246ArgCys: 0.246 ± 0.148
1.724ArgAsp: 1.724 ± 0.446
2.792ArgGlu: 2.792 ± 0.566
1.396ArgPhe: 1.396 ± 0.42
1.56ArgGly: 1.56 ± 0.362
0.821ArgHis: 0.821 ± 0.381
2.627ArgIle: 2.627 ± 0.542
3.449ArgLys: 3.449 ± 0.726
3.366ArgLeu: 3.366 ± 0.601
1.15ArgMet: 1.15 ± 0.297
2.135ArgAsn: 2.135 ± 0.488
0.575ArgPro: 0.575 ± 0.21
0.903ArgGln: 0.903 ± 0.427
1.396ArgArg: 1.396 ± 0.366
2.217ArgSer: 2.217 ± 0.461
2.217ArgThr: 2.217 ± 0.521
2.545ArgVal: 2.545 ± 0.505
0.411ArgTrp: 0.411 ± 0.167
1.888ArgTyr: 1.888 ± 0.432
0.0ArgXaa: 0.0 ± 0.0
Ser
5.748SerAla: 5.748 ± 1.498
0.082SerCys: 0.082 ± 0.08
4.434SerAsp: 4.434 ± 0.56
5.173SerGlu: 5.173 ± 0.74
3.531SerPhe: 3.531 ± 0.628
4.105SerGly: 4.105 ± 0.717
0.575SerHis: 0.575 ± 0.219
5.748SerIle: 5.748 ± 0.717
4.762SerLys: 4.762 ± 0.602
5.583SerLeu: 5.583 ± 0.699
0.903SerMet: 0.903 ± 0.25
4.516SerAsn: 4.516 ± 0.638
1.15SerPro: 1.15 ± 0.326
2.299SerGln: 2.299 ± 0.352
1.314SerArg: 1.314 ± 0.303
3.12SerSer: 3.12 ± 0.647
3.449SerThr: 3.449 ± 0.625
4.023SerVal: 4.023 ± 0.574
0.657SerTrp: 0.657 ± 0.286
1.806SerTyr: 1.806 ± 0.351
0.0SerXaa: 0.0 ± 0.0
Thr
5.748ThrAla: 5.748 ± 1.209
0.493ThrCys: 0.493 ± 0.224
2.956ThrAsp: 2.956 ± 0.435
4.598ThrGlu: 4.598 ± 0.613
3.12ThrPhe: 3.12 ± 0.562
4.023ThrGly: 4.023 ± 0.65
0.903ThrHis: 0.903 ± 0.291
3.859ThrIle: 3.859 ± 0.632
5.501ThrLys: 5.501 ± 0.656
4.434ThrLeu: 4.434 ± 0.586
1.724ThrMet: 1.724 ± 0.424
3.366ThrAsn: 3.366 ± 0.533
1.642ThrPro: 1.642 ± 0.356
1.806ThrGln: 1.806 ± 0.416
1.971ThrArg: 1.971 ± 0.551
3.941ThrSer: 3.941 ± 0.6
3.859ThrThr: 3.859 ± 0.449
3.284ThrVal: 3.284 ± 0.525
0.657ThrTrp: 0.657 ± 0.236
1.314ThrTyr: 1.314 ± 0.411
0.0ThrXaa: 0.0 ± 0.0
Val
5.994ValAla: 5.994 ± 0.946
0.164ValCys: 0.164 ± 0.1
4.516ValAsp: 4.516 ± 0.726
4.188ValGlu: 4.188 ± 0.529
2.135ValPhe: 2.135 ± 0.432
3.449ValGly: 3.449 ± 0.438
0.657ValHis: 0.657 ± 0.243
4.105ValIle: 4.105 ± 0.536
6.404ValLys: 6.404 ± 0.748
4.598ValLeu: 4.598 ± 0.554
2.135ValMet: 2.135 ± 0.377
3.777ValAsn: 3.777 ± 0.542
1.724ValPro: 1.724 ± 0.428
2.299ValGln: 2.299 ± 0.389
2.71ValArg: 2.71 ± 0.604
4.516ValSer: 4.516 ± 0.677
3.941ValThr: 3.941 ± 0.59
4.434ValVal: 4.434 ± 0.566
0.411ValTrp: 0.411 ± 0.166
2.627ValTyr: 2.627 ± 0.588
0.0ValXaa: 0.0 ± 0.0
Trp
0.657TrpAla: 0.657 ± 0.203
0.0TrpCys: 0.0 ± 0.0
0.739TrpAsp: 0.739 ± 0.211
1.067TrpGlu: 1.067 ± 0.368
0.411TrpPhe: 0.411 ± 0.171
0.657TrpGly: 0.657 ± 0.215
0.328TrpHis: 0.328 ± 0.157
0.821TrpIle: 0.821 ± 0.342
0.903TrpLys: 0.903 ± 0.337
0.657TrpLeu: 0.657 ± 0.267
0.246TrpMet: 0.246 ± 0.135
0.493TrpAsn: 0.493 ± 0.242
0.0TrpPro: 0.0 ± 0.0
0.821TrpGln: 0.821 ± 0.238
0.493TrpArg: 0.493 ± 0.25
0.575TrpSer: 0.575 ± 0.27
0.493TrpThr: 0.493 ± 0.214
0.575TrpVal: 0.575 ± 0.233
0.0TrpTrp: 0.0 ± 0.0
0.739TrpTyr: 0.739 ± 0.505
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.038TyrAla: 3.038 ± 0.418
0.246TyrCys: 0.246 ± 0.157
1.642TyrAsp: 1.642 ± 0.38
2.71TyrGlu: 2.71 ± 0.637
1.642TyrPhe: 1.642 ± 0.43
1.724TyrGly: 1.724 ± 0.398
0.246TyrHis: 0.246 ± 0.146
2.135TyrIle: 2.135 ± 0.392
3.695TyrLys: 3.695 ± 0.737
3.038TyrLeu: 3.038 ± 0.66
0.821TyrMet: 0.821 ± 0.236
3.038TyrAsn: 3.038 ± 0.658
0.739TyrPro: 0.739 ± 0.239
2.381TyrGln: 2.381 ± 0.544
0.985TyrArg: 0.985 ± 0.237
2.545TyrSer: 2.545 ± 0.362
2.217TyrThr: 2.217 ± 0.436
2.627TyrVal: 2.627 ± 0.424
0.657TyrTrp: 0.657 ± 0.258
2.217TyrTyr: 2.217 ± 0.626
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 67 proteins (12180 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski