Amino acid dipepetide frequency for Lactococcus phage vB_Llc_bIBB14s

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.993AlaAla: 0.993 ± 0.433
0.11AlaCys: 0.11 ± 0.125
2.538AlaAsp: 2.538 ± 0.651
4.414AlaGlu: 4.414 ± 0.779
2.979AlaPhe: 2.979 ± 0.814
4.083AlaGly: 4.083 ± 0.755
0.883AlaHis: 0.883 ± 0.246
4.414AlaIle: 4.414 ± 1.125
5.738AlaLys: 5.738 ± 0.932
6.069AlaLeu: 6.069 ± 0.754
2.758AlaMet: 2.758 ± 0.64
4.634AlaAsn: 4.634 ± 0.694
0.883AlaPro: 0.883 ± 0.356
2.317AlaGln: 2.317 ± 0.549
2.317AlaArg: 2.317 ± 0.561
3.2AlaSer: 3.2 ± 0.801
3.752AlaThr: 3.752 ± 0.819
4.855AlaVal: 4.855 ± 1.059
1.876AlaTrp: 1.876 ± 0.514
2.096AlaTyr: 2.096 ± 0.371
0.0AlaXaa: 0.0 ± 0.0
Cys
0.221CysAla: 0.221 ± 0.134
0.0CysCys: 0.0 ± 0.0
0.331CysAsp: 0.331 ± 0.211
0.221CysGlu: 0.221 ± 0.164
0.221CysPhe: 0.221 ± 0.165
0.883CysGly: 0.883 ± 0.38
0.221CysHis: 0.221 ± 0.186
0.772CysIle: 0.772 ± 0.304
0.772CysLys: 0.772 ± 0.378
0.221CysLeu: 0.221 ± 0.156
0.221CysMet: 0.221 ± 0.174
0.552CysAsn: 0.552 ± 0.243
0.221CysPro: 0.221 ± 0.111
0.331CysGln: 0.331 ± 0.179
0.441CysArg: 0.441 ± 0.19
0.331CysSer: 0.331 ± 0.241
0.441CysThr: 0.441 ± 0.221
0.441CysVal: 0.441 ± 0.257
0.11CysTrp: 0.11 ± 0.128
0.441CysTyr: 0.441 ± 0.243
0.0CysXaa: 0.0 ± 0.0
Asp
1.876AspAla: 1.876 ± 0.495
0.221AspCys: 0.221 ± 0.15
2.979AspAsp: 2.979 ± 0.698
3.972AspGlu: 3.972 ± 0.802
3.31AspPhe: 3.31 ± 0.533
2.758AspGly: 2.758 ± 0.562
0.552AspHis: 0.552 ± 0.252
4.745AspIle: 4.745 ± 0.673
5.296AspLys: 5.296 ± 0.631
5.517AspLeu: 5.517 ± 0.919
0.993AspMet: 0.993 ± 0.302
4.634AspAsn: 4.634 ± 0.72
1.655AspPro: 1.655 ± 0.46
0.552AspGln: 0.552 ± 0.376
2.096AspArg: 2.096 ± 0.483
3.089AspSer: 3.089 ± 0.616
5.076AspThr: 5.076 ± 0.727
3.531AspVal: 3.531 ± 0.854
0.662AspTrp: 0.662 ± 0.281
2.758AspTyr: 2.758 ± 0.509
0.0AspXaa: 0.0 ± 0.0
Glu
3.752GluAla: 3.752 ± 0.668
0.331GluCys: 0.331 ± 0.19
3.421GluAsp: 3.421 ± 0.673
5.848GluGlu: 5.848 ± 0.98
3.31GluPhe: 3.31 ± 0.485
1.876GluGly: 1.876 ± 0.35
1.324GluHis: 1.324 ± 0.444
6.841GluIle: 6.841 ± 1.044
6.841GluLys: 6.841 ± 1.138
10.041GluLeu: 10.041 ± 1.464
2.538GluMet: 2.538 ± 0.539
5.186GluAsn: 5.186 ± 0.712
1.765GluPro: 1.765 ± 0.465
4.193GluGln: 4.193 ± 0.8
2.869GluArg: 2.869 ± 0.608
3.972GluSer: 3.972 ± 0.606
4.303GluThr: 4.303 ± 0.601
4.634GluVal: 4.634 ± 0.607
0.993GluTrp: 0.993 ± 0.305
3.421GluTyr: 3.421 ± 0.749
0.0GluXaa: 0.0 ± 0.0
Phe
3.31PheAla: 3.31 ± 0.611
0.221PheCys: 0.221 ± 0.16
3.641PheAsp: 3.641 ± 0.623
2.427PheGlu: 2.427 ± 0.561
2.207PhePhe: 2.207 ± 0.677
2.317PheGly: 2.317 ± 0.483
0.331PheHis: 0.331 ± 0.201
3.089PheIle: 3.089 ± 0.603
4.524PheLys: 4.524 ± 0.622
2.427PheLeu: 2.427 ± 0.449
0.772PheMet: 0.772 ± 0.281
2.869PheAsn: 2.869 ± 0.798
0.662PhePro: 0.662 ± 0.249
0.883PheGln: 0.883 ± 0.355
0.993PheArg: 0.993 ± 0.282
4.524PheSer: 4.524 ± 1.037
2.538PheThr: 2.538 ± 0.412
2.648PheVal: 2.648 ± 0.405
0.441PheTrp: 0.441 ± 0.196
1.214PheTyr: 1.214 ± 0.353
0.0PheXaa: 0.0 ± 0.0
Gly
3.862GlyAla: 3.862 ± 1.121
0.441GlyCys: 0.441 ± 0.215
2.979GlyAsp: 2.979 ± 0.695
4.524GlyGlu: 4.524 ± 0.7
2.538GlyPhe: 2.538 ± 0.841
4.303GlyGly: 4.303 ± 1.156
0.552GlyHis: 0.552 ± 0.271
4.083GlyIle: 4.083 ± 1.418
6.179GlyLys: 6.179 ± 0.645
5.296GlyLeu: 5.296 ± 0.903
0.993GlyMet: 0.993 ± 0.319
3.641GlyAsn: 3.641 ± 0.583
0.331GlyPro: 0.331 ± 0.192
2.207GlyGln: 2.207 ± 0.462
1.765GlyArg: 1.765 ± 0.349
4.303GlySer: 4.303 ± 0.839
2.538GlyThr: 2.538 ± 0.61
5.517GlyVal: 5.517 ± 1.2
1.103GlyTrp: 1.103 ± 0.365
3.531GlyTyr: 3.531 ± 0.72
0.0GlyXaa: 0.0 ± 0.0
His
0.552HisAla: 0.552 ± 0.24
0.662HisCys: 0.662 ± 0.362
0.883HisAsp: 0.883 ± 0.238
0.662HisGlu: 0.662 ± 0.243
0.662HisPhe: 0.662 ± 0.285
1.324HisGly: 1.324 ± 0.35
0.0HisHis: 0.0 ± 0.0
0.772HisIle: 0.772 ± 0.245
1.324HisLys: 1.324 ± 0.452
1.214HisLeu: 1.214 ± 0.456
0.0HisMet: 0.0 ± 0.0
1.876HisAsn: 1.876 ± 0.473
0.11HisPro: 0.11 ± 0.093
0.441HisGln: 0.441 ± 0.208
0.11HisArg: 0.11 ± 0.108
0.221HisSer: 0.221 ± 0.147
0.662HisThr: 0.662 ± 0.261
0.772HisVal: 0.772 ± 0.281
0.331HisTrp: 0.331 ± 0.227
0.662HisTyr: 0.662 ± 0.34
0.0HisXaa: 0.0 ± 0.0
Ile
3.972IleAla: 3.972 ± 0.692
0.331IleCys: 0.331 ± 0.199
4.524IleAsp: 4.524 ± 0.61
6.841IleGlu: 6.841 ± 1.004
2.427IlePhe: 2.427 ± 0.56
3.862IleGly: 3.862 ± 0.954
0.772IleHis: 0.772 ± 0.289
4.193IleIle: 4.193 ± 0.602
7.393IleLys: 7.393 ± 0.868
5.076IleLeu: 5.076 ± 1.015
1.765IleMet: 1.765 ± 0.38
5.186IleAsn: 5.186 ± 0.691
1.545IlePro: 1.545 ± 0.352
2.317IleGln: 2.317 ± 0.428
1.655IleArg: 1.655 ± 0.429
3.641IleSer: 3.641 ± 0.671
5.627IleThr: 5.627 ± 0.676
3.972IleVal: 3.972 ± 0.672
1.324IleTrp: 1.324 ± 0.418
3.31IleTyr: 3.31 ± 0.534
0.0IleXaa: 0.0 ± 0.0
Lys
6.069LysAla: 6.069 ± 1.004
0.993LysCys: 0.993 ± 0.392
4.855LysAsp: 4.855 ± 0.744
9.268LysGlu: 9.268 ± 1.542
2.538LysPhe: 2.538 ± 0.504
5.186LysGly: 5.186 ± 0.827
1.434LysHis: 1.434 ± 0.506
5.848LysIle: 5.848 ± 0.933
7.834LysLys: 7.834 ± 1.059
8.386LysLeu: 8.386 ± 0.912
2.758LysMet: 2.758 ± 0.437
5.738LysAsn: 5.738 ± 0.771
1.324LysPro: 1.324 ± 0.38
3.752LysGln: 3.752 ± 0.593
3.531LysArg: 3.531 ± 0.603
5.407LysSer: 5.407 ± 0.847
5.958LysThr: 5.958 ± 0.728
5.517LysVal: 5.517 ± 0.746
1.655LysTrp: 1.655 ± 0.314
3.752LysTyr: 3.752 ± 0.702
0.0LysXaa: 0.0 ± 0.0
Leu
5.517LeuAla: 5.517 ± 0.755
0.331LeuCys: 0.331 ± 0.191
4.524LeuAsp: 4.524 ± 0.699
6.179LeuGlu: 6.179 ± 0.862
4.083LeuPhe: 4.083 ± 0.707
4.855LeuGly: 4.855 ± 0.664
1.324LeuHis: 1.324 ± 0.393
6.951LeuIle: 6.951 ± 1.029
9.048LeuLys: 9.048 ± 0.963
5.848LeuLeu: 5.848 ± 1.009
1.324LeuMet: 1.324 ± 0.361
5.186LeuAsn: 5.186 ± 0.768
2.869LeuPro: 2.869 ± 0.549
3.2LeuGln: 3.2 ± 0.614
2.538LeuArg: 2.538 ± 0.569
5.186LeuSer: 5.186 ± 0.746
4.965LeuThr: 4.965 ± 0.725
5.738LeuVal: 5.738 ± 0.704
1.545LeuTrp: 1.545 ± 0.396
4.414LeuTyr: 4.414 ± 0.736
0.0LeuXaa: 0.0 ± 0.0
Met
2.427MetAla: 2.427 ± 0.495
0.11MetCys: 0.11 ± 0.137
1.545MetAsp: 1.545 ± 0.433
1.765MetGlu: 1.765 ± 0.549
0.662MetPhe: 0.662 ± 0.245
0.993MetGly: 0.993 ± 0.304
0.221MetHis: 0.221 ± 0.157
2.317MetIle: 2.317 ± 0.57
2.648MetLys: 2.648 ± 0.502
1.545MetLeu: 1.545 ± 0.461
0.221MetMet: 0.221 ± 0.155
1.765MetAsn: 1.765 ± 0.412
0.221MetPro: 0.221 ± 0.177
1.434MetGln: 1.434 ± 0.394
0.552MetArg: 0.552 ± 0.325
1.214MetSer: 1.214 ± 0.333
1.876MetThr: 1.876 ± 0.493
1.434MetVal: 1.434 ± 0.309
0.11MetTrp: 0.11 ± 0.092
1.103MetTyr: 1.103 ± 0.337
0.0MetXaa: 0.0 ± 0.0
Asn
5.296AsnAla: 5.296 ± 0.922
0.11AsnCys: 0.11 ± 0.116
4.083AsnAsp: 4.083 ± 0.667
5.296AsnGlu: 5.296 ± 0.778
2.207AsnPhe: 2.207 ± 0.579
6.51AsnGly: 6.51 ± 0.799
1.103AsnHis: 1.103 ± 0.328
3.641AsnIle: 3.641 ± 0.629
6.179AsnLys: 6.179 ± 1.293
5.517AsnLeu: 5.517 ± 0.787
1.434AsnMet: 1.434 ± 0.312
2.758AsnAsn: 2.758 ± 0.536
1.986AsnPro: 1.986 ± 0.524
1.986AsnGln: 1.986 ± 0.475
2.538AsnArg: 2.538 ± 0.522
5.076AsnSer: 5.076 ± 0.649
4.855AsnThr: 4.855 ± 0.854
3.752AsnVal: 3.752 ± 0.618
0.993AsnTrp: 0.993 ± 0.355
2.317AsnTyr: 2.317 ± 0.635
0.0AsnXaa: 0.0 ± 0.0
Pro
1.214ProAla: 1.214 ± 0.352
0.11ProCys: 0.11 ± 0.127
1.655ProAsp: 1.655 ± 0.473
1.434ProGlu: 1.434 ± 0.312
0.993ProPhe: 0.993 ± 0.241
0.331ProGly: 0.331 ± 0.247
0.11ProHis: 0.11 ± 0.11
1.986ProIle: 1.986 ± 0.485
1.876ProLys: 1.876 ± 0.48
1.986ProLeu: 1.986 ± 0.429
0.662ProMet: 0.662 ± 0.257
2.427ProAsn: 2.427 ± 0.758
0.552ProPro: 0.552 ± 0.284
0.552ProGln: 0.552 ± 0.223
0.441ProArg: 0.441 ± 0.214
1.545ProSer: 1.545 ± 0.521
2.096ProThr: 2.096 ± 0.443
1.324ProVal: 1.324 ± 0.387
0.221ProTrp: 0.221 ± 0.15
0.662ProTyr: 0.662 ± 0.242
0.0ProXaa: 0.0 ± 0.0
Gln
3.31GlnAla: 3.31 ± 0.67
0.331GlnCys: 0.331 ± 0.204
1.986GlnAsp: 1.986 ± 0.523
2.538GlnGlu: 2.538 ± 0.704
1.655GlnPhe: 1.655 ± 0.398
2.427GlnGly: 2.427 ± 0.496
0.441GlnHis: 0.441 ± 0.222
1.876GlnIle: 1.876 ± 0.432
2.648GlnLys: 2.648 ± 0.639
3.31GlnLeu: 3.31 ± 0.561
0.993GlnMet: 0.993 ± 0.282
2.869GlnAsn: 2.869 ± 0.432
1.324GlnPro: 1.324 ± 0.438
1.765GlnGln: 1.765 ± 0.463
1.324GlnArg: 1.324 ± 0.394
2.427GlnSer: 2.427 ± 0.496
1.986GlnThr: 1.986 ± 0.442
2.207GlnVal: 2.207 ± 0.516
0.662GlnTrp: 0.662 ± 0.201
0.993GlnTyr: 0.993 ± 0.293
0.0GlnXaa: 0.0 ± 0.0
Arg
2.317ArgAla: 2.317 ± 0.697
0.441ArgCys: 0.441 ± 0.249
1.765ArgAsp: 1.765 ± 0.451
2.648ArgGlu: 2.648 ± 0.537
0.662ArgPhe: 0.662 ± 0.26
1.765ArgGly: 1.765 ± 0.366
0.662ArgHis: 0.662 ± 0.25
2.207ArgIle: 2.207 ± 0.484
3.862ArgLys: 3.862 ± 0.707
4.303ArgLeu: 4.303 ± 0.742
0.441ArgMet: 0.441 ± 0.244
2.317ArgAsn: 2.317 ± 0.598
0.552ArgPro: 0.552 ± 0.253
1.655ArgGln: 1.655 ± 0.359
1.655ArgArg: 1.655 ± 0.45
1.324ArgSer: 1.324 ± 0.395
1.655ArgThr: 1.655 ± 0.333
1.876ArgVal: 1.876 ± 0.455
0.11ArgTrp: 0.11 ± 0.097
1.986ArgTyr: 1.986 ± 0.423
0.0ArgXaa: 0.0 ± 0.0
Ser
6.069SerAla: 6.069 ± 1.457
0.883SerCys: 0.883 ± 0.335
3.972SerAsp: 3.972 ± 0.819
3.972SerGlu: 3.972 ± 0.626
2.427SerPhe: 2.427 ± 0.497
5.517SerGly: 5.517 ± 1.681
0.772SerHis: 0.772 ± 0.253
3.862SerIle: 3.862 ± 0.688
5.076SerLys: 5.076 ± 0.84
5.186SerLeu: 5.186 ± 0.805
1.655SerMet: 1.655 ± 0.407
3.641SerAsn: 3.641 ± 0.698
0.883SerPro: 0.883 ± 0.274
2.427SerGln: 2.427 ± 0.573
2.648SerArg: 2.648 ± 0.39
5.627SerSer: 5.627 ± 1.132
3.531SerThr: 3.531 ± 0.744
4.083SerVal: 4.083 ± 0.788
0.883SerTrp: 0.883 ± 0.299
2.096SerTyr: 2.096 ± 0.397
0.0SerXaa: 0.0 ± 0.0
Thr
3.862ThrAla: 3.862 ± 0.566
0.331ThrCys: 0.331 ± 0.213
3.2ThrAsp: 3.2 ± 0.73
5.848ThrGlu: 5.848 ± 0.705
2.648ThrPhe: 2.648 ± 0.515
4.083ThrGly: 4.083 ± 0.515
0.331ThrHis: 0.331 ± 0.16
4.303ThrIle: 4.303 ± 0.747
5.296ThrLys: 5.296 ± 0.693
5.848ThrLeu: 5.848 ± 0.768
1.324ThrMet: 1.324 ± 0.332
4.414ThrAsn: 4.414 ± 0.75
2.317ThrPro: 2.317 ± 0.502
2.758ThrGln: 2.758 ± 0.628
1.986ThrArg: 1.986 ± 0.467
4.965ThrSer: 4.965 ± 0.723
3.972ThrThr: 3.972 ± 0.689
3.862ThrVal: 3.862 ± 0.765
0.993ThrTrp: 0.993 ± 0.284
1.765ThrTyr: 1.765 ± 0.366
0.0ThrXaa: 0.0 ± 0.0
Val
4.193ValAla: 4.193 ± 0.754
0.552ValCys: 0.552 ± 0.209
4.303ValAsp: 4.303 ± 0.725
4.855ValGlu: 4.855 ± 0.504
3.089ValPhe: 3.089 ± 0.647
4.193ValGly: 4.193 ± 0.768
0.993ValHis: 0.993 ± 0.325
3.752ValIle: 3.752 ± 0.508
5.186ValLys: 5.186 ± 0.793
2.758ValLeu: 2.758 ± 0.502
1.986ValMet: 1.986 ± 0.359
2.538ValAsn: 2.538 ± 0.505
1.765ValPro: 1.765 ± 0.448
1.986ValGln: 1.986 ± 0.438
3.089ValArg: 3.089 ± 0.733
5.627ValSer: 5.627 ± 1.473
4.745ValThr: 4.745 ± 0.724
3.421ValVal: 3.421 ± 0.641
0.662ValTrp: 0.662 ± 0.275
2.648ValTyr: 2.648 ± 0.586
0.0ValXaa: 0.0 ± 0.0
Trp
0.552TrpAla: 0.552 ± 0.208
0.221TrpCys: 0.221 ± 0.182
0.662TrpAsp: 0.662 ± 0.32
0.883TrpGlu: 0.883 ± 0.295
0.993TrpPhe: 0.993 ± 0.33
0.883TrpGly: 0.883 ± 0.36
0.441TrpHis: 0.441 ± 0.2
0.552TrpIle: 0.552 ± 0.226
0.883TrpLys: 0.883 ± 0.318
1.765TrpLeu: 1.765 ± 0.533
0.441TrpMet: 0.441 ± 0.195
1.655TrpAsn: 1.655 ± 0.413
0.0TrpPro: 0.0 ± 0.0
0.883TrpGln: 0.883 ± 0.273
0.331TrpArg: 0.331 ± 0.264
1.655TrpSer: 1.655 ± 0.341
0.772TrpThr: 0.772 ± 0.266
0.221TrpVal: 0.221 ± 0.143
0.221TrpTrp: 0.221 ± 0.121
1.214TrpTyr: 1.214 ± 0.307
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.545TyrAla: 1.545 ± 0.482
0.662TyrCys: 0.662 ± 0.289
2.427TyrAsp: 2.427 ± 0.622
3.862TyrGlu: 3.862 ± 0.601
2.427TyrPhe: 2.427 ± 0.431
2.538TyrGly: 2.538 ± 0.587
0.772TyrHis: 0.772 ± 0.299
3.531TyrIle: 3.531 ± 0.629
3.31TyrLys: 3.31 ± 0.769
3.31TyrLeu: 3.31 ± 0.674
0.772TyrMet: 0.772 ± 0.265
3.752TyrAsn: 3.752 ± 0.477
1.214TyrPro: 1.214 ± 0.389
1.434TyrGln: 1.434 ± 0.416
1.324TyrArg: 1.324 ± 0.394
1.876TyrSer: 1.876 ± 0.469
2.758TyrThr: 2.758 ± 0.595
2.538TyrVal: 2.538 ± 0.552
0.331TyrTrp: 0.331 ± 0.194
2.317TyrTyr: 2.317 ± 0.547
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (9064 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski