Amino acid dipepetide frequency for Lactobacillus virus phiJL1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.736AlaAla: 7.736 ± 2.942
0.478AlaCys: 0.478 ± 0.218
5.348AlaAsp: 5.348 ± 0.732
4.107AlaGlu: 4.107 ± 0.633
2.483AlaPhe: 2.483 ± 0.476
6.59AlaGly: 6.59 ± 1.81
1.624AlaHis: 1.624 ± 0.327
7.163AlaIle: 7.163 ± 1.33
7.449AlaLys: 7.449 ± 0.928
6.303AlaLeu: 6.303 ± 1.119
3.534AlaMet: 3.534 ± 0.743
4.775AlaAsn: 4.775 ± 0.913
2.292AlaPro: 2.292 ± 0.558
2.388AlaGln: 2.388 ± 0.499
2.197AlaArg: 2.197 ± 0.423
4.202AlaSer: 4.202 ± 1.418
5.539AlaThr: 5.539 ± 1.256
6.208AlaVal: 6.208 ± 0.856
0.86AlaTrp: 0.86 ± 0.328
2.77AlaTyr: 2.77 ± 0.461
0.0AlaXaa: 0.0 ± 0.0
Cys
0.191CysAla: 0.191 ± 0.122
0.0CysCys: 0.0 ± 0.0
0.191CysAsp: 0.191 ± 0.133
0.764CysGlu: 0.764 ± 0.243
0.287CysPhe: 0.287 ± 0.216
0.669CysGly: 0.669 ± 0.24
0.0CysHis: 0.0 ± 0.0
0.287CysIle: 0.287 ± 0.149
0.287CysLys: 0.287 ± 0.212
0.096CysLeu: 0.096 ± 0.107
0.191CysMet: 0.191 ± 0.131
0.478CysAsn: 0.478 ± 0.21
0.096CysPro: 0.096 ± 0.098
0.096CysGln: 0.096 ± 0.098
0.382CysArg: 0.382 ± 0.175
0.191CysSer: 0.191 ± 0.131
0.191CysThr: 0.191 ± 0.14
0.382CysVal: 0.382 ± 0.227
0.0CysTrp: 0.0 ± 0.0
0.191CysTyr: 0.191 ± 0.124
0.0CysXaa: 0.0 ± 0.0
Asp
5.253AspAla: 5.253 ± 0.589
0.287AspCys: 0.287 ± 0.168
3.82AspAsp: 3.82 ± 0.651
4.871AspGlu: 4.871 ± 0.96
2.388AspPhe: 2.388 ± 0.441
5.157AspGly: 5.157 ± 0.82
1.051AspHis: 1.051 ± 0.228
4.107AspIle: 4.107 ± 0.483
4.584AspLys: 4.584 ± 0.456
6.876AspLeu: 6.876 ± 0.76
1.815AspMet: 1.815 ± 0.414
4.775AspAsn: 4.775 ± 0.895
2.388AspPro: 2.388 ± 0.514
1.528AspGln: 1.528 ± 0.373
1.719AspArg: 1.719 ± 0.471
4.966AspSer: 4.966 ± 0.835
4.202AspThr: 4.202 ± 0.791
4.011AspVal: 4.011 ± 0.71
0.669AspTrp: 0.669 ± 0.278
3.82AspTyr: 3.82 ± 0.744
0.0AspXaa: 0.0 ± 0.0
Glu
4.68GluAla: 4.68 ± 0.759
0.096GluCys: 0.096 ± 0.094
4.489GluAsp: 4.489 ± 0.833
4.298GluGlu: 4.298 ± 0.924
2.77GluPhe: 2.77 ± 0.615
2.483GluGly: 2.483 ± 0.563
0.573GluHis: 0.573 ± 0.239
3.343GluIle: 3.343 ± 0.643
3.629GluLys: 3.629 ± 0.795
6.59GluLeu: 6.59 ± 0.869
1.624GluMet: 1.624 ± 0.328
2.865GluAsn: 2.865 ± 0.754
0.573GluPro: 0.573 ± 0.26
2.101GluGln: 2.101 ± 0.453
2.197GluArg: 2.197 ± 0.44
4.107GluSer: 4.107 ± 0.554
2.388GluThr: 2.388 ± 0.504
2.865GluVal: 2.865 ± 0.598
0.573GluTrp: 0.573 ± 0.205
3.056GluTyr: 3.056 ± 0.686
0.0GluXaa: 0.0 ± 0.0
Phe
2.77PheAla: 2.77 ± 0.453
0.191PheCys: 0.191 ± 0.15
3.152PheAsp: 3.152 ± 0.676
1.91PheGlu: 1.91 ± 0.448
1.146PhePhe: 1.146 ± 0.303
3.343PheGly: 3.343 ± 0.476
0.382PheHis: 0.382 ± 0.174
2.197PheIle: 2.197 ± 0.496
2.674PheLys: 2.674 ± 0.474
2.101PheLeu: 2.101 ± 0.327
0.573PheMet: 0.573 ± 0.229
2.77PheAsn: 2.77 ± 0.459
0.764PhePro: 0.764 ± 0.256
0.86PheGln: 0.86 ± 0.283
1.242PheArg: 1.242 ± 0.411
2.674PheSer: 2.674 ± 0.283
3.725PheThr: 3.725 ± 0.548
2.101PheVal: 2.101 ± 0.361
0.191PheTrp: 0.191 ± 0.12
1.719PheTyr: 1.719 ± 0.545
0.0PheXaa: 0.0 ± 0.0
Gly
3.725GlyAla: 3.725 ± 1.231
0.096GlyCys: 0.096 ± 0.089
4.298GlyAsp: 4.298 ± 0.858
3.247GlyGlu: 3.247 ± 0.57
3.056GlyPhe: 3.056 ± 0.569
3.725GlyGly: 3.725 ± 0.65
0.764GlyHis: 0.764 ± 0.252
4.966GlyIle: 4.966 ± 0.731
6.685GlyLys: 6.685 ± 0.658
5.826GlyLeu: 5.826 ± 0.901
1.719GlyMet: 1.719 ± 0.426
4.584GlyAsn: 4.584 ± 0.862
0.764GlyPro: 0.764 ± 0.387
1.719GlyGln: 1.719 ± 0.337
2.483GlyArg: 2.483 ± 0.588
4.966GlySer: 4.966 ± 1.135
5.826GlyThr: 5.826 ± 1.006
6.494GlyVal: 6.494 ± 0.853
0.669GlyTrp: 0.669 ± 0.27
4.202GlyTyr: 4.202 ± 0.783
0.0GlyXaa: 0.0 ± 0.0
His
0.764HisAla: 0.764 ± 0.31
0.096HisCys: 0.096 ± 0.098
1.051HisAsp: 1.051 ± 0.252
0.669HisGlu: 0.669 ± 0.247
0.764HisPhe: 0.764 ± 0.258
0.86HisGly: 0.86 ± 0.235
0.096HisHis: 0.096 ± 0.089
0.573HisIle: 0.573 ± 0.227
0.955HisLys: 0.955 ± 0.29
0.955HisLeu: 0.955 ± 0.319
0.0HisMet: 0.0 ± 0.0
0.573HisAsn: 0.573 ± 0.215
0.478HisPro: 0.478 ± 0.196
0.573HisGln: 0.573 ± 0.203
0.764HisArg: 0.764 ± 0.253
1.242HisSer: 1.242 ± 0.31
0.955HisThr: 0.955 ± 0.28
1.433HisVal: 1.433 ± 0.411
0.287HisTrp: 0.287 ± 0.123
0.764HisTyr: 0.764 ± 0.358
0.0HisXaa: 0.0 ± 0.0
Ile
4.775IleAla: 4.775 ± 1.165
0.382IleCys: 0.382 ± 0.168
5.157IleAsp: 5.157 ± 0.872
4.775IleGlu: 4.775 ± 0.782
1.624IlePhe: 1.624 ± 0.379
4.871IleGly: 4.871 ± 1.052
1.051IleHis: 1.051 ± 0.307
4.011IleIle: 4.011 ± 0.563
6.303IleLys: 6.303 ± 0.787
2.579IleLeu: 2.579 ± 0.348
2.006IleMet: 2.006 ± 0.361
4.107IleAsn: 4.107 ± 0.67
2.197IlePro: 2.197 ± 0.509
2.006IleGln: 2.006 ± 0.377
2.006IleArg: 2.006 ± 0.435
5.444IleSer: 5.444 ± 0.703
5.539IleThr: 5.539 ± 0.654
4.775IleVal: 4.775 ± 0.736
0.669IleTrp: 0.669 ± 0.2
2.674IleTyr: 2.674 ± 0.456
0.0IleXaa: 0.0 ± 0.0
Lys
6.972LysAla: 6.972 ± 0.812
0.0LysCys: 0.0 ± 0.0
4.011LysAsp: 4.011 ± 0.724
3.629LysGlu: 3.629 ± 0.674
3.916LysPhe: 3.916 ± 0.52
3.438LysGly: 3.438 ± 0.598
1.719LysHis: 1.719 ± 0.523
4.011LysIle: 4.011 ± 0.584
4.011LysLys: 4.011 ± 0.691
5.539LysLeu: 5.539 ± 0.875
1.624LysMet: 1.624 ± 0.383
4.584LysAsn: 4.584 ± 0.565
2.961LysPro: 2.961 ± 0.517
4.011LysGln: 4.011 ± 0.899
2.961LysArg: 2.961 ± 0.615
5.062LysSer: 5.062 ± 0.688
5.348LysThr: 5.348 ± 0.81
4.393LysVal: 4.393 ± 0.644
0.669LysTrp: 0.669 ± 0.203
3.534LysTyr: 3.534 ± 0.603
0.0LysXaa: 0.0 ± 0.0
Leu
5.635LeuAla: 5.635 ± 0.775
0.287LeuCys: 0.287 ± 0.184
5.253LeuAsp: 5.253 ± 0.81
3.629LeuGlu: 3.629 ± 0.723
2.292LeuPhe: 2.292 ± 0.471
4.489LeuGly: 4.489 ± 0.789
0.955LeuHis: 0.955 ± 0.295
4.871LeuIle: 4.871 ± 0.81
6.017LeuLys: 6.017 ± 0.836
5.157LeuLeu: 5.157 ± 0.725
2.292LeuMet: 2.292 ± 0.493
3.725LeuAsn: 3.725 ± 0.485
3.247LeuPro: 3.247 ± 0.568
2.388LeuGln: 2.388 ± 0.512
2.006LeuArg: 2.006 ± 0.518
5.73LeuSer: 5.73 ± 0.76
5.73LeuThr: 5.73 ± 0.818
5.253LeuVal: 5.253 ± 0.731
0.955LeuTrp: 0.955 ± 0.334
2.77LeuTyr: 2.77 ± 0.709
0.0LeuXaa: 0.0 ± 0.0
Met
3.534MetAla: 3.534 ± 0.588
0.382MetCys: 0.382 ± 0.173
1.433MetAsp: 1.433 ± 0.368
0.764MetGlu: 0.764 ± 0.235
1.146MetPhe: 1.146 ± 0.284
1.91MetGly: 1.91 ± 0.43
0.191MetHis: 0.191 ± 0.13
2.006MetIle: 2.006 ± 0.466
2.197MetLys: 2.197 ± 0.491
1.91MetLeu: 1.91 ± 0.51
0.86MetMet: 0.86 ± 0.27
1.433MetAsn: 1.433 ± 0.404
0.478MetPro: 0.478 ± 0.197
1.242MetGln: 1.242 ± 0.299
0.955MetArg: 0.955 ± 0.334
2.292MetSer: 2.292 ± 0.556
1.815MetThr: 1.815 ± 0.326
1.91MetVal: 1.91 ± 0.312
0.191MetTrp: 0.191 ± 0.145
0.382MetTyr: 0.382 ± 0.153
0.0MetXaa: 0.0 ± 0.0
Asn
4.775AsnAla: 4.775 ± 0.787
0.382AsnCys: 0.382 ± 0.154
4.489AsnAsp: 4.489 ± 0.741
2.77AsnGlu: 2.77 ± 0.467
1.146AsnPhe: 1.146 ± 0.251
6.303AsnGly: 6.303 ± 0.877
1.051AsnHis: 1.051 ± 0.308
3.725AsnIle: 3.725 ± 0.607
3.916AsnLys: 3.916 ± 0.527
3.82AsnLeu: 3.82 ± 0.684
1.528AsnMet: 1.528 ± 0.48
4.011AsnAsn: 4.011 ± 1.026
1.91AsnPro: 1.91 ± 0.451
2.77AsnGln: 2.77 ± 0.46
2.292AsnArg: 2.292 ± 0.53
4.966AsnSer: 4.966 ± 0.89
2.961AsnThr: 2.961 ± 0.469
4.202AsnVal: 4.202 ± 0.618
0.669AsnTrp: 0.669 ± 0.247
2.674AsnTyr: 2.674 ± 0.72
0.0AsnXaa: 0.0 ± 0.0
Pro
2.674ProAla: 2.674 ± 0.659
0.287ProCys: 0.287 ± 0.149
2.006ProAsp: 2.006 ± 0.477
2.101ProGlu: 2.101 ± 0.508
1.146ProPhe: 1.146 ± 0.262
1.433ProGly: 1.433 ± 0.332
0.096ProHis: 0.096 ± 0.097
2.483ProIle: 2.483 ± 0.422
1.815ProLys: 1.815 ± 0.522
2.483ProLeu: 2.483 ± 0.469
0.287ProMet: 0.287 ± 0.134
2.006ProAsn: 2.006 ± 0.453
0.287ProPro: 0.287 ± 0.18
1.242ProGln: 1.242 ± 0.371
1.146ProArg: 1.146 ± 0.341
1.815ProSer: 1.815 ± 0.34
1.433ProThr: 1.433 ± 0.315
2.101ProVal: 2.101 ± 0.313
0.382ProTrp: 0.382 ± 0.186
1.624ProTyr: 1.624 ± 0.42
0.0ProXaa: 0.0 ± 0.0
Gln
3.438GlnAla: 3.438 ± 0.745
0.096GlnCys: 0.096 ± 0.094
1.91GlnAsp: 1.91 ± 0.409
1.528GlnGlu: 1.528 ± 0.382
0.669GlnPhe: 0.669 ± 0.259
3.056GlnGly: 3.056 ± 0.413
0.764GlnHis: 0.764 ± 0.316
2.483GlnIle: 2.483 ± 0.352
2.292GlnLys: 2.292 ± 0.391
2.483GlnLeu: 2.483 ± 0.565
0.669GlnMet: 0.669 ± 0.243
1.91GlnAsn: 1.91 ± 0.524
0.764GlnPro: 0.764 ± 0.274
1.528GlnGln: 1.528 ± 0.402
1.433GlnArg: 1.433 ± 0.4
3.152GlnSer: 3.152 ± 0.571
3.534GlnThr: 3.534 ± 0.501
2.483GlnVal: 2.483 ± 0.401
0.191GlnTrp: 0.191 ± 0.121
1.337GlnTyr: 1.337 ± 0.362
0.0GlnXaa: 0.0 ± 0.0
Arg
2.483ArgAla: 2.483 ± 0.403
0.191ArgCys: 0.191 ± 0.196
2.579ArgAsp: 2.579 ± 0.452
2.006ArgGlu: 2.006 ± 0.594
1.433ArgPhe: 1.433 ± 0.429
1.91ArgGly: 1.91 ± 0.337
0.382ArgHis: 0.382 ± 0.174
2.579ArgIle: 2.579 ± 0.528
2.197ArgLys: 2.197 ± 0.581
2.483ArgLeu: 2.483 ± 0.459
0.955ArgMet: 0.955 ± 0.287
1.815ArgAsn: 1.815 ± 0.379
0.86ArgPro: 0.86 ± 0.304
1.242ArgGln: 1.242 ± 0.457
1.433ArgArg: 1.433 ± 0.426
1.528ArgSer: 1.528 ± 0.472
2.006ArgThr: 2.006 ± 0.347
3.725ArgVal: 3.725 ± 0.846
0.573ArgTrp: 0.573 ± 0.224
2.197ArgTyr: 2.197 ± 0.424
0.0ArgXaa: 0.0 ± 0.0
Ser
6.685SerAla: 6.685 ± 1.701
0.096SerCys: 0.096 ± 0.089
5.062SerAsp: 5.062 ± 0.714
4.107SerGlu: 4.107 ± 0.899
2.579SerPhe: 2.579 ± 0.454
6.112SerGly: 6.112 ± 1.263
0.573SerHis: 0.573 ± 0.173
5.635SerIle: 5.635 ± 0.802
3.725SerLys: 3.725 ± 0.677
5.062SerLeu: 5.062 ± 0.557
2.292SerMet: 2.292 ± 0.53
3.725SerAsn: 3.725 ± 0.603
1.815SerPro: 1.815 ± 0.378
3.534SerGln: 3.534 ± 0.547
2.101SerArg: 2.101 ± 0.44
6.59SerSer: 6.59 ± 1.103
5.348SerThr: 5.348 ± 0.758
5.826SerVal: 5.826 ± 0.758
0.669SerTrp: 0.669 ± 0.302
2.77SerTyr: 2.77 ± 0.367
0.0SerXaa: 0.0 ± 0.0
Thr
6.876ThrAla: 6.876 ± 1.17
0.764ThrCys: 0.764 ± 0.247
4.775ThrAsp: 4.775 ± 0.67
3.152ThrGlu: 3.152 ± 0.691
3.247ThrPhe: 3.247 ± 0.535
5.826ThrGly: 5.826 ± 0.894
0.669ThrHis: 0.669 ± 0.246
4.775ThrIle: 4.775 ± 0.849
3.82ThrLys: 3.82 ± 0.586
4.393ThrLeu: 4.393 ± 0.836
2.197ThrMet: 2.197 ± 0.482
3.534ThrAsn: 3.534 ± 0.455
2.865ThrPro: 2.865 ± 0.526
2.579ThrGln: 2.579 ± 0.504
1.91ThrArg: 1.91 ± 0.487
4.966ThrSer: 4.966 ± 0.791
3.916ThrThr: 3.916 ± 0.385
4.966ThrVal: 4.966 ± 0.751
0.478ThrTrp: 0.478 ± 0.186
3.152ThrTyr: 3.152 ± 0.497
0.0ThrXaa: 0.0 ± 0.0
Val
6.876ValAla: 6.876 ± 1.254
0.382ValCys: 0.382 ± 0.192
5.826ValAsp: 5.826 ± 1.07
3.629ValGlu: 3.629 ± 0.581
2.483ValPhe: 2.483 ± 0.435
3.725ValGly: 3.725 ± 0.701
0.955ValHis: 0.955 ± 0.267
5.062ValIle: 5.062 ± 0.686
6.399ValLys: 6.399 ± 0.936
4.298ValLeu: 4.298 ± 0.722
0.669ValMet: 0.669 ± 0.276
5.157ValAsn: 5.157 ± 0.546
2.579ValPro: 2.579 ± 0.48
2.101ValGln: 2.101 ± 0.432
2.388ValArg: 2.388 ± 0.481
5.253ValSer: 5.253 ± 0.858
5.348ValThr: 5.348 ± 0.708
4.107ValVal: 4.107 ± 0.815
0.955ValTrp: 0.955 ± 0.319
2.579ValTyr: 2.579 ± 0.539
0.0ValXaa: 0.0 ± 0.0
Trp
0.86TrpAla: 0.86 ± 0.3
0.191TrpCys: 0.191 ± 0.138
0.478TrpAsp: 0.478 ± 0.211
0.669TrpGlu: 0.669 ± 0.23
0.382TrpPhe: 0.382 ± 0.211
0.764TrpGly: 0.764 ± 0.18
0.287TrpHis: 0.287 ± 0.157
0.764TrpIle: 0.764 ± 0.232
0.191TrpLys: 0.191 ± 0.136
0.573TrpLeu: 0.573 ± 0.276
0.096TrpMet: 0.096 ± 0.091
0.669TrpAsn: 0.669 ± 0.261
0.0TrpPro: 0.0 ± 0.0
0.096TrpGln: 0.096 ± 0.089
0.669TrpArg: 0.669 ± 0.27
1.528TrpSer: 1.528 ± 0.364
0.764TrpThr: 0.764 ± 0.274
0.573TrpVal: 0.573 ± 0.225
0.0TrpTrp: 0.0 ± 0.0
0.478TrpTyr: 0.478 ± 0.242
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.343TyrAla: 3.343 ± 0.51
0.287TyrCys: 0.287 ± 0.155
2.961TyrAsp: 2.961 ± 0.675
2.674TyrGlu: 2.674 ± 0.618
1.624TyrPhe: 1.624 ± 0.418
3.152TyrGly: 3.152 ± 0.562
0.669TyrHis: 0.669 ± 0.222
1.91TyrIle: 1.91 ± 0.419
3.247TyrLys: 3.247 ± 0.609
3.152TyrLeu: 3.152 ± 0.498
2.006TyrMet: 2.006 ± 0.469
2.865TyrAsn: 2.865 ± 0.801
1.528TyrPro: 1.528 ± 0.367
1.719TyrGln: 1.719 ± 0.521
2.292TyrArg: 2.292 ± 0.579
3.629TyrSer: 3.629 ± 0.742
2.292TyrThr: 2.292 ± 0.537
2.961TyrVal: 2.961 ± 0.511
0.382TyrTrp: 0.382 ± 0.193
2.101TyrTyr: 2.101 ± 0.486
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 46 proteins (10472 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski