Amino acid dipepetide frequency for Lactobacillus phage iA2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.045AlaAla: 9.045 ± 1.485
0.186AlaCys: 0.186 ± 0.143
5.875AlaAsp: 5.875 ± 0.653
5.408AlaGlu: 5.408 ± 0.819
2.984AlaPhe: 2.984 ± 0.602
5.222AlaGly: 5.222 ± 1.088
1.399AlaHis: 1.399 ± 0.321
5.688AlaIle: 5.688 ± 0.951
6.621AlaLys: 6.621 ± 0.837
6.807AlaLeu: 6.807 ± 0.852
2.145AlaMet: 2.145 ± 0.587
5.502AlaAsn: 5.502 ± 0.737
1.958AlaPro: 1.958 ± 0.534
3.916AlaGln: 3.916 ± 0.53
1.958AlaArg: 1.958 ± 0.389
5.502AlaSer: 5.502 ± 0.8
5.875AlaThr: 5.875 ± 1.054
5.688AlaVal: 5.688 ± 1.096
0.466AlaTrp: 0.466 ± 0.299
3.45AlaTyr: 3.45 ± 0.497
0.0AlaXaa: 0.0 ± 0.0
Cys
0.373CysAla: 0.373 ± 0.184
0.093CysCys: 0.093 ± 0.089
0.186CysAsp: 0.186 ± 0.177
0.093CysGlu: 0.093 ± 0.089
0.093CysPhe: 0.093 ± 0.089
0.186CysGly: 0.186 ± 0.214
0.186CysHis: 0.186 ± 0.141
0.093CysIle: 0.093 ± 0.086
0.186CysLys: 0.186 ± 0.127
0.373CysLeu: 0.373 ± 0.202
0.093CysMet: 0.093 ± 0.107
0.0CysAsn: 0.0 ± 0.0
0.28CysPro: 0.28 ± 0.21
0.186CysGln: 0.186 ± 0.138
0.373CysArg: 0.373 ± 0.193
0.093CysSer: 0.093 ± 0.089
0.093CysThr: 0.093 ± 0.077
0.0CysVal: 0.0 ± 0.0
0.186CysTrp: 0.186 ± 0.129
0.093CysTyr: 0.093 ± 0.098
0.0CysXaa: 0.0 ± 0.0
Asp
5.968AspAla: 5.968 ± 0.868
0.373AspCys: 0.373 ± 0.205
4.756AspAsp: 4.756 ± 0.7
4.662AspGlu: 4.662 ± 0.844
2.518AspPhe: 2.518 ± 0.568
4.849AspGly: 4.849 ± 0.787
1.212AspHis: 1.212 ± 0.308
4.476AspIle: 4.476 ± 0.524
4.662AspLys: 4.662 ± 0.707
6.341AspLeu: 6.341 ± 0.78
1.492AspMet: 1.492 ± 0.295
3.264AspAsn: 3.264 ± 0.467
3.357AspPro: 3.357 ± 0.725
3.357AspGln: 3.357 ± 0.618
2.704AspArg: 2.704 ± 0.448
3.823AspSer: 3.823 ± 0.475
2.704AspThr: 2.704 ± 0.497
4.01AspVal: 4.01 ± 0.74
1.212AspTrp: 1.212 ± 0.31
2.704AspTyr: 2.704 ± 0.542
0.0AspXaa: 0.0 ± 0.0
Glu
4.103GluAla: 4.103 ± 0.481
0.093GluCys: 0.093 ± 0.095
4.662GluAsp: 4.662 ± 0.845
3.357GluGlu: 3.357 ± 0.694
3.077GluPhe: 3.077 ± 0.553
2.145GluGly: 2.145 ± 0.378
1.212GluHis: 1.212 ± 0.336
3.357GluIle: 3.357 ± 0.612
4.103GluLys: 4.103 ± 0.817
5.688GluLeu: 5.688 ± 1.017
1.585GluMet: 1.585 ± 0.479
3.264GluAsn: 3.264 ± 0.615
2.424GluPro: 2.424 ± 0.597
3.45GluGln: 3.45 ± 0.644
2.704GluArg: 2.704 ± 0.783
2.424GluSer: 2.424 ± 0.571
3.264GluThr: 3.264 ± 0.547
3.823GluVal: 3.823 ± 0.648
1.119GluTrp: 1.119 ± 0.417
1.958GluTyr: 1.958 ± 0.372
0.0GluXaa: 0.0 ± 0.0
Phe
2.611PheAla: 2.611 ± 0.455
0.28PheCys: 0.28 ± 0.203
2.891PheAsp: 2.891 ± 0.556
2.331PheGlu: 2.331 ± 0.444
1.585PhePhe: 1.585 ± 0.461
3.264PheGly: 3.264 ± 0.488
0.653PheHis: 0.653 ± 0.253
1.772PheIle: 1.772 ± 0.327
2.424PheLys: 2.424 ± 0.468
2.051PheLeu: 2.051 ± 0.44
0.839PheMet: 0.839 ± 0.311
2.145PheAsn: 2.145 ± 0.583
0.373PhePro: 0.373 ± 0.18
1.212PheGln: 1.212 ± 0.31
1.305PheArg: 1.305 ± 0.366
3.077PheSer: 3.077 ± 0.463
2.238PheThr: 2.238 ± 0.453
1.772PheVal: 1.772 ± 0.393
0.559PheTrp: 0.559 ± 0.206
0.932PheTyr: 0.932 ± 0.31
0.0PheXaa: 0.0 ± 0.0
Gly
3.73GlyAla: 3.73 ± 0.762
0.093GlyCys: 0.093 ± 0.089
4.756GlyAsp: 4.756 ± 0.879
4.662GlyGlu: 4.662 ± 0.647
2.518GlyPhe: 2.518 ± 0.364
3.823GlyGly: 3.823 ± 0.822
1.585GlyHis: 1.585 ± 0.362
5.129GlyIle: 5.129 ± 0.603
5.315GlyLys: 5.315 ± 0.588
4.662GlyLeu: 4.662 ± 0.807
1.958GlyMet: 1.958 ± 0.402
3.357GlyAsn: 3.357 ± 0.463
1.772GlyPro: 1.772 ± 0.384
2.797GlyGln: 2.797 ± 0.434
1.958GlyArg: 1.958 ± 0.422
3.357GlySer: 3.357 ± 0.601
4.476GlyThr: 4.476 ± 0.643
3.45GlyVal: 3.45 ± 0.611
1.399GlyTrp: 1.399 ± 0.409
3.264GlyTyr: 3.264 ± 0.55
0.0GlyXaa: 0.0 ± 0.0
His
1.772HisAla: 1.772 ± 0.402
0.093HisCys: 0.093 ± 0.108
1.585HisAsp: 1.585 ± 0.442
1.026HisGlu: 1.026 ± 0.387
0.839HisPhe: 0.839 ± 0.213
1.119HisGly: 1.119 ± 0.343
0.466HisHis: 0.466 ± 0.197
1.492HisIle: 1.492 ± 0.278
1.305HisLys: 1.305 ± 0.345
1.119HisLeu: 1.119 ± 0.282
0.466HisMet: 0.466 ± 0.185
1.026HisAsn: 1.026 ± 0.313
0.373HisPro: 0.373 ± 0.159
0.746HisGln: 0.746 ± 0.304
0.932HisArg: 0.932 ± 0.386
1.026HisSer: 1.026 ± 0.303
1.026HisThr: 1.026 ± 0.266
2.051HisVal: 2.051 ± 0.483
0.28HisTrp: 0.28 ± 0.161
0.186HisTyr: 0.186 ± 0.117
0.0HisXaa: 0.0 ± 0.0
Ile
5.688IleAla: 5.688 ± 0.61
0.186IleCys: 0.186 ± 0.137
4.196IleAsp: 4.196 ± 0.701
2.891IleGlu: 2.891 ± 0.518
1.305IlePhe: 1.305 ± 0.36
3.543IleGly: 3.543 ± 0.525
1.585IleHis: 1.585 ± 0.38
2.145IleIle: 2.145 ± 0.43
5.688IleLys: 5.688 ± 0.887
3.357IleLeu: 3.357 ± 0.753
1.492IleMet: 1.492 ± 0.327
3.916IleAsn: 3.916 ± 0.663
2.145IlePro: 2.145 ± 0.406
3.264IleGln: 3.264 ± 0.47
2.891IleArg: 2.891 ± 0.401
3.543IleSer: 3.543 ± 0.59
5.502IleThr: 5.502 ± 0.614
4.849IleVal: 4.849 ± 0.618
1.026IleTrp: 1.026 ± 0.301
2.238IleTyr: 2.238 ± 0.63
0.0IleXaa: 0.0 ± 0.0
Lys
6.434LysAla: 6.434 ± 0.706
0.093LysCys: 0.093 ± 0.086
4.662LysAsp: 4.662 ± 0.707
3.73LysGlu: 3.73 ± 0.62
2.145LysPhe: 2.145 ± 0.403
4.01LysGly: 4.01 ± 0.644
1.678LysHis: 1.678 ± 0.359
2.984LysIle: 2.984 ± 0.445
5.502LysLys: 5.502 ± 1.05
6.061LysLeu: 6.061 ± 0.633
2.331LysMet: 2.331 ± 0.478
3.17LysAsn: 3.17 ± 0.515
2.518LysPro: 2.518 ± 0.555
3.17LysGln: 3.17 ± 0.658
3.45LysArg: 3.45 ± 0.76
5.875LysSer: 5.875 ± 0.725
6.341LysThr: 6.341 ± 0.558
4.756LysVal: 4.756 ± 0.667
0.653LysTrp: 0.653 ± 0.264
1.399LysTyr: 1.399 ± 0.352
0.0LysXaa: 0.0 ± 0.0
Leu
6.714LeuAla: 6.714 ± 0.683
0.466LeuCys: 0.466 ± 0.277
5.502LeuAsp: 5.502 ± 0.71
5.968LeuGlu: 5.968 ± 0.899
2.145LeuPhe: 2.145 ± 0.408
4.849LeuGly: 4.849 ± 0.672
1.212LeuHis: 1.212 ± 0.497
4.103LeuIle: 4.103 ± 0.579
7.553LeuLys: 7.553 ± 0.926
6.994LeuLeu: 6.994 ± 0.955
1.865LeuMet: 1.865 ± 0.328
5.222LeuAsn: 5.222 ± 0.848
2.424LeuPro: 2.424 ± 0.497
3.357LeuGln: 3.357 ± 0.452
4.383LeuArg: 4.383 ± 0.73
6.994LeuSer: 6.994 ± 1.01
4.383LeuThr: 4.383 ± 0.593
4.942LeuVal: 4.942 ± 0.675
0.466LeuTrp: 0.466 ± 0.239
2.611LeuTyr: 2.611 ± 0.459
0.0LeuXaa: 0.0 ± 0.0
Met
2.891MetAla: 2.891 ± 0.528
0.0MetCys: 0.0 ± 0.0
1.492MetAsp: 1.492 ± 0.356
1.492MetGlu: 1.492 ± 0.353
0.653MetPhe: 0.653 ± 0.205
0.932MetGly: 0.932 ± 0.257
0.466MetHis: 0.466 ± 0.217
1.305MetIle: 1.305 ± 0.414
1.958MetLys: 1.958 ± 0.366
1.399MetLeu: 1.399 ± 0.355
0.653MetMet: 0.653 ± 0.335
1.958MetAsn: 1.958 ± 0.457
0.932MetPro: 0.932 ± 0.338
0.932MetGln: 0.932 ± 0.287
0.839MetArg: 0.839 ± 0.29
2.797MetSer: 2.797 ± 0.483
1.772MetThr: 1.772 ± 0.33
1.585MetVal: 1.585 ± 0.327
0.186MetTrp: 0.186 ± 0.149
1.026MetTyr: 1.026 ± 0.215
0.0MetXaa: 0.0 ± 0.0
Asn
5.688AsnAla: 5.688 ± 0.999
0.093AsnCys: 0.093 ± 0.107
3.077AsnAsp: 3.077 ± 0.494
2.611AsnGlu: 2.611 ± 0.583
1.585AsnPhe: 1.585 ± 0.453
5.688AsnGly: 5.688 ± 0.814
1.585AsnHis: 1.585 ± 0.395
3.637AsnIle: 3.637 ± 0.537
3.357AsnLys: 3.357 ± 0.577
4.662AsnLeu: 4.662 ± 0.79
1.119AsnMet: 1.119 ± 0.314
2.984AsnAsn: 2.984 ± 0.586
1.492AsnPro: 1.492 ± 0.295
2.518AsnGln: 2.518 ± 0.476
2.891AsnArg: 2.891 ± 0.517
3.45AsnSer: 3.45 ± 0.449
3.637AsnThr: 3.637 ± 0.635
2.891AsnVal: 2.891 ± 0.783
1.399AsnTrp: 1.399 ± 0.336
1.678AsnTyr: 1.678 ± 0.404
0.0AsnXaa: 0.0 ± 0.0
Pro
2.611ProAla: 2.611 ± 0.648
0.0ProCys: 0.0 ± 0.0
2.984ProAsp: 2.984 ± 0.49
2.611ProGlu: 2.611 ± 0.456
1.678ProPhe: 1.678 ± 0.411
2.238ProGly: 2.238 ± 0.47
0.466ProHis: 0.466 ± 0.227
2.331ProIle: 2.331 ± 0.502
1.772ProLys: 1.772 ± 0.384
2.518ProLeu: 2.518 ± 0.638
0.373ProMet: 0.373 ± 0.136
1.865ProAsn: 1.865 ± 0.418
0.746ProPro: 0.746 ± 0.351
0.839ProGln: 0.839 ± 0.26
1.492ProArg: 1.492 ± 0.377
2.424ProSer: 2.424 ± 0.429
2.424ProThr: 2.424 ± 0.521
1.772ProVal: 1.772 ± 0.423
0.186ProTrp: 0.186 ± 0.117
0.932ProTyr: 0.932 ± 0.254
0.0ProXaa: 0.0 ± 0.0
Gln
3.077GlnAla: 3.077 ± 0.629
0.093GlnCys: 0.093 ± 0.089
2.238GlnAsp: 2.238 ± 0.413
2.051GlnGlu: 2.051 ± 0.308
1.026GlnPhe: 1.026 ± 0.247
3.357GlnGly: 3.357 ± 0.563
0.839GlnHis: 0.839 ± 0.305
3.357GlnIle: 3.357 ± 0.49
3.077GlnLys: 3.077 ± 0.553
4.756GlnLeu: 4.756 ± 0.646
1.865GlnMet: 1.865 ± 0.422
2.051GlnAsn: 2.051 ± 0.414
1.212GlnPro: 1.212 ± 0.271
2.331GlnGln: 2.331 ± 0.438
2.145GlnArg: 2.145 ± 0.546
2.331GlnSer: 2.331 ± 0.484
4.289GlnThr: 4.289 ± 0.657
3.73GlnVal: 3.73 ± 0.569
0.932GlnTrp: 0.932 ± 0.28
1.305GlnTyr: 1.305 ± 0.272
0.0GlnXaa: 0.0 ± 0.0
Arg
2.984ArgAla: 2.984 ± 0.393
0.186ArgCys: 0.186 ± 0.128
3.45ArgAsp: 3.45 ± 0.59
2.797ArgGlu: 2.797 ± 0.562
1.212ArgPhe: 1.212 ± 0.357
2.424ArgGly: 2.424 ± 0.329
0.932ArgHis: 0.932 ± 0.417
3.264ArgIle: 3.264 ± 0.579
3.077ArgLys: 3.077 ± 0.782
4.196ArgLeu: 4.196 ± 0.59
0.839ArgMet: 0.839 ± 0.298
2.611ArgAsn: 2.611 ± 0.579
1.305ArgPro: 1.305 ± 0.429
1.772ArgGln: 1.772 ± 0.506
2.238ArgArg: 2.238 ± 0.466
3.357ArgSer: 3.357 ± 0.703
2.518ArgThr: 2.518 ± 0.44
2.424ArgVal: 2.424 ± 0.52
1.026ArgTrp: 1.026 ± 0.31
1.772ArgTyr: 1.772 ± 0.478
0.0ArgXaa: 0.0 ± 0.0
Ser
5.781SerAla: 5.781 ± 1.007
0.28SerCys: 0.28 ± 0.143
3.73SerAsp: 3.73 ± 0.561
4.383SerGlu: 4.383 ± 0.571
2.051SerPhe: 2.051 ± 0.316
4.756SerGly: 4.756 ± 0.735
0.932SerHis: 0.932 ± 0.28
5.502SerIle: 5.502 ± 0.805
3.916SerLys: 3.916 ± 0.594
5.595SerLeu: 5.595 ± 0.817
1.678SerMet: 1.678 ± 0.396
3.45SerAsn: 3.45 ± 0.695
2.424SerPro: 2.424 ± 0.497
3.264SerGln: 3.264 ± 0.446
2.891SerArg: 2.891 ± 0.744
5.315SerSer: 5.315 ± 0.858
4.756SerThr: 4.756 ± 0.935
4.569SerVal: 4.569 ± 0.717
0.932SerTrp: 0.932 ± 0.26
1.678SerTyr: 1.678 ± 0.464
0.0SerXaa: 0.0 ± 0.0
Thr
5.781ThrAla: 5.781 ± 1.15
0.093ThrCys: 0.093 ± 0.086
3.823ThrAsp: 3.823 ± 0.79
2.984ThrGlu: 2.984 ± 0.611
3.077ThrPhe: 3.077 ± 0.481
4.476ThrGly: 4.476 ± 0.857
1.119ThrHis: 1.119 ± 0.327
4.662ThrIle: 4.662 ± 0.483
2.891ThrLys: 2.891 ± 0.518
6.434ThrLeu: 6.434 ± 0.725
1.678ThrMet: 1.678 ± 0.365
2.891ThrAsn: 2.891 ± 0.477
1.865ThrPro: 1.865 ± 0.445
3.916ThrGln: 3.916 ± 0.817
2.518ThrArg: 2.518 ± 0.414
5.315ThrSer: 5.315 ± 0.772
3.823ThrThr: 3.823 ± 0.741
5.222ThrVal: 5.222 ± 0.865
1.305ThrTrp: 1.305 ± 0.263
2.331ThrTyr: 2.331 ± 0.375
0.0ThrXaa: 0.0 ± 0.0
Val
6.341ValAla: 6.341 ± 0.789
0.186ValCys: 0.186 ± 0.121
4.849ValAsp: 4.849 ± 0.65
2.424ValGlu: 2.424 ± 0.527
2.145ValPhe: 2.145 ± 0.369
4.476ValGly: 4.476 ± 0.693
0.746ValHis: 0.746 ± 0.228
3.637ValIle: 3.637 ± 0.439
4.103ValLys: 4.103 ± 0.577
3.543ValLeu: 3.543 ± 0.494
1.865ValMet: 1.865 ± 0.347
3.73ValAsn: 3.73 ± 0.446
3.077ValPro: 3.077 ± 0.611
2.611ValGln: 2.611 ± 0.454
3.543ValArg: 3.543 ± 0.594
4.476ValSer: 4.476 ± 0.876
4.196ValThr: 4.196 ± 0.609
5.035ValVal: 5.035 ± 0.862
1.026ValTrp: 1.026 ± 0.366
2.984ValTyr: 2.984 ± 0.496
0.0ValXaa: 0.0 ± 0.0
Trp
1.212TrpAla: 1.212 ± 0.314
0.186TrpCys: 0.186 ± 0.132
0.932TrpAsp: 0.932 ± 0.213
0.653TrpGlu: 0.653 ± 0.279
0.373TrpPhe: 0.373 ± 0.167
0.839TrpGly: 0.839 ± 0.283
0.0TrpHis: 0.0 ± 0.0
0.932TrpIle: 0.932 ± 0.278
0.932TrpLys: 0.932 ± 0.286
2.424TrpLeu: 2.424 ± 0.408
0.186TrpMet: 0.186 ± 0.13
1.772TrpAsn: 1.772 ± 0.71
0.466TrpPro: 0.466 ± 0.144
0.559TrpGln: 0.559 ± 0.208
0.932TrpArg: 0.932 ± 0.354
0.746TrpSer: 0.746 ± 0.259
1.212TrpThr: 1.212 ± 0.361
0.466TrpVal: 0.466 ± 0.184
0.373TrpTrp: 0.373 ± 0.175
0.373TrpTyr: 0.373 ± 0.167
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.891TyrAla: 2.891 ± 0.427
0.186TyrCys: 0.186 ± 0.132
2.797TyrAsp: 2.797 ± 0.525
1.678TyrGlu: 1.678 ± 0.316
1.305TyrPhe: 1.305 ± 0.29
1.865TyrGly: 1.865 ± 0.44
0.653TyrHis: 0.653 ± 0.238
1.585TyrIle: 1.585 ± 0.428
2.518TyrLys: 2.518 ± 0.462
3.17TyrLeu: 3.17 ± 0.416
0.653TyrMet: 0.653 ± 0.195
1.865TyrAsn: 1.865 ± 0.402
1.119TyrPro: 1.119 ± 0.4
1.865TyrGln: 1.865 ± 0.449
2.238TyrArg: 2.238 ± 0.445
1.958TyrSer: 1.958 ± 0.345
1.678TyrThr: 1.678 ± 0.444
1.958TyrVal: 1.958 ± 0.449
0.839TyrTrp: 0.839 ± 0.273
1.399TyrTyr: 1.399 ± 0.456
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 50 proteins (10725 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski