Amino acid dipepetide frequency for Acinetobacter phage LZ35

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.993AlaAla: 4.993 ± 0.831
0.434AlaCys: 0.434 ± 0.159
3.907AlaAsp: 3.907 ± 0.545
3.835AlaGlu: 3.835 ± 0.443
2.46AlaPhe: 2.46 ± 0.403
4.776AlaGly: 4.776 ± 0.677
1.158AlaHis: 1.158 ± 0.275
5.861AlaIle: 5.861 ± 0.605
5.861AlaLys: 5.861 ± 0.646
6.657AlaLeu: 6.657 ± 0.73
2.243AlaMet: 2.243 ± 0.523
3.98AlaAsn: 3.98 ± 0.601
2.388AlaPro: 2.388 ± 0.455
3.546AlaGln: 3.546 ± 0.582
2.677AlaArg: 2.677 ± 0.44
4.414AlaSer: 4.414 ± 0.62
4.631AlaThr: 4.631 ± 0.707
3.835AlaVal: 3.835 ± 0.548
0.941AlaTrp: 0.941 ± 0.254
2.75AlaTyr: 2.75 ± 0.471
0.0AlaXaa: 0.0 ± 0.0
Cys
0.651CysAla: 0.651 ± 0.246
0.217CysCys: 0.217 ± 0.128
0.868CysAsp: 0.868 ± 0.238
1.013CysGlu: 1.013 ± 0.312
0.724CysPhe: 0.724 ± 0.214
0.724CysGly: 0.724 ± 0.245
0.217CysHis: 0.217 ± 0.138
0.434CysIle: 0.434 ± 0.185
1.302CysLys: 1.302 ± 0.346
1.158CysLeu: 1.158 ± 0.303
0.217CysMet: 0.217 ± 0.127
0.362CysAsn: 0.362 ± 0.173
0.507CysPro: 0.507 ± 0.175
0.217CysGln: 0.217 ± 0.125
0.507CysArg: 0.507 ± 0.204
0.724CysSer: 0.724 ± 0.221
0.434CysThr: 0.434 ± 0.206
0.941CysVal: 0.941 ± 0.269
0.217CysTrp: 0.217 ± 0.129
0.796CysTyr: 0.796 ± 0.263
0.0CysXaa: 0.0 ± 0.0
Asp
4.559AspAla: 4.559 ± 0.583
0.579AspCys: 0.579 ± 0.196
3.907AspAsp: 3.907 ± 0.724
4.559AspGlu: 4.559 ± 0.692
3.039AspPhe: 3.039 ± 0.447
4.993AspGly: 4.993 ± 0.781
0.579AspHis: 0.579 ± 0.219
3.618AspIle: 3.618 ± 0.615
4.703AspLys: 4.703 ± 0.707
4.269AspLeu: 4.269 ± 0.658
1.447AspMet: 1.447 ± 0.284
2.171AspAsn: 2.171 ± 0.383
1.592AspPro: 1.592 ± 0.329
2.822AspGln: 2.822 ± 0.442
2.967AspArg: 2.967 ± 0.48
3.329AspSer: 3.329 ± 0.62
3.184AspThr: 3.184 ± 0.543
4.124AspVal: 4.124 ± 0.584
1.302AspTrp: 1.302 ± 0.272
2.894AspTyr: 2.894 ± 0.429
0.0AspXaa: 0.0 ± 0.0
Glu
5.716GluAla: 5.716 ± 0.65
0.289GluCys: 0.289 ± 0.146
3.763GluAsp: 3.763 ± 0.675
4.631GluGlu: 4.631 ± 0.732
3.763GluPhe: 3.763 ± 0.632
4.052GluGly: 4.052 ± 0.477
1.013GluHis: 1.013 ± 0.244
4.92GluIle: 4.92 ± 0.647
4.631GluLys: 4.631 ± 0.588
5.427GluLeu: 5.427 ± 0.823
1.954GluMet: 1.954 ± 0.377
3.184GluAsn: 3.184 ± 0.442
1.592GluPro: 1.592 ± 0.394
2.967GluGln: 2.967 ± 0.505
1.52GluArg: 1.52 ± 0.362
4.92GluSer: 4.92 ± 0.512
2.026GluThr: 2.026 ± 0.356
4.414GluVal: 4.414 ± 0.577
1.447GluTrp: 1.447 ± 0.372
3.907GluTyr: 3.907 ± 0.483
0.0GluXaa: 0.0 ± 0.0
Phe
3.618PheAla: 3.618 ± 0.546
0.796PheCys: 0.796 ± 0.216
3.401PheAsp: 3.401 ± 0.52
2.46PheGlu: 2.46 ± 0.504
1.158PhePhe: 1.158 ± 0.291
2.822PheGly: 2.822 ± 0.376
0.724PheHis: 0.724 ± 0.253
3.546PheIle: 3.546 ± 0.482
3.039PheLys: 3.039 ± 0.467
2.533PheLeu: 2.533 ± 0.42
1.592PheMet: 1.592 ± 0.316
3.184PheAsn: 3.184 ± 0.536
1.158PhePro: 1.158 ± 0.298
1.375PheGln: 1.375 ± 0.3
1.737PheArg: 1.737 ± 0.336
2.388PheSer: 2.388 ± 0.445
2.894PheThr: 2.894 ± 0.518
2.605PheVal: 2.605 ± 0.412
0.868PheTrp: 0.868 ± 0.27
2.822PheTyr: 2.822 ± 0.455
0.0PheXaa: 0.0 ± 0.0
Gly
5.355GlyAla: 5.355 ± 0.882
0.579GlyCys: 0.579 ± 0.253
3.473GlyAsp: 3.473 ± 0.544
4.631GlyGlu: 4.631 ± 0.43
4.776GlyPhe: 4.776 ± 0.532
4.776GlyGly: 4.776 ± 0.656
1.013GlyHis: 1.013 ± 0.234
4.486GlyIle: 4.486 ± 0.627
4.848GlyLys: 4.848 ± 0.566
5.427GlyLeu: 5.427 ± 0.594
2.243GlyMet: 2.243 ± 0.397
4.124GlyAsn: 4.124 ± 0.491
0.724GlyPro: 0.724 ± 0.282
2.388GlyGln: 2.388 ± 0.389
2.605GlyArg: 2.605 ± 0.401
4.559GlySer: 4.559 ± 0.631
3.184GlyThr: 3.184 ± 0.473
6.223GlyVal: 6.223 ± 0.651
1.302GlyTrp: 1.302 ± 0.3
3.039GlyTyr: 3.039 ± 0.41
0.0GlyXaa: 0.0 ± 0.0
His
1.375HisAla: 1.375 ± 0.315
0.145HisCys: 0.145 ± 0.1
0.796HisAsp: 0.796 ± 0.227
1.52HisGlu: 1.52 ± 0.37
0.289HisPhe: 0.289 ± 0.198
1.375HisGly: 1.375 ± 0.326
0.362HisHis: 0.362 ± 0.148
1.375HisIle: 1.375 ± 0.275
1.085HisLys: 1.085 ± 0.271
1.447HisLeu: 1.447 ± 0.339
0.507HisMet: 0.507 ± 0.213
0.796HisAsn: 0.796 ± 0.23
0.651HisPro: 0.651 ± 0.23
0.724HisGln: 0.724 ± 0.242
0.362HisArg: 0.362 ± 0.17
0.507HisSer: 0.507 ± 0.208
0.579HisThr: 0.579 ± 0.177
0.868HisVal: 0.868 ± 0.277
0.072HisTrp: 0.072 ± 0.069
1.085HisTyr: 1.085 ± 0.311
0.0HisXaa: 0.0 ± 0.0
Ile
4.342IleAla: 4.342 ± 0.637
0.868IleCys: 0.868 ± 0.283
5.137IleAsp: 5.137 ± 0.579
5.933IleGlu: 5.933 ± 0.747
2.243IlePhe: 2.243 ± 0.425
4.197IleGly: 4.197 ± 0.534
1.447IleHis: 1.447 ± 0.386
3.907IleIle: 3.907 ± 0.565
7.019IleLys: 7.019 ± 0.787
4.776IleLeu: 4.776 ± 0.545
1.23IleMet: 1.23 ± 0.269
3.98IleAsn: 3.98 ± 0.507
3.546IlePro: 3.546 ± 0.505
2.243IleGln: 2.243 ± 0.468
2.677IleArg: 2.677 ± 0.362
4.414IleSer: 4.414 ± 0.664
4.486IleThr: 4.486 ± 0.728
4.269IleVal: 4.269 ± 0.54
0.796IleTrp: 0.796 ± 0.232
2.026IleTyr: 2.026 ± 0.353
0.0IleXaa: 0.0 ± 0.0
Lys
6.151LysAla: 6.151 ± 0.898
0.868LysCys: 0.868 ± 0.293
4.124LysAsp: 4.124 ± 0.663
5.644LysGlu: 5.644 ± 0.774
3.039LysPhe: 3.039 ± 0.497
5.355LysGly: 5.355 ± 0.544
1.302LysHis: 1.302 ± 0.31
5.065LysIle: 5.065 ± 0.748
5.282LysLys: 5.282 ± 0.73
6.295LysLeu: 6.295 ± 0.626
2.605LysMet: 2.605 ± 0.436
4.197LysAsn: 4.197 ± 0.59
2.46LysPro: 2.46 ± 0.467
2.677LysGln: 2.677 ± 0.432
3.473LysArg: 3.473 ± 0.505
5.21LysSer: 5.21 ± 0.627
4.631LysThr: 4.631 ± 0.462
5.282LysVal: 5.282 ± 0.677
0.868LysTrp: 0.868 ± 0.24
2.388LysTyr: 2.388 ± 0.447
0.0LysXaa: 0.0 ± 0.0
Leu
6.223LeuAla: 6.223 ± 0.73
0.651LeuCys: 0.651 ± 0.231
5.282LeuAsp: 5.282 ± 0.652
6.368LeuGlu: 6.368 ± 0.671
2.967LeuPhe: 2.967 ± 0.572
5.355LeuGly: 5.355 ± 0.639
1.013LeuHis: 1.013 ± 0.31
5.716LeuIle: 5.716 ± 0.583
6.295LeuLys: 6.295 ± 0.755
5.861LeuLeu: 5.861 ± 0.73
2.315LeuMet: 2.315 ± 0.366
6.223LeuAsn: 6.223 ± 0.592
1.737LeuPro: 1.737 ± 0.378
1.954LeuGln: 1.954 ± 0.373
3.039LeuArg: 3.039 ± 0.451
5.427LeuSer: 5.427 ± 0.577
4.124LeuThr: 4.124 ± 0.511
4.703LeuVal: 4.703 ± 0.481
0.724LeuTrp: 0.724 ± 0.256
2.171LeuTyr: 2.171 ± 0.36
0.0LeuXaa: 0.0 ± 0.0
Met
1.881MetAla: 1.881 ± 0.362
0.579MetCys: 0.579 ± 0.203
1.447MetAsp: 1.447 ± 0.4
1.737MetGlu: 1.737 ± 0.375
1.302MetPhe: 1.302 ± 0.373
1.737MetGly: 1.737 ± 0.478
0.217MetHis: 0.217 ± 0.123
1.809MetIle: 1.809 ± 0.336
1.954MetLys: 1.954 ± 0.37
2.026MetLeu: 2.026 ± 0.365
0.507MetMet: 0.507 ± 0.171
2.46MetAsn: 2.46 ± 0.464
0.941MetPro: 0.941 ± 0.257
1.664MetGln: 1.664 ± 0.369
1.085MetArg: 1.085 ± 0.307
2.894MetSer: 2.894 ± 0.502
2.171MetThr: 2.171 ± 0.419
1.013MetVal: 1.013 ± 0.292
0.289MetTrp: 0.289 ± 0.146
0.579MetTyr: 0.579 ± 0.209
0.0MetXaa: 0.0 ± 0.0
Asn
3.618AsnAla: 3.618 ± 0.489
1.085AsnCys: 1.085 ± 0.514
3.835AsnAsp: 3.835 ± 0.515
4.197AsnGlu: 4.197 ± 0.633
2.388AsnPhe: 2.388 ± 0.473
5.789AsnGly: 5.789 ± 0.962
1.158AsnHis: 1.158 ± 0.264
3.835AsnIle: 3.835 ± 0.6
3.401AsnLys: 3.401 ± 0.538
4.559AsnLeu: 4.559 ± 0.575
1.881AsnMet: 1.881 ± 0.4
3.98AsnAsn: 3.98 ± 0.792
2.605AsnPro: 2.605 ± 0.52
2.46AsnGln: 2.46 ± 0.359
2.026AsnArg: 2.026 ± 0.358
3.546AsnSer: 3.546 ± 0.58
3.835AsnThr: 3.835 ± 0.564
3.184AsnVal: 3.184 ± 0.495
0.434AsnTrp: 0.434 ± 0.178
2.822AsnTyr: 2.822 ± 0.537
0.0AsnXaa: 0.0 ± 0.0
Pro
1.447ProAla: 1.447 ± 0.342
0.362ProCys: 0.362 ± 0.182
2.171ProAsp: 2.171 ± 0.402
2.46ProGlu: 2.46 ± 0.402
1.302ProPhe: 1.302 ± 0.376
0.0ProGly: 0.0 ± 0.0
0.507ProHis: 0.507 ± 0.202
1.954ProIle: 1.954 ± 0.418
2.677ProLys: 2.677 ± 0.572
2.75ProLeu: 2.75 ± 0.466
0.941ProMet: 0.941 ± 0.295
2.533ProAsn: 2.533 ± 0.49
0.579ProPro: 0.579 ± 0.205
1.737ProGln: 1.737 ± 0.373
0.941ProArg: 0.941 ± 0.258
2.315ProSer: 2.315 ± 0.424
1.881ProThr: 1.881 ± 0.337
2.098ProVal: 2.098 ± 0.319
0.072ProTrp: 0.072 ± 0.065
1.809ProTyr: 1.809 ± 0.342
0.0ProXaa: 0.0 ± 0.0
Gln
2.967GlnAla: 2.967 ± 0.508
0.217GlnCys: 0.217 ± 0.117
2.315GlnAsp: 2.315 ± 0.425
2.533GlnGlu: 2.533 ± 0.473
2.026GlnPhe: 2.026 ± 0.483
2.894GlnGly: 2.894 ± 0.525
0.941GlnHis: 0.941 ± 0.283
2.098GlnIle: 2.098 ± 0.444
3.473GlnLys: 3.473 ± 0.483
3.546GlnLeu: 3.546 ± 0.588
0.941GlnMet: 0.941 ± 0.315
2.026GlnAsn: 2.026 ± 0.436
1.013GlnPro: 1.013 ± 0.276
1.737GlnGln: 1.737 ± 0.372
1.592GlnArg: 1.592 ± 0.402
2.026GlnSer: 2.026 ± 0.369
2.098GlnThr: 2.098 ± 0.355
2.171GlnVal: 2.171 ± 0.397
0.941GlnTrp: 0.941 ± 0.266
1.592GlnTyr: 1.592 ± 0.399
0.0GlnXaa: 0.0 ± 0.0
Arg
2.46ArgAla: 2.46 ± 0.453
0.796ArgCys: 0.796 ± 0.263
2.171ArgAsp: 2.171 ± 0.432
2.171ArgGlu: 2.171 ± 0.426
1.954ArgPhe: 1.954 ± 0.397
2.388ArgGly: 2.388 ± 0.476
0.724ArgHis: 0.724 ± 0.207
2.75ArgIle: 2.75 ± 0.402
3.835ArgLys: 3.835 ± 0.54
2.822ArgLeu: 2.822 ± 0.507
0.796ArgMet: 0.796 ± 0.268
1.592ArgAsn: 1.592 ± 0.327
1.23ArgPro: 1.23 ± 0.278
1.23ArgGln: 1.23 ± 0.284
1.809ArgArg: 1.809 ± 0.376
2.677ArgSer: 2.677 ± 0.479
2.026ArgThr: 2.026 ± 0.392
3.039ArgVal: 3.039 ± 0.503
0.724ArgTrp: 0.724 ± 0.23
1.592ArgTyr: 1.592 ± 0.405
0.0ArgXaa: 0.0 ± 0.0
Ser
3.401SerAla: 3.401 ± 0.563
0.868SerCys: 0.868 ± 0.279
3.256SerAsp: 3.256 ± 0.602
3.256SerGlu: 3.256 ± 0.548
2.822SerPhe: 2.822 ± 0.471
5.137SerGly: 5.137 ± 0.576
0.941SerHis: 0.941 ± 0.255
6.223SerIle: 6.223 ± 0.648
5.644SerLys: 5.644 ± 0.82
5.933SerLeu: 5.933 ± 0.613
2.171SerMet: 2.171 ± 0.376
3.401SerAsn: 3.401 ± 0.493
1.592SerPro: 1.592 ± 0.297
2.315SerGln: 2.315 ± 0.403
2.46SerArg: 2.46 ± 0.425
3.618SerSer: 3.618 ± 0.658
3.473SerThr: 3.473 ± 0.627
3.618SerVal: 3.618 ± 0.641
0.868SerTrp: 0.868 ± 0.271
2.243SerTyr: 2.243 ± 0.415
0.0SerXaa: 0.0 ± 0.0
Thr
3.98ThrAla: 3.98 ± 0.573
1.013ThrCys: 1.013 ± 0.271
3.401ThrAsp: 3.401 ± 0.42
1.954ThrGlu: 1.954 ± 0.298
2.026ThrPhe: 2.026 ± 0.33
4.559ThrGly: 4.559 ± 0.523
0.507ThrHis: 0.507 ± 0.178
3.763ThrIle: 3.763 ± 0.613
3.835ThrLys: 3.835 ± 0.661
4.559ThrLeu: 4.559 ± 0.769
1.52ThrMet: 1.52 ± 0.297
2.677ThrAsn: 2.677 ± 0.488
2.388ThrPro: 2.388 ± 0.415
1.954ThrGln: 1.954 ± 0.383
2.098ThrArg: 2.098 ± 0.366
2.388ThrSer: 2.388 ± 0.401
3.763ThrThr: 3.763 ± 0.937
4.92ThrVal: 4.92 ± 0.803
1.302ThrTrp: 1.302 ± 0.319
1.52ThrTyr: 1.52 ± 0.311
0.0ThrXaa: 0.0 ± 0.0
Val
4.631ValAla: 4.631 ± 0.658
0.796ValCys: 0.796 ± 0.277
3.763ValAsp: 3.763 ± 0.393
4.342ValGlu: 4.342 ± 0.732
3.329ValPhe: 3.329 ± 0.514
4.92ValGly: 4.92 ± 0.641
1.085ValHis: 1.085 ± 0.329
4.848ValIle: 4.848 ± 0.68
4.486ValLys: 4.486 ± 0.498
4.052ValLeu: 4.052 ± 0.665
1.809ValMet: 1.809 ± 0.374
5.427ValAsn: 5.427 ± 0.86
1.664ValPro: 1.664 ± 0.277
2.533ValGln: 2.533 ± 0.511
2.243ValArg: 2.243 ± 0.451
3.763ValSer: 3.763 ± 0.508
2.894ValThr: 2.894 ± 0.481
4.631ValVal: 4.631 ± 0.607
0.796ValTrp: 0.796 ± 0.25
2.894ValTyr: 2.894 ± 0.415
0.0ValXaa: 0.0 ± 0.0
Trp
1.375TrpAla: 1.375 ± 0.331
0.362TrpCys: 0.362 ± 0.156
1.013TrpAsp: 1.013 ± 0.269
0.507TrpGlu: 0.507 ± 0.165
0.941TrpPhe: 0.941 ± 0.252
0.724TrpGly: 0.724 ± 0.219
0.289TrpHis: 0.289 ± 0.151
0.868TrpIle: 0.868 ± 0.259
0.941TrpLys: 0.941 ± 0.237
1.23TrpLeu: 1.23 ± 0.259
0.434TrpMet: 0.434 ± 0.186
1.302TrpAsn: 1.302 ± 0.317
0.217TrpPro: 0.217 ± 0.123
0.724TrpGln: 0.724 ± 0.22
0.868TrpArg: 0.868 ± 0.285
1.085TrpSer: 1.085 ± 0.302
0.724TrpThr: 0.724 ± 0.232
0.941TrpVal: 0.941 ± 0.232
0.289TrpTrp: 0.289 ± 0.131
0.072TrpTyr: 0.072 ± 0.072
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.605TyrAla: 2.605 ± 0.385
0.724TyrCys: 0.724 ± 0.324
2.605TyrAsp: 2.605 ± 0.568
2.388TyrGlu: 2.388 ± 0.487
2.171TyrPhe: 2.171 ± 0.45
3.184TyrGly: 3.184 ± 0.587
0.724TyrHis: 0.724 ± 0.236
2.46TyrIle: 2.46 ± 0.451
2.605TyrLys: 2.605 ± 0.475
2.822TyrLeu: 2.822 ± 0.449
0.868TyrMet: 0.868 ± 0.218
3.039TyrAsn: 3.039 ± 0.513
2.026TyrPro: 2.026 ± 0.431
1.881TyrGln: 1.881 ± 0.366
2.098TyrArg: 2.098 ± 0.405
3.039TyrSer: 3.039 ± 0.443
1.085TyrThr: 1.085 ± 0.319
2.026TyrVal: 2.026 ± 0.338
0.651TyrTrp: 0.651 ± 0.233
1.085TyrTyr: 1.085 ± 0.298
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 83 proteins (13821 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski