Amino acid dipepetide frequency for Celeribacter phage P12053L

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.122AlaAla: 12.122 ± 2.979
0.319AlaCys: 0.319 ± 0.175
5.981AlaAsp: 5.981 ± 0.681
5.423AlaGlu: 5.423 ± 0.514
2.791AlaPhe: 2.791 ± 0.408
6.54AlaGly: 6.54 ± 0.899
0.718AlaHis: 0.718 ± 0.258
4.386AlaIle: 4.386 ± 0.594
6.938AlaLys: 6.938 ± 0.806
6.619AlaLeu: 6.619 ± 0.913
2.233AlaMet: 2.233 ± 0.361
3.35AlaAsn: 3.35 ± 0.578
1.994AlaPro: 1.994 ± 0.287
3.669AlaGln: 3.669 ± 0.502
3.908AlaArg: 3.908 ± 0.711
6.619AlaSer: 6.619 ± 1.75
7.337AlaThr: 7.337 ± 1.879
5.742AlaVal: 5.742 ± 0.91
1.515AlaTrp: 1.515 ± 0.376
3.031AlaTyr: 3.031 ± 0.406
0.0AlaXaa: 0.0 ± 0.0
Cys
0.718CysAla: 0.718 ± 0.271
0.08CysCys: 0.08 ± 0.073
0.558CysAsp: 0.558 ± 0.201
0.479CysGlu: 0.479 ± 0.243
0.16CysPhe: 0.16 ± 0.168
0.558CysGly: 0.558 ± 0.255
0.399CysHis: 0.399 ± 0.238
0.319CysIle: 0.319 ± 0.181
1.196CysLys: 1.196 ± 0.454
0.877CysLeu: 0.877 ± 0.275
0.16CysMet: 0.16 ± 0.12
0.638CysAsn: 0.638 ± 0.258
0.718CysPro: 0.718 ± 0.278
0.479CysGln: 0.479 ± 0.199
0.479CysArg: 0.479 ± 0.196
0.479CysSer: 0.479 ± 0.19
0.16CysThr: 0.16 ± 0.108
0.319CysVal: 0.319 ± 0.204
0.08CysTrp: 0.08 ± 0.069
0.638CysTyr: 0.638 ± 0.218
0.0CysXaa: 0.0 ± 0.0
Asp
5.184AspAla: 5.184 ± 0.693
0.718AspCys: 0.718 ± 0.341
3.669AspAsp: 3.669 ± 0.486
4.307AspGlu: 4.307 ± 0.665
3.669AspPhe: 3.669 ± 0.582
5.343AspGly: 5.343 ± 0.594
1.515AspHis: 1.515 ± 0.4
4.705AspIle: 4.705 ± 0.724
3.031AspLys: 3.031 ± 0.489
5.024AspLeu: 5.024 ± 0.513
1.117AspMet: 1.117 ± 0.339
3.031AspAsn: 3.031 ± 0.542
3.908AspPro: 3.908 ± 0.482
1.196AspGln: 1.196 ± 0.294
2.951AspArg: 2.951 ± 0.512
3.748AspSer: 3.748 ± 0.529
5.822AspThr: 5.822 ± 0.718
5.104AspVal: 5.104 ± 0.566
1.276AspTrp: 1.276 ± 0.381
2.871AspTyr: 2.871 ± 0.549
0.0AspXaa: 0.0 ± 0.0
Glu
6.779GluAla: 6.779 ± 0.777
0.399GluCys: 0.399 ± 0.183
5.423GluAsp: 5.423 ± 0.825
5.662GluGlu: 5.662 ± 0.655
2.871GluPhe: 2.871 ± 0.391
5.343GluGly: 5.343 ± 0.565
0.877GluHis: 0.877 ± 0.261
3.908GluIle: 3.908 ± 0.552
3.669GluLys: 3.669 ± 0.558
6.141GluLeu: 6.141 ± 0.846
2.153GluMet: 2.153 ± 0.437
2.712GluAsn: 2.712 ± 0.551
1.675GluPro: 1.675 ± 0.42
2.153GluGln: 2.153 ± 0.507
3.988GluArg: 3.988 ± 0.546
2.951GluSer: 2.951 ± 0.446
5.343GluThr: 5.343 ± 0.869
4.865GluVal: 4.865 ± 0.835
1.117GluTrp: 1.117 ± 0.325
2.712GluTyr: 2.712 ± 0.485
0.0GluXaa: 0.0 ± 0.0
Phe
1.834PheAla: 1.834 ± 0.447
0.16PheCys: 0.16 ± 0.096
3.19PheAsp: 3.19 ± 0.509
2.472PheGlu: 2.472 ± 0.51
0.957PhePhe: 0.957 ± 0.295
2.712PheGly: 2.712 ± 0.593
0.798PheHis: 0.798 ± 0.193
2.153PheIle: 2.153 ± 0.389
2.871PheLys: 2.871 ± 0.368
1.994PheLeu: 1.994 ± 0.29
1.037PheMet: 1.037 ± 0.287
2.552PheAsn: 2.552 ± 0.43
0.877PhePro: 0.877 ± 0.213
0.798PheGln: 0.798 ± 0.305
1.436PheArg: 1.436 ± 0.371
3.031PheSer: 3.031 ± 0.537
2.632PheThr: 2.632 ± 0.586
1.994PheVal: 1.994 ± 0.412
0.638PheTrp: 0.638 ± 0.201
1.834PheTyr: 1.834 ± 0.464
0.0PheXaa: 0.0 ± 0.0
Gly
4.546GlyAla: 4.546 ± 0.8
0.718GlyCys: 0.718 ± 0.273
4.147GlyAsp: 4.147 ± 0.541
4.626GlyGlu: 4.626 ± 0.546
2.871GlyPhe: 2.871 ± 0.485
5.423GlyGly: 5.423 ± 0.805
1.515GlyHis: 1.515 ± 0.375
2.233GlyIle: 2.233 ± 0.351
5.264GlyLys: 5.264 ± 0.559
6.46GlyLeu: 6.46 ± 0.962
2.153GlyMet: 2.153 ± 0.387
2.951GlyAsn: 2.951 ± 0.429
1.356GlyPro: 1.356 ± 0.379
2.632GlyGln: 2.632 ± 0.424
3.35GlyArg: 3.35 ± 0.546
5.742GlySer: 5.742 ± 0.784
5.822GlyThr: 5.822 ± 0.861
4.865GlyVal: 4.865 ± 0.618
1.356GlyTrp: 1.356 ± 0.35
3.19GlyTyr: 3.19 ± 0.553
0.0GlyXaa: 0.0 ± 0.0
His
1.037HisAla: 1.037 ± 0.288
0.319HisCys: 0.319 ± 0.181
0.479HisAsp: 0.479 ± 0.234
0.798HisGlu: 0.798 ± 0.253
0.638HisPhe: 0.638 ± 0.245
1.117HisGly: 1.117 ± 0.296
0.16HisHis: 0.16 ± 0.102
0.798HisIle: 0.798 ± 0.253
1.276HisLys: 1.276 ± 0.311
1.914HisLeu: 1.914 ± 0.469
0.479HisMet: 0.479 ± 0.196
0.877HisAsn: 0.877 ± 0.287
0.399HisPro: 0.399 ± 0.254
0.638HisGln: 0.638 ± 0.293
1.117HisArg: 1.117 ± 0.303
0.957HisSer: 0.957 ± 0.207
0.798HisThr: 0.798 ± 0.242
1.436HisVal: 1.436 ± 0.302
0.16HisTrp: 0.16 ± 0.137
0.558HisTyr: 0.558 ± 0.184
0.0HisXaa: 0.0 ± 0.0
Ile
4.147IleAla: 4.147 ± 0.541
0.718IleCys: 0.718 ± 0.229
3.908IleAsp: 3.908 ± 0.495
3.748IleGlu: 3.748 ± 0.585
1.276IlePhe: 1.276 ± 0.331
3.11IleGly: 3.11 ± 0.472
0.399IleHis: 0.399 ± 0.213
1.755IleIle: 1.755 ± 0.391
3.429IleLys: 3.429 ± 0.583
3.19IleLeu: 3.19 ± 0.581
1.196IleMet: 1.196 ± 0.225
2.153IleAsn: 2.153 ± 0.438
2.233IlePro: 2.233 ± 0.451
1.356IleGln: 1.356 ± 0.456
2.233IleArg: 2.233 ± 0.502
3.908IleSer: 3.908 ± 0.541
3.35IleThr: 3.35 ± 0.542
3.19IleVal: 3.19 ± 0.475
0.239IleTrp: 0.239 ± 0.159
2.233IleTyr: 2.233 ± 0.459
0.0IleXaa: 0.0 ± 0.0
Lys
6.3LysAla: 6.3 ± 0.886
0.319LysCys: 0.319 ± 0.239
4.945LysAsp: 4.945 ± 0.661
6.141LysGlu: 6.141 ± 0.861
1.914LysPhe: 1.914 ± 0.399
4.785LysGly: 4.785 ± 0.842
1.356LysHis: 1.356 ± 0.479
3.031LysIle: 3.031 ± 0.591
4.785LysLys: 4.785 ± 0.571
5.184LysLeu: 5.184 ± 0.645
1.994LysMet: 1.994 ± 0.341
2.153LysAsn: 2.153 ± 0.474
2.313LysPro: 2.313 ± 0.432
2.552LysGln: 2.552 ± 0.488
2.951LysArg: 2.951 ± 0.597
4.227LysSer: 4.227 ± 0.71
4.147LysThr: 4.147 ± 0.511
4.147LysVal: 4.147 ± 0.442
0.718LysTrp: 0.718 ± 0.211
2.472LysTyr: 2.472 ± 0.538
0.0LysXaa: 0.0 ± 0.0
Leu
5.981LeuAla: 5.981 ± 0.663
0.718LeuCys: 0.718 ± 0.268
7.178LeuAsp: 7.178 ± 0.756
6.141LeuGlu: 6.141 ± 0.686
2.791LeuPhe: 2.791 ± 0.576
5.742LeuGly: 5.742 ± 0.701
1.276LeuHis: 1.276 ± 0.451
2.712LeuIle: 2.712 ± 0.465
5.583LeuLys: 5.583 ± 0.824
6.221LeuLeu: 6.221 ± 0.847
2.552LeuMet: 2.552 ± 0.501
4.307LeuAsn: 4.307 ± 0.557
2.393LeuPro: 2.393 ± 0.536
3.429LeuGln: 3.429 ± 0.584
3.509LeuArg: 3.509 ± 0.422
6.46LeuSer: 6.46 ± 0.777
5.503LeuThr: 5.503 ± 0.585
4.945LeuVal: 4.945 ± 0.54
0.877LeuTrp: 0.877 ± 0.296
2.632LeuTyr: 2.632 ± 0.504
0.0LeuXaa: 0.0 ± 0.0
Met
2.791MetAla: 2.791 ± 0.468
0.399MetCys: 0.399 ± 0.194
1.356MetAsp: 1.356 ± 0.354
1.675MetGlu: 1.675 ± 0.377
0.718MetPhe: 0.718 ± 0.226
2.393MetGly: 2.393 ± 0.503
0.319MetHis: 0.319 ± 0.15
0.798MetIle: 0.798 ± 0.235
2.313MetLys: 2.313 ± 0.466
2.393MetLeu: 2.393 ± 0.435
0.638MetMet: 0.638 ± 0.191
0.957MetAsn: 0.957 ± 0.297
0.798MetPro: 0.798 ± 0.276
0.877MetGln: 0.877 ± 0.289
1.117MetArg: 1.117 ± 0.247
2.552MetSer: 2.552 ± 0.449
1.914MetThr: 1.914 ± 0.394
0.798MetVal: 0.798 ± 0.204
0.319MetTrp: 0.319 ± 0.147
0.638MetTyr: 0.638 ± 0.206
0.0MetXaa: 0.0 ± 0.0
Asn
4.147AsnAla: 4.147 ± 1.145
0.957AsnCys: 0.957 ± 0.326
3.19AsnAsp: 3.19 ± 0.478
2.472AsnGlu: 2.472 ± 0.476
1.356AsnPhe: 1.356 ± 0.428
3.828AsnGly: 3.828 ± 0.628
0.319AsnHis: 0.319 ± 0.138
3.35AsnIle: 3.35 ± 0.559
3.19AsnLys: 3.19 ± 0.506
3.589AsnLeu: 3.589 ± 0.629
1.037AsnMet: 1.037 ± 0.291
3.11AsnAsn: 3.11 ± 0.51
2.074AsnPro: 2.074 ± 0.34
1.276AsnGln: 1.276 ± 0.26
2.074AsnArg: 2.074 ± 0.469
2.074AsnSer: 2.074 ± 0.328
2.712AsnThr: 2.712 ± 0.392
3.35AsnVal: 3.35 ± 0.361
0.239AsnTrp: 0.239 ± 0.159
1.755AsnTyr: 1.755 ± 0.338
0.0AsnXaa: 0.0 ± 0.0
Pro
2.552ProAla: 2.552 ± 0.359
0.479ProCys: 0.479 ± 0.231
2.871ProAsp: 2.871 ± 0.486
2.632ProGlu: 2.632 ± 0.457
1.436ProPhe: 1.436 ± 0.314
0.239ProGly: 0.239 ± 0.119
0.558ProHis: 0.558 ± 0.215
1.595ProIle: 1.595 ± 0.308
1.834ProLys: 1.834 ± 0.461
2.153ProLeu: 2.153 ± 0.341
0.877ProMet: 0.877 ± 0.231
1.675ProAsn: 1.675 ± 0.335
0.718ProPro: 0.718 ± 0.295
1.834ProGln: 1.834 ± 0.322
1.356ProArg: 1.356 ± 0.32
3.669ProSer: 3.669 ± 0.573
2.472ProThr: 2.472 ± 0.43
2.313ProVal: 2.313 ± 0.444
0.399ProTrp: 0.399 ± 0.177
0.798ProTyr: 0.798 ± 0.211
0.0ProXaa: 0.0 ± 0.0
Gln
4.227GlnAla: 4.227 ± 0.74
0.08GlnCys: 0.08 ± 0.073
1.914GlnAsp: 1.914 ± 0.377
2.552GlnGlu: 2.552 ± 0.477
1.037GlnPhe: 1.037 ± 0.284
2.313GlnGly: 2.313 ± 0.358
0.798GlnHis: 0.798 ± 0.209
1.356GlnIle: 1.356 ± 0.463
2.472GlnLys: 2.472 ± 0.376
3.031GlnLeu: 3.031 ± 0.447
1.196GlnMet: 1.196 ± 0.28
1.515GlnAsn: 1.515 ± 0.573
0.877GlnPro: 0.877 ± 0.245
1.755GlnGln: 1.755 ± 0.375
1.675GlnArg: 1.675 ± 0.363
2.153GlnSer: 2.153 ± 0.352
2.074GlnThr: 2.074 ± 0.407
2.791GlnVal: 2.791 ± 0.523
0.239GlnTrp: 0.239 ± 0.111
1.117GlnTyr: 1.117 ± 0.388
0.0GlnXaa: 0.0 ± 0.0
Arg
4.147ArgAla: 4.147 ± 0.639
0.798ArgCys: 0.798 ± 0.307
3.27ArgAsp: 3.27 ± 0.476
2.951ArgGlu: 2.951 ± 0.463
1.276ArgPhe: 1.276 ± 0.311
3.589ArgGly: 3.589 ± 0.628
1.196ArgHis: 1.196 ± 0.351
2.791ArgIle: 2.791 ± 0.57
3.11ArgLys: 3.11 ± 0.529
3.828ArgLeu: 3.828 ± 0.645
1.196ArgMet: 1.196 ± 0.318
2.074ArgAsn: 2.074 ± 0.397
1.117ArgPro: 1.117 ± 0.28
1.755ArgGln: 1.755 ± 0.356
2.153ArgArg: 2.153 ± 0.536
2.712ArgSer: 2.712 ± 0.485
1.914ArgThr: 1.914 ± 0.309
3.828ArgVal: 3.828 ± 0.509
0.638ArgTrp: 0.638 ± 0.241
1.834ArgTyr: 1.834 ± 0.295
0.0ArgXaa: 0.0 ± 0.0
Ser
7.417SerAla: 7.417 ± 1.716
0.718SerCys: 0.718 ± 0.316
4.147SerAsp: 4.147 ± 0.587
4.945SerGlu: 4.945 ± 0.915
3.11SerPhe: 3.11 ± 0.444
5.423SerGly: 5.423 ± 0.964
0.957SerHis: 0.957 ± 0.253
2.951SerIle: 2.951 ± 0.366
4.147SerLys: 4.147 ± 0.642
5.662SerLeu: 5.662 ± 0.747
1.356SerMet: 1.356 ± 0.336
3.27SerAsn: 3.27 ± 0.464
2.552SerPro: 2.552 ± 0.527
2.632SerGln: 2.632 ± 0.522
1.994SerArg: 1.994 ± 0.431
4.785SerSer: 4.785 ± 0.717
4.147SerThr: 4.147 ± 0.848
4.626SerVal: 4.626 ± 0.516
1.515SerTrp: 1.515 ± 0.382
2.472SerTyr: 2.472 ± 0.401
0.0SerXaa: 0.0 ± 0.0
Thr
7.018ThrAla: 7.018 ± 1.398
0.399ThrCys: 0.399 ± 0.183
4.626ThrAsp: 4.626 ± 0.478
4.227ThrGlu: 4.227 ± 0.52
2.871ThrPhe: 2.871 ± 0.611
4.626ThrGly: 4.626 ± 0.693
1.196ThrHis: 1.196 ± 0.315
2.791ThrIle: 2.791 ± 0.405
4.147ThrLys: 4.147 ± 0.545
5.662ThrLeu: 5.662 ± 0.603
1.436ThrMet: 1.436 ± 0.368
3.031ThrAsn: 3.031 ± 1.033
2.393ThrPro: 2.393 ± 0.354
2.791ThrGln: 2.791 ± 0.521
3.908ThrArg: 3.908 ± 0.508
5.264ThrSer: 5.264 ± 1.613
3.828ThrThr: 3.828 ± 0.754
4.546ThrVal: 4.546 ± 0.597
0.798ThrTrp: 0.798 ± 0.296
2.791ThrTyr: 2.791 ± 0.527
0.0ThrXaa: 0.0 ± 0.0
Val
6.38ValAla: 6.38 ± 0.682
0.798ValCys: 0.798 ± 0.309
4.466ValAsp: 4.466 ± 0.713
5.343ValGlu: 5.343 ± 0.672
2.393ValPhe: 2.393 ± 0.436
5.024ValGly: 5.024 ± 0.582
0.798ValHis: 0.798 ± 0.27
3.429ValIle: 3.429 ± 0.569
4.147ValLys: 4.147 ± 0.776
5.822ValLeu: 5.822 ± 0.551
1.356ValMet: 1.356 ± 0.251
3.35ValAsn: 3.35 ± 0.56
2.313ValPro: 2.313 ± 0.413
1.914ValGln: 1.914 ± 0.354
2.552ValArg: 2.552 ± 0.603
4.227ValSer: 4.227 ± 0.784
4.785ValThr: 4.785 ± 0.6
5.503ValVal: 5.503 ± 0.727
0.718ValTrp: 0.718 ± 0.264
1.914ValTyr: 1.914 ± 0.366
0.0ValXaa: 0.0 ± 0.0
Trp
1.356TrpAla: 1.356 ± 0.28
0.08TrpCys: 0.08 ± 0.075
0.399TrpAsp: 0.399 ± 0.156
0.877TrpGlu: 0.877 ± 0.275
0.798TrpPhe: 0.798 ± 0.295
0.798TrpGly: 0.798 ± 0.302
0.239TrpHis: 0.239 ± 0.17
0.558TrpIle: 0.558 ± 0.281
0.877TrpLys: 0.877 ± 0.307
1.515TrpLeu: 1.515 ± 0.405
0.479TrpMet: 0.479 ± 0.189
0.638TrpAsn: 0.638 ± 0.209
0.638TrpPro: 0.638 ± 0.234
0.399TrpGln: 0.399 ± 0.185
1.117TrpArg: 1.117 ± 0.355
0.399TrpSer: 0.399 ± 0.225
0.877TrpThr: 0.877 ± 0.306
0.877TrpVal: 0.877 ± 0.296
0.16TrpTrp: 0.16 ± 0.106
0.718TrpTyr: 0.718 ± 0.311
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.552TyrAla: 2.552 ± 0.492
0.319TyrCys: 0.319 ± 0.189
2.472TyrAsp: 2.472 ± 0.418
3.27TyrGlu: 3.27 ± 0.541
1.037TyrPhe: 1.037 ± 0.303
2.313TyrGly: 2.313 ± 0.316
0.558TyrHis: 0.558 ± 0.212
2.153TyrIle: 2.153 ± 0.375
2.153TyrLys: 2.153 ± 0.393
3.669TyrLeu: 3.669 ± 0.594
1.037TyrMet: 1.037 ± 0.262
1.755TyrAsn: 1.755 ± 0.351
1.117TyrPro: 1.117 ± 0.367
1.037TyrGln: 1.037 ± 0.254
2.313TyrArg: 2.313 ± 0.517
2.951TyrSer: 2.951 ± 0.504
2.712TyrThr: 2.712 ± 0.477
1.994TyrVal: 1.994 ± 0.431
0.798TyrTrp: 0.798 ± 0.291
1.994TyrTyr: 1.994 ± 0.504
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (12540 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski