Amino acid dipepetide frequency for Acinetobacter phage SH-Ab 15497

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.553AlaAla: 5.553 ± 1.033
0.525AlaCys: 0.525 ± 0.15
5.403AlaAsp: 5.403 ± 0.643
6.529AlaGlu: 6.529 ± 0.677
2.326AlaPhe: 2.326 ± 0.469
5.253AlaGly: 5.253 ± 0.758
1.351AlaHis: 1.351 ± 0.336
3.602AlaIle: 3.602 ± 0.636
6.303AlaLys: 6.303 ± 0.795
7.204AlaLeu: 7.204 ± 0.829
3.002AlaMet: 3.002 ± 0.482
3.677AlaAsn: 3.677 ± 0.555
3.077AlaPro: 3.077 ± 0.486
3.377AlaGln: 3.377 ± 0.606
3.977AlaArg: 3.977 ± 0.605
4.127AlaSer: 4.127 ± 0.577
4.427AlaThr: 4.427 ± 0.786
5.553AlaVal: 5.553 ± 0.716
1.126AlaTrp: 1.126 ± 0.263
3.002AlaTyr: 3.002 ± 0.539
0.0AlaXaa: 0.0 ± 0.0
Cys
0.675CysAla: 0.675 ± 0.23
0.0CysCys: 0.0 ± 0.0
0.675CysAsp: 0.675 ± 0.273
1.051CysGlu: 1.051 ± 0.263
0.225CysPhe: 0.225 ± 0.1
0.45CysGly: 0.45 ± 0.157
0.225CysHis: 0.225 ± 0.113
0.9CysIle: 0.9 ± 0.257
0.825CysLys: 0.825 ± 0.261
0.45CysLeu: 0.45 ± 0.192
0.45CysMet: 0.45 ± 0.214
0.6CysAsn: 0.6 ± 0.211
0.825CysPro: 0.825 ± 0.359
0.3CysGln: 0.3 ± 0.148
0.825CysArg: 0.825 ± 0.285
0.6CysSer: 0.6 ± 0.207
0.825CysThr: 0.825 ± 0.289
0.675CysVal: 0.675 ± 0.234
0.375CysTrp: 0.375 ± 0.185
0.45CysTyr: 0.45 ± 0.175
0.0CysXaa: 0.0 ± 0.0
Asp
4.653AspAla: 4.653 ± 0.754
0.6AspCys: 0.6 ± 0.146
4.277AspAsp: 4.277 ± 0.513
4.502AspGlu: 4.502 ± 0.725
2.176AspPhe: 2.176 ± 0.353
5.103AspGly: 5.103 ± 0.684
1.051AspHis: 1.051 ± 0.21
5.028AspIle: 5.028 ± 0.502
4.352AspLys: 4.352 ± 0.499
6.604AspLeu: 6.604 ± 0.798
1.801AspMet: 1.801 ± 0.341
2.927AspAsn: 2.927 ± 0.503
3.377AspPro: 3.377 ± 0.54
2.927AspGln: 2.927 ± 0.364
2.927AspArg: 2.927 ± 0.479
2.852AspSer: 2.852 ± 0.402
3.377AspThr: 3.377 ± 0.407
3.602AspVal: 3.602 ± 0.461
1.576AspTrp: 1.576 ± 0.302
1.726AspTyr: 1.726 ± 0.412
0.0AspXaa: 0.0 ± 0.0
Glu
5.403GluAla: 5.403 ± 0.673
1.276GluCys: 1.276 ± 0.42
4.427GluAsp: 4.427 ± 0.591
5.028GluGlu: 5.028 ± 0.609
3.827GluPhe: 3.827 ± 0.412
6.379GluGly: 6.379 ± 0.698
1.351GluHis: 1.351 ± 0.371
4.653GluIle: 4.653 ± 0.646
4.277GluLys: 4.277 ± 0.635
6.529GluLeu: 6.529 ± 0.669
2.176GluMet: 2.176 ± 0.353
2.551GluAsn: 2.551 ± 0.399
1.576GluPro: 1.576 ± 0.421
2.852GluGln: 2.852 ± 0.435
3.827GluArg: 3.827 ± 0.833
4.878GluSer: 4.878 ± 0.525
2.701GluThr: 2.701 ± 0.45
3.827GluVal: 3.827 ± 0.54
1.126GluTrp: 1.126 ± 0.244
3.152GluTyr: 3.152 ± 0.561
0.0GluXaa: 0.0 ± 0.0
Phe
2.551PheAla: 2.551 ± 0.336
0.375PheCys: 0.375 ± 0.153
2.852PheAsp: 2.852 ± 0.365
3.077PheGlu: 3.077 ± 0.586
0.75PhePhe: 0.75 ± 0.272
3.002PheGly: 3.002 ± 0.545
0.75PheHis: 0.75 ± 0.209
1.951PheIle: 1.951 ± 0.345
3.077PheLys: 3.077 ± 0.462
2.026PheLeu: 2.026 ± 0.378
0.9PheMet: 0.9 ± 0.257
1.876PheAsn: 1.876 ± 0.431
0.75PhePro: 0.75 ± 0.234
0.825PheGln: 0.825 ± 0.327
1.501PheArg: 1.501 ± 0.28
1.726PheSer: 1.726 ± 0.349
2.701PheThr: 2.701 ± 0.566
2.476PheVal: 2.476 ± 0.419
0.525PheTrp: 0.525 ± 0.219
1.201PheTyr: 1.201 ± 0.273
0.0PheXaa: 0.0 ± 0.0
Gly
6.379GlyAla: 6.379 ± 0.866
0.825GlyCys: 0.825 ± 0.243
4.953GlyAsp: 4.953 ± 0.56
4.728GlyGlu: 4.728 ± 0.596
2.476GlyPhe: 2.476 ± 0.483
6.153GlyGly: 6.153 ± 0.976
1.801GlyHis: 1.801 ± 0.311
4.277GlyIle: 4.277 ± 0.503
4.728GlyLys: 4.728 ± 0.52
4.953GlyLeu: 4.953 ± 0.477
2.251GlyMet: 2.251 ± 0.304
2.551GlyAsn: 2.551 ± 0.425
2.101GlyPro: 2.101 ± 0.386
3.002GlyGln: 3.002 ± 0.347
3.077GlyArg: 3.077 ± 0.428
4.052GlySer: 4.052 ± 0.672
4.427GlyThr: 4.427 ± 0.82
5.403GlyVal: 5.403 ± 0.596
1.051GlyTrp: 1.051 ± 0.304
2.551GlyTyr: 2.551 ± 0.411
0.0GlyXaa: 0.0 ± 0.0
His
1.201HisAla: 1.201 ± 0.33
0.6HisCys: 0.6 ± 0.182
1.051HisAsp: 1.051 ± 0.271
1.276HisGlu: 1.276 ± 0.251
0.6HisPhe: 0.6 ± 0.166
1.876HisGly: 1.876 ± 0.476
0.525HisHis: 0.525 ± 0.251
0.9HisIle: 0.9 ± 0.278
1.201HisLys: 1.201 ± 0.319
2.401HisLeu: 2.401 ± 0.565
0.45HisMet: 0.45 ± 0.166
0.75HisAsn: 0.75 ± 0.287
1.276HisPro: 1.276 ± 0.277
0.9HisGln: 0.9 ± 0.295
0.75HisArg: 0.75 ± 0.247
1.501HisSer: 1.501 ± 0.35
0.825HisThr: 0.825 ± 0.266
2.101HisVal: 2.101 ± 0.38
0.3HisTrp: 0.3 ± 0.131
0.75HisTyr: 0.75 ± 0.217
0.0HisXaa: 0.0 ± 0.0
Ile
5.178IleAla: 5.178 ± 0.592
0.976IleCys: 0.976 ± 0.248
4.502IleAsp: 4.502 ± 0.465
4.427IleGlu: 4.427 ± 0.534
2.326IlePhe: 2.326 ± 0.439
4.052IleGly: 4.052 ± 0.527
1.501IleHis: 1.501 ± 0.387
3.602IleIle: 3.602 ± 0.579
4.277IleLys: 4.277 ± 0.548
3.827IleLeu: 3.827 ± 0.431
2.026IleMet: 2.026 ± 0.4
3.377IleAsn: 3.377 ± 0.391
3.152IlePro: 3.152 ± 0.414
2.476IleGln: 2.476 ± 0.49
3.452IleArg: 3.452 ± 0.581
3.152IleSer: 3.152 ± 0.541
3.902IleThr: 3.902 ± 0.591
3.677IleVal: 3.677 ± 0.576
0.375IleTrp: 0.375 ± 0.141
1.651IleTyr: 1.651 ± 0.298
0.0IleXaa: 0.0 ± 0.0
Lys
6.303LysAla: 6.303 ± 0.742
0.75LysCys: 0.75 ± 0.287
4.127LysAsp: 4.127 ± 0.473
5.103LysGlu: 5.103 ± 0.804
2.852LysPhe: 2.852 ± 0.573
4.202LysGly: 4.202 ± 0.535
1.651LysHis: 1.651 ± 0.406
4.127LysIle: 4.127 ± 0.512
4.277LysLys: 4.277 ± 0.695
5.028LysLeu: 5.028 ± 0.727
2.777LysMet: 2.777 ± 0.432
2.701LysAsn: 2.701 ± 0.425
2.777LysPro: 2.777 ± 0.481
1.726LysGln: 1.726 ± 0.423
2.101LysArg: 2.101 ± 0.367
4.502LysSer: 4.502 ± 0.647
3.452LysThr: 3.452 ± 0.532
4.502LysVal: 4.502 ± 0.453
1.051LysTrp: 1.051 ± 0.249
1.801LysTyr: 1.801 ± 0.372
0.0LysXaa: 0.0 ± 0.0
Leu
6.679LeuAla: 6.679 ± 0.67
0.675LeuCys: 0.675 ± 0.233
5.253LeuAsp: 5.253 ± 0.572
4.728LeuGlu: 4.728 ± 0.734
2.777LeuPhe: 2.777 ± 0.318
5.328LeuGly: 5.328 ± 0.669
1.501LeuHis: 1.501 ± 0.399
4.352LeuIle: 4.352 ± 0.611
5.628LeuLys: 5.628 ± 0.624
4.878LeuLeu: 4.878 ± 0.734
2.401LeuMet: 2.401 ± 0.365
4.502LeuAsn: 4.502 ± 0.564
3.527LeuPro: 3.527 ± 0.512
2.777LeuGln: 2.777 ± 0.513
4.277LeuArg: 4.277 ± 0.501
4.427LeuSer: 4.427 ± 0.618
4.953LeuThr: 4.953 ± 0.621
4.728LeuVal: 4.728 ± 0.479
0.75LeuTrp: 0.75 ± 0.213
2.401LeuTyr: 2.401 ± 0.495
0.0LeuXaa: 0.0 ± 0.0
Met
2.777MetAla: 2.777 ± 0.368
0.375MetCys: 0.375 ± 0.155
1.651MetAsp: 1.651 ± 0.364
2.852MetGlu: 2.852 ± 0.472
0.825MetPhe: 0.825 ± 0.224
2.251MetGly: 2.251 ± 0.393
0.75MetHis: 0.75 ± 0.239
1.801MetIle: 1.801 ± 0.361
1.876MetLys: 1.876 ± 0.356
1.876MetLeu: 1.876 ± 0.305
0.75MetMet: 0.75 ± 0.211
1.576MetAsn: 1.576 ± 0.452
1.051MetPro: 1.051 ± 0.276
1.201MetGln: 1.201 ± 0.338
2.026MetArg: 2.026 ± 0.384
2.701MetSer: 2.701 ± 0.326
2.026MetThr: 2.026 ± 0.328
1.876MetVal: 1.876 ± 0.343
0.225MetTrp: 0.225 ± 0.13
0.825MetTyr: 0.825 ± 0.263
0.0MetXaa: 0.0 ± 0.0
Asn
3.902AsnAla: 3.902 ± 0.672
0.525AsnCys: 0.525 ± 0.183
3.152AsnAsp: 3.152 ± 0.493
2.701AsnGlu: 2.701 ± 0.432
1.126AsnPhe: 1.126 ± 0.264
4.578AsnGly: 4.578 ± 0.553
0.825AsnHis: 0.825 ± 0.223
2.852AsnIle: 2.852 ± 0.602
2.476AsnLys: 2.476 ± 0.352
3.527AsnLeu: 3.527 ± 0.475
0.976AsnMet: 0.976 ± 0.32
1.651AsnAsn: 1.651 ± 0.341
3.152AsnPro: 3.152 ± 0.512
1.951AsnGln: 1.951 ± 0.292
2.476AsnArg: 2.476 ± 0.474
2.401AsnSer: 2.401 ± 0.569
2.251AsnThr: 2.251 ± 0.418
2.626AsnVal: 2.626 ± 0.418
0.976AsnTrp: 0.976 ± 0.283
1.351AsnTyr: 1.351 ± 0.333
0.0AsnXaa: 0.0 ± 0.0
Pro
3.302ProAla: 3.302 ± 0.52
0.525ProCys: 0.525 ± 0.199
2.777ProAsp: 2.777 ± 0.333
3.677ProGlu: 3.677 ± 0.471
1.276ProPhe: 1.276 ± 0.235
0.15ProGly: 0.15 ± 0.096
0.825ProHis: 0.825 ± 0.259
3.152ProIle: 3.152 ± 0.72
2.551ProLys: 2.551 ± 0.558
2.927ProLeu: 2.927 ± 0.591
1.276ProMet: 1.276 ± 0.295
2.551ProAsn: 2.551 ± 0.489
1.726ProPro: 1.726 ± 0.42
1.351ProGln: 1.351 ± 0.299
1.501ProArg: 1.501 ± 0.401
2.927ProSer: 2.927 ± 0.509
2.927ProThr: 2.927 ± 0.373
2.476ProVal: 2.476 ± 0.364
0.675ProTrp: 0.675 ± 0.266
1.876ProTyr: 1.876 ± 0.363
0.0ProXaa: 0.0 ± 0.0
Gln
3.302GlnAla: 3.302 ± 0.58
0.3GlnCys: 0.3 ± 0.162
2.101GlnAsp: 2.101 ± 0.38
2.551GlnGlu: 2.551 ± 0.484
1.426GlnPhe: 1.426 ± 0.349
2.026GlnGly: 2.026 ± 0.402
0.976GlnHis: 0.976 ± 0.244
2.777GlnIle: 2.777 ± 0.566
2.026GlnLys: 2.026 ± 0.311
3.002GlnLeu: 3.002 ± 0.485
1.801GlnMet: 1.801 ± 0.297
1.576GlnAsn: 1.576 ± 0.341
1.576GlnPro: 1.576 ± 0.328
1.576GlnGln: 1.576 ± 0.284
1.801GlnArg: 1.801 ± 0.354
2.176GlnSer: 2.176 ± 0.353
2.101GlnThr: 2.101 ± 0.391
3.152GlnVal: 3.152 ± 0.376
0.6GlnTrp: 0.6 ± 0.223
1.126GlnTyr: 1.126 ± 0.225
0.0GlnXaa: 0.0 ± 0.0
Arg
4.502ArgAla: 4.502 ± 0.581
0.525ArgCys: 0.525 ± 0.204
3.377ArgAsp: 3.377 ± 0.525
3.677ArgGlu: 3.677 ± 0.591
2.026ArgPhe: 2.026 ± 0.383
3.827ArgGly: 3.827 ± 0.53
1.501ArgHis: 1.501 ± 0.443
3.077ArgIle: 3.077 ± 0.416
2.777ArgLys: 2.777 ± 0.433
4.127ArgLeu: 4.127 ± 0.532
1.126ArgMet: 1.126 ± 0.267
2.401ArgAsn: 2.401 ± 0.382
1.876ArgPro: 1.876 ± 0.442
1.801ArgGln: 1.801 ± 0.375
2.326ArgArg: 2.326 ± 0.391
2.626ArgSer: 2.626 ± 0.491
2.551ArgThr: 2.551 ± 0.356
3.602ArgVal: 3.602 ± 0.465
0.825ArgTrp: 0.825 ± 0.328
1.801ArgTyr: 1.801 ± 0.25
0.0ArgXaa: 0.0 ± 0.0
Ser
3.452SerAla: 3.452 ± 0.51
0.375SerCys: 0.375 ± 0.152
3.527SerAsp: 3.527 ± 0.457
3.602SerGlu: 3.602 ± 0.517
2.326SerPhe: 2.326 ± 0.402
5.028SerGly: 5.028 ± 0.617
1.126SerHis: 1.126 ± 0.246
3.677SerIle: 3.677 ± 0.487
3.902SerLys: 3.902 ± 0.631
4.803SerLeu: 4.803 ± 0.613
2.101SerMet: 2.101 ± 0.417
2.401SerAsn: 2.401 ± 0.476
2.176SerPro: 2.176 ± 0.458
2.326SerGln: 2.326 ± 0.297
3.827SerArg: 3.827 ± 0.56
3.302SerSer: 3.302 ± 0.607
3.527SerThr: 3.527 ± 0.494
3.452SerVal: 3.452 ± 0.394
0.976SerTrp: 0.976 ± 0.271
2.326SerTyr: 2.326 ± 0.436
0.0SerXaa: 0.0 ± 0.0
Thr
3.827ThrAla: 3.827 ± 0.585
0.375ThrCys: 0.375 ± 0.183
2.777ThrAsp: 2.777 ± 0.419
4.127ThrGlu: 4.127 ± 0.565
2.026ThrPhe: 2.026 ± 0.312
4.878ThrGly: 4.878 ± 0.617
0.825ThrHis: 0.825 ± 0.237
4.578ThrIle: 4.578 ± 0.512
3.902ThrLys: 3.902 ± 0.544
4.502ThrLeu: 4.502 ± 0.813
1.951ThrMet: 1.951 ± 0.402
2.626ThrAsn: 2.626 ± 0.628
2.326ThrPro: 2.326 ± 0.331
1.876ThrGln: 1.876 ± 0.289
3.077ThrArg: 3.077 ± 0.509
2.852ThrSer: 2.852 ± 0.562
3.677ThrThr: 3.677 ± 0.598
4.202ThrVal: 4.202 ± 0.68
1.351ThrTrp: 1.351 ± 0.292
2.476ThrTyr: 2.476 ± 0.555
0.0ThrXaa: 0.0 ± 0.0
Val
5.928ValAla: 5.928 ± 0.593
0.976ValCys: 0.976 ± 0.327
5.103ValAsp: 5.103 ± 0.497
5.028ValGlu: 5.028 ± 0.536
2.326ValPhe: 2.326 ± 0.401
3.677ValGly: 3.677 ± 0.59
1.126ValHis: 1.126 ± 0.27
4.127ValIle: 4.127 ± 0.6
4.127ValLys: 4.127 ± 0.428
3.902ValLeu: 3.902 ± 0.631
1.801ValMet: 1.801 ± 0.341
2.927ValAsn: 2.927 ± 0.372
2.777ValPro: 2.777 ± 0.411
2.476ValGln: 2.476 ± 0.419
3.677ValArg: 3.677 ± 0.542
3.902ValSer: 3.902 ± 0.526
4.653ValThr: 4.653 ± 0.551
5.028ValVal: 5.028 ± 0.76
1.126ValTrp: 1.126 ± 0.285
2.026ValTyr: 2.026 ± 0.325
0.0ValXaa: 0.0 ± 0.0
Trp
1.351TrpAla: 1.351 ± 0.286
0.225TrpCys: 0.225 ± 0.111
0.976TrpAsp: 0.976 ± 0.363
0.825TrpGlu: 0.825 ± 0.161
0.3TrpPhe: 0.3 ± 0.142
0.825TrpGly: 0.825 ± 0.213
0.45TrpHis: 0.45 ± 0.171
0.825TrpIle: 0.825 ± 0.229
0.675TrpLys: 0.675 ± 0.259
1.351TrpLeu: 1.351 ± 0.317
0.525TrpMet: 0.525 ± 0.193
0.675TrpAsn: 0.675 ± 0.242
0.075TrpPro: 0.075 ± 0.069
1.051TrpGln: 1.051 ± 0.263
0.825TrpArg: 0.825 ± 0.277
1.501TrpSer: 1.501 ± 0.33
0.9TrpThr: 0.9 ± 0.27
1.051TrpVal: 1.051 ± 0.267
0.375TrpTrp: 0.375 ± 0.166
1.201TrpTyr: 1.201 ± 0.253
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.476TyrAla: 2.476 ± 0.583
0.675TyrCys: 0.675 ± 0.223
2.476TyrAsp: 2.476 ± 0.392
2.326TyrGlu: 2.326 ± 0.491
0.9TyrPhe: 0.9 ± 0.237
2.701TyrGly: 2.701 ± 0.445
1.126TyrHis: 1.126 ± 0.36
1.651TyrIle: 1.651 ± 0.282
2.551TyrLys: 2.551 ± 0.454
2.626TyrLeu: 2.626 ± 0.46
0.675TyrMet: 0.675 ± 0.215
1.576TyrAsn: 1.576 ± 0.32
1.276TyrPro: 1.276 ± 0.31
1.201TyrGln: 1.201 ± 0.294
2.101TyrArg: 2.101 ± 0.469
2.026TyrSer: 2.026 ± 0.523
2.026TyrThr: 2.026 ± 0.459
2.701TyrVal: 2.701 ± 0.385
0.6TyrTrp: 0.6 ± 0.184
1.426TyrTyr: 1.426 ± 0.369
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 58 proteins (13327 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski