Amino acid dipepetide frequency for Lactobacillus phage 3-SAC12

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.388AlaAla: 4.388 ± 0.863
0.47AlaCys: 0.47 ± 0.195
5.014AlaAsp: 5.014 ± 0.709
4.388AlaGlu: 4.388 ± 0.625
2.272AlaPhe: 2.272 ± 0.329
6.582AlaGly: 6.582 ± 1.431
0.784AlaHis: 0.784 ± 0.233
5.25AlaIle: 5.25 ± 0.709
7.13AlaLys: 7.13 ± 0.984
5.563AlaLeu: 5.563 ± 0.84
1.88AlaMet: 1.88 ± 0.404
4.936AlaAsn: 4.936 ± 0.593
1.959AlaPro: 1.959 ± 0.442
2.899AlaGln: 2.899 ± 0.564
2.899AlaArg: 2.899 ± 0.561
4.544AlaSer: 4.544 ± 0.799
5.014AlaThr: 5.014 ± 0.882
3.526AlaVal: 3.526 ± 0.45
0.705AlaTrp: 0.705 ± 0.237
2.821AlaTyr: 2.821 ± 0.63
0.0AlaXaa: 0.0 ± 0.0
Cys
0.548CysAla: 0.548 ± 0.23
0.078CysCys: 0.078 ± 0.081
0.548CysAsp: 0.548 ± 0.252
0.313CysGlu: 0.313 ± 0.157
0.313CysPhe: 0.313 ± 0.196
0.47CysGly: 0.47 ± 0.208
0.235CysHis: 0.235 ± 0.14
0.548CysIle: 0.548 ± 0.182
0.313CysLys: 0.313 ± 0.156
0.548CysLeu: 0.548 ± 0.236
0.157CysMet: 0.157 ± 0.112
0.392CysAsn: 0.392 ± 0.208
0.235CysPro: 0.235 ± 0.139
0.078CysGln: 0.078 ± 0.094
0.392CysArg: 0.392 ± 0.198
0.313CysSer: 0.313 ± 0.147
0.078CysThr: 0.078 ± 0.087
0.392CysVal: 0.392 ± 0.202
0.0CysTrp: 0.0 ± 0.0
0.47CysTyr: 0.47 ± 0.22
0.0CysXaa: 0.0 ± 0.0
Asp
2.977AspAla: 2.977 ± 0.579
0.392AspCys: 0.392 ± 0.235
5.72AspAsp: 5.72 ± 0.959
4.936AspGlu: 4.936 ± 0.801
3.134AspPhe: 3.134 ± 0.496
4.153AspGly: 4.153 ± 0.486
0.784AspHis: 0.784 ± 0.243
4.779AspIle: 4.779 ± 0.903
5.406AspLys: 5.406 ± 0.604
6.033AspLeu: 6.033 ± 0.943
1.88AspMet: 1.88 ± 0.346
3.447AspAsn: 3.447 ± 0.479
2.194AspPro: 2.194 ± 0.469
1.097AspGln: 1.097 ± 0.229
1.802AspArg: 1.802 ± 0.37
3.212AspSer: 3.212 ± 0.739
3.134AspThr: 3.134 ± 0.437
3.918AspVal: 3.918 ± 0.488
0.627AspTrp: 0.627 ± 0.262
3.761AspTyr: 3.761 ± 0.614
0.0AspXaa: 0.0 ± 0.0
Glu
3.604GluAla: 3.604 ± 0.519
0.235GluCys: 0.235 ± 0.144
2.742GluAsp: 2.742 ± 0.563
4.858GluGlu: 4.858 ± 1.086
2.429GluPhe: 2.429 ± 0.479
1.959GluGly: 1.959 ± 0.349
1.019GluHis: 1.019 ± 0.312
4.388GluIle: 4.388 ± 0.771
4.701GluLys: 4.701 ± 0.784
6.425GluLeu: 6.425 ± 0.84
2.507GluMet: 2.507 ± 0.434
4.544GluAsn: 4.544 ± 0.684
1.645GluPro: 1.645 ± 0.478
2.272GluGln: 2.272 ± 0.489
2.664GluArg: 2.664 ± 0.631
3.134GluSer: 3.134 ± 0.57
3.918GluThr: 3.918 ± 0.54
4.858GluVal: 4.858 ± 0.619
0.627GluTrp: 0.627 ± 0.383
3.056GluTyr: 3.056 ± 0.504
0.0GluXaa: 0.0 ± 0.0
Phe
2.664PheAla: 2.664 ± 0.406
0.078PheCys: 0.078 ± 0.074
2.586PheAsp: 2.586 ± 0.526
2.272PheGlu: 2.272 ± 0.551
1.332PhePhe: 1.332 ± 0.341
3.291PheGly: 3.291 ± 0.724
0.548PheHis: 0.548 ± 0.16
1.959PheIle: 1.959 ± 0.377
2.586PheLys: 2.586 ± 0.384
2.351PheLeu: 2.351 ± 0.464
0.548PheMet: 0.548 ± 0.237
3.369PheAsn: 3.369 ± 0.555
0.47PhePro: 0.47 ± 0.21
0.627PheGln: 0.627 ± 0.204
1.097PheArg: 1.097 ± 0.294
3.526PheSer: 3.526 ± 0.548
1.88PheThr: 1.88 ± 0.353
2.977PheVal: 2.977 ± 0.538
0.627PheTrp: 0.627 ± 0.252
1.175PheTyr: 1.175 ± 0.381
0.0PheXaa: 0.0 ± 0.0
Gly
4.858GlyAla: 4.858 ± 1.418
0.157GlyCys: 0.157 ± 0.108
3.604GlyAsp: 3.604 ± 0.518
4.153GlyGlu: 4.153 ± 0.585
2.272GlyPhe: 2.272 ± 0.388
5.328GlyGly: 5.328 ± 1.352
1.019GlyHis: 1.019 ± 0.397
5.171GlyIle: 5.171 ± 0.552
6.895GlyLys: 6.895 ± 1.253
4.858GlyLeu: 4.858 ± 0.546
2.194GlyMet: 2.194 ± 0.408
3.526GlyAsn: 3.526 ± 0.625
1.019GlyPro: 1.019 ± 0.292
1.724GlyGln: 1.724 ± 0.274
2.037GlyArg: 2.037 ± 0.453
5.406GlySer: 5.406 ± 0.809
3.996GlyThr: 3.996 ± 0.795
5.093GlyVal: 5.093 ± 0.681
1.254GlyTrp: 1.254 ± 0.42
3.526GlyTyr: 3.526 ± 0.508
0.0GlyXaa: 0.0 ± 0.0
His
1.019HisAla: 1.019 ± 0.341
0.0HisCys: 0.0 ± 0.0
1.254HisAsp: 1.254 ± 0.4
1.097HisGlu: 1.097 ± 0.36
0.705HisPhe: 0.705 ± 0.228
1.41HisGly: 1.41 ± 0.367
0.313HisHis: 0.313 ± 0.229
1.175HisIle: 1.175 ± 0.341
1.489HisLys: 1.489 ± 0.35
0.784HisLeu: 0.784 ± 0.221
0.47HisMet: 0.47 ± 0.219
0.94HisAsn: 0.94 ± 0.316
0.47HisPro: 0.47 ± 0.193
0.548HisGln: 0.548 ± 0.198
0.47HisArg: 0.47 ± 0.228
1.175HisSer: 1.175 ± 0.277
0.94HisThr: 0.94 ± 0.26
1.332HisVal: 1.332 ± 0.345
0.157HisTrp: 0.157 ± 0.094
0.705HisTyr: 0.705 ± 0.209
0.0HisXaa: 0.0 ± 0.0
Ile
5.485IleAla: 5.485 ± 0.711
0.47IleCys: 0.47 ± 0.234
6.033IleAsp: 6.033 ± 0.889
3.839IleGlu: 3.839 ± 0.832
2.586IlePhe: 2.586 ± 0.385
4.231IleGly: 4.231 ± 0.771
1.019IleHis: 1.019 ± 0.271
4.779IleIle: 4.779 ± 0.884
5.798IleLys: 5.798 ± 0.811
3.056IleLeu: 3.056 ± 0.468
1.88IleMet: 1.88 ± 0.391
5.093IleAsn: 5.093 ± 0.657
2.115IlePro: 2.115 ± 0.415
2.351IleGln: 2.351 ± 0.464
2.037IleArg: 2.037 ± 0.528
5.25IleSer: 5.25 ± 0.583
4.779IleThr: 4.779 ± 0.637
5.171IleVal: 5.171 ± 0.618
0.235IleTrp: 0.235 ± 0.168
2.272IleTyr: 2.272 ± 0.447
0.0IleXaa: 0.0 ± 0.0
Lys
7.443LysAla: 7.443 ± 1.195
0.47LysCys: 0.47 ± 0.18
3.526LysAsp: 3.526 ± 0.49
4.779LysGlu: 4.779 ± 0.809
1.802LysPhe: 1.802 ± 0.346
3.996LysGly: 3.996 ± 0.531
1.567LysHis: 1.567 ± 0.333
7.13LysIle: 7.13 ± 0.798
8.697LysLys: 8.697 ± 1.662
6.738LysLeu: 6.738 ± 0.868
2.194LysMet: 2.194 ± 0.472
7.443LysAsn: 7.443 ± 0.754
2.821LysPro: 2.821 ± 0.536
3.918LysGln: 3.918 ± 0.483
4.309LysArg: 4.309 ± 0.581
7.365LysSer: 7.365 ± 0.916
5.72LysThr: 5.72 ± 0.664
4.309LysVal: 4.309 ± 0.641
0.862LysTrp: 0.862 ± 0.236
3.683LysTyr: 3.683 ± 0.636
0.0LysXaa: 0.0 ± 0.0
Leu
5.406LeuAla: 5.406 ± 0.695
0.47LeuCys: 0.47 ± 0.187
5.328LeuAsp: 5.328 ± 0.784
4.936LeuGlu: 4.936 ± 0.619
2.821LeuPhe: 2.821 ± 0.461
4.231LeuGly: 4.231 ± 0.563
1.175LeuHis: 1.175 ± 0.256
5.328LeuIle: 5.328 ± 0.712
6.817LeuLys: 6.817 ± 0.963
3.604LeuLeu: 3.604 ± 0.524
1.567LeuMet: 1.567 ± 0.267
4.936LeuAsn: 4.936 ± 0.929
1.802LeuPro: 1.802 ± 0.332
1.802LeuGln: 1.802 ± 0.355
3.134LeuArg: 3.134 ± 0.59
5.876LeuSer: 5.876 ± 0.844
5.014LeuThr: 5.014 ± 0.872
4.153LeuVal: 4.153 ± 0.708
0.392LeuTrp: 0.392 ± 0.197
2.507LeuTyr: 2.507 ± 0.602
0.0LeuXaa: 0.0 ± 0.0
Met
2.507MetAla: 2.507 ± 0.446
0.313MetCys: 0.313 ± 0.139
0.627MetAsp: 0.627 ± 0.285
1.254MetGlu: 1.254 ± 0.299
1.175MetPhe: 1.175 ± 0.264
2.115MetGly: 2.115 ± 0.44
0.47MetHis: 0.47 ± 0.205
1.802MetIle: 1.802 ± 0.385
2.586MetLys: 2.586 ± 0.415
1.724MetLeu: 1.724 ± 0.341
0.627MetMet: 0.627 ± 0.241
1.802MetAsn: 1.802 ± 0.446
1.489MetPro: 1.489 ± 0.291
1.175MetGln: 1.175 ± 0.258
1.489MetArg: 1.489 ± 0.367
2.037MetSer: 2.037 ± 0.377
1.802MetThr: 1.802 ± 0.375
1.567MetVal: 1.567 ± 0.335
0.078MetTrp: 0.078 ± 0.076
1.019MetTyr: 1.019 ± 0.247
0.0MetXaa: 0.0 ± 0.0
Asn
5.485AsnAla: 5.485 ± 0.588
0.313AsnCys: 0.313 ± 0.145
5.563AsnAsp: 5.563 ± 0.697
3.134AsnGlu: 3.134 ± 0.547
1.88AsnPhe: 1.88 ± 0.352
7.208AsnGly: 7.208 ± 0.757
1.41AsnHis: 1.41 ± 0.316
3.134AsnIle: 3.134 ± 0.504
5.014AsnLys: 5.014 ± 0.682
4.074AsnLeu: 4.074 ± 0.514
1.332AsnMet: 1.332 ± 0.288
4.858AsnAsn: 4.858 ± 0.5
2.742AsnPro: 2.742 ± 0.405
2.586AsnGln: 2.586 ± 0.42
3.212AsnArg: 3.212 ± 0.554
6.346AsnSer: 6.346 ± 1.026
4.074AsnThr: 4.074 ± 0.57
2.977AsnVal: 2.977 ± 0.397
0.705AsnTrp: 0.705 ± 0.249
2.351AsnTyr: 2.351 ± 0.399
0.0AsnXaa: 0.0 ± 0.0
Pro
3.291ProAla: 3.291 ± 0.721
0.078ProCys: 0.078 ± 0.07
2.194ProAsp: 2.194 ± 0.569
2.821ProGlu: 2.821 ± 0.524
0.94ProPhe: 0.94 ± 0.286
0.47ProGly: 0.47 ± 0.173
0.392ProHis: 0.392 ± 0.174
2.037ProIle: 2.037 ± 0.449
2.507ProLys: 2.507 ± 0.532
2.821ProLeu: 2.821 ± 0.406
0.47ProMet: 0.47 ± 0.173
1.567ProAsn: 1.567 ± 0.331
0.47ProPro: 0.47 ± 0.197
0.784ProGln: 0.784 ± 0.225
0.627ProArg: 0.627 ± 0.249
1.567ProSer: 1.567 ± 0.278
2.586ProThr: 2.586 ± 0.404
2.194ProVal: 2.194 ± 0.493
0.392ProTrp: 0.392 ± 0.179
1.332ProTyr: 1.332 ± 0.372
0.0ProXaa: 0.0 ± 0.0
Gln
2.115GlnAla: 2.115 ± 0.481
0.313GlnCys: 0.313 ± 0.166
2.194GlnAsp: 2.194 ± 0.432
2.037GlnGlu: 2.037 ± 0.437
1.254GlnPhe: 1.254 ± 0.365
1.959GlnGly: 1.959 ± 0.44
0.705GlnHis: 0.705 ± 0.218
2.194GlnIle: 2.194 ± 0.325
2.899GlnLys: 2.899 ± 0.405
2.899GlnLeu: 2.899 ± 0.602
1.332GlnMet: 1.332 ± 0.312
2.037GlnAsn: 2.037 ± 0.418
0.784GlnPro: 0.784 ± 0.239
1.41GlnGln: 1.41 ± 0.377
1.724GlnArg: 1.724 ± 0.406
2.586GlnSer: 2.586 ± 0.424
1.645GlnThr: 1.645 ± 0.482
1.959GlnVal: 1.959 ± 0.402
0.548GlnTrp: 0.548 ± 0.216
0.862GlnTyr: 0.862 ± 0.312
0.0GlnXaa: 0.0 ± 0.0
Arg
3.056ArgAla: 3.056 ± 0.521
0.47ArgCys: 0.47 ± 0.204
1.175ArgAsp: 1.175 ± 0.322
2.194ArgGlu: 2.194 ± 0.472
1.645ArgPhe: 1.645 ± 0.436
2.429ArgGly: 2.429 ± 0.407
0.94ArgHis: 0.94 ± 0.293
2.507ArgIle: 2.507 ± 0.454
3.604ArgLys: 3.604 ± 0.558
3.291ArgLeu: 3.291 ± 0.463
1.802ArgMet: 1.802 ± 0.336
2.586ArgAsn: 2.586 ± 0.467
0.862ArgPro: 0.862 ± 0.266
1.489ArgGln: 1.489 ± 0.35
1.645ArgArg: 1.645 ± 0.482
1.645ArgSer: 1.645 ± 0.317
1.254ArgThr: 1.254 ± 0.336
2.742ArgVal: 2.742 ± 0.484
0.627ArgTrp: 0.627 ± 0.239
2.194ArgTyr: 2.194 ± 0.487
0.0ArgXaa: 0.0 ± 0.0
Ser
5.014SerAla: 5.014 ± 0.642
0.47SerCys: 0.47 ± 0.193
4.936SerAsp: 4.936 ± 0.63
4.623SerGlu: 4.623 ± 0.768
2.664SerPhe: 2.664 ± 0.36
6.503SerGly: 6.503 ± 1.37
0.94SerHis: 0.94 ± 0.474
4.858SerIle: 4.858 ± 0.665
5.798SerLys: 5.798 ± 1.009
4.858SerLeu: 4.858 ± 0.58
1.41SerMet: 1.41 ± 0.319
4.466SerAsn: 4.466 ± 0.616
2.586SerPro: 2.586 ± 0.491
2.507SerGln: 2.507 ± 0.488
2.115SerArg: 2.115 ± 0.373
4.466SerSer: 4.466 ± 0.668
3.291SerThr: 3.291 ± 0.662
5.406SerVal: 5.406 ± 0.593
1.175SerTrp: 1.175 ± 0.361
2.586SerTyr: 2.586 ± 0.469
0.0SerXaa: 0.0 ± 0.0
Thr
4.779ThrAla: 4.779 ± 0.631
0.392ThrCys: 0.392 ± 0.197
3.291ThrAsp: 3.291 ± 0.504
3.291ThrGlu: 3.291 ± 0.467
2.977ThrPhe: 2.977 ± 0.477
4.309ThrGly: 4.309 ± 0.63
0.705ThrHis: 0.705 ± 0.286
4.074ThrIle: 4.074 ± 0.557
4.936ThrLys: 4.936 ± 0.574
4.388ThrLeu: 4.388 ± 0.548
2.272ThrMet: 2.272 ± 0.406
3.761ThrAsn: 3.761 ± 0.594
2.664ThrPro: 2.664 ± 0.566
1.645ThrGln: 1.645 ± 0.376
2.272ThrArg: 2.272 ± 0.341
3.996ThrSer: 3.996 ± 0.665
3.761ThrThr: 3.761 ± 0.723
5.014ThrVal: 5.014 ± 0.688
0.627ThrTrp: 0.627 ± 0.238
2.115ThrTyr: 2.115 ± 0.485
0.0ThrXaa: 0.0 ± 0.0
Val
4.231ValAla: 4.231 ± 0.601
0.47ValCys: 0.47 ± 0.202
3.918ValAsp: 3.918 ± 0.581
3.761ValGlu: 3.761 ± 0.663
1.802ValPhe: 1.802 ± 0.435
3.447ValGly: 3.447 ± 0.705
1.332ValHis: 1.332 ± 0.271
4.309ValIle: 4.309 ± 0.635
7.287ValLys: 7.287 ± 0.91
4.231ValLeu: 4.231 ± 0.666
1.645ValMet: 1.645 ± 0.431
5.014ValAsn: 5.014 ± 0.575
1.645ValPro: 1.645 ± 0.371
2.586ValGln: 2.586 ± 0.445
1.959ValArg: 1.959 ± 0.371
4.623ValSer: 4.623 ± 0.655
4.701ValThr: 4.701 ± 0.572
4.309ValVal: 4.309 ± 0.734
0.705ValTrp: 0.705 ± 0.226
3.291ValTyr: 3.291 ± 0.54
0.0ValXaa: 0.0 ± 0.0
Trp
1.019TrpAla: 1.019 ± 0.288
0.313TrpCys: 0.313 ± 0.181
0.392TrpAsp: 0.392 ± 0.182
0.392TrpGlu: 0.392 ± 0.178
0.47TrpPhe: 0.47 ± 0.204
1.019TrpGly: 1.019 ± 0.32
0.0TrpHis: 0.0 ± 0.0
0.784TrpIle: 0.784 ± 0.302
0.627TrpLys: 0.627 ± 0.255
0.94TrpLeu: 0.94 ± 0.335
0.078TrpMet: 0.078 ± 0.079
0.94TrpAsn: 0.94 ± 0.249
0.392TrpPro: 0.392 ± 0.19
0.548TrpGln: 0.548 ± 0.222
0.392TrpArg: 0.392 ± 0.176
0.94TrpSer: 0.94 ± 0.212
0.705TrpThr: 0.705 ± 0.272
0.784TrpVal: 0.784 ± 0.255
0.157TrpTrp: 0.157 ± 0.104
0.078TrpTyr: 0.078 ± 0.087
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.134TyrAla: 3.134 ± 0.392
0.47TyrCys: 0.47 ± 0.256
2.742TyrAsp: 2.742 ± 0.63
2.507TyrGlu: 2.507 ± 0.536
1.489TyrPhe: 1.489 ± 0.314
2.977TyrGly: 2.977 ± 0.43
1.097TyrHis: 1.097 ± 0.295
2.272TyrIle: 2.272 ± 0.506
3.683TyrLys: 3.683 ± 0.637
2.115TyrLeu: 2.115 ± 0.487
1.332TyrMet: 1.332 ± 0.294
2.742TyrAsn: 2.742 ± 0.522
1.175TyrPro: 1.175 ± 0.377
1.41TyrGln: 1.41 ± 0.26
1.959TyrArg: 1.959 ± 0.446
2.664TyrSer: 2.664 ± 0.502
2.977TyrThr: 2.977 ± 0.547
2.586TyrVal: 2.586 ± 0.38
0.47TyrTrp: 0.47 ± 0.214
1.332TyrTyr: 1.332 ± 0.513
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 61 proteins (12764 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski