Amino acid dipepetide frequency for Lactococcus phage 98103

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.25AlaAla: 3.25 ± 0.691
0.451AlaCys: 0.451 ± 0.195
3.611AlaAsp: 3.611 ± 0.598
4.062AlaGlu: 4.062 ± 0.795
2.618AlaPhe: 2.618 ± 0.454
3.701AlaGly: 3.701 ± 0.737
0.632AlaHis: 0.632 ± 0.228
5.687AlaIle: 5.687 ± 0.939
4.514AlaLys: 4.514 ± 0.665
5.868AlaLeu: 5.868 ± 0.693
1.625AlaMet: 1.625 ± 0.282
4.785AlaAsn: 4.785 ± 0.534
1.535AlaPro: 1.535 ± 0.397
3.25AlaGln: 3.25 ± 0.612
2.437AlaArg: 2.437 ± 0.465
3.16AlaSer: 3.16 ± 0.492
3.792AlaThr: 3.792 ± 0.617
4.153AlaVal: 4.153 ± 0.71
1.715AlaTrp: 1.715 ± 0.484
2.076AlaTyr: 2.076 ± 0.388
0.0AlaXaa: 0.0 ± 0.0
Cys
0.09CysAla: 0.09 ± 0.082
0.0CysCys: 0.0 ± 0.0
0.632CysAsp: 0.632 ± 0.238
0.542CysGlu: 0.542 ± 0.206
0.361CysPhe: 0.361 ± 0.169
0.271CysGly: 0.271 ± 0.168
0.09CysHis: 0.09 ± 0.106
0.09CysIle: 0.09 ± 0.096
0.812CysLys: 0.812 ± 0.264
0.451CysLeu: 0.451 ± 0.217
0.181CysMet: 0.181 ± 0.163
0.271CysAsn: 0.271 ± 0.159
0.271CysPro: 0.271 ± 0.161
0.0CysGln: 0.0 ± 0.0
0.181CysArg: 0.181 ± 0.115
0.812CysSer: 0.812 ± 0.249
0.361CysThr: 0.361 ± 0.185
0.451CysVal: 0.451 ± 0.207
0.0CysTrp: 0.0 ± 0.0
0.09CysTyr: 0.09 ± 0.088
0.0CysXaa: 0.0 ± 0.0
Asp
3.069AspAla: 3.069 ± 0.519
0.271AspCys: 0.271 ± 0.138
4.243AspAsp: 4.243 ± 0.686
6.229AspGlu: 6.229 ± 0.91
3.34AspPhe: 3.34 ± 0.576
5.236AspGly: 5.236 ± 0.835
0.542AspHis: 0.542 ± 0.173
4.694AspIle: 4.694 ± 0.628
5.326AspLys: 5.326 ± 0.609
4.153AspLeu: 4.153 ± 0.511
1.535AspMet: 1.535 ± 0.414
2.799AspAsn: 2.799 ± 0.426
0.903AspPro: 0.903 ± 0.375
1.354AspGln: 1.354 ± 0.318
2.347AspArg: 2.347 ± 0.326
4.333AspSer: 4.333 ± 0.56
3.972AspThr: 3.972 ± 0.585
3.792AspVal: 3.792 ± 0.53
1.083AspTrp: 1.083 ± 0.305
2.257AspTyr: 2.257 ± 0.355
0.0AspXaa: 0.0 ± 0.0
Glu
3.521GluAla: 3.521 ± 0.572
0.271GluCys: 0.271 ± 0.143
2.347GluAsp: 2.347 ± 0.528
6.681GluGlu: 6.681 ± 1.314
4.062GluPhe: 4.062 ± 0.45
2.708GluGly: 2.708 ± 0.574
1.083GluHis: 1.083 ± 0.322
4.965GluIle: 4.965 ± 0.519
7.944GluLys: 7.944 ± 1.349
8.486GluLeu: 8.486 ± 1.05
2.167GluMet: 2.167 ± 0.499
3.972GluAsn: 3.972 ± 0.624
2.167GluPro: 2.167 ± 0.586
3.792GluGln: 3.792 ± 0.651
3.069GluArg: 3.069 ± 0.514
3.25GluSer: 3.25 ± 0.583
4.424GluThr: 4.424 ± 0.689
5.507GluVal: 5.507 ± 0.867
1.174GluTrp: 1.174 ± 0.303
3.611GluTyr: 3.611 ± 0.557
0.0GluXaa: 0.0 ± 0.0
Phe
2.708PheAla: 2.708 ± 0.519
0.542PheCys: 0.542 ± 0.201
3.34PheAsp: 3.34 ± 0.418
3.069PheGlu: 3.069 ± 0.637
1.806PhePhe: 1.806 ± 0.43
2.799PheGly: 2.799 ± 0.618
0.361PheHis: 0.361 ± 0.146
2.889PheIle: 2.889 ± 0.501
4.514PheLys: 4.514 ± 0.662
2.889PheLeu: 2.889 ± 0.555
1.444PheMet: 1.444 ± 0.445
3.16PheAsn: 3.16 ± 0.646
0.632PhePro: 0.632 ± 0.225
1.715PheGln: 1.715 ± 0.474
1.083PheArg: 1.083 ± 0.346
3.34PheSer: 3.34 ± 0.483
2.799PheThr: 2.799 ± 0.481
2.257PheVal: 2.257 ± 0.536
0.632PheTrp: 0.632 ± 0.26
1.715PheTyr: 1.715 ± 0.42
0.0PheXaa: 0.0 ± 0.0
Gly
3.431GlyAla: 3.431 ± 0.734
0.451GlyCys: 0.451 ± 0.201
2.979GlyAsp: 2.979 ± 0.584
3.069GlyGlu: 3.069 ± 0.691
2.347GlyPhe: 2.347 ± 0.425
4.424GlyGly: 4.424 ± 0.781
0.542GlyHis: 0.542 ± 0.219
5.417GlyIle: 5.417 ± 0.573
6.049GlyLys: 6.049 ± 0.751
5.778GlyLeu: 5.778 ± 1.244
1.535GlyMet: 1.535 ± 0.483
3.431GlyAsn: 3.431 ± 0.819
0.812GlyPro: 0.812 ± 0.353
2.708GlyGln: 2.708 ± 0.625
2.708GlyArg: 2.708 ± 0.529
4.153GlySer: 4.153 ± 0.674
4.424GlyThr: 4.424 ± 0.638
3.972GlyVal: 3.972 ± 0.714
0.993GlyTrp: 0.993 ± 0.34
3.069GlyTyr: 3.069 ± 0.616
0.0GlyXaa: 0.0 ± 0.0
His
1.174HisAla: 1.174 ± 0.324
0.09HisCys: 0.09 ± 0.079
0.903HisAsp: 0.903 ± 0.219
1.625HisGlu: 1.625 ± 0.393
0.722HisPhe: 0.722 ± 0.257
1.083HisGly: 1.083 ± 0.346
0.271HisHis: 0.271 ± 0.141
0.542HisIle: 0.542 ± 0.264
0.722HisLys: 0.722 ± 0.256
0.812HisLeu: 0.812 ± 0.314
0.271HisMet: 0.271 ± 0.155
0.722HisAsn: 0.722 ± 0.27
0.271HisPro: 0.271 ± 0.142
0.542HisGln: 0.542 ± 0.235
0.542HisArg: 0.542 ± 0.209
0.903HisSer: 0.903 ± 0.31
0.361HisThr: 0.361 ± 0.159
0.722HisVal: 0.722 ± 0.193
0.181HisTrp: 0.181 ± 0.126
0.722HisTyr: 0.722 ± 0.224
0.0HisXaa: 0.0 ± 0.0
Ile
4.875IleAla: 4.875 ± 0.76
0.632IleCys: 0.632 ± 0.246
3.972IleAsp: 3.972 ± 0.625
6.139IleGlu: 6.139 ± 0.834
2.347IlePhe: 2.347 ± 0.477
3.611IleGly: 3.611 ± 0.57
0.903IleHis: 0.903 ± 0.472
3.34IleIle: 3.34 ± 0.656
7.312IleLys: 7.312 ± 0.787
4.243IleLeu: 4.243 ± 0.715
1.354IleMet: 1.354 ± 0.286
5.326IleAsn: 5.326 ± 0.822
1.625IlePro: 1.625 ± 0.353
3.16IleGln: 3.16 ± 0.551
2.437IleArg: 2.437 ± 0.369
4.875IleSer: 4.875 ± 0.676
4.694IleThr: 4.694 ± 0.576
3.34IleVal: 3.34 ± 0.801
0.542IleTrp: 0.542 ± 0.276
2.257IleTyr: 2.257 ± 0.43
0.0IleXaa: 0.0 ± 0.0
Lys
6.5LysAla: 6.5 ± 1.035
0.0LysCys: 0.0 ± 0.0
6.049LysAsp: 6.049 ± 0.694
7.042LysGlu: 7.042 ± 0.961
2.708LysPhe: 2.708 ± 0.486
6.049LysGly: 6.049 ± 0.824
1.896LysHis: 1.896 ± 0.386
6.59LysIle: 6.59 ± 0.679
9.208LysLys: 9.208 ± 1.155
7.493LysLeu: 7.493 ± 1.002
2.799LysMet: 2.799 ± 0.484
6.861LysAsn: 6.861 ± 1.064
2.167LysPro: 2.167 ± 0.389
5.417LysGln: 5.417 ± 0.876
3.521LysArg: 3.521 ± 0.696
5.417LysSer: 5.417 ± 0.781
5.056LysThr: 5.056 ± 0.704
4.604LysVal: 4.604 ± 0.831
0.993LysTrp: 0.993 ± 0.303
3.431LysTyr: 3.431 ± 0.597
0.0LysXaa: 0.0 ± 0.0
Leu
4.424LeuAla: 4.424 ± 0.679
0.812LeuCys: 0.812 ± 0.27
5.778LeuAsp: 5.778 ± 0.625
5.417LeuGlu: 5.417 ± 0.784
2.708LeuPhe: 2.708 ± 0.492
4.604LeuGly: 4.604 ± 0.531
0.722LeuHis: 0.722 ± 0.253
5.236LeuIle: 5.236 ± 0.713
7.674LeuLys: 7.674 ± 1.045
6.229LeuLeu: 6.229 ± 0.827
1.986LeuMet: 1.986 ± 0.432
5.778LeuAsn: 5.778 ± 0.692
2.889LeuPro: 2.889 ± 0.505
3.882LeuGln: 3.882 ± 0.711
2.347LeuArg: 2.347 ± 0.467
6.771LeuSer: 6.771 ± 0.756
5.236LeuThr: 5.236 ± 0.702
3.611LeuVal: 3.611 ± 0.593
1.444LeuTrp: 1.444 ± 0.619
2.889LeuTyr: 2.889 ± 0.606
0.0LeuXaa: 0.0 ± 0.0
Met
2.889MetAla: 2.889 ± 0.523
0.181MetCys: 0.181 ± 0.135
1.264MetAsp: 1.264 ± 0.403
1.715MetGlu: 1.715 ± 0.53
0.722MetPhe: 0.722 ± 0.21
1.174MetGly: 1.174 ± 0.312
0.181MetHis: 0.181 ± 0.127
1.535MetIle: 1.535 ± 0.439
2.167MetLys: 2.167 ± 0.508
2.076MetLeu: 2.076 ± 0.457
0.361MetMet: 0.361 ± 0.194
2.076MetAsn: 2.076 ± 0.404
0.722MetPro: 0.722 ± 0.263
1.264MetGln: 1.264 ± 0.31
1.354MetArg: 1.354 ± 0.373
1.625MetSer: 1.625 ± 0.383
2.799MetThr: 2.799 ± 0.541
0.903MetVal: 0.903 ± 0.306
0.271MetTrp: 0.271 ± 0.146
0.542MetTyr: 0.542 ± 0.236
0.0MetXaa: 0.0 ± 0.0
Asn
4.604AsnAla: 4.604 ± 0.674
0.271AsnCys: 0.271 ± 0.148
3.521AsnAsp: 3.521 ± 0.522
4.243AsnGlu: 4.243 ± 0.622
3.069AsnPhe: 3.069 ± 0.451
5.958AsnGly: 5.958 ± 1.048
0.993AsnHis: 0.993 ± 0.441
3.069AsnIle: 3.069 ± 0.598
5.597AsnLys: 5.597 ± 0.583
5.687AsnLeu: 5.687 ± 0.697
1.535AsnMet: 1.535 ± 0.466
4.785AsnAsn: 4.785 ± 0.7
2.076AsnPro: 2.076 ± 0.415
3.521AsnGln: 3.521 ± 0.727
2.076AsnArg: 2.076 ± 0.325
4.153AsnSer: 4.153 ± 0.839
2.618AsnThr: 2.618 ± 0.524
3.882AsnVal: 3.882 ± 0.582
0.722AsnTrp: 0.722 ± 0.238
2.528AsnTyr: 2.528 ± 0.486
0.0AsnXaa: 0.0 ± 0.0
Pro
1.083ProAla: 1.083 ± 0.321
0.09ProCys: 0.09 ± 0.092
2.076ProAsp: 2.076 ± 0.501
2.347ProGlu: 2.347 ± 0.387
1.083ProPhe: 1.083 ± 0.314
0.812ProGly: 0.812 ± 0.217
0.722ProHis: 0.722 ± 0.196
1.354ProIle: 1.354 ± 0.446
2.618ProLys: 2.618 ± 0.479
2.257ProLeu: 2.257 ± 0.439
0.542ProMet: 0.542 ± 0.203
1.444ProAsn: 1.444 ± 0.399
0.451ProPro: 0.451 ± 0.176
0.993ProGln: 0.993 ± 0.26
0.542ProArg: 0.542 ± 0.26
1.264ProSer: 1.264 ± 0.387
1.535ProThr: 1.535 ± 0.397
1.806ProVal: 1.806 ± 0.402
0.181ProTrp: 0.181 ± 0.126
1.174ProTyr: 1.174 ± 0.328
0.0ProXaa: 0.0 ± 0.0
Gln
4.514GlnAla: 4.514 ± 0.762
0.361GlnCys: 0.361 ± 0.153
1.264GlnAsp: 1.264 ± 0.416
3.611GlnGlu: 3.611 ± 0.491
1.535GlnPhe: 1.535 ± 0.359
2.528GlnGly: 2.528 ± 0.666
0.451GlnHis: 0.451 ± 0.238
3.16GlnIle: 3.16 ± 0.619
3.611GlnLys: 3.611 ± 0.73
3.701GlnLeu: 3.701 ± 0.67
1.625GlnMet: 1.625 ± 0.419
2.979GlnAsn: 2.979 ± 0.526
1.444GlnPro: 1.444 ± 0.426
2.799GlnGln: 2.799 ± 0.599
1.806GlnArg: 1.806 ± 0.44
1.715GlnSer: 1.715 ± 0.541
2.528GlnThr: 2.528 ± 0.489
2.979GlnVal: 2.979 ± 0.566
0.903GlnTrp: 0.903 ± 0.289
1.806GlnTyr: 1.806 ± 0.324
0.0GlnXaa: 0.0 ± 0.0
Arg
2.076ArgAla: 2.076 ± 0.431
0.181ArgCys: 0.181 ± 0.131
2.257ArgAsp: 2.257 ± 0.434
2.708ArgGlu: 2.708 ± 0.48
2.076ArgPhe: 2.076 ± 0.465
2.076ArgGly: 2.076 ± 0.361
0.181ArgHis: 0.181 ± 0.129
2.347ArgIle: 2.347 ± 0.392
4.333ArgLys: 4.333 ± 0.619
3.972ArgLeu: 3.972 ± 0.798
1.264ArgMet: 1.264 ± 0.301
1.986ArgAsn: 1.986 ± 0.396
0.993ArgPro: 0.993 ± 0.38
1.083ArgGln: 1.083 ± 0.264
1.354ArgArg: 1.354 ± 0.343
1.986ArgSer: 1.986 ± 0.364
1.625ArgThr: 1.625 ± 0.336
2.528ArgVal: 2.528 ± 0.42
0.451ArgTrp: 0.451 ± 0.196
1.444ArgTyr: 1.444 ± 0.408
0.0ArgXaa: 0.0 ± 0.0
Ser
4.153SerAla: 4.153 ± 0.887
0.542SerCys: 0.542 ± 0.249
5.868SerAsp: 5.868 ± 0.617
4.514SerGlu: 4.514 ± 0.747
3.611SerPhe: 3.611 ± 0.593
4.514SerGly: 4.514 ± 0.739
1.174SerHis: 1.174 ± 0.308
3.431SerIle: 3.431 ± 0.539
4.604SerLys: 4.604 ± 0.707
3.701SerLeu: 3.701 ± 0.557
1.535SerMet: 1.535 ± 0.385
4.424SerAsn: 4.424 ± 0.554
1.264SerPro: 1.264 ± 0.31
2.799SerGln: 2.799 ± 0.532
2.257SerArg: 2.257 ± 0.294
4.243SerSer: 4.243 ± 0.688
3.701SerThr: 3.701 ± 0.49
4.333SerVal: 4.333 ± 0.487
0.722SerTrp: 0.722 ± 0.205
2.618SerTyr: 2.618 ± 0.401
0.0SerXaa: 0.0 ± 0.0
Thr
4.424ThrAla: 4.424 ± 0.648
0.181ThrCys: 0.181 ± 0.127
3.972ThrAsp: 3.972 ± 0.463
4.424ThrGlu: 4.424 ± 0.751
3.069ThrPhe: 3.069 ± 0.44
4.965ThrGly: 4.965 ± 0.548
0.542ThrHis: 0.542 ± 0.228
4.424ThrIle: 4.424 ± 0.796
5.507ThrLys: 5.507 ± 0.557
4.424ThrLeu: 4.424 ± 0.62
1.174ThrMet: 1.174 ± 0.322
3.16ThrAsn: 3.16 ± 0.537
1.354ThrPro: 1.354 ± 0.342
1.896ThrGln: 1.896 ± 0.373
2.799ThrArg: 2.799 ± 0.545
3.16ThrSer: 3.16 ± 0.512
4.062ThrThr: 4.062 ± 0.514
4.514ThrVal: 4.514 ± 0.688
0.542ThrTrp: 0.542 ± 0.259
1.806ThrTyr: 1.806 ± 0.373
0.0ThrXaa: 0.0 ± 0.0
Val
3.34ValAla: 3.34 ± 0.503
0.271ValCys: 0.271 ± 0.226
4.604ValAsp: 4.604 ± 0.874
4.875ValGlu: 4.875 ± 0.718
2.437ValPhe: 2.437 ± 0.386
3.069ValGly: 3.069 ± 0.533
0.993ValHis: 0.993 ± 0.307
3.882ValIle: 3.882 ± 0.469
6.5ValLys: 6.5 ± 0.969
4.153ValLeu: 4.153 ± 0.507
1.174ValMet: 1.174 ± 0.349
3.882ValAsn: 3.882 ± 0.63
1.354ValPro: 1.354 ± 0.357
1.896ValGln: 1.896 ± 0.582
1.535ValArg: 1.535 ± 0.354
5.236ValSer: 5.236 ± 0.726
4.243ValThr: 4.243 ± 0.676
4.785ValVal: 4.785 ± 0.722
0.812ValTrp: 0.812 ± 0.218
1.715ValTyr: 1.715 ± 0.389
0.0ValXaa: 0.0 ± 0.0
Trp
1.174TrpAla: 1.174 ± 0.28
0.09TrpCys: 0.09 ± 0.096
0.722TrpAsp: 0.722 ± 0.39
0.812TrpGlu: 0.812 ± 0.298
0.812TrpPhe: 0.812 ± 0.24
0.361TrpGly: 0.361 ± 0.183
0.271TrpHis: 0.271 ± 0.147
1.715TrpIle: 1.715 ± 0.299
1.625TrpLys: 1.625 ± 0.395
0.722TrpLeu: 0.722 ± 0.268
0.181TrpMet: 0.181 ± 0.109
1.083TrpAsn: 1.083 ± 0.499
0.09TrpPro: 0.09 ± 0.088
1.174TrpGln: 1.174 ± 0.345
0.722TrpArg: 0.722 ± 0.252
0.542TrpSer: 0.542 ± 0.23
0.722TrpThr: 0.722 ± 0.264
0.722TrpVal: 0.722 ± 0.261
0.361TrpTrp: 0.361 ± 0.176
0.361TrpTyr: 0.361 ± 0.183
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.625TyrAla: 1.625 ± 0.336
0.271TyrCys: 0.271 ± 0.143
2.076TyrAsp: 2.076 ± 0.392
2.076TyrGlu: 2.076 ± 0.464
2.257TyrPhe: 2.257 ± 0.504
2.347TyrGly: 2.347 ± 0.534
0.632TyrHis: 0.632 ± 0.261
2.618TyrIle: 2.618 ± 0.616
3.16TyrLys: 3.16 ± 0.532
3.431TyrLeu: 3.431 ± 0.756
1.444TyrMet: 1.444 ± 0.341
1.986TyrAsn: 1.986 ± 0.437
1.354TyrPro: 1.354 ± 0.341
2.076TyrGln: 2.076 ± 0.436
1.806TyrArg: 1.806 ± 0.463
3.069TyrSer: 3.069 ± 0.5
1.444TyrThr: 1.444 ± 0.399
1.896TyrVal: 1.896 ± 0.371
0.542TyrTrp: 0.542 ± 0.193
1.174TyrTyr: 1.174 ± 0.32
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 58 proteins (11078 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski