Amino acid dipepetide frequency for Lactococcus phage 53801

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.42AlaAla: 4.42 ± 0.647
0.354AlaCys: 0.354 ± 0.167
3.536AlaAsp: 3.536 ± 0.545
4.155AlaGlu: 4.155 ± 0.71
2.829AlaPhe: 2.829 ± 0.449
4.596AlaGly: 4.596 ± 0.858
0.972AlaHis: 0.972 ± 0.342
4.508AlaIle: 4.508 ± 0.69
5.48AlaLys: 5.48 ± 0.747
5.392AlaLeu: 5.392 ± 0.645
2.387AlaMet: 2.387 ± 0.356
4.862AlaAsn: 4.862 ± 0.614
1.591AlaPro: 1.591 ± 0.416
3.094AlaGln: 3.094 ± 0.468
2.033AlaArg: 2.033 ± 0.587
3.713AlaSer: 3.713 ± 0.575
3.713AlaThr: 3.713 ± 0.581
3.005AlaVal: 3.005 ± 0.597
1.414AlaTrp: 1.414 ± 0.263
2.917AlaTyr: 2.917 ± 0.535
0.0AlaXaa: 0.0 ± 0.0
Cys
0.088CysAla: 0.088 ± 0.082
0.088CysCys: 0.088 ± 0.077
0.972CysAsp: 0.972 ± 0.284
0.707CysGlu: 0.707 ± 0.216
0.088CysPhe: 0.088 ± 0.085
0.884CysGly: 0.884 ± 0.333
0.265CysHis: 0.265 ± 0.176
0.354CysIle: 0.354 ± 0.155
0.354CysLys: 0.354 ± 0.188
0.354CysLeu: 0.354 ± 0.187
0.0CysMet: 0.0 ± 0.0
0.265CysAsn: 0.265 ± 0.135
0.619CysPro: 0.619 ± 0.248
0.0CysGln: 0.0 ± 0.0
0.619CysArg: 0.619 ± 0.164
0.796CysSer: 0.796 ± 0.254
0.354CysThr: 0.354 ± 0.188
0.265CysVal: 0.265 ± 0.127
0.0CysTrp: 0.0 ± 0.0
0.265CysTyr: 0.265 ± 0.141
0.0CysXaa: 0.0 ± 0.0
Asp
2.652AspAla: 2.652 ± 0.464
0.354AspCys: 0.354 ± 0.176
4.508AspAsp: 4.508 ± 0.55
5.127AspGlu: 5.127 ± 0.82
3.447AspPhe: 3.447 ± 0.689
5.127AspGly: 5.127 ± 0.824
0.354AspHis: 0.354 ± 0.166
5.48AspIle: 5.48 ± 0.586
5.922AspLys: 5.922 ± 0.83
4.066AspLeu: 4.066 ± 0.537
1.591AspMet: 1.591 ± 0.348
4.243AspAsn: 4.243 ± 0.469
0.796AspPro: 0.796 ± 0.244
1.414AspGln: 1.414 ± 0.287
1.945AspArg: 1.945 ± 0.362
3.713AspSer: 3.713 ± 0.593
4.331AspThr: 4.331 ± 0.589
4.066AspVal: 4.066 ± 0.444
1.503AspTrp: 1.503 ± 0.283
3.182AspTyr: 3.182 ± 0.651
0.0AspXaa: 0.0 ± 0.0
Glu
3.801GluAla: 3.801 ± 0.616
0.354GluCys: 0.354 ± 0.163
2.563GluAsp: 2.563 ± 0.586
5.038GluGlu: 5.038 ± 0.932
3.624GluPhe: 3.624 ± 0.475
3.005GluGly: 3.005 ± 0.536
1.326GluHis: 1.326 ± 0.432
4.95GluIle: 4.95 ± 0.652
5.48GluLys: 5.48 ± 0.945
7.602GluLeu: 7.602 ± 0.897
1.856GluMet: 1.856 ± 0.372
4.066GluAsn: 4.066 ± 0.574
2.298GluPro: 2.298 ± 0.412
3.801GluGln: 3.801 ± 0.609
3.094GluArg: 3.094 ± 0.471
3.447GluSer: 3.447 ± 0.596
3.801GluThr: 3.801 ± 0.583
4.508GluVal: 4.508 ± 0.685
1.238GluTrp: 1.238 ± 0.345
2.121GluTyr: 2.121 ± 0.335
0.0GluXaa: 0.0 ± 0.0
Phe
2.652PheAla: 2.652 ± 0.478
0.972PheCys: 0.972 ± 0.279
4.331PheAsp: 4.331 ± 0.465
3.182PheGlu: 3.182 ± 0.576
1.149PhePhe: 1.149 ± 0.352
2.652PheGly: 2.652 ± 0.495
0.884PheHis: 0.884 ± 0.259
3.536PheIle: 3.536 ± 0.498
4.155PheLys: 4.155 ± 0.626
2.121PheLeu: 2.121 ± 0.412
1.679PheMet: 1.679 ± 0.424
2.21PheAsn: 2.21 ± 0.479
0.707PhePro: 0.707 ± 0.285
2.033PheGln: 2.033 ± 0.445
1.768PheArg: 1.768 ± 0.374
3.624PheSer: 3.624 ± 0.561
2.74PheThr: 2.74 ± 0.522
2.652PheVal: 2.652 ± 0.498
0.354PheTrp: 0.354 ± 0.16
1.503PheTyr: 1.503 ± 0.456
0.0PheXaa: 0.0 ± 0.0
Gly
4.155GlyAla: 4.155 ± 0.782
0.53GlyCys: 0.53 ± 0.176
3.536GlyAsp: 3.536 ± 0.647
3.005GlyGlu: 3.005 ± 0.654
2.298GlyPhe: 2.298 ± 0.399
5.392GlyGly: 5.392 ± 0.982
0.884GlyHis: 0.884 ± 0.251
6.63GlyIle: 6.63 ± 0.952
5.569GlyLys: 5.569 ± 0.666
3.889GlyLeu: 3.889 ± 0.607
2.121GlyMet: 2.121 ± 0.415
3.889GlyAsn: 3.889 ± 0.64
1.061GlyPro: 1.061 ± 0.434
2.121GlyGln: 2.121 ± 0.578
2.387GlyArg: 2.387 ± 0.474
5.48GlySer: 5.48 ± 0.768
5.304GlyThr: 5.304 ± 0.989
5.038GlyVal: 5.038 ± 0.824
0.53GlyTrp: 0.53 ± 0.213
3.801GlyTyr: 3.801 ± 0.513
0.0GlyXaa: 0.0 ± 0.0
His
0.796HisAla: 0.796 ± 0.252
0.088HisCys: 0.088 ± 0.085
0.884HisAsp: 0.884 ± 0.263
1.061HisGlu: 1.061 ± 0.351
0.884HisPhe: 0.884 ± 0.332
0.796HisGly: 0.796 ± 0.248
0.177HisHis: 0.177 ± 0.124
0.707HisIle: 0.707 ± 0.263
1.149HisLys: 1.149 ± 0.221
0.972HisLeu: 0.972 ± 0.301
0.442HisMet: 0.442 ± 0.172
1.061HisAsn: 1.061 ± 0.29
0.265HisPro: 0.265 ± 0.108
0.972HisGln: 0.972 ± 0.274
0.354HisArg: 0.354 ± 0.152
0.884HisSer: 0.884 ± 0.274
0.442HisThr: 0.442 ± 0.178
1.326HisVal: 1.326 ± 0.323
0.177HisTrp: 0.177 ± 0.168
0.796HisTyr: 0.796 ± 0.265
0.0HisXaa: 0.0 ± 0.0
Ile
3.889IleAla: 3.889 ± 0.571
0.707IleCys: 0.707 ± 0.262
4.508IleAsp: 4.508 ± 0.609
5.304IleGlu: 5.304 ± 0.629
2.829IlePhe: 2.829 ± 0.53
4.773IleGly: 4.773 ± 0.597
1.679IleHis: 1.679 ± 0.48
3.713IleIle: 3.713 ± 0.585
6.276IleLys: 6.276 ± 0.767
4.508IleLeu: 4.508 ± 0.612
1.326IleMet: 1.326 ± 0.299
5.392IleAsn: 5.392 ± 0.565
1.591IlePro: 1.591 ± 0.328
3.271IleGln: 3.271 ± 0.492
2.917IleArg: 2.917 ± 0.564
5.038IleSer: 5.038 ± 0.618
4.508IleThr: 4.508 ± 0.546
3.713IleVal: 3.713 ± 0.574
0.972IleTrp: 0.972 ± 0.304
2.563IleTyr: 2.563 ± 0.521
0.0IleXaa: 0.0 ± 0.0
Lys
6.188LysAla: 6.188 ± 1.037
0.265LysCys: 0.265 ± 0.221
5.392LysAsp: 5.392 ± 0.595
5.127LysGlu: 5.127 ± 0.622
3.801LysPhe: 3.801 ± 0.468
4.862LysGly: 4.862 ± 0.567
1.149LysHis: 1.149 ± 0.296
4.596LysIle: 4.596 ± 0.485
6.453LysLys: 6.453 ± 0.911
8.044LysLeu: 8.044 ± 1.02
2.298LysMet: 2.298 ± 0.385
5.569LysAsn: 5.569 ± 0.666
2.121LysPro: 2.121 ± 0.352
4.243LysGln: 4.243 ± 0.585
2.74LysArg: 2.74 ± 0.554
4.95LysSer: 4.95 ± 0.701
5.304LysThr: 5.304 ± 0.667
3.713LysVal: 3.713 ± 0.512
0.884LysTrp: 0.884 ± 0.348
2.917LysTyr: 2.917 ± 0.552
0.0LysXaa: 0.0 ± 0.0
Leu
4.508LeuAla: 4.508 ± 0.594
0.53LeuCys: 0.53 ± 0.17
4.685LeuAsp: 4.685 ± 0.526
4.42LeuGlu: 4.42 ± 0.669
3.271LeuPhe: 3.271 ± 0.53
5.038LeuGly: 5.038 ± 0.635
0.884LeuHis: 0.884 ± 0.235
4.243LeuIle: 4.243 ± 0.671
6.718LeuLys: 6.718 ± 0.904
6.541LeuLeu: 6.541 ± 0.927
1.591LeuMet: 1.591 ± 0.332
5.569LeuAsn: 5.569 ± 0.726
3.005LeuPro: 3.005 ± 0.546
3.801LeuGln: 3.801 ± 0.502
1.945LeuArg: 1.945 ± 0.41
7.69LeuSer: 7.69 ± 0.546
5.834LeuThr: 5.834 ± 0.846
2.917LeuVal: 2.917 ± 0.425
0.796LeuTrp: 0.796 ± 0.301
2.563LeuTyr: 2.563 ± 0.359
0.0LeuXaa: 0.0 ± 0.0
Met
1.856MetAla: 1.856 ± 0.336
0.088MetCys: 0.088 ± 0.106
1.503MetAsp: 1.503 ± 0.307
1.856MetGlu: 1.856 ± 0.407
0.707MetPhe: 0.707 ± 0.216
1.679MetGly: 1.679 ± 0.354
0.265MetHis: 0.265 ± 0.128
1.503MetIle: 1.503 ± 0.309
1.945MetLys: 1.945 ± 0.463
1.061MetLeu: 1.061 ± 0.28
0.442MetMet: 0.442 ± 0.165
2.21MetAsn: 2.21 ± 0.414
0.796MetPro: 0.796 ± 0.278
1.326MetGln: 1.326 ± 0.318
1.061MetArg: 1.061 ± 0.27
2.21MetSer: 2.21 ± 0.382
2.74MetThr: 2.74 ± 0.351
1.061MetVal: 1.061 ± 0.228
0.177MetTrp: 0.177 ± 0.108
0.884MetTyr: 0.884 ± 0.308
0.0MetXaa: 0.0 ± 0.0
Asn
4.243AsnAla: 4.243 ± 0.511
0.53AsnCys: 0.53 ± 0.181
3.359AsnAsp: 3.359 ± 0.429
3.624AsnGlu: 3.624 ± 0.639
2.475AsnPhe: 2.475 ± 0.44
6.099AsnGly: 6.099 ± 1.391
0.53AsnHis: 0.53 ± 0.276
4.773AsnIle: 4.773 ± 0.597
4.508AsnLys: 4.508 ± 0.481
4.596AsnLeu: 4.596 ± 0.52
2.121AsnMet: 2.121 ± 0.447
3.801AsnAsn: 3.801 ± 0.566
2.652AsnPro: 2.652 ± 0.48
3.978AsnGln: 3.978 ± 0.77
2.563AsnArg: 2.563 ± 0.457
3.889AsnSer: 3.889 ± 0.472
3.447AsnThr: 3.447 ± 0.616
3.801AsnVal: 3.801 ± 0.592
0.707AsnTrp: 0.707 ± 0.224
1.856AsnTyr: 1.856 ± 0.322
0.0AsnXaa: 0.0 ± 0.0
Pro
1.238ProAla: 1.238 ± 0.369
0.088ProCys: 0.088 ± 0.093
1.856ProAsp: 1.856 ± 0.446
1.856ProGlu: 1.856 ± 0.285
1.503ProPhe: 1.503 ± 0.335
1.061ProGly: 1.061 ± 0.281
0.619ProHis: 0.619 ± 0.223
1.591ProIle: 1.591 ± 0.395
2.563ProLys: 2.563 ± 0.44
1.856ProLeu: 1.856 ± 0.356
0.265ProMet: 0.265 ± 0.174
2.121ProAsn: 2.121 ± 0.382
0.707ProPro: 0.707 ± 0.189
0.796ProGln: 0.796 ± 0.26
0.972ProArg: 0.972 ± 0.305
1.945ProSer: 1.945 ± 0.369
1.679ProThr: 1.679 ± 0.388
2.121ProVal: 2.121 ± 0.367
0.442ProTrp: 0.442 ± 0.2
0.796ProTyr: 0.796 ± 0.217
0.0ProXaa: 0.0 ± 0.0
Gln
4.331GlnAla: 4.331 ± 0.656
0.177GlnCys: 0.177 ± 0.12
1.856GlnAsp: 1.856 ± 0.388
3.978GlnGlu: 3.978 ± 0.504
1.591GlnPhe: 1.591 ± 0.421
2.21GlnGly: 2.21 ± 0.424
0.707GlnHis: 0.707 ± 0.239
3.094GlnIle: 3.094 ± 0.451
3.359GlnLys: 3.359 ± 0.587
3.978GlnLeu: 3.978 ± 0.672
1.414GlnMet: 1.414 ± 0.342
2.563GlnAsn: 2.563 ± 0.518
1.238GlnPro: 1.238 ± 0.509
2.21GlnGln: 2.21 ± 0.583
1.768GlnArg: 1.768 ± 0.42
2.21GlnSer: 2.21 ± 0.423
2.563GlnThr: 2.563 ± 0.359
3.005GlnVal: 3.005 ± 0.522
0.707GlnTrp: 0.707 ± 0.251
1.768GlnTyr: 1.768 ± 0.335
0.0GlnXaa: 0.0 ± 0.0
Arg
2.21ArgAla: 2.21 ± 0.462
0.442ArgCys: 0.442 ± 0.166
2.21ArgAsp: 2.21 ± 0.355
2.563ArgGlu: 2.563 ± 0.575
1.768ArgPhe: 1.768 ± 0.367
2.563ArgGly: 2.563 ± 0.433
0.265ArgHis: 0.265 ± 0.171
2.563ArgIle: 2.563 ± 0.528
2.652ArgLys: 2.652 ± 0.431
4.243ArgLeu: 4.243 ± 0.795
1.503ArgMet: 1.503 ± 0.367
1.856ArgAsn: 1.856 ± 0.324
0.972ArgPro: 0.972 ± 0.329
1.414ArgGln: 1.414 ± 0.287
1.061ArgArg: 1.061 ± 0.271
1.768ArgSer: 1.768 ± 0.233
2.21ArgThr: 2.21 ± 0.357
2.298ArgVal: 2.298 ± 0.394
0.0ArgTrp: 0.0 ± 0.0
1.679ArgTyr: 1.679 ± 0.434
0.0ArgXaa: 0.0 ± 0.0
Ser
5.038SerAla: 5.038 ± 0.596
0.53SerCys: 0.53 ± 0.241
6.011SerAsp: 6.011 ± 0.572
4.773SerGlu: 4.773 ± 0.631
3.271SerPhe: 3.271 ± 0.441
5.569SerGly: 5.569 ± 0.857
1.414SerHis: 1.414 ± 0.372
4.243SerIle: 4.243 ± 0.552
4.508SerLys: 4.508 ± 0.674
5.569SerLeu: 5.569 ± 0.748
0.707SerMet: 0.707 ± 0.257
4.508SerAsn: 4.508 ± 0.554
0.796SerPro: 0.796 ± 0.287
2.74SerGln: 2.74 ± 0.472
1.945SerArg: 1.945 ± 0.316
6.099SerSer: 6.099 ± 0.865
4.42SerThr: 4.42 ± 0.641
4.42SerVal: 4.42 ± 0.497
0.796SerTrp: 0.796 ± 0.204
2.652SerTyr: 2.652 ± 0.454
0.0SerXaa: 0.0 ± 0.0
Thr
5.127ThrAla: 5.127 ± 0.536
0.442ThrCys: 0.442 ± 0.223
4.596ThrAsp: 4.596 ± 0.596
4.42ThrGlu: 4.42 ± 0.611
3.536ThrPhe: 3.536 ± 0.456
5.215ThrGly: 5.215 ± 0.881
0.796ThrHis: 0.796 ± 0.226
4.95ThrIle: 4.95 ± 0.572
4.862ThrLys: 4.862 ± 0.598
3.536ThrLeu: 3.536 ± 0.537
1.149ThrMet: 1.149 ± 0.336
3.271ThrAsn: 3.271 ± 0.472
1.679ThrPro: 1.679 ± 0.315
2.298ThrGln: 2.298 ± 0.342
2.829ThrArg: 2.829 ± 0.465
3.801ThrSer: 3.801 ± 0.744
4.331ThrThr: 4.331 ± 0.655
4.95ThrVal: 4.95 ± 0.915
1.238ThrTrp: 1.238 ± 0.351
1.856ThrTyr: 1.856 ± 0.494
0.0ThrXaa: 0.0 ± 0.0
Val
4.066ValAla: 4.066 ± 0.674
0.354ValCys: 0.354 ± 0.184
3.978ValAsp: 3.978 ± 0.508
4.685ValGlu: 4.685 ± 0.579
2.917ValPhe: 2.917 ± 0.448
3.094ValGly: 3.094 ± 0.499
0.619ValHis: 0.619 ± 0.239
4.596ValIle: 4.596 ± 0.604
4.508ValLys: 4.508 ± 0.598
4.508ValLeu: 4.508 ± 0.627
1.326ValMet: 1.326 ± 0.361
3.801ValAsn: 3.801 ± 0.557
1.326ValPro: 1.326 ± 0.318
2.652ValGln: 2.652 ± 0.483
1.591ValArg: 1.591 ± 0.373
4.95ValSer: 4.95 ± 0.528
3.447ValThr: 3.447 ± 0.788
3.801ValVal: 3.801 ± 0.445
1.061ValTrp: 1.061 ± 0.267
2.033ValTyr: 2.033 ± 0.462
0.0ValXaa: 0.0 ± 0.0
Trp
1.326TrpAla: 1.326 ± 0.306
0.0TrpCys: 0.0 ± 0.0
0.796TrpAsp: 0.796 ± 0.27
0.884TrpGlu: 0.884 ± 0.295
0.796TrpPhe: 0.796 ± 0.257
0.884TrpGly: 0.884 ± 0.328
0.088TrpHis: 0.088 ± 0.075
0.796TrpIle: 0.796 ± 0.206
1.326TrpLys: 1.326 ± 0.309
0.796TrpLeu: 0.796 ± 0.223
0.265TrpMet: 0.265 ± 0.142
0.619TrpAsn: 0.619 ± 0.219
0.354TrpPro: 0.354 ± 0.165
0.972TrpGln: 0.972 ± 0.298
0.796TrpArg: 0.796 ± 0.258
0.619TrpSer: 0.619 ± 0.256
1.061TrpThr: 1.061 ± 0.422
0.796TrpVal: 0.796 ± 0.256
0.354TrpTrp: 0.354 ± 0.147
0.796TrpTyr: 0.796 ± 0.321
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.387TyrAla: 2.387 ± 0.369
0.442TyrCys: 0.442 ± 0.236
2.74TyrAsp: 2.74 ± 0.596
2.21TyrGlu: 2.21 ± 0.455
2.121TyrPhe: 2.121 ± 0.413
2.121TyrGly: 2.121 ± 0.475
0.354TyrHis: 0.354 ± 0.241
2.563TyrIle: 2.563 ± 0.435
2.829TyrLys: 2.829 ± 0.522
2.917TyrLeu: 2.917 ± 0.521
0.796TyrMet: 0.796 ± 0.299
1.679TyrAsn: 1.679 ± 0.367
1.503TyrPro: 1.503 ± 0.374
1.591TyrGln: 1.591 ± 0.322
1.856TyrArg: 1.856 ± 0.353
3.182TyrSer: 3.182 ± 0.46
2.652TyrThr: 2.652 ± 0.535
1.945TyrVal: 1.945 ± 0.405
0.972TyrTrp: 0.972 ± 0.242
0.972TyrTyr: 0.972 ± 0.265
0.088TyrXaa: 0.088 ± 0.086
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.088XaaIle: 0.088 ± 0.086
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (11314 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski