Amino acid dipepetide frequency for Lactococcus phage 38502

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.044AlaAla: 4.044 ± 0.827
0.481AlaCys: 0.481 ± 0.218
4.332AlaAsp: 4.332 ± 0.634
3.947AlaGlu: 3.947 ± 0.575
2.118AlaPhe: 2.118 ± 0.412
4.429AlaGly: 4.429 ± 0.886
1.059AlaHis: 1.059 ± 0.283
4.91AlaIle: 4.91 ± 0.797
5.006AlaLys: 5.006 ± 0.726
5.103AlaLeu: 5.103 ± 0.915
1.925AlaMet: 1.925 ± 0.289
3.37AlaAsn: 3.37 ± 0.53
1.925AlaPro: 1.925 ± 0.481
2.599AlaGln: 2.599 ± 0.45
2.022AlaArg: 2.022 ± 0.49
3.466AlaSer: 3.466 ± 0.635
3.658AlaThr: 3.658 ± 0.51
4.236AlaVal: 4.236 ± 0.542
1.059AlaTrp: 1.059 ± 0.325
1.54AlaTyr: 1.54 ± 0.499
0.0AlaXaa: 0.0 ± 0.0
Cys
0.385CysAla: 0.385 ± 0.264
0.193CysCys: 0.193 ± 0.149
0.674CysAsp: 0.674 ± 0.208
0.289CysGlu: 0.289 ± 0.161
0.289CysPhe: 0.289 ± 0.168
0.578CysGly: 0.578 ± 0.208
0.193CysHis: 0.193 ± 0.131
0.289CysIle: 0.289 ± 0.175
0.578CysLys: 0.578 ± 0.287
0.77CysLeu: 0.77 ± 0.294
0.0CysMet: 0.0 ± 0.0
0.578CysAsn: 0.578 ± 0.238
0.096CysPro: 0.096 ± 0.109
0.289CysGln: 0.289 ± 0.159
0.578CysArg: 0.578 ± 0.276
0.77CysSer: 0.77 ± 0.277
0.193CysThr: 0.193 ± 0.119
0.193CysVal: 0.193 ± 0.111
0.193CysTrp: 0.193 ± 0.128
0.289CysTyr: 0.289 ± 0.142
0.0CysXaa: 0.0 ± 0.0
Asp
2.984AspAla: 2.984 ± 0.484
0.481AspCys: 0.481 ± 0.192
4.044AspAsp: 4.044 ± 0.829
4.621AspGlu: 4.621 ± 0.74
3.177AspPhe: 3.177 ± 0.549
5.391AspGly: 5.391 ± 1.161
0.77AspHis: 0.77 ± 0.237
3.947AspIle: 3.947 ± 0.546
5.006AspLys: 5.006 ± 0.787
4.717AspLeu: 4.717 ± 0.738
2.503AspMet: 2.503 ± 0.491
3.658AspAsn: 3.658 ± 0.499
1.733AspPro: 1.733 ± 0.42
0.578AspGln: 0.578 ± 0.209
2.118AspArg: 2.118 ± 0.42
4.429AspSer: 4.429 ± 0.628
2.888AspThr: 2.888 ± 0.583
2.984AspVal: 2.984 ± 0.604
1.444AspTrp: 1.444 ± 0.442
2.214AspTyr: 2.214 ± 0.488
0.0AspXaa: 0.0 ± 0.0
Glu
4.14GluAla: 4.14 ± 0.571
0.578GluCys: 0.578 ± 0.23
2.984GluAsp: 2.984 ± 0.616
4.14GluGlu: 4.14 ± 0.973
2.696GluPhe: 2.696 ± 0.54
2.214GluGly: 2.214 ± 0.386
1.252GluHis: 1.252 ± 0.344
5.103GluIle: 5.103 ± 0.734
7.798GluLys: 7.798 ± 1.322
7.124GluLeu: 7.124 ± 0.901
2.311GluMet: 2.311 ± 0.41
4.429GluAsn: 4.429 ± 0.609
1.925GluPro: 1.925 ± 0.437
3.273GluGln: 3.273 ± 0.519
2.888GluArg: 2.888 ± 0.613
3.081GluSer: 3.081 ± 0.433
3.658GluThr: 3.658 ± 0.611
4.14GluVal: 4.14 ± 0.759
1.059GluTrp: 1.059 ± 0.265
2.022GluTyr: 2.022 ± 0.466
0.0GluXaa: 0.0 ± 0.0
Phe
2.696PheAla: 2.696 ± 0.531
0.193PheCys: 0.193 ± 0.143
2.792PheAsp: 2.792 ± 0.529
3.081PheGlu: 3.081 ± 0.584
1.829PhePhe: 1.829 ± 0.417
2.696PheGly: 2.696 ± 0.564
0.578PheHis: 0.578 ± 0.276
3.273PheIle: 3.273 ± 0.469
3.37PheLys: 3.37 ± 0.562
2.984PheLeu: 2.984 ± 0.446
1.54PheMet: 1.54 ± 0.387
3.466PheAsn: 3.466 ± 0.53
0.963PhePro: 0.963 ± 0.267
1.348PheGln: 1.348 ± 0.552
1.348PheArg: 1.348 ± 0.322
3.37PheSer: 3.37 ± 0.596
3.081PheThr: 3.081 ± 0.701
2.599PheVal: 2.599 ± 0.488
0.193PheTrp: 0.193 ± 0.144
2.503PheTyr: 2.503 ± 0.478
0.0PheXaa: 0.0 ± 0.0
Gly
3.273GlyAla: 3.273 ± 0.563
0.193GlyCys: 0.193 ± 0.135
3.177GlyAsp: 3.177 ± 0.525
3.947GlyGlu: 3.947 ± 0.578
3.466GlyPhe: 3.466 ± 0.686
4.621GlyGly: 4.621 ± 1.056
0.866GlyHis: 0.866 ± 0.351
5.873GlyIle: 5.873 ± 1.153
5.68GlyLys: 5.68 ± 0.623
5.199GlyLeu: 5.199 ± 0.544
1.252GlyMet: 1.252 ± 0.323
3.755GlyAsn: 3.755 ± 0.584
1.155GlyPro: 1.155 ± 0.403
3.37GlyGln: 3.37 ± 0.612
1.733GlyArg: 1.733 ± 0.318
3.851GlySer: 3.851 ± 0.974
6.065GlyThr: 6.065 ± 1.12
4.044GlyVal: 4.044 ± 0.829
0.77GlyTrp: 0.77 ± 0.237
3.466GlyTyr: 3.466 ± 0.553
0.0GlyXaa: 0.0 ± 0.0
His
1.059HisAla: 1.059 ± 0.357
0.193HisCys: 0.193 ± 0.124
0.674HisAsp: 0.674 ± 0.231
1.348HisGlu: 1.348 ± 0.371
0.481HisPhe: 0.481 ± 0.191
0.77HisGly: 0.77 ± 0.286
0.481HisHis: 0.481 ± 0.172
1.059HisIle: 1.059 ± 0.313
1.155HisLys: 1.155 ± 0.299
1.059HisLeu: 1.059 ± 0.361
0.385HisMet: 0.385 ± 0.189
1.059HisAsn: 1.059 ± 0.232
0.578HisPro: 0.578 ± 0.232
0.289HisGln: 0.289 ± 0.181
0.578HisArg: 0.578 ± 0.225
1.059HisSer: 1.059 ± 0.328
0.385HisThr: 0.385 ± 0.195
0.963HisVal: 0.963 ± 0.324
0.289HisTrp: 0.289 ± 0.156
0.674HisTyr: 0.674 ± 0.235
0.0HisXaa: 0.0 ± 0.0
Ile
4.717IleAla: 4.717 ± 0.539
0.578IleCys: 0.578 ± 0.26
4.332IleAsp: 4.332 ± 0.63
5.488IleGlu: 5.488 ± 0.861
2.599IlePhe: 2.599 ± 0.614
4.236IleGly: 4.236 ± 0.649
1.155IleHis: 1.155 ± 0.38
5.391IleIle: 5.391 ± 0.766
7.124IleLys: 7.124 ± 0.992
4.621IleLeu: 4.621 ± 0.651
1.444IleMet: 1.444 ± 0.337
5.295IleAsn: 5.295 ± 0.85
2.311IlePro: 2.311 ± 0.437
2.696IleGln: 2.696 ± 0.545
2.407IleArg: 2.407 ± 0.493
7.221IleSer: 7.221 ± 0.77
4.429IleThr: 4.429 ± 0.878
3.658IleVal: 3.658 ± 0.512
0.481IleTrp: 0.481 ± 0.224
1.925IleTyr: 1.925 ± 0.436
0.0IleXaa: 0.0 ± 0.0
Lys
5.969LysAla: 5.969 ± 0.621
0.674LysCys: 0.674 ± 0.221
5.103LysAsp: 5.103 ± 0.679
6.065LysGlu: 6.065 ± 1.118
3.273LysPhe: 3.273 ± 0.612
4.814LysGly: 4.814 ± 0.651
1.637LysHis: 1.637 ± 0.539
5.776LysIle: 5.776 ± 0.792
9.82LysLys: 9.82 ± 1.459
8.857LysLeu: 8.857 ± 0.986
2.311LysMet: 2.311 ± 0.43
5.488LysAsn: 5.488 ± 0.892
2.022LysPro: 2.022 ± 0.497
3.273LysGln: 3.273 ± 0.507
3.947LysArg: 3.947 ± 0.716
5.295LysSer: 5.295 ± 0.891
5.488LysThr: 5.488 ± 0.68
5.584LysVal: 5.584 ± 0.851
1.348LysTrp: 1.348 ± 0.394
4.621LysTyr: 4.621 ± 0.744
0.0LysXaa: 0.0 ± 0.0
Leu
5.295LeuAla: 5.295 ± 0.905
0.193LeuCys: 0.193 ± 0.125
4.621LeuAsp: 4.621 ± 0.605
5.006LeuGlu: 5.006 ± 0.639
3.755LeuPhe: 3.755 ± 0.508
5.199LeuGly: 5.199 ± 0.921
0.77LeuHis: 0.77 ± 0.327
5.006LeuIle: 5.006 ± 0.588
8.568LeuLys: 8.568 ± 1.314
4.91LeuLeu: 4.91 ± 0.675
2.118LeuMet: 2.118 ± 0.453
4.91LeuAsn: 4.91 ± 0.708
4.14LeuPro: 4.14 ± 0.647
3.755LeuGln: 3.755 ± 0.584
2.503LeuArg: 2.503 ± 0.616
6.547LeuSer: 6.547 ± 0.707
5.584LeuThr: 5.584 ± 0.724
3.562LeuVal: 3.562 ± 0.426
1.155LeuTrp: 1.155 ± 0.367
1.733LeuTyr: 1.733 ± 0.493
0.0LeuXaa: 0.0 ± 0.0
Met
1.059MetAla: 1.059 ± 0.348
0.385MetCys: 0.385 ± 0.198
1.155MetAsp: 1.155 ± 0.332
2.503MetGlu: 2.503 ± 0.377
0.674MetPhe: 0.674 ± 0.265
1.059MetGly: 1.059 ± 0.27
0.193MetHis: 0.193 ± 0.143
1.733MetIle: 1.733 ± 0.436
2.888MetLys: 2.888 ± 0.573
1.54MetLeu: 1.54 ± 0.332
0.578MetMet: 0.578 ± 0.248
1.733MetAsn: 1.733 ± 0.432
0.77MetPro: 0.77 ± 0.293
1.155MetGln: 1.155 ± 0.332
0.77MetArg: 0.77 ± 0.253
2.599MetSer: 2.599 ± 0.63
2.696MetThr: 2.696 ± 0.522
0.866MetVal: 0.866 ± 0.315
0.385MetTrp: 0.385 ± 0.211
0.578MetTyr: 0.578 ± 0.19
0.0MetXaa: 0.0 ± 0.0
Asn
3.466AsnAla: 3.466 ± 0.565
0.578AsnCys: 0.578 ± 0.221
2.984AsnAsp: 2.984 ± 0.56
3.177AsnGlu: 3.177 ± 0.619
3.466AsnPhe: 3.466 ± 0.703
5.488AsnGly: 5.488 ± 1.196
1.252AsnHis: 1.252 ± 0.481
4.044AsnIle: 4.044 ± 0.694
5.006AsnLys: 5.006 ± 0.77
3.851AsnLeu: 3.851 ± 0.544
0.963AsnMet: 0.963 ± 0.356
4.429AsnAsn: 4.429 ± 0.713
2.503AsnPro: 2.503 ± 0.581
3.37AsnGln: 3.37 ± 0.773
2.311AsnArg: 2.311 ± 0.419
3.851AsnSer: 3.851 ± 0.511
3.273AsnThr: 3.273 ± 0.774
4.236AsnVal: 4.236 ± 0.624
0.481AsnTrp: 0.481 ± 0.209
3.081AsnTyr: 3.081 ± 0.843
0.0AsnXaa: 0.0 ± 0.0
Pro
1.252ProAla: 1.252 ± 0.375
0.0ProCys: 0.0 ± 0.0
2.118ProAsp: 2.118 ± 0.51
1.733ProGlu: 1.733 ± 0.406
1.925ProPhe: 1.925 ± 0.374
0.963ProGly: 0.963 ± 0.412
0.289ProHis: 0.289 ± 0.159
2.214ProIle: 2.214 ± 0.394
2.214ProLys: 2.214 ± 0.535
3.658ProLeu: 3.658 ± 0.543
0.77ProMet: 0.77 ± 0.218
1.54ProAsn: 1.54 ± 0.441
0.963ProPro: 0.963 ± 0.27
1.925ProGln: 1.925 ± 0.364
0.674ProArg: 0.674 ± 0.265
2.407ProSer: 2.407 ± 0.451
2.214ProThr: 2.214 ± 0.527
2.214ProVal: 2.214 ± 0.477
0.289ProTrp: 0.289 ± 0.15
1.155ProTyr: 1.155 ± 0.305
0.0ProXaa: 0.0 ± 0.0
Gln
3.081GlnAla: 3.081 ± 0.571
0.193GlnCys: 0.193 ± 0.132
1.733GlnAsp: 1.733 ± 0.422
2.599GlnGlu: 2.599 ± 0.607
2.311GlnPhe: 2.311 ± 0.434
2.407GlnGly: 2.407 ± 0.699
0.77GlnHis: 0.77 ± 0.248
1.829GlnIle: 1.829 ± 0.437
4.236GlnLys: 4.236 ± 0.642
3.177GlnLeu: 3.177 ± 0.439
0.77GlnMet: 0.77 ± 0.247
2.599GlnAsn: 2.599 ± 0.642
1.252GlnPro: 1.252 ± 0.436
3.37GlnGln: 3.37 ± 1.261
2.214GlnArg: 2.214 ± 0.514
3.37GlnSer: 3.37 ± 0.521
2.407GlnThr: 2.407 ± 0.546
1.925GlnVal: 1.925 ± 0.376
0.674GlnTrp: 0.674 ± 0.246
1.54GlnTyr: 1.54 ± 0.364
0.0GlnXaa: 0.0 ± 0.0
Arg
2.118ArgAla: 2.118 ± 0.323
0.963ArgCys: 0.963 ± 0.308
2.214ArgAsp: 2.214 ± 0.443
2.022ArgGlu: 2.022 ± 0.411
1.252ArgPhe: 1.252 ± 0.382
1.637ArgGly: 1.637 ± 0.368
0.193ArgHis: 0.193 ± 0.119
3.466ArgIle: 3.466 ± 0.601
3.658ArgLys: 3.658 ± 0.805
3.658ArgLeu: 3.658 ± 0.533
0.578ArgMet: 0.578 ± 0.232
2.407ArgAsn: 2.407 ± 0.507
0.77ArgPro: 0.77 ± 0.279
1.733ArgGln: 1.733 ± 0.482
1.733ArgArg: 1.733 ± 0.447
1.733ArgSer: 1.733 ± 0.503
1.925ArgThr: 1.925 ± 0.43
1.637ArgVal: 1.637 ± 0.302
0.481ArgTrp: 0.481 ± 0.176
2.214ArgTyr: 2.214 ± 0.473
0.0ArgXaa: 0.0 ± 0.0
Ser
5.488SerAla: 5.488 ± 1.227
0.385SerCys: 0.385 ± 0.251
5.584SerAsp: 5.584 ± 0.727
5.103SerGlu: 5.103 ± 0.613
2.984SerPhe: 2.984 ± 0.536
6.643SerGly: 6.643 ± 1.101
0.866SerHis: 0.866 ± 0.262
3.947SerIle: 3.947 ± 0.686
6.258SerLys: 6.258 ± 0.748
4.717SerLeu: 4.717 ± 0.998
2.214SerMet: 2.214 ± 0.486
3.947SerAsn: 3.947 ± 0.759
1.444SerPro: 1.444 ± 0.325
2.599SerGln: 2.599 ± 0.534
3.37SerArg: 3.37 ± 0.543
4.332SerSer: 4.332 ± 0.616
3.081SerThr: 3.081 ± 0.723
3.947SerVal: 3.947 ± 0.578
0.674SerTrp: 0.674 ± 0.254
2.599SerTyr: 2.599 ± 0.455
0.0SerXaa: 0.0 ± 0.0
Thr
3.658ThrAla: 3.658 ± 0.636
0.481ThrCys: 0.481 ± 0.19
4.14ThrAsp: 4.14 ± 0.578
4.621ThrGlu: 4.621 ± 0.503
2.503ThrPhe: 2.503 ± 0.445
5.199ThrGly: 5.199 ± 0.781
1.155ThrHis: 1.155 ± 0.253
5.969ThrIle: 5.969 ± 0.669
5.006ThrLys: 5.006 ± 0.633
5.103ThrLeu: 5.103 ± 0.625
0.963ThrMet: 0.963 ± 0.337
3.273ThrAsn: 3.273 ± 0.699
2.599ThrPro: 2.599 ± 0.447
2.118ThrGln: 2.118 ± 0.475
1.444ThrArg: 1.444 ± 0.352
3.37ThrSer: 3.37 ± 0.613
4.236ThrThr: 4.236 ± 1.237
5.103ThrVal: 5.103 ± 1.064
0.674ThrTrp: 0.674 ± 0.25
0.963ThrTyr: 0.963 ± 0.279
0.0ThrXaa: 0.0 ± 0.0
Val
3.562ValAla: 3.562 ± 0.577
0.096ValCys: 0.096 ± 0.093
3.755ValAsp: 3.755 ± 0.54
4.429ValGlu: 4.429 ± 0.783
2.696ValPhe: 2.696 ± 0.449
4.429ValGly: 4.429 ± 0.871
0.481ValHis: 0.481 ± 0.198
4.814ValIle: 4.814 ± 0.81
3.947ValLys: 3.947 ± 0.546
4.14ValLeu: 4.14 ± 0.705
0.866ValMet: 0.866 ± 0.227
3.081ValAsn: 3.081 ± 0.588
2.503ValPro: 2.503 ± 0.478
1.829ValGln: 1.829 ± 0.397
1.637ValArg: 1.637 ± 0.386
4.717ValSer: 4.717 ± 0.682
4.717ValThr: 4.717 ± 0.716
4.236ValVal: 4.236 ± 0.579
0.674ValTrp: 0.674 ± 0.173
1.637ValTyr: 1.637 ± 0.357
0.0ValXaa: 0.0 ± 0.0
Trp
0.963TrpAla: 0.963 ± 0.301
0.289TrpCys: 0.289 ± 0.152
0.385TrpAsp: 0.385 ± 0.24
0.77TrpGlu: 0.77 ± 0.269
0.481TrpPhe: 0.481 ± 0.202
0.578TrpGly: 0.578 ± 0.297
0.0TrpHis: 0.0 ± 0.0
1.155TrpIle: 1.155 ± 0.315
1.444TrpLys: 1.444 ± 0.335
1.252TrpLeu: 1.252 ± 0.337
0.385TrpMet: 0.385 ± 0.189
1.059TrpAsn: 1.059 ± 0.305
0.096TrpPro: 0.096 ± 0.101
0.77TrpGln: 0.77 ± 0.314
0.481TrpArg: 0.481 ± 0.182
1.252TrpSer: 1.252 ± 0.303
0.674TrpThr: 0.674 ± 0.252
0.866TrpVal: 0.866 ± 0.254
0.193TrpTrp: 0.193 ± 0.123
0.481TrpTyr: 0.481 ± 0.174
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.022TyrAla: 2.022 ± 0.439
0.289TyrCys: 0.289 ± 0.157
2.888TyrAsp: 2.888 ± 0.499
1.925TyrGlu: 1.925 ± 0.411
1.925TyrPhe: 1.925 ± 0.415
2.407TyrGly: 2.407 ± 0.479
0.578TyrHis: 0.578 ± 0.233
2.311TyrIle: 2.311 ± 0.42
2.599TyrLys: 2.599 ± 0.547
2.984TyrLeu: 2.984 ± 0.63
1.348TyrMet: 1.348 ± 0.355
1.733TyrAsn: 1.733 ± 0.361
0.77TyrPro: 0.77 ± 0.301
2.118TyrGln: 2.118 ± 0.464
1.637TyrArg: 1.637 ± 0.401
3.466TyrSer: 3.466 ± 0.538
1.925TyrThr: 1.925 ± 0.36
1.252TyrVal: 1.252 ± 0.307
1.059TyrTrp: 1.059 ± 0.367
1.54TyrTyr: 1.54 ± 0.386
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 60 proteins (10388 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski